Query gi|254780922|ref|YP_003065335.1| glucose-1-phosphate thymidylyltransferase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 292 No_of_seqs 222 out of 7407 Neff 7.5 Searched_HMMs 39220 Date Mon May 30 01:15:40 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780922.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 COG1209 RfbA dTDP-glucose pyro 100.0 0 0 583.6 8.8 284 1-287 1-284 (286) 2 TIGR01208 rmlA_long glucose-1- 100.0 0 0 518.9 9.6 236 2-239 1-240 (361) 3 TIGR01207 rmlA glucose-1-phosp 100.0 0 0 495.8 9.7 286 2-287 1-286 (286) 4 cd02538 G1P_TT_short G1P_TT_sh 100.0 0 0 472.2 7.9 240 1-240 1-240 (240) 5 PRK10122 UTP--glucose-1-phosph 100.0 0 0 436.9 10.3 232 1-235 4-274 (297) 6 PRK13389 UTP--glucose-1-phosph 100.0 0 0 426.5 9.8 232 2-237 10-279 (302) 7 TIGR01099 galU UTP-glucose-1-p 100.0 0 0 414.8 6.8 230 1-233 2-270 (270) 8 pfam00483 NTP_transferase Nucl 100.0 0 0 404.7 7.1 237 2-238 1-245 (247) 9 cd04189 G1P_TT_long G1P_TT_lon 100.0 0 0 399.3 10.6 235 1-239 1-235 (236) 10 cd02541 UGPase_prokaryotic Pro 100.0 0 0 397.7 10.6 232 2-237 2-264 (267) 11 cd06425 M1P_guanylylT_B_like_N 100.0 0 0 388.3 7.0 228 1-237 1-232 (233) 12 cd06428 M1P_guanylylT_A_like_N 100.0 0 0 382.4 7.7 230 3-236 1-256 (257) 13 PRK05293 glgC glucose-1-phosph 100.0 0 0 380.9 8.1 241 1-243 4-263 (381) 14 TIGR02091 glgC glucose-1-phosp 100.0 0 0 382.5 4.7 245 1-246 1-304 (421) 15 cd06915 NTP_transferase_WcbM_l 100.0 0 0 373.2 8.0 223 3-234 1-223 (223) 16 COG1210 GalU UDP-glucose pyrop 100.0 0 0 362.0 7.9 233 1-237 5-269 (291) 17 COG1208 GCD1 Nucleoside-diphos 100.0 0 0 353.1 8.5 233 1-240 2-237 (358) 18 cd04181 NTP_transferase NTP_tr 100.0 0 0 349.0 8.4 217 3-225 1-217 (217) 19 cd06422 NTP_transferase_like_1 100.0 0 0 347.7 8.2 218 2-233 1-221 (221) 20 cd06426 NTP_transferase_like_2 100.0 0 0 344.9 8.0 220 3-234 1-220 (220) 21 cd02540 GT2_GlmU_N_bac N-termi 100.0 0 0 346.1 4.7 218 3-230 1-229 (229) 22 PRK03282 consensus 100.0 0 0 343.4 6.5 242 1-243 7-283 (406) 23 PRK00844 glgC glucose-1-phosph 100.0 0 0 340.0 7.4 242 1-243 6-281 (409) 24 cd02524 G1P_cytidylyltransfera 100.0 0 0 337.8 9.0 225 3-239 1-247 (253) 25 KOG1322 consensus 100.0 0 0 340.1 6.5 235 1-243 10-250 (371) 26 PRK04928 consensus 100.0 0 0 335.4 7.5 241 2-243 5-277 (405) 27 PRK00725 glgC glucose-1-phosph 100.0 0 0 331.0 6.7 241 3-246 22-299 (431) 28 PRK03701 consensus 100.0 0 0 330.5 6.8 245 1-246 18-299 (431) 29 PRK01884 consensus 100.0 0 0 327.4 5.3 244 2-246 19-298 (435) 30 PRK02862 glgC glucose-1-phosph 100.0 0 0 317.5 7.7 232 2-239 5-274 (429) 31 cd04183 GT2_BcE_like GT2_BcbE_ 100.0 0 0 311.8 6.5 222 3-230 1-230 (231) 32 COG0448 GlgC ADP-glucose pyrop 100.0 0 0 316.7 1.4 238 3-243 8-266 (393) 33 PRK09451 glmU bifunctional N-a 100.0 4.2E-45 0 303.0 7.1 225 1-239 6-241 (456) 34 cd02523 PC_cytidylyltransferas 100.0 5.2E-43 0 290.0 6.3 222 3-234 1-229 (229) 35 TIGR01173 glmU UDP-N-acetylglu 100.0 1.9E-40 4.2E-45 274.0 7.4 236 1-242 1-250 (461) 36 cd02508 ADP_Glucose_PP ADP-glu 100.0 1.2E-36 3.1E-41 250.1 5.1 184 3-224 1-200 (200) 37 COG1207 GlmU N-acetylglucosami 100.0 3.5E-36 9E-41 247.2 6.8 231 1-239 3-244 (460) 38 cd04198 eIF-2B_gamma_N The N-t 100.0 1.3E-34 3.2E-39 237.4 5.0 137 1-139 1-140 (214) 39 cd04197 eIF-2B_epsilon_N The N 100.0 2.9E-33 7.5E-38 228.9 5.8 178 1-181 1-217 (217) 40 cd02507 eIF-2B_gamma_N_like Th 100.0 1.5E-32 3.7E-37 224.5 5.5 179 1-181 1-216 (216) 41 TIGR02623 G1P_cyt_trans glucos 100.0 8E-32 2E-36 219.8 7.2 217 1-230 1-239 (256) 42 TIGR01105 galF regulatory prot 100.0 4.4E-31 1.1E-35 215.2 6.5 232 1-235 4-274 (297) 43 KOG1460 consensus 100.0 6.6E-31 1.7E-35 214.1 4.8 239 2-243 4-270 (407) 44 COG1213 Predicted sugar nucleo 99.9 5.9E-27 1.5E-31 189.3 6.6 218 1-238 4-229 (239) 45 cd02509 GDP-M1P_Guanylyltransf 99.9 1.4E-27 3.6E-32 193.2 3.1 226 1-230 1-274 (274) 46 COG4750 LicC CTP:phosphocholin 99.9 6.3E-24 1.6E-28 170.3 -0.1 170 1-187 1-171 (231) 47 COG0836 {ManC} Mannose-1-phosp 99.8 9.5E-22 2.4E-26 156.6 3.7 232 1-235 2-283 (333) 48 TIGR02092 glgD glucose-1-phosp 99.8 3.9E-21 1E-25 152.8 3.8 240 1-246 3-274 (383) 49 KOG1462 consensus 99.8 6.1E-20 1.6E-24 145.3 4.2 246 1-248 10-322 (433) 50 pfam01128 IspD Uncharacterized 99.6 7.7E-16 2E-20 119.6 6.6 210 3-238 3-220 (221) 51 TIGR03310 matur_ygfJ molybdenu 99.6 4.1E-15 1E-19 115.0 6.8 184 2-232 1-187 (188) 52 PRK00155 ispD 2-C-methyl-D-ery 99.5 1.5E-14 3.9E-19 111.4 6.8 211 3-239 7-225 (228) 53 TIGR00453 ispD 2-C-methyl-D-er 99.5 1.1E-14 2.7E-19 112.4 5.6 209 3-235 2-224 (226) 54 KOG1461 consensus 99.5 1.1E-14 2.8E-19 112.3 4.9 132 2-137 26-168 (673) 55 cd04182 GT_2_like_f GT_2_like_ 99.5 3.2E-14 8.2E-19 109.4 6.5 182 1-230 1-185 (186) 56 cd02516 CDP-ME_synthetase CDP- 99.5 7.1E-14 1.8E-18 107.2 5.9 206 3-233 3-218 (218) 57 COG2266 GTP:adenosylcobinamide 99.4 2.6E-13 6.7E-18 103.7 6.8 175 1-245 1-175 (177) 58 PRK13385 2-C-methyl-D-erythrit 99.4 2.4E-13 6E-18 104.0 5.7 217 1-238 1-232 (238) 59 PRK13368 3-deoxy-manno-octulos 99.4 4.2E-13 1.1E-17 102.4 6.2 221 1-236 1-237 (238) 60 TIGR00454 TIGR00454 conserved 99.4 3.8E-13 9.8E-18 102.6 4.4 117 1-124 1-124 (204) 61 COG1211 IspD 4-diphosphocytidy 99.4 1.2E-12 3.1E-17 99.5 6.5 212 2-238 6-227 (230) 62 TIGR03202 pucB xanthine dehydr 99.4 9.7E-13 2.5E-17 100.1 5.3 184 2-230 2-190 (190) 63 PRK05450 3-deoxy-manno-octulos 99.3 1.6E-12 4.2E-17 98.7 5.8 224 1-238 1-247 (248) 64 PRK00317 mobA molybdopterin-gu 99.3 3.4E-12 8.7E-17 96.7 7.4 186 1-236 4-192 (193) 65 cd02517 CMP-KDO-Synthetase CMP 99.3 2.1E-12 5.4E-17 98.0 6.2 218 3-235 4-238 (239) 66 COG2068 Uncharacterized MobA-r 99.3 1.1E-11 2.8E-16 93.5 7.7 187 3-238 8-198 (199) 67 PRK09382 ispDF bifunctional 2- 99.2 1.2E-10 3.1E-15 87.0 7.4 196 3-237 9-216 (382) 68 cd02503 MobA MobA catalyzes th 99.1 1.4E-10 3.7E-15 86.5 5.3 105 1-120 1-105 (181) 69 PRK00560 molybdopterin-guanine 99.0 1.3E-09 3.2E-14 80.6 7.9 181 3-240 11-194 (196) 70 COG0746 MobA Molybdopterin-gua 99.0 6.7E-10 1.7E-14 82.3 5.9 105 1-122 5-110 (192) 71 pfam02348 CTP_transf_3 Cytidyl 99.0 7.4E-10 1.9E-14 82.0 4.7 176 3-191 2-195 (197) 72 PRK02726 molybdopterin-guanine 98.9 1.4E-09 3.7E-14 80.2 5.1 99 3-116 10-110 (200) 73 COG1212 KdsB CMP-2-keto-3-deox 98.8 1.1E-08 2.9E-13 74.6 6.0 222 3-239 6-244 (247) 74 TIGR01479 GMP_PMI mannose-1-ph 98.8 3.4E-09 8.7E-14 77.9 3.1 232 2-236 2-292 (478) 75 COG1083 NeuA CMP-N-acetylneura 98.6 3.9E-08 1E-12 71.2 4.1 202 3-239 6-224 (228) 76 cd04193 UDPGlcNAc_PPase UDPGlc 98.6 6.2E-08 1.6E-12 70.0 5.0 179 3-184 18-254 (323) 77 TIGR02665 molyb_mobA molybdopt 98.6 1.9E-08 5E-13 73.1 1.6 117 2-132 2-129 (202) 78 cd02518 GT2_SpsF SpsF is a gly 98.5 5.8E-07 1.5E-11 63.9 8.7 221 3-275 2-226 (233) 79 cd02513 CMP-NeuAc_Synthase CMP 98.5 1.4E-07 3.6E-12 67.7 4.8 200 3-236 4-222 (223) 80 PTZ00339 UDP-N-acetylglucosami 98.4 2.9E-07 7.4E-12 65.8 4.9 178 3-184 112-356 (499) 81 TIGR03584 PseF pseudaminic aci 98.3 7E-07 1.8E-11 63.4 4.2 202 3-237 2-220 (222) 82 cd04180 UGPase_euk_like Eukary 98.2 8.3E-07 2.1E-11 62.9 2.9 157 3-162 3-201 (266) 83 TIGR00466 kdsB 3-deoxy-D-manno 98.1 5.5E-06 1.4E-10 57.7 4.7 214 3-230 5-245 (246) 84 cd06424 UGGPase UGGPase cataly 98.0 6.7E-06 1.7E-10 57.2 4.7 75 3-81 3-93 (315) 85 cd00761 Glyco_tranf_GTA_type G 97.7 2.9E-05 7.3E-10 53.2 3.6 93 26-119 3-98 (156) 86 COG1861 SpsF Spore coat polysa 97.7 1.6E-05 4E-10 54.9 2.1 119 1-130 3-124 (241) 87 COG4284 UDP-glucose pyrophosph 97.7 8.3E-05 2.1E-09 50.3 5.5 173 3-181 108-334 (472) 88 cd00897 UGPase_euk Eukaryotic 97.3 0.00035 8.9E-09 46.4 4.8 156 3-162 6-201 (300) 89 KOG2388 consensus 96.5 0.0028 7.1E-08 40.8 3.6 75 3-81 100-192 (477) 90 pfam01704 UDPGP UTP--glucose-1 95.6 0.027 7E-07 34.5 5.1 51 3-56 56-112 (416) 91 cd04184 GT2_RfbC_Mx_like Myxoc 95.2 0.033 8.4E-07 34.0 4.3 103 25-127 6-112 (202) 92 pfam00535 Glycos_transf_2 Glyc 94.7 0.037 9.4E-07 33.7 3.5 106 25-131 3-111 (168) 93 cd02526 GT2_RfbF_like RfbF is 94.0 0.11 2.7E-06 30.8 4.8 189 31-227 10-208 (237) 94 cd06433 GT_2_WfgS_like WfgS an 93.9 0.067 1.7E-06 32.1 3.5 101 25-130 3-107 (202) 95 cd06442 DPM1_like DPM1_like re 92.8 0.12 3E-06 30.6 3.3 186 25-214 2-195 (224) 96 PRK00576 molybdopterin-guanine 92.6 0.18 4.5E-06 29.5 4.0 36 20-56 2-39 (178) 97 cd04188 DPG_synthase DPG_synth 92.3 0.1 2.6E-06 31.0 2.5 190 25-223 2-206 (211) 98 cd04195 GT2_AmsE_like GT2_AmsE 91.2 0.16 4E-06 29.8 2.4 101 25-126 3-108 (201) 99 cd04187 DPM1_like_bac Bacteria 90.3 0.22 5.5E-06 28.9 2.5 157 25-188 2-166 (181) 100 PRK10073 predicted glycosyl tr 89.9 0.18 4.5E-06 29.5 1.8 104 25-134 11-121 (329) 101 cd04186 GT_2_like_c Subfamily 89.7 0.44 1.1E-05 27.0 3.7 93 25-122 2-98 (166) 102 TIGR03111 glyc2_xrt_Gpos1 puta 89.0 0.23 5.7E-06 28.8 1.7 102 24-126 53-159 (439) 103 cd06434 GT2_HAS Hyaluronan syn 87.3 0.46 1.2E-05 26.8 2.5 96 24-123 4-102 (235) 104 cd04179 DPM_DPG-synthase_like 87.2 0.35 8.8E-06 27.6 1.8 107 25-132 2-113 (185) 105 PRK10018 predicted glycosyl tr 86.7 0.33 8.3E-06 27.8 1.4 99 25-125 10-112 (279) 106 cd06913 beta3GnTL1_like Beta 1 86.5 0.81 2.1E-05 25.3 3.4 103 25-128 2-114 (219) 107 cd06435 CESA_NdvC_like NdvC_li 85.7 0.71 1.8E-05 25.7 2.7 97 25-122 3-108 (236) 108 cd06423 CESA_like CESA_like is 85.6 0.72 1.8E-05 25.6 2.7 100 25-128 2-108 (180) 109 PRK11204 N-glycosyltransferase 84.7 0.36 9.3E-06 27.5 0.9 97 25-123 59-159 (421) 110 cd04196 GT_2_like_d Subfamily 84.6 0.59 1.5E-05 26.2 1.9 102 25-128 3-109 (214) 111 cd06421 CESA_CelA_like CESA_Ce 84.1 0.75 1.9E-05 25.5 2.3 98 24-125 5-111 (234) 112 cd06427 CESA_like_2 CESA_like_ 81.7 1.1 2.8E-05 24.5 2.3 98 24-123 5-109 (241) 113 cd04192 GT_2_like_e Subfamily 81.7 1.2 3E-05 24.2 2.5 102 25-128 2-112 (229) 114 cd06439 CESA_like_1 CESA_like_ 79.6 1.1 2.8E-05 24.5 1.8 95 25-123 34-134 (251) 115 cd06437 CESA_CaSu_A2 Cellulose 79.3 1.2 3.1E-05 24.2 1.9 99 24-125 5-113 (232) 116 cd04185 GT_2_like_b Subfamily 78.2 2.3 6E-05 22.4 3.1 95 26-122 3-103 (202) 117 COG1216 Predicted glycosyltran 76.1 3.2 8.1E-05 21.6 3.3 200 22-229 6-226 (305) 118 cd02525 Succinoglycan_BP_ExoA 72.9 4.1 0.0001 20.9 3.2 101 25-128 5-111 (249) 119 COG1215 Glycosyltransferases, 67.5 4.5 0.00011 20.6 2.4 105 24-129 58-168 (439) 120 PRK10063 predicted glycosyl tr 65.5 4.2 0.00011 20.8 2.0 80 30-114 12-98 (248) 121 TIGR03552 F420_cofC 2-phospho- 62.8 4.9 0.00012 20.4 1.9 109 2-123 1-116 (195) 122 PRK10714 undecaprenyl phosphat 62.4 10 0.00027 18.4 3.5 157 25-188 11-176 (324) 123 cd06438 EpsO_like EpsO protein 61.0 8.8 0.00022 18.8 3.0 94 25-123 2-105 (183) 124 cd02522 GT_2_like_a GT_2_like_ 58.5 7 0.00018 19.4 2.1 93 25-127 4-101 (221) 125 pfam01983 CofC Guanylyl transf 52.2 7.2 0.00018 19.4 1.3 107 1-122 1-112 (217) 126 cd02520 Glucosylceramide_synth 51.9 12 0.00031 17.9 2.4 95 24-121 5-109 (196) 127 KOG2148 consensus 50.4 20 0.0005 16.6 5.2 61 224-288 744-804 (867) 128 cd06436 GlcNAc-1-P_transferase 49.2 12 0.0003 18.1 2.0 93 25-120 2-111 (191) 129 cd06420 GT2_Chondriotin_Pol_N 48.0 16 0.00041 17.2 2.6 89 25-116 2-96 (182) 130 COG2179 Predicted hydrolase of 47.1 12 0.00031 17.9 1.8 106 23-143 37-146 (175) 131 cd05133 RasGAP_IQGAP1 IQGAP1 i 45.7 6.8 0.00017 19.5 0.3 111 139-264 190-300 (360) 132 TIGR01326 OAH_OAS_sulfhy O-ace 41.9 16 0.00042 17.1 1.8 56 30-91 81-141 (434) 133 PRK09860 putative alcohol dehy 41.9 18 0.00047 16.8 2.1 93 14-108 1-97 (383) 134 PRK04015 chromatin protein; Pr 41.6 27 0.00069 15.8 2.9 31 24-54 5-37 (91) 135 cd03409 Chelatase_Class_II Cla 40.2 28 0.00072 15.6 3.5 48 5-52 2-65 (101) 136 cd05131 RasGAP_IQGAP2 IQGAP2 i 39.5 9.7 0.00025 18.6 0.3 111 139-264 190-300 (339) 137 pfam11931 DUF3449 Domain of un 38.8 14 0.00036 17.5 1.1 39 30-69 76-117 (187) 138 KOG2638 consensus 38.7 30 0.00076 15.5 4.1 59 4-65 107-171 (498) 139 COG1788 AtoD Acyl CoA:acetate/ 37.9 31 0.00079 15.4 4.0 35 4-56 20-54 (220) 140 TIGR02429 pcaI_scoA_fam 3-oxoa 36.1 31 0.00078 15.4 2.5 76 3-82 20-105 (222) 141 pfam04007 DUF354 Protein of un 34.4 35 0.00088 15.1 2.5 12 35-46 17-28 (335) 142 TIGR00677 fadh2_euk methylenet 34.2 35 0.00089 15.0 2.5 25 31-56 79-103 (312) 143 pfam07959 Fucokinase L-fucokin 31.7 39 0.00099 14.8 2.9 29 205-234 224-252 (414) 144 PRK05802 hypothetical protein; 30.1 41 0.0011 14.6 2.6 20 198-217 189-208 (328) 145 COG1581 Ssh10b Archaeal DNA-bi 29.6 42 0.0011 14.5 4.1 33 22-54 3-37 (91) 146 KOG2961 consensus 29.4 40 0.001 14.7 2.1 23 219-241 163-185 (190) 147 TIGR02529 EutJ ethanolamine ut 28.4 44 0.0011 14.4 2.6 29 167-198 144-172 (240) 148 COG3981 Predicted acetyltransf 27.9 45 0.0011 14.4 2.4 28 96-123 65-93 (174) 149 cd03413 CbiK_C Anaerobic cobal 27.8 45 0.0011 14.4 3.2 26 26-51 36-61 (103) 150 PRK05282 peptidase E; Validate 27.7 41 0.0011 14.6 2.0 87 28-124 11-110 (233) 151 COG0299 PurN Folate-dependent 27.5 44 0.0011 14.4 2.1 50 1-51 1-56 (200) 152 KOG2978 consensus 27.2 46 0.0012 14.3 2.5 103 25-132 8-122 (238) 153 PRK06263 sdhA succinate dehydr 26.9 42 0.0011 14.6 1.9 19 2-20 187-205 (539) 154 PRK12305 thrS threonyl-tRNA sy 26.4 48 0.0012 14.2 2.3 52 223-289 456-508 (576) 155 PRK11253 ldcA L,D-carboxypepti 25.5 50 0.0013 14.1 2.2 14 34-47 19-32 (305) 156 TIGR01186 proV glycine betaine 24.7 36 0.00091 15.0 1.2 28 244-271 334-362 (372) 157 TIGR01819 F420_cofD LPPG:Fo 2- 24.3 33 0.00085 15.2 1.0 12 4-15 2-14 (359) 158 TIGR03029 EpsG chain length de 23.2 55 0.0014 13.8 3.4 16 222-237 245-260 (274) 159 PRK00413 thrS threonyl-tRNA sy 22.6 56 0.0014 13.7 2.4 28 81-108 362-389 (639) 160 PRK10624 L-1,2-propanediol oxi 22.2 56 0.0014 13.7 1.8 79 30-108 13-95 (381) 161 cd02514 GT13_GLCNAC-TI GT13_GL 22.2 57 0.0015 13.7 3.0 118 25-154 5-143 (334) 162 TIGR00639 PurN phosphoribosylg 21.7 35 0.0009 15.0 0.7 48 1-49 1-56 (215) 163 pfam02829 3H 3H domain. This d 21.4 60 0.0015 13.6 2.2 18 128-146 20-37 (98) 164 TIGR01236 D1pyr5carbox1 1-pyrr 21.0 44 0.0011 14.4 1.0 42 141-182 369-420 (551) 165 COG3315 O-Methyltransferase in 20.5 53 0.0014 13.9 1.4 12 168-179 174-186 (297) 166 KOG1447 consensus 20.0 50 0.0013 14.1 1.2 14 1-14 64-77 (412) No 1 >COG1209 RfbA dTDP-glucose pyrophosphorylase [Cell envelope biogenesis, outer membrane] Probab=100.00 E-value=0 Score=583.58 Aligned_cols=284 Identities=49% Similarity=0.907 Sum_probs=278.9 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 90799248887455334438982222478374799999999978986899997776564311122222222111122233 Q gi|254780922|r 1 MKGIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQ 80 (292) Q Consensus 1 MkaiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q 80 (292) |||||||||.||||+|+|...||+|+||+|||||+|+|+.|+.+||+||+||+++++...+++++|+|++||++|+|++| T Consensus 1 mKgiILAgG~GTRL~PlT~~~~KqLlpV~~KPmi~y~l~~L~~aGI~dI~II~~~~~~~~~~~llGdgs~~gv~itY~~Q 80 (286) T COG1209 1 MKGVILAGGSGTRLRPLTRVVPKQLLPVYDKPMIYYPLETLMLAGIRDILIVVGPEDKPTFKELLGDGSDFGVDITYAVQ 80 (286) T ss_pred CCCEEECCCCCCCCCCCCCCCCCCCCEECCCCHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHCCCCCCCCCEEEEEC T ss_conf 97279637675443323136775332116862158579999975985699996688505666333671003863699965 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHCCCC Q ss_conf 33224468521011123356752132144321111000011100000111001222210001222122221000100133 Q gi|254780922|r 81 LVPAGLAQSYILGAEFIGDSSSVLILGDNVFYGSDISDIFHKARARRNSATVVGCHVQNPQRYGVVEVDSSNQAISIEEK 160 (292) Q Consensus 81 ~~p~Gta~Ai~~a~~~i~~~~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~~v~~p~~yGvv~~d~~~~i~~i~EK 160 (292) .+|+|+|||+..|++|+++++|+|+||||+|++ ++.+++..+.+...++++++++|+||+||||+++|+++++++++|| T Consensus 81 ~~p~GlA~Av~~a~~fv~~~~f~l~LGDNi~~~-~l~~~~~~~~~~~~ga~i~~~~V~dP~rfGV~e~d~~~~v~~l~EK 159 (286) T COG1209 81 PEPDGLAHAVLIAEDFVGDDDFVLYLGDNIFQD-GLSELLEHFAEEGSGATILLYEVDDPSRYGVVEFDEDGKVIGLEEK 159 (286) T ss_pred CCCCCHHHHHHHHHHHCCCCCEEEEECCCEECC-CHHHHHHHHHCCCCCCEEEEEECCCCCCCEEEEECCCCCEEEEEEC T ss_conf 898768999998886458986599926844105-7799999985248994899998489530448998689968776788 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHCCCCCEEEEECCCCEEEECCCCHHHHHHHHHHHHHH Q ss_conf 33322222202455410002243220046753333333432000001320463135412563278457799999999998 Q gi|254780922|r 161 PNNPKSSFAVTGIYFYDQEVVNIARNIRPSARGELEITDVNSYYLDKGLLAVEFLREGSAWFDAGTPESLLDTAVFVRNI 240 (292) Q Consensus 161 P~~~~sn~a~~GiY~f~~~i~~~~~~l~~s~rgE~eitD~~~~~l~~~~~~~~~~~~g~~W~D~Gt~e~l~~As~~i~~~ 240 (292) |++|+||||++|+|+|++++|+.++.++||+|||+||||+++.|+++|.+.+..+.+| .|+|+||+++|++|++|+++. T Consensus 160 P~~P~SNlAvtGlY~~d~~Vf~~~~~ikPS~RGElEITd~i~~~i~~G~~~~~~~~~G-~WlDtGt~~slleA~~~i~~~ 238 (286) T COG1209 160 PKEPKSNLAVTGLYFYDPSVFEAIKQIKPSARGELEITDAIDLYIEKGYLVVAILIRG-WWLDTGTPESLLEANNFVRTV 238 (286) T ss_pred CCCCCCCEEEEEEEEECHHHHHHHHCCCCCCCCCEEEHHHHHHHHHCCCEEEEEECCC-EEEECCCHHHHHHHHHHHHHH T ss_conf 9999876268899970858999987189887785675288999997696888987151-688638701289999999888 Q ss_pred HHHCCCEEEEHHHHHHHCCCCCHHHHHHHHHHCCCCHHHHHHHHHHH Q ss_conf 88809866209999988689898999998864258848999999986 Q gi|254780922|r 241 ENRLGLYVACPEEIAYRHDFINESQFFQLIDHFGNSPYGLYLRQVVE 287 (292) Q Consensus 241 e~~~g~~i~~~ee~a~~~~~i~~~~l~~~~~~~~~~~y~~~l~~~~~ 287 (292) +++||..+.||||+|| +|||+..++......+.+|.||.||..... T Consensus 239 ~~~~G~~~~~~~~~~~-~~~i~~~~~~~~~~~l~~~~~G~y~~~~~~ 284 (286) T COG1209 239 SKRQGFKIACPEEIAW-NGWIDGPGLIGLASQLEKSGYGQYLLELLR 284 (286) T ss_pred HHHCCCEEECHHHEEE-EEEEECHHHHCCCCCHHHCCCCHHHHHHHC T ss_conf 8634877727037888-248835686232420443673066656533 No 2 >TIGR01208 rmlA_long glucose-1-phosphate thymidylyltransferase; InterPro: IPR005908 Synonym: dTDP-D-glucose synthase This group of proteins comprises a tightly conserved but broadly distributed subfamily of known and putative bacterial glucose-1-phosphate thymidylyltransferases (2.7.7.24 from EC). It is well characterised in several species as the first of four enzymes involved in the biosynthesis of dTDP-L-rhamnose, a cell wall constituent and a feedback inhibitor of the enzyme.dTTP + alpha-D-glucose 1-phosphate = diphosphate + dTDP-glucose The family of known and putative glucose-1-phosphate thymidyltransferases shows a deep split into a short form (see IPR005907 from INTERPRO) and a long form described by this model. The homotetrameric short form is found in numerous bacterial species that incorporate dTDP-L-rhamnose, which it helps synthesize, into the cell wall. It is subject to feedback inhibition. The long form, in contrast, is found in many species for which it serves as a sugar-activating enzyme for antibiotic biosynthesis and or other, unknown pathways, and in which dTDP-L-rhamnose is not necessarily produced. . Probab=100.00 E-value=0 Score=518.85 Aligned_cols=236 Identities=37% Similarity=0.671 Sum_probs=226.3 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 07992488874553344389822224783747999999999789868999977765643111222222221111222333 Q gi|254780922|r 2 KGIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQL 81 (292) Q Consensus 2 kaiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~ 81 (292) ||+|||||.||||||||.+.||||+||.|||+|+|+|++++.|||+||-||+++...+.+++++++|++||++|+|+.|. T Consensus 1 KaliL~aG~GTRLRPLTFt~pK~LiPvAnKPi~~Yaie~~~~AGI~diGIvvg~~~~e~i~~~~g~g~~fg~kityI~Q~ 80 (361) T TIGR01208 1 KALILAAGKGTRLRPLTFTRPKQLIPVANKPILQYAIEDLIEAGITDIGIVVGPETGEEIKEIVGEGERFGAKITYIVQG 80 (361) T ss_pred CCEECCCCCCCCCCCCCCCCCCCEEEECCCCHHHHHHHHHHHCCCEEEEEEECCCCCHHHEEEECCCCCCCEEEEEEECC T ss_conf 92442587867677766578843066168732367688887469769999846988212202324883023289898227 Q ss_pred CCCCCCCCCCCCCCC-CCCCCCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCCCCCCCCCCCCCCCCCCCC-HHHCCC Q ss_conf 322446852101112-335675213214432111100001110000011100122221000122212222100-010013 Q gi|254780922|r 82 VPAGLAQSYILGAEF-IGDSSSVLILGDNVFYGSDISDIFHKARARRNSATVVGCHVQNPQRYGVVEVDSSNQ-AISIEE 159 (292) Q Consensus 82 ~p~Gta~Ai~~a~~~-i~~~~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~~v~~p~~yGvv~~d~~~~-i~~i~E 159 (292) +|+|+|||++.|++| ++|++|+|-||||++ ..++.++++.+.++..++.|++.+|+||++|||++++++|+ |+.++| T Consensus 81 ~plGlAHAv~~A~~fGlgd~~FvvYLGDNl~-~~gi~~fv~~F~~~~~da~ILL~~V~dP~~FGVA~l~d~G~~i~~LvE 159 (361) T TIGR01208 81 EPLGLAHAVYVARDFGLGDEDFVVYLGDNLI-QDGIKRFVKSFEEKDYDALILLKKVEDPTAFGVAELEDDGKRILKLVE 159 (361) T ss_pred CCCHHHHHHHHHHHHCCCCCCEEEECCCCHH-HHHHHHHHHHHCCCCCCCEEECCCCCCCCCCCEEEEECCCCEEEEEEE T ss_conf 8760467645788847899860797374210-423789988533148003011043887884555899248979999883 Q ss_pred CCCCCCCCCCHHHHHHHHH--HHHHHHHHCCCCCCCCCCCCCCCCHHCCCCCEEEEECCCCEEEECCCCHHHHHHHHHHH Q ss_conf 3333222222024554100--02243220046753333333432000001320463135412563278457799999999 Q gi|254780922|r 160 KPNNPKSSFAVTGIYFYDQ--EVVNIARNIRPSARGELEITDVNSYYLDKGLLAVEFLREGSAWFDAGTPESLLDTAVFV 237 (292) Q Consensus 160 KP~~~~sn~a~~GiY~f~~--~i~~~~~~l~~s~rgE~eitD~~~~~l~~~~~~~~~~~~g~~W~D~Gt~e~l~~As~~i 237 (292) ||++||||||+.|+|+|.| .+|+.+++++||+||||||||+++.++++|..+......| +|.|+|++|||++||..| T Consensus 160 KPk~PPSNLAvvGlY~F~pPe~if~~~~~~kPSwRGElEITD~IQ~lIe~Gy~V~~~~v~G-WWkDTGk~eDLL~AN~~i 238 (361) T TIGR01208 160 KPKEPPSNLAVVGLYMFRPPELIFEAIKNIKPSWRGELEITDAIQYLIEKGYKVGGSKVKG-WWKDTGKPEDLLDANRLI 238 (361) T ss_pred CCCCCCCCCEEEEEEECCCHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCEEEEEEEEE-EECCCCCCHHHHHHHHHH T ss_conf 3865787701233453387779888985288877662157757756531583777799978-870159802389999997 Q ss_pred HH Q ss_conf 99 Q gi|254780922|r 238 RN 239 (292) Q Consensus 238 ~~ 239 (292) +. T Consensus 239 Ld 240 (361) T TIGR01208 239 LD 240 (361) T ss_pred HH T ss_conf 44 No 3 >TIGR01207 rmlA glucose-1-phosphate thymidylyltransferase; InterPro: IPR005907 Synonym: dTDP-D-glucose synthase This group of proteins comprises a tightly conserved but broadly distributed subfamily (here designated as short form) of known and putative bacterial glucose-1-phosphate thymidylyltransferases (2.7.7.24 from EC). It is well characterised in several species as the first of four enzymes involved in the biosynthesis of dTDP-L-rhamnose, a cell wall constituent and a feedback inhibitor of the enzyme. The enzyme is active as a homotetramer.dTTP + alpha-D-glucose 1-phosphate = diphosphate + dTDP-glucose The family of known and putative glucose-1-phosphate thymidyltransferases shows a deep split into a short form (see IPR005908 from INTERPRO) and a long form described by this model. The homotetrameric short form is found in numerous bacterial species that incorporate dTDP-L-rhamnose, which it helps synthesize, into the cell wall. It is subject to feedback inhibition. The long form, in contrast, is found in many species for which it serves as a sugar-activating enzyme for antibiotic biosynthesis and or other, unknown pathways, and in which dTDP-L-rhamnose is not necessarily produced. ; GO: 0008879 glucose-1-phosphate thymidylyltransferase activity, 0045226 extracellular polysaccharide biosynthetic process. Probab=100.00 E-value=0 Score=495.82 Aligned_cols=286 Identities=57% Similarity=1.012 Sum_probs=281.8 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 07992488874553344389822224783747999999999789868999977765643111222222221111222333 Q gi|254780922|r 2 KGIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQL 81 (292) Q Consensus 2 kaiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~ 81 (292) |+||||||-||||+|.|..+.|+|+||+|||||+|+|+.|+.+||++|+||+.+++...|++++|||++||+++.|..|. T Consensus 1 ~Gi~laGG~Gtrl~P~t~~~skqllP~ydkPmiyyPl~~lmlaG~r~i~~i~~~~~~~~f~~llGdG~~~G~~~~y~~q~ 80 (286) T TIGR01207 1 KGIILAGGSGTRLYPITLAVSKQLLPVYDKPMIYYPLSVLMLAGIRDILIISTPEDTPRFKRLLGDGSQFGVELSYAVQP 80 (286) T ss_pred CCEEEECCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHEEEEECCCHHHHHHHHHCCCCHHCEEEEEECCC T ss_conf 92687268874012434544321031015750123189998730222013521100057898713520104578871157 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHCCCCC Q ss_conf 32244685210111233567521321443211110000111000001110012222100012221222210001001333 Q gi|254780922|r 82 VPAGLAQSYILGAEFIGDSSSVLILGDNVFYGSDISDIFHKARARRNSATVVGCHVQNPQRYGVVEVDSSNQAISIEEKP 161 (292) Q Consensus 82 ~p~Gta~Ai~~a~~~i~~~~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~~v~~p~~yGvv~~d~~~~i~~i~EKP 161 (292) .|.|+|.|+..+++|+++++.+|++|||+||+.++.+.+........++++|++.|.||++|||+++|.++++++++||| T Consensus 81 ~P~Gla~af~ig~~f~~~~~~~l~lGdn~fyG~~~~~~l~~~~~~~~Ga~vf~y~v~~P~~yGv~~f~~~~~~~~~eekP 160 (286) T TIGR01207 81 SPDGLAQAFIIGEEFIGDDKSALVLGDNIFYGHDLSELLKRAAAREEGATVFAYQVKDPERYGVVEFDEEGRAISLEEKP 160 (286) T ss_pred CCCHHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCEEEECCCCCEEEEECCC T ss_conf 86325888877688726776179960500102148999999875306857999875076412306663873178641378 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHCCCCCEEEEECCCCEEEECCCCHHHHHHHHHHHHHHH Q ss_conf 33222222024554100022432200467533333334320000013204631354125632784577999999999988 Q gi|254780922|r 162 NNPKSSFAVTGIYFYDQEVVNIARNIRPSARGELEITDVNSYYLDKGLLAVEFLREGSAWFDAGTPESLLDTAVFVRNIE 241 (292) Q Consensus 162 ~~~~sn~a~~GiY~f~~~i~~~~~~l~~s~rgE~eitD~~~~~l~~~~~~~~~~~~g~~W~D~Gt~e~l~~As~~i~~~e 241 (292) ..|+|+||++|+|+++.++.++++.++||.|||+||||+++.|+++|.+.+..+.||..|+|+||.+++.+|+.||.+++ T Consensus 161 ~~P~s~y~v~Gly~yd~~~~~~a~~~~~s~rGelei~~~n~~yl~~G~l~v~~~~rG~aWldtGt~~sl~~a~~f~~t~~ 240 (286) T TIGR01207 161 AKPKSNYAVTGLYFYDNRVVEIARELKPSARGELEITDLNRVYLEEGKLSVELLGRGYAWLDTGTHDSLLEASSFIETIE 240 (286) T ss_pred CCCCCCEEEEEHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCEEEHHHHCCCCEECCCHHHHHHHHHHHHHHHH T ss_conf 88763413401110106899999731324566402888999998636502014526511000111688999999999988 Q ss_pred HHCCCEEEEHHHHHHHCCCCCHHHHHHHHHHCCCCHHHHHHHHHHH Q ss_conf 8809866209999988689898999998864258848999999986 Q gi|254780922|r 242 NRLGLYVACPEEIAYRHDFINESQFFQLIDHFGNSPYGLYLRQVVE 287 (292) Q Consensus 242 ~~~g~~i~~~ee~a~~~~~i~~~~l~~~~~~~~~~~y~~~l~~~~~ 287 (292) +|||+.|+|+||+||++|||+.+++.+.+..+.++.||+||.++.+ T Consensus 241 ~~q~~~~~~~ee~a~~~~~~~~~~~~~~a~~~~k~~yG~yl~~~~~ 286 (286) T TIGR01207 241 KRQGLKVACPEEIAFRKGWIDAEQLEELAEKLAKNGYGQYLLELLE 286 (286) T ss_pred HHCCEEEEEHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 5246155414456666641037899999998886335589999829 No 4 >cd02538 G1P_TT_short G1P_TT_short is the short form of glucose-1-phosphate thymidylyltransferase. This family is the short form of glucose-1-phosphate thymidylyltransferase. Glucose-1-phosphate thymidylyltransferase catalyses the formation of dTDP-glucose, from dTTP and glucose 1-phosphate. It is the first enzyme in the biosynthesis of dTDP-L-rhamnose, a cell wall constituent and a feedback inhibitor of the enzyme.There are two forms of Glucose-1-phosphate thymidylyltransferase in bacteria and archeae; short form and long form. The homotetrameric, feedback inhibited short form is found in numerous bacterial species that produce dTDP-L-rhamnose. The long form, which has an extra 50 amino acids c-terminal, is found in many species for which it serves as a sugar-activating enzyme for antibiotic biosynthesis and or other, unknown pathways, and in which dTDP-L-rhamnose is not necessarily produced. Probab=100.00 E-value=0 Score=472.23 Aligned_cols=240 Identities=62% Similarity=1.065 Sum_probs=231.2 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 90799248887455334438982222478374799999999978986899997776564311122222222111122233 Q gi|254780922|r 1 MKGIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQ 80 (292) Q Consensus 1 MkaiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q 80 (292) |||||||||+||||+|+|..+||||+||+|||||+|+|++|..+||++|+|++++++.+++.++++++++++++++|+.| T Consensus 1 MkavILAgG~GtRL~PlT~~~PKpLlpi~~kPli~~~l~~l~~~Gi~~i~iv~~~~~~~~~~~~~~~~~~~~~~i~y~~e 80 (240) T cd02538 1 MKGIILAGGSGTRLYPLTKVVSKQLLPVYDKPMIYYPLSTLMLAGIREILIISTPEDLPLFKELLGDGSDLGIRITYAVQ 80 (240) T ss_pred CEEEEECCCCCCCCCCCCCCCCCCCCEECCEEEHHHHHHHHHHCCCCCCEEECCHHHHHHHHHHHHCCCCCCEEEEEEEE T ss_conf 90999888786678732379874327799997189999999985996231002342699999997447637527999998 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHCCCC Q ss_conf 33224468521011123356752132144321111000011100000111001222210001222122221000100133 Q gi|254780922|r 81 LVPAGLAQSYILGAEFIGDSSSVLILGDNVFYGSDISDIFHKARARRNSATVVGCHVQNPQRYGVVEVDSSNQAISIEEK 160 (292) Q Consensus 81 ~~p~Gta~Ai~~a~~~i~~~~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~~v~~p~~yGvv~~d~~~~i~~i~EK 160 (292) ++|+|||+|+.+|++++++++|++++|||++++.++.+++..|.+...++|++++++++|++|||+++|++++|++|+|| T Consensus 81 ~~~~Gt~~al~~a~~~~~~~~~~v~~gD~i~~~~~~~~~l~~~~~~~~~~ti~~~~v~~~~~yGvv~~d~~~~v~~~~EK 160 (240) T cd02538 81 PKPGGLAQAFIIGEEFIGDDPVCLILGDNIFYGQGLSPILQRAAAQKEGATVFGYEVNDPERYGVVEFDENGRVLSIEEK 160 (240) T ss_pred ECCCCCCHHHHHHHHHCCCCCEEEECCCCEECCCCHHHHHHHHHHCCCCCEEEEEEECCCCCCCCCCCCCCCCEEEEEEC T ss_conf 62686313788888762898734443874010656899999999666884787777236113672311478857898889 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHCCCCCEEEEECCCCEEEECCCCHHHHHHHHHHHHHH Q ss_conf 33322222202455410002243220046753333333432000001320463135412563278457799999999998 Q gi|254780922|r 161 PNNPKSSFAVTGIYFYDQEVVNIARNIRPSARGELEITDVNSYYLDKGLLAVEFLREGSAWFDAGTPESLLDTAVFVRNI 240 (292) Q Consensus 161 P~~~~sn~a~~GiY~f~~~i~~~~~~l~~s~rgE~eitD~~~~~l~~~~~~~~~~~~g~~W~D~Gt~e~l~~As~~i~~~ 240 (292) |++|+||++++|+|+|++++|+++++++|+.|||+++||+++.++++|++.+..+..+++|+|||||++|++||+||+.+ T Consensus 161 P~~~~s~~~~~GiYi~~~~i~~~i~~~~~~~~ge~~i~d~~~~~i~~g~~~~~~~~~~~~W~DiGtpe~~~~An~~i~~~ 240 (240) T cd02538 161 PKKPKSNYAVTGLYFYDNDVFEIAKQLKPSARGELEITDVNNEYLEKGKLSVELLGRGFAWLDTGTHESLLEASNFVQTI 240 (240) T ss_pred CCCCCCCEEEEEEEEECHHHHHHHHHCCCCCCCEEEHHHHHHHHHHCCCCEEEEECCCCEEEECCCHHHHHHHHHHHHHC T ss_conf 99999887999999989899999985899999907588999999985997399968998089799999999999889649 No 5 >PRK10122 UTP--glucose-1-phosphate uridylyltransferase subunit GalF; Provisional Probab=100.00 E-value=0 Score=436.87 Aligned_cols=232 Identities=24% Similarity=0.437 Sum_probs=198.0 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCC--------- Q ss_conf 90799248887455334438982222478374799999999978986899997776564311122222222--------- Q gi|254780922|r 1 MKGIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKW--------- 71 (292) Q Consensus 1 MkaiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~--------- 71 (292) |||||||||+||||+|+|+.+||+||||.|||+|+|+++.+..+||++|+||+++.+ ..+++|+..+.++ T Consensus 4 MKaVI~aAG~GTRl~P~Tk~~PKemLPv~dkPii~~~vee~~~aGI~eiiiV~~~~k-~~ie~~F~~~~~le~~l~~~~k 82 (297) T PRK10122 4 LKAVIPVAGLGMHMLPATKAIPKEMLPIVDKPMIQYIVDEIVAAGIKEIVLVTHASK-NAVENHFDTSYELESLLEQRVK 82 (297) T ss_pred CEEEEECCCCCCCCCCCCCCCCCCCCEECCEEHHHHHHHHHHHCCCCEEEEEECCCC-CHHHHHCCCHHHHHHHHHHHHH T ss_conf 549998986763568531578924462898536999999999879978999917971-1577746671888999876117 Q ss_pred -------------CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCC-------CCCCHHHHHCCCHHHHH Q ss_conf -------------1111222333322446852101112335675213214432111-------10000111000001110 Q gi|254780922|r 72 -------------GVQFSYIEQLVPAGLAQSYILGAEFIGDSSSVLILGDNVFYGS-------DISDIFHKARARRNSAT 131 (292) Q Consensus 72 -------------g~~i~y~~q~~p~Gta~Ai~~a~~~i~~~~~~lilgDni~~~~-------~l~~~l~~~~~~~~~~t 131 (292) |++|+|+.|.+|+|||||+++|++++++++|++++||+++.+. ++..++..+.+....++ T Consensus 83 ~~~l~~~~~~~~~gv~i~yv~Q~eplGlGhAV~~A~~~v~depF~VllgD~l~~~~~~~~~~~~l~~~i~~~~~~~~~~v 162 (297) T PRK10122 83 RQLLAEVQSICPPGVTIMNVRQGQPLGLGHSILCARPAIGDNPFVVVLPDVVIDDASADPLRYNLAAMIARFNETGRSQV 162 (297) T ss_pred HHHHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCEECCCCCCHHHHHHHHHHHHHHHCCCCEE T ss_conf 99999887524489649998779998678999988873499607997266142278641678889999999971499599 Q ss_pred CCCCCCCCCCCCCCCCCC----CCC---CHHHCCCCCCCC---CCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCC Q ss_conf 012222100012221222----210---001001333332---2222202455410002243220046753333333432 Q gi|254780922|r 132 VVGCHVQNPQRYGVVEVD----SSN---QAISIEEKPNNP---KSSFAVTGIYFYDQEVVNIARNIRPSARGELEITDVN 201 (292) Q Consensus 132 i~~~~v~~p~~yGvv~~d----~~~---~i~~i~EKP~~~---~sn~a~~GiY~f~~~i~~~~~~l~~s~rgE~eitD~~ 201 (292) ++...++||++|||+..+ .++ +|++++|||..| +||+|++|+|+|++++|+++++++||.|||+|+||++ T Consensus 163 ~~~~v~~d~~~yGVv~~~~~~d~~g~v~~I~~ivEKP~~P~~apSnlai~GrYi~~p~IF~~L~~~~p~~~GEiqLTDAi 242 (297) T PRK10122 163 LAKRMPGDLSEYSVIQTKEPLDREGKVSRIVEFIEKPDQPQTLDSDLMAVGRYVLSADIWPELERTQPGAWGRIQLTDAI 242 (297) T ss_pred EEEEECCCCCCEEEEECCCCCCCCCCEEEEEECCCCCCCCCCCCCCCEEEEEEEECHHHHHHHHHCCCCCCCEEEHHHHH T ss_conf 99995787651579955655466775333565146778986677773377899728799999970899999858899999 Q ss_pred CHHCCCCCEEEEECCCCEEEECCCCHHHHHHHHH Q ss_conf 0000013204631354125632784577999999 Q gi|254780922|r 202 SYYLDKGLLAVEFLREGSAWFDAGTPESLLDTAV 235 (292) Q Consensus 202 ~~~l~~~~~~~~~~~~g~~W~D~Gt~e~l~~As~ 235 (292) +.+++++.+ ..+..+| .|+|||+|.+|++|+. T Consensus 243 ~~l~~~~~v-~~~~~~G-~~~D~G~~~g~l~A~i 274 (297) T PRK10122 243 AELAKKQSV-DAMLMTG-DSYDCGKKMGYMQAFV 274 (297) T ss_pred HHHHCCCCE-EEEEECC-EEECCCCHHHHHHHHH T ss_conf 999701987-9999551-7975798626999999 No 6 >PRK13389 UTP--glucose-1-phosphate uridylyltransferase subunit GalU; Provisional Probab=100.00 E-value=0 Score=426.50 Aligned_cols=232 Identities=23% Similarity=0.421 Sum_probs=199.4 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCC----------- Q ss_conf 079924888745533443898222247837479999999997898689999777656431112222222----------- Q gi|254780922|r 2 KGIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEK----------- 70 (292) Q Consensus 2 kaiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~----------- 70 (292) ||||||||+||||+|+|+++||+|+||+|||+|+|+++.++.+||++|+||+++.+ +.+++|++.+.+ T Consensus 10 KaVI~AAG~GTRl~P~Tk~~PKemLPi~dkPii~~~vee~~~aGI~ei~iV~~~~K-~~I~d~F~~~~ele~~L~~~~k~ 88 (302) T PRK13389 10 KAVIPVAGLGTRMLPATKAIPKEMLPLVDKPLIQYVVNECIAAGITEIVLVTHSSK-NSIENHFDTSFELEAMLEKRVKR 88 (302) T ss_pred EEEEECCCCCCCCCCCCCCCCCCCCEECCEEHHHHHHHHHHHCCCCEEEEEECCCC-CHHHHHCCCCHHHHHHHHHHHHH T ss_conf 79998986753668650469812154898875999999999869978999927984-13556347708888888763257 Q ss_pred -----------CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCC-------CCCCCHHHHHCCCHHHHHC Q ss_conf -----------2111122233332244685210111233567521321443211-------1100001110000011100 Q gi|254780922|r 71 -----------WGVQFSYIEQLVPAGLAQSYILGAEFIGDSSSVLILGDNVFYG-------SDISDIFHKARARRNSATV 132 (292) Q Consensus 71 -----------~g~~i~y~~q~~p~Gta~Ai~~a~~~i~~~~~~lilgDni~~~-------~~l~~~l~~~~~~~~~~ti 132 (292) .+++++|+.|.+|+|+|||+++|++|+++++|++++||+++.. .++.++++.+.+. ..++| T Consensus 89 ~~l~~~~~i~~~~~~i~~v~Q~~~~GlGhAV~~A~~~v~depF~VllgD~li~~~~~~l~~~~l~~li~~~~~~-g~~~I 167 (302) T PRK13389 89 QLLDEVQSICPPHVTIMQVRQGLAKGLGHAVLCAHPVVGDEPVAVILPDVILDEYESDLSQDNLAEMIRRFDET-GHSQI 167 (302) T ss_pred HHHHHHHHCCCCCCEEEEEECCCCCCHHHHHHHHHHHCCCCCEEEEECCHHHCCCCCCCCHHHHHHHHHHHHHC-CCCEE T ss_conf 77777653035883599986789876899999889862996579993633413556652056899999998614-98689 Q ss_pred CCCCCCCCCCCCCCCCC-------CCCCHHHCCCCCC--CCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCH Q ss_conf 12222100012221222-------2100010013333--32222220245541000224322004675333333343200 Q gi|254780922|r 133 VGCHVQNPQRYGVVEVD-------SSNQAISIEEKPN--NPKSSFAVTGIYFYDQEVVNIARNIRPSARGELEITDVNSY 203 (292) Q Consensus 133 ~~~~v~~p~~yGvv~~d-------~~~~i~~i~EKP~--~~~sn~a~~GiY~f~~~i~~~~~~l~~s~rgE~eitD~~~~ 203 (292) .+.+|+||++|||++++ +..+|.+|+|||+ +++||+|++|+|+|++++|+++++++||.|||+|+||+++. T Consensus 168 ~v~~Ved~s~YGVv~~~g~~~~~~~~~~I~~iVEKP~~e~aPSnlAi~GrYif~p~IF~~L~~~~p~~~GEiqLTDAi~~ 247 (302) T PRK13389 168 MVEPVADVTAYGVVDCKGVELAPGESVPMVGVVEKPKADVAPSNLAIVGRYVLSADIWPLLAKTPPGAGDEIQLTDAIDM 247 (302) T ss_pred EEEECCCCCCEEEEECCCCCCCCCCCCCEEEEEECCCCCCCCCCEEEEEEEEECHHHHHHHHCCCCCCCCCEEHHHHHHH T ss_conf 99986776324479635664456763322456743788789877578767887889999996079999980849999999 Q ss_pred HCCCCCEEEEECCCCEEEECCCCHHHHHHHHHHH Q ss_conf 0001320463135412563278457799999999 Q gi|254780922|r 204 YLDKGLLAVEFLREGSAWFDAGTPESLLDTAVFV 237 (292) Q Consensus 204 ~l~~~~~~~~~~~~g~~W~D~Gt~e~l~~As~~i 237 (292) +++++.+. .+..+| .|+|||++.+|++|+... T Consensus 248 l~~~~~v~-~~~~~G-~r~D~G~~~gyl~A~i~~ 279 (302) T PRK13389 248 LIEKETVE-AYHMKG-KSHDCGNKLGYMQAFVEY 279 (302) T ss_pred HHHHCCEE-EEEEEE-EEECCCCCCHHHHHHHHH T ss_conf 98429769-999655-797876801379999999 No 7 >TIGR01099 galU UTP-glucose-1-phosphate uridylyltransferase; InterPro: IPR005771 Uridine 5'-triphosphate:glucose-1-phosphate uridylyltransferase (uridine diphosphoglucose pyrophosphorylase, 2.7.7.9 from EC) is responsible for the synthesis of UDP-glucose, a key compound in the biosynthesis of polysaccharides. Glucose is fermented through the glycolysis step to pyruvate, which in turn is converted to lactate. The glycolytic intermediate glucose 6-phosphate is converted to glucose 1-phosphate by phosphoglucomutase activity, and this metabolite is subsequently converted to UDP-glucose by UDP-glucose pyrophosphorylase (GalU) activity. ; GO: 0003983 UTP:glucose-1-phosphate uridylyltransferase activity, 0006011 UDP-glucose metabolic process. Probab=100.00 E-value=0 Score=414.80 Aligned_cols=230 Identities=25% Similarity=0.456 Sum_probs=198.1 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCC------------ Q ss_conf 90799248887455334438982222478374799999999978986899997776564311122222------------ Q gi|254780922|r 1 MKGIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSG------------ 68 (292) Q Consensus 1 MkaiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g------------ 68 (292) |||||||||+||||.|.|+++||.||||.|||+|+|+++.++.|||++|+|||+++++++.+|+--+. T Consensus 2 rKAViPaAGlGTRfLPATKA~PKEMLPiVdKP~IQY~VEEav~aGie~i~~VTGr~K~aIEDHFD~s~ELE~~L~~~~K~ 81 (270) T TIGR01099 2 RKAVIPAAGLGTRFLPATKAIPKEMLPIVDKPLIQYIVEEAVEAGIEDILIVTGRGKRAIEDHFDTSYELEHQLEKKNKE 81 (270) T ss_pred CEEEECCCCCCCCCCHHHHCCCCCCCCCCCCCEEHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHH T ss_conf 77885276753100234516886567645874210457989864893479996588501014258788899998751048 Q ss_pred ---------CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCC------CCCCHHHHHCCCHHHHHCC Q ss_conf ---------2221111222333322446852101112335675213214432111------1000011100000111001 Q gi|254780922|r 69 ---------EKWGVQFSYIEQLVPAGLAQSYILGAEFIGDSSSVLILGDNVFYGS------DISDIFHKARARRNSATVV 133 (292) Q Consensus 69 ---------~~~g~~i~y~~q~~p~Gta~Ai~~a~~~i~~~~~~lilgDni~~~~------~l~~~l~~~~~~~~~~ti~ 133 (292) .+.+++|.|+.|.+++|+|||+++|+.+++|++|+++||||++.+. .+..|++.+.+.++.. |. T Consensus 82 ~lL~~v~~I~~~~~~i~yvRQke~kGLGhAvL~a~~~vGdEpFaV~LgDdi~~~~~~D~~~~Lkqm~~~Y~~~g~s~-ia 160 (270) T TIGR01099 82 ELLKEVRKISNLNANIFYVRQKEQKGLGHAVLCARPLVGDEPFAVLLGDDIVVSEEPDLQEALKQMIDLYEKYGCSI-IA 160 (270) T ss_pred HHHHHHHHHCCCCCCEEEEECCCCCCCCHHHHHCCCCCCCCCEEEEECCEEEECCCCCHHHHHHHHHHHHHHHCCEE-EE T ss_conf 89999998626997289981388885512754352125888606763526563588754689999999998638817-88 Q ss_pred C--CCCCCCCCCCCCCCCC--C----CC--HHHCCCCCC--CCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCC Q ss_conf 2--2221000122212222--1----00--010013333--322222202455410002243220046753333333432 Q gi|254780922|r 134 G--CHVQNPQRYGVVEVDS--S----NQ--AISIEEKPN--NPKSSFAVTGIYFYDQEVVNIARNIRPSARGELEITDVN 201 (292) Q Consensus 134 ~--~~v~~p~~yGvv~~d~--~----~~--i~~i~EKP~--~~~sn~a~~GiY~f~~~i~~~~~~l~~s~rgE~eitD~~ 201 (292) + ++.++.++|||+.... - .+ |.+++|||+ ++|||||++|+|+|+|+||+.+++++|++.||+||||++ T Consensus 161 V~~Vp~e~V~~YGii~~~~dy~~~~~y~s~i~~mVEKP~~e~APSNLAi~GRYvL~P~IF~~L~~t~~G~GgEIQLTDAl 240 (270) T TIGR01099 161 VEEVPKEEVSKYGIIDGEGDYLEEGLYESVIKDMVEKPKPEEAPSNLAIVGRYVLTPDIFDLLEETPPGKGGEIQLTDAL 240 (270) T ss_pred EEECCHHHCCCCEEECCCCCHHHCCCEEEEEECCEECCCCCCCCCCEEEECHHCCCHHHHHHHHHCCCCCCCCCHHHHHH T ss_conf 86344551775515727420001232234221414076678887421022001168568888761799999731188899 Q ss_pred CHHCCCCCEEEEECCCCEEEECCCCHHHHHHH Q ss_conf 00000132046313541256327845779999 Q gi|254780922|r 202 SYYLDKGLLAVEFLREGSAWFDAGTPESLLDT 233 (292) Q Consensus 202 ~~~l~~~~~~~~~~~~g~~W~D~Gt~e~l~~A 233 (292) +.++++... .++..+| ..||||+..+|++| T Consensus 241 ~~l~~~e~v-~A~~f~G-~ryD~G~~~Gy~~A 270 (270) T TIGR01099 241 RKLLKKERV-LAYKFEG-KRYDCGSKLGYLKA 270 (270) T ss_pred HHHHHCCCE-EEEEEEC-CEEEECCCCCCEEC T ss_conf 998623977-8887503-16654483331009 No 8 >pfam00483 NTP_transferase Nucleotidyl transferase. This family includes a wide range of enzymes which transfer nucleotides onto phosphosugars. Probab=100.00 E-value=0 Score=404.69 Aligned_cols=237 Identities=42% Similarity=0.678 Sum_probs=213.5 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCE-EHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 079924888745533443898222247837-4799999999978986899997776564311122222222111122233 Q gi|254780922|r 2 KGIVLAGGSGTRLRPLTDLLSKQMLPIYNK-PMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQ 80 (292) Q Consensus 2 kaiIlAaG~GtRl~P~T~~~pK~Llpi~gk-plI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q 80 (292) ||||||||+||||+|+|..+||||+||+|+ |||+|+|++|..+|+++|++|+++++.++++++++++.+++++++|+.| T Consensus 1 KavIlAgG~GtRl~PlT~~~pKpllpv~~~~pli~~~l~~l~~~g~~~~~~v~~~~~~~~i~~~~~~~~~~~~~i~~~~e 80 (247) T pfam00483 1 KAIILAGGSGTRLWPLTRTLAKPLVPVLDKYPMIQYTLSRLMNAGIREPIVICTQEHRFLVAEQLGDGSKFGLQVTYALQ 80 (247) T ss_pred CEEEECCCCCCCCCCCCCCCCCCEEEECCCCCHHHHHHHHHHHCCCCHHEEECCHHHHHHHHHHHHHCCCCCCEEEEEEE T ss_conf 97998897777788010688988518289978999999999865995203313367788899998633667743777431 Q ss_pred CCCCCCCCCCCCCCCCCCCCC--CCEEECCCCCCCCCCCCHHHHHCCCHHHH--HCCCCCCCCCCCCCCCCCCCCCCHHH Q ss_conf 332244685210111233567--52132144321111000011100000111--00122221000122212222100010 Q gi|254780922|r 81 LVPAGLAQSYILGAEFIGDSS--SVLILGDNVFYGSDISDIFHKARARRNSA--TVVGCHVQNPQRYGVVEVDSSNQAIS 156 (292) Q Consensus 81 ~~p~Gta~Ai~~a~~~i~~~~--~~lilgDni~~~~~l~~~l~~~~~~~~~~--ti~~~~v~~p~~yGvv~~d~~~~i~~ 156 (292) ++|+|||||+.+|++++.+++ .++++++|++++.++.++++.|.++.+++ ++..+++++|++||++++|++++|.+ T Consensus 81 ~~~~Gta~ai~~a~~~l~~~~~~~~lv~~~D~~~~~~~~~~l~~h~~~~~~~~~~~~~~~~~~~~~yGvv~~d~~~~V~~ 160 (247) T pfam00483 81 PEPRGTAPAVALAADFLGDDDPELVLVLGGDHIYRMDFEEAVQKARAKAADGTVTFGIVPVEDPTGYGVIEFDENGRVIR 160 (247) T ss_pred CCCCCCHHHHHHHHHHHCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCEEEECCCCCCCC T ss_conf 27777056899999864347876279977853344409999999998479986888556555763451489888665030 Q ss_pred CCCCCCCCC-CCCCHHHHHHHHHHHHHHHH-HCCCCCCCCCCCCCCCCHHCCCCCEEEEECCCC-EEEECCCCHHHHHHH Q ss_conf 013333322-22220245541000224322-004675333333343200000132046313541-256327845779999 Q gi|254780922|r 157 IEEKPNNPK-SSFAVTGIYFYDQEVVNIAR-NIRPSARGELEITDVNSYYLDKGLLAVEFLREG-SAWFDAGTPESLLDT 233 (292) Q Consensus 157 i~EKP~~~~-sn~a~~GiY~f~~~i~~~~~-~l~~s~rgE~eitD~~~~~l~~~~~~~~~~~~g-~~W~D~Gt~e~l~~A 233 (292) |+|||+.++ ||++++|+|+|++++|+++. .++++.++|++++|+++.++++|...+.+...+ .+|+|+|||++|++| T Consensus 161 ~~EKP~~~~~~~~~~~G~Y~~~~~i~~~~~~~~~~~~~~e~~~~d~i~~~i~~g~~~~~~~~~~~~~W~DiGt~e~~~~a 240 (247) T pfam00483 161 FVEKPDLPKASNYASMGIYFFNSGVFLFLAKYLKELARGEDEITDAIPAALEDGDLDLAFLFRGYAYWSDVGTWDSLWEA 240 (247) T ss_pred EEECCCCHHHHHHEEEEEEEECHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCCEEEEECCCHHHHHHH T ss_conf 46779962443425611499889999999965777767994499999999987996399960798846869899999999 Q ss_pred HHHHH Q ss_conf 99999 Q gi|254780922|r 234 AVFVR 238 (292) Q Consensus 234 s~~i~ 238 (292) |+.+. T Consensus 241 n~~ll 245 (247) T pfam00483 241 NLKLL 245 (247) T ss_pred HHHHH T ss_conf 99986 No 9 >cd04189 G1P_TT_long G1P_TT_long represents the long form of glucose-1-phosphate thymidylyltransferase. This family is the long form of Glucose-1-phosphate thymidylyltransferase. Glucose-1-phosphate thymidylyltransferase catalyses the formation of dTDP-glucose, from dTTP and glucose 1-phosphate. It is the first enzyme in the biosynthesis of dTDP-L-rhamnose, a cell wall constituent and a feedback inhibitor of the enzyme.There are two forms of Glucose-1-phosphate thymidylyltransferase in bacteria and archeae; short form and long form. The long form, which has an extra 50 amino acids c-terminal, is found in many species for which it serves as a sugar-activating enzyme for antibiotic biosynthesis and or other, unknown pathways, and in which dTDP-L-rhamnose is not necessarily produced.The long from enzymes also have a left-handed parallel helix domain at the c-terminus, whereas, th eshort form enzymes do not have this domain. The homotetrameric, feedback inhibited short form is found in Probab=100.00 E-value=0 Score=399.30 Aligned_cols=235 Identities=38% Similarity=0.693 Sum_probs=218.1 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 90799248887455334438982222478374799999999978986899997776564311122222222111122233 Q gi|254780922|r 1 MKGIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQ 80 (292) Q Consensus 1 MkaiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q 80 (292) |||||||||+||||+|+|..+||||+||+|||||+|+|++|..+|+++|++++++.. +.++++++++.++++++.|+.| T Consensus 1 MkaiILAaG~GtRl~p~t~~~PK~Ll~i~gkpli~~~i~~l~~~gi~~Iiiv~~~~~-~~i~~~~~~~~~~~~~i~~v~~ 79 (236) T cd04189 1 MKGLILAGGKGTRLRPLTYTRPKQLIPVAGKPIIQYAIEDLREAGIEDIGIVVGPTG-EEIKEALGDGSRFGVRITYILQ 79 (236) T ss_pred CEEEEECCCCCCCCCCCCCCCCCEEEEECCCHHHHHHHHHHHHCCCCEEEEEEECCH-HHCCCCCCCCCCCCCCEEEEEE T ss_conf 909998886876887000799802038998609999999999869958999854141-2201223466647875489862 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHCCCC Q ss_conf 33224468521011123356752132144321111000011100000111001222210001222122221000100133 Q gi|254780922|r 81 LVPAGLAQSYILGAEFIGDSSSVLILGDNVFYGSDISDIFHKARARRNSATVVGCHVQNPQRYGVVEVDSSNQAISIEEK 160 (292) Q Consensus 81 ~~p~Gta~Ai~~a~~~i~~~~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~~v~~p~~yGvv~~d~~~~i~~i~EK 160 (292) .+|+|||+|+..|++++++++|++++||+ ++..++.+++..|.+.+.++++++.++++|++||++.+| +++|++|+|| T Consensus 80 ~~~~Gt~~al~~a~~~l~~~~~iv~~gD~-~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~g~~~~~-~~~v~~i~eK 157 (236) T cd04189 80 EEPLGLAHAVLAARDFLGDEPFVVYLGDN-LIQEGISPLVRDFLEEDADASILLAEVEDPRRFGVAVVD-DGRIVRLVEK 157 (236) T ss_pred CCCCCHHHHHHHHHHHCCCCCEEEEECCC-EECCCHHHHHHHHHHCCCCCEEEEEEEECCCCCCEEEEC-CCEEEECCCC T ss_conf 15532457877766514899758972794-654689999999986278611478986246676289973-6235431325 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHCCCCCEEEEECCCCEEEECCCCHHHHHHHHHHHHH Q ss_conf 3332222220245541000224322004675333333343200000132046313541256327845779999999999 Q gi|254780922|r 161 PNNPKSSFAVTGIYFYDQEVVNIARNIRPSARGELEITDVNSYYLDKGLLAVEFLREGSAWFDAGTPESLLDTAVFVRN 239 (292) Q Consensus 161 P~~~~sn~a~~GiY~f~~~i~~~~~~l~~s~rgE~eitD~~~~~l~~~~~~~~~~~~g~~W~D~Gt~e~l~~As~~i~~ 239 (292) |+.+.|+++++|+|+|++++++.+++++++.++|++++|+++.++++|.....+...| .|+|+|||++|.+|++++.. T Consensus 158 ~~~~~~~~~~~Giy~f~~~i~~~~~~~~~~~~~e~~i~d~i~~li~~g~~v~~~~~~g-~W~Digtpedl~~A~~~l~~ 235 (236) T cd04189 158 PKEPPSNLALVGVYAFTPAIFDAISRLKPSWRGELEITDAIQWLIDRGRRVGYSIVTG-WWKDTGTPEDLLEANRLLLD 235 (236) T ss_pred CCCCCCCEEEEEEEEECHHHHHHHHHCCCCCCCCEEHHHHHHHHHHCCCCEEEEEECC-EEEECCCHHHHHHHHHHHHC T ss_conf 8898767599999999989999998479987881338999999998799889997199-79979899999999899871 No 10 >cd02541 UGPase_prokaryotic Prokaryotic UGPase catalyses the synthesis of UDP-glucose. Prokaryotic UDP-Glucose Pyrophosphorylase (UGPase) catalyzes a reversible production of UDP-Glucose and pyrophosphate (PPi) from glucose-1-phosphate and UTP. UDP-glucose plays pivotal roles in galactose utilization, in glycogen synthesis, and in the synthesis of the carbohydrate moieties of glycolipids , glycoproteins , and proteoglycans. UGPase is found in both prokaryotes and eukaryotes, although prokaryotic and eukaryotic forms of UGPase catalyze the same reaction, they share low sequence similarity. Probab=100.00 E-value=0 Score=397.72 Aligned_cols=232 Identities=26% Similarity=0.480 Sum_probs=198.3 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCC------------ Q ss_conf 07992488874553344389822224783747999999999789868999977765643111222222------------ Q gi|254780922|r 2 KGIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGE------------ 69 (292) Q Consensus 2 kaiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~------------ 69 (292) ||||||||+||||+|+|..+||||+||+|||||+|+|++|.++||++|+|+++|.... +.+|++... T Consensus 2 KAvILAgG~GTRL~PlT~~~PKpLlpi~gkPii~~~i~~l~~~Gi~~iii~~~~~~~~-I~~~f~~~~~~~~~l~~~~k~ 80 (267) T cd02541 2 KAVIPAAGLGTRFLPATKAIPKEMLPIVDKPVIQYIVEEAVAAGIEDIIIVTGRGKRA-IEDHFDRSYELEETLEKKGKT 80 (267) T ss_pred EEEEECCCCCCCCCHHHCCCCCCCCEECCEEHHHHHHHHHHHCCCCEEEEEEECCHHH-HHHHHCCCHHHHHHHHHCCCH T ss_conf 6999898774667842469984364899998699999999986998899991246666-244506658899998526856 Q ss_pred ---------CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCC--CCCCHHHHHCCCHHHHHCCCCC-- Q ss_conf ---------221111222333322446852101112335675213214432111--1000011100000111001222-- Q gi|254780922|r 70 ---------KWGVQFSYIEQLVPAGLAQSYILGAEFIGDSSSVLILGDNVFYGS--DISDIFHKARARRNSATVVGCH-- 136 (292) Q Consensus 70 ---------~~g~~i~y~~q~~p~Gta~Ai~~a~~~i~~~~~~lilgDni~~~~--~l~~~l~~~~~~~~~~ti~~~~-- 136 (292) .++++++|+.|++|+|||||+.+|++++++++|++++||+++... .+.+++..|.++.+. ++.+.. T Consensus 81 ~~~~~~~~~~~~~~i~~~~e~~~lGtggai~~a~~~~~~~~flv~~gD~l~~~~~~~l~~li~~~~~~~~~-~~~~~~v~ 159 (267) T cd02541 81 DLLEEVRIISDLANIHYVRQKEPLGLGHAVLCAKPFIGDEPFAVLLGDDLIDSKEPCLKQLIEAYEKTGAS-VIAVEEVP 159 (267) T ss_pred HHHHHHHCCCCCCEEEEEECCCCCCHHHHHHHCCCCCCCCCEEEEECCEEEECCCHHHHHHHHHHHHHCCE-EEEEEEEC T ss_conf 55555422478955999977898876789872101049986799968988853750668999999730872-68999968 Q ss_pred CCCCCCCCCCCCCCCC----CHHHCCCCCC--CCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHCCCCCE Q ss_conf 2100012221222210----0010013333--322222202455410002243220046753333333432000001320 Q gi|254780922|r 137 VQNPQRYGVVEVDSSN----QAISIEEKPN--NPKSSFAVTGIYFYDQEVVNIARNIRPSARGELEITDVNSYYLDKGLL 210 (292) Q Consensus 137 v~~p~~yGvv~~d~~~----~i~~i~EKP~--~~~sn~a~~GiY~f~~~i~~~~~~l~~s~rgE~eitD~~~~~l~~~~~ 210 (292) .+++++|||++++..+ +|++|+|||+ +|+||++++|+|+|++++|+++++..++.++|++++|+++.+++++++ T Consensus 160 ~~~~~~yGvv~~~~~~~~~~~i~~~~EKP~~~~~~s~l~~~GiYi~~~~i~~~i~~~~~~~~~e~~l~d~i~~li~~~~v 239 (267) T cd02541 160 PEDVSKYGIVKGEKIDGDVFKVKGLVEKPKPEEAPSNLAIVGRYVLTPDIFDILENTKPGKGGEIQLTDAIAKLLEEEPV 239 (267) T ss_pred CCCCCCCCEEEECCCCCCEEEEEEEEECCCCCCCCCCEEEEEEEEECHHHHHHHHHCCCCCCCCCCHHHHHHHHHHCCCE T ss_conf 66787364389667787068888764068876798531663449988999878972899999833077999999857996 Q ss_pred EEEECCCCEEEECCCCHHHHHHHHHHH Q ss_conf 463135412563278457799999999 Q gi|254780922|r 211 AVEFLREGSAWFDAGTPESLLDTAVFV 237 (292) Q Consensus 211 ~~~~~~~g~~W~D~Gt~e~l~~As~~i 237 (292) ..+...| .|+|+|||++|++||..+ T Consensus 240 -~~~~~~g-~W~DiGtp~~y~~A~~~~ 264 (267) T cd02541 240 -YAYVFEG-KRYDCGNKLGYLKATVEF 264 (267) T ss_pred -EEEEECC-EEEECCCHHHHHHHHHHH T ss_conf -9998598-889579989999999998 No 11 >cd06425 M1P_guanylylT_B_like_N N-terminal domain of the M1P-guanylyltransferase B-isoform like proteins. GDP-mannose pyrophosphorylase (GTP: alpha-d-mannose-1-phosphate guanyltransferase) catalyzes the formation of GDP-d-mannose from GTP and alpha-d-mannose-1-Phosphate. It contains an N-terminal catalytic domain and a C-terminal Lefthanded-beta-Helix fold domain. GDP-d-mannose is the activated form of mannose for formation of cell wall lipoarabinomannan and various mannose-containing glycolipids and polysaccharides. The function of GDP-mannose pyrophosphorylase is essential for cell wall integrity, morphogenesis and viability. Repression of GDP-mannose pyrophosphorylase in yeast leads to phenotypes, such as cell lysis, defective cell wall, and failure of polarized growth and cell separation. Probab=100.00 E-value=0 Score=388.28 Aligned_cols=228 Identities=26% Similarity=0.454 Sum_probs=199.0 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCC-CCCCCCCCCCC Q ss_conf 90799248887455334438982222478374799999999978986899997776564311122222-22211112223 Q gi|254780922|r 1 MKGIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSG-EKWGVQFSYIE 79 (292) Q Consensus 1 MkaiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g-~~~g~~i~y~~ 79 (292) |||||||||+||||+|+|+.+||||+||+|||+|+|+|+.|..+||++|+|+++|.. +++.+++++. .+++++|+|+. T Consensus 1 MkavIlagG~GtRl~plT~~~PKpLlpv~gkpli~~~i~~l~~~gi~~iii~~~y~~-~~i~~~~~~~~~~~~~~i~~~~ 79 (233) T cd06425 1 MKALILVGGYGTRLRPLTLTVPKPLVEFCNKPMIEHQIEALAKAGVKEIILAVNYRP-EDMVPFLKEYEKKLGIKITFSI 79 (233) T ss_pred CEEEEECCCCCCCCCCHHCCCCCCCCEECCEEHHHHHHHHHHHCCCCCCCCHHHHHH-HHHHHHHHHHHHHCCCEEEECC T ss_conf 969998897765788013299830108999869999999999859973210112357-7999999877774396056522 Q ss_pred CCCCCCCCCCCCCCCCCCC--CCCCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCCCCCCCCCCCCCCCCCC-CCHHH Q ss_conf 3332244685210111233--56752132144321111000011100000111001222210001222122221-00010 Q gi|254780922|r 80 QLVPAGLAQSYILGAEFIG--DSSSVLILGDNVFYGSDISDIFHKARARRNSATVVGCHVQNPQRYGVVEVDSS-NQAIS 156 (292) Q Consensus 80 q~~p~Gta~Ai~~a~~~i~--~~~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~~v~~p~~yGvv~~d~~-~~i~~ 156 (292) |++|+|||||+..|+++++ +++|++++||+ +++.++.+++..|.++++.+++++.++++|++||++.+|++ ++|.+ T Consensus 80 e~~~lGtagal~~a~~~i~~~~~~flVlngD~-~~~~~l~~~~~~h~~~~~~~ti~~~~~~~~~~yGv~~~d~~~g~v~~ 158 (233) T cd06425 80 ETEPLGTAGPLALARDLLGDDDEPFFVLNSDV-ICDFPLAELLDFHKKHGAEGTILVTKVEDPSKYGVVVHDENTGRIER 158 (233) T ss_pred CCCCCCCHHHHHHHHHHHCCCCCCEEEECCCH-HCCHHHHHHHHHHHHCCCCEEEEEEECCCHHHCCEEEECCCCCCCCC T ss_conf 57666721699999998468998689977832-10203699999998558984999997678455680886687674222 Q ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHCCCCCEEEEECCCCEEEECCCCHHHHHHHHHH Q ss_conf 01333332222220245541000224322004675333333343200000132046313541256327845779999999 Q gi|254780922|r 157 IEEKPNNPKSSFAVTGIYFYDQEVVNIARNIRPSARGELEITDVNSYYLDKGLLAVEFLREGSAWFDAGTPESLLDTAVF 236 (292) Q Consensus 157 i~EKP~~~~sn~a~~GiY~f~~~i~~~~~~l~~s~rgE~eitD~~~~~l~~~~~~~~~~~~g~~W~D~Gt~e~l~~As~~ 236 (292) |+|||+.+.|+++++|+|+|++++|+.++... .+++ .|+++.+++++++. .+..+| +|+|+|||++|++|+++ T Consensus 159 f~EKP~~~~~~~i~~Giyi~~~~i~~~i~~~~----~~~~-~di~~~l~~~~~v~-~~~~~g-~W~DiGtp~~~l~A~~~ 231 (233) T cd06425 159 FVEKPKVFVGNKINAGIYILNPSVLDRIPLRP----TSIE-KEIFPKMASEGQLY-AYELPG-FWMDIGQPKDFLKGMSL 231 (233) T ss_pred EEECCCCCCCCCEEEEEEEECHHHHHHCCCCC----CCCH-HHHHHHHHHCCCEE-EEEECC-EEEECCCHHHHHHHHHH T ss_conf 03778888898703689999899996566688----8746-99999998479979-998599-99949798999999998 Q ss_pred H Q ss_conf 9 Q gi|254780922|r 237 V 237 (292) Q Consensus 237 i 237 (292) . T Consensus 232 ~ 232 (233) T cd06425 232 Y 232 (233) T ss_pred H T ss_conf 6 No 12 >cd06428 M1P_guanylylT_A_like_N N-terminal domain of M1P_guanylyl_A_ like proteins are likely to be a isoform of GDP-mannose pyrophosphorylase. N-terminal domain of the M1P-guanylyltransferase A-isoform like proteins: The proteins of this family are likely to be a isoform of GDP-mannose pyrophosphorylase. Their sequences are highly conserved with mannose-1-phosphate guanyltransferase, but generally about 40-60 bases longer. GDP-mannose pyrophosphorylase (GTP: alpha-d-mannose-1-phosphate guanyltransferase) catalyzes the formation of GDP-d-mannose from GTP and alpha-d-mannose-1-Phosphate. It contains an N-terminal catalytic domain that resembles a dinucleotide-binding Rossmann fold and a C-terminal LbH fold domain. GDP-d-mannose is the activated form of mannose for formation of cell wall lipoarabinomannan and various mannose-containing glycolipids and polysaccharides. The function of GDP-mannose pyrophosphorylase is essential for cell wall integrity, morphogenesis and viability. Repre Probab=100.00 E-value=0 Score=382.38 Aligned_cols=230 Identities=22% Similarity=0.407 Sum_probs=198.3 Q ss_pred EEEECCC--CCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHH-CCCCEEEEEECCCCCCCCCCCCCCCC-CCCCCCCCC Q ss_conf 7992488--8745533443898222247837479999999997-89868999977765643111222222-221111222 Q gi|254780922|r 3 GIVLAGG--SGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMD-AGIREILIISTPRDLPVLKEFLGSGE-KWGVQFSYI 78 (292) Q Consensus 3 aiIlAaG--~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~-~Gi~~I~ii~~~~~~~~~~~~l~~g~-~~g~~i~y~ 78 (292) ||||||| +||||+|+|+.+||||+||+|||||+|++++|.+ +|+++|++++++. .+++.+++.+++ .++++++|+ T Consensus 1 AvILaGG~~~GTRL~plT~~~PKpllpv~gkPiI~~~i~~l~~~~gi~~i~i~~~~~-~~~~~~~~~~~~~~~~~~i~~~ 79 (257) T cd06428 1 AVILVGGPQKGTRFRPLSLDVPKPLFPVAGKPMIHHHIEACAKVPDLKEVLLIGFYP-ESVFSDFISDAQQEFNVPIRYL 79 (257) T ss_pred CEEECCCCCCCCCCCHHHCCCCCCCCEECCEEEHHHHHHHHHHHCCCCEEEEECCCC-HHHHHHHHHCCCCCCCCCEEEE T ss_conf 989788999973167433799844257999954199999999747984899980575-7999998742343369605998 Q ss_pred CCCCCCCCCCCCCCCCCCCCC---CCCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCCCC--CCCCCCCCCCCCC-CC Q ss_conf 333322446852101112335---67521321443211110000111000001110012222--1000122212222-10 Q gi|254780922|r 79 EQLVPAGLAQSYILGAEFIGD---SSSVLILGDNVFYGSDISDIFHKARARRNSATVVGCHV--QNPQRYGVVEVDS-SN 152 (292) Q Consensus 79 ~q~~p~Gta~Ai~~a~~~i~~---~~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~~v--~~p~~yGvv~~d~-~~ 152 (292) .|.+|+|||||+..+++++.. +.|++++|| ++++.++.++++.|.++++.+|+++.++ ++|++||++..|+ ++ T Consensus 80 ~e~~~lGtagal~~~~~~i~~~~~~~~lv~~gD-~l~~~dl~~l~~~h~~~~~~~ti~~~~v~~~~~s~fGvvv~~~~~~ 158 (257) T cd06428 80 QEYKPLGTAGGLYHFRDQILAGNPSAFFVLNAD-VCCDFPLQELLEFHKKHGASGTILGTEASREQASNYGCIVEDPSTG 158 (257) T ss_pred ECCCCCCHHHHHHHHHHHHHCCCCCEEEEECCC-CCCCCCHHHHHHHHHHCCCCEEEEEEECCCCCCCCEEEEEEECCCC T ss_conf 647666736789999999734899769998885-0038789999999997799869999987821365402899978999 Q ss_pred CHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCC----------------CCCCHHCCCCCEEEEECC Q ss_conf 0010013333322222202455410002243220046753333333----------------432000001320463135 Q gi|254780922|r 153 QAISIEEKPNNPKSSFAVTGIYFYDQEVVNIARNIRPSARGELEIT----------------DVNSYYLDKGLLAVEFLR 216 (292) Q Consensus 153 ~i~~i~EKP~~~~sn~a~~GiY~f~~~i~~~~~~l~~s~rgE~eit----------------D~~~~~l~~~~~~~~~~~ 216 (292) +|++|+|||++++||+|++|+|+|++++|++++++.+|.|+|++++ |+++.++++|++ ..+.. T Consensus 159 ~i~~f~EKP~~~~s~lin~G~Yvf~~~if~~i~~~~~~~~~e~~~~d~~~~~~~~~~~~~e~dvlp~l~~~g~l-~~y~~ 237 (257) T cd06428 159 EVLHYVEKPETFVSDLINCGVYLFSPEIFDTIKKAFQSRQQEAQLGDDNNREGRAEVIRLEQDVLTPLAGSGKL-YVYKT 237 (257) T ss_pred CEEEEEECCCCCCCCEEEEEEEEECHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCEECHHHHHHHHHCCCE-EEEEC T ss_conf 59999988999888808999999988999999751845467410134333346664440003267999857988-99938 Q ss_pred CCEEEECCCCHHHHHHHHHH Q ss_conf 41256327845779999999 Q gi|254780922|r 217 EGSAWFDAGTPESLLDTAVF 236 (292) Q Consensus 217 ~g~~W~D~Gt~e~l~~As~~ 236 (292) +| +|.|+|||++|++||.. T Consensus 238 ~g-~W~DiGTp~~~~~AN~~ 256 (257) T cd06428 238 DD-FWSQIKTAGSAIYANRL 256 (257) T ss_pred CC-EEECCCCHHHHHHHHCC T ss_conf 98-38759898999998743 No 13 >PRK05293 glgC glucose-1-phosphate adenylyltransferase; Provisional Probab=100.00 E-value=0 Score=380.93 Aligned_cols=241 Identities=27% Similarity=0.447 Sum_probs=203.0 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCE-EHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCC--- Q ss_conf 9079924888745533443898222247837-479999999997898689999777656431112222222211112--- Q gi|254780922|r 1 MKGIVLAGGSGTRLRPLTDLLSKQMLPIYNK-PMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQFS--- 76 (292) Q Consensus 1 MkaiIlAaG~GtRl~P~T~~~pK~Llpi~gk-plI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~--- 76 (292) |||||||||+||||+|+|..+||||+||+|| |||+|+|++|.++||++|.+++++.. +.+.+|+++|++|+++.. T Consensus 4 MkAvILAgG~GtRlrPLT~~rPK~llPv~gk~~lId~~L~~~~~~gI~~v~v~~~~~~-~~i~~h~g~g~~w~l~~~~~g 82 (381) T PRK05293 4 MLAMILAGGQGTRLGKLTKNIAKPAVPFGGKYRIIDFTLSNCANSGIYTVGVLTQYQP-LELNNHIGIGSPWDLDRKNGG 82 (381) T ss_pred CEEEEECCCCCCCCCCCCCCCCCEEEEECCCCCCHHHHHHHHHHCCCCEEEEECCCCH-HHHHHHHCCCCCCCCCCCCCC T ss_conf 0999978888767254458974203377891512899999999869999999868997-999998717650364344487 Q ss_pred ------CCCCCCC---CCCCCCCCCCCCCCCC--CCCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCCCC--CCCCCC Q ss_conf ------2233332---2446852101112335--67521321443211110000111000001110012222--100012 Q gi|254780922|r 77 ------YIEQLVP---AGLAQSYILGAEFIGD--SSSVLILGDNVFYGSDISDIFHKARARRNSATVVGCHV--QNPQRY 143 (292) Q Consensus 77 ------y~~q~~p---~Gta~Ai~~a~~~i~~--~~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~~v--~~p~~y 143 (292) |..|..+ +|||||++++++|+.+ ++++|++++|++++.++.++++.|.++++++|+++.++ ++|++| T Consensus 83 ~~il~~~~~~~~~~~~~Gta~al~~~~~~i~~~~~e~vlv~~GD~i~~~dl~~ll~~H~~~~ad~Ti~~~~v~~~~~~~y 162 (381) T PRK05293 83 VTILPPYSESSGGKWYKGTAHAIYQNIEYIDQYDPEYVLILSGDHIYKMDYDKMLDYHKEKEADVTIAVIEVPWEEASRF 162 (381) T ss_pred EEECCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCEEEEECCCEEECCCHHHHHHHHHHCCCCCCEEEEEECCCCCCCC T ss_conf 79834300146776667428999999998644899989997699886789999999998646665037898051264416 Q ss_pred CCCCCCCCCCHHHCCCCCCCCCCCCCHHHHHHHHHHHH-HHHHHCCCCCCCC-CCCCCCCCHHCCCCCEEEEECCCCEEE Q ss_conf 22122221000100133333222222024554100022-4322004675333-333343200000132046313541256 Q gi|254780922|r 144 GVVEVDSSNQAISIEEKPNNPKSSFAVTGIYFYDQEVV-NIARNIRPSARGE-LEITDVNSYYLDKGLLAVEFLREGSAW 221 (292) Q Consensus 144 Gvv~~d~~~~i~~i~EKP~~~~sn~a~~GiY~f~~~i~-~~~~~l~~s~rgE-~eitD~~~~~l~~~~~~~~~~~~g~~W 221 (292) ||+..|++++|++|+|||+.++|+++++|+|+|+++++ +++....++...+ ...+|+++.+++++.....+..+| +| T Consensus 163 Gvv~~d~~g~I~~~~EKp~~~~s~l~n~GiYif~~~~L~~~l~~~~~~~~~~~~~~~d~i~~ll~~~~~v~~y~~~g-~w 241 (381) T PRK05293 163 GIMNTDEEMRIVEFEEKPKFPKSNLASMGIYIFNWKRLKEYLEEDEKNPNSSHDFGKDVIPLYLEEGEKLYAYPFEG-YW 241 (381) T ss_pred EEEEECCCCCEEEEEECCCCCCCCCCCCCEEEECHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCEEEEEECC-EE T ss_conf 08998799979999978899876654331699569999999987513886334668999999997799689998558-68 Q ss_pred ECCCCHHHHHHHHHHHHHHHHH Q ss_conf 3278457799999999998888 Q gi|254780922|r 222 FDAGTPESLLDTAVFVRNIENR 243 (292) Q Consensus 222 ~D~Gt~e~l~~As~~i~~~e~~ 243 (292) .|+|||++|++||+.+..-++. T Consensus 242 ~dig~~~~~~~an~~lL~~~~~ 263 (381) T PRK05293 242 KDVGTIESLWEANMELLRPENP 263 (381) T ss_pred EECCCHHHHHHHHHHHHCCCCC T ss_conf 8889889999999998666764 No 14 >TIGR02091 glgC glucose-1-phosphate adenylyltransferase; InterPro: IPR011831 This enzyme, glucose-1-phosphate adenylyltransferase, is also called ADP-glucose pyrophosphorylase. The plant form is an alpha2, beta2 heterodimer, allosterically regulated in plants. Both subunits are homologous and included in this entry. In bacteria, both homomeric forms of GlgC and more active heterodimers of GlgC and GlgD have been described. This entry describes the GlgC subunit only. This enzyme appears in variants of glycogen synthesis pathways that use ADP-glucose, rather than UDP-glucose as in animals.; GO: 0008878 glucose-1-phosphate adenylyltransferase activity. Probab=100.00 E-value=0 Score=382.55 Aligned_cols=245 Identities=24% Similarity=0.441 Sum_probs=201.7 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCE-EHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCC---- Q ss_conf 9079924888745533443898222247837-47999999999789868999977765643111222222221111---- Q gi|254780922|r 1 MKGIVLAGGSGTRLRPLTDLLSKQMLPIYNK-PMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQF---- 75 (292) Q Consensus 1 MkaiIlAaG~GtRl~P~T~~~pK~Llpi~gk-plI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i---- 75 (292) |=|||||||.||||+|||+.+.||-||++|| +|||++|+||+++||.+|+|+|.|+.... .+|++.|+.|++.= T Consensus 1 ~la~iLAGG~GsRL~pLT~~rAKPAVpFGGkYRiIDF~LSNc~NSGi~ri~VLTQY~~~sL-~~Hi~iG~~W~~~~~~~~ 79 (421) T TIGR02091 1 VLAMILAGGQGSRLSPLTKRRAKPAVPFGGKYRIIDFALSNCINSGINRIGVLTQYKSHSL-NRHIQIGRGWDFDGPEIG 79 (421) T ss_pred CEEEEECCCCCCCCCHHHHHHCCCCCCCCCCEEEECCCHHHHHHCCCCEEEEEEECCHHHH-HHHHHCCCCCCCCCCCCC T ss_conf 9788943888754613346525776555630145210013555415445788850516788-888406876567744247 Q ss_pred CCC-----CCC----CC---CCCCCCCCCCCCCCCC-C--CCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCCCC--C Q ss_conf 222-----333----32---2446852101112335-6--7521321443211110000111000001110012222--1 Q gi|254780922|r 76 SYI-----EQL----VP---AGLAQSYILGAEFIGD-S--SSVLILGDNVFYGSDISDIFHKARARRNSATVVGCHV--Q 138 (292) Q Consensus 76 ~y~-----~q~----~p---~Gta~Ai~~a~~~i~~-~--~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~~v--~ 138 (292) .|+ .|. .+ +|||+|++++..+|.+ . +++|||++|++|.+|+.+||+.|.++++++||.+.+| + T Consensus 80 ~fv~~Lpa~~~~~~~~~~WY~GTADAvYQNl~~i~~~~~p~yvlILsGDHiYkMDy~~mL~~H~e~~Ad~Tia~~~Vp~~ 159 (421) T TIGR02091 80 GFVELLPAQQRFEESGKDWYQGTADAVYQNLDLIEETYDPEYVLILSGDHIYKMDYEKMLDYHIEKGADLTIACIPVPRK 159 (421) T ss_pred CCEEEECCCEEECCCCCCCEEECHHHHHHHHHHHHCCCCCCEEEEECCCCEECCCHHHHHHHHHHCCCCEEEEEEECCHH T ss_conf 62676437511126788532112378899999874016924899934661010487899999985679879985427855 Q ss_pred CCCCCCCCCCCCCCCHHHCCCCCCCCCCC----------------------CCHHHHHHHHHHHH-HHHHHCCCCCCCCC Q ss_conf 00012221222210001001333332222----------------------22024554100022-43220046753333 Q gi|254780922|r 139 NPQRYGVVEVDSSNQAISIEEKPNNPKSS----------------------FAVTGIYFYDQEVV-NIARNIRPSARGEL 195 (292) Q Consensus 139 ~p~~yGvv~~d~~~~i~~i~EKP~~~~sn----------------------~a~~GiY~f~~~i~-~~~~~l~~s~rgE~ 195 (292) ++|+|||+++|++++|++|+|||++|++. +|+||||+|+.+++ ++|.+-..+...-. T Consensus 160 eAs~fGvm~vD~~g~i~~F~EKP~~P~~~~~~~~~~s~~~~~~~~~~~~~yLASMGiYiF~~~~L~~~L~~d~~~~~s~~ 239 (421) T TIGR02091 160 EASRFGVMQVDEDGRIVDFEEKPANPPSIPGDPDKSSDLGLSKLDAQKGPYLASMGIYIFDKDVLKELLEEDAKDPESSH 239 (421) T ss_pred HCCCCCEEEECCCCCEEEEECCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCEEECCEEEECHHHHHHHHHHHHCCCCCCC T ss_conf 51528728887899688888268888667777677444045514533477447555136457899999999753876777 Q ss_pred CC-CCCCCHHCCCC-CEEEEECCC--------CEEEECCCCHHHHHHHHHHHHHHH----HHCCC Q ss_conf 33-34320000013-204631354--------125632784577999999999988----88098 Q gi|254780922|r 196 EI-TDVNSYYLDKG-LLAVEFLRE--------GSAWFDAGTPESLLDTAVFVRNIE----NRLGL 246 (292) Q Consensus 196 ei-tD~~~~~l~~~-~~~~~~~~~--------g~~W~D~Gt~e~l~~As~~i~~~e----~~~g~ 246 (292) +. .|+|+.++.+| .+.+..+.+ ..||.||||.++|+|||+++-... ...++ T Consensus 240 DFGkdiIP~~~~~g~~~~Ay~F~~s~v~g~~~e~YWrDVGTidsfweANmdL~~~~~PqvP~f~l 304 (421) T TIGR02091 240 DFGKDIIPKLLEEGKSVQAYDFSQSCVPGEEKEGYWRDVGTIDSFWEANMDLVSEVKPQVPPFDL 304 (421) T ss_pred CCCHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCCEEEECHHHHHHHHHHHHCCCCCCCCCCCCC T ss_conf 54435688985089718998046760568788876223004065667518762788888887776 No 15 >cd06915 NTP_transferase_WcbM_like WcbM_like is a subfamily of nucleotidyl transferases. WcbM protein of Burkholderia mallei is involved in the biosynthesis, export or translocation of capsule. It is a subfamily of nucleotidyl transferases that transfer nucleotides onto phosphosugars. Probab=100.00 E-value=0 Score=373.19 Aligned_cols=223 Identities=25% Similarity=0.413 Sum_probs=203.7 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 79924888745533443898222247837479999999997898689999777656431112222222211112223333 Q gi|254780922|r 3 GIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQLV 82 (292) Q Consensus 3 aiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~~ 82 (292) |||||||+||||+|+|..+||||+||+|||||+|+|++|.++||++|+|+++++. +++.+++++..++++++.|+.|++ T Consensus 1 AvIlAaG~GtRl~p~T~~~PKpLl~i~gkpli~~~l~~l~~~gi~~iii~~~~~~-~~i~~~~~~~~~~~~~i~~~~e~~ 79 (223) T cd06915 1 AVILAGGLGTRLRSVVKDLPKPLAPVAGRPFLEYLLEYLARQGISRIVLSVGYLA-EQIEEYFGDGYRGGIRIYYVIEPE 79 (223) T ss_pred CEEECCCCCCCCCCCCCCCCCCCCEECCEEHHHHHHHHHHHCCCCEEEEEECCHH-HHHHHHHHCCCCCCCCCEEEEEEE T ss_conf 9894787886788110799831128999989999999999759977887301023-433443310234577614788642 Q ss_pred CCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHCCCCCC Q ss_conf 22446852101112335675213214432111100001110000011100122221000122212222100010013333 Q gi|254780922|r 83 PAGLAQSYILGAEFIGDSSSVLILGDNVFYGSDISDIFHKARARRNSATVVGCHVQNPQRYGVVEVDSSNQAISIEEKPN 162 (292) Q Consensus 83 p~Gta~Ai~~a~~~i~~~~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~~v~~p~~yGvv~~d~~~~i~~i~EKP~ 162 (292) |+|||+|+..|++++++++|++++||++ ++.++.+++..|.++++.++++++++++|++||++++|++++|+++.|||. T Consensus 80 ~lGt~gal~~a~~~i~~~~flv~~gD~~-~~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~g~v~~d~~~~i~~~~eK~~ 158 (223) T cd06915 80 PLGTGGAIKNALPKLPEDQFLVLNGDTY-FDVDLLALLAALRASGADATMALRRVPDASRYGNVTVDGDGRVIAFVEKGP 158 (223) T ss_pred ECCHHHHHHHHHHHCCCCCEEEECCCEE-ECCCHHHHHHHHHHCCCCEEEEEEECCCCCCCCCEEECCCCCEEEEEECCC T ss_conf 3646889988886369998899827711-267789999999857997899999899830278589879731423475489 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHCCCCCEEEEECCCCEEEECCCCHHHHHHHH Q ss_conf 322222202455410002243220046753333333432000001320463135412563278457799999 Q gi|254780922|r 163 NPKSSFAVTGIYFYDQEVVNIARNIRPSARGELEITDVNSYYLDKGLLAVEFLREGSAWFDAGTPESLLDTA 234 (292) Q Consensus 163 ~~~sn~a~~GiY~f~~~i~~~~~~l~~s~rgE~eitD~~~~~l~~~~~~~~~~~~g~~W~D~Gt~e~l~~As 234 (292) .+.++++++|+|+|++++|+.++...++ ..+|+++.+++++++ ..+...| .|+|+||||+|.+|. T Consensus 159 ~~~~~~i~~Giyi~~~~i~~~~~~~~~~-----~~~d~~~~li~~~~v-~~~~~~g-~w~DiGt~e~~~~A~ 223 (223) T cd06915 159 GAAPGLINGGVYLLRKEILAEIPADAFS-----LEADVLPALVKRGRL-YGFEVDG-YFIDIGIPEDYARAQ 223 (223) T ss_pred CCCCCEEEEEEEEECHHHHHHHHCCCCC-----CHHHHHHHHHHCCCE-EEEEECC-EEEECCCHHHHHHHC T ss_conf 8766824463899989999741425997-----279999999834997-9999799-199798989999739 No 16 >COG1210 GalU UDP-glucose pyrophosphorylase [Cell envelope biogenesis, outer membrane] Probab=100.00 E-value=0 Score=362.02 Aligned_cols=233 Identities=27% Similarity=0.486 Sum_probs=196.1 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCC-----------C Q ss_conf 90799248887455334438982222478374799999999978986899997776564311122222-----------2 Q gi|254780922|r 1 MKGIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSG-----------E 69 (292) Q Consensus 1 MkaiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g-----------~ 69 (292) |||||+|||+||||.|.|+++||.|+||.|||+|+|+++.++.+||++|+|||++++..+ ++||... + T Consensus 5 rKAViPaAGlGTRfLPATKaiPKEMLPIvdKP~IqYiVeEa~~aGIe~i~iVTgr~K~~I-eDhFD~s~ELE~~L~~~~K 83 (291) T COG1210 5 RKAVIPAAGLGTRFLPATKAIPKEMLPIVDKPLIQYIVEEAVAAGIEEILIVTGRGKRAI-EDHFDTSYELENTLEKRGK 83 (291) T ss_pred CEEEEECCCCCCCCCCCCCCCCHHHCCCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCHH-HHHCCCCHHHHHHHHHHCH T ss_conf 179887467645546600357664436667221999999999869977999966983568-8867686999999987277 Q ss_pred ----------CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCC--CCCCHHHHHCCCHHHHHCCCCCC Q ss_conf ----------221111222333322446852101112335675213214432111--10000111000001110012222 Q gi|254780922|r 70 ----------KWGVQFSYIEQLVPAGLAQSYILGAEFIGDSSSVLILGDNVFYGS--DISDIFHKARARRNSATVVGCHV 137 (292) Q Consensus 70 ----------~~g~~i~y~~q~~p~Gta~Ai~~a~~~i~~~~~~lilgDni~~~~--~l~~~l~~~~~~~~~~ti~~~~v 137 (292) .-.++++|+.|.+|+|+|||++||++|++++||+++||||++.+. .+..+++.+.+... ..+.+.+| T Consensus 84 ~~~L~~v~~i~~~~~i~~vRQ~e~~GLGhAVl~A~~~vg~EpFaVlL~Ddl~~~~~~~l~qmi~~y~~~g~-svi~v~ev 162 (291) T COG1210 84 RELLEEVRSIPPLVTISFVRQKEPLGLGHAVLCAKPFVGDEPFAVLLPDDLVDSEKPCLKQMIELYEETGG-SVIGVEEV 162 (291) T ss_pred HHHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHHHHHCCCCCEEEEECCEEECCCCHHHHHHHHHHHHHCC-CEEEEEEC T ss_conf 99999997235775399986587676017888535432898569981772632780699999999998599-27999988 Q ss_pred --CCCCCCCCCC----CCCC-CCHHHCCCCC--CCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHCCCC Q ss_conf --1000122212----2221-0001001333--33222222024554100022432200467533333334320000013 Q gi|254780922|r 138 --QNPQRYGVVE----VDSS-NQAISIEEKP--NNPKSSFAVTGIYFYDQEVVNIARNIRPSARGELEITDVNSYYLDKG 208 (292) Q Consensus 138 --~~p~~yGvv~----~d~~-~~i~~i~EKP--~~~~sn~a~~GiY~f~~~i~~~~~~l~~s~rgE~eitD~~~~~l~~~ 208 (292) ++.++|||+. .+++ .+|.+++||| ++++||+|++|.|+|+++||+.+++.+|.++||+++||.+..++++. T Consensus 163 ~~e~v~kYGvi~~g~~~~~~~~~v~~~VEKP~~~~APSnlai~GRYil~p~IF~~L~~~~~G~ggEiQLTDai~~L~~~~ 242 (291) T COG1210 163 PPEDVSKYGVIDPGEPVEKGVYKVKGMVEKPKPEEAPSNLAIVGRYVLTPEIFDILEETKPGAGGEIQLTDAIKKLLKKE 242 (291) T ss_pred CHHHCCCCCEEECCCCCCCCEEEEEEEEECCCCCCCCCCEEEEEEEECCHHHHHHHHHCCCCCCCEEEHHHHHHHHHHHC T ss_conf 87877656267248503577289977776699999985415765465689999999627999998766799999998538 Q ss_pred CEEEEECCCCEEEECCCCHHHHHHHHHHH Q ss_conf 20463135412563278457799999999 Q gi|254780922|r 209 LLAVEFLREGSAWFDAGTPESLLDTAVFV 237 (292) Q Consensus 209 ~~~~~~~~~g~~W~D~Gt~e~l~~As~~i 237 (292) . ...+..+| ..+|||++.+|++|+... T Consensus 243 ~-v~a~~~~G-kryD~G~k~Gyi~a~v~~ 269 (291) T COG1210 243 P-VLAYVFEG-KRYDCGSKLGYIKANVEF 269 (291) T ss_pred C-EEEEEECC-CEECCCCCCCHHHHHHHH T ss_conf 5-79999456-276168712489999999 No 17 >COG1208 GCD1 Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) [Cell envelope biogenesis, outer membrane / Translation, ribosomal structure and biogenesis] Probab=100.00 E-value=0 Score=353.06 Aligned_cols=233 Identities=33% Similarity=0.568 Sum_probs=205.2 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 90799248887455334438982222478374799999999978986899997776564311122222222111122233 Q gi|254780922|r 1 MKGIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQ 80 (292) Q Consensus 1 MkaiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q 80 (292) |||||||||+||||+|+|.++||||+||+|||||+|+|+.|..+|+++|+++++|.. ++++++++++..++.+|.|+.| T Consensus 2 mkavILagG~GtRLrPlT~~~PKPllpI~gkPii~~~l~~L~~~Gv~eivi~~~y~~-~~i~~~~~d~~~~~~~I~y~~e 80 (358) T COG1208 2 MKAVILAGGYGTRLRPLTDDRPKPLLPIAGKPLIEYVLEALAAAGVEEIVLVVGYLG-EQIEEYFGDGEGLGVRITYVVE 80 (358) T ss_pred CEEEEECCCCCCCCCCCCCCCCCCCCEECCEEHHHHHHHHHHHCCCCEEEEECCCCH-HHHHHHHHHCCCCCCCEEEEEC T ss_conf 359999466566635544899744657998645999999999879978999877468-9999998626356986599716 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCCCCCCCCCCCCCCCCCC-CCHHHCCC Q ss_conf 33224468521011123356752132144321111000011100000111001222210001222122221-00010013 Q gi|254780922|r 81 LVPAGLAQSYILGAEFIGDSSSVLILGDNVFYGSDISDIFHKARARRNSATVVGCHVQNPQRYGVVEVDSS-NQAISIEE 159 (292) Q Consensus 81 ~~p~Gta~Ai~~a~~~i~~~~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~~v~~p~~yGvv~~d~~-~~i~~i~E 159 (292) .+|+|||+|+..+.+++++++|++++||. +++.++..+++.|.++...+++....+.+|+.||++..+.+ ++|.+|.| T Consensus 81 ~~~lGTag~l~~a~~~l~~~~f~v~~GDv-~~~~dl~~l~~~~~~~~~~~~~~~~~~~~~~~~Gvv~~~~~~~~v~~f~e 159 (358) T COG1208 81 KEPLGTAGALKNALDLLGGDDFLVLNGDV-LTDLDLSELLEFHKKKGALATIALTRVLDPSEFGVVETDDGDGRVVEFRE 159 (358) T ss_pred CCCCCCHHHHHHHHHHHCCCCEEEEECCC-CCCCCHHHHHHHHHHCCCCCEEEEEECCCCCCCCEEEECCCCCEEEEEEE T ss_conf 87666489999998860799789997886-22468899999987356642799875277655863997599848988997 Q ss_pred CCC--CCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHCCCCCEEEEECCCCEEEECCCCHHHHHHHHHHH Q ss_conf 333--322222202455410002243220046753333333432000001320463135412563278457799999999 Q gi|254780922|r 160 KPN--NPKSSFAVTGIYFYDQEVVNIARNIRPSARGELEITDVNSYYLDKGLLAVEFLREGSAWFDAGTPESLLDTAVFV 237 (292) Q Consensus 160 KP~--~~~sn~a~~GiY~f~~~i~~~~~~l~~s~rgE~eitD~~~~~l~~~~~~~~~~~~g~~W~D~Gt~e~l~~As~~i 237 (292) ||. .+.|+++++|+|+|++++|+++....++. ++- |+++.+++++.....+..+| +|+|+|||++|.+|+..+ T Consensus 160 kp~~~~~~~~~in~Giyi~~~~v~~~i~~~~~~~---~~~-~~~~~l~~~~~~v~~~~~~g-~W~dig~p~d~~~a~~~~ 234 (358) T COG1208 160 KPGPEEPPSNLINAGIYIFDPEVFDYIEKGERFD---FEE-ELLPALAAKGEDVYGYVFEG-YWLDIGTPEDLLEANELL 234 (358) T ss_pred CCCCCCCCCCEEEEEEEEECHHHHCCCCCCCCCC---CHH-HHHHHHHHCCCCEEEEEECC-EEEECCCHHHHHHHHHHH T ss_conf 2667678886699999998978943013357432---147-78999987699569997578-799699989999999999 Q ss_pred HHH Q ss_conf 998 Q gi|254780922|r 238 RNI 240 (292) Q Consensus 238 ~~~ 240 (292) ..- T Consensus 235 ~~~ 237 (358) T COG1208 235 LRG 237 (358) T ss_pred HHH T ss_conf 851 No 18 >cd04181 NTP_transferase NTP_transferases catalyze the transfer of nucleotides onto phosphosugars. Nucleotidyltransferases transfer nucleotides onto phosphosugars. The enzyme family includes Alpha-D-Glucose-1-Phosphate Cytidylyltransferase, Mannose-1-phosphate guanyltransferase, and Glucose-1-phosphate thymidylyltransferase. The products are activated sugars that are precursors for synthesis of lipopolysaccharide, glycolipids and polysaccharides. Probab=100.00 E-value=0 Score=349.03 Aligned_cols=217 Identities=35% Similarity=0.649 Sum_probs=198.6 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 79924888745533443898222247837479999999997898689999777656431112222222211112223333 Q gi|254780922|r 3 GIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQLV 82 (292) Q Consensus 3 aiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~~ 82 (292) |||||||+||||+|+|..+||||+||+|||||+|+|++|..+|+++|+|++++.. +++.+++.+...++.+++|+.|++ T Consensus 1 AiIlaaG~GtRl~~~t~~~PK~Ll~v~gkplI~~~l~~l~~~g~~~iiiv~~~~~-~~i~~~~~~~~~~~~~i~~~~e~~ 79 (217) T cd04181 1 AVILAAGKGTRLRPLTDTRPKPLLPIAGKPILEYIIERLARAGIDEIILVVGYLG-EQIEEYFGDGSKFGVNIEYVVQEE 79 (217) T ss_pred CEEECCCCCCCCCHHHCCCCCCCCEECCCHHHHHHHHHHHHCCCCEEEEEEECCC-CCHHEEEECCCCCCCEEEEEECCC T ss_conf 9890688866578213699840238999449999999999749977999861233-100001100125796599953488 Q ss_pred CCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHCCCCCC Q ss_conf 22446852101112335675213214432111100001110000011100122221000122212222100010013333 Q gi|254780922|r 83 PAGLAQSYILGAEFIGDSSSVLILGDNVFYGSDISDIFHKARARRNSATVVGCHVQNPQRYGVVEVDSSNQAISIEEKPN 162 (292) Q Consensus 83 p~Gta~Ai~~a~~~i~~~~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~~v~~p~~yGvv~~d~~~~i~~i~EKP~ 162 (292) |+|||+|+.+|++++.+++|++++||++ ++.++.+++..|.++.+++++++++.++|++||++++|++++|.+|+|||. T Consensus 80 ~~Gt~~al~~a~~~i~~~~~lv~~~D~i-~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~yg~v~~d~~~~v~~i~EKp~ 158 (217) T cd04181 80 PLGTAGAVRNAEDFLGDDDFLVVNGDVL-TDLDLSELLRFHREKGADATIAVKEVEDPSRYGVVELDDDGRVTRFVEKPT 158 (217) T ss_pred CCCCHHHHHHHHCCCCCCCEEEECCCCC-CCCCHHHHHHHHHHCCCCEEEEEEEECCCCCCEEEEECCCCCEEEEEECCC T ss_conf 7763455432100269997899727822-250899999999757996799987403677630899888510436898899 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHCCCCCEEEEECCCCEEEECCC Q ss_conf 322222202455410002243220046753333333432000001320463135412563278 Q gi|254780922|r 163 NPKSSFAVTGIYFYDQEVVNIARNIRPSARGELEITDVNSYYLDKGLLAVEFLREGSAWFDAG 225 (292) Q Consensus 163 ~~~sn~a~~GiY~f~~~i~~~~~~l~~s~rgE~eitD~~~~~l~~~~~~~~~~~~g~~W~D~G 225 (292) .++|+++++|+|+|++++|+++++.. .++|++++|+++.++++|++.+ +...| .|+|+| T Consensus 159 ~~~~~~~~~G~y~~~~~~f~~i~~~~--~~~~~~l~d~~~~l~~~~kv~~-~~~~g-~W~DiG 217 (217) T cd04181 159 LPESNLANAGIYIFEPEILDYIPEIL--PRGEDELTDAIPLLIEEGKVYG-YPVDG-YWLDIG 217 (217) T ss_pred CCCCCEEEEEEEEECHHHHHHHHHCC--CCCEEEHHHHHHHHHHCCCEEE-EEECC-EEEECC T ss_conf 98788489989999999999765158--8875118999999985799699-99799-999179 No 19 >cd06422 NTP_transferase_like_1 NTP_transferase_like_1 is a member of the nucleotidyl transferase family. This is a subfamily of nucleotidyl transferases. Nucleotidyl transferases transfer nucleotides onto phosphosugars. The activated sugars are precursors for synthesis of lipopolysaccharide, glycolipids and polysaccharides. Other subfamilies of nucleotidyl transferases include Alpha-D-Glucose-1-Phosphate Cytidylyltransferase, Mannose-1-phosphate guanyltransferase, and Glucose-1-phosphate thymidylyltransferase. Probab=100.00 E-value=0 Score=347.72 Aligned_cols=218 Identities=25% Similarity=0.442 Sum_probs=188.9 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCC- Q ss_conf 0799248887455334438982222478374799999999978986899997776564311122222222111122233- Q gi|254780922|r 2 KGIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQ- 80 (292) Q Consensus 2 kaiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q- 80 (292) ||||||||+||||+|+|+.+||||+||+|||||+|+|++|.++||++|+|+++|.. +++++++++ +.++++++|+.| T Consensus 1 KAvIlAgG~GtRl~plT~~~PKpllpv~~kpii~~~i~~l~~~gi~~iii~~~y~~-~~i~~~~~~-~~~~~~i~~~~e~ 78 (221) T cd06422 1 KAMILAAGLGTRMRPLTDTRPKPLVPVAGKPLIDHALDRLAAAGIRRIVVNTHHLA-DQIEAHLGD-SRFGLRITISDEP 78 (221) T ss_pred CEEEECCCCCCCCCHHHCCCCHHEEEECCEEHHHHHHHHHHHCCCCEEEEECCCCH-HHHHHHHHC-CCCCCCEEEEEEC T ss_conf 98998884766887445799802379999899999999999829967998325327-889998722-5578737997403 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCCCCCCC--CCCCCCCCCCCCCHHHCC Q ss_conf 332244685210111233567521321443211110000111000001110012222100--012221222210001001 Q gi|254780922|r 81 LVPAGLAQSYILGAEFIGDSSSVLILGDNVFYGSDISDIFHKARARRNSATVVGCHVQNP--QRYGVVEVDSSNQAISIE 158 (292) Q Consensus 81 ~~p~Gta~Ai~~a~~~i~~~~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~~v~~p--~~yGvv~~d~~~~i~~i~ 158 (292) ..|+|||||+.+|++++++++|++++||+ +++.++.+++..|..+.+.++++...+++| ++||+++.|+++.+ . T Consensus 79 ~~~lGT~~ai~~a~~~~~~d~flv~~gD~-l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~i---~ 154 (221) T cd06422 79 DELLETGGGIKKALPLLGDEPFLVVNGDI-LWDGDLAPLLLLHAWRMDALLLLLPLVRNPGHNGVGDFSLDADGRL---R 154 (221) T ss_pred CCCCCCCHHHHHHHHHCCCCCEEEEECCE-EECCCHHHHHHHHHHHCCCCEEEEEEECCCCCCCCCEEEECCCEEE---E T ss_conf 32366505888888634998689980675-7818989999999970689779999867876557537987686589---9 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHCCCCCEEEEECCCCEEEECCCCHHHHHHH Q ss_conf 333332222220245541000224322004675333333343200000132046313541256327845779999 Q gi|254780922|r 159 EKPNNPKSSFAVTGIYFYDQEVVNIARNIRPSARGELEITDVNSYYLDKGLLAVEFLREGSAWFDAGTPESLLDT 233 (292) Q Consensus 159 EKP~~~~sn~a~~GiY~f~~~i~~~~~~l~~s~rgE~eitD~~~~~l~~~~~~~~~~~~g~~W~D~Gt~e~l~~A 233 (292) +||+.+.++++++|+|+|++++|+.++ ++|++++|+++.++++|++. .+...| +|+|||||++|++| T Consensus 155 ~~~~~~~~~~~~~G~yif~~~if~~i~------~~~~~~~d~~~~li~~~~v~-~~~~~g-~w~DiGtpe~~~~A 221 (221) T cd06422 155 RGGGGAVAPFTFTGIQILSPELFAGIP------PGKFSLNPLWDRAIAAGRLF-GLVYDG-LWFDVGTPERLLAA 221 (221) T ss_pred ECCCCCCCEEEEEEEEEEEHHHHHHCC------CCCCCHHHHHHHHHHCCCEE-EEECCC-EEEECCCHHHHHHC T ss_conf 778887742788779998189995478------88775589999999669849-997299-79949999999749 No 20 >cd06426 NTP_transferase_like_2 NTP_trnasferase_like_2 is a member of the nucleotidyl transferase family. This is a subfamily of nucleotidyl transferases. Nucleotidyl transferases transfer nucleotides onto phosphosugars. The activated sugars are precursors for synthesis of lipopolysaccharide, glycolipids and polysaccharides. Other subfamilies of nucleotidyl transferases include Alpha-D-Glucose-1-Phosphate Cytidylyltransferase, Mannose-1-phosphate guanyltransferase, and Glucose-1-phosphate thymidylyltransferase. Probab=100.00 E-value=0 Score=344.87 Aligned_cols=220 Identities=28% Similarity=0.463 Sum_probs=195.5 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 79924888745533443898222247837479999999997898689999777656431112222222211112223333 Q gi|254780922|r 3 GIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQLV 82 (292) Q Consensus 3 aiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~~ 82 (292) .||||||+||||+|+|+.+||||+||+|||||+|+|+.|..+|+++|+|+++|+. +++++++++++.++++++|+.|++ T Consensus 1 viILAaG~GtRl~plT~~~PK~Ll~i~gkpli~~~i~~l~~~gi~~i~i~~~y~~-e~i~~~~~~~~~~~~~i~~i~e~~ 79 (220) T cd06426 1 VVIMAGGKGTRLRPLTENTPKPMLKVGGKPILETIIDRFIAQGFRNFYISVNYLA-EMIEDYFGDGSKFGVNISYVREDK 79 (220) T ss_pred CEEECCCCCCCCCHHHCCCCCCCEEECCCHHHHHHHHHHHHCCCCEEEEEECCCC-HHHHHHHCCCCCCCCEEEEEECCC T ss_conf 9893775755788444699810329999539999999999839968999501461-132334315665784388721466 Q ss_pred CCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHCCCCCC Q ss_conf 22446852101112335675213214432111100001110000011100122221000122212222100010013333 Q gi|254780922|r 83 PAGLAQSYILGAEFIGDSSSVLILGDNVFYGSDISDIFHKARARRNSATVVGCHVQNPQRYGVVEVDSSNQAISIEEKPN 162 (292) Q Consensus 83 p~Gta~Ai~~a~~~i~~~~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~~v~~p~~yGvv~~d~~~~i~~i~EKP~ 162 (292) |+|||+|+..+++++ +++|++++||. +++.++.++++.|.++++++++.+.+.+.+.+||++..+ +++|.++.|||. T Consensus 80 ~lGt~gal~~~~~~~-~~~~iv~~gD~-i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gv~~~~-~~~v~~~~ekp~ 156 (220) T cd06426 80 PLGTAGALSLLPEKP-TDPFLVMNGDI-LTNLNYEHLLDFHKENNADATVCVREYEVQVPYGVVETE-GGRITSIEEKPT 156 (220) T ss_pred CCCHHHHHHHHHHCC-CCCEEEECCCE-ECCCCHHHHHHHHHHCCCCEEEEEEECCCCCCCCEEEEC-CCCEEEEEECCC T ss_conf 555789998667618-99789977773-047599999999997499669999862568777659934-883767776688 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHCCCCCEEEEECCCCEEEECCCCHHHHHHHH Q ss_conf 322222202455410002243220046753333333432000001320463135412563278457799999 Q gi|254780922|r 163 NPKSSFAVTGIYFYDQEVVNIARNIRPSARGELEITDVNSYYLDKGLLAVEFLREGSAWFDAGTPESLLDTA 234 (292) Q Consensus 163 ~~~sn~a~~GiY~f~~~i~~~~~~l~~s~rgE~eitD~~~~~l~~~~~~~~~~~~g~~W~D~Gt~e~l~~As 234 (292) . ++++++|+|+|++++++.+++ +++++++|+++.++++|.....+..++ .|+|+|||++|.+|| T Consensus 157 ~--~~~~~~Giy~~~~~~~~~i~~-----~~~~~~~d~~~~li~~g~~v~~~~~~~-~W~DiGt~e~l~~AN 220 (220) T cd06426 157 H--SFLVNAGIYVLEPEVLDLIPK-----NEFFDMPDLIEKLIKEGKKVGVFPIHE-YWLDIGRPEDYEKAN 220 (220) T ss_pred C--CCCCCEEEEEECHHHHHHCCC-----CCCCCCCHHHHHHHHCCCCEEEEEECC-EEEECCCHHHHHHHC T ss_conf 8--652331699989999974689-----994454589999998799889997499-999698989999749 No 21 >cd02540 GT2_GlmU_N_bac N-terminal domain of bacterial GlmU. The N-terminal domain of N-Acetylglucosamine-1-phosphate uridyltransferase (GlmU). GlmU is an essential bacterial enzyme with both an acetyltransferase and an uridyltransferase activity which have been mapped to the C-terminal and N-terminal domains, respectively. This family represents the N-terminal uridyltransferase. GlmU performs the last two steps in the synthesis of UDP-N-acetylglucosamine (UDP-GlcNAc), which is an essential precursor in both the peptidoglycan and the lipopolysaccharide metabolic pathways in Gram-positive and Gram-negative bacteria, respectively. Probab=100.00 E-value=0 Score=346.09 Aligned_cols=218 Identities=21% Similarity=0.337 Sum_probs=188.6 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 79924888745533443898222247837479999999997898689999777656431112222222211112223333 Q gi|254780922|r 3 GIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQLV 82 (292) Q Consensus 3 aiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~~ 82 (292) |||||||+||||+| .+||||+||+|||||+|++++|..+|+++|+||+++.. +.+++++++ .+++|+.|++ T Consensus 1 AvIlAaG~GtRl~~---~~PKpllpi~~kpii~~ii~~l~~~gi~~i~iv~~~~~-e~i~~~~~~-----~~i~~v~Q~~ 71 (229) T cd02540 1 AVILAAGKGTRMKS---DLPKVLHPLAGKPMLEHVLDAARALGPDRIVVVVGHGA-EQVKKALAN-----PNVEFVLQEE 71 (229) T ss_pred CEEECCCCCCCCCC---CCCCCCCEECCEEHHHHHHHHHHHCCCCEEEEECCCCH-HHHHHHHCC-----CCEEEEEECC T ss_conf 98946878757998---97841518998799999999999769975996357689-999987543-----8726998068 Q ss_pred CCCCCCCCCCCCCCCCC--CCCCEEECCC-CCCCCCCCCHHHHHCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHCCC Q ss_conf 22446852101112335--6752132144-32111100001110000011100122221000122212222100010013 Q gi|254780922|r 83 PAGLAQSYILGAEFIGD--SSSVLILGDN-VFYGSDISDIFHKARARRNSATVVGCHVQNPQRYGVVEVDSSNQAISIEE 159 (292) Q Consensus 83 p~Gta~Ai~~a~~~i~~--~~~~lilgDn-i~~~~~l~~~l~~~~~~~~~~ti~~~~v~~p~~yGvv~~d~~~~i~~i~E 159 (292) |+|||||+.+|++++++ ++|++++||+ ++...++.++++.|.+.+++++++++++++|++||++..|++++|.+|+| T Consensus 72 ~lGta~Av~~a~~~i~~~~~~~lVl~gD~pli~~~~l~~l~~~~~~~~~~~ti~~~~~~~p~~YG~v~~d~~g~v~~ivE 151 (229) T cd02540 72 QLGTGHAVKQALPALKDFEGDVLVLYGDVPLITPETLQRLLEAHREAGADVTVLTAELEDPTGYGRIIRDGNGKVLRIVE 151 (229) T ss_pred CCCCHHHHHHHHHHCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHCCCCEEEEEEEECCCCCCCEEEECCCCCEEEEEE T ss_conf 89737899999874458887299995896453678899999999965996699998604876786799889997899998 Q ss_pred CC----CCCCCCCCHHHHHHHHHH-HHHHHHHCCCC-CCCCCCCCCCCCHHCCCCCEEEEECCCCEEEECCC--CHHHH Q ss_conf 33----332222220245541000-22432200467-53333333432000001320463135412563278--45779 Q gi|254780922|r 160 KP----NNPKSSFAVTGIYFYDQE-VVNIARNIRPS-ARGELEITDVNSYYLDKGLLAVEFLREGSAWFDAG--TPESL 230 (292) Q Consensus 160 KP----~~~~sn~a~~GiY~f~~~-i~~~~~~l~~s-~rgE~eitD~~~~~l~~~~~~~~~~~~g~~W~D~G--t~e~l 230 (292) || ..++++++++|+|+|+++ +++++++++++ .|||+++||+++.++++|.....+...+ .|-|+| |+++| T Consensus 152 kk~~~~~~~~~~~~n~GiYif~~~~l~~~l~~l~~~~~~gE~~ltD~i~~~i~~g~kv~~~~~~~-~~~~~Gin~~~dl 229 (229) T cd02540 152 EKDATEEEKAIREVNAGIYAFDAEFLFEALPKLTNNNAQGEYYLTDIIALAVADGLKVAAVLADD-EEEVLGVNDRVQL 229 (229) T ss_pred CCCCCCCCCCCCEEEEEEEEECHHHHHHHHHHCCCCCCCCEEEHHHHHHHHHHCCCCEEEEEECC-HHHEECCCCHHHC T ss_conf 87876543454322227999859999999984798766887889999999998899189999288-7661378897879 No 22 >PRK03282 consensus Probab=100.00 E-value=0 Score=343.38 Aligned_cols=242 Identities=24% Similarity=0.403 Sum_probs=195.1 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCE-EHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCC-CCCCCC Q ss_conf 9079924888745533443898222247837-47999999999789868999977765643111222222221-111222 Q gi|254780922|r 1 MKGIVLAGGSGTRLRPLTDLLSKQMLPIYNK-PMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWG-VQFSYI 78 (292) Q Consensus 1 MkaiIlAaG~GtRl~P~T~~~pK~Llpi~gk-plI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g-~~i~y~ 78 (292) |.|||||||+||||+|||..+||||+|++|+ |||+|+|++|.++||++|+|+++|... .+.+|++++.+++ ....|+ T Consensus 7 v~avILagG~GtRLrPLT~~rpKp~vP~~g~yrlIdf~Lsnl~~~Gi~~v~v~~~y~~~-sl~~h~g~~w~~~~~~g~~v 85 (406) T PRK03282 7 VLGIVLAGGEGKRLMPLTADRAKPAVPFGGAYRLIDFVLSNLVNAGYLRICVLTQYKSH-SLDRHISQTWRMSGLLGNYI 85 (406) T ss_pred EEEEEECCCCCCCCCCCCCCCCHHHCEECCCCCHHHHHHHHHHHCCCCEEEEECCCCHH-HHHHHHHHCCCCCCCCCCEE T ss_conf 79999789987723302279835400658966457999998887699889998177779-99999862137556566379 Q ss_pred CCCC---------CCCCCCCCCCCCCCCCCC--CCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCCCC--CCCCCCCC Q ss_conf 3333---------224468521011123356--7521321443211110000111000001110012222--10001222 Q gi|254780922|r 79 EQLV---------PAGLAQSYILGAEFIGDS--SSVLILGDNVFYGSDISDIFHKARARRNSATVVGCHV--QNPQRYGV 145 (292) Q Consensus 79 ~q~~---------p~Gta~Ai~~a~~~i~~~--~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~~v--~~p~~yGv 145 (292) .+.. .+|||+|++.+.+++.++ ++++++++|++++.|+.+++++|.++++++|+...++ ++|++||+ T Consensus 86 ~~~~~~~~~g~~~~~Gta~ai~~~~~~l~~~~~~~vvv~~gD~v~~~D~~~~l~~H~~~~a~~Ti~~~~v~~~d~s~~Gv 165 (406) T PRK03282 86 TPVPAQQRLGPRWYTGSADAIYQSLNLIYDEDPDYVVVFGADHVYRMDPEQMVQQHIESGAGVTVAGIRVPRSEATAFGV 165 (406) T ss_pred EECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCEEEECCHHHHHHHHHHCCCCEEEEEEECCHHHHHHCCE T ss_conf 75652123674214577899999999986269987999789799807999999999976998599999827677623781 Q ss_pred CCCCCC-CCHHHCCCCCCCC-------CCCCCHHHHHHHHHHH-HHHHHHCCCCCCCCCCC-CCCCCHHCCCCCEEEEEC Q ss_conf 122221-0001001333332-------2222202455410002-24322004675333333-343200000132046313 Q gi|254780922|r 146 VEVDSS-NQAISIEEKPNNP-------KSSFAVTGIYFYDQEV-VNIARNIRPSARGELEI-TDVNSYYLDKGLLAVEFL 215 (292) Q Consensus 146 v~~d~~-~~i~~i~EKP~~~-------~sn~a~~GiY~f~~~i-~~~~~~l~~s~rgE~ei-tD~~~~~l~~~~~~~~~~ 215 (292) +++|++ ++|..|+|||.++ .++++++|+|+|++++ ++.+.....+.++++++ .|+++.+++++.....-+ T Consensus 166 v~~d~d~~~V~~f~EKP~~~~~~~~~~~~~l~n~GiYvf~~~~l~~~l~~~~~~~~~~~d~~~di~p~l~~~g~~~~y~~ 245 (406) T PRK03282 166 IDADDDSGRIREFLEKPADPPGLPDDPDETFASMGNYVFTTKALIDALRADAEDEDSDHDMGGDIIPRFVARGEAAVYDF 245 (406) T ss_pred EEECCCCCEEEEEEECCCCCCCCCCCCCCEECCCEEEEECHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHCCCEEEEEE T ss_conf 88828999799999557876578898010020210699849999999986501777644226778999997098279985 Q ss_pred C----------CCEEEECCCCHHHHHHHHHHHHHHHHH Q ss_conf 5----------412563278457799999999998888 Q gi|254780922|r 216 R----------EGSAWFDAGTPESLLDTAVFVRNIENR 243 (292) Q Consensus 216 ~----------~g~~W~D~Gt~e~l~~As~~i~~~e~~ 243 (292) . ..++|.|+||+++|++||+.+...... T Consensus 246 ~~~~v~g~~~~~~GYW~dIgt~~~y~~an~dll~~~~~ 283 (406) T PRK03282 246 SDNEVPGATDRDRGYWRDVGTLDAFYDAHMDLISVHPV 283 (406) T ss_pred ECCCCCCCCCCCCCEEEECCCHHHHHHHHHHHHCCCCC T ss_conf 03535575445551798679899999988986277853 No 23 >PRK00844 glgC glucose-1-phosphate adenylyltransferase; Provisional Probab=100.00 E-value=0 Score=340.02 Aligned_cols=242 Identities=25% Similarity=0.409 Sum_probs=192.3 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCE-EHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCC-CCC---C Q ss_conf 9079924888745533443898222247837-4799999999978986899997776564311122222222-111---1 Q gi|254780922|r 1 MKGIVLAGGSGTRLRPLTDLLSKQMLPIYNK-PMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKW-GVQ---F 75 (292) Q Consensus 1 MkaiIlAaG~GtRl~P~T~~~pK~Llpi~gk-plI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~-g~~---i 75 (292) |+|||||||.||||+|||..+||||+||+|| |||+|+|++|.++|+++|+|+++++. ..+.+|++++..+ +.. + T Consensus 6 ~~aVILAgG~GtRLrPLT~~rpKpllPvagkyplIdf~Ls~l~~aGi~~v~v~~~y~~-~sl~~hi~~~w~~~~~~~~~i 84 (409) T PRK00844 6 VLGIVLAGGEGKRLMPLTADRAKPAVPFGGSYRLIDFVLSNLVNAGLLRICVLTQYKS-HSLDRHISQGWRLSGLLGEYI 84 (409) T ss_pred EEEEEECCCCCCCCCCCCCCCCHHCCEECCCCCHHHHHHHHHHHCCCCEEEEECCCCH-HHHHHHHHCCCCCCCCCCCEE T ss_conf 6999988998774430006980440065883607899999998769988999907787-999999862726567677358 Q ss_pred CCCCCCC------CCCCCCCCCCCCCCCCCC--CCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCCCC--CCCCCCCC Q ss_conf 2223333------224468521011123356--7521321443211110000111000001110012222--10001222 Q gi|254780922|r 76 SYIEQLV------PAGLAQSYILGAEFIGDS--SSVLILGDNVFYGSDISDIFHKARARRNSATVVGCHV--QNPQRYGV 145 (292) Q Consensus 76 ~y~~q~~------p~Gta~Ai~~a~~~i~~~--~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~~v--~~p~~yGv 145 (292) .++.+++ .+|||||++++.+++.++ ++++++++|++++.++.++++.|.++++++|+...++ ++|++||+ T Consensus 85 ~~~~~~~~~~~~~~~Gtadai~~~~~~i~~~~~d~vlv~~gD~i~~~dl~~~l~~H~~~~a~~Ti~~~~v~~~~~~~~Gv 164 (409) T PRK00844 85 TPVPAQQRLGKRWYEGSADAIYQSLNLIEDEDPDYVVVFGADHVYRMDPEQMVEFHIESGAGLTVAGIRVPREEASAFGV 164 (409) T ss_pred EECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCEEECCCHHHHHHHHHHCCCCCEEEEEEECHHHCCCCCE T ss_conf 75672124676545577899999998874179988999789989707999999999756886347899926687453678 Q ss_pred CCCCCCCCHHHCCCCCCCCCC-------CCCHHHHHHHHHHHH-HHHHHCCCCCCCCCCC-CCCCCHHCCCCCEEEEECC Q ss_conf 122221000100133333222-------222024554100022-4322004675333333-3432000001320463135 Q gi|254780922|r 146 VEVDSSNQAISIEEKPNNPKS-------SFAVTGIYFYDQEVV-NIARNIRPSARGELEI-TDVNSYYLDKGLLAVEFLR 216 (292) Q Consensus 146 v~~d~~~~i~~i~EKP~~~~s-------n~a~~GiY~f~~~i~-~~~~~l~~s~rgE~ei-tD~~~~~l~~~~~~~~~~~ 216 (292) +++|++++|+.|.|||++|++ ++|++|+|+|++++| +.+........+.+++ +|+++.+++++.+...-+. T Consensus 165 v~~d~~g~V~~f~EKP~~~~~~~~~~~~~l~n~GiYvf~~~~L~~~l~~~~~~~~~~~d~~~diip~l~~~g~~~~y~~~ 244 (409) T PRK00844 165 IEVDEDGRIRGFLEKPADPPGLPDDPDETLASMGNYIFTTDVLIDALREDAEDEDSSHDMGGDIIPKLVPRGEAAVYDFS 244 (409) T ss_pred EEECCCCCEEEEEECCCCCCCCCCCCCCCEEEEEEEEECHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCEEEEEEC T ss_conf 99889996999981578876778971201120135995199999999875016776332067789999863886899841 Q ss_pred ----------CCEEEECCCCHHHHHHHHHHHHHHHHH Q ss_conf ----------412563278457799999999998888 Q gi|254780922|r 217 ----------EGSAWFDAGTPESLLDTAVFVRNIENR 243 (292) Q Consensus 217 ----------~g~~W~D~Gt~e~l~~As~~i~~~e~~ 243 (292) ..++|.|+||+++|++||+.+..-+.. T Consensus 245 ~~~v~g~~~~~~GYW~digt~~~y~~an~dlL~~~~~ 281 (409) T PRK00844 245 DNEVPGATERDRGYWRDVGTIDSFYDAHMDLLSVHPV 281 (409) T ss_pred CCCCCCCCCCCCCEEEECCCHHHHHHHHHHHHCCCCH T ss_conf 4544564345650589789989999988998678842 No 24 >cd02524 G1P_cytidylyltransferase G1P_cytidylyltransferase catalyzes the production of CDP-D-Glucose. Alpha-D-Glucose-1-phosphate Cytidylyltransferase catalyzes the production of CDP-D-Glucose from alpha-D-Glucose-1-phosphate and MgCTP as substrate. CDP-D-Glucose is the precursor for synthesizing four of the five naturally occurring 3,6-dideoxy sugars-abequose (3,6-dideoxy-D-Xylo-hexose), ascarylose (3,6-dideoxy-L-arabino-hexose), paratose (3,6-dideoxy-D-ribohexose), and tyvelose (3,6-dideoxy-D-arabino-hexose. Deoxysugars are ubiquitous in nature where they function in a variety of biological processes, including cell adhesion, immune response, determination of ABO blood groups, fertilization, antibiotic function, and microbial pathogenicity. Probab=100.00 E-value=0 Score=337.81 Aligned_cols=225 Identities=21% Similarity=0.318 Sum_probs=187.6 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCC---------- Q ss_conf 7992488874553344389822224783747999999999789868999977765643111222222221---------- Q gi|254780922|r 3 GIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWG---------- 72 (292) Q Consensus 3 aiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g---------- 72 (292) .||||||+||||+|+|..+||||+||+|||+|+|+++.|.++||++|+++++|.. +++++++++....+ T Consensus 1 VvILaGG~GtRL~pLT~~~PKpLvpi~gkPii~~ii~~l~~~gi~~iil~~gy~~-~~i~~~f~~~~~~~~~~~~~~~~~ 79 (253) T cd02524 1 VVILAGGLGTRLSEETELKPKPMVEIGGRPILWHIMKIYSHYGHNDFILCLGYKG-HVIKEYFLNYFLHNSDVTIDLGTN 79 (253) T ss_pred CEEECCCCCCCCCHHHCCCCCCCEEECCEEHHHHHHHHHHHCCCCEEEEECCCCC-HHHHHHHHHHHHHCCCEEEEECCC T ss_conf 9893478755414312799800318999989999999999869987999532251-137999877776158558993377 Q ss_pred -----------CCCCCCCCCCCCCCCCCCCCCCCCCCCC-CCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCCCCCCC Q ss_conf -----------1112223333224468521011123356-7521321443211110000111000001110012222100 Q gi|254780922|r 73 -----------VQFSYIEQLVPAGLAQSYILGAEFIGDS-SSVLILGDNVFYGSDISDIFHKARARRNSATVVGCHVQNP 140 (292) Q Consensus 73 -----------~~i~y~~q~~p~Gta~Ai~~a~~~i~~~-~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~~v~~p 140 (292) .++.++.+..+.|||+|+..++++++++ +|++++||+ +++.++.+++..|.++++.+|++. +++| T Consensus 80 ~~~~~~~~~~~~~i~~~~~~~~~~tgg~l~~~~~~l~~~e~flv~nGD~-l~d~dl~~l~~~h~~~~~~~tl~~--v~~~ 156 (253) T cd02524 80 RIELHNSDIEDWKVTLVDTGLNTMTGGRLKRVRRYLGDDETFMLTYGDG-VSDVNINALIEFHRSHGKLATVTA--VHPP 156 (253) T ss_pred EEEEECCCCCCCEEEEEECCCCCHHHHHHHHHHHHCCCCCEEEEEECCC-CCCCCHHHHHHHHHHCCCEEEEEE--CCCC T ss_conf 2798336677860589853665503389999998748997399991552-343687999999986698599996--5888 Q ss_pred CCCCCCCCCCCCCHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHCCCCCEEEEECCCCEE Q ss_conf 01222122221000100133333222222024554100022432200467533333334320000013204631354125 Q gi|254780922|r 141 QRYGVVEVDSSNQAISIEEKPNNPKSSFAVTGIYFYDQEVVNIARNIRPSARGELEITDVNSYYLDKGLLAVEFLREGSA 220 (292) Q Consensus 141 ~~yGvv~~d~~~~i~~i~EKP~~~~sn~a~~GiY~f~~~i~~~~~~l~~s~rgE~eitD~~~~~l~~~~~~~~~~~~g~~ 220 (292) ++||++++|++++|.+|.|||+.+ ++++++|+|+|++++|++++.-. ..++ +|+++.++++|++. .+...| + T Consensus 157 ~~yG~v~~d~~~~v~~f~EKP~~~-~~~in~GiYv~~~~if~~i~~~~----~~~e-~d~~p~li~~g~l~-~y~~~g-~ 228 (253) T cd02524 157 GRFGELDLDDDGQVTSFTEKPQGD-GGWINGGFFVLEPEVFDYIDGDD----TVFE-REPLERLAKDGELM-AYKHTG-F 228 (253) T ss_pred CCCCEEEECCCCEEEEEEECCCCC-CCCCEEEEEEECHHHHHHCCCCC----CCHH-HHHHHHHHHCCCEE-EEECCC-E T ss_conf 888679998998599987378887-77421499998899995466788----7426-78999999669979-996698-8 Q ss_pred EECCCCHHHHHHHHHHHHH Q ss_conf 6327845779999999999 Q gi|254780922|r 221 WFDAGTPESLLDTAVFVRN 239 (292) Q Consensus 221 W~D~Gt~e~l~~As~~i~~ 239 (292) |+|+|||+||.+|++++++ T Consensus 229 W~DigT~~d~~~l~~~~~~ 247 (253) T cd02524 229 WQCMDTLRDKQTLEELWNS 247 (253) T ss_pred EECCCCHHHHHHHHHHHHC T ss_conf 8189997999999999973 No 25 >KOG1322 consensus Probab=100.00 E-value=0 Score=340.10 Aligned_cols=235 Identities=29% Similarity=0.479 Sum_probs=201.9 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCC-CCCCCCCCCCCC Q ss_conf 9079924888745533443898222247837479999999997898689999777656431112222-222211112223 Q gi|254780922|r 1 MKGIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGS-GEKWGVQFSYIE 79 (292) Q Consensus 1 MkaiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~-g~~~g~~i~y~~ 79 (292) |||+||.||.||||+|||.++|||++|+++||||+|++++|+++||++|++.++|+..++..++..+ +..+|+++.++. T Consensus 10 vkaiILvGG~GTRLrPLT~t~pKPlVpfgn~pmI~hqieal~nsGi~~I~la~~y~s~sl~~~~~k~y~~~lgVei~~s~ 89 (371) T KOG1322 10 VKAIILVGGYGTRLRPLTLTRPKPLVPFGNKPMILHQIEALINSGITKIVLATQYNSESLNRHLSKAYGKELGVEILAST 89 (371) T ss_pred EEEEEEECCCCCEEECEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEE T ss_conf 15899954777446000136888665427630527789998757974799997138289999999976124555999982 Q ss_pred CCCCCCCCCCCCCCCCCCCCC---CCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCCCCCCCCCCCCCCCCC-CCCHH Q ss_conf 333224468521011123356---75213214432111100001110000011100122221000122212222-10001 Q gi|254780922|r 80 QLVPAGLAQSYILGAEFIGDS---SSVLILGDNVFYGSDISDIFHKARARRNSATVVGCHVQNPQRYGVVEVDS-SNQAI 155 (292) Q Consensus 80 q~~p~Gta~Ai~~a~~~i~~~---~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~~v~~p~~yGvv~~d~-~~~i~ 155 (292) |.+|+||+|.+..|++++-.. +|+ +|+.|++|..+|.+++++|.++++++|+.+.+|++|++||++.+|+ .|+|. T Consensus 90 eteplgtaGpl~laR~~L~~~~~~~ff-VLnsDvi~~~p~~~~vqfH~~~gae~TI~~t~vdepSkyGvv~~d~~~grV~ 168 (371) T KOG1322 90 ETEPLGTAGPLALARDFLWVFEDAPFF-VLNSDVICRMPYKEMVQFHRAHGAEITIVVTKVDEPSKYGVVVIDEDTGRVI 168 (371) T ss_pred CCCCCCCCCHHHHHHHHHHHCCCCCEE-EECCCEEECCCHHHHHHHHHHCCCCEEEEEEECCCCCCCCEEEEECCCCCEE T ss_conf 068876546699999886403798679-9557726437799999999855996589987446853263699935888255 Q ss_pred HCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHCCCCCEEEEECCCCEEEECCCCHHHHHHHHH Q ss_conf 00133333222222024554100022432200467533333334320000013204631354125632784577999999 Q gi|254780922|r 156 SIEEKPNNPKSSFAVTGIYFYDQEVVNIARNIRPSARGELEITDVNSYYLDKGLLAVEFLREGSAWFDAGTPESLLDTAV 235 (292) Q Consensus 156 ~i~EKP~~~~sn~a~~GiY~f~~~i~~~~~~l~~s~rgE~eitD~~~~~l~~~~~~~~~~~~g~~W~D~Gt~e~l~~As~ 235 (292) +|+|||++.+||-+++|+|+|++++++.+. +.|.+ =|- .+++.+.++.++ ..+..+| +|+|+|+|.+|++|-. T Consensus 169 ~F~EKPkd~vsnkinaGiYi~~~~vL~ri~-~~ptS-iek---EifP~~a~~~~l-~a~~l~g-fWmDIGqpkdf~~g~~ 241 (371) T KOG1322 169 RFVEKPKDLVSNKINAGIYILNPEVLDRIL-LRPTS-IEK---EIFPAMAEEHQL-YAFDLPG-FWMDIGQPKDFLTGFS 241 (371) T ss_pred EEHHCCHHHHHCCCCCEEEEECHHHHHHHH-HCCCC-HHH---HHHHHHHHCCCE-EEEECCC-HHHHCCCHHHHHHHHH T ss_conf 802175656533124158997799986765-06663-011---111556411725-8996276-1320688899988898 Q ss_pred -HHHHHHHH Q ss_conf -99998888 Q gi|254780922|r 236 -FVRNIENR 243 (292) Q Consensus 236 -~i~~~e~~ 243 (292) |.++.... T Consensus 242 ~Yl~s~~~~ 250 (371) T KOG1322 242 FYLRSLPKY 250 (371) T ss_pred HHHHHCCCC T ss_conf 887537433 No 26 >PRK04928 consensus Probab=100.00 E-value=0 Score=335.41 Aligned_cols=241 Identities=25% Similarity=0.441 Sum_probs=190.8 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCE-EHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCC-CCCCCCCC Q ss_conf 079924888745533443898222247837-4799999999978986899997776564311122222222-11112223 Q gi|254780922|r 2 KGIVLAGGSGTRLRPLTDLLSKQMLPIYNK-PMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKW-GVQFSYIE 79 (292) Q Consensus 2 kaiIlAaG~GtRl~P~T~~~pK~Llpi~gk-plI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~-g~~i~y~~ 79 (292) -|||||||.||||+|||.++||||+|++|+ |||+|+|++|.++||++|.|++++... .+.+|++++..+ ++...|+. T Consensus 5 ~aiILagG~gtRLrPLT~~rpKp~lP~~g~yrlIdf~Ls~l~~sGi~~V~v~~~~~~~-sl~~hl~~gw~l~~~~~~~~~ 83 (405) T PRK04928 5 LGMILAGGEGSRLRPLTESRTKPAVPFGGSYRLIDFALNNFVNADLMRIYVLTQFKSQ-SLYIHMKKGWNINGITDRFID 83 (405) T ss_pred EEEEECCCCCCCCCHHHCCCCHHHEEECCCCCEEHHHHHHHHHCCCCEEEEECCCCHH-HHHHHHHCCCCCCCCCCCEEE T ss_conf 9999789987733232159856523878857131899999998799889999088879-999997466376641351787 Q ss_pred CC---------CCCCCCCCCCCCCCCCC--CCCCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCCC--CCCCCCCCCC Q ss_conf 33---------32244685210111233--56752132144321111000011100000111001222--2100012221 Q gi|254780922|r 80 QL---------VPAGLAQSYILGAEFIG--DSSSVLILGDNVFYGSDISDIFHKARARRNSATVVGCH--VQNPQRYGVV 146 (292) Q Consensus 80 q~---------~p~Gta~Ai~~a~~~i~--~~~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~~--v~~p~~yGvv 146 (292) .. ...|||||+..+.+|++ ++++++++++|++++.++.++++.|.++++.+|+...+ +++|++|||+ T Consensus 84 ~~p~~~~~~~~~~~Gta~ai~~~~~~l~~~~~e~~lv~~gD~l~~~Dl~~ll~~H~~~~a~iTi~~~~~~~e~~~~~Gvv 163 (405) T PRK04928 84 PIPAQMRTGKRWYEGTADAIYQNLRFIELSEPEHVCIFGSDHIYKMDIKQMLDFHKEKEAALTVSALRMPLEEASQFGVI 163 (405) T ss_pred ECCHHHCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCEEECCCHHHHHHHHHHCCCCCEEEEEECCHHCCCCCCEE T ss_conf 36224125764456568999999999854699889997898896689999999998568873168998241125537768 Q ss_pred CCCCCCCHHHCCCCCCCCCC-------CCCHHHHHHHHHHHHH-HHHHCCCCCCCCCCC-CCCCCHHCCCCCEEEEEC-- Q ss_conf 22221000100133333222-------2220245541000224-322004675333333-343200000132046313-- Q gi|254780922|r 147 EVDSSNQAISIEEKPNNPKS-------SFAVTGIYFYDQEVVN-IARNIRPSARGELEI-TDVNSYYLDKGLLAVEFL-- 215 (292) Q Consensus 147 ~~d~~~~i~~i~EKP~~~~s-------n~a~~GiY~f~~~i~~-~~~~l~~s~rgE~ei-tD~~~~~l~~~~~~~~~~-- 215 (292) ++|++++|++|.|||++|++ +++++|+|+|+++++. .+..-.....+++++ +|+++.++..+.+...-+ T Consensus 164 ~~D~~g~V~~f~EKP~~~~~~~g~~~~~li~~GiYvf~~~vl~~~l~~~~~~~~~~~d~~~dvip~l~~~g~~~~y~~~~ 243 (405) T PRK04928 164 EVDAEGRMIGFEEKPANPKSIPGDPDHALVSMGNYIFEAEVLFKELIEDAANENSSHDFGKDIIPKMFPRGKVFVYDFST 243 (405) T ss_pred EECCCCCEEEEEECCCCCCCCCCCCCCEEEECCEEEECHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCEEEEEECC T ss_conf 88899989999976898756679833117503226751999999998650147876741578899998728868998024 Q ss_pred ------CCCEEEECCCCHHHHHHHHHHHHHHHHH Q ss_conf ------5412563278457799999999998888 Q gi|254780922|r 216 ------REGSAWFDAGTPESLLDTAVFVRNIENR 243 (292) Q Consensus 216 ------~~g~~W~D~Gt~e~l~~As~~i~~~e~~ 243 (292) ...++|.|+||+++|++||+.+..-+.+ T Consensus 244 ~~i~g~~~~gYW~digt~~~y~~an~dlL~~~~~ 277 (405) T PRK04928 244 NRIPGEKEEVYWRDVGTIDSYWQAHMDLLKKDAP 277 (405) T ss_pred CCCCCCCCCEEEEECCCHHHHHHHHHHHHCCCCH T ss_conf 4335756650589789878999999998668856 No 27 >PRK00725 glgC glucose-1-phosphate adenylyltransferase; Provisional Probab=100.00 E-value=0 Score=330.98 Aligned_cols=241 Identities=24% Similarity=0.402 Sum_probs=196.5 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEECCE-EHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCC-------- Q ss_conf 79924888745533443898222247837-479999999997898689999777656431112222222211-------- Q gi|254780922|r 3 GIVLAGGSGTRLRPLTDLLSKQMLPIYNK-PMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGV-------- 73 (292) Q Consensus 3 aiIlAaG~GtRl~P~T~~~pK~Llpi~gk-plI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~-------- 73 (292) |||||||.||||+|||..+|||++|++|+ +||||+|++|+++||++|.|+++++... +.+|+++| |+. T Consensus 22 aviLAGG~GtRL~pLT~~RpkpavPfgGryRlIDF~LSn~vnsGI~~V~V~t~y~~~S-L~dHlg~G--W~~~~~~~g~~ 98 (431) T PRK00725 22 ALILAGGRGSRLKELTDKRAKPAVYFGGKFRIIDFALSNCLNSGIRRIGVITQYQAHS-LVRHIQRG--WSFFREERNEF 98 (431) T ss_pred EEEECCCCCCCCHHHHCCCCCCCCEECCCCEEEHHHHHHHHHCCCCEEEEEECCCHHH-HHHHHHCC--CCCCCCCCCCC T ss_conf 9996388877551554278400315378235402367668875999899992778799-99987534--24444468960 Q ss_pred -CCCC-----CCCCCCCCCCCCCCCCCCCCCC--CCCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCCCC--CCCCCC Q ss_conf -1122-----2333322446852101112335--67521321443211110000111000001110012222--100012 Q gi|254780922|r 74 -QFSY-----IEQLVPAGLAQSYILGAEFIGD--SSSVLILGDNVFYGSDISDIFHKARARRNSATVVGCHV--QNPQRY 143 (292) Q Consensus 74 -~i~y-----~~q~~p~Gta~Ai~~a~~~i~~--~~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~~v--~~p~~y 143 (292) .+.. ..+..++||++|+.++.+|+.. .+++++++.|++|+.|+.+++++|.++++++|+...++ ++|++| T Consensus 99 ~~~lp~~~~~~~~~~~~Gtadav~~n~~~l~~~~~eyvvI~~gD~I~~~D~~~~l~~H~~~gAdiTi~~~~v~~~~~~~f 178 (431) T PRK00725 99 VDLLPAQQRVSEENWYRGTADAVYQNLDIIRRYKAEYVVILAGDHIYKMDYSRMLIDHVEKGADCTVGCLEVPREEASAF 178 (431) T ss_pred EEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCEEECCCHHHHHHHHHHCCCCEEEEEEECCHHHCCCC T ss_conf 79777611258876546858999999999975699879994598898179999999998779987999998487885547 Q ss_pred CCCCCCCCCCHHHCCCCCCCC-------CCCCCHHHHHHHHHH-HHHHHHHCCCCCCCCCCC-CCCCCHHCCCCCEEEEE Q ss_conf 221222210001001333332-------222220245541000-224322004675333333-34320000013204631 Q gi|254780922|r 144 GVVEVDSSNQAISIEEKPNNP-------KSSFAVTGIYFYDQE-VVNIARNIRPSARGELEI-TDVNSYYLDKGLLAVEF 214 (292) Q Consensus 144 Gvv~~d~~~~i~~i~EKP~~~-------~sn~a~~GiY~f~~~-i~~~~~~l~~s~rgE~ei-tD~~~~~l~~~~~~~~~ 214 (292) ||+++|++++|++|.|||++| .+++|++|+|+|+++ +++.+.+.....++++++ .|+++.+++++.+...- T Consensus 179 Gv~~~D~~grV~~f~EKP~~p~~~~g~~~~~l~smGiYvf~~~~L~~~L~~~~~~~~~~~Dfg~diip~ll~~~~~~ay~ 258 (431) T PRK00725 179 GVMAVDENDKIIAFVEKPADPPAMPGDPDKSLASMGIYVFDADYLYELLERDAADPNSSHDFGKDIIPKIVEEGKAYAHP 258 (431) T ss_pred CEEEECCCCCEEEEEECCCCCCCCCCCCCCCEECCEEEEECHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCEEEEE T ss_conf 74999999989999933788434689755330001589985999999998652177642102677899999749878986 Q ss_pred CC---------CCEEEECCCCHHHHHHHHHHHHHHHHHCCC Q ss_conf 35---------412563278457799999999998888098 Q gi|254780922|r 215 LR---------EGSAWFDAGTPESLLDTAVFVRNIENRLGL 246 (292) Q Consensus 215 ~~---------~g~~W~D~Gt~e~l~~As~~i~~~e~~~g~ 246 (292) +. ..++|.|+||+++|++||+.+...+.+..+ T Consensus 259 f~~~~~~~~~~~~GYw~dIgt~~sY~~AnmdLL~~~~~~~l 299 (431) T PRK00725 259 FPLSCVRSDPEAEPYWRDVGTLDAYWQANLDLASVVPELDM 299 (431) T ss_pred CCCCCCCCCCCCCCEEEECCCHHHHHHHHHHHHCCCCHHHC T ss_conf 25333456656664178879999999998777457845432 No 28 >PRK03701 consensus Probab=100.00 E-value=0 Score=330.47 Aligned_cols=245 Identities=24% Similarity=0.380 Sum_probs=197.4 Q ss_pred CE--EEEECCCCCCCCCHHHCCCCCCEEEECCE-EHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCC---- Q ss_conf 90--79924888745533443898222247837-479999999997898689999777656431112222222211---- Q gi|254780922|r 1 MK--GIVLAGGSGTRLRPLTDLLSKQMLPIYNK-PMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGV---- 73 (292) Q Consensus 1 Mk--aiIlAaG~GtRl~P~T~~~pK~Llpi~gk-plI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~---- 73 (292) || |||||||+||||+|||..+|||++|++|+ |||||+|++|+++||++|.|++.++..++ .+|+++|..|.. T Consensus 18 ~~~~aviLAGG~gtRL~pLT~~RpkpaVPf~GrYRlIDF~LSn~vnsGI~~V~V~tq~~~~SL-~dHlg~Gw~~~~~~~~ 96 (431) T PRK03701 18 LKSVALILAGGRGTRLKDLTNKRAKPAVHFGGKFRIIDFALSNCLNSGIRRIGVITQYQSHTL-VQHIQRGWSFFNEEMN 96 (431) T ss_pred CCEEEEEECCCCCCCCCHHHCCCCHHCCCCCCCCHHHHHHHHHHHHCCCCEEEEEECCCHHHH-HHHHHCCCCHHCCCCC T ss_conf 126999943898764405543884421100785234799999999869998999927888999-9998636210022578 Q ss_pred ---CCC-----CCCCCCCCCCCCCCCCCCCCCC--CCCCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCCCC--CCCC Q ss_conf ---112-----2233332244685210111233--567521321443211110000111000001110012222--1000 Q gi|254780922|r 74 ---QFS-----YIEQLVPAGLAQSYILGAEFIG--DSSSVLILGDNVFYGSDISDIFHKARARRNSATVVGCHV--QNPQ 141 (292) Q Consensus 74 ---~i~-----y~~q~~p~Gta~Ai~~a~~~i~--~~~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~~v--~~p~ 141 (292) .+. +..|..++||++|+.++.+|+. .++++++++.|++|+.|+.++++.|.++++++|++..++ ++|+ T Consensus 97 ~~~~~lp~~~~~~~~~~~~Gtadai~~n~~~l~~~~~eyVvI~~gD~I~~~D~~~~l~~H~~~gAdiTv~~~~v~~e~~~ 176 (431) T PRK03701 97 EFVDLLPAQQRMKGENWYRGTADAVTQNLDIIRRYKAEYVVILAGDHIYKQDYSRMLIDHVEKGARCTVACMPVPIEEAS 176 (431) T ss_pred CEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCEEECCCHHHHHHHHHHCCCCCEEEEEECCHHHCC T ss_conf 54997777410478864357489999999999746998599965988875899999999997599826999874878832 Q ss_pred CCCCCCCCCCCCHHHCCCCCCCCC-------CCCCHHHHHHHHHHHH-HHHHHCCCCCCCCCCC-CCCCCHHCCCCCEEE Q ss_conf 122212222100010013333322-------2222024554100022-4322004675333333-343200000132046 Q gi|254780922|r 142 RYGVVEVDSSNQAISIEEKPNNPK-------SSFAVTGIYFYDQEVV-NIARNIRPSARGELEI-TDVNSYYLDKGLLAV 212 (292) Q Consensus 142 ~yGvv~~d~~~~i~~i~EKP~~~~-------sn~a~~GiY~f~~~i~-~~~~~l~~s~rgE~ei-tD~~~~~l~~~~~~~ 212 (292) +|||+++|++++|++|.|||.+|+ ++++++|+|+|+++++ +++++.....++.+++ .|+++.+++.+.... T Consensus 177 ~~GVl~~D~~grV~~f~EKP~~p~~~~~~~~~~~~smgiYVf~~~~L~~ll~~~~~~~~~~~Df~~diip~l~~~~~v~~ 256 (431) T PRK03701 177 AFGVMAVDENDKIIEFVEKPANPPSMPGDPSKSLASMGIYVFDADYLYELLEEDDRDENSSHDFGKDIIPKITEAGLAYA 256 (431) T ss_pred CCCEEEECCCCCEEEEEECCCCCCCCCCCCCCCEEEEEEEEECHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCEEE T ss_conf 06859988999799999558886556898553321005899759999999863301566765437889999986096899 Q ss_pred EECC---------CCEEEECCCCHHHHHHHHHHHHHHHHHCCC Q ss_conf 3135---------412563278457799999999998888098 Q gi|254780922|r 213 EFLR---------EGSAWFDAGTPESLLDTAVFVRNIENRLGL 246 (292) Q Consensus 213 ~~~~---------~g~~W~D~Gt~e~l~~As~~i~~~e~~~g~ 246 (292) ..+. ..+||.|+||+++|++||+.+...+.+..+ T Consensus 257 y~f~~~~v~~~~~~~gYw~dIgti~sY~~anmdlL~~~~~~~l 299 (431) T PRK03701 257 HPFPLSCVQSDPDAEPYWRDVGTLEAYWKANLDLASVTPELDM 299 (431) T ss_pred EEECCCCCCCCCCCCCCEEECCCHHHHHHHHHHHHCCCCCCCC T ss_conf 8504444446766777214579999999989987377765544 No 29 >PRK01884 consensus Probab=100.00 E-value=0 Score=327.36 Aligned_cols=244 Identities=23% Similarity=0.371 Sum_probs=194.8 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECC-EEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCC-----CCCC Q ss_conf 07992488874553344389822224783-74799999999978986899997776564311122222222-----1111 Q gi|254780922|r 2 KGIVLAGGSGTRLRPLTDLLSKQMLPIYN-KPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKW-----GVQF 75 (292) Q Consensus 2 kaiIlAaG~GtRl~P~T~~~pK~Llpi~g-kplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~-----g~~i 75 (292) =|||||||+||||+|||..+|||++|++| +|||||+|++|+++||++|.|+++++..+ +.+|+++|..| |.-+ T Consensus 19 ~avILAGG~GtRL~pLT~~RpkpavPfgGryRiIDF~LSNlvnsGI~~V~V~t~y~~~S-L~dHl~~Gw~~~~~~~g~~i 97 (435) T PRK01884 19 LVLILAGGRGSRLHELTDKRAKPALYFGGNRRIIDFALSNCINSGLNRIGVVTQYAAHS-LLRHLQTGWSFLPQERGEFI 97 (435) T ss_pred EEEEECCCCCCCCHHHHCCCCHHCCCCCCCCEEEEHHHHHHHHCCCCEEEEECCCCHHH-HHHHHHCCCCCCCCCCCCEE T ss_conf 89996588876151655387151162488871210168678876998899981768799-99998616365555799527 Q ss_pred CCCCC-------CCCCCCCCCCCCCCCCCCC---CCCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCCCC--CCCCCC Q ss_conf 22233-------3322446852101112335---67521321443211110000111000001110012222--100012 Q gi|254780922|r 76 SYIEQ-------LVPAGLAQSYILGAEFIGD---SSSVLILGDNVFYGSDISDIFHKARARRNSATVVGCHV--QNPQRY 143 (292) Q Consensus 76 ~y~~q-------~~p~Gta~Ai~~a~~~i~~---~~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~~v--~~p~~y 143 (292) .+..+ ...+|||+|+.++..++.+ .+++++++.|++|+.|+.+++++|.++++++|+...++ ++|++| T Consensus 98 ~~lp~~~~~~~~~~~~Gtadai~~n~~~i~~~~~~~yVlVl~gD~I~~~D~~~~l~~H~~~~AdiTva~~~v~~e~~~~~ 177 (435) T PRK01884 98 DMLPARQQIDDSTWYRGTADAVYQNMAIIKNHYRPKYILILAGDHIYKQDYSQMLLDHVNSGAKCTVGCIEVPREEAHEF 177 (435) T ss_pred EECCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCEEEECCHHHHHHHHHHCCCCEEEEEEECCHHHHHHC T ss_conf 86787422588751037399999889999724688879995599898078999999999759985899998687883007 Q ss_pred CCCCCCCCCCHHHCCCCCCCC-------CCCCCHHHHHHHHHHHH-HHHHHCCCCCCCCCCC-CCCCCHHCCCCCEEEEE Q ss_conf 221222210001001333332-------22222024554100022-4322004675333333-34320000013204631 Q gi|254780922|r 144 GVVEVDSSNQAISIEEKPNNP-------KSSFAVTGIYFYDQEVV-NIARNIRPSARGELEI-TDVNSYYLDKGLLAVEF 214 (292) Q Consensus 144 Gvv~~d~~~~i~~i~EKP~~~-------~sn~a~~GiY~f~~~i~-~~~~~l~~s~rgE~ei-tD~~~~~l~~~~~~~~~ 214 (292) ||+.+|++++|++|.|||++| .++++++|+|+|+++++ +.+++...+..+++++ .|+++.+++++.+.... T Consensus 178 Gvv~~d~~~rV~~f~EKP~~~~~~~g~~~~~lismGiYvf~~~~L~e~L~~~~~~~~~~~Dfg~diip~~l~~g~v~ay~ 257 (435) T PRK01884 178 GVMAVNENLKVKAFVEKPKDPPAMVGKPDSSLASMGIYVFDADYLYKMLEREVNTPQTSHDFGKDILPKALEEGVLYAHP 257 (435) T ss_pred CEEEECCCCCEEEEEECCCCCCCCCCCCCCCEECCEEEEECHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCEEEEE T ss_conf 74998999978899956888110379866422023259973999999999763177751257898999998519389997 Q ss_pred CCC---------CEEEECCCCHHHHHHHHHHHHHHHHHCCC Q ss_conf 354---------12563278457799999999998888098 Q gi|254780922|r 215 LRE---------GSAWFDAGTPESLLDTAVFVRNIENRLGL 246 (292) Q Consensus 215 ~~~---------g~~W~D~Gt~e~l~~As~~i~~~e~~~g~ 246 (292) +.. .++|.|+||+++|++||+.+..-+.+..+ T Consensus 258 f~~~~~G~~~~~~~Yw~dIgt~~sY~~AnmdLL~~~~~~~l 298 (435) T PRK01884 258 FSRSCMGRNTEGEIYWRDVGTLDSFWQSNIDLVSENPQLDI 298 (435) T ss_pred CCCCCCCCCCCCCCEEEECCCHHHHHHHHHHHHCCCCHHHC T ss_conf 36764454456772354589989999989998678844432 No 30 >PRK02862 glgC glucose-1-phosphate adenylyltransferase; Provisional Probab=100.00 E-value=0 Score=317.53 Aligned_cols=232 Identities=23% Similarity=0.386 Sum_probs=179.2 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCE-EHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCC-----CCCC Q ss_conf 079924888745533443898222247837-4799999999978986899997776564311122222222-----1111 Q gi|254780922|r 2 KGIVLAGGSGTRLRPLTDLLSKQMLPIYNK-PMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKW-----GVQF 75 (292) Q Consensus 2 kaiIlAaG~GtRl~P~T~~~pK~Llpi~gk-plI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~-----g~~i 75 (292) -|||||||+||||+|||..+||||+|++|| +|||++|++|+++||++|.+++++....+.+ |++.+..+ |.-. T Consensus 5 ~avILaGG~GtRL~PLT~~rpKp~vPf~GrYRlIDf~Lsn~~nsgI~~V~v~tqy~~~sl~~-Hl~~~w~~~~~~~g~v~ 83 (429) T PRK02862 5 LAIILGGGAGTRLYPLTKLRAKPAVPLAGKYRLIDIPISNCINSGINKIYVLTQFNSASLNR-HISRTYNFSGFSQGFVE 83 (429) T ss_pred EEEEECCCCCCCCCHHHCCCCHHHEEECCCCHHHHHHHHHHHHCCCCEEEEEECCCHHHHHH-HHHCCCCCCCCCCCEEE T ss_conf 99996688886042544588677325888242089999989886998899994658899999-86242157666786068 Q ss_pred CCCCCCCC------CCCCCCCCCCCCCCCCC--CCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCCCC--CCCCCCCC Q ss_conf 22233332------24468521011123356--7521321443211110000111000001110012222--10001222 Q gi|254780922|r 76 SYIEQLVP------AGLAQSYILGAEFIGDS--SSVLILGDNVFYGSDISDIFHKARARRNSATVVGCHV--QNPQRYGV 145 (292) Q Consensus 76 ~y~~q~~p------~Gta~Ai~~a~~~i~~~--~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~~v--~~p~~yGv 145 (292) .+..|..| +|||+|+.++..++... +.++++++|.+|+.|+.++++.|.++++++|++..+| ++|++||+ T Consensus 84 ~l~~~~~~~~~~~~~Gtadai~~~~~~l~~~~~~~~lV~~GD~l~~~D~~~~l~~H~~~~AdiTi~~~~v~~~~~~~fGv 163 (429) T PRK02862 84 VLAAQQTPDNPSWFQGTADAVRQYLWLFQEWDVDEYLILSGDQLYRMDYSLFVQHHRETGADITLAVLPVDEKQASGFGL 163 (429) T ss_pred ECCCEEECCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCEEECCCHHHHHHHHHHCCCCEEEEEEECCHHHCCCCCE T ss_conf 60435514787655664899999999997379973999748877826999999999974998279987558576553628 Q ss_pred CCCCCCCCHHHCCCCCCC---------------------CCCCCCHHHHHHHHHHHH-HHHHHCCCCCCCCCCCCCCCCH Q ss_conf 122221000100133333---------------------222222024554100022-4322004675333333343200 Q gi|254780922|r 146 VEVDSSNQAISIEEKPNN---------------------PKSSFAVTGIYFYDQEVV-NIARNIRPSARGELEITDVNSY 203 (292) Q Consensus 146 v~~d~~~~i~~i~EKP~~---------------------~~sn~a~~GiY~f~~~i~-~~~~~l~~s~rgE~eitD~~~~ 203 (292) +++|++++|++|+|||+. .++.+|++|+|+|++++| +.++.. ++. .+|. .|+++. T Consensus 164 v~~D~~grV~~f~EKP~~~~~~~~~~~~~~~~~~~~~~~~~~~lasmGiYif~~~~L~~ll~~~-~~~-~Dfg-~diip~ 240 (429) T PRK02862 164 MKTDDDGRITEFSEKPKGEELKAMAVDTSRLGLSPEEAKEKPYLASMGIYVFSRDVLFDLLNKN-PEH-TDFG-KEIIPE 240 (429) T ss_pred EEECCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEECHHHHHHHHHHC-CCC-CHHH-HHHHHH T ss_conf 9999999799999578875444211465223567322356764012259998599999999758-654-2238-889999 Q ss_pred HCCCCCEEEEECCCCEEEECCCCHHHHHHHHHHHHH Q ss_conf 000132046313541256327845779999999999 Q gi|254780922|r 204 YLDKGLLAVEFLREGSAWFDAGTPESLLDTAVFVRN 239 (292) Q Consensus 204 ~l~~~~~~~~~~~~g~~W~D~Gt~e~l~~As~~i~~ 239 (292) +++..+ ...+..+| +|.|+||+++|++||+.+.. T Consensus 241 ~~~~~~-v~~y~~~G-Yw~dIgti~sy~~AnmdLl~ 274 (429) T PRK02862 241 AARGYN-LQAYLFDD-YWEDIGTIEAFYEANLALTQ 274 (429) T ss_pred HHCCCE-EEEEEECC-EEEECCCHHHHHHHHHHHHC T ss_conf 847783-89998147-26878998999999899864 No 31 >cd04183 GT2_BcE_like GT2_BcbE_like is likely involved in the biosynthesis of the polysaccharide capsule. GT2_BcbE_like: The bcbE gene is one of the genes in the capsule biosynthetic locus of Pasteurella multocida. Its deducted product is likely involved in the biosynthesis of the polysaccharide capsule, which is found on surface of a wide range of bacteria. It is a subfamily of Glycosyltransferase Family GT2, which includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Probab=100.00 E-value=0 Score=311.84 Aligned_cols=222 Identities=22% Similarity=0.321 Sum_probs=176.2 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCC--CCCCCCCCCCCCCCCCCCC Q ss_conf 79924888745533443898222247837479999999997898689999777656431--1122222222111122233 Q gi|254780922|r 3 GIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVL--KEFLGSGEKWGVQFSYIEQ 80 (292) Q Consensus 3 aiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~--~~~l~~g~~~g~~i~y~~q 80 (292) -||||||+||||+|+|..+||||+||+|||||+|+|+.|..+|++++++|++..+.... ...+... ..+.++ +..| T Consensus 1 iVIlaaG~GtRl~~~t~~~PK~Ll~v~gkp~i~~~i~~l~~~~~~~~i~v~~~~~~~~~~~~~~~~~~-~~~~~i-~~~~ 78 (231) T cd04183 1 IIIPMAGLGSRFKKAGYTYPKPLIEVDGKPMIEWVIESLAKIFDSRFIFICRDEHNTKFHLDESLKLL-APNATV-VELD 78 (231) T ss_pred CEEECCCCCCCCCCCCCCCCCEEEEECCEEHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHCCC-CCCCEE-EEEC T ss_conf 98986788736475679988750179995789999999986499246861454666678887775155-897189-9956 Q ss_pred CCCCCCCCCCCCCCCCCCCC-CCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHCCC Q ss_conf 33224468521011123356-75213214432111100001110000011100122221000122212222100010013 Q gi|254780922|r 81 LVPAGLAQSYILGAEFIGDS-SSVLILGDNVFYGSDISDIFHKARARRNSATVVGCHVQNPQRYGVVEVDSSNQAISIEE 159 (292) Q Consensus 81 ~~p~Gta~Ai~~a~~~i~~~-~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~~v~~p~~yGvv~~d~~~~i~~i~E 159 (292) .+++|+|||+++|.++++++ ++++++||+ ++..++.+++..|.++..+++++++...+| +||++.+|++++|.+++| T Consensus 79 ~~~~G~a~av~~a~~~i~~~~~~lv~~gD~-~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~-~~g~v~~~~~~~v~~i~E 156 (231) T cd04183 79 GETLGAACTVLLAADLIDNDDPLLIFNCDQ-IVESDLLAFLAAFRERDLDGGVLTFFSSHP-RWSYVKLDENGRVIETAE 156 (231) T ss_pred CCCCCHHHHHHHHHHHCCCCCEEEEEECCC-CCCCCHHHHHHHHHHCCCCEEEEEEEECCC-CCCCEEECCCCCEEECCC T ss_conf 999967999999998648998099970761-136388999999985699879999991257-776357847984765023 Q ss_pred CCCCCCCCCCHHHHHHHHHH--HHHHHHHC---CCCCCCCCCCCCCCCHHCCCCCEEEEECCCCEEEECCCCHHHH Q ss_conf 33332222220245541000--22432200---4675333333343200000132046313541256327845779 Q gi|254780922|r 160 KPNNPKSSFAVTGIYFYDQE--VVNIARNI---RPSARGELEITDVNSYYLDKGLLAVEFLREGSAWFDAGTPESL 230 (292) Q Consensus 160 KP~~~~sn~a~~GiY~f~~~--i~~~~~~l---~~s~rgE~eitD~~~~~l~~~~~~~~~~~~g~~W~D~Gt~e~l 230 (292) || +.|++|++|+|+|++. +++.+.++ ..+.+||+++||+++.++++|.....+......|+|+|||++| T Consensus 157 K~--~~s~~~~~GiY~F~~~~~f~~~~~~~~~~~~~~~gE~yltd~i~~li~~g~kv~~~~i~~~~~~d~GtPedl 230 (231) T cd04183 157 KE--PISDLATAGLYYFKSGSLFVEAAKKMIRKDDSVNGEFYISPLYNELILDGKKVGIYLIDKDDYHSFGTPEDL 230 (231) T ss_pred CC--CCCHHHCCEEEEECCHHHHHHHHHHHHHCCCCCCCEEEHHHHHHHHHHCCCEEEEEEECCCEEEECCCHHHC T ss_conf 58--875002014699746799999999986448655999985899999998889189999789969989696984 No 32 >COG0448 GlgC ADP-glucose pyrophosphorylase [Carbohydrate transport and metabolism] Probab=100.00 E-value=0 Score=316.70 Aligned_cols=238 Identities=27% Similarity=0.452 Sum_probs=198.7 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEECCE-EHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCC----C Q ss_conf 79924888745533443898222247837-479999999997898689999777656431112222222211112----2 Q gi|254780922|r 3 GIVLAGGSGTRLRPLTDLLSKQMLPIYNK-PMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQFS----Y 77 (292) Q Consensus 3 aiIlAaG~GtRl~P~T~~~pK~Llpi~gk-plI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~----y 77 (292) |+|||||.||||.|||+.++||-+|++|| .|||++|+||+++||++|.|+++|+.. ...+|++.|.+|+.... + T Consensus 8 aiILaGg~G~rL~~LT~~RakpAVpFgGkYRiIDF~LSN~vNSGi~~I~VltQy~~~-SL~~Hi~~G~~w~l~~~~~~v~ 86 (393) T COG0448 8 AIILAGGRGSRLSPLTKDRAKPAVPFGGKYRIIDFALSNCVNSGIRRIGVLTQYKSH-SLNDHIGRGWPWDLDRKNGGVF 86 (393) T ss_pred EEEECCCCCCCCCHHHHCCCCCCCCCCCEEEEEEEECCCCCCCCCCEEEEEECCCHH-HHHHHHHCCCCCCCCCCCCCEE T ss_conf 999758987755224547503444427646898677000010388769999435536-7999862898656655558479 Q ss_pred CC--C---CC---CCCCCCCCCCCCCCCCC--CCCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCCCC--CCCCCCCC Q ss_conf 23--3---33---22446852101112335--67521321443211110000111000001110012222--10001222 Q gi|254780922|r 78 IE--Q---LV---PAGLAQSYILGAEFIGD--SSSVLILGDNVFYGSDISDIFHKARARRNSATVVGCHV--QNPQRYGV 145 (292) Q Consensus 78 ~~--q---~~---p~Gta~Ai~~a~~~i~~--~~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~~v--~~p~~yGv 145 (292) +. | .. -.|||+|+++...++.. .+.+|||+.|++|..|+..+++.|.++++++|+...+| +++++||+ T Consensus 87 ilp~~~~~~~~~wy~Gtadai~Qnl~~i~~~~~eyvlIlsgDhIYkmDy~~ml~~H~~~gadiTv~~~~Vp~~eas~fGi 166 (393) T COG0448 87 ILPAQQREGGERWYEGTADAIYQNLLIIRRSDPEYVLILSGDHIYKMDYSDMLDFHIESGADVTVAVKEVPREEASRFGV 166 (393) T ss_pred EECCHHCCCCCCCEECCHHHHHHHHHHHHHCCCCEEEEECCCEEEECCHHHHHHHHHHCCCCEEEEEEECCHHHHHHCCC T ss_conf 95711125787312044799997689997439888999528778944899999999981998799999879676002481 Q ss_pred CCCCCCCCHHHCCCCCCC-CCCC-CCHHHHHHHHHHHH-HHHHHCCCCCCCCCCC-CCCCCHHCCCCCEEEEECCCCEEE Q ss_conf 122221000100133333-2222-22024554100022-4322004675333333-343200000132046313541256 Q gi|254780922|r 146 VEVDSSNQAISIEEKPNN-PKSS-FAVTGIYFYDQEVV-NIARNIRPSARGELEI-TDVNSYYLDKGLLAVEFLREGSAW 221 (292) Q Consensus 146 v~~d~~~~i~~i~EKP~~-~~sn-~a~~GiY~f~~~i~-~~~~~l~~s~rgE~ei-tD~~~~~l~~~~~~~~~~~~g~~W 221 (292) +++|++++|++|+|||.+ |++| +|++|+|+|+.+.+ +++..-..++.+..+. .|+++.+++++. ...+...| || T Consensus 167 m~~D~~~~i~~F~eKp~~~~~~~~laSMgiYIf~~~~L~~~L~~~~~~~~~~~DfgkdiIp~~~~~~~-v~AY~f~g-Yw 244 (393) T COG0448 167 MNVDENGRIIEFVEKPADGPPSNSLASMGIYIFNTDLLKELLEEDAKDPNSSHDFGKDIIPKLLERGK-VYAYEFSG-YW 244 (393) T ss_pred EEECCCCCEEEEEECCCCCCCCCCEEEEEEEEECHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHCCC-EEEEECCC-HH T ss_conf 58879997887352667688665313556699829999999998741667733106778898875087-77874256-66 Q ss_pred ECCCCHHHHHHHHHHHHHHHHH Q ss_conf 3278457799999999998888 Q gi|254780922|r 222 FDAGTPESLLDTAVFVRNIENR 243 (292) Q Consensus 222 ~D~Gt~e~l~~As~~i~~~e~~ 243 (292) .|+||.++|++||+++...++. T Consensus 245 ~dVgTi~syy~aNmdLl~~~~~ 266 (393) T COG0448 245 RDVGTIDSYYEANMDLLSPQPE 266 (393) T ss_pred HHCCCHHHHHHHHHHHCCCCCC T ss_conf 5266399999843876289975 No 33 >PRK09451 glmU bifunctional N-acetylglucosamine-1-phosphate uridyltransferase/glucosamine-1-phosphate acetyltransferase; Provisional Probab=100.00 E-value=4.2e-45 Score=302.98 Aligned_cols=225 Identities=18% Similarity=0.272 Sum_probs=178.5 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 90799248887455334438982222478374799999999978986899997776564311122222222111122233 Q gi|254780922|r 1 MKGIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQ 80 (292) Q Consensus 1 MkaiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q 80 (292) |+|||||||+||||++ ++||||+||+|||||+|+|+.+..+|+++|++|++|.. +.++++++++ +++|+.| T Consensus 6 m~aVILAAGkGTRM~s---~~PKvL~pi~gkPml~hvi~~l~~~g~~~ivvVvg~~~-e~i~~~~~~~-----~i~~v~Q 76 (456) T PRK09451 6 MSVVILAAGKGTRMYS---DLPKVLHTLAGKPMVQHVIDAANELGAAHVHLVYGHGG-DLLKQTLKDE-----PLNWVLQ 76 (456) T ss_pred CEEEEECCCCCCCCCC---CCCHHHHEECCHHHHHHHHHHHHHCCCCEEEEEECCCH-HHHHHHHCCC-----CCEEEEE T ss_conf 5399985778777799---97957504489869999999998769980999969987-9999874458-----8449995 Q ss_pred CCCCCCCCCCCCCCCCCCCC-CCCEEECCC-CCCCCCCCCHHHHHCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHCC Q ss_conf 33224468521011123356-752132144-3211110000111000001110012222100012221222210001001 Q gi|254780922|r 81 LVPAGLAQSYILGAEFIGDS-SSVLILGDN-VFYGSDISDIFHKARARRNSATVVGCHVQNPQRYGVVEVDSSNQAISIE 158 (292) Q Consensus 81 ~~p~Gta~Ai~~a~~~i~~~-~~~lilgDn-i~~~~~l~~~l~~~~~~~~~~ti~~~~v~~p~~yGvv~~d~~~~i~~i~ 158 (292) ++|+|||||+.+|.+++.++ +|++++||. ++....+.++++.+. ..++++++..++||+.||++.. ++|++.+|+ T Consensus 77 ~eqlGTghAV~~A~~~l~~~~~vLVl~GD~PLi~~~tl~~l~~~~~--~~~~~llt~~~~dP~~YGrIi~-~~g~v~~IV 153 (456) T PRK09451 77 AEQLGTGHAMQQAAPFFADDEDILMLYGDVPLISVETLQRLRDAKP--QGGIGLLTVKLDNPTGYGRITR-ENGKVVGIV 153 (456) T ss_pred CCCCCCHHHHHHHHHHCCCCCCEEEEECCCCCCCHHHHHHHHHHCC--CCCEEEEEEECCCCCCCEEEEE-CCCCEEEEE T ss_conf 8889729999998886066885899959801569999999986251--3885999997689654648994-389579999 Q ss_pred CCCC----CCCCCCCHHHHHHHHHHHHH-HHHHCC-CCCCCCCCCCCCCCHHCCCCCEEEEECCCCEEEEC---CCCHHH Q ss_conf 3333----32222220245541000224-322004-67533333334320000013204631354125632---784577 Q gi|254780922|r 159 EKPN----NPKSSFAVTGIYFYDQEVVN-IARNIR-PSARGELEITDVNSYYLDKGLLAVEFLREGSAWFD---AGTPES 229 (292) Q Consensus 159 EKP~----~~~sn~a~~GiY~f~~~i~~-~~~~l~-~s~rgE~eitD~~~~~l~~~~~~~~~~~~g~~W~D---~Gt~e~ 229 (292) ||+. ....+.+++|+|+|+.+.|. ++.+++ +++++||++||++..+.++|.....+.... |.+ +.+..+ T Consensus 154 E~kda~~~e~~i~eiNaGIy~f~~~~l~~~L~~i~~~n~~gEyyLTDii~~~~~~g~~v~~v~~~~--~~e~~GvN~~~~ 231 (456) T PRK09451 154 EHKDATDEQRQIQEINTGILVANGADLKRWLAKLTNNNAQGEYYITDIIALAYQEGREIVAVHPTR--LSEVEGVNNRLQ 231 (456) T ss_pred ECCCCCHHHHCCCEECCEEEEECHHHHHHHHHHCCCCCCCCCEEECHHHHHHHHCCCEEEEEECCC--HHHHCCCCCHHH T ss_conf 826798455211122024899579999998875067554562563056788874695799998288--799446798999 Q ss_pred HHHHHHHHHH Q ss_conf 9999999999 Q gi|254780922|r 230 LLDTAVFVRN 239 (292) Q Consensus 230 l~~As~~i~~ 239 (292) |.++....+. T Consensus 232 La~~e~~~~~ 241 (456) T PRK09451 232 LARLERVYQA 241 (456) T ss_pred HHHHHHHHHH T ss_conf 9998999999 No 34 >cd02523 PC_cytidylyltransferase Phosphocholine cytidylyltransferases catalyze the synthesis of CDP-choline. This family contains proteins similar to prokaryotic phosphocholine (P-cho) cytidylyltransferases. Phosphocholine (PC) cytidylyltransferases catalyze the transfer of a cytidine monophosphate from CTP to phosphocholine to form CDP-choline. PC is the most abundant phospholipid in eukaryotic membranes and it is also important in prokaryotic membranes. For pathogenic prokaryotes, the cell surface PC facilitates the interaction with host surface and induces attachment and invasion. In addition cell wall PC serves as scaffold for a group of choline-binding proteins that are secreted from the cells. Phosphocholine (PC) cytidylyltransferase is a key enzyme in the prokaryotic choline metabolism pathway. It has been hypothesized to consist of a choline transport system, a choline kinase, CTP:phosphocholine cytidylyltransferase, and a choline phosphotransferase that transfers P-Cho from CDP Probab=100.00 E-value=5.2e-43 Score=290.01 Aligned_cols=222 Identities=23% Similarity=0.329 Sum_probs=170.9 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 79924888745533443898222247837479999999997898689999777656431112222222211112223333 Q gi|254780922|r 3 GIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQLV 82 (292) Q Consensus 3 aiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~~ 82 (292) |||||||+||||+|+|..+||||+||+|+|||+|+|+.|..+||++|+|+++|.. +++++++++. .++++.+..|.. T Consensus 1 AiILAaG~GtRl~plT~~~PK~L~~i~gkpli~~~i~~l~~~gi~~i~iv~gy~~-e~i~~~~~~~--~~i~~i~n~~~~ 77 (229) T cd02523 1 AIILAAGRGSRLRPLTEDRPKCLLEINGKPLLERQIETLKEAGIDDIVIVTGYKK-EQIEELLKKY--PNIKFVYNPDYA 77 (229) T ss_pred CEEECCCCCCCCCCCCCCCCCCCCEECCEEHHHHHHHHHHHCCCCCCEEECCCCH-HHHHHHHHHC--CCCCCEEEEECC T ss_conf 9893687765688645798803217998799999999999849985366555489-9999998531--365403400023 Q ss_pred CCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCC--CCCCCCCCCCCCCCCCCCHHHCCCC Q ss_conf 22446852101112335675213214432111100001110000011100122--2210001222122221000100133 Q gi|254780922|r 83 PAGLAQSYILGAEFIGDSSSVLILGDNVFYGSDISDIFHKARARRNSATVVGC--HVQNPQRYGVVEVDSSNQAISIEEK 160 (292) Q Consensus 83 p~Gta~Ai~~a~~~i~~~~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~--~v~~p~~yGvv~~d~~~~i~~i~EK 160 (292) +.|+++|+++|+++++ ++|++++||++ ++.++.+.+. +.....+++.. ....+.+|+++.. ..+.+..++|| T Consensus 78 ~~g~~~sl~~a~~~~~-~~~lv~~gD~i-~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~ 151 (229) T cd02523 78 ETNNIYSLYLARDFLD-EDFLLLEGDVV-FDPSILERLL---SSPADNAILVDKKTKEWEDEYVKDLD-DAGVLLGIISK 151 (229) T ss_pred CCCHHHHHHHHHHHCC-CCEEEEECCEE-CCHHHHHHHH---HCCCCCEEEEEECCCCCCCCEEEEEC-CCCCEEEEEEC T ss_conf 6782999998887558-98599957765-4667999998---47788879999564577874479942-68708999868 Q ss_pred CCCCCC-CCCHHHHHHHHHHHHH----HHHHCCCCCCCCCCCCCCCCHHCCCCCEEEEECCCCEEEECCCCHHHHHHHH Q ss_conf 333222-2220245541000224----3220046753333333432000001320463135412563278457799999 Q gi|254780922|r 161 PNNPKS-SFAVTGIYFYDQEVVN----IARNIRPSARGELEITDVNSYYLDKGLLAVEFLREGSAWFDAGTPESLLDTA 234 (292) Q Consensus 161 P~~~~s-n~a~~GiY~f~~~i~~----~~~~l~~s~rgE~eitD~~~~~l~~~~~~~~~~~~g~~W~D~Gt~e~l~~As 234 (292) |+.+.+ +.+.+|+|+|++++++ .+.++.++.+++++++|+++.+++++.+.+.....+ .|+|+|||++|.+|. T Consensus 152 ~~~~~~~~~~~~Gi~~~~~~~~~~l~~~l~~~~~~~~~~~~~~~~i~~li~~~~~~~~~~~~~-~W~dIgt~edl~~Ae 229 (229) T cd02523 152 AKNLEEIQGEYVGISKFSPEDADRLAEALEELIEAGRVNLYYEDALQRLISEEGVKVKDISDG-FWYEIDDLEDLERAE 229 (229) T ss_pred CCCCCCCCCEEEEEEEECHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHCCCCEEEECCCC-EEEECCCHHHHHHHC T ss_conf 999887462899999998999999999999728677787239999999980799669984893-199696999999649 No 35 >TIGR01173 glmU UDP-N-acetylglucosamine pyrophosphorylase; InterPro: IPR005882 N-Acetylglucosamine-1-PO(4) uridyltransferase (GlmU, 2.7.7.23 from EC) is a trimeric bifunctional enzyme that catalyzes the last two sequential reactions in the de novo biosynthetic pathway for UDP-GlcNAc. The X-ray crystal structure of Escherichia coli GlmU in complex with UDP-GlcNAc and CoA has been determined to 2.1 A resolution and reveals a two-domain architecture that is responsible for these two reactions . The C-terminal domain is responsible for the CoA-dependent acetylation of Glc-1-PO(4) to GlcNAc-1-PO(4) and displays the longest left-handed parallel beta-helix observed to date. The acetyltransferase active site defined by the binding site for CoA makes use of residues from all three subunits and is positioned beneath an open cavity large enough to accommodate the Glc-1-PO(4) acetyl acceptor. The N-terminal domain catalyzes uridyl transfer from UTP to GlcNAc-1-PO(4) to form the final products UDP-GlcNAc and pyrophosphate. This domain is composed of a central seven-stranded beta-sheet surrounded by six alpha-helices in a Rossmann fold-like topology. ; GO: 0003977 UDP-N-acetylglucosamine diphosphorylase activity, 0009103 lipopolysaccharide biosynthetic process. Probab=100.00 E-value=1.9e-40 Score=274.00 Aligned_cols=236 Identities=17% Similarity=0.298 Sum_probs=187.8 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 90799248887455334438982222478374799999999978986899997776564311122222222111122233 Q gi|254780922|r 1 MKGIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQ 80 (292) Q Consensus 1 MkaiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q 80 (292) |.+||||||+||||+ +..||.|+||+||||+.|+|+.+...+.++|.+|.+|.. +++++-+.+... -.+.|+.| T Consensus 1 ~~~vILAAGkGTRMk---S~lPKVLH~laGkpMl~hVi~~A~~L~~~~i~vV~GH~~-~~V~~~l~~~~d--~t~~~v~Q 74 (461) T TIGR01173 1 LSVVILAAGKGTRMK---SKLPKVLHPLAGKPMLEHVIDAARKLSPEKIHVVLGHGA-EQVRKALAEEND--KTVNWVLQ 74 (461) T ss_pred CEEEEEECCCCCCCC---CCCCCCCCHHHCCCHHHHHHHHHHCCCCCCEEEEECCCH-HHHHHHHHCCCC--CEEEEEEE T ss_conf 938998068883111---386860120103227899999985378341599981686-999998503799--67999996 Q ss_pred CCCCC-CCCCCCCCCCCCC--CCC-CCEEECCC-CCCCCCCC-CHHHHHCC-CHHHHHCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 33224-4685210111233--567-52132144-32111100-00111000-0011100122221000122212222100 Q gi|254780922|r 81 LVPAG-LAQSYILGAEFIG--DSS-SVLILGDN-VFYGSDIS-DIFHKARA-RRNSATVVGCHVQNPQRYGVVEVDSSNQ 153 (292) Q Consensus 81 ~~p~G-ta~Ai~~a~~~i~--~~~-~~lilgDn-i~~~~~l~-~~l~~~~~-~~~~~ti~~~~v~~p~~yGvv~~d~~~~ 153 (292) .+|+| ||||+.+|.+|+. ++. .++++||. ++..+.|. +|++.|.+ +...+|+++...+||+.||.|-.+++|+ T Consensus 75 ~~qlGGTGHAv~~a~~~l~~~~~~~vLvLyGDvPLi~~eTL~m~Ll~~~~~~~~~~~tlLt~~l~DP~GYGRI~r~~~g~ 154 (461) T TIGR01173 75 AEQLGGTGHAVLQALPFLSEDDDGRVLVLYGDVPLISAETLEMRLLESHRQLNGAKVTLLTAKLEDPTGYGRIIRENDGA 154 (461) T ss_pred CCCCCCHHHHHHHHCCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHCCCCCCEEEEEEEECCCCCCCEEEECCCCC T ss_conf 88978728999871167888668608999588787756778779888630001045179998718889543589848995 Q ss_pred HHHCCCC-CCCCC---CCCCHHHHHHHHHHHH-HHHHHCCC-CCCCCCCCCCCCCHHCCCCCEEEEECCCCE-EEECCCC Q ss_conf 0100133-33322---2222024554100022-43220046-753333333432000001320463135412-5632784 Q gi|254780922|r 154 AISIEEK-PNNPK---SSFAVTGIYFYDQEVV-NIARNIRP-SARGELEITDVNSYYLDKGLLAVEFLREGS-AWFDAGT 226 (292) Q Consensus 154 i~~i~EK-P~~~~---sn~a~~GiY~f~~~i~-~~~~~l~~-s~rgE~eitD~~~~~l~~~~~~~~~~~~g~-~W~D~Gt 226 (292) |..|+|- -.++. =.-+|+|+|+|+...+ .+|+++.+ ++.|||++||++..+.++|.....+..... --.=+-+ T Consensus 155 V~~IVE~KDA~~eqk~I~eiNtG~y~~~~~~L~~~L~~l~n~NaqgEYYLTD~ia~a~~~g~~v~~~~~~d~~E~~GvNd 234 (461) T TIGR01173 155 VQAIVEEKDANEEQKAIKEINTGVYVFDGAALKRWLPKLSNNNAQGEYYLTDVIALAVADGEEVRAVQVDDSEEVLGVND 234 (461) T ss_pred EEEEECCCCCCHHHHCCCEEEEEEEEECHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHCCCCEEEEEEECCHHHHCCCCC T ss_conf 89997335988698035278887999832899988876287704443147899999850894789998087598336679 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 5779999999999888 Q gi|254780922|r 227 PESLLDTAVFVRNIEN 242 (292) Q Consensus 227 ~e~l~~As~~i~~~e~ 242 (292) -..|-++.++++.-+. T Consensus 235 R~qLa~lE~~~q~r~~ 250 (461) T TIGR01173 235 RLQLAQLERILQRRIA 250 (461) T ss_pred HHHHHHHHHHHHHHHH T ss_conf 8899999999999999 No 36 >cd02508 ADP_Glucose_PP ADP-glucose pyrophosphorylase is involved in the biosynthesis of glycogen or starch. ADP-glucose pyrophosphorylase (glucose-1-phosphate adenylyltransferase) catalyzes a very important step in the biosynthesis of alpha 1,4-glucans (glycogen or starch) in bacteria and plants: synthesis of the activated glucosyl donor, ADP-glucose, from glucose-1-phosphate and ATP. ADP-glucose pyrophosphorylase is a tetrameric allosterically regulated enzyme. While a homotetramer in bacteria, in plant chloroplasts and amyloplasts, it is a heterotetramer of two different, yet evolutionary related, subunits. There are a number of conserved regions in the sequence of bacterial and plant ADP-glucose pyrophosphorylase subunits. It is a subfamily of a very diverse glycosy transferase family 2. Probab=100.00 E-value=1.2e-36 Score=250.10 Aligned_cols=184 Identities=25% Similarity=0.422 Sum_probs=147.4 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEECCE-EHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCC------- Q ss_conf 79924888745533443898222247837-4799999999978986899997776564311122222222111------- Q gi|254780922|r 3 GIVLAGGSGTRLRPLTDLLSKQMLPIYNK-PMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQ------- 74 (292) Q Consensus 3 aiIlAaG~GtRl~P~T~~~pK~Llpi~gk-plI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~------- 74 (292) |||||||+||||+|+|..+||||+||+|| |||+|+|++|.++||++|+|+++|.. +++.+|+++|++|+++ T Consensus 1 AvIlagG~GtRL~pLT~~~PKp~lpi~gk~~iId~~l~~l~~~Gi~~i~i~~~y~~-~~i~~hl~~g~~~~~~~~~~gi~ 79 (200) T cd02508 1 AIILAGGEGTRLSPLTKKRAKPAVPFGGRYRLIDFPLSNMVNSGIRNVGVLTQYKS-RSLNDHLGSGKEWDLDRKNGGLF 79 (200) T ss_pred CEEECCCCCCCCHHHHCCCCCCCEEECCCCCCHHHHHHHHHHCCCCEEEEECCCCH-HHHHHHHHCCCCCCCCCCCCCEE T ss_conf 98957888863625447997413088991620799999887679767998345358-99999985640125765668589 Q ss_pred CC----CCCCCCCCCCCCCCCCCCCCCCC---CCCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCCCCCCCCCCCCCC Q ss_conf 12----22333322446852101112335---675213214432111100001110000011100122221000122212 Q gi|254780922|r 75 FS----YIEQLVPAGLAQSYILGAEFIGD---SSSVLILGDNVFYGSDISDIFHKARARRNSATVVGCHVQNPQRYGVVE 147 (292) Q Consensus 75 i~----y~~q~~p~Gta~Ai~~a~~~i~~---~~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~~v~~p~~yGvv~ 147 (292) +. +..+..+.||++|+..+.+++.+ +.|+++.|| ++++.|+.++++.|.++++++|++. T Consensus 80 i~~~~~~~~~~~~~Gt~~~i~~~~~~l~~~~~~~~lv~ngD-~l~~~D~~~~l~~H~~~~a~~Ti~~------------- 145 (200) T cd02508 80 ILPPQQRKGGDWYRGTADAIYQNLDYIERSDPEYVLILSGD-HIYNMDYREMLDFHIESGADITVVY------------- 145 (200) T ss_pred EEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEECCC-EEEECCHHHHHHHHHHCCCCEEEEE------------- T ss_conf 96313336876553478999999998721788729997699-7870699999999987699989999------------- Q ss_pred CCCCCCHHHCCCCCCCCCCCCCHHHHHHHHHHHH-HHHHHCCCCCCCCCCCCCCCCHHCCCCCEEEEECCCCEEEECC Q ss_conf 2221000100133333222222024554100022-4322004675333333343200000132046313541256327 Q gi|254780922|r 148 VDSSNQAISIEEKPNNPKSSFAVTGIYFYDQEVV-NIARNIRPSARGELEITDVNSYYLDKGLLAVEFLREGSAWFDA 224 (292) Q Consensus 148 ~d~~~~i~~i~EKP~~~~sn~a~~GiY~f~~~i~-~~~~~l~~s~rgE~eitD~~~~~l~~~~~~~~~~~~g~~W~D~ 224 (292) .+++|+|+|++++| +++++..+.. .....+|+++.++++++ ...+..+| +|+|+ T Consensus 146 --------------------~in~GiYi~~~~~l~~~l~~~~~~~-~~d~~~d~ip~l~~~~~-v~~y~~~g-yw~DI 200 (200) T cd02508 146 --------------------KASMGIYIFSKDLLIELLEEDAADG-SHDFGKDIIPAMLKKLK-IYAYEFNG-YWADI 200 (200) T ss_pred --------------------EEECEEEEEEHHHHHHHHHHHCCCC-CCCCHHHHHHHHHHCCC-EEEEECCC-EEECC T ss_conf --------------------8504899980899999998632467-67558999999960398-89996688-57494 No 37 >COG1207 GlmU N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) [Cell envelope biogenesis, outer membrane] Probab=100.00 E-value=3.5e-36 Score=247.17 Aligned_cols=231 Identities=19% Similarity=0.336 Sum_probs=185.8 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 90799248887455334438982222478374799999999978986899997776564311122222222111122233 Q gi|254780922|r 1 MKGIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQ 80 (292) Q Consensus 1 MkaiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q 80 (292) |.+||||||+||||+ +..||.|+|++||||++|+|+.+...+.+++.+|+++.. +..+..+.... ++.|+.| T Consensus 3 ~~~vILAAGkGTRMk---S~lPKVLH~vaGkpMl~hVi~~a~~l~~~~i~vVvGh~a-e~V~~~~~~~~----~v~~v~Q 74 (460) T COG1207 3 LSAVILAAGKGTRMK---SDLPKVLHPVAGKPMLEHVIDAARALGPDDIVVVVGHGA-EQVREALAERD----DVEFVLQ 74 (460) T ss_pred CEEEEEECCCCCCCC---CCCCCCCCCCCCCCHHHHHHHHHHHCCCCEEEEEECCCH-HHHHHHHCCCC----CCEEEEE T ss_conf 169999448875345---798411100168008999999876268661899986877-89998756354----7339996 Q ss_pred CCCCCCCCCCCCCCCCC-CCCC--CCEEECCC-CCCCCCCCCHHHHHCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCHHH Q ss_conf 33224468521011123-3567--52132144-32111100001110000011100122221000122212222100010 Q gi|254780922|r 81 LVPAGLAQSYILGAEFI-GDSS--SVLILGDN-VFYGSDISDIFHKARARRNSATVVGCHVQNPQRYGVVEVDSSNQAIS 156 (292) Q Consensus 81 ~~p~Gta~Ai~~a~~~i-~~~~--~~lilgDn-i~~~~~l~~~l~~~~~~~~~~ti~~~~v~~p~~yGvv~~d~~~~i~~ 156 (292) .+|+|||||+.+|++++ .+.+ .++++||. ++....+..++..|....+.+++++...+||+.||.+..++++.|.. T Consensus 75 ~eqlGTgHAV~~a~~~l~~~~~g~vLVl~GD~PLit~~TL~~L~~~~~~~~~~~tvLt~~~~dP~GYGRIvr~~~g~V~~ 154 (460) T COG1207 75 EEQLGTGHAVLQALPALADDYDGDVLVLYGDVPLITAETLEELLAAHPAHGAAATVLTAELDDPTGYGRIVRDGNGEVTA 154 (460) T ss_pred CCCCCHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCEEEECCCCCEEE T ss_conf 25587389999666764047897389996996667899999999863403775599998738998754699939986999 Q ss_pred CCCCCC-CC---CCCCCHHHHHHHHHH-HHHHHHHCC-CCCCCCCCCCCCCCHHCCCCCEEEEECCCCE-EEECCCCHHH Q ss_conf 013333-32---222220245541000-224322004-6753333333432000001320463135412-5632784577 Q gi|254780922|r 157 IEEKPN-NP---KSSFAVTGIYFYDQE-VVNIARNIR-PSARGELEITDVNSYYLDKGLLAVEFLREGS-AWFDAGTPES 229 (292) Q Consensus 157 i~EKP~-~~---~sn~a~~GiY~f~~~-i~~~~~~l~-~s~rgE~eitD~~~~~l~~~~~~~~~~~~g~-~W~D~Gt~e~ 229 (292) |+|--. ++ .-+.+++|+|+|+.. .+++++++. .++.|||++||++..+..+|.....+....+ --.-+.+-.. T Consensus 155 IVE~KDA~~eek~I~eiNtGiy~f~~~~L~~~L~~l~nnNaqgEYYLTDvI~i~~~~g~~V~a~~~~d~~E~~GVN~R~q 234 (460) T COG1207 155 IVEEKDASEEEKQIKEINTGIYAFDGAALLRALPKLSNNNAQGEYYLTDVIAIARNEGEKVRAVHVDDEEEVLGVNDRVQ 234 (460) T ss_pred EEECCCCCHHHHCCCEEEEEEEEECHHHHHHHHHHHCCCCCCCCEEHHHHHHHHHHCCCEEEEEECCCHHHHCCCCCHHH T ss_conf 99747899789508388206899867999999987256665574868999999985797699996696688627674999 Q ss_pred HHHHHHHHHH Q ss_conf 9999999999 Q gi|254780922|r 230 LLDTAVFVRN 239 (292) Q Consensus 230 l~~As~~i~~ 239 (292) |-+|....+. T Consensus 235 La~~e~~~q~ 244 (460) T COG1207 235 LAEAERIMQR 244 (460) T ss_pred HHHHHHHHHH T ss_conf 9999999999 No 38 >cd04198 eIF-2B_gamma_N The N-terminal domain of gamma subunit of the eIF-2B is a subfamily of glycosyltransferase 2. N-terminal domain of gamma subunit of the eukaryotic translation initiation factor 2B (eIF-2B): eIF-2B is a guanine nucleotide-exchange factor which mediates the exchange of GDP (bound to initiation factor eIF2) for GTP, generating active eIF2.GTP complex. EIF2B is a complex multimeric protein consisting of five subunits named alpha, beta, gamma, delta and epsilon. Subunit gamma shares sequence similarity with epsilon subunit, and with a family of bifunctional nucleotide-binding enzymes such as ADP-glucose pyrophosphorylase, suggesting that epsilon subunit may play roles in nucleotide binding activity. In yeast, eIF2B gamma enhances the activity of eIF2B-epsilon leading to the idea that these subunits form the catalytic subcomplex. Probab=100.00 E-value=1.3e-34 Score=237.44 Aligned_cols=137 Identities=27% Similarity=0.397 Sum_probs=118.6 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCC---CCCCCCCCCC Q ss_conf 9079924888745533443898222247837479999999997898689999777656431112222---2222111122 Q gi|254780922|r 1 MKGIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGS---GEKWGVQFSY 77 (292) Q Consensus 1 MkaiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~---g~~~g~~i~y 77 (292) |||||||||+||||+|+|..+||||+||+|+|||+|+|+.|.++|+++|+|+|++...+.+++++.. .......+.+ T Consensus 1 ~qAVILagG~GtRl~plT~~~PK~Llpi~n~P~i~y~l~~l~~~G~~~i~i~~~~~~~~~i~~~~~~~~~~~~~~~~~~~ 80 (214) T cd04198 1 FQAVILAGGGGSRLYPLTDNIPKALLPVANKPMIWYPLDWLEKAGFEDVIVVVPEEEQAEISTYLRSFPLNLKQKLDEVT 80 (214) T ss_pred CEEEEECCCCCCCCCHHHCCCCCCCCEECCEEHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHCCCCCCCEEEEE T ss_conf 95999868887665643569986466899997499999999986998899995753599999999970456677628999 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCCCCCC Q ss_conf 23333224468521011123356752132144321111000011100000111001222210 Q gi|254780922|r 78 IEQLVPAGLAQSYILGAEFIGDSSSVLILGDNVFYGSDISDIFHKARARRNSATVVGCHVQN 139 (292) Q Consensus 78 ~~q~~p~Gta~Ai~~a~~~i~~~~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~~v~~ 139 (292) +.+++|.|||||+....+++.+ +|+++.|| ++++.++.++++.|.++++.+|++.+++.. T Consensus 81 ~~~~~~~gT~~aLr~~~~~i~~-dflvl~gD-~i~~i~l~~ll~~Hr~~~a~~T~~l~~~~~ 140 (214) T cd04198 81 IVLDEDMGTADSLRHIRKKIKK-DFLVLSCD-LITDLPLIELVDLHRSHDASLTVLLYPPPV 140 (214) T ss_pred EECCCCCCHHHHHHHHHHCCCC-CEEEECCC-EEECCCHHHHHHHHHHHCCCEEEEEEECCC T ss_conf 6579878719999999851899-98998599-836688899999999709948999953576 No 39 >cd04197 eIF-2B_epsilon_N The N-terminal domain of epsilon subunit of the eIF-2B is a subfamily of glycosyltransferase 2. N-terminal domain of epsilon subunit of the eukaryotic translation initiation factor 2B (eIF-2B): eIF-2B is a guanine nucleotide-exchange factor which mediates the exchange of GDP (bound to initiation factor eIF2) for GTP, generating active eIF2.GTP complex. EIF2B is a complex multimeric protein consisting of five subunits named alpha, beta, gamma, delta and epsilon. Subunit epsilon shares sequence similarity with gamma subunit, and with a family of bifunctional nucleotide-binding enzymes such as ADP-glucose pyrophosphorylase, suggesting that epsilon subunit may play roles in nucleotide binding activity. In yeast, eIF2B gamma enhances the activity of eIF2B-epsilon leading to the idea that these subunits form the catalytic subcomplex. Probab=99.98 E-value=2.9e-33 Score=228.86 Aligned_cols=178 Identities=20% Similarity=0.314 Sum_probs=135.9 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCC-----CCC Q ss_conf 907992488874553344389822224783747999999999789868999977765643111222222221-----111 Q gi|254780922|r 1 MKGIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWG-----VQF 75 (292) Q Consensus 1 MkaiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g-----~~i 75 (292) |||||||||+||||+|+|..+||||+||+|+|||+|+|+.|..+|+++|+|+++++ .+++++|+++....+ ..+ T Consensus 1 ~qAVIlAgg~gtRl~PlT~~~PK~LlPv~n~Pli~y~l~~L~~~G~~ei~v~~~~~-~~~i~~~~~~~~~~~~~~~~~~v 79 (217) T cd04197 1 LQAVVLADSFNRRFRPLTKEKPRCLLPLANVPLIDYTLEFLALNGVEEVFVFCCSH-SDQIKEYIEKSKWSKPKSSLMIV 79 (217) T ss_pred CEEEEECCCCCCCCCHHHCCCCCCCCEECCEEHHHHHHHHHHHCCCCEEEEECCCC-HHHHHHHHHHHHHCCCCCCCCEE T ss_conf 95999878887725620358986533799998399999999987997799994689-99999999843221566777348 Q ss_pred CCCCCCCCCCCCCCCCC--CCCCCCCCCCCEEECCCCCCCCCCCCHHHHHCCC-----HHHHHCCCCCCCCCC------- Q ss_conf 22233332244685210--1112335675213214432111100001110000-----011100122221000------- Q gi|254780922|r 76 SYIEQLVPAGLAQSYIL--GAEFIGDSSSVLILGDNVFYGSDISDIFHKARAR-----RNSATVVGCHVQNPQ------- 141 (292) Q Consensus 76 ~y~~q~~p~Gta~Ai~~--a~~~i~~~~~~lilgDni~~~~~l~~~l~~~~~~-----~~~~ti~~~~v~~p~------- 141 (292) .+...++..++|+|+.. ++.++.+ +|+|+.||. +++.++.++++.|.++ .+.+|++..++..+. T Consensus 80 ~~~~~~~~~~~Gdalr~l~~~~~i~~-dFlv~~gD~-it~~~l~~~l~~Hr~~r~~dk~a~~T~~~~~~~~~~~~~~~~~ 157 (217) T cd04197 80 IIIMSEDCRSLGDALRDLDAKGLIRG-DFILVSGDV-VSNIDLKEILEEHKERRKKDKNAIMTMVLKEASPPHRTRRTGE 157 (217) T ss_pred EEEECCCCCCHHHHHHHHHHHCCCCC-CEEEEECCC-EECCCHHHHHHHHHHHHCCCCCCEEEEEECCCCCCCCCCCCCC T ss_conf 99756888765169998876044789-999997990-4137889999999863133757448998704688742345776 Q ss_pred CCCCCCCCCCCCHHHCCCCCCCC--------------------CCCCCHHHHHHHHHHHH Q ss_conf 12221222210001001333332--------------------22222024554100022 Q gi|254780922|r 142 RYGVVEVDSSNQAISIEEKPNNP--------------------KSSFAVTGIYFYDQEVV 181 (292) Q Consensus 142 ~yGvv~~d~~~~i~~i~EKP~~~--------------------~sn~a~~GiY~f~~~i~ 181 (292) .++++.-++++++..+.++|... .+++..+|+|+++|+++ T Consensus 158 ~~~v~id~~~~rll~~~~~~~~~~~~~~~l~~~ll~~~~~~~i~~dL~D~hIyIcsp~VL 217 (217) T cd04197 158 EFVIAVDPKTSRLLHYEELPGSKYRSITDLPSELLGSNSEVEIRHDLLDCHIDICSPDVL 217 (217) T ss_pred CCEEEEECCCCEEEEEECCCCCCCCCCCCCCHHHHCCCCCEEEECCCEECEEEEEECCCC T ss_conf 508999589981999842466554443345799970599589984885033899602549 No 40 >cd02507 eIF-2B_gamma_N_like The N-terminal of eIF-2B_gamma_like is predicted to have glycosyltransferase activity. N-terminal domain of eEIF-2B epsilon and gamma, subunits of eukaryotic translation initiators, is a subfamily of glycosyltranferase 2 and is predicted to have glycosyltranferase activity. eIF-2B is a guanine nucleotide-exchange factor which mediates the exchange of GDP (bound to initiation factor eIF2) for GTP, generating active eIF2.GTP complex. EIF2B is a complex multimeric protein consisting of five subunits named alpha, beta, gamma, delta and epsilon. Subunit epsilon shares sequence similarity with gamma subunit, and with a family of bifunctional nucleotide-binding enzymes such as ADP-glucose pyrophosphorylase, suggesting that epsilon subunit may play roles in nucleotide binding activity. In yeast, eIF2B gamma enhances the activity of eIF2B-epsilon leading to the idea that these subunits form the catalytic subcomplex. Probab=99.97 E-value=1.5e-32 Score=224.47 Aligned_cols=179 Identities=22% Similarity=0.299 Sum_probs=127.9 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCC-----CCCCCCCC Q ss_conf 9079924888745533443898222247837479999999997898689999777656431112222-----22221111 Q gi|254780922|r 1 MKGIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGS-----GEKWGVQF 75 (292) Q Consensus 1 MkaiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~-----g~~~g~~i 75 (292) |||||||||+||||+|+|...||||+||+|+|||+|+|+.|..+||++|+|+++++..++++++... .....+.+ T Consensus 1 ~qAVIlagg~gtrl~Plt~~~PK~lLpi~n~P~i~y~l~~l~~~gi~~i~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (216) T cd02507 1 FQAVVLADGFGSRFLPLTSDIPKALLPVANVPLIDYTLEWLEKAGVEEVFVVCCEHSQAIIEHLLKSKWSSLSSKMIVDV 80 (216) T ss_pred CEEEEECCCCCCCCCCCCCCCCCCCCEECCEEHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHCCCCCCCCCCEEE T ss_conf 93999827897736600069986556799999799999999987998899995888899999998624345676764478 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCHHHHHCCC--HHHHHCCCCCCC--CC----------C Q ss_conf 222333322446852101112335675213214432111100001110000--011100122221--00----------0 Q gi|254780922|r 76 SYIEQLVPAGLAQSYILGAEFIGDSSSVLILGDNVFYGSDISDIFHKARAR--RNSATVVGCHVQ--NP----------Q 141 (292) Q Consensus 76 ~y~~q~~p~Gta~Ai~~a~~~i~~~~~~lilgDni~~~~~l~~~l~~~~~~--~~~~ti~~~~v~--~p----------~ 141 (292) .+..+.++.|||+++..+++.+.+ +|+++.|| ++++.++..+++.|.+. ...+++.....+ ++ . T Consensus 81 ~~~~~~~~~Gta~~l~~~~~~i~~-dflvl~gD-~i~~~~l~~~l~~~~~~~~~~~~tl~~~~~~~~~~~~~~~~~~~~~ 158 (216) T cd02507 81 ITSDLCESAGDALRLRDIRGLIRS-DFLLLSCD-LVSNIPLSELLEERRKKDKNAIATLTVLLASPPVSTEQSKKTEEED 158 (216) T ss_pred EECCCCCCCCCHHHHHHHHHCCCC-CEEEECCC-EEECCCHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCCCCCCCC T ss_conf 965568876557999987615799-98998599-7876898999999987485774499999823777654333467663 Q ss_pred CCCCCCCCCCCCHHH----------------CCCCCC--CCCCCCCHHHHHHHHHHHH Q ss_conf 122212222100010----------------013333--3222222024554100022 Q gi|254780922|r 142 RYGVVEVDSSNQAIS----------------IEEKPN--NPKSSFAVTGIYFYDQEVV 181 (292) Q Consensus 142 ~yGvv~~d~~~~i~~----------------i~EKP~--~~~sn~a~~GiY~f~~~i~ 181 (292) -+|+...+....... +.+++. .-.||++.+|+|+|+++++ T Consensus 159 ~~~~d~~~~~~l~i~~~~~~~~~~~~~i~~~~l~~~~~~~i~snL~D~giYI~sp~VL 216 (216) T cd02507 159 VIAVDSKTQRLLLLHYEEDLDEDLELIIRKSLLSKHPNVTIRTDLLDCHIYICSPDVL 216 (216) T ss_pred EEEEECCCCEEEEEEECCCCCCCCCCCCCHHHHHCCCCEEEECCCEEEEEEEECCCCC T ss_conf 7999379987999961003454344213278861699749844884857998241009 No 41 >TIGR02623 G1P_cyt_trans glucose-1-phosphate cytidylyltransferase; InterPro: IPR013446 Glucose-1-phosphate cytidylyltransferases, also known as CDP-glucose pyrophosphorylase, are the product of the rfbF gene and produce CDP-D-glucose.. Probab=99.97 E-value=8e-32 Score=219.84 Aligned_cols=217 Identities=24% Similarity=0.366 Sum_probs=173.7 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCC-- Q ss_conf 907992488874553344389822224783747999999999789868999977765643111222222221111222-- Q gi|254780922|r 1 MKGIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQFSYI-- 78 (292) Q Consensus 1 MkaiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~-- 78 (292) |||||||||+|||+..-|.-+||||++|+|||||.|+++.....||+|++|.++|+. ..+++|+.+..-.--++++- T Consensus 1 MKaViLAGGlGTRisEEt~lrPKPM~EiGgkPIlWHI~k~Y~~~Gi~~FiiC~GYkG-y~IKeyF~NY~l~~SDvT~~l~ 79 (256) T TIGR02623 1 MKAVILAGGLGTRISEETHLRPKPMVEIGGKPILWHIMKIYSAHGIKEFIICLGYKG-YVIKEYFANYFLHMSDVTIDLE 79 (256) T ss_pred CEEEEEECCCCCCCCCCCCCCCCCCEEECCCCHHHHHHHHHHHCCCCEEEEEECCCC-CEEEEECCCCEEECCEEEEEEC T ss_conf 937997178644322435358976077679637788999997528330278853433-2201001063231131678751 Q ss_pred --------CCCCC-----------CCCCCCCCCCCCCCC-CCCCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCCCCC Q ss_conf --------33332-----------244685210111233-5675213214432111100001110000011100122221 Q gi|254780922|r 79 --------EQLVP-----------AGLAQSYILGAEFIG-DSSSVLILGDNVFYGSDISDIFHKARARRNSATVVGCHVQ 138 (292) Q Consensus 79 --------~q~~p-----------~Gta~Ai~~a~~~i~-~~~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~~v~ 138 (292) ...+| .-|||-+..-++|+. +++||+.+||-+ -+.++..++..|.+....||+.+++ T Consensus 80 ~n~~~~H~~~~EPW~VTLvDTG~~t~TGGRl~Rv~~~~~~d~~Fc~TYGDGv-~~~~I~~~~~~H~~~g~~AT~tAv~-- 156 (256) T TIGR02623 80 DNTIEVHEKRAEPWRVTLVDTGESTQTGGRLKRVREYLEDDEAFCLTYGDGV-ADIDIKALIAFHRKHGKKATVTAVQ-- 156 (256) T ss_pred CCEEEECCCCCCCCEEEEEECCCCCCCCCHHHHHHHHHCCCCCEEEEECCCC-CCCCHHHHHHHHHHCCCCEEEEEEC-- T ss_conf 5704531213897168888568776678327889998437876898715850-3325899999988629806788607-- Q ss_pred CCCCCCCCCCCCCCCHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHCCCCCEEEEECCCC Q ss_conf 00012221222210001001333332222220245541000224322004675333333343200000132046313541 Q gi|254780922|r 139 NPQRYGVVEVDSSNQAISIEEKPNNPKSSFAVTGIYFYDQEVVNIARNIRPSARGELEITDVNSYYLDKGLLAVEFLREG 218 (292) Q Consensus 139 ~p~~yGvv~~d~~~~i~~i~EKP~~~~sn~a~~GiY~f~~~i~~~~~~l~~s~rgE~eitD~~~~~l~~~~~~~~~~~~g 218 (292) -|-|||-+.+++ ..|.+|.|||... .-++|.|+++++|++++.+..-... +| .+.+..+.++|.+.+ +-+.| T Consensus 157 PPGRfGaL~~~~-~~V~~F~EKP~gd-~g~iNGGFFVL~P~V~d~I~~D~~~----wE-~~~L~~L~~~g~L~a-Y~H~G 228 (256) T TIGR02623 157 PPGRFGALELEG-ESVTSFQEKPLGD-GGLINGGFFVLNPSVLDLIDGDATV----WE-SEPLETLAQRGELSA-YEHSG 228 (256) T ss_pred CCCCEEEEEECC-CCEEEECCCCCCC-CEEEECCEEEECCCEEEEECCCCEE----EC-CHHHHHHHHCCCCEE-EEECC T ss_conf 888302367725-5111101474889-8168065687483202022488502----01-037899974488137-85177 Q ss_pred EEEECCCCHHHH Q ss_conf 256327845779 Q gi|254780922|r 219 SAWFDAGTPESL 230 (292) Q Consensus 219 ~~W~D~Gt~e~l 230 (292) +|--+.|.-|= T Consensus 229 -FW~PMDTLRDk 239 (256) T TIGR02623 229 -FWQPMDTLRDK 239 (256) T ss_pred -CCCCCHHHHHH T ss_conf -56852225667 No 42 >TIGR01105 galF regulatory protein GalF; InterPro: IPR005774 This family of proteins, GalF, represents a non-catalytic subunit of the UTP-glucose pyrophosphorylase modulating the enzyme activity to increase the formation of UDP-glucose . ; GO: 0030234 enzyme regulator activity, 0006011 UDP-glucose metabolic process. Probab=99.97 E-value=4.4e-31 Score=215.19 Aligned_cols=232 Identities=23% Similarity=0.437 Sum_probs=185.8 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCC----------- Q ss_conf 907992488874553344389822224783747999999999789868999977765643111222222----------- Q gi|254780922|r 1 MKGIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGE----------- 69 (292) Q Consensus 1 MkaiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~----------- 69 (292) +||||+.||.|-.|.|.|+.+||.|+|+.|||||+|+++..+.+||++|++|++..+..+ ++|+.... T Consensus 4 lkaviPvaGlGmhmlPatkaiPkemlP~~dkPmiqyivdeivaaGikei~lvth~sknav-enhfdtsyele~lleqrvk 82 (297) T TIGR01105 4 LKAVIPVAGLGMHMLPATKAIPKEMLPIVDKPMIQYIVDEIVAAGIKEIVLVTHASKNAV-ENHFDTSYELESLLEQRVK 82 (297) T ss_pred CCEECCCCCCHHHHHHHHHCCCHHHCCCCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHH-HHHCCCHHHHHHHHHHHHH T ss_conf 410122112012320011102112122114327889999988603010323420103445-4430203778999988888 Q ss_pred -----------CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCC-------CCCCHHHHHCCCHHHHH Q ss_conf -----------221111222333322446852101112335675213214432111-------10000111000001110 Q gi|254780922|r 70 -----------KWGVQFSYIEQLVPAGLAQSYILGAEFIGDSSSVLILGDNVFYGS-------DISDIFHKARARRNSAT 131 (292) Q Consensus 70 -----------~~g~~i~y~~q~~p~Gta~Ai~~a~~~i~~~~~~lilgDni~~~~-------~l~~~l~~~~~~~~~~t 131 (292) +-|+.|--+.|.+|+|+||+++||++.+++.||+++++|-++.+. ++..++..+.+.+..-. T Consensus 83 rqllaevq~iCPPGv~imnvrq~~PlGlGhsilCarP~~GdnPf~vvlPd~~~d~~~adPlrynlaam~arfnetGrsqv 162 (297) T TIGR01105 83 RQLLAEVQSICPPGVTIMNVRQAQPLGLGHSILCARPVVGDNPFVVVLPDIIIDDATADPLRYNLAAMIARFNETGRSQV 162 (297) T ss_pred HHHHHHHHHCCCCCCEEEEEECCCCCCCCCHHEECCCCCCCCCEEEEECCEEECCCCCCHHHHHHHHHHHHHHCCCHHHH T ss_conf 99988765228997179863047765444011011553268875898052356177766167889999987520330233 Q ss_pred CCCCCCCCCCCCCCCCCC----CCC---CHHHCCCCCCCCC---CCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCC Q ss_conf 012222100012221222----210---0010013333322---222202455410002243220046753333333432 Q gi|254780922|r 132 VVGCHVQNPQRYGVVEVD----SSN---QAISIEEKPNNPK---SSFAVTGIYFYDQEVVNIARNIRPSARGELEITDVN 201 (292) Q Consensus 132 i~~~~v~~p~~yGvv~~d----~~~---~i~~i~EKP~~~~---sn~a~~GiY~f~~~i~~~~~~l~~s~rgE~eitD~~ 201 (292) +.-.-..|-+.|.|++.. .+| ++++|+|||.+|. |++...|.|+++.+++.-++...|++-|..++||++ T Consensus 163 lakrm~GdlseysviqtkePl~~eG~v~rivefiekPd~Pqtldsd~mavGryvlsadiW~elert~PGaWGriqltdai 242 (297) T TIGR01105 163 LAKRMPGDLSEYSVIQTKEPLDREGKVSRIVEFIEKPDQPQTLDSDLMAVGRYVLSADIWAELERTEPGAWGRIQLTDAI 242 (297) T ss_pred HHHHCCCCCCHHEEEECCCCHHHCCCEEEEEHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHH T ss_conf 22103665200001102340212474122100020889875213467766346543444555431689863101477899 Q ss_pred CHHCCCCCEEEEECCCCEEEECCCCHHHHHHHHH Q ss_conf 0000013204631354125632784577999999 Q gi|254780922|r 202 SYYLDKGLLAVEFLREGSAWFDAGTPESLLDTAV 235 (292) Q Consensus 202 ~~~l~~~~~~~~~~~~g~~W~D~Gt~e~l~~As~ 235 (292) ..+.++.....-.+. |. -+|||..-+|++|-. T Consensus 243 aelakkqsvdamlmt-G~-sydCGkkmGymqafv 274 (297) T TIGR01105 243 AELAKKQSVDAMLMT-GD-SYDCGKKMGYMQAFV 274 (297) T ss_pred HHHHHHHHHHHHEEC-CC-CCHHHHHHHHHHHHH T ss_conf 988743102231011-67-410012446889999 No 43 >KOG1460 consensus Probab=99.96 E-value=6.6e-31 Score=214.08 Aligned_cols=239 Identities=24% Similarity=0.401 Sum_probs=185.4 Q ss_pred EEEEECCC--CCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHH-CCCCEEEEEECCCCCCCCCCCCCC-CCCCCCCCCC Q ss_conf 07992488--8745533443898222247837479999999997-898689999777656431112222-2222111122 Q gi|254780922|r 2 KGIVLAGG--SGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMD-AGIREILIISTPRDLPVLKEFLGS-GEKWGVQFSY 77 (292) Q Consensus 2 kaiIlAaG--~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~-~Gi~~I~ii~~~~~~~~~~~~l~~-g~~~g~~i~y 77 (292) |||||.|| +|||+||++..+||||.||+|.|||+|+|+.|.. .|..+|+++-- ...+.|..++.. ..++.+.+.| T Consensus 4 ~AVIlVGGP~kGTRFRPLSf~vPKPLfpiaG~pmI~Hhi~ac~qi~~l~eI~LvGF-y~e~~f~~fis~~~~e~~~pvrY 82 (407) T KOG1460 4 KAVILVGGPQKGTRFRPLSFNVPKPLFPIAGVPMIHHHISACKQISGLAEILLVGF-YEERVFTDFISAIQQEFKVPVRY 82 (407) T ss_pred EEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCHHHEEEEEC-CCCHHHHHHHHHHHHHCCCCHHH T ss_conf 79999558888851034435899875431885140406898751656023367740-45058999999998644652554 Q ss_pred CCCCCCCCCCCCCCCCCCCCC-CCC-CCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCCCC--CCCCCCCCCCCC-CCC Q ss_conf 233332244685210111233-567-521321443211110000111000001110012222--100012221222-210 Q gi|254780922|r 78 IEQLVPAGLAQSYILGAEFIG-DSS-SVLILGDNVFYGSDISDIFHKARARRNSATVVGCHV--QNPQRYGVVEVD-SSN 152 (292) Q Consensus 78 ~~q~~p~Gta~Ai~~a~~~i~-~~~-~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~~v--~~p~~yGvv~~d-~~~ 152 (292) ..|..|+||||+++.-++.|- +++ .+.+++.|+.++..+.+|++.|...+.-.++++.++ ++.+.||-+.-| ..+ T Consensus 83 L~E~~plGtaGgLyhFrdqIl~g~ps~vFvlnaDVCcsfPl~~ml~ahr~~g~~~tll~tkvs~e~asnfG~lV~dP~t~ 162 (407) T KOG1460 83 LREDNPLGTAGGLYHFRDQILAGSPSAVFVLNADVCCSFPLQDMLEAHRRYGGIGTLLVTKVSREQASNFGCLVEDPSTG 162 (407) T ss_pred HCCCCCCCCCCCEEEHHHHHHCCCCCEEEEEECCEECCCCHHHHHHHHHHCCCCEEEEEEEECHHHHHCCCEEEECCCCC T ss_conf 34578877665322444577469986289970550158747999999863388459999975576751057154068767 Q ss_pred CHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCC-----------------CCCCCCCCC-CCCCCHHCCCCCEEEEE Q ss_conf 001001333332222220245541000224322004-----------------675333333-34320000013204631 Q gi|254780922|r 153 QAISIEEKPNNPKSSFAVTGIYFYDQEVVNIARNIR-----------------PSARGELEI-TDVNSYYLDKGLLAVEF 214 (292) Q Consensus 153 ~i~~i~EKP~~~~sn~a~~GiY~f~~~i~~~~~~l~-----------------~s~rgE~ei-tD~~~~~l~~~~~~~~~ 214 (292) ++...+|||....|+.+++|+|+|++++|+.+++.- +...+-+-+ +|++..+..++.+ +.+ T Consensus 163 evlHYveKPsTfvSd~InCGvYlF~~eif~~i~~v~~q~~~~~~~~~~~~~l~~g~~d~irLeqDvlspLag~k~l-Y~y 241 (407) T KOG1460 163 EVLHYVEKPSTFVSDIINCGVYLFTPEIFNAIAEVYRQRQDLLEVEKDLPLLQPGPADFIRLEQDVLSPLAGSKQL-YAY 241 (407) T ss_pred CEEEEECCCCHHHHCCCCEEEEEECHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCEEEEECHHHHHHCCCCCE-EEE T ss_conf 3578505862353335011489946899788999999877655454216344877666078430010011277724-787 Q ss_pred CCCCEEEECCCCHHHHHHHHH-HHHHHHHH Q ss_conf 354125632784577999999-99998888 Q gi|254780922|r 215 LREGSAWFDAGTPESLLDTAV-FVRNIENR 243 (292) Q Consensus 215 ~~~g~~W~D~Gt~e~l~~As~-~i~~~e~~ 243 (292) ...+ .|--+-|+-+-+.||. |++..+.+ T Consensus 242 ~t~~-fW~QiKtagsal~as~lYLs~yk~t 270 (407) T KOG1460 242 ETTD-FWSQIKTAGSALYASRLYLSQYKRT 270 (407) T ss_pred ECCC-HHHHHCCCCCEEEHHHHHHHHHHHC T ss_conf 0610-8887346621333326688887630 No 44 >COG1213 Predicted sugar nucleotidyltransferases [Cell envelope biogenesis, outer membrane] Probab=99.94 E-value=5.9e-27 Score=189.30 Aligned_cols=218 Identities=20% Similarity=0.364 Sum_probs=144.9 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 90799248887455334438982222478374799999999978986899997776564311122222222111122233 Q gi|254780922|r 1 MKGIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQ 80 (292) Q Consensus 1 MkaiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q 80 (292) |||||||||+||||+| .+||+|+.|+|+|+|+|+|++|+.+||+++++|++....+.++.+++. +.++..+++- T Consensus 4 ~kavILAAG~GsRlg~---~~PK~Lvev~gr~ii~~~i~~L~~~gi~e~vvV~~g~~~~lve~~l~~---~~~~~~iv~N 77 (239) T COG1213 4 MKAVILAAGFGSRLGP---DIPKALVEVGGREIIYRTIENLAKAGITEFVVVTNGYRADLVEEFLKK---YPFNAKIVIN 77 (239) T ss_pred EEEEEEECCCCCCCCC---CCCCHHHHCCCEEEHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHHC---CCCCEEEEEC T ss_conf 0478770344553479---997144315886738999999987698639999623027899999851---8961699968 Q ss_pred CCC--CCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCC-----CCCCCCCCCCCCCCCCCC Q ss_conf 332--2446852101112335675213214432111100001110000011100122-----221000122212222100 Q gi|254780922|r 81 LVP--AGLAQSYILGAEFIGDSSSVLILGDNVFYGSDISDIFHKARARRNSATVVGC-----HVQNPQRYGVVEVDSSNQ 153 (292) Q Consensus 81 ~~p--~Gta~Ai~~a~~~i~~~~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~-----~v~~p~~yGvv~~d~~~~ 153 (292) ..+ .+++.++++|++++.. .|++++||.+ |...+.+.+. +......++.. .+++.+. + .+++|+ T Consensus 78 ~~y~ktN~~~Sl~~akd~~~~-~fii~~sD~v-ye~~~~e~l~---~a~~~~li~d~~~~~~~~~ea~k---v-~~e~G~ 148 (239) T COG1213 78 SDYEKTNTGYSLLLAKDYMDG-RFILVMSDHV-YEPSILERLL---EAPGEGLIVDRRPRYVGVEEATK---V-KDEGGR 148 (239) T ss_pred CCCCCCCCEEEEEEEHHHHCC-CEEEEECCEE-ECHHHHHHHH---HCCCCCEEEECCCCCCCCCCEEE---E-EECCCE T ss_conf 985557764677454346338-5799957875-0689999998---47577678952333355675069---9-844978 Q ss_pred HHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHC-CCCCEEEEECCCCEEEECCCCHHHHHH Q ss_conf 0100133333222222024554100022432200467533333334320000-013204631354125632784577999 Q gi|254780922|r 154 AISIEEKPNNPKSSFAVTGIYFYDQEVVNIARNIRPSARGELEITDVNSYYL-DKGLLAVEFLREGSAWFDAGTPESLLD 232 (292) Q Consensus 154 i~~i~EKP~~~~sn~a~~GiY~f~~~i~~~~~~l~~s~rgE~eitD~~~~~l-~~~~~~~~~~~~g~~W~D~Gt~e~l~~ 232 (292) +..|--+-. .-.-..+|+..|+.++|+.+-++. ..+.+++..++.+.+. ......+. .+|..|+|+.||++|.+ T Consensus 149 i~~igK~l~--e~~~e~iGi~~l~~~i~~~~~~~~-~e~~~~~~~~~~~~~~~~~~~~di~--~~g~~w~EVDtpeDl~~ 223 (239) T COG1213 149 IVEIGKDLT--EYDGEDIGIFILSDSIFEDTYELL-VERSEYDYREVEKEAGLPFTEVDIH--VDGLFWMEVDTPEDLER 223 (239) T ss_pred EEHHCCCCC--CCCCEEEEEEEECHHHHHHHHHHH-HHHHHHHHHHHHHHHCCCEEEEECC--CCCCEEEECCCHHHHHH T ss_conf 762107855--136216535883457878789887-6556677999999858761785102--36852486388899999 Q ss_pred HHHHHH Q ss_conf 999999 Q gi|254780922|r 233 TAVFVR 238 (292) Q Consensus 233 As~~i~ 238 (292) |..++. T Consensus 224 ar~~~~ 229 (239) T COG1213 224 ARKYLV 229 (239) T ss_pred HHHHHH T ss_conf 999999 No 45 >cd02509 GDP-M1P_Guanylyltransferase GDP-M1P_Guanylyltransferase catalyzes the formation of GDP-Mannose. GDP-mannose-1-phosphate guanylyltransferase, also called GDP-mannose pyrophosphorylase (GDP-MP), catalyzes the formation of GDP-Mannose from mannose-1-phosphate and GTP. Mannose is a key monosaccharide for glycosylation of proteins and lipids. GDP-Mannose is the activated donor for mannosylation of various biomolecules. This enzyme is known to be bifunctional, as both mannose-6-phosphate isomerase and mannose-1-phosphate guanylyltransferase. This CD covers the N-terminal GDP-mannose-1-phosphate guanylyltransferase domain, whereas the isomerase function is located at the C-terminal half. GDP-MP is a member of the nucleotidyltransferase family of enzymes. Probab=99.94 E-value=1.4e-27 Score=193.17 Aligned_cols=226 Identities=20% Similarity=0.307 Sum_probs=148.8 Q ss_pred CEEEEECCCCCCCCCHHH-CCCCCCEEEEC-CEEHHHHHHHHHHH-CCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 907992488874553344-38982222478-37479999999997-8986899997776564311122222222111122 Q gi|254780922|r 1 MKGIVLAGGSGTRLRPLT-DLLSKQMLPIY-NKPMIYYPVSTLMD-AGIREILIISTPRDLPVLKEFLGSGEKWGVQFSY 77 (292) Q Consensus 1 MkaiIlAaG~GtRl~P~T-~~~pK~Llpi~-gkplI~~~l~~l~~-~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y 77 (292) |.+||||||.||||+|++ +..|||++++. +++|++.+++.+.. .+.++|+|+|+..+...+++.+.+ ...+... T Consensus 1 i~~VIlaGG~GtRLWPlSr~~~PKQf~~~~~~~sLlq~T~~R~~~~~~~~~i~IvTn~~~~~~v~~ql~~---~~~~~~i 77 (274) T cd02509 1 IYPVILAGGSGTRLWPLSRESYPKQFLKLFGDKSLLQQTLDRLKGLVPPDRILVVTNEEYRFLVREQLPE---GLPEENI 77 (274) T ss_pred CEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCEEEECCHHHHHHHHHHHH---CCCCCCE T ss_conf 9899947986044687558889977521799998999999987468986780999572179999999884---3986648 Q ss_pred CCCCCCCCCCCCCCCCCCCCC---CCCCCEEECCCCCC-CC-CCCCHHHHHCCC--HHHHHCCCCCCCCC-CCCCCCCCC Q ss_conf 233332244685210111233---56752132144321-11-100001110000--01110012222100-012221222 Q gi|254780922|r 78 IEQLVPAGLAQSYILGAEFIG---DSSSVLILGDNVFY-GS-DISDIFHKARAR--RNSATVVGCHVQNP-QRYGVVEVD 149 (292) Q Consensus 78 ~~q~~p~Gta~Ai~~a~~~i~---~~~~~lilgDni~~-~~-~l~~~l~~~~~~--~~~~ti~~~~v~~p-~~yGvv~~d 149 (292) +.|+.+++||.|+..|..++. .+..+++++.|++. +. .|.+.++.+.+. .....+++.+...| +.||.++.+ T Consensus 78 i~EP~~rnTApAI~laa~~~~~~~~d~~l~v~PsDH~I~d~~~F~~~i~~a~~~a~~~~ivt~GI~P~~P~TgYGYI~~~ 157 (274) T cd02509 78 ILEPEGRNTAPAIALAALYLAKRDPDAVLLVLPSDHLIEDVEAFLKAVKKAVEAAEEGYLVTFGIKPTRPETGYGYIEAG 157 (274) T ss_pred EECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEEECCCCCCCEEEEEC T ss_conf 96768889799999999999975999779998474020088999999999999973398899742137789886289857 Q ss_pred CC-----CCHHHCCCCCCCCC-------CC-CCHHHHHHHHHHHH-H-----------HHHHCCCCC--CCCCCC----- Q ss_conf 21-----00010013333322-------22-22024554100022-4-----------322004675--333333----- Q gi|254780922|r 150 SS-----NQAISIEEKPNNPK-------SS-FAVTGIYFYDQEVV-N-----------IARNIRPSA--RGELEI----- 197 (292) Q Consensus 150 ~~-----~~i~~i~EKP~~~~-------sn-~a~~GiY~f~~~i~-~-----------~~~~l~~s~--rgE~ei----- 197 (292) +. .+|.+|+|||.... .| +=|+|+|+|+.+.+ + .+.+...+. .++.++ T Consensus 158 ~~~~~~~~~V~~F~EKP~~~~A~~~i~sg~y~WNsGiFv~~~~~~l~~~~~~~p~i~~~~~~~~~~~~~~~~~~~~~~~f 237 (274) T cd02509 158 EKLGGGVYRVKRFVEKPDLETAKEYLESGNYLWNSGIFLFRAKTFLEELKKHAPDIYEALEKALAAAGTDDFLRLLEEAF 237 (274) T ss_pred CCCCCCCEEEEEEEECCCHHHHHHHHHCCCCEEECHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHCCCCCHHHHHHHHH T ss_conf 53577742578876479889999998759903300232200999999999879999999999997567873020568898 Q ss_pred ---CC-CCC-HHCCCCCEEEEECCCCEEEECCCCHHHH Q ss_conf ---34-320-0000132046313541256327845779 Q gi|254780922|r 198 ---TD-VNS-YYLDKGLLAVEFLREGSAWFDAGTPESL 230 (292) Q Consensus 198 ---tD-~~~-~~l~~~~~~~~~~~~g~~W~D~Gt~e~l 230 (292) .. -+. ..|++- -.+..+.-.+.|.|+||.+++ T Consensus 238 ~~~~~iSIDyavmEk~-~~~~vv~~~~~WsDlGsW~sl 274 (274) T cd02509 238 AKIPSISIDYAVMEKT-KKVAVVPADFGWSDLGSWDAL 274 (274) T ss_pred HHCCCCCCCHHHHEEC-CCCEEEEECCCCCCCCCCCCC T ss_conf 5287756754985407-884599838985876786679 No 46 >COG4750 LicC CTP:phosphocholine cytidylyltransferase involved in choline phosphorylation for cell surface LPS epitopes [Cell envelope biogenesis, outer membrane] Probab=99.87 E-value=6.3e-24 Score=170.28 Aligned_cols=170 Identities=22% Similarity=0.398 Sum_probs=120.5 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 90799248887455334438982222478374799999999978986899997776564311122222222111122233 Q gi|254780922|r 1 MKGIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQ 80 (292) Q Consensus 1 MkaiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q 80 (292) |+|||||||.|||+.|+|..+||+|+.|.|+|||++.++.|..+||.+|+||++|.+ ++|+. +. .++++.+.|-.. T Consensus 1 ~nAIIlAAG~gsR~~plT~~tpK~LlkV~g~plIErqI~~L~e~gI~dI~IVvGYlk-E~FeY-Lk--dKy~vtLvyN~k 76 (231) T COG4750 1 MNAIILAAGLGSRFVPLTQSTPKSLLKVNGEPLIERQIEQLREAGIDDITIVVGYLK-EQFEY-LK--DKYDVTLVYNPK 76 (231) T ss_pred CCEEEEECCCCCCCCCCCCCCCHHHHHHCCCCCHHHHHHHHHHCCCCEEEEEEEEHH-HHHHH-HH--HHCCEEEEECCH T ss_conf 961899425566533131028767887358550999999999779861899963148-99999-98--715749995740 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCCCCCC-CCCCCCCCCCCCCCHHHCCC Q ss_conf 33224468521011123356752132144321111000011100000111001222210-00122212222100010013 Q gi|254780922|r 81 LVPAGLAQSYILGAEFIGDSSSVLILGDNVFYGSDISDIFHKARARRNSATVVGCHVQN-PQRYGVVEVDSSNQAISIEE 159 (292) Q Consensus 81 ~~p~Gta~Ai~~a~~~i~~~~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~~v~~-p~~yGvv~~d~~~~i~~i~E 159 (292) -..--.--++..|++++++. -++-+||.+.. + ++..+..+.. -+++..+. ...| .+..+.+++|+.++ T Consensus 77 Y~~yNn~ySlyla~d~l~nt--YiidsDnyl~k-N---if~~~~~~S~---Yfav~~~~~tnEw-~l~~~~~~ki~~v~- 145 (231) T COG4750 77 YREYNNIYSLYLARDFLNNT--YIIDSDNYLTK-N---IFLTKESHSK---YFAVYRSGKTNEW-LLIYNSDGKITRVD- 145 (231) T ss_pred HHHHHHHHHHHHHHHHHCCC--EEECCCHHHHH-H---HHHCCCCCCE---EEEEEECCCCCEE-EEEECCCCCEEEEE- T ss_conf 77663099999999985256--79505047666-5---5525735212---7899963877425-79976998489999- Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHC Q ss_conf 3333222222024554100022432200 Q gi|254780922|r 160 KPNNPKSSFAVTGIYFYDQEVVNIARNI 187 (292) Q Consensus 160 KP~~~~sn~a~~GiY~f~~~i~~~~~~l 187 (292) -...++++.+|+-+|+...-+.++++ T Consensus 146 --Igg~~~~imsG~sff~~~~~~ki~~l 171 (231) T COG4750 146 --IGGLNGYIMSGISFFDAQFSNKIKKL 171 (231) T ss_pred --ECCCCCCEEEEEEEECCHHHHHHHHH T ss_conf --66855625751144234247999999 No 47 >COG0836 {ManC} Mannose-1-phosphate guanylyltransferase [Cell envelope biogenesis, outer membrane] Probab=99.84 E-value=9.5e-22 Score=156.62 Aligned_cols=232 Identities=22% Similarity=0.304 Sum_probs=145.1 Q ss_pred CEEEEECCCCCCCCCHHHCC-CCCCEEEEC-CEEHHHHHHHHHHH-CCCCEEEEEECCCCCCCCCCCCCCCCCCCCCC-- Q ss_conf 90799248887455334438-982222478-37479999999997-89868999977765643111222222221111-- Q gi|254780922|r 1 MKGIVLAGGSGTRLRPLTDL-LSKQMLPIY-NKPMIYYPVSTLMD-AGIREILIISTPRDLPVLKEFLGSGEKWGVQF-- 75 (292) Q Consensus 1 MkaiIlAaG~GtRl~P~T~~-~pK~Llpi~-gkplI~~~l~~l~~-~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i-- 75 (292) |+.||||||.||||+|++.. .|||++++. |+.|++..+..+.. .++.++++|||..+.....+-+ .+.+.+. T Consensus 2 ~~pvIlaGG~GsRLWPLSR~~~PKQFl~L~~~~Sllq~T~~R~~~l~~~~~~~vVtne~~~f~v~eql---~e~~~~~~~ 78 (333) T COG0836 2 MIPVILAGGSGSRLWPLSRKDYPKQFLKLFGDLSLLQQTVKRLAFLGDIEEPLVVTNEKYRFIVKEQL---PEIDIENAA 78 (333) T ss_pred CEEEEEECCCCCCCCCCCCCCCCCCCEEECCCCCHHHHHHHHHHHCCCCCCEEEEECHHHHHHHHHHH---HHHHHCCCC T ss_conf 12599707876536876754377230341799718999999875147856609995779999999986---641012566 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCC---CCCCCEEECCCCCCCC--CCCCHHHHHCCCHH--HHHCCCCCCCCC-CCCCCCC Q ss_conf 22233332244685210111233---5675213214432111--10000111000001--110012222100-0122212 Q gi|254780922|r 76 SYIEQLVPAGLAQSYILGAEFIG---DSSSVLILGDNVFYGS--DISDIFHKARARRN--SATVVGCHVQNP-QRYGVVE 147 (292) Q Consensus 76 ~y~~q~~p~Gta~Ai~~a~~~i~---~~~~~lilgDni~~~~--~l~~~l~~~~~~~~--~~ti~~~~v~~p-~~yGvv~ 147 (292) ..+.|+.-+-||-|+..|.-.+. .+..+++++.|+.... .|.+.+....+... ....|+.....| +.||.++ T Consensus 79 ~illEP~gRnTApAIA~aa~~~~~~~~d~~~lVlpsDH~I~d~~af~~av~~A~~~A~~g~lVTfGI~Pt~PeTGYGYIe 158 (333) T COG0836 79 GIILEPEGRNTAPAIALAALSATAEGGDALVLVLPSDHVIADEEAFLNAVKKAEKAAEEGGIVTFGIPPTRPETGYGYIE 158 (333) T ss_pred CEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCEECCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCEEE T ss_conf 06854578774999999999998758983899944755655689999999999999971987998327898866741463 Q ss_pred CCC------CCCHHHCCCCCCC--CCC------CCCHHHHHHHHHHHH-HHHHHCCCC-----------C---------C Q ss_conf 222------1000100133333--222------222024554100022-432200467-----------5---------3 Q gi|254780922|r 148 VDS------SNQAISIEEKPNN--PKS------SFAVTGIYFYDQEVV-NIARNIRPS-----------A---------R 192 (292) Q Consensus 148 ~d~------~~~i~~i~EKP~~--~~s------n~a~~GiY~f~~~i~-~~~~~l~~s-----------~---------r 192 (292) .-+ -.+|..|+|||.. +++ -+=|.|+++|....+ +-+++..|+ . . T Consensus 159 ~G~~~~~~~~~~V~~FvEKPd~etA~~yv~sG~y~WNSGmF~Fra~~~l~e~~~~~P~i~~~~~~~~~~~~d~~~~~l~~ 238 (333) T COG0836 159 TGESIAENGVYKVDRFVEKPDLETAKKYVESGEYLWNSGMFLFRASVFLEELKKHQPDIYCAAEKAFEAAVDENSVRLDN 238 (333) T ss_pred CCCCCCCCCCEEEEEEEECCCHHHHHHHHHCCCEEEECCCEEEEHHHHHHHHHHHCCHHHHHHHHHHHCCCCCCHHHCCH T ss_conf 28624567825755531388789999999739657632316888899999998609099999999874403342210048 Q ss_pred CCCCC-CC-CCCHHCCCCCEEEEECCCCEEEECCCCHHHHHHHHH Q ss_conf 33333-34-320000013204631354125632784577999999 Q gi|254780922|r 193 GELEI-TD-VNSYYLDKGLLAVEFLREGSAWFDAGTPESLLDTAV 235 (292) Q Consensus 193 gE~ei-tD-~~~~~l~~~~~~~~~~~~g~~W~D~Gt~e~l~~As~ 235 (292) .+|+- .+ -+...+-+..-.+....-.+.|-|+|+..++++.+. T Consensus 239 e~f~~~p~iSIDYAiMEkt~~~aVVp~~f~WsDlGsW~Al~~~~~ 283 (333) T COG0836 239 EAYEEIPAISIDYAIMEKTSKAAVVPADFGWSDLGSWHALWEVLD 283 (333) T ss_pred HHHHHCCCCCCHHHHHHHHCCEEEEECCCCCCCCCCHHHHHHHHH T ss_conf 887448312203787741166489956788565567899998751 No 48 >TIGR02092 glgD glucose-1-phosphate adenylyltransferase, GlgD subunit; InterPro: IPR011832 This family is GlgD, an apparent regulatory protein that appears in an alpha2/beta2 heterotetramer with GlgC (glucose-1-phosphate adenylyltransferase, IPR011831 from INTERPRO) in a subset of bacteria that use GlgC for glycogen biosynthesis.; GO: 0005978 glycogen biosynthetic process. Probab=99.82 E-value=3.9e-21 Score=152.76 Aligned_cols=240 Identities=19% Similarity=0.311 Sum_probs=176.1 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCE-EHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCC----- Q ss_conf 9079924888745533443898222247837-4799999999978986899997776564311122222222111----- Q gi|254780922|r 1 MKGIVLAGGSGTRLRPLTDLLSKQMLPIYNK-PMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQ----- 74 (292) Q Consensus 1 MkaiIlAaG~GtRl~P~T~~~pK~Llpi~gk-plI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~----- 74 (292) |=|||=-.-.-..|.|||..+|=++||++|| +||||+||++++|||+.|.+.+........-+|||.|++|+++ T Consensus 3 ~~aIi~l~~~~~~l~~LT~~Rp~A~~PfgGrYRlIDF~LSn~~NAgI~~v~~~~~~~~~~Sl~DHLg~G~eWdL~r~~~g 82 (383) T TIGR02092 3 MSAIINLTESKKNLSPLTKVRPVASLPFGGRYRLIDFPLSNMVNAGIRNVAVFFKNKERRSLFDHLGSGREWDLNRKRDG 82 (383) T ss_pred EEEEEECCCCCHHCCCHHHHCCCEEECCCCEEEEEECCHHCCCCCCEEEEEEECCCCCCCHHHHHCCCCCCCCCCCCCCC T ss_conf 58997357882011314550870332358647788301002310360022465266665315221348688361102575 Q ss_pred -CCCC-CCCCC--CCCC-------CCCCCCCCCCCC--CCCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCCCCCCC- Q ss_conf -1222-33332--2446-------852101112335--67521321443211110000111000001110012222100- Q gi|254780922|r 75 -FSYI-EQLVP--AGLA-------QSYILGAEFIGD--SSSVLILGDNVFYGSDISDIFHKARARRNSATVVGCHVQNP- 140 (292) Q Consensus 75 -i~y~-~q~~p--~Gta-------~Ai~~a~~~i~~--~~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~~v~~p- 140 (292) +-|. .+..+ .-.. +-+....+|+.. .+.++++|...+++.||..+++.|.+.+.++|++-.+|++. T Consensus 83 lF~~pP~~~~~~~~~~~~kaall~~~~~~~l~fl~rS~~~y~V~~~s~~v~NiDl~~~l~~H~~~~~~iT~VYKkv~~~~ 162 (383) T TIGR02092 83 LFLFPPKYNDRDDVSEGGKAALLKRYFSNNLEFLKRSTSEYVVVLNSHMVCNIDLKAVLKRHEETGKDITVVYKKVKKAD 162 (383) T ss_pred EEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCEEEECCCEEEECCHHHHHHHHHHCCCCEEEEECCCCCCC T ss_conf 27706876864323456337788999988767875089971898547457830289999998744896689973478467 Q ss_pred -C-CC-CCCCCCCCCCHHHCC----C--CCCCCCCCCCHHHHHHHHHHHH-HHHHHCCCCCCCCCC-CCCCCCHHCCCCC Q ss_conf -0-12-221222210001001----3--3333222222024554100022-432200467533333-3343200000132 Q gi|254780922|r 141 -Q-RY-GVVEVDSSNQAISIE----E--KPNNPKSSFAVTGIYFYDQEVV-NIARNIRPSARGELE-ITDVNSYYLDKGL 209 (292) Q Consensus 141 -~-~y-Gvv~~d~~~~i~~i~----E--KP~~~~sn~a~~GiY~f~~~i~-~~~~~l~~s~rgE~e-itD~~~~~l~~~~ 209 (292) + .+ -++++|+.|+|.++. - ++.+..+| .++-+|+.+.+.+ +++... ..+|+.. +.++++.++.+-. T Consensus 163 ~~~~~c~~L~~De~g~v~s~~Pms~~~~~~~~e~~n-~Sl~~yi~~t~lLiell~~~--~~~g~~~~l~~~~~~~l~~~~ 239 (383) T TIGR02092 163 LSEEDCTILKFDEKGEVKSIGPMSVAKNLNKEEEEN-LSLDIYILSTELLIELLYEC--IQKGKKTSLEELIRENLKELN 239 (383) T ss_pred CCCCCCCEEEECCCCCEEEEECEEEECCCCCHHHHH-HHHHHEEECHHHHHHHHHHH--HHCCCHHHHHHHHHHHHHHCC T ss_conf 762357278863678568863200002676134443-20100242668999999987--631556799999999998647 Q ss_pred E-EEEECCCCEEEECCCCHHHHHHHHHHHHHHHHHCCC Q ss_conf 0-463135412563278457799999999998888098 Q gi|254780922|r 210 L-AVEFLREGSAWFDAGTPESLLDTAVFVRNIENRLGL 246 (292) Q Consensus 210 ~-~~~~~~~g~~W~D~Gt~e~l~~As~~i~~~e~~~g~ 246 (292) . .++| .| |--=+-+.++|++|||.+..-+..+.+ T Consensus 240 ~~aYEY--~G-Yl~~I~s~~sYY~aNM~lL~~~~f~~L 274 (383) T TIGR02092 240 INAYEY--KG-YLANINSVKSYYKANMDLLEPKIFQSL 274 (383) T ss_pred CCCCEE--EE-EEEEECCHHHHHHHHHHHHHHHHHHHH T ss_conf 543303--42-211110167898872886316876887 No 49 >KOG1462 consensus Probab=99.79 E-value=6.1e-20 Score=145.28 Aligned_cols=246 Identities=24% Similarity=0.343 Sum_probs=156.3 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 90799248887455334438982222478374799999999978986899997776564311122222222111122233 Q gi|254780922|r 1 MKGIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQ 80 (292) Q Consensus 1 MkaiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q 80 (292) |+||++|||-||||--+|...||+||||+++|||.|+|..|..+|+++|.|++.......++.-+++....-.+-.|+.- T Consensus 10 fqavV~a~~ggt~~p~~~~~~pKaLLPIgn~PMi~YpL~~L~~~gfteiiVv~~e~e~~~i~~al~~~~~l~~~~~~v~i 89 (433) T KOG1462 10 FQAVVLAGGGGTRMPEVTSRLPKALLPIGNKPMILYPLNSLEQAGFTEIIVVVNEDEKLDIESALGSNIDLKKRPDYVEI 89 (433) T ss_pred HHHHEEECCCCEECHHHHHHCCHHHCCCCCCCEEEEEHHHHHHCCCEEEEEEECHHHHHHHHHHHHCCCCCCCCCCEEEE T ss_conf 65532525896045444523632112458964364126689756973899995378899999987507754321327983 Q ss_pred CC----CCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCCC----CCCCC-------CCCC Q ss_conf 33----224468521011123356752132144321111000011100000111001222----21000-------1222 Q gi|254780922|r 81 LV----PAGLAQSYILGAEFIGDSSSVLILGDNVFYGSDISDIFHKARARRNSATVVGCH----VQNPQ-------RYGV 145 (292) Q Consensus 81 ~~----p~Gta~Ai~~a~~~i~~~~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~~----v~~p~-------~yGv 145 (292) +. -.|||+++..--..++.++| |+++.|++.+..+..++.++...++...++... +..|. .+.+ T Consensus 90 p~~~~~d~gtadsLr~Iy~kikS~Df-lvlsCD~Vtdv~l~~lvd~FR~~d~slamli~~~~s~~~~pgqk~k~k~~~d~ 168 (433) T KOG1462 90 PTDDNSDFGTADSLRYIYSKIKSEDF-LVLSCDFVTDVPLQPLVDKFRATDASLAMLIGNALSEVPIPGQKGKKKQARDV 168 (433) T ss_pred ECCCCCCCCCHHHHHHHHHHHCCCCE-EEEECCCCCCCCCHHHHHHHHCCCHHHHHHHCCCCCCCCCCCCCCCCCCCCCE T ss_conf 04563344778888654554146877-99965302688808889887524765867752645445366734455440213 Q ss_pred CCCCCCCCHHHC------CCCCC---------CC----CCCCCHHHHHHHHHHHHHHHHHC--CCCCCCCCCCCCCCCHH Q ss_conf 122221000100------13333---------32----22222024554100022432200--46753333333432000 Q gi|254780922|r 146 VEVDSSNQAISI------EEKPN---------NP----KSSFAVTGIYFYDQEVVNIARNI--RPSARGELEITDVNSYY 204 (292) Q Consensus 146 v~~d~~~~i~~i------~EKP~---------~~----~sn~a~~GiY~f~~~i~~~~~~l--~~s~rgE~eitD~~~~~ 204 (292) +-++++..=.-+ .|+|- .| -+++...-+|+|...+++++..- ..|-+++| +.++++.. T Consensus 169 igi~e~t~rl~y~~~~~d~~~~l~i~~slL~~~prltl~t~L~dahiY~~k~~v~d~l~~~~sisSfk~~f-~P~lvkkQ 247 (433) T KOG1462 169 IGINEDTERLAYSSDSADEEEPLVIRKSLLWNHPRLTLTTKLVDAHIYVFKHWVIDLLSEKESISSFKADF-LPYLVKKQ 247 (433) T ss_pred EEECCCCCEEEEEECCCCCCCCEEHHHHHHHCCCCEEEECCCCCEEEEEEHHHHHHHHHCCCCCEEECCCC-CCHHHHHH T ss_conf 44415663368840577677743211345521886488615431466423899999973178620200145-53666656 Q ss_pred C------C---------------CC--------CEEEEECCCCEEEECCCCHHHHHHHH--HHHHHHHHHCCCEE Q ss_conf 0------0---------------13--------20463135412563278457799999--99999888809866 Q gi|254780922|r 205 L------D---------------KG--------LLAVEFLREGSAWFDAGTPESLLDTA--VFVRNIENRLGLYV 248 (292) Q Consensus 205 l------~---------------~~--------~~~~~~~~~g~~W~D~Gt~e~l~~As--~~i~~~e~~~g~~i 248 (292) - + .+ +..+....++....-+.|.-.|+++| .++.++...++... T Consensus 248 ~q~~~~~~~~~~~~l~t~~~~~~d~~~~~~d~ik~y~~~~p~e~~~~raNtL~~y~eiN~~k~~~~l~~e~~~~k 322 (433) T KOG1462 248 FQKNPPLKKNETSILPTPNLNNPDGIHSPDDRIKCYAYILPTESLFVRANTLLSYMEINRDKKLKKLCSEAKFVK 322 (433) T ss_pred HHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCEEEECCHHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 530777565300136775556853234744511356899047653587533677776407889987364456641 No 50 >pfam01128 IspD Uncharacterized protein family UPF0007. Probab=99.62 E-value=7.7e-16 Score=119.57 Aligned_cols=210 Identities=17% Similarity=0.265 Sum_probs=126.2 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCC-CCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 7992488874553344389822224783747999999999789-868999977765643111222222221111222333 Q gi|254780922|r 3 GIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAG-IREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQL 81 (292) Q Consensus 3 aiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~G-i~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~ 81 (292) |||||||.||||. ..+||++++++|||+++|.|+.+.+.. +++|+|++++...+.++.+..+ .++.++.-. T Consensus 3 aIIlAaG~G~R~~---~~~pKQf~~l~gkpil~~sl~~f~~~~~i~~Ivvv~~~~~~~~~~~~~~~-----~~i~~v~GG 74 (221) T pfam01128 3 AVIPAAGSGKRMG---AGVPKQFLQLLGQPLLEHTVDAFLASPVVDRIVVAVSPDDTPEFRQLLGD-----PSIQLVAGG 74 (221) T ss_pred EEEECCEECCCCC---CCCCCCCCEECCEEHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHCCC-----CCEEEECCC T ss_conf 9993581061269---58986536689888999999999638888979999677889999986079-----987996799 Q ss_pred CCCCCCCCCCCCCCCCCC-CCCCEEE-CCCCCCCCC-CCCHHHHHCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHCC Q ss_conf 322446852101112335-6752132-144321111-0000111000001110012222100012221222210001001 Q gi|254780922|r 82 VPAGLAQSYILGAEFIGD-SSSVLIL-GDNVFYGSD-ISDIFHKARARRNSATVVGCHVQNPQRYGVVEVDSSNQAISIE 158 (292) Q Consensus 82 ~p~Gta~Ai~~a~~~i~~-~~~~lil-gDni~~~~~-l~~~l~~~~~~~~~~ti~~~~v~~p~~yGvv~~d~~~~i~~i~ 158 (292) . .-.++++.|.+.+.. .++++|. +.--|...+ +.+++... ....++.+.+.++.|+-+ ..++++.+.+.. T Consensus 75 ~--tR~~SV~ngL~~l~~~~~~VlIHDaaRP~v~~~~i~~li~~~-~~~~~~~i~~~p~~DTik----~v~~~~~i~~t~ 147 (221) T pfam01128 75 D--TRQDSVLNGLKALAGTAKFVLVHDGARPCLPHADLARLLAAL-ETGTQGAILALPVTDTIK----RVEADGVVAGTP 147 (221) T ss_pred C--CHHHHHHHHHHHHCCCCCEEEEEECCCCCCCHHHHHHHHHHH-HHHCCCCEEEECCCCCEE----EECCCCCEECCC T ss_conf 6--389999999998358999899983656679999999999998-600486135412466435----753786044035 Q ss_pred CCCCCCCCCCCHHHHH-HHHHHHHHH--H-HHCCCCCCCCCCCCCCCCHHCCCCCEEEEECCCCEEEECCCCHHHHHHHH Q ss_conf 3333322222202455-410002243--2-20046753333333432000001320463135412563278457799999 Q gi|254780922|r 159 EKPNNPKSSFAVTGIY-FYDQEVVNI--A-RNIRPSARGELEITDVNSYYLDKGLLAVEFLREGSAWFDAGTPESLLDTA 234 (292) Q Consensus 159 EKP~~~~sn~a~~GiY-~f~~~i~~~--~-~~l~~s~rgE~eitD~~~~~l~~~~~~~~~~~~g~~W~D~Gt~e~l~~As 234 (292) + +++ +| .-+|++|.+ + +.........++.||-...+...|.....+.++.. =+=+-||+||.-|. T Consensus 148 d-----R~~-----l~~~QTPQ~F~~~~L~~a~~~~~~~~~~~TDd~sl~~~~g~~v~~v~G~~~-N~KIT~~eDl~~ae 216 (221) T pfam01128 148 D-----RSG-----LWAAQTPQGFRVDLLLAAHQRGDQPGAEITDDASLVEHAGGSVQVVPGRPD-NLKITTPEDLALAE 216 (221) T ss_pred C-----CHH-----HEECCCCCCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHCCCCEEEEECCCC-CCCCCCHHHHHHHH T ss_conf 6-----421-----011007620069999999998886299976299999977997799965842-38878999999999 Q ss_pred HHHH Q ss_conf 9999 Q gi|254780922|r 235 VFVR 238 (292) Q Consensus 235 ~~i~ 238 (292) .+++ T Consensus 217 ~lL~ 220 (221) T pfam01128 217 AILR 220 (221) T ss_pred HHHC T ss_conf 9967 No 51 >TIGR03310 matur_ygfJ molybdenum hydroxylase accessory protein, YgfJ family. Members of this protein family are probable accessory proteins for the biosynthesis of enzymes related to xanthine dehydrogenase. Comparative genomics suggests a role in the maturation of selenium-dependent molybdenum hydroxylases, although a tenuous alternative hypothesis is a role for this protein (with a requirement for SelD, the selenium donor protein in the selenocysteine and selenouridine biosynthesis pathways) metabolizing a selenium-containing substrate such as selenate. Probab=99.58 E-value=4.1e-15 Score=115.00 Aligned_cols=184 Identities=18% Similarity=0.266 Sum_probs=112.2 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCC- Q ss_conf 0799248887455334438982222478374799999999978986899997776564311122222222111122233- Q gi|254780922|r 2 KGIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQ- 80 (292) Q Consensus 2 kaiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q- 80 (292) .|||||||.++||. .||.|+|++|+||+.|+++.+..+++.++++|+++...+.+..+. ...++.+ +.. T Consensus 1 dAIILAaG~S~RmG-----~~K~Ll~~~g~~ll~~~i~~~~~~~~~~vivv~~~~~~~~~~~~~---~~~~~~~--v~n~ 70 (188) T TIGR03310 1 DAIILAAGLSSRMG-----QNKLLLPYKGKTILEHVVDNALRLFFDEVILVLGHEADELVALLA---NHSNITL--VHNP 70 (188) T ss_pred CEEEECCCCCCCCC-----CCCCCCEECCCCHHHHHHHHHHHCCCCCEEEECCCCHHHHHHHHC---CCCCCCE--ECCC T ss_conf 96998898766598-----890068889832999999999856999669982783789999862---6689626--5486 Q ss_pred CCCCCCCCCCCCCCCCCCCCC-CCEEECCCCCCCC-CCCCHHHHHCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHCC Q ss_conf 332244685210111233567-5213214432111-10000111000001110012222100012221222210001001 Q gi|254780922|r 81 LVPAGLAQSYILGAEFIGDSS-SVLILGDNVFYGS-DISDIFHKARARRNSATVVGCHVQNPQRYGVVEVDSSNQAISIE 158 (292) Q Consensus 81 ~~p~Gta~Ai~~a~~~i~~~~-~~lilgDni~~~~-~l~~~l~~~~~~~~~~ti~~~~v~~p~~yGvv~~d~~~~i~~i~ 158 (292) ....|.++++..+.....+.+ ++++++|--+... .+..+++.+..+...+.+. .| +| T Consensus 71 ~~~~G~~~si~~gl~~~~~~~~~li~~~D~P~l~~~~i~~L~~~~~~~~~~iv~p--------~~-------~g------ 129 (188) T TIGR03310 71 QYAEGQSSSIKLGLELPVQSDGYLFLLGDQPFVTPDIIQLLLEAFALKNDEIVVP--------LY-------KG------ 129 (188) T ss_pred CCCCCHHHHHHHHHHCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHCCCCCEEE--------CC-------CC------ T ss_conf 5567808999999970657885999718878899999999999998579961734--------55-------99------ Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHCCCCCEEEEECCCCEEEECCCCHHHHHH Q ss_conf 33333222222024554100022432200467533333334320000013204631354125632784577999 Q gi|254780922|r 159 EKPNNPKSSFAVTGIYFYDQEVVNIARNIRPSARGELEITDVNSYYLDKGLLAVEFLREGSAWFDAGTPESLLD 232 (292) Q Consensus 159 EKP~~~~sn~a~~GiY~f~~~i~~~~~~l~~s~rgE~eitD~~~~~l~~~~~~~~~~~~g~~W~D~Gt~e~l~~ 232 (292) ++-+|. +|+.+.++.+.++. |+.....+++.+.. ....+..-.. ...+|+.|||||.+ T Consensus 130 -~~g~Pv---------lf~~~~~~~L~~l~----gd~G~r~ll~~~~~-~~~~v~~~d~-~~~~DiDTpeD~~~ 187 (188) T TIGR03310 130 -KRGHPV---------LFPRKLFPELLALT----GDTGGRQILRELPH-EVKYVEVKDP-GILFDIDTPEDYQA 187 (188) T ss_pred -CCCCCE---------EECHHHHHHHHHCC----CCCCHHHHHHHCCC-CEEEEECCCC-CCCCCCCCHHHHHC T ss_conf -536766---------87799999998443----88038999984806-7699972898-90418999999961 No 52 >PRK00155 ispD 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase; Reviewed Probab=99.54 E-value=1.5e-14 Score=111.43 Aligned_cols=211 Identities=20% Similarity=0.280 Sum_probs=123.0 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCC-CCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 7992488874553344389822224783747999999999789-868999977765643111222222221111222333 Q gi|254780922|r 3 GIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAG-IREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQL 81 (292) Q Consensus 3 aiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~G-i~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~ 81 (292) |||||||.|+||. ..+||++++++|+|+|+|.|+.+.... |++|+|+++....+.++..+.. +...+.++. T Consensus 7 aIIlAaG~G~R~g---~~~pKQf~~l~gkpil~~sl~~f~~~~~i~~IvvV~~~~~~~~~~~~~~~---~~~~i~~v~-- 78 (228) T PRK00155 7 AIIPAAGIGSRMG---ADRPKQYLPLGGKPILEHTLEAFLAHPAIDEIIVVVPPDDTEYAKLLLAK---FDKRIKVVA-- 78 (228) T ss_pred EEEECCCCCCCCC---CCCCCCCCEECCEEHHHHHHHHHHCCCCCCEEEEEECCHHHHHHHHHHHH---CCCCEEEEC-- T ss_conf 9993772051378---67996556689569999999999759998979999576269999999874---498579976-- Q ss_pred CCCCC--CCCCCCCCCCCCCCCCCEEE-CCCCCCCCC-CCCHHHHHCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHC Q ss_conf 32244--68521011123356752132-144321111-000011100000111001222210001222122221000100 Q gi|254780922|r 82 VPAGL--AQSYILGAEFIGDSSSVLIL-GDNVFYGSD-ISDIFHKARARRNSATVVGCHVQNPQRYGVVEVDSSNQAISI 157 (292) Q Consensus 82 ~p~Gt--a~Ai~~a~~~i~~~~~~lil-gDni~~~~~-l~~~l~~~~~~~~~~ti~~~~v~~p~~yGvv~~d~~~~i~~i 157 (292) -|. .++++.|..++.++++++|. +.--|.+.+ +.+++.... ..++.+.+.++.|. +...++++.+.+. T Consensus 79 --GG~tR~~Sv~ngL~~l~~~~~VlIHDaARP~is~~~i~~li~~~~--~~~~~ip~~p~~DT----ik~~~~~~~i~~t 150 (228) T PRK00155 79 --GGAERQDSVLNGLQALPDDDWVLVHDAARPFLTPDDIDRLIEAAE--EYGAAILAVPVRDT----IKRVDAGGFIVDT 150 (228) T ss_pred --CCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHHHHHHHHH--HCCCCEEEEECCCC----EEEECCCCCEEEE T ss_conf --970399999999998403997999706676899999999999998--46984698760241----7896589815530 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHH--H-HCCCCCCCCCCCCCCCCHHCCCCCEEEEECCCCEEEECCCCHHHHHHHH Q ss_conf 133333222222024554100022432--2-0046753333333432000001320463135412563278457799999 Q gi|254780922|r 158 EEKPNNPKSSFAVTGIYFYDQEVVNIA--R-NIRPSARGELEITDVNSYYLDKGLLAVEFLREGSAWFDAGTPESLLDTA 234 (292) Q Consensus 158 ~EKP~~~~sn~a~~GiY~f~~~i~~~~--~-~l~~s~rgE~eitD~~~~~l~~~~~~~~~~~~g~~W~D~Gt~e~l~~As 234 (292) +. ++++.. .-+|+.|.+- . .........++.||=-..+...|.....+.+... =+=+-||+||.-|. T Consensus 151 l~-----R~~l~~----~QTPQ~F~~~~l~~a~~~~~~~~~~~TDd~sl~~~~g~~v~~v~G~~~-N~KIT~~eDL~~ae 220 (228) T PRK00155 151 PD-----RSGLWA----AQTPQGFRVELLKEALARALAEGFTITDDASAVEWLGKPVRLVEGRYD-NIKVTTPEDLALAE 220 (228) T ss_pred CC-----CHHEEE----EECCCCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHCCCCEEEEECCCC-CCCCCCHHHHHHHH T ss_conf 35-----144366----528853259999999999987599877899999986997699966854-37728999999999 Q ss_pred HHHHH Q ss_conf 99999 Q gi|254780922|r 235 VFVRN 239 (292) Q Consensus 235 ~~i~~ 239 (292) .+++. T Consensus 221 ~ll~~ 225 (228) T PRK00155 221 AILKR 225 (228) T ss_pred HHHHH T ss_conf 99985 No 53 >TIGR00453 ispD 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase; InterPro: IPR001228 The bacterial ispD protein catalyzes the third step of the deoxyxylulose-5-phosphate pathway (DXP) of isoprenoid biosynthesis; the formation of 4-diphosphocytidyl-2C-methyl-D-erythritol from CTP and 2C-methyl-D-erythritol 4-phosphate.; GO: 0008299 isoprenoid biosynthetic process. Probab=99.53 E-value=1.1e-14 Score=112.42 Aligned_cols=209 Identities=18% Similarity=0.269 Sum_probs=136.7 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHC-CCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 799248887455334438982222478374799999999978-9868999977765643111222222221111222333 Q gi|254780922|r 3 GIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDA-GIREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQL 81 (292) Q Consensus 3 aiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~-Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~ 81 (292) |||||||.|+||. ...||+.++++|+|+|.|+|+.+..+ .|++|+||+.+.+.+.++..+.....+ ..+..+. T Consensus 2 avi~AAG~G~R~~---~~~pKQy~~l~G~Pll~h~~~~~~~~~~~~~vvVV~~~~~~~~~~~~~~~~~~~-~~~~~v~-- 75 (226) T TIGR00453 2 AVIPAAGRGTRMG---SGVPKQYLELAGRPLLEHTLDAFLAHPAIDEVVVVVSPDDTEFFQKALAARAKF-KVVKIVA-- 75 (226) T ss_pred EEEEECCCCEECC---CCCCEEEEEECCEEEHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHCCCCCC-CCCEEEC-- T ss_conf 6873167111336---798702665188022177899986323206468983674158888863256578-6115846-- Q ss_pred CCCC--CCCCCCCCCCCC---CCCCCCEEECC--CCCCCCCCCCHHHHHCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCH Q ss_conf 3224--468521011123---35675213214--4321111000011100000111001222210001222122221000 Q gi|254780922|r 82 VPAG--LAQSYILGAEFI---GDSSSVLILGD--NVFYGSDISDIFHKARARRNSATVVGCHVQNPQRYGVVEVDSSNQA 154 (292) Q Consensus 82 ~p~G--ta~Ai~~a~~~i---~~~~~~lilgD--ni~~~~~l~~~l~~~~~~~~~~ti~~~~v~~p~~yGvv~~d~~~~i 154 (292) -| =.+++..+.+.+ .+.+++||.=. =++....+.++++... ....+.+.+.++.|+ +...|.+|.+ T Consensus 76 --GG~~Rq~SV~~GL~a~~~~~~~~~VlvHDaARPf~~~~~~~~l~~~~~-~~~~a~~~A~Pv~DT----lK~~~~~G~~ 148 (226) T TIGR00453 76 --GGDTRQDSVRNGLKALPERADAEIVLVHDAARPFVPKELIDRLLEALE-KGAGAAILALPVADT----LKRVDADGFV 148 (226) T ss_pred --CCCCHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCHHHHHHHHHHHH-HCCCCEEEEEECCCE----EEEECCCCCE T ss_conf --987468999999998763589882898477345889879999999986-079834874205431----5774488862 Q ss_pred HHCCCCCCCCCCCCCHHHHH-HHHHHHHHH--HHH-CCC-CCCCCCCCCCCCCHHCCC-CCEEEEECCCCEEEECCCCHH Q ss_conf 10013333322222202455-410002243--220-046-753333333432000001-320463135412563278457 Q gi|254780922|r 155 ISIEEKPNNPKSSFAVTGIY-FYDQEVVNI--ARN-IRP-SARGELEITDVNSYYLDK-GLLAVEFLREGSAWFDAGTPE 228 (292) Q Consensus 155 ~~i~EKP~~~~sn~a~~GiY-~f~~~i~~~--~~~-l~~-s~rgE~eitD~~~~~l~~-~~~~~~~~~~g~~W~D~Gt~e 228 (292) .+=++- .++| +.+|..|+. +.+ ... -...-+++||=-..+-.- |....-+-++- .=+=+-||+ T Consensus 149 ~~T~dR----------~~Lw~~QTPQ~F~~~~L~~Ah~~~A~~~g~~~TDDA~~~E~~~G~~v~LVeG~~-~n~KiT~P~ 217 (226) T TIGR00453 149 VETVDR----------EGLWAAQTPQAFRRELLLKAHARAAKAEGFEITDDASAVERLFGGKVALVEGDA-DNFKITTPE 217 (226) T ss_pred ECCCCC----------CCCCEEECCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHCCCCEEEEECCC-CCEECCCHH T ss_conf 026662----------145255768876888999999999984698513589999996389728872761-011036727 Q ss_pred HHHHHHH Q ss_conf 7999999 Q gi|254780922|r 229 SLLDTAV 235 (292) Q Consensus 229 ~l~~As~ 235 (292) ||.-|.. T Consensus 218 DL~~Ae~ 224 (226) T TIGR00453 218 DLALAEA 224 (226) T ss_pred HHHHHHH T ss_conf 8999875 No 54 >KOG1461 consensus Probab=99.52 E-value=1.1e-14 Score=112.30 Aligned_cols=132 Identities=23% Similarity=0.366 Sum_probs=97.8 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCC----CCCC Q ss_conf 079924888745533443898222247837479999999997898689999777656431112222222211----1122 Q gi|254780922|r 2 KGIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGV----QFSY 77 (292) Q Consensus 2 kaiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~----~i~y 77 (292) +||++|--+-||++|+|...|+.|+|+.+.|||+|+|+.|.++|+.++++.|+.+..+ +..|+.. +.|.. .+.. T Consensus 26 qAIllaDsf~trF~Plt~~~p~~LLPlaNVpmIdYtL~~L~~agV~eVfvfc~~~~~q-i~e~i~~-sew~~~~~~~v~t 103 (673) T KOG1461 26 QAILLADSFETRFRPLTLEKPRVLLPLANVPMIDYTLEWLERAGVEEVFVFCSAHAAQ-IIEYIEK-SEWYLPMSFIVVT 103 (673) T ss_pred EEEEEECCCHHCCCCCCCCCCCEEEEECCCHHHHHHHHHHHHCCCEEEEEEECCCHHH-HHHHHHH-CCCCCCCCCEEEE T ss_conf 7999721410022313357874475644716799999999865961899995145799-9999863-5203665602799 Q ss_pred CCCCCCCCCCCCCC--CCCCCCCCCCCCEEECCCCCCCCCCCCHHHHHCCC-----HHHHHCCCCCC Q ss_conf 23333224468521--01112335675213214432111100001110000-----01110012222 Q gi|254780922|r 78 IEQLVPAGLAQSYI--LGAEFIGDSSSVLILGDNVFYGSDISDIFHKARAR-----RNSATVVGCHV 137 (292) Q Consensus 78 ~~q~~p~Gta~Ai~--~a~~~i~~~~~~lilgDni~~~~~l~~~l~~~~~~-----~~~~ti~~~~v 137 (292) +.-.+.+-+|+|+. .++..+. .+|+|+.||. +.+.+|.+++++|.++ ++-+|++..+. T Consensus 104 i~s~~~~S~GDamR~id~k~lit-gDFiLVsgd~-vsN~pl~~~l~eHr~r~k~Dk~~iMTmv~k~~ 168 (673) T KOG1461 104 ICSGESRSVGDAMRDIDEKQLIT-GDFILVSGDT-VSNMPLRNVLEEHRKRRKEDKDAIMTMVFKES 168 (673) T ss_pred ECCCCCCCHHHHHHHHHHCCEEE-CCEEEEECCE-EECCCHHHHHHHHHHHHHHCCCCEEEEEEECC T ss_conf 80788672888988877426010-6469983773-42475699999999876517243578998224 No 55 >cd04182 GT_2_like_f GT_2_like_f is a subfamily of the glycosyltransferase family 2 (GT-2) with unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families. Probab=99.51 E-value=3.2e-14 Score=109.39 Aligned_cols=182 Identities=18% Similarity=0.307 Sum_probs=105.6 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 90799248887455334438982222478374799999999978986899997776564311122222222111122233 Q gi|254780922|r 1 MKGIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQ 80 (292) Q Consensus 1 MkaiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q 80 (292) +-|||||||+|+||. .+|+|++++|+|||+|+++.+..+++.+|++|++++...... .+ ...++.+.. .+ T Consensus 1 i~~iILAgG~s~RmG-----~~K~l~~~~g~pli~~~i~~~~~~~~~~i~vv~~~~~~~~~~-~~---~~~~~~~v~-~~ 70 (186) T cd04182 1 IAAIILAAGRSSRMG-----GNKLLLPLDGKPLLRHALDAALAAGLSRVIVVLGAEADAVRA-AL---AGLPVVVVI-NP 70 (186) T ss_pred CEEEEECCCCCCCCC-----CCEECCEECCCCHHHHHHHHHHHCCCCCEEECCCCCHHHHHH-HH---CCCCCEEEE-CC T ss_conf 969998997653599-----891255899731999999999974998325237740789999-85---479936751-78 Q ss_pred CCCCCCCCCCCCCCCCCCCC-C-CCEEECCCCCCCCC-CCCHHHHHCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHC Q ss_conf 33224468521011123356-7-52132144321111-000011100000111001222210001222122221000100 Q gi|254780922|r 81 LVPAGLAQSYILGAEFIGDS-S-SVLILGDNVFYGSD-ISDIFHKARARRNSATVVGCHVQNPQRYGVVEVDSSNQAISI 157 (292) Q Consensus 81 ~~p~Gta~Ai~~a~~~i~~~-~-~~lilgDni~~~~~-l~~~l~~~~~~~~~~ti~~~~v~~p~~yGvv~~d~~~~i~~i 157 (292) ....|.++++..+...+..+ + ++++.+|.-|...+ +..+++.+......+.+.. | +| T Consensus 71 ~~~~G~~~si~~gl~~~~~~~~~~lv~~~D~P~i~~~~i~~L~~~~~~~~~~iv~~~--------~-------~g----- 130 (186) T cd04182 71 DWEEGMSSSLAAGLEALPADADAVLILLADQPLVTAETLRALIDAFREDGAGIVAPV--------Y-------QG----- 130 (186) T ss_pred CCCCCCCHHHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHHHHHHHHHCCCCEEEEE--------C-------CC----- T ss_conf 756785378999999733488768984377787899999999999975899589996--------1-------79----- Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHCCCCCEEEEECCCCEEEECCCCHHHH Q ss_conf 1333332222220245541000224322004675333333343200000132046313541256327845779 Q gi|254780922|r 158 EEKPNNPKSSFAVTGIYFYDQEVVNIARNIRPSARGELEITDVNSYYLDKGLLAVEFLREGSAWFDAGTPESL 230 (292) Q Consensus 158 ~EKP~~~~sn~a~~GiY~f~~~i~~~~~~l~~s~rgE~eitD~~~~~l~~~~~~~~~~~~g~~W~D~Gt~e~l 230 (292) ++.+| + +|+.+.++.+.++.- .+|. ..++.... ....+..-..+ .-+|+.||+|| T Consensus 131 --~~g~P----v-----l~~~~~~~~l~~l~g-d~G~---r~~l~~~~--~~~~~~~~~~~-~~~didTpeDl 185 (186) T cd04182 131 --RRGHP----V-----LFPRSLFPELLALSG-DKGA---RSLLRAHP--DRVVVEVDDPG-VLIDIDTPEDL 185 (186) T ss_pred --CCCCC----E-----EECHHHHHHHHHCCC-CCCH---HHHHHHCC--CCEEEECCCCC-CCCCCCCHHHC T ss_conf --75686----4-----777999999984027-7037---99998698--86899639988-26689895880 No 56 >cd02516 CDP-ME_synthetase CDP-ME synthetase is involved in mevalonate-independent isoprenoid production. 4-diphosphocytidyl-2-methyl-D-erythritol synthase (CDP-ME), also called 2C-methyl-d-erythritol 4-phosphate cytidylyltransferase catalyzes the third step in the alternative (non-mevalonate) pathway of Isopentenyl diphosphate (IPP) biosynthesis: the formation of 4-diphosphocytidyl-2C-methyl-D-erythritol from CTP and 2C-methyl-D-erythritol 4-phosphate. This mevalonate independent pathway that utilizes pyruvate and glyceraldehydes 3-phosphate as starting materials for production of IPP occurs in a variety of bacteria, archaea and plant cells, but is absent in mammals. Thus, CDP-ME synthetase is an attractive targets for the structure-based design of selective antibacterial, herbicidal and antimalarial drugs. Probab=99.47 E-value=7.1e-14 Score=107.23 Aligned_cols=206 Identities=21% Similarity=0.287 Sum_probs=116.4 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCC-CCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 7992488874553344389822224783747999999999789-868999977765643111222222221111222333 Q gi|254780922|r 3 GIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAG-IREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQL 81 (292) Q Consensus 3 aiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~G-i~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~ 81 (292) |||||||.|+||. ..+||++++++|+|+|+|.|+.+.... |++|+|+++....+.++.+..... ...+.++. T Consensus 3 aIIlAaG~G~R~~---~~~PKQf~~i~gk~ii~~sl~~f~~~~~i~~Iivv~~~~~~~~~~~~~~~~~--~~~i~~v~-- 75 (218) T cd02516 3 AIILAAGSGSRMG---ADIPKQFLELGGKPVLEHTLEAFLAHPAIDEIVVVVPPDDIDLAKELAKYGL--SKVVKIVE-- 75 (218) T ss_pred EEEECCCCCCCCC---CCCCCCEEEECCEEHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHHHHCC--CCCEEEEC-- T ss_conf 9992683471488---5798523358967999999999975899897999968378999999886447--98769989-- Q ss_pred CCCCC--CCCCCCCCCCCC--CCCCCEEE-CCCCCCCC-CCCCHHHHHCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCHH Q ss_conf 32244--685210111233--56752132-14432111-10000111000001110012222100012221222210001 Q gi|254780922|r 82 VPAGL--AQSYILGAEFIG--DSSSVLIL-GDNVFYGS-DISDIFHKARARRNSATVVGCHVQNPQRYGVVEVDSSNQAI 155 (292) Q Consensus 82 ~p~Gt--a~Ai~~a~~~i~--~~~~~lil-gDni~~~~-~l~~~l~~~~~~~~~~ti~~~~v~~p~~yGvv~~d~~~~i~ 155 (292) -|. .++++.|...+. +.++++|. +.--|... .+.+++.... ..++.+.+.++.|. +...++++.+. T Consensus 76 --GG~tR~~SV~ngl~~l~~~~~~~VlIHDaaRP~i~~~~i~~li~~~~--~~~~~i~~~p~~DT----ik~~~~~~~i~ 147 (218) T cd02516 76 --GGATRQDSVLNGLKALPDADPDIVLIHDAARPFVSPELIDRLIDALK--EYGAAIPAVPVTDT----IKRVDDDGVVV 147 (218) T ss_pred --CCCCHHHHHHHHHHHHCCCCCCEEEEECCCCCCCCHHHHHHHHHHHH--HCCCEEEEECCHHC----EEEECCCCEEE T ss_conf --98409999998987401269998999257677899999999999997--37871898425101----79963897798 Q ss_pred HCCCCCCCCCCCCCHHHHHHHHHHHHHHH--HH-CCCCCCCCCCCCCCCCHHCCCCCEEEEECCCCEEEECCCCHHHHHH Q ss_conf 00133333222222024554100022432--20-0467533333334320000013204631354125632784577999 Q gi|254780922|r 156 SIEEKPNNPKSSFAVTGIYFYDQEVVNIA--RN-IRPSARGELEITDVNSYYLDKGLLAVEFLREGSAWFDAGTPESLLD 232 (292) Q Consensus 156 ~i~EKP~~~~sn~a~~GiY~f~~~i~~~~--~~-l~~s~rgE~eitD~~~~~l~~~~~~~~~~~~g~~W~D~Gt~e~l~~ 232 (292) +... ++++.. .-+|++|.+- .+ .....+...+.||-...+...|.....+.+... =+=+-||+||.- T Consensus 148 ~t~d-----R~~l~~----~QTPQ~F~~~~l~~a~~~~~~~~~~~TDd~sl~~~~g~~v~~v~G~~~-N~KIT~~eDl~~ 217 (218) T cd02516 148 ETLD-----REKLWA----AQTPQAFRLDLLLKAHRQASEEGEEFTDDASLVEAAGGKVALVEGSED-NIKITTPEDLAL 217 (218) T ss_pred EECC-----CCCEEE----EECCCCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHCCCCEEEEECCCC-CCCCCCHHHHHH T ss_conf 6237-----613798----748754259999999999986499978589999986997699975875-585998999824 Q ss_pred H Q ss_conf 9 Q gi|254780922|r 233 T 233 (292) Q Consensus 233 A 233 (292) | T Consensus 218 A 218 (218) T cd02516 218 A 218 (218) T ss_pred C T ss_conf 9 No 57 >COG2266 GTP:adenosylcobinamide-phosphate guanylyltransferase [Coenzyme metabolism] Probab=99.44 E-value=2.6e-13 Score=103.68 Aligned_cols=175 Identities=26% Similarity=0.379 Sum_probs=104.1 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 90799248887455334438982222478374799999999978986899997776564311122222222111122233 Q gi|254780922|r 1 MKGIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQ 80 (292) Q Consensus 1 MkaiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q 80 (292) |.+||||||+||||. ..-|||++|.|||||+|.++.|.+ .+++|++.++++. ...+.++. .+|+++. . T Consensus 1 m~~iiMAGGrGtRmg----~~EKPlleV~GkpLI~~v~~al~~-~~d~i~v~isp~t-p~t~~~~~---~~gv~vi---~ 68 (177) T COG2266 1 MMAIIMAGGRGTRMG----RPEKPLLEVCGKPLIDRVLEALRK-IVDEIIVAISPHT-PKTKEYLE---SVGVKVI---E 68 (177) T ss_pred CCEEEECCCCCCCCC----CCCCCCHHHCCCCHHHHHHHHHHH-HCCCEEEEECCCC-HHHHHHHH---HCCCEEE---E T ss_conf 935896288544468----876752020781389999999972-2183899967998-76999997---3593699---7 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHCCCC Q ss_conf 33224468521011123356752132144321111000011100000111001222210001222122221000100133 Q gi|254780922|r 81 LVPAGLAQSYILGAEFIGDSSSVLILGDNVFYGSDISDIFHKARARRNSATVVGCHVQNPQRYGVVEVDSSNQAISIEEK 160 (292) Q Consensus 81 ~~p~Gta~Ai~~a~~~i~~~~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~~v~~p~~yGvv~~d~~~~i~~i~EK 160 (292) ..-.|--+-+..+.++++. |++++.+|=.|......+.+..... +. ..+... +-..|. T Consensus 69 tpG~GYv~Dl~~al~~l~~-P~lvvsaDLp~l~~~~i~~vi~~~~-----~~-~~p~~~--------~~~~G~------- 126 (177) T COG2266 69 TPGEGYVEDLRFALESLGT-PILVVSADLPFLNPSIIDSVIDAAA-----SV-EVPIVT--------VVKAGR------- 126 (177) T ss_pred CCCCCHHHHHHHHHHHCCC-CEEEEECCCCCCCHHHHHHHHHHHH-----HC-CCCEEE--------EECCCC------- T ss_conf 5998708999999974499-4599865534178899999999985-----24-675057--------311586------- Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHCCCCCEEEEECCCCEEEECCCCHHHHHHHHHHHHHH Q ss_conf 33322222202455410002243220046753333333432000001320463135412563278457799999999998 Q gi|254780922|r 161 PNNPKSSFAVTGIYFYDQEVVNIARNIRPSARGELEITDVNSYYLDKGLLAVEFLREGSAWFDAGTPESLLDTAVFVRNI 240 (292) Q Consensus 161 P~~~~sn~a~~GiY~f~~~i~~~~~~l~~s~rgE~eitD~~~~~l~~~~~~~~~~~~g~~W~D~Gt~e~l~~As~~i~~~ 240 (292) +-+|+-+|+. .++| ++.+ ....+ .-+++.|++||..|....... T Consensus 127 --------v~~Glni~~~------------~~~~-~~~~--------------i~~~~-la~NVNT~eDl~~a~~ll~~~ 170 (177) T COG2266 127 --------VPVGLNIVGG------------KQEE-EILE--------------IDNPE-LAVNVNTPEDLKKAERLLRTS 170 (177) T ss_pred --------CCEEEEEECC------------CCCC-EEEE--------------EECCC-EEEECCCHHHHHHHHHHHHHC T ss_conf --------4401686248------------7762-1687--------------50666-057649999999999997212 Q ss_pred HHHCC Q ss_conf 88809 Q gi|254780922|r 241 ENRLG 245 (292) Q Consensus 241 e~~~g 245 (292) +.+.| T Consensus 171 ~~~~~ 175 (177) T COG2266 171 ERLEG 175 (177) T ss_pred CCCCC T ss_conf 33547 No 58 >PRK13385 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase; Provisional Probab=99.42 E-value=2.4e-13 Score=103.97 Aligned_cols=217 Identities=17% Similarity=0.146 Sum_probs=115.3 Q ss_pred CE--EEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCC-CCEEEEEECCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 90--7992488874553344389822224783747999999999789-86899997776564311122222222111122 Q gi|254780922|r 1 MK--GIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAG-IREILIISTPRDLPVLKEFLGSGEKWGVQFSY 77 (292) Q Consensus 1 Mk--aiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~G-i~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y 77 (292) |+ +||||||.|+||. ..+||++++++|+|++.|.|+.+.... |++|+|++.....+.+++++........++.+ T Consensus 1 M~~~~IilAaG~G~R~~---~~~pKQf~~l~gkpil~~sl~~f~~~~~i~~Ivvv~~~~~~~~~~~~~~~~~~~~~~v~i 77 (238) T PRK13385 1 MNYELIFLAAGQGKMGN---VPLPKQFLDIDNKPILIHTIEKFILVSEFNEIIIATPAQWISHTQDILKKYNITDQRVKV 77 (238) T ss_pred CCEEEEEECCCCCCCCC---CCCCCCEEEECCEEHHHHHHHHHHCCCCCCEEEEECCHHHHHHHHHHHHHCCCCCCCEEE T ss_conf 95699997773461488---689964327997889999999985587678799967788899999999861544476899 Q ss_pred CCCCCCCCCCCCCCCCCCCC------CCCCCCEEECC-CCCCCCC-CCCHHHHHCCCHHHHHCCCCCCCCCCCCCCCCCC Q ss_conf 23333224468521011123------35675213214-4321111-0000111000001110012222100012221222 Q gi|254780922|r 78 IEQLVPAGLAQSYILGAEFI------GDSSSVLILGD-NVFYGSD-ISDIFHKARARRNSATVVGCHVQNPQRYGVVEVD 149 (292) Q Consensus 78 ~~q~~p~Gta~Ai~~a~~~i------~~~~~~lilgD-ni~~~~~-l~~~l~~~~~~~~~~ti~~~~v~~p~~yGvv~~d 149 (292) +.=... -.+++..|..++ .++++++|.=. --|.+.+ +.+.+....+... +.+.+.++.+. +... T Consensus 78 v~GG~t--R~~SV~ngl~~l~~~~~~~~~~~VlIHDaARP~vs~~~i~~~i~~~~~~~~-~~~~~~~~~dt-----i~~~ 149 (238) T PRK13385 78 VAGGTD--RNETIMNIIDHIRNVNGINNDDVIVTHDAVRPFLTQRIIKENIEVAVKYGA-VDTVIEAIDTI-----VMSK 149 (238) T ss_pred ECCCCC--HHHHHHHHHHHHHHCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCC-EEEEEEECCCE-----EEEE T ss_conf 328986--489999999998741147876889995466778999999999999872197-37999613445-----9992 Q ss_pred CCCCHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHH--HH-CCC-CCCCCCCCCCCCCHHCCCCCEEEEECCCCEEEECCC Q ss_conf 21000100133333222222024554100022432--20-046-753333333432000001320463135412563278 Q gi|254780922|r 150 SSNQAISIEEKPNNPKSSFAVTGIYFYDQEVVNIA--RN-IRP-SARGELEITDVNSYYLDKGLLAVEFLREGSAWFDAG 225 (292) Q Consensus 150 ~~~~i~~i~EKP~~~~sn~a~~GiY~f~~~i~~~~--~~-l~~-s~rgE~eitD~~~~~l~~~~~~~~~~~~g~~W~D~G 225 (292) +++.+.+.. +++++.. +-+|++|++- .+ ... +..+....||--..+...|.....+.+.. .=+=+- T Consensus 150 ~~~~i~~t~-----dR~~l~~----~QTPQ~F~~~~L~~a~~~~~~~~~~~~TDdasl~e~~g~~v~~v~G~~-~NiKIT 219 (238) T PRK13385 150 DKQNIHSIP-----VRNEMYQ----GQTPQSFNIKLLQDSYRALSSEQKEILSDACKIIVESGHPVKLVRGEL-YNIKVT 219 (238) T ss_pred CCCCEEECC-----CHHHHHH----HCCCCCCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHCCCCEEEEECCC-CCCCCC T ss_conf 699524126-----8788200----029961359999999999986289988889999987799669996696-347799 Q ss_pred CHHHHHHHHHHHH Q ss_conf 4577999999999 Q gi|254780922|r 226 TPESLLDTAVFVR 238 (292) Q Consensus 226 t~e~l~~As~~i~ 238 (292) ||+||.-|...++ T Consensus 220 ~~eDL~~a~~ll~ 232 (238) T PRK13385 220 TPYDLKVANAIIQ 232 (238) T ss_pred CHHHHHHHHHHHC T ss_conf 8999999999975 No 59 >PRK13368 3-deoxy-manno-octulosonate cytidylyltransferase; Provisional Probab=99.41 E-value=4.2e-13 Score=102.37 Aligned_cols=221 Identities=19% Similarity=0.250 Sum_probs=137.4 Q ss_pred CE--EEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCC-CCEEEEEECCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 90--7992488874553344389822224783747999999999789-86899997776564311122222222111122 Q gi|254780922|r 1 MK--GIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAG-IREILIISTPRDLPVLKEFLGSGEKWGVQFSY 77 (292) Q Consensus 1 Mk--aiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~G-i~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y 77 (292) || +||+|=+-.|||- -|+|.+|+|+|||+|+.+.+..+. +++++|.|. +.+ +.++. +.+|.+... T Consensus 1 Mk~~~iIPAR~~S~R~p------~K~L~~i~g~pmi~~v~~~a~~s~~~d~v~VaTd--d~e-I~~~~---k~~g~~~i~ 68 (238) T PRK13368 1 MKVVVVIPARYGSSRLP------GKPLLDILGKPMIQHVYERAAQAAGVEEVYVATD--DQR-IEDAV---EAFGGKVVM 68 (238) T ss_pred CCEEEEECCCCCCCCCC------CCCCHHHCCCCHHHHHHHHHHHCCCCCEEEEECC--CHH-HHHHH---HCCCCEEEE T ss_conf 97899984687787899------8701252896999999999984689882999568--424-87775---004632785 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCC-CCCCHHHHHCCCHHH-HHCCCCCCCCCCC---CCC--CCCCC Q ss_conf 2333322446852101112335675213214432111-100001110000011-1001222210001---222--12222 Q gi|254780922|r 78 IEQLVPAGLAQSYILGAEFIGDSSSVLILGDNVFYGS-DISDIFHKARARRNS-ATVVGCHVQNPQR---YGV--VEVDS 150 (292) Q Consensus 78 ~~q~~p~Gta~Ai~~a~~~i~~~~~~lilgDni~~~~-~l~~~l~~~~~~~~~-~ti~~~~v~~p~~---yGv--v~~d~ 150 (292) ...+...|| +.+..|.+.++.+-++.+-||.-+... .+.++++.+.+.... +.....+..++.. --+ +.+|. T Consensus 69 ts~~~~~GT-dRi~Ea~~~~~~d~ivnvQgDePli~p~~I~~~i~~~~~~~~~~v~t~~~~~~~~~~~~n~n~vKvv~~~ 147 (238) T PRK13368 69 TSDDHLSGT-DRLAEVMLKIEADIYINVQGDEPLIRPRDIDTLIQPMLDDPSINVATLCAPISTEEEFESPNVVKVVVDK 147 (238) T ss_pred CCCCCCCCH-HHHHHHHHHCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCEEEEEEECCCCHHHHCCCCCEEEEECC T ss_conf 156557714-7899998515987899953886778999999999999837765276787416888884389852999899 Q ss_pred CCCHHHCCCCCC----C--CCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHCCCCCEEEEECCCCEEEECC Q ss_conf 100010013333----3--2222220245541000224322004675333333343200000132046313541256327 Q gi|254780922|r 151 SNQAISIEEKPN----N--PKSSFAVTGIYFYDQEVVNIARNIRPSARGELEITDVNSYYLDKGLLAVEFLREGSAWFDA 224 (292) Q Consensus 151 ~~~i~~i~EKP~----~--~~sn~a~~GiY~f~~~i~~~~~~l~~s~rgE~eitD~~~~~l~~~~~~~~~~~~g~~W~D~ 224 (292) ++++.-|--.|- + ...-+-..|+|.|+++++....+++++..-..|--+.++ ++..|.....+...+.. +.+ T Consensus 148 ~~~~lyfSR~~ip~~~~~~~~~~~khiGiy~f~~~~L~~f~~l~~s~lE~~E~lEqLR-~leng~~I~~~~~~~~~-~~V 225 (238) T PRK13368 148 NGDALYFSRSPIPSRRDGESARYLKHVGIYAFRRDVLQQFSQLPETPLEQIESLEQLR-ALEHGYKIRMVEVEATS-IGV 225 (238) T ss_pred CCCCCCCCCCCCCCCCCCCCCHHHEEEEEEEECHHHHHHHHHCCCCHHHHHHHHHHHH-HHHCCCCEEEEEECCCC-CCC T ss_conf 7870114525675112454201220301122119999998708998567777089999-99869926799959999-998 Q ss_pred CCHHHHHHHHHH Q ss_conf 845779999999 Q gi|254780922|r 225 GTPESLLDTAVF 236 (292) Q Consensus 225 Gt~e~l~~As~~ 236 (292) .||+||..|... T Consensus 226 Dt~eDl~~v~ki 237 (238) T PRK13368 226 DTPEDLERVRAI 237 (238) T ss_pred CCHHHHHHHHHH T ss_conf 799999999985 No 60 >TIGR00454 TIGR00454 conserved hypothetical protein TIGR00454; InterPro: IPR005245 This entry describes a family of conserved hypothetical proteins with no known function. . Probab=99.38 E-value=3.8e-13 Score=102.63 Aligned_cols=117 Identities=20% Similarity=0.321 Sum_probs=76.9 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCC-CCEEEEEECCCCCCCCCCCCCC-CCCCCCCCCCC Q ss_conf 907992488874553344389822224783747999999999789-8689999777656431112222-22221111222 Q gi|254780922|r 1 MKGIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAG-IREILIISTPRDLPVLKEFLGS-GEKWGVQFSYI 78 (292) Q Consensus 1 MkaiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~G-i~~I~ii~~~~~~~~~~~~l~~-g~~~g~~i~y~ 78 (292) |-|+|+|||+||||. .+.=|||+.|.|++||||.|+.|..+. ++.|+++|+++-... ++|+.. +..+. +|. + T Consensus 1 M~aL~MaGGkGTRlg---~d~EKPL~evcGr~lIDhvl~~l~~~~GV~~ii~~TSPhTP~T-e~y~~~~~~~~~-~iv-v 74 (204) T TIGR00454 1 MLALVMAGGKGTRLG---RDVEKPLLEVCGRKLIDHVLEALKKAKGVDNIIVVTSPHTPKT-EEYVAEKYKEYK-RIV-V 74 (204) T ss_pred CCCEEECCCCCCCCC---CCCCCCHHHHCCCCHHHHHHHHHHHCCCCCEEEEEECCCCCCC-HHHHHHCCCEEE-EEE-E T ss_conf 950240588731226---5556533756585045777786640478753799837779763-788730585045-788-9 Q ss_pred CCCCCCCC----CCCCCCCCCCCCCCCCCEEECCCCCCCCCCCC-HHHHHC Q ss_conf 33332244----68521011123356752132144321111000-011100 Q gi|254780922|r 79 EQLVPAGL----AQSYILGAEFIGDSSSVLILGDNVFYGSDISD-IFHKAR 124 (292) Q Consensus 79 ~q~~p~Gt----a~Ai~~a~~~i~~~~~~lilgDni~~~~~l~~-~l~~~~ 124 (292) +...=+|- -+++..-+..+ +|||+++.+|=++......+ .+..+. T Consensus 75 idasGkGYiEDl~E~~~hlE~~~-~EP~lV~ssDl~~~r~~~id~Ivd~y~ 124 (204) T TIGR00454 75 IDASGKGYIEDLREVLSHLELAF-SEPLLVVSSDLVLVRDKIIDSIVDAYY 124 (204) T ss_pred EECCCCCCHHHHHHHHHHHHHHH-CCCEEEEECCHHHHHHHHHHHHHHHHH T ss_conf 97699960252789999877663-398578743602433677877764321 No 61 >COG1211 IspD 4-diphosphocytidyl-2-methyl-D-erithritol synthase [Lipid metabolism] Probab=99.37 E-value=1.2e-12 Score=99.50 Aligned_cols=212 Identities=20% Similarity=0.305 Sum_probs=124.8 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCC-CCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 07992488874553344389822224783747999999999789-86899997776564311122222222111122233 Q gi|254780922|r 2 KGIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAG-IREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQ 80 (292) Q Consensus 2 kaiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~G-i~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q 80 (292) =|||||||.||||.. ..||+.+++.|+||+.|.|+.+.... |++|+|+++......+.++.. ...+-.+.++.- T Consensus 6 ~~vilAaG~G~R~~~---~~pKq~l~l~g~pll~~tl~~f~~~~~i~~ivvv~~~~~~~~~~~~~~--~~~~~~v~~v~G 80 (230) T COG1211 6 SAVILAAGFGSRMGN---PVPKQYLELGGRPLLEHTLEAFLESPAIDEIVVVVSPEDDPYFEKLPK--LSADKRVEVVKG 80 (230) T ss_pred EEEEECCCCCCCCCC---CCCCEEEEECCEEEHHHHHHHHHHCCCCCEEEEEECHHHHHHHHHHHH--HCCCCEEEEECC T ss_conf 999975763433579---998458899998856999999974767676999978466489998632--124870799348 Q ss_pred CCCCCCCCCCCCCCCCCC--CCCCCEEECCC-CCCCC-CCCCHHHHHCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCHHH Q ss_conf 332244685210111233--56752132144-32111-100001110000011100122221000122212222100010 Q gi|254780922|r 81 LVPAGLAQSYILGAEFIG--DSSSVLILGDN-VFYGS-DISDIFHKARARRNSATVVGCHVQNPQRYGVVEVDSSNQAIS 156 (292) Q Consensus 81 ~~p~Gta~Ai~~a~~~i~--~~~~~lilgDn-i~~~~-~l~~~l~~~~~~~~~~ti~~~~v~~p~~yGvv~~d~~~~i~~ 156 (292) ... -.++++.+...+. +++++|+.=.. -|... .+.+++. .....++.+.+.++.|+-+. .|.++.+.+ T Consensus 81 G~~--R~~SV~~gL~~~~~~~~~~VlvHDaaRPf~~~~~i~~li~--~~~~~~aai~alpv~DTik~----~~~~~~i~~ 152 (230) T COG1211 81 GAT--RQESVYNGLQALSKYDSDWVLVHDAARPFLTPKLIKRLIE--LADKYGAAILALPVTDTLKR----VDADGNIVE 152 (230) T ss_pred CCC--HHHHHHHHHHHHHCCCCCEEEEECCCCCCCCHHHHHHHHH--HHCCCCCEEEEEECCCCEEE----ECCCCCEEE T ss_conf 751--7999999999721158988999646667899999999998--62238817997504673788----658987234 Q ss_pred CCCCCCCCCCCCCHHHHH-HHHHHHHHHH--HH-CC-CCCCCCCCCCCCCCHHCCCCCEEEEECCCCEEEECCCCHHHHH Q ss_conf 013333322222202455-4100022432--20-04-6753333333432000001320463135412563278457799 Q gi|254780922|r 157 IEEKPNNPKSSFAVTGIY-FYDQEVVNIA--RN-IR-PSARGELEITDVNSYYLDKGLLAVEFLREGSAWFDAGTPESLL 231 (292) Q Consensus 157 i~EKP~~~~sn~a~~GiY-~f~~~i~~~~--~~-l~-~s~rgE~eitD~~~~~l~~~~~~~~~~~~g~~W~D~Gt~e~l~ 231 (292) ... . .++| .-+|..|.+- .+ .. -...+ .++||--..+...|....-+-++- .=+-+-||+||. T Consensus 153 t~~-----R-----~~l~~~QTPQ~F~~~~L~~a~~~a~~~~-~~~tDdas~~e~~G~~v~lV~G~~-~n~KiTtpeDL~ 220 (230) T COG1211 153 TVD-----R-----SGLWAAQTPQAFRLELLKQALARAFAEG-REITDDASAIEKAGGPVSLVEGSA-DNFKITTPEDLE 220 (230) T ss_pred CCC-----H-----HHHHHHHCCCCCCHHHHHHHHHHHHHCC-CCCCCHHHHHHHCCCCEEEEECCC-CEEEECCHHHHH T ss_conf 437-----5-----5423433876331999999999998628-776886999987699819982683-305762888999 Q ss_pred HHHHHHH Q ss_conf 9999999 Q gi|254780922|r 232 DTAVFVR 238 (292) Q Consensus 232 ~As~~i~ 238 (292) -|...+. T Consensus 221 ~a~~il~ 227 (230) T COG1211 221 IAEAILR 227 (230) T ss_pred HHHHHHC T ss_conf 9998752 No 62 >TIGR03202 pucB xanthine dehydrogenase accessory protein pucB. In Bacillus subtilis the expression of this protein, located in an operon with the structural subunits of xanthine dehydrogenase, has been found to be essential for XDH activity. Some members of this family appear to have a distant relationship to the MobA protein involved in molybdopterin biosynthesis, although this may be coincidental. Probab=99.35 E-value=9.7e-13 Score=100.10 Aligned_cols=184 Identities=17% Similarity=0.230 Sum_probs=107.9 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCC-CCCCCCCCCCCCCC Q ss_conf 07992488874553344389822224783747999999999789868999977765643111222-22222111122233 Q gi|254780922|r 2 KGIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLG-SGEKWGVQFSYIEQ 80 (292) Q Consensus 2 kaiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~-~g~~~g~~i~y~~q 80 (292) -|||||||.++||. .+|.|+|+.|+||+.+.+..+..+++.+++||+++...+ ...+.. .....++.+.+ .. T Consensus 2 ~aiILAAG~S~RmG-----~~KlLl~~~g~~ll~~~~~~~~~~~~~~vvVV~g~~~~~-~~~~~~~~~~~~~~~~v~-n~ 74 (190) T TIGR03202 2 VAIYLAAGQSRRMG-----ENKLALPLGETTLGSASLKTALSSRLSKVIVVIGEKYAH-LSWLDPYLLADERIMLVC-CR 74 (190) T ss_pred EEEEECCCCCCCCC-----CCCCCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCHHHH-HHHHHHHHHCCCCEEEEE-CC T ss_conf 79997998766899-----884238509911999999999866998199981871888-876678763348749996-89 Q ss_pred CCCCCCCCCCCCCCCCCCC---CCCCEEECCCCCCCC-CCCCHHHHHCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCHHH Q ss_conf 3322446852101112335---675213214432111-100001110000011100122221000122212222100010 Q gi|254780922|r 81 LVPAGLAQSYILGAEFIGD---SSSVLILGDNVFYGS-DISDIFHKARARRNSATVVGCHVQNPQRYGVVEVDSSNQAIS 156 (292) Q Consensus 81 ~~p~Gta~Ai~~a~~~i~~---~~~~lilgDni~~~~-~l~~~l~~~~~~~~~~ti~~~~v~~p~~yGvv~~d~~~~i~~ 156 (292) +...|.+.++..+...... +-++++++|--+... .+..++..+......+.+ | .| T Consensus 75 ~~~~G~~sSi~~Gl~~~~~~~~d~~li~l~D~P~l~~~~i~~L~~~~~~~~~~iv~-------~-~~------------- 133 (190) T TIGR03202 75 DACEGQAHSLKCGLRKAEAMGADAVVILLADQPFLTADVINALLALAKRRPDDYVA-------A-SF------------- 133 (190) T ss_pred CCCCCCHHHHHHHHHHHCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHCCCCEEE-------E-EE------------- T ss_conf 86546079999999974366889699964897788999999999999858990899-------8-62------------- Q ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHCCCCCEEEEECCCCEEEECCCCHHHH Q ss_conf 01333332222220245541000224322004675333333343200000132046313541256327845779 Q gi|254780922|r 157 IEEKPNNPKSSFAVTGIYFYDQEVVNIARNIRPSARGELEITDVNSYYLDKGLLAVEFLREGSAWFDAGTPESL 230 (292) Q Consensus 157 i~EKP~~~~sn~a~~GiY~f~~~i~~~~~~l~~s~rgE~eitD~~~~~l~~~~~~~~~~~~g~~W~D~Gt~e~l 230 (292) ..++-+|. +|+...|+.+..++ |+..-..+++... ....+.....+ .-+|+.||||| T Consensus 134 -~g~~G~Pv---------lf~~~~f~~L~~l~----GD~Gar~ll~~~~--~~~~~~~~d~~-~~~DIDTpeDy 190 (190) T TIGR03202 134 -KGKPRPPI---------LFSKSLFPKLKALK----GDEGARALLRKDK--SGLALPVADAS-AFFDIDTKEDY 190 (190) T ss_pred -CCCCCCCE---------EECHHHHHHHHHCC----CCHHHHHHHHCCC--CCEEEEECCCC-CCCCCCCHHHC T ss_conf -89636988---------97799999988378----8745999984177--65699838989-16079884549 No 63 >PRK05450 3-deoxy-manno-octulosonate cytidylyltransferase; Provisional Probab=99.34 E-value=1.6e-12 Score=98.66 Aligned_cols=224 Identities=20% Similarity=0.269 Sum_probs=133.4 Q ss_pred CE--EEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHC-CCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 90--799248887455334438982222478374799999999978-986899997776564311122222222111122 Q gi|254780922|r 1 MK--GIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDA-GIREILIISTPRDLPVLKEFLGSGEKWGVQFSY 77 (292) Q Consensus 1 Mk--aiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~-Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y 77 (292) || +||+|=...|||- -|||.+|+|+|||.|+.+.+..+ ++++++|.|. +.++. ++. +.+|.+... T Consensus 1 Mk~~~iIPAR~~SsR~P------gKpL~~I~GkpmI~~v~~~a~~~~~~~~V~VATd--d~~I~-~~~---~~~g~~~im 68 (248) T PRK05450 1 MKFVVIIPARYASTRLP------GKPLADIGGKPMIVRVYERASKASGADRVVVATD--DERIA-DAV---EAFGGEVVM 68 (248) T ss_pred CCEEEEEECCCCCCCCC------CCCCHHHCCCCHHHHHHHHHHHCCCCCEEEEEEC--CHHHH-HHH---HCCCCCEEE T ss_conf 97799980797787899------8620241896989999999996689882999708--85244-353---156862340 Q ss_pred CCCCCCCCCCCCCCCCCCCCC--CCCCCE-EECCCCCCCC-CCCCHHHHHCCCHH-HHHCCCCCCCCCCCC---C--CCC Q ss_conf 233332244685210111233--567521-3214432111-10000111000001-110012222100012---2--212 Q gi|254780922|r 78 IEQLVPAGLAQSYILGAEFIG--DSSSVL-ILGDNVFYGS-DISDIFHKARARRN-SATVVGCHVQNPQRY---G--VVE 147 (292) Q Consensus 78 ~~q~~p~Gta~Ai~~a~~~i~--~~~~~l-ilgDni~~~~-~l~~~l~~~~~~~~-~~ti~~~~v~~p~~y---G--vv~ 147 (292) ...+.+.||- -+..|.+.++ +.++++ +-||.-+... .+..++........ .+..+..+..++..+ . -|. T Consensus 69 Ts~~h~~GTd-Ri~Ea~~~l~~~~~d~IInvQGDEPli~p~~I~~li~~~~~~~~~~i~tl~~~~~~~~~~~d~n~VKvv 147 (248) T PRK05450 69 TSADHPSGTD-RIAEAAAKLGLSDDDIVVNVQGDEPLIPPELIDQVAEPLAAHPEADMATLAVPIDDEEELFNPNVVKVV 147 (248) T ss_pred CCCCCCCCHH-HHHHHHHHCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCEEEEEEEECCCHHHHHCCCCEEEE T ss_conf 4645477038-999999850877786899935986778999999999999858866399999862888996088834899 Q ss_pred CCCCCCHHHCCCCCC---------CCCC-CCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHCCCCCEEEEECCC Q ss_conf 222100010013333---------3222-222024554100022432200467533333334320000013204631354 Q gi|254780922|r 148 VDSSNQAISIEEKPN---------NPKS-SFAVTGIYFYDQEVVNIARNIRPSARGELEITDVNSYYLDKGLLAVEFLRE 217 (292) Q Consensus 148 ~d~~~~i~~i~EKP~---------~~~s-n~a~~GiY~f~~~i~~~~~~l~~s~rgE~eitD~~~~~l~~~~~~~~~~~~ 217 (292) +|.++++.-|---|- ...+ -+--.|+|.|+++++....+++++..-..|--+.++ .+..|.....+... T Consensus 148 ~~~~~~alyfSRs~IP~~~~~~~~~~~~~~~~hiGiy~f~~~~L~~f~~l~~s~lE~~E~lEqLR-~leng~~I~~~~~~ 226 (248) T PRK05450 148 LDKNGYALYFSRAPIPWGRDAADPTAPTPVYRHIGIYAYRRGFLRRFVSLPPSPLEQIESLEQLR-ALENGYRIHVAVVD 226 (248) T ss_pred ECCCCCEEEEECCCCCCCCCCCCCCCCCCEEEEEEEEEEHHHHHHHHHHCCCCHHHHHHHHHHHH-HHHCCCCEEEEEEC T ss_conf 89999840554267875444321235555168999886009999998746998146565269999-99859957999967 Q ss_pred CEEEECCCCHHHHHHHHHHHH Q ss_conf 125632784577999999999 Q gi|254780922|r 218 GSAWFDAGTPESLLDTAVFVR 238 (292) Q Consensus 218 g~~W~D~Gt~e~l~~As~~i~ 238 (292) ...=+-+.||+||.+|...++ T Consensus 227 ~~~~~~VDT~eDl~~v~~~lk 247 (248) T PRK05450 227 EAPSPGVDTPEDLERVRALLA 247 (248) T ss_pred CCCCCCCCCHHHHHHHHHHHC T ss_conf 899999779999999999866 No 64 >PRK00317 mobA molybdopterin-guanine dinucleotide biosynthesis protein A; Reviewed Probab=99.34 E-value=3.4e-12 Score=96.67 Aligned_cols=186 Identities=19% Similarity=0.334 Sum_probs=98.3 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 90799248887455334438982222478374799999999978986899997776564311122222222111122233 Q gi|254780922|r 1 MKGIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQ 80 (292) Q Consensus 1 MkaiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q 80 (292) |.|||||||++|||. ..+|+|++++|+|+|+|+++.|. ..+.+++|+++.+. +.+.. ++..+ +.. T Consensus 4 i~gvILAGG~SsRmg----G~dK~ll~~~G~~li~~~~~~l~-~~~~~v~i~~~~~~-~~~~~-------~~~~v--i~D 68 (193) T PRK00317 4 ITGVILAGGRARRMG----GVDKGLQELNGKPLIQHVIDRLA-PQVDEIVINANRNL-ARYAA-------FGLPV--IPD 68 (193) T ss_pred CEEEEECCCCCCCCC----CCCCCCCEECCEEHHHHHHHHHH-HHCCEEEEECCCCH-HHHHH-------CCCEE--EEC T ss_conf 569998876767899----99975065899749999999877-54576899768788-88864-------49969--963 Q ss_pred CCC--CCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCC-CCHHHHHCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHC Q ss_conf 332--244685210111233567521321443211110-00011100000111001222210001222122221000100 Q gi|254780922|r 81 LVP--AGLAQSYILGAEFIGDSSSVLILGDNVFYGSDI-SDIFHKARARRNSATVVGCHVQNPQRYGVVEVDSSNQAISI 157 (292) Q Consensus 81 ~~p--~Gta~Ai~~a~~~i~~~~~~lilgDni~~~~~l-~~~l~~~~~~~~~~ti~~~~v~~p~~yGvv~~d~~~~i~~i 157 (292) ..| .|.-.++..|......+.++++..|--|...+. ..+++...+....+.+ ...+++. T Consensus 69 ~~~~~~GPL~Gi~saL~~~~~~~vlv~~~DmP~i~~~~i~~L~~~~~~~~~~iv~---------------~~~~~~~--- 130 (193) T PRK00317 69 ELADFPGPLAGILAGLQQAETEWVLFVPCDTPFLPRDLVARLAQALIEEDADIAV---------------AHDGERD--- 130 (193) T ss_pred CCCCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHHHCCCCEEE---------------EEECCCC--- T ss_conf 7877667589999998626877599962677889899999999998747998389---------------8648954--- Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHCCCCCEEEEECCCCEEEECCCCHHHHHHHHHH Q ss_conf 1333332222220245541000224322004675333333343200000132046313541256327845779999999 Q gi|254780922|r 158 EEKPNNPKSSFAVTGIYFYDQEVVNIARNIRPSARGELEITDVNSYYLDKGLLAVEFLREGSAWFDAGTPESLLDTAVF 236 (292) Q Consensus 158 ~EKP~~~~sn~a~~GiY~f~~~i~~~~~~l~~s~rgE~eitD~~~~~l~~~~~~~~~~~~g~~W~D~Gt~e~l~~As~~ 236 (292) +| ..|+ |+.++++.+...-. .|+..+.++++. -+...+.+-.....++++.||+||.+|.+. T Consensus 131 -----~P-----L~al--y~~~~l~~l~~~l~--~g~~~l~~~l~~---~~~~~v~~~~~~~~f~NiNTpeDl~~~~~~ 192 (193) T PRK00317 131 -----HP-----TFAL--YSVALLPDLEAALA--AGERKVMAFYAR---HGTVAVDFSDPKDAFFNINTPEDLAQLEQL 192 (193) T ss_pred -----CC-----CCCC--CCHHHHHHHHHHHH--CCCCCHHHHHHH---CCCEEEEECCCCCCCCCCCCHHHHHHHHHH T ss_conf -----65-----3110--47889999999999--099659999987---895899807986623689999999999961 No 65 >cd02517 CMP-KDO-Synthetase CMP-KDO synthetase catalyzes the activation of KDO which is an essential component of the lipopolysaccharide. CMP-KDO Synthetase: 3-Deoxy-D-manno-octulosonate cytidylyltransferase (CMP-KDO synthetase) catalyzes the conversion of CTP and 3-deoxy-D-manno-octulosonate into CMP-3-deoxy-D-manno-octulosonate (CMP-KDO) and pyrophosphate. KDO is an essential component of the lipopolysaccharide found in the outer surface of gram-negative eubacteria. It is also a constituent of the capsular polysaccharides of some gram-negative eubacteria. Its presence in the cell wall polysaccharides of green algae and plant were also discovered. However, they have not been found in yeast and animals. The absence of the enzyme in mammalian cells makes it an attractive target molecule for drug design. Probab=99.34 E-value=2.1e-12 Score=98.00 Aligned_cols=218 Identities=18% Similarity=0.237 Sum_probs=133.0 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCC-CCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 7992488874553344389822224783747999999999789-868999977765643111222222221111222333 Q gi|254780922|r 3 GIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAG-IREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQL 81 (292) Q Consensus 3 aiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~G-i~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~ 81 (292) +||+|=+-.|||- -|||.+|+|+|||+|+.+.+..+. +++|+|.|. +.++. ++. +.+|.+......+ T Consensus 4 ~iIPAR~~S~Rlp------~K~L~~I~G~pmI~~v~~~a~~s~~~d~V~VATD--d~eI~-~~~---~~~g~~~imT~~~ 71 (239) T cd02517 4 VVIPARYASSRLP------GKPLADIAGKPMIQHVYERAKKAKGLDEVVVATD--DERIA-DAV---ESFGGKVVMTSPD 71 (239) T ss_pred EEECCCCCCCCCC------CCCCHHHCCCCHHHHHHHHHHHCCCCCEEEEECC--CHHHH-HHH---CCCCCCCCCCCCC T ss_conf 9975887687899------9730453995989999999996699884999648--26555-332---1479740104643 Q ss_pred CCCCCCCCCCCCCCCCCCC-CC-CEEECCCCCCCC-CCCCHHHHHCCCH-HHHHCCCCCCCCCC---CCCCC--CCCCCC Q ss_conf 3224468521011123356-75-213214432111-1000011100000-11100122221000---12221--222210 Q gi|254780922|r 82 VPAGLAQSYILGAEFIGDS-SS-VLILGDNVFYGS-DISDIFHKARARR-NSATVVGCHVQNPQ---RYGVV--EVDSSN 152 (292) Q Consensus 82 ~p~Gta~Ai~~a~~~i~~~-~~-~lilgDni~~~~-~l~~~l~~~~~~~-~~~ti~~~~v~~p~---~yGvv--~~d~~~ 152 (292) .+.|| +.+..|.+.++.+ ++ +.+-||.-+... .+.+++....+.. ..++.+..+..++. .-.+| .+|+++ T Consensus 72 h~~GT-dRi~ea~~~l~~~~D~iinvQGDePli~p~~I~~~i~~~~~~~~~~v~t~~~~i~~~~~~~~~n~VKvv~~~~~ 150 (239) T cd02517 72 HPSGT-DRIAEVAEKLDADDDIVVNVQGDEPLIPPEMIDQVVAALKDDPGVDMATLATPISDEEELFNPNVVKVVLDKDG 150 (239) T ss_pred CCCCC-HHHHHHHHHCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHCCCCEEEEEEEECCCHHHHCCCCCEEEEECCCC T ss_conf 35621-59999999719988989995187567899999999999852887559874023699889418996399978988 Q ss_pred CHHHCCCCCC-------CCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHCCCCCEEEEECCCCEEEECCC Q ss_conf 0010013333-------322222202455410002243220046753333333432000001320463135412563278 Q gi|254780922|r 153 QAISIEEKPN-------NPKSSFAVTGIYFYDQEVVNIARNIRPSARGELEITDVNSYYLDKGLLAVEFLREGSAWFDAG 225 (292) Q Consensus 153 ~i~~i~EKP~-------~~~sn~a~~GiY~f~~~i~~~~~~l~~s~rgE~eitD~~~~~l~~~~~~~~~~~~g~~W~D~G 225 (292) ++.-|---|- ....-+--.|+|.|++++++...+++++..-..|--+.+ .++..|............ +++. T Consensus 151 ~alyfSRs~IP~~~~~~~~~~~~khiGIy~f~~~~L~~f~~~~~s~lE~~E~lEqL-R~leng~~I~~~~~~~~~-~~VD 228 (239) T cd02517 151 YALYFSRSPIPYPRDSSEDFPYYKHIGIYAYRRDFLLRFAALPPSPLEQIESLEQL-RALENGYKIKVVETDHES-IGVD 228 (239) T ss_pred CCHHCCCCCCCCCCCCCCCCCEEEEEEEEEECHHHHHHHHCCCCCHHHHHHHHHHH-HHHHCCCEEEEEEECCCC-CCCC T ss_conf 51103347886434565550410377655633999999873799825777608999-999839806899969999-8988 Q ss_pred CHHHHHHHHH Q ss_conf 4577999999 Q gi|254780922|r 226 TPESLLDTAV 235 (292) Q Consensus 226 t~e~l~~As~ 235 (292) ||+||.+|.. T Consensus 229 t~eDl~~vek 238 (239) T cd02517 229 TPEDLERVEA 238 (239) T ss_pred CHHHHHHHHC T ss_conf 9999999970 No 66 >COG2068 Uncharacterized MobA-related protein [General function prediction only] Probab=99.29 E-value=1.1e-11 Score=93.48 Aligned_cols=187 Identities=24% Similarity=0.399 Sum_probs=115.7 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 79924888745533443898222247837479999999997898689999777656431112222222211112223333 Q gi|254780922|r 3 GIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQLV 82 (292) Q Consensus 3 aiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~~ 82 (292) |||||||+||||. .||.|+|+.|+|++.+.+++...++.+++++|+++.-.+...... ...+..+.+ .++- T Consensus 8 ~VvLAAGrssRmG-----~~KlLap~~g~plv~~~~~~a~~a~~~~vivV~g~~~~~~~~a~~---~~~~~~~v~-npd~ 78 (199) T COG2068 8 AVVLAAGRSSRMG-----QPKLLAPLDGKPLVRASAETALSAGLDRVIVVTGHRVAEAVEALL---AQLGVTVVV-NPDY 78 (199) T ss_pred EEEECCCCCCCCC-----CCCEECCCCCCCHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHH---CCCCEEEEE-CCCH T ss_conf 8887166666678-----751654149982899999999726887099992761136788653---369808996-9552 Q ss_pred CCCCCCCCCCCCCCCCCC--CCCEEECCCCC-CCCCCCCHHHHHCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHCCC Q ss_conf 224468521011123356--75213214432-111100001110000011100122221000122212222100010013 Q gi|254780922|r 83 PAGLAQSYILGAEFIGDS--SSVLILGDNVF-YGSDISDIFHKARARRNSATVVGCHVQNPQRYGVVEVDSSNQAISIEE 159 (292) Q Consensus 83 p~Gta~Ai~~a~~~i~~~--~~~lilgDni~-~~~~l~~~l~~~~~~~~~~ti~~~~v~~p~~yGvv~~d~~~~i~~i~E 159 (292) ..|.+.++..+......+ -.++.+||--+ ...++..++...... . . +. .|. |+ .+.| T Consensus 79 ~~Gls~Sl~ag~~a~~~~~~~v~~~lgDmP~V~~~t~~rl~~~~~~~-~-~--~v----~p~-~~----g~rG------- 138 (199) T COG2068 79 AQGLSTSLKAGLRAADAEGDGVVLMLGDMPQVTPATVRRLIAAFRAR-G-A--AV----RPV-YG----GARG------- 138 (199) T ss_pred HHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCCHHHHHHHHHHCCCC-C-C--EE----EEE-CC----CCCC------- T ss_conf 22376999999873556797699996888878789999999844346-7-3--44----222-06----8767------- Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHCCCCCE-EEEECCCCEEEECCCCHHHHHHHHHHHH Q ss_conf 333322222202455410002243220046753333333432000001320-4631354125632784577999999999 Q gi|254780922|r 160 KPNNPKSSFAVTGIYFYDQEVVNIARNIRPSARGELEITDVNSYYLDKGLL-AVEFLREGSAWFDAGTPESLLDTAVFVR 238 (292) Q Consensus 160 KP~~~~sn~a~~GiY~f~~~i~~~~~~l~~s~rgE~eitD~~~~~l~~~~~-~~~~~~~g~~W~D~Gt~e~l~~As~~i~ 238 (292) +|. .|+++.|..+..+..+ .| .+.++.++.+ .+.+..+...-.|+.||++|.+|+..++ T Consensus 139 ---~Pv---------~~~~~~~~~l~~l~GD-~G-------~r~ll~~~~~~~~~V~~~~g~llDVDTped~~~a~~~~~ 198 (199) T COG2068 139 ---HPV---------LLSKDLFPALARLSGD-VG-------ARQLLEEGGLPLVEVEVDAGVLLDVDTPEDLARAQDLLR 198 (199) T ss_pred ---CCE---------EECHHHHHHHHHCCCC-HH-------HHHHHHHCCCCEEEECCCCCEEECCCCHHHHHHHHHHHC T ss_conf ---964---------5656678998616886-22-------899997538634765268851754799899999987624 No 67 >PRK09382 ispDF bifunctional 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase/2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase protein; Provisional Probab=99.15 E-value=1.2e-10 Score=86.97 Aligned_cols=196 Identities=17% Similarity=0.222 Sum_probs=111.2 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCC-CCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 7992488874553344389822224783747999999999789-868999977765643111222222221111222333 Q gi|254780922|r 3 GIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAG-IREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQL 81 (292) Q Consensus 3 aiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~G-i~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~ 81 (292) +||||||.|+||.. ...+||++++++|+|+++|.|+.+.... |++|+|+++....+.++++.. ++.++. T Consensus 9 aIIlAAG~G~Rfg~-~~~ipKQf~~l~gkpil~~sl~~f~~~~~i~~Ivvvv~~~~~~~~~~~~~-------~i~~V~-- 78 (382) T PRK09382 9 LVIVAAGRSTRFGQ-SAEVKKQWLRIGGKPLWLHVLENFSSAPDFKEIVVVIHPDDIAYMKRAVP-------EIKFVT-- 78 (382) T ss_pred EEEECCCCCCCCCC-CCCCCCCEEEECCEEHHHHHHHHHHCCCCCCEEEEEECHHHHHHHHHHCC-------CCEEEC-- T ss_conf 99968722504899-87999741067977899999999966899898999938889999997457-------957937-- Q ss_pred CCCCC--CCCCCCCCCCCCCCCCCEEEC-CCCCCCCC-CCCHHHHHCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHC Q ss_conf 32244--685210111233567521321-44321111-000011100000111001222210001222122221000100 Q gi|254780922|r 82 VPAGL--AQSYILGAEFIGDSSSVLILG-DNVFYGSD-ISDIFHKARARRNSATVVGCHVQNPQRYGVVEVDSSNQAISI 157 (292) Q Consensus 82 ~p~Gt--a~Ai~~a~~~i~~~~~~lilg-Dni~~~~~-l~~~l~~~~~~~~~~ti~~~~v~~p~~yGvv~~d~~~~i~~i 157 (292) -|. .++++.|.+.+.+ ++++|.= ---|.+.+ +.++++... ..++.+.+.++.|+-++. ++ T Consensus 79 --GG~tRq~SV~ngL~~i~~-~~VlIHDaARP~vs~~lI~~li~~~~--~~~a~ip~lpv~DTik~~----~~------- 142 (382) T PRK09382 79 --GGATRQESVRNALEALDS-EYVLISDAARPFVPKELIDRLIEALD--KADCVLPALPVADTLKRA----NE------- 142 (382) T ss_pred --CCHHHHHHHHHHHHHCCC-CEEEEECCCCCCCCHHHHHHHHHHHC--CCCCCEEEEECCCCEEEC----CC------- T ss_conf --850599999999975678-88999554257899999999998651--477615875243350104----66------- Q ss_pred CCCCCCCCCCCCHHHHH-HHHHHHHHH------HHHCCCCCCCCCCCCCCCCHHCCCCCEEEEECCCCEEEECCCCHHHH Q ss_conf 13333322222202455-410002243------22004675333333343200000132046313541256327845779 Q gi|254780922|r 158 EEKPNNPKSSFAVTGIY-FYDQEVVNI------ARNIRPSARGELEITDVNSYYLDKGLLAVEFLREGSAWFDAGTPESL 230 (292) Q Consensus 158 ~EKP~~~~sn~a~~GiY-~f~~~i~~~------~~~l~~s~rgE~eitD~~~~~l~~~~~~~~~~~~g~~W~D~Gt~e~l 230 (292) - -++++ +| +=+|+.|.+ .++.. ....+.||=-..+...|.....+.++. .=+=+-||+|| T Consensus 143 --t--vdR~~-----l~~~QTPQ~F~~~~L~~a~~~a~---~~~~~~TDDasl~e~~G~~V~~V~G~~-~N~KITtpeDL 209 (382) T PRK09382 143 --T--VDREG-----LKLIQTPQLSRTKTLLAAHRKAL---SGRTDFTDDSSIAEAAGGKVALVEGSE-DLHKLTFKEDL 209 (382) T ss_pred --C--CCHHH-----CEEECCCCCCCHHHHHHHHHHHH---HCCCCCCCHHHHHHHCCCCEEEEECCC-CCCCCCCHHHH T ss_conf --4--68678-----45622775355999999999998---639986740999998699279995675-48538999999 Q ss_pred HHHHHHH Q ss_conf 9999999 Q gi|254780922|r 231 LDTAVFV 237 (292) Q Consensus 231 ~~As~~i 237 (292) ..+..+. T Consensus 210 ~~~~~~~ 216 (382) T PRK09382 210 KALDRLS 216 (382) T ss_pred HHHHHHH T ss_conf 9998644 No 68 >cd02503 MobA MobA catalyzes the formation of molybdopterin guanine dinucleotide. The prokaryotic enzyme molybdopterin-guanine dinucleotide biosynthesis protein A (MobA). All mononuclear molybdoenzymes bind molybdenum in complex with an organic cofactor termed molybdopterin (MPT). In many bacteria, including Escherichia coli, molybdopterin can be further modified by attachment of a GMP group to the terminal phosphate of molybdopterin to form molybdopterin guanine dinucleotide (MGD). This GMP attachment step is catalyzed by MobA, by linking a guanosine 5'-phosphate to MPT forming molybdopterin guanine dinucleotide. This reaction requires GTP, MgCl2, and the MPT form of the cofactor. It is a reaction unique to prokaryotes, and therefore may represent a potential drug target. Probab=99.09 E-value=1.4e-10 Score=86.50 Aligned_cols=105 Identities=17% Similarity=0.270 Sum_probs=67.7 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 90799248887455334438982222478374799999999978986899997776564311122222222111122233 Q gi|254780922|r 1 MKGIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQ 80 (292) Q Consensus 1 MkaiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q 80 (292) +-|||||||.+|||. .+|.|++++|+|+|+|+++.+... ..+++|++++... . + ...++.+.. .+ T Consensus 1 i~aiILAgG~S~RmG-----~~K~ll~~~g~~li~~~~~~l~~~-~~~v~vv~~~~~~-~---~----~~~~~~~i~-d~ 65 (181) T cd02503 1 ITGVILAGGKSRRMG-----GDKALLELGGKPLLEHVLERLKPL-VDEVVISANRDQE-R---Y----ALLGVPVIP-DE 65 (181) T ss_pred CEEEEECCCCCCCCC-----CCCCCCEECCCCHHHHHHHHHHHH-CCCEEEEECCCHH-H---H----HCCCCEEEE-CC T ss_conf 969997897734189-----985177989840999999999864-6807999177277-6---6----317986994-79 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCHH Q ss_conf 3322446852101112335675213214432111100001 Q gi|254780922|r 81 LVPAGLAQSYILGAEFIGDSSSVLILGDNVFYGSDISDIF 120 (292) Q Consensus 81 ~~p~Gta~Ai~~a~~~i~~~~~~lilgDni~~~~~l~~~l 120 (292) ..-.|...++..+......+.++++.+|--+...+....+ T Consensus 66 ~~~~GPl~gi~~~l~~~~~~~~lv~~~D~P~i~~~~i~~L 105 (181) T cd02503 66 PPGKGPLAGILAALRAAPADWVLVLACDMPFLPPELLERL 105 (181) T ss_pred CCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHH T ss_conf 9787848999999862787759996479898889999999 No 69 >PRK00560 molybdopterin-guanine dinucleotide biosynthesis protein A; Provisional Probab=99.02 E-value=1.3e-09 Score=80.56 Aligned_cols=181 Identities=13% Similarity=0.175 Sum_probs=97.9 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEECCEE-HHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCC- Q ss_conf 799248887455334438982222478374-799999999978986899997776564311122222222111122233- Q gi|254780922|r 3 GIVLAGGSGTRLRPLTDLLSKQMLPIYNKP-MIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQ- 80 (292) Q Consensus 3 aiIlAaG~GtRl~P~T~~~pK~Llpi~gkp-lI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q- 80 (292) |||||||+++||. .+|+++++.|+| ||+|+++.|... +++|+|.++... .. + +. .++.. T Consensus 11 ~vILAGGkSsRmG-----~dKall~~~g~~~Lie~~~~~l~~~-~~~i~is~~~~~-~~---~-------~~--~~i~d~ 71 (196) T PRK00560 11 CVILAGGKSSRMG-----ENKALLPFGSYSSLLEYQYTRLLKL-FKKVYISTKDKK-FE---F-------NA--PFLLEK 71 (196) T ss_pred EEEECCCCCCCCC-----CCCCCCEECCCCHHHHHHHHHHHHH-CCEEEEECCCHH-HH---C-------CC--CEEECC T ss_conf 8997378722399-----9862527699555999999998863-676899648136-64---5-------99--888657 Q ss_pred -CCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHCCC Q ss_conf -3322446852101112335675213214432111100001110000011100122221000122212222100010013 Q gi|254780922|r 81 -LVPAGLAQSYILGAEFIGDSSSVLILGDNVFYGSDISDIFHKARARRNSATVVGCHVQNPQRYGVVEVDSSNQAISIEE 159 (292) Q Consensus 81 -~~p~Gta~Ai~~a~~~i~~~~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~~v~~p~~yGvv~~d~~~~i~~i~E 159 (292) ....|.-.++..|......+.++++-.|.-|...++...+.. .... .++....+++. T Consensus 72 ~~~~~GPL~Gi~saL~~~~~~~~~v~~cD~Pfl~~~~i~~L~~--~~~~---------------~~v~~~~~~~~----- 129 (196) T PRK00560 72 ESDLFSPLFGIHNAFLTLQTPEIFFISVDTPFVSFESIKKLCG--KENF---------------SVTYAKSPTKE----- 129 (196) T ss_pred CCCCCCHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHH--CCCC---------------CEEEEEECCCC----- T ss_conf 8898880999999998577786899957888889999999972--2699---------------98999548901----- Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHCCCCCEEEEECCCCEEEECCCCHHHHHHHHHHHHH Q ss_conf 33332222220245541000224322004675333333343200000132046313541256327845779999999999 Q gi|254780922|r 160 KPNNPKSSFAVTGIYFYDQEVVNIARNIRPSARGELEITDVNSYYLDKGLLAVEFLREGSAWFDAGTPESLLDTAVFVRN 239 (292) Q Consensus 160 KP~~~~sn~a~~GiY~f~~~i~~~~~~l~~s~rgE~eitD~~~~~l~~~~~~~~~~~~g~~W~D~Gt~e~l~~As~~i~~ 239 (292) +| ..|+| +++++..++..-. .|++-+.++++.. +...+.+-... ..+.+.||+||.+|...+++ T Consensus 130 ---~P-----L~aly--~~~~l~~l~~~l~--~g~~~l~~~l~~~---~~~~v~~~~~~-~F~NiNTpeD~~~A~~~lk~ 193 (196) T PRK00560 130 ---HY-----LISLW--HRSLLNALNYALK--TQNYRLSDLVKNS---SSQAVHFEDEE-EFLNLNTLKDYELALQILKS 193 (196) T ss_pred ---CC-----EEEEE--CHHHHHHHHHHHH--CCCCHHHHHHHHC---CCEEEECCCCC-CCCCCCCHHHHHHHHHHHHH T ss_conf ---30-----46785--2889999999998--2995199999878---96897459987-42468999999999999872 Q ss_pred H Q ss_conf 8 Q gi|254780922|r 240 I 240 (292) Q Consensus 240 ~ 240 (292) . T Consensus 194 ~ 194 (196) T PRK00560 194 R 194 (196) T ss_pred C T ss_conf 3 No 70 >COG0746 MobA Molybdopterin-guanine dinucleotide biosynthesis protein A [Coenzyme metabolism] Probab=99.01 E-value=6.7e-10 Score=82.28 Aligned_cols=105 Identities=21% Similarity=0.415 Sum_probs=71.7 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 90799248887455334438982222478374799999999978986899997776564311122222222111122233 Q gi|254780922|r 1 MKGIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQ 80 (292) Q Consensus 1 MkaiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q 80 (292) |.+||||||+|+|| .+|+|++++|+|||+|+++.|....- .+ +|+...+.+. + ..+|+ ..+.. T Consensus 5 ~~~vILAGG~srRm------~dK~l~~~~g~~lie~v~~~L~~~~~-~v-vi~~~~~~~~---~----~~~g~--~vv~D 67 (192) T COG0746 5 MTGVILAGGKSRRM------RDKALLPLNGRPLIEHVIDRLRPQVD-VV-VISANRNQGR---Y----AEFGL--PVVPD 67 (192) T ss_pred CEEEEECCCCCCCC------CCCEEEEECCEEHHHHHHHHHHCCCC-EE-EEECCCCHHH---H----HCCCC--CEEEC T ss_conf 36999778753567------88745787982899999998740188-79-9976873424---4----31698--57547 Q ss_pred CCCC-CCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCCCHHHH Q ss_conf 3322-44685210111233567521321443211110000111 Q gi|254780922|r 81 LVPA-GLAQSYILGAEFIGDSSSVLILGDNVFYGSDISDIFHK 122 (292) Q Consensus 81 ~~p~-Gta~Ai~~a~~~i~~~~~~lilgDni~~~~~l~~~l~~ 122 (292) ..+. |.-.++..|....+.+.++++-+|.-|...++-+.+.. T Consensus 68 ~~~~~GPL~Gi~~al~~~~~~~~~v~~~D~P~i~~~lv~~l~~ 110 (192) T COG0746 68 ELPGFGPLAGILAALRHFGTEWVLVLPCDMPFIPPELVERLLS 110 (192) T ss_pred CCCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHH T ss_conf 8888887899999998579875999816778789999999998 No 71 >pfam02348 CTP_transf_3 Cytidylyltransferase. This family consists of two main Cytidylyltransferase activities: 1) 3-deoxy-manno-octulosonate cytidylyltransferase,, EC:2.7.7.38 catalysing the reaction:- CTP + 3-deoxy-D-manno-octulosonate <= diphosphate + CMP-3-deoxy-D-manno-octulosonate, 2) acylneuraminate cytidylyltransferase EC:2.7.7.43,, catalysing the reaction:- CTP + N-acylneuraminate <= diphosphate + CMP-N-acylneuraminate. NeuAc cytydilyltransferase of Mannheimia haemolytica has been characterized describing kinetics and regulation by substrate charge, energetic charge and amino-sugar demand. Probab=98.96 E-value=7.4e-10 Score=82.03 Aligned_cols=176 Identities=19% Similarity=0.264 Sum_probs=108.2 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCC-CCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 7992488874553344389822224783747999999999789-868999977765643111222222221111222333 Q gi|254780922|r 3 GIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAG-IREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQL 81 (292) Q Consensus 3 aiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~G-i~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~ 81 (292) +||+|=+-.|||- .|+|.+|+|+|||+|+++.+..++ +++|+|.|. +.++ .+.. +.+|.+......+ T Consensus 2 ~iIpAR~~S~Rlp------~K~L~~i~gkpmI~~v~~~a~~s~~~~~viVaTd--~~~I-~~~~---~~~~~~~i~t~~~ 69 (197) T pfam02348 2 AIIPARSGSKRLP------GKNLLKLGGKPLIARVIEAALQSKLFDKVVVATD--SEEI-ADIA---EKFGGGVVVRRGS 69 (197) T ss_pred EEECCCCCCCCCC------CCCCHHHCCCCHHHHHHHHHHHCCCCCEEEEECC--CCEE-EEEC---CCCCCCEEEEEEC T ss_conf 8984588887899------9620352990689999999997589872899548--3014-4201---2323313576402 Q ss_pred CCCCCCCCCCCCCCCCCCCC-CCEEECCCCCCCCC-CCCHHHHHCCCHHHH--HCCCC-----CCCCCCCCCCCCCCCCC Q ss_conf 32244685210111233567-52132144321111-000011100000111--00122-----22100012221222210 Q gi|254780922|r 82 VPAGLAQSYILGAEFIGDSS-SVLILGDNVFYGSD-ISDIFHKARARRNSA--TVVGC-----HVQNPQRYGVVEVDSSN 152 (292) Q Consensus 82 ~p~Gta~Ai~~a~~~i~~~~-~~lilgDni~~~~~-l~~~l~~~~~~~~~~--ti~~~-----~v~~p~~yGvv~~d~~~ 152 (292) .+.||.-....++.+..+.+ ++-+.||..|.... +.+++..+.+..... ++... ...+|+.-= +.+|.++ T Consensus 70 ~~~gt~R~~ea~~~~~~~~d~ivnvqgD~Pli~p~~i~~~i~~~~~~~~~~~~t~~~~~~~~~~~~~~n~vK-vv~~~~~ 148 (197) T pfam02348 70 LASGTDRFIEAVKAFLADEEIIVNLQGDEPLLQPTSILRAIEHLREAGEDYVSTLVKPVVSSEEILNPNPLK-VVLDDAG 148 (197) T ss_pred CCCCCCHHHHHHHHHCCHHEEEEECCCCEEECCHHHHHHHHHHHHHCCCCCEEEEEEECCCHHHHCCCCCCE-EEECCCC T ss_conf 356521199999864341009996589668758899999999998589872898204737888933998749-9988899 Q ss_pred CHHHCCCCCC--------CCCCCCCHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 0010013333--------32222220245541000224322004675 Q gi|254780922|r 153 QAISIEEKPN--------NPKSSFAVTGIYFYDQEVVNIARNIRPSA 191 (292) Q Consensus 153 ~i~~i~EKP~--------~~~sn~a~~GiY~f~~~i~~~~~~l~~s~ 191 (292) +..-|...|. .+...+-..|+|.|+.+.+.....+.+++ T Consensus 149 ~alyfsR~~iP~~~~~~~~~~~~~~h~gIY~f~~~~L~~f~~l~~s~ 195 (197) T pfam02348 149 YALYFSRSPIPYIRDHPAPLQLYLRHKGIYAFRKEHLIRYVIDTPSD 195 (197) T ss_pred CEEECCCCCCCCCCCCCCCCCCCCEEEEEEEECHHHHHHHHCCCCCC T ss_conf 88241378897431232557875489999898899999987689999 No 72 >PRK02726 molybdopterin-guanine dinucleotide biosynthesis protein A; Provisional Probab=98.92 E-value=1.4e-09 Score=80.22 Aligned_cols=99 Identities=19% Similarity=0.322 Sum_probs=63.5 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 79924888745533443898222247837479999999997898689999777656431112222222211112223333 Q gi|254780922|r 3 GIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQLV 82 (292) Q Consensus 3 aiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~~ 82 (292) |||||||+++||. .+|+|+++.|+|+|+|+++.+...+ ++|+|+++ . .+.+..++.. + +.++.... T Consensus 10 ~vILAGG~S~RMG-----~dK~ll~~~G~~ll~~~~~~l~~~~-~~v~i~~~-~-~~~~~~~~~~----~--~~~i~D~~ 75 (200) T PRK02726 10 ALILAGGKSSRMG-----QDKALITWQGVPLLQRVCRIAAACA-DEVYIITP-W-PERYQSLLPP----G--CHWLREPP 75 (200) T ss_pred EEEECCCCHHHCC-----CCCCCCEECCEEHHHHHHHHHHHHC-CEEEEECC-C-HHHHHHHCCC----C--CEEEECCC T ss_conf 9998576540279-----9843387898759999999987418-86999679-9-7887752588----8--57901699 Q ss_pred C-CCCCCCCCCCCCCCCCCCCCEEECCC-CCCCCCC Q ss_conf 2-24468521011123356752132144-3211110 Q gi|254780922|r 83 P-AGLAQSYILGAEFIGDSSSVLILGDN-VFYGSDI 116 (292) Q Consensus 83 p-~Gta~Ai~~a~~~i~~~~~~lilgDn-i~~~~~l 116 (292) + .|.-.++..|...... +.+++++.| -|...++ T Consensus 76 ~~~GPL~Gi~saL~~~~~-~~~lv~~cD~P~l~~~~ 110 (200) T PRK02726 76 PSGGPLVAFAQGLPQIKT-EWVLLLACDLPKLRVGV 110 (200) T ss_pred CCCCCHHHHHHHHHHCCC-CCEEEECCCCCCCCHHH T ss_conf 889859999999985688-83899438878889999 No 73 >COG1212 KdsB CMP-2-keto-3-deoxyoctulosonic acid synthetase [Cell envelope biogenesis, outer membrane] Probab=98.80 E-value=1.1e-08 Score=74.62 Aligned_cols=222 Identities=18% Similarity=0.265 Sum_probs=131.3 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 79924888745533443898222247837479999999997898689999777656431112222222211112223333 Q gi|254780922|r 3 GIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQLV 82 (292) Q Consensus 3 aiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~~ 82 (292) +||+|-=..|||- -|||-.|+|||||.+..++...+|..+++|.|. +.++++.. ..+|.+.....-+. T Consensus 6 viIPAR~~STRLp------gKPLadI~GkpmI~rV~e~a~~s~~~rvvVATD--de~I~~av----~~~G~~avmT~~~h 73 (247) T COG1212 6 VIIPARLASTRLP------GKPLADIGGKPMIVRVAERALKSGADRVVVATD--DERIAEAV----QAFGGEAVMTSKDH 73 (247) T ss_pred EEEECCHHCCCCC------CCCHHHHCCCHHHHHHHHHHHHCCCCEEEEECC--CHHHHHHH----HHHCCEEEECCCCC T ss_conf 9985420026689------973577379607899999998737872899749--88999999----97097899617888 Q ss_pred CCCCCCCCCCCCCC-CCCCCCCE-EECCCCCCCCC-CCCHHHHHCCCHHHHHCCCCCCCCCCC-C--CC--CCCCCCCCH Q ss_conf 22446852101112-33567521-32144321111-000011100000111001222210001-2--22--122221000 Q gi|254780922|r 83 PAGLAQSYILGAEF-IGDSSSVL-ILGDNVFYGSD-ISDIFHKARARRNSATVVGCHVQNPQR-Y--GV--VEVDSSNQA 154 (292) Q Consensus 83 p~Gta~Ai~~a~~~-i~~~~~~l-ilgDni~~~~~-l~~~l~~~~~~~~~~ti~~~~v~~p~~-y--Gv--v~~d~~~~i 154 (292) +.||--..-.++.. +.++..++ +-||.-|.... +...++...+...+..-.++++.+++. + -+ +.+|++|+. T Consensus 74 ~SGTdR~~Ev~~~l~~~~~~iIVNvQGDeP~i~p~~I~~~~~~L~~~~~~~aTl~~~i~~~ee~~nPN~VKvV~d~~g~A 153 (247) T COG1212 74 QSGTDRLAEVVEKLGLPDDEIIVNVQGDEPFIEPEVIRAVAENLENSNADMATLAVKITDEEEAFNPNVVKVVLDKEGYA 153 (247) T ss_pred CCCCHHHHHHHHHCCCCCCEEEEECCCCCCCCCHHHHHHHHHHHHHCCCCEEEEEEECCCHHHHCCCCCEEEEECCCCCE T ss_conf 98008999999965999642999865887777979999999998737766045534337888845998279997599828 Q ss_pred HHCCCCCC----C---CCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCH--HCCCCCEEEEECCCCEEEECCC Q ss_conf 10013333----3---2222220245541000224322004675333333343200--0001320463135412563278 Q gi|254780922|r 155 ISIEEKPN----N---PKSSFAVTGIYFYDQEVVNIARNIRPSARGELEITDVNSY--YLDKGLLAVEFLREGSAWFDAG 225 (292) Q Consensus 155 ~~i~EKP~----~---~~sn~a~~GiY~f~~~i~~~~~~l~~s~rgE~eitD~~~~--~l~~~~~~~~~~~~g~~W~D~G 225 (292) .=|---|- + +..-+--.|||-|...+++-.-.++||. +|-+.-+++ .+..|........+...-.-+. T Consensus 154 LYFSRs~iP~~rd~~~~~p~l~HIGIYayr~~~L~~f~~~~ps~---LE~~E~LEQLR~Le~G~kI~v~i~~~~p~~gVD 230 (247) T COG1212 154 LYFSRAPIPYGRDNFGGTPFLRHIGIYAYRAGFLERFVALKPSP---LEKIESLEQLRVLENGEKIHVEIVKEVPSIGVD 230 (247) T ss_pred EEEECCCCCCCCCCCCCCCHHHEEEHHHHHHHHHHHHHCCCCCH---HHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCC T ss_conf 99876878775434677643642205886799999997069960---567877899999876975789984168888989 Q ss_pred CHHHHHHHHHHHHH Q ss_conf 45779999999999 Q gi|254780922|r 226 TPESLLDTAVFVRN 239 (292) Q Consensus 226 t~e~l~~As~~i~~ 239 (292) |||||.+|-.++.+ T Consensus 231 T~EDLe~v~~~~~~ 244 (247) T COG1212 231 TPEDLERVRKILSN 244 (247) T ss_pred CHHHHHHHHHHHHH T ss_conf 88999999999875 No 74 >TIGR01479 GMP_PMI mannose-1-phosphate guanylyltransferase/mannose-6-phosphate isomerase; InterPro: IPR006375 This enzyme is known to be bifunctional, as both mannose-6-phosphate isomerase (5.3.1.8 from EC) (PMI) and mannose-1-phosphate guanylyltransferase (2.7.7.22 from EC) in Pseudomonas aeruginosa, Xanthomonas campestris, and Acetobacter xylinus. The literature on the enzyme from Escherichia coli attributes mannose-6-phosphate isomerase activity to an adjacent gene, but the present sequence has not been shown to lack the activity. The PMI domain lies at the C-terminal. ; GO: 0008928 mannose-1-phosphate guanylyltransferase (GDP) activity, 0009103 lipopolysaccharide biosynthetic process. Probab=98.79 E-value=3.4e-09 Score=77.87 Aligned_cols=232 Identities=19% Similarity=0.283 Sum_probs=132.3 Q ss_pred EEEEECCCCCCCCCHHHCC-CCCCEEEECC-E-EHHHHHHHHHHHC-CCCEEEEEECCCCCCCCCCCCCCCCCCCCCCC- Q ss_conf 0799248887455334438-9822224783-7-4799999999978-98689999777656431112222222211112- Q gi|254780922|r 2 KGIVLAGGSGTRLRPLTDL-LSKQMLPIYN-K-PMIYYPVSTLMDA-GIREILIISTPRDLPVLKEFLGSGEKWGVQFS- 76 (292) Q Consensus 2 kaiIlAaG~GtRl~P~T~~-~pK~Llpi~g-k-plI~~~l~~l~~~-Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~- 76 (292) -.+|+|||.||||+|++.. .||+++++.| . +++.-.+..+... ....-+++|+..+.-...+-+. ..+.... T Consensus 2 ~p~~~~gg~g~~lwp~~~~~~p~~f~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~n~~~~f~~~~~~~---~~~~~~~~ 78 (478) T TIGR01479 2 LPVILAGGSGTRLWPLSRELYPKQFLALVGDLYTLLQETLKRLAGLPDCSSPLVICNEEHRFLVAEQLR---ELGLLASN 78 (478) T ss_pred CCEEEECCCCCCCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCCCCEEEECCHHHHHHHHHHH---HHCCCCCC T ss_conf 522640787750121211220034432045204678999986226642236525524203468888898---60652222 Q ss_pred CCCCCCCCCCCCCCCCCC-----CCCCCCCCCEEECCC-CCCCC-CCCCHHHHHCCCHHH--HHCCCCCCCCC-CCCCCC Q ss_conf 223333224468521011-----123356752132144-32111-100001110000011--10012222100-012221 Q gi|254780922|r 77 YIEQLVPAGLAQSYILGA-----EFIGDSSSVLILGDN-VFYGS-DISDIFHKARARRNS--ATVVGCHVQNP-QRYGVV 146 (292) Q Consensus 77 y~~q~~p~Gta~Ai~~a~-----~~i~~~~~~lilgDn-i~~~~-~l~~~l~~~~~~~~~--~ti~~~~v~~p-~~yGvv 146 (292) .+.++..+.|+-|+..+. ..-+.++.+++++.| ++.+. .+...+......... ...|+.....| +.||.+ T Consensus 79 ~~l~p~g~~~~p~~~~~~~~~~~~~~~~~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~g~~~~fg~~p~~p~~gygy~ 158 (478) T TIGR01479 79 ILLEPVGRNTAPAIALAALLALRRAPGEDPLLLVLAADHVITDEDAFRAAVKLALPAAAEGKLVTFGIVPTTPETGYGYI 158 (478) T ss_pred EEECCCCCCHHHHHHHHHHHHHHCCCCCCCEEEEEECCCHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCHH T ss_conf 14302666415789999998764057877478986155111003678999876432433175478502357765453111 Q ss_pred CCC--C----------CCCHHHCCCCCCCCC-C-------CCCHHHHHHHHH------------HHHHHHHHCCCCCCCC Q ss_conf 222--2----------100010013333322-2-------222024554100------------0224322004675333 Q gi|254780922|r 147 EVD--S----------SNQAISIEEKPNNPK-S-------SFAVTGIYFYDQ------------EVVNIARNIRPSARGE 194 (292) Q Consensus 147 ~~d--~----------~~~i~~i~EKP~~~~-s-------n~a~~GiY~f~~------------~i~~~~~~l~~s~rgE 194 (292) ... . -..+..|+|||.... . .+-+.|.++|.. ++++.+.........+ T Consensus 159 ~~g~~~~~~~~~~~~~~~~~~~f~~~p~~~~~~~~~~~g~~~wn~g~f~~~~~~~~~~l~~~~p~~~~~~~~~~~~~~~d 238 (478) T TIGR01479 159 RRGLDAPLAGGDDGDDVYAVERFVEKPDLATAQEYLESGDYYWNSGMFLFRASRYLAELKKHAPDIYDACEAALEASEPD 238 (478) T ss_pred HCCCCCCCCCCCCCCHHHHHHHHHCCCCHHHHHHHHHCCCCEECCCCEEHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 01443102442112214666665405455678999751762001430000135789988763058899999987410234 Q ss_pred CCCC------------CCCCHHCCCCCEEEEECCCCEEEECCCCHHHHHHHHHH Q ss_conf 3333------------43200000132046313541256327845779999999 Q gi|254780922|r 195 LEIT------------DVNSYYLDKGLLAVEFLREGSAWFDAGTPESLLDTAVF 236 (292) Q Consensus 195 ~eit------------D~~~~~l~~~~~~~~~~~~g~~W~D~Gt~e~l~~As~~ 236 (292) .+.. +.+...+-+..........+..|-|+|+...+++.+.. T Consensus 239 ~~~~~~~~~~f~~~p~~~~d~~~~~~~~~~~~~p~~~~w~d~g~w~~~w~~~~~ 292 (478) T TIGR01479 239 LDFIRLDKEAFEACPSESIDYAVMEKTSDAVVVPLDAGWSDVGSWSALWEISDK 292 (478) T ss_pred CCHHCCCHHHHHCCCCCCCCHHHHHCCCCEEEEECCCCCCCCCCHHHHHHHCCC T ss_conf 201000457874062100002322214570687235665433431344320035 No 75 >COG1083 NeuA CMP-N-acetylneuraminic acid synthetase [Cell envelope biogenesis, outer membrane] Probab=98.61 E-value=3.9e-08 Score=71.20 Aligned_cols=202 Identities=15% Similarity=0.214 Sum_probs=123.2 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCC-CCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 7992488874553344389822224783747999999999789-868999977765643111222222221111222333 Q gi|254780922|r 3 GIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAG-IREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQL 81 (292) Q Consensus 3 aiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~G-i~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~ 81 (292) |||+|-|-..|.. .|-+.+++|||||.|+++.+.+++ +++| +|++.. . .++..+..+|.+.-+.... T Consensus 6 AiIpAR~gSKgI~------~KNi~~~~gkpLi~~~I~aA~ns~~fd~V-viSsDs-~----~Il~~A~~ygak~~~~Rp~ 73 (228) T COG1083 6 AIIPARGGSKGIK------NKNIRKFGGKPLIGYTIEAALNSKLFDKV-VISSDS-E----EILEEAKKYGAKVFLKRPK 73 (228) T ss_pred EEEECCCCCCCCC------CCCHHHHCCCCHHHHHHHHHHCCCCCCEE-EECCCC-H----HHHHHHHHHCCCCCCCCCH T ss_conf 9976267887687------63357757961589999998658852269-984795-8----9999998727611024874 Q ss_pred CC-CCCC---CCCCCCCCCCCC-CC-CCEEECCC-CCCCCCCCCHHHHHCCCHHHHHCCCCCCCCC-CC-CCCCCCCCCC Q ss_conf 32-2446---852101112335-67-52132144-3211110000111000001110012222100-01-2221222210 Q gi|254780922|r 82 VP-AGLA---QSYILGAEFIGD-SS-SVLILGDN-VFYGSDISDIFHKARARRNSATVVGCHVQNP-QR-YGVVEVDSSN 152 (292) Q Consensus 82 ~p-~Gta---~Ai~~a~~~i~~-~~-~~lilgDn-i~~~~~l~~~l~~~~~~~~~~ti~~~~v~~p-~~-yGvv~~d~~~ 152 (292) +. .+++ ++++.+..+.+. ++ .++..+-. +..+..+.+.++...++..+..+.+++.+.. -+ | .+.+| T Consensus 74 ~LA~D~ast~~~~lh~le~~~~~~~~~~lLq~TsPLl~~~~ik~A~e~f~~~~~~sl~sa~e~e~~p~k~f----~~~~~ 149 (228) T COG1083 74 ELASDRASTIDAALHALESFNIDEDTLILLQPTSPLLTSLHIKEAFEKFLNNQYDSLFSAVECEHHPYKAF----SLNNG 149 (228) T ss_pred HHCCCCHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHH----HHCCC T ss_conf 54167216789999999874424570688406850001457999999996588763588753223468888----75078 Q ss_pred CHHHCCCCCC-------CCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHCCCCCEEEEECCCCEEEECCC Q ss_conf 0010013333-------322222202455410002243220046753333333432000001320463135412563278 Q gi|254780922|r 153 QAISIEEKPN-------NPKSSFAVTGIYFYDQEVVNIARNIRPSARGELEITDVNSYYLDKGLLAVEFLREGSAWFDAG 225 (292) Q Consensus 153 ~i~~i~EKP~-------~~~sn~a~~GiY~f~~~i~~~~~~l~~s~rgE~eitD~~~~~l~~~~~~~~~~~~g~~W~D~G 225 (292) ++..+.|-|. -|+.-..+..+|+++.+.|.--+ .... +...+..+.+... +|+. T Consensus 150 ~~~~~~~~~~~~~rrQ~Lpk~Y~~NgaiYi~~~~~l~e~~-----------------~~f~-~~~~~y~m~~~~~-~DID 210 (228) T COG1083 150 EVKPVNEDPDFETRRQDLPKAYRENGAIYINKKDALLEND-----------------CFFI-PNTILYEMPEDES-IDID 210 (228) T ss_pred CEEECCCCCCCCCCCCCCHHHHHHCCCEEEEHHHHHHHCC-----------------CEEC-CCCEEEECCCCCC-CCCC T ss_conf 4410114776554211051656536767975279874157-----------------5415-8966898171200-0566 Q ss_pred CHHHHHHHHHHHHH Q ss_conf 45779999999999 Q gi|254780922|r 226 TPESLLDTAVFVRN 239 (292) Q Consensus 226 t~e~l~~As~~i~~ 239 (292) |+.||..|.+.+.. T Consensus 211 ~~~Dl~iae~l~~~ 224 (228) T COG1083 211 TELDLEIAENLIFL 224 (228) T ss_pred CHHHHHHHHHHHHH T ss_conf 08769999998656 No 76 >cd04193 UDPGlcNAc_PPase UDPGlcNAc pyrophosphorylase catalayzes the synthesis of UDPGlcNAc. UDP-N-acetylglucosamine (UDPGlcNAc) pyrophosphorylase (UAP) (also named GlcNAc1P uridyltransferase), catalyzes the reversible conversion of UTP and GlcNAc1 to PPi and UDPGlcNAc. UDP-N-acetylglucosamine (UDPGlcNAc), the activated form of GlcNAc, is a key precursor of N- and O-linked glycosylations. It is essential for the synthesis of chitin (a major component of the fungal cell wall) and of the glycosylphosphatidylinositol (GPI) linker which anchors a variety of cell surface proteins to the plasma membrane. In bacteria, UDPGlcNAc represents an essential precursor for both peptidoglycan and lipopolysaccharide biosynthesis. Human UAP has two isoforms, resulting from alternative splicing of a single gene and differing by the presence or absence of 17 amino acids. UDPGlcNAc pyrophosphorylase shares significant sequence and structure conservation with UDPglucose pyrophosphorylase. Probab=98.60 E-value=6.2e-08 Score=69.95 Aligned_cols=179 Identities=20% Similarity=0.275 Sum_probs=102.1 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEEC---CEEHHHHHHHHHHH----CC------CC-EEEEEECCCCCCCCCCCCCCC Q ss_conf 799248887455334438982222478---37479999999997----89------86-899997776564311122222 Q gi|254780922|r 3 GIVLAGGSGTRLRPLTDLLSKQMLPIY---NKPMIYYPVSTLMD----AG------IR-EILIISTPRDLPVLKEFLGSG 68 (292) Q Consensus 3 aiIlAaG~GtRl~P~T~~~pK~Llpi~---gkplI~~~l~~l~~----~G------i~-~I~ii~~~~~~~~~~~~l~~g 68 (292) .|+||||.||||. ...||-|+||. ++++++...+.+.. +| +. -++|.++....+....++... T Consensus 18 vv~laGG~GTrlG---~~~pKg~~~i~~~~~~sl~~l~~e~i~~l~~~~~~~~~~~~~ip~~iMtS~~T~~~T~~~l~~~ 94 (323) T cd04193 18 VLLLAGGQGTRLG---FDGPKGMFPVGLPSKKSLFQLQAERILKLQELAGEASGKKVPIPWYIMTSEATHEETRKFFKEN 94 (323) T ss_pred EEEECCCCCCCCC---CCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHHHHHHC T ss_conf 9995788745568---9997162572688998099999999999999987743888851699978853458999999867 Q ss_pred CCCCCC---CCCCCCC---------------------CCCCCCCCCCC--CC---C-CC--CCCCCCEEECCCCCCCCCC Q ss_conf 222111---1222333---------------------32244685210--11---1-23--3567521321443211110 Q gi|254780922|r 69 EKWGVQ---FSYIEQL---------------------VPAGLAQSYIL--GA---E-FI--GDSSSVLILGDNVFYGSDI 116 (292) Q Consensus 69 ~~~g~~---i~y~~q~---------------------~p~Gta~Ai~~--a~---~-~i--~~~~~~lilgDni~~~~~l 116 (292) ..+|++ +.+..|. .|.|.|+-+.. .. + +. +-+-..+...||+....-- T Consensus 95 ~~fGl~~~~i~~f~Q~~~P~~~~~g~~~l~~~~~~~~~P~GhG~i~~aL~~sG~l~~l~~~Gi~yi~v~~vDN~L~~~~D 174 (323) T cd04193 95 NYFGLDPEQVHFFQQGMLPCVDFDGKILLEEKGKIAMAPNGNGGLYKALQTAGILEDMKKRGIKYIHVYSVDNILVKVAD 174 (323) T ss_pred CCCCCCHHHEEEEECCCCEEECCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCHHHHHHCCCEEEEEEECCCCCCCCCC T ss_conf 64688855647871687314868987156899751427999717999999878488999669889999965744323248 Q ss_pred CCHHHHHCCCHHHHHCCCCCCCCC-CCCCCCC-CCCCCCHHHCCCCCCCC----------CCCCCHHHHHHHHHHHHHHH Q ss_conf 000111000001110012222100-0122212-22210001001333332----------22222024554100022432 Q gi|254780922|r 117 SDIFHKARARRNSATVVGCHVQNP-QRYGVVE-VDSSNQAISIEEKPNNP----------KSSFAVTGIYFYDQEVVNIA 184 (292) Q Consensus 117 ~~~l~~~~~~~~~~ti~~~~v~~p-~~yGvv~-~d~~~~i~~i~EKP~~~----------~sn~a~~GiY~f~~~i~~~~ 184 (292) ..++-.+..++.++.+-+++..+| ++-|++- .|..-.+++.-|=|... .-+.++++.++|+-+.++.+ T Consensus 175 p~~lG~~~~~~~~~~~kvv~K~~~~ekvG~l~~~~gk~~vvEYsel~~~~~~~~~~~g~l~f~~gNi~~~~fsl~fl~~~ 254 (323) T cd04193 175 PVFIGFCISKGADVGAKVVRKRYPTEKVGVVVLVDGKPQVVEYSEISDELAEKRDADGELQYNAGNIANHFFSLDFLEKA 254 (323) T ss_pred HHHHHHHHHCCCCEEEEEEECCCCCCCEEEEEEECCEEEEEEECCCCHHHHHHHCCCCCEEECCCCHHHHHEEHHHHHHH T ss_conf 89999998749963789988888998310899989906999842589889840187661651455677753139999999 No 77 >TIGR02665 molyb_mobA molybdopterin-guanine dinucleotide biosynthesis protein A; InterPro: IPR013482 In many molybdopterin-containing enzymes, including nitrate reductase and dimethylsulphoxide reductase, the cofactor is molybdopterin-guanine dinucleotide. This entry describes proteobacterial MobA (molybdopterin-guanine dinucleotide biosynthesis protein A). MobA synthesises molybdopterin-guanine dinucleotide from molybdopterin and GTP. ; GO: 0003824 catalytic activity, 0006777 Mo-molybdopterin cofactor biosynthetic process. Probab=98.56 E-value=1.9e-08 Score=73.11 Aligned_cols=117 Identities=19% Similarity=0.273 Sum_probs=75.6 Q ss_pred EEEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 07992488874553344389822224783747999999999789868999977765643111222222221111222333 Q gi|254780922|r 2 KGIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQL 81 (292) Q Consensus 2 kaiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~ 81 (292) -|||||||.|+||. ...|.|+++.|+|||+|+++.|. --+.+|+|.+|.+. +.+. +.+ +|..+ ..+ T Consensus 2 ~gviLAGG~arRMG----G~DKGL~~L~g~PL~~hv~~rL~-PQv~~~~IsANRn~-~~Y~-~~g----~Gl~V---~~D 67 (202) T TIGR02665 2 SGVILAGGRARRMG----GRDKGLVELGGKPLIEHVLARLR-PQVSDLAISANRNP-ERYA-QAG----FGLPV---VPD 67 (202) T ss_pred CEEEECCCCCCCCC----CCCCCCEECCCCCHHHHHHHHHC-CCHHHHHHHCCCCH-HHHH-HHC----CCCEE---CCC T ss_conf 73676587400268----88866101286528999999843-50766674138897-7898-860----89731---278 Q ss_pred C---C---CCCCCCCCCCCCCC----CCCCCCEEECCCCCCCCCCCCHHHHHCCC-HHHHHC Q ss_conf 3---2---24468521011123----35675213214432111100001110000-011100 Q gi|254780922|r 82 V---P---AGLAQSYILGAEFI----GDSSSVLILGDNVFYGSDISDIFHKARAR-RNSATV 132 (292) Q Consensus 82 ~---p---~Gta~Ai~~a~~~i----~~~~~~lilgDni~~~~~l~~~l~~~~~~-~~~~ti 132 (292) + - .|---.|..+.++. ..+-++.+=.|+=|...+|-.-|..+... ...+.+ T Consensus 68 ~~DA~~~F~GPLAGilagL~~a~~~~~~~~vl~~PCD~P~lP~dLv~RL~~a~~~~~a~iav 129 (202) T TIGR02665 68 DVDALADFPGPLAGILAGLRWAVAGTGTDWVLTVPCDTPFLPEDLVARLAAALEAQDADIAV 129 (202) T ss_pred CCCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCHHHHHHHHHHHHCCCCCEEE T ss_conf 53435788887689999999998446998288822888998878999999997528997899 No 78 >cd02518 GT2_SpsF SpsF is a glycosyltrnasferase implicated in the synthesis of the spore coat. Spore coat polysaccharide biosynthesis protein F (spsF) is a glycosyltransferase implicated in the synthesis of the spore coat in a variety of bacteria challenged by stress as starvation. The spsF gene is expressed in the late stage of coat development responsible for a terminal step in coat formation that involves the glycosylation of the coat. SpsF gene mutation resulted in spores that appeared normal. But, the spores tended to aggregate and had abnormal adsorption properties, indicating a surface alteration. Probab=98.55 E-value=5.8e-07 Score=63.87 Aligned_cols=221 Identities=18% Similarity=0.189 Sum_probs=118.1 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCC-CCEEEEEECCCCC-CCCCCCCCCCCCCCCCCCCCCC Q ss_conf 7992488874553344389822224783747999999999789-8689999777656-4311122222222111122233 Q gi|254780922|r 3 GIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAG-IREILIISTPRDL-PVLKEFLGSGEKWGVQFSYIEQ 80 (292) Q Consensus 3 aiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~G-i~~I~ii~~~~~~-~~~~~~l~~g~~~g~~i~y~~q 80 (292) |||.|-.-.|||- -|+|.+++|+|||+|+++.+..+. +++|++.|+.... +.+.++. ...|+++ ..- T Consensus 2 ~iI~aR~~S~RLp------~K~L~~i~~~~~i~~~i~r~k~~~~~~~IivaTs~~~~Dd~l~~~~---~~~~i~~--frG 70 (233) T cd02518 2 AIIQARMGSTRLP------GKVLKPLGGKPLLEHLLDRLKRSKLIDEIVIATSTNEEDDPLEALA---KKLGVKV--FRG 70 (233) T ss_pred EEEECCCCCCCCC------CCCHHHHCCCCHHHHHHHHHHHCCCCCEEEEEECCCCCCHHHHHHH---HHCCEEE--EEC T ss_conf 8996077883789------8600230995799999999996689884999817865226999999---7484899--957 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCC-CCCHHHHHCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHCCC Q ss_conf 33224468521011123356752132144321111-00001110000011100122221000122212222100010013 Q gi|254780922|r 81 LVPAGLAQSYILGAEFIGDSSSVLILGDNVFYGSD-ISDIFHKARARRNSATVVGCHVQNPQRYGVVEVDSSNQAISIEE 159 (292) Q Consensus 81 ~~p~Gta~Ai~~a~~~i~~~~~~lilgDni~~~~~-l~~~l~~~~~~~~~~ti~~~~v~~p~~yGvv~~d~~~~i~~i~E 159 (292) .+...+ +-+..|.+..+-+.++-+.||+-|.+.. +..++..+.+++.+.+-...+..-|...++-.+..+ -+.+... T Consensus 71 s~~dvl-~R~~~a~~~~~~d~ivri~gD~P~idp~~id~~i~~~~~~~~Dyvsn~~~~~~P~G~~~ei~s~~-~L~~~~~ 148 (233) T cd02518 71 SEEDVL-GRYYQAAEEYNADVVVRITGDCPLIDPEIIDAVIRLFLKSGADYTSNTLPRTYPDGLDVEVFTRD-ALERAAA 148 (233) T ss_pred CCCHHH-HHHHHHHHHCCCCEEEEECCCCCCCCHHHHHHHHHHHHCCCCCEEECCCCCCCCCCEEEEEEHHH-HHHHHHH T ss_conf 830588-99999987157888999768777589899999999985079998944789988881699988199-9999997 Q ss_pred CCCCCCCCCCHHHHHHHHH-HHHHHHHHCCCCCCCCCCCCCCCCHHCCCCCEEEEECCCCEEEECCCCHHHHHHHHHHHH Q ss_conf 3333222222024554100-022432200467533333334320000013204631354125632784577999999999 Q gi|254780922|r 160 KPNNPKSSFAVTGIYFYDQ-EVVNIARNIRPSARGELEITDVNSYYLDKGLLAVEFLREGSAWFDAGTPESLLDTAVFVR 238 (292) Q Consensus 160 KP~~~~sn~a~~GiY~f~~-~i~~~~~~l~~s~rgE~eitD~~~~~l~~~~~~~~~~~~g~~W~D~Gt~e~l~~As~~i~ 238 (292) +.+++ ...--++.|++++ +.|... .+.+ .. ... .. ..+.+.|++||.-+....+ T Consensus 149 ~~~~~-~~rEHvt~y~~~n~~~fki~-~~~~------------------~~---~~~-~~-~rltlD~~eD~~~~~~I~~ 203 (233) T cd02518 149 EADDP-YEREHVTPYIRRHPELFRIG-YLEA------------------PP---DRL-SD-LRLTVDTPEDFELIKEIYE 203 (233) T ss_pred HCCCC-CCCCCCHHHHHHCCHHCEEE-EECC------------------CC---CCC-CC-CEEEECCHHHHHHHHHHHH T ss_conf 44784-42100039988492022488-7448------------------74---346-55-5698368999999999999 Q ss_pred HHHHHCCCEEEEHHHHHHHCCCCCHHHHHHHHHHCCC Q ss_conf 9888809866209999988689898999998864258 Q gi|254780922|r 239 NIENRLGLYVACPEEIAYRHDFINESQFFQLIDHFGN 275 (292) Q Consensus 239 ~~e~~~g~~i~~~ee~a~~~~~i~~~~l~~~~~~~~~ 275 (292) ..- ..+..++-+++.+.+...|+ T Consensus 204 ~l~--------------~~~~~~s~~dIi~~l~knp~ 226 (233) T cd02518 204 ALY--------------PKNPDFSLEDIIELLDKNPE 226 (233) T ss_pred HHC--------------CCCCCCCHHHHHHHHHHCCC T ss_conf 855--------------77899899999999997966 No 79 >cd02513 CMP-NeuAc_Synthase CMP-NeuAc_Synthase activates N-acetylneuraminic acid by adding CMP moiety. CMP-N-acetylneuraminic acid synthetase (CMP-NeuAc synthetase) or acylneuraminate cytidylyltransferase catalyzes the transfer the CMP moiety of CTP to the anomeric hydroxyl group of NeuAc in the presence of Mg++. It is the second to last step in the sialylation of the oligosaccharide component of glycoconjugates by providing the activated sugar-nucleotide cytidine 5'-monophosphate N-acetylneuraminic acid (CMP-Neu5Ac), the substrate for sialyltransferases. Eukaryotic CMP-NeuAc synthetases are predominantly located in the nucleus. The activated CMP-Neu5Ac diffuses from the nucleus into the cytoplasm. Probab=98.51 E-value=1.4e-07 Score=67.70 Aligned_cols=200 Identities=15% Similarity=0.229 Sum_probs=105.4 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCC-CCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 7992488874553344389822224783747999999999789-868999977765643111222222221111222333 Q gi|254780922|r 3 GIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAG-IREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQL 81 (292) Q Consensus 3 aiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~G-i~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~ 81 (292) |||+|=|--+|+. .|.+.+++|||||+|+++.+..++ +++|+| ++ .+.++. ++ .+.+|....+..-. T Consensus 4 aiIpAR~gSkri~------~KN~~~i~gkpLi~~~i~~a~ks~~~d~Iiv-ST-Ds~~i~-~~---~~k~~~~~~~~Rpk 71 (223) T cd02513 4 AIIPARGGSKGIP------GKNIRPLGGKPLIAWTIEAALESKLFDRVVV-ST-DDEEIA-EV---ARKYGAEVPFLRPA 71 (223) T ss_pred EEECCCCCCCCCC------CHHHHHHCCCCHHHHHHHHHHHCCCCCEEEE-EC-CHHHHH-HH---HCCCCCEEEECCCH T ss_conf 9986787787899------7004463895969999999996699550789-55-878975-42---12456417973846 Q ss_pred CC----CCCCCCCCCCCCCCCC-----CCCCEEECCCCCCC-CCCCCHHHHHCCCHHHHHCCCCCCCC-CCCCCCCCCCC Q ss_conf 32----2446852101112335-----67521321443211-11000011100000111001222210-00122212222 Q gi|254780922|r 82 VP----AGLAQSYILGAEFIGD-----SSSVLILGDNVFYG-SDISDIFHKARARRNSATVVGCHVQN-PQRYGVVEVDS 150 (292) Q Consensus 82 ~p----~Gta~Ai~~a~~~i~~-----~~~~lilgDni~~~-~~l~~~l~~~~~~~~~~ti~~~~v~~-p~~yGvv~~d~ 150 (292) +- ..+-+.+..+.++++. +-++++.+..-|.. .++.+.++....++.+..+.+.+..+ |.+... .+. T Consensus 72 ~ls~d~~~~~dvi~~~l~~~~~~~~~~d~iv~l~pTsP~r~~~~I~~~i~~~~~~k~dsl~sv~~~~~~~~~~~~--~~~ 149 (223) T cd02513 72 ELATDTASSIDVILHALDQLEELGRDFDIVVLLQPTSPLRSAEDIDEAIELLLSEGADSVFSVTEFHRFPWRALG--LDD 149 (223) T ss_pred HHCCCCCCCHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCEEEEEEECCCCHHHHEE--ECC T ss_conf 876886780677778998899868998999996279762789999999999985799999999854768188567--626 Q ss_pred CCCHHHCCCC-CCC------CCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHCCCCCEEEEECCCCEEEEC Q ss_conf 1000100133-333------222222024554100022432200467533333334320000013204631354125632 Q gi|254780922|r 151 SNQAISIEEK-PNN------PKSSFAVTGIYFYDQEVVNIARNIRPSARGELEITDVNSYYLDKGLLAVEFLREGSAWFD 223 (292) Q Consensus 151 ~~~i~~i~EK-P~~------~~sn~a~~GiY~f~~~i~~~~~~l~~s~rgE~eitD~~~~~l~~~~~~~~~~~~g~~W~D 223 (292) ++ ...+..+ ... ++.-..++++|+++.+.+..-+. ....+ .....+.. ..-+| T Consensus 150 ~~-~~~~~~~~~~~~~rQ~~~~~y~~~g~~yi~~~~~~~k~~~-----------------~~~~k-~~~~~~~~-~~siD 209 (223) T cd02513 150 NG-LEPVNYPEDKRTRRQDLPPAYHENGAIYIAKREALLESNS-----------------FFGGK-TGPYEMPR-ERSID 209 (223) T ss_pred CC-CCCCCCCHHHCCCCCCCCEEEEECCEEEEEEHHHHHHCCC-----------------CCCCC-EEEEEECC-CCEEC T ss_conf 87-0213686110676446760478604899999999985498-----------------03897-79999488-71684 Q ss_pred CCCHHHHHHHHHH Q ss_conf 7845779999999 Q gi|254780922|r 224 AGTPESLLDTAVF 236 (292) Q Consensus 224 ~Gt~e~l~~As~~ 236 (292) +.|++||.-|... T Consensus 210 ID~~~Dl~~Ae~l 222 (223) T cd02513 210 IDTEEDFELAEAL 222 (223) T ss_pred CCCHHHHHHHHHH T ss_conf 8999999999986 No 80 >PTZ00339 UDP-N-acetylglucosamine pyrophosphorylase; Provisional Probab=98.44 E-value=2.9e-07 Score=65.75 Aligned_cols=178 Identities=17% Similarity=0.255 Sum_probs=98.6 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEEC---CEEHHHHHHHHHHH--------C--C---CCEEEEEECCCCCCCCCCCCC Q ss_conf 799248887455334438982222478---37479999999997--------8--9---868999977765643111222 Q gi|254780922|r 3 GIVLAGGSGTRLRPLTDLLSKQMLPIY---NKPMIYYPVSTLMD--------A--G---IREILIISTPRDLPVLKEFLG 66 (292) Q Consensus 3 aiIlAaG~GtRl~P~T~~~pK~Llpi~---gkplI~~~l~~l~~--------~--G---i~~I~ii~~~~~~~~~~~~l~ 66 (292) +|+||||.||||. .+.||-|.||+ +|++.+...+.+.+ . + .=-++|.|+....+....+|. T Consensus 112 vlllAGGqGTRLG---~~~PKG~~~igl~s~ksLfql~aeri~~lq~la~~~~~~~~~~~ip~yIMTS~~t~~~T~~ff~ 188 (499) T PTZ00339 112 VLILAGGLGTRLG---SDKPKGLLECTPLKKKSLFQFHCEKIRRLEEMAAAASGGGDDPTIYILVLTSSFNHDQTRQFLQ 188 (499) T ss_pred EEEECCCCCCCCC---CCCCCEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCHHHHHHHHHH T ss_conf 9995078746679---9999711115899987199999999999999988750678888603999778504699999998 Q ss_pred CCCCCCCC---CCCCCCC----------------------CCCCCCCCCCCCCC-------CCCC------CCCCEEECC Q ss_conf 22222111---1222333----------------------32244685210111-------2335------675213214 Q gi|254780922|r 67 SGEKWGVQ---FSYIEQL----------------------VPAGLAQSYILGAE-------FIGD------SSSVLILGD 108 (292) Q Consensus 67 ~g~~~g~~---i~y~~q~----------------------~p~Gta~Ai~~a~~-------~i~~------~~~~lilgD 108 (292) .-..||++ +.|..|. .|.|.|+-+ .|.. .+.+ .-.-+..=| T Consensus 189 ~~~~FGl~~~~V~fF~Q~~lP~~d~~~Gkille~~~~i~~aPdGNGG~y-~AL~~sg~~~~iL~dm~~rGI~yi~~~~VD 267 (499) T PTZ00339 189 ENNFFGLDKEQVIFFKQSSLPCYDENTGRVLMESRGSLCTAPGGNGDVF-KALAKCSEFMSILDKLEKLGIKYVQIISID 267 (499) T ss_pred HCCCCCCCHHHEEEEECCCEEEEECCCCEEEECCCCCEEECCCCCHHHH-HHHHHCCCHHHHHHHHHHCCCEEEEEEEEC T ss_conf 6777788757689997387336854799687558880687789963899-998862431127999997498699999865 Q ss_pred CCCCCCCCCCHHHHHCCCHHHHHCCCCCCCCC-CCCCCC-CCCCCCCHHHCCCCCCC------CC-----CCCCHHHHHH Q ss_conf 43211110000111000001110012222100-012221-22221000100133333------22-----2222024554 Q gi|254780922|r 109 NVFYGSDISDIFHKARARRNSATVVGCHVQNP-QRYGVV-EVDSSNQAISIEEKPNN------PK-----SSFAVTGIYF 175 (292) Q Consensus 109 ni~~~~~l~~~l~~~~~~~~~~ti~~~~v~~p-~~yGvv-~~d~~~~i~~i~EKP~~------~~-----sn~a~~GiY~ 175 (292) |++...-=..++=.+.....++..-++.-.+| ++-||+ ..|..-.|++.-|-++. +. -+.++++..+ T Consensus 268 N~L~kvaDP~FiG~~~~~~~~~~~Kvv~K~~p~E~VGvv~~~~g~~~VvEYSEi~~~~ae~~~~~~~l~~f~agnI~~H~ 347 (499) T PTZ00339 268 NILAKVADPEFIGLASSFPAHVVLNKCPKRSDDESVGVFCLKDDEWQVVEYTEINERILENKDATTNLKQFNCGNICSHI 347 (499) T ss_pred CCCCCCCCCHHHHHHHHCCCCHHCEEEECCCCCCCCCEEEEECCEEEEEEECCCCHHHHHCCCCCCCEEECCCCCHHHEE T ss_conf 33324678066348776486100027872689876543898889268997036687776277887760102434432136 Q ss_pred HHHHHHHHH Q ss_conf 100022432 Q gi|254780922|r 176 YDQEVVNIA 184 (292) Q Consensus 176 f~~~i~~~~ 184 (292) |+-++++.+ T Consensus 348 fs~~FL~~v 356 (499) T PTZ00339 348 FSLDFLKKV 356 (499) T ss_pred ECHHHHHHH T ss_conf 319999988 No 81 >TIGR03584 PseF pseudaminic acid CMP-transferase. The sequences in this family include the pfam02348 (cytidyltransferase) domain and are homologous to the NeuA protein responsible for the transfer of CMP to neuraminic acid. According to, this gene is responsible for the transfer of CMP to the structurally related sugar, pseudaminic acid which is observed as a component of sugar modifications of flagellin in Campylobacter species. This gene is commonly observed in apparent operons with other genes responsible for the biosynthesis of pseudaminic acid and as a component of flagellar and exopolysaccharide biosynthesis loci. Probab=98.30 E-value=7e-07 Score=63.36 Aligned_cols=202 Identities=15% Similarity=0.212 Sum_probs=106.6 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCC-CCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 7992488874553344389822224783747999999999789-868999977765643111222222221111222333 Q gi|254780922|r 3 GIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAG-IREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQL 81 (292) Q Consensus 3 aiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~G-i~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~ 81 (292) |||+|=|--+|+. .|-+.+++|+|||+|.++.+..++ +++|+ |++ .+.++. + -++.+|.++.|..-. T Consensus 2 aiIPaR~gSkri~------~KN~~~~~g~pLi~~~i~~~~~s~~~d~i~-vST-D~~~i~-~---ia~~~~~~~~~~Rp~ 69 (222) T TIGR03584 2 AIIPARGGSKRIP------RKNIKPFCGKPMIAYSIEAALNSGLFDEVV-VST-DDEEIA-E---VAKSYGASVPFMRPA 69 (222) T ss_pred EEECCCCCCCCCC------CCCHHHHCCCCHHHHHHHHHHHCCCCCEEE-ECC-CHHHHH-H---HHHCCCCCCEEECCH T ss_conf 8984377887678------601256589285999999998579965388-727-889987-5---552169864251865 Q ss_pred CCC----CCCCCCCCCCCCCCC---CC-CCEEECCCCCC-CCCCCCHHHHHCCCHHHHHCCCCCCC-CCCCCCCCCCCCC Q ss_conf 322----446852101112335---67-52132144321-11100001110000011100122221-0001222122221 Q gi|254780922|r 82 VPA----GLAQSYILGAEFIGD---SS-SVLILGDNVFY-GSDISDIFHKARARRNSATVVGCHVQ-NPQRYGVVEVDSS 151 (292) Q Consensus 82 ~p~----Gta~Ai~~a~~~i~~---~~-~~lilgDni~~-~~~l~~~l~~~~~~~~~~ti~~~~v~-~p~~yGvv~~d~~ 151 (292) +-. .+.+++..+.+++.. .+ ++++.+-.-|. ..++.+.++...+++.+..+.+.+.. .|.+- ..++++ T Consensus 70 ~ls~d~~~~~~vi~~~l~~~~~~~~~d~i~~l~pTsP~r~~~~I~~ai~~~~~~~~ds~~sv~~~~~~~~~~--~~~~~~ 147 (222) T TIGR03584 70 ELSDDFTGTAPVVAHAIERLQLQKQPDHACCIYATAPLLQAKILKEAFELLKQPNKHFVFTVTEFAFPIQRA--FSLKEN 147 (222) T ss_pred HHCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCCEEEECCCCCHHHH--EEECCC T ss_conf 555886214599999999998658999899954897615899999999999726998550653377685893--798679 Q ss_pred CCHHHCCCCCCCC-----CCCC-CHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHCCCCCEEEEECCCCEEEECCC Q ss_conf 0001001333332-----2222-202455410002243220046753333333432000001320463135412563278 Q gi|254780922|r 152 NQAISIEEKPNNP-----KSSF-AVTGIYFYDQEVVNIARNIRPSARGELEITDVNSYYLDKGLLAVEFLREGSAWFDAG 225 (292) Q Consensus 152 ~~i~~i~EKP~~~-----~sn~-a~~GiY~f~~~i~~~~~~l~~s~rgE~eitD~~~~~l~~~~~~~~~~~~g~~W~D~G 225 (292) +.+.-+..+...+ +..+ .+.++|+++.+.+..-+. ....+ .....+... ..+|+. T Consensus 148 ~~~~~~~~~~~~~r~Qdl~~~y~~~G~~y~~~~~~~~~~~~-----------------~~~~~-~~~~~~~~~-~siDID 208 (222) T TIGR03584 148 GGVTMFFPEHYNTRSQDLEEAYHDAGQFYWGKSQAWLESGP-----------------IFSPH-SIPIILPRH-LVQDID 208 (222) T ss_pred CCEEECCHHHHHHHHHHCCCCEEECCEEEEEEHHHHHHCCC-----------------CCCCC-EEEEEECCC-CCCCCC T ss_conf 94633474552024442443375447899988999984497-----------------02898-799998998-628999 Q ss_pred CHHHHHHHHHHH Q ss_conf 457799999999 Q gi|254780922|r 226 TPESLLDTAVFV 237 (292) Q Consensus 226 t~e~l~~As~~i 237 (292) |++||.-|.... T Consensus 209 ~~~D~~~Ae~l~ 220 (222) T TIGR03584 209 TLEDWERAELLY 220 (222) T ss_pred CHHHHHHHHHHH T ss_conf 999999999986 No 82 >cd04180 UGPase_euk_like Eukaryotic UGPase-like includes UDPase and UDPGlcNAc pyrophosphorylase enzymes. This family includes UDP-Glucose Pyrophosphorylase (UDPase) and UDPGlcNAc pyrophosphorylase enzymes. The two enzymes share significant sequence and structure similarity. UDP-Glucose Pyrophosphorylase catalyzes a reversible production of UDP-Glucose and pyrophosphate (PPi) from Glucose-1-phosphate and UTP. UDP-glucose plays pivotal roles in galactose utilization, in glycogen synthesis, and in the synthesis of the carbohydrate moieties of glycolipids , glycoproteins , and proteoglycans . UDP-N-acetylglucosamine (UDPGlcNAc) pyrophosphorylase (UAP) (also named GlcNAc1P uridyltransferase), catalyzes the reversible conversion of UTP and GlcNAc1P from PPi and UDPGlcNAc, which is a key precursor of N- and O-linked glycosylations and is essential for the synthesis of chitin (a major component of the fungal cell wall) and of the glycosylphosphatidylinositol (GPI) linker anchoring a variety o Probab=98.21 E-value=8.3e-07 Score=62.89 Aligned_cols=157 Identities=13% Similarity=0.068 Sum_probs=83.5 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEEC---CEEHHHHHHHHHHH------CCCC-EEEEEECCCCCCCCCCCCCCCCCCC Q ss_conf 799248887455334438982222478---37479999999997------8986-8999977765643111222222221 Q gi|254780922|r 3 GIVLAGGSGTRLRPLTDLLSKQMLPIY---NKPMIYYPVSTLMD------AGIR-EILIISTPRDLPVLKEFLGSGEKWG 72 (292) Q Consensus 3 aiIlAaG~GtRl~P~T~~~pK~Llpi~---gkplI~~~l~~l~~------~Gi~-~I~ii~~~~~~~~~~~~l~~g~~~g 72 (292) +|+||||.||||. ...||-|+||. ++++++..++.+.. .|.. -.+|.++..-.+...+++..-.... T Consensus 3 vllLaGGlGTrLG---~~~pKg~~~v~~~~~~t~~~l~~~~i~~l~~~~~~~~~iPl~iMtS~~T~~~T~~~~~~~~~~~ 79 (266) T cd04180 3 VVLLAGGLGTRLG---KDGPKSSTDVGLPSGQCFLQLIGEKILTLQEIDLYSCKIPEQLMNSKYTHEKTQCYFEKINQKN 79 (266) T ss_pred EEEECCCCCCCCC---CCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEECCHHHHHHHHHHHHHCCCCC T ss_conf 9996687623469---9997370672589998599999999999999861799831999668004479999999839996 Q ss_pred CCCCCCCCC---------------------CCCCCCCCCCC--CCC----CC--CCCCCCEEECCCCCCCCCCCCHHHHH Q ss_conf 111222333---------------------32244685210--111----23--35675213214432111100001110 Q gi|254780922|r 73 VQFSYIEQL---------------------VPAGLAQSYIL--GAE----FI--GDSSSVLILGDNVFYGSDISDIFHKA 123 (292) Q Consensus 73 ~~i~y~~q~---------------------~p~Gta~Ai~~--a~~----~i--~~~~~~lilgDni~~~~~l~~~l~~~ 123 (292) .++.+..|. .|.|-|+-... ... +. +-+-..+...||+....--..++-.+ T Consensus 80 ~~v~~f~Q~~~P~~~~~g~~~~~~~~~~~~~P~GhGdi~~aL~~sG~Ld~l~~~Gieyi~v~~vDN~la~v~DP~flG~~ 159 (266) T cd04180 80 SYVITFMQGKLPLKNDDDARDPHNKTKCHLFPCGHGDVVLALIHSGHLNKLLEKGYRYIHFIGVDNLLVKVADPLFIGIA 159 (266) T ss_pred CCEEEEECCCCCEECCCCCCCCCCCCCEEECCCCCCHHHHHHHHCCCHHHHHHCCCEEEEEEECCCCCCHHHHHHHHHHH T ss_conf 64499952886148578871679999424777996279999997780999996698899999668666002479999999 Q ss_pred CCCHHHHHCCCCCCCCCC-CCCCCCCCCCC--CHHHCCCCCC Q ss_conf 000011100122221000-12221222210--0010013333 Q gi|254780922|r 124 RARRNSATVVGCHVQNPQ-RYGVVEVDSSN--QAISIEEKPN 162 (292) Q Consensus 124 ~~~~~~~ti~~~~v~~p~-~yGvv~~d~~~--~i~~i~EKP~ 162 (292) .++..++.+-+++-.+|. +=|++....+| +|++.-|=|+ T Consensus 160 ~~~~~~~~~kvv~Kt~~dek~G~l~~~~~g~~~vvEyse~~~ 201 (266) T cd04180 160 IQNRKAINQKVVPKTRNEESGGYRIANINGRVQLLEYDQIKK 201 (266) T ss_pred HHHCCCEEEEEEECCCCCCEEEEEEEEECCCEEEEEEECCCH T ss_conf 971997589997899898604599998089279999734999 No 83 >TIGR00466 kdsB 3-deoxy-D-manno-octulosonate cytidylyltransferase; InterPro: IPR004528 3-Deoxy-D-manno-octulosonate cytidylyltransferase (2.7.7.38 from EC) activates KDO, a required 8-carbon sugar, for incorporation into bacterial lipopolysaccharide in Gram negative bacteria. It acts as a homodimer and catalyses the conversion of CTP and 3-deoxy-D-manno-octulosonate into CMP-3-deoxy-D-manno-octulosonate and pyrophosphate.; GO: 0008690 3-deoxy-manno-octulosonate cytidylyltransferase activity, 0009103 lipopolysaccharide biosynthetic process, 0005737 cytoplasm. Probab=98.07 E-value=5.5e-06 Score=57.72 Aligned_cols=214 Identities=19% Similarity=0.238 Sum_probs=126.5 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 79924888745533443898222247837479999999997898689999777656431112222222211112223333 Q gi|254780922|r 3 GIVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQLV 82 (292) Q Consensus 3 aiIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~~ 82 (292) .||||==..|||- .|||.-|.|+|||.|.+++...+|+++++|.|. +.++.+ . .+.+|.+.--..... T Consensus 5 vIIPAR~~SsRl~------~K~L~DI~G~PMi~~v~~~A~~s~~~~~ivA~D--~E~v~~-~---~~~~~~~~~~T~~~H 72 (246) T TIGR00466 5 VIIPARLASSRLP------GKPLEDIKGKPMIVHVLEKANESGADRVIVATD--DESVAQ-T---VKKFGIEVCLTSKDH 72 (246) T ss_pred EEECCCCCCCCCC------CCCCCCCCCCCHHHHHHHHHHHCCCCEEEEECC--CHHHHH-H---HHHCCCEEEEECCCC T ss_conf 9853653113589------872035578936889999975236356899748--726999-9---997297189851667 Q ss_pred CCCCCCCCCCCCC-CCCCCCCCE-EECCCCCCCCCCCC-HHHHHCCCHHHHHCCCCCCCC------CCCCCCCCCCCCCC Q ss_conf 2244685210111-233567521-32144321111000-011100000111001222210------00122212222100 Q gi|254780922|r 83 PAGLAQSYILGAE-FIGDSSSVL-ILGDNVFYGSDISD-IFHKARARRNSATVVGCHVQN------PQRYGVVEVDSSNQ 153 (292) Q Consensus 83 p~Gta~Ai~~a~~-~i~~~~~~l-ilgDni~~~~~l~~-~l~~~~~~~~~~ti~~~~v~~------p~~yGvv~~d~~~~ 153 (292) ..||--..-.++. -+.++..++ +-||.-+.....-. +++.-.++.....-.+.++.| |..-= |.+|++|. T Consensus 73 ~SGTERl~E~~~~la~~~d~~i~N~QGDEP~~~~~ii~~~~~nLA~~~~~~~~~a~ki~d~E~~~~PN~VK-VvlD~~~~ 151 (246) T TIGR00466 73 NSGTERLAEVVEKLALKDDEVIVNLQGDEPLIPKEIIRQVADNLAKKNVPMAALAVKIHDAEEAFNPNAVK-VVLDSEGY 151 (246) T ss_pred CCCHHHHHHHHHHHCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCEEE-EEEECCCC T ss_conf 85046899999984789885799820898765389999999988630401121122222121205997479-99707886 Q ss_pred HHHCCC-----------CCCCC----CC--CCCHHHHHHHH-HHHHHHHHHCCCCCCCCCCCCCCCCHHCCCCCEEEEEC Q ss_conf 010013-----------33332----22--22202455410-00224322004675333333343200000132046313 Q gi|254780922|r 154 AISIEE-----------KPNNP----KS--SFAVTGIYFYD-QEVVNIARNIRPSARGELEITDVNSYYLDKGLLAVEFL 215 (292) Q Consensus 154 i~~i~E-----------KP~~~----~s--n~a~~GiY~f~-~~i~~~~~~l~~s~rgE~eitD~~~~~l~~~~~~~~~~ 215 (292) +.=|-- +-.++ -| -+--.|||-|. .++++-.-+++|+.--++|.-.=++.+-...+..+... T Consensus 152 ~LYFSRs~IP~~RD~~~~~~~~~~~~~~D~~L~HiGIYgy~n~~~~~~~v~~~P~~LE~iE~LEQLR~Lyy~~kI~V~~~ 231 (246) T TIGR00466 152 ALYFSRSLIPFDRDRFAKRQEVQKVQLSDALLRHIGIYGYRNAGFIKEYVAWKPSVLEEIEKLEQLRVLYYGEKIHVKIA 231 (246) T ss_pred EEEEECCCCCCCHHHHHHHCCCCEEEHHHHHHHHHCHHHCCHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCEEEEEEE T ss_conf 78864021542001244305510020113445442211100247999997228874024566788888760563689875 Q ss_pred CCCEEEECCCCHHHH Q ss_conf 541256327845779 Q gi|254780922|r 216 REGSAWFDAGTPESL 230 (292) Q Consensus 216 ~~g~~W~D~Gt~e~l 230 (292) ...- -.-+.|+||| T Consensus 232 ~~~P-~vGvDT~EDL 245 (246) T TIGR00466 232 QEVP-SVGVDTQEDL 245 (246) T ss_pred ECCC-CCCCCCHHHC T ss_conf 0378-8677862314 No 84 >cd06424 UGGPase UGGPase catalyzes the synthesis of UDP-Glucose/UDP-Galactose. UGGPase: UDP-Galactose/Glucose Pyrophosphorylase catalyzes the reversible production of UDP-Glucose/UDP-Galactose and pyrophosphate (PPi) from Glucose-1-phosphate/Galactose-1-phosphate and UTP. Its dual substrate specificity distinguishes it from the single substrate enzyme UDP-glucose pyrophosphorylase. It may play a key role in the galactose metabolism in raffinose oligosaccharide (RFO) metabolizing plants. RFO raffinose is a major photoassimilate and is a galactosylderivative of sucrose (Suc) containing a galactose (Gal) moiety. Upon arriving at the sink tissue, the Gal moieties of the RFOs are initially removed by alpha-galactosidase and then are phosphorylated to Gal-1-P. Gal-1-P is converted to UDP-Gal. The UDP-Gal is further metabolized to UDP-Glc via an epimerase reaction. The UDP-Glc can be directly utilized in cell wall metabolism or in Suc synthesis. However, for the Suc synthesis UDP-Glc must be f Probab=98.04 E-value=6.7e-06 Score=57.19 Aligned_cols=75 Identities=23% Similarity=0.317 Sum_probs=55.2 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEEC---CEEHHHHHHHHHHH----CC------CCEEEEEECCCCCCCCCCCCCCCC Q ss_conf 799248887455334438982222478---37479999999997----89------868999977765643111222222 Q gi|254780922|r 3 GIVLAGGSGTRLRPLTDLLSKQMLPIY---NKPMIYYPVSTLMD----AG------IREILIISTPRDLPVLKEFLGSGE 69 (292) Q Consensus 3 aiIlAaG~GtRl~P~T~~~pK~Llpi~---gkplI~~~l~~l~~----~G------i~~I~ii~~~~~~~~~~~~l~~g~ 69 (292) -|+||||.||||. ..-||-++||. +++++++.++.+.. ++ | -++|.++....+...+++..-. T Consensus 3 vvllaGG~GTRLG---~~~pK~~l~v~~~~~~tl~~l~~~~i~~~~~~~~~~~~~~i-P~~iMtS~~t~~~T~~~~~~~~ 78 (315) T cd06424 3 FVLVAGGLGERLG---YSGIKIGLPVELTTNTTYLQYYLNYIRAFQEASKKGEKMEI-PFVIMTSDDTHSKTLKLLEENN 78 (315) T ss_pred EEEECCCCCCCCC---CCCCCCCEECCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCC-CEEEECCCCCHHHHHHHHHHCC T ss_conf 9997787843248---99885560127889980999999999999998605789886-5799789873699999998663 Q ss_pred CCCC---CCCCCCCC Q ss_conf 2211---11222333 Q gi|254780922|r 70 KWGV---QFSYIEQL 81 (292) Q Consensus 70 ~~g~---~i~y~~q~ 81 (292) .+|. ++.+..|. T Consensus 79 ~FGl~~~~v~~f~Q~ 93 (315) T cd06424 79 YFGLEKDQVHILKQE 93 (315) T ss_pred CCCCCCCCEEEEECC T ss_conf 036885455998438 No 85 >cd00761 Glyco_tranf_GTA_type Glycosyltransferase family A (GT-A) includes diverse families of glycosyl transferases with a common GT-A type structural fold. Glycosyltransferases (GTs) are enzymes that synthesize oligosaccharides, polysaccharides, and glycoconjugates by transferring the sugar moiety from an activated nucleotide-sugar donor to an acceptor molecule, which may be a growing oligosaccharide, a lipid, or a protein. Based on the stereochemistry of the donor and acceptor molecules, GTs are classified as either retaining or inverting enzymes. To date, all GT structures adopt one of two possible folds, termed GT-A fold and GT-B fold. This hierarchy includes diverse families of glycosyl transferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. The majority of the proteins in this superfamily are Glycosyltransferase family 2 (GT-2) proteins. But it als Probab=97.73 E-value=2.9e-05 Score=53.23 Aligned_cols=93 Identities=14% Similarity=0.166 Sum_probs=62.1 Q ss_pred EEECC-EEHHHHHHHHHHHCC--CCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 24783-747999999999789--868999977765643111222222221111222333322446852101112335675 Q gi|254780922|r 26 LPIYN-KPMIYYPVSTLMDAG--IREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQLVPAGLAQSYILGAEFIGDSSS 102 (292) Q Consensus 26 lpi~g-kplI~~~l~~l~~~G--i~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~~p~Gta~Ai~~a~~~i~~~~~ 102 (292) +|.+| .|+|.+.|+.+.+.+ ..+|+|+++....+..+.+. ........+.+..+....|+++++..+.+....+.+ T Consensus 3 i~~~n~~~~l~~~l~si~~~~~~~~eiiiid~~s~~~~~~~~~-~~~~~~~~~~~~~~~~~~g~~~~~n~~~~~~~~~~v 81 (156) T cd00761 3 IPAYNEEPYLERCLESLLAQTYPNFEVIVVDDGSTDGTLEILE-EYAKKDPRVIRVINEENQGLAAARNAGLKAARGEYI 81 (156) T ss_pred EEECCCHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCHHHHHH-HHCCCCCEEEEECCCCCCCHHHHHHHHHHHCCCCEE T ss_conf 9988988999999999982899897999998999800221000-001456638995056787846887788874899999 Q ss_pred CEEECCCCCCCCCCCCH Q ss_conf 21321443211110000 Q gi|254780922|r 103 VLILGDNVFYGSDISDI 119 (292) Q Consensus 103 ~lilgDni~~~~~l~~~ 119 (292) +++-+|+++....+..+ T Consensus 82 ~~~d~D~~~~~~~~~~~ 98 (156) T cd00761 82 LFLDADDLLLPDWLERL 98 (156) T ss_pred EEECCCCCCCHHHHHHH T ss_conf 99999985787499999 No 86 >COG1861 SpsF Spore coat polysaccharide biosynthesis protein F, CMP-KDO synthetase homolog [Cell envelope biogenesis, outer membrane] Probab=97.72 E-value=1.6e-05 Score=54.86 Aligned_cols=119 Identities=18% Similarity=0.305 Sum_probs=77.0 Q ss_pred CEEEEECCCCC-CCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCC-CCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 90799248887-4553344389822224783747999999999789-868999977765643111222222221111222 Q gi|254780922|r 1 MKGIVLAGGSG-TRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAG-IREILIISTPRDLPVLKEFLGSGEKWGVQFSYI 78 (292) Q Consensus 1 MkaiIlAaG~G-tRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~G-i~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~ 78 (292) |-++|+.|-.| |||. -|.|+|++++|||++.|+++..+. +.+|+|.|+-...+. .+.+.....|+. + T Consensus 3 ~I~~IiQARmgStRLp------gKvLlpL~~~pmI~~~lervrks~~~d~ivvATS~~~~d~--~l~~~~~~~G~~---v 71 (241) T COG1861 3 MILVIIQARMGSTRLP------GKVLLPLGGEPMIEYQLERVRKSKDLDKIVVATSDKEEDD--ALEEVCRSHGFY---V 71 (241) T ss_pred CEEEEEEECCCCCCCC------CCHHHHCCCCCHHHHHHHHHHCCCCCCCEEEEECCCCCHH--HHHHHHHHCCEE---E T ss_conf 4798853002676578------6133330787419999999860433243589734772045--799999970824---7 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCCC-CHHHHHCCCHHHH Q ss_conf 333322446852101112335675213214432111100-0011100000111 Q gi|254780922|r 79 EQLVPAGLAQSYILGAEFIGDSSSVLILGDNVFYGSDIS-DIFHKARARRNSA 130 (292) Q Consensus 79 ~q~~p~Gta~Ai~~a~~~i~~~~~~lilgDni~~~~~l~-~~l~~~~~~~~~~ 130 (292) .+..+.-.-+-+..|.+..+.+-++=+-||+-|.+..+. ..+..|.+++.+- T Consensus 72 frGs~~dVL~Rf~~a~~a~~~~~VVRvTGD~P~~dp~l~d~~v~~~l~~gaDY 124 (241) T COG1861 72 FRGSEEDVLQRFIIAIKAYSADVVVRVTGDNPFLDPELVDAAVDRHLEKGADY 124 (241) T ss_pred ECCCHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHHHCCCCC T ss_conf 63888999999999997358876999608998778899999999998659860 No 87 >COG4284 UDP-glucose pyrophosphorylase [Carbohydrate transport and metabolism] Probab=97.70 E-value=8.3e-05 Score=50.33 Aligned_cols=173 Identities=15% Similarity=0.183 Sum_probs=92.5 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEEC-CEEHHHHHHHHHH----HCC--CCEEEEEECCCCCCCCCCCCCCCCCCCC-- Q ss_conf 799248887455334438982222478-3747999999999----789--8689999777656431112222222211-- Q gi|254780922|r 3 GIVLAGGSGTRLRPLTDLLSKQMLPIY-NKPMIYYPVSTLM----DAG--IREILIISTPRDLPVLKEFLGSGEKWGV-- 73 (292) Q Consensus 3 aiIlAaG~GtRl~P~T~~~pK~Llpi~-gkplI~~~l~~l~----~~G--i~~I~ii~~~~~~~~~~~~l~~g~~~g~-- 73 (292) .|.||||.||||. ..-||.|++|. |+++++.+.+... ..+ +-- +|.++....+. ..++..-.-.|. T Consensus 108 vl~LaGGqGtrlG---~~gPKgl~~V~~gks~~dl~~~qIk~ln~~~~~~vP~-~iMtS~nt~~t-~s~f~~~~Y~~~~k 182 (472) T COG4284 108 VLKLAGGQGTRLG---CDGPKGLFEVKDGKSLFDLQAEQIKYLNRQYNVDVPL-YIMTSLNTEET-DSYFKSNDYFGLDK 182 (472) T ss_pred EEEECCCCCCCCC---CCCCCEEEEECCCCCHHHHHHHHHHHHHHHHCCCCCE-EEEECCCCHHH-HHHHHHHHHCCCCH T ss_conf 9994488665335---6787526894589849999999999999973899777-98826782777-77776622037777 Q ss_pred -CCCCCCCC-----------------------CCCCCCCC---CCC---CCCCC--CCCCCCEEECCCCCCCCCCCCHHH Q ss_conf -11222333-----------------------32244685---210---11123--356752132144321111000011 Q gi|254780922|r 74 -QFSYIEQL-----------------------VPAGLAQS---YIL---GAEFI--GDSSSVLILGDNVFYGSDISDIFH 121 (292) Q Consensus 74 -~i~y~~q~-----------------------~p~Gta~A---i~~---a~~~i--~~~~~~lilgDni~~~~~l~~~l~ 121 (292) .|.+..|. -|-|.|+- +.. -.+++ +-+-..+..-||+-...|+ .++. T Consensus 183 ~~I~fF~Q~~~P~~~~~sg~~~~~~~~~~~~~~P~GnG~lf~aL~~SG~le~l~~~G~e~lfV~nIDNL~~~vD~-~~lg 261 (472) T COG4284 183 EDIFFFVQSLFPRLLSDSGLPFLESDDSNLAWYPPGNGDLFKALKSSGILEKLIAQGIEYLFVSNIDNLGATVDL-KFLG 261 (472) T ss_pred HHEEEEECCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHCCHHHHHHHCCCEEEEEECCCCCCCCCCH-HHHH T ss_conf 783777247744356566852214589631017899860899987513699998568369999514312344589-9999 Q ss_pred HHCCCHHHHHCCCCC-CCCCCCCCCCC-CCCCCCHHHCCCCCCCCC----------CCCCH-HHHHHHHHHHH Q ss_conf 100000111001222-21000122212-222100010013333322----------22220-24554100022 Q gi|254780922|r 122 KARARRNSATVVGCH-VQNPQRYGVVE-VDSSNQAISIEEKPNNPK----------SSFAV-TGIYFYDQEVV 181 (292) Q Consensus 122 ~~~~~~~~~ti~~~~-v~~p~~yGvv~-~d~~~~i~~i~EKP~~~~----------sn~a~-~GiY~f~~~i~ 181 (292) .+...+.+.++-++. .+..++=|++. .|+.-++++.-|=|.+.. -..++ .++|++.-+.+ T Consensus 262 ~~~~~~~e~~~e~t~Kt~a~ekvG~Lv~~~g~~rllEysev~~~~~~~~~s~~~~~~~n~Nni~l~~~~~~~l 334 (472) T COG4284 262 FMAETNYEYLMETTDKTKADEKVGILVTYDGKLRLLEYSEVPNEHREEFTSDGKLKYFNTNNIWLHLFSVKFL 334 (472) T ss_pred HHHHCCCCEEEEEEECCCCCCCCEEEEEECCCEEEEEEECCCHHHHHHHCCCCCEEEECCCCCEEEHHHHHHH T ss_conf 9986275325787522456664117998679567899840784674320462212354264222026677888 No 88 >cd00897 UGPase_euk Eukaryotic UGPase catalyses the synthesis of UDP-Glucose. UGPase (UDP-Glucose Pyrophosphorylase) catalyzes the reversible production of UDP-Glucose and pyrophosphate (PPi) from Glucose-1-phosphate and UTP. UDP-glucose plays pivotal roles in galactose utilization, in glycogen synthesis, and in the synthesis of the carbohydrate moieties of glycolipids, glycoproteins, and proteoglycans. UGPase is found in both prokaryotes and eukaryotes. Interestingly, while the prokaryotic and eukaryotic forms of UGPase catalyze the same reaction, they share low sequence similarity. This family consists of mainly eukaryotic UTP-glucose-1-phosphate uridylyltransferases. Probab=97.34 E-value=0.00035 Score=46.44 Aligned_cols=156 Identities=15% Similarity=0.196 Sum_probs=78.2 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEEC-CEEHHHHHHHHH----HHCCCCE-EEEEECCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 799248887455334438982222478-374799999999----9789868-9999777656431112222222211112 Q gi|254780922|r 3 GIVLAGGSGTRLRPLTDLLSKQMLPIY-NKPMIYYPVSTL----MDAGIRE-ILIISTPRDLPVLKEFLGSGEKWGVQFS 76 (292) Q Consensus 3 aiIlAaG~GtRl~P~T~~~pK~Llpi~-gkplI~~~l~~l----~~~Gi~~-I~ii~~~~~~~~~~~~l~~g~~~g~~i~ 76 (292) .|.||||.||||. ...||.|+||- |+++++..++.+ ...|..= .+|.++..-.+.-.+++........++. T Consensus 6 vlkLnGGlGTrlG---~~~pK~~i~V~~~~tfldl~~~qi~~l~~~yg~~vPl~iMtS~~T~~~T~~~l~ky~~~~~~i~ 82 (300) T cd00897 6 VLKLNGGLGTSMG---CTGPKSLIEVRDGKTFLDLTVQQIEHLNKTYGVDVPLVLMNSFNTDEDTKKILKKYAGVNVDIH 82 (300) T ss_pred EEEECCCCCCCCC---CCCCCEEEECCCCCCHHHHHHHHHHHHHHHCCCCCCEEEECCCCHHHHHHHHHHHCCCCCCCEE T ss_conf 9992698745568---8998177882899809999999999999971998318997886477899999997488788769 Q ss_pred CCCCC------------------------CCCCCCCCCCC--CC----CCC--CCCCCCEEECCCCCCCCCCCCHHHHHC Q ss_conf 22333------------------------32244685210--11----123--356752132144321111000011100 Q gi|254780922|r 77 YIEQL------------------------VPAGLAQSYIL--GA----EFI--GDSSSVLILGDNVFYGSDISDIFHKAR 124 (292) Q Consensus 77 y~~q~------------------------~p~Gta~Ai~~--a~----~~i--~~~~~~lilgDni~~~~~l~~~l~~~~ 124 (292) +..|. .|.|-|+-... .. +++ +-+-.++...||+---.|. .++..+. T Consensus 83 ~F~Q~~~P~l~~~~~~p~~~~~~~~~~~w~P~GhGdiy~aL~~sG~Ld~l~~~G~eyifv~nvDNLga~~Dp-~~lg~~~ 161 (300) T cd00897 83 TFNQSRYPRISKETLLPVPSWADSPDEEWYPPGHGDIFESLYNSGLLDTLLAQGKEYLFVSNIDNLGATVDL-RILNHMV 161 (300) T ss_pred EEECCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCHHHHHHHCCCEEEEEEECCCCCCCCCH-HHHHHHH T ss_conf 984377312725889653557899720246699718999988778499998679769999825663110698-9999987 Q ss_pred CCHHHHHCCCCCCCCC-CCCCCC-CCCCCCCHHHCCCCCC Q ss_conf 0001110012222100-012221-2222100010013333 Q gi|254780922|r 125 ARRNSATVVGCHVQNP-QRYGVV-EVDSSNQAISIEEKPN 162 (292) Q Consensus 125 ~~~~~~ti~~~~v~~p-~~yGvv-~~d~~~~i~~i~EKP~ 162 (292) .++....+-+++-..| ++=|++ ..|..-.+.++-|=|+ T Consensus 162 ~~~~~~~~eV~~Kt~~d~KgG~l~~~~g~~~l~E~aq~p~ 201 (300) T cd00897 162 DNKAEYIMEVTDKTRADVKGGTLIQYEGKLRLLEIAQVPK 201 (300) T ss_pred HCCCCEEEEEEECCCCCCCCCEEEEECCEEEEEEHHCCCH T ss_conf 4399759996255778876438999999888866100996 No 89 >KOG2388 consensus Probab=96.52 E-value=0.0028 Score=40.76 Aligned_cols=75 Identities=25% Similarity=0.427 Sum_probs=46.6 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEEC---CEEHHHHHHHHHHH----------CCCCEE--EEEECCCCCCCCCCCCCC Q ss_conf 799248887455334438982222478---37479999999997----------898689--999777656431112222 Q gi|254780922|r 3 GIVLAGGSGTRLRPLTDLLSKQMLPIY---NKPMIYYPVSTLMD----------AGIREI--LIISTPRDLPVLKEFLGS 67 (292) Q Consensus 3 aiIlAaG~GtRl~P~T~~~pK~Llpi~---gkplI~~~l~~l~~----------~Gi~~I--~ii~~~~~~~~~~~~l~~ 67 (292) ++++|||.||||. ...||.+.|++ ++.+.++..+.+.. .| .+| +|.|+..-.+.-.+++.. T Consensus 100 ~~llaGgqgtRLg---~~~pkg~~~~G~~~~~slf~~qae~il~lq~~a~~~~~~~-~~I~w~ImtS~~T~e~T~~~f~~ 175 (477) T KOG2388 100 VVLLAGGQGTRLG---SSGPKGCYPIGLPSGKSLFQIQAERILKLQELASMAVSDG-VDIPWYIMTSAFTHEATLEYFES 175 (477) T ss_pred EEEECCCCEEEEC---CCCCCCEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCC-CCEEEEEECCCCCCHHHHHHHHH T ss_conf 8995167403113---5787533664775654065534999999999766540468-86489995368861777767753 Q ss_pred CCCCCCC---CCCCCCC Q ss_conf 2222111---1222333 Q gi|254780922|r 68 GEKWGVQ---FSYIEQL 81 (292) Q Consensus 68 g~~~g~~---i~y~~q~ 81 (292) -..+|++ +++..|. T Consensus 176 ~~~FGl~~~qv~~f~Q~ 192 (477) T KOG2388 176 HKYFGLKPEQVTFFQQG 192 (477) T ss_pred CCCCCCCHHHEEEEECC T ss_conf 58877774672421026 No 90 >pfam01704 UDPGP UTP--glucose-1-phosphate uridylyltransferase. This family consists of UTP--glucose-1-phosphate uridylyltransferases, EC:2.7.7.9. Also known as UDP-glucose pyrophosphorylase (UDPGP) and Glucose-1-phosphate uridylyltransferase. UTP--glucose-1-phosphate uridylyltransferase catalyses the interconversion of MgUTP + glucose-1-phosphate and UDP-glucose + MgPPi. UDP-glucose is an important intermediate in mammalian carbohydrate interconversion involved in various metabolic roles depending on tissue type. In Dictyostelium (slime mold) mutants in this enzyme abort the development cycle. Also within the family is UDP-N-acetylglucosamine or AGX1 and two hypothetical proteins from Borrelia burgdorferi the lyme disease spirochaete. Probab=95.58 E-value=0.027 Score=34.53 Aligned_cols=51 Identities=18% Similarity=0.234 Sum_probs=35.5 Q ss_pred EEEECCCCCCCCCHHHCCCCCCEEEEC-CEEHHHHHHHHHH----HCCCCE-EEEEECCC Q ss_conf 799248887455334438982222478-3747999999999----789868-99997776 Q gi|254780922|r 3 GIVLAGGSGTRLRPLTDLLSKQMLPIY-NKPMIYYPVSTLM----DAGIRE-ILIISTPR 56 (292) Q Consensus 3 aiIlAaG~GtRl~P~T~~~pK~Llpi~-gkplI~~~l~~l~----~~Gi~~-I~ii~~~~ 56 (292) .|-|+||.||+|. -..||.|++|. |+.++|.++.... ..|+.= .++.++.. T Consensus 56 VlKLNGGLGTsMG---~~~pKSli~Vr~g~TFLDl~v~Qi~~ln~~y~~~vPl~lMnSf~ 112 (416) T pfam01704 56 VLKLNGGLGTSMG---CKGPKSMIEVRNGNTFLDLIVQQIEQLNKRYNCDVPLLLMNSFN 112 (416) T ss_pred EEECCCCCCCCCC---CCCCCEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCEEEECCCC T ss_conf 9982698777668---88885577978998189999999999987549986669966866 No 91 >cd04184 GT2_RfbC_Mx_like Myxococcus xanthus RfbC like proteins are required for O-antigen biosynthesis. The rfbC gene encodes a predicted protein of 1,276 amino acids, which is required for O-antigen biosynthesis in Myxococcus xanthus. It is a subfamily of Glycosyltransferase Family GT2, which includes diverse families of glycosyl transferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Probab=95.16 E-value=0.033 Score=34.05 Aligned_cols=103 Identities=12% Similarity=0.156 Sum_probs=60.7 Q ss_pred EEEECCE--EHHHHHHHHHHHCCCC--EEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 2247837--4799999999978986--89999777656431112222222211112223333224468521011123356 Q gi|254780922|r 25 MLPIYNK--PMIYYPVSTLMDAGIR--EILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQLVPAGLAQSYILGAEFIGDS 100 (292) Q Consensus 25 Llpi~gk--plI~~~l~~l~~~Gi~--~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~~p~Gta~Ai~~a~~~i~~~ 100 (292) .+|+++. ..|...|+.+.+.-.+ ||+||-+..........+.......-++.++..++..|.+.|...|.+.-.++ T Consensus 6 iip~yN~~~~~l~~~l~Si~~Qt~~~~EiIvvDd~StD~~~~~~~~~~~~~~~~i~~~~~~~n~G~~~a~N~gi~~a~ge 85 (202) T cd04184 6 VMPVYNTPEKYLREAIESVRAQTYPNWELCIADDASTDPEVKRVLKKYAAQDPRIKVVFREENGGISAATNSALELATGE 85 (202) T ss_pred EEECCCCCHHHHHHHHHHHHHCCCCCEEEEEEECCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHCCCCC T ss_conf 99748896699999999998378998799999899894789999999885188520012678878999988645136876 Q ss_pred CCCEEECCCCCCCCCCCCHHHHHCCCH Q ss_conf 752132144321111000011100000 Q gi|254780922|r 101 SSVLILGDNVFYGSDISDIFHKARARR 127 (292) Q Consensus 101 ~~~lilgDni~~~~~l~~~l~~~~~~~ 127 (292) -++.+-+||++....+..++....+.. T Consensus 86 yi~flD~DD~~~p~~l~~~~~~~~~~~ 112 (202) T cd04184 86 FVALLDHDDELAPHALYEVVKALNEHP 112 (202) T ss_pred EEEECCCCCEECHHHHHHHHHHHHHCC T ss_conf 677327785528439999999998689 No 92 >pfam00535 Glycos_transf_2 Glycosyl transferase family 2. Diverse family, transferring sugar from UDP-glucose, UDP-N-acetyl- galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids. Probab=94.66 E-value=0.037 Score=33.72 Aligned_cols=106 Identities=12% Similarity=0.110 Sum_probs=62.8 Q ss_pred EEEECCE-EHHHHHHHHHHHCCC--CEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 2247837-479999999997898--6899997776564311122222222111122233332244685210111233567 Q gi|254780922|r 25 MLPIYNK-PMIYYPVSTLMDAGI--REILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQLVPAGLAQSYILGAEFIGDSS 101 (292) Q Consensus 25 Llpi~gk-plI~~~l~~l~~~Gi--~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~~p~Gta~Ai~~a~~~i~~~~ 101 (292) .+|.++. ..|.-.|+.+.+... -+|+|+-+.. .+.-.+.+........++.++..+...|.++|+..+......+- T Consensus 3 iip~yN~~~~l~~~l~sl~~q~~~~~eiiivDd~S-~d~t~~~~~~~~~~~~~v~~~~~~~~~g~~~a~n~g~~~a~~~~ 81 (168) T pfam00535 3 IIPTYNEEKYLEECLESLLNQTYPNFEIIVVDDGS-TDGTVEIAEEYAKKDPRIRVIRLEENLGKAAARNAGLKLATGDY 81 (168) T ss_pred EEEECCCHHHHHHHHHHHHCCCCCCEEEEEEECCC-CCHHHCEEEEECCCCCCCEEEEEECCCCCHHHHHHHHHHCCCCE T ss_conf 99961888999999999973879985999998999-81010012100134676002442024574688889999679985 Q ss_pred CCEEECCCCCCCCCCCCHHHHHCCCHHHHH Q ss_conf 521321443211110000111000001110 Q gi|254780922|r 102 SVLILGDNVFYGSDISDIFHKARARRNSAT 131 (292) Q Consensus 102 ~~lilgDni~~~~~l~~~l~~~~~~~~~~t 131 (292) ++++-+|+++....+.+++....+...++. T Consensus 82 v~~lD~D~~~~~~~l~~~~~~~~~~~~~~~ 111 (168) T pfam00535 82 ILFLDADDEVAPDWLEKLVELLEKNGADIV 111 (168) T ss_pred EEEECCCCCCCHHHHHHHHHHHHCCCCEEE T ss_conf 999858998687699999999983997699 No 93 >cd02526 GT2_RfbF_like RfbF is a putative dTDP-rhamnosyl transferase. Shigella flexneri RfbF protein is a putative dTDP-rhamnosyl transferase. dTDP rhamnosyl transferases of Shigella flexneri add rhamnose sugars to N-acetyl-glucosamine in the O-antigen tetrasaccharide repeat. Lipopolysaccharide O antigens are important virulence determinants for many bacteria. The variations of sugar composition, the sequence of the sugars and the linkages in the O antigen provide structural diversity of the O antigen. Probab=94.05 E-value=0.11 Score=30.83 Aligned_cols=189 Identities=11% Similarity=0.109 Sum_probs=84.1 Q ss_pred EEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC--CCCCEEECC Q ss_conf 747999999999789868999977765643111222222221111222333322446852101112335--675213214 Q gi|254780922|r 31 KPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQLVPAGLAQSYILGAEFIGD--SSSVLILGD 108 (292) Q Consensus 31 kplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~~p~Gta~Ai~~a~~~i~~--~~~~lilgD 108 (292) ...+.-.|+.+.. .+.+|+||=|...... ...+ +...-++.++...+-.|-|.|...+...... .++++++.| T Consensus 10 ~~~l~~~L~sl~~-q~~eIiVVDN~S~d~~-~~~~---~~~~~~v~~i~~~~N~G~a~g~N~Gi~~a~~~~~d~i~~ln~ 84 (237) T cd02526 10 LSKLKELLAALAE-QVDKVVVVDNSSGNDI-ELRL---RLNSEKIELIHLGENLGIAKALNIGIKAALENGADYVLLFDQ 84 (237) T ss_pred HHHHHHHHHHHHC-CCCEEEEEECCCCHHH-HHHH---HHCCCCEEEEECCCCCCHHHHCCCCCHHHHCCCCCEEEEECC T ss_conf 9999999997663-7998999969889028-9999---861998699989997785887274303341179778999357 Q ss_pred CCCCCCC-CCCHHHHHCC--CHHHHHCCCCCCCCC---CCCCCCCCCCCCCHHHCCCCCCCC-CCCCCH-HHHHHHHHHH Q ss_conf 4321111-0000111000--001110012222100---012221222210001001333332-222220-2455410002 Q gi|254780922|r 109 NVFYGSD-ISDIFHKARA--RRNSATVVGCHVQNP---QRYGVVEVDSSNQAISIEEKPNNP-KSSFAV-TGIYFYDQEV 180 (292) Q Consensus 109 ni~~~~~-l~~~l~~~~~--~~~~~ti~~~~v~~p---~~yGvv~~d~~~~i~~i~EKP~~~-~sn~a~-~GiY~f~~~i 180 (292) |.+...+ +..++..... ....+.+++-.+.+. ..+........ ....-...+..+ +..++. +| .++..++ T Consensus 85 D~~~~~~~l~~l~~~~~~~~~~~~vg~~~p~~~d~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~sg-~lir~~~ 162 (237) T cd02526 85 DSVPPPDMVEKLLAYKILSDKNSNIGAVGPRIIDRRTGENSPGVRKSGY-KLRIQKEGEEGLKEVDFLITSG-SLISLEA 162 (237) T ss_pred CCCCCHHHHHHHHHHHHHCCCCCCEEEEEEEEECCCCCCCCCCCCCCCC-CCCCCCCCCCCCEECCEEEECC-EEEEHHH T ss_conf 5655905999999999852227988999717986899800454211253-3223544247844604303014-3778999 Q ss_pred HHHHHHCCCCCCCCCCCCCCCCHHCCCCCEEEEECCCCEEEECCCCH Q ss_conf 24322004675333333343200000132046313541256327845 Q gi|254780922|r 181 VNIARNIRPSARGELEITDVNSYYLDKGLLAVEFLREGSAWFDAGTP 227 (292) Q Consensus 181 ~~~~~~l~~s~rgE~eitD~~~~~l~~~~~~~~~~~~g~~W~D~Gt~ 227 (292) |+.+--..++---.+|=+|+--.+.+.|.. +.+...-..|+..|+. T Consensus 163 ~~~vG~fde~fF~y~ED~Dl~~R~~~~G~~-i~~~p~a~v~H~~G~~ 208 (237) T cd02526 163 LEKVGGFDEDLFIDYVDTEWCLRARSKGYK-IYVVPDAVLKHELGDK 208 (237) T ss_pred HHHHCCCCHHHCCCCCHHHHHHHHHHCCCC-EEEECCEEEEECCCCC T ss_conf 998579968977937199999999985997-9998988999789984 No 94 >cd06433 GT_2_WfgS_like WfgS and WfeV are involved in O-antigen biosynthesis. Escherichia coli WfgS and Shigella dysenteriae WfeV are glycosyltransferase 2 family enzymes involved in O-antigen biosynthesis. GT-2 enzymes have GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families. Probab=93.90 E-value=0.067 Score=32.12 Aligned_cols=101 Identities=10% Similarity=0.183 Sum_probs=57.6 Q ss_pred EEEECCE-EHHHHHHHHHHHCCCC--EEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 2247837-4799999999978986--899997776564311122222222111122233332244685210111233567 Q gi|254780922|r 25 MLPIYNK-PMIYYPVSTLMDAGIR--EILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQLVPAGLAQSYILGAEFIGDSS 101 (292) Q Consensus 25 Llpi~gk-plI~~~l~~l~~~Gi~--~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~~p~Gta~Ai~~a~~~i~~~~ 101 (292) ++|+++. ..|.-.|+.+.+.-.. ||+||-+...... .+.+.. +.-.+.+....+-.|.++|...|.....++ T Consensus 3 iip~yN~~~~l~~~l~Si~~Q~~~~~EiIvVDd~S~D~t-~~~~~~---~~~~~~~~~~~~~~G~~~a~N~g~~~a~g~- 77 (202) T cd06433 3 ITPTYNQAETLEETIDSVLSQTYPNIEYIVIDGGSTDGT-VDIIKK---YEDKITYWISEPDKGIYDAMNKGIALATGD- 77 (202) T ss_pred EEEECCCHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCH-HHHHHH---CCCCEEEEEECCCCCHHHHHHHHHHHCCCC- T ss_conf 999769889999999999837899979999979988423-445331---134125999888888899987327754855- Q ss_pred CCEEE-CCCCCCCCCCCCHHHHHCCCHHHH Q ss_conf 52132-144321111000011100000111 Q gi|254780922|r 102 SVLIL-GDNVFYGSDISDIFHKARARRNSA 130 (292) Q Consensus 102 ~~lil-gDni~~~~~l~~~l~~~~~~~~~~ 130 (292) +++.+ +||++....+..++....+..... T Consensus 78 yi~~ld~DD~~~~~~~~~~~~~~~~~~~~~ 107 (202) T cd06433 78 IIGFLNSDDTLLPGALLAVVAAFAEHPEVD 107 (202) T ss_pred EEECCCCCCEECCHHHHHHHHHHHHCCCCC T ss_conf 532248886268449999999998789940 No 95 >cd06442 DPM1_like DPM1_like represents putative enzymes similar to eukaryotic DPM1. Proteins similar to eukaryotic DPM1, including enzymes from bacteria and archaea; DPM1 is the catalytic subunit of eukaryotic dolichol-phosphate mannose (DPM) synthase. DPM synthase is required for synthesis of the glycosylphosphatidylinositol (GPI) anchor, N-glycan precursor, protein O-mannose, and C-mannose. In higher eukaryotes,the enzyme has three subunits, DPM1, DPM2 and DPM3. DPM is synthesized from dolichol phosphate and GDP-Man on the cytosolic surface of the ER membrane by DPM synthase and then is flipped onto the luminal side and used as a donor substrate. In lower eukaryotes, such as Saccharomyces cerevisiae and Trypanosoma brucei, DPM synthase consists of a single component (Dpm1p and TbDpm1, respectively) that possesses one predicted transmembrane region near the C terminus for anchoring to the ER membrane. In contrast, the Dpm1 homologues of higher eukaryotes, namely fission yeast, fungi, Probab=92.75 E-value=0.12 Score=30.57 Aligned_cols=186 Identities=17% Similarity=0.152 Sum_probs=84.4 Q ss_pred EEEECCE-EHHHHHHHHHHHC--CCC-EEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 2247837-4799999999978--986-89999777656431112222222211112223333224468521011123356 Q gi|254780922|r 25 MLPIYNK-PMIYYPVSTLMDA--GIR-EILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQLVPAGLAQSYILGAEFIGDS 100 (292) Q Consensus 25 Llpi~gk-plI~~~l~~l~~~--Gi~-~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~~p~Gta~Ai~~a~~~i~~~ 100 (292) .+|+++. .-|.-.++.+... +.+ +|+||-+ .+.+--.+.+.......-++..+....-.|-|.|+..+.+....+ T Consensus 2 IIP~yNEe~~i~~~l~~i~~~~~~~~~eIIvVdD-gStD~T~~i~~~l~~~~~~~~~i~~~~n~G~g~Ai~~G~~~a~~~ 80 (224) T cd06442 2 IIPTYNERENIPELIERLDAALKGIDYEIIVVDD-NSPDGTAEIVRELAKEYPRVRLIVRPGKRGLGSAYIEGFKAARGD 80 (224) T ss_pred EEEECCCHHHHHHHHHHHHHHHCCCCEEEEEEEC-CCCCCCHHHHHHHHCCCCCEEEEECCCCCCHHHHHHHHHHHHCCC T ss_conf 9995487888999999999860689949999989-697371788865422477247863366787279999999860178 Q ss_pred CCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCCCCCCCC--CCCCCCCCCCCCHHHCCCCCC--CCCCCCCHHHHHHH Q ss_conf 75213214432111100001110000011100122221000--122212222100010013333--32222220245541 Q gi|254780922|r 101 SSVLILGDNVFYGSDISDIFHKARARRNSATVVGCHVQNPQ--RYGVVEVDSSNQAISIEEKPN--NPKSSFAVTGIYFY 176 (292) Q Consensus 101 ~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~~v~~p~--~yGvv~~d~~~~i~~i~EKP~--~~~sn~a~~GiY~f 176 (292) -.+++-+|.-+...++.++++...+.+.++.+........+ .++.... =-.++..+.-+-- .+-++ ..+|.-+| T Consensus 81 ~i~~~DaD~q~~p~~i~~li~~~~~~~~d~V~GsR~~~~~~~~~~~~~r~-~~s~~~~~i~~~l~~~~i~D-~~~gfr~~ 158 (224) T cd06442 81 VIVVMDADLSHPPEYIPELLEAQLEGGADLVIGSRYVEGGGVEGWGLKRK-LISRGANLLARLLLGRKVSD-PTSGFRAY 158 (224) T ss_pred CEEEECCCCCCCHHHHHHHHHHHHCCCCCEEEEEEECCCCCCCCCHHHHH-HHHHHHHHHHHHCCCCEECC-CCCCEEEE T ss_conf 46886599976277789999998718872479887627986357706667-88888887632305842431-79963998 Q ss_pred HHHHHHHHHHCCCCCCCCCCCCCCCCHHCCCCCEEEEE Q ss_conf 00022432200467533333334320000013204631 Q gi|254780922|r 177 DQEVVNIARNIRPSARGELEITDVNSYYLDKGLLAVEF 214 (292) Q Consensus 177 ~~~i~~~~~~l~~s~rgE~eitD~~~~~l~~~~~~~~~ 214 (292) +.++++.+.....+.+-+++ +++.-...+.|....++ T Consensus 159 ~~~~~~~i~~~~~~~~f~~~-~ei~~~~~~~~~ki~ev 195 (224) T cd06442 159 RREVLEKLIDSLVSKGYKFQ-LELLVRARRLGYRIVEV 195 (224) T ss_pred EHHHHHHHHHHCCCCCCCCH-HHHHHHHHHCCCEEEEE T ss_conf 89999988775456663018-99999999889989999 No 96 >PRK00576 molybdopterin-guanine dinucleotide biosynthesis protein A; Provisional Probab=92.61 E-value=0.18 Score=29.48 Aligned_cols=36 Identities=14% Similarity=0.242 Sum_probs=28.8 Q ss_pred CCCCCEEEECC--EEHHHHHHHHHHHCCCCEEEEEECCC Q ss_conf 89822224783--74799999999978986899997776 Q gi|254780922|r 20 LLSKQMLPIYN--KPMIYYPVSTLMDAGIREILIISTPR 56 (292) Q Consensus 20 ~~pK~Llpi~g--kplI~~~l~~l~~~Gi~~I~ii~~~~ 56 (292) ...|.++|+.| +|||+|+++.|... +.+|+|+++.. T Consensus 2 G~DKAll~~~G~~~tLlerv~~~l~~~-~~~V~vv~~~~ 39 (178) T PRK00576 2 GRDKATLPLPGGTTTLVEHVVGIVGQR-CAPVFVMAAPG 39 (178) T ss_pred CCCCEEEEECCCCCCHHHHHHHHHHHH-CCEEEEECCCC T ss_conf 988774276799825999999999865-89899988998 No 97 >cd04188 DPG_synthase DPG_synthase is involved in protein N-linked glycosylation. UDP-glucose:dolichyl-phosphate glucosyltransferase (DPG_synthase) is a transmembrane-bound enzyme of the endoplasmic reticulum involved in protein N-linked glycosylation. This enzyme catalyzes the transfer of glucose from UDP-glucose to dolichyl phosphate. Probab=92.34 E-value=0.1 Score=30.96 Aligned_cols=190 Identities=10% Similarity=0.077 Sum_probs=87.6 Q ss_pred EEEECCE-----EHHHHHHHHHHHCC-CC-EEEEEECCCCCCCCCCCCCC-CCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 2247837-----47999999999789-86-89999777656431112222-22221111222333322446852101112 Q gi|254780922|r 25 MLPIYNK-----PMIYYPVSTLMDAG-IR-EILIISTPRDLPVLKEFLGS-GEKWGVQFSYIEQLVPAGLAQSYILGAEF 96 (292) Q Consensus 25 Llpi~gk-----plI~~~l~~l~~~G-i~-~I~ii~~~~~~~~~~~~l~~-g~~~g~~i~y~~q~~p~Gta~Ai~~a~~~ 96 (292) .+|+++. ++|+-.++.+.... .+ ||+||-+...... .+.+.. .......+.++...+..|.|+|+..|... T Consensus 2 iIP~yNE~~~I~~~i~~i~~~~~~~~~~~~eIivVddgS~D~T-~~i~~~~~~~~~~~~~vi~~~~n~G~g~A~~~G~~~ 80 (211) T cd04188 2 VIPAYNEEKRLPPTLEEAVEYLEERPSFSYEIIVVDDGSKDGT-AEVARKLARKNPALIRVLTLPKNRGKGGAVRAGMLA 80 (211) T ss_pred EEEECCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCH-HHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHH T ss_conf 9992478888999999999998633699838999989997187-999999974178659999658888804899999996 Q ss_pred CCCCCCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCCCCCCCCC---CCCCC--CCC-CCCHHHCCCCCCCCCCCCCH Q ss_conf 3356752132144321111000011100000111001222210001---22212--222-10001001333332222220 Q gi|254780922|r 97 IGDSSSVLILGDNVFYGSDISDIFHKARARRNSATVVGCHVQNPQR---YGVVE--VDS-SNQAISIEEKPNNPKSSFAV 170 (292) Q Consensus 97 i~~~~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~~v~~p~~---yGvv~--~d~-~~~i~~i~EKP~~~~sn~a~ 170 (292) ...+-++++-+|..+...++.++++.......++.+.......... ..... ... -..+..+.=+ ..-++ .. T Consensus 81 a~~d~i~~~DaD~~~~~~~i~~l~~~~~~~~~d~V~GsR~~~~~~~~~~~~~~r~~~s~~~~~~~~~l~~--~~i~D-~~ 157 (211) T cd04188 81 ARGDYILFADADLATPFEELEKLEEALKTSGYDIAIGSRAHLASAAVVKRSWLRNLLGRGFNFLVRLLLG--LGIKD-TQ 157 (211) T ss_pred HHCCEEEEECCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHC--CCCCC-CC T ss_conf 0048556776897407889999999998489819999755679986335638899999999999999838--98788-88 Q ss_pred HHHHHHHHHHHHHH-HHCCCCCCCCCCCCCCCCHHCCCCCEEEEECCCCEEEEC Q ss_conf 24554100022432-200467533333334320000013204631354125632 Q gi|254780922|r 171 TGIYFYDQEVVNIA-RNIRPSARGELEITDVNSYYLDKGLLAVEFLREGSAWFD 223 (292) Q Consensus 171 ~GiY~f~~~i~~~~-~~l~~s~rgE~eitD~~~~~l~~~~~~~~~~~~g~~W~D 223 (292) +|+-+|+.++++.+ ..+. +.+-+++ ++++-.+.+.|....++.. .|.| T Consensus 158 ~gfr~~~~~~l~~i~~~~~-~~~f~~~-~El~~~~~~~g~ki~EvPi---~~~~ 206 (211) T cd04188 158 CGFKLFTRDAARRLFPRLH-LERWAFD-VELLVLARRLGYPIEEVPV---RWVE 206 (211) T ss_pred CCEEEEEHHHHHHHHHHCC-CCCCCCH-HHHHHHHHHCCCCEEEEEE---EEEE T ss_conf 6746577999999875564-6885015-9999999986991999900---8998 No 98 >cd04195 GT2_AmsE_like GT2_AmsE_like is involved in exopolysaccharide amylovora biosynthesis. AmsE is a glycosyltransferase involved in exopolysaccharide amylovora biosynthesis in Erwinia amylovora. Amylovara is one of the three exopolysaccharide produced by E. amylovora. Amylovara-deficient mutants are non-pathogenic. It is a subfamily of Glycosyltransferase Family GT2, which includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Probab=91.21 E-value=0.16 Score=29.78 Aligned_cols=101 Identities=11% Similarity=0.181 Sum_probs=59.7 Q ss_pred EEEECCE--E-HHHHHHHHHHHCCCC--EEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 2247837--4-799999999978986--8999977765643111222222221111222333322446852101112335 Q gi|254780922|r 25 MLPIYNK--P-MIYYPVSTLMDAGIR--EILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQLVPAGLAQSYILGAEFIGD 99 (292) Q Consensus 25 Llpi~gk--p-lI~~~l~~l~~~Gi~--~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~~p~Gta~Ai~~a~~~i~~ 99 (292) ++|++++ | .|.-+|+.+.+.-.. ||+||......+.....+..... ..++.++..++-.|.+.|...|.....+ T Consensus 3 iip~YN~e~~~~l~~~l~Svl~Qt~~~~EiIiVdDgSs~d~~~~i~~~~~~-~~~i~~i~~~~N~G~~~a~N~gi~~a~g 81 (201) T cd04195 3 LMSVYIKEKPEFLREALESILKQTLPPDEVVLVKDGPVTQSLNEVLEEFKR-KLPLKVVPLEKNRGLGKALNEGLKHCTY 81 (201) T ss_pred EEECCCCCCHHHHHHHHHHHHHCCCCCCEEEEEECCCCCHHHHHHHHHHHC-CCCEEEEECCCCCCHHHHHHHHHHHCCC T ss_conf 998884897899999999999579998189999899996543999998614-7998999878878989997763642676 Q ss_pred CCCCEEECCCCCCCCCCCCHHHHHCCC Q ss_conf 675213214432111100001110000 Q gi|254780922|r 100 SSSVLILGDNVFYGSDISDIFHKARAR 126 (292) Q Consensus 100 ~~~~lilgDni~~~~~l~~~l~~~~~~ 126 (292) +-++.+-+||++....+..++....++ T Consensus 82 ~yI~~lD~DD~~~p~~l~~~~~~l~~~ 108 (201) T cd04195 82 DWVARMDTDDISLPDRFEKQLDFIEKN 108 (201) T ss_pred CEEEECCCCCCCCHHHHHHHHHHHHHC T ss_conf 699981889846716999999999878 No 99 >cd04187 DPM1_like_bac Bacterial DPM1_like enzymes are related to eukaryotic DPM1. A family of bacterial enzymes related to eukaryotic DPM1; Although the mechanism of eukaryotic enzyme is well studied, the mechanism of the bacterial enzymes is not well understood. The eukaryotic DPM1 is the catalytic subunit of eukaryotic Dolichol-phosphate mannose (DPM) synthase. DPM synthase is required for synthesis of the glycosylphosphatidylinositol (GPI) anchor, N-glycan precursor, protein O-mannose, and C-mannose. The enzyme has three subunits, DPM1, DPM2 and DPM3. DPM is synthesized from dolichol phosphate and GDP-Man on the cytosolic surface of the ER membrane by DPM synthase and then is flipped onto the luminal side and used as a donor substrate. This protein family belongs to Glycosyltransferase 2 superfamily. Probab=90.31 E-value=0.22 Score=28.91 Aligned_cols=157 Identities=13% Similarity=0.091 Sum_probs=72.6 Q ss_pred EEEECCE-----EHHHHHHHHHHHCCCC-EEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 2247837-----4799999999978986-899997776564311122222222111122233332244685210111233 Q gi|254780922|r 25 MLPIYNK-----PMIYYPVSTLMDAGIR-EILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQLVPAGLAQSYILGAEFIG 98 (292) Q Consensus 25 Llpi~gk-----plI~~~l~~l~~~Gi~-~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~~p~Gta~Ai~~a~~~i~ 98 (292) .+|+++. .++.-..+.+.....+ +|+||-. .+.+.-.+.+.....-.-++.++...+..|-+.|+..+..... T Consensus 2 IIP~yNEe~~I~~~i~~i~~~~~~~~~~~eIivVDD-~StD~T~~i~~~l~~~~~~v~~i~~~~N~G~g~Ai~~g~~~a~ 80 (181) T cd04187 2 VVPVYNEEENLPELYERLKAVLESLGYDYEIIFVDD-GSTDRTLEILRELAARDPRVKVIRLSRNFGQQAALLAGLDHAR 80 (181) T ss_pred EEEECCCHHHHHHHHHHHHHHHHHCCCCEEEEEEEC-CCCCCCHHHHHCCCCCCCCEEEEECCCCCCHHHHHHHHHHHCC T ss_conf 996167888799999999999865599879999989-9975624542012123332045314557644678898998556 Q ss_pred CCCCCEE-ECCCCCCCCCCCCHHHHHCCCHHHHHCCCCCCCCCCCCCCCCCCC-CCCHHHCCCCCCCCCCCCCHHHHHHH Q ss_conf 5675213-214432111100001110000011100122221000122212222-10001001333332222220245541 Q gi|254780922|r 99 DSSSVLI-LGDNVFYGSDISDIFHKARARRNSATVVGCHVQNPQRYGVVEVDS-SNQAISIEEKPNNPKSSFAVTGIYFY 176 (292) Q Consensus 99 ~~~~~li-lgDni~~~~~l~~~l~~~~~~~~~~ti~~~~v~~p~~yGvv~~d~-~~~i~~i~EKP~~~~sn~a~~GiY~f 176 (292) .+ ++++ -+|...-...+.+++.. .+.+.+.. .+........+-..-... -..+..+.=+. +-.-..+|+.+| T Consensus 81 ~d-~i~~~D~D~q~~p~~i~~l~~~-~~~~~D~V-~gsR~~r~~~~~r~~~s~~~~~l~~~l~~~---~~~D~~~gfr~~ 154 (181) T cd04187 81 GD-AVITMDADLQDPPELIPEMLAK-WEEGYDVV-YGVRKNRKESWLKRLTSKLFYRLINKLSGV---DIPDNGGDFRLM 154 (181) T ss_pred CC-EEEECCCCCCCCHHHHHHHHHH-HHCCCEEE-EEEEECCCCCHHHHHHHHHHHHHHHHHHCC---CCCCCCCCEEEE T ss_conf 97-4775189998299999999999-97599199-999707887689999999999999998799---900789286998 Q ss_pred HHHHHHHHHHCC Q ss_conf 000224322004 Q gi|254780922|r 177 DQEVVNIARNIR 188 (292) Q Consensus 177 ~~~i~~~~~~l~ 188 (292) ++++++.++... T Consensus 155 ~r~~~~~i~~~~ 166 (181) T cd04187 155 DRKVVDALLLLP 166 (181) T ss_pred EHHHHHHHHHCC T ss_conf 899999987489 No 100 >PRK10073 predicted glycosyl transferase; Provisional Probab=89.87 E-value=0.18 Score=29.46 Aligned_cols=104 Identities=14% Similarity=0.171 Sum_probs=66.5 Q ss_pred EEEECCE-EHHHHHHHHHHHCCCC--EEEEEECCCCC----CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 2247837-4799999999978986--89999777656----431112222222211112223333224468521011123 Q gi|254780922|r 25 MLPIYNK-PMIYYPVSTLMDAGIR--EILIISTPRDL----PVLKEFLGSGEKWGVQFSYIEQLVPAGLAQSYILGAEFI 97 (292) Q Consensus 25 Llpi~gk-plI~~~l~~l~~~Gi~--~I~ii~~~~~~----~~~~~~l~~g~~~g~~i~y~~q~~p~Gta~Ai~~a~~~i 97 (292) .+|+++. +-|.-.|+.+...-.+ ||+ +++..+. ++.+++.. -.-++.++.| +-.|.+.|-..+.+.- T Consensus 11 iiP~YN~e~yl~~cl~Si~~Qt~~~~EiI-iVdDgStD~s~~i~~~~~~----~~~~i~vi~~-~N~G~s~ARN~gl~~a 84 (329) T PRK10073 11 IVPLYNAGKDFRACMESLIAQTWTALEII-IVNDGSTDNSVEIAKHYAE----NYPHVRLLHQ-ANAGASVARNTGLAVA 84 (329) T ss_pred EEECCCCHHHHHHHHHHHHCCCCCCEEEE-EEECCCCCCHHHHHHHHHH----HCCCEEEEEC-CCCCHHHHHHHHHHHC T ss_conf 99288988999999999980899997999-9989998258999999981----2998999966-8886189999999974 Q ss_pred CCCCCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCC Q ss_conf 3567521321443211110000111000001110012 Q gi|254780922|r 98 GDSSSVLILGDNVFYGSDISDIFHKARARRNSATVVG 134 (292) Q Consensus 98 ~~~~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~ 134 (292) .++-++.+=+||.+...-+..++....+...+..++. T Consensus 85 ~G~yi~f~DsDD~l~~~~le~l~~~a~~~~~DIv~~~ 121 (329) T PRK10073 85 TGKYVAFVDADDEVYPTMYETLMTMALEDDLDVAQCN 121 (329) T ss_pred CCCEEEEECCCCCCCCCHHHHHHHHHHHCCCCEEEEC T ss_conf 8888999757727588769999999974799999987 No 101 >cd04186 GT_2_like_c Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families. Probab=89.67 E-value=0.44 Score=26.97 Aligned_cols=93 Identities=13% Similarity=0.176 Sum_probs=53.8 Q ss_pred EEEECCE-EHHHHHHHHHHHCCC--CEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 2247837-479999999997898--6899997776564311122222222111122233332244685210111233567 Q gi|254780922|r 25 MLPIYNK-PMIYYPVSTLMDAGI--REILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQLVPAGLAQSYILGAEFIGDSS 101 (292) Q Consensus 25 Llpi~gk-plI~~~l~~l~~~Gi--~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~~p~Gta~Ai~~a~~~i~~~~ 101 (292) .+|.++. ..|.-.|+.+.+.-. -+|+||-+... +.-.+.+ +++.-++.++..++..|.+.|...+.+.... + T Consensus 2 IIptyN~~~~l~~~l~Sl~~q~~~~~eiivVD~~S~-d~t~~~~---~~~~~~i~~i~~~~n~G~~~a~N~g~~~a~g-~ 76 (166) T cd04186 2 IIVNYNSLEYLKACLDSLLAQTYPDFEVIVVDNAST-DGSVELL---RELFPEVRLIRNGENLGFGAGNNQGIREAKG-D 76 (166) T ss_pred EEEECCCHHHHHHHHHHHHHCCCCCEEEEEEECCCC-CHHHHHH---HHCCCCEEEEECCCCCCHHHHHHHHHHHHCC-C T ss_conf 999889889999999999837799809999968998-3156777---7428988999898987767876168987165-7 Q ss_pred CCEEECCCCCCCCC-CCCHHHH Q ss_conf 52132144321111-0000111 Q gi|254780922|r 102 SVLILGDNVFYGSD-ISDIFHK 122 (292) Q Consensus 102 ~~lilgDni~~~~~-l~~~l~~ 122 (292) +++.+.||.+...+ +..++.. T Consensus 77 ~i~~lD~D~~~~~~~l~~~~~~ 98 (166) T cd04186 77 YVLLLNPDTVVEPGALLELLDA 98 (166) T ss_pred CEEEECCCEEECCCHHHHHHHH T ss_conf 2699879938883999999999 No 102 >TIGR03111 glyc2_xrt_Gpos1 putative glycosyltransferase TIGR03111. Members of this protein family probable glycosyltransferases of family 2, whose genes are near those for Gram-positive proteins (TIGR03110) related to the proposed exosortase (TIGR02602). Probab=88.97 E-value=0.23 Score=28.79 Aligned_cols=102 Identities=10% Similarity=0.113 Sum_probs=60.5 Q ss_pred CEEEECC-EEHHHHHHHHHHHCCC--C--EEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 2224783-7479999999997898--6--899997776564311122222222111122233332244685210111233 Q gi|254780922|r 24 QMLPIYN-KPMIYYPVSTLMDAGI--R--EILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQLVPAGLAQSYILGAEFIG 98 (292) Q Consensus 24 ~Llpi~g-kplI~~~l~~l~~~Gi--~--~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~~p~Gta~Ai~~a~~~i~ 98 (292) -++|.++ ...|...++.+.+... + ||++|.........+ .+.......-++.....+++.|-++|+..|..... T Consensus 53 ViIPayNEe~~I~~~i~Sl~~~dYP~~~lEIIvVdDgStD~T~e-i~~~~~~~~p~~~v~~~~~n~GKa~ALN~gl~~A~ 131 (439) T TIGR03111 53 IIIPVYNSEDTLFNCIESIYNQTYPIELIDIILANNQSTDDSFQ-VFCRAQNEFPGLSLRYMNSDQGKAKALNAAIYNSI 131 (439) T ss_pred EEEECCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCHHHHH-HHHHHHHHCCCEEEEECCCCCCHHHHHHHHHHHCC T ss_conf 99935873889999999999679999855999997899517999-99998865898399976898985999999999778 Q ss_pred CCCCCEEECCCCCCCCCCCCHHHHHCCC Q ss_conf 5675213214432111100001110000 Q gi|254780922|r 99 DSSSVLILGDNVFYGSDISDIFHKARAR 126 (292) Q Consensus 99 ~~~~~lilgDni~~~~~l~~~l~~~~~~ 126 (292) ++-++.+=+|+++..+-+.+++....+. T Consensus 132 Ge~iv~~DADt~le~daL~~lv~~F~~d 159 (439) T TIGR03111 132 GKYIIHIDSDGKLHKDAIKNMVTRFENN 159 (439) T ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHC T ss_conf 8989998089886856999999987209 No 103 >cd06434 GT2_HAS Hyaluronan synthases catalyze polymerization of hyaluronan. Hyaluronan synthases (HASs) are bi-functional glycosyltransferases that catalyze polymerization of hyaluronan. HASs transfer both GlcUA and GlcNAc in beta-(1,3) and beta-(1,4) linkages, respectively to the hyaluronan chain using UDP-GlcNAc and UDP-GlcUA as substrates. HA is made as a free glycan, not attached to a protein or lipid. HASs do not need a primer for HA synthesis; they initiate HA biosynthesis de novo with only UDP-GlcNAc, UDP-GlcUA, and Mg2+. Hyaluronan (HA) is a linear heteropolysaccharide composed of (1-3)-linked beta-D-GlcUA-beta-D-GlcNAc disaccharide repeats. It can be found in vertebrates and a few microbes and is typically on the cell surface or in the extracellular space, but is also found inside mammalian cells. Hyaluronan has several physiochemical and biological functions such as space filling, lubrication, and providing a hydrated matrix through which cells can migrate. Probab=87.31 E-value=0.46 Score=26.83 Aligned_cols=96 Identities=15% Similarity=0.320 Sum_probs=53.1 Q ss_pred CEEEECC-EE-HHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 2224783-74-799999999978986899997776564311122222222111122233332244685210111233567 Q gi|254780922|r 24 QMLPIYN-KP-MIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQLVPAGLAQSYILGAEFIGDSS 101 (292) Q Consensus 24 ~Llpi~g-kp-lI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~~p~Gta~Ai~~a~~~i~~~~ 101 (292) -++|.++ .| +|.-.|+.+....--+|+||......... ..+..... ...+. +...+..|-++|+..|.+.... + T Consensus 4 viIPayNE~~~il~~~l~s~~~~~~~eiivV~D~s~d~~~-~~~~~~~~-~~~~~-v~~~~~~GK~~Aln~~l~~a~~-d 79 (235) T cd06434 4 VIIPVYDEDPDVFRECLRSILRQKPLEIIVVTDGDDEPYL-SILSQTVK-YGGIF-VITVPHPGKRRALAEGIRHVTT-D 79 (235) T ss_pred EEEEECCCCHHHHHHHHHHHHCCCCCEEEEEECCCCCCHH-HHHHHHCC-CCCEE-EEECCCCCHHHHHHHHHHHCCC-C T ss_conf 9995226976899999999980899989999889997289-99998634-89789-9988989989999999997577-8 Q ss_pred CCEEECCCCCCCCC-CCCHHHHH Q ss_conf 52132144321111-00001110 Q gi|254780922|r 102 SVLILGDNVFYGSD-ISDIFHKA 123 (292) Q Consensus 102 ~~lilgDni~~~~~-l~~~l~~~ 123 (292) +++++-+|...+.+ +.+++... T Consensus 80 ~v~~~DaD~~~~~~~l~~l~~~f 102 (235) T cd06434 80 IVVLLDSDTVWPPNALPEMLKPF 102 (235) T ss_pred EEEEECCCCCCCHHHHHHHHHHH T ss_conf 89998588535888999999964 No 104 >cd04179 DPM_DPG-synthase_like DPM_DPG-synthase_like is a member of the Glycosyltransferase 2 superfamily. DPM1 is the catalytic subunit of eukaryotic dolichol-phosphate mannose (DPM) synthase. DPM synthase is required for synthesis of the glycosylphosphatidylinositol (GPI) anchor, N-glycan precursor, protein O-mannose, and C-mannose. In higher eukaryotes,the enzyme has three subunits, DPM1, DPM2 and DPM3. DPM is synthesized from dolichol phosphate and GDP-Man on the cytosolic surface of the ER membrane by DPM synthase and then is flipped onto the luminal side and used as a donor substrate. In lower eukaryotes, such as Saccharomyces cerevisiae and Trypanosoma brucei, DPM synthase consists of a single component (Dpm1p and TbDpm1, respectively) that possesses one predicted transmembrane region near the C terminus for anchoring to the ER membrane. In contrast, the Dpm1 homologues of higher eukaryotes, namely fission yeast, fungi, and animals, have no transmembrane region, suggesting the ex Probab=87.19 E-value=0.35 Score=27.62 Aligned_cols=107 Identities=17% Similarity=0.119 Sum_probs=58.3 Q ss_pred EEEECCE-EHHHHHHHHHHHC---C-CCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 2247837-4799999999978---9-868999977765643111222222221111222333322446852101112335 Q gi|254780922|r 25 MLPIYNK-PMIYYPVSTLMDA---G-IREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQLVPAGLAQSYILGAEFIGD 99 (292) Q Consensus 25 Llpi~gk-plI~~~l~~l~~~---G-i~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~~p~Gta~Ai~~a~~~i~~ 99 (292) .+|+++. .-|.-.+..+... . .-+|++|-+...... .+.+.....-.-++..+...+-.|-+.|+..|.++... T Consensus 2 iIP~yNE~~~i~~~l~~i~~~~~~~~~~eiiiVddgS~D~T-~~i~~~~~~~~~~~~~i~~~~n~G~g~ai~~G~~~a~~ 80 (185) T cd04179 2 VIPAYNEEENIPELVERLLAVLEEGYDYEIIVVDDGSTDGT-AEIARELAARVPRVRVIRLSRNFGKGAAVRAGFKAARG 80 (185) T ss_pred EEEECCCHHHHHHHHHHHHHHHCCCCCEEEEEEECCCCCCH-HHHHHHHHHCCEEEEEEECCCCCCCHHHHHHHHHHCCC T ss_conf 99844878899999999998753579879999989996207-88888764233049986324577705899988875258 Q ss_pred CCCCEEECCCCCCCCCCCCHHHHHCCCHHHHHC Q ss_conf 675213214432111100001110000011100 Q gi|254780922|r 100 SSSVLILGDNVFYGSDISDIFHKARARRNSATV 132 (292) Q Consensus 100 ~~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti 132 (292) +-++.+-+|.-+...++.++++...+.+.++.+ T Consensus 81 d~i~~~D~D~~~~~~~i~~~i~~~~~~~~d~V~ 113 (185) T cd04179 81 DIVVTMDADLQHPPEDIPKLLEKLLEGGADVVI 113 (185) T ss_pred CEEEEECCCCCCCHHHHHHHHHHHHHCCCEEEE T ss_conf 889992799870999999999999978985999 No 105 >PRK10018 predicted glycosyl transferase; Provisional Probab=86.66 E-value=0.33 Score=27.79 Aligned_cols=99 Identities=11% Similarity=0.172 Sum_probs=61.8 Q ss_pred EEEECC-EEHHHHHHHHHHHCCCCEE-EEEECCCCC--CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 224783-7479999999997898689-999777656--431112222222211112223333224468521011123356 Q gi|254780922|r 25 MLPIYN-KPMIYYPVSTLMDAGIREI-LIISTPRDL--PVLKEFLGSGEKWGVQFSYIEQLVPAGLAQSYILGAEFIGDS 100 (292) Q Consensus 25 Llpi~g-kplI~~~l~~l~~~Gi~~I-~ii~~~~~~--~~~~~~l~~g~~~g~~i~y~~q~~p~Gta~Ai~~a~~~i~~~ 100 (292) .+|+++ ...|..+|+.+.+...+++ +||+..... +..+.+.... -..+|.|+.++.-.|.+.|...|...-.++ T Consensus 10 IIP~yN~~~~l~~aI~SVl~Qty~n~EiIIVDD~Std~~~~~~~~~~~--~d~RI~~i~~~~N~G~~~aRN~gi~~A~Ge 87 (279) T PRK10018 10 YMPTWNRQQLAIRAIKSVLRQDYSNWEMIIVDDCSTSWEQLQQYVTAL--NDPRITYIHNDINSGACAVRNQAIMLAQGE 87 (279) T ss_pred EECCCCCHHHHHHHHHHHHHCCCCCEEEEEEECCCCCHHHHHHHHHHC--CCCCEEEEECCCCCCHHHHHHHHHHHHCCC T ss_conf 994799779999999999957998989999989999879999999975--899889998789788999999999995699 Q ss_pred CCCEEECCCCCCCCCCCCHHHHHCC Q ss_conf 7521321443211110000111000 Q gi|254780922|r 101 SSVLILGDNVFYGSDISDIFHKARA 125 (292) Q Consensus 101 ~~~lilgDni~~~~~l~~~l~~~~~ 125 (292) -.+.+=+||.+....+..++....+ T Consensus 88 yIafLDsDD~~~PnkLE~ql~~~~~ 112 (279) T PRK10018 88 YITGIDDDDEWTPNRLSVFLAHKQQ 112 (279) T ss_pred EEEEECCCCCCCCCHHHHHHHHHHH T ss_conf 8999998768785669999961442 No 106 >cd06913 beta3GnTL1_like Beta 1, 3-N-acetylglucosaminyltransferase is essential for the formation of poly-N-acetyllactosamine . This family includes human Beta3GnTL1 and related eukaryotic proteins. Human Beta3GnTL1 is a putative beta-1,3-N-acetylglucosaminyltransferase. Beta3GnTL1 is expressed at various levels in most of tissues examined. Beta 1, 3-N-acetylglucosaminyltransferase has been found to be essential for the formation of poly-N-acetyllactosamine. Poly-N-acetyllactosamine is a unique carbohydrate composed of N-acetyllactosamine repeats. It is often an important part of cell-type-specific oligosaccharide structures and some functional oligosaccharides. It has been shown that the structure and biosynthesis of poly-N-acetyllactosamine display a dramatic change during development and oncogenesis. Several members of beta-1, 3-N-acetylglucosaminyltransferase have been identified. Probab=86.48 E-value=0.81 Score=25.32 Aligned_cols=103 Identities=14% Similarity=0.147 Sum_probs=59.3 Q ss_pred EEEECCE-EHHHHHHHHHHHCCCC---EEEEEECCCCC----CCCCCCCCCCCCCCCCCCCCC--CCCCCCCCCCCCCCC Q ss_conf 2247837-4799999999978986---89999777656----431112222222211112223--333224468521011 Q gi|254780922|r 25 MLPIYNK-PMIYYPVSTLMDAGIR---EILIISTPRDL----PVLKEFLGSGEKWGVQFSYIE--QLVPAGLAQSYILGA 94 (292) Q Consensus 25 Llpi~gk-plI~~~l~~l~~~Gi~---~I~ii~~~~~~----~~~~~~l~~g~~~g~~i~y~~--q~~p~Gta~Ai~~a~ 94 (292) .+|+++. ..|.-.|+.+.+...+ || ||+...+. ++++++........+.+.... ...|+|.+.|...|. T Consensus 2 IiP~YN~~~~l~~~l~Svl~Qty~~~~Ei-IiVDDgStD~t~~ii~~~~~~~~~~~~~~~~~~~~~~~~~G~~~arN~gi 80 (219) T cd06913 2 ILPVHNGEQWLDECLESVLQQDFEGTLEL-SVFNDASTDKSAEIIEKWRKKLEDSGVIVLVGSHNSPSPKGVGYAKNQAI 80 (219) T ss_pred EECCCCCHHHHHHHHHHHHHCCCCCCEEE-EEEECCCCCCHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCHHHHHHHHH T ss_conf 88678988999999999984889998999-99989998156999999998666777699982367766467899999999 Q ss_pred CCCCCCCCCEEECCCCCCCCCCCCHHHHHCCCHH Q ss_conf 1233567521321443211110000111000001 Q gi|254780922|r 95 EFIGDSSSVLILGDNVFYGSDISDIFHKARARRN 128 (292) Q Consensus 95 ~~i~~~~~~lilgDni~~~~~l~~~l~~~~~~~~ 128 (292) +.-.++-++.+-+||++....+..++....+... T Consensus 81 ~~A~GeyI~flDsDD~~~p~~l~~~~~~~~~~~~ 114 (219) T cd06913 81 AQSSGRYLCFLDSDDVMMPQRIRLQYEAALQHPN 114 (219) T ss_pred HHCCCCEEEEECCCCCCCHHHHHHHHHHHHHCCC T ss_conf 9778887886189764276599999999985999 No 107 >cd06435 CESA_NdvC_like NdvC_like proteins in this family are putative bacterial beta-(1,6)-glucosyltransferase. NdvC_like proteins in this family are putative bacterial beta-(1,6)-glucosyltransferase. Bradyrhizobium japonicum synthesizes periplasmic cyclic beta-(1,3),beta-(1,6)-D-glucans during growth under hypoosmotic conditions. Two genes (ndvB, ndvC) are involved in the beta-(1, 3), beta-(1,6)-glucan synthesis. The ndvC mutant strain resulted in synthesis of altered cyclic beta-glucans composed almost entirely of beta-(1, 3)-glycosyl linkages. The periplasmic cyclic beta-(1,3),beta-(1,6)-D-glucans function for osmoregulation. The ndvC mutation also affects the ability of the bacteria to establish a successful symbiotic interaction with host plant. Thus, the beta-glucans may function as suppressors of a host defense response. Probab=85.70 E-value=0.71 Score=25.68 Aligned_cols=97 Identities=15% Similarity=0.249 Sum_probs=54.0 Q ss_pred EEEECCEE--HHHHHHHHHHHCCC--CEEEEEECCCCCCCCCCCCC-CCCCCCCCCCCCCCCC-CCCCCCCCCCCCCCCC Q ss_conf 22478374--79999999997898--68999977765643111222-2222211112223333-2244685210111233 Q gi|254780922|r 25 MLPIYNKP--MIYYPVSTLMDAGI--REILIISTPRDLPVLKEFLG-SGEKWGVQFSYIEQLV-PAGLAQSYILGAEFIG 98 (292) Q Consensus 25 Llpi~gkp--lI~~~l~~l~~~Gi--~~I~ii~~~~~~~~~~~~l~-~g~~~g~~i~y~~q~~-p~Gta~Ai~~a~~~i~ 98 (292) ++|.++.+ +|.-.|+.|...-. -+|+||.+....+...+.+. -...++.++.++.... +.|-++|+..|.+... T Consensus 3 iIPayNE~~~vi~~~l~sl~~~~Yp~~eIiVvdd~stD~t~~~~v~~~~~~~~~~~~~~~~~~~~g~K~~alN~~l~~~~ 82 (236) T cd06435 3 HVPCYEEPPEMVKETLDSLAALDYPNFEVIVIDNNTKDEALWKPVEAHCAQLGERFRFFHVEPLPGAKAGALNYALERTA 82 (236) T ss_pred EEECCCCCHHHHHHHHHHHHHCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHHHHHCC T ss_conf 98358992899999999999579999879999892998158999999998729967999948889970999999999647 Q ss_pred CC-CCCEEE-CCCCCCCCC-CCCHHHH Q ss_conf 56-752132-144321111-0000111 Q gi|254780922|r 99 DS-SSVLIL-GDNVFYGSD-ISDIFHK 122 (292) Q Consensus 99 ~~-~~~lil-gDni~~~~~-l~~~l~~ 122 (292) .+ ++++++ +|++ ...+ +..++.. T Consensus 83 ~~~e~i~~~DaD~~-~~pd~L~~~v~~ 108 (236) T cd06435 83 PDAEIIAVIDADYQ-VEPDWLKRLVPI 108 (236) T ss_pred CCCEEEEEECCCCC-CCHHHHHHHHHH T ss_conf 98549999768877-786899999986 No 108 >cd06423 CESA_like CESA_like is the cellulose synthase superfamily. The cellulose synthase (CESA) superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains. The members include cellulose synthase catalytic subunit, chitin synthase, glucan biosynthesis protein and other families of CESA-like proteins. Cellulose synthase catalyzes the polymerization reaction of cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues in plants, most algae, some bacteria and fungi, and even some animals. In bacteria, algae and lower eukaryotes, there is a second unrelated type of cellulose synthase (Type II), which produces acylated cellulose, a derivative of cellulose. Chitin synthase catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin, which is a linear homopolymer of beta-(1,4)-linked GlcNAc residues and Glucan Biosynthesis protein catalyzes the Probab=85.64 E-value=0.72 Score=25.64 Aligned_cols=100 Identities=15% Similarity=0.252 Sum_probs=51.7 Q ss_pred EEEECCE-EHHHHHHHHHHHCCCC--EEEEEECCCCC---CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 2247837-4799999999978986--89999777656---4311122222222111122233332244685210111233 Q gi|254780922|r 25 MLPIYNK-PMIYYPVSTLMDAGIR--EILIISTPRDL---PVLKEFLGSGEKWGVQFSYIEQLVPAGLAQSYILGAEFIG 98 (292) Q Consensus 25 Llpi~gk-plI~~~l~~l~~~Gi~--~I~ii~~~~~~---~~~~~~l~~g~~~g~~i~y~~q~~p~Gta~Ai~~a~~~i~ 98 (292) .+|.++. -.|.-.|+.+.+.-.+ +|+|+-+.... +.++++... ...........+..|.++|+..+..... T Consensus 2 iIp~~N~~~~l~~~l~sl~~Q~~~~~eIivvdd~S~d~t~~~~~~~~~~---~~~~~~~~~~~~~~g~~~a~N~g~~~a~ 78 (180) T cd06423 2 IVPAYNEEAVIERTIESLLALDYPKLEVIVVDDGSTDDTLEILEELAAL---YIRRVLVVRDKENGGKAGALNAGLRHAK 78 (180) T ss_pred EEEECCCHHHHHHHHHHHHHCCCCCEEEEEEECCCCHHHHHHHHHHHHC---CCCEEEEECCCCCCCHHHHHHHCCCCCC T ss_conf 9997797899999999998189998189999899975789999998621---7971898447776779999752510267 Q ss_pred CCCCCEEE-CCCCCCCCCCCCHHHHHCCCHH Q ss_conf 56752132-1443211110000111000001 Q gi|254780922|r 99 DSSSVLIL-GDNVFYGSDISDIFHKARARRN 128 (292) Q Consensus 99 ~~~~~lil-gDni~~~~~l~~~l~~~~~~~~ 128 (292) . ++++.+ +|+++...-+..++....+... T Consensus 79 ~-d~i~~lD~D~~~~~~~l~~~~~~~~~~~~ 108 (180) T cd06423 79 G-DIVVVLDADTILEPDALKRLVVPFFADPK 108 (180) T ss_pred C-CEEEEECCCCCCCHHHHHHHHHHHHHCCC T ss_conf 5-57987438720085699999999987959 No 109 >PRK11204 N-glycosyltransferase PgaC; Provisional Probab=84.74 E-value=0.36 Score=27.49 Aligned_cols=97 Identities=15% Similarity=0.127 Sum_probs=55.5 Q ss_pred EEEECC-EEHHHHHHHHHHHCCC--CEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 224783-7479999999997898--6899997776564311122222222111122233332244685210111233567 Q gi|254780922|r 25 MLPIYN-KPMIYYPVSTLMDAGI--REILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQLVPAGLAQSYILGAEFIGDSS 101 (292) Q Consensus 25 Llpi~g-kplI~~~l~~l~~~Gi--~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~~p~Gta~Ai~~a~~~i~~~~ 101 (292) ++|-++ ...|.-.|+.+.+... -||++|. ..+.+.-.+.+.....-.-++..+......|-|+|+..|......| T Consensus 59 lIPayNEe~~I~~tI~sll~~~YP~~eIiVVd-DgStD~T~~i~~~~~~~~p~~~vi~~~~n~GKa~ALN~gl~~a~ge- 136 (421) T PRK11204 59 LVPCYNEGENVEETISAALALRYPNYEVIAIN-DGSSDNTGEILDRLAAEDPRLRVIHLAENQGKAIALNTGAAAARSE- 136 (421) T ss_pred EECCCCCHHHHHHHHHHHHHCCCCCCEEEEEE-CCCCHHHHHHHHHHHHHCCCEEEEECCCCCCHHHHHHHHHHHCCCC- T ss_conf 98479978999999999995899986699998-9997408999999970689689997699988999999999843789- Q ss_pred CCEE-ECCCCCCCCCCCCHHHHH Q ss_conf 5213-214432111100001110 Q gi|254780922|r 102 SVLI-LGDNVFYGSDISDIFHKA 123 (292) Q Consensus 102 ~~li-lgDni~~~~~l~~~l~~~ 123 (292) ++++ =+|++...+-+..++... T Consensus 137 ~Vv~~DAD~~~~~d~L~~~v~~f 159 (421) T PRK11204 137 YLVCIDGDALLDPDAAAYMVEHF 159 (421) T ss_pred EEEEECCCCCCCHHHHHHHHHHH T ss_conf 89998998714867999999999 No 110 >cd04196 GT_2_like_d Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families. Probab=84.57 E-value=0.59 Score=26.16 Aligned_cols=102 Identities=9% Similarity=0.107 Sum_probs=53.1 Q ss_pred EEEECCE-EHHHHHHHHHHHCCCC--EEEEEECCCCCCCCCCCCCCC-CCCCCCCCCCCC-CCCCCCCCCCCCCCCCCCC Q ss_conf 2247837-4799999999978986--899997776564311122222-222111122233-3322446852101112335 Q gi|254780922|r 25 MLPIYNK-PMIYYPVSTLMDAGIR--EILIISTPRDLPVLKEFLGSG-EKWGVQFSYIEQ-LVPAGLAQSYILGAEFIGD 99 (292) Q Consensus 25 Llpi~gk-plI~~~l~~l~~~Gi~--~I~ii~~~~~~~~~~~~l~~g-~~~g~~i~y~~q-~~p~Gta~Ai~~a~~~i~~ 99 (292) .+|.++. +.|.-.|+.+.+.-.. ||+|| .....+.-.+.+... .+.. ++.++.+ ..-.|.+.++..|.....+ T Consensus 3 iip~yN~~~~l~~~i~Sil~Qt~~~~Eiivv-DDgStD~t~~ii~~~~~~~~-~~~~~~~~~~n~G~~~n~n~gi~~a~g 80 (214) T cd04196 3 LMATYNGEKYLREQLDSILAQTYKNDELIIS-DDGSTDGTVEIIKEYIDKDP-FIIILIRNGKNLGVARNFESLLQAADG 80 (214) T ss_pred EEEECCCHHHHHHHHHHHHHCCCCCEEEEEE-ECCCCCCHHHHHHHHHHHCC-CCCEEEECCCCCHHHHHHHHHHHHCCC T ss_conf 9991798889999999998379988699999-89997237999999997588-651366658880599999999986478 Q ss_pred CCCCEEECCCCCCCCCCCCHHHHHCCCHH Q ss_conf 67521321443211110000111000001 Q gi|254780922|r 100 SSSVLILGDNVFYGSDISDIFHKARARRN 128 (292) Q Consensus 100 ~~~~lilgDni~~~~~l~~~l~~~~~~~~ 128 (292) +-++++-+||++....+..++....+... T Consensus 81 eyi~~lD~DD~~~p~~l~~~~~~~~~~~~ 109 (214) T cd04196 81 DYVFFCDQDDIWLPDKLERLLKAFLKDDK 109 (214) T ss_pred CEEECCCCCCCCCHHHHHHHHHHHHHCCC T ss_conf 75741377656286599999999985899 No 111 >cd06421 CESA_CelA_like CESA_CelA_like are involved in the elongation of the glucan chain of cellulose. Family of proteins related to Agrobacterium tumefaciens CelA and Gluconacetobacter xylinus BscA. These proteins are involved in the elongation of the glucan chain of cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues. They are putative catalytic subunit of cellulose synthase, which is a glycosyltransferase using UDP-glucose as the substrate. The catalytic subunit is an integral membrane protein with 6 transmembrane segments and it is postulated that the protein is anchored in the membrane at the N-terminal end. Probab=84.13 E-value=0.75 Score=25.51 Aligned_cols=98 Identities=14% Similarity=0.226 Sum_probs=52.0 Q ss_pred CEEEECCEE--HHHHHHHHHHHCCC--C--EEEEEECCCCCCCCCCCCCC-CCCCCCCCCCCCCCCCCC-CCCCCCCCCC Q ss_conf 222478374--79999999997898--6--89999777656431112222-222211112223333224-4685210111 Q gi|254780922|r 24 QMLPIYNKP--MIYYPVSTLMDAGI--R--EILIISTPRDLPVLKEFLGS-GEKWGVQFSYIEQLVPAG-LAQSYILGAE 95 (292) Q Consensus 24 ~Llpi~gkp--lI~~~l~~l~~~Gi--~--~I~ii~~~~~~~~~~~~l~~-g~~~g~~i~y~~q~~p~G-ta~Ai~~a~~ 95 (292) -++|.++.+ +|.-.|+.+.+.-. . +|+||-+ ...+...+.+.. +... .+.++.+..+.| -++|+..|.. T Consensus 5 ViIPayNE~~~~i~~~l~sl~~q~YP~~~~eIiVvdD-~std~t~~~~~~~~~~~--~~~~~~~~~~~g~Ka~alN~g~~ 81 (234) T cd06421 5 VFIPTYNEPLEIVRKTLRAALAIDYPHDKLRVYVLDD-GRRPELRALAAELGVEY--GYRYLTRPDNRHAKAGNLNNALA 81 (234) T ss_pred EEEECCCCCHHHHHHHHHHHHHCCCCCCCEEEEEEEC-CCCHHHHHHHHHHCCCC--EEEEEECCCCCCCHHHHHHHHHH T ss_conf 9998389978999999999996799998289999989-89878898888845664--36888748899976999999999 Q ss_pred CCCCCCCCEEE-CCCCCCCCCCCCHHHHHCC Q ss_conf 23356752132-1443211110000111000 Q gi|254780922|r 96 FIGDSSSVLIL-GDNVFYGSDISDIFHKARA 125 (292) Q Consensus 96 ~i~~~~~~lil-gDni~~~~~l~~~l~~~~~ 125 (292) ....+ +++++ +|++....-+..++..... T Consensus 82 ~a~gd-~v~~~DaD~~~~~~~L~~~~~~~~~ 111 (234) T cd06421 82 HTTGD-FVAILDADHVPTPDFLRRTLGYFLD 111 (234) T ss_pred HCCCC-EEEEECCCCCCCHHHHHHHHHHHHH T ss_conf 77899-9999889877087699999999986 No 112 >cd06427 CESA_like_2 CESA_like_2 is a member of the cellulose synthase superfamily. The cellulose synthase (CESA) superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains. The members include cellulose synthase catalytic subunit, chitin synthase, Glucan Biosynthesis protein and other families of CESA-like proteins. Cellulose synthase catalyzes the polymerization reaction of cellulose, an aggregate of unbranched polymers of beta-1,4-linked glucose residues in plants, most algae, some bacteria and fungi, and even some animals. In bacteria, algae and lower eukaryotes, there is a second unrelated type of cellulose synthase (Type II), which produces acylated cellulose, a derivative of cellulose. Chitin synthase catalyzes the incorporation of GlcNAc from substrate UDP-GlcNAc into chitin, which is a linear homopolymer of beta-(1,4)-linked GlcNAc residues and Glucan Biosynthesis prot Probab=81.75 E-value=1.1 Score=24.50 Aligned_cols=98 Identities=9% Similarity=0.132 Sum_probs=55.3 Q ss_pred CEEEECCE-EHHHHHHHHHHHCCC--C--EEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 22247837-479999999997898--6--899997776564311122222222111122233332244685210111233 Q gi|254780922|r 24 QMLPIYNK-PMIYYPVSTLMDAGI--R--EILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQLVPAGLAQSYILGAEFIG 98 (292) Q Consensus 24 ~Llpi~gk-plI~~~l~~l~~~Gi--~--~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~~p~Gta~Ai~~a~~~i~ 98 (292) -++|.++. ..|...|+.+..... . +|++|....+....+....-.......+..+....+.|-|+|+..|..+.. T Consensus 5 ViIPa~NE~~vI~~ti~sl~~~~YP~~~~evivv~d~s~~~t~~~~~~~~~~~~~~~~~~~~~~~~gK~~alN~al~~a~ 84 (241) T cd06427 5 ILVPLYKEAEVLPQLIASLSALDYPRSKLDVKLLLEEDDEETIAAARALRLPSIFRVVVVPPSQPRTKPKACNYALAFAR 84 (241) T ss_pred EEEECCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECCCCHHHHHHHHHHCCCCCCEEEEECCCCCCCHHHHHHHHHHHCC T ss_conf 99966898899999999999679997617999998899968999999857876708999638988746999999999615 Q ss_pred CCCCCEEE-CCCCCCCCC-CCCHHHHH Q ss_conf 56752132-144321111-00001110 Q gi|254780922|r 99 DSSSVLIL-GDNVFYGSD-ISDIFHKA 123 (292) Q Consensus 99 ~~~~~lil-gDni~~~~~-l~~~l~~~ 123 (292) . ++++++ +|++ ...+ +..++... T Consensus 85 g-d~v~~~DAD~~-~~p~~L~~~v~~f 109 (241) T cd06427 85 G-EYVVIYDAEDA-PDPDQLKKAVAAF 109 (241) T ss_pred C-CEEEEECCCCC-CCHHHHHHHHHHH T ss_conf 8-85999786556-4977999999999 No 113 >cd04192 GT_2_like_e Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families. Probab=81.67 E-value=1.2 Score=24.25 Aligned_cols=102 Identities=9% Similarity=0.056 Sum_probs=51.5 Q ss_pred EEEECCE-EHHHHHHHHHHHCCC--C--EEEEEECCCCCCCCCCCCCC-CCCCCCCCCCCCCCC--CCCCCCCCCCCCCC Q ss_conf 2247837-479999999997898--6--89999777656431112222-222211112223333--22446852101112 Q gi|254780922|r 25 MLPIYNK-PMIYYPVSTLMDAGI--R--EILIISTPRDLPVLKEFLGS-GEKWGVQFSYIEQLV--PAGLAQSYILGAEF 96 (292) Q Consensus 25 Llpi~gk-plI~~~l~~l~~~Gi--~--~I~ii~~~~~~~~~~~~l~~-g~~~g~~i~y~~q~~--p~Gta~Ai~~a~~~ 96 (292) ++|.++. ..|...|+.+.+... + ||+||-+ .+.+--.+.+.. ......++..+.... -.|-++|+..+... T Consensus 2 iIP~~Ne~~~i~~~l~sl~~q~yp~~~~EVivvdd-~StD~T~~i~~~~~~~~~~~~~~~~~~~~~~~gk~~aln~g~~~ 80 (229) T cd04192 2 VIAARNEAENLPRLLQSLSALDYPKEKFEVILVDD-HSTDGTVQILEFAAAKPNFQLKILNNSRVSISGKKNALTTAIKA 80 (229) T ss_pred EEEECCCHHHHHHHHHHHHHCCCCCCCEEEEEEEC-CCCCCHHHHHHHHHHCCCCCEEECCCCCCCCCHHHHHHHHHHHH T ss_conf 99988988999999999995789988689999989-79716799999997056974553024677772599999999986 Q ss_pred CCCCCCCEEE-CCCCCCCCCCCCHHHHHCCCHH Q ss_conf 3356752132-1443211110000111000001 Q gi|254780922|r 97 IGDSSSVLIL-GDNVFYGSDISDIFHKARARRN 128 (292) Q Consensus 97 i~~~~~~lil-gDni~~~~~l~~~l~~~~~~~~ 128 (292) ...+ +++++ +|++....-+..++..+..... T Consensus 81 a~ge-~i~~lDaD~~~~~~~l~~~~~~~~~~~~ 112 (229) T cd04192 81 AKGD-WIVTTDADCVVPSNWLLTFVAFIQKEQI 112 (229) T ss_pred CCCC-EEEEECCCCCCCHHHHHHHHHHHHCCCC T ss_conf 4677-6998567656587699999999748994 No 114 >cd06439 CESA_like_1 CESA_like_1 is a member of the cellulose synthase (CESA) superfamily. This is a subfamily of cellulose synthase (CESA) superfamily. CESA superfamily includes a wide variety of glycosyltransferase family 2 enzymes that share the common characteristic of catalyzing the elongation of polysaccharide chains. The members of the superfamily include cellulose synthase catalytic subunit, chitin synthase, glucan biosynthesis protein and other families of CESA-like proteins. Probab=79.59 E-value=1.1 Score=24.48 Aligned_cols=95 Identities=14% Similarity=0.162 Sum_probs=53.9 Q ss_pred EEEECC-EEHHHHHHHHHHHCCC--C--EEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 224783-7479999999997898--6--8999977765643111222222221111222333322446852101112335 Q gi|254780922|r 25 MLPIYN-KPMIYYPVSTLMDAGI--R--EILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQLVPAGLAQSYILGAEFIGD 99 (292) Q Consensus 25 Llpi~g-kplI~~~l~~l~~~Gi--~--~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~~p~Gta~Ai~~a~~~i~~ 99 (292) ++|.++ .+.|.-.|+.+..... + ||+||.+ ...+--.+.+.....- .+....+..-.|-++|+..+.....+ T Consensus 34 iIP~yNE~~~i~~~l~sl~~q~Yp~~~~eVIvvdD-~StD~T~ei~~~~~~~--~~~v~~~~~n~Gk~~AlN~gi~~a~g 110 (251) T cd06439 34 IIPAYNEEAVIEAKLENLLALDYPRDRLEIIVVSD-GSTDGTAEIAREYADK--GVKLLRFPERRGKAAALNRALALATG 110 (251) T ss_pred EEEECCCHHHHHHHHHHHHHCCCCCCCEEEEEEEC-CCCHHHHHHHHHHHCC--CCEEECCCCCCCHHHHHHHHHHHCCC T ss_conf 99738988999999999996699988789999979-9981499999997455--85043167776799999999987678 Q ss_pred CCCCEEECCCCCCCCC-CCCHHHHH Q ss_conf 6752132144321111-00001110 Q gi|254780922|r 100 SSSVLILGDNVFYGSD-ISDIFHKA 123 (292) Q Consensus 100 ~~~~lilgDni~~~~~-l~~~l~~~ 123 (292) +-++++-+|+++ ..+ +..++... T Consensus 111 d~i~~lDaD~~~-~~~~l~~l~~~f 134 (251) T cd06439 111 EIVVFTDANALL-DPDALRLLVRHF 134 (251) T ss_pred CEEEECCCCCCC-CHHHHHHHHHHH T ss_conf 989833787651-876999999986 No 115 >cd06437 CESA_CaSu_A2 Cellulose synthase catalytic subunit A2 (CESA2) is a catalytic subunit or a catalytic subunit substitute of the cellulose synthase complex. Cellulose synthase (CESA) catalyzes the polymerization reaction of cellulose using UDP-glucose as the substrate. Cellulose is an aggregate of unbranched polymers of beta-1,4-linked glucose residues, which is an abundant polysaccharide produced by plants and in varying degrees by several other organisms including algae, bacteria, fungi, and even some animals. Genomes from higher plants harbor multiple CESA genes. There are ten in Arabidopsis. At least three different CESA proteins are required to form a functional complex. In Arabidopsis, CESA1, 3 and 6 and CESA4, 7 and 8, are required for cellulose biosynthesis during primary and secondary cell wall formation. CESA2 is very closely related to CESA6 and is viewed as a prime substitute for CESA6. They functionally compensate each other. The cesa2 and cesa6 double mutant plants we Probab=79.31 E-value=1.2 Score=24.21 Aligned_cols=99 Identities=17% Similarity=0.165 Sum_probs=55.3 Q ss_pred CEEEECCE-EHHHHHHHHHHHCCC--C--EEEEEECCCCCC---CCCCCCCCCCCCCCCCCCCCCCCCCC-CCCCCCCCC Q ss_conf 22247837-479999999997898--6--899997776564---31112222222211112223333224-468521011 Q gi|254780922|r 24 QMLPIYNK-PMIYYPVSTLMDAGI--R--EILIISTPRDLP---VLKEFLGSGEKWGVQFSYIEQLVPAG-LAQSYILGA 94 (292) Q Consensus 24 ~Llpi~gk-plI~~~l~~l~~~Gi--~--~I~ii~~~~~~~---~~~~~l~~g~~~g~~i~y~~q~~p~G-ta~Ai~~a~ 94 (292) -++|+++. ..|.-.|+.+...-. + +|+|+-+ ...+ ....+.......+.++.++....+.| -++|+..+. T Consensus 5 ViIPa~NE~~~I~~~l~sl~~q~YP~~~~~I~VvDd-stD~t~~~~~~~~~~~~~~~~~~~~~~~~~~~g~Ka~aln~gl 83 (232) T cd06437 5 VQLPVFNEKYVVERLIEAACALDYPKDRLEIQVLDD-STDETVRLAREIVEEYAAQGVNIKHVRRADRTGYKAGALAEGM 83 (232) T ss_pred EEEECCCCHHHHHHHHHHHHHCCCCCCEEEEEEEEC-CCCHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHH T ss_conf 999558988999999999996799998089999979-9966999999999766641996899836988886699999999 Q ss_pred CCCCCCCCCEEE-CCCCCCCCCCCCHHHHHCC Q ss_conf 123356752132-1443211110000111000 Q gi|254780922|r 95 EFIGDSSSVLIL-GDNVFYGSDISDIFHKARA 125 (292) Q Consensus 95 ~~i~~~~~~lil-gDni~~~~~l~~~l~~~~~ 125 (292) +.... ++++++ +|++ ...++...+..+.. T Consensus 84 ~~a~g-d~i~~~DaD~~-~~~d~L~~~~~~f~ 113 (232) T cd06437 84 KVAKG-EYVAIFDADFV-PPPDFLQKTPPYFA 113 (232) T ss_pred HHCCC-CEEEEECCCCC-CCHHHHHHHHHHHC T ss_conf 97789-88999776447-38479999999831 No 116 >cd04185 GT_2_like_b Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families. Probab=78.23 E-value=2.3 Score=22.41 Aligned_cols=95 Identities=14% Similarity=0.156 Sum_probs=53.3 Q ss_pred EEEC-CEEHHHHHHHHHHHCC--CCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC--CC Q ss_conf 2478-3747999999999789--86899997776564311122222222111122233332244685210111233--56 Q gi|254780922|r 26 LPIY-NKPMIYYPVSTLMDAG--IREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQLVPAGLAQSYILGAEFIG--DS 100 (292) Q Consensus 26 lpi~-gkplI~~~l~~l~~~G--i~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~~p~Gta~Ai~~a~~~i~--~~ 100 (292) +|.+ ....+.-.|+.|...- ..+|+||=+.. .+.-.+.+..... ..++.++...+-.|.|+|...+.+... +. T Consensus 3 I~t~N~~~~l~~~L~Sl~~q~~~~~eIiVVDn~S-tD~t~~~l~~~~~-~~~v~~i~~~~N~G~a~~~N~Gi~~a~~~~~ 80 (202) T cd04185 3 VVTYNRLDLLKECLDALLAQTRPPDHIIVIDNAS-TDGTAEWLTSLGD-LDNIVYLRLPENLGGAGGFYEGVRRAYELGY 80 (202) T ss_pred EEECCCHHHHHHHHHHHHHCCCCCCEEEEEECCC-CCCHHHHHHHHCC-CCCEEEEECCCCCCHHHHHHHHHHHHHHCCC T ss_conf 9002888999999999982779998899997949-8761665300035-6668998568887327899999998875598 Q ss_pred CCCEEECCCCCCCCC-CCCHHHH Q ss_conf 752132144321111-0000111 Q gi|254780922|r 101 SSVLILGDNVFYGSD-ISDIFHK 122 (292) Q Consensus 101 ~~~lilgDni~~~~~-l~~~l~~ 122 (292) ++++.+.||.+...+ +..++.. T Consensus 81 d~v~~ld~D~~~~~~~l~~l~~~ 103 (202) T cd04185 81 DWIWLMDDDAIPDPDALEKLLAY 103 (202) T ss_pred EEEEEECCCCCCCCCHHHHHHHH T ss_conf 18999899875492299999998 No 117 >COG1216 Predicted glycosyltransferases [General function prediction only] Probab=76.11 E-value=3.2 Score=21.59 Aligned_cols=200 Identities=17% Similarity=0.163 Sum_probs=91.9 Q ss_pred CCCEEEECCEEHHHHHHHHHHHCCCCEEEEE--ECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 8222247837479999999997898689999--77765643111222222221111222333322446852101112335 Q gi|254780922|r 22 SKQMLPIYNKPMIYYPVSTLMDAGIREILII--STPRDLPVLKEFLGSGEKWGVQFSYIEQLVPAGLAQSYILGAEFIGD 99 (292) Q Consensus 22 pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii--~~~~~~~~~~~~l~~g~~~g~~i~y~~q~~p~Gta~Ai~~a~~~i~~ 99 (292) .--++-.+...-+.-.|..+.+.......++ .+....... ..+... ...++.++.-.+=+|-|++...+...... T Consensus 6 ~~iiv~yn~~~~l~~~l~~l~~~~~~~~~iv~vDn~s~d~~~-~~~~~~--~~~~v~~i~~~~NlG~agg~n~g~~~a~~ 82 (305) T COG1216 6 SIIIVTYNRGEDLVECLASLAAQTYPDDVIVVVDNGSTDGSL-EALKAR--FFPNVRLIENGENLGFAGGFNRGIKYALA 82 (305) T ss_pred EEEEEEECCHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCHH-HHHHHH--CCCCEEEEECCCCCEEHHHHHHHHHHHHC T ss_conf 999999688889999999997176876359980799973228-887760--58868999889886708888899999862 Q ss_pred --CCCCEEECCCCCCCCC-CCCHHHHHCCCHHHHHCCCCCCCCC------CCCCCCCCC---CCCCHHHCCCCCCCCCCC Q ss_conf --6752132144321111-0000111000001110012222100------012221222---210001001333332222 Q gi|254780922|r 100 --SSSVLILGDNVFYGSD-ISDIFHKARARRNSATVVGCHVQNP------QRYGVVEVD---SSNQAISIEEKPNNPKSS 167 (292) Q Consensus 100 --~~~~lilgDni~~~~~-l~~~l~~~~~~~~~~ti~~~~v~~p------~~yGvv~~d---~~~~i~~i~EKP~~~~sn 167 (292) +++++++++|.+.+.+ +.++++.+.+.... .+++..+.++ +.-| ...+ .........+.+....+. T Consensus 83 ~~~~~~l~LN~D~~~~~~~l~~ll~~~~~~~~~-~~~~~~i~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (305) T COG1216 83 KGDDYVLLLNPDTVVEPDLLEELLKAAEEDPAA-GVVGPLIRNYDESLYIDRRG-GESDGLTGGWRASPLLEIAPDLSSY 160 (305) T ss_pred CCCCEEEEEECCCCCCHHHHHHHHHHHHHCCCC-EEEEEEECCCCCCCCCCCCC-CCCCCCCCCHHCCCCCCCCCCCCCC T ss_conf 898489998388687906999999999868987-79634654887532344345-6444332200024333444433342 Q ss_pred -----CCHHHHHHHHHHHHHHHHHCCCCCCC--CCCCCCCCCHHCCCCCEEEEECCCCEEEECCCCHHH Q ss_conf -----22024554100022432200467533--333334320000013204631354125632784577 Q gi|254780922|r 168 -----FAVTGIYFYDQEVVNIARNIRPSARG--ELEITDVNSYYLDKGLLAVEFLREGSAWFDAGTPES 229 (292) Q Consensus 168 -----~a~~GiY~f~~~i~~~~~~l~~s~rg--E~eitD~~~~~l~~~~~~~~~~~~g~~W~D~Gt~e~ 229 (292) +++.+.-+++.++|+.+--. +.+- =+|=+|.--.+.+.|.. +.+......|+..|.-.. T Consensus 161 ~~~~~~~~G~~~li~~~~~~~vG~~--de~~F~y~eD~D~~~R~~~~G~~-i~~~p~a~i~H~~g~s~~ 226 (305) T COG1216 161 LEVVASLSGACLLIRREAFEKVGGF--DERFFIYYEDVDLCLRARKAGYK-IYYVPDAIIYHKIGSSKG 226 (305) T ss_pred CHHHHHCCCCEEEEEHHHHHHHCCC--CHHHHHHHHHHHHHHHHHHCCCC-EEEEECEEEEECCCCCCC T ss_conf 0133322561799749999984798--76872100687999999975996-998004599973777777 No 118 >cd02525 Succinoglycan_BP_ExoA ExoA is involved in the biosynthesis of succinoglycan. Succinoglycan Biosynthesis Protein ExoA catalyzes the formation of a beta-1,3 linkage of the second sugar (glucose) of the succinoglycan with the galactose on the lipid carrie. Succinoglycan is an acidic exopolysaccharide that is important for invasion of the nodules. Succinoglycan is a high-molecular-weight polymer composed of repeating octasaccharide units. These units are synthesized on membrane-bound isoprenoid lipid carriers, beginning with galactose followed by seven glucose molecules, and modified by the addition of acetate, succinate, and pyruvate. ExoA is a membrane protein with a transmembrance domain at c-terminus. Probab=72.86 E-value=4.1 Score=20.92 Aligned_cols=101 Identities=12% Similarity=0.171 Sum_probs=50.0 Q ss_pred EEEECC-EEHHHHHHHHHHHCCC----CEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 224783-7479999999997898----68999977765643111222222221111222333322446852101112335 Q gi|254780922|r 25 MLPIYN-KPMIYYPVSTLMDAGI----REILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQLVPAGLAQSYILGAEFIGD 99 (292) Q Consensus 25 Llpi~g-kplI~~~l~~l~~~Gi----~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~~p~Gta~Ai~~a~~~i~~ 99 (292) .+|.++ ...|...|+.+.+.-. -||+||-+. +.+.-.+.+.........+.++ +.+..|.+.|...|.+.... T Consensus 5 iIp~yN~~~~l~~~l~Sl~~q~yp~~~~EVIvVDd~-S~D~t~~~~~~~~~~~~~i~~~-~~~~~~~~~a~N~gi~~a~g 82 (249) T cd02525 5 IIPVRNEEKYIEELLESLLNQSYPKDLIEIIVVDGG-STDGTREIVQEYAAKDPRIRLI-DNPKRIQSAGLNIGIRNSRG 82 (249) T ss_pred EECCCCCHHHHHHHHHHHHHCCCCCCCEEEEEEECC-CCCCHHHHHHHHHHCCCCEEEC-CCCCCCHHHHHHHHHHHCCC T ss_conf 993569789999999999846899998899999895-9615699999997407713211-35556589999898996687 Q ss_pred CCCCEEECCCCCCCCC-CCCHHHHHCCCHH Q ss_conf 6752132144321111-0000111000001 Q gi|254780922|r 100 SSSVLILGDNVFYGSD-ISDIFHKARARRN 128 (292) Q Consensus 100 ~~~~lilgDni~~~~~-l~~~l~~~~~~~~ 128 (292) ++++.+-+|.....+ +..++....+... T Consensus 83 -d~i~~lD~D~~~~~~~l~~~~~~~~~~~~ 111 (249) T cd02525 83 -DIIIRVDAHAVYPKDYILELVEALKRTGA 111 (249) T ss_pred -CEEEEECCCCCCCHHHHHHHHHHHHCCCC T ss_conf -76884147765695699999999877897 No 119 >COG1215 Glycosyltransferases, probably involved in cell wall biogenesis [Cell envelope biogenesis, outer membrane] Probab=67.45 E-value=4.5 Score=20.63 Aligned_cols=105 Identities=17% Similarity=0.263 Sum_probs=58.9 Q ss_pred CEEEECCE-E-HHHHHHHHHHHCCCC--EEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCC-CCCCCCCCCCCCCCCCCC Q ss_conf 22247837-4-799999999978986--899997776564311122222222111122233-332244685210111233 Q gi|254780922|r 24 QMLPIYNK-P-MIYYPVSTLMDAGIR--EILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQ-LVPAGLAQSYILGAEFIG 98 (292) Q Consensus 24 ~Llpi~gk-p-lI~~~l~~l~~~Gi~--~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q-~~p~Gta~Ai~~a~~~i~ 98 (292) -++|.++. | .++-.++.+.+.... ||+++......+..+-....+.+++.++....+ .+..|-++|+..+..... T Consensus 58 viiP~ynE~~~~~~~~l~s~~~~dyp~~evivv~d~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~gK~~al~~~l~~~~ 137 (439) T COG1215 58 VIIPAYNEEPEVLEETLESLLSQDYPRYEVIVVDDGSTDETYEILEELGAEYGPNFRVIYPEKKNGGKAGALNNGLKRAK 137 (439) T ss_pred EEEECCCCCHHHHHHHHHHHHHCCCCCCEEEEEECCCCCHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHHHCC T ss_conf 99836999867899999999827898715999968998309999999974437857999624455524899998875057 Q ss_pred CCCCCEEE-CCCCCCCCCCCCHHHHHCCCHHH Q ss_conf 56752132-14432111100001110000011 Q gi|254780922|r 99 DSSSVLIL-GDNVFYGSDISDIFHKARARRNS 129 (292) Q Consensus 99 ~~~~~lil-gDni~~~~~l~~~l~~~~~~~~~ 129 (292) . ++++++ +|++....-+.+++......... T Consensus 138 ~-d~V~~~DaD~~~~~d~l~~~~~~f~~~~~~ 168 (439) T COG1215 138 G-DVVVILDADTVPEPDALRELVSPFEDPPVG 168 (439) T ss_pred C-CEEEEECCCCCCCCHHHHHHHHHHCCCCEE T ss_conf 8-889998388778711999999973258804 No 120 >PRK10063 predicted glycosyl transferase; Provisional Probab=65.48 E-value=4.2 Score=20.83 Aligned_cols=80 Identities=13% Similarity=0.176 Sum_probs=45.0 Q ss_pred CEEHHHHHHHHHHH---C---CCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCC-CCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 37479999999997---8---986899997776564311122222222111122-2333322446852101112335675 Q gi|254780922|r 30 NKPMIYYPVSTLMD---A---GIREILIISTPRDLPVLKEFLGSGEKWGVQFSY-IEQLVPAGLAQSYILGAEFIGDSSS 102 (292) Q Consensus 30 gkplI~~~l~~l~~---~---Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y-~~q~~p~Gta~Ai~~a~~~i~~~~~ 102 (292) +.+-|...|+.+.+ . .+ |.+||-+... +--.+++. ++..++.+ .+.++..|+.+|...+...-.++-. T Consensus 12 ~~~~l~~Tl~Sv~~~~~~~~~~~-E~IIIDGgSt-DgT~~ii~---~~~~~~~~~~iSEpD~GIYdAmNKGI~~A~Gd~I 86 (248) T PRK10063 12 NLEGIVKTHASLRHLAQDPGISF-EWIVVDGGSN-DGTREYLE---NLNGIFNLRFVSEPDNGIYDAMNKGIAMAQGKFA 86 (248) T ss_pred CHHHHHHHHHHHHHHHHCCCCCE-EEEEEECCCC-CCHHHHHH---HCCCCCCEEEEEECCCCCHHHHHHHHHHCCCCEE T ss_conf 98999999999999861878986-9999979396-11799999---7066666799993898808898851987788889 Q ss_pred CEEECCCCCCCC Q ss_conf 213214432111 Q gi|254780922|r 103 VLILGDNVFYGS 114 (292) Q Consensus 103 ~lilgDni~~~~ 114 (292) ..+.+||.|+.. T Consensus 87 ~FLNsdD~~~~~ 98 (248) T PRK10063 87 LFLNSGDIFHQD 98 (248) T ss_pred EEECCCCCCCCC T ss_conf 998388664847 No 121 >TIGR03552 F420_cofC 2-phospho-L-lactate guanylyltransferase CofC. Members of this protein family are the CofC enzyme of coenzyme F420 biosynthesis. Probab=62.79 E-value=4.9 Score=20.41 Aligned_cols=109 Identities=17% Similarity=0.277 Sum_probs=59.7 Q ss_pred EEEEECCCCC---CCCCHH-HCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 0799248887---455334-438982222478374799999999978986899997776564311122222222111122 Q gi|254780922|r 2 KGIVLAGGSG---TRLRPL-TDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQFSY 77 (292) Q Consensus 2 kaiIlAaG~G---tRl~P~-T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y 77 (292) +|||+.=.+. |||.|. +..---.| -+-|+..+++.+..+++.+|++++... .. ..+ ...+|+.+ T Consensus 1 ~aviPvK~~~~aKtRLs~~L~~~eR~~L----~~~ml~dtl~~l~~~~~~~v~vvt~d~--~~-~~~---a~~~g~~~-- 68 (195) T TIGR03552 1 RAVIPVKRLANAKSRLSPVLSPEEREEL----ALAMLRDVITALRGAGAGAVLVVSPDP--AL-LEA---ARNLGAPV-- 68 (195) T ss_pred CEEEECCCCCCCCCCCCCCCCHHHHHHH----HHHHHHHHHHHHHHCCCCCEEEEECCH--HH-HHH---HHHCCCEE-- T ss_conf 9178368887452004530799999999----999999999999858997189990987--89-999---98719979-- Q ss_pred CCCCCCCCCCCCCCCCCCCCCC-CCCCEEECCCC--CCCCCCCCHHHHH Q ss_conf 2333322446852101112335-67521321443--2111100001110 Q gi|254780922|r 78 IEQLVPAGLAQSYILGAEFIGD-SSSVLILGDNV--FYGSDISDIFHKA 123 (292) Q Consensus 78 ~~q~~p~Gta~Ai~~a~~~i~~-~~~~lilgDni--~~~~~l~~~l~~~ 123 (292) +.|.. .|+..|+..+...... ...+++++.|+ +...++..++... T Consensus 69 i~~~~-~gLn~al~~a~~~~~~~~~~vlii~aDlP~l~~~~l~~~l~~~ 116 (195) T TIGR03552 69 LRDPG-PGLNNALNAALAEAREPGGAVLILMADLPLLTPRELKRLLAAA 116 (195) T ss_pred EECCC-CCHHHHHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHHC T ss_conf 82699-8889999999999873899389965887889999999999844 No 122 >PRK10714 undecaprenyl phosphate 4-deoxy-4-formamido-L-arabinose transferase; Provisional Probab=62.35 E-value=10 Score=18.35 Aligned_cols=157 Identities=11% Similarity=0.182 Sum_probs=70.6 Q ss_pred EEEECCE-----EHHHHHHHHHHHCCCC-EEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 2247837-----4799999999978986-899997776564311122222222111122233332244685210111233 Q gi|254780922|r 25 MLPIYNK-----PMIYYPVSTLMDAGIR-EILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQLVPAGLAQSYILGAEFIG 98 (292) Q Consensus 25 Llpi~gk-----plI~~~l~~l~~~Gi~-~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~~p~Gta~Ai~~a~~~i~ 98 (292) .+|+++- ++++.....+...+.+ ||++|-+.......+.+..-.+..+.++..+.=..--|-..|+..+.++-. T Consensus 11 ViP~yNEe~~i~~~~~~~~~~l~~~~~~~EiI~VDDgS~D~T~~~l~~l~~~~~~~v~~i~lsRNfG~~~Ai~aGl~~a~ 90 (324) T PRK10714 11 VIPVYNEQESLPELIRRTTAACESLGKEYEILLIDDGSSDNSAEMLVEASQAEDSHIISILLNRNYGQHSAIMAGFSHVT 90 (324) T ss_pred EEECCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHCC T ss_conf 98017778779999999999998679998999998999867799999986305996899989889885689999987347 Q ss_pred CCCCCEEECCCCCCCCC-CCCHHHHHCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCH-HHCCCCCCC-CCCCCCHHHHHH Q ss_conf 56752132144321111-000011100000111001222210001222122221000-100133333-222222024554 Q gi|254780922|r 99 DSSSVLILGDNVFYGSD-ISDIFHKARARRNSATVVGCHVQNPQRYGVVEVDSSNQA-ISIEEKPNN-PKSSFAVTGIYF 175 (292) Q Consensus 99 ~~~~~lilgDni~~~~~-l~~~l~~~~~~~~~~ti~~~~v~~p~~yGvv~~d~~~~i-~~i~EKP~~-~~sn~a~~GiY~ 175 (292) + ++++++-+|+=+... +.++++. .+.+.+. |.++..+....+ .. .--.++ ..+..+... +.+++ .++.-. T Consensus 91 G-d~vi~mD~DLQdpPe~Ip~li~~-~~~G~Dv-V~~~r~~r~~~~--~r-~~~s~l~~~l~~~~~~~~~~d~-~~~frl 163 (324) T PRK10714 91 G-DLIITLDADLQNPPEEIPRLVAK-ADEGYDV-VGTVRQNRQDSW--FR-KTASKMINRLIQRTTGKAMGDY-GCMLRA 163 (324) T ss_pred C-CEEEEECCCCCCCHHHHHHHHHH-HHCCCCE-EEEEECCCCCCH--HH-HHHHHHHHHHHHHHCCCCCCCC-CHHHHH T ss_conf 9-98999868887598899999998-7048868-999888887757--88-9999999999998539987875-312441 Q ss_pred HHHHHHHHHHHCC Q ss_conf 1000224322004 Q gi|254780922|r 176 YDQEVVNIARNIR 188 (292) Q Consensus 176 f~~~i~~~~~~l~ 188 (292) ++.++.+.+.+.. T Consensus 164 ~~r~vv~~l~~~~ 176 (324) T PRK10714 164 YRRHIVDAMLHCH 176 (324) T ss_pred HHHHHHHHHHHCC T ss_conf 3799999999637 No 123 >cd06438 EpsO_like EpsO protein participates in the methanolan synthesis. The Methylobacillus sp EpsO protein is predicted to participate in the methanolan synthesis. Methanolan is an exopolysaccharide (EPS), composed of glucose, mannose and galactose. A 21 genes cluster was predicted to participate in the methanolan synthesis. Gene disruption analysis revealed that EpsO is one of the glycosyltransferase enzymes involved in the synthesis of repeating sugar units onto the lipid carrier. Probab=61.03 E-value=8.8 Score=18.80 Aligned_cols=94 Identities=13% Similarity=0.174 Sum_probs=49.6 Q ss_pred EEEECCE-EHHHHHHHHHHHCCC----CEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 2247837-479999999997898----68999977765643111222222221111222333322446852101112335 Q gi|254780922|r 25 MLPIYNK-PMIYYPVSTLMDAGI----REILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQLVPAGLAQSYILGAEFIGD 99 (292) Q Consensus 25 Llpi~gk-plI~~~l~~l~~~Gi----~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~~p~Gta~Ai~~a~~~i~~ 99 (292) ++|.++. ..|.-.|..+.+... -+|++|.. ...+.-.+.. +.++.+..........|-++|+..|..+... T Consensus 2 lIPa~NEe~vI~~ti~~l~~~~YP~~~~eIivvdD-~stD~T~~~a---~~~~~~v~~~~~~~~~gK~~aln~~~~~~~~ 77 (183) T cd06438 2 LIPAHNEEAVIGNTVRSLKAQDYPRELYRIFVVAD-NCTDDTAQVA---RAAGATVLERHDPERRGKGYALDFGFRHLLN 77 (183) T ss_pred EEECCCCHHHHHHHHHHHHHCCCCCCCEEEEEEEC-CCCCHHHHHH---HHCCCEEEEEECCCCCCCHHHHHHHHHHHHC T ss_conf 88207809999999999983689978569999848-9862089999---9839969997377777628899988999842 Q ss_pred ----CCCCEEE-CCCCCCCCCCCCHHHHH Q ss_conf ----6752132-14432111100001110 Q gi|254780922|r 100 ----SSSVLIL-GDNVFYGSDISDIFHKA 123 (292) Q Consensus 100 ----~~~~lil-gDni~~~~~l~~~l~~~ 123 (292) .++++++ +|++ ...++...+..+ T Consensus 78 ~~~~~d~v~i~DAD~~-~~~d~l~~~~~~ 105 (183) T cd06438 78 LADDPDAVVVFDADNL-VDPNALEELNAR 105 (183) T ss_pred CCCCCCEEEEECCCCC-CCHHHHHHHHHH T ss_conf 4788668999556667-797899999999 No 124 >cd02522 GT_2_like_a GT_2_like_a represents a glycosyltransferase family-2 subfamily with unknown function. Glycosyltransferase family 2 (GT-2) subfamily of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families. Probab=58.49 E-value=7 Score=19.44 Aligned_cols=93 Identities=20% Similarity=0.199 Sum_probs=48.1 Q ss_pred EEEECC-EEHHHHHHHHHHHCC--CCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 224783-747999999999789--86899997776564311122222222111122233332244685210111233567 Q gi|254780922|r 25 MLPIYN-KPMIYYPVSTLMDAG--IREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQLVPAGLAQSYILGAEFIGDSS 101 (292) Q Consensus 25 Llpi~g-kplI~~~l~~l~~~G--i~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~~p~Gta~Ai~~a~~~i~~~~ 101 (292) .+|.++ ...|.-.|+.+.+.- --||+||-+... +--...+ +.++..+ . ..+.|.|.|...+...-.++ T Consensus 4 IIP~yNe~~~l~~~L~Sl~~q~~~~~EvIVVDdgSt-D~T~~i~---~~~~~~~---~-~~~~G~a~a~N~G~~~A~Ge- 74 (221) T cd02522 4 IIPTLNEAENLPRLLASLRRLNPLPLEIIVVDGGST-DGTVAIA---RSAGVVV---I-SSPKGRARQMNAGAAAARGD- 74 (221) T ss_pred EEEECCCHHHHHHHHHHHHCCCCCCEEEEEEECCCC-CCHHHHH---HHHCCEE---E-ECCCCHHHHHHHHHHHCCCC- T ss_conf 996078888999999999757899838999989698-7649999---9725225---5-26978799999999867888- Q ss_pred CCEEE-CCCCCCCCC-CCCHHHHHCCCH Q ss_conf 52132-144321111-000011100000 Q gi|254780922|r 102 SVLIL-GDNVFYGSD-ISDIFHKARARR 127 (292) Q Consensus 102 ~~lil-gDni~~~~~-l~~~l~~~~~~~ 127 (292) +++.+ +|+. ...+ +..++....... T Consensus 75 ~i~flDaD~~-~~~~~l~~l~~~~~~~~ 101 (221) T cd02522 75 WLLFLHADTR-LPPDWDAAIIETLRADG 101 (221) T ss_pred EEEEECCCCC-CCCCHHHHHHHHHHCCC T ss_conf 7998775236-89069999999997399 No 125 >pfam01983 CofC Guanylyl transferase CofC like. Coenzyme F420 is a hydride carrier cofactor that functions during methanogenesis. This family of proteins represents CofC, a nucleotidyl transferase that is involved in coenzyme F420 biosynthesis. CofC has been shown to catalyse the formation of lactyl-2-diphospho-5'-guanosine from 2-phospho-L-lactate and GTP. Probab=52.18 E-value=7.2 Score=19.36 Aligned_cols=107 Identities=15% Similarity=0.172 Sum_probs=55.8 Q ss_pred CEEEEECCCCC---CCCCHHHCCC-CCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 90799248887---4553344389-8222247837479999999997898689999777656431112222222211112 Q gi|254780922|r 1 MKGIVLAGGSG---TRLRPLTDLL-SKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQFS 76 (292) Q Consensus 1 MkaiIlAaG~G---tRl~P~T~~~-pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~ 76 (292) |++||+.-.+. |||.+.-..- -..| =+-|+..+|..+.. +. +++|+... .+ ..+.. ...++.+ T Consensus 1 M~vvIPvK~f~~aKtRLs~~Ls~eeR~~L----a~~Ml~dVl~al~~--~~-v~vvs~d~--~v-~~~a~--~~~g~~v- 67 (217) T pfam01983 1 MRIIIPVSANNHPKTRLSSILSEEERKEL----LRLMLLDVIDALKP--VD-VLVFSEDE--VV-LPSAL--DVLGVEV- 67 (217) T ss_pred CEEEEECCCCCCCCHHCCCCCCHHHHHHH----HHHHHHHHHHHHHC--CC-EEEEECCH--HH-HHHHH--HCCCCEE- T ss_conf 97999716887540002310799999999----99999999998633--86-69993758--87-78888--6179548- Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCC-CCCCCCCCCHHHH Q ss_conf 223333224468521011123356752132144-3211110000111 Q gi|254780922|r 77 YIEQLVPAGLAQSYILGAEFIGDSSSVLILGDN-VFYGSDISDIFHK 122 (292) Q Consensus 77 y~~q~~p~Gta~Ai~~a~~~i~~~~~~lilgDn-i~~~~~l~~~l~~ 122 (292) +. +...|+-.|+..+....+.++.+++.+|= ++...++..++.. T Consensus 68 -~~-~~~~gLN~Av~~a~~~~~~~~v~Iv~aDLPli~~~~l~~ll~~ 112 (217) T pfam01983 68 -VV-ETESDLNTAVNQAFMAPEEAPVIIIPSDIPLISKEVLKRFLET 112 (217) T ss_pred -EE-CCCCCHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHH T ss_conf -73-4874678999999972899868997076678998999999983 No 126 >cd02520 Glucosylceramide_synthase Glucosylceramide synthase catalyzes the first glycosylation step of glycosphingolipid synthesis. UDP-glucose:N-acylsphingosine D-glucosyltransferase (glucosylceramide synthase or ceramide glucosyltransferase) catalyzes the first glycosylation step of glycosphingolipid synthesis. Its product, glucosylceramide, serves as the core of more than 300 glycosphingolipids (GSL). GSLs are a group of membrane components that have the lipid portion embedded in the outer plasma membrane leaflet and the sugar chains extended to the outer environment. Several lines of evidence suggest the importance of GSLs in various cellular processes such as differentiation, adhesion, proliferation, and cell-cell recognition. In pathogenic fungus Cryptococcus neoformans, glucosylceramide serves as an antigen that elicits an antibody response in patients and it is essential for fungal growth in host extracellular environment. Probab=51.86 E-value=12 Score=17.92 Aligned_cols=95 Identities=17% Similarity=0.274 Sum_probs=51.4 Q ss_pred CEEEECC-EEHHHHHHHHHHHCCCC--EEEEEECCCCC---CCCCCCCCCCCCCCCCCCCCCCCCCCCC---CCCCCCCC Q ss_conf 2224783-74799999999978986--89999777656---4311122222222111122233332244---68521011 Q gi|254780922|r 24 QMLPIYN-KPMIYYPVSTLMDAGIR--EILIISTPRDL---PVLKEFLGSGEKWGVQFSYIEQLVPAGL---AQSYILGA 94 (292) Q Consensus 24 ~Llpi~g-kplI~~~l~~l~~~Gi~--~I~ii~~~~~~---~~~~~~l~~g~~~g~~i~y~~q~~p~Gt---a~Ai~~a~ 94 (292) -++|++| .+.|...|+.+.+.... ||+|+...... ++.+++..... +.++..+.+..+.|. ++++..+. T Consensus 5 IivP~~nee~~i~~~l~sll~qdYp~~Eiivv~d~s~D~t~~i~~~~~~~~p--~~~~~~~~~~~~~g~~~K~~~l~~g~ 82 (196) T cd02520 5 ILKPLCGVDPNLYENLESFFQQDYPKYEILFCVQDEDDPAIPVVRKLIAKYP--NVDARLLIGGEKVGINPKVNNLIKGY 82 (196) T ss_pred EEEECCCCCHHHHHHHHHHHHCCCCCCEEEEEECCCCCCHHHHHHHHHHHCC--CCCEEEEECCCCCCCCHHHHHHHHHH T ss_conf 9996899927799999999966899808999978999978999999874287--76659997688878448999999999 Q ss_pred CCCCCCCCCEEECCCCCCCCC-CCCHHH Q ss_conf 123356752132144321111-000011 Q gi|254780922|r 95 EFIGDSSSVLILGDNVFYGSD-ISDIFH 121 (292) Q Consensus 95 ~~i~~~~~~lilgDni~~~~~-l~~~l~ 121 (292) ....+ +++++.-.|.+...+ +..++. T Consensus 83 ~~a~g-dii~~~DaD~~~~~~~L~~lv~ 109 (196) T cd02520 83 EEARY-DILVISDSDISVPPDYLRRMVA 109 (196) T ss_pred HHCCC-CEEEEECCCCCCCHHHHHHHHH T ss_conf 96458-9999988997729559999999 No 127 >KOG2148 consensus Probab=50.44 E-value=20 Score=16.61 Aligned_cols=61 Identities=23% Similarity=0.441 Sum_probs=44.8 Q ss_pred CCCHHHHHHHHHHHHHHHHHCCCEEEEHHHHHHHCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHH Q ss_conf 78457799999999998888098662099999886898989999988642588489999999864 Q gi|254780922|r 224 AGTPESLLDTAVFVRNIENRLGLYVACPEEIAYRHDFINESQFFQLIDHFGNSPYGLYLRQVVEK 288 (292) Q Consensus 224 ~Gt~e~l~~As~~i~~~e~~~g~~i~~~ee~a~~~~~i~~~~l~~~~~~~~~~~y~~~l~~~~~~ 288 (292) +|.| |.+-+.|+.+++++.---| .||||+|+.-| ++-++++-|..||.++-.+-|-++.++ T Consensus 744 ~GrP--Lekln~Ffe~v~~~vaqGi-rpeeiSYQ~aF-sk~elrkvi~qypgkEVkKglenlYKK 804 (867) T KOG2148 744 MGRP--LEKLNTFFEGVNNKVAQGI-RPEEISYQLAF-SKQELRKVIKQYPGKEVKKGLENLYKK 804 (867) T ss_pred HCCC--HHHHHHHHHHHHHHHHCCC-CHHHHHHHHHH-HHHHHHHHHHHCCHHHHHHHHHHHHHH T ss_conf 0681--7889999986667765389-78774788876-277799999977647788799999999 No 128 >cd06436 GlcNAc-1-P_transferase N-acetyl-glucosamine transferase is involved in the synthesis of Poly-beta-1,6-N-acetyl-D-glucosamine. N-acetyl-glucosamine transferase is responsible for the synthesis of bacteria Poly-beta-1,6-N-acetyl-D-glucosamine (PGA). Poly-beta-1,6-N-acetyl-D-glucosamine is a homopolymer that serves as an adhesion for the maintenance of biofilm structural stability in diverse eubacteria. N-acetyl-glucosamine transferase is the product of gene pgaC. Genetic analysis indicated that all four genes of the pgaABCD locus were required for the PGA production, pgaC being a glycosyltransferase. Probab=49.18 E-value=12 Score=18.06 Aligned_cols=93 Identities=10% Similarity=0.151 Sum_probs=45.4 Q ss_pred EEEECC-EEHHHHHHHHHHHCC-CCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCC---CCCCCCCCCCCCCCCCCC Q ss_conf 224783-747999999999789-868999977765643111222222221111222333---322446852101112335 Q gi|254780922|r 25 MLPIYN-KPMIYYPVSTLMDAG-IREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQL---VPAGLAQSYILGAEFIGD 99 (292) Q Consensus 25 Llpi~g-kplI~~~l~~l~~~G-i~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~---~p~Gta~Ai~~a~~~i~~ 99 (292) |+|-++ ...|...|+.|.+.- -.+|+|| ...+.+.-...+.... ..-++..+.+. ...|-++|+..|..++.. T Consensus 2 lIPa~NEe~~I~~ti~sl~~~~~~~eIivv-dDgS~D~T~~~~~~~~-~~~~~~vi~~~~~~~~~GK~~ALN~al~~~~~ 79 (191) T cd06436 2 LVPCLNEEAVIQRTLASLLRNKPNFLVLVI-DDASDDDTAGIVRLAI-TDSRVHLLRRHLPNARTGKGDALNAAYDQIRQ 79 (191) T ss_pred EEEECCCHHHHHHHHHHHHCCCCCCEEEEE-ECCCCCCHHHHHHHHH-HCCCEEEEEECCCCCCCCCHHHHHHHHHHHHH T ss_conf 873078789999999999738999689999-7999920999999983-08987999954776678515788889998664 Q ss_pred ----------CCCCEEE-CCCCCCCCC-CCCHH Q ss_conf ----------6752132-144321111-00001 Q gi|254780922|r 100 ----------SSSVLIL-GDNVFYGSD-ISDIF 120 (292) Q Consensus 100 ----------~~~~lil-gDni~~~~~-l~~~l 120 (292) +.+++++ +|+++ ..+ +..++ T Consensus 80 ~~~~~~~~~~~eii~v~DAD~~~-~~d~L~~~~ 111 (191) T cd06436 80 ILIEEGADPERVIIAVIDADGRL-DPNALEAVA 111 (191) T ss_pred HHHHCCCCCCCEEEEEECCCCCC-CHHHHHHHH T ss_conf 32000135664189995378732-889999999 No 129 >cd06420 GT2_Chondriotin_Pol_N N-terminal domain of Chondroitin polymerase functions as a GalNAc transferase. Chondroitin polymerase is a two domain, bi-functional protein. The N-terminal domain functions as a GalNAc transferase. The bacterial chondroitin polymerase catalyzes elongation of the chondroitin chain by alternatively transferring the GlcUA and GalNAc moiety from UDP-GlcUA and UDP-GalNAc to the non-reducing ends of the chondroitin chain. The enzyme consists of N-terminal and C-terminal domains in which the two active sites catalyze the addition of GalNAc and GlcUA, respectively. Chondroitin chains range from 40 to over 100 repeating units of the disaccharide. Sulfated chondroitins are involved in the regulation of various biological functions such as central nervous system development, wound repair, infection, growth factor signaling, and morphogenesis, in addition to its conventional structural roles. In Caenorhabditis elegans, chondroitin is an essential factor for the worm Probab=47.98 E-value=16 Score=17.15 Aligned_cols=89 Identities=12% Similarity=0.203 Sum_probs=46.0 Q ss_pred EEEECCE-EHHHHHHHHHHHCCC--CEEEEEECCCCCCCCCCCCCC-CCCCCCCCCCCCCCCCCC--CCCCCCCCCCCCC Q ss_conf 2247837-479999999997898--689999777656431112222-222211112223333224--4685210111233 Q gi|254780922|r 25 MLPIYNK-PMIYYPVSTLMDAGI--REILIISTPRDLPVLKEFLGS-GEKWGVQFSYIEQLVPAG--LAQSYILGAEFIG 98 (292) Q Consensus 25 Llpi~gk-plI~~~l~~l~~~Gi--~~I~ii~~~~~~~~~~~~l~~-g~~~g~~i~y~~q~~p~G--ta~Ai~~a~~~i~ 98 (292) .+|.++. ..|.-.|+.+.+... .||+||-+ .+.+.-.+++.. .....+++.++.++. .| .+.|...|...-. T Consensus 2 IIptyN~~~~L~~~L~Sl~~Qt~~~~EIIvvDd-~StD~t~~~~~~~~~~~~~~i~~~~~~~-~g~~~~~arN~gi~~a~ 79 (182) T cd06420 2 IITTYNRPEALELVLKSVLNQSILPFEVIIADD-GSTEETKELIEEFKSQFPIPIKHVWQED-EGFRKAKIRNKAIAAAK 79 (182) T ss_pred EEEECCCHHHHHHHHHHHHHCCCCCCEEEEEEC-CCCHHHHHHHHHHHHHCCCCCCEEEECC-CCHHHHHHHHHHHHHCC T ss_conf 998779889999999999837999958999979-9961489999999851887410001005-67268999999998558 Q ss_pred CCCCCEEECCCCCCCCCC Q ss_conf 567521321443211110 Q gi|254780922|r 99 DSSSVLILGDNVFYGSDI 116 (292) Q Consensus 99 ~~~~~lilgDni~~~~~l 116 (292) . ++++.+-||.+...++ T Consensus 80 g-~~i~flD~D~~~~~~~ 96 (182) T cd06420 80 G-DYLIFIDGDCIPHPDF 96 (182) T ss_pred C-CEEEEECCCCCCCCHH T ss_conf 8-7288878983279109 No 130 >COG2179 Predicted hydrolase of the HAD superfamily [General function prediction only] Probab=47.13 E-value=12 Score=17.88 Aligned_cols=106 Identities=22% Similarity=0.361 Sum_probs=57.5 Q ss_pred CCEEEECC---EEHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 22224783---747999999999789868999977765643111222222221111222333322446852101112335 Q gi|254780922|r 23 KQMLPIYN---KPMIYYPVSTLMDAGIREILIISTPRDLPVLKEFLGSGEKWGVQFSYIEQLVPAGLAQSYILGAEFIGD 99 (292) Q Consensus 23 K~Llpi~g---kplI~~~l~~l~~~Gi~~I~ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~~p~Gta~Ai~~a~~~i~~ 99 (292) -.|+|..+ -|=+.-++..+..+|++= +|++| ....-+... ..++|++..| .-.+|.+- |+..|..-.+- T Consensus 37 NTLv~wd~~~~tpe~~~W~~e~k~~gi~v-~vvSN-n~e~RV~~~---~~~l~v~fi~-~A~KP~~~--~fr~Al~~m~l 108 (175) T COG2179 37 NTLVPWDNPDATPELRAWLAELKEAGIKV-VVVSN-NKESRVARA---AEKLGVPFIY-RAKKPFGR--AFRRALKEMNL 108 (175) T ss_pred CCEECCCCCCCCHHHHHHHHHHHHCCCEE-EEEEC-CCHHHHHHH---HHHCCCCEEE-CCCCCCHH--HHHHHHHHCCC T ss_conf 75110469989999999999998659779-99818-978888766---6525972340-22596279--99999998099 Q ss_pred -CCCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCCCCCCCCCC Q ss_conf -67521321443211110000111000001110012222100012 Q gi|254780922|r 100 -SSSVLILGDNVFYGSDISDIFHKARARRNSATVVGCHVQNPQRY 143 (292) Q Consensus 100 -~~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~~v~~p~~y 143 (292) .+-+++.||-+|++. +-- +...--||++.++.+|..+ T Consensus 109 ~~~~vvmVGDqL~TDV-----lgg--nr~G~~tIlV~Pl~~~d~~ 146 (175) T COG2179 109 PPEEVVMVGDQLFTDV-----LGG--NRAGMRTILVEPLVAPDGW 146 (175) T ss_pred CHHHEEEECCHHHHHH-----HCC--CCCCCEEEEEEEECCCCCH T ss_conf 8368799851255666-----413--4147279999871166405 No 131 >cd05133 RasGAP_IQGAP1 IQGAP1 is a homodimeric protein that is widely expressed among vertebrate cell types from early embryogenesis. Mammalian IQGAP1 protein is the best characterized member of the IQGAP family, and contains several protein-interacting domains. Human IQGAP1 is most similar to mouse Iqgap1 (94% identity) and has 62% identity to human IQGAP2. IQGAP1 binds and cross-links actin filaments in vitro and has been implicated in Ca2+/calmodulin signaling, E-cadherin-dependent cell adhesion, cell motility, and invasion. Yeast IQGAP homologues have a role in the recruitment of actin filaments, are components of the spindle pole body, and are required for actomyosin ring assembly and cytokinesis. Furthermore, IQGAP1 over-expression has also been detected in gastric and colorectal carcinomas and gastric cancer cell lines. Probab=45.75 E-value=6.8 Score=19.52 Aligned_cols=111 Identities=7% Similarity=0.136 Sum_probs=54.3 Q ss_pred CCCCCCCCCCCCCCCHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHCCCCCEEEEECCCC Q ss_conf 00012221222210001001333332222220245541000224322004675333333343200000132046313541 Q gi|254780922|r 139 NPQRYGVVEVDSSNQAISIEEKPNNPKSSFAVTGIYFYDQEVVNIARNIRPSARGELEITDVNSYYLDKGLLAVEFLREG 218 (292) Q Consensus 139 ~p~~yGvv~~d~~~~i~~i~EKP~~~~sn~a~~GiY~f~~~i~~~~~~l~~s~rgE~eitD~~~~~l~~~~~~~~~~~~g 218 (292) .|..||+++....+.+.. ..+.|++... +++..+...++-...+-.+.. ++.|+.+....+..+-.. T Consensus 190 sPe~f~ivd~~~~~~ls~------~~RrNL~~Ia------KvLq~~as~k~F~~~~~~l~p-LN~fI~~~~~~~~~fl~~ 256 (360) T cd05133 190 APDAFDIIDLSAGGQLTT------DQRRNLGSIA------KMLQHAASNKMFLGDNAHLSI-INEYLSQSYQKFRRFFQS 256 (360) T ss_pred CCHHCCCCCCCCCCCCCH------HHHHHHHHHH------HHHHHHHCCCCCCCCCHHHHH-HHHHHHHHHHHHHHHHHH T ss_conf 841128112676666798------9998799999------999999769988898517777-899999999999999998 Q ss_pred EEEECCCCHHHHHHHHHHHHHHHHHCCCEEEEHHHHHHHCCCCCHH Q ss_conf 2563278457799999999998888098662099999886898989 Q gi|254780922|r 219 SAWFDAGTPESLLDTAVFVRNIENRLGLYVACPEEIAYRHDFINES 264 (292) Q Consensus 219 ~~W~D~Gt~e~l~~As~~i~~~e~~~g~~i~~~ee~a~~~~~i~~~ 264 (292) -.||..|+++++...|-..+....-..--++.||..-+..+.+. T Consensus 257 --~~~V~d~e~~f~idey~dl~~~~kP~i~It~~eI~~~H~Ll~~~ 300 (360) T cd05133 257 --ACEVPELQDKFNVDEYSDLVTLTKPVIYISIGEIINTHTLLLDH 300 (360) T ss_pred --HCCCCCHHHHCCHHHHHHHHHCCCCEEEEEHHHHHHHHHHHHHH T ss_conf --46889999973811588998653984897699999999999987 No 132 >TIGR01326 OAH_OAS_sulfhy O-acetylhomoserine aminocarboxypropyltransferase/cysteine synthase; InterPro: IPR006235 This family of sequences is a distinct clade of the Cys/Met metabolism pyridoxal phosphate-dependent enzyme superfamily. Members include examples of OAH/OAS sulfhydrylase, an enzyme with activity both as O-acetylhomoserine sulfhydrylase (OAH, 2.5.1.49 from EC) and O-acetylserine sulfhydrylase (OAS, 2.5.1.47 from EC). An alternate name for OAH sulfhydrylase is homocysteine synthase.; GO: 0016765 transferase activity transferring alkyl or aryl (other than methyl) groups, 0006520 amino acid metabolic process. Probab=41.92 E-value=16 Score=17.13 Aligned_cols=56 Identities=20% Similarity=0.380 Sum_probs=38.4 Q ss_pred CEEHHHHHHHHHHHCCCCEEEEEECCC-C---CCCCCCCCCCCCCCCCCCCCCCCCC-CCCCCCCCC Q ss_conf 374799999999978986899997776-5---6431112222222211112223333-224468521 Q gi|254780922|r 30 NKPMIYYPVSTLMDAGIREILIISTPR-D---LPVLKEFLGSGEKWGVQFSYIEQLV-PAGLAQSYI 91 (292) Q Consensus 30 gkplI~~~l~~l~~~Gi~~I~ii~~~~-~---~~~~~~~l~~g~~~g~~i~y~~q~~-p~Gta~Ai~ 91 (292) |..-+-+.|-+|..|| ++| |++.. - ..+|.+ .-+++||+.+|+.++. |.-..-||. T Consensus 81 G~AA~~~Ailnla~AG-DnI--VsS~~LYGGTynLF~~---TlkrlGIevrFvd~dd~pe~~~k~id 141 (434) T TIGR01326 81 GQAAITYAILNLAQAG-DNI--VSSSYLYGGTYNLFKH---TLKRLGIEVRFVDGDDDPEELEKAID 141 (434) T ss_pred HHHHHHHHHHHHHHCC-CCE--EECCCCCCHHHHHHHH---HHHHCCEEEEEECCCCCHHHHHHHCC T ss_conf 5799999999997269-826--9806444225789999---55544814887278888789997606 No 133 >PRK09860 putative alcohol dehydrogenase; Provisional Probab=41.91 E-value=18 Score=16.79 Aligned_cols=93 Identities=14% Similarity=0.191 Sum_probs=44.6 Q ss_pred CCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCC--CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 5334438982222478374799999999978986899997776--56431112222222211112223333224468521 Q gi|254780922|r 14 LRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPR--DLPVLKEFLGSGEKWGVQFSYIEQLVPAGLAQSYI 91 (292) Q Consensus 14 l~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~--~~~~~~~~l~~g~~~g~~i~y~~q~~p~Gta~Ai~ 91 (292) |..+|...|...+ .|+--++..-+.+...|.++++|+++.. ....+++....-+..|+.......-+|.=+-..+. T Consensus 1 M~~ftF~~Pt~i~--fG~g~~~~l~~~~~~~G~k~~lvvt~~~~~~~g~~~~v~~~L~~~gi~~~~f~~V~~nP~~~~v~ 78 (383) T PRK09860 1 MAASTFFIPSVNV--IGADSLTDAMNMMADYGFTRTLIVTDNMLTKLGMAGDVQKALEERNIFSVIYDGTQPNPTTENVA 78 (383) T ss_pred CCCCEEECCCCEE--ECCCHHHHHHHHHHHCCCCEEEEECCCCHHHCCHHHHHHHHHHHCCCCEEEECCCCCCCCHHHHH T ss_conf 9974895898149--88389999999999829987999828456657469999999987699589968952796999999 Q ss_pred CCCCCCC--CCCCCEEECC Q ss_conf 0111233--5675213214 Q gi|254780922|r 92 LGAEFIG--DSSSVLILGD 108 (292) Q Consensus 92 ~a~~~i~--~~~~~lilgD 108 (292) .+.+..+ +-++++-+|+ T Consensus 79 ~~~~~~r~~~~D~ivavGG 97 (383) T PRK09860 79 AGLKLLKENNCDSVISLGG 97 (383) T ss_pred HHHHHHHHCCCCEEEEECC T ss_conf 9999998739999999389 No 134 >PRK04015 chromatin protein; Provisional Probab=41.58 E-value=27 Score=15.75 Aligned_cols=31 Identities=19% Similarity=0.472 Sum_probs=23.7 Q ss_pred CEEEECCEEHHHHHHHHHHH--CCCCEEEEEEC Q ss_conf 22247837479999999997--89868999977 Q gi|254780922|r 24 QMLPIYNKPMIYYPVSTLMD--AGIREILIIST 54 (292) Q Consensus 24 ~Llpi~gkplI~~~l~~l~~--~Gi~~I~ii~~ 54 (292) ..+-|+.||+..|.+..+.. .|.++|.+-.. T Consensus 5 ~~I~IG~KP~m~YV~avvtqf~~g~~eV~iKAR 37 (91) T PRK04015 5 NTVLVGKKPVMNYVLAVVTQFNEGAKEVVIKAR 37 (91) T ss_pred CEEEECCCCHHHHHHHHHHHHHCCCCEEEEEEC T ss_conf 889991885799999999998779977999961 No 135 >cd03409 Chelatase_Class_II Class II Chelatase: a family of ATP-independent monomeric or homodimeric enzymes that catalyze the insertion of metal into protoporphyrin rings. This family includes protoporphyrin IX ferrochelatase (HemH), sirohydrochlorin ferrochelatase (SirB) and the cobaltochelatases, CbiK and CbiX. HemH and SirB are involved in heme and siroheme biosynthesis, respectively, while the cobaltochelatases are associated with cobalamin biosynthesis. Excluded from this family are the ATP-dependent heterotrimeric chelatases (class I) and the multifunctional homodimeric enzymes with dehydrogenase and chelatase activities (class III). Probab=40.24 E-value=28 Score=15.62 Aligned_cols=48 Identities=23% Similarity=0.438 Sum_probs=29.9 Q ss_pred EECCCCCCCCC-HHHCCC---------CCCEEEEC------CEEHHHHHHHHHHHCCCCEEEEE Q ss_conf 92488874553-344389---------82222478------37479999999997898689999 Q gi|254780922|r 5 VLAGGSGTRLR-PLTDLL---------SKQMLPIY------NKPMIYYPVSTLMDAGIREILII 52 (292) Q Consensus 5 IlAaG~GtRl~-P~T~~~---------pK~Llpi~------gkplI~~~l~~l~~~Gi~~I~ii 52 (292) ||.=|.|||+. |.-+.+ -.|..|+. ..|-++-.|+.|...|+++|+++ T Consensus 2 iL~v~hGs~~~dpy~~~ie~~a~~i~~~~~~~~v~~~f~e~~~P~i~eai~~l~~~G~~~ivvv 65 (101) T cd03409 2 LLVVGHGSPYKDPYKKDIEAQAHNLAESLPDFPYYVGFQSGLGPDTEEAIRELAEEGYQRVVIV 65 (101) T ss_pred EEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCCEEHHHHHHCCCCHHHHHHHHHHCCCCEEEEE T ss_conf 6999669997666089999999999988799824143797719999999999997698679998 No 136 >cd05131 RasGAP_IQGAP2 IQGAP2 is a member of the IQGAP family that contains a calponin-homology (CH) domain which binds F-actin, IQGAP-specific repeat, a single WW domain, four IQ motifs which mediate interactions with calmodulin, and a Ras-GTPase-activating protein (GAP)-related domain that binds Rho family GTPases. IQGAP2 and IQGAP3 play important roles in the regulation of the cytoskeleton for axon outgrowth in hippocampal neurons and are thought to stay in a common regulatory pathway. The results of RNA interference studies indicated that IQGAP3 partially compensates functions of IQGAP2, but has lesser ability than IQGAP2 to promote axon outgrowth in hippocampal neuron. Morevover, IQGAP2 is required for the cadherin-mediated cell-to-cell adhesion in Xenopus laevis embryos. Probab=39.48 E-value=9.7 Score=18.55 Aligned_cols=111 Identities=14% Similarity=0.202 Sum_probs=53.1 Q ss_pred CCCCCCCCCCCCCCCHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHCCCCCEEEEECCCC Q ss_conf 00012221222210001001333332222220245541000224322004675333333343200000132046313541 Q gi|254780922|r 139 NPQRYGVVEVDSSNQAISIEEKPNNPKSSFAVTGIYFYDQEVVNIARNIRPSARGELEITDVNSYYLDKGLLAVEFLREG 218 (292) Q Consensus 139 ~p~~yGvv~~d~~~~i~~i~EKP~~~~sn~a~~GiY~f~~~i~~~~~~l~~s~rgE~eitD~~~~~l~~~~~~~~~~~~g 218 (292) .|..||+++....+.+.. ..+.|++... +++..+...++....+-.+.. ++.|+.+....+..+-.. T Consensus 190 sPe~f~iid~~~~~~l~~------~qRrNL~~Ia------KvLq~~as~k~F~~~~~~l~p-LN~fI~~~~~~~~~fl~~ 256 (339) T cd05131 190 APDGFDIIDMTAGGQIHS------DQRRNLGSVA------KVLQHAASNKLFEGENDHLSS-MNSYLSQTYQKFRKFFQA 256 (339) T ss_pred CCHHCCCCCCCCCCCCCH------HHHHHHHHHH------HHHHHHHCCCCCCCCCHHHHH-HHHHHHHHHHHHHHHHHH T ss_conf 841128302576666798------9997699999------999999769978886336667-899999999999999998 Q ss_pred EEEECCCCHHHHHHHHHHHHHHHHHCCCEEEEHHHHHHHCCCCCHH Q ss_conf 2563278457799999999998888098662099999886898989 Q gi|254780922|r 219 SAWFDAGTPESLLDTAVFVRNIENRLGLYVACPEEIAYRHDFINES 264 (292) Q Consensus 219 ~~W~D~Gt~e~l~~As~~i~~~e~~~g~~i~~~ee~a~~~~~i~~~ 264 (292) -.||..++++++...|-..+....-..--++.||..-+..+.+. T Consensus 257 --l~~V~d~e~~f~id~y~dl~~~~kP~i~It~~eI~~~H~Ll~~~ 300 (339) T cd05131 257 --ACDVPEPEEKFNVDEYSDLVTLSKPVIYISIEEIINTHSLLLEH 300 (339) T ss_pred --HCCCCCHHHHCCHHHHHHHHHCCCCEEEEEHHHHHHHHHHHHHH T ss_conf --63889999972831488998732984896699999999999984 No 137 >pfam11931 DUF3449 Domain of unknown function (DUF3449). This presumed domain is functionally uncharacterized. This domain is found in eukaryotes. This domain is typically between 181 to 207 amino acids in length. This domain has two conserved sequence motifs: PIP and CEICG. Probab=38.76 E-value=14 Score=17.50 Aligned_cols=39 Identities=15% Similarity=0.113 Sum_probs=18.4 Q ss_pred CEEHHHHHHHHHHHCCCCEEEEEEC---CCCCCCCCCCCCCCC Q ss_conf 3747999999999789868999977---765643111222222 Q gi|254780922|r 30 NKPMIYYPVSTLMDAGIREILIIST---PRDLPVLKEFLGSGE 69 (292) Q Consensus 30 gkplI~~~l~~l~~~Gi~~I~ii~~---~~~~~~~~~~l~~g~ 69 (292) |||| =|+|-.|...|+.=-+=||+ |.....|++||..++ T Consensus 76 GkPI-PyWLYKLhGL~~ey~CEICGN~~Y~GrkaFekHF~E~r 117 (187) T pfam11931 76 GKPI-PYWLYKLHGLGKEFKCEICGNTSYKGRKAFEKHFSEWR 117 (187) T ss_pred CCCC-CHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHCCCHH T ss_conf 9855-28999970799716657568851015789998707013 No 138 >KOG2638 consensus Probab=38.69 E-value=30 Score=15.47 Aligned_cols=59 Identities=15% Similarity=0.191 Sum_probs=36.2 Q ss_pred EEECCCCCCCCCHHHCCCCCCEEEEC-CEEHHHHHH---HHHH-HCCCCEE-EEEECCCCCCCCCCCC Q ss_conf 99248887455334438982222478-374799999---9999-7898689-9997776564311122 Q gi|254780922|r 4 IVLAGGSGTRLRPLTDLLSKQMLPIY-NKPMIYYPV---STLM-DAGIREI-LIISTPRDLPVLKEFL 65 (292) Q Consensus 4 iIlAaG~GtRl~P~T~~~pK~Llpi~-gkplI~~~l---~~l~-~~Gi~~I-~ii~~~~~~~~~~~~l 65 (292) +=|-||.||-|. -.-||.+++|. |...+|.++ ++|- ..+++-- ++..++.-.+.-++++ T Consensus 107 lKLNGGlGttmG---c~gPKS~ieVR~g~tFLDL~V~QIe~LN~~Y~~dVPlvLMNSfnTdedT~kil 171 (498) T KOG2638 107 LKLNGGLGTTMG---CKGPKSVIEVRDGLTFLDLTVRQIENLNKTYNVDVPLVLMNSFNTDEDTQKIL 171 (498) T ss_pred EEECCCCCCCCC---CCCCCEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCEEEECCCCCCHHHHHHH T ss_conf 994588677545---67873069972897215789999998776328997779850556617799999 No 139 >COG1788 AtoD Acyl CoA:acetate/3-ketoacid CoA transferase, alpha subunit [Lipid metabolism] Probab=37.85 E-value=31 Score=15.39 Aligned_cols=35 Identities=26% Similarity=0.474 Sum_probs=25.6 Q ss_pred EEECCCCCCCCCHHHCCCCCCEEEECCEEHHHHHHHHHHHCCCCEEEEEECCC Q ss_conf 99248887455334438982222478374799999999978986899997776 Q gi|254780922|r 4 IVLAGGSGTRLRPLTDLLSKQMLPIYNKPMIYYPVSTLMDAGIREILIISTPR 56 (292) Q Consensus 4 iIlAaG~GtRl~P~T~~~pK~Llpi~gkplI~~~l~~l~~~Gi~~I~ii~~~~ 56 (292) -|+.||+|+..-|. ..+..+++.|++|..+|++.- T Consensus 20 ti~~gGFg~~g~P~------------------alI~ali~~GvkdLt~I~n~~ 54 (220) T COG1788 20 TIMIGGFGTCGIPE------------------ALIHALIRQGVKDLTVISNNA 54 (220) T ss_pred EEEECCCCCCCCHH------------------HHHHHHHHCCCCCEEEEECCC T ss_conf 99986545668969------------------999999974986359994688 No 140 >TIGR02429 pcaI_scoA_fam 3-oxoacid CoA-transferase, A subunit; InterPro: IPR012792 CoA-transferases catalyse the reversible transfer of of coenzyme A from CoA-thioesters to free acids, and can be divided into three families . Family I includes transferases for 3-oxoacids (2.8.3.5 from EC, 2.8.3.6 from EC), short-chain fatty acids (2.8.3.8 from EC, 2.8.3.9 from EC) and glutaconate (2.8.3.12 from EC). Most of the family I enzymes use acetyl-CoA or succinyl-CoA as CoA donors, and are composed of two separate polypeptides, subunits A and B, which generally aggregate as heterodimers or heterotetramers. The eukaryotic enzymes, however, are generally composed of a single two-domain polypetide representing a fusion of the A and B subunits. The transfer of CoA from one substrate to another occurs via a ping pong mechanism which involves the formation of thioester bond between CoA and a conserved glutamate residue at the active site of the enzyme . This entry represents the CoA-binding A subunit of family I CoA-transferases. This domain forms a three-layer alpha-beta-alpha sandwich where the central layer is an all parallel beta-sheet, against which helices pack from both sides , . The active site is thought to be located at the interface of the A and B subunits and formed by loops from both subunits.; GO: 0008410 CoA-transferase activity. Probab=36.14 E-value=31 Score=15.40 Aligned_cols=76 Identities=21% Similarity=0.273 Sum_probs=39.9 Q ss_pred EEEECCCCCCCCCHHH------CCCCCCEEEECCE-EHHHHHHHHHHHCC-CCEEEEEEC-CCCCCCCCCCCCCCCCC-C Q ss_conf 7992488874553344------3898222247837-47999999999789-868999977-76564311122222222-1 Q gi|254780922|r 3 GIVLAGGSGTRLRPLT------DLLSKQMLPIYNK-PMIYYPVSTLMDAG-IREILIIST-PRDLPVLKEFLGSGEKW-G 72 (292) Q Consensus 3 aiIlAaG~GtRl~P~T------~~~pK~Llpi~gk-plI~~~l~~l~~~G-i~~I~ii~~-~~~~~~~~~~l~~g~~~-g 72 (292) |=||-+|+||-=.|.- ..-+|.|.=|.+- -==+.=|..|..+| ++++ ||+ +++.+. +.+.+..+- . T Consensus 20 aTi~igGFG~~G~P~eLi~aLi~~g~k~LTivsNNAG~G~~GLa~Ll~ag~V~kl--icSFP~~~ds--~~F~~~Y~aG~ 95 (222) T TIGR02429 20 ATILIGGFGTAGLPEELIDALIETGAKDLTIVSNNAGNGEIGLAALLKAGQVRKL--ICSFPRQSDS--YVFDELYRAGK 95 (222) T ss_pred CEEEEECCCCCCCHHHHHHHHHHHCCCCCEEEECCCCCCHHHHHHHHHCCCEEEE--EEECCCCCCC--HHHHHHHHCCC T ss_conf 6698627798887589999999706777778863778803689999834970278--8636889982--35788852496 Q ss_pred CCCCCCCCCC Q ss_conf 1112223333 Q gi|254780922|r 73 VQFSYIEQLV 82 (292) Q Consensus 73 ~~i~y~~q~~ 82 (292) ++++.+.|.+ T Consensus 96 ieLElVPQG~ 105 (222) T TIGR02429 96 IELELVPQGT 105 (222) T ss_pred EEEEEECCCC T ss_conf 1688607864 No 141 >pfam04007 DUF354 Protein of unknown function (DUF354). Members of this family are around 350 amino acids in length. They are found in archaebacteria and have no known function. Probab=34.44 E-value=35 Score=15.07 Aligned_cols=12 Identities=25% Similarity=0.163 Sum_probs=6.7 Q ss_pred HHHHHHHHHCCC Q ss_conf 999999997898 Q gi|254780922|r 35 YYPVSTLMDAGI 46 (292) Q Consensus 35 ~~~l~~l~~~Gi 46 (292) -..+..|.+.|. T Consensus 17 k~iI~eL~k~Gh 28 (335) T pfam04007 17 KPIISELEKEGY 28 (335) T ss_pred HHHHHHHHHCCC T ss_conf 999999986898 No 142 >TIGR00677 fadh2_euk methylenetetrahydrofolate reductase; InterPro: IPR004621 The enzyme activities methylenetetrahydrofolate reductase (1.5.1.20 from EC) and 5,10-methylenetetrahydrofolate reductase (FADH) (1.7.99.5 from EC) differ in that the former (assigned in many eukaryotes) is defined to use NADP+ as an acceptor, while the latter (assigned in many bacteria) is flexible with respect to the acceptor. Both convert 5-methyltetrahydrofolate to 5,10-methylenetetrahydrofolate. From a larger set of proteins assigned as one or the other, this family describes the subset of proteins found in eukaryotes, and currently designated methylenetetrahydrofolate reductase (1.5.1.20 from EC). This protein is an FAD-containing flavoprotein.; GO: 0004489 methylenetetrahydrofolate reductase (NADPH) activity, 0006555 methionine metabolic process. Probab=34.18 E-value=35 Score=15.04 Aligned_cols=25 Identities=24% Similarity=0.212 Sum_probs=19.9 Q ss_pred EEHHHHHHHHHHHCCCCEEEEEECCC Q ss_conf 74799999999978986899997776 Q gi|254780922|r 31 KPMIYYPVSTLMDAGIREILIISTPR 56 (292) Q Consensus 31 kplI~~~l~~l~~~Gi~~I~ii~~~~ 56 (292) ..+|+++|+++..+||+.|+ ..... T Consensus 79 ~e~ID~AL~~~~~~G~~NiL-ALRGD 103 (312) T TIGR00677 79 IEMIDDALERAKSNGIQNIL-ALRGD 103 (312) T ss_pred HHHHHHHHHHHHHCCCCEEE-CCCCC T ss_conf 78899999999865653340-03761 No 143 >pfam07959 Fucokinase L-fucokinase. In the salvage pathway of GDP-L-fucose, free cytosolic fucose is phosphorylated by L-fucokinase to form L-fucose-L-phosphate, which is then further converted to GDP-L-fucose in the reaction catalysed by GDP-L-fucose pyrophosphorylase. Probab=31.69 E-value=39 Score=14.77 Aligned_cols=29 Identities=31% Similarity=0.416 Sum_probs=16.5 Q ss_pred CCCCCEEEEECCCCEEEECCCCHHHHHHHH Q ss_conf 001320463135412563278457799999 Q gi|254780922|r 205 LDKGLLAVEFLREGSAWFDAGTPESLLDTA 234 (292) Q Consensus 205 l~~~~~~~~~~~~g~~W~D~Gt~e~l~~As 234 (292) ++.-.+.+..+..+.+ +-+||-..|+..- T Consensus 224 L~~~~l~v~~l~~~~F-~H~GTs~E~L~~~ 252 (414) T pfam07959 224 LKGTSLNVVVLPDGGF-YHFGTSAEYLDHL 252 (414) T ss_pred HCCCCEEEEEECCCEE-EECCCCHHHHHHH T ss_conf 6088337999089778-7705889999863 No 144 >PRK05802 hypothetical protein; Provisional Probab=30.07 E-value=41 Score=14.60 Aligned_cols=20 Identities=15% Similarity=0.134 Sum_probs=11.3 Q ss_pred CCCCCHHCCCCCEEEEECCC Q ss_conf 34320000013204631354 Q gi|254780922|r 198 TDVNSYYLDKGLLAVEFLRE 217 (292) Q Consensus 198 tD~~~~~l~~~~~~~~~~~~ 217 (292) ..+++.++++|......+.. T Consensus 189 vpVikkL~~n~NkV~vIid~ 208 (328) T PRK05802 189 VPVIKKLYSNGNKVIVILDK 208 (328) T ss_pred HHHHHHHHHCCCEEEEEECC T ss_conf 59999998689979999858 No 145 >COG1581 Ssh10b Archaeal DNA-binding protein [Transcription] Probab=29.60 E-value=42 Score=14.55 Aligned_cols=33 Identities=21% Similarity=0.517 Sum_probs=26.1 Q ss_pred CCCEEEECCEEHHHHHHHHH--HHCCCCEEEEEEC Q ss_conf 82222478374799999999--9789868999977 Q gi|254780922|r 22 SKQMLPIYNKPMIYYPVSTL--MDAGIREILIIST 54 (292) Q Consensus 22 pK~Llpi~gkplI~~~l~~l--~~~Gi~~I~ii~~ 54 (292) +++-+-++.||...|.|..+ .+.|.+++.+-.. T Consensus 3 ~envV~vG~KPvmNYVlAvlt~fn~g~~eViiKAR 37 (91) T COG1581 3 EENVVLVGKKPVMNYVLAVLTQFNEGADEVIIKAR 37 (91) T ss_pred CCCEEEECCCCHHHHHHHHHHHHHCCCCEEEEEEC T ss_conf 75379985764499999999998768978999823 No 146 >KOG2961 consensus Probab=29.43 E-value=40 Score=14.69 Aligned_cols=23 Identities=22% Similarity=0.303 Sum_probs=9.4 Q ss_pred EEEECCCCHHHHHHHHHHHHHHH Q ss_conf 25632784577999999999988 Q gi|254780922|r 219 SAWFDAGTPESLLDTAVFVRNIE 241 (292) Q Consensus 219 ~~W~D~Gt~e~l~~As~~i~~~e 241 (292) +.|..-|.-.+-.-.+.+++.++ T Consensus 163 gVw~~~gv~~~~n~i~~~~~~l~ 185 (190) T KOG2961 163 GVWTEPGVRAEENFIVRQVRRLE 185 (190) T ss_pred EEEECCCCCCCCHHHHHHHHHHH T ss_conf 68821564556408999999999 No 147 >TIGR02529 EutJ ethanolamine utilization protein EutJ family protein; InterPro: IPR013366 Salmonella typhimurium is capable of growth on ethanolamine as a sole source of carbon nitrogen and energy . During growth on this compound the cells form a multimolecular structure known as a metabolosome, which is similar to the carboxysome used by some photosynthetic bacteria to fix CO2, and is thought to contain the enzymes needed to metabolise this compound to acetyl-CoA. The metabolosome is not directly involved in the biochemistry of ethanolamine utilization - instead its role is thought to be to concentrate the enzymes involved in this process, while also protecting the cell from the build-up of toxic intermediates . The genes involved in growth on ethanolamine are encoded in a 17-gene operon known as the ethanolamine utilization (eut) operon. EutJ shows similarity to chaperonins and may play a role in assembly of the metabolosme , though it is not necessary for growth on this compound.. Probab=28.43 E-value=44 Score=14.42 Aligned_cols=29 Identities=24% Similarity=0.343 Sum_probs=19.2 Q ss_pred CCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 22202455410002243220046753333333 Q gi|254780922|r 167 SFAVTGIYFYDQEVVNIARNIRPSARGELEIT 198 (292) Q Consensus 167 n~a~~GiY~f~~~i~~~~~~l~~s~rgE~eit 198 (292) .|+..|-|-.+ |+-+++.|..+++|-|+. T Consensus 144 ~LV~AG~ygi~---~EEAEe~K~~~k~e~E~F 172 (240) T TIGR02529 144 SLVLAGAYGIS---FEEAEEIKRDKKKEEEVF 172 (240) T ss_pred EEEEEECCCCC---HHHHHHHHHCCCCCEEEE T ss_conf 67886126888---889887863058843788 No 148 >COG3981 Predicted acetyltransferase [General function prediction only] Probab=27.85 E-value=45 Score=14.36 Aligned_cols=28 Identities=11% Similarity=0.158 Sum_probs=10.9 Q ss_pred CCCCCCCCEEECCCCCCCC-CCCCHHHHH Q ss_conf 2335675213214432111-100001110 Q gi|254780922|r 96 FIGDSSSVLILGDNVFYGS-DISDIFHKA 123 (292) Q Consensus 96 ~i~~~~~~lilgDni~~~~-~l~~~l~~~ 123 (292) ++....++.+-.|+-+.|. ++...+..+ T Consensus 65 ~V~~~~y~~v~~d~~ivG~i~lRh~Ln~~ 93 (174) T COG3981 65 WVPASTYWAVDEDGQIVGFINLRHQLNDF 93 (174) T ss_pred CEECEEEEEEECCCCEEEEEEEEEECCHH T ss_conf 03033689993388089999862100568 No 149 >cd03413 CbiK_C Anaerobic cobalamin biosynthetic cobalt chelatase (CbiK), C-terminal domain. CbiK is part of the cobalt-early path for cobalamin biosynthesis. It catalyzes the insertion of cobalt into the oxidized form of precorrin-2, factor II (sirohydrochlorin), the second step of the anaerobic branch of vitamin B12 biosynthesis. CbiK belongs to the class II family of chelatases, and is a homomeric enzyme that does not require ATP for its enzymatic activity. Probab=27.83 E-value=45 Score=14.36 Aligned_cols=26 Identities=19% Similarity=0.277 Sum_probs=17.0 Q ss_pred EEECCEEHHHHHHHHHHHCCCCEEEE Q ss_conf 24783747999999999789868999 Q gi|254780922|r 26 LPIYNKPMIYYPVSTLMDAGIREILI 51 (292) Q Consensus 26 lpi~gkplI~~~l~~l~~~Gi~~I~i 51 (292) --|-|.|-|+.++..|...|+++|.+ T Consensus 36 gtvEg~P~~e~vl~~L~~~g~k~V~L 61 (103) T cd03413 36 GTVEGYPGLDDVLAKLKKAGIKKVTL 61 (103) T ss_pred EEECCCCCHHHHHHHHHHCCCCEEEE T ss_conf 99578589999999999769986999 No 150 >PRK05282 peptidase E; Validated Probab=27.67 E-value=41 Score=14.59 Aligned_cols=87 Identities=14% Similarity=0.283 Sum_probs=45.8 Q ss_pred ECCEEHHHHHHHHHHHC--CCCEEEEEECCCCCCCCCCCCCC----CCCCCCCCCCCCCCC-CCCCCCCCCCCCCCCCCC Q ss_conf 78374799999999978--98689999777656431112222----222211112223333-224468521011123356 Q gi|254780922|r 28 IYNKPMIYYPVSTLMDA--GIREILIISTPRDLPVLKEFLGS----GEKWGVQFSYIEQLV-PAGLAQSYILGAEFIGDS 100 (292) Q Consensus 28 i~gkplI~~~l~~l~~~--Gi~~I~ii~~~~~~~~~~~~l~~----g~~~g~~i~y~~q~~-p~Gta~Ai~~a~~~i~~~ 100 (292) +.|.+-++|.++.+... |+++|++|--..-....+.|... ...+|+++.-+.+.+ |. .| |.+. T Consensus 11 ~~g~~yL~~~~~~i~~~~~~~k~ilFIPyA~~~~~~d~Yt~~v~~af~~lg~~v~gih~~~dp~---~A-------I~~A 80 (233) T PRK05282 11 LPGTAYLEHALPLIAELLAGRRKAVFIPYAGVTQSWDDYTAKVAEVLAPLGIEVTGIHRVEDPV---AA-------IENA 80 (233) T ss_pred CCCCCHHHHHHHHHHHHHHCCCEEEEECCCCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCHH---HH-------HHHC T ss_conf 7998579999999999984796599975888879889999999999986698288762458989---99-------9749 Q ss_pred CCCEEECCCCC------CCCCCCCHHHHHC Q ss_conf 75213214432------1111000011100 Q gi|254780922|r 101 SSVLILGDNVF------YGSDISDIFHKAR 124 (292) Q Consensus 101 ~~~lilgDni~------~~~~l~~~l~~~~ 124 (292) +.+.+-|+|.| +..++.+.+.+.. T Consensus 81 d~I~vgGGNTF~Ll~~L~~~gl~~~ir~~V 110 (233) T PRK05282 81 DGIIVGGGNTFQLLKQLYERGLLAPIREAV 110 (233) T ss_pred CEEEECCCCHHHHHHHHHHCCCHHHHHHHH T ss_conf 979986973999999999858299999999 No 151 >COG0299 PurN Folate-dependent phosphoribosylglycinamide formyltransferase PurN [Nucleotide transport and metabolism] Probab=27.46 E-value=44 Score=14.39 Aligned_cols=50 Identities=28% Similarity=0.404 Sum_probs=34.9 Q ss_pred CEEEEECCCCCCCCCHHHCCCCCCE------EEECCEEHHHHHHHHHHHCCCCEEEE Q ss_conf 9079924888745533443898222------24783747999999999789868999 Q gi|254780922|r 1 MKGIVLAGGSGTRLRPLTDLLSKQM------LPIYNKPMIYYPVSTLMDAGIREILI 51 (292) Q Consensus 1 MkaiIlAaG~GtRl~P~T~~~pK~L------lpi~gkplI~~~l~~l~~~Gi~~I~i 51 (292) ||-+|||-|.||-|.-+-..++..- .-+.|+| =-|.|+....+||...++ T Consensus 1 ~ki~VlaSG~GSNlqaiida~~~~~~~a~i~~Visd~~-~A~~lerA~~~gIpt~~~ 56 (200) T COG0299 1 KKIAVLASGNGSNLQAIIDAIKGGKLDAEIVAVISDKA-DAYALERAAKAGIPTVVL 56 (200) T ss_pred CEEEEEEECCCCCHHHHHHHHHCCCCCCEEEEEEECCC-CCHHHHHHHHCCCCEEEE T ss_conf 94999994796559999998764898817999995797-778999999849988974 No 152 >KOG2978 consensus Probab=27.20 E-value=46 Score=14.29 Aligned_cols=103 Identities=21% Similarity=0.353 Sum_probs=52.8 Q ss_pred EEEECC----EEHHHHHHHHHHH-CCCC-EEEEEECCCCCC----CCCCCCCCCCCCC-CCCCCCCCCCCCCCCCCCCCC Q ss_conf 224783----7479999999997-8986-899997776564----3111222222221-111222333322446852101 Q gi|254780922|r 25 MLPIYN----KPMIYYPVSTLMD-AGIR-EILIISTPRDLP----VLKEFLGSGEKWG-VQFSYIEQLVPAGLAQSYILG 93 (292) Q Consensus 25 Llpi~g----kplI~~~l~~l~~-~Gi~-~I~ii~~~~~~~----~~~~~l~~g~~~g-~~i~y~~q~~p~Gta~Ai~~a 93 (292) ++|-++ -|++-|.+..... .|.+ +| |++...+.+ ..+.+. +.+| -++....-..-+|++-|...+ T Consensus 8 ilPtYnEk~Nlpi~~~li~~~~~e~~~~~ei-IivDD~SpDGt~~~a~~L~---k~yg~d~i~l~pR~~klGLgtAy~hg 83 (238) T KOG2978 8 ILPTYNEKENLPIITRLIAKYMSEEGKKYEI-IIVDDASPDGTQEVAKALQ---KIYGEDNILLKPRTKKLGLGTAYIHG 83 (238) T ss_pred EECCCCCCCCCEEEHHHHHHHHHHHCCCEEE-EEEECCCCCCCHHHHHHHH---HHHCCCCEEEEECCCCCCCHHHHHHH T ss_conf 8414567778716599987543553686479-9983899976099999999---98589818998636766501787756 Q ss_pred CCCCCCCCCCEEECCCCCCCCC-CCCHHHHHCCCHHHHHC Q ss_conf 1123356752132144321111-00001110000011100 Q gi|254780922|r 94 AEFIGDSSSVLILGDNVFYGSD-ISDIFHKARARRNSATV 132 (292) Q Consensus 94 ~~~i~~~~~~lilgDni~~~~~-l~~~l~~~~~~~~~~ti 132 (292) ..+..+ +|++++-.|+-+... +.++++...+..-+.+. T Consensus 84 l~~a~g-~fiviMDaDlsHhPk~ipe~i~lq~~~~~div~ 122 (238) T KOG2978 84 LKHATG-DFIVIMDADLSHHPKFIPEFIRLQKEGNYDIVL 122 (238) T ss_pred HHHCCC-CEEEEEECCCCCCCHHHHHHHHHHHCCCCCEEE T ss_conf 654368-769998076678915679999986505752566 No 153 >PRK06263 sdhA succinate dehydrogenase flavoprotein subunit; Reviewed Probab=26.85 E-value=42 Score=14.55 Aligned_cols=19 Identities=37% Similarity=0.560 Sum_probs=13.7 Q ss_pred EEEEECCCCCCCCCHHHCC Q ss_conf 0799248887455334438 Q gi|254780922|r 2 KGIVLAGGSGTRLRPLTDL 20 (292) Q Consensus 2 kaiIlAaG~GtRl~P~T~~ 20 (292) |+||||.|=..+++|.|.+ T Consensus 187 kaVILATGG~~~ly~~~tn 205 (539) T PRK06263 187 KSTILATGGAGQLYPITSN 205 (539) T ss_pred CEEEECCCCCCCCCCCCCC T ss_conf 5699898987656877679 No 154 >PRK12305 thrS threonyl-tRNA synthetase; Reviewed Probab=26.42 E-value=48 Score=14.20 Aligned_cols=52 Identities=19% Similarity=0.163 Sum_probs=28.0 Q ss_pred CCCCHHHHHHHHHHHHHHHHHCCCEEEEHHHHHHHCCCCCHHHHHHHHHHCC-CCHHHHHHHHHHHHH Q ss_conf 2784577999999999988880986620999998868989899999886425-884899999998641 Q gi|254780922|r 223 DAGTPESLLDTAVFVRNIENRLGLYVACPEEIAYRHDFINESQFFQLIDHFG-NSPYGLYLRQVVEKK 289 (292) Q Consensus 223 D~Gt~e~l~~As~~i~~~e~~~g~~i~~~ee~a~~~~~i~~~~l~~~~~~~~-~~~y~~~l~~~~~~~ 289 (292) =.|+.|+++-.- +|.-.|- | -=|++..|..-+.-.-. +.+|.+.+.+.+.+. T Consensus 456 ilGS~ERfigiL-----iEh~~G~---------~-P~WLaP~Qv~IipV~e~~~~~ya~~v~~~L~~~ 508 (576) T PRK12305 456 LIGTYERFIAIL-----LEQTKGN---------L-PFWLSPKQVIIIPVNENLNTDYAKELNKKLFKL 508 (576) T ss_pred CCCCHHHHHHHH-----HHHCCCC---------C-CCCCCCCEEEEEECCCCHHHHHHHHHHHHHHHC T ss_conf 547489999999-----9970799---------9-836273037999538603599999999999978 No 155 >PRK11253 ldcA L,D-carboxypeptidase A; Provisional Probab=25.51 E-value=50 Score=14.09 Aligned_cols=14 Identities=29% Similarity=0.366 Sum_probs=5.9 Q ss_pred HHHHHHHHHHCCCC Q ss_conf 99999999978986 Q gi|254780922|r 34 IYYPVSTLMDAGIR 47 (292) Q Consensus 34 I~~~l~~l~~~Gi~ 47 (292) ++-.++.|...|.+ T Consensus 19 ~~~ai~~Le~~G~~ 32 (305) T PRK11253 19 ALRGIQRLTDAGHQ 32 (305) T ss_pred HHHHHHHHHHCCCE T ss_conf 99999999968998 No 156 >TIGR01186 proV glycine betaine/L-proline transport ATP binding subunit; InterPro: IPR005892 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This family comprises the glycine betaine/L-proline ATP binding subunit in bacteria and its equivalents in archaea. This transport system belong to the larger ATP-Binding Cassette (ABC) transporter superfamily. The characteristic feature of these transporters is the obligatory coupling of ATP hydrolysis to substrate translocation. Functionally, this transport system is involved in osmoregulation. Under conditions of stress, the organism recruits this transport system to accumulate glycine betaine and other solutes which offer osmo-protection. It has been demonstrated that glycine betaine uptake is accompanied by symport with sodium ions. The locus has been named variously as proU or opuA. A gene library from L. lactis functionally complements an Escherichia coli proU mutant. The complementing locus is similar to a opuA locus in Bacillus subtlis. This clarifies the differences in nomenclature.; GO: 0005524 ATP binding, 0015171 amino acid transmembrane transporter activity, 0006865 amino acid transport, 0016020 membrane. Probab=24.75 E-value=36 Score=15.00 Aligned_cols=28 Identities=11% Similarity=0.151 Sum_probs=13.9 Q ss_pred CCCE-EEEHHHHHHHCCCCCHHHHHHHHH Q ss_conf 0986-620999998868989899999886 Q gi|254780922|r 244 LGLY-VACPEEIAYRHDFINESQFFQLID 271 (292) Q Consensus 244 ~g~~-i~~~ee~a~~~~~i~~~~l~~~~~ 271 (292) ++.. |+-.+|----.|-|++..|...+. T Consensus 334 ~~~~YvpVVDE~~~~~GIv~r~~L~~~l~ 362 (372) T TIGR01186 334 AGIKYVPVVDEDQRLVGIVTRASLVDALY 362 (372) T ss_pred CCCCEEEEECCCCCEEEEEEHHHHHHHHH T ss_conf 79956666614564586764778999871 No 157 >TIGR01819 F420_cofD LPPG:Fo 2-phospho-L-lactate transferase; InterPro: IPR010115 This entry represents LPPG:Fo 2-phospho-L-lactate transferase (CofD), which catalyses the fourth step in the biosynthesis of coenzyme F420, a flavin derivative found in methanogens, Mycobacteria, and several other lineages. This enzyme is characterised so far in Methanococcus jannaschii but appears restricted to F420-containing species and is predicted to carry out the same function in these other species. . Probab=24.34 E-value=33 Score=15.18 Aligned_cols=12 Identities=67% Similarity=1.041 Sum_probs=9.4 Q ss_pred EEECCCCCC-CCC Q ss_conf 992488874-553 Q gi|254780922|r 4 IVLAGGSGT-RLR 15 (292) Q Consensus 4 iIlAaG~Gt-Rl~ 15 (292) +|||||.|| ||. T Consensus 2 ~vLaGGtGtpkLL 14 (359) T TIGR01819 2 TVLAGGTGTPKLL 14 (359) T ss_pred EEEECCCCCHHHH T ss_conf 7863586707899 No 158 >TIGR03029 EpsG chain length determinant protein tyrosine kinase EpsG. The proteins in this family are homologs of the EpsG protein found in Methylobacillus strain 12S and are generally found in operons with other Eps homologs. The protein is believed to function as the protein tyrosine kinase component of the chain length regulator (along with the transmembrane component EpsF). Probab=23.25 E-value=55 Score=13.82 Aligned_cols=16 Identities=6% Similarity=0.015 Sum_probs=7.7 Q ss_pred ECCCCHHHHHHHHHHH Q ss_conf 3278457799999999 Q gi|254780922|r 222 FDAGTPESLLDTAVFV 237 (292) Q Consensus 222 ~D~Gt~e~l~~As~~i 237 (292) .+.-+..++.+|-..+ T Consensus 245 ~~~T~~~~l~~a~~~L 260 (274) T TIGR03029 245 VNETRLHELTSLKEHL 260 (274) T ss_pred CCCCCHHHHHHHHHHH T ss_conf 8988899999999999 No 159 >PRK00413 thrS threonyl-tRNA synthetase; Reviewed Probab=22.55 E-value=56 Score=13.74 Aligned_cols=28 Identities=11% Similarity=0.184 Sum_probs=12.4 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCEEECC Q ss_conf 3322446852101112335675213214 Q gi|254780922|r 81 LVPAGLAQSYILGAEFIGDSSSVLILGD 108 (292) Q Consensus 81 ~~p~Gta~Ai~~a~~~i~~~~~~lilgD 108 (292) .++.|.-+.+...+.|.-+|-.+.+..| T Consensus 362 ~E~SG~L~GL~RvR~FtqdDaHIFc~~d 389 (639) T PRK00413 362 YEKSGALHGLMRVRGFTQDDAHIFCTPE 389 (639) T ss_pred CCCCCCCCCEEEEEEEECCCCEEECCHH T ss_conf 6778744564233000015841324789 No 160 >PRK10624 L-1,2-propanediol oxidoreductase; Provisional Probab=22.23 E-value=56 Score=13.74 Aligned_cols=79 Identities=11% Similarity=0.167 Sum_probs=37.7 Q ss_pred CEEHHHHHHHHHHHCCCCEEEEEECCCC--CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC--CCCCCEE Q ss_conf 3747999999999789868999977765--64311122222222111122233332244685210111233--5675213 Q gi|254780922|r 30 NKPMIYYPVSTLMDAGIREILIISTPRD--LPVLKEFLGSGEKWGVQFSYIEQLVPAGLAQSYILGAEFIG--DSSSVLI 105 (292) Q Consensus 30 gkplI~~~l~~l~~~Gi~~I~ii~~~~~--~~~~~~~l~~g~~~g~~i~y~~q~~p~Gta~Ai~~a~~~i~--~~~~~li 105 (292) |+=-++..-+.+.+.|.++++||++..- ...+++....-+..++.......-+|.=+-+.+..+.+..+ +-++++. T Consensus 13 G~g~l~~l~~~~~~~G~k~~lvvtd~~~~k~g~~~~v~~~L~~~~i~~~vf~~v~pnP~~~~V~~~~~~~r~~~~D~Iia 92 (381) T PRK10624 13 GRGAVGALTDEVKRRGYHKALIVTDKTLVQCGVVAKVTDKLDAAGLAYEIYDGVKPNPTISVVKEGLEVFQASGADYLIA 92 (381) T ss_pred CCCHHHHHHHHHHHCCCCEEEEEECCCHHHCCHHHHHHHHHHHCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCEEEE T ss_conf 97699999999997499879999696545563699999999876983999889258989999999999998649998998 Q ss_pred ECC Q ss_conf 214 Q gi|254780922|r 106 LGD 108 (292) Q Consensus 106 lgD 108 (292) .|+ T Consensus 93 vGG 95 (381) T PRK10624 93 IGG 95 (381) T ss_pred ECC T ss_conf 089 No 161 >cd02514 GT13_GLCNAC-TI GT13_GLCNAC-TI is involved in an essential step in the synthesis of complex or hybrid-type N-linked oligosaccharides. Alpha-1,3-mannosyl-glycoprotein beta-1,2-N-acetylglucosaminyltransferase (GLCNAC-T I , GNT-I) transfers N-acetyl-D-glucosamine from UDP to high-mannose glycoprotein N-oligosaccharide, an essential step in the synthesis of complex or hybrid-type N-linked oligosaccharides. The enzyme is an integral membrane protein localized to the Golgi apparatus. The catalytic domain is located at the C-terminus. These proteins are members of the glycosy transferase family 13. Probab=22.22 E-value=57 Score=13.70 Aligned_cols=118 Identities=14% Similarity=0.232 Sum_probs=58.8 Q ss_pred EEEECCEE-HHHHHHHHHHHC--CCCEEE-EEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCC---CCC-----CCCCCC Q ss_conf 22478374-799999999978--986899-997776564311122222222111122233332---244-----685210 Q gi|254780922|r 25 MLPIYNKP-MIYYPVSTLMDA--GIREIL-IISTPRDLPVLKEFLGSGEKWGVQFSYIEQLVP---AGL-----AQSYIL 92 (292) Q Consensus 25 Llpi~gkp-lI~~~l~~l~~~--Gi~~I~-ii~~~~~~~~~~~~l~~g~~~g~~i~y~~q~~p---~Gt-----a~Ai~~ 92 (292) ++--+++| -+...|+.|.+. +++..- +|......+...+.+ +.+|.+++++.|... ..+ -+.-.+ T Consensus 5 lV~a~NRp~~l~r~L~sLl~~~p~~~~~pIiVsqDg~~~~~~~v~---~~yg~~i~~i~~~~~~~~~~~~~~k~~~Y~~I 81 (334) T cd02514 5 LVIACNRPDYLRRMLDSLLSYRPSAEKFPIIVSQDGGYEEVADVA---KSFGDGVTHIQHPPISIKNVNPPHKFQGYYRI 81 (334) T ss_pred EEEECCCHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCHHHHHHH---HHHCCEEEEEECCCCCCCCCCCHHHHHHHHHH T ss_conf 999558779999999999851866787448999479987999999---97251027896788665566752555668999 Q ss_pred CC------C--C-CCCCCCCEEECCCCCCCCCCCCHHHHHCCCHHHHHCCCCCCCCCCCCCCCCCCCCCCH Q ss_conf 11------1--2-3356752132144321111000011100000111001222210001222122221000 Q gi|254780922|r 93 GA------E--F-IGDSSSVLILGDNVFYGSDISDIFHKARARRNSATVVGCHVQNPQRYGVVEVDSSNQA 154 (292) Q Consensus 93 a~------~--~-i~~~~~~lilgDni~~~~~l~~~l~~~~~~~~~~ti~~~~v~~p~~yGvv~~d~~~~i 154 (292) |. + | ..+-+.++|+-||+-...+|.+.+..... ++ ..||+-|+|=--+.+|+- T Consensus 82 a~HYk~aL~~~F~~~~~~~vIIlEDDL~vspDFf~YF~~t~~------LL---~~D~sl~cISAWNDnG~~ 143 (334) T cd02514 82 ARHYKWALTQTFNLFGYSFVIILEDDLDIAPDFFSYFQATLP------LL---EEDPSLWCISAWNDNGKE 143 (334) T ss_pred HHHHHHHHHHHHHHCCCCEEEEEECCCCCCHHHHHHHHHHHH------HH---CCCCCEEEEEEECCCCCC T ss_conf 999999999986305887699994563335759999997568------76---159966899733167865 No 162 >TIGR00639 PurN phosphoribosylglycinamide formyltransferase; InterPro: IPR004607 This family describes phosphoribosylglycinamide formyltransferase (GAR transformylase), one of several proteins found in formyltransferase. This enzyme uses formyl tetrahydrofolate as a formyl group donor to produce 5'-phosphoribosyl-N-formylglycinamide. PurT, a different GAR transformylase, uses ATP and formate rather than formyl tetrahydrofolate. Experimental proof includes complementation of Escherichia coli purN mutants by orthologs from vertebrates (where it is a domain of a multifunctional protein), Bacillus subtilis, and Arabidopsis thaliana. In phylogenetic analyses, the member from Saccharomyces cerevisiae shows a long branch length but membership in the family, while the formyltetrahydrofolate deformylases form a closely related outgroup. ; GO: 0004644 phosphoribosylglycinamide formyltransferase activity, 0006189 'de novo' IMP biosynthetic process. Probab=21.69 E-value=35 Score=15.03 Aligned_cols=48 Identities=33% Similarity=0.500 Sum_probs=22.9 Q ss_pred CEEEEECCCCCCCCCHHHCCC--------CCCEEEECCEEHHHHHHHHHHHCCCCEE Q ss_conf 907992488874553344389--------8222247837479999999997898689 Q gi|254780922|r 1 MKGIVLAGGSGTRLRPLTDLL--------SKQMLPIYNKPMIYYPVSTLMDAGIREI 49 (292) Q Consensus 1 MkaiIlAaG~GtRl~P~T~~~--------pK~Llpi~gkplI~~~l~~l~~~Gi~~I 49 (292) ||-+||+=|.||=|+-+-..+ .+--+=|.|||= -|.++.+..++|-.. T Consensus 1 ~~i~VLiSG~GSNLqAiiDA~~~~~~~~~A~v~lVisn~p~-A~g~~RA~~~~iP~~ 56 (215) T TIGR00639 1 KRIVVLISGSGSNLQAIIDAIKEGQGKIDASVVLVISNKPK-AYGLERAAQAGIPTK 56 (215) T ss_pred CEEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCC-CHHHHHHHHCCCCEE T ss_conf 93689984287179999999852127867368999865866-266777887589467 No 163 >pfam02829 3H 3H domain. This domain is predicted to be a small molecule binding domain, based on its occurrence with other domains. The domain is named after its three conserved histidine residues. Probab=21.36 E-value=60 Score=13.59 Aligned_cols=18 Identities=39% Similarity=0.636 Sum_probs=7.8 Q ss_pred HHHHCCCCCCCCCCCCCCC Q ss_conf 1110012222100012221 Q gi|254780922|r 128 NSATVVGCHVQNPQRYGVV 146 (292) Q Consensus 128 ~~~ti~~~~v~~p~~yGvv 146 (292) .+++|.-+-|+.| -||.+ T Consensus 20 ~Gg~V~DV~veHp-~YG~i 37 (98) T pfam02829 20 NGGTVKDVIVEHP-VYGEI 37 (98) T ss_pred CCCEEEEEEEECC-CCCEE T ss_conf 8999999999468-98268 No 164 >TIGR01236 D1pyr5carbox1 1-pyrroline-5-carboxylate dehydrogenase; InterPro: IPR005931 The delta(1)-pyrroline-5-carboxylate synthetase (1.5.1.12 from EC) a mitochondrial inner membrane, ATP- and NADPH-dependent, bifunctional enzyme, catalyzes the reduction of glutamate to delta1-pyrroline-5-carboxylate, a critical step in the de novo biosynthesis of proline and ornithine. It is the rate-limiting enzyme in proline biosynthesis and is subject to feedback inhibition by proline.1-pyrroline-5-carboxylate + NAD+ + H2O = L-glutamate + NADH This model represents one of several related branches of delta-1-pyrroline-5-carboxylate dehydrogenase.; GO: 0003842 1-pyrroline-5-carboxylate dehydrogenase activity, 0006561 proline biosynthetic process, 0005759 mitochondrial matrix. Probab=20.97 E-value=44 Score=14.42 Aligned_cols=42 Identities=17% Similarity=0.330 Sum_probs=19.2 Q ss_pred CCCCCCCCCCC--CCHHHCCCCCCCCCCCCC--HHHH------HHHHHHHHH Q ss_conf 01222122221--000100133333222222--0245------541000224 Q gi|254780922|r 141 QRYGVVEVDSS--NQAISIEEKPNNPKSSFA--VTGI------YFYDQEVVN 182 (292) Q Consensus 141 ~~yGvv~~d~~--~~i~~i~EKP~~~~sn~a--~~Gi------Y~f~~~i~~ 182 (292) +.|+-..+|+. .|+++.+|.-+..++++- ..|= ||..|.|++ T Consensus 369 ~~FmgAvIde~sF~K~vk~IE~AK~~p~~~~I~~GGkyDdS~GYFV~PTv~~ 420 (551) T TIGR01236 369 SSFMGAVIDEKSFAKIVKYIERAKKDPEELEILAGGKYDDSKGYFVEPTVVE 420 (551) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCEEECCCCEEEECEEEC T ss_conf 4556641257788889999987415813507877336616743485135861 No 165 >COG3315 O-Methyltransferase involved in polyketide biosynthesis [Secondary metabolites biosynthesis, transport, and catabolism] Probab=20.55 E-value=53 Score=13.89 Aligned_cols=12 Identities=17% Similarity=0.603 Sum_probs=4.5 Q ss_pred CCHHHH-HHHHHH Q ss_conf 220245-541000 Q gi|254780922|r 168 FAVTGI-YFYDQE 179 (292) Q Consensus 168 ~a~~Gi-Y~f~~~ 179 (292) ++.-|+ ++|+++ T Consensus 174 ~iaEGLl~YL~~~ 186 (297) T COG3315 174 WIAEGLLMYLPEE 186 (297) T ss_pred EEECCCCCCCCHH T ss_conf 5524350149989 No 166 >KOG1447 consensus Probab=20.01 E-value=50 Score=14.09 Aligned_cols=14 Identities=43% Similarity=0.795 Sum_probs=10.7 Q ss_pred CEEEEECCCCCCCC Q ss_conf 90799248887455 Q gi|254780922|r 1 MKGIVLAGGSGTRL 14 (292) Q Consensus 1 MkaiIlAaG~GtRl 14 (292) .||-|||||+|.-- T Consensus 64 vKAQILAGGRGKG~ 77 (412) T KOG1447 64 VKAQILAGGRGKGV 77 (412) T ss_pred EEEEEEECCCCCCE T ss_conf 76443106765542 Done!