Query gi|255764481|ref|YP_003065182.2| UDP-N-acetylglucosamine acyltransferase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 271 No_of_seqs 249 out of 6765 Neff 6.6 Searched_HMMs 39220 Date Sun May 29 21:17:44 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 255764481.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR01852 lipid_A_lpxA acyl-[a 100.0 0 0 708.1 15.5 255 10-264 1-257 (257) 2 COG1043 LpxA Acyl-[acyl carrie 100.0 0 0 629.5 15.7 259 6-264 2-260 (260) 3 PRK05289 UDP-N-acetylglucosami 100.0 0 0 592.6 20.1 260 6-267 1-260 (261) 4 cd03351 LbH_UDP-GlcNAc_AT UDP- 100.0 0 0 591.1 19.9 254 9-263 1-254 (254) 5 PRK12461 UDP-N-acetylglucosami 100.0 0 0 590.1 19.4 255 9-265 2-256 (256) 6 cd03352 LbH_LpxD UDP-3-O-acyl- 100.0 0 0 344.9 12.6 193 2-197 2-201 (205) 7 PRK00892 lpxD UDP-3-O-[3-hydro 100.0 0 0 340.3 12.7 193 2-197 111-311 (343) 8 TIGR01853 lipid_A_lpxD UDP-3-O 100.0 1.4E-45 0 304.9 11.5 193 2-197 114-316 (336) 9 cd03353 LbH_GlmU_C N-acetyl-gl 100.0 2.6E-39 6.6E-44 266.3 11.2 180 8-196 3-193 (193) 10 COG1044 LpxD UDP-3-O-[3-hydrox 100.0 8.4E-39 2.1E-43 263.2 13.7 219 2-241 112-337 (338) 11 PRK00892 lpxD UDP-3-O-[3-hydro 100.0 9.7E-35 2.5E-39 237.7 12.5 225 6-243 97-339 (343) 12 TIGR01173 glmU UDP-N-acetylglu 100.0 1.2E-34 3.1E-39 237.1 8.7 181 3-207 273-458 (461) 13 TIGR01853 lipid_A_lpxD UDP-3-O 100.0 2.8E-32 7.2E-37 222.3 9.6 202 7-221 101-323 (336) 14 PRK09451 glmU bifunctional N-a 100.0 8.5E-32 2.2E-36 219.3 10.1 181 3-207 267-452 (456) 15 TIGR03308 phn_thr-fam phosphon 100.0 1.8E-29 4.7E-34 204.7 13.4 172 2-220 3-174 (204) 16 COG1044 LpxD UDP-3-O-[3-hydrox 100.0 1.6E-28 4.2E-33 198.8 11.7 186 6-203 98-301 (338) 17 COG1207 GlmU N-acetylglucosami 100.0 6.1E-29 1.6E-33 201.5 9.0 181 3-207 270-455 (460) 18 cd04745 LbH_paaY_like paaY-lik 99.9 6.1E-28 1.6E-32 195.2 8.8 105 85-220 39-145 (155) 19 cd04650 LbH_FBP Ferripyochelin 99.9 5.8E-28 1.5E-32 195.3 8.5 108 85-223 39-148 (154) 20 cd04645 LbH_gamma_CA_like Gamm 99.9 7.3E-28 1.9E-32 194.7 8.8 143 9-221 1-145 (153) 21 TIGR03532 DapD_Ac 2,3,4,5-tetr 99.9 4.9E-25 1.3E-29 177.0 10.4 127 6-197 85-211 (231) 22 PRK13627 carnitine operon prot 99.9 2.3E-25 5.8E-30 179.1 7.6 146 5-220 8-155 (196) 23 TIGR03570 NeuD_NnaD sugar O-ac 99.9 2.6E-25 6.7E-30 178.8 7.8 102 86-194 100-201 (201) 24 cd03358 LbH_WxcM_N_like WcxM-l 99.9 1.4E-24 3.5E-29 174.2 8.7 119 22-198 1-119 (119) 25 cd03352 LbH_LpxD UDP-3-O-acyl- 99.9 2.8E-24 7E-29 172.4 9.9 160 19-190 1-178 (205) 26 PRK05289 UDP-N-acetylglucosami 99.9 7E-24 1.8E-28 169.8 10.7 163 1-177 2-188 (261) 27 cd05636 LbH_G1P_TT_C_like Puta 99.9 6.5E-24 1.7E-28 170.0 10.3 152 6-180 4-163 (163) 28 cd03350 LbH_THP_succinylT 2,3, 99.9 9.3E-24 2.4E-28 169.1 10.2 119 8-191 2-120 (139) 29 cd03360 LbH_AT_putative Putati 99.9 2E-24 5E-29 173.3 6.5 84 110-193 114-197 (197) 30 cd03351 LbH_UDP-GlcNAc_AT UDP- 99.9 1E-23 2.6E-28 168.8 10.0 162 2-177 12-185 (254) 31 COG0663 PaaY Carbonic anhydras 99.9 6.1E-24 1.5E-28 170.2 8.0 159 5-224 9-170 (176) 32 PRK12461 UDP-N-acetylglucosami 99.9 1.6E-23 4E-28 167.7 9.2 161 2-177 13-185 (256) 33 cd03353 LbH_GlmU_C N-acetyl-gl 99.9 6.3E-23 1.6E-27 163.8 10.1 159 2-176 16-190 (193) 34 cd04646 LbH_Dynactin_6 Dynacti 99.9 3.5E-23 9E-28 165.4 6.5 94 85-197 38-136 (164) 35 cd03359 LbH_Dynactin_5 Dynacti 99.9 2.5E-22 6.3E-27 160.1 9.0 150 43-224 7-160 (161) 36 cd00710 LbH_gamma_CA Gamma car 99.9 4.4E-22 1.1E-26 158.6 6.9 133 49-220 14-154 (167) 37 TIGR01852 lipid_A_lpxA acyl-[a 99.9 9.3E-22 2.4E-26 156.5 8.2 227 2-253 11-249 (257) 38 cd03349 LbH_XAT Xenobiotic acy 99.9 3.2E-21 8.2E-26 153.2 9.1 68 145-220 72-139 (145) 39 cd04646 LbH_Dynactin_6 Dynacti 99.8 1.3E-21 3.3E-26 155.7 6.4 157 9-187 1-160 (164) 40 TIGR01173 glmU UDP-N-acetylglu 99.8 1.8E-20 4.7E-25 148.4 8.5 146 9-182 260-418 (461) 41 cd03357 LbH_MAT_GAT Maltose O- 99.8 9.9E-21 2.5E-25 150.1 6.2 56 142-197 114-169 (169) 42 PRK10502 putative colanic acid 99.8 4.2E-20 1.1E-24 146.2 9.3 122 19-198 52-177 (179) 43 PRK09527 lacA galactoside O-ac 99.8 2.2E-20 5.7E-25 147.9 7.3 124 21-197 57-182 (203) 44 PRK09451 glmU bifunctional N-a 99.8 4.9E-20 1.2E-24 145.8 9.0 143 9-179 254-409 (456) 45 cd04647 LbH_MAT_like Maltose O 99.8 6.1E-21 1.5E-25 151.5 3.3 85 113-197 24-109 (109) 46 PRK09677 putative lipopolysacc 99.8 2.9E-20 7.5E-25 147.2 6.6 63 143-212 127-189 (192) 47 COG1043 LpxA Acyl-[acyl carrie 99.8 6.3E-20 1.6E-24 145.1 7.8 162 2-177 16-189 (260) 48 cd04645 LbH_gamma_CA_like Gamm 99.8 1.4E-19 3.6E-24 142.9 9.6 134 3-188 1-138 (153) 49 cd05636 LbH_G1P_TT_C_like Puta 99.8 1.3E-19 3.4E-24 143.1 8.0 155 2-161 6-162 (163) 50 TIGR02287 PaaY phenylacetic ac 99.8 3E-20 7.6E-25 147.1 4.3 151 5-225 6-161 (193) 51 cd05825 LbH_wcaF_like wcaF-lik 99.8 1.4E-19 3.6E-24 142.9 4.3 55 143-197 53-107 (107) 52 cd04745 LbH_paaY_like paaY-lik 99.8 1.1E-18 2.9E-23 137.3 8.7 62 127-188 77-139 (155) 53 cd00710 LbH_gamma_CA Gamma car 99.8 1.2E-18 3.2E-23 137.0 8.3 160 8-222 3-166 (167) 54 cd04650 LbH_FBP Ferripyochelin 99.8 3.2E-18 8E-23 134.5 9.4 134 3-188 2-139 (154) 55 PRK10092 maltose O-acetyltrans 99.7 5.9E-19 1.5E-23 139.0 3.5 56 142-197 125-180 (183) 56 COG1207 GlmU N-acetylglucosami 99.7 2.3E-17 5.8E-22 129.1 8.7 152 8-182 256-415 (460) 57 cd03354 LbH_SAT Serine acetylt 99.7 3.6E-18 9.1E-23 134.1 4.1 49 145-193 53-101 (101) 58 PRK13627 carnitine operon prot 99.7 3.7E-17 9.5E-22 127.8 7.6 166 2-232 11-177 (196) 59 COG0110 WbbJ Acetyltransferase 99.7 4.3E-17 1.1E-21 127.3 7.4 120 85-217 67-187 (190) 60 KOG4750 consensus 99.7 2E-17 5.2E-22 129.4 3.6 106 86-198 135-252 (269) 61 cd03359 LbH_Dynactin_5 Dynacti 99.7 3E-16 7.6E-21 122.1 9.3 125 37-187 21-149 (161) 62 PRK11132 cysE serine acetyltra 99.6 5.3E-16 1.3E-20 120.6 6.1 54 146-199 193-246 (273) 63 TIGR01172 cysE serine O-acetyl 99.6 3.7E-16 9.3E-21 121.6 5.2 50 146-195 114-163 (163) 64 COG1045 CysE Serine acetyltran 99.6 1.2E-15 3.1E-20 118.3 6.5 53 146-198 119-171 (194) 65 PRK10191 putative colanic acid 99.6 1.5E-15 3.9E-20 117.6 3.8 53 145-197 91-143 (146) 66 COG0663 PaaY Carbonic anhydras 99.6 9.8E-15 2.5E-19 112.6 7.6 133 2-183 12-145 (176) 67 COG2171 DapD Tetrahydrodipicol 99.4 5.2E-13 1.3E-17 101.8 7.6 119 8-191 109-227 (271) 68 TIGR03570 NeuD_NnaD sugar O-ac 99.4 5.1E-13 1.3E-17 101.9 7.2 82 7-100 87-168 (201) 69 cd05824 LbH_M1P_guanylylT_C Ma 99.4 4E-13 1E-17 102.6 6.4 77 16-94 2-78 (80) 70 cd03360 LbH_AT_putative Putati 99.4 1E-12 2.6E-17 100.0 5.7 103 7-128 84-186 (197) 71 cd00208 LbetaH Left-handed par 99.3 3.8E-12 9.8E-17 96.4 6.9 35 146-180 44-78 (78) 72 PRK11830 dapD 2,3,4,5-tetrahyd 99.3 1E-11 2.6E-16 93.8 6.7 62 10-72 99-160 (265) 73 cd03350 LbH_THP_succinylT 2,3, 99.3 1.9E-11 4.9E-16 92.0 8.1 40 147-189 94-133 (139) 74 cd03358 LbH_WxcM_N_like WcxM-l 99.2 1.8E-11 4.6E-16 92.2 6.7 54 4-57 1-54 (119) 75 KOG1461 consensus 99.2 1E-11 2.6E-16 93.7 4.4 100 22-143 318-417 (673) 76 cd00208 LbetaH Left-handed par 99.2 7E-11 1.8E-15 88.5 7.0 34 39-72 2-35 (78) 77 TIGR03308 phn_thr-fam phosphon 99.1 2.8E-10 7.1E-15 84.8 8.5 181 2-237 20-202 (204) 78 TIGR03532 DapD_Ac 2,3,4,5-tetr 99.1 2.8E-10 7.2E-15 84.8 7.6 72 112-183 100-179 (231) 79 PRK09677 putative lipopolysacc 99.1 3.2E-10 8.1E-15 84.4 7.8 133 19-181 43-182 (192) 80 cd05635 LbH_unknown Uncharacte 99.1 3.5E-10 8.8E-15 84.2 7.0 54 12-65 1-57 (101) 81 cd05824 LbH_M1P_guanylylT_C Ma 99.1 3.9E-10 1E-14 83.8 6.8 48 22-70 2-49 (80) 82 PRK11830 dapD 2,3,4,5-tetrahyd 99.1 2.1E-10 5.4E-15 85.5 4.9 81 112-192 127-231 (265) 83 COG1208 GCD1 Nucleoside-diphos 99.1 6.9E-10 1.8E-14 82.3 7.5 57 14-71 256-312 (358) 84 KOG1461 consensus 99.0 4.2E-10 1.1E-14 83.6 6.4 104 6-125 320-423 (673) 85 PRK05293 glgC glucose-1-phosph 99.0 7.2E-10 1.8E-14 82.2 7.1 12 11-22 280-291 (381) 86 PRK10502 putative colanic acid 99.0 1.1E-09 2.8E-14 81.0 7.1 55 2-56 53-111 (179) 87 PRK05293 glgC glucose-1-phosph 99.0 1.7E-09 4.4E-14 79.8 7.8 16 55-70 308-323 (381) 88 KOG4042 consensus 99.0 4.2E-10 1.1E-14 83.7 4.1 164 8-231 9-181 (190) 89 cd03354 LbH_SAT Serine acetylt 99.0 1.8E-09 4.6E-14 79.7 7.2 43 131-175 57-99 (101) 90 cd04647 LbH_MAT_like Maltose O 99.0 1.2E-09 3.2E-14 80.7 6.2 46 128-175 58-103 (109) 91 KOG4042 consensus 99.0 8.1E-10 2.1E-14 81.9 4.6 76 40-124 11-92 (190) 92 KOG3121 consensus 98.9 1E-09 2.6E-14 81.2 5.0 139 36-223 32-175 (184) 93 TIGR02287 PaaY phenylacetic ac 98.9 2.4E-09 6.1E-14 78.9 6.8 130 4-182 11-141 (193) 94 KOG1460 consensus 98.9 1.6E-09 4.1E-14 80.0 5.8 69 4-73 285-358 (407) 95 cd05635 LbH_unknown Uncharacte 98.9 3.8E-09 9.7E-14 77.7 7.3 85 35-160 9-93 (101) 96 PRK10092 maltose O-acetyltrans 98.9 2.2E-09 5.6E-14 79.2 5.7 113 25-176 59-175 (183) 97 cd05787 LbH_eIF2B_epsilon eIF- 98.9 7E-09 1.8E-13 76.0 7.9 47 22-70 2-48 (79) 98 PRK00844 glgC glucose-1-phosph 98.9 6.3E-09 1.6E-13 76.3 7.2 55 128-184 331-385 (409) 99 COG1208 GCD1 Nucleoside-diphos 98.9 3.8E-09 9.7E-14 77.7 6.0 105 28-179 253-357 (358) 100 cd05787 LbH_eIF2B_epsilon eIF- 98.8 8.4E-09 2.1E-13 75.5 6.2 64 4-70 2-65 (79) 101 TIGR01172 cysE serine O-acetyl 98.8 6.8E-09 1.7E-13 76.1 4.9 71 49-165 81-151 (163) 102 TIGR02353 NRPS_term_dom non-ri 98.8 4.2E-09 1.1E-13 77.4 3.0 72 2-73 119-203 (719) 103 PRK03282 consensus 98.8 2.8E-08 7.2E-13 72.2 7.0 54 140-195 329-384 (406) 104 PRK09527 lacA galactoside O-ac 98.7 2.3E-08 5.9E-13 72.8 6.3 46 138-185 141-187 (203) 105 cd03357 LbH_MAT_GAT Maltose O- 98.7 3.6E-08 9.1E-13 71.6 6.9 117 21-176 44-164 (169) 106 TIGR00965 dapD 2,3,4,5-tetrahy 98.7 9.2E-09 2.3E-13 75.3 3.3 114 10-153 106-219 (275) 107 cd03356 LbH_G1P_AT_C_like Left 98.7 6.1E-08 1.6E-12 70.1 7.5 47 22-70 2-48 (79) 108 cd03349 LbH_XAT Xenobiotic acy 98.7 6.3E-08 1.6E-12 70.1 7.3 51 129-181 74-125 (145) 109 cd05825 LbH_wcaF_like wcaF-lik 98.7 2.6E-08 6.6E-13 72.5 5.3 47 128-176 56-102 (107) 110 cd03356 LbH_G1P_AT_C_like Left 98.7 3.3E-08 8.5E-13 71.8 5.7 30 4-34 2-31 (79) 111 PRK10191 putative colanic acid 98.7 4.5E-08 1.1E-12 71.0 6.1 43 131-175 95-137 (146) 112 TIGR02353 NRPS_term_dom non-ri 98.7 2.1E-08 5.3E-13 73.1 4.4 84 111-194 622-719 (719) 113 cd04652 LbH_eIF2B_gamma_C eIF- 98.7 1.2E-07 3E-12 68.4 7.7 29 23-52 3-31 (81) 114 PRK04928 consensus 98.6 1.1E-07 2.7E-12 68.6 7.0 13 167-179 364-376 (405) 115 PRK02862 glgC glucose-1-phosph 98.6 1.9E-07 4.8E-12 67.1 7.6 33 37-71 308-340 (429) 116 cd04652 LbH_eIF2B_gamma_C eIF- 98.6 9.5E-08 2.4E-12 68.9 5.9 30 4-34 2-31 (81) 117 cd04651 LbH_G1P_AT_C Glucose-1 98.5 2.2E-07 5.7E-12 66.6 6.5 69 2-74 13-81 (104) 118 COG1045 CysE Serine acetyltran 98.5 1.6E-07 4.2E-12 67.4 5.8 38 130-167 121-158 (194) 119 cd04649 LbH_THP_succinylT_puta 98.5 2.3E-07 5.9E-12 66.5 6.2 63 8-72 2-64 (147) 120 PRK02862 glgC glucose-1-phosph 98.4 6.7E-07 1.7E-11 63.6 6.9 16 126-141 381-396 (429) 121 PRK03282 consensus 98.4 1.3E-06 3.3E-11 61.8 7.8 48 130-179 331-378 (406) 122 PRK00844 glgC glucose-1-phosph 98.4 1.4E-06 3.5E-11 61.7 7.8 38 143-182 334-371 (409) 123 PRK03701 consensus 98.4 5.1E-07 1.3E-11 64.4 5.1 45 142-188 349-395 (431) 124 KOG1460 consensus 98.3 8.8E-07 2.2E-11 62.9 5.5 57 7-70 282-338 (407) 125 PRK01884 consensus 98.3 1.4E-06 3.5E-11 61.7 5.9 59 128-188 329-393 (435) 126 cd04651 LbH_G1P_AT_C Glucose-1 98.3 2.1E-06 5.3E-11 60.6 6.7 57 10-70 4-60 (104) 127 PRK00725 glgC glucose-1-phosph 98.3 1.1E-06 2.8E-11 62.3 5.0 40 141-182 348-387 (431) 128 PRK04928 consensus 98.3 4.9E-06 1.3E-10 58.2 7.7 12 162-173 365-376 (405) 129 KOG1322 consensus 98.3 1.4E-06 3.6E-11 61.6 4.9 46 3-48 254-299 (371) 130 COG0110 WbbJ Acetyltransferase 98.3 4.5E-06 1.1E-10 58.5 7.4 140 24-182 30-177 (190) 131 COG2171 DapD Tetrahydrodipicol 98.2 1.9E-06 4.8E-11 60.8 5.2 67 114-180 124-198 (271) 132 TIGR00965 dapD 2,3,4,5-tetrahy 98.2 1.6E-06 4E-11 61.3 4.3 114 43-194 103-241 (275) 133 KOG1462 consensus 98.2 1.6E-06 4E-11 61.4 4.2 16 165-180 386-401 (433) 134 PRK11132 cysE serine acetyltra 98.2 3.5E-06 9E-11 59.1 5.9 44 130-175 195-238 (273) 135 KOG1322 consensus 98.0 7.9E-06 2E-10 56.9 5.2 12 128-139 334-345 (371) 136 PRK01884 consensus 98.0 1.5E-05 3.7E-10 55.3 6.5 52 129-182 346-397 (435) 137 cd04649 LbH_THP_succinylT_puta 97.9 4.3E-05 1.1E-09 52.3 6.8 34 147-182 74-107 (147) 138 KOG1462 consensus 97.8 3.2E-05 8.2E-10 53.1 5.4 44 116-179 368-411 (433) 139 KOG4750 consensus 97.8 1.6E-05 4.1E-10 55.0 3.8 35 130-164 202-236 (269) 140 PRK03701 consensus 97.8 4.9E-05 1.3E-09 52.0 5.7 55 129-185 348-402 (431) 141 PRK00725 glgC glucose-1-phosph 97.8 5.6E-05 1.4E-09 51.6 5.5 70 112-185 333-402 (431) 142 TIGR01208 rmlA_long glucose-1- 97.7 6.3E-05 1.6E-09 51.3 5.2 51 32-95 255-306 (361) 143 TIGR01208 rmlA_long glucose-1- 97.7 4.9E-05 1.2E-09 52.0 4.5 34 14-47 255-288 (361) 144 TIGR02091 glgC glucose-1-phosp 97.6 0.00011 2.9E-09 49.7 4.8 12 7-18 315-326 (421) 145 KOG3121 consensus 97.6 0.00011 2.7E-09 49.9 4.4 145 49-230 33-178 (184) 146 COG0448 GlgC ADP-glucose pyrop 97.4 0.00075 1.9E-08 44.6 7.3 12 37-48 296-307 (393) 147 COG4801 Predicted acyltransfer 97.4 0.00033 8.5E-09 46.8 5.0 49 42-96 55-103 (277) 148 COG4801 Predicted acyltransfer 97.2 0.0014 3.5E-08 42.9 6.9 22 147-168 167-188 (277) 149 TIGR03536 DapD_gpp 2,3,4,5-tet 96.9 0.0017 4.2E-08 42.4 4.7 33 147-181 251-283 (341) 150 TIGR03535 DapD_actino 2,3,4,5- 96.8 0.0021 5.4E-08 41.7 4.8 23 36-58 152-174 (319) 151 TIGR03536 DapD_gpp 2,3,4,5-tet 96.8 0.0031 7.9E-08 40.7 5.6 86 34-124 175-264 (341) 152 TIGR03535 DapD_actino 2,3,4,5- 96.7 0.0049 1.2E-07 39.5 5.7 33 147-181 226-258 (319) 153 TIGR02092 glgD glucose-1-phosp 96.3 0.0039 1E-07 40.1 3.3 10 61-70 335-344 (383) 154 TIGR02092 glgD glucose-1-phosp 95.0 0.067 1.7E-06 32.4 5.5 15 117-131 336-350 (383) 155 PRK13412 fkp bifunctional fuco 74.8 5.2 0.00013 20.6 4.1 48 6-54 324-371 (974) 156 pfam03961 DUF342 Protein of un 74.5 7.3 0.00019 19.6 8.1 27 44-70 175-201 (450) 157 pfam10302 DUF2407 Putative mem 71.7 8.3 0.00021 19.3 4.5 38 204-241 125-167 (252) 158 pfam00427 PBS_linker_poly Phyc 70.5 9.1 0.00023 19.0 4.7 53 207-259 3-58 (131) 159 cd04765 HTH_MlrA-like_sg2 Heli 63.4 13 0.00032 18.2 6.3 38 206-243 36-73 (99) 160 pfam07637 PSD5 Protein of unkn 63.3 13 0.00032 18.1 4.7 42 206-247 16-57 (64) 161 cd01999 Argininosuccinate_Synt 58.3 8.2 0.00021 19.3 2.4 61 201-261 260-331 (385) 162 smart00335 ANX Annexin repeats 55.1 17 0.00044 17.3 4.0 27 209-237 14-40 (53) 163 PRK00509 argininosuccinate syn 55.0 16 0.00042 17.4 3.5 61 201-261 263-334 (398) 164 COG5126 FRQ1 Ca2+-binding prot 52.7 19 0.00048 17.0 5.6 52 204-255 9-65 (160) 165 cd01105 HTH_GlnR-like Helix-Tu 50.4 21 0.00053 16.8 5.6 46 206-252 37-82 (88) 166 TIGR00423 TIGR00423 conserved 46.9 17 0.00043 17.3 2.5 50 207-266 96-149 (331) 167 pfam00764 Arginosuc_synth Argi 44.7 25 0.00064 16.3 3.7 61 201-261 257-328 (389) 168 TIGR00032 argG argininosuccina 44.5 25 0.00065 16.2 4.3 60 202-261 283-355 (420) 169 PRK04527 argininosuccinate syn 44.4 24 0.00062 16.4 3.0 61 201-261 264-335 (397) 170 PRK13820 argininosuccinate syn 43.0 27 0.00068 16.1 4.1 61 201-261 261-332 (395) 171 cd07316 terB_like_DjlA N-termi 42.9 27 0.00069 16.1 6.5 54 204-257 28-82 (106) 172 PRK13340 alanine racemase; Rev 42.1 18 0.00046 17.1 2.1 35 190-224 153-188 (404) 173 TIGR02044 CueR Cu(I)-responsiv 40.2 30 0.00076 15.8 3.2 56 199-254 47-104 (127) 174 pfam03586 Herpes_UL36 Herpesvi 40.1 29 0.00073 15.9 2.8 71 184-258 96-166 (253) 175 PRK09430 djlA Dna-J like membr 39.4 31 0.00078 15.7 6.9 13 214-226 216-228 (269) 176 TIGR02120 GspF general secreti 39.2 31 0.00078 15.7 6.1 55 203-258 67-121 (414) 177 cd04396 RhoGAP_fSAC7_BAG7 RhoG 37.6 20 0.0005 16.9 1.7 49 175-223 24-72 (225) 178 PRK04156 gltX glutamyl-tRNA sy 37.5 22 0.00055 16.7 1.8 50 164-217 325-379 (566) 179 pfam00191 Annexin Annexin. Thi 36.8 34 0.00086 15.5 4.0 27 209-237 27-53 (66) 180 pfam03556 DUF298 Domain of unk 36.2 34 0.00087 15.4 5.9 49 216-264 21-78 (117) 181 pfam01244 Peptidase_M19 Membra 34.7 35 0.0009 15.3 2.6 23 202-224 290-314 (316) 182 PTZ00184 calmodulin; Provision 34.5 36 0.00093 15.3 3.3 47 215-261 84-135 (149) 183 pfam07319 DnaI_N Primosomal pr 34.4 37 0.00093 15.2 4.4 36 217-253 4-39 (93) 184 cd04781 HTH_MerR-like_sg6 Heli 34.2 20 0.0005 16.9 1.2 52 204-255 52-103 (120) 185 TIGR02589 cas_Csd2 CRISPR-asso 33.3 38 0.00097 15.1 3.0 43 180-227 190-233 (307) 186 PRK05347 glutaminyl-tRNA synth 32.8 26 0.00067 16.1 1.7 15 200-214 295-309 (556) 187 TIGR01270 Trp_5_monoox tryptop 32.7 30 0.00078 15.7 2.0 42 229-270 238-286 (499) 188 TIGR02225 recomb_XerD tyrosine 30.8 42 0.0011 14.9 4.0 61 202-262 53-127 (305) 189 COG3945 Uncharacterized conser 30.3 42 0.0011 14.9 2.4 17 209-225 92-108 (189) 190 KOG0819 consensus 29.8 44 0.0011 14.8 3.5 17 210-226 119-135 (321) 191 TIGR00463 gltX_arch glutamyl-t 29.5 29 0.00074 15.9 1.5 60 158-219 326-385 (600) 192 cd04379 RhoGAP_SYD1 RhoGAP_SYD 29.5 28 0.00071 16.0 1.3 43 180-222 15-57 (207) 193 TIGR02568 LcrE type III secret 29.5 44 0.0011 14.7 5.7 65 192-257 147-219 (265) 194 PRK05665 amidotransferase; Pro 28.2 47 0.0012 14.6 3.3 58 200-258 178-236 (240) 195 TIGR02160 PA_CoA_Oxy5 phenylac 27.8 34 0.00086 15.5 1.5 56 160-225 115-182 (371) 196 COG1422 Predicted membrane pro 27.4 48 0.0012 14.5 3.1 40 200-241 67-106 (201) 197 TIGR02137 HSK-PSP phosphoserin 26.7 27 0.00069 16.1 0.9 39 176-221 29-67 (203) 198 COG0137 ArgG Argininosuccinate 26.6 50 0.0013 14.4 4.2 60 201-260 265-336 (403) 199 pfam05476 PET122 PET122. The n 25.8 51 0.0013 14.3 5.7 57 198-257 206-262 (267) 200 cd06825 PLPDE_III_VanT Type II 25.6 51 0.0013 14.3 2.1 24 198-222 120-143 (368) 201 pfam12098 DUF3574 Protein of u 25.4 52 0.0013 14.3 2.2 19 210-228 73-91 (104) 202 cd04382 RhoGAP_MgcRacGAP RhoGA 25.2 34 0.00087 15.4 1.2 25 197-221 31-55 (193) 203 KOG1148 consensus 24.8 44 0.0011 14.8 1.7 44 176-219 482-528 (764) 204 pfam04282 DUF438 Family of unk 24.3 55 0.0014 14.2 2.8 46 210-255 11-58 (79) 205 COG0484 DnaJ DnaJ-class molecu 23.8 56 0.0014 14.1 2.8 16 209-224 336-351 (371) 206 TIGR00243 Dxr 1-deoxy-D-xylulo 23.7 37 0.00094 15.2 1.1 82 152-236 249-342 (406) 207 TIGR02143 trmA_only tRNA (urac 23.1 22 0.00056 16.6 -0.1 36 206-241 297-334 (361) 208 cd03557 L-arabinose_isomerase 23.0 30 0.00076 15.8 0.5 95 147-241 152-250 (484) 209 cd07313 terB_like_2 tellurium 22.8 58 0.0015 14.0 4.6 49 209-257 16-65 (104) 210 PTZ00183 centrin; Provisional 22.5 59 0.0015 13.9 5.3 19 206-224 18-36 (168) 211 cd04769 HTH_MerR2 Helix-Turn-H 22.2 60 0.0015 13.9 2.7 19 203-221 51-69 (116) 212 CHL00091 apcE phycobillisome l 22.2 60 0.0015 13.9 5.2 50 209-258 721-775 (886) 213 TIGR00635 ruvB Holliday juncti 22.1 60 0.0015 13.9 5.7 76 165-247 112-196 (305) 214 TIGR01926 peroxid_rel uncharac 21.5 56 0.0014 14.1 1.7 37 190-227 128-164 (179) 215 cd04408 RhoGAP_GMIP RhoGAP_GMI 21.4 48 0.0012 14.5 1.3 26 196-221 29-54 (200) 216 pfam09675 Chlamy_scaf Chlamydi 20.0 67 0.0017 13.6 2.2 44 214-259 30-73 (107) No 1 >TIGR01852 lipid_A_lpxA acyl-[acyl-carrier-protein]--UDP-N-acetylglucosamine O-acyltransferase; InterPro: IPR010137 This entry describes LpxA, an enzyme for the biosynthesis of lipid A, a component of lipopolysaccharide (LPS) in the outer membrane outer leaflet of most Gram-negative bacteria. Some differences are found between lipid A of different species, but this protein represents the first step (from UDP-N-acetyl-D-glucosamine) and appears to be conserved in function. Proteins from this family contain many copies of the bacterial transferase hexapeptide repeat.; GO: 0008780 acyl-[acyl-carrier-protein]-UDP-N-acetylglucosamine O-acyltransferase activity, 0008610 lipid biosynthetic process. Probab=100.00 E-value=0 Score=708.06 Aligned_cols=255 Identities=50% Similarity=0.839 Sum_probs=252.8 Q ss_pred ECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCCCCCCCCCC Q ss_conf 65510888985998898995655988998998898986686891684686553114342232102333122222233222 Q gi|255764481|r 10 IHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTELLVG 89 (271) Q Consensus 10 IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~v~IG 89 (271) |||||+|+|+|+||+||+|||||+|+++|+||+||+|+|||+|.|.|+||+||+|||+|+||++|||+||+||.|+|+|| T Consensus 1 IHPTA~ie~gA~iG~~V~iGPf~~v~~~V~ig~g~~l~shV~i~G~T~iG~~~~i~p~A~iG~~PQdlky~GE~t~l~IG 80 (257) T TIGR01852 1 IHPTAIIEPGAEIGEDVEIGPFCIVGPGVKIGDGVELKSHVVIEGHTTIGEGTKIFPGAVIGGEPQDLKYKGERTELIIG 80 (257) T ss_pred CCCCEEECCCCEECCCCEEEEEEEECCCCEECCCCEECCCEEEECCEEECCCCEEECCEEECCCCCCCEECCCCCEEEEC T ss_conf 99621878866828985780188878975886885875736896770878998880760767988565014870179988 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCEEEEC Q ss_conf 22221224432111122222212233200001665543310012344322345245532148606523444336506404 Q gi|255764481|r 90 KKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFTRIGK 169 (271) Q Consensus 90 ~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v~IG~ 169 (271) |||+|||+||||+||.++++.|+||||||||+|+||||||.|||+|+|+|+|+|||||+|||+|+|||++.||||||||+ T Consensus 81 ~~n~IRE~VTi~~GT~~g~g~T~iG~~nllMAysHvAHDC~vGn~vv~aN~a~LAGHV~vgD~a~iGG~~avHQFvRIG~ 160 (257) T TIGR01852 81 DNNTIREFVTINRGTKSGGGVTSIGNNNLLMAYSHVAHDCVVGNHVVLANNATLAGHVEVGDYAIIGGLVAVHQFVRIGR 160 (257) T ss_pred CCCEEEEEEEECCCCCCCCCEEEECCCCEECCCCEEEECCEECCEEEEECCCEECEEEEECCEEEECCCCCEEEEEEEHH T ss_conf 96569755785066258888899888765720535612436888169813631222799888999778972110133000 Q ss_pred CCEEECCCCCCCCCCCCCHHHHC-CCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHH-CCCCHH Q ss_conf 74350132112233331000107-4532334420144428889999999999999972799899999999964-258989 Q gi|255764481|r 170 YAFIGGMTGVVHDVIPYGILNGN-PGALRGVNVVAMRRAGFSRDTIHLIRAVYKQIFQQGDSIYKNAGAIREQ-NVSCPE 247 (271) Q Consensus 170 ~a~ig~gs~V~kdVpp~~~~~G~-pa~~~~~N~~glkR~g~s~e~i~~l~~ayr~lfr~~~~l~e~l~~l~~~-~~~~~~ 247 (271) ||||||+|.|.||||||+++.|+ ||+++|||++||||+||++|+|..|++|||+|||+.++++|+++++.|+ ..++|+ T Consensus 161 ~aMigG~s~v~~DvpPY~~~~G~~~a~l~GlN~vGLrR~Gf~~~~i~~i~~ayr~lfr~~~~~~~~~~~v~e~P~~~~~~ 240 (257) T TIGR01852 161 YAMIGGLSAVSKDVPPYGLVEGNSRAVLRGLNIVGLRRRGFDREEITAIKKAYRLLFRSGLPLQEALQQVAEEPYEDNPE 240 (257) T ss_pred HHHHHHHCCCCCCCCCCEEECCCCCCEEEEEEEEEEECCCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCCCCHH T ss_conf 23221200246787761786378753378870132213787989999999998876179989899999998351337757 Q ss_pred HHHHHHHHHHCCCCCCC Q ss_conf 99999998615888737 Q gi|255764481|r 248 VSDIINFIFADRKRPLS 264 (271) Q Consensus 248 v~~~~~Fi~~s~rRgi~ 264 (271) +++|++||+++++|||| T Consensus 241 v~~~~~FI~~dS~RG~~ 257 (257) T TIGR01852 241 VKEIVDFIREDSKRGLC 257 (257) T ss_pred HHHHHHHHCCCCCCCCC T ss_conf 99999986076668889 No 2 >COG1043 LpxA Acyl-[acyl carrier protein] Probab=100.00 E-value=0 Score=629.48 Aligned_cols=259 Identities=47% Similarity=0.774 Sum_probs=255.4 Q ss_pred CCCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCCCCCC Q ss_conf 99756551088898599889899565598899899889898668689168468655311434223210233312222223 Q gi|255764481|r 6 NNPIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTE 85 (271) Q Consensus 6 ~~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~ 85 (271) +++.|||||+|+|+|+||+||+|||||+|+++|+||++|+|++||+|.|.|+||++|+|||+|+||.+|||+||+|++|+ T Consensus 2 ~~~~IHPTAiIe~gA~ig~~V~IGpf~iIg~~V~ig~~t~l~shvvv~G~T~IG~~n~I~~~A~iG~~pQdlKykge~T~ 81 (260) T COG1043 2 KMAKIHPTAIIEPGAEIGEDVKIGPFCIIGPNVEIGDGTVLKSHVVVEGHTTIGRNNRIFPFASIGEDPQDLKYKGEPTR 81 (260) T ss_pred CCCCCCCCEEECCCCCCCCCCEECCEEEECCCCEECCCCEECCCEEEECCCEECCCCEEECCCCCCCCCCCCCCCCCCEE T ss_conf 75645762152787882899888833798897198899588343799677077789879111042898754502798427 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCE Q ss_conf 32222222122443211112222221223320000166554331001234432234524553214860652344433650 Q gi|255764481|r 86 LLVGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFT 165 (271) Q Consensus 86 v~IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v 165 (271) ++||+||+|||+||||+||.++++.|+||||||||+|+||||||.|||+|+++|+++|+|||+|||+|+||+.+.||||| T Consensus 82 l~IG~~n~IRE~vTi~~GT~~g~g~T~IGdnnl~May~HVAHDC~iGn~~ilaNnatLAGHV~igD~aiiGG~saVHQF~ 161 (260) T COG1043 82 LIIGDNNTIREFVTIHRGTVQGGGVTRIGDNNLIMAYAHVAHDCVIGNNCILANNATLAGHVEVGDYAIIGGLSAVHQFV 161 (260) T ss_pred EEECCCCEEEEEEEEECCCCCCCEEEEECCCCEEEEEEEEECCCEECCCEEEECCCEEECEEEECCEEEECCCCEEEEEE T ss_conf 99889975866889861544786159977887898731031344456717996686685407887789974752377788 Q ss_pred EEECCCEEECCCCCCCCCCCCCHHHHCCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCC Q ss_conf 64047435013211223333100010745323344201444288899999999999999727998999999999642589 Q gi|255764481|r 166 RIGKYAFIGGMTGVVHDVIPYGILNGNPGALRGVNVVAMRRAGFSRDTIHLIRAVYKQIFQQGDSIYKNAGAIREQNVSC 245 (271) Q Consensus 166 ~IG~~a~ig~gs~V~kdVpp~~~~~G~pa~~~~~N~~glkR~g~s~e~i~~l~~ayr~lfr~~~~l~e~l~~l~~~~~~~ 245 (271) |||++||+|++|.|.+|||||+++.||||+++|||++||||+||++|+|.+|++|||+|||++++++|++++++++..++ T Consensus 162 rIG~~amiGg~S~v~~DVpPy~~~~Gn~a~l~GlN~vGlkRrgf~~e~i~alr~ayk~lfr~~~~~~e~~~~i~~~~~~~ 241 (260) T COG1043 162 RIGAHAMIGGLSAVSQDVPPYVIASGNHARLRGLNIVGLKRRGFSREEIHALRKAYKLLFRSGLTLREALEEIAEEYADN 241 (260) T ss_pred EECCHHEECCCHHHCCCCCCEEEECCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHHHHEECCCCHHHHHHHHHHHHCCC T ss_conf 97401123441010367798388528704421213220110699989999999999998018988899999999884278 Q ss_pred HHHHHHHHHHHHCCCCCCC Q ss_conf 8999999998615888737 Q gi|255764481|r 246 PEVSDIINFIFADRKRPLS 264 (271) Q Consensus 246 ~~v~~~~~Fi~~s~rRgi~ 264 (271) |++++|+|||++|+||.+| T Consensus 242 ~~v~~~~dFi~~s~r~~~r 260 (260) T COG1043 242 PEVKEFIDFIASSSRGIIR 260 (260) T ss_pred HHHHHHHHHHHHCCCCCCC T ss_conf 6799999998642466669 No 3 >PRK05289 UDP-N-acetylglucosamine acyltransferase; Provisional Probab=100.00 E-value=0 Score=592.57 Aligned_cols=260 Identities=47% Similarity=0.779 Sum_probs=253.2 Q ss_pred CCCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCCCCCC Q ss_conf 99756551088898599889899565598899899889898668689168468655311434223210233312222223 Q gi|255764481|r 6 NNPIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTE 85 (271) Q Consensus 6 ~~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~ 85 (271) +++.|||||+|+|+|+||+||+|||||+|+++|+||++|+|++|++|.++++||++|.|+|+|+||++|||++|.+++++ T Consensus 1 ~~~~IHptAiI~~~A~IG~~v~Igp~~vIg~~v~IG~~~~I~~~v~I~~~t~IG~~~~I~~~avIG~~pqd~~~~g~~~~ 80 (261) T PRK05289 1 SMAKIHPTAIVEPGAEIGENVEIGPFCVIGPDVVIGDGTEIGSHVVIDGHTTIGKNNRIFPFASIGEEPQDLKYKGEPTR 80 (261) T ss_pred CCCCCCCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEECCCEECCCCEECCCCCCCCCCCCCCCCCCCCE T ss_conf 95740898898998999998999999999999999999999799689189888988345247533778766660698536 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCE Q ss_conf 32222222122443211112222221223320000166554331001234432234524553214860652344433650 Q gi|255764481|r 86 LLVGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFT 165 (271) Q Consensus 86 v~IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v 165 (271) |+|||+|.|||+|||||||..+++.|+||++|+||+++||+|||+||++|+++|++.|+|||+|||+|+||++++||||+ T Consensus 81 v~IGd~~~Ire~vtI~rgT~~~~g~T~IG~~~~im~~vhIaHd~~IG~~~iian~~~laG~v~Igd~v~iGg~~~v~q~v 160 (261) T PRK05289 81 LVIGDNNTIREFVTINRGTVQGGGVTRIGDNNLLMAYVHVAHDCVVGNNVILANNATLAGHVEVGDYAIIGGLSGVHQFV 160 (261) T ss_pred EEECCCCEECCCCEECCCCCCCCCCEEECCCCEEECCCCCCCCCCCCCCCEECCCEEECCCEEECCCEEECCCCEECCCC T ss_conf 99899778985414436610588644988885440036434455127871654305664607988970997615664883 Q ss_pred EEECCCEEECCCCCCCCCCCCCHHHHCCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCC Q ss_conf 64047435013211223333100010745323344201444288899999999999999727998999999999642589 Q gi|255764481|r 166 RIGKYAFIGGMTGVVHDVIPYGILNGNPGALRGVNVVAMRRAGFSRDTIHLIRAVYKQIFQQGDSIYKNAGAIREQNVSC 245 (271) Q Consensus 166 ~IG~~a~ig~gs~V~kdVpp~~~~~G~pa~~~~~N~~glkR~g~s~e~i~~l~~ayr~lfr~~~~l~e~l~~l~~~~~~~ 245 (271) +||++|||||+|+|+||||||+++.|+|||++|||++||||+|||+|+|++||+|||+||+++++++|+++++++ ..++ T Consensus 161 ~IG~~a~vgagS~V~kDVpp~~~v~G~PAr~~glN~vGL~R~g~s~~~i~~i~~ayr~l~~~~~~~~~~~~~~~~-~~~~ 239 (261) T PRK05289 161 RIGAHAMVGGMSAVVQDVPPYVLAEGNPARLRGLNIVGLKRRGFSREEIHALRRAYKLLYRSGLTLEEALEELAE-LADS 239 (261) T ss_pred EECCCEEECCCCEECCCCCCCCEEECCCCEEEEEEHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHH-CCCC T ss_conf 989871999763654247998678468356886128888677999999999999999997279989999999985-3799 Q ss_pred HHHHHHHHHHHHCCCCCCCCCC Q ss_conf 8999999998615888737876 Q gi|255764481|r 246 PEVSDIINFIFADRKRPLSNWG 267 (271) Q Consensus 246 ~~v~~~~~Fi~~s~rRgi~~~~ 267 (271) ||+++|++||++| +||+|.|. T Consensus 240 ~~v~~~~~Fi~~s-~RGi~~~~ 260 (261) T PRK05289 240 PEVKEILDFIEAS-SRGIIRPR 260 (261) T ss_pred HHHHHHHHHHHCC-CCCCCCCC T ss_conf 8999999999618-88522789 No 4 >cd03351 LbH_UDP-GlcNAc_AT UDP-N-acetylglucosamine O-acyltransferase (UDP-GlcNAc acyltransferase): Proteins in this family catalyze the transfer of (R)-3-hydroxymyristic acid from its acyl carrier protein thioester to UDP-GlcNAc. It is the first enzyme in the lipid A biosynthetic pathway and is also referred to as LpxA. Lipid A is essential for the growth of Escherichia coli and related bacteria. It is also essential for maintaining the integrity of the outer membrane. UDP-GlcNAc acyltransferase is a homotrimer of left-handed parallel beta helix (LbH) subunits. Each subunit contains an N-terminal LbH region with 9 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X), and a C-terminal alpha-helical region. Probab=100.00 E-value=0 Score=591.14 Aligned_cols=254 Identities=44% Similarity=0.776 Sum_probs=249.7 Q ss_pred EECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCCCCCCCCC Q ss_conf 56551088898599889899565598899899889898668689168468655311434223210233312222223322 Q gi|255764481|r 9 IIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTELLV 88 (271) Q Consensus 9 ~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~v~I 88 (271) ||||||+|+|+|+||+||+|||||+|+++|+||++|+|++||+|.++++||++|.|+|+|+||++|||++|.+++++++| T Consensus 1 ~IHptAiI~~~A~IG~~v~Igp~~vIg~~V~IG~~~~I~~~v~I~~~t~IG~~~~I~~~avIG~~pqd~~~~g~~~~v~I 80 (254) T cd03351 1 MIHPTAIVDPGAKIGENVEIGPFCVIGPNVEIGDGTVIGSHVVIDGPTTIGKNNRIFPFASIGEAPQDLKYKGEPTRLEI 80 (254) T ss_pred CCCCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCEEECCCCCCCCCCCCCCCCCCCCEEEE T ss_conf 91897796998999997999999899999999999999799689199797765256057534778767673798676997 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCEEEE Q ss_conf 22222122443211112222221223320000166554331001234432234524553214860652344433650640 Q gi|255764481|r 89 GKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFTRIG 168 (271) Q Consensus 89 G~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v~IG 168 (271) ||+|.|||+|+|||||..+++.|+||++|+||+++||+|||.||++|+++|++.|+|||+|||+||||++++||||++|| T Consensus 81 G~~~~Ire~vtI~rgt~~~~~~T~IG~~~~im~~~HIaHdc~IG~~~iian~~~laGhv~Igd~a~IGg~~~v~q~v~IG 160 (254) T cd03351 81 GDNNTIREFVTIHRGTAQGGGVTRIGNNNLLMAYVHVAHDCVIGNNVILANNATLAGHVEIGDYAIIGGLSAVHQFCRIG 160 (254) T ss_pred CCCCEECCCCCCCCCCCCCCCCEEECCCCEEEECCCCCCCCCCCCCEEEECCCCCCCCEEECCCCEECCCCEECCCCEEC T ss_conf 99778995330236743788755978885782025445677337975993344556856989984898836775981999 Q ss_pred CCCEEECCCCCCCCCCCCCHHHHCCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCHHH Q ss_conf 47435013211223333100010745323344201444288899999999999999727998999999999642589899 Q gi|255764481|r 169 KYAFIGGMTGVVHDVIPYGILNGNPGALRGVNVVAMRRAGFSRDTIHLIRAVYKQIFQQGDSIYKNAGAIREQNVSCPEV 248 (271) Q Consensus 169 ~~a~ig~gs~V~kdVpp~~~~~G~pa~~~~~N~~glkR~g~s~e~i~~l~~ayr~lfr~~~~l~e~l~~l~~~~~~~~~v 248 (271) ++|||||+|+|+||||||+++.|+|||++|||++||||+||++|+|+.|++|||+||+++++++|++++++++..++||+ T Consensus 161 ~~a~Igags~V~kDVpp~~~v~G~PA~~~glN~vGl~R~g~~~~~i~~i~~ayr~l~~~~~~~~~~~~~~~~~~~~~~~~ 240 (254) T cd03351 161 RHAMVGGGSGVVQDVPPYVIAAGNRARLRGLNLVGLKRRGFSREEIRALKRAYRILYRSGLTLEEALEELEEEAPDSPEV 240 (254) T ss_pred CCEEECCCCEEECCCCCCCEECCCCCEEEEEEHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCHHH T ss_conf 66199767456113698726517965787214898867799999999999999999617998999999998764699899 Q ss_pred HHHHHHHHHCCCCCC Q ss_conf 999999861588873 Q gi|255764481|r 249 SDIINFIFADRKRPL 263 (271) Q Consensus 249 ~~~~~Fi~~s~rRgi 263 (271) ++|++||++| +||+ T Consensus 241 ~~~~~Fi~~s-~RGi 254 (254) T cd03351 241 EELVDFIRSS-KRGI 254 (254) T ss_pred HHHHHHHHHC-CCCC T ss_conf 9999999736-7899 No 5 >PRK12461 UDP-N-acetylglucosamine acyltransferase; Provisional Probab=100.00 E-value=0 Score=590.06 Aligned_cols=255 Identities=40% Similarity=0.718 Sum_probs=249.7 Q ss_pred EECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCCCCCCCCC Q ss_conf 56551088898599889899565598899899889898668689168468655311434223210233312222223322 Q gi|255764481|r 9 IIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTELLV 88 (271) Q Consensus 9 ~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~v~I 88 (271) ||||||+|+|+|+||+||+|||||+|+++|+||++|+|++|++|.++++||++|.|+|+|+||.+|||++|++++++++| T Consensus 2 ~IHptAiI~~~A~Ig~~v~Igp~~vIg~~V~IG~~~~I~~~v~I~~~t~IG~~~~I~~~avIG~~pqd~~~~g~~~~v~I 81 (256) T PRK12461 2 MIHPTAVIDPSAKLGEGVEIGPFAVIGADVEIGDGTWIMPHAVILGWTRIGKNNKIHQGAVVGDEPQDFGYKGEESHLII 81 (256) T ss_pred CCCCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCEEEEECCCCCCCCCCCCCCCCCCEEEE T ss_conf 29998798998999999999999999999999999999498189199899974399754311763334441586206998 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCEEEE Q ss_conf 22222122443211112222221223320000166554331001234432234524553214860652344433650640 Q gi|255764481|r 89 GKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFTRIG 168 (271) Q Consensus 89 G~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v~IG 168 (271) ||+|.|||++||+|||. +++.|+||++|+||+++||+|||.||++|+++|++.|+|||+|||+|+||+++++|||++|| T Consensus 82 Gd~~~Ire~~tI~rgt~-~~~~T~IG~~~~im~~vhIaHd~~IG~~~iian~~~laG~v~Igd~v~iGg~~~v~q~v~IG 160 (256) T PRK12461 82 GDNNVIREGVTIHRGTK-GGTVTRIGNDNLLMANSHIAHDCQIGNNVILVNGALLAGHVTVGDRAIISGNCLVHQFCRIG 160 (256) T ss_pred CCCCEECCCCCCCCCCC-CCCCEEECCCCEEECCCCCCCCCCCCCCEEEECCEEECCCEEECCCCEECCCCEECCCCEEC T ss_conf 99878798755678614-89866988785874364334643136770672220232425999972995734771797999 Q ss_pred CCCEEECCCCCCCCCCCCCHHHHCCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCHHH Q ss_conf 47435013211223333100010745323344201444288899999999999999727998999999999642589899 Q gi|255764481|r 169 KYAFIGGMTGVVHDVIPYGILNGNPGALRGVNVVAMRRAGFSRDTIHLIRAVYKQIFQQGDSIYKNAGAIREQNVSCPEV 248 (271) Q Consensus 169 ~~a~ig~gs~V~kdVpp~~~~~G~pa~~~~~N~~glkR~g~s~e~i~~l~~ayr~lfr~~~~l~e~l~~l~~~~~~~~~v 248 (271) ++|||||+|+|+||||||+++.|+|||++|||++||||+||++++|++||+|||+|||++++++|++++++++..++||+ T Consensus 161 ~~a~Ig~gS~V~kDVPp~~~v~G~pA~~~glN~vGl~R~gf~~~~i~~i~~ay~~l~~~~~~~~~~~~~~~~~~~~~~~v 240 (256) T PRK12461 161 RLAMMAGGSRISKDVPPYCMMAGNTTNVHGLNAVGLRRNGFSSRAIRALKRAYKIIYRSGLSLQDAVQELEDQFPMSPEI 240 (256) T ss_pred CCCEECCCCEEEEECCCCEEEECCCCEEEECHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCHHH T ss_conf 99699878577753799838826877654010440452898999999999999999708998999999998765499999 Q ss_pred HHHHHHHHHCCCCCCCC Q ss_conf 99999986158887378 Q gi|255764481|r 249 SDIINFIFADRKRPLSN 265 (271) Q Consensus 249 ~~~~~Fi~~s~rRgi~~ 265 (271) ++|++||++| +||||. T Consensus 241 ~~~~~Fi~~s-~RGi~r 256 (256) T PRK12461 241 RELIDFVKAS-KRGIVR 256 (256) T ss_pred HHHHHHHHHC-CCCCCC T ss_conf 9999998627-887529 No 6 >cd03352 LbH_LpxD UDP-3-O-acyl-glucosamine N-acyltransferase (LpxD): The enzyme catalyzes the transfer of 3-hydroxymyristic acid or 3-hydroxy-arachidic acid, depending on the organism, from the acyl carrier protein (ACP) to UDP-3-O-acyl-glucosamine to produce UDP-2,3-diacyl-GlcNAc. This constitutes the third step in the lipid A biosynthetic pathway in Gram-negative bacteria. LpxD is a homotrimer, with each subunit consisting of a novel combination of an N-terminal uridine-binding domain, a core lipid-binding left-handed parallel beta helix (LbH) domain, and a C-terminal alpha-helical extension. The LbH domain contains 9 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Probab=100.00 E-value=0 Score=344.86 Aligned_cols=193 Identities=28% Similarity=0.453 Sum_probs=181.9 Q ss_pred CCCCCCCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCC Q ss_conf 50579975655108889859988989956559889989988989866868916846865531143422321023331222 Q gi|255764481|r 2 SRMGNNPIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNF 81 (271) Q Consensus 2 ~~i~~~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g 81 (271) ++|++++.|||+|+|+++|+||+|++|+|+|+|+++|+||++|+|+++++|.++++||++|.|+++++||.+|++..... T Consensus 2 a~ig~~v~I~~~avI~~~v~Ig~n~~I~~~~~I~~~v~IG~~~~I~~~~~I~~~~~IG~~~~I~~~~vIg~~~f~~~~~~ 81 (205) T cd03352 2 AKIGENVSIGPNAVIGEGVVIGDGVVIGPGVVIGDGVVIGDDCVIHPNVTIYEGCIIGDRVIIHSGAVIGSDGFGFAPDG 81 (205) T ss_pred CEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEECCCEECCCCEECCEEEECCCEEECCCCCCCCCCC T ss_conf 69199999999879999999999999989889999968898836600344727868754477715348621353231479 Q ss_pred C-------CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEE Q ss_conf 2-------223322222221224432111122222212233200001665543310012344322345245532148606 Q gi|255764481|r 82 V-------GTELLVGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVV 154 (271) Q Consensus 82 ~-------~~~v~IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~ 154 (271) . ...+.|||++.|+++++|+|++. ..|+||+++++|.++||+|||.||+++++++++.++||++|+|+|| T Consensus 82 ~~~~~~~~~g~v~Ig~~v~Ig~~~~I~~g~~---~~T~IG~~~~i~~~v~igh~~~iG~~~~i~~~~~~~g~~~Igd~~~ 158 (205) T cd03352 82 GGWVKIPQLGGVIIGDDVEIGANTTIDRGAL---GDTVIGDGTKIDNLVQIAHNVRIGENCLIAAQVGIAGSTTIGDNVI 158 (205) T ss_pred CCCEECCCCCCEEECCCEEECCCCEECCCCC---CCCCCCCCCEECCCCCCCCCCCCCCCEEECCCCCCCCCEEECCCCE T ss_conf 8404514213189848849888627636642---6774787768645452556404158707503200026524974353 Q ss_pred ECCCCCCCCCEEEECCCEEECCCCCCCCCCCCCHHHHCCCCCC Q ss_conf 5234443365064047435013211223333100010745323 Q gi|255764481|r 155 FGGGSAVHQFTRIGKYAFIGGMTGVVHDVIPYGILNGNPGALR 197 (271) Q Consensus 155 IG~~~~v~~~v~IG~~a~ig~gs~V~kdVpp~~~~~G~pa~~~ 197 (271) ||+++.++|+++||++|||||+|+|+||||||+++.|+|||.. T Consensus 159 iG~~s~i~~gv~IG~~a~VgagSvVtkdVp~~~iv~G~PAr~i 201 (205) T cd03352 159 IGGQVGIAGHLTIGDGVVIGAGSGVTSIVPPGEYVSGTPAQPH 201 (205) T ss_pred ECCCCEECCCCEECCCCEECCCCEECCCCCCCCEEEEEECCHH T ss_conf 9896488299399999899868879648699969997628435 No 7 >PRK00892 lpxD UDP-3-O-[3-hydroxymyristoyl] glucosamine N-acyltransferase; Provisional Probab=100.00 E-value=0 Score=340.29 Aligned_cols=193 Identities=23% Similarity=0.381 Sum_probs=152.9 Q ss_pred CCCCCCCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCC--- Q ss_conf 50579975655108889859988989956559889989988989866868916846865531143422321023331--- Q gi|255764481|r 2 SRMGNNPIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKY--- 78 (271) Q Consensus 2 ~~i~~~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~--- 78 (271) ++|++++.|.|.++|+++++||+||.|+|+|+|+++|+||+||+|+|+|+|..++.||++|.|+++++||.+...+. T Consensus 111 a~Ig~~v~Igp~~vIg~~v~IG~~~~I~~~~~Ig~~v~IG~~~~I~~nv~I~~~~~IG~~~~I~~g~vIG~dGFGf~~~~ 190 (343) T PRK00892 111 AKIGKSVSIGPNAVIGAGVEIGDGVIIGAGAVIGAGVKIGDDCRLHANVTIYHRVQIGNRVLIHSGAVIGQDGFGFAPDR 190 (343) T ss_pred CEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEECCCEECCCEEECCCCEECCCCEECCCCCCCCCCC T ss_conf 78799988999989999989999999929819958818889958811856625558888756468862236675413679 Q ss_pred --CC--CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEE Q ss_conf --22--22223322222221224432111122222212233200001665543310012344322345245532148606 Q gi|255764481|r 79 --HN--FVGTELLVGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVV 154 (271) Q Consensus 79 --~~--g~~~~v~IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~ 154 (271) +. ..-..|+|||++.|++++||+||+.. .|+||++++||.++||||||+||++|++++++.|+||++|||+|+ T Consensus 191 ~~~~kip~~G~V~I~d~veIgan~tIdrg~~~---~T~Ig~~~kidn~vhIaHn~~IG~~~iia~~~giaGs~~igd~v~ 267 (343) T PRK00892 191 GGWVKIPQLGRVIIGDDVEIGANTTIDRGALD---DTVIGEGVKIDNLVQIAHNVRIGRHCAIAAQVGIAGSTKIGRYCM 267 (343) T ss_pred CCEEECCCCCEEEECCCCEECCCCEECCCCCC---CCEECCCCEEEEEEEECCCCEECCCCEEEECCEEEEEEEECCCEE T ss_conf 94046464324998898688887327236677---978898965921117616747878838852114700579999149 Q ss_pred ECCCCCCCCCEEEECCCEEECCCCCCCCCCC-CCHHHHCCCCCC Q ss_conf 5234443365064047435013211223333-100010745323 Q gi|255764481|r 155 FGGGSAVHQFTRIGKYAFIGGMTGVVHDVIP-YGILNGNPGALR 197 (271) Q Consensus 155 IG~~~~v~~~v~IG~~a~ig~gs~V~kdVpp-~~~~~G~pa~~~ 197 (271) ||+++.|+|+++||+++|++|+|.|++|||+ ...+.|.||+.. T Consensus 268 igG~~gi~~h~~Ig~~~~i~~~sgv~~di~~~g~~~~G~Pa~~~ 311 (343) T PRK00892 268 IGGQVGIAGHLEIGDGVQITAMSGVMKSITEPGEYSSGIPAQPN 311 (343) T ss_pred ECCCCCCCCCCEECCCCEEEECCEECCCCCCCCEEEEEECCCHH T ss_conf 94413234886989997990065143646999808975346008 No 8 >TIGR01853 lipid_A_lpxD UDP-3-O-[3-hydroxymyristoyl] glucosamine N-acyltransferase; InterPro: IPR007691 UDP-3-O-[3-hydroxymyristoyl] glucosamine N-acyltransferase (2.3.1 from EC) catalyses an early step in lipid A biosynthesis :UDP-3-O-(3-hydroxytetradecanoyl)glucosamine + (R)-3-hydroxytetradecanoyl- [acyl carrier protein] = UDP-2,3-bis(3-hydroxytetradecanoyl)glucosamine + [acyl carrier protein] Members of this family also contain a hexapeptide repeat (IPR001451 from INTERPRO). This entry represents the non-repeating region of LPXD proteins.; GO: 0016747 transferase activity transferring groups other than amino-acyl groups, 0009245 lipid A biosynthetic process. Probab=100.00 E-value=1.4e-45 Score=304.88 Aligned_cols=193 Identities=27% Similarity=0.492 Sum_probs=175.5 Q ss_pred CCCCCCCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECC------C Q ss_conf 5057997565510888985998898995655988998998898986686891684686553114342232102------3 Q gi|255764481|r 2 SRMGNNPIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDT------Q 75 (271) Q Consensus 2 ~~i~~~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~------q 75 (271) ++|++++.|-|.|+|..++.||+||.|+|+|+|++||+||++|+|+|+|+|+.++.||+||.|+|+|+||+|= + T Consensus 114 a~~~~~v~I~pnvvIG~~v~IG~~~~I~pg~~iG~~v~IG~~~~i~~~v~i~~~~~lG~nV~I~~GaVIG~DGFGya~~~ 193 (336) T TIGR01853 114 AKIGEGVTIGPNVVIGAGVEIGENVIIGPGVVIGDDVVIGDGSVIYPNVVIYERVQLGKNVIIHAGAVIGSDGFGYAHTA 193 (336) T ss_pred CCCCCCCEECCEEEECCCCEECCCCEECCCCEECCCCEECCCCEEECCCEEEECCEECCEEEECCCCEECCCCCCEEECC T ss_conf 31078436875417888868778636357768768868799677806708972415647158768878867873203578 Q ss_pred CCCC-C-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCE Q ss_conf 3312-2-2222332222222122443211112222221223320000166554331001234432234524553214860 Q gi|255764481|r 76 SKYH-N-FVGTELLVGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRV 153 (271) Q Consensus 76 d~~~-~-g~~~~v~IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~ 153 (271) +..+ + ..-..|+|+|++.|+.|+||.||+.. .|.|++++.|...++|||+|+||+||+|+.++.|+|+++||+|| T Consensus 194 ~g~~~KI~q~G~ViI~D~VEIGANTtIDRg~~~---dT~i~~gtKIDNLVQIAHNc~IG~n~~ivaqvGiaGSt~iG~~V 270 (336) T TIGR01853 194 NGGHVKIPQIGRVIIEDDVEIGANTTIDRGAFD---DTVIGEGTKIDNLVQIAHNCRIGENCIIVAQVGIAGSTKIGRNV 270 (336) T ss_pred CCCEEEEEEECEEEEECEEEECCCCHHHCCCCC---CCEEECCCEECCEEEEEEEEEECCCCEEEEEEECCEEEEECCEE T ss_conf 897154413024896023387468702110214---71872384662315741334626642789642011346755806 Q ss_pred EECCCCCCCCCEEEECCCEEECCCCCCCCCCCCCHHHH--CCCCCC Q ss_conf 65234443365064047435013211223333100010--745323 Q gi|255764481|r 154 VFGGGSAVHQFTRIGKYAFIGGMTGVVHDVIPYGILNG--NPGALR 197 (271) Q Consensus 154 ~IG~~~~v~~~v~IG~~a~ig~gs~V~kdVpp~~~~~G--~pa~~~ 197 (271) .|||.+++-.+.+||+.++|+|.|.|++||||...+.| -||+.. T Consensus 271 ~~GGqvGv~GHl~IgD~v~i~A~sgV~~~~~~~~~ygggi~Pa~~~ 316 (336) T TIGR01853 271 ILGGQVGVAGHLEIGDNVTIGAKSGVTKSVPPNGRYGGGIIPAMPQ 316 (336) T ss_pred EECCEEEEECEEEECCCCEEEEECCCCCCCCCCCCCCEEECCCCHH T ss_conf 9626026511089438348845122067779886010011141202 No 9 >cd03353 LbH_GlmU_C N-acetyl-glucosamine-1-phosphate uridyltransferase (GlmU), C-terminal left-handed beta-helix (LbH) acetyltransferase domain: GlmU is also known as UDP-N-acetylglucosamine pyrophosphorylase. It is a bifunctional bacterial enzyme that catalyzes two consecutive steps in the formation of UDP-N-acetylglucosamine (UDP-GlcNAc), an important precursor in bacterial cell wall formation. The two enzymatic activities, uridyltransferase and acetyltransferase, are carried out by two independent domains. The C-terminal LbH domain possesses the acetyltransferase activity. It catalyzes the CoA-dependent acetylation of GlcN-1-phosphate to GlcNAc-1-phosphate. The LbH domain contains 10 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X. The acetyltransferase active site is located at the interface between two subunits of the active LbH trimer. Probab=100.00 E-value=2.6e-39 Score=266.32 Aligned_cols=180 Identities=21% Similarity=0.347 Sum_probs=139.5 Q ss_pred CEECC-CEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCCCCCCC Q ss_conf 75655-10888985998898995655988998998898986686891684686553114342232102333122222233 Q gi|255764481|r 8 PIIHP-LALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTEL 86 (271) Q Consensus 8 ~~Ihp-sA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~v 86 (271) .+||| +++|+|+|+||+||+|+|+|+|+++|+||++|+|+|+++| +++.||++|+|++++++.... ....+ T Consensus 3 ~~i~p~s~~I~~~v~IG~~v~I~p~~~I~~~v~IG~~~~Ig~~~~I-~~~~IG~~~~I~~~~vi~~~~-------ig~~~ 74 (193) T cd03353 3 TLIDPETTYIDGDVEIGVDVVIDPGVILEGKTVIGEDCVIGPNCVI-KDSTIGDGVVIKASSVIEGAV-------IGNGA 74 (193) T ss_pred EEECCCEEEECCCCEECCCCEECCCEEEECCCEECCCCEECCCEEE-CCCEECCCCEECCCEECCCCE-------ECCCC T ss_conf 7958980699998299999999898299298599999899573457-786888874002100113525-------17652 Q ss_pred CCCCCCCCCCCCCCCCCCCCCC----CCCCCCCCCHHCC-----CCCCCCCCCCCCCCCCCCCCCCCCC-CEECCCEEEC Q ss_conf 2222222122443211112222----2212233200001-----6655433100123443223452455-3214860652 Q gi|255764481|r 87 LVGKKCVIREGVTINRGTVEYG----GKTIVGDNNFFLA-----NSHVAHDCKLGNGIVLSNNVMIAGH-VIVDDRVVFG 156 (271) Q Consensus 87 ~IG~~~~Ire~v~I~~gt~~~~----~~T~IG~~~~i~~-----~~hI~Hd~~IG~~vii~n~~~l~G~-v~Igd~~~IG 156 (271) .||+++.||++++|..++..+. ..|.||+++.+.. +++|+|+|.||.+++++|.+.+++| ++|||+|||| T Consensus 75 ~IG~~~~i~~~~~I~~~~~iG~~v~i~~~~ig~~~~i~h~~~igd~~Ig~~v~Ig~~~i~~n~~g~~~~~t~IGd~v~IG 154 (193) T cd03353 75 TVGPFAHLRPGTVLGEGVHIGNFVEIKKSTIGEGSKANHLSYLGDAEIGEGVNIGAGTITCNYDGVNKHRTVIGDNVFIG 154 (193) T ss_pred EECCCEEECCCCEECCEEEECCCEEEECCCCCCCCEECCEEECCCCEECCCCEECCCEEEECCCCCCCCCCEECCCCEEC T ss_conf 66685076587688751674253576444336884877244618856777888866758873434335623433561898 Q ss_pred CCCCCCCCEEEECCCEEECCCCCCCCCCCCCHHHHCCCCC Q ss_conf 3444336506404743501321122333310001074532 Q gi|255764481|r 157 GGSAVHQFTRIGKYAFIGGMTGVVHDVIPYGILNGNPGAL 196 (271) Q Consensus 157 ~~~~v~~~v~IG~~a~ig~gs~V~kdVpp~~~~~G~pa~~ 196 (271) ++++++|+++||++|||||+|.|+||||||+|+.| +||. T Consensus 155 ~~~~i~~~v~IG~~a~Igags~V~kdVpp~~lvvg-~~~~ 193 (193) T cd03353 155 SNSQLVAPVTIGDGATIAAGSTITKDVPPGALAIA-RARQ 193 (193) T ss_pred CCCEEECCEEECCCCEECCCCEECCCCCCCCEEEE-ECCC T ss_conf 89879898399999899869888842399978996-2489 No 10 >COG1044 LpxD UDP-3-O-[3-hydroxymyristoyl] Probab=100.00 E-value=8.4e-39 Score=263.15 Aligned_cols=219 Identities=24% Similarity=0.392 Sum_probs=184.6 Q ss_pred CCCCCCCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCC--- Q ss_conf 50579975655108889859988989956559889989988989866868916846865531143422321023331--- Q gi|255764481|r 2 SRMGNNPIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKY--- 78 (271) Q Consensus 2 ~~i~~~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~--- 78 (271) ++++.++.|.|.++|.++|.||+||.|+|.|+|+++|+||++|+|+++|+|..++.||++|.|+++++||.+---+. T Consensus 112 A~i~~~~~ig~~vvI~~~v~IG~~~~I~~~avIg~~~~IG~~~~i~~~v~I~~~~~IG~~v~I~~GavIG~dgFg~a~~~ 191 (338) T COG1044 112 ATIGKNVSIGPNVVIGAGVVIGENVVIGAGAVIGENVKIGDGTVIHPNVTIYHNVVIGNNVIIHSGAVIGADGFGYAGTA 191 (338) T ss_pred CCCCCCCCCCCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEECCCEECCCEEECCCCEECCCCCCCCCCC T ss_conf 70377873288749888978789858988978888968888858847878952757878228889988756852122466 Q ss_pred ----CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEE Q ss_conf ----2222223322222221224432111122222212233200001665543310012344322345245532148606 Q gi|255764481|r 79 ----HNFVGTELLVGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVV 154 (271) Q Consensus 79 ----~~g~~~~v~IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~ 154 (271) .-....+|.|||++.|+.+++|.|+... .|.||+++.|...++|||+|.||.+|+++.++.++|+++||++|. T Consensus 192 ~g~~Ki~q~g~V~Igd~VeIGanT~Idrga~~---dTvIg~~~kIdN~vqIaHnv~IG~~~~I~~~vgIaGs~~IG~~v~ 268 (338) T COG1044 192 IGWVKIPQIGRVIIGDDVEIGANTTIDRGALD---DTVIGEGVKIDNLVQIGHNVRIGEHCIIAGQVGIAGSVKIGKYVI 268 (338) T ss_pred CCCEECCEECEEEECCCEEECCCCEECCCCCC---CCEECCCCEECCEEEECCCCEECCCCEEECCCEEECCCEECCEEE T ss_conf 88567126065997982487355242565556---726537848764307815527888837702631613535778079 Q ss_pred ECCCCCCCCCEEEECCCEEECCCCCCCCCCCCCHHHHCCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHH Q ss_conf 52344433650640474350132112233331000107453233442014442888999999999999997279989999 Q gi|255764481|r 155 FGGGSAVHQFTRIGKYAFIGGMTGVVHDVIPYGILNGNPGALRGVNVVAMRRAGFSRDTIHLIRAVYKQIFQQGDSIYKN 234 (271) Q Consensus 155 IG~~~~v~~~v~IG~~a~ig~gs~V~kdVpp~~~~~G~pa~~~~~N~~glkR~g~s~e~i~~l~~ayr~lfr~~~~l~e~ 234 (271) ||+.+.+..+.+||++++|++++.|.++||+...+.|.|+++. ..++| .+ .++++...+.++ T Consensus 269 igg~vgI~gh~~IgD~~~I~~~~~v~~~i~~~~~~gg~P~~p~---k~w~k-------------~~--a~~~~l~~~~~r 330 (338) T COG1044 269 IGGQVGIAGHLEIGDGVTIGARSGVMASITEPGYSGGIPAQPI---KEWLK-------------TA--ALIRRLPEMRKR 330 (338) T ss_pred ECCCEEECCCEEECCCCEEECCCCCCCCCCCCCEECCCCCCHH---HHHHH-------------HH--HHHHHCHHHHHH T ss_conf 8761346484187578788045243123578866336777539---99999-------------99--998527778999 Q ss_pred HHHHHHH Q ss_conf 9999964 Q gi|255764481|r 235 AGAIREQ 241 (271) Q Consensus 235 l~~l~~~ 241 (271) +++|+++ T Consensus 331 ~k~lek~ 337 (338) T COG1044 331 LKALEKQ 337 (338) T ss_pred HHHHHCC T ss_conf 8776405 No 11 >PRK00892 lpxD UDP-3-O-[3-hydroxymyristoyl] glucosamine N-acyltransferase; Provisional Probab=100.00 E-value=9.7e-35 Score=237.72 Aligned_cols=225 Identities=24% Similarity=0.395 Sum_probs=188.3 Q ss_pred CCCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCCCCCC Q ss_conf 99756551088898599889899565598899899889898668689168468655311434223210233312222223 Q gi|255764481|r 6 NNPIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTE 85 (271) Q Consensus 6 ~~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~ 85 (271) ..+.|||+|+|+|+|+|+++|.|||||+|+++|+||+||+|+|+|+|..+++||+||.|+|+++|.. . T Consensus 97 ~~~gIh~sA~I~~~a~Ig~~v~Igp~~vIg~~v~IG~~~~I~~~~~Ig~~v~IG~~~~I~~nv~I~~------------~ 164 (343) T PRK00892 97 PAPGIHPSAVIDPTAKIGKSVSIGPNAVIGAGVEIGDGVIIGAGAVIGAGVKIGDDCRLHANVTIYH------------R 164 (343) T ss_pred CCCCCCCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEECCCEECC------------C T ss_conf 5688797536188778799988999989999989999999929819958818889958811856625------------5 Q ss_pred CCCCCCCCCCCCCCCCCC-----CCC-------CCCCCCCCCCCHHCCCCCCC----CCCCCCCCCCCCCCCCCCCCCEE Q ss_conf 322222221224432111-----122-------22221223320000166554----33100123443223452455321 Q gi|255764481|r 86 LLVGKKCVIREGVTINRG-----TVE-------YGGKTIVGDNNFFLANSHVA----HDCKLGNGIVLSNNVMIAGHVIV 149 (271) Q Consensus 86 v~IG~~~~Ire~v~I~~g-----t~~-------~~~~T~IG~~~~i~~~~hI~----Hd~~IG~~vii~n~~~l~G~v~I 149 (271) +.||++|+|..++.|..- ... .-+...|||++.|++++.|. .+++||++|-+.|.+.++.+|+| T Consensus 165 ~~IG~~~~I~~g~vIG~dGFGf~~~~~~~~kip~~G~V~I~d~veIgan~tIdrg~~~~T~Ig~~~kidn~vhIaHn~~I 244 (343) T PRK00892 165 VQIGNRVLIHSGAVIGQDGFGFAPDRGGWVKIPQLGRVIIGDDVEIGANTTIDRGALDDTVIGEGVKIDNLVQIAHNVRI 244 (343) T ss_pred EEECCCCEECCCCEECCCCCCCCCCCCCEEECCCCCEEEECCCCEECCCCEECCCCCCCCEECCCCEEEEEEEECCCCEE T ss_conf 58888756468862236675413679940464643249988986888873272366779788989659211176167478 Q ss_pred CCCEEECCCCCCCCCEEEECCCEEECCCCCCCC--CCCCCHHHHCCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHHHHC Q ss_conf 486065234443365064047435013211223--333100010745323344201444288899999999999999727 Q gi|255764481|r 150 DDRVVFGGGSAVHQFTRIGKYAFIGGMTGVVHD--VIPYGILNGNPGALRGVNVVAMRRAGFSRDTIHLIRAVYKQIFQQ 227 (271) Q Consensus 150 gd~~~IG~~~~v~~~v~IG~~a~ig~gs~V~kd--Vpp~~~~~G~pa~~~~~N~~glkR~g~s~e~i~~l~~ayr~lfr~ 227 (271) |++++|.+++++...++||+++++|+.+.|... |-.++++.|.-.-...+...|....|+...+.++-++.+ .+||+ T Consensus 245 G~~~iia~~~giaGs~~igd~v~igG~~gi~~h~~Ig~~~~i~~~sgv~~di~~~g~~~~G~Pa~~~~~~~r~~-a~~~~ 323 (343) T PRK00892 245 GRHCAIAAQVGIAGSTKIGRYCMIGGQVGIAGHLEIGDGVQITAMSGVMKSITEPGEYSSGIPAQPNKEWRKTA-ALVRR 323 (343) T ss_pred CCCCEEEECCEEEEEEEECCCEEECCCCCCCCCCEECCCCEEEECCEECCCCCCCCEEEEEECCCHHHHHHHHH-HHHHH T ss_conf 78838852114700579999149944132348869899979900651436469998089753460089999999-88860 Q ss_pred CCCHHHHHHHHHHHCC Q ss_conf 9989999999996425 Q gi|255764481|r 228 GDSIYKNAGAIREQNV 243 (271) Q Consensus 228 ~~~l~e~l~~l~~~~~ 243 (271) ...+.+.+++|++... T Consensus 324 Lp~l~k~~~~lek~~~ 339 (343) T PRK00892 324 LDKMRKRLKALEKKVE 339 (343) T ss_pred HHHHHHHHHHHHHHHH T ss_conf 7999999999999998 No 12 >TIGR01173 glmU UDP-N-acetylglucosamine pyrophosphorylase; InterPro: IPR005882 N-Acetylglucosamine-1-PO(4) uridyltransferase (GlmU, 2.7.7.23 from EC) is a trimeric bifunctional enzyme that catalyzes the last two sequential reactions in the de novo biosynthetic pathway for UDP-GlcNAc. The X-ray crystal structure of Escherichia coli GlmU in complex with UDP-GlcNAc and CoA has been determined to 2.1 A resolution and reveals a two-domain architecture that is responsible for these two reactions . The C-terminal domain is responsible for the CoA-dependent acetylation of Glc-1-PO(4) to GlcNAc-1-PO(4) and displays the longest left-handed parallel beta-helix observed to date. The acetyltransferase active site defined by the binding site for CoA makes use of residues from all three subunits and is positioned beneath an open cavity large enough to accommodate the Glc-1-PO(4) acetyl acceptor. The N-terminal domain catalyzes uridyl transfer from UTP to GlcNAc-1-PO(4) to form the final products UDP-GlcNAc and pyrophosphate. This domain is composed of a central seven-stranded beta-sheet surrounded by six alpha-helices in a Rossmann fold-like topology. ; GO: 0003977 UDP-N-acetylglucosamine diphosphorylase activity, 0009103 lipopolysaccharide biosynthetic process. Probab=100.00 E-value=1.2e-34 Score=237.09 Aligned_cols=181 Identities=24% Similarity=0.373 Sum_probs=149.0 Q ss_pred CCCCCCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCCC Q ss_conf 05799756551088898599889899565598899899889898668689168468655311434223210233312222 Q gi|255764481|r 3 RMGNNPIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFV 82 (271) Q Consensus 3 ~i~~~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~ 82 (271) .|++++.|+|..++++.++||+||.|||+|.| .|+.|++++.|.|++++. ++.||++|.|+|||.|.+ T Consensus 273 ~~G~Dv~I~~~v~leG~v~iG~~v~IGp~~~I-~ns~I~~~~~I~~~s~~e-~~~ig~~~~vGPFArLRP---------- 340 (461) T TIGR01173 273 EIGQDVVIDPNVILEGKVQIGDDVVIGPGCVI-KNSVIGSNAVIKPYSVLE-GSEIGEGCDVGPFARLRP---------- 340 (461) T ss_pred EECCEEEECCCCEECCEEEECCCEEECCCCEE-EEEEECCCCEEEEEEECC-CCEECCCCEECCCCCCCC---------- T ss_conf 98771697558388340798787078898589-972886885588888414-678615640277300186---------- Q ss_pred CCCCCCCCCCCCCCCCCCCCCCC----CCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCC-CEECCCEEECC Q ss_conf 22332222222122443211112----22222122332000016655433100123443223452455-32148606523 Q gi|255764481|r 83 GTELLVGKKCVIREGVTINRGTV----EYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGH-VIVDDRVVFGG 157 (271) Q Consensus 83 ~~~v~IG~~~~Ire~v~I~~gt~----~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~-v~Igd~~~IG~ 157 (271) ...|++++.|+-||-+-.++. +-++.|+||| +.||.+|.||-++|.+|.-..--| +.|||+||||+ T Consensus 341 --~~~L~~~~hiGNFVE~Kna~iG~gsKA~HLsYlGD-------AeiG~~vNiGAGtITcNYDG~nK~~T~IGd~VFiGS 411 (461) T TIGR01173 341 --GSVLGAGVHIGNFVEVKNARIGEGSKAGHLSYLGD-------AEIGSNVNIGAGTITCNYDGVNKHKTIIGDGVFIGS 411 (461) T ss_pred --CCHHHCCCEEEEEEEEECCEECCCCCCCCCCEEEE-------EEECCCCEECCEEEEEECCCCCCCCCEECCCCEECC T ss_conf --43211778160258652476188644231433320-------250796003231689932796220348648878776 Q ss_pred CCCCCCCEEEECCCEEECCCCCCCCCCCCCHHHHCCCCCCCCCCCHHHHC Q ss_conf 44433650640474350132112233331000107453233442014442 Q gi|255764481|r 158 GSAVHQFTRIGKYAFIGGMTGVVHDVIPYGILNGNPGALRGVNVVAMRRA 207 (271) Q Consensus 158 ~~~v~~~v~IG~~a~ig~gs~V~kdVpp~~~~~G~pa~~~~~N~~glkR~ 207 (271) +|.+-.+++||++|+|||||+||+|||+++|+.. |.+..|+.|-.|+ T Consensus 412 nt~LVAPV~iG~gA~iaAGstvt~DVp~g~La~~---R~~Q~~iegW~~~ 458 (461) T TIGR01173 412 NTQLVAPVKIGDGATIAAGSTVTKDVPEGALAIA---RARQRNIEGWVRP 458 (461) T ss_pred CCEEECCEEECCCCEECCCCEEECCCCCCCCEEE---CCCCEECCCCCCC T ss_conf 7604554394381187124148023488852662---4556552276326 No 13 >TIGR01853 lipid_A_lpxD UDP-3-O-[3-hydroxymyristoyl] glucosamine N-acyltransferase; InterPro: IPR007691 UDP-3-O-[3-hydroxymyristoyl] glucosamine N-acyltransferase (2.3.1 from EC) catalyses an early step in lipid A biosynthesis :UDP-3-O-(3-hydroxytetradecanoyl)glucosamine + (R)-3-hydroxytetradecanoyl- [acyl carrier protein] = UDP-2,3-bis(3-hydroxytetradecanoyl)glucosamine + [acyl carrier protein] Members of this family also contain a hexapeptide repeat (IPR001451 from INTERPRO). This entry represents the non-repeating region of LPXD proteins.; GO: 0016747 transferase activity transferring groups other than amino-acyl groups, 0009245 lipid A biosynthetic process. Probab=99.97 E-value=2.8e-32 Score=222.32 Aligned_cols=202 Identities=26% Similarity=0.380 Sum_probs=169.5 Q ss_pred CCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCCCCCCC Q ss_conf 97565510888985998898995655988998998898986686891684686553114342232102333122222233 Q gi|255764481|r 7 NPIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTEL 86 (271) Q Consensus 7 ~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~v 86 (271) .+-|||+|+++|+|+|+++|.|+||++|+.+|.||+||+|+|+|+|..+++||++|.|||+++|.. .+ T Consensus 101 ~~gi~p~a~~~~~a~~~~~v~I~pnvvIG~~v~IG~~~~I~pg~~iG~~v~IG~~~~i~~~v~i~~------------~~ 168 (336) T TIGR01853 101 EAGIHPTAVVEPSAKIGEGVTIGPNVVIGAGVEIGENVIIGPGVVIGDDVVIGDGSVIYPNVVIYE------------RV 168 (336) T ss_pred CCCCCCCCCCCCCCCCCCCCEECCEEEECCCCEECCCCEECCCCEECCCCEECCCCEEECCCEEEE------------CC T ss_conf 035676873056531078436875417888868778636357768768868799677806708972------------41 Q ss_pred CCCCCCCCCCCCCCCCC-----CCC--------CCCCCCCCCCCHHCCCCCCCC----CCCCCCCCCCCCCCCCCCCCEE Q ss_conf 22222221224432111-----122--------222212233200001665543----3100123443223452455321 Q gi|255764481|r 87 LVGKKCVIREGVTINRG-----TVE--------YGGKTIVGDNNFFLANSHVAH----DCKLGNGIVLSNNVMIAGHVIV 149 (271) Q Consensus 87 ~IG~~~~Ire~v~I~~g-----t~~--------~~~~T~IG~~~~i~~~~hI~H----d~~IG~~vii~n~~~l~G~v~I 149 (271) .||+||.|+.++.|-.. ... .-+..+|+|++-|++|++|.. |+.|++++-|-|.+++|.+|+| T Consensus 169 ~lG~nV~I~~GaVIG~DGFGya~~~~g~~~KI~q~G~ViI~D~VEIGANTtIDRg~~~dT~i~~gtKIDNLVQIAHNc~I 248 (336) T TIGR01853 169 QLGKNVIIHAGAVIGSDGFGYAHTANGGHVKIPQIGRVIIEDDVEIGANTTIDRGAFDDTVIGEGTKIDNLVQIAHNCRI 248 (336) T ss_pred EECCEEEECCCCEECCCCCCEEECCCCCEEEEEEECEEEEECEEEECCCCHHHCCCCCCCEEECCCEECCEEEEEEEEEE T ss_conf 56471587688788678732035788971544130248960233874687021102147187238466231574133462 Q ss_pred CCCEEECCCCCCCCCEEEECCCEEECCCCCCCC--CCCCCHHHHCCCCCCCCCCCHHHHCC--CCHHHHHHHHHHH Q ss_conf 486065234443365064047435013211223--33310001074532334420144428--8899999999999 Q gi|255764481|r 150 DDRVVFGGGSAVHQFTRIGKYAFIGGMTGVVHD--VIPYGILNGNPGALRGVNVVAMRRAG--FSRDTIHLIRAVY 221 (271) Q Consensus 150 gd~~~IG~~~~v~~~v~IG~~a~ig~gs~V~kd--Vpp~~~~~G~pa~~~~~N~~glkR~g--~s~e~i~~l~~ay 221 (271) |+|+.|-+.+++-.-++||+|+|+||.+.|... |-++.++.|--+-...+-.- -+.-| |..-+.++.+|.+ T Consensus 249 G~n~~ivaqvGiaGSt~iG~~V~~GGqvGv~GHl~IgD~v~i~A~sgV~~~~~~~-~~ygggi~Pa~~~~~w~k~~ 323 (336) T TIGR01853 249 GENCIIVAQVGIAGSTKIGRNVILGGQVGVAGHLEIGDNVTIGAKSGVTKSVPPN-GRYGGGIIPAMPQKEWLKIA 323 (336) T ss_pred CCCCEEEEEEECCEEEEECCEEEECCEEEEECEEEECCCCEEEEECCCCCCCCCC-CCCCEEECCCCHHHHHHHHH T ss_conf 6642789642011346755806962602651108943834884512206777988-60100111412024789999 No 14 >PRK09451 glmU bifunctional N-acetylglucosamine-1-phosphate uridyltransferase/glucosamine-1-phosphate acetyltransferase; Provisional Probab=99.97 E-value=8.5e-32 Score=219.32 Aligned_cols=181 Identities=21% Similarity=0.303 Sum_probs=128.8 Q ss_pred CCCCCCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCCC Q ss_conf 05799756551088898599889899565598899899889898668689168468655311434223210233312222 Q gi|255764481|r 3 RMGNNPIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFV 82 (271) Q Consensus 3 ~i~~~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~ 82 (271) +++.++.|.|.++|++.+.||+||+|||+|+| .|+.||++|.|.++++|. ++.||++|.|+|||.|.. T Consensus 267 ~ig~dv~I~~nvvi~G~v~IG~~v~Ig~g~~I-~ns~Ig~~~~I~~~S~Ie-~s~Ig~~~~IGPfA~lRp---------- 334 (456) T PRK09451 267 THGRDVEIDTNVIIEGNVTLGHRVKIGAGCVL-KNCVIGDDCEISPYSVVE-DANLGAACTIGPFARLRP---------- 334 (456) T ss_pred EECCCEEECCCEEEECCEEECCCEEECCCEEE-ECCEECCCCEEECCEEEC-CCEECCCCEECCCCCCCC---------- T ss_conf 96575077577088442475797598240798-576984687894225650-663436716888643488---------- Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCC----CCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCC-CEECCCEEECC Q ss_conf 2233222222212244321111222----222122332000016655433100123443223452455-32148606523 Q gi|255764481|r 83 GTELLVGKKCVIREGVTINRGTVEY----GGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGH-VIVDDRVVFGG 157 (271) Q Consensus 83 ~~~v~IG~~~~Ire~v~I~~gt~~~----~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~-v~Igd~~~IG~ 157 (271) ...|++++.|+.||-+...+... ...+.||| +.||.+|.||-+++.+|.....-| +.|||+|+||. T Consensus 335 --~t~i~~~~~iGnfvEiK~s~i~~g~k~~HlsYiGD-------a~iG~~~NiGAGtit~NyDG~~K~~t~igd~~fiGs 405 (456) T PRK09451 335 --GAELLEGAHVGNFVEMKKARLGKGSKAGHLTYLGD-------AEIGDNVNIGAGTITCNYDGANKFKTIIGDDVFVGS 405 (456) T ss_pred --CCEECCCCEEEEEEEEECCEECCCCEECCEEEECC-------CEECCCCEECCCEEEEECCCCCCCCCEECCCCEECC T ss_conf --76236788882289982459758977042336614-------476588688887699722487665648789829987 Q ss_pred CCCCCCCEEEECCCEEECCCCCCCCCCCCCHHHHCCCCCCCCCCCHHHHC Q ss_conf 44433650640474350132112233331000107453233442014442 Q gi|255764481|r 158 GSAVHQFTRIGKYAFIGGMTGVVHDVIPYGILNGNPGALRGVNVVAMRRA 207 (271) Q Consensus 158 ~~~v~~~v~IG~~a~ig~gs~V~kdVpp~~~~~G~pa~~~~~N~~glkR~ 207 (271) ++.+.++++||++|+||+||+|++|||+.+++.+ |.+..|+.|-+|+ T Consensus 406 n~~lvapv~iG~~a~i~aGs~it~dVp~~~l~i~---r~~q~~~~~~~~~ 452 (456) T PRK09451 406 DTQLVAPVTVGKGATIGAGTTVTRDVAENELVIS---RVPQRHIQGWQRP 452 (456) T ss_pred CCEEECCEEECCCCEECCCCEECCCCCCCCEEEC---CCCCCCCCCCCCC T ss_conf 8448358498899799989878850699977770---4662335555674 No 15 >TIGR03308 phn_thr-fam phosphonate metabolim protein, transferase hexapeptide repeat family. This family of proteins contains copies of the Bacterial transferase hexapeptide repeat family (pfam00132) and is only found in operons encoding the phosphonate C-P lyase system (GenProp0232). Many C-P lyase operons, however, lack a homolog of this protein. Probab=99.97 E-value=1.8e-29 Score=204.75 Aligned_cols=172 Identities=25% Similarity=0.334 Sum_probs=129.4 Q ss_pred CCCCCCCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCC Q ss_conf 50579975655108889859988989956559889989988989866868916846865531143422321023331222 Q gi|255764481|r 2 SRMGNNPIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNF 81 (271) Q Consensus 2 ~~i~~~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g 81 (271) ..++..+.|||+|.|. ++++|++|+||++|.| .++.|||+++|++++.|. +++||+.|.|.+++.|+... T Consensus 3 ~~l~~~p~Ihp~a~i~-~~~lG~~v~Ig~~~~i-~ns~IGDySyI~~~~~I~-n~~IGkfcsIa~~v~I~~~n------- 72 (204) T TIGR03308 3 VLLSPEPTLHPTAELT-ESKLGRYTEIGERTRL-REVALGDYSYVMRDCDII-YTTIGKFCSIAAMVRINATN------- 72 (204) T ss_pred CCCCCCCEECCCCEEC-CCCCCCCEEECCCCEE-ECCEECCCCEECCCCEEE-CCEECCCCEECCCCEECCCC------- T ss_conf 4347997689998683-2783783399999588-337999985888997598-88699984888987989877------- Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCC Q ss_conf 22233222222212244321111222222122332000016655433100123443223452455321486065234443 Q gi|255764481|r 82 VGTELLVGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAV 161 (271) Q Consensus 82 ~~~~v~IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v 161 (271) ..-+.++.|.-+.. ++. ++. +... +...+.. .-...++||+++|||.+++| T Consensus 73 -----------Hp~~~~s~~~f~y~-------~~~-~~~-------~~~~-~~~~~~~--~~~~~~~Ig~dvwiG~~~~i 123 (204) T TIGR03308 73 -----------HPMERPTLHHFTYR-------AAM-YFD-------DASD-DADFFAW--RRAKRVTIGHDVWIGHGAVI 123 (204) T ss_pred -----------CCCCCCEECCEEEE-------CCC-CCC-------CCCC-CHHHHCC--CCCCCEEECCCCEECCCCEE T ss_conf -----------87667267777553-------753-346-------6666-1223213--34787098997688799899 Q ss_pred CCCEEEECCCEEECCCCCCCCCCCCCHHHHCCCCCCCCCCCHHHHCCCCHHHHHHHHHH Q ss_conf 36506404743501321122333310001074532334420144428889999999999 Q gi|255764481|r 162 HQFTRIGKYAFIGGMTGVVHDVIPYGILNGNPGALRGVNVVAMRRAGFSRDTIHLIRAV 220 (271) Q Consensus 162 ~~~v~IG~~a~ig~gs~V~kdVpp~~~~~G~pa~~~~~N~~glkR~g~s~e~i~~l~~a 220 (271) .++++||++|+|||||+|+||||||++++|+|||+. |+.|++++|..|.+. T Consensus 124 ~~gv~IG~gavigagsvVtkdv~~~~iv~G~PAk~i--------r~RF~~~~i~~L~~~ 174 (204) T TIGR03308 124 LPGVTIGNGAVIAAGAVVTKDVAPYTIVAGVPAKLI--------RRRFPPEIAARIEAL 174 (204) T ss_pred CCCCEECCCCEECCCCEECCCCCCCEEEEECCCEEE--------ECCCCHHHHHHHHHC T ss_conf 199799999899779989951799849984188566--------310999999999877 No 16 >COG1044 LpxD UDP-3-O-[3-hydroxymyristoyl] Probab=99.96 E-value=1.6e-28 Score=198.79 Aligned_cols=186 Identities=27% Similarity=0.451 Sum_probs=157.2 Q ss_pred CCCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCCCCCC Q ss_conf 99756551088898599889899565598899899889898668689168468655311434223210233312222223 Q gi|255764481|r 6 NNPIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTE 85 (271) Q Consensus 6 ~~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~ 85 (271) ..+.|||+|+|+|+|.++++|.|||+++|+++|+||+|++|+++++|..++.||++|.|+++++|+. . T Consensus 98 ~~~~I~~~A~i~p~A~i~~~~~ig~~vvI~~~v~IG~~~~I~~~avIg~~~~IG~~~~i~~~v~I~~------------~ 165 (338) T COG1044 98 PAAGIHPTAVIDPTATIGKNVSIGPNVVIGAGVVIGENVVIGAGAVIGENVKIGDGTVIHPNVTIYH------------N 165 (338) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEC------------C T ss_conf 5355476553177670377873288749888978789858988978888968888858847878952------------7 Q ss_pred CCCCCCCCCCCCCCCCCC------CCCC------CCCCCCCCCCHHCCCCCCCCC----CCCCCCCCCCCCCCCCCCCEE Q ss_conf 322222221224432111------1222------222122332000016655433----100123443223452455321 Q gi|255764481|r 86 LLVGKKCVIREGVTINRG------TVEY------GGKTIVGDNNFFLANSHVAHD----CKLGNGIVLSNNVMIAGHVIV 149 (271) Q Consensus 86 v~IG~~~~Ire~v~I~~g------t~~~------~~~T~IG~~~~i~~~~hI~Hd----~~IG~~vii~n~~~l~G~v~I 149 (271) +.||++|.|.+++.|... +..+ .+..+|||++.|+++++|.+. +.||+++.|.|.++++.+|+| T Consensus 166 ~~IG~~v~I~~GavIG~dgFg~a~~~~g~~Ki~q~g~V~Igd~VeIGanT~Idrga~~dTvIg~~~kIdN~vqIaHnv~I 245 (338) T COG1044 166 VVIGNNVIIHSGAVIGADGFGYAGTAIGWVKIPQIGRVIIGDDVEIGANTTIDRGALDDTVIGEGVKIDNLVQIGHNVRI 245 (338) T ss_pred CEECCCEEECCCCEECCCCCCCCCCCCCCEECCEECEEEECCCEEECCCCEECCCCCCCCEECCCCEECCEEEECCCCEE T ss_conf 57878228889988756852122466885671260659979824873552425655567265378487643078155278 Q ss_pred CCCEEECCCCCCCCCEEEECCCEEECCCCCCCC--CCCCCHHHHCCCCCCCCCCCH Q ss_conf 486065234443365064047435013211223--333100010745323344201 Q gi|255764481|r 150 DDRVVFGGGSAVHQFTRIGKYAFIGGMTGVVHD--VIPYGILNGNPGALRGVNVVA 203 (271) Q Consensus 150 gd~~~IG~~~~v~~~v~IG~~a~ig~gs~V~kd--Vpp~~~~~G~pa~~~~~N~~g 203 (271) |+++.|.+++.+-+.++||+|+++|+.+.+... |-.++...+.-+....+-..+ T Consensus 246 G~~~~I~~~vgIaGs~~IG~~v~igg~vgI~gh~~IgD~~~I~~~~~v~~~i~~~~ 301 (338) T COG1044 246 GEHCIIAGQVGIAGSVKIGKYVIIGGQVGIAGHLEIGDGVTIGARSGVMASITEPG 301 (338) T ss_pred CCCCEEECCCEEECCCEECCEEEECCCEEECCCEEECCCCEEECCCCCCCCCCCCC T ss_conf 88837702631613535778079876134648418757878804524312357886 No 17 >COG1207 GlmU N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) [Cell envelope biogenesis, outer membrane] Probab=99.96 E-value=6.1e-29 Score=201.47 Aligned_cols=181 Identities=24% Similarity=0.352 Sum_probs=119.6 Q ss_pred CCCCCCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCCC Q ss_conf 05799756551088898599889899565598899899889898668689168468655311434223210233312222 Q gi|255764481|r 3 RMGNNPIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFV 82 (271) Q Consensus 3 ~i~~~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~ 82 (271) .|++++.|+|..++....+||++|.|||+|+| .|+.||+|+.|.++++|. +++||++|.|+|||.|.. T Consensus 270 ~ig~DvvI~p~v~l~G~t~ig~~v~iGpg~~i-~ds~I~~~a~I~~~S~ie-~s~vg~~~~VGPfA~LRP---------- 337 (460) T COG1207 270 EIGRDVVIEPNVILEGNTVIGDNVVIGPGSVI-KDSVIGDNAVIKAYSVIE-GSTVGEGATVGPFARLRP---------- 337 (460) T ss_pred EECCCEEEECCCEEEEEEEECCCEEECCCCEE-EEEEECCCCEEEECCEEE-CCEECCCCCCCCCCCCCC---------- T ss_conf 98781499148489415896796499999678-753876998897510430-558547963378310179---------- Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCC----CCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCC-CEECCCEEECC Q ss_conf 223322222221224432111122----2222122332000016655433100123443223452455-32148606523 Q gi|255764481|r 83 GTELLVGKKCVIREGVTINRGTVE----YGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGH-VIVDDRVVFGG 157 (271) Q Consensus 83 ~~~v~IG~~~~Ire~v~I~~gt~~----~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~-v~Igd~~~IG~ 157 (271) ...+|+++.|+.||-+...+.. -+..|.||| +.||.+|.||-++|.+|.-..--| ..|||++|||+ T Consensus 338 --g~~L~~~~hIGNFVEvK~a~ig~gsKa~HLtYlGD-------A~iG~~~NiGAGtItcNYDG~nK~~T~IGd~vFiGS 408 (460) T COG1207 338 --GAVLGADVHIGNFVEVKKATIGKGSKAGHLTYLGD-------AEIGENVNIGAGTITCNYDGKNKFKTIIGDNVFIGS 408 (460) T ss_pred --CCCCCCCCEEEEEEEEECCCCCCCCCCCCEEEECC-------CEECCCCEECCCEEEECCCCCCCCEEEECCCCEECC T ss_conf --68526797674259984356068863353234213-------320687421442599847886332006468857766 Q ss_pred CCCCCCCEEEECCCEEECCCCCCCCCCCCCHHHHCCCCCCCCCCCHHHHC Q ss_conf 44433650640474350132112233331000107453233442014442 Q gi|255764481|r 158 GSAVHQFTRIGKYAFIGGMTGVVHDVIPYGILNGNPGALRGVNVVAMRRA 207 (271) Q Consensus 158 ~~~v~~~v~IG~~a~ig~gs~V~kdVpp~~~~~G~pa~~~~~N~~glkR~ 207 (271) ++.+..+++||++++|||||+||+|||+.+++.+ |.+..|+.|-.|+ T Consensus 409 ns~LVAPV~IGd~a~iaAGStIT~DVp~~aLai~---RarQ~~~egw~~~ 455 (460) T COG1207 409 NSQLVAPVTIGDGATIAAGSTITKDVPEGALAIS---RARQTNKEGWVRK 455 (460) T ss_pred CCCEEEEEEECCCCEECCCCEECCCCCCCCEEEE---ECCEEECCCCCCC T ss_conf 8718864896698488146368365799844674---0441432442145 No 18 >cd04745 LbH_paaY_like paaY-like: This group is composed by uncharacterized proteins with similarity to the protein product of the E. coli paaY gene, which is part of the paa gene cluster responsible for phenylacetic acid degradation. Proteins in this group are expected to adopt the left-handed parallel beta-helix (LbH) structure. They contain imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Similarity to gamma carbonic anhydrase and Ferripyochelin Binding Protein (FBP) may suggest metal binding capacity. Probab=99.95 E-value=6.1e-28 Score=195.20 Aligned_cols=105 Identities=14% Similarity=0.271 Sum_probs=82.8 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCC Q ss_conf 33222222212244321111222222122332000016655433100123443223452455321486065234443365 Q gi|255764481|r 85 ELLVGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQF 164 (271) Q Consensus 85 ~v~IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~ 164 (271) .+.||+++.|+++|+||. +.+|+|.||+++.++.++.+.| ++|||++|||.+++|+++ T Consensus 39 ~i~IG~~~~I~~n~~Ih~---------------------~~~~~~~Ig~~~~ig~~~~~~g-~~Ig~~~~IG~~a~I~~G 96 (155) T cd04745 39 RIVIRDGANVQDNCVIHG---------------------FPGQDTVLEENGHIGHGAILHG-CTIGRNALVGMNAVVMDG 96 (155) T ss_pred CEEECCCCEECCCCEECC---------------------CCCCCCEECCCCEECCCCEECC-CEEEEEEEECCCCEEECC T ss_conf 789999979787844013---------------------5999869899978998748715-177432497687599597 Q ss_pred EEEECCCEEECCCCCCC--CCCCCCHHHHCCCCCCCCCCCHHHHCCCCHHHHHHHHHH Q ss_conf 06404743501321122--333310001074532334420144428889999999999 Q gi|255764481|r 165 TRIGKYAFIGGMTGVVH--DVIPYGILNGNPGALRGVNVVAMRRAGFSRDTIHLIRAV 220 (271) Q Consensus 165 v~IG~~a~ig~gs~V~k--dVpp~~~~~G~pa~~~~~N~~glkR~g~s~e~i~~l~~a 220 (271) ++||++|+||+||+|++ ||||++++.|+|||+. +.+++|+++.+++. T Consensus 97 ~~IG~~siIgagSvV~~~~~VP~~~l~~GnPAk~i---------r~l~~~e~~~~~~~ 145 (155) T cd04745 97 AVIGEESIVGAMAFVKAGTVIPPRSLIAGSPAKVI---------RELSDEEVAWKTRG 145 (155) T ss_pred CEECCCCEEEECCEECCCCCCCCCEEEEEECCEEE---------ECCCHHHHHHHHHH T ss_conf 79899979941878769968379989997286797---------43999999999999 No 19 >cd04650 LbH_FBP Ferripyochelin Binding Protein (FBP): FBP is an outer membrane protein which plays a role in iron acquisition. It binds iron when it is complexed with pyochelin. It adopts the left-handed parallel beta-helix (LbH) structure, and contains imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Acyltransferase activity has not been observed in this group. Probab=99.95 E-value=5.8e-28 Score=195.35 Aligned_cols=108 Identities=20% Similarity=0.401 Sum_probs=82.2 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCC Q ss_conf 33222222212244321111222222122332000016655433100123443223452455321486065234443365 Q gi|255764481|r 85 ELLVGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQF 164 (271) Q Consensus 85 ~v~IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~ 164 (271) .+.||++|.|+++++||.. -+|++.||++|.++.++.+.+ ++|||++|||.+++|.++ T Consensus 39 ~i~IG~~~~I~~n~~I~~~---------------------~~~~~~ig~~v~ig~~~~i~g-~~Ig~~v~IG~~a~I~~G 96 (154) T cd04650 39 SIYIGKYSNVQENVSIHTD---------------------HGYPTEIGDYVTIGHNAVVHG-AKVGNYVIVGMGAILLNG 96 (154) T ss_pred CEEECCCEEECCCCEEEEC---------------------CCCCCEECCCCEECCCEEEEC-CEECCEEEECCCEEEECC T ss_conf 6599993398999889846---------------------999868799768899528823-022013598872299628 Q ss_pred EEEECCCEEECCCCCC--CCCCCCCHHHHCCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHH Q ss_conf 0640474350132112--2333310001074532334420144428889999999999999 Q gi|255764481|r 165 TRIGKYAFIGGMTGVV--HDVIPYGILNGNPGALRGVNVVAMRRAGFSRDTIHLIRAVYKQ 223 (271) Q Consensus 165 v~IG~~a~ig~gs~V~--kdVpp~~~~~G~pa~~~~~N~~glkR~g~s~e~i~~l~~ayr~ 223 (271) ++||++|+|||||+|+ +|||||++++|+|||+. +.++++++..+++.-+. T Consensus 97 v~IG~~svIgaGsVV~~~~~Vp~~~lv~G~PAr~i---------r~l~~~~~~~~~~~~~~ 148 (154) T cd04650 97 AKIGDHVIIGAGAVVTPGKEIPDYSLVLGVPAKVV---------RKLTEEEIEWIKKNAEE 148 (154) T ss_pred CEECCCCEECCCCEECCCCEECCCCEEEECCEEEE---------EECCHHHHHHHHHHHHH T ss_conf 79899979988988879979199849996181897---------72999999999999999 No 20 >cd04645 LbH_gamma_CA_like Gamma carbonic anhydrase-like: This family is composed of gamma carbonic anhydrase (CA), Ferripyochelin Binding Protein (FBP), E. coli paaY protein, and similar proteins. CAs are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism, involving the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide, followed by the regeneration of the active site by ionization of the zinc-bound water molecule and removal of a proton from the active site. They are ubiquitous enzymes involved in fundamental processes like photosynthesis, respiration, pH homeostasis and ion transport. There are three evolutionary distinct groups - alpha, beta and gamma carbonic anhydrases - which show no significant sequence identity or structural similarity. Gamma CAs are trimeric enzymes with left-handed parallel beta helix (LbH) structural domain. Probab=99.95 E-value=7.3e-28 Score=194.74 Aligned_cols=143 Identities=17% Similarity=0.335 Sum_probs=98.4 Q ss_pred EECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCCCCCCCCC Q ss_conf 56551088898599889899565598899899889898668689168468655311434223210233312222223322 Q gi|255764481|r 9 IIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTELLV 88 (271) Q Consensus 9 ~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~v~I 88 (271) .|||+|+|+|+|.|..+|+|| ++|+|+++++|+++ ...+.| T Consensus 1 ~I~~~a~I~p~A~I~G~V~IG------------~~~~I~~~~~I~~d---------------------------~~~i~I 41 (153) T cd04645 1 EIDPSAFIAPNATVIGDVTLG------------EGSSVWFGAVLRGD---------------------------VNPIRI 41 (153) T ss_pred CCCCCEEECCCCEEECCEEEC------------CCCEECCCCEEECC---------------------------CCCEEE T ss_conf 918981999999997726998------------99999658384246---------------------------663599 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCEEEE Q ss_conf 22222122443211112222221223320000166554331001234432234524553214860652344433650640 Q gi|255764481|r 89 GKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFTRIG 168 (271) Q Consensus 89 G~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v~IG 168 (271) |+++.|+++++||... .|++.||++|.++.++.+.| ++|||++|||.++.|+++++|| T Consensus 42 G~~~~I~~~~~Ih~~~---------------------~~~~~IG~~v~Ig~~~~i~g-~~Ig~~~~IG~~a~I~~gv~IG 99 (153) T cd04645 42 GERTNIQDGSVLHVDP---------------------GYPTIIGDNVTVGHGAVLHG-CTIGDNCLIGMGAIILDGAVIG 99 (153) T ss_pred CCCCEECCCCCCCCCC---------------------CCCCEECCCCEECCCCEEEC-EEEEEEEEECCCCEECCCCEEC T ss_conf 9998989685435259---------------------99979899989999959924-1873004770543985699988 Q ss_pred CCCEEECCCCCC--CCCCCCCHHHHCCCCCCCCCCCHHHHCCCCHHHHHHHHHHH Q ss_conf 474350132112--23333100010745323344201444288899999999999 Q gi|255764481|r 169 KYAFIGGMTGVV--HDVIPYGILNGNPGALRGVNVVAMRRAGFSRDTIHLIRAVY 221 (271) Q Consensus 169 ~~a~ig~gs~V~--kdVpp~~~~~G~pa~~~~~N~~glkR~g~s~e~i~~l~~ay 221 (271) ++|+||+||+|+ +|||||+++.|+|||+. +-++++++..+++.- T Consensus 100 ~~~vIgagsvV~~~~~vp~~~v~~G~PAr~i---------r~~~~~e~~~~~~~~ 145 (153) T cd04645 100 KGSIVAAGSLVPPGKVIPPGSLVAGSPAKVV---------RELTDEEIAELRESA 145 (153) T ss_pred CCCEECCCCEECCCCEECCCCEEEECCCEEE---------CCCCHHHHHHHHHHH T ss_conf 9979944989748979699859996283797---------159999999999999 No 21 >TIGR03532 DapD_Ac 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-acetyltransferase. Alternate name: tetrahydrodipicolinate N-acetyltransferase. Note that IUBMB lists this alternate name as the accepted name. Unfortunately, the related succinyl transferase acting on the same substrate (EC:2.3.1.117, TIGR00695) uses the opposite standard. We have decided to give these two enzymes names which more clearly indicated that they act on the same substrate. Probab=99.92 E-value=4.9e-25 Score=177.03 Aligned_cols=127 Identities=28% Similarity=0.415 Sum_probs=96.6 Q ss_pred CCCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCCCCCC Q ss_conf 99756551088898599889899565598899899889898668689168468655311434223210233312222223 Q gi|255764481|r 6 NNPIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTE 85 (271) Q Consensus 6 ~~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~ 85 (271) -++.|+|.|+|.+.+.||+|+.|.+.|+|..++.||++|.|..++++.+.+.||++|+|+++++|++- T Consensus 85 ~~aRIepgaiir~~v~IG~~aVImmga~INiGA~IGegtmId~~Avvg~~a~IGk~~HisaGa~laGV------------ 152 (231) T TIGR03532 85 INARIEPGAIIRDQVIIGDNAVIMMGAVINIGAEIGEGTMIDMNAVLGGRATVGKNVHIGAGAVLAGV------------ 152 (231) T ss_pred CCCEECCCCEEECCEEECCCEEEECCCEEECCCEECCCCEECCCCEECCCCEECCCCEECCCCEEECC------------ T ss_conf 78357788588425298798399138887048598789458058564454688797058898643002------------ Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCE Q ss_conf 32222222122443211112222221223320000166554331001234432234524553214860652344433650 Q gi|255764481|r 86 LLVGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFT 165 (271) Q Consensus 86 v~IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v 165 (271) +.+. + +..|.|+|+||||+++.|.+++ T Consensus 153 --------------leP~-----~----------------------------------a~PViIeDnV~IGAnAvIl~GV 179 (231) T TIGR03532 153 --------------IEPP-----S----------------------------------AKPVVIEDNVLIGANAVILEGV 179 (231) T ss_pred --------------CCCC-----C----------------------------------CCCEEECCCEEECCCCEECCCC T ss_conf --------------2576-----6----------------------------------7997987885987997991998 Q ss_pred EEECCCEEECCCCCCCCCCCCCHHHHCCCCCC Q ss_conf 64047435013211223333100010745323 Q gi|255764481|r 166 RIGKYAFIGGMTGVVHDVIPYGILNGNPGALR 197 (271) Q Consensus 166 ~IG~~a~ig~gs~V~kdVpp~~~~~G~pa~~~ 197 (271) +||++|+||+||+|++|||||++++|+|||+. T Consensus 180 ~IG~gsVIgAGsVVtkdVP~~sVVaG~PAKII 211 (231) T TIGR03532 180 RVGKGAVVAAGAIVTEDVPPNTVVAGVPAKVI 211 (231) T ss_pred EECCCCEECCCCEECCCCCCCCEEEECCEEEE T ss_conf 99999999849889565799979985481999 No 22 >PRK13627 carnitine operon protein CaiE; Provisional Probab=99.92 E-value=2.3e-25 Score=179.13 Aligned_cols=146 Identities=23% Similarity=0.338 Sum_probs=104.8 Q ss_pred CCCCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCCCCC Q ss_conf 79975655108889859988989956559889989988989866868916846865531143422321023331222222 Q gi|255764481|r 5 GNNPIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFVGT 84 (271) Q Consensus 5 ~~~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~ 84 (271) +..+.|||+|+|+|+| +|-|+++||++|.||++|+|-+ +.. T Consensus 8 g~~P~i~~~afIap~A------------------------------~viGdV~ig~~~sIw~~aVlRG---------D~~ 48 (196) T PRK13627 8 GLIPVVHPTAFVHPSA------------------------------VLIGDVIVGAGVYIGPLASLRG---------DYG 48 (196) T ss_pred CCCCCCCCCEEECCCC------------------------------EEEEEEEECCCCEECCCEEEEC---------CCC T ss_conf 9587229983889999------------------------------8998499999988977708956---------887 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCC Q ss_conf 33222222212244321111222222122332000016655433100123443223452455321486065234443365 Q gi|255764481|r 85 ELLVGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQF 164 (271) Q Consensus 85 ~v~IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~ 164 (271) .++||+++.|+++++||... +..|.|| ++|.++.++.|.| |+|+|+++||.+++|+.+ T Consensus 49 ~I~IG~~tNIQD~~viH~~~---~~~~~IG------------------~~vtIGH~ailhg-c~Igd~~lIGmgavVldg 106 (196) T PRK13627 49 RLIVQAGANLQDGCIMHGYC---DTDTIVG------------------ENGHIGHGAILHG-CVIGRDALVGMNSVIMDG 106 (196) T ss_pred CEEECCCCEECCCCEEECCC---CCCCEEC------------------CCEEECCCEEEEE-EEEECCCEECCCCEEECC T ss_conf 17989986767983781489---9985888------------------9805878729963-499489788469799288 Q ss_pred EEEECCCEEECCCCCCCC--CCCCCHHHHCCCCCCCCCCCHHHHCCCCHHHHHHHHHH Q ss_conf 064047435013211223--33310001074532334420144428889999999999 Q gi|255764481|r 165 TRIGKYAFIGGMTGVVHD--VIPYGILNGNPGALRGVNVVAMRRAGFSRDTIHLIRAV 220 (271) Q Consensus 165 v~IG~~a~ig~gs~V~kd--Vpp~~~~~G~pa~~~~~N~~glkR~g~s~e~i~~l~~a 220 (271) ++||++|||||||+|+.. +||++|+.|+|||+. |-++++++..++.- T Consensus 107 a~Ig~~~iI~AgslV~~g~~ip~~~L~~G~PAk~v---------R~lt~~ei~~~~~~ 155 (196) T PRK13627 107 AVIGEESIVAAMSFVKAGFQGEKRQLLMGTPARAV---------RSVSDQELHWKRLN 155 (196) T ss_pred CEECCCCEECCCCEECCCCCCCCCCEEEECCEEEE---------ECCCHHHHHHHHHH T ss_conf 89999989976864579949389819987781681---------14999999998876 No 23 >TIGR03570 NeuD_NnaD sugar O-acyltransferase, sialic acid O-acetyltransferase NeuD family. These proteins contain repeats of the bacterial transferase hexapeptide (pfam00132), although often these do not register above the trusted cutoff. Probab=99.92 E-value=2.6e-25 Score=178.76 Aligned_cols=102 Identities=29% Similarity=0.507 Sum_probs=85.4 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCE Q ss_conf 32222222122443211112222221223320000166554331001234432234524553214860652344433650 Q gi|255764481|r 86 LLVGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFT 165 (271) Q Consensus 86 v~IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v 165 (271) +.||+++.|.++++|. ..++||++|++..+++|+|||.||++|.++.++.++|+|+||++++||.+++|.|.+ T Consensus 100 ~~ig~G~~I~~~~~i~-------~~~~IG~~~~i~~~~~I~Hd~~ig~~~~i~~~~~i~G~v~Ig~~~~iG~~~~i~~~i 172 (201) T TIGR03570 100 ASIGEGTVIMAGAVIN-------PDVRIGDNVIINTGAIVEHDCVIGDYVHIAPGVTLSGGVVIGEGVFIGAGATIIQGV 172 (201) T ss_pred CEECCCCEEECCCEEC-------CCCEEEEEEEECCCCCCCCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCC T ss_conf 6787998996286987-------664780267980423214425477876899985998988797773598988988998 Q ss_pred EEECCCEEECCCCCCCCCCCCCHHHHCCC Q ss_conf 64047435013211223333100010745 Q gi|255764481|r 166 RIGKYAFIGGMTGVVHDVIPYGILNGNPG 194 (271) Q Consensus 166 ~IG~~a~ig~gs~V~kdVpp~~~~~G~pa 194 (271) +||++|+|||||+|+||||+.+++.|+|| T Consensus 173 ~Ig~~~~Igags~V~~~v~~~~~~~G~PA 201 (201) T TIGR03570 173 TIGAGAIVGAGAVVTKDIPDGGVVVGVPA 201 (201) T ss_pred EECCCCEECCCCEEECCCCCCCEEEEECC T ss_conf 99999999979889043799979997587 No 24 >cd03358 LbH_WxcM_N_like WcxM-like, Left-handed parallel beta-Helix (LbH) N-terminal domain: This group is composed of Xanthomonas campestris WcxM and proteins with similarity to the WcxM N-terminal domain. WcxM is thought to be bifunctional, catalyzing both the isomerization and transacetylation reactions of keto-hexoses. It contains an N-terminal LbH domain responsible for the transacetylation function and a C-terminal isomerase domain. The LbH domain contains imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X), typical of enzymes with acyltransferase activity. Probab=99.91 E-value=1.4e-24 Score=174.22 Aligned_cols=119 Identities=26% Similarity=0.395 Sum_probs=98.7 Q ss_pred ECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 98898995655988998998898986686891684686553114342232102333122222233222222212244321 Q gi|255764481|r 22 IGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTELLVGKKCVIREGVTIN 101 (271) Q Consensus 22 Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~v~IG~~~~Ire~v~I~ 101 (271) ||+||.|+|+|+|++++.||++|+|++++.|..+++||++|.|++++++..++.. +. . T Consensus 1 IG~~~~Ig~~~~I~~~~~IG~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~~~~~~~~------------~~--------~-- 58 (119) T cd03358 1 IGDNCIIGTNVFIENDVKIGDNVKIQSNVSIYEGVTIEDDVFIGPNVVFTNDLYP------------RS--------K-- 58 (119) T ss_pred CCCCCEECCCCEECCCCEECCCEEECCCCCCCCCCEEEEEEEECCCCCCCCCCCC------------CC--------C-- T ss_conf 9899899999899999999999699998636789998437797788633044466------------76--------4-- Q ss_pred CCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCEEEECCCEEECCCCCCC Q ss_conf 11122222212233200001665543310012344322345245532148606523444336506404743501321122 Q gi|255764481|r 102 RGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFTRIGKYAFIGGMTGVVH 181 (271) Q Consensus 102 ~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~gs~V~k 181 (271) ..+ . ...++++|+|++|||.+++|+|+++||++|+|||+|+|+| T Consensus 59 -----------~~~------------~-------------~~~~~v~Ig~~v~IG~~~~I~~gv~IG~~~~IgagSvV~~ 102 (119) T cd03358 59 -----------IYR------------K-------------WELKGTTVKRGASIGANATILPGVTIGEYALVGAGAVVTK 102 (119) T ss_pred -----------CCC------------C-------------CCCCCCEECCCEEECCCCEEECCEEECCCCEECCCCEECC T ss_conf -----------346------------7-------------6324839878838488747789949899989977998903 Q ss_pred CCCCCCHHHHCCCCCCC Q ss_conf 33331000107453233 Q gi|255764481|r 182 DVIPYGILNGNPGALRG 198 (271) Q Consensus 182 dVpp~~~~~G~pa~~~~ 198 (271) |||||++++|+|||+.| T Consensus 103 dv~~~~i~~G~PAk~IG 119 (119) T cd03358 103 DVPPYALVVGNPARIIG 119 (119) T ss_pred CCCCCCEEEEECCEEEC T ss_conf 65999799974979919 No 25 >cd03352 LbH_LpxD UDP-3-O-acyl-glucosamine N-acyltransferase (LpxD): The enzyme catalyzes the transfer of 3-hydroxymyristic acid or 3-hydroxy-arachidic acid, depending on the organism, from the acyl carrier protein (ACP) to UDP-3-O-acyl-glucosamine to produce UDP-2,3-diacyl-GlcNAc. This constitutes the third step in the lipid A biosynthetic pathway in Gram-negative bacteria. LpxD is a homotrimer, with each subunit consisting of a novel combination of an N-terminal uridine-binding domain, a core lipid-binding left-handed parallel beta helix (LbH) domain, and a C-terminal alpha-helical extension. The LbH domain contains 9 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Probab=99.91 E-value=2.8e-24 Score=172.35 Aligned_cols=160 Identities=27% Similarity=0.419 Sum_probs=136.8 Q ss_pred CCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 85998898995655988998998898986686891684686553114342232102333122222233222222212244 Q gi|255764481|r 19 GAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTELLVGKKCVIREGV 98 (271) Q Consensus 19 ~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~v~IG~~~~Ire~v 98 (271) +|+|+++|.|+|+++|+++|+||++++|+|+++|..+++||++|.|+++++|+. .+.||++|.|++++ T Consensus 1 ~a~ig~~v~I~~~avI~~~v~Ig~n~~I~~~~~I~~~v~IG~~~~I~~~~~I~~------------~~~IG~~~~I~~~~ 68 (205) T cd03352 1 SAKIGENVSIGPNAVIGEGVVIGDGVVIGPGVVIGDGVVIGDDCVIHPNVTIYE------------GCIIGDRVIIHSGA 68 (205) T ss_pred CCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEECCCEECC------------CCEECCEEEECCCE T ss_conf 969199999999879999999999999989889999968898836600344727------------86875447771534 Q ss_pred CCCCCCC----C--------CCCCCCCCCCCHHCCCCCCC----CCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCC Q ss_conf 3211112----2--------22221223320000166554----331001234432234524553214860652344433 Q gi|255764481|r 99 TINRGTV----E--------YGGKTIVGDNNFFLANSHVA----HDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVH 162 (271) Q Consensus 99 ~I~~gt~----~--------~~~~T~IG~~~~i~~~~hI~----Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~ 162 (271) .|....+ . ..+.+.||+++.|.+++.|. .++.||+++.+.+.+.++.+|.||++++|..++.+. T Consensus 69 vIg~~~f~~~~~~~~~~~~~~~g~v~Ig~~v~Ig~~~~I~~g~~~~T~IG~~~~i~~~v~igh~~~iG~~~~i~~~~~~~ 148 (205) T cd03352 69 VIGSDGFGFAPDGGGWVKIPQLGGVIIGDDVEIGANTTIDRGALGDTVIGDGTKIDNLVQIAHNVRIGENCLIAAQVGIA 148 (205) T ss_pred EECCCCCCCCCCCCCCEECCCCCCEEECCCEEECCCCEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCEEECCCCCCC T ss_conf 86213532314798404514213189848849888627636642677478776864545255640415870750320002 Q ss_pred CCEEEECCCEEECCCCCCCC--CCCCCHHH Q ss_conf 65064047435013211223--33310001 Q gi|255764481|r 163 QFTRIGKYAFIGGMTGVVHD--VIPYGILN 190 (271) Q Consensus 163 ~~v~IG~~a~ig~gs~V~kd--Vpp~~~~~ 190 (271) ..++||++|++|++|.+... |-+++++. T Consensus 149 g~~~Igd~~~iG~~s~i~~gv~IG~~a~Vg 178 (205) T cd03352 149 GSTTIGDNVIIGGQVGIAGHLTIGDGVVIG 178 (205) T ss_pred CCEEECCCCEECCCCEECCCCEECCCCEEC T ss_conf 652497435398964882993999998998 No 26 >PRK05289 UDP-N-acetylglucosamine acyltransferase; Provisional Probab=99.91 E-value=7e-24 Score=169.82 Aligned_cols=163 Identities=20% Similarity=0.323 Sum_probs=127.9 Q ss_pred CCCCCCCCEECCCE------------EECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCC------------C Q ss_conf 95057997565510------------88898599889899565598899899889898668689168------------4 Q gi|255764481|r 1 MSRMGNNPIIHPLA------------LVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGK------------T 56 (271) Q Consensus 1 ~~~i~~~~~IhpsA------------~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~------------t 56 (271) |++|..++.|||+| +|.++++||+++.|+|+++|.++++||++|+|+|+++|.+. + T Consensus 2 ~~~IHptAiI~~~A~IG~~v~Igp~~vIg~~v~IG~~~~I~~~v~I~~~t~IG~~~~I~~~avIG~~pqd~~~~g~~~~v 81 (261) T PRK05289 2 MAKIHPTAIVEPGAEIGENVEIGPFCVIGPDVVIGDGTEIGSHVVIDGHTTIGKNNRIFPFASIGEEPQDLKYKGEPTRL 81 (261) T ss_pred CCCCCCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEECCCEECCCCEECCCCCCCCCCCCCCCCCCCCEE T ss_conf 57408988989989999989999999999999999999997996891898889883452475337787666606985369 Q ss_pred EEEEEEECCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCC Q ss_conf 68655311434223210233312222223322222221224432111122222212233200001665543310012344 Q gi|255764481|r 57 KIGDFTKVFPMAVLGGDTQSKYHNFVGTELLVGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIV 136 (271) Q Consensus 57 ~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~v~IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vi 136 (271) .||++|.|.++++|.... ..+.....||++|.|-.++.|- ....||++|.+.+++.++..|.||++++ T Consensus 82 ~IGd~~~Ire~vtI~rgT-----~~~~g~T~IG~~~~im~~vhIa-------Hd~~IG~~~iian~~~laG~v~Igd~v~ 149 (261) T PRK05289 82 VIGDNNTIREFVTINRGT-----VQGGGVTRIGDNNLLMAYVHVA-------HDCVVGNNVILANNATLAGHVEVGDYAI 149 (261) T ss_pred EECCCCEECCCCEECCCC-----CCCCCCEEECCCCEEECCCCCC-------CCCCCCCCCEECCCEEECCCEEECCCEE T ss_conf 989977898541443661-----0588644988885440036434-------4551278716543056646079889709 Q ss_pred CCCCCCCCCCCEECCCEEECCCCCCCCCEEEECCCEEECCC Q ss_conf 32234524553214860652344433650640474350132 Q gi|255764481|r 137 LSNNVMIAGHVIVDDRVVFGGGSAVHQFTRIGKYAFIGGMT 177 (271) Q Consensus 137 i~n~~~l~G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~gs 177 (271) |+.++.+..+++||+++.+|+++.|.+. |-.++++.+.- T Consensus 150 iGg~~~v~q~v~IG~~a~vgagS~V~kD--Vpp~~~v~G~P 188 (261) T PRK05289 150 IGGLSGVHQFVRIGAHAMVGGMSAVVQD--VPPYVLAEGNP 188 (261) T ss_pred ECCCCEECCCCEECCCEEECCCCEECCC--CCCCCEEECCC T ss_conf 9761566488398987199976365424--79986784683 No 27 >cd05636 LbH_G1P_TT_C_like Putative glucose-1-phosphate thymidylyltransferase, C-terminal Left-handed parallel beta-Helix (LbH) domain: Proteins in this family show simlarity to glucose-1-phosphate adenylyltransferases in that they contain N-terminal catalytic domains that resemble a dinucleotide-binding Rossmann fold and C-terminal LbH fold domains. Members in this family are predicted to be glucose-1-phosphate thymidylyltransferases, which are involved in the dTDP-L-rhamnose biosynthetic pathway. Glucose-1-phosphate thymidylyltransferase catalyzes the synthesis of deoxy-thymidine di-phosphate (dTDP)-L-rhamnose, an important component of the cell wall of many microorganisms. The C-terminal LbH domain contains multiple turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Probab=99.91 E-value=6.5e-24 Score=170.03 Aligned_cols=152 Identities=16% Similarity=0.141 Sum_probs=114.0 Q ss_pred CCCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCCCCCC Q ss_conf 99756551088898599889899565598899899889898668689168468655311434223210233312222223 Q gi|255764481|r 6 NNPIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTE 85 (271) Q Consensus 6 ~~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~ 85 (271) ....|||+|.|++.+.|++++.|+|+|+|+++|.||++|+|++++.|++++.||++|+|++++.|.. T Consensus 4 ~~~~I~p~a~I~~~v~Ig~~a~I~~~a~I~~~v~IG~~~~Ig~~~~I~~~~~IG~~~~IG~~~~I~~------------- 70 (163) T cd05636 4 IEGTVEEGVTIKGPVWIGEGAIVRSGAYIEGPVIIGKGCEIGPNAYIRGYTVLGDGCVVGNSVEVKN------------- 70 (163) T ss_pred EEEEECCCCEECCCEEECCCCEECCCCEEECCEEECCCCEEEECCCCCCCEEECCEEEECCCEEECC------------- T ss_conf 2679999999989819999999999999979759989989936412269858613012055124212------------- Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCC--------CCCCCCCCCCCCCCCCCCEECCCEEECC Q ss_conf 32222222122443211112222221223320000166554331--------0012344322345245532148606523 Q gi|255764481|r 86 LLVGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDC--------KLGNGIVLSNNVMIAGHVIVDDRVVFGG 157 (271) Q Consensus 86 v~IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~--------~IG~~vii~n~~~l~G~v~Igd~~~IG~ 157 (271) ..|++++.+..++.+ +.+.||++++|.+++.++-.. .+.+. .+....... .+.|||+++||+ T Consensus 71 ~vi~~~~~i~~~~~i--------g~siIG~~v~IGagtvi~n~~~~~~~~~v~~~~~-~~~~~~~~~-G~vIGd~~~IG~ 140 (163) T cd05636 71 SIIMDGTKVPHLNYV--------GDSVLGENVNLGAGTITANLRFDDKPVKVRLKGE-RVDTGRRKL-GAIIGDGVKTGI 140 (163) T ss_pred CCCCCCCCCCCCCEE--------CCCEECCCCEECCCEEEECCCCCCCCEEEEECCE-EECCCCEEC-CCEECCCCEECC T ss_conf 401478432775687--------7729929969889837817767886347840770-623677040-779779979998 Q ss_pred CCCCCCCEEEECCCEEECCCCCC Q ss_conf 44433650640474350132112 Q gi|255764481|r 158 GSAVHQFTRIGKYAFIGGMTGVV 180 (271) Q Consensus 158 ~~~v~~~v~IG~~a~ig~gs~V~ 180 (271) ++++.++++||++|+||+||+|. T Consensus 141 ns~I~~G~~IG~~a~IgaGsvVr 163 (163) T cd05636 141 NVSLNPGVKIGPGSWVYPGCVVR 163 (163) T ss_pred CCEECCCEEECCCCEECCCCEEC T ss_conf 98986995999998999997969 No 28 >cd03350 LbH_THP_succinylT 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate (THDP) N-succinyltransferase (also called THP succinyltransferase): THDP N-succinyltransferase catalyzes the conversion of tetrahydrodipicolinate and succinyl-CoA to N-succinyltetrahydrodipicolinate and CoA. It is the committed step in the succinylase pathway by which bacteria synthesize L-lysine and meso-diaminopimelate, a component of peptidoglycan. The enzyme is homotrimeric and each subunit contains an N-terminal region with alpha helices and hairpin loops, as well as a C-terminal region with a left-handed parallel alpha-helix (LbH) structural motif encoded by hexapeptide repeat motifs. Probab=99.90 E-value=9.3e-24 Score=169.05 Aligned_cols=119 Identities=25% Similarity=0.297 Sum_probs=105.4 Q ss_pred CEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCCCCCCCC Q ss_conf 75655108889859988989956559889989988989866868916846865531143422321023331222222332 Q gi|255764481|r 8 PIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTELL 87 (271) Q Consensus 8 ~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~v~ 87 (271) ..|||+|+|+++|+||+++.|+|+++|+.++.||++|.|.+++.|..++.||++|.|+|+++|++.. T Consensus 2 ~rI~P~A~I~~ga~IG~g~vI~~~a~I~~g~~IG~~~~I~~~~~Ig~~~~IG~~v~I~~g~~i~g~~------------- 68 (139) T cd03350 2 RRVPPGAIIRDGAFIGPGAVLMMPSYVNIGAYVDEGTMVDSWATVGSCAQIGKNVHLSAGAVIGGVL------------- 68 (139) T ss_pred EEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCC------------- T ss_conf 0749982988999999998998899984398989872786666026866999932898886784442------------- Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCEEE Q ss_conf 22222212244321111222222122332000016655433100123443223452455321486065234443365064 Q gi|255764481|r 88 VGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFTRI 167 (271) Q Consensus 88 IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v~I 167 (271) .+.. .-++.|+|++|||+++.|.+.++| T Consensus 69 ------------------~p~~----------------------------------~~~v~IeD~~~IGa~a~V~~Gv~I 96 (139) T cd03350 69 ------------------EPLQ----------------------------------ATPVIIEDDVFIGANCEVVEGVIV 96 (139) T ss_pred ------------------CCCC----------------------------------CCCCEECCCEEEEECCEEECCCEE T ss_conf ------------------5764----------------------------------688688799599309989468799 Q ss_pred ECCCEEECCCCCCCCCCCCCHHHH Q ss_conf 047435013211223333100010 Q gi|255764481|r 168 GKYAFIGGMTGVVHDVIPYGILNG 191 (271) Q Consensus 168 G~~a~ig~gs~V~kdVpp~~~~~G 191 (271) |++|+|||||+||||+|||.++.| T Consensus 97 G~~avIGAGsVVtkdvp~y~~~~g 120 (139) T cd03350 97 GKGAVLAAGVVLTQSTPIYDRETG 120 (139) T ss_pred CCCCEECCCCEEECCCCCEEEECC T ss_conf 999899979889189884682378 No 29 >cd03360 LbH_AT_putative Putative Acyltransferase (AT), Left-handed parallel beta-Helix (LbH) domain; This group is composed of mostly uncharacterized proteins containing an N-terminal helical subdomain followed by a LbH domain. The alignment contains 6 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. A few members are identified as NeuD, a sialic acid (Sia) O-acetyltransferase that is required for Sia synthesis and surface polysaccharide sialylation. Probab=99.90 E-value=2e-24 Score=173.26 Aligned_cols=84 Identities=29% Similarity=0.500 Sum_probs=78.1 Q ss_pred CCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCEEEECCCEEECCCCCCCCCCCCCHH Q ss_conf 21223320000166554331001234432234524553214860652344433650640474350132112233331000 Q gi|255764481|r 110 KTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFTRIGKYAFIGGMTGVVHDVIPYGIL 189 (271) Q Consensus 110 ~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~gs~V~kdVpp~~~~ 189 (271) .++||++|.+..+++|+|||.||++|.++.++.++|+|+|||+++||.+++|.|.++||++++|||||+|+||||+.+++ T Consensus 114 ~~~IG~~~~i~~~~~i~Hd~~ig~~~~i~~~~~i~g~v~Ig~~~~iG~~~~i~~~i~Ig~~~~igags~v~~~i~~~~~~ 193 (197) T cd03360 114 DARIGDNVIINTGAVIGHDCVIGDFVHIAPGVVLSGGVTIGEGAFIGAGATIIQGVTIGAGAIIGAGAVVTKDVPDGSVV 193 (197) T ss_pred CCCCCCEEEECCCHHCCCCCCCCCCCEECCCEEECCCEEEEEECCCCCCCEECCCCEECCCCEECCCCEEEEECCCCEEE T ss_conf 65304368987700105315013101066723677830891216298998987998999999999799891256999099 Q ss_pred HHCC Q ss_conf 1074 Q gi|255764481|r 190 NGNP 193 (271) Q Consensus 190 ~G~p 193 (271) .|+| T Consensus 194 ~G~P 197 (197) T cd03360 194 VGNP 197 (197) T ss_pred EECC T ss_conf 9309 No 30 >cd03351 LbH_UDP-GlcNAc_AT UDP-N-acetylglucosamine O-acyltransferase (UDP-GlcNAc acyltransferase): Proteins in this family catalyze the transfer of (R)-3-hydroxymyristic acid from its acyl carrier protein thioester to UDP-GlcNAc. It is the first enzyme in the lipid A biosynthetic pathway and is also referred to as LpxA. Lipid A is essential for the growth of Escherichia coli and related bacteria. It is also essential for maintaining the integrity of the outer membrane. UDP-GlcNAc acyltransferase is a homotrimer of left-handed parallel beta helix (LbH) subunits. Each subunit contains an N-terminal LbH region with 9 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X), and a C-terminal alpha-helical region. Probab=99.90 E-value=1e-23 Score=168.80 Aligned_cols=162 Identities=19% Similarity=0.357 Sum_probs=137.9 Q ss_pred CCCCCCCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECC------------CCEEEEEEECCCCCE Q ss_conf 50579975655108889859988989956559889989988989866868916------------846865531143422 Q gi|255764481|r 2 SRMGNNPIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAG------------KTKIGDFTKVFPMAV 69 (271) Q Consensus 2 ~~i~~~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g------------~t~IG~~~~I~~~a~ 69 (271) ++|++++.|.|.|+|.++++||+++.|+|+++|.++++||++|+|+++++|.+ .+.||++|.|.++++ T Consensus 12 A~IG~~v~Igp~~vIg~~V~IG~~~~I~~~v~I~~~t~IG~~~~I~~~avIG~~pqd~~~~g~~~~v~IG~~~~Ire~vt 91 (254) T cd03351 12 AKIGENVEIGPFCVIGPNVEIGDGTVIGSHVVIDGPTTIGKNNRIFPFASIGEAPQDLKYKGEPTRLEIGDNNTIREFVT 91 (254) T ss_pred CEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCEEECCCCCCCCCCCCCCCCCCCCEEEECCCCEECCCCC T ss_conf 99999799999989999999999999979968919979776525605753477876767379867699799778995330 Q ss_pred EEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEE Q ss_conf 32102333122222233222222212244321111222222122332000016655433100123443223452455321 Q gi|255764481|r 70 LGGDTQSKYHNFVGTELLVGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIV 149 (271) Q Consensus 70 Ig~~~qd~~~~g~~~~v~IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~I 149 (271) |.-.. ........||++|.|-.++.|. ....||+++.+..++.++..|.||++++|+.++.+..+++| T Consensus 92 I~rgt-----~~~~~~T~IG~~~~im~~~HIa-------Hdc~IG~~~iian~~~laGhv~Igd~a~IGg~~~v~q~v~I 159 (254) T cd03351 92 IHRGT-----AQGGGVTRIGNNNLLMAYVHVA-------HDCVIGNNVILANNATLAGHVEIGDYAIIGGLSAVHQFCRI 159 (254) T ss_pred CCCCC-----CCCCCCEEECCCCEEEECCCCC-------CCCCCCCCEEEECCCCCCCCEEECCCCEECCCCEECCCCEE T ss_conf 23674-----3788755978885782025445-------67733797599334455685698998489883677598199 Q ss_pred CCCEEECCCCCCCCCEEEECCCEEECCC Q ss_conf 4860652344433650640474350132 Q gi|255764481|r 150 DDRVVFGGGSAVHQFTRIGKYAFIGGMT 177 (271) Q Consensus 150 gd~~~IG~~~~v~~~v~IG~~a~ig~gs 177 (271) |++++||+++.|.+. |-.++++.+.- T Consensus 160 G~~a~Igags~V~kD--Vpp~~~v~G~P 185 (254) T cd03351 160 GRHAMVGGGSGVVQD--VPPYVIAAGNR 185 (254) T ss_pred CCCEEECCCCEEECC--CCCCCEECCCC T ss_conf 966199767456113--69872651796 No 31 >COG0663 PaaY Carbonic anhydrases/acetyltransferases, isoleucine patch superfamily [General function prediction only] Probab=99.90 E-value=6.1e-24 Score=170.22 Aligned_cols=159 Identities=23% Similarity=0.390 Sum_probs=115.0 Q ss_pred CCCCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCCCCC Q ss_conf 79975655108889859988989956559889989988989866868916846865531143422321023331222222 Q gi|255764481|r 5 GNNPIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFVGT 84 (271) Q Consensus 5 ~~~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~ 84 (271) +..+.|||+|+|+|+| +|.|+++||+++.|||+|+|.+| .+ T Consensus 9 ~~~P~i~~~a~Va~~A------------------------------~viGdV~Ig~~vsIw~~aVlRgD---------~~ 49 (176) T COG0663 9 GLSPKIDPTAFVAPSA------------------------------TVIGDVRIGAGVSIWPGAVLRGD---------VE 49 (176) T ss_pred CCCCCCCCCEEECCCC------------------------------EEEEEEEECCCCEECCCEEEECC---------CC T ss_conf 8798789862877999------------------------------79985999999788776389746---------77 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCC Q ss_conf 33222222212244321111222222122332000016655433100123443223452455321486065234443365 Q gi|255764481|r 85 ELLVGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQF 164 (271) Q Consensus 85 ~v~IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~ 164 (271) .+.||+++.|+++|++|... + +.+.||++|.++.++.|.| |+|+|+++||.+++|+.+ T Consensus 50 ~I~IG~~tNIQDg~ViH~~~---~------------------~p~~IG~~vtIGH~aivHG-c~Ig~~~lIGmgA~vldg 107 (176) T COG0663 50 PIRIGARTNIQDGVVIHADP---G------------------YPVTIGDDVTIGHGAVVHG-CTIGDNVLIGMGATVLDG 107 (176) T ss_pred CEEECCCCEECCCEEEECCC---C------------------CCEEECCCCEECCCCEEEE-EEECCCCEEECCCEEECC T ss_conf 56987885603783895279---9------------------9929779958857658988-598788589138667578 Q ss_pred EEEECCCEEECCCCCCC--CCCCCCHHHHCCCCCCC-CCCCHHHHCCCCHHHHHHHHHHHHHH Q ss_conf 06404743501321122--33331000107453233-44201444288899999999999999 Q gi|255764481|r 165 TRIGKYAFIGGMTGVVH--DVIPYGILNGNPGALRG-VNVVAMRRAGFSRDTIHLIRAVYKQI 224 (271) Q Consensus 165 v~IG~~a~ig~gs~V~k--dVpp~~~~~G~pa~~~~-~N~~glkR~g~s~e~i~~l~~ayr~l 224 (271) ++||++|+|||||+|+. .+|+++++.|.|||... ++..-++....+.+.+..+++.|+.- T Consensus 108 a~IG~~~iVgAgalV~~~k~~p~~~L~~G~Pak~~r~l~~~~~~~~~~~a~~Yv~~~~~~~~~ 170 (176) T COG0663 108 AVIGDGSIVGAGALVTPGKEIPGGSLVVGSPAKVVRPLDDEELAWLRENAENYVKLADRYLAG 170 (176) T ss_pred CEECCCCEECCCCCCCCCCCCCCCEEEECCCCEEEECCCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 698898698668700588377997596237401440288667666667489999988877521 No 32 >PRK12461 UDP-N-acetylglucosamine acyltransferase; Provisional Probab=99.90 E-value=1.6e-23 Score=167.66 Aligned_cols=161 Identities=20% Similarity=0.330 Sum_probs=135.0 Q ss_pred CCCCCCCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECC------------CCEEEEEEECCCCCE Q ss_conf 50579975655108889859988989956559889989988989866868916------------846865531143422 Q gi|255764481|r 2 SRMGNNPIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAG------------KTKIGDFTKVFPMAV 69 (271) Q Consensus 2 ~~i~~~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g------------~t~IG~~~~I~~~a~ 69 (271) ++|++++.|.|.++|.++++||++++|+|+++|+++++||++|+|+++++|.. .+.||+++.|.++++ T Consensus 13 A~Ig~~v~Igp~~vIg~~V~IG~~~~I~~~v~I~~~t~IG~~~~I~~~avIG~~pqd~~~~g~~~~v~IGd~~~Ire~~t 92 (256) T PRK12461 13 AKLGEGVEIGPFAVIGADVEIGDGTWIMPHAVILGWTRIGKNNKIHQGAVVGDEPQDFGYKGEESHLIIGDNNVIREGVT 92 (256) T ss_pred CEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCEEEEECCCCCCCCCCCCCCCCCCEEEECCCCEECCCCC T ss_conf 99999999999999999999999999949818919989997439975431176333444158620699899878798755 Q ss_pred EEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEE Q ss_conf 32102333122222233222222212244321111222222122332000016655433100123443223452455321 Q gi|255764481|r 70 LGGDTQSKYHNFVGTELLVGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIV 149 (271) Q Consensus 70 Ig~~~qd~~~~g~~~~v~IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~I 149 (271) |--.. ...+...||++|.|-.++.| .....||++|.+.+++.++-.|.|||+++|+.++.+.++++| T Consensus 93 I~rgt------~~~~~T~IG~~~~im~~vhI-------aHd~~IG~~~iian~~~laG~v~Igd~v~iGg~~~v~q~v~I 159 (256) T PRK12461 93 IHRGT------KGGTVTRIGNDNLLMANSHI-------AHDCQIGNNVILVNGALLAGHVTVGDRAIISGNCLVHQFCRI 159 (256) T ss_pred CCCCC------CCCCCEEECCCCEEECCCCC-------CCCCCCCCCEEEECCEEECCCEEECCCCEECCCCEECCCCEE T ss_conf 67861------48986698878587436433-------464313677067222023242599997299573477179799 Q ss_pred CCCEEECCCCCCCCCEEEECCCEEECCC Q ss_conf 4860652344433650640474350132 Q gi|255764481|r 150 DDRVVFGGGSAVHQFTRIGKYAFIGGMT 177 (271) Q Consensus 150 gd~~~IG~~~~v~~~v~IG~~a~ig~gs 177 (271) |++|.||+++.|.+. |-.|+++.+.- T Consensus 160 G~~a~Ig~gS~V~kD--VPp~~~v~G~p 185 (256) T PRK12461 160 GRLAMMAGGSRISKD--VPPYCMMAGNT 185 (256) T ss_pred CCCCEECCCCEEEEE--CCCCEEEECCC T ss_conf 999699878577753--79983882687 No 33 >cd03353 LbH_GlmU_C N-acetyl-glucosamine-1-phosphate uridyltransferase (GlmU), C-terminal left-handed beta-helix (LbH) acetyltransferase domain: GlmU is also known as UDP-N-acetylglucosamine pyrophosphorylase. It is a bifunctional bacterial enzyme that catalyzes two consecutive steps in the formation of UDP-N-acetylglucosamine (UDP-GlcNAc), an important precursor in bacterial cell wall formation. The two enzymatic activities, uridyltransferase and acetyltransferase, are carried out by two independent domains. The C-terminal LbH domain possesses the acetyltransferase activity. It catalyzes the CoA-dependent acetylation of GlcN-1-phosphate to GlcNAc-1-phosphate. The LbH domain contains 10 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X. The acetyltransferase active site is located at the interface between two subunits of the active LbH trimer. Probab=99.89 E-value=6.3e-23 Score=163.84 Aligned_cols=159 Identities=24% Similarity=0.387 Sum_probs=122.2 Q ss_pred CCCCCCCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCC Q ss_conf 50579975655108889859988989956559889989988989866868916846865531143422321023331222 Q gi|255764481|r 2 SRMGNNPIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNF 81 (271) Q Consensus 2 ~~i~~~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g 81 (271) ++|++++.|+|.|+|+++++||++|.|+|+|+| .+++||++|+|+++++|. ++.||++|.|++++.|... T Consensus 16 v~IG~~v~I~p~~~I~~~v~IG~~~~Ig~~~~I-~~~~IG~~~~I~~~~vi~-~~~ig~~~~IG~~~~i~~~-------- 85 (193) T cd03353 16 VEIGVDVVIDPGVILEGKTVIGEDCVIGPNCVI-KDSTIGDGVVIKASSVIE-GAVIGNGATVGPFAHLRPG-------- 85 (193) T ss_pred CEECCCCEECCCEEEECCCEECCCCEECCCEEE-CCCEECCCCEECCCEECC-CCEECCCCEECCCEEECCC-------- T ss_conf 299999999898299298599999899573457-786888874002100113-5251765266685076587-------- Q ss_pred CCCCCCCCCCCCCCCCCC-----CCCCCCCCC----CCCCCCCCCHHCCCCCCC-------CCCCCCCCCCCCCCCCCCC Q ss_conf 222332222222122443-----211112222----221223320000166554-------3310012344322345245 Q gi|255764481|r 82 VGTELLVGKKCVIREGVT-----INRGTVEYG----GKTIVGDNNFFLANSHVA-------HDCKLGNGIVLSNNVMIAG 145 (271) Q Consensus 82 ~~~~v~IG~~~~Ire~v~-----I~~gt~~~~----~~T~IG~~~~i~~~~hI~-------Hd~~IG~~vii~n~~~l~G 145 (271) +.|++++.|+.++. +.+++..+. +.+.||++|.+++++-++ |.+.|||+|+|+.++.+.+ T Consensus 86 ----~~I~~~~~iG~~v~i~~~~ig~~~~i~h~~~igd~~Ig~~v~Ig~~~i~~n~~g~~~~~t~IGd~v~IG~~~~i~~ 161 (193) T cd03353 86 ----TVLGEGVHIGNFVEIKKSTIGEGSKANHLSYLGDAEIGEGVNIGAGTITCNYDGVNKHRTVIGDNVFIGSNSQLVA 161 (193) T ss_pred ----CEECCEEEECCCEEEECCCCCCCCEECCEEECCCCEECCCCEECCCEEEECCCCCCCCCCEECCCCEECCCCEEEC T ss_conf ----6887516742535764443368848772446188567778888667588734343356234335618988987989 Q ss_pred CCEECCCEEECCCCCCCCCEEEECCCEEECC Q ss_conf 5321486065234443365064047435013 Q gi|255764481|r 146 HVIVDDRVVFGGGSAVHQFTRIGKYAFIGGM 176 (271) Q Consensus 146 ~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~g 176 (271) +++|||+++||+++.|.+.+ -.++|+=+. T Consensus 162 ~v~IG~~a~Igags~V~kdV--pp~~lvvg~ 190 (193) T cd03353 162 PVTIGDGATIAAGSTITKDV--PPGALAIAR 190 (193) T ss_pred CEEECCCCEECCCCEECCCC--CCCCEEEEE T ss_conf 83999998998698888423--999789962 No 34 >cd04646 LbH_Dynactin_6 Dynactin 6 (or subunit p27): Dynactin is a major component of the activator complex that stimulates dynein-mediated vesicle transport. Dynactin is a heterocomplex of at least eight subunits, including a 150,000-MW protein called Glued, the actin-capping protein Arp1, and dynamatin. In vitro binding experiments show that dynactin enhances dynein-dependent motility, possibly through interaction with microtubules and vesicles. Subunit p27 is part of the pointed-end subcomplex in dynactin that also includes p25, p26, and Arp11. This subcomplex interacts with membranous cargoes. p25 and p27 contain the imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X), indicating a left-handed parallel beta helix (LbH) structural domain. Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Probab=99.88 E-value=3.5e-23 Score=165.44 Aligned_cols=94 Identities=22% Similarity=0.276 Sum_probs=63.0 Q ss_pred CCCCCCCCCCCCCCCCCCCCCC---CCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCC Q ss_conf 3322222221224432111122---2222122332000016655433100123443223452455321486065234443 Q gi|255764481|r 85 ELLVGKKCVIREGVTINRGTVE---YGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAV 161 (271) Q Consensus 85 ~v~IG~~~~Ire~v~I~~gt~~---~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v 161 (271) .++||++|.|+|+++|+..... .+..+.||+++ +++|+|.|. .++|||+++||.++.| T Consensus 38 ~I~IG~~~nIqd~~~I~~~~~~~~~~g~~~~IG~~~------~i~~g~~ih-------------~~~IG~~~lIg~~a~V 98 (164) T cd04646 38 PIIIGENNIIEEQVTIVNKKPKDPAEPKPMIIGSNN------VFEVGCKCE-------------ALKIGNNNVFESKSFV 98 (164) T ss_pred CEEECCCCEECCCEEEECCCCCCCCCCCCEEECCCC------EECCCEEEE-------------CCEECCCCEECCCCEE T ss_conf 889899966669859941787777899887988998------886781995-------------5597678889689899 Q ss_pred CCCEEEECCCEEECCCCCCCC--CCCCCHHHHCCCCCC Q ss_conf 365064047435013211223--333100010745323 Q gi|255764481|r 162 HQFTRIGKYAFIGGMTGVVHD--VIPYGILNGNPGALR 197 (271) Q Consensus 162 ~~~v~IG~~a~ig~gs~V~kd--Vpp~~~~~G~pa~~~ 197 (271) +.+++||++|+|||+|.|+.+ +||++++.|+||+.+ T Consensus 99 ~~ga~Ig~~ciIgags~V~~~~~ip~~slv~G~pa~~~ 136 (164) T cd04646 99 GKNVIITDGCIIGAGCKLPSSEILPENTVIYGADCLRR 136 (164) T ss_pred CCCCEECCCCEECCCCCCCCCCEECCCEEEECCCEEEE T ss_conf 99989999989998989289979299859978874781 No 35 >cd03359 LbH_Dynactin_5 Dynactin 5 (or subunit p25); Dynactin is a major component of the activator complex that stimulates dynein-mediated vesicle transport. Dynactin is a heterocomplex of at least eight subunits, including a 150,000-MW protein called Glued, the actin-capping protein Arp1, and dynamatin. In vitro binding experiments show that dynactin enhances dynein-dependent motility, possibly through interaction with microtubules and vesicles. Subunit p25 is part of the pointed-end subcomplex in dynactin that also includes p26, p27, and Arp11. This subcomplex interacts with membranous cargoes. p25 and p27 contain imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X), indicating a left-handed parallel beta helix (LbH) structural domain. Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Probab=99.88 E-value=2.5e-22 Score=160.14 Aligned_cols=150 Identities=19% Similarity=0.325 Sum_probs=96.2 Q ss_pred CCEECCCCEECC--CCEEEEEEECCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHC Q ss_conf 989866868916--846865531143422321023331222222332222222122443211112222221223320000 Q gi|255764481|r 43 GVELISHCVVAG--KTKIGDFTKVFPMAVLGGDTQSKYHNFVGTELLVGKKCVIREGVTINRGTVEYGGKTIVGDNNFFL 120 (271) Q Consensus 43 ~~~I~~~v~I~g--~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~v~IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~ 120 (271) +.+|..++.|.| +++||++|.||++|+|.++ ...+.||++|.|+++++|+....... .+. T Consensus 7 gn~I~k~a~I~G~~nI~IG~~ssI~~~avirGD---------~~~I~IG~~~~I~d~~~I~~~~~~~~-~~~-------- 68 (161) T cd03359 7 GNKVSRKSVICGSQNIVLNGKTIIQSDVIIRGD---------LATVSIGRYCILSEGCVIRPPFKKFS-KGV-------- 68 (161) T ss_pred CCEECCCCEEECCCCEEECCCCEECCCCEEECC---------CCCEEECCCCEECCCCEEECCCCCCC-CCC-------- T ss_conf 995877629978877599998789999689678---------86549999858889989924644357-788-------- Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCEEEECCCEEECCCCCCCC--CCCCCHHHHCCCCCCC Q ss_conf 16655433100123443223452455321486065234443365064047435013211223--3331000107453233 Q gi|255764481|r 121 ANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFTRIGKYAFIGGMTGVVHD--VIPYGILNGNPGALRG 198 (271) Q Consensus 121 ~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~gs~V~kd--Vpp~~~~~G~pa~~~~ 198 (271) ..+++.||++|+++.++.+.+ ++|+++++||.++.+.+++.||++|+|+++|+|..+ ||||++++|+|||+. T Consensus 69 ----~~~p~~IGd~v~Ig~~~~i~~-~~Ig~~v~IG~~~~I~~g~~Ig~~~~I~~gsvV~~~~~ip~~~~~~G~Pak~i- 142 (161) T cd03359 69 ----AFFPLHIGDYVFIGENCVVNA-AQIGSYVHIGKNCVIGRRCIIKDCVKILDGTVVPPDTVIPPYSVVSGRPARFI- 142 (161) T ss_pred ----CCCCCEECCCEEECCCEEEEC-CEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEECCEEEE- T ss_conf ----887728489879899828848-89667715579819949979999989988989889989899819985180895- Q ss_pred CCCCHHHHCCCCHHHHHHHHHHHHHH Q ss_conf 44201444288899999999999999 Q gi|255764481|r 199 VNVVAMRRAGFSRDTIHLIRAVYKQI 224 (271) Q Consensus 199 ~N~~glkR~g~s~e~i~~l~~ayr~l 224 (271) +-+++.+...|++.-+.+ T Consensus 143 --------~~l~e~~~~~~~~~~~~y 160 (161) T cd03359 143 --------GELPECTQELMEEETKEY 160 (161) T ss_pred --------EECCHHHHHHHHHHHHHC T ss_conf --------769999999999989862 No 36 >cd00710 LbH_gamma_CA Gamma carbonic anhydrases (CA): Carbonic anhydrases are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism, involving the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide, followed by the regeneration of the active site by ionization of the zinc-bound water molecule and removal of a proton from the active site. They are ubiquitous enzymes involved in fundamental processes like photosynthesis, respiration, pH homeostasis and ion transport. There are three distinct groups of carbonic anhydrases - alpha, beta and gamma - which show no significant sequence identity or structural similarity. Gamma CAs are homotrimeric enzymes, with each subunit containing a left-handed parallel beta helix (LbH) structural domain. Probab=99.86 E-value=4.4e-22 Score=158.58 Aligned_cols=133 Identities=22% Similarity=0.326 Sum_probs=88.4 Q ss_pred CCEECCCCEEEEEEECCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCC Q ss_conf 86891684686553114342232102333122222233222222212244321111222222122332000016655433 Q gi|255764481|r 49 HCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTELLVGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHD 128 (271) Q Consensus 49 ~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~v~IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd 128 (271) +++|.|+++||++|.|||+|+|.++ +...+.||+++.|+++++||.. . ++. T Consensus 14 ~A~viGdV~ig~~~sIw~~aviRgD--------~~~~I~IG~~~nIqD~~viH~~---~------------------~~~ 64 (167) T cd00710 14 TAVVIGDVIIGDNVFVGPGASIRAD--------EGTPIIIGANVNIQDGVVIHAL---E------------------GYS 64 (167) T ss_pred CCEEEEEEEECCCCEECCCCEEECC--------CCCCEEECCCCCCCCCEEEECC---C------------------CCC T ss_conf 9989970899999999888789478--------8887698887434777078225---7------------------667 Q ss_pred CCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCEEEECCCEEECCCCCCC-CCCCCCHH-------HHCCCCCCCCC Q ss_conf 10012344322345245532148606523444336506404743501321122-33331000-------10745323344 Q gi|255764481|r 129 CKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFTRIGKYAFIGGMTGVVH-DVIPYGIL-------NGNPGALRGVN 200 (271) Q Consensus 129 ~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~gs~V~k-dVpp~~~~-------~G~pa~~~~~N 200 (271) +.||++|.+++++.|.|+|+|||+++||.+++++. ++||++|+||++|+|.. .+||..++ .|.||+. T Consensus 65 ~~IG~~vtIgh~a~ihG~c~Igd~~~IG~gaiv~~-a~Ig~~~vIgags~V~g~~i~~g~~v~~G~~v~~~~pa~~---- 139 (167) T cd00710 65 VWIGKNVSIAHGAIVHGPAYIGDNCFIGFRSVVFN-AKVGDNCVIGHNAVVDGVEIPPGRYVPAGAVITSQTQADA---- 139 (167) T ss_pred EEECCCCEECCCCEECCEEEECCCCEECCCCEEEC-CEECCCCEECCCCEEECCCCCCCCEECCCCEEECCCCHHH---- T ss_conf 49779828989848812049839988988989982-5976998998898884778399989558828836996697---- Q ss_pred CCHHHHCCCCHHHHHHHHHH Q ss_conf 20144428889999999999 Q gi|255764481|r 201 VVAMRRAGFSRDTIHLIRAV 220 (271) Q Consensus 201 ~~glkR~g~s~e~i~~l~~a 220 (271) -+.++++++..+++. T Consensus 140 -----~~~lt~~~~~~~~~~ 154 (167) T cd00710 140 -----LPDVTDSAREFNEKV 154 (167) T ss_pred -----CCCCCHHHHHHHHHH T ss_conf -----337999999999999 No 37 >TIGR01852 lipid_A_lpxA acyl-[acyl-carrier-protein]--UDP-N-acetylglucosamine O-acyltransferase; InterPro: IPR010137 This entry describes LpxA, an enzyme for the biosynthesis of lipid A, a component of lipopolysaccharide (LPS) in the outer membrane outer leaflet of most Gram-negative bacteria. Some differences are found between lipid A of different species, but this protein represents the first step (from UDP-N-acetyl-D-glucosamine) and appears to be conserved in function. Proteins from this family contain many copies of the bacterial transferase hexapeptide repeat.; GO: 0008780 acyl-[acyl-carrier-protein]-UDP-N-acetylglucosamine O-acyltransferase activity, 0008610 lipid biosynthetic process. Probab=99.86 E-value=9.3e-22 Score=156.54 Aligned_cols=227 Identities=19% Similarity=0.326 Sum_probs=151.1 Q ss_pred CCCCCCCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECC------------CCEEEEEEECCCCCE Q ss_conf 50579975655108889859988989956559889989988989866868916------------846865531143422 Q gi|255764481|r 2 SRMGNNPIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAG------------KTKIGDFTKVFPMAV 69 (271) Q Consensus 2 ~~i~~~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g------------~t~IG~~~~I~~~a~ 69 (271) |+|+++..|-|.|+|.++|+||+||+|.++++|...++||+||+|.|+++|.+ .+.||++|+|-.|++ T Consensus 11 A~iG~~V~iGPf~~v~~~V~ig~g~~l~shV~i~G~T~iG~~~~i~p~A~iG~~PQdlky~GE~t~l~IG~~n~IRE~VT 90 (257) T TIGR01852 11 AEIGEDVEIGPFCIVGPGVKIGDGVELKSHVVIEGHTTIGEGTKIFPGAVIGGEPQDLKYKGERTELIIGDNNTIREFVT 90 (257) T ss_pred CEECCCCEEEEEEEECCCCEECCCCEECCCEEEECCEEECCCCEEECCEEECCCCCCCEECCCCCEEEECCCCEEEEEEE T ss_conf 68289857801888789758868858757368967708789988807607679885650148701799889656975578 Q ss_pred EEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEE Q ss_conf 32102333122222233222222212244321111222222122332000016655433100123443223452455321 Q gi|255764481|r 70 LGGDTQSKYHNFVGTELLVGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIV 149 (271) Q Consensus 70 Ig~~~qd~~~~g~~~~v~IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~I 149 (271) |-... ......+.|||||.|-.|+.|- ..=+|||+|-|-.++.+|-.+.|||+++|+-.+.+..-|+| T Consensus 91 i~~GT-----~~g~g~T~iG~~nllMAysHvA-------HDC~vGn~vv~aN~a~LAGHV~vgD~a~iGG~~avHQFvRI 158 (257) T TIGR01852 91 INRGT-----KSGGGVTSIGNNNLLMAYSHVA-------HDCVVGNHVVLANNATLAGHVEVGDYAIIGGLVAVHQFVRI 158 (257) T ss_pred ECCCC-----CCCCCEEEECCCCEECCCCEEE-------ECCEECCEEEEECCCEECEEEEECCEEEECCCCCEEEEEEE T ss_conf 50662-----5888889988876572053561-------24368881698136312227998889997789721101330 Q ss_pred CCCEEECCCCCCCCCEEEECCCEEECCCCCCCCCCCCCHHHHCCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHHHHCCC Q ss_conf 48606523444336506404743501321122333310001074532334420144428889999999999999972799 Q gi|255764481|r 150 DDRVVFGGGSAVHQFTRIGKYAFIGGMTGVVHDVIPYGILNGNPGALRGVNVVAMRRAGFSRDTIHLIRAVYKQIFQQGD 229 (271) Q Consensus 150 gd~~~IG~~~~v~~~v~IG~~a~ig~gs~V~kdVpp~~~~~G~pa~~~~~N~~glkR~g~s~e~i~~l~~ayr~lfr~~~ 229 (271) |.+|.|||.|.|-+- |=-|+|+.+.|. ....| =-+.||.+-|+.|.-. ..-.+++|..||---.-.. T Consensus 159 G~~aMigG~s~v~~D--vpPY~~~~G~~~--------a~l~G--lN~vGLrR~Gf~~~~i-~~i~~ayr~lfr~~~~~~~ 225 (257) T TIGR01852 159 GRYAMIGGLSAVSKD--VPPYGLVEGNSR--------AVLRG--LNIVGLRRRGFDREEI-TAIKKAYRLLFRSGLPLQE 225 (257) T ss_pred HHHHHHHHHCCCCCC--CCCCEEECCCCC--------CEEEE--EEEEEEECCCCCHHHH-HHHHHHHHHHHCCCCCHHH T ss_conf 002322120024678--776178637875--------33788--7013221378798999-9999998876179989899 Q ss_pred CHHHHHHHHHHHCCCCHHHHHHHH Q ss_conf 899999999964258989999999 Q gi|255764481|r 230 SIYKNAGAIREQNVSCPEVSDIIN 253 (271) Q Consensus 230 ~l~e~l~~l~~~~~~~~~v~~~~~ 253 (271) .+++.+|+-.++.++..++=+|+. T Consensus 226 ~~~~v~e~P~~~~~~v~~~~~FI~ 249 (257) T TIGR01852 226 ALQQVAEEPYEDNPEVKEIVDFIR 249 (257) T ss_pred HHHHHHHCCCCCCHHHHHHHHHHC T ss_conf 999998351337757999999860 No 38 >cd03349 LbH_XAT Xenobiotic acyltransferase (XAT): The XAT class of hexapeptide acyltransferases is composed of a large number of microbial enzymes that catalyze the CoA-dependent acetylation of a variety of hydroxyl-bearing acceptors such as chloramphenicol and streptogramin, among others. Members of this class of enzymes include Enterococcus faecium streptogramin A acetyltransferase and Pseudomonas aeruginosa chloramphenicol acetyltransferase. They contain repeated copies of a six-residue hexapeptide repeat sequence motif (X-[STAV]-X-[LIV]-[GAED]-X) and adopt a left-handed parallel beta helix (LbH) structure. The active enzyme is a trimer with CoA and substrate binding sites at the interface of two separate LbH subunits. XATs are implicated in inactivating xenobiotics leading to xenobiotic resistance in patients. Probab=99.85 E-value=3.2e-21 Score=153.17 Aligned_cols=68 Identities=35% Similarity=0.537 Sum_probs=62.9 Q ss_pred CCCEECCCEEECCCCCCCCCEEEECCCEEECCCCCCCCCCCCCHHHHCCCCCCCCCCCHHHHCCCCHHHHHHHHHH Q ss_conf 5532148606523444336506404743501321122333310001074532334420144428889999999999 Q gi|255764481|r 145 GHVIVDDRVVFGGGSAVHQFTRIGKYAFIGGMTGVVHDVIPYGILNGNPGALRGVNVVAMRRAGFSRDTIHLIRAV 220 (271) Q Consensus 145 G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~gs~V~kdVpp~~~~~G~pa~~~~~N~~glkR~g~s~e~i~~l~~a 220 (271) +.++|||+||||++++|+++++||++|+|||+|+|+||||||++++|+|||+. |..|+++.+..|.+. T Consensus 72 ~~i~Ig~~vwIG~n~~Il~GV~IG~gavIgAgSvVtkdvp~~~i~~G~PAk~i--------k~Rf~~~~~~~l~~~ 139 (145) T cd03349 72 GDVIIGNDVWIGHGATILPGVTIGDGAVIAAGAVVTKDVPPYAIVGGNPAKVI--------RYRFDEETIERLLAL 139 (145) T ss_pred CCEEECCCEEECCCCEEECCCEECCCCEEECCCEECCCCCCCCEEEEECCEEE--------ECCCCHHHHHHHHHC T ss_conf 99798899699998889588096788689289789704699949983388788--------155999999999876 No 39 >cd04646 LbH_Dynactin_6 Dynactin 6 (or subunit p27): Dynactin is a major component of the activator complex that stimulates dynein-mediated vesicle transport. Dynactin is a heterocomplex of at least eight subunits, including a 150,000-MW protein called Glued, the actin-capping protein Arp1, and dynamatin. In vitro binding experiments show that dynactin enhances dynein-dependent motility, possibly through interaction with microtubules and vesicles. Subunit p27 is part of the pointed-end subcomplex in dynactin that also includes p25, p26, and Arp11. This subcomplex interacts with membranous cargoes. p25 and p27 contain the imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X), indicating a left-handed parallel beta helix (LbH) structural domain. Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Probab=99.85 E-value=1.3e-21 Score=155.69 Aligned_cols=157 Identities=16% Similarity=0.214 Sum_probs=105.0 Q ss_pred EECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEC---CCCEEEEEEECCCCCEEEECCCCCCCCCCCCC Q ss_conf 565510888985998898995655988998998898986686891---68468655311434223210233312222223 Q gi|255764481|r 9 IIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVA---GKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTE 85 (271) Q Consensus 9 ~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~---g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~ 85 (271) .|||+|+|+|.|.|-.+|+ ||++|.|+|+++|+ +..+||++|.|..+++|...++ .+.+++.. T Consensus 1 kI~~~a~I~~~A~iiGdV~------------IG~~~sI~p~avIrgd~g~I~IG~~~nIqd~~~I~~~~~--~~~~~g~~ 66 (164) T cd04646 1 KIAPGAVVCQESEIRGDVT------------IGPGTVVHPRATIIAEAGPIIIGENNIIEEQVTIVNKKP--KDPAEPKP 66 (164) T ss_pred CCCCCCEECCCCEEEEEEE------------ECCCCEECCCCEEEECCCCEEECCCCEECCCEEEECCCC--CCCCCCCC T ss_conf 9489879999999982179------------999998995759981437889899966669859941787--77789988 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCE Q ss_conf 32222222122443211112222221223320000166554331001234432234524553214860652344433650 Q gi|255764481|r 86 LLVGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFT 165 (271) Q Consensus 86 v~IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v 165 (271) +.||++++|+++|++|. ..||++|+|+.+++|.++|+||++|++++++.+.++.++.|++.+.|.....+.. T Consensus 67 ~~IG~~~~i~~g~~ih~--------~~IG~~~lIg~~a~V~~ga~Ig~~ciIgags~V~~~~~ip~~slv~G~pa~~~~~ 138 (164) T cd04646 67 MIIGSNNVFEVGCKCEA--------LKIGNNNVFESKSFVGKNVIITDGCIIGAGCKLPSSEILPENTVIYGADCLRRTQ 138 (164) T ss_pred EEECCCCEECCCEEEEC--------CEECCCCEECCCCEECCCCEECCCCEECCCCCCCCCCEECCCEEEECCCEEEEEC T ss_conf 79889988867819955--------5976788896898999998999998999898928997929985997887478121 Q ss_pred EEECCCEEECCCCCCCCCCCCC Q ss_conf 6404743501321122333310 Q gi|255764481|r 166 RIGKYAFIGGMTGVVHDVIPYG 187 (271) Q Consensus 166 ~IG~~a~ig~gs~V~kdVpp~~ 187 (271) .--..++-.-...+.|.+|-|. T Consensus 139 ~~~~~~~~~~~~~l~~~~p~y~ 160 (164) T cd04646 139 TDRPKPQTLQLDFLRKILPNYH 160 (164) T ss_pred CCCCCCHHHHHHHHHHHCHHHH T ss_conf 5676312689999987640122 No 40 >TIGR01173 glmU UDP-N-acetylglucosamine pyrophosphorylase; InterPro: IPR005882 N-Acetylglucosamine-1-PO(4) uridyltransferase (GlmU, 2.7.7.23 from EC) is a trimeric bifunctional enzyme that catalyzes the last two sequential reactions in the de novo biosynthetic pathway for UDP-GlcNAc. The X-ray crystal structure of Escherichia coli GlmU in complex with UDP-GlcNAc and CoA has been determined to 2.1 A resolution and reveals a two-domain architecture that is responsible for these two reactions . The C-terminal domain is responsible for the CoA-dependent acetylation of Glc-1-PO(4) to GlcNAc-1-PO(4) and displays the longest left-handed parallel beta-helix observed to date. The acetyltransferase active site defined by the binding site for CoA makes use of residues from all three subunits and is positioned beneath an open cavity large enough to accommodate the Glc-1-PO(4) acetyl acceptor. The N-terminal domain catalyzes uridyl transfer from UTP to GlcNAc-1-PO(4) to form the final products UDP-GlcNAc and pyrophosphate. This domain is composed of a central seven-stranded beta-sheet surrounded by six alpha-helices in a Rossmann fold-like topology. ; GO: 0003977 UDP-N-acetylglucosamine diphosphorylase activity, 0009103 lipopolysaccharide biosynthetic process. Probab=99.83 E-value=1.8e-20 Score=148.44 Aligned_cols=146 Identities=27% Similarity=0.418 Sum_probs=117.2 Q ss_pred EECCC-EEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCCCCCCCC Q ss_conf 56551-08889859988989956559889989988989866868916846865531143422321023331222222332 Q gi|255764481|r 9 IIHPL-ALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTELL 87 (271) Q Consensus 9 ~Ihps-A~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~v~ 87 (271) +.+|. -+|+++++||.||+|.|+++|+++|+||+||+|||+|+|. |+.||+++.|.|+|++.. .. T Consensus 260 ~~dP~~~~i~~~v~~G~Dv~I~~~v~leG~v~iG~~v~IGp~~~I~-ns~I~~~~~I~~~s~~e~-------------~~ 325 (461) T TIGR01173 260 LIDPARTDIRGTVEIGQDVVIDPNVILEGKVQIGDDVVIGPGCVIK-NSVIGSNAVIKPYSVLEG-------------SE 325 (461) T ss_pred EECCCEEEECCCEEECCEEEECCCCEECCEEEECCCEEECCCCEEE-EEEECCCCEEEEEEECCC-------------CE T ss_conf 9868337873547987716975583883407987870788985899-728868855888884146-------------78 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCC-----CCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCC Q ss_conf 2222221224432111122222212233-----20000166554331001234432234524553214860652344433 Q gi|255764481|r 88 VGKKCVIREGVTINRGTVEYGGKTIVGD-----NNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVH 162 (271) Q Consensus 88 IG~~~~Ire~v~I~~gt~~~~~~T~IG~-----~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~ 162 (271) ||++|.+++|+.+-++|... ..++||| +..|+.+|...|=++|| .++||.++.||++|..- T Consensus 326 ig~~~~vGPFArLRP~~~L~-~~~hiGNFVE~Kna~iG~gsKA~HLsYlG-------------DAeiG~~vNiGAGtITc 391 (461) T TIGR01173 326 IGEGCDVGPFARLRPGSVLG-AGVHIGNFVEVKNARIGEGSKAGHLSYLG-------------DAEIGSNVNIGAGTITC 391 (461) T ss_pred ECCCCEECCCCCCCCCCHHH-CCCEEEEEEEEECCEECCCCCCCCCCEEE-------------EEEECCCCEECCEEEEE T ss_conf 61564027730018643211-77816025865247618864423143332-------------02507960032316899 Q ss_pred CC-------EEEECCCEEECCCCCCCC Q ss_conf 65-------064047435013211223 Q gi|255764481|r 163 QF-------TRIGKYAFIGGMTGVVHD 182 (271) Q Consensus 163 ~~-------v~IG~~a~ig~gs~V~kd 182 (271) .+ +.||+++|||..|...-- T Consensus 392 NYDG~nK~~T~IGd~VFiGSnt~LVAP 418 (461) T TIGR01173 392 NYDGVNKHKTIIGDGVFIGSNTQLVAP 418 (461) T ss_pred ECCCCCCCCCEECCCCEECCCCEEECC T ss_conf 327962203486488787767604554 No 41 >cd03357 LbH_MAT_GAT Maltose O-acetyltransferase (MAT) and Galactoside O-acetyltransferase (GAT): MAT and GAT catalyze the CoA-dependent acetylation of the 6-hydroxyl group of their respective sugar substrates. MAT acetylates maltose and glucose exclusively at the C6 position of the nonreducing end glucosyl moiety. GAT specifically acetylates galactopyranosides. Furthermore, MAT shows higher affinity toward artificial substrates containing an alkyl or hydrophobic chain as well as a glucosyl unit. Active MAT and GAT are homotrimers, with each subunit consisting of an N-terminal alpha-helical region and a C-terminal left-handed parallel alpha-helix (LbH) subdomain with 6 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Probab=99.82 E-value=9.9e-21 Score=150.13 Aligned_cols=56 Identities=27% Similarity=0.382 Sum_probs=52.1 Q ss_pred CCCCCCEECCCEEECCCCCCCCCEEEECCCEEECCCCCCCCCCCCCHHHHCCCCCC Q ss_conf 52455321486065234443365064047435013211223333100010745323 Q gi|255764481|r 142 MIAGHVIVDDRVVFGGGSAVHQFTRIGKYAFIGGMTGVVHDVIPYGILNGNPGALR 197 (271) Q Consensus 142 ~l~G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~gs~V~kdVpp~~~~~G~pa~~~ 197 (271) ..+..++|||+||||++++|.++++||++|+|||||+|+|||||+++++|+|||+. T Consensus 114 ~~~~PI~Igd~vWIG~~~~IlpGVtIG~~~vIgAGsVVtkdip~~~v~~GnPArvi 169 (169) T cd03357 114 EYAKPITIGDNVWIGGGVIILPGVTIGDNSVIGAGSVVTKDIPANVVAAGNPARVI 169 (169) T ss_pred CCCCCEEECCCCEECCCCEEECCCEECCCCEECCCCEECCCCCCCEEEEEECCEEC T ss_conf 52455698974362899789099799999999889879746599919997586989 No 42 >PRK10502 putative colanic acid biosynthesis acetyltransferase WcaF; Provisional Probab=99.82 E-value=4.2e-20 Score=146.17 Aligned_cols=122 Identities=20% Similarity=0.256 Sum_probs=81.5 Q ss_pred CCEECCCCEECCCCEE--CCCCEECCCCEECCCCEEC--CCCEEEEEEECCCCCEEEECCCCCCCCCCCCCCCCCCCCCC Q ss_conf 8599889899565598--8998998898986686891--68468655311434223210233312222223322222221 Q gi|255764481|r 19 GAVIGPNSLIGPFCCV--GSEVEIGAGVELISHCVVA--GKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTELLVGKKCVI 94 (271) Q Consensus 19 ~a~Ig~~v~Igp~~~I--~~~v~IG~~~~I~~~v~I~--g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~v~IG~~~~I 94 (271) +|+||++|.|.|.+.| .-+.+|||++.|+++|.|. +..+||++|.|.+++.|. T Consensus 52 GAkIG~~v~I~p~v~I~~PwnL~IGd~~~Ig~~v~i~~~~~I~IG~~v~Isq~v~l~----------------------- 108 (179) T PRK10502 52 GAKIGKGVVIRPSVRITYPWKLTIGDYAWIGDDVVLYTLGEITIGAHCVISQKSYLC----------------------- 108 (179) T ss_pred CCEECCCCEECCCCEEEECCCEEECCCCEECCCCEEEECCCCEECCCEEECCCCEEE----------------------- T ss_conf 886799988999657950430798896578898189848961798841277971896----------------------- Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCEEEECCCEEE Q ss_conf 22443211112222221223320000166554331001234432234524553214860652344433650640474350 Q gi|255764481|r 95 REGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFTRIGKYAFIG 174 (271) Q Consensus 95 re~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig 174 (271) .++. |-.-.+ --.....++|||+||||+++.|.++++||++|+|| T Consensus 109 -------t~sH----------------------d~~~~~------~~l~~~pI~Igd~~wIga~a~I~pGv~Ig~gavig 153 (179) T PRK10502 109 -------TGSH----------------------DYSSPH------FDLNAAPIVIGEGCWLATDVFVAPGVTIGDGAVVG 153 (179) T ss_pred -------CCCC----------------------CCCCCC------CCCCCCCCEECCCEEECCCCEEECCCEECCCCEEC T ss_conf -------3889----------------------976766------67324885858996999996990889999997995 Q ss_pred CCCCCCCCCCCCCHHHHCCCCCCC Q ss_conf 132112233331000107453233 Q gi|255764481|r 175 GMTGVVHDVIPYGILNGNPGALRG 198 (271) Q Consensus 175 ~gs~V~kdVpp~~~~~G~pa~~~~ 198 (271) |+|+|+||+||++++.|+|||..+ T Consensus 154 A~SvV~kdv~~~~i~~G~PA~~ik 177 (179) T PRK10502 154 ARSSVFKSLPANTVCRGNPAVVIR 177 (179) T ss_pred CCCEEEEECCCCCEEEECCCEEEE T ss_conf 499892177999699931847871 No 43 >PRK09527 lacA galactoside O-acetyltransferase; Reviewed Probab=99.82 E-value=2.2e-20 Score=147.93 Aligned_cols=124 Identities=19% Similarity=0.307 Sum_probs=81.8 Q ss_pred EECCCCEECC--CCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 9988989956--55988998998898986686891684686553114342232102333122222233222222212244 Q gi|255764481|r 21 VIGPNSLIGP--FCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTELLVGKKCVIREGV 98 (271) Q Consensus 21 ~Ig~~v~Igp--~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~v~IG~~~~Ire~v 98 (271) .+|+++.|.| +|--|-|..||+++.+..+++|... ..+.|||++.|+.+| T Consensus 57 ~~g~~~~IepPF~cdYG~NI~iG~~~fiN~n~~ilD~----------------------------~~I~IGd~v~iGPnV 108 (203) T PRK09527 57 TVGENAWVEPPVYFSYGSNIHIGRNFYANFNLTIVDD----------------------------YTVTIGDNVLIAPNV 108 (203) T ss_pred CCCCCCEEECCEEEECCCCEEECCCEEECCCCEEEEC----------------------------CCEEECCCEEECCCC T ss_conf 2599978969978725479697798798688389957----------------------------637998972886997 Q ss_pred CCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCEEEECCCEEECCCC Q ss_conf 32111122222212233200001665543310012344322345245532148606523444336506404743501321 Q gi|255764481|r 99 TINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFTRIGKYAFIGGMTG 178 (271) Q Consensus 99 ~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~gs~ 178 (271) +|...+...... -+ ..+...+..++|||+||||++++|.++++||++|+|||||+ T Consensus 109 ~i~t~~Hp~~~~------------------~R-------~~~~~~~~pi~Ig~~vwIG~~~~I~pGv~IG~~~vigAgsv 163 (203) T PRK09527 109 TLSVTGHPVHHE------------------LR-------KNGEMYSFPITIGNNVWIGSHVVINPGVTIGDNSVIGAGSV 163 (203) T ss_pred EEECCCCCCCHH------------------HH-------HCCCCCCCCEEECCEEEECCCCEECCCCEECCCCEECCCCE T ss_conf 893698988988------------------97-------33876688859998689899999909929999979914988 Q ss_pred CCCCCCCCCHHHHCCCCCC Q ss_conf 1223333100010745323 Q gi|255764481|r 179 VVHDVIPYGILNGNPGALR 197 (271) Q Consensus 179 V~kdVpp~~~~~G~pa~~~ 197 (271) |+|||||+++++|+|||+. T Consensus 164 Vtkdip~~~v~~G~Parvi 182 (203) T PRK09527 164 VTKDIPPNVVAAGVPCRVI 182 (203) T ss_pred ECCCCCCCCEEEEECCEEE T ss_conf 8530699829996282889 No 44 >PRK09451 glmU bifunctional N-acetylglucosamine-1-phosphate uridyltransferase/glucosamine-1-phosphate acetyltransferase; Provisional Probab=99.82 E-value=4.9e-20 Score=145.79 Aligned_cols=143 Identities=23% Similarity=0.319 Sum_probs=109.6 Q ss_pred EECC-CEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCCCCCCCC Q ss_conf 5655-108889859988989956559889989988989866868916846865531143422321023331222222332 Q gi|255764481|r 9 IIHP-LALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTELL 87 (271) Q Consensus 9 ~Ihp-sA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~v~ 87 (271) +++| +.+|++.+++|++|+|+|+++|.++|+||+||+|+++++|. ++.||++|.|.++++|.+ .. T Consensus 254 i~dP~~~~I~~~v~ig~dv~I~~nvvi~G~v~IG~~v~Ig~g~~I~-ns~Ig~~~~I~~~S~Ie~-------------s~ 319 (456) T PRK09451 254 LRDPARFDLRGTLTHGRDVEIDTNVIIEGNVTLGHRVKIGAGCVLK-NCVIGDDCEISPYSVVED-------------AN 319 (456) T ss_pred EECCCEEEEECCEEECCCEEECCCEEEECCEEECCCEEECCCEEEE-CCEECCCCEEECCEEECC-------------CE T ss_conf 8158807770328965750775770884424757975982407985-769846878942256506-------------63 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCC-----CCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCC Q ss_conf 2222221224432111122222212233-----20000166554331001234432234524553214860652344433 Q gi|255764481|r 88 VGKKCVIREGVTINRGTVEYGGKTIVGD-----NNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVH 162 (271) Q Consensus 88 IG~~~~Ire~v~I~~gt~~~~~~T~IG~-----~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~ 162 (271) ||++|.|+.|+.+-++|... ..++||+ +..||.++.+.|=..|| ..+||+++.||+++..- T Consensus 320 Ig~~~~IGPfA~lRp~t~i~-~~~~iGnfvEiK~s~i~~g~k~~HlsYiG-------------Da~iG~~~NiGAGtit~ 385 (456) T PRK09451 320 LGAACTIGPFARLRPGAELL-EGAHVGNFVEMKKARLGKGSKAGHLTYLG-------------DAEIGDNVNIGAGTITC 385 (456) T ss_pred ECCCCEECCCCCCCCCCEEC-CCCEEEEEEEEECCEECCCCEECCEEEEC-------------CCEECCCCEECCCEEEE T ss_conf 43671688864348876236-78888228998245975897704233661-------------44765886888876997 Q ss_pred C-------CEEEECCCEEECCCCC Q ss_conf 6-------5064047435013211 Q gi|255764481|r 163 Q-------FTRIGKYAFIGGMTGV 179 (271) Q Consensus 163 ~-------~v~IG~~a~ig~gs~V 179 (271) . .+.||++|+||.++.+ T Consensus 386 NyDG~~K~~t~igd~~fiGsn~~l 409 (456) T PRK09451 386 NYDGANKFKTIIGDDVFVGSDTQL 409 (456) T ss_pred ECCCCCCCCCEECCCCEECCCCEE T ss_conf 224876656487898299878448 No 45 >cd04647 LbH_MAT_like Maltose O-acyltransferase (MAT)-like: This family is composed of maltose O-acetyltransferase, galactoside O-acetyltransferase (GAT), xenobiotic acyltransferase (XAT) and similar proteins. MAT and GAT catalyze the CoA-dependent acetylation of the 6-hydroxyl group of their respective sugar substrates. MAT acetylates maltose and glucose exclusively while GAT specifically acetylates galactopyranosides. XAT catalyzes the CoA-dependent acetylation of a variety of hydroxyl-bearing acceptors such as chloramphenicol and streptogramin, among others. XATs are implicated in inactivating xenobiotics leading to xenobiotic resistance in patients. Members of this family contain a a left-handed parallel beta-helix (LbH) domain with at least 5 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). They are trimeric in their active form. Probab=99.81 E-value=6.1e-21 Score=151.46 Aligned_cols=85 Identities=24% Similarity=0.368 Sum_probs=66.4 Q ss_pred CCCCCHHCCCCCC-CCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCEEEECCCEEECCCCCCCCCCCCCHHHH Q ss_conf 2332000016655-433100123443223452455321486065234443365064047435013211223333100010 Q gi|255764481|r 113 VGDNNFFLANSHV-AHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFTRIGKYAFIGGMTGVVHDVIPYGILNG 191 (271) Q Consensus 113 IG~~~~i~~~~hI-~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~gs~V~kdVpp~~~~~G 191 (271) ||++|+|+.+++| +|+..+++......+..+.++++|||++|||.++.|+++++||++|+|||+|+|+||+||++++.| T Consensus 24 IG~~~~I~~~~~i~~~~~~~~~~~~~~~~~~~~~~v~Ig~~v~IG~~~~I~~gv~Ig~~~iIga~SvV~k~i~~~~i~~G 103 (109) T cd04647 24 IGDNVLIGPNVTIYDHNHDIDDPERPIEQGVTSAPIVIGDDVWIGANVVILPGVTIGDGAVVGAGSVVTKDVPPNSIVAG 103 (109) T ss_pred ECCCCEECCEEEEEEEEEECCCCCEECCCCCEECCEEEECCCEECCCEEEEECCEECCCCEECCCCEECCCCCCCCEEEE T ss_conf 99984993508983300132887525267637189799423578898399285130898699379889148399989997 Q ss_pred CCCCCC Q ss_conf 745323 Q gi|255764481|r 192 NPGALR 197 (271) Q Consensus 192 ~pa~~~ 197 (271) +|||+. T Consensus 104 ~PAk~i 109 (109) T cd04647 104 NPAKVI 109 (109) T ss_pred ECCEEC T ss_conf 586999 No 46 >PRK09677 putative lipopolysaccharide biosynthesis O-acetyl transferase WbbJ; Provisional Probab=99.81 E-value=2.9e-20 Score=147.18 Aligned_cols=63 Identities=17% Similarity=0.328 Sum_probs=56.2 Q ss_pred CCCCCEECCCEEECCCCCCCCCEEEECCCEEECCCCCCCCCCCCCHHHHCCCCCCCCCCCHHHHCCCCHH Q ss_conf 2455321486065234443365064047435013211223333100010745323344201444288899 Q gi|255764481|r 143 IAGHVIVDDRVVFGGGSAVHQFTRIGKYAFIGGMTGVVHDVIPYGILNGNPGALRGVNVVAMRRAGFSRD 212 (271) Q Consensus 143 l~G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~gs~V~kdVpp~~~~~G~pa~~~~~N~~glkR~g~s~e 212 (271) -...++|||+||||++++|.++++||++|+|||+|+|++|+||+++++|+|||+. |+..|+.+ T Consensus 127 ~~~pv~Ig~~~wIg~~~~Il~Gv~Ig~~~vvgA~SvV~~dvp~~~i~~G~PAk~I-------k~~~~~~~ 189 (192) T PRK09677 127 ESSAVVIGQRVWIGENVTILPGVSIGNGCIVGANSVVTKSIPENTVIAGNPAKII-------KKYNHETK 189 (192) T ss_pred CCCCEEECCCCEECCCCEEECCCEECCCCEECCCCEECCCCCCCEEEEEECCEEE-------EECCCCCC T ss_conf 7599898999899999999199699999899169999858599919997083889-------71275223 No 47 >COG1043 LpxA Acyl-[acyl carrier protein] Probab=99.81 E-value=6.3e-20 Score=145.09 Aligned_cols=162 Identities=18% Similarity=0.304 Sum_probs=94.1 Q ss_pred CCCCCCCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECC------------CCEEEEEEECCCCCE Q ss_conf 50579975655108889859988989956559889989988989866868916------------846865531143422 Q gi|255764481|r 2 SRMGNNPIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAG------------KTKIGDFTKVFPMAV 69 (271) Q Consensus 2 ~~i~~~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g------------~t~IG~~~~I~~~a~ 69 (271) ++|+.+..|-|.++|.|+++|+++++|+++++|...++||++++|.|++.|.+ .+.||++|.|-.+++ T Consensus 16 A~ig~~V~IGpf~iIg~~V~ig~~t~l~shvvv~G~T~IG~~n~I~~~A~iG~~pQdlKykge~T~l~IG~~n~IRE~vT 95 (260) T COG1043 16 AEIGEDVKIGPFCIIGPNVEIGDGTVLKSHVVVEGHTTIGRNNRIFPFASIGEDPQDLKYKGEPTRLIIGDNNTIREFVT 95 (260) T ss_pred CCCCCCCEECCEEEECCCCEECCCCEECCCEEEECCCEECCCCEEECCCCCCCCCCCCCCCCCCEEEEECCCCEEEEEEE T ss_conf 88289988883379889719889958834379967707778987911104289875450279842799889975866889 Q ss_pred EEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEE Q ss_conf 32102333122222233222222212244321111222222122332000016655433100123443223452455321 Q gi|255764481|r 70 LGGDTQSKYHNFVGTELLVGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIV 149 (271) Q Consensus 70 Ig~~~qd~~~~g~~~~v~IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~I 149 (271) |-... ........|||||.|-.++.|- ..-+|||+|.|-.++.+|-.+.|||+++++--+.+..-|+| T Consensus 96 i~~GT-----~~g~g~T~IGdnnl~May~HVA-------HDC~iGn~~ilaNnatLAGHV~igD~aiiGG~saVHQF~rI 163 (260) T COG1043 96 IHRGT-----VQGGGVTRIGDNNLIMAYAHVA-------HDCVIGNNCILANNATLAGHVEVGDYAIIGGLSAVHQFVRI 163 (260) T ss_pred EECCC-----CCCCEEEEECCCCEEEEEEEEE-------CCCEECCCEEEECCCEEECEEEECCEEEECCCCEEEEEEEE T ss_conf 86154-----4786159977887898731031-------34445671799668668540788778997475237778897 Q ss_pred CCCEEECCCCCCCCCEEEECCCEEECCC Q ss_conf 4860652344433650640474350132 Q gi|255764481|r 150 DDRVVFGGGSAVHQFTRIGKYAFIGGMT 177 (271) Q Consensus 150 gd~~~IG~~~~v~~~v~IG~~a~ig~gs 177 (271) |++++||+.+.|.|.+ =.|+|+.++- T Consensus 164 G~~amiGg~S~v~~DV--pPy~~~~Gn~ 189 (260) T COG1043 164 GAHAMIGGLSAVSQDV--PPYVIASGNH 189 (260) T ss_pred CCHHEECCCHHHCCCC--CCEEEECCCC T ss_conf 4011234410103677--9838852870 No 48 >cd04645 LbH_gamma_CA_like Gamma carbonic anhydrase-like: This family is composed of gamma carbonic anhydrase (CA), Ferripyochelin Binding Protein (FBP), E. coli paaY protein, and similar proteins. CAs are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism, involving the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide, followed by the regeneration of the active site by ionization of the zinc-bound water molecule and removal of a proton from the active site. They are ubiquitous enzymes involved in fundamental processes like photosynthesis, respiration, pH homeostasis and ion transport. There are three evolutionary distinct groups - alpha, beta and gamma carbonic anhydrases - which show no significant sequence identity or structural similarity. Gamma CAs are trimeric enzymes with left-handed parallel beta helix (LbH) structural domain. Probab=99.81 E-value=1.4e-19 Score=142.89 Aligned_cols=134 Identities=19% Similarity=0.323 Sum_probs=99.9 Q ss_pred CCCCCCEECCCEEECCCCEECCCCEECCCCEEC---CCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCC Q ss_conf 057997565510888985998898995655988---99899889898668689168468655311434223210233312 Q gi|255764481|r 3 RMGNNPIIHPLALVEEGAVIGPNSLIGPFCCVG---SEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYH 79 (271) Q Consensus 3 ~i~~~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~---~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~ 79 (271) +|+.+++|||+|.|..+++||+++.|+|+|+|. ..+.||++|.|+++++|.. . T Consensus 1 ~I~~~a~I~p~A~I~G~V~IG~~~~I~~~~~I~~d~~~i~IG~~~~I~~~~~Ih~------------------~------ 56 (153) T cd04645 1 EIDPSAFIAPNATVIGDVTLGEGSSVWFGAVLRGDVNPIRIGERTNIQDGSVLHV------------------D------ 56 (153) T ss_pred CCCCCEEECCCCEEECCEEECCCCEECCCCEEECCCCCEEECCCCEECCCCCCCC------------------C------ T ss_conf 9189819999999977269989999965838424666359999989896854352------------------5------ Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCC Q ss_conf 22222332222222122443211112222221223320000166554331001234432234524553214860652344 Q gi|255764481|r 80 NFVGTELLVGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGS 159 (271) Q Consensus 80 ~g~~~~v~IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~ 159 (271) ....+.||+++.|..++.+ |+|.||++|+++.++.+...++||++++||+++ T Consensus 57 --~~~~~~IG~~v~Ig~~~~i--------------------------~g~~Ig~~~~IG~~a~I~~gv~IG~~~vIgags 108 (153) T cd04645 57 --PGYPTIIGDNVTVGHGAVL--------------------------HGCTIGDNCLIGMGAIILDGAVIGKGSIVAAGS 108 (153) T ss_pred --CCCCCEECCCCEECCCCEE--------------------------ECEEEEEEEEECCCCEECCCCEECCCCEECCCC T ss_conf --9999798999899999599--------------------------241873004770543985699988997994498 Q ss_pred CCCCCEEEECCCEEECC-CCCCCCCCCCCH Q ss_conf 43365064047435013-211223333100 Q gi|255764481|r 160 AVHQFTRIGKYAFIGGM-TGVVHDVIPYGI 188 (271) Q Consensus 160 ~v~~~v~IG~~a~ig~g-s~V~kdVpp~~~ 188 (271) .|.++..|.+++++++. |.+.|+++|.-+ T Consensus 109 vV~~~~~vp~~~v~~G~PAr~ir~~~~~e~ 138 (153) T cd04645 109 LVPPGKVIPPGSLVAGSPAKVVRELTDEEI 138 (153) T ss_pred EECCCCEECCCCEEEECCCEEECCCCHHHH T ss_conf 974897969985999628379715999999 No 49 >cd05636 LbH_G1P_TT_C_like Putative glucose-1-phosphate thymidylyltransferase, C-terminal Left-handed parallel beta-Helix (LbH) domain: Proteins in this family show simlarity to glucose-1-phosphate adenylyltransferases in that they contain N-terminal catalytic domains that resemble a dinucleotide-binding Rossmann fold and C-terminal LbH fold domains. Members in this family are predicted to be glucose-1-phosphate thymidylyltransferases, which are involved in the dTDP-L-rhamnose biosynthetic pathway. Glucose-1-phosphate thymidylyltransferase catalyzes the synthesis of deoxy-thymidine di-phosphate (dTDP)-L-rhamnose, an important component of the cell wall of many microorganisms. The C-terminal LbH domain contains multiple turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Probab=99.80 E-value=1.3e-19 Score=143.09 Aligned_cols=155 Identities=15% Similarity=0.153 Sum_probs=84.0 Q ss_pred CCCCCCCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCC--CCCC Q ss_conf 50579975655108889859988989956559889989988989866868916846865531143422321023--3312 Q gi|255764481|r 2 SRMGNNPIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQ--SKYH 79 (271) Q Consensus 2 ~~i~~~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~q--d~~~ 79 (271) +.|..++.|++..+|+++|+|++++.|++.++|+++++||++|.|.+++.|..+++||.++.|. .++|+.... +..| T Consensus 6 ~~I~p~a~I~~~v~Ig~~a~I~~~a~I~~~v~IG~~~~Ig~~~~I~~~~~IG~~~~IG~~~~I~-~~vi~~~~~i~~~~~ 84 (163) T cd05636 6 GTVEEGVTIKGPVWIGEGAIVRSGAYIEGPVIIGKGCEIGPNAYIRGYTVLGDGCVVGNSVEVK-NSIIMDGTKVPHLNY 84 (163) T ss_pred EEECCCCEECCCEEECCCCEECCCCEEECCEEECCCCEEEECCCCCCCEEECCEEEECCCEEEC-CCCCCCCCCCCCCCE T ss_conf 7999999998981999999999999997975998998993641226985861301205512421-240147843277568 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCC Q ss_conf 22222332222222122443211112222221223320000166554331001234432234524553214860652344 Q gi|255764481|r 80 NFVGTELLVGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGS 159 (271) Q Consensus 80 ~g~~~~v~IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~ 159 (271) -+ ...||+++.|+.++++...... .....+.-+..+....+..+.+.|||+|.++.++.+...++|+++++||+++ T Consensus 85 ig---~siIG~~v~IGagtvi~n~~~~-~~~~~v~~~~~~~~~~~~~~G~vIGd~~~IG~ns~I~~G~~IG~~a~IgaGs 160 (163) T cd05636 85 VG---DSVLGENVNLGAGTITANLRFD-DKPVKVRLKGERVDTGRRKLGAIIGDGVKTGINVSLNPGVKIGPGSWVYPGC 160 (163) T ss_pred EC---CCEECCCCEECCCEEEECCCCC-CCCEEEEECCEEECCCCEECCCEECCCCEECCCCEECCCEEECCCCEECCCC T ss_conf 77---7299299698898378177678-8634784077062367704077977997999898986995999998999997 Q ss_pred CC Q ss_conf 43 Q gi|255764481|r 160 AV 161 (271) Q Consensus 160 ~v 161 (271) +| T Consensus 161 vV 162 (163) T cd05636 161 VV 162 (163) T ss_pred EE T ss_conf 96 No 50 >TIGR02287 PaaY phenylacetic acid degradation protein PaaY; InterPro: IPR011974 Members of this family are located next to other genes organised into apparent operons for phenylacetic acid degradation . PaaY is located near the end of these gene clusters and often next to PaaX, a transcriptional regulator.. Probab=99.80 E-value=3e-20 Score=147.12 Aligned_cols=151 Identities=22% Similarity=0.327 Sum_probs=113.3 Q ss_pred CCCCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCCCCC Q ss_conf 79975655108889859988989956559889989988989866868916846865531143422321023331222222 Q gi|255764481|r 5 GNNPIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFVGT 84 (271) Q Consensus 5 ~~~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~ 84 (271) |-.+.+||+|||||+|.| -|++.||++|.|+|.|+| +|+.. T Consensus 6 GLtPVVhp~ayVHP~AVL------------------------------iGDVi~G~rcYvgPlASL---------RGDFG 46 (193) T TIGR02287 6 GLTPVVHPEAYVHPDAVL------------------------------IGDVIVGKRCYVGPLASL---------RGDFG 46 (193) T ss_pred CCCCCCCCCCCCCCCEEE------------------------------EEEEEECCCCEECCCCCC---------CCCCC T ss_conf 884656987734686378------------------------------621788795246766313---------57757 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCC Q ss_conf 33222222212244321111222222122332000016655433100123443223452455321486065234443365 Q gi|255764481|r 85 ELLVGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQF 164 (271) Q Consensus 85 ~v~IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~ 164 (271) ++++-+...|+++|..| | -++..|.++.| -||||..++. .|.|+.|+.||.|++|+.. T Consensus 47 rIvl~eGAN~QD~CVMH-G--FPg~DTvVeen------GHvGHgAiLH-------------gC~vgrnaLvGMNAVVMDg 104 (193) T TIGR02287 47 RIVLKEGANVQDTCVMH-G--FPGQDTVVEEN------GHVGHGAILH-------------GCRVGRNALVGMNAVVMDG 104 (193) T ss_pred CEEEECCCCCCCCCEEC-C--CCCCCEEEECC------CCCCCCEEEC-------------CCEECCCCEECCCEEEECC T ss_conf 25753488614642436-7--29988676068------8233212351-------------5463252311165066166 Q ss_pred EEEECCCEEECCCCCC--CCCCCCCHHHHCCCCCCCCCCCHHHHCCCCHHHHHHHHH---HHHHHH Q ss_conf 0640474350132112--233331000107453233442014442888999999999---999997 Q gi|255764481|r 165 TRIGKYAFIGGMTGVV--HDVIPYGILNGNPGALRGVNVVAMRRAGFSRDTIHLIRA---VYKQIF 225 (271) Q Consensus 165 v~IG~~a~ig~gs~V~--kdVpp~~~~~G~pa~~~~~N~~glkR~g~s~e~i~~l~~---ayr~lf 225 (271) ..||+.|+|||.|.|. -.+||..|+.|+|||+. |-+|++|+.+-++ -|..|- T Consensus 105 Avige~sIVaA~aFVKAg~E~paq~Lv~GsPAkv~---------R~LseQElaWK~~GT~eYq~La 161 (193) T TIGR02287 105 AVIGERSIVAASAFVKAGAEIPAQVLVVGSPAKVV---------RELSEQELAWKKRGTREYQDLA 161 (193) T ss_pred CEECCCCCEEHHHHHHCCCCCCCCCEEECCCCEEE---------ECCHHHHHHHHHCCCHHHHHHH T ss_conf 46677412002033434734784612662873055---------3311467887413425689999 No 51 >cd05825 LbH_wcaF_like wcaF-like: This group is composed of the protein product of the E. coli wcaF gene and similar proteins. WcaF is part of the gene cluster responsible for the biosynthesis of the extracellular polysaccharide colanic acid. The wcaF protein is predicted to contain a left-handed parallel beta-helix (LbH) domain encoded by imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Many are trimeric in their active forms. Probab=99.78 E-value=1.4e-19 Score=142.87 Aligned_cols=55 Identities=24% Similarity=0.358 Sum_probs=51.8 Q ss_pred CCCCCEECCCEEECCCCCCCCCEEEECCCEEECCCCCCCCCCCCCHHHHCCCCCC Q ss_conf 2455321486065234443365064047435013211223333100010745323 Q gi|255764481|r 143 IAGHVIVDDRVVFGGGSAVHQFTRIGKYAFIGGMTGVVHDVIPYGILNGNPGALR 197 (271) Q Consensus 143 l~G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~gs~V~kdVpp~~~~~G~pa~~~ 197 (271) ..+.++|+|++|||++++|+++++||++|+|||+|+|+||+|||+++.|+|||.. T Consensus 53 ~~~~v~Ig~~v~IG~~~~I~~gv~Ig~~svVgagSvV~kdvp~~~i~~G~PAk~I 107 (107) T cd05825 53 ITAPIVIGDGAWVAAEAFVGPGVTIGEGAVVGARSVVVRDLPAWTVYAGNPAVPV 107 (107) T ss_pred ECCCEEECCCCEECCCCEEECCCEECCCCEEECCCEECCCCCCCCEEEEECCEEC T ss_conf 6399799388587498789699191899799489889038599979981475899 No 52 >cd04745 LbH_paaY_like paaY-like: This group is composed by uncharacterized proteins with similarity to the protein product of the E. coli paaY gene, which is part of the paa gene cluster responsible for phenylacetic acid degradation. Proteins in this group are expected to adopt the left-handed parallel beta-helix (LbH) structure. They contain imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Similarity to gamma carbonic anhydrase and Ferripyochelin Binding Protein (FBP) may suggest metal binding capacity. Probab=99.77 E-value=1.1e-18 Score=137.25 Aligned_cols=62 Identities=18% Similarity=0.375 Sum_probs=53.6 Q ss_pred CCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCEEEECCCEEECC-CCCCCCCCCCCH Q ss_conf 33100123443223452455321486065234443365064047435013-211223333100 Q gi|255764481|r 127 HDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFTRIGKYAFIGGM-TGVVHDVIPYGI 188 (271) Q Consensus 127 Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~g-s~V~kdVpp~~~ 188 (271) |+|.||++|.++.++.+...++||++++||+++.|.+...|-+++++++. +.+.|++.+--+ T Consensus 77 ~g~~Ig~~~~IG~~a~I~~G~~IG~~siIgagSvV~~~~~VP~~~l~~GnPAk~ir~l~~~e~ 139 (155) T cd04745 77 HGCTIGRNALVGMNAVVMDGAVIGEESIVGAMAFVKAGTVIPPRSLIAGSPAKVIRELSDEEV 139 (155) T ss_pred CCCEEEEEEEECCCCEEECCCEECCCCEEEECCEECCCCCCCCCEEEEEECCEEEECCCHHHH T ss_conf 151774324976875995977989997994187876996837998999728679743999999 No 53 >cd00710 LbH_gamma_CA Gamma carbonic anhydrases (CA): Carbonic anhydrases are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism, involving the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide, followed by the regeneration of the active site by ionization of the zinc-bound water molecule and removal of a proton from the active site. They are ubiquitous enzymes involved in fundamental processes like photosynthesis, respiration, pH homeostasis and ion transport. There are three distinct groups of carbonic anhydrases - alpha, beta and gamma - which show no significant sequence identity or structural similarity. Gamma CAs are homotrimeric enzymes, with each subunit containing a left-handed parallel beta helix (LbH) structural domain. Probab=99.77 E-value=1.2e-18 Score=137.00 Aligned_cols=160 Identities=18% Similarity=0.203 Sum_probs=95.9 Q ss_pred CEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCC----CEEEEEEECCCCCEEEECCCCCCCCCCC Q ss_conf 756551088898599889899565598899899889898668689168----4686553114342232102333122222 Q gi|255764481|r 8 PIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGK----TKIGDFTKVFPMAVLGGDTQSKYHNFVG 83 (271) Q Consensus 8 ~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~----t~IG~~~~I~~~a~Ig~~~qd~~~~g~~ 83 (271) +.|||+|+|+|+|.+ -++|+||+++.|.++++|+++ .+||++|.|..+++|... .. T Consensus 3 P~i~~~afIap~A~v------------iGdV~ig~~~sIw~~aviRgD~~~~I~IG~~~nIqD~~viH~~--------~~ 62 (167) T cd00710 3 PVIDPSAYVHPTAVV------------IGDVIIGDNVFVGPGASIRADEGTPIIIGANVNIQDGVVIHAL--------EG 62 (167) T ss_pred CCCCCCEEECCCCEE------------EEEEEECCCCEECCCCEEECCCCCCEEECCCCCCCCCEEEECC--------CC T ss_conf 706999398999989------------9708999999998887894788887698887434777078225--------76 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCC Q ss_conf 23322222221224432111122222212233200001665543310012344322345245532148606523444336 Q gi|255764481|r 84 TELLVGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQ 163 (271) Q Consensus 84 ~~v~IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~ 163 (271) ..+.||++++|+.++++| ..|.||++|.++.++.+.. ++|||+++||+++.| + T Consensus 63 ~~~~IG~~vtIgh~a~ih-------------------------G~c~Igd~~~IG~gaiv~~-a~Ig~~~vIgags~V-~ 115 (167) T cd00710 63 YSVWIGKNVSIAHGAIVH-------------------------GPAYIGDNCFIGFRSVVFN-AKVGDNCVIGHNAVV-D 115 (167) T ss_pred CCEEECCCCEECCCCEEC-------------------------CEEEECCCCEECCCCEEEC-CEECCCCEECCCCEE-E T ss_conf 674977982898984881-------------------------2049839988988989982-597699899889888-4 Q ss_pred CEEEECCCEEECCCCCCCCCCCCCHHHHCCCCCCCCCCCHHHHCCCCHHHHHHHHHHHH Q ss_conf 50640474350132112233331000107453233442014442888999999999999 Q gi|255764481|r 164 FTRIGKYAFIGGMTGVVHDVIPYGILNGNPGALRGVNVVAMRRAGFSRDTIHLIRAVYK 222 (271) Q Consensus 164 ~v~IG~~a~ig~gs~V~kdVpp~~~~~G~pa~~~~~N~~glkR~g~s~e~i~~l~~ayr 222 (271) .++|..+.++.+|+.|++..|+..+-.-.+... +...-+.+.+.+|.+.|| T Consensus 116 g~~i~~g~~v~~G~~v~~~~pa~~~~~lt~~~~--------~~~~~~~~~y~el~~~y~ 166 (167) T cd00710 116 GVEIPPGRYVPAGAVITSQTQADALPDVTDSAR--------EFNEKVITVNNELAEGYK 166 (167) T ss_pred CCCCCCCCEECCCCEEECCCCHHHCCCCCHHHH--------HHHHHHHHHHHHHHHHHC T ss_conf 778399989558828836996697337999999--------999999999999998743 No 54 >cd04650 LbH_FBP Ferripyochelin Binding Protein (FBP): FBP is an outer membrane protein which plays a role in iron acquisition. It binds iron when it is complexed with pyochelin. It adopts the left-handed parallel beta-helix (LbH) structure, and contains imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Acyltransferase activity has not been observed in this group. Probab=99.76 E-value=3.2e-18 Score=134.47 Aligned_cols=134 Identities=21% Similarity=0.385 Sum_probs=91.1 Q ss_pred CCCCCCEECCCEEECCCCEECCCCEECCCCEECCC---CEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCC Q ss_conf 05799756551088898599889899565598899---899889898668689168468655311434223210233312 Q gi|255764481|r 3 RMGNNPIIHPLALVEEGAVIGPNSLIGPFCCVGSE---VEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYH 79 (271) Q Consensus 3 ~i~~~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~---v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~ 79 (271) +|+++++|||+|+|..++.||+++.|+|+|+|..+ ++||++|.|++++.|. ... T Consensus 2 ~I~~~~~I~p~a~i~G~v~IG~~~~I~~~~~Ir~d~~~i~IG~~~~I~~n~~I~------------------~~~----- 58 (154) T cd04650 2 RISPKAYVHPTSYVIGDVVIGELTSVWHYAVIRGDNDSIYIGKYSNVQENVSIH------------------TDH----- 58 (154) T ss_pred CCCCCEEECCCCEEECEEEECCCCEECCCCEEECCCCCEEECCCEEECCCCEEE------------------ECC----- T ss_conf 469987999999997605999999998753895588765999933989998898------------------469----- Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCC Q ss_conf 22222332222222122443211112222221223320000166554331001234432234524553214860652344 Q gi|255764481|r 80 NFVGTELLVGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGS 159 (271) Q Consensus 80 ~g~~~~v~IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~ 159 (271) ...+.||+++.|..++.+ |.|.||++|.++.++.+...++||++++||+++ T Consensus 59 ---~~~~~ig~~v~ig~~~~i--------------------------~g~~Ig~~v~IG~~a~I~~Gv~IG~~svIgaGs 109 (154) T cd04650 59 ---GYPTEIGDYVTIGHNAVV--------------------------HGAKVGNYVIVGMGAILLNGAKIGDHVIIGAGA 109 (154) T ss_pred ---CCCCEECCCCEECCCEEE--------------------------ECCEECCEEEECCCEEEECCCEECCCCEECCCC T ss_conf ---998687997688995288--------------------------230220135988722996287989997998898 Q ss_pred CCCCCEEEECCCEEECC-CCCCCCCCCCCH Q ss_conf 43365064047435013-211223333100 Q gi|255764481|r 160 AVHQFTRIGKYAFIGGM-TGVVHDVIPYGI 188 (271) Q Consensus 160 ~v~~~v~IG~~a~ig~g-s~V~kdVpp~~~ 188 (271) +|-+...+-+++++++. +.+.|++++..+ T Consensus 110 VV~~~~~Vp~~~lv~G~PAr~ir~l~~~~~ 139 (154) T cd04650 110 VVTPGKEIPDYSLVLGVPAKVVRKLTEEEI 139 (154) T ss_pred EECCCCEECCCCEEEECCEEEEEECCHHHH T ss_conf 887997919984999618189772999999 No 55 >PRK10092 maltose O-acetyltransferase; Provisional Probab=99.75 E-value=5.9e-19 Score=139.04 Aligned_cols=56 Identities=21% Similarity=0.340 Sum_probs=52.8 Q ss_pred CCCCCCEECCCEEECCCCCCCCCEEEECCCEEECCCCCCCCCCCCCHHHHCCCCCC Q ss_conf 52455321486065234443365064047435013211223333100010745323 Q gi|255764481|r 142 MIAGHVIVDDRVVFGGGSAVHQFTRIGKYAFIGGMTGVVHDVIPYGILNGNPGALR 197 (271) Q Consensus 142 ~l~G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~gs~V~kdVpp~~~~~G~pa~~~ 197 (271) ..+..++|||++|||++++|.++++||++|.|||||+|+|||||+++++|+|||+. T Consensus 125 ~~~~pi~Ig~~vwig~~~~i~pGv~IG~~~vigagsvV~~dip~~~v~~G~Par~i 180 (183) T PRK10092 125 ELGKPVTIGNNVWIGGRAVINPGVTIGDNVVVASGAVVTKDVPDNVVVGGNPARII 180 (183) T ss_pred EECCCEEECCCCEECCCCEEECCCEECCCCEECCCCEEEEECCCCCEEEEECCEEE T ss_conf 31687798998688899899079899999899759889227799929997475788 No 56 >COG1207 GlmU N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) [Cell envelope biogenesis, outer membrane] Probab=99.72 E-value=2.3e-17 Score=129.09 Aligned_cols=152 Identities=22% Similarity=0.351 Sum_probs=113.8 Q ss_pred CEECCC-EEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCCCCCCC Q ss_conf 756551-0888985998898995655988998998898986686891684686553114342232102333122222233 Q gi|255764481|r 8 PIIHPL-ALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTEL 86 (271) Q Consensus 8 ~~Ihps-A~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~v 86 (271) .+|+|. -+|+++++||.+|.|.|++++.++++||++|+|+|+++|. ++.||+++.|.+|++|.+ . T Consensus 256 tl~dP~t~~i~~dv~ig~DvvI~p~v~l~G~t~ig~~v~iGpg~~i~-ds~I~~~a~I~~~S~ie~-------------s 321 (460) T COG1207 256 TLIDPATTYIRGDVEIGRDVVIEPNVILEGNTVIGDNVVIGPGSVIK-DSVIGDNAVIKAYSVIEG-------------S 321 (460) T ss_pred EEECCCEEEECCCEEECCCEEEECCCEEEEEEEECCCEEECCCCEEE-EEEECCCCEEEECCEEEC-------------C T ss_conf 99678848873717987814991484894158967964999996787-538769988975104305-------------5 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCC--- Q ss_conf 22222221224432111122222212233200001665543310012344322345245532148606523444336--- Q gi|255764481|r 87 LVGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQ--- 163 (271) Q Consensus 87 ~IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~--- 163 (271) .||+++.++.|+.+-+++..+ ..++||+- +-+ .+..||++.-..+-+. -|.++||.++.||+++..-. T Consensus 322 ~vg~~~~VGPfA~LRPg~~L~-~~~hIGNF------VEv-K~a~ig~gsKa~HLtY-lGDA~iG~~~NiGAGtItcNYDG 392 (460) T COG1207 322 TVGEGATVGPFARLRPGAVLG-ADVHIGNF------VEV-KKATIGKGSKAGHLTY-LGDAEIGENVNIGAGTITCNYDG 392 (460) T ss_pred EECCCCCCCCCCCCCCCCCCC-CCCEEEEE------EEE-ECCCCCCCCCCCCEEE-ECCCEECCCCEECCCEEEECCCC T ss_conf 854796337831017968526-79767425------998-4356068863353234-21332068742144259984788 Q ss_pred ----CEEEECCCEEECCCCCCCC Q ss_conf ----5064047435013211223 Q gi|255764481|r 164 ----FTRIGKYAFIGGMTGVVHD 182 (271) Q Consensus 164 ----~v~IG~~a~ig~gs~V~kd 182 (271) .+.||+++|||.+|...-. T Consensus 393 ~nK~~T~IGd~vFiGSns~LVAP 415 (460) T COG1207 393 KNKFKTIIGDNVFIGSNSQLVAP 415 (460) T ss_pred CCCCEEEECCCCEECCCCCEEEE T ss_conf 63320064688577668718864 No 57 >cd03354 LbH_SAT Serine acetyltransferase (SAT): SAT catalyzes the CoA-dependent acetylation of the side chain hydroxyl group of L-serine to form O-acetylserine, as the first step of a two-step biosynthetic pathway in bacteria and plants leading to the formation of L-cysteine. This reaction represents a key metabolic point of regulation for the cysteine biosynthetic pathway due to its feedback inhibition by cysteine. The enzyme is a 175 kDa homohexamer, composed of a dimer of homotrimers. Each subunit contains an N-terminal alpha helical region and a C-terminal left-handed beta-helix (LbH) subdomain with 5 turns, each containing a hexapeptide repeat motif characteristic of the acyltransferase superfamily of enzymes. The trimer interface mainly involves the C-terminal LbH subdomain while the dimer (of trimers) interface is mediated by the N-terminal alpha helical subdomain. Probab=99.72 E-value=3.6e-18 Score=134.12 Aligned_cols=49 Identities=31% Similarity=0.422 Sum_probs=47.2 Q ss_pred CCCEECCCEEECCCCCCCCCEEEECCCEEECCCCCCCCCCCCCHHHHCC Q ss_conf 5532148606523444336506404743501321122333310001074 Q gi|255764481|r 145 GHVIVDDRVVFGGGSAVHQFTRIGKYAFIGGMTGVVHDVIPYGILNGNP 193 (271) Q Consensus 145 G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~gs~V~kdVpp~~~~~G~p 193 (271) +++.|+|++|||.++.+.++++||++|+|||+|+|+||+|+++++.|+| T Consensus 53 ~~~~Ig~~v~IG~~~~I~~gv~IG~~~vIgagsvV~~dv~~~si~~G~P 101 (101) T cd03354 53 RHPTIGDNVVIGAGAKILGNITIGDNVKIGANAVVTKDVPANSTVVGVP 101 (101) T ss_pred CCCCCCCCEEECCCCEEECCCEECCCCEECCCCEECCCCCCCCEEEECC T ss_conf 7761487586358988929879999879987988996859998998039 No 58 >PRK13627 carnitine operon protein CaiE; Provisional Probab=99.70 E-value=3.7e-17 Score=127.77 Aligned_cols=166 Identities=15% Similarity=0.279 Sum_probs=124.6 Q ss_pred CCCCCCCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCC Q ss_conf 50579975655108889859988989956559889989988989866868916846865531143422321023331222 Q gi|255764481|r 2 SRMGNNPIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNF 81 (271) Q Consensus 2 ~~i~~~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g 81 (271) -+|+.+++|+|+|.|-.++.||++++|||+|+|..++ +..+||++|.|..+|+|...+ T Consensus 11 P~i~~~afIap~A~viGdV~ig~~~sIw~~aVlRGD~---------------~~I~IG~~tNIQD~~viH~~~------- 68 (196) T PRK13627 11 PVVHPTAFVHPSAVLIGDVIVGAGVYIGPLASLRGDY---------------GRLIVQAGANLQDGCIMHGYC------- 68 (196) T ss_pred CCCCCCEEECCCCEEEEEEEECCCCEECCCEEEECCC---------------CCEEECCCCEECCCCEEECCC------- T ss_conf 7229983889999899849999998897770895688---------------717989986767983781489------- Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCC Q ss_conf 22233222222212244321111222222122332000016655433100123443223452455321486065234443 Q gi|255764481|r 82 VGTELLVGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAV 161 (271) Q Consensus 82 ~~~~v~IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v 161 (271) ...+.||++++|+.++.+ |+|.||++|.++.++.+...++||++++||+++.| T Consensus 69 -~~~~~IG~~vtIGH~ail--------------------------hgc~Igd~~lIGmgavVldga~Ig~~~iI~AgslV 121 (196) T PRK13627 69 -DTDTIVGENGHIGHGAIL--------------------------HGCVIGRDALVGMNSVIMDGAVIGEESIVAAMSFV 121 (196) T ss_pred -CCCCEECCCEEECCCEEE--------------------------EEEEEECCCEECCCCEEECCCEECCCCEECCCCEE T ss_conf -998588898058787299--------------------------63499489788469799288899999899768645 Q ss_pred CCCEEEECCCEEECC-CCCCCCCCCCCHHHHCCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHHHHCCCCHH Q ss_conf 365064047435013-21122333310001074532334420144428889999999999999972799899 Q gi|255764481|r 162 HQFTRIGKYAFIGGM-TGVVHDVIPYGILNGNPGALRGVNVVAMRRAGFSRDTIHLIRAVYKQIFQQGDSIY 232 (271) Q Consensus 162 ~~~v~IG~~a~ig~g-s~V~kdVpp~~~~~G~pa~~~~~N~~glkR~g~s~e~i~~l~~ayr~lfr~~~~l~ 232 (271) .++.+|-.+.++.+. +.+.+++-+-.+ ++...+.+.+.+|.+.|+.-.+.-.+|. T Consensus 122 ~~g~~ip~~~L~~G~PAk~vR~lt~~ei----------------~~~~~~a~~Y~~la~r~~~~l~~~~Pl~ 177 (196) T PRK13627 122 KAGFQGEKRQLLMGTPARAVRSVSDQEL----------------HWKRLNTKEYQDLAGRCHASLHETQPLT 177 (196) T ss_pred CCCCCCCCCCEEEECCEEEEECCCHHHH----------------HHHHHHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 7994938981998778168114999999----------------9988769999999999985485468864 No 59 >COG0110 WbbJ Acetyltransferase (isoleucine patch superfamily) [General function prediction only] Probab=99.70 E-value=4.3e-17 Score=127.35 Aligned_cols=120 Identities=29% Similarity=0.421 Sum_probs=76.3 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCC-CCCCCCCEECCCEEECCCCCCCC Q ss_conf 33222222212244321111222222122332000016655433100123443223-45245532148606523444336 Q gi|255764481|r 85 ELLVGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNN-VMIAGHVIVDDRVVFGGGSAVHQ 163 (271) Q Consensus 85 ~v~IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~-~~l~G~v~Igd~~~IG~~~~v~~ 163 (271) .+.+|+++.+..++.+-. .....||++++++.++++-.....++....... ....+.++||++||||+++.|.+ T Consensus 67 ~~~iG~~~~i~~~~~~~~-----~~~i~ig~~~~i~~~v~i~~~~h~~~~~~~~~~~~~~~~~v~IG~~vwIG~~a~Ilp 141 (190) T COG0110 67 NLTIGDLCFIGVNVVILV-----GEGITIGDNVVVGPNVTIYTNSHPGDFVTANIGALVGAGPVTIGEDVWIGAGAVILP 141 (190) T ss_pred CEEECCEEEEECCCEEEE-----CCCCEECCCCEECCCEEECCCCCCCCCCCCCCCCCCCCCCCEECCCCEECCCCEECC T ss_conf 106463467702544983-----488588998589599498268765652200014323578978899858778978979 Q ss_pred CEEEECCCEEECCCCCCCCCCCCCHHHHCCCCCCCCCCCHHHHCCCCHHHHHHH Q ss_conf 506404743501321122333310001074532334420144428889999999 Q gi|255764481|r 164 FTRIGKYAFIGGMTGVVHDVIPYGILNGNPGALRGVNVVAMRRAGFSRDTIHLI 217 (271) Q Consensus 164 ~v~IG~~a~ig~gs~V~kdVpp~~~~~G~pa~~~~~N~~glkR~g~s~e~i~~l 217 (271) +++||++|+||++|+|+||+|||+++.|+|||+. |..+..+.+..+ T Consensus 142 GV~IG~gavigagsVVtkdvp~~~iv~G~Pa~vi--------r~~~~~~~~~~~ 187 (190) T COG0110 142 GVTIGEGAVIGAGSVVTKDVPPYGIVAGNPARVI--------RKRDVVAKIGVL 187 (190) T ss_pred CEEECCCEEEECCEEEECCCCCCEEEECCCCEEE--------EECCHHHHHHHH T ss_conf 8598998199788589577799828979998898--------741605555653 No 60 >KOG4750 consensus Probab=99.68 E-value=2e-17 Score=129.40 Aligned_cols=106 Identities=24% Similarity=0.381 Sum_probs=76.8 Q ss_pred CCCCCCCCCCCCC--CCCCCCCCCCCCCCCCCCCHHCC--CCCCCCCCCCCCCCCCCCCCCCC--------CCCEECCCE Q ss_conf 3222222212244--32111122222212233200001--66554331001234432234524--------553214860 Q gi|255764481|r 86 LLVGKKCVIREGV--TINRGTVEYGGKTIVGDNNFFLA--NSHVAHDCKLGNGIVLSNNVMIA--------GHVIVDDRV 153 (271) Q Consensus 86 v~IG~~~~Ire~v--~I~~gt~~~~~~T~IG~~~~i~~--~~hI~Hd~~IG~~vii~n~~~l~--------G~v~Igd~~ 153 (271) +.++.++.+.+.. -||++. +||+.-++.. ++-|+.-.+||++|.+..++.|+ .|-+|+|+| T Consensus 135 lal~~q~ris~~~gvdihpaa-------~ig~gilldhatgvvigeTAvvg~~vSilH~Vtlggtgk~~gdrhP~Igd~v 207 (269) T KOG4750 135 LALGLQVRISPNFGVDIHPAA-------KIGKGILLDHATGVVIGETAVVGDNVSILHPVTLGGTGKGSGDRHPKIGDNV 207 (269) T ss_pred EEEEECCEECCCCCCCCCCHH-------HCCCCEEECCCCCEEECCEEEECCCEEEECCEEECCCCCCCCCCCCCCCCCE T ss_conf 887641233242021246201-------0265312213332353102474264466444311465466555688556770 Q ss_pred EECCCCCCCCCEEEECCCEEECCCCCCCCCCCCCHHHHCCCCCCC Q ss_conf 652344433650640474350132112233331000107453233 Q gi|255764481|r 154 VFGGGSAVHQFTRIGKYAFIGGMTGVVHDVIPYGILNGNPGALRG 198 (271) Q Consensus 154 ~IG~~~~v~~~v~IG~~a~ig~gs~V~kdVpp~~~~~G~pa~~~~ 198 (271) |||++++|..+++||++|+|||||+|+|||||..++.|||||+.+ T Consensus 208 liGaGvtILgnV~IGegavIaAGsvV~kDVP~~~~AvGnPAklIg 252 (269) T KOG4750 208 LIGAGVTILGNVTIGEGAVIAAGSVVLKDVPPNTLAVGNPAKLIG 252 (269) T ss_pred EECCCCEEECCEEECCCCEEECCCEEEECCCCCCEECCCCHHHCC T ss_conf 875541785781677886773050697216987530488233234 No 61 >cd03359 LbH_Dynactin_5 Dynactin 5 (or subunit p25); Dynactin is a major component of the activator complex that stimulates dynein-mediated vesicle transport. Dynactin is a heterocomplex of at least eight subunits, including a 150,000-MW protein called Glued, the actin-capping protein Arp1, and dynamatin. In vitro binding experiments show that dynactin enhances dynein-dependent motility, possibly through interaction with microtubules and vesicles. Subunit p25 is part of the pointed-end subcomplex in dynactin that also includes p26, p27, and Arp11. This subcomplex interacts with membranous cargoes. p25 and p27 contain imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X), indicating a left-handed parallel beta helix (LbH) structural domain. Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Probab=99.67 E-value=3e-16 Score=122.11 Aligned_cols=125 Identities=18% Similarity=0.239 Sum_probs=82.2 Q ss_pred CCEECCCCEECCCCEECCC---CEEEEEEECCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 9899889898668689168---4686553114342232102333122222233222222212244321111222222122 Q gi|255764481|r 37 EVEIGAGVELISHCVVAGK---TKIGDFTKVFPMAVLGGDTQSKYHNFVGTELLVGKKCVIREGVTINRGTVEYGGKTIV 113 (271) Q Consensus 37 ~v~IG~~~~I~~~v~I~g~---t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~v~IG~~~~Ire~v~I~~gt~~~~~~T~I 113 (271) |+.||++|.|.++++|+++ .+||++|.|.++++|........+.... .++.| T Consensus 21 nI~IG~~ssI~~~avirGD~~~I~IG~~~~I~d~~~I~~~~~~~~~~~~~-------------------------~p~~I 75 (161) T cd03359 21 NIVLNGKTIIQSDVIIRGDLATVSIGRYCILSEGCVIRPPFKKFSKGVAF-------------------------FPLHI 75 (161) T ss_pred CEEECCCCEECCCCEEECCCCCEEECCCCEECCCCEEECCCCCCCCCCCC-------------------------CCCEE T ss_conf 75999987899996896788654999985888998992464435778888-------------------------77284 Q ss_pred CCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCEEEECCCEEECC-CCCCCCCCCCC Q ss_conf 332000016655433100123443223452455321486065234443365064047435013-21122333310 Q gi|255764481|r 114 GDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFTRIGKYAFIGGM-TGVVHDVIPYG 187 (271) Q Consensus 114 G~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~g-s~V~kdVpp~~ 187 (271) |++++++.++.+ |.|.||++|.++.++.++..+.|+|+++|+.++.|.+..+|-.++++++. +.+.+++|+-. T Consensus 76 Gd~v~Ig~~~~i-~~~~Ig~~v~IG~~~~I~~g~~Ig~~~~I~~gsvV~~~~~ip~~~~~~G~Pak~i~~l~e~~ 149 (161) T cd03359 76 GDYVFIGENCVV-NAAQIGSYVHIGKNCVIGRRCIIKDCVKILDGTVVPPDTVIPPYSVVSGRPARFIGELPECT 149 (161) T ss_pred CCCEEECCCEEE-ECCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEECCEEEEEECCHHH T ss_conf 898798998288-48896677155798199499799999899889898899898998199851808957699999 No 62 >PRK11132 cysE serine acetyltransferase; Provisional Probab=99.63 E-value=5.3e-16 Score=120.56 Aligned_cols=54 Identities=26% Similarity=0.393 Sum_probs=50.6 Q ss_pred CCEECCCEEECCCCCCCCCEEEECCCEEECCCCCCCCCCCCCHHHHCCCCCCCC Q ss_conf 532148606523444336506404743501321122333310001074532334 Q gi|255764481|r 146 HVIVDDRVVFGGGSAVHQFTRIGKYAFIGGMTGVVHDVIPYGILNGNPGALRGV 199 (271) Q Consensus 146 ~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~gs~V~kdVpp~~~~~G~pa~~~~~ 199 (271) |-+|+|+|.||+++.|..+++||++|.|||||+|++||||++++.|+|||+.|- T Consensus 193 HP~i~~~v~igaga~iLG~i~iG~~a~igA~svVl~dvp~~~tv~GvPa~~vg~ 246 (273) T PRK11132 193 HPKIREGVMIGAGAKILGNIEVGRGAKIGAGSVVLQPVPPHTTAAGVPARIVGK 246 (273) T ss_pred CCCCCCCEEECCCCEEECCEEECCCCEECCCCEECCCCCCCCEEECCCCEEECC T ss_conf 980488848754888971858899988867866864379998796667203088 No 63 >TIGR01172 cysE serine O-acetyltransferase; InterPro: IPR005881 The biosynthesis of L-cysteine is the predominant way by which inorganic sulphur is incorporated into organic compounds. In this process, the most abundant utilizable source of sulphur, inorganic sulphate, is taken up and reduced to sulphide. Sulphide is used to produce L-cysteine, which serves for protein synthesis or the production of other sulphur-containing organic compounds. Two routes for cysteine biosynthesis in nature have been documented. Serine transacetylase (2.3.1.30 from EC) catalyzes steps in pathway I, the activation of L-serine by acetyl-coenzyme A, yielding O-acetyl-L-serine.; GO: 0009001 serine O-acetyltransferase activity, 0006535 cysteine biosynthetic process from serine, 0005737 cytoplasm. Probab=99.62 E-value=3.7e-16 Score=121.56 Aligned_cols=50 Identities=30% Similarity=0.419 Sum_probs=48.0 Q ss_pred CCEECCCEEECCCCCCCCCEEEECCCEEECCCCCCCCCCCCCHHHHCCCC Q ss_conf 53214860652344433650640474350132112233331000107453 Q gi|255764481|r 146 HVIVDDRVVFGGGSAVHQFTRIGKYAFIGGMTGVVHDVIPYGILNGNPGA 195 (271) Q Consensus 146 ~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~gs~V~kdVpp~~~~~G~pa~ 195 (271) |-+|+|+|.||+++-|+.+.+||++|-|||+|+|++||||+.+|.|.||| T Consensus 114 HPt~~~gV~iGAGAKvLG~I~vG~~AkiGAnsVVl~dVP~~~TVVGvPar 163 (163) T TIGR01172 114 HPTIGEGVVIGAGAKVLGNIEVGENAKIGANSVVLKDVPAGATVVGVPAR 163 (163) T ss_pred CCCCCCCEEEEECCEEECCEEECCCCEECCCEEECCCCCCCCEEEEECCC T ss_conf 88256871784078240376677997772332673537898658743589 No 64 >COG1045 CysE Serine acetyltransferase [Amino acid transport and metabolism] Probab=99.61 E-value=1.2e-15 Score=118.27 Aligned_cols=53 Identities=30% Similarity=0.454 Sum_probs=49.5 Q ss_pred CCEECCCEEECCCCCCCCCEEEECCCEEECCCCCCCCCCCCCHHHHCCCCCCC Q ss_conf 53214860652344433650640474350132112233331000107453233 Q gi|255764481|r 146 HVIVDDRVVFGGGSAVHQFTRIGKYAFIGGMTGVVHDVIPYGILNGNPGALRG 198 (271) Q Consensus 146 ~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~gs~V~kdVpp~~~~~G~pa~~~~ 198 (271) |.+|+|+|.||+++-|...++||+++.|||+|+|.|||||++++.|.|||+.. T Consensus 119 hPtIg~~V~IGagAkILG~I~IGd~a~IGA~sVVlkdVP~~~tvvGvPArii~ 171 (194) T COG1045 119 HPTIGNGVYIGAGAKILGNIEIGDNAKIGAGSVVLKDVPPNATVVGVPARVIG 171 (194) T ss_pred CCCCCCCEEECCCCEEECCEEECCCCEECCCCEECCCCCCCCEEECCCCEEEC T ss_conf 98317885989998897166887897888786681578999668667646853 No 65 >PRK10191 putative colanic acid biosynthesis acetyltransferase WcaB; Provisional Probab=99.57 E-value=1.5e-15 Score=117.64 Aligned_cols=53 Identities=13% Similarity=0.218 Sum_probs=49.6 Q ss_pred CCCEECCCEEECCCCCCCCCEEEECCCEEECCCCCCCCCCCCCHHHHCCCCCC Q ss_conf 55321486065234443365064047435013211223333100010745323 Q gi|255764481|r 145 GHVIVDDRVVFGGGSAVHQFTRIGKYAFIGGMTGVVHDVIPYGILNGNPGALR 197 (271) Q Consensus 145 G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~gs~V~kdVpp~~~~~G~pa~~~ 197 (271) ++-+|||+++||+++.+...++||++++|||+|+|++||||++++.|+|||++ T Consensus 91 ~~P~Ig~~V~Igaga~IlG~i~IG~~~~IGA~svV~~dvp~~~~v~G~Pari~ 143 (146) T PRK10191 91 ACPHIGNGVELGANVIILGDITLGNNVTVGAGSVVLDSVPDNALVVGEKARVK 143 (146) T ss_pred CCCEECCCEEEECCCEEECCCEECCCCEECCCCEECCCCCCCCEEEEECCEEE T ss_conf 98987999599289999648599999999989589703599999997680999 No 66 >COG0663 PaaY Carbonic anhydrases/acetyltransferases, isoleucine patch superfamily [General function prediction only] Probab=99.56 E-value=9.8e-15 Score=112.62 Aligned_cols=133 Identities=20% Similarity=0.360 Sum_probs=103.2 Q ss_pred CCCCCCCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCC Q ss_conf 50579975655108889859988989956559889989988989866868916846865531143422321023331222 Q gi|255764481|r 2 SRMGNNPIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNF 81 (271) Q Consensus 2 ~~i~~~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g 81 (271) -+|+.+++|||+|.|=.+++||+++.|||+++|..++ +..+||+++.|..+++|-.++. T Consensus 12 P~i~~~a~Va~~A~viGdV~Ig~~vsIw~~aVlRgD~---------------~~I~IG~~tNIQDg~ViH~~~~------ 70 (176) T COG0663 12 PKIDPTAFVAPSATVIGDVRIGAGVSIWPGAVLRGDV---------------EPIRIGARTNIQDGVVIHADPG------ 70 (176) T ss_pred CCCCCCEEECCCCEEEEEEEECCCCEECCCEEEECCC---------------CCEEECCCCEECCCEEEECCCC------ T ss_conf 8789862877999799859999997887763897467---------------7569878856037838952799------ Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCC Q ss_conf 22233222222212244321111222222122332000016655433100123443223452455321486065234443 Q gi|255764481|r 82 VGTELLVGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAV 161 (271) Q Consensus 82 ~~~~v~IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v 161 (271) ..+.||++++|+..+.+ |.|.||++|.++.++.+.-.++||++++||+++.| T Consensus 71 --~p~~IG~~vtIGH~aiv--------------------------HGc~Ig~~~lIGmgA~vldga~IG~~~iVgAgalV 122 (176) T COG0663 71 --YPVTIGDDVTIGHGAVV--------------------------HGCTIGDNVLIGMGATVLDGAVIGDGSIVGAGALV 122 (176) T ss_pred --CCEEECCCCEECCCCEE--------------------------EEEEECCCCEEECCCEEECCCEECCCCEECCCCCC T ss_conf --99297799588576589--------------------------88598788589138667578698898698668700 Q ss_pred CCCEEEECCCEEEC-CCCCCCCC Q ss_conf 36506404743501-32112233 Q gi|255764481|r 162 HQFTRIGKYAFIGG-MTGVVHDV 183 (271) Q Consensus 162 ~~~v~IG~~a~ig~-gs~V~kdV 183 (271) .+.-++..+.++-+ -+.+.+.+ T Consensus 123 ~~~k~~p~~~L~~G~Pak~~r~l 145 (176) T COG0663 123 TPGKEIPGGSLVVGSPAKVVRPL 145 (176) T ss_pred CCCCCCCCCEEEECCCCEEEECC T ss_conf 58837799759623740144028 No 67 >COG2171 DapD Tetrahydrodipicolinate N-succinyltransferase [Amino acid transport and metabolism] Probab=99.42 E-value=5.2e-13 Score=101.82 Aligned_cols=119 Identities=23% Similarity=0.263 Sum_probs=86.0 Q ss_pred CEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCCCCCCCC Q ss_conf 75655108889859988989956559889989988989866868916846865531143422321023331222222332 Q gi|255764481|r 8 PIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTELL 87 (271) Q Consensus 8 ~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~v~ 87 (271) ..|-|+|+|-.+|.|++|+++.|-++|.=++.++++|.|.-++++.+.+.||+||.|+-+|+|++- ++ T Consensus 109 ~RI~p~a~VR~ga~i~~gtvvM~~sfVNigA~~~~gtMVd~~as~G~~a~VGkn~higgGa~I~GV------------Le 176 (271) T COG2171 109 VRIVPGAIVRLGAYIAKGTVVMPESFVNIGAGTGEGTMVDGRASVGSCAQVGKNSHIGGGASIGGV------------LE 176 (271) T ss_pred EEECCCCEEEECCEECCCCEECCCCEEEECCCCCCCEEEEEEEEEECCEEECCCCCCCCCCEEEEE------------EC T ss_conf 363676477605287799387555668777602752589615655131797998452786367677------------34 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCEEE Q ss_conf 22222212244321111222222122332000016655433100123443223452455321486065234443365064 Q gi|255764481|r 88 VGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFTRI 167 (271) Q Consensus 88 IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v~I 167 (271) ++ ...+ +.|||+|+||+++.+.-++.+ T Consensus 177 --------------p~---~a~P------------------------------------v~IgdncliGAns~~veGV~v 203 (271) T COG2171 177 --------------PL---QANP------------------------------------VIIGDNCLIGANSEVVEGVIV 203 (271) T ss_pred --------------CC---CCCC------------------------------------EEECCCCEECCCCCEEEEEEE T ss_conf --------------88---8788------------------------------------698776186245334861696 Q ss_pred ECCCEEECCCCCCCCCCCCCHHHH Q ss_conf 047435013211223333100010 Q gi|255764481|r 168 GKYAFIGGMTGVVHDVIPYGILNG 191 (271) Q Consensus 168 G~~a~ig~gs~V~kdVpp~~~~~G 191 (271) |++|.|+||..|++|+|++..+.| T Consensus 204 Gdg~VV~aGv~I~~~tki~~~~~g 227 (271) T COG2171 204 GDGCVVAAGVFITQDTKIYDRVAG 227 (271) T ss_pred CCCCEEECCEEEECCCCEEEEECC T ss_conf 789688545089379514774346 No 68 >TIGR03570 NeuD_NnaD sugar O-acyltransferase, sialic acid O-acetyltransferase NeuD family. These proteins contain repeats of the bacterial transferase hexapeptide (pfam00132), although often these do not register above the trusted cutoff. Probab=99.42 E-value=5.1e-13 Score=101.88 Aligned_cols=82 Identities=30% Similarity=0.528 Sum_probs=68.0 Q ss_pred CCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCCCCCCC Q ss_conf 97565510888985998898995655988998998898986686891684686553114342232102333122222233 Q gi|255764481|r 7 NPIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTEL 86 (271) Q Consensus 7 ~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~v 86 (271) -..|||+|+|+++++||+++.|+|+|+|.++++||++|.|.+++.|.-+++||++|.+.|++.+++ .+ T Consensus 87 ~~lIhp~a~i~~~~~ig~G~~I~~~~~i~~~~~IG~~~~i~~~~~I~Hd~~ig~~~~i~~~~~i~G------------~v 154 (201) T TIGR03570 87 ATLIHPSAIVSPSASIGEGTVIMAGAVINPDVRIGDNVIINTGAIVEHDCVIGDYVHIAPGVTLSG------------GV 154 (201) T ss_pred EEEECCCCEECCCCEECCCCEEECCCEECCCCEEEEEEEECCCCCCCCCCEECCCCEECCCCEECC------------CC T ss_conf 899999809889867879989962869876647802679804232144254778768999859989------------88 Q ss_pred CCCCCCCCCCCCCC Q ss_conf 22222221224432 Q gi|255764481|r 87 LVGKKCVIREGVTI 100 (271) Q Consensus 87 ~IG~~~~Ire~v~I 100 (271) .||+++.|+.+++| T Consensus 155 ~Ig~~~~iG~~~~i 168 (201) T TIGR03570 155 VIGEGVFIGAGATI 168 (201) T ss_pred EECCCCEECCCCEE T ss_conf 79777359898898 No 69 >cd05824 LbH_M1P_guanylylT_C Mannose-1-phosphate guanylyltransferase, C-terminal Left-handed parallel beta helix (LbH) domain: Mannose-1-phosphate guanylyltransferase is also known as GDP-mannose pyrophosphorylase. It catalyzes the synthesis of GDP-mannose from GTP and mannose-1-phosphate, and is involved in the maintenance of cell wall integrity and glycosylation. Similar to ADP-glucose pyrophosphorylase, it contains an N-terminal catalytic domain that resembles a dinucleotide-binding Rossmann fold and a C-terminal LbH fold domain, presumably with 4 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Probab=99.41 E-value=4e-13 Score=102.57 Aligned_cols=77 Identities=26% Similarity=0.478 Sum_probs=40.1 Q ss_pred ECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCCCCCCCCCCCCCCC Q ss_conf 8898599889899565598899899889898668689168468655311434223210233312222223322222221 Q gi|255764481|r 16 VEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTELLVGKKCVI 94 (271) Q Consensus 16 I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~v~IG~~~~I 94 (271) |||+|+|+++|+|||+++|+++|+||++|+|. +++|..+++|++++.+. +++||..++..+|......+.|||++.| T Consensus 2 Idpsa~Ig~~~~IGp~vvIG~nv~IG~gv~I~-~~vI~~~~~I~~~s~i~-~siIG~~~~iG~~~ri~~~~viGd~v~I 78 (80) T cd05824 2 IDPSAKIGKTAKIGPNVVIGPNVTIGDGVRLQ-RCVILSNSTVRDHSWVK-SSIVGWNSTVGRWTRLENVTVLGDDVTI 78 (80) T ss_pred CCCCCEECCCCEECCCCEECCCCEECCCCEEE-EEEEECCCEECCCCEEE-CCEECCCCCCCCCEEECCCCEECCCCEE T ss_conf 37878989999999985899999999997997-30895789998986997-8998579704894036586298898698 No 70 >cd03360 LbH_AT_putative Putative Acyltransferase (AT), Left-handed parallel beta-Helix (LbH) domain; This group is composed of mostly uncharacterized proteins containing an N-terminal helical subdomain followed by a LbH domain. The alignment contains 6 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. A few members are identified as NeuD, a sialic acid (Sia) O-acetyltransferase that is required for Sia synthesis and surface polysaccharide sialylation. Probab=99.36 E-value=1e-12 Score=100.05 Aligned_cols=103 Identities=31% Similarity=0.501 Sum_probs=79.1 Q ss_pred CCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCCCCCCC Q ss_conf 97565510888985998898995655988998998898986686891684686553114342232102333122222233 Q gi|255764481|r 7 NPIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTEL 86 (271) Q Consensus 7 ~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~v 86 (271) -..|||+|+|++.++||+++.|+|+|+|+++++||++|.|.+++.|.-++.||++|.+.|+++|++ .+ T Consensus 84 ~niIhp~a~i~~~~~ig~G~~I~~~~~i~~~~~IG~~~~i~~~~~i~Hd~~ig~~~~i~~~~~i~g------------~v 151 (197) T cd03360 84 ATLIHPSAVVSPSAVIGEGCVIMAGAVINPDARIGDNVIINTGAVIGHDCVIGDFVHIAPGVVLSG------------GV 151 (197) T ss_pred EEEECCCCEECCCCEECCCCEEECCCEECCCCCCCCEEEECCCHHCCCCCCCCCCCEECCCEEECC------------CE T ss_conf 899999959877747999429931759865653043689877001053150131010667236778------------30 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCC Q ss_conf 222222212244321111222222122332000016655433 Q gi|255764481|r 87 LVGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHD 128 (271) Q Consensus 87 ~IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd 128 (271) .||+++.|+.+++|..+ ..||+++.+++++.|-+| T Consensus 152 ~Ig~~~~iG~~~~i~~~-------i~Ig~~~~igags~v~~~ 186 (197) T cd03360 152 TIGEGAFIGAGATIIQG-------VTIGAGAIIGAGAVVTKD 186 (197) T ss_pred EEEEECCCCCCCEECCC-------CEECCCCEECCCCEEEEE T ss_conf 89121629899898799-------899999999979989125 No 71 >cd00208 LbetaH Left-handed parallel beta-Helix (LbetaH or LbH) domain: The alignment contains 5 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity, however, some subfamilies in this hierarchy also show activities related to ion transport or translation initiation. Many are trimeric in their active forms. Probab=99.32 E-value=3.8e-12 Score=96.41 Aligned_cols=35 Identities=31% Similarity=0.462 Sum_probs=32.0 Q ss_pred CCEECCCEEECCCCCCCCCEEEECCCEEECCCCCC Q ss_conf 53214860652344433650640474350132112 Q gi|255764481|r 146 HVIVDDRVVFGGGSAVHQFTRIGKYAFIGGMTGVV 180 (271) Q Consensus 146 ~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~gs~V~ 180 (271) +++|||++|||+++.++++++||++|+|||+|+|| T Consensus 44 ~~~Ig~~v~Ig~~a~I~~gv~IGdn~~IgagavVt 78 (78) T cd00208 44 PTIIGDNVEIGANAVIHGGVKIGDNAVIGAGAVVT 78 (78) T ss_pred CCEECCCEEECCCCEEECCEEECCCCEECCCCEEC T ss_conf 98999996998689897981999999999885699 No 72 >PRK11830 dapD 2,3,4,5-tetrahydropyridine-2,6-carboxylate N-succinyltransferase; Provisional Probab=99.27 E-value=1e-11 Score=93.78 Aligned_cols=62 Identities=24% Similarity=0.325 Sum_probs=32.7 Q ss_pred ECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEE Q ss_conf 655108889859988989956559889989988989866868916846865531143422321 Q gi|255764481|r 10 IHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGG 72 (271) Q Consensus 10 IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~ 72 (271) +-|.|++-.+|.|+++|.+.| ++|.=++.+|++|.|..++++...+.||+||++..++.||+ T Consensus 99 vVP~a~vR~Gayi~~~vVlMP-s~VNiGAyv~~gtmiDtwatvGScAqiGknvhls~g~gigG 160 (265) T PRK11830 99 VVPGAVVRRGAYIAKNVVLMP-SYVNIGAYVDEGTMVDTWATVGSCAQIGKNVHLSGGVGIGG 160 (265) T ss_pred ECCCCEEEECEEECCCCEEEE-EEEEEEEEECCCCEEECCCCEECHHEECCCEEECCCCEEEE T ss_conf 878853760305459958864-17776039848954804762411532378336878741426 No 73 >cd03350 LbH_THP_succinylT 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate (THDP) N-succinyltransferase (also called THP succinyltransferase): THDP N-succinyltransferase catalyzes the conversion of tetrahydrodipicolinate and succinyl-CoA to N-succinyltetrahydrodipicolinate and CoA. It is the committed step in the succinylase pathway by which bacteria synthesize L-lysine and meso-diaminopimelate, a component of peptidoglycan. The enzyme is homotrimeric and each subunit contains an N-terminal region with alpha helices and hairpin loops, as well as a C-terminal region with a left-handed parallel alpha-helix (LbH) structural motif encoded by hexapeptide repeat motifs. Probab=99.27 E-value=1.9e-11 Score=92.02 Aligned_cols=40 Identities=28% Similarity=0.256 Sum_probs=24.2 Q ss_pred CEECCCEEECCCCCCCCCEEEECCCEEECCCCCCCCCCCCCHH Q ss_conf 3214860652344433650640474350132112233331000 Q gi|255764481|r 147 VIVDDRVVFGGGSAVHQFTRIGKYAFIGGMTGVVHDVIPYGIL 189 (271) Q Consensus 147 v~Igd~~~IG~~~~v~~~v~IG~~a~ig~gs~V~kdVpp~~~~ 189 (271) ++||++++||+++.|-+.+.+ |. +-++.....++|||+++ T Consensus 94 v~IG~~avIGAGsVVtkdvp~--y~-~~~g~~~~~~~p~~~~~ 133 (139) T cd03350 94 VIVGKGAVLAAGVVLTQSTPI--YD-RETGEIYYGRVPPGSVV 133 (139) T ss_pred CEECCCCEECCCCEEECCCCC--EE-EECCCEEECCCCCCCEE T ss_conf 799999899979889189884--68-23780970116998787 No 74 >cd03358 LbH_WxcM_N_like WcxM-like, Left-handed parallel beta-Helix (LbH) N-terminal domain: This group is composed of Xanthomonas campestris WcxM and proteins with similarity to the WcxM N-terminal domain. WcxM is thought to be bifunctional, catalyzing both the isomerization and transacetylation reactions of keto-hexoses. It contains an N-terminal LbH domain responsible for the transacetylation function and a C-terminal isomerase domain. The LbH domain contains imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X), typical of enzymes with acyltransferase activity. Probab=99.24 E-value=1.8e-11 Score=92.22 Aligned_cols=54 Identities=24% Similarity=0.253 Sum_probs=49.1 Q ss_pred CCCCCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCE Q ss_conf 579975655108889859988989956559889989988989866868916846 Q gi|255764481|r 4 MGNNPIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTK 57 (271) Q Consensus 4 i~~~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~ 57 (271) |++++.|+|.++|+++++||+++.|+++|+|..+++||++|.|++++.+..+.. T Consensus 1 IG~~~~Ig~~~~I~~~~~IG~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~~~~~~~ 54 (119) T cd03358 1 IGDNCIIGTNVFIENDVKIGDNVKIQSNVSIYEGVTIEDDVFIGPNVVFTNDLY 54 (119) T ss_pred CCCCCEECCCCEECCCCEECCCEEECCCCCCCCCCEEEEEEEECCCCCCCCCCC T ss_conf 989989999989999999999969999863678999843779778863304446 No 75 >KOG1461 consensus Probab=99.22 E-value=1e-11 Score=93.74 Aligned_cols=100 Identities=20% Similarity=0.321 Sum_probs=57.2 Q ss_pred ECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 98898995655988998998898986686891684686553114342232102333122222233222222212244321 Q gi|255764481|r 22 IGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTELLVGKKCVIREGVTIN 101 (271) Q Consensus 22 Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~v~IG~~~~Ire~v~I~ 101 (271) -++++++.+-|.++.++.||.+|.|++++.|. |+.||.||.|+.+++|-+ ..|+++|+|+++|.|+ T Consensus 318 k~~dv~~~~~~~v~~~~~ig~gT~Ig~g~~I~-NSVIG~~c~IgsN~~I~~-------------S~iw~~v~Igdnc~I~ 383 (673) T KOG1461 318 KSPDVVLSHSVIVGANVVIGAGTKIGSGSKIS-NSVIGANCRIGSNVRIKN-------------SFIWNNVTIGDNCRID 383 (673) T ss_pred CCCCCEEHHHCCCCCEEEECCCCCCCCCCEEE-CCEECCCCEECCCEEEEE-------------EEEECCCEECCCCEEE T ss_conf 47401000120124407951454126798555-226658977457458851-------------3662583887886573 Q ss_pred CCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 111222222122332000016655433100123443223452 Q gi|255764481|r 102 RGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMI 143 (271) Q Consensus 102 ~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l 143 (271) +. .|++++.+.+++++-+.|++|.+|+++.+-.+ T Consensus 384 ~a--------ii~d~v~i~~~~~l~~g~vl~~~VVv~~~~~l 417 (673) T KOG1461 384 HA--------IICDDVKIGEGAILKPGSVLGFGVVVGRNFVL 417 (673) T ss_pred EE--------EEECCCEECCCCCCCCCCEEEEEEEECCCCCC T ss_conf 46--------86058376788440788577521686798312 No 76 >cd00208 LbetaH Left-handed parallel beta-Helix (LbetaH or LbH) domain: The alignment contains 5 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity, however, some subfamilies in this hierarchy also show activities related to ion transport or translation initiation. Many are trimeric in their active forms. Probab=99.18 E-value=7e-11 Score=88.54 Aligned_cols=34 Identities=38% Similarity=0.598 Sum_probs=14.5 Q ss_pred EECCCCEECCCCEECCCCEEEEEEECCCCCEEEE Q ss_conf 9988989866868916846865531143422321 Q gi|255764481|r 39 EIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGG 72 (271) Q Consensus 39 ~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~ 72 (271) .||++|+|+++++|.+++.||++|.|+|+++|+. T Consensus 2 ~IG~~~~I~~~~~I~~~v~IG~~~~I~~~~~I~~ 35 (78) T cd00208 2 FIGEGVKIHPKAVIRGPVVIGDNVNIGPGAVIGA 35 (78) T ss_pred EECCCCEECCCCEECCCEEECCCCEECCCCEEEE T ss_conf 9999849999849988269999998999989963 No 77 >TIGR03308 phn_thr-fam phosphonate metabolim protein, transferase hexapeptide repeat family. This family of proteins contains copies of the Bacterial transferase hexapeptide repeat family (pfam00132) and is only found in operons encoding the phosphonate C-P lyase system (GenProp0232). Many C-P lyase operons, however, lack a homolog of this protein. Probab=99.13 E-value=2.8e-10 Score=84.80 Aligned_cols=181 Identities=12% Similarity=0.145 Sum_probs=90.4 Q ss_pred CCCCCCCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCC Q ss_conf 50579975655108889859988989956559889989988989866868916846865531143422321023331222 Q gi|255764481|r 2 SRMGNNPIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNF 81 (271) Q Consensus 2 ~~i~~~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g 81 (271) +++|+++.|-+.+.+. ++.||+...|+++|.|. +++||..|.|.++|.|.+...--+....+++...+.. T Consensus 20 ~~lG~~v~Ig~~~~i~-ns~IGDySyI~~~~~I~-n~~IGkfcsIa~~v~I~~~nHp~~~~s~~~f~y~~~~-------- 89 (204) T TIGR03308 20 SKLGRYTEIGERTRLR-EVALGDYSYVMRDCDII-YTTIGKFCSIAAMVRINATNHPMERPTLHHFTYRAAM-------- 89 (204) T ss_pred CCCCCCEEECCCCEEE-CCEECCCCEECCCCEEE-CCEECCCCEECCCCEECCCCCCCCCCEECCEEEECCC-------- T ss_conf 7837833999995883-37999985888997598-8869998488898798987787667267777553753-------- Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCC Q ss_conf 22233222222212244321111222222122332000016655433100123443223452455321486065234443 Q gi|255764481|r 82 VGTELLVGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAV 161 (271) Q Consensus 82 ~~~~v~IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v 161 (271) -.-..++...+.++ .-.|.+.||++|.++.++.+...++||++++||++++| T Consensus 90 --~~~~~~~~~~~~~~--------------------------~~~~~~~Ig~dvwiG~~~~i~~gv~IG~gavigagsvV 141 (204) T TIGR03308 90 --YFDDASDDADFFAW--------------------------RRAKRVTIGHDVWIGHGAVILPGVTIGNGAVIAAGAVV 141 (204) T ss_pred --CCCCCCCCHHHHCC--------------------------CCCCCEEECCCCEECCCCEECCCCEECCCCEECCCCEE T ss_conf --34666661223213--------------------------34787098997688799899199799999899779989 Q ss_pred CCCEEEECCCEEECC-CCCCCC-CCCCCHHHHCCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHH Q ss_conf 365064047435013-211223-3331000107453233442014442888999999999999997279989999999 Q gi|255764481|r 162 HQFTRIGKYAFIGGM-TGVVHD-VIPYGILNGNPGALRGVNVVAMRRAGFSRDTIHLIRAVYKQIFQQGDSIYKNAGA 237 (271) Q Consensus 162 ~~~v~IG~~a~ig~g-s~V~kd-Vpp~~~~~G~pa~~~~~N~~glkR~g~s~e~i~~l~~ayr~lfr~~~~l~e~l~~ 237 (271) -. .|..|++++|. |.+.|. -++- +.. + +.-+++=..+.|+|++--.. ||+ ++.+.-+++ T Consensus 142 tk--dv~~~~iv~G~PAk~ir~RF~~~-~i~----~-----L~~~~WWdw~~e~i~~~l~~----f~~-~~~~~f~~~ 202 (204) T TIGR03308 142 TK--DVAPYTIVAGVPAKLIRRRFPPE-IAA----R-----IEALAWWDWDHETLREALPD----FRD-LPAEAFLDK 202 (204) T ss_pred CC--CCCCCEEEEECCCEEEECCCCHH-HHH----H-----HHHCCCCCCCHHHHHHHHHH----HHC-CCHHHHHHH T ss_conf 95--17998499841885663109999-999----9-----98773648999999998998----872-899999874 No 78 >TIGR03532 DapD_Ac 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-acetyltransferase. Alternate name: tetrahydrodipicolinate N-acetyltransferase. Note that IUBMB lists this alternate name as the accepted name. Unfortunately, the related succinyl transferase acting on the same substrate (EC:2.3.1.117, TIGR00695) uses the opposite standard. We have decided to give these two enzymes names which more clearly indicated that they act on the same substrate. Probab=99.11 E-value=2.8e-10 Score=84.75 Aligned_cols=72 Identities=22% Similarity=0.449 Sum_probs=45.0 Q ss_pred CCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCC--------CEEEECCCEEECCCCCCCCC Q ss_conf 2233200001665543310012344322345245532148606523444336--------50640474350132112233 Q gi|255764481|r 112 IVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQ--------FTRIGKYAFIGGMTGVVHDV 183 (271) Q Consensus 112 ~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~--------~v~IG~~a~ig~gs~V~kdV 183 (271) .||++..+|.++.|..++.||+++++..++.+++.++||++|.||+++++.. +|.||++++||++|+|...| T Consensus 100 ~IG~~aVImmga~INiGA~IGegtmId~~Avvg~~a~IGk~~HisaGa~laGVleP~~a~PViIeDnV~IGAnAvIl~GV 179 (231) T TIGR03532 100 IIGDNAVIMMGAVINIGAEIGEGTMIDMNAVLGGRATVGKNVHIGAGAVLAGVIEPPSAKPVVIEDNVLIGANAVILEGV 179 (231) T ss_pred EECCCEEEECCCEEECCCEECCCCEECCCCEECCCCEECCCCEECCCCEEECCCCCCCCCCEEECCCEEECCCCEECCCC T ss_conf 98798399138887048598789458058564454688797058898643002257667997987885987997991998 No 79 >PRK09677 putative lipopolysaccharide biosynthesis O-acetyl transferase WbbJ; Provisional Probab=99.11 E-value=3.2e-10 Score=84.42 Aligned_cols=133 Identities=16% Similarity=0.161 Sum_probs=69.0 Q ss_pred CCEECCCCEECCCCEEC----CCCEECCCCEECCCCEEC--CCCEEEEEEECCCCCEEEECCCCCCCCCCCCCCCCCCCC Q ss_conf 85998898995655988----998998898986686891--684686553114342232102333122222233222222 Q gi|255764481|r 19 GAVIGPNSLIGPFCCVG----SEVEIGAGVELISHCVVA--GKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTELLVGKKC 92 (271) Q Consensus 19 ~a~Ig~~v~Igp~~~I~----~~v~IG~~~~I~~~v~I~--g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~v~IG~~~ 92 (271) .-++|+++.+|+.|.++ +...||+||.|++++.|. +..+||+||.|.+++.|.+.. |..........+.. T Consensus 43 ~I~~g~~~~~g~~~rl~~~~~gkl~IGdNv~Ig~~~~I~~~~~I~IG~nVlIa~~V~I~d~n----H~~~~~~~~~~~~~ 118 (192) T PRK09677 43 SINFGEGFTSGVGLRLDAFGRGKLFFGDNVQVNDYVHIACIESITIGRDTLIASKVFITDHN----HGSFKHSDDFSSPN 118 (192) T ss_pred CEEECCCEEECCCEEEEEECCCEEEECCCCEECCCCEEEECCCEEECCCCEECCCEEEECCC----CCCCCCCCCCCCCC T ss_conf 56878961966877999706750899998489998589867767989986887988996788----87655545434567 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCEEEECCCE Q ss_conf 21224432111122222212233200001665543310012344322345245532148606523444336506404743 Q gi|255764481|r 93 VIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFTRIGKYAF 172 (271) Q Consensus 93 ~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v~IG~~a~ 172 (271) .. ...++. ...+ +.|||+|.++.++.+...++||+++++|++++|.. .+-++++ T Consensus 119 ~~----~~~~~~--~~~p------------------v~Ig~~~wIg~~~~Il~Gv~Ig~~~vvgA~SvV~~--dvp~~~i 172 (192) T PRK09677 119 LP----PDMRTL--ESSA------------------VVIGQRVWIGENVTILPGVSIGNGCIVGANSVVTK--SIPENTV 172 (192) T ss_pred CC----CCCCCC--CCCC------------------EEECCCCEECCCCEEECCCEECCCCEECCCCEECC--CCCCCEE T ss_conf 68----212667--7599------------------89899989999999919969999989916999985--8599919 Q ss_pred EECC-CCCCC Q ss_conf 5013-21122 Q gi|255764481|r 173 IGGM-TGVVH 181 (271) Q Consensus 173 ig~g-s~V~k 181 (271) +++. |.+.| T Consensus 173 ~~G~PAk~Ik 182 (192) T PRK09677 173 IAGNPAKIIK 182 (192) T ss_pred EEEECCEEEE T ss_conf 9970838897 No 80 >cd05635 LbH_unknown Uncharacterized proteins, Left-handed parallel beta-Helix (LbH) domain: Members in this group are uncharacterized bacterial proteins containing a LbH domain with multiple turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Probab=99.08 E-value=3.5e-10 Score=84.20 Aligned_cols=54 Identities=30% Similarity=0.466 Sum_probs=44.8 Q ss_pred CCEEE---CCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECC Q ss_conf 51088---8985998898995655988998998898986686891684686553114 Q gi|255764481|r 12 PLALV---EEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVF 65 (271) Q Consensus 12 psA~I---~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~ 65 (271) |.+++ +..+.|+++++|.|||+|+..+.||++|.|+|++.|++++.||++|+|+ T Consensus 1 p~~~~d~~~GpV~I~~~~~I~~~~~i~Gp~~IG~~~~I~p~a~Ir~~t~Ig~~~~IG 57 (101) T cd05635 1 PGAVLDAEDGPIYIGKDAVIEPFAVIEGPVYIGPGSRVKMGARIYGNTTIGPTCKIG 57 (101) T ss_pred CCCEEECCCCCEEECCCCEECCCCEEECCEEECCCCEECCCCEEECCCEECCCCEEC T ss_conf 971787898999987999999999997666999999999997863887999997761 No 81 >cd05824 LbH_M1P_guanylylT_C Mannose-1-phosphate guanylyltransferase, C-terminal Left-handed parallel beta helix (LbH) domain: Mannose-1-phosphate guanylyltransferase is also known as GDP-mannose pyrophosphorylase. It catalyzes the synthesis of GDP-mannose from GTP and mannose-1-phosphate, and is involved in the maintenance of cell wall integrity and glycosylation. Similar to ADP-glucose pyrophosphorylase, it contains an N-terminal catalytic domain that resembles a dinucleotide-binding Rossmann fold and a C-terminal LbH fold domain, presumably with 4 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Probab=99.07 E-value=3.9e-10 Score=83.83 Aligned_cols=48 Identities=23% Similarity=0.271 Sum_probs=19.8 Q ss_pred ECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEE Q ss_conf 9889899565598899899889898668689168468655311434223 Q gi|255764481|r 22 IGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVL 70 (271) Q Consensus 22 Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~I 70 (271) |++.++||+.|.|+++|.||+||+|+++|.|. ++.|.+++.|.+++.| T Consensus 2 Idpsa~Ig~~~~IGp~vvIG~nv~IG~gv~I~-~~vI~~~~~I~~~s~i 49 (80) T cd05824 2 IDPSAKIGKTAKIGPNVVIGPNVTIGDGVRLQ-RCVILSNSTVRDHSWV 49 (80) T ss_pred CCCCCEECCCCEECCCCEECCCCEECCCCEEE-EEEEECCCEECCCCEE T ss_conf 37878989999999985899999999997997-3089578999898699 No 82 >PRK11830 dapD 2,3,4,5-tetrahydropyridine-2,6-carboxylate N-succinyltransferase; Provisional Probab=99.05 E-value=2.1e-10 Score=85.55 Aligned_cols=81 Identities=23% Similarity=0.269 Sum_probs=67.9 Q ss_pred CCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCC--------CCEECCCEEECCCCCCCCCEEEECCCEEECCCCCCC-- Q ss_conf 2233200001665543310012344322345245--------532148606523444336506404743501321122-- Q gi|255764481|r 112 IVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAG--------HVIVDDRVVFGGGSAVHQFTRIGKYAFIGGMTGVVH-- 181 (271) Q Consensus 112 ~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G--------~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~gs~V~k-- 181 (271) .||+.+++...++||--++||.||.|+.++.++| .+.|+|+|+||+++.|..++.|+++|.+++|..+++ T Consensus 127 yv~~gtmiDtwatvGScAqiGknvhls~g~gigGvleP~~a~p~iied~~fiGa~~~v~eGv~v~~~avl~~gv~l~~st 206 (265) T PRK11830 127 YVDEGTMVDTWATVGSCAQIGKNVHLSGGVGIGGVLEPLQANPVIIEDNCFIGARSEVVEGVIVEEGAVIGMGVFLGQST 206 (265) T ss_pred EECCCCEEECCCCEECHHEECCCEEECCCCEEEEEECCCCCCCEEEECCCEECCCCEEEEEEEEECCCEECCEEEECCCC T ss_conf 98489548047624115323783368787414265247667987983682873574796139994586972104760671 Q ss_pred -------------CCCCCCHH-HHC Q ss_conf -------------33331000-107 Q gi|255764481|r 182 -------------DVIPYGIL-NGN 192 (271) Q Consensus 182 -------------dVpp~~~~-~G~ 192 (271) +||||+++ .|+ T Consensus 207 ~I~D~~~~ev~~g~vP~~svvv~Gs 231 (265) T PRK11830 207 KIYDRETGEVHYGRVPAGSVVVPGS 231 (265) T ss_pred EEEECCCCCEEECCCCCCCEEECCC T ss_conf 6787247828743479998895574 No 83 >COG1208 GCD1 Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) [Cell envelope biogenesis, outer membrane / Translation, ribosomal structure and biogenesis] Probab=99.05 E-value=6.9e-10 Score=82.31 Aligned_cols=57 Identities=28% Similarity=0.360 Sum_probs=34.5 Q ss_pred EEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEE Q ss_conf 0888985998898995655988998998898986686891684686553114342232 Q gi|255764481|r 14 ALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLG 71 (271) Q Consensus 14 A~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig 71 (271) +.+.+.+.|++++.|++.+.|++++.||+||+|+.++.|. ++.|.++|.|++++.|+ T Consensus 256 ~~i~gp~~ig~~~~i~~~~~i~~~~~ig~~~~I~~~~~i~-~Sii~~~~~i~~~~~i~ 312 (358) T COG1208 256 AYIIGPVVIGPGAKIGPGALIGPYTVIGEGVTIGNGVEIK-NSIIMDNVVIGHGSYIG 312 (358) T ss_pred EEEECCEEECCCCEECCCCEECCCCEECCCCEECCCCEEE-EEEEECCCEECCCCEEC T ss_conf 3885894982697898884868987899998889987787-54887787988888882 No 84 >KOG1461 consensus Probab=99.05 E-value=4.2e-10 Score=83.64 Aligned_cols=104 Identities=22% Similarity=0.253 Sum_probs=77.0 Q ss_pred CCCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCCCCCC Q ss_conf 99756551088898599889899565598899899889898668689168468655311434223210233312222223 Q gi|255764481|r 6 NNPIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTE 85 (271) Q Consensus 6 ~~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~ 85 (271) .+..+.+++.+..+..||.+.+||..+.| .|.+||.||.|+++++|. ++.|++||+|+.+|.|. . T Consensus 320 ~dv~~~~~~~v~~~~~ig~gT~Ig~g~~I-~NSVIG~~c~IgsN~~I~-~S~iw~~v~Igdnc~I~-------------~ 384 (673) T KOG1461 320 PDVVLSHSVIVGANVVIGAGTKIGSGSKI-SNSVIGANCRIGSNVRIK-NSFIWNNVTIGDNCRID-------------H 384 (673) T ss_pred CCCEEHHHCCCCCEEEECCCCCCCCCCEE-ECCEECCCCEECCCEEEE-EEEEECCCEECCCCEEE-------------E T ss_conf 40100012012440795145412679855-522665897745745885-13662583887886573-------------4 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCC Q ss_conf 3222222212244321111222222122332000016655 Q gi|255764481|r 86 LLVGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHV 125 (271) Q Consensus 86 v~IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI 125 (271) ..|+|++.|++++.+.+|...+ ...++|++-.+-.|+.| T Consensus 385 aii~d~v~i~~~~~l~~g~vl~-~~VVv~~~~~l~~ns~~ 423 (673) T KOG1461 385 AIICDDVKIGEGAILKPGSVLG-FGVVVGRNFVLPKNSKV 423 (673) T ss_pred EEEECCCEECCCCCCCCCCEEE-EEEEECCCCCCCCCCCC T ss_conf 6860583767884407885775-21686798312454433 No 85 >PRK05293 glgC glucose-1-phosphate adenylyltransferase; Provisional Probab=99.04 E-value=7.2e-10 Score=82.20 Aligned_cols=12 Identities=25% Similarity=0.393 Sum_probs=3.9 Q ss_pred CCCEEECCCCEE Q ss_conf 551088898599 Q gi|255764481|r 11 HPLALVEEGAVI 22 (271) Q Consensus 11 hpsA~I~~~a~I 22 (271) .|.+++.++|+| T Consensus 280 ~~p~~i~~~~~I 291 (381) T PRK05293 280 LPPQYIAENAKV 291 (381) T ss_pred CCCEEECCCCEE T ss_conf 899298799789 No 86 >PRK10502 putative colanic acid biosynthesis acetyltransferase WcaF; Provisional Probab=99.01 E-value=1.1e-09 Score=81.01 Aligned_cols=55 Identities=20% Similarity=0.311 Sum_probs=45.6 Q ss_pred CCCCCCCEECCCEEEC--CCCEECCCCEECCCCEEC--CCCEECCCCEECCCCEECCCC Q ss_conf 5057997565510888--985998898995655988--998998898986686891684 Q gi|255764481|r 2 SRMGNNPIIHPLALVE--EGAVIGPNSLIGPFCCVG--SEVEIGAGVELISHCVVAGKT 56 (271) Q Consensus 2 ~~i~~~~~IhpsA~I~--~~a~Ig~~v~Igp~~~I~--~~v~IG~~~~I~~~v~I~g~t 56 (271) ++|++++.|+|++.|. -..+||+++.||++|.|. +.++||++|.|.+++.|...+ T Consensus 53 AkIG~~v~I~p~v~I~~PwnL~IGd~~~Ig~~v~i~~~~~I~IG~~v~Isq~v~l~t~s 111 (179) T PRK10502 53 AKIGKGVVIRPSVRITYPWKLTIGDYAWIGDDVVLYTLGEITIGAHCVISQKSYLCTGS 111 (179) T ss_pred CEECCCCEECCCCEEEECCCEEECCCCEECCCCEEEECCCCEECCCEEECCCCEEECCC T ss_conf 86799988999657950430798896578898189848961798841277971896388 No 87 >PRK05293 glgC glucose-1-phosphate adenylyltransferase; Provisional Probab=99.00 E-value=1.7e-09 Score=79.83 Aligned_cols=16 Identities=6% Similarity=-0.093 Sum_probs=6.3 Q ss_pred CCEEEEEEECCCCCEE Q ss_conf 8468655311434223 Q gi|255764481|r 55 KTKIGDFTKVFPMAVL 70 (271) Q Consensus 55 ~t~IG~~~~I~~~a~I 70 (271) ++.||++|.|+++|.| T Consensus 308 nSvIg~~v~Ig~ga~I 323 (381) T PRK05293 308 HSVLFQGVQVGEGSIV 323 (381) T ss_pred CCEECCCCEECCCCEE T ss_conf 4898899899999999 No 88 >KOG4042 consensus Probab=98.98 E-value=4.2e-10 Score=83.69 Aligned_cols=164 Identities=18% Similarity=0.285 Sum_probs=85.6 Q ss_pred CEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEC---CCCEEEEEEECCCCCEEEECCCC-CCCCCCC Q ss_conf 7565510888985998898995655988998998898986686891---68468655311434223210233-3122222 Q gi|255764481|r 8 PIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVA---GKTKIGDFTKVFPMAVLGGDTQS-KYHNFVG 83 (271) Q Consensus 8 ~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~---g~t~IG~~~~I~~~a~Ig~~~qd-~~~~g~~ 83 (271) ..|-|+|++--.+ -|.++|.|+++|+++|.+++. |...||+||.|..+|+|-..-.+ .-+..+. T Consensus 9 vkIap~AvVCvEs------------~irGdvti~~gcVvHP~a~~iA~aGPI~iGEnniiEEyA~i~n~~~~~~~~d~~~ 76 (190) T KOG4042 9 VKIAPSAVVCVES------------DIRGDVTIKEGCVVHPFAVFIATAGPIYIGENNIIEEYAVIRNRLEPGAVWDSDG 76 (190) T ss_pred EEECCCEEEEEEC------------CCCCCEEECCCCEECCEEEEECCCCCEEECCCCHHHHHHHHHHHCCCCCCCCCCC T ss_conf 4645751899860------------2133148658838533079970569889866712566788770068887547899 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCC Q ss_conf 23322222221224432111122222212233200001665543310012344322345245532148606523444336 Q gi|255764481|r 84 TELLVGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQ 163 (271) Q Consensus 84 ~~v~IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~ 163 (271) ..+.||++|+|.-+|... .-++||+|.|...+++ |++| T Consensus 77 ~pmiIGt~NvFeVgc~s~--------A~kvGd~NVieskayv------g~gv---------------------------- 114 (190) T KOG4042 77 QPMIIGTWNVFEVGCKSS--------AKKVGDRNVIESKAYV------GDGV---------------------------- 114 (190) T ss_pred CEEEEECCCEEEEECHHH--------HHHHCCCCEEEEEEEE------CCCC---------------------------- T ss_conf 767872363588612334--------5551676547666686------4994---------------------------- Q ss_pred CEEEECCCEEECCCCC--CCCCCCCCHHHHCCCCCCCCCCCHHHHCCCCHHHHHHHHHH---HHHHHHCCCCH Q ss_conf 5064047435013211--22333310001074532334420144428889999999999---99997279989 Q gi|255764481|r 164 FTRIGKYAFIGGMTGV--VHDVIPYGILNGNPGALRGVNVVAMRRAGFSRDTIHLIRAV---YKQIFQQGDSI 231 (271) Q Consensus 164 ~v~IG~~a~ig~gs~V--~kdVpp~~~~~G~pa~~~~~N~~glkR~g~s~e~i~~l~~a---yr~lfr~~~~l 231 (271) .+-++|.|||+..+ .+.+|+...+.|.. +|-++-..|.+-..-+|..|++. |--||+..... T Consensus 115 --~vssgC~vGA~c~v~~~q~lpent~vYga~----~L~R~~~~~~~~qtlQidFLrKiLPnYHHL~k~~~~~ 181 (190) T KOG4042 115 --SVSSGCSVGAKCTVFSHQNLPENTSVYGAT----NLSRTTKTPNMTQTLQIDFLRKILPNYHHLYKKKKAV 181 (190) T ss_pred --EECCCCEECCCEEEECCCCCCCCCEEECCC----CCCCEECCCCCCCCCHHHHHHHHCCCHHHHHCCCCCC T ss_conf --872796541405771103469751687213----4010002799986220789998761145552565553 No 89 >cd03354 LbH_SAT Serine acetyltransferase (SAT): SAT catalyzes the CoA-dependent acetylation of the side chain hydroxyl group of L-serine to form O-acetylserine, as the first step of a two-step biosynthetic pathway in bacteria and plants leading to the formation of L-cysteine. This reaction represents a key metabolic point of regulation for the cysteine biosynthetic pathway due to its feedback inhibition by cysteine. The enzyme is a 175 kDa homohexamer, composed of a dimer of homotrimers. Each subunit contains an N-terminal alpha helical region and a C-terminal left-handed beta-helix (LbH) subdomain with 5 turns, each containing a hexapeptide repeat motif characteristic of the acyltransferase superfamily of enzymes. The trimer interface mainly involves the C-terminal LbH subdomain while the dimer (of trimers) interface is mediated by the N-terminal alpha helical subdomain. Probab=98.98 E-value=1.8e-09 Score=79.72 Aligned_cols=43 Identities=21% Similarity=0.485 Sum_probs=23.2 Q ss_pred CCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCEEEECCCEEEC Q ss_conf 012344322345245532148606523444336506404743501 Q gi|255764481|r 131 LGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFTRIGKYAFIGG 175 (271) Q Consensus 131 IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~ 175 (271) ||++|+|+.++.+.+.++|+|+++||+++.|.. .+-+++++.+ T Consensus 57 Ig~~v~IG~~~~I~~gv~IG~~~vIgagsvV~~--dv~~~si~~G 99 (101) T cd03354 57 IGDNVVIGAGAKILGNITIGDNVKIGANAVVTK--DVPANSTVVG 99 (101) T ss_pred CCCCEEECCCCEEECCCEECCCCEECCCCEECC--CCCCCCEEEE T ss_conf 487586358988929879999879987988996--8599989980 No 90 >cd04647 LbH_MAT_like Maltose O-acyltransferase (MAT)-like: This family is composed of maltose O-acetyltransferase, galactoside O-acetyltransferase (GAT), xenobiotic acyltransferase (XAT) and similar proteins. MAT and GAT catalyze the CoA-dependent acetylation of the 6-hydroxyl group of their respective sugar substrates. MAT acetylates maltose and glucose exclusively while GAT specifically acetylates galactopyranosides. XAT catalyzes the CoA-dependent acetylation of a variety of hydroxyl-bearing acceptors such as chloramphenicol and streptogramin, among others. XATs are implicated in inactivating xenobiotics leading to xenobiotic resistance in patients. Members of this family contain a a left-handed parallel beta-helix (LbH) domain with at least 5 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). They are trimeric in their active form. Probab=98.97 E-value=1.2e-09 Score=80.73 Aligned_cols=46 Identities=26% Similarity=0.446 Sum_probs=27.3 Q ss_pred CCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCEEEECCCEEEC Q ss_conf 310012344322345245532148606523444336506404743501 Q gi|255764481|r 128 DCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFTRIGKYAFIGG 175 (271) Q Consensus 128 d~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~ 175 (271) .+.||++|.++.++.+...++|+++++||+++.|... +-+++++.+ T Consensus 58 ~v~Ig~~v~IG~~~~I~~gv~Ig~~~iIga~SvV~k~--i~~~~i~~G 103 (109) T cd04647 58 PIVIGDDVWIGANVVILPGVTIGDGAVVGAGSVVTKD--VPPNSIVAG 103 (109) T ss_pred CEEEECCCEECCCEEEEECCEECCCCEECCCCEECCC--CCCCCEEEE T ss_conf 9799423578898399285130898699379889148--399989997 No 91 >KOG4042 consensus Probab=98.95 E-value=8.1e-10 Score=81.87 Aligned_cols=76 Identities=24% Similarity=0.353 Sum_probs=49.5 Q ss_pred ECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCC----CCC--CCCCCCCC Q ss_conf 988989866868916846865531143422321023331222222332222222122443211----112--22222122 Q gi|255764481|r 40 IGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTELLVGKKCVIREGVTINR----GTV--EYGGKTIV 113 (271) Q Consensus 40 IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~v~IG~~~~Ire~v~I~~----gt~--~~~~~T~I 113 (271) |-...++---+.|+|+++|+++|.++|++++-.+ .....||++|.|.|+++|-. |.. .++..-.| T Consensus 11 Iap~AvVCvEs~irGdvti~~gcVvHP~a~~iA~---------aGPI~iGEnniiEEyA~i~n~~~~~~~~d~~~~pmiI 81 (190) T KOG4042 11 IAPSAVVCVESDIRGDVTIKEGCVVHPFAVFIAT---------AGPIYIGENNIIEEYAVIRNRLEPGAVWDSDGQPMII 81 (190) T ss_pred ECCCEEEEEECCCCCCEEECCCCEECCEEEEECC---------CCCEEECCCCHHHHHHHHHHHCCCCCCCCCCCCEEEE T ss_conf 4575189986021331486588385330799705---------6988986671256678877006888754789976787 Q ss_pred CCCCHHCCCCC Q ss_conf 33200001665 Q gi|255764481|r 114 GDNNFFLANSH 124 (271) Q Consensus 114 G~~~~i~~~~h 124 (271) |.+|.|..+++ T Consensus 82 Gt~NvFeVgc~ 92 (190) T KOG4042 82 GTWNVFEVGCK 92 (190) T ss_pred ECCCEEEEECH T ss_conf 23635886123 No 92 >KOG3121 consensus Probab=98.95 E-value=1e-09 Score=81.22 Aligned_cols=139 Identities=17% Similarity=0.282 Sum_probs=84.2 Q ss_pred CCCEECCCCEECCCCEECC---CCEEEEEEECCCCCEEEECCCCCCCC-C-CCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 9989988989866868916---84686553114342232102333122-2-22233222222212244321111222222 Q gi|255764481|r 36 SEVEIGAGVELISHCVVAG---KTKIGDFTKVFPMAVLGGDTQSKYHN-F-VGTELLVGKKCVIREGVTINRGTVEYGGK 110 (271) Q Consensus 36 ~~v~IG~~~~I~~~v~I~g---~t~IG~~~~I~~~a~Ig~~~qd~~~~-g-~~~~v~IG~~~~Ire~v~I~~gt~~~~~~ 110 (271) .|..|.--+.+.+.++|+| ++++|+.|.+...++|... -..|. + ..-.+.|||++.|.|.|.++.++ T Consensus 32 QNI~lnGKtIv~~g~iIRGDLAnVr~GryCV~ksrsvIRPp--~K~FSKg~affp~hiGdhVFieE~cVVnAAq------ 103 (184) T KOG3121 32 QNILLNGKTIVEEGVIIRGDLANVRIGRYCVLKSRSVIRPP--MKIFSKGPAFFPVHIGDHVFIEEECVVNAAQ------ 103 (184) T ss_pred CEEEECCCEEEEECCEEECCCCCCEECCEEEECCCCCCCCC--HHHHCCCCEEEEEEECCEEEEECCEEEEHHH------ T ss_conf 32798372798608589512010247336896466543770--6775678624566524568870441862666------ Q ss_pred CCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCEEEECCCEEECCCCCCCCCCCCCHHH Q ss_conf 12233200001665543310012344322345245532148606523444336506404743501321122333310001 Q gi|255764481|r 111 TIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFTRIGKYAFIGGMTGVVHDVIPYGILN 190 (271) Q Consensus 111 T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~gs~V~kdVpp~~~~~ 190 (271) | ++|+|+|.+|.||++|++-.. +.|.|++++-..+. +|||+... T Consensus 104 --I------gsyVh~GknaviGrrCVlkdC------c~ild~tVlPpet~----------------------vppy~~~~ 147 (184) T KOG3121 104 --I------GSYVHLGKNAVIGRRCVLKDC------CRILDDTVLPPETL----------------------VPPYSTIG 147 (184) T ss_pred --H------EEEEEECCCEEECCCEEHHHH------EECCCCCCCCCCCC----------------------CCCCEEEC T ss_conf --1------203685464067375083341------04158952582013----------------------69716773 Q ss_pred HCCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHH Q ss_conf 074532334420144428889999999999999 Q gi|255764481|r 191 GNPGALRGVNVVAMRRAGFSRDTIHLIRAVYKQ 223 (271) Q Consensus 191 G~pa~~~~~N~~glkR~g~s~e~i~~l~~ayr~ 223 (271) |+|+...| ...+-...+-|...+.-|.. T Consensus 148 g~p~~~~G-----~~P~ctq~lMi~~tksyY~~ 175 (184) T KOG3121 148 GNPAQVVG-----TEPRCTQNLMIEATKSYYDN 175 (184) T ss_pred CCCCEEEC-----CCCHHHHHHHHHHHHHHHHH T ss_conf 88723531-----57302689999999999983 No 93 >TIGR02287 PaaY phenylacetic acid degradation protein PaaY; InterPro: IPR011974 Members of this family are located next to other genes organised into apparent operons for phenylacetic acid degradation . PaaY is located near the end of these gene clusters and often next to PaaX, a transcriptional regulator.. Probab=98.94 E-value=2.4e-09 Score=78.93 Aligned_cols=130 Identities=18% Similarity=0.375 Sum_probs=80.1 Q ss_pred CCCCCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCCCC Q ss_conf 57997565510888985998898995655988998998898986686891684686553114342232102333122222 Q gi|255764481|r 4 MGNNPIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFVG 83 (271) Q Consensus 4 i~~~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~ 83 (271) ...++++||+|+.=.++.||+.|+|||.+-+.+|- |...+-++..|...|+.-+- .+.. T Consensus 11 Vhp~ayVHP~AVLiGDVi~G~rcYvgPlASLRGDF---------------GrIvl~eGAN~QD~CVMHGF------Pg~D 69 (193) T TIGR02287 11 VHPEAYVHPDAVLIGDVIVGKRCYVGPLASLRGDF---------------GRIVLKEGANVQDTCVMHGF------PGQD 69 (193) T ss_pred CCCCCCCCCCEEEEEEEEECCCCEECCCCCCCCCC---------------CCEEEECCCCCCCCCEECCC------CCCC T ss_conf 69877346863786217887952467663135775---------------72575348861464243672------9988 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCC Q ss_conf 23322222221224432111122222212233200001665543310012344322345245532148606523444336 Q gi|255764481|r 84 TELLVGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQ 163 (271) Q Consensus 84 ~~v~IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~ 163 (271) | .+++|= .||... |=|.|.||.|..++-|+.+--...|++++++++.+.|.. T Consensus 70 T--vVeenG-------------------HvGHgA-------iLHgC~vgrnaLvGMNAVVMDgAvige~sIVaA~aFVKA 121 (193) T TIGR02287 70 T--VVEENG-------------------HVGHGA-------ILHGCRVGRNALVGMNAVVMDGAVIGERSIVAASAFVKA 121 (193) T ss_pred E--EEECCC-------------------CCCCCE-------EECCCEECCCCEECCCEEEECCCEECCCCCEEHHHHHHC T ss_conf 6--760688-------------------233212-------351546325231116506616646677412002033434 Q ss_pred CEEEECCCEEECC-CCCCCC Q ss_conf 5064047435013-211223 Q gi|255764481|r 164 FTRIGKYAFIGGM-TGVVHD 182 (271) Q Consensus 164 ~v~IG~~a~ig~g-s~V~kd 182 (271) ...+-...+|.|- |.|.+. T Consensus 122 g~E~paq~Lv~GsPAkv~R~ 141 (193) T TIGR02287 122 GAEIPAQVLVVGSPAKVVRE 141 (193) T ss_pred CCCCCCCCEEECCCCEEEEC T ss_conf 73478461266287305533 No 94 >KOG1460 consensus Probab=98.94 E-value=1.6e-09 Score=80.01 Aligned_cols=69 Identities=32% Similarity=0.347 Sum_probs=48.5 Q ss_pred CCCCCEECCCEEECCCCEECCCCEECCCCEECCCCEEC-----CCCEECCCCEECCCCEEEEEEECCCCCEEEEC Q ss_conf 57997565510888985998898995655988998998-----89898668689168468655311434223210 Q gi|255764481|r 4 MGNNPIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIG-----AGVELISHCVVAGKTKIGDFTKVFPMAVLGGD 73 (271) Q Consensus 4 i~~~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG-----~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~ 73 (271) |=-+.+|||||.+||+|+||+||.||+.+.|+++|+|- +++.|..|+++. ++.||-.|.|+.-+.+.+. T Consensus 285 IigdVyIhPsakvhptAkiGPNVSIga~vrvg~GvRl~~sIIl~d~ei~enavVl-~sIigw~s~iGrWaRVe~~ 358 (407) T KOG1460 285 IIGDVYIHPSAKVHPTAKIGPNVSIGANVRVGPGVRLRESIILDDAEIEENAVVL-HSIIGWKSSIGRWARVEGI 358 (407) T ss_pred EEEEEEECCCCEECCCCCCCCCCEECCCCEECCCCEEEEEEECCCCEEECCCEEE-EEEECCCCCCCCEEEECCC T ss_conf 8742477675404776620897144688254688266545602673762163677-6541144322542430146 No 95 >cd05635 LbH_unknown Uncharacterized proteins, Left-handed parallel beta-Helix (LbH) domain: Members in this group are uncharacterized bacterial proteins containing a LbH domain with multiple turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Probab=98.93 E-value=3.8e-09 Score=77.67 Aligned_cols=85 Identities=24% Similarity=0.348 Sum_probs=55.3 Q ss_pred CCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 89989988989866868916846865531143422321023331222222332222222122443211112222221223 Q gi|255764481|r 35 GSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTELLVGKKCVIREGVTINRGTVEYGGKTIVG 114 (271) Q Consensus 35 ~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~v~IG~~~~Ire~v~I~~gt~~~~~~T~IG 114 (271) ...|.|+++++|.|+++|++++.||++|.|.|+|.|.+ .+.||++|.|+. - | T Consensus 9 ~GpV~I~~~~~I~~~~~i~Gp~~IG~~~~I~p~a~Ir~------------~t~Ig~~~~IG~--E-------------i- 60 (101) T cd05635 9 DGPIYIGKDAVIEPFAVIEGPVYIGPGSRVKMGARIYG------------NTTIGPTCKIGG--E-------------V- 60 (101) T ss_pred CCCEEECCCCEECCCCEEECCEEECCCCEECCCCEEEC------------CCEECCCCEECC--E-------------E- T ss_conf 89999879999999999976669999999999978638------------879999977614--0-------------8- Q ss_pred CCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCC Q ss_conf 3200001665543310012344322345245532148606523444 Q gi|255764481|r 115 DNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSA 160 (271) Q Consensus 115 ~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~ 160 (271) .++.|+.++++.|...||| ..||++|.||+++. T Consensus 61 k~S~i~~~s~~~H~~YlGD-------------S~iG~~vNiGAGT~ 93 (101) T cd05635 61 EDSIIEGYSNKQHDGFLGH-------------SYLGSWCNLGAGTN 93 (101) T ss_pred EEEEEECCCCCCCCCEEEE-------------EEECCCCEECCCCE T ss_conf 7618857875687758761-------------17838757877857 No 96 >PRK10092 maltose O-acetyltransferase; Provisional Probab=98.91 E-value=2.2e-09 Score=79.16 Aligned_cols=113 Identities=20% Similarity=0.307 Sum_probs=64.5 Q ss_pred CCEECC--CCEECCCCEECCCCEECCCCEEC--CCCEEEEEEECCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 989956--55988998998898986686891--68468655311434223210233312222223322222221224432 Q gi|255764481|r 25 NSLIGP--FCCVGSEVEIGAGVELISHCVVA--GKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTELLVGKKCVIREGVTI 100 (271) Q Consensus 25 ~v~Igp--~~~I~~~v~IG~~~~I~~~v~I~--g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~v~IG~~~~Ire~v~I 100 (271) ++.|.| +|--|-|..||+++.+..+++|. +.++||+++.|+|++.|-.....+... T Consensus 59 ~~~I~ppF~cdYG~ni~iG~~~~iN~~~~ilD~~~I~IG~~v~iGpnv~i~t~~Hp~d~~-------------------- 118 (183) T PRK10092 59 EAYIEPTFRCDYGYNIFLGNNFYANFDCVMLDVCPIHIGDNCMLAPGVHIYTATHPLDPV-------------------- 118 (183) T ss_pred CCEEECCEEEECCCCEEECCCEEEECCEEEEECCCEEECCCEEECCCCEEECCCCCCCHH-------------------- T ss_conf 837958888406778786787699278499947873999980584997898599988988-------------------- Q ss_pred CCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCEEEECCCEEECC Q ss_conf 1111222222122332000016655433100123443223452455321486065234443365064047435013 Q gi|255764481|r 101 NRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFTRIGKYAFIGGM 176 (271) Q Consensus 101 ~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~g 176 (271) .+- .+...+.-..||++|+++.++.+...|+|||+++||++++|... |-+++++++. T Consensus 119 ----------~R~-------~~~~~~~pi~Ig~~vwig~~~~i~pGv~IG~~~vigagsvV~~d--ip~~~v~~G~ 175 (183) T PRK10092 119 ----------ARN-------SGAELGKPVTIGNNVWIGGRAVINPGVTIGDNVVVASGAVVTKD--VPDNVVVGGN 175 (183) T ss_pred ----------HHH-------CCCEECCCEEECCCCEECCCCEEECCCEECCCCEECCCCEEEEE--CCCCCEEEEE T ss_conf ----------952-------58631687798998688899899079899999899759889227--7999299974 No 97 >cd05787 LbH_eIF2B_epsilon eIF-2B epsilon subunit, central Left-handed parallel beta-Helix (LbH) domain: eIF-2B is a eukaryotic translation initiator, a guanine nucleotide exchange factor (GEF) composed of five different subunits (alpha, beta, gamma, delta and epsilon). eIF2B is important for regenerating GTP-bound eIF2 during the initiation process. This event is obligatory for eIF2 to bind initiator methionyl-tRNA, forming the ternary initiation complex. The eIF-2B epsilon subunit contains an N-terminal domain that resembles a dinucleotide-binding Rossmann fold, a central LbH domain containing 4 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X), and a C-terminal domain of unknown function that is present in eIF-4 gamma, eIF-5, and eIF-2B epsilon. The epsilon and gamma subunits form the catalytic subcomplex of eIF-2B, which binds eIF2 and catalyzes guanine nucleotide exchange. Probab=98.90 E-value=7e-09 Score=76.00 Aligned_cols=47 Identities=19% Similarity=0.220 Sum_probs=20.2 Q ss_pred ECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEE Q ss_conf 9889899565598899899889898668689168468655311434223 Q gi|255764481|r 22 IGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVL 70 (271) Q Consensus 22 Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~I 70 (271) ||++++|++++.| ++..||++|+|++++.|. ++.|.+++.|++++.| T Consensus 2 IG~~t~Ig~~~~I-~~svIG~nc~Ig~~~~I~-~svi~d~~~Ig~~~~i 48 (79) T cd05787 2 IGRGTSIGEGTTI-KNSVIGRNCKIGKNVVID-NSYIWDDVTIEDGCTI 48 (79) T ss_pred CCCCCEECCCCEE-ECCEECCCCEECCCCEEE-CCEEECCCEECCCCEE T ss_conf 9998999999999-599989999999998894-8999197899999999 No 98 >PRK00844 glgC glucose-1-phosphate adenylyltransferase; Provisional Probab=98.88 E-value=6.3e-09 Score=76.30 Aligned_cols=55 Identities=16% Similarity=0.228 Sum_probs=29.3 Q ss_pred CCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCEEEECCCEEECCCCCCCCCC Q ss_conf 310012344322345245532148606523444336506404743501321122333 Q gi|255764481|r 128 DCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFTRIGKYAFIGGMTGVVHDVI 184 (271) Q Consensus 128 d~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~gs~V~kdVp 184 (271) +++||.+|.+++++.+. ++.|.+++.||.++.|. .+-|++++.||.|++|..|.- T Consensus 331 nSvIg~~v~Ig~ga~I~-nSII~~~~~Ig~~a~I~-~sIidk~v~Ig~gt~I~~~~~ 385 (409) T PRK00844 331 NSVLSPNVRVDSGAIVE-GSVLMDGVRIGRGAVVR-RAILDKNVVVPPGTQIGVDLE 385 (409) T ss_pred CCEECCCCEECCCCEEE-CCEEECCCEECCCCEEE-EEEECCCCEECCCCEECCCCH T ss_conf 88977998999998995-18992949999999998-019899989899989897804 No 99 >COG1208 GCD1 Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) [Cell envelope biogenesis, outer membrane / Translation, ribosomal structure and biogenesis] Probab=98.88 E-value=3.8e-09 Score=77.68 Aligned_cols=105 Identities=20% Similarity=0.241 Sum_probs=63.4 Q ss_pred ECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 95655988998998898986686891684686553114342232102333122222233222222212244321111222 Q gi|255764481|r 28 IGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTELLVGKKCVIREGVTINRGTVEY 107 (271) Q Consensus 28 Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~v~IG~~~~Ire~v~I~~gt~~~ 107 (271) ++. ++|.+.+.||++|.|++++.|.+++.||++|.|++.+.|.. ..|.+++.|..++.+ T Consensus 253 ~~~-~~i~gp~~ig~~~~i~~~~~i~~~~~ig~~~~I~~~~~i~~-------------Sii~~~~~i~~~~~i------- 311 (358) T COG1208 253 IRS-AYIIGPVVIGPGAKIGPGALIGPYTVIGEGVTIGNGVEIKN-------------SIIMDNVVIGHGSYI------- 311 (358) T ss_pred CCC-EEEECCEEECCCCEECCCCEECCCCEECCCCEECCCCEEEE-------------EEEECCCEECCCCEE------- T ss_conf 662-38858949826978988848689878999988899877875-------------488778798888888------- Q ss_pred CCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCEEEECCCEEECCCCC Q ss_conf 222122332000016655433100123443223452455321486065234443365064047435013211 Q gi|255764481|r 108 GGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFTRIGKYAFIGGMTGV 179 (271) Q Consensus 108 ~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~gs~V 179 (271) . +++||++|.++++ + .++| +.+|.++.+.+.++++..+.+..+..+ T Consensus 312 ------------------~-~sIi~~~~~ig~~--~----~i~d-~~~g~~~~i~~g~~~~~~~~~~~~~~~ 357 (358) T COG1208 312 ------------------G-DSIIGENCKIGAS--L----IIGD-VVIGINSEILPGVVVGPGSVVESGEIE 357 (358) T ss_pred ------------------C-CEEECCCCEECCC--C----EEEE-EEECCCEEECCCEEECCCCCCCCCCCC T ss_conf ------------------2-0698188599986--1----5701-586366086276184786061576303 No 100 >cd05787 LbH_eIF2B_epsilon eIF-2B epsilon subunit, central Left-handed parallel beta-Helix (LbH) domain: eIF-2B is a eukaryotic translation initiator, a guanine nucleotide exchange factor (GEF) composed of five different subunits (alpha, beta, gamma, delta and epsilon). eIF2B is important for regenerating GTP-bound eIF2 during the initiation process. This event is obligatory for eIF2 to bind initiator methionyl-tRNA, forming the ternary initiation complex. The eIF-2B epsilon subunit contains an N-terminal domain that resembles a dinucleotide-binding Rossmann fold, a central LbH domain containing 4 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X), and a C-terminal domain of unknown function that is present in eIF-4 gamma, eIF-5, and eIF-2B epsilon. The epsilon and gamma subunits form the catalytic subcomplex of eIF-2B, which binds eIF2 and catalyzes guanine nucleotide exchange. Probab=98.83 E-value=8.4e-09 Score=75.53 Aligned_cols=64 Identities=19% Similarity=0.255 Sum_probs=29.3 Q ss_pred CCCCCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEE Q ss_conf 5799756551088898599889899565598899899889898668689168468655311434223 Q gi|255764481|r 4 MGNNPIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVL 70 (271) Q Consensus 4 i~~~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~I 70 (271) |++++.|.+.+.|. ...||+||+||+.|.| .+..|.++++|++++.|. ++.|++++.|++++.| T Consensus 2 IG~~t~Ig~~~~I~-~svIG~nc~Ig~~~~I-~~svi~d~~~Ig~~~~i~-~siI~~~~~Ig~~~~i 65 (79) T cd05787 2 IGRGTSIGEGTTIK-NSVIGRNCKIGKNVVI-DNSYIWDDVTIEDGCTIH-HSIVADGAVIGKGCTI 65 (79) T ss_pred CCCCCEECCCCEEE-CCEECCCCEECCCCEE-ECCEEECCCEECCCCEEC-CCEECCCCEECCCCEE T ss_conf 99989999999995-9998999999999889-489991978999999990-8999798999999999 No 101 >TIGR01172 cysE serine O-acetyltransferase; InterPro: IPR005881 The biosynthesis of L-cysteine is the predominant way by which inorganic sulphur is incorporated into organic compounds. In this process, the most abundant utilizable source of sulphur, inorganic sulphate, is taken up and reduced to sulphide. Sulphide is used to produce L-cysteine, which serves for protein synthesis or the production of other sulphur-containing organic compounds. Two routes for cysteine biosynthesis in nature have been documented. Serine transacetylase (2.3.1.30 from EC) catalyzes steps in pathway I, the activation of L-serine by acetyl-coenzyme A, yielding O-acetyl-L-serine.; GO: 0009001 serine O-acetyltransferase activity, 0006535 cysteine biosynthetic process from serine, 0005737 cytoplasm. Probab=98.80 E-value=6.8e-09 Score=76.12 Aligned_cols=71 Identities=24% Similarity=0.399 Sum_probs=51.8 Q ss_pred CCEECCCCEEEEEEECCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCC Q ss_conf 86891684686553114342232102333122222233222222212244321111222222122332000016655433 Q gi|255764481|r 49 HCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTELLVGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHD 128 (271) Q Consensus 49 ~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~v~IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd 128 (271) +++|.+-++||+|+.|+++++||+- .-+=| + ...+| T Consensus 81 GvVIGETa~iGddv~iyhGVTLGGt-----------gk~~G--------------~--kRHPt----------------- 116 (163) T TIGR01172 81 GVVIGETAEIGDDVTIYHGVTLGGT-----------GKEKG--------------V--KRHPT----------------- 116 (163) T ss_pred CEEEEEEEEECCCEEEEECEEECCC-----------CCCCC--------------C--CCCCC----------------- T ss_conf 3685003687388378632120678-----------88878--------------8--98882----------------- Q ss_pred CCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCE Q ss_conf 1001234432234524553214860652344433650 Q gi|255764481|r 129 CKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFT 165 (271) Q Consensus 129 ~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v 165 (271) |+++|+|+.+|.+-|+.+||++|.||++|+|.+.| T Consensus 117 --~~~gV~iGAGAKvLG~I~vG~~AkiGAnsVVl~dV 151 (163) T TIGR01172 117 --IGEGVVIGAGAKVLGNIEVGENAKIGANSVVLKDV 151 (163) T ss_pred --CCCCEEEEECCEEECCEEECCCCEECCCEEECCCC T ss_conf --56871784078240376677997772332673537 No 102 >TIGR02353 NRPS_term_dom non-ribosomal peptide synthetase terminal domain of unknown function; InterPro: IPR012728 This domain is found exclusively in non-ribosomal peptide synthetases and always as the final domain in the polypeptide. This domain is roughly 700 amino acids in size and is found in polypeptides roughly twice that size.. Probab=98.76 E-value=4.2e-09 Score=77.44 Aligned_cols=72 Identities=21% Similarity=0.225 Sum_probs=55.8 Q ss_pred CCCCCCCEECC-CEE-ECCCCEECCCCEECCCCEEC-----------CCCEECCCCEECCCCEECCCCEEEEEEECCCCC Q ss_conf 50579975655-108-88985998898995655988-----------998998898986686891684686553114342 Q gi|255764481|r 2 SRMGNNPIIHP-LAL-VEEGAVIGPNSLIGPFCCVG-----------SEVEIGAGVELISHCVVAGKTKIGDFTKVFPMA 68 (271) Q Consensus 2 ~~i~~~~~Ihp-sA~-I~~~a~Ig~~v~Igp~~~I~-----------~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a 68 (271) +|||+++.||. +.. +++=.+||+|+.|..++.+. +.|+||++..||..+++..||.||+...+.+.+ T Consensus 119 akiG~~v~i~s~~~p~~tDLl~iG~~~~v~~~v~l~g~r~~~g~l~~G~v~~g~~A~vG~~s~l~~~t~iG~~a~L~~~S 198 (719) T TIGR02353 119 AKIGKGVDIGSKLPPVCTDLLTIGDGVIVEKEVMLLGYRVERGRLRIGPVRIGDDAFVGTRSVLDIDTAIGDRAQLGHGS 198 (719) T ss_pred CCCCCCEEECCCCCEEEEEEEECCCCEEEECCCCEEEEEEECCEEEEEEEEECCCEEEEEEEEECCCCCCCCHHCCCCCC T ss_conf 00068537546555366300450786364034101100021584686328983716872013747887236502035546 Q ss_pred EEEEC Q ss_conf 23210 Q gi|255764481|r 69 VLGGD 73 (271) Q Consensus 69 ~Ig~~ 73 (271) .+-.+ T Consensus 199 al~~g 203 (719) T TIGR02353 199 ALAGG 203 (719) T ss_pred CCCCC T ss_conf 46688 No 103 >PRK03282 consensus Probab=98.76 E-value=2.8e-08 Score=72.23 Aligned_cols=54 Identities=24% Similarity=0.361 Sum_probs=27.6 Q ss_pred CCCCCCCCEECCCEEECCCCCCCCCEEEECCCEEECCCCCCCC--CCCCCHHHHCCCC Q ss_conf 3452455321486065234443365064047435013211223--3331000107453 Q gi|255764481|r 140 NVMIAGHVIVDDRVVFGGGSAVHQFTRIGKYAFIGGMTGVVHD--VIPYGILNGNPGA 195 (271) Q Consensus 140 ~~~l~G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~gs~V~kd--Vpp~~~~~G~pa~ 195 (271) ++.|+..|+|+++|.| -+|.|+|.++||++|.+ -.|++-+| |++.+...+.|.+ T Consensus 329 nSvIg~~v~Ig~ga~I-~~SIIm~~~~Ig~~a~l-~~~Iidk~v~Ig~g~~Ig~~~~~ 384 (406) T PRK03282 329 NSVLSPNVVVDSGAIV-EGSVLMPGVRIGRGAVV-RHAILDKNVVVPPGAMVGVDPER 384 (406) T ss_pred CCEECCCCEECCCCEE-ECCEEECCCEECCCCEE-ECCEECCCCEECCCCEECCCCCC T ss_conf 9997899899999899-70889096899999999-73198999999989899988420 No 104 >PRK09527 lacA galactoside O-acetyltransferase; Reviewed Probab=98.74 E-value=2.3e-08 Score=72.79 Aligned_cols=46 Identities=26% Similarity=0.431 Sum_probs=28.1 Q ss_pred CCCCCCCCCCEECCCEEECCCCCCCCCEEEECCCEEECC-CCCCCCCCC Q ss_conf 223452455321486065234443365064047435013-211223333 Q gi|255764481|r 138 SNNVMIAGHVIVDDRVVFGGGSAVHQFTRIGKYAFIGGM-TGVVHDVIP 185 (271) Q Consensus 138 ~n~~~l~G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~g-s~V~kdVpp 185 (271) +.++.+...|+||++++||++++|.. .|-++++++|. +.+.|.+.+ T Consensus 141 G~~~~I~pGv~IG~~~vigAgsvVtk--dip~~~v~~G~Parvir~i~~ 187 (203) T PRK09527 141 GSHVVINPGVTIGDNSVIGAGSVVTK--DIPPNVVAAGVPCRVIREIND 187 (203) T ss_pred CCCCEECCCCEECCCCEECCCCEECC--CCCCCCEEEEECCEEEEECCH T ss_conf 99999909929999979914988853--069982999628288986898 No 105 >cd03357 LbH_MAT_GAT Maltose O-acetyltransferase (MAT) and Galactoside O-acetyltransferase (GAT): MAT and GAT catalyze the CoA-dependent acetylation of the 6-hydroxyl group of their respective sugar substrates. MAT acetylates maltose and glucose exclusively at the C6 position of the nonreducing end glucosyl moiety. GAT specifically acetylates galactopyranosides. Furthermore, MAT shows higher affinity toward artificial substrates containing an alkyl or hydrophobic chain as well as a glucosyl unit. Active MAT and GAT are homotrimers, with each subunit consisting of an N-terminal alpha-helical region and a C-terminal left-handed parallel alpha-helix (LbH) subdomain with 6 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Probab=98.73 E-value=3.6e-08 Score=71.61 Aligned_cols=117 Identities=22% Similarity=0.314 Sum_probs=61.4 Q ss_pred EECCCCEECC--CCEECCCCEECCCCEECCCCEEC--CCCEEEEEEECCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 9988989956--55988998998898986686891--6846865531143422321023331222222332222222122 Q gi|255764481|r 21 VIGPNSLIGP--FCCVGSEVEIGAGVELISHCVVA--GKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTELLVGKKCVIRE 96 (271) Q Consensus 21 ~Ig~~v~Igp--~~~I~~~v~IG~~~~I~~~v~I~--g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~v~IG~~~~Ire 96 (271) .+|+++.|.| +|--+-+.+||+++.|..+++|. +..+||+++.|+|++.|-.....+. + T Consensus 44 ~~g~~~~I~~pf~~dyG~ni~iG~~~~in~n~~ild~~~I~IG~~v~igp~v~i~t~~H~~d----~------------- 106 (169) T cd03357 44 SVGENVYIEPPFHCDYGYNIHIGDNFYANFNCTILDVAPVTIGDNVLIGPNVQIYTAGHPLD----P------------- 106 (169) T ss_pred CCCCCCEEECCEEEECCCCEEECCEEEECCCCEEEECCCEEECCCEEECCCCEEECCCCCCC----H------------- T ss_conf 68999589788898343677988844887884897068879999807879858976899999----7------------- Q ss_pred CCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCEEEECCCEEECC Q ss_conf 44321111222222122332000016655433100123443223452455321486065234443365064047435013 Q gi|255764481|r 97 GVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFTRIGKYAFIGGM 176 (271) Q Consensus 97 ~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~g 176 (271) ..+- .+...+.-..|||+|+|+.++.+...|+||++++||++++|-.. |-+++++++. T Consensus 107 -------------~~r~-------~~~~~~~PI~Igd~vWIG~~~~IlpGVtIG~~~vIgAGsVVtkd--ip~~~v~~Gn 164 (169) T cd03357 107 -------------EERN-------RGLEYAKPITIGDNVWIGGGVIILPGVTIGDNSVIGAGSVVTKD--IPANVVAAGN 164 (169) T ss_pred -------------HHHH-------CCCCCCCCEEECCCCEECCCCEEECCCEECCCCEECCCCEECCC--CCCCEEEEEE T ss_conf -------------8963-------46652455698974362899789099799999999889879746--5999199975 No 106 >TIGR00965 dapD 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase; InterPro: IPR005664 Bacteria, plants and fungi metabolise aspartic acid to produce four amino acids - lysine, threonine, methionine and isoleucine - in a series of reactions known as the aspartate pathway. Additionally, several important metabolic intermediates are produced by these reactions, such as diaminopimelic acid, an essential component of bacterial cell wall biosynthesis, and dipicolinic acid, which is involved in sporulation in Gram-positive bacteria. Members of the animal kingdom do not posses this pathway and must therefore acquire these essential amino acids through their diet. Research into improving the metabolic flux through this pathway has the potential to increase the yield of the essential amino acids in important crops, thus improving their nutritional value. Additionally, since the enzymes are not present in animals, inhibitors of them are promising targets for the development of novel antibiotics and herbicides. For more information see . The lysine/diaminopimelic acid branch of the aspartate pathway produces the essential amino acid lysine via the intermediate meso-diaminopimelic acid (meso-DAP), which is also a vital cell wall component in Gram-negative bacteria . The production of dihydropicolinate from aspartate-semialdehyde controls flux into the lysine/diaminopimelic acid pathway. Three variants of this pathway exist, differing in how tetrahydropicolinate (formed by reduction of dihydropicolinate) is metabolised to meso-DAP. One variant, the most commonly found one in archaea and bacteria, uses primarily succinyl intermediates, while a second variant, found only in Bacillus, utilises primarily acetyl intermediates. In the third variant, found in some Gram-positive bacteria, a dehydrogenase converts tetrahydropicolinate directly to meso-DAP. In all variants meso-DAP is subsequently converted to lysine by a decarboxylase, or, in Gram-negative bacteria, assimilated into the cell wall. Evidence exists that a fourth, currently unknown, variant of this pathway may function in plants . 2,3,4,5-tetrahydropyridine-2-carboxylate N-succinyltransferase (also known as tetrahydrodipicolinate N-succinyltransferase or DapD) is part of the succinyl route of of lysine/DAP biosynthesis. The DapD protein is a homotrimer is a trimeric enzyme with each monomer composed of three domain: an N-terminal helical domain, a distinctive left-handed parallel beta-helix (LBH) domain, and a predominantly beta C-terminal domain , . The LBH structure is encoded by an imperfect tandem-repeated hexapeptide sequence. Each trimer contains three independent active sites, always occuring at the boundary of two subunits, and formed by residues from one N-terminal domain, one C-terminal domain and two adjacent LBH domains.; GO: 0016757 transferase activity transferring glycosyl groups, 0009089 lysine biosynthetic process via diaminopimelate. Probab=98.71 E-value=9.2e-09 Score=75.28 Aligned_cols=114 Identities=25% Similarity=0.295 Sum_probs=94.3 Q ss_pred ECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCCCCCCCCCC Q ss_conf 65510888985998898995655988998998898986686891684686553114342232102333122222233222 Q gi|255764481|r 10 IHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTELLVG 89 (271) Q Consensus 10 IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~v~IG 89 (271) .=|.|++-.++.|++|+.+.| +||.=++.|.+++.|..-++|..-+.||+||++.=++=||+--..| T Consensus 106 ~VPgA~vR~G~fiAkNvVlMP-SyvNIGAYvDEGtMvDTWATvGSCAQIGKNvHlSGG~GIGGVLEPL------------ 172 (275) T TIGR00965 106 KVPGAIVRRGTFIAKNVVLMP-SYVNIGAYVDEGTMVDTWATVGSCAQIGKNVHLSGGVGIGGVLEPL------------ 172 (275) T ss_pred CCCHHHEECCCEEEEEEEECC-CEEECCCEEECCCEECCHHHHCCHHHHCCCEEEECCCCCCCCCCCC------------ T ss_conf 487436005735775237755-5210136751686010012230032206625751783316524633------------ Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCE Q ss_conf 2222122443211112222221223320000166554331001234432234524553214860 Q gi|255764481|r 90 KKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRV 153 (271) Q Consensus 90 ~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~ 153 (271) .-.+|.|.||||+++-+-|..++++.++.+|+.++.++..++|-|+- T Consensus 173 -----------------QA~P~IIEDNCFiGARSEi~EGVIVEEGSVISMGVfIG~STkI~~R~ 219 (275) T TIGR00965 173 -----------------QAKPTIIEDNCFIGARSEIVEGVIVEEGSVISMGVFIGQSTKIVDRD 219 (275) T ss_pred -----------------CCCCEEEECCCEECCCEEEEEEEEECCCCEEEEEEEECCCCEEEECC T ss_conf -----------------57875882476004611356207971685888406855601688536 No 107 >cd03356 LbH_G1P_AT_C_like Left-handed parallel beta-Helix (LbH) domain of a group of proteins with similarity to glucose-1-phosphate adenylyltransferase: Included in this family are glucose-1-phosphate adenylyltransferase, mannose-1-phosphate guanylyltransferase, and the eukaryotic translation initiation factor eIF-2B subunits, epsilon and gamma. Most members of this family contains an N-terminal catalytic domain that resembles a dinucleotide-binding Rossmann fold, followed by a LbH fold domain with at least 4 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). eIF-2B epsilon contains an additional domain of unknown function at the C-terminus. Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Probab=98.71 E-value=6.1e-08 Score=70.13 Aligned_cols=47 Identities=19% Similarity=0.291 Sum_probs=20.5 Q ss_pred ECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEE Q ss_conf 9889899565598899899889898668689168468655311434223 Q gi|255764481|r 22 IGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVL 70 (271) Q Consensus 22 Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~I 70 (271) ||++++|++++.|+ +..||++|.|++++.|. ++.|.+++.|+++|.| T Consensus 2 IG~~~~Ig~~~~I~-~svIG~~c~Ig~~~~I~-~Sii~~~~~Ig~~~~i 48 (79) T cd03356 2 IGESTVIGENAIIK-NSVIGDNVRIGDGVTIT-NSILMDNVTIGANSVI 48 (79) T ss_pred CCCCCEECCCCEEE-CCEECCCCEECCCCEEE-EEEEECCCEECCCCEE T ss_conf 98979999999995-99999999999996996-1099797999999999 No 108 >cd03349 LbH_XAT Xenobiotic acyltransferase (XAT): The XAT class of hexapeptide acyltransferases is composed of a large number of microbial enzymes that catalyze the CoA-dependent acetylation of a variety of hydroxyl-bearing acceptors such as chloramphenicol and streptogramin, among others. Members of this class of enzymes include Enterococcus faecium streptogramin A acetyltransferase and Pseudomonas aeruginosa chloramphenicol acetyltransferase. They contain repeated copies of a six-residue hexapeptide repeat sequence motif (X-[STAV]-X-[LIV]-[GAED]-X) and adopt a left-handed parallel beta helix (LbH) structure. The active enzyme is a trimer with CoA and substrate binding sites at the interface of two separate LbH subunits. XATs are implicated in inactivating xenobiotics leading to xenobiotic resistance in patients. Probab=98.70 E-value=6.3e-08 Score=70.07 Aligned_cols=51 Identities=25% Similarity=0.470 Sum_probs=32.5 Q ss_pred CCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCEEEECCCEEECC-CCCCC Q ss_conf 100123443223452455321486065234443365064047435013-21122 Q gi|255764481|r 129 CKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFTRIGKYAFIGGM-TGVVH 181 (271) Q Consensus 129 ~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~g-s~V~k 181 (271) +.||++|.|+.++.+-..++||++++||++++|-.. +-.++++++. |.+.| T Consensus 74 i~Ig~~vwIG~n~~Il~GV~IG~gavIgAgSvVtkd--vp~~~i~~G~PAk~ik 125 (145) T cd03349 74 VIIGNDVWIGHGATILPGVTIGDGAVIAAGAVVTKD--VPPYAIVGGNPAKVIR 125 (145) T ss_pred EEECCCEEECCCCEEECCCEECCCCEEECCCEECCC--CCCCCEEEEECCEEEE T ss_conf 798899699998889588096788689289789704--6999499833887881 No 109 >cd05825 LbH_wcaF_like wcaF-like: This group is composed of the protein product of the E. coli wcaF gene and similar proteins. WcaF is part of the gene cluster responsible for the biosynthesis of the extracellular polysaccharide colanic acid. The wcaF protein is predicted to contain a left-handed parallel beta-helix (LbH) domain encoded by imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Many are trimeric in their active forms. Probab=98.69 E-value=2.6e-08 Score=72.45 Aligned_cols=47 Identities=17% Similarity=0.350 Sum_probs=30.5 Q ss_pred CCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCEEEECCCEEECC Q ss_conf 3100123443223452455321486065234443365064047435013 Q gi|255764481|r 128 DCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFTRIGKYAFIGGM 176 (271) Q Consensus 128 d~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~g 176 (271) .+.||++|+++.++.+...++|+|++++|+++.|.. .+-+++++++. T Consensus 56 ~v~Ig~~v~IG~~~~I~~gv~Ig~~svVgagSvV~k--dvp~~~i~~G~ 102 (107) T cd05825 56 PIVIGDGAWVAAEAFVGPGVTIGEGAVVGARSVVVR--DLPAWTVYAGN 102 (107) T ss_pred CEEECCCCEECCCCEEECCCEECCCCEEECCCEECC--CCCCCCEEEEE T ss_conf 979938858749878969919189979948988903--85999799814 No 110 >cd03356 LbH_G1P_AT_C_like Left-handed parallel beta-Helix (LbH) domain of a group of proteins with similarity to glucose-1-phosphate adenylyltransferase: Included in this family are glucose-1-phosphate adenylyltransferase, mannose-1-phosphate guanylyltransferase, and the eukaryotic translation initiation factor eIF-2B subunits, epsilon and gamma. Most members of this family contains an N-terminal catalytic domain that resembles a dinucleotide-binding Rossmann fold, followed by a LbH fold domain with at least 4 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). eIF-2B epsilon contains an additional domain of unknown function at the C-terminus. Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Probab=98.69 E-value=3.3e-08 Score=71.80 Aligned_cols=30 Identities=30% Similarity=0.511 Sum_probs=11.8 Q ss_pred CCCCCEECCCEEECCCCEECCCCEECCCCEE Q ss_conf 5799756551088898599889899565598 Q gi|255764481|r 4 MGNNPIIHPLALVEEGAVIGPNSLIGPFCCV 34 (271) Q Consensus 4 i~~~~~IhpsA~I~~~a~Ig~~v~Igp~~~I 34 (271) |++++.|.+.|.|.+ ..||++|.||+.+.| T Consensus 2 IG~~~~Ig~~~~I~~-svIG~~c~Ig~~~~I 31 (79) T cd03356 2 IGESTVIGENAIIKN-SVIGDNVRIGDGVTI 31 (79) T ss_pred CCCCCEECCCCEEEC-CEECCCCEECCCCEE T ss_conf 989799999999959-999999999999699 No 111 >PRK10191 putative colanic acid biosynthesis acetyltransferase WcaB; Provisional Probab=98.68 E-value=4.5e-08 Score=70.98 Aligned_cols=43 Identities=35% Similarity=0.656 Sum_probs=28.1 Q ss_pred CCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCEEEECCCEEEC Q ss_conf 012344322345245532148606523444336506404743501 Q gi|255764481|r 131 LGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFTRIGKYAFIGG 175 (271) Q Consensus 131 IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~ 175 (271) |||+|.++.++.+-|.++|||++.||+++.|.. .+-+++.+.+ T Consensus 95 Ig~~V~Igaga~IlG~i~IG~~~~IGA~svV~~--dvp~~~~v~G 137 (146) T PRK10191 95 IGNGVELGANVIILGDITLGNNVTVGAGSVVLD--SVPDNALVVG 137 (146) T ss_pred ECCCEEEECCCEEECCCEECCCCEECCCCEECC--CCCCCCEEEE T ss_conf 799959928999964859999999998958970--3599999997 No 112 >TIGR02353 NRPS_term_dom non-ribosomal peptide synthetase terminal domain of unknown function; InterPro: IPR012728 This domain is found exclusively in non-ribosomal peptide synthetases and always as the final domain in the polypeptide. This domain is roughly 700 amino acids in size and is found in polypeptides roughly twice that size.. Probab=98.68 E-value=2.1e-08 Score=73.05 Aligned_cols=84 Identities=21% Similarity=0.338 Sum_probs=66.9 Q ss_pred CCCCCCCHHCCCCCCCCC-CCCCCCCCCCCCCCCC-----------CCCEECCCEEECCCCCCCCCEEEECCCEEECCCC Q ss_conf 122332000016655433-1001234432234524-----------5532148606523444336506404743501321 Q gi|255764481|r 111 TIVGDNNFFLANSHVAHD-CKLGNGIVLSNNVMIA-----------GHVIVDDRVVFGGGSAVHQFTRIGKYAFIGGMTG 178 (271) Q Consensus 111 T~IG~~~~i~~~~hI~Hd-~~IG~~vii~n~~~l~-----------G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~gs~ 178 (271) .+||+++|+..+--=.|| |.||+.+.+-.++.+- .+|+|+|+|.+|..+.||..+.||+.|.+|-.|. T Consensus 622 ~kiGr~v~~d~~~ltE~dLv~IG~d~tlN~G~~~QtHlfeDRv~k~D~v~igdgatlG~~aiVlY~~~~G~~a~lgp~Sl 701 (719) T TIGR02353 622 VKIGRGVYIDTTDLTERDLVTIGDDSTLNEGSVIQTHLFEDRVMKSDTVTIGDGATLGPGAIVLYGAVVGEDAVLGPDSL 701 (719) T ss_pred CEECCCCEECCCCCCCHHHEEECCCCEECCCCCEECCCCCCCEEEECEEEEECEEEECCCCEECCCEEECCCCEECCCCE T ss_conf 24555322267567522203656861246875111140006756745289724038888635857315568643468432 Q ss_pred CCC--CCCCCCHHHHCCC Q ss_conf 122--3333100010745 Q gi|255764481|r 179 VVH--DVIPYGILNGNPG 194 (271) Q Consensus 179 V~k--dVpp~~~~~G~pa 194 (271) |.| .||+.+...|||| T Consensus 702 VMkGE~vP~~trW~GnPa 719 (719) T TIGR02353 702 VMKGEEVPAHTRWEGNPA 719 (719) T ss_pred EECCCCCCCCCCCCCCCC T ss_conf 503552888785888899 No 113 >cd04652 LbH_eIF2B_gamma_C eIF-2B gamma subunit, C-terminal Left-handed parallel beta-Helix (LbH) domain: eIF-2B is a eukaryotic translation initiator, a guanine nucleotide exchange factor (GEF) composed of five different subunits (alpha, beta, gamma, delta and epsilon). eIF2B is important for regenerating GTP-bound eIF2 during the initiation process. This event is obligatory for eIF2 to bind initiator methionyl-tRNA, forming the ternary initiation complex. The eIF-2B gamma subunit contains an N-terminal domain that resembles a dinucleotide-binding Rossmann fold and a C-terminal LbH domain with 4 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). The epsilon and gamma subunits form the catalytic subcomplex of eIF-2B, which binds eIF2 and catalyzes guanine nucleotide exchange. Probab=98.66 E-value=1.2e-07 Score=68.35 Aligned_cols=29 Identities=21% Similarity=0.358 Sum_probs=9.4 Q ss_pred CCCCEECCCCEECCCCEECCCCEECCCCEE Q ss_conf 889899565598899899889898668689 Q gi|255764481|r 23 GPNSLIGPFCCVGSEVEIGAGVELISHCVV 52 (271) Q Consensus 23 g~~v~Igp~~~I~~~v~IG~~~~I~~~v~I 52 (271) |++++|++.+.| ++..||++|+|++++.| T Consensus 3 G~~~~Ig~~~~I-~~SvIG~nc~Ig~~~~i 31 (81) T cd04652 3 GENTQVGEKTSI-KRSVIGANCKIGKRVKI 31 (81) T ss_pred CCCCEECCCCEE-ECCEECCCCEECCCCEE T ss_conf 798899999989-67999799999999689 No 114 >PRK04928 consensus Probab=98.64 E-value=1.1e-07 Score=68.61 Aligned_cols=13 Identities=31% Similarity=0.123 Sum_probs=4.6 Q ss_pred EECCCEEECCCCC Q ss_conf 4047435013211 Q gi|255764481|r 167 IGKYAFIGGMTGV 179 (271) Q Consensus 167 IG~~a~ig~gs~V 179 (271) |++++-||.|++| T Consensus 364 idk~v~Ig~g~~I 376 (405) T PRK04928 364 IDKDVEIAPGTQI 376 (405) T ss_pred ECCCCEECCCCEE T ss_conf 8899999999999 No 115 >PRK02862 glgC glucose-1-phosphate adenylyltransferase; Provisional Probab=98.61 E-value=1.9e-07 Score=67.08 Aligned_cols=33 Identities=24% Similarity=0.362 Sum_probs=18.1 Q ss_pred CCEECCCCEECCCCEECCCCEEEEEEECCCCCEEE Q ss_conf 98998898986686891684686553114342232 Q gi|255764481|r 37 EVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLG 71 (271) Q Consensus 37 ~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig 71 (271) +..|++||+|. ++.|. ++.||.+++|+++|+|. T Consensus 308 ~SlI~~GciI~-g~~V~-nSVlg~~v~I~~ga~V~ 340 (429) T PRK02862 308 ESIISEGCILK-NCSIH-HSVLGVRSRIESDCVLE 340 (429) T ss_pred CCEECCCEEEE-CCEEE-CCEECCCCEECCCCEEE T ss_conf 45975976983-88887-67888985898998996 No 116 >cd04652 LbH_eIF2B_gamma_C eIF-2B gamma subunit, C-terminal Left-handed parallel beta-Helix (LbH) domain: eIF-2B is a eukaryotic translation initiator, a guanine nucleotide exchange factor (GEF) composed of five different subunits (alpha, beta, gamma, delta and epsilon). eIF2B is important for regenerating GTP-bound eIF2 during the initiation process. This event is obligatory for eIF2 to bind initiator methionyl-tRNA, forming the ternary initiation complex. The eIF-2B gamma subunit contains an N-terminal domain that resembles a dinucleotide-binding Rossmann fold and a C-terminal LbH domain with 4 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). The epsilon and gamma subunits form the catalytic subcomplex of eIF-2B, which binds eIF2 and catalyzes guanine nucleotide exchange. Probab=98.60 E-value=9.5e-08 Score=68.95 Aligned_cols=30 Identities=27% Similarity=0.404 Sum_probs=10.3 Q ss_pred CCCCCEECCCEEECCCCEECCCCEECCCCEE Q ss_conf 5799756551088898599889899565598 Q gi|255764481|r 4 MGNNPIIHPLALVEEGAVIGPNSLIGPFCCV 34 (271) Q Consensus 4 i~~~~~IhpsA~I~~~a~Ig~~v~Igp~~~I 34 (271) |++++.|.+.+.|. ...||+||+||+.+.| T Consensus 2 IG~~~~Ig~~~~I~-~SvIG~nc~Ig~~~~i 31 (81) T cd04652 2 VGENTQVGEKTSIK-RSVIGANCKIGKRVKI 31 (81) T ss_pred CCCCCEECCCCEEE-CCEECCCCEECCCCEE T ss_conf 87988999999896-7999799999999689 No 117 >cd04651 LbH_G1P_AT_C Glucose-1-phosphate adenylyltransferase, C-terminal Left-handed parallel beta helix (LbH) domain: Glucose-1-phosphate adenylyltransferase is also known as ADP-glucose synthase or ADP-glucose pyrophosphorylase. It catalyzes the first committed and rate-limiting step in starch biosynthesis in plants and glycogen biosynthesis in bacteria. It is the enzymatic site for regulation of storage polysaccharide accumulation in plants and bacteria. The enzyme is a homotetramer, with each subunit containing an N-terminal catalytic domain that resembles a dinucleotide-binding Rossmann fold and a C-terminal LbH fold domain with at 5 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). The LbH domain is involved in cooperative allosteric regulation and oligomerization. Probab=98.54 E-value=2.2e-07 Score=66.61 Aligned_cols=69 Identities=23% Similarity=0.218 Sum_probs=41.3 Q ss_pred CCCCCCCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEECC Q ss_conf 5057997565510888985998898995655988998998898986686891684686553114342232102 Q gi|255764481|r 2 SRMGNNPIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGGDT 74 (271) Q Consensus 2 ~~i~~~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~ 74 (271) |.|+..+.|+.. .|. ...|+.+|.|+++|.| .++.|.+++.|+.++.|. ++.|++++.|++++.||.++ T Consensus 13 S~Ig~Gc~I~~~-~V~-~SVIg~~~~Ig~~a~I-~~svi~~~~~Ig~~~~l~-~~Ii~~~~~I~~g~~Ig~~~ 81 (104) T cd04651 13 SLVSEGCIISGG-TVE-NSVLFRGVRVGSGSVV-EDSVIMPNVGIGRNAVIR-RAIIDKNVVIPDGVVIGGDP 81 (104) T ss_pred CEECCCCEEECC-EEE-CCEECCCCEECCCCEE-ECCEEECCCEECCCCEEE-EEEECCCCEECCCCEECCCC T ss_conf 998899799599-998-0897299799999899-600996556899999998-61999999999899999893 No 118 >COG1045 CysE Serine acetyltransferase [Amino acid transport and metabolism] Probab=98.54 E-value=1.6e-07 Score=67.45 Aligned_cols=38 Identities=26% Similarity=0.470 Sum_probs=30.8 Q ss_pred CCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCEEE Q ss_conf 00123443223452455321486065234443365064 Q gi|255764481|r 130 KLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFTRI 167 (271) Q Consensus 130 ~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v~I 167 (271) .||++|.|+.++.+-|+.+|||++.||++++|.+.+-- T Consensus 121 tIg~~V~IGagAkILG~I~IGd~a~IGA~sVVlkdVP~ 158 (194) T COG1045 121 TIGNGVYIGAGAKILGNIEIGDNAKIGAGSVVLKDVPP 158 (194) T ss_pred CCCCCEEECCCCEEECCEEECCCCEECCCCEECCCCCC T ss_conf 31788598999889716688789788878668157899 No 119 >cd04649 LbH_THP_succinylT_putative Putative 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate (THDP) N-succinyltransferase (THP succinyltransferase), C-terminal left-handed parallel alpha-helix (LbH) domain: This group is composed of mostly uncharacterized proteins containing an N-terminal domain of unknown function and a C-terminal LbH domain with similarity to THP succinyltransferase LbH. THP succinyltransferase catalyzes the conversion of tetrahydrodipicolinate and succinyl-CoA to N-succinyltetrahydrodipicolinate and CoA. It is the committed step in the succinylase pathway by which bacteria synthesize L-lysine and meso-diaminopimelate, a component of peptidoglycan. The enzyme is trimeric and displays the left-handed parallel alpha-helix (LbH) structural motif encoded by the hexapeptide repeat motif. Probab=98.52 E-value=2.3e-07 Score=66.51 Aligned_cols=63 Identities=14% Similarity=-0.010 Sum_probs=50.4 Q ss_pred CEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEEEE Q ss_conf 75655108889859988989956559889989988989866868916846865531143422321 Q gi|255764481|r 8 PIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVLGG 72 (271) Q Consensus 8 ~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~ 72 (271) ..|-|.+.|--+|.|+++++|.|-++|.=|+-....+.|. ..|...+.||++|.++-+|.|++ T Consensus 2 vRi~~~~~VRlGAyl~~Gt~vM~egfVN~nAgt~g~~Mie--grissgv~VG~~~diggGasi~G 64 (147) T cd04649 2 VRIADADRVRLGAYLAEGTTVMHEGFVNFNAGTLGNCMVE--GRISSGVIVGKGSDVGGGASIMG 64 (147) T ss_pred EEECCCCEEEECCEECCCCEECCCCEEEECCCCCCCCEEE--EEEEECEEECCCCCCCCCCEEEE T ss_conf 3744786687341887995986256798766306661687--67851569899833278617878 No 120 >PRK02862 glgC glucose-1-phosphate adenylyltransferase; Provisional Probab=98.44 E-value=6.7e-07 Score=63.63 Aligned_cols=16 Identities=19% Similarity=0.540 Sum_probs=6.1 Q ss_pred CCCCCCCCCCCCCCCC Q ss_conf 4331001234432234 Q gi|255764481|r 126 AHDCKLGNGIVLSNNV 141 (271) Q Consensus 126 ~Hd~~IG~~vii~n~~ 141 (271) +.+|.||++|.|.+.. T Consensus 381 dk~~~Ig~~~~I~~~~ 396 (429) T PRK02862 381 DKNARIGNNVTIVNKD 396 (429) T ss_pred CCCCEECCCCEECCCC T ss_conf 7898999999897898 No 121 >PRK03282 consensus Probab=98.42 E-value=1.3e-06 Score=61.84 Aligned_cols=48 Identities=17% Similarity=0.248 Sum_probs=17.8 Q ss_pred CCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCEEEECCCEEECCCCC Q ss_conf 00123443223452455321486065234443365064047435013211 Q gi|255764481|r 130 KLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFTRIGKYAFIGGMTGV 179 (271) Q Consensus 130 ~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~gs~V 179 (271) .||.+|.++.++.+- ++.|.+++.||.++.|. .+-|++++.||.|+.| T Consensus 331 vIg~~v~Ig~ga~I~-~SIIm~~~~Ig~~a~l~-~~Iidk~v~Ig~g~~I 378 (406) T PRK03282 331 VLSPNVVVDSGAIVE-GSVLMPGVRIGRGAVVR-HAILDKNVVVPPGAMV 378 (406) T ss_pred EECCCCEECCCCEEE-CCEEECCCEECCCCEEE-CCEECCCCEECCCCEE T ss_conf 978998999998997-08890968999999997-3198999999989899 No 122 >PRK00844 glgC glucose-1-phosphate adenylyltransferase; Provisional Probab=98.41 E-value=1.4e-06 Score=61.72 Aligned_cols=38 Identities=24% Similarity=0.303 Sum_probs=14.4 Q ss_pred CCCCCEECCCEEECCCCCCCCCEEEECCCEEECCCCCCCC Q ss_conf 2455321486065234443365064047435013211223 Q gi|255764481|r 143 IAGHVIVDDRVVFGGGSAVHQFTRIGKYAFIGGMTGVVHD 182 (271) Q Consensus 143 l~G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~gs~V~kd 182 (271) |+..|+||++|.|- ++.|++.++||++|.| -+|.+-++ T Consensus 334 Ig~~v~Ig~ga~I~-nSII~~~~~Ig~~a~I-~~sIidk~ 371 (409) T PRK00844 334 LSPNVRVDSGAIVE-GSVLMDGVRIGRGAVV-RRAILDKN 371 (409) T ss_pred ECCCCEECCCCEEE-CCEEECCCEECCCCEE-EEEEECCC T ss_conf 77998999998995-1899294999999999-80198999 No 123 >PRK03701 consensus Probab=98.39 E-value=5.1e-07 Score=64.39 Aligned_cols=45 Identities=7% Similarity=0.002 Sum_probs=21.1 Q ss_pred CCCCCCEECCCEEECCCCCCCCCEEEECCCEEECCCCCCCC--CCCCCH Q ss_conf 52455321486065234443365064047435013211223--333100 Q gi|255764481|r 142 MIAGHVIVDDRVVFGGGSAVHQFTRIGKYAFIGGMTGVVHD--VIPYGI 188 (271) Q Consensus 142 ~l~G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~gs~V~kd--Vpp~~~ 188 (271) .|+-.|+|+++|.|- ++.|+|.++||++|.+ -.+.+.|| |||... T Consensus 349 Vl~r~v~I~~gA~V~-nSIIm~~~~Ig~ga~l-~~~IiDk~v~I~~g~v 395 (431) T PRK03701 349 VLFPRVRVNSFCNID-SAVLLPEVWVGRSCRL-RRCVIDRACVIPEGMV 395 (431) T ss_pred CCCCCCEECCCCEEE-CCEECCCCEECCCCEE-EEEEECCCCEECCCCE T ss_conf 811888999998997-2698596989999999-8379899999997999 No 124 >KOG1460 consensus Probab=98.34 E-value=8.8e-07 Score=62.90 Aligned_cols=57 Identities=28% Similarity=0.390 Sum_probs=37.7 Q ss_pred CCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEE Q ss_conf 9756551088898599889899565598899899889898668689168468655311434223 Q gi|255764481|r 7 NPIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVL 70 (271) Q Consensus 7 ~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~I 70 (271) .+.|--..+|||+|++++.++|||++-|+++|++|++++|. .+.|=++|.|..+|++ T Consensus 282 ~a~IigdVyIhPsakvhptAkiGPNVSIga~vrvg~GvRl~-------~sIIl~d~ei~enavV 338 (407) T KOG1460 282 QAEIIGDVYIHPSAKVHPTAKIGPNVSIGANVRVGPGVRLR-------ESIILDDAEIEENAVV 338 (407) T ss_pred CCEEEEEEEECCCCEECCCCCCCCCCEECCCCEECCCCEEE-------EEEECCCCEEECCCEE T ss_conf 85487424776754047766208971446882546882665-------4560267376216367 No 125 >PRK01884 consensus Probab=98.31 E-value=1.4e-06 Score=61.68 Aligned_cols=59 Identities=15% Similarity=0.174 Sum_probs=35.1 Q ss_pred CCCCCCCCCCC----CCCCCCCCCEECCCEEECCCCCCCCCEEEECCCEEECCCCCCCC--CCCCCH Q ss_conf 31001234432----23452455321486065234443365064047435013211223--333100 Q gi|255764481|r 128 DCKLGNGIVLS----NNVMIAGHVIVDDRVVFGGGSAVHQFTRIGKYAFIGGMTGVVHD--VIPYGI 188 (271) Q Consensus 128 d~~IG~~vii~----n~~~l~G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~gs~V~kd--Vpp~~~ 188 (271) ++.|+++|++. .++.|+..|+|+++|.|- +|.|+|.++||++|.+- .+.+-|| |||... T Consensus 329 nSlIa~GcvI~~g~V~nSVL~r~v~I~~gA~V~-nSIIm~~~~IG~ga~l~-~~IiDk~v~I~~g~~ 393 (435) T PRK01884 329 NSLIAGGCVITDASISNSVLFDRIKVDEFSKID-HCVVLPQVKIGKNCVLK-RCIIDRHCVIPDGMQ 393 (435) T ss_pred CCEECCCEEEECCEEECCEEECCCEECCCCEEE-ECEEECCCEECCCCEEE-EEEECCCCEECCCCE T ss_conf 899819979939899719820896999999998-28880869899999996-269899999997999 No 126 >cd04651 LbH_G1P_AT_C Glucose-1-phosphate adenylyltransferase, C-terminal Left-handed parallel beta helix (LbH) domain: Glucose-1-phosphate adenylyltransferase is also known as ADP-glucose synthase or ADP-glucose pyrophosphorylase. It catalyzes the first committed and rate-limiting step in starch biosynthesis in plants and glycogen biosynthesis in bacteria. It is the enzymatic site for regulation of storage polysaccharide accumulation in plants and bacteria. The enzyme is a homotetramer, with each subunit containing an N-terminal catalytic domain that resembles a dinucleotide-binding Rossmann fold and a C-terminal LbH fold domain with at 5 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). The LbH domain is involved in cooperative allosteric regulation and oligomerization. Probab=98.31 E-value=2.1e-06 Score=60.56 Aligned_cols=57 Identities=21% Similarity=0.127 Sum_probs=22.6 Q ss_pred ECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCCEE Q ss_conf 6551088898599889899565598899899889898668689168468655311434223 Q gi|255764481|r 10 IHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMAVL 70 (271) Q Consensus 10 IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a~I 70 (271) +-+.|.|. +..|+++|.|. .+.| .+..||++|+|++++.|. ++.|.+++.|+++|.| T Consensus 4 ~~~~~~v~-~S~Ig~Gc~I~-~~~V-~~SVIg~~~~Ig~~a~I~-~svi~~~~~Ig~~~~l 60 (104) T cd04651 4 IGRRGEVK-NSLVSEGCIIS-GGTV-ENSVLFRGVRVGSGSVVE-DSVIMPNVGIGRNAVI 60 (104) T ss_pred ECCCCEEE-ECEECCCCEEE-CCEE-ECCEECCCCEECCCCEEE-CCEEECCCEECCCCEE T ss_conf 88994898-49988997995-9999-808972997999998996-0099655689999999 No 127 >PRK00725 glgC glucose-1-phosphate adenylyltransferase; Provisional Probab=98.29 E-value=1.1e-06 Score=62.28 Aligned_cols=40 Identities=8% Similarity=0.026 Sum_probs=18.7 Q ss_pred CCCCCCCEECCCEEECCCCCCCCCEEEECCCEEECCCCCCCC Q ss_conf 452455321486065234443365064047435013211223 Q gi|255764481|r 141 VMIAGHVIVDDRVVFGGGSAVHQFTRIGKYAFIGGMTGVVHD 182 (271) Q Consensus 141 ~~l~G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~gs~V~kd 182 (271) +.|+..|+|+++|.|- ++.|+|.++||++|.+ -.+++-|| T Consensus 348 SVI~r~v~Ig~gA~V~-nSIIm~~~~IG~ga~l-~naIiDk~ 387 (431) T PRK00725 348 SVLFTRVRVNSFSNID-SAVLLPEVWVGRSCRL-RRCVIDRA 387 (431) T ss_pred CCCCCCCEECCCCEEE-EEEECCCCEECCCCEE-EEEEECCC T ss_conf 9825998999998996-1398898988999999-71398999 No 128 >PRK04928 consensus Probab=98.26 E-value=4.9e-06 Score=58.21 Aligned_cols=12 Identities=17% Similarity=-0.006 Sum_probs=4.3 Q ss_pred CCCEEEECCCEE Q ss_conf 365064047435 Q gi|255764481|r 162 HQFTRIGKYAFI 173 (271) Q Consensus 162 ~~~v~IG~~a~i 173 (271) -.+|+||+++.| T Consensus 365 dk~v~Ig~g~~I 376 (405) T PRK04928 365 DKDVEIAPGTQI 376 (405) T ss_pred CCCCEECCCCEE T ss_conf 899999999999 No 129 >KOG1322 consensus Probab=98.26 E-value=1.4e-06 Score=61.64 Aligned_cols=46 Identities=37% Similarity=0.456 Sum_probs=31.2 Q ss_pred CCCCCCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECC Q ss_conf 0579975655108889859988989956559889989988989866 Q gi|255764481|r 3 RMGNNPIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELIS 48 (271) Q Consensus 3 ~i~~~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~ 48 (271) |.-..+.|.-...+++.+.+|++|.|||+++||++++|++++.|.. T Consensus 254 r~~p~~~i~~nvlvd~~~~iG~~C~Ig~~vvIG~r~~i~~gV~l~~ 299 (371) T KOG1322 254 RLLPGSKIVGNVLVDSIASIGENCSIGPNVVIGPRVRIEDGVRLQD 299 (371) T ss_pred CCCCCCCCCCCEEECCCCCCCCCCEECCCCEECCCCEECCCEEEEE T ss_conf 5469755124376214001388637888736778728627628874 No 130 >COG0110 WbbJ Acetyltransferase (isoleucine patch superfamily) [General function prediction only] Probab=98.25 E-value=4.5e-06 Score=58.47 Aligned_cols=140 Identities=20% Similarity=0.296 Sum_probs=72.1 Q ss_pred CCCEECCCCEECCC---CEECCCCEECCCCEEC---CCCEEEEEEECCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 89899565598899---8998898986686891---68468655311434223210233312222223322222221224 Q gi|255764481|r 24 PNSLIGPFCCVGSE---VEIGAGVELISHCVVA---GKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTELLVGKKCVIREG 97 (271) Q Consensus 24 ~~v~Igp~~~I~~~---v~IG~~~~I~~~v~I~---g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~v~IG~~~~Ire~ 97 (271) .+..++.+..+... ..+...+.+.+..... .+..+|++|.+..++.+- ....+.||+++.+..+ T Consensus 30 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~iG~~~~i~~~~~~~----------~~~~i~ig~~~~i~~~ 99 (190) T COG0110 30 KPVKLGRYAEILGRLVGIKIGEVAVIRPPVRIDLGEKNLTIGDLCFIGVNVVIL----------VGEGITIGDNVVVGPN 99 (190) T ss_pred CCCEECCCEEEEECCCCCCCCCEEEECCCCEEEECCCCEEECCEEEEECCCEEE----------ECCCCEECCCCEECCC T ss_conf 884013432787401443256305861350244143210646346770254498----------3488588998589599 Q ss_pred CCCCCCCCCCCCCC-CCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCEEEECCCEEECC Q ss_conf 43211112222221-22332000016655433100123443223452455321486065234443365064047435013 Q gi|255764481|r 98 VTINRGTVEYGGKT-IVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFTRIGKYAFIGGM 176 (271) Q Consensus 98 v~I~~gt~~~~~~T-~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~g 176 (271) |++...+......+ ..+. .....++.||++|.++.++.+...|+|||+++||+++++.. -+-.++.+++. T Consensus 100 v~i~~~~h~~~~~~~~~~~-------~~~~~~v~IG~~vwIG~~a~IlpGV~IG~gavigagsVVtk--dvp~~~iv~G~ 170 (190) T COG0110 100 VTIYTNSHPGDFVTANIGA-------LVGAGPVTIGEDVWIGAGAVILPGVTIGEGAVIGAGSVVTK--DVPPYGIVAGN 170 (190) T ss_pred EEECCCCCCCCCCCCCCCC-------CCCCCCCEECCCCEECCCCEECCCEEECCCEEEECCEEEEC--CCCCCEEEECC T ss_conf 4982687656522000143-------23578978899858778978979859899819978858957--77998289799 Q ss_pred -CCCCCC Q ss_conf -211223 Q gi|255764481|r 177 -TGVVHD 182 (271) Q Consensus 177 -s~V~kd 182 (271) +.|.+. T Consensus 171 Pa~vir~ 177 (190) T COG0110 171 PARVIRK 177 (190) T ss_pred CCEEEEE T ss_conf 9889874 No 131 >COG2171 DapD Tetrahydrodipicolinate N-succinyltransferase [Amino acid transport and metabolism] Probab=98.23 E-value=1.9e-06 Score=60.82 Aligned_cols=67 Identities=21% Similarity=0.373 Sum_probs=32.3 Q ss_pred CCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCC--------CCCEEEECCCEEECCCCCC Q ss_conf 332000016655433100123443223452455321486065234443--------3650640474350132112 Q gi|255764481|r 114 GDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAV--------HQFTRIGKYAFIGGMTGVV 180 (271) Q Consensus 114 G~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v--------~~~v~IG~~a~ig~gs~V~ 180 (271) |++..+|..+.|-=+..++..+.+--++.+++.++||+++.||+++.| .+.+.||++|++||+|.+. T Consensus 124 ~~gtvvM~~sfVNigA~~~~gtMVd~~as~G~~a~VGkn~higgGa~I~GVLep~~a~Pv~IgdncliGAns~~v 198 (271) T COG2171 124 AKGTVVMPESFVNIGAGTGEGTMVDGRASVGSCAQVGKNSHIGGGASIGGVLEPLQANPVIIGDNCLIGANSEVV 198 (271) T ss_pred CCCCEECCCCEEEECCCCCCCEEEEEEEEEECCEEECCCCCCCCCCEEEEEECCCCCCCEEECCCCEECCCCCEE T ss_conf 799387555668777602752589615655131797998452786367677348887886987761862453348 No 132 >TIGR00965 dapD 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase; InterPro: IPR005664 Bacteria, plants and fungi metabolise aspartic acid to produce four amino acids - lysine, threonine, methionine and isoleucine - in a series of reactions known as the aspartate pathway. Additionally, several important metabolic intermediates are produced by these reactions, such as diaminopimelic acid, an essential component of bacterial cell wall biosynthesis, and dipicolinic acid, which is involved in sporulation in Gram-positive bacteria. Members of the animal kingdom do not posses this pathway and must therefore acquire these essential amino acids through their diet. Research into improving the metabolic flux through this pathway has the potential to increase the yield of the essential amino acids in important crops, thus improving their nutritional value. Additionally, since the enzymes are not present in animals, inhibitors of them are promising targets for the development of novel antibiotics and herbicides. For more information see . The lysine/diaminopimelic acid branch of the aspartate pathway produces the essential amino acid lysine via the intermediate meso-diaminopimelic acid (meso-DAP), which is also a vital cell wall component in Gram-negative bacteria . The production of dihydropicolinate from aspartate-semialdehyde controls flux into the lysine/diaminopimelic acid pathway. Three variants of this pathway exist, differing in how tetrahydropicolinate (formed by reduction of dihydropicolinate) is metabolised to meso-DAP. One variant, the most commonly found one in archaea and bacteria, uses primarily succinyl intermediates, while a second variant, found only in Bacillus, utilises primarily acetyl intermediates. In the third variant, found in some Gram-positive bacteria, a dehydrogenase converts tetrahydropicolinate directly to meso-DAP. In all variants meso-DAP is subsequently converted to lysine by a decarboxylase, or, in Gram-negative bacteria, assimilated into the cell wall. Evidence exists that a fourth, currently unknown, variant of this pathway may function in plants . 2,3,4,5-tetrahydropyridine-2-carboxylate N-succinyltransferase (also known as tetrahydrodipicolinate N-succinyltransferase or DapD) is part of the succinyl route of of lysine/DAP biosynthesis. The DapD protein is a homotrimer is a trimeric enzyme with each monomer composed of three domain: an N-terminal helical domain, a distinctive left-handed parallel beta-helix (LBH) domain, and a predominantly beta C-terminal domain , . The LBH structure is encoded by an imperfect tandem-repeated hexapeptide sequence. Each trimer contains three independent active sites, always occuring at the boundary of two subunits, and formed by residues from one N-terminal domain, one C-terminal domain and two adjacent LBH domains.; GO: 0016757 transferase activity transferring glycosyl groups, 0009089 lysine biosynthetic process via diaminopimelate. Probab=98.21 E-value=1.6e-06 Score=61.33 Aligned_cols=114 Identities=19% Similarity=0.253 Sum_probs=78.6 Q ss_pred CCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCC Q ss_conf 98986686891684686553114342232102333122222233222222212244321111222222122332000016 Q gi|255764481|r 43 GVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTELLVGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLAN 122 (271) Q Consensus 43 ~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~v~IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~ 122 (271) +-+.=|.++++..+.|-+|..+.| +++.||.. |....++... T Consensus 103 g~R~VPgA~vR~G~fiAkNvVlMP-----------------SyvNIGAY---------------------vDEGtMvDTW 144 (275) T TIGR00965 103 GFRKVPGAIVRRGTFIAKNVVLMP-----------------SYVNIGAY---------------------VDEGTMVDTW 144 (275) T ss_pred CCCCCCHHHEECCCEEEEEEEECC-----------------CEEECCCE---------------------EECCCEECCH T ss_conf 665487436005735775237755-----------------52101367---------------------5168601001 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCC--------CCEECCCEEECCCCCCCCCEEEECCCEEECCCCCC-------------- Q ss_conf 65543310012344322345245--------53214860652344433650640474350132112-------------- Q gi|255764481|r 123 SHVAHDCKLGNGIVLSNNVMIAG--------HVIVDDRVVFGGGSAVHQFTRIGKYAFIGGMTGVV-------------- 180 (271) Q Consensus 123 ~hI~Hd~~IG~~vii~n~~~l~G--------~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~gs~V~-------------- 180 (271) ++||--.+||.||.|+-++.++| .+.|+||||||+.+=|..++-+.+++.|.+|.-+- T Consensus 145 ATvGSCAQIGKNvHlSGG~GIGGVLEPLQA~P~IIEDNCFiGARSEi~EGVIVEEGSVISMGVfIG~STkI~~R~TGEI~ 224 (275) T TIGR00965 145 ATVGSCAQIGKNVHLSGGVGIGGVLEPLQAKPTIIEDNCFIGARSEIVEGVIVEEGSVISMGVFIGQSTKIVDRDTGEII 224 (275) T ss_pred HHHCCHHHHCCCEEEECCCCCCCCCCCCCCCCEEEECCCEECCCEEEEEEEEECCCCEEEEEEEECCCCEEEECCCCEEE T ss_conf 22300322066257517833165246335787588247600461135620797168588840685560168853667176 Q ss_pred -CCCCCCCH-HHHC-CC Q ss_conf -23333100-0107-45 Q gi|255764481|r 181 -HDVIPYGI-LNGN-PG 194 (271) Q Consensus 181 -kdVpp~~~-~~G~-pa 194 (271) .-||+|++ +.|+ |. T Consensus 225 ~GRvPA~SVVVsG~lPs 241 (275) T TIGR00965 225 YGRVPAGSVVVSGILPS 241 (275) T ss_pred ECCCCCEEEEECCCCCC T ss_conf 55135602797177887 No 133 >KOG1462 consensus Probab=98.20 E-value=1.6e-06 Score=61.35 Aligned_cols=16 Identities=31% Similarity=0.254 Sum_probs=6.7 Q ss_pred EEEECCCEEECCCCCC Q ss_conf 0640474350132112 Q gi|255764481|r 165 TRIGKYAFIGGMTGVV 180 (271) Q Consensus 165 v~IG~~a~ig~gs~V~ 180 (271) +.||.+|-||.+|.++ T Consensus 386 sIIg~gA~Ig~gs~L~ 401 (433) T KOG1462 386 SIIGMGAQIGSGSKLK 401 (433) T ss_pred CEECCCCEECCCCEEE T ss_conf 2662665654797033 No 134 >PRK11132 cysE serine acetyltransferase; Provisional Probab=98.20 E-value=3.5e-06 Score=59.12 Aligned_cols=44 Identities=25% Similarity=0.369 Sum_probs=34.0 Q ss_pred CCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCEEEECCCEEEC Q ss_conf 0012344322345245532148606523444336506404743501 Q gi|255764481|r 130 KLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFTRIGKYAFIGG 175 (271) Q Consensus 130 ~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~ 175 (271) .||++|+|+.++.+-|+++||+++.||++++|...+ -.++.+.+ T Consensus 195 ~i~~~v~igaga~iLG~i~iG~~a~igA~svVl~dv--p~~~tv~G 238 (273) T PRK11132 195 KIREGVMIGAGAKILGNIEVGRGAKIGAGSVVLQPV--PPHTTAAG 238 (273) T ss_pred CCCCCEEECCCCEEECCEEECCCCEECCCCEECCCC--CCCCEEEC T ss_conf 048884875488897185889998886786686437--99987966 No 135 >KOG1322 consensus Probab=98.05 E-value=7.9e-06 Score=56.93 Aligned_cols=12 Identities=17% Similarity=0.670 Sum_probs=4.2 Q ss_pred CCCCCCCCCCCC Q ss_conf 310012344322 Q gi|255764481|r 128 DCKLGNGIVLSN 139 (271) Q Consensus 128 d~~IG~~vii~n 139 (271) +|+||++|++.| T Consensus 334 ~a~lG~nV~V~d 345 (371) T KOG1322 334 NAVLGKNVIVAD 345 (371) T ss_pred CCEECCCEEEEC T ss_conf 537536259942 No 136 >PRK01884 consensus Probab=98.04 E-value=1.5e-05 Score=55.27 Aligned_cols=52 Identities=17% Similarity=0.093 Sum_probs=24.3 Q ss_pred CCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCEEEECCCEEECCCCCCCC Q ss_conf 100123443223452455321486065234443365064047435013211223 Q gi|255764481|r 129 CKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFTRIGKYAFIGGMTGVVHD 182 (271) Q Consensus 129 ~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~gs~V~kd 182 (271) ++|+.+|.+..++.+-. +.|.+++.||.++.+. .+-|++++.|+.|.+|--| T Consensus 346 SVL~r~v~I~~gA~V~n-SIIm~~~~IG~ga~l~-~~IiDk~v~I~~g~~IG~~ 397 (435) T PRK01884 346 SVLFDRIKVDEFSKIDH-CVVLPQVKIGKNCVLK-RCIIDRHCVIPDGMQIGVD 397 (435) T ss_pred CEEECCCEECCCCEEEE-CEEECCCEECCCCEEE-EEEECCCCEECCCCEECCC T ss_conf 98208969999999982-8880869899999996-2698999999979999858 No 137 >cd04649 LbH_THP_succinylT_putative Putative 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate (THDP) N-succinyltransferase (THP succinyltransferase), C-terminal left-handed parallel alpha-helix (LbH) domain: This group is composed of mostly uncharacterized proteins containing an N-terminal domain of unknown function and a C-terminal LbH domain with similarity to THP succinyltransferase LbH. THP succinyltransferase catalyzes the conversion of tetrahydrodipicolinate and succinyl-CoA to N-succinyltetrahydrodipicolinate and CoA. It is the committed step in the succinylase pathway by which bacteria synthesize L-lysine and meso-diaminopimelate, a component of peptidoglycan. The enzyme is trimeric and displays the left-handed parallel alpha-helix (LbH) structural motif encoded by the hexapeptide repeat motif. Probab=97.90 E-value=4.3e-05 Score=52.35 Aligned_cols=34 Identities=15% Similarity=0.271 Sum_probs=17.1 Q ss_pred CEECCCEEECCCCCCCCCEEEECCCEEECCCCCCCC Q ss_conf 321486065234443365064047435013211223 Q gi|255764481|r 147 VIVDDRVVFGGGSAVHQFTRIGKYAFIGGMTGVVHD 182 (271) Q Consensus 147 v~Igd~~~IG~~~~v~~~v~IG~~a~ig~gs~V~kd 182 (271) +.|||+|+||+++.+ .+.+|+.|.|.+|-.++.+ T Consensus 74 i~Ig~ncliGAnsgi--Gi~lGd~cvv~aGlyi~~g 107 (147) T cd04649 74 ISIGKRCLLGANSGI--GISLGDNCIVEAGLYVTAG 107 (147) T ss_pred EEECCCEEECCCCCC--CEEECCCCEEECCEEECCC T ss_conf 698875076146545--8486898688244598399 No 138 >KOG1462 consensus Probab=97.85 E-value=3.2e-05 Score=53.13 Aligned_cols=44 Identities=25% Similarity=0.339 Sum_probs=20.9 Q ss_pred CCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCEEEECCCEEECCCCC Q ss_conf 2000016655433100123443223452455321486065234443365064047435013211 Q gi|255764481|r 116 NNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFTRIGKYAFIGGMTGV 179 (271) Q Consensus 116 ~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~gs~V 179 (271) ++.+|.+++|+|+|.|. + +.|+++|.||.++.+ +.|.||.+-+| T Consensus 368 nSilm~nV~vg~G~~Ie-n------------sIIg~gA~Ig~gs~L-------~nC~Ig~~yvV 411 (433) T KOG1462 368 NSILMDNVVVGDGVNIE-N------------SIIGMGAQIGSGSKL-------KNCIIGPGYVV 411 (433) T ss_pred EEEEECCCEECCCCCEE-C------------CEECCCCEECCCCEE-------EEEEECCCCEE T ss_conf 11766583755884020-4------------266266565479703-------33485598277 No 139 >KOG4750 consensus Probab=97.84 E-value=1.6e-05 Score=54.99 Aligned_cols=35 Identities=29% Similarity=0.547 Sum_probs=28.9 Q ss_pred CCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCC Q ss_conf 00123443223452455321486065234443365 Q gi|255764481|r 130 KLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQF 164 (271) Q Consensus 130 ~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~ 164 (271) .|||+|.|+.++.+-|+++||.++.||+++.|... T Consensus 202 ~Igd~vliGaGvtILgnV~IGegavIaAGsvV~kD 236 (269) T KOG4750 202 KIGDNVLIGAGVTILGNVTIGEGAVIAAGSVVLKD 236 (269) T ss_pred CCCCCEEECCCCEEECCEEECCCCEEECCCEEEEC T ss_conf 55677087554178578167788677305069721 No 140 >PRK03701 consensus Probab=97.80 E-value=4.9e-05 Score=51.98 Aligned_cols=55 Identities=11% Similarity=0.010 Sum_probs=28.5 Q ss_pred CCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCEEEECCCEEECCCCCCCCCCC Q ss_conf 100123443223452455321486065234443365064047435013211223333 Q gi|255764481|r 129 CKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFTRIGKYAFIGGMTGVVHDVIP 185 (271) Q Consensus 129 ~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~gs~V~kdVpp 185 (271) ++|+.+|.++.++.+-. +.|.+++.||.++.+. .+-|+++|.|+.|.+|-.|... T Consensus 348 SVl~r~v~I~~gA~V~n-SIIm~~~~Ig~ga~l~-~~IiDk~v~I~~g~vIg~~~~~ 402 (431) T PRK03701 348 SVLFPRVRVNSFCNIDS-AVLLPEVWVGRSCRLR-RCVIDRACVIPEGMVIGENAEE 402 (431) T ss_pred CCCCCCCEECCCCEEEC-CEECCCCEECCCCEEE-EEEECCCCEECCCCEECCCHHH T ss_conf 98118889999989972-6985969899999998-3798999999979998968367 No 141 >PRK00725 glgC glucose-1-phosphate adenylyltransferase; Provisional Probab=97.76 E-value=5.6e-05 Score=51.62 Aligned_cols=70 Identities=13% Similarity=0.060 Sum_probs=36.5 Q ss_pred CCCCCCHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCEEEECCCEEECCCCCCCCCCC Q ss_conf 22332000016655433100123443223452455321486065234443365064047435013211223333 Q gi|255764481|r 112 IVGDNNFFLANSHVAHDCKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFTRIGKYAFIGGMTGVVHDVIP 185 (271) Q Consensus 112 ~IG~~~~i~~~~hI~Hd~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~gs~V~kdVpp 185 (271) .|++.|.|.. ++|. +++|+.+|.++.++.+-. +.|.+++.||.++.+. ++-|++++.|+.|++|..|..+ T Consensus 333 li~~GcvI~G-~~Ve-nSVI~r~v~Ig~gA~V~n-SIIm~~~~IG~ga~l~-naIiDk~v~I~~G~~Ig~~~~~ 402 (431) T PRK00725 333 LVSGGCIISG-SVVV-QSVLFTRVRVNSFSNIDS-AVLLPEVWVGRSCRLR-RCVIDRACVIPEGMVIGENAEE 402 (431) T ss_pred EEECCEEEEC-CEEE-ECCCCCCCEECCCCEEEE-EEECCCCEECCCCEEE-EEEECCCCEECCCCEECCCHHH T ss_conf 9839989969-9998-298259989999989961-3988989889999997-1398999898989998968578 No 142 >TIGR01208 rmlA_long glucose-1-phosphate thymidylyltransferase; InterPro: IPR005908 Synonym: dTDP-D-glucose synthase This group of proteins comprises a tightly conserved but broadly distributed subfamily of known and putative bacterial glucose-1-phosphate thymidylyltransferases (2.7.7.24 from EC). It is well characterised in several species as the first of four enzymes involved in the biosynthesis of dTDP-L-rhamnose, a cell wall constituent and a feedback inhibitor of the enzyme.dTTP + alpha-D-glucose 1-phosphate = diphosphate + dTDP-glucose The family of known and putative glucose-1-phosphate thymidyltransferases shows a deep split into a short form (see IPR005907 from INTERPRO) and a long form described by this model. The homotetrameric short form is found in numerous bacterial species that incorporate dTDP-L-rhamnose, which it helps synthesize, into the cell wall. It is subject to feedback inhibition. The long form, in contrast, is found in many species for which it serves as a sugar-activating enzyme for antibiotic biosynthesis and or other, unknown pathways, and in which dTDP-L-rhamnose is not necessarily produced. . Probab=97.72 E-value=6.3e-05 Score=51.30 Aligned_cols=51 Identities=25% Similarity=0.406 Sum_probs=28.0 Q ss_pred CEECCCCEECCCCEECCCCEECCCCEEEEEEECCCCC-EEEECCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 5988998998898986686891684686553114342-232102333122222233222222212 Q gi|255764481|r 32 CCVGSEVEIGAGVELISHCVVAGKTKIGDFTKVFPMA-VLGGDTQSKYHNFVGTELLVGKKCVIR 95 (271) Q Consensus 32 ~~I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I~~~a-~Ig~~~qd~~~~g~~~~v~IG~~~~Ir 95 (271) +-|.+-|.||++++|.-+++|+|++.||+||.|. +| .||. ++.||++++|+ T Consensus 255 s~i~GRV~v~~gakid~ns~i~GPa~IG~d~~I~-nsGyiGP------------YtSig~~~~I~ 306 (361) T TIGR01208 255 SKIEGRVEVGEGAKIDVNSVIRGPAVIGEDCIIE-NSGYIGP------------YTSIGEGVVIR 306 (361) T ss_pred CEEEEEEEECCCCEECCCCEEECCEEECCCCEEE-CCCCCCC------------EEEECCCEEEE T ss_conf 5788407908978842377787060877952782-6861177------------07725753874 No 143 >TIGR01208 rmlA_long glucose-1-phosphate thymidylyltransferase; InterPro: IPR005908 Synonym: dTDP-D-glucose synthase This group of proteins comprises a tightly conserved but broadly distributed subfamily of known and putative bacterial glucose-1-phosphate thymidylyltransferases (2.7.7.24 from EC). It is well characterised in several species as the first of four enzymes involved in the biosynthesis of dTDP-L-rhamnose, a cell wall constituent and a feedback inhibitor of the enzyme.dTTP + alpha-D-glucose 1-phosphate = diphosphate + dTDP-glucose The family of known and putative glucose-1-phosphate thymidyltransferases shows a deep split into a short form (see IPR005907 from INTERPRO) and a long form described by this model. The homotetrameric short form is found in numerous bacterial species that incorporate dTDP-L-rhamnose, which it helps synthesize, into the cell wall. It is subject to feedback inhibition. The long form, in contrast, is found in many species for which it serves as a sugar-activating enzyme for antibiotic biosynthesis and or other, unknown pathways, and in which dTDP-L-rhamnose is not necessarily produced. . Probab=97.71 E-value=4.9e-05 Score=52.00 Aligned_cols=34 Identities=15% Similarity=0.181 Sum_probs=14.2 Q ss_pred EEECCCCEECCCCEECCCCEECCCCEECCCCEEC Q ss_conf 0888985998898995655988998998898986 Q gi|255764481|r 14 ALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELI 47 (271) Q Consensus 14 A~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~ 47 (271) +-|...++|+++.+|.-+++|.+-|.||+||.|. T Consensus 255 s~i~GRV~v~~gakid~ns~i~GPa~IG~d~~I~ 288 (361) T TIGR01208 255 SKIEGRVEVGEGAKIDVNSVIRGPAVIGEDCIIE 288 (361) T ss_pred CEEEEEEEECCCCEECCCCEEECCEEECCCCEEE T ss_conf 5788407908978842377787060877952782 No 144 >TIGR02091 glgC glucose-1-phosphate adenylyltransferase; InterPro: IPR011831 This enzyme, glucose-1-phosphate adenylyltransferase, is also called ADP-glucose pyrophosphorylase. The plant form is an alpha2, beta2 heterodimer, allosterically regulated in plants. Both subunits are homologous and included in this entry. In bacteria, both homomeric forms of GlgC and more active heterodimers of GlgC and GlgD have been described. This entry describes the GlgC subunit only. This enzyme appears in variants of glycogen synthesis pathways that use ADP-glucose, rather than UDP-glucose as in animals.; GO: 0008878 glucose-1-phosphate adenylyltransferase activity. Probab=97.58 E-value=0.00011 Score=49.70 Aligned_cols=12 Identities=25% Similarity=0.564 Sum_probs=7.2 Q ss_pred CCEECCCEEECC Q ss_conf 975655108889 Q gi|255764481|r 7 NPIIHPLALVEE 18 (271) Q Consensus 7 ~~~IhpsA~I~~ 18 (271) +.+.-|+.++++ T Consensus 315 ~~~~pPa~f~~~ 326 (421) T TIGR02091 315 NEFLPPAKFVDS 326 (421) T ss_pred CCCCCCCEEECC T ss_conf 888888604557 No 145 >KOG3121 consensus Probab=97.56 E-value=0.00011 Score=49.87 Aligned_cols=145 Identities=16% Similarity=0.203 Sum_probs=79.2 Q ss_pred CCEECCCCEEEEEEECCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHCCCCCCCCC Q ss_conf 86891684686553114342232102333122222233222222212244321111222222122332000016655433 Q gi|255764481|r 49 HCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTELLVGKKCVIREGVTINRGTVEYGGKTIVGDNNFFLANSHVAHD 128 (271) Q Consensus 49 ~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~v~IG~~~~Ire~v~I~~gt~~~~~~T~IG~~~~i~~~~hI~Hd 128 (271) +..|.|.+.+-++ +.|. |+-..+.||..|++...+.|.+-- + +=+.---|-.+|||.. T Consensus 33 NI~lnGKtIv~~g------~iIR---------GDLAnVr~GryCV~ksrsvIRPp~-K------~FSKg~affp~hiGdh 90 (184) T KOG3121 33 NILLNGKTIVEEG------VIIR---------GDLANVRIGRYCVLKSRSVIRPPM-K------IFSKGPAFFPVHIGDH 90 (184) T ss_pred EEEECCCEEEEEC------CEEE---------CCCCCCEECCEEEECCCCCCCCCH-H------HHCCCCEEEEEEECCE T ss_conf 2798372798608------5895---------120102473368964665437706-7------7567862456652456 Q ss_pred CCCCCCCCCCCCCCCCCCCEECCCEEECCCCCCCCCEEEECCCEEECCCCCCCCCCCCCHHHHCCCCCCCCCCCHHHHCC Q ss_conf 10012344322345245532148606523444336506404743501321122333310001074532334420144428 Q gi|255764481|r 129 CKLGNGIVLSNNVMIAGHVIVDDRVVFGGGSAVHQFTRIGKYAFIGGMTGVVHDVIPYGILNGNPGALRGVNVVAMRRAG 208 (271) Q Consensus 129 ~~IG~~vii~n~~~l~G~v~Igd~~~IG~~~~v~~~v~IG~~a~ig~gs~V~kdVpp~~~~~G~pa~~~~~N~~glkR~g 208 (271) +.|++.|++ |.++++..|.+|.+++||-.+++..-++|-+++. +||..++- |-...+-| +-.--| T Consensus 91 VFieE~cVV-nAAqIgsyVh~GknaviGrrCVlkdCc~ild~tV----------lPpet~vp--py~~~~g~--p~~~~G 155 (184) T KOG3121 91 VFIEEECVV-NAAQIGSYVHLGKNAVIGRRCVLKDCCRILDDTV----------LPPETLVP--PYSTIGGN--PAQVVG 155 (184) T ss_pred EEEECCEEE-EHHHHEEEEEECCCEEECCCEEHHHHEECCCCCC----------CCCCCCCC--CCEEECCC--CCEEEC T ss_conf 887044186-2666120368546406737508334104158952----------58201369--71677388--723531 Q ss_pred CCHHHH-HHHHHHHHHHHHCCCC Q ss_conf 889999-9999999999727998 Q gi|255764481|r 209 FSRDTI-HLIRAVYKQIFQQGDS 230 (271) Q Consensus 209 ~s~e~i-~~l~~ayr~lfr~~~~ 230 (271) +.++-- ...-++-+..|..=.+ T Consensus 156 ~~P~ctq~lMi~~tksyY~~Fvp 178 (184) T KOG3121 156 TEPRCTQNLMIEATKSYYDNFVP 178 (184) T ss_pred CCCHHHHHHHHHHHHHHHHHCCC T ss_conf 57302689999999999983375 No 146 >COG0448 GlgC ADP-glucose pyrophosphorylase [Carbohydrate transport and metabolism] Probab=97.42 E-value=0.00075 Score=44.55 Aligned_cols=12 Identities=8% Similarity=0.183 Sum_probs=4.5 Q ss_pred CCEECCCCEECC Q ss_conf 989988989866 Q gi|255764481|r 37 EVEIGAGVELIS 48 (271) Q Consensus 37 ~v~IG~~~~I~~ 48 (271) +..+++||.|.. T Consensus 296 nSLv~~GciI~G 307 (393) T COG0448 296 NSLVAGGCIISG 307 (393) T ss_pred EEEEECCEEEEE T ss_conf 226607718963 No 147 >COG4801 Predicted acyltransferase [General function prediction only] Probab=97.36 E-value=0.00033 Score=46.76 Aligned_cols=49 Identities=16% Similarity=0.255 Sum_probs=21.6 Q ss_pred CCCEECCCCEECCCCEEEEEEECCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 8989866868916846865531143422321023331222222332222222122 Q gi|255764481|r 42 AGVELISHCVVAGKTKIGDFTKVFPMAVLGGDTQSKYHNFVGTELLVGKKCVIRE 96 (271) Q Consensus 42 ~~~~I~~~v~I~g~t~IG~~~~I~~~a~Ig~~~qd~~~~g~~~~v~IG~~~~Ire 96 (271) .+|+|+.+|.+.+|+..+.+..|+.|++|++. --....+.||+++.|.+ T Consensus 55 ~diridmw~kv~gNV~ve~dayiGE~~sI~gk------l~v~gdLdig~dV~Ieg 103 (277) T COG4801 55 KDIRIDMWCKVTGNVIVENDAYIGEFSSIKGK------LTVIGDLDIGADVIIEG 103 (277) T ss_pred CCEEEEEEEEEECCEEECCCEEEECCCEEEEE------EEEECCCCCCCCEEEEC T ss_conf 11366215675231898474077033024016------99953323565168756 No 148 >COG4801 Predicted acyltransferase [General function prediction only] Probab=97.23 E-value=0.0014 Score=42.90 Aligned_cols=22 Identities=18% Similarity=0.267 Sum_probs=10.1 Q ss_pred CEECCCEEECCCCCCCCCEEEE Q ss_conf 3214860652344433650640 Q gi|255764481|r 147 VIVDDRVVFGGGSAVHQFTRIG 168 (271) Q Consensus 147 v~Igd~~~IG~~~~v~~~v~IG 168 (271) .++++++++|..+.++.+++-+ T Consensus 167 i~vg~da~Vg~~c~l~Gnvka~ 188 (277) T COG4801 167 IKVGDDAFVGTVCRLEGNVKAG 188 (277) T ss_pred EEECCCCEECCEEEEECCEEEE T ss_conf 8706884613205997450321 No 149 >TIGR03536 DapD_gpp 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase. 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase (DapD) is involved in the succinylated branch of the "lysine biosynthesis via diaminopimelate (DAP)" pathway (GenProp0125). This model represents a clade of DapD sequences most closely related to the actinobacterial DapD family represented by the TIGR03535 model. All of the genes evaluated for the seed of this model are found in genomes where the downstream desuccinylase is present, but known DapD genes are absent. Additionally, many of the genes identified by this model are found proximal to genes involved in this lysine biosynthesis pathway. Probab=96.89 E-value=0.0017 Score=42.42 Aligned_cols=33 Identities=9% Similarity=0.189 Sum_probs=15.3 Q ss_pred CEECCCEEECCCCCCCCCEEEECCCEEECCCCCCC Q ss_conf 32148606523444336506404743501321122 Q gi|255764481|r 147 VIVDDRVVFGGGSAVHQFTRIGKYAFIGGMTGVVH 181 (271) Q Consensus 147 v~Igd~~~IG~~~~v~~~v~IG~~a~ig~gs~V~k 181 (271) +.||++|.+|+++++ ++.+|+.|.|.+|-.||. T Consensus 251 isiG~~cLlGANsg~--GI~LGD~C~veaGlyvta 283 (341) T TIGR03536 251 ISVGEGCLLGANAGI--GIPLGDRCTVEAGLYITA 283 (341) T ss_pred EEECCCCCCCCCCCC--CCCCCCCCEEECCEEEEC T ss_conf 888732201444555--742489847812449927 No 150 >TIGR03535 DapD_actino 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase. Alternate name: tetrahydrodipicolinate N-succinyltransferase. Probab=96.82 E-value=0.0021 Score=41.74 Aligned_cols=23 Identities=22% Similarity=0.222 Sum_probs=9.1 Q ss_pred CCCEECCCCEECCCCEECCCCEE Q ss_conf 99899889898668689168468 Q gi|255764481|r 36 SEVEIGAGVELISHCVVAGKTKI 58 (271) Q Consensus 36 ~~v~IG~~~~I~~~v~I~g~t~I 58 (271) .+|+|.|..++.=++.+...|++ T Consensus 152 ~gVRIada~rVRLGAyl~~GTtV 174 (319) T TIGR03535 152 TGVRIGDADRVRLGAHLAEGTTV 174 (319) T ss_pred CCCEECCCCCEEEEEEECCCCEE T ss_conf 88477045525540076598077 No 151 >TIGR03536 DapD_gpp 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase. 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase (DapD) is involved in the succinylated branch of the "lysine biosynthesis via diaminopimelate (DAP)" pathway (GenProp0125). This model represents a clade of DapD sequences most closely related to the actinobacterial DapD family represented by the TIGR03535 model. All of the genes evaluated for the seed of this model are found in genomes where the downstream desuccinylase is present, but known DapD genes are absent. Additionally, many of the genes identified by this model are found proximal to genes involved in this lysine biosynthesis pathway. Probab=96.82 E-value=0.0031 Score=40.72 Aligned_cols=86 Identities=22% Similarity=0.249 Sum_probs=37.6 Q ss_pred ECCCCEECCCCEECCCCEECCCCEEEEEEEC-CCCCEEEECCCCCCCCC-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 8899899889898668689168468655311-43422321023331222-222332222222122443211112222221 Q gi|255764481|r 34 VGSEVEIGAGVELISHCVVAGKTKIGDFTKV-FPMAVLGGDTQSKYHNF-VGTELLVGKKCVIREGVTINRGTVEYGGKT 111 (271) Q Consensus 34 I~~~v~IG~~~~I~~~v~I~g~t~IG~~~~I-~~~a~Ig~~~qd~~~~g-~~~~v~IG~~~~Ire~v~I~~gt~~~~~~T 111 (271) +-.+|+|.|-.++.=++.+...|++..--.+ |+--++|..- -.| ....|++|+++-++-+++|- ||..+++.. T Consensus 175 vP~gvRIad~~rVRLGA~L~~GTtvmhegfvnfNAGTlG~~m----vEGRiS~GVvVg~gsDiGGgaSim-GtLSGGg~~ 249 (341) T TIGR03536 175 VPKGVRIADTARVRLGAYVGEGTTVMHEGFINFNAGTEGPSM----VEGRISAGVMVGKGSDLGGGCSTM-GTLSGGGNI 249 (341) T ss_pred CCCCCEECCCCCEEEEEEECCCEEEECCCEEECCCCCCCCCE----EEEEEECCEEECCCCCCCCCEEEE-EEECCCCCE T ss_conf 678837615663443007758817702214750477766424----530440346966787667750247-763589823 Q ss_pred --CCCCCCHHCCCCC Q ss_conf --2233200001665 Q gi|255764481|r 112 --IVGDNNFFLANSH 124 (271) Q Consensus 112 --~IG~~~~i~~~~h 124 (271) .||.+|++++++- T Consensus 250 ~isiG~~cLlGANsg 264 (341) T TIGR03536 250 VISVGEGCLLGANAG 264 (341) T ss_pred EEEECCCCCCCCCCC T ss_conf 688873220144455 No 152 >TIGR03535 DapD_actino 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase. Alternate name: tetrahydrodipicolinate N-succinyltransferase. Probab=96.67 E-value=0.0049 Score=39.49 Aligned_cols=33 Identities=15% Similarity=0.277 Sum_probs=18.1 Q ss_pred CEECCCEEECCCCCCCCCEEEECCCEEECCCCCCC Q ss_conf 32148606523444336506404743501321122 Q gi|255764481|r 147 VIVDDRVVFGGGSAVHQFTRIGKYAFIGGMTGVVH 181 (271) Q Consensus 147 v~Igd~~~IG~~~~v~~~v~IG~~a~ig~gs~V~k 181 (271) +.||++|.+|+++++ ++.+|+.|.|-+|-.||. T Consensus 226 isiG~~cLlGANsGi--GI~LGD~C~VeaGlyita 258 (319) T TIGR03535 226 ISIGERCLLGANSGI--GISLGDDCVVEAGLYVTA 258 (319) T ss_pred EEECCCCEECCCCCC--CCCCCCCCEEECCEEECC T ss_conf 788653200454555--743589717811228817 No 153 >TIGR02092 glgD glucose-1-phosphate adenylyltransferase, GlgD subunit; InterPro: IPR011832 This family is GlgD, an apparent regulatory protein that appears in an alpha2/beta2 heterotetramer with GlgC (glucose-1-phosphate adenylyltransferase, IPR011831 from INTERPRO) in a subset of bacteria that use GlgC for glycogen biosynthesis.; GO: 0005978 glycogen biosynthetic process. Probab=96.28 E-value=0.0039 Score=40.07 Aligned_cols=10 Identities=10% Similarity=0.199 Sum_probs=3.1 Q ss_pred EEECCCCCEE Q ss_conf 5311434223 Q gi|255764481|r 61 FTKVFPMAVL 70 (271) Q Consensus 61 ~~~I~~~a~I 70 (271) ||.|++.|.| T Consensus 335 n~iimq~~~I 344 (383) T TIGR02092 335 NSIIMQRTVI 344 (383) T ss_pred EEEECCCCEE T ss_conf 5477588777 No 154 >TIGR02092 glgD glucose-1-phosphate adenylyltransferase, GlgD subunit; InterPro: IPR011832 This family is GlgD, an apparent regulatory protein that appears in an alpha2/beta2 heterotetramer with GlgC (glucose-1-phosphate adenylyltransferase, IPR011831 from INTERPRO) in a subset of bacteria that use GlgC for glycogen biosynthesis.; GO: 0005978 glycogen biosynthetic process. Probab=94.97 E-value=0.067 Score=32.36 Aligned_cols=15 Identities=7% Similarity=0.324 Sum_probs=5.5 Q ss_pred CHHCCCCCCCCCCCC Q ss_conf 000016655433100 Q gi|255764481|r 117 NFFLANSHVAHDCKL 131 (271) Q Consensus 117 ~~i~~~~hI~Hd~~I 131 (271) |.||..++|+.||.+ T Consensus 336 ~iimq~~~Ig~ga~L 350 (383) T TIGR02092 336 SIIMQRTVIGEGAHL 350 (383) T ss_pred EEECCCCEECCCCEE T ss_conf 477588777878656 No 155 >PRK13412 fkp bifunctional fucokinase/L-fucose-1-P-guanylyltransferase; Provisional Probab=74.81 E-value=5.2 Score=20.56 Aligned_cols=48 Identities=17% Similarity=0.172 Sum_probs=24.7 Q ss_pred CCCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECC Q ss_conf 9975655108889859988989956559889989988989866868916 Q gi|255764481|r 6 NNPIIHPLALVEEGAVIGPNSLIGPFCCVGSEVEIGAGVELISHCVVAG 54 (271) Q Consensus 6 ~~~~IhpsA~I~~~a~Ig~~v~Igp~~~I~~~v~IG~~~~I~~~v~I~g 54 (271) ..+.+||+++|. +++|...+.++..+..-+|+.|+.+..|+++++|.| T Consensus 324 ~~~~~~~a~~v~-ns~i~~~l~~~~~sl~iens~i~~~w~igs~~IltG 371 (974) T PRK13412 324 RKVKPNPAMFVQ-NAVLSGGLDAENATLWIENSHIGSHWKIASRSIITG 371 (974) T ss_pred CCCCCCCCEEEE-CCEECCCCCCCCCEEEEECCCCCCCEEECCCCEEEC T ss_conf 047878745888-366437503578708997533155638657846715 No 156 >pfam03961 DUF342 Protein of unknown function (DUF342). This family of bacterial proteins has no known function. The proteins are in the region of 500-600 amino acid residues in length. Probab=74.46 E-value=7.3 Score=19.61 Aligned_cols=27 Identities=11% Similarity=0.130 Sum_probs=14.1 Q ss_pred CEECCCCEECCCCEEEEEEECCCCCEE Q ss_conf 898668689168468655311434223 Q gi|255764481|r 44 VELISHCVVAGKTKIGDFTKVFPMAVL 70 (271) Q Consensus 44 ~~I~~~v~I~g~t~IG~~~~I~~~a~I 70 (271) ..+.+.-.|.+++.+..++.-|.++++ T Consensus 175 i~V~~vl~v~gdVd~~TGNI~F~G~V~ 201 (450) T pfam03961 175 ISVDDVLEIKGDVDLSTGNIDFKGSVI 201 (450) T ss_pred EEEEEEEEECCCCCCCCCCEEEEEEEE T ss_conf 999998997687787734467777799 No 157 >pfam10302 DUF2407 Putative membrane protein (DUF2407). This is a family of proteins found in fungi. The function is not known. There is a characteristic GFDRL sequence motif. Probab=71.65 E-value=8.3 Score=19.27 Aligned_cols=38 Identities=21% Similarity=0.407 Sum_probs=29.4 Q ss_pred HHHCCCCHHHHHHHHHHHHHHHHC-----CCCHHHHHHHHHHH Q ss_conf 444288899999999999999727-----99899999999964 Q gi|255764481|r 204 MRRAGFSRDTIHLIRAVYKQIFQQ-----GDSIYKNAGAIREQ 241 (271) Q Consensus 204 lkR~g~s~e~i~~l~~ayr~lfr~-----~~~l~e~l~~l~~~ 241 (271) |.-.|||++++..|+.-|+.+|-. .++-.+.+.+||+. T Consensus 125 LlsaGFs~~EV~~LR~QF~ai~~~~~tpd~~~s~~~lR~LEd~ 167 (252) T pfam10302 125 LLSAGFSPAEVSLLRSQFRAIYGSSDTPDTMPSPADLRQLEER 167 (252) T ss_pred HHHCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHH T ss_conf 9875899999999999999871566687668888999999999 No 158 >pfam00427 PBS_linker_poly Phycobilisome Linker polypeptide. Probab=70.47 E-value=9.1 Score=19.04 Aligned_cols=53 Identities=17% Similarity=0.241 Sum_probs=34.6 Q ss_pred CCCCHHHH-HHHHHHHHHHHHC-CCCHHHHHHHHHHHCCCCH-HHHHHHHHHHHCC Q ss_conf 28889999-9999999999727-9989999999996425898-9999999986158 Q gi|255764481|r 207 AGFSRDTI-HLIRAVYKQIFQQ-GDSIYKNAGAIREQNVSCP-EVSDIINFIFADR 259 (271) Q Consensus 207 ~g~s~e~i-~~l~~ayr~lfr~-~~~l~e~l~~l~~~~~~~~-~v~~~~~Fi~~s~ 259 (271) ...+.+++ ..|+.+||.+|-. ...-.|++..++.++.+-. .|++|+.-+.+|. T Consensus 3 ~~~s~~~~~~vI~AaYrQVfgn~~~~~~eR~~~lES~lrng~IsVreFVr~la~S~ 58 (131) T pfam00427 3 PNSSEEELEEVIRAAYRQVFGNDHVMESERLTSLESQLRNGEISVREFVRGLAKSE 58 (131) T ss_pred CCCCHHHHHHHHHHHHHHHHCCCCHHHHHCCHHHHHHHHCCCCCHHHHHHHHHCCH T ss_conf 99899999999999999997576402312023699998869982999999998269 No 159 >cd04765 HTH_MlrA-like_sg2 Helix-Turn-Helix DNA binding domain of putative MlrA-like transcription regulators. Putative helix-turn-helix (HTH) MlrA-like transcription regulators (subgroup 2), N-terminal domain. The MlrA protein, also known as YehV, has been shown to control cell-cell aggregation by co-regulating the expression of curli and extracellular matrix production in Escherichia coli and Salmonella typhimurium. These proteins belong to the MerR superfamily of transcription regulators that promote expression of several stress regulon genes by reconfiguring the spacer between the -35 and -10 promoter elements. Their conserved N-terminal domains contain predicted HTH motifs that mediate DNA binding, while the dissimilar C-terminal domains bind specific coactivator molecules. Probab=63.44 E-value=13 Score=18.17 Aligned_cols=38 Identities=16% Similarity=0.346 Sum_probs=32.6 Q ss_pred HCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCC Q ss_conf 42888999999999999997279989999999996425 Q gi|255764481|r 206 RAGFSRDTIHLIRAVYKQIFQQGDSIYKNAGAIREQNV 243 (271) Q Consensus 206 R~g~s~e~i~~l~~ayr~lfr~~~~l~e~l~~l~~~~~ 243 (271) +|-|+++++..|+....++...|.+++.+-+.|.+... T Consensus 36 ~R~Ys~~di~~l~~Ik~Ll~~~G~ti~~~k~~L~~~~~ 73 (99) T cd04765 36 RRYYRPKDVELLLLIKHLLYEKGYTIEGAKQALKEDGA 73 (99) T ss_pred CCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCC T ss_conf 85068999999999999999879879999999983886 No 160 >pfam07637 PSD5 Protein of unknown function (DUF1595). A family of proteins in Rhodopirellula baltica, associated with pfam07635, pfam07626, pfam07631, pfam07627, and pfam07624. Probab=63.25 E-value=13 Score=18.14 Aligned_cols=42 Identities=12% Similarity=0.200 Sum_probs=36.9 Q ss_pred HCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCHH Q ss_conf 428889999999999999972799899999999964258989 Q gi|255764481|r 206 RAGFSRDTIHLIRAVYKQIFQQGDSIYKNAGAIREQNVSCPE 247 (271) Q Consensus 206 R~g~s~e~i~~l~~ayr~lfr~~~~l~e~l~~l~~~~~~~~~ 247 (271) |+-.+.+++..+.+.|+.....+.+..+++.........+|. T Consensus 16 RRp~t~~e~~~~~~~~~~~~~~g~~~~~a~~~~l~aiL~SP~ 57 (64) T pfam07637 16 RRPLTDAELDRLLALYDAAREAGKDFEEALKEALAAILCSPS 57 (64) T ss_pred CCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCC T ss_conf 689999999999999999987499999999999999985986 No 161 >cd01999 Argininosuccinate_Synthase Argininosuccinate synthase. The Argininosuccinate synthase is a urea cycle enzyme that catalyzes the penultimate step in arginine biosynthesis: the ATP-dependent ligation of citrulline to aspartate to form argininosuccinate, AMP and pyrophosphate . In humans, a defect in the AS gene causes citrullinemia, a genetic disease characterized by severe vomiting spells and mental retardation. AS is a homotetrameric enzyme of chains of about 400 amino-acid residues. An arginine seems to be important for the enzyme's catalytic mechanism. The sequences of AS from various prokaryotes, archaebacteria and eukaryotes show significant similarity Probab=58.26 E-value=8.2 Score=19.32 Aligned_cols=61 Identities=8% Similarity=0.084 Sum_probs=39.0 Q ss_pred CCHHHHCC-CCHHHHHHHHHHHHHHHHCCCC---------HHHHHHHHHHH-CCCCHHHHHHHHHHHHCCCC Q ss_conf 20144428-8899999999999999727998---------99999999964-25898999999998615888 Q gi|255764481|r 201 VVAMRRAG-FSRDTIHLIRAVYKQIFQQGDS---------IYKNAGAIREQ-NVSCPEVSDIINFIFADRKR 261 (271) Q Consensus 201 ~~glkR~g-~s~e~i~~l~~ayr~lfr~~~~---------l~e~l~~l~~~-~~~~~~v~~~~~Fi~~s~rR 261 (271) .+|||-|| |..---..|..|+|.|-.--++ +.+...++.-+ ..-+|+.+.+-.|+..++++ T Consensus 260 ~vG~KsRgvYEaPg~tiL~~AHr~LE~~~Ldr~~~~~K~~l~~~~a~lvY~G~wfsPlr~~l~afi~~tq~~ 331 (385) T cd01999 260 VIGIKSREVYEAPGATILIKAHRDLESLTLDREVLHFKDIVDPKYAELVYNGLWFSPLREALEAFIDKTQER 331 (385) T ss_pred EECEEEEEEEECCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCE T ss_conf 001122005635538899999999999857799999999999999999854355788899999999986384 No 162 >smart00335 ANX Annexin repeats. Probab=55.06 E-value=17 Score=17.27 Aligned_cols=27 Identities=19% Similarity=0.411 Sum_probs=22.9 Q ss_pred CCHHHHHHHHHHHHHHHHCCCCHHHHHHH Q ss_conf 88999999999999997279989999999 Q gi|255764481|r 209 FSRDTIHLIRAVYKQIFQQGDSIYKNAGA 237 (271) Q Consensus 209 ~s~e~i~~l~~ayr~lfr~~~~l~e~l~~ 237 (271) .++.++..++++|+..| +.+|.++++. T Consensus 14 Rs~~ql~~I~~~Y~~~y--~~~L~~~i~~ 40 (53) T smart00335 14 RSNAQLQAIKQAYKKRY--GKDLEDDIKS 40 (53) T ss_pred CCHHHHHHHHHHHHHHH--CCCHHHHHHH T ss_conf 99999999999999883--9789999887 No 163 >PRK00509 argininosuccinate synthase; Provisional Probab=54.95 E-value=16 Score=17.42 Aligned_cols=61 Identities=7% Similarity=0.034 Sum_probs=39.5 Q ss_pred CCHHHHCC-CCHHHHHHHHHHHHHHHHCCC---------CHHHHHHHHHHH-CCCCHHHHHHHHHHHHCCCC Q ss_conf 20144428-889999999999999972799---------899999999964-25898999999998615888 Q gi|255764481|r 201 VVAMRRAG-FSRDTIHLIRAVYKQIFQQGD---------SIYKNAGAIREQ-NVSCPEVSDIINFIFADRKR 261 (271) Q Consensus 201 ~~glkR~g-~s~e~i~~l~~ayr~lfr~~~---------~l~e~l~~l~~~-~~~~~~v~~~~~Fi~~s~rR 261 (271) .+|||-|+ |..---..|.+|+|.|-.--+ .+.+...+|.-+ ..-+|+.+.+-.||..++++ T Consensus 263 ~vGiKsRgvYEaPaa~iL~~AHr~LE~~~Ldr~~~~~K~~l~~~~a~lvY~G~wfsP~~~~l~afi~~sQ~~ 334 (398) T PRK00509 263 LVGIKSREVYETPGGTILIKAHRALESLTLDREVAHFKDELEPKYAELVYNGLWFSPLREALQAFIDETQEH 334 (398) T ss_pred EEEEEEEEEEECCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCC T ss_conf 232321004516548899999999999858799999999999999999863475788899999999985191 No 164 >COG5126 FRQ1 Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only] Probab=52.70 E-value=19 Score=17.03 Aligned_cols=52 Identities=12% Similarity=0.149 Sum_probs=27.2 Q ss_pred HHHCCCCHHHHHHHHHHHHHHHH--CCCCHHHHHHHHHHHCCC---CHHHHHHHHHH Q ss_conf 44428889999999999999972--799899999999964258---98999999998 Q gi|255764481|r 204 MRRAGFSRDTIHLIRAVYKQIFQ--QGDSIYKNAGAIREQNVS---CPEVSDIINFI 255 (271) Q Consensus 204 lkR~g~s~e~i~~l~~ayr~lfr--~~~~l~e~l~~l~~~~~~---~~~v~~~~~Fi 255 (271) +...-|++++|+.+++||.++.+ .+.--...+..+...++. .+++.++++=+ T Consensus 9 ~~~~~~t~~qi~~lkeaF~l~D~d~~G~I~~~el~~ilr~lg~~~s~~ei~~l~~~~ 65 (160) T COG5126 9 LTFTQLTEEQIQELKEAFQLFDRDSDGLIDRNELGKILRSLGFNPSEAEINKLFEEI 65 (160) T ss_pred HHCCCCCHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHC T ss_conf 211558999999999999773767899873899999999838999689999999854 No 165 >cd01105 HTH_GlnR-like Helix-Turn-Helix DNA binding domain of GlnR-like transcription regulators. Helix-turn-helix (HTH) transcription regulator GlnR and related proteins, N-terminal domain. The GlnR and TnrA (also known as ScgR) proteins have been shown to regulate expression of glutamine synthetase as well as several genes involved in nitrogen metabolism. These proteins share the N-terminal DNA binding domain with other transcription regulators of the MerR superfamily that promote transcription by reconfiguring the spacer between the -35 and -10 promoter elements. A typical MerR regulator is comprised of two distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their conserved N-terminal domains contain predicted winged HTH motifs that mediate DNA binding, while the dissimilar C-terminal domains bind specific coactivator molecules. Probab=50.39 E-value=21 Score=16.80 Aligned_cols=46 Identities=9% Similarity=0.213 Sum_probs=35.7 Q ss_pred HCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCHHHHHHH Q ss_conf 42888999999999999997279989999999996425898999999 Q gi|255764481|r 206 RAGFSRDTIHLIRAVYKQIFQQGDSIYKNAGAIREQNVSCPEVSDII 252 (271) Q Consensus 206 R~g~s~e~i~~l~~ayr~lfr~~~~l~e~l~~l~~~~~~~~~v~~~~ 252 (271) +|-|+.+++..|+.+-.++ ..|.++++..+.|+++..+.+..++|. T Consensus 37 ~R~Ys~~dl~~l~~I~~l~-~~G~sl~~i~~~L~~~~~~~e~~~~~~ 82 (88) T cd01105 37 QRKYSLADVDRLLVIKELL-DEGFTLAAAVEKLRRRRVQAEVRRRLM 82 (88) T ss_pred CEECCHHHHHHHHHHHHHH-HCCCCHHHHHHHHHHCCCHHHHHHHHH T ss_conf 2147999999999999999-979989999999998577099999999 No 166 >TIGR00423 TIGR00423 conserved hypothetical protein TIGR00423; InterPro: IPR005244 This entry describes a family of conserved hypothetical proteins with no known function. . Probab=46.95 E-value=17 Score=17.32 Aligned_cols=50 Identities=14% Similarity=0.210 Sum_probs=27.0 Q ss_pred CCCCHHHHHHHHHHHHHHHHCCCCH-HHHHHHHHHHCCCC-H--HHHHHHHHHHHCCCCCCCCC Q ss_conf 2888999999999999997279989-99999999642589-8--99999999861588873787 Q gi|255764481|r 207 AGFSRDTIHLIRAVYKQIFQQGDSI-YKNAGAIREQNVSC-P--EVSDIINFIFADRKRPLSNW 266 (271) Q Consensus 207 ~g~s~e~i~~l~~ayr~lfr~~~~l-~e~l~~l~~~~~~~-~--~v~~~~~Fi~~s~rRgi~~~ 266 (271) .+||+.|+.-+-+- .++++ +|+|+.|++.-.++ | -++-|.|-+ ||-|||- T Consensus 96 HafSp~Ev~f~A~~------~~~~~e~EvL~~LK~aGL~SmPGtgAEILdD~v----Rr~IcP~ 149 (331) T TIGR00423 96 HAFSPMEVYFLAKN------EGLSIEKEVLKRLKKAGLDSMPGTGAEILDDSV----RRKICPN 149 (331) T ss_pred ECCCHHHHHHHHHH------CCCCHHHHHHHHHHHHCCCCCCCCCEEECCHHH----HHHHCCC T ss_conf 14686899999986------189788999999888503567776226530335----8754779 No 167 >pfam00764 Arginosuc_synth Arginosuccinate synthase. This family contains a PP-loop motif. Probab=44.73 E-value=25 Score=16.26 Aligned_cols=61 Identities=8% Similarity=0.086 Sum_probs=38.8 Q ss_pred CCHHHHCC-CCHHHHHHHHHHHHHHHHCCC---------CHHHHHHHHHHH-CCCCHHHHHHHHHHHHCCCC Q ss_conf 20144428-889999999999999972799---------899999999964-25898999999998615888 Q gi|255764481|r 201 VVAMRRAG-FSRDTIHLIRAVYKQIFQQGD---------SIYKNAGAIREQ-NVSCPEVSDIINFIFADRKR 261 (271) Q Consensus 201 ~~glkR~g-~s~e~i~~l~~ayr~lfr~~~---------~l~e~l~~l~~~-~~~~~~v~~~~~Fi~~s~rR 261 (271) .+|||-|+ |..---..|.+|+|.|-.--+ .+.+...++.-+ ..-+|+.+.+-.|+..++++ T Consensus 257 ~vGiKsRgvYEaPgatiL~~AHr~LE~l~Ldr~~~~~K~~l~~~~a~lvY~G~wfsP~r~~l~afi~~~q~~ 328 (389) T pfam00764 257 LVGLKSREIYEAPAATVLITAHRDLENLTLTREVLRFKRIVDQKYAELVYDGLWFSPLKEALDAFIDKTQER 328 (389) T ss_pred EEEEEECCEECCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCC T ss_conf 000230303118169999999999999847698999999999999998755463788999999999996092 No 168 >TIGR00032 argG argininosuccinate synthase; InterPro: IPR001518 Argininosuccinate synthase (6.3.4.5 from EC) (AS) is a urea cycle enzyme that catalyzes the penultimate step in arginine biosynthesis: the ATP-dependent ligation of citrulline to aspartate to form argininosuccinate, AMP and pyrophosphate , . In humans, a defect in the AS gene causes citrullinemia, a genetic disease characterised by severe vomiting spells and mental retardation. AS is a homotetrameric enzyme of chains of about 400 amino-acid residues. An arginine seems to be important for the enzyme's catalytic mechanism. The sequences of AS from various prokaryotes, archaebacteria and eukaryotes show significant similarity.; GO: 0004055 argininosuccinate synthase activity, 0005524 ATP binding, 0006526 arginine biosynthetic process. Probab=44.53 E-value=25 Score=16.24 Aligned_cols=60 Identities=12% Similarity=0.111 Sum_probs=42.9 Q ss_pred CHHHHCC-CCHHHHHHHHHHHHHHHH---------CCC-CHHHHHHHHHHH-CCCCHHHHHH-HHHHHHCCCC Q ss_conf 0144428-889999999999999972---------799-899999999964-2589899999-9998615888 Q gi|255764481|r 202 VAMRRAG-FSRDTIHLIRAVYKQIFQ---------QGD-SIYKNAGAIREQ-NVSCPEVSDI-INFIFADRKR 261 (271) Q Consensus 202 ~glkR~g-~s~e~i~~l~~ayr~lfr---------~~~-~l~e~l~~l~~~-~~~~~~v~~~-~~Fi~~s~rR 261 (271) +|||-|+ +..--...|.+|||.|-. ... ..++.+.+|--+ ++..|+...| =.||+.+++| T Consensus 283 ig~KSRe~YE~Pg~~~L~~AH~~Le~L~l~~~~~~f~~s~~~~~y~~LiY~GlwfdPl~e~LR~a~i~~~Q~~ 355 (420) T TIGR00032 283 IGLKSREIYEAPGATLLIKAHRDLEQLTLTRDVLRFKDSIVEEKYSELIYNGLWFDPLKEALRDAFIRKTQER 355 (420) T ss_pred ECCCCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHH T ss_conf 3132454010157999999999986111118999998864115789998620147807899999998887740 No 169 >PRK04527 argininosuccinate synthase; Provisional Probab=44.42 E-value=24 Score=16.35 Aligned_cols=61 Identities=10% Similarity=0.091 Sum_probs=38.7 Q ss_pred CCHHHHCC-CCHHHHHHHHHHHHHHHHCCC---------CHHHHHHHHHHH-CCCCHHHHHHHHHHHHCCCC Q ss_conf 20144428-889999999999999972799---------899999999964-25898999999998615888 Q gi|255764481|r 201 VVAMRRAG-FSRDTIHLIRAVYKQIFQQGD---------SIYKNAGAIREQ-NVSCPEVSDIINFIFADRKR 261 (271) Q Consensus 201 ~~glkR~g-~s~e~i~~l~~ayr~lfr~~~---------~l~e~l~~l~~~-~~~~~~v~~~~~Fi~~s~rR 261 (271) .+|||-|+ |..---..|..|+|.|-.--+ .+.+...+|.-+ ..-+|+.+.+-.||..++++ T Consensus 264 ~vG~KsRgvYEaPa~~iL~~AHr~LE~~~ld~~~~~~K~~l~~~~a~lvY~G~WfsPlr~~l~afi~~sq~~ 335 (397) T PRK04527 264 VIGLKGRIVFEAPGLVSLLTAHRALEDAVLTKQQNRFKPDVARKWVELVYEGFYHDPLKTDIEAFLKSSQAK 335 (397) T ss_pred EEEEECCEEEECCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCC T ss_conf 232341514338189999999999998717899999999988999886507750788999999999997396 No 170 >PRK13820 argininosuccinate synthase; Provisional Probab=43.04 E-value=27 Score=16.09 Aligned_cols=61 Identities=15% Similarity=0.145 Sum_probs=38.8 Q ss_pred CCHHHHCC-CCHHHHHHHHHHHHHHHHCCC---------CHHHHHHHHHHH-CCCCHHHHHHHHHHHHCCCC Q ss_conf 20144428-889999999999999972799---------899999999964-25898999999998615888 Q gi|255764481|r 201 VVAMRRAG-FSRDTIHLIRAVYKQIFQQGD---------SIYKNAGAIREQ-NVSCPEVSDIINFIFADRKR 261 (271) Q Consensus 201 ~~glkR~g-~s~e~i~~l~~ayr~lfr~~~---------~l~e~l~~l~~~-~~~~~~v~~~~~Fi~~s~rR 261 (271) .+|||-|+ |..---..|..|+|.|-.--+ .+.+...+|.-+ ..-+|+.+.+-.|+..++++ T Consensus 261 ~vGiKsR~vYEaPaatiL~~AHr~LE~~tLdr~~~~~K~~l~~~~a~lvY~G~wfsPlr~~l~afi~~sq~~ 332 (395) T PRK13820 261 VLGLKARENYEHPAATVLLTAHKALEQLVLTREELKFKEIVDSKWAELAYKGLVHEPLRDDLNAFIDKTQER 332 (395) T ss_pred EEEEECCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCC T ss_conf 011322630058189999999999998627788999999999999999856763788999999999985092 No 171 >cd07316 terB_like_DjlA N-terminal tellurium resistance protein terB-like domain of heat shock DnaJ-like proteins. Tellurium resistance terB-like domain of the DnaJ-like DjlA proteins. This family represents the terB-like domain of DjlA-like proteins, a subgroup of heat shock DnaJ-like proteins. Escherichia coli DjlA is a type III membrane protein with a small N-terminal transmembrane region and DnaJ-like domain on the extreme C-terminus. Overproduction has been shown to activate the RcsC pathway, which regulates the production of the capsular exopolysaccharide colanic acid. The specific function of this domain is unknown. Probab=42.94 E-value=27 Score=16.09 Aligned_cols=54 Identities=13% Similarity=0.226 Sum_probs=40.2 Q ss_pred HHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCH-HHHHHHHHHHH Q ss_conf 4442888999999999999997279989999999996425898-99999999861 Q gi|255764481|r 204 MRRAGFSRDTIHLIRAVYKQIFQQGDSIYKNAGAIREQNVSCP-EVSDIINFIFA 257 (271) Q Consensus 204 lkR~g~s~e~i~~l~~ayr~lfr~~~~l~e~l~~l~~~~~~~~-~v~~~~~Fi~~ 257 (271) |++.+++.++.+.+.+.|+..-....++++-+.++...+...| ....+++|+-. T Consensus 28 ~~~~~~~~~~~~~a~~~f~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~ll~~L~~ 82 (106) T cd07316 28 MDQMGLDAEARREAIRLFNEGKESDFGLEEYARQFRRACGGRPELLLQLLEFLFQ 82 (106) T ss_pred HHHCCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHH T ss_conf 8753799999999999999875267789999999999876899999999999999 No 172 >PRK13340 alanine racemase; Reviewed Probab=42.13 E-value=18 Score=17.15 Aligned_cols=35 Identities=23% Similarity=0.310 Sum_probs=24.1 Q ss_pred HHCCCCCC-CCCCCHHHHCCCCHHHHHHHHHHHHHH Q ss_conf 10745323-344201444288899999999999999 Q gi|255764481|r 190 NGNPGALR-GVNVVAMRRAGFSRDTIHLIRAVYKQI 224 (271) Q Consensus 190 ~G~pa~~~-~~N~~glkR~g~s~e~i~~l~~ayr~l 224 (271) .|.|.++. .+|+.||.|.||+.++...+.++.++. T Consensus 153 ~~~~~~vHlkiDTtGM~RlG~~~~~~~~~~~~~~i~ 188 (404) T PRK13340 153 NGKPIDIHLALNSGGMGRNGLDPSTFRGAWEAVRIA 188 (404) T ss_pred HCCCEEEEEEECCCCCCCCCCCCHHHHHHHHHHHHH T ss_conf 199547999981688666798810289999999996 No 173 >TIGR02044 CueR Cu(I)-responsive transcriptional regulator; InterPro: IPR011789 This entry represents the copper-, silver- and gold- (I) responsive transcriptional activator of the gamma proteobacterial copper efflux system . This protein is a member of the MerR family of transcriptional activators and contains a distinctive pattern of cysteine residues in its metal binding loop, Cys-X7-Cys. This family also lacks a conserved cysteine at the N-terminal end of the dimerization helix which is required for the binding of divalent metals such as zinc; here it is replaced by a serine residue .; GO: 0003677 DNA binding, 0005507 copper ion binding, 0016563 transcription activator activity, 0045941 positive regulation of transcription. Probab=40.15 E-value=30 Score=15.82 Aligned_cols=56 Identities=13% Similarity=0.250 Sum_probs=28.8 Q ss_pred CCCCH-HHHCCCCHHHHHHHHHHHHHHHHCC-CCHHHHHHHHHHHCCCCHHHHHHHHH Q ss_conf 44201-4442888999999999999997279-98999999999642589899999999 Q gi|255764481|r 199 VNVVA-MRRAGFSRDTIHLIRAVYKQIFQQG-DSIYKNAGAIREQNVSCPEVSDIINF 254 (271) Q Consensus 199 ~N~~g-lkR~g~s~e~i~~l~~ayr~lfr~~-~~l~e~l~~l~~~~~~~~~v~~~~~F 254 (271) |+.+- -|+.|||-|++++|-..|.-=-|.- ..-+-+++++.|-.....+.+.|-+- T Consensus 47 L~li~raR~~GFSLeEc~~ll~L~ndp~R~SAdVK~rtl~k~~E~~~kI~eL~~m~~q 104 (127) T TIGR02044 47 LRLISRARSVGFSLEECKELLSLWNDPNRTSADVKARTLEKVAELERKISELQEMKDQ 104 (127) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 8999999871898899999987522888988788999998899999999999889999 No 174 >pfam03586 Herpes_UL36 Herpesvirus UL36 tegument protein. The UL36 open reading frame (ORF) encodes the largest herpes simplex virus type 1 (HSV-1) protein, a 270-kDa polypeptide designated VP1/2, which is also a component of the virion tegument. A null mutation in the UL36 gene of herpes simplex virus type 1 results in accumulation of unenveloped DNA-filled capsids in the cytoplasm of infected cells. This family only covers a small central part of this large protein. Probab=40.10 E-value=29 Score=15.92 Aligned_cols=71 Identities=17% Similarity=0.132 Sum_probs=36.3 Q ss_pred CCCCHHHHCCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHC Q ss_conf 331000107453233442014442888999999999999997279989999999996425898999999998615 Q gi|255764481|r 184 IPYGILNGNPGALRGVNVVAMRRAGFSRDTIHLIRAVYKQIFQQGDSIYKNAGAIREQNVSCPEVSDIINFIFAD 258 (271) Q Consensus 184 pp~~~~~G~pa~~~~~N~~glkR~g~s~e~i~~l~~ayr~lfr~~~~l~e~l~~l~~~~~~~~~v~~~~~Fi~~s 258 (271) +.|..++-.-+.+-|++..+|.|.--=.+.|...-.+ -.+.....+.+..+..+..+.|+..+++||.+++ T Consensus 96 daF~~aAp~y~~LFgvdve~L~~L~rIa~~iL~~a~a----~~G~idy~~~V~~l~~dL~~vP~L~kYVdFYrrg 166 (253) T pfam03586 96 DAFPAAAPYYETLFGVDVEPLMRLLRIAGGILEHAAA----GGGNIDYYDAVGTLSGDLLAVPELAKYVDFYRRG 166 (253) T ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHH----CCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHH T ss_conf 7656667899999756589999999999999999870----3898869999999899986283698899999954 No 175 >PRK09430 djlA Dna-J like membrane chaperone protein; Provisional Probab=39.37 E-value=31 Score=15.74 Aligned_cols=13 Identities=15% Similarity=0.447 Sum_probs=6.9 Q ss_pred HHHHHHHHHHHHH Q ss_conf 9999999999972 Q gi|255764481|r 214 IHLIRAVYKQIFQ 226 (271) Q Consensus 214 i~~l~~ayr~lfr 226 (271) ..++|++||.|-+ T Consensus 216 ~~eik~ayr~l~~ 228 (269) T PRK09430 216 DQEIKRAYRKLMS 228 (269) T ss_pred HHHHHHHHHHHHH T ss_conf 9999999999998 No 176 >TIGR02120 GspF general secretion pathway protein F; InterPro: IPR011850 This membrane protein is a component of the terminal branch complex of the general secretion pathway (GSP), also known as the "Type II" secretion pathway. The GSP transports proteins (generally virulence-associated cell wall hydrolases) across the outer membrase of the bacterial cell. Transport across the inner membrane is often, but not exclusively handled by the Sec system. GspF nearly always clusters with other GSP subunits. However, some genes from Xylella and Xanthomonas strains are excluded from this entry due to excessive divergence.; GO: 0008565 protein transporter activity, 0015628 protein secretion by the type II secretion system, 0015627 type II protein secretion system complex. Probab=39.24 E-value=31 Score=15.73 Aligned_cols=55 Identities=18% Similarity=0.260 Sum_probs=47.0 Q ss_pred HHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHC Q ss_conf 14442888999999999999997279989999999996425898999999998615 Q gi|255764481|r 203 AMRRAGFSRDTIHLIRAVYKQIFQQGDSIYKNAGAIREQNVSCPEVSDIINFIFAD 258 (271) Q Consensus 203 glkR~g~s~e~i~~l~~ayr~lfr~~~~l~e~l~~l~~~~~~~~~v~~~~~Fi~~s 258 (271) -+.|++.|..+...+-+=--.|---+.+++|+|..+.++.. .|.+++++.-|++. T Consensus 67 ~~~~~~ls~~~la~~TRQLATLl~AglPLeeaL~~l~~Q~e-~~~~~~~l~~iR~~ 121 (414) T TIGR02120 67 KLLRRRLSRAELALFTRQLATLLGAGLPLEEALAALLEQAE-KPRLKSVLAAIRSR 121 (414) T ss_pred HHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC-CHHHHHHHHHHHHH T ss_conf 63168989999999999999999853409999999996259-57899999999999 No 177 >cd04396 RhoGAP_fSAC7_BAG7 RhoGAP_fSAC7_BAG7: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of fungal SAC7 and BAG7-like proteins. Both proteins are GTPase activating proteins of Rho1, but differ functionally in vivo: SAC7, but not BAG7, is involved in the control of Rho1-mediated activation of the PKC-MPK1 pathway. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude. Probab=37.57 E-value=20 Score=16.94 Aligned_cols=49 Identities=14% Similarity=0.007 Sum_probs=33.8 Q ss_pred CCCCCCCCCCCCCHHHHCCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHH Q ss_conf 1321122333310001074532334420144428889999999999999 Q gi|255764481|r 175 GMTGVVHDVIPYGILNGNPGALRGVNVVAMRRAGFSRDTIHLIRAVYKQ 223 (271) Q Consensus 175 ~gs~V~kdVpp~~~~~G~pa~~~~~N~~glkR~g~s~e~i~~l~~ayr~ 223 (271) |++.++.-||....--=.--+.+|++..|+-|..=+...++.|+++|.. T Consensus 24 ~~~~~~~~IP~iV~~Ci~~le~~GL~~EGIfRvsG~~~~I~~Lk~~fd~ 72 (225) T cd04396 24 GEQYVYGYIPVVVAKCGVYLKENATEVEGIFRVAGSSKRIRELQLIFST 72 (225) T ss_pred CCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEECCCCHHHHHHHHHHHCC T ss_conf 7657799889499999999997499988802689879999999999766 No 178 >PRK04156 gltX glutamyl-tRNA synthetase; Provisional Probab=37.46 E-value=22 Score=16.69 Aligned_cols=50 Identities=20% Similarity=0.282 Sum_probs=26.0 Q ss_pred CEEEECCCEEECCCCCCCC-----CCCCCHHHHCCCCCCCCCCCHHHHCCCCHHHHHHH Q ss_conf 5064047435013211223-----33310001074532334420144428889999999 Q gi|255764481|r 164 FTRIGKYAFIGGMTGVVHD-----VIPYGILNGNPGALRGVNVVAMRRAGFSRDTIHLI 217 (271) Q Consensus 164 ~v~IG~~a~ig~gs~V~kd-----Vpp~~~~~G~pa~~~~~N~~glkR~g~s~e~i~~l 217 (271) +..-|+..|-+. +.+|- |.. ..+.|- --++-..+-||||||+++|.|+.. T Consensus 325 ~~~fgRln~~~~--vlSKrkl~~lVe~-g~v~GW-DDPRlpTl~glrRRG~~peAir~f 379 (566) T PRK04156 325 YIHYGRLKIEGF--VLSTSKIRKGIES-GEYSGW-DDPRLPTLRALRRRGILPEAIREL 379 (566) T ss_pred EEEEEEEECCCE--EEEHHHHHHHHHC-CCCCCC-CCCCHHHHHHHHHCCCCHHHHHHH T ss_conf 577767751531--3108899999862-988888-788427789998679999999999 No 179 >pfam00191 Annexin Annexin. This family of annexins also includes giardin that has been shown to function as an annexin. Probab=36.76 E-value=34 Score=15.48 Aligned_cols=27 Identities=26% Similarity=0.535 Sum_probs=22.8 Q ss_pred CCHHHHHHHHHHHHHHHHCCCCHHHHHHH Q ss_conf 88999999999999997279989999999 Q gi|255764481|r 209 FSRDTIHLIRAVYKQIFQQGDSIYKNAGA 237 (271) Q Consensus 209 ~s~e~i~~l~~ayr~lfr~~~~l~e~l~~ 237 (271) .++.++..++++|+..| +.+|.++++. T Consensus 27 rs~~~l~~i~~~Y~~~y--g~~L~~~i~~ 53 (66) T pfam00191 27 RSNAQLQAIREAYKKLY--GKDLEKDIKS 53 (66) T ss_pred CCHHHHHHHHHHHHHHH--CCCHHHHHHH T ss_conf 89999999999999986--9789999987 No 180 >pfam03556 DUF298 Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif. This domain is implicated in some aspect of neddylation of the cullin 3 family and has a possible role in the regulation of the protein modifier Nedd8 E3 ligase. Neddylation is the process by which the C-terminal glycine of the ubiquitin-like protein Nedd8 is covalently linked to lysine residues in a protein through an isopeptide bond. Probab=36.17 E-value=34 Score=15.43 Aligned_cols=49 Identities=16% Similarity=0.231 Sum_probs=33.9 Q ss_pred HHHHHHHHHHH-------CCCCHHHHHHHHHHHCCC--CHHHHHHHHHHHHCCCCCCC Q ss_conf 99999999972-------799899999999964258--98999999998615888737 Q gi|255764481|r 216 LIRAVYKQIFQ-------QGDSIYKNAGAIREQNVS--CPEVSDIINFIFADRKRPLS 264 (271) Q Consensus 216 ~l~~ayr~lfr-------~~~~l~e~l~~l~~~~~~--~~~v~~~~~Fi~~s~rRgi~ 264 (271) .-++.|+..|. +.+.++.|++-.+--+.. .|.+...++|++...+|+|. T Consensus 21 ~Fk~~Y~f~F~~~~~~~qk~l~~e~Ai~~W~lll~~~~~~~l~~W~~Fl~~~~~k~Is 78 (117) T pfam03556 21 KFKDLYRFTFDFAREEGQKSLDLETAIEYWKLLFGSRTFPLLDQWIEFLEEKHKKSIS 78 (117) T ss_pred HHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCCC T ss_conf 9999999999997585658764999999999982888756899999999875677867 No 181 >pfam01244 Peptidase_M19 Membrane dipeptidase (Peptidase family M19). Probab=34.68 E-value=35 Score=15.34 Aligned_cols=23 Identities=22% Similarity=0.365 Sum_probs=15.7 Q ss_pred CHHHHCCCCHHHHHHH--HHHHHHH Q ss_conf 0144428889999999--9999999 Q gi|255764481|r 202 VAMRRAGFSRDTIHLI--RAVYKQI 224 (271) Q Consensus 202 ~glkR~g~s~e~i~~l--~~ayr~l 224 (271) ..|.|+||++++|+.+ .+++|+| T Consensus 290 ~~L~~rG~se~~i~ki~g~N~lRvl 314 (316) T pfam01244 290 AELLRRGYSEAEIEKILGGNWLRVL 314 (316) T ss_pred HHHHHCCCCHHHHHHHHHHHHHHHH T ss_conf 9999869899999999856399984 No 182 >PTZ00184 calmodulin; Provisional Probab=34.54 E-value=36 Score=15.26 Aligned_cols=47 Identities=13% Similarity=0.058 Sum_probs=20.2 Q ss_pred HHHHHHHHHHHH--CCCCHHHHHHHHHHHCC---CCHHHHHHHHHHHHCCCC Q ss_conf 999999999972--79989999999996425---898999999998615888 Q gi|255764481|r 215 HLIRAVYKQIFQ--QGDSIYKNAGAIREQNV---SCPEVSDIINFIFADRKR 261 (271) Q Consensus 215 ~~l~~ayr~lfr--~~~~l~e~l~~l~~~~~---~~~~v~~~~~Fi~~s~rR 261 (271) ..+++||+.+=+ +|.--.+.+.++..... ...++++++..+.....+ T Consensus 84 ~~l~~aF~~~D~d~~G~I~~~el~~~l~~lg~~~s~~ei~~l~~~~D~d~dG 135 (149) T PTZ00184 84 EEIKEAFRVFDKDGNGFISAAELRHVMTNLGEKLTDEEVDEMIREADVDGDG 135 (149) T ss_pred HHHHHHHHHHCCCCCCEECHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCCC T ss_conf 9999998731689998684999999999858988699999999987889999 No 183 >pfam07319 DnaI_N Primosomal protein DnaI N-terminus. This family represents the N-terminus (approximately 120 residues) of bacterial primosomal DnaI proteins, although one family member appears to be of viral origin. DnaI is one of the components of the Bacillus subtilis replication restart primosome, and is required for the DnaB75-dependent loading of the DnaC helicase. Probab=34.37 E-value=37 Score=15.25 Aligned_cols=36 Identities=17% Similarity=0.353 Sum_probs=24.3 Q ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHCCCCHHHHHHHH Q ss_conf 9999999972799899999999964258989999999 Q gi|255764481|r 217 IRAVYKQIFQQGDSIYKNAGAIREQNVSCPEVSDIIN 253 (271) Q Consensus 217 l~~ayr~lfr~~~~l~e~l~~l~~~~~~~~~v~~~~~ 253 (271) +.++.+.+.. ...+.++.+++.++..+.|+|+.|+. T Consensus 4 I~~~l~~~~~-~~~~~~r~e~l~~~vl~dpdVqaFl~ 39 (93) T pfam07319 4 IGETLKKLMQ-NRDFQKRYEQLKKEVLKDPDVQAFLQ 39 (93) T ss_pred HHHHHHHHHH-CHHHHHHHHHHHHHHHCCHHHHHHHH T ss_conf 7999999983-30077999999999973998999999 No 184 >cd04781 HTH_MerR-like_sg6 Helix-Turn-Helix DNA binding domain of putative transcription regulators from the MerR superfamily. Putative helix-turn-helix (HTH) MerR-like transcription regulators (subgroup 6) with at least two conserved cysteines present in the C-terminal portion of the protein. Based on sequence similarity, these proteins are predicted to function as transcription regulators that mediate responses to stress in eubacteria. They belong to the MerR superfamily of transcription regulators that promote transcription of various stress regulons by reconfiguring the operator sequence located between the -35 and -10 promoter elements. A typical MerR regulator is comprised of two distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their N-terminal domains are homologous and contain a DNA-binding winged HTH motif, while the C-terminal domains are often dissimilar and bind specific coactivator molecules such as metal ions, drugs, an Probab=34.18 E-value=20 Score=16.92 Aligned_cols=52 Identities=17% Similarity=0.196 Sum_probs=24.8 Q ss_pred HHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 4442888999999999999997279989999999996425898999999998 Q gi|255764481|r 204 MRRAGFSRDTIHLIRAVYKQIFQQGDSIYKNAGAIREQNVSCPEVSDIINFI 255 (271) Q Consensus 204 lkR~g~s~e~i~~l~~ayr~lfr~~~~l~e~l~~l~~~~~~~~~v~~~~~Fi 255 (271) +|+.|||-++|+.+-.....--.....+++.+++++++..+-...+..++-+ T Consensus 52 ~r~lGfsL~EI~~ll~~~~~~~~~~~~l~~~~~~i~~~I~~L~~~r~~L~~~ 103 (120) T cd04781 52 GRAAGFSLDEIQAMLSHDGKPPIDRQLLKAKAAELDQQIQRLQAMRELLRHV 103 (120) T ss_pred HHHCCCCHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9995999999999884268983799999999999999999999999999998 No 185 >TIGR02589 cas_Csd2 CRISPR-associated protein, Csd2 family; InterPro: IPR013418 This entry represents one of two closely related subfamilies that belong to the larger family of CRISPR-associated protein TM1801. Members are the Csd2 proteins of the Dvulg subtype of the CRISPR/cas system. CRISPR stands for Clustered Regularly Interspaced Short Palindromic Repeats. A related entry is IPR013419 from INTERPRO, the Csh2 protein of the Hmari CRISPR subtype.. Probab=33.33 E-value=38 Score=15.14 Aligned_cols=43 Identities=21% Similarity=0.545 Sum_probs=31.9 Q ss_pred CCCCCCCCHHHHCCCCCCC-CCCCHHHHCCCCHHHHHHHHHHHHHHHHC Q ss_conf 2233331000107453233-44201444288899999999999999727 Q gi|255764481|r 180 VHDVIPYGILNGNPGALRG-VNVVAMRRAGFSRDTIHLIRAVYKQIFQQ 227 (271) Q Consensus 180 ~kdVpp~~~~~G~pa~~~~-~N~~glkR~g~s~e~i~~l~~ayr~lfr~ 227 (271) +|..-||.+|. .+| +|.-=-++-|||.|+...|+++.-.||.. T Consensus 190 ~Kh~V~YGlY~-----~~Gfisa~lA~~TGFs~eDl~~~~~~L~~lFE~ 233 (307) T TIGR02589 190 RKHTVDYGLYV-----AHGFISAQLAEKTGFSDEDLELIKEALVNLFEN 233 (307) T ss_pred CCCEECCCEEE-----EEECCCHHHHHHCCCCHHHHHHHHHHHHHHHCC T ss_conf 12220561478-----860107577742488888999999998634005 No 186 >PRK05347 glutaminyl-tRNA synthetase; Provisional Probab=32.77 E-value=26 Score=16.13 Aligned_cols=15 Identities=27% Similarity=0.782 Sum_probs=6.2 Q ss_pred CCCHHHHCCCCHHHH Q ss_conf 420144428889999 Q gi|255764481|r 200 NVVAMRRAGFSRDTI 214 (271) Q Consensus 200 N~~glkR~g~s~e~i 214 (271) .+-||||||++++.| T Consensus 295 Tl~glrRRG~~peAi 309 (556) T PRK05347 295 TISGLRRRGYTPESI 309 (556) T ss_pred HHHHHHHCCCCHHHH T ss_conf 668887569998999 No 187 >TIGR01270 Trp_5_monoox tryptophan 5-monooxygenase; InterPro: IPR005963 Tryptophan 5-monooxygenase (1.14.16.4 from EC) is a member of the family of tetrameric, biopterin-dependent aromatic amino acid hydroxylases found in metazoans. It is closely related to tetrameric phenylalanine-4-hydroxylase and tyrosine 3-monooxygenase, and more distantly related to the monomeric phenylalanine-4-hydroxylase found in some Gram-negative bacteria. It is the rate-limiting enzyme in the biosynthesis of serotonin in the central nervous system and catalyzes the first step of the synthesis of melatonin in the pineal gland .L-tryptophan + tetrahydropteridine + O2 = 5-hydroxy-L-tryptophan + dihydropteridine + H2O ; GO: 0004510 tryptophan 5-monooxygenase activity, 0042427 serotonin biosynthetic process. Probab=32.67 E-value=30 Score=15.75 Aligned_cols=42 Identities=19% Similarity=0.397 Sum_probs=29.2 Q ss_pred CCHHHHHHHHHHHC----CCCHHHHHHHHHHHHC---CCCCCCCCCCCC Q ss_conf 98999999999642----5898999999998615---888737876348 Q gi|255764481|r 229 DSIYKNAGAIREQN----VSCPEVSDIINFIFAD---RKRPLSNWGNSK 270 (271) Q Consensus 229 ~~l~e~l~~l~~~~----~~~~~v~~~~~Fi~~s---~rRgi~~~~~~~ 270 (271) ....+++..|+... .+.|.++++++|+++- +=|+++.|.++| T Consensus 238 ~efl~Nl~lL~~~cGY~E~niPQL~dvs~FLK~kTGF~~RPVAGYLSaR 286 (499) T TIGR01270 238 KEFLDNLPLLEKYCGYREDNIPQLEDVSKFLKAKTGFRLRPVAGYLSAR 286 (499) T ss_pred HHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHCCCCEECCCCCCCCHH T ss_conf 5665105778752688768775278999996322585425635656643 No 188 >TIGR02225 recomb_XerD tyrosine recombinase XerD; InterPro: IPR011932 Proteins in this entry are part of the wider so-called "phage" integrase fmaily which describes a number of recombinases with tyrosine active sites that transiently bind covalently to DNA. Many are associated with mobile DNA elements, including phage, transposons, and phase variation loci. This entry represents the chromosomally-encoded polypeptide XerD, which is closely related to, and interacts with, another chromosomally-encoded polypeptide, XerC (IPR011931 from INTERPRO). These proteins are site-specific tyrosine recombinases, which act by catalyzing the cutting and rejoining of the recombining DNA molecules . They form a cyclic heterotetrameric complex composed of two molecules of XerC and two molecules of XerD. The XerC-XerD complex is essential to convert dimers of the bacterial chromosome into monomers to permit their segregation at cell division. It also contributes to the segregational stability of plasmids. XerD is composed of two domains, an N-terminal alpha-helical domain, and a C-terminal domain composed mostly of alpha helices, with a three-stranded antiprallel beta sheet along one edge . The active site is found at the extreme C-terminal end of the polypeptide. Most of the C-terminal domain is structurally homologous to that of other "phage" integrases, but the active site regions of these enzymes are stucturally distinct.; GO: 0003677 DNA binding, 0006310 DNA recombination, 0007049 cell cycle, 0007059 chromosome segregation, 0015074 DNA integration, 0051301 cell division. Probab=30.80 E-value=42 Score=14.88 Aligned_cols=61 Identities=16% Similarity=0.252 Sum_probs=40.1 Q ss_pred CHHHHCCCCHH----HHHHHHHHHHHHHHCCCCHHHHHHH-HH-----HHCCC---CHHHHHHHHHH-HHCCCCC Q ss_conf 01444288899----9999999999997279989999999-99-----64258---98999999998-6158887 Q gi|255764481|r 202 VAMRRAGFSRD----TIHLIRAVYKQIFQQGDSIYKNAGA-IR-----EQNVS---CPEVSDIINFI-FADRKRP 262 (271) Q Consensus 202 ~glkR~g~s~e----~i~~l~~ayr~lfr~~~~l~e~l~~-l~-----~~~~~---~~~v~~~~~Fi-~~s~rRg 262 (271) ..|+++|+++. .|..|+..|+.|++.+..-.+=.+. |+ ...++ -+||..|++.- ..+..-| T Consensus 53 ~~l~~~g~~~~S~AR~lsalR~Fy~fL~~e~~~~~dP~~~~~~~PK~~~~LP~vLt~~EV~~LL~~p~~~~~~~g 127 (305) T TIGR02225 53 AELKEAGLSARSIARALSALRSFYRFLLREGLREDDPSALRIETPKVARKLPKVLTVEEVEALLAAPIDVDTPLG 127 (305) T ss_pred HHHHHCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCC T ss_conf 999866877002889999999999999868874557336642034224578883798999999850024677676 No 189 >COG3945 Uncharacterized conserved protein [Function unknown] Probab=30.31 E-value=42 Score=14.89 Aligned_cols=17 Identities=18% Similarity=0.231 Sum_probs=6.5 Q ss_pred CCHHHHHHHHHHHHHHH Q ss_conf 88999999999999997 Q gi|255764481|r 209 FSRDTIHLIRAVYKQIF 225 (271) Q Consensus 209 ~s~e~i~~l~~ayr~lf 225 (271) +-+.-+..+.++|..+- T Consensus 92 ~~R~i~r~lee~~~~~k 108 (189) T COG3945 92 EGRYIIRDLEEAYERLK 108 (189) T ss_pred HHHHHHHHHHHHHHHHH T ss_conf 68999999999999988 No 190 >KOG0819 consensus Probab=29.84 E-value=44 Score=14.78 Aligned_cols=17 Identities=18% Similarity=0.698 Sum_probs=8.3 Q ss_pred CHHHHHHHHHHHHHHHH Q ss_conf 89999999999999972 Q gi|255764481|r 210 SRDTIHLIRAVYKQIFQ 226 (271) Q Consensus 210 s~e~i~~l~~ayr~lfr 226 (271) |++++.+++++|...|. T Consensus 119 T~~el~~i~~aY~~~y~ 135 (321) T KOG0819 119 TNEELRAIRQAYQELYK 135 (321) T ss_pred CHHHHHHHHHHHHHHHC T ss_conf 98999999999999976 No 191 >TIGR00463 gltX_arch glutamyl-tRNA synthetase; InterPro: IPR004526 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . Glutamyl-tRNA synthetase (6.1.1.17 from EC) is a class Ic synthetase and shows several similarities with glutaminyl-tRNA synthetase concerning structure and catalytic properties. It is an alpha2 dimer. To date one crystal structure of a glutamyl-tRNA synthetase (Thermus thermophilus) has been solved. The molecule has the form of a bent cylinder and consists of four domains. The N-terminal half (domains 1 and 2) contains the 'Rossman fold' typical for class I synthetases and resembles the corresponding part of E. coli GlnRS, whereas the C-terminal half exhibits a GluRS-specific structure . ; GO: 0000166 nucleotide binding, 0004818 glutamate-tRNA ligase activity, 0005524 ATP binding, 0006412 translation, 0006424 glutamyl-tRNA aminoacylation, 0005737 cytoplasm. Probab=29.53 E-value=29 Score=15.87 Aligned_cols=60 Identities=15% Similarity=0.108 Sum_probs=30.4 Q ss_pred CCCCCCCEEEECCCEEECCCCCCCCCCCCCHHHHCCCCCCCCCCCHHHHCCCCHHHHHHHHH Q ss_conf 44433650640474350132112233331000107453233442014442888999999999 Q gi|255764481|r 158 GSAVHQFTRIGKYAFIGGMTGVVHDVIPYGILNGNPGALRGVNVVAMRRAGFSRDTIHLIRA 219 (271) Q Consensus 158 ~~~v~~~v~IG~~a~ig~gs~V~kdVpp~~~~~G~pa~~~~~N~~glkR~g~s~e~i~~l~~ 219 (271) -+..-.+++|-+--.. .-|...|++... .+.||===+|=-.+=++||||+.+|.|+.+-. T Consensus 326 ~~~hyg~~ki~~v~~L-sts~~~~~i~~g-~y~GnWDDpRLpTlRa~rRRG~~pEai~~~~~ 385 (600) T TIGR00463 326 EFIHYGRLKINDVRTL-STSSIKKGIVRG-DYSGNWDDPRLPTLRAVRRRGIKPEAIREFML 385 (600) T ss_pred CEEEEEEEEHHHHHHH-HHHHHHHHHHCC-CCCCCCCCCCCCHHHHHHCCCCCHHHHHHHHH T ss_conf 0234411551133232-103454110126-51246568886302332117881578999999 No 192 >cd04379 RhoGAP_SYD1 RhoGAP_SYD1: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain present in SYD-1_like proteins. Syd-1, first identified and best studied in C.elegans, has been shown to play an important role in neuronal development by specifying axonal properties. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude. Probab=29.52 E-value=28 Score=15.99 Aligned_cols=43 Identities=19% Similarity=0.093 Sum_probs=27.6 Q ss_pred CCCCCCCCHHHHCCCCCCCCCCCHHHHCCCCHHHHHHHHHHHH Q ss_conf 2233331000107453233442014442888999999999999 Q gi|255764481|r 180 VHDVIPYGILNGNPGALRGVNVVAMRRAGFSRDTIHLIRAVYK 222 (271) Q Consensus 180 ~kdVpp~~~~~G~pa~~~~~N~~glkR~g~s~e~i~~l~~ayr 222 (271) .++||..-.-.-.--...|++.+|+-|..=+..+++.|++.|+ T Consensus 15 ~~~IP~iv~~ci~~le~~GL~~EGIFR~sGs~~~i~~L~~~~~ 57 (207) T cd04379 15 SRDVPIVLQKCVQEIERRGLDVIGLYRLCGSAAKKKELRDAFE 57 (207) T ss_pred CCCCCHHHHHHHHHHHHHCCCCCCEEECCCCHHHHHHHHHHHH T ss_conf 9995889999999999858997863424787999999999995 No 193 >TIGR02568 LcrE type III secretion regulator YopN/LcrE/InvE/MxiC; InterPro: IPR013401 This protein is found in type III secretion operons and, in Yersinia, is localized to the cell surface and is involved in the Low-Calcium Response (LCR), possibly by sensing the calcium concentration . In Salmonella, the gene is known as InvE and is believed to perform an essential role in the secretion process and interacts with the proteins SipBCD and SicA . Negative regulation of type III secretion in Y. pestis is mediated in part by a multiprotein complex that has been proposed to act as a physical impediment to type III secretion by blocking the entrance to the secretion apparatus prior to contact with mammalian cells. This complex is composed of YopN, its heterodimeric secretion chaperone SycN-YscB, and TyeA ; GO: 0030254 protein secretion by the type III secretion system, 0050709 negative regulation of protein secretion, 0009986 cell surface. Probab=29.46 E-value=44 Score=14.74 Aligned_cols=65 Identities=18% Similarity=0.343 Sum_probs=47.4 Q ss_pred CCCCCCCCCCCHHHHCCC----CHHH----HHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHH Q ss_conf 745323344201444288----8999----99999999999727998999999999642589899999999861 Q gi|255764481|r 192 NPGALRGVNVVAMRRAGF----SRDT----IHLIRAVYKQIFQQGDSIYKNAGAIREQNVSCPEVSDIINFIFA 257 (271) Q Consensus 192 ~pa~~~~~N~~glkR~g~----s~e~----i~~l~~ayr~lfr~~~~l~e~l~~l~~~~~~~~~v~~~~~Fi~~ 257 (271) .|.-.-++|+- |+...| +.+. +..|+..||..-..-.++.+.++.+.+.+...+.+.-.++|+.. T Consensus 147 ~~~~~a~~~~~-l~~~~fa~~~~~~~~~~~l~~LR~lYr~~~~~~~~~~~~~~~~~er~~~~~d~~~~l~~l~~ 219 (265) T TIGR02568 147 GPEIRAGINTA-LAAAAFADSLDAKARVSDLSALRDLYRQAVSDQSSLSQLLEDLIERYGAQRDFSIALDFLIR 219 (265) T ss_pred CCCHHHHHHHH-HHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHH T ss_conf 86347777677-88898742144120134699999998998404545899999999852554248999999999 No 194 >PRK05665 amidotransferase; Provisional Probab=28.15 E-value=47 Score=14.59 Aligned_cols=58 Identities=9% Similarity=0.165 Sum_probs=41.3 Q ss_pred CCCHHHH-CCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHC Q ss_conf 4201444-2888999999999999997279989999999996425898999999998615 Q gi|255764481|r 200 NVVAMRR-AGFSRDTIHLIRAVYKQIFQQGDSIYKNAGAIREQNVSCPEVSDIINFIFAD 258 (271) Q Consensus 200 N~~glkR-~g~s~e~i~~l~~ayr~lfr~~~~l~e~l~~l~~~~~~~~~v~~~~~Fi~~s 258 (271) +..+++- .-|+++-.++|-+..|..+ .....+++++.|..........+-|++|++.. T Consensus 178 ~~~~~Q~HPEf~~~~~~~ll~~R~~~i-~~~~~~~a~~sl~~~~D~~~~a~~i~~Fl~~~ 236 (240) T PRK05665 178 QVLCFQGHPEFVHDYSRALLDLRQEHL-GEEVYSKGVASLAHDHQGTLVAEWMMRFVAQK 236 (240) T ss_pred CEEEEECCCCCCHHHHHHHHHHHHHHC-CHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHC T ss_conf 689991696879999999999977636-99999999997068875899999999998428 No 195 >TIGR02160 PA_CoA_Oxy5 phenylacetate-CoA oxygenase/reductase, PaaK subunit; InterPro: IPR011884 Phenylacetate-CoA oxygenase is comprised of a five gene complex responsible for the hydroxylation of phenylacetate-CoA (PA-CoA) as the second catabolic step in phenylacetic acid (PA) degradation , . Although the exact function of this enzyme has not been determined, it has been shown to be required for phenylacetic acid degradation and has been proposed to function in a multicomponent oxygenase acting on phenylacetate-CoA . . Probab=27.80 E-value=34 Score=15.47 Aligned_cols=56 Identities=16% Similarity=0.309 Sum_probs=34.7 Q ss_pred CCCCCEEEECCCEEECCCCCC------------CCCCCCCHHHHCCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHHH Q ss_conf 433650640474350132112------------233331000107453233442014442888999999999999997 Q gi|255764481|r 160 AVHQFTRIGKYAFIGGMTGVV------------HDVIPYGILNGNPGALRGVNVVAMRRAGFSRDTIHLIRAVYKQIF 225 (271) Q Consensus 160 ~v~~~v~IG~~a~ig~gs~V~------------kdVpp~~~~~G~pa~~~~~N~~glkR~g~s~e~i~~l~~ayr~lf 225 (271) +|.|-.-=|+|..|+|||.|| .+=--|.|++||+..- -=.-+|++..||..|=.-| T Consensus 115 tiaqeaG~~hyvavAAGSGITP~lai~~tvLa~~p~S~ftLvY~Nr~~~----------~vMFa~~L~DLKd~yp~Rf 182 (371) T TIGR02160 115 TIAQEAGAGHYVAVAAGSGITPILAIAKTVLAAEPKSTFTLVYGNRRSA----------SVMFAEELADLKDKYPQRF 182 (371) T ss_pred HHHHHHCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCEEEEEECCCCCC----------CHHHHHHHHHHHCCHHHHH T ss_conf 8888606896899981787058999999874069952089996178887----------2778998987631204578 No 196 >COG1422 Predicted membrane protein [Function unknown] Probab=27.41 E-value=48 Score=14.51 Aligned_cols=40 Identities=10% Similarity=0.232 Sum_probs=27.7 Q ss_pred CCCHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHH Q ss_conf 420144428889999999999999972799899999999964 Q gi|255764481|r 200 NVVAMRRAGFSRDTIHLIRAVYKQIFQQGDSIYKNAGAIREQ 241 (271) Q Consensus 200 N~~glkR~g~s~e~i~~l~~ayr~lfr~~~~l~e~l~~l~~~ 241 (271) .+++-+|-+--.++.++.++.+|...+++. .+++++|+++ T Consensus 67 ~liD~ekm~~~qk~m~efq~e~~eA~~~~d--~~~lkkLq~~ 106 (201) T COG1422 67 LLIDQEKMKELQKMMKEFQKEFREAQESGD--MKKLKKLQEK 106 (201) T ss_pred HHCCHHHHHHHHHHHHHHHHHHHHHHHHCC--HHHHHHHHHH T ss_conf 960299999999999999999999998088--9999999999 No 197 >TIGR02137 HSK-PSP phosphoserine phosphatase/homoserine phosphotransferase bifunctional protein; InterPro: IPR011863 This protein is has been characterised as both a phosphoserine phosphatase and a phosphoserine:homoserine phosphotransferase . In Pseudomonas aeruginosa, where the characterization was done, a second phosphoserine phosphatase (SerB) and a second homoserine kinase (thrB) are found, but in Fibrobacter succinogenes neither are present. . Probab=26.69 E-value=27 Score=16.06 Aligned_cols=39 Identities=15% Similarity=0.229 Sum_probs=17.8 Q ss_pred CCCCCCCCCCCCHHHHCCCCCCCCCCCHHHHCCCCHHHHHHHHHHH Q ss_conf 3211223333100010745323344201444288899999999999 Q gi|255764481|r 176 MTGVVHDVIPYGILNGNPGALRGVNVVAMRRAGFSRDTIHLIRAVY 221 (271) Q Consensus 176 gs~V~kdVpp~~~~~G~pa~~~~~N~~glkR~g~s~e~i~~l~~ay 221 (271) --.-|+|||.|-+..-.+-|+ |.|.++.-.+|+++-... T Consensus 29 L~~TTRDIPDYDVLMK~R~~I-------L~~~~~KL~DIQ~VIA~L 67 (203) T TIGR02137 29 LKLTTRDIPDYDVLMKQRLKI-------LDRENLKLSDIQEVIATL 67 (203) T ss_pred HHHHCCCCCCHHHHHHHHHHH-------HHHCCCCHHHHHHHHHHC T ss_conf 531037898434465556655-------421587124568887615 No 198 >COG0137 ArgG Argininosuccinate synthase [Amino acid transport and metabolism] Probab=26.57 E-value=50 Score=14.42 Aligned_cols=60 Identities=8% Similarity=0.094 Sum_probs=39.3 Q ss_pred CCHHHHCC-CCHHHHHHHHHHHHHHH---------HCCC-CHHHHHHHHHHH-CCCCHHHHHHHHHHHHCCC Q ss_conf 20144428-88999999999999997---------2799-899999999964-2589899999999861588 Q gi|255764481|r 201 VVAMRRAG-FSRDTIHLIRAVYKQIF---------QQGD-SIYKNAGAIREQ-NVSCPEVSDIINFIFADRK 260 (271) Q Consensus 201 ~~glkR~g-~s~e~i~~l~~ayr~lf---------r~~~-~l~e~l~~l~~~-~~~~~~v~~~~~Fi~~s~r 260 (271) .+|||-|+ +..---..|..|+|.|- +... .+.+...+|.-+ +..+|+...|-.|+..+++ T Consensus 265 ~vG~KSRevYE~Pg~tiL~~AHr~LE~ltl~r~~~~~k~~~~~~~~a~lvY~GlWf~Pl~~~L~a~~~~~q~ 336 (403) T COG0137 265 LVGIKSREVYEAPGATILLTAHRALESLTLDREEAHFKDIELGPKYAELVYNGLWFSPLREALRAFIDETQE 336 (403) T ss_pred EEEEECCHHCCCCHHHHHHHHHHHHHHHEECHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHH T ss_conf 011202100248219999999999976340687776443221789999986787348799999999999885 No 199 >pfam05476 PET122 PET122. The nuclear PET122 gene of S. cerevisiae encodes a mitochondrial-localized protein that activates initiation of translation of the mitochondrial mRNA from the COX3 gene, which encodes subunit III of cytochrome c oxidase. Probab=25.76 E-value=51 Score=14.33 Aligned_cols=57 Identities=12% Similarity=0.227 Sum_probs=40.9 Q ss_pred CCCCCHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHH Q ss_conf 344201444288899999999999999727998999999999642589899999999861 Q gi|255764481|r 198 GVNVVAMRRAGFSRDTIHLIRAVYKQIFQQGDSIYKNAGAIREQNVSCPEVSDIINFIFA 257 (271) Q Consensus 198 ~~N~~glkR~g~s~e~i~~l~~ayr~lfr~~~~l~e~l~~l~~~~~~~~~v~~~~~Fi~~ 257 (271) =||.+=|. ..++.+.--.+-+-|+..|+ ++.+.+.++-+-.+. +.-...++.+|++. T Consensus 206 LLNmiLlq-~~~~~d~KirlF~~F~~~~~-~l~l~DSi~IL~~~~-~~y~~~~L~~f~~~ 262 (267) T pfam05476 206 LLNMVLLQ-NEFPLDFRIRLFAEFRTSYR-GLPYLDSIEILIKRC-EGYRMNELLTFVRG 262 (267) T ss_pred HHHHHHCC-CCCCHHHHHHHHHHHHHHCC-CCCHHHHHHHHHHHC-CCHHHHHHHHHHHH T ss_conf 99999818-88883789999999998588-984677999999870-50879999999995 No 200 >cd06825 PLPDE_III_VanT Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzymes, VanT and similar proteins. This subfamily is composed of Enterococcus gallinarum VanT and similar proteins. VanT is a membrane-bound serine racemase (EC 5.1.1.18) that is essential for vancomycin resistance in Enterococcus gallinarum. It converts L-serine into its D-enantiomer (D-serine) for peptidoglycan synthesis. The C-terminal region of this protein contains a PLP-binding TIM-barrel domain followed by beta-sandwich domain, which is homologous to the fold type III PLP-dependent enzyme, bacterial alanine racemase (AR). AR exists as homodimers with active sites that lie at the interface between the TIM barrel domain of one subunit and the beta-sandwich domain of the other subunit. On the basis of this similarity, it has been suggested that dimer formation of VanT is required for its catalytic activity, and that it catalyzes the racemization of serine in a mechanistically similar manner to that of alanine by Probab=25.59 E-value=51 Score=14.33 Aligned_cols=24 Identities=38% Similarity=0.559 Sum_probs=18.3 Q ss_pred CCCCCHHHHCCCCHHHHHHHHHHHH Q ss_conf 3442014442888999999999999 Q gi|255764481|r 198 GVNVVAMRRAGFSRDTIHLIRAVYK 222 (271) Q Consensus 198 ~~N~~glkR~g~s~e~i~~l~~ayr 222 (271) .++ .||.|.||+++++..+.+.++ T Consensus 120 kiD-TGM~RlG~~~~e~~~~~~~~~ 143 (368) T cd06825 120 KVD-TGMHRLGESPEDIDSILAIYR 143 (368) T ss_pred EEC-CCCCCCCCCHHHHHHHHHHHH T ss_conf 970-799867888789999999986 No 201 >pfam12098 DUF3574 Protein of unknown function (DUF3574). This family of proteins is functionally uncharacterized. This protein is found in bacteria and viruses. Proteins in this family are typically between 144 to 163 amino acids in length. This protein has a conserved TPRF sequence motif. Probab=25.39 E-value=52 Score=14.28 Aligned_cols=19 Identities=47% Similarity=0.662 Sum_probs=16.0 Q ss_pred CHHHHHHHHHHHHHHHHCC Q ss_conf 8999999999999997279 Q gi|255764481|r 210 SRDTIHLIRAVYKQIFQQG 228 (271) Q Consensus 210 s~e~i~~l~~ayr~lfr~~ 228 (271) +...+.+|+.+||..|++. T Consensus 73 ~~~~i~~I~~aYK~~F~Q~ 91 (104) T pfam12098 73 KRADIEAIREAYKQRFHQQ 91 (104) T ss_pred HHHHHHHHHHHHHHHCCCC T ss_conf 7999999999999876675 No 202 >cd04382 RhoGAP_MgcRacGAP RhoGAP_MgcRacGAP: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain present in MgcRacGAP proteins. MgcRacGAP plays an important dual role in cytokinesis: i) it is part of centralspindlin-complex, together with the mitotic kinesin MKLP1, which is critical for the structure of the central spindle by promoting microtuble bundling. ii) after phosphorylation by aurora B MgcRacGAP becomes an effective regulator of RhoA and plays an important role in the assembly of the contractile ring and the initiation of cytokinesis. MgcRacGAP-like proteins contain a N-terminal C1-like domain, and a C-terminal RhoGAP domain. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway Probab=25.23 E-value=34 Score=15.43 Aligned_cols=25 Identities=16% Similarity=0.333 Sum_probs=12.5 Q ss_pred CCCCCCHHHHCCCCHHHHHHHHHHH Q ss_conf 3344201444288899999999999 Q gi|255764481|r 197 RGVNVVAMRRAGFSRDTIHLIRAVY 221 (271) Q Consensus 197 ~~~N~~glkR~g~s~e~i~~l~~ay 221 (271) +|+...|+-|..=+..+++++++.| T Consensus 31 ~Gl~~EGIFR~~G~~~~i~~L~~~~ 55 (193) T cd04382 31 RGLTEEGLYRVSGSEREVKALKEKF 55 (193) T ss_pred HCCCCCCCEECCCCHHHHHHHHHHH T ss_conf 1999898002688799999999998 No 203 >KOG1148 consensus Probab=24.75 E-value=44 Score=14.76 Aligned_cols=44 Identities=27% Similarity=0.245 Sum_probs=23.1 Q ss_pred CCCCCCCCCCCCHHHHC---CCCCCCCCCCHHHHCCCCHHHHHHHHH Q ss_conf 32112233331000107---453233442014442888999999999 Q gi|255764481|r 176 MTGVVHDVIPYGILNGN---PGALRGVNVVAMRRAGFSRDTIHLIRA 219 (271) Q Consensus 176 gs~V~kdVpp~~~~~G~---pa~~~~~N~~glkR~g~s~e~i~~l~~ 219 (271) +++++|--....+..|. =--++=..+.+||||||.++.|..--+ T Consensus 482 ~tv~SKRKi~~Lv~~~~V~~wDDPRLfTL~alRRRG~ppeAIn~F~~ 528 (764) T KOG1148 482 YTVVSKRKILKLVTEGYVRGWDDPRLFTLTALRRRGFPPEAINNFCA 528 (764) T ss_pred EEEEEHHHHHHHHHHCCCCCCCCCCCHHHHHHHHCCCCHHHHHHHHH T ss_conf 35520677898864164046887410114567755998799999999 No 204 >pfam04282 DUF438 Family of unknown function (DUF438). Probab=24.31 E-value=55 Score=14.16 Aligned_cols=46 Identities=7% Similarity=0.184 Sum_probs=28.4 Q ss_pred CHHHHHHHHHHHHHHHHCCCC--HHHHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 899999999999999727998--9999999996425898999999998 Q gi|255764481|r 210 SRDTIHLIRAVYKQIFQQGDS--IYKNAGAIREQNVSCPEVSDIINFI 255 (271) Q Consensus 210 s~e~i~~l~~ayr~lfr~~~~--l~e~l~~l~~~~~~~~~v~~~~~Fi 255 (271) .-+....+|+-|+.+|.+-.. +..+=++|-++.....++++|||-= T Consensus 11 ~G~~~e~vk~~F~~~~~~Vs~~EI~~~EqeLi~eG~~~~eiq~LCdvH 58 (79) T pfam04282 11 NGGSPEEVKERFQELFKGVSPSEISAMEQELIAEGMPPEEIQKLCDVH 58 (79) T ss_pred CCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHH T ss_conf 899999999999999778999999999999999599999999996999 No 205 >COG0484 DnaJ DnaJ-class molecular chaperone with C-terminal Zn finger domain [Posttranslational modification, protein turnover, chaperones] Probab=23.81 E-value=56 Score=14.10 Aligned_cols=16 Identities=19% Similarity=0.339 Sum_probs=7.9 Q ss_pred CCHHHHHHHHHHHHHH Q ss_conf 8899999999999999 Q gi|255764481|r 209 FSRDTIHLIRAVYKQI 224 (271) Q Consensus 209 ~s~e~i~~l~~ayr~l 224 (271) +++++...|++.-+.+ T Consensus 336 ls~~q~~lL~~~~~~~ 351 (371) T COG0484 336 LSDEQKELLEEFAKSL 351 (371) T ss_pred CCHHHHHHHHHHHHHH T ss_conf 9999999999998763 No 206 >TIGR00243 Dxr 1-deoxy-D-xylulose 5-phosphate reductoisomerase; InterPro: IPR003821 1-deoxy-D-xylulose 5-phosphate reductoisomerase synthesizes 2-C-methyl-D-erythritol 4-phosphate from 1-deoxy-D-xylulose 5-phosphate in a single step by intramolecular rearrangement and reduction and is responsible for terpenoid biosynthesis in some organisms . In Arabidopsis thaliana 1-deoxy-D-xylulose 5-phosphate reductoisomerase is the first committed enzyme of the non-mevalonate pathway for isoprenoid biosynthesis.; GO: 0003824 catalytic activity, 0008299 isoprenoid biosynthetic process. Probab=23.72 E-value=37 Score=15.23 Aligned_cols=82 Identities=13% Similarity=0.191 Sum_probs=46.6 Q ss_pred CEEECCCCCCCCCEEEECCCEEECCCCCCCCCC-CCCHHHHCCCCCC-----CCCCCHHHHCCCCH---HHHH--HHH-H Q ss_conf 606523444336506404743501321122333-3100010745323-----34420144428889---9999--999-9 Q gi|255764481|r 152 RVVFGGGSAVHQFTRIGKYAFIGGMTGVVHDVI-PYGILNGNPGALR-----GVNVVAMRRAGFSR---DTIH--LIR-A 219 (271) Q Consensus 152 ~~~IG~~~~v~~~v~IG~~a~ig~gs~V~kdVp-p~~~~~G~pa~~~-----~~N~~glkR~g~s~---e~i~--~l~-~ 219 (271) ++.|-..|.||.-|+.-|.++++--..=..-+| .|++. .|-|.. .|...-+.---|.+ +++. .+| = T Consensus 249 dv~ih~~SIiHSlVe~~D~SViAqLG~PDMr~Pi~yal~--~p~r~~~~~~kPLDl~~~~~L~F~~~d~dRY~lWClkdL 326 (406) T TIGR00243 249 DVVIHPQSIIHSLVEFQDGSVIAQLGSPDMRLPIAYALA--YPERLLSADVKPLDLYKLSALTFEEPDFDRYPLWCLKDL 326 (406) T ss_pred EEEECCCCEEEEEEEEECCCCEECCCCCCCCCHHHHHCC--CCHHHCCCCCCCHHHHHHCCCCCCCCCCCCCCHHHHHHH T ss_conf 164667662676678857600011288622003897568--722323789870355542103557887664404568988 Q ss_pred HHHHHHHCCCCHHHHHH Q ss_conf 99999727998999999 Q gi|255764481|r 220 VYKQIFQQGDSIYKNAG 236 (271) Q Consensus 220 ayr~lfr~~~~l~e~l~ 236 (271) || .=|+.|....-+|. T Consensus 327 a~-ea~~~G~~~~~vLN 342 (406) T TIGR00243 327 AL-EAGKAGGAMPTVLN 342 (406) T ss_pred HH-HHHHCCCHHHHHHH T ss_conf 75-30520215889996 No 207 >TIGR02143 trmA_only tRNA (uracil-5-)-methyltransferase; InterPro: IPR011869 This family consists exclusively of proteins believed to act as tRNA (uracil-5-)-methyltransferase. All members of this entry belong to the Proteobacteria. ; GO: 0009021 tRNA (uracil-5-)-methyltransferase activity, 0008033 tRNA processing. Probab=23.15 E-value=22 Score=16.64 Aligned_cols=36 Identities=22% Similarity=0.463 Sum_probs=31.5 Q ss_pred HCCCCHHHHHHHHHHHHHHHHC--CCCHHHHHHHHHHH Q ss_conf 4288899999999999999727--99899999999964 Q gi|255764481|r 206 RAGFSRDTIHLIRAVYKQIFQQ--GDSIYKNAGAIREQ 241 (271) Q Consensus 206 R~g~s~e~i~~l~~ayr~lfr~--~~~l~e~l~~l~~~ 241 (271) |+|++++.++.+++-=|+||=| ..||.+.|++|.+. T Consensus 297 RaGlD~~t~~Lv~~y~rIlYISCNP~TL~~NL~~L~~T 334 (361) T TIGR02143 297 RAGLDPDTVKLVQKYERILYISCNPETLKENLEQLSET 334 (361) T ss_pred CCCCCHHHHHHHHHCCCEEEEECCHHHHHHHHHHHHHC T ss_conf 88889899999962598799846968999999988606 No 208 >cd03557 L-arabinose_isomerase L-Arabinose isomerase (AI) catalyzes the isomerization of L-arabinose to L-ribulose, the first reaction in its conversion into D-xylulose-5-phosphate, an intermediate in the pentose phosphate pathway, which allows L-arabinose to be used as a carbon source. AI can also convert D-galactose to D-tagatose at elevated temperatures in the presence of divalent metal ions. D-tagatose, rarely found in nature, is of commercial interest as a low-calorie sugar substitute. Probab=23.03 E-value=30 Score=15.81 Aligned_cols=95 Identities=14% Similarity=0.163 Sum_probs=58.2 Q ss_pred CEECCCEEECCCCCCCCCEEEECCCEE-ECCCCCCCCCCCCCHHHHCCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHHH Q ss_conf 321486065234443365064047435-0132112233331000107453233442014442888999999999999997 Q gi|255764481|r 147 VIVDDRVVFGGGSAVHQFTRIGKYAFI-GGMTGVVHDVIPYGILNGNPGALRGVNVVAMRRAGFSRDTIHLIRAVYKQIF 225 (271) Q Consensus 147 v~Igd~~~IG~~~~v~~~v~IG~~a~i-g~gs~V~kdVpp~~~~~G~pa~~~~~N~~glkR~g~s~e~i~~l~~ayr~lf 225 (271) .+|++++.......-.+..++++.--= ---++...|--.-.+..|...-..++|..--.....++++|.++-+-|+..| T Consensus 152 ~~I~~W~raA~~~~~~r~lkvarFGdnMr~VAVTeGDkveaq~~fG~~V~~~~vgdLv~~v~~Vsd~ei~al~~ey~~~Y 231 (484) T cd03557 152 EKIGDWMRAAAGWADSRHLKVARFGDNMRNVAVTEGDKVEAQIQFGWSVNGYGVGDLVARVDAVSDSDVDALVDEYEALY 231 (484) T ss_pred HHHHHHHHHHHHHHHHHCCEEEEECCCCCCCCCCCCCHHHHHHHHCEEEEEEECHHHHHHHHHCCHHHHHHHHHHHHHHC T ss_conf 99999999999999762476788566534622126508888786272685473038999998589999999999998604 Q ss_pred HCCCCHH---HHHHHHHHH Q ss_conf 2799899---999999964 Q gi|255764481|r 226 QQGDSIY---KNAGAIREQ 241 (271) Q Consensus 226 r~~~~l~---e~l~~l~~~ 241 (271) .-...+. +..+.++.. T Consensus 232 ~~~~~~~~~~~~~~~l~~a 250 (484) T cd03557 232 DLAPELKDGGERRASLREA 250 (484) T ss_pred CCCCCCCCCHHHHHHHHHH T ss_conf 6472224745789999999 No 209 >cd07313 terB_like_2 tellurium resistance terB-like protein, subgroup 2. This family includes several uncharacterized bacterial proteins. Protein sequence homology analysis shows they are similar to tellurium resistance protein terB, but the function of this family is unknown. Probab=22.81 E-value=58 Score=13.97 Aligned_cols=49 Identities=8% Similarity=0.171 Sum_probs=29.2 Q ss_pred CCHHHHHHHHHHHHHHHHC-CCCHHHHHHHHHHHCCCCHHHHHHHHHHHH Q ss_conf 8899999999999999727-998999999999642589899999999861 Q gi|255764481|r 209 FSRDTIHLIRAVYKQIFQQ-GDSIYKNAGAIREQNVSCPEVSDIINFIFA 257 (271) Q Consensus 209 ~s~e~i~~l~~ayr~lfr~-~~~l~e~l~~l~~~~~~~~~v~~~~~Fi~~ 257 (271) +++.++..+++..+..|.- ....++-++..++...+++....|+.-|.. T Consensus 16 ~~~~E~~~i~~~l~~~f~l~~~~~~~l~~~a~~~~~~~~~l~~ft~~in~ 65 (104) T cd07313 16 YDEEERAAIDRLLAERFGLDAEEAAELLAEAEALEEEAPDLYEFTSLIKE 65 (104) T ss_pred CCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHH T ss_conf 79999999999999987857899999999999988657349999998733 No 210 >PTZ00183 centrin; Provisional Probab=22.47 E-value=59 Score=13.93 Aligned_cols=19 Identities=16% Similarity=0.303 Sum_probs=11.7 Q ss_pred HCCCCHHHHHHHHHHHHHH Q ss_conf 4288899999999999999 Q gi|255764481|r 206 RAGFSRDTIHLIRAVYKQI 224 (271) Q Consensus 206 R~g~s~e~i~~l~~ayr~l 224 (271) +..+|.++++.|+++|+.+ T Consensus 18 ~~~ls~eq~~elke~F~~~ 36 (168) T PTZ00183 18 RPELNEEQKLEIREAFDLF 36 (168) T ss_pred CCCCCHHHHHHHHHHHHHH T ss_conf 0269999999999999998 No 211 >cd04769 HTH_MerR2 Helix-Turn-Helix DNA binding domain of MerR2-like transcription regulators. Helix-turn-helix (HTH) transcription regulator MerR2 and related proteins. MerR2 in Bacillus cereus RC607 regulates resistance to organomercurials. The MerR family transcription regulators have been shown to mediate responses to stress including exposure to heavy metals, drugs, or oxygen radicals in eubacterial and some archaeal species. They regulate transcription by reconfiguring the spacer between the -35 and -10 promoter elements. A typical MerR regulator is comprised of two distinct domains that harbor the regulatory (effector-binding) site and the active (DNA-binding) site. Their N-terminal domains are homologous and contain a DNA-binding winged HTH motif, while the C-terminal domains are often dissimilar and bind specific coactivator molecules such as metal ions, drugs, and organic substrates. Probab=22.21 E-value=60 Score=13.90 Aligned_cols=19 Identities=26% Similarity=0.294 Sum_probs=13.0 Q ss_pred HHHHCCCCHHHHHHHHHHH Q ss_conf 1444288899999999999 Q gi|255764481|r 203 AMRRAGFSRDTIHLIRAVY 221 (271) Q Consensus 203 glkR~g~s~e~i~~l~~ay 221 (271) -+|+.|||-++|+.+-..| T Consensus 51 ~~r~~GfsL~eI~~ll~~~ 69 (116) T cd04769 51 EARQLGFTLAELKAIFAGH 69 (116) T ss_pred HHHHCCCCHHHHHHHHHHC T ss_conf 9999799999999999643 No 212 >CHL00091 apcE phycobillisome linker protein Probab=22.21 E-value=60 Score=13.90 Aligned_cols=50 Identities=16% Similarity=0.266 Sum_probs=35.2 Q ss_pred CCHHHH-HHHHHHHHHHHHCCC---CHHHHHHHHHHHCCCCH-HHHHHHHHHHHC Q ss_conf 889999-999999999972799---89999999996425898-999999998615 Q gi|255764481|r 209 FSRDTI-HLIRAVYKQIFQQGD---SIYKNAGAIREQNVSCP-EVSDIINFIFAD 258 (271) Q Consensus 209 ~s~e~i-~~l~~ayr~lfr~~~---~l~e~l~~l~~~~~~~~-~v~~~~~Fi~~s 258 (271) .+++++ ..|+.+||.+|.... -..+.+..+|..+.+.. .+++|++-+-.| T Consensus 721 ~~~~~lq~vIrAaYRQvfeRd~~~yi~~eef~~aESkl~ng~IsVkEFVe~Lg~S 775 (886) T CHL00091 721 SDSENLEQVLRAAYRQIFERDLNSFIAGGEFLDIESAFLNGQISVRELVEKLASS 775 (886) T ss_pred CCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHCCC T ss_conf 7799999999999999967897156665477899998755871199999996335 No 213 >TIGR00635 ruvB Holliday junction DNA helicase RuvB; InterPro: IPR004605 All proteins in this family for which functions are known are 5'-3' DNA helicases that, as part of a complex with RuvA homologs serve as a 5'-3' Holliday junction helicase. RuvA specifically binds Holliday junctions as a sandwich of two tetramers and maintains the configuration of the junction. It forms a complex with two hexameric rings of RuvB, the subunit that contains helicase activity. The complex drives ATP-dependent branch migration of the Holliday junction recombination intermediate. The endonuclease RuvC resolves junctions.; GO: 0003677 DNA binding, 0005524 ATP binding, 0009378 Holliday junction helicase activity, 0006281 DNA repair, 0006310 DNA recombination. Probab=22.08 E-value=60 Score=13.88 Aligned_cols=76 Identities=18% Similarity=0.215 Sum_probs=46.4 Q ss_pred EEEECCCEEECCC-CCCCCCCCCCHHHHCCCCCCCCCCCHHH-HCCC-------CHHHHHHHHHHHHHHHHCCCCHHHHH Q ss_conf 0640474350132-1122333310001074532334420144-4288-------89999999999999972799899999 Q gi|255764481|r 165 TRIGKYAFIGGMT-GVVHDVIPYGILNGNPGALRGVNVVAMR-RAGF-------SRDTIHLIRAVYKQIFQQGDSIYKNA 235 (271) Q Consensus 165 v~IG~~a~ig~gs-~V~kdVpp~~~~~G~pa~~~~~N~~glk-R~g~-------s~e~i~~l~~ayr~lfr~~~~l~e~l 235 (271) +.||++ .+| .|.=|+|||+++ |--.|.-.+-- -|+ |.|+ |.+++..+-+.+-.++.-.. -++++ T Consensus 112 i~IG~G----p~Ar~v~ldLpPFTLv-GATTR~G~lt~-PLrdRFG~~~rl~fY~~~EL~~Iv~R~A~~L~~ei-~~~~a 184 (305) T TIGR00635 112 IVIGKG----PSARSVRLDLPPFTLV-GATTRAGMLTS-PLRDRFGIILRLEFYTPEELAEIVSRSAGLLNIEI-EQEAA 184 (305) T ss_pred EEEECC----CCCCEEEECCCCCCCC-CCCCCCCCCCC-CHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCC-CHHHH T ss_conf 787128----9852576068694420-00034774103-13345447454026898789999875334414300-77899 Q ss_pred HHHHHHCCCCHH Q ss_conf 999964258989 Q gi|255764481|r 236 GAIREQNVSCPE 247 (271) Q Consensus 236 ~~l~~~~~~~~~ 247 (271) .++-...-.+|- T Consensus 185 ~~IArrSRGTPR 196 (305) T TIGR00635 185 LEIARRSRGTPR 196 (305) T ss_pred HHHHHHCCCCHH T ss_conf 999875478637 No 214 >TIGR01926 peroxid_rel uncharacterized peroxidase-related enzyme; InterPro: IPR010195 Members of this family are conserved hypothetical proteins of around 200 amino acids in length. Many of them contain an akylhydroperoxidase (AhpD) domain. . Probab=21.54 E-value=56 Score=14.09 Aligned_cols=37 Identities=27% Similarity=0.458 Sum_probs=28.2 Q ss_pred HHCCCCCCCCCCCHHHHCCCCHHHHHHHHHHHHHHHHC Q ss_conf 10745323344201444288899999999999999727 Q gi|255764481|r 190 NGNPGALRGVNVVAMRRAGFSRDTIHLIRAVYKQIFQQ 227 (271) Q Consensus 190 ~G~pa~~~~~N~~glkR~g~s~e~i~~l~~ayr~lfr~ 227 (271) .-.|+.+..--.--||..|||+++|-+|-... .+|-. T Consensus 128 t~~P~~v~~~d~~~Lr~~GfsD~~IlD~~~~~-AfF~~ 164 (179) T TIGR01926 128 TATPAEVNEADLAALRAAGFSDEEILDLIHVV-AFFNW 164 (179) T ss_pred HHCCCCCCHHHHHHHHHCCCCHHHHHHHHHHH-HHHHH T ss_conf 70851001898888875489958999999999-99857 No 215 >cd04408 RhoGAP_GMIP RhoGAP_GMIP: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of GMIP (Gem interacting protein). GMIP plays important roles in neurite growth and axonal guidance, and interacts with Gem, a member of the RGK subfamily of the Ras small GTPase superfamily, through the N-terminal half of the protein. GMIP contains a C-terminal RhoGAP domain. GMIP inhibits RhoA function, but is inactive towards Rac1 and Cdc41. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude. Probab=21.38 E-value=48 Score=14.48 Aligned_cols=26 Identities=15% Similarity=0.232 Sum_probs=13.0 Q ss_pred CCCCCCCHHHHCCCCHHHHHHHHHHH Q ss_conf 23344201444288899999999999 Q gi|255764481|r 196 LRGVNVVAMRRAGFSRDTIHLIRAVY 221 (271) Q Consensus 196 ~~~~N~~glkR~g~s~e~i~~l~~ay 221 (271) ..|++.+|+-|..=+..+++.+++.| T Consensus 29 ~~gl~~EGIfR~~G~~~~i~~L~~~~ 54 (200) T cd04408 29 NRALGVQGIYRISGSKARVEKLCQAF 54 (200) T ss_pred HHCCCCCCEEECCCCHHHHHHHHHHH T ss_conf 85898475343068699999999999 No 216 >pfam09675 Chlamy_scaf Chlamydia-phage Chp2 scaffold (Chlamy_scaf). Members of this entry are encoded by genes in chlamydia-phage such as Chp2. These viruses have around eight genes and obligately infect intracellular bacterial pathogens of the genus Chlamydia. This protein is annotated as VP3 or structural protein (as if a protein of mature viral particles), however, it is displaced from procapsids as DNA is packaged, and therefore is more correctly described as a scaffolding protein. Probab=20.00 E-value=67 Score=13.61 Aligned_cols=44 Identities=11% Similarity=0.189 Sum_probs=26.6 Q ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHCC Q ss_conf 9999999999972799899999999964258989999999986158 Q gi|255764481|r 214 IHLIRAVYKQIFQQGDSIYKNAGAIREQNVSCPEVSDIINFIFADR 259 (271) Q Consensus 214 i~~l~~ayr~lfr~~~~l~e~l~~l~~~~~~~~~v~~~~~Fi~~s~ 259 (271) ....+.|..++-+....+.+--+++++++.+.| ..|++|+...+ T Consensus 30 v~Dy~eAmn~V~~A~e~F~~LPa~vR~~F~NDP--~~FleF~~dp~ 73 (107) T pfam09675 30 PIDYQEALNAVIEAQEAFDSLPAKIRERFGNDP--EEMLEFLNDPE 73 (107) T ss_pred CCCHHHHHHHHHHHHHHHHHCCHHHHHHHCCCH--HHHHHHHCCCC T ss_conf 002999999999999999978899999877999--99999971955 Done!