Query gi|254781223|ref|YP_003065636.1| intrrupted gp229, phage associated protein [Candidatus Liberibacter asiaticus str. psy62] Match_columns 110 No_of_seqs 110 out of 3883 Neff 9.7 Searched_HMMs 39220 Date Mon May 30 07:40:17 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254781223.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 cd04745 LbH_paaY_like paaY-lik 99.9 4.2E-22 1.1E-26 108.7 6.7 108 1-109 3-117 (155) 2 cd04645 LbH_gamma_CA_like Gamm 99.9 1.3E-21 3.3E-26 106.6 7.5 108 1-109 2-116 (153) 3 TIGR01853 lipid_A_lpxD UDP-3-O 99.8 1.2E-21 3.1E-26 106.8 6.3 67 1-67 116-182 (336) 4 cd00710 LbH_gamma_CA Gamma car 99.8 2.3E-21 6E-26 105.6 7.2 104 1-105 5-116 (167) 5 cd04646 LbH_Dynactin_6 Dynacti 99.8 4E-21 1E-25 104.6 7.1 108 1-109 2-122 (164) 6 PRK00892 lpxD UDP-3-O-[3-hydro 99.8 5.1E-21 1.3E-25 104.2 7.1 56 3-58 115-170 (343) 7 cd04650 LbH_FBP Ferripyochelin 99.8 9.6E-21 2.4E-25 103.0 8.0 108 1-109 3-117 (154) 8 TIGR03570 NeuD_NnaD sugar O-ac 99.8 5.8E-21 1.5E-25 103.9 6.7 103 5-107 88-190 (201) 9 PRK13627 carnitine operon prot 99.8 6.9E-21 1.8E-25 103.6 6.9 108 1-109 13-127 (196) 10 TIGR01852 lipid_A_lpxA acyl-[a 99.8 9.3E-21 2.4E-25 103.1 7.2 109 1-109 1-140 (257) 11 cd03360 LbH_AT_putative Putati 99.8 1.8E-20 4.5E-25 101.9 6.2 103 5-107 85-187 (197) 12 cd03351 LbH_UDP-GlcNAc_AT UDP- 99.8 3E-20 7.7E-25 100.9 6.8 53 1-53 2-54 (254) 13 PRK05289 UDP-N-acetylglucosami 99.8 4.9E-20 1.2E-24 100.1 6.7 56 1-56 5-60 (261) 14 PRK12461 UDP-N-acetylglucosami 99.8 3.9E-20 9.9E-25 100.5 6.1 58 1-58 3-60 (256) 15 cd03352 LbH_LpxD UDP-3-O-acyl- 99.8 7.9E-20 2E-24 99.2 7.6 61 1-61 4-64 (205) 16 cd03351 LbH_UDP-GlcNAc_AT UDP- 99.8 7.3E-20 1.9E-24 99.3 7.3 106 2-107 15-175 (254) 17 PRK05289 UDP-N-acetylglucosami 99.8 9E-20 2.3E-24 99.0 7.6 47 2-48 18-64 (261) 18 PRK12461 UDP-N-acetylglucosami 99.8 1E-19 2.6E-24 98.7 7.6 49 1-49 15-63 (256) 19 PRK00892 lpxD UDP-3-O-[3-hydro 99.8 7.3E-20 1.9E-24 99.3 6.6 61 2-62 102-162 (343) 20 cd03350 LbH_THP_succinylT 2,3, 99.8 1.5E-19 3.8E-24 98.1 7.2 104 5-108 2-113 (139) 21 COG1044 LpxD UDP-3-O-[3-hydrox 99.8 3.8E-19 9.8E-24 96.3 7.8 68 2-69 115-182 (338) 22 COG0663 PaaY Carbonic anhydras 99.8 4.3E-19 1.1E-23 96.1 6.7 107 1-108 14-127 (176) 23 TIGR03532 DapD_Ac 2,3,4,5-tetr 99.8 7.5E-19 1.9E-23 95.1 6.9 104 4-107 86-197 (231) 24 TIGR01852 lipid_A_lpxA acyl-[a 99.8 1.6E-18 4.1E-23 93.7 6.4 107 1-107 31-174 (257) 25 COG1044 LpxD UDP-3-O-[3-hydrox 99.7 7.9E-18 2E-22 90.9 8.0 43 4-46 135-177 (338) 26 cd05636 LbH_G1P_TT_C_like Puta 99.7 8.7E-18 2.2E-22 90.7 7.7 48 2-49 21-68 (163) 27 TIGR01173 glmU UDP-N-acetylglu 99.7 1.5E-17 3.8E-22 89.7 6.7 37 71-107 401-437 (461) 28 cd03359 LbH_Dynactin_5 Dynacti 99.7 3.5E-17 8.9E-22 88.2 7.6 104 5-109 8-128 (161) 29 cd05636 LbH_G1P_TT_C_like Puta 99.7 8E-17 2E-21 86.7 7.7 56 2-57 9-64 (163) 30 cd03358 LbH_WxcM_N_like WcxM-l 99.7 7.1E-17 1.8E-21 86.9 7.4 104 1-107 1-104 (119) 31 PRK09451 glmU bifunctional N-a 99.7 6.6E-17 1.7E-21 87.0 7.2 37 71-107 395-431 (456) 32 TIGR03570 NeuD_NnaD sugar O-ac 99.7 5.2E-17 1.3E-21 87.5 6.5 106 1-108 90-195 (201) 33 cd03353 LbH_GlmU_C N-acetyl-gl 99.7 2.8E-16 7E-21 84.4 7.6 43 2-45 19-61 (193) 34 cd03360 LbH_AT_putative Putati 99.7 1.4E-16 3.5E-21 85.7 5.8 106 1-108 87-192 (197) 35 cd03352 LbH_LpxD UDP-3-O-acyl- 99.7 2.7E-16 7E-21 84.4 6.9 107 1-107 22-187 (205) 36 cd03353 LbH_GlmU_C N-acetyl-gl 99.6 3.2E-16 8.1E-21 84.2 7.0 53 4-57 9-61 (193) 37 COG1043 LpxA Acyl-[acyl carrie 99.6 6.4E-16 1.6E-20 82.9 7.4 41 69-109 105-145 (260) 38 cd03350 LbH_THP_succinylT 2,3, 99.6 3.7E-16 9.4E-21 83.9 6.1 104 1-104 4-115 (139) 39 PRK09451 glmU bifunctional N-a 99.6 2.8E-15 7.2E-20 80.2 6.8 75 3-79 270-344 (456) 40 cd03359 LbH_Dynactin_5 Dynacti 99.6 4.1E-15 1.1E-19 79.5 7.6 107 2-109 25-134 (161) 41 cd04645 LbH_gamma_CA_like Gamm 99.6 3.8E-15 9.6E-20 79.7 7.0 107 1-108 8-121 (153) 42 TIGR02287 PaaY phenylacetic ac 99.6 1.3E-15 3.4E-20 81.6 4.5 102 1-103 11-119 (193) 43 cd04745 LbH_paaY_like paaY-lik 99.5 8.9E-15 2.3E-19 78.1 5.9 107 1-108 9-122 (155) 44 TIGR01173 glmU UDP-N-acetylglu 99.5 1.2E-14 3.1E-19 77.6 6.5 77 2-80 275-351 (461) 45 TIGR03308 phn_thr-fam phosphon 99.5 1.5E-14 3.7E-19 77.2 6.6 38 70-107 108-145 (204) 46 COG1043 LpxA Acyl-[acyl carrie 99.5 2.7E-14 6.9E-19 76.1 5.9 38 2-39 7-44 (260) 47 cd04650 LbH_FBP Ferripyochelin 99.5 5.8E-14 1.5E-18 74.7 7.3 107 1-108 9-122 (154) 48 TIGR03532 DapD_Ac 2,3,4,5-tetr 99.5 6.8E-14 1.7E-18 74.5 6.5 99 2-100 90-196 (231) 49 COG1207 GlmU N-acetylglucosami 99.5 1.4E-13 3.5E-18 73.2 7.6 91 17-107 321-434 (460) 50 cd04646 LbH_Dynactin_6 Dynacti 99.5 1.2E-13 3E-18 73.5 6.4 104 3-109 16-128 (164) 51 PRK10502 putative colanic acid 99.5 1.7E-13 4.4E-18 72.8 6.8 104 1-107 55-162 (179) 52 cd00710 LbH_gamma_CA Gamma car 99.4 2.2E-13 5.7E-18 72.3 6.2 84 18-103 44-131 (167) 53 COG2171 DapD Tetrahydrodipicol 99.4 4.2E-13 1.1E-17 71.1 7.3 106 4-109 108-221 (271) 54 cd04647 LbH_MAT_like Maltose O 99.4 2.7E-13 7E-18 71.9 6.0 91 17-107 2-95 (109) 55 cd03358 LbH_WxcM_N_like WcxM-l 99.4 2.9E-12 7.5E-17 67.6 7.9 101 7-110 1-101 (119) 56 cd00208 LbetaH Left-handed par 99.3 1.3E-12 3.4E-17 69.0 5.6 75 2-104 4-78 (78) 57 cd03354 LbH_SAT Serine acetylt 99.3 2E-12 5E-17 68.3 6.4 86 12-107 4-91 (101) 58 PRK11830 dapD 2,3,4,5-tetrahyd 99.3 2.4E-12 6.1E-17 68.0 6.4 104 5-109 97-208 (265) 59 PRK13627 carnitine operon prot 99.3 2.8E-12 7.3E-17 67.7 6.0 32 7-38 13-44 (196) 60 COG0663 PaaY Carbonic anhydras 99.3 4.1E-12 1E-16 67.0 6.1 97 6-108 13-109 (176) 61 PRK09677 putative lipopolysacc 99.3 9.1E-12 2.3E-16 65.6 7.8 106 2-107 47-167 (192) 62 cd05824 LbH_M1P_guanylylT_C Ma 99.3 2.2E-12 5.7E-17 68.1 4.5 31 7-37 2-32 (80) 63 cd03357 LbH_MAT_GAT Maltose O- 99.3 8.1E-12 2.1E-16 65.8 6.9 106 2-107 46-155 (169) 64 KOG1461 consensus 99.3 2.7E-12 6.9E-17 67.8 4.1 103 4-108 315-422 (673) 65 cd05825 LbH_wcaF_like wcaF-lik 99.3 5.6E-12 1.4E-16 66.4 5.7 88 17-107 4-93 (107) 66 cd05824 LbH_M1P_guanylylT_C Ma 99.2 8E-12 2E-16 65.8 5.2 31 13-43 2-32 (80) 67 COG1207 GlmU N-acetylglucosami 99.2 1.9E-11 4.9E-16 64.2 6.1 42 3-45 273-314 (460) 68 PRK09527 lacA galactoside O-ac 99.2 2.9E-11 7.4E-16 63.5 6.3 106 2-107 59-168 (203) 69 KOG1461 consensus 99.2 1.6E-11 4.1E-16 64.5 4.3 93 12-107 317-409 (673) 70 cd05787 LbH_eIF2B_epsilon eIF- 99.2 3.1E-11 8E-16 63.3 5.4 30 1-31 2-31 (79) 71 PRK10092 maltose O-acetyltrans 99.2 5.2E-11 1.3E-15 62.4 6.2 104 4-107 59-166 (183) 72 COG1208 GCD1 Nucleoside-diphos 99.1 2E-10 5.1E-15 60.0 7.5 102 5-109 256-357 (358) 73 cd03349 LbH_XAT Xenobiotic acy 99.1 2.5E-10 6.3E-15 59.6 7.6 104 1-107 4-110 (145) 74 cd00208 LbetaH Left-handed par 99.1 9.9E-11 2.5E-15 61.3 4.4 77 17-109 1-77 (78) 75 PRK05293 glgC glucose-1-phosph 99.1 2.3E-10 5.8E-15 59.8 6.2 69 37-108 311-379 (381) 76 cd05787 LbH_eIF2B_epsilon eIF- 99.1 9.4E-11 2.4E-15 61.4 4.0 46 14-61 3-48 (79) 77 COG2171 DapD Tetrahydrodipicol 99.0 2.9E-10 7.5E-15 59.3 5.5 102 8-109 106-209 (271) 78 PRK05293 glgC glucose-1-phosph 99.0 5.2E-10 1.3E-14 58.2 6.7 45 49-95 311-355 (381) 79 cd03354 LbH_SAT Serine acetylt 99.0 2.1E-10 5.5E-15 59.9 4.6 83 1-87 5-89 (101) 80 cd04647 LbH_MAT_like Maltose O 99.0 5.9E-10 1.5E-14 58.0 6.4 89 1-100 4-94 (109) 81 cd03356 LbH_G1P_AT_C_like Left 99.0 4.9E-10 1.2E-14 58.4 5.8 29 2-31 3-31 (79) 82 TIGR03308 phn_thr-fam phosphon 99.0 8.6E-10 2.2E-14 57.3 6.7 105 2-109 6-141 (204) 83 cd05635 LbH_unknown Uncharacte 99.0 9.4E-10 2.4E-14 57.2 6.4 32 71-103 63-94 (101) 84 cd04652 LbH_eIF2B_gamma_C eIF- 99.0 7.7E-10 2E-14 57.5 5.2 29 2-31 3-31 (81) 85 cd05635 LbH_unknown Uncharacte 99.0 1.2E-09 3.1E-14 56.7 6.0 85 5-99 12-96 (101) 86 cd03356 LbH_G1P_AT_C_like Left 98.9 6.4E-10 1.6E-14 57.9 4.1 29 14-43 3-31 (79) 87 cd04652 LbH_eIF2B_gamma_C eIF- 98.9 7E-10 1.8E-14 57.7 4.2 78 14-101 3-80 (81) 88 PRK11830 dapD 2,3,4,5-tetrahyd 98.9 1.1E-09 2.9E-14 56.9 4.7 96 11-107 97-194 (265) 89 TIGR02287 PaaY phenylacetic ac 98.9 8.2E-10 2.1E-14 57.4 3.8 90 19-109 11-107 (193) 90 PRK10502 putative colanic acid 98.9 4E-09 1E-13 54.6 7.3 100 6-109 54-158 (179) 91 TIGR01172 cysE serine O-acetyl 98.9 1.4E-09 3.6E-14 56.4 4.8 86 22-107 61-151 (163) 92 PRK10092 maltose O-acetyltrans 98.9 1.3E-09 3.4E-14 56.6 4.5 84 2-85 77-162 (183) 93 TIGR00965 dapD 2,3,4,5-tetrahy 98.9 1.2E-09 3E-14 56.8 3.8 100 9-109 108-215 (275) 94 COG1045 CysE Serine acetyltran 98.9 2.5E-09 6.4E-14 55.4 5.2 38 70-107 119-156 (194) 95 COG1208 GCD1 Nucleoside-diphos 98.9 3.9E-09 1E-13 54.6 6.2 91 11-104 256-346 (358) 96 PRK09677 putative lipopolysacc 98.8 7.3E-09 1.9E-13 53.5 6.5 30 79-108 133-162 (192) 97 COG0110 WbbJ Acetyltransferase 98.8 8.7E-09 2.2E-13 53.1 6.8 103 5-107 53-161 (190) 98 cd05825 LbH_wcaF_like wcaF-lik 98.8 6.6E-09 1.7E-13 53.7 4.8 85 1-100 6-92 (107) 99 cd03357 LbH_MAT_GAT Maltose O- 98.7 2.4E-08 6E-13 51.3 5.6 36 65-100 119-154 (169) 100 PRK10191 putative colanic acid 98.7 3.2E-08 8.1E-13 50.8 6.0 86 11-107 42-129 (146) 101 PRK09527 lacA galactoside O-ac 98.7 1.4E-08 3.7E-13 52.2 4.2 30 3-32 80-111 (203) 102 COG1045 CysE Serine acetyltran 98.6 5.7E-08 1.5E-12 49.7 5.9 85 12-100 69-155 (194) 103 TIGR01172 cysE serine O-acetyl 98.6 4.2E-08 1.1E-12 50.3 4.8 87 11-100 62-150 (163) 104 cd04649 LbH_THP_succinylT_puta 98.6 1.8E-07 4.6E-12 47.7 7.2 103 5-109 2-110 (147) 105 PRK11132 cysE serine acetyltra 98.6 6.1E-08 1.6E-12 49.6 4.8 38 70-107 193-230 (273) 106 PRK02862 glgC glucose-1-phosph 98.6 1E-07 2.6E-12 48.7 5.2 33 17-49 309-345 (429) 107 PRK02862 glgC glucose-1-phosph 98.5 1.7E-07 4.4E-12 47.8 4.8 13 95-107 415-427 (429) 108 KOG1322 consensus 98.5 1.8E-07 4.7E-12 47.6 4.9 91 6-104 260-350 (371) 109 KOG3121 consensus 98.5 2.2E-07 5.6E-12 47.3 5.0 99 4-103 39-140 (184) 110 TIGR00965 dapD 2,3,4,5-tetrahy 98.4 1.4E-07 3.5E-12 48.2 3.4 85 16-104 132-216 (275) 111 PRK10191 putative colanic acid 98.4 5.8E-07 1.5E-11 45.5 6.1 81 2-87 45-127 (146) 112 cd03349 LbH_XAT Xenobiotic acy 98.4 1.2E-06 3E-11 44.2 7.4 35 75-109 72-106 (145) 113 PRK03282 consensus 98.4 5.8E-07 1.5E-11 45.6 5.2 50 47-98 330-379 (406) 114 PRK00844 glgC glucose-1-phosph 98.3 7.5E-07 1.9E-11 45.1 4.8 30 74-104 352-381 (409) 115 KOG1462 consensus 98.3 6.5E-07 1.7E-11 45.3 4.5 84 11-98 329-412 (433) 116 cd04651 LbH_G1P_AT_C Glucose-1 98.3 7.1E-07 1.8E-11 45.2 4.2 20 6-26 3-22 (104) 117 cd04651 LbH_G1P_AT_C Glucose-1 98.3 1.1E-06 2.7E-11 44.4 5.0 50 48-99 30-79 (104) 118 KOG4750 consensus 98.3 8.7E-07 2.2E-11 44.8 4.5 104 4-107 126-237 (269) 119 KOG1460 consensus 98.3 1.8E-06 4.5E-11 43.5 5.6 36 3-38 287-322 (407) 120 KOG1460 consensus 98.2 2.6E-06 6.7E-11 42.8 5.9 32 1-32 291-322 (407) 121 PRK11132 cysE serine acetyltra 98.2 2.4E-06 6.1E-11 43.0 5.4 84 13-100 144-229 (273) 122 PRK00844 glgC glucose-1-phosph 98.2 3.1E-06 7.8E-11 42.5 5.0 52 48-101 333-384 (409) 123 PRK04928 consensus 98.1 4.3E-06 1.1E-10 41.9 4.8 33 65-98 345-377 (405) 124 PRK03282 consensus 98.1 6E-06 1.5E-10 41.3 5.3 51 52-104 329-379 (406) 125 cd04649 LbH_THP_succinylT_puta 98.1 9.9E-06 2.5E-10 40.4 6.2 37 12-48 3-39 (147) 126 KOG3121 consensus 98.0 4.3E-06 1.1E-10 41.9 3.9 95 15-109 32-134 (184) 127 COG0110 WbbJ Acetyltransferase 98.0 5.9E-06 1.5E-10 41.4 4.3 31 77-107 125-155 (190) 128 COG0448 GlgC ADP-glucose pyrop 98.0 9E-06 2.3E-10 40.6 5.3 51 48-100 313-363 (393) 129 PRK01884 consensus 98.0 6.4E-06 1.6E-10 41.2 4.5 34 71-105 363-396 (435) 130 COG4801 Predicted acyltransfer 98.0 1.2E-05 3.2E-10 40.0 5.3 87 12-100 18-104 (277) 131 TIGR01208 rmlA_long glucose-1- 98.0 8.4E-06 2.1E-10 40.7 3.9 91 8-100 252-342 (361) 132 PRK03701 consensus 98.0 7.5E-06 1.9E-10 40.9 3.7 49 54-104 349-397 (431) 133 KOG1322 consensus 97.9 8.2E-06 2.1E-10 40.7 3.8 52 2-54 262-313 (371) 134 KOG4042 consensus 97.9 5.1E-06 1.3E-10 41.6 2.5 44 5-48 9-61 (190) 135 PRK00725 glgC glucose-1-phosph 97.9 2.5E-05 6.3E-10 38.8 5.2 51 53-105 348-398 (431) 136 PRK01884 consensus 97.8 2.2E-05 5.6E-10 39.0 4.2 53 46-100 345-397 (435) 137 KOG1462 consensus 97.8 5.5E-05 1.4E-09 37.3 5.8 59 48-109 353-411 (433) 138 TIGR02091 glgC glucose-1-phosp 97.8 1.7E-05 4.2E-10 39.5 2.8 80 15-98 335-420 (421) 139 PRK03701 consensus 97.7 4.1E-05 1E-09 37.8 4.4 51 48-100 349-399 (431) 140 COG4801 Predicted acyltransfer 97.7 8.5E-05 2.2E-09 36.5 5.4 86 16-103 16-101 (277) 141 TIGR02353 NRPS_term_dom non-ri 97.6 4.2E-05 1.1E-09 37.8 2.8 30 72-101 169-198 (719) 142 COG0448 GlgC ADP-glucose pyrop 97.5 0.00017 4.3E-09 35.3 5.2 53 52-106 311-363 (393) 143 KOG4042 consensus 97.5 0.00014 3.7E-09 35.6 4.8 32 2-33 30-64 (190) 144 TIGR02091 glgC glucose-1-phosp 97.4 8.9E-05 2.3E-09 36.4 2.8 51 52-104 352-420 (421) 145 PRK00725 glgC glucose-1-phosph 97.4 0.00017 4.3E-09 35.3 3.6 50 47-98 348-397 (431) 146 TIGR01208 rmlA_long glucose-1- 97.4 0.0003 7.7E-09 34.2 4.7 52 5-56 255-306 (361) 147 TIGR02353 NRPS_term_dom non-ri 97.3 0.00031 8E-09 34.1 4.6 37 69-105 668-704 (719) 148 KOG4750 consensus 97.2 0.001 2.7E-08 32.0 5.8 91 4-98 140-234 (269) 149 TIGR03536 DapD_gpp 2,3,4,5-tet 96.9 0.0028 7.2E-08 30.2 5.7 98 6-103 174-287 (341) 150 TIGR02092 glgD glucose-1-phosp 96.8 0.0011 2.7E-08 31.9 3.2 52 52-105 318-369 (383) 151 TIGR03535 DapD_actino 2,3,4,5- 96.6 0.0059 1.5E-07 28.8 5.9 98 6-103 149-262 (319) 152 TIGR02092 glgD glucose-1-phosp 96.6 0.0019 4.9E-08 30.8 3.1 52 46-99 318-369 (383) 153 pfam04519 DUF583 Protein of un 26.6 36 0.00092 13.0 4.8 19 7-25 3-21 (95) No 1 >cd04745 LbH_paaY_like paaY-like: This group is composed by uncharacterized proteins with similarity to the protein product of the E. coli paaY gene, which is part of the paa gene cluster responsible for phenylacetic acid degradation. Proteins in this group are expected to adopt the left-handed parallel beta-helix (LbH) structure. They contain imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Similarity to gamma carbonic anhydrase and Ferripyochelin Binding Protein (FBP) may suggest metal binding capacity. Probab=99.86 E-value=4.2e-22 Score=108.67 Aligned_cols=108 Identities=23% Similarity=0.366 Sum_probs=95.4 Q ss_pred CCCCEEECCCEEECCCCEECCCCEECCCCCCCCC---CCCCCCCCEEEEEEE----ECCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 9687698897099288399798154575432311---222222210011355----202211112221101122222110 Q gi|254781223|r 1 MYDNAVVRDCATVIDDARVSGNASVSRFAQVKSN---AEVSDNTYVRDNAKV----GGYAKVSGNASVGGNAIVRDTAEV 73 (110) Q Consensus 1 i~~n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~---~~i~~~~~i~~~~~i----~~~~~i~~~~~i~~~~~~~~~~~i 73 (110) |++++||+|+|.|.+++.||++|+|+++++|+++ ..+++++.+.+++.+ ...+.+..++.++.++.+.+ +.| T Consensus 3 i~~~a~I~p~A~I~G~V~IG~~~~I~~~~~Ir~d~g~i~IG~~~~I~~n~~Ih~~~~~~~~Ig~~~~ig~~~~~~g-~~I 81 (155) T cd04745 3 VDPSSFVHPTAVLIGDVIIGKNCYIGPHASLRGDFGRIVIRDGANVQDNCVIHGFPGQDTVLEENGHIGHGAILHG-CTI 81 (155) T ss_pred CCCCEEECCCCEEEEEEEECCCCEECCCEEEECCCCCEEECCCCEECCCCEECCCCCCCCEECCCCEECCCCEECC-CEE T ss_conf 7998599999999844799899799846079638777899999797878440135999869899978998748715-177 Q ss_pred CCCCCCCCCCCCCCCEEECCCEEECCCCEECCCEEE Q ss_conf 111111112432201397498499888899698097 Q gi|254781223|r 74 GGDAFVIGFTVISGNARVRGNAVVGGDTVVEGDTVL 109 (110) Q Consensus 74 ~~~~~i~~~~~I~~~~~I~~~~~I~~~~~I~~~~~i 109 (110) +++++|+.++.|.++++|++++.|+.++.+..+..| T Consensus 82 g~~~~IG~~a~I~~G~~IG~~siIgagSvV~~~~~V 117 (155) T cd04745 82 GRNALVGMNAVVMDGAVIGEESIVGAMAFVKAGTVI 117 (155) T ss_pred EEEEEECCCCEEECCCEECCCCEEEECCEECCCCCC T ss_conf 432497687599597798999799418787699683 No 2 >cd04645 LbH_gamma_CA_like Gamma carbonic anhydrase-like: This family is composed of gamma carbonic anhydrase (CA), Ferripyochelin Binding Protein (FBP), E. coli paaY protein, and similar proteins. CAs are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism, involving the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide, followed by the regeneration of the active site by ionization of the zinc-bound water molecule and removal of a proton from the active site. They are ubiquitous enzymes involved in fundamental processes like photosynthesis, respiration, pH homeostasis and ion transport. There are three evolutionary distinct groups - alpha, beta and gamma carbonic anhydrases - which show no significant sequence identity or structural similarity. Gamma CAs are trimeric enzymes with left-handed parallel beta helix (LbH) structural domain. Probab=99.85 E-value=1.3e-21 Score=106.62 Aligned_cols=108 Identities=20% Similarity=0.306 Sum_probs=94.2 Q ss_pred CCCCEEECCCEEECCCCEECCCCEECCCCCCCCC---CCCCCCCCEEEEEEE----ECCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 9687698897099288399798154575432311---222222210011355----202211112221101122222110 Q gi|254781223|r 1 MYDNAVVRDCATVIDDARVSGNASVSRFAQVKSN---AEVSDNTYVRDNAKV----GGYAKVSGNASVGGNAIVRDTAEV 73 (110) Q Consensus 1 i~~n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~---~~i~~~~~i~~~~~i----~~~~~i~~~~~i~~~~~~~~~~~i 73 (110) |+++|||+|+|.|.+++.||++|.|++++.|+++ ..+++++.+..++.+ ...+.+++++.+..++.+.+ +.| T Consensus 2 I~~~a~I~p~A~I~G~V~IG~~~~I~~~~~I~~d~~~i~IG~~~~I~~~~~Ih~~~~~~~~IG~~v~Ig~~~~i~g-~~I 80 (153) T cd04645 2 IDPSAFIAPNATVIGDVTLGEGSSVWFGAVLRGDVNPIRIGERTNIQDGSVLHVDPGYPTIIGDNVTVGHGAVLHG-CTI 80 (153) T ss_pred CCCCEEECCCCEEECCEEECCCCEECCCCEEECCCCCEEECCCCEECCCCCCCCCCCCCCEECCCCEECCCCEEEC-EEE T ss_conf 1898199999999772699899999658384246663599999898968543525999979899989999959924-187 Q ss_pred CCCCCCCCCCCCCCCEEECCCEEECCCCEECCCEEE Q ss_conf 111111112432201397498499888899698097 Q gi|254781223|r 74 GGDAFVIGFTVISGNARVRGNAVVGGDTVVEGDTVL 109 (110) Q Consensus 74 ~~~~~i~~~~~I~~~~~I~~~~~I~~~~~I~~~~~i 109 (110) +++++|+.++.|.++++|+++++|++++.+..+..| T Consensus 81 g~~~~IG~~a~I~~gv~IG~~~vIgagsvV~~~~~v 116 (153) T cd04645 81 GDNCLIGMGAIILDGAVIGKGSIVAAGSLVPPGKVI 116 (153) T ss_pred EEEEEECCCCEECCCCEECCCCEECCCCEECCCCEE T ss_conf 300477054398569998899799449897489796 No 3 >TIGR01853 lipid_A_lpxD UDP-3-O-[3-hydroxymyristoyl] glucosamine N-acyltransferase; InterPro: IPR007691 UDP-3-O-[3-hydroxymyristoyl] glucosamine N-acyltransferase (2.3.1 from EC) catalyses an early step in lipid A biosynthesis :UDP-3-O-(3-hydroxytetradecanoyl)glucosamine + (R)-3-hydroxytetradecanoyl- [acyl carrier protein] = UDP-2,3-bis(3-hydroxytetradecanoyl)glucosamine + [acyl carrier protein] Members of this family also contain a hexapeptide repeat (IPR001451 from INTERPRO). This entry represents the non-repeating region of LPXD proteins.; GO: 0016747 transferase activity transferring groups other than amino-acyl groups, 0009245 lipid A biosynthetic process. Probab=99.85 E-value=1.2e-21 Score=106.77 Aligned_cols=67 Identities=7% Similarity=0.143 Sum_probs=45.5 Q ss_pred CCCCEEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCC Q ss_conf 9687698897099288399798154575432311222222210011355202211112221101122 Q gi|254781223|r 1 MYDNAVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIV 67 (110) Q Consensus 1 i~~n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~ 67 (110) +.+++.|.|+|.|+.++.||+||.|+|+++|++++.|+++|.++.++.++..+.++.++.+.+.+++ T Consensus 116 ~~~~v~I~pnvvIG~~v~IG~~~~I~pg~~iG~~v~IG~~~~i~~~v~i~~~~~lG~nV~I~~GaVI 182 (336) T TIGR01853 116 IGEGVTIGPNVVIGAGVEIGENVIIGPGVVIGDDVVIGDGSVIYPNVVIYERVQLGKNVIIHAGAVI 182 (336) T ss_pred CCCCCEECCEEEECCCCEECCCCEECCCCEECCCCEECCCCEEECCCEEEECCEECCEEEECCCCEE T ss_conf 0784368754178888687786363577687688687996778067089724156471587688788 No 4 >cd00710 LbH_gamma_CA Gamma carbonic anhydrases (CA): Carbonic anhydrases are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism, involving the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide, followed by the regeneration of the active site by ionization of the zinc-bound water molecule and removal of a proton from the active site. They are ubiquitous enzymes involved in fundamental processes like photosynthesis, respiration, pH homeostasis and ion transport. There are three distinct groups of carbonic anhydrases - alpha, beta and gamma - which show no significant sequence identity or structural similarity. Gamma CAs are homotrimeric enzymes, with each subunit containing a left-handed parallel beta helix (LbH) structural domain. Probab=99.85 E-value=2.3e-21 Score=105.57 Aligned_cols=104 Identities=27% Similarity=0.375 Sum_probs=94.3 Q ss_pred CCCCEEECCCEEECCCCEECCCCEECCCCCCCCC----CCCCCCCCEEEEEEEEC----CCCCCCCCCCCCCCCCCCCCC Q ss_conf 9687698897099288399798154575432311----22222221001135520----221111222110112222211 Q gi|254781223|r 1 MYDNAVVRDCATVIDDARVSGNASVSRFAQVKSN----AEVSDNTYVRDNAKVGG----YAKVSGNASVGGNAIVRDTAE 72 (110) Q Consensus 1 i~~n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~----~~i~~~~~i~~~~~i~~----~~~i~~~~~i~~~~~~~~~~~ 72 (110) |++++||.|+|.+.+++.|++++.||++++|++| ..|++++.+.+++.++. .+.+++++.++..+.+.+.|. T Consensus 5 i~~~afIap~A~viGdV~ig~~~sIw~~aviRgD~~~~I~IG~~~nIqD~~viH~~~~~~~~IG~~vtIgh~a~ihG~c~ 84 (167) T cd00710 5 IDPSAYVHPTAVVIGDVIIGDNVFVGPGASIRADEGTPIIIGANVNIQDGVVIHALEGYSVWIGKNVSIAHGAIVHGPAY 84 (167) T ss_pred CCCCEEECCCCEEEEEEEECCCCEECCCCEEECCCCCCEEECCCCCCCCCEEEECCCCCCEEECCCCEECCCCEECCEEE T ss_conf 69993989999899708999999998887894788887698887434777078225766749779828989848812049 Q ss_pred CCCCCCCCCCCCCCCCEEECCCEEECCCCEECC Q ss_conf 011111111243220139749849988889969 Q gi|254781223|r 73 VGGDAFVIGFTVISGNARVRGNAVVGGDTVVEG 105 (110) Q Consensus 73 i~~~~~i~~~~~I~~~~~I~~~~~I~~~~~I~~ 105 (110) |+++++|+-++.+.+ +.|++++.|+.++.+.+ T Consensus 85 Igd~~~IG~gaiv~~-a~Ig~~~vIgags~V~g 116 (167) T cd00710 85 IGDNCFIGFRSVVFN-AKVGDNCVIGHNAVVDG 116 (167) T ss_pred ECCCCEECCCCEEEC-CEECCCCEECCCCEEEC T ss_conf 839988988989982-59769989988988847 No 5 >cd04646 LbH_Dynactin_6 Dynactin 6 (or subunit p27): Dynactin is a major component of the activator complex that stimulates dynein-mediated vesicle transport. Dynactin is a heterocomplex of at least eight subunits, including a 150,000-MW protein called Glued, the actin-capping protein Arp1, and dynamatin. In vitro binding experiments show that dynactin enhances dynein-dependent motility, possibly through interaction with microtubules and vesicles. Subunit p27 is part of the pointed-end subcomplex in dynactin that also includes p25, p26, and Arp11. This subcomplex interacts with membranous cargoes. p25 and p27 contain the imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X), indicating a left-handed parallel beta helix (LbH) structural domain. Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Probab=99.84 E-value=4e-21 Score=104.63 Aligned_cols=108 Identities=12% Similarity=0.209 Sum_probs=86.2 Q ss_pred CCCCEEECCCEEECCCCEECCCCEECCCCCCCCCC---CCCCCCCEEEEEEEEC----------CCCCCCCCCCCCCCCC Q ss_conf 96876988970992883997981545754323112---2222221001135520----------2211112221101122 Q gi|254781223|r 1 MYDNAVVRDCATVIDDARVSGNASVSRFAQVKSNA---EVSDNTYVRDNAKVGG----------YAKVSGNASVGGNAIV 67 (110) Q Consensus 1 i~~n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~---~i~~~~~i~~~~~i~~----------~~~i~~~~~i~~~~~~ 67 (110) |+++++|.++|.|.+++.|+++|.|||+++|+++. .+++++.+.+++.+.. .+.++.++.+...+.+ T Consensus 2 I~~~a~I~~~A~iiGdV~IG~~~sI~p~avIrgd~g~I~IG~~~nIqd~~~I~~~~~~~~~~g~~~~IG~~~~i~~g~~i 81 (164) T cd04646 2 IAPGAVVCQESEIRGDVTIGPGTVVHPRATIIAEAGPIIIGENNIIEEQVTIVNKKPKDPAEPKPMIIGSNNVFEVGCKC 81 (164) T ss_pred CCCCCEECCCCEEEEEEEECCCCEECCCCEEEECCCCEEECCCCEECCCEEEECCCCCCCCCCCCEEECCCCEECCCEEE T ss_conf 48987999999998217999999899575998143788989996666985994178777789988798899888678199 Q ss_pred CCCCCCCCCCCCCCCCCCCCCEEECCCEEECCCCEECCCEEE Q ss_conf 222110111111112432201397498499888899698097 Q gi|254781223|r 68 RDTAEVGGDAFVIGFTVISGNARVRGNAVVGGDTVVEGDTVL 109 (110) Q Consensus 68 ~~~~~i~~~~~i~~~~~I~~~~~I~~~~~I~~~~~I~~~~~i 109 (110) . .+.|+++++|+-++.|.+++.|++++.|++++.+..+..+ T Consensus 82 h-~~~IG~~~lIg~~a~V~~ga~Ig~~ciIgags~V~~~~~i 122 (164) T cd04646 82 E-ALKIGNNNVFESKSFVGKNVIITDGCIIGAGCKLPSSEIL 122 (164) T ss_pred E-CCEECCCCEECCCCEECCCCEECCCCEECCCCCCCCCCEE T ss_conf 5-5597678889689899999899999899989892899792 No 6 >PRK00892 lpxD UDP-3-O-[3-hydroxymyristoyl] glucosamine N-acyltransferase; Provisional Probab=99.84 E-value=5.1e-21 Score=104.18 Aligned_cols=56 Identities=7% Similarity=0.139 Sum_probs=24.7 Q ss_pred CCEEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCC Q ss_conf 87698897099288399798154575432311222222210011355202211112 Q gi|254781223|r 3 DNAVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGN 58 (110) Q Consensus 3 ~n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~ 58 (110) +|++|+|++.|.++++|++++.|++++.|++++.+++++.++.++.++.++.++.+ T Consensus 115 ~~v~Igp~~vIg~~v~IG~~~~I~~~~~Ig~~v~IG~~~~I~~nv~I~~~~~IG~~ 170 (343) T PRK00892 115 KSVSIGPNAVIGAGVEIGDGVIIGAGAVIGAGVKIGDDCRLHANVTIYHRVQIGNR 170 (343) T ss_pred CCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEECCCEECCCEEECCC T ss_conf 99889999899999899999999298199588188899588118566255588887 No 7 >cd04650 LbH_FBP Ferripyochelin Binding Protein (FBP): FBP is an outer membrane protein which plays a role in iron acquisition. It binds iron when it is complexed with pyochelin. It adopts the left-handed parallel beta-helix (LbH) structure, and contains imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Acyltransferase activity has not been observed in this group. Probab=99.83 E-value=9.6e-21 Score=103.02 Aligned_cols=108 Identities=20% Similarity=0.322 Sum_probs=94.2 Q ss_pred CCCCEEECCCEEECCCCEECCCCEECCCCCCCCC---CCCCCCCCEEEEEEEEC----CCCCCCCCCCCCCCCCCCCCCC Q ss_conf 9687698897099288399798154575432311---22222221001135520----2211112221101122222110 Q gi|254781223|r 1 MYDNAVVRDCATVIDDARVSGNASVSRFAQVKSN---AEVSDNTYVRDNAKVGG----YAKVSGNASVGGNAIVRDTAEV 73 (110) Q Consensus 1 i~~n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~---~~i~~~~~i~~~~~i~~----~~~i~~~~~i~~~~~~~~~~~i 73 (110) |.+++||+|+|.|.+++.||++++||+++.|+++ ..+++++.+.+++.+.. .+.+++++.++.++.+.+ +.| T Consensus 3 I~~~~~I~p~a~i~G~v~IG~~~~I~~~~~Ir~d~~~i~IG~~~~I~~n~~I~~~~~~~~~ig~~v~ig~~~~i~g-~~I 81 (154) T cd04650 3 ISPKAYVHPTSYVIGDVVIGELTSVWHYAVIRGDNDSIYIGKYSNVQENVSIHTDHGYPTEIGDYVTIGHNAVVHG-AKV 81 (154) T ss_pred CCCCEEECCCCEEECEEEECCCCEECCCCEEECCCCCEEECCCEEECCCCEEEECCCCCCEECCCCEECCCEEEEC-CEE T ss_conf 6998799999999760599999999875389558876599993398999889846999868799768899528823-022 Q ss_pred CCCCCCCCCCCCCCCEEECCCEEECCCCEECCCEEE Q ss_conf 111111112432201397498499888899698097 Q gi|254781223|r 74 GGDAFVIGFTVISGNARVRGNAVVGGDTVVEGDTVL 109 (110) Q Consensus 74 ~~~~~i~~~~~I~~~~~I~~~~~I~~~~~I~~~~~i 109 (110) +++++|+.++.|.+++.|+++++|++++.+..+..| T Consensus 82 g~~v~IG~~a~I~~Gv~IG~~svIgaGsVV~~~~~V 117 (154) T cd04650 82 GNYVIVGMGAILLNGAKIGDHVIIGAGAVVTPGKEI 117 (154) T ss_pred CCEEEECCCEEEECCCEECCCCEECCCCEECCCCEE T ss_conf 013598872299628798999799889888799791 No 8 >TIGR03570 NeuD_NnaD sugar O-acyltransferase, sialic acid O-acetyltransferase NeuD family. These proteins contain repeats of the bacterial transferase hexapeptide (pfam00132), although often these do not register above the trusted cutoff. Probab=99.83 E-value=5.8e-21 Score=103.94 Aligned_cols=103 Identities=17% Similarity=0.242 Sum_probs=99.5 Q ss_pred EEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 69889709928839979815457543231122222221001135520221111222110112222211011111111243 Q gi|254781223|r 5 AVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVIGFTV 84 (110) Q Consensus 5 ~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~ 84 (110) ..|+|+|.|+++++|++++.|+|++.|.+++.+++++.+..++.+++++.+++++++.+++.+.+++.|+++++++.++. T Consensus 88 ~lIhp~a~i~~~~~ig~G~~I~~~~~i~~~~~IG~~~~i~~~~~I~Hd~~ig~~~~i~~~~~i~G~v~Ig~~~~iG~~~~ 167 (201) T TIGR03570 88 TLIHPSAIVSPSASIGEGTVIMAGAVINPDVRIGDNVIINTGAIVEHDCVIGDYVHIAPGVTLSGGVVIGEGVFIGAGAT 167 (201) T ss_pred EEECCCCEECCCCEECCCCEEECCCEECCCCEEEEEEEECCCCCCCCCCEECCCCEECCCCEECCCCEECCCCEECCCCE T ss_conf 99999809889867879989962869876647802679804232144254778768999859989887977735989889 Q ss_pred CCCCEEECCCEEECCCCEECCCE Q ss_conf 22013974984998888996980 Q gi|254781223|r 85 ISGNARVRGNAVVGGDTVVEGDT 107 (110) Q Consensus 85 I~~~~~I~~~~~I~~~~~I~~~~ 107 (110) |.++++|++++.|++++.|..|. T Consensus 168 i~~~i~Ig~~~~Igags~V~~~v 190 (201) T TIGR03570 168 IIQGVTIGAGAIVGAGAVVTKDI 190 (201) T ss_pred ECCCCEECCCCEECCCCEEECCC T ss_conf 88998999999999798890437 No 9 >PRK13627 carnitine operon protein CaiE; Provisional Probab=99.83 E-value=6.9e-21 Score=103.60 Aligned_cols=108 Identities=22% Similarity=0.278 Sum_probs=96.8 Q ss_pred CCCCEEECCCEEECCCCEECCCCEECCCCCCCCC---CCCCCCCCEEEEEEEEC----CCCCCCCCCCCCCCCCCCCCCC Q ss_conf 9687698897099288399798154575432311---22222221001135520----2211112221101122222110 Q gi|254781223|r 1 MYDNAVVRDCATVIDDARVSGNASVSRFAQVKSN---AEVSDNTYVRDNAKVGG----YAKVSGNASVGGNAIVRDTAEV 73 (110) Q Consensus 1 i~~n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~---~~i~~~~~i~~~~~i~~----~~~i~~~~~i~~~~~~~~~~~i 73 (110) |++++||.|+|.+.+++.|++++.||+++++++| ..+++++.+.+++.++. .+.++++++++..+.+++ |.| T Consensus 13 i~~~afIap~A~viGdV~ig~~~sIw~~aVlRGD~~~I~IG~~tNIQD~~viH~~~~~~~~IG~~vtIGH~ailhg-c~I 91 (196) T PRK13627 13 VHPTAFVHPSAVLIGDVIVGAGVYIGPLASLRGDYGRLIVQAGANLQDGCIMHGYCDTDTIVGENGHIGHGAILHG-CVI 91 (196) T ss_pred CCCCEEECCCCEEEEEEEECCCCEECCCEEEECCCCCEEECCCCEECCCCEEECCCCCCCEECCCEEECCCEEEEE-EEE T ss_conf 2998388999989984999999889777089568871798998676798378148999858889805878729963-499 Q ss_pred CCCCCCCCCCCCCCCEEECCCEEECCCCEECCCEEE Q ss_conf 111111112432201397498499888899698097 Q gi|254781223|r 74 GGDAFVIGFTVISGNARVRGNAVVGGDTVVEGDTVL 109 (110) Q Consensus 74 ~~~~~i~~~~~I~~~~~I~~~~~I~~~~~I~~~~~i 109 (110) +++++|+-++.|.+++.|++++.|++++.+..+..+ T Consensus 92 gd~~lIGmgavVldga~Ig~~~iI~AgslV~~g~~i 127 (196) T PRK13627 92 GRDALVGMNSVIMDGAVIGEESIVAAMSFVKAGFQG 127 (196) T ss_pred ECCCEECCCCEEECCCEECCCCEECCCCEECCCCCC T ss_conf 489788469799288899999899768645799493 No 10 >TIGR01852 lipid_A_lpxA acyl-[acyl-carrier-protein]--UDP-N-acetylglucosamine O-acyltransferase; InterPro: IPR010137 This entry describes LpxA, an enzyme for the biosynthesis of lipid A, a component of lipopolysaccharide (LPS) in the outer membrane outer leaflet of most Gram-negative bacteria. Some differences are found between lipid A of different species, but this protein represents the first step (from UDP-N-acetyl-D-glucosamine) and appears to be conserved in function. Proteins from this family contain many copies of the bacterial transferase hexapeptide repeat.; GO: 0008780 acyl-[acyl-carrier-protein]-UDP-N-acetylglucosamine O-acyltransferase activity, 0008610 lipid biosynthetic process. Probab=99.83 E-value=9.3e-21 Score=103.07 Aligned_cols=109 Identities=9% Similarity=0.260 Sum_probs=69.2 Q ss_pred CCCCEEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEE------------------------------ Q ss_conf 96876988970992883997981545754323112222222100113552------------------------------ Q gi|254781223|r 1 MYDNAVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVG------------------------------ 50 (110) Q Consensus 1 i~~n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~------------------------------ 50 (110) ||+.|+|.|.|+|.++|+||+-|.|.++++|+.++++.+++.+...+.++ T Consensus 1 IHPTA~ie~gA~iG~~V~iGPf~~v~~~V~ig~g~~l~shV~i~G~T~iG~~~~i~p~A~iG~~PQdlky~GE~t~l~IG 80 (257) T TIGR01852 1 IHPTAIIEPGAEIGEDVEIGPFCIVGPGVKIGDGVELKSHVVIEGHTTIGEGTKIFPGAVIGGEPQDLKYKGERTELIIG 80 (257) T ss_pred CCCCEEECCCCEECCCCEEEEEEEECCCCEECCCCEECCCEEEECCEEECCCCEEECCEEECCCCCCCEECCCCCEEEEC T ss_conf 99621878866828985780188878975886885875736896770878998880760767988565014870179988 Q ss_pred CCCCCCCCCCCCCCCCCC-CCCCCCCCCCCCCCCCCCCCEEECCCEEECCCCEECCCEEE Q ss_conf 022111122211011222-22110111111112432201397498499888899698097 Q gi|254781223|r 51 GYAKVSGNASVGGNAIVR-DTAEVGGDAFVIGFTVISGNARVRGNAVVGGDTVVEGDTVL 109 (110) Q Consensus 51 ~~~~i~~~~~i~~~~~~~-~~~~i~~~~~i~~~~~I~~~~~I~~~~~I~~~~~I~~~~~i 109 (110) ++++|++.+++.....-. +.+.||++++++-+++|.+||.||+++.+.+++++.+.+.| T Consensus 81 ~~n~IRE~VTi~~GT~~g~g~T~iG~~nllMAysHvAHDC~vGn~vv~aN~a~LAGHV~v 140 (257) T TIGR01852 81 DNNTIREFVTINRGTKSGGGVTSIGNNNLLMAYSHVAHDCVVGNHVVLANNATLAGHVEV 140 (257) T ss_pred CCCEEEEEEEECCCCCCCCCEEEECCCCEECCCCEEEECCEECCEEEEECCCEECEEEEE T ss_conf 965697557850662588888998887657205356124368881698136312227998 No 11 >cd03360 LbH_AT_putative Putative Acyltransferase (AT), Left-handed parallel beta-Helix (LbH) domain; This group is composed of mostly uncharacterized proteins containing an N-terminal helical subdomain followed by a LbH domain. The alignment contains 6 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. A few members are identified as NeuD, a sialic acid (Sia) O-acetyltransferase that is required for Sia synthesis and surface polysaccharide sialylation. Probab=99.82 E-value=1.8e-20 Score=101.90 Aligned_cols=103 Identities=17% Similarity=0.290 Sum_probs=99.4 Q ss_pred EEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 69889709928839979815457543231122222221001135520221111222110112222211011111111243 Q gi|254781223|r 5 AVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVIGFTV 84 (110) Q Consensus 5 ~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~ 84 (110) ..|+|+|.+++++.+++++.|++++.|.+++.+++++.+..++.+++++.+++++++.+.+.+.+++.|+++++++.++. T Consensus 85 niIhp~a~i~~~~~ig~G~~I~~~~~i~~~~~IG~~~~i~~~~~i~Hd~~ig~~~~i~~~~~i~g~v~Ig~~~~iG~~~~ 164 (197) T cd03360 85 TLIHPSAVVSPSAVIGEGCVIMAGAVINPDARIGDNVIINTGAVIGHDCVIGDFVHIAPGVVLSGGVTIGEGAFIGAGAT 164 (197) T ss_pred EEECCCCEECCCCEECCCCEEECCCEECCCCCCCCEEEECCCHHCCCCCCCCCCCEECCCEEECCCEEEEEECCCCCCCE T ss_conf 99999959877747999429931759865653043689877001053150131010667236778308912162989989 Q ss_pred CCCCEEECCCEEECCCCEECCCE Q ss_conf 22013974984998888996980 Q gi|254781223|r 85 ISGNARVRGNAVVGGDTVVEGDT 107 (110) Q Consensus 85 I~~~~~I~~~~~I~~~~~I~~~~ 107 (110) |.++++|++++.|++++.|.+|. T Consensus 165 i~~~i~Ig~~~~igags~v~~~i 187 (197) T cd03360 165 IIQGVTIGAGAIIGAGAVVTKDV 187 (197) T ss_pred ECCCCEECCCCEECCCCEEEEEC T ss_conf 87998999999999799891256 No 12 >cd03351 LbH_UDP-GlcNAc_AT UDP-N-acetylglucosamine O-acyltransferase (UDP-GlcNAc acyltransferase): Proteins in this family catalyze the transfer of (R)-3-hydroxymyristic acid from its acyl carrier protein thioester to UDP-GlcNAc. It is the first enzyme in the lipid A biosynthetic pathway and is also referred to as LpxA. Lipid A is essential for the growth of Escherichia coli and related bacteria. It is also essential for maintaining the integrity of the outer membrane. UDP-GlcNAc acyltransferase is a homotrimer of left-handed parallel beta helix (LbH) subunits. Each subunit contains an N-terminal LbH region with 9 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X), and a C-terminal alpha-helical region. Probab=99.81 E-value=3e-20 Score=100.94 Aligned_cols=53 Identities=8% Similarity=0.174 Sum_probs=23.2 Q ss_pred CCCCEEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCC Q ss_conf 96876988970992883997981545754323112222222100113552022 Q gi|254781223|r 1 MYDNAVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYA 53 (110) Q Consensus 1 i~~n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~ 53 (110) ||+.|.|+|+|.|.+++.|++.|.|++++.|++++.+..++.+..++.++.++ T Consensus 2 IHptAiI~~~A~IG~~v~Igp~~vIg~~V~IG~~~~I~~~v~I~~~t~IG~~~ 54 (254) T cd03351 2 IHPTAIVDPGAKIGENVEIGPFCVIGPNVEIGDGTVIGSHVVIDGPTTIGKNN 54 (254) T ss_pred CCCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCE T ss_conf 18977969989999979999998999999999999997996891997977652 No 13 >PRK05289 UDP-N-acetylglucosamine acyltransferase; Provisional Probab=99.81 E-value=4.9e-20 Score=100.08 Aligned_cols=56 Identities=9% Similarity=0.204 Sum_probs=23.8 Q ss_pred CCCCEEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCC Q ss_conf 96876988970992883997981545754323112222222100113552022111 Q gi|254781223|r 1 MYDNAVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVS 56 (110) Q Consensus 1 i~~n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~ 56 (110) ||+.|.|.|+|+|.+++.|++.|.|++++.|++++.++.++.+..++.++.++.+. T Consensus 5 IHptAiI~~~A~IG~~v~Igp~~vIg~~v~IG~~~~I~~~v~I~~~t~IG~~~~I~ 60 (261) T PRK05289 5 IHPTAIVEPGAEIGENVEIGPFCVIGPDVVIGDGTEIGSHVVIDGHTTIGKNNRIF 60 (261) T ss_pred CCCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEECCCEECCCCEEC T ss_conf 08988989989999989999999999999999999997996891898889883452 No 14 >PRK12461 UDP-N-acetylglucosamine acyltransferase; Provisional Probab=99.81 E-value=3.9e-20 Score=100.49 Aligned_cols=58 Identities=10% Similarity=0.149 Sum_probs=28.1 Q ss_pred CCCCEEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCC Q ss_conf 9687698897099288399798154575432311222222210011355202211112 Q gi|254781223|r 1 MYDNAVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGN 58 (110) Q Consensus 1 i~~n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~ 58 (110) ||+.|.|+|+|+|.++++|++.|+|+++++|++++.++.++.+..++.++.++.+.++ T Consensus 3 IHptAiI~~~A~Ig~~v~Igp~~vIg~~V~IG~~~~I~~~v~I~~~t~IG~~~~I~~~ 60 (256) T PRK12461 3 IHPTAVIDPSAKLGEGVEIGPFAVIGADVEIGDGTWIMPHAVILGWTRIGKNNKIHQG 60 (256) T ss_pred CCCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCEEEEEC T ss_conf 9998798998999999999999999999999999999498189199899974399754 No 15 >cd03352 LbH_LpxD UDP-3-O-acyl-glucosamine N-acyltransferase (LpxD): The enzyme catalyzes the transfer of 3-hydroxymyristic acid or 3-hydroxy-arachidic acid, depending on the organism, from the acyl carrier protein (ACP) to UDP-3-O-acyl-glucosamine to produce UDP-2,3-diacyl-GlcNAc. This constitutes the third step in the lipid A biosynthetic pathway in Gram-negative bacteria. LpxD is a homotrimer, with each subunit consisting of a novel combination of an N-terminal uridine-binding domain, a core lipid-binding left-handed parallel beta helix (LbH) domain, and a C-terminal alpha-helical extension. The LbH domain contains 9 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Probab=99.81 E-value=7.9e-20 Score=99.20 Aligned_cols=61 Identities=7% Similarity=0.109 Sum_probs=31.4 Q ss_pred CCCCEEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCC Q ss_conf 9687698897099288399798154575432311222222210011355202211112221 Q gi|254781223|r 1 MYDNAVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASV 61 (110) Q Consensus 1 i~~n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i 61 (110) |.++++|+++|.|.++++|++++.|+|++.|++++++++++.+..++.++..+.+++++.+ T Consensus 4 ig~~v~I~~~avI~~~v~Ig~n~~I~~~~~I~~~v~IG~~~~I~~~~~I~~~~~IG~~~~I 64 (205) T cd03352 4 IGENVSIGPNAVIGEGVVIGDGVVIGPGVVIGDGVVIGDDCVIHPNVTIYEGCIIGDRVII 64 (205) T ss_pred ECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEECCCEECCCCEECCEEEE T ss_conf 1999999998799999999999999898899999688988366003447278687544777 No 16 >cd03351 LbH_UDP-GlcNAc_AT UDP-N-acetylglucosamine O-acyltransferase (UDP-GlcNAc acyltransferase): Proteins in this family catalyze the transfer of (R)-3-hydroxymyristic acid from its acyl carrier protein thioester to UDP-GlcNAc. It is the first enzyme in the lipid A biosynthetic pathway and is also referred to as LpxA. Lipid A is essential for the growth of Escherichia coli and related bacteria. It is also essential for maintaining the integrity of the outer membrane. UDP-GlcNAc acyltransferase is a homotrimer of left-handed parallel beta helix (LbH) subunits. Each subunit contains an N-terminal LbH region with 9 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X), and a C-terminal alpha-helical region. Probab=99.80 E-value=7.3e-20 Score=99.34 Aligned_cols=106 Identities=16% Similarity=0.264 Sum_probs=52.4 Q ss_pred CCCEEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEE-------------------------------- Q ss_conf 687698897099288399798154575432311222222210011355-------------------------------- Q gi|254781223|r 2 YDNAVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKV-------------------------------- 49 (110) Q Consensus 2 ~~n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i-------------------------------- 49 (110) .+|+.|+|.+.|.++++|+++|.|++++.|.+++.+++++.++..+.+ T Consensus 15 G~~v~Igp~~vIg~~V~IG~~~~I~~~v~I~~~t~IG~~~~I~~~avIG~~pqd~~~~g~~~~v~IG~~~~Ire~vtI~r 94 (254) T cd03351 15 GENVEIGPFCVIGPNVEIGDGTVIGSHVVIDGPTTIGKNNRIFPFASIGEAPQDLKYKGEPTRLEIGDNNTIREFVTIHR 94 (254) T ss_pred CCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCEEECCCCCCCCCCCCCCCCCCCCEEEECCCCEECCCCCCCC T ss_conf 99799999989999999999999979968919979776525605753477876767379867699799778995330236 Q ss_pred -----------------------ECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCEEECCCCEECCC Q ss_conf -----------------------202211112221101122222110111111112432201397498499888899698 Q gi|254781223|r 50 -----------------------GGYAKVSGNASVGGNAIVRDTAEVGGDAFVIGFTVISGNARVRGNAVVGGDTVVEGD 106 (110) Q Consensus 50 -----------------------~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~I~~~~~I~~~~~I~~~~~I~~~ 106 (110) ++++.+++++.+...+.+.+++.+++++++++++.+...++||++++|++++.+.+| T Consensus 95 gt~~~~~~T~IG~~~~im~~~HIaHdc~IG~~~iian~~~laGhv~Igd~a~IGg~~~v~q~v~IG~~a~Igags~V~kD 174 (254) T cd03351 95 GTAQGGGVTRIGNNNLLMAYVHVAHDCVIGNNVILANNATLAGHVEIGDYAIIGGLSAVHQFCRIGRHAMVGGGSGVVQD 174 (254) T ss_pred CCCCCCCCEEECCCCEEEECCCCCCCCCCCCCEEEECCCCCCCCEEECCCCEECCCCEECCCCEECCCEEECCCCEEECC T ss_conf 74378875597888578202544567733797599334455685698998489883677598199966199767456113 Q ss_pred E Q ss_conf 0 Q gi|254781223|r 107 T 107 (110) Q Consensus 107 ~ 107 (110) . T Consensus 175 V 175 (254) T cd03351 175 V 175 (254) T ss_pred C T ss_conf 6 No 17 >PRK05289 UDP-N-acetylglucosamine acyltransferase; Provisional Probab=99.80 E-value=9e-20 Score=98.96 Aligned_cols=47 Identities=9% Similarity=0.143 Sum_probs=23.2 Q ss_pred CCCEEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEE Q ss_conf 68769889709928839979815457543231122222221001135 Q gi|254781223|r 2 YDNAVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAK 48 (110) Q Consensus 2 ~~n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~ 48 (110) .+|+.|+|.+.|.+++.||++|.|+++++|.+++.+++++.++..+. T Consensus 18 G~~v~Igp~~vIg~~v~IG~~~~I~~~v~I~~~t~IG~~~~I~~~av 64 (261) T PRK05289 18 GENVEIGPFCVIGPDVVIGDGTEIGSHVVIDGHTTIGKNNRIFPFAS 64 (261) T ss_pred CCCCEECCCCEECCCCEECCCCEECCCCEEECCCEECCCCEECCCCC T ss_conf 99899999999999999999999979968918988898834524753 No 18 >PRK12461 UDP-N-acetylglucosamine acyltransferase; Provisional Probab=99.80 E-value=1e-19 Score=98.72 Aligned_cols=49 Identities=12% Similarity=0.163 Sum_probs=30.3 Q ss_pred CCCCEEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEE Q ss_conf 9687698897099288399798154575432311222222210011355 Q gi|254781223|r 1 MYDNAVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKV 49 (110) Q Consensus 1 i~~n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i 49 (110) |.+|+.|.|.+.|.++++||++|.|+++++|.+++.+++++.++..+.+ T Consensus 15 Ig~~v~Igp~~vIg~~V~IG~~~~I~~~v~I~~~t~IG~~~~I~~~avI 63 (256) T PRK12461 15 LGEGVEIGPFAVIGADVEIGDGTWIMPHAVILGWTRIGKNNKIHQGAVV 63 (256) T ss_pred ECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCEEEEECCCC T ss_conf 9999999999999999999999999498189199899974399754311 No 19 >PRK00892 lpxD UDP-3-O-[3-hydroxymyristoyl] glucosamine N-acyltransferase; Provisional Probab=99.80 E-value=7.3e-20 Score=99.34 Aligned_cols=61 Identities=13% Similarity=0.219 Sum_probs=23.3 Q ss_pred CCCEEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCC Q ss_conf 6876988970992883997981545754323112222222100113552022111122211 Q gi|254781223|r 2 YDNAVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVG 62 (110) Q Consensus 2 ~~n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~ 62 (110) |+++.|+++|.|.+++.|+++++|++++.|++++.++.++.+..++.++.++.+.+++.+. T Consensus 102 h~sA~I~~~a~Ig~~v~Igp~~vIg~~v~IG~~~~I~~~~~Ig~~v~IG~~~~I~~nv~I~ 162 (343) T PRK00892 102 HPSAVIDPTAKIGKSVSIGPNAVIGAGVEIGDGVIIGAGAVIGAGVKIGDDCRLHANVTIY 162 (343) T ss_pred CCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEECCCEEC T ss_conf 9753618877879998899998999998999999992981995881888995881185662 No 20 >cd03350 LbH_THP_succinylT 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate (THDP) N-succinyltransferase (also called THP succinyltransferase): THDP N-succinyltransferase catalyzes the conversion of tetrahydrodipicolinate and succinyl-CoA to N-succinyltetrahydrodipicolinate and CoA. It is the committed step in the succinylase pathway by which bacteria synthesize L-lysine and meso-diaminopimelate, a component of peptidoglycan. The enzyme is homotrimeric and each subunit contains an N-terminal region with alpha helices and hairpin loops, as well as a C-terminal region with a left-handed parallel alpha-helix (LbH) structural motif encoded by hexapeptide repeat motifs. Probab=99.79 E-value=1.5e-19 Score=98.06 Aligned_cols=104 Identities=22% Similarity=0.217 Sum_probs=86.5 Q ss_pred EEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCC--------CCCCCC Q ss_conf 698897099288399798154575432311222222210011355202211112221101122222--------110111 Q gi|254781223|r 5 AVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDT--------AEVGGD 76 (110) Q Consensus 5 ~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~--------~~i~~~ 76 (110) ..|+|+|.|.++++|++++.|++++.|..++++++++.+..++.+++.+.++.++++..++.+.+. +.+.++ T Consensus 2 ~rI~P~A~I~~ga~IG~g~vI~~~a~I~~g~~IG~~~~I~~~~~Ig~~~~IG~~v~I~~g~~i~g~~~p~~~~~v~IeD~ 81 (139) T cd03350 2 RRVPPGAIIRDGAFIGPGAVLMMPSYVNIGAYVDEGTMVDSWATVGSCAQIGKNVHLSAGAVIGGVLEPLQATPVIIEDD 81 (139) T ss_pred EEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCCCCCCCCCEECCC T ss_conf 07499829889999999989988999843989898727866660268669999328988867844425764688688799 Q ss_pred CCCCCCCCCCCCEEECCCEEECCCCEECCCEE Q ss_conf 11111243220139749849988889969809 Q gi|254781223|r 77 AFVIGFTVISGNARVRGNAVVGGDTVVEGDTV 108 (110) Q Consensus 77 ~~i~~~~~I~~~~~I~~~~~I~~~~~I~~~~~ 108 (110) ++|+.++.|...+.|++++.|++++.+.+|.. T Consensus 82 ~~IGa~a~V~~Gv~IG~~avIGAGsVVtkdvp 113 (139) T cd03350 82 VFIGANCEVVEGVIVGKGAVLAAGVVLTQSTP 113 (139) T ss_pred EEEEECCEEECCCEECCCCEECCCCEEECCCC T ss_conf 59930998946879999989997988918988 No 21 >COG1044 LpxD UDP-3-O-[3-hydroxymyristoyl] Probab=99.78 E-value=3.8e-19 Score=96.34 Aligned_cols=68 Identities=13% Similarity=0.172 Sum_probs=37.7 Q ss_pred CCCEEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCC Q ss_conf 68769889709928839979815457543231122222221001135520221111222110112222 Q gi|254781223|r 2 YDNAVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRD 69 (110) Q Consensus 2 ~~n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~ 69 (110) .+++.|++++.|++++.|++|+.|++++.|++++.|++++.+..++.++.++.++.++.+...+.+.. T Consensus 115 ~~~~~ig~~vvI~~~v~IG~~~~I~~~avIg~~~~IG~~~~i~~~v~I~~~~~IG~~v~I~~GavIG~ 182 (338) T COG1044 115 GKNVSIGPNVVIGAGVVIGENVVIGAGAVIGENVKIGDGTVIHPNVTIYHNVVIGNNVIIHSGAVIGA 182 (338) T ss_pred CCCCCCCCCEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEECCCEECCCEEECCCCEECC T ss_conf 77873288749888978789858988978888968888858847878952757878228889988756 No 22 >COG0663 PaaY Carbonic anhydrases/acetyltransferases, isoleucine patch superfamily [General function prediction only] Probab=99.77 E-value=4.3e-19 Score=96.15 Aligned_cols=107 Identities=19% Similarity=0.311 Sum_probs=94.2 Q ss_pred CCCCEEECCCEEECCCCEECCCCEECCCCCCCCCC---CCCCCCCEEEEEEEEC----CCCCCCCCCCCCCCCCCCCCCC Q ss_conf 96876988970992883997981545754323112---2222221001135520----2211112221101122222110 Q gi|254781223|r 1 MYDNAVVRDCATVIDDARVSGNASVSRFAQVKSNA---EVSDNTYVRDNAKVGG----YAKVSGNASVGGNAIVRDTAEV 73 (110) Q Consensus 1 i~~n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~---~i~~~~~i~~~~~i~~----~~~i~~~~~i~~~~~~~~~~~i 73 (110) |++++||.|+|.|.+++.|++++.||+++++++|. .++.++.+-+++.++. .+.++++++++..+.+++ |.| T Consensus 14 i~~~a~Va~~A~viGdV~Ig~~vsIw~~aVlRgD~~~I~IG~~tNIQDg~ViH~~~~~p~~IG~~vtIGH~aivHG-c~I 92 (176) T COG0663 14 IDPTAFVAPSATVIGDVRIGAGVSIWPGAVLRGDVEPIRIGARTNIQDGVVIHADPGYPVTIGDDVTIGHGAVVHG-CTI 92 (176) T ss_pred CCCCEEECCCCEEEEEEEECCCCEECCCEEEECCCCCEEECCCCEECCCEEEECCCCCCEEECCCCEECCCCEEEE-EEE T ss_conf 8986287799979985999999788776389746775698788560378389527999929779958857658988-598 Q ss_pred CCCCCCCCCCCCCCCEEECCCEEECCCCEECCCEE Q ss_conf 11111111243220139749849988889969809 Q gi|254781223|r 74 GGDAFVIGFTVISGNARVRGNAVVGGDTVVEGDTV 108 (110) Q Consensus 74 ~~~~~i~~~~~I~~~~~I~~~~~I~~~~~I~~~~~ 108 (110) +++++|+=++.+.+++.|++++.|+.++.|..... T Consensus 93 g~~~lIGmgA~vldga~IG~~~iVgAgalV~~~k~ 127 (176) T COG0663 93 GDNVLIGMGATVLDGAVIGDGSIVGAGALVTPGKE 127 (176) T ss_pred CCCCEEECCCEEECCCEECCCCEECCCCCCCCCCC T ss_conf 78858913866757869889869866870058837 No 23 >TIGR03532 DapD_Ac 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-acetyltransferase. Alternate name: tetrahydrodipicolinate N-acetyltransferase. Note that IUBMB lists this alternate name as the accepted name. Unfortunately, the related succinyl transferase acting on the same substrate (EC:2.3.1.117, TIGR00695) uses the opposite standard. We have decided to give these two enzymes names which more clearly indicated that they act on the same substrate. Probab=99.77 E-value=7.5e-19 Score=95.12 Aligned_cols=104 Identities=27% Similarity=0.356 Sum_probs=99.7 Q ss_pred CEEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCC--------CCCCCC Q ss_conf 769889709928839979815457543231122222221001135520221111222110112222--------211011 Q gi|254781223|r 4 NAVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRD--------TAEVGG 75 (110) Q Consensus 4 n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~--------~~~i~~ 75 (110) |+.|.|.|.|.+.+.||+|+.|++++.|.-++.+++++.+..++.+++.+.++.+++++..+.+.+ .+.|++ T Consensus 86 ~aRIepgaiir~~v~IG~~aVImmga~INiGA~IGegtmId~~Avvg~~a~IGk~~HisaGa~laGVleP~~a~PViIeD 165 (231) T TIGR03532 86 NARIEPGAIIRDQVIIGDNAVIMMGAVINIGAEIGEGTMIDMNAVLGGRATVGKNVHIGAGAVLAGVIEPPSAKPVVIED 165 (231) T ss_pred CCEECCCCEEECCEEECCCEEEECCCEEECCCEECCCCEECCCCEECCCCEECCCCEECCCCEEECCCCCCCCCCEEECC T ss_conf 83577885884252987983991388870485987894580585644546887970588986430022576679979878 Q ss_pred CCCCCCCCCCCCCEEECCCEEECCCCEECCCE Q ss_conf 11111124322013974984998888996980 Q gi|254781223|r 76 DAFVIGFTVISGNARVRGNAVVGGDTVVEGDT 107 (110) Q Consensus 76 ~~~i~~~~~I~~~~~I~~~~~I~~~~~I~~~~ 107 (110) +++|+.++.|.++++|++++.|+.++.+.+|. T Consensus 166 nV~IGAnAvIl~GV~IG~gsVIgAGsVVtkdV 197 (231) T TIGR03532 166 NVLIGANAVILEGVRVGKGAVVAAGAIVTEDV 197 (231) T ss_pred CEEECCCCEECCCCEECCCCEECCCCEECCCC T ss_conf 85987997991998999999998498895657 No 24 >TIGR01852 lipid_A_lpxA acyl-[acyl-carrier-protein]--UDP-N-acetylglucosamine O-acyltransferase; InterPro: IPR010137 This entry describes LpxA, an enzyme for the biosynthesis of lipid A, a component of lipopolysaccharide (LPS) in the outer membrane outer leaflet of most Gram-negative bacteria. Some differences are found between lipid A of different species, but this protein represents the first step (from UDP-N-acetyl-D-glucosamine) and appears to be conserved in function. Proteins from this family contain many copies of the bacterial transferase hexapeptide repeat.; GO: 0008780 acyl-[acyl-carrier-protein]-UDP-N-acetylglucosamine O-acyltransferase activity, 0008610 lipid biosynthetic process. Probab=99.75 E-value=1.6e-18 Score=93.73 Aligned_cols=107 Identities=17% Similarity=0.262 Sum_probs=61.6 Q ss_pred CCCCEEECCCEEECCCCEECCCCEECCCCCCCCCCC-------------------------------------CCCCCCE Q ss_conf 968769889709928839979815457543231122-------------------------------------2222210 Q gi|254781223|r 1 MYDNAVVRDCATVIDDARVSGNASVSRFAQVKSNAE-------------------------------------VSDNTYV 43 (110) Q Consensus 1 i~~n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~-------------------------------------i~~~~~i 43 (110) |.+++.|.+++.|.+.++||+||.|+|+|.|+...| +++|+.+ T Consensus 31 ig~g~~l~shV~i~G~T~iG~~~~i~p~A~iG~~PQdlky~GE~t~l~IG~~n~IRE~VTi~~GT~~g~g~T~iG~~nll 110 (257) T TIGR01852 31 IGDGVELKSHVVIEGHTTIGEGTKIFPGAVIGGEPQDLKYKGERTELIIGDNNTIREFVTINRGTKSGGGVTSIGNNNLL 110 (257) T ss_pred ECCCCEECCCEEEECCEEECCCCEEECCEEECCCCCCCEECCCCCEEEECCCCEEEEEEEECCCCCCCCCEEEECCCCEE T ss_conf 86885875736896770878998880760767988565014870179988965697557850662588888998887657 Q ss_pred EEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCEEECCCCEECCCE Q ss_conf 0113552022111122211011222221101111111124322013974984998888996980 Q gi|254781223|r 44 RDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVIGFTVISGNARVRGNAVVGGDTVVEGDT 107 (110) Q Consensus 44 ~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~I~~~~~I~~~~~I~~~~~I~~~~ 107 (110) ..++++.++|.+++++.+..++.+.+++.|++.+.|++.+-++..|+||..++|++.+-|..|. T Consensus 111 MAysHvAHDC~vGn~vv~aN~a~LAGHV~vgD~a~iGG~~avHQFvRIG~~aMigG~s~v~~Dv 174 (257) T TIGR01852 111 MAYSHVAHDCVVGNHVVLANNATLAGHVEVGDYAIIGGLVAVHQFVRIGRYAMIGGLSAVSKDV 174 (257) T ss_pred CCCCEEEECCEECCEEEEECCCEECEEEEECCEEEECCCCCEEEEEEEHHHHHHHHHCCCCCCC T ss_conf 2053561243688816981363122279988899977897211013300023221200246787 No 25 >COG1044 LpxD UDP-3-O-[3-hydroxymyristoyl] Probab=99.74 E-value=7.9e-18 Score=90.86 Aligned_cols=43 Identities=12% Similarity=0.270 Sum_probs=14.1 Q ss_pred CEEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEE Q ss_conf 7698897099288399798154575432311222222210011 Q gi|254781223|r 4 NAVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDN 46 (110) Q Consensus 4 n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~ 46 (110) |+.|.+++.|+++++||++++|++++.|+.++.+++++.+... T Consensus 135 ~~~I~~~avIg~~~~IG~~~~i~~~v~I~~~~~IG~~v~I~~G 177 (338) T COG1044 135 NVVIGAGAVIGENVKIGDGTVIHPNVTIYHNVVIGNNVIIHSG 177 (338) T ss_pred CCEECCCCEECCCCEECCCCEECCCCEEECCCEECCCEEECCC T ss_conf 8589889788889688888588478789527578782288899 No 26 >cd05636 LbH_G1P_TT_C_like Putative glucose-1-phosphate thymidylyltransferase, C-terminal Left-handed parallel beta-Helix (LbH) domain: Proteins in this family show simlarity to glucose-1-phosphate adenylyltransferases in that they contain N-terminal catalytic domains that resemble a dinucleotide-binding Rossmann fold and C-terminal LbH fold domains. Members in this family are predicted to be glucose-1-phosphate thymidylyltransferases, which are involved in the dTDP-L-rhamnose biosynthetic pathway. Glucose-1-phosphate thymidylyltransferase catalyzes the synthesis of deoxy-thymidine di-phosphate (dTDP)-L-rhamnose, an important component of the cell wall of many microorganisms. The C-terminal LbH domain contains multiple turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Probab=99.73 E-value=8.7e-18 Score=90.70 Aligned_cols=48 Identities=17% Similarity=0.226 Sum_probs=24.5 Q ss_pred CCCEEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEE Q ss_conf 687698897099288399798154575432311222222210011355 Q gi|254781223|r 2 YDNAVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKV 49 (110) Q Consensus 2 ~~n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i 49 (110) ++++.|.+++.|.+++.|+++|.|++++.|++++.+++++.+..++.+ T Consensus 21 g~~a~I~~~a~I~~~v~IG~~~~Ig~~~~I~~~~~IG~~~~IG~~~~I 68 (163) T cd05636 21 GEGAIVRSGAYIEGPVIIGKGCEIGPNAYIRGYTVLGDGCVVGNSVEV 68 (163) T ss_pred CCCCEECCCCEEECCEEECCCCEEEECCCCCCCEEECCEEEECCCEEE T ss_conf 999999999999797599899899364122698586130120551242 No 27 >TIGR01173 glmU UDP-N-acetylglucosamine pyrophosphorylase; InterPro: IPR005882 N-Acetylglucosamine-1-PO(4) uridyltransferase (GlmU, 2.7.7.23 from EC) is a trimeric bifunctional enzyme that catalyzes the last two sequential reactions in the de novo biosynthetic pathway for UDP-GlcNAc. The X-ray crystal structure of Escherichia coli GlmU in complex with UDP-GlcNAc and CoA has been determined to 2.1 A resolution and reveals a two-domain architecture that is responsible for these two reactions . The C-terminal domain is responsible for the CoA-dependent acetylation of Glc-1-PO(4) to GlcNAc-1-PO(4) and displays the longest left-handed parallel beta-helix observed to date. The acetyltransferase active site defined by the binding site for CoA makes use of residues from all three subunits and is positioned beneath an open cavity large enough to accommodate the Glc-1-PO(4) acetyl acceptor. The N-terminal domain catalyzes uridyl transfer from UTP to GlcNAc-1-PO(4) to form the final products UDP-GlcNAc and pyrophosphate. This domain is composed of a central seven-stranded beta-sheet surrounded by six alpha-helices in a Rossmann fold-like topology. ; GO: 0003977 UDP-N-acetylglucosamine diphosphorylase activity, 0009103 lipopolysaccharide biosynthetic process. Probab=99.71 E-value=1.5e-17 Score=89.72 Aligned_cols=37 Identities=16% Similarity=0.304 Sum_probs=25.6 Q ss_pred CCCCCCCCCCCCCCCCCCEEECCCEEECCCCEECCCE Q ss_conf 1101111111124322013974984998888996980 Q gi|254781223|r 71 AEVGGDAFVIGFTVISGNARVRGNAVVGGDTVVEGDT 107 (110) Q Consensus 71 ~~i~~~~~i~~~~~I~~~~~I~~~~~I~~~~~I~~~~ 107 (110) +.||++++||+|+.+=-.++||+++.|+++++|..|+ T Consensus 401 T~IGd~VFiGSnt~LVAPV~iG~gA~iaAGstvt~DV 437 (461) T TIGR01173 401 TIIGDGVFIGSNTQLVAPVKIGDGATIAAGSTVTKDV 437 (461) T ss_pred CEECCCCEECCCCEEECCEEECCCCEECCCCEEECCC T ss_conf 4864887877676045543943811871241480234 No 28 >cd03359 LbH_Dynactin_5 Dynactin 5 (or subunit p25); Dynactin is a major component of the activator complex that stimulates dynein-mediated vesicle transport. Dynactin is a heterocomplex of at least eight subunits, including a 150,000-MW protein called Glued, the actin-capping protein Arp1, and dynamatin. In vitro binding experiments show that dynactin enhances dynein-dependent motility, possibly through interaction with microtubules and vesicles. Subunit p25 is part of the pointed-end subcomplex in dynactin that also includes p26, p27, and Arp11. This subcomplex interacts with membranous cargoes. p25 and p27 contain imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X), indicating a left-handed parallel beta helix (LbH) structural domain. Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Probab=99.70 E-value=3.5e-17 Score=88.17 Aligned_cols=104 Identities=14% Similarity=0.222 Sum_probs=82.3 Q ss_pred EEECCCEEECC--CCEECCCCEECCCCCCCCC---CCCCCCCCEEEEEEEEC------------CCCCCCCCCCCCCCCC Q ss_conf 69889709928--8399798154575432311---22222221001135520------------2211112221101122 Q gi|254781223|r 5 AVVRDCATVID--DARVSGNASVSRFAQVKSN---AEVSDNTYVRDNAKVGG------------YAKVSGNASVGGNAIV 67 (110) Q Consensus 5 ~~I~~~a~I~~--~~~I~~n~~I~~~~~i~~~---~~i~~~~~i~~~~~i~~------------~~~i~~~~~i~~~~~~ 67 (110) -+|+.+|.|.+ |+.|++++.||++++|++| ..+++++.+.+++.+.. ...+++++.++.++.+ T Consensus 8 n~I~k~a~I~G~~nI~IG~~ssI~~~avirGD~~~I~IG~~~~I~d~~~I~~~~~~~~~~~~~~p~~IGd~v~Ig~~~~i 87 (161) T cd03359 8 NKVSRKSVICGSQNIVLNGKTIIQSDVIIRGDLATVSIGRYCILSEGCVIRPPFKKFSKGVAFFPLHIGDYVFIGENCVV 87 (161) T ss_pred CEECCCCEEECCCCEEECCCCEECCCCEEECCCCCEEECCCCEECCCCEEECCCCCCCCCCCCCCCEECCCEEECCCEEE T ss_conf 95877629978877599998789999689678865499998588899899246443577888877284898798998288 Q ss_pred CCCCCCCCCCCCCCCCCCCCCEEECCCEEECCCCEECCCEEE Q ss_conf 222110111111112432201397498499888899698097 Q gi|254781223|r 68 RDTAEVGGDAFVIGFTVISGNARVRGNAVVGGDTVVEGDTVL 109 (110) Q Consensus 68 ~~~~~i~~~~~i~~~~~I~~~~~I~~~~~I~~~~~I~~~~~i 109 (110) . .+.++.+++|+.++.|++++.|++++.|++++.+..++.| T Consensus 88 ~-~~~Ig~~v~IG~~~~I~~g~~Ig~~~~I~~gsvV~~~~~i 128 (161) T cd03359 88 N-AAQIGSYVHIGKNCVIGRRCIIKDCVKILDGTVVPPDTVI 128 (161) T ss_pred E-CCEECCCEEECCCCEECCCCEECCCCEECCCCEECCCCEE T ss_conf 4-8896677155798199499799999899889898899898 No 29 >cd05636 LbH_G1P_TT_C_like Putative glucose-1-phosphate thymidylyltransferase, C-terminal Left-handed parallel beta-Helix (LbH) domain: Proteins in this family show simlarity to glucose-1-phosphate adenylyltransferases in that they contain N-terminal catalytic domains that resemble a dinucleotide-binding Rossmann fold and C-terminal LbH fold domains. Members in this family are predicted to be glucose-1-phosphate thymidylyltransferases, which are involved in the dTDP-L-rhamnose biosynthetic pathway. Glucose-1-phosphate thymidylyltransferase catalyzes the synthesis of deoxy-thymidine di-phosphate (dTDP)-L-rhamnose, an important component of the cell wall of many microorganisms. The C-terminal LbH domain contains multiple turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Probab=99.69 E-value=8e-17 Score=86.67 Aligned_cols=56 Identities=16% Similarity=0.222 Sum_probs=21.6 Q ss_pred CCCEEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCC Q ss_conf 68769889709928839979815457543231122222221001135520221111 Q gi|254781223|r 2 YDNAVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSG 57 (110) Q Consensus 2 ~~n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~ 57 (110) ++|+.|.+++.|.+++.|++++.|++++.|++++.+++++.++.++.++.++.++. T Consensus 9 ~p~a~I~~~v~Ig~~a~I~~~a~I~~~v~IG~~~~Ig~~~~I~~~~~IG~~~~IG~ 64 (163) T cd05636 9 EEGVTIKGPVWIGEGAIVRSGAYIEGPVIIGKGCEIGPNAYIRGYTVLGDGCVVGN 64 (163) T ss_pred CCCCEECCCEEECCCCEECCCCEEECCEEECCCCEEEECCCCCCCEEECCEEEECC T ss_conf 99999989819999999999999979759989989936412269858613012055 No 30 >cd03358 LbH_WxcM_N_like WcxM-like, Left-handed parallel beta-Helix (LbH) N-terminal domain: This group is composed of Xanthomonas campestris WcxM and proteins with similarity to the WcxM N-terminal domain. WcxM is thought to be bifunctional, catalyzing both the isomerization and transacetylation reactions of keto-hexoses. It contains an N-terminal LbH domain responsible for the transacetylation function and a C-terminal isomerase domain. The LbH domain contains imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X), typical of enzymes with acyltransferase activity. Probab=99.69 E-value=7.1e-17 Score=86.89 Aligned_cols=104 Identities=14% Similarity=0.183 Sum_probs=84.3 Q ss_pred CCCCEEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 96876988970992883997981545754323112222222100113552022111122211011222221101111111 Q gi|254781223|r 1 MYDNAVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVI 80 (110) Q Consensus 1 i~~n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~ 80 (110) |.+|++|.+++.|.+++.|++++.|++++.|++++.+++++.+..++.+......... . ........+.++++++|+ T Consensus 1 IG~~~~Ig~~~~I~~~~~IG~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~~~~~~~~~~~--~-~~~~~~~~v~Ig~~v~IG 77 (119) T cd03358 1 IGDNCIIGTNVFIENDVKIGDNVKIQSNVSIYEGVTIEDDVFIGPNVVFTNDLYPRSK--I-YRKWELKGTTVKRGASIG 77 (119) T ss_pred CCCCCEECCCCEECCCCEECCCEEECCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCC--C-CCCCCCCCCEECCCEEEC T ss_conf 9899899999899999999999699998636789998437797788633044466764--3-467632483987883848 Q ss_pred CCCCCCCCEEECCCEEECCCCEECCCE Q ss_conf 124322013974984998888996980 Q gi|254781223|r 81 GFTVISGNARVRGNAVVGGDTVVEGDT 107 (110) Q Consensus 81 ~~~~I~~~~~I~~~~~I~~~~~I~~~~ 107 (110) .++.|.++++|+++++|++++.+.+|. T Consensus 78 ~~~~I~~gv~IG~~~~IgagSvV~~dv 104 (119) T cd03358 78 ANATILPGVTIGEYALVGAGAVVTKDV 104 (119) T ss_pred CCCEEECCEEECCCCEECCCCEECCCC T ss_conf 874778994989998997799890365 No 31 >PRK09451 glmU bifunctional N-acetylglucosamine-1-phosphate uridyltransferase/glucosamine-1-phosphate acetyltransferase; Provisional Probab=99.69 E-value=6.6e-17 Score=87.02 Aligned_cols=37 Identities=30% Similarity=0.366 Sum_probs=23.9 Q ss_pred CCCCCCCCCCCCCCCCCCEEECCCEEECCCCEECCCE Q ss_conf 1101111111124322013974984998888996980 Q gi|254781223|r 71 AEVGGDAFVIGFTVISGNARVRGNAVVGGDTVVEGDT 107 (110) Q Consensus 71 ~~i~~~~~i~~~~~I~~~~~I~~~~~I~~~~~I~~~~ 107 (110) +.|++++++++|+.+-..++||+++.+++++.|.+|+ T Consensus 395 t~igd~~fiGsn~~lvapv~iG~~a~i~aGs~it~dV 431 (456) T PRK09451 395 TIIGDDVFVGSDTQLVAPVTVGKGATIGAGTTVTRDV 431 (456) T ss_pred CEECCCCEECCCCEEECCEEECCCCEECCCCEECCCC T ss_conf 4878982998784483584988997999898788506 No 32 >TIGR03570 NeuD_NnaD sugar O-acyltransferase, sialic acid O-acetyltransferase NeuD family. These proteins contain repeats of the bacterial transferase hexapeptide (pfam00132), although often these do not register above the trusted cutoff. Probab=99.68 E-value=5.2e-17 Score=87.46 Aligned_cols=106 Identities=19% Similarity=0.310 Sum_probs=96.9 Q ss_pred CCCCEEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 96876988970992883997981545754323112222222100113552022111122211011222221101111111 Q gi|254781223|r 1 MYDNAVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVI 80 (110) Q Consensus 1 i~~n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~ 80 (110) ||+++.|++++.|.+++.|+++|.|.+++.|+.++.+..++.+..++.++.++.+.+++.+.+++.+..++.++.++.+. T Consensus 90 Ihp~a~i~~~~~ig~G~~I~~~~~i~~~~~IG~~~~i~~~~~I~Hd~~ig~~~~i~~~~~i~G~v~Ig~~~~iG~~~~i~ 169 (201) T TIGR03570 90 IHPSAIVSPSASIGEGTVIMAGAVINPDVRIGDNVIINTGAIVEHDCVIGDYVHIAPGVTLSGGVVIGEGVFIGAGATII 169 (201) T ss_pred ECCCCEECCCCEECCCCEEECCCEECCCCEEEEEEEECCCCCCCCCCEECCCCEECCCCEECCCCEECCCCEECCCCEEC T ss_conf 99980988986787998996286987664780267980423214425477876899985998988797773598988988 Q ss_pred CCCCCCCCEEECCCEEECCCCEECCCEE Q ss_conf 1243220139749849988889969809 Q gi|254781223|r 81 GFTVISGNARVRGNAVVGGDTVVEGDTV 108 (110) Q Consensus 81 ~~~~I~~~~~I~~~~~I~~~~~I~~~~~ 108 (110) .+..|++++.|+.++.+-.+. ..+.+ T Consensus 170 ~~i~Ig~~~~Igags~V~~~v--~~~~~ 195 (201) T TIGR03570 170 QGVTIGAGAIVGAGAVVTKDI--PDGGV 195 (201) T ss_pred CCCEECCCCEECCCCEEECCC--CCCCE T ss_conf 998999999999798890437--99979 No 33 >cd03353 LbH_GlmU_C N-acetyl-glucosamine-1-phosphate uridyltransferase (GlmU), C-terminal left-handed beta-helix (LbH) acetyltransferase domain: GlmU is also known as UDP-N-acetylglucosamine pyrophosphorylase. It is a bifunctional bacterial enzyme that catalyzes two consecutive steps in the formation of UDP-N-acetylglucosamine (UDP-GlcNAc), an important precursor in bacterial cell wall formation. The two enzymatic activities, uridyltransferase and acetyltransferase, are carried out by two independent domains. The C-terminal LbH domain possesses the acetyltransferase activity. It catalyzes the CoA-dependent acetylation of GlcN-1-phosphate to GlcNAc-1-phosphate. The LbH domain contains 10 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X. The acetyltransferase active site is located at the interface between two subunits of the active LbH trimer. Probab=99.66 E-value=2.8e-16 Score=84.42 Aligned_cols=43 Identities=7% Similarity=0.173 Sum_probs=20.4 Q ss_pred CCCEEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEE Q ss_conf 68769889709928839979815457543231122222221001 Q gi|254781223|r 2 YDNAVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRD 45 (110) Q Consensus 2 ~~n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~ 45 (110) .+|+.|.|++.|.++++|+++|.|++++.|+ ++.+++++.+.. T Consensus 19 G~~v~I~p~~~I~~~v~IG~~~~Ig~~~~I~-~~~IG~~~~I~~ 61 (193) T cd03353 19 GVDVVIDPGVILEGKTVIGEDCVIGPNCVIK-DSTIGDGVVIKA 61 (193) T ss_pred CCCCEECCCEEEECCCEECCCCEECCCEEEC-CCEECCCCEECC T ss_conf 9999998982992985999998995734577-868888740021 No 34 >cd03360 LbH_AT_putative Putative Acyltransferase (AT), Left-handed parallel beta-Helix (LbH) domain; This group is composed of mostly uncharacterized proteins containing an N-terminal helical subdomain followed by a LbH domain. The alignment contains 6 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. A few members are identified as NeuD, a sialic acid (Sia) O-acetyltransferase that is required for Sia synthesis and surface polysaccharide sialylation. Probab=99.66 E-value=1.4e-16 Score=85.66 Aligned_cols=106 Identities=20% Similarity=0.306 Sum_probs=97.4 Q ss_pred CCCCEEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 96876988970992883997981545754323112222222100113552022111122211011222221101111111 Q gi|254781223|r 1 MYDNAVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVI 80 (110) Q Consensus 1 i~~n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~ 80 (110) ||+++.+++++.+..++.|++++.|.+++.|+.++.+..++.+..++.++.++++.+.+.+.+++.+..++.++.++.+. T Consensus 87 Ihp~a~i~~~~~ig~G~~I~~~~~i~~~~~IG~~~~i~~~~~i~Hd~~ig~~~~i~~~~~i~g~v~Ig~~~~iG~~~~i~ 166 (197) T cd03360 87 IHPSAVVSPSAVIGEGCVIMAGAVINPDARIGDNVIINTGAVIGHDCVIGDFVHIAPGVVLSGGVTIGEGAFIGAGATII 166 (197) T ss_pred ECCCCEECCCCEECCCCEEECCCEECCCCCCCCEEEECCCHHCCCCCCCCCCCEECCCEEECCCEEEEEECCCCCCCEEC T ss_conf 99995987774799942993175986565304368987700105315013101066723677830891216298998987 Q ss_pred CCCCCCCCEEECCCEEECCCCEECCCEE Q ss_conf 1243220139749849988889969809 Q gi|254781223|r 81 GFTVISGNARVRGNAVVGGDTVVEGDTV 108 (110) Q Consensus 81 ~~~~I~~~~~I~~~~~I~~~~~I~~~~~ 108 (110) .+..|++++.|+.++.+-.+ +..+++ T Consensus 167 ~~i~Ig~~~~igags~v~~~--i~~~~~ 192 (197) T cd03360 167 QGVTIGAGAIIGAGAVVTKD--VPDGSV 192 (197) T ss_pred CCCEECCCCEECCCCEEEEE--CCCCEE T ss_conf 99899999999979989125--699909 No 35 >cd03352 LbH_LpxD UDP-3-O-acyl-glucosamine N-acyltransferase (LpxD): The enzyme catalyzes the transfer of 3-hydroxymyristic acid or 3-hydroxy-arachidic acid, depending on the organism, from the acyl carrier protein (ACP) to UDP-3-O-acyl-glucosamine to produce UDP-2,3-diacyl-GlcNAc. This constitutes the third step in the lipid A biosynthetic pathway in Gram-negative bacteria. LpxD is a homotrimer, with each subunit consisting of a novel combination of an N-terminal uridine-binding domain, a core lipid-binding left-handed parallel beta helix (LbH) domain, and a C-terminal alpha-helical extension. The LbH domain contains 9 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Probab=99.65 E-value=2.7e-16 Score=84.44 Aligned_cols=107 Identities=13% Similarity=0.269 Sum_probs=55.0 Q ss_pred CCCCEEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECC-------------------CCCCCCCCC Q ss_conf 9687698897099288399798154575432311222222210011355202-------------------211112221 Q gi|254781223|r 1 MYDNAVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGY-------------------AKVSGNASV 61 (110) Q Consensus 1 i~~n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~-------------------~~i~~~~~i 61 (110) |.+|++|+|++.|.++++||++|.|++++.|++++.+++++.+..++.++.. ..+++++.+ T Consensus 22 Ig~n~~I~~~~~I~~~v~IG~~~~I~~~~~I~~~~~IG~~~~I~~~~vIg~~~f~~~~~~~~~~~~~~~g~v~Ig~~v~I 101 (205) T cd03352 22 IGDGVVIGPGVVIGDGVVIGDDCVIHPNVTIYEGCIIGDRVIIHSGAVIGSDGFGFAPDGGGWVKIPQLGGVIIGDDVEI 101 (205) T ss_pred ECCCCEECCCCEECCCCEECCCCEEECCCEECCCCEECCEEEECCCEEECCCCCCCCCCCCCCEECCCCCCEEECCCEEE T ss_conf 99999998988999996889883660034472786875447771534862135323147984045142131898488498 Q ss_pred C----------------------------------------CCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCEEECCCC Q ss_conf 1----------------------------------------011222221101111111124322013974984998888 Q gi|254781223|r 62 G----------------------------------------GNAIVRDTAEVGGDAFVIGFTVISGNARVRGNAVVGGDT 101 (110) Q Consensus 62 ~----------------------------------------~~~~~~~~~~i~~~~~i~~~~~I~~~~~I~~~~~I~~~~ 101 (110) + ..+.+.+.+.|+++++++.++.+.++++|+++++|++++ T Consensus 102 g~~~~I~~g~~~~T~IG~~~~i~~~v~igh~~~iG~~~~i~~~~~~~g~~~Igd~~~iG~~s~i~~gv~IG~~a~VgagS 181 (205) T cd03352 102 GANTTIDRGALGDTVIGDGTKIDNLVQIAHNVRIGENCLIAAQVGIAGSTTIGDNVIIGGQVGIAGHLTIGDGVVIGAGS 181 (205) T ss_pred CCCCEECCCCCCCCCCCCCCEECCCCCCCCCCCCCCCEEECCCCCCCCCEEECCCCEECCCCEECCCCEECCCCEECCCC T ss_conf 88627636642677478776864545255640415870750320002652497435398964882993999998998688 Q ss_pred EECCCE Q ss_conf 996980 Q gi|254781223|r 102 VVEGDT 107 (110) Q Consensus 102 ~I~~~~ 107 (110) .|.+|. T Consensus 182 vVtkdV 187 (205) T cd03352 182 GVTSIV 187 (205) T ss_pred EECCCC T ss_conf 796486 No 36 >cd03353 LbH_GlmU_C N-acetyl-glucosamine-1-phosphate uridyltransferase (GlmU), C-terminal left-handed beta-helix (LbH) acetyltransferase domain: GlmU is also known as UDP-N-acetylglucosamine pyrophosphorylase. It is a bifunctional bacterial enzyme that catalyzes two consecutive steps in the formation of UDP-N-acetylglucosamine (UDP-GlcNAc), an important precursor in bacterial cell wall formation. The two enzymatic activities, uridyltransferase and acetyltransferase, are carried out by two independent domains. The C-terminal LbH domain possesses the acetyltransferase activity. It catalyzes the CoA-dependent acetylation of GlcN-1-phosphate to GlcNAc-1-phosphate. The LbH domain contains 10 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X. The acetyltransferase active site is located at the interface between two subunits of the active LbH trimer. Probab=99.65 E-value=3.2e-16 Score=84.16 Aligned_cols=53 Identities=6% Similarity=0.133 Sum_probs=24.8 Q ss_pred CEEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCC Q ss_conf 769889709928839979815457543231122222221001135520221111 Q gi|254781223|r 4 NAVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSG 57 (110) Q Consensus 4 n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~ 57 (110) +++|.|+++|.+++.|++++.|.+++.|++++.++.++.+. +..++.++.+.+ T Consensus 9 s~~I~~~v~IG~~v~I~p~~~I~~~v~IG~~~~Ig~~~~I~-~~~IG~~~~I~~ 61 (193) T cd03353 9 TTYIDGDVEIGVDVVIDPGVILEGKTVIGEDCVIGPNCVIK-DSTIGDGVVIKA 61 (193) T ss_pred EEEECCCCEECCCCEECCCEEEECCCEECCCCEECCCEEEC-CCEECCCCEECC T ss_conf 06999982999999998982992985999998995734577-868888740021 No 37 >COG1043 LpxA Acyl-[acyl carrier protein] Probab=99.64 E-value=6.4e-16 Score=82.90 Aligned_cols=41 Identities=7% Similarity=0.426 Sum_probs=29.1 Q ss_pred CCCCCCCCCCCCCCCCCCCCEEECCCEEECCCCEECCCEEE Q ss_conf 22110111111112432201397498499888899698097 Q gi|254781223|r 69 DTAEVGGDAFVIGFTVISGNARVRGNAVVGGDTVVEGDTVL 109 (110) Q Consensus 69 ~~~~i~~~~~i~~~~~I~~~~~I~~~~~I~~~~~I~~~~~i 109 (110) +.+.|+++.+++.++++..+|.|++++.+.+++.+.+.+.| T Consensus 105 g~T~IGdnnl~May~HVAHDC~iGn~~ilaNnatLAGHV~i 145 (260) T COG1043 105 GVTRIGDNNLIMAYAHVAHDCVIGNNCILANNATLAGHVEV 145 (260) T ss_pred EEEEECCCCEEEEEEEEECCCEECCCEEEECCCEEECEEEE T ss_conf 15997788789873103134445671799668668540788 No 38 >cd03350 LbH_THP_succinylT 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate (THDP) N-succinyltransferase (also called THP succinyltransferase): THDP N-succinyltransferase catalyzes the conversion of tetrahydrodipicolinate and succinyl-CoA to N-succinyltetrahydrodipicolinate and CoA. It is the committed step in the succinylase pathway by which bacteria synthesize L-lysine and meso-diaminopimelate, a component of peptidoglycan. The enzyme is homotrimeric and each subunit contains an N-terminal region with alpha helices and hairpin loops, as well as a C-terminal region with a left-handed parallel alpha-helix (LbH) structural motif encoded by hexapeptide repeat motifs. Probab=99.63 E-value=3.7e-16 Score=83.89 Aligned_cols=104 Identities=22% Similarity=0.234 Sum_probs=94.0 Q ss_pred CCCCEEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCC--------CCCCCCCC Q ss_conf 9687698897099288399798154575432311222222210011355202211112221101--------12222211 Q gi|254781223|r 1 MYDNAVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGN--------AIVRDTAE 72 (110) Q Consensus 1 i~~n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~--------~~~~~~~~ 72 (110) ||++++|.+.|.|.+++.|+++++|..++.|+.++.+..++.+..++.++.++++++++.+.+. ..+.+++. T Consensus 4 I~P~A~I~~ga~IG~g~vI~~~a~I~~g~~IG~~~~I~~~~~Ig~~~~IG~~v~I~~g~~i~g~~~p~~~~~v~IeD~~~ 83 (139) T cd03350 4 VPPGAIIRDGAFIGPGAVLMMPSYVNIGAYVDEGTMVDSWATVGSCAQIGKNVHLSAGAVIGGVLEPLQATPVIIEDDVF 83 (139) T ss_pred ECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCEECCCCCCCCCCCCEECCCEE T ss_conf 49982988999999998998899984398989872786666026866999932898886784442576468868879959 Q ss_pred CCCCCCCCCCCCCCCCEEECCCEEECCCCEEC Q ss_conf 01111111124322013974984998888996 Q gi|254781223|r 73 VGGDAFVIGFTVISGNARVRGNAVVGGDTVVE 104 (110) Q Consensus 73 i~~~~~i~~~~~I~~~~~I~~~~~I~~~~~I~ 104 (110) ++.++.+...+.|++++.|+.++.+..+.-.+ T Consensus 84 IGa~a~V~~Gv~IG~~avIGAGsVVtkdvp~y 115 (139) T cd03350 84 IGANCEVVEGVIVGKGAVLAAGVVLTQSTPIY 115 (139) T ss_pred EEECCEEECCCEECCCCEECCCCEEECCCCCE T ss_conf 93099894687999998999798891898846 No 39 >PRK09451 glmU bifunctional N-acetylglucosamine-1-phosphate uridyltransferase/glucosamine-1-phosphate acetyltransferase; Provisional Probab=99.59 E-value=2.8e-15 Score=80.22 Aligned_cols=75 Identities=16% Similarity=0.199 Sum_probs=31.1 Q ss_pred CCEEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 87698897099288399798154575432311222222210011355202211112221101122222110111111 Q gi|254781223|r 3 DNAVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFV 79 (110) Q Consensus 3 ~n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i 79 (110) .++.|.+++.+.+++.||++|.|++++.|. ++.+++++.+..++.+.+ +.++.++.+++.+.+..++.+++++.| T Consensus 270 ~dv~I~~nvvi~G~v~IG~~v~Ig~g~~I~-ns~Ig~~~~I~~~S~Ie~-s~Ig~~~~IGPfA~lRp~t~i~~~~~i 344 (456) T PRK09451 270 RDVEIDTNVIIEGNVTLGHRVKIGAGCVLK-NCVIGDDCEISPYSVVED-ANLGAACTIGPFARLRPGAELLEGAHV 344 (456) T ss_pred CCEEECCCEEEECCEEECCCEEECCCEEEE-CCEECCCCEEECCEEECC-CEECCCCEECCCCCCCCCCEECCCCEE T ss_conf 750775770884424757975982407985-769846878942256506-634367168886434887623678888 No 40 >cd03359 LbH_Dynactin_5 Dynactin 5 (or subunit p25); Dynactin is a major component of the activator complex that stimulates dynein-mediated vesicle transport. Dynactin is a heterocomplex of at least eight subunits, including a 150,000-MW protein called Glued, the actin-capping protein Arp1, and dynamatin. In vitro binding experiments show that dynactin enhances dynein-dependent motility, possibly through interaction with microtubules and vesicles. Subunit p25 is part of the pointed-end subcomplex in dynactin that also includes p26, p27, and Arp11. This subcomplex interacts with membranous cargoes. p25 and p27 contain imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X), indicating a left-handed parallel beta helix (LbH) structural domain. Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Probab=99.59 E-value=4.1e-15 Score=79.52 Aligned_cols=107 Identities=14% Similarity=0.183 Sum_probs=74.3 Q ss_pred CCCEEECCCEEECCC---CEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 687698897099288---39979815457543231122222221001135520221111222110112222211011111 Q gi|254781223|r 2 YDNAVVRDCATVIDD---ARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAF 78 (110) Q Consensus 2 ~~n~~I~~~a~I~~~---~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~ 78 (110) .+++.|.++|.+.++ ..||++|.|.+++.|++...............+++++.++.++.+. .+.+..++.|+.++. T Consensus 25 G~~ssI~~~avirGD~~~I~IG~~~~I~d~~~I~~~~~~~~~~~~~~p~~IGd~v~Ig~~~~i~-~~~Ig~~v~IG~~~~ 103 (161) T cd03359 25 NGKTIIQSDVIIRGDLATVSIGRYCILSEGCVIRPPFKKFSKGVAFFPLHIGDYVFIGENCVVN-AAQIGSYVHIGKNCV 103 (161) T ss_pred CCCCEECCCCEEECCCCCEEECCCCEECCCCEEECCCCCCCCCCCCCCCEECCCEEECCCEEEE-CCEECCCEEECCCCE T ss_conf 9987899996896788654999985888998992464435778888772848987989982884-889667715579819 Q ss_pred CCCCCCCCCCEEECCCEEECCCCEECCCEEE Q ss_conf 1112432201397498499888899698097 Q gi|254781223|r 79 VIGFTVISGNARVRGNAVVGGDTVVEGDTVL 109 (110) Q Consensus 79 i~~~~~I~~~~~I~~~~~I~~~~~I~~~~~i 109 (110) |++++.|++++.|++++++..++.|...+++ T Consensus 104 I~~g~~Ig~~~~I~~gsvV~~~~~ip~~~~~ 134 (161) T cd03359 104 IGRRCIIKDCVKILDGTVVPPDTVIPPYSVV 134 (161) T ss_pred ECCCCEECCCCEECCCCEECCCCEECCCCEE T ss_conf 9499799999899889898899898998199 No 41 >cd04645 LbH_gamma_CA_like Gamma carbonic anhydrase-like: This family is composed of gamma carbonic anhydrase (CA), Ferripyochelin Binding Protein (FBP), E. coli paaY protein, and similar proteins. CAs are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism, involving the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide, followed by the regeneration of the active site by ionization of the zinc-bound water molecule and removal of a proton from the active site. They are ubiquitous enzymes involved in fundamental processes like photosynthesis, respiration, pH homeostasis and ion transport. There are three evolutionary distinct groups - alpha, beta and gamma carbonic anhydrases - which show no significant sequence identity or structural similarity. Gamma CAs are trimeric enzymes with left-handed parallel beta helix (LbH) structural domain. Probab=99.58 E-value=3.8e-15 Score=79.69 Aligned_cols=107 Identities=18% Similarity=0.210 Sum_probs=53.6 Q ss_pred CCCCEEECCCEEECCCCEECCCCEE---CCCCCCCCCCCCCCCCCE----EEEEEEECCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 9687698897099288399798154---575432311222222210----011355202211112221101122222110 Q gi|254781223|r 1 MYDNAVVRDCATVIDDARVSGNASV---SRFAQVKSNAEVSDNTYV----RDNAKVGGYAKVSGNASVGGNAIVRDTAEV 73 (110) Q Consensus 1 i~~n~~I~~~a~I~~~~~I~~n~~I---~~~~~i~~~~~i~~~~~i----~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i 73 (110) |+++++|.+++.|.+++.|++++.| .....|+.++.+.+++.+ ...+.++.++.+..++.+.+ +.+.+++.| T Consensus 8 I~p~A~I~G~V~IG~~~~I~~~~~I~~d~~~i~IG~~~~I~~~~~Ih~~~~~~~~IG~~v~Ig~~~~i~g-~~Ig~~~~I 86 (153) T cd04645 8 IAPNATVIGDVTLGEGSSVWFGAVLRGDVNPIRIGERTNIQDGSVLHVDPGYPTIIGDNVTVGHGAVLHG-CTIGDNCLI 86 (153) T ss_pred ECCCCEEECCEEECCCCEECCCCEEECCCCCEEECCCCEECCCCCCCCCCCCCCEECCCCEECCCCEEEC-EEEEEEEEE T ss_conf 9999999772699899999658384246663599999898968543525999979899989999959924-187300477 Q ss_pred CCCCCCCCCCCCCCCEEECCCEEECCCCEECCCEE Q ss_conf 11111111243220139749849988889969809 Q gi|254781223|r 74 GGDAFVIGFTVISGNARVRGNAVVGGDTVVEGDTV 108 (110) Q Consensus 74 ~~~~~i~~~~~I~~~~~I~~~~~I~~~~~I~~~~~ 108 (110) +.++.+..++.|++++.|+.++.+..+..+...++ T Consensus 87 G~~a~I~~gv~IG~~~vIgagsvV~~~~~vp~~~v 121 (153) T cd04645 87 GMGAIILDGAVIGKGSIVAAGSLVPPGKVIPPGSL 121 (153) T ss_pred CCCCEECCCCEECCCCEECCCCEECCCCEECCCCE T ss_conf 05439856999889979944989748979699859 No 42 >TIGR02287 PaaY phenylacetic acid degradation protein PaaY; InterPro: IPR011974 Members of this family are located next to other genes organised into apparent operons for phenylacetic acid degradation . PaaY is located near the end of these gene clusters and often next to PaaX, a transcriptional regulator.. Probab=99.58 E-value=1.3e-15 Score=81.57 Aligned_cols=102 Identities=24% Similarity=0.287 Sum_probs=72.3 Q ss_pred CCCCEEECCCEEECCCCEECCCCEECCCCCCCCC---CCCCCCCCEEEEEEEE----CCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 9687698897099288399798154575432311---2222222100113552----02211112221101122222110 Q gi|254781223|r 1 MYDNAVVRDCATVIDDARVSGNASVSRFAQVKSN---AEVSDNTYVRDNAKVG----GYAKVSGNASVGGNAIVRDTAEV 73 (110) Q Consensus 1 i~~n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~---~~i~~~~~i~~~~~i~----~~~~i~~~~~i~~~~~~~~~~~i 73 (110) ||+.+|++|+|.+-+|+.||..|+|+|+|.+|+| ..+.+++.+-+++.-+ .++.+..|-+++..+.++ .|.| T Consensus 11 Vhp~ayVHP~AVLiGDVi~G~rcYvgPlASLRGDFGrIvl~eGAN~QD~CVMHGFPg~DTvVeenGHvGHgAiLH-gC~v 89 (193) T TIGR02287 11 VHPEAYVHPDAVLIGDVIVGKRCYVGPLASLRGDFGRIVLKEGANVQDTCVMHGFPGQDTVVEENGHVGHGAILH-GCRV 89 (193) T ss_pred CCCCCCCCCCEEEEEEEEECCCCEECCCCCCCCCCCCEEEECCCCCCCCCEECCCCCCCEEEECCCCCCCCEEEC-CCEE T ss_conf 698773468637862178879524676631357757257534886146424367299886760688233212351-5463 Q ss_pred CCCCCCCCCCCCCCCEEECCCEEECCCCEE Q ss_conf 111111112432201397498499888899 Q gi|254781223|r 74 GGDAFVIGFTVISGNARVRGNAVVGGDTVV 103 (110) Q Consensus 74 ~~~~~i~~~~~I~~~~~I~~~~~I~~~~~I 103 (110) ++|+.||=|+.++|.+.|++++.+++-+.| T Consensus 90 grnaLvGMNAVVMDgAvige~sIVaA~aFV 119 (193) T TIGR02287 90 GRNALVGMNAVVMDGAVIGERSIVAASAFV 119 (193) T ss_pred CCCCEECCCEEEECCCEECCCCCEEHHHHH T ss_conf 252311165066166466774120020334 No 43 >cd04745 LbH_paaY_like paaY-like: This group is composed by uncharacterized proteins with similarity to the protein product of the E. coli paaY gene, which is part of the paa gene cluster responsible for phenylacetic acid degradation. Proteins in this group are expected to adopt the left-handed parallel beta-helix (LbH) structure. They contain imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Similarity to gamma carbonic anhydrase and Ferripyochelin Binding Protein (FBP) may suggest metal binding capacity. Probab=99.54 E-value=8.9e-15 Score=78.12 Aligned_cols=107 Identities=21% Similarity=0.236 Sum_probs=48.3 Q ss_pred CCCCEEECCCEEECCCCEECCCCEECC---CCCCCCCCCCCCCCCE----EEEEEEECCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 968769889709928839979815457---5432311222222210----011355202211112221101122222110 Q gi|254781223|r 1 MYDNAVVRDCATVIDDARVSGNASVSR---FAQVKSNAEVSDNTYV----RDNAKVGGYAKVSGNASVGGNAIVRDTAEV 73 (110) Q Consensus 1 i~~n~~I~~~a~I~~~~~I~~n~~I~~---~~~i~~~~~i~~~~~i----~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i 73 (110) |++++.|.+++.|.+++.|++++.|.. ...|++++.+.+++.+ ..+..++.++.+..++.+.+ +.+..++.+ T Consensus 9 I~p~A~I~G~V~IG~~~~I~~~~~Ir~d~g~i~IG~~~~I~~n~~Ih~~~~~~~~Ig~~~~ig~~~~~~g-~~Ig~~~~I 87 (155) T cd04745 9 VHPTAVLIGDVIIGKNCYIGPHASLRGDFGRIVIRDGANVQDNCVIHGFPGQDTVLEENGHIGHGAILHG-CTIGRNALV 87 (155) T ss_pred ECCCCEEEEEEEECCCCEECCCEEEECCCCCEEECCCCEECCCCEECCCCCCCCEECCCCEECCCCEECC-CEEEEEEEE T ss_conf 9999999844799899799846079638777899999797878440135999869899978998748715-177432497 Q ss_pred CCCCCCCCCCCCCCCEEECCCEEECCCCEECCCEE Q ss_conf 11111111243220139749849988889969809 Q gi|254781223|r 74 GGDAFVIGFTVISGNARVRGNAVVGGDTVVEGDTV 108 (110) Q Consensus 74 ~~~~~i~~~~~I~~~~~I~~~~~I~~~~~I~~~~~ 108 (110) +.++.|..++.|++++.|+.++++-.+..|..+++ T Consensus 88 G~~a~I~~G~~IG~~siIgagSvV~~~~~VP~~~l 122 (155) T cd04745 88 GMNAVVMDGAVIGEESIVGAMAFVKAGTVIPPRSL 122 (155) T ss_pred CCCCEEECCCEECCCCEEEECCEECCCCCCCCCEE T ss_conf 68759959779899979941878769968379989 No 44 >TIGR01173 glmU UDP-N-acetylglucosamine pyrophosphorylase; InterPro: IPR005882 N-Acetylglucosamine-1-PO(4) uridyltransferase (GlmU, 2.7.7.23 from EC) is a trimeric bifunctional enzyme that catalyzes the last two sequential reactions in the de novo biosynthetic pathway for UDP-GlcNAc. The X-ray crystal structure of Escherichia coli GlmU in complex with UDP-GlcNAc and CoA has been determined to 2.1 A resolution and reveals a two-domain architecture that is responsible for these two reactions . The C-terminal domain is responsible for the CoA-dependent acetylation of Glc-1-PO(4) to GlcNAc-1-PO(4) and displays the longest left-handed parallel beta-helix observed to date. The acetyltransferase active site defined by the binding site for CoA makes use of residues from all three subunits and is positioned beneath an open cavity large enough to accommodate the Glc-1-PO(4) acetyl acceptor. The N-terminal domain catalyzes uridyl transfer from UTP to GlcNAc-1-PO(4) to form the final products UDP-GlcNAc and pyrophosphate. This domain is composed of a central seven-stranded beta-sheet surrounded by six alpha-helices in a Rossmann fold-like topology. ; GO: 0003977 UDP-N-acetylglucosamine diphosphorylase activity, 0009103 lipopolysaccharide biosynthetic process. Probab=99.54 E-value=1.2e-14 Score=77.56 Aligned_cols=77 Identities=10% Similarity=0.183 Sum_probs=40.8 Q ss_pred CCCEEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 6876988970992883997981545754323112222222100113552022111122211011222221101111111 Q gi|254781223|r 2 YDNAVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVI 80 (110) Q Consensus 2 ~~n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~ 80 (110) ..+|.|.+++.+.++++||++|.|+|++.| .|+.|++++.+...+.+. .+.+..++.+++.+.+.+.+.+..+++|| T Consensus 275 G~Dv~I~~~v~leG~v~iG~~v~IGp~~~I-~ns~I~~~~~I~~~s~~e-~~~ig~~~~vGPFArLRP~~~L~~~~hiG 351 (461) T TIGR01173 275 GQDVVIDPNVILEGKVQIGDDVVIGPGCVI-KNSVIGSNAVIKPYSVLE-GSEIGEGCDVGPFARLRPGSVLGAGVHIG 351 (461) T ss_pred CCEEEECCCCEECCEEEECCCEEECCCCEE-EEEEECCCCEEEEEEECC-CCEECCCCEECCCCCCCCCCHHHCCCEEE T ss_conf 771697558388340798787078898589-972886885588888414-67861564027730018643211778160 No 45 >TIGR03308 phn_thr-fam phosphonate metabolim protein, transferase hexapeptide repeat family. This family of proteins contains copies of the Bacterial transferase hexapeptide repeat family (pfam00132) and is only found in operons encoding the phosphonate C-P lyase system (GenProp0232). Many C-P lyase operons, however, lack a homolog of this protein. Probab=99.54 E-value=1.5e-14 Score=77.23 Aligned_cols=38 Identities=24% Similarity=0.325 Sum_probs=32.9 Q ss_pred CCCCCCCCCCCCCCCCCCCEEECCCEEECCCCEECCCE Q ss_conf 21101111111124322013974984998888996980 Q gi|254781223|r 70 TAEVGGDAFVIGFTVISGNARVRGNAVVGGDTVVEGDT 107 (110) Q Consensus 70 ~~~i~~~~~i~~~~~I~~~~~I~~~~~I~~~~~I~~~~ 107 (110) .+.|+++++|+.++.|..+++||+++.|+.++.|.+|. T Consensus 108 ~~~Ig~dvwiG~~~~i~~gv~IG~gavigagsvVtkdv 145 (204) T TIGR03308 108 RVTIGHDVWIGHGAVILPGVTIGNGAVIAAGAVVTKDV 145 (204) T ss_pred CEEECCCCEECCCCEECCCCEECCCCEECCCCEECCCC T ss_conf 70989976887998991997999998997799899517 No 46 >COG1043 LpxA Acyl-[acyl carrier protein] Probab=99.50 E-value=2.7e-14 Score=76.13 Aligned_cols=38 Identities=8% Similarity=0.158 Sum_probs=13.5 Q ss_pred CCCEEECCCEEECCCCEECCCCEECCCCCCCCCCCCCC Q ss_conf 68769889709928839979815457543231122222 Q gi|254781223|r 2 YDNAVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSD 39 (110) Q Consensus 2 ~~n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~ 39 (110) |+.+.|.|.|++.++++||+-|.|+++++|++++.+.+ T Consensus 7 HPTAiIe~gA~ig~~V~IGpf~iIg~~V~ig~~t~l~s 44 (260) T COG1043 7 HPTAIIEPGAEIGEDVKIGPFCIIGPNVEIGDGTVLKS 44 (260) T ss_pred CCCEEECCCCCCCCCCEECCEEEECCCCEECCCCEECC T ss_conf 76215278788289988883379889719889958834 No 47 >cd04650 LbH_FBP Ferripyochelin Binding Protein (FBP): FBP is an outer membrane protein which plays a role in iron acquisition. It binds iron when it is complexed with pyochelin. It adopts the left-handed parallel beta-helix (LbH) structure, and contains imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Acyltransferase activity has not been observed in this group. Probab=99.50 E-value=5.8e-14 Score=74.73 Aligned_cols=107 Identities=19% Similarity=0.207 Sum_probs=53.9 Q ss_pred CCCCEEECCCEEECCCCEECCCCEECC---CCCCCCCCCCCCCCCEEEE----EEEECCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 968769889709928839979815457---5432311222222210011----355202211112221101122222110 Q gi|254781223|r 1 MYDNAVVRDCATVIDDARVSGNASVSR---FAQVKSNAEVSDNTYVRDN----AKVGGYAKVSGNASVGGNAIVRDTAEV 73 (110) Q Consensus 1 i~~n~~I~~~a~I~~~~~I~~n~~I~~---~~~i~~~~~i~~~~~i~~~----~~i~~~~~i~~~~~i~~~~~~~~~~~i 73 (110) |+++++|..++.|.+++.|+.+|.|.. ...|+.++.+.+++.+..+ ..++.++.+..++.+.+ +.+.+++.| T Consensus 9 I~p~a~i~G~v~IG~~~~I~~~~~Ir~d~~~i~IG~~~~I~~n~~I~~~~~~~~~ig~~v~ig~~~~i~g-~~Ig~~v~I 87 (154) T cd04650 9 VHPTSYVIGDVVIGELTSVWHYAVIRGDNDSIYIGKYSNVQENVSIHTDHGYPTEIGDYVTIGHNAVVHG-AKVGNYVIV 87 (154) T ss_pred ECCCCEEECEEEECCCCEECCCCEEECCCCCEEECCCEEECCCCEEEECCCCCCEECCCCEECCCEEEEC-CEECCEEEE T ss_conf 9999999760599999999875389558876599993398999889846999868799768899528823-022013598 Q ss_pred CCCCCCCCCCCCCCCEEECCCEEECCCCEECCCEE Q ss_conf 11111111243220139749849988889969809 Q gi|254781223|r 74 GGDAFVIGFTVISGNARVRGNAVVGGDTVVEGDTV 108 (110) Q Consensus 74 ~~~~~i~~~~~I~~~~~I~~~~~I~~~~~I~~~~~ 108 (110) +.++.|..++.|++++.|+.++++-.+..+..+++ T Consensus 88 G~~a~I~~Gv~IG~~svIgaGsVV~~~~~Vp~~~l 122 (154) T cd04650 88 GMGAILLNGAKIGDHVIIGAGAVVTPGKEIPDYSL 122 (154) T ss_pred CCCEEEECCCEECCCCEECCCCEECCCCEECCCCE T ss_conf 87229962879899979988988879979199849 No 48 >TIGR03532 DapD_Ac 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-acetyltransferase. Alternate name: tetrahydrodipicolinate N-acetyltransferase. Note that IUBMB lists this alternate name as the accepted name. Unfortunately, the related succinyl transferase acting on the same substrate (EC:2.3.1.117, TIGR00695) uses the opposite standard. We have decided to give these two enzymes names which more clearly indicated that they act on the same substrate. Probab=99.48 E-value=6.8e-14 Score=74.46 Aligned_cols=99 Identities=21% Similarity=0.304 Sum_probs=56.3 Q ss_pred CCCEEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCC--------CCCCCCCCCC Q ss_conf 68769889709928839979815457543231122222221001135520221111222110--------1122222110 Q gi|254781223|r 2 YDNAVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGG--------NAIVRDTAEV 73 (110) Q Consensus 2 ~~n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~--------~~~~~~~~~i 73 (110) .+-|.|.+.+.|.+++.|..+++|..++.|++++-+..++.+...++++.++++++.+.+.+ .+.+.+++.| T Consensus 90 epgaiir~~v~IG~~aVImmga~INiGA~IGegtmId~~Avvg~~a~IGk~~HisaGa~laGVleP~~a~PViIeDnV~I 169 (231) T TIGR03532 90 EPGAIIRDQVIIGDNAVIMMGAVINIGAEIGEGTMIDMNAVLGGRATVGKNVHIGAGAVLAGVIEPPSAKPVVIEDNVLI 169 (231) T ss_pred CCCCEEECCEEECCCEEEECCCEEECCCEECCCCEECCCCEECCCCEECCCCEECCCCEEECCCCCCCCCCEEECCCEEE T ss_conf 78858842529879839913888704859878945805856445468879705889864300225766799798788598 Q ss_pred CCCCCCCCCCCCCCCEEECCCEEECCC Q ss_conf 111111112432201397498499888 Q gi|254781223|r 74 GGDAFVIGFTVISGNARVRGNAVVGGD 100 (110) Q Consensus 74 ~~~~~i~~~~~I~~~~~I~~~~~I~~~ 100 (110) +.++.|..++.|++++.|+.++.+..+ T Consensus 170 GAnAvIl~GV~IG~gsVIgAGsVVtkd 196 (231) T TIGR03532 170 GANAVILEGVRVGKGAVVAAGAIVTED 196 (231) T ss_pred CCCCEECCCCEECCCCEECCCCEECCC T ss_conf 799799199899999999849889565 No 49 >COG1207 GlmU N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) [Cell envelope biogenesis, outer membrane] Probab=99.47 E-value=1.4e-13 Score=73.20 Aligned_cols=91 Identities=18% Similarity=0.339 Sum_probs=49.8 Q ss_pred CEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEE----------------CCCCCCCCCCCCCCCC-------CCCCCCC Q ss_conf 3997981545754323112222222100113552----------------0221111222110112-------2222110 Q gi|254781223|r 17 ARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVG----------------GYAKVSGNASVGGNAI-------VRDTAEV 73 (110) Q Consensus 17 ~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~----------------~~~~i~~~~~i~~~~~-------~~~~~~i 73 (110) +.|+++|.|+|++.+|+++.++.++.+++.+.+. +++.++.++.++-.+. -...+.| T Consensus 321 s~vg~~~~VGPfA~LRPg~~L~~~~hIGNFVEvK~a~ig~gsKa~HLtYlGDA~iG~~~NiGAGtItcNYDG~nK~~T~I 400 (460) T COG1207 321 STVGEGATVGPFARLRPGAVLGADVHIGNFVEVKKATIGKGSKAGHLTYLGDAEIGENVNIGAGTITCNYDGKNKFKTII 400 (460) T ss_pred CEECCCCCCCCCCCCCCCCCCCCCCEEEEEEEEECCCCCCCCCCCCEEEECCCEECCCCEECCCEEEECCCCCCCCEEEE T ss_conf 58547963378310179685267976742599843560688633532342133206874214425998478863320064 Q ss_pred CCCCCCCCCCCCCCCEEECCCEEECCCCEECCCE Q ss_conf 1111111124322013974984998888996980 Q gi|254781223|r 74 GGDAFVIGFTVISGNARVRGNAVVGGDTVVEGDT 107 (110) Q Consensus 74 ~~~~~i~~~~~I~~~~~I~~~~~I~~~~~I~~~~ 107 (110) +++++|++|+.+--.++||+++.|++++.|..|+ T Consensus 401 Gd~vFiGSns~LVAPV~IGd~a~iaAGStIT~DV 434 (460) T COG1207 401 GDNVFIGSNSQLVAPVTIGDGATIAAGSTITKDV 434 (460) T ss_pred CCCCEECCCCCEEEEEEECCCCEECCCCEECCCC T ss_conf 6885776687188648966984881463683657 No 50 >cd04646 LbH_Dynactin_6 Dynactin 6 (or subunit p27): Dynactin is a major component of the activator complex that stimulates dynein-mediated vesicle transport. Dynactin is a heterocomplex of at least eight subunits, including a 150,000-MW protein called Glued, the actin-capping protein Arp1, and dynamatin. In vitro binding experiments show that dynactin enhances dynein-dependent motility, possibly through interaction with microtubules and vesicles. Subunit p27 is part of the pointed-end subcomplex in dynactin that also includes p25, p26, and Arp11. This subcomplex interacts with membranous cargoes. p25 and p27 contain the imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X), indicating a left-handed parallel beta helix (LbH) structural domain. Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Probab=99.46 E-value=1.2e-13 Score=73.48 Aligned_cols=104 Identities=9% Similarity=0.116 Sum_probs=49.4 Q ss_pred CCEEECCCEEECCCC---------EECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 876988970992883---------99798154575432311222222210011355202211112221101122222110 Q gi|254781223|r 3 DNAVVRDCATVIDDA---------RVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEV 73 (110) Q Consensus 3 ~n~~I~~~a~I~~~~---------~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i 73 (110) +++.|++++.|++++ .||+++.|.+++.|..+.. ........+.+++++.+...+.+. .+.+.+++.+ T Consensus 16 GdV~IG~~~sI~p~avIrgd~g~I~IG~~~nIqd~~~I~~~~~--~~~~~g~~~~IG~~~~i~~g~~ih-~~~IG~~~lI 92 (164) T cd04646 16 GDVTIGPGTVVHPRATIIAEAGPIIIGENNIIEEQVTIVNKKP--KDPAEPKPMIIGSNNVFEVGCKCE-ALKIGNNNVF 92 (164) T ss_pred EEEEECCCCEECCCCEEEECCCCEEECCCCEECCCEEEECCCC--CCCCCCCCEEECCCCEECCCEEEE-CCEECCCCEE T ss_conf 2179999998995759981437889899966669859941787--777899887988998886781995-5597678889 Q ss_pred CCCCCCCCCCCCCCCEEECCCEEECCCCEECCCEEE Q ss_conf 111111112432201397498499888899698097 Q gi|254781223|r 74 GGDAFVIGFTVISGNARVRGNAVVGGDTVVEGDTVL 109 (110) Q Consensus 74 ~~~~~i~~~~~I~~~~~I~~~~~I~~~~~I~~~~~i 109 (110) +-++.+++++.|+++|.|+.++.+..+..+..++++ T Consensus 93 g~~a~V~~ga~Ig~~ciIgags~V~~~~~ip~~slv 128 (164) T cd04646 93 ESKSFVGKNVIITDGCIIGAGCKLPSSEILPENTVI 128 (164) T ss_pred CCCCEECCCCEECCCCEECCCCCCCCCCEECCCEEE T ss_conf 689899999899999899989892899792998599 No 51 >PRK10502 putative colanic acid biosynthesis acetyltransferase WcaF; Provisional Probab=99.45 E-value=1.7e-13 Score=72.75 Aligned_cols=104 Identities=9% Similarity=0.087 Sum_probs=82.0 Q ss_pred CCCCEEECCCEEEC--CCCEECCCCEECCCCCCC--CCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 96876988970992--883997981545754323--11222222210011355202211112221101122222110111 Q gi|254781223|r 1 MYDNAVVRDCATVI--DDARVSGNASVSRFAQVK--SNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGD 76 (110) Q Consensus 1 i~~n~~I~~~a~I~--~~~~I~~n~~I~~~~~i~--~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~ 76 (110) |.+++.|+|.+.|. .+..||+++.|++++.|. +...|++++.+..++.+....+-...-. .........|+++ T Consensus 55 IG~~v~I~p~v~I~~PwnL~IGd~~~Ig~~v~i~~~~~I~IG~~v~Isq~v~l~t~sHd~~~~~---~~l~~~pI~Igd~ 131 (179) T PRK10502 55 IGKGVVIRPSVRITYPWKLTIGDYAWIGDDVVLYTLGEITIGAHCVISQKSYLCTGSHDYSSPH---FDLNAAPIVIGEG 131 (179) T ss_pred ECCCCEECCCCEEEECCCEEECCCCEECCCCEEEECCCCEECCCEEECCCCEEECCCCCCCCCC---CCCCCCCCEECCC T ss_conf 7999889996579504307988965788981898489617988412779718963889976766---6732488585899 Q ss_pred CCCCCCCCCCCCEEECCCEEECCCCEECCCE Q ss_conf 1111124322013974984998888996980 Q gi|254781223|r 77 AFVIGFTVISGNARVRGNAVVGGDTVVEGDT 107 (110) Q Consensus 77 ~~i~~~~~I~~~~~I~~~~~I~~~~~I~~~~ 107 (110) ++|+.++.|.++++|++++.|+.++.+.+|. T Consensus 132 ~wIga~a~I~pGv~Ig~gavigA~SvV~kdv 162 (179) T PRK10502 132 CWLATDVFVAPGVTIGDGAVVGARSSVFKSL 162 (179) T ss_pred EEECCCCEEECCCEECCCCEECCCCEEEEEC T ss_conf 6999996990889999997995499892177 No 52 >cd00710 LbH_gamma_CA Gamma carbonic anhydrases (CA): Carbonic anhydrases are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism, involving the nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide, followed by the regeneration of the active site by ionization of the zinc-bound water molecule and removal of a proton from the active site. They are ubiquitous enzymes involved in fundamental processes like photosynthesis, respiration, pH homeostasis and ion transport. There are three distinct groups of carbonic anhydrases - alpha, beta and gamma - which show no significant sequence identity or structural similarity. Gamma CAs are homotrimeric enzymes, with each subunit containing a left-handed parallel beta helix (LbH) structural domain. Probab=99.43 E-value=2.2e-13 Score=72.30 Aligned_cols=84 Identities=18% Similarity=0.254 Sum_probs=30.9 Q ss_pred EECCCCEECCCCCCCC----CCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECC Q ss_conf 9979815457543231----122222221001135520221111222110112222211011111111243220139749 Q gi|254781223|r 18 RVSGNASVSRFAQVKS----NAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVIGFTVISGNARVRG 93 (110) Q Consensus 18 ~I~~n~~I~~~~~i~~----~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~I~~~~~I~~ 93 (110) .||+++.|.+++.|.. .+.+++++.+..++.+++.+++++++.++.++.+.+ +.|++++.|+.++.+ .++.+.+ T Consensus 44 ~IG~~~nIqD~~viH~~~~~~~~IG~~vtIgh~a~ihG~c~Igd~~~IG~gaiv~~-a~Ig~~~vIgags~V-~g~~i~~ 121 (167) T cd00710 44 IIGANVNIQDGVVIHALEGYSVWIGKNVSIAHGAIVHGPAYIGDNCFIGFRSVVFN-AKVGDNCVIGHNAVV-DGVEIPP 121 (167) T ss_pred EECCCCCCCCCEEEECCCCCCEEECCCCEECCCCEECCEEEECCCCEECCCCEEEC-CEECCCCEECCCCEE-ECCCCCC T ss_conf 98887434777078225766749779828989848812049839988988989982-597699899889888-4778399 Q ss_pred CEEECCCCEE Q ss_conf 8499888899 Q gi|254781223|r 94 NAVVGGDTVV 103 (110) Q Consensus 94 ~~~I~~~~~I 103 (110) +..+..++.+ T Consensus 122 g~~v~~G~~v 131 (167) T cd00710 122 GRYVPAGAVI 131 (167) T ss_pred CCEECCCCEE T ss_conf 9895588288 No 53 >COG2171 DapD Tetrahydrodipicolinate N-succinyltransferase [Amino acid transport and metabolism] Probab=99.43 E-value=4.2e-13 Score=71.15 Aligned_cols=106 Identities=22% Similarity=0.256 Sum_probs=87.0 Q ss_pred CEEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCC--------CCCCCC Q ss_conf 769889709928839979815457543231122222221001135520221111222110112222--------211011 Q gi|254781223|r 4 NAVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRD--------TAEVGG 75 (110) Q Consensus 4 n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~--------~~~i~~ 75 (110) -+.|.|.+.+...+.|++|+.+++.+.+.-++.++..+.+..+..++..++++.++++++.+.+.+ .+.|++ T Consensus 108 g~RI~p~a~VR~ga~i~~gtvvM~~sfVNigA~~~~gtMVd~~as~G~~a~VGkn~higgGa~I~GVLep~~a~Pv~Igd 187 (271) T COG2171 108 GVRIVPGAIVRLGAYIAKGTVVMPESFVNIGAGTGEGTMVDGRASVGSCAQVGKNSHIGGGASIGGVLEPLQANPVIIGD 187 (271) T ss_pred CEEECCCCEEEECCEECCCCEECCCCEEEECCCCCCCEEEEEEEEEECCEEECCCCCCCCCCEEEEEECCCCCCCEEECC T ss_conf 63636764776052877993875556687776027525896156551317979984527863676773488878869877 Q ss_pred CCCCCCCCCCCCCEEECCCEEECCCCEECCCEEE Q ss_conf 1111112432201397498499888899698097 Q gi|254781223|r 76 DAFVIGFTVISGNARVRGNAVVGGDTVVEGDTVL 109 (110) Q Consensus 76 ~~~i~~~~~I~~~~~I~~~~~I~~~~~I~~~~~i 109 (110) +|+|+.++.+...+.+++++.|.++..|..++.+ T Consensus 188 ncliGAns~~veGV~vGdg~VV~aGv~I~~~tki 221 (271) T COG2171 188 NCLIGANSEVVEGVIVGDGCVVAAGVFITQDTKI 221 (271) T ss_pred CCEECCCCCEEEEEEECCCCEEECCEEEECCCCE T ss_conf 6186245334861696789688545089379514 No 54 >cd04647 LbH_MAT_like Maltose O-acyltransferase (MAT)-like: This family is composed of maltose O-acetyltransferase, galactoside O-acetyltransferase (GAT), xenobiotic acyltransferase (XAT) and similar proteins. MAT and GAT catalyze the CoA-dependent acetylation of the 6-hydroxyl group of their respective sugar substrates. MAT acetylates maltose and glucose exclusively while GAT specifically acetylates galactopyranosides. XAT catalyzes the CoA-dependent acetylation of a variety of hydroxyl-bearing acceptors such as chloramphenicol and streptogramin, among others. XATs are implicated in inactivating xenobiotics leading to xenobiotic resistance in patients. Members of this family contain a a left-handed parallel beta-helix (LbH) domain with at least 5 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). They are trimeric in their active form. Probab=99.42 E-value=2.7e-13 Score=71.93 Aligned_cols=91 Identities=16% Similarity=0.238 Sum_probs=60.8 Q ss_pred CEECCCCEECCCCCCCC--CCCCCCCCCEEEEEEEECC-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECC Q ss_conf 39979815457543231--1222222210011355202-21111222110112222211011111111243220139749 Q gi|254781223|r 17 ARVSGNASVSRFAQVKS--NAEVSDNTYVRDNAKVGGY-AKVSGNASVGGNAIVRDTAEVGGDAFVIGFTVISGNARVRG 93 (110) Q Consensus 17 ~~I~~n~~I~~~~~i~~--~~~i~~~~~i~~~~~i~~~-~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~I~~~~~I~~ 93 (110) +.||+++.|++++.|.. ...+++++.+..++.+..+ ..+.............+.+.|+++++|+.++.|.++++|++ T Consensus 2 v~IG~~~~Ig~~~~I~~~~~i~IG~~~~I~~~~~i~~~~~~~~~~~~~~~~~~~~~~v~Ig~~v~IG~~~~I~~gv~Ig~ 81 (109) T cd04647 2 ISIGDNVYIGPGCVISAGGGITIGDNVLIGPNVTIYDHNHDIDDPERPIEQGVTSAPIVIGDDVWIGANVVILPGVTIGD 81 (109) T ss_pred CEECCCCEECCCCEEECCCCEEECCCCEECCEEEEEEEEEECCCCCEECCCCCEECCEEEECCCEECCCEEEEECCEECC T ss_conf 19999989999999968988799998499350898330013288752526763718979942357889839928513089 Q ss_pred CEEECCCCEECCCE Q ss_conf 84998888996980 Q gi|254781223|r 94 NAVVGGDTVVEGDT 107 (110) Q Consensus 94 ~~~I~~~~~I~~~~ 107 (110) ++.|++++.+.++. T Consensus 82 ~~iIga~SvV~k~i 95 (109) T cd04647 82 GAVVGAGSVVTKDV 95 (109) T ss_pred CCEECCCCEECCCC T ss_conf 86993798891483 No 55 >cd03358 LbH_WxcM_N_like WcxM-like, Left-handed parallel beta-Helix (LbH) N-terminal domain: This group is composed of Xanthomonas campestris WcxM and proteins with similarity to the WcxM N-terminal domain. WcxM is thought to be bifunctional, catalyzing both the isomerization and transacetylation reactions of keto-hexoses. It contains an N-terminal LbH domain responsible for the transacetylation function and a C-terminal isomerase domain. The LbH domain contains imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X), typical of enzymes with acyltransferase activity. Probab=99.36 E-value=2.9e-12 Score=67.63 Aligned_cols=101 Identities=12% Similarity=0.144 Sum_probs=53.8 Q ss_pred ECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 88970992883997981545754323112222222100113552022111122211011222221101111111124322 Q gi|254781223|r 7 VRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVIGFTVIS 86 (110) Q Consensus 7 I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~I~ 86 (110) |++++.|.+++.|.+++.|++++.|..++.++.++.+.+++.++.++.+........ ... .........|++++.|+ T Consensus 1 IG~~~~Ig~~~~I~~~~~IG~~~~I~~~~~I~~~~~Ig~~~~Ig~~~~~~~~~~~~~--~~~-~~~~~~~v~Ig~~v~IG 77 (119) T cd03358 1 IGDNCIIGTNVFIENDVKIGDNVKIQSNVSIYEGVTIEDDVFIGPNVVFTNDLYPRS--KIY-RKWELKGTTVKRGASIG 77 (119) T ss_pred CCCCCEECCCCEECCCCEECCCEEECCCCCCCCCCEEEEEEEECCCCCCCCCCCCCC--CCC-CCCCCCCCEECCCEEEC T ss_conf 989989999989999999999969999863678999843779778863304446676--434-67632483987883848 Q ss_pred CCEEECCCEEECCCCEECCCEEEC Q ss_conf 013974984998888996980979 Q gi|254781223|r 87 GNARVRGNAVVGGDTVVEGDTVLE 110 (110) Q Consensus 87 ~~~~I~~~~~I~~~~~I~~~~~i~ 110 (110) .++.|.+++.|++++.|...++|. T Consensus 78 ~~~~I~~gv~IG~~~~IgagSvV~ 101 (119) T cd03358 78 ANATILPGVTIGEYALVGAGAVVT 101 (119) T ss_pred CCCEEECCEEECCCCEECCCCEEC T ss_conf 874778994989998997799890 No 56 >cd00208 LbetaH Left-handed parallel beta-Helix (LbetaH or LbH) domain: The alignment contains 5 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity, however, some subfamilies in this hierarchy also show activities related to ion transport or translation initiation. Many are trimeric in their active forms. Probab=99.35 E-value=1.3e-12 Score=69.05 Aligned_cols=75 Identities=17% Similarity=0.256 Sum_probs=46.3 Q ss_pred CCCEEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 68769889709928839979815457543231122222221001135520221111222110112222211011111111 Q gi|254781223|r 2 YDNAVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVIG 81 (110) Q Consensus 2 ~~n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~ 81 (110) .++|+|.+++.|.++++|++++.|+++++|..... ........|+++++|+. T Consensus 4 G~~~~I~~~~~I~~~v~IG~~~~I~~~~~I~~~~~----------------------------~~~~~~~~Ig~~v~Ig~ 55 (78) T cd00208 4 GEGVKIHPKAVIRGPVVIGDNVNIGPGAVIGAATG----------------------------PNEKNPTIIGDNVEIGA 55 (78) T ss_pred CCCCEECCCCEECCCEEECCCCEECCCCEEEECCC----------------------------CCCCCCCEECCCEEECC T ss_conf 99849999849988269999998999989963798----------------------------86678989999969986 Q ss_pred CCCCCCCEEECCCEEECCCCEEC Q ss_conf 24322013974984998888996 Q gi|254781223|r 82 FTVISGNARVRGNAVVGGDTVVE 104 (110) Q Consensus 82 ~~~I~~~~~I~~~~~I~~~~~I~ 104 (110) ++.|.++++|++|+.|++++.++ T Consensus 56 ~a~I~~gv~IGdn~~IgagavVt 78 (78) T cd00208 56 NAVIHGGVKIGDNAVIGAGAVVT 78 (78) T ss_pred CCEEECCEEECCCCEECCCCEEC T ss_conf 89897981999999999885699 No 57 >cd03354 LbH_SAT Serine acetyltransferase (SAT): SAT catalyzes the CoA-dependent acetylation of the side chain hydroxyl group of L-serine to form O-acetylserine, as the first step of a two-step biosynthetic pathway in bacteria and plants leading to the formation of L-cysteine. This reaction represents a key metabolic point of regulation for the cysteine biosynthetic pathway due to its feedback inhibition by cysteine. The enzyme is a 175 kDa homohexamer, composed of a dimer of homotrimers. Each subunit contains an N-terminal alpha helical region and a C-terminal left-handed beta-helix (LbH) subdomain with 5 turns, each containing a hexapeptide repeat motif characteristic of the acyltransferase superfamily of enzymes. The trimer interface mainly involves the C-terminal LbH subdomain while the dimer (of trimers) interface is mediated by the N-terminal alpha helical subdomain. Probab=99.34 E-value=2e-12 Score=68.35 Aligned_cols=86 Identities=19% Similarity=0.280 Sum_probs=51.0 Q ss_pred EECCCCEECCCCEEC--CCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCE Q ss_conf 992883997981545--754323112222222100113552022111122211011222221101111111124322013 Q gi|254781223|r 12 TVIDDARVSGNASVS--RFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVIGFTVISGNA 89 (110) Q Consensus 12 ~I~~~~~I~~n~~I~--~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~I~~~~ 89 (110) .|.++++|+++++|. .+..|+.++.+++++.+..++.++...... ......++++++++.++.|.+++ T Consensus 4 ~I~~~~~IG~~~~I~~~~~v~Ig~~~~Ig~~~~I~~~~~ig~~~~~~----------~~~~~~Ig~~v~IG~~~~I~~gv 73 (101) T cd03354 4 DIHPGAKIGPGLFIDHGTGIVIGETAVIGDNCTIYQGVTLGGKGKGG----------GKRHPTIGDNVVIGAGAKILGNI 73 (101) T ss_pred EECCCCEECCCEEECCCCCEEECCCCEECCCCEECCCCEECCCCCCC----------CCCCCCCCCCEEECCCCEEECCC T ss_conf 98999999949299199877999998999998998887886631146----------87776148758635898892987 Q ss_pred EECCCEEECCCCEECCCE Q ss_conf 974984998888996980 Q gi|254781223|r 90 RVRGNAVVGGDTVVEGDT 107 (110) Q Consensus 90 ~I~~~~~I~~~~~I~~~~ 107 (110) +|++++.|++++.+.+|. T Consensus 74 ~IG~~~vIgagsvV~~dv 91 (101) T cd03354 74 TIGDNVKIGANAVVTKDV 91 (101) T ss_pred EECCCCEECCCCEECCCC T ss_conf 999987998798899685 No 58 >PRK11830 dapD 2,3,4,5-tetrahydropyridine-2,6-carboxylate N-succinyltransferase; Provisional Probab=99.34 E-value=2.4e-12 Score=68.01 Aligned_cols=104 Identities=21% Similarity=0.234 Sum_probs=70.9 Q ss_pred EEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCC--------CCCCCCCC Q ss_conf 6988970992883997981545754323112222222100113552022111122211011222--------22110111 Q gi|254781223|r 5 AVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVR--------DTAEVGGD 76 (110) Q Consensus 5 ~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~--------~~~~i~~~ 76 (110) ..+-|++.+...+.|++++.++| +.+.=++++++++.+...+.++..++++.++++++.+.+. ..+.|.++ T Consensus 97 vRvVP~a~vR~Gayi~~~vVlMP-s~VNiGAyv~~gtmiDtwatvGScAqiGknvhls~g~gigGvleP~~a~p~iied~ 175 (265) T PRK11830 97 VRVVPGAVVRRGAYIAKNVVLMP-SYVNIGAYVDEGTMVDTWATVGSCAQIGKNVHLSGGVGIGGVLEPLQANPVIIEDN 175 (265) T ss_pred EEECCCCEEEECEEECCCCEEEE-EEEEEEEEECCCCEEECCCCEECHHEECCCEEECCCCEEEEEECCCCCCCEEEECC T ss_conf 08878853760305459958864-17776039848954804762411532378336878741426524766798798368 Q ss_pred CCCCCCCCCCCCEEECCCEEECCCCEECCCEEE Q ss_conf 111112432201397498499888899698097 Q gi|254781223|r 77 AFVIGFTVISGNARVRGNAVVGGDTVVEGDTVL 109 (110) Q Consensus 77 ~~i~~~~~I~~~~~I~~~~~I~~~~~I~~~~~i 109 (110) |+|+.++.+.+.+.++++++++.++.++..+.| T Consensus 176 ~fiGa~~~v~eGv~v~~~avl~~gv~l~~st~I 208 (265) T PRK11830 176 CFIGARSEVVEGVIVEEGAVIGMGVFLGQSTKI 208 (265) T ss_pred CEECCCCEEEEEEEEECCCEECCEEEECCCCEE T ss_conf 287357479613999458697210476067167 No 59 >PRK13627 carnitine operon protein CaiE; Provisional Probab=99.32 E-value=2.8e-12 Score=67.69 Aligned_cols=32 Identities=22% Similarity=0.122 Sum_probs=13.6 Q ss_pred ECCCEEECCCCEECCCCEECCCCCCCCCCCCC Q ss_conf 88970992883997981545754323112222 Q gi|254781223|r 7 VRDCATVIDDARVSGNASVSRFAQVKSNAEVS 38 (110) Q Consensus 7 I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~ 38 (110) |++++.|.+++.|-+++.|++++.|+.++.+. T Consensus 13 i~~~afIap~A~viGdV~ig~~~sIw~~aVlR 44 (196) T PRK13627 13 VHPTAFVHPSAVLIGDVIVGAGVYIGPLASLR 44 (196) T ss_pred CCCCEEECCCCEEEEEEEECCCCEECCCEEEE T ss_conf 29983889999899849999998897770895 No 60 >COG0663 PaaY Carbonic anhydrases/acetyltransferases, isoleucine patch superfamily [General function prediction only] Probab=99.30 E-value=4.1e-12 Score=67.02 Aligned_cols=97 Identities=15% Similarity=0.241 Sum_probs=47.3 Q ss_pred EECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 98897099288399798154575432311222222210011355202211112221101122222110111111112432 Q gi|254781223|r 6 VVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVIGFTVI 85 (110) Q Consensus 6 ~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~I 85 (110) .|+++|.|.+++.|-++..|++++.|+.++.+..+. +...++.++-+.+++.+..... ..+.|++++.|+..+.+ T Consensus 13 ~i~~~a~Va~~A~viGdV~Ig~~vsIw~~aVlRgD~---~~I~IG~~tNIQDg~ViH~~~~--~p~~IG~~vtIGH~aiv 87 (176) T COG0663 13 KIDPTAFVAPSATVIGDVRIGAGVSIWPGAVLRGDV---EPIRIGARTNIQDGVVIHADPG--YPVTIGDDVTIGHGAVV 87 (176) T ss_pred CCCCCEEECCCCEEEEEEEECCCCEECCCEEEECCC---CCEEECCCCEECCCEEEECCCC--CCEEECCCCEECCCCEE T ss_conf 789862877999799859999997887763897467---7569878856037838952799--99297799588576589 Q ss_pred CCCEEECCCEEECCCCEECCCEE Q ss_conf 20139749849988889969809 Q gi|254781223|r 86 SGNARVRGNAVVGGDTVVEGDTV 108 (110) Q Consensus 86 ~~~~~I~~~~~I~~~~~I~~~~~ 108 (110) +. |+|+++++||.++.|.+.+. T Consensus 88 HG-c~Ig~~~lIGmgA~vldga~ 109 (176) T COG0663 88 HG-CTIGDNVLIGMGATVLDGAV 109 (176) T ss_pred EE-EEECCCCEEECCCEEECCCE T ss_conf 88-59878858913866757869 No 61 >PRK09677 putative lipopolysaccharide biosynthesis O-acetyl transferase WbbJ; Provisional Probab=99.30 E-value=9.1e-12 Score=65.58 Aligned_cols=106 Identities=9% Similarity=0.104 Sum_probs=74.5 Q ss_pred CCCEEECCCEEEC----CCCEECCCCEECCCCCCC--CCCCCCCCCCEEEEEEEECCCCCCCC---------CCCCCCCC Q ss_conf 6876988970992----883997981545754323--11222222210011355202211112---------22110112 Q gi|254781223|r 2 YDNAVVRDCATVI----DDARVSGNASVSRFAQVK--SNAEVSDNTYVRDNAKVGGYAKVSGN---------ASVGGNAI 66 (110) Q Consensus 2 ~~n~~I~~~a~I~----~~~~I~~n~~I~~~~~i~--~~~~i~~~~~i~~~~~i~~~~~i~~~---------~~i~~~~~ 66 (110) .++..+++.+++. +...||+|+.|++++.|. ....|++++.+..++.+.++.+-... ........ T Consensus 47 g~~~~~g~~~rl~~~~~gkl~IGdNv~Ig~~~~I~~~~~I~IG~nVlIa~~V~I~d~nH~~~~~~~~~~~~~~~~~~~~~ 126 (192) T PRK09677 47 GEGFTSGVGLRLDAFGRGKLFFGDNVQVNDYVHIACIESITIGRDTLIASKVFITDHNHGSFKHSDDFSSPNLPPDMRTL 126 (192) T ss_pred CCCEEECCCEEEEEECCCEEEECCCCEECCCCEEEECCCEEECCCCEECCCEEEECCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 89619668779997067508999984899985898677679899868879889967888765554543456768212667 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCEEECCCEEECCCCEECCCE Q ss_conf 22221101111111124322013974984998888996980 Q gi|254781223|r 67 VRDTAEVGGDAFVIGFTVISGNARVRGNAVVGGDTVVEGDT 107 (110) Q Consensus 67 ~~~~~~i~~~~~i~~~~~I~~~~~I~~~~~I~~~~~I~~~~ 107 (110) ......|+++++|+.++.|.++++|++++.|++++.+.+|. T Consensus 127 ~~~pv~Ig~~~wIg~~~~Il~Gv~Ig~~~vvgA~SvV~~dv 167 (192) T PRK09677 127 ESSAVVIGQRVWIGENVTILPGVSIGNGCIVGANSVVTKSI 167 (192) T ss_pred CCCCEEECCCCEECCCCEEECCCEECCCCEECCCCEECCCC T ss_conf 75998989998999999991996999998991699998585 No 62 >cd05824 LbH_M1P_guanylylT_C Mannose-1-phosphate guanylyltransferase, C-terminal Left-handed parallel beta helix (LbH) domain: Mannose-1-phosphate guanylyltransferase is also known as GDP-mannose pyrophosphorylase. It catalyzes the synthesis of GDP-mannose from GTP and mannose-1-phosphate, and is involved in the maintenance of cell wall integrity and glycosylation. Similar to ADP-glucose pyrophosphorylase, it contains an N-terminal catalytic domain that resembles a dinucleotide-binding Rossmann fold and a C-terminal LbH fold domain, presumably with 4 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Probab=99.30 E-value=2.2e-12 Score=68.12 Aligned_cols=31 Identities=10% Similarity=0.201 Sum_probs=14.1 Q ss_pred ECCCEEECCCCEECCCCEECCCCCCCCCCCC Q ss_conf 8897099288399798154575432311222 Q gi|254781223|r 7 VRDCATVIDDARVSGNASVSRFAQVKSNAEV 37 (110) Q Consensus 7 I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i 37 (110) |.|+|.|.++++|++++.|+|+++|++++++ T Consensus 2 Idpsa~Ig~~~~IGp~vvIG~nv~IG~gv~I 32 (80) T cd05824 2 IDPSAKIGKTAKIGPNVVIGPNVTIGDGVRL 32 (80) T ss_pred CCCCCEECCCCEECCCCEECCCCEECCCCEE T ss_conf 3787898999999998589999999999799 No 63 >cd03357 LbH_MAT_GAT Maltose O-acetyltransferase (MAT) and Galactoside O-acetyltransferase (GAT): MAT and GAT catalyze the CoA-dependent acetylation of the 6-hydroxyl group of their respective sugar substrates. MAT acetylates maltose and glucose exclusively at the C6 position of the nonreducing end glucosyl moiety. GAT specifically acetylates galactopyranosides. Furthermore, MAT shows higher affinity toward artificial substrates containing an alkyl or hydrophobic chain as well as a glucosyl unit. Active MAT and GAT are homotrimers, with each subunit consisting of an N-terminal alpha-helical region and a C-terminal left-handed parallel alpha-helix (LbH) subdomain with 6 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Probab=99.29 E-value=8.1e-12 Score=65.80 Aligned_cols=106 Identities=13% Similarity=0.149 Sum_probs=70.4 Q ss_pred CCCEEECCCEEE--CCCCEECCCCEECCCCCCCC--CCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 687698897099--28839979815457543231--12222222100113552022111122211011222221101111 Q gi|254781223|r 2 YDNAVVRDCATV--IDDARVSGNASVSRFAQVKS--NAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDA 77 (110) Q Consensus 2 ~~n~~I~~~a~I--~~~~~I~~n~~I~~~~~i~~--~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~ 77 (110) .++++|.+.-.. ..+..||++++|..++++.+ ...|++++.++.++.+.-..+-.+...-...........|++++ T Consensus 46 g~~~~I~~pf~~dyG~ni~iG~~~~in~n~~ild~~~I~IG~~v~igp~v~i~t~~H~~d~~~r~~~~~~~~PI~Igd~v 125 (169) T cd03357 46 GENVYIEPPFHCDYGYNIHIGDNFYANFNCTILDVAPVTIGDNVLIGPNVQIYTAGHPLDPEERNRGLEYAKPITIGDNV 125 (169) T ss_pred CCCCEEECCEEEECCCCEEECCEEEECCCCEEEECCCEEECCCEEECCCCEEECCCCCCCHHHHHCCCCCCCCEEECCCC T ss_conf 99958978889834367798884488788489706887999980787985897689999978963466524556989743 Q ss_pred CCCCCCCCCCCEEECCCEEECCCCEECCCE Q ss_conf 111124322013974984998888996980 Q gi|254781223|r 78 FVIGFTVISGNARVRGNAVVGGDTVVEGDT 107 (110) Q Consensus 78 ~i~~~~~I~~~~~I~~~~~I~~~~~I~~~~ 107 (110) +|+.++.|..+++||+++.|++++.|.+|. T Consensus 126 WIG~~~~IlpGVtIG~~~vIgAGsVVtkdi 155 (169) T cd03357 126 WIGGGVIILPGVTIGDNSVIGAGSVVTKDI 155 (169) T ss_pred EECCCCEEECCCEECCCCEECCCCEECCCC T ss_conf 628997890997999999998898797465 No 64 >KOG1461 consensus Probab=99.28 E-value=2.7e-12 Score=67.79 Aligned_cols=103 Identities=16% Similarity=0.224 Sum_probs=44.9 Q ss_pred CEEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCC-----CCCCCC Q ss_conf 769889709928839979815457543231122222221001135520221111222110112222211-----011111 Q gi|254781223|r 4 NAVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAE-----VGGDAF 78 (110) Q Consensus 4 n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~-----i~~~~~ 78 (110) |.+-++.+.+...+.++.+..|+.++.|+.++.|. |+.++.++.++.++.+. ++.++.++.+.++|. +.+++. T Consensus 315 ~IYk~~dv~~~~~~~v~~~~~ig~gT~Ig~g~~I~-NSVIG~~c~IgsN~~I~-~S~iw~~v~Igdnc~I~~aii~d~v~ 392 (673) T KOG1461 315 NIYKSPDVVLSHSVIVGANVVIGAGTKIGSGSKIS-NSVIGANCRIGSNVRIK-NSFIWNNVTIGDNCRIDHAIICDDVK 392 (673) T ss_pred CCCCCCCCEEHHHCCCCCEEEECCCCCCCCCCEEE-CCEECCCCEECCCEEEE-EEEEECCCEECCCCEEEEEEEECCCE T ss_conf 40147401000120124407951454126798555-22665897745745885-13662583887886573468605837 Q ss_pred CCCCCCCCCCEEECCCEEECCCCEECCCEE Q ss_conf 111243220139749849988889969809 Q gi|254781223|r 79 VIGFTVISGNARVRGNAVVGGDTVVEGDTV 108 (110) Q Consensus 79 i~~~~~I~~~~~I~~~~~I~~~~~I~~~~~ 108 (110) |+.++.+.+.+.++.+++++.+..+..++. T Consensus 393 i~~~~~l~~g~vl~~~VVv~~~~~l~~ns~ 422 (673) T KOG1461 393 IGEGAILKPGSVLGFGVVVGRNFVLPKNSK 422 (673) T ss_pred ECCCCCCCCCCEEEEEEEECCCCCCCCCCC T ss_conf 678844078857752168679831245443 No 65 >cd05825 LbH_wcaF_like wcaF-like: This group is composed of the protein product of the E. coli wcaF gene and similar proteins. WcaF is part of the gene cluster responsible for the biosynthesis of the extracellular polysaccharide colanic acid. The wcaF protein is predicted to contain a left-handed parallel beta-helix (LbH) domain encoded by imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Many are trimeric in their active forms. Probab=99.28 E-value=5.6e-12 Score=66.45 Aligned_cols=88 Identities=14% Similarity=0.189 Sum_probs=53.2 Q ss_pred CEECCCCEECCCCCC--CCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCC Q ss_conf 399798154575432--311222222210011355202211112221101122222110111111112432201397498 Q gi|254781223|r 17 ARVSGNASVSRFAQV--KSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVIGFTVISGNARVRGN 94 (110) Q Consensus 17 ~~I~~n~~I~~~~~i--~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~I~~~~~I~~~ 94 (110) +.||+++.|++++.| .+...+++++.+..++.+....+...... .....+...|+++++++.++.|.++++|+++ T Consensus 4 v~IG~~~~Ig~~~~I~~~~~i~IG~~~~i~~~v~I~~~~h~~~~~~---~~~~~~~v~Ig~~v~IG~~~~I~~gv~Ig~~ 80 (107) T cd05825 4 LTIGDNSWIGEGVWIYNLAPVTIGSDACISQGAYLCTGSHDYRSPA---FPLITAPIVIGDGAWVAAEAFVGPGVTIGEG 80 (107) T ss_pred EEECCCCEECCCCEEEECCCEEECCCCEECCCCEECCCCCCCCCCC---CCEECCCEEECCCCEECCCCEEECCCEECCC T ss_conf 7989986899998995388579999739959959707888876766---5316399799388587498789699191899 Q ss_pred EEECCCCEECCCE Q ss_conf 4998888996980 Q gi|254781223|r 95 AVVGGDTVVEGDT 107 (110) Q Consensus 95 ~~I~~~~~I~~~~ 107 (110) ++++.++.+.+|. T Consensus 81 svVgagSvV~kdv 93 (107) T cd05825 81 AVVGARSVVVRDL 93 (107) T ss_pred CEEECCCEECCCC T ss_conf 7994898890385 No 66 >cd05824 LbH_M1P_guanylylT_C Mannose-1-phosphate guanylyltransferase, C-terminal Left-handed parallel beta helix (LbH) domain: Mannose-1-phosphate guanylyltransferase is also known as GDP-mannose pyrophosphorylase. It catalyzes the synthesis of GDP-mannose from GTP and mannose-1-phosphate, and is involved in the maintenance of cell wall integrity and glycosylation. Similar to ADP-glucose pyrophosphorylase, it contains an N-terminal catalytic domain that resembles a dinucleotide-binding Rossmann fold and a C-terminal LbH fold domain, presumably with 4 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Probab=99.25 E-value=8e-12 Score=65.82 Aligned_cols=31 Identities=13% Similarity=0.244 Sum_probs=14.9 Q ss_pred ECCCCEECCCCEECCCCCCCCCCCCCCCCCE Q ss_conf 9288399798154575432311222222210 Q gi|254781223|r 13 VIDDARVSGNASVSRFAQVKSNAEVSDNTYV 43 (110) Q Consensus 13 I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i 43 (110) |-|.++|+++|.|+|+++|++++.+++++.+ T Consensus 2 Idpsa~Ig~~~~IGp~vvIG~nv~IG~gv~I 32 (80) T cd05824 2 IDPSAKIGKTAKIGPNVVIGPNVTIGDGVRL 32 (80) T ss_pred CCCCCEECCCCEECCCCEECCCCEECCCCEE T ss_conf 3787898999999998589999999999799 No 67 >COG1207 GlmU N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) [Cell envelope biogenesis, outer membrane] Probab=99.23 E-value=1.9e-11 Score=64.24 Aligned_cols=42 Identities=10% Similarity=0.248 Sum_probs=20.1 Q ss_pred CCEEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEE Q ss_conf 8769889709928839979815457543231122222221001 Q gi|254781223|r 3 DNAVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRD 45 (110) Q Consensus 3 ~n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~ 45 (110) .++.|.|++.+.++++|+++|.|+|++.| .++.+++++.+.. T Consensus 273 ~DvvI~p~v~l~G~t~ig~~v~iGpg~~i-~ds~I~~~a~I~~ 314 (460) T COG1207 273 RDVVIEPNVILEGNTVIGDNVVIGPGSVI-KDSVIGDNAVIKA 314 (460) T ss_pred CCEEEECCCEEEEEEEECCCEEECCCCEE-EEEEECCCCEEEE T ss_conf 81499148489415896796499999678-7538769988975 No 68 >PRK09527 lacA galactoside O-acetyltransferase; Reviewed Probab=99.21 E-value=2.9e-11 Score=63.49 Aligned_cols=106 Identities=15% Similarity=0.180 Sum_probs=69.5 Q ss_pred CCCEEECCCEEE--CCCCEECCCCEECCCCCCCCC--CCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 687698897099--288399798154575432311--2222222100113552022111122211011222221101111 Q gi|254781223|r 2 YDNAVVRDCATV--IDDARVSGNASVSRFAQVKSN--AEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDA 77 (110) Q Consensus 2 ~~n~~I~~~a~I--~~~~~I~~n~~I~~~~~i~~~--~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~ 77 (110) .++++|.+.-.. +-|..||+++++-.++++-+. ..|++++.++.++++.-..+-.....-...........|++++ T Consensus 59 g~~~~IepPF~cdYG~NI~iG~~~fiN~n~~ilD~~~I~IGd~v~iGPnV~i~t~~Hp~~~~~R~~~~~~~~pi~Ig~~v 138 (203) T PRK09527 59 GENAWVEPPVYFSYGSNIHIGRNFYANFNLTIVDDYTVTIGDNVLIAPNVTLSVTGHPVHHELRKNGEMYSFPITIGNNV 138 (203) T ss_pred CCCCEEECCEEEECCCCEEECCCEEECCCCEEEECCCEEECCCEEECCCCEEECCCCCCCHHHHHCCCCCCCCEEECCEE T ss_conf 99978969978725479697798798688389957637998972886997893698988988973387668885999868 Q ss_pred CCCCCCCCCCCEEECCCEEECCCCEECCCE Q ss_conf 111124322013974984998888996980 Q gi|254781223|r 78 FVIGFTVISGNARVRGNAVVGGDTVVEGDT 107 (110) Q Consensus 78 ~i~~~~~I~~~~~I~~~~~I~~~~~I~~~~ 107 (110) +|+.++.|..++.||+++.|++++.|.+|. T Consensus 139 wIG~~~~I~pGv~IG~~~vigAgsvVtkdi 168 (203) T PRK09527 139 WIGSHVVINPGVTIGDNSVIGAGSVVTKDI 168 (203) T ss_pred EECCCCEECCCCEECCCCEECCCCEECCCC T ss_conf 989999990992999997991498885306 No 69 >KOG1461 consensus Probab=99.19 E-value=1.6e-11 Score=64.55 Aligned_cols=93 Identities=12% Similarity=0.279 Sum_probs=49.1 Q ss_pred EECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEE Q ss_conf 99288399798154575432311222222210011355202211112221101122222110111111112432201397 Q gi|254781223|r 12 TVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVIGFTVISGNARV 91 (110) Q Consensus 12 ~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~I~~~~~I 91 (110) .-.+++.+...|.++.++.|+..+.++.++.+ .|..|+.++.|+.++.+ .++.+..++.|+++|.| +++.|+++++| T Consensus 317 Yk~~dv~~~~~~~v~~~~~ig~gT~Ig~g~~I-~NSVIG~~c~IgsN~~I-~~S~iw~~v~Igdnc~I-~~aii~d~v~i 393 (673) T KOG1461 317 YKSPDVVLSHSVIVGANVVIGAGTKIGSGSKI-SNSVIGANCRIGSNVRI-KNSFIWNNVTIGDNCRI-DHAIICDDVKI 393 (673) T ss_pred CCCCCCEEHHHCCCCCEEEECCCCCCCCCCEE-ECCEECCCCEECCCEEE-EEEEEECCCEECCCCEE-EEEEEECCCEE T ss_conf 14740100012012440795145412679855-52266589774574588-51366258388788657-34686058376 Q ss_pred CCCEEECCCCEECCCE Q ss_conf 4984998888996980 Q gi|254781223|r 92 RGNAVVGGDTVVEGDT 107 (110) Q Consensus 92 ~~~~~I~~~~~I~~~~ 107 (110) +.++.+-.++.++-++ T Consensus 394 ~~~~~l~~g~vl~~~V 409 (673) T KOG1461 394 GEGAILKPGSVLGFGV 409 (673) T ss_pred CCCCCCCCCCEEEEEE T ss_conf 7884407885775216 No 70 >cd05787 LbH_eIF2B_epsilon eIF-2B epsilon subunit, central Left-handed parallel beta-Helix (LbH) domain: eIF-2B is a eukaryotic translation initiator, a guanine nucleotide exchange factor (GEF) composed of five different subunits (alpha, beta, gamma, delta and epsilon). eIF2B is important for regenerating GTP-bound eIF2 during the initiation process. This event is obligatory for eIF2 to bind initiator methionyl-tRNA, forming the ternary initiation complex. The eIF-2B epsilon subunit contains an N-terminal domain that resembles a dinucleotide-binding Rossmann fold, a central LbH domain containing 4 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X), and a C-terminal domain of unknown function that is present in eIF-4 gamma, eIF-5, and eIF-2B epsilon. The epsilon and gamma subunits form the catalytic subcomplex of eIF-2B, which binds eIF2 and catalyzes guanine nucleotide exchange. Probab=99.18 E-value=3.1e-11 Score=63.34 Aligned_cols=30 Identities=7% Similarity=0.203 Sum_probs=15.9 Q ss_pred CCCCEEECCCEEECCCCEECCCCEECCCCCC Q ss_conf 9687698897099288399798154575432 Q gi|254781223|r 1 MYDNAVVRDCATVIDDARVSGNASVSRFAQV 31 (110) Q Consensus 1 i~~n~~I~~~a~I~~~~~I~~n~~I~~~~~i 31 (110) |.+|+.|.+++.|. ++.|++||.|++++.| T Consensus 2 IG~~t~Ig~~~~I~-~svIG~nc~Ig~~~~I 31 (79) T cd05787 2 IGRGTSIGEGTTIK-NSVIGRNCKIGKNVVI 31 (79) T ss_pred CCCCCEECCCCEEE-CCEECCCCEECCCCEE T ss_conf 99989999999995-9998999999999889 No 71 >PRK10092 maltose O-acetyltransferase; Provisional Probab=99.17 E-value=5.2e-11 Score=62.44 Aligned_cols=104 Identities=14% Similarity=0.141 Sum_probs=69.4 Q ss_pred CEEECCCEEE--CCCCEECCCCEECCCCCCCC--CCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 7698897099--28839979815457543231--1222222210011355202211112221101122222110111111 Q gi|254781223|r 4 NAVVRDCATV--IDDARVSGNASVSRFAQVKS--NAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFV 79 (110) Q Consensus 4 n~~I~~~a~I--~~~~~I~~n~~I~~~~~i~~--~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i 79 (110) +++|.|.-.. +.|..||+|+++..++++-+ ...|++++.++.++++.-..+-.+...-...........|+++++| T Consensus 59 ~~~I~ppF~cdYG~ni~iG~~~~iN~~~~ilD~~~I~IG~~v~iGpnv~i~t~~Hp~d~~~R~~~~~~~~pi~Ig~~vwi 138 (183) T PRK10092 59 EAYIEPTFRCDYGYNIFLGNNFYANFDCVMLDVCPIHIGDNCMLAPGVHIYTATHPLDPVARNSGAELGKPVTIGNNVWI 138 (183) T ss_pred CCEEECCEEEECCCCEEECCCEEEECCEEEEECCCEEECCCEEECCCCEEECCCCCCCHHHHHCCCEECCCEEECCCCEE T ss_conf 83795888840677878678769927849994787399998058499789859998898895258631687798998688 Q ss_pred CCCCCCCCCEEECCCEEECCCCEECCCE Q ss_conf 1124322013974984998888996980 Q gi|254781223|r 80 IGFTVISGNARVRGNAVVGGDTVVEGDT 107 (110) Q Consensus 80 ~~~~~I~~~~~I~~~~~I~~~~~I~~~~ 107 (110) +.++.|..++.||+++.|++++.+.+|. T Consensus 139 g~~~~i~pGv~IG~~~vigagsvV~~di 166 (183) T PRK10092 139 GGRAVINPGVTIGDNVVVASGAVVTKDV 166 (183) T ss_pred CCCCEEECCCEECCCCEECCCCEEEEEC T ss_conf 8998990798999998997598892277 No 72 >COG1208 GCD1 Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) [Cell envelope biogenesis, outer membrane / Translation, ribosomal structure and biogenesis] Probab=99.13 E-value=2e-10 Score=59.97 Aligned_cols=102 Identities=16% Similarity=0.291 Sum_probs=54.3 Q ss_pred EEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 69889709928839979815457543231122222221001135520221111222110112222211011111111243 Q gi|254781223|r 5 AVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVIGFTV 84 (110) Q Consensus 5 ~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~ 84 (110) +.+-+.+.+.+++.|++++.|.+++.|++++.++.++.+ .+..+..++.+...+.+ .++.+..++.++.+..+++ +. T Consensus 256 ~~i~gp~~ig~~~~i~~~~~i~~~~~ig~~~~I~~~~~i-~~Sii~~~~~i~~~~~i-~~sIi~~~~~ig~~~~i~d-~~ 332 (358) T COG1208 256 AYIIGPVVIGPGAKIGPGALIGPYTVIGEGVTIGNGVEI-KNSIIMDNVVIGHGSYI-GDSIIGENCKIGASLIIGD-VV 332 (358) T ss_pred EEEECCEEECCCCEECCCCEECCCCEECCCCEECCCCEE-EEEEEECCCEECCCCEE-CCEEECCCCEECCCCEEEE-EE T ss_conf 388589498269789888486898789999888998778-75488778798888888-2069818859998615701-58 Q ss_pred CCCCEEECCCEEECCCCEECCCEEE Q ss_conf 2201397498499888899698097 Q gi|254781223|r 85 ISGNARVRGNAVVGGDTVVEGDTVL 109 (110) Q Consensus 85 I~~~~~I~~~~~I~~~~~I~~~~~i 109 (110) ++.++.+..++.++..+.+..+.++ T Consensus 333 ~g~~~~i~~g~~~~~~~~~~~~~~~ 357 (358) T COG1208 333 IGINSEILPGVVVGPGSVVESGEIE 357 (358) T ss_pred ECCCEEECCCEEECCCCCCCCCCCC T ss_conf 6366086276184786061576303 No 73 >cd03349 LbH_XAT Xenobiotic acyltransferase (XAT): The XAT class of hexapeptide acyltransferases is composed of a large number of microbial enzymes that catalyze the CoA-dependent acetylation of a variety of hydroxyl-bearing acceptors such as chloramphenicol and streptogramin, among others. Members of this class of enzymes include Enterococcus faecium streptogramin A acetyltransferase and Pseudomonas aeruginosa chloramphenicol acetyltransferase. They contain repeated copies of a six-residue hexapeptide repeat sequence motif (X-[STAV]-X-[LIV]-[GAED]-X) and adopt a left-handed parallel beta helix (LbH) structure. The active enzyme is a trimer with CoA and substrate binding sites at the interface of two separate LbH subunits. XATs are implicated in inactivating xenobiotics leading to xenobiotic resistance in patients. Probab=99.12 E-value=2.5e-10 Score=59.60 Aligned_cols=104 Identities=13% Similarity=0.062 Sum_probs=60.7 Q ss_pred CCCCEEECCCEEECC--CCEECCCCEECCCCCCCCCC-CCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 968769889709928--83997981545754323112-222222100113552022111122211011222221101111 Q gi|254781223|r 1 MYDNAVVRDCATVID--DARVSGNASVSRFAQVKSNA-EVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDA 77 (110) Q Consensus 1 i~~n~~I~~~a~I~~--~~~I~~n~~I~~~~~i~~~~-~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~ 77 (110) |.+++++++++.+.. ++.||++|.|.+++.|..+. +..+.............. . ...............|++++ T Consensus 4 IG~~syi~~~~~i~~~~~v~IG~~~~I~~~v~I~~~~~h~~~~~~~~p~~~~~~~~--~-~~~~~~~~~~~~~i~Ig~~v 80 (145) T cd03349 4 VGDYSYGSGPDCDVGGDKLSIGKFCSIAPGVKIGLGGNHPTDWVSTYPFYIFGGEW--E-DDAKFDDWPSKGDVIIGNDV 80 (145) T ss_pred ECCCEEECCCCEEECCCCCEECCCCEECCCCEEECCCCCCCCCEEECCEEEECCCC--C-CCCCCCCCCCCCCEEECCCE T ss_conf 89987888993696289728999978999999979998787560644207523666--7-62000265416997988996 Q ss_pred CCCCCCCCCCCEEECCCEEECCCCEECCCE Q ss_conf 111124322013974984998888996980 Q gi|254781223|r 78 FVIGFTVISGNARVRGNAVVGGDTVVEGDT 107 (110) Q Consensus 78 ~i~~~~~I~~~~~I~~~~~I~~~~~I~~~~ 107 (110) +|+.++.|..+++||+++.|++++.|.+|. T Consensus 81 wIG~n~~Il~GV~IG~gavIgAgSvVtkdv 110 (145) T cd03349 81 WIGHGATILPGVTIGDGAVIAAGAVVTKDV 110 (145) T ss_pred EECCCCEEECCCEECCCCEEECCCEECCCC T ss_conf 999988895880967886892897897046 No 74 >cd00208 LbetaH Left-handed parallel beta-Helix (LbetaH or LbH) domain: The alignment contains 5 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity, however, some subfamilies in this hierarchy also show activities related to ion transport or translation initiation. Many are trimeric in their active forms. Probab=99.08 E-value=9.9e-11 Score=61.25 Aligned_cols=77 Identities=16% Similarity=0.266 Sum_probs=52.5 Q ss_pred CEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCEE Q ss_conf 39979815457543231122222221001135520221111222110112222211011111111243220139749849 Q gi|254781223|r 17 ARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVIGFTVISGNARVRGNAV 96 (110) Q Consensus 17 ~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~I~~~~~I~~~~~ 96 (110) +.|+++|+|+++++|++.+.+++++.+..++.+.. ...-.......|++++.|+.++.|.+++. T Consensus 1 V~IG~~~~I~~~~~I~~~v~IG~~~~I~~~~~I~~----------------~~~~~~~~~~~Ig~~v~Ig~~a~I~~gv~ 64 (78) T cd00208 1 VFIGEGVKIHPKAVIRGPVVIGDNVNIGPGAVIGA----------------ATGPNEKNPTIIGDNVEIGANAVIHGGVK 64 (78) T ss_pred CEECCCCEECCCCEECCCEEECCCCEECCCCEEEE----------------CCCCCCCCCCEECCCEEECCCCEEECCEE T ss_conf 99999849999849988269999998999989963----------------79886678989999969986898979819 Q ss_pred ECCCCEECCCEEE Q ss_conf 9888899698097 Q gi|254781223|r 97 VGGDTVVEGDTVL 109 (110) Q Consensus 97 I~~~~~I~~~~~i 109 (110) |++++.|..+++| T Consensus 65 IGdn~~IgagavV 77 (78) T cd00208 65 IGDNAVIGAGAVV 77 (78) T ss_pred ECCCCEECCCCEE T ss_conf 9999999988569 No 75 >PRK05293 glgC glucose-1-phosphate adenylyltransferase; Provisional Probab=99.08 E-value=2.3e-10 Score=59.75 Aligned_cols=69 Identities=19% Similarity=0.226 Sum_probs=31.3 Q ss_pred CCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCEEECCCCEECCCEE Q ss_conf 222221001135520221111222110112222211011111111243220139749849988889969809 Q gi|254781223|r 37 VSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVIGFTVISGNARVRGNAVVGGDTVVEGDTV 108 (110) Q Consensus 37 i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~I~~~~~I~~~~~I~~~~~I~~~~~ 108 (110) ++.++.+..++.+. ++.+++++.++.++.+. ++.+++++.|+.++.+ .+..+.+.++|+.++.|..|++ T Consensus 311 Ig~~v~Ig~ga~I~-nSiI~~~~~Ig~~~~I~-nsIi~~~~~Ig~~~~~-~~~~~~~~~vi~~~~~~~~~~~ 379 (381) T PRK05293 311 LFQGVQVGEGSIVK-DSVIMPGAKIGENVVIE-RAIIGENAVIGDGVIG-VISGIDEVVVIGEGEVVGVDKI 379 (381) T ss_pred ECCCCEECCCCEEE-CCEECCCCEECCCCEEE-EEEECCCCEECCCCEE-ECCEECCCEEECCCCEECCCCC T ss_conf 88998999999997-78985969999899990-1299597999979689-5774268889899948897636 No 76 >cd05787 LbH_eIF2B_epsilon eIF-2B epsilon subunit, central Left-handed parallel beta-Helix (LbH) domain: eIF-2B is a eukaryotic translation initiator, a guanine nucleotide exchange factor (GEF) composed of five different subunits (alpha, beta, gamma, delta and epsilon). eIF2B is important for regenerating GTP-bound eIF2 during the initiation process. This event is obligatory for eIF2 to bind initiator methionyl-tRNA, forming the ternary initiation complex. The eIF-2B epsilon subunit contains an N-terminal domain that resembles a dinucleotide-binding Rossmann fold, a central LbH domain containing 4 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X), and a C-terminal domain of unknown function that is present in eIF-4 gamma, eIF-5, and eIF-2B epsilon. The epsilon and gamma subunits form the catalytic subcomplex of eIF-2B, which binds eIF2 and catalyzes guanine nucleotide exchange. Probab=99.08 E-value=9.4e-11 Score=61.36 Aligned_cols=46 Identities=7% Similarity=0.207 Sum_probs=18.4 Q ss_pred CCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCC Q ss_conf 288399798154575432311222222210011355202211112221 Q gi|254781223|r 14 IDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASV 61 (110) Q Consensus 14 ~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i 61 (110) .+++.|++++.| +++.|++++.+++++.+. ++.+.+++.+.+++.+ T Consensus 3 G~~t~Ig~~~~I-~~svIG~nc~Ig~~~~I~-~svi~d~~~Ig~~~~i 48 (79) T cd05787 3 GRGTSIGEGTTI-KNSVIGRNCKIGKNVVID-NSYIWDDVTIEDGCTI 48 (79) T ss_pred CCCCEECCCCEE-ECCEECCCCEECCCCEEE-CCEEECCCEECCCCEE T ss_conf 998999999999-599989999999998894-8999197899999999 No 77 >COG2171 DapD Tetrahydrodipicolinate N-succinyltransferase [Amino acid transport and metabolism] Probab=99.05 E-value=2.9e-10 Score=59.29 Aligned_cols=102 Identities=22% Similarity=0.222 Sum_probs=64.8 Q ss_pred CCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCC--CCCCC Q ss_conf 8970992883997981545754323112222222100113552022111122211011222221101111111--12432 Q gi|254781223|r 8 RDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVI--GFTVI 85 (110) Q Consensus 8 ~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~--~~~~I 85 (110) ...++|.|.+.+...++|.+++++.....+.-++.+...+.+...+..++.++++.++.+++.+.|+.--.-. ..+.| T Consensus 106 ~~g~RI~p~a~VR~ga~i~~gtvvM~~sfVNigA~~~~gtMVd~~as~G~~a~VGkn~higgGa~I~GVLep~~a~Pv~I 185 (271) T COG2171 106 PEGVRIVPGAIVRLGAYIAKGTVVMPESFVNIGAGTGEGTMVDGRASVGSCAQVGKNSHIGGGASIGGVLEPLQANPVII 185 (271) T ss_pred CCCEEECCCCEEEECCEECCCCEECCCCEEEECCCCCCCEEEEEEEEEECCEEECCCCCCCCCCEEEEEECCCCCCCEEE T ss_conf 57636367647760528779938755566877760275258961565513179799845278636767734888788698 Q ss_pred CCCEEECCCEEECCCCEECCCEEE Q ss_conf 201397498499888899698097 Q gi|254781223|r 86 SGNARVRGNAVVGGDTVVEGDTVL 109 (110) Q Consensus 86 ~~~~~I~~~~~I~~~~~I~~~~~i 109 (110) ++||.||.|+++..+..+..+++| T Consensus 186 gdncliGAns~~veGV~vGdg~VV 209 (271) T COG2171 186 GDNCLIGANSEVVEGVIVGDGCVV 209 (271) T ss_pred CCCCEECCCCCEEEEEEECCCCEE T ss_conf 776186245334861696789688 No 78 >PRK05293 glgC glucose-1-phosphate adenylyltransferase; Provisional Probab=99.05 E-value=5.2e-10 Score=58.24 Aligned_cols=45 Identities=16% Similarity=0.242 Sum_probs=16.2 Q ss_pred EECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCE Q ss_conf 52022111122211011222221101111111124322013974984 Q gi|254781223|r 49 VGGYAKVSGNASVGGNAIVRDTAEVGGDAFVIGFTVISGNARVRGNA 95 (110) Q Consensus 49 i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~I~~~~~I~~~~ 95 (110) ++.++.|+.++.+ .++.+..++.|++++.| .++.++.++.|++|+ T Consensus 311 Ig~~v~Ig~ga~I-~nSiI~~~~~Ig~~~~I-~nsIi~~~~~Ig~~~ 355 (381) T PRK05293 311 LFQGVQVGEGSIV-KDSVIMPGAKIGENVVI-ERAIIGENAVIGDGV 355 (381) T ss_pred ECCCCEECCCCEE-ECCEECCCCEECCCCEE-EEEEECCCCEECCCC T ss_conf 8899899999999-77898596999989999-012995979999796 No 79 >cd03354 LbH_SAT Serine acetyltransferase (SAT): SAT catalyzes the CoA-dependent acetylation of the side chain hydroxyl group of L-serine to form O-acetylserine, as the first step of a two-step biosynthetic pathway in bacteria and plants leading to the formation of L-cysteine. This reaction represents a key metabolic point of regulation for the cysteine biosynthetic pathway due to its feedback inhibition by cysteine. The enzyme is a 175 kDa homohexamer, composed of a dimer of homotrimers. Each subunit contains an N-terminal alpha helical region and a C-terminal left-handed beta-helix (LbH) subdomain with 5 turns, each containing a hexapeptide repeat motif characteristic of the acyltransferase superfamily of enzymes. The trimer interface mainly involves the C-terminal LbH subdomain while the dimer (of trimers) interface is mediated by the N-terminal alpha helical subdomain. Probab=99.04 E-value=2.1e-10 Score=59.86 Aligned_cols=83 Identities=8% Similarity=0.091 Sum_probs=39.8 Q ss_pred CCCCEEECCCEEEC--CCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 96876988970992--8839979815457543231122222221001135520221111222110112222211011111 Q gi|254781223|r 1 MYDNAVVRDCATVI--DDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAF 78 (110) Q Consensus 1 i~~n~~I~~~a~I~--~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~ 78 (110) |+++++|.++++|. .++.|++++.|++++.|..++.++.+..... .....+.++++++.++.+.++..+++++. T Consensus 5 I~~~~~IG~~~~I~~~~~v~Ig~~~~Ig~~~~I~~~~~ig~~~~~~~----~~~~~Ig~~v~IG~~~~I~~gv~IG~~~v 80 (101) T cd03354 5 IHPGAKIGPGLFIDHGTGIVIGETAVIGDNCTIYQGVTLGGKGKGGG----KRHPTIGDNVVIGAGAKILGNITIGDNVK 80 (101) T ss_pred ECCCCEECCCEEECCCCCEEECCCCEECCCCEECCCCEECCCCCCCC----CCCCCCCCCEEECCCCEEECCCEECCCCE T ss_conf 89999999492991998779999989999989988878866311468----77761487586358988929879999879 Q ss_pred CCCCCCCCC Q ss_conf 111243220 Q gi|254781223|r 79 VIGFTVISG 87 (110) Q Consensus 79 i~~~~~I~~ 87 (110) ++.++.+.. T Consensus 81 IgagsvV~~ 89 (101) T cd03354 81 IGANAVVTK 89 (101) T ss_pred ECCCCEECC T ss_conf 987988996 No 80 >cd04647 LbH_MAT_like Maltose O-acyltransferase (MAT)-like: This family is composed of maltose O-acetyltransferase, galactoside O-acetyltransferase (GAT), xenobiotic acyltransferase (XAT) and similar proteins. MAT and GAT catalyze the CoA-dependent acetylation of the 6-hydroxyl group of their respective sugar substrates. MAT acetylates maltose and glucose exclusively while GAT specifically acetylates galactopyranosides. XAT catalyzes the CoA-dependent acetylation of a variety of hydroxyl-bearing acceptors such as chloramphenicol and streptogramin, among others. XATs are implicated in inactivating xenobiotics leading to xenobiotic resistance in patients. Members of this family contain a a left-handed parallel beta-helix (LbH) domain with at least 5 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). They are trimeric in their active form. Probab=99.03 E-value=5.9e-10 Score=58.04 Aligned_cols=89 Identities=12% Similarity=0.180 Sum_probs=47.6 Q ss_pred CCCCEEECCCEEECC--CCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 968769889709928--839979815457543231122222221001135520221111222110112222211011111 Q gi|254781223|r 1 MYDNAVVRDCATVID--DARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAF 78 (110) Q Consensus 1 i~~n~~I~~~a~I~~--~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~ 78 (110) |.+|++|.+++.|.. .+.|+++|.|++++.|.... ..+.............+...+.+++.++.++. T Consensus 4 IG~~~~Ig~~~~I~~~~~i~IG~~~~I~~~~~i~~~~-----------~~~~~~~~~~~~~~~~~~v~Ig~~v~IG~~~~ 72 (109) T cd04647 4 IGDNVYIGPGCVISAGGGITIGDNVLIGPNVTIYDHN-----------HDIDDPERPIEQGVTSAPIVIGDDVWIGANVV 72 (109) T ss_pred ECCCCEECCCCEEECCCCEEECCCCEECCEEEEEEEE-----------EECCCCCEECCCCCEECCEEEECCCEECCCEE T ss_conf 9999899999999689887999984993508983300-----------13288752526763718979942357889839 Q ss_pred CCCCCCCCCCEEECCCEEECCC Q ss_conf 1112432201397498499888 Q gi|254781223|r 79 VIGFTVISGNARVRGNAVVGGD 100 (110) Q Consensus 79 i~~~~~I~~~~~I~~~~~I~~~ 100 (110) +..++.|++++.|+.++.+-.+ T Consensus 73 I~~gv~Ig~~~iIga~SvV~k~ 94 (109) T cd04647 73 ILPGVTIGDGAVVGAGSVVTKD 94 (109) T ss_pred EEECCEECCCCEECCCCEECCC T ss_conf 9285130898699379889148 No 81 >cd03356 LbH_G1P_AT_C_like Left-handed parallel beta-Helix (LbH) domain of a group of proteins with similarity to glucose-1-phosphate adenylyltransferase: Included in this family are glucose-1-phosphate adenylyltransferase, mannose-1-phosphate guanylyltransferase, and the eukaryotic translation initiation factor eIF-2B subunits, epsilon and gamma. Most members of this family contains an N-terminal catalytic domain that resembles a dinucleotide-binding Rossmann fold, followed by a LbH fold domain with at least 4 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). eIF-2B epsilon contains an additional domain of unknown function at the C-terminus. Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Probab=99.02 E-value=4.9e-10 Score=58.38 Aligned_cols=29 Identities=10% Similarity=0.223 Sum_probs=12.4 Q ss_pred CCCEEECCCEEECCCCEECCCCEECCCCCC Q ss_conf 687698897099288399798154575432 Q gi|254781223|r 2 YDNAVVRDCATVIDDARVSGNASVSRFAQV 31 (110) Q Consensus 2 ~~n~~I~~~a~I~~~~~I~~n~~I~~~~~i 31 (110) .+++.|.+++.|.+ +.|+++|.|++++.| T Consensus 3 G~~~~Ig~~~~I~~-svIG~~c~Ig~~~~I 31 (79) T cd03356 3 GESTVIGENAIIKN-SVIGDNVRIGDGVTI 31 (79) T ss_pred CCCCEECCCCEEEC-CEECCCCEECCCCEE T ss_conf 89799999999959-999999999999699 No 82 >TIGR03308 phn_thr-fam phosphonate metabolim protein, transferase hexapeptide repeat family. This family of proteins contains copies of the Bacterial transferase hexapeptide repeat family (pfam00132) and is only found in operons encoding the phosphonate C-P lyase system (GenProp0232). Many C-P lyase operons, however, lack a homolog of this protein. Probab=99.01 E-value=8.6e-10 Score=57.34 Aligned_cols=105 Identities=11% Similarity=0.209 Sum_probs=56.6 Q ss_pred CCCEEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCC-------- Q ss_conf 687698897099288399798154575432311222222210011355202211112221101122222110-------- Q gi|254781223|r 2 YDNAVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEV-------- 73 (110) Q Consensus 2 ~~n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i-------- 73 (110) .++..|+|.|+|. ++++|+++.|++++.+. ++.+++++.+..++.+. ++.|+.++.+..++.+....+- T Consensus 6 ~~~p~Ihp~a~i~-~~~lG~~v~Ig~~~~i~-ns~IGDySyI~~~~~I~-n~~IGkfcsIa~~v~I~~~nHp~~~~s~~~ 82 (204) T TIGR03308 6 SPEPTLHPTAELT-ESKLGRYTEIGERTRLR-EVALGDYSYVMRDCDII-YTTIGKFCSIAAMVRINATNHPMERPTLHH 82 (204) T ss_pred CCCCEECCCCEEC-CCCCCCCEEECCCCEEE-CCEECCCCEECCCCEEE-CCEECCCCEECCCCEECCCCCCCCCCEECC T ss_conf 7997689998683-27837833999995883-37999985888997598-886999848889879898778766726777 Q ss_pred -----------------------CCCCCCCCCCCCCCCEEECCCEEECCCCEECCCEEE Q ss_conf -----------------------111111112432201397498499888899698097 Q gi|254781223|r 74 -----------------------GGDAFVIGFTVISGNARVRGNAVVGGDTVVEGDTVL 109 (110) Q Consensus 74 -----------------------~~~~~i~~~~~I~~~~~I~~~~~I~~~~~I~~~~~i 109 (110) .....|++++-|+.++.|-+++.||+++.|..+++| T Consensus 83 f~y~~~~~~~~~~~~~~~~~~~~~~~~~Ig~dvwiG~~~~i~~gv~IG~gavigagsvV 141 (204) T TIGR03308 83 FTYRAAMYFDDASDDADFFAWRRAKRVTIGHDVWIGHGAVILPGVTIGNGAVIAAGAVV 141 (204) T ss_pred EEEECCCCCCCCCCCHHHHCCCCCCCEEECCCCEECCCCEECCCCEECCCCEECCCCEE T ss_conf 75537533466666122321334787098997688799899199799999899779989 No 83 >cd05635 LbH_unknown Uncharacterized proteins, Left-handed parallel beta-Helix (LbH) domain: Members in this group are uncharacterized bacterial proteins containing a LbH domain with multiple turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Probab=99.00 E-value=9.4e-10 Score=57.19 Aligned_cols=32 Identities=9% Similarity=-0.011 Sum_probs=13.9 Q ss_pred CCCCCCCCCCCCCCCCCCEEECCCEEECCCCEE Q ss_conf 110111111112432201397498499888899 Q gi|254781223|r 71 AEVGGDAFVIGFTVISGNARVRGNAVVGGDTVV 103 (110) Q Consensus 71 ~~i~~~~~i~~~~~I~~~~~I~~~~~I~~~~~I 103 (110) +.++..+.+...+.|+ ++.|++++-+|+++.. T Consensus 63 S~i~~~s~~~H~~YlG-DS~iG~~vNiGAGT~t 94 (101) T cd05635 63 SIIEGYSNKQHDGFLG-HSYLGSWCNLGAGTNN 94 (101) T ss_pred EEEECCCCCCCCCEEE-EEEECCCCEECCCCEE T ss_conf 1885787568775876-1178387578778578 No 84 >cd04652 LbH_eIF2B_gamma_C eIF-2B gamma subunit, C-terminal Left-handed parallel beta-Helix (LbH) domain: eIF-2B is a eukaryotic translation initiator, a guanine nucleotide exchange factor (GEF) composed of five different subunits (alpha, beta, gamma, delta and epsilon). eIF2B is important for regenerating GTP-bound eIF2 during the initiation process. This event is obligatory for eIF2 to bind initiator methionyl-tRNA, forming the ternary initiation complex. The eIF-2B gamma subunit contains an N-terminal domain that resembles a dinucleotide-binding Rossmann fold and a C-terminal LbH domain with 4 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). The epsilon and gamma subunits form the catalytic subcomplex of eIF-2B, which binds eIF2 and catalyzes guanine nucleotide exchange. Probab=98.97 E-value=7.7e-10 Score=57.54 Aligned_cols=29 Identities=10% Similarity=0.253 Sum_probs=12.5 Q ss_pred CCCEEECCCEEECCCCEECCCCEECCCCCC Q ss_conf 687698897099288399798154575432 Q gi|254781223|r 2 YDNAVVRDCATVIDDARVSGNASVSRFAQV 31 (110) Q Consensus 2 ~~n~~I~~~a~I~~~~~I~~n~~I~~~~~i 31 (110) .+|++|.+++.|. ++.|++||.|++++.+ T Consensus 3 G~~~~Ig~~~~I~-~SvIG~nc~Ig~~~~i 31 (81) T cd04652 3 GENTQVGEKTSIK-RSVIGANCKIGKRVKI 31 (81) T ss_pred CCCCEECCCCEEE-CCEECCCCEECCCCEE T ss_conf 7988999999896-7999799999999689 No 85 >cd05635 LbH_unknown Uncharacterized proteins, Left-handed parallel beta-Helix (LbH) domain: Members in this group are uncharacterized bacterial proteins containing a LbH domain with multiple turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Probab=98.97 E-value=1.2e-09 Score=56.71 Aligned_cols=85 Identities=18% Similarity=0.236 Sum_probs=39.0 Q ss_pred EEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 69889709928839979815457543231122222221001135520221111222110112222211011111111243 Q gi|254781223|r 5 AVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVIGFTV 84 (110) Q Consensus 5 ~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~ 84 (110) ++|.+++.|.+.+.|.++|+|++++.|++++++..++.+..++.++. .-.++.+...+.+.+.+.+ +++. T Consensus 12 V~I~~~~~I~~~~~i~Gp~~IG~~~~I~p~a~Ir~~t~Ig~~~~IG~---------Eik~S~i~~~s~~~H~~Yl-GDS~ 81 (101) T cd05635 12 IYIGKDAVIEPFAVIEGPVYIGPGSRVKMGARIYGNTTIGPTCKIGG---------EVEDSIIEGYSNKQHDGFL-GHSY 81 (101) T ss_pred EEECCCCEECCCCEEECCEEECCCCEECCCCEEECCCEECCCCEECC---------EEEEEEEECCCCCCCCCEE-EEEE T ss_conf 99879999999999976669999999999978638879999977614---------0876188578756877587-6117 Q ss_pred CCCCEEECCCEEECC Q ss_conf 220139749849988 Q gi|254781223|r 85 ISGNARVRGNAVVGG 99 (110) Q Consensus 85 I~~~~~I~~~~~I~~ 99 (110) |+.++-+|.++...+ T Consensus 82 iG~~vNiGAGT~t~N 96 (101) T cd05635 82 LGSWCNLGAGTNNSD 96 (101) T ss_pred ECCCCEECCCCEECC T ss_conf 838757877857805 No 86 >cd03356 LbH_G1P_AT_C_like Left-handed parallel beta-Helix (LbH) domain of a group of proteins with similarity to glucose-1-phosphate adenylyltransferase: Included in this family are glucose-1-phosphate adenylyltransferase, mannose-1-phosphate guanylyltransferase, and the eukaryotic translation initiation factor eIF-2B subunits, epsilon and gamma. Most members of this family contains an N-terminal catalytic domain that resembles a dinucleotide-binding Rossmann fold, followed by a LbH fold domain with at least 4 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). eIF-2B epsilon contains an additional domain of unknown function at the C-terminus. Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Probab=98.95 E-value=6.4e-10 Score=57.89 Aligned_cols=29 Identities=14% Similarity=0.279 Sum_probs=10.7 Q ss_pred CCCCEECCCCEECCCCCCCCCCCCCCCCCE Q ss_conf 288399798154575432311222222210 Q gi|254781223|r 14 IDDARVSGNASVSRFAQVKSNAEVSDNTYV 43 (110) Q Consensus 14 ~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i 43 (110) .++++|++++.|.+ +.|++++.+++++.+ T Consensus 3 G~~~~Ig~~~~I~~-svIG~~c~Ig~~~~I 31 (79) T cd03356 3 GESTVIGENAIIKN-SVIGDNVRIGDGVTI 31 (79) T ss_pred CCCCEECCCCEEEC-CEECCCCEECCCCEE T ss_conf 89799999999959-999999999999699 No 87 >cd04652 LbH_eIF2B_gamma_C eIF-2B gamma subunit, C-terminal Left-handed parallel beta-Helix (LbH) domain: eIF-2B is a eukaryotic translation initiator, a guanine nucleotide exchange factor (GEF) composed of five different subunits (alpha, beta, gamma, delta and epsilon). eIF2B is important for regenerating GTP-bound eIF2 during the initiation process. This event is obligatory for eIF2 to bind initiator methionyl-tRNA, forming the ternary initiation complex. The eIF-2B gamma subunit contains an N-terminal domain that resembles a dinucleotide-binding Rossmann fold and a C-terminal LbH domain with 4 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). The epsilon and gamma subunits form the catalytic subcomplex of eIF-2B, which binds eIF2 and catalyzes guanine nucleotide exchange. Probab=98.95 E-value=7e-10 Score=57.71 Aligned_cols=78 Identities=13% Similarity=0.216 Sum_probs=31.1 Q ss_pred CCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECC Q ss_conf 28839979815457543231122222221001135520221111222110112222211011111111243220139749 Q gi|254781223|r 14 IDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVIGFTVISGNARVRG 93 (110) Q Consensus 14 ~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~I~~~~~I~~ 93 (110) .++++|++++.| +++.|+.++.+++++.+. ++.+. +++.+..++.+ .++.+++++.|+.++.+ .+|.|++ T Consensus 3 G~~~~Ig~~~~I-~~SvIG~nc~Ig~~~~i~-~sii~------~~v~I~~~~~i-~~sIIg~~~~Ig~~~~l-~~~vig~ 72 (81) T cd04652 3 GENTQVGEKTSI-KRSVIGANCKIGKRVKIT-NCVIM------DNVTIEDGCTL-ENCIIGNGAVIGEKCKL-KDCLVGS 72 (81) T ss_pred CCCCEECCCCEE-ECCEECCCCEECCCCEEE-CCEEE------CCCEECCCCEE-ECCEEECCCEECCCCEE-ECCEECC T ss_conf 798899999989-679997999999996896-89996------98798999899-74999097999999999-3739889 Q ss_pred CEEECCCC Q ss_conf 84998888 Q gi|254781223|r 94 NAVVGGDT 101 (110) Q Consensus 94 ~~~I~~~~ 101 (110) ++.|.+++ T Consensus 73 ~~~V~~gt 80 (81) T cd04652 73 GYRVEAGT 80 (81) T ss_pred CCEECCCC T ss_conf 98989898 No 88 >PRK11830 dapD 2,3,4,5-tetrahydropyridine-2,6-carboxylate N-succinyltransferase; Provisional Probab=98.93 E-value=1.1e-09 Score=56.86 Aligned_cols=96 Identities=21% Similarity=0.249 Sum_probs=35.4 Q ss_pred EEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCC--CCCCCCCCCC Q ss_conf 09928839979815457543231122222221001135520221111222110112222211011111--1112432201 Q gi|254781223|r 11 ATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAF--VIGFTVISGN 88 (110) Q Consensus 11 a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~--i~~~~~I~~~ 88 (110) .++-|++.+...++|.+++++. .+++.-++.+...+.+...++++.+++++.++.+++.+.|++-.. -...+.|.|+ T Consensus 97 vRvVP~a~vR~Gayi~~~vVlM-Ps~VNiGAyv~~gtmiDtwatvGScAqiGknvhls~g~gigGvleP~~a~p~iied~ 175 (265) T PRK11830 97 VRVVPGAVVRRGAYIAKNVVLM-PSYVNIGAYVDEGTMVDTWATVGSCAQIGKNVHLSGGVGIGGVLEPLQANPVIIEDN 175 (265) T ss_pred EEECCCCEEEECEEECCCCEEE-EEEEEEEEEECCCCEEECCCCEECHHEECCCEEECCCCEEEEEECCCCCCCEEEECC T ss_conf 0887885376030545995886-417776039848954804762411532378336878741426524766798798368 Q ss_pred EEECCCEEECCCCEECCCE Q ss_conf 3974984998888996980 Q gi|254781223|r 89 ARVRGNAVVGGDTVVEGDT 107 (110) Q Consensus 89 ~~I~~~~~I~~~~~I~~~~ 107 (110) |.||.++.|..++.|+.++ T Consensus 176 ~fiGa~~~v~eGv~v~~~a 194 (265) T PRK11830 176 CFIGARSEVVEGVIVEEGA 194 (265) T ss_pred CEECCCCEEEEEEEEECCC T ss_conf 2873574796139994586 No 89 >TIGR02287 PaaY phenylacetic acid degradation protein PaaY; InterPro: IPR011974 Members of this family are located next to other genes organised into apparent operons for phenylacetic acid degradation . PaaY is located near the end of these gene clusters and often next to PaaX, a transcriptional regulator.. Probab=98.92 E-value=8.2e-10 Score=57.43 Aligned_cols=90 Identities=24% Similarity=0.254 Sum_probs=48.8 Q ss_pred ECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECC---CCCCCCCCCCCCCCCCC----CCCCCCCCCCCCCCCCCCCEEE Q ss_conf 9798154575432311222222210011355202---21111222110112222----2110111111112432201397 Q gi|254781223|r 19 VSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGY---AKVSGNASVGGNAIVRD----TAEVGGDAFVIGFTVISGNARV 91 (110) Q Consensus 19 I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~---~~i~~~~~i~~~~~~~~----~~~i~~~~~i~~~~~I~~~~~I 91 (110) +-+.++++|.|++=+|+.++..+.+..++.+.++ -.+.+.+.+-+++..++ .+.+++|-+||..+.++ .|+| T Consensus 11 Vhp~ayVHP~AVLiGDVi~G~rcYvgPlASLRGDFGrIvl~eGAN~QD~CVMHGFPg~DTvVeenGHvGHgAiLH-gC~v 89 (193) T TIGR02287 11 VHPEAYVHPDAVLIGDVIVGKRCYVGPLASLRGDFGRIVLKEGANVQDTCVMHGFPGQDTVVEENGHVGHGAILH-GCRV 89 (193) T ss_pred CCCCCCCCCCEEEEEEEEECCCCEECCCCCCCCCCCCEEEECCCCCCCCCEECCCCCCCEEEECCCCCCCCEEEC-CCEE T ss_conf 698773468637862178879524676631357757257534886146424367299886760688233212351-5463 Q ss_pred CCCEEECCCCEECCCEEE Q ss_conf 498499888899698097 Q gi|254781223|r 92 RGNAVVGGDTVVEGDTVL 109 (110) Q Consensus 92 ~~~~~I~~~~~I~~~~~i 109 (110) +.|+.||.|+.|++.++| T Consensus 90 grnaLvGMNAVVMDgAvi 107 (193) T TIGR02287 90 GRNALVGMNAVVMDGAVI 107 (193) T ss_pred CCCCEECCCEEEECCCEE T ss_conf 252311165066166466 No 90 >PRK10502 putative colanic acid biosynthesis acetyltransferase WcaF; Provisional Probab=98.92 E-value=4e-09 Score=54.55 Aligned_cols=100 Identities=8% Similarity=0.088 Sum_probs=38.2 Q ss_pred EECCCEEECCCCEE--CCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCC---CCCCCCCCCC Q ss_conf 98897099288399--798154575432311222222210011355202211112221101122222---1101111111 Q gi|254781223|r 6 VVRDCATVIDDARV--SGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDT---AEVGGDAFVI 80 (110) Q Consensus 6 ~I~~~a~I~~~~~I--~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~---~~i~~~~~i~ 80 (110) .|++++.|++.+.| .-+..|++++.|++++.+... ....++.++.++.++.+.....-... -.+.....|+ T Consensus 54 kIG~~v~I~p~v~I~~PwnL~IGd~~~Ig~~v~i~~~----~~I~IG~~v~Isq~v~l~t~sHd~~~~~~~l~~~pI~Ig 129 (179) T PRK10502 54 KIGKGVVIRPSVRITYPWKLTIGDYAWIGDDVVLYTL----GEITIGAHCVISQKSYLCTGSHDYSSPHFDLNAAPIVIG 129 (179) T ss_pred EECCCCEECCCCEEEECCCEEECCCCEECCCCEEEEC----CCCEECCCEEECCCCEEECCCCCCCCCCCCCCCCCCEEC T ss_conf 6799988999657950430798896578898189848----961798841277971896388997676667324885858 Q ss_pred CCCCCCCCEEECCCEEECCCCEECCCEEE Q ss_conf 12432201397498499888899698097 Q gi|254781223|r 81 GFTVISGNARVRGNAVVGGDTVVEGDTVL 109 (110) Q Consensus 81 ~~~~I~~~~~I~~~~~I~~~~~I~~~~~i 109 (110) +++-|+.++.|.+++.|++++.|..+++| T Consensus 130 d~~wIga~a~I~pGv~Ig~gavigA~SvV 158 (179) T PRK10502 130 EGCWLATDVFVAPGVTIGDGAVVGARSSV 158 (179) T ss_pred CCEEECCCCEEECCCEECCCCEECCCCEE T ss_conf 99699999699088999999799549989 No 91 >TIGR01172 cysE serine O-acetyltransferase; InterPro: IPR005881 The biosynthesis of L-cysteine is the predominant way by which inorganic sulphur is incorporated into organic compounds. In this process, the most abundant utilizable source of sulphur, inorganic sulphate, is taken up and reduced to sulphide. Sulphide is used to produce L-cysteine, which serves for protein synthesis or the production of other sulphur-containing organic compounds. Two routes for cysteine biosynthesis in nature have been documented. Serine transacetylase (2.3.1.30 from EC) catalyzes steps in pathway I, the activation of L-serine by acetyl-coenzyme A, yielding O-acetyl-L-serine.; GO: 0009001 serine O-acetyltransferase activity, 0006535 cysteine biosynthetic process from serine, 0005737 cytoplasm. Probab=98.91 E-value=1.4e-09 Score=56.43 Aligned_cols=86 Identities=17% Similarity=0.295 Sum_probs=54.8 Q ss_pred CCEECCCCCCCCCCCCCC--CCCEEEEEEEECCCCCCCCCCCCCCCCCCC---CCCCCCCCCCCCCCCCCCCEEECCCEE Q ss_conf 815457543231122222--221001135520221111222110112222---211011111111243220139749849 Q gi|254781223|r 22 NASVSRFAQVKSNAEVSD--NTYVRDNAKVGGYAKVSGNASVGGNAIVRD---TAEVGGDAFVIGFTVISGNARVRGNAV 96 (110) Q Consensus 22 n~~I~~~~~i~~~~~i~~--~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~---~~~i~~~~~i~~~~~I~~~~~I~~~~~ 96 (110) ...|+|.|.|+....|.. ..++++-+.+++++.|+-.+++++.....+ +-+++++.+|+.+++|-+|.+||+|+. T Consensus 61 gidIHPgA~IG~g~fIDHg~GvVIGETa~iGddv~iyhGVTLGGtgk~~G~kRHPt~~~gV~iGAGAKvLG~I~vG~~Ak 140 (163) T TIGR01172 61 GIDIHPGARIGRGVFIDHGTGVVIGETAEIGDDVTIYHGVTLGGTGKEKGVKRHPTIGEGVVIGAGAKVLGNIEVGENAK 140 (163) T ss_pred CCCCCCCCCCCCEEEEECCCCEEEEEEEEECCCEEEEECEEECCCCCCCCCCCCCCCCCCEEEEECCEEECCEEECCCCE T ss_conf 23217643007730777560368500368738837863212067888878898882568717840782403766779977 Q ss_pred ECCCCEECCCE Q ss_conf 98888996980 Q gi|254781223|r 97 VGGDTVVEGDT 107 (110) Q Consensus 97 I~~~~~I~~~~ 107 (110) ||+|+.|-+|+ T Consensus 141 iGAnsVVl~dV 151 (163) T TIGR01172 141 IGANSVVLKDV 151 (163) T ss_pred ECCCEEECCCC T ss_conf 72332673537 No 92 >PRK10092 maltose O-acetyltransferase; Provisional Probab=98.91 E-value=1.3e-09 Score=56.56 Aligned_cols=84 Identities=15% Similarity=0.232 Sum_probs=33.6 Q ss_pred CCCEEECCCEEECC--CCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 68769889709928--8399798154575432311222222210011355202211112221101122222110111111 Q gi|254781223|r 2 YDNAVVRDCATVID--DARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFV 79 (110) Q Consensus 2 ~~n~~I~~~a~I~~--~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i 79 (110) .+|+++..++.+.+ .+.||+++.|+|++.|+...+-.+...-......+....++++++++.++.+...+.|++++.| T Consensus 77 G~~~~iN~~~~ilD~~~I~IG~~v~iGpnv~i~t~~Hp~d~~~R~~~~~~~~pi~Ig~~vwig~~~~i~pGv~IG~~~vi 156 (183) T PRK10092 77 GNNFYANFDCVMLDVCPIHIGDNCMLAPGVHIYTATHPLDPVARNSGAELGKPVTIGNNVWIGGRAVINPGVTIGDNVVV 156 (183) T ss_pred CCCEEEECCEEEEECCCEEECCCEEECCCCEEECCCCCCCHHHHHCCCEECCCEEECCCCEECCCCEEECCCEECCCCEE T ss_conf 78769927849994787399998058499789859998898895258631687798998688899899079899999899 Q ss_pred CCCCCC Q ss_conf 112432 Q gi|254781223|r 80 IGFTVI 85 (110) Q Consensus 80 ~~~~~I 85 (110) +.++.+ T Consensus 157 gagsvV 162 (183) T PRK10092 157 ASGAVV 162 (183) T ss_pred CCCCEE T ss_conf 759889 No 93 >TIGR00965 dapD 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase; InterPro: IPR005664 Bacteria, plants and fungi metabolise aspartic acid to produce four amino acids - lysine, threonine, methionine and isoleucine - in a series of reactions known as the aspartate pathway. Additionally, several important metabolic intermediates are produced by these reactions, such as diaminopimelic acid, an essential component of bacterial cell wall biosynthesis, and dipicolinic acid, which is involved in sporulation in Gram-positive bacteria. Members of the animal kingdom do not posses this pathway and must therefore acquire these essential amino acids through their diet. Research into improving the metabolic flux through this pathway has the potential to increase the yield of the essential amino acids in important crops, thus improving their nutritional value. Additionally, since the enzymes are not present in animals, inhibitors of them are promising targets for the development of novel antibiotics and herbicides. For more information see . The lysine/diaminopimelic acid branch of the aspartate pathway produces the essential amino acid lysine via the intermediate meso-diaminopimelic acid (meso-DAP), which is also a vital cell wall component in Gram-negative bacteria . The production of dihydropicolinate from aspartate-semialdehyde controls flux into the lysine/diaminopimelic acid pathway. Three variants of this pathway exist, differing in how tetrahydropicolinate (formed by reduction of dihydropicolinate) is metabolised to meso-DAP. One variant, the most commonly found one in archaea and bacteria, uses primarily succinyl intermediates, while a second variant, found only in Bacillus, utilises primarily acetyl intermediates. In the third variant, found in some Gram-positive bacteria, a dehydrogenase converts tetrahydropicolinate directly to meso-DAP. In all variants meso-DAP is subsequently converted to lysine by a decarboxylase, or, in Gram-negative bacteria, assimilated into the cell wall. Evidence exists that a fourth, currently unknown, variant of this pathway may function in plants . 2,3,4,5-tetrahydropyridine-2-carboxylate N-succinyltransferase (also known as tetrahydrodipicolinate N-succinyltransferase or DapD) is part of the succinyl route of of lysine/DAP biosynthesis. The DapD protein is a homotrimer is a trimeric enzyme with each monomer composed of three domain: an N-terminal helical domain, a distinctive left-handed parallel beta-helix (LBH) domain, and a predominantly beta C-terminal domain , . The LBH structure is encoded by an imperfect tandem-repeated hexapeptide sequence. Each trimer contains three independent active sites, always occuring at the boundary of two subunits, and formed by residues from one N-terminal domain, one C-terminal domain and two adjacent LBH domains.; GO: 0016757 transferase activity transferring glycosyl groups, 0009089 lysine biosynthetic process via diaminopimelate. Probab=98.90 E-value=1.2e-09 Score=56.80 Aligned_cols=100 Identities=19% Similarity=0.198 Sum_probs=71.2 Q ss_pred CCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCC--------CCCCCCCCCCCC Q ss_conf 970992883997981545754323112222222100113552022111122211011222--------221101111111 Q gi|254781223|r 9 DCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVR--------DTAEVGGDAFVI 80 (110) Q Consensus 9 ~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~--------~~~~i~~~~~i~ 80 (110) |.|.+..++.|..|+.++|. .+.=++++.+++.+...+.++..+.++.|+++.+.+-+. ..+.|.+||+|+ T Consensus 108 PgA~vR~G~fiAkNvVlMPS-yvNIGAYvDEGtMvDTWATvGSCAQIGKNvHlSGG~GIGGVLEPLQA~P~IIEDNCFiG 186 (275) T TIGR00965 108 PGAIVRRGTFIAKNVVLMPS-YVNIGAYVDEGTMVDTWATVGSCAQIGKNVHLSGGVGIGGVLEPLQAKPTIIEDNCFIG 186 (275) T ss_pred CHHHEECCCEEEEEEEECCC-EEECCCEEECCCEECCHHHHCCHHHHCCCEEEECCCCCCCCCCCCCCCCEEEECCCEEC T ss_conf 74360057357752377555-21013675168601001223003220662575178331652463357875882476004 Q ss_pred CCCCCCCCEEECCCEEECCCCEECCCEEE Q ss_conf 12432201397498499888899698097 Q gi|254781223|r 81 GFTVISGNARVRGNAVVGGDTVVEGDTVL 109 (110) Q Consensus 81 ~~~~I~~~~~I~~~~~I~~~~~I~~~~~i 109 (110) .-+.|-+.+.+.++++|..+..|...|-| T Consensus 187 ARSEi~EGVIVEEGSVISMGVfIG~STkI 215 (275) T TIGR00965 187 ARSEIVEGVIVEEGSVISMGVFIGQSTKI 215 (275) T ss_pred CCEEEEEEEEECCCCEEEEEEEECCCCEE T ss_conf 61135620797168588840685560168 No 94 >COG1045 CysE Serine acetyltransferase [Amino acid transport and metabolism] Probab=98.89 E-value=2.5e-09 Score=55.40 Aligned_cols=38 Identities=26% Similarity=0.406 Sum_probs=26.2 Q ss_pred CCCCCCCCCCCCCCCCCCCEEECCCEEECCCCEECCCE Q ss_conf 21101111111124322013974984998888996980 Q gi|254781223|r 70 TAEVGGDAFVIGFTVISGNARVRGNAVVGGDTVVEGDT 107 (110) Q Consensus 70 ~~~i~~~~~i~~~~~I~~~~~I~~~~~I~~~~~I~~~~ 107 (110) .-.|++++.|+.++.|-++.+||+|+.||+++.|-+|. T Consensus 119 hPtIg~~V~IGagAkILG~I~IGd~a~IGA~sVVlkdV 156 (194) T COG1045 119 HPTIGNGVYIGAGAKILGNIEIGDNAKIGAGSVVLKDV 156 (194) T ss_pred CCCCCCCEEECCCCEEECCEEECCCCEECCCCEECCCC T ss_conf 98317885989998897166887897888786681578 No 95 >COG1208 GCD1 Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) [Cell envelope biogenesis, outer membrane / Translation, ribosomal structure and biogenesis] Probab=98.89 E-value=3.9e-09 Score=54.60 Aligned_cols=91 Identities=12% Similarity=0.168 Sum_probs=52.2 Q ss_pred EEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEE Q ss_conf 09928839979815457543231122222221001135520221111222110112222211011111111243220139 Q gi|254781223|r 11 ATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVIGFTVISGNAR 90 (110) Q Consensus 11 a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~I~~~~~ 90 (110) +.+.+.+.++++|.|++++.|++++.+++++.+.+++.+ .++.+.+++.+...+.+. ++.++.++.|+.+..+++ +. T Consensus 256 ~~i~gp~~ig~~~~i~~~~~i~~~~~ig~~~~I~~~~~i-~~Sii~~~~~i~~~~~i~-~sIi~~~~~ig~~~~i~d-~~ 332 (358) T COG1208 256 AYIIGPVVIGPGAKIGPGALIGPYTVIGEGVTIGNGVEI-KNSIIMDNVVIGHGSYIG-DSIIGENCKIGASLIIGD-VV 332 (358) T ss_pred EEEECCEEECCCCEECCCCEECCCCEECCCCEECCCCEE-EEEEEECCCEECCCCEEC-CEEECCCCEECCCCEEEE-EE T ss_conf 388589498269789888486898789999888998778-754887787988888882-069818859998615701-58 Q ss_pred ECCCEEECCCCEEC Q ss_conf 74984998888996 Q gi|254781223|r 91 VRGNAVVGGDTVVE 104 (110) Q Consensus 91 I~~~~~I~~~~~I~ 104 (110) ++.++.+..+..+. T Consensus 333 ~g~~~~i~~g~~~~ 346 (358) T COG1208 333 IGINSEILPGVVVG 346 (358) T ss_pred ECCCEEECCCEEEC T ss_conf 63660862761847 No 96 >PRK09677 putative lipopolysaccharide biosynthesis O-acetyl transferase WbbJ; Provisional Probab=98.85 E-value=7.3e-09 Score=53.47 Aligned_cols=30 Identities=17% Similarity=0.234 Sum_probs=10.3 Q ss_pred CCCCCCCCCCEEECCCEEECCCCEECCCEE Q ss_conf 111243220139749849988889969809 Q gi|254781223|r 79 VIGFTVISGNARVRGNAVVGGDTVVEGDTV 108 (110) Q Consensus 79 i~~~~~I~~~~~I~~~~~I~~~~~I~~~~~ 108 (110) |++++-|+.++.|.+++.|++++.|..+++ T Consensus 133 Ig~~~wIg~~~~Il~Gv~Ig~~~vvgA~Sv 162 (192) T PRK09677 133 IGQRVWIGENVTILPGVSIGNGCIVGANSV 162 (192) T ss_pred ECCCCEECCCCEEECCCEECCCCEECCCCE T ss_conf 899989999999919969999989916999 No 97 >COG0110 WbbJ Acetyltransferase (isoleucine patch superfamily) [General function prediction only] Probab=98.84 E-value=8.7e-09 Score=53.15 Aligned_cols=103 Identities=17% Similarity=0.192 Sum_probs=69.1 Q ss_pred EEECCCEEE--C-CCCEECCCCEECCCCCC--CCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCC-CCCCCCCCCCCCCC Q ss_conf 698897099--2-88399798154575432--311222222210011355202211112221101-12222211011111 Q gi|254781223|r 5 AVVRDCATV--I-DDARVSGNASVSRFAQV--KSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGN-AIVRDTAEVGGDAF 78 (110) Q Consensus 5 ~~I~~~a~I--~-~~~~I~~n~~I~~~~~i--~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~-~~~~~~~~i~~~~~ 78 (110) +.+.+.... . .++.+|.+|.++.++.+ .....+++++.+..++.+....+.......... ......+.|+++++ T Consensus 53 ~~i~~~~~~~~~~~~~~iG~~~~i~~~~~~~~~~~i~ig~~~~i~~~v~i~~~~h~~~~~~~~~~~~~~~~~v~IG~~vw 132 (190) T COG0110 53 AVIRPPVRIDLGEKNLTIGDLCFIGVNVVILVGEGITIGDNVVVGPNVTIYTNSHPGDFVTANIGALVGAGPVTIGEDVW 132 (190) T ss_pred EEECCCCEEEECCCCEEECCEEEEECCCEEEECCCCEECCCCEECCCEEECCCCCCCCCCCCCCCCCCCCCCCEECCCCE T ss_conf 58613502441432106463467702544983488588998589599498268765652200014323578978899858 Q ss_pred CCCCCCCCCCEEECCCEEECCCCEECCCE Q ss_conf 11124322013974984998888996980 Q gi|254781223|r 79 VIGFTVISGNARVRGNAVVGGDTVVEGDT 107 (110) Q Consensus 79 i~~~~~I~~~~~I~~~~~I~~~~~I~~~~ 107 (110) |+.++.|..+++|++++.|+.++.+.+|. T Consensus 133 IG~~a~IlpGV~IG~gavigagsVVtkdv 161 (190) T COG0110 133 IGAGAVILPGVTIGEGAVIGAGSVVTKDV 161 (190) T ss_pred ECCCCEECCCEEECCCEEEECCEEEECCC T ss_conf 77897897985989981997885895777 No 98 >cd05825 LbH_wcaF_like wcaF-like: This group is composed of the protein product of the E. coli wcaF gene and similar proteins. WcaF is part of the gene cluster responsible for the biosynthesis of the extracellular polysaccharide colanic acid. The wcaF protein is predicted to contain a left-handed parallel beta-helix (LbH) domain encoded by imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity. Many are trimeric in their active forms. Probab=98.80 E-value=6.6e-09 Score=53.66 Aligned_cols=85 Identities=20% Similarity=0.173 Sum_probs=44.8 Q ss_pred CCCCEEECCCEEEC--CCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 96876988970992--8839979815457543231122222221001135520221111222110112222211011111 Q gi|254781223|r 1 MYDNAVVRDCATVI--DDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAF 78 (110) Q Consensus 1 i~~n~~I~~~a~I~--~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~ 78 (110) |.+|++|.+++.|. ..++|+++|.|++++.|....+....... ....+...+.+++.++.++. T Consensus 6 IG~~~~Ig~~~~I~~~~~i~IG~~~~i~~~v~I~~~~h~~~~~~~---------------~~~~~~v~Ig~~v~IG~~~~ 70 (107) T cd05825 6 IGDNSWIGEGVWIYNLAPVTIGSDACISQGAYLCTGSHDYRSPAF---------------PLITAPIVIGDGAWVAAEAF 70 (107) T ss_pred ECCCCEECCCCEEEECCCEEECCCCEECCCCEECCCCCCCCCCCC---------------CEECCCEEECCCCEECCCCE T ss_conf 899868999989953885799997399599597078888767665---------------31639979938858749878 Q ss_pred CCCCCCCCCCEEECCCEEECCC Q ss_conf 1112432201397498499888 Q gi|254781223|r 79 VIGFTVISGNARVRGNAVVGGD 100 (110) Q Consensus 79 i~~~~~I~~~~~I~~~~~I~~~ 100 (110) |..+++|++++.|+.++.+..+ T Consensus 71 I~~gv~Ig~~svVgagSvV~kd 92 (107) T cd05825 71 VGPGVTIGEGAVVGARSVVVRD 92 (107) T ss_pred EECCCEECCCCEEECCCEECCC T ss_conf 9699191899799489889038 No 99 >cd03357 LbH_MAT_GAT Maltose O-acetyltransferase (MAT) and Galactoside O-acetyltransferase (GAT): MAT and GAT catalyze the CoA-dependent acetylation of the 6-hydroxyl group of their respective sugar substrates. MAT acetylates maltose and glucose exclusively at the C6 position of the nonreducing end glucosyl moiety. GAT specifically acetylates galactopyranosides. Furthermore, MAT shows higher affinity toward artificial substrates containing an alkyl or hydrophobic chain as well as a glucosyl unit. Active MAT and GAT are homotrimers, with each subunit consisting of an N-terminal alpha-helical region and a C-terminal left-handed parallel alpha-helix (LbH) subdomain with 6 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Probab=98.72 E-value=2.4e-08 Score=51.33 Aligned_cols=36 Identities=22% Similarity=0.364 Sum_probs=15.8 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCEEECCCEEECCC Q ss_conf 122222110111111112432201397498499888 Q gi|254781223|r 65 AIVRDTAEVGGDAFVIGFTVISGNARVRGNAVVGGD 100 (110) Q Consensus 65 ~~~~~~~~i~~~~~i~~~~~I~~~~~I~~~~~I~~~ 100 (110) ..+.+++.|+.++.|..+++|++++.|+.++++-.+ T Consensus 119 I~Igd~vWIG~~~~IlpGVtIG~~~vIgAGsVVtkd 154 (169) T cd03357 119 ITIGDNVWIGGGVIILPGVTIGDNSVIGAGSVVTKD 154 (169) T ss_pred EEECCCCEECCCCEEECCCEECCCCEECCCCEECCC T ss_conf 698974362899789099799999999889879746 No 100 >PRK10191 putative colanic acid biosynthesis acetyltransferase WcaB; Provisional Probab=98.71 E-value=3.2e-08 Score=50.81 Aligned_cols=86 Identities=16% Similarity=0.221 Sum_probs=51.7 Q ss_pred EEECCCCEECCCCEECCC--CCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 099288399798154575--432311222222210011355202211112221101122222110111111112432201 Q gi|254781223|r 11 ATVIDDARVSGNASVSRF--AQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVIGFTVISGN 88 (110) Q Consensus 11 a~I~~~~~I~~n~~I~~~--~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~I~~~ 88 (110) ..|.+.++||++.+|... .+|++.+.+++++.+..++.++... ..-...-.++++++++.++.|-++ T Consensus 42 ieI~p~a~IG~gl~I~H~~GiVIg~~~~IG~n~~I~q~VTiG~~~-----------~~~~~~P~Ig~~V~Igaga~IlG~ 110 (146) T PRK10191 42 YEIQAAATIGRRFTIHHGYAVVINKNVVAGDDFTIRHGVTIGNRG-----------ADNMACPHIGNGVELGANVIILGD 110 (146) T ss_pred EEECCCCEECCCEEEECCEEEEECCCEEECCCEEEECCEEECCCC-----------CCCCCCCEECCCEEEECCCEEECC T ss_conf 950899887898798378258999971989980994887888867-----------877898987999599289999648 Q ss_pred EEECCCEEECCCCEECCCE Q ss_conf 3974984998888996980 Q gi|254781223|r 89 ARVRGNAVVGGDTVVEGDT 107 (110) Q Consensus 89 ~~I~~~~~I~~~~~I~~~~ 107 (110) +.||+|+.|++++.+.+|. T Consensus 111 i~IG~~~~IGA~svV~~dv 129 (146) T PRK10191 111 ITLGNNVTVGAGSVVLDSV 129 (146) T ss_pred CEECCCCEECCCCEECCCC T ss_conf 5999999999895897035 No 101 >PRK09527 lacA galactoside O-acetyltransferase; Reviewed Probab=98.70 E-value=1.4e-08 Score=52.25 Aligned_cols=30 Identities=13% Similarity=0.157 Sum_probs=13.0 Q ss_pred CCEEECCCEEECC--CCEECCCCEECCCCCCC Q ss_conf 8769889709928--83997981545754323 Q gi|254781223|r 3 DNAVVRDCATVID--DARVSGNASVSRFAQVK 32 (110) Q Consensus 3 ~n~~I~~~a~I~~--~~~I~~n~~I~~~~~i~ 32 (110) +|+++..++.+.+ .+.||+|+.|+|++.|. T Consensus 80 ~~~fiN~n~~ilD~~~I~IGd~v~iGPnV~i~ 111 (203) T PRK09527 80 RNFYANFNLTIVDDYTVTIGDNVLIAPNVTLS 111 (203) T ss_pred CCEEECCCCEEEECCCEEECCCEEECCCCEEE T ss_conf 98798688389957637998972886997893 No 102 >COG1045 CysE Serine acetyltransferase [Amino acid transport and metabolism] Probab=98.65 E-value=5.7e-08 Score=49.74 Aligned_cols=85 Identities=20% Similarity=0.310 Sum_probs=41.4 Q ss_pred EECCCCEECCCCEECCC--CCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCE Q ss_conf 99288399798154575--4323112222222100113552022111122211011222221101111111124322013 Q gi|254781223|r 12 TVIDDARVSGNASVSRF--AQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVIGFTVISGNA 89 (110) Q Consensus 12 ~I~~~~~I~~n~~I~~~--~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~I~~~~ 89 (110) .|-+.++||...+|... .+|++.+.+++++.+...+.+++...-.+ ..+..+..++.|+.++.|.++..|++++ T Consensus 69 eIhp~A~IG~g~fIdHg~GvVIgeta~IGddv~I~~gVTLGgtg~~~g----~RhPtIg~~V~IGagAkILG~I~IGd~a 144 (194) T COG1045 69 EIHPGAKIGRGLFIDHGTGVVIGETAVIGDDVTIYHGVTLGGTGKESG----KRHPTIGNGVYIGAGAKILGNIEIGDNA 144 (194) T ss_pred EECCCCEECCCEEECCCCEEEECCEEEECCCEEEEECEEECCCCCCCC----CCCCCCCCCEEECCCCEEECCEEECCCC T ss_conf 347777277856873775589764249879728970617668878778----7798317885989998897166887897 Q ss_pred EECCCEEECCC Q ss_conf 97498499888 Q gi|254781223|r 90 RVRGNAVVGGD 100 (110) Q Consensus 90 ~I~~~~~I~~~ 100 (110) +||.++++-.+ T Consensus 145 ~IGA~sVVlkd 155 (194) T COG1045 145 KIGAGSVVLKD 155 (194) T ss_pred EECCCCEECCC T ss_conf 88878668157 No 103 >TIGR01172 cysE serine O-acetyltransferase; InterPro: IPR005881 The biosynthesis of L-cysteine is the predominant way by which inorganic sulphur is incorporated into organic compounds. In this process, the most abundant utilizable source of sulphur, inorganic sulphate, is taken up and reduced to sulphide. Sulphide is used to produce L-cysteine, which serves for protein synthesis or the production of other sulphur-containing organic compounds. Two routes for cysteine biosynthesis in nature have been documented. Serine transacetylase (2.3.1.30 from EC) catalyzes steps in pathway I, the activation of L-serine by acetyl-coenzyme A, yielding O-acetyl-L-serine.; GO: 0009001 serine O-acetyltransferase activity, 0006535 cysteine biosynthetic process from serine, 0005737 cytoplasm. Probab=98.63 E-value=4.2e-08 Score=50.28 Aligned_cols=87 Identities=23% Similarity=0.351 Sum_probs=55.4 Q ss_pred EEECCCCEECCCCEECC--CCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 09928839979815457--5432311222222210011355202211112221101122222110111111112432201 Q gi|254781223|r 11 ATVIDDARVSGNASVSR--FAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVIGFTVISGN 88 (110) Q Consensus 11 a~I~~~~~I~~n~~I~~--~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~I~~~ 88 (110) ..|=|.++||...+|.. +.+|++-+.+++++.++..+++++.....+ ...|..+.+++.|+-++.|-++..|++| T Consensus 62 idIHPgA~IG~g~fIDHg~GvVIGETa~iGddv~iyhGVTLGGtgk~~G---~kRHPt~~~gV~iGAGAKvLG~I~vG~~ 138 (163) T TIGR01172 62 IDIHPGARIGRGVFIDHGTGVVIGETAEIGDDVTIYHGVTLGGTGKEKG---VKRHPTIGEGVVIGAGAKVLGNIEVGEN 138 (163) T ss_pred CCCCCCCCCCCEEEEECCCCEEEEEEEEECCCEEEEECEEECCCCCCCC---CCCCCCCCCCEEEEECCEEECCEEECCC T ss_conf 3217643007730777560368500368738837863212067888878---8988825687178407824037667799 Q ss_pred EEECCCEEECCC Q ss_conf 397498499888 Q gi|254781223|r 89 ARVRGNAVVGGD 100 (110) Q Consensus 89 ~~I~~~~~I~~~ 100 (110) ++||.|+++-.+ T Consensus 139 AkiGAnsVVl~d 150 (163) T TIGR01172 139 AKIGANSVVLKD 150 (163) T ss_pred CEECCCEEECCC T ss_conf 777233267353 No 104 >cd04649 LbH_THP_succinylT_putative Putative 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate (THDP) N-succinyltransferase (THP succinyltransferase), C-terminal left-handed parallel alpha-helix (LbH) domain: This group is composed of mostly uncharacterized proteins containing an N-terminal domain of unknown function and a C-terminal LbH domain with similarity to THP succinyltransferase LbH. THP succinyltransferase catalyzes the conversion of tetrahydrodipicolinate and succinyl-CoA to N-succinyltetrahydrodipicolinate and CoA. It is the committed step in the succinylase pathway by which bacteria synthesize L-lysine and meso-diaminopimelate, a component of peptidoglycan. The enzyme is trimeric and displays the left-handed parallel alpha-helix (LbH) structural motif encoded by the hexapeptide repeat motif. Probab=98.60 E-value=1.8e-07 Score=47.68 Aligned_cols=103 Identities=20% Similarity=0.199 Sum_probs=62.0 Q ss_pred EEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCC----EEEEEEEECCCCCCCCCCCCCCCCC--CCCCCCCCCCC Q ss_conf 69889709928839979815457543231122222221----0011355202211112221101122--22211011111 Q gi|254781223|r 5 AVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTY----VRDNAKVGGYAKVSGNASVGGNAIV--RDTAEVGGDAF 78 (110) Q Consensus 5 ~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~----i~~~~~i~~~~~i~~~~~i~~~~~~--~~~~~i~~~~~ 78 (110) +.|.|.+++.-.+++++++.+++...+.-++.....+. +...++++.++++++.+.+.+-..- .....|+++|+ T Consensus 2 vRi~~~~~VRlGAyl~~Gt~vM~egfVN~nAgt~g~~Miegrissgv~VG~~~diggGasi~G~Ls~g~~~pi~Ig~ncl 81 (147) T cd04649 2 VRIADADRVRLGAYLAEGTTVMHEGFVNFNAGTLGNCMVEGRISSGVIVGKGSDVGGGASIMGTLSGGGNNVISIGKRCL 81 (147) T ss_pred EEECCCCEEEECCEECCCCEECCCCEEEECCCCCCCCEEEEEEEECEEECCCCCCCCCCEEEEECCCCCCCCEEECCCEE T ss_conf 37447866873418879959862567987663066616876785156989983327861787871678823569887507 Q ss_pred CCCCCCCCCCEEECCCEEECCCCEECCCEEE Q ss_conf 1112432201397498499888899698097 Q gi|254781223|r 79 VIGFTVISGNARVRGNAVVGGDTVVEGDTVL 109 (110) Q Consensus 79 i~~~~~I~~~~~I~~~~~I~~~~~I~~~~~i 109 (110) ++.++.+ ...+++++.|..+..|..++.| T Consensus 82 iGAnsgi--Gi~lGd~cvv~aGlyi~~gTki 110 (147) T cd04649 82 LGANSGI--GISLGDNCIVEAGLYVTAGTKV 110 (147) T ss_pred ECCCCCC--CEEECCCCEEECCEEECCCCEE T ss_conf 6146545--8486898688244598399789 No 105 >PRK11132 cysE serine acetyltransferase; Provisional Probab=98.59 E-value=6.1e-08 Score=49.62 Aligned_cols=38 Identities=21% Similarity=0.226 Sum_probs=28.8 Q ss_pred CCCCCCCCCCCCCCCCCCCEEECCCEEECCCCEECCCE Q ss_conf 21101111111124322013974984998888996980 Q gi|254781223|r 70 TAEVGGDAFVIGFTVISGNARVRGNAVVGGDTVVEGDT 107 (110) Q Consensus 70 ~~~i~~~~~i~~~~~I~~~~~I~~~~~I~~~~~I~~~~ 107 (110) +-.+++++.|+.++.|-+|..||+++.|+.++.|..|+ T Consensus 193 HP~i~~~v~igaga~iLG~i~iG~~a~igA~svVl~dv 230 (273) T PRK11132 193 HPKIREGVMIGAGAKILGNIEVGRGAKIGAGSVVLQPV 230 (273) T ss_pred CCCCCCCEEECCCCEEECCEEECCCCEECCCCEECCCC T ss_conf 98048884875488897185889998886786686437 No 106 >PRK02862 glgC glucose-1-phosphate adenylyltransferase; Provisional Probab=98.56 E-value=1e-07 Score=48.69 Aligned_cols=33 Identities=6% Similarity=0.164 Sum_probs=15.0 Q ss_pred CEECCCCEECC----CCCCCCCCCCCCCCCEEEEEEE Q ss_conf 39979815457----5432311222222210011355 Q gi|254781223|r 17 ARVSGNASVSR----FAQVKSNAEVSDNTYVRDNAKV 49 (110) Q Consensus 17 ~~I~~n~~I~~----~~~i~~~~~i~~~~~i~~~~~i 49 (110) ..|+++|.|.. ++.|+.++.+++++.+.+.... T Consensus 309 SlI~~GciI~g~~V~nSVlg~~v~I~~ga~V~nSIim 345 (429) T PRK02862 309 SIISEGCILKNCSIHHSVLGVRSRIESDCVLEDTLVM 345 (429) T ss_pred CEECCCEEEECCEEECCEECCCCEECCCCEEEEEEEE T ss_conf 5975976983888876788898589899899631898 No 107 >PRK02862 glgC glucose-1-phosphate adenylyltransferase; Provisional Probab=98.49 E-value=1.7e-07 Score=47.75 Aligned_cols=13 Identities=23% Similarity=0.274 Sum_probs=4.6 Q ss_pred EEECCCCEECCCE Q ss_conf 4998888996980 Q gi|254781223|r 95 AVVGGDTVVEGDT 107 (110) Q Consensus 95 ~~I~~~~~I~~~~ 107 (110) ++|..++.|..++ T Consensus 415 vvv~k~~~ip~gt 427 (429) T PRK02862 415 VVVVKNATIPDGT 427 (429) T ss_pred EEECCCCCCCCCC T ss_conf 9989998989998 No 108 >KOG1322 consensus Probab=98.49 E-value=1.8e-07 Score=47.63 Aligned_cols=91 Identities=16% Similarity=0.181 Sum_probs=49.3 Q ss_pred EECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 98897099288399798154575432311222222210011355202211112221101122222110111111112432 Q gi|254781223|r 6 VVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVIGFTVI 85 (110) Q Consensus 6 ~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~I 85 (110) .+..++.+-+-+.+|++|.|++|+.|+..+.+.+++.+.....++ .+.+..++.+.. +.++.++.|+.++.| T Consensus 260 ~i~~nvlvd~~~~iG~~C~Ig~~vvIG~r~~i~~gV~l~~s~il~-------~~~~~~~s~i~s-~ivg~~~~IG~~~~i 331 (371) T KOG1322 260 KIVGNVLVDSIASIGENCSIGPNVVIGPRVRIEDGVRLQDSTILG-------ADYYETHSEISS-SIVGWNVPIGIWARI 331 (371) T ss_pred CCCCCEEECCCCCCCCCCEECCCCEECCCCEECCCEEEEEEEEEC-------CCEECHHHHHHH-HHCCCCCCCCCCEEE T ss_conf 512437621400138863788873677872862762887427972-------512101688876-432565200574697 Q ss_pred CCCEEECCCEEECCCCEEC Q ss_conf 2013974984998888996 Q gi|254781223|r 86 SGNARVRGNAVVGGDTVVE 104 (110) Q Consensus 86 ~~~~~I~~~~~I~~~~~I~ 104 (110) ..++.+|+|+.|.+--.+. T Consensus 332 d~~a~lG~nV~V~d~~~vn 350 (371) T KOG1322 332 DKNAVLGKNVIVADEDYVN 350 (371) T ss_pred ECCCEECCCEEEECCCCCC T ss_conf 1253753625994234545 No 109 >KOG3121 consensus Probab=98.47 E-value=2.2e-07 Score=47.29 Aligned_cols=99 Identities=12% Similarity=0.186 Sum_probs=60.1 Q ss_pred CEEECCCEEECC---CCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 769889709928---83997981545754323112222222100113552022111122211011222221101111111 Q gi|254781223|r 4 NAVVRDCATVID---DARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVI 80 (110) Q Consensus 4 n~~I~~~a~I~~---~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~ 80 (110) ...+...+.|.+ ++++|..|.+...+.|++..++.+.-...-...++++..+.+.+.+ ..++++...+++.++.|+ T Consensus 39 KtIv~~g~iIRGDLAnVr~GryCV~ksrsvIRPp~K~FSKg~affp~hiGdhVFieE~cVV-nAAqIgsyVh~GknaviG 117 (184) T KOG3121 39 KTIVEEGVIIRGDLANVRIGRYCVLKSRSVIRPPMKIFSKGPAFFPVHIGDHVFIEEECVV-NAAQIGSYVHLGKNAVIG 117 (184) T ss_pred CEEEEECCEEECCCCCCEECCEEEECCCCCCCCCHHHHCCCCEEEEEEECCEEEEECCEEE-EHHHHEEEEEECCCEEEC T ss_conf 2798608589512010247336896466543770677567862456652456887044186-266612036854640673 Q ss_pred CCCCCCCCEEECCCEEECCCCEE Q ss_conf 12432201397498499888899 Q gi|254781223|r 81 GFTVISGNARVRGNAVVGGDTVV 103 (110) Q Consensus 81 ~~~~I~~~~~I~~~~~I~~~~~I 103 (110) ..+.+.+=|+|-+++++-..+.+ T Consensus 118 rrCVlkdCc~ild~tVlPpet~v 140 (184) T KOG3121 118 RRCVLKDCCRILDDTVLPPETLV 140 (184) T ss_pred CCEEHHHHEECCCCCCCCCCCCC T ss_conf 75083341041589525820136 No 110 >TIGR00965 dapD 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase; InterPro: IPR005664 Bacteria, plants and fungi metabolise aspartic acid to produce four amino acids - lysine, threonine, methionine and isoleucine - in a series of reactions known as the aspartate pathway. Additionally, several important metabolic intermediates are produced by these reactions, such as diaminopimelic acid, an essential component of bacterial cell wall biosynthesis, and dipicolinic acid, which is involved in sporulation in Gram-positive bacteria. Members of the animal kingdom do not posses this pathway and must therefore acquire these essential amino acids through their diet. Research into improving the metabolic flux through this pathway has the potential to increase the yield of the essential amino acids in important crops, thus improving their nutritional value. Additionally, since the enzymes are not present in animals, inhibitors of them are promising targets for the development of novel antibiotics and herbicides. For more information see . The lysine/diaminopimelic acid branch of the aspartate pathway produces the essential amino acid lysine via the intermediate meso-diaminopimelic acid (meso-DAP), which is also a vital cell wall component in Gram-negative bacteria . The production of dihydropicolinate from aspartate-semialdehyde controls flux into the lysine/diaminopimelic acid pathway. Three variants of this pathway exist, differing in how tetrahydropicolinate (formed by reduction of dihydropicolinate) is metabolised to meso-DAP. One variant, the most commonly found one in archaea and bacteria, uses primarily succinyl intermediates, while a second variant, found only in Bacillus, utilises primarily acetyl intermediates. In the third variant, found in some Gram-positive bacteria, a dehydrogenase converts tetrahydropicolinate directly to meso-DAP. In all variants meso-DAP is subsequently converted to lysine by a decarboxylase, or, in Gram-negative bacteria, assimilated into the cell wall. Evidence exists that a fourth, currently unknown, variant of this pathway may function in plants . 2,3,4,5-tetrahydropyridine-2-carboxylate N-succinyltransferase (also known as tetrahydrodipicolinate N-succinyltransferase or DapD) is part of the succinyl route of of lysine/DAP biosynthesis. The DapD protein is a homotrimer is a trimeric enzyme with each monomer composed of three domain: an N-terminal helical domain, a distinctive left-handed parallel beta-helix (LBH) domain, and a predominantly beta C-terminal domain , . The LBH structure is encoded by an imperfect tandem-repeated hexapeptide sequence. Each trimer contains three independent active sites, always occuring at the boundary of two subunits, and formed by residues from one N-terminal domain, one C-terminal domain and two adjacent LBH domains.; GO: 0016757 transferase activity transferring glycosyl groups, 0009089 lysine biosynthetic process via diaminopimelate. Probab=98.45 E-value=1.4e-07 Score=48.17 Aligned_cols=85 Identities=18% Similarity=0.273 Sum_probs=41.1 Q ss_pred CCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCE Q ss_conf 83997981545754323112222222100113552022111122211011222221101111111124322013974984 Q gi|254781223|r 16 DARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVIGFTVISGNARVRGNA 95 (110) Q Consensus 16 ~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~I~~~~~I~~~~ 95 (110) ++.|-+.+.|.-.|+++..++|++|+.+.+.+-+++-.. . .+...+.+.+||.+|-.+.|-..+.+.+.+.|.-++ T Consensus 132 GAYvDEGtMvDTWATvGSCAQIGKNvHlSGG~GIGGVLE---P-LQA~P~IIEDNCFiGARSEi~EGVIVEEGSVISMGV 207 (275) T TIGR00965 132 GAYVDEGTMVDTWATVGSCAQIGKNVHLSGGVGIGGVLE---P-LQAKPTIIEDNCFIGARSEIVEGVIVEEGSVISMGV 207 (275) T ss_pred CCEEECCCEECCHHHHCCHHHHCCCEEEECCCCCCCCCC---C-CCCCCEEEECCCEECCCEEEEEEEEECCCCEEEEEE T ss_conf 367516860100122300322066257517833165246---3-357875882476004611356207971685888406 Q ss_pred EECCCCEEC Q ss_conf 998888996 Q gi|254781223|r 96 VVGGDTVVE 104 (110) Q Consensus 96 ~I~~~~~I~ 104 (110) .||.-+.|. T Consensus 208 fIG~STkI~ 216 (275) T TIGR00965 208 FIGQSTKIV 216 (275) T ss_pred EECCCCEEE T ss_conf 855601688 No 111 >PRK10191 putative colanic acid biosynthesis acetyltransferase WcaB; Provisional Probab=98.42 E-value=5.8e-07 Score=45.54 Aligned_cols=81 Identities=12% Similarity=0.144 Sum_probs=32.5 Q ss_pred CCCEEECCCEEEC--CCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 6876988970992--88399798154575432311222222210011355202211112221101122222110111111 Q gi|254781223|r 2 YDNAVVRDCATVI--DDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFV 79 (110) Q Consensus 2 ~~n~~I~~~a~I~--~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i 79 (110) +..+.|++...+. ....|++.+.|++|+.|+.++.++... ..-.....+++++.++.++.+.++..|++++.| T Consensus 45 ~p~a~IG~gl~I~H~~GiVIg~~~~IG~n~~I~q~VTiG~~~-----~~~~~~P~Ig~~V~Igaga~IlG~i~IG~~~~I 119 (146) T PRK10191 45 QAAATIGRRFTIHHGYAVVINKNVVAGDDFTIRHGVTIGNRG-----ADNMACPHIGNGVELGANVIILGDITLGNNVTV 119 (146) T ss_pred CCCCEECCCEEEECCEEEEECCCEEECCCEEEECCEEECCCC-----CCCCCCCEECCCEEEECCCEEECCCEECCCCEE T ss_conf 899887898798378258999971989980994887888867-----877898987999599289999648599999999 Q ss_pred CCCCCCCC Q ss_conf 11243220 Q gi|254781223|r 80 IGFTVISG 87 (110) Q Consensus 80 ~~~~~I~~ 87 (110) +.++.+-. T Consensus 120 GA~svV~~ 127 (146) T PRK10191 120 GAGSVVLD 127 (146) T ss_pred CCCCEECC T ss_conf 98958970 No 112 >cd03349 LbH_XAT Xenobiotic acyltransferase (XAT): The XAT class of hexapeptide acyltransferases is composed of a large number of microbial enzymes that catalyze the CoA-dependent acetylation of a variety of hydroxyl-bearing acceptors such as chloramphenicol and streptogramin, among others. Members of this class of enzymes include Enterococcus faecium streptogramin A acetyltransferase and Pseudomonas aeruginosa chloramphenicol acetyltransferase. They contain repeated copies of a six-residue hexapeptide repeat sequence motif (X-[STAV]-X-[LIV]-[GAED]-X) and adopt a left-handed parallel beta helix (LbH) structure. The active enzyme is a trimer with CoA and substrate binding sites at the interface of two separate LbH subunits. XATs are implicated in inactivating xenobiotics leading to xenobiotic resistance in patients. Probab=98.41 E-value=1.2e-06 Score=44.24 Aligned_cols=35 Identities=20% Similarity=0.317 Sum_probs=25.7 Q ss_pred CCCCCCCCCCCCCCEEECCCEEECCCCEECCCEEE Q ss_conf 11111112432201397498499888899698097 Q gi|254781223|r 75 GDAFVIGFTVISGNARVRGNAVVGGDTVVEGDTVL 109 (110) Q Consensus 75 ~~~~i~~~~~I~~~~~I~~~~~I~~~~~I~~~~~i 109 (110) ....|++++-|+.++.|-+++.|++++.|..+++| T Consensus 72 ~~i~Ig~~vwIG~n~~Il~GV~IG~gavIgAgSvV 106 (145) T cd03349 72 GDVIIGNDVWIGHGATILPGVTIGDGAVIAAGAVV 106 (145) T ss_pred CCEEECCCEEECCCCEEECCCEECCCCEEECCCEE T ss_conf 99798899699998889588096788689289789 No 113 >PRK03282 consensus Probab=98.38 E-value=5.8e-07 Score=45.55 Aligned_cols=50 Identities=22% Similarity=0.302 Sum_probs=20.5 Q ss_pred EEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCEEEC Q ss_conf 3552022111122211011222221101111111124322013974984998 Q gi|254781223|r 47 AKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVIGFTVISGNARVRGNAVVG 98 (110) Q Consensus 47 ~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~I~~~~~I~~~~~I~ 98 (110) +.++.++.+..++.+ .++.+.+++.|++++.+ .++.+..++.|++++.|| T Consensus 330 SvIg~~v~Ig~ga~I-~~SIIm~~~~Ig~~a~l-~~~Iidk~v~Ig~g~~Ig 379 (406) T PRK03282 330 SVLSPNVVVDSGAIV-EGSVLMPGVRIGRGAVV-RHAILDKNVVVPPGAMVG 379 (406) T ss_pred CEECCCCEECCCCEE-ECCEEECCCEECCCCEE-ECCEECCCCEECCCCEEC T ss_conf 997899899999899-70889096899999999-731989999999898999 No 114 >PRK00844 glgC glucose-1-phosphate adenylyltransferase; Provisional Probab=98.33 E-value=7.5e-07 Score=45.07 Aligned_cols=30 Identities=20% Similarity=0.052 Sum_probs=10.6 Q ss_pred CCCCCCCCCCCCCCCEEECCCEEECCCCEEC Q ss_conf 1111111124322013974984998888996 Q gi|254781223|r 74 GGDAFVIGFTVISGNARVRGNAVVGGDTVVE 104 (110) Q Consensus 74 ~~~~~i~~~~~I~~~~~I~~~~~I~~~~~I~ 104 (110) .+++.|+.++.|. ++.|++|+.|++++.|+ T Consensus 352 ~~~~~Ig~~a~I~-~sIidk~v~Ig~gt~I~ 381 (409) T PRK00844 352 MDGVRIGRGAVVR-RAILDKNVVVPPGTQIG 381 (409) T ss_pred ECCCEECCCCEEE-EEEECCCCEECCCCEEC T ss_conf 2949999999998-01989998989998989 No 115 >KOG1462 consensus Probab=98.33 E-value=6.5e-07 Score=45.34 Aligned_cols=84 Identities=19% Similarity=0.217 Sum_probs=35.1 Q ss_pred EEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEE Q ss_conf 09928839979815457543231122222221001135520221111222110112222211011111111243220139 Q gi|254781223|r 11 ATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVIGFTVISGNAR 90 (110) Q Consensus 11 a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~I~~~~~ 90 (110) +.+.-+..+++++.|++++.| ..+.++++|.++..+.+. ++.+++++.++..+.+. ++.|+.++.|++++.+. ||. T Consensus 329 ~l~g~d~iv~~~t~i~~~s~i-k~SviG~nC~Ig~~~~v~-nSilm~nV~vg~G~~Ie-nsIIg~gA~Ig~gs~L~-nC~ 404 (433) T KOG1462 329 ALVGADSIVGDNTQIGENSNI-KRSVIGSNCDIGERVKVA-NSILMDNVVVGDGVNIE-NSIIGMGAQIGSGSKLK-NCI 404 (433) T ss_pred ECCCHHHCCCCCCEECCCCEE-EEEEECCCCCCCCCCEEE-EEEEECCCEECCCCCEE-CCEECCCCEECCCCEEE-EEE T ss_conf 102323101788552555402-200125776116872777-11766583755884020-42662665654797033-348 Q ss_pred ECCCEEEC Q ss_conf 74984998 Q gi|254781223|r 91 VRGNAVVG 98 (110) Q Consensus 91 I~~~~~I~ 98 (110) |+.+=++- T Consensus 405 Ig~~yvVe 412 (433) T KOG1462 405 IGPGYVVE 412 (433) T ss_pred ECCCCEEC T ss_conf 55982776 No 116 >cd04651 LbH_G1P_AT_C Glucose-1-phosphate adenylyltransferase, C-terminal Left-handed parallel beta helix (LbH) domain: Glucose-1-phosphate adenylyltransferase is also known as ADP-glucose synthase or ADP-glucose pyrophosphorylase. It catalyzes the first committed and rate-limiting step in starch biosynthesis in plants and glycogen biosynthesis in bacteria. It is the enzymatic site for regulation of storage polysaccharide accumulation in plants and bacteria. The enzyme is a homotetramer, with each subunit containing an N-terminal catalytic domain that resembles a dinucleotide-binding Rossmann fold and a C-terminal LbH fold domain with at 5 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). The LbH domain is involved in cooperative allosteric regulation and oligomerization. Probab=98.30 E-value=7.1e-07 Score=45.18 Aligned_cols=20 Identities=20% Similarity=0.177 Sum_probs=10.7 Q ss_pred EECCCEEECCCCEECCCCEEC Q ss_conf 988970992883997981545 Q gi|254781223|r 6 VVRDCATVIDDARVSGNASVS 26 (110) Q Consensus 6 ~I~~~a~I~~~~~I~~n~~I~ 26 (110) ++.+.+.+. +..|+++|.|. T Consensus 3 ~~~~~~~v~-~S~Ig~Gc~I~ 22 (104) T cd04651 3 YIGRRGEVK-NSLVSEGCIIS 22 (104) T ss_pred EECCCCEEE-ECEECCCCEEE T ss_conf 888994898-49988997995 No 117 >cd04651 LbH_G1P_AT_C Glucose-1-phosphate adenylyltransferase, C-terminal Left-handed parallel beta helix (LbH) domain: Glucose-1-phosphate adenylyltransferase is also known as ADP-glucose synthase or ADP-glucose pyrophosphorylase. It catalyzes the first committed and rate-limiting step in starch biosynthesis in plants and glycogen biosynthesis in bacteria. It is the enzymatic site for regulation of storage polysaccharide accumulation in plants and bacteria. The enzyme is a homotetramer, with each subunit containing an N-terminal catalytic domain that resembles a dinucleotide-binding Rossmann fold and a C-terminal LbH fold domain with at 5 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). The LbH domain is involved in cooperative allosteric regulation and oligomerization. Probab=98.29 E-value=1.1e-06 Score=44.42 Aligned_cols=50 Identities=18% Similarity=0.339 Sum_probs=17.5 Q ss_pred EEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCEEECC Q ss_conf 5520221111222110112222211011111111243220139749849988 Q gi|254781223|r 48 KVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVIGFTVISGNARVRGNAVVGG 99 (110) Q Consensus 48 ~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~I~~~~~I~~~~~I~~ 99 (110) .++.++.+..++.+ .++.+.+++.|++++.+ .++.+..++.|++++.|+. T Consensus 30 VIg~~~~Ig~~a~I-~~svi~~~~~Ig~~~~l-~~~Ii~~~~~I~~g~~Ig~ 79 (104) T cd04651 30 VLFRGVRVGSGSVV-EDSVIMPNVGIGRNAVI-RRAIIDKNVVIPDGVVIGG 79 (104) T ss_pred EECCCCEECCCCEE-ECCEEECCCEECCCCEE-EEEEECCCCEECCCCEECC T ss_conf 97299799999899-60099655689999999-8619999999998999998 No 118 >KOG4750 consensus Probab=98.29 E-value=8.7e-07 Score=44.81 Aligned_cols=104 Identities=19% Similarity=0.255 Sum_probs=61.0 Q ss_pred CEEECCCEE--ECCCCEECCCC--EECCCCCCCCCCCCCCC--CCEEEEEEEECCCCCCCCCCCCCCCCCC--CCCCCCC Q ss_conf 769889709--92883997981--54575432311222222--2100113552022111122211011222--2211011 Q gi|254781223|r 4 NAVVRDCAT--VIDDARVSGNA--SVSRFAQVKSNAEVSDN--TYVRDNAKVGGYAKVSGNASVGGNAIVR--DTAEVGG 75 (110) Q Consensus 4 n~~I~~~a~--I~~~~~I~~n~--~I~~~~~i~~~~~i~~~--~~i~~~~~i~~~~~i~~~~~i~~~~~~~--~~~~i~~ 75 (110) |.|..+... +.-++.++++. -|+|.+.|+.+..+... .+++.-+.++.++.+..++.+++...-. ..-.|++ T Consensus 126 ~Lw~~~rk~lal~~q~ris~~~gvdihpaa~ig~gilldhatgvvigeTAvvg~~vSilH~Vtlggtgk~~gdrhP~Igd 205 (269) T KOG4750 126 NLWTQDRKILALGLQVRISPNFGVDIHPAAKIGKGILLDHATGVVIGETAVVGDNVSILHPVTLGGTGKGSGDRHPKIGD 205 (269) T ss_pred HHEECCCEEEEEEECCEECCCCCCCCCCHHHCCCCEEECCCCCEEECCEEEECCCEEEECCEEECCCCCCCCCCCCCCCC T ss_conf 11553771688764123324202124620102653122133323531024742644664443114654665556885567 Q ss_pred CCCCCCCCCCCCCEEECCCEEECCCCEECCCE Q ss_conf 11111124322013974984998888996980 Q gi|254781223|r 76 DAFVIGFTVISGNARVRGNAVVGGDTVVEGDT 107 (110) Q Consensus 76 ~~~i~~~~~I~~~~~I~~~~~I~~~~~I~~~~ 107 (110) +++|+..+.|-+++.||++++|+.++.+..|. T Consensus 206 ~vliGaGvtILgnV~IGegavIaAGsvV~kDV 237 (269) T KOG4750 206 NVLIGAGVTILGNVTIGEGAVIAAGSVVLKDV 237 (269) T ss_pred CEEECCCCEEECCEEECCCCEEECCCEEEECC T ss_conf 70875541785781677886773050697216 No 119 >KOG1460 consensus Probab=98.27 E-value=1.8e-06 Score=43.54 Aligned_cols=36 Identities=17% Similarity=0.253 Sum_probs=20.3 Q ss_pred CCEEECCCEEECCCCEECCCCEECCCCCCCCCCCCC Q ss_conf 876988970992883997981545754323112222 Q gi|254781223|r 3 DNAVVRDCATVIDDARVSGNASVSRFAQVKSNAEVS 38 (110) Q Consensus 3 ~n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~ 38 (110) ++++|+|+|.+.+.++||+|+.|++++.+++++.+. T Consensus 287 gdVyIhPsakvhptAkiGPNVSIga~vrvg~GvRl~ 322 (407) T KOG1460 287 GDVYIHPSAKVHPTAKIGPNVSIGANVRVGPGVRLR 322 (407) T ss_pred EEEEECCCCEECCCCCCCCCCEECCCCEECCCCEEE T ss_conf 424776754047766208971446882546882665 No 120 >KOG1460 consensus Probab=98.23 E-value=2.6e-06 Score=42.81 Aligned_cols=32 Identities=16% Similarity=0.189 Sum_probs=17.9 Q ss_pred CCCCEEECCCEEECCCCEECCCCEECCCCCCC Q ss_conf 96876988970992883997981545754323 Q gi|254781223|r 1 MYDNAVVRDCATVIDDARVSGNASVSRFAQVK 32 (110) Q Consensus 1 i~~n~~I~~~a~I~~~~~I~~n~~I~~~~~i~ 32 (110) ||+++.++|.|.|++|+.||.++.|++++.++ T Consensus 291 IhPsakvhptAkiGPNVSIga~vrvg~GvRl~ 322 (407) T KOG1460 291 IHPSAKVHPTAKIGPNVSIGANVRVGPGVRLR 322 (407) T ss_pred ECCCCEECCCCCCCCCCEECCCCEECCCCEEE T ss_conf 76754047766208971446882546882665 No 121 >PRK11132 cysE serine acetyltransferase; Provisional Probab=98.22 E-value=2.4e-06 Score=42.99 Aligned_cols=84 Identities=18% Similarity=0.295 Sum_probs=39.2 Q ss_pred ECCCCEECCCCEECC--CCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEE Q ss_conf 928839979815457--543231122222221001135520221111222110112222211011111111243220139 Q gi|254781223|r 13 VIDDARVSGNASVSR--FAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVIGFTVISGNAR 90 (110) Q Consensus 13 I~~~~~I~~n~~I~~--~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~I~~~~~ 90 (110) |-|.++||...+|.. +.+|++-+.+++++.+..++.+++-....+ ..+..+..++.|+.++.|-++..|+++++ T Consensus 144 IHP~A~iG~gi~iDH~~GvVIGETavigd~vsi~~~VTLGgt~~~~g----~RHP~i~~~v~igaga~iLG~i~iG~~a~ 219 (273) T PRK11132 144 IHPAAKIGRGIMLDHATGIVVGETAVIENDVSILQSVTLGGTGKTSG----DRHPKIREGVMIGAGAKILGNIEVGRGAK 219 (273) T ss_pred CCCCCEECCCEEEECCCCEEECCEEEECCCEEEECCEEEECCCCCCC----CCCCCCCCCEEECCCCEEECCEEECCCCE T ss_conf 26665116856883476358756468668548951235403556688----98980488848754888971858899988 Q ss_pred ECCCEEECCC Q ss_conf 7498499888 Q gi|254781223|r 91 VRGNAVVGGD 100 (110) Q Consensus 91 I~~~~~I~~~ 100 (110) |+.++++-.+ T Consensus 220 igA~svVl~d 229 (273) T PRK11132 220 IGAGSVVLQP 229 (273) T ss_pred ECCCCEECCC T ss_conf 8678668643 No 122 >PRK00844 glgC glucose-1-phosphate adenylyltransferase; Provisional Probab=98.16 E-value=3.1e-06 Score=42.54 Aligned_cols=52 Identities=21% Similarity=0.293 Sum_probs=22.7 Q ss_pred EEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCEEECCCC Q ss_conf 552022111122211011222221101111111124322013974984998888 Q gi|254781223|r 48 KVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVIGFTVISGNARVRGNAVVGGDT 101 (110) Q Consensus 48 ~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~I~~~~~I~~~~~I~~~~ 101 (110) .++.+++|.+++.+ .++.+.+++.|++++.+ .++.|..++.|++++.|+.+. T Consensus 333 vIg~~v~Ig~ga~I-~nSII~~~~~Ig~~a~I-~~sIidk~v~Ig~gt~I~~~~ 384 (409) T PRK00844 333 VLSPNVRVDSGAIV-EGSVLMDGVRIGRGAVV-RRAILDKNVVVPPGTQIGVDL 384 (409) T ss_pred EECCCCEECCCCEE-ECCEEECCCEECCCCEE-EEEEECCCCEECCCCEECCCC T ss_conf 97799899999899-51899294999999999-801989998989998989780 No 123 >PRK04928 consensus Probab=98.11 E-value=4.3e-06 Score=41.92 Aligned_cols=33 Identities=9% Similarity=0.322 Sum_probs=10.7 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCEEECCCEEEC Q ss_conf 1222221101111111124322013974984998 Q gi|254781223|r 65 AIVRDTAEVGGDAFVIGFTVISGNARVRGNAVVG 98 (110) Q Consensus 65 ~~~~~~~~i~~~~~i~~~~~I~~~~~I~~~~~I~ 98 (110) +.+.+++.|++++.+ .++.|..++.|++|++|+ T Consensus 345 SII~~~~~Ig~ga~i-~~sIidk~v~Ig~g~~I~ 377 (405) T PRK04928 345 SVLLGDVKIGEGCVL-RRVIIDKDVEIAPGTQIG 377 (405) T ss_pred CEEECCCEECCCCEE-ECCEECCCCEECCCCEEC T ss_conf 899692999999999-654988999999999998 No 124 >PRK03282 consensus Probab=98.09 E-value=6e-06 Score=41.30 Aligned_cols=51 Identities=24% Similarity=0.152 Sum_probs=24.0 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCEEECCCCEEC Q ss_conf 22111122211011222221101111111124322013974984998888996 Q gi|254781223|r 52 YAKVSGNASVGGNAIVRDTAEVGGDAFVIGFTVISGNARVRGNAVVGGDTVVE 104 (110) Q Consensus 52 ~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~I~~~~~I~~~~~I~~~~~I~ 104 (110) ++.+++++.++.++.+. ++.|..++.|+.++.+. ++.+.+++.|++++.|+ T Consensus 329 nSvIg~~v~Ig~ga~I~-~SIIm~~~~Ig~~a~l~-~~Iidk~v~Ig~g~~Ig 379 (406) T PRK03282 329 NSVLSPNVVVDSGAIVE-GSVLMPGVRIGRGAVVR-HAILDKNVVVPPGAMVG 379 (406) T ss_pred CCEECCCCEECCCCEEE-CCEEECCCEECCCCEEE-CCEECCCCEECCCCEEC T ss_conf 99978998999998997-08890968999999997-31989999999898999 No 125 >cd04649 LbH_THP_succinylT_putative Putative 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate (THDP) N-succinyltransferase (THP succinyltransferase), C-terminal left-handed parallel alpha-helix (LbH) domain: This group is composed of mostly uncharacterized proteins containing an N-terminal domain of unknown function and a C-terminal LbH domain with similarity to THP succinyltransferase LbH. THP succinyltransferase catalyzes the conversion of tetrahydrodipicolinate and succinyl-CoA to N-succinyltetrahydrodipicolinate and CoA. It is the committed step in the succinylase pathway by which bacteria synthesize L-lysine and meso-diaminopimelate, a component of peptidoglycan. The enzyme is trimeric and displays the left-handed parallel alpha-helix (LbH) structural motif encoded by the hexapeptide repeat motif. Probab=98.08 E-value=9.9e-06 Score=40.40 Aligned_cols=37 Identities=22% Similarity=0.089 Sum_probs=13.7 Q ss_pred EECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEE Q ss_conf 9928839979815457543231122222221001135 Q gi|254781223|r 12 TVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAK 48 (110) Q Consensus 12 ~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~ 48 (110) +|.+..++.-.+++.+++++.+..++.-++.....+. T Consensus 3 Ri~~~~~VRlGAyl~~Gt~vM~egfVN~nAgt~g~~M 39 (147) T cd04649 3 RIADADRVRLGAYLAEGTTVMHEGFVNFNAGTLGNCM 39 (147) T ss_pred EECCCCEEEECCEECCCCEECCCCEEEECCCCCCCCE T ss_conf 7447866873418879959862567987663066616 No 126 >KOG3121 consensus Probab=98.05 E-value=4.3e-06 Score=41.92 Aligned_cols=95 Identities=12% Similarity=0.133 Sum_probs=43.0 Q ss_pred CCCEECCCCEECCCCCCCCC---CCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCC-----CCCCCCCCC Q ss_conf 88399798154575432311---2222222100113552022111122211011222221101111-----111124322 Q gi|254781223|r 15 DDARVSGNASVSRFAQVKSN---AEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDA-----FVIGFTVIS 86 (110) Q Consensus 15 ~~~~I~~n~~I~~~~~i~~~---~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~-----~i~~~~~I~ 86 (110) .|..+-+...+..++.|++| ++++.+|.+..+..+...-.+...-.-.-...+.++..|+++| .|++.++++ T Consensus 32 QNI~lnGKtIv~~g~iIRGDLAnVr~GryCV~ksrsvIRPp~K~FSKg~affp~hiGdhVFieE~cVVnAAqIgsyVh~G 111 (184) T KOG3121 32 QNILLNGKTIVEEGVIIRGDLANVRIGRYCVLKSRSVIRPPMKIFSKGPAFFPVHIGDHVFIEEECVVNAAQIGSYVHLG 111 (184) T ss_pred CEEEECCCEEEEECCEEECCCCCCEECCEEEECCCCCCCCCHHHHCCCCEEEEEEECCEEEEECCEEEEHHHHEEEEEEC T ss_conf 32798372798608589512010247336896466543770677567862456652456887044186266612036854 Q ss_pred CCEEECCCEEECCCCEECCCEEE Q ss_conf 01397498499888899698097 Q gi|254781223|r 87 GNARVRGNAVVGGDTVVEGDTVL 109 (110) Q Consensus 87 ~~~~I~~~~~I~~~~~I~~~~~i 109 (110) .++.||..+++.+.+.|..|+++ T Consensus 112 knaviGrrCVlkdCc~ild~tVl 134 (184) T KOG3121 112 KNAVIGRRCVLKDCCRILDDTVL 134 (184) T ss_pred CCEEECCCEEHHHHEECCCCCCC T ss_conf 64067375083341041589525 No 127 >COG0110 WbbJ Acetyltransferase (isoleucine patch superfamily) [General function prediction only] Probab=98.03 E-value=5.9e-06 Score=41.36 Aligned_cols=31 Identities=13% Similarity=0.126 Sum_probs=10.6 Q ss_pred CCCCCCCCCCCCEEECCCEEECCCCEECCCE Q ss_conf 1111124322013974984998888996980 Q gi|254781223|r 77 AFVIGFTVISGNARVRGNAVVGGDTVVEGDT 107 (110) Q Consensus 77 ~~i~~~~~I~~~~~I~~~~~I~~~~~I~~~~ 107 (110) +.|++++-|+.++.|-++++|++++.|...+ T Consensus 125 v~IG~~vwIG~~a~IlpGV~IG~gavigags 155 (190) T COG0110 125 VTIGEDVWIGAGAVILPGVTIGEGAVIGAGS 155 (190) T ss_pred CEECCCCEECCCCEECCCEEECCCEEEECCE T ss_conf 7889985877897897985989981997885 No 128 >COG0448 GlgC ADP-glucose pyrophosphorylase [Carbohydrate transport and metabolism] Probab=98.03 E-value=9e-06 Score=40.57 Aligned_cols=51 Identities=20% Similarity=0.358 Sum_probs=20.5 Q ss_pred EEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCEEECCC Q ss_conf 55202211112221101122222110111111112432201397498499888 Q gi|254781223|r 48 KVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVIGFTVISGNARVRGNAVVGGD 100 (110) Q Consensus 48 ~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~I~~~~~I~~~~~I~~~ 100 (110) .+..++++..++.+ .++++.+++.|+++|.+ .++.|..++.|+++++|++. T Consensus 313 VL~~~v~I~~gs~i-~~svim~~~~IG~~~~l-~~aIIDk~v~I~~g~~i~~~ 363 (393) T COG0448 313 VLFRGVRIGKGSVI-ENSVIMPDVEIGEGAVL-RRAIIDKNVVIGEGVVIGGD 363 (393) T ss_pred EEECCEEECCCCEE-EEEEEECCCEECCCCEE-EEEEECCCCEECCCCEECCC T ss_conf 87567197789888-75387278688899889-99896799686898288677 No 129 >PRK01884 consensus Probab=98.03 E-value=6.4e-06 Score=41.21 Aligned_cols=34 Identities=9% Similarity=-0.019 Sum_probs=14.7 Q ss_pred CCCCCCCCCCCCCCCCCCEEECCCEEECCCCEECC Q ss_conf 11011111111243220139749849988889969 Q gi|254781223|r 71 AEVGGDAFVIGFTVISGNARVRGNAVVGGDTVVEG 105 (110) Q Consensus 71 ~~i~~~~~i~~~~~I~~~~~I~~~~~I~~~~~I~~ 105 (110) +.|..++.|+.++.|. ++.+.+++.|+++++|.- T Consensus 363 SIIm~~~~IG~ga~l~-~~IiDk~v~I~~g~~IG~ 396 (435) T PRK01884 363 CVVLPQVKIGKNCVLK-RCIIDRHCVIPDGMQIGV 396 (435) T ss_pred CEEECCCEECCCCEEE-EEEECCCCEECCCCEECC T ss_conf 8880869899999996-269899999997999985 No 130 >COG4801 Predicted acyltransferase [General function prediction only] Probab=97.99 E-value=1.2e-05 Score=39.99 Aligned_cols=87 Identities=17% Similarity=0.226 Sum_probs=38.2 Q ss_pred EECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEE Q ss_conf 99288399798154575432311222222210011355202211112221101122222110111111112432201397 Q gi|254781223|r 12 TVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVIGFTVISGNARV 91 (110) Q Consensus 12 ~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~I~~~~~I 91 (110) .+.+++.||+|+.+... +++.....++++.+..... ..+..+..++.+.+|+....++.+++.+.|.+...+..+..+ T Consensus 18 vv~gdViIG~nS~l~~~-V~g~~iivge~v~i~Gdiv-a~diridmw~kv~gNV~ve~dayiGE~~sI~gkl~v~gdLdi 95 (277) T COG4801 18 VVKGDVIIGKNSMLKYG-VVGEEIIVGERVRIYGDIV-AKDIRIDMWCKVTGNVIVENDAYIGEFSSIKGKLTVIGDLDI 95 (277) T ss_pred EEECCEEECCCCEEEEE-EEEEEEEECCCCEEEEEEE-ECCEEEEEEEEEECCEEECCCEEEECCCEEEEEEEEECCCCC T ss_conf 97546898566426443-2201588436707860687-311366215675231898474077033024016999533235 Q ss_pred CCCEEECCC Q ss_conf 498499888 Q gi|254781223|r 92 RGNAVVGGD 100 (110) Q Consensus 92 ~~~~~I~~~ 100 (110) ++++.|-++ T Consensus 96 g~dV~Iegg 104 (277) T COG4801 96 GADVIIEGG 104 (277) T ss_pred CCCEEEECC T ss_conf 651687567 No 131 >TIGR01208 rmlA_long glucose-1-phosphate thymidylyltransferase; InterPro: IPR005908 Synonym: dTDP-D-glucose synthase This group of proteins comprises a tightly conserved but broadly distributed subfamily of known and putative bacterial glucose-1-phosphate thymidylyltransferases (2.7.7.24 from EC). It is well characterised in several species as the first of four enzymes involved in the biosynthesis of dTDP-L-rhamnose, a cell wall constituent and a feedback inhibitor of the enzyme.dTTP + alpha-D-glucose 1-phosphate = diphosphate + dTDP-glucose The family of known and putative glucose-1-phosphate thymidyltransferases shows a deep split into a short form (see IPR005907 from INTERPRO) and a long form described by this model. The homotetrameric short form is found in numerous bacterial species that incorporate dTDP-L-rhamnose, which it helps synthesize, into the cell wall. It is subject to feedback inhibition. The long form, in contrast, is found in many species for which it serves as a sugar-activating enzyme for antibiotic biosynthesis and or other, unknown pathways, and in which dTDP-L-rhamnose is not necessarily produced. . Probab=97.96 E-value=8.4e-06 Score=40.70 Aligned_cols=91 Identities=12% Similarity=0.227 Sum_probs=52.6 Q ss_pred CCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 89709928839979815457543231122222221001135520221111222110112222211011111111243220 Q gi|254781223|r 8 RDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVIGFTVISG 87 (110) Q Consensus 8 ~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~I~~ 87 (110) ...+.|.+-|.|++.+.|.-|+.|++.+.|++||.+.+.-.+++.+.++..+.+.. +.+.. +.+-+.+.|-..-.=-. T Consensus 252 ~~~s~i~GRV~v~~gakid~ns~i~GPa~IG~d~~I~nsGyiGPYtSig~~~~I~d-~~vE~-S~vldes~I~~v~~Ri~ 329 (361) T TIGR01208 252 DDESKIEGRVEVGEGAKIDVNSVIRGPAVIGEDCIIENSGYIGPYTSIGEGVVIRD-AEVEH-SIVLDESVIEGVEKRIV 329 (361) T ss_pred ECCCEEEEEEEECCCCEECCCCEEECCEEECCCCEEECCCCCCCEEEECCCEEEEC-CCCCC-EEEECCCEEEEHHHHHH T ss_conf 22657884079089788423777870608779527826861177077257538741-63020-35644211431032022 Q ss_pred CEEECCCEEECCC Q ss_conf 1397498499888 Q gi|254781223|r 88 NARVRGNAVVGGD 100 (110) Q Consensus 88 ~~~I~~~~~I~~~ 100 (110) ++.||..+.|-.+ T Consensus 330 dS~iG~~~~i~~~ 342 (361) T TIGR01208 330 DSVIGKKVRIKGN 342 (361) T ss_pred HEECCCCCEEECC T ss_conf 3011782078437 No 132 >PRK03701 consensus Probab=97.95 E-value=7.5e-06 Score=40.91 Aligned_cols=49 Identities=10% Similarity=0.050 Sum_probs=19.0 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCEEECCCCEEC Q ss_conf 111122211011222221101111111124322013974984998888996 Q gi|254781223|r 54 KVSGNASVGGNAIVRDTAEVGGDAFVIGFTVISGNARVRGNAVVGGDTVVE 104 (110) Q Consensus 54 ~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~I~~~~~I~~~~~I~~~~~I~ 104 (110) .++.++.++.++.+. ++.|..++.|+.++++. ++.|.+++.|++++.|. T Consensus 349 Vl~r~v~I~~gA~V~-nSIIm~~~~Ig~ga~l~-~~IiDk~v~I~~g~vIg 397 (431) T PRK03701 349 VLFPRVRVNSFCNID-SAVLLPEVWVGRSCRLR-RCVIDRACVIPEGMVIG 397 (431) T ss_pred CCCCCCEECCCCEEE-CCEECCCCEECCCCEEE-EEEECCCCEECCCCEEC T ss_conf 811888999998997-26985969899999998-37989999999799989 No 133 >KOG1322 consensus Probab=97.95 E-value=8.2e-06 Score=40.74 Aligned_cols=52 Identities=12% Similarity=0.217 Sum_probs=32.1 Q ss_pred CCCEEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCC Q ss_conf 68769889709928839979815457543231122222221001135520221 Q gi|254781223|r 2 YDNAVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAK 54 (110) Q Consensus 2 ~~n~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~ 54 (110) -+|..+.+-+.++.++.|++|+.|++++.|.+++.+.+.+.++.. .+..++- T Consensus 262 ~~nvlvd~~~~iG~~C~Ig~~vvIG~r~~i~~gV~l~~s~il~~~-~~~~~s~ 313 (371) T KOG1322 262 VGNVLVDSIASIGENCSIGPNVVIGPRVRIEDGVRLQDSTILGAD-YYETHSE 313 (371) T ss_pred CCCEEECCCCCCCCCCEECCCCEECCCCEECCCEEEEEEEEECCC-EECHHHH T ss_conf 243762140013886378887367787286276288742797251-2101688 No 134 >KOG4042 consensus Probab=97.93 E-value=5.1e-06 Score=41.60 Aligned_cols=44 Identities=16% Similarity=0.244 Sum_probs=23.6 Q ss_pred EEECCCEE------ECCCCEECCCCEECCCCCCCCC---CCCCCCCCEEEEEE Q ss_conf 69889709------9288399798154575432311---22222221001135 Q gi|254781223|r 5 AVVRDCAT------VIDDARVSGNASVSRFAQVKSN---AEVSDNTYVRDNAK 48 (110) Q Consensus 5 ~~I~~~a~------I~~~~~I~~n~~I~~~~~i~~~---~~i~~~~~i~~~~~ 48 (110) +.|.|.|. |.+++.+++.|.++|.+++-.. ..++++..+-..+. T Consensus 9 vkIap~AvVCvEs~irGdvti~~gcVvHP~a~~iA~aGPI~iGEnniiEEyA~ 61 (190) T KOG4042 9 VKIAPSAVVCVESDIRGDVTIKEGCVVHPFAVFIATAGPIYIGENNIIEEYAV 61 (190) T ss_pred EEECCCEEEEEECCCCCCEEECCCCEECCEEEEECCCCCEEECCCCHHHHHHH T ss_conf 46457518998602133148658838533079970569889866712566788 No 135 >PRK00725 glgC glucose-1-phosphate adenylyltransferase; Provisional Probab=97.88 E-value=2.5e-05 Score=38.75 Aligned_cols=51 Identities=10% Similarity=0.057 Sum_probs=23.3 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCEEECCCCEECC Q ss_conf 21111222110112222211011111111243220139749849988889969 Q gi|254781223|r 53 AKVSGNASVGGNAIVRDTAEVGGDAFVIGFTVISGNARVRGNAVVGGDTVVEG 105 (110) Q Consensus 53 ~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~I~~~~~I~~~~~I~~~~~I~~ 105 (110) +.++.++.++..+.+. ++.|..++.|+.++.+. ++.+.+++.|++++.|.. T Consensus 348 SVI~r~v~Ig~gA~V~-nSIIm~~~~IG~ga~l~-naIiDk~v~I~~G~~Ig~ 398 (431) T PRK00725 348 SVLFTRVRVNSFSNID-SAVLLPEVWVGRSCRLR-RCVIDRACVIPEGMVIGE 398 (431) T ss_pred CCCCCCCEECCCCEEE-EEEECCCCEECCCCEEE-EEEECCCCEECCCCEECC T ss_conf 9825998999998996-13988989889999997-139899989898999896 No 136 >PRK01884 consensus Probab=97.83 E-value=2.2e-05 Score=38.97 Aligned_cols=53 Identities=15% Similarity=0.240 Sum_probs=27.0 Q ss_pred EEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCEEECCC Q ss_conf 1355202211112221101122222110111111112432201397498499888 Q gi|254781223|r 46 NAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVIGFTVISGNARVRGNAVVGGD 100 (110) Q Consensus 46 ~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~I~~~~~I~~~~~I~~~ 100 (110) ++.+..++++..++.+ .++.+..++.|++++.+ .++.+..++.|++++.||-+ T Consensus 345 nSVL~r~v~I~~gA~V-~nSIIm~~~~IG~ga~l-~~~IiDk~v~I~~g~~IG~~ 397 (435) T PRK01884 345 NSVLFDRIKVDEFSKI-DHCVVLPQVKIGKNCVL-KRCIIDRHCVIPDGMQIGVD 397 (435) T ss_pred CCEEECCCEECCCCEE-EECEEECCCEECCCCEE-EEEEECCCCEECCCCEECCC T ss_conf 1982089699999999-82888086989999999-62698999999979999858 No 137 >KOG1462 consensus Probab=97.79 E-value=5.5e-05 Score=37.29 Aligned_cols=59 Identities=17% Similarity=0.316 Sum_probs=21.5 Q ss_pred EEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCEEECCCCEECCCEEE Q ss_conf 55202211112221101122222110111111112432201397498499888899698097 Q gi|254781223|r 48 KVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVIGFTVISGNARVRGNAVVGGDTVVEGDTVL 109 (110) Q Consensus 48 ~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~I~~~~~I~~~~~I~~~~~I~~~~~i 109 (110) .++.++.|++.+.+ .++.+..++.+++.+.| .++.|+.+++|++++.+- ++.|+.+-+| T Consensus 353 viG~nC~Ig~~~~v-~nSilm~nV~vg~G~~I-ensIIg~gA~Ig~gs~L~-nC~Ig~~yvV 411 (433) T KOG1462 353 VIGSNCDIGERVKV-ANSILMDNVVVGDGVNI-ENSIIGMGAQIGSGSKLK-NCIIGPGYVV 411 (433) T ss_pred EECCCCCCCCCCEE-EEEEEECCCEECCCCCE-ECCEECCCCEECCCCEEE-EEEECCCCEE T ss_conf 12577611687277-71176658375588402-042662665654797033-3485598277 No 138 >TIGR02091 glgC glucose-1-phosphate adenylyltransferase; InterPro: IPR011831 This enzyme, glucose-1-phosphate adenylyltransferase, is also called ADP-glucose pyrophosphorylase. The plant form is an alpha2, beta2 heterodimer, allosterically regulated in plants. Both subunits are homologous and included in this entry. In bacteria, both homomeric forms of GlgC and more active heterodimers of GlgC and GlgD have been described. This entry describes the GlgC subunit only. This enzyme appears in variants of glycogen synthesis pathways that use ADP-glucose, rather than UDP-glucose as in animals.; GO: 0008878 glucose-1-phosphate adenylyltransferase activity. Probab=97.77 E-value=1.7e-05 Score=39.48 Aligned_cols=80 Identities=16% Similarity=0.284 Sum_probs=31.2 Q ss_pred CCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCC----CCCC--CCCCCCCCCCCCCCCCCCC Q ss_conf 883997981545754323112222222100113552022111122211011----2222--2110111111112432201 Q gi|254781223|r 15 DDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNA----IVRD--TAEVGGDAFVIGFTVISGN 88 (110) Q Consensus 15 ~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~----~~~~--~~~i~~~~~i~~~~~I~~~ 88 (110) .+..|+++|.|...+ | .++.++.++++..++.+.+ ++|+++.+..... ...+ .+.||++|.| .+|.|..| T Consensus 335 ~~Slv~~Gc~i~g~~-v-~~SvL~~~v~i~~~~~v~d-sVim~~Dfy~~~~E~~~l~~G~~a~~IG~~~~i-~~~IiD~~ 410 (421) T TIGR02091 335 SDSLVAEGCIISGAT-V-SHSVLGSRVRIGSGSTVED-SVIMGDDFYENESEIDALKDGKVAVGIGRGAVI-RNAIIDKN 410 (421) T ss_pred EEEEEECCCEECCCE-E-EEEEEECCCEECCCCEEEE-EEECCCCCCCCHHHHHHHHCCCCCEEECCCCEE-EEEECCCC T ss_conf 440775671880767-8-7228526868546557988-688798654785688888649752231788888-66380587 Q ss_pred EEECCCEEEC Q ss_conf 3974984998 Q gi|254781223|r 89 ARVRGNAVVG 98 (110) Q Consensus 89 ~~I~~~~~I~ 98 (110) |+|+++++|+ T Consensus 411 ~~Ig~g~~Ig 420 (421) T TIGR02091 411 VRIGEGVQIG 420 (421) T ss_pred CCCCCCCEEC T ss_conf 7848955573 No 139 >PRK03701 consensus Probab=97.74 E-value=4.1e-05 Score=37.84 Aligned_cols=51 Identities=16% Similarity=0.173 Sum_probs=22.2 Q ss_pred EEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCEEECCC Q ss_conf 55202211112221101122222110111111112432201397498499888 Q gi|254781223|r 48 KVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVIGFTVISGNARVRGNAVVGGD 100 (110) Q Consensus 48 ~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~I~~~~~I~~~~~I~~~ 100 (110) .+..++.+..++.+ .++.+.+++.|++++.+ .++.+..++.|+++++||.+ T Consensus 349 Vl~r~v~I~~gA~V-~nSIIm~~~~Ig~ga~l-~~~IiDk~v~I~~g~vIg~~ 399 (431) T PRK03701 349 VLFPRVRVNSFCNI-DSAVLLPEVWVGRSCRL-RRCVIDRACVIPEGMVIGEN 399 (431) T ss_pred CCCCCCEECCCCEE-ECCEECCCCEECCCCEE-EEEEECCCCEECCCCEECCC T ss_conf 81188899999899-72698596989999999-83798999999979998968 No 140 >COG4801 Predicted acyltransferase [General function prediction only] Probab=97.68 E-value=8.5e-05 Score=36.52 Aligned_cols=86 Identities=20% Similarity=0.299 Sum_probs=28.1 Q ss_pred CCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCE Q ss_conf 83997981545754323112222222100113552022111122211011222221101111111124322013974984 Q gi|254781223|r 16 DARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVIGFTVISGNARVRGNA 95 (110) Q Consensus 16 ~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~I~~~~~I~~~~ 95 (110) ++.+..+..|++++.+... ..++...+..+..+++...- .+..+...+.+.+|.....++.++..++|.+...+..+. T Consensus 16 ~ivv~gdViIG~nS~l~~~-V~g~~iivge~v~i~Gdiva-~diridmw~kv~gNV~ve~dayiGE~~sI~gkl~v~gdL 93 (277) T COG4801 16 IIVVKGDVIIGKNSMLKYG-VVGEEIIVGERVRIYGDIVA-KDIRIDMWCKVTGNVIVENDAYIGEFSSIKGKLTVIGDL 93 (277) T ss_pred EEEEECCEEECCCCEEEEE-EEEEEEEECCCCEEEEEEEE-CCEEEEEEEEEECCEEECCCEEEECCCEEEEEEEEECCC T ss_conf 6897546898566426443-22015884367078606873-113662156752318984740770330240169995332 Q ss_pred EECCCCEE Q ss_conf 99888899 Q gi|254781223|r 96 VVGGDTVV 103 (110) Q Consensus 96 ~I~~~~~I 103 (110) .++.+..| T Consensus 94 dig~dV~I 101 (277) T COG4801 94 DIGADVII 101 (277) T ss_pred CCCCCEEE T ss_conf 35651687 No 141 >TIGR02353 NRPS_term_dom non-ribosomal peptide synthetase terminal domain of unknown function; InterPro: IPR012728 This domain is found exclusively in non-ribosomal peptide synthetases and always as the final domain in the polypeptide. This domain is roughly 700 amino acids in size and is found in polypeptides roughly twice that size.. Probab=97.60 E-value=4.2e-05 Score=37.80 Aligned_cols=30 Identities=27% Similarity=0.410 Sum_probs=9.6 Q ss_pred CCCCCCCCCCCCCCCCCEEECCCEEECCCC Q ss_conf 101111111124322013974984998888 Q gi|254781223|r 72 EVGGDAFVIGFTVISGNARVRGNAVVGGDT 101 (110) Q Consensus 72 ~i~~~~~i~~~~~I~~~~~I~~~~~I~~~~ 101 (110) .++++++||..+.+..|+.||+.+.+.+.+ T Consensus 169 ~~g~~A~vG~~s~l~~~t~iG~~a~L~~~S 198 (719) T TIGR02353 169 RIGDDAFVGTRSVLDIDTAIGDRAQLGHGS 198 (719) T ss_pred EECCCEEEEEEEEECCCCCCCCHHCCCCCC T ss_conf 983716872013747887236502035546 No 142 >COG0448 GlgC ADP-glucose pyrophosphorylase [Carbohydrate transport and metabolism] Probab=97.54 E-value=0.00017 Score=35.29 Aligned_cols=53 Identities=19% Similarity=0.292 Sum_probs=26.0 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCEEECCCCEECCC Q ss_conf 2211112221101122222110111111112432201397498499888899698 Q gi|254781223|r 52 YAKVSGNASVGGNAIVRDTAEVGGDAFVIGFTVISGNARVRGNAVVGGDTVVEGD 106 (110) Q Consensus 52 ~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~I~~~~~I~~~~~I~~~~~I~~~ 106 (110) ++.++.++.+...+.+. ++.+..++.|+.++++. ++.|-+|+.|++++.|.++ T Consensus 311 nSVL~~~v~I~~gs~i~-~svim~~~~IG~~~~l~-~aIIDk~v~I~~g~~i~~~ 363 (393) T COG0448 311 NSVLFRGVRIGKGSVIE-NSVIMPDVEIGEGAVLR-RAIIDKNVVIGEGVVIGGD 363 (393) T ss_pred EEEEECCEEECCCCEEE-EEEEECCCEECCCCEEE-EEEECCCCEECCCCEECCC T ss_conf 63875671977898887-53872786888998899-9896799686898288677 No 143 >KOG4042 consensus Probab=97.53 E-value=0.00014 Score=35.56 Aligned_cols=32 Identities=19% Similarity=0.330 Sum_probs=24.0 Q ss_pred CCCEEECCCEEEC---CCCEECCCCEECCCCCCCC Q ss_conf 6876988970992---8839979815457543231 Q gi|254781223|r 2 YDNAVVRDCATVI---DDARVSGNASVSRFAQVKS 33 (110) Q Consensus 2 ~~n~~I~~~a~I~---~~~~I~~n~~I~~~~~i~~ 33 (110) .+.|.++|.|.+. +-..||+|+.|..++.|+. T Consensus 30 ~~gcVvHP~a~~iA~aGPI~iGEnniiEEyA~i~n 64 (190) T KOG4042 30 KEGCVVHPFAVFIATAGPIYIGENNIIEEYAVIRN 64 (190) T ss_pred CCCCEECCEEEEECCCCCEEECCCCHHHHHHHHHH T ss_conf 58838533079970569889866712566788770 No 144 >TIGR02091 glgC glucose-1-phosphate adenylyltransferase; InterPro: IPR011831 This enzyme, glucose-1-phosphate adenylyltransferase, is also called ADP-glucose pyrophosphorylase. The plant form is an alpha2, beta2 heterodimer, allosterically regulated in plants. Both subunits are homologous and included in this entry. In bacteria, both homomeric forms of GlgC and more active heterodimers of GlgC and GlgD have been described. This entry describes the GlgC subunit only. This enzyme appears in variants of glycogen synthesis pathways that use ADP-glucose, rather than UDP-glucose as in animals.; GO: 0008878 glucose-1-phosphate adenylyltransferase activity. Probab=97.44 E-value=8.9e-05 Score=36.43 Aligned_cols=51 Identities=22% Similarity=0.276 Sum_probs=29.5 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCC------------------CCCCCCCCCCCCEEECCCEEECCCCEEC Q ss_conf 2211112221101122222110111------------------1111124322013974984998888996 Q gi|254781223|r 52 YAKVSGNASVGGNAIVRDTAEVGGD------------------AFVIGFTVISGNARVRGNAVVGGDTVVE 104 (110) Q Consensus 52 ~~~i~~~~~i~~~~~~~~~~~i~~~------------------~~i~~~~~I~~~~~I~~~~~I~~~~~I~ 104 (110) +.+++.++.+..++.+.+ +.|.++ +.||.+|+|. +|.|-+|++|++++.|. T Consensus 352 ~SvL~~~v~i~~~~~v~d-sVim~~Dfy~~~~E~~~l~~G~~a~~IG~~~~i~-~~IiD~~~~Ig~g~~Ig 420 (421) T TIGR02091 352 HSVLGSRVRIGSGSTVED-SVIMGDDFYENESEIDALKDGKVAVGIGRGAVIR-NAIIDKNVRIGEGVQIG 420 (421) T ss_pred EEEEECCCEECCCCEEEE-EEECCCCCCCCHHHHHHHHCCCCCEEECCCCEEE-EEECCCCCCCCCCCEEC T ss_conf 228526868546557988-6887986547856888886497522317888886-63805877848955573 No 145 >PRK00725 glgC glucose-1-phosphate adenylyltransferase; Provisional Probab=97.38 E-value=0.00017 Score=35.27 Aligned_cols=50 Identities=16% Similarity=0.181 Sum_probs=23.4 Q ss_pred EEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCEEEC Q ss_conf 3552022111122211011222221101111111124322013974984998 Q gi|254781223|r 47 AKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVIGFTVISGNARVRGNAVVG 98 (110) Q Consensus 47 ~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~I~~~~~I~~~~~I~ 98 (110) +.+...++|..++.+ .++.+..++.|++++.+ .++.+..++.|++++.|+ T Consensus 348 SVI~r~v~Ig~gA~V-~nSIIm~~~~IG~ga~l-~naIiDk~v~I~~G~~Ig 397 (431) T PRK00725 348 SVLFTRVRVNSFSNI-DSAVLLPEVWVGRSCRL-RRCVIDRACVIPEGMVIG 397 (431) T ss_pred CCCCCCCEECCCCEE-EEEEECCCCEECCCCEE-EEEEECCCCEECCCCEEC T ss_conf 982599899999899-61398898988999999-713989998989899989 No 146 >TIGR01208 rmlA_long glucose-1-phosphate thymidylyltransferase; InterPro: IPR005908 Synonym: dTDP-D-glucose synthase This group of proteins comprises a tightly conserved but broadly distributed subfamily of known and putative bacterial glucose-1-phosphate thymidylyltransferases (2.7.7.24 from EC). It is well characterised in several species as the first of four enzymes involved in the biosynthesis of dTDP-L-rhamnose, a cell wall constituent and a feedback inhibitor of the enzyme.dTTP + alpha-D-glucose 1-phosphate = diphosphate + dTDP-glucose The family of known and putative glucose-1-phosphate thymidyltransferases shows a deep split into a short form (see IPR005907 from INTERPRO) and a long form described by this model. The homotetrameric short form is found in numerous bacterial species that incorporate dTDP-L-rhamnose, which it helps synthesize, into the cell wall. It is subject to feedback inhibition. The long form, in contrast, is found in many species for which it serves as a sugar-activating enzyme for antibiotic biosynthesis and or other, unknown pathways, and in which dTDP-L-rhamnose is not necessarily produced. . Probab=97.36 E-value=0.0003 Score=34.22 Aligned_cols=52 Identities=12% Similarity=0.213 Sum_probs=19.1 Q ss_pred EEECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEECCCCCC Q ss_conf 6988970992883997981545754323112222222100113552022111 Q gi|254781223|r 5 AVVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVGGYAKVS 56 (110) Q Consensus 5 ~~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~ 56 (110) ..|.+.+.|...++|.-|+.|...|.|+.|+.|.+.-.++..+.++..+.+. T Consensus 255 s~i~GRV~v~~gakid~ns~i~GPa~IG~d~~I~nsGyiGPYtSig~~~~I~ 306 (361) T TIGR01208 255 SKIEGRVEVGEGAKIDVNSVIRGPAVIGEDCIIENSGYIGPYTSIGEGVVIR 306 (361) T ss_pred CEEEEEEEECCCCEECCCCEEECCEEECCCCEEECCCCCCCEEEECCCEEEE T ss_conf 5788407908978842377787060877952782686117707725753874 No 147 >TIGR02353 NRPS_term_dom non-ribosomal peptide synthetase terminal domain of unknown function; InterPro: IPR012728 This domain is found exclusively in non-ribosomal peptide synthetases and always as the final domain in the polypeptide. This domain is roughly 700 amino acids in size and is found in polypeptides roughly twice that size.. Probab=97.34 E-value=0.00031 Score=34.14 Aligned_cols=37 Identities=30% Similarity=0.308 Sum_probs=15.7 Q ss_pred CCCCCCCCCCCCCCCCCCCCEEECCCEEECCCCEECC Q ss_conf 2211011111111243220139749849988889969 Q gi|254781223|r 69 DTAEVGGDAFVIGFTVISGNARVRGNAVVGGDTVVEG 105 (110) Q Consensus 69 ~~~~i~~~~~i~~~~~I~~~~~I~~~~~I~~~~~I~~ 105 (110) +...|++.|.+++.+.+...+.||+++.++..+.|++ T Consensus 668 D~v~igdgatlG~~aiVlY~~~~G~~a~lgp~SlVMk 704 (719) T TIGR02353 668 DTVTIGDGATLGPGAIVLYGAVVGEDAVLGPDSLVMK 704 (719) T ss_pred CEEEEECEEEECCCCEECCCEEECCCCEECCCCEEEC T ss_conf 5289724038888635857315568643468432503 No 148 >KOG4750 consensus Probab=97.18 E-value=0.001 Score=31.96 Aligned_cols=91 Identities=18% Similarity=0.220 Sum_probs=38.4 Q ss_pred CEEECCCE--EECCCCEECCCCEECCC--CCCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 76988970--99288399798154575--432311222222210011355202211112221101122222110111111 Q gi|254781223|r 4 NAVVRDCA--TVIDDARVSGNASVSRF--AQVKSNAEVSDNTYVRDNAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFV 79 (110) Q Consensus 4 n~~I~~~a--~I~~~~~I~~n~~I~~~--~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i 79 (110) +.+++++. -|-|.++||+.-.+... .+|++-..+++++.+..+..++.--.-. -..+..+.+++.++..+.| T Consensus 140 q~ris~~~gvdihpaa~ig~gilldhatgvvigeTAvvg~~vSilH~Vtlggtgk~~----gdrhP~Igd~vliGaGvtI 215 (269) T KOG4750 140 QVRISPNFGVDIHPAAKIGKGILLDHATGVVIGETAVVGDNVSILHPVTLGGTGKGS----GDRHPKIGDNVLIGAGVTI 215 (269) T ss_pred CCEECCCCCCCCCCHHHCCCCEEECCCCCEEECCEEEECCCEEEECCEEECCCCCCC----CCCCCCCCCCEEECCCCEE T ss_conf 123324202124620102653122133323531024742644664443114654665----5568855677087554178 Q ss_pred CCCCCCCCCEEECCCEEEC Q ss_conf 1124322013974984998 Q gi|254781223|r 80 IGFTVISGNARVRGNAVVG 98 (110) Q Consensus 80 ~~~~~I~~~~~I~~~~~I~ 98 (110) -+++.|+.++.|+.++++- T Consensus 216 LgnV~IGegavIaAGsvV~ 234 (269) T KOG4750 216 LGNVTIGEGAVIAAGSVVL 234 (269) T ss_pred ECCEEECCCCEEECCCEEE T ss_conf 5781677886773050697 No 149 >TIGR03536 DapD_gpp 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase. 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase (DapD) is involved in the succinylated branch of the "lysine biosynthesis via diaminopimelate (DAP)" pathway (GenProp0125). This model represents a clade of DapD sequences most closely related to the actinobacterial DapD family represented by the TIGR03535 model. All of the genes evaluated for the seed of this model are found in genomes where the downstream desuccinylase is present, but known DapD genes are absent. Additionally, many of the genes identified by this model are found proximal to genes involved in this lysine biosynthesis pathway. Probab=96.87 E-value=0.0028 Score=30.17 Aligned_cols=98 Identities=20% Similarity=0.168 Sum_probs=46.6 Q ss_pred EECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEE----CCCCCCCCCCCCCCCCCCC--------CCCC Q ss_conf 988970992883997981545754323112222222100113552----0221111222110112222--------2110 Q gi|254781223|r 6 VVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVG----GYAKVSGNASVGGNAIVRD--------TAEV 73 (110) Q Consensus 6 ~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~----~~~~i~~~~~i~~~~~~~~--------~~~i 73 (110) .+-++++|.+-.++.=.+++.+++++.....+..|+-......+. ..+.++..+.+++.+.+.+ ...| T Consensus 174 vvP~gvRIad~~rVRLGA~L~~GTtvmhegfvnfNAGTlG~~mvEGRiS~GVvVg~gsDiGGgaSimGtLSGGg~~~isi 253 (341) T TIGR03536 174 VVPKGVRIADTARVRLGAYVGEGTTVMHEGFINFNAGTEGPSMVEGRISAGVMVGKGSDLGGGCSTMGTLSGGGNIVISV 253 (341) T ss_pred ECCCCCEECCCCCEEEEEEECCCEEEECCCEEECCCCCCCCCEEEEEEECCEEECCCCCCCCCEEEEEEECCCCCEEEEE T ss_conf 56788376156634430077588177022147504777664245304403469667876677502477635898236888 Q ss_pred CCCCCCCCCCCC----CCCEEECCCEEECCCCEE Q ss_conf 111111112432----201397498499888899 Q gi|254781223|r 74 GGDAFVIGFTVI----SGNARVRGNAVVGGDTVV 103 (110) Q Consensus 74 ~~~~~i~~~~~I----~~~~~I~~~~~I~~~~~I 103 (110) +++|.++.|+-+ +++|.|..+..|-.++.| T Consensus 254 G~~cLlGANsg~GI~LGD~C~veaGlyvtagtkv 287 (341) T TIGR03536 254 GEGCLLGANAGIGIPLGDRCTVEAGLYITAGTKV 287 (341) T ss_pred CCCCCCCCCCCCCCCCCCCCEEECCEEEECCCEE T ss_conf 7322014445557424898478124499279679 No 150 >TIGR02092 glgD glucose-1-phosphate adenylyltransferase, GlgD subunit; InterPro: IPR011832 This family is GlgD, an apparent regulatory protein that appears in an alpha2/beta2 heterotetramer with GlgC (glucose-1-phosphate adenylyltransferase, IPR011831 from INTERPRO) in a subset of bacteria that use GlgC for glycogen biosynthesis.; GO: 0005978 glycogen biosynthetic process. Probab=96.81 E-value=0.0011 Score=31.95 Aligned_cols=52 Identities=13% Similarity=0.208 Sum_probs=23.7 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCEEECCCCEECC Q ss_conf 221111222110112222211011111111243220139749849988889969 Q gi|254781223|r 52 YAKVSGNASVGGNAIVRDTAEVGGDAFVIGFTVISGNARVRGNAVVGGDTVVEG 105 (110) Q Consensus 52 ~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~I~~~~~I~~~~~I~~~~~I~~ 105 (110) ++.+++.+.+..++.+. +|.|..+|.|+.+|++ +++.+=+++.|.++..+.+ T Consensus 318 NSil~R~V~v~kd~~~k-n~iimq~~~Ig~ga~L-~~vI~DK~~~I~~n~~~~G 369 (383) T TIGR02092 318 NSILSRGVYVGKDALIK-NSIIMQRTVIGEGAHL-ENVIIDKDVVIEPNEKIAG 369 (383) T ss_pred EEEEECCEEECCCCEEE-EEEECCCCEECCCCEE-EEEEECCCCEECCCEEEEE T ss_conf 57985541872786787-5477588777878656-1588657628788706852 No 151 >TIGR03535 DapD_actino 2,3,4,5-tetrahydropyridine-2,6-dicarboxylate N-succinyltransferase. Alternate name: tetrahydrodipicolinate N-succinyltransferase. Probab=96.63 E-value=0.0059 Score=28.83 Aligned_cols=98 Identities=20% Similarity=0.167 Sum_probs=47.9 Q ss_pred EECCCEEECCCCEECCCCEECCCCCCCCCCCCCCCCCEEEEEEEE----CCCCCCCCCCCCCCCCCCC--------CCCC Q ss_conf 988970992883997981545754323112222222100113552----0221111222110112222--------2110 Q gi|254781223|r 6 VVRDCATVIDDARVSGNASVSRFAQVKSNAEVSDNTYVRDNAKVG----GYAKVSGNASVGGNAIVRD--------TAEV 73 (110) Q Consensus 6 ~I~~~a~I~~~~~I~~n~~I~~~~~i~~~~~i~~~~~i~~~~~i~----~~~~i~~~~~i~~~~~~~~--------~~~i 73 (110) .+-++++|.+-.++.=.+++.+++++.....+..|+-......+. ..+.++.++.+++.+.+.+ ...| T Consensus 149 vvP~gVRIada~rVRLGAyl~~GTtVmheGfvnfNaGTlG~~mvEGRiS~gvvvg~gsDiGGgaS~mGtLSGGg~~~isi 228 (319) T TIGR03535 149 VVPTGVRIGDADRVRLGAHLAEGTTVMHEGFVNFNAGTLGASMVEGRISAGVVVGDGSDIGGGASIMGTLSGGGKEVISI 228 (319) T ss_pred ECCCCCEECCCCCEEEEEEECCCCEEEECCEEEECCCCCCCCEEEEEEECCEEECCCCCCCCCCEEEEEECCCCCEEEEE T ss_conf 56788477045525540076598077100137604767665346314403469667876677501145304898224788 Q ss_pred CCCCCCCCCCCC----CCCEEECCCEEECCCCEE Q ss_conf 111111112432----201397498499888899 Q gi|254781223|r 74 GGDAFVIGFTVI----SGNARVRGNAVVGGDTVV 103 (110) Q Consensus 74 ~~~~~i~~~~~I----~~~~~I~~~~~I~~~~~I 103 (110) +++|.++.|+-+ +++|.|..+..|-.++.| T Consensus 229 G~~cLlGANsGiGI~LGD~C~VeaGlyitaGTkV 262 (319) T TIGR03535 229 GERCLLGANSGIGISLGDDCVVEAGLYVTAGTKV 262 (319) T ss_pred CCCCEECCCCCCCCCCCCCCEEECCEEECCCCEE T ss_conf 6532004545557435897178112288179779 No 152 >TIGR02092 glgD glucose-1-phosphate adenylyltransferase, GlgD subunit; InterPro: IPR011832 This family is GlgD, an apparent regulatory protein that appears in an alpha2/beta2 heterotetramer with GlgC (glucose-1-phosphate adenylyltransferase, IPR011831 from INTERPRO) in a subset of bacteria that use GlgC for glycogen biosynthesis.; GO: 0005978 glycogen biosynthetic process. Probab=96.57 E-value=0.0019 Score=30.84 Aligned_cols=52 Identities=19% Similarity=0.291 Sum_probs=28.3 Q ss_pred EEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCEEECC Q ss_conf 135520221111222110112222211011111111243220139749849988 Q gi|254781223|r 46 NAKVGGYAKVSGNASVGGNAIVRDTAEVGGDAFVIGFTVISGNARVRGNAVVGG 99 (110) Q Consensus 46 ~~~i~~~~~i~~~~~i~~~~~~~~~~~i~~~~~i~~~~~I~~~~~I~~~~~I~~ 99 (110) |+++.-.+.|..++.+ .++.+...|.|+++|.+ +++.+..++.|++|..+.+ T Consensus 318 NSil~R~V~v~kd~~~-kn~iimq~~~Ig~ga~L-~~vI~DK~~~I~~n~~~~G 369 (383) T TIGR02092 318 NSILSRGVYVGKDALI-KNSIIMQRTVIGEGAHL-ENVIIDKDVVIEPNEKIAG 369 (383) T ss_pred EEEEECCEEECCCCEE-EEEEECCCCEECCCCEE-EEEEECCCCEECCCEEEEE T ss_conf 5798554187278678-75477588777878656-1588657628788706852 No 153 >pfam04519 DUF583 Protein of unknown function, DUF583. This family contains several uncharacterized hypothetical proteins. Probab=26.62 E-value=36 Score=13.03 Aligned_cols=19 Identities=21% Similarity=0.365 Sum_probs=7.7 Q ss_pred ECCCEEECCCCEECCCCEE Q ss_conf 8897099288399798154 Q gi|254781223|r 7 VRDCATVIDDARVSGNASV 25 (110) Q Consensus 7 I~~~a~I~~~~~I~~n~~I 25 (110) |++++.+.++..-.++..| T Consensus 3 I~~~~~i~G~i~~~g~l~I 21 (95) T pfam04519 3 IGKGTTIEGDITFSGDLRI 21 (95) T ss_pred ECCCCEEEEEEEECCEEEE T ss_conf 6899999999978873999 Done!