Query gi|254780777|ref|YP_003065190.1| uridylate kinase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 242 No_of_seqs 109 out of 4303 Neff 9.2 Searched_HMMs 39220 Date Sun May 29 19:38:01 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780777.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK00358 pyrH uridylate kinase 100.0 0 0 343.5 18.2 231 7-240 1-231 (231) 2 cd04254 AAK_UMPK-PyrH-Ec UMP k 100.0 0 0 341.2 18.4 231 7-240 1-231 (231) 3 cd04239 AAK_UMPK-like AAK_UMPK 100.0 0 0 336.3 18.9 229 8-240 1-229 (229) 4 TIGR02075 pyrH_bact uridylate 100.0 0 0 337.0 15.5 232 6-240 1-236 (236) 5 TIGR01027 proB glutamate 5-kin 100.0 0 0 332.9 16.4 235 7-242 1-264 (379) 6 COG0528 PyrH Uridylate kinase 100.0 0 0 310.2 19.8 236 3-241 2-238 (238) 7 PRK12314 gamma-glutamyl kinase 100.0 0 0 309.8 18.5 235 6-242 9-264 (265) 8 PRK12443 uridylate kinase; Rev 100.0 0 0 306.1 19.3 235 5-242 3-239 (247) 9 PRK05429 gamma-glutamyl kinase 100.0 0 0 306.4 18.4 235 5-241 7-261 (372) 10 PRK13402 gamma-glutamyl kinase 100.0 0 0 304.8 18.0 235 4-242 3-255 (363) 11 COG0263 ProB Glutamate 5-kinas 100.0 4.2E-45 0 298.0 18.1 236 5-242 5-260 (369) 12 cd04242 AAK_G5K_ProB AAK_G5K_P 100.0 5.6E-45 0 296.9 18.3 231 8-240 1-251 (251) 13 cd04256 AAK_P5CS_ProBA AAK_P5C 100.0 7.1E-44 0 290.2 17.6 234 6-240 8-284 (284) 14 cd04253 AAK_UMPK-PyrH-Pf AAK_U 100.0 4.6E-42 0 279.0 17.2 212 8-240 1-221 (221) 15 TIGR00657 asp_kinases aspartat 100.0 2.8E-42 0 280.3 14.0 221 6-242 1-312 (504) 16 TIGR02076 pyrH_arch uridylate 100.0 9.2E-40 2.4E-44 264.8 16.2 215 9-240 1-232 (232) 17 PRK07431 aspartate kinase; Pro 100.0 4E-39 1E-43 260.9 15.6 219 6-241 1-244 (594) 18 cd04246 AAK_AK-DapG-like AAK_A 100.0 2.6E-38 6.7E-43 255.8 17.0 215 9-240 2-239 (239) 19 COG0527 LysC Aspartokinases [A 100.0 1.3E-38 3.3E-43 257.8 14.9 221 6-242 1-290 (447) 20 cd04261 AAK_AKii-LysC-BS AAK_A 100.0 5.9E-38 1.5E-42 253.7 17.7 215 9-240 2-239 (239) 21 PRK08841 aspartate kinase; Val 100.0 1.4E-37 3.6E-42 251.3 16.4 220 6-242 1-243 (392) 22 PRK06635 aspartate kinase; Rev 100.0 2.2E-37 5.6E-42 250.1 16.3 220 6-242 1-243 (402) 23 cd04260 AAK_AKi-DapG-BS AAK_AK 100.0 3.7E-37 9.3E-42 248.8 15.0 215 9-240 2-244 (244) 24 PRK08210 aspartate kinase I; R 100.0 1.1E-36 2.7E-41 245.9 16.2 219 6-241 1-247 (405) 25 cd04241 AAK_FomA-like AAK_FomA 100.0 5.5E-36 1.4E-40 241.5 15.8 226 9-239 2-252 (252) 26 PRK06291 aspartate kinase; Pro 100.0 9.2E-36 2.3E-40 240.1 16.6 219 8-242 2-304 (466) 27 KOG1154 consensus 100.0 4.3E-36 1.1E-40 242.2 13.9 239 2-241 4-275 (285) 28 cd02115 AAK Amino Acid Kinases 100.0 2.5E-35 6.3E-40 237.5 16.9 222 10-239 1-248 (248) 29 PRK08961 bifunctional aspartat 100.0 1.4E-34 3.6E-39 232.8 14.3 225 2-242 3-305 (865) 30 cd04244 AAK_AK-LysC-like AAK_A 100.0 4.3E-34 1.1E-38 229.9 16.4 216 8-240 1-298 (298) 31 cd04259 AAK_AK-DapDC AAK_AK-Da 100.0 8E-34 2E-38 228.2 17.2 216 9-240 2-295 (295) 32 PRK05925 aspartate kinase; Pro 100.0 4.6E-34 1.2E-38 229.7 15.7 216 6-241 1-280 (440) 33 cd04234 AAK_AK AAK_AK: Amino A 100.0 6.5E-34 1.7E-38 228.7 16.5 202 9-240 2-227 (227) 34 PRK09436 thrA bifunctional asp 100.0 7.3E-34 1.9E-38 228.4 15.4 216 10-242 3-298 (817) 35 PRK09466 metL bifunctional asp 100.0 5.8E-34 1.5E-38 229.1 14.3 220 5-241 9-300 (810) 36 cd04257 AAK_AK-HSDH AAK_AK-HSD 100.0 3.5E-33 8.9E-38 224.2 17.0 214 10-240 3-294 (294) 37 TIGR01092 P5CS delta l-pyrroli 100.0 1.1E-34 2.9E-39 233.4 8.8 237 2-242 2-294 (738) 38 cd04247 AAK_AK-Hom3 AAK_AK-Hom 100.0 6.1E-33 1.6E-37 222.7 15.8 216 9-242 3-306 (306) 39 cd04258 AAK_AKiii-LysC-EC AAK_ 100.0 1.2E-32 3E-37 221.0 17.1 213 9-240 2-292 (292) 40 PRK09084 aspartate kinase III; 100.0 8.1E-33 2.1E-37 222.0 16.1 215 9-242 2-290 (447) 41 PRK08373 aspartate kinase; Val 100.0 7.6E-33 1.9E-37 222.2 15.0 215 7-242 4-278 (359) 42 cd04243 AAK_AK-HSDH-like AAK_A 100.0 1.9E-32 4.9E-37 219.6 16.6 214 9-240 2-293 (293) 43 PRK09034 aspartate kinase; Rev 100.0 2.9E-33 7.3E-38 224.8 12.1 213 10-241 3-289 (450) 44 cd04245 AAK_AKiii-YclM-BS AAK_ 100.0 1.8E-32 4.7E-37 219.8 15.2 214 6-240 1-288 (288) 45 COG1608 Predicted archaeal kin 100.0 1.3E-31 3.2E-36 214.6 14.5 231 9-241 3-252 (252) 46 pfam00696 AA_kinase Amino acid 100.0 9.4E-31 2.4E-35 209.3 14.1 206 7-219 1-230 (230) 47 cd04255 AAK_UMPK-MosAB AAK_UMP 100.0 2.1E-29 5.5E-34 200.9 16.7 221 9-240 33-262 (262) 48 cd04248 AAK_AK-Ectoine AAK_AK- 100.0 8.8E-28 2.2E-32 190.9 13.7 93 138-240 207-304 (304) 49 PRK09181 aspartate kinase; Val 100.0 1.9E-27 4.9E-32 188.8 14.5 95 138-242 214-313 (476) 50 PRK00942 acetylglutamate kinas 99.9 1.2E-22 3E-27 159.4 14.2 219 8-240 1-262 (262) 51 TIGR02078 AspKin_pair Pyrococc 99.9 3.7E-23 9.4E-28 162.4 11.2 213 9-241 2-277 (341) 52 cd04240 AAK_UC AAK_UC: Unchara 99.9 1.4E-22 3.7E-27 158.8 14.1 199 10-239 1-203 (203) 53 cd04238 AAK_NAGK-like AAK_NAGK 99.9 2.3E-22 5.7E-27 157.6 13.6 217 9-239 1-256 (256) 54 KOG0456 consensus 99.9 4.8E-22 1.2E-26 155.5 12.9 217 8-242 80-376 (559) 55 cd04250 AAK_NAGK-C AAK_NAGK-C: 99.9 5.7E-21 1.5E-25 148.9 14.7 224 5-239 13-279 (279) 56 TIGR00656 asp_kin_monofn aspar 99.8 7.3E-20 1.9E-24 142.1 13.1 220 6-241 1-298 (480) 57 cd04249 AAK_NAGK-NC AAK_NAGK-N 99.8 1.9E-18 4.9E-23 133.4 15.2 217 9-239 1-252 (252) 58 cd04251 AAK_NAGK-UC AAK_NAGK-U 99.8 4.8E-18 1.2E-22 130.9 13.8 208 9-239 1-257 (257) 59 CHL00202 argB acetylglutamate 99.8 1.8E-17 4.6E-22 127.3 16.4 226 4-240 21-283 (284) 60 PRK12353 carbamate kinase; Rev 99.8 6.7E-18 1.7E-22 130.0 14.0 229 6-241 2-312 (312) 61 COG0548 ArgB Acetylglutamate k 99.8 3.4E-17 8.6E-22 125.7 17.6 224 6-241 2-265 (265) 62 cd04237 AAK_NAGS-ABP AAK_NAGS- 99.8 5.5E-17 1.4E-21 124.3 14.7 221 4-239 16-280 (280) 63 COG2054 Uncharacterized archae 99.7 9.1E-17 2.3E-21 123.0 11.8 206 10-241 3-210 (212) 64 PRK12454 carbamate kinase-like 99.7 3.4E-16 8.6E-21 119.5 14.2 228 7-241 3-313 (313) 65 PRK12686 carbamate kinase; Rev 99.7 3.2E-16 8.3E-21 119.6 14.0 228 6-240 3-312 (313) 66 PRK05279 N-acetylglutamate syn 99.7 1.3E-15 3.4E-20 115.8 14.7 221 6-242 25-292 (441) 67 cd04235 AAK_CK AAK_CK: Carbama 99.7 3.7E-15 9.3E-20 113.1 13.2 226 8-240 1-308 (308) 68 cd04252 AAK_NAGK-fArgBP AAK_NA 99.6 2.6E-14 6.6E-19 107.9 15.1 214 10-239 2-248 (248) 69 PRK09411 carbamate kinase; Rev 99.6 2.5E-14 6.4E-19 108.0 13.6 223 6-240 1-296 (297) 70 TIGR00761 argB acetylglutamate 99.6 1.1E-14 2.9E-19 110.1 11.6 199 9-217 2-253 (254) 71 PRK12352 putative carbamate ki 99.6 3.8E-14 9.8E-19 106.8 13.8 228 7-241 3-315 (316) 72 PRK12354 carbamate kinase; Rev 99.6 1.3E-13 3.4E-18 103.5 13.6 228 8-242 2-302 (302) 73 COG0549 ArcC Carbamate kinase 99.6 2.1E-13 5.3E-18 102.3 13.8 228 7-241 1-312 (312) 74 TIGR00746 arcC carbamate kinas 99.5 5.4E-13 1.4E-17 99.8 10.7 229 7-241 1-321 (321) 75 cd04236 AAK_NAGS-Urea AAK_NAGS 99.5 6E-12 1.5E-16 93.3 14.8 209 5-239 34-271 (271) 76 PRK04531 acetylglutamate kinas 99.2 2.7E-10 7E-15 83.1 12.4 217 5-240 19-271 (421) 77 TIGR01890 N-Ac-Glu-synth amino 98.9 2.7E-07 6.8E-12 64.6 14.8 221 7-241 18-288 (439) 78 TIGR01689 EcbF-BcbF capsule bi 91.6 0.4 1E-05 26.5 4.9 48 6-54 1-49 (126) 79 TIGR00705 SppA_67K signal pept 86.9 1.4 3.6E-05 23.1 4.8 38 15-52 100-139 (614) 80 PRK09484 3-deoxy-D-manno-octul 85.3 0.93 2.4E-05 24.3 3.2 18 38-55 60-77 (186) 81 PRK00192 mannosyl-3-phosphogly 83.8 2.6 6.7E-05 21.5 5.0 47 1-54 1-47 (275) 82 PRK10530 phosphotransferase; P 80.9 3.4 8.8E-05 20.8 4.6 46 4-55 1-46 (272) 83 KOG2436 consensus 73.4 6.9 0.00018 18.9 7.7 162 7-177 96-289 (520) 84 PRK10513 sugar phosphatase; Pr 71.3 7.7 0.0002 18.6 5.2 46 4-55 1-46 (270) 85 TIGR01524 ATPase-IIIB_Mg magne 68.4 8.9 0.00023 18.2 6.3 122 11-174 473-597 (892) 86 COG1778 Low specificity phosph 65.7 4 0.0001 20.4 1.9 21 37-57 43-63 (170) 87 TIGR00338 serB phosphoserine p 64.7 8.5 0.00022 18.3 3.4 26 34-59 91-116 (223) 88 pfam03686 UPF0146 Uncharacteri 55.6 11 0.00029 17.6 2.7 72 139-220 19-103 (127) 89 PRK01158 phosphoglycolate phos 55.0 16 0.0004 16.7 5.0 45 4-54 1-45 (226) 90 PRK10964 ADP-heptose:LPS hepto 54.6 16 0.0004 16.7 6.0 20 34-53 75-94 (322) 91 KOG1780 consensus 50.6 7.7 0.0002 18.6 1.2 38 7-47 15-54 (77) 92 TIGR02482 PFKA_ATP 6-phosphofr 50.6 18 0.00047 16.3 4.7 44 10-55 58-101 (302) 93 COG5663 Uncharacterized conser 50.3 7.4 0.00019 18.7 1.1 37 201-237 143-185 (194) 94 PRK08305 spoVFB dipicolinate s 50.2 19 0.00047 16.3 4.0 43 3-56 2-44 (195) 95 TIGR01670 YrbI-phosphatas 3-de 49.7 13 0.00033 17.3 2.3 23 203-225 35-57 (154) 96 TIGR01302 IMP_dehydrog inosine 49.4 19 0.00049 16.2 3.3 33 199-232 325-364 (476) 97 COG0560 SerB Phosphoserine pho 48.9 19 0.00049 16.1 4.3 31 143-173 101-133 (212) 98 pfam05141 DIT1_PvcA Pyoverdine 45.7 22 0.00055 15.9 2.9 45 12-58 28-72 (274) 99 TIGR01077 L13_A_E ribosomal pr 44.4 23 0.00058 15.7 3.2 24 30-53 9-32 (144) 100 TIGR01466 cobJ_cbiH precorrin- 42.9 18 0.00046 16.3 2.1 24 31-55 57-80 (254) 101 PRK10949 protease 4; Provision 42.0 25 0.00063 15.5 4.0 41 5-52 112-152 (618) 102 TIGR01490 HAD-SF-IB-hyp1 HAD-s 41.0 26 0.00066 15.4 3.4 18 38-55 98-115 (204) 103 TIGR01662 HAD-SF-IIIA hydrolas 40.7 25 0.00063 15.5 2.6 33 31-63 30-65 (171) 104 TIGR00099 Cof-subfamily Cof-li 37.2 29 0.00075 15.0 3.9 41 9-55 2-43 (270) 105 TIGR01978 sufC FeS assembly AT 36.9 30 0.00076 15.0 4.8 31 17-57 30-60 (248) 106 COG4598 HisP ABC-type histidin 36.6 30 0.00077 15.0 5.0 26 196-221 203-229 (256) 107 PRK06278 cobyrinic acid a,c-di 34.5 33 0.00083 14.8 3.1 27 202-228 346-372 (482) 108 COG1915 Uncharacterized conser 34.5 33 0.00083 14.8 4.8 43 10-55 168-210 (415) 109 TIGR00213 GmhB_yaeD D,D-heptos 32.6 35 0.00089 14.6 3.6 82 27-120 24-107 (178) 110 PRK11544 hycI hydrogenase 3 ma 32.3 35 0.0009 14.5 2.4 20 6-26 1-20 (156) 111 cd02877 GH18_hevamine_XipI_cla 31.9 36 0.00092 14.5 3.5 26 31-56 58-83 (280) 112 TIGR00300 TIGR00300 conserved 31.7 36 0.00092 14.5 4.1 43 10-55 175-222 (418) 113 CHL00189 infB translation init 31.5 37 0.00093 14.4 5.5 100 131-230 272-399 (770) 114 TIGR02440 FadJ fatty oxidation 30.6 38 0.00096 14.4 2.3 32 22-53 22-55 (732) 115 cd03802 GT1_AviGT4_like This f 30.4 38 0.00097 14.3 4.7 26 31-56 21-47 (335) 116 TIGR02177 PorB_KorB 2-oxoacid: 30.1 38 0.00098 14.3 3.1 30 27-56 9-38 (302) 117 COG0680 HyaD Ni,Fe-hydrogenase 28.9 40 0.001 14.2 5.2 47 5-58 1-47 (160) 118 pfam05116 S6PP Sucrose-6F-phos 28.0 42 0.0011 14.1 4.6 45 5-55 1-45 (247) 119 pfam10686 DUF2493 Protein of u 27.9 42 0.0011 14.1 2.8 16 40-55 26-41 (71) 120 TIGR01313 therm_gnt_kin carboh 27.4 43 0.0011 14.0 3.6 37 15-55 49-87 (175) 121 cd03522 MoeA_like MoeA_like. T 27.1 43 0.0011 14.0 3.1 17 35-51 29-45 (312) 122 TIGR01488 HAD-SF-IB HAD-superf 26.9 44 0.0011 14.0 3.0 17 144-160 113-129 (204) 123 PRK10976 putative sugar phosph 26.7 44 0.0011 13.9 5.0 44 6-55 2-45 (266) 124 PRK10886 DnaA initiator-associ 26.1 45 0.0012 13.9 3.5 30 202-231 126-155 (196) 125 PRK08674 bifunctional phosphog 25.9 46 0.0012 13.8 3.6 22 38-59 98-119 (328) 126 cd02871 GH18_chitinase_D-like 25.4 47 0.0012 13.8 3.7 26 31-56 59-84 (312) 127 COG1126 GlnQ ABC-type polar am 24.9 48 0.0012 13.7 5.0 18 145-162 142-159 (240) 128 PRK07313 phosphopantothenoylcy 24.7 48 0.0012 13.7 3.5 37 7-55 3-39 (180) 129 KOG2599 consensus 24.4 49 0.0012 13.7 5.4 134 23-156 88-237 (308) 130 PRK05920 aromatic acid decarbo 24.4 49 0.0012 13.7 4.2 40 5-56 3-42 (205) 131 pfam08645 PNK3P Polynucleotide 24.2 49 0.0013 13.7 3.0 45 9-53 3-53 (158) 132 TIGR01487 SPP-like SPP-like hy 24.0 50 0.0013 13.6 3.1 34 17-54 11-45 (223) 133 COG0396 sufC Cysteine desulfur 23.6 50 0.0013 13.6 5.0 30 17-56 34-63 (251) 134 PRK13982 bifunctional SbtC-lik 23.4 51 0.0013 13.6 8.0 41 3-55 68-108 (476) 135 pfam00162 PGK Phosphoglycerate 23.2 51 0.0013 13.5 5.7 51 4-55 7-57 (383) 136 COG1979 Uncharacterized oxidor 23.1 52 0.0013 13.5 5.6 58 3-60 25-100 (384) 137 PRK09982 universal stress prot 22.9 52 0.0013 13.5 2.4 48 4-62 1-48 (142) 138 TIGR01494 ATPase_P-type ATPase 22.8 52 0.0013 13.5 3.9 24 202-225 332-355 (478) 139 pfam06581 DUF1135 Protein of u 22.6 35 0.00091 14.5 0.9 21 144-164 148-168 (239) 140 COG0616 SppA Periplasmic serin 22.6 53 0.0013 13.5 2.4 14 8-21 61-74 (317) 141 PRK13936 phosphoheptose isomer 22.4 53 0.0014 13.4 3.5 30 202-231 128-157 (197) 142 PRK09552 mtnX 2-hydroxy-3-keto 21.5 56 0.0014 13.3 3.3 20 204-223 187-206 (219) 143 COG0561 Cof Predicted hydrolas 21.4 56 0.0014 13.3 5.2 45 5-55 2-46 (264) 144 PRK13962 bifunctional phosphog 21.2 56 0.0014 13.3 6.0 51 5-55 12-62 (653) 145 PRK13938 phosphoheptose isomer 21.2 57 0.0014 13.3 3.4 29 28-56 28-56 (196) 146 cd03239 ABC_SMC_head The struc 21.1 57 0.0014 13.3 5.0 19 15-34 24-42 (178) 147 TIGR00075 hypD hydrogenase exp 21.0 57 0.0015 13.3 3.1 31 198-229 211-242 (384) 148 PRK11133 serB phosphoserine ph 20.4 59 0.0015 13.2 4.9 31 143-173 205-237 (322) No 1 >PRK00358 pyrH uridylate kinase; Provisional Probab=100.00 E-value=0 Score=343.46 Aligned_cols=231 Identities=53% Similarity=0.830 Sum_probs=202.4 Q ss_pred EEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 49999940036853887770899999999999999976990999823654102334420377775402467888889899 Q gi|254780777|r 7 KRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRGSQVVAENYLLCERSTVDSMGMLSTVI 86 (242) Q Consensus 7 KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 86 (242) ||||||||||+|++++++++|.++|+++|++|++++++|+||++|+|||..+++........ .....++++.+++.. T Consensus 1 kRIViKiG~s~L~~~~~~~l~~~~i~~la~~I~~l~~~G~evvlVsSGg~~a~~~~~~~~~~---~~~~~d~~g~~at~~ 77 (231) T PRK00358 1 KRVLLKLSGEALAGDEGFGIDPDVLDRIAQEIKELVELGVEVAVVIGGGNIFRGAAAAKAGM---DRVTADYMGMLATVM 77 (231) T ss_pred CEEEEEECHHHCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCHHHCCHHHHHCCC---CCCHHHHHHHHHHHH T ss_conf 95999965565038999983999999999999999977996999983873315264774588---654699999999999 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHCCHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHCHHHHHHCCC Q ss_conf 99999999976188641134001002456654321000011001111124655677762157888886416446651475 Q gi|254780777|r 87 NALALDLALRKINVPTVILSSIFMPQVCEVFSCRNAVSYLSQGKVVIFSGGTGNAFLTTDSAAALRASEIGADVILKGTQ 166 (242) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~D~~a~~ia~~~~ad~l~i~td 166 (242) +.++++..+...+.+....+.............+.....++.+.++++.+..+.++++||++|+++|.+++||+|+++|| T Consensus 78 n~~ll~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~g~v~Iv~~~~~~~~~~nD~laa~vA~~i~AD~liilTD 157 (231) T PRK00358 78 NALALQDALERAGVNTRVQSAIPMNQVAEPYIRRRAIRHLEKGRVVIFAAGTGNPFFTTDTAAALRAEEIGADVLLKATK 157 (231) T ss_pred HHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCHHHHHHHHHCCEEEEEECCCCCCCCCCHHHHHHHHHHCCCCEEEEEEC T ss_conf 99999999973697326751223443234010899998753680799966756878771089999998659988988622 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCHHHHHHHHHCCCCEEEEECCCCCHHHHHHCCCCCCCEEC Q ss_conf 43512343223444444441159999855998140999999997499699951898547999977998782466 Q gi|254780777|r 167 VDGVYSADPRVHASSTRFDSLTYNQFIEKGLKVMDCASVVLARDCSIPIIVFSIHSPGGIWGGLSGIGRSTIIS 240 (242) Q Consensus 167 vdGiyt~dP~~~~~ak~i~~is~~e~l~~G~~v~~~~A~~~a~~~gI~v~I~ng~~~~~i~~~l~Ge~~GT~I~ 240 (242) |||+||+||+++|+|++|++++++|+++.|+++||++|+++|+++|||++|+||++|++|.++++||.+||+|+ T Consensus 158 VDGlYt~dP~~~~dAk~I~~Is~~e~~~~g~~~~~~~Aa~~a~~~Gi~~~I~ng~~~~~i~~il~Ge~vGTlI~ 231 (231) T PRK00358 158 VDGVYDADPKKDPDAKKYDTLTYDEVLDKGLKVMDPTAISLARDNKIPIIVFNGNKPGNLKRVVKGENIGTLVS 231 (231) T ss_pred CCCEECCCCCCCCCCEECCCCCHHHHHHCCCCHHHHHHHHHHHHCCCCEEEECCCCCCHHHHHHCCCCCCEEEC T ss_conf 37416679766998668254579999867754278999999998799899987999348999977999874749 No 2 >cd04254 AAK_UMPK-PyrH-Ec UMP kinase (UMPK)-Ec, the microbial/chloroplast uridine monophosphate kinase (uridylate kinase) enzyme that catalyzes UMP phosphorylation and plays a key role in pyrimidine nucleotide biosynthesis; regulation of this process is via feed-back control and via gene repression of carbamoyl phosphate synthetase (the first enzyme of the pyrimidine biosynthesis pathway). The UMP kinase of E. coli (Ec) is known to function as a homohexamer, with GTP and UTP being allosteric effectors. Like other related enzymes (carbamate kinase, aspartokinase, and N-acetylglutamate kinase) the E. coli and most bacterial and chloroplast UMPKs (this CD) have a conserved, N-terminal, lysine residue proposed to function in the catalysis of the phosphoryl group transfer, whereas most archaeal UMPKs appear to lack this residue and the Pyrococcus furiosus structure has an additional Mg ion bound to the ATP molecule which is proposed to function as the catalysis instead. Members of this CD be Probab=100.00 E-value=0 Score=341.19 Aligned_cols=231 Identities=55% Similarity=0.871 Sum_probs=207.5 Q ss_pred EEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 49999940036853887770899999999999999976990999823654102334420377775402467888889899 Q gi|254780777|r 7 KRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRGSQVVAENYLLCERSTVDSMGMLSTVI 86 (242) Q Consensus 7 KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 86 (242) ||||||||||+|++++++++|.++|+++|++|++++++|+||+||+|||+.++++...... ......++++++++.+ T Consensus 1 kRIViKiGGs~L~~~~~~~ld~~~I~~la~~I~~l~~~G~eVviVvggG~i~rg~~~~~~~---~~~~~~d~~g~lat~~ 77 (231) T cd04254 1 KRVLLKLSGEALAGENGFGIDPEVLNRIAREIKEVVDLGVEVAIVVGGGNIFRGASAAEAG---MDRATADYMGMLATVI 77 (231) T ss_pred CEEEEEECHHHCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCHHCCHHHHHCC---CCCCHHHHHHHHHHHH T ss_conf 9599995166607899988599999999999999997798699998177301242245346---6635188999999999 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHCCHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHCHHHHHHCCC Q ss_conf 99999999976188641134001002456654321000011001111124655677762157888886416446651475 Q gi|254780777|r 87 NALALDLALRKINVPTVILSSIFMPQVCEVFSCRNAVSYLSQGKVVIFSGGTGNAFLTTDSAAALRASEIGADVILKGTQ 166 (242) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~D~~a~~ia~~~~ad~l~i~td 166 (242) +..+++..+...+.++.+.+..+.....+.+..+.....++.+.++++.+..+.++++||++|+++|.+++||.|+++|| T Consensus 78 n~~ll~~al~~~g~~~~~~t~~~~~~~~~~~~~~~~~~~L~~g~VvI~~g~~g~~~~ttD~~aa~~A~~~~ad~~~~~td 157 (231) T cd04254 78 NALALQDALESLGVKTRVMSAIPMQGVAEPYIRRRAIRHLEKGRVVIFAGGTGNPFFTTDTAAALRAIEINADVILKATK 157 (231) T ss_pred HHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCHHHHHHHHHCCCEEEEECCCCCCCCCCHHHHHHHHHHCCCCEEEEEEE T ss_conf 99999999986799879974223442223112899999973598899966757876551179999998729988999763 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCHHHHHHHHHCCCCEEEEECCCCCHHHHHHCCCCCCCEEC Q ss_conf 43512343223444444441159999855998140999999997499699951898547999977998782466 Q gi|254780777|r 167 VDGVYSADPRVHASSTRFDSLTYNQFIEKGLKVMDCASVVLARDCSIPIIVFSIHSPGGIWGGLSGIGRSTIIS 240 (242) Q Consensus 167 vdGiyt~dP~~~~~ak~i~~is~~e~l~~G~~v~~~~A~~~a~~~gI~v~I~ng~~~~~i~~~l~Ge~~GT~I~ 240 (242) |||+||+||+++|+|+++++++|+|++++|.++||++|+++|+++|||++|+||++|++|.++++||.+||+|| T Consensus 158 vdgvy~~dP~~~~~a~~~~~is~~e~~~~~~~v~d~~A~~~a~~~~i~~~v~~~~~~~~I~k~l~Ge~~GTlI~ 231 (231) T cd04254 158 VDGVYDADPKKNPNAKRYDHLTYDEVLSKGLKVMDATAFTLCRDNNLPIVVFNINEPGNLLKAVKGEGVGTLIS 231 (231) T ss_pred ECCCCCCCCCCCCCCEECCCCCHHHHHHCCCCCCCHHHHHHHHHCCCCEEEECCCCCCHHHHHHCCCCCCEEEC T ss_conf 16622689655755554030569999847766243999999998699889973999526999977998864759 No 3 >cd04239 AAK_UMPK-like AAK_UMPK-like: UMP kinase (UMPK)-like, the microbial/chloroplast uridine monophosphate kinase (uridylate kinase) enzyme that catalyzes UMP phosphorylation and plays a key role in pyrimidine nucleotide biosynthesis. Regulation of this process is via feed-back control and via gene repression of carbamoyl phosphate synthetase (the first enzyme of the pyrimidine biosynthesis pathway). The UMP kinases of E. coli (Ec) and Pyrococcus furiosus (Pf) are known to function as homohexamers, with GTP and UTP being allosteric effectors. Like other related enzymes (carbamate kinase, aspartokinase, and N-acetylglutamate kinase) the E. coli and most bacterial UMPKs have a conserved, N-terminal, lysine residue proposed to function in the catalysis of the phosphoryl group transfer, whereas most archaeal UMPKs appear to lack this residue and the Pyrococcus furiosus structure has an additional Mg ion bound to the ATP molecule which is proposed to function as the catalysis instead. Als Probab=100.00 E-value=0 Score=336.32 Aligned_cols=229 Identities=50% Similarity=0.782 Sum_probs=205.6 Q ss_pred EEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 99999400368538877708999999999999999769909998236541023344203777754024678888898999 Q gi|254780777|r 8 RVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRGSQVVAENYLLCERSTVDSMGMLSTVIN 87 (242) Q Consensus 8 rIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 87 (242) |||||||||+|+++++ +++.++|+++|++|++++++|+||++|+|||..++++........ .....+.+..++..+ T Consensus 1 RIViKiG~s~lt~~~~-~l~~~~i~~la~~i~~l~~~g~evilV~s~G~~a~g~~~~~~~~~---~~~~~~~~~~a~~~~ 76 (229) T cd04239 1 RIVLKLSGEALAGEGG-GIDPEVLKEIAREIKEVVDLGVEVAIVVGGGNIARGYIAAARGMP---RATADYIGMLATVMN 76 (229) T ss_pred CEEEEECHHHHCCCCC-CCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHCCCC---CCHHHHHHHHHHHHH T ss_conf 9899976577018999-919999999999999999779979999899734466434415667---135999999999999 Q ss_pred HHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHCCHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHCHHHHHHCCCC Q ss_conf 99999999761886411340010024566543210000110011111246556777621578888864164466514754 Q gi|254780777|r 88 ALALDLALRKINVPTVILSSIFMPQVCEVFSCRNAVSYLSQGKVVIFSGGTGNAFLTTDSAAALRASEIGADVILKGTQV 167 (242) Q Consensus 88 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~D~~a~~ia~~~~ad~l~i~tdv 167 (242) ..+++..+..++.+.............+....+.....++.+.+|+..+..+++++|||++|+++|.+++||.|+++||| T Consensus 77 ~~ll~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~g~ipV~~~~~~~~~~dnD~lAa~vA~~v~AD~lIilTDV 156 (229) T cd04239 77 ALALQDALEKLGVKTRVMSAIPMQGVAEPYIRRRAIRHLEKGRIVIFGGGTGNPGFTTDTAAALRAEEIGADVLLKATNV 156 (229) T ss_pred HHHHHHHHHHCCCCEEEECHHHCCCCHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCHHHHHHHHHHCCCCEEEEECCC T ss_conf 99999999867997898317760330102239999999853977998365578876500899999997299889861123 Q ss_pred CCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCHHHHHHHHHCCCCEEEEECCCCCHHHHHHCCCCCCCEEC Q ss_conf 3512343223444444441159999855998140999999997499699951898547999977998782466 Q gi|254780777|r 168 DGVYSADPRVHASSTRFDSLTYNQFIEKGLKVMDCASVVLARDCSIPIIVFSIHSPGGIWGGLSGIGRSTIIS 240 (242) Q Consensus 168 dGiyt~dP~~~~~ak~i~~is~~e~l~~G~~v~~~~A~~~a~~~gI~v~I~ng~~~~~i~~~l~Ge~~GT~I~ 240 (242) ||+||+||+++|+|++|++++++|++++|+|+||++|+++|+++||+++|+||++|++|.++++||.+||+|+ T Consensus 157 DGlY~~dP~~~~~Ak~I~~is~~e~~~~g~~~~k~~Aa~~a~~~Gi~~~I~ng~~~~~i~~~l~G~~vGTlI~ 229 (229) T cd04239 157 DGVYDADPKKNPDAKKYDRISYDELLKKGLKVMDATALTLCRRNKIPIIVFNGLKPGNLLRALKGEHVGTLIE 229 (229) T ss_pred CCCCCCCCCCCCCCEECCCCCHHHHHHCCCCEECHHHHHHHHHCCCCEEEECCCCCCHHHHHHCCCCCCEEEC T ss_conf 8617889887998748771289999875987444999999998799699977999338999977999981659 No 4 >TIGR02075 pyrH_bact uridylate kinase; InterPro: IPR015963 Uridylate kinases (also known as UMP kinases) are key enzymes in the synthesis of nucleoside triphosphates. They catalyse the reversible transfer of the gamma-phosphoryl group from an ATP donor to UMP, yielding UDP, which is the starting point for the synthesis of all other pyrimidine nucleotides. The eukaryotic enzyme has a dual specificity, phosphorylating both UMP and CMP, while the bacterial enzyme is specific to UMP. The bacterial enzyme shows no sequence similarity to the eukaryotic enzyme or other nucleoside monophosphate kinases, but rather appears to be part of the amino acid kinase family. It is dependent on magnesium for activity and is activated by GTP and repressed by UTP , . In many bacterial genomes, the gene tends to be located immediately downstream of elongation factor T and upstream of ribosome recycling factor. A related protein family, believed to be equivalent in function is found in the archaea and in spirochetes. Structurally, the bacterial and archaeal proteins are homohexamers centred around a hollow nucleus and organised as a trimer of dimers , . Each monomer within the protein forms the amino acid kinase fold and can be divided into an N-terminal region which binds UMP and mediates intersubunit interactions within the dimer, and a C-terminal region which binds ATP and contains a mobile loop covering the active site. Inhibition of enzyme activity by UTP appears to be due to competition for the binding site for UMP, not allosteric inhibition as was previously suspected. This entry represents the bacterial proteins.. Probab=100.00 E-value=0 Score=337.02 Aligned_cols=232 Identities=57% Similarity=0.893 Sum_probs=224.8 Q ss_pred CEEEEEEEEEEEEEC-CCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHH Q ss_conf 249999940036853-8877708999999999999999769909998236541023344203777754024678888898 Q gi|254780777|r 6 YKRVLLKVSGEALAG-ISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRGSQVVAENYLLCERSTVDSMGMLST 84 (242) Q Consensus 6 ~KrIViKiGGSsL~~-~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 84 (242) ||||+|||.|.+|+. ++++++|..+|+++|+||+++++.|.||.||.|||+..|+.... ....++...+++|++++ T Consensus 1 ~kRvLlKLSGEaL~g~~~~f~ID~~~l~~iA~eik~~~~~G~evgIViGGGNIfRG~~~~---~~Gi~R~~aDymGMLAT 77 (236) T TIGR02075 1 YKRVLLKLSGEALAGKDSQFGIDPKRLNRIANEIKEVVKKGIEVGIVIGGGNIFRGKSAA---ELGIDRVSADYMGMLAT 77 (236) T ss_pred CCEEEEEECCHHHCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHH---HCCCCEEECCCCCHHHH T ss_conf 964899961043247888864268999999999999986794899997782788778898---74896111432305899 Q ss_pred HHHHHHHHHHHHHHCCCCCCCC-CCHHH-HHHHHHHHHHHCCHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHCHHHHH Q ss_conf 9999999999976188641134-00100-245665432100001100111112465567776215788888641644665 Q gi|254780777|r 85 VINALALDLALRKINVPTVILS-SIFMP-QVCEVFSCRNAVSYLSQGKVVIFSGGTGNAFLTTDSAAALRASEIGADVIL 162 (242) Q Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~-~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~D~~a~~ia~~~~ad~l~ 162 (242) .++..+++..+...++.+.+.+ +.+.. ...+.+..+.....++.+.++++.+.+|.||.++|+.|++.|.+++||.++ T Consensus 78 viNglaL~~~L~~~g~~~~V~Sfai~~~~~i~E~Y~~~~a~~~le~g~vVIF~gGtGnPfFTTDtaA~LRAiEi~aD~~L 157 (236) T TIGR02075 78 VINGLALKDALEKLGLKTRVLSFAISMPKQICESYIRRKAIKHLEKGKVVIFSGGTGNPFFTTDTAAALRAIEINADVIL 157 (236) T ss_pred HHHHHHHHHHHHHCCCCEEEEEEEEECCCCCCCCCCHHHHHHHHHCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCEEEE T ss_conf 99999999888736995689877753586413232278999985359789995589869632115888766643134799 Q ss_pred HCCC-CCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCHHHHHHHHHCCCCEEEEECCCCCHHHHHHCCCCCCCEEC Q ss_conf 1475-43512343223444444441159999855998140999999997499699951898547999977998782466 Q gi|254780777|r 163 KGTQ-VDGVYSADPRVHASSTRFDSLTYNQFIEKGLKVMDCASVVLARDCSIPIIVFSIHSPGGIWGGLSGIGRSTIIS 240 (242) Q Consensus 163 i~td-vdGiyt~dP~~~~~ak~i~~is~~e~l~~G~~v~~~~A~~~a~~~gI~v~I~ng~~~~~i~~~l~Ge~~GT~I~ 240 (242) ..|+ ||||||+||+++|+|++++.|||+|+|..+.+|||.+|.-+|++|++|++|||-++|++|.+++.||.+||+|+ T Consensus 158 kgt~GVDGVY~~DPkknk~A~~y~~itY~~~L~~~L~VMD~TA~~La~dnnlpi~VFnI~~~g~l~~vi~g~~~gTlv~ 236 (236) T TIGR02075 158 KGTNGVDGVYTADPKKNKDAKKYDTITYNEVLKKNLKVMDLTAFSLAKDNNLPIVVFNIDEPGALKKVILGKGIGTLVS 236 (236) T ss_pred EECCCCCCCCCCCCCCCCCCEECCCCCHHHHHHCCCCHHHHHHHHHHHHCCCCEEEECCCCCCHHHHHHCCCCCEEEEC T ss_conf 8104888023587885987523166798999861471135899999987789759966665301645322784457528 No 5 >TIGR01027 proB glutamate 5-kinase; InterPro: IPR005715 L-glutamate 5-phosphotransferase, (gamma-glutamyl kinase, proB, 2.7.2.11 from EC), catalyzes the first step in proline biosynthesisATP + L-glutamate = ADP + L-glutamate 5-phosphate. the product of which rapidly cyclises to 5-oxoproline and phosphate. Bacterial ProB proteins hits the full length of this model, but the ProB-like domain of delta 1-pyrroline-5-carboxylate synthetase does not hit the C-terminal 100 residues. ; GO: 0004349 glutamate 5-kinase activity, 0006561 proline biosynthetic process, 0005737 cytoplasm. Probab=100.00 E-value=0 Score=332.93 Aligned_cols=235 Identities=23% Similarity=0.299 Sum_probs=207.3 Q ss_pred EEEEEEEEEEEEECCCCC-CCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHH Q ss_conf 499999400368538877-7089999999999999997699099982365410233442037777540246788888989 Q gi|254780777|r 7 KRVLLKVSGEALAGISGF-GVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRGSQVVAENYLLCERSTVDSMGMLSTV 85 (242) Q Consensus 7 KrIViKiGGSsL~~~~~~-~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 85 (242) ||||+|||+|+||++++. .++..+|..|+++|++++++||||||||||+ .+.+....+....+.+....+++...++. T Consensus 1 ~riVvK~GsS~Lt~~~g~h~l~~~~i~~lv~~v~~L~~~Gh~vviVSSGA-~AaG~~~LG~~~rP~~la~KQAlAAVGQ~ 79 (379) T TIGR01027 1 KRIVVKVGSSSLTGSSGSHELDRSRIAELVEQVAALHAAGHEVVIVSSGA-VAAGFEALGLPERPKTLAEKQALAAVGQV 79 (379) T ss_pred CEEEEEEECCEEECCCCCCEECHHHHHHHHHHHHHHHHCCCEEEEEECCH-HHHHHHHCCCCCCCCCHHHHHHHHHCCHH T ss_conf 96899970321117888741377999999999999986599899981675-88623445569998625678878731456 Q ss_pred HHHHHHHHHHHHHCCCC--CCCCCCHHHHHH--HHHHHHHH-CCHHHHCCCCCCCCCC-----CCCCCCCHHHHHHHHHH Q ss_conf 99999999997618864--113400100245--66543210-0001100111112465-----56777621578888864 Q gi|254780777|r 86 INALALDLALRKINVPT--VILSSIFMPQVC--EVFSCRNA-VSYLSQGKVVIFSGGT-----GNAFLTTDSAAALRASE 155 (242) Q Consensus 86 ~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~--~~~~~~~~-~~~~~~~~~~~~~~~~-----~~~~~~~D~~a~~ia~~ 155 (242) .++..|...|..||.+. .+++..++..+. ++.|-+.+ ..+++.+.+|++++++ |+.|||||++||++|.+ T Consensus 80 ~Lm~~y~~~F~~Yg~~~aQiLLTr~D~~~R~eeRy~NAr~TL~~Ll~~gvvPIiNENDTVaveEi~fGDND~LSAlvA~L 159 (379) T TIGR01027 80 RLMQLYENLFSSYGLKVAQILLTRADFSKREEERYLNARNTLEALLELGVVPIINENDTVAVEEIKFGDNDTLSALVAIL 159 (379) T ss_pred HHHHHHHHHHHHCCCCHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCEEEEEEEECCCCHHHHHHHHH T ss_conf 89999999997548822243405798863378999999999999986594789867742244000546781699999999 Q ss_pred HCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHH--------------HHCCCCCCCHHHHHHHHHCCCCEEEEECC Q ss_conf 1644665147543512343223444444441159999--------------85599814099999999749969995189 Q gi|254780777|r 156 IGADVILKGTQVDGVYSADPRVHASSTRFDSLTYNQF--------------IEKGLKVMDCASVVLARDCSIPIIVFSIH 221 (242) Q Consensus 156 ~~ad~l~i~tdvdGiyt~dP~~~~~ak~i~~is~~e~--------------l~~G~~v~~~~A~~~a~~~gI~v~I~ng~ 221 (242) ++||.|+++|||||+||+|||+||+|++|+++...+. .+.|+|-+|+.||++|.++|||++|+.|. T Consensus 160 v~AD~L~LLTD~dGLYd~dPR~nPdA~~I~~V~~~~~~~~~~aG~S~g~~~~GTGGM~sK~~AA~~A~~~Gv~v~i~~g~ 239 (379) T TIGR01027 160 VGADLLVLLTDVDGLYDADPRTNPDAKLIPVVEEITDLLLGVAGDSSGSSKVGTGGMRSKIEAARLATRAGVPVIIASGS 239 (379) T ss_pred HHHHHHHHHCCCCCCCCCCCCCCCCCEECCCCCCCCHHHHEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEECCC T ss_conf 97528977617553306874537851453412453323411114667788756646789999999998689958994488 Q ss_pred CCCHHHHHHC----CCCCCCEECCC Q ss_conf 8547999977----99878246689 Q gi|254780777|r 222 SPGGIWGGLS----GIGRSTIISGE 242 (242) Q Consensus 222 ~~~~i~~~l~----Ge~~GT~I~~~ 242 (242) +|++|.+++. |+.+||+|.++ T Consensus 240 ~P~~i~~~~~hhfyg~~~GT~F~a~ 264 (379) T TIGR01027 240 KPEKIADALEHHFYGAPVGTVFEAQ 264 (379) T ss_pred CHHHHHHHHHHHCCCCCCCCEECCC T ss_conf 9789999999740377877443357 No 6 >COG0528 PyrH Uridylate kinase [Nucleotide transport and metabolism] Probab=100.00 E-value=0 Score=310.22 Aligned_cols=236 Identities=52% Similarity=0.788 Sum_probs=226.9 Q ss_pred CCCCEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHCCCCCCCCHHHHHHHHHH Q ss_conf 88824999994003685388777089999999999999997699099982365410233442037777540246788888 Q gi|254780777|r 3 DFPYKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRGSQVVAENYLLCERSTVDSMGML 82 (242) Q Consensus 3 ~~~~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 82 (242) +.+|||||+||||+.|..++++++|.+++.++|++|+++.+.|.||.||+|||..+|++..... ..+...+++++++ T Consensus 2 ~~~~~rillkLsGe~l~g~~~~gid~~~i~~~a~~i~~~~~~g~eV~iVvGGGni~Rg~~~~~~---g~~r~~~D~mGml 78 (238) T COG0528 2 KPKYMRILLKLSGEALAGEQGFGIDPEVLDRIANEIKELVDLGVEVAVVVGGGNIARGYIGAAA---GMDRVTADYMGML 78 (238) T ss_pred CCCEEEEEEEEECCEECCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHC---CCCHHHHHHHHHH T ss_conf 8633799999421036478888879899999999999998669689999789789976789876---9851224688899 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHCCHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHCHHHHH Q ss_conf 98999999999997618864113400100245665432100001100111112465567776215788888641644665 Q gi|254780777|r 83 STVINALALDLALRKINVPTVILSSIFMPQVCEVFSCRNAVSYLSQGKVVIFSGGTGNAFLTTDSAAALRASEIGADVIL 162 (242) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~D~~a~~ia~~~~ad~l~ 162 (242) +++++..+++.++...++++.+.+........+....+.....++.+.++++.+.++.|+.++|+.|+++|.+++||.++ T Consensus 79 aTvmNal~L~~aL~~~~~~~~v~sai~~~~~~e~~~~~~A~~~l~~grVvIf~gGtg~P~fTTDt~AALrA~ei~ad~ll 158 (238) T COG0528 79 ATVMNALALQDALERLGVDTRVQSAIAMPQVAEPYSRREAIRHLEKGRVVIFGGGTGNPGFTTDTAAALRAEEIEADVLL 158 (238) T ss_pred HHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCCCCCHHHHHHHHHCCCEEEEECCCCCCCCCHHHHHHHHHHHHCCCEEE T ss_conf 99999999999986358761221311176666866799999999749989991888999873479999999983886899 Q ss_pred HCCC-CCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCHHHHHHHHHCCCCEEEEECCCCCHHHHHHCCCCCCCEECC Q ss_conf 1475-435123432234444444411599998559981409999999974996999518985479999779987824668 Q gi|254780777|r 163 KGTQ-VDGVYSADPRVHASSTRFDSLTYNQFIEKGLKVMDCASVVLARDCSIPIIVFSIHSPGGIWGGLSGIGRSTIISG 241 (242) Q Consensus 163 i~td-vdGiyt~dP~~~~~ak~i~~is~~e~l~~G~~v~~~~A~~~a~~~gI~v~I~ng~~~~~i~~~l~Ge~~GT~I~~ 241 (242) ..|+ ||||||+||+++|+|++++++||+|+++.+..+||++|+.+|++++||+++||.++|++|.+++.||.+||+|++ T Consensus 159 ~atn~VDGVY~~DPkk~pdA~~~~~Lty~e~l~~~l~vmD~tA~~l~~~~~i~i~Vfn~~~~~~l~~~~~ge~~GT~V~~ 238 (238) T COG0528 159 KATNKVDGVYDADPKKDPDAKKYDTLTYDEVLKIGLKVMDPTAFSLARDNGIPIIVFNINKPGNLKRALKGEEVGTIVEP 238 (238) T ss_pred EECCCCCCEECCCCCCCCCCEECCCCCHHHHHHHCCEEECHHHHHHHHHCCCCEEEEECCCCCCHHHHHCCCCCCEEECC T ss_conf 84067780368999889882232237999999724624059999999975994899937887659999748967417069 No 7 >PRK12314 gamma-glutamyl kinase; Provisional Probab=100.00 E-value=0 Score=309.83 Aligned_cols=235 Identities=16% Similarity=0.210 Sum_probs=187.1 Q ss_pred CEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHH Q ss_conf 24999994003685388777089999999999999997699099982365410233442037777540246788888989 Q gi|254780777|r 6 YKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRGSQVVAENYLLCERSTVDSMGMLSTV 85 (242) Q Consensus 6 ~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 85 (242) -||||||+|||+|+++++ +++.++++++|++|++++++|++++||+||+.. .+.........+.+..+.+++...++. T Consensus 9 ~krIViKiG~s~lt~~~g-~~~~~~i~~la~~I~~l~~~G~~vvlVsSGav~-~G~~~l~~~~~~~~~~~~qa~aavGq~ 86 (265) T PRK12314 9 AKRIVIKVGSSSLSYENG-KINLERIEQLVFVISDLMNKGKEVILVSSGAIG-AGLTKLKLDKRPTNLAEKQALAAVGQP 86 (265) T ss_pred CCEEEEEECHHHEECCCC-CCCHHHHHHHHHHHHHHHHCCCEEEEECCCHHH-CCCCCCCCCCCCCCHHHHHHHHHHHHH T ss_conf 878999966112678999-859999999999999999789989998637443-150013887787624679999997279 Q ss_pred HHHHHHHHHHHHHCCCCCCC--CCCHHHHHHHHHH-HHHHCCHHHHCCCCCCCCCC-----CC-CCCCCHHHHHHHHHHH Q ss_conf 99999999997618864113--4001002456654-32100001100111112465-----56-7776215788888641 Q gi|254780777|r 86 INALALDLALRKINVPTVIL--SSIFMPQVCEVFS-CRNAVSYLSQGKVVIFSGGT-----GN-AFLTTDSAAALRASEI 156 (242) Q Consensus 86 ~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~-----~~-~~~~~D~~a~~ia~~~ 156 (242) .++.+|...|..++.++.+. +...........+ .+....+++.+.+|++++.+ ++ .|||||++|+++|.++ T Consensus 87 ~Lm~~y~~~f~~~~~~~aQiLlt~~d~~~~~~~~n~~~tl~~ll~~g~IPIvneND~Vat~ei~~fGdnD~lAA~vA~~i 166 (265) T PRK12314 87 ELMSLYSQFFARYGIVVGQILLTRDDFDSPKSRANVKNTFESLLELGILPIVNENDAVATDEIDKFGDNDRLSAIVAKLV 166 (265) T ss_pred HHHHHHHHHHHHCCCCEEEEECCHHHCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHC T ss_conf 99999998865449815888504432127999999999999998489566870788531465445457509999998755 Q ss_pred CHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCH--HHHHH----------CCCCCCCHHHHHHHHHCCCCEEEEECCCCC Q ss_conf 644665147543512343223444444441159--99985----------599814099999999749969995189854 Q gi|254780777|r 157 GADVILKGTQVDGVYSADPRVHASSTRFDSLTY--NQFIE----------KGLKVMDCASVVLARDCSIPIIVFSIHSPG 224 (242) Q Consensus 157 ~ad~l~i~tdvdGiyt~dP~~~~~ak~i~~is~--~e~l~----------~G~~v~~~~A~~~a~~~gI~v~I~ng~~~~ 224 (242) +||+|+++||||||||+||+.+|+|++|+++++ ++.+. .|+|.+|+.|+++|.++|||++|+||++|+ T Consensus 167 ~AD~liilTdVDGvY~~dP~~~~dAk~I~~v~~~~~~~~~~~~~~~s~~gtGGM~sKl~Aa~~a~~~Gi~v~I~ng~~~~ 246 (265) T PRK12314 167 KADLLIILSDIDGLYDKNPRIYPDAKLLSEVTEITEEILALAGGAGSKFGTGGMVTKLKAAKFLMEAGIKMVLANGFNPS 246 (265) T ss_pred CCCEEEEEECCCCCCCCCCCCCCCCEEEEEECCCCHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCC T ss_conf 97669997356742279988799973840245679789975147888877686489999999999789919997089962 Q ss_pred HHHHHHCCCCCCCEECCC Q ss_conf 799997799878246689 Q gi|254780777|r 225 GIWGGLSGIGRSTIISGE 242 (242) Q Consensus 225 ~i~~~l~Ge~~GT~I~~~ 242 (242) +|.++++||.+||++.|+ T Consensus 247 ~i~~~l~ge~~GT~f~P~ 264 (265) T PRK12314 247 DILSFLEGESIGTLFAPK 264 (265) T ss_pred HHHHHHCCCCCCEEECCC T ss_conf 899997699871598068 No 8 >PRK12443 uridylate kinase; Reviewed Probab=100.00 E-value=0 Score=306.14 Aligned_cols=235 Identities=38% Similarity=0.616 Sum_probs=221.4 Q ss_pred CCEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHH Q ss_conf 82499999400368538877708999999999999999769909998236541023344203777754024678888898 Q gi|254780777|r 5 PYKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRGSQVVAENYLLCERSTVDSMGMLST 84 (242) Q Consensus 5 ~~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 84 (242) +||||++|++|.+|+.++++++|.++++.+|++|+++.+.|.|+.||.|||+++|++... ....++..++.++++++ T Consensus 3 ~ykRVllKlSGEaL~g~~~~g~d~~~l~~ia~eI~~~~~~G~evaiVvGGGNi~RG~~~~---~~gidr~~aD~iGMLaT 79 (247) T PRK12443 3 PYKRVLIKLSGGALADQTGNSFNSKRLEHIANEILSIVDLGIEVSIVIGGGNIFRGHLAE---EWGIDRVEADNIGTLGT 79 (247) T ss_pred CCCEEEEEEEHHHHCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCHHHH---HCCCCCCHHHHHHHHHH T ss_conf 867899997388828999888799999999999999997798799997688502322365---53987310368889999 Q ss_pred HHHHHHHHHHHHH-HCCCCCCCCCCHHHHHHHHHHHHHHCCHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHCHHHHHH Q ss_conf 9999999999976-188641134001002456654321000011001111124655677762157888886416446651 Q gi|254780777|r 85 VINALALDLALRK-INVPTVILSSIFMPQVCEVFSCRNAVSYLSQGKVVIFSGGTGNAFLTTDSAAALRASEIGADVILK 163 (242) Q Consensus 85 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~D~~a~~ia~~~~ad~l~i 163 (242) .++..+++..+.. .+.++.+.+........+.+..+.....++.+.++++.+.++.|+.++|+.|++.|.+++||.++. T Consensus 80 vmNal~L~~~l~~~~g~~~rv~sai~~~~v~e~y~~~rA~~~LekG~VVIfagGTGnP~fTTDtaAaLrA~Ei~Ad~lL~ 159 (247) T PRK12443 80 IINSLMLRGVLTSKTNKEVRVMTSIPFNAVAEPYIRLRAVHHLDNGYIVIFGGGNGQPFVTTDYPSVQRAIEMNSDAILV 159 (247) T ss_pred HHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCHHHHHHHHHCCCEEEEECCCCCCCCCCCHHHHHHHHHHCCHHHHC T ss_conf 99999999999975599469973233674567777899999973697899978878875204399999998828613321 Q ss_pred CCC-CCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCHHHHHHHHHCCCCEEEEECCCCCHHHHHHCCCCCCCEECCC Q ss_conf 475-4351234322344444444115999985599814099999999749969995189854799997799878246689 Q gi|254780777|r 164 GTQ-VDGVYSADPRVHASSTRFDSLTYNQFIEKGLKVMDCASVVLARDCSIPIIVFSIHSPGGIWGGLSGIGRSTIISGE 242 (242) Q Consensus 164 ~td-vdGiyt~dP~~~~~ak~i~~is~~e~l~~G~~v~~~~A~~~a~~~gI~v~I~ng~~~~~i~~~l~Ge~~GT~I~~~ 242 (242) .|. ||||||+||+++|+|++++++||+|+++.+.++||.+|+.+|+++++|++|||.++|++|.+++.||.+||+|+.| T Consensus 160 at~~VDGVYd~DP~k~~dA~k~~~ls~~e~l~~~L~vMD~tA~~lc~~~~ipi~Vfn~~~~g~l~ka~~Ge~iGT~V~~~ 239 (247) T PRK12443 160 AKQGVDGVFTSDPKHNKSAKMYRKLNYNDVVRQNIQVMDQAALLLARDYNLPAHVFNFDEPGVMRRICLGEHVGTLINDD 239 (247) T ss_pred CCCCCCCCCCCCCCCCCCCEEEEECCHHHHHHCCCCCHHHHHHHHHHHCCCCEEEECCCCCCHHHHHHCCCCCCEEECCC T ss_conf 05665776468988999754443337999998699442999999999739988998289986399996799863288188 No 9 >PRK05429 gamma-glutamyl kinase; Provisional Probab=100.00 E-value=0 Score=306.38 Aligned_cols=235 Identities=19% Similarity=0.231 Sum_probs=194.4 Q ss_pred CCEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHH Q ss_conf 82499999400368538877708999999999999999769909998236541023344203777754024678888898 Q gi|254780777|r 5 PYKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRGSQVVAENYLLCERSTVDSMGMLST 84 (242) Q Consensus 5 ~~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 84 (242) +.||||||+|+|+|+++++. ++.++|..+++||++++++|++|++||||+. +.+.........+.+..+.+++...++ T Consensus 7 ~~krIVIKiGSs~Lt~~~g~-l~~~~i~~l~~~I~~L~~~G~evilVSSGAI-a~G~~~L~~~~rp~~l~~~QA~AAvGQ 84 (372) T PRK05429 7 DARRIVVKVGSSLLTDDGGG-LDRARIAELARQIAALRAAGHEVVLVSSGAV-AAGRSRLGLPKRPKTLAEKQAAAAVGQ 84 (372) T ss_pred CCCEEEEECCCCEEECCCCC-CCHHHHHHHHHHHHHHHHCCCEEEEECCHHH-HHHHHHCCCCCCCCCHHHHHHHHHHCH T ss_conf 28789999174336898888-8899999999999999978999999884079-862776099988563778889987156 Q ss_pred HHHHHHHHHHHHHHCCCCCC--CCCCHHHHHHHHHHHH-HHCCHHHHCCCCCCCCCC-----CCCCCCCHHHHHHHHHHH Q ss_conf 99999999999761886411--3400100245665432-100001100111112465-----567776215788888641 Q gi|254780777|r 85 VINALALDLALRKINVPTVI--LSSIFMPQVCEVFSCR-NAVSYLSQGKVVIFSGGT-----GNAFLTTDSAAALRASEI 156 (242) Q Consensus 85 ~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~-----~~~~~~~D~~a~~ia~~~ 156 (242) ..++..|...|..++..+.+ ++..++..+..+.+.+ ....+++.+.+|++++.+ ++.|||||++|+++|.++ T Consensus 85 ~~Lm~~Y~~~f~~~~~~~aQiLlT~~D~~~r~rylN~r~tl~~Ll~~gvIPIINENDtVat~Ei~fGDND~LaA~VA~li 164 (372) T PRK05429 85 SRLMQAYEELFARYGITVAQILLTRDDLSDRERYLNARATLRTLLELGVVPIINENDTVATDEIKFGDNDTLSALVANLV 164 (372) T ss_pred HHHHHHHHHHHHHCCCCEEEEECCHHHHCCHHHHHHHHHHHHHHHHCCCEEECCCCCCCEECCCCCCCCCHHHHHHHHHH T ss_conf 99999999999863985578862665640458999999999999877960001477740005555578515882687773 Q ss_pred CHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHH--HHH----------HCCCCCCCHHHHHHHHHCCCCEEEEECCCCC Q ss_conf 6446651475435123432234444444411599--998----------5599814099999999749969995189854 Q gi|254780777|r 157 GADVILKGTQVDGVYSADPRVHASSTRFDSLTYN--QFI----------EKGLKVMDCASVVLARDCSIPIIVFSIHSPG 224 (242) Q Consensus 157 ~ad~l~i~tdvdGiyt~dP~~~~~ak~i~~is~~--e~l----------~~G~~v~~~~A~~~a~~~gI~v~I~ng~~~~ 224 (242) +||+|+++|||||+||+||+.+|+|++|+++... +.. +.|+|.+|+.||++|.++||+++|+||++|+ T Consensus 165 ~ADlLiiLTdvdGly~~~P~~~~~A~~I~~v~~i~~~i~~~~~~~~s~~GtGGM~tKl~AA~~a~~~Gi~~iIa~G~~~~ 244 (372) T PRK05429 165 EADLLILLTDIDGLYTADPRKNPDAKLIPEVEAITDELEAMAGGAGSGLGTGGMATKLEAARIATRAGIPVVIASGRKPD 244 (372) T ss_pred CCCEEEEEECCCCCCCCCCCCCCCCCEEEEECCCCHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEECCCCCCC T ss_conf 88779997456753358997686540441205787789987168877887784377999999999779989992499965 Q ss_pred HHHHHHCCCCCCCEECC Q ss_conf 79999779987824668 Q gi|254780777|r 225 GIWGGLSGIGRSTIISG 241 (242) Q Consensus 225 ~i~~~l~Ge~~GT~I~~ 241 (242) .|.++++|+.+||++.| T Consensus 245 ~l~~i~~g~~~GT~f~~ 261 (372) T PRK05429 245 VLLRLLAGEAVGTLFLP 261 (372) T ss_pred HHHHHHCCCCCEEEEEC T ss_conf 89999779972699943 No 10 >PRK13402 gamma-glutamyl kinase; Provisional Probab=100.00 E-value=0 Score=304.77 Aligned_cols=235 Identities=17% Similarity=0.161 Sum_probs=186.4 Q ss_pred CCCEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHCCCCCCCCHHHHHHHHHHH Q ss_conf 88249999940036853887770899999999999999976990999823654102334420377775402467888889 Q gi|254780777|r 4 FPYKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRGSQVVAENYLLCERSTVDSMGMLS 83 (242) Q Consensus 4 ~~~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (242) -.|||||||+|+|+|+++++ +++.++|.++|++|++++++|++|++|+||+.. .++........+ ...+.++....+ T Consensus 3 ~~~kRIVIKiGSs~lt~~~g-~l~~~~i~~l~~~I~~L~~~G~evilVSSGAia-~G~~~L~~~~rp-~l~~kQA~AAvG 79 (363) T PRK13402 3 SPWKRIVVKVGSALITPHKQ-GCSSHYLLGIAQFIVYLRAQGHQVVLVSSGSVA-AGYHKLGFIDRP-SVPEKKAMAAAG 79 (363) T ss_pred CCCCEEEEEECCCEEECCCC-CCCHHHHHHHHHHHHHHHHCCCEEEEECCCHHH-HHHHHCCCCCCC-CHHHHHHHHHHH T ss_conf 98866999967010689999-878999999999999999789989998878799-647760997689-757888999853 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHH-HCCHHHHCCCCCCCCC-----CCCCCCCCHHHHHHHHHHHC Q ss_conf 89999999999976188641134001002456654321-0000110011111246-----55677762157888886416 Q gi|254780777|r 84 TVINALALDLALRKINVPTVILSSIFMPQVCEVFSCRN-AVSYLSQGKVVIFSGG-----TGNAFLTTDSAAALRASEIG 157 (242) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~-----~~~~~~~~D~~a~~ia~~~~ 157 (242) +..++..|...| .....+.+++..++..+..+.+.+. ...+++.+.+|++++. .|+.|||||++|+++|.+++ T Consensus 80 Q~~Lm~~Y~~~F-~~~~AQiLLT~~D~~~r~rylN~r~Tl~~Ll~~gvIPIINENDtVat~Ei~fGDND~LaA~VA~li~ 158 (363) T PRK13402 80 QGDMMATWSKLF-DFPAAQLLLTHGDLRDRERYISIRNTINVLLEHGILPIINENDAVTTDKLKVGDNDNLSAMVAALAD 158 (363) T ss_pred HHHHHHHHHHHH-CCHHHHHHCCHHHHCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEEECCCCCHHHHHHHHHCC T ss_conf 999999999984-7357677188878616478899999999999689422115888501001211575079999998718 Q ss_pred HHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHH--HHH----------HCCCCCCCHHHHHHHHHCCCCEEEEECCCCCH Q ss_conf 446651475435123432234444444411599--998----------55998140999999997499699951898547 Q gi|254780777|r 158 ADVILKGTQVDGVYSADPRVHASSTRFDSLTYN--QFI----------EKGLKVMDCASVVLARDCSIPIIVFSIHSPGG 225 (242) Q Consensus 158 ad~l~i~tdvdGiyt~dP~~~~~ak~i~~is~~--e~l----------~~G~~v~~~~A~~~a~~~gI~v~I~ng~~~~~ 225 (242) ||.|+++|||||+||+||+.+|+|++|++++.. +.. +.|+|.+|+.||++|.++||+++|+||++|++ T Consensus 159 ADlLilLTDvdGLYd~dP~~~~~AklI~~V~~i~~~i~~~a~~~~s~~GtGGM~TKl~AA~~a~~~Gi~~~Ia~G~~~~~ 238 (363) T PRK13402 159 ADTLIICSDVDGLYDQNPRTNPDAKLIKEVTEINAEIYAMAGGAGSEVGTGGMRTKIQAAKKAISHGIETFIINGFTADI 238 (363) T ss_pred CCEEEEECCCCCCCCCCCCCCCCCCEEEEECCCCHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEECCCCCCCH T ss_conf 88899941668444799877876643678515667789871687788887834789999999997799699816999747 Q ss_pred HHHHHCCCCCCCEECCC Q ss_conf 99997799878246689 Q gi|254780777|r 226 IWGGLSGIGRSTIISGE 242 (242) Q Consensus 226 i~~~l~Ge~~GT~I~~~ 242 (242) |.++++|+.+||++.|. T Consensus 239 i~~i~~g~~~GT~F~p~ 255 (363) T PRK13402 239 FNQLLKGQNPGTLFTPD 255 (363) T ss_pred HHHHHCCCCCCEEEECC T ss_conf 99997089862499547 No 11 >COG0263 ProB Glutamate 5-kinase [Amino acid transport and metabolism] Probab=100.00 E-value=4.2e-45 Score=298.02 Aligned_cols=236 Identities=19% Similarity=0.284 Sum_probs=198.1 Q ss_pred CCEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHH Q ss_conf 82499999400368538877708999999999999999769909998236541023344203777754024678888898 Q gi|254780777|r 5 PYKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRGSQVVAENYLLCERSTVDSMGMLST 84 (242) Q Consensus 5 ~~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 84 (242) ++||||+|||+|+|+++.+ .++..+|..++++|++++++|+||+|||||+ .+.+....+....+......++....++ T Consensus 5 ~~~riVvKiGSs~Lt~~~g-~l~~~~l~~l~~~ia~L~~~G~eVilVSSGA-iaaG~~~Lg~~~rp~~l~~kQA~AAVGQ 82 (369) T COG0263 5 SARRIVVKIGSSSLTDGTG-GLDRSKLEELVRQVAALHKAGHEVVLVSSGA-IAAGRTRLGLPKRPKTLAEKQAAAAVGQ 82 (369) T ss_pred CCEEEEEEECCCEEECCCC-CCCHHHHHHHHHHHHHHHHCCCEEEEECCCH-HHHCHHHCCCCCCCCCHHHHHHHHHHCH T ss_conf 6537999977502357999-8298999999999999986898899983446-6517444199988761288899987379 Q ss_pred HHHHHHHHHHHHHHCCCC--CCCCCCHHHHHHHHHHHHHH-CCHHHHCCCCCCCCCC-----CCCCCCCHHHHHHHHHHH Q ss_conf 999999999997618864--11340010024566543210-0001100111112465-----567776215788888641 Q gi|254780777|r 85 VINALALDLALRKINVPT--VILSSIFMPQVCEVFSCRNA-VSYLSQGKVVIFSGGT-----GNAFLTTDSAAALRASEI 156 (242) Q Consensus 85 ~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~-----~~~~~~~D~~a~~ia~~~ 156 (242) ..++..|...|..++++. .+++..+...+....+.+.+ ..+++.+.+|++++.+ |+.|||||++++++|.++ T Consensus 83 ~~Lm~~y~~~f~~~g~~v~QiLLTr~D~~~r~ry~Nar~Tl~~Ll~~gvVPIINENDtva~~EikfGDND~LsA~VA~lv 162 (369) T COG0263 83 VRLMQLYEELFARYGIKVGQILLTRDDFSDRRRYLNARNTLSALLELGVVPIINENDTVATEEIKFGDNDTLSALVAILV 162 (369) T ss_pred HHHHHHHHHHHHHCCCEEEEEEEEHHHHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCEEEEEEEECCCCHHHHHHHHHH T ss_conf 99999999999860974568986031200089999999999999978962246487860441036568746999999874 Q ss_pred CHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHH--HHHH----------CCCCCCCHHHHHHHHHCCCCEEEEECCCCC Q ss_conf 6446651475435123432234444444411599--9985----------599814099999999749969995189854 Q gi|254780777|r 157 GADVILKGTQVDGVYSADPRVHASSTRFDSLTYN--QFIE----------KGLKVMDCASVVLARDCSIPIIVFSIHSPG 224 (242) Q Consensus 157 ~ad~l~i~tdvdGiyt~dP~~~~~ak~i~~is~~--e~l~----------~G~~v~~~~A~~~a~~~gI~v~I~ng~~~~ 224 (242) +||.|+++||+||+||+||+++|+|++|+++... |... .|+|.+|+.|+++|.++||+++|+||.+|+ T Consensus 163 ~ADlLvlLsDiDGLyd~nPr~~pdAk~i~~V~~it~ei~~~aggsgs~~GTGGM~TKl~AA~iA~~aG~~~iI~~g~~~~ 242 (369) T COG0263 163 GADLLVLLSDIDGLYDANPRTNPDAKLIPEVEEITPEIEAMAGGSGSELGTGGMRTKLEAAKIATRAGVPVIIASGSKPD 242 (369) T ss_pred CCCEEEEEECCCCCCCCCCCCCCCCEEEHHHCCCCHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEECCCCCC T ss_conf 87879998746855589988799874610213468889987467777777561888999999999769958994699831 Q ss_pred HHHHHHCCCCCCCEECCC Q ss_conf 799997799878246689 Q gi|254780777|r 225 GIWGGLSGIGRSTIISGE 242 (242) Q Consensus 225 ~i~~~l~Ge~~GT~I~~~ 242 (242) .|.+++.|+.+||++.++ T Consensus 243 ~i~~~~~~~~~GT~F~~~ 260 (369) T COG0263 243 VILDALEGEAVGTLFEPQ 260 (369) T ss_pred HHHHHHHCCCCCCEEECC T ss_conf 589997189885189358 No 12 >cd04242 AAK_G5K_ProB AAK_G5K_ProB: Glutamate-5-kinase (G5K) catalyzes glutamate-dependent ATP cleavage; G5K transfers the terminal phosphoryl group of ATP to the gamma-carboxyl group of glutamate, in the first and controlling step of proline (and, in mammals, ornithine) biosynthesis. G5K is subject to feedback allosteric inhibition by proline or ornithine. In microorganisms and plants, proline plays an important role as an osmoprotectant and, in mammals, ornithine biosynthesis is crucial for proper ammonia detoxification, since a G5K mutation has been shown to cause human hyperammonaemia. Microbial G5K generally consists of two domains: a catalytic G5K domain and one PUA (pseudo uridine synthases and archaeosine-specific transglycosylases) domain, and some lack the PUA domain. G5K requires free Mg for activity, it is tetrameric, and it aggregates to higher forms in a proline-dependent way. G5K lacking the PUA domain remains tetrameric, active, and proline-inhibitable, but the Mg requir Probab=100.00 E-value=5.6e-45 Score=296.94 Aligned_cols=231 Identities=18% Similarity=0.267 Sum_probs=180.8 Q ss_pred EEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 99999400368538877708999999999999999769909998236541023344203777754024678888898999 Q gi|254780777|r 8 RVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRGSQVVAENYLLCERSTVDSMGMLSTVIN 87 (242) Q Consensus 8 rIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 87 (242) |||||+|||+|+++++ +++.++|+++|++|++++++|++|++||||+.. .+....+....+.+..+.++....++..+ T Consensus 1 RiViKiGss~lt~~~~-~~~~~~i~~la~~I~~l~~~G~evvlVsSGAva-~G~~~l~~~~~~~~~~~~qa~AavGQ~~L 78 (251) T cd04242 1 RIVVKVGSSLLTDEDG-GLDLGRLASLVEQIAELRNQGKEVILVSSGAVA-AGRQRLGLEKRPKTLPEKQALAAVGQSLL 78 (251) T ss_pred CEEEEECCCEEECCCC-CCCHHHHHHHHHHHHHHHHCCCEEEEECCCHHH-HCHHHCCCCCCCCHHHHHHHHHHHCCHHH T ss_conf 9899967011689999-859999999999999999789979998247566-18765399878640577889998571999 Q ss_pred HHHHHHHHHHHCCCCCCC--CCCHHHHHHHHHHHH-HHCCHHHHCCCCCCCCC-----CCCCCCCCHHHHHHHHHHHCHH Q ss_conf 999999997618864113--400100245665432-10000110011111246-----5567776215788888641644 Q gi|254780777|r 88 ALALDLALRKINVPTVIL--SSIFMPQVCEVFSCR-NAVSYLSQGKVVIFSGG-----TGNAFLTTDSAAALRASEIGAD 159 (242) Q Consensus 88 ~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~-----~~~~~~~~D~~a~~ia~~~~ad 159 (242) +.+|...|..++.++.+. +...+..+....+.+ ....+++.+.+|++++. .++.|+|||++|+++|.+++|| T Consensus 79 m~~y~~~f~~~g~~~aQiLlt~~d~~~~~~~~n~~~ti~~LL~~g~iPIiNEND~vat~ei~fGDnD~lAa~vA~~i~Ad 158 (251) T cd04242 79 MALYEQLFAQYGIKVAQILLTRDDFEDRKRYLNARNTLETLLELGVIPIINENDTVATEEIRFGDNDRLSALVAGLVNAD 158 (251) T ss_pred HHHHHHHHHHCCCCCEEEECCHHHCCCHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCCHHHHHHHHHCCCC T ss_conf 99999998872985003523565402749999999999999978964675588855212233447747999998651888 Q ss_pred HHHHCCCCCCCCCCCCCCCCCCCCCCCCCH--HHHHH----------CCCCCCCHHHHHHHHHCCCCEEEEECCCCCHHH Q ss_conf 665147543512343223444444441159--99985----------599814099999999749969995189854799 Q gi|254780777|r 160 VILKGTQVDGVYSADPRVHASSTRFDSLTY--NQFIE----------KGLKVMDCASVVLARDCSIPIIVFSIHSPGGIW 227 (242) Q Consensus 160 ~l~i~tdvdGiyt~dP~~~~~ak~i~~is~--~e~l~----------~G~~v~~~~A~~~a~~~gI~v~I~ng~~~~~i~ 227 (242) .|+++|||||+||+||+++|+|++|++++. ++... .|+|..|+.|+++|.++|+|++|+||++|++|. T Consensus 159 ~LiilTdVDGvY~~dP~~~~~Ak~i~~v~~~~~~i~~~~~~~~s~~gtGGM~sKl~AA~~a~~~Gi~~~I~~g~~~~~i~ 238 (251) T cd04242 159 LLILLSDVDGLYDKNPRENPDAKLIPEVEEITDEIEAMAGGSGSSVGTGGMRTKLKAARIATEAGIPVVIANGRKPDVLL 238 (251) T ss_pred EEEEEECCCCCCCCCCCCCCCCCCCCEECCCCHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCCHHH T ss_conf 79996446732368987687666342635798799977077888887387389999999999889969998289977899 Q ss_pred HHHCCCCCCCEEC Q ss_conf 9977998782466 Q gi|254780777|r 228 GGLSGIGRSTIIS 240 (242) Q Consensus 228 ~~l~Ge~~GT~I~ 240 (242) ++++||.+||++. T Consensus 239 ~~l~g~~vGT~f~ 251 (251) T cd04242 239 DILAGEAVGTLFL 251 (251) T ss_pred HHHCCCCCCEEEC T ss_conf 9976999841669 No 13 >cd04256 AAK_P5CS_ProBA AAK_P5CS_ProBA: Glutamate-5-kinase (G5K) domain of the bifunctional delta 1-pyrroline-5-carboxylate synthetase (P5CS), composed of an N-terminal G5K (ProB) and a C-terminal glutamyl 5- phosphate reductase (G5PR, ProA), the first and second enzyme catalyzing proline (and, in mammals, ornithine) biosynthesis. G5K transfers the terminal phosphoryl group of ATP to the gamma-carboxyl group of glutamate, and is subject to feedback allosteric inhibition by proline or ornithine. In plants, proline plays an important role as an osmoprotectant and, in mammals, ornithine biosynthesis is crucial for proper ammonia detoxification, since a G5K mutation has been shown to cause human hyperammonaemia. Probab=100.00 E-value=7.1e-44 Score=290.17 Aligned_cols=234 Identities=14% Similarity=0.108 Sum_probs=181.6 Q ss_pred CEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCC-HHHHHHH---------------CCCCC Q ss_conf 2499999400368538877708999999999999999769909998236541-0233442---------------03777 Q gi|254780777|r 6 YKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNI-FRGSQVV---------------AENYL 69 (242) Q Consensus 6 ~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~-~~~~~~~---------------~~~~~ 69 (242) -||||||+|||+|+++++..++.++++++|++|++++++|++++|||||+.. ++..... ..... T Consensus 8 ~krIVIKiGss~lt~~d~~~~~~~~i~~l~~~I~~L~~~G~evvlVsSGAv~~G~~~l~~~~~~~~~~~~~~~~~~~~~~ 87 (284) T cd04256 8 AKRIVVKLGSAVVTREDECGLALGRLASIVEQVSELQSQGREVILVTSGAVAFGKQRLRHEILLSSSMRQTLKSGQLKDM 87 (284) T ss_pred CCEEEEEECHHHEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCHHHCCCHHHCCHHCCCCHHHHHHCCCCCCCC T ss_conf 87899995702237999997589999999999999997899699990684653811421100022024433022345557 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCC--CHHHHHHHHHH-HHHHCCHHHHCCCCCCCCCC-----C-- Q ss_conf 75402467888889899999999999761886411340--01002456654-32100001100111112465-----5-- Q gi|254780777|r 70 LCERSTVDSMGMLSTVINALALDLALRKINVPTVILSS--IFMPQVCEVFS-CRNAVSYLSQGKVVIFSGGT-----G-- 139 (242) Q Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~-----~-- 139 (242) +....+.+++...++..++..|...|..+++.+.+... .+...+....+ ......+++.+.+|++++.+ + T Consensus 88 ~~~~~~~qa~aavGQ~~Lm~~y~~~f~~~~~~~aQiLlt~~D~~~r~~~~n~~~tl~~Ll~~gvIPIvNEND~Va~~e~~ 167 (284) T cd04256 88 PQMELDGRACAAVGQSGLMALYEAMFTQYGITVAQVLVTKPDFYDEQTRRNLNGTLEELLRLNIIPIINTNDAVSPPPEP 167 (284) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCHHHHCCHHHHHHHHHHHHHHHHCCEEEEECCCCCCCCCCCC T ss_conf 52007899999820999999999999875981688727877730279999999999999738978576378610246556 Q ss_pred -------CCCCCCHHHHHHHHHHHCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHH--HH--------CCCCCCCH Q ss_conf -------67776215788888641644665147543512343223444444441159999--85--------59981409 Q gi|254780777|r 140 -------NAFLTTDSAAALRASEIGADVILKGTQVDGVYSADPRVHASSTRFDSLTYNQF--IE--------KGLKVMDC 202 (242) Q Consensus 140 -------~~~~~~D~~a~~ia~~~~ad~l~i~tdvdGiyt~dP~~~~~ak~i~~is~~e~--l~--------~G~~v~~~ 202 (242) +.|+|||.+|+++|.+++||+|+++|||||+||+||+ +|+|++|+++...|. +. .|+|..|+ T Consensus 168 ~~d~~~~i~fgDND~Laa~vA~~i~Ad~LillTDVdGlyd~~P~-~~~akli~~i~~~d~~~~~~~~~s~~gtGGM~tKi 246 (284) T cd04256 168 DEDLQGVISIKDNDSLAARLAVELKADLLILLSDVDGLYDGPPG-SDDAKLIHTFYPGDQQSITFGTKSRVGTGGMEAKV 246 (284) T ss_pred CCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCC-CCCCCEEEEECCCCHHHHHCCCCCCCCCCCHHHHH T ss_conf 41123431135601899998876188889997257720269999-99874421455664888745788997617879999 Q ss_pred HHHHHHHHCCCCEEEEECCCCCHHHHHHCCCCCCCEEC Q ss_conf 99999997499699951898547999977998782466 Q gi|254780777|r 203 ASVVLARDCSIPIIVFSIHSPGGIWGGLSGIGRSTIIS 240 (242) Q Consensus 203 ~A~~~a~~~gI~v~I~ng~~~~~i~~~l~Ge~~GT~I~ 240 (242) .||++|.++||+++|+||++|++|.++++||.+||++| T Consensus 247 ~AA~~a~~~Gi~~~I~nG~~~~~i~~i~~g~~vGT~Ft 284 (284) T cd04256 247 KAALWALQGGTSVVITNGMAGDVITKILEGKKVGTFFT 284 (284) T ss_pred HHHHHHHHCCCCEEEECCCCCCHHHHHHCCCCCCCEEC T ss_conf 99999997899799950899668999976999882549 No 14 >cd04253 AAK_UMPK-PyrH-Pf AAK_UMPK-PyrH-Pf: UMP kinase (UMPK)-Pf, the mostly archaeal uridine monophosphate kinase (uridylate kinase) enzymes that catalyze UMP phosphorylation and play a key role in pyrimidine nucleotide biosynthesis; regulation of this process is via feed-back control and via gene repression of carbamoyl phosphate synthetase (the first enzyme of the pyrimidine biosynthesis pathway). The UMP kinase of Pyrococcus furiosus (Pf) is known to function as a homohexamer, with GTP and UTP being allosteric effectors. Like other related enzymes (carbamate kinase, aspartokinase, and N-acetylglutamate kinase) the E. coli and most bacterial UMPKs have a conserved, N-terminal, lysine residue proposed to function in the catalysis of the phosphoryl group transfer, whereas most archaeal UMPKs (this CD) appear to lack this residue and the Pyrococcus furiosus structure has an additional Mg ion bound to the ATP molecule which is proposed to function as the catalysis instead. Members of thi Probab=100.00 E-value=4.6e-42 Score=279.00 Aligned_cols=212 Identities=30% Similarity=0.446 Sum_probs=169.5 Q ss_pred EEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 99999400368538877708999999999999999769909998236541023344203777754024678888898999 Q gi|254780777|r 8 RVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRGSQVVAENYLLCERSTVDSMGMLSTVIN 87 (242) Q Consensus 8 rIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 87 (242) |||||||||+|+++. +.+++.+++++|++++ ++++++||+|||+++|.+....+.. ......+++++...++++ T Consensus 1 RIViKiGgs~l~~~~----~~~~i~~la~~i~~l~-~~~~~iiVvggG~~ar~~~~~~~~~-~~~~~~~~~~g~~~t~~n 74 (221) T cd04253 1 RIVISLGGSVLAPEK----DADFIKEYANVLRKIS-DGHKVAVVVGGGRLAREYISVARKL-GASEAFLDEIGIMATRLN 74 (221) T ss_pred CEEEEEEHHHCCCCC----CHHHHHHHHHHHHHHH-CCCEEEEEECCCHHHHHHHHHHHHC-CCCHHHHHHHHHHHHHHH T ss_conf 979996077707998----8799999999999997-5996999987777887699999985-978999999999999999 Q ss_pred HHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHCCHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHCHHHHHHCCCC Q ss_conf 99999999761886411340010024566543210000110011111246556777621578888864164466514754 Q gi|254780777|r 88 ALALDLALRKINVPTVILSSIFMPQVCEVFSCRNAVSYLSQGKVVIFSGGTGNAFLTTDSAAALRASEIGADVILKGTQV 167 (242) Q Consensus 88 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~D~~a~~ia~~~~ad~l~i~tdv 167 (242) +.++...+........ ...+.....++.+.+++..+. .+..++|++|+++|.+++||.|+++||| T Consensus 75 a~~l~~~~~~~~~~~~-------------~~~~~~~~~l~~~~i~v~~g~--~p~~ttD~laa~vA~~i~Ad~li~ltdV 139 (221) T cd04253 75 ARLLIAALGDAYPPVP-------------TSYEEALEAMFTGKIVVMGGT--EPGQSTDAVAALLAERLGADLLINATNV 139 (221) T ss_pred HHHHHHHHHCCCCCCC-------------CCHHHHHHHHHCCCEEEEECC--CCCCCCCHHHHHHHHHHCCCEEEEEECC T ss_conf 9999997404587664-------------659999999974986999478--8988867999999998298747886435 Q ss_pred CCCCCCCCCCCCCCCCCCCCCHHHHHHC---------CCCCCCHHHHHHHHHCCCCEEEEECCCCCHHHHHHCCCCCCCE Q ss_conf 3512343223444444441159999855---------9981409999999974996999518985479999779987824 Q gi|254780777|r 168 DGVYSADPRVHASSTRFDSLTYNQFIEK---------GLKVMDCASVVLARDCSIPIIVFSIHSPGGIWGGLSGIGRSTI 238 (242) Q Consensus 168 dGiyt~dP~~~~~ak~i~~is~~e~l~~---------G~~v~~~~A~~~a~~~gI~v~I~ng~~~~~i~~~l~Ge~~GT~ 238 (242) ||+||+||+++|+|++|++++++|++++ ++++++..|+++|+++|+|++|+||++|++|.++++||++||+ T Consensus 140 dGvY~~DP~~~~~Ak~i~~is~~el~~~~~~~~~~~g~~~~~d~~Aa~~a~~~gi~~~I~nG~~~~~i~~~l~Ge~vGTl 219 (221) T cd04253 140 DGVYSKDPRKDPDAKKFDRLSADELIDIVGKSSWKAGSNEPFDPLAAKIIERSGIKTIVVDGRDPENLERALKGEFVGTI 219 (221) T ss_pred CCCCCCCCCCCCCCEECCCCCHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCEEEECCCCCCHHHHHHCCCCCCEE T ss_conf 87557898779987788658899999883567756898764799999999987998899869997589999779999716 Q ss_pred EC Q ss_conf 66 Q gi|254780777|r 239 IS 240 (242) Q Consensus 239 I~ 240 (242) |. T Consensus 220 I~ 221 (221) T cd04253 220 IE 221 (221) T ss_pred EC T ss_conf 69 No 15 >TIGR00657 asp_kinases aspartate kinase; InterPro: IPR001341 Bacteria, plants and fungi metabolise aspartic acid to produce four amino acids - lysine, threonine, methionine and isoleucine - in a series of reactions known as the aspartate pathway. Additionally, several important metabolic intermediates are produced by these reactions, such as diaminopimelic acid, an essential component of bacterial cell wall biosynthesis, and dipicolinic acid, which is involved in sporulation in Gram-positive bacteria. Members of the animal kingdom do not posses this pathway and must therefore acquire these essential amino acids through their diet. Research into improving the metabolic flux through this pathway has the potential to increase the yield of the essential amino acids in important crops, thus improving their nutritional value. Additionally, since the enzymes are not present in animals, inhibitors of them are promising targets for the development of novel antibiotics and herbicides. For more information see . Aspartate kinase (2.7.2.4 from EC) (AK) catalyzes the first reaction in the aspartate pathway; the phosphorylation of aspartate. The product of this reaction can then be used in the biosynthesis of lysine or in the pathway leading to homoserine, which participates in the biosynthesis of threonine, isoleucine and methionine . In bacteria there are three different aspartate kinase isozymes which differ in sensitivity to repression and inhibition by Lys, Met and Thr. AK1 and AK2 are bifunctional enzymes which both consist of an N-terminal AK domain and a C-terminal homoserine dehydrogenase domain. AK1 is involved in threonine biosynthesis and AK2, in that of methionine. The third isozyme, AK3 is monofunctional and involved in lysine synthesis. In archaea and plants there may be a single isozyme of AK which in plants is multifunctional. This entry represents a region encoding aspartate kinase activity found in both the monofunctional and bifunctional enzymes. Synonym(s): Aspartokinase; GO: 0004072 aspartate kinase activity, 0008652 amino acid biosynthetic process. Probab=100.00 E-value=2.8e-42 Score=280.29 Aligned_cols=221 Identities=22% Similarity=0.271 Sum_probs=178.1 Q ss_pred CEEEEEEEEEEEEECCCCCCCCH--HHHHHHHHHHHHHHHCCCEEEEEECCCC--CHHHHHHHC---------------- Q ss_conf 24999994003685388777089--9999999999999976990999823654--102334420---------------- Q gi|254780777|r 6 YKRVLLKVSGEALAGISGFGVDI--DSVNRICADIAEVYAKGIEIGIVVGGGN--IFRGSQVVA---------------- 65 (242) Q Consensus 6 ~KrIViKiGGSsL~~~~~~~~~~--~~i~~ia~~I~~~~~~g~~vviV~sg~~--~~~~~~~~~---------------- 65 (242) |.+||.||||+|+.+ . ++|+++|+.|.+..++|+++++|+|+.+ +-++..... T Consensus 1 m~~iV~KFGGTSVg~-------~PverI~~~A~~v~~~~~~g~~vvVVVSA~~g~Td~Lv~~~~~~~~~~~~~~~~~~~~ 73 (504) T TIGR00657 1 MMLIVQKFGGTSVGN-------APVERIRRVAKIVLKEKEKGNQVVVVVSAMAGVTDALVELAEKVSPGPSAAQEKEFLE 73 (504) T ss_pred CCEEEEEECCEEECC-------CCHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCHHHHHHHHHHCCCCCCCHHHHHHH T ss_conf 974799736757667-------3237899999988887535897389997377533689999986046443303679999 Q ss_pred -----------------------------------------CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCC--CC Q ss_conf -----------------------------------------37777540246788888989999999999976188--64 Q gi|254780777|r 66 -----------------------------------------ENYLLCERSTVDSMGMLSTVINALALDLALRKINV--PT 102 (242) Q Consensus 66 -----------------------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~ 102 (242) ........++.+...+.+..++..++..++..+|. +. T Consensus 74 ~Ir~~~~~~~~~~~~~~~~~~~~~~l~~~e~~~~~~~L~~~~~LGe~~~~~~D~ilS~GE~~S~~l~~~~l~~~G~K~~~ 153 (504) T TIGR00657 74 KIREKHLEILERLIIPQAIAEKLKRLLDAELVLLKKVLLGISYLGEVTPREMDRILSVGERLSIALLSAALEELGVKAKA 153 (504) T ss_pred HHHHHCCCHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCHHHHHHHHHHHHHHCCCCCCC T ss_conf 99984000477740773368999999998899999999987873468868744131203588899999999745785201 Q ss_pred CCCCCCHHHHH--------------HHHHHHHHHCCHHHHC-CCCCCCCCC---------CCCCCCCHHHHHHHHHHHCH Q ss_conf 11340010024--------------5665432100001100-111112465---------56777621578888864164 Q gi|254780777|r 103 VILSSIFMPQV--------------CEVFSCRNAVSYLSQG-KVVIFSGGT---------GNAFLTTDSAAALRASEIGA 158 (242) Q Consensus 103 ~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~-~~~~~~~~~---------~~~~~~~D~~a~~ia~~~~a 158 (242) ..+........ ............++.+ .+|++-|+. .+.+|++|++|+++|.+++| T Consensus 154 ~~l~~~~~~I~T~~~~~~A~p~~~~~~~~~~~rL~~~L~~g~~ipvvaGF~G~~~~g~~TtLGRGGSD~tA~llA~aL~A 233 (504) T TIGR00657 154 VSLTGAQAGILTDSNFGRARPDLAVIIEILTERLEPLLEEGDIIPVVAGFQGATEKGEITTLGRGGSDYTAALLAAALKA 233 (504) T ss_pred CCCCCCCCCEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCEEEEEECCHHHHHHHHHHHHHCC T ss_conf 22114745455447667641202467776699999998469879998400410577507983068068999999986199 Q ss_pred HHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHC---CCCCCCHHHHHHHHH-CCCCEEEEECCCCCHHHHHHCCCC Q ss_conf 4665147543512343223444444441159999855---998140999999997-499699951898547999977998 Q gi|254780777|r 159 DVILKGTQVDGVYSADPRVHASSTRFDSLTYNQFIEK---GLKVMDCASVVLARD-CSIPIIVFSIHSPGGIWGGLSGIG 234 (242) Q Consensus 159 d~l~i~tdvdGiyt~dP~~~~~ak~i~~is~~e~l~~---G~~v~~~~A~~~a~~-~gI~v~I~ng~~~~~i~~~l~Ge~ 234 (242) |.|.||||||||||+|||..|+||+|++|||+||+|+ |+||+||+++++|++ ++||++|.|.++| +. T Consensus 234 d~~~IyTDV~Gi~T~DPR~V~~Ar~i~~isy~Em~ELA~~GAKVLHPr~~ep~~~~~~ip~~v~st~~~---------~a 304 (504) T TIGR00657 234 DECEIYTDVDGIYTTDPRIVPDARRIDEISYEEMLELASFGAKVLHPRTLEPAMRDYKIPIVVKSTFNP---------EA 304 (504) T ss_pred CEEEEEECCCCCEECCCCCCCCCCCCCCCCHHHHHHHHHHCHHHHHHHHHHHHHCCCCCCEEEECCCCC---------CC T ss_conf 689998727950327886376652045448789999985031653278899984679951899607788---------99 Q ss_pred CCCEECCC Q ss_conf 78246689 Q gi|254780777|r 235 RSTIISGE 242 (242) Q Consensus 235 ~GT~I~~~ 242 (242) .||+|.++ T Consensus 305 ~GT~I~~~ 312 (504) T TIGR00657 305 PGTLIVAS 312 (504) T ss_pred CCEEECCC T ss_conf 96687177 No 16 >TIGR02076 pyrH_arch uridylate kinase, putative; InterPro: IPR011818 Uridylate kinases (also known as UMP kinases) are key enzymes in the synthesis of nucleoside triphosphates. They catalyse the reversible transfer of the gamma-phosphoryl group from an ATP donor to UMP, yielding UDP, which is the starting point for the synthesis of all other pyrimidine nucleotides. The eukaryotic enzyme has a dual specificity, phosphorylating both UMP and CMP, while the bacterial enzyme is specific to UMP. The bacterial enzyme shows no sequence similarity to the eukaryotic enzyme or other nucleoside monophosphate kinases, but rather appears to be part of the amino acid kinase family. It is dependent on magnesium for activity and is activated by GTP and repressed by UTP , . In many bacterial genomes, the gene tends to be located immediately downstream of elongation factor T and upstream of ribosome recycling factor. A related protein family, believed to be equivalent in function is found in the archaea and in spirochetes. Structurally, the bacterial and archaeal proteins are homohexamers centred around a hollow nucleus and organised as a trimer of dimers , . Each monomer within the protein forms the amino acid kinase fold and can be divided into an N-terminal region which binds UMP and mediates intersubunit interactions within the dimer, and a C-terminal region which binds ATP and contains a mobile loop covering the active site. Inhibition of enzyme activity by UTP appears to be due to competition for the binding site for UMP, not allosteric inhibition as was previously suspected. This entry represents the archaeal and spirochete proteins.; GO: 0009041 uridylate kinase activity, 0006221 pyrimidine nucleotide biosynthetic process. Probab=100.00 E-value=9.2e-40 Score=264.80 Aligned_cols=215 Identities=29% Similarity=0.391 Sum_probs=191.2 Q ss_pred EEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCC--CEEEEEECCCCCHHHHHHHCCCC-CCCCHHHHHHHHHHHHH Q ss_conf 9999400368538877708999999999999999769--90999823654102334420377-77540246788888989 Q gi|254780777|r 9 VLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKG--IEIGIVVGGGNIFRGSQVVAENY-LLCERSTVDSMGMLSTV 85 (242) Q Consensus 9 IViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g--~~vviV~sg~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 85 (242) |||+||||+|..++ +|.|++..+|+.|.++..+| |+|.||+|||..+|.|....+.. +..+...++.+|...++ T Consensus 1 iV~~LGGsV~~~~~---~d~e~~~~~A~~l~~~~~~Gek~~v~vVvGGG~~AR~YI~~aR~lnPG~~E~~lD~iGI~~TR 77 (232) T TIGR02076 1 IVISLGGSVLSPEN---IDAEKIKEYANILRKLSDEGEKHKVGVVVGGGKTAREYIGVARELNPGASETFLDEIGIDATR 77 (232) T ss_pred CEEECCCEEECCCC---CCHHHHHHHHHHHHHHHHCCCCEEEEEEECCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHH T ss_conf 97833451616888---887899999999999984499468998878876588999999860888880245677799999 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHCCHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHCHHHHHHCC Q ss_conf 99999999997618864113400100245665432100001100111112465567776215788888641644665147 Q gi|254780777|r 86 INALALDLALRKINVPTVILSSIFMPQVCEVFSCRNAVSYLSQGKVVIFSGGTGNAFLTTDSAAALRASEIGADVILKGT 165 (242) Q Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~D~~a~~ia~~~~ad~l~i~t 165 (242) ++++++..++.....+..+....+... ....+.+++..|++ |..++|..||++|..++||.|++.| T Consensus 78 LNA~LLi~aL~~~a~p~vP~~~~EA~~------------~~~~~~ivVmGGt~--PGhtTDAVAA~lAE~~~ad~L~~~T 143 (232) T TIGR02076 78 LNAMLLIAALGDDAYPKVPENFEEALE------------AMSLGKIVVMGGTE--PGHTTDAVAALLAEFLEADLLINAT 143 (232) T ss_pred HHHHHHHHHCCCCCCCCCCCCHHHHHH------------HHHCCCEEEECCCC--CCCHHHHHHHHHHHHHCCCCEEEEE T ss_conf 999999987163358989889899999------------98269879866878--9852899999997662687269982 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCHHHHHHC---------C-CCCCCHHHHHHHHHCCCCEEEEECCC--CCHHHH-H-HC Q ss_conf 543512343223444444441159999855---------9-98140999999997499699951898--547999-9-77 Q gi|254780777|r 166 QVDGVYSADPRVHASSTRFDSLTYNQFIEK---------G-LKVMDCASVVLARDCSIPIIVFSIHS--PGGIWG-G-LS 231 (242) Q Consensus 166 dvdGiyt~dP~~~~~ak~i~~is~~e~l~~---------G-~~v~~~~A~~~a~~~gI~v~I~ng~~--~~~i~~-~-l~ 231 (242) +||||||+||+++|+|+++++++|+|+.+. | ..++|+.|++++++++|.++|+|+++ |++|.+ + ++ T Consensus 144 nVDGVYd~DP~~~~~A~~~~~l~~~eL~~i~~G~~~~~AG~~~~~D~lA~kii~Rs~i~t~V~~~~dfr~~~l~~~~n~~ 223 (232) T TIGR02076 144 NVDGVYDKDPNKYPDAKKFDKLTPEELVEIVRGSSSLKAGSSEVVDPLAAKIIERSKIRTIVVNGRDFRPENLEKIVNLK 223 (232) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHCCCEEEEECCCCCCHHHHHHHHHHC T ss_conf 26852177778887840002589889999860441127899752448899999747954899648545888999998725 Q ss_pred CCCCCCEEC Q ss_conf 998782466 Q gi|254780777|r 232 GIGRSTIIS 240 (242) Q Consensus 232 Ge~~GT~I~ 240 (242) ||+.||.|. T Consensus 224 Ge~~GT~I~ 232 (232) T TIGR02076 224 GEIVGTIIE 232 (232) T ss_pred CCCCCCEEC T ss_conf 886876209 No 17 >PRK07431 aspartate kinase; Provisional Probab=100.00 E-value=4e-39 Score=260.86 Aligned_cols=219 Identities=22% Similarity=0.300 Sum_probs=179.2 Q ss_pred CEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCC--CCHHHHHHHCCCCCCCCHHHHHHHHHHH Q ss_conf 24999994003685388777089999999999999997699099982365--4102334420377775402467888889 Q gi|254780777|r 6 YKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGG--NIFRGSQVVAENYLLCERSTVDSMGMLS 83 (242) Q Consensus 6 ~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (242) |+.||.||||||+.+ .++++++++.|.+.+++|+++++|+|+. .+-++...........+.++++.....+ T Consensus 1 M~lIV~KFGGTSVa~-------~e~i~~va~~I~~~~~~g~~vvVVVSAmg~~Td~L~~la~~i~~~~~~re~D~lls~G 73 (594) T PRK07431 1 MALIVQKFGGTSVGS-------VERIQAVAQRIARTKEAGNDVVVVVSAMGKTTDELVKLAEEISDNPPRREMDMLLSTG 73 (594) T ss_pred CCEEEEEECCCCCCC-------HHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHH T ss_conf 974998428765798-------9999999999999987589999997499872999999999874799989999999775 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHH---------HHHHCCHHHHCCCCCCCCC-----------CCCCCC Q ss_conf 89999999999976188641134001002456654---------3210000110011111246-----------556777 Q gi|254780777|r 84 TVINALALDLALRKINVPTVILSSIFMPQVCEVFS---------CRNAVSYLSQGKVVIFSGG-----------TGNAFL 143 (242) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~-----------~~~~~~ 143 (242) .+.+..++...+...|.+...++..+........+ .......++.+.++++.++ +.+.+| T Consensus 74 E~~S~~Lla~~L~~~G~~a~sl~~~q~gi~Td~~~~~A~i~~i~~~~i~~~l~~~~v~Vv~GFqGi~~~~~~eiTTLGRG 153 (594) T PRK07431 74 EQVSIALLSMALQELGQPAISLTGAQVGIVTESEHGRARILEIKTERIQRHLDAGKVVVVAGFQGISLSSGLEITTLGRG 153 (594) T ss_pred HHHHHHHHHHHHHHCCCCCEEEEHHHCCEECCCCCCCEECCCCCHHHHHHHHHCCCCEEEEEEECCCCCCCCCEEEECCC T ss_conf 99999999999996899719920001617767987867305068999999986299179960001257998615650688 Q ss_pred CCHHHHHHHHHHHCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHC---CCCCCCHHHHHHHHHCCCCEEEEEC Q ss_conf 6215788888641644665147543512343223444444441159999855---9981409999999974996999518 Q gi|254780777|r 144 TTDSAAALRASEIGADVILKGTQVDGVYSADPRVHASSTRFDSLTYNQFIEK---GLKVMDCASVVLARDCSIPIIVFSI 220 (242) Q Consensus 144 ~~D~~a~~ia~~~~ad~l~i~tdvdGiyt~dP~~~~~ak~i~~is~~e~l~~---G~~v~~~~A~~~a~~~gI~v~I~ng 220 (242) ++|++|+.+|.+++|+.|.+||||||+||+|||.+|+|++|++|||+|++|+ |++|+|++++++|+++|||++|.|. T Consensus 154 GSD~TA~~lAaaL~Ad~~eI~TDVdGV~TaDPriv~~Ar~i~~ISY~Em~ELa~~GAkVLhprsVe~A~k~~IPI~Vrst 233 (594) T PRK07431 154 GSDTSAVALAAALGADACEIYTDVPGVLTTDPRLVPEAQLMDEISCDEMLELASLGASVLHPRAVEIARNYGVPLVVRSS 233 (594) T ss_pred CCHHHHHHHHHHCCCCEEEEEECCCCEECCCCCCCCCCCEECCCCHHHHHHHHHCCCCCCCHHHHHHHHHCCCCEEEECC T ss_conf 72679999998648989999635672364699868886285712799999999758774567789999973996599667 Q ss_pred CCCCHHHHHHCCCCCCCEECC Q ss_conf 985479999779987824668 Q gi|254780777|r 221 HSPGGIWGGLSGIGRSTIISG 241 (242) Q Consensus 221 ~~~~~i~~~l~Ge~~GT~I~~ 241 (242) +.+ .+||+|+. T Consensus 234 f~~----------~~GT~I~~ 244 (594) T PRK07431 234 WSD----------APGTLLTS 244 (594) T ss_pred CCC----------CCCCEEEC T ss_conf 889----------99857844 No 18 >cd04246 AAK_AK-DapG-like AAK_AK-DapG-like: Amino Acid Kinase Superfamily (AAK), AK-DapG-like; this CD includes the N-terminal catalytic aspartokinase (AK) domain of the diaminopimelate-sensitive aspartokinase isoenzyme AKI (DapG), a monofunctional enzymes found in Bacilli (Bacillus subtilis 168), Clostridia, and Actinobacteria bacterial species, as well as, the catalytic AK domain of the lysine-sensitive aspartokinase isoenzyme AKII of Bacillus subtilis 168, the lysine plus threonine-sensitive aspartokinase of Corynebacterium glutamicum, and related isoenzymes. In Bacillus subtilis, the regulation of the diaminopimelate-lysine biosynthetic pathway involves dual control by diaminopimelate and lysine, effected through separate diaminopimelate- and lysine-sensitive aspartokinase isoenzymes. The role of the AKI isoenzyme is most likely to provide a constant level of aspartyl-beta-phosphate for the biosynthesis of diaminopimelate for peptidoglycan synthesis and dipicolinate during sporulati Probab=100.00 E-value=2.6e-38 Score=255.84 Aligned_cols=215 Identities=26% Similarity=0.352 Sum_probs=173.2 Q ss_pred EEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECC--CCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 9999400368538877708999999999999999769909998236--54102334420377775402467888889899 Q gi|254780777|r 9 VLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGG--GNIFRGSQVVAENYLLCERSTVDSMGMLSTVI 86 (242) Q Consensus 9 IViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 86 (242) ||+||||||+.+ .+++++++++|++..++|+++++|+|+ +.+.++.............++.+.....+.++ T Consensus 2 iV~KFGGtSv~~-------~e~i~~va~iI~~~~~~g~~viVVvSA~gg~Td~L~~l~~~~~~~~~~~~~d~ils~GE~l 74 (239) T cd04246 2 IVQKFGGTSVAD-------IERIKRVAERIKKAVKKGYQVVVVVSAMGGTTDELIGLAKEVSPRPSPRELDMLLSTGEQI 74 (239) T ss_pred EEEEECCCCCCC-------HHHHHHHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHH T ss_conf 899878100499-------9999999999999986798299998489865677999888764772588999999988999 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCHHHHHHH---------HHHHHHHCCHHHHCCCCCCCCCC---------CCCCCCCHHH Q ss_conf 99999999976188641134001002456---------65432100001100111112465---------5677762157 Q gi|254780777|r 87 NALALDLALRKINVPTVILSSIFMPQVCE---------VFSCRNAVSYLSQGKVVIFSGGT---------GNAFLTTDSA 148 (242) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~D~~ 148 (242) +..++...+...|.+...+...+...... ..........++.+.++++.++. .+.+|++|++ T Consensus 75 s~~lla~~L~~~G~~a~~~~~~~~~i~t~~~~~~a~i~~~~~~~~~~~l~~~~v~Vv~GF~g~~~~G~~ttLGRGGSD~t 154 (239) T cd04246 75 SAALLAMALNRLGIKAISLTGWQAGILTDDHHGNARIIDIDPKRILEALEEGDVVVVAGFQGVNEDGEITTLGRGGSDTT 154 (239) T ss_pred HHHHHHHHHHHCCCCCEEEEHHHCCEECCCCCCHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCEEEECCCCHHHH T ss_conf 99999999997599808857323625304641022233530556665552454479733234489985688179964799 Q ss_pred HHHHHHHHCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHH---CCCCCCCHHHHHHHHHCCCCEEEEECCCCCH Q ss_conf 8888864164466514754351234322344444444115999985---5998140999999997499699951898547 Q gi|254780777|r 149 AALRASEIGADVILKGTQVDGVYSADPRVHASSTRFDSLTYNQFIE---KGLKVMDCASVVLARDCSIPIIVFSIHSPGG 225 (242) Q Consensus 149 a~~ia~~~~ad~l~i~tdvdGiyt~dP~~~~~ak~i~~is~~e~l~---~G~~v~~~~A~~~a~~~gI~v~I~ng~~~~~ 225 (242) |+++|.+++|+.+.+||||||+||+||+.+|+|++|+++||+|+++ .|++|+|++|+++++++|||++|.|.++| T Consensus 155 Aa~iA~~l~A~~v~iwTDVdGi~taDPr~v~~A~~i~~lsy~EA~ela~~GakVlhp~ti~p~~~~~Ipi~I~nt~~~-- 232 (239) T cd04246 155 AVALAAALKADRCEIYTDVDGVYTADPRIVPKARKLDVISYDEMLEMASLGAKVLHPRSVELAKKYNVPLRVRSSFSE-- 232 (239) T ss_pred HHHHHHHHCCCEEEEEECCCEEECCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCHHHHHHHHHCCCCEEEEECCCC-- T ss_conf 999999728888999718972573899968766572752799999998689842589999999986995899848989-- Q ss_pred HHHHHCCCCCCCEEC Q ss_conf 999977998782466 Q gi|254780777|r 226 IWGGLSGIGRSTIIS 240 (242) Q Consensus 226 i~~~l~Ge~~GT~I~ 240 (242) + .||+|| T Consensus 233 -------~-~GT~Is 239 (239) T cd04246 233 -------N-PGTLIT 239 (239) T ss_pred -------C-CCCEEC T ss_conf -------8-987759 No 19 >COG0527 LysC Aspartokinases [Amino acid transport and metabolism] Probab=100.00 E-value=1.3e-38 Score=257.76 Aligned_cols=221 Identities=24% Similarity=0.282 Sum_probs=177.2 Q ss_pred CEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCC--CHHHHHHHCC----------------- Q ss_conf 249999940036853887770899999999999999976990999823654--1023344203----------------- Q gi|254780777|r 6 YKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGN--IFRGSQVVAE----------------- 66 (242) Q Consensus 6 ~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~--~~~~~~~~~~----------------- 66 (242) |+++|.||||||+.+ .+++.++++.|++..++|+++++|+|++. +..+...... T Consensus 1 m~~iV~KFGGTSva~-------~e~i~~va~iv~~~~~~g~~vVVVvSA~~~vTd~Lv~~a~~~~~~~~~~~~~~~~~~~ 73 (447) T COG0527 1 MRLIVQKFGGTSVAD-------AERILRVADIVKEDSEEGVKVVVVVSAMGGVTDLLVALAEGAESGRDAVAEQRHRDIA 73 (447) T ss_pred CCEEEEEECCCCCCC-------HHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHH T ss_conf 950899868755388-------8999999999985463397189997898774589999986331445205788899999 Q ss_pred --CCCC--------------------------CCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHH- Q ss_conf --7777--------------------------54024678888898999999999997618864113400100245665- Q gi|254780777|r 67 --NYLL--------------------------CERSTVDSMGMLSTVINALALDLALRKINVPTVILSSIFMPQVCEVF- 117 (242) Q Consensus 67 --~~~~--------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 117 (242) .... .+.+..+.....+...+..++...+...|++....+..+........ T Consensus 74 ~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~D~lls~GE~~Sa~lla~~L~~~Gv~A~~~~~~~~~i~t~~~~ 153 (447) T COG0527 74 SELILDPFIAARLAEVIAEFKKVLLGIALLGEVSPRERDELLSLGERLSAALLAAALNALGVDARSLDGRQAGIATDSNH 153 (447) T ss_pred HHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCHHHHHHHHHHHHHHCCCCEEEECHHHCEEEECCCC T ss_conf 98741510456676666666676666664147987899999850548899999999996799659975588425535766 Q ss_pred --------HHHH-HCCHHHHCCCCCCCCCC---------CCCCCCCHHHHHHHHHHHCHHHHHHCCCCCCCCCCCCCCCC Q ss_conf --------4321-00001100111112465---------56777621578888864164466514754351234322344 Q gi|254780777|r 118 --------SCRN-AVSYLSQGKVVIFSGGT---------GNAFLTTDSAAALRASEIGADVILKGTQVDGVYSADPRVHA 179 (242) Q Consensus 118 --------~~~~-~~~~~~~~~~~~~~~~~---------~~~~~~~D~~a~~ia~~~~ad~l~i~tdvdGiyt~dP~~~~ 179 (242) .... ....++.+.++++.|+. .+.+|++|++|+++|.+++||+|.||||||||||+|||.+| T Consensus 154 ~~a~i~~~~~~~~l~~~~~~~~v~Vv~GF~G~~~~G~~tTLGRGGSD~SA~~laa~l~Ad~~~I~TDVdGI~TaDPRiVp 233 (447) T COG0527 154 GNARILDEDSERRLLRLLEEGKVPVVAGFQGINEDGETTTLGRGGSDYSAAALAAALGADEVEIWTDVDGVYTADPRIVP 233 (447) T ss_pred CCCCCCHHHHHHHHHHHHCCCCEEEECCCEEECCCCCEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCC T ss_conf 65531035566667887446977995186651699988984798478899999997199889999878877568997788 Q ss_pred CCCCCCCCCHHHHHHC---CCCCCCHHHHHHHHHCCCCEEEEECCCCCHHHHHHCCCCCCCEECCC Q ss_conf 4444441159999855---99814099999999749969995189854799997799878246689 Q gi|254780777|r 180 SSTRFDSLTYNQFIEK---GLKVMDCASVVLARDCSIPIIVFSIHSPGGIWGGLSGIGRSTIISGE 242 (242) Q Consensus 180 ~ak~i~~is~~e~l~~---G~~v~~~~A~~~a~~~gI~v~I~ng~~~~~i~~~l~Ge~~GT~I~~~ 242 (242) +|++|+++||+|++++ |++|+|++|+++|++++||++|.|.++| +..||+|+++ T Consensus 234 ~Ar~i~~isyeEa~ELA~~GAkVLHprav~pa~~~~Ip~~i~~t~~p---------~~~GTlI~~~ 290 (447) T COG0527 234 DARLLPEISYEEALELAYLGAKVLHPRAVEPAMRSGIPLRIKNTFNP---------DAPGTLITAE 290 (447) T ss_pred CCEECCCCCHHHHHHHHHCCCHHCCHHHHHHHHHCCCCEEEEECCCC---------CCCCEEEECC T ss_conf 64576846999999999778521499888899876991899716999---------9995399567 No 20 >cd04261 AAK_AKii-LysC-BS AAK_AKii-LysC-BS: Amino Acid Kinase Superfamily (AAK), AKii; this CD includes the N-terminal catalytic aspartokinase (AK) domain of the lysine-sensitive aspartokinase isoenzyme AKII of Bacillus subtilis 168, and the lysine plus threonine-sensitive aspartokinase of Corynebacterium glutamicum, and related sequences. In B. subtilis 168, the regulation of the diaminopimelate (Dap)-lysine biosynthetic pathway involves dual control by Dap and lysine, effected through separate Dap- and lysine-sensitive aspartokinase isoenzymes. The B. subtilis 168 AKII is induced by methionine, and repressed and inhibited by lysine. Although Corynebacterium glutamicum is known to contain a single aspartokinase isoenzyme type, both the succinylase and dehydrogenase variant pathways of DAP-lysine synthesis operate simultaneously in this organism. In this organism and other various Gram-positive bacteria, the DAP-lysine pathway is feedback regulated by the concerted action of lysine and Probab=100.00 E-value=5.9e-38 Score=253.66 Aligned_cols=215 Identities=26% Similarity=0.348 Sum_probs=172.0 Q ss_pred EEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECC--CCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 9999400368538877708999999999999999769909998236--54102334420377775402467888889899 Q gi|254780777|r 9 VLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGG--GNIFRGSQVVAENYLLCERSTVDSMGMLSTVI 86 (242) Q Consensus 9 IViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 86 (242) ||+||||||+.+ .++++++++.|++..++|+++++|+|+ +.+.++...........+.++++.....+.+. T Consensus 2 iV~KFGGtSv~~-------~e~i~~va~iI~~~~~~~~~vvVVvSA~gg~Td~Li~la~~~~~~~~~~e~d~ils~GE~l 74 (239) T cd04261 2 IVQKFGGTSVAS-------IERIKRVAERIKKRKKKGNQVVVVVSAMGGTTDELIELAKEISPRPPARELDVLLSTGEQV 74 (239) T ss_pred EEEEECCCCCCC-------HHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHH T ss_conf 899978201399-------9999999999998986699289997798753688999999870699989998887777999 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCHHHHH---------HHHHHHHHHCCHHHHCCCCCCCCCC---------CCCCCCCHHH Q ss_conf 999999999761886411340010024---------5665432100001100111112465---------5677762157 Q gi|254780777|r 87 NALALDLALRKINVPTVILSSIFMPQV---------CEVFSCRNAVSYLSQGKVVIFSGGT---------GNAFLTTDSA 148 (242) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~D~~ 148 (242) +..++...+...|++.......+.... ............++...++++.++. .+.+|++|++ T Consensus 75 sa~ll~~~L~~~G~~a~~~~~~~~~i~t~~~~~~a~i~~~~~~~i~~~l~~~~v~Vv~GF~g~~~~g~~ttLGRGGSD~t 154 (239) T cd04261 75 SIALLAMALNRLGIKAISLTGWQAGILTDGHHGKARIIDIDPDRIRELLEEGDVVIVAGFQGINEDGDITTLGRGGSDTS 154 (239) T ss_pred HHHHHHHHHHHCCCCCEEEEHHHCCEECCCCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECCCCHHHH T ss_conf 99999999997699757968666754315752045568899999986402455589734655478884688279946889 Q ss_pred HHHHHHHHCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHH---CCCCCCCHHHHHHHHHCCCCEEEEECCCCCH Q ss_conf 8888864164466514754351234322344444444115999985---5998140999999997499699951898547 Q gi|254780777|r 149 AALRASEIGADVILKGTQVDGVYSADPRVHASSTRFDSLTYNQFIE---KGLKVMDCASVVLARDCSIPIIVFSIHSPGG 225 (242) Q Consensus 149 a~~ia~~~~ad~l~i~tdvdGiyt~dP~~~~~ak~i~~is~~e~l~---~G~~v~~~~A~~~a~~~gI~v~I~ng~~~~~ 225 (242) |+++|.+++|+.+++||||||+||+|||.+|+|++|+++||+|+.+ .|++|+|++|+++++++|||++|.|.++| T Consensus 155 Aai~A~~l~A~~v~iwTDVdGi~taDPr~v~~A~~i~~lsy~EA~ela~~GakVlHp~ai~p~~~~~Ipi~i~nt~~p-- 232 (239) T cd04261 155 AVALAAALGADRCEIYTDVDGVYTADPRIVPKARKLDEISYDEMLEMASLGAKVLHPRSVELAKKYGVPLRVLSSFSE-- 232 (239) T ss_pred HHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCEECHHHHHHHHHCCCCCCCHHHHHHHHHCCCCEEEEECCCC-- T ss_conf 999999669888999818983275899968876773770799999999689731589999999986994899708989-- Q ss_pred HHHHHCCCCCCCEEC Q ss_conf 999977998782466 Q gi|254780777|r 226 IWGGLSGIGRSTIIS 240 (242) Q Consensus 226 i~~~l~Ge~~GT~I~ 240 (242) + .||+|| T Consensus 233 -------~-~GT~Is 239 (239) T cd04261 233 -------E-PGTLIT 239 (239) T ss_pred -------C-CCCEEC T ss_conf -------7-988859 No 21 >PRK08841 aspartate kinase; Validated Probab=100.00 E-value=1.4e-37 Score=251.32 Aligned_cols=220 Identities=23% Similarity=0.288 Sum_probs=175.5 Q ss_pred CEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCC--CCHHHHHHHCCCCCCCCHHHHHHHHHHH Q ss_conf 24999994003685388777089999999999999997699099982365--4102334420377775402467888889 Q gi|254780777|r 6 YKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGG--NIFRGSQVVAENYLLCERSTVDSMGMLS 83 (242) Q Consensus 6 ~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (242) |+.||.||||||+.+ .++++++++.|.+..++|+++++|+|+. .+-++.............++++.....+ T Consensus 1 M~~iV~KFGGTSv~~-------~e~i~~va~~i~~~~~~~~~vvVVvSAm~g~Td~Ll~la~~i~~~~~~relD~lls~G 73 (392) T PRK08841 1 MPLIVQKFGGTSVGS-------IERIQTVAEHIIKAKNDGNQVVVVVSAMAGETNRLLGLAKQVDSVPTARELDVLLSAG 73 (392) T ss_pred CCEEEEEECCCCCCC-------HHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHH T ss_conf 987999158635799-------9999999999998875699899996289996899999998632599989999999765 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHH---------HHHHHCCHHHHCCCCCCCCCC---------CCCCCCC Q ss_conf 8999999999997618864113400100245665---------432100001100111112465---------5677762 Q gi|254780777|r 84 TVINALALDLALRKINVPTVILSSIFMPQVCEVF---------SCRNAVSYLSQGKVVIFSGGT---------GNAFLTT 145 (242) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~ 145 (242) .+.+..+++..+...|.+...+...+........ ........++...++++.++. .+.+|++ T Consensus 74 E~lS~~lla~~L~~~G~~a~~l~~~~~~i~t~~~~~~a~i~~~~~~~~~~~l~~~~v~Vv~GF~g~~~~G~~ttLGRGGS 153 (392) T PRK08841 74 EQVSMALLAMTLNKLGYKARSLTGAQANIVTDNQHNDATIKHIDTSTITELLEQDQIVIVAGFQGRNENGDITTLGRGGS 153 (392) T ss_pred HHHHHHHHHHHHHHCCCCCEEEECHHCCCCCCCCCCCEEEEHHCHHHHHHHHHCCCEEEECCEEECCCCCCEEECCCCCH T ss_conf 99999999999986799757987123351246777643520203387998873598699418201079998756279977 Q ss_pred HHHHHHHHHHHCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHC---CCCCCCHHHHHHHHHCCCCEEEEECCC Q ss_conf 15788888641644665147543512343223444444441159999855---998140999999997499699951898 Q gi|254780777|r 146 DSAAALRASEIGADVILKGTQVDGVYSADPRVHASSTRFDSLTYNQFIEK---GLKVMDCASVVLARDCSIPIIVFSIHS 222 (242) Q Consensus 146 D~~a~~ia~~~~ad~l~i~tdvdGiyt~dP~~~~~ak~i~~is~~e~l~~---G~~v~~~~A~~~a~~~gI~v~I~ng~~ 222 (242) |++|+++|.+++|+.|.+||||||+||+|||.+|+|++++++||+|++++ |++|+|++++++|++++||++|.|.++ T Consensus 154 D~sA~~lAa~l~A~~~eIwTDVdGv~taDPr~V~~A~~l~~isy~Ea~ELa~~GAkVlHp~tv~pa~~~~Ipi~v~ntf~ 233 (392) T PRK08841 154 DTTAVALAGALNADECQIFTDVDGVYTCDPRVVKNARKLDVIDFPSMEAMARKGAKVLHLPSVQHAWKHSVPLRVLSSFE 233 (392) T ss_pred HHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCCCCEECCCCHHHHHHHHHCCCCCCCHHHHHHHHHCCCEEEEEECCC T ss_conf 99999999980997899985677636379868768728264789999999976996368889999987597599983687 Q ss_pred CCHHHHHHCCCCCCCEECCC Q ss_conf 54799997799878246689 Q gi|254780777|r 223 PGGIWGGLSGIGRSTIISGE 242 (242) Q Consensus 223 ~~~i~~~l~Ge~~GT~I~~~ 242 (242) + ..||+|+++ T Consensus 234 ~----------~~GT~I~~~ 243 (392) T PRK08841 234 D----------GEGTLIKGE 243 (392) T ss_pred C----------CCCEEEECC T ss_conf 9----------998289667 No 22 >PRK06635 aspartate kinase; Reviewed Probab=100.00 E-value=2.2e-37 Score=250.15 Aligned_cols=220 Identities=22% Similarity=0.306 Sum_probs=175.5 Q ss_pred CEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCC--CCHHHHHHHCCCCCCCCHHHHHHHHHHH Q ss_conf 24999994003685388777089999999999999997699099982365--4102334420377775402467888889 Q gi|254780777|r 6 YKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGG--NIFRGSQVVAENYLLCERSTVDSMGMLS 83 (242) Q Consensus 6 ~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (242) ||+||.||||||+.+ .+.++++++.|....++|+++++|+|+. .+-.+...........+.++.+.....+ T Consensus 1 M~~iV~KFGGtSv~~-------~~~~~~v~~ii~~~~~~~~~~vvVvSA~~~~Td~Li~la~~~~~~~~~~e~d~ils~G 73 (402) T PRK06635 1 MALIVQKFGGTSVGD-------VERIKNVAERVKAEVEAGHQVVVVVSAMGGTTDELLDLAKEVSPAPDPRELDMLLSTG 73 (402) T ss_pred CCEEEEEECCCCCCC-------HHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHH T ss_conf 977999578503699-------9999999999987887299989996499987899999999860799879999999871 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHH---------HHHHHHCCHHHHCCCCCCCCCC---------CCCCCCC Q ss_conf 899999999999761886411340010024566---------5432100001100111112465---------5677762 Q gi|254780777|r 84 TVINALALDLALRKINVPTVILSSIFMPQVCEV---------FSCRNAVSYLSQGKVVIFSGGT---------GNAFLTT 145 (242) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~ 145 (242) .+.+..++...+...|++...+...+....... .........+....++++.|+. .+.+|++ T Consensus 74 E~~S~~l~a~~L~~~g~~a~~~~~~~~~i~td~~~~~a~~~~~~~~~~~~~l~~~~v~Vv~GF~g~~~~G~~ttLgRGGS 153 (402) T PRK06635 74 EQVSVALLAMALQSLGVKARSFTGWQAPIITDSAHGKARITDIDPDRIREALDEGKVVVVAGFQGVDEDGEITTLGRGGS 153 (402) T ss_pred HHHHHHHHHHHHHHCCCCCEEECHHHCCEECCCCCCCCCCCHHHHHHHHHHHHCCCEEEECCCEEECCCCCEEEECCCCC T ss_conf 79999999999985799858914787855315654552411422788999984697799737312257896677358872 Q ss_pred HHHHHHHHHHHCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHC---CCCCCCHHHHHHHHHCCCCEEEEECCC Q ss_conf 15788888641644665147543512343223444444441159999855---998140999999997499699951898 Q gi|254780777|r 146 DSAAALRASEIGADVILKGTQVDGVYSADPRVHASSTRFDSLTYNQFIEK---GLKVMDCASVVLARDCSIPIIVFSIHS 222 (242) Q Consensus 146 D~~a~~ia~~~~ad~l~i~tdvdGiyt~dP~~~~~ak~i~~is~~e~l~~---G~~v~~~~A~~~a~~~gI~v~I~ng~~ 222 (242) |++|+++|.+++|+.+.+||||||+||+|||.+|+|++++++||+|+.++ |++|+||+++++|++++||++|.|.++ T Consensus 154 D~sA~~~a~~l~A~~~~IwtDV~Gi~taDPr~V~~A~~i~~isy~Ea~ELa~~GAkVlHp~ti~pa~~~~Ipi~v~nTf~ 233 (402) T PRK06635 154 DTTAVALAAALKADECEIYTDVDGVYTTDPRIVPKARKLDKISYEEMLELASLGAKVLHPRSVELAKKYNVPLRVRSSFS 233 (402) T ss_pred HHHHHHHHHHCCCCEEEEEECCCCEECCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCHHHHHHHHHCCCEEEEEECCC T ss_conf 18999999860775899985356305158754677721572599999999976987567888999997498089985678 Q ss_pred CCHHHHHHCCCCCCCEECCC Q ss_conf 54799997799878246689 Q gi|254780777|r 223 PGGIWGGLSGIGRSTIISGE 242 (242) Q Consensus 223 ~~~i~~~l~Ge~~GT~I~~~ 242 (242) | + .||+|+++ T Consensus 234 p---------~-~GT~I~~~ 243 (402) T PRK06635 234 E---------E-PGTLITEE 243 (402) T ss_pred C---------C-CCEEEECC T ss_conf 9---------8-97399538 No 23 >cd04260 AAK_AKi-DapG-BS AAK_AKi-DapG-BS: Amino Acid Kinase Superfamily (AAK), AKi-DapG; this CD includes the N-terminal catalytic aspartokinase (AK) domain of the diaminopimelate-sensitive aspartokinase isoenzyme AKI (DapG), a monofunctional class enzyme found in Bacilli (Bacillus subtilis 168), Clostridia, and Actinobacteria bacterial species. In Bacillus subtilis, the regulation of the diaminopimelate-lysine biosynthetic pathway involves dual control by diaminopimelate and lysine, effected through separate diaminopimelate- and lysine-sensitive aspartokinase isoenzymes. AKI activity is invariant during the exponential and stationary phases of growth and is not altered by addition of amino acids to the growth medium. The role of this isoenzyme is most likely to provide a constant level of aspartyl-beta-phosphate for the biosynthesis of diaminopimelate for peptidoglycan synthesis and dipicolinate during sporulation. The B. subtilis AKI is tetrameric consisting of two alpha and two bet Probab=100.00 E-value=3.7e-37 Score=248.77 Aligned_cols=215 Identities=23% Similarity=0.292 Sum_probs=172.5 Q ss_pred EEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCC-------HHHHHHHCCCCCCCCHHHHHHHHH Q ss_conf 9999400368538877708999999999999999769909998236541-------023344203777754024678888 Q gi|254780777|r 9 VLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNI-------FRGSQVVAENYLLCERSTVDSMGM 81 (242) Q Consensus 9 IViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~-------~~~~~~~~~~~~~~~~~~~~~~~~ 81 (242) ||+|||||||.+ .++++++++.|++..++|+++++|+|+... ..+.............++.+.... T Consensus 2 iV~KFGGtSv~~-------~~~i~~va~iI~~~~~~~~~~vVVvSAmg~~~~~~~td~L~~l~~~~~~~~~~~~~D~l~s 74 (244) T cd04260 2 IVQKFGGTSVST-------KERREQVAKKVKQAVDEGYKPVVVVSAMGRKGDPYATDTLINLVYAENSDISPRELDLLMS 74 (244) T ss_pred EEEEECCCCCCC-------HHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHH T ss_conf 898118523799-------9999999999988987799889997999999975899999999997530389899999997 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHH---------HHHHHCCHHHHCCCCCCCCCCC---------CCCC Q ss_conf 898999999999997618864113400100245665---------4321000011001111124655---------6777 Q gi|254780777|r 82 LSTVINALALDLALRKINVPTVILSSIFMPQVCEVF---------SCRNAVSYLSQGKVVIFSGGTG---------NAFL 143 (242) Q Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~---------~~~~ 143 (242) .+.+.+..++...+...|++.......+........ ........++...++++.++.+ +.++ T Consensus 75 ~GE~lSa~lla~~L~~~Gi~a~~~~~~~~~i~t~~~~~~a~~~~~~~~~~~~~l~~~~v~Vv~GFig~~~~G~~ttLGRg 154 (244) T cd04260 75 CGEIISAVVLTSTLRAQGLKAVALTGAQAGILTDDNYSNAKIIKVNPKKILSALKEGDVVVVAGFQGVTEDGEVTTLGRG 154 (244) T ss_pred HCCHHHHHHHHHHHHHCCCCCEEEEHHHCCEECCCCCCCEEEEEECHHHHHHHHHHCCEEEECCCCCCCCCCCEEECCCC T ss_conf 43099999999999964998748658888353268777436765158999999863463574031024689966771898 Q ss_pred CCHHHHHHHHHHHCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHC---CCCCCCHHHHHHHHHCCCCEEEEEC Q ss_conf 6215788888641644665147543512343223444444441159999855---9981409999999974996999518 Q gi|254780777|r 144 TTDSAAALRASEIGADVILKGTQVDGVYSADPRVHASSTRFDSLTYNQFIEK---GLKVMDCASVVLARDCSIPIIVFSI 220 (242) Q Consensus 144 ~~D~~a~~ia~~~~ad~l~i~tdvdGiyt~dP~~~~~ak~i~~is~~e~l~~---G~~v~~~~A~~~a~~~gI~v~I~ng 220 (242) ++|++|+++|.+++|+.|.+||||||+||+||+.+|+|++|+++||+|++++ |++|+|+++++++++++||++|.|. T Consensus 155 GSD~TAa~lA~~L~A~~~~IwtDV~Gv~taDPr~v~~A~~i~~lsy~EA~ELa~~GAkVlHp~ti~pa~~~~Ipi~Irnt 234 (244) T cd04260 155 GSDTTAAALGAALNAEYVEIYTDVDGIMTADPRVVPNARILDVVSYNEVFQMAHQGAKVIHPRAVEIAMQANIPIRIRST 234 (244) T ss_pred CHHHHHHHHHHHCCCCEEEEEECCCEEEECCCCCCCCCEECCCCCHHHHHHHHHCCCCCCCHHHHHHHHHCCCCEEEEEC T ss_conf 46899999999609988999646871271799868887687731899999999779751689999999986981899707 Q ss_pred CCCCHHHHHHCCCCCCCEEC Q ss_conf 98547999977998782466 Q gi|254780777|r 221 HSPGGIWGGLSGIGRSTIIS 240 (242) Q Consensus 221 ~~~~~i~~~l~Ge~~GT~I~ 240 (242) ++| +.||+|| T Consensus 235 ~~~----------~~GT~Is 244 (244) T cd04260 235 MSE----------NPGTLIT 244 (244) T ss_pred CCC----------CCCCEEC T ss_conf 988----------7987749 No 24 >PRK08210 aspartate kinase I; Reviewed Probab=100.00 E-value=1.1e-36 Score=245.91 Aligned_cols=219 Identities=21% Similarity=0.279 Sum_probs=174.0 Q ss_pred CEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCC-------HHHHHHHCCCCCCCCHHHHHH Q ss_conf 2499999400368538877708999999999999999769909998236541-------023344203777754024678 Q gi|254780777|r 6 YKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNI-------FRGSQVVAENYLLCERSTVDS 78 (242) Q Consensus 6 ~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~-------~~~~~~~~~~~~~~~~~~~~~ 78 (242) ||.||.||||||+.+ .++++++++.|.+..++|+++++|+|+... ..+.............++.+. T Consensus 1 Mk~iV~KFGGtSv~~-------~~~i~~v~~ii~~~~~~~~~~vVVvSAmg~~~~~~~td~L~~l~~~~~~~~~~~~~D~ 73 (405) T PRK08210 1 MKIIVQKFGGTSVST-------EERRKMVVNKIKKALKEGYKVVVVVSAMGRKGDPYATDTLLSLVGEEFSDISKREQDL 73 (405) T ss_pred CCEEEEEECCCCCCC-------HHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHCCCCCHHHHHH T ss_conf 987988168502699-------9999999999987886699889997998989981149999999986502598788999 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHH---------HHHHHHCCHHHHCCCCCCCCCC---------CC Q ss_conf 88889899999999999761886411340010024566---------5432100001100111112465---------56 Q gi|254780777|r 79 MGMLSTVINALALDLALRKINVPTVILSSIFMPQVCEV---------FSCRNAVSYLSQGKVVIFSGGT---------GN 140 (242) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~---------~~ 140 (242) ....+.+....++...+...|.+.......+....... .........++...++++.|+. .+ T Consensus 74 l~s~GE~~S~~l~a~~L~~~g~~a~~~~~~~~~i~t~~~~~~a~~~~~~~~~~~~~l~~~~v~vv~GF~g~~~~g~~ttL 153 (405) T PRK08210 74 LMSCGEIISSVVFSNMLNEAGIKAVALTGGQAGIITDDNFTNAKIIEVNPDRILEALEEGDVVVVAGFQGATENGDITTL 153 (405) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCEEEEHHHCCEECCCCCCCCEEECCCHHHHHHHHHCCCEEEECCCEEECCCCCEEEE T ss_conf 98775899999999999747998679774158478467545530001457999998745985996174654689956662 Q ss_pred CCCCCHHHHHHHHHHHCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHC---CCCCCCHHHHHHHHHCCCCEEE Q ss_conf 7776215788888641644665147543512343223444444441159999855---9981409999999974996999 Q gi|254780777|r 141 AFLTTDSAAALRASEIGADVILKGTQVDGVYSADPRVHASSTRFDSLTYNQFIEK---GLKVMDCASVVLARDCSIPIIV 217 (242) Q Consensus 141 ~~~~~D~~a~~ia~~~~ad~l~i~tdvdGiyt~dP~~~~~ak~i~~is~~e~l~~---G~~v~~~~A~~~a~~~gI~v~I 217 (242) .+|++|++|+++|.+++|+.|.+||||||+||+||+.+|+|++|+++||+|++++ |++|+||+++++|++++||++| T Consensus 154 GRGGSD~TA~~la~~l~A~~~~iwtDV~Gv~taDPr~v~~a~~i~~lsy~Ea~ELa~~GakVlhp~tv~pa~~~~Ipi~I 233 (405) T PRK08210 154 GRGGSDTTAAALGVALKAEYIDIYTDVDGIMTADPRIVEDARLLDVVSYNEVFQMAYQGAKVIHPRAVEIAMQANIPLRI 233 (405) T ss_pred CCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCEECCCCCHHHHHHHHHCCCCCCCHHHHHHHHHCCCEEEE T ss_conf 58873279999998639988999856776563799827665111624999999999779863688999999973983999 Q ss_pred EECCCCCHHHHHHCCCCCCCEECC Q ss_conf 518985479999779987824668 Q gi|254780777|r 218 FSIHSPGGIWGGLSGIGRSTIISG 241 (242) Q Consensus 218 ~ng~~~~~i~~~l~Ge~~GT~I~~ 241 (242) .|.+++ ..||+|++ T Consensus 234 ~nT~~~----------~~GT~I~~ 247 (405) T PRK08210 234 RSTYSD----------SKGTLITS 247 (405) T ss_pred ECCCCC----------CCCCEEEC T ss_conf 736899----------99867934 No 25 >cd04241 AAK_FomA-like AAK_FomA-like: This CD includes a fosfomycin biosynthetic gene product, FomA, and similar proteins found in a wide range of organisms. Together, the fomA and fomB genes in the fosfomycin biosynthetic gene cluster of Streptomyces wedmorensis confer high-level fosfomycin resistance. FomA and FomB proteins converted fosfomycin to fosfomycin monophosphate and fosfomycin diphosphate in the presence of ATP and a magnesium ion, indicating that FomA and FomB catalyzed phosphorylations of fosfomycin and fosfomycin monophosphate, respectively. FomA and related sequences in this CD are members of the Amino Acid Kinase Superfamily (AAK). Probab=100.00 E-value=5.5e-36 Score=241.51 Aligned_cols=226 Identities=17% Similarity=0.169 Sum_probs=166.8 Q ss_pred EEEEEEEEEEECCC-CCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHCCC--CCCCCHHHHHHHHHHHHH Q ss_conf 99994003685388-777089999999999999997699099982365410233442037--777540246788888989 Q gi|254780777|r 9 VLLKVSGEALAGIS-GFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRGSQVVAEN--YLLCERSTVDSMGMLSTV 85 (242) Q Consensus 9 IViKiGGSsL~~~~-~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~ 85 (242) ||||+|||+|++++ +..++.++|.+++++|+++. |++++||||||++++........ ................+. T Consensus 2 IVIKiGgs~lt~~~~~~~~~~~~i~~~~~~i~~~~--~~~iiiV~GgG~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (252) T cd04241 2 IILKLGGSVITDKDRPETIREENLERIARELAEAI--DEKLVLVHGGGSFGHPKAKEYGLPDGDGSFSAEGVAETHEAML 79 (252) T ss_pred EEEEECHHHEECCCCCCCCCHHHHHHHHHHHHHHC--CCCEEEEECCHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHH T ss_conf 99996578704897656511999999999999867--9988999895487699998713446874122888999999999 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHH----HHHHHCCHHHHCCCCCCCCC----CCCCC--CCCHHHHHHHHHH Q ss_conf 99999999997618864113400100245665----43210000110011111246----55677--7621578888864 Q gi|254780777|r 86 INALALDLALRKINVPTVILSSIFMPQVCEVF----SCRNAVSYLSQGKVVIFSGG----TGNAF--LTTDSAAALRASE 155 (242) Q Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~----~~~~~--~~~D~~a~~ia~~ 155 (242) ..+..+...+..++.++...+..+........ +......+++.+.+|++.+. .+..+ .++|++|+++|.+ T Consensus 80 ~l~~~~~~~l~~~g~~a~~l~~~~~~~~~~~~~~~~~~~~l~~ll~~g~IPIin~~~v~~~~~~~~i~~~D~lAa~vA~~ 159 (252) T cd04241 80 ELNSIVVDALLEAGVPAVSVPPSSFFVTENGRIVSFDLEVIKELLDRGFVPVLHGDVVLDEGGGITILSGDDIVVELAKA 159 (252) T ss_pred HHHHHHHHHHHHCCCCEEEECHHHHHHHHCCCEEECCHHHHHHHHHCCCEEEECCCCEECCCCCCCCCCHHHHHHHHHHH T ss_conf 99999999999769963520404444431784103178999999987964696687652477742224373999999998 Q ss_pred HCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHH------------CCCCCCCHHHHHHHHHCCCCEEEEECCCC Q ss_conf 164466514754351234322344444444115999985------------59981409999999974996999518985 Q gi|254780777|r 156 IGADVILKGTQVDGVYSADPRVHASSTRFDSLTYNQFIE------------KGLKVMDCASVVLARDCSIPIIVFSIHSP 223 (242) Q Consensus 156 ~~ad~l~i~tdvdGiyt~dP~~~~~ak~i~~is~~e~l~------------~G~~v~~~~A~~~a~~~gI~v~I~ng~~~ 223 (242) ++||.|+++|||||+||+|| ++++++++++..+..+ .|+|..|+.||+.|.++|++++|+||++| T Consensus 160 l~Ad~LI~lTDVdGvy~~~p---~~a~~i~~i~~~~~~~~~~~~~~~~~~~tGGM~~Kl~aA~~a~~~Gi~v~I~~G~~~ 236 (252) T cd04241 160 LKPERVIFLTDVDGVYDKPP---PDAKLIPEIDVGSLEDILAALGSAGTDVTGGMAGKIEELLELARRGIEVYIFNGDKP 236 (252) T ss_pred HCCCEEEEEECCCEEECCCC---CCCCCCCCCCCCCHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEECCCCC T ss_conf 38786999956874646999---998386450732499998736877787627739999999999978994999899997 Q ss_pred CHHHHHHCCCCCCCEE Q ss_conf 4799997799878246 Q gi|254780777|r 224 GGIWGGLSGIGRSTII 239 (242) Q Consensus 224 ~~i~~~l~Ge~~GT~I 239 (242) ++|.+++.|+.+||+| T Consensus 237 ~~i~~~l~g~~vGT~I 252 (252) T cd04241 237 ENLYRALLGNFIGTRI 252 (252) T ss_pred CHHHHHHCCCCCCCCC T ss_conf 6799997799998999 No 26 >PRK06291 aspartate kinase; Provisional Probab=100.00 E-value=9.2e-36 Score=240.15 Aligned_cols=219 Identities=21% Similarity=0.240 Sum_probs=162.8 Q ss_pred EEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCC--CHHHHHHHCC---CC-------------- Q ss_conf 9999940036853887770899999999999999976990999823654--1023344203---77-------------- Q gi|254780777|r 8 RVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGN--IFRGSQVVAE---NY-------------- 68 (242) Q Consensus 8 rIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~--~~~~~~~~~~---~~-------------- 68 (242) |||.||||||+.+ .++++++++.|+...+++.++++|+|+.. +-++...... .. T Consensus 2 ~iV~KFGGTSva~-------~~~i~~v~~iI~~~~~~~~~~vVVvSA~~gvTd~L~~~~~~~~~~~~~~~~~~~~~~i~~ 74 (466) T PRK06291 2 RLVMKFGGTSVGD-------GERIRHVAKLVKKYREEGNEVVVVVSAMTGVTDRLLEIAEQALDVGDIAKVNDFIAELRE 74 (466) T ss_pred CEEEEECCCCCCC-------HHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHH T ss_conf 7899738423798-------999999999987677658989999679999789999999998747787889999999999 Q ss_pred ----------------------------------------CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf ----------------------------------------7754024678888898999999999997618864113400 Q gi|254780777|r 69 ----------------------------------------LLCERSTVDSMGMLSTVINALALDLALRKINVPTVILSSI 108 (242) Q Consensus 69 ----------------------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 108 (242) .....+..+.....+...+..++...+...|++....... T Consensus 75 ~h~~~~~~l~~~~~~~~e~~~~l~~~~~~l~~~l~~~~~~~~~s~~~~d~i~s~GE~~Sa~l~a~~L~~~Gi~a~~ld~~ 154 (466) T PRK06291 75 RHEIAIEEAIEAPDIRVEVSKTIDSRIEELEKALVGVSYLGELTPRSRDYILSFGERLSAPILSGALRDLGIPSLHLTGG 154 (466) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEHH T ss_conf 99999998502755689999999999999999876665530168177889986899999999999999679984998868 Q ss_pred HHHHH-------------HHHHHHHHHCCHHHHCCCCCCCCCC---------CCCCCCCHHHHHHHHHHHCHHHHHHCCC Q ss_conf 10024-------------5665432100001100111112465---------5677762157888886416446651475 Q gi|254780777|r 109 FMPQV-------------CEVFSCRNAVSYLSQGKVVIFSGGT---------GNAFLTTDSAAALRASEIGADVILKGTQ 166 (242) Q Consensus 109 ~~~~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~D~~a~~ia~~~~ad~l~i~td 166 (242) +.... ............+....++++.|+. .+.+|++|++|+++|.+++|+.+.+||| T Consensus 155 ~~~i~t~~~~~~a~~~~~~~~~~~~~~~~~~~~~~v~vv~GFig~~~~G~~ttLGRGGSD~TAa~lA~~l~A~~~~IwTD 234 (466) T PRK06291 155 EAGIITDDNFGNARPLPKTYERIKERLGPLLKEGVIPVVTGFIGETEEGIITTLGRGGSDYSAAIIGAALDADEIWIWTD 234 (466) T ss_pred HCEEEECCCCCCEEECHHHHHHHHHHHHHHHCCCCEEEECCCCEECCCCCEEEECCCCHHHHHHHHHHHCCCCEEEEEEC T ss_conf 89177147876511553306889998877740483689736104889997787079961899999999578869999405 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCHHHHHHC---CCCCCCHHHHHHHHHCCCCEEEEECCCCCHHHHHHCCCCCCCEECCC Q ss_conf 43512343223444444441159999855---99814099999999749969995189854799997799878246689 Q gi|254780777|r 167 VDGVYSADPRVHASSTRFDSLTYNQFIEK---GLKVMDCASVVLARDCSIPIIVFSIHSPGGIWGGLSGIGRSTIISGE 242 (242) Q Consensus 167 vdGiyt~dP~~~~~ak~i~~is~~e~l~~---G~~v~~~~A~~~a~~~gI~v~I~ng~~~~~i~~~l~Ge~~GT~I~~~ 242 (242) |||+||+||+.+|+|++|+++||+|+.++ |++|+||+|++++++++||++|.|.++| +..||+|+++ T Consensus 235 V~Gi~taDPriV~~A~~i~~lsy~EA~ELa~~GakVlHp~ti~p~~~~~IPi~I~nt~~P---------~~~GT~I~~~ 304 (466) T PRK06291 235 VDGVMTADPRIVPEARTIPKISYIEAMELSYFGAKVLHPRTIEPAMEKGIPVRVKNTFNP---------EFPGTLITSE 304 (466) T ss_pred CCCEEECCCCCCCCCEECCCCCHHHHHHHHHCCCCCCCHHHHHHHHHCCCCEEEEECCCC---------CCCCEEEECC T ss_conf 673471699878777543745999999999758764567789999980993899826898---------9997289647 No 27 >KOG1154 consensus Probab=100.00 E-value=4.3e-36 Score=242.19 Aligned_cols=239 Identities=17% Similarity=0.180 Sum_probs=181.0 Q ss_pred CCCCC-EEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHCCCCCCC--------- Q ss_conf 68882-49999940036853887770899999999999999976990999823654102334420377775--------- Q gi|254780777|r 2 SDFPY-KRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRGSQVVAENYLLC--------- 71 (242) Q Consensus 2 ~~~~~-KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~~~~~~~~~~~~--------- 71 (242) +.+|+ ||||+|+|+++++++++.++...++..|++|++++++.|++|++|+|||................ T Consensus 4 ~~~kka~rIVVKLGSavit~e~~~~laLgrla~IVEqV~~L~~~G~evilVSSGaVA~G~qrLr~~~~~s~s~r~~l~~~ 83 (285) T KOG1154 4 AFLKKAYRIVVKLGSAVITREDTCGLALGRLASIVEQVSELQRMGREVILVSSGAVAFGRQRLRQELLPSSSMRQTLKPQ 83 (285) T ss_pred HHCCCCEEEEEEECCEEEECCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHCCCHHHHHHHCCC T ss_conf 01020049999814468877987540478999999999999856855999923525550777510203435678752675 Q ss_pred -CHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHH---HHCCHHHHCCCCCCCCCC-----CCCC Q ss_conf -4024678888898999999999997618864113400100245665432---100001100111112465-----5677 Q gi|254780777|r 72 -ERSTVDSMGMLSTVINALALDLALRKINVPTVILSSIFMPQVCEVFSCR---NAVSYLSQGKVVIFSGGT-----GNAF 142 (242) Q Consensus 72 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~-----~~~~ 142 (242) +..+..+....++..++.+|...|.++++...+......+...+..+++ .....+..+.+|++++.+ +.+| T Consensus 84 ~~l~e~rA~AAvGQ~~Lmalye~lF~Qy~~~~AQvLvT~~Di~d~~~r~Nl~~Ti~eLL~m~viPIvNeNDavs~~~~~~ 163 (285) T KOG1154 84 SELAEKRACAAVGQSGLMALYETLFTQYGITIAQVLVTRNDILDEQQRKNLQNTISELLSMNVIPIVNENDAVSPREIPF 163 (285) T ss_pred CCHHHHHHHHHHCCCHHHHHHHHHHHHHCCEEEEEEECCCCHHHHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCC T ss_conf 41146788887275218999999999837530035541643220899988999999998579244543788647763246 Q ss_pred CC---CHHHHHHHHHHHCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHH-----------HHCCCCCCCHHHHHHH Q ss_conf 76---215788888641644665147543512343223444444441159999-----------8559981409999999 Q gi|254780777|r 143 LT---TDSAAALRASEIGADVILKGTQVDGVYSADPRVHASSTRFDSLTYNQF-----------IEKGLKVMDCASVVLA 208 (242) Q Consensus 143 ~~---~D~~a~~ia~~~~ad~l~i~tdvdGiyt~dP~~~~~ak~i~~is~~e~-----------l~~G~~v~~~~A~~~a 208 (242) +| ||++|+++|.+++||+|+++|||||+||.+| ..+.+++++..+..+- .+.|+|-.|+.|+-+| T Consensus 164 ~D~~dNDsLsA~laaei~ADlLilLsDVdglYt~PP-d~~~~~li~~~~~~~~~v~~tfG~~S~vGtGGM~tKv~AA~~A 242 (285) T KOG1154 164 GDSSDNDSLAAILAAEIKADLLILLSDVDGLYTGPP-DADPSKLIHTFSPGDPQVSTTFGSKSKVGTGGMETKVKAAVNA 242 (285) T ss_pred CCCCCCCHHHHHHHHHHCCCEEEEEECCCCCCCCCC-CCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHH T ss_conf 888765179999998742677999852561335999-8675334421046778876446666765767611558888777 Q ss_pred HHCCCCEEEEECCCCCHHHHHHCCCCCCCEECC Q ss_conf 974996999518985479999779987824668 Q gi|254780777|r 209 RDCSIPIIVFSIHSPGGIWGGLSGIGRSTIISG 241 (242) Q Consensus 209 ~~~gI~v~I~ng~~~~~i~~~l~Ge~~GT~I~~ 241 (242) .+.|++++|.||..|.+|.+++.|..+||.++. T Consensus 243 ~~~Gv~viI~~g~~p~~I~~iv~g~kvgt~f~~ 275 (285) T KOG1154 243 LNAGVSVIITNGDAPENITDIVEGKKVGTFFEQ 275 (285) T ss_pred HCCCCEEEEECCCCHHHHHHHHHHHHHHHHHHH T ss_conf 607914999479866778988645112014330 No 28 >cd02115 AAK Amino Acid Kinases (AAK) superfamily, catalytic domain; present in such enzymes like N-acetylglutamate kinase (NAGK), carbamate kinase (CK), aspartokinase (AK), glutamate-5-kinase (G5K) and UMP kinase (UMPK). The AAK superfamily includes kinases that phosphorylate a variety of amino acid substrates. These kinases catalyze the formation of phosphoric anhydrides, generally with a carboxylate, and use ATP as the source of the phosphoryl group; are involved in amino acid biosynthesis. Some of these kinases control the process via allosteric feed-back inhibition. Probab=100.00 E-value=2.5e-35 Score=237.51 Aligned_cols=222 Identities=24% Similarity=0.309 Sum_probs=170.1 Q ss_pred EEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHH-HHHCC-----CCCCCCHHHHHHHHHHH Q ss_conf 9994003685388777089999999999999997699099982365410233-44203-----77775402467888889 Q gi|254780777|r 10 LLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRGS-QVVAE-----NYLLCERSTVDSMGMLS 83 (242) Q Consensus 10 ViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~~-~~~~~-----~~~~~~~~~~~~~~~~~ 83 (242) |+|||||+|++ .+.+++++++|+.+++.|+++++||||+...... ..... ..........+.....+ T Consensus 1 ViK~GGssl~~-------~~~~~~~~~~i~~l~~~g~~~vvV~sa~~~it~~l~~~~~~~~~~~~~~~t~~~~~~i~~~g 73 (248) T cd02115 1 VIKFGGSSVSS-------EERLRNLARILVKLASEGGRVVVVHGAGPQITDELLAHGELLGYARGLRITDRETDALAAMG 73 (248) T ss_pred CEEECHHHCCC-------HHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHH T ss_conf 99889136069-------99999999999999978999899969844532999986535433136789989999999999 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCHHHHHH---------HHHHHHHHCCHHHHCCCCCCCCC--------CCCCCCCCH Q ss_conf 8999999999997618864113400100245---------66543210000110011111246--------556777621 Q gi|254780777|r 84 TVINALALDLALRKINVPTVILSSIFMPQVC---------EVFSCRNAVSYLSQGKVVIFSGG--------TGNAFLTTD 146 (242) Q Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~~D 146 (242) ......++...+...+.........+..... ...........++.+.+|++.++ ..+.++++| T Consensus 74 ~~~~~~~l~~~L~~~g~~a~~l~~~~~~~~~~~~~~~~~v~~~~~~~l~~~l~~~~ipVi~g~~~~~~~~~~~l~~~~sD 153 (248) T cd02115 74 EGMSNLLIAAALEQHGIKAVPLDLTQAGFASPNQGHVGKITKVSTDRLKSLLENGILPILSGFGGTDEKETGTLGRGGSD 153 (248) T ss_pred HHHHHHHHHHHHHHCCCCEEEEEEEECCEECCCCCCCCCCCCCCHHHHHHHHHCCCCEEECCCCCCCCCCCCEECCCCCC T ss_conf 99999999999985699816986553233136644432102268999998774167668136345556777633078878 Q ss_pred HHHHHHHHHHCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHH---CCCCCCCHHHHHHHHHCCCCEEEEECCCC Q ss_conf 578888864164466514754351234322344444444115999985---59981409999999974996999518985 Q gi|254780777|r 147 SAAALRASEIGADVILKGTQVDGVYSADPRVHASSTRFDSLTYNQFIE---KGLKVMDCASVVLARDCSIPIIVFSIHSP 223 (242) Q Consensus 147 ~~a~~ia~~~~ad~l~i~tdvdGiyt~dP~~~~~ak~i~~is~~e~l~---~G~~v~~~~A~~~a~~~gI~v~I~ng~~~ 223 (242) ++|+.+|.+++|+.+++||||||+||+||+.+|+|+++++++++|+.+ .|+|++|++|+..|.++|||++|.|+++| T Consensus 154 ~~Aa~lA~~l~A~~lii~TDV~Gv~~~dp~~~~~a~~i~~ls~~ea~el~~~G~~v~~~~a~~~a~~~gi~v~I~n~~~~ 233 (248) T cd02115 154 STAALLAAALKADRLVILTDVDGVYTADPRKVPDAKLLSELTYEEAAELAYAGAMVLKPKAADPAARAGIPVRIANTENP 233 (248) T ss_pred HHHHHHHHHHCCCHHHHEECCCCCEECCCCCCCCCEECCCCCHHHHHHHHHCCCCHHHHHHHHHHHHCCCCEEEECCCCC T ss_conf 99999999949665531045786161899868867588653899999998559967489999999987997999139997 Q ss_pred CHHHHHHCCCCCCCEE Q ss_conf 4799997799878246 Q gi|254780777|r 224 GGIWGGLSGIGRSTII 239 (242) Q Consensus 224 ~~i~~~l~Ge~~GT~I 239 (242) +.+ .++.++..||+| T Consensus 234 ~~l-elf~~~~~GTlI 248 (248) T cd02115 234 GAL-ALFTPDGGGTLI 248 (248) T ss_pred CHH-HHCCCCCCCCCC T ss_conf 533-411799996819 No 29 >PRK08961 bifunctional aspartate kinase/diaminopimelate decarboxylase protein; Provisional Probab=100.00 E-value=1.4e-34 Score=232.80 Aligned_cols=225 Identities=18% Similarity=0.135 Sum_probs=155.3 Q ss_pred CCCCCEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHH-HCCCEEEEEECCCCC--HHHHHHHCCC----------- Q ss_conf 68882499999400368538877708999999999999999-769909998236541--0233442037----------- Q gi|254780777|r 2 SDFPYKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVY-AKGIEIGIVVGGGNI--FRGSQVVAEN----------- 67 (242) Q Consensus 2 ~~~~~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~-~~g~~vviV~sg~~~--~~~~~~~~~~----------- 67 (242) +..-.|=||+||||||+.+ .++++++++.|.... +.|.++++|+|+.+. -.+....... T Consensus 3 ~~~~~~wVV~KFGGTSVas-------~~~~~~Va~iI~~~~~~~g~rvvVVvSAm~gvTn~L~~la~~a~~~~~~l~~i~ 75 (865) T PRK08961 3 SPSADRWVVLKFGGTSVSR-------RHRWDTIAKLARKRLNETGGRVLVVVSALSGVTNELTAIADGAADSASRVAALE 75 (865) T ss_pred CCCCCCEEEECCCCCCCCC-------HHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHHCCHHHHHHHH T ss_conf 8888987998628466898-------999999999999999850998899985999863899999986421056788999 Q ss_pred ---------CCC------------------------CCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHH Q ss_conf ---------777------------------------54024678888898999999999997618864113400100245 Q gi|254780777|r 68 ---------YLL------------------------CERSTVDSMGMLSTVINALALDLALRKINVPTVILSSIFMPQVC 114 (242) Q Consensus 68 ---------~~~------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 114 (242) ... ...+..+.....+.+++..++...+...|++.......+..... T Consensus 76 ~~h~~i~~eL~~~~~~~l~~~l~~L~~ll~~i~~~~~s~~~~d~ilS~GE~lSt~l~a~~L~~~Gi~a~~lDar~~l~t~ 155 (865) T PRK08961 76 QRHREFLAELELDAQAVLANRLAALQALLTDARAATRALDWQAEVLGQGELLSSRLGAAYLHASGLDMGWLDAREWLTAL 155 (865) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCHHHHCCEEEHHHHHHHHHHHHHHHHCCCCCEEECHHHHHHCC T ss_conf 99999999741005678999999999998666640578356345861737999999999999679981785689960014 Q ss_pred ----------HH---HHHHH-----HCCHH-HHCCCCCCCCC---------CCCCCCCCHHHHHHHHHHHCHHHHHHCCC Q ss_conf ----------66---54321-----00001-10011111246---------55677762157888886416446651475 Q gi|254780777|r 115 ----------EV---FSCRN-----AVSYL-SQGKVVIFSGG---------TGNAFLTTDSAAALRASEIGADVILKGTQ 166 (242) Q Consensus 115 ----------~~---~~~~~-----~~~~~-~~~~~~~~~~~---------~~~~~~~~D~~a~~ia~~~~ad~l~i~td 166 (242) .. ..... ..... ....++++.|+ +.+.+|++|++|+++|.+++|+.+.|||| T Consensus 156 ~~~~~~~~~~~~l~~~~~~~~~~~l~~~~~~~~~~vvVtqGFiG~~~~G~tTTLGRGGSDySAallAaaL~A~~veIWTD 235 (865) T PRK08961 156 PPQPNQSEWSQRLSVSCQWQSDAGWRARFAAQPAQVLITQGFIARHADGGTAILGRGGSDTSAAYFGALLGAARVEIWTD 235 (865) T ss_pred CCCCCCCCHHHHHHHHHHHCCCHHHHHHHHCCCCCEEEECCCEEECCCCCEEEECCCCHHHHHHHHHHHCCCCEEEEEEC T ss_conf 67676430344433333330257889887505982799518335169998777079855899999999749988999858 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCHHHHHHC---CCCCCCHHHHHHHHHCCCCEEEEECCCCCHHHHHHCCCCCCCEECCC Q ss_conf 43512343223444444441159999855---99814099999999749969995189854799997799878246689 Q gi|254780777|r 167 VDGVYSADPRVHASSTRFDSLTYNQFIEK---GLKVMDCASVVLARDCSIPIIVFSIHSPGGIWGGLSGIGRSTIISGE 242 (242) Q Consensus 167 vdGiyt~dP~~~~~ak~i~~is~~e~l~~---G~~v~~~~A~~~a~~~gI~v~I~ng~~~~~i~~~l~Ge~~GT~I~~~ 242 (242) |||+||+|||.+|+|++|+++||+|+.|+ |++|+||+++++|+++|||++|.|.++| |..||+|+.+ T Consensus 236 VdGI~TaDPRiVp~Ar~I~~LSy~EA~ELA~fGAKVLHP~Ti~Pa~~~~IPv~VknT~~P---------e~~GT~I~~~ 305 (865) T PRK08961 236 VPGMFSANPKEVPDARLLTRLDYYEAQEIATTGAKVLHPRSIKPCRDAGVPMAILDTERP---------DLSGTSIDGN 305 (865) T ss_pred CCCCCCCCCCCCCCCEECCCCCHHHHHHHHHCCCCCCCHHHHHHHHHCCCCEEEECCCCC---------CCCCCEEECC T ss_conf 987866899878886587813899999999768752486667999976996798178998---------9997278489 No 30 >cd04244 AAK_AK-LysC-like AAK_AK-LysC-like: Amino Acid Kinase Superfamily (AAK), AK-LysC-like; this CD includes the N-terminal catalytic aspartokinase (AK) domain of the lysine-sensitive AK isoenzyme found in higher plants. The lysine-sensitive AK isoenzyme is a monofunctional protein. It is involved in the overall regulation of the aspartate pathway and can be synergistically inhibited by S-adenosylmethionine. Also included in this CD is an uncharacterized LysC-like AK found in Euryarchaeota and some bacteria. AK catalyzes the conversion of aspartate and ATP to aspartylphosphate and ADP. Probab=100.00 E-value=4.3e-34 Score=229.86 Aligned_cols=216 Identities=21% Similarity=0.255 Sum_probs=157.1 Q ss_pred EEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCC--HHHHHHHCC---C--------------- Q ss_conf 99999400368538877708999999999999999769909998236541--023344203---7--------------- Q gi|254780777|r 8 RVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNI--FRGSQVVAE---N--------------- 67 (242) Q Consensus 8 rIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~--~~~~~~~~~---~--------------- 67 (242) |||+||||||+.+ .+.++++++.|.+.. +++++++|+|+... -.+...... + T Consensus 1 riV~KFGGtSva~-------~~~i~~v~~II~~~~-~~~~~vVVVSA~~gvTn~L~~~~~~a~~~~~~~~~~~~~~i~~~ 72 (298) T cd04244 1 RLVMKFGGTSVGS-------AERIRHVADLVGTYA-EGHEVVVVVSAMGGVTDRLLLAAEAAVSGRIAGVKDFIEILRLR 72 (298) T ss_pred CEEEEECCCCCCC-------HHHHHHHHHHHHHHC-CCCCEEEEECCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHH T ss_conf 9579849101189-------999999999985546-69998999779999889999999998526767899999999999 Q ss_pred -------------------------------------CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHH Q ss_conf -------------------------------------7775402467888889899999999999761886411340010 Q gi|254780777|r 68 -------------------------------------YLLCERSTVDSMGMLSTVINALALDLALRKINVPTVILSSIFM 110 (242) Q Consensus 68 -------------------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 110 (242) ......+..+.....+...+..++...+...|++.......+. T Consensus 73 h~~~~~~l~~~~~~~~~~~~l~~~~~~l~~~l~~i~~~~e~s~~~~D~ils~GE~lSa~l~a~~L~~~Gi~a~~~d~~~~ 152 (298) T cd04244 73 HIKAAKEAISDEEIAEVESIIDSLLEELEKLLYGIAYLGELTPRSRDYIVSFGERLSAPIFSAALRSLGIKARALDGGEA 152 (298) T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEECHHHC T ss_conf 99999985365768999999999999999999877653117868899999878899999999999967999799547987 Q ss_pred HHHHH-------------HHHHHHHCCHHHHCCCCCCCCCC---------CCCCCCCHHHHHHHHHHHCHHHHHHCCCCC Q ss_conf 02456-------------65432100001100111112465---------567776215788888641644665147543 Q gi|254780777|r 111 PQVCE-------------VFSCRNAVSYLSQGKVVIFSGGT---------GNAFLTTDSAAALRASEIGADVILKGTQVD 168 (242) Q Consensus 111 ~~~~~-------------~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~D~~a~~ia~~~~ad~l~i~tdvd 168 (242) ..... ..........+....++++.|+. .+.++++|++|+++|.+++|+.+.|||||+ T Consensus 153 ~i~t~~~~~~a~~~~~~~~~~~~~~~~~~~~~~i~Vv~GFig~~~~G~~ttLGRgGSDyTAailA~~l~A~~~~IwtDV~ 232 (298) T cd04244 153 GIITDDNFGNARPLPATYERVRKRLLPMLEDGKIPVVTGFIGATEDGAITTLGRGGSDYSATIIGAALDADEIWIWKDVD 232 (298) T ss_pred CEECCCCCCCEEECHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECCCCHHHHHHHHHHHCCCCEEEEECCCC T ss_conf 55047876540033521999999988875059289993825558987665558994789999999972887899943988 Q ss_pred CCCCCCCCCCCCCCCCCCCCHHHHHHC---CCCCCCHHHHHHHHHCCCCEEEEECCCCCHHHHHHCCCCCCCEEC Q ss_conf 512343223444444441159999855---998140999999997499699951898547999977998782466 Q gi|254780777|r 169 GVYSADPRVHASSTRFDSLTYNQFIEK---GLKVMDCASVVLARDCSIPIIVFSIHSPGGIWGGLSGIGRSTIIS 240 (242) Q Consensus 169 Giyt~dP~~~~~ak~i~~is~~e~l~~---G~~v~~~~A~~~a~~~gI~v~I~ng~~~~~i~~~l~Ge~~GT~I~ 240 (242) |+||+||+.+|+|++|+++||+|+.++ |++|+||++++++++++||++|.|.++| +..||+|| T Consensus 233 Gi~taDPr~v~~a~~i~~lsy~EA~ELa~~GakVlHP~ti~p~~~~~IPi~Iknt~~p---------~~~GT~Is 298 (298) T cd04244 233 GVMTADPRIVPEARTIPRLSYAEAMELAYFGAKVLHPRTVEPAMEKGIPVRVKNTFNP---------EAPGTLIT 298 (298) T ss_pred CEEECCCCCCCCCEECCCCCHHHHHHHHHCCCCCCCHHHHHHHHHCCCCEEEECCCCC---------CCCCEEEC T ss_conf 3260799768887574732899999999679740599999999987996899389998---------99981559 No 31 >cd04259 AAK_AK-DapDC AAK_AK-DapDC: Amino Acid Kinase Superfamily (AAK), AK-DapDC; this CD includes the N-terminal catalytic aspartokinase (AK) domain of the bifunctional enzyme AK - DAP decarboxylase (DapDC) found in some bacteria. Aspartokinase is the first enzyme in the aspartate metabolic pathway, catalyzes the conversion of aspartate and ATP to aspartylphosphate and ADP. DapDC, which is the lysA gene product, catalyzes the decarboxylation of DAP to lysine. Probab=100.00 E-value=8e-34 Score=228.18 Aligned_cols=216 Identities=18% Similarity=0.133 Sum_probs=157.1 Q ss_pred EEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCC--CHHHHHHHC--------------------- Q ss_conf 999940036853887770899999999999999976990999823654--102334420--------------------- Q gi|254780777|r 9 VLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGN--IFRGSQVVA--------------------- 65 (242) Q Consensus 9 IViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~--~~~~~~~~~--------------------- 65 (242) ||.|||||||.+ .+.++++++.|....+.+.++++|+|+.. +-.+..... T Consensus 2 iV~KFGGTSv~~-------~~~~~~v~~ii~~~~~~~~~~vvVVSA~~gvTd~L~~~~~~~~~~~~~~~~~~i~~~h~~~ 74 (295) T cd04259 2 VVLKFGGTSVSS-------RARWDTIAKLAQKHLNTGGQPLIVCSALSGISNKLEALIDQALLDEHHSLFNAIQSRHLNL 74 (295) T ss_pred EEEEECHHCCCC-------HHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHH T ss_conf 899768010587-------9999999999998874799989997999998368999999875364466999999999999 Q ss_pred --------------------------CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHH---- Q ss_conf --------------------------37777540246788888989999999999976188641134001002456---- Q gi|254780777|r 66 --------------------------ENYLLCERSTVDSMGMLSTVINALALDLALRKINVPTVILSSIFMPQVCE---- 115 (242) Q Consensus 66 --------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---- 115 (242) ........+..+.....+.+.+..++...+...|+++......+...... T Consensus 75 ~~~l~~~~~~~l~~~~~~l~~~l~~i~~~~~~~~~~~d~i~s~GE~lSa~l~a~~L~~~Gi~a~~ldar~~i~~~~~~~~ 154 (295) T cd04259 75 AEQLEVDADALLANDLAQLQRWLTGISLLKQASPRTRAEVLALGELMSTRLGAAYLEAQGLKVKWLDARELLTATPTLGG 154 (295) T ss_pred HHHHCCCHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEECHHHHEECCCCCCC T ss_conf 99852147789999999999999876640479888999999998999999999999967998379648992000467675 Q ss_pred -----------HHH--HHHHCCHHHHCCCCCCCCCC---------CCCCCCCHHHHHHHHHHHCHHHHHHCCCCCCCCCC Q ss_conf -----------654--32100001100111112465---------56777621578888864164466514754351234 Q gi|254780777|r 116 -----------VFS--CRNAVSYLSQGKVVIFSGGT---------GNAFLTTDSAAALRASEIGADVILKGTQVDGVYSA 173 (242) Q Consensus 116 -----------~~~--~~~~~~~~~~~~~~~~~~~~---------~~~~~~~D~~a~~ia~~~~ad~l~i~tdvdGiyt~ 173 (242) ... .+......+...++++.|+. .+.+|++|++|+++|.+++|+.+.|||||||+||+ T Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~v~v~~GFig~~~~G~~tTLGRgGSDyTAa~ia~~l~A~~v~IwtDV~Gi~ta 234 (295) T cd04259 155 ETMNYLSARCESEYADALLQKRLADGAQLIITQGFIARNAHGETVLLGRGGSDTSAAYFAAKLQAARCEIWTDVPGLFTA 234 (295) T ss_pred CCHHHHHHHHHHHHCHHHHHHHHCCCCEEEEECCCCEECCCCCEEEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCC T ss_conf 31123443565641549899875149859996784345699987772589730589999997499879999777767568 Q ss_pred CCCCCCCCCCCCCCCHHHHHHC---CCCCCCHHHHHHHHHCCCCEEEEECCCCCHHHHHHCCCCCCCEEC Q ss_conf 3223444444441159999855---998140999999997499699951898547999977998782466 Q gi|254780777|r 174 DPRVHASSTRFDSLTYNQFIEK---GLKVMDCASVVLARDCSIPIIVFSIHSPGGIWGGLSGIGRSTIIS 240 (242) Q Consensus 174 dP~~~~~ak~i~~is~~e~l~~---G~~v~~~~A~~~a~~~gI~v~I~ng~~~~~i~~~l~Ge~~GT~I~ 240 (242) ||+.+|+|++|+++||+|+.++ |++|+||++++++.+.+||++|.|.++| +..||+|| T Consensus 235 DPr~v~~A~~i~~lsy~EA~eLa~~GakVlHp~t~~p~~~~~Ipi~Irnt~~p---------~~~GT~Is 295 (295) T cd04259 235 NPHEVPHARLLKRLDYDEAQEIATMGAKVLHPRCIPPARRANIPMVVRSTERP---------ELSGTLIT 295 (295) T ss_pred CCCCCCCCEECCCCCHHHHHHHHHCCCCCCCHHHHHHHHHCCCCEEEECCCCC---------CCCCEEEC T ss_conf 98768888465733899999999679841689999999984991899289999---------99982669 No 32 >PRK05925 aspartate kinase; Provisional Probab=100.00 E-value=4.6e-34 Score=229.66 Aligned_cols=216 Identities=18% Similarity=0.138 Sum_probs=150.4 Q ss_pred CEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCC--CCHHHHHHHC------------------ Q ss_conf 24999994003685388777089999999999999997699099982365--4102334420------------------ Q gi|254780777|r 6 YKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGG--NIFRGSQVVA------------------ 65 (242) Q Consensus 6 ~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~--~~~~~~~~~~------------------ 65 (242) |+.||.||||||+.+ .++++++++.|.+ +.+.++|+|+. .+-.+..... T Consensus 1 M~~iV~KFGGtSv~~-------~e~i~~va~ii~~----~~~~~vVVSA~~gvTd~L~~~~~~~~~~~~~~~~~l~~~h~ 69 (440) T PRK05925 1 MAPLVYKFGGTSLGT-------AESIRRVCDIICK----EKPSFVVVSAVAGVTDLLEEFCRVSKGKREQITEKIREKHE 69 (440) T ss_pred CCEEEEEECCCCCCC-------HHHHHHHHHHHHH----CCCCEEEEECCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHH T ss_conf 986999379623389-------9999999999971----59978999699997089999987620107899999999999 Q ss_pred -----C-----------------CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHH-------H--- Q ss_conf -----3-----------------7777540246788888989999999999976188641134001002-------4--- Q gi|254780777|r 66 -----E-----------------NYLLCERSTVDSMGMLSTVINALALDLALRKINVPTVILSSIFMPQ-------V--- 113 (242) Q Consensus 66 -----~-----------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~--- 113 (242) . ..........+.....+...+..++...+...|.+.......+... . T Consensus 70 ~i~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~ils~GE~~Sa~lla~~l~~~g~~a~~~da~~~i~td~~~~~a~~~ 149 (440) T PRK05925 70 EIAKELGIEFSLSPWWERLEHFEDKEEISSEDQARILAIGEDISASLICAYCCTNVLPLEFLEARQVILTDDQYLRATPD 149 (440) T ss_pred HHHHHHCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEECCEEEEEECCCCCCCCCC T ss_conf 99987254212468999999876404488477888988729999999999999679982884303289826844330335 Q ss_pred HHHHHHHHHCCHHHHCCCCCCCCCC---------CCCCCCCHHHHHHHHHHHCHHHHHHCCCCCCCCCCCCCCCCCCCCC Q ss_conf 5665432100001100111112465---------5677762157888886416446651475435123432234444444 Q gi|254780777|r 114 CEVFSCRNAVSYLSQGKVVIFSGGT---------GNAFLTTDSAAALRASEIGADVILKGTQVDGVYSADPRVHASSTRF 184 (242) Q Consensus 114 ~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~D~~a~~ia~~~~ad~l~i~tdvdGiyt~dP~~~~~ak~i 184 (242) ............+....++++.|+. .+.+|++|++|+++|.+++|+.|.|||||||+||+||+.+|+|++| T Consensus 150 ~~~~~~~~~~~~~~~~~v~Vv~GF~g~~~~G~~ttLGRGGSD~sA~~la~al~A~~~~IwTDV~Gv~taDPriV~~A~~i 229 (440) T PRK05925 150 LALMQTNWHELNLQEDAIYIMQGFIGANSSGKTTVLGRGGSDFSASLIAELCKAREVRIYTDVNGIYTMDPKIIKDAQLI 229 (440) T ss_pred HHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCEEEECCCCCCHHHHHHHHHCCCCEEEEECCCCCEECCCCCCCCCCCCC T ss_conf 88888889987522796899616300189997776169973089999999617849998316573374799868888566 Q ss_pred CCCCHHHHHHC---CCCCCCHHHHHHHHHCCCCEEEEECCCCCHHHHHHCCCCCCCEECC Q ss_conf 41159999855---9981409999999974996999518985479999779987824668 Q gi|254780777|r 185 DSLTYNQFIEK---GLKVMDCASVVLARDCSIPIIVFSIHSPGGIWGGLSGIGRSTIISG 241 (242) Q Consensus 185 ~~is~~e~l~~---G~~v~~~~A~~~a~~~gI~v~I~ng~~~~~i~~~l~Ge~~GT~I~~ 241 (242) +++||+|++++ |++|+||+++++|+++|||++|.|.++| ++.||+|++ T Consensus 230 ~~lsy~Ea~ELa~~GAkVlHprtv~pa~~~~Ipi~i~nt~~p---------~~~GT~I~~ 280 (440) T PRK05925 230 PELSFEEMQNLASFGAKVLHPPMLKPCVRAGIPIFVTSTFDV---------TKGGTWIYA 280 (440) T ss_pred CCCCHHHHHHHHHCCCCCCCHHHHHHHHHCCCCEEEECCCCC---------CCCCEEEEE T ss_conf 630899999999769640489999999983995899678999---------999538985 No 33 >cd04234 AAK_AK AAK_AK: Amino Acid Kinase Superfamily (AAK), Aspartokinase (AK); this CD includes the N-terminal catalytic domain of aspartokinase (4-L-aspartate-4-phosphotransferase;). AK is the first enzyme in the biosynthetic pathway of the aspartate family of amino acids (lysine, threonine, methionine, and isoleucine) and the bacterial cell wall component, meso-diaminopimelate. It also catalyzes the conversion of aspartate and ATP to aspartylphosphate and ADP. One mechanism for the regulation of this pathway is by the production of several isoenzymes of aspartokinase with different repressors and allosteric inhibitors. Pairs of ACT domains are proposed to specifically bind amino acids leading to allosteric regulation of the enzyme. In Escherichia coli, three different aspartokinase isoenzymes are regulated specifically by lysine, methionine, and threonine. AK-HSDHI (ThrA) and AK-HSDHII (MetL) are bifunctional enzymes that consist of an N-terminal AK and a C-terminal homoserine dehyd Probab=100.00 E-value=6.5e-34 Score=228.73 Aligned_cols=202 Identities=26% Similarity=0.290 Sum_probs=155.4 Q ss_pred EEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCC--CHHHHHHHCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 999940036853887770899999999999999976990999823654--102334420377775402467888889899 Q gi|254780777|r 9 VLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGN--IFRGSQVVAENYLLCERSTVDSMGMLSTVI 86 (242) Q Consensus 9 IViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 86 (242) +|+|||||||.+ .+.++++++.|.+. .+++++++|+|+.. +-++.. .......+.+. T Consensus 2 ~V~KFGGtSv~~-------~~~i~~v~~ii~~~-~~~~~~vVVvSA~~g~Td~L~~-------------~a~ils~GE~l 60 (227) T cd04234 2 VVQKFGGTSVAS-------AERIKRVADIIKAY-EKGNRVVVVVSAMGGVTDLLIE-------------LALLLSFGERL 60 (227) T ss_pred EEEEECCCCCCC-------HHHHHHHHHHHHHH-HCCCCEEEEECCCCCCHHHHHH-------------HHHHHHHHHHH T ss_conf 899678301799-------99999999999865-4389889997388432899999-------------99999299999 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCHHHHHHHH----------HHHHHHCCHHHHCCCCCCCCC---------CCCCCCCCHH Q ss_conf 999999999761886411340010024566----------543210000110011111246---------5567776215 Q gi|254780777|r 87 NALALDLALRKINVPTVILSSIFMPQVCEV----------FSCRNAVSYLSQGKVVIFSGG---------TGNAFLTTDS 147 (242) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~D~ 147 (242) +..++...+...|++.......+....... ...+....+.+...++++.++ +.+.+|++|+ T Consensus 61 Sa~lla~~L~~~Gi~a~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~~~~v~Vv~GF~g~~~~G~~tTLGRGGSD~ 140 (227) T cd04234 61 SARLLAAALRDRGIKARSLDARQAGITTDDNHGAARIIEISYERLKELLAEIGKVPVVTGFIGRNEDGEITTLGRGGSDY 140 (227) T ss_pred HHHHHHHHHHHCCCCEEEECHHHCEEECCCCCCCEEEHHHCHHHHHHHHHHCCCEEEECCEEEECCCCCEEEECCCCHHH T ss_conf 99999999997899719964157206416876520645428999999997389679962604540588468838885678 Q ss_pred HHHHHHHHHCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHC---CCCCCCHHHHHHHHHCCCCEEEEECCCCC Q ss_conf 788888641644665147543512343223444444441159999855---99814099999999749969995189854 Q gi|254780777|r 148 AAALRASEIGADVILKGTQVDGVYSADPRVHASSTRFDSLTYNQFIEK---GLKVMDCASVVLARDCSIPIIVFSIHSPG 224 (242) Q Consensus 148 ~a~~ia~~~~ad~l~i~tdvdGiyt~dP~~~~~ak~i~~is~~e~l~~---G~~v~~~~A~~~a~~~gI~v~I~ng~~~~ 224 (242) +|+++|.+++|+.|.+||||||+||+||+.+|+|++++++||+|++++ |++|+|++++++|++++||++|.|.++| T Consensus 141 TA~~lA~~l~A~~~~I~TDVdGi~taDPr~v~~A~~i~~lsy~Ea~ela~~GakVlHp~ti~pa~~~~Ipi~i~nt~~p- 219 (227) T cd04234 141 SAAALAAALGADEVEIWTDVDGIYTADPRIVPEARLIPEISYDEALELAYFGAKVLHPRAVEPARKANIPIRVKNTFNP- 219 (227) T ss_pred HHHHHHHHHCCHHHEEEECCCEEECCCCCCCCCCEEECCCCHHHHHHHHHCCCCCCCHHHHHHHHHCCCEEEEECCCCC- T ss_conf 9999999818602222126870472799747886240441999999999779741699999999986983999289999- Q ss_pred HHHHHHCCCCCCCEEC Q ss_conf 7999977998782466 Q gi|254780777|r 225 GIWGGLSGIGRSTIIS 240 (242) Q Consensus 225 ~i~~~l~Ge~~GT~I~ 240 (242) +..||+|+ T Consensus 220 --------~~~GT~Is 227 (227) T cd04234 220 --------EAPGTLIT 227 (227) T ss_pred --------CCCCEEEC T ss_conf --------99981769 No 34 >PRK09436 thrA bifunctional aspartokinase I/homeserine dehydrogenase I; Provisional Probab=100.00 E-value=7.3e-34 Score=228.41 Aligned_cols=216 Identities=21% Similarity=0.259 Sum_probs=153.9 Q ss_pred EEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCC--CCHHHHHHHC---CCC---------------- Q ss_conf 9994003685388777089999999999999997699099982365--4102334420---377---------------- Q gi|254780777|r 10 LLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGG--NIFRGSQVVA---ENY---------------- 68 (242) Q Consensus 10 ViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~--~~~~~~~~~~---~~~---------------- 68 (242) |+|||||||.+ .++++++++.|....+.+ ++++|+|+. .+-++..... .+. T Consensus 3 V~KFGGTSva~-------ae~i~~Va~II~~~~~~~-~vvVVVSA~ggvTd~Ll~~~~~a~~~~~~~~~l~~i~~~h~~~ 74 (817) T PRK09436 3 VLKFGGTSLAN-------AERFLRVADIIESNARQE-QVAVVLSAPAKVTNHLVAMAEKAAGGQDYLPNLSDAERIFAEL 74 (817) T ss_pred EEEECCCCCCC-------HHHHHHHHHHHHHHCCCC-CEEEEECCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHH T ss_conf 88757156789-------999999999997534389-9899983999868999999998854886789999999999999 Q ss_pred -------------------------------------CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHH Q ss_conf -------------------------------------7754024678888898999999999997618864113400100 Q gi|254780777|r 69 -------------------------------------LLCERSTVDSMGMLSTVINALALDLALRKINVPTVILSSIFMP 111 (242) Q Consensus 69 -------------------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 111 (242) ...+.+..+.....+.+.+..++...+...|++.......+.. T Consensus 75 ~~~l~~~~~~~~~~~l~~~l~~~~~~L~~~l~gi~ll~e~s~~~~d~Ils~GE~lSa~lla~~L~~~Gi~a~~lDar~~i 154 (817) T PRK09436 75 LRGLAAALPGFDLAQLKAKLDQEFAELKDLLHGISLLGECPDKVNAAIISRGERLSIAIMSAVLEARGHDATVIDPRELL 154 (817) T ss_pred HHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEECHHHHE T ss_conf 99986433420257899999999999999987665530388577789975869999999999999689980997748907 Q ss_pred HH----------HHHHHHHHHCCHHHHCCCCCCCCC---------CCCCCCCCHHHHHHHHHHHCHHHHHHCCCCCCCCC Q ss_conf 24----------566543210000110011111246---------55677762157888886416446651475435123 Q gi|254780777|r 112 QV----------CEVFSCRNAVSYLSQGKVVIFSGG---------TGNAFLTTDSAAALRASEIGADVILKGTQVDGVYS 172 (242) Q Consensus 112 ~~----------~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~D~~a~~ia~~~~ad~l~i~tdvdGiyt 172 (242) .. ......+......+...++++.|+ +.+.+|++|++|+++|.+++|+.|.+||||||+|| T Consensus 155 ~td~~~~~a~vd~~~s~~~i~~~~~~~~~i~Vv~GFig~~~~G~~tTLGRGGSDyTAailAa~L~A~~~eIwTDVdGi~T 234 (817) T PRK09436 155 LADGHYLESTVDIAESTRRIAASFIPADHVVLMPGFTAGNEKGELVTLGRNGSDYSAAVLAACLDADCCEIWTDVDGVYT 234 (817) T ss_pred EECCCCCCEEECHHHHHHHHHHHHCCCCCEEEECCEEEECCCCCEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCEEE T ss_conf 83587776556667649999986345797899347112079998777069861789999999719866999846773270 Q ss_pred CCCCCCCCCCCCCCCCHHHHHHC---CCCCCCHHHHHHHHHCCCCEEEEECCCCCHHHHHHCCCCCCCEECCC Q ss_conf 43223444444441159999855---99814099999999749969995189854799997799878246689 Q gi|254780777|r 173 ADPRVHASSTRFDSLTYNQFIEK---GLKVMDCASVVLARDCSIPIIVFSIHSPGGIWGGLSGIGRSTIISGE 242 (242) Q Consensus 173 ~dP~~~~~ak~i~~is~~e~l~~---G~~v~~~~A~~~a~~~gI~v~I~ng~~~~~i~~~l~Ge~~GT~I~~~ 242 (242) +||+.+|+|++|+++||+|++++ |++|+||++++++.+++||++|.|.++| +..||+|+++ T Consensus 235 aDPr~V~~A~~I~~lSY~EA~ELA~fGAkVLHP~Ti~P~~~~~IPi~IkNTf~P---------~~~GT~I~~~ 298 (817) T PRK09436 235 ADPRVVPDARLLKSLSYQEAMELSYFGAKVLHPRTIAPIAQFQIPCLIKNTFNP---------QAPGTLIGAE 298 (817) T ss_pred CCCCCCCCCEEEEEECHHHHHHHHHCCCCEECHHHHHHHHHCCCEEEEECCCCC---------CCCCEEEECC T ss_conf 699877666575163699999998479950266778899874965999657788---------9997488425 No 35 >PRK09466 metL bifunctional aspartate kinase II/homoserine dehydrogenase II; Provisional Probab=100.00 E-value=5.8e-34 Score=229.06 Aligned_cols=220 Identities=19% Similarity=0.124 Sum_probs=152.0 Q ss_pred CCEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCC-CCHHHHHHHCC----------------- Q ss_conf 824999994003685388777089999999999999997699099982365-41023344203----------------- Q gi|254780777|r 5 PYKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGG-NIFRGSQVVAE----------------- 66 (242) Q Consensus 5 ~~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~-~~~~~~~~~~~----------------- 66 (242) -|.|.|.||||||+.+ .++++++++.|.+..+. .++|+||..+ .+-++...... T Consensus 9 ~m~r~V~KFGGTSVa~-------ae~i~~Va~II~~~~~~-~~vVVVSA~ggvTd~Li~~~~~a~~~~~~~~~~~~~l~~ 80 (810) T PRK09466 9 AMGRQLHKFGGSSLAD-------AKCYRRVAGILAEYSQP-DDLVVVSAAGKTTNQLIKWLKLSQTDRLSAHEVLQTLRR 80 (810) T ss_pred CCCCEEECCCCCCCCC-------HHHHHHHHHHHHHCCCC-CCEEEECCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHH T ss_conf 3226897518665689-------99999999999735789-939998499998189999999875386057899999999 Q ss_pred --------------------------------CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHH Q ss_conf --------------------------------777754024678888898999999999997618864113400100245 Q gi|254780777|r 67 --------------------------------NYLLCERSTVDSMGMLSTVINALALDLALRKINVPTVILSSIFMPQVC 114 (242) Q Consensus 67 --------------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 114 (242) ..........+.....+.+.+..++...+...|++............. T Consensus 81 ~h~~li~~ll~~~~~~~l~~~l~~~l~~l~~~l~~~~~~~~~d~ils~GE~lSa~Lla~~L~~~Gi~a~~ldar~~i~~d 160 (810) T PRK09466 81 YQQDLIEGLLSAEQARSLLSRLISDLERLAALLDGGINDAQYAEVVGHGEVWSARLLAALLNQQGLAAAWLDARSFLRAE 160 (810) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEECHHHHEEEC T ss_conf 99999997506234699999999999999997515688678767733608999999999999679984895679905616 Q ss_pred H--------HHHHHHHCCHHH--HCCCCCCCCC---------CCCCCCCCHHHHHHHHHHHCHHHHHHCCCCCCCCCCCC Q ss_conf 6--------654321000011--0011111246---------55677762157888886416446651475435123432 Q gi|254780777|r 115 E--------VFSCRNAVSYLS--QGKVVIFSGG---------TGNAFLTTDSAAALRASEIGADVILKGTQVDGVYSADP 175 (242) Q Consensus 115 ~--------~~~~~~~~~~~~--~~~~~~~~~~---------~~~~~~~~D~~a~~ia~~~~ad~l~i~tdvdGiyt~dP 175 (242) . ..........+. ...++++.|+ +.+.+|++|++|+++|.+++|+.|.|||||||+||+|| T Consensus 161 ~~~~~~vd~~~s~~~l~~~l~~~~~~i~VvtGFig~~~~G~~tTLGRGGSD~SAailAaaL~A~~~eIwTDVdGV~TADP 240 (810) T PRK09466 161 RAAQPQVDEGLSYPLLQQLLAQHPGKRLVVTGFISRNEAGETVLLGRNGSDYSATLIGALADVERVTIWSDVAGVYSADP 240 (810) T ss_pred CCCCCCCCHHHHHHHHHHHHHHCCCCEEEECCEEEECCCCCEEEECCCCCCHHHHHHHHHHCCCEEEEEECCCCEECCCC T ss_conf 78653113666599999999736897699826677469997787058873069999999809977999735673153699 Q ss_pred CCCCCCCCCCCCCHHHHHHC---CCCCCCHHHHHHHHHCCCCEEEEECCCCCHHHHHHCCCCCCCEECC Q ss_conf 23444444441159999855---9981409999999974996999518985479999779987824668 Q gi|254780777|r 176 RVHASSTRFDSLTYNQFIEK---GLKVMDCASVVLARDCSIPIIVFSIHSPGGIWGGLSGIGRSTIISG 241 (242) Q Consensus 176 ~~~~~ak~i~~is~~e~l~~---G~~v~~~~A~~~a~~~gI~v~I~ng~~~~~i~~~l~Ge~~GT~I~~ 241 (242) |.+|+|++|+++||+|++|+ |++|+||++++++++++||++|.|.++| +..||+|++ T Consensus 241 riV~~A~~I~~iSY~EA~ELA~fGAkVLHPrti~Pa~~~~IPI~IknTf~P---------e~~GTlI~~ 300 (810) T PRK09466 241 RKVKDACLLPLLRLDEASELARLAAPVLHARTLQPVSGSDIDLQLRCSYTP---------EQGSTRIER 300 (810) T ss_pred CCCCCCEEECEECHHHHHHHHHCCCCCCCHHHHHHHHHCCCEEEEEECCCC---------CCCCEEEEE T ss_conf 767565111234599999998469986576789999864974999954578---------999659975 No 36 >cd04257 AAK_AK-HSDH AAK_AK-HSDH: Amino Acid Kinase Superfamily (AAK), AK-HSDH; this CD includes the N-terminal catalytic domain of aspartokinase (AK) of the bifunctional enzyme AK - homoserine dehydrogenase (HSDH). These aspartokinases are found in bacteria (E. coli AKI-HSDHI, ThrA and E. coli AKII-HSDHII, MetL) and higher plants (Z. mays AK-HSDH). AK and HSDH are the first and third enzymes in the biosynthetic pathway of the aspartate family of amino acids. AK catalyzes the phosphorylation of Asp to P-aspartyl phosphate. HSDH catalyzes the NADPH-dependent conversion of Asp 3-semialdehyde to homoserine. ThrA and MetL are involved in threonine and methionine biosynthesis, respectively. In E. coli, ThrA is subject to allosteric regulation by the end product L-threonine and the native enzyme is reported to be tetrameric. As with bacteria, plant AK and HSDH are feedback inhibited by pathway end products. Maize AK-HSDH is a Thr-sensitive 180-kD enzyme. Arabidopsis AK-HSDH is an alanine-act Probab=100.00 E-value=3.5e-33 Score=224.24 Aligned_cols=214 Identities=24% Similarity=0.263 Sum_probs=153.9 Q ss_pred EEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCC--CHHHHHHHC---C------------------ Q ss_conf 99940036853887770899999999999999976990999823654--102334420---3------------------ Q gi|254780777|r 10 LLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGN--IFRGSQVVA---E------------------ 66 (242) Q Consensus 10 ViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~--~~~~~~~~~---~------------------ 66 (242) |+||||||+.+ .+.++++++.|.+..++. ++++|+|+.. +-.+..... . T Consensus 3 V~KFGGtSv~~-------~~~i~~v~~iI~~~~~~~-~~vVVVSA~ggvTd~L~~~~~~a~~~~~~~~~~l~~i~~~h~~ 74 (294) T cd04257 3 VLKFGGTSLAN-------AERIRRVADIILNAAKQE-QVAVVVSAPGKVTDLLLELAELASSGDDAYEDILQELESKHLD 74 (294) T ss_pred EEEECCCCCCC-------HHHHHHHHHHHHHHCCCC-CEEEEECCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHH T ss_conf 99878111589-------999999999997532489-9899980999867999999999853862689999999999999 Q ss_pred ---------------------------------CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHH Q ss_conf ---------------------------------77775402467888889899999999999761886411340010024 Q gi|254780777|r 67 ---------------------------------NYLLCERSTVDSMGMLSTVINALALDLALRKINVPTVILSSIFMPQV 113 (242) Q Consensus 67 ---------------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 113 (242) ..........+.....+.+.+..++...+...|++.......+.... T Consensus 75 ~i~~l~~~~~~~~~~~~i~~~~~~l~~~l~~~~~~~~~~~~~~d~i~s~GE~lSa~l~a~~L~~~Gi~a~~ld~~~~i~t 154 (294) T cd04257 75 LITELLSGDAAAELLSALGNDLEELKDLLEGIYLLGELPDSIRAKVLSFGERLSARLLSALLNQQGLDAAWIDARELIVT 154 (294) T ss_pred HHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEECHHHHEEC T ss_conf 99985350457899999999999999999777641569889999999989899999999999977998299533362112 Q ss_pred H----------HHHHHHHHCCHHHHCCCCCCCCCC---------CCCCCCCHHHHHHHHHHHCHHHHHHCCCCCCCCCCC Q ss_conf 5----------665432100001100111112465---------567776215788888641644665147543512343 Q gi|254780777|r 114 C----------EVFSCRNAVSYLSQGKVVIFSGGT---------GNAFLTTDSAAALRASEIGADVILKGTQVDGVYSAD 174 (242) Q Consensus 114 ~----------~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~D~~a~~ia~~~~ad~l~i~tdvdGiyt~d 174 (242) . .....+......+...++++.|+. .+.+|++|++|+++|.+++|+.+.|||||+|+||+| T Consensus 155 ~~~~~~a~~~~~~t~~~~~~~~~~~~~v~Vv~GFig~~~~G~~ttLGRgGSD~TAa~ia~~L~A~~v~IwtDV~Gi~taD 234 (294) T cd04257 155 DGGYLNAVVDIELSKERIKAWFSSNGKVIVVTGFIASNPQGETTTLGRNGSDYSAAILAALLDADQVEIWTDVDGVYSAD 234 (294) T ss_pred CCCCCCEEECHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCEEEECCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCC T ss_conf 68877535668651999999864179789976952567889751348994889999999972997899942888505279 Q ss_pred CCCCCCCCCCCCCCHHHHHHC---CCCCCCHHHHHHHHHCCCCEEEEECCCCCHHHHHHCCCCCCCEEC Q ss_conf 223444444441159999855---998140999999997499699951898547999977998782466 Q gi|254780777|r 175 PRVHASSTRFDSLTYNQFIEK---GLKVMDCASVVLARDCSIPIIVFSIHSPGGIWGGLSGIGRSTIIS 240 (242) Q Consensus 175 P~~~~~ak~i~~is~~e~l~~---G~~v~~~~A~~~a~~~gI~v~I~ng~~~~~i~~~l~Ge~~GT~I~ 240 (242) |+.+|+|++|+++||+|+.++ |++|+||+++++|++++||++|.|.++| ++.||+|+ T Consensus 235 Pr~v~~a~~i~~lsy~EA~eLa~~GakVlHp~ti~p~~~~~Ipi~vrnt~~p---------~~~GT~IS 294 (294) T cd04257 235 PRKVKDARLLPSLSYQEAMELSYFGAKVLHPKTIQPVAKKNIPILIKNTFNP---------EAPGTLIS 294 (294) T ss_pred CCCCCCCEECCCCCHHHHHHHHHCCCCCCCHHHHHHHHHCCCCEEEECCCCC---------CCCCEEEC T ss_conf 9768898671604899999999679831599999999987993899389998---------99981519 No 37 >TIGR01092 P5CS delta l-pyrroline-5-carboxylate synthetase; InterPro: IPR005766 delta l-pyrroline-5-carboxylate synthetase contains a glutamate 5-kinase (ProB, 2.7.2.11 from EC) region followed by a gamma-glutamyl phosphate reductase (ProA, 1.2.1.41 from EC) region and catalyses the first and second steps in proline biosynthesis.; GO: 0003824 catalytic activity, 0006561 proline biosynthetic process. Probab=100.00 E-value=1.1e-34 Score=233.41 Aligned_cols=237 Identities=19% Similarity=0.254 Sum_probs=177.4 Q ss_pred CCCC-CEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCC-CHHHHH----HHC---------- Q ss_conf 6888-249999940036853887770899999999999999976990999823654-102334----420---------- Q gi|254780777|r 2 SDFP-YKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGN-IFRGSQ----VVA---------- 65 (242) Q Consensus 2 ~~~~-~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~-~~~~~~----~~~---------- 65 (242) +.+| .||||+|+|++++|.+++.++...||..|++|+++|..+|+|+++|+||+. +++.+. ... T Consensus 2 a~~K~~KRiVVK~G~AVVTr~d~~GLALGRL~si~EQl~~L~~~G~E~~LVtSGAV~~G~QRLR~~~L~~~S~~~~~~~~ 81 (738) T TIGR01092 2 AFLKDVKRIVVKVGTAVVTRGDGRGLALGRLGSIVEQLSELNSDGYEVILVTSGAVALGRQRLRYRKLVNSSFADLQKPQ 81 (738) T ss_pred CCCCCCCEEEEEECCEEEECCCCCEEHHHHHHHHHHHHHHHCCCCCEEEEECCCHHHCCCHHHHHHHHHHHHHHHHHCCH T ss_conf 64335547899825457877898401456789999999986028928999814412304046899999999999862420 Q ss_pred ------CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHH---HHCCHHHHCCCCCCCC Q ss_conf ------3777754024678888898999999999997618864113400100245665432---1000011001111124 Q gi|254780777|r 66 ------ENYLLCERSTVDSMGMLSTVINALALDLALRKINVPTVILSSIFMPQVCEVFSCR---NAVSYLSQGKVVIFSG 136 (242) Q Consensus 66 ------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~ 136 (242) ......+...++ ..++..++.+|...|.++.+...+...+..+.+....+.+ .....+.+..+|+++. T Consensus 82 qnqlke~ai~~L~~~ACA---a~GQsgLMAlYe~~F~Q~~~~aAQ~LVT~~dF~D~~~R~~L~~T~~~lL~~~vvPv~N~ 158 (738) T TIGR01092 82 QNQLKEMAIVELDGKACA---AVGQSGLMALYETMFTQLDITAAQLLVTDLDFRDEQFRRQLNETVKELLRMKVVPVLNE 158 (738) T ss_pred HHHHHHHHHHHCCCCHHH---HCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCEEEEEEEC T ss_conf 233444212101331100---10413689999998658988888876315556686777779999999852303356517 Q ss_pred CC-------------C---CCCCCCHHHHHHHHHHHCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHH--------- Q ss_conf 65-------------5---6777621578888864164466514754351234322344444444115999--------- Q gi|254780777|r 137 GT-------------G---NAFLTTDSAAALRASEIGADVILKGTQVDGVYSADPRVHASSTRFDSLTYNQ--------- 191 (242) Q Consensus 137 ~~-------------~---~~~~~~D~~a~~ia~~~~ad~l~i~tdvdGiyt~dP~~~~~ak~i~~is~~e--------- 191 (242) .+ | +.+.|||.+||++|-++.+|+|++++||+|+|+.+| -.+++|+|+.--.++ T Consensus 159 NDAv~~~~~P~~D~~Ginvis~~DNDSLAA~LalElK~DLLIlLSDVeGLy~~PP-s~~~~KLI~~f~~~~~Q~~iTFG~ 237 (738) T TIGR01092 159 NDAVSTRKAPYSDSQGINVISFWDNDSLAALLALELKADLLILLSDVEGLYDGPP-SDDDSKLIDTFVKEKHQDEITFGE 237 (738) T ss_pred CCCCCCCCCCCCCCCCEEEEEEECCHHHHHHHHHHHHHHHEEEEECCCCCCCCCC-CCCCCCEEEEECCCCCCCCEECCC T ss_conf 8754775477543466058998336479999998774404020103022348511-378762688730675456010365 Q ss_pred --HHHCCCCCCCHHHHHHHHHCCCCEEEEECCCCCH----HHHHHCCCCCCCEECCC Q ss_conf --9855998140999999997499699951898547----99997799878246689 Q gi|254780777|r 192 --FIEKGLKVMDCASVVLARDCSIPIIVFSIHSPGG----IWGGLSGIGRSTIISGE 242 (242) Q Consensus 192 --~l~~G~~v~~~~A~~~a~~~gI~v~I~ng~~~~~----i~~~l~Ge~~GT~I~~~ 242 (242) -+++|+|--|++||-+|...|+||+|+||..|.+ |.++++|+++||+++.+ T Consensus 238 KSrlG~GGM~AKVKAA~~A~~gG~~ViIa~G~~~~~sGhvI~~v~~G~~VGTlF~~~ 294 (738) T TIGR01092 238 KSRLGRGGMTAKVKAAVYAAYGGVPVIIASGTAAKNSGHVITKVVEGKKVGTLFHED 294 (738) T ss_pred CCCCCCCCCHHHHHHHHHHHHCCEEEEECCCCCCCCCCCEEEEECCCCEEECCEECC T ss_conf 222368850278999999974891699816856533653322120573220010035 No 38 >cd04247 AAK_AK-Hom3 AAK_AK-Hom3: Amino Acid Kinase Superfamily (AAK), AK-Hom3; this CD includes the N-terminal catalytic domain of the aspartokinase HOM3, a monofunctional class enzyme found in Saccharomyces cerevisiae and other related AK domains. Aspartokinase, the first enzyme in the aspartate metabolic pathway, catalyzes the conversion of aspartate and ATP to aspartylphosphate and ADP, and in fungi, is responsible for the production of threonine, isoleucine and methionine. S. cerevisiae has a single aspartokinase isoenzyme type, which is regulated by feedback, allosteric inhibition by L-threonine. Recent studies show that the allosteric transition triggered by binding of threonine to AK involves a large change in the conformation of the native hexameric enzyme that is converted to an inactive one of different shape and substantially smaller hydrodynamic size. Probab=100.00 E-value=6.1e-33 Score=222.74 Aligned_cols=216 Identities=21% Similarity=0.230 Sum_probs=151.4 Q ss_pred EEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCC--------CHHHHHHHC---CC---------- Q ss_conf 999940036853887770899999999999999976990999823654--------102334420---37---------- Q gi|254780777|r 9 VLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGN--------IFRGSQVVA---EN---------- 67 (242) Q Consensus 9 IViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~--------~~~~~~~~~---~~---------- 67 (242) ||.||||||+.+ ...++++.|.....+++++++|+|+.+ +.++..... .. T Consensus 3 iV~KFGGTSva~---------~~~~~~~~I~~~~~~~~~vvVVvSA~~~~~~~~g~T~~L~~~~~~a~~~~~~~~~~~~~ 73 (306) T cd04247 3 VVQKFGGTSVGK---------FPDNIADDIVKAYLKGNKVAVVCSARSTGTKAEGTTNRLLQAADEALDAQEKAFHDIVE 73 (306) T ss_pred EEEEECHHHHHH---------HHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHH T ss_conf 999779140887---------99999999987762799889998899798775686999999999887445367999999 Q ss_pred -------------------------------------------CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf -------------------------------------------7775402467888889899999999999761886411 Q gi|254780777|r 68 -------------------------------------------YLLCERSTVDSMGMLSTVINALALDLALRKINVPTVI 104 (242) Q Consensus 68 -------------------------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 104 (242) ......+..+.....+.+.+..++...+...|++... T Consensus 74 ~i~~~h~~~~~~~~~~~~~~~~l~~~i~~~~~~L~~~l~~~~~l~e~~~~~~d~ils~GE~lSa~l~a~~L~~~Gi~a~~ 153 (306) T cd04247 74 DIRSDHLAAARKFIKNPELQAELEEEINKECELLRKYLEAAKILSEISPRTKDLVISTGEKLSCRFMAAVLRDRGVDAEY 153 (306) T ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEE T ss_conf 99999999999973087889999999999999999999988761448878888998212789999999999977999499 Q ss_pred CCCCHHHHHH------H----HHHHHHHCCHH--HHCCCCCCCCCC---------CCCCCCCHHHHHHHHHHHCHHHHHH Q ss_conf 3400100245------6----65432100001--100111112465---------5677762157888886416446651 Q gi|254780777|r 105 LSSIFMPQVC------E----VFSCRNAVSYL--SQGKVVIFSGGT---------GNAFLTTDSAAALRASEIGADVILK 163 (242) Q Consensus 105 ~~~~~~~~~~------~----~~~~~~~~~~~--~~~~~~~~~~~~---------~~~~~~~D~~a~~ia~~~~ad~l~i 163 (242) .......... . ..........+ ....++++.|+. .+.+|++|++|+++|.+++|+.+.| T Consensus 154 lDa~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~Vv~GFig~~~~G~~tTLGRgGSDySAaila~~l~A~~v~I 233 (306) T cd04247 154 VDLSHIVDLDFSIEALDQTFYDELAQVLGEKITACENRVPVVTGFFGNVPGGLLSQIGRGYTDLCAALCAVGLNADELQI 233 (306) T ss_pred ECHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCEEECCCCCEEECCCCCHHHHHHHHHHHCCCCEEEE T ss_conf 64899513667635455056899999999987741697489604125469998776068970799999999769778999 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHC---CCCCCCHHHHHHHHHCCCCEEEEECCCCCHHHHHHCCCCCCCEEC Q ss_conf 47543512343223444444441159999855---998140999999997499699951898547999977998782466 Q gi|254780777|r 164 GTQVDGVYSADPRVHASSTRFDSLTYNQFIEK---GLKVMDCASVVLARDCSIPIIVFSIHSPGGIWGGLSGIGRSTIIS 240 (242) Q Consensus 164 ~tdvdGiyt~dP~~~~~ak~i~~is~~e~l~~---G~~v~~~~A~~~a~~~gI~v~I~ng~~~~~i~~~l~Ge~~GT~I~ 240 (242) ||||||+||+||+.+|+|++|+++||+|+.++ |++|+||++++++++.+||++|.|.++| +..||+|+ T Consensus 234 wTDV~Gi~taDPr~V~~A~~i~~lsy~EA~eLa~~GakVlHP~ti~P~~~~~IPi~Irnt~~P---------~~~GT~I~ 304 (306) T cd04247 234 WKEVDGIFTADPRKVPTARLLPSITPEEAAELTYYGSEVIHPFTMEQVIKARIPIRIKNVENP---------RGEGTVIY 304 (306) T ss_pred EECCCCCCCCCCCCCCCCEECCEECHHHHHHHHHCCCCCCCHHHHHHHHHCCCCEEECCCCCC---------CCCCCEEC T ss_conf 977875454798757898365721899999999679731598999999987994898189998---------99973656 Q ss_pred CC Q ss_conf 89 Q gi|254780777|r 241 GE 242 (242) Q Consensus 241 ~~ 242 (242) +| T Consensus 305 ~d 306 (306) T cd04247 305 PD 306 (306) T ss_pred CC T ss_conf 89 No 39 >cd04258 AAK_AKiii-LysC-EC AAK_AKiii-LysC-EC: Amino Acid Kinase Superfamily (AAK), AKiii-LysC-EC: this CD includes the N-terminal catalytic aspartokinase (AK) domain of the lysine-sensitive aspartokinase isoenzyme AKIII. AKIII is a monofunctional class enzyme (LysC) found in some bacteria such as E. coli. Aspartokinase is the first enzyme in the aspartate metabolic pathway and catalyzes the conversion of aspartate and ATP to aspartylphosphate and ADP. In E. coli, LysC is reported to be a homodimer of 50 kD subunits. Probab=100.00 E-value=1.2e-32 Score=221.01 Aligned_cols=213 Identities=22% Similarity=0.219 Sum_probs=152.4 Q ss_pred EEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCC--CHHHHHHHCC-------------------- Q ss_conf 999940036853887770899999999999999976990999823654--1023344203-------------------- Q gi|254780777|r 9 VLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGN--IFRGSQVVAE-------------------- 66 (242) Q Consensus 9 IViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~--~~~~~~~~~~-------------------- 66 (242) +|+||||||+.+ .+.+.++++.|.+ ++.+.++|+|+.. +-++...... T Consensus 2 ~V~KFGGtSv~~-------~~~i~~v~~ii~~---~~~~~vVVVSA~~gvTd~L~~~~~~~~~~~~~~~~~~~~~i~~~h 71 (292) T cd04258 2 VVAKFGGTSVAD-------YAAMLRCAAIVKS---DASVRLVVVSASAGVTNLLVALADAAESGEEIESIPQLHEIRAIH 71 (292) T ss_pred EEEEECCCCCCC-------HHHHHHHHHHHHC---CCCCEEEEEECCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH T ss_conf 899578654488-------9999999999975---899809999489997499999999874477166899999999999 Q ss_pred --------------------------------CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHH Q ss_conf --------------------------------777754024678888898999999999997618864113400100245 Q gi|254780777|r 67 --------------------------------NYLLCERSTVDSMGMLSTVINALALDLALRKINVPTVILSSIFMPQVC 114 (242) Q Consensus 67 --------------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 114 (242) .......+..+.....+...+..++...+...|++.......+..... T Consensus 72 ~~~~~~l~~~~~~~~~l~~~~~~l~~~~~~~~~~~~~~~~~~d~ils~GE~lSa~lla~~L~~~Gi~a~~~da~~~i~td 151 (292) T cd04258 72 FAILNRLGAPEELRAKLEELLEELTQLAEGAALLGELSPASRDELLSFGERMSSLLFSEALREQGVPAEWFDVRTVLRTD 151 (292) T ss_pred HHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEECHHHHEEEC T ss_conf 99998724738899999999999999986554214688889999988888999999999999779985895657905746 Q ss_pred ------------HHHHHHHHCCHHHHCCCCCCCCCC---------CCCCCCCHHHHHHHHHHHCHHHHHHCCCCCCCCCC Q ss_conf ------------665432100001100111112465---------56777621578888864164466514754351234 Q gi|254780777|r 115 ------------EVFSCRNAVSYLSQGKVVIFSGGT---------GNAFLTTDSAAALRASEIGADVILKGTQVDGVYSA 173 (242) Q Consensus 115 ------------~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~D~~a~~ia~~~~ad~l~i~tdvdGiyt~ 173 (242) ...........++...++++.|+. .+.+|++|++|+++|.+++|+.+.+||||||+||+ T Consensus 152 ~~~~~a~v~~~~~~~~~~~~~~~~~~~~i~Vv~GFig~~~~G~~ttLGRGGSDyTA~~la~~l~A~~v~iwtDV~Gi~ta 231 (292) T cd04258 152 SRFGRAAPDLNALAELAAKLLKPLLAGTVVVTQGFIGSTEKGRTTTLGRGGSDYSAALLAEALHAEELQIWTDVAGIYTT 231 (292) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHCCCCEEEECCCCEECCCCCEEECCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCC T ss_conf 86653235388889999999865337989993580543799977753789616899999997699889999678878758 Q ss_pred CCCCCCCCCCCCCCCHHHHHHC---CCCCCCHHHHHHHHHCCCCEEEEECCCCCHHHHHHCCCCCCCEEC Q ss_conf 3223444444441159999855---998140999999997499699951898547999977998782466 Q gi|254780777|r 174 DPRVHASSTRFDSLTYNQFIEK---GLKVMDCASVVLARDCSIPIIVFSIHSPGGIWGGLSGIGRSTIIS 240 (242) Q Consensus 174 dP~~~~~ak~i~~is~~e~l~~---G~~v~~~~A~~~a~~~gI~v~I~ng~~~~~i~~~l~Ge~~GT~I~ 240 (242) ||+.+|+|++|+++||+|+.++ |++|+|++|++++++++||++|.|.++| ++.||+|| T Consensus 232 DPr~V~~A~~i~~lsy~EA~eLa~~GakVlHp~ti~p~~~~~Ipi~I~nt~~p---------~~~GT~Is 292 (292) T cd04258 232 DPRICPAARAIKEISFAEAAEMATFGAKVLHPATLLPAIRKNIPVFVGSSKDP---------EAGGTLIT 292 (292) T ss_pred CCCCCCCCEECCCCCHHHHHHHHHCCCCCCCHHHHHHHHHCCCCEEEECCCCC---------CCCCEEEC T ss_conf 99878987674836999999999779731699999999986994899378998---------99982549 No 40 >PRK09084 aspartate kinase III; Validated Probab=100.00 E-value=8.1e-33 Score=221.98 Aligned_cols=215 Identities=22% Similarity=0.225 Sum_probs=150.5 Q ss_pred EEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCC--CCHHHHHHHCCCC------------------ Q ss_conf 99994003685388777089999999999999997699099982365--4102334420377------------------ Q gi|254780777|r 9 VLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGG--NIFRGSQVVAENY------------------ 68 (242) Q Consensus 9 IViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~--~~~~~~~~~~~~~------------------ 68 (242) ||.||||||+.+ .++++++++.|.+ +..+.++|+|+. .+-.+........ T Consensus 2 vV~KFGGtSv~~-------~~~i~~v~~ii~~---~~~~~vVVVSA~~gvTd~L~~~a~~~~~~~~~~~~~~~i~~~h~~ 71 (447) T PRK09084 2 VVAKFGGTSVAD-------FDAMNRSADIVLS---NPNTRLVVLSASSGVTNLLVALAEGAEPGDERFALLDEIRQIQYA 71 (447) T ss_pred EEEEECCCCCCC-------HHHHHHHHHHHHC---CCCCEEEEEECCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHH T ss_conf 899478556798-------9999999999972---799829999189996589999998754641389999999999999 Q ss_pred ------------------------------CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHH---- Q ss_conf ------------------------------7754024678888898999999999997618864113400100245---- Q gi|254780777|r 69 ------------------------------LLCERSTVDSMGMLSTVINALALDLALRKINVPTVILSSIFMPQVC---- 114 (242) Q Consensus 69 ------------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---- 114 (242) ...+....+.....+...+..++...+...|++.......+..... T Consensus 72 ~~~~l~~~~~~~~~i~~~~~~l~~l~~~~~~~~~~~~~D~lls~GE~lSa~lla~~L~~~G~~a~~ld~~~~i~td~~~~ 151 (447) T PRK09084 72 ILDRLRDPNVVREEIERLLENITVLAEAASLATSLALTDELVSHGELMSTLLFVEALRERGVQAEWFDVRKVMRTDDRFG 151 (447) T ss_pred HHHHHCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEECHHHHEEECCCCC T ss_conf 99873361879999999999999999863204774189999974679999999999996799838817789467668666 Q ss_pred --------HHHHHHHHCCHHHHCCCCCCCCCC---------CCCCCCCHHHHHHHHHHHCHHHHHHCCCCCCCCCCCCCC Q ss_conf --------665432100001100111112465---------567776215788888641644665147543512343223 Q gi|254780777|r 115 --------EVFSCRNAVSYLSQGKVVIFSGGT---------GNAFLTTDSAAALRASEIGADVILKGTQVDGVYSADPRV 177 (242) Q Consensus 115 --------~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~D~~a~~ia~~~~ad~l~i~tdvdGiyt~dP~~ 177 (242) .....+.....+....++++.|+. .+.+|++|++|+++|.+++|+.+.|||||||+||+|||. T Consensus 152 ~a~~~~~~~~~~~~~~~~~~~~~~v~Vv~GF~g~~~~G~~ttLGRGGSD~TAa~la~~l~A~~~~IwtDV~Gi~taDPr~ 231 (447) T PRK09084 152 RAEPDVAAIAELAQEQLLPLLEEGVVVTQGFIGSDEKGRTTTLGRGGSDYSAALLAEALNASRVEIWTDVPGIYTTDPRI 231 (447) T ss_pred CCCCCHHHHHHHHHHHHHHHCCCCEEEEEEEEEECCCCCEEECCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEECCCCC T ss_conf 51447899999999998774369868950045766999777617996068999999966984999961778336069876 Q ss_pred CCCCCCCCCCCHHHHHHC---CCCCCCHHHHHHHHHCCCCEEEEECCCCCHHHHHHCCCCCCCEECCC Q ss_conf 444444441159999855---99814099999999749969995189854799997799878246689 Q gi|254780777|r 178 HASSTRFDSLTYNQFIEK---GLKVMDCASVVLARDCSIPIIVFSIHSPGGIWGGLSGIGRSTIISGE 242 (242) Q Consensus 178 ~~~ak~i~~is~~e~l~~---G~~v~~~~A~~~a~~~gI~v~I~ng~~~~~i~~~l~Ge~~GT~I~~~ 242 (242) +|+|++|+++||+|+.++ |++|+||+|++++.++|||++|.|.++| +..||+|+++ T Consensus 232 V~~A~~i~~lsy~Ea~ELa~~GAkVlHp~av~pa~~~~IPi~I~nt~~P---------~~~GT~I~~~ 290 (447) T PRK09084 232 VPAARRIDEISFEEAAEMATFGAKVLHPATLLPAVRSNIPVFVGSSKDP---------EAGGTWITND 290 (447) T ss_pred CCCCCCCCCCCHHHHHHHHHCCCHHCCHHHHHHHHHCCCEEEEECCCCC---------CCCCEEEECC T ss_conf 7665203634899999999777100175889999981975999557898---------9997089426 No 41 >PRK08373 aspartate kinase; Validated Probab=100.00 E-value=7.6e-33 Score=222.16 Aligned_cols=215 Identities=17% Similarity=0.187 Sum_probs=149.6 Q ss_pred EEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCC--CHHHHHHHC------------------- Q ss_conf 49999940036853887770899999999999999976990999823654--102334420------------------- Q gi|254780777|r 7 KRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGN--IFRGSQVVA------------------- 65 (242) Q Consensus 7 KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~--~~~~~~~~~------------------- 65 (242) |-+|+||||||+.+. ++++++.|+...+ +.++++|+|+.. +-.+..... T Consensus 4 km~V~KFGGTSV~~a---------~~~v~~iv~~~~~-~~~vvVVVSAm~gvTd~Li~~a~~~~~~~~~I~~~h~~~a~~ 73 (359) T PRK08373 4 KMIVVKFGGSSVRYD---------FEEAVELVKSLSE-EKEVIVVVSALKGVTDCLLKYANTDKSAALEIEKIHEEFAKE 73 (359) T ss_pred CCEEEEECHHCCHHH---------HHHHHHHHHHHHC-CCCEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 879984281013328---------9999999998752-898899967999977999999976577789999999999998 Q ss_pred ----------------CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHH------------HHH Q ss_conf ----------------3777754024678888898999999999997618864113400100245------------665 Q gi|254780777|r 66 ----------------ENYLLCERSTVDSMGMLSTVINALALDLALRKINVPTVILSSIFMPQVC------------EVF 117 (242) Q Consensus 66 ----------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~~ 117 (242) ......+....+.....+.+.+..++...+...|++.......+..... ... T Consensus 74 l~~~~~~l~~~l~~l~~~~~~~~~~~~D~Ils~GE~lSa~Lla~~L~~~Gi~a~~lda~~ii~td~~~~~a~vd~~~s~~ 153 (359) T PRK08373 74 HGIDTEILSPYLKELFNRPDFPSEALRDYILSIGERLSAVIFAEGLKILGINGKVVDPWDILEAKGNFGNAFIDIKKSKR 153 (359) T ss_pred HCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEECHHHEEEECCCCCCEEECHHHHHH T ss_conf 36667788999999873113884576666776739999999999999779980997548958853876757754787676 Q ss_pred HHHHHCCHHHHCCCCCCCCCC--------CCCCCCCHHHHHHHHHHHCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCH Q ss_conf 432100001100111112465--------567776215788888641644665147543512343223444444441159 Q gi|254780777|r 118 SCRNAVSYLSQGKVVIFSGGT--------GNAFLTTDSAAALRASEIGADVILKGTQVDGVYSADPRVHASSTRFDSLTY 189 (242) Q Consensus 118 ~~~~~~~~~~~~~~~~~~~~~--------~~~~~~~D~~a~~ia~~~~ad~l~i~tdvdGiyt~dP~~~~~ak~i~~is~ 189 (242) ........++...++++.|+. .+.+|++|++|+++|.+++|+.+.|||||||+||+|||.+|+|++|+++|| T Consensus 154 ~~~~l~~~l~~~~i~Vv~GFiG~~~G~~TTLGRGGSDyTAailA~~L~A~~v~IwTDVdGI~TADPriV~~Ar~I~~lSY 233 (359) T PRK08373 154 NAKVVYEALERGKIPVVPGFYGNLNGYRVTLGRGGSDYSAVALGVLLNAKAVLIMSDVEGIYTADPKLVPSARLIPYVSY 233 (359) T ss_pred HHHHHHHHHCCCCEEEECCEEEECCCCEEEECCCCCHHHHHHHHHHCCCCEEEEEECCCCCEECCCCCCCCCEECCCCCH T ss_conf 69999987327958993674670499264306997168999999974997899975898516089987898748882189 Q ss_pred HHHHHC---CCCCCCHHHHHHHHHCCCCEEEEECCCCCHHHHHHCCCCCCCEECCC Q ss_conf 999855---99814099999999749969995189854799997799878246689 Q gi|254780777|r 190 NQFIEK---GLKVMDCASVVLARDCSIPIIVFSIHSPGGIWGGLSGIGRSTIISGE 242 (242) Q Consensus 190 ~e~l~~---G~~v~~~~A~~~a~~~gI~v~I~ng~~~~~i~~~l~Ge~~GT~I~~~ 242 (242) +|++++ |++|+||+++++|++ +||++|.|.++| ..||+||.| T Consensus 234 ~EA~ELA~~GAKVLHPrti~Pakk-~IPi~v~nT~~~----------~~GTLIt~e 278 (359) T PRK08373 234 DEIKIAAKLGMKALHWKAIDLVKG-YIPIIFGRTRDW----------RMGTLISNE 278 (359) T ss_pred HHHHHHHHCCCCCCCHHHHHHHHH-CCCEEEECCCCC----------CCCEEEECC T ss_conf 999999977974168888999980-995799267978----------896478526 No 42 >cd04243 AAK_AK-HSDH-like AAK_AK-HSDH-like: Amino Acid Kinase Superfamily (AAK), AK-HSDH-like; this family includes the N-terminal catalytic domain of aspartokinase (AK) of the bifunctional enzyme AK- homoserine dehydrogenase (HSDH). These aspartokinases are found in such bacteria as E. coli (AKI-HSDHI, ThrA and AKII-HSDHII, MetL) and in higher plants (Z. mays AK-HSDH). AK and HSDH are the first and third enzymes in the biosynthetic pathway of the aspartate family of amino acids. AK catalyzes the phosphorylation of Asp to P-aspartyl phosphate. HSDH catalyzes the NADPH-dependent conversion of Asp 3-semialdehyde to homoserine. ThrA and MetL are involved in threonine and methionine biosynthesis, respectively. In E. coli, ThrA is subject to allosteric regulation by the end product L-threonine and the native enzyme is reported to be tetrameric. As with bacteria, plant AK and HSDH are feedback inhibited by pathway end products. Maize AK-HSDH is a Thr-sensitive 180-kD enzyme. Arabidopsis AK- Probab=100.00 E-value=1.9e-32 Score=219.65 Aligned_cols=214 Identities=23% Similarity=0.250 Sum_probs=153.8 Q ss_pred EEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCC--CHHHHHHHC--------------------- Q ss_conf 999940036853887770899999999999999976990999823654--102334420--------------------- Q gi|254780777|r 9 VLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGN--IFRGSQVVA--------------------- 65 (242) Q Consensus 9 IViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~--~~~~~~~~~--------------------- 65 (242) .|+|||||||.+ .+.++++++.|... .+.++++|+|+.. +-.+..... T Consensus 2 kV~KFGGtSv~~-------~~~i~~v~~iI~~~--~~~~~vVVVSA~~gvTd~L~~~~~~a~~~~~~~~~~l~~i~~~h~ 72 (293) T cd04243 2 KVLKFGGTSVAS-------AERIRRVADIIKSR--ASSPVLVVVSALGGVTNRLVALAELAASGDDAQAIVLQEIRERHL 72 (293) T ss_pred EEEEECCCCCCC-------HHHHHHHHHHHHHC--CCCCEEEEECCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHH T ss_conf 899878123699-------99999999999837--798959998199886899999999884377578999999999999 Q ss_pred ---------------------------------CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHH Q ss_conf ---------------------------------37777540246788888989999999999976188641134001002 Q gi|254780777|r 66 ---------------------------------ENYLLCERSTVDSMGMLSTVINALALDLALRKINVPTVILSSIFMPQ 112 (242) Q Consensus 66 ---------------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 112 (242) ........+..+.....+...+..++...+...|++.......+... T Consensus 73 ~i~~~L~~~~~~~~~~~~i~~~~~~L~~~l~~~~~~~~~~~~~~d~ils~GE~lSa~l~a~~L~~~Gi~a~~ld~~~~i~ 152 (293) T cd04243 73 DLIKELLSGESAAELLAALDSLLERLKDLLEGIRLLGELSDKTRAEVLSFGELLSSRLMSAYLQEQGLPAAWLDARELLL 152 (293) T ss_pred HHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEECHHHHEE T ss_conf 99998527154799999999999999999876542257988999999988989999999999997899809975478420 Q ss_pred HH----------HHHHHHHHCCHHHHCCCCCCCCCC---------CCCCCCCHHHHHHHHHHHCHHHHHHCCCCCCCCCC Q ss_conf 45----------665432100001100111112465---------56777621578888864164466514754351234 Q gi|254780777|r 113 VC----------EVFSCRNAVSYLSQGKVVIFSGGT---------GNAFLTTDSAAALRASEIGADVILKGTQVDGVYSA 173 (242) Q Consensus 113 ~~----------~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~D~~a~~ia~~~~ad~l~i~tdvdGiyt~ 173 (242) .. .....+......+...++++.|+. .+.+|++|++|+++|.+++|+.+.+||||+|+||+ T Consensus 153 td~~~~~a~~~~~~s~~~~~~~~~~~~~v~Vv~GFig~~~~G~~ttLGRgGSD~sAaila~~l~A~~v~IwtDV~Gi~ta 232 (293) T cd04243 153 TDDGFLNAVVDLKLSKERLAQLLAEHGKVVVTQGFIASNEDGETTTLGRGGSDYSAALLAALLDAEEVEIWTDVDGVYTA 232 (293) T ss_pred CCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCEEEECCCHHHHHHHHHHHHCCCCEEEEEECCCCEEEC T ss_conf 26887531007764199999986338959997496612799977872798289999999995699889998378840708 Q ss_pred CCCCCCCCCCCCCCCHHHHHHC---CCCCCCHHHHHHHHHCCCCEEEEECCCCCHHHHHHCCCCCCCEEC Q ss_conf 3223444444441159999855---998140999999997499699951898547999977998782466 Q gi|254780777|r 174 DPRVHASSTRFDSLTYNQFIEK---GLKVMDCASVVLARDCSIPIIVFSIHSPGGIWGGLSGIGRSTIIS 240 (242) Q Consensus 174 dP~~~~~ak~i~~is~~e~l~~---G~~v~~~~A~~~a~~~gI~v~I~ng~~~~~i~~~l~Ge~~GT~I~ 240 (242) ||+.+|+|++|+++||+|+.++ |++|+||++++++++++||++|.|.++| +..||+|| T Consensus 233 DPr~v~~a~~i~~lsy~EA~ELa~~GakVlHp~ti~p~~~~~Ipi~i~nt~~p---------~~~GT~Is 293 (293) T cd04243 233 DPRKVPDARLLKELSYDEAMELAYFGAKVLHPRTIQPAIRKNIPIFIKNTFNP---------EAPGTLIS 293 (293) T ss_pred CCCCCCCCEECCCCCHHHHHHHHHCCCCCCCHHHHHHHHHCCCCEEEECCCCC---------CCCCEEEC T ss_conf 99878887564810899999999779620599999999986994899379998---------99982659 No 43 >PRK09034 aspartate kinase; Reviewed Probab=100.00 E-value=2.9e-33 Score=224.76 Aligned_cols=213 Identities=18% Similarity=0.181 Sum_probs=152.9 Q ss_pred EEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCC---------CHHHHHHHC--------------- Q ss_conf 99940036853887770899999999999999976990999823654---------102334420--------------- Q gi|254780777|r 10 LLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGN---------IFRGSQVVA--------------- 65 (242) Q Consensus 10 ViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~---------~~~~~~~~~--------------- 65 (242) |+||||||+.+ .+.++++++.|.. +..+.+ +|+|+.. +-.+..... T Consensus 3 V~KFGGtSv~~-------~~~i~~v~~ii~~--~~~~~v-VVVSA~G~~~~~~~~vTd~L~~~~~~~~~~~~~~~~~~~i 72 (450) T PRK09034 3 VVKFGGSSLAS-------AEQFKKVLNIVKS--DPERKI-VVVSAPGKRFKEDTKVTDLLILYAEHVLNGEDYDDIFEAI 72 (450) T ss_pred EEEECCCCCCC-------HHHHHHHHHHHHC--CCCCEE-EEECCCCCCCCCCCCHHHHHHHHHHHHHCCCCHHHHHHHH T ss_conf 99878313899-------9999999999962--789839-9990798876775552699999999986487589999999 Q ss_pred -----------------------------CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHH Q ss_conf -----------------------------377775402467888889899999999999761886411340010024566 Q gi|254780777|r 66 -----------------------------ENYLLCERSTVDSMGMLSTVINALALDLALRKINVPTVILSSIFMPQVCEV 116 (242) Q Consensus 66 -----------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 116 (242) ........+..+.....+.+.+..++...+...|++.......+....... T Consensus 73 ~~~h~~i~~~l~~~~~~~~~l~~~~~~l~~~~~~~~~~~~d~i~s~GE~~Sa~l~a~~L~~~Gi~a~~~d~~~~~~~~~~ 152 (450) T PRK09034 73 IARYAEIADELGLKDDILEKIAEILEHLANLDKEENGYLLDAFKARGEDLNAKLIAAYLNYEGIDARYVSPKEAGIIVTD 152 (450) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEECHHHEEEEECC T ss_conf 99999999873566779999999999998602457467787885188999999999999968997399873450278526 Q ss_pred ---------HHHHHHCCHHHHCCCCCCCCC---------CCCCCCCCHHHHHHHHHHHCHHHHHHCCCCCCCCCCCCCCC Q ss_conf ---------543210000110011111246---------55677762157888886416446651475435123432234 Q gi|254780777|r 117 ---------FSCRNAVSYLSQGKVVIFSGG---------TGNAFLTTDSAAALRASEIGADVILKGTQVDGVYSADPRVH 178 (242) Q Consensus 117 ---------~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~D~~a~~ia~~~~ad~l~i~tdvdGiyt~dP~~~ 178 (242) .........+....++++.|+ +.+.+|++|++|+++|.+++|+.+.+||||||+||+|||.+ T Consensus 153 ~~~~a~~~~~~~~~l~~~~~~~~v~Vv~GFig~~~~G~~tTLGRGGSD~TAa~la~~l~A~~~~IwTDVdGi~taDPr~V 232 (450) T PRK09034 153 EPGGAQILPESYDKLKKLRDSDEKLVIPGFFGVTPDGQIKTFSRGGSDITGAILARGVKADLYENFTDVDGIYAANPNIV 232 (450) T ss_pred CCCCCEECHHHHHHHHHHHCCCCEEEEEEEEEECCCCCEEEECCCCHHHHHHHHHHHHCCCEEEEEECCCCEEECCCCCC T ss_conf 77761773741999998871798799832788469997776168834899999999719988999854673160698767 Q ss_pred CCCCCCCCCCHHHHHHC---CCCCCCHHHHHHHHHCCCCEEEEECCCCCHHHHHHCCCCCCCEECC Q ss_conf 44444441159999855---9981409999999974996999518985479999779987824668 Q gi|254780777|r 179 ASSTRFDSLTYNQFIEK---GLKVMDCASVVLARDCSIPIIVFSIHSPGGIWGGLSGIGRSTIISG 241 (242) Q Consensus 179 ~~ak~i~~is~~e~l~~---G~~v~~~~A~~~a~~~gI~v~I~ng~~~~~i~~~l~Ge~~GT~I~~ 241 (242) |+|++|+++||+|+.++ |++|+||+++++++++|||++|.|.++| +..||+|++ T Consensus 233 ~~A~~i~~lsy~EA~ELa~~GAkVlHP~ti~p~~~~~IPi~I~nt~~P---------~~~GT~I~~ 289 (450) T PRK09034 233 KNPKPIEEITYREMRELSYAGFSVFHDEALIPAYRGGIPINIKNTNNP---------EDPGTLIVH 289 (450) T ss_pred CCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCHHHCCCEEEEEEECCCC---------CCCCEEEEE T ss_conf 876620527899999999758754571314521306548999727999---------999608962 No 44 >cd04245 AAK_AKiii-YclM-BS AAK_AKiii-YclM-BS: Amino Acid Kinase Superfamily (AAK), AKiii-YclM-BS; this CD includes the N-terminal catalytic aspartokinase (AK) domain of the lysine plus threonine-sensitive aspartokinase isoenzyme AKIII, a monofunctional class enzyme found in Bacilli (Bacillus subtilis YclM) and Clostridia species. Aspartokinase is the first enzyme in the aspartate metabolic pathway and catalyzes the conversion of aspartate and ATP to aspartylphosphate and ADP. In Bacillus subtilis (BS), YclM is reported to be a single polypeptide of 50 kD. The Bacillus subtilis 168 AKIII is induced by lysine and repressed by threonine, and it is synergistically inhibited by lysine and threonine. Probab=100.00 E-value=1.8e-32 Score=219.80 Aligned_cols=214 Identities=19% Similarity=0.193 Sum_probs=152.1 Q ss_pred CEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCC---------CHHHHHHHC----------- Q ss_conf 249999940036853887770899999999999999976990999823654---------102334420----------- Q gi|254780777|r 6 YKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGN---------IFRGSQVVA----------- 65 (242) Q Consensus 6 ~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~---------~~~~~~~~~----------- 65 (242) || |+||||||+.+ .+.++++++.|.. +..+. ++|+|+.. +..+..... T Consensus 1 Mk--V~KFGGtSv~~-------~~~i~~v~~Ii~~--~~~~~-vVVVSA~g~~tn~~~gvTd~L~~~a~~~~~~~~~~~~ 68 (288) T cd04245 1 MK--VVKFGGSSLAS-------AEQFQKVKAIVKA--DPERK-IVVVSAPGKRFKDDTKVTDLLILYAEAVLAGEDTESI 68 (288) T ss_pred CE--EEEECCCCCCC-------HHHHHHHHHHHHC--CCCCE-EEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCCHHHH T ss_conf 98--99888113589-------9999999999971--78980-9999089888675214799999999987568459999 Q ss_pred ---------------------------------CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHH Q ss_conf ---------------------------------37777540246788888989999999999976188641134001002 Q gi|254780777|r 66 ---------------------------------ENYLLCERSTVDSMGMLSTVINALALDLALRKINVPTVILSSIFMPQ 112 (242) Q Consensus 66 ---------------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 112 (242) ........+..+.....+.+.+..++...+...|++.......+... T Consensus 69 ~~~i~~~h~~~~~~l~~~~~~~~~l~~~l~~l~~~~~~~~~~~~d~i~s~GE~lSa~l~a~~L~~~Gi~a~~ld~~d~~i 148 (288) T cd04245 69 FEAIVDRYAEIADELGLPMSILEEIAEILENLANLDYANPDYLLDALKARGEYLNAQLMAAYLNYQGIDARYVIPKDAGL 148 (288) T ss_pred HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCEE T ss_conf 99999999999987466477999999999999871446855777687309999999999999996899839950000217 Q ss_pred H---------HHHHHHHHHCCHHHHCCCCCCCCC---------CCCCCCCCHHHHHHHHHHHCHHHHHHCCCCCCCCCCC Q ss_conf 4---------566543210000110011111246---------5567776215788888641644665147543512343 Q gi|254780777|r 113 V---------CEVFSCRNAVSYLSQGKVVIFSGG---------TGNAFLTTDSAAALRASEIGADVILKGTQVDGVYSAD 174 (242) Q Consensus 113 ~---------~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~D~~a~~ia~~~~ad~l~i~tdvdGiyt~d 174 (242) . .............+...++++.|+ +.+.++++|++|+++|.+++|+.+.+|||||||||+| T Consensus 149 ~~~~~~~~a~~~~~~~~~~~~~~~~~~v~Vv~GFig~~~~G~~ttLGRgGSDySAaiia~~l~A~~v~iwtDV~Gi~taD 228 (288) T cd04245 149 VVTDEPGNAQILPESYQKIKKLRDSDEKLVIPGFYGYSKNGDIKTFSRGGSDITGAILARGFQADLYENFTDVDGIYAAN 228 (288) T ss_pred EECCCCCCCEECHHHHHHHHHHHHCCCEEEEEEEEEECCCCCEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCEEEECC T ss_conf 84477653033275699999887059838982577753899764328983899999999975898899954798037379 Q ss_pred CCCCCCCCCCCCCCHHHHHHC---CCCCCCHHHHHHHHHCCCCEEEEECCCCCHHHHHHCCCCCCCEEC Q ss_conf 223444444441159999855---998140999999997499699951898547999977998782466 Q gi|254780777|r 175 PRVHASSTRFDSLTYNQFIEK---GLKVMDCASVVLARDCSIPIIVFSIHSPGGIWGGLSGIGRSTIIS 240 (242) Q Consensus 175 P~~~~~ak~i~~is~~e~l~~---G~~v~~~~A~~~a~~~gI~v~I~ng~~~~~i~~~l~Ge~~GT~I~ 240 (242) ||.+|+|++++.+||+|+.++ |++|+||++++++.+.+||++|.|.++| +..||+|- T Consensus 229 Pr~V~~A~~i~~lsy~EA~eLa~~GakVlHP~ti~P~~~~~Ipi~irnt~~p---------~~~GT~I~ 288 (288) T cd04245 229 PRIVANPKPISEMTYREMRELSYAGFSVFHDEALIPAIEAGIPINIKNTNHP---------EAPGTLIV 288 (288) T ss_pred CCCCCCCEECCCCCHHHHHHHHHCCCCCCCHHHHHHHHHCCCEEEEECCCCC---------CCCCCEEC T ss_conf 9857798687735999999999679851588999999985980899378999---------99985779 No 45 >COG1608 Predicted archaeal kinase [General function prediction only] Probab=99.98 E-value=1.3e-31 Score=214.64 Aligned_cols=231 Identities=20% Similarity=0.205 Sum_probs=178.3 Q ss_pred EEEEEEEEEEECCC-CCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHCCC--CCCCCHHHHHHHHHHHHH Q ss_conf 99994003685388-777089999999999999997699099982365410233442037--777540246788888989 Q gi|254780777|r 9 VLLKVSGEALAGIS-GFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRGSQVVAEN--YLLCERSTVDSMGMLSTV 85 (242) Q Consensus 9 IViKiGGSsL~~~~-~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~ 85 (242) +|+|||||+||+++ ++++++++++++|.+|++ ..-..+++|||||+|+|........ ......+...+....++. T Consensus 3 ~IlKlGGSvITdK~~p~t~r~~~l~ria~eI~~--~~~~~livVHGgGSFGHp~Ak~~~~~~~~~~~s~~G~~~~~~am~ 80 (252) T COG1608 3 IILKLGGSVITDKDKPRTVREDRLRRIAREISN--GKPEKLIVVHGGGSFGHPAAKEFGLEGLKNYLSPLGFSLTHLAML 80 (252) T ss_pred EEEEECCEEEECCCCCCHHHHHHHHHHHHHHHC--CCCCCEEEEECCCCCCCHHHHHHCCCCCCCCCCCCCHHHHHHHHH T ss_conf 899955616525887321118999999999845--995438999458665678899707654446667434399999999 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCHHHHH---HHHHHHHHHCCHHHHCCCCCCCCC------CCCCCCCCHHHHHHHHHHH Q ss_conf 9999999999761886411340010024---566543210000110011111246------5567776215788888641 Q gi|254780777|r 86 INALALDLALRKINVPTVILSSIFMPQV---CEVFSCRNAVSYLSQGKVVIFSGG------TGNAFLTTDSAAALRASEI 156 (242) Q Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~D~~a~~ia~~~ 156 (242) .++......+...|....+..+.+.... ............++.+.+|+..++ .+....+.|..+..+|.++ T Consensus 81 ~L~~~V~~~l~~~Gv~av~~~P~s~~~~~gr~~~~~l~~i~~~l~~gfvPvl~GDVv~d~~~g~~IiSGDdIv~~LA~~l 160 (252) T COG1608 81 ELNSIVVDALLDAGVRAVSVVPISFSTFNGRILYTYLEAIKDALEKGFVPVLYGDVVPDDDNGYEIISGDDIVLHLAKEL 160 (252) T ss_pred HHHHHHHHHHHHCCCCCCCCCCCCEEECCCCEEECHHHHHHHHHHCCCEEEEECCEEECCCCCEEEEECCHHHHHHHHHH T ss_conf 99999999998669853222686403317703220299999998769876233346874898548985118999999974 Q ss_pred CHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHC-------CCCCCCHHHHHHHHHCCCCEEEEECCCCCHHHHH Q ss_conf 644665147543512343223444444441159999855-------9981409999999974996999518985479999 Q gi|254780777|r 157 GADVILKGTQVDGVYSADPRVHASSTRFDSLTYNQFIEK-------GLKVMDCASVVLARDCSIPIIVFSIHSPGGIWGG 229 (242) Q Consensus 157 ~ad~l~i~tdvdGiyt~dP~~~~~ak~i~~is~~e~l~~-------G~~v~~~~A~~~a~~~gI~v~I~ng~~~~~i~~~ 229 (242) .+|+++++|||||||+.||+++|+++.++++.....++. |+|.-|++++....+++-++||+||++|+||.++ T Consensus 161 ~pd~v~f~tdVdGVy~~~p~~~p~~~~l~~i~~~~~~~gs~~~DVTGGi~~Kl~~~~~~~~~~~~vyi~ng~~~~ni~~~ 240 (252) T COG1608 161 KPDRVIFLTDVDGVYDRDPGKVPDARLLSEIEGRVALGGSGGTDVTGGIAKKLEALLEIARYGKEVYIFNGNKPENIYRA 240 (252) T ss_pred CCCEEEEEECCCCEECCCCCCCCCCCCHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCHHHHHHH T ss_conf 98779998457823568987486620001443234306867621313399999999998735863899878777789999 Q ss_pred HCCCCCCCEECC Q ss_conf 779987824668 Q gi|254780777|r 230 LSGIGRSTIISG 241 (242) Q Consensus 230 l~Ge~~GT~I~~ 241 (242) |+|+.+||.|++ T Consensus 241 l~G~~vGT~I~~ 252 (252) T COG1608 241 LRGENVGTRIDG 252 (252) T ss_pred HCCCCCCEEECC T ss_conf 658877548469 No 46 >pfam00696 AA_kinase Amino acid kinase family. This family includes kinases that phosphorylate a variety of amino acid substrates, as well as uridylate kinase and carbamate kinase. This family includes: Aspartokinase EC:2.7.2.4. Acetylglutamate kinase EC:2.7.2.8. Glutamate 5-kinase EC:2.7.2.11. Uridylate kinase EC:2.7.4.-. Carbamate kinase EC:2.7.2.2. Probab=99.97 E-value=9.4e-31 Score=209.26 Aligned_cols=206 Identities=25% Similarity=0.263 Sum_probs=144.4 Q ss_pred EEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHCCCCCCCCHHHHHHHH----HH Q ss_conf 49999940036853887770899999999999999976990999823654102334420377775402467888----88 Q gi|254780777|r 7 KRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRGSQVVAENYLLCERSTVDSMG----ML 82 (242) Q Consensus 7 KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~~~~~~~~~~~~~~~~~~~~~----~~ 82 (242) ||||+|||||+|++ .+.+++++++|+++.+.|++++||||||.+.+.+............+...... .. T Consensus 1 k~iViK~GGs~l~~-------~~~i~~~~~~I~~l~~~g~~vvvV~ggg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 (230) T pfam00696 1 KRIVIKLGGSSLTD-------EDAIKRIAEEIALLSELGIKVVVVSGGGGFTDKLLAAYGIAEKIGLRVTAGATGLIIEA 73 (230) T ss_pred CEEEEEECHHHCCC-------HHHHHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHH T ss_conf 90999958567789-------89999999999999968994999979858887999981777657515669999999999 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHH---HHHHHHHHCCHHHHCCCCCCCCCC---------CCCCCCCHHHHH Q ss_conf 98999999999997618864113400100245---665432100001100111112465---------567776215788 Q gi|254780777|r 83 STVINALALDLALRKINVPTVILSSIFMPQVC---EVFSCRNAVSYLSQGKVVIFSGGT---------GNAFLTTDSAAA 150 (242) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~D~~a~ 150 (242) ........+...+...+.+............. ...........++...+|++.++. +..++++|++|+ T Consensus 74 ~~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~l~~~~ipVi~g~~~~~~~~~~~~l~~~~sD~~Aa 153 (230) T pfam00696 74 ALAGLLDIVVSAGERLGARAVALLLSDGGIGAVRLDANDTEAIKELLEEGVVPVITGFGGENDTGETTTLGRGSSDTLAA 153 (230) T ss_pred HHHHHHHHHHHHHHHCCCHHHHCCCCCCCCCEEEEEHHHHHHHHHHHHCCCCEEECCCCEECCCCCCCCCCCCCCHHHHH T ss_conf 99999999999999829730132565134551688452599999999769925524764247988723134877108899 Q ss_pred HHHHHHCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHC-------CCCC-CCHHHHHHHHHCCCCEEEEE Q ss_conf 888641644665147543512343223444444441159999855-------9981-40999999997499699951 Q gi|254780777|r 151 LRASEIGADVILKGTQVDGVYSADPRVHASSTRFDSLTYNQFIEK-------GLKV-MDCASVVLARDCSIPIIVFS 219 (242) Q Consensus 151 ~ia~~~~ad~l~i~tdvdGiyt~dP~~~~~ak~i~~is~~e~l~~-------G~~v-~~~~A~~~a~~~gI~v~I~n 219 (242) ++|.+++|+.|+++|||+||||+||+.+|+|++|++++++|+.++ |+|. +++.|.+++++.++|++|+| T Consensus 154 ~lA~~l~A~~lii~tdV~Gvy~~dP~~~~~a~~i~~i~~~e~~~l~~~~~~~gGm~~k~~~a~~~~~~~~i~v~I~n 230 (230) T pfam00696 154 LLAEALGADKLIILTDVDGVYTADPRKVPDAKLIPELSYDEAEELLAAGDGTGGMKVKDPAALKAARRGGIPVVIVN 230 (230) T ss_pred HHHHHCCCCEEEEEECCCCEECCCCCCCCCCEECCCCCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCEEEEEC T ss_conf 99985598678986157813637998799873855767999998876597849969999999999997898699949 No 47 >cd04255 AAK_UMPK-MosAB AAK_UMPK-MosAB: This CD includes the alpha and beta subunits of the Mo storage protein (MosA and MosB) which are related to uridine monophosphate kinase (UMPK) enzymes that catalyze the phosphorylation of UMP by ATP, yielding UDP, and playing a key role in pyrimidine nucleotide biosynthesis. The Mo storage protein from the nitrogen-fixing bacterium, Azotobacter vinelandii, is characterized as an alpha4-beta4 octamer containing a polynuclear molybdenum-oxide cluster which is ATP-dependent to bind Mo and pH-dependent to release Mo. These and related bacterial sequences in this CD are members of the Amino Acid Kinase Superfamily (AAK). Probab=99.97 E-value=2.1e-29 Score=200.88 Aligned_cols=221 Identities=19% Similarity=0.207 Sum_probs=166.7 Q ss_pred EEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 99994003685388777089999999999999997699099982365410233442037777540246788888989999 Q gi|254780777|r 9 VLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRGSQVVAENYLLCERSTVDSMGMLSTVINA 88 (242) Q Consensus 9 IViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 88 (242) -|+||||.|++|.. .+.+-.+.++|.+++++ |+++|++|||..+|.....+.... .+.-.+...+......+. T Consensus 33 ~VvKIGGqsi~DrG-----~~av~Plv~Ei~~~~~~-hk~li~TGgG~RaRHiy~igldLg-mPtGvLa~L~~~vs~QNa 105 (262) T cd04255 33 NVVKIGGQSIIDRG-----AEAVLPLVEEIVALRPE-HKLLILTGGGTRARHVYSIGLDLG-MPTGVLAKLGASVSEQNA 105 (262) T ss_pred EEEEECCEEEECCC-----HHHHHHHHHHHHHHHHH-CCEEEEECCCHHHHHHHHHHHCCC-CCHHHHHHHHHHHHHHHH T ss_conf 28998780300373-----34243289999975131-528998367500334312341479-961156776422446479 Q ss_pred HHHHHHHHHHCCCCCCCCCCHHHHHHHHHH-----HHHHCCHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHCHHHHHH Q ss_conf 999999976188641134001002456654-----321000011001111124655677762157888886416446651 Q gi|254780777|r 89 LALDLALRKINVPTVILSSIFMPQVCEVFS-----CRNAVSYLSQGKVVIFSGGTGNAFLTTDSAAALRASEIGADVILK 163 (242) Q Consensus 89 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~D~~a~~ia~~~~ad~l~i 163 (242) ..+...+..+|....-..... ....... ...-.+....+..+. +..-++...+|+.|.++|..++|+.|++ T Consensus 106 ~ml~~LLa~~G~~~i~~~~~~--~l~~~l~~~~~~v~~g~PPY~~~e~p~--~~g~IPphRTDtGAfLlAe~iGa~~li~ 181 (262) T cd04255 106 EMLATLLAKHGGSKVGHGDLL--QLPTFLKAGRAPVISGMPPYGLWEHPA--EEGRIPPHRTDVGAFLLAEVIGARNLIF 181 (262) T ss_pred HHHHHHHCCCCCEEECCCCHH--HHHHHHHCCCEEEEECCCCCCCCCCCC--CCCCCCCCCCCHHHHHHHHHHCCCEEEE T ss_conf 999998431387025787789--999998658710442799975454786--5687899754389999999848753799 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHCC--CCCCCHHHHHHHHH--CCCCEEEEECCCCCHHHHHHCCCCCCCEE Q ss_conf 475435123432234444444411599998559--98140999999997--49969995189854799997799878246 Q gi|254780777|r 164 GTQVDGVYSADPRVHASSTRFDSLTYNQFIEKG--LKVMDCASVVLARD--CSIPIIVFSIHSPGGIWGGLSGIGRSTII 239 (242) Q Consensus 164 ~tdvdGiyt~dP~~~~~ak~i~~is~~e~l~~G--~~v~~~~A~~~a~~--~gI~v~I~ng~~~~~i~~~l~Ge~~GT~I 239 (242) .+||||+||+||+++++|++++++|++|++.++ -++++..+.++.+. +--++.|+||.+|+||.++++||++||+| T Consensus 182 vkdVDGvYtaDPk~~~~A~~i~~iS~~Ell~~~L~dlv~er~~l~ll~~a~~i~~VqVvNgl~PGnitrAL~GEhVGTiI 261 (262) T cd04255 182 VKDEDGLYTADPKKNKKAEFIPEISAAELLKKDLDDLVLERPVLDLLQNARHVKEVQIVNGLVPGNLTRALRGEHVGTII 261 (262) T ss_pred EECCCCCCCCCCCCCCCCEECCCCCHHHHHHCCCCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHCCCCCCCEE T ss_conf 96577776799999976534253779999863476430059999999861135517998078853799987268664100 Q ss_pred C Q ss_conf 6 Q gi|254780777|r 240 S 240 (242) Q Consensus 240 ~ 240 (242) + T Consensus 262 r 262 (262) T cd04255 262 R 262 (262) T ss_pred C T ss_conf 9 No 48 >cd04248 AAK_AK-Ectoine AAK_AK-Ectoine: Amino Acid Kinase Superfamily (AAK), AK-Ectoine; this CD includes the N-terminal catalytic domain of the aspartokinase of the ectoine (1,4,5,6-tetrahydro-2-methyl pyrimidine-4-carboxylate) biosynthetic pathway found in Methylomicrobium alcaliphilum, Vibrio cholerae, and other various halotolerant or halophilic bacteria. Bacteria exposed to hyperosmotic stress accumulate organic solutes called 'compatible solutes' of which ectoine, a heterocyclic amino acid, is one. Apart from its osmotic function, ectoine also exhibits a protective effect on proteins, nucleic acids and membranes against a variety of stress factors. de novo synthesis of ectoine starts with the phosphorylation of L-aspartate and shares its first two enzymatic steps with the biosynthesis of amino acids of the aspartate family: aspartokinase and L-aspartate-semialdehyde dehydrogenase. The M. alcaliphilum and the V. cholerae aspartokinases are encoded on the ectABCask operon. Probab=99.95 E-value=8.8e-28 Score=190.93 Aligned_cols=93 Identities=17% Similarity=0.174 Sum_probs=87.9 Q ss_pred CCCCCCCCHHHHHHHHHHHCHHHHHHCCCCCCCCCCCCCCCCC--CCCCCCCCHHHHHHC---CCCCCCHHHHHHHHHCC Q ss_conf 5567776215788888641644665147543512343223444--444441159999855---99814099999999749 Q gi|254780777|r 138 TGNAFLTTDSAAALRASEIGADVILKGTQVDGVYSADPRVHAS--STRFDSLTYNQFIEK---GLKVMDCASVVLARDCS 212 (242) Q Consensus 138 ~~~~~~~~D~~a~~ia~~~~ad~l~i~tdvdGiyt~dP~~~~~--ak~i~~is~~e~l~~---G~~v~~~~A~~~a~~~g 212 (242) +.+.+|++|++|+++|.+++|+.+.+||||+ +||+|||.+|+ |++|+++||+|+.|+ |++|+||++++++.+++ T Consensus 207 tTLGRgGSDySAailA~~l~A~~~~IWtDV~-v~tADPr~V~~~~a~~i~~lSy~EA~ELayfGAkVlHP~Ti~P~~~~~ 285 (304) T cd04248 207 REFDRGYSEMTFSRIAVLTGASEAIIHKEFH-LSSADPKLVGEDKARPIGRTNYDVADQLANLGMEAIHPKAAKGLRQAG 285 (304) T ss_pred EEECCCCCHHHHHHHHHHCCCCEEEEEECCC-EECCCCCCCCCCCCEEECCCCHHHHHHHHHCCCCCCCHHHHHHHHHCC T ss_conf 6507997178999999976999899985066-616897957877663606479999999997797215989999999869 Q ss_pred CCEEEEECCCCCHHHHHHCCCCCCCEEC Q ss_conf 9699951898547999977998782466 Q gi|254780777|r 213 IPIIVFSIHSPGGIWGGLSGIGRSTIIS 240 (242) Q Consensus 213 I~v~I~ng~~~~~i~~~l~Ge~~GT~I~ 240 (242) ||++|.|.++| +..||+|| T Consensus 286 IPi~IkNt~~P---------~~~GT~Is 304 (304) T cd04248 286 IPLRVKNTFEP---------DHPGTLIT 304 (304) T ss_pred CCEEEECCCCC---------CCCCCEEC T ss_conf 98898068998---------99983439 No 49 >PRK09181 aspartate kinase; Validated Probab=99.95 E-value=1.9e-27 Score=188.84 Aligned_cols=95 Identities=17% Similarity=0.178 Sum_probs=88.1 Q ss_pred CCCCCCCCHHHHHHHHHHHCHHHHHHCCCCCCCCCCCCCCC--CCCCCCCCCCHHHHHHC---CCCCCCHHHHHHHHHCC Q ss_conf 55677762157888886416446651475435123432234--44444441159999855---99814099999999749 Q gi|254780777|r 138 TGNAFLTTDSAAALRASEIGADVILKGTQVDGVYSADPRVH--ASSTRFDSLTYNQFIEK---GLKVMDCASVVLARDCS 212 (242) Q Consensus 138 ~~~~~~~~D~~a~~ia~~~~ad~l~i~tdvdGiyt~dP~~~--~~ak~i~~is~~e~l~~---G~~v~~~~A~~~a~~~g 212 (242) +.+.+|++|++|+++|.+++|+.+.+||||+ +||+||+.. +++++|+++||+|+.++ |++|+||++++++++++ T Consensus 214 ~tlgRggSD~ta~~ia~~l~a~~~~i~~dv~-v~taDP~~v~~~~~~~i~~~sy~ea~ela~~Ga~vlhp~~~~p~~~~~ 292 (476) T PRK09181 214 RTFDRGYSEMTFSRIAVLTGADEAIIHKEYH-LSSADPKLVGEDKVVPIGRTNYDVADQLANLGMEAIHPKAAKGLRQAG 292 (476) T ss_pred EECCCCCHHHHHHHHHHHHCCCEEEEEEECC-EECCCCCCCCCCCCEECCCCCHHHHHHHHHCCCCCCCCHHHHHHHHCC T ss_conf 4358884599999999984898999996365-312798843654344735268999999986598640806789999759 Q ss_pred CCEEEEECCCCCHHHHHHCCCCCCCEECCC Q ss_conf 969995189854799997799878246689 Q gi|254780777|r 213 IPIIVFSIHSPGGIWGGLSGIGRSTIISGE 242 (242) Q Consensus 213 I~v~I~ng~~~~~i~~~l~Ge~~GT~I~~~ 242 (242) ||++|.|.++| +..||+|+++ T Consensus 293 Ipi~v~nt~~p---------~~~GT~I~~~ 313 (476) T PRK09181 293 IPLRIKNTFEP---------EHPGTLITKD 313 (476) T ss_pred CCEEEEECCCC---------CCCCEEEECC T ss_conf 96899706899---------9997599468 No 50 >PRK00942 acetylglutamate kinase; Provisional Probab=99.90 E-value=1.2e-22 Score=159.35 Aligned_cols=219 Identities=18% Similarity=0.193 Sum_probs=138.9 Q ss_pred EEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHH-HHHHCCCCC------CCCHHHHHHHH Q ss_conf 99999400368538877708999999999999999769909998236541023-344203777------75402467888 Q gi|254780777|r 8 RVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRG-SQVVAENYL------LCERSTVDSMG 80 (242) Q Consensus 8 rIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~-~~~~~~~~~------~~~~~~~~~~~ 80 (242) .|||||||++|+++ +.+++++++|+.+++.|.+++||||||..... ....+.... .-+..... .. T Consensus 1 TiVIkiGG~~l~~~-------~~~~~~~~dIa~L~~~G~~vVlVHGgg~qi~~~l~~~gi~~~~~~g~RvT~~~~l~-~~ 72 (262) T PRK00942 1 TIVIKYGGNAMTDE-------ELKEAFARDIVLLKQVGINPVVVHGGGPQIDELLKKLGIESEFVNGLRVTDKETME-VV 72 (262) T ss_pred CEEEEECHHHHCCH-------HHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHH-HH T ss_conf 99999885884498-------79999999999999889919999388299999999869972304775466189999-99 Q ss_pred HHHHH-HHHHHHHHHHHHHCCCCCCCCCC--------HHHHH-------------HHHHHHHHHCCHHHHCCCCCCCC-- Q ss_conf 88989-99999999997618864113400--------10024-------------56654321000011001111124-- Q gi|254780777|r 81 MLSTV-INALALDLALRKINVPTVILSSI--------FMPQV-------------CEVFSCRNAVSYLSQGKVVIFSG-- 136 (242) Q Consensus 81 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~-------------~~~~~~~~~~~~~~~~~~~~~~~-- 136 (242) ..... .....+...+...+......+.. ..... ....+.......++.+.+|+... T Consensus 73 ~~~l~g~i~~~l~~~l~~~g~~avglsg~d~~~v~a~~~~~~~~~~~~d~g~vG~v~~v~~~~I~~ll~~g~IpVispig 152 (262) T PRK00942 73 EMVLAGKVNKELVSLINKHGGKAVGLSGKDGGLVTARKLDENGKEKIIDLGYVGEVEEVNPALLETLLEAGYIPVISPIG 152 (262) T ss_pred HHHHHCHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCEEEECHHHHHHHHHCCCEEEECCCC T ss_conf 99981719999999998558742314566787589998644566545667620461666689999998479757962641 Q ss_pred ---CCCCCCCCCHHHHHHHHHHHCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHH-------CCCCCCCHHHHH Q ss_conf ---6556777621578888864164466514754351234322344444444115999985-------599814099999 Q gi|254780777|r 137 ---GTGNAFLTTDSAAALRASEIGADVILKGTQVDGVYSADPRVHASSTRFDSLTYNQFIE-------KGLKVMDCASVV 206 (242) Q Consensus 137 ---~~~~~~~~~D~~a~~ia~~~~ad~l~i~tdvdGiyt~dP~~~~~ak~i~~is~~e~l~-------~G~~v~~~~A~~ 206 (242) ..+....+.|.+|+.+|..++||.|+++||++|+|+. +++++++++..|+.. .|+|..|+.|+. T Consensus 153 ~~~~G~~~ninaD~~Aa~iA~~l~AdkLi~ltdv~Gv~~~------~g~~i~~i~~~e~~~l~~~~~~~ggM~pKl~aa~ 226 (262) T PRK00942 153 VGEDGETYNVNADTAAGAIAAALGAEKLILLTDVPGVLDD------KGSLISELTAEEAEELIEDGVITGGMIPKVEAAL 226 (262) T ss_pred CCCCCCEECCCHHHHHHHHHHHHCCCEEEEEECCCCEECC------CCCCCCCCCHHHHHHHHHCCCCCCCHHHHHHHHH T ss_conf 8898878557799999999998467949999588761578------8751431069999999865976896099999999 Q ss_pred HHHHCCCC-EEEEECCCCCH-HHHHHCCCCCCCEEC Q ss_conf 99974996-99951898547-999977998782466 Q gi|254780777|r 207 LARDCSIP-IIVFSIHSPGG-IWGGLSGIGRSTIIS 240 (242) Q Consensus 207 ~a~~~gI~-v~I~ng~~~~~-i~~~l~Ge~~GT~I~ 240 (242) .|.++|++ ++|.||+.|+. +...+.++..||+|. T Consensus 227 ~a~~~Gv~rv~I~~g~~~~~ll~elft~~G~GT~i~ 262 (262) T PRK00942 227 EAARSGVRSVHIIDGRVPHSLLLELFTDEGIGTMIV 262 (262) T ss_pred HHHHCCCCEEEECCCCCCCHHHHHHHCCCCCCCEEC T ss_conf 999819998999369998779999817999852569 No 51 >TIGR02078 AspKin_pair Pyrococcus aspartate kinase subunit, putative; InterPro: IPR011819 This family consists of proteins restricted to and found as paralogous pairs (typically close together) in species of Pyrococcus, a hyperthermophilic archaeal genus. Members are always found close to other genes of threonine biosynthesis and appear to represent the Pyrococcal form of aspartate kinase. Alignment to aspartokinase III from Escherichia coli shows that 300 N-terminal and 20 C-terminal amino acids are homologous, but the form in Pyrococcus lacks ~100 amino acids in between.. Probab=99.90 E-value=3.7e-23 Score=162.42 Aligned_cols=213 Identities=15% Similarity=0.211 Sum_probs=158.3 Q ss_pred EEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCC--CCHH--------------------------- Q ss_conf 99994003685388777089999999999999997699099982365--4102--------------------------- Q gi|254780777|r 9 VLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGG--NIFR--------------------------- 59 (242) Q Consensus 9 IViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~--~~~~--------------------------- 59 (242) ||+||||||+-.+ .+.-.+-++.+.++ .++++|+|+- .+.. T Consensus 2 iV~KFGGsSvr~~---------F~eA~~L~~~L~E~-~~v~VVVSALkGiTD~L~~ya~t~D~~~~~~i~~~y~~f~~~~ 71 (341) T TIGR02078 2 IVVKFGGSSVRYD---------FEEALELVKSLSEE-KEVVVVVSALKGITDELIRYANTFDKKAAVKIEEIYVEFAKEL 71 (341) T ss_pred EEEEECCCCHHHH---------HHHHHHHHHHHCCC-CCEEEEEECCCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHC T ss_conf 6898568413666---------89999999985169-8689998534442579998731104688999999999877750 Q ss_pred ----------HHHHHCCCCCCCC-HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHH------------HHHHHH Q ss_conf ----------3344203777754-02467888889899999999999761886411340010------------024566 Q gi|254780777|r 60 ----------GSQVVAENYLLCE-RSTVDSMGMLSTVINALALDLALRKINVPTVILSSIFM------------PQVCEV 116 (242) Q Consensus 60 ----------~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~~~~~ 116 (242) ........+..++ .+-.+...+.+.+.++.+.+..+...++......+.+. +..... T Consensus 72 gvd~~~l~~~l~~~~n~~~dlP~keAL~D~I~S~GE~lSA~ifaeGL~~L~i~g~~v~p~d~~~A~G~FGnafidik~Sk 151 (341) T TIGR02078 72 GVDLNILSPYLKELFNPKPDLPPKEALRDYILSLGELLSAVIFAEGLKLLGINGKVVDPWDLLVAKGEFGNAFIDIKKSK 151 (341) T ss_pred CCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHEECCEEECCHHHEEEECCCCCEEEEHHHHH T ss_conf 88622334678876236778781122356677775789999997013531203227652666277436465477464655 Q ss_pred HHHHHHCCHHHHCCCCCCCCCCC--------CCCCCCHHHHHHHHHHHCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 54321000011001111124655--------6777621578888864164466514754351234322344444444115 Q gi|254780777|r 117 FSCRNAVSYLSQGKVVIFSGGTG--------NAFLTTDSAAALRASEIGADVILKGTQVDGVYSADPRVHASSTRFDSLT 188 (242) Q Consensus 117 ~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~D~~a~~ia~~~~ad~l~i~tdvdGiyt~dP~~~~~ak~i~~is 188 (242) .........++.+.+|+++|..+ ..+|++|++|..+|.++++.++.|++||+|+||+|||.+|-|++|+.+| T Consensus 152 ~~~k~~~~~l~~G~iPvipGF~gnlnGyrvTlGRGGSDYSAvalg~l~~s~~v~IMSdVEGIfTADPKLvPsArLIpY~S 231 (341) T TIGR02078 152 RNAKILYEVLESGKIPVIPGFYGNLNGYRVTLGRGGSDYSAVALGVLLNSKLVAIMSDVEGIFTADPKLVPSARLIPYLS 231 (341) T ss_pred HHHHHHHHHHHCCCEEEECCCCCCCCCEEEEECCCCCHHHHHHHHHHHCCEEEEEEECCCEEEECCCCCCCCHHCCCCCC T ss_conf 57778898886583022146206888448986478622799999987512268997345600435866375001033334 Q ss_pred HHHHH---HCCCCCCCHHHHHHHHHCCCCEEEEECCCCCHHHHHHCCCCCCCEECC Q ss_conf 99998---559981409999999974996999518985479999779987824668 Q gi|254780777|r 189 YNQFI---EKGLKVMDCASVVLARDCSIPIIVFSIHSPGGIWGGLSGIGRSTIISG 241 (242) Q Consensus 189 ~~e~l---~~G~~v~~~~A~~~a~~~gI~v~I~ng~~~~~i~~~l~Ge~~GT~I~~ 241 (242) |+|++ ..|-+.+|-+|++++++..||+..=.+++ =+-||+|+. T Consensus 232 Y~Ei~iAaK~GMkAl~wkA~~l~~e~~ip~lfGrt~d----------wr~GT~~s~ 277 (341) T TIGR02078 232 YEEILIAAKLGMKALQWKAADLAKEYKIPVLFGRTRD----------WRMGTVVSN 277 (341) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEECCCCC----------CEEEEEEEC T ss_conf 7799997452057776652000157543587415466----------401267762 No 52 >cd04240 AAK_UC AAK_UC: Uncharacterized (UC) amino acid kinase-like proteins found mainly in archaea and a few bacteria. Sequences in this CD are members of the Amino Acid Kinase (AAK) superfamily. Probab=99.90 E-value=1.4e-22 Score=158.79 Aligned_cols=199 Identities=21% Similarity=0.233 Sum_probs=136.9 Q ss_pred EEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 99940036853887770899999999999999976990999823654102334420377775402467888889899999 Q gi|254780777|r 10 LLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRGSQVVAENYLLCERSTVDSMGMLSTVINAL 89 (242) Q Consensus 10 ViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 89 (242) |+|||||.+.+ ..++.+.++.+ .+.+++||.|||.+++.+....+... .+...++++...+|.+..+ T Consensus 1 VvKiGGSL~~~----------~~~ll~~l~~~--~~~~v~IVpGGG~fAd~vR~~~~~~~-~~d~~aH~mAi~Am~~~a~ 67 (203) T cd04240 1 VVKIGGSLIRE----------AVRLLRWLKTL--SGGGVVIVPGGGPFADVVRRYQERKG-LSDAAAHWMAILAMEQYGY 67 (203) T ss_pred CEEEEEECCCC----------HHHHHHHHHHH--CCCCEEEEECCHHHHHHHHHHHHHCC-CCHHHHHHHHHHHHHHHHH T ss_conf 97982056466----------89999999983--59978999798089999999999829-9878999999999999999 Q ss_pred HHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHCCHHHHCCCCCCCCCCCCC---CCCCHHHHHHHHHHHCHHHHHHCCC Q ss_conf 9999997618864113400100245665432100001100111112465567---7762157888886416446651475 Q gi|254780777|r 90 ALDLALRKINVPTVILSSIFMPQVCEVFSCRNAVSYLSQGKVVIFSGGTGNA---FLTTDSAAALRASEIGADVILKGTQ 166 (242) Q Consensus 90 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~D~~a~~ia~~~~ad~l~i~td 166 (242) +++......- ...... ..+. ..+.....+....... ...+++ ..++|+.|+.+|..++|+.|++.|| T Consensus 68 ll~~~~~~~~----~~~~~~---~~~~-~~~~~~~v~lP~~~l~--~~~~lp~sWdvtsDs~a~~~A~~l~a~~li~~~~ 137 (203) T cd04240 68 LLADLEPRLV----ARTLAE---LTDV-LERGKIAILLPYRLLL--DTDPLPHSWEVTSDSIAAWLAKKLGAKRLVIVTD 137 (203) T ss_pred HHHHHCCHHH----CCCHHH---HHHH-HHCCCCEEEECHHHHC--CCCCCCCCCCCCHHHHHHHHHHHCCCCEEEEEEC T ss_conf 9996013101----046477---9999-7338915871158751--5898887877661899999999759885899856 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCHHHHHHHHHCCCCEEEEECCCCCHHHHHHCCC-CCCCEE Q ss_conf 4351234322344444444115999985599814099999999749969995189854799997799-878246 Q gi|254780777|r 167 VDGVYSADPRVHASSTRFDSLTYNQFIEKGLKVMDCASVVLARDCSIPIIVFSIHSPGGIWGGLSGI-GRSTII 239 (242) Q Consensus 167 vdGiyt~dP~~~~~ak~i~~is~~e~l~~G~~v~~~~A~~~a~~~gI~v~I~ng~~~~~i~~~l~Ge-~~GT~I 239 (242) |||+|++| +|++++++..|+. +..++|+.+.+++.+++++++|+||+.|+++.++|+|+ .+||+| T Consensus 138 vdGiy~~~------~kl~~~i~~~~l~--~~~~VD~~~~~~~~~~~i~~~VvnG~~p~~i~~~l~g~~~~GT~I 203 (203) T cd04240 138 VDGIYEKD------GKLVNEIAAAELL--GETSVDPAFPRLLTKYGIRCYVVNGDDPERVLAALRGREGVGTRI 203 (203) T ss_pred CCCCCCCC------HHHHHHCCHHHHC--CCCCHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHCCCCCCCCCC T ss_conf 77551875------0554115898864--887410778999997696399987998799999986898798579 No 53 >cd04238 AAK_NAGK-like AAK_NAGK-like: N-Acetyl-L-glutamate kinase (NAGK)-like . Included in this CD are the Escherichia coli and Pseudomonas aeruginosa type NAGKs which catalyze the phosphorylation of N-acetyl-L-glutamate (NAG) by ATP in the second step of arginine biosynthesis found in bacteria and photosynthetic organisms using either the acetylated, noncyclic (NC), or non-acetylated, cyclic (C) route of ornithine biosynthesis. Also included in this CD is a distinct group of uncharacterized (UC) bacterial and archeal NAGKs. Members of this CD belong to the Amino Acid Kinase Superfamily (AAK). Probab=99.89 E-value=2.3e-22 Score=157.58 Aligned_cols=217 Identities=18% Similarity=0.146 Sum_probs=139.1 Q ss_pred EEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHH-HHCCCCC------CCCHHHHHHHHH Q ss_conf 999940036853887770899999999999999976990999823654102334-4203777------754024678888 Q gi|254780777|r 9 VLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRGSQ-VVAENYL------LCERSTVDSMGM 81 (242) Q Consensus 9 IViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~~~-~~~~~~~------~~~~~~~~~~~~ 81 (242) ||||+|||+|+++ +.+.+++++|+.++..|.+++|||||+....... ..+.... .-+..... ... T Consensus 1 IVIKlGG~~l~d~-------~~~~~~~~dIa~L~~~G~~vVlVHGgg~qi~~~l~~~gi~~~~~~g~RvTd~~~l~-~~~ 72 (256) T cd04238 1 VVIKYGGSAMKDE-------ELKEAFADDIVLLKQVGINPVIVHGGGPEINELLKRLGIESEFVNGLRVTDKETME-IVE 72 (256) T ss_pred CEEEECHHHHCCH-------HHHHHHHHHHHHHHHCCCCEEEECCCHHHHHHHHHHCCCCCEECCCCCCCCHHHHH-HHH T ss_conf 9899786894598-------79999999999999889929999188299999999869983502673146187899-999 Q ss_pred HHH-HHHHHHHHHHHHHHCCCCCCCCCCH--------H-----HH----HHHHHHHHHHCCHHHHCCCCCCCCC-----C Q ss_conf 898-9999999999976188641134001--------0-----02----4566543210000110011111246-----5 Q gi|254780777|r 82 LST-VINALALDLALRKINVPTVILSSIF--------M-----PQ----VCEVFSCRNAVSYLSQGKVVIFSGG-----T 138 (242) Q Consensus 82 ~~~-~~~~~~~~~~~~~~~~~~~~~~~~~--------~-----~~----~~~~~~~~~~~~~~~~~~~~~~~~~-----~ 138 (242) ... -.....+...+...+.+....+..+ . +. .....+.......++.+.+|+.... . T Consensus 73 ~~l~g~in~~l~~~l~~~g~~aiglsg~d~~~i~a~~~~~~~~d~g~vG~v~~v~~~~i~~lL~~g~IPVi~pi~~~~~G 152 (256) T cd04238 73 MVLAGKVNKELVSLLNRAGGKAVGLSGKDGGLIKAEKKEEKDIDLGFVGEVTEVNPELLETLLEAGYIPVIAPIAVDEDG 152 (256) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCEEEECEEECCCC T ss_conf 99853389999999997799721214655877999975564434342455443558999999847994476041356888 Q ss_pred CCCCCCCHHHHHHHHHHHCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHH-------CCCCCCCHHHHHHHHHC Q ss_conf 56777621578888864164466514754351234322344444444115999985-------59981409999999974 Q gi|254780777|r 139 GNAFLTTDSAAALRASEIGADVILKGTQVDGVYSADPRVHASSTRFDSLTYNQFIE-------KGLKVMDCASVVLARDC 211 (242) Q Consensus 139 ~~~~~~~D~~a~~ia~~~~ad~l~i~tdvdGiyt~dP~~~~~ak~i~~is~~e~l~-------~G~~v~~~~A~~~a~~~ 211 (242) +....+.|..|+.+|.+++||.|+++||++|+|+.+ ++++++++.+|+.+ .|+|..|+.|+..|.++ T Consensus 153 ~~~ninaD~~Aa~lA~~l~AdkLi~ltdv~Gv~~~~------g~~i~~lt~~e~~~l~~~~~~~gGM~~Kl~aa~~a~~~ 226 (256) T cd04238 153 ETYNVNADTAAGAIAAALKAEKLILLTDVPGVLDDP------GSLISELTPKEAEELIEDGVISGGMIPKVEAALEALEG 226 (256) T ss_pred CEEEECHHHHHHHHHHHHCCCCEEEECCCCCEECCC------CCCCCCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHC T ss_conf 689637999999999972801089852888615688------75234127999999997696689539999999999984 Q ss_pred CC-CEEEEECCCCCHHHHHHC-CCCCCCEE Q ss_conf 99-699951898547999977-99878246 Q gi|254780777|r 212 SI-PIIVFSIHSPGGIWGGLS-GIGRSTII 239 (242) Q Consensus 212 gI-~v~I~ng~~~~~i~~~l~-Ge~~GT~I 239 (242) |+ .++|.||+.|+.|...|- .+..||+| T Consensus 227 Gv~rv~Ii~g~~~~~Ll~eLft~~G~GT~I 256 (256) T cd04238 227 GVRKVHIIDGRVPHSLLLELFTDEGIGTMI 256 (256) T ss_pred CCCEEEEECCCCCCHHHHHHCCCCCCCCCC T ss_conf 999799937999877999973799985879 No 54 >KOG0456 consensus Probab=99.88 E-value=4.8e-22 Score=155.53 Aligned_cols=217 Identities=18% Similarity=0.190 Sum_probs=142.3 Q ss_pred EEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHH------HHH----------------- Q ss_conf 999994003685388777089999999999999997699099982365410233------442----------------- Q gi|254780777|r 8 RVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRGS------QVV----------------- 64 (242) Q Consensus 8 rIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~~------~~~----------------- 64 (242) .+|.||||||+..- ++.-.++ +..+.-...++++|.|+.+..... ... T Consensus 80 ~~V~KFGGsSV~s~-------~~~i~v~--~l~~~~~~e~~~vV~SA~sk~Tna~~ta~~~~~~c~va~~~sie~l~iIk 150 (559) T KOG0456 80 TCVMKFGGSSVGSA-------ERMIEVA--VLILYFPDERPVVVLSAMSKTTNALLTAGEKAVCCGVANVESIEELSIIK 150 (559) T ss_pred EEEEECCCCCCCCC-------CHHHHHH--HHHHHCCCCCEEEEEECCCCCHHHHHHHHHHEECCCCCCCCHHHHHHHHH T ss_conf 89994288411334-------0013356--78874687773899971344125453310020001656743478999999 Q ss_pred ------------------------------CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHH Q ss_conf ------------------------------03777754024678888898999999999997618864113400100245 Q gi|254780777|r 65 ------------------------------AENYLLCERSTVDSMGMLSTVINALALDLALRKINVPTVILSSIFMPQVC 114 (242) Q Consensus 65 ------------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 114 (242) .........++.+.....+..++..+++..+.+.|.....+...+..... T Consensus 151 e~Hi~ta~e~~~d~~v~~~~le~leq~Lk~i~mm~Elt~RTrD~lvs~GE~lS~rf~aA~lnd~G~kar~~D~~~I~~~~ 230 (559) T KOG0456 151 ELHIRTAHELIVDPAVIAKLLEGLEQLLKGIAMMKELTLRTRDYLVSFGECLSTRFFAAYLNDIGHKARQYDAFEIGFIT 230 (559) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEECHHHEECCC T ss_conf 98776698854488999999999999999988987615245667654346788999999987328532351143420332 Q ss_pred ----------HHHHHH----HHCCHHHHCCCCCCCCC----------CCCCCCCCHHHHHHHHHHHCHHHHHHCCCCCCC Q ss_conf ----------665432----10000110011111246----------556777621578888864164466514754351 Q gi|254780777|r 115 ----------EVFSCR----NAVSYLSQGKVVIFSGG----------TGNAFLTTDSAAALRASEIGADVILKGTQVDGV 170 (242) Q Consensus 115 ----------~~~~~~----~~~~~~~~~~~~~~~~~----------~~~~~~~~D~~a~~ia~~~~ad~l~i~tdvdGi 170 (242) +..... ....-.....+|++.++ +.+.+|.+|..|+.++.++++|.+..|.||||+ T Consensus 231 ~d~~t~~d~~~a~~~av~k~~~~~~aken~VPVvTGf~Gk~~~tg~lt~lGRG~sDl~At~i~~al~~~EiQVWKdVDGv 310 (559) T KOG0456 231 TDDFTNDDILEATYPAVSKLLSGDWAKENAVPVVTGFLGKGWPTGALTTLGRGGSDLTATTIGKALGLDEIQVWKDVDGV 310 (559) T ss_pred CCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCEEEECCCCCCCCCCEECCCCCCHHHHHHHHHHHCCCHHHHHHHHCCCE T ss_conf 43334324788878999875243101377663686003367656200035677216689999887181454213234752 Q ss_pred CCCCCCCCCCCCCCCCCCHHHHHHC---CCCCCCHHHHHHHHHCCCCEEEEECCCCCHHHHHHCCCCCCCEECCC Q ss_conf 2343223444444441159999855---99814099999999749969995189854799997799878246689 Q gi|254780777|r 171 YSADPRVHASSTRFDSLTYNQFIEK---GLKVMDCASVVLARDCSIPIIVFSIHSPGGIWGGLSGIGRSTIISGE 242 (242) Q Consensus 171 yt~dP~~~~~ak~i~~is~~e~l~~---G~~v~~~~A~~~a~~~gI~v~I~ng~~~~~i~~~l~Ge~~GT~I~~~ 242 (242) +|+||+++|.|++++.++++|+.+. |+.|+||...+.+.+..||++|.|..+| ..+||+|+++ T Consensus 311 ~T~DP~~~p~Ar~vp~lT~dEAaELaYfGaqVlHP~sM~~~~~~~IPvRvKN~~NP---------~~~GTvI~~d 376 (559) T KOG0456 311 LTCDPRIYPGARLVPYLTFDEAAELAYFGAQVLHPFSMRPAREGRIPVRVKNSYNP---------TAPGTVITPD 376 (559) T ss_pred EECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHCCCCCEEEECCCCC---------CCCCEEECCC T ss_conf 74388647775006865877877877521121056333411306762586337799---------9994286362 No 55 >cd04250 AAK_NAGK-C AAK_NAGK-C: N-Acetyl-L-glutamate kinase - cyclic (NAGK-C) catalyzes the phosphorylation of the gamma-COOH group of N-acetyl-L-glutamate (NAG) by ATP in the second step of arginine biosynthesis found in some bacteria and photosynthetic organisms using the non-acetylated, cyclic route of ornithine biosynthesis. In this pathway, glutamate is first N-acetylated and then phosphorylated by NAGK to give phosphoryl NAG, which is converted to NAG-ornithine. There are two variants of this pathway. In one, typified by the pathway in Thermotoga maritima and Pseudomonas aeruginosa, the acetyl group is recycled by reversible transacetylation from acetylornithine to glutamate. The phosphorylation of NAG by NAGK is feedback inhibited by arginine. In photosynthetic organisms, NAGK is the target of the nitrogen-signaling protein PII. Hexameric formation of NAGK domains appears to be essential to both arginine inhibition and NAGK-PII complex formation. NAGK-C are members of the Amino A Probab=99.87 E-value=5.7e-21 Score=148.93 Aligned_cols=224 Identities=16% Similarity=0.143 Sum_probs=140.6 Q ss_pred CCEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHH-HHHCCCC------CCCCHHHHH Q ss_conf 824999994003685388777089999999999999997699099982365410233-4420377------775402467 Q gi|254780777|r 5 PYKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRGS-QVVAENY------LLCERSTVD 77 (242) Q Consensus 5 ~~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~~-~~~~~~~------~~~~~~~~~ 77 (242) +=|++|||+||+.|.++ +.+.+++++|+.++..|.+++||||||...... ...+... ..-+..... T Consensus 13 rgktiVIklGG~~l~d~-------~~~~~~~~dI~~L~~~G~~vVlVHGggpqi~~~l~~~gi~~~~~~G~RvT~~~~l~ 85 (279) T cd04250 13 RGKTVVIKYGGNAMKDE-------ELKESFARDIVLLKYVGINPVVVHGGGPEINEMLKKLGIESEFVNGLRVTDEETME 85 (279) T ss_pred CCCEEEEEECHHHHCCH-------HHHHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHCCCCCCCCCCCCCCCHHHHH T ss_conf 89989999890784697-------79999999999999889929999699679999999869984223774357388899 Q ss_pred HHHHHHHH-HHHHHHHHHHHHHCCCCCCCCCCH--------HH---------H----HHHHHHHHHHCCHHHHCCCCCCC Q ss_conf 88888989-999999999976188641134001--------00---------2----45665432100001100111112 Q gi|254780777|r 78 SMGMLSTV-INALALDLALRKINVPTVILSSIF--------MP---------Q----VCEVFSCRNAVSYLSQGKVVIFS 135 (242) Q Consensus 78 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~--------~~---------~----~~~~~~~~~~~~~~~~~~~~~~~ 135 (242) ...+... .....+...+...+.+..-.+..+ .. . .....+.......++.+.+|+.. T Consensus 86 -iv~~~l~G~vn~~lv~~l~~~g~~avglsg~dg~~i~a~~~~~~~~~~~~D~g~vG~v~~vd~~~i~~ll~~g~IpVIs 164 (279) T cd04250 86 -IVEMVLVGKVNKEIVSLINRAGGKAVGLSGKDGNLIKAKKKDATVIEEIIDLGFVGEVTEVNPELLETLLEAGYIPVIA 164 (279) T ss_pred -HHHHHHHCCHHHHHHHHHHHCCCCEEECCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCEEEEC T ss_conf -9999985614699999999739985462266686699997555555556656645665522789999998579748964 Q ss_pred CC----C-CCCCCCCHHHHHHHHHHHCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHH-------CCCCCCCHH Q ss_conf 46----5-56777621578888864164466514754351234322344444444115999985-------599814099 Q gi|254780777|r 136 GG----T-GNAFLTTDSAAALRASEIGADVILKGTQVDGVYSADPRVHASSTRFDSLTYNQFIE-------KGLKVMDCA 203 (242) Q Consensus 136 ~~----~-~~~~~~~D~~a~~ia~~~~ad~l~i~tdvdGiyt~dP~~~~~ak~i~~is~~e~l~-------~G~~v~~~~ 203 (242) .. . +....+.|..|+.+|.+++||.|+++|||+|+|+..+.. .+++++++.+|+.+ .|+|.-|++ T Consensus 165 plg~~~~G~~~NvnaD~~A~~lA~aL~AdkLi~ltdv~Gv~~~~~~~---~~~i~~lt~~e~~~li~~~~i~ggM~pKl~ 241 (279) T cd04250 165 PVGVGEDGETYNINADTAAGAIAAALKAEKLILLTDVAGVLDDPNDP---GSLISEISLKEAEELIADGIISGGMIPKVE 241 (279) T ss_pred CCCCCCCCCEECCCHHHHHHHHHHHHHHCCEEEEECCCCCCCCCCCC---CCCCCCCCHHHHHHHHHCCCCCCCCHHHHH T ss_conf 64168989786688999999999986449499995881110588887---540025899999999975976897499999 Q ss_pred HHHHHHHCCC-CEEEEECCCCCHHHH-HHCCCCCCCEE Q ss_conf 9999997499-699951898547999-97799878246 Q gi|254780777|r 204 SVVLARDCSI-PIIVFSIHSPGGIWG-GLSGIGRSTII 239 (242) Q Consensus 204 A~~~a~~~gI-~v~I~ng~~~~~i~~-~l~Ge~~GT~I 239 (242) |+.-|.++|+ .++|.||+.|+.|.. .+.++..||+| T Consensus 242 aa~~Al~~GV~rvhIidg~~~~~LL~ELft~~G~GT~I 279 (279) T cd04250 242 ACIEALEGGVKAAHIIDGRVPHSLLLEIFTDEGIGTMI 279 (279) T ss_pred HHHHHHHCCCCEEEEECCCCCCHHHHHHHCCCCCCCCC T ss_conf 99999984999799946989887999981799986609 No 56 >TIGR00656 asp_kin_monofn asparate kinase, monofunctional class; InterPro: IPR005260 Aspartate kinase catalyzes the first step in the biosynthesis of Lys (via its precursor diaminopimelate), Met, and Thr. In E. coli, a distinct isozyme is inhibited by each of these three amino acid end products . The Met-sensitive (I) and Thr-sensitive (II) forms are bifunctional enzymes fused to homoserine dehydrogenase, which catalyses the next step in the biosynthesis of these amino acids. Inn contrast, the Lys-sensitive form (III) is a monofunctional enzyme; homoserine dehydrogenase is not part of the Lys biosynthetic pathway. This entry represents a subclass of aspartate kinases that are mostly Lys-sensitive and are not fused to homoserine dehydrogenase. The E. coli enzyme is a homodimer, while the Bacillus and Corynebacterium enzymes are alpha 2/beta 2 heterotetramers, where the beta subunit is translated from an in-phase alternative initiator at Met-246 , . The protein slr0657 from Synechocystis PCC6803 is extended by a duplication of the C-terminal region corresponding to the beta chain. Incorporation of a second copy of the C-terminal domain may be quite common in this subgroup of aspartokinases. ; GO: 0004072 aspartate kinase activity, 0009058 biosynthetic process. Probab=99.84 E-value=7.3e-20 Score=142.11 Aligned_cols=220 Identities=20% Similarity=0.236 Sum_probs=161.5 Q ss_pred CEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCH--HHHHH-------------------- Q ss_conf 24999994003685388777089999999999999997699099982365410--23344-------------------- Q gi|254780777|r 6 YKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIF--RGSQV-------------------- 63 (242) Q Consensus 6 ~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~--~~~~~-------------------- 63 (242) +..||.||||+++.+ .+.+.+.++.+.+...+|+++++|.|+.... ..... T Consensus 1 ~~~~v~kfGGt~~~~-------~~~~~~~~~~~~~~~~~g~~~~vv~sa~~~~td~l~~~~~~~~pg~~~~~~~~~~~~~ 73 (480) T TIGR00656 1 MELIVQKFGGTSVGS-------GERIKNAARIVLKEKEEGHKVVVVVSAMGGVTDALLSLAEGLDPGEGGAKGGDLLGIL 73 (480) T ss_pred CCEEEEECCCCCCCC-------HHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCHHHHHHHHHHCCCCCHHHHHHHHHHH T ss_conf 944787426721254-------2678888878887764077148998615652024665543202322113567889999 Q ss_pred -----H----CCCC------------------CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC--CCHHHHHH Q ss_conf -----2----0377------------------77540246788888989999999999976188641134--00100245 Q gi|254780777|r 64 -----V----AENY------------------LLCERSTVDSMGMLSTVINALALDLALRKINVPTVILS--SIFMPQVC 114 (242) Q Consensus 64 -----~----~~~~------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~ 114 (242) . .... .....++.+.....+......++...+...|.....+. ........ T Consensus 74 ~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~s~Ge~~s~~~~~~~l~~~g~~~~~~~p~g~~~g~~t 153 (480) T TIGR00656 74 RELGLPLRYPEAISDEILELELNKLLLAAAALGEWTPRERDLLVSYGELLSSALLSGALRDLGVKAIWLDPFGGEAGILT 153 (480) T ss_pred HHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCEEE T ss_conf 98643331100135567887777788877886421755777776411257899999999852762144143456222232 Q ss_pred H--------------HHHHHHHCCHHHHCCCCCCCCCC---------CCCCCCCHHHHHHHHHHHCHHHHHHCCCCCCCC Q ss_conf 6--------------65432100001100111112465---------567776215788888641644665147543512 Q gi|254780777|r 115 E--------------VFSCRNAVSYLSQGKVVIFSGGT---------GNAFLTTDSAAALRASEIGADVILKGTQVDGVY 171 (242) Q Consensus 115 ~--------------~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~D~~a~~ia~~~~ad~l~i~tdvdGiy 171 (242) + ..........++.+.++++.++. .+..|++|++|++++..++|+.+.+||||+|+| T Consensus 154 ~~~~~~a~~~~~~~~~~~~~~l~~~l~~g~~~~~~Gf~G~~~~g~~ttlGrGGsd~~a~~l~~~l~a~~~~~~tdv~G~~ 233 (480) T TIGR00656 154 DDNFGNAKIDELATEGLAAERLLPLLEEGIIPVVAGFQGATEEGDTTTLGRGGSDLTAALLGAALKADRVDIYTDVPGVY 233 (480) T ss_pred CCCCCCCEEEHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCEEECCCCCCHHHHHHHHHHCCCCEEEEEECCCCCC T ss_conf 45455520002224677888888887549758972333577676444304786237899987521776379974478501 Q ss_pred CCCCCCCCCCCCCCCCCHHHHHHC---CCCCCCHHHHHHH-HHCCCCEEEEECCCCCHHHHHHCCCCCCCEECC Q ss_conf 343223444444441159999855---9981409999999-974996999518985479999779987824668 Q gi|254780777|r 172 SADPRVHASSTRFDSLTYNQFIEK---GLKVMDCASVVLA-RDCSIPIIVFSIHSPGGIWGGLSGIGRSTIISG 241 (242) Q Consensus 172 t~dP~~~~~ak~i~~is~~e~l~~---G~~v~~~~A~~~a-~~~gI~v~I~ng~~~~~i~~~l~Ge~~GT~I~~ 241 (242) +.||+..+.|++++.++|+|+.++ |++++|+++++++ .+.++|+.+.+...+ +..||+|.+ T Consensus 234 ~~dPr~~~~a~~~~~~~~~e~~~la~~Ga~~l~p~~~~~~d~~~~~~~~~~~~~~~---------~~~gt~~~~ 298 (480) T TIGR00656 234 TTDPRVVEGAKKLPKISYEEALELATLGAKVLHPRTLEPADKEGGVPIEVRSSFDP---------EAEGTLITN 298 (480) T ss_pred CCCCCCCCCCCCCCCCCHHHHHHHHHCCCHHCCCCCCCCCCCCCCCCEEEEECCCC---------CCCCCEECC T ss_conf 26864245643223323899999975250120321002011013663588624577---------767414304 No 57 >cd04249 AAK_NAGK-NC AAK_NAGK-NC: N-Acetyl-L-glutamate kinase - noncyclic (NAGK-NC) catalyzes the phosphorylation of the gamma-COOH group of N-acetyl-L-glutamate (NAG) by ATP in the second step of microbial arginine biosynthesis using the acetylated, noncyclic route of ornithine biosynthesis. There are two variants of this pathway. In one, typified by the pathway in Escherichia coli, glutamate is acetylated by acetyl-CoA and acetylornithine is deacylated hydrolytically. In this pathway, feedback inhibition by arginine occurs at the initial acetylation of glutamate and not at the phosphorylation of NAG by NAGK. Homodimeric NAGK-NC are members of the Amino Acid Kinase Superfamily (AAK). Probab=99.81 E-value=1.9e-18 Score=133.35 Aligned_cols=217 Identities=18% Similarity=0.127 Sum_probs=135.3 Q ss_pred EEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHC-CCEEEEEECCCCCHHHH-HHHCC------CCCCCCHHHHHHHH Q ss_conf 999940036853887770899999999999999976-99099982365410233-44203------77775402467888 Q gi|254780777|r 9 VLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAK-GIEIGIVVGGGNIFRGS-QVVAE------NYLLCERSTVDSMG 80 (242) Q Consensus 9 IViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~-g~~vviV~sg~~~~~~~-~~~~~------~~~~~~~~~~~~~~ 80 (242) +|||+||+.+.+ .+.+++++.+++.+.+. |+.+++|||||...... ...+- +...-+........ T Consensus 1 iVIK~GG~~l~~-------~~~~~~~~~~i~~l~~~~~~~vVlVHGggpqI~~~l~~~gi~~~f~~G~RvTd~~tl~vv~ 73 (252) T cd04249 1 LVIKLGGALLET-------EAALEQLFSALSEYQQQHNRQLVIVHGGGCVVDELLKKLNFPSEKKNGLRVTPKEQIPYIT 73 (252) T ss_pred CEEEECCHHCCC-------HHHHHHHHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHCCCCCCCCCCEEECCHHHHHHHH T ss_conf 989978246178-------8899999999999998189876998799689999999769997237965658999999999 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHH--------------HHHHHHHHHHCCHHHHCCCCCCCCC----CC-CC Q ss_conf 88989999999999976188641134001002--------------4566543210000110011111246----55-67 Q gi|254780777|r 81 MLSTVINALALDLALRKINVPTVILSSIFMPQ--------------VCEVFSCRNAVSYLSQGKVVIFSGG----TG-NA 141 (242) Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~----~~-~~ 141 (242) +......+..+...+...+.+..-.+..+... .....+.......++.+.+|+.... ++ .. T Consensus 74 ~vl~g~vn~~lv~~l~~~g~~a~gl~~~dg~~~~~~~~~~dlG~vG~v~~Vn~~~i~~ll~~g~IPVisplg~~~~G~~~ 153 (252) T cd04249 74 GALAGTANKQLMAQAIKAGLKPVGLSLADGGMTAVTQLDPELGAVGKATANDPSLLNDLLKAGFLPIISSIGADDQGQLM 153 (252) T ss_pred HHHCCCHHHHHHHHHHHCCCCCCCCCCCCCCEEEECCCCCCCCCCCCEEEECHHHHHHHHCCCCCCEECCCEECCCCCEE T ss_conf 99846159999999973589862044567746873023666786555323557899998628987476353148999879 Q ss_pred CCCCHHHHHHHHHHHCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHH-------CCCCCCCHHHHHHHHHCCC- Q ss_conf 77621578888864164466514754351234322344444444115999985-------5998140999999997499- Q gi|254780777|r 142 FLTTDSAAALRASEIGADVILKGTQVDGVYSADPRVHASSTRFDSLTYNQFIE-------KGLKVMDCASVVLARDCSI- 213 (242) Q Consensus 142 ~~~~D~~a~~ia~~~~ad~l~i~tdvdGiyt~dP~~~~~ak~i~~is~~e~l~-------~G~~v~~~~A~~~a~~~gI- 213 (242) ..+.|..|+.+|.+++|+ |+++|||+|+|+. +..++++++..|+.+ .|+|.-|++++.-|.++|+ T Consensus 154 NiNaD~~A~~iA~aL~A~-Li~ltdv~Gvld~------~g~li~~l~~~e~~~li~~~~i~gGMipKl~aa~~Al~~gv~ 226 (252) T cd04249 154 NVNADQAATAIAQLLNAD-LVLLSDVSGVLDA------DKQLISELNAKQAAELIEQGVITDGMIVKVNAALDAAQSLRR 226 (252) T ss_pred ECCHHHHHHHHHHHHCCC-EEEEECCCCEECC------CCCCCCCCCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCC T ss_conf 759999999999971598-9999688613889------998942189999999997595479739999999999981998 Q ss_pred CEEEEECCCCCHHHHHHCCCCCCCEE Q ss_conf 69995189854799997799878246 Q gi|254780777|r 214 PIIVFSIHSPGGIWGGLSGIGRSTII 239 (242) Q Consensus 214 ~v~I~ng~~~~~i~~~l~Ge~~GT~I 239 (242) .++|.+|+.|..|..++.++.+||+| T Consensus 227 ~v~I~~g~~~~~L~elft~~GiGT~I 252 (252) T cd04249 227 GIDIASWQYPEQLTALLAGEPVGTKI 252 (252) T ss_pred EEEEECCCCCHHHHHHHCCCCCCCCC T ss_conf 79994389873899997599987739 No 58 >cd04251 AAK_NAGK-UC AAK_NAGK-UC: N-Acetyl-L-glutamate kinase - uncharacterized (NAGK-UC). This domain is similar to Escherichia coli and Pseudomonas aeruginosa NAGKs which catalyze the phosphorylation of the gamma-COOH group of N-acetyl-L-glutamate (NAG) by ATP in the second step of microbial arginine biosynthesis. These uncharacterized domain sequences are found in some bacteria (Deinococci and Chloroflexi) and archea and belong to the Amino Acid Kinase Superfamily (AAK). Probab=99.79 E-value=4.8e-18 Score=130.91 Aligned_cols=208 Identities=18% Similarity=0.219 Sum_probs=128.4 Q ss_pred EEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHCCCC------------CCCCHHHH Q ss_conf 999940036853887770899999999999999976990999823654102334420377------------77540246 Q gi|254780777|r 9 VLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRGSQVVAENY------------LLCERSTV 76 (242) Q Consensus 9 IViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~~~~~~~~~------------~~~~~~~~ 76 (242) ||||+|||.+.+ +.++ ++.+.+.|.++++||||+............. ..-+.... T Consensus 1 iVIK~GG~~l~d----------~~~~---~~~l~~lG~~~VlVHGgg~qI~~~l~~~gi~~~fv~~~~Gl~~RvTd~~~l 67 (257) T cd04251 1 IVVKIGGSVVSD----------LDKV---IDDIANFGERLIVVHGGGNYVNEYLKRLGVEPKFVTSPSGIRSRYTDKETL 67 (257) T ss_pred CEEEECHHHHHC----------HHHH---HHHHHHCCCCEEEECCCCHHHHHHHHHCCCCCEEECCCCCCCEEECCHHHH T ss_conf 989989588769----------8999---999997599989997997799999998599967741789862021898999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHH--------------------------HHHHHHHHHCCHHHHCC Q ss_conf 7888889899999999999761886411340010024--------------------------56654321000011001 Q gi|254780777|r 77 DSMGMLSTVINALALDLALRKINVPTVILSSIFMPQV--------------------------CEVFSCRNAVSYLSQGK 130 (242) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------------------~~~~~~~~~~~~~~~~~ 130 (242) . .........+..+...+...+.+....+..+.... ....+.......++.+. T Consensus 68 ~-~v~~~~g~vn~~lv~~l~~~g~~a~glsg~d~~~~~~~~~~~~~~~~~g~~~~id~g~vG~v~~V~~~~i~~ll~~g~ 146 (257) T cd04251 68 E-VFVMVMGLINKKIVARLHSLGVKAVGLTGLDGRLLEAKRKEIVRVNERGRKMIIRGGYTGKVEKVNSDLIEALLDAGY 146 (257) T ss_pred H-HHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEEECCCEEECCCCCCCCCCCCCEEEEEEECHHHHHHHHHCCC T ss_conf 9-999999999999999998489986522266673689851252442467860104579767798987899999984899 Q ss_pred CCCCCC-----CCCCCCCCCHHHHHHHHHHHCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHH-----CCCCCC Q ss_conf 111124-----6556777621578888864164466514754351234322344444444115999985-----599814 Q gi|254780777|r 131 VVIFSG-----GTGNAFLTTDSAAALRASEIGADVILKGTQVDGVYSADPRVHASSTRFDSLTYNQFIE-----KGLKVM 200 (242) Q Consensus 131 ~~~~~~-----~~~~~~~~~D~~a~~ia~~~~ad~l~i~tdvdGiyt~dP~~~~~ak~i~~is~~e~l~-----~G~~v~ 200 (242) +|+... ..+....+.|..|+.+|..++|+.|+++|||+|+|. | .+++++++..|+.+ .|+|.- T Consensus 147 iPVispig~~~~G~~~NvnaD~~A~~iA~aL~A~kLI~ltdv~Gvl~-d------g~li~~l~~~e~~~li~~i~gGM~p 219 (257) T cd04251 147 LPVVSPVAYSEEGEPLNVDGDRAAAAIAAALKAERLILLTDVEGLYL-D------GRVIERITVSDAESLLEKAGGGMKR 219 (257) T ss_pred CEEECCCCCCCCCCEECCCHHHHHHHHHHHCCCCEEEEEECCCEEEC-C------CEECCCCCHHHHHHHHHHHCCCHHH T ss_conf 45866766688760742877999999999809996999968840424-9------8453658999999999873485399 Q ss_pred CHHHHHHHHHCCCC-EEEEECCCCCHHHHHHCCCCCCCEE Q ss_conf 09999999974996-9995189854799997799878246 Q gi|254780777|r 201 DCASVVLARDCSIP-IIVFSIHSPGGIWGGLSGIGRSTII 239 (242) Q Consensus 201 ~~~A~~~a~~~gI~-v~I~ng~~~~~i~~~l~Ge~~GT~I 239 (242) |++|+.-|.++|++ ++|.||+.|+.|...|+| .||+| T Consensus 220 Kl~aa~~A~~~GV~~v~Ii~g~~~~~ll~el~G--~GT~I 257 (257) T cd04251 220 KLLAAAEAVEGGVREVVIGDARADSPISSALNG--GGTVI 257 (257) T ss_pred HHHHHHHHHHCCCCEEEEECCCCCCHHHHHHCC--CCCCC T ss_conf 999999999809997999079998779999658--98469 No 59 >CHL00202 argB acetylglutamate kinase; Provisional Probab=99.78 E-value=1.8e-17 Score=127.32 Aligned_cols=226 Identities=14% Similarity=0.125 Sum_probs=141.7 Q ss_pred CCCEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHH-HHHHCC------CCCCCCHHHH Q ss_conf 882499999400368538877708999999999999999769909998236541023-344203------7777540246 Q gi|254780777|r 4 FPYKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRG-SQVVAE------NYLLCERSTV 76 (242) Q Consensus 4 ~~~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~-~~~~~~------~~~~~~~~~~ 76 (242) +.=|.+|||+||+.+.++ ....++++.|+-++.-|.++|+||||+..... ....+. +...-+.... T Consensus 21 ~rgktfVIk~gG~~~~d~-------~l~~~~~~DialL~~lGik~VlVHGgg~qI~~~l~~~gi~~~f~~G~RvTd~~~l 93 (284) T CHL00202 21 FRGRIMVIKYGGAAMKNL-------ILKADIIKDILFLSCIGLKIVVVHGGGPEINFWLKQLNISPKFWNGIRVTDKVTM 93 (284) T ss_pred HCCCEEEEEECCEEEECH-------HHHHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHCCCCCCCCCCCCCCCHHHH T ss_conf 689989999896666375-------4899999999999988997999889966899999976998436288321887899 Q ss_pred HHHHHH-HHHHHHHHHHHHHHHHCCCCCCCCCCH--------H---HHH----HHHHHHHHHCCHHHHCCCCCCCCC--- Q ss_conf 788888-989999999999976188641134001--------0---024----566543210000110011111246--- Q gi|254780777|r 77 DSMGML-STVINALALDLALRKINVPTVILSSIF--------M---PQV----CEVFSCRNAVSYLSQGKVVIFSGG--- 137 (242) Q Consensus 77 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~---~~~----~~~~~~~~~~~~~~~~~~~~~~~~--- 137 (242) .-..+. ....+. .+...+...+.+.......+ . +.. ....+.......++.+.+|+.... T Consensus 94 ~vv~~vl~g~vn~-~lv~~l~~~g~~a~gl~g~dg~~i~A~~~~~~D~g~vG~V~~Vd~~~I~~lL~~g~IpVIspig~~ 172 (284) T CHL00202 94 EIVEMVLAGKVNK-DLVGSINANGGKAVGLCGKDANLIVARASDKKDLGLVGEIQQVDPQLIDMLLEKNYIPVIASVAAD 172 (284) T ss_pred HHHHHHHHHHHHH-HHHHHHHHCCCCCEEEECCCCCEEEEECCCCCCCCCCCCCEEECHHHHHHHHHCCCEEEEECCEEC T ss_conf 9999998778999-999999856995055312557669983589877444444123267999999708975998044067 Q ss_pred -C-CCCCCCCHHHHHHHHHHHCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHH---C----CCCCCCHHHHHHH Q ss_conf -5-56777621578888864164466514754351234322344444444115999985---5----9981409999999 Q gi|254780777|r 138 -T-GNAFLTTDSAAALRASEIGADVILKGTQVDGVYSADPRVHASSTRFDSLTYNQFIE---K----GLKVMDCASVVLA 208 (242) Q Consensus 138 -~-~~~~~~~D~~a~~ia~~~~ad~l~i~tdvdGiyt~dP~~~~~ak~i~~is~~e~l~---~----G~~v~~~~A~~~a 208 (242) + +....+.|..|+.+|..++|+.|+++||++|+|.+.+ ....++.+++..|+.+ . |+|.-|+.++--| T Consensus 173 ~~G~~~NvnaD~~A~~iA~aL~A~KLI~ltdv~Gvl~d~~---~~~~li~~l~~~e~~~l~~~~~i~~gm~pkl~aa~~A 249 (284) T CHL00202 173 HDGQTYNINADVVAGEIAAKLNAEKLILLTDTPGILADIN---DPNSLISTLNIKEARNLASTGIISGGMIPKVNCCIRA 249 (284) T ss_pred CCCCEEEECHHHHHHHHHHHHCCCCEEEEECCCCEECCCC---CCCCCCCCCCHHHHHHHHHCCCCCCCHHHHHHHHHHH T ss_conf 9999895099999999999864196999988967106788---7653244489999999987598787469999999999 Q ss_pred HHCCC-CEEEEECCCCCH-HHHHHCCCCCCCEEC Q ss_conf 97499-699951898547-999977998782466 Q gi|254780777|r 209 RDCSI-PIIVFSIHSPGG-IWGGLSGIGRSTIIS 240 (242) Q Consensus 209 ~~~gI-~v~I~ng~~~~~-i~~~l~Ge~~GT~I~ 240 (242) -++|+ .++|.||+.|+. |..++..+..||+|- T Consensus 250 l~~GV~rvhiidg~~~~aLL~ELfT~~G~GT~Iv 283 (284) T CHL00202 250 LAQGVEAAHIIDGKEKHALLLEILTEKGIGSMLV 283 (284) T ss_pred HHCCCCEEEEECCCCCCEEHHHHHCCCCCEEEEE T ss_conf 9859998999258899700056527999814884 No 60 >PRK12353 carbamate kinase; Reviewed Probab=99.78 E-value=6.7e-18 Score=129.99 Aligned_cols=229 Identities=21% Similarity=0.258 Sum_probs=136.3 Q ss_pred CEEEEEEEEEEEEECCCCC-CCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCH-HHHH-HHCCC---CC--CCCHHHHH Q ss_conf 2499999400368538877-7089999999999999997699099982365410-2334-42037---77--75402467 Q gi|254780777|r 6 YKRVLLKVSGEALAGISGF-GVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIF-RGSQ-VVAEN---YL--LCERSTVD 77 (242) Q Consensus 6 ~KrIViKiGGSsL~~~~~~-~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~-~~~~-~~~~~---~~--~~~~~~~~ 77 (242) +||||||+||+++.++++. .--.+.++..|++|+.+.+.|++++||||+|-.. .... ..... .. +.+..... T Consensus 2 ~k~IVIalGGNAl~~~~~t~~~Q~~~~~~~a~~ia~l~~~G~~~VitHGnGPQVG~lll~~~~~~~~~~~~~pld~~~a~ 81 (312) T PRK12353 2 MKKIVVALGGNALGSSDGSAEEQLEAVKKTAKSLVDLIEEGHEVVITHGNGPQVGNLLLQQAAADSEKNPAMPLDVCVAM 81 (312) T ss_pred CCEEEEEECHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHHHCCCCCCCCEECCHHHH T ss_conf 84499983725318999998999999999999999999889969998388448899999866542688898211014343 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCC--------------------------CCHHHHHHHHH-------------- Q ss_conf 88888989999999999976188641134--------------------------00100245665-------------- Q gi|254780777|r 78 SMGMLSTVINALALDLALRKINVPTVILS--------------------------SIFMPQVCEVF-------------- 117 (242) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------------------~~~~~~~~~~~-------------- 117 (242) +.+..+.++.. .+...+...+......+ ..+........ T Consensus 82 tqG~IGy~l~q-~l~~~l~~~g~~~~v~tlvtqv~Vd~~Dpaf~~ptKpIG~~y~~~ea~~l~~~~g~~~~~d~g~G~RR 160 (312) T PRK12353 82 SQGYIGYWLQN-ALDNELLKRGINKPVATVVTQVEVDANDPAFKNPTKPIGPFYTEEEAEKLAKEKGWTFKEDAGRGYRR 160 (312) T ss_pred CCCHHHHHHHH-HHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCEEEECCCCCEEE T ss_conf 03069999999-99999984589986489998998878760002788765665689999999972797442438886277 Q ss_pred -----------HHHHHCCHHHHCCCCCCCCCCCCC-------------CCCCHHHHHHHHHHHCHHHHHHCCCCCCCCCC Q ss_conf -----------432100001100111112465567-------------77621578888864164466514754351234 Q gi|254780777|r 118 -----------SCRNAVSYLSQGKVVIFSGGTGNA-------------FLTTDSAAALRASEIGADVILKGTQVDGVYSA 173 (242) Q Consensus 118 -----------~~~~~~~~~~~~~~~~~~~~~~~~-------------~~~~D~~a~~ia~~~~ad~l~i~tdvdGiyt~ 173 (242) .......+++.+.+++..+..+++ ..|-|..|+++|..++||.++++|||+|||-. T Consensus 161 VV~SP~P~~iie~~~I~~L~~~g~IvIa~GGGGIPV~~~~g~~~Gv~aVIdkDlaaa~LA~~l~Ad~lliLTdV~~V~~n 240 (312) T PRK12353 161 VVPSPKPVDIIEKEAIKTLVEAGVVVIAAGGGGIPVVREGGGLKGVEAVIDKDFASALLAELIDADLLIILTAVDKVYIN 240 (312) T ss_pred EECCCCCCCEECHHHHHHHHHCCCEEEECCCCCCCEECCCCCEECCEEEECCCHHHHHHHHHCCCCEEEEEECCHHHHHC T ss_conf 50699973022399999999789789964888776576799446613331533888999997199879997163467652 Q ss_pred CCCCCCCCCCCCCCCHHHHHH--------CCCCCCCHHHHHHHHHCCC-C-EEEEECCCCCHHHHHHCCCCCCCEECC Q ss_conf 322344444444115999985--------5998140999999997499-6-999518985479999779987824668 Q gi|254780777|r 174 DPRVHASSTRFDSLTYNQFIE--------KGLKVMDCASVVLARDCSI-P-IIVFSIHSPGGIWGGLSGIGRSTIISG 241 (242) Q Consensus 174 dP~~~~~ak~i~~is~~e~l~--------~G~~v~~~~A~~~a~~~gI-~-v~I~ng~~~~~i~~~l~Ge~~GT~I~~ 241 (242) -++ |+.++++++|.+|+.+ .|+|.=|..|+--+.++|- . .+| ..++.+..+|+| ..||+|++ T Consensus 241 ~g~--p~q~~l~~lt~~e~~~~~~~G~F~~GsM~PKVeAai~fv~~gg~~~aiI---t~l~~~~~al~G-~~GT~I~~ 312 (312) T PRK12353 241 FGK--PNQKKLDRVTVSELEKYIEEGQFAPGSMLPKVEAAISFVESGGKRKAII---TSLENAKEALEG-KAGTVIVK 312 (312) T ss_pred CCC--CCCCCCCCCCHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCCEEEE---CCHHHHHHHHCC-CCCCEEEC T ss_conf 899--8613321388999999997799888882699999999999489837998---989999999679-99848829 No 61 >COG0548 ArgB Acetylglutamate kinase [Amino acid transport and metabolism] Probab=99.78 E-value=3.4e-17 Score=125.68 Aligned_cols=224 Identities=17% Similarity=0.170 Sum_probs=141.7 Q ss_pred CEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCH-HHHHHHC------CCCCCCCHHHHHH Q ss_conf 24999994003685388777089999999999999997699099982365410-2334420------3777754024678 Q gi|254780777|r 6 YKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIF-RGSQVVA------ENYLLCERSTVDS 78 (242) Q Consensus 6 ~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~-~~~~~~~------~~~~~~~~~~~~~ 78 (242) +|.+|||+||+.+.++ +...++++.|+.++..|.+.++|||||-.. +.....+ .+...-+..+.. T Consensus 2 ~k~~VIK~GG~~~~~~-------~l~~~~~~di~lL~~~G~~~VvVHGggp~I~~~l~~~gie~~f~~glRvTd~~tle- 73 (265) T COG0548 2 GKTIVIKLGGSAMEDE-------NLLEAFASDIALLKSVGIRPVVVHGGGPQIDEMLAKLGIEPEFVKGLRVTDAETLE- 73 (265) T ss_pred CCEEEEEECCEEECCH-------HHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCEECCCHHHHH- T ss_conf 9669999894644170-------27899999999999779947999289568889999769997634977767989999- Q ss_pred HHHHHHH-HHHHHHHHHHHHHCCCCCCCCCCH--------HHHHH----------HHHHHHHHCCHHHHCCCCCCC---- Q ss_conf 8888989-999999999976188641134001--------00245----------665432100001100111112---- Q gi|254780777|r 79 MGMLSTV-INALALDLALRKINVPTVILSSIF--------MPQVC----------EVFSCRNAVSYLSQGKVVIFS---- 135 (242) Q Consensus 79 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~----------~~~~~~~~~~~~~~~~~~~~~---- 135 (242) ...+... ..+..+-..+..++....-.+..+ ..... ..-+.......+....+|+.. T Consensus 74 vv~mvl~G~vNk~iva~l~~~g~~avGlsg~Dg~li~A~~~~~~~~id~g~vG~i~~Vn~~~i~~ll~~~~IpViapia~ 153 (265) T COG0548 74 VVEMVLGGTVNKEIVARLSKHGGQAVGLSGVDGNLVTAKKLDVDDGVDLGYVGEIRKVNPELIERLLDNGAIPVIAPIAV 153 (265) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCEEEEEECCCCCCCCCCEEEEEEEECHHHHHHHHHCCCCEEEECCEE T ss_conf 99999988889999999997187414555337878999986641343364044589878999999975798169807427 Q ss_pred -CCCCCCCCCCHHHHHHHHHHHCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHH-------CCCCCCCHHHHHH Q ss_conf -46556777621578888864164466514754351234322344444444115999985-------5998140999999 Q gi|254780777|r 136 -GGTGNAFLTTDSAAALRASEIGADVILKGTQVDGVYSADPRVHASSTRFDSLTYNQFIE-------KGLKVMDCASVVL 207 (242) Q Consensus 136 -~~~~~~~~~~D~~a~~ia~~~~ad~l~i~tdvdGiyt~dP~~~~~ak~i~~is~~e~l~-------~G~~v~~~~A~~~ 207 (242) ...+....+.|+.|..+|.+++|+.|+++|||+|+|+..|.. ..+++++.+|+.+ .|+|.-|++++.- T Consensus 154 ~~~G~~~NvnaD~~A~~iA~aLkAekLi~ltdv~Gvl~~~~~~----s~i~~~~~~~~~~li~~~~i~~GMi~Kv~~a~~ 229 (265) T COG0548 154 DEDGETLNVNADTAAGALAAALKAEKLILLTDVPGVLDDKGDP----SLISELDAEEAEELIEQGIITGGMIPKVEAALE 229 (265) T ss_pred CCCCCEEEECHHHHHHHHHHHCCCCEEEEEECCHHHCCCCCCC----EEECCCCHHHHHHHHHCCCCCCCHHHHHHHHHH T ss_conf 8999787048899999999973988299872770003489985----455357999999886249734860899999999 Q ss_pred HHHCCC-CEEEEECCCCCHHHH-HHCCCCCCCEECC Q ss_conf 997499-699951898547999-9779987824668 Q gi|254780777|r 208 ARDCSI-PIIVFSIHSPGGIWG-GLSGIGRSTIISG 241 (242) Q Consensus 208 a~~~gI-~v~I~ng~~~~~i~~-~l~Ge~~GT~I~~ 241 (242) |.++|+ +++|.||+.|+.|.. .+.++.+||.|.+ T Consensus 230 A~~~Gv~~v~ii~g~~~~~ll~eLFt~~giGT~i~~ 265 (265) T COG0548 230 ALESGVRRVHIISGRVPHSLLLELFTRDGIGTMIVR 265 (265) T ss_pred HHHHCCCEEEEECCCCCCHHHHHHHCCCCCCEEECC T ss_conf 998499738996377862699998457985068259 No 62 >cd04237 AAK_NAGS-ABP AAK_NAGS-ABP: N-acetylglutamate (NAG) kinase-like domain of the NAG Synthase (NAGS) of the arginine-biosynthesis pathway (ABP) found in gamma- and beta-proteobacteria and higher plant chloroplasts. Domain architecture of these NAGS consisted of an N-terminal NAG kinase-like (ArgB) domain (this CD) and a C-terminal NAG synthase, acetyltransferase (ArgA) domain. Both bacterial and plant sequences in this CD have a conserved N-terminal extension; a similar sequence in the NAG kinases of the cyclic arginine-biosynthesis pathway has been implicated in feedback inhibition sensing. Plant sequences also have an N-terminal chloroplast transit peptide and an insert (approx. 70 residues) in the C-terminal region of ArgB. Members of this CD belong to the Amino Acid Kinase Superfamily (AAK). Probab=99.75 E-value=5.5e-17 Score=124.33 Aligned_cols=221 Identities=16% Similarity=0.143 Sum_probs=134.2 Q ss_pred CCCEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHH-HHHCC------CCCCCCHHHH Q ss_conf 8824999994003685388777089999999999999997699099982365410233-44203------7777540246 Q gi|254780777|r 4 FPYKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRGS-QVVAE------NYLLCERSTV 76 (242) Q Consensus 4 ~~~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~~-~~~~~------~~~~~~~~~~ 76 (242) ++-|.+|||+||+.+.++ .+.+++++|+-++..|.++++|||||...... ...+. +...-+.... T Consensus 16 f~~kt~VIK~GG~~~~~~--------~l~~~~~di~lL~~~Gi~~VlVHGGG~~I~~~l~~~gi~~~fv~G~RVTd~~tl 87 (280) T cd04237 16 HRGKTFVIAFGGEAVAHP--------NFDNIVHDIALLHSLGIRLVLVHGARPQIDQRLAERGLEPRYHRGLRITDAAAL 87 (280) T ss_pred HCCCEEEEEECCHHHCCH--------HHHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHCCCCEEECCCCCCCCHHHH T ss_conf 689989999896330774--------499999999999988996999928986779999986997462178777999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHH----HCCCC---CCC-----CCCHH------HH----HHHHHHHHHHCCHHHHCCCCCC Q ss_conf 788888989999999999976----18864---113-----40010------02----4566543210000110011111 Q gi|254780777|r 77 DSMGMLSTVINALALDLALRK----INVPT---VIL-----SSIFM------PQ----VCEVFSCRNAVSYLSQGKVVIF 134 (242) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~----~~~~~---~~~-----~~~~~------~~----~~~~~~~~~~~~~~~~~~~~~~ 134 (242) ..........+..+ ...+.. .+... ... ..... +. .....+.......++.+.+|++ T Consensus 88 ~vv~~~~G~vn~~i-~a~ls~g~~~~g~~g~~~~~~~g~~i~a~p~g~~dgiD~G~vG~v~~Vn~~~l~~ll~~~~iPVI 166 (280) T cd04237 88 ECVKEAAGAVRLEI-EALLSMGLPNSPMAGARIRVVSGNFVTARPLGVVDGVDFGHTGEVRRIDADAIRRQLDQGSIVLL 166 (280) T ss_pred HHHHHHHHHHHHHH-HHHHHHCCCCCCCCCCCCEEECCCEEEEEECCCCCCCCCCCCCCCCEEEHHHHHHHHHCCCEEEE T ss_conf 99999999999999-99997053435653557234058658988865314744467983026649999868758965896 Q ss_pred CCC-----CCCCCCCCHHHHHHHHHHHCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHH---C-----CCCCCC Q ss_conf 246-----556777621578888864164466514754351234322344444444115999985---5-----998140 Q gi|254780777|r 135 SGG-----TGNAFLTTDSAAALRASEIGADVILKGTQVDGVYSADPRVHASSTRFDSLTYNQFIE---K-----GLKVMD 201 (242) Q Consensus 135 ~~~-----~~~~~~~~D~~a~~ia~~~~ad~l~i~tdvdGiyt~dP~~~~~ak~i~~is~~e~l~---~-----G~~v~~ 201 (242) ... .+....+.|..|+.+|..++|+.++++|||+|+++. +.+++++++..|+.+ . |+|.-| T Consensus 167 spig~~~~G~~~NvNAD~~A~~iA~aL~A~kLi~ltdv~Gvl~~------~~~~i~~l~~~e~~~ll~~~~~i~~GMipK 240 (280) T cd04237 167 SPLGYSPTGEVFNLSMEDVATAVAIALKADKLIFLTDGPGLLDD------DGELIRELTAQEAEALLETGALLTNDTARL 240 (280) T ss_pred ECCEECCCCCEEEECHHHHHHHHHHHHHHCEEEEECCCCCEECC------CCCCCCCCCHHHHHHHHHHCCCCCCCHHHH T ss_conf 04318899988700899999999999744709987188832789------998865049999999998479888778999 Q ss_pred HHHHHHHHHCCCC-EEEEECCCCCHHH-HHHCCCCCCCEE Q ss_conf 9999999974996-9995189854799-997799878246 Q gi|254780777|r 202 CASVVLARDCSIP-IIVFSIHSPGGIW-GGLSGIGRSTII 239 (242) Q Consensus 202 ~~A~~~a~~~gI~-v~I~ng~~~~~i~-~~l~Ge~~GT~I 239 (242) ++++.-|.++|++ ++|.||+.|+.|. .++..+..||+| T Consensus 241 l~~a~~Al~~GV~~vhIidg~~~~aLL~ElfT~~G~GT~I 280 (280) T cd04237 241 LQAAIEACRGGVPRVHLISYAEDGALLLELFTRDGVGTLI 280 (280) T ss_pred HHHHHHHHHCCCCEEEECCCCCCCHHHHHHHCCCCCCCCC T ss_conf 9999999983999899905999971467760699986709 No 63 >COG2054 Uncharacterized archaeal kinase related to aspartokinases, uridylate kinases [General function prediction only] Probab=99.72 E-value=9.1e-17 Score=123.01 Aligned_cols=206 Identities=21% Similarity=0.183 Sum_probs=130.9 Q ss_pred EEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 99940036853887770899999999999999976990999823654102334420377775402467888889899999 Q gi|254780777|r 10 LLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRGSQVVAENYLLCERSTVDSMGMLSTVINAL 89 (242) Q Consensus 10 ViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 89 (242) |+|.|||.... +.+.....+.+-+.++.+|.||+.|+.......+.....+ ....++...+|-.--. T Consensus 3 vVk~~Gs~~~~------------~~~~~~~ale~~~~~i~iVpGGg~FAd~VR~id~~~~lSd-sasHwmAI~~Md~~G~ 69 (212) T COG2054 3 VVKKGGSGVAE------------RAAAVKEALENLQRSILIVPGGGIFADLVRKIDEEFGLSD-SASHWMAITAMDQYGF 69 (212) T ss_pred EEEECCCCHHH------------HHHHHHHHHHHHCCEEEEECCCHHHHHHHHHHHHHCCCCC-CHHHHHHHHHHHHHHH T ss_conf 68862787088------------8899999998414608995184289999999987459875-0788999999999889 Q ss_pred HHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHCCHHHHCCCCCCCCCCC-CCCCCCHHHHHHHHHHHCHHHHHHCCCCC Q ss_conf 99999976188641134001002456654321000011001111124655-67776215788888641644665147543 Q gi|254780777|r 90 ALDLALRKINVPTVILSSIFMPQVCEVFSCRNAVSYLSQGKVVIFSGGTG-NAFLTTDSAAALRASEIGADVILKGTQVD 168 (242) Q Consensus 90 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~D~~a~~ia~~~~ad~l~i~tdvd 168 (242) +++ +........+..+.... ..+.....+......-....-+ -=..++|.+|+.+|...++..+++.|||| T Consensus 70 ~la----d~~~~~~~~tv~ep~~~----i~~~~~aVLLPyrlLr~~DplpHSW~VTSDsis~~Ia~~~~~~~vv~aTDVd 141 (212) T COG2054 70 YLA----DLASRFVTDTVTEPEDG----IKPDAKAVLLPYRLLRKTDPLPHSWEVTSDSISVWIAAKAGATEVVKATDVD 141 (212) T ss_pred HHH----HHHCCCCCCEEECHHHC----CCCCCCEEEEEHHHHHCCCCCCCCEEECCCHHHHHHHHHCCCCEEEEEECCC T ss_conf 998----64300342023164442----6756606860067650278887652222318999999972982799995378 Q ss_pred CCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCHHHHHHHHHCCCCEEEEECCCCCHHHHHHCC-CCCCCEECC Q ss_conf 5123432234444444411599998559981409999999974996999518985479999779-987824668 Q gi|254780777|r 169 GVYSADPRVHASSTRFDSLTYNQFIEKGLKVMDCASVVLARDCSIPIIVFSIHSPGGIWGGLSG-IGRSTIISG 241 (242) Q Consensus 169 Giyt~dP~~~~~ak~i~~is~~e~l~~G~~v~~~~A~~~a~~~gI~v~I~ng~~~~~i~~~l~G-e~~GT~I~~ 241 (242) |+|+.+|+ ++++++|+..|..+ |-..+|+.+-+++.+++++++++||..|.++..+++| +.+||+|.+ T Consensus 142 GI~~~~~~----~kLv~eI~A~dl~~-~~t~vD~~~P~Ll~k~~m~~~Vvng~~pervi~~lrGk~~v~T~Ivg 210 (212) T COG2054 142 GIYEEDPK----GKLVREIRASDLKT-GETSVDPYLPKLLVKYKMNCRVVNGKEPERVILALRGKEVVGTLIVG 210 (212) T ss_pred CCCCCCCC----CHHHHHHHHHHCCC-CCCCCCCHHHHHHHHCCCCEEEECCCCHHHHHHHHHCCCCCEEEEEC T ss_conf 60115896----30566621755356-75315604068999718715998787889999987165654478757 No 64 >PRK12454 carbamate kinase-like carbamoyl phosphate synthetase; Reviewed Probab=99.72 E-value=3.4e-16 Score=119.50 Aligned_cols=228 Identities=20% Similarity=0.219 Sum_probs=138.1 Q ss_pred EEEEEEEEEEEEECCCCC-CC--CHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHH-HHHH--HCC----CCCCCCHHHH Q ss_conf 499999400368538877-70--899999999999999976990999823654102-3344--203----7777540246 Q gi|254780777|r 7 KRVLLKVSGEALAGISGF-GV--DIDSVNRICADIAEVYAKGIEIGIVVGGGNIFR-GSQV--VAE----NYLLCERSTV 76 (242) Q Consensus 7 KrIViKiGGSsL~~~~~~-~~--~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~-~~~~--~~~----~~~~~~~~~~ 76 (242) |||||-+||++|..+... +. -.+.++.-|++|+++.++|++++|+||.|-... .... ... ...+.+...+ T Consensus 3 k~iVvALGGNAl~~~g~~~t~~~Q~~~v~~~a~~ia~li~~g~~vVitHGNGPQVG~i~l~~~~~~~~~~p~~PLd~~~A 82 (313) T PRK12454 3 KRIVIALGGNALQQRGEKGTAENMMKNVRRTAEQIARLIEEGYEVVITHGNGPQVGNLLLQMDAAKDVGIPPFPLDVAGA 82 (313) T ss_pred CEEEEECCHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHHHCCCCCCCCCCHHHHH T ss_conf 55999808365089999999999999999999999999987997999818813999999999873246899866035523 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCC---CCCC-----------------------HHHHHHHHH-----------HH Q ss_conf 7888889899999999999761886411---3400-----------------------100245665-----------43 Q gi|254780777|r 77 DSMGMLSTVINALALDLALRKINVPTVI---LSSI-----------------------FMPQVCEVF-----------SC 119 (242) Q Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~-----------------------~~~~~~~~~-----------~~ 119 (242) .+.+..+.+++. .+...+...+....+ .+.. +........ ++ T Consensus 83 ~sQG~IGy~lqq-al~n~L~~~~~~~~v~tlvTqv~VD~~DpAF~~PtKpIGpfy~~eeA~~l~~~~gw~~~eD~grG~R 161 (313) T PRK12454 83 MTQGWIGYMIQQ-ALRNVLPKRGIERQVATIVTQTIVDKNDPAFQNPTKPVGPFYDEEEAKKLAKEKGWIVKEDAGRGYR 161 (313) T ss_pred CCCHHHHHHHHH-HHHHHHHHCCCCCCEEEEEEEEEECCCCHHHCCCCCCCCCCCCHHHHHHHHHHCCCEEEECCCCCEE T ss_conf 013489999999-9999998539998469999899987876322089886268879999999998559789764898558 Q ss_pred --------------HHHCCHHHHCCCCCCCCCCCCC-------------CCCCHHHHHHHHHHHCHHHHHHCCCCCCCCC Q ss_conf --------------2100001100111112465567-------------7762157888886416446651475435123 Q gi|254780777|r 120 --------------RNAVSYLSQGKVVIFSGGTGNA-------------FLTTDSAAALRASEIGADVILKGTQVDGVYS 172 (242) Q Consensus 120 --------------~~~~~~~~~~~~~~~~~~~~~~-------------~~~~D~~a~~ia~~~~ad~l~i~tdvdGiyt 172 (242) .....+++.+.+++..+..+++ ..|-|..|+++|..++||.|+++||||++|- T Consensus 162 RVVpSP~P~~IvE~~~I~~Lv~~g~iVIa~GGGGIPVv~~~g~l~GveAVIDKD~aaa~LA~~l~Ad~liILTdVd~V~~ 241 (313) T PRK12454 162 RVVPSPDPLGIVEIETIKMLVENGVIVIAAGGGGIPVIEEDGELKGVEAVIDKDLASELLAEELNADIFIILTDVEKVYL 241 (313) T ss_pred EECCCCCCCEEECHHHHHHHHHCCCEEEEECCCCCCEECCCCCEECCCEEECCCHHHHHHHHHHCCCEEEEEECCCHHEE T ss_conf 86179999427528999999978988999489856668559827565204445188999999709998999616650000 Q ss_pred CCCCCCCCCCCCCCCCHHHHHH--------CCCCCCCHHH-HHHHHHCCCCEEEEECCCCCHHHHHHCCCCCCCEECC Q ss_conf 4322344444444115999985--------5998140999-9999974996999518985479999779987824668 Q gi|254780777|r 173 ADPRVHASSTRFDSLTYNQFIE--------KGLKVMDCAS-VVLARDCSIPIIVFSIHSPGGIWGGLSGIGRSTIISG 241 (242) Q Consensus 173 ~dP~~~~~ak~i~~is~~e~l~--------~G~~v~~~~A-~~~a~~~gI~v~I~ng~~~~~i~~~l~Ge~~GT~I~~ 241 (242) .- -.|+.+.++++|.+|+.. .|+|.=|.+| ++.+++.|-+++|.+ ++.+..+|+|+ .||.|.| T Consensus 242 n~--g~p~q~~l~~it~~e~~~~~~eg~F~~GsM~PKV~Aai~Fve~gg~~aiIts---l~~~~~al~g~-~GT~IvP 313 (313) T PRK12454 242 NY--GKPDQKALDKVTVDEAKKYYEEGHFPAGSMGPKILAAIRFVENGGRRAVIGD---LEKAVEALEGK-TGTRIIP 313 (313) T ss_pred CC--CCCCHHHHHCCCHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCEEEECC---HHHHHHHHCCC-CCCEECC T ss_conf 38--9964122403889999999875998888826999999999994797699998---89999996799-9988659 No 65 >PRK12686 carbamate kinase; Reviewed Probab=99.72 E-value=3.2e-16 Score=119.60 Aligned_cols=228 Identities=18% Similarity=0.220 Sum_probs=134.7 Q ss_pred CEEEEEEEEEEEEECCCCC-CCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHH-HHH-HHC---CCC--CCCCHHHHH Q ss_conf 2499999400368538877-70899999999999999976990999823654102-334-420---377--775402467 Q gi|254780777|r 6 YKRVLLKVSGEALAGISGF-GVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIFR-GSQ-VVA---ENY--LLCERSTVD 77 (242) Q Consensus 6 ~KrIViKiGGSsL~~~~~~-~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~-~~~-~~~---~~~--~~~~~~~~~ 77 (242) .|||||++||++|..+++. .--...++..|++|+++.+.|++++|+||.|-... ... ... ... .+.+...+. T Consensus 3 ~k~iVIAlGGNAl~~~~~t~~~Q~~~~~~~a~~ia~l~~~g~~vVitHGNGPQVG~l~l~~~~~~~~~~p~~Pld~~~A~ 82 (313) T PRK12686 3 KEKIVIALGGNAILQTEASAEAQQTAVREAMQHLVDLIEAGADIVITHGNGPQVGSLLLQQAESNSNKVPAMPLDTCVAM 82 (313) T ss_pred CCEEEEEECCHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHHHHCCCCCCCCCCCHHHHC T ss_conf 77799982803128999999999999999999999999869979998188079899999987643688898764045430 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCC---CCC------C-----------------HHHHHHHHH-------------- Q ss_conf 888889899999999999761886411---340------0-----------------100245665-------------- Q gi|254780777|r 78 SMGMLSTVINALALDLALRKINVPTVI---LSS------I-----------------FMPQVCEVF-------------- 117 (242) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~------~-----------------~~~~~~~~~-------------- 117 (242) +.+..+.++.. .+...+...+..... .+. + +........ T Consensus 83 sQG~IGy~lqq-al~n~l~~~~~~~~v~tllTqv~VD~~DpAF~~PtKpIGp~y~~eeA~~l~~~~gw~~~~d~g~g~RR 161 (313) T PRK12686 83 SQGMIGYWLQT-ALNNELTERNIDKPVITLVTQVEVDKDDPAFANPTKPIGPFYTEEEAKQQAEQPGSTFKEDAGRGYRR 161 (313) T ss_pred CCHHHHHHHHH-HHHHHHHHCCCCCCEEEEEEEEEECCCCHHHHCCCCCCCCCCCHHHHHHHHHHCCCEEEECCCCCEEE T ss_conf 23379999999-99999986389986489998888888886775789860688799999999972498699748984489 Q ss_pred -----------HHHHHCCHHHHCCCCCCCCCCCCC-------------CCCCHHHHHHHHHHHCHHHHHHCCCCCCCCCC Q ss_conf -----------432100001100111112465567-------------77621578888864164466514754351234 Q gi|254780777|r 118 -----------SCRNAVSYLSQGKVVIFSGGTGNA-------------FLTTDSAAALRASEIGADVILKGTQVDGVYSA 173 (242) Q Consensus 118 -----------~~~~~~~~~~~~~~~~~~~~~~~~-------------~~~~D~~a~~ia~~~~ad~l~i~tdvdGiyt~ 173 (242) .......+++.+.+++..+..+++ ..|-|..|+++|..++||.++|+||||+||-. T Consensus 162 VVpSP~P~~Ive~~~I~~Lv~~g~ivIa~GGGGIPV~~~~~~~~GveAVIDKDlaaa~LA~~l~AD~liILTdV~~V~~n 241 (313) T PRK12686 162 VVPSPKPKEIIEHDTIRTLVDGGNIVIACGGGGIPVIRDGNTLKGVEAVIDKDFASEKLAELIDADLLMILTGVENVFIN 241 (313) T ss_pred EECCCCCCCCCCHHHHHHHHHCCCEEEECCCCCCCEEEECCCEECEEEEECCCHHHHHHHHHCCCCEEEEEECCCEEEEC T ss_conf 83699984114699999999689889982798743797589676302242544788999986199989997266602321 Q ss_pred CCCCCCCCCCCCCCCHHHHHH--------CCCCCCCHHHH-HHHHHC-CCCEEEEECCCCCHHHHHHCCCCCCCEEC Q ss_conf 322344444444115999985--------59981409999-999974-99699951898547999977998782466 Q gi|254780777|r 174 DPRVHASSTRFDSLTYNQFIE--------KGLKVMDCASV-VLARDC-SIPIIVFSIHSPGGIWGGLSGIGRSTIIS 240 (242) Q Consensus 174 dP~~~~~ak~i~~is~~e~l~--------~G~~v~~~~A~-~~a~~~-gI~v~I~ng~~~~~i~~~l~Ge~~GT~I~ 240 (242) -.+ |+.+.++++|.+|+.. .|+|.=|+.|+ +.+++- |=..+| ..++.+..+|+|+ .||.|| T Consensus 242 ~g~--p~q~~l~~it~~e~~~~~~~g~F~~GSM~PKVeAai~Fve~g~g~~aiI---~~l~~~~~al~G~-~GT~It 312 (313) T PRK12686 242 FNK--PNQQKLDDISVAEAKQYIAEGQFAPGSMLPKVEAAIDFVESGKNKKAII---TSLEQAKEALAGN-AGTHIT 312 (313) T ss_pred CCC--CCHHHHHCCCHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCCEEEE---CCHHHHHHHHCCC-CCCEEE T ss_conf 899--7522352189999999997599988882799999999998099986998---9899999996799-987871 No 66 >PRK05279 N-acetylglutamate synthase; Validated Probab=99.69 E-value=1.3e-15 Score=115.83 Aligned_cols=221 Identities=17% Similarity=0.145 Sum_probs=135.0 Q ss_pred CEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHH-HHHHHC------CCCCCCCHHHHHH Q ss_conf 249999940036853887770899999999999999976990999823654102-334420------3777754024678 Q gi|254780777|r 6 YKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIFR-GSQVVA------ENYLLCERSTVDS 78 (242) Q Consensus 6 ~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~-~~~~~~------~~~~~~~~~~~~~ 78 (242) =|.+|||+||+++.+++ +.++++.|+-++.-|.++||||||+.... .....+ .+...-+...... T Consensus 25 gktfVI~~gG~a~~~~~--------~~~~~~DIaLL~slGIr~VlVHGggpqI~~~L~~~gi~s~f~~G~RVTD~~tl~~ 96 (441) T PRK05279 25 GKTFVIMLGGEAIAHGN--------FSNIVNDIALLHSLGIRLVLVHGARPQIEEQLAAHGIEPRYHKGLRITDAAALEC 96 (441) T ss_pred CCEEEEEECCEECCCCC--------HHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHH T ss_conf 99899998972036835--------8999999999986897699987997788999997699855139913699999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCC--------CCC-----CCCHHHHH----------HHHHHHHHHCCHHHHCCCCCCC Q ss_conf 888898999999999997618864--------113-----40010024----------5665432100001100111112 Q gi|254780777|r 79 MGMLSTVINALALDLALRKINVPT--------VIL-----SSIFMPQV----------CEVFSCRNAVSYLSQGKVVIFS 135 (242) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~--------~~~-----~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~ 135 (242) ........+..+ ...+.. +.+. ... ...+.... ....+.......++.+.+|+.. T Consensus 97 v~~v~G~v~~~I-~a~Ls~-gl~nspm~g~~i~v~sGn~i~A~p~Gv~dGvD~g~vG~V~~Vd~~~I~~~L~~g~Ipvis 174 (441) T PRK05279 97 VKQAAGELRLDI-EARLSM-GLPNTPMAGAHIRVVSGNFVTARPLGVDDGVDYCHTGEVRRIDTEAIRRQLDNGAIVLLS 174 (441) T ss_pred HHHHHHHHHHHH-HHHHHC-CCCCCCCCCCCEEEECCCEEEEEECCCCCCCCCCCCCEEEEECHHHHHHHHHCCCEEEEC T ss_conf 999999999999-999834-367875557724675486689876353137566754158797389999999789829986 Q ss_pred CC-----CCCCCCCCHHHHHHHHHHHCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHH----------CCCCCC Q ss_conf 46-----556777621578888864164466514754351234322344444444115999985----------599814 Q gi|254780777|r 136 GG-----TGNAFLTTDSAAALRASEIGADVILKGTQVDGVYSADPRVHASSTRFDSLTYNQFIE----------KGLKVM 200 (242) Q Consensus 136 ~~-----~~~~~~~~D~~a~~ia~~~~ad~l~i~tdvdGiyt~dP~~~~~ak~i~~is~~e~l~----------~G~~v~ 200 (242) .. .+....+.|..|+.+|.+++|+.|+++||++|+++ .+.+++.+++..|+.+ .++|.- T Consensus 175 plG~s~~Ge~fNinad~vA~~iA~aL~A~KLI~ltd~~Gi~d------~~g~li~~lt~~ea~~ll~~~~~~~i~~g~~~ 248 (441) T PRK05279 175 PLGYSPTGESFNLTMEEVATQVAIALKADKLIFFTESQGILD------EDGELIRELSPNEAQALLERLEQGDLNSGTAR 248 (441) T ss_pred CEEECCCCCEECCCHHHHHHHHHHHCCCCEEEEECCCCCCCC------CCCCCCCCCCHHHHHHHHHHCCCCCCCCCHHH T ss_conf 646389999866699999999999769988998528876428------99987530799999999975317888845699 Q ss_pred CHHHHHHHHHCCC-CEEEEECCCCCHH-HHHHCCCCCCCEECCC Q ss_conf 0999999997499-6999518985479-9997799878246689 Q gi|254780777|r 201 DCASVVLARDCSI-PIIVFSIHSPGGI-WGGLSGIGRSTIISGE 242 (242) Q Consensus 201 ~~~A~~~a~~~gI-~v~I~ng~~~~~i-~~~l~Ge~~GT~I~~~ 242 (242) ++.++-.|-+.|+ .++|.||+.|+.| ..++.-+.+||+|+.+ T Consensus 249 kl~~a~~A~~~GV~RvHiIdg~~~gaLL~ELFT~~GvGTmI~~~ 292 (441) T PRK05279 249 FLRAAVKACRGGVPRSHLISYAEDGALLQELFTRDGIGTQIVME 292 (441) T ss_pred HHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHCCCCEEEEEECC T ss_conf 99999999983988699756878841887762588718988547 No 67 >cd04235 AAK_CK AAK_CK: Carbamate kinase (CK) catalyzes both the ATP-phosphorylation of carbamate and carbamoyl phosphate (CP) utilization with the production of ATP from ADP and CP. Both CK (this CD) and nonhomologous CP synthetase synthesize carbamoyl phosphate, an essential precursor of arginine and pyrimidine bases, in the presence of ATP, bicarbonate, and ammonia. CK is a homodimer of 33 kDa subunits and is a member of the Amino Acid Kinase Superfamily (AAK). Probab=99.66 E-value=3.7e-15 Score=113.11 Aligned_cols=226 Identities=21% Similarity=0.239 Sum_probs=133.9 Q ss_pred EEEEEEEEEEEECCCCC-CC--CHHHHHHHHHHHHHHHHCCCEEEEEECCCCCH-HHHHHHC---CCCCCCCHHHH--HH Q ss_conf 99999400368538877-70--89999999999999997699099982365410-2334420---37777540246--78 Q gi|254780777|r 8 RVLLKVSGEALAGISGF-GV--DIDSVNRICADIAEVYAKGIEIGIVVGGGNIF-RGSQVVA---ENYLLCERSTV--DS 78 (242) Q Consensus 8 rIViKiGGSsL~~~~~~-~~--~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~-~~~~~~~---~~~~~~~~~~~--~~ 78 (242) ||||.+||++|..+... +. -.+.++.-|++|+++.+.|++++|+||.|-.. ....... ...+..+...+ .+ T Consensus 1 riViALGGNAl~~~g~~~t~~~Q~~~v~~~a~~ia~l~~~G~~vVitHGNGPQVG~ll~~~~~~~~~~p~~Pld~~~A~s 80 (308) T cd04235 1 RIVVALGGNALLRRGEPGTAEEQRENVKIAAKALADLIKNGHEVVITHGNGPQVGNLLLQNEAAAEKVPAYPLDVCGAMS 80 (308) T ss_pred CEEEEECHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHHHCCCCCCCCCHHHHHHH T ss_conf 98999573650899999999999999999999999999869979998699518999999888624678998620464653 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCC--------------------------CCHHHHHHHHH-----------H--- Q ss_conf 8888989999999999976188641134--------------------------00100245665-----------4--- Q gi|254780777|r 79 MGMLSTVINALALDLALRKINVPTVILS--------------------------SIFMPQVCEVF-----------S--- 118 (242) Q Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------------------~~~~~~~~~~~-----------~--- 118 (242) .+..+.+++ ..+...+...+......+ ..+........ + T Consensus 81 QG~IGy~lq-qal~n~l~~~~~~~~v~t~vTqv~Vd~~DpAF~~PtKpIGpfy~~eeA~~l~~~~g~~~~eD~grg~RRV 159 (308) T cd04235 81 QGMIGYMLQ-QALDNELPKRGIDKPVVTLVTQVVVDANDPAFKNPTKPIGPFYSEEEAEELAAEKGWTFKEDAGRGYRRV 159 (308) T ss_pred HHHHHHHHH-HHHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCEEEECCCCCEEEE T ss_conf 059999999-9999999854899845999989998899832238888657775999999999845978974288640578 Q ss_pred -----------HHHHCCHHHHCCCCCCCCCCCCC-------------CCCCHHHHHHHHHHHCHHHHHHCCCCCCCCCCC Q ss_conf -----------32100001100111112465567-------------776215788888641644665147543512343 Q gi|254780777|r 119 -----------CRNAVSYLSQGKVVIFSGGTGNA-------------FLTTDSAAALRASEIGADVILKGTQVDGVYSAD 174 (242) Q Consensus 119 -----------~~~~~~~~~~~~~~~~~~~~~~~-------------~~~~D~~a~~ia~~~~ad~l~i~tdvdGiyt~d 174 (242) ......+++.+.+++..+..+++ ..|-|..|+++|..++||.|+++|||+++|-.- T Consensus 160 VpSP~P~~IvE~~~I~~L~~~g~ivIa~GGGGIPVv~~~~~~~GveAVIDKD~aaa~LA~~l~Ad~liiLTdV~~V~~n~ 239 (308) T cd04235 160 VPSPKPKDIVEIEAIKTLVDNGVIVIAAGGGGIPVVREGGGLKGVEAVIDKDLASALLAEEINADLLVILTDVDNVYINF 239 (308) T ss_pred ECCCCCCEEECHHHHHHHHHCCCEEEEECCCCCCEECCCCCEECCEEEECCHHHHHHHHHHCCCCEEEEEECCHHHHHCC T ss_conf 66999814546999999997798899828986546824995443110213438889999863898899971641687628 Q ss_pred CCCCCCCCCCCCCCHHHHHH--------CCCCCCCHHHHHHHHHCCC-CEEEEECCCCCHHHHHHCCCCCCCEEC Q ss_conf 22344444444115999985--------5998140999999997499-699951898547999977998782466 Q gi|254780777|r 175 PRVHASSTRFDSLTYNQFIE--------KGLKVMDCASVVLARDCSI-PIIVFSIHSPGGIWGGLSGIGRSTIIS 240 (242) Q Consensus 175 P~~~~~ak~i~~is~~e~l~--------~G~~v~~~~A~~~a~~~gI-~v~I~ng~~~~~i~~~l~Ge~~GT~I~ 240 (242) . .|+.+.++++|.+|+.. .|+|.=|.+|+--..++|- +++| ..++.+..+|+| ..||.|. T Consensus 240 g--~p~q~~L~~it~~e~~~~~~~g~F~~GSM~PKv~Aai~fv~~gg~~a~I---tsl~~~~~al~G-~~GT~I~ 308 (308) T cd04235 240 G--KPNQKALEQVTVEELEKYIEEGQFAPGSMGPKVEAAIRFVESGGKKAII---TSLENAEAALEG-KAGTVIV 308 (308) T ss_pred C--CCCCHHHCCCCHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCEEEE---CCHHHHHHHHCC-CCCCEEC T ss_conf 9--9860022148899999999769988898369999999999948987999---989999999679-9995879 No 68 >cd04252 AAK_NAGK-fArgBP AAK_NAGK-fArgBP: N-Acetyl-L-glutamate kinase (NAGK) of the fungal arginine-biosynthetic pathway (fArgBP). The nuclear-encoded, mitochondrial polyprotein precursor with an N-terminal NAGK (ArgB) domain (this CD), a central DUF619 domain, and a C-terminal reductase domain (ArgC, N-Acetylglutamate Phosphate Reductase, NAGPR). The precursor is cleaved in the mitochondria into two distinct enzymes (NAGK-DUF619 and NAGPR). Native molecular weights of these proteins indicate that the kinase is an octamer whereas the reductase is a dimer. This CD also includes some gamma-proteobacteria (Xanthomonas and Xylella) NAG kinases with an N-terminal NAGK (ArgB) domain (this CD) and a C-terminal DUF619 domain. The DUF619 domain is described as a putative distant homolog of the acetyltransferase, ArgA, predicted to function in NAG synthase association in fungi. Eukaryotic sequences have an N-terminal mitochondrial transit peptide. Members of this NAG kinase domain CD belong to th Probab=99.63 E-value=2.6e-14 Score=107.88 Aligned_cols=214 Identities=14% Similarity=0.125 Sum_probs=128.4 Q ss_pred EEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHH-HHHHC------CCCCCCCHHHHHHHHHH Q ss_conf 999400368538877708999999999999999769909998236541023-34420------37777540246788888 Q gi|254780777|r 10 LLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRG-SQVVA------ENYLLCERSTVDSMGML 82 (242) Q Consensus 10 ViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~-~~~~~------~~~~~~~~~~~~~~~~~ 82 (242) |||+||+.+.+ .+.++++.|+-++..|.++++||||+..... ....+ .+...-+.......... T Consensus 2 VIk~GG~~~~d---------~~~~l~~di~lL~~lGi~~VlVHGgg~qI~~~l~~~gi~~~~~~G~RvTd~~~l~~v~~v 72 (248) T cd04252 2 VIKVGGAIIED---------DLDELAASLSFLQHVGLYPIVVHGAGPQLNEELEAAGVEPEYVDGLRVTDPETLAVARKV 72 (248) T ss_pred EEEECCHHHHH---------HHHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHH T ss_conf 79999177652---------199999999999988995999918987888999986998531388446898999999999 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCH-----HH---HH----HHHHHHHHHCCHHHHCCCCCCCC-----CCCCCCCCC Q ss_conf 989999999999976188641134001-----00---24----56654321000011001111124-----655677762 Q gi|254780777|r 83 STVINALALDLALRKINVPTVILSSIF-----MP---QV----CEVFSCRNAVSYLSQGKVVIFSG-----GTGNAFLTT 145 (242) Q Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~---~~----~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~ 145 (242) ....+.. +...+...+.+....+... .. .. ....+.......+..+.+|+... ..+....+. T Consensus 73 l~~vn~~-lv~~l~~~g~~a~~i~g~~~~a~~~~~~~~g~vG~v~~Vd~~~i~~lL~~g~Ipvispig~~~~G~~~Nvna 151 (248) T cd04252 73 FLEENLK-LVEALERNGARARPITSGVFEAEYLDKDKYGLVGKITGVNKAPIEAAIRAGYLPILTSLAETPSGQLLNVNA 151 (248) T ss_pred HHHHHHH-HHHHHHHCCCCEEEECCCCEEEEECCCCCCCCCCCEEEECHHHHHHHHHCCCCEEEEEEEECCCCCEEEECH T ss_conf 9999999-999999659973860376368886676567843225898299999998589944885147889999898799 Q ss_pred HHHHHHHHHHHCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHH----HHH----CCCCCCCHHHHHHHHHCCCCEEE Q ss_conf 1578888864164466514754351234322344444444115999----985----59981409999999974996999 Q gi|254780777|r 146 DSAAALRASEIGADVILKGTQVDGVYSADPRVHASSTRFDSLTYNQ----FIE----KGLKVMDCASVVLARDCSIPIIV 217 (242) Q Consensus 146 D~~a~~ia~~~~ad~l~i~tdvdGiyt~dP~~~~~ak~i~~is~~e----~l~----~G~~v~~~~A~~~a~~~gI~v~I 217 (242) |..|+.+|..++|+.|+++||++|+++.+. +++.+++.++ +.+ .++|..++.+++-|.+.--.+++ T Consensus 152 D~~A~~iA~aL~A~kLi~ltdv~Gv~d~~g------~~i~~l~~~~~~~~l~~~~~i~~gm~~kl~a~~~~~~~v~r~h~ 225 (248) T cd04252 152 DVAAGELARVLEPLKIVFLNETGGLLDGTG------KKISAINLDEEYDDLMKQPWVKYGTKLKIKEIKELLDTLPRSSS 225 (248) T ss_pred HHHHHHHHHHCCCCEEEEEECCCCCCCCCC------CCCCCCCHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCCCEE T ss_conf 999999999759998999958830179999------99742788999999987799778636479999999963997189 Q ss_pred EECCCCCHHH-HHHCCCCCCCEE Q ss_conf 5189854799-997799878246 Q gi|254780777|r 218 FSIHSPGGIW-GGLSGIGRSTII 239 (242) Q Consensus 218 ~ng~~~~~i~-~~l~Ge~~GT~I 239 (242) .++..|+.+. .++..+.+||+| T Consensus 226 i~~~~~~aLL~ELFT~~G~GT~I 248 (248) T cd04252 226 VSITSPDDLQKELFTHSGAGTLI 248 (248) T ss_pred EECCCCCHHHHHHHCCCCCCCCC T ss_conf 96027866577771599870639 No 69 >PRK09411 carbamate kinase; Reviewed Probab=99.61 E-value=2.5e-14 Score=107.95 Aligned_cols=223 Identities=22% Similarity=0.188 Sum_probs=131.3 Q ss_pred CEEEEEEEEEEEEECCCCC-CC--CHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHH-H--HHCCCCCCCCHHH--HH Q ss_conf 2499999400368538877-70--89999999999999997699099982365410233-4--4203777754024--67 Q gi|254780777|r 6 YKRVLLKVSGEALAGISGF-GV--DIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRGS-Q--VVAENYLLCERST--VD 77 (242) Q Consensus 6 ~KrIViKiGGSsL~~~~~~-~~--~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~~-~--~~~~~~~~~~~~~--~~ 77 (242) ||||||-+||++|..+... +. -...++.-++.|+.+.+ +++++|.||-|-..... . ......+..+... +. T Consensus 1 Mk~iViALGGNAl~~~g~~~t~e~Q~~~v~~aa~~ia~l~~-~~~vVITHGNGPQVG~l~lq~~a~~~~p~~PLd~~~A~ 79 (297) T PRK09411 1 MKTLVVALGGNALLQRGEALTAENQYRNIASAVPALARLAR-SYRLAIVHGNGPQVGLLALQNLAWKEVEPYPLDVLVAE 79 (297) T ss_pred CCEEEEECCHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHH-CCCEEEEECCCHHHHHHHHHHHHHCCCCCCCCCHHHHH T ss_conf 97599976814217999998999999999999999999980-89399984781298999999886326999883034242 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCC---CCCC------CH-----------HHHH-----HHHHHHH------------ Q ss_conf 88888989999999999976188641---1340------01-----------0024-----5665432------------ Q gi|254780777|r 78 SMGMLSTVINALALDLALRKINVPTV---ILSS------IF-----------MPQV-----CEVFSCR------------ 120 (242) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~------~~-----------~~~~-----~~~~~~~------------ 120 (242) +.+..+. .++..+...+.... +.+. +. +... ......+ T Consensus 80 sQG~IGY-----~l~q~L~~~~~~~~V~tvlTQv~VD~~DpAF~~PtKpIGpfy~~eeA~~l~~~~gw~~~edgrg~RRV 154 (297) T PRK09411 80 SQGMIGY-----MLAQSLSAQPQMPPVTTVLTRIEVSPDDPAFLQPEKFIGPVYQPEEQEALEAAYGWQMKRDGKYLRRV 154 (297) T ss_pred HHHHHHH-----HHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCEEECCCCEEEEE T ss_conf 2069999-----99999984378998579988899889981213899851788799999999975596895269802687 Q ss_pred -------------HHCCHHHHCCCCCCCCCCCCC----------CCCCHHHHHHHHHHHCHHHHHHCCCCCCCCCCCCCC Q ss_conf -------------100001100111112465567----------776215788888641644665147543512343223 Q gi|254780777|r 121 -------------NAVSYLSQGKVVIFSGGTGNA----------FLTTDSAAALRASEIGADVILKGTQVDGVYSADPRV 177 (242) Q Consensus 121 -------------~~~~~~~~~~~~~~~~~~~~~----------~~~~D~~a~~ia~~~~ad~l~i~tdvdGiyt~dP~~ 177 (242) ....+++.+.+++..+..+++ ..|-|..++++|..++||.++|+||||++|-.-. T Consensus 155 VpSP~P~~IvE~~~Ik~Lv~~g~iVIa~GGGGIPV~~~~~GVeAVIDKD~asalLA~~l~AD~liILTdVd~V~~n~g-- 232 (297) T PRK09411 155 VASPQPRKILDSEAIELLLKEGHVVICSGGGGVPVTEDGAGSEAVIDKDLAAALLAEQINADGLVILTDADAVYENWG-- 232 (297) T ss_pred ECCCCCCCEECHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCEEEECCCHHHHHHHHHHCCCEEEEEECCCCHHHCCC-- T ss_conf 379998416369999999986987996598870558985352136634177899999729998999746650101389-- Q ss_pred CCCCCCCCCCCHHHHHH----CCCCCCCHHH-HHHHHHCCCCEEEEECCCCCHHHHHHCCCCCCCEEC Q ss_conf 44444444115999985----5998140999-999997499699951898547999977998782466 Q gi|254780777|r 178 HASSTRFDSLTYNQFIE----KGLKVMDCAS-VVLARDCSIPIIVFSIHSPGGIWGGLSGIGRSTIIS 240 (242) Q Consensus 178 ~~~ak~i~~is~~e~l~----~G~~v~~~~A-~~~a~~~gI~v~I~ng~~~~~i~~~l~Ge~~GT~I~ 240 (242) -|+.+.+++++.+|+.. .|+|.=|..| +++++..|=+++|. .++.+..+|+|+ .||.|| T Consensus 233 ~p~q~~L~~~t~~el~~~~~~~GSM~PKVeAai~Fv~~~G~~aiIt---sl~~~~~al~G~-~GT~It 296 (297) T PRK09411 233 TPQQRAIRHATPDELAPFAKADGAMGPKVTAVSGYVRSRGKPAWIG---ALSRIEETLAGE-AGTCIS 296 (297) T ss_pred CCCHHHHHCCCHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCEEEEC---CHHHHHHHHCCC-CCCEEE T ss_conf 9243356218999998326798986489999999999579869989---799999996799-987872 No 70 >TIGR00761 argB acetylglutamate kinase; InterPro: IPR004662 N -Acetylglutamate (NAG) fulfils distinct biological roles in lower and higher organisms. In prokaryotes, lower eukaryotes and plants it is the first intermediate in the biosynthesis of arginine, whereas in ureotelic (excreting nitrogen mostly in the form of urea) vertebrates, it is an essential allosteric cofactor for carbamyl phosphate synthetase I (CPSI), the first enzyme of the urea cycle. The pathway that leads from glutamate to arginine in lower organisms employs eight steps, starting with the acetylation of glutamate to form NAG. In these species, NAG can be produced by two enzymatic reactions: one catalysed by NAG synthase (NAGS) and the other by ornithine acetyltransferase (OAT). In ureotelic species, NAG is produced exclusively by NAGS. In lower organisms, NAGS is feedback-inhibited by L-arginine, whereas mammalian NAGS activity is significantly enhanced by this amino acid. The NAGS genes of bacteria, fungi and mammals are more diverse than other arginine-biosynthesis and urea-cycle genes. The evolutionary relationship between the distinctly different roles of NAG and its metabolism in lower and higher organisms remains to be determined . The pathway from glutamate to arginine is: NAGS; N-acetylglutamate synthase (2.3.1.1 from EC) (glutamate to N-acetylglutamate) NAGK; N-acetylglutamate kinase (2.7.2.8 from EC) (N-acetylglutamate to N-acetylglutamate-5P) NAGSA; N-acetyl-gamma-glutamyl-phosphate reductase (1.2.1.38 from EC) (N-acetylglutamate-5P to N-acetylglumate semialdehyde) Acetylornithine aminotransferase (2.6.1.11 from EC) (N-acetylglumate semialdehyde to N-acetylornithine) Acetylornithine deacetylase (3.5.1.16 from EC) (N-acetylornithine to ornithine) Arginase (3.5.3.1 from EC) (ornithine to arginine) This entry describes N-acetylglutamate kinases (ArgB) of many prokaryotes and the N-acetylglutamate kinase domains of multifunctional proteins from yeasts as well as some, as yet, uncharacterised proteins.; GO: 0003991 acetylglutamate kinase activity, 0006526 arginine biosynthetic process, 0005737 cytoplasm. Probab=99.61 E-value=1.1e-14 Score=110.05 Aligned_cols=199 Identities=18% Similarity=0.231 Sum_probs=123.6 Q ss_pred EEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCE-EEEEECCCCCHHHH-H----HHC--CCC-----CCCCHHH Q ss_conf 999940036853887770899999999999999976990-99982365410233-4----420--377-----7754024 Q gi|254780777|r 9 VLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIE-IGIVVGGGNIFRGS-Q----VVA--ENY-----LLCERST 75 (242) Q Consensus 9 IViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~-vviV~sg~~~~~~~-~----~~~--~~~-----~~~~~~~ 75 (242) ||||+||+.+.++ +..++++++++.++..|.+ ++||||||..--.. . ... .+. +.-+..+ T Consensus 2 ~ViK~GG~~~~~~-------~~~~~~~~~~~~l~~~g~~PpviVHGGGp~in~~~~~lgi~~~f~~Gl~G~~~RvTd~~t 74 (254) T TIGR00761 2 IVIKIGGSAISND-------ELLEEFFSDIAFLRAVGIKPPVIVHGGGPEINELLEALGIEPEFKNGLSGVVFRVTDKET 74 (254) T ss_pred EEEEECCEEECCC-------HHHHHHHHHHHHHHHCCCCCCEEEECCCHHHHHHHHHCCCCCEEECCCCCEEEECCCHHH T ss_conf 7898447341041-------246999999999997298553898378526888998328983676688720375278899 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHH------------HHH--------------HHHHHHHHHCCHHHHC Q ss_conf 67888889899999999999761886411340010------------024--------------5665432100001100 Q gi|254780777|r 76 VDSMGMLSTVINALALDLALRKINVPTVILSSIFM------------PQV--------------CEVFSCRNAVSYLSQG 129 (242) Q Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~~--------------~~~~~~~~~~~~~~~~ 129 (242) +.-........-+..+-..+..++....-++..+. ... ...-+......++..+ T Consensus 75 l~v~~~vL~g~vN~~LV~~l~~~g~~A~GL~g~Dg~L~~a~rk~~~~~~~~~~vv~~~~G~VG~~~~Vn~~~~~~Ll~~G 154 (254) T TIGR00761 75 LEVVEMVLIGDVNKELVALLNKHGINAIGLTGLDGQLFTADRKTASLDKEVIKVVAIDLGYVGEIKKVNKALLEALLKAG 154 (254) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCEEEEEEEEEEEEECCCEEEEEECCCEEEECCCCHHHHHHHHHCC T ss_conf 99989998364669999999967992863113577357999878988515768999974841231327889999999649 Q ss_pred CCCCCCCCC------CCCCCCCHHHHHHHHHHHCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCH-HHHHH-------C Q ss_conf 111112465------567776215788888641644665147543512343223444444441159-99985-------5 Q gi|254780777|r 130 KVVIFSGGT------GNAFLTTDSAAALRASEIGADVILKGTQVDGVYSADPRVHASSTRFDSLTY-NQFIE-------K 195 (242) Q Consensus 130 ~~~~~~~~~------~~~~~~~D~~a~~ia~~~~ad~l~i~tdvdGiyt~dP~~~~~ak~i~~is~-~e~l~-------~ 195 (242) ..|+....- ..-..+-|..|..+|..++|+.|+++|||+|++...| ...++|++|+. +|... . T Consensus 155 ~~PVi~sla~~~~~g~~lNvNAD~aA~~lA~~L~A~kL~~LtDv~Gi~~~y~---d~~~~i~~l~~D~~~~~l~~~~~i~ 231 (254) T TIGR00761 155 IIPVIASLALTNAEGQALNVNADTAAGELAAALGAEKLVLLTDVPGILNGYP---DKQSLISELPLDEEIEQLIKQGVIK 231 (254) T ss_pred CCCEECCCCCCHHCCCEEECCHHHHHHHHHHHCCCCEEEEECCCHHHHCCCC---CCCCEECCCCHHHHHHHHHHHHHHC T ss_conf 8517716658623187770677799999998609953888417403225888---7461402268679999877522231 Q ss_pred CCCCCCHHHHHHHHHCCCCEEE Q ss_conf 9981409999999974996999 Q gi|254780777|r 196 GLKVMDCASVVLARDCSIPIIV 217 (242) Q Consensus 196 G~~v~~~~A~~~a~~~gI~v~I 217 (242) |+|.-|+.+|.-|.+.|++-.+ T Consensus 232 gGMipKV~~a~~A~~~Gv~~v~ 253 (254) T TIGR00761 232 GGMIPKVNAALEALESGVKSVI 253 (254) T ss_pred CCCHHHHHHHHHHHHCCCCEEC T ss_conf 5985789999999973888003 No 71 >PRK12352 putative carbamate kinase; Reviewed Probab=99.60 E-value=3.8e-14 Score=106.81 Aligned_cols=228 Identities=18% Similarity=0.220 Sum_probs=136.6 Q ss_pred EEEEEEEEEEEEECCCCC-CC--CHHHHHHHHHHHHHHHHCCCEEEEEECCCCC-HHHHHHH-----CCCCC--CCCHHH Q ss_conf 499999400368538877-70--8999999999999999769909998236541-0233442-----03777--754024 Q gi|254780777|r 7 KRVLLKVSGEALAGISGF-GV--DIDSVNRICADIAEVYAKGIEIGIVVGGGNI-FRGSQVV-----AENYL--LCERST 75 (242) Q Consensus 7 KrIViKiGGSsL~~~~~~-~~--~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~-~~~~~~~-----~~~~~--~~~~~~ 75 (242) |||||-|||++|....+. +. -...++.-+++|+++.+.|++++|+||-|-. +...... ....+ +.+... T Consensus 3 k~iViALGGNAll~~g~~~t~~~Q~~~v~~~a~~ia~li~~g~~vvitHGNGPQVG~l~lq~e~a~~~~~~p~~PLd~~~ 82 (316) T PRK12352 3 ELVVVAIGGNSIIKDNASQSIEHQAEAVKAVADTVLEMLASDYDIVLTHGNGPQVGLDLRRAEIAHEREGLPLTPLANCV 82 (316) T ss_pred CEEEEECCHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHHHHHCCCCCCCCCHHHHH T ss_conf 66999767343289999999999999999999999999976996999817816988999999875336789987545541 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCC--CCCC------C-----------------HHHHHHHHHHHH---------- Q ss_conf 6788888989999999999976188641--1340------0-----------------100245665432---------- Q gi|254780777|r 76 VDSMGMLSTVINALALDLALRKINVPTV--ILSS------I-----------------FMPQVCEVFSCR---------- 120 (242) Q Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~------~-----------------~~~~~~~~~~~~---------- 120 (242) +.+.+..+.+++.. +...+...+.... +.+. + +.........-+ T Consensus 83 A~sQG~IGY~lqqa-l~n~l~~r~~~~vvtvlTqv~VD~~DpAF~~PtKpIGpfy~~eeA~~l~~~~~gw~~~eD~grG~ 161 (316) T PRK12352 83 ADTQGGIGYLIQQA-LNNRLARHGEKKAVTVVTQVEVDKNDPGFAHPTKPIGAFFSESQRDELQKANPDWRFVEDAGRGY 161 (316) T ss_pred HCCCHHHHHHHHHH-HHHHHHHCCCCCEEEEEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEECCCCCE T ss_conf 21220899999999-99999763898868998789989898010079987478869999999998668987997489963 Q ss_pred ----------------HHCCHHHHCCCCCCCCCCCCC--------------CCCCHHHHHHHHHHHCHHHHHHCCCCCCC Q ss_conf ----------------100001100111112465567--------------77621578888864164466514754351 Q gi|254780777|r 121 ----------------NAVSYLSQGKVVIFSGGTGNA--------------FLTTDSAAALRASEIGADVILKGTQVDGV 170 (242) Q Consensus 121 ----------------~~~~~~~~~~~~~~~~~~~~~--------------~~~~D~~a~~ia~~~~ad~l~i~tdvdGi 170 (242) ....+++.+.+++..+..+++ ..|-|..|+++|..++||.|+|+||||++ T Consensus 162 RRVVpSP~P~~IvE~~~Ik~L~~~g~iVIa~GGGGIPV~~~~~g~l~GVeAVIDKD~asa~LA~~l~AD~liILTdVd~V 241 (316) T PRK12352 162 RRVVASPEPKRIVEAPAIKALIQQGFVVIGAGGGGIPVVRTDAGDYQSVDAVIDKDLSTALLAREIHADILVITTGVEKV 241 (316) T ss_pred EEEECCCCCCCEECHHHHHHHHHCCCEEEEECCCCCCEEECCCCCEECCEEEECCCHHHHHHHHHHCCCEEEEEECCCEE T ss_conf 78747999850405999999997898899847987533762898565431466352788999997099989997356603 Q ss_pred CCCCCCCCCCCCCCCCCCHHHHHH--------CCCCCCCHHH-HHHHHHCCCCEEEEECCCCCHHHHHHCCCCCCCEECC Q ss_conf 234322344444444115999985--------5998140999-9999974996999518985479999779987824668 Q gi|254780777|r 171 YSADPRVHASSTRFDSLTYNQFIE--------KGLKVMDCAS-VVLARDCSIPIIVFSIHSPGGIWGGLSGIGRSTIISG 241 (242) Q Consensus 171 yt~dP~~~~~ak~i~~is~~e~l~--------~G~~v~~~~A-~~~a~~~gI~v~I~ng~~~~~i~~~l~Ge~~GT~I~~ 241 (242) |-.-.+ |+.+.+++++.+|+.. .|+|.=|..| ++..++.|-+++|.+ .+.+..+|+|+ .||.|+. T Consensus 242 ~~n~gk--p~q~~L~~vt~~e~~~y~~eg~F~~GSM~PKV~Aai~Fve~gg~~aiIts---l~~~~~al~G~-aGT~Ivk 315 (316) T PRK12352 242 CIHFGK--PQQQALDRVDIATMTRYMQEGHFPPGSMLPKIIASLTFLEQGGKEVIITT---PECLPAALRGE-TGTHIIK 315 (316) T ss_pred EEECCC--CCHHHHHHCCHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCEEEECC---HHHHHHHHCCC-CCCEEEE T ss_conf 540899--65113530899999999876999888837999999999994798699897---89999996799-9867940 No 72 >PRK12354 carbamate kinase; Reviewed Probab=99.57 E-value=1.3e-13 Score=103.52 Aligned_cols=228 Identities=22% Similarity=0.205 Sum_probs=133.0 Q ss_pred EEEEEEEEEEEECCC-CCCC--CHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHH---HHCCCCCCCCHHH--HHHH Q ss_conf 999994003685388-7770--899999999999999976990999823654102334---4203777754024--6788 Q gi|254780777|r 8 RVLLKVSGEALAGIS-GFGV--DIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRGSQ---VVAENYLLCERST--VDSM 79 (242) Q Consensus 8 rIViKiGGSsL~~~~-~~~~--~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~~~---~~~~~~~~~~~~~--~~~~ 79 (242) ||||-+||++|..+. +.+. -.+.++.-+++|+++.+ |++++|.||-|-...... .........+... +.+. T Consensus 2 riViALGGNAll~~g~~~t~~~Q~~~v~~aa~~ia~l~~-g~~vvitHGNGPQVG~i~l~~~~~~~~p~~PLd~~~A~SQ 80 (302) T PRK12354 2 RIVIALGGNALLRRGEPMTAENQRANIRIAAEQIAKIAP-GNEVVIVHGNGPQVGLLALQNAAYKAVPPYPLDVLGAETE 80 (302) T ss_pred EEEEECCHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHC-CCEEEEEECCCHHHHHHHHHHHHCCCCCCCCHHHHHHHCC T ss_conf 499982735317999999999999999999999999975-9959998278168899999877416899988267645211 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCC-----CCCC-----------------HHHHHHHHH---------HHH-------- Q ss_conf 8889899999999999761886411-----3400-----------------100245665---------432-------- Q gi|254780777|r 80 GMLSTVINALALDLALRKINVPTVI-----LSSI-----------------FMPQVCEVF---------SCR-------- 120 (242) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~-----------------~~~~~~~~~---------~~~-------- 120 (242) +..+.+++..+...........+.+ ...+ +........ ..| T Consensus 81 G~IGY~lqqaL~n~l~~~k~v~tvvTqv~VD~~DpAF~~PtKpIGpfy~~eeA~~l~~~~gw~~~edgrG~RRVVpSP~P 160 (302) T PRK12354 81 GMIGYMLEQELGNLLPFERPVATLLTQVEVDANDPAFANPTKPIGPVYSEEEAERLAAEKGWVIKPDGDKFRRVVPSPKP 160 (302) T ss_pred HHHHHHHHHHHHHHHCCCCCEEEEEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCEEECCCCEEEEEECCCCC T ss_conf 69999999999987556998699986899889984534899860788699999999997498871158803686379998 Q ss_pred -------HHCCHHHHCCCCCCCCCCCCC--------------CCCCHHHHHHHHHHHCHHHHHHCCCCCCCCCCCCCCCC Q ss_conf -------100001100111112465567--------------77621578888864164466514754351234322344 Q gi|254780777|r 121 -------NAVSYLSQGKVVIFSGGTGNA--------------FLTTDSAAALRASEIGADVILKGTQVDGVYSADPRVHA 179 (242) Q Consensus 121 -------~~~~~~~~~~~~~~~~~~~~~--------------~~~~D~~a~~ia~~~~ad~l~i~tdvdGiyt~dP~~~~ 179 (242) ....+++.+.+++..+..+++ ..|-|..|+++|..++||.|+|+||||+||-.--+ | T Consensus 161 ~~IvE~~~Ik~L~~~g~iVIa~GGGGIPVv~~~~g~l~GveAVIDKD~aaa~LA~~l~AD~liILT~Vd~V~~n~gk--p 238 (302) T PRK12354 161 KRIVEIRAIKWLLEKGHIVICAGGGGIPVVYDADGKLHGVEAVIDKDLAAALLAEQLDADLLLILTDVDAVYLDWGK--P 238 (302) T ss_pred CEEECHHHHHHHHHCCCEEEEECCCCCCEEECCCCCEECCEEEECCCHHHHHHHHHHCCCEEEEEECCCHHHHCCCC--C T ss_conf 55564999999997899899847886013986899686504566340788999987099989998565422024899--6 Q ss_pred CCCCCCCCCHHHHHHC----CCCCCCHHH-HHHHHHCCCCEEEEECCCCCHHHHHHCCCCCCCEECCC Q ss_conf 4444441159999855----998140999-99999749969995189854799997799878246689 Q gi|254780777|r 180 SSTRFDSLTYNQFIEK----GLKVMDCAS-VVLARDCSIPIIVFSIHSPGGIWGGLSGIGRSTIISGE 242 (242) Q Consensus 180 ~ak~i~~is~~e~l~~----G~~v~~~~A-~~~a~~~gI~v~I~ng~~~~~i~~~l~Ge~~GT~I~~~ 242 (242) +.+.++++|.+|+... |+|.=|..| ++.+...|=..+|. .++.+..+|+|+ .||.|+|| T Consensus 239 ~q~~l~~~t~~el~~~~F~~GSM~PKVeAai~Fv~~~g~~aiIt---sl~~~~~al~G~-~GT~I~p~ 302 (302) T PRK12354 239 TQRPLAQATPDELRELGFAAGSMGPKVEAACEFVEATGGIAGIG---SLADILAILAGE-AGTRISPE 302 (302) T ss_pred CHHHHHCCCHHHHHHCCCCCCCCHHHHHHHHHHHHCCCCEEEEC---CHHHHHHHHCCC-CCCEECCC T ss_conf 60023148899997379998883699999999998279879989---799999996799-99686889 No 73 >COG0549 ArcC Carbamate kinase [Amino acid transport and metabolism] Probab=99.55 E-value=2.1e-13 Score=102.31 Aligned_cols=228 Identities=22% Similarity=0.284 Sum_probs=139.0 Q ss_pred EEEEEEEEEEEEECCCC---CCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHH--C----CCCCCCCHHHHH Q ss_conf 49999940036853887---77089999999999999997699099982365410233442--0----377775402467 Q gi|254780777|r 7 KRVLLKVSGEALAGISG---FGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRGSQVV--A----ENYLLCERSTVD 77 (242) Q Consensus 7 KrIViKiGGSsL~~~~~---~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~~~~~--~----~~~~~~~~~~~~ 77 (242) +||||-+||++|....+ ...-.+.+..-+++|+.+.++|++++|.||-|-.......- . ...+..+...+. T Consensus 1 ~~iVvALGGNAll~~g~~~tae~Q~~~v~~ta~~i~~l~~~g~e~VitHGNGPQVG~l~lq~~aa~~~~~~p~~PLd~~~ 80 (312) T COG0549 1 KRIVVALGGNALLQRGEPLTAEAQYEAVKITAEQIADLIASGYEVVITHGNGPQVGLLLLQNEAADSEKGVPAYPLDVLV 80 (312) T ss_pred CEEEEEECCHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHHCCCCCCCCCCHHHHH T ss_conf 91899935345347999989899999999999999999855885999718975677999876653254578886468886 Q ss_pred H--HHHHHHHHHHHHHHHHHHHHCCC---CCCCCC-----------------------CHHHHHHHH------------- Q ss_conf 8--88889899999999999761886---411340-----------------------010024566------------- Q gi|254780777|r 78 S--MGMLSTVINALALDLALRKINVP---TVILSS-----------------------IFMPQVCEV------------- 116 (242) Q Consensus 78 ~--~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~-----------------------~~~~~~~~~------------- 116 (242) + .+..+.+ +...+...+...+.. ..+.+. .+....... T Consensus 81 AmsQG~IGy~-l~qal~n~l~~~~~~~~v~tvvTqv~VD~nDPAF~nPtKpIGpfY~~eea~~l~~~~gw~~keD~~rG~ 159 (312) T COG0549 81 AMSQGMIGYM-LQQALRNELPRRGLEKPVVTVVTQVEVDANDPAFLNPTKPIGPFYSEEEAEELAKEYGWVFKEDAGRGY 159 (312) T ss_pred HHHHHHHHHH-HHHHHHHHHHHCCCCCCEEEEEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCEEEECCCCCE T ss_conf 7301289999-999999776425877742699999997689844359998777876999999988663967865499873 Q ss_pred ------------HHHHHHCCHHHHCCCCCCCCCCCCC-------------CCCCHHHHHHHHHHHCHHHHHHCCCCCCCC Q ss_conf ------------5432100001100111112465567-------------776215788888641644665147543512 Q gi|254780777|r 117 ------------FSCRNAVSYLSQGKVVIFSGGTGNA-------------FLTTDSAAALRASEIGADVILKGTQVDGVY 171 (242) Q Consensus 117 ------------~~~~~~~~~~~~~~~~~~~~~~~~~-------------~~~~D~~a~~ia~~~~ad~l~i~tdvdGiy 171 (242) .-......+++.+.+++..+..+++ ..|-|..++++|..++||.|+|+||||++| T Consensus 160 RRVVpSP~P~~IvE~~~Ik~L~~~g~vVI~~GGGGIPVv~~~~~~~GVeAVIDKDlasalLA~~i~AD~liILTdVd~Vy 239 (312) T COG0549 160 RRVVPSPKPVRIVEAEAIKALLESGHVVIAAGGGGIPVVEEGAGLQGVEAVIDKDLASALLAEQIDADLLIILTDVDAVY 239 (312) T ss_pred EEECCCCCCCCCHHHHHHHHHHHCCCEEEEECCCCCCEEECCCCCCEEEEEECCHHHHHHHHHHHCCCEEEEEECCCHHE T ss_conf 68149999712031788999986798899957887444861897310368981187789999871589899985465111 Q ss_pred CCCCCCCCCCCCCCCCCHHHHHH--------CCCCCCCHH-HHHHHHHCCCCEEEEECCCCCHHHHHHCCCCCCCEECC Q ss_conf 34322344444444115999985--------599814099-99999974996999518985479999779987824668 Q gi|254780777|r 172 SADPRVHASSTRFDSLTYNQFIE--------KGLKVMDCA-SVVLARDCSIPIIVFSIHSPGGIWGGLSGIGRSTIISG 241 (242) Q Consensus 172 t~dP~~~~~ak~i~~is~~e~l~--------~G~~v~~~~-A~~~a~~~gI~v~I~ng~~~~~i~~~l~Ge~~GT~I~~ 241 (242) -.=. -|+.+.+++++.+|+-. .|+|.=|+. |+.++++-|=+++|.+ .+++..+|+|+ .||.|.+ T Consensus 240 ~n~g--kp~q~~L~~v~~~e~~~yl~eg~Fa~GSM~PKVeAai~FV~~~gk~A~Its---Le~~~~~l~g~-~GT~I~~ 312 (312) T COG0549 240 VNFG--KPNQQALDRVTVDEMEKYLAEGQFAAGSMGPKVEAAISFVENTGKPAIITS---LENAEAALEGK-AGTVIVP 312 (312) T ss_pred ECCC--CCCCHHHCCCCHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCCCCEEECC---HHHHHHHHCCC-CCCEECC T ss_conf 1279--955112224699999999864798888731889999999984798369775---98889986467-8858519 No 74 >TIGR00746 arcC carbamate kinase; InterPro: IPR003964 The arginine dihydrolase (AD) pathway is found in many prokaryotes and some primitive eukaryotes. The three- enzyme anaerobic pathway breaks down L-arginine to form 1 mol of ATP, carbon dioxide and ammonia. In simpler bacteria, the first enzyme, arginine deiminase, can account for up to 100f total cell protein . Carbamate kinase is involved in the last step of the AD pathway, converting carbamoyl phosphate and ADP into ammonia, carbon dioxide and ATP . The second step of the pathway involves the degradation of L-citrulline to carbamoyl phosphate and L-ornithine, using ornithine carbamoyltransferase . The crystal structure of Enterococcus faecium carbamate kinase has been determined to 2.8A resolution . The enzyme exists as a homodimer of two 33kDa subunits. The hallmark of the dimer is a 16-stranded beta-sheet, surrounded by alpha-helices. Each subunit contains an active site within a large crevice.; GO: 0008804 carbamate kinase activity, 0006525 arginine metabolic process. Probab=99.47 E-value=5.4e-13 Score=99.76 Aligned_cols=229 Identities=20% Similarity=0.215 Sum_probs=142.7 Q ss_pred EEEEEEEEEEEEEC-CCCCCC--CHHHHHHHHHHHHHHHHCCCEEEEEECCC-CCHHHHH----HHC---CCCCCCCHHH Q ss_conf 49999940036853-887770--89999999999999997699099982365-4102334----420---3777754024 Q gi|254780777|r 7 KRVLLKVSGEALAG-ISGFGV--DIDSVNRICADIAEVYAKGIEIGIVVGGG-NIFRGSQ----VVA---ENYLLCERST 75 (242) Q Consensus 7 KrIViKiGGSsL~~-~~~~~~--~~~~i~~ia~~I~~~~~~g~~vviV~sg~-~~~~~~~----~~~---~~~~~~~~~~ 75 (242) |||||=|||+.|-. .+.-+. =.+.++.=+.+|+++.++|+++||+||=| ..+.+.. ..+ ....+++-.- T Consensus 1 k~~VvALGGNAlLqRg~~~s~~~Q~~nv~~t~~~l~~~~~~G~~lvi~HGNGPQVG~LlLQ~~A~~~~~~~~~~PLDV~G 80 (321) T TIGR00746 1 KRVVVALGGNALLQRGEKGSAENQRKNVEQTAPQLVKLIKRGYELVITHGNGPQVGLLLLQNQAADSEKVVPAMPLDVLG 80 (321) T ss_pred CEEEEEECCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCCCCCCCEEC T ss_conf 94889737358863788379889999999999999999861897999847803688899999897603788988743202 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCCC-----------------------CCCHHHHHH-----H------------ Q ss_conf 678888898999999999997618864113-----------------------400100245-----6------------ Q gi|254780777|r 76 VDSMGMLSTVINALALDLALRKINVPTVIL-----------------------SSIFMPQVC-----E------------ 115 (242) Q Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------------------~~~~~~~~~-----~------------ 115 (242) +.+.+.-+-++.. .+...+...|....+. +.++..... + T Consensus 81 AmSQG~IGYml~q-~l~~~L~~~g~~~~vaT~~TQt~VD~~DpAF~nPTKpiGPfY~e~EAk~~~a~~~g~~~ke~tdGD 159 (321) T TIGR00746 81 AMSQGMIGYMLQQ-ALNNELPKEGLEKPVATVLTQTIVDPKDPAFQNPTKPIGPFYDEEEAKKRLAAEKGWIVKEDTDGD 159 (321) T ss_pred CCCCCHHHHHHHH-HHHHHHHHCCCCCCCEEEEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEECCCCCCC T ss_conf 0335307899999-999999866899985389998888888888785497516888989999999861787004788876 Q ss_pred ----------------HHHHHHHCCHHHHCCCCCCCCCCCCC---------------CCCCHHHHHHHHHHHCHHHHHHC Q ss_conf ----------------65432100001100111112465567---------------77621578888864164466514 Q gi|254780777|r 116 ----------------VFSCRNAVSYLSQGKVVIFSGGTGNA---------------FLTTDSAAALRASEIGADVILKG 164 (242) Q Consensus 116 ----------------~~~~~~~~~~~~~~~~~~~~~~~~~~---------------~~~~D~~a~~ia~~~~ad~l~i~ 164 (242) ..-......+++.+.++++.+..|+| .+|=|.-+.+||..++||.|+|+ T Consensus 160 ~g~gwRrVVPSP~P~~I~E~~~Ik~L~e~g~iVI~~GGGGvPv~~dg~~kkl~GVeAVIDKDlA~~~LA~~~~AD~L~IL 239 (321) T TIGR00746 160 DGRGWRRVVPSPRPKDIVEAEVIKTLVENGVIVICSGGGGVPVVEDGAEKKLKGVEAVIDKDLASEKLAEEVNADILVIL 239 (321) T ss_pred CCCCCEEECCCCCCCCCEECHHHHHHHHCCEEEEEECCCCEEEEECCCCCEEEEEEEEECHHHHHHHHHHHHCCCEEEEE T ss_conf 77830376579999663120778998648827998578820356628943022011032366888988877438744310 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCHHHHHHC--------CCCCCCHH-HHHHHHHCCC-CEEEEECCCCCHHHHHHCCCC Q ss_conf 7543512343223444444441159999855--------99814099-9999997499-699951898547999977998 Q gi|254780777|r 165 TQVDGVYSADPRVHASSTRFDSLTYNQFIEK--------GLKVMDCA-SVVLARDCSI-PIIVFSIHSPGGIWGGLSGIG 234 (242) Q Consensus 165 tdvdGiyt~dP~~~~~ak~i~~is~~e~l~~--------G~~v~~~~-A~~~a~~~gI-~v~I~ng~~~~~i~~~l~Ge~ 234 (242) ||||.+|-.=- + |+-|.+.+++.+|+..+ |||.=|++ |++++...|= +.+|. .+.++..+|.|+- T Consensus 240 TDVd~vy~nyg-k-P~e~~L~~~~~~El~~~~~~G~Fa~GSMgPKV~AaI~Fv~~~G~~~AiIt---~L~~~~~~l~g~i 314 (321) T TIGR00746 240 TDVDAVYVNYG-K-PDEKKLREVTVEELEDYEKDGHFAAGSMGPKVEAAIEFVESRGKKRAIIT---SLETAVEALEGKI 314 (321) T ss_pred HHHHHHHHCCC-C-HHHHHHHCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHCCCCCCCEEC---CHHHHHHHHCCCE T ss_conf 20124522077-8-24788724698899999855886688986038999899871888740005---8589998646973 Q ss_pred CCCEECC Q ss_conf 7824668 Q gi|254780777|r 235 RSTIISG 241 (242) Q Consensus 235 ~GT~I~~ 241 (242) .||.|++ T Consensus 315 aGT~v~~ 321 (321) T TIGR00746 315 AGTRVSK 321 (321) T ss_pred EEEEEEC T ss_conf 2214509 No 75 >cd04236 AAK_NAGS-Urea AAK_NAGS-Urea: N-acetylglutamate (NAG) kinase-like domain of the NAG Synthase (NAGS) of the urea cycle found in animals. Ureogenic NAGS is a mitochondrial enzyme catalyzing the formation of NAG from acetylcoenzyme A and L-glutamate; NAG is an essential allosteric activator of carbamylphosphate synthase I, the first and rate limiting enzyme of the urea cycle. Ureogenic NAGS activity is dependent on the concentration of glutamate (substrate) and arginine (activator). Domain architecture of ureogenic NAGS consists of an N-terminal NAG kinase-like (ArgB) domain (this CD) and a C-terminal DUF619 domain. Members of this CD belong to the protein superfamily, the Amino Acid Kinase Family (AAKF). Probab=99.45 E-value=6e-12 Score=93.27 Aligned_cols=209 Identities=11% Similarity=0.014 Sum_probs=125.5 Q ss_pred CCEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHH Q ss_conf 82499999400368538877708999999999999999769909998236541023344203777754024678888898 Q gi|254780777|r 5 PYKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRGSQVVAENYLLCERSTVDSMGMLST 84 (242) Q Consensus 5 ~~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 84 (242) .++=+|||+||++|.+ .+.++.++..++-|++-|-..+||||||... ..... ....... .... T Consensus 34 ~~~faVIKvGGaVl~d-------~~~l~~l~~~lsfL~~vGl~pIVVHG~g~~i------~~~~~---~~~~~~~-~~v~ 96 (271) T cd04236 34 WPAFAVLEVDHSVFRS-------LEMVQSLSFGLAFLQRMDMKLLVVMGLSAPD------GTNMS---DLELQAA-RSRL 96 (271) T ss_pred CCCEEEEEECHHHEEC-------HHHHHHHHHHHHHHHHCCCEEEEEECCCHHC------CCCCC---CHHHHHH-HHHH T ss_conf 7876999988131505-------8899999999999998899489993897011------55746---0122567-6663 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCHHHHHH-H-----------HHHHHHHCCHHHHCCCCCCCCCC-----CCCCCCCHH Q ss_conf 999999999997618864113400100245-6-----------65432100001100111112465-----567776215 Q gi|254780777|r 85 VINALALDLALRKINVPTVILSSIFMPQVC-E-----------VFSCRNAVSYLSQGKVVIFSGGT-----GNAFLTTDS 147 (242) Q Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~-----------~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~D~ 147 (242) ......+-.++...|......+........ + ..+.......++.+.+|+..... ..-....|+ T Consensus 97 ~~~n~~lV~aL~~~G~~A~~i~s~~~~~~~~~~e~~~~vG~~~~v~~~~i~~~i~~g~IPVIaplG~~~~Gq~~NINADt 176 (271) T cd04236 97 VKDCKTLVEALQANSAAAHPLFSGESVLQAEEPEPGASKGPSVSVDTELLQWCLGSGHIPLVCPIGETSSGRSVSLDSSE 176 (271) T ss_pred HHHHHHHHHHHHHCCCCEEEEECCCCEEEECCCCCCCCCCCCEEEEHHHHHHHHHCCCCCEECCCEECCCCCEEEECHHH T ss_conf 79999999999976996245416873377225301376687138607999999978997188252378999888775899 Q ss_pred HHHHHHHHHCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHH----C----CCCCCCHHHHHHHH---HCCCCEE Q ss_conf 78888864164466514754351234322344444444115999985----5----99814099999999---7499699 Q gi|254780777|r 148 AAALRASEIGADVILKGTQVDGVYSADPRVHASSTRFDSLTYNQFIE----K----GLKVMDCASVVLAR---DCSIPII 216 (242) Q Consensus 148 ~a~~ia~~~~ad~l~i~tdvdGiyt~dP~~~~~ak~i~~is~~e~l~----~----G~~v~~~~A~~~a~---~~gI~v~ 216 (242) .|..+|..+.|+.++++||++|+++.+ .++|++++..+.++ . +++..+...++.+. .++..+. T Consensus 177 aA~aiA~aL~a~Kli~LTdv~Gvld~~------~~lI~~i~l~~~~~~l~~~~~i~~~~~~~i~~i~~~l~~l~~~~s~v 250 (271) T cd04236 177 VTTAIAKALQPIKVIFLNRSGGLRDQK------HKVLPQVHLPADLPSLSDAEWLSETEQNRIQDIATLLNALPSMSSAV 250 (271) T ss_pred HHHHHHHHCCCCEEEEEECCCCEECCC------CCEECEECHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHCCCCCEEE T ss_conf 999999864876799980787407789------98822016188999987076512352674778999997177873699 Q ss_pred EEECCCCC-HHHHHHCCCCCCCEE Q ss_conf 95189854-799997799878246 Q gi|254780777|r 217 VFSIHSPG-GIWGGLSGIGRSTII 239 (242) Q Consensus 217 I~ng~~~~-~i~~~l~Ge~~GT~I 239 (242) |.+ |. -+..++.-+..||++ T Consensus 251 i~~---phsll~ELFTd~G~GTLf 271 (271) T cd04236 251 ITS---AETLLTELFSHKGSGTLF 271 (271) T ss_pred EEC---HHHHHHHHHCCCCCCCCC T ss_conf 837---568789986389877788 No 76 >PRK04531 acetylglutamate kinase; Provisional Probab=99.24 E-value=2.7e-10 Score=83.06 Aligned_cols=217 Identities=16% Similarity=0.169 Sum_probs=120.9 Q ss_pred CCEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHH-HC------CCCCCCCHHHHH Q ss_conf 82499999400368538877708999999999999999769909998236541023344-20------377775402467 Q gi|254780777|r 5 PYKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRGSQV-VA------ENYLLCERSTVD 77 (242) Q Consensus 5 ~~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~~~~-~~------~~~~~~~~~~~~ 77 (242) .+|-.|||+||+++.+ .++++|..++-|+.-|...|||||||........ .+ .+...-+..... T Consensus 19 ~~~fAVIKvGGavi~e---------~l~~la~dLafL~~vGl~PVVVHGgGpqI~~~L~~~gIe~~fi~G~RVTd~etl~ 89 (421) T PRK04531 19 PERFAVIKVGGAVLRE---------DLEALTSSLKFLQEVGLTPVVVHGAGPQLDAELSAAGIEKDTVNGLRVTSPEALA 89 (421) T ss_pred CCEEEEEEECHHHHHH---------HHHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHCCCCCCCCCCCCCCCHHHHH T ss_conf 7618999988799997---------9999999999999879979999389877879999759997677963058888999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHH------------HHHHHHHHHCCHHHHCCCCCCCCCCC-----C Q ss_conf 888889899999999999761886411340010024------------56654321000011001111124655-----6 Q gi|254780777|r 78 SMGMLSTVINALALDLALRKINVPTVILSSIFMPQV------------CEVFSCRNAVSYLSQGKVVIFSGGTG-----N 140 (242) Q Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~~-----~ 140 (242) -........ ...+-..+...|......+....... ....+.......++.+.+|++..... . T Consensus 90 vv~~vl~~~-n~~lV~~L~~~G~~A~~i~g~~~~a~~~d~~~~G~VG~V~~Vd~~~I~~~L~~g~IPVIapig~~~~Gq~ 168 (421) T PRK04531 90 IVRKVFQAS-NLKLVEALQQNGARATSITGGVFEAEYLDRDTYGLVGEVKAVLLAAVRSSLRAGKIPVIASLGETPSGQI 168 (421) T ss_pred HHHHHHHHH-HHHHHHHHHHCCCCEEECCCCCEECCCCCCCCCCCCCCEEEECHHHHHHHHHCCCEEEECCCEECCCCCE T ss_conf 999999765-2999999996799757346654350336666676433347786899999997898269815315899978 Q ss_pred CCCCCHHHHHHHHHHHCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHH--------CCCCCCCHHHHHHHHHC- Q ss_conf 777621578888864164466514754351234322344444444115999985--------59981409999999974- Q gi|254780777|r 141 AFLTTDSAAALRASEIGADVILKGTQVDGVYSADPRVHASSTRFDSLTYNQFIE--------KGLKVMDCASVVLARDC- 211 (242) Q Consensus 141 ~~~~~D~~a~~ia~~~~ad~l~i~tdvdGiyt~dP~~~~~ak~i~~is~~e~l~--------~G~~v~~~~A~~~a~~~- 211 (242) -....|..|+.+|.++.|+.++++|+..|+++.+ .++|..|+.++..+ ..++-+|++-++...++ T Consensus 169 lNINAD~aA~aLA~aL~a~KlIfLt~~gGi~d~~------gk~Is~InL~~eyd~lm~~~w~~~g~klkl~~ik~LLd~L 242 (421) T PRK04531 169 LNINADVAANELVSILQPYKIIFLTGTGGLLDAD------GNKISSINLSTEFDQLMQQPSINGGMRLKLEQIKDLLDRL 242 (421) T ss_pred EEECHHHHHHHHHHHCCCCEEEEECCCCCEECCC------CCEEEEEECCHHHHHHHHCCCCCHHHHHHHHHHHHHHHCC T ss_conf 8405999999999862864699971678606789------9886357544035545205103566899899999998519 Q ss_pred --CCCEEEEECCCCCHHHHHH-CCCCCCCEEC Q ss_conf --9969995189854799997-7998782466 Q gi|254780777|r 212 --SIPIIVFSIHSPGGIWGGL-SGIGRSTIIS 240 (242) Q Consensus 212 --gI~v~I~ng~~~~~i~~~l-~Ge~~GT~I~ 240 (242) .-.|.|. .|.+|.+=| .-..-||+|. T Consensus 243 P~~SSV~It---~p~~L~kELFT~~GsGTLir 271 (421) T PRK04531 243 PLESSVSIT---SPSDLAKELFTHKGSGTLVR 271 (421) T ss_pred CCCCEEEEE---CHHHHHHHHCCCCCCCEEEE T ss_conf 986607870---89998887626799960486 No 77 >TIGR01890 N-Ac-Glu-synth amino-acid N-acetyltransferase; InterPro: IPR010167 This entry represents a clade of amino-acid N-acetyltransferases, or N-acetylglutamate synthase, which is the product of the argA gene and the first enzyme in arginine biosynthesis. This enzyme displays more diversity between bacteria, fungi and mammals than other enzymes in arginine metabolism, and N-acetylglutamate itself can have different roles in different taxonomic groups: in prokaryotes, lower eukaryotes and plants it is the first intermediate in arginine biosynthesis, while in vertebrates it is an allosteric cofactor for the first enzyme in the urea cycle . In bacteria, arginine can regulate ornithine biosynthesis via a feedback inhibition mechanism . This enzyme may also act on aspartate.; GO: 0004042 amino-acid N-acetyltransferase activity, 0006526 arginine biosynthetic process, 0005737 cytoplasm. Probab=98.86 E-value=2.7e-07 Score=64.61 Aligned_cols=221 Identities=16% Similarity=0.172 Sum_probs=132.1 Q ss_pred EEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHH-------HHCCCCCCCCHHHHHHH Q ss_conf 49999940036853887770899999999999999976990999823654102334-------42037777540246788 Q gi|254780777|r 7 KRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRGSQ-------VVAENYLLCERSTVDSM 79 (242) Q Consensus 7 KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~~~-------~~~~~~~~~~~~~~~~~ 79 (242) |+.|+=|||..+.+++ +.+++..|+=|+.-|.++|+|||+-.+..... ...++...-|...+... T Consensus 18 KT~Vvgl~g~~v~~~~--------l~~~v~Di~LLh~LGv~lVLvHGaRPq~~~~la~~~~~p~Y~~G~RvTD~asLe~~ 89 (439) T TIGR01890 18 KTFVVGLGGELVEDDN--------LGNIVADIALLHSLGVRLVLVHGARPQIEERLAARGRTPHYHRGLRVTDEASLELV 89 (439) T ss_pred CEEEEECCCCCCCCCC--------HHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHCCCCCCCCCCEEECCHHHHHHH T ss_conf 6699842840014444--------24599999999758918999708852689999962897870322571356567789 Q ss_pred HHHHHHHHHHHHHHHHHHH-CCCC-----CC-----CCCCHHHHH----------HHHHHHHHHCCHHHHCCCCCCC--- Q ss_conf 8889899999999999761-8864-----11-----340010024----------5665432100001100111112--- Q gi|254780777|r 80 GMLSTVINALALDLALRKI-NVPT-----VI-----LSSIFMPQV----------CEVFSCRNAVSYLSQGKVVIFS--- 135 (242) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~-~~~~-----~~-----~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~--- 135 (242) ...+-.+....-+...... +.|- .+ .+.-+.... .+..........+....++... T Consensus 90 ~~aaG~lr~~ieA~Ls~~~~~~pmaGsr~~vvSGNfvtArPiGv~~GvDy~h~G~vRk~D~~~i~~~Ld~g~IVLL~PLG 169 (439) T TIGR01890 90 KQAAGELRLAIEARLSMSLSNTPMAGSRLSVVSGNFVTARPIGVIEGVDYEHTGVVRKIDTEGIRRLLDAGSIVLLSPLG 169 (439) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCEEEECCCEEECCCCCEECCCEEHHHHHHHHHHHCCCCEEECCCCC T ss_conf 99876688888888752124678888531356243332033101058662032745110088898652078767517654 Q ss_pred --CCCCCCCCCCHHHHHHHHHHHCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHH----CCCCC---------- Q ss_conf --46556777621578888864164466514754351234322344444444115999985----59981---------- Q gi|254780777|r 136 --GGTGNAFLTTDSAAALRASEIGADVILKGTQVDGVYSADPRVHASSTRFDSLTYNQFIE----KGLKV---------- 199 (242) Q Consensus 136 --~~~~~~~~~~D~~a~~ia~~~~ad~l~i~tdvdGiyt~dP~~~~~ak~i~~is~~e~l~----~G~~v---------- 199 (242) ...+.+.-+....|+.+|..++||.|+++|..+|+.+ ++.+++.++++.|+.. ++..- T Consensus 170 ~S~tG~~FnL~~e~vAt~~A~~L~AdKL~~~t~~~Gi~d------~~G~l~~eL~pq~~~~~~~~l~~~~aPdde~~~~~ 243 (439) T TIGR01890 170 YSPTGEIFNLDMEDVATSVAIELKADKLIYFTEEPGILD------ADGKLVAELSPQEVESLVERLGEETAPDDELSADT 243 (439) T ss_pred CCCCCHHCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCC------CCCHHHHHCCHHHHHHHHHHHCCCCCCCHHHHHHH T ss_conf 366400004406889999998625540454058887246------56524232487899999998505789733322558 Q ss_pred CC-HHHHHHHHHCCC-CEEEEECCCCCHHHH-HHCCCCCCCEECC Q ss_conf 40-999999997499-699951898547999-9779987824668 Q gi|254780777|r 200 MD-CASVVLARDCSI-PIIVFSIHSPGGIWG-GLSGIGRSTIISG 241 (242) Q Consensus 200 ~~-~~A~~~a~~~gI-~v~I~ng~~~~~i~~-~l~Ge~~GT~I~~ 241 (242) +. +.+|--|.+-|+ .++|.|+...|.|.. .+.=..+||.|+. T Consensus 244 ~~lL~~a~~A~~GGV~R~hlv~ya~DGsLL~ELF~~~G~GT~v~~ 288 (439) T TIGR01890 244 ARLLSAAVKACRGGVRRSHLVSYAEDGSLLQELFTRDGIGTLVSK 288 (439) T ss_pred HHHHHHHHHHHHCCCCEEEECCCHHHHHHHHHHHHCCCCCHHHHH T ss_conf 999999999986797514422623225899999861893037888 No 78 >TIGR01689 EcbF-BcbF capsule biosynthesis phosphatase; InterPro: IPR010039 This entry describes a small family of highly conserved proteins (>60 0dentity). Two of these, BcbF and EcbF of Pasteurella multocida are believed to be part of the capsule polysaccharide biosynthesis machinery because they are co-transcribed from a locus devoted to that purpose . In P. multocida there are six different variant capsules (A-F), and these proteins are found only in B and E. The other two species in which this gene is (currently) found are both also pathogenic. These proteins are also members of the IIIC subfamily of the Haloacid Dehalogenase (HAD) superfamily of aspartate-nucleophile hydrolases. All of the characterised enzymes within subfamily III are phosphatases, and all of the active site residues characteristic of HAD-superfamily phosphatases are present in this subfamily.. Probab=91.64 E-value=0.4 Score=26.51 Aligned_cols=48 Identities=17% Similarity=0.311 Sum_probs=38.3 Q ss_pred CEEEEEEEEEEEEECCC-CCCCCHHHHHHHHHHHHHHHHCCCEEEEEECC Q ss_conf 24999994003685388-77708999999999999999769909998236 Q gi|254780777|r 6 YKRVLLKVSGEALAGIS-GFGVDIDSVNRICADIAEVYAKGIEIGIVVGG 54 (242) Q Consensus 6 ~KrIViKiGGSsL~~~~-~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg 54 (242) |||+|+-+=++.=..++ .| .+..=+..+.+++.+.++.|.+++|.+|- T Consensus 1 MK~lv~DLD~TIT~~~~~DY-~N~~P~~~Vi~~L~~YK~lGF~i~IsssR 49 (126) T TIGR01689 1 MKRLVMDLDNTITLTENGDY-ANVKPIKAVIEKLREYKALGFEIVISSSR 49 (126) T ss_pred CCEEEEECCCCEEECCCCCC-CCCCCHHHHHHHHHHHHHCCCEEEEECCC T ss_conf 94478743783631257873-23550689999999887468789886565 No 79 >TIGR00705 SppA_67K signal peptide peptidase SppA, 67K type; InterPro: IPR004634 Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes . They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence . Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases . Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base . The geometric orientations of the catalytic residues are similar between families, despite different protein folds . The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) , . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This group of serine peptidases belong to MEROPS peptidase family S49 (protease IV family, clan S-). The predicted active site serine for members of this family occurs in a transmembrane domain. Signal peptides of secretory proteins seem to serve at least two important biological functions. First, they are required for protein targeting to and translocation across membranes, such as the eubacterial plasma membrane and the endoplasmic reticular membrane of eukaryotes. Second, in addition to their role as determinants for protein targeting and translocation, certain signal peptides have a signalling function. During or shortly after pre-protein translocation, the signal peptide is removed by signal peptidases. The integral membrane protein, SppA (protease IV), of Escherichia coli was shown experimentally to degrade signal peptides. The member of this family from Bacillus subtilis has only been shown to be required for efficient processing of pre-proteins under conditions of hyper-secretion . These enzymes have a molecular mass around 67 kDa and a duplication such that the N-terminal half shares extensive homology with the C-terminal half and was shown in E. coli to form homotetramers. E. coli SohB, which is most closely homologous to the C-terminal duplication of SppA, is predicted to perform a similar function of small peptide degradation, but in the periplasm. Many prokaryotes have a single SppA/SohB homolog that may perform the function of either or both. ; GO: 0009003 signal peptidase activity, 0006465 signal peptide processing, 0016021 integral to membrane. Probab=86.93 E-value=1.4 Score=23.14 Aligned_cols=38 Identities=18% Similarity=0.257 Sum_probs=25.5 Q ss_pred EEEEECCCCC-CCC-HHHHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 0368538877-708-9999999999999997699099982 Q gi|254780777|r 15 GEALAGISGF-GVD-IDSVNRICADIAEVYAKGIEIGIVV 52 (242) Q Consensus 15 GSsL~~~~~~-~~~-~~~i~~ia~~I~~~~~~g~~vviV~ 52 (242) .+.+.|-+.+ +.| ...+..+-+.++++++.|+.|+.+. T Consensus 100 ~~~~~DL~~~~g~dG~~~l~~~~~AL~~Fk~SgKpv~ay~ 139 (614) T TIGR00705 100 EGLVLDLKNFEGVDGSPALSEIGKALSEFKDSGKPVYAYG 139 (614) T ss_pred EEEEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCEEEEEE T ss_conf 8999733323576677327988999999985699089985 No 80 >PRK09484 3-deoxy-D-manno-octulosonate 8-phosphate phosphatase; Provisional Probab=85.30 E-value=0.93 Score=24.27 Aligned_cols=18 Identities=33% Similarity=0.599 Sum_probs=11.6 Q ss_pred HHHHHHCCCEEEEEECCC Q ss_conf 999997699099982365 Q gi|254780777|r 38 IAEVYAKGIEIGIVVGGG 55 (242) Q Consensus 38 I~~~~~~g~~vviV~sg~ 55 (242) |..+++.|.++.++||.. T Consensus 60 i~~l~~~Gi~v~iISg~~ 77 (186) T PRK09484 60 IRCLLTSGIEVAIITGRK 77 (186) T ss_pred HHHHHHCCCEEEEEECCC T ss_conf 999998799699994788 No 81 >PRK00192 mannosyl-3-phosphoglycerate phosphatase; Reviewed Probab=83.79 E-value=2.6 Score=21.49 Aligned_cols=47 Identities=17% Similarity=0.204 Sum_probs=35.1 Q ss_pred CCCCCCEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECC Q ss_conf 968882499999400368538877708999999999999999769909998236 Q gi|254780777|r 1 MSDFPYKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGG 54 (242) Q Consensus 1 m~~~~~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg 54 (242) |-+|++|-|+.-+=|+.|.++ +.+. ..-+.|.+++++|+.++++||- T Consensus 1 ~~~M~~~LI~tDlDGTLL~~~--~~~~-----~~~~ai~~l~~~Gi~vviaTGR 47 (275) T PRK00192 1 MIDMSKLLVFTDLDGTLLDHT--YDWE-----PAKPTLKALKEKGIPVIPCTSK 47 (275) T ss_pred CCCCCCEEEEEECCCCCCCCC--CCCC-----HHHHHHHHHHHCCCEEEEECCC T ss_conf 988884699994857784899--9836-----8999999999789999996898 No 82 >PRK10530 phosphotransferase; Provisional Probab=80.86 E-value=3.4 Score=20.78 Aligned_cols=46 Identities=22% Similarity=0.309 Sum_probs=34.9 Q ss_pred CCCEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCC Q ss_conf 8824999994003685388777089999999999999997699099982365 Q gi|254780777|r 4 FPYKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGG 55 (242) Q Consensus 4 ~~~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~ 55 (242) |+||-|+.=+=|+.|.++.. +.... .+.|.+++++|+++++.+|-. T Consensus 1 M~~KlIa~DlDGTLl~~~~~--i~~~~----~~ai~~l~~~Gi~~~iaTGR~ 46 (272) T PRK10530 1 MTYRVIALDLDGTLLTPKKT--ILPSS----LEALARAREAGYQLIIVTGRH 46 (272) T ss_pred CCCCEEEEECCCCCCCCCCC--CCHHH----HHHHHHHHHCCCEEEEECCCC T ss_conf 99529999087363589895--99999----999999997899999995998 No 83 >KOG2436 consensus Probab=73.44 E-value=6.9 Score=18.92 Aligned_cols=162 Identities=15% Similarity=0.114 Sum_probs=76.2 Q ss_pred EEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHCCCC-------CCCCHHHHHHH Q ss_conf 49999940036853887770899999999999999976990999823654102334420377-------77540246788 Q gi|254780777|r 7 KRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRGSQVVAENY-------LLCERSTVDSM 79 (242) Q Consensus 7 KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~~~~~~~~~-------~~~~~~~~~~~ 79 (242) |-+|+|-|+.. .+ +....+++.++-++..|...++|++.+...-.+....... ...+...++.. T Consensus 96 ~fvV~~~g~~~-~t--------~~~~sl~s~lafl~h~gl~pIvv~g~~~qin~~l~~~~ie~~y~~~~RvTda~t~q~~ 166 (520) T KOG2436 96 KFVVIKSGEAI-ST--------SLLHSLASDLAFLHHVGLRPIVVPGTQPQINRLLAERGIEPEYVDGYRVTDAHTLQAA 166 (520) T ss_pred EEEEEECCCCC-CC--------CHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHCCCCCCCCCCEECCCHHHHHHH T ss_conf 07999435435-56--------3588998777887507736888438627777899972888300363102618888776 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHH---HHHH---H--------------HHHHHHCCHHHHCCCCCC--CCC Q ss_conf 88898999999999997618864113400100---2456---6--------------543210000110011111--246 Q gi|254780777|r 80 GMLSTVINALALDLALRKINVPTVILSSIFMP---QVCE---V--------------FSCRNAVSYLSQGKVVIF--SGG 137 (242) Q Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~---~--------------~~~~~~~~~~~~~~~~~~--~~~ 137 (242) ....+.....-+...+..+|......+..... ...+ . ..+.........+.++.. .+. T Consensus 167 ~~~~~~E~n~~lv~nL~~~g~~ar~~s~g~~v~~~f~a~~~~v~d~~~y~~~gei~~vd~d~i~~l~~~G~mp~L~sla~ 246 (520) T KOG2436 167 KESVSLEANLNLVINLSQLGTRARPSSSGVRVGNFFPADRNGVLDGEDYGLVGEIKKVDVDRIRHLLDAGSMPLLRSLAA 246 (520) T ss_pred HHCCHHHHHHHHHHHHHHHHCEECCCCCCCCCCCEEECCCCCCCCCCEEEEECCCCEECHHHHHHHHHCCCCHHEHHHCC T ss_conf 53130021567999999754010331256644304423523455400022003300311524235652798221110113 Q ss_pred C--C-CCCCCCHHHHHHHHHHHCHHHHHHCCCCCCCCCCCCCC Q ss_conf 5--5-67776215788888641644665147543512343223 Q gi|254780777|r 138 T--G-NAFLTTDSAAALRASEIGADVILKGTQVDGVYSADPRV 177 (242) Q Consensus 138 ~--~-~~~~~~D~~a~~ia~~~~ad~l~i~tdvdGiyt~dP~~ 177 (242) + + .-....|..|.-+|..+.++.+++++|..-+...+|.. T Consensus 247 TaSGqvlnvNa~~~a~elA~~L~~~kli~l~d~g~~l~e~ge~ 289 (520) T KOG2436 247 TASGQVLNVNADEVAGELALALGPDKLILLMDKGRILKENGED 289 (520) T ss_pred CCCCCEEEEEHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCCC T ss_conf 6766168751787766887435840068750455424557555 No 84 >PRK10513 sugar phosphatase; Provisional Probab=71.32 E-value=7.7 Score=18.62 Aligned_cols=46 Identities=20% Similarity=0.281 Sum_probs=34.6 Q ss_pred CCCEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCC Q ss_conf 8824999994003685388777089999999999999997699099982365 Q gi|254780777|r 4 FPYKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGG 55 (242) Q Consensus 4 ~~~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~ 55 (242) |++|-|+.=+=|+.|.++.. +.. +..+.|.+++++|+.+++.+|-. T Consensus 1 M~iKli~~DlDGTLl~~~~~--i~~----~~~~ai~~l~~~Gi~~~iaTGR~ 46 (270) T PRK10513 1 MAIKLIAIDMDGTLLLPDHT--ISP----AVKNAIAAARAKGVNVVLTTGRP 46 (270) T ss_pred CCCCEEEEECCCCCCCCCCC--CCH----HHHHHHHHHHHCCCEEEEECCCC T ss_conf 98429999366252589796--899----99999999997899999997998 No 85 >TIGR01524 ATPase-IIIB_Mg magnesium-translocating P-type ATPase; InterPro: IPR006415 This group describes the magnesium translocating P-type ATPase found in a limited number of bacterial species and best described in Salmonella typhimurium, which contains two isoforms . These transporters are active in low external Mg2+ concentrations and pump the ion into the cytoplasm. The magnesium ATPases have been classified as type IIIB by a phylogenetic analysis .; GO: 0015444 magnesium-importing ATPase activity, 0015693 magnesium ion transport, 0016021 integral to membrane. Probab=68.40 E-value=8.9 Score=18.23 Aligned_cols=122 Identities=13% Similarity=0.138 Sum_probs=60.6 Q ss_pred EEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHCCCCCCC-CHHHHHHHHHHHHHHHHH Q ss_conf 9940036853887770899999999999999976990999823654102334420377775-402467888889899999 Q gi|254780777|r 11 LKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRGSQVVAENYLLC-ERSTVDSMGMLSTVINAL 89 (242) Q Consensus 11 iKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 89 (242) +..||.+.+ ++....+++-+...+++++|.+|+.|.-= +...+...+.. ..++.+ + T Consensus 473 ~r~gg~vv~------L~e~~~~~l~~~t~~~NrqG~RV~avAtk------~l~~~e~~Pt~~~~DE~~-l---------- 529 (892) T TIGR01524 473 VREGGAVVA------LDESEKERLKDVTEELNRQGLRVVAVATK------KLDASEADPTYSKEDEEE-L---------- 529 (892) T ss_pred CCCCCCEEC------CCHHHHHHHHHHHHHHCCCCCEEEEEEEE------ECCCCCCCCCCCEECHHH-H---------- T ss_conf 011880303------76257899999998743278578767763------046788886641307778-8---------- Q ss_pred HHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHCCHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHH--CHHHHHHCCCC Q ss_conf 9999997618864113400100245665432100001100111112465567776215788888641--64466514754 Q gi|254780777|r 90 ALDLALRKINVPTVILSSIFMPQVCEVFSCRNAVSYLSQGKVVIFSGGTGNAFLTTDSAAALRASEI--GADVILKGTQV 167 (242) Q Consensus 90 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~D~~a~~ia~~~--~ad~l~i~tdv 167 (242) ....+..+-.|..-. -......+...+..+.+ --|||.-.++.+=.++ +|..+++.+|| T Consensus 530 -iieG~l~FlDPpKes------------t~~AiaaL~~ngi~VKv------LTGDne~VtariC~EVGida~d~l~G~di 590 (892) T TIGR01524 530 -IIEGYLAFLDPPKES------------TKKAIAALKKNGIAVKV------LTGDNEVVTARICLEVGIDANDVLLGADI 590 (892) T ss_pred -HHHHHHHCCCCCHHH------------HHHHHHHHHHCCCEEEE------EECCCHHHHHHHCCCCCCCCCCEEECCCH T ss_conf -899776406880101------------79999999857978999------61886578732110045576775753234 Q ss_pred CCCCCCC Q ss_conf 3512343 Q gi|254780777|r 168 DGVYSAD 174 (242) Q Consensus 168 dGiyt~d 174 (242) +++=|+. T Consensus 591 E~lSD~e 597 (892) T TIGR01524 591 EELSDEE 597 (892) T ss_pred HHCCHHH T ss_conf 3226799 No 86 >COG1778 Low specificity phosphatase (HAD superfamily) [General function prediction only] Probab=65.74 E-value=4 Score=20.36 Aligned_cols=21 Identities=24% Similarity=0.460 Sum_probs=16.4 Q ss_pred HHHHHHHCCCEEEEEECCCCC Q ss_conf 999999769909998236541 Q gi|254780777|r 37 DIAEVYAKGIEIGIVVGGGNI 57 (242) Q Consensus 37 ~I~~~~~~g~~vviV~sg~~~ 57 (242) -|+.+.+.|.++.|.+|--+. T Consensus 43 Gik~l~~~Gi~vAIITGr~s~ 63 (170) T COG1778 43 GIKLLLKSGIKVAIITGRDSP 63 (170) T ss_pred HHHHHHHCCCEEEEEECCCCH T ss_conf 999999859928999678787 No 87 >TIGR00338 serB phosphoserine phosphatase SerB; InterPro: IPR004469 Phosphoserine phosphatase (SerB), (3.1.3.3 from EC), also known as O-phosphoserine phosphohydrolase, is involved in both serine and glycine biosynthesis. It catalyzes the reaction 3-phospho-serine + H2O = L-serine + phosphate, which is the last step in the biosynthesis of serine from carbohydrates. The reaction proceeds via the formation of a phosphoryl-enzyme intermediate. It acts as a homodimer, and requires magnesium as a cofactor.; GO: 0004647 phosphoserine phosphatase activity, 0006564 L-serine biosynthetic process. Probab=64.69 E-value=8.5 Score=18.35 Aligned_cols=26 Identities=19% Similarity=0.424 Sum_probs=19.5 Q ss_pred HHHHHHHHHHCCCEEEEEECCCCCHH Q ss_conf 99999999976990999823654102 Q gi|254780777|r 34 ICADIAEVYAKGIEIGIVVGGGNIFR 59 (242) Q Consensus 34 ia~~I~~~~~~g~~vviV~sg~~~~~ 59 (242) +-+-|+.|+++|++|+++|||-..+. T Consensus 91 ~~Elv~~L~~~gykVav~SGGF~~~a 116 (223) T TIGR00338 91 AEELVKTLKEKGYKVAVISGGFDLFA 116 (223) T ss_pred HHHHHHHHHHCCEEEEEEECCHHHHH T ss_conf 78999999886907999857768889 No 88 >pfam03686 UPF0146 Uncharacterized protein family (UPF0146). The function of this family of proteins is unknown. Probab=55.58 E-value=11 Score=17.62 Aligned_cols=72 Identities=14% Similarity=0.098 Sum_probs=47.0 Q ss_pred CCCCCCCHHHHHHHHHHHCHHHHHHCCC-------------CCCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCCCCHHHH Q ss_conf 5677762157888886416446651475-------------435123432234444444411599998559981409999 Q gi|254780777|r 139 GNAFLTTDSAAALRASEIGADVILKGTQ-------------VDGVYSADPRVHASSTRFDSLTYNQFIEKGLKVMDCASV 205 (242) Q Consensus 139 ~~~~~~~D~~a~~ia~~~~ad~l~i~td-------------vdGiyt~dP~~~~~ak~i~~is~~e~l~~G~~v~~~~A~ 205 (242) |...|.+...|..+... + .-++.|| +|-+++-++..+.+|.+|..|.+-.. +++... T Consensus 19 EVGIG~~~~vA~~L~~~-g--~dv~~tDi~~~av~~gl~~v~DDif~P~~~lY~~A~lIYSIRPp~E-------l~~~i~ 88 (127) T pfam03686 19 EVGIGFFLDVAKRLAER-G--FDVLATDINEKAVPEGLRFVVDDITNPNISIYEGADLIYSIRPPPE-------LQSAIL 88 (127) T ss_pred EEECCCCHHHHHHHHHC-C--CCEEEEECCCCCCCCCCCEEECCCCCCCHHHHCCCCEEEECCCCHH-------HHHHHH T ss_conf 99136888999999985-9--9689997776576678887872588989889558788998189878-------889999 Q ss_pred HHHHHCCCCEEEEEC Q ss_conf 999974996999518 Q gi|254780777|r 206 VLARDCSIPIIVFSI 220 (242) Q Consensus 206 ~~a~~~gI~v~I~ng 220 (242) ++|++.|++++|.-- T Consensus 89 ~lA~~v~a~liI~PL 103 (127) T pfam03686 89 DVAKAVGAPLYIKPL 103 (127) T ss_pred HHHHHCCCCEEEECC T ss_conf 999981998999848 No 89 >PRK01158 phosphoglycolate phosphatase; Provisional Probab=55.02 E-value=16 Score=16.75 Aligned_cols=45 Identities=13% Similarity=0.237 Sum_probs=32.7 Q ss_pred CCCEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECC Q ss_conf 882499999400368538877708999999999999999769909998236 Q gi|254780777|r 4 FPYKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGG 54 (242) Q Consensus 4 ~~~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg 54 (242) |.+|-|+.=+=|+.|.++. .+. .+..+.|.+++++|+.+++.+|- T Consensus 1 MmiKli~~DlDGTLl~~~~--~i~----~~~~~ai~~l~~~Gi~v~iaTGR 45 (226) T PRK01158 1 MMIKAIVIDIDGTITDPDR--RLH----LKAVEAIRKAEKLGVPVILVTGN 45 (226) T ss_pred CCCCEEEEECCCCCCCCCC--CCC----HHHHHHHHHHHHCCCEEEEECCC T ss_conf 9603899945777646919--719----99999999999889989998999 No 90 >PRK10964 ADP-heptose:LPS heptosyl transferase I; Provisional Probab=54.64 E-value=16 Score=16.71 Aligned_cols=20 Identities=15% Similarity=-0.155 Sum_probs=10.6 Q ss_pred HHHHHHHHHHCCCEEEEEEC Q ss_conf 99999999976990999823 Q gi|254780777|r 34 ICADIAEVYAKGIEIGIVVG 53 (242) Q Consensus 34 ia~~I~~~~~~g~~vviV~s 53 (242) ....+..++++.+.+++..- T Consensus 75 ~~~~~~~lr~~~yD~vidlq 94 (322) T PRK10964 75 RKAFRRALQAEQYDAVIDAQ 94 (322) T ss_pred HHHHHHHHHHCCCCEEEECC T ss_conf 99999998745897999885 No 91 >KOG1780 consensus Probab=50.60 E-value=7.7 Score=18.61 Aligned_cols=38 Identities=26% Similarity=0.483 Sum_probs=21.1 Q ss_pred EEEEEEEEEEE-EECC-CCCCCCHHHHHHHHHHHHHHHHCCCE Q ss_conf 49999940036-8538-87770899999999999999976990 Q gi|254780777|r 7 KRVLLKVSGEA-LAGI-SGFGVDIDSVNRICADIAEVYAKGIE 47 (242) Q Consensus 7 KrIViKiGGSs-L~~~-~~~~~~~~~i~~ia~~I~~~~~~g~~ 47 (242) |||++|++|+- ++.- .|| ..+++-+.++.-+....+.. T Consensus 15 Kki~lklnG~r~v~GiLrGy---D~FmNiVlde~vE~~~~~~~ 54 (77) T KOG1780 15 KKIVLKLNGGRKVTGILRGY---DPFMNIVLDETVEPNGDGDK 54 (77) T ss_pred HEEEEEECCCCEEEEEEECC---CHHHHHHHHHCEEECCCCCC T ss_conf 26899947983888887424---56774313420200476874 No 92 >TIGR02482 PFKA_ATP 6-phosphofructokinase; InterPro: IPR012828 6-phosphofructokinase (2.7.1.11 from EC) catalyses the addition of phosphate from ATP to fructose 6-phosphate to give fructose 1,6-bisphosphate. This represents a key control step in glycolysis. This entry contains bacterial ATP-dependent 6-phosphofructokinases, which lack a beta-hairpin loop present in IPR012829 from INTERPRO family members. IPR012829 from INTERPRO contains members that are ATP-dependent as well as members that are pyrophosphate-dependent. IPR011183 from INTERPRO represents the pyrophosphate-dependent phosphofructokinase, diphosphate--fructose-6-phosphate 1-phosphotransferase (2.7.1.90 from EC).; GO: 0003872 6-phosphofructokinase activity, 0005524 ATP binding, 0006002 fructose 6-phosphate metabolic process, 0006096 glycolysis. Probab=50.55 E-value=18 Score=16.31 Aligned_cols=44 Identities=20% Similarity=0.338 Sum_probs=34.5 Q ss_pred EEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCC Q ss_conf 9994003685388777089999999999999997699099982365 Q gi|254780777|r 10 LLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGG 55 (242) Q Consensus 10 ViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~ 55 (242) .+..||+.|-+..=..++....++.| +..|++.|.+=.||.||- T Consensus 58 Ii~~GGT~L~tAR~~EFK~~evR~kA--~~nLK~~GI~~LVViGGD 101 (302) T TIGR02482 58 IISKGGTILGTARCPEFKTEEVREKA--VENLKKLGIEALVVIGGD 101 (302) T ss_pred CCCCCCCEEECCCCCCCCCHHHHHHH--HHHHHHHCCCEEEEECCC T ss_conf 00158833311478545687899999--999887488668998684 No 93 >COG5663 Uncharacterized conserved protein [Function unknown] Probab=50.31 E-value=7.4 Score=18.74 Aligned_cols=37 Identities=16% Similarity=0.188 Sum_probs=27.0 Q ss_pred CHHHHHHHHHCCCCEEEEEC---CCC--CHHHHHHCC-CCCCC Q ss_conf 09999999974996999518---985--479999779-98782 Q gi|254780777|r 201 DCASVVLARDCSIPIIVFSI---HSP--GGIWGGLSG-IGRST 237 (242) Q Consensus 201 ~~~A~~~a~~~gI~v~I~ng---~~~--~~i~~~l~G-e~~GT 237 (242) +..|.++|++.|+|++.+|. ++| .||.++.++ |..+| T Consensus 143 ~~na~~iAk~~~~~vilins~ynRkp~~~niiR~~~w~e~y~~ 185 (194) T COG5663 143 HDNAGQIAKNAGIPVILINSPYNRKPAAKNIIRANNWAEAYEW 185 (194) T ss_pred CCHHHHHHHHCCCCEEEECCCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 7248999883597089823734566527889998718999999 No 94 >PRK08305 spoVFB dipicolinate synthase subunit B; Reviewed Probab=50.17 E-value=19 Score=16.27 Aligned_cols=43 Identities=19% Similarity=0.276 Sum_probs=33.1 Q ss_pred CCCCEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCC Q ss_conf 888249999940036853887770899999999999999976990999823654 Q gi|254780777|r 3 DFPYKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGN 56 (242) Q Consensus 3 ~~~~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~ 56 (242) .++=|||++=+.||.=+ .++..+++.+++++|-+|..|.|..+ T Consensus 2 ~L~gK~I~lgvTGSiaa-----------y~kv~~~ir~L~~~GA~V~~ImS~~a 44 (195) T PRK08305 2 SLKGKRIGFGLTGSHCT-----------YEEVMPQIEKLVAEGAEVRPIVSYTV 44 (195) T ss_pred CCCCCEEEEEECCHHHH-----------HHHHHHHHHHHHHCCCEEEEEECHHH T ss_conf 76898899996727998-----------99999999999987996999975447 No 95 >TIGR01670 YrbI-phosphatas 3-deoxy-D-manno-octulosonate 8-phosphate phosphatase, YrbI family; InterPro: IPR010023 This entry is a part of the haloacid dehalogenase (HAD) superfamily of hydrolases from beta, gamma and epsilon proteobacteria, Aquifex, Fusobacterium, Porphyromonas and Methanosarcina. All characterised members of the HAD-superfamily hydrolase, subfamily IIIA and most characterised members of the HAD superfamily are phosphatases. HAD superfamily phosphatases contain active site residues in several conserved catalytic motifs , all of which are found conserved in this family. Lipopolysaccharides of Gram-negative bacteria consist of a heteropolysaccharide component (O-antigen) and a hydrophobic component (lipid A). Linking the two portions is 3-Deoxy-D-manno-octulosonate (KDO), an 8-carbon sugar. Biosynthesis of KDO linked to lipid A proceeds via five steps, one of which involves removal of a phosphate from KDO 8-P via the action of KDO 8-P phosphatase (3.1.3.45 from EC; IPR008230 from INTERPRO) . One member of this family, the YrbI protein from Haemophilus influenzae has been cloned, expressed, purified and found to be an active phosphatase. Furthermore, its crystal structure has been determined . The sequence from Methanosarcina acetivorans, Q8TJL6 from SWISSPROT, is distinctive in that it is linked to an N-terminal cytidylyltransferase domain (IPR003329 from INTERPRO) and is annotated as acylneuraminate cytidylyltransferase. This may give some clue as the function of these phosphatases. Several eukaryotic sequences are also closely related to this family; one such sequence is the CMP-N-acetylneuraminic acid synthetase from mouse, but in this case as in the others the phosphatase domain is clearly inactive as many of the active site residues are not conserved.; GO: 0019143 3-deoxy-manno-octulosonate-8-phosphatase activity, 0009103 lipopolysaccharide biosynthetic process. Probab=49.70 E-value=13 Score=17.26 Aligned_cols=23 Identities=9% Similarity=0.056 Sum_probs=11.5 Q ss_pred HHHHHHHHCCCCEEEEECCCCCH Q ss_conf 99999997499699951898547 Q gi|254780777|r 203 ASVVLARDCSIPIIVFSIHSPGG 225 (242) Q Consensus 203 ~A~~~a~~~gI~v~I~ng~~~~~ 225 (242) .+++.+.+.|+++-|.+||+... T Consensus 35 ~gI~~~lk~gi~~AiItGR~~~~ 57 (154) T TIGR01670 35 IGIKLLLKLGIKVAIITGRDAKL 57 (154) T ss_pred HHHHHHHHCCCEEEEEECCCCHH T ss_conf 89999997498698981798756 No 96 >TIGR01302 IMP_dehydrog inosine-5'-monophosphate dehydrogenase; InterPro: IPR005990 Synonyms: Inosine-5'-monophosphate dehydrogenase, Inosinic acid dehydrogenase IMP dehydrogenase (1.1.1.205 from EC,IMPDH) catalyzes the rate-limiting reaction of de novo GTP biosynthesis, the NAD-dependent reduction of IMP into XMP .Inosine 5-phosphate + NAD+ + H2O = xanthosine 5-phosphate + NADH IMP dehydrogenase is associated with cell proliferation and is a possible target for cancer chemotherapy. Mammalian and bacterial IMPDHs are tetramers of identical chains. There are two IMP dehydrogenase isozymes in humans . IMP dehydrogenase nearly always contains a long insertion that has two CBS domains within it and adopts a TIM barrel structure.; GO: 0003938 IMP dehydrogenase activity, 0006177 GMP biosynthetic process. Probab=49.41 E-value=19 Score=16.20 Aligned_cols=33 Identities=24% Similarity=0.322 Sum_probs=25.7 Q ss_pred CCCHHH----HHHHHHCCCCEEEEECC--CCCHHHHHHC-C Q ss_conf 140999----99999749969995189--8547999977-9 Q gi|254780777|r 199 VMDCAS----VVLARDCSIPIIVFSIH--SPGGIWGGLS-G 232 (242) Q Consensus 199 v~~~~A----~~~a~~~gI~v~I~ng~--~~~~i~~~l~-G 232 (242) +=|.+| ++.|.++|||| |++|- .-|.|.|||. | T Consensus 325 vPQ~TAv~~Va~~A~~~Gi~V-IADGGIr~SGDivKAlAaG 364 (476) T TIGR01302 325 VPQITAVYDVAEYAAQSGIPV-IADGGIRYSGDIVKALAAG 364 (476) T ss_pred CHHHHHHHHHHHHHHHCCCEE-EECCCCCCHHHHHHHHHHC T ss_conf 626889999999997279909-9837756255899999816 No 97 >COG0560 SerB Phosphoserine phosphatase [Amino acid transport and metabolism] Probab=48.91 E-value=19 Score=16.15 Aligned_cols=31 Identities=16% Similarity=0.064 Sum_probs=15.4 Q ss_pred CCCHHHHHHHHHHHCHHHHHH--CCCCCCCCCC Q ss_conf 762157888886416446651--4754351234 Q gi|254780777|r 143 LTTDSAAALRASEIGADVILK--GTQVDGVYSA 173 (242) Q Consensus 143 ~~~D~~a~~ia~~~~ad~l~i--~tdvdGiyt~ 173 (242) ++-+..+..++..++.|...- +...||.||. T Consensus 101 gg~~~lv~~ia~~lg~d~~~an~l~~~dG~ltG 133 (212) T COG0560 101 GGFTFLVEPIAERLGIDYVVANELEIDDGKLTG 133 (212) T ss_pred CCHHHHHHHHHHHHCCCHHHHHEEEECCCCCCC T ss_conf 886999999999949757650201431774136 No 98 >pfam05141 DIT1_PvcA Pyoverdine/dityrosine biosynthesis protein. This family includes DIT1 that is involved in synthesising dityrosine. Dityrosine is a sporulation-specific component of the yeast ascospore wall that is essential for the resistance of the spores to adverse environmental conditions. Pseudomonas aeruginosa PvcA is involved in the biosynthesis of pyoverdine. Probab=45.68 E-value=22 Score=15.87 Aligned_cols=45 Identities=31% Similarity=0.505 Sum_probs=34.5 Q ss_pred EEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCH Q ss_conf 94003685388777089999999999999997699099982365410 Q gi|254780777|r 12 KVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIF 58 (242) Q Consensus 12 KiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~ 58 (242) |.-| .|-|. +..+...+|+++|+.|.+.+.-|-++.|+|-|-.+. T Consensus 28 KVlG-~lPD~-aEelaL~~Ln~lc~~I~~vY~PGa~i~I~SDG~Vf~ 72 (274) T pfam05141 28 KVLG-SLPDK-AEELALERLNGLCERIQAVYPPGAKITIVSDGHVFS 72 (274) T ss_pred CCCC-CCCCH-HHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCEEC T ss_conf 4578-89977-899999999999999997588995899972686510 No 99 >TIGR01077 L13_A_E ribosomal protein L13; InterPro: IPR005755 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites , . About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many of ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome , . Ribosomal protein L13 is one of the proteins from the large ribosomal subunit . In Escherichia coli, L13 is known to be one of the early assembly proteins of the 50S ribosomal subunit. This model represents ribosomal protein of L13 from the Archaea and from the eukaryotic cytosol.; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0015934 large ribosomal subunit. Probab=44.41 E-value=23 Score=15.72 Aligned_cols=24 Identities=17% Similarity=0.277 Sum_probs=21.0 Q ss_pred HHHHHHHHHHHHHHCCCEEEEEEC Q ss_conf 999999999999976990999823 Q gi|254780777|r 30 SVNRICADIAEVYAKGIEIGIVVG 53 (242) Q Consensus 30 ~i~~ia~~I~~~~~~g~~vviV~s 53 (242) .|-|||..||...=.|.+|+||=. T Consensus 9 ilGRLAS~VAK~lL~G~~vVvVnA 32 (144) T TIGR01077 9 ILGRLASVVAKRLLNGERVVVVNA 32 (144) T ss_pred HHHHHHHHHHHHHHCCCEEEEEEC T ss_conf 137689999998757986789722 No 100 >TIGR01466 cobJ_cbiH precorrin-3B C17-methyltransferase; InterPro: IPR006363 Cobalamin (vitamin B12) is a structurally complex cofactor, consisting of a modified tetrapyrrole with a centrally chelated cobalt. Cobalamin is usually found in one of two biologically active forms: methylcobalamin and adocobalamin. Most prokaryotes, as well as animals, have cobalamin-dependent enzymes, whereas plants and fungi do not appear to use it. In bacteria and archaea, these include methionine synthase, ribonucleotide reductase, glutamate and methylmalonyl-CoA mutases, ethanolamine ammonia lyase, and diol dehydratase . In mammals, cobalamin is obtained through the diet, and is required for methionine synthase and methylmalonyl-CoA mutase . There are at least two distinct cobalamin biosynthetic pathways in bacteria : Aerobic pathway that requires oxygen and in which cobalt is inserted late in the pathway ; found in Pseudomonas denitrificans and Rhodobacter capsulatus. Anaerobic pathway in which cobalt insertion is the first committed step towards cobalamin synthesis ; found in Salmonella typhimurium, Bacillus megaterium, and Propionibacterium freudenreichii shermanii. Either pathway can be divided into two parts: (1) corrin ring synthesis (differs in aerobic and anaerobic pathways) and (2) adenosylation of corrin ring, attachment of aminopropanol arm, and assembly of the nucleotide loop (common to both pathways) . There are about 30 enzymes involved in either pathway, where those involved in the aerobic pathway are prefixed Cob and those of the anaerobic pathway Cbi. Several of these enzymes are pathway-specific: CbiD, CbiG, and CbiK are specific to the anaerobic route of S. typhimurium, whereas CobE, CobF, CobG, CobN, CobS, CobT, and CobW are unique to the aerobic pathway of P. denitrificans. This entry represents CobJ and CbiH precorrin-3B C(17)-methyltransferase (2.1.1.131 from EC), both as stand-alone enzymes and when CobJ forms part of a bifunctional enzyme. In the aerobic pathway, once CobG has generated precorrin-3b, CobJ catalyses the methylation of precorrin-3b at C-17 to form precorrin-4 (the extruded methylated C-20 fragment is left attached as an acyl group at C-1) . In the corresponding anaerobic pathway, CbiH carries out this ring contraction, using cobalt-precorrin-3b as a substrate to generate a tetramethylated delta-lactone. These proteins belong to the superfamily of tetrapyrrole (corrin/porphyrin) methylases (IPR000878 from INTERPRO), which includes methylases that use S-adenosylmethionine (S-AdoMet) in the methylation of diverse substrates. A number of other methylases in the cobalamin biosynthesis pathway also belong to this domain superfamily (precorrin-3 methylase, IPR012203 from INTERPRO, IPR012059 from INTERPRO, IPR012382 from INTERPRO, IPR012797 from INTERPRO and others), and a fusion of precorrin-3B C17-methyltransferases with precorrin isomerase is represented by IPR014422 from INTERPRO. Nomenclature note: precorrin-3B C17-methyltransferase is one of the two methyltransferases often referred to as precorrin-3 methylase (the other is precorrin-4 C11-methyltransferase, 2.1.1.133 from EC). ; GO: 0008168 methyltransferase activity, 0009236 cobalamin biosynthetic process. Probab=42.86 E-value=18 Score=16.34 Aligned_cols=24 Identities=25% Similarity=0.441 Sum_probs=16.3 Q ss_pred HHHHHHHHHHHHHCCCEEEEEECCC Q ss_conf 9999999999997699099982365 Q gi|254780777|r 31 VNRICADIAEVYAKGIEIGIVVGGG 55 (242) Q Consensus 31 i~~ia~~I~~~~~~g~~vviV~sg~ 55 (242) |.|.-..| ++.++|+.|.|||||- T Consensus 57 ~~Ra~~Ai-~~A~~G~~VAlvSSGD 80 (254) T TIGR01466 57 IERAELAI-ELAKEGKTVALVSSGD 80 (254) T ss_pred HHHHHHHH-HHHHCCCEEEEEECCC T ss_conf 89999999-9986099489993687 No 101 >PRK10949 protease 4; Provisional Probab=42.05 E-value=25 Score=15.49 Aligned_cols=41 Identities=22% Similarity=0.220 Sum_probs=29.3 Q ss_pred CCEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 824999994003685388777089999999999999997699099982 Q gi|254780777|r 5 PYKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVV 52 (242) Q Consensus 5 ~~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~ 52 (242) +-|-||+++++-. +.....+..+.+.|.++++.|+.|+... T Consensus 112 rI~givLdl~~~~-------g~~~a~l~eI~~AL~~FK~SGK~ViAy~ 152 (618) T PRK10949 112 NITGIVLDLKNFA-------GADQPSMQYIGKALREFRDSGKPVYAVG 152 (618) T ss_pred CEEEEEEECCCCC-------CCCHHHHHHHHHHHHHHHHHCCEEEEEE T ss_conf 8259999777788-------8788999999999999998199399996 No 102 >TIGR01490 HAD-SF-IB-hyp1 HAD-superfamily subfamily IB hydrolase, TIGR01490; InterPro: IPR006385 This group of sequences belong to the IB subfamily of the haloacid dehalogenase (HAD) superfamily of aspartate-nucleophile hydrolases. The sequences are all bacterial. The IB subfamily includes the enzyme phosphoserine phosphatase. Due to this relationship, several of these sequences have been annotated as "phosphoserine phosphatase related proteins," or "phosphoserine phosphatase-family enzymes." There is presently no evidence that any of the proteins in this family possess PSPase activity. . Probab=40.96 E-value=26 Score=15.39 Aligned_cols=18 Identities=28% Similarity=0.184 Sum_probs=11.4 Q ss_pred HHHHHHCCCEEEEEECCC Q ss_conf 999997699099982365 Q gi|254780777|r 38 IAEVYAKGIEIGIVVGGG 55 (242) Q Consensus 38 I~~~~~~g~~vviV~sg~ 55 (242) +..++.+||.++|||++. T Consensus 98 ~~~H~~~G~~~vlvSaS~ 115 (204) T TIGR01490 98 IKAHKAEGHTIVLVSASL 115 (204) T ss_pred HHHHHHCCCEEEEECCCH T ss_conf 999998089389981678 No 103 >TIGR01662 HAD-SF-IIIA hydrolase, HAD-superfamily, subfamily IIIA; InterPro: IPR006549 This group of proteins is a part of the haloacid dehalogenase (HAD) superfamily of aspartate-nucleophile hydrolases. The Class III subfamilies are characterised by the lack of any domains located either between the first and second conserved catalytic motifs (as in the Class I subfamilies) or between the second and third conserved catalytic motifs (as in the Class II subfamilies) of the superfamily domain , . The IIIA subfamily contains five major clades: histidinol-phosphatase , histidinol-phosphatase-related protein, DNA 3-phosphatase and sequences related to YqeG and YrbI.. Probab=40.74 E-value=25 Score=15.48 Aligned_cols=33 Identities=24% Similarity=0.256 Sum_probs=25.7 Q ss_pred HHHHHH--HHHHHH-HCCCEEEEEECCCCCHHHHHH Q ss_conf 999999--999999-769909998236541023344 Q gi|254780777|r 31 VNRICA--DIAEVY-AKGIEIGIVVGGGNIFRGSQV 63 (242) Q Consensus 31 i~~ia~--~I~~~~-~~g~~vviV~sg~~~~~~~~~ 63 (242) ...+.+ .+++++ .+|++|+|||.....+|++.. T Consensus 30 ~~~~~~Gl~~~~l~W~~G~~v~i~tN~~Gigrg~~~ 65 (171) T TIGR01662 30 YPGVIDGLALAELKWKAGYKVVIVTNQSGIGRGYFS 65 (171) T ss_pred CCCCHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCC T ss_conf 688600189998887459769999788742111268 No 104 >TIGR00099 Cof-subfamily Cof-like hydrolase; InterPro: IPR000150 The Haloacid Dehydrogenase (HAD) superfamily includes phosphatases, phosphonatases, P-type ATPases, beta-phosphoglucomutases, phosphomannomutases, and dehalogenases, which are involved in a variety of cellular processes ranging from amino acid biosynthesis to detoxification . Proteins in this entry are mostly uncharacterised, though they form a distinct subgroup within the HAD superfamily. Members are found almost exclusively in bacteria and many species contain several paralogs, for example Escherichia coli contains a total of six proteins from this entry. Sequence similarities suggest that these enzymes are phosphatases which work on phosphorylated sugars. The YbiV (P75792 from SWISSPROT) protein from E. coli has been experimentally characterised . This enzyme catalyzes the hydrolysis of sugar phosphate to sugar and inorganic phosphate. It has a wide substrate specificity, catalyzing the hydrolysis of ribose-5-phosphate and glucose-6-phosphate most efficiently, but it is not known if these are the real substrates in vivo. The protein appears to be a monomer that contains two domains, an alpha-beta hydrolase domain that forms a Rossman fold, and an alpha-beta domain. The active site is found in a negatively charged cavity found at the interface between the two domains. ; GO: 0016787 hydrolase activity, 0008152 metabolic process. Probab=37.18 E-value=29 Score=15.02 Aligned_cols=41 Identities=17% Similarity=0.345 Sum_probs=28.4 Q ss_pred EEEEEEEEEEE-CCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCC Q ss_conf 99994003685-388777089999999999999997699099982365 Q gi|254780777|r 9 VLLKVSGEALA-GISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGG 55 (242) Q Consensus 9 IViKiGGSsL~-~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~ 55 (242) |++=+-|+.|. +.. .+. .+.-+.|++++.+|++++|++|-. T Consensus 2 i~~D~DGTLL~~~~~--~i~----~~~~~al~~l~~~G~~~~laTGR~ 43 (270) T TIGR00099 2 IFIDLDGTLLNKDDK--EIS----PSTKEALAKLREKGIKVVLATGRP 43 (270) T ss_pred EEEECCCEEECCCCC--CCC----HHHHHHHHHHHHCCCEEEEEECCC T ss_conf 889638503559987--518----889999999996698899981886 No 105 >TIGR01978 sufC FeS assembly ATPase SufC; InterPro: IPR010230 Iron-sulphur (FeS) clusters are important cofactors for numerous proteins involved in electron transfer, in redox and non-redox catalysis, in gene regulation, and as sensors of oxygen and iron. These functions depend on the various FeS cluster prosthetic groups, the most common being [2Fe-2S] and [4Fe-4S] . FeS cluster assembly is a complex process involving the mobilisation of Fe and S atoms from storage sources, their assembly into [Fe-S] form, their transport to specific cellular locations, and their transfer to recipient apoproteins. So far, three FeS assembly machineries have been identified, which are capable of synthesising all types of [Fe-S] clusters: ISC (iron-sulphur cluster), SUF (sulphur assimilation), and NIF (nitrogen fixation) systems. The ISC system is conserved in eubacteria and eukaryotes (mitochondria), and has broad specificity, targeting general FeS proteins , . It is encoded by the isc operon (iscRSUA-hscBA-fdx-iscX). IscS is a cysteine desulphurase, which obtains S from cysteine (converting it to alanine) and serves as a S donor for FeS cluster assembly. IscU and IscA act as scaffolds to accept S and Fe atoms, assembling clusters and transfering them to recipient apoproteins. HscA is a molecular chaperone and HscB is a co-chaperone. Fdx is a [2Fe-2S]-type ferredoxin. IscR is a transcription factor that regulates expression of the isc operon. IscX (also known as YfhJ) appears to interact with IscS and may function as an Fe donor during cluster assembly . The SUF system is an alternative pathway to the ISC system that operates under iron starvation and oxidative stress. It is found in eubacteria, archaea and eukaryotes (plastids). The SUF system is encoded by the suf operon (sufABCDSE), and the six encoded proteins are arranged into two complexes (SufSE and SufBCD) and one protein (SufA). SufS is a pyridoxal-phosphate (PLP) protein displaying cysteine desulphurase activity. SufE acts as a scaffold protein that accepts S from SufS and donates it to SufA . SufC is an ATPase with an unorthodox ATP-binding cassette (ABC)-like component. No specific functions have been assigned to SufB and SufD. SufA is homologous to IscA , acting as a scaffold protein in which Fe and S atoms are assembled into [FeS] cluster forms, which can then easily be transferred to apoproteins targets. In the NIF system, NifS and NifU are required for the formation of metalloclusters of nitrogenase in Azotobacter vinelandii, and other organisms, as well as in the maturation of other FeS proteins. Nitrogenase catalyses the fixation of nitrogen. It contains a complex cluster, the FeMo cofactor, which contains molybdenum, Fe and S. NifS is a cysteine desulphurase. NifU binds one Fe atom at its N-terminal, assembling an FeS cluster that is transferred to nitrogenase apoproteins . Nif proteins involved in the formation of FeS clusters can also be found in organisms that do not fix nitrogen . This entry represents SufC, which acts as an ATPase in the SUF system. SufC belongs to the ATP-binding cassette transporter family (IPR003439 from INTERPRO) but is no longer thought to be part of a transporter. The complex is reported as cytosolic or associated with the membrane.; GO: 0005524 ATP binding, 0006810 transport. Probab=36.95 E-value=30 Score=15.00 Aligned_cols=31 Identities=29% Similarity=0.347 Sum_probs=16.3 Q ss_pred EEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCC Q ss_conf 68538877708999999999999999769909998236541 Q gi|254780777|r 17 ALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNI 57 (242) Q Consensus 17 sL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~ 57 (242) +|+.+||-|= +.++..|+ ||.-.-|++|-.. T Consensus 30 AiMGPNGsGK-----STL~~~ia-----Ghp~y~vt~G~I~ 60 (248) T TIGR01978 30 AIMGPNGSGK-----STLSKTIA-----GHPKYEVTSGTIL 60 (248) T ss_pred EEECCCCCCH-----HHHHHHHH-----CCCCEEEEEEEEE T ss_conf 9868899847-----88877761-----7993378420898 No 106 >COG4598 HisP ABC-type histidine transport system, ATPase component [Amino acid transport and metabolism] Probab=36.59 E-value=30 Score=14.96 Aligned_cols=26 Identities=23% Similarity=0.152 Sum_probs=9.4 Q ss_pred CCCCCCHHHHHHHHHCCCC-EEEEECC Q ss_conf 9981409999999974996-9995189 Q gi|254780777|r 196 GLKVMDCASVVLARDCSIP-IIVFSIH 221 (242) Q Consensus 196 G~~v~~~~A~~~a~~~gI~-v~I~ng~ 221 (242) -.|++=..-...|.+-.-. ++.-+|. T Consensus 203 rTMv~VTHEM~FAR~Vss~v~fLh~G~ 229 (256) T COG4598 203 RTMVVVTHEMGFARDVSSHVIFLHQGK 229 (256) T ss_pred CEEEEEEEEHHHHHHHHHHEEEEECCE T ss_conf 748999620467775300168755444 No 107 >PRK06278 cobyrinic acid a,c-diamide synthase; Validated Probab=34.53 E-value=33 Score=14.76 Aligned_cols=27 Identities=30% Similarity=0.496 Sum_probs=21.5 Q ss_pred HHHHHHHHHCCCCEEEEECCCCCHHHH Q ss_conf 999999997499699951898547999 Q gi|254780777|r 202 CASVVLARDCSIPIIVFSIHSPGGIWG 228 (242) Q Consensus 202 ~~A~~~a~~~gI~v~I~ng~~~~~i~~ 228 (242) -.+|++|+..|+|++|.++...+-|.. T Consensus 346 YS~aeIak~lg~PvyivS~cnksGIEg 372 (482) T PRK06278 346 YSGAEIAKALGFPVYIVSSCSKGGIEG 372 (482) T ss_pred CCHHHHHHHCCCCEEEEECCCCCCCHH T ss_conf 768999985599889994255566047 No 108 >COG1915 Uncharacterized conserved protein [Function unknown] Probab=34.52 E-value=33 Score=14.76 Aligned_cols=43 Identities=21% Similarity=0.288 Sum_probs=31.3 Q ss_pred EEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCC Q ss_conf 9994003685388777089999999999999997699099982365 Q gi|254780777|r 10 LLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGG 55 (242) Q Consensus 10 ViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~ 55 (242) +.-|=+|-+.++.+. .-.++++|+++.+.+++|-++++|.|.+ T Consensus 168 ~F~FM~~~vSsErp~---e~~i~~IA~E~~ei~~kgGkIvvv~GPa 210 (415) T COG1915 168 EFEFMGSGVSSERPV---ETLIEQIAWEIREIRDKGGKIVVVAGPA 210 (415) T ss_pred EEEEECCCCCCCCCH---HHHHHHHHHHHHHHHHCCCCEEEEECCE T ss_conf 577612787767857---7999999999999986499289982673 No 109 >TIGR00213 GmhB_yaeD D,D-heptose 1,7-bisphosphate phosphatase; InterPro: IPR004446 In a number of species, including Escherichia coli, the histidine biosynthetic enzymes imidazole glycerol phosphate dehydratase and histidinol phosphatase are found together in the bifunctional protein HisB. This family represents a protein closely related to the histidinol phosphatase domain of HisB. The protein is found both in Helicobacter pylori, for which the histidine biosynthetic pathway appears to be absent, and in species that also have a bifunctional HisB protein.. Probab=32.62 E-value=35 Score=14.57 Aligned_cols=82 Identities=12% Similarity=0.096 Sum_probs=46.6 Q ss_pred CHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHH-HHHHHHHHCCCCCCC Q ss_conf 8999999999999999769909998236541023344203777754024678888898999999-999997618864113 Q gi|254780777|r 27 DIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRGSQVVAENYLLCERSTVDSMGMLSTVINALA-LDLALRKINVPTVIL 105 (242) Q Consensus 27 ~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~ 105 (242) |.+++..+.+.++++++.|+.+++|+-=...+|++-.... - ..+...+.-.+ ....+...|+-.+.+ T Consensus 24 ~F~F~~Gv~~~l~~~K~~Gy~L~lvTNQSGI~RG~ft~~~-------F-----~~LT~~~~~SLf~e~d~~LDGIYyC~H 91 (178) T TIGR00213 24 NFEFIDGVIDALRELKKLGYKLVLVTNQSGIARGYFTEKQ-------F-----EQLTEWLDWSLFAERDVDLDGIYYCPH 91 (178) T ss_pred CCHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCHHH-------H-----HHHHHHHHHHHHHHCCCCCCCEEECCC T ss_conf 3215566999999987458278898745655435427788-------9-----999999988876524887363244578 Q ss_pred CCCH-HHHHHHHHHHH Q ss_conf 4001-00245665432 Q gi|254780777|r 106 SSIF-MPQVCEVFSCR 120 (242) Q Consensus 106 ~~~~-~~~~~~~~~~~ 120 (242) .+.. .....+...+| T Consensus 92 ~p~~~~~e~~~~C~CR 107 (178) T TIGR00213 92 HPEGSKEEYKENCDCR 107 (178) T ss_pred CCCCCHHHHHHCCCCC T ss_conf 9732013566226778 No 110 >PRK11544 hycI hydrogenase 3 maturation protease; Provisional Probab=32.32 E-value=35 Score=14.54 Aligned_cols=20 Identities=40% Similarity=0.579 Sum_probs=16.3 Q ss_pred CEEEEEEEEEEEEECCCCCCC Q ss_conf 249999940036853887770 Q gi|254780777|r 6 YKRVLLKVSGEALAGISGFGV 26 (242) Q Consensus 6 ~KrIViKiGGSsL~~~~~~~~ 26 (242) ||++|+=+ |+.|..+++.++ T Consensus 1 mk~lvlgi-GN~L~gDDG~G~ 20 (156) T PRK11544 1 MTDVLLCV-GNSMMGDDGAGP 20 (156) T ss_pred CCEEEEEE-CCCCCCCCCHHH T ss_conf 96489983-864305574759 No 111 >cd02877 GH18_hevamine_XipI_class_III This conserved domain family includes xylanase inhibitor Xip-I, and the class III plant chitinases such as hevamine, concanavalin B, and PPL2, all of which have a glycosyl hydrolase family 18 (GH18) domain. Hevamine is a class III endochitinase that hydrolyzes the linear polysaccharide chains of chitin and peptidoglycan and is important for defense against pathogenic bacteria and fungi. PPL2 (Parkia platycephala lectin 2) is a class III chitinase from Parkia platycephala seeds that hydrolyzes beta(1-4) glycosidic bonds linking 2-acetoamido-2-deoxy-beta-D-glucopyranose units in chitin. Probab=31.87 E-value=36 Score=14.49 Aligned_cols=26 Identities=27% Similarity=0.494 Sum_probs=22.2 Q ss_pred HHHHHHHHHHHHHCCCEEEEEECCCC Q ss_conf 99999999999976990999823654 Q gi|254780777|r 31 VNRICADIAEVYAKGIEIGIVVGGGN 56 (242) Q Consensus 31 i~~ia~~I~~~~~~g~~vviV~sg~~ 56 (242) -..+..+|+.++.+|++|+|-.||+. T Consensus 58 C~~i~~dI~~CQ~~GkkVlLSlGG~~ 83 (280) T cd02877 58 CPQLGADIKHCQSKGKKVLLSIGGAG 83 (280) T ss_pred HHHHHHHHHHHHHCCCEEEEECCCCC T ss_conf 05689999999987998999736888 No 112 >TIGR00300 TIGR00300 conserved hypothetical protein TIGR00300; InterPro: IPR005239 This family of conserved hypothetical proteins has no known function.. Probab=31.67 E-value=36 Score=14.47 Aligned_cols=43 Identities=21% Similarity=0.343 Sum_probs=33.1 Q ss_pred EEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHC----C-CEEEEEECCC Q ss_conf 99940036853887770899999999999999976----9-9099982365 Q gi|254780777|r 10 LLKVSGEALAGISGFGVDIDSVNRICADIAEVYAK----G-IEIGIVVGGG 55 (242) Q Consensus 10 ViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~----g-~~vviV~sg~ 55 (242) |.-|=+|-+.++.+. ...+++||+++.+++++ | =++++|-|-+ T Consensus 175 ~FeFM~~~vSSE~p~---e~~i~~IA~E~~ei~~~yrk~GkG~I~vvaGPa 222 (418) T TIGR00300 175 VFEFMGSEVSSERPV---ETLIEKIAKEIYEIREKYRKTGKGKIVVVAGPA 222 (418) T ss_pred CCEECCCCCCCCCHH---HHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCE T ss_conf 000015776544305---799999999999999986315883389985780 No 113 >CHL00189 infB translation initiation factor 2; Provisional Probab=31.46 E-value=37 Score=14.45 Aligned_cols=100 Identities=18% Similarity=0.135 Sum_probs=60.3 Q ss_pred CCCCCCCCCCCCCCCHHHHHHHHHHHCHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCC------HHHHHHCCCCCCC--- Q ss_conf 1111246556777621578888864164466514754351234322344444444115------9999855998140--- Q gi|254780777|r 131 VVIFSGGTGNAFLTTDSAAALRASEIGADVILKGTQVDGVYSADPRVHASSTRFDSLT------YNQFIEKGLKVMD--- 201 (242) Q Consensus 131 ~~~~~~~~~~~~~~~D~~a~~ia~~~~ad~l~i~tdvdGiyt~dP~~~~~ak~i~~is------~~e~l~~G~~v~~--- 201 (242) .|++........|-+..+-++.-.-+-+..----|.-.|-|.-+-....+.+.|..+. +..|..+|+.++| T Consensus 272 pPVVTIMGHVDHGKTsLLD~iR~t~Va~~EaGGITQhIGAy~V~~~~~~~~~~ITFlDTPGHeAFt~MRaRGA~vTDIvI 351 (770) T CHL00189 272 PPIVTILGHVDHGKTTLLDAIRKTNIAQKEAGGITQKIGAYEVEVPYKDQNQKIVFLDTPGHEAFSSMRSRGANVTDIAI 351 (770) T ss_pred CCEEEEECCCCCCHHHHHHHHHCCCCCCCCCCCCCCEECEEEEEECCCCCCCEEEEECCCCHHHHHHHHHCCCCCCCEEE T ss_conf 99899857725772037888852885134567655503529997515788975899559946889999862786666799 Q ss_pred -------------HHHHHHHHHCCCCEEEEE------CCCCCHHHHHH Q ss_conf -------------999999997499699951------89854799997 Q gi|254780777|r 202 -------------CASVVLARDCSIPIIVFS------IHSPGGIWGGL 230 (242) Q Consensus 202 -------------~~A~~~a~~~gI~v~I~n------g~~~~~i~~~l 230 (242) ..|+..|+.+|+|++++= +-+|+.+..-| T Consensus 352 LVVAADDGVmPQTiEAI~hakaA~VPiIVAINKiDkp~an~~rVk~eL 399 (770) T CHL00189 352 LIVAADDGVKPQTIEAINHIQAANVPIIVAINKIDKENANIDRVKQEL 399 (770) T ss_pred EEEECCCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHH T ss_conf 999657885672799999998769988999877458998857899999 No 114 >TIGR02440 FadJ fatty oxidation complex, alpha subunit FadJ; InterPro: IPR012802 Members of this family represent the alpha subunit of the multifunctional enzyme complex of the fatty acid degradation cycle, which plays a minor role in aerobic beta-oxidation of fatty acids. The FadJI complex is necessary for anaerobic growth on short-chain acids with nitrate as an electron acceptor. Activities include: enoyl-CoA hydratase (4.2.1.17 from EC), 3-hydroxyacyl-CoA dehydrogenase (1.1.1.35 from EC), and 3-hydroxybutyryl-CoA epimerase (5.1.2.3 from EC). A representative is the Escherichia coli FadJ protein (also named YfcX) (P77399 from SWISSPROT). This entry excludes FadB of IPR012799 from INTERPRO.; GO: 0003857 3-hydroxyacyl-CoA dehydrogenase activity, 0004300 enoyl-CoA hydratase activity, 0008692 3-hydroxybutyryl-CoA epimerase activity, 0051287 NAD binding, 0006635 fatty acid beta-oxidation, 0016507 fatty acid beta-oxidation multienzyme complex. Probab=30.60 E-value=38 Score=14.36 Aligned_cols=32 Identities=9% Similarity=0.114 Sum_probs=22.8 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHCCC--EEEEEEC Q ss_conf 8777089999999999999997699--0999823 Q gi|254780777|r 22 SGFGVDIDSVNRICADIAEVYAKGI--EIGIVVG 53 (242) Q Consensus 22 ~~~~~~~~~i~~ia~~I~~~~~~g~--~vviV~s 53 (242) .-.++..++..++.+.++++++... =+|+||+ T Consensus 22 ~mNTLkaeF~~qv~~il~q~~~~~~l~GlV~~Sg 55 (732) T TIGR02440 22 KMNTLKAEFADQVSEILKQLKEKKELRGLVLVSG 55 (732) T ss_pred CCHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEEC T ss_conf 5202255446899999999972888735899852 No 115 >cd03802 GT1_AviGT4_like This family is most closely related to the GT1 family of glycosyltransferases. aviGT4 in Streptomyces viridochromogenes has been shown to be involved in biosynthesis of oligosaccharide antibiotic avilamycin A. Inactivation of aviGT4 resulted in a mutant that accumulated a novel avilamycin derivative lacking the terminal eurekanate residue. Probab=30.37 E-value=38 Score=14.34 Aligned_cols=26 Identities=23% Similarity=0.516 Sum_probs=18.4 Q ss_pred HHHHHHHHH-HHHHCCCEEEEEECCCC Q ss_conf 999999999-99976990999823654 Q gi|254780777|r 31 VNRICADIA-EVYAKGIEIGIVVGGGN 56 (242) Q Consensus 31 i~~ia~~I~-~~~~~g~~vviV~sg~~ 56 (242) +++.+..++ .|.++||+|.+++.+++ T Consensus 21 ~e~~~~~La~~L~~~Gh~V~v~~~~~~ 47 (335) T cd03802 21 TERVVAALTEGLVARGHEVTLFASGDS 47 (335) T ss_pred HHHHHHHHHHHHHHCCCEEEEEECCCC T ss_conf 999999999999976998999962898 No 116 >TIGR02177 PorB_KorB 2-oxoacid:acceptor oxidoreductase, beta subunit, pyruvate/2-ketoisovalerate family; InterPro: IPR011896 A number of anaerobic and microaerophilic species lack pyruvate dehydrogenase and have instead a four subunit, oxygen-sensitive pyruvate oxidoreductase, with either ferredoxins or flavodoxins used as the acceptor. Several related four-subunit enzymes may exist in the same species. This entry describes the beta subunits, representing mostly pyruvate and 2-ketoisovalerate specific enzymes.. Probab=30.13 E-value=38 Score=14.31 Aligned_cols=30 Identities=20% Similarity=0.267 Sum_probs=26.4 Q ss_pred CHHHHHHHHHHHHHHHHCCCEEEEEECCCC Q ss_conf 899999999999999976990999823654 Q gi|254780777|r 27 DIDSVNRICADIAEVYAKGIEIGIVVGGGN 56 (242) Q Consensus 27 ~~~~i~~ia~~I~~~~~~g~~vviV~sg~~ 56 (242) |...|+.+-+.+.||+=.=++|+||||-|- T Consensus 9 DFGIL~AL~~AL~EL~LdP~qvvVVSGIGC 38 (302) T TIGR02177 9 DFGILSALQKALAELNLDPEQVVVVSGIGC 38 (302) T ss_pred CHHHHHHHHHHHHHHCCCCCCEEEEECCCC T ss_conf 677899999999862889852799813131 No 117 >COG0680 HyaD Ni,Fe-hydrogenase maturation factor [Energy production and conversion] Probab=28.88 E-value=40 Score=14.18 Aligned_cols=47 Identities=28% Similarity=0.499 Sum_probs=30.8 Q ss_pred CCEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCH Q ss_conf 824999994003685388777089999999999999997699099982365410 Q gi|254780777|r 5 PYKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIF 58 (242) Q Consensus 5 ~~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~ 58 (242) +|+.+|+=+ |+.|..++|+|+ ++|+.+++....-..+.++-+|.... T Consensus 1 ~~~ilIlG~-GN~L~~DDG~Gv------~vae~L~~~~~~~~~v~vid~Gt~~~ 47 (160) T COG0680 1 KMRILILGV-GNILMGDDGFGV------RVAEKLKKRYKPPENVEVIDGGTAGP 47 (160) T ss_pred CCEEEEEEE-CCCCCCCCCCCH------HHHHHHHHHCCCCCCEEEEECCCCCH T ss_conf 975899960-884416674039------99999998457777738997687708 No 118 >pfam05116 S6PP Sucrose-6F-phosphate phosphohydrolase. This family consists of Sucrose-6F-phosphate phosphohydrolase proteins found in plants and cyanobacteria. Sucrose-6(F)-phosphate phosphohydrolase catalyses the final step in the pathway of sucrose biosynthesis. Probab=27.98 E-value=42 Score=14.08 Aligned_cols=45 Identities=11% Similarity=0.125 Sum_probs=31.5 Q ss_pred CCEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCC Q ss_conf 824999994003685388777089999999999999997699099982365 Q gi|254780777|r 5 PYKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGG 55 (242) Q Consensus 5 ~~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~ 55 (242) +.|-|+.-+=|+.|.++.. +. .+..+.|.+++++|+.+++++|-. T Consensus 1 P~~LIa~DlDGTLL~~~~~--i~----~~~~~al~~l~~~Gi~vviaTGR~ 45 (247) T pfam05116 1 PRLLLVSDLDNTLVDGDNE--AL----ARLNQLLEAQYRPDSLLVFATGRS 45 (247) T ss_pred CCEEEEEECCCCCCCCCCC--CC----HHHHHHHHHHHHCCCEEEEECCCC T ss_conf 9889999387878093495--88----999999999995898899988999 No 119 >pfam10686 DUF2493 Protein of unknown function (DUF2493). Members of this family are all Proteobacteria. The function is not known. Probab=27.92 E-value=42 Score=14.07 Aligned_cols=16 Identities=25% Similarity=0.453 Sum_probs=8.4 Q ss_pred HHHHCCCEEEEEECCC Q ss_conf 9997699099982365 Q gi|254780777|r 40 EVYAKGIEIGIVVGGG 55 (242) Q Consensus 40 ~~~~~g~~vviV~sg~ 55 (242) +++++-..+++||||+ T Consensus 26 ~~~~~~pd~vlihGG~ 41 (71) T pfam10686 26 KVHARHPDMVLLHGGA 41 (71) T ss_pred HHHHHCCCEEEEECCC T ss_conf 9998689879997798 No 120 >TIGR01313 therm_gnt_kin carbohydrate kinase, thermoresistant glucokinase family; InterPro: IPR006001 This family of proteins includes thermoresistant and thermosensitve isozymes of gluconate kinase (gluconokinase) in E. coli and other related proteins; members of this family are often named by similarity to the thermostable isozyme. These proteins show homology to shikimate kinases and adenylate kinases but not to gluconate kinases from the FGGY family of carbohydrate kinases (IPR000577 from INTERPRO).; GO: 0016301 kinase activity, 0005975 carbohydrate metabolic process. Probab=27.45 E-value=43 Score=14.02 Aligned_cols=37 Identities=16% Similarity=0.087 Sum_probs=28.6 Q ss_pred EEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCE--EEEEECCC Q ss_conf 036853887770899999999999999976990--99982365 Q gi|254780777|r 15 GEALAGISGFGVDIDSVNRICADIAEVYAKGIE--IGIVVGGG 55 (242) Q Consensus 15 GSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~--vviV~sg~ 55 (242) |-=|+|++. ++.|.+++++.....+.+.. ++|++|+= T Consensus 49 GiPL~DdDR----~pWL~~l~~~~~~~~~~~~~~~~~~~CSAL 87 (175) T TIGR01313 49 GIPLNDDDR----WPWLKNLADALAQAAAKNKVHLVIITCSAL 87 (175) T ss_pred CCCCCCCCC----HHHHHHHHHHHHHHHHCCCCCCEEEEEECC T ss_conf 888870120----437999999999998457745447884011 No 121 >cd03522 MoeA_like MoeA_like. This domain is similar to a domain found in a variety of proteins involved in biosynthesis of molybdopterin cofactor, like MoaB, MogA, and MoeA. There this domain is presumed to bind molybdopterin. The exact function of this subgroup is unknown. Probab=27.10 E-value=43 Score=13.98 Aligned_cols=17 Identities=41% Similarity=0.407 Sum_probs=12.2 Q ss_pred HHHHHHHHHCCCEEEEE Q ss_conf 99999999769909998 Q gi|254780777|r 35 CADIAEVYAKGIEIGIV 51 (242) Q Consensus 35 a~~I~~~~~~g~~vviV 51 (242) .+.|..|.+.|++-+.| T Consensus 29 ~eDI~~L~~~G~~~v~v 45 (312) T cd03522 29 AEDIAALLAAGKEHVYV 45 (312) T ss_pred HHHHHHHHHCCCCEEEE T ss_conf 99999999779967999 No 122 >TIGR01488 HAD-SF-IB HAD-superfamily hydrolase, subfamily IB (PSPase-like); InterPro: IPR006383 This group represents a subfamily of the haloacid dehalogenase superfamily of aspartate-nucleophile hydrolases. Subfamily IA, B, C and D are distinguished from the rest of the superfamily by the presence of a variable domain between the first and second conserved catalytic motifs. In subfamilies IA and IB, this domain consists of an alpha-helical bundle. It was necessary to model these two subfamilies separately, breaking them at an apparent phylogenetic bifurcation, so that the resulting model(s) are not so broadly defined that members of subfamily III (which lack the variable domain) are included. ; GO: 0016791 phosphoric monoester hydrolase activity, 0008152 metabolic process. Probab=26.95 E-value=44 Score=13.97 Aligned_cols=17 Identities=24% Similarity=0.198 Sum_probs=7.9 Q ss_pred CCHHHHHHHHHHHCHHH Q ss_conf 62157888886416446 Q gi|254780777|r 144 TTDSAAALRASEIGADV 160 (242) Q Consensus 144 ~~D~~a~~ia~~~~ad~ 160 (242) +.+....-+|..++.+. T Consensus 113 g~~~~~~~~~e~~g~~~ 129 (204) T TIGR01488 113 GLDFLVEPLAEKLGVDE 129 (204) T ss_pred CHHHHHHHHHHHCCHHH T ss_conf 86799999997528434 No 123 >PRK10976 putative sugar phosphatase; Provisional Probab=26.73 E-value=44 Score=13.94 Aligned_cols=44 Identities=18% Similarity=0.205 Sum_probs=30.7 Q ss_pred CEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCC Q ss_conf 24999994003685388777089999999999999997699099982365 Q gi|254780777|r 6 YKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGG 55 (242) Q Consensus 6 ~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~ 55 (242) ||-|+.=+=|+.|.++. .+.. +-.+.|.+++++|+.+++.+|-. T Consensus 2 iKlIa~DlDGTLl~~~~--~is~----~~~~ai~~l~~~Gi~~viATGR~ 45 (266) T PRK10976 2 YQVVASDLDGTLLSPDH--TLSP----YAKETLKLLTARGIHFVFATGRH 45 (266) T ss_pred EEEEEEECCCCCCCCCC--CCCH----HHHHHHHHHHHCCCEEEEECCCC T ss_conf 39999937646558879--7199----99999999997899999997999 No 124 >PRK10886 DnaA initiator-associating protein DiaA; Provisional Probab=26.11 E-value=45 Score=13.87 Aligned_cols=30 Identities=20% Similarity=0.189 Sum_probs=22.3 Q ss_pred HHHHHHHHHCCCCEEEEECCCCCHHHHHHC Q ss_conf 999999997499699951898547999977 Q gi|254780777|r 202 CASVVLARDCSIPIIVFSIHSPGGIWGGLS 231 (242) Q Consensus 202 ~~A~~~a~~~gI~v~I~ng~~~~~i~~~l~ 231 (242) +.|++.|++.|+.++-+.|++-+.+...+. T Consensus 126 i~Ai~~A~~~g~~~i~ltG~~gg~l~~l~~ 155 (196) T PRK10886 126 VKAVEAAVTRDMTIVALTGYDGGELAGLLG 155 (196) T ss_pred HHHHHHHHHCCCEEEEEECCCCHHHHHHCC T ss_conf 999999998899899997688623665157 No 125 >PRK08674 bifunctional phosphoglucose/phosphomannose isomerase; Validated Probab=25.85 E-value=46 Score=13.84 Aligned_cols=22 Identities=18% Similarity=0.443 Sum_probs=16.9 Q ss_pred HHHHHHCCCEEEEEECCCCCHH Q ss_conf 9999976990999823654102 Q gi|254780777|r 38 IAEVYAKGIEIGIVVGGGNIFR 59 (242) Q Consensus 38 I~~~~~~g~~vviV~sg~~~~~ 59 (242) +.+..++|-+++.|++||.-.+ T Consensus 98 ~~~A~~rga~vi~ItsGG~L~~ 119 (328) T PRK08674 98 VEQAKKRGAKIIAITSGGKLAE 119 (328) T ss_pred HHHHHHCCCCEEEEECCCCHHH T ss_conf 9999975995899948970678 No 126 >cd02871 GH18_chitinase_D-like GH18 domain of Chitinase D (ChiD). ChiD, a chitinase found in Bacillus circulans, hydrolyzes the 1,4-beta-linkages of N-acetylglucosamine in chitin and chitodextrins. The domain architecture of ChiD includes a catalytic glycosyl hydrolase family 18 (GH18) domain, a chitin-binding domain, and a fibronectin type III domain. The chitin-binding and fibronectin type III domains are located either N-terminal or C-terminal to the catalytic domain. This family includes exochitinase Chi36 from Bacillus cereus. Probab=25.36 E-value=47 Score=13.79 Aligned_cols=26 Identities=38% Similarity=0.525 Sum_probs=21.2 Q ss_pred HHHHHHHHHHHHHCCCEEEEEECCCC Q ss_conf 99999999999976990999823654 Q gi|254780777|r 31 VNRICADIAEVYAKGIEIGIVVGGGN 56 (242) Q Consensus 31 i~~ia~~I~~~~~~g~~vviV~sg~~ 56 (242) -..+-.+|+.++.+|++|+|-.||+. T Consensus 59 ~~~f~~dI~~~qs~G~kVllSiGGa~ 84 (312) T cd02871 59 PAEFKADIKALQAKGKKVLISIGGAN 84 (312) T ss_pred HHHHHHHHHHHHHCCCEEEEEECCCC T ss_conf 89999999999978998999807877 No 127 >COG1126 GlnQ ABC-type polar amino acid transport system, ATPase component [Amino acid transport and metabolism] Probab=24.91 E-value=48 Score=13.73 Aligned_cols=18 Identities=28% Similarity=0.362 Sum_probs=7.5 Q ss_pred CHHHHHHHHHHHCHHHHH Q ss_conf 215788888641644665 Q gi|254780777|r 145 TDSAAALRASEIGADVIL 162 (242) Q Consensus 145 ~D~~a~~ia~~~~ad~l~ 162 (242) --+.|...|-+++.+.++ T Consensus 142 qQRVAIARALaM~P~vmL 159 (240) T COG1126 142 QQRVAIARALAMDPKVML 159 (240) T ss_pred HHHHHHHHHHCCCCCEEE T ss_conf 899999998717998886 No 128 >PRK07313 phosphopantothenoylcysteine decarboxylase; Validated Probab=24.74 E-value=48 Score=13.71 Aligned_cols=37 Identities=22% Similarity=0.383 Sum_probs=26.5 Q ss_pred EEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCC Q ss_conf 4999994003685388777089999999999999997699099982365 Q gi|254780777|r 7 KRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGG 55 (242) Q Consensus 7 KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~ 55 (242) |||++=+.||+=+ -+..+.+.++++.|.+|-+|.+-. T Consensus 3 K~IllgvtGsIAa------------yK~~~lir~L~k~g~~V~vi~T~~ 39 (180) T PRK07313 3 KKILLAVSGSIAA------------YKAADLTSQLTKIGYDVTVIMTKA 39 (180) T ss_pred CEEEEEEECHHHH------------HHHHHHHHHHHHCCCEEEEEECCC T ss_conf 7699999549999------------999999999998899599997746 No 129 >KOG2599 consensus Probab=24.45 E-value=49 Score=13.68 Aligned_cols=134 Identities=15% Similarity=0.006 Sum_probs=64.7 Q ss_pred CCCCCHHHHHHHHHHHHHHHHCCCEEEEEECC--CCCHHHHHHH-----CCCCCC-------CCHHHH--HHHHHHHHHH Q ss_conf 77708999999999999999769909998236--5410233442-----037777-------540246--7888889899 Q gi|254780777|r 23 GFGVDIDSVNRICADIAEVYAKGIEIGIVVGG--GNIFRGSQVV-----AENYLL-------CERSTV--DSMGMLSTVI 86 (242) Q Consensus 23 ~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg--~~~~~~~~~~-----~~~~~~-------~~~~~~--~~~~~~~~~~ 86 (242) ||..+...++.+++.+++++++..+++-|.-. |--+++|... .+.... +.+-++ ..-....+.. T Consensus 88 GY~~n~~~l~~i~~iv~~lk~~np~~~wv~DPVmGDnG~lYV~eelipvYr~~i~~ladiiTPNqFE~EiLtg~~I~t~e 167 (308) T KOG2599 88 GYLPNVSFLQKIADIVKKLKKKNPNLTWVCDPVMGDNGRLYVPEELIPVYRDLIIPLADIITPNQFEAEILTGMEIRTEE 167 (308) T ss_pred ECCCCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCEECCHHHHHHHHHHHCCHHHHCCCCCHHHHHHCCCEECCHH T ss_conf 05688269999999999998429984999676346785174458777999986302555107761111143287424499 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHCCHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 9999999997618864113400100245665432100001100111112465567776215788888641 Q gi|254780777|r 87 NALALDLALRKINVPTVILSSIFMPQVCEVFSCRNAVSYLSQGKVVIFSGGTGNAFLTTDSAAALRASEI 156 (242) Q Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~D~~a~~ia~~~ 156 (242) -.......+++.+.+..+.+..................-.......-++-..+.+.|++|.-++++..-+ T Consensus 168 da~~a~~~lhq~~v~~vVITS~~~~~~~g~~l~c~gs~~~~~~f~~~ipki~~~FtGTGDLfsaLLla~~ 237 (308) T KOG2599 168 DAKRAVEKLHQKGVKTVVITSFDLGEFTGETLRCIGSSCGSERFRYLIPKIDGVFTGTGDLFSALLLAWL 237 (308) T ss_pred HHHHHHHHHHHHCCCEEEEEEEEECCCCCCEEEEEEECCCCCEEEEEECCCCEEEECCCHHHHHHHHHHH T ss_conf 9999999999748987999744307888857999986168825899922555289605189999999997 No 130 >PRK05920 aromatic acid decarboxylase; Validated Probab=24.39 E-value=49 Score=13.67 Aligned_cols=40 Identities=23% Similarity=0.453 Sum_probs=25.0 Q ss_pred CCEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCC Q ss_conf 8249999940036853887770899999999999999976990999823654 Q gi|254780777|r 5 PYKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGN 56 (242) Q Consensus 5 ~~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~ 56 (242) +|||||+=+.|.|= ..|+ .++ +..+.+.+.++-+|.|-++ T Consensus 3 ~mkrivvgITGASG---~~ya------~rl---l~~L~~~~~ev~lviS~~a 42 (205) T PRK05920 3 KMKRIVLAITGASG---AIYG------VRL---LECLLAADYEVHLVISKAA 42 (205) T ss_pred CCCEEEEEEECHHH---HHHH------HHH---HHHHHHCCCEEEEEECHHH T ss_conf 88759999865427---9999------999---9999867998999986789 No 131 >pfam08645 PNK3P Polynucleotide kinase 3 phosphatase. Polynucleotide kinase 3 phosphatases play a role in the repair of single breaks in DNA induced by DNA-damaging agents such as gamma radiation and camptothecin. Probab=24.19 E-value=49 Score=13.65 Aligned_cols=45 Identities=16% Similarity=0.214 Sum_probs=29.2 Q ss_pred EEEEEEEEEEECCCCC-----CCCHHHH-HHHHHHHHHHHHCCCEEEEEEC Q ss_conf 9999400368538877-----7089999-9999999999976990999823 Q gi|254780777|r 9 VLLKVSGEALAGISGF-----GVDIDSV-NRICADIAEVYAKGIEIGIVVG 53 (242) Q Consensus 9 IViKiGGSsL~~~~~~-----~~~~~~i-~~ia~~I~~~~~~g~~vviV~s 53 (242) ...-+=|+.+.++.+. .-|+..+ ..+.+.++.++++|+.++||+= T Consensus 3 a~fD~DgTLi~~ksg~~~~~~~~Dw~~~~p~V~~~L~~L~~~gY~iVIvTN 53 (158) T pfam08645 3 AAFDLDGTLIKTKSGKVFPKDADDWKWLYPSVPEKLKKLHEDGYKIVIFTN 53 (158) T ss_pred EEEECCCCEECCCCCCCCCCCCHHEEECCCCHHHHHHHHHHCCCEEEEEEC T ss_conf 999668952767889978999401188077889999999988987999947 No 132 >TIGR01487 SPP-like SPP-like hydrolase, Archaeal; InterPro: IPR006382 This group of Archaeal sequences is most closely related to the sucrose-phosphate phosphatases (SPP) from plants and cyanobacteria. If it should be shown that the proteins in this family possess sucrose-phosphatase activity then SPP would represent a broad family of functionally related proteins.. Probab=24.01 E-value=50 Score=13.63 Aligned_cols=34 Identities=24% Similarity=0.315 Sum_probs=22.8 Q ss_pred EEECCC-CCCCCHHHHHHHHHHHHHHHHCCCEEEEEECC Q ss_conf 685388-77708999999999999999769909998236 Q gi|254780777|r 17 ALAGIS-GFGVDIDSVNRICADIAEVYAKGIEIGIVVGG 54 (242) Q Consensus 17 sL~~~~-~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg 54 (242) .||++. .+-++ .+..+.|.++.++|..|++|||= T Consensus 11 TLT~~~~~~~I~----~~A~e~iRk~~k~GI~V~LvtGN 45 (223) T TIGR01487 11 TLTDEEKNRLIS----LRAIEAIRKAEKKGIKVSLVTGN 45 (223) T ss_pred CCCCCCCCCCCC----HHHHHHHHHHHHCCCEEEEEECC T ss_conf 425444463108----88999998675278369998088 No 133 >COG0396 sufC Cysteine desulfurase activator ATPase [Posttranslational modification, protein turnover, chaperones] Probab=23.64 E-value=50 Score=13.58 Aligned_cols=30 Identities=27% Similarity=0.267 Sum_probs=14.6 Q ss_pred EEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCC Q ss_conf 6853887770899999999999999976990999823654 Q gi|254780777|r 17 ALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGN 56 (242) Q Consensus 17 sL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~ 56 (242) +|..+||.+- +.++..|. |+.---|++|-. T Consensus 34 aiMGPNGsGK-----STLa~~i~-----G~p~Y~Vt~G~I 63 (251) T COG0396 34 AIMGPNGSGK-----STLAYTIM-----GHPKYEVTEGEI 63 (251) T ss_pred EEECCCCCCH-----HHHHHHHH-----CCCCCEEECCEE T ss_conf 9988998788-----99999972-----899746755569 No 134 >PRK13982 bifunctional SbtC-like/phosphopantothenoylcysteine decarboxylase/phosphopantothenate synthase; Provisional Probab=23.42 E-value=51 Score=13.56 Aligned_cols=41 Identities=22% Similarity=0.287 Sum_probs=29.8 Q ss_pred CCCCEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCC Q ss_conf 88824999994003685388777089999999999999997699099982365 Q gi|254780777|r 3 DFPYKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGG 55 (242) Q Consensus 3 ~~~~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~ 55 (242) .++=|||++=++||.=+ -.-++.+..|+++|.+|-+|.+-+ T Consensus 68 ~l~GKkIlLgVtGsIAA------------YKa~~LvR~L~k~Ga~V~vvmT~~ 108 (476) T PRK13982 68 SLGSKRITLIIGGGIAA------------YKALDLIRRLKERGAEVRCVLTKA 108 (476) T ss_pred CCCCCEEEEEECCHHHH------------HHHHHHHHHHHHCCCEEEEEECHH T ss_conf 88899799996748999------------999999999997899899997877 No 135 >pfam00162 PGK Phosphoglycerate kinase. Probab=23.18 E-value=51 Score=13.53 Aligned_cols=51 Identities=22% Similarity=0.207 Sum_probs=39.5 Q ss_pred CCCEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCC Q ss_conf 8824999994003685388777089999999999999997699099982365 Q gi|254780777|r 4 FPYKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGG 55 (242) Q Consensus 4 ~~~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~ 55 (242) ++-|||.+.+-=++=.. ++.-.|..||.+....|..+.++|-+++|.+.=| T Consensus 7 ~~gK~VllRvD~NvPi~-~g~I~Dd~RI~~~~pTI~~Ll~~gakvvi~SH~G 57 (383) T pfam00162 7 LKGKRVLVRVDFNVPLD-DGKITDDTRIRAALPTIKYLLEKGAKVVLLSHLG 57 (383) T ss_pred CCCCEEEEEECCCCCCC-CCEECCHHHHHHHHHHHHHHHHCCCEEEEEECCC T ss_conf 68999999932577754-9918976889999999999997899799995678 No 136 >COG1979 Uncharacterized oxidoreductases, Fe-dependent alcohol dehydrogenase family [Energy production and conversion] Probab=23.10 E-value=52 Score=13.52 Aligned_cols=58 Identities=21% Similarity=0.438 Sum_probs=37.5 Q ss_pred CCC-CEEEEEEEEEEEEECCCCC----------------CC-CHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHH Q ss_conf 888-2499999400368538877----------------70-8999999999999999769909998236541023 Q gi|254780777|r 3 DFP-YKRVLLKVSGEALAGISGF----------------GV-DIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRG 60 (242) Q Consensus 3 ~~~-~KrIViKiGGSsL~~~~~~----------------~~-~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~ 60 (242) ++| ++||.+-.||-|+-...-| ++ -..+++.+-+-|.-+++++.++++-+|||+.... T Consensus 25 ei~~~~kVLi~YGGGSIKrnGvydqV~~~Lkg~~~~E~~GVEPNP~~~Tv~kaV~i~kee~idflLAVGGGSViD~ 100 (384) T COG1979 25 EIPKDAKVLIVYGGGSIKKNGVYDQVVEALKGIEVIEFGGVEPNPRLETLMKAVEICKEENIDFLLAVGGGSVIDG 100 (384) T ss_pred HCCCCCEEEEEECCCCCCCCCHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCHHHH T ss_conf 5545572999956862000251999999854866999448778954899999999999739639999568420011 No 137 >PRK09982 universal stress protein UspD; Provisional Probab=22.87 E-value=52 Score=13.49 Aligned_cols=48 Identities=10% Similarity=0.091 Sum_probs=23.4 Q ss_pred CCCEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHH Q ss_conf 88249999940036853887770899999999999999976990999823654102334 Q gi|254780777|r 4 FPYKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGGNIFRGSQ 62 (242) Q Consensus 4 ~~~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~~~~~~~~ 62 (242) |.||.|.+-+.. +.+ ..++-+=|-.+++-.+. ++-+++=-......|. T Consensus 1 M~YkHILVAVdL---s~e------S~~Li~KAv~lAk~~~A--klSlIhvd~~~~elY~ 48 (142) T PRK09982 1 MAYKHIGVAISG---NEE------DALLVNKALELARHNDA--HLTLIHIDDGLSELYP 48 (142) T ss_pred CCCCEEEEEEEC---CHH------HHHHHHHHHHHHHHCCC--EEEEEEEECCCHHHCC T ss_conf 985179999855---841------69999999999876098--3999999088466511 No 138 >TIGR01494 ATPase_P-type ATPase, P-type (transporting), HAD superfamily, subfamily IC; InterPro: IPR001757 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (F-, V- and A-ATPases contain rotary motors) and in the type of ions they transport , . F-ATPases (F1F0-ATPases) in mitochondria, chloroplasts and bacterial plasma membranes are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases) are primarily found in eukaryotic vacuoles, catalysing ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases) are found in Archaea and function like F-ATPases. P-ATPases (E1E2-ATPases) are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. P-ATPases (sometime known as E1-E2 ATPases) (3.6.3.- from EC) are found in bacteria and in a number of eukaryotic plasma membranes and organelles . P-ATPases function to transport a variety of different compounds, including ions and phospholipids, across a membrane using ATP hydrolysis for energy. There are many different classes of P-ATPases, each of which transports a specific type of ion: H^+, Na^+, K^+, Mg^2+, Ca^2+, Ag^+ and Ag^2+, Zn^2+, Co^2+, Pb^2+, Ni^2+, Cd^2+, Cu^+ and Cu^2+. P-ATPases can be composed of one or two polypeptides, and can usually assume two main conformations called E1 and E2. This entry represents the several classes of P-type ATPases, including those that transport K^+ (3.6.3.12 from EC), Mg^2+ (3.6.3.2 from EC), Cd^2+ (3.6.3.3 from EC), Cu ^2+ (3.6.3.4 from EC), Zn^2+ (3.6.3.5 from EC), Na^+ (3.6.3.7 from EC), Ca^2+ (3.6.3.8 from EC), Na^+/K^+ (3.6.3.9 from EC), and H^+/K^+ (3.6.3.10 from EC). These P-ATPases are found in both prokaryotes and eukaryotes. More information about this protein can be found at Protein of the Month: ATP Synthases . ; GO: 0005524 ATP binding, 0015662 ATPase activity coupled to transmembrane movement of ions phosphorylative mechanism, 0006810 transport, 0016020 membrane. Probab=22.76 E-value=52 Score=13.48 Aligned_cols=24 Identities=4% Similarity=0.032 Sum_probs=20.9 Q ss_pred HHHHHHHHHCCCCEEEEECCCCCH Q ss_conf 999999997499699951898547 Q gi|254780777|r 202 CASVVLARDCSIPIIVFSIHSPGG 225 (242) Q Consensus 202 ~~A~~~a~~~gI~v~I~ng~~~~~ 225 (242) ..+++.+++.||.+|+..|.++++ T Consensus 332 ~~~i~~L~~agi~v~mlTGD~~~t 355 (478) T TIGR01494 332 KETIEELKRAGIKVWMLTGDNVET 355 (478) T ss_pred HHHHHHHHHCCCEEEEEECCCHHH T ss_conf 899999986599899995798799 No 139 >pfam06581 DUF1135 Protein of unknown function (DUF1135). This family consists of several hypothetical mammalian proteins of unknown function. Probab=22.64 E-value=35 Score=14.53 Aligned_cols=21 Identities=19% Similarity=-0.050 Sum_probs=8.7 Q ss_pred CCHHHHHHHHHHHCHHHHHHC Q ss_conf 621578888864164466514 Q gi|254780777|r 144 TTDSAAALRASEIGADVILKG 164 (242) Q Consensus 144 ~~D~~a~~ia~~~~ad~l~i~ 164 (242) ....+-.+.-.++-||.+-.+ T Consensus 148 t~~ClR~Lfr~lf~aD~fsel 168 (239) T pfam06581 148 TAACLRRLFRAIFLADPFSEL 168 (239) T ss_pred HHHHHHHHHHHHHHHHHHHHH T ss_conf 889999999999986245553 No 140 >COG0616 SppA Periplasmic serine proteases (ClpP class) [Posttranslational modification, protein turnover, chaperones / Intracellular trafficking and secretion] Probab=22.61 E-value=53 Score=13.46 Aligned_cols=14 Identities=29% Similarity=0.444 Sum_probs=6.3 Q ss_pred EEEEEEEEEEEECC Q ss_conf 99999400368538 Q gi|254780777|r 8 RVLLKVSGEALAGI 21 (242) Q Consensus 8 rIViKiGGSsL~~~ 21 (242) .-||-+-|+..... T Consensus 61 Iavi~~~G~I~~~~ 74 (317) T COG0616 61 IAVIHVEGAIVAGG 74 (317) T ss_pred EEEEEEEEEEECCC T ss_conf 99997454653487 No 141 >PRK13936 phosphoheptose isomerase; Provisional Probab=22.38 E-value=53 Score=13.43 Aligned_cols=30 Identities=10% Similarity=0.140 Sum_probs=22.3 Q ss_pred HHHHHHHHHCCCCEEEEECCCCCHHHHHHC Q ss_conf 999999997499699951898547999977 Q gi|254780777|r 202 CASVVLARDCSIPIIVFSIHSPGGIWGGLS 231 (242) Q Consensus 202 ~~A~~~a~~~gI~v~I~ng~~~~~i~~~l~ 231 (242) +.|++.|++.|+.++-+-|++-+.+...++ T Consensus 128 i~A~~~A~~~g~~~i~ltG~dgg~l~~l~~ 157 (197) T PRK13936 128 IQAIQAAHEREMHVVALTGRDGGKMASLLL 157 (197) T ss_pred HHHHHHHHHCCCEEEEEECCCCHHHHHHCC T ss_conf 999999998599599998168535675256 No 142 >PRK09552 mtnX 2-hydroxy-3-keto-5-methylthiopentenyl-1-phosphate phosphatase; Reviewed Probab=21.49 E-value=56 Score=13.32 Aligned_cols=20 Identities=10% Similarity=0.125 Sum_probs=13.8 Q ss_pred HHHHHHHCCCCEEEEECCCC Q ss_conf 99999974996999518985 Q gi|254780777|r 204 SVVLARDCSIPIIVFSIHSP 223 (242) Q Consensus 204 A~~~a~~~gI~v~I~ng~~~ 223 (242) -++.+++.|||-.-++.+.. T Consensus 187 L~~~c~~~gip~~~F~~F~D 206 (219) T PRK09552 187 LITKCEENHISYTPFETFHD 206 (219) T ss_pred HHHHHHHCCCCCCCCCCHHH T ss_conf 99999985999634399999 No 143 >COG0561 Cof Predicted hydrolases of the HAD superfamily [General function prediction only] Probab=21.39 E-value=56 Score=13.31 Aligned_cols=45 Identities=16% Similarity=0.259 Sum_probs=33.0 Q ss_pred CCEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCC Q ss_conf 824999994003685388777089999999999999997699099982365 Q gi|254780777|r 5 PYKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGG 55 (242) Q Consensus 5 ~~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~ 55 (242) .+|-|+.-+-|+.|.+... +. .+..+.|.+++++|+.+++++|-. T Consensus 2 ~~kli~~DlDGTLl~~~~~--i~----~~~~~al~~~~~~g~~v~iaTGR~ 46 (264) T COG0561 2 MIKLLAFDLDGTLLDSNKT--IS----PETKEALARLREKGVKVVLATGRP 46 (264) T ss_pred CCEEEEEECCCCCCCCCCC--CC----HHHHHHHHHHHHCCCEEEEECCCC T ss_conf 8569999476667489998--89----999999999997829899988998 No 144 >PRK13962 bifunctional phosphoglycerate kinase/triosephosphate isomerase; Provisional Probab=21.21 E-value=56 Score=13.28 Aligned_cols=51 Identities=16% Similarity=0.128 Sum_probs=40.7 Q ss_pred CCEEEEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCC Q ss_conf 824999994003685388777089999999999999997699099982365 Q gi|254780777|r 5 PYKRVLLKVSGEALAGISGFGVDIDSVNRICADIAEVYAKGIEIGIVVGGG 55 (242) Q Consensus 5 ~~KrIViKiGGSsL~~~~~~~~~~~~i~~ia~~I~~~~~~g~~vviV~sg~ 55 (242) +=|||.+.+-=++=.++++.-.|..||.+....|..+.++|-+++|.+-=| T Consensus 12 ~gK~VllRvDlNVPi~~~g~I~Dd~RI~~~lpTI~~ll~~gakvvi~SHlG 62 (653) T PRK13962 12 KGKKVFVRVDFNVPLDEDGNITDDTRIRAALPTIKYALDRGAKVILLSHLG 62 (653) T ss_pred CCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCC T ss_conf 899899992556875897908978899999999999997889799980688 No 145 >PRK13938 phosphoheptose isomerase; Provisional Probab=21.16 E-value=57 Score=13.28 Aligned_cols=29 Identities=17% Similarity=0.359 Sum_probs=20.4 Q ss_pred HHHHHHHHHHHHHHHHCCCEEEEEECCCC Q ss_conf 99999999999999976990999823654 Q gi|254780777|r 28 IDSVNRICADIAEVYAKGIEIGIVVGGGN 56 (242) Q Consensus 28 ~~~i~~ia~~I~~~~~~g~~vviV~sg~~ 56 (242) .+.++.+++.+.+..+.|.++.+.=-||+ T Consensus 28 ~~~i~~~~~~i~~~~~~g~kI~~cGNGGS 56 (196) T PRK13938 28 LEAARAIGDRLIAGYRAGARVFMCGNGGS 56 (196) T ss_pred HHHHHHHHHHHHHHHHCCCEEEEEECCHH T ss_conf 99999999999999987998999968688 No 146 >cd03239 ABC_SMC_head The structural maintenance of chromosomes (SMC) proteins are essential for successful chromosome transmission during replication and segregation of the genome in all organisms. SMCs are generally present as single proteins in bacteria, and as at least six distinct proteins in eukaryotes. The proteins range in size from approximately 110 to 170 kDa, and each has five distinct domains: amino- and carboxy-terminal globular domains, which contain sequences characteristic of ATPases, two coiled-coil regions separating the terminal domains , and a central flexible hinge. SMC proteins function together with other proteins in a range of chromosomal transactions, including chromosome condensation, sister-chromatid cohesion, recombination, DNA repair, and epigenetic silencing of gene expression. Probab=21.13 E-value=57 Score=13.27 Aligned_cols=19 Identities=32% Similarity=0.429 Sum_probs=8.7 Q ss_pred EEEEECCCCCCCCHHHHHHH Q ss_conf 03685388777089999999 Q gi|254780777|r 15 GEALAGISGFGVDIDSVNRI 34 (242) Q Consensus 15 GSsL~~~~~~~~~~~~i~~i 34 (242) =+++..++|.| +...++.+ T Consensus 24 ~~~ivG~nGsG-KSni~~ai 42 (178) T cd03239 24 FNAIVGPNGSG-KSNIVDAI 42 (178) T ss_pred EEEEECCCCCC-HHHHHHHH T ss_conf 17998999887-78999999 No 147 >TIGR00075 hypD hydrogenase expression/formation protein HypD; InterPro: IPR002780 HypD is involved in the hyp operon which is needed for the activity of the three hydrogenase isoenzymes in Escherichia coli. HypD is one of the genes needed for formation of these enzymes . This protein has been found in Gram-negative and Gram-positive bacteria and Archaea. HypD contains many possible metal binding residues, which may bind to nickel. Transposon Tn5 insertions into HypD resulted in Rhizobium leguminosarum mutants that lacked any hydrogenase activity in symbiosis with peas .; GO: 0046872 metal ion binding. Probab=21.04 E-value=57 Score=13.26 Aligned_cols=31 Identities=23% Similarity=0.328 Sum_probs=21.2 Q ss_pred CCCCHHH-HHHHHHCCCCEEEEECCCCCHHHHH Q ss_conf 8140999-9999974996999518985479999 Q gi|254780777|r 198 KVMDCAS-VVLARDCSIPIIVFSIHSPGGIWGG 229 (242) Q Consensus 198 ~v~~~~A-~~~a~~~gI~v~I~ng~~~~~i~~~ 229 (242) .++-.++ ..++.+++||.+|+ ||+|--|..+ T Consensus 211 ti~G~~~Y~~i~~~y~~P~Vva-GFEP~DiL~a 242 (384) T TIGR00075 211 TIIGVKPYEPIAEKYKIPIVVA-GFEPVDILQA 242 (384) T ss_pred EEECHHHHHHHHHHCCCCEEEE-CCCHHHHHHH T ss_conf 8515243488998568987996-8886799999 No 148 >PRK11133 serB phosphoserine phosphatase; Provisional Probab=20.38 E-value=59 Score=13.18 Aligned_cols=31 Identities=13% Similarity=0.025 Sum_probs=15.6 Q ss_pred CCCHHHHHHHHHHHCHHHHHH--CCCCCCCCCC Q ss_conf 762157888886416446651--4754351234 Q gi|254780777|r 143 LTTDSAAALRASEIGADVILK--GTQVDGVYSA 173 (242) Q Consensus 143 ~~~D~~a~~ia~~~~ad~l~i--~tdvdGiyt~ 173 (242) |+=...+..++..++-|...= +--.||..|. T Consensus 205 GGFt~fa~~l~~~Lg~D~~~AN~Lei~dg~LTG 237 (322) T PRK11133 205 GGFTYFADYLKDKLRLDAAVANELEIMDGKLTG 237 (322) T ss_pred CCHHHHHHHHHHHCCCCEEEEEEEEEECCEEEC T ss_conf 881799999999739886870267987899744 Done!