Query 537021.9.peg.394_1 Match_columns 142 No_of_seqs 115 out of 2586 Neff 7.1 Searched_HMMs 39220 Date Tue May 24 22:25:07 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i peg_394.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 cd04237 AAK_NAGS-ABP AAK_NAGS- 100.0 0 0 291.1 13.5 140 1-140 140-280 (280) 2 CHL00202 argB acetylglutamate 100.0 0 0 290.5 13.3 141 1-141 140-283 (284) 3 PRK05279 N-acetylglutamate syn 100.0 0 0 285.7 13.7 141 1-141 147-290 (441) 4 cd04250 AAK_NAGK-C AAK_NAGK-C: 100.0 0 0 283.1 13.5 140 1-140 137-279 (279) 5 PRK00942 acetylglutamate kinas 100.0 0 0 281.6 14.6 141 1-141 122-262 (262) 6 cd04238 AAK_NAGK-like AAK_NAGK 100.0 0 0 279.3 13.2 140 1-140 117-256 (256) 7 cd04249 AAK_NAGK-NC AAK_NAGK-N 100.0 0 0 277.1 13.3 138 1-140 115-252 (252) 8 COG0548 ArgB Acetylglutamate k 100.0 0 0 275.6 13.5 142 1-142 122-265 (265) 9 cd04252 AAK_NAGK-fArgBP AAK_NA 100.0 8.4E-45 0 268.1 12.0 139 1-140 109-248 (248) 10 cd04251 AAK_NAGK-UC AAK_NAGK-U 100.0 5.2E-44 0 263.6 13.1 134 1-140 124-257 (257) 11 PRK04531 acetylglutamate kinas 100.0 2.6E-38 6.7E-43 232.1 10.1 140 1-141 131-271 (421) 12 cd04236 AAK_NAGS-Urea AAK_NAGS 100.0 3.6E-38 9.2E-43 231.4 10.7 136 1-140 132-271 (271) 13 TIGR00761 argB acetylglutamate 100.0 4.4E-38 1.1E-42 230.9 8.3 117 1-117 133-254 (254) 14 TIGR01890 N-Ac-Glu-synth amino 100.0 4.7E-37 1.2E-41 225.3 9.7 141 1-141 139-287 (439) 15 cd02115 AAK Amino Acid Kinases 100.0 1.7E-29 4.4E-34 183.5 11.9 133 1-140 107-248 (248) 16 cd04241 AAK_FomA-like AAK_FomA 100.0 4.3E-29 1.1E-33 181.3 12.1 137 2-140 107-252 (252) 17 PRK12353 carbamate kinase; Rev 99.9 5.3E-25 1.4E-29 158.7 11.0 130 9-142 170-312 (312) 18 cd04256 AAK_P5CS_ProBA AAK_P5C 99.9 4.7E-24 1.2E-28 153.5 10.6 130 10-141 137-284 (284) 19 PRK12314 gamma-glutamyl kinase 99.9 3.2E-22 8.2E-27 143.4 11.3 126 13-142 124-263 (265) 20 PRK12454 carbamate kinase-like 99.9 8E-22 2E-26 141.1 12.4 129 8-142 171-313 (313) 21 pfam00696 AA_kinase Amino acid 99.9 1.2E-21 3.1E-26 140.2 10.1 115 5-119 106-230 (230) 22 cd04235 AAK_CK AAK_CK: Carbama 99.9 1.9E-21 4.9E-26 139.0 10.9 126 10-141 169-308 (308) 23 cd04242 AAK_G5K_ProB AAK_G5K_P 99.9 5.1E-21 1.3E-25 136.7 11.6 127 10-141 110-251 (251) 24 TIGR01027 proB glutamate 5-kin 99.9 3.4E-21 8.7E-26 137.7 9.8 128 10-142 115-263 (379) 25 COG1608 Predicted archaeal kin 99.8 1.6E-20 4.2E-25 133.9 11.0 138 3-142 108-252 (252) 26 PRK05429 gamma-glutamyl kinase 99.8 2.3E-20 5.7E-25 133.2 10.3 128 10-142 119-261 (372) 27 PRK13402 gamma-glutamyl kinase 99.8 3E-20 7.6E-25 132.5 10.2 128 10-142 112-254 (363) 28 PRK12686 carbamate kinase; Rev 99.8 6.2E-20 1.6E-24 130.7 11.1 131 6-141 167-312 (313) 29 PRK12352 putative carbamate ki 99.8 1.4E-19 3.5E-24 128.8 11.5 127 10-141 174-314 (316) 30 KOG2436 consensus 99.8 5E-21 1.3E-25 136.8 2.6 141 1-142 215-359 (520) 31 TIGR00746 arcC carbamate kinas 99.8 1.1E-19 2.7E-24 129.4 8.0 129 9-142 176-321 (321) 32 PRK12354 carbamate kinase; Rev 99.8 4.5E-19 1.1E-23 126.0 10.6 125 10-142 164-301 (302) 33 COG0549 ArcC Carbamate kinase 99.8 1.6E-18 4E-23 123.0 10.3 131 7-142 168-312 (312) 34 COG0263 ProB Glutamate 5-kinas 99.8 4.8E-18 1.2E-22 120.3 10.8 128 10-142 117-259 (369) 35 cd04239 AAK_UMPK-like AAK_UMPK 99.8 4.2E-18 1.1E-22 120.6 9.6 120 8-141 104-229 (229) 36 cd04253 AAK_UMPK-PyrH-Pf AAK_U 99.7 1.5E-16 3.9E-21 112.0 11.8 129 4-141 85-221 (221) 37 PRK09411 carbamate kinase; Rev 99.7 8.7E-17 2.2E-21 113.3 10.1 124 10-141 164-296 (297) 38 PRK00358 pyrH uridylate kinase 99.7 2.4E-16 6E-21 111.0 11.6 119 9-141 107-231 (231) 39 TIGR01092 P5CS delta l-pyrroli 99.6 1.9E-15 4.8E-20 106.0 4.8 129 12-141 141-292 (738) 40 cd04254 AAK_UMPK-PyrH-Ec UMP k 99.5 5.6E-14 1.4E-18 97.9 10.3 120 7-140 105-230 (231) 41 cd04240 AAK_UC AAK_UC: Unchara 99.5 3.6E-14 9.2E-19 98.9 8.2 119 12-140 81-203 (203) 42 TIGR00657 asp_kinases aspartat 99.5 4.7E-14 1.2E-18 98.2 8.7 122 5-142 177-311 (504) 43 KOG1154 consensus 99.5 2.2E-14 5.7E-19 100.1 5.4 124 13-140 136-273 (285) 44 TIGR02076 pyrH_arch uridylate 99.3 1E-11 2.7E-16 85.3 9.0 119 16-141 103-232 (232) 45 PRK07431 aspartate kinase; Pro 99.2 5.8E-11 1.5E-15 81.2 9.4 120 6-141 112-243 (594) 46 cd04260 AAK_AKi-DapG-BS AAK_AK 99.2 1.3E-10 3.4E-15 79.2 10.9 119 6-140 115-243 (244) 47 PRK08841 aspartate kinase; Val 99.2 8.9E-11 2.3E-15 80.2 9.9 120 6-141 112-241 (392) 48 COG0527 LysC Aspartokinases [A 99.2 7.2E-11 1.8E-15 80.7 9.1 120 7-142 158-289 (447) 49 cd04255 AAK_UMPK-MosAB AAK_UMP 99.2 1.7E-10 4.4E-15 78.6 10.9 129 7-141 120-262 (262) 50 PRK08210 aspartate kinase I; R 99.2 1.2E-10 3.1E-15 79.4 9.6 120 6-141 117-246 (405) 51 cd04246 AAK_AK-DapG-like AAK_A 99.2 1.6E-10 4.1E-15 78.7 9.5 118 7-140 111-238 (239) 52 cd04261 AAK_AKii-LysC-BS AAK_A 99.2 1.3E-10 3.4E-15 79.2 8.3 118 7-140 111-238 (239) 53 PRK06635 aspartate kinase; Rev 99.2 2.7E-10 7E-15 77.5 9.8 121 6-142 112-242 (402) 54 cd04234 AAK_AK AAK_AK: Amino A 99.1 1.1E-09 2.7E-14 74.2 10.0 120 7-141 97-227 (227) 55 PRK09034 aspartate kinase; Rev 99.1 3.2E-10 8.1E-15 77.1 7.1 119 9-142 161-289 (450) 56 PRK06291 aspartate kinase; Pro 99.1 1.4E-09 3.4E-14 73.7 9.8 112 15-141 181-302 (466) 57 PRK05925 aspartate kinase; Pro 99.1 8.7E-10 2.2E-14 74.7 8.8 111 16-141 159-279 (440) 58 cd04245 AAK_AKiii-YclM-BS AAK_ 99.1 2E-09 5.1E-14 72.7 9.9 116 11-141 163-288 (288) 59 cd04244 AAK_AK-LysC-like AAK_A 99.0 3.4E-09 8.7E-14 71.4 9.8 112 14-140 176-297 (298) 60 cd04247 AAK_AK-Hom3 AAK_AK-Hom 99.0 3.4E-09 8.8E-14 71.4 9.5 107 21-142 189-305 (306) 61 cd04243 AAK_AK-HSDH-like AAK_A 99.0 4.3E-09 1.1E-13 70.8 9.7 109 17-140 174-292 (293) 62 cd04257 AAK_AK-HSDH AAK_AK-HSD 99.0 4.1E-09 1E-13 71.0 9.4 110 16-140 174-293 (294) 63 PRK12443 uridylate kinase; Rev 99.0 1.1E-08 2.8E-13 68.6 11.4 119 9-142 112-238 (247) 64 PRK08373 aspartate kinase; Val 99.0 4.1E-09 1.1E-13 71.0 8.4 112 12-141 155-276 (359) 65 PRK09466 metL bifunctional asp 99.0 5.7E-09 1.4E-13 70.2 9.1 115 13-142 174-300 (810) 66 TIGR02075 pyrH_bact uridylate 99.0 6.4E-09 1.6E-13 69.9 9.2 115 12-141 114-236 (236) 67 cd04258 AAK_AKiii-LysC-EC AAK_ 98.9 7.4E-09 1.9E-13 69.6 9.2 108 18-140 174-291 (292) 68 COG2054 Uncharacterized archae 98.9 2.1E-09 5.4E-14 72.6 5.8 117 17-142 88-210 (212) 69 COG0528 PyrH Uridylate kinase 98.9 2.7E-08 6.8E-13 66.5 11.4 119 9-142 112-238 (238) 70 PRK09084 aspartate kinase III; 98.9 9.4E-09 2.4E-13 69.0 9.0 106 21-141 173-288 (447) 71 PRK09436 thrA bifunctional asp 98.9 1.1E-08 2.8E-13 68.6 8.7 112 16-142 176-297 (817) 72 PRK08961 bifunctional aspartat 98.8 2.7E-08 6.8E-13 66.5 9.1 116 11-141 177-303 (865) 73 cd04259 AAK_AK-DapDC AAK_AK-Da 98.8 4E-08 1E-12 65.5 8.4 111 15-140 173-294 (295) 74 cd04248 AAK_AK-Ectoine AAK_AK- 98.5 1.4E-06 3.5E-11 57.1 8.8 110 15-140 183-303 (304) 75 PRK09181 aspartate kinase; Val 98.3 5.5E-06 1.4E-10 53.7 8.8 106 20-141 195-311 (476) 76 TIGR00656 asp_kin_monofn aspar 98.1 6.7E-06 1.7E-10 53.2 6.0 86 6-91 161-261 (480) 77 TIGR02078 AspKin_pair Pyrococc 98.1 1.2E-05 3E-10 51.9 7.2 106 11-124 153-268 (341) 78 KOG0456 consensus 96.5 0.012 3E-07 35.4 6.7 104 21-142 258-375 (559) 79 PRK02797 4-alpha-L-fucosyltran 62.3 11 0.00027 19.0 3.6 60 1-64 188-250 (358) 80 COG1778 Low specificity phosph 53.9 10 0.00026 19.1 2.3 21 61-81 11-38 (170) 81 TIGR01670 YrbI-phosphatas 3-de 53.6 18 0.00046 17.8 3.5 21 61-81 4-31 (154) 82 TIGR01225 hutH histidine ammon 50.7 6.7 0.00017 20.1 0.9 91 1-95 107-221 (529) 83 TIGR02902 spore_lonB ATP-depen 47.3 21 0.00053 17.4 3.0 33 94-130 481-513 (532) 84 PRK09484 3-deoxy-D-manno-octul 46.9 5.2 0.00013 20.7 -0.1 11 61-71 27-37 (186) 85 PRK09367 histidine ammonia-lya 46.2 13 0.00033 18.5 1.8 29 1-30 110-138 (504) 86 TIGR02405 trehalos_R_Ecol treh 46.0 13 0.00034 18.5 1.8 10 18-27 85-94 (311) 87 TIGR02478 6PF1K_euk 6-phosphof 44.3 17 0.00042 18.0 2.1 89 44-135 19-129 (777) 88 pfam00221 PAL Phenylalanine an 44.2 14 0.00036 18.3 1.8 28 2-30 108-135 (477) 89 COG2986 HutH Histidine ammonia 43.1 13 0.00032 18.6 1.4 26 2-28 111-136 (498) 90 COG0283 Cmk Cytidylate kinase 42.5 30 0.00077 16.5 5.6 70 58-142 67-136 (222) 91 pfam11135 DUF2888 Protein of u 41.9 7.9 0.0002 19.7 0.2 29 113-142 89-117 (146) 92 pfam07429 Fuc4NAc_transf 4-alp 38.2 36 0.00091 16.1 3.8 60 1-64 190-252 (361) 93 TIGR01772 MDH_euk_gproteo mala 38.0 29 0.00073 16.7 2.5 76 14-90 81-186 (379) 94 KOG1494 consensus 36.2 38 0.00098 15.9 3.3 52 12-64 87-146 (345) 95 PRK09802 DNA-binding transcrip 33.5 42 0.0011 15.8 2.8 20 36-55 107-126 (269) 96 cd00332 PAL-HAL Phenylalanine 33.3 24 0.00061 17.1 1.5 28 2-30 103-130 (444) 97 pfam06555 consensus 32.2 15 0.00037 18.3 0.3 67 17-87 80-152 (200) 98 PHA00451 protein kinase 31.8 15 0.00038 18.2 0.3 44 16-60 200-248 (364) 99 TIGR01226 phe_am_lyase phenyla 30.7 26 0.00067 16.8 1.4 40 2-42 157-197 (714) 100 cd04167 Snu114p Snu114p subfam 29.7 5.7 0.00015 20.5 -2.1 116 2-120 4-132 (213) 101 TIGR01188 drrA daunorubicin re 28.1 26 0.00067 16.8 1.0 27 91-119 127-153 (343) 102 PRK11377 dihydroxyacetone kina 26.3 57 0.0015 15.0 3.5 55 13-67 19-73 (472) 103 COG1658 Small primase-like pro 26.1 44 0.0011 15.6 1.9 55 11-79 20-74 (127) 104 KOG1447 consensus 26.0 52 0.0013 15.2 2.2 49 1-49 193-250 (412) 105 PRK13597 imidazole glycerol ph 25.5 59 0.0015 14.9 5.3 18 48-65 115-132 (252) 106 COG1634 Uncharacterized Rossma 25.5 37 0.00094 16.1 1.4 21 41-62 158-178 (232) 107 TIGR02364 dha_pts dihydroxyace 22.1 70 0.0018 14.5 3.7 70 12-91 17-88 (128) 108 cd01173 pyridoxal_pyridoxamine 22.0 70 0.0018 14.5 7.9 24 99-122 160-183 (254) 109 PRK06953 short chain dehydroge 22.0 70 0.0018 14.5 7.0 112 5-119 8-129 (222) 110 COG0149 TpiA Triosephosphate i 21.6 65 0.0017 14.7 2.0 40 13-52 112-154 (251) 111 TIGR00017 cmk cytidylate kinas 20.9 74 0.0019 14.4 6.0 84 42-142 55-141 (223) 112 pfam11322 DUF3124 Protein of u 20.2 7.7 0.0002 19.8 -3.0 53 23-80 8-60 (126) No 1 >cd04237 AAK_NAGS-ABP AAK_NAGS-ABP: N-acetylglutamate (NAG) kinase-like domain of the NAG Synthase (NAGS) of the arginine-biosynthesis pathway (ABP) found in gamma- and beta-proteobacteria and higher plant chloroplasts. Domain architecture of these NAGS consisted of an N-terminal NAG kinase-like (ArgB) domain (this CD) and a C-terminal NAG synthase, acetyltransferase (ArgA) domain. Both bacterial and plant sequences in this CD have a conserved N-terminal extension; a similar sequence in the NAG kinases of the cyclic arginine-biosynthesis pathway has been implicated in feedback inhibition sensing. Plant sequences also have an N-terminal chloroplast transit peptide and an insert (approx. 70 residues) in the C-terminal region of ArgB. Members of this CD belong to the Amino Acid Kinase Superfamily (AAK). Probab=100.00 E-value=0 Score=291.06 Aligned_cols=140 Identities=29% Similarity=0.555 Sum_probs=137.4 Q ss_pred CCCCCCEEEECHHHHHHHHHCCCEEEECCCCCCCCCCEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEECCCCCCCCCC Q ss_conf 98552013556999999997798499857431787320066147999999986251133432267633534555556338 Q 537021.9.peg.3 1 LGFVGNVIKVNRTILDLLIKSGIIPVIAPIAPGYDGATYNINADTFAGAIAEKLNVIRLLFLTDVPGVLDKNRQLISKLS 80 (142) Q Consensus 1 ~g~~G~v~~vd~~~i~~lL~~g~iPVi~pig~~~~G~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~~~i~~i~ 80 (142) +||||+|++||++.|+.+|++|+||||+|+|++++|+.||+|+|++|+++|.+|+|+||||+||++|+++++++++++++ T Consensus 140 ~G~vG~v~~Vn~~~l~~ll~~~~iPVIspig~~~~G~~~NvNAD~~A~~iA~aL~A~kLi~ltdv~Gvl~~~~~~i~~l~ 219 (280) T cd04237 140 FGHTGEVRRIDADAIRRQLDQGSIVLLSPLGYSPTGEVFNLSMEDVATAVAIALKADKLIFLTDGPGLLDDDGELIRELT 219 (280) T ss_pred CCCCCCCCEEEHHHHHHHHHCCCEEEEECCEECCCCCEEEECHHHHHHHHHHHHHHCEEEEECCCCCEECCCCCCCCCCC T ss_conf 46798302664999986875896589604318899988700899999999999744709987188832789998865049 Q ss_pred HHHHHHHHHC-CCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEE Q ss_conf 8899999626-97678678999999999984998899952578768999996799866345 Q 537021.9.peg.3 81 INEARTLIKD-GTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLL 140 (142) Q Consensus 81 ~~e~~~l~~~-~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i 140 (142) .+|+++++++ +++++||+||+++|++|+++||+++||+|++.||+||.||||++|+||+| T Consensus 220 ~~e~~~ll~~~~~i~~GMipKl~~a~~Al~~GV~~vhIidg~~~~aLL~ElfT~~G~GT~I 280 (280) T cd04237 220 AQEAEALLETGALLTNDTARLLQAAIEACRGGVPRVHLISYAEDGALLLELFTRDGVGTLI 280 (280) T ss_pred HHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHCCCCCCCCC T ss_conf 9999999984798887789999999999983999899905999971467760699986709 No 2 >CHL00202 argB acetylglutamate kinase; Provisional Probab=100.00 E-value=0 Score=290.50 Aligned_cols=141 Identities=50% Similarity=0.874 Sum_probs=137.6 Q ss_pred CCCCCCEEEECHHHHHHHHHCCCEEEECCCCCCCCCCEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEE---ECCCCCCC Q ss_conf 985520135569999999977984998574317873200661479999999862511334322676335---34555556 Q 537021.9.peg.3 1 LGFVGNVIKVNRTILDLLIKSGIIPVIAPIAPGYDGATYNINADTFAGAIAEKLNVIRLLFLTDVPGVL---DKNRQLIS 77 (142) Q Consensus 1 ~g~~G~v~~vd~~~i~~lL~~g~iPVi~pig~~~~G~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~---d~~~~~i~ 77 (142) +||||+|++||++.|+.+|++|+||||+|+|++++|+.||+|+|++|+++|.+|+|+||||+||++|++ ++++++++ T Consensus 140 ~g~vG~V~~Vd~~~I~~lL~~g~IpVIspig~~~~G~~~NvnaD~~A~~iA~aL~A~KLI~ltdv~Gvl~d~~~~~~li~ 219 (284) T CHL00202 140 LGLVGEIQQVDPQLIDMLLEKNYIPVIASVAADHDGQTYNINADVVAGEIAAKLNAEKLILLTDTPGILADINDPNSLIS 219 (284) T ss_pred CCCCCCCEEECHHHHHHHHHCCCEEEEECCEECCCCCEEEECHHHHHHHHHHHHCCCCEEEEECCCCEECCCCCCCCCCC T ss_conf 44444412326799999970897599804406799998950999999999998641969999889671067887653244 Q ss_pred CCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEEC Q ss_conf 3388899999626976786789999999999849988999525787689999967998663454 Q 537021.9.peg.3 78 KLSINEARTLIKDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLI 141 (142) Q Consensus 78 ~i~~~e~~~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~ 141 (142) +++.+|++++++++++.+||+||+++|++|+++||+|+||+|++.|++||.||||++|+||+|| T Consensus 220 ~l~~~e~~~l~~~~~i~~gm~pkl~aa~~Al~~GV~rvhiidg~~~~aLL~ELfT~~G~GT~Iv 283 (284) T CHL00202 220 TLNIKEARNLASTGIISGGMIPKVNCCIRALAQGVEAAHIIDGKEKHALLLEILTEKGIGSMLV 283 (284) T ss_pred CCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCCCEEHHHHHCCCCCEEEEE T ss_conf 4899999999875987874699999999999859998999258899700056527999814884 No 3 >PRK05279 N-acetylglutamate synthase; Validated Probab=100.00 E-value=0 Score=285.70 Aligned_cols=141 Identities=26% Similarity=0.517 Sum_probs=137.4 Q ss_pred CCCCCCEEEECHHHHHHHHHCCCEEEECCCCCCCCCCEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEECCCCCCCCCC Q ss_conf 98552013556999999997798499857431787320066147999999986251133432267633534555556338 Q 537021.9.peg.3 1 LGFVGNVIKVNRTILDLLIKSGIIPVIAPIAPGYDGATYNINADTFAGAIAEKLNVIRLLFLTDVPGVLDKNRQLISKLS 80 (142) Q Consensus 1 ~g~~G~v~~vd~~~i~~lL~~g~iPVi~pig~~~~G~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~~~i~~i~ 80 (142) +||||+|++||++.|+.+|++|+||||+|+|++++|++||+|+|++|+++|.+|+|+||||+||++|++|.++++|++++ T Consensus 147 ~g~vG~V~~Vd~~~I~~~L~~g~IpvisplG~s~~Ge~fNinad~vA~~iA~aL~A~KLI~ltd~~Gi~d~~g~li~~lt 226 (441) T PRK05279 147 YCHTGEVRRIDTEAIRRQLDNGAIVLLSPLGYSPTGESFNLTMEEVATQVAIALKADKLIFFTESQGILDEDGELIRELS 226 (441) T ss_pred CCCCCEEEEECHHHHHHHHHCCCEEEECCEEECCCCCEECCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCCCCCC T ss_conf 67541587973899999997898299866463899998666999999999997699889985288764289998753079 Q ss_pred HHHHHHHHHC---CCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEEC Q ss_conf 8899999626---976786789999999999849988999525787689999967998663454 Q 537021.9.peg.3 81 INEARTLIKD---GTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLI 141 (142) Q Consensus 81 ~~e~~~l~~~---~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~ 141 (142) .+++++++++ +.+++||.+|+++|.+||++||+|+||+|++.||+||.||||++|+||||. T Consensus 227 ~~ea~~ll~~~~~~~i~~g~~~kl~~a~~A~~~GV~RvHiIdg~~~gaLL~ELFT~~GvGTmI~ 290 (441) T PRK05279 227 PNEAQALLERLEQGDLNSGTARFLRAAVKACRGGVPRSHLISYAEDGALLQELFTRDGIGTQIV 290 (441) T ss_pred HHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHCCCCEEEEEE T ss_conf 9999999975317888845699999999999839886997568788418877625887189885 No 4 >cd04250 AAK_NAGK-C AAK_NAGK-C: N-Acetyl-L-glutamate kinase - cyclic (NAGK-C) catalyzes the phosphorylation of the gamma-COOH group of N-acetyl-L-glutamate (NAG) by ATP in the second step of arginine biosynthesis found in some bacteria and photosynthetic organisms using the non-acetylated, cyclic route of ornithine biosynthesis. In this pathway, glutamate is first N-acetylated and then phosphorylated by NAGK to give phosphoryl NAG, which is converted to NAG-ornithine. There are two variants of this pathway. In one, typified by the pathway in Thermotoga maritima and Pseudomonas aeruginosa, the acetyl group is recycled by reversible transacetylation from acetylornithine to glutamate. The phosphorylation of NAG by NAGK is feedback inhibited by arginine. In photosynthetic organisms, NAGK is the target of the nitrogen-signaling protein PII. Hexameric formation of NAGK domains appears to be essential to both arginine inhibition and NAGK-PII complex formation. NAGK-C are members of the Amino A Probab=100.00 E-value=0 Score=283.06 Aligned_cols=140 Identities=59% Similarity=0.955 Sum_probs=137.0 Q ss_pred CCCCCCEEEECHHHHHHHHHCCCEEEECCCCCCCCCCEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEEC---CCCCCC Q ss_conf 98552013556999999997798499857431787320066147999999986251133432267633534---555556 Q 537021.9.peg.3 1 LGFVGNVIKVNRTILDLLIKSGIIPVIAPIAPGYDGATYNINADTFAGAIAEKLNVIRLLFLTDVPGVLDK---NRQLIS 77 (142) Q Consensus 1 ~g~~G~v~~vd~~~i~~lL~~g~iPVi~pig~~~~G~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d~---~~~~i~ 77 (142) +||||+|++||.+.|+.+|++|+||||+|++++++|+.||+|+|++|+++|.+|+|+||||+||++|++++ ++++++ T Consensus 137 ~g~vG~v~~vd~~~i~~ll~~g~IpVIsplg~~~~G~~~NvnaD~~A~~lA~aL~AdkLi~ltdv~Gv~~~~~~~~~~i~ 216 (279) T cd04250 137 LGFVGEVTEVNPELLETLLEAGYIPVIAPVGVGEDGETYNINADTAAGAIAAALKAEKLILLTDVAGVLDDPNDPGSLIS 216 (279) T ss_pred CCCCCCCCCCCHHHHHHHHHCCCEEEECCCCCCCCCCEECCCHHHHHHHHHHHHHHCCEEEEECCCCCCCCCCCCCCCCC T ss_conf 66456655227899999985797489646416898978668899999999998644949999588111058888754002 Q ss_pred CCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEE Q ss_conf 338889999962697678678999999999984998899952578768999996799866345 Q 537021.9.peg.3 78 KLSINEARTLIKDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLL 140 (142) Q Consensus 78 ~i~~~e~~~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i 140 (142) +++.+|++++++++++++||+||+++|.+|+++||+||||+||+.|++||.||||++|+||+| T Consensus 217 ~lt~~e~~~li~~~~i~ggM~pKl~aa~~Al~~GV~rvhIidg~~~~~LL~ELft~~G~GT~I 279 (279) T cd04250 217 EISLKEAEELIADGIISGGMIPKVEACIEALEGGVKAAHIIDGRVPHSLLLEIFTDEGIGTMI 279 (279) T ss_pred CCCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHCCCCCCCCC T ss_conf 589999999997597689749999999999984999799946989887999981799986609 No 5 >PRK00942 acetylglutamate kinase; Provisional Probab=100.00 E-value=0 Score=281.64 Aligned_cols=141 Identities=55% Similarity=0.953 Sum_probs=139.1 Q ss_pred CCCCCCEEEECHHHHHHHHHCCCEEEECCCCCCCCCCEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEECCCCCCCCCC Q ss_conf 98552013556999999997798499857431787320066147999999986251133432267633534555556338 Q 537021.9.peg.3 1 LGFVGNVIKVNRTILDLLIKSGIIPVIAPIAPGYDGATYNINADTFAGAIAEKLNVIRLLFLTDVPGVLDKNRQLISKLS 80 (142) Q Consensus 1 ~g~~G~v~~vd~~~i~~lL~~g~iPVi~pig~~~~G~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~~~i~~i~ 80 (142) +||||+|++||.+.|+.+|++|+|||++|++++++|+.+|+|+|++|+++|.+|+|+||||+||++|+|++++++|++++ T Consensus 122 ~g~vG~v~~v~~~~I~~ll~~g~IpVispig~~~~G~~~ninaD~~Aa~iA~~l~AdkLi~ltdv~Gv~~~~g~~i~~i~ 201 (262) T PRK00942 122 LGYVGEVEEVNPALLETLLEAGYIPVISPIGVGEDGETYNVNADTAAGAIAAALGAEKLILLTDVPGVLDDKGSLISELT 201 (262) T ss_pred CCCCCCEEEECHHHHHHHHHCCCEEEECCCCCCCCCCEECCCHHHHHHHHHHHHCCCEEEEEECCCCEECCCCCCCCCCC T ss_conf 76204616666899999984797579626418898878557799999999998467949999588761578875143106 Q ss_pred HHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEEC Q ss_conf 8899999626976786789999999999849988999525787689999967998663454 Q 537021.9.peg.3 81 INEARTLIKDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLI 141 (142) Q Consensus 81 ~~e~~~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~ 141 (142) .+|++++++++++++||.||+++|.+|+++|++||||+||+.|++|+.||||++|+||+|+ T Consensus 202 ~~e~~~l~~~~~~~ggM~pKl~aa~~a~~~Gv~rv~I~~g~~~~~ll~elft~~G~GT~i~ 262 (262) T PRK00942 202 AEEAEELIEDGVITGGMIPKVEAALEAARSGVRSVHIIDGRVPHSLLLELFTDEGIGTMIV 262 (262) T ss_pred HHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHCCCCCCCEEC T ss_conf 9999999865976896099999999999819998999369998779999817999852569 No 6 >cd04238 AAK_NAGK-like AAK_NAGK-like: N-Acetyl-L-glutamate kinase (NAGK)-like . Included in this CD are the Escherichia coli and Pseudomonas aeruginosa type NAGKs which catalyze the phosphorylation of N-acetyl-L-glutamate (NAG) by ATP in the second step of arginine biosynthesis found in bacteria and photosynthetic organisms using either the acetylated, noncyclic (NC), or non-acetylated, cyclic (C) route of ornithine biosynthesis. Also included in this CD is a distinct group of uncharacterized (UC) bacterial and archeal NAGKs. Members of this CD belong to the Amino Acid Kinase Superfamily (AAK). Probab=100.00 E-value=0 Score=279.30 Aligned_cols=140 Identities=58% Similarity=0.956 Sum_probs=138.1 Q ss_pred CCCCCCEEEECHHHHHHHHHCCCEEEECCCCCCCCCCEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEECCCCCCCCCC Q ss_conf 98552013556999999997798499857431787320066147999999986251133432267633534555556338 Q 537021.9.peg.3 1 LGFVGNVIKVNRTILDLLIKSGIIPVIAPIAPGYDGATYNINADTFAGAIAEKLNVIRLLFLTDVPGVLDKNRQLISKLS 80 (142) Q Consensus 1 ~g~~G~v~~vd~~~i~~lL~~g~iPVi~pig~~~~G~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~~~i~~i~ 80 (142) +||||+|++||.+.|+.+|++|+|||++|++++++|+.+|+|+|++|+++|.+++|+|||||||++|+|+++++++++++ T Consensus 117 ~g~vG~v~~v~~~~i~~lL~~g~IPVi~pi~~~~~G~~~ninaD~~Aa~lA~~l~AdkLi~ltdv~Gv~~~~g~~i~~lt 196 (256) T cd04238 117 LGFVGEVTEVNPELLETLLEAGYIPVIAPIAVDEDGETYNVNADTAAGAIAAALKAEKLILLTDVPGVLDDPGSLISELT 196 (256) T ss_pred CCCCCCCCCCCHHHHHHHHHCCCCEEEECEEECCCCCEEEECHHHHHHHHHHHHCCCCEEEECCCCCEECCCCCCCCCCC T ss_conf 34245544355899999984799447604135688868963799999999997280108985288861568875234127 Q ss_pred HHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEE Q ss_conf 889999962697678678999999999984998899952578768999996799866345 Q 537021.9.peg.3 81 INEARTLIKDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLL 140 (142) Q Consensus 81 ~~e~~~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i 140 (142) .+|++++++++++++||.+|+++|.+|+++|++||||+|++.|++|+.||||++|+||+| T Consensus 197 ~~e~~~l~~~~~~~gGM~~Kl~aa~~a~~~Gv~rv~Ii~g~~~~~Ll~eLft~~G~GT~I 256 (256) T cd04238 197 PKEAEELIEDGVISGGMIPKVEAALEALEGGVRKVHIIDGRVPHSLLLELFTDEGIGTMI 256 (256) T ss_pred HHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHCCCCCCCCCC T ss_conf 999999997696689539999999999984999799937999877999973799985879 No 7 >cd04249 AAK_NAGK-NC AAK_NAGK-NC: N-Acetyl-L-glutamate kinase - noncyclic (NAGK-NC) catalyzes the phosphorylation of the gamma-COOH group of N-acetyl-L-glutamate (NAG) by ATP in the second step of microbial arginine biosynthesis using the acetylated, noncyclic route of ornithine biosynthesis. There are two variants of this pathway. In one, typified by the pathway in Escherichia coli, glutamate is acetylated by acetyl-CoA and acetylornithine is deacylated hydrolytically. In this pathway, feedback inhibition by arginine occurs at the initial acetylation of glutamate and not at the phosphorylation of NAG by NAGK. Homodimeric NAGK-NC are members of the Amino Acid Kinase Superfamily (AAK). Probab=100.00 E-value=0 Score=277.11 Aligned_cols=138 Identities=36% Similarity=0.578 Sum_probs=134.9 Q ss_pred CCCCCCEEEECHHHHHHHHHCCCEEEECCCCCCCCCCEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEECCCCCCCCCC Q ss_conf 98552013556999999997798499857431787320066147999999986251133432267633534555556338 Q 537021.9.peg.3 1 LGFVGNVIKVNRTILDLLIKSGIIPVIAPIAPGYDGATYNINADTFAGAIAEKLNVIRLLFLTDVPGVLDKNRQLISKLS 80 (142) Q Consensus 1 ~g~~G~v~~vd~~~i~~lL~~g~iPVi~pig~~~~G~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~~~i~~i~ 80 (142) +||||+|++||++.|+.+|++|+||||+|+|++++|+.||+|+|.+|+++|.+|+|+ |+|+||++|++|.++++|++++ T Consensus 115 lG~vG~v~~Vn~~~i~~ll~~g~IPVisplg~~~~G~~~NiNaD~~A~~iA~aL~A~-Li~ltdv~Gvld~~g~li~~l~ 193 (252) T cd04249 115 LGAVGKATANDPSLLNDLLKAGFLPIISSIGADDQGQLMNVNADQAATAIAQLLNAD-LVLLSDVSGVLDADKQLISELN 193 (252) T ss_pred CCCCCCEEEECHHHHHHHHCCCCCCEECCCEECCCCCEEECCHHHHHHHHHHHHCCC-EEEEECCCCEECCCCCCCCCCC T ss_conf 786555323557899998628987476353148999879759999999999971598-9999688613889998942189 Q ss_pred HHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEE Q ss_conf 889999962697678678999999999984998899952578768999996799866345 Q 537021.9.peg.3 81 INEARTLIKDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLL 140 (142) Q Consensus 81 ~~e~~~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i 140 (142) .+|++++++++++++||+||+++|++|+++|++++||+||+.|++| .|+||++|+||+| T Consensus 194 ~~e~~~li~~~~i~gGMipKl~aa~~Al~~gv~~v~I~~g~~~~~L-~elft~~GiGT~I 252 (252) T cd04249 194 AKQAAELIEQGVITDGMIVKVNAALDAAQSLRRGIDIASWQYPEQL-TALLAGEPVGTKI 252 (252) T ss_pred HHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCHHH-HHHHCCCCCCCCC T ss_conf 9999999975954797399999999999819987999438987389-9997599987739 No 8 >COG0548 ArgB Acetylglutamate kinase [Amino acid transport and metabolism] Probab=100.00 E-value=0 Score=275.58 Aligned_cols=142 Identities=52% Similarity=0.910 Sum_probs=139.2 Q ss_pred CCCCCCEEEECHHHHHHHHHCCCEEEECCCCCCCCCCEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEECCCC--CCCC Q ss_conf 98552013556999999997798499857431787320066147999999986251133432267633534555--5563 Q 537021.9.peg.3 1 LGFVGNVIKVNRTILDLLIKSGIIPVIAPIAPGYDGATYNINADTFAGAIAEKLNVIRLLFLTDVPGVLDKNRQ--LISK 78 (142) Q Consensus 1 ~g~~G~v~~vd~~~i~~lL~~g~iPVi~pig~~~~G~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~~--~i~~ 78 (142) +||||+|++||++.|+.++++|+||||+|+|++++|+.||+|+|++|+++|.+|+|+||+||||++|++++.++ +|++ T Consensus 122 ~g~vG~i~~Vn~~~i~~ll~~~~IpViapia~~~~G~~~NvnaD~~A~~iA~aLkAekLi~ltdv~Gvl~~~~~~s~i~~ 201 (265) T COG0548 122 LGYVGEIRKVNPELIERLLDNGAIPVIAPIAVDEDGETLNVNADTAAGALAAALKAEKLILLTDVPGVLDDKGDPSLISE 201 (265) T ss_pred CCEEEEEEEECHHHHHHHHHCCCCEEEECCEECCCCCEEEECHHHHHHHHHHHCCCCEEEEEECCHHHCCCCCCCEEECC T ss_conf 64044589878999999975798169807427899978704889999999997398829987277000348998545535 Q ss_pred CCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEECC Q ss_conf 3888999996269767867899999999998499889995257876899999679986634549 Q 537021.9.peg.3 79 LSINEARTLIKDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLIP 142 (142) Q Consensus 79 i~~~e~~~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~p 142 (142) ++.++++++++++++++||.+|+++|.+|+++|++++||+||+.|++++.||||++|+||+|+| T Consensus 202 ~~~~~~~~li~~~~i~~GMi~Kv~~a~~A~~~Gv~~v~ii~g~~~~~ll~eLFt~~giGT~i~~ 265 (265) T COG0548 202 LDAEEAEELIEQGIITGGMIPKVEAALEALESGVRRVHIISGRVPHSLLLELFTRDGIGTMIVR 265 (265) T ss_pred CCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCEEEEECCCCCCHHHHHHHCCCCCCEEECC T ss_conf 7999999886249734860899999999998499738996377862699998457985068259 No 9 >cd04252 AAK_NAGK-fArgBP AAK_NAGK-fArgBP: N-Acetyl-L-glutamate kinase (NAGK) of the fungal arginine-biosynthetic pathway (fArgBP). The nuclear-encoded, mitochondrial polyprotein precursor with an N-terminal NAGK (ArgB) domain (this CD), a central DUF619 domain, and a C-terminal reductase domain (ArgC, N-Acetylglutamate Phosphate Reductase, NAGPR). The precursor is cleaved in the mitochondria into two distinct enzymes (NAGK-DUF619 and NAGPR). Native molecular weights of these proteins indicate that the kinase is an octamer whereas the reductase is a dimer. This CD also includes some gamma-proteobacteria (Xanthomonas and Xylella) NAG kinases with an N-terminal NAGK (ArgB) domain (this CD) and a C-terminal DUF619 domain. The DUF619 domain is described as a putative distant homolog of the acetyltransferase, ArgA, predicted to function in NAG synthase association in fungi. Eukaryotic sequences have an N-terminal mitochondrial transit peptide. Members of this NAG kinase domain CD belong to th Probab=100.00 E-value=8.4e-45 Score=268.14 Aligned_cols=139 Identities=29% Similarity=0.533 Sum_probs=133.2 Q ss_pred CCCCCCEEEECHHHHHHHHHCCCEEEECCCCCCCCCCEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEECCCCCCCCCC Q ss_conf 98552013556999999997798499857431787320066147999999986251133432267633534555556338 Q 537021.9.peg.3 1 LGFVGNVIKVNRTILDLLIKSGIIPVIAPIAPGYDGATYNINADTFAGAIAEKLNVIRLLFLTDVPGVLDKNRQLISKLS 80 (142) Q Consensus 1 ~g~~G~v~~vd~~~i~~lL~~g~iPVi~pig~~~~G~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~~~i~~i~ 80 (142) +||||+|++||++.|+.+|++|+|||++|+|++++|+.||+|+|++|+++|.+|+|+||||+||++|++|++++++++++ T Consensus 109 ~g~vG~v~~Vd~~~i~~lL~~g~Ipvispig~~~~G~~~NvnaD~~A~~iA~aL~A~kLi~ltdv~Gv~d~~g~~i~~l~ 188 (248) T cd04252 109 YGLVGKITGVNKAPIEAAIRAGYLPILTSLAETPSGQLLNVNADVAAGELARVLEPLKIVFLNETGGLLDGTGKKISAIN 188 (248) T ss_pred CCCCCCEEEECHHHHHHHHHCCCCEEEEEEEECCCCCEEEECHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCC T ss_conf 78432258982999999985899448851478899998987999999999997599989999588301799999974278 Q ss_pred H-HHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEE Q ss_conf 8-89999962697678678999999999984998899952578768999996799866345 Q 537021.9.peg.3 81 I-NEARTLIKDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLL 140 (142) Q Consensus 81 ~-~e~~~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i 140 (142) . ++++++++++++++||++|+++ .++|.++++|+|+++++.|++||.||||++|+||+| T Consensus 189 ~~~~~~~l~~~~~i~~gm~~kl~a-~~~~~~~v~r~h~i~~~~~~aLL~ELFT~~G~GT~I 248 (248) T cd04252 189 LDEEYDDLMKQPWVKYGTKLKIKE-IKELLDTLPRSSSVSITSPDDLQKELFTHSGAGTLI 248 (248) T ss_pred HHHHHHHHHHCCCCCCCCHHHHHH-HHHHHHCCCCCEEEECCCCCHHHHHHHCCCCCCCCC T ss_conf 899999998779977863647999-999996399718996027866577771599870639 No 10 >cd04251 AAK_NAGK-UC AAK_NAGK-UC: N-Acetyl-L-glutamate kinase - uncharacterized (NAGK-UC). This domain is similar to Escherichia coli and Pseudomonas aeruginosa NAGKs which catalyze the phosphorylation of the gamma-COOH group of N-acetyl-L-glutamate (NAG) by ATP in the second step of microbial arginine biosynthesis. These uncharacterized domain sequences are found in some bacteria (Deinococci and Chloroflexi) and archea and belong to the Amino Acid Kinase Superfamily (AAK). Probab=100.00 E-value=5.2e-44 Score=263.62 Aligned_cols=134 Identities=34% Similarity=0.638 Sum_probs=130.3 Q ss_pred CCCCCCEEEECHHHHHHHHHCCCEEEECCCCCCCCCCEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEECCCCCCCCCC Q ss_conf 98552013556999999997798499857431787320066147999999986251133432267633534555556338 Q 537021.9.peg.3 1 LGFVGNVIKVNRTILDLLIKSGIIPVIAPIAPGYDGATYNINADTFAGAIAEKLNVIRLLFLTDVPGVLDKNRQLISKLS 80 (142) Q Consensus 1 ~g~~G~v~~vd~~~i~~lL~~g~iPVi~pig~~~~G~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~~~i~~i~ 80 (142) +||||+|++||++.|+.+|++|++||++|++++++|+.||+|+|++|+++|.+|+|+||||+||++|++ .++++|++++ T Consensus 124 ~g~vG~v~~V~~~~i~~ll~~g~iPVispig~~~~G~~~NvnaD~~A~~iA~aL~A~kLI~ltdv~Gvl-~dg~li~~l~ 202 (257) T cd04251 124 GGYTGKVEKVNSDLIEALLDAGYLPVVSPVAYSEEGEPLNVDGDRAAAAIAAALKAERLILLTDVEGLY-LDGRVIERIT 202 (257) T ss_pred CCCEEEEEEECHHHHHHHHHCCCCEEECCCCCCCCCCEECCCHHHHHHHHHHHCCCCEEEEEECCCEEE-CCCEECCCCC T ss_conf 797677989878999999848994586676668876074287799999999980999699996884042-4984536589 Q ss_pred HHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEE Q ss_conf 889999962697678678999999999984998899952578768999996799866345 Q 537021.9.peg.3 81 INEARTLIKDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLL 140 (142) Q Consensus 81 ~~e~~~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i 140 (142) .+|+++++++ +++||+||+++|.+|+++||+||||+||+.|++|+.|| +|.||+| T Consensus 203 ~~e~~~li~~--i~gGM~pKl~aa~~A~~~GV~~v~Ii~g~~~~~ll~el---~G~GT~I 257 (257) T cd04251 203 VSDAESLLEK--AGGGMKRKLLAAAEAVEGGVREVVIGDARADSPISSAL---NGGGTVI 257 (257) T ss_pred HHHHHHHHHH--HCCCHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHH---CCCCCCC T ss_conf 9999999987--34853999999999998099979990799987799996---5898469 No 11 >PRK04531 acetylglutamate kinase; Provisional Probab=100.00 E-value=2.6e-38 Score=232.14 Aligned_cols=140 Identities=31% Similarity=0.476 Sum_probs=133.0 Q ss_pred CCCCCCEEEECHHHHHHHHHCCCEEEECCCCCCCCCCEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEECCCCCCCCCC Q ss_conf 98552013556999999997798499857431787320066147999999986251133432267633534555556338 Q 537021.9.peg.3 1 LGFVGNVIKVNRTILDLLIKSGIIPVIAPIAPGYDGATYNINADTFAGAIAEKLNVIRLLFLTDVPGVLDKNRQLISKLS 80 (142) Q Consensus 1 ~g~~G~v~~vd~~~i~~lL~~g~iPVi~pig~~~~G~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~~~i~~i~ 80 (142) +||||+|++||++.|+.+|++|+||||+|+|.+++|+.+|+|||.+|+++|.+|+|+|+||||+.+|++|.+|++|+.|+ T Consensus 131 ~G~VG~V~~Vd~~~I~~~L~~g~IPVIapig~~~~Gq~lNINAD~aA~aLA~aL~a~KlIfLt~~gGi~d~~gk~Is~In 210 (421) T PRK04531 131 YGLVGEVKAVLLAAVRSSLRAGKIPVIASLGETPSGQILNINADVAANELVSILQPYKIIFLTGTGGLLDADGNKISSIN 210 (421) T ss_pred CCCCCCEEEECHHHHHHHHHCCCEEEECCCEECCCCCEEEECHHHHHHHHHHHCCCCEEEEECCCCCEECCCCCEEEEEE T ss_conf 76433347786899999997898269815315899978840599999999986286469997167860678998863575 Q ss_pred HH-HHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEEC Q ss_conf 88-99999626976786789999999999849988999525787689999967998663454 Q 537021.9.peg.3 81 IN-EARTLIKDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLI 141 (142) Q Consensus 81 ~~-e~~~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~ 141 (142) .+ |++.++++.|+..||+.|++...+.+.+- |+...+..+.|..|..||||+.|+||+|- T Consensus 211 L~~eyd~lm~~~w~~~g~klkl~~ik~LLd~L-P~~SSV~It~p~~L~kELFT~~GsGTLir 271 (421) T PRK04531 211 LSTEFDQLMQQPSINGGMRLKLEQIKDLLDRL-PLESSVSITSPSDLAKELFTHKGSGTLVR 271 (421) T ss_pred CCHHHHHHHHCCCCCHHHHHHHHHHHHHHHCC-CCCCEEEEECHHHHHHHHCCCCCCCEEEE T ss_conf 44035545205103566899899999998519-98660787089998887626799960486 No 12 >cd04236 AAK_NAGS-Urea AAK_NAGS-Urea: N-acetylglutamate (NAG) kinase-like domain of the NAG Synthase (NAGS) of the urea cycle found in animals. Ureogenic NAGS is a mitochondrial enzyme catalyzing the formation of NAG from acetylcoenzyme A and L-glutamate; NAG is an essential allosteric activator of carbamylphosphate synthase I, the first and rate limiting enzyme of the urea cycle. Ureogenic NAGS activity is dependent on the concentration of glutamate (substrate) and arginine (activator). Domain architecture of ureogenic NAGS consists of an N-terminal NAG kinase-like (ArgB) domain (this CD) and a C-terminal DUF619 domain. Members of this CD belong to the protein superfamily, the Amino Acid Kinase Family (AAKF). Probab=100.00 E-value=3.6e-38 Score=231.39 Aligned_cols=136 Identities=26% Similarity=0.464 Sum_probs=124.1 Q ss_pred CCCCCCEEEECHHHHHHHHHCCCEEEECCCCCCCCCCEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEECCCCCCCCCC Q ss_conf 98552013556999999997798499857431787320066147999999986251133432267633534555556338 Q 537021.9.peg.3 1 LGFVGNVIKVNRTILDLLIKSGIIPVIAPIAPGYDGATYNINADTFAGAIAEKLNVIRLLFLTDVPGVLDKNRQLISKLS 80 (142) Q Consensus 1 ~g~~G~v~~vd~~~i~~lL~~g~iPVi~pig~~~~G~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~~~i~~i~ 80 (142) .||+|++++||.++|++.+++|+||||+|+|++++|++||+|+|.+|.++|.+|+|.||+||||++|++|.++++|++++ T Consensus 132 ~~~vG~~~~v~~~~i~~~i~~g~IPVIaplG~~~~Gq~~NINADtaA~aiA~aL~a~Kli~LTdv~Gvld~~~~lI~~i~ 211 (271) T cd04236 132 GASKGPSVSVDTELLQWCLGSGHIPLVCPIGETSSGRSVSLDSSEVTTAIAKALQPIKVIFLNRSGGLRDQKHKVLPQVH 211 (271) T ss_pred CCCCCCCEEEEHHHHHHHHHCCCCCEECCCEECCCCCEEEECHHHHHHHHHHHCCCCEEEEEECCCCEECCCCCEECEEC T ss_conf 37668713860799999997899718825237899988877589999999986487679998078740778998822016 Q ss_pred H-HHHHHHHHCCCCCC---CHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEE Q ss_conf 8-89999962697678---678999999999984998899952578768999996799866345 Q 537021.9.peg.3 81 I-NEARTLIKDGTISG---GMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLL 140 (142) Q Consensus 81 ~-~e~~~l~~~~~~~~---gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i 140 (142) . ++++.+.++.|+++ +|.+|+..|++++.+++..+ |+ .|++|+.||||++|+||+| T Consensus 212 l~~~~~~l~~~~~i~~~~~~~i~~i~~~l~~l~~~~s~v-i~---~phsll~ELFTd~G~GTLf 271 (271) T cd04236 212 LPADLPSLSDAEWLSETEQNRIQDIATLLNALPSMSSAV-IT---SAETLLTELFSHKGSGTLF 271 (271) T ss_pred HHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHCCCCCEEE-EE---CHHHHHHHHHCCCCCCCCC T ss_conf 188999987076512352674778999997177873699-83---7568789986389877788 No 13 >TIGR00761 argB acetylglutamate kinase; InterPro: IPR004662 N -Acetylglutamate (NAG) fulfils distinct biological roles in lower and higher organisms. In prokaryotes, lower eukaryotes and plants it is the first intermediate in the biosynthesis of arginine, whereas in ureotelic (excreting nitrogen mostly in the form of urea) vertebrates, it is an essential allosteric cofactor for carbamyl phosphate synthetase I (CPSI), the first enzyme of the urea cycle. The pathway that leads from glutamate to arginine in lower organisms employs eight steps, starting with the acetylation of glutamate to form NAG. In these species, NAG can be produced by two enzymatic reactions: one catalysed by NAG synthase (NAGS) and the other by ornithine acetyltransferase (OAT). In ureotelic species, NAG is produced exclusively by NAGS. In lower organisms, NAGS is feedback-inhibited by L-arginine, whereas mammalian NAGS activity is significantly enhanced by this amino acid. The NAGS genes of bacteria, fungi and mammals are more diverse than other arginine-biosynthesis and urea-cycle genes. The evolutionary relationship between the distinctly different roles of NAG and its metabolism in lower and higher organisms remains to be determined . The pathway from glutamate to arginine is: NAGS; N-acetylglutamate synthase (2.3.1.1 from EC) (glutamate to N-acetylglutamate) NAGK; N-acetylglutamate kinase (2.7.2.8 from EC) (N-acetylglutamate to N-acetylglutamate-5P) NAGSA; N-acetyl-gamma-glutamyl-phosphate reductase (1.2.1.38 from EC) (N-acetylglutamate-5P to N-acetylglumate semialdehyde) Acetylornithine aminotransferase (2.6.1.11 from EC) (N-acetylglumate semialdehyde to N-acetylornithine) Acetylornithine deacetylase (3.5.1.16 from EC) (N-acetylornithine to ornithine) Arginase (3.5.3.1 from EC) (ornithine to arginine) This entry describes N-acetylglutamate kinases (ArgB) of many prokaryotes and the N-acetylglutamate kinase domains of multifunctional proteins from yeasts as well as some, as yet, uncharacterised proteins.; GO: 0003991 acetylglutamate kinase activity, 0006526 arginine biosynthetic process, 0005737 cytoplasm. Probab=100.00 E-value=4.4e-38 Score=230.91 Aligned_cols=117 Identities=51% Similarity=0.876 Sum_probs=113.4 Q ss_pred CCCCCCEEEECHHHHHHHHHCCCEEEECCCCCC-CCCCEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEEC--CC-CCC Q ss_conf 985520135569999999977984998574317-87320066147999999986251133432267633534--55-555 Q 537021.9.peg.3 1 LGFVGNVIKVNRTILDLLIKSGIIPVIAPIAPG-YDGATYNINADTFAGAIAEKLNVIRLLFLTDVPGVLDK--NR-QLI 76 (142) Q Consensus 1 ~g~~G~v~~vd~~~i~~lL~~g~iPVi~pig~~-~~G~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d~--~~-~~i 76 (142) +||||++++||++.|+.||++|++|||+|+++| .+|+.+|+|||.+|+++|.+|+|+||+||||++|++.. ++ ++| T Consensus 133 ~G~VG~~~~Vn~~~~~~Ll~~G~~PVi~sla~~~~~g~~lNvNAD~aA~~lA~~L~A~kL~~LtDv~Gi~~~y~d~~~~i 212 (254) T TIGR00761 133 LGYVGEIKKVNKALLEALLKAGIIPVIASLALTNAEGQALNVNADTAAGELAAALGAEKLVLLTDVPGILNGYPDKQSLI 212 (254) T ss_pred CCCEEEECCCCHHHHHHHHHCCCCCEECCCCCCHHCCCEEECCHHHHHHHHHHHCCCCEEEEECCCHHHHCCCCCCCCEE T ss_conf 48412313278899999996498517716658623187770677799999998609953888417403225888746140 Q ss_pred CCCCH-HHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEE Q ss_conf 63388-899999626976786789999999999849988999 Q 537021.9.peg.3 77 SKLSI-NEARTLIKDGTISGGMIPKIETSIKAIENGVKSVAI 117 (142) Q Consensus 77 ~~i~~-~e~~~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i 117 (142) ++|+. +|+++|+.+++++|||+|||++|.+|++.|+++||| T Consensus 213 ~~l~~D~~~~~l~~~~~i~gGMipKV~~a~~A~~~Gv~~v~I 254 (254) T TIGR00761 213 SELPLDEEIEQLIKQGVIKGGMIPKVNAALEALESGVKSVII 254 (254) T ss_pred CCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCEECC T ss_conf 226867999987752223159857899999999738880038 No 14 >TIGR01890 N-Ac-Glu-synth amino-acid N-acetyltransferase; InterPro: IPR010167 This entry represents a clade of amino-acid N-acetyltransferases, or N-acetylglutamate synthase, which is the product of the argA gene and the first enzyme in arginine biosynthesis. This enzyme displays more diversity between bacteria, fungi and mammals than other enzymes in arginine metabolism, and N-acetylglutamate itself can have different roles in different taxonomic groups: in prokaryotes, lower eukaryotes and plants it is the first intermediate in arginine biosynthesis, while in vertebrates it is an allosteric cofactor for the first enzyme in the urea cycle . In bacteria, arginine can regulate ornithine biosynthesis via a feedback inhibition mechanism . This enzyme may also act on aspartate.; GO: 0004042 amino-acid N-acetyltransferase activity, 0006526 arginine biosynthetic process, 0005737 cytoplasm. Probab=100.00 E-value=4.7e-37 Score=225.26 Aligned_cols=141 Identities=27% Similarity=0.504 Sum_probs=131.7 Q ss_pred CCCCCCEEEECHHHHHHHHHCCCEEEECCCCCCCCCCEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEECCCCCCCCCC Q ss_conf 98552013556999999997798499857431787320066147999999986251133432267633534555556338 Q 537021.9.peg.3 1 LGFVGNVIKVNRTILDLLIKSGIIPVIAPIAPGYDGATYNINADTFAGAIAEKLNVIRLLFLTDVPGVLDKNRQLISKLS 80 (142) Q Consensus 1 ~g~~G~v~~vd~~~i~~lL~~g~iPVi~pig~~~~G~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~~~i~~i~ 80 (142) |.|+|.|++||.+.|+.+|+.|.|+.++|+|++.+|+.||++..++|+.+|+.|+||||||+|..||+.|.+|+++.+++ T Consensus 139 y~h~G~vRk~D~~~i~~~Ld~g~IVLL~PLG~S~tG~~FnL~~e~vAt~~A~~L~AdKL~~~t~~~Gi~d~~G~l~~eL~ 218 (439) T TIGR01890 139 YEHTGVVRKIDTEGIRRLLDAGSIVLLSPLGYSPTGEIFNLDMEDVATSVAIELKADKLIYFTEEPGILDADGKLVAELS 218 (439) T ss_pred CEECCCEEHHHHHHHHHHHCCCCEEECCCCCCCCCCHHCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCHHHHHCC T ss_conf 20327451100888986520787675176543664000044068899999986255404540588872465652423248 Q ss_pred HHHHHHHHHC---CC-----CCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEEC Q ss_conf 8899999626---97-----6786789999999999849988999525787689999967998663454 Q 537021.9.peg.3 81 INEARTLIKD---GT-----ISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLI 141 (142) Q Consensus 81 ~~e~~~l~~~---~~-----~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~ 141 (142) ++|++.+++. .. ..-....-+++|.+||+.||+|+||+++..+|+||+||||.+|+||+|. T Consensus 219 pq~~~~~~~~l~~~~aPdde~~~~~~~lL~~a~~A~~GGV~R~hlv~ya~DGsLL~ELF~~~G~GT~v~ 287 (439) T TIGR01890 219 PQEVESLVERLGEETAPDDELSADTARLLSAAVKACRGGVRRSHLVSYAEDGSLLQELFTRDGIGTLVS 287 (439) T ss_pred HHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEECCCHHHHHHHHHHHHCCCCCHHHH T ss_conf 789999999850578973332255899999999998679751442262322589999986189303788 No 15 >cd02115 AAK Amino Acid Kinases (AAK) superfamily, catalytic domain; present in such enzymes like N-acetylglutamate kinase (NAGK), carbamate kinase (CK), aspartokinase (AK), glutamate-5-kinase (G5K) and UMP kinase (UMPK). The AAK superfamily includes kinases that phosphorylate a variety of amino acid substrates. These kinases catalyze the formation of phosphoric anhydrides, generally with a carboxylate, and use ATP as the source of the phosphoryl group; are involved in amino acid biosynthesis. Some of these kinases control the process via allosteric feed-back inhibition. Probab=99.96 E-value=1.7e-29 Score=183.49 Aligned_cols=133 Identities=32% Similarity=0.519 Sum_probs=121.6 Q ss_pred CCCCCCEEEECHHHHHHHHHCCCEEEECCCCCCC---CCCEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEECC----- Q ss_conf 9855201355699999999779849985743178---73200661479999999862511334322676335345----- Q 537021.9.peg.3 1 LGFVGNVIKVNRTILDLLIKSGIIPVIAPIAPGY---DGATYNINADTFAGAIAEKLNVIRLLFLTDVPGVLDKN----- 72 (142) Q Consensus 1 ~g~~G~v~~vd~~~i~~lL~~g~iPVi~pig~~~---~G~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~----- 72 (142) +|++|++++++.+.++.+++.+.+||++|++... .++.+|.++|..|+.+|.+|+|++|+|+|||+|+|+.+ T Consensus 107 ~~~~~~v~~~~~~~l~~~l~~~~ipVi~g~~~~~~~~~~~l~~~~sD~~Aa~lA~~l~A~~lii~TDV~Gv~~~dp~~~~ 186 (248) T cd02115 107 QGHVGKITKVSTDRLKSLLENGILPILSGFGGTDEKETGTLGRGGSDSTAALLAAALKADRLVILTDVDGVYTADPRKVP 186 (248) T ss_pred CCCCCCCCCCCHHHHHHHHHCCCCEEECCCCCCCCCCCCEECCCCCCHHHHHHHHHHCCCHHHHEECCCCCEECCCCCCC T ss_conf 44432102268999998774167668136345556777633078878999999999496655310457861618998688 Q ss_pred -CCCCCCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEE Q ss_conf -55556338889999962697678678999999999984998899952578768999996799866345 Q 537021.9.peg.3 73 -RQLISKLSINEARTLIKDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLL 140 (142) Q Consensus 73 -~~~i~~i~~~e~~~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i 140 (142) .+++++++.+|+.++.. .|+|.+|+++|..++++|++ +||.|++.|+.| |+|+++++||+| T Consensus 187 ~a~~i~~ls~~ea~el~~----~G~~v~~~~a~~~a~~~gi~-v~I~n~~~~~~l--elf~~~~~GTlI 248 (248) T cd02115 187 DAKLLSELTYEEAAELAY----AGAMVLKPKAADPAARAGIP-VRIANTENPGAL--ALFTPDGGGTLI 248 (248) T ss_pred CCEECCCCCHHHHHHHHH----CCCCHHHHHHHHHHHHCCCC-EEEECCCCCCHH--HHCCCCCCCCCC T ss_conf 675886538999999985----59967489999999987997-999139997533--411799996819 No 16 >cd04241 AAK_FomA-like AAK_FomA-like: This CD includes a fosfomycin biosynthetic gene product, FomA, and similar proteins found in a wide range of organisms. Together, the fomA and fomB genes in the fosfomycin biosynthetic gene cluster of Streptomyces wedmorensis confer high-level fosfomycin resistance. FomA and FomB proteins converted fosfomycin to fosfomycin monophosphate and fosfomycin diphosphate in the presence of ATP and a magnesium ion, indicating that FomA and FomB catalyzed phosphorylations of fosfomycin and fosfomycin monophosphate, respectively. FomA and related sequences in this CD are members of the Amino Acid Kinase Superfamily (AAK). Probab=99.96 E-value=4.3e-29 Score=181.30 Aligned_cols=137 Identities=26% Similarity=0.486 Sum_probs=119.7 Q ss_pred CCCCCEEEECHHHHHHHHHCCCEEEEC-CCCCCCCCCEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEEC---CCCCCC Q ss_conf 855201355699999999779849985-7431787320066147999999986251133432267633534---555556 Q 537021.9.peg.3 2 GFVGNVIKVNRTILDLLIKSGIIPVIA-PIAPGYDGATYNINADTFAGAIAEKLNVIRLLFLTDVPGVLDK---NRQLIS 77 (142) Q Consensus 2 g~~G~v~~vd~~~i~~lL~~g~iPVi~-pig~~~~G~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d~---~~~~i~ 77 (142) ...|++.+++.+.++.+|+.|+|||++ +...+..+...++|+|.+|+.+|..++|++|||+||++|+|+. ++++|+ T Consensus 107 ~~~~~~~~~~~~~l~~ll~~g~IPIin~~~v~~~~~~~~i~~~D~lAa~vA~~l~Ad~LI~lTDVdGvy~~~p~~a~~i~ 186 (252) T cd04241 107 TENGRIVSFDLEVIKELLDRGFVPVLHGDVVLDEGGGITILSGDDIVVELAKALKPERVIFLTDVDGVYDKPPPDAKLIP 186 (252) T ss_pred HHCCCEEECCHHHHHHHHHCCCEEEECCCCEECCCCCCCCCCHHHHHHHHHHHHCCCEEEEEECCCEEECCCCCCCCCCC T ss_conf 31784103178999999987964696687652477742224373999999998387869999568746469999983864 Q ss_pred CCCHHHHHHHHHCC-----CCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEE Q ss_conf 33888999996269-----7678678999999999984998899952578768999996799866345 Q 537021.9.peg.3 78 KLSINEARTLIKDG-----TISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLL 140 (142) Q Consensus 78 ~i~~~e~~~l~~~~-----~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i 140 (142) +++..+++++...+ ..+|||.+|+++|..|.++|++ ++|+||+.|+.++ ++++++++||.| T Consensus 187 ~i~~~~~~~~~~~~~~~~~~~tGGM~~Kl~aA~~a~~~Gi~-v~I~~G~~~~~i~-~~l~g~~vGT~I 252 (252) T cd04241 187 EIDVGSLEDILAALGSAGTDVTGGMAGKIEELLELARRGIE-VYIFNGDKPENLY-RALLGNFIGTRI 252 (252) T ss_pred CCCCCCHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHCCCC-EEEECCCCCCHHH-HHHCCCCCCCCC T ss_conf 50732499998736877787627739999999999978994-9998999976799-997799998999 No 17 >PRK12353 carbamate kinase; Reviewed Probab=99.92 E-value=5.3e-25 Score=158.70 Aligned_cols=130 Identities=24% Similarity=0.448 Sum_probs=108.6 Q ss_pred EECHHHHHHHHHCCCEEEEC-----CCCCCC---CCCEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEE----CCCCCC Q ss_conf 55699999999779849985-----743178---732006614799999998625113343226763353----455555 Q 537021.9.peg.3 9 KVNRTILDLLIKSGIIPVIA-----PIAPGY---DGATYNINADTFAGAIAEKLNVIRLLFLTDVPGVLD----KNRQLI 76 (142) Q Consensus 9 ~vd~~~i~~lL~~g~iPVi~-----pig~~~---~G~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d----~~~~~i 76 (142) -++.+.|+.|+++|+|||.+ |+..+. .|-..+||.|.+|+.||..|+|++|++||||+|||. .+.+.| T Consensus 170 iie~~~I~~L~~~g~IvIa~GGGGIPV~~~~g~~~Gv~aVIdkDlaaa~LA~~l~Ad~lliLTdV~~V~~n~g~p~q~~l 249 (312) T PRK12353 170 IIEKEAIKTLVEAGVVVIAAGGGGIPVVREGGGLKGVEAVIDKDFASALLAELIDADLLIILTAVDKVYINFGKPNQKKL 249 (312) T ss_pred EECHHHHHHHHHCCCEEEECCCCCCCEECCCCCEECCEEEECCCHHHHHHHHHCCCCEEEEEECCHHHHHCCCCCCCCCC T ss_conf 22399999999789789964888776576799446613331533888999997199879997163467652899861332 Q ss_pred CCCCHHHHHHHHHCCCCC-CCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEECC Q ss_conf 633888999996269767-867899999999998499889995257876899999679986634549 Q 537021.9.peg.3 77 SKLSINEARTLIKDGTIS-GGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLIP 142 (142) Q Consensus 77 ~~i~~~e~~~l~~~~~~~-~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~p 142 (142) ++++.+|+++|++++.+. |+|.||+++|++|+++|.++..|| ...+.+...| ....||+|+| T Consensus 250 ~~lt~~e~~~~~~~G~F~~GsM~PKVeAai~fv~~gg~~~aiI--t~l~~~~~al--~G~~GT~I~~ 312 (312) T PRK12353 250 DRVTVSELEKYIEEGQFAPGSMLPKVEAAISFVESGGKRKAII--TSLENAKEAL--EGKAGTVIVK 312 (312) T ss_pred CCCCHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCCEEEE--CCHHHHHHHH--CCCCCCEEEC T ss_conf 1388999999997799888882699999999999489837998--9899999996--7999848829 No 18 >cd04256 AAK_P5CS_ProBA AAK_P5CS_ProBA: Glutamate-5-kinase (G5K) domain of the bifunctional delta 1-pyrroline-5-carboxylate synthetase (P5CS), composed of an N-terminal G5K (ProB) and a C-terminal glutamyl 5- phosphate reductase (G5PR, ProA), the first and second enzyme catalyzing proline (and, in mammals, ornithine) biosynthesis. G5K transfers the terminal phosphoryl group of ATP to the gamma-carboxyl group of glutamate, and is subject to feedback allosteric inhibition by proline or ornithine. In plants, proline plays an important role as an osmoprotectant and, in mammals, ornithine biosynthesis is crucial for proper ammonia detoxification, since a G5K mutation has been shown to cause human hyperammonaemia. Probab=99.91 E-value=4.7e-24 Score=153.46 Aligned_cols=130 Identities=27% Similarity=0.447 Sum_probs=106.0 Q ss_pred ECH-HHHHHHHHCCCEEEEC---CCCCCC------CCCEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEEC-----CCC Q ss_conf 569-9999999779849985---743178------7320066147999999986251133432267633534-----555 Q 537021.9.peg.3 10 VNR-TILDLLIKSGIIPVIA---PIAPGY------DGATYNINADTFAGAIAEKLNVIRLLFLTDVPGVLDK-----NRQ 74 (142) Q Consensus 10 vd~-~~i~~lL~~g~iPVi~---pig~~~------~G~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d~-----~~~ 74 (142) .|. +.|+.+|+.|.|||+. ++..++ .+.....|+|++|+.+|..++|++||+||||+|+||. +.+ T Consensus 137 ~n~~~tl~~Ll~~gvIPIvNEND~Va~~e~~~~d~~~~i~fgDND~Laa~vA~~i~Ad~LillTDVdGlyd~~P~~~~ak 216 (284) T cd04256 137 RNLNGTLEELLRLNIIPIINTNDAVSPPPEPDEDLQGVISIKDNDSLAARLAVELKADLLILLSDVDGLYDGPPGSDDAK 216 (284) T ss_pred HHHHHHHHHHHHCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCC T ss_conf 99999999997389785763786102465564112343113560189999887618888999725772026999999874 Q ss_pred CCCCCCHHHHHHHH---HCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEEC Q ss_conf 55633888999996---26976786789999999999849988999525787689999967998663454 Q 537021.9.peg.3 75 LISKLSINEARTLI---KDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLI 141 (142) Q Consensus 75 ~i~~i~~~e~~~l~---~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~ 141 (142) +|+++...+.+.+. .+...+|||.+|+++|..+.++|++ ++|+||+.|+.|+ ++|+++++||+|. T Consensus 217 li~~i~~~d~~~~~~~~~s~~gtGGM~tKi~AA~~a~~~Gi~-~~I~nG~~~~~i~-~i~~g~~vGT~Ft 284 (284) T cd04256 217 LIHTFYPGDQQSITFGTKSRVGTGGMEAKVKAALWALQGGTS-VVITNGMAGDVIT-KILEGKKVGTFFT 284 (284) T ss_pred EEEEECCCCHHHHHCCCCCCCCCCCHHHHHHHHHHHHHCCCC-EEEECCCCCCHHH-HHHCCCCCCCEEC T ss_conf 421455664888745788997617879999999999978997-9995089966899-9976999882549 No 19 >PRK12314 gamma-glutamyl kinase; Provisional Probab=99.88 E-value=3.2e-22 Score=143.35 Aligned_cols=126 Identities=25% Similarity=0.507 Sum_probs=101.4 Q ss_pred HHHHHHHHCCCEEEEC---CCCCCCCCCEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEECC------CCCCCCCCH-- Q ss_conf 9999999779849985---74317873200661479999999862511334322676335345------555563388-- Q 537021.9.peg.3 13 TILDLLIKSGIIPVIA---PIAPGYDGATYNINADTFAGAIAEKLNVIRLLFLTDVPGVLDKN------RQLISKLSI-- 81 (142) Q Consensus 13 ~~i~~lL~~g~iPVi~---pig~~~~G~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~------~~~i~~i~~-- 81 (142) +.+..+|+.|.|||+. .++ +++...|. +.|.+|+.+|..++|+.|+++|||+|||+.+ .++|++++. T Consensus 124 ~tl~~ll~~g~IPIvneND~Va-t~ei~~fG-dnD~lAA~vA~~i~AD~liilTdVDGvY~~dP~~~~dAk~I~~v~~~~ 201 (265) T PRK12314 124 NTFESLLELGILPIVNENDAVA-TDEIDKFG-DNDRLSAIVAKLVKADLLIILSDIDGLYDKNPRIYPDAKLLSEVTEIT 201 (265) T ss_pred HHHHHHHHCCCEEEEECCCCCC-CCCCCCCC-CCCHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCEEEEEECCCC T ss_conf 9999998489566870788531-46544545-750999999875597669997356742279988799973840245679 Q ss_pred HHHHHHHH---CCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEECC Q ss_conf 89999962---69767867899999999998499889995257876899999679986634549 Q 537021.9.peg.3 82 NEARTLIK---DGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLIP 142 (142) Q Consensus 82 ~e~~~l~~---~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~p 142 (142) .+..++.. +...+|||..|+++|..+.++|++ ++|+||+.|+.| ..++.++.+||+|+| T Consensus 202 ~~~~~~~~~~~s~~gtGGM~sKl~Aa~~a~~~Gi~-v~I~ng~~~~~i-~~~l~ge~~GT~f~P 263 (265) T PRK12314 202 EEILALAGGAGSKFGTGGMVTKLKAAKFLMEAGIK-MVLANGFNPSDI-LSFLEGESIGTLFAP 263 (265) T ss_pred HHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHCCCC-EEEEECCCCCHH-HHHHCCCCCCEEECC T ss_conf 78997514788887768648999999999978991-999708996289-999769987159806 No 20 >PRK12454 carbamate kinase-like carbamoyl phosphate synthetase; Reviewed Probab=99.88 E-value=8e-22 Score=141.14 Aligned_cols=129 Identities=29% Similarity=0.462 Sum_probs=108.9 Q ss_pred EEECHHHHHHHHHCCCEEEEC-----CCCCCCCCCEECCCH----HHHHHHHHHHHCHHHHHHHCCCCCEEEC----CCC Q ss_conf 355699999999779849985-----743178732006614----7999999986251133432267633534----555 Q 537021.9.peg.3 8 IKVNRTILDLLIKSGIIPVIA-----PIAPGYDGATYNINA----DTFAGAIAEKLNVIRLLFLTDVPGVLDK----NRQ 74 (142) Q Consensus 8 ~~vd~~~i~~lL~~g~iPVi~-----pig~~~~G~~~nin~----D~~a~~lA~~l~A~kli~ltdv~Gv~d~----~~~ 74 (142) +-++.+.|+.|+++|++||.+ |+ ..++|...++++ |.+|+.||..++||.|++|||+++||.+ +.+ T Consensus 171 ~IvE~~~I~~Lv~~g~iVIa~GGGGIPV-v~~~g~l~GveAVIDKD~aaa~LA~~l~Ad~liILTdVd~V~~n~g~p~q~ 249 (313) T PRK12454 171 GIVEIETIKMLVENGVIVIAAGGGGIPV-IEEDGELKGVEAVIDKDLASELLAEELNADIFIILTDVEKVYLNYGKPDQK 249 (313) T ss_pred EEECHHHHHHHHHCCCEEEEECCCCCCE-ECCCCCEECCCEEECCCHHHHHHHHHHCCCEEEEEECCCHHEECCCCCCHH T ss_conf 2752899999997898899948985666-855982756520444518899999970999899961665000038996412 Q ss_pred CCCCCCHHHHHHHHHCCCCCC-CHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEECC Q ss_conf 556338889999962697678-67899999999998499889995257876899999679986634549 Q 537021.9.peg.3 75 LISKLSINEARTLIKDGTISG-GMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLIP 142 (142) Q Consensus 75 ~i~~i~~~e~~~l~~~~~~~~-gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~p 142 (142) .+++++.+|++++++++.+.. +|.||+++|++|+++|.++++|.+ .+.+...| ....||+|+| T Consensus 250 ~l~~it~~e~~~~~~eg~F~~GsM~PKV~Aai~Fve~gg~~aiIts---l~~~~~al--~g~~GT~IvP 313 (313) T PRK12454 250 ALDKVTVDEAKKYYEEGHFPAGSMGPKILAAIRFVENGGRRAVIGD---LEKAVEAL--EGKTGTRIIP 313 (313) T ss_pred HHHCCCHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCEEEECC---HHHHHHHH--CCCCCCEECC T ss_conf 2403889999999875998888826999999999994797699998---89999996--7999988659 No 21 >pfam00696 AA_kinase Amino acid kinase family. This family includes kinases that phosphorylate a variety of amino acid substrates, as well as uridylate kinase and carbamate kinase. This family includes: Aspartokinase EC:2.7.2.4. Acetylglutamate kinase EC:2.7.2.8. Glutamate 5-kinase EC:2.7.2.11. Uridylate kinase EC:2.7.4.-. Carbamate kinase EC:2.7.2.2. Probab=99.87 E-value=1.2e-21 Score=140.18 Aligned_cols=115 Identities=32% Similarity=0.455 Sum_probs=102.1 Q ss_pred CCEEEECHHHHHHHHHCCCEEEECCCC-CCCCCCE---ECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEEC------CCC Q ss_conf 201355699999999779849985743-1787320---066147999999986251133432267633534------555 Q 537021.9.peg.3 5 GNVIKVNRTILDLLIKSGIIPVIAPIA-PGYDGAT---YNINADTFAGAIAEKLNVIRLLFLTDVPGVLDK------NRQ 74 (142) Q Consensus 5 G~v~~vd~~~i~~lL~~g~iPVi~pig-~~~~G~~---~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d~------~~~ 74 (142) ++...++.+.++.+|+++.|||+++.+ ++.+|+. .+.++|.+|+.+|..++|++|+|+|||+|||+. +++ T Consensus 106 ~~~~~~~~~~i~~~l~~~~ipVi~g~~~~~~~~~~~~l~~~~sD~~Aa~lA~~l~A~~lii~tdV~Gvy~~dP~~~~~a~ 185 (230) T pfam00696 106 VRLDANDTEAIKELLEEGVVPVITGFGGENDTGETTTLGRGSSDTLAALLAEALGADKLIILTDVDGVYTADPRKVPDAK 185 (230) T ss_pred EEEEHHHHHHHHHHHHCCCCEEECCCCEECCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCCCEECCCCCCCCCCE T ss_conf 68845259999999976992552476424798872313487710889999985598678986157813637998799873 Q ss_pred CCCCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEE Q ss_conf 556338889999962697678678999999999984998899952 Q 537021.9.peg.3 75 LISKLSINEARTLIKDGTISGGMIPKIETSIKAIENGVKSVAILD 119 (142) Q Consensus 75 ~i~~i~~~e~~~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~ 119 (142) +|++++.+|+.++..++.+++||++|+++|++++++|.-+++|.| T Consensus 186 ~i~~i~~~e~~~l~~~~~~~gGm~~k~~~a~~~~~~~~i~v~I~n 230 (230) T pfam00696 186 LIPELSYDEAEELLAAGDGTGGMKVKDPAALKAARRGGIPVVIVN 230 (230) T ss_pred ECCCCCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCEEEEEC T ss_conf 855767999998876597849969999999999997898699949 No 22 >cd04235 AAK_CK AAK_CK: Carbamate kinase (CK) catalyzes both the ATP-phosphorylation of carbamate and carbamoyl phosphate (CP) utilization with the production of ATP from ADP and CP. Both CK (this CD) and nonhomologous CP synthetase synthesize carbamoyl phosphate, an essential precursor of arginine and pyrimidine bases, in the presence of ATP, bicarbonate, and ammonia. CK is a homodimer of 33 kDa subunits and is a member of the Amino Acid Kinase Superfamily (AAK). Probab=99.86 E-value=1.9e-21 Score=139.04 Aligned_cols=126 Identities=26% Similarity=0.456 Sum_probs=105.9 Q ss_pred ECHHHHHHHHHCCCEEEEC-----CCCCCCCCCEEC----CCHHHHHHHHHHHHCHHHHHHHCCCCCEEECC----CCCC Q ss_conf 5699999999779849985-----743178732006----61479999999862511334322676335345----5555 Q 537021.9.peg.3 10 VNRTILDLLIKSGIIPVIA-----PIAPGYDGATYN----INADTFAGAIAEKLNVIRLLFLTDVPGVLDKN----RQLI 76 (142) Q Consensus 10 vd~~~i~~lL~~g~iPVi~-----pig~~~~G~~~n----in~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~----~~~i 76 (142) ++.+.|+.|+++|+|||.+ |+..+ +|...+ ||.|.+|+.||..|+||+|++|||+++||.+. .+.+ T Consensus 169 vE~~~I~~L~~~g~ivIa~GGGGIPVv~~-~~~~~GveAVIDKD~aaa~LA~~l~Ad~liiLTdV~~V~~n~g~p~q~~L 247 (308) T cd04235 169 VEIEAIKTLVDNGVIVIAAGGGGIPVVRE-GGGLKGVEAVIDKDLASALLAEEINADLLVILTDVDNVYINFGKPNQKAL 247 (308) T ss_pred ECHHHHHHHHHCCCEEEEECCCCCCEECC-CCCEECCEEEECCHHHHHHHHHHCCCCEEEEEECCHHHHHCCCCCCCHHH T ss_conf 46999999997798899828986546824-99544311021343888999986389889997164168762899860022 Q ss_pred CCCCHHHHHHHHHCCCCCCC-HHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEEC Q ss_conf 63388899999626976786-789999999999849988999525787689999967998663454 Q 537021.9.peg.3 77 SKLSINEARTLIKDGTISGG-MIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLI 141 (142) Q Consensus 77 ~~i~~~e~~~l~~~~~~~~g-m~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~ 141 (142) ++++.+|+++|++++.+..| |.||+++|++|+++|.++++|.+ .+.+...| ....||+|+ T Consensus 248 ~~it~~e~~~~~~~g~F~~GSM~PKv~Aai~fv~~gg~~a~Its---l~~~~~al--~G~~GT~I~ 308 (308) T cd04235 248 EQVTVEELEKYIEEGQFAPGSMGPKVEAAIRFVESGGKKAIITS---LENAEAAL--EGKAGTVIV 308 (308) T ss_pred CCCCHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCEEEECC---HHHHHHHH--CCCCCCEEC T ss_conf 14889999999976998889836999999999994898799998---99999996--799995879 No 23 >cd04242 AAK_G5K_ProB AAK_G5K_ProB: Glutamate-5-kinase (G5K) catalyzes glutamate-dependent ATP cleavage; G5K transfers the terminal phosphoryl group of ATP to the gamma-carboxyl group of glutamate, in the first and controlling step of proline (and, in mammals, ornithine) biosynthesis. G5K is subject to feedback allosteric inhibition by proline or ornithine. In microorganisms and plants, proline plays an important role as an osmoprotectant and, in mammals, ornithine biosynthesis is crucial for proper ammonia detoxification, since a G5K mutation has been shown to cause human hyperammonaemia. Microbial G5K generally consists of two domains: a catalytic G5K domain and one PUA (pseudo uridine synthases and archaeosine-specific transglycosylases) domain, and some lack the PUA domain. G5K requires free Mg for activity, it is tetrameric, and it aggregates to higher forms in a proline-dependent way. G5K lacking the PUA domain remains tetrameric, active, and proline-inhibitable, but the Mg requir Probab=99.86 E-value=5.1e-21 Score=136.72 Aligned_cols=127 Identities=30% Similarity=0.527 Sum_probs=100.9 Q ss_pred ECH-HHHHHHHHCCCEEEEC---CCCCCCCCCEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEECC------CCCCCCC Q ss_conf 569-9999999779849985---74317873200661479999999862511334322676335345------5555633 Q 537021.9.peg.3 10 VNR-TILDLLIKSGIIPVIA---PIAPGYDGATYNINADTFAGAIAEKLNVIRLLFLTDVPGVLDKN------RQLISKL 79 (142) Q Consensus 10 vd~-~~i~~lL~~g~iPVi~---pig~~~~G~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~------~~~i~~i 79 (142) .|. +.++.+|+.|.|||+. .++. .+..--+.|++|+.+|..++|+.|+++||+||+|+.+ .++|+++ T Consensus 110 ~n~~~ti~~LL~~g~iPIiNEND~vat---~ei~fGDnD~lAa~vA~~i~Ad~LiilTdVDGvY~~dP~~~~~Ak~i~~v 186 (251) T cd04242 110 LNARNTLETLLELGVIPIINENDTVAT---EEIRFGDNDRLSALVAGLVNADLLILLSDVDGLYDKNPRENPDAKLIPEV 186 (251) T ss_pred HHHHHHHHHHHHCCCEEEECCCCCCCC---CCCCCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCEE T ss_conf 999999999997896467558885521---22334477479999986518887999644673236898768766634263 Q ss_pred CH--HHHHHHHH---CCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEEC Q ss_conf 88--89999962---6976786789999999999849988999525787689999967998663454 Q 537021.9.peg.3 80 SI--NEARTLIK---DGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLI 141 (142) Q Consensus 80 ~~--~e~~~l~~---~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~ 141 (142) +. .++..+.. +..-+|||..|+++|..+.++|++ ++|+||+.|+.| ..++.++.+||+|+ T Consensus 187 ~~~~~~i~~~~~~~~s~~gtGGM~sKl~AA~~a~~~Gi~-~~I~~g~~~~~i-~~~l~g~~vGT~f~ 251 (251) T cd04242 187 EEITDEIEAMAGGSGSSVGTGGMRTKLKAARIATEAGIP-VVIANGRKPDVL-LDILAGEAVGTLFL 251 (251) T ss_pred CCCCHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHCCCC-EEEEECCCCCHH-HHHHCCCCCCEEEC T ss_conf 579879997707788888738738999999999988996-999828997789-99976999841669 No 24 >TIGR01027 proB glutamate 5-kinase; InterPro: IPR005715 L-glutamate 5-phosphotransferase, (gamma-glutamyl kinase, proB, 2.7.2.11 from EC), catalyzes the first step in proline biosynthesisATP + L-glutamate = ADP + L-glutamate 5-phosphate. the product of which rapidly cyclises to 5-oxoproline and phosphate. Bacterial ProB proteins hits the full length of this model, but the ProB-like domain of delta 1-pyrroline-5-carboxylate synthetase does not hit the C-terminal 100 residues. ; GO: 0004349 glutamate 5-kinase activity, 0006561 proline biosynthetic process, 0005737 cytoplasm. Probab=99.85 E-value=3.4e-21 Score=137.66 Aligned_cols=128 Identities=28% Similarity=0.390 Sum_probs=102.1 Q ss_pred ECH-HHHHHHHHCCCEEEEC---CCCCCCCCCEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEECCC------CCCCCC Q ss_conf 569-9999999779849985---743178732006614799999998625113343226763353455------555633 Q 537021.9.peg.3 10 VNR-TILDLLIKSGIIPVIA---PIAPGYDGATYNINADTFAGAIAEKLNVIRLLFLTDVPGVLDKNR------QLISKL 79 (142) Q Consensus 10 vd~-~~i~~lL~~g~iPVi~---pig~~~~G~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~------~~i~~i 79 (142) -|. +.|..||+.|.||||. .++.+ |+--=|.|++++.+|...+||.|++|||++|+||.|+ ++|+.+ T Consensus 115 ~NAr~TL~~Ll~~gvvPIiNENDTVave---Ei~fGDND~LSAlvA~Lv~AD~L~LLTD~dGLYd~dPR~nPdA~~I~~V 191 (379) T TIGR01027 115 LNARNTLEALLELGVVPIINENDTVAVE---EIKFGDNDTLSALVAILVGADLLVLLTDVDGLYDADPRTNPDAKLIPVV 191 (379) T ss_pred HHHHHHHHHHHHCCCEEEEECCCCEEEE---EEEECCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEECCCC T ss_conf 9999999999865947898677422440---0054678169999999997528977617553306874537851453412 Q ss_pred CHH--HHHHHH--H--CC-CCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHC----CCCCCCEECC Q ss_conf 888--999996--2--69-76786789999999999849988999525787689999967----9986634549 Q 537021.9.peg.3 80 SIN--EARTLI--K--DG-TISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFT----KNGSGTLLIP 142 (142) Q Consensus 80 ~~~--e~~~l~--~--~~-~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft----~~g~GT~i~p 142 (142) ..- +...+. . +. +-||||..||.||.-|++.||| +.|.++..|+.| .++.. ++..||+|.| T Consensus 192 ~~~~~~~~~~aG~S~g~~~~GTGGM~sK~~AA~~A~~~Gv~-v~i~~g~~P~~i-~~~~~hhfyg~~~GT~F~a 263 (379) T TIGR01027 192 EEITDLLLGVAGDSSGSSKVGTGGMRSKIEAARLATRAGVP-VIIASGSKPEKI-ADALEHHFYGAPVGTVFEA 263 (379) T ss_pred CCCCHHHHEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCC-EEEECCCCHHHH-HHHHHHHCCCCCCCCEECC T ss_conf 45332341111466778875664678999999999868995-899448897899-9999974037787744335 No 25 >COG1608 Predicted archaeal kinase [General function prediction only] Probab=99.84 E-value=1.6e-20 Score=133.92 Aligned_cols=138 Identities=26% Similarity=0.452 Sum_probs=112.9 Q ss_pred CCCCEEEECHHHHHHHHHCCCEEEE-CCCCCCCCCCEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEECCCCCCCCCC- Q ss_conf 5520135569999999977984998-57431787320066147999999986251133432267633534555556338- Q 537021.9.peg.3 3 FVGNVIKVNRTILDLLIKSGIIPVI-APIAPGYDGATYNINADTFAGAIAEKLNVIRLLFLTDVPGVLDKNRQLISKLS- 80 (142) Q Consensus 3 ~~G~v~~vd~~~i~~lL~~g~iPVi-~pig~~~~G~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~~~i~~i~- 80 (142) |-|++..-..+.++.+|+.|++||+ ..+.++++...--+++|+++..+|..++|++++|+||+||||+.++..++.+. T Consensus 108 ~~gr~~~~~l~~i~~~l~~gfvPvl~GDVv~d~~~g~~IiSGDdIv~~LA~~l~pd~v~f~tdVdGVy~~~p~~~p~~~~ 187 (252) T COG1608 108 FNGRILYTYLEAIKDALEKGFVPVLYGDVVPDDDNGYEIISGDDIVLHLAKELKPDRVIFLTDVDGVYDRDPGKVPDARL 187 (252) T ss_pred CCCCEEECHHHHHHHHHHCCCEEEEECCEEECCCCCEEEEECCHHHHHHHHHHCCCEEEEEECCCCEECCCCCCCCCCCC T ss_conf 17703220299999998769876233346874898548985118999999974987799984578235689874866200 Q ss_pred HHHHHH---HHHCC--CCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEECC Q ss_conf 889999---96269--767867899999999998499889995257876899999679986634549 Q 537021.9.peg.3 81 INEART---LIKDG--TISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLIP 142 (142) Q Consensus 81 ~~e~~~---l~~~~--~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~p 142 (142) .++++. +..++ .+||||..|++++++..+++.+ |+++|++.|+.|.. ...++.+||.|.+ T Consensus 188 l~~i~~~~~~~gs~~~DVTGGi~~Kl~~~~~~~~~~~~-vyi~ng~~~~ni~~-~l~G~~vGT~I~~ 252 (252) T COG1608 188 LSEIEGRVALGGSGGTDVTGGIAKKLEALLEIARYGKE-VYIFNGNKPENIYR-ALRGENVGTRIDG 252 (252) T ss_pred HHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCCE-EEEECCCCHHHHHH-HHCCCCCCEEECC T ss_conf 01443234306867621313399999999998735863-89987877778999-9658877548469 No 26 >PRK05429 gamma-glutamyl kinase; Provisional Probab=99.84 E-value=2.3e-20 Score=133.15 Aligned_cols=128 Identities=28% Similarity=0.523 Sum_probs=103.2 Q ss_pred ECH-HHHHHHHHCCCEEEEC---CCCCCCCCCEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEECC------CCCCCCC Q ss_conf 569-9999999779849985---74317873200661479999999862511334322676335345------5555633 Q 537021.9.peg.3 10 VNR-TILDLLIKSGIIPVIA---PIAPGYDGATYNINADTFAGAIAEKLNVIRLLFLTDVPGVLDKN------RQLISKL 79 (142) Q Consensus 10 vd~-~~i~~lL~~g~iPVi~---pig~~~~G~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~------~~~i~~i 79 (142) .|. ..++.||+.|.||||. .++-+ +..--|.|++|+.+|..++||.||+|||++|+|+.+ .++|+++ T Consensus 119 lN~r~tl~~Ll~~gvIPIINENDtVat~---Ei~fGDND~LaA~VA~li~ADlLiiLTdvdGly~~~P~~~~~A~~I~~v 195 (372) T PRK05429 119 LNARATLRTLLELGVVPIINENDTVATD---EIKFGDNDTLSALVANLVEADLLILLTDIDGLYTADPRKNPDAKLIPEV 195 (372) T ss_pred HHHHHHHHHHHHCCCEEECCCCCCCEEC---CCCCCCCCHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCCCCEEEEE T ss_conf 9999999999877960001477740005---5555785158826877738877999745675335899768654044120 Q ss_pred CH--HHHHHHHH---CCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEECC Q ss_conf 88--89999962---69767867899999999998499889995257876899999679986634549 Q 537021.9.peg.3 80 SI--NEARTLIK---DGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLIP 142 (142) Q Consensus 80 ~~--~e~~~l~~---~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~p 142 (142) .. .++..+.. +..-+|||..|+++|..|.+.|++ ++|+||+.++. +..++.++.+||+|.| T Consensus 196 ~~i~~~i~~~~~~~~s~~GtGGM~tKl~AA~~a~~~Gi~-~iIa~G~~~~~-l~~i~~g~~~GT~f~~ 261 (372) T PRK05429 196 EAITDELEAMAGGAGSGLGTGGMATKLEAARIATRAGIP-VVIASGRKPDV-LLRLLAGEAVGTLFLP 261 (372) T ss_pred CCCCHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHCCCC-EEECCCCCCCH-HHHHHCCCCCEEEEEC T ss_conf 578778998716887788778437799999999977998-99924999658-9999779972699943 No 27 >PRK13402 gamma-glutamyl kinase; Provisional Probab=99.83 E-value=3e-20 Score=132.47 Aligned_cols=128 Identities=24% Similarity=0.452 Sum_probs=104.0 Q ss_pred ECH-HHHHHHHHCCCEEEEC---CCCCCCCCCEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEECC------CCCCCCC Q ss_conf 569-9999999779849985---74317873200661479999999862511334322676335345------5555633 Q 537021.9.peg.3 10 VNR-TILDLLIKSGIIPVIA---PIAPGYDGATYNINADTFAGAIAEKLNVIRLLFLTDVPGVLDKN------RQLISKL 79 (142) Q Consensus 10 vd~-~~i~~lL~~g~iPVi~---pig~~~~G~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~------~~~i~~i 79 (142) .|. +.+..||+.|.||||. .++.++ .--=|.|++|+.+|..++||.||+|||++|+||.+ .++|+++ T Consensus 112 lN~r~Tl~~Ll~~gvIPIINENDtVat~E---i~fGDND~LaA~VA~li~ADlLilLTDvdGLYd~dP~~~~~AklI~~V 188 (363) T PRK13402 112 ISIRNTINVLLEHGILPIINENDAVTTDK---LKVGDNDNLSAMVAALADADTLIICSDVDGLYDQNPRTNPDAKLIKEV 188 (363) T ss_pred HHHHHHHHHHHHCCCCCCCCCCCCEEEEE---EECCCCCHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCCCCCCEEEEE T ss_conf 99999999999689422115888501001---211575079999998718888999416684447998778766436785 Q ss_pred CH--HHHHHHHH---CCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEECC Q ss_conf 88--89999962---69767867899999999998499889995257876899999679986634549 Q 537021.9.peg.3 80 SI--NEARTLIK---DGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLIP 142 (142) Q Consensus 80 ~~--~e~~~l~~---~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~p 142 (142) +. .++..+.. +..-+|||..|+++|..|.+.|++ ++|++|+.|+. +.++|.++.+||+|+| T Consensus 189 ~~i~~~i~~~a~~~~s~~GtGGM~TKl~AA~~a~~~Gi~-~~Ia~G~~~~~-i~~i~~g~~~GT~F~p 254 (363) T PRK13402 189 TEINAEIYAMAGGAGSEVGTGGMRTKIQAAKKAISHGIE-TFIINGFTADI-FNQLLKGQNPGTLFTP 254 (363) T ss_pred CCCCHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHCCCC-EEECCCCCCCH-HHHHHCCCCCCEEEEC T ss_conf 156677898716877888878347899999999977996-99816999747-9999708986249954 No 28 >PRK12686 carbamate kinase; Reviewed Probab=99.83 E-value=6.2e-20 Score=130.72 Aligned_cols=131 Identities=24% Similarity=0.392 Sum_probs=105.3 Q ss_pred CEEE-ECHHHHHHHHHCCCEEEEC-----CCCCCCC---CCEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEECC---- Q ss_conf 0135-5699999999779849985-----7431787---3200661479999999862511334322676335345---- Q 537021.9.peg.3 6 NVIK-VNRTILDLLIKSGIIPVIA-----PIAPGYD---GATYNINADTFAGAIAEKLNVIRLLFLTDVPGVLDKN---- 72 (142) Q Consensus 6 ~v~~-vd~~~i~~lL~~g~iPVi~-----pig~~~~---G~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~---- 72 (142) ++.+ ++.+.|+.|+++|+|||.+ |+..++. |-.-.||.|..|+.||..++||.|++||||++||-+. T Consensus 167 ~P~~Ive~~~I~~Lv~~g~ivIa~GGGGIPV~~~~~~~~GveAVIDKDlaaa~LA~~l~AD~liILTdV~~V~~n~g~p~ 246 (313) T PRK12686 167 KPKEIIEHDTIRTLVDGGNIVIACGGGGIPVIRDGNTLKGVEAVIDKDFASEKLAELIDADLLMILTGVENVFINFNKPN 246 (313) T ss_pred CCCCCCCHHHHHHHHHCCCEEEECCCCCCCEEEECCCEECEEEEECCCHHHHHHHHHCCCCEEEEEECCCEEEECCCCCC T ss_conf 98411469999999968988998279874379758967630224254478899998619998999726660232189975 Q ss_pred CCCCCCCCHHHHHHHHHCCCCCCC-HHHHHHHHHHHHHCC-CCEEEEEECCCCCHHHHHHHCCCCCCCEEC Q ss_conf 555563388899999626976786-789999999999849-988999525787689999967998663454 Q 537021.9.peg.3 73 RQLISKLSINEARTLIKDGTISGG-MIPKIETSIKAIENG-VKSVAILDGKKPHSILMEIFTKNGSGTLLI 141 (142) Q Consensus 73 ~~~i~~i~~~e~~~l~~~~~~~~g-m~~Ki~~a~~a~~~G-v~~v~i~~~~~~~~ll~elft~~g~GT~i~ 141 (142) .+.+++++.+|++++++++.+..| |.||+++|++|+++| .+++.|. ..+.+...| ....||.|. T Consensus 247 q~~l~~it~~e~~~~~~~g~F~~GSM~PKVeAai~Fve~g~g~~aiI~---~l~~~~~al--~G~~GT~It 312 (313) T PRK12686 247 QQKLDDISVAEAKQYIAEGQFAPGSMLPKVEAAIDFVESGKNKKAIIT---SLEQAKEAL--AGNAGTHIT 312 (313) T ss_pred HHHHHCCCHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEC---CHHHHHHHH--CCCCCCEEE T ss_conf 223521899999999975999888827999999999980999869989---899999996--799987871 No 29 >PRK12352 putative carbamate kinase; Reviewed Probab=99.82 E-value=1.4e-19 Score=128.84 Aligned_cols=127 Identities=24% Similarity=0.403 Sum_probs=107.0 Q ss_pred ECHHHHHHHHHCCCEEEEC-----CCCCCCCCCEECC----CHHHHHHHHHHHHCHHHHHHHCCCCCEEECC----CCCC Q ss_conf 5699999999779849985-----7431787320066----1479999999862511334322676335345----5555 Q 537021.9.peg.3 10 VNRTILDLLIKSGIIPVIA-----PIAPGYDGATYNI----NADTFAGAIAEKLNVIRLLFLTDVPGVLDKN----RQLI 76 (142) Q Consensus 10 vd~~~i~~lL~~g~iPVi~-----pig~~~~G~~~ni----n~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~----~~~i 76 (142) ++.+.|+.|+++|.|||.+ |+..+++|....+ |-|.+++.||..++||.|++||||++||-+. .+.+ T Consensus 174 vE~~~Ik~L~~~g~iVIa~GGGGIPV~~~~~g~l~GVeAVIDKD~asa~LA~~l~AD~liILTdVd~V~~n~gkp~q~~L 253 (316) T PRK12352 174 VEAPAIKALIQQGFVVIGAGGGGIPVVRTDAGDYQSVDAVIDKDLSTALLAREIHADILVITTGVEKVCIHFGKPQQQAL 253 (316) T ss_pred ECHHHHHHHHHCCCEEEEECCCCCCEEECCCCCEECCEEEECCCHHHHHHHHHHCCCEEEEEECCCEEEEECCCCCHHHH T ss_conf 05999999997898899847987533762898565431466352788999997099989997356603540899651135 Q ss_pred CCCCHHHHHHHHHCCCCCCC-HHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEEC Q ss_conf 63388899999626976786-789999999999849988999525787689999967998663454 Q 537021.9.peg.3 77 SKLSINEARTLIKDGTISGG-MIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLI 141 (142) Q Consensus 77 ~~i~~~e~~~l~~~~~~~~g-m~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~ 141 (142) ++++.+|++++++++.+..| |.||+++|++++++|.+++.|.+ .+.+...| ....||.|+ T Consensus 254 ~~vt~~e~~~y~~eg~F~~GSM~PKV~Aai~Fve~gg~~aiIts---l~~~~~al--~G~aGT~Iv 314 (316) T PRK12352 254 DRVDIATMTRYMQEGHFPPGSMLPKIIASLTFLEQGGKEVIITT---PECLPAAL--RGETGTHII 314 (316) T ss_pred HHCCHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCEEEECC---HHHHHHHH--CCCCCCEEE T ss_conf 30899999999876999888837999999999994798699897---89999996--799986794 No 30 >KOG2436 consensus Probab=99.81 E-value=5e-21 Score=136.76 Aligned_cols=141 Identities=26% Similarity=0.424 Sum_probs=122.2 Q ss_pred CCCCCCEEEECHHHHHHHHHCCCEEEECCCCCCCCCCEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEECCCCCCCCCC Q ss_conf 98552013556999999997798499857431787320066147999999986251133432267633534555556338 Q 537021.9.peg.3 1 LGFVGNVIKVNRTILDLLIKSGIIPVIAPIAPGYDGATYNINADTFAGAIAEKLNVIRLLFLTDVPGVLDKNRQLISKLS 80 (142) Q Consensus 1 ~g~~G~v~~vd~~~i~~lL~~g~iPVi~pig~~~~G~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~~~i~~i~ 80 (142) +|++|+|.+||.+.|+.+++.|.+|++++++.+.+|+++|+|+|++|.++|.+|+|+|+++++|++-+++.+++.++.++ T Consensus 215 y~~~gei~~vd~d~i~~l~~~G~mp~L~sla~TaSGqvlnvNa~~~a~elA~~L~~~kli~l~d~g~~l~e~ge~~S~l~ 294 (520) T KOG2436 215 YGLVGEIKKVDVDRIRHLLDAGSMPLLRSLAATASGQVLNVNADEVAGELALALGPDKLILLMDKGRILKENGEDISSLI 294 (520) T ss_pred EEEECCCCEECHHHHHHHHHCCCCHHEHHHCCCCCCCEEEEEHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCCCCCC T ss_conf 22003300311524235652798221110113676616875178776688743584006875045542455755564021 Q ss_pred HHHHHHHHHC----CCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEECC Q ss_conf 8899999626----9767867899999999998499889995257876899999679986634549 Q 537021.9.peg.3 81 INEARTLIKD----GTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLIP 142 (142) Q Consensus 81 ~~e~~~l~~~----~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~p 142 (142) ..+...++.. ..++.+|.+++..+.....-+... ..+..+...+++.|+|+++|.||+|-| T Consensus 295 l~~e~~~l~k~~qq~~~a~~~v~aV~~~~~~~~p~~~s-~~i~~~t~~n~~~~~~te~G~~t~~~~ 359 (520) T KOG2436 295 LQEEDAGLRKPSQQKNIAANNVKAVKDGIDSSLPRPSS-YNIAITTQQNLIKELFTEKGAGTLISG 359 (520) T ss_pred CCHHHHHHHHHHHHCCCCCCCCHHHHHHEEECCCCCCC-CCCCEEECCCCCCEEECCCCCCCCCCC T ss_conf 66667655414653022000223554310111676777-774136402365401026898732148 No 31 >TIGR00746 arcC carbamate kinase; InterPro: IPR003964 The arginine dihydrolase (AD) pathway is found in many prokaryotes and some primitive eukaryotes. The three- enzyme anaerobic pathway breaks down L-arginine to form 1 mol of ATP, carbon dioxide and ammonia. In simpler bacteria, the first enzyme, arginine deiminase, can account for up to 100f total cell protein . Carbamate kinase is involved in the last step of the AD pathway, converting carbamoyl phosphate and ADP into ammonia, carbon dioxide and ATP . The second step of the pathway involves the degradation of L-citrulline to carbamoyl phosphate and L-ornithine, using ornithine carbamoyltransferase . The crystal structure of Enterococcus faecium carbamate kinase has been determined to 2.8A resolution . The enzyme exists as a homodimer of two 33kDa subunits. The hallmark of the dimer is a 16-stranded beta-sheet, surrounded by alpha-helices. Each subunit contains an active site within a large crevice.; GO: 0008804 carbamate kinase activity, 0006525 arginine metabolic process. Probab=99.80 E-value=1.1e-19 Score=129.42 Aligned_cols=129 Identities=25% Similarity=0.381 Sum_probs=106.6 Q ss_pred EECHHHHHHHHHCCCEEEEC-----CCCCCCCC-CEEC----CCHHHHHHHHHHHHCHHHHHHHCCCCCEEECCC----C Q ss_conf 55699999999779849985-----74317873-2006----614799999998625113343226763353455----5 Q 537021.9.peg.3 9 KVNRTILDLLIKSGIIPVIA-----PIAPGYDG-ATYN----INADTFAGAIAEKLNVIRLLFLTDVPGVLDKNR----Q 74 (142) Q Consensus 9 ~vd~~~i~~lL~~g~iPVi~-----pig~~~~G-~~~n----in~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~----~ 74 (142) -++.+.|+.|++.|.|+|-| |+..+.++ .+-. ||-|.++..||..++||-|++||||+.+|-+.+ + T Consensus 176 I~E~~~Ik~L~e~g~iVI~~GGGGvPv~~dg~~kkl~GVeAVIDKDlA~~~LA~~~~AD~L~ILTDVd~vy~nygkP~e~ 255 (321) T TIGR00746 176 IVEAEVIKTLVENGVIVICSGGGGVPVVEDGAEKKLKGVEAVIDKDLASEKLAEEVNADILVILTDVDAVYVNYGKPDEK 255 (321) T ss_pred CEECHHHHHHHHCCEEEEEECCCCEEEEECCCCCEEEEEEEEECHHHHHHHHHHHHCCCEEEEEHHHHHHHHCCCCHHHH T ss_conf 31207789986488279985788203566289430220110323668889888774387443102012452207782478 Q ss_pred CCCCCCHHHHHHHHHCCCCCCC-HHHHHHHHHHHHHCCCC-EEEEEECCCCCHHHHHHHCCCC-CCCEECC Q ss_conf 5563388899999626976786-78999999999984998-8999525787689999967998-6634549 Q 537021.9.peg.3 75 LISKLSINEARTLIKDGTISGG-MIPKIETSIKAIENGVK-SVAILDGKKPHSILMEIFTKNG-SGTLLIP 142 (142) Q Consensus 75 ~i~~i~~~e~~~l~~~~~~~~g-m~~Ki~~a~~a~~~Gv~-~v~i~~~~~~~~ll~elft~~g-~GT~i~p 142 (142) .+++++.+|++.|.+.+.+..| |.|||+||+++++.+.+ +++|.+ .+.+ .+++ +.. .||.|+| T Consensus 256 ~L~~~~~~El~~~~~~G~Fa~GSMgPKV~AaI~Fv~~~G~~~AiIt~---L~~~-~~~l-~g~iaGT~v~~ 321 (321) T TIGR00746 256 KLREVTVEELEDYEKDGHFAAGSMGPKVEAAIEFVESRGKKRAIITS---LETA-VEAL-EGKIAGTRVSK 321 (321) T ss_pred HHHCCCHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHCCCCCCCEECC---HHHH-HHHH-CCCEEEEEEEC T ss_conf 87246988999998558866889860389998998718887400058---5899-9864-69732214509 No 32 >PRK12354 carbamate kinase; Reviewed Probab=99.80 E-value=4.5e-19 Score=125.99 Aligned_cols=125 Identities=29% Similarity=0.374 Sum_probs=103.7 Q ss_pred ECHHHHHHHHHCCCEEEEC-----CCCCCCCCCEECC----CHHHHHHHHHHHHCHHHHHHHCCCCCEEECC----CCCC Q ss_conf 5699999999779849985-----7431787320066----1479999999862511334322676335345----5555 Q 537021.9.peg.3 10 VNRTILDLLIKSGIIPVIA-----PIAPGYDGATYNI----NADTFAGAIAEKLNVIRLLFLTDVPGVLDKN----RQLI 76 (142) Q Consensus 10 vd~~~i~~lL~~g~iPVi~-----pig~~~~G~~~ni----n~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~----~~~i 76 (142) ++.+.|+.|+++|.++|.+ |+..+++|....+ |-|.+++.||..++||.|++||||++||-+. .+.+ T Consensus 164 vE~~~Ik~L~~~g~iVIa~GGGGIPVv~~~~g~l~GveAVIDKD~aaa~LA~~l~AD~liILT~Vd~V~~n~gkp~q~~l 243 (302) T PRK12354 164 VEIRAIKWLLEKGHIVICAGGGGIPVVYDADGKLHGVEAVIDKDLAAALLAEQLDADLLLILTDVDAVYLDWGKPTQRPL 243 (302) T ss_pred ECHHHHHHHHHCCCEEEEECCCCCCEEECCCCCEECCEEEECCCHHHHHHHHHHCCCEEEEEECCCHHHHCCCCCCHHHH T ss_conf 64999999997899899847886013986899686504566340788999987099989998565422024899660023 Q ss_pred CCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEECC Q ss_conf 633888999996269767867899999999998499889995257876899999679986634549 Q 537021.9.peg.3 77 SKLSINEARTLIKDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLIP 142 (142) Q Consensus 77 ~~i~~~e~~~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~p 142 (142) ++++.+|++++ ....|.|.||+++|+++++++.+++.|.+ .+.+...| ....||.|+| T Consensus 244 ~~~t~~el~~~---~F~~GSM~PKVeAai~Fv~~~g~~aiIts---l~~~~~al--~G~~GT~I~p 301 (302) T PRK12354 244 AQATPDELREL---GFAAGSMGPKVEAACEFVEATGGIAGIGS---LADILAIL--AGEAGTRISP 301 (302) T ss_pred HCCCHHHHHHC---CCCCCCCHHHHHHHHHHHHCCCCEEEECC---HHHHHHHH--CCCCCCEECC T ss_conf 14889999737---99988836999999999982798799897---99999996--7999968688 No 33 >COG0549 ArcC Carbamate kinase [Amino acid transport and metabolism] Probab=99.78 E-value=1.6e-18 Score=123.00 Aligned_cols=131 Identities=25% Similarity=0.428 Sum_probs=107.6 Q ss_pred EEE-ECHHHHHHHHHCCCEEEEC-----CCCCCCCC---CEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEECCC---- Q ss_conf 135-5699999999779849985-----74317873---2006614799999998625113343226763353455---- Q 537021.9.peg.3 7 VIK-VNRTILDLLIKSGIIPVIA-----PIAPGYDG---ATYNINADTFAGAIAEKLNVIRLLFLTDVPGVLDKNR---- 73 (142) Q Consensus 7 v~~-vd~~~i~~lL~~g~iPVi~-----pig~~~~G---~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~---- 73 (142) +++ ++.+.|+.|+++|.++|.+ |+..+..| -.=-||-|.+++.||..++||-|++|||+++||-+.| T Consensus 168 P~~IvE~~~Ik~L~~~g~vVI~~GGGGIPVv~~~~~~~GVeAVIDKDlasalLA~~i~AD~liILTdVd~Vy~n~gkp~q 247 (312) T COG0549 168 PVRIVEAEAIKALLESGHVVIAAGGGGIPVVEEGAGLQGVEAVIDKDLASALLAEQIDADLLIILTDVDAVYVNFGKPNQ 247 (312) T ss_pred CCCCHHHHHHHHHHHCCCEEEEECCCCCCEEECCCCCCEEEEEECCHHHHHHHHHHHCCCEEEEEECCCHHEECCCCCCC T ss_conf 71203178899998679889995788744486189731036898118778999987158989998546511112799551 Q ss_pred CCCCCCCHHHHHHHHHCCCCCCC-HHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEECC Q ss_conf 55563388899999626976786-7899999999998499889995257876899999679986634549 Q 537021.9.peg.3 74 QLISKLSINEARTLIKDGTISGG-MIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLIP 142 (142) Q Consensus 74 ~~i~~i~~~e~~~l~~~~~~~~g-m~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~p 142 (142) +.+++++.+|++++++++.+..| |.||+++|+++++++.+++.|.+- +.+ .+.+ ....||.|+| T Consensus 248 ~~L~~v~~~e~~~yl~eg~Fa~GSM~PKVeAai~FV~~~gk~A~ItsL---e~~-~~~l-~g~~GT~I~~ 312 (312) T COG0549 248 QALDRVTVDEMEKYLAEGQFAAGSMGPKVEAAISFVENTGKPAIITSL---ENA-EAAL-EGKAGTVIVP 312 (312) T ss_pred HHHCCCCHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCCCCEEECCH---HHH-HHHH-CCCCCCEECC T ss_conf 122246999999998647988887318899999999847983697759---888-9986-4678858519 No 34 >COG0263 ProB Glutamate 5-kinase [Amino acid transport and metabolism] Probab=99.77 E-value=4.8e-18 Score=120.32 Aligned_cols=128 Identities=30% Similarity=0.463 Sum_probs=102.0 Q ss_pred ECH-HHHHHHHHCCCEEEEC---CCCCCCCCCEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEECCC------CCCCCC Q ss_conf 569-9999999779849985---743178732006614799999998625113343226763353455------555633 Q 537021.9.peg.3 10 VNR-TILDLLIKSGIIPVIA---PIAPGYDGATYNINADTFAGAIAEKLNVIRLLFLTDVPGVLDKNR------QLISKL 79 (142) Q Consensus 10 vd~-~~i~~lL~~g~iPVi~---pig~~~~G~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~------~~i~~i 79 (142) .|. ..+..||+.|.||||. .++.+ ++--=|.|.+++.+|...+||.|++|||++|+||.|+ ++|+++ T Consensus 117 ~Nar~Tl~~Ll~~gvVPIINENDtva~~---EikfGDND~LsA~VA~lv~ADlLvlLsDiDGLyd~nPr~~pdAk~i~~V 193 (369) T COG0263 117 LNARNTLSALLELGVVPIINENDTVATE---EIKFGDNDTLSALVAILVGADLLVLLSDIDGLYDANPRTNPDAKLIPEV 193 (369) T ss_pred HHHHHHHHHHHHCCCEEEECCCCCEEEE---EEEECCCCHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCCCEEEHHH T ss_conf 9999999999978962246487860441---0365687469999998748787999874685558998879987461021 Q ss_pred CH--HHHHHHHH---CCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEECC Q ss_conf 88--89999962---69767867899999999998499889995257876899999679986634549 Q 537021.9.peg.3 80 SI--NEARTLIK---DGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLIP 142 (142) Q Consensus 80 ~~--~e~~~l~~---~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~p 142 (142) +. .+++.+.. +..-||||..|+++|.-|.+.|++ +.|.+|+.|+.++ ++..++..||+|.| T Consensus 194 ~~it~ei~~~aggsgs~~GTGGM~TKl~AA~iA~~aG~~-~iI~~g~~~~~i~-~~~~~~~~GT~F~~ 259 (369) T COG0263 194 EEITPEIEAMAGGSGSELGTGGMRTKLEAAKIATRAGVP-VIIASGSKPDVIL-DALEGEAVGTLFEP 259 (369) T ss_pred CCCCHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHCCCC-EEEECCCCCCHHH-HHHHCCCCCCEEEC T ss_conf 346888998746777777756188899999999976995-8994699831589-99718988518935 No 35 >cd04239 AAK_UMPK-like AAK_UMPK-like: UMP kinase (UMPK)-like, the microbial/chloroplast uridine monophosphate kinase (uridylate kinase) enzyme that catalyzes UMP phosphorylation and plays a key role in pyrimidine nucleotide biosynthesis. Regulation of this process is via feed-back control and via gene repression of carbamoyl phosphate synthetase (the first enzyme of the pyrimidine biosynthesis pathway). The UMP kinases of E. coli (Ec) and Pyrococcus furiosus (Pf) are known to function as homohexamers, with GTP and UTP being allosteric effectors. Like other related enzymes (carbamate kinase, aspartokinase, and N-acetylglutamate kinase) the E. coli and most bacterial UMPKs have a conserved, N-terminal, lysine residue proposed to function in the catalysis of the phosphoryl group transfer, whereas most archaeal UMPKs appear to lack this residue and the Pyrococcus furiosus structure has an additional Mg ion bound to the ATP molecule which is proposed to function as the catalysis instead. Als Probab=99.76 E-value=4.2e-18 Score=120.62 Aligned_cols=120 Identities=22% Similarity=0.239 Sum_probs=97.0 Q ss_pred EEECHHHHHHHHHCCCEEEECCCCCCCCCCEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEECCC------CCCCCCCH Q ss_conf 355699999999779849985743178732006614799999998625113343226763353455------55563388 Q 537021.9.peg.3 8 IKVNRTILDLLIKSGIIPVIAPIAPGYDGATYNINADTFAGAIAEKLNVIRLLFLTDVPGVLDKNR------QLISKLSI 81 (142) Q Consensus 8 ~~vd~~~i~~lL~~g~iPVi~pig~~~~G~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~------~~i~~i~~ 81 (142) +..+.+.+..+|+.|.|||+..+ + +..+ .+.|.+|+.+|..++||.||++||++|+|+.|+ ++|++++. T Consensus 104 ~~~~~~~~~~~l~~g~ipV~~~~--~--~~~~-~dnD~lAa~vA~~v~AD~lIilTDVDGlY~~dP~~~~~Ak~I~~is~ 178 (229) T cd04239 104 EPYIRRRAIRHLEKGRIVIFGGG--T--GNPG-FTTDTAAALRAEEIGADVLLKATNVDGVYDADPKKNPDAKKYDRISY 178 (229) T ss_pred HHHHHHHHHHHHHCCCEEEEECC--C--CCCC-CCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCCCCCEECCCCCH T ss_conf 02239999999853977998365--5--7887-65008999999972998898611238617889887998748771289 Q ss_pred HHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEEC Q ss_conf 899999626976786789999999999849988999525787689999967998663454 Q 537021.9.peg.3 82 NEARTLIKDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLI 141 (142) Q Consensus 82 ~e~~~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~ 141 (142) +|+.+ .++|..|+.+|..|.++|++ ++|+||+.|+.| .++|.++.+||+|. T Consensus 179 ~e~~~-------~g~~~~k~~Aa~~a~~~Gi~-~~I~ng~~~~~i-~~~l~G~~vGTlI~ 229 (229) T cd04239 179 DELLK-------KGLKVMDATALTLCRRNKIP-IIVFNGLKPGNL-LRALKGEHVGTLIE 229 (229) T ss_pred HHHHH-------CCCCEECHHHHHHHHHCCCC-EEEECCCCCCHH-HHHHCCCCCCEEEC T ss_conf 99987-------59874449999999987996-999779993389-99977999981659 No 36 >cd04253 AAK_UMPK-PyrH-Pf AAK_UMPK-PyrH-Pf: UMP kinase (UMPK)-Pf, the mostly archaeal uridine monophosphate kinase (uridylate kinase) enzymes that catalyze UMP phosphorylation and play a key role in pyrimidine nucleotide biosynthesis; regulation of this process is via feed-back control and via gene repression of carbamoyl phosphate synthetase (the first enzyme of the pyrimidine biosynthesis pathway). The UMP kinase of Pyrococcus furiosus (Pf) is known to function as a homohexamer, with GTP and UTP being allosteric effectors. Like other related enzymes (carbamate kinase, aspartokinase, and N-acetylglutamate kinase) the E. coli and most bacterial UMPKs have a conserved, N-terminal, lysine residue proposed to function in the catalysis of the phosphoryl group transfer, whereas most archaeal UMPKs (this CD) appear to lack this residue and the Pyrococcus furiosus structure has an additional Mg ion bound to the ATP molecule which is proposed to function as the catalysis instead. Members of thi Probab=99.71 E-value=1.5e-16 Score=112.02 Aligned_cols=129 Identities=19% Similarity=0.232 Sum_probs=100.9 Q ss_pred CCCEEEECHHHHHHHHHCCCEEEECCCCCCCCCCEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEECC------CCCCC Q ss_conf 520135569999999977984998574317873200661479999999862511334322676335345------55556 Q 537021.9.peg.3 4 VGNVIKVNRTILDLLIKSGIIPVIAPIAPGYDGATYNINADTFAGAIAEKLNVIRLLFLTDVPGVLDKN------RQLIS 77 (142) Q Consensus 4 ~G~v~~vd~~~i~~lL~~g~iPVi~pig~~~~G~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~------~~~i~ 77 (142) .......+.+...+.++.|.+|++.-. ..+ .+.|.+|+.+|..++|+.|+++||++|+|+.| .++|+ T Consensus 85 ~~~~~~~~~~~~~~~l~~~~i~v~~g~---~p~----~ttD~laa~vA~~i~Ad~li~ltdVdGvY~~DP~~~~~Ak~i~ 157 (221) T cd04253 85 AYPPVPTSYEEALEAMFTGKIVVMGGT---EPG----QSTDAVAALLAERLGADLLINATNVDGVYSKDPRKDPDAKKFD 157 (221) T ss_pred CCCCCCCCHHHHHHHHHCCCEEEEECC---CCC----CCCCHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCCCEECC T ss_conf 587664659999999974986999478---898----8867999999998298747886435875578987799877886 Q ss_pred CCCHHHHHHHHHCCC--CCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEEC Q ss_conf 338889999962697--6786789999999999849988999525787689999967998663454 Q 537021.9.peg.3 78 KLSINEARTLIKDGT--ISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLI 141 (142) Q Consensus 78 ~i~~~e~~~l~~~~~--~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~ 141 (142) +++.+|+.++..... ..+||..+..+|..+.++|++ ++|+||+.|+.| ..++.+|.+||+|. T Consensus 158 ~is~~el~~~~~~~~~~~g~~~~~d~~Aa~~a~~~gi~-~~I~nG~~~~~i-~~~l~Ge~vGTlI~ 221 (221) T cd04253 158 RLSADELIDIVGKSSWKAGSNEPFDPLAAKIIERSGIK-TIVVDGRDPENL-ERALKGEFVGTIIE 221 (221) T ss_pred CCCHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCC-EEEECCCCCCHH-HHHHCCCCCCEEEC T ss_conf 58899999883567756898764799999999987998-899869997589-99977999971669 No 37 >PRK09411 carbamate kinase; Reviewed Probab=99.71 E-value=8.7e-17 Score=113.34 Aligned_cols=124 Identities=23% Similarity=0.319 Sum_probs=100.6 Q ss_pred ECHHHHHHHHHCCCEEEEC-----CCCCCCCCCEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEECC----CCCCCCCC Q ss_conf 5699999999779849985-----74317873200661479999999862511334322676335345----55556338 Q 537021.9.peg.3 10 VNRTILDLLIKSGIIPVIA-----PIAPGYDGATYNINADTFAGAIAEKLNVIRLLFLTDVPGVLDKN----RQLISKLS 80 (142) Q Consensus 10 vd~~~i~~lL~~g~iPVi~-----pig~~~~G~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~----~~~i~~i~ 80 (142) ++.+.|+.|+++|.++|.+ |+.-+..|-.--||-|.+++.||..++||.|++||||++||-+. .+.+++++ T Consensus 164 vE~~~Ik~Lv~~g~iVIa~GGGGIPV~~~~~GVeAVIDKD~asalLA~~l~AD~liILTdVd~V~~n~g~p~q~~L~~~t 243 (297) T PRK09411 164 LDSEAIELLLKEGHVVICSGGGGVPVTEDGAGSEAVIDKDLAAALLAEQINADGLVILTDADAVYENWGTPQQRAIRHAT 243 (297) T ss_pred ECHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCEEEECCCHHHHHHHHHHCCCEEEEEECCCCHHHCCCCCCHHHHHCCC T ss_conf 36999999998698799659887055898535213663417789999972999899974665010138992433562189 Q ss_pred HHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEEC Q ss_conf 8899999626976786789999999999849988999525787689999967998663454 Q 537021.9.peg.3 81 INEARTLIKDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLI 141 (142) Q Consensus 81 ~~e~~~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~ 141 (142) .+|++++.. ..|.|.||+++|+++++++.+++.|.+ .+.+...| .+..||.|. T Consensus 244 ~~el~~~~~---~~GSM~PKVeAai~Fv~~~G~~aiIts---l~~~~~al--~G~~GT~It 296 (297) T PRK09411 244 PDELAPFAK---ADGAMGPKVTAVSGYVRSRGKPAWIGA---LSRIEETL--AGEAGTCIS 296 (297) T ss_pred HHHHHHHCC---CCCCCHHHHHHHHHHHHHCCCEEEECC---HHHHHHHH--CCCCCCEEE T ss_conf 999983267---989864899999999995798699897---99999996--799987872 No 38 >PRK00358 pyrH uridylate kinase; Provisional Probab=99.70 E-value=2.4e-16 Score=110.95 Aligned_cols=119 Identities=22% Similarity=0.280 Sum_probs=90.9 Q ss_pred EECHHHHHHHHHCCCEEEECCCCCCCCCCEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEECCC------CCCCCCCHH Q ss_conf 55699999999779849985743178732006614799999998625113343226763353455------555633888 Q 537021.9.peg.3 9 KVNRTILDLLIKSGIIPVIAPIAPGYDGATYNINADTFAGAIAEKLNVIRLLFLTDVPGVLDKNR------QLISKLSIN 82 (142) Q Consensus 9 ~vd~~~i~~lL~~g~iPVi~pig~~~~G~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~------~~i~~i~~~ 82 (142) ..+.+-....|+.|.|||+.-... ..+ .+.|++|+.+|..++|+.|+++||++|+|+.|+ ++|++++.. T Consensus 107 ~~~~~~~~~~l~~g~v~Iv~~~~~----~~~-~~nD~laa~vA~~i~AD~liilTDVDGlYt~dP~~~~dAk~I~~Is~~ 181 (231) T PRK00358 107 PYIRRRAIRHLEKGRVVIFAAGTG----NPF-FTTDTAAALRAEEIGADVLLKATKVDGVYDADPKKDPDAKKYDTLTYD 181 (231) T ss_pred CCCHHHHHHHHHCCEEEEEECCCC----CCC-CCCHHHHHHHHHHCCCCEEEEEECCCCEECCCCCCCCCCEECCCCCHH T ss_conf 010899998753680799966756----878-771089999998659988988622374166797669986682545799 Q ss_pred HHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEEC Q ss_conf 99999626976786789999999999849988999525787689999967998663454 Q 537021.9.peg.3 83 EARTLIKDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLI 141 (142) Q Consensus 83 e~~~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~ 141 (142) |. +. .+.+.-|+.+|..|.++|++ ++|+||+.|+.| ..++.+|.+||+|. T Consensus 182 e~---~~----~g~~~~~~~Aa~~a~~~Gi~-~~I~ng~~~~~i-~~il~Ge~vGTlI~ 231 (231) T PRK00358 182 EV---LD----KGLKVMDPTAISLARDNKIP-IIVFNGNKPGNL-KRVVKGENIGTLVS 231 (231) T ss_pred HH---HH----CCCCHHHHHHHHHHHHCCCC-EEEECCCCCCHH-HHHHCCCCCCEEEC T ss_conf 99---86----77542789999999987998-999879993489-99977999874749 No 39 >TIGR01092 P5CS delta l-pyrroline-5-carboxylate synthetase; InterPro: IPR005766 delta l-pyrroline-5-carboxylate synthetase contains a glutamate 5-kinase (ProB, 2.7.2.11 from EC) region followed by a gamma-glutamyl phosphate reductase (ProA, 1.2.1.41 from EC) region and catalyses the first and second steps in proline biosynthesis.; GO: 0003824 catalytic activity, 0006561 proline biosynthetic process. Probab=99.57 E-value=1.9e-15 Score=105.98 Aligned_cols=129 Identities=25% Similarity=0.422 Sum_probs=100.6 Q ss_pred HHHHHHHHHCCCEEEEC-------CCCC--CCCCC-EEC-CCHHHHHHHHHHHHCHHHHHHHCCCCCEEEC-----CCCC Q ss_conf 99999999779849985-------7431--78732-006-6147999999986251133432267633534-----5555 Q 537021.9.peg.3 12 RTILDLLIKSGIIPVIA-------PIAP--GYDGA-TYN-INADTFAGAIAEKLNVIRLLFLTDVPGVLDK-----NRQL 75 (142) Q Consensus 12 ~~~i~~lL~~g~iPVi~-------pig~--~~~G~-~~n-in~D~~a~~lA~~l~A~kli~ltdv~Gv~d~-----~~~~ 75 (142) .+.++++|+...|||+. |-.+ |..|= +.. -|.|.+|+.||..|+||-||+|+||+|+|+. +.++ T Consensus 141 ~~T~~~lL~~~vvPv~N~NDAv~~~~~P~~D~~Ginvis~~DNDSLAA~LalElK~DLLIlLSDVeGLy~~PPs~~~~KL 220 (738) T TIGR01092 141 NETVKELLRMKVVPVLNENDAVSTRKAPYSDSQGINVISFWDNDSLAALLALELKADLLILLSDVEGLYDGPPSDDDSKL 220 (738) T ss_pred HHHHHHHHCCEEEEEEECCCCCCCCCCCCCCCCCEEEEEEECCHHHHHHHHHHHHHHHEEEEECCCCCCCCCCCCCCCCE T ss_conf 99999985230335651787547754775434660589983364799999987744040201030223485113787626 Q ss_pred CCCCCHHHHH---HHHH-CCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCH---HHHHHHCCCCCCCEEC Q ss_conf 5633888999---9962-697678678999999999984998899952578768---9999967998663454 Q 537021.9.peg.3 76 ISKLSINEAR---TLIK-DGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHS---ILMEIFTKNGSGTLLI 141 (142) Q Consensus 76 i~~i~~~e~~---~l~~-~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~---ll~elft~~g~GT~i~ 141 (142) |..+-..+-+ .+-. +..-.|||.-||+||+.|+..|++ |.|.+|..+.. ++..+..+..+||+|- T Consensus 221 I~~f~~~~~Q~~iTFG~KSrlG~GGM~AKVKAA~~A~~gG~~-ViIa~G~~~~~sGhvI~~v~~G~~VGTlF~ 292 (738) T TIGR01092 221 IDTFVKEKHQDEITFGEKSRLGRGGMTAKVKAAVYAAYGGVP-VIIASGTAAKNSGHVITKVVEGKKVGTLFH 292 (738) T ss_pred EEEECCCCCCCCEECCCCCCCCCCCCHHHHHHHHHHHHCCEE-EEECCCCCCCCCCCEEEEECCCCEEECCEE T ss_conf 887306754560103652223688502789999999748916-998168565336533221205732200100 No 40 >cd04254 AAK_UMPK-PyrH-Ec UMP kinase (UMPK)-Ec, the microbial/chloroplast uridine monophosphate kinase (uridylate kinase) enzyme that catalyzes UMP phosphorylation and plays a key role in pyrimidine nucleotide biosynthesis; regulation of this process is via feed-back control and via gene repression of carbamoyl phosphate synthetase (the first enzyme of the pyrimidine biosynthesis pathway). The UMP kinase of E. coli (Ec) is known to function as a homohexamer, with GTP and UTP being allosteric effectors. Like other related enzymes (carbamate kinase, aspartokinase, and N-acetylglutamate kinase) the E. coli and most bacterial and chloroplast UMPKs (this CD) have a conserved, N-terminal, lysine residue proposed to function in the catalysis of the phosphoryl group transfer, whereas most archaeal UMPKs appear to lack this residue and the Pyrococcus furiosus structure has an additional Mg ion bound to the ATP molecule which is proposed to function as the catalysis instead. Members of this CD be Probab=99.55 E-value=5.6e-14 Score=97.85 Aligned_cols=120 Identities=22% Similarity=0.269 Sum_probs=91.9 Q ss_pred EEEECHHHHHHHHHCCCEEEECCCCCCCCCCEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEECCC------CCCCCCC Q ss_conf 1355699999999779849985743178732006614799999998625113343226763353455------5556338 Q 537021.9.peg.3 7 VIKVNRTILDLLIKSGIIPVIAPIAPGYDGATYNINADTFAGAIAEKLNVIRLLFLTDVPGVLDKNR------QLISKLS 80 (142) Q Consensus 7 v~~vd~~~i~~lL~~g~iPVi~pig~~~~G~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~------~~i~~i~ 80 (142) .++.+.+.+...|+.|.|||+.. ..|..+ .+.|.+|+.+|..++|+.++++|||+|+|+.++ +++++++ T Consensus 105 ~~~~~~~~~~~~L~~g~VvI~~g----~~g~~~-~ttD~~aa~~A~~~~ad~~~~~tdvdgvy~~dP~~~~~a~~~~~is 179 (231) T cd04254 105 AEPYIRRRAIRHLEKGRVVIFAG----GTGNPF-FTTDTAAALRAIEINADVILKATKVDGVYDADPKKNPNAKRYDHLT 179 (231) T ss_pred CCCCCHHHHHHHHHCCCEEEEEC----CCCCCC-CCCHHHHHHHHHHCCCCEEEEEEEECCCCCCCCCCCCCCEECCCCC T ss_conf 23112899999973598899966----757876-5511799999987299889997631662268965575555403056 Q ss_pred HHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEE Q ss_conf 889999962697678678999999999984998899952578768999996799866345 Q 537021.9.peg.3 81 INEARTLIKDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLL 140 (142) Q Consensus 81 ~~e~~~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i 140 (142) .+|+-+ .+ ...| +..++.-|.++|+| ++|+|++.|+.|.. +..+|..||+| T Consensus 180 ~~e~~~---~~--~~v~--d~~A~~~a~~~~i~-~~v~~~~~~~~I~k-~l~Ge~~GTlI 230 (231) T cd04254 180 YDEVLS---KG--LKVM--DATAFTLCRDNNLP-IVVFNINEPGNLLK-AVKGEGVGTLI 230 (231) T ss_pred HHHHHH---CC--CCCC--CHHHHHHHHHCCCC-EEEECCCCCCHHHH-HHCCCCCCEEE T ss_conf 999984---77--6624--39999999986998-89973999526999-97799886475 No 41 >cd04240 AAK_UC AAK_UC: Uncharacterized (UC) amino acid kinase-like proteins found mainly in archaea and a few bacteria. Sequences in this CD are members of the Amino Acid Kinase (AAK) superfamily. Probab=99.53 E-value=3.6e-14 Score=98.90 Aligned_cols=119 Identities=23% Similarity=0.441 Sum_probs=96.4 Q ss_pred HHHHHHHHHCCCEEEECCCCC----CCCCCEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEECCCCCCCCCCHHHHHHH Q ss_conf 999999997798499857431----7873200661479999999862511334322676335345555563388899999 Q 537021.9.peg.3 12 RTILDLLIKSGIIPVIAPIAP----GYDGATYNINADTFAGAIAEKLNVIRLLFLTDVPGVLDKNRQLISKLSINEARTL 87 (142) Q Consensus 12 ~~~i~~lL~~g~iPVi~pig~----~~~G~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~~~i~~i~~~e~~~l 87 (142) ...++..++++.+||+-|-.+ ++--+.+++++|.+|+.+|..++|++||++||+||+|+.+++++++++.+++.. T Consensus 81 ~~~~~~~~~~~~~~v~lP~~~l~~~~~lp~sWdvtsDs~a~~~A~~l~a~~li~~~~vdGiy~~~~kl~~~i~~~~l~~- 159 (203) T cd04240 81 LAELTDVLERGKIAILLPYRLLLDTDPLPHSWEVTSDSIAAWLAKKLGAKRLVIVTDVDGIYEKDGKLVNEIAAAELLG- 159 (203) T ss_pred HHHHHHHHHCCCCEEEECHHHHCCCCCCCCCCCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCHHHHHHCCHHHHCC- T ss_conf 4779999733891587115875158988878776618999999997598858998567755187505541158988648- Q ss_pred HHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEE Q ss_conf 62697678678999999999984998899952578768999996799866345 Q 537021.9.peg.3 88 IKDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLL 140 (142) Q Consensus 88 ~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i 140 (142) ...-+.+.+|+ +.+++++ ++++||+.|+.++..|+..+..||.| T Consensus 160 ---~~~VD~~~~~~-----~~~~~i~-~~VvnG~~p~~i~~~l~g~~~~GT~I 203 (203) T cd04240 160 ---ETSVDPAFPRL-----LTKYGIR-CYVVNGDDPERVLAALRGREGVGTRI 203 (203) T ss_pred ---CCCHHHHHHHH-----HHHCCCE-EEEECCCCHHHHHHHHCCCCCCCCCC T ss_conf ---87410778999-----9976963-99987998799999986898798579 No 42 >TIGR00657 asp_kinases aspartate kinase; InterPro: IPR001341 Bacteria, plants and fungi metabolise aspartic acid to produce four amino acids - lysine, threonine, methionine and isoleucine - in a series of reactions known as the aspartate pathway. Additionally, several important metabolic intermediates are produced by these reactions, such as diaminopimelic acid, an essential component of bacterial cell wall biosynthesis, and dipicolinic acid, which is involved in sporulation in Gram-positive bacteria. Members of the animal kingdom do not posses this pathway and must therefore acquire these essential amino acids through their diet. Research into improving the metabolic flux through this pathway has the potential to increase the yield of the essential amino acids in important crops, thus improving their nutritional value. Additionally, since the enzymes are not present in animals, inhibitors of them are promising targets for the development of novel antibiotics and herbicides. For more information see . Aspartate kinase (2.7.2.4 from EC) (AK) catalyzes the first reaction in the aspartate pathway; the phosphorylation of aspartate. The product of this reaction can then be used in the biosynthesis of lysine or in the pathway leading to homoserine, which participates in the biosynthesis of threonine, isoleucine and methionine . In bacteria there are three different aspartate kinase isozymes which differ in sensitivity to repression and inhibition by Lys, Met and Thr. AK1 and AK2 are bifunctional enzymes which both consist of an N-terminal AK domain and a C-terminal homoserine dehydrogenase domain. AK1 is involved in threonine biosynthesis and AK2, in that of methionine. The third isozyme, AK3 is monofunctional and involved in lysine synthesis. In archaea and plants there may be a single isozyme of AK which in plants is multifunctional. This entry represents a region encoding aspartate kinase activity found in both the monofunctional and bifunctional enzymes. Synonym(s): Aspartokinase; GO: 0004072 aspartate kinase activity, 0008652 amino acid biosynthetic process. Probab=99.53 E-value=4.7e-14 Score=98.24 Aligned_cols=122 Identities=23% Similarity=0.352 Sum_probs=97.0 Q ss_pred CCEEEECHHHHHHHHHCC-CEEEECCC-CCCCCCCEECC---CHHHHHHHHHHHHCHHHHHHHCCCCCEEECCC------ Q ss_conf 201355699999999779-84998574-31787320066---14799999998625113343226763353455------ Q 537021.9.peg.3 5 GNVIKVNRTILDLLIKSG-IIPVIAPI-APGYDGATYNI---NADTFAGAIAEKLNVIRLLFLTDVPGVLDKNR------ 73 (142) Q Consensus 5 G~v~~vd~~~i~~lL~~g-~iPVi~pi-g~~~~G~~~ni---n~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~------ 73 (142) +.+..+.++.|..+|+.| +|||++-+ |.+++|++..+ =+|..|+.||.+|+|+.+.+.|||||||..|+ T Consensus 177 ~~~~~~~~~rL~~~L~~g~~ipvvaGF~G~~~~g~~TtLGRGGSD~tA~llA~aL~Ad~~~IyTDV~Gi~T~DPR~V~~A 256 (504) T TIGR00657 177 AVIIEILTERLEPLLEEGDIIPVVAGFQGATEKGEITTLGRGGSDYTAALLAAALKADECEIYTDVDGIYTTDPRIVPDA 256 (504) T ss_pred HHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCEEEEEECCHHHHHHHHHHHHHCCCEEEEEECCCCCEECCCCCCCCC T ss_conf 46777669999999846987999840041057750798306806899999998619968999872795032788637665 Q ss_pred CCCCCCCHHHHHHHHHCCCCCCCHHHHHHHHH-HHHH-CCCCEEEEEECCCCCHHHHHHHCCCCCCCEECC Q ss_conf 55563388899999626976786789999999-9998-499889995257876899999679986634549 Q 537021.9.peg.3 74 QLISKLSINEARTLIKDGTISGGMIPKIETSI-KAIE-NGVKSVAILDGKKPHSILMEIFTKNGSGTLLIP 142 (142) Q Consensus 74 ~~i~~i~~~e~~~l~~~~~~~~gm~~Ki~~a~-~a~~-~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~p 142 (142) |+|++|+.+|+.+|...|. +.--..++ -+.+ .++| +++-|- |+.+.-||+|.| T Consensus 257 r~i~~isy~Em~ELA~~GA-----KVLHPr~~ep~~~~~~ip-~~v~st----------~~~~a~GT~I~~ 311 (504) T TIGR00657 257 RRIDEISYEEMLELASFGA-----KVLHPRTLEPAMRDYKIP-IVVKST----------FNPEAPGTLIVA 311 (504) T ss_pred CCCCCCCHHHHHHHHHHCH-----HHHHHHHHHHHHCCCCCC-EEEECC----------CCCCCCCEEECC T ss_conf 2045448789999985031-----653278899984679951-899607----------788999668717 No 43 >KOG1154 consensus Probab=99.50 E-value=2.2e-14 Score=100.06 Aligned_cols=124 Identities=26% Similarity=0.492 Sum_probs=93.0 Q ss_pred HHHHHHHHCCCEEEEC---CCCCC--CCCCEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEECC-----CCCCCCCCHH Q ss_conf 9999999779849985---74317--873200661479999999862511334322676335345-----5555633888 Q 537021.9.peg.3 13 TILDLLIKSGIIPVIA---PIAPG--YDGATYNINADTFAGAIAEKLNVIRLLFLTDVPGVLDKN-----RQLISKLSIN 82 (142) Q Consensus 13 ~~i~~lL~~g~iPVi~---pig~~--~~G~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~-----~~~i~~i~~~ 82 (142) ..|+++|+.|.|||+. .+... ..|+. -|.|.+|+.+|..++||-||+||||+|+|+.. .++|...+.. T Consensus 136 ~Ti~eLL~m~viPIvNeNDavs~~~~~~~D~--~dNDsLsA~laaei~ADlLilLsDVdglYt~PPd~~~~~li~~~~~~ 213 (285) T KOG1154 136 NTISELLSMNVIPIVNENDAVSPREIPFGDS--SDNDSLAAILAAEIKADLLILLSDVDGLYTGPPDADPSKLIHTFSPG 213 (285) T ss_pred HHHHHHHHCCCEEEECCCCCCCCCCCCCCCC--CCCCHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCCEEEEEECCC T ss_conf 9999998579244543788647763246888--76517999999874267799985256133599986753344210467 Q ss_pred HHH---HHH-HCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEE Q ss_conf 999---996-2697678678999999999984998899952578768999996799866345 Q 537021.9.peg.3 83 EAR---TLI-KDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLL 140 (142) Q Consensus 83 e~~---~l~-~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i 140 (142) +-. .+- .+..-+|||..|+.+|..|+++|++ +.|.+|..|.+ +..++.+..+||.| T Consensus 214 ~~~v~~tfG~~S~vGtGGM~tKv~AA~~A~~~Gv~-viI~~g~~p~~-I~~iv~g~kvgt~f 273 (285) T KOG1154 214 DPQVSTTFGSKSKVGTGGMETKVKAAVNALNAGVS-VIITNGDAPEN-ITDIVEGKKVGTFF 273 (285) T ss_pred CCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCE-EEEECCCCHHH-HHHHHHHHHHHHHH T ss_conf 78876446666765767611558888777607914-99947986677-89886451120143 No 44 >TIGR02076 pyrH_arch uridylate kinase, putative; InterPro: IPR011818 Uridylate kinases (also known as UMP kinases) are key enzymes in the synthesis of nucleoside triphosphates. They catalyse the reversible transfer of the gamma-phosphoryl group from an ATP donor to UMP, yielding UDP, which is the starting point for the synthesis of all other pyrimidine nucleotides. The eukaryotic enzyme has a dual specificity, phosphorylating both UMP and CMP, while the bacterial enzyme is specific to UMP. The bacterial enzyme shows no sequence similarity to the eukaryotic enzyme or other nucleoside monophosphate kinases, but rather appears to be part of the amino acid kinase family. It is dependent on magnesium for activity and is activated by GTP and repressed by UTP , . In many bacterial genomes, the gene tends to be located immediately downstream of elongation factor T and upstream of ribosome recycling factor. A related protein family, believed to be equivalent in function is found in the archaea and in spirochetes. Structurally, the bacterial and archaeal proteins are homohexamers centred around a hollow nucleus and organised as a trimer of dimers , . Each monomer within the protein forms the amino acid kinase fold and can be divided into an N-terminal region which binds UMP and mediates intersubunit interactions within the dimer, and a C-terminal region which binds ATP and contains a mobile loop covering the active site. Inhibition of enzyme activity by UTP appears to be due to competition for the binding site for UMP, not allosteric inhibition as was previously suspected. This entry represents the archaeal and spirochete proteins.; GO: 0009041 uridylate kinase activity, 0006221 pyrimidine nucleotide biosynthetic process. Probab=99.32 E-value=1e-11 Score=85.32 Aligned_cols=119 Identities=24% Similarity=0.329 Sum_probs=93.8 Q ss_pred HHHHHCCCEEEECCCCCCCCCCEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEECCC------CCCCCCCHHHHHHHHH Q ss_conf 9999779849985743178732006614799999998625113343226763353455------5556338889999962 Q 537021.9.peg.3 16 DLLIKSGIIPVIAPIAPGYDGATYNINADTFAGAIAEKLNVIRLLFLTDVPGVLDKNR------QLISKLSINEARTLIK 89 (142) Q Consensus 16 ~~lL~~g~iPVi~pig~~~~G~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~------~~i~~i~~~e~~~l~~ 89 (142) .++...+.|||..- ..-|++ -|.+|+.||..++|+.||..|+||||||.|+ +.+++++++|+.++.. T Consensus 103 ~~~~~~~~ivVmGG---t~PGht----TDAVAA~lAE~~~ad~L~~~TnVDGVYd~DP~~~~~A~~~~~l~~~eL~~i~~ 175 (232) T TIGR02076 103 LEAMSLGKIVVMGG---TEPGHT----TDAVAALLAEFLEADLLINATNVDGVYDKDPNKYPDAKKFDKLTPEELVEIVR 175 (232) T ss_pred HHHHHCCCEEEECC---CCCCCH----HHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHC T ss_conf 99982698798668---789852----89999999766268726998226852177778887840002589889999860 Q ss_pred -CCCCCCCHHHHHH-HHHHHHHCCCCEEEEEECCC--CCHHHHHH-HCCCCCCCEEC Q ss_conf -6976786789999-99999984998899952578--76899999-67998663454 Q 537021.9.peg.3 90 -DGTISGGMIPKIE-TSIKAIENGVKSVAILDGKK--PHSILMEI-FTKNGSGTLLI 141 (142) Q Consensus 90 -~~~~~~gm~~Ki~-~a~~a~~~Gv~~v~i~~~~~--~~~ll~el-ft~~g~GT~i~ 141 (142) ++.+..|-..-++ .|.+.+++.-=++++++|++ |+.|..-+ ..++-.||.|. T Consensus 176 G~~~~~AG~~~~~D~lA~kii~Rs~i~t~V~~~~dfr~~~l~~~~n~~Ge~~GT~I~ 232 (232) T TIGR02076 176 GSSSLKAGSSEVVDPLAAKIIERSKIRTIVVNGRDFRPENLEKIVNLKGEIVGTIIE 232 (232) T ss_pred CCCCCCCCCCCCHHHHHHHHHHHCCCEEEEECCCCCCHHHHHHHHHHCCCCCCCEEC T ss_conf 441127899752448899999747954899648545888999998725886876209 No 45 >PRK07431 aspartate kinase; Provisional Probab=99.24 E-value=5.8e-11 Score=81.20 Aligned_cols=120 Identities=23% Similarity=0.334 Sum_probs=90.0 Q ss_pred CEEEECHHHHHHHHHCCCEEEECCC-CCCCCC--CEECC---CHHHHHHHHHHHHCHHHHHHHCCCCCEEECCCC----- Q ss_conf 0135569999999977984998574-317873--20066---147999999986251133432267633534555----- Q 537021.9.peg.3 6 NVIKVNRTILDLLIKSGIIPVIAPI-APGYDG--ATYNI---NADTFAGAIAEKLNVIRLLFLTDVPGVLDKNRQ----- 74 (142) Q Consensus 6 ~v~~vd~~~i~~lL~~g~iPVi~pi-g~~~~G--~~~ni---n~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~~----- 74 (142) +|..++++.+++.|+.+.+||++-+ |.+.+| ++..+ -+|..|+.+|.+|+|+.+-++|||+|+|..|++ T Consensus 112 ~i~~i~~~~i~~~l~~~~v~Vv~GFqGi~~~~~~eiTTLGRGGSD~TA~~lAaaL~Ad~~eI~TDVdGV~TaDPriv~~A 191 (594) T PRK07431 112 RILEIKTERIQRHLDAGKVVVVAGFQGISLSSGLEITTLGRGGSDTSAVALAAALGADACEIYTDVPGVLTTDPRLVPEA 191 (594) T ss_pred ECCCCCHHHHHHHHHCCCCEEEEEEECCCCCCCCCEEEECCCCCHHHHHHHHHHCCCCEEEEEECCCCEECCCCCCCCCC T ss_conf 30506899999998629917996000125799861565068872679999998648989999635672364699868886 Q ss_pred -CCCCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEEC Q ss_conf -5563388899999626976786789999999999849988999525787689999967998663454 Q 537021.9.peg.3 75 -LISKLSINEARTLIKDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLI 141 (142) Q Consensus 75 -~i~~i~~~e~~~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~ 141 (142) .|++|+++|+.+|...|. .=+- -++..-|.+.++| +||-|- |+ +..||+|+ T Consensus 192 r~i~~ISY~Em~ELa~~GA--kVLh--prsVe~A~k~~IP-I~Vrst----------f~-~~~GT~I~ 243 (594) T PRK07431 192 QLMDEISCDEMLELASLGA--SVLH--PRAVEIARNYGVP-LVVRSS----------WS-DAPGTLLT 243 (594) T ss_pred CEECCCCHHHHHHHHHCCC--CCCC--HHHHHHHHHCCCC-EEEECC----------CC-CCCCCEEE T ss_conf 2857127999999997587--7456--7789999973996-599667----------88-99985784 No 46 >cd04260 AAK_AKi-DapG-BS AAK_AKi-DapG-BS: Amino Acid Kinase Superfamily (AAK), AKi-DapG; this CD includes the N-terminal catalytic aspartokinase (AK) domain of the diaminopimelate-sensitive aspartokinase isoenzyme AKI (DapG), a monofunctional class enzyme found in Bacilli (Bacillus subtilis 168), Clostridia, and Actinobacteria bacterial species. In Bacillus subtilis, the regulation of the diaminopimelate-lysine biosynthetic pathway involves dual control by diaminopimelate and lysine, effected through separate diaminopimelate- and lysine-sensitive aspartokinase isoenzymes. AKI activity is invariant during the exponential and stationary phases of growth and is not altered by addition of amino acids to the growth medium. The role of this isoenzyme is most likely to provide a constant level of aspartyl-beta-phosphate for the biosynthesis of diaminopimelate for peptidoglycan synthesis and dipicolinate during sporulation. The B. subtilis AKI is tetrameric consisting of two alpha and two bet Probab=99.24 E-value=1.3e-10 Score=79.20 Aligned_cols=119 Identities=25% Similarity=0.367 Sum_probs=91.2 Q ss_pred CEEEECHHHHHHHHHCCCEEEECCC-CCCCCCCEECC---CHHHHHHHHHHHHCHHHHHHHCCCCCEEECCC------CC Q ss_conf 0135569999999977984998574-31787320066---14799999998625113343226763353455------55 Q 537021.9.peg.3 6 NVIKVNRTILDLLIKSGIIPVIAPI-APGYDGATYNI---NADTFAGAIAEKLNVIRLLFLTDVPGVLDKNR------QL 75 (142) Q Consensus 6 ~v~~vd~~~i~~lL~~g~iPVi~pi-g~~~~G~~~ni---n~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~------~~ 75 (142) ++..++.+.+...|+.+.+||++-+ |.+.+|++..+ -+|..|+.+|..|+|+.+.+.|||+|+|..|+ ++ T Consensus 115 ~~~~~~~~~~~~~l~~~~v~Vv~GFig~~~~G~~ttLGRgGSD~TAa~lA~~L~A~~~~IwtDV~Gv~taDPr~v~~A~~ 194 (244) T cd04260 115 KIIKVNPKKILSALKEGDVVVVAGFQGVTEDGEVTTLGRGGSDTTAAALGAALNAEYVEIYTDVDGIMTADPRVVPNARI 194 (244) T ss_pred EEEEECHHHHHHHHHHCCEEEECCCCCCCCCCCEEECCCCCHHHHHHHHHHHCCCCEEEEEECCCEEEECCCCCCCCCEE T ss_conf 67651589999998634635740310246899667718984689999999960998899964687127179986888768 Q ss_pred CCCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEE Q ss_conf 56338889999962697678678999999999984998899952578768999996799866345 Q 537021.9.peg.3 76 ISKLSINEARTLIKDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLL 140 (142) Q Consensus 76 i~~i~~~e~~~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i 140 (142) |++++.+|+.+|..-|. .=+-| ++..-+.++++| ++|-|-.. ...||+| T Consensus 195 i~~lsy~EA~ELa~~GA--kVlHp--~ti~pa~~~~Ip-i~Irnt~~-----------~~~GT~I 243 (244) T cd04260 195 LDVVSYNEVFQMAHQGA--KVIHP--RAVEIAMQANIP-IRIRSTMS-----------ENPGTLI 243 (244) T ss_pred CCCCCHHHHHHHHHCCC--CCCCH--HHHHHHHHCCCC-EEEEECCC-----------CCCCCEE T ss_conf 77318999999997797--51689--999999986981-89970798-----------8798774 No 47 >PRK08841 aspartate kinase; Validated Probab=99.23 E-value=8.9e-11 Score=80.18 Aligned_cols=120 Identities=21% Similarity=0.335 Sum_probs=92.8 Q ss_pred CEEEECHHHHHHHHHCCCEEEECCC-CCCCCCCEECC---CHHHHHHHHHHHHCHHHHHHHCCCCCEEECCCC------C Q ss_conf 0135569999999977984998574-31787320066---147999999986251133432267633534555------5 Q 537021.9.peg.3 6 NVIKVNRTILDLLIKSGIIPVIAPI-APGYDGATYNI---NADTFAGAIAEKLNVIRLLFLTDVPGVLDKNRQ------L 75 (142) Q Consensus 6 ~v~~vd~~~i~~lL~~g~iPVi~pi-g~~~~G~~~ni---n~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~~------~ 75 (142) ++..++++.+..+++++.+||++-+ |.+++|++..+ -+|..|+.+|.+|+|+.+-+.|||+|+|..|++ + T Consensus 112 ~i~~~~~~~~~~~l~~~~v~Vv~GF~g~~~~G~~ttLGRGGSD~sA~~lAa~l~A~~~eIwTDVdGv~taDPr~V~~A~~ 191 (392) T PRK08841 112 TIKHIDTSTITELLEQDQIVIVAGFQGRNENGDITTLGRGGSDTTAVALAGALNADECQIFTDVDGVYTCDPRVVKNARK 191 (392) T ss_pred EEEHHCHHHHHHHHHCCCEEEECCEEECCCCCCEEECCCCCHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCCCCE T ss_conf 52020338799887359869941820107999875627997799999999980997899985677636379868768728 Q ss_pred CCCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEEC Q ss_conf 563388899999626976786789999999999849988999525787689999967998663454 Q 537021.9.peg.3 76 ISKLSINEARTLIKDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLI 141 (142) Q Consensus 76 i~~i~~~e~~~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~ 141 (142) +++++++|+.+|...|.- =+-| ++..-+.++++| ++|-|- |+ ++.||+|. T Consensus 192 l~~isy~Ea~ELa~~GAk--VlHp--~tv~pa~~~~Ip-i~v~nt----------f~-~~~GT~I~ 241 (392) T PRK08841 192 LDVIDFPSMEAMARKGAK--VLHL--PSVQHAWKHSVP-LRVLSS----------FE-DGEGTLIK 241 (392) T ss_pred ECCCCHHHHHHHHHCCCC--CCCH--HHHHHHHHCCCE-EEEEEC----------CC-CCCCEEEE T ss_conf 264789999999976996--3688--899999875975-999836----------87-99982896 No 48 >COG0527 LysC Aspartokinases [Amino acid transport and metabolism] Probab=99.23 E-value=7.2e-11 Score=80.69 Aligned_cols=120 Identities=27% Similarity=0.363 Sum_probs=91.2 Q ss_pred EEEECHHH-HHHHHHCCCEEEECCC-CCCCCCCEECC---CHHHHHHHHHHHHCHHHHHHHCCCCCEEECCC------CC Q ss_conf 13556999-9999977984998574-31787320066---14799999998625113343226763353455------55 Q 537021.9.peg.3 7 VIKVNRTI-LDLLIKSGIIPVIAPI-APGYDGATYNI---NADTFAGAIAEKLNVIRLLFLTDVPGVLDKNR------QL 75 (142) Q Consensus 7 v~~vd~~~-i~~lL~~g~iPVi~pi-g~~~~G~~~ni---n~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~------~~ 75 (142) +...+.+. +..+++.+.+||++-+ |.+++|+...+ -+|..|+.||.+|+|+.+.+.|||+|||..|+ ++ T Consensus 158 i~~~~~~~~l~~~~~~~~v~Vv~GF~G~~~~G~~tTLGRGGSD~SA~~laa~l~Ad~~~I~TDVdGI~TaDPRiVp~Ar~ 237 (447) T COG0527 158 ILDEDSERRLLRLLEEGKVPVVAGFQGINEDGETTTLGRGGSDYSAAALAAALGADEVEIWTDVDGVYTADPRIVPDARL 237 (447) T ss_pred CCHHHHHHHHHHHHCCCCEEEECCCEEECCCCCEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCEE T ss_conf 10355666678874469779951866516999889847984788999999971998899998788775689977886457 Q ss_pred CCCCCHHHHHHHHHCCCCCCCHHHH-HHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEECC Q ss_conf 5633888999996269767867899-999999998499889995257876899999679986634549 Q 537021.9.peg.3 76 ISKLSINEARTLIKDGTISGGMIPK-IETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLIP 142 (142) Q Consensus 76 i~~i~~~e~~~l~~~~~~~~gm~~K-i~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~p 142 (142) |++++.+|+.+|...| ++.- -++..-+.+.++| ++|-|-..| +..||+|.+ T Consensus 238 i~~isyeEa~ELA~~G-----AkVLHprav~pa~~~~Ip-~~i~~t~~p----------~~~GTlI~~ 289 (447) T COG0527 238 LPEISYEEALELAYLG-----AKVLHPRAVEPAMRSGIP-LRIKNTFNP----------DAPGTLITA 289 (447) T ss_pred CCCCCHHHHHHHHHCC-----CHHCCHHHHHHHHHCCCC-EEEEECCCC----------CCCCEEEEC T ss_conf 6846999999999778-----521499888899876991-899716999----------999539956 No 49 >cd04255 AAK_UMPK-MosAB AAK_UMPK-MosAB: This CD includes the alpha and beta subunits of the Mo storage protein (MosA and MosB) which are related to uridine monophosphate kinase (UMPK) enzymes that catalyze the phosphorylation of UMP by ATP, yielding UDP, and playing a key role in pyrimidine nucleotide biosynthesis. The Mo storage protein from the nitrogen-fixing bacterium, Azotobacter vinelandii, is characterized as an alpha4-beta4 octamer containing a polynuclear molybdenum-oxide cluster which is ATP-dependent to bind Mo and pH-dependent to release Mo. These and related bacterial sequences in this CD are members of the Amino Acid Kinase Superfamily (AAK). Probab=99.22 E-value=1.7e-10 Score=78.59 Aligned_cols=129 Identities=22% Similarity=0.272 Sum_probs=93.4 Q ss_pred EEEECHHHHHHHHHCCCEEEEC---CCCC----CCCCCEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEECC------C Q ss_conf 1355699999999779849985---7431----7873200661479999999862511334322676335345------5 Q 537021.9.peg.3 7 VIKVNRTILDLLIKSGIIPVIA---PIAP----GYDGATYNINADTFAGAIAEKLNVIRLLFLTDVPGVLDKN------R 73 (142) Q Consensus 7 v~~vd~~~i~~lL~~g~iPVi~---pig~----~~~G~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~------~ 73 (142) |..-+..-+-.+|..+..||.+ |-++ .+.|..=.-..|.-|..+|..++|+-++|.+|++|+|++| . T Consensus 120 i~~~~~~~l~~~l~~~~~~v~~g~PPY~~~e~p~~~g~IPphRTDtGAfLlAe~iGa~~li~vkdVDGvYtaDPk~~~~A 199 (262) T cd04255 120 VGHGDLLQLPTFLKAGRAPVISGMPPYGLWEHPAEEGRIPPHRTDVGAFLLAEVIGARNLIFVKDEDGLYTADPKKNKKA 199 (262) T ss_pred ECCCCHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCCC T ss_conf 57877899999986587104427999754547865687899754389999999848753799965777767999999765 Q ss_pred CCCCCCCHHHHHHHH-HCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEEC Q ss_conf 555633888999996-26976786789999999999849988999525787689999967998663454 Q 537021.9.peg.3 74 QLISKLSINEARTLI-KDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLI 141 (142) Q Consensus 74 ~~i~~i~~~e~~~l~-~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~ 141 (142) +++++++.+|+.+.- ++- -|-+++-...+..+ .+.++.|+|+..|+.+..++ .+|.+||+|- T Consensus 200 ~~i~~iS~~Ell~~~L~dl----v~er~~l~ll~~a~-~i~~VqVvNgl~PGnitrAL-~GEhVGTiIr 262 (262) T cd04255 200 EFIPEISAAELLKKDLDDL----VLERPVLDLLQNAR-HVKEVQIVNGLVPGNLTRAL-RGEHVGTIIR 262 (262) T ss_pred EECCCCCHHHHHHCCCCCC----CHHHHHHHHHHHHC-CCCEEEEEECCCCCHHHHHH-CCCCCCCEEC T ss_conf 3425377999986347643----00599999998611-35517998078853799987-2686641009 No 50 >PRK08210 aspartate kinase I; Reviewed Probab=99.21 E-value=1.2e-10 Score=79.40 Aligned_cols=120 Identities=23% Similarity=0.345 Sum_probs=91.6 Q ss_pred CEEEECHHHHHHHHHCCCEEEECCC-CCCCCCCEECC---CHHHHHHHHHHHHCHHHHHHHCCCCCEEECCCC------C Q ss_conf 0135569999999977984998574-31787320066---147999999986251133432267633534555------5 Q 537021.9.peg.3 6 NVIKVNRTILDLLIKSGIIPVIAPI-APGYDGATYNI---NADTFAGAIAEKLNVIRLLFLTDVPGVLDKNRQ------L 75 (142) Q Consensus 6 ~v~~vd~~~i~~lL~~g~iPVi~pi-g~~~~G~~~ni---n~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~~------~ 75 (142) ++.+++.+.+...++.+.+||++-+ |.+++|++..+ -+|..|+.+|.+|+|+.+.+.|||+|+|..|++ + T Consensus 117 ~~~~~~~~~~~~~l~~~~v~vv~GF~g~~~~g~~ttLGRGGSD~TA~~la~~l~A~~~~iwtDV~Gv~taDPr~v~~a~~ 196 (405) T PRK08210 117 KIIEVNPDRILEALEEGDVVVVAGFQGATENGDITTLGRGGSDTTAAALGVALKAEYIDIYTDVDGIMTADPRIVEDARL 196 (405) T ss_pred EEECCCHHHHHHHHHCCCEEEECCCEEECCCCCEEEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCEE T ss_conf 00014579999987459859961746546899566625887327999999863998899985677656379982766511 Q ss_pred CCCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEEC Q ss_conf 563388899999626976786789999999999849988999525787689999967998663454 Q 537021.9.peg.3 76 ISKLSINEARTLIKDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLI 141 (142) Q Consensus 76 i~~i~~~e~~~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~ 141 (142) |++++.+|+.+|..-|.- =+-| ++..-+.+.++| ++|-|- |+ +..||+|. T Consensus 197 i~~lsy~Ea~ELa~~Gak--Vlhp--~tv~pa~~~~Ip-i~I~nT----------~~-~~~GT~I~ 246 (405) T PRK08210 197 LDVVSYNEVFQMAYQGAK--VIHP--RAVEIAMQANIP-LRIRST----------YS-DSKGTLIT 246 (405) T ss_pred CCCCCHHHHHHHHHCCCC--CCCH--HHHHHHHHCCCE-EEEECC----------CC-CCCCCEEE T ss_conf 162499999999977986--3688--999999973983-999736----------89-99986793 No 51 >cd04246 AAK_AK-DapG-like AAK_AK-DapG-like: Amino Acid Kinase Superfamily (AAK), AK-DapG-like; this CD includes the N-terminal catalytic aspartokinase (AK) domain of the diaminopimelate-sensitive aspartokinase isoenzyme AKI (DapG), a monofunctional enzymes found in Bacilli (Bacillus subtilis 168), Clostridia, and Actinobacteria bacterial species, as well as, the catalytic AK domain of the lysine-sensitive aspartokinase isoenzyme AKII of Bacillus subtilis 168, the lysine plus threonine-sensitive aspartokinase of Corynebacterium glutamicum, and related isoenzymes. In Bacillus subtilis, the regulation of the diaminopimelate-lysine biosynthetic pathway involves dual control by diaminopimelate and lysine, effected through separate diaminopimelate- and lysine-sensitive aspartokinase isoenzymes. The role of the AKI isoenzyme is most likely to provide a constant level of aspartyl-beta-phosphate for the biosynthesis of diaminopimelate for peptidoglycan synthesis and dipicolinate during sporulati Probab=99.19 E-value=1.6e-10 Score=78.74 Aligned_cols=118 Identities=24% Similarity=0.309 Sum_probs=87.4 Q ss_pred EEEECHHHHHHHHHCCCEEEECC-CCCCCCCCEECC---CHHHHHHHHHHHHCHHHHHHHCCCCCEEECCC------CCC Q ss_conf 13556999999997798499857-431787320066---14799999998625113343226763353455------555 Q 537021.9.peg.3 7 VIKVNRTILDLLIKSGIIPVIAP-IAPGYDGATYNI---NADTFAGAIAEKLNVIRLLFLTDVPGVLDKNR------QLI 76 (142) Q Consensus 7 v~~vd~~~i~~lL~~g~iPVi~p-ig~~~~G~~~ni---n~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~------~~i 76 (142) +..++.+.+..+|+++.+||++- +|.+.+|+...+ -+|..|+.+|..|+|+++.+.|||+|+|+.|+ ++| T Consensus 111 i~~~~~~~~~~~l~~~~v~Vv~GF~g~~~~G~~ttLGRGGSD~tAa~iA~~l~A~~v~iwTDVdGi~taDPr~v~~A~~i 190 (239) T cd04246 111 IIDIDPKRILEALEEGDVVVVAGFQGVNEDGEITTLGRGGSDTTAVALAAALKADRCEIYTDVDGVYTADPRIVPKARKL 190 (239) T ss_pred HHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCEEEECCCCHHHHHHHHHHHHCCCEEEEEECCCEEECCCCCCCCCCCCC T ss_conf 33530556665552454479733234489985688179964799999999728888999718972573899968766572 Q ss_pred CCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEE Q ss_conf 6338889999962697678678999999999984998899952578768999996799866345 Q 537021.9.peg.3 77 SKLSINEARTLIKDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLL 140 (142) Q Consensus 77 ~~i~~~e~~~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i 140 (142) ++++.+|+.+|..-+. .=|-| ++..-+.++++| ++|-|- |.. ..||+| T Consensus 191 ~~lsy~EA~ela~~Ga--kVlhp--~ti~p~~~~~Ip-i~I~nt----------~~~-~~GT~I 238 (239) T cd04246 191 DVISYDEMLEMASLGA--KVLHP--RSVELAKKYNVP-LRVRSS----------FSE-NPGTLI 238 (239) T ss_pred CCCCHHHHHHHHHCCC--CCCCH--HHHHHHHHCCCC-EEEEEC----------CCC-CCCCEE T ss_conf 7527999999986898--42589--999999986995-899848----------989-898775 No 52 >cd04261 AAK_AKii-LysC-BS AAK_AKii-LysC-BS: Amino Acid Kinase Superfamily (AAK), AKii; this CD includes the N-terminal catalytic aspartokinase (AK) domain of the lysine-sensitive aspartokinase isoenzyme AKII of Bacillus subtilis 168, and the lysine plus threonine-sensitive aspartokinase of Corynebacterium glutamicum, and related sequences. In B. subtilis 168, the regulation of the diaminopimelate (Dap)-lysine biosynthetic pathway involves dual control by Dap and lysine, effected through separate Dap- and lysine-sensitive aspartokinase isoenzymes. The B. subtilis 168 AKII is induced by methionine, and repressed and inhibited by lysine. Although Corynebacterium glutamicum is known to contain a single aspartokinase isoenzyme type, both the succinylase and dehydrogenase variant pathways of DAP-lysine synthesis operate simultaneously in this organism. In this organism and other various Gram-positive bacteria, the DAP-lysine pathway is feedback regulated by the concerted action of lysine and Probab=99.17 E-value=1.3e-10 Score=79.21 Aligned_cols=118 Identities=25% Similarity=0.371 Sum_probs=86.8 Q ss_pred EEEECHHHHHHHHHCCCEEEECC-CCCCCCCCEECC---CHHHHHHHHHHHHCHHHHHHHCCCCCEEECCC------CCC Q ss_conf 13556999999997798499857-431787320066---14799999998625113343226763353455------555 Q 537021.9.peg.3 7 VIKVNRTILDLLIKSGIIPVIAP-IAPGYDGATYNI---NADTFAGAIAEKLNVIRLLFLTDVPGVLDKNR------QLI 76 (142) Q Consensus 7 v~~vd~~~i~~lL~~g~iPVi~p-ig~~~~G~~~ni---n~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~------~~i 76 (142) +..++...++.+|+.+.|||++- +|.+.+|+...+ .+|..|+.+|..|+|+.+.+.|||+|+|..|+ ++| T Consensus 111 i~~~~~~~i~~~l~~~~v~Vv~GF~g~~~~g~~ttLGRGGSD~tAai~A~~l~A~~v~iwTDVdGi~taDPr~v~~A~~i 190 (239) T cd04261 111 IIDIDPDRIRELLEEGDVVIVAGFQGINEDGDITTLGRGGSDTSAVALAAALGADRCEIYTDVDGVYTADPRIVPKARKL 190 (239) T ss_pred HHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCC T ss_conf 68899999986402455589734655478884688279946889999999669888999818983275899968876773 Q ss_pred CCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEE Q ss_conf 6338889999962697678678999999999984998899952578768999996799866345 Q 537021.9.peg.3 77 SKLSINEARTLIKDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLL 140 (142) Q Consensus 77 ~~i~~~e~~~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i 140 (142) ++++.+|+.+|..-+. .=+-| ++..-+.++++| ++|-|- |+.+ .||+| T Consensus 191 ~~lsy~EA~ela~~Ga--kVlHp--~ai~p~~~~~Ip-i~i~nt----------~~p~-~GT~I 238 (239) T cd04261 191 DEISYDEMLEMASLGA--KVLHP--RSVELAKKYGVP-LRVLSS----------FSEE-PGTLI 238 (239) T ss_pred CEECHHHHHHHHHCCC--CCCCH--HHHHHHHHCCCC-EEEEEC----------CCCC-CCCEE T ss_conf 7707999999996897--31589--999999986994-899708----------9897-98885 No 53 >PRK06635 aspartate kinase; Reviewed Probab=99.17 E-value=2.7e-10 Score=77.49 Aligned_cols=121 Identities=22% Similarity=0.278 Sum_probs=91.3 Q ss_pred CEEEECHHHHHHHHHCCCEEEECCC-CCCCCCCEECC---CHHHHHHHHHHHHCHHHHHHHCCCCCEEECCCC------C Q ss_conf 0135569999999977984998574-31787320066---147999999986251133432267633534555------5 Q 537021.9.peg.3 6 NVIKVNRTILDLLIKSGIIPVIAPI-APGYDGATYNI---NADTFAGAIAEKLNVIRLLFLTDVPGVLDKNRQ------L 75 (142) Q Consensus 6 ~v~~vd~~~i~~lL~~g~iPVi~pi-g~~~~G~~~ni---n~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~~------~ 75 (142) ++..++.+.++..|+.+.+||++-+ |.+++|++..+ -+|..|+.+|.+|+|+.+.+.|||+|+|..|++ + T Consensus 112 ~~~~~~~~~~~~~l~~~~v~Vv~GF~g~~~~G~~ttLgRGGSD~sA~~~a~~l~A~~~~IwtDV~Gi~taDPr~V~~A~~ 191 (402) T PRK06635 112 RITDIDPDRIREALDEGKVVVVAGFQGVDEDGEITTLGRGGSDTTAVALAAALKADECEIYTDVDGVYTTDPRIVPKARK 191 (402) T ss_pred CCCHHHHHHHHHHHHCCCEEEECCCEEECCCCCEEEECCCCCHHHHHHHHHHCCCCEEEEEECCCCEECCCCCCCCCCCC T ss_conf 41142278899998469779973731225789667735887218999999860775899985356305158754677721 Q ss_pred CCCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEECC Q ss_conf 5633888999996269767867899999999998499889995257876899999679986634549 Q 537021.9.peg.3 76 ISKLSINEARTLIKDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLIP 142 (142) Q Consensus 76 i~~i~~~e~~~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~p 142 (142) +++++.+|+.+|..-|. .=+-| ++..-+.++++| ++|-|-.. .+ .||+|.+ T Consensus 192 i~~isy~Ea~ELa~~GA--kVlHp--~ti~pa~~~~Ip-i~v~nTf~----------p~-~GT~I~~ 242 (402) T PRK06635 192 LDKISYEEMLELASLGA--KVLHP--RSVELAKKYNVP-LRVRSSFS----------EE-PGTLITE 242 (402) T ss_pred CCCCCHHHHHHHHHCCC--CCCCH--HHHHHHHHCCCE-EEEEECCC----------CC-CCEEEEC T ss_conf 57259999999997698--75678--889999974980-89985678----------98-9739953 No 54 >cd04234 AAK_AK AAK_AK: Amino Acid Kinase Superfamily (AAK), Aspartokinase (AK); this CD includes the N-terminal catalytic domain of aspartokinase (4-L-aspartate-4-phosphotransferase;). AK is the first enzyme in the biosynthetic pathway of the aspartate family of amino acids (lysine, threonine, methionine, and isoleucine) and the bacterial cell wall component, meso-diaminopimelate. It also catalyzes the conversion of aspartate and ATP to aspartylphosphate and ADP. One mechanism for the regulation of this pathway is by the production of several isoenzymes of aspartokinase with different repressors and allosteric inhibitors. Pairs of ACT domains are proposed to specifically bind amino acids leading to allosteric regulation of the enzyme. In Escherichia coli, three different aspartokinase isoenzymes are regulated specifically by lysine, methionine, and threonine. AK-HSDHI (ThrA) and AK-HSDHII (MetL) are bifunctional enzymes that consist of an N-terminal AK and a C-terminal homoserine dehyd Probab=99.10 E-value=1.1e-09 Score=74.20 Aligned_cols=120 Identities=26% Similarity=0.392 Sum_probs=86.0 Q ss_pred EEEECHHHHHHHH-HCCCEEEECC-CCCCCCCCEECC---CHHHHHHHHHHHHCHHHHHHHCCCCCEEECCC------CC Q ss_conf 1355699999999-7798499857-431787320066---14799999998625113343226763353455------55 Q 537021.9.peg.3 7 VIKVNRTILDLLI-KSGIIPVIAP-IAPGYDGATYNI---NADTFAGAIAEKLNVIRLLFLTDVPGVLDKNR------QL 75 (142) Q Consensus 7 v~~vd~~~i~~lL-~~g~iPVi~p-ig~~~~G~~~ni---n~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~------~~ 75 (142) +.+...+.+.+.+ +.+.+||++- +|.+.+|++..+ -+|..|+.+|.+|+|+.+.+.|||+|+|..|+ ++ T Consensus 97 ~~~~~~~~l~~~l~~~~~v~Vv~GF~g~~~~G~~tTLGRGGSD~TA~~lA~~l~A~~~~I~TDVdGi~taDPr~v~~A~~ 176 (227) T cd04234 97 IIEISYERLKELLAEIGKVPVVTGFIGRNEDGEITTLGRGGSDYSAAALAAALGADEVEIWTDVDGIYTADPRIVPEARL 176 (227) T ss_pred EHHHCHHHHHHHHHHCCCEEEECCEEEECCCCCEEEECCCCHHHHHHHHHHHHCCHHHEEEECCCEEECCCCCCCCCCEE T ss_conf 45428999999997389679962604540588468838885678999999981860222212687047279974788624 Q ss_pred CCCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEEC Q ss_conf 563388899999626976786789999999999849988999525787689999967998663454 Q 537021.9.peg.3 76 ISKLSINEARTLIKDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLI 141 (142) Q Consensus 76 i~~i~~~e~~~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~ 141 (142) |++++.+|+.+|..-|. .-+-| ++..-+.++++| ++|-|- |+.+..||+|- T Consensus 177 i~~lsy~Ea~ela~~Ga--kVlHp--~ti~pa~~~~Ip-i~i~nt----------~~p~~~GT~Is 227 (227) T cd04234 177 IPEISYDEALELAYFGA--KVLHP--RAVEPARKANIP-IRVKNT----------FNPEAPGTLIT 227 (227) T ss_pred ECCCCHHHHHHHHHCCC--CCCCH--HHHHHHHHCCCE-EEEECC----------CCCCCCCEEEC T ss_conf 04419999999997797--41699--999999986983-999289----------99999981769 No 55 >PRK09034 aspartate kinase; Reviewed Probab=99.09 E-value=3.2e-10 Score=77.13 Aligned_cols=119 Identities=23% Similarity=0.229 Sum_probs=88.9 Q ss_pred EECHHHHHHHHHCCCEEEECC-CCCCCCCCEECC---CHHHHHHHHHHHHCHHHHHHHCCCCCEEECCC------CCCCC Q ss_conf 556999999997798499857-431787320066---14799999998625113343226763353455------55563 Q 537021.9.peg.3 9 KVNRTILDLLIKSGIIPVIAP-IAPGYDGATYNI---NADTFAGAIAEKLNVIRLLFLTDVPGVLDKNR------QLISK 78 (142) Q Consensus 9 ~vd~~~i~~lL~~g~iPVi~p-ig~~~~G~~~ni---n~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~------~~i~~ 78 (142) ....+.+..+++.+.+||++- +|.+.+|++..+ -+|..|+.+|.+|+|+.+.+.|||+|+|..|+ ++|++ T Consensus 161 ~~~~~~l~~~~~~~~v~Vv~GFig~~~~G~~tTLGRGGSD~TAa~la~~l~A~~~~IwTDVdGi~taDPr~V~~A~~i~~ 240 (450) T PRK09034 161 PESYDKLKKLRDSDEKLVIPGFFGVTPDGQIKTFSRGGSDITGAILARGVKADLYENFTDVDGIYAANPNIVKNPKPIEE 240 (450) T ss_pred HHHHHHHHHHHCCCCEEEEEEEEEECCCCCEEEECCCCHHHHHHHHHHHHCCCEEEEEECCCCEEECCCCCCCCCCCCCC T ss_conf 74199999887179879983278846999777616883489999999971998899985467316069876787662052 Q ss_pred CCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEECC Q ss_conf 3888999996269767867899999999998499889995257876899999679986634549 Q 537021.9.peg.3 79 LSINEARTLIKDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLIP 142 (142) Q Consensus 79 i~~~e~~~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~p 142 (142) ++.+|+.+|..-|.- -+-| ++..-+.++++| ++|-|- |+.+..||+|.+ T Consensus 241 lsy~EA~ELa~~GAk--VlHP--~ti~p~~~~~IP-i~I~nt----------~~P~~~GT~I~~ 289 (450) T PRK09034 241 ITYREMRELSYAGFS--VFHD--EALIPAYRGGIP-INIKNT----------NNPEDPGTLIVH 289 (450) T ss_pred CCHHHHHHHHHCCCC--CCCC--CCCCHHHCCCEE-EEEEEC----------CCCCCCCEEEEE T ss_conf 789999999975875--4571--314521306548-999727----------999999608962 No 56 >PRK06291 aspartate kinase; Provisional Probab=99.07 E-value=1.4e-09 Score=73.65 Aligned_cols=112 Identities=30% Similarity=0.414 Sum_probs=84.2 Q ss_pred HHHHHHCCCEEEECCC-CCCCCCCEECC---CHHHHHHHHHHHHCHHHHHHHCCCCCEEECCCC------CCCCCCHHHH Q ss_conf 9999977984998574-31787320066---147999999986251133432267633534555------5563388899 Q 537021.9.peg.3 15 LDLLIKSGIIPVIAPI-APGYDGATYNI---NADTFAGAIAEKLNVIRLLFLTDVPGVLDKNRQ------LISKLSINEA 84 (142) Q Consensus 15 i~~lL~~g~iPVi~pi-g~~~~G~~~ni---n~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~~------~i~~i~~~e~ 84 (142) +..+++.+.+||++-+ |.+++|++..+ -+|..|+.+|.+|+|+.+.+.|||+|+|..|++ +|++++.+|+ T Consensus 181 ~~~~~~~~~v~vv~GFig~~~~G~~ttLGRGGSD~TAa~lA~~l~A~~~~IwTDV~Gi~taDPriV~~A~~i~~lsy~EA 260 (466) T PRK06291 181 LGPLLKEGVIPVVTGFIGETEEGIITTLGRGGSDYSAAIIGAALDADEIWIWTDVDGVMTADPRIVPEARTIPKISYIEA 260 (466) T ss_pred HHHHHCCCCEEEECCCCEECCCCCEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCEEECCCCCCCCCEECCCCCHHHH T ss_conf 87774048368973610488999778707996189999999957886999940567347169987877754374599999 Q ss_pred HHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEEC Q ss_conf 999626976786789999999999849988999525787689999967998663454 Q 537021.9.peg.3 85 RTLIKDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLI 141 (142) Q Consensus 85 ~~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~ 141 (142) .+|..-|.- -+-| ++..-+.++++| ++|-|-..| +..||+|. T Consensus 261 ~ELa~~Gak--VlHp--~ti~p~~~~~IP-i~I~nt~~P----------~~~GT~I~ 302 (466) T PRK06291 261 MELSYFGAK--VLHP--RTIEPAMEKGIP-VRVKNTFNP----------EFPGTLIT 302 (466) T ss_pred HHHHHCCCC--CCCH--HHHHHHHHCCCC-EEEEECCCC----------CCCCEEEE T ss_conf 999975876--4567--789999980993-899826898----------99972896 No 57 >PRK05925 aspartate kinase; Provisional Probab=99.07 E-value=8.7e-10 Score=74.71 Aligned_cols=111 Identities=23% Similarity=0.296 Sum_probs=83.3 Q ss_pred HHHHHCCCEEEECCC-CCCCCCCEECC---CHHHHHHHHHHHHCHHHHHHHCCCCCEEECCC------CCCCCCCHHHHH Q ss_conf 999977984998574-31787320066---14799999998625113343226763353455------555633888999 Q 537021.9.peg.3 16 DLLIKSGIIPVIAPI-APGYDGATYNI---NADTFAGAIAEKLNVIRLLFLTDVPGVLDKNR------QLISKLSINEAR 85 (142) Q Consensus 16 ~~lL~~g~iPVi~pi-g~~~~G~~~ni---n~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~------~~i~~i~~~e~~ 85 (142) ...++.+.+||++-+ |.+.+|++..+ -+|..|+.+|.+|+|+.+.+.|||+|+|..|+ ++|++++.+|+. T Consensus 159 ~~~~~~~~v~Vv~GF~g~~~~G~~ttLGRGGSD~sA~~la~al~A~~~~IwTDV~Gv~taDPriV~~A~~i~~lsy~Ea~ 238 (440) T PRK05925 159 ELNLQEDAIYIMQGFIGANSSGKTTVLGRGGSDFSASLIAELCKAREVRIYTDVNGIYTMDPKIIKDAQLIPELSFEEMQ 238 (440) T ss_pred HHHHCCCCEEEECCCCCCCCCCCEEEECCCCCCHHHHHHHHHCCCCEEEEECCCCCEECCCCCCCCCCCCCCCCCHHHHH T ss_conf 87522796899616300189997776169973089999999617849998316573374799868888566630899999 Q ss_pred HHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEEC Q ss_conf 99626976786789999999999849988999525787689999967998663454 Q 537021.9.peg.3 86 TLIKDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLI 141 (142) Q Consensus 86 ~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~ 141 (142) +|..-|. .-+-| ++..-+.++++| ++|-|-.. .+..||+|. T Consensus 239 ELa~~GA--kVlHp--rtv~pa~~~~Ip-i~i~nt~~----------p~~~GT~I~ 279 (440) T PRK05925 239 NLASFGA--KVLHP--PMLKPCVRAGIP-IFVTSTFD----------VTKGGTWIY 279 (440) T ss_pred HHHHCCC--CCCCH--HHHHHHHHCCCC-EEEECCCC----------CCCCCEEEE T ss_conf 9997696--40489--999999983995-89967899----------999953898 No 58 >cd04245 AAK_AKiii-YclM-BS AAK_AKiii-YclM-BS: Amino Acid Kinase Superfamily (AAK), AKiii-YclM-BS; this CD includes the N-terminal catalytic aspartokinase (AK) domain of the lysine plus threonine-sensitive aspartokinase isoenzyme AKIII, a monofunctional class enzyme found in Bacilli (Bacillus subtilis YclM) and Clostridia species. Aspartokinase is the first enzyme in the aspartate metabolic pathway and catalyzes the conversion of aspartate and ATP to aspartylphosphate and ADP. In Bacillus subtilis (BS), YclM is reported to be a single polypeptide of 50 kD. The Bacillus subtilis 168 AKIII is induced by lysine and repressed by threonine, and it is synergistically inhibited by lysine and threonine. Probab=99.05 E-value=2e-09 Score=72.71 Aligned_cols=116 Identities=24% Similarity=0.276 Sum_probs=88.6 Q ss_pred CHHHHHHHHHCCCEEEECC-CCCCCCCCEECC---CHHHHHHHHHHHHCHHHHHHHCCCCCEEECCCC------CCCCCC Q ss_conf 6999999997798499857-431787320066---147999999986251133432267633534555------556338 Q 537021.9.peg.3 11 NRTILDLLIKSGIIPVIAP-IAPGYDGATYNI---NADTFAGAIAEKLNVIRLLFLTDVPGVLDKNRQ------LISKLS 80 (142) Q Consensus 11 d~~~i~~lL~~g~iPVi~p-ig~~~~G~~~ni---n~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~~------~i~~i~ 80 (142) ....+..+++.+.+||++- +|.+.+|++..+ -+|..|+.+|.+|+|+.+.+.|||+|||..|++ +|++++ T Consensus 163 ~~~~~~~~~~~~~v~Vv~GFig~~~~G~~ttLGRgGSDySAaiia~~l~A~~v~iwtDV~Gi~taDPr~V~~A~~i~~ls 242 (288) T cd04245 163 SYQKIKKLRDSDEKLVIPGFYGYSKNGDIKTFSRGGSDITGAILARGFQADLYENFTDVDGIYAANPRIVANPKPISEMT 242 (288) T ss_pred HHHHHHHHHHCCCEEEEEEEEEECCCCCEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCEEEECCCCCCCCCEECCCCC T ss_conf 69999988705983898257775389976432898389999999997589889995479803737998577986877359 Q ss_pred HHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEEC Q ss_conf 8899999626976786789999999999849988999525787689999967998663454 Q 537021.9.peg.3 81 INEARTLIKDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLI 141 (142) Q Consensus 81 ~~e~~~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~ 141 (142) .+|+.+|..-|. .-+-| ++..-+.+.++| ++|-|-. ..+..||+|| T Consensus 243 y~EA~eLa~~Ga--kVlHP--~ti~P~~~~~Ip-i~irnt~----------~p~~~GT~I~ 288 (288) T cd04245 243 YREMRELSYAGF--SVFHD--EALIPAIEAGIP-INIKNTN----------HPEAPGTLIV 288 (288) T ss_pred HHHHHHHHHCCC--CCCCH--HHHHHHHHCCCE-EEEECCC----------CCCCCCCEEC T ss_conf 999999996798--51588--999999985980-8993789----------9999985779 No 59 >cd04244 AAK_AK-LysC-like AAK_AK-LysC-like: Amino Acid Kinase Superfamily (AAK), AK-LysC-like; this CD includes the N-terminal catalytic aspartokinase (AK) domain of the lysine-sensitive AK isoenzyme found in higher plants. The lysine-sensitive AK isoenzyme is a monofunctional protein. It is involved in the overall regulation of the aspartate pathway and can be synergistically inhibited by S-adenosylmethionine. Also included in this CD is an uncharacterized LysC-like AK found in Euryarchaeota and some bacteria. AK catalyzes the conversion of aspartate and ATP to aspartylphosphate and ADP. Probab=99.01 E-value=3.4e-09 Score=71.43 Aligned_cols=112 Identities=29% Similarity=0.418 Sum_probs=84.0 Q ss_pred HHHHHHHCCCEEEECCC-CCCCCCCEECC---CHHHHHHHHHHHHCHHHHHHHCCCCCEEECCCC------CCCCCCHHH Q ss_conf 99999977984998574-31787320066---147999999986251133432267633534555------556338889 Q 537021.9.peg.3 14 ILDLLIKSGIIPVIAPI-APGYDGATYNI---NADTFAGAIAEKLNVIRLLFLTDVPGVLDKNRQ------LISKLSINE 83 (142) Q Consensus 14 ~i~~lL~~g~iPVi~pi-g~~~~G~~~ni---n~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~~------~i~~i~~~e 83 (142) .+...++.+.|||++-+ |.+++|++..+ -+|..|+.+|.+|+|+.+.+.|||+|+|..|++ .|++++.+| T Consensus 176 ~~~~~~~~~~i~Vv~GFig~~~~G~~ttLGRgGSDyTAailA~~l~A~~~~IwtDV~Gi~taDPr~v~~a~~i~~lsy~E 255 (298) T cd04244 176 RLLPMLEDGKIPVVTGFIGATEDGAITTLGRGGSDYSATIIGAALDADEIWIWKDVDGVMTADPRIVPEARTIPRLSYAE 255 (298) T ss_pred HHHHHHCCCCEEEEECCCCCCCCCCEEEECCCCHHHHHHHHHHHCCCCEEEEECCCCCEEECCCCCCCCCEECCCCCHHH T ss_conf 98887505928999382555898766555899478999999997288789994398832607997688875747328999 Q ss_pred HHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEE Q ss_conf 999962697678678999999999984998899952578768999996799866345 Q 537021.9.peg.3 84 ARTLIKDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLL 140 (142) Q Consensus 84 ~~~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i 140 (142) +.+|..-|. .-+-| ++..-+.++++| ++|-|- |+.+..||+| T Consensus 256 A~ELa~~Ga--kVlHP--~ti~p~~~~~IP-i~Iknt----------~~p~~~GT~I 297 (298) T cd04244 256 AMELAYFGA--KVLHP--RTVEPAMEKGIP-VRVKNT----------FNPEAPGTLI 297 (298) T ss_pred HHHHHHCCC--CCCCH--HHHHHHHHCCCC-EEEECC----------CCCCCCCEEE T ss_conf 999996797--40599--999999987996-899389----------9989998155 No 60 >cd04247 AAK_AK-Hom3 AAK_AK-Hom3: Amino Acid Kinase Superfamily (AAK), AK-Hom3; this CD includes the N-terminal catalytic domain of the aspartokinase HOM3, a monofunctional class enzyme found in Saccharomyces cerevisiae and other related AK domains. Aspartokinase, the first enzyme in the aspartate metabolic pathway, catalyzes the conversion of aspartate and ATP to aspartylphosphate and ADP, and in fungi, is responsible for the production of threonine, isoleucine and methionine. S. cerevisiae has a single aspartokinase isoenzyme type, which is regulated by feedback, allosteric inhibition by L-threonine. Recent studies show that the allosteric transition triggered by binding of threonine to AK involves a large change in the conformation of the native hexameric enzyme that is converted to an inactive one of different shape and substantially smaller hydrodynamic size. Probab=99.00 E-value=3.4e-09 Score=71.41 Aligned_cols=107 Identities=23% Similarity=0.311 Sum_probs=81.7 Q ss_pred CCCEEEECCC-CCCCCCCEECC---CHHHHHHHHHHHHCHHHHHHHCCCCCEEECCC------CCCCCCCHHHHHHHHHC Q ss_conf 7984998574-31787320066---14799999998625113343226763353455------55563388899999626 Q 537021.9.peg.3 21 SGIIPVIAPI-APGYDGATYNI---NADTFAGAIAEKLNVIRLLFLTDVPGVLDKNR------QLISKLSINEARTLIKD 90 (142) Q Consensus 21 ~g~iPVi~pi-g~~~~G~~~ni---n~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~------~~i~~i~~~e~~~l~~~ 90 (142) .+.|||++-+ |.+++|++..+ -+|..|+.+|.+|+|+++.+.|||+|+|..|+ ++|++++.+|+.+|..- T Consensus 189 ~~~v~Vv~GFig~~~~G~~tTLGRgGSDySAaila~~l~A~~v~IwTDV~Gi~taDPr~V~~A~~i~~lsy~EA~eLa~~ 268 (306) T cd04247 189 ENRVPVVTGFFGNVPGGLLSQIGRGYTDLCAALCAVGLNADELQIWKEVDGIFTADPRKVPTARLLPSITPEEAAELTYY 268 (306) T ss_pred CCCEEEEECCEEECCCCCEEECCCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCEECCEECHHHHHHHHHC T ss_conf 69748960412546999877606897079999999976977899997787545479875789836572189999999967 Q ss_pred CCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEECC Q ss_conf 9767867899999999998499889995257876899999679986634549 Q 537021.9.peg.3 91 GTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLIP 142 (142) Q Consensus 91 ~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~p 142 (142) |. .-+-| ++..-+.+.++| ++|-|- |..+..||+|.| T Consensus 269 Ga--kVlHP--~ti~P~~~~~IP-i~Irnt----------~~P~~~GT~I~~ 305 (306) T cd04247 269 GS--EVIHP--FTMEQVIKARIP-IRIKNV----------ENPRGEGTVIYP 305 (306) T ss_pred CC--CCCCH--HHHHHHHHCCCC-EEECCC----------CCCCCCCCEECC T ss_conf 97--31598--999999987994-898189----------998999736568 No 61 >cd04243 AAK_AK-HSDH-like AAK_AK-HSDH-like: Amino Acid Kinase Superfamily (AAK), AK-HSDH-like; this family includes the N-terminal catalytic domain of aspartokinase (AK) of the bifunctional enzyme AK- homoserine dehydrogenase (HSDH). These aspartokinases are found in such bacteria as E. coli (AKI-HSDHI, ThrA and AKII-HSDHII, MetL) and in higher plants (Z. mays AK-HSDH). AK and HSDH are the first and third enzymes in the biosynthetic pathway of the aspartate family of amino acids. AK catalyzes the phosphorylation of Asp to P-aspartyl phosphate. HSDH catalyzes the NADPH-dependent conversion of Asp 3-semialdehyde to homoserine. ThrA and MetL are involved in threonine and methionine biosynthesis, respectively. In E. coli, ThrA is subject to allosteric regulation by the end product L-threonine and the native enzyme is reported to be tetrameric. As with bacteria, plant AK and HSDH are feedback inhibited by pathway end products. Maize AK-HSDH is a Thr-sensitive 180-kD enzyme. Arabidopsis AK- Probab=98.99 E-value=4.3e-09 Score=70.85 Aligned_cols=109 Identities=29% Similarity=0.364 Sum_probs=81.9 Q ss_pred HHHHCCCEEEECCC-CCCCCCCEECC---CHHHHHHHHHHHHCHHHHHHHCCCCCEEECCC------CCCCCCCHHHHHH Q ss_conf 99977984998574-31787320066---14799999998625113343226763353455------5556338889999 Q 537021.9.peg.3 17 LLIKSGIIPVIAPI-APGYDGATYNI---NADTFAGAIAEKLNVIRLLFLTDVPGVLDKNR------QLISKLSINEART 86 (142) Q Consensus 17 ~lL~~g~iPVi~pi-g~~~~G~~~ni---n~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~------~~i~~i~~~e~~~ 86 (142) .+.+.+.|||++-+ |.+.+|++..+ -+|..|+.+|.+|+|+.+.+.|||+|+|..|+ +.|++++.+|+.+ T Consensus 174 ~~~~~~~v~Vv~GFig~~~~G~~ttLGRgGSD~sAaila~~l~A~~v~IwtDV~Gi~taDPr~v~~a~~i~~lsy~EA~E 253 (293) T cd04243 174 LLAEHGKVVVTQGFIASNEDGETTTLGRGGSDYSAALLAALLDAEEVEIWTDVDGVYTADPRKVPDARLLKELSYDEAME 253 (293) T ss_pred HHHCCCCEEEECCCCCCCCCCCEEEECCCHHHHHHHHHHHHCCCCEEEEEECCCCEEECCCCCCCCCEECCCCCHHHHHH T ss_conf 86338959997496612799977872798289999999995699889998378840708998788875648108999999 Q ss_pred HHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEE Q ss_conf 962697678678999999999984998899952578768999996799866345 Q 537021.9.peg.3 87 LIKDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLL 140 (142) Q Consensus 87 l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i 140 (142) |..-|. .-+-| ++..-+.++++| ++|-|-.. .+..||+| T Consensus 254 La~~Ga--kVlHp--~ti~p~~~~~Ip-i~i~nt~~----------p~~~GT~I 292 (293) T cd04243 254 LAYFGA--KVLHP--RTIQPAIRKNIP-IFIKNTFN----------PEAPGTLI 292 (293) T ss_pred HHHCCC--CCCCH--HHHHHHHHCCCC-EEEECCCC----------CCCCCEEE T ss_conf 997796--20599--999999986994-89937999----------89998265 No 62 >cd04257 AAK_AK-HSDH AAK_AK-HSDH: Amino Acid Kinase Superfamily (AAK), AK-HSDH; this CD includes the N-terminal catalytic domain of aspartokinase (AK) of the bifunctional enzyme AK - homoserine dehydrogenase (HSDH). These aspartokinases are found in bacteria (E. coli AKI-HSDHI, ThrA and E. coli AKII-HSDHII, MetL) and higher plants (Z. mays AK-HSDH). AK and HSDH are the first and third enzymes in the biosynthetic pathway of the aspartate family of amino acids. AK catalyzes the phosphorylation of Asp to P-aspartyl phosphate. HSDH catalyzes the NADPH-dependent conversion of Asp 3-semialdehyde to homoserine. ThrA and MetL are involved in threonine and methionine biosynthesis, respectively. In E. coli, ThrA is subject to allosteric regulation by the end product L-threonine and the native enzyme is reported to be tetrameric. As with bacteria, plant AK and HSDH are feedback inhibited by pathway end products. Maize AK-HSDH is a Thr-sensitive 180-kD enzyme. Arabidopsis AK-HSDH is an alanine-act Probab=98.99 E-value=4.1e-09 Score=71.02 Aligned_cols=110 Identities=25% Similarity=0.345 Sum_probs=81.6 Q ss_pred HHHHHCCCEEEECCC-CCCCCCCEECC---CHHHHHHHHHHHHCHHHHHHHCCCCCEEECCC------CCCCCCCHHHHH Q ss_conf 999977984998574-31787320066---14799999998625113343226763353455------555633888999 Q 537021.9.peg.3 16 DLLIKSGIIPVIAPI-APGYDGATYNI---NADTFAGAIAEKLNVIRLLFLTDVPGVLDKNR------QLISKLSINEAR 85 (142) Q Consensus 16 ~~lL~~g~iPVi~pi-g~~~~G~~~ni---n~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~------~~i~~i~~~e~~ 85 (142) +.+.+.+.+||++-+ |.+.+|++..+ -+|..|+.+|.+|+|+++.+.|||+|+|..|+ ++|++++.+|+. T Consensus 174 ~~~~~~~~v~Vv~GFig~~~~G~~ttLGRgGSD~TAa~ia~~L~A~~v~IwtDV~Gi~taDPr~v~~a~~i~~lsy~EA~ 253 (294) T cd04257 174 AWFSSNGKVIVVTGFIASNPQGETTTLGRNGSDYSAAILAALLDADQVEIWTDVDGVYSADPRKVKDARLLPSLSYQEAM 253 (294) T ss_pred HHHHCCCCEEEECCCCCCCCCCCEEEECCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCCCCCEECCCCCHHHHH T ss_conf 98641797899769525678897513489948899999999729978999428885052799768898671604899999 Q ss_pred HHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEE Q ss_conf 9962697678678999999999984998899952578768999996799866345 Q 537021.9.peg.3 86 TLIKDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLL 140 (142) Q Consensus 86 ~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i 140 (142) +|..-|. .-+-| ++..-+.++++| ++|-|- |..+..||+| T Consensus 254 eLa~~Ga--kVlHp--~ti~p~~~~~Ip-i~vrnt----------~~p~~~GT~I 293 (294) T cd04257 254 ELSYFGA--KVLHP--KTIQPVAKKNIP-ILIKNT----------FNPEAPGTLI 293 (294) T ss_pred HHHHCCC--CCCCH--HHHHHHHHCCCC-EEEECC----------CCCCCCCEEE T ss_conf 9996798--31599--999999987993-899389----------9989998151 No 63 >PRK12443 uridylate kinase; Reviewed Probab=98.98 E-value=1.1e-08 Score=68.59 Aligned_cols=119 Identities=17% Similarity=0.209 Sum_probs=84.1 Q ss_pred EECHHHHHHHHHCCCEEEECCCCCCCCCCEECCCHHHHHHHHHHHHCHHHHHHHCC-CCCEEECCC------CCCCCCCH Q ss_conf 55699999999779849985743178732006614799999998625113343226-763353455------55563388 Q 537021.9.peg.3 9 KVNRTILDLLIKSGIIPVIAPIAPGYDGATYNINADTFAGAIAEKLNVIRLLFLTD-VPGVLDKNR------QLISKLSI 81 (142) Q Consensus 9 ~vd~~~i~~lL~~g~iPVi~pig~~~~G~~~nin~D~~a~~lA~~l~A~kli~ltd-v~Gv~d~~~------~~i~~i~~ 81 (142) ..+..--.+.|++|.|+|.+ ..+|..|-. .|.+|+..|..++|+.++.-|. ||||||.|+ +..++++. T Consensus 112 ~y~~~rA~~~LekG~VVIfa----gGTGnP~fT-TDtaAaLrA~Ei~Ad~lL~at~~VDGVYd~DP~k~~dA~k~~~ls~ 186 (247) T PRK12443 112 PYIRLRAVHHLDNGYIVIFG----GGNGQPFVT-TDYPSVQRAIEMNSDAILVAKQGVDGVFTSDPKHNKSAKMYRKLNY 186 (247) T ss_pred CCCHHHHHHHHHCCCEEEEE----CCCCCCCCC-CCHHHHHHHHHHCCHHHHCCCCCCCCCCCCCCCCCCCCEEEEECCH T ss_conf 77789999997369789997----887887520-4399999998828613321056657764689889997544433379 Q ss_pred HHHHHHHHCCCCCCCHHHHHHHHHHHH-HCCCCEEEEEECCCCCHHHHHHHCCCCCCCEECC Q ss_conf 899999626976786789999999999-8499889995257876899999679986634549 Q 537021.9.peg.3 82 NEARTLIKDGTISGGMIPKIETSIKAI-ENGVKSVAILDGKKPHSILMEIFTKNGSGTLLIP 142 (142) Q Consensus 82 ~e~~~l~~~~~~~~gm~~Ki~~a~~a~-~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~p 142 (142) +|+-+ ++-.+ | .. .|+..| ++++| ++++|...|++|...+ .++.+||+|-+ T Consensus 187 ~e~l~--~~L~v---M--D~-tA~~lc~~~~ip-i~Vfn~~~~g~l~ka~-~Ge~iGT~V~~ 238 (247) T PRK12443 187 NDVVR--QNIQV---M--DQ-AALLLARDYNLP-AHVFNFDEPGVMRRIC-LGEHVGTLIND 238 (247) T ss_pred HHHHH--CCCCC---H--HH-HHHHHHHHCCCC-EEEECCCCCCHHHHHH-CCCCCCEEECC T ss_conf 99998--69944---2--99-999999973998-8998289986399996-79986328818 No 64 >PRK08373 aspartate kinase; Validated Probab=98.96 E-value=4.1e-09 Score=70.98 Aligned_cols=112 Identities=21% Similarity=0.274 Sum_probs=80.7 Q ss_pred HHHHHHHHHCCCEEEECCC-CCCCCCCEECC---CHHHHHHHHHHHHCHHHHHHHCCCCCEEECCCC------CCCCCCH Q ss_conf 9999999977984998574-31787320066---147999999986251133432267633534555------5563388 Q 537021.9.peg.3 12 RTILDLLIKSGIIPVIAPI-APGYDGATYNI---NADTFAGAIAEKLNVIRLLFLTDVPGVLDKNRQ------LISKLSI 81 (142) Q Consensus 12 ~~~i~~lL~~g~iPVi~pi-g~~~~G~~~ni---n~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~~------~i~~i~~ 81 (142) .+.+...++.+.|||++-+ | +.+|....+ -+|..|+.+|.+|+|+.+.+.|||+|+|..|++ +|++++. T Consensus 155 ~~~l~~~l~~~~i~Vv~GFiG-~~~G~~TTLGRGGSDyTAailA~~L~A~~v~IwTDVdGI~TADPriV~~Ar~I~~lSY 233 (359) T PRK08373 155 AKVVYEALERGKIPVVPGFYG-NLNGYRVTLGRGGSDYSAVALGVLLNAKAVLIMSDVEGIYTADPKLVPSARLIPYVSY 233 (359) T ss_pred HHHHHHHHCCCCEEEECCEEE-ECCCCEEEECCCCCHHHHHHHHHHCCCCEEEEEECCCCCEECCCCCCCCCEECCCCCH T ss_conf 999998732795899367467-0499264306997168999999974997899975898516089987898748882189 Q ss_pred HHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEEC Q ss_conf 899999626976786789999999999849988999525787689999967998663454 Q 537021.9.peg.3 82 NEARTLIKDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLI 141 (142) Q Consensus 82 ~e~~~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~ 141 (142) +|+.+|..-|. +.-=-.+++-++..+| +++-|- | +...||+|. T Consensus 234 ~EA~ELA~~GA-----KVLHPrti~Pakk~IP-i~v~nT----------~-~~~~GTLIt 276 (359) T PRK08373 234 DEIKIAAKLGM-----KALHWKAIDLVKGYIP-IIFGRT----------R-DWRMGTLIS 276 (359) T ss_pred HHHHHHHHCCC-----CCCCHHHHHHHHHCCC-EEEECC----------C-CCCCCEEEE T ss_conf 99999997797-----4168888999980995-799267----------9-788964785 No 65 >PRK09466 metL bifunctional aspartate kinase II/homoserine dehydrogenase II; Provisional Probab=98.96 E-value=5.7e-09 Score=70.21 Aligned_cols=115 Identities=20% Similarity=0.246 Sum_probs=83.8 Q ss_pred HHHHHHHH--CCCEEEECCC-CCCCCCCEECC---CHHHHHHHHHHHHCHHHHHHHCCCCCEEECCCC------CCCCCC Q ss_conf 99999997--7984998574-31787320066---147999999986251133432267633534555------556338 Q 537021.9.peg.3 13 TILDLLIK--SGIIPVIAPI-APGYDGATYNI---NADTFAGAIAEKLNVIRLLFLTDVPGVLDKNRQ------LISKLS 80 (142) Q Consensus 13 ~~i~~lL~--~g~iPVi~pi-g~~~~G~~~ni---n~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~~------~i~~i~ 80 (142) +.++.++. .+.+||++-+ |.+.+|++..+ -+|..|+.+|.+|+|+++-+.|||+|+|..|++ +|++++ T Consensus 174 ~~l~~~l~~~~~~i~VvtGFig~~~~G~~tTLGRGGSD~SAailAaaL~A~~~eIwTDVdGV~TADPriV~~A~~I~~iS 253 (810) T PRK09466 174 PLLQQLLAQHPGKRLVVTGFISRNEAGETVLLGRNGSDYSATLIGALADVERVTIWSDVAGVYSADPRKVKDACLLPLLR 253 (810) T ss_pred HHHHHHHHHCCCCEEEECCEEEECCCCCEEEECCCCCCHHHHHHHHHHCCCEEEEEECCCCEECCCCCCCCCCEEECEEC T ss_conf 99999997368976998266774699977870588730699999998099779997356731536997675651112345 Q ss_pred HHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEECC Q ss_conf 88999996269767867899999999998499889995257876899999679986634549 Q 537021.9.peg.3 81 INEARTLIKDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLIP 142 (142) Q Consensus 81 ~~e~~~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~p 142 (142) .+|+.+|..-|. .=+-| ++..-+.+.++| ++|-|- |+.+..||+|.+ T Consensus 254 Y~EA~ELA~fGA--kVLHP--rti~Pa~~~~IP-I~IknT----------f~Pe~~GTlI~~ 300 (810) T PRK09466 254 LDEASELARLAA--PVLHA--RTLQPVSGSDID-LQLRCS----------YTPEQGSTRIER 300 (810) T ss_pred HHHHHHHHHCCC--CCCCH--HHHHHHHHCCCE-EEEEEC----------CCCCCCCEEEEE T ss_conf 999999984699--86576--789999864974-999954----------578999659975 No 66 >TIGR02075 pyrH_bact uridylate kinase; InterPro: IPR015963 Uridylate kinases (also known as UMP kinases) are key enzymes in the synthesis of nucleoside triphosphates. They catalyse the reversible transfer of the gamma-phosphoryl group from an ATP donor to UMP, yielding UDP, which is the starting point for the synthesis of all other pyrimidine nucleotides. The eukaryotic enzyme has a dual specificity, phosphorylating both UMP and CMP, while the bacterial enzyme is specific to UMP. The bacterial enzyme shows no sequence similarity to the eukaryotic enzyme or other nucleoside monophosphate kinases, but rather appears to be part of the amino acid kinase family. It is dependent on magnesium for activity and is activated by GTP and repressed by UTP , . In many bacterial genomes, the gene tends to be located immediately downstream of elongation factor T and upstream of ribosome recycling factor. A related protein family, believed to be equivalent in function is found in the archaea and in spirochetes. Structurally, the bacterial and archaeal proteins are homohexamers centred around a hollow nucleus and organised as a trimer of dimers , . Each monomer within the protein forms the amino acid kinase fold and can be divided into an N-terminal region which binds UMP and mediates intersubunit interactions within the dimer, and a C-terminal region which binds ATP and contains a mobile loop covering the active site. Inhibition of enzyme activity by UTP appears to be due to competition for the binding site for UMP, not allosteric inhibition as was previously suspected. This entry represents the bacterial proteins.. Probab=98.95 E-value=6.4e-09 Score=69.92 Aligned_cols=115 Identities=22% Similarity=0.280 Sum_probs=90.2 Q ss_pred HHHHHHHHHCCCEEEECCCCCCCCCCEECCCHHHHHHHHHHHHCHHHHHHHCC-CCCEEECCCC------CCCCCCHHHH Q ss_conf 99999999779849985743178732006614799999998625113343226-7633534555------5563388899 Q 537021.9.peg.3 12 RTILDLLIKSGIIPVIAPIAPGYDGATYNINADTFAGAIAEKLNVIRLLFLTD-VPGVLDKNRQ------LISKLSINEA 84 (142) Q Consensus 12 ~~~i~~lL~~g~iPVi~pig~~~~G~~~nin~D~~a~~lA~~l~A~kli~ltd-v~Gv~d~~~~------~i~~i~~~e~ 84 (142) ..-..+.|++|.|+|.+ ..+|.+|- .-|.+|+.=|..++||-++.-|+ |||||+.|++ +.+++|.+++ T Consensus 114 ~~~a~~~le~g~vVIF~----gGtGnPfF-TTDtaA~LRAiEi~aD~~Lkgt~GVDGVY~~DPkknk~A~~y~~itY~~~ 188 (236) T TIGR02075 114 RRKAIKHLEKGKVVIFS----GGTGNPFF-TTDTAAALRAIEINADVILKGTNGVDGVYTADPKKNKDAKKYDTITYNEV 188 (236) T ss_pred HHHHHHHHHCCCEEEEE----CCCCCCCC-HHHHHHHHHHHHHCCEEEEEECCCCCCCCCCCCCCCCCCEECCCCCHHHH T ss_conf 78999985359789995----58986963-21158887666431347998104888023587885987523166798999 Q ss_pred HHHHHCCCCCCCHHHHHHHHHHHH-HCCCCEEEEEECCCCCHHHHHHHCCCCCCCEEC Q ss_conf 999626976786789999999999-849988999525787689999967998663454 Q 537021.9.peg.3 85 RTLIKDGTISGGMIPKIETSIKAI-ENGVKSVAILDGKKPHSILMEIFTKNGSGTLLI 141 (142) Q Consensus 85 ~~l~~~~~~~~gm~~Ki~~a~~a~-~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~ 141 (142) -+ ++=.+.+ .+|+-.| +|..| ++++|...|++|...+ ++++.||+|- T Consensus 189 L~--~~L~VMD------~TA~~La~dnnlp-i~VFnI~~~g~l~~vi-~g~~~gTlv~ 236 (236) T TIGR02075 189 LK--KNLKVMD------LTAFSLAKDNNLP-IVVFNIDEPGALKKVI-LGKGIGTLVS 236 (236) T ss_pred HH--CCCCHHH------HHHHHHHHHCCCC-EEEECCCCCCHHHHHH-CCCCCEEEEC T ss_conf 86--1471135------8999999877897-5996666530164532-2784457528 No 67 >cd04258 AAK_AKiii-LysC-EC AAK_AKiii-LysC-EC: Amino Acid Kinase Superfamily (AAK), AKiii-LysC-EC: this CD includes the N-terminal catalytic aspartokinase (AK) domain of the lysine-sensitive aspartokinase isoenzyme AKIII. AKIII is a monofunctional class enzyme (LysC) found in some bacteria such as E. coli. Aspartokinase is the first enzyme in the aspartate metabolic pathway and catalyzes the conversion of aspartate and ATP to aspartylphosphate and ADP. In E. coli, LysC is reported to be a homodimer of 50 kD subunits. Probab=98.94 E-value=7.4e-09 Score=69.56 Aligned_cols=108 Identities=25% Similarity=0.333 Sum_probs=81.2 Q ss_pred HHHCCCEEEECCC-CCCCCCCEECC---CHHHHHHHHHHHHCHHHHHHHCCCCCEEECCCC------CCCCCCHHHHHHH Q ss_conf 9977984998574-31787320066---147999999986251133432267633534555------5563388899999 Q 537021.9.peg.3 18 LIKSGIIPVIAPI-APGYDGATYNI---NADTFAGAIAEKLNVIRLLFLTDVPGVLDKNRQ------LISKLSINEARTL 87 (142) Q Consensus 18 lL~~g~iPVi~pi-g~~~~G~~~ni---n~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~~------~i~~i~~~e~~~l 87 (142) .+..+.+||++-+ |.+.+|++..+ -+|..|+.+|..|+|+.+.+.|||+|+|..|++ +|++++.+|+.+| T Consensus 174 ~~~~~~i~Vv~GFig~~~~G~~ttLGRGGSDyTA~~la~~l~A~~v~iwtDV~Gi~taDPr~V~~A~~i~~lsy~EA~eL 253 (292) T cd04258 174 PLLAGTVVVTQGFIGSTEKGRTTTLGRGGSDYSAALLAEALHAEELQIWTDVAGIYTTDPRICPAARAIKEISFAEAAEM 253 (292) T ss_pred HHCCCCEEEECCCCEECCCCCEEECCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCEECCCCCHHHHHHH T ss_conf 53379899935805437999777537896168999999976998899996788787589987898767483699999999 Q ss_pred HHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEE Q ss_conf 62697678678999999999984998899952578768999996799866345 Q 537021.9.peg.3 88 IKDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLL 140 (142) Q Consensus 88 ~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i 140 (142) ..-|.- =+-| ++..-+.++++| ++|-|- |+.+..||+| T Consensus 254 a~~Gak--VlHp--~ti~p~~~~~Ip-i~I~nt----------~~p~~~GT~I 291 (292) T cd04258 254 ATFGAK--VLHP--ATLLPAIRKNIP-VFVGSS----------KDPEAGGTLI 291 (292) T ss_pred HHCCCC--CCCH--HHHHHHHHCCCC-EEEECC----------CCCCCCCEEE T ss_conf 977973--1699--999999986994-899378----------9989998254 No 68 >COG2054 Uncharacterized archaeal kinase related to aspartokinases, uridylate kinases [General function prediction only] Probab=98.92 E-value=2.1e-09 Score=72.56 Aligned_cols=117 Identities=21% Similarity=0.365 Sum_probs=90.3 Q ss_pred HHHHCCCEEEECCCC----CCCCCCEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEECCC--CCCCCCCHHHHHHHHHC Q ss_conf 999779849985743----178732006614799999998625113343226763353455--55563388899999626 Q 537021.9.peg.3 17 LLIKSGIIPVIAPIA----PGYDGATYNINADTFAGAIAEKLNVIRLLFLTDVPGVLDKNR--QLISKLSINEARTLIKD 90 (142) Q Consensus 17 ~lL~~g~iPVi~pig----~~~~G~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~--~~i~~i~~~e~~~l~~~ 90 (142) ...+.+..||+=|-. .|+--+++-+.+|.++..+|..+++.++++.||++|+|++++ +++++|+.++++. . T Consensus 88 ~~i~~~~~aVLLPyrlLr~~DplpHSW~VTSDsis~~Ia~~~~~~~vv~aTDVdGI~~~~~~~kLv~eI~A~dl~~---~ 164 (212) T COG2054 88 DGIKPDAKAVLLPYRLLRKTDPLPHSWEVTSDSISVWIAAKAGATEVVKATDVDGIYEEDPKGKLVREIRASDLKT---G 164 (212) T ss_pred HCCCCCCCEEEEEHHHHHCCCCCCCCEEECCCHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHCCC---C T ss_conf 4267566068600676502788876522223189999999729827999953786011589630566621755356---7 Q ss_pred CCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEECC Q ss_conf 9767867899999999998499889995257876899999679986634549 Q 537021.9.peg.3 91 GTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLIP 142 (142) Q Consensus 91 ~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~p 142 (142) -..-++..|++. .+.+.+ +.++|++.|+.++..+.-.+-+||+|++ T Consensus 165 ~t~vD~~~P~Ll-----~k~~m~-~~Vvng~~pervi~~lrGk~~v~T~Ivg 210 (212) T COG2054 165 ETSVDPYLPKLL-----VKYKMN-CRVVNGKEPERVILALRGKEVVGTLIVG 210 (212) T ss_pred CCCCCCHHHHHH-----HHCCCC-EEEECCCCHHHHHHHHHCCCCCEEEEEC T ss_conf 531560406899-----971871-5998787889999987165654478757 No 69 >COG0528 PyrH Uridylate kinase [Nucleotide transport and metabolism] Probab=98.92 E-value=2.7e-08 Score=66.49 Aligned_cols=119 Identities=22% Similarity=0.294 Sum_probs=85.9 Q ss_pred EECHHHHHHHHHCCCEEEECCCCCCCCCCEECCCHHHHHHHHHHHHCHHHHHHHCC-CCCEEECCC------CCCCCCCH Q ss_conf 55699999999779849985743178732006614799999998625113343226-763353455------55563388 Q 537021.9.peg.3 9 KVNRTILDLLIKSGIIPVIAPIAPGYDGATYNINADTFAGAIAEKLNVIRLLFLTD-VPGVLDKNR------QLISKLSI 81 (142) Q Consensus 9 ~vd~~~i~~lL~~g~iPVi~pig~~~~G~~~nin~D~~a~~lA~~l~A~kli~ltd-v~Gv~d~~~------~~i~~i~~ 81 (142) ..+.+-..+.|+.|.|+|.+ ..+|..+ -.-|.+|+..|..++|+-++..|+ |||||+.++ +..++++. T Consensus 112 ~~~~~~A~~~l~~grVvIf~----gGtg~P~-fTTDt~AALrA~ei~ad~ll~atn~VDGVY~~DPkk~pdA~~~~~Lty 186 (238) T COG0528 112 PYSRREAIRHLEKGRVVIFG----GGTGNPG-FTTDTAAALRAEEIEADVLLKATNKVDGVYDADPKKDPDAKKYDTLTY 186 (238) T ss_pred CCCHHHHHHHHHCCCEEEEE----CCCCCCC-CCHHHHHHHHHHHHCCCEEEEECCCCCCEECCCCCCCCCCEECCCCCH T ss_conf 66799999999749989991----8889998-734799999999838868998406778036899988988223223799 Q ss_pred HHHHHHHHCCCCCCCHHHHHHHHHHHH-HCCCCEEEEEECCCCCHHHHHHHCCCCCCCEECC Q ss_conf 899999626976786789999999999-8499889995257876899999679986634549 Q 537021.9.peg.3 82 NEARTLIKDGTISGGMIPKIETSIKAI-ENGVKSVAILDGKKPHSILMEIFTKNGSGTLLIP 142 (142) Q Consensus 82 ~e~~~l~~~~~~~~gm~~Ki~~a~~a~-~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~p 142 (142) .|+.+.. ..+-+ . .|+..| +++++ ++++|++.+++| ...+.++..||.|.| T Consensus 187 ~e~l~~~--l~vmD-----~-tA~~l~~~~~i~-i~Vfn~~~~~~l-~~~~~ge~~GT~V~~ 238 (238) T COG0528 187 DEVLKIG--LKVMD-----P-TAFSLARDNGIP-IIVFNINKPGNL-KRALKGEEVGTIVEP 238 (238) T ss_pred HHHHHHC--CEEEC-----H-HHHHHHHHCCCC-EEEEECCCCCCH-HHHHCCCCCCEEECC T ss_conf 9999724--62405-----9-999999975994-899937887659-999748967417069 No 70 >PRK09084 aspartate kinase III; Validated Probab=98.91 E-value=9.4e-09 Score=69.00 Aligned_cols=106 Identities=25% Similarity=0.329 Sum_probs=78.7 Q ss_pred CCCEEEECCC-CCCCCCCEECC---CHHHHHHHHHHHHCHHHHHHHCCCCCEEECCCC------CCCCCCHHHHHHHHHC Q ss_conf 7984998574-31787320066---147999999986251133432267633534555------5563388899999626 Q 537021.9.peg.3 21 SGIIPVIAPI-APGYDGATYNI---NADTFAGAIAEKLNVIRLLFLTDVPGVLDKNRQ------LISKLSINEARTLIKD 90 (142) Q Consensus 21 ~g~iPVi~pi-g~~~~G~~~ni---n~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~~------~i~~i~~~e~~~l~~~ 90 (142) .+.+||++-+ |.+.+|++..+ -+|..|+.+|.+|+|+.+.+.|||+|+|..|++ +|++++.+|+.+|..- T Consensus 173 ~~~v~Vv~GF~g~~~~G~~ttLGRGGSD~TAa~la~~l~A~~~~IwtDV~Gi~taDPr~V~~A~~i~~lsy~Ea~ELa~~ 252 (447) T PRK09084 173 EEGVVVTQGFIGSDEKGRTTTLGRGGSDYSAALLAEALNASRVEIWTDVPGIYTTDPRIVPAARRIDEISFEEAAEMATF 252 (447) T ss_pred CCCEEEEEEEEEECCCCCEEECCCCCCHHHHHHHHHHCCCCEEEEECCCCCEEECCCCCCCCCCCCCCCCHHHHHHHHHC T ss_conf 69868950045766999777617996068999999966984999961778336069876766520363489999999977 Q ss_pred CCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEEC Q ss_conf 976786789999999999849988999525787689999967998663454 Q 537021.9.peg.3 91 GTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLI 141 (142) Q Consensus 91 ~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~ 141 (142) |.- =+-| ++..-+.+.++| ++|-|- |+.+..||+|. T Consensus 253 GAk--VlHp--~av~pa~~~~IP-i~I~nt----------~~P~~~GT~I~ 288 (447) T PRK09084 253 GAK--VLHP--ATLLPAVRSNIP-VFVGSS----------KDPEAGGTWIT 288 (447) T ss_pred CCH--HCCH--HHHHHHHHCCCE-EEEECC----------CCCCCCCEEEE T ss_conf 710--0175--889999981975-999557----------89899970894 No 71 >PRK09436 thrA bifunctional aspartokinase I/homeserine dehydrogenase I; Provisional Probab=98.89 E-value=1.1e-08 Score=68.59 Aligned_cols=112 Identities=23% Similarity=0.250 Sum_probs=82.9 Q ss_pred HHHHHCCCEEEECCC-CCCCCCCEECC---CHHHHHHHHHHHHCHHHHHHHCCCCCEEECCCC------CCCCCCHHHHH Q ss_conf 999977984998574-31787320066---147999999986251133432267633534555------55633888999 Q 537021.9.peg.3 16 DLLIKSGIIPVIAPI-APGYDGATYNI---NADTFAGAIAEKLNVIRLLFLTDVPGVLDKNRQ------LISKLSINEAR 85 (142) Q Consensus 16 ~~lL~~g~iPVi~pi-g~~~~G~~~ni---n~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~~------~i~~i~~~e~~ 85 (142) ..+++.+.+||++-+ |.+.+|++..+ -+|..|+.+|.+|+|+.+.+.|||+|+|..|++ +|++++.+|+. T Consensus 176 ~~~~~~~~i~Vv~GFig~~~~G~~tTLGRGGSDyTAailAa~L~A~~~eIwTDVdGi~TaDPr~V~~A~~I~~lSY~EA~ 255 (817) T PRK09436 176 ASFIPADHVVLMPGFTAGNEKGELVTLGRNGSDYSAAVLAACLDADCCEIWTDVDGVYTADPRVVPDARLLKSLSYQEAM 255 (817) T ss_pred HHHCCCCCEEEECCEEEECCCCCEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCEEECCCCCCCCCEEEEEECHHHHH T ss_conf 86345797899347112079998777069861789999999719866999846773270699877666575163699999 Q ss_pred HHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEECC Q ss_conf 996269767867899999999998499889995257876899999679986634549 Q 537021.9.peg.3 86 TLIKDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLIP 142 (142) Q Consensus 86 ~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~p 142 (142) +|..-|. .=+-| ++..-+.++++| ++|-|- |+.+..||+|.+ T Consensus 256 ELA~fGA--kVLHP--~Ti~P~~~~~IP-i~IkNT----------f~P~~~GT~I~~ 297 (817) T PRK09436 256 ELSYFGA--KVLHP--RTIAPIAQFQIP-CLIKNT----------FNPQAPGTLIGA 297 (817) T ss_pred HHHHCCC--CEECH--HHHHHHHHCCCE-EEEECC----------CCCCCCCEEEEC T ss_conf 9984799--50266--778899874965-999657----------788999748842 No 72 >PRK08961 bifunctional aspartate kinase/diaminopimelate decarboxylase protein; Provisional Probab=98.84 E-value=2.7e-08 Score=66.51 Aligned_cols=116 Identities=28% Similarity=0.361 Sum_probs=83.4 Q ss_pred CHHHHHHHHHC-CCEEEEC-CCCCCCCCCEECC---CHHHHHHHHHHHHCHHHHHHHCCCCCEEECCC------CCCCCC Q ss_conf 69999999977-9849985-7431787320066---14799999998625113343226763353455------555633 Q 537021.9.peg.3 11 NRTILDLLIKS-GIIPVIA-PIAPGYDGATYNI---NADTFAGAIAEKLNVIRLLFLTDVPGVLDKNR------QLISKL 79 (142) Q Consensus 11 d~~~i~~lL~~-g~iPVi~-pig~~~~G~~~ni---n~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~------~~i~~i 79 (142) +...+..++.. +.++|+. =+|.+++|++..+ -+|..|+.+|.+|+|+.+-+.|||+|+|..|. ++|+++ T Consensus 177 ~~~l~~~~~~~~~~vvVtqGFiG~~~~G~tTTLGRGGSDySAallAaaL~A~~veIWTDVdGI~TaDPRiVp~Ar~I~~L 256 (865) T PRK08961 177 DAGWRARFAAQPAQVLITQGFIARHADGGTAILGRGGSDTSAAYFGALLGAARVEIWTDVPGMFSANPKEVPDARLLTRL 256 (865) T ss_pred CHHHHHHHHCCCCCEEEECCCEEECCCCCEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCEECCCC T ss_conf 57889887505982799518335169998777079855899999999749988999858987866899878886587813 Q ss_pred CHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEEC Q ss_conf 88899999626976786789999999999849988999525787689999967998663454 Q 537021.9.peg.3 80 SINEARTLIKDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLI 141 (142) Q Consensus 80 ~~~e~~~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~ 141 (142) +.+|+.+|..-|.- =+-| ++..-+.++++| ++|-|-..|+ --||+|- T Consensus 257 Sy~EA~ELA~fGAK--VLHP--~Ti~Pa~~~~IP-v~VknT~~Pe----------~~GT~I~ 303 (865) T PRK08961 257 DYYEAQEIATTGAK--VLHP--RSIKPCRDAGVP-MAILDTERPD----------LSGTSID 303 (865) T ss_pred CHHHHHHHHHCCCC--CCCH--HHHHHHHHCCCC-EEEECCCCCC----------CCCCEEE T ss_conf 89999999976875--2486--667999976996-7981789989----------9972784 No 73 >cd04259 AAK_AK-DapDC AAK_AK-DapDC: Amino Acid Kinase Superfamily (AAK), AK-DapDC; this CD includes the N-terminal catalytic aspartokinase (AK) domain of the bifunctional enzyme AK - DAP decarboxylase (DapDC) found in some bacteria. Aspartokinase is the first enzyme in the aspartate metabolic pathway, catalyzes the conversion of aspartate and ATP to aspartylphosphate and ADP. DapDC, which is the lysA gene product, catalyzes the decarboxylation of DAP to lysine. Probab=98.78 E-value=4e-08 Score=65.53 Aligned_cols=111 Identities=26% Similarity=0.322 Sum_probs=79.7 Q ss_pred HHHHH-HCCCEEEEC-CCCCCCCCCEECC---CHHHHHHHHHHHHCHHHHHHHCCCCCEEECCC------CCCCCCCHHH Q ss_conf 99999-779849985-7431787320066---14799999998625113343226763353455------5556338889 Q 537021.9.peg.3 15 LDLLI-KSGIIPVIA-PIAPGYDGATYNI---NADTFAGAIAEKLNVIRLLFLTDVPGVLDKNR------QLISKLSINE 83 (142) Q Consensus 15 i~~lL-~~g~iPVi~-pig~~~~G~~~ni---n~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~------~~i~~i~~~e 83 (142) ++..+ +...+||++ =+|.+.+|++..+ -+|..|+.+|.+|+|+.+.+.|||+|+|..|+ ++|++++.+| T Consensus 173 ~~~~l~~~~~v~v~~GFig~~~~G~~tTLGRgGSDyTAa~ia~~l~A~~v~IwtDV~Gi~taDPr~v~~A~~i~~lsy~E 252 (295) T cd04259 173 LQKRLADGAQLIITQGFIARNAHGETVLLGRGGSDTSAAYFAAKLQAARCEIWTDVPGLFTANPHEVPHARLLKRLDYDE 252 (295) T ss_pred HHHHHCCCCEEEEECCCCEECCCCCEEEECCCCCCHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCEECCCCCHHH T ss_conf 99875149859996784345699987772589730589999997499879999777767568987688884657338999 Q ss_pred HHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEE Q ss_conf 999962697678678999999999984998899952578768999996799866345 Q 537021.9.peg.3 84 ARTLIKDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLL 140 (142) Q Consensus 84 ~~~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i 140 (142) +.+|..-|. .=+-| ++..-+.++++| ++|-|- |+.+..||+| T Consensus 253 A~eLa~~Ga--kVlHp--~t~~p~~~~~Ip-i~Irnt----------~~p~~~GT~I 294 (295) T cd04259 253 AQEIATMGA--KVLHP--RCIPPARRANIP-MVVRST----------ERPELSGTLI 294 (295) T ss_pred HHHHHHCCC--CCCCH--HHHHHHHHCCCC-EEEECC----------CCCCCCCEEE T ss_conf 999996798--41689--999999984991-899289----------9999998266 No 74 >cd04248 AAK_AK-Ectoine AAK_AK-Ectoine: Amino Acid Kinase Superfamily (AAK), AK-Ectoine; this CD includes the N-terminal catalytic domain of the aspartokinase of the ectoine (1,4,5,6-tetrahydro-2-methyl pyrimidine-4-carboxylate) biosynthetic pathway found in Methylomicrobium alcaliphilum, Vibrio cholerae, and other various halotolerant or halophilic bacteria. Bacteria exposed to hyperosmotic stress accumulate organic solutes called 'compatible solutes' of which ectoine, a heterocyclic amino acid, is one. Apart from its osmotic function, ectoine also exhibits a protective effect on proteins, nucleic acids and membranes against a variety of stress factors. de novo synthesis of ectoine starts with the phosphorylation of L-aspartate and shares its first two enzymatic steps with the biosynthesis of amino acids of the aspartate family: aspartokinase and L-aspartate-semialdehyde dehydrogenase. The M. alcaliphilum and the V. cholerae aspartokinases are encoded on the ectABCask operon. Probab=98.46 E-value=1.4e-06 Score=57.07 Aligned_cols=110 Identities=14% Similarity=0.196 Sum_probs=78.6 Q ss_pred HHHHHHCCCEEEECCCCCCCCCCEECC---CHHHHHHHHHHHHCHHHHHHHCCCCCEEECCCCC--------CCCCCHHH Q ss_conf 999997798499857431787320066---1479999999862511334322676335345555--------56338889 Q 537021.9.peg.3 15 LDLLIKSGIIPVIAPIAPGYDGATYNI---NADTFAGAIAEKLNVIRLLFLTDVPGVLDKNRQL--------ISKLSINE 83 (142) Q Consensus 15 i~~lL~~g~iPVi~pig~~~~G~~~ni---n~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~~~--------i~~i~~~e 83 (142) +..+...+.+||++-++-+.+|++..+ -+|..|+.+|..|+|+.+.+.|||+ +|..|+++ |++++.+| T Consensus 183 ~~~~~~~~~v~Vv~GF~~~~~G~~tTLGRgGSDySAailA~~l~A~~~~IWtDV~-v~tADPr~V~~~~a~~i~~lSy~E 261 (304) T cd04248 183 FRDIDPRDELPIVTGYAKCAEGLMREFDRGYSEMTFSRIAVLTGASEAIIHKEFH-LSSADPKLVGEDKARPIGRTNYDV 261 (304) T ss_pred HHHCCCCCCEEEECCCCCCCCCCEEEECCCCCHHHHHHHHHHCCCCEEEEEECCC-EECCCCCCCCCCCCEEECCCCHHH T ss_conf 9722657826885772678998356507997178999999976999899985066-616897957877663606479999 Q ss_pred HHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEE Q ss_conf 999962697678678999999999984998899952578768999996799866345 Q 537021.9.peg.3 84 ARTLIKDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLL 140 (142) Q Consensus 84 ~~~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i 140 (142) +.+|..-|.- =.-| ++..-+.++++| ++|-|-.. .+..||+| T Consensus 262 A~ELayfGAk--VlHP--~Ti~P~~~~~IP-i~IkNt~~----------P~~~GT~I 303 (304) T cd04248 262 ADQLANLGME--AIHP--KAAKGLRQAGIP-LRVKNTFE----------PDHPGTLI 303 (304) T ss_pred HHHHHHCCCC--CCCH--HHHHHHHHCCCC-EEEECCCC----------CCCCCCEE T ss_conf 9999977972--1598--999999986998-89806899----------89998343 No 75 >PRK09181 aspartate kinase; Validated Probab=98.31 E-value=5.5e-06 Score=53.75 Aligned_cols=106 Identities=17% Similarity=0.227 Sum_probs=75.5 Q ss_pred HCCCEEEECCCCCCCCCCEECC---CHHHHHHHHHHHHCHHHHHHHCCCCCEEECCCCC--------CCCCCHHHHHHHH Q ss_conf 7798499857431787320066---1479999999862511334322676335345555--------5633888999996 Q 537021.9.peg.3 20 KSGIIPVIAPIAPGYDGATYNI---NADTFAGAIAEKLNVIRLLFLTDVPGVLDKNRQL--------ISKLSINEARTLI 88 (142) Q Consensus 20 ~~g~iPVi~pig~~~~G~~~ni---n~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~~~--------i~~i~~~e~~~l~ 88 (142) ..+.+||++-++.+.+|.+..+ -+|..|+.+|..|+|+++.+.||++ +|..|+++ |++++.+|+.+|. T Consensus 195 ~~~~i~VvtGF~~~~~G~~~tlgRggSD~ta~~ia~~l~a~~~~i~~dv~-v~taDP~~v~~~~~~~i~~~sy~ea~ela 273 (476) T PRK09181 195 LSKELPIVTGYAKCKEGLMRTFDRGYSEMTFSRIAVLTGADEAIIHKEYH-LSSADPKLVGEDKVVPIGRTNYDVADQLA 273 (476) T ss_pred CCCCEEEECCCCCCCCCCEEECCCCCHHHHHHHHHHHHCCCEEEEEEECC-EECCCCCCCCCCCCEECCCCCHHHHHHHH T ss_conf 45745894781668998354358884599999999984898999996365-31279884365434473526899999998 Q ss_pred HCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEEC Q ss_conf 26976786789999999999849988999525787689999967998663454 Q 537021.9.peg.3 89 KDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLI 141 (142) Q Consensus 89 ~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~ 141 (142) .-|.- =+-| ++..-+.+.++| +||-|- |+.+..||+|. T Consensus 274 ~~Ga~--vlhp--~~~~p~~~~~Ip-i~v~nt----------~~p~~~GT~I~ 311 (476) T PRK09181 274 NLGME--AIHP--KAAKGLRQAGIP-LRIKNT----------FEPEHPGTLIT 311 (476) T ss_pred HCCCC--CCCC--HHHHHHHHCCCC-EEEEEC----------CCCCCCCEEEE T ss_conf 65986--4080--678999975996-899706----------89999975994 No 76 >TIGR00656 asp_kin_monofn asparate kinase, monofunctional class; InterPro: IPR005260 Aspartate kinase catalyzes the first step in the biosynthesis of Lys (via its precursor diaminopimelate), Met, and Thr. In E. coli, a distinct isozyme is inhibited by each of these three amino acid end products . The Met-sensitive (I) and Thr-sensitive (II) forms are bifunctional enzymes fused to homoserine dehydrogenase, which catalyses the next step in the biosynthesis of these amino acids. Inn contrast, the Lys-sensitive form (III) is a monofunctional enzyme; homoserine dehydrogenase is not part of the Lys biosynthetic pathway. This entry represents a subclass of aspartate kinases that are mostly Lys-sensitive and are not fused to homoserine dehydrogenase. The E. coli enzyme is a homodimer, while the Bacillus and Corynebacterium enzymes are alpha 2/beta 2 heterotetramers, where the beta subunit is translated from an in-phase alternative initiator at Met-246 , . The protein slr0657 from Synechocystis PCC6803 is extended by a duplication of the C-terminal region corresponding to the beta chain. Incorporation of a second copy of the C-terminal domain may be quite common in this subgroup of aspartokinases. ; GO: 0004072 aspartate kinase activity, 0009058 biosynthetic process. Probab=98.12 E-value=6.7e-06 Score=53.25 Aligned_cols=86 Identities=30% Similarity=0.422 Sum_probs=70.2 Q ss_pred CEEEECH-----HHHHHHHHCCCEEEECCC-CCCCCCCEECC---CHHHHHHHHHHHHCHHHHHHHCCCCCEEECCC--- Q ss_conf 0135569-----999999977984998574-31787320066---14799999998625113343226763353455--- Q 537021.9.peg.3 6 NVIKVNR-----TILDLLIKSGIIPVIAPI-APGYDGATYNI---NADTFAGAIAEKLNVIRLLFLTDVPGVLDKNR--- 73 (142) Q Consensus 6 ~v~~vd~-----~~i~~lL~~g~iPVi~pi-g~~~~G~~~ni---n~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~--- 73 (142) ++...++ +.+..+|+.|++||++.+ |.+.+|+...+ .+|..|+.++.+++|+++.+.||++|+|..+. T Consensus 161 ~~~~~~~~~~~~~~l~~~l~~g~~~~~~Gf~G~~~~g~~ttlGrGGsd~~a~~l~~~l~a~~~~~~tdv~G~~~~dPr~~ 240 (480) T TIGR00656 161 KIDELATEGLAAERLLPLLEEGIIPVVAGFQGATEEGDTTTLGRGGSDLTAALLGAALKADRVDIYTDVPGVYTTDPRVV 240 (480) T ss_pred EEEHHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCEEECCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCC T ss_conf 00022246778888888875497589723335776764443047862378999875217763799744785012686424 Q ss_pred ---CCCCCCCHHHHHHHHHCC Q ss_conf ---555633888999996269 Q 537021.9.peg.3 74 ---QLISKLSINEARTLIKDG 91 (142) Q Consensus 74 ---~~i~~i~~~e~~~l~~~~ 91 (142) +.+..++..|+.++...+ T Consensus 241 ~~a~~~~~~~~~e~~~la~~G 261 (480) T TIGR00656 241 EGAKKLPKISYEEALELATLG 261 (480) T ss_pred CCCCCCCCCCHHHHHHHHHCC T ss_conf 564322332389999997525 No 77 >TIGR02078 AspKin_pair Pyrococcus aspartate kinase subunit, putative; InterPro: IPR011819 This family consists of proteins restricted to and found as paralogous pairs (typically close together) in species of Pyrococcus, a hyperthermophilic archaeal genus. Members are always found close to other genes of threonine biosynthesis and appear to represent the Pyrococcal form of aspartate kinase. Alignment to aspartokinase III from Escherichia coli shows that 300 N-terminal and 20 C-terminal amino acids are homologous, but the form in Pyrococcus lacks ~100 amino acids in between.. Probab=98.12 E-value=1.2e-05 Score=51.93 Aligned_cols=106 Identities=25% Similarity=0.325 Sum_probs=74.7 Q ss_pred CHHHHHHHHHCCCEEEECCCCCCCCCCEECC---CHHHHHHHHHHHHCHHHHHHHCCCCCEEECCCCC------CCCCCH Q ss_conf 6999999997798499857431787320066---1479999999862511334322676335345555------563388 Q 537021.9.peg.3 11 NRTILDLLIKSGIIPVIAPIAPGYDGATYNI---NADTFAGAIAEKLNVIRLLFLTDVPGVLDKNRQL------ISKLSI 81 (142) Q Consensus 11 d~~~i~~lL~~g~iPVi~pig~~~~G~~~ni---n~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~~~------i~~i~~ 81 (142) +...+.++|++|.||||+-..-+-+|--..+ =+|--|.++|+-+++..+.+++||.|+|..|+++ |+.++. T Consensus 153 ~~k~~~~~l~~G~iPvipGF~gnlnGyrvTlGRGGSDYSAvalg~l~~s~~v~IMSdVEGIfTADPKLvPsArLIpY~SY 232 (341) T TIGR02078 153 NAKILYEVLESGKIPVIPGFYGNLNGYRVTLGRGGSDYSAVALGVLLNSKLVAIMSDVEGIFTADPKLVPSARLIPYLSY 232 (341) T ss_pred HHHHHHHHHHCCCEEEECCCCCCCCCEEEEECCCCCHHHHHHHHHHHCCEEEEEEECCCEEEECCCCCCCCHHCCCCCCH T ss_conf 77788988865830221462068884489864786227999999875122689973456004358663750010333347 Q ss_pred HHHHHHHHCCCCCCCHH-HHHHHHHHHHHCCCCEEEEEECCCCC Q ss_conf 89999962697678678-99999999998499889995257876 Q 537021.9.peg.3 82 NEARTLIKDGTISGGMI-PKIETSIKAIENGVKSVAILDGKKPH 124 (142) Q Consensus 82 ~e~~~l~~~~~~~~gm~-~Ki~~a~~a~~~Gv~~v~i~~~~~~~ 124 (142) +|+.- . .+-||+ -..+ |.+.++. .++-++=||..+ T Consensus 233 ~Ei~i--A---aK~GMkAl~wk-A~~l~~e--~~ip~lfGrt~d 268 (341) T TIGR02078 233 EEILI--A---AKLGMKALQWK-AADLAKE--YKIPVLFGRTRD 268 (341) T ss_pred HHHHH--H---HHHHHHHHHHH-HCCCCCC--CEEEEEECCCCC T ss_conf 79999--7---45205777665-2000157--543587415466 No 78 >KOG0456 consensus Probab=96.49 E-value=0.012 Score=35.37 Aligned_cols=104 Identities=26% Similarity=0.396 Sum_probs=67.6 Q ss_pred CCCEEEECCC-CCC-CCCCEECC---CHHHHHHHHHHHHCHHHHHHHCCCCCEEECCC------CCCCCCCHHHHHHHHH Q ss_conf 7984998574-317-87320066---14799999998625113343226763353455------5556338889999962 Q 537021.9.peg.3 21 SGIIPVIAPI-APG-YDGATYNI---NADTFAGAIAEKLNVIRLLFLTDVPGVLDKNR------QLISKLSINEARTLIK 89 (142) Q Consensus 21 ~g~iPVi~pi-g~~-~~G~~~ni---n~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~------~~i~~i~~~e~~~l~~ 89 (142) .+.+||++-+ |.. +.|-.-.+ -.|..|+.++.+|+++.+-..-||||++..+. ++++.++.+|+.+|.. T Consensus 258 en~VPVvTGf~Gk~~~tg~lt~lGRG~sDl~At~i~~al~~~EiQVWKdVDGv~T~DP~~~p~Ar~vp~lT~dEAaELaY 337 (559) T KOG0456 258 ENAVPVVTGFLGKGWPTGALTTLGRGGSDLTATTIGKALGLDEIQVWKDVDGVLTCDPRIYPGARLVPYLTFDEAAELAY 337 (559) T ss_pred CCCCCEEEECCCCCCCCCCEECCCCCCHHHHHHHHHHHCCCHHHHHHHHCCCEEECCCCCCCCCCCCCCCCHHHHHHHHH T ss_conf 77663686003367656200035677216689999887181454213234752743886477750068658778778775 Q ss_pred CC--CCC-CCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCCCEECC Q ss_conf 69--767-867899999999998499889995257876899999679986634549 Q 537021.9.peg.3 90 DG--TIS-GGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSGTLLIP 142 (142) Q Consensus 90 ~~--~~~-~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~GT~i~p 142 (142) -+ .+- -.|.+-. +..+| +.|-|.. +..+.||.|-| T Consensus 338 fGaqVlHP~sM~~~~-------~~~IP-vRvKN~~----------NP~~~GTvI~~ 375 (559) T KOG0456 338 FGAQVLHPFSMRPAR-------EGRIP-VRVKNSY----------NPTAPGTVITP 375 (559) T ss_pred HHHHHCCCCCCCHHH-------CCCCC-EEEECCC----------CCCCCCEEECC T ss_conf 211210563334113-------06762-5863377----------99999428636 No 79 >PRK02797 4-alpha-L-fucosyltransferase; Provisional Probab=62.33 E-value=11 Score=19.01 Aligned_cols=60 Identities=17% Similarity=0.213 Sum_probs=36.6 Q ss_pred CCCCCCEEEECHHHH---HHHHHCCCEEEECCCCCCCCCCEECCCHHHHHHHHHHHHCHHHHHHHCC Q ss_conf 985520135569999---9999779849985743178732006614799999998625113343226 Q 537021.9.peg.3 1 LGFVGNVIKVNRTIL---DLLIKSGIIPVIAPIAPGYDGATYNINADTFAGAIAEKLNVIRLLFLTD 64 (142) Q Consensus 1 ~g~~G~v~~vd~~~i---~~lL~~g~iPVi~pig~~~~G~~~nin~D~~a~~lA~~l~A~kli~ltd 64 (142) +|-+|++..=..+.+ +.....+ +-|++|++|....+.| .+++...-....+++++..|++ T Consensus 188 vGNSgD~sN~Hie~L~~l~~~~~~~-v~i~~Pl~Ypagn~~Y---i~~V~~~g~~lFg~~~~~~L~e 250 (358) T PRK02797 188 VGNSGDPSNRHIEALRALHQQFGDN-VKIIVPMGYPANNQAY---IEEVRQAGLALFGEENLQILTE 250 (358) T ss_pred EECCCCCCCCHHHHHHHHHHHHCCC-EEEEEECCCCCCCHHH---HHHHHHHHHHHCCCCCEEEHHH T ss_conf 7078985400899999999973688-6999977757765999---9999999998658666776442 No 80 >COG1778 Low specificity phosphatase (HAD superfamily) [General function prediction only] Probab=53.90 E-value=10 Score=19.13 Aligned_cols=21 Identities=33% Similarity=0.596 Sum_probs=11.8 Q ss_pred HHCCCCCEEE-------CCCCCCCCCCH Q ss_conf 3226763353-------45555563388 Q 537021.9.peg.3 61 FLTDVPGVLD-------KNRQLISKLSI 81 (142) Q Consensus 61 ~ltdv~Gv~d-------~~~~~i~~i~~ 81 (142) .++||||++. ++|+-++..+. T Consensus 11 li~DVDGvLTDG~ly~~~~Gee~KaFnv 38 (170) T COG1778 11 LILDVDGVLTDGKLYYDENGEEIKAFNV 38 (170) T ss_pred EEEECCCEEECCEEEECCCCCEEEEEEC T ss_conf 9994564056674888489756543303 No 81 >TIGR01670 YrbI-phosphatas 3-deoxy-D-manno-octulosonate 8-phosphate phosphatase, YrbI family; InterPro: IPR010023 This entry is a part of the haloacid dehalogenase (HAD) superfamily of hydrolases from beta, gamma and epsilon proteobacteria, Aquifex, Fusobacterium, Porphyromonas and Methanosarcina. All characterised members of the HAD-superfamily hydrolase, subfamily IIIA and most characterised members of the HAD superfamily are phosphatases. HAD superfamily phosphatases contain active site residues in several conserved catalytic motifs , all of which are found conserved in this family. Lipopolysaccharides of Gram-negative bacteria consist of a heteropolysaccharide component (O-antigen) and a hydrophobic component (lipid A). Linking the two portions is 3-Deoxy-D-manno-octulosonate (KDO), an 8-carbon sugar. Biosynthesis of KDO linked to lipid A proceeds via five steps, one of which involves removal of a phosphate from KDO 8-P via the action of KDO 8-P phosphatase (3.1.3.45 from EC; IPR008230 from INTERPRO) . One member of this family, the YrbI protein from Haemophilus influenzae has been cloned, expressed, purified and found to be an active phosphatase. Furthermore, its crystal structure has been determined . The sequence from Methanosarcina acetivorans, Q8TJL6 from SWISSPROT, is distinctive in that it is linked to an N-terminal cytidylyltransferase domain (IPR003329 from INTERPRO) and is annotated as acylneuraminate cytidylyltransferase. This may give some clue as the function of these phosphatases. Several eukaryotic sequences are also closely related to this family; one such sequence is the CMP-N-acetylneuraminic acid synthetase from mouse, but in this case as in the others the phosphatase domain is clearly inactive as many of the active site residues are not conserved.; GO: 0019143 3-deoxy-manno-octulosonate-8-phosphatase activity, 0009103 lipopolysaccharide biosynthetic process. Probab=53.63 E-value=18 Score=17.78 Aligned_cols=21 Identities=33% Similarity=0.596 Sum_probs=11.9 Q ss_pred HHCCCCCEEE-------CCCCCCCCCCH Q ss_conf 3226763353-------45555563388 Q 537021.9.peg.3 61 FLTDVPGVLD-------KNRQLISKLSI 81 (142) Q Consensus 61 ~ltdv~Gv~d-------~~~~~i~~i~~ 81 (142) ++.|+||++. ++|+.|+..|. T Consensus 4 lilDvDGvLtDGkiyyt~nGE~iK~FNV 31 (154) T TIGR01670 4 LILDVDGVLTDGKIYYTENGEEIKAFNV 31 (154) T ss_pred EEEECCCEEECCEEEECCCCCEEEEECC T ss_conf 8884272310670666388844312015 No 82 >TIGR01225 hutH histidine ammonia-lyase; InterPro: IPR005921 Histidine ammonia-lyase deaminates histidine to urocanic acid, the first step in histidine degradation. It is closely related to phenylalanine ammonia-lyase. ; GO: 0004397 histidine ammonia-lyase activity, 0006548 histidine catabolic process, 0005737 cytoplasm. Probab=50.72 E-value=6.7 Score=20.13 Aligned_cols=91 Identities=21% Similarity=0.387 Sum_probs=46.8 Q ss_pred CCCCCCEEEECHHHHHHHHHCCCEEEEC---CCCCC--------------CCCCEECCCH---HHHHHHHHHHHCHHHHH Q ss_conf 9855201355699999999779849985---74317--------------8732006614---79999999862511334 Q 537021.9.peg.3 1 LGFVGNVIKVNRTILDLLIKSGIIPVIA---PIAPG--------------YDGATYNINA---DTFAGAIAEKLNVIRLL 60 (142) Q Consensus 1 ~g~~G~v~~vd~~~i~~lL~~g~iPVi~---pig~~--------------~~G~~~nin~---D~~a~~lA~~l~A~kli 60 (142) -||+| |+.==.+.+-.+|+++.+|+|+ ++|-+ ..|+.+...+ +-..+.-+..|+| + T Consensus 107 ~G~SG-vR~ev~~~l~~~LN~~v~P~vP~~GSvGASGDLAPLaH~aL~L~GeG~a~~~~G~~~~A~~~L~~aGL~P---V 182 (529) T TIGR01225 107 KGYSG-VRAEVLELLVALLNAGVLPVVPEKGSVGASGDLAPLAHLALVLIGEGEAFSFEGERMPAAEALAAAGLKP---V 182 (529) T ss_pred CCCCH-HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCEEECCCCCCCCHHHHHHHHCCCC---E T ss_conf 56651-3589999999984899630306887635650045799999875234501057765063889999839998---0 Q ss_pred HHCCCCCEEECCCC-CCCC---CCHHHHHHHHHCCCCCC Q ss_conf 32267633534555-5563---38889999962697678 Q 537021.9.peg.3 61 FLTDVPGVLDKNRQ-LISK---LSINEARTLIKDGTISG 95 (142) Q Consensus 61 ~ltdv~Gv~d~~~~-~i~~---i~~~e~~~l~~~~~~~~ 95 (142) -|....|+-=-||. .... +-+-+++.++.+..+++ T Consensus 183 ~L~~KEGLALINGTQ~~t~~~~~Al~dA~~l~~~A~~~a 221 (529) T TIGR01225 183 TLKAKEGLALINGTQLMTGLAVLALFDAERLLRSADIAA 221 (529) T ss_pred EECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 205543303430677999999999999999999999987 No 83 >TIGR02902 spore_lonB ATP-dependent protease LonB; InterPro: IPR014251 This entry represents LonB, a paralog of the ATP-dependent protease La (LonA, IPR004815 from INTERPRO). LonB proteins are unassigned peptidases belonging to the MEROPS peptidase family S16 (lon protease family, clan SJ) and are found strictly, and almost universally, in endospore-forming bacteria. This protease was shown, in Bacillus subtilis, to be expressed specifically in the forespore during sporulation, under control of sigmaF . The lonB gene, despite being located immediately upstream of lonA, was shown to be monocistronic. LonB appears to be involved in the post-translation control of sigmaH, but lonB mutation did not produce an obvious sporulation defect under the conditions tested . Note that additional paralogs of LonA and LonB occur in the Clostridium lineage and these are excluded from this entry. . Probab=47.34 E-value=21 Score=17.41 Aligned_cols=33 Identities=30% Similarity=0.455 Sum_probs=23.5 Q ss_pred CCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHH Q ss_conf 7867899999999998499889995257876899999 Q 537021.9.peg.3 94 SGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEI 130 (142) Q Consensus 94 ~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~el 130 (142) -||...|++||. +.|+++| |+-+.+-......+ T Consensus 481 VGGV~~Ki~AA~---~AGak~V-iIP~eNwqe~~~~~ 513 (532) T TIGR02902 481 VGGVVSKIEAAK---KAGAKKV-IIPYENWQESFESI 513 (532) T ss_pred CCCCCHHHHHHH---HCCCCEE-ECCCCHHHHHHHHH T ss_conf 178612689999---7497265-34752078999855 No 84 >PRK09484 3-deoxy-D-manno-octulosonate 8-phosphate phosphatase; Provisional Probab=46.88 E-value=5.2 Score=20.73 Aligned_cols=11 Identities=36% Similarity=0.748 Sum_probs=8.4 Q ss_pred HHCCCCCEEEC Q ss_conf 32267633534 Q 537021.9.peg.3 61 FLTDVPGVLDK 71 (142) Q Consensus 61 ~ltdv~Gv~d~ 71 (142) |+.|+|||+.+ T Consensus 27 ~i~DvDGVLTD 37 (186) T PRK09484 27 LICDVDGVFSD 37 (186) T ss_pred EEECCEECEEC T ss_conf 99847164078 No 85 >PRK09367 histidine ammonia-lyase; Provisional Probab=46.17 E-value=13 Score=18.55 Aligned_cols=29 Identities=34% Similarity=0.434 Sum_probs=21.9 Q ss_pred CCCCCCEEEECHHHHHHHHHCCCEEEECCC Q ss_conf 985520135569999999977984998574 Q 537021.9.peg.3 1 LGFVGNVIKVNRTILDLLIKSGIIPVIAPI 30 (142) Q Consensus 1 ~g~~G~v~~vd~~~i~~lL~~g~iPVi~pi 30 (142) .||+| |+.-=.+.+..+|++|++|+|+.. T Consensus 110 ~G~SG-Vr~~vve~L~~~lN~~i~P~VP~~ 138 (504) T PRK09367 110 RGFSG-VRLEVIEALLALLNAGVYPVIPEK 138 (504) T ss_pred CCCCC-CCHHHHHHHHHHHHCCCCCCCCCC T ss_conf 68887-774699999999968981547788 No 86 >TIGR02405 trehalos_R_Ecol trehalose operon repressor; InterPro: IPR012771 Trehalose is a non-reducing disaccharide which can be used as both a carbon source and an osmoprotectant in bacteria. Trehalose uptake into the cytoplasm occurs via a trehalose-specific phosphotransferase system which phosphorylates trehalase to trehalose-6-phosphate (Tre6P) during transport into the cytoplasm, and a hydrolase which hydrolyses Tre6P to glucose and glucose-6-phophate. This entry represents LacI-type TreR, a transcriptional repressor of trehalose uptake found mainly within the gamma-proteobacteria. It does not include the GntR-type TreR's such as those found in Bacillus species. It is capable of binding both the inducer Tre6P and trehalose. Binding of trehalose does not affect the repressor's affinity for its DNA binding site, while binding Tre6P substantially reduces its affinity. The repression activity of TreR is therefore regulated by the ratio of trehalose to Tre6P within the cell . The protein is composed of two domains, an N-terminal DNA-binding helix-turn-helix domain, and a C-terminal effector-binding domain which is homologous to that of LacI. The effector-binding domain is composed of two subdomains, both of which form an alpha-beta-alpha sandwhich, with the effector binding site located at the interface of these subdomains . Tre6P and trehalose bind competitively to this site, with the affinty for trehalose substantially lower than that for Tre6P.; GO: 0003677 DNA binding, 0016566 specific transcriptional repressor activity, 0005991 trehalose metabolic process, 0016481 negative regulation of transcription. Probab=45.97 E-value=13 Score=18.51 Aligned_cols=10 Identities=30% Similarity=0.680 Sum_probs=4.4 Q ss_pred HHHCCCEEEE Q ss_conf 9977984998 Q 537021.9.peg.3 18 LIKSGIIPVI 27 (142) Q Consensus 18 lL~~g~iPVi 27 (142) +-++||=||| T Consensus 85 fY~~GYDpii 94 (311) T TIGR02405 85 FYEAGYDPII 94 (311) T ss_pred HHHCCCCEEE T ss_conf 9846998088 No 87 >TIGR02478 6PF1K_euk 6-phosphofructokinase; InterPro: IPR009161 Members of this family are eukaryotic (with one exception) ATP-dependent 6-phosphofructokinases (2.7.1.11 from EC) in which two tandem copies of the phosphofructokinase are found . Members are found, often including several isozymes, in animals and fungi and in the bacterium Propionibacterium acnes KPA171202 (a human skin commensal). Phosphofructokinase (PFK) catalyses the phosphorylation of fructose-6-phosphate to fructose-1,6-biphosphate, which then enters the Embden-Meyerhof pathway. PFK is a key regulatory enzyme in glycolysis. For additional information please see , , .; GO: 0003872 6-phosphofructokinase activity, 0006096 glycolysis, 0005737 cytoplasm. Probab=44.29 E-value=17 Score=17.95 Aligned_cols=89 Identities=16% Similarity=0.250 Sum_probs=49.5 Q ss_pred HHHHHHHHHHHCHHHHHHHCCCCCEEECCCCCCCCCCHHHHHHHHHCC-CC----C-CCH---HHHHHHHHHHHHCCCCE Q ss_conf 799999998625113343226763353455555633888999996269-76----7-867---89999999999849988 Q 537021.9.peg.3 44 DTFAGAIAEKLNVIRLLFLTDVPGVLDKNRQLISKLSINEARTLIKDG-TI----S-GGM---IPKIETSIKAIENGVKS 114 (142) Q Consensus 44 D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~~~i~~i~~~e~~~l~~~~-~~----~-~gm---~~Ki~~a~~a~~~Gv~~ 114 (142) =++.+-.|...+++-..+.=.=.|+- +-|..|++..=++...++..| ++ . ..| .--|+||...+++|+.. T Consensus 19 VRAVVR~aI~~Gc~VY~irEGY~GLV-~GGD~I~~~~W~DV~g~l~~GGT~IGtARC~~FReR~GRLkAA~Nl~~~gId~ 97 (777) T TIGR02478 19 VRAVVRMAIYVGCRVYAIREGYEGLV-DGGDNIEELSWEDVRGILSLGGTLIGTARCKEFRERPGRLKAARNLIKRGIDA 97 (777) T ss_pred HHHHHHHHHHHCCEEEEEECCCCCCC-CCCCHHHHCCCCCCCCHHHCCCCEEEEEEECCCCCCHHHHHHHHHHHHCCCCE T ss_conf 89998988731997999970764532-22420211144231223344873342032154458866899999988719945 Q ss_pred EEEEECCCCCH-------------HHHHHHCCCC Q ss_conf 99952578768-------------9999967998 Q 537021.9.peg.3 115 VAILDGKKPHS-------------ILMEIFTKNG 135 (142) Q Consensus 115 v~i~~~~~~~~-------------ll~elft~~g 135 (142) -.++.| +++ |+.||++++. T Consensus 98 LvViGG--DGSLTGAd~FR~EW~~L~~EL~~~g~ 129 (777) T TIGR02478 98 LVVIGG--DGSLTGADLFREEWPSLLEELVDTGK 129 (777) T ss_pred EEEECC--CHHHHHHHHHHHCCHHHHHHHHHCCC T ss_conf 999889--75688888776304158899731586 No 88 >pfam00221 PAL Phenylalanine and histidine ammonia-lyase. Probab=44.16 E-value=14 Score=18.34 Aligned_cols=28 Identities=29% Similarity=0.460 Sum_probs=21.3 Q ss_pred CCCCCEEEECHHHHHHHHHCCCEEEECCC Q ss_conf 85520135569999999977984998574 Q 537021.9.peg.3 2 GFVGNVIKVNRTILDLLIKSGIIPVIAPI 30 (142) Q Consensus 2 g~~G~v~~vd~~~i~~lL~~g~iPVi~pi 30 (142) ||+| |+.-=.+.+..+|++|++|+|+.. T Consensus 108 G~SG-v~~~vv~~L~~~lN~~i~P~VP~~ 135 (477) T pfam00221 108 GYSG-VRPEVLEALLALLNAGVTPVVPER 135 (477) T ss_pred CCCC-CCHHHHHHHHHHHHCCCCCCCCCC T ss_conf 9886-675699999999958981557788 No 89 >COG2986 HutH Histidine ammonia-lyase [Amino acid transport and metabolism] Probab=43.08 E-value=13 Score=18.62 Aligned_cols=26 Identities=38% Similarity=0.614 Sum_probs=20.4 Q ss_pred CCCCCEEEECHHHHHHHHHCCCEEEEC Q ss_conf 855201355699999999779849985 Q 537021.9.peg.3 2 GFVGNVIKVNRTILDLLIKSGIIPVIA 28 (142) Q Consensus 2 g~~G~v~~vd~~~i~~lL~~g~iPVi~ 28 (142) ||+| |+.-=.+.|..+|++|.+|+|+ T Consensus 111 G~SG-vr~~vi~~LlalLN~gv~P~iP 136 (498) T COG2986 111 GYSG-VRLEVIELLLALLNKGVIPVIP 136 (498) T ss_pred CCCC-CCHHHHHHHHHHHHCCCEECCC T ss_conf 7787-8899999999998589820688 No 90 >COG0283 Cmk Cytidylate kinase [Nucleotide transport and metabolism] Probab=42.49 E-value=30 Score=16.51 Aligned_cols=70 Identities=14% Similarity=0.255 Sum_probs=29.0 Q ss_pred HHHHHCCCCCEEECCCCCCCCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHCCCCCC Q ss_conf 33432267633534555556338889999962697678678999999999984998899952578768999996799866 Q 537021.9.peg.3 58 RLLFLTDVPGVLDKNRQLISKLSINEARTLIKDGTISGGMIPKIETSIKAIENGVKSVAILDGKKPHSILMEIFTKNGSG 137 (142) Q Consensus 58 kli~ltdv~Gv~d~~~~~i~~i~~~e~~~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~~~~~~ll~elft~~g~G 137 (142) ++-|..+ ..++-++..+-..+..++.......=..-...+..+....+..-...+.+ +++||. +| T Consensus 67 ~i~f~~~-~~v~l~gedvs~~ir~~~V~~~aS~vA~~p~VR~~l~~~Qr~~a~~~~~~-V~dGRD-------------iG 131 (222) T COG0283 67 DISFVND-DRVFLNGEDVSEEIRTEEVGNAASKVAAIPEVREALVKLQRAFAKNGPGI-VADGRD-------------IG 131 (222) T ss_pred CCEECCC-CEEEECCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCE-EEECCC-------------CC T ss_conf 8110467-15887782003665168999999999725999999999999998438988-986687-------------66 Q ss_pred CEECC Q ss_conf 34549 Q 537021.9.peg.3 138 TLLIP 142 (142) Q Consensus 138 T~i~p 142 (142) |.|.| T Consensus 132 TvV~P 136 (222) T COG0283 132 TVVFP 136 (222) T ss_pred CEECC T ss_conf 56778 No 91 >pfam11135 DUF2888 Protein of unknown function (DUF2888). Some members in this family of proteins with unknown function are annotated as immediate early protein ICP-18 however this cannot be confirmed. Probab=41.93 E-value=7.9 Score=19.73 Aligned_cols=29 Identities=21% Similarity=0.257 Sum_probs=21.7 Q ss_pred CEEEEEECCCCCHHHHHHHCCCCCCCEECC Q ss_conf 889995257876899999679986634549 Q 537021.9.peg.3 113 KSVAILDGKKPHSILMEIFTKNGSGTLLIP 142 (142) Q Consensus 113 ~~v~i~~~~~~~~ll~elft~~g~GT~i~p 142 (142) +|.|-+++. ++.+-.|+-|..-+||+||| T Consensus 89 erlhpid~~-~g~~~~~letdgpvgtvivp 117 (146) T pfam11135 89 ERLHPIDTD-DGCDDEELETDGPVGTVIVP 117 (146) T ss_pred HHCCCCCCC-CCCCHHHEECCCCCEEEEEC T ss_conf 324765366-77630100027984059956 No 92 >pfam07429 Fuc4NAc_transf 4-alpha-L-fucosyltransferase (Fuc4NAc transferase). This family contains the bacterial enzyme 4-alpha-L-fucosyltransferase (Fuc4NAc transferase) (EC 2.4.1.-) (approximately 360 residues long). This catalyses the synthesis of Fuc4NAc-ManNAcA-GlcNAc-PP-Und (lipid III) as part of the biosynthetic pathway of enterobacterial common antigen (ECA), a polysaccharide comprised of the trisaccharide repeat unit Fuc4NAc-ManNAcA-GlcNAc. Probab=38.22 E-value=36 Score=16.12 Aligned_cols=60 Identities=17% Similarity=0.221 Sum_probs=34.9 Q ss_pred CCCCCCEEEECHHHHHH---HHHCCCEEEECCCCCCCCCCEECCCHHHHHHHHHHHHCHHHHHHHCC Q ss_conf 98552013556999999---99779849985743178732006614799999998625113343226 Q 537021.9.peg.3 1 LGFVGNVIKVNRTILDL---LIKSGIIPVIAPIAPGYDGATYNINADTFAGAIAEKLNVIRLLFLTD 64 (142) Q Consensus 1 ~g~~G~v~~vd~~~i~~---lL~~g~iPVi~pig~~~~G~~~nin~D~~a~~lA~~l~A~kli~ltd 64 (142) +|-+|+...=..+.++. ....+ |-||.|+||-...+.| .+++..+-....+++++..|++ T Consensus 190 vGNSgD~sN~HieaL~~i~~~fg~~-vkiivPmgYPannq~Y---i~~V~~~g~~lF~~~~~~iL~e 252 (361) T pfam07429 190 VGNSGDRSNRHIAALKAIHQQFGDN-VRIIVPMGYPANNQAY---IEQVRQAGLALFPAENLQILTE 252 (361) T ss_pred EECCCCCCHHHHHHHHHHHHHHCCC-EEEEEECCCCCCCHHH---HHHHHHHHHHHCCCCCHHHHHH T ss_conf 7378884404999999999973798-4899978867876999---9999999997648421116653 No 93 >TIGR01772 MDH_euk_gproteo malate dehydrogenase, NAD-dependent; InterPro: IPR010097 This entry represents the NAD-dependent malate dehydrogenase found in eukaryotes and certain gamma proteobacteria. The enzyme is involved in the citric acid cycle as well as the glyoxalate cycle. Several isoforms exist in eukaryotes. In Saccharomyces cereviseae, for example, there are cytoplasmic, mitochondrial and peroxisomal forms. Although malate dehydrogenases have in some cases been mistaken for lactate dehydrogenases due to the similarity of these two substrates and the apparent ease with which evolution can toggle these activities, critical residues have been identified which can discriminate between the two activities.; GO: 0030060 L-malate dehydrogenase activity, 0006108 malate metabolic process. Probab=38.01 E-value=29 Score=16.65 Aligned_cols=76 Identities=28% Similarity=0.410 Sum_probs=50.4 Q ss_pred HHHHHHHCCCEEEEC-----CCCCCCCCCEECCCH---HHHHHHHHHHHCHHH--HHHHCC--------------CCCEE Q ss_conf 999999779849985-----743178732006614---799999998625113--343226--------------76335 Q 537021.9.peg.3 14 ILDLLIKSGIIPVIA-----PIAPGYDGATYNINA---DTFAGAIAEKLNVIR--LLFLTD--------------VPGVL 69 (142) Q Consensus 14 ~i~~lL~~g~iPVi~-----pig~~~~G~~~nin~---D~~a~~lA~~l~A~k--li~ltd--------------v~Gv~ 69 (142) .|+.+|+.-.|+||+ =.|.+.| +.||+|| -+++.++|...--.+ +.++|+ ..|+| T Consensus 81 ~l~~~L~~advVvIPAGVPRKPGMtRD-DLF~~NAgIv~~L~~a~A~~cP~A~vfi~vI~NPVNStVPI~a~~LKk~G~Y 159 (379) T TIGR01772 81 ALENALKGADVVVIPAGVPRKPGMTRD-DLFNVNAGIVKDLVAAVAESCPKAMVFILVISNPVNSTVPIAAEVLKKKGVY 159 (379) T ss_pred CHHHHHHCCCEEEEECCCCCCCCCCHH-HHHHHCHHHHHHHHHHHHHHCCCCEEEEEEEECCCCCHHHHHHHHHHHCCCC T ss_conf 148886279589981787887556755-6755041789989999974087605899998668565179999999961886 Q ss_pred ECCC----CCCC--CCCHHHHHHHHHC Q ss_conf 3455----5556--3388899999626 Q 537021.9.peg.3 70 DKNR----QLIS--KLSINEARTLIKD 90 (142) Q Consensus 70 d~~~----~~i~--~i~~~e~~~l~~~ 90 (142) |.+. ++.. .++.=.+..++.+ T Consensus 160 ~pn~GierrlfGVT~LD~vRA~tF~~E 186 (379) T TIGR01772 160 DPNKGIERRLFGVTTLDIVRANTFVAE 186 (379) T ss_pred CCCCCCCEEECCCCHHHHHHHHHHHHH T ss_conf 888753024314112468999999999 No 94 >KOG1494 consensus Probab=36.25 E-value=38 Score=15.95 Aligned_cols=52 Identities=31% Similarity=0.340 Sum_probs=32.5 Q ss_pred HHHHHHHHHCCCEEEEC---C--CCCCCCCCEECCCH---HHHHHHHHHHHCHHHHHHHCC Q ss_conf 99999999779849985---7--43178732006614---799999998625113343226 Q 537021.9.peg.3 12 RTILDLLIKSGIIPVIA---P--IAPGYDGATYNINA---DTFAGAIAEKLNVIRLLFLTD 64 (142) Q Consensus 12 ~~~i~~lL~~g~iPVi~---p--ig~~~~G~~~nin~---D~~a~~lA~~l~A~kli~ltd 64 (142) .+-|...|+.-.++||+ | .|.+.+ +.||+|+ -+++.++|...--.++.++|+ T Consensus 87 ~~~L~~al~~advVvIPAGVPRKPGMTRD-DLFn~NAgIv~~l~~aia~~cP~A~i~vIsN 146 (345) T KOG1494 87 ADGLENALKGADVVVIPAGVPRKPGMTRD-DLFNINAGIVKTLAAAIAKCCPNALILVISN 146 (345) T ss_pred HHHHHHHHCCCCEEEECCCCCCCCCCCHH-HHHHCCHHHHHHHHHHHHHHCCCCEEEEECC T ss_conf 36789875389789963899899998577-7664144799999999986596121676607 No 95 >PRK09802 DNA-binding transcriptional regulator AgaR; Provisional Probab=33.50 E-value=42 Score=15.75 Aligned_cols=20 Identities=20% Similarity=0.192 Sum_probs=8.7 Q ss_pred CCEECCCHHHHHHHHHHHHC Q ss_conf 32006614799999998625 Q 537021.9.peg.3 36 GATYNINADTFAGAIAEKLN 55 (142) Q Consensus 36 G~~~nin~D~~a~~lA~~l~ 55 (142) |+++-+|+=..+.++|..|. T Consensus 107 gd~I~lD~GTT~~~la~~L~ 126 (269) T PRK09802 107 GHRVILDSGTTTFEIARLMR 126 (269) T ss_pred CCEEEECCCHHHHHHHHHHH T ss_conf 99999778679999997323 No 96 >cd00332 PAL-HAL Phenylalanine ammonia-lyase (PAL) and histidine ammonia-lyase (HAL) and. PAL and HAL are members of the Lyase class I_like superfamily of enzymes that, catalyze similar beta-elimination reactions and are active as homotetramers. The four active sites of the homotetrameric enzyme are each formed by residues from three different subunits. PAL, present in plants and fungi, catalyzes the conversion of L-phenylalanine to E-cinnamic acid. HAL, found in several bacteria and animals, catalyzes the conversion of L-histidine to E-urocanic acid . Both PAL and HAL contain the cofactor 3, 5-dihydro-5-methylidene-4H-imidazol-4-one (MIO) which is formed by autocatalytic excision/cyclization of the internal tripeptide, Ala-Ser-Gly. PAL is being explored as enzyme substitution therapy for Phenylketonuria (PKU), a disorder which involves an inability to metabolize phenylalanine. HAL failure in humans results in the disease histidinemia. Probab=33.28 E-value=24 Score=17.07 Aligned_cols=28 Identities=29% Similarity=0.393 Sum_probs=21.5 Q ss_pred CCCCCEEEECHHHHHHHHHCCCEEEECCC Q ss_conf 85520135569999999977984998574 Q 537021.9.peg.3 2 GFVGNVIKVNRTILDLLIKSGIIPVIAPI 30 (142) Q Consensus 2 g~~G~v~~vd~~~i~~lL~~g~iPVi~pi 30 (142) ||+| |+.-=.+.+..+|++|++|+|+.. T Consensus 103 G~SG-v~~~vv~~L~~~lN~~i~P~VP~~ 130 (444) T cd00332 103 GHSG-VRPEVLERLVALLNAGVTPVVPER 130 (444) T ss_pred CCCC-CCHHHHHHHHHHHHCCCCCCCCCC T ss_conf 8886-675799999999968981547788 No 97 >pfam06555 consensus Probab=32.23 E-value=15 Score=18.25 Aligned_cols=67 Identities=15% Similarity=0.213 Sum_probs=40.9 Q ss_pred HHHHCCCEEEECCCCCCCCCCEECCCH--HHHHHHHHHHHCHHHHHHHCCCCCEEECCC---CCCCCCC-HHHHHHH Q ss_conf 999779849985743178732006614--799999998625113343226763353455---5556338-8899999 Q 537021.9.peg.3 17 LLIKSGIIPVIAPIAPGYDGATYNINA--DTFAGAIAEKLNVIRLLFLTDVPGVLDKNR---QLISKLS-INEARTL 87 (142) Q Consensus 17 ~lL~~g~iPVi~pig~~~~G~~~nin~--D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~---~~i~~i~-~~e~~~l 87 (142) ..+..++ =+-|.=+.+.+|+.+|+|+ .++|+.+ .+|.|+|++....-|-.+.. ++++.+. +.+++.+ T Consensus 80 ~a~~aD~-fl~S~NAiT~dG~LvNiDg~GNRVaa~~---fGpk~Vi~V~G~NKIv~dle~A~~Rir~~AaP~Na~Rl 152 (200) T pfam06555 80 KALTADY-FLTSTNAITEDGELVNIDGNGNRVAAMI---FGPKNVIVVAGINKIVKDLEEAIKRIKTIAAPMNAKRL 152 (200) T ss_pred HHHCCCE-EEECHHHCCCCCEEEEECCCCCHHHHHH---CCCCEEEEEEECCCCCCCHHHHHHHHHHHCCCCCHHHC T ss_conf 9657999-9865204057875999727734555310---28971999971773569999999999874478234442 No 98 >PHA00451 protein kinase Probab=31.78 E-value=15 Score=18.20 Aligned_cols=44 Identities=14% Similarity=0.376 Sum_probs=30.6 Q ss_pred HHHHHCCCEEEEC-CCCCCCCC----CEECCCHHHHHHHHHHHHCHHHHH Q ss_conf 9999779849985-74317873----200661479999999862511334 Q 537021.9.peg.3 16 DLLIKSGIIPVIA-PIAPGYDG----ATYNINADTFAGAIAEKLNVIRLL 60 (142) Q Consensus 16 ~~lL~~g~iPVi~-pig~~~~G----~~~nin~D~~a~~lA~~l~A~kli 60 (142) .-+.+++-+|+|+ |+.|+.+. ..|.++.|.+-+++-. +..++.| T Consensus 200 NiMf~~~g~p~ITDPVSFS~Dr~re~G~F~ldPd~LiaEvEa-ia~~~~I 248 (364) T PHA00451 200 NIMFDKMGVPYITDPVSFSADRKREPGEFPLDPDELIAEVEA-IANQRAI 248 (364) T ss_pred CEEECCCCCEEECCCCCCCCCCCCCCCCCCCCHHHHHHHHHH-HHHHHHH T ss_conf 434678997455278555542234799788898999999999-9889999 No 99 >TIGR01226 phe_am_lyase phenylalanine ammonia-lyase; InterPro: IPR005922 The ubiquitous higher plant enzyme phenylalanine ammonia-lyase (PAL; 4.3.1.5 from EC) is a key biosynthetic catalyst in phenylpropanoid assembly. PAL catalyses the non-oxidative deamination of L-phenylalanine to trans-cinnamic acid. PAL contains a catalytic Ala-Ser-Gly triad that is post-translationally cyclised. PAL is structurally similar to the mechanistically related histidine ammonia lyase (HAL; 4.3.1.3 from EC), with PAL having an additional approximately 160 residues extending from the common fold . Catalysis in PAL may be governed by the dipole moments of seven alpha helices associated with the PAL active site. The cofactor 3,5-dihydro-5-methylidene-4H-imidazol-4-one (MIO) resides atop the positive poles of three helices, for increasing its electrophilicity. Plant and fungal PAL enzymes contain aa approximately 100-residue long C-terminal multi-helix domain, which might play a role in the rapid response of PAL in the regulation of phenylpropanoid biosynthesis by destabilising the enzyme . ; GO: 0016841 ammonia-lyase activity, 0006559 L-phenylalanine catabolic process, 0005737 cytoplasm. Probab=30.75 E-value=26 Score=16.84 Aligned_cols=40 Identities=15% Similarity=0.256 Sum_probs=28.1 Q ss_pred CCCCCEEEECHHHHHHHHHCCCEEEECCC-CCCCCCCEECCC Q ss_conf 85520135569999999977984998574-317873200661 Q 537021.9.peg.3 2 GFVGNVIKVNRTILDLLIKSGIIPVIAPI-APGYDGATYNIN 42 (142) Q Consensus 2 g~~G~v~~vd~~~i~~lL~~g~iPVi~pi-g~~~~G~~~nin 42 (142) ||+| |+-.=.|.+++||+++.+|+++== .++..|+...++ T Consensus 157 GySg-iR~~ilE~~~~lLn~~vtP~lPLRGtI~aSGDL~PLS 197 (714) T TIGR01226 157 GYSG-IRFEILEAITKLLNANVTPVLPLRGTITASGDLVPLS 197 (714) T ss_pred CCCH-HHHHHHHHHHHHHHCCCCEEECCCCCCCCCCCCCHHH T ss_conf 3303-6899999999997489832105775500434231477 No 100 >cd04167 Snu114p Snu114p subfamily. Snu114p is one of several proteins that make up the U5 small nuclear ribonucleoprotein (snRNP) particle. U5 is a component of the spliceosome, which catalyzes the splicing of pre-mRNA to remove introns. Snu114p is homologous to EF-2, but typically contains an additional N-terminal domain not found in Ef-2. This protein is part of the GTP translation factor family and the Ras superfamily, characterized by five G-box motifs. Probab=29.69 E-value=5.7 Score=20.50 Aligned_cols=116 Identities=18% Similarity=0.189 Sum_probs=53.9 Q ss_pred CCCCCEEEECHHHHHHHHHCCCEEEECCCCCCCCC--CEECCCHHHHHHHHHHHHCH-----------HHHHHHCCCCCE Q ss_conf 85520135569999999977984998574317873--20066147999999986251-----------133432267633 Q 537021.9.peg.3 2 GFVGNVIKVNRTILDLLIKSGIIPVIAPIAPGYDG--ATYNINADTFAGAIAEKLNV-----------IRLLFLTDVPGV 68 (142) Q Consensus 2 g~~G~v~~vd~~~i~~lL~~g~iPVi~pig~~~~G--~~~nin~D~~a~~lA~~l~A-----------~kli~ltdv~Gv 68 (142) +..|.+.+.-+.++..|+........ -+....+ +.+....++..-.+.....+ ..++-+.|.||= T Consensus 4 aiigHvdhGKTTL~d~Ll~~t~~~~~--~~~~~~~~~~~~D~~~~E~eRgiTI~s~~~sl~~~~~~~k~~~inlIDTPGH 81 (213) T cd04167 4 AIAGHLHHGKTSLLDMLIEQTHDLTP--SGKDGWKPLRYTDIRKDEQERGISIKSSPISLVLPDSKGKSYLFNIIDTPGH 81 (213) T ss_pred EEECCCCCCHHHHHHHHHHHHHHHHH--HCCCCCCEEEEECCCHHHHHHCCCEECCEEEEEEECCCCCEEEEEEEECCCC T ss_conf 99827898989999999997344555--4044421135751646654203558614599998256675057877889872 Q ss_pred EECCCCCCCCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEEC Q ss_conf 5345555563388899999626976786789999999999849988999525 Q 537021.9.peg.3 69 LDKNRQLISKLSINEARTLIKDGTISGGMIPKIETSIKAIENGVKSVAILDG 120 (142) Q Consensus 69 ~d~~~~~i~~i~~~e~~~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~~ 120 (142) .|-.++..+.+...+..-+.-+. ..|=|..--+...+|.+.+++.+.++|- T Consensus 82 ~dF~~ev~~al~~~DgailVVDa-~eGv~~qT~~~l~~a~~~~l~~ilviNK 132 (213) T cd04167 82 VNFMDEVAAALRLSDGVVLVVDV-VEGVTSNTERLIRHAILEGLPIVLVINK 132 (213) T ss_pred HHHHHHHHHHHHHCCEEEEEEEC-CCCCCHHHHHHHHHHHHCCCCEEEEEEC T ss_conf 41799999888637767999987-8887577999999999869998999988 No 101 >TIGR01188 drrA daunorubicin resistance ABC transporter, ATP-binding protein; InterPro: IPR005894 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This family contains the daunorubicin resistance ABC transporter, ATP binding subunit in bacteria and archaea. This model is restricted in its scope to preferentially recognize the ATP binding subunit associated with effux of the drug, daunorubicin. In other words it functions as an ATP dependent antiporter. . Probab=28.05 E-value=26 Score=16.84 Aligned_cols=27 Identities=37% Similarity=0.512 Sum_probs=19.1 Q ss_pred CCCCCCHHHHHHHHHHHHHCCCCEEEEEE Q ss_conf 97678678999999999984998899952 Q 537021.9.peg.3 91 GTISGGMIPKIETSIKAIENGVKSVAILD 119 (142) Q Consensus 91 ~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~ 119 (142) +..+|||+++|+-|.-.+-+ |++-+.| T Consensus 127 ~tySGGMrRRL~iA~sli~~--P~vLFLD 153 (343) T TIGR01188 127 KTYSGGMRRRLEIAASLIHQ--PEVLFLD 153 (343) T ss_pred CCCCCCCEEHHHHHHHHHCC--CCEEECC T ss_conf 22677112144543111038--8256514 No 102 >PRK11377 dihydroxyacetone kinase subunit M; Provisional Probab=26.25 E-value=57 Score=14.97 Aligned_cols=55 Identities=11% Similarity=0.053 Sum_probs=40.0 Q ss_pred HHHHHHHHCCCEEEECCCCCCCCCCEECCCHHHHHHHHHHHHCHHHHHHHCCCCC Q ss_conf 9999999779849985743178732006614799999998625113343226763 Q 537021.9.peg.3 13 TILDLLIKSGIIPVIAPIAPGYDGATYNINADTFAGAIAEKLNVIRLLFLTDVPG 67 (142) Q Consensus 13 ~~i~~lL~~g~iPVi~pig~~~~G~~~nin~D~~a~~lA~~l~A~kli~ltdv~G 67 (142) +....+...+.+|+...=|.++.+..+..|++.+..++......+-++.|+|-.. T Consensus 19 ela~qm~~~~~~~i~~a~g~~d~~~~~Gtd~~~i~~ai~~~~~~~gv~vl~DlGS 73 (472) T PRK11377 19 ELARQMLMSDGCKLAIAAGIDDPQNPIGTDAVKVMEAIESVADADHVLVMMDMGS 73 (472) T ss_pred HHHHHHCCCCCCEEEEECCCCCCCCCCCCCHHHHHHHHHHHCCCCCEEEEEECCH T ss_conf 9999743489854887235579989878899999999998538997799981748 No 103 >COG1658 Small primase-like proteins (Toprim domain) [DNA replication, recombination, and repair] Probab=26.13 E-value=44 Score=15.60 Aligned_cols=55 Identities=24% Similarity=0.239 Sum_probs=36.2 Q ss_pred CHHHHHHHHHCCCEEEECCCCCCCCCCEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEECCCCCCCCC Q ss_conf 699999999779849985743178732006614799999998625113343226763353455555633 Q 537021.9.peg.3 11 NRTILDLLIKSGIIPVIAPIAPGYDGATYNINADTFAGAIAEKLNVIRLLFLTDVPGVLDKNRQLISKL 79 (142) Q Consensus 11 d~~~i~~lL~~g~iPVi~pig~~~~G~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~~~i~~i 79 (142) |...|+.+...+.+-+.+ . -++.......++.++.-..+|+||| .|..|+.|++. T Consensus 20 D~~~l~~~~~~~~i~~~g--------~--~i~~~~~ie~i~~~~~~k~VIILTD----~D~~Ge~Irk~ 74 (127) T COG1658 20 DTASLKRLGDAGVIITNG--------S--AINSLETIELIKKAQKYKGVIILTD----PDRKGERIRKK 74 (127) T ss_pred HHHHHHHHCCCCEEEECC--------C--CCCHHHHHHHHHHHHCCCCEEEEEC----CCCCHHHHHHH T ss_conf 799999854686389758--------6--5427889999998642587799868----98562899999 No 104 >KOG1447 consensus Probab=26.04 E-value=52 Score=15.21 Aligned_cols=49 Identities=24% Similarity=0.310 Sum_probs=34.8 Q ss_pred CCCCCCEEEECHHHHHHHHHC----CCEE-EECCCCCCCCCCEECCCH----HHHHHH Q ss_conf 985520135569999999977----9849-985743178732006614----799999 Q 537021.9.peg.3 1 LGFVGNVIKVNRTILDLLIKS----GIIP-VIAPIAPGYDGATYNINA----DTFAGA 49 (142) Q Consensus 1 ~g~~G~v~~vd~~~i~~lL~~----g~iP-Vi~pig~~~~G~~~nin~----D~~a~~ 49 (142) |||+|+..+.-.+-|+.+-+. +-+- =|.|++.+++|+++-+++ |+-|.. T Consensus 193 L~F~G~l~~~aA~eI~kLY~LF~avDAtQvEiNPl~ET~~G~V~cvDAK~NFDDnA~f 250 (412) T KOG1447 193 LGFVGPLKSQAADEITKLYNLFLAVDATQVEINPLGETPEGQVVCVDAKINFDDNAEF 250 (412) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHCCEEEEECCCCCCCCCEEEEEEEECCCCCHHHH T ss_conf 4666707778999999999999651212787325665898608998501067841766 No 105 >PRK13597 imidazole glycerol phosphate synthase subunit HisF; Provisional Probab=25.50 E-value=59 Score=14.89 Aligned_cols=18 Identities=11% Similarity=0.300 Sum_probs=8.3 Q ss_pred HHHHHHHCHHHHHHHCCC Q ss_conf 999986251133432267 Q 537021.9.peg.3 48 GAIAEKLNVIRLLFLTDV 65 (142) Q Consensus 48 ~~lA~~l~A~kli~ltdv 65 (142) .+++...++++++.--|+ T Consensus 115 ~~~~~~fG~q~Iv~~iD~ 132 (252) T PRK13597 115 RELADHFGAQAVVLAIDA 132 (252) T ss_pred HHHHHHCCCCCEEEEEEE T ss_conf 999987499652999988 No 106 >COG1634 Uncharacterized Rossmann fold enzyme [General function prediction only] Probab=25.46 E-value=37 Score=16.05 Aligned_cols=21 Identities=14% Similarity=0.189 Sum_probs=16.7 Q ss_pred CCHHHHHHHHHHHHCHHHHHHH Q ss_conf 6147999999986251133432 Q 537021.9.peg.3 41 INADTFAGAIAEKLNVIRLLFL 62 (142) Q Consensus 41 in~D~~a~~lA~~l~A~kli~l 62 (142) .|+|++ +.+|..|+|.++++. T Consensus 158 TDGDRA-a~LA~~lgA~~I~l~ 178 (232) T COG1634 158 TDGDRA-AFLAYYLGAEKIRLV 178 (232) T ss_pred CCCHHH-HHHHHHHCCCEEEEE T ss_conf 873099-999998497769995 No 107 >TIGR02364 dha_pts dihydroxyacetone kinase, phosphotransfer subunit; InterPro: IPR012844 In Escherichia coli and many other bacteria, unlike the yeasts and a few bacteria such as Citrobacter freundii, the dihydroxyacetone kinase (also called glycerone kinase) transfers a phosphate from a phosphoprotein rather than from ATP and contains multiple subunits.. Probab=22.09 E-value=70 Score=14.51 Aligned_cols=70 Identities=20% Similarity=0.290 Sum_probs=51.4 Q ss_pred HHHHHHHHHCCCEEEECCCCCC-CCCCEECCCHHHHHHHHHHHHC-HHHHHHHCCCCCEEECCCCCCCCCCHHHHHHHHH Q ss_conf 9999999977984998574317-8732006614799999998625-1133432267633534555556338889999962 Q 537021.9.peg.3 12 RTILDLLIKSGIIPVIAPIAPG-YDGATYNINADTFAGAIAEKLN-VIRLLFLTDVPGVLDKNRQLISKLSINEARTLIK 89 (142) Q Consensus 12 ~~~i~~lL~~g~iPVi~pig~~-~~G~~~nin~D~~a~~lA~~l~-A~kli~ltdv~Gv~d~~~~~i~~i~~~e~~~l~~ 89 (142) .+.++++...+.++|+..=|-+ ++| .+..|.+.+..++....+ +++++.|+|-+. ..+|.+-+-+|++ T Consensus 17 ~~L~~qM~~S~dV~I~~aGGtdd~dg-~lGTs~~~I~eaI~~~~~~~~gv~~f~DlGS---------A~mn~E~A~El~e 86 (128) T TIGR02364 17 KELVKQMAGSDDVKIISAGGTDDDDG-RLGTSADKIKEAIEKAANEAEGVLIFYDLGS---------AVMNAEMAVELLE 86 (128) T ss_pred HHHHHHHHCCCCEEEEEECCCCCCCC-CCCCCHHHHHHHHHHHHCCCCCEEEEEECCC---------HHHHHHHHHHHCC T ss_conf 99999853679546887106428888-8078989999998653027894699862661---------2565889986338 Q ss_pred CC Q ss_conf 69 Q 537021.9.peg.3 90 DG 91 (142) Q Consensus 90 ~~ 91 (142) .. T Consensus 87 ~~ 88 (128) T TIGR02364 87 DE 88 (128) T ss_pred CC T ss_conf 50 No 108 >cd01173 pyridoxal_pyridoxamine_kinase Pyridoxal kinase plays a key role in the synthesis of the active coenzyme pyridoxal-5'-phosphate (PLP), by catalyzing the phosphorylation of the precursor vitamin B6 in the presence of Zn2+ and ATP. Mammals are unable to synthesize PLP de novo and require its precursors in the form of vitamin B6 (pyridoxal, pyridoxine, and pyridoxamine) from their diet. Pyridoxal kinase encoding genes are also found in many other species including yeast and bacteria. Probab=21.98 E-value=70 Score=14.50 Aligned_cols=24 Identities=13% Similarity=0.061 Sum_probs=15.0 Q ss_pred HHHHHHHHHHHCCCCEEEEEECCC Q ss_conf 999999999984998899952578 Q 537021.9.peg.3 99 PKIETSIKAIENGVKSVAILDGKK 122 (142) Q Consensus 99 ~Ki~~a~~a~~~Gv~~v~i~~~~~ 122 (142) .-.++|.+..+.|.+.|.|-++.. T Consensus 160 ~~~~aa~~L~~~g~~~Vvitg~~~ 183 (254) T cd01173 160 DAKAAARALHAKGPKTVVVTSVEL 183 (254) T ss_pred HHHHHHHHHHHHCCCEEEECCCCC T ss_conf 999999999970899999998657 No 109 >PRK06953 short chain dehydrogenase; Provisional Probab=21.96 E-value=70 Score=14.50 Aligned_cols=112 Identities=11% Similarity=0.173 Sum_probs=59.2 Q ss_pred CCEEEECHHHHHHHHHCCCEEEECCCCC-------CCCCCEE--CC-CHHHHHHHHHHHHCHHHHHHHCCCCCEEECCCC Q ss_conf 2013556999999997798499857431-------7873200--66-147999999986251133432267633534555 Q 537021.9.peg.3 5 GNVIKVNRTILDLLIKSGIIPVIAPIAP-------GYDGATY--NI-NADTFAGAIAEKLNVIRLLFLTDVPGVLDKNRQ 74 (142) Q Consensus 5 G~v~~vd~~~i~~lL~~g~iPVi~pig~-------~~~G~~~--ni-n~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~~ 74 (142) |-=+.|-....+.++++|.-++++.--. ....+.+ .+ +.+.+ ..++..+...++=.+....|++..... T Consensus 8 Gas~GIG~a~a~~la~~G~~V~~~~R~~~~l~~l~~~~~~~~~~Dv~d~~~v-~~~~~~~~~~~ldili~nAGi~~~~~~ 86 (222) T PRK06953 8 GASRGIGLEFVRQYRADGWRVIATARDAAGLAALRALGAEALALDVADPESI-AGLGWKLDGEALDAAVYVAGVYGPRTE 86 (222) T ss_pred CCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHCCCCEEEEECCCHHHH-HHHHHHHCCCCCCEEEEECCCCCCCCC T ss_conf 7572999999999998889999996888889998842151777405899999-999986236776789981665567876 Q ss_pred CCCCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCCEEEEEE Q ss_conf 556338889999962697678678999999999984998899952 Q 537021.9.peg.3 75 LISKLSINEARTLIKDGTISGGMIPKIETSIKAIENGVKSVAILD 119 (142) Q Consensus 75 ~i~~i~~~e~~~l~~~~~~~~gm~~Ki~~a~~a~~~Gv~~v~i~~ 119 (142) .+..++.++.+..++-+.. |...-.++++-.++..-.++.+++ T Consensus 87 ~~~~~~~~~~~~~~~vN~~--g~~~l~~~~lP~l~~~~g~ii~iS 129 (222) T PRK06953 87 GVEPITREDFDAVMHTNVL--GPMQLLPILLPLVEAAGGVLAVLS 129 (222) T ss_pred CCCCCCHHHHHHHHHHHHH--HHHHHHHHHHHHHHHCCCCCCEEC T ss_conf 5466899999999987119--999999999999985799852456 No 110 >COG0149 TpiA Triosephosphate isomerase [Carbohydrate transport and metabolism] Probab=21.59 E-value=65 Score=14.68 Aligned_cols=40 Identities=18% Similarity=0.341 Sum_probs=25.3 Q ss_pred HHHHHHHHCCCEEEEC---CCCCCCCCCEECCCHHHHHHHHHH Q ss_conf 9999999779849985---743178732006614799999998 Q 537021.9.peg.3 13 TILDLLIKSGIIPVIA---PIAPGYDGATYNINADTFAGAIAE 52 (142) Q Consensus 13 ~~i~~lL~~g~iPVi~---pig~~~~G~~~nin~D~~a~~lA~ 52 (142) .-++..++.|+.||+| ++..-+.|+...+-..++...++. T Consensus 112 ~K~~aa~~~Gl~pIlCvGEtl~~reag~t~~v~~~Ql~~~l~~ 154 (251) T COG0149 112 KKVKAAKEAGLTPILCVGETLEEREAGKTLEVLKRQLAAALAA 154 (251) T ss_pred HHHHHHHHCCCEEEEECCCCHHHHHCCCHHHHHHHHHHHHHHH T ss_conf 9999999889968998589777775556688999999999874 No 111 >TIGR00017 cmk cytidylate kinase; InterPro: IPR003136 This family consists of cytidylate kinase (2.7.4.14 from EC), which catalyzes the phosphorylation of cytidine 5-monophosphate (dCMP) to cytidine 5 -diphosphate (dCDP) in the presence of ATP or GTP. UMP and dCMP can also act as acceptors .; GO: 0004127 cytidylate kinase activity, 0005524 ATP binding, 0006139 nucleobase nucleoside nucleotide and nucleic acid metabolic process. Probab=20.95 E-value=74 Score=14.38 Aligned_cols=84 Identities=13% Similarity=0.187 Sum_probs=38.2 Q ss_pred CHHHHHHHHHHHHCHHHHHHHCCC--CCEEECCCCCCCCCCHHHHHHHHHCCCCCCCHHHHHHHHHH-HHHCCCCEEEEE Q ss_conf 147999999986251133432267--63353455555633888999996269767867899999999-998499889995 Q 537021.9.peg.3 42 NADTFAGAIAEKLNVIRLLFLTDV--PGVLDKNRQLISKLSINEARTLIKDGTISGGMIPKIETSIK-AIENGVKSVAIL 118 (142) Q Consensus 42 n~D~~a~~lA~~l~A~kli~ltdv--~Gv~d~~~~~i~~i~~~e~~~l~~~~~~~~gm~~Ki~~a~~-a~~~Gv~~v~i~ 118 (142) +.++.-+.|+..++- -|.+.. -+|+-++..+-..|..+++..+...-......+.++---.+ ..++.-. =.|+ T Consensus 55 ~~E~~L~~L~~~~di---~f~~~~~~~~v~~nG~DV~~~I~t~~v~~~aS~~A~~~~vR~~~~~~Qq~~~~~sd~-g~v~ 130 (223) T TIGR00017 55 TDEDALLELISQLDI---RFIPTAGEVKVFLNGEDVSEAIRTQEVANLASKVAAFPKVREILLKRQQKLAKNSDD-GIVA 130 (223) T ss_pred CCHHHHHHHHHHCCC---EEECCCCCEEEEECCCCHHHHCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCC-CEEE T ss_conf 684889999863260---442157750688707771322068358999998723777999999999998751589-7588 Q ss_pred ECCCCCHHHHHHHCCCCCCCEECC Q ss_conf 257876899999679986634549 Q 537021.9.peg.3 119 DGKKPHSILMEIFTKNGSGTLLIP 142 (142) Q Consensus 119 ~~~~~~~ll~elft~~g~GT~i~p 142 (142) +||. +||+|.| T Consensus 131 dGRD-------------iGTvVfP 141 (223) T TIGR00017 131 DGRD-------------IGTVVFP 141 (223) T ss_pred ECCC-------------CCCEECC T ss_conf 4552-------------3744678 No 112 >pfam11322 DUF3124 Protein of unknown function (DUF3124). This bacterial family of proteins has no known function. Probab=20.21 E-value=7.7 Score=19.79 Aligned_cols=53 Identities=17% Similarity=0.323 Sum_probs=37.6 Q ss_pred CEEEECCCCCCCCCCEECCCHHHHHHHHHHHHCHHHHHHHCCCCCEEECCCCCCCCCC Q ss_conf 8499857431787320066147999999986251133432267633534555556338 Q 537021.9.peg.3 23 IIPVIAPIAPGYDGATYNINADTFAGAIAEKLNVIRLLFLTDVPGVLDKNRQLISKLS 80 (142) Q Consensus 23 ~iPVi~pig~~~~G~~~nin~D~~a~~lA~~l~A~kli~ltdv~Gv~d~~~~~i~~i~ 80 (142) ++|+-|.+-....+..+++.+ ..--+..++.+=|+++.++. ||.+|++++..- T Consensus 8 YVPvYS~Iy~~~~~~~~~Lt~----tLSiRNtd~~~pi~i~~v~Y-ydt~Gklvr~yl 60 (126) T pfam11322 8 YVPVYSHIYSGPEHRPFNLTA----TLSIRNTDPSQPIYITSIDY-YDTDGKLVRTYL 60 (126) T ss_pred EEEEEEEEEECCCCCEEEEEE----EEEEECCCCCCCEEEEEEEE-ECCCCCCHHHHH T ss_conf 998677999869986086899----99997499999889999888-869982878752 Done!