Query 537021.9.peg.393_1 Match_columns 150 No_of_seqs 118 out of 1509 Neff 5.8 Searched_HMMs 39220 Date Wed May 25 05:33:15 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i peg_393.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 CHL00202 argB acetylglutamate 100.0 0 0 322.1 15.0 143 7-149 5-148 (284) 2 cd04250 AAK_NAGK-C AAK_NAGK-C: 100.0 6.8E-43 0 296.1 15.9 138 12-149 1-145 (279) 3 PRK05279 N-acetylglutamate syn 100.0 8E-42 0 289.2 14.5 142 6-149 6-155 (441) 4 cd04237 AAK_NAGS-ABP AAK_NAGS- 100.0 4.9E-41 1.4E-45 284.1 15.5 138 10-149 3-148 (280) 5 COG0548 ArgB Acetylglutamate k 100.0 3.2E-39 8.1E-44 272.5 13.7 126 24-149 1-130 (265) 6 TIGR00761 argB acetylglutamate 100.0 1.3E-37 3.3E-42 262.2 12.0 123 27-149 1-141 (254) 7 TIGR01890 N-Ac-Glu-synth amino 100.0 7.4E-37 1.9E-41 257.3 11.2 139 10-150 2-148 (439) 8 cd04249 AAK_NAGK-NC AAK_NAGK-N 100.0 2.4E-34 6.2E-39 241.2 11.5 122 28-149 1-123 (252) 9 PRK00942 acetylglutamate kinas 100.0 2.3E-33 6E-38 234.9 13.1 123 27-149 1-130 (262) 10 PRK04531 acetylglutamate kinas 100.0 6.1E-33 1.6E-37 232.2 13.7 131 13-149 4-139 (421) 11 cd04238 AAK_NAGK-like AAK_NAGK 100.0 7.9E-32 2E-36 225.1 13.3 122 28-149 1-125 (256) 12 cd04252 AAK_NAGK-fArgBP AAK_NA 100.0 4.9E-30 1.2E-34 213.6 13.1 115 29-149 2-117 (248) 13 cd04251 AAK_NAGK-UC AAK_NAGK-U 100.0 2.6E-28 6.8E-33 202.5 11.4 115 28-149 1-132 (257) 14 KOG2436 consensus 99.9 2.6E-24 6.6E-29 176.9 10.9 141 8-149 77-223 (520) 15 cd04236 AAK_NAGS-Urea AAK_NAGS 99.9 3E-21 7.6E-26 157.3 11.8 128 3-149 6-140 (271) 16 cd02115 AAK Amino Acid Kinases 99.8 4.5E-18 1.2E-22 136.9 11.6 114 29-148 1-114 (248) 17 pfam00696 AA_kinase Amino acid 99.6 1.6E-14 4.1E-19 114.2 11.3 109 26-137 1-109 (230) 18 PRK12353 carbamate kinase; Rev 99.3 2.2E-11 5.5E-16 94.1 11.7 94 25-120 2-104 (312) 19 cd04241 AAK_FomA-like AAK_FomA 99.0 5.7E-09 1.5E-13 78.5 10.9 99 27-132 1-107 (252) 20 PRK12686 carbamate kinase; Rev 98.7 3.1E-07 7.8E-12 67.4 11.6 96 24-121 2-106 (313) 21 cd04235 AAK_CK AAK_CK: Carbama 98.7 4.9E-07 1.2E-11 66.2 11.4 93 27-125 1-107 (308) 22 PRK12454 carbamate kinase-like 98.3 4.6E-06 1.2E-10 59.9 7.7 92 25-121 2-107 (313) 23 cd04254 AAK_UMPK-PyrH-Ec UMP k 98.1 4E-05 1E-09 53.9 10.0 92 26-129 1-101 (231) 24 PRK00358 pyrH uridylate kinase 98.1 7.1E-05 1.8E-09 52.3 11.1 90 26-127 1-99 (231) 25 cd04256 AAK_P5CS_ProBA AAK_P5C 98.0 3E-05 7.7E-10 54.6 8.1 47 24-70 7-60 (284) 26 cd04239 AAK_UMPK-like AAK_UMPK 98.0 0.00011 2.8E-09 51.1 10.3 88 28-126 2-96 (229) 27 PRK12314 gamma-glutamyl kinase 97.8 0.00017 4.4E-09 49.8 9.1 46 24-69 8-59 (265) 28 COG1608 Predicted archaeal kin 97.8 7.8E-05 2E-09 52.0 7.0 100 28-134 3-115 (252) 29 PRK12352 putative carbamate ki 97.7 0.00018 4.5E-09 49.7 7.6 87 25-116 2-99 (316) 30 cd04242 AAK_G5K_ProB AAK_G5K_P 97.7 0.00021 5.3E-09 49.3 7.8 51 28-80 2-58 (251) 31 PRK09411 carbamate kinase; Rev 97.7 0.00075 1.9E-08 45.7 10.1 87 26-121 2-99 (297) 32 TIGR00746 arcC carbamate kinas 97.6 0.00017 4.3E-09 49.9 6.2 94 26-125 1-110 (321) 33 COG0549 ArcC Carbamate kinase 97.5 0.0012 3E-08 44.5 9.3 86 26-116 1-97 (312) 34 cd04261 AAK_AKii-LysC-BS AAK_A 97.5 0.0019 4.9E-08 43.1 9.8 98 27-132 1-101 (239) 35 PRK12354 carbamate kinase; Rev 97.2 0.0021 5.2E-08 42.9 7.9 82 27-117 2-94 (302) 36 cd04246 AAK_AK-DapG-like AAK_A 97.2 0.0084 2.1E-07 39.0 10.3 96 27-128 1-97 (239) 37 cd04253 AAK_UMPK-PyrH-Pf AAK_U 97.2 0.0015 3.9E-08 43.7 6.5 44 28-72 2-48 (221) 38 COG0528 PyrH Uridylate kinase 97.0 0.014 3.5E-07 37.6 10.2 97 23-130 3-107 (238) 39 PRK08841 aspartate kinase; Val 96.7 0.024 6.1E-07 36.1 9.6 99 26-131 2-102 (392) 40 PRK07431 aspartate kinase; Pro 96.4 0.05 1.3E-06 34.0 9.6 102 26-133 2-104 (594) 41 PRK08210 aspartate kinase I; R 96.4 0.08 2.1E-06 32.7 10.7 100 26-132 2-108 (405) 42 cd04240 AAK_UC AAK_UC: Unchara 96.4 0.0038 9.8E-08 41.2 3.6 41 29-74 1-41 (203) 43 cd04260 AAK_AKi-DapG-BS AAK_AK 96.4 0.068 1.7E-06 33.2 9.9 99 27-132 1-106 (244) 44 TIGR02076 pyrH_arch uridylate 96.2 0.0059 1.5E-07 40.0 3.7 75 28-114 1-86 (232) 45 PRK05429 gamma-glutamyl kinase 96.1 0.11 2.7E-06 31.9 10.0 56 24-81 7-68 (372) 46 PRK13402 gamma-glutamyl kinase 96.1 0.019 4.8E-07 36.7 6.0 55 25-81 5-65 (363) 47 COG0263 ProB Glutamate 5-kinas 96.0 0.064 1.6E-06 33.3 8.5 89 25-123 6-102 (369) 48 PRK12443 uridylate kinase; Rev 95.9 0.024 6.1E-07 36.1 5.9 91 26-128 5-105 (247) 49 TIGR01027 proB glutamate 5-kin 95.9 0.07 1.8E-06 33.1 8.2 86 26-121 1-96 (379) 50 PRK06635 aspartate kinase; Rev 95.9 0.1 2.6E-06 32.0 8.9 100 26-132 2-103 (402) 51 TIGR02075 pyrH_bact uridylate 95.8 0.2 5.2E-06 30.1 11.1 100 26-137 2-112 (236) 52 COG0527 LysC Aspartokinases [A 95.6 0.18 4.7E-06 30.4 9.5 107 27-134 3-150 (447) 53 COG5630 ARG2 Acetylglutamate s 95.3 0.013 3.2E-07 37.9 2.7 124 14-148 125-252 (495) 54 cd04234 AAK_AK AAK_AK: Amino A 95.3 0.22 5.5E-06 29.9 9.0 39 27-66 1-39 (227) 55 cd07041 STAS_RsbR_RsbS_like Su 95.0 0.18 4.5E-06 30.5 7.7 77 5-82 20-97 (109) 56 KOG1154 consensus 94.2 0.078 2E-06 32.8 4.3 98 24-124 8-118 (285) 57 PRK06291 aspartate kinase; Pro 92.6 1 2.5E-05 25.7 8.9 50 27-76 2-51 (466) 58 cd04259 AAK_AK-DapDC AAK_AK-Da 89.7 1.1 2.7E-05 25.5 5.6 47 28-74 2-48 (295) 59 cd06844 STAS Sulphate Transpor 87.9 2.5 6.5E-05 23.1 7.5 74 7-81 20-94 (100) 60 cd04244 AAK_AK-LysC-like AAK_A 87.6 1.6 4E-05 24.4 5.3 48 28-76 2-49 (298) 61 pfam00752 XPG_N XPG N-terminal 87.3 1.7 4.2E-05 24.3 5.3 52 20-71 20-85 (100) 62 PRK08961 bifunctional aspartat 86.7 1.9 5E-05 23.8 5.4 54 23-76 5-59 (865) 63 COG1366 SpoIIAA Anti-anti-sigm 84.0 4 0.0001 21.8 8.2 76 7-83 25-101 (117) 64 PRK00192 mannosyl-3-phosphogly 82.5 3.4 8.7E-05 22.2 5.1 67 24-90 5-74 (275) 65 smart00485 XPGN Xeroderma pigm 81.9 4.2 0.00011 21.7 5.4 52 20-71 19-84 (99) 66 TIGR00657 asp_kinases aspartat 79.8 4.1 0.0001 21.8 4.7 52 26-77 2-55 (504) 67 cd04243 AAK_AK-HSDH-like AAK_A 77.9 5.9 0.00015 20.7 5.1 47 28-76 2-48 (293) 68 cd04257 AAK_AK-HSDH AAK_AK-HSD 77.7 6.2 0.00016 20.6 5.1 48 28-76 2-49 (294) 69 TIGR01283 nifE nitrogenase MoF 77.5 6.6 0.00017 20.4 5.7 72 9-91 321-392 (470) 70 PRK08373 aspartate kinase; Val 75.5 7.4 0.00019 20.1 5.1 100 24-127 2-132 (359) 71 pfam01740 STAS STAS domain. Th 75.3 7.5 0.00019 20.1 7.3 73 10-83 24-97 (106) 72 cd04258 AAK_AKiii-LysC-EC AAK_ 75.0 6.7 0.00017 20.4 4.7 45 27-74 1-45 (292) 73 cd04247 AAK_AK-Hom3 AAK_AK-Hom 72.3 8.9 0.00023 19.6 4.9 38 27-66 2-39 (306) 74 COG0474 MgtA Cation transport 71.8 9.1 0.00023 19.5 7.5 100 29-130 536-649 (917) 75 pfam10686 DUF2493 Protein of u 70.5 9.7 0.00025 19.4 5.5 52 26-80 3-59 (71) 76 COG0561 Cof Predicted hydrolas 68.8 11 0.00027 19.1 5.6 102 25-128 3-109 (264) 77 TIGR00268 TIGR00268 conserved 68.7 9.6 0.00025 19.4 4.3 109 14-136 3-131 (263) 78 PRK05925 aspartate kinase; Pro 67.5 11 0.00028 19.0 9.5 104 26-136 2-148 (440) 79 cd04255 AAK_UMPK-MosAB AAK_UMP 67.2 7.4 0.00019 20.1 3.5 85 26-123 31-121 (262) 80 PRK04596 minC septum formation 67.0 11 0.00029 18.9 6.5 59 21-81 46-104 (248) 81 TIGR01092 P5CS delta l-pyrroli 64.6 10 0.00027 19.2 3.9 45 25-69 7-58 (738) 82 PTZ00217 flap endonuclease-1; 64.5 13 0.00033 18.6 5.2 50 20-69 23-92 (394) 83 TIGR01116 ATPase-IIA1_Ca calci 63.5 5.9 0.00015 20.8 2.4 95 4-99 418-538 (800) 84 cd04245 AAK_AKiii-YclM-BS AAK_ 63.1 13 0.00032 18.6 4.1 35 28-64 2-36 (288) 85 PRK00009 phosphoenolpyruvate c 63.1 14 0.00035 18.4 4.8 65 26-93 568-647 (916) 86 KOG2518 consensus 60.0 9.7 0.00025 19.3 3.0 49 19-69 18-82 (556) 87 PRK03868 glucose-6-phosphate i 60.0 15 0.00039 18.1 8.9 89 10-102 42-135 (409) 88 PRK10444 UMP phosphatase; Prov 58.4 15 0.00038 18.1 3.8 55 26-80 2-60 (248) 89 cd01979 Pchlide_reductase_N Pc 58.3 16 0.00042 17.9 5.0 77 7-95 258-334 (396) 90 TIGR00600 rad2 DNA excision re 57.8 11 0.00027 19.1 2.9 62 33-98 46-113 (1127) 91 pfam06953 ArsD Arsenical resis 57.4 15 0.00038 18.2 3.6 67 38-137 22-88 (119) 92 PRK10530 phosphotransferase; P 57.4 17 0.00043 17.8 4.3 97 26-122 4-105 (272) 93 cd00128 XPG Xeroderma pigmento 57.1 17 0.00044 17.8 5.4 50 20-69 19-82 (316) 94 PRK00973 glucose-6-phosphate i 57.0 17 0.00044 17.7 6.8 87 12-99 61-154 (454) 95 TIGR01292 TRX_reduct thioredox 56.3 18 0.00045 17.7 4.1 36 21-65 147-182 (321) 96 cd03141 GATase1_Hsp31_like Typ 56.2 18 0.00045 17.7 6.8 54 82-135 146-205 (221) 97 TIGR00014 arsC arsenate reduct 55.5 6.5 0.00017 20.4 1.5 10 59-68 92-101 (114) 98 TIGR01465 cobM_cbiF precorrin- 55.1 18 0.00047 17.6 11.7 126 5-135 49-203 (252) 99 pfam08282 Hydrolase_3 haloacid 54.5 19 0.00048 17.5 4.1 26 10-35 39-64 (254) 100 PRK13385 2-C-methyl-D-erythrit 53.9 18 0.00046 17.6 3.6 59 9-72 54-119 (238) 101 cd01976 Nitrogenase_MoFe_alpha 53.7 19 0.0005 17.4 7.2 67 5-80 280-349 (421) 102 PRK09444 pntB pyridine nucleot 50.2 22 0.00056 17.1 4.2 39 45-83 291-339 (459) 103 COG2054 Uncharacterized archae 49.7 21 0.00053 17.2 3.3 73 29-113 3-77 (212) 104 TIGR00453 ispD 2-C-methyl-D-er 48.6 19 0.00049 17.5 3.0 34 27-65 70-104 (226) 105 PTZ00114 Heat shock protein 60 47.9 24 0.00061 16.8 7.8 113 18-134 397-519 (580) 106 PRK00694 4-hydroxy-3-methylbut 47.6 23 0.00059 16.9 3.3 53 44-96 45-104 (606) 107 pfam02233 PNTB NAD(P) transhyd 47.3 24 0.00062 16.8 4.2 43 45-87 296-352 (464) 108 pfam01973 MAF_flag10 Protein o 46.6 25 0.00064 16.7 3.8 27 10-36 33-59 (169) 109 PTZ00063 histone deacetylase; 46.4 25 0.00064 16.7 6.1 118 11-133 237-390 (439) 110 TIGR00656 asp_kin_monofn aspar 45.4 26 0.00067 16.6 4.5 54 25-78 1-54 (480) 111 cd01974 Nitrogenase_MoFe_beta 45.2 26 0.00067 16.6 7.9 65 3-76 281-347 (435) 112 TIGR00705 SppA_67K signal pept 45.2 15 0.00038 18.2 2.0 75 33-126 371-454 (614) 113 PRK08605 D-lactate dehydrogena 45.0 27 0.00068 16.5 3.5 36 51-86 63-100 (332) 114 TIGR02078 AspKin_pair Pyrococc 44.0 27 0.0007 16.4 3.8 46 27-77 1-48 (341) 115 PRK10637 cysG siroheme synthas 42.7 29 0.00073 16.3 5.4 69 15-86 254-329 (457) 116 TIGR02472 sucr_P_syn_N sucrose 42.4 15 0.00039 18.0 1.8 78 35-122 348-427 (445) 117 COG1211 IspD 4-diphosphocytidy 42.1 29 0.00075 16.3 3.4 20 55-74 70-89 (230) 118 cd03134 GATase1_PfpI_like A ty 41.9 30 0.00075 16.2 5.7 67 47-136 84-151 (165) 119 COG2378 Predicted transcriptio 41.0 25 0.00065 16.7 2.7 80 8-100 5-87 (311) 120 PRK10017 putative pyruvyl tran 40.9 30 0.00077 16.2 3.0 51 19-71 111-163 (426) 121 pfam00850 Hist_deacetyl Histon 40.4 31 0.0008 16.1 7.4 62 12-73 218-290 (302) 122 PRK10513 sugar phosphatase; Pr 40.0 32 0.00081 16.1 5.2 65 26-90 4-75 (270) 123 PRK09034 aspartate kinase; Rev 39.9 32 0.00081 16.0 7.6 34 28-63 2-35 (450) 124 cd03147 GATase1_Ydr533c_like T 37.9 34 0.00087 15.9 5.6 102 29-135 98-215 (231) 125 cd01427 HAD_like Haloacid deha 36.9 33 0.00083 16.0 2.7 43 41-83 24-67 (139) 126 cd05015 SIS_PGI_1 Phosphogluco 36.1 36 0.00093 15.7 9.4 111 13-131 4-122 (158) 127 PRK11440 hypothetical protein; 35.6 17 0.00043 17.8 1.1 93 38-131 29-133 (188) 128 TIGR03674 fen_arch flap struct 35.5 37 0.00095 15.6 5.7 28 41-69 57-84 (338) 129 KOG0339 consensus 34.8 38 0.00097 15.5 6.2 73 21-101 292-364 (731) 130 PRK00155 ispD 2-C-methyl-D-ery 34.6 38 0.00098 15.5 3.6 14 59-72 98-112 (228) 131 TIGR01508 rib_reduct_arch diam 34.4 39 0.00099 15.5 4.0 63 18-87 116-186 (224) 132 PRK05301 pyrroloquinoline quin 34.0 39 0.001 15.5 7.8 67 14-80 105-185 (375) 133 TIGR01245 trpD anthranilate ph 33.9 39 0.001 15.4 3.4 63 39-106 192-279 (331) 134 PRK05991 precorrin-3B C17-meth 33.9 39 0.001 15.4 5.0 71 9-86 28-113 (254) 135 PRK00339 minC septum formation 33.3 40 0.001 15.4 6.2 61 20-82 47-107 (249) 136 PRK13639 cbiO cobalt transport 33.2 40 0.001 15.4 4.9 13 46-58 224-236 (275) 137 TIGR02090 LEU1_arch isopropylm 32.6 17 0.00043 17.8 0.7 14 66-79 321-334 (371) 138 PRK06136 uroporphyrin-III C-me 32.3 42 0.0011 15.3 6.5 92 24-120 79-190 (253) 139 PRK09084 aspartate kinase III; 32.3 42 0.0011 15.3 5.2 37 27-66 1-37 (447) 140 KOG4701 consensus 32.2 42 0.0011 15.3 3.1 51 17-68 58-114 (568) 141 PRK10517 magnesium-transportin 31.1 44 0.0011 15.2 7.9 89 37-127 545-643 (900) 142 COG0547 TrpD Anthranilate phos 30.9 44 0.0011 15.1 3.6 65 38-105 197-284 (338) 143 cd01987 USP_OKCHK USP domain i 30.8 44 0.0011 15.1 8.4 103 6-114 9-115 (124) 144 PRK09436 thrA bifunctional asp 30.1 45 0.0012 15.0 5.1 47 28-75 2-48 (817) 145 PRK10976 putative sugar phosph 29.9 46 0.0012 15.0 5.4 66 26-91 3-72 (266) 146 pfam11869 DUF3389 Protein of u 29.9 46 0.0012 15.0 4.6 35 28-68 17-51 (75) 147 KOG0456 consensus 29.7 46 0.0012 15.0 5.7 47 25-73 78-124 (559) 148 TIGR01369 CPSaseII_lrg carbamo 29.6 46 0.0012 15.0 3.7 63 19-86 948-1013(1089) 149 pfam01872 RibD_C RibD C-termin 29.3 47 0.0012 15.0 3.2 31 46-76 119-149 (196) 150 TIGR00329 gcp metalloendopepti 28.8 48 0.0012 14.9 3.0 62 34-99 159-225 (337) 151 PRK06859 consensus 28.2 49 0.0012 14.8 6.4 47 70-119 106-152 (253) 152 KOG3517 consensus 27.8 50 0.0013 14.8 2.4 71 31-101 10-90 (334) 153 PRK06278 cobyrinic acid a,c-di 27.8 34 0.00087 15.9 1.5 17 20-36 266-282 (482) 154 pfam07521 RMMBL RNA-metabolisi 27.6 50 0.0013 14.8 3.1 33 27-66 9-41 (43) 155 cd02871 GH18_chitinase_D-like 27.4 51 0.0013 14.8 3.4 24 44-67 60-83 (312) 156 PRK12852 groEL chaperonin GroE 27.1 51 0.0013 14.7 6.7 109 23-133 372-488 (542) 157 KOG0202 consensus 27.1 51 0.0013 14.7 4.1 112 10-126 557-684 (972) 158 PRK09451 glmU bifunctional N-a 26.8 52 0.0013 14.7 2.9 38 9-48 82-119 (456) 159 PRK11391 etp phosphotyrosine-p 26.7 52 0.0013 14.7 2.9 69 26-95 3-74 (144) 160 TIGR01737 FGAM_synth_I phospho 26.6 52 0.0013 14.7 3.0 84 26-125 2-100 (264) 161 TIGR00099 Cof-subfamily Cof-li 26.4 53 0.0013 14.6 7.4 112 28-139 2-126 (270) 162 TIGR03590 PseG pseudaminic aci 26.3 53 0.0013 14.6 6.2 99 25-130 171-273 (280) 163 COG3956 Protein containing tet 25.7 54 0.0014 14.5 6.3 72 20-97 77-148 (488) 164 TIGR01249 pro_imino_pep_1 prol 25.6 54 0.0014 14.5 2.6 55 45-101 242-305 (310) 165 cd07022 S49_Sppa_36K_type Sign 25.6 55 0.0014 14.5 5.0 78 67-146 115-202 (214) 166 KOG1533 consensus 25.1 56 0.0014 14.5 4.3 107 11-123 86-231 (290) 167 TIGR02824 quinone_pig3 putativ 24.9 43 0.0011 15.2 1.6 28 51-78 163-190 (334) 168 TIGR01326 OAH_OAS_sulfhy O-ace 24.9 42 0.0011 15.3 1.6 51 44-97 83-138 (434) 169 cd01013 isochorismatase Isocho 24.8 56 0.0014 14.4 3.9 91 40-131 53-154 (203) 170 PRK12851 groEL chaperonin GroE 24.5 57 0.0015 14.4 6.9 109 21-133 370-487 (541) 171 cd02516 CDP-ME_synthetase CDP- 24.3 58 0.0015 14.4 5.5 48 18-70 61-109 (218) 172 PRK05802 hypothetical protein; 24.2 58 0.0015 14.4 6.1 73 9-87 155-233 (328) 173 PRK09181 aspartate kinase; Val 23.9 59 0.0015 14.3 3.8 39 26-67 3-45 (476) 174 pfam02579 Nitro_FeMo-Co Dinitr 23.8 59 0.0015 14.3 3.5 32 52-83 48-79 (94) 175 KOG0356 consensus 23.7 59 0.0015 14.3 5.7 82 18-103 381-463 (550) 176 PRK13561 putative phosphodiest 23.4 60 0.0015 14.3 7.6 63 10-74 500-565 (651) 177 TIGR03343 biphenyl_bphD 2-hydr 23.3 30 0.00078 16.2 0.6 65 11-76 48-120 (282) 178 PTZ00286 6-phospho-1-fructokin 23.2 61 0.0015 14.2 3.4 82 50-135 180-280 (486) 179 pfam01075 Glyco_transf_9 Glyco 23.0 61 0.0016 14.2 7.8 114 12-128 94-214 (249) 180 PRK05569 flavodoxin; Provision 22.9 61 0.0016 14.2 5.5 70 26-100 50-134 (141) 181 TIGR01250 pro_imino_pep_2 prol 22.7 44 0.0011 15.1 1.4 30 57-87 25-63 (302) 182 TIGR00963 secA preprotein tran 22.6 62 0.0016 14.2 4.6 45 40-85 418-466 (904) 183 cd03466 Nitrogenase_NifN_2 Nit 22.5 62 0.0016 14.2 7.8 105 3-122 278-394 (429) 184 KOG1342 consensus 22.5 62 0.0016 14.2 5.2 57 16-72 244-306 (425) 185 cd03148 GATase1_EcHsp31_like T 22.3 63 0.0016 14.1 5.2 100 29-135 100-215 (232) 186 pfam02593 DUF166 Uncharacteriz 22.3 63 0.0016 14.1 5.1 106 11-135 35-145 (215) 187 cd01425 RPS2 Ribosomal protein 22.1 63 0.0016 14.1 5.4 116 10-130 13-162 (193) 188 cd04248 AAK_AK-Ectoine AAK_AK- 22.1 64 0.0016 14.1 3.5 15 28-42 2-16 (304) 189 COG2217 ZntA Cation transport 22.1 64 0.0016 14.1 6.4 80 18-100 507-594 (713) 190 PRK01158 phosphoglycolate phos 22.1 64 0.0016 14.1 5.7 67 26-92 4-74 (226) 191 cd07042 STAS_SulP_like_sulfate 21.1 67 0.0017 14.0 7.5 74 9-83 23-98 (107) 192 COG0648 Nfo Endonuclease IV [D 21.0 67 0.0017 14.0 3.7 28 41-68 84-111 (280) 193 cd04895 ACT_ACR_1 ACT domain-c 20.9 67 0.0017 14.0 2.0 19 85-103 51-69 (72) 194 PRK01096 deoxyguanosinetriphos 20.6 68 0.0017 13.9 2.7 35 44-79 100-134 (442) 195 COG4148 ModC ABC-type molybdat 20.5 61 0.0015 14.2 1.7 50 1-54 152-203 (352) 196 PRK10528 multifunctional acyl- 20.1 70 0.0018 13.9 6.0 54 10-63 57-114 (191) No 1 >CHL00202 argB acetylglutamate kinase; Provisional Probab=100.00 E-value=0 Score=322.08 Aligned_cols=143 Identities=40% Similarity=0.724 Sum_probs=139.4 Q ss_pred HHCCCHHHHHHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECC Q ss_conf 22062289999999987598899986756724746899999999998518971799845724578899970898115578 Q 537021.9.peg.3 7 QFQAEILEQVLPFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENG 86 (150) Q Consensus 7 ~~~a~~l~ea~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G 86 (150) .+++.|||||+|||++||||||||||||++|.|++++.+|++||++|+++|+|+||||||||||+++|+++|++|+|++| T Consensus 5 ~e~v~~lrea~pYi~~~rgktfVIk~gG~~~~d~~l~~~~~~DialL~~lGik~VlVHGgg~qI~~~l~~~gi~~~f~~G 84 (284) T CHL00202 5 DERVQVLSEALPYIQKFRGRIMVIKYGGAAMKNLILKADIIKDILFLSCIGLKIVVVHGGGPEINFWLKQLNISPKFWNG 84 (284) T ss_pred HHHHHHHHHHHHHHHHHCCCEEEEEECCEEEECHHHHHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHCCCCCCCCCC T ss_conf 89999999977999996899899998966663754899999999999988997999889966899999976998436288 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEECC-CCCCCCCCCCC Q ss_conf 3466124899999997416689999999828997687786558989999747-52111799510 Q 537021.9.peg.3 87 LRITDQQTAEVVEMVLAGSINKKIVSLINQTGTQAIGICGKDGNMVFAEKAR-HSLRLSPNTKK 149 (150) Q Consensus 87 ~RVTd~~tl~iv~~vL~G~vn~~lv~~L~~~g~~avglsg~dg~li~a~~~~-~~lg~vg~~~k 149 (150) +||||+++|++++|+|+|++|++|++.|+++|.+|+|++|.|||+++|+|.+ .++|++|++++ T Consensus 85 ~RvTd~~~l~vv~~vl~g~vn~~lv~~l~~~g~~a~gl~g~dg~~i~A~~~~~~D~g~vG~V~~ 148 (284) T CHL00202 85 IRVTDKVTMEIVEMVLAGKVNKDLVGSINANGGKAVGLCGKDANLIVARASDKKDLGLVGEIQQ 148 (284) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEECCCCCEEEEECCCCCCCCCCCCCEE T ss_conf 3218878999999998778999999999856995055312557669983589877444444123 No 2 >cd04250 AAK_NAGK-C AAK_NAGK-C: N-Acetyl-L-glutamate kinase - cyclic (NAGK-C) catalyzes the phosphorylation of the gamma-COOH group of N-acetyl-L-glutamate (NAG) by ATP in the second step of arginine biosynthesis found in some bacteria and photosynthetic organisms using the non-acetylated, cyclic route of ornithine biosynthesis. In this pathway, glutamate is first N-acetylated and then phosphorylated by NAGK to give phosphoryl NAG, which is converted to NAG-ornithine. There are two variants of this pathway. In one, typified by the pathway in Thermotoga maritima and Pseudomonas aeruginosa, the acetyl group is recycled by reversible transacetylation from acetylornithine to glutamate. The phosphorylation of NAG by NAGK is feedback inhibited by arginine. In photosynthetic organisms, NAGK is the target of the nitrogen-signaling protein PII. Hexameric formation of NAGK domains appears to be essential to both arginine inhibition and NAGK-PII complex formation. NAGK-C are members of the Amino A Probab=100.00 E-value=6.8e-43 Score=296.07 Aligned_cols=138 Identities=49% Similarity=0.869 Sum_probs=133.6 Q ss_pred HHHHHHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCC Q ss_conf 28999999998759889998675672474689999999999851897179984572457889997089811557834661 Q 537021.9.peg.3 12 ILEQVLPFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITD 91 (150) Q Consensus 12 ~l~ea~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd 91 (150) +||||+|||++|||||||||+||+++.|++++.+|++||++|+.+|+++||||||||||+++|+++|++|+|++|+|||| T Consensus 1 ~l~ea~PYi~~~rgktiVIklGG~~l~d~~~~~~~~~dI~~L~~~G~~vVlVHGggpqi~~~l~~~gi~~~~~~G~RvT~ 80 (279) T cd04250 1 VLIEALPYIQKFRGKTVVIKYGGNAMKDEELKESFARDIVLLKYVGINPVVVHGGGPEINEMLKKLGIESEFVNGLRVTD 80 (279) T ss_pred CHHHHHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHCCCCCCCCCCCCCCC T ss_conf 90466678998589989999890784697799999999999998899299996996799999998699842237743573 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEECC-------CCCCCCCCCCC Q ss_conf 24899999997416689999999828997687786558989999747-------52111799510 Q 537021.9.peg.3 92 QQTAEVVEMVLAGSINKKIVSLINQTGTQAIGICGKDGNMVFAEKAR-------HSLRLSPNTKK 149 (150) Q Consensus 92 ~~tl~iv~~vL~G~vn~~lv~~L~~~g~~avglsg~dg~li~a~~~~-------~~lg~vg~~~k 149 (150) +++|++++|+|+|++|++|++.|+++|++|+|+||.|+|+++|+|+. .++|++|++++ T Consensus 81 ~~~l~iv~~~l~G~vn~~lv~~l~~~g~~avglsg~dg~~i~a~~~~~~~~~~~~D~g~vG~v~~ 145 (279) T cd04250 81 EETMEIVEMVLVGKVNKEIVSLINRAGGKAVGLSGKDGNLIKAKKKDATVIEEIIDLGFVGEVTE 145 (279) T ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHCCCCEEECCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCC T ss_conf 88899999998561469999999973998546226668669999755555555665664566552 No 3 >PRK05279 N-acetylglutamate synthase; Validated Probab=100.00 E-value=8e-42 Score=289.21 Aligned_cols=142 Identities=27% Similarity=0.406 Sum_probs=129.2 Q ss_pred HHHCCCHHHHHHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEEC Q ss_conf 32206228999999998759889998675672474689999999999851897179984572457889997089811557 Q 537021.9.peg.3 6 YQFQAEILEQVLPFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFEN 85 (150) Q Consensus 6 ~~~~a~~l~ea~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~ 85 (150) ..+=..|||||+|||++||||||||||||++|.|+++ .+|++||++|+++|+++||||||||||+++|++.|++|+|++ T Consensus 6 ~~~fv~~fRea~PYI~~frgktfVI~~gG~a~~~~~~-~~~~~DIaLL~slGIr~VlVHGggpqI~~~L~~~gi~s~f~~ 84 (441) T PRK05279 6 TTEFVDWFRHSAPYINAHRGKTFVIMLGGEAIAHGNF-SNIVNDIALLHSLGIRLVLVHGARPQIEEQLAAHGIEPRYHK 84 (441) T ss_pred HHHHHHHHHHHHHHHHHHCCCEEEEEECCEECCCCCH-HHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHCCCCCEEEC T ss_conf 1689999985137789858998999989720368358-999999999986897699987997788999997699855139 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCC----CEEEEEEEECCEEEEEECC----CCCCCCCCCCC Q ss_conf 8346612489999999741668999999982899----7687786558989999747----52111799510 Q 537021.9.peg.3 86 GLRITDQQTAEVVEMVLAGSINKKIVSLINQTGT----QAIGICGKDGNMVFAEKAR----HSLRLSPNTKK 149 (150) Q Consensus 86 G~RVTd~~tl~iv~~vL~G~vn~~lv~~L~~~g~----~avglsg~dg~li~a~~~~----~~lg~vg~~~k 149 (150) |+||||++||++++|+ +|++|.+|++.|++... .+.++++++||+++|+|++ .++|++|++++ T Consensus 85 G~RVTD~~tl~~v~~v-~G~v~~~I~a~Ls~gl~nspm~g~~i~v~sGn~i~A~p~Gv~dGvD~g~vG~V~~ 155 (441) T PRK05279 85 GLRITDAAALECVKQA-AGELRLDIEARLSMGLPNTPMAGAHIRVVSGNFVTARPLGVDDGVDYCHTGEVRR 155 (441) T ss_pred CCCCCCHHHHHHHHHH-HHHHHHHHHHHHHCCCCCCCCCCCCEEEECCCEEEEEECCCCCCCCCCCCCEEEE T ss_conf 9136999999999999-9999999999983436787555772467548668987635313756675415879 No 4 >cd04237 AAK_NAGS-ABP AAK_NAGS-ABP: N-acetylglutamate (NAG) kinase-like domain of the NAG Synthase (NAGS) of the arginine-biosynthesis pathway (ABP) found in gamma- and beta-proteobacteria and higher plant chloroplasts. Domain architecture of these NAGS consisted of an N-terminal NAG kinase-like (ArgB) domain (this CD) and a C-terminal NAG synthase, acetyltransferase (ArgA) domain. Both bacterial and plant sequences in this CD have a conserved N-terminal extension; a similar sequence in the NAG kinases of the cyclic arginine-biosynthesis pathway has been implicated in feedback inhibition sensing. Plant sequences also have an N-terminal chloroplast transit peptide and an insert (approx. 70 residues) in the C-terminal region of ArgB. Members of this CD belong to the Amino Acid Kinase Superfamily (AAK). Probab=100.00 E-value=4.9e-41 Score=284.15 Aligned_cols=138 Identities=28% Similarity=0.471 Sum_probs=126.0 Q ss_pred CCHHHHHHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCC Q ss_conf 62289999999987598899986756724746899999999998518971799845724578899970898115578346 Q 537021.9.peg.3 10 AEILEQVLPFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRI 89 (150) Q Consensus 10 a~~l~ea~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RV 89 (150) ..|||||+|||++||||||||||||++|+|++ +++|++||++|+++|++|||||||||||+++|+++|++|+|++|+|| T Consensus 3 v~~~rea~pYi~~f~~kt~VIK~GG~~~~~~~-l~~~~~di~lL~~~Gi~~VlVHGGG~~I~~~l~~~gi~~~fv~G~RV 81 (280) T cd04237 3 VDWFREAAPYINAHRGKTFVIAFGGEAVAHPN-FDNIVHDIALLHSLGIRLVLVHGARPQIDQRLAERGLEPRYHRGLRI 81 (280) T ss_pred HHHHHHHHHHHHHHCCCEEEEEECCHHHCCHH-HHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHCCCCEEECCCCCC T ss_conf 78888867999986899899998963307744-99999999999988996999928986779999986997462178777 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHC----CCCEEEEEEEECCEEEEEECC----CCCCCCCCCCC Q ss_conf 6124899999997416689999999828----997687786558989999747----52111799510 Q 537021.9.peg.3 90 TDQQTAEVVEMVLAGSINKKIVSLINQT----GTQAIGICGKDGNMVFAEKAR----HSLRLSPNTKK 149 (150) Q Consensus 90 Td~~tl~iv~~vL~G~vn~~lv~~L~~~----g~~avglsg~dg~li~a~~~~----~~lg~vg~~~k 149 (150) ||+++|++++++ .|.+|++|++.|++. |..+.++++.+||++.|+|++ .++|+||++++ T Consensus 82 Td~~tl~vv~~~-~G~vn~~i~a~ls~g~~~~g~~g~~~~~~~g~~i~a~p~g~~dgiD~G~vG~v~~ 148 (280) T cd04237 82 TDAAALECVKEA-AGAVRLEIEALLSMGLPNSPMAGARIRVVSGNFVTARPLGVVDGVDFGHTGEVRR 148 (280) T ss_pred CCHHHHHHHHHH-HHHHHHHHHHHHHHCCCCCCCCCCCCEEECCCEEEEEECCCCCCCCCCCCCCCCE T ss_conf 999999999999-9999999999997053435653557234058658988865314744467983026 No 5 >COG0548 ArgB Acetylglutamate kinase [Amino acid transport and metabolism] Probab=100.00 E-value=3.2e-39 Score=272.53 Aligned_cols=126 Identities=49% Similarity=0.802 Sum_probs=123.4 Q ss_pred CCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHHHHHHHH Q ss_conf 59889998675672474689999999999851897179984572457889997089811557834661248999999974 Q 537021.9.peg.3 24 ENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEVVEMVLA 103 (150) Q Consensus 24 r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~~vL~ 103 (150) ++||+||||||++|.|++++++|++||++|+..|+++|+|||||||||++|+++|++|+|++|+||||++||++++|||+ T Consensus 1 ~~k~~VIK~GG~~~~~~~l~~~~~~di~lL~~~G~~~VvVHGggp~I~~~l~~~gie~~f~~glRvTd~~tlevv~mvl~ 80 (265) T COG0548 1 RGKTIVIKLGGSAMEDENLLEAFASDIALLKSVGIRPVVVHGGGPQIDEMLAKLGIEPEFVKGLRVTDAETLEVVEMVLG 80 (265) T ss_pred CCCEEEEEECCEEECCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCEECCCHHHHHHHHHHHH T ss_conf 99669999894644170278999999999997799479992895688899997699976349777679899999999998 Q ss_pred HHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEECCCC----CCCCCCCCC Q ss_conf 1668999999982899768778655898999974752----111799510 Q 537021.9.peg.3 104 GSINKKIVSLINQTGTQAIGICGKDGNMVFAEKARHS----LRLSPNTKK 149 (150) Q Consensus 104 G~vn~~lv~~L~~~g~~avglsg~dg~li~a~~~~~~----lg~vg~~~k 149 (150) |++|++|++.|+++|++|+|+||.|||+++|+|++++ +|+||++++ T Consensus 81 G~vNk~iva~l~~~g~~avGlsg~Dg~li~A~~~~~~~~id~g~vG~i~~ 130 (265) T COG0548 81 GTVNKEIVARLSKHGGQAVGLSGVDGNLVTAKKLDVDDGVDLGYVGEIRK 130 (265) T ss_pred HHHHHHHHHHHHHHCCCCEEEEECCCCEEEEEECCCCCCCCCCEEEEEEE T ss_conf 88899999999971874145553378789999866413433640445898 No 6 >TIGR00761 argB acetylglutamate kinase; InterPro: IPR004662 N -Acetylglutamate (NAG) fulfils distinct biological roles in lower and higher organisms. In prokaryotes, lower eukaryotes and plants it is the first intermediate in the biosynthesis of arginine, whereas in ureotelic (excreting nitrogen mostly in the form of urea) vertebrates, it is an essential allosteric cofactor for carbamyl phosphate synthetase I (CPSI), the first enzyme of the urea cycle. The pathway that leads from glutamate to arginine in lower organisms employs eight steps, starting with the acetylation of glutamate to form NAG. In these species, NAG can be produced by two enzymatic reactions: one catalysed by NAG synthase (NAGS) and the other by ornithine acetyltransferase (OAT). In ureotelic species, NAG is produced exclusively by NAGS. In lower organisms, NAGS is feedback-inhibited by L-arginine, whereas mammalian NAGS activity is significantly enhanced by this amino acid. The NAGS genes of bacteria, fungi and mammals are more diverse than other arginine-biosynthesis and urea-cycle genes. The evolutionary relationship between the distinctly different roles of NAG and its metabolism in lower and higher organisms remains to be determined . The pathway from glutamate to arginine is: NAGS; N-acetylglutamate synthase (2.3.1.1 from EC) (glutamate to N-acetylglutamate) NAGK; N-acetylglutamate kinase (2.7.2.8 from EC) (N-acetylglutamate to N-acetylglutamate-5P) NAGSA; N-acetyl-gamma-glutamyl-phosphate reductase (1.2.1.38 from EC) (N-acetylglutamate-5P to N-acetylglumate semialdehyde) Acetylornithine aminotransferase (2.6.1.11 from EC) (N-acetylglumate semialdehyde to N-acetylornithine) Acetylornithine deacetylase (3.5.1.16 from EC) (N-acetylornithine to ornithine) Arginase (3.5.3.1 from EC) (ornithine to arginine) This entry describes N-acetylglutamate kinases (ArgB) of many prokaryotes and the N-acetylglutamate kinase domains of multifunctional proteins from yeasts as well as some, as yet, uncharacterised proteins.; GO: 0003991 acetylglutamate kinase activity, 0006526 arginine biosynthetic process, 0005737 cytoplasm. Probab=100.00 E-value=1.3e-37 Score=262.17 Aligned_cols=123 Identities=48% Similarity=0.834 Sum_probs=118.9 Q ss_pred EEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCC-EEEEECCCHHHHHHHHHCCCCCEEECCC-----CCCCHHHHHHHHH Q ss_conf 899986756724746899999999998518971-7998457245788999708981155783-----4661248999999 Q 537021.9.peg.3 27 TIVVKYGGHVMNCTDLSKDFVNDIALLKKSNIT-PVIVHGGGPQIGAVLEKMGIKSKFENGL-----RITDQQTAEVVEM 100 (150) Q Consensus 27 tiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k-~ViVHGgg~qI~~~l~~~gi~~~~v~G~-----RVTd~~tl~iv~~ 100 (150) ++|||+||+++.|+++.++|++|++.|+..|++ |||||||||+||++|+++||||+|++|+ ||||++||+|++| T Consensus 1 ~~ViK~GG~~~~~~~~~~~~~~~~~~l~~~g~~PpviVHGGGp~in~~~~~lgi~~~f~~Gl~G~~~RvTd~~tl~v~~~ 80 (254) T TIGR00761 1 TIVIKIGGSAISNDELLEEFFSDIAFLRAVGIKPPVIVHGGGPEINELLEALGIEPEFKNGLSGVVFRVTDKETLEVVEM 80 (254) T ss_pred CEEEEECCEEECCCHHHHHHHHHHHHHHHCCCCCCEEEECCCHHHHHHHHHCCCCCEEECCCCCEEEECCCHHHHHHHHH T ss_conf 97898447341041246999999999997298553898378526888998328983676688720375278899999899 Q ss_pred HHHHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEE----ECCCC--------CCCCCCCCC Q ss_conf 97416689999999828997687786558989999----74752--------111799510 Q 537021.9.peg.3 101 VLAGSINKKIVSLINQTGTQAIGICGKDGNMVFAE----KARHS--------LRLSPNTKK 149 (150) Q Consensus 101 vL~G~vn~~lv~~L~~~g~~avglsg~dg~li~a~----~~~~~--------lg~vg~~~k 149 (150) ||+|++|+.|++.|+++|++|+||||.|+.+++|+ .++.+ +||||++++ T Consensus 81 vL~g~vN~~LV~~l~~~g~~A~GL~g~Dg~L~~a~rk~~~~~~~~~~vv~~~~G~VG~~~~ 141 (254) T TIGR00761 81 VLIGDVNKELVALLNKHGINAIGLTGLDGQLFTADRKTASLDKEVIKVVAIDLGYVGEIKK 141 (254) T ss_pred HHHHHHHHHHHHHHHHCCCCEEEECCCCCCEEEEEEEEEEEEECCCEEEEEECCCEEEECC T ss_conf 9836466999999996799286311357735799987898851576899997484123132 No 7 >TIGR01890 N-Ac-Glu-synth amino-acid N-acetyltransferase; InterPro: IPR010167 This entry represents a clade of amino-acid N-acetyltransferases, or N-acetylglutamate synthase, which is the product of the argA gene and the first enzyme in arginine biosynthesis. This enzyme displays more diversity between bacteria, fungi and mammals than other enzymes in arginine metabolism, and N-acetylglutamate itself can have different roles in different taxonomic groups: in prokaryotes, lower eukaryotes and plants it is the first intermediate in arginine biosynthesis, while in vertebrates it is an allosteric cofactor for the first enzyme in the urea cycle . In bacteria, arginine can regulate ornithine biosynthesis via a feedback inhibition mechanism . This enzyme may also act on aspartate.; GO: 0004042 amino-acid N-acetyltransferase activity, 0006526 arginine biosynthetic process, 0005737 cytoplasm. Probab=100.00 E-value=7.4e-37 Score=257.33 Aligned_cols=139 Identities=27% Similarity=0.413 Sum_probs=125.3 Q ss_pred CCHHHHHHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCC Q ss_conf 62289999999987598899986756724746899999999998518971799845724578899970898115578346 Q 537021.9.peg.3 10 AEILEQVLPFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRI 89 (150) Q Consensus 10 a~~l~ea~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RV 89 (150) ..|||+|+|||++||||||||.+||.++.+++ +.+|+.||+|||++|+|+|||||.+|||++.|++++..|+|+.|+|| T Consensus 2 V~~lR~aAPYi~a~RdKT~Vvgl~g~~v~~~~-l~~~v~Di~LLh~LGv~lVLvHGaRPq~~~~la~~~~~p~Y~~G~Rv 80 (439) T TIGR01890 2 VAWLREAAPYINAHRDKTFVVGLGGELVEDDN-LGNIVADIALLHSLGVRLVLVHGARPQIEERLAARGRTPHYHRGLRV 80 (439) T ss_pred CCCHHHCCCHHHHHCCCEEEEECCCCCCCCCC-HHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHCCCCCCCCCCEEE T ss_conf 10033214035541586699842840014444-24599999999758918999708852689999962897870322571 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHC---CCCEE-EEEEEECCEEEEEECCC----CCCCCCCCCCC Q ss_conf 6124899999997416689999999828---99768-77865589899997475----21117995109 Q 537021.9.peg.3 90 TDQQTAEVVEMVLAGSINKKIVSLINQT---GTQAI-GICGKDGNMVFAEKARH----SLRLSPNTKKT 150 (150) Q Consensus 90 Td~~tl~iv~~vL~G~vn~~lv~~L~~~---g~~av-glsg~dg~li~a~~~~~----~lg~vg~~~k~ 150 (150) ||+.+|+.++++ +|+++..|.+.|+.. -..|- -++.+|||||+|+|.++ |+-+.|.++|+ T Consensus 81 TD~asLe~~~~a-aG~lr~~ieA~Ls~~~~~~pmaGsr~~vvSGNfvtArPiGv~~GvDy~h~G~vRk~ 148 (439) T TIGR01890 81 TDEASLELVKQA-AGELRLAIEARLSMSLSNTPMAGSRLSVVSGNFVTARPIGVIEGVDYEHTGVVRKI 148 (439) T ss_pred CCHHHHHHHHHH-HHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCEEEECCCEEECCCCCEECCCEEHH T ss_conf 356567789998-76688888888752124678888531356243332033101058662032745110 No 8 >cd04249 AAK_NAGK-NC AAK_NAGK-NC: N-Acetyl-L-glutamate kinase - noncyclic (NAGK-NC) catalyzes the phosphorylation of the gamma-COOH group of N-acetyl-L-glutamate (NAG) by ATP in the second step of microbial arginine biosynthesis using the acetylated, noncyclic route of ornithine biosynthesis. There are two variants of this pathway. In one, typified by the pathway in Escherichia coli, glutamate is acetylated by acetyl-CoA and acetylornithine is deacylated hydrolytically. In this pathway, feedback inhibition by arginine occurs at the initial acetylation of glutamate and not at the phosphorylation of NAG by NAGK. Homodimeric NAGK-NC are members of the Amino Acid Kinase Superfamily (AAK). Probab=100.00 E-value=2.4e-34 Score=241.21 Aligned_cols=122 Identities=25% Similarity=0.442 Sum_probs=116.6 Q ss_pred EEEEECHHHHCCHHHHHHHHHHHHHHH-HCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHHHHHHHHHHH Q ss_conf 999867567247468999999999985-1897179984572457889997089811557834661248999999974166 Q 537021.9.peg.3 28 IVVKYGGHVMNCTDLSKDFVNDIALLK-KSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEVVEMVLAGSI 106 (150) Q Consensus 28 iVIK~GG~~l~d~~l~~~~~~dI~ll~-~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~~vL~G~v 106 (150) |||||||++|.|++++++|++|+++|+ ..|.++||||||||||+++|+++|++++|++|+||||+++|++++|+|+|++ T Consensus 1 iVIK~GG~~l~~~~~~~~~~~~i~~l~~~~~~~vVlVHGggpqI~~~l~~~gi~~~f~~G~RvTd~~tl~vv~~vl~g~v 80 (252) T cd04249 1 LVIKLGGALLETEAALEQLFSALSEYQQQHNRQLVIVHGGGCVVDELLKKLNFPSEKKNGLRVTPKEQIPYITGALAGTA 80 (252) T ss_pred CEEEECCHHCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHCCCCCCCCCCEEECCHHHHHHHHHHHCCCH T ss_conf 98997824617888999999999999981898769987996899999997699972379656589999999999984615 Q ss_pred HHHHHHHHHHCCCCEEEEEEEECCEEEEEECCCCCCCCCCCCC Q ss_conf 8999999982899768778655898999974752111799510 Q 537021.9.peg.3 107 NKKIVSLINQTGTQAIGICGKDGNMVFAEKARHSLRLSPNTKK 149 (150) Q Consensus 107 n~~lv~~L~~~g~~avglsg~dg~li~a~~~~~~lg~vg~~~k 149 (150) |+.|++.++++|.+|+|+|+.|++++.|++.++++|+||++++ T Consensus 81 n~~lv~~l~~~g~~a~gl~~~dg~~~~~~~~~~dlG~vG~v~~ 123 (252) T cd04249 81 NKQLMAQAIKAGLKPVGLSLADGGMTAVTQLDPELGAVGKATA 123 (252) T ss_pred HHHHHHHHHHCCCCCCCCCCCCCCEEEECCCCCCCCCCCCEEE T ss_conf 9999999973589862044567746873023666786555323 No 9 >PRK00942 acetylglutamate kinase; Provisional Probab=100.00 E-value=2.3e-33 Score=234.91 Aligned_cols=123 Identities=54% Similarity=0.887 Sum_probs=118.9 Q ss_pred EEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHHHHHHHHHHH Q ss_conf 89998675672474689999999999851897179984572457889997089811557834661248999999974166 Q 537021.9.peg.3 27 TIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEVVEMVLAGSI 106 (150) Q Consensus 27 tiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~~vL~G~v 106 (150) |||||+||+++.|++++++|++||++|+.+|+++||||||||||+++|+++|++|+|++|+||||+++|++++|+|+|++ T Consensus 1 TiVIkiGG~~l~~~~~~~~~~~dIa~L~~~G~~vVlVHGgg~qi~~~l~~~gi~~~~~~g~RvT~~~~l~~~~~~l~g~i 80 (262) T PRK00942 1 TIVIKYGGNAMTDEELKEAFARDIVLLKQVGINPVVVHGGGPQIDELLKKLGIESEFVNGLRVTDKETMEVVEMVLAGKV 80 (262) T ss_pred CEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHCHH T ss_conf 99999885884498799999999999998899199993882999999998699723047754661899999999981719 Q ss_pred HHHHHHHHHHCCCCEEEEEEEECCEEEEEECC-------CCCCCCCCCCC Q ss_conf 89999999828997687786558989999747-------52111799510 Q 537021.9.peg.3 107 NKKIVSLINQTGTQAIGICGKDGNMVFAEKAR-------HSLRLSPNTKK 149 (150) Q Consensus 107 n~~lv~~L~~~g~~avglsg~dg~li~a~~~~-------~~lg~vg~~~k 149 (150) |+.|++.|+++|++++|+||.|+|+++|+|+. .++|++|++++ T Consensus 81 ~~~l~~~l~~~g~~avglsg~d~~~v~a~~~~~~~~~~~~d~g~vG~v~~ 130 (262) T PRK00942 81 NKELVSLINKHGGKAVGLSGKDGGLVTARKLDENGKEKIIDLGYVGEVEE 130 (262) T ss_pred HHHHHHHHHHCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCEEE T ss_conf 99999999855874231456678758999864456654566762046166 No 10 >PRK04531 acetylglutamate kinase; Provisional Probab=100.00 E-value=6.1e-33 Score=232.24 Aligned_cols=131 Identities=29% Similarity=0.501 Sum_probs=121.6 Q ss_pred HHHHHHHHHHHCC----CEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCC Q ss_conf 8999999998759----889998675672474689999999999851897179984572457889997089811557834 Q 537021.9.peg.3 13 LEQVLPFVQFYEN----ETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLR 88 (150) Q Consensus 13 l~ea~pYi~~~r~----ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~R 88 (150) =||+..|+++|++ ++.|||+||++|.+. +++|++|+++|+.+|++|||||||||||+++|+++|++++|++|+| T Consensus 4 ~kev~~yL~~Fss~~~~~fAVIKvGGavi~e~--l~~la~dLafL~~vGl~PVVVHGgGpqI~~~L~~~gIe~~fi~G~R 81 (421) T PRK04531 4 AKEISQYLKRFSSLDPERFAVIKVGGAVLRED--LEALTSSLKFLQEVGLTPVVVHGAGPQLDAELSAAGIEKDTVNGLR 81 (421) T ss_pred HHHHHHHHHHHCCCCCCEEEEEEECHHHHHHH--HHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHCCCCCCCCCCCC T ss_conf 79999999973588876189999887999979--9999999999998799799993898778799997599976779630 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEECC-CCCCCCCCCCC Q ss_conf 66124899999997416689999999828997687786558989999747-52111799510 Q 537021.9.peg.3 89 ITDQQTAEVVEMVLAGSINKKIVSLINQTGTQAIGICGKDGNMVFAEKAR-HSLRLSPNTKK 149 (150) Q Consensus 89 VTd~~tl~iv~~vL~G~vn~~lv~~L~~~g~~avglsg~dg~li~a~~~~-~~lg~vg~~~k 149 (150) |||+++|++++|+|. ++|.+|++.|+++|.+|++++| +++.|+..+ +++|+||++++ T Consensus 82 VTd~etl~vv~~vl~-~~n~~lV~~L~~~G~~A~~i~g---~~~~a~~~d~~~~G~VG~V~~ 139 (421) T PRK04531 82 VTSPEALAIVRKVFQ-ASNLKLVEALQQNGARATSITG---GVFEAEYLDRDTYGLVGEVKA 139 (421) T ss_pred CCCHHHHHHHHHHHH-HHHHHHHHHHHHCCCCEEECCC---CCEECCCCCCCCCCCCCCEEE T ss_conf 588889999999997-6529999999967997573466---543503366666764333477 No 11 >cd04238 AAK_NAGK-like AAK_NAGK-like: N-Acetyl-L-glutamate kinase (NAGK)-like . Included in this CD are the Escherichia coli and Pseudomonas aeruginosa type NAGKs which catalyze the phosphorylation of N-acetyl-L-glutamate (NAG) by ATP in the second step of arginine biosynthesis found in bacteria and photosynthetic organisms using either the acetylated, noncyclic (NC), or non-acetylated, cyclic (C) route of ornithine biosynthesis. Also included in this CD is a distinct group of uncharacterized (UC) bacterial and archeal NAGKs. Members of this CD belong to the Amino Acid Kinase Superfamily (AAK). Probab=99.98 E-value=7.9e-32 Score=225.12 Aligned_cols=122 Identities=50% Similarity=0.894 Sum_probs=117.8 Q ss_pred EEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 99986756724746899999999998518971799845724578899970898115578346612489999999741668 Q 537021.9.peg.3 28 IVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEVVEMVLAGSIN 107 (150) Q Consensus 28 iVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~~vL~G~vn 107 (150) +|||+||+++.|++++++|++||++|+++|+++||||||||||++.|+++|++++|++|+||||+++|++++|+++|++| T Consensus 1 IVIKlGG~~l~d~~~~~~~~~dIa~L~~~G~~vVlVHGgg~qi~~~l~~~gi~~~~~~g~RvTd~~~l~~~~~~l~g~in 80 (256) T cd04238 1 VVIKYGGSAMKDEELKEAFADDIVLLKQVGINPVIVHGGGPEINELLKRLGIESEFVNGLRVTDKETMEIVEMVLAGKVN 80 (256) T ss_pred CEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEECCCHHHHHHHHHHCCCCCEECCCCCCCCHHHHHHHHHHHHHHHH T ss_conf 98997868945987999999999999988992999918829999999986998350267314618789999999853389 Q ss_pred HHHHHHHHHCCCCEEEEEEEECCEEEEEEC---CCCCCCCCCCCC Q ss_conf 999999982899768778655898999974---752111799510 Q 537021.9.peg.3 108 KKIVSLINQTGTQAIGICGKDGNMVFAEKA---RHSLRLSPNTKK 149 (150) Q Consensus 108 ~~lv~~L~~~g~~avglsg~dg~li~a~~~---~~~lg~vg~~~k 149 (150) ++|++.|+++|++|+|+|+.|+|+++|+|. ..++|++|++++ T Consensus 81 ~~l~~~l~~~g~~aiglsg~d~~~i~a~~~~~~~~d~g~vG~v~~ 125 (256) T cd04238 81 KELVSLLNRAGGKAVGLSGKDGGLIKAEKKEEKDIDLGFVGEVTE 125 (256) T ss_pred HHHHHHHHHCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCC T ss_conf 999999997799721214655877999975564434342455443 No 12 >cd04252 AAK_NAGK-fArgBP AAK_NAGK-fArgBP: N-Acetyl-L-glutamate kinase (NAGK) of the fungal arginine-biosynthetic pathway (fArgBP). The nuclear-encoded, mitochondrial polyprotein precursor with an N-terminal NAGK (ArgB) domain (this CD), a central DUF619 domain, and a C-terminal reductase domain (ArgC, N-Acetylglutamate Phosphate Reductase, NAGPR). The precursor is cleaved in the mitochondria into two distinct enzymes (NAGK-DUF619 and NAGPR). Native molecular weights of these proteins indicate that the kinase is an octamer whereas the reductase is a dimer. This CD also includes some gamma-proteobacteria (Xanthomonas and Xylella) NAG kinases with an N-terminal NAGK (ArgB) domain (this CD) and a C-terminal DUF619 domain. The DUF619 domain is described as a putative distant homolog of the acetyltransferase, ArgA, predicted to function in NAG synthase association in fungi. Eukaryotic sequences have an N-terminal mitochondrial transit peptide. Members of this NAG kinase domain CD belong to th Probab=99.97 E-value=4.9e-30 Score=213.63 Aligned_cols=115 Identities=28% Similarity=0.503 Sum_probs=107.7 Q ss_pred EEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 99867567247468999999999985189717998457245788999708981155783466124899999997416689 Q 537021.9.peg.3 29 VVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEVVEMVLAGSINK 108 (150) Q Consensus 29 VIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~~vL~G~vn~ 108 (150) ||||||+++.|+ +++|++||++|+++|+++||||||||||+++|+++|++|+|++|+||||+++|++++|+++ ++|+ T Consensus 2 VIk~GG~~~~d~--~~~l~~di~lL~~lGi~~VlVHGgg~qI~~~l~~~gi~~~~~~G~RvTd~~~l~~v~~vl~-~vn~ 78 (248) T cd04252 2 VIKVGGAIIEDD--LDELAASLSFLQHVGLYPIVVHGAGPQLNEELEAAGVEPEYVDGLRVTDPETLAVARKVFL-EENL 78 (248) T ss_pred EEEECCHHHHHH--HHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHH-HHHH T ss_conf 799991776521--9999999999998899599991898788899998699853138844689899999999999-9999 Q ss_pred HHHHHHHHCCCCEEEEEEEECCEEEEEECCC-CCCCCCCCCC Q ss_conf 9999998289976877865589899997475-2111799510 Q 537021.9.peg.3 109 KIVSLINQTGTQAIGICGKDGNMVFAEKARH-SLRLSPNTKK 149 (150) Q Consensus 109 ~lv~~L~~~g~~avglsg~dg~li~a~~~~~-~lg~vg~~~k 149 (150) +|++.|+++|.++.+++| +++.|++.+. ++|++|++++ T Consensus 79 ~lv~~l~~~g~~a~~i~g---~~~~a~~~~~~~~g~vG~v~~ 117 (248) T cd04252 79 KLVEALERNGARARPITS---GVFEAEYLDKDKYGLVGKITG 117 (248) T ss_pred HHHHHHHHCCCCEEEECC---CCEEEEECCCCCCCCCCCEEE T ss_conf 999999965997386037---636888667656784322589 No 13 >cd04251 AAK_NAGK-UC AAK_NAGK-UC: N-Acetyl-L-glutamate kinase - uncharacterized (NAGK-UC). This domain is similar to Escherichia coli and Pseudomonas aeruginosa NAGKs which catalyze the phosphorylation of the gamma-COOH group of N-acetyl-L-glutamate (NAG) by ATP in the second step of microbial arginine biosynthesis. These uncharacterized domain sequences are found in some bacteria (Deinococci and Chloroflexi) and archea and belong to the Amino Acid Kinase Superfamily (AAK). Probab=99.96 E-value=2.6e-28 Score=202.50 Aligned_cols=115 Identities=37% Similarity=0.632 Sum_probs=103.3 Q ss_pred EEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEE---CCC--CCCCHHHHHHHHHHH Q ss_conf 999867567247468999999999985189717998457245788999708981155---783--466124899999997 Q 537021.9.peg.3 28 IVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFE---NGL--RITDQQTAEVVEMVL 102 (150) Q Consensus 28 iVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v---~G~--RVTd~~tl~iv~~vL 102 (150) +||||||+++.|++ +++.|+ .++|.++||||||||||+++|+++|++|+|+ +|+ ||||+++|++++|++ T Consensus 1 iVIK~GG~~l~d~~---~~~~~l---~~lG~~~VlVHGgg~qI~~~l~~~gi~~~fv~~~~Gl~~RvTd~~~l~~v~~~~ 74 (257) T cd04251 1 IVVKIGGSVVSDLD---KVIDDI---ANFGERLIVVHGGGNYVNEYLKRLGVEPKFVTSPSGIRSRYTDKETLEVFVMVM 74 (257) T ss_pred CEEEECHHHHHCHH---HHHHHH---HHCCCCEEEECCCCHHHHHHHHHCCCCCEEECCCCCCCEEECCHHHHHHHHHHH T ss_conf 98998958876989---999999---975999899979977999999985999677417898620218989999999999 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEECCC------------CCCCCCCCCC Q ss_conf 4166899999998289976877865589899997475------------2111799510 Q 537021.9.peg.3 103 AGSINKKIVSLINQTGTQAIGICGKDGNMVFAEKARH------------SLRLSPNTKK 149 (150) Q Consensus 103 ~G~vn~~lv~~L~~~g~~avglsg~dg~li~a~~~~~------------~lg~vg~~~k 149 (150) |++|++|++.|+++|++|+|+||.|++++.|++++. ++|++|++++ T Consensus 75 -g~vn~~lv~~l~~~g~~a~glsg~d~~~~~~~~~~~~~~~~~g~~~~id~g~vG~v~~ 132 (257) T cd04251 75 -GLINKKIVARLHSLGVKAVGLTGLDGRLLEAKRKEIVRVNERGRKMIIRGGYTGKVEK 132 (257) T ss_pred -HHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEEECCCEEECCCCCCCCCCCCCEEEEEE T ss_conf -9999999999984899865222666736898512524424678601045797677989 No 14 >KOG2436 consensus Probab=99.91 E-value=2.6e-24 Score=176.91 Aligned_cols=141 Identities=27% Similarity=0.455 Sum_probs=129.9 Q ss_pred HCCCHHHHHHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCC Q ss_conf 20622899999999875988999867567247468999999999985189717998457245788999708981155783 Q 537021.9.peg.3 8 FQAEILEQVLPFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGL 87 (150) Q Consensus 8 ~~a~~l~ea~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~ 87 (150) ..-.+++|++|||..|++++||+..+|.++ ...++..+++|+++|+.+|++||+|||++||||+.|...|++++|.+|+ T Consensus 77 ~~v~~~~e~l~yi~~~~~q~fvV~~~g~~~-~t~~~~sl~s~lafl~h~gl~pIvv~g~~~qin~~l~~~~ie~~y~~~~ 155 (520) T KOG2436 77 TTVRILRESLPYITSFRDQKFVVIKSGEAI-STSLLHSLASDLAFLHHVGLRPIVVPGTQPQINRLLAERGIEPEYVDGY 155 (520) T ss_pred HHHHHHHHHHHHHHHHCCCEEEEEECCCCC-CCCHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHCCCCCCCCCCE T ss_conf 089999988789987337507999435435-5635889987778875077368884386277778999728883003631 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEE--ECCEEEEEEC----CCCCCCCCCCCC Q ss_conf 4661248999999974166899999998289976877865--5898999974----752111799510 Q 537021.9.peg.3 88 RITDQQTAEVVEMVLAGSINKKIVSLINQTGTQAIGICGK--DGNMVFAEKA----RHSLRLSPNTKK 149 (150) Q Consensus 88 RVTd~~tl~iv~~vL~G~vn~~lv~~L~~~g~~avglsg~--dg~li~a~~~----~~~lg~vg~~~k 149 (150) |+||+.+|+++.+...++.|..++..|+++|..+.+.++. .+|++.|++. +..+|++|+..+ T Consensus 156 RvTda~t~q~~~~~~~~E~n~~lv~nL~~~g~~ar~~s~g~~v~~~f~a~~~~v~d~~~y~~~gei~~ 223 (520) T KOG2436 156 RVTDAHTLQAAKESVSLEANLNLVINLSQLGTRARPSSSGVRVGNFFPADRNGVLDGEDYGLVGEIKK 223 (520) T ss_pred ECCCHHHHHHHHHCCHHHHHHHHHHHHHHHHCEECCCCCCCCCCCEEECCCCCCCCCCEEEEECCCCE T ss_conf 02618888776531300215679999997540103312566443044235234554000220033003 No 15 >cd04236 AAK_NAGS-Urea AAK_NAGS-Urea: N-acetylglutamate (NAG) kinase-like domain of the NAG Synthase (NAGS) of the urea cycle found in animals. Ureogenic NAGS is a mitochondrial enzyme catalyzing the formation of NAG from acetylcoenzyme A and L-glutamate; NAG is an essential allosteric activator of carbamylphosphate synthase I, the first and rate limiting enzyme of the urea cycle. Ureogenic NAGS activity is dependent on the concentration of glutamate (substrate) and arginine (activator). Domain architecture of ureogenic NAGS consists of an N-terminal NAG kinase-like (ArgB) domain (this CD) and a C-terminal DUF619 domain. Members of this CD belong to the protein superfamily, the Amino Acid Kinase Family (AAKF). Probab=99.86 E-value=3e-21 Score=157.28 Aligned_cols=128 Identities=10% Similarity=0.131 Sum_probs=110.0 Q ss_pred CHHHHHCCCHHHHHHHHHHHHC-------CCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHH Q ss_conf 0023220622899999999875-------988999867567247468999999999985189717998457245788999 Q 537021.9.peg.3 3 EKIYQFQAEILEQVLPFVQFYE-------NETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLE 75 (150) Q Consensus 3 ~~~~~~~a~~l~ea~pYi~~~r-------~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~ 75 (150) ++++|++.+..+|+..|+++|+ .+.+|||+||+++.|.+.+++|++|+++|+.+|+.||+||||||++...++ T Consensus 6 k~~l~~~G~s~kEvr~YL~~F~~~~~~~~~~faVIKvGGaVl~d~~~l~~l~~~lsfL~~vGl~pIVVHG~g~~i~~~~~ 85 (271) T cd04236 6 KAFLHQKGGDPREARYWLTQFQIAMPNDWPAFAVLEVDHSVFRSLEMVQSLSFGLAFLQRMDMKLLVVMGLSAPDGTNMS 85 (271) T ss_pred HHHHHHCCCCHHHHHHHHHHHCCCCCCCCCCEEEEEECHHHEECHHHHHHHHHHHHHHHHCCCEEEEEECCCHHCCCCCC T ss_conf 99866509998999999997335677777876999988131505889999999999999889948999389701155746 Q ss_pred HCCCCCEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEECCCCCCCCCCCCC Q ss_conf 70898115578346612489999999741668999999982899768778655898999974752111799510 Q 537021.9.peg.3 76 KMGIKSKFENGLRITDQQTAEVVEMVLAGSINKKIVSLINQTGTQAIGICGKDGNMVFAEKARHSLRLSPNTKK 149 (150) Q Consensus 76 ~~gi~~~~v~G~RVTd~~tl~iv~~vL~G~vn~~lv~~L~~~g~~avglsg~dg~li~a~~~~~~lg~vg~~~k 149 (150) +.+++.+++++. +-|+.|+++|++.|++|.++++.+. .+.+. ...++++||++.+ T Consensus 86 ----------------~~~~~~~~~v~~-~~n~~lV~aL~~~G~~A~~i~s~~~-~~~~~-~~e~~~~vG~~~~ 140 (271) T cd04236 86 ----------------DLELQAARSRLV-KDCKTLVEALQANSAAAHPLFSGES-VLQAE-EPEPGASKGPSVS 140 (271) T ss_pred ----------------CHHHHHHHHHHH-HHHHHHHHHHHHCCCCEEEEECCCC-EEEEC-CCCCCCCCCCCEE T ss_conf ----------------012256766637-9999999999976996245416873-37722-5301376687138 No 16 >cd02115 AAK Amino Acid Kinases (AAK) superfamily, catalytic domain; present in such enzymes like N-acetylglutamate kinase (NAGK), carbamate kinase (CK), aspartokinase (AK), glutamate-5-kinase (G5K) and UMP kinase (UMPK). The AAK superfamily includes kinases that phosphorylate a variety of amino acid substrates. These kinases catalyze the formation of phosphoric anhydrides, generally with a carboxylate, and use ATP as the source of the phosphoryl group; are involved in amino acid biosynthesis. Some of these kinases control the process via allosteric feed-back inhibition. Probab=99.77 E-value=4.5e-18 Score=136.90 Aligned_cols=114 Identities=25% Similarity=0.355 Sum_probs=104.3 Q ss_pred EEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 99867567247468999999999985189717998457245788999708981155783466124899999997416689 Q 537021.9.peg.3 29 VVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEVVEMVLAGSINK 108 (150) Q Consensus 29 VIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~~vL~G~vn~ 108 (150) ||||||+.+.|++.+.+++++|+.++..|.++|+||||+++++..|.+++++++|.+|+|+|++++++++.+.. +..+. T Consensus 1 ViK~GGssl~~~~~~~~~~~~i~~l~~~g~~~vvV~sa~~~it~~l~~~~~~~~~~~~~~~t~~~~~~i~~~g~-~~~~~ 79 (248) T cd02115 1 VIKFGGSSVSSEERLRNLARILVKLASEGGRVVVVHGAGPQITDELLAHGELLGYARGLRITDRETDALAAMGE-GMSNL 79 (248) T ss_pred CEEECHHHCCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHH-HHHHH T ss_conf 99889136069999999999999999789998999698445329999865354331367899899999999999-99999 Q ss_pred HHHHHHHHCCCCEEEEEEEECCEEEEEECCCCCCCCCCCC Q ss_conf 9999998289976877865589899997475211179951 Q 537021.9.peg.3 109 KIVSLINQTGTQAIGICGKDGNMVFAEKARHSLRLSPNTK 148 (150) Q Consensus 109 ~lv~~L~~~g~~avglsg~dg~li~a~~~~~~lg~vg~~~ 148 (150) .+++.|++.|+++.++++.+..++.. +++++|+.. T Consensus 80 ~l~~~L~~~g~~a~~l~~~~~~~~~~-----~~~~~~~v~ 114 (248) T cd02115 80 LIAAALEQHGIKAVPLDLTQAGFASP-----NQGHVGKIT 114 (248) T ss_pred HHHHHHHHCCCCEEEEEEEECCEECC-----CCCCCCCCC T ss_conf 99999985699816986553233136-----644432102 No 17 >pfam00696 AA_kinase Amino acid kinase family. This family includes kinases that phosphorylate a variety of amino acid substrates, as well as uridylate kinase and carbamate kinase. This family includes: Aspartokinase EC:2.7.2.4. Acetylglutamate kinase EC:2.7.2.8. Glutamate 5-kinase EC:2.7.2.11. Uridylate kinase EC:2.7.4.-. Carbamate kinase EC:2.7.2.2. Probab=99.60 E-value=1.6e-14 Score=114.15 Aligned_cols=109 Identities=32% Similarity=0.457 Sum_probs=101.6 Q ss_pred CEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHHHHHHHHHH Q ss_conf 88999867567247468999999999985189717998457245788999708981155783466124899999997416 Q 537021.9.peg.3 26 ETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEVVEMVLAGS 105 (150) Q Consensus 26 ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~~vL~G~ 105 (150) |++|||+||+++.|++.+..++++|+.++..|.++|||||||++.+.++++.+.+++ .+.|+|+.+++.+.++++. . T Consensus 1 k~iViK~GGs~l~~~~~i~~~~~~I~~l~~~g~~vvvV~ggg~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~-~ 77 (230) T pfam00696 1 KRIVIKLGGSSLTDEDAIKRIAEEIALLSELGIKVVVVSGGGGFTDKLLAAYGIAEK--IGLRVTAGATGLIIEAALA-G 77 (230) T ss_pred CEEEEEECHHHCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHCCCCC--CCHHHHHHHHHHHHHHHHH-H T ss_conf 909999585677898999999999999996899499997985888799998177765--7515669999999999999-9 Q ss_pred HHHHHHHHHHHCCCCEEEEEEEECCEEEEEEC Q ss_conf 68999999982899768778655898999974 Q 537021.9.peg.3 106 INKKIVSLINQTGTQAIGICGKDGNMVFAEKA 137 (150) Q Consensus 106 vn~~lv~~L~~~g~~avglsg~dg~li~a~~~ 137 (150) ++..+++.+...+.+++++...+.++..++.. T Consensus 78 l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 109 (230) T pfam00696 78 LLDIVVSAGERLGARAVALLLSDGGIGAVRLD 109 (230) T ss_pred HHHHHHHHHHHCCCHHHHCCCCCCCCCEEEEE T ss_conf 99999999998297301325651345516884 No 18 >PRK12353 carbamate kinase; Reviewed Probab=99.34 E-value=2.2e-11 Score=94.05 Aligned_cols=94 Identities=19% Similarity=0.300 Sum_probs=65.4 Q ss_pred CCEEEEEECHHHHCCH--------HHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHH-HHHCCCCCEEECCCCCCCHHHH Q ss_conf 9889998675672474--------689999999999851897179984572457889-9970898115578346612489 Q 537021.9.peg.3 25 NETIVVKYGGHVMNCT--------DLSKDFVNDIALLKKSNITPVIVHGGGPQIGAV-LEKMGIKSKFENGLRITDQQTA 95 (150) Q Consensus 25 ~ktiVIK~GG~~l~d~--------~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~-l~~~gi~~~~v~G~RVTd~~tl 95 (150) .|++||+|||++|.++ +....++++|+.|...|.++|||||+||||... |++....+.+..+++.+...++ T Consensus 2 ~k~IVIalGGNAl~~~~~t~~~Q~~~~~~~a~~ia~l~~~G~~~VitHGnGPQVG~lll~~~~~~~~~~~~~pld~~~a~ 81 (312) T PRK12353 2 MKKIVVALGGNALGSSDGSAEEQLEAVKKTAKSLVDLIEEGHEVVITHGNGPQVGNLLLQQAAADSEKNPAMPLDVCVAM 81 (312) T ss_pred CCEEEEEECHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHHHCCCCCCCCEECCHHHH T ss_conf 84499983725318999998999999999999999999889969998388448899999866542688898211014343 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 9999997416689999999828997 Q 537021.9.peg.3 96 EVVEMVLAGSINKKIVSLINQTGTQ 120 (150) Q Consensus 96 ~iv~~vL~G~vn~~lv~~L~~~g~~ 120 (150) .- .-+.-.+...|...|...|.+ T Consensus 82 tq--G~IGy~l~q~l~~~l~~~g~~ 104 (312) T PRK12353 82 SQ--GYIGYWLQNALDNELLKRGIN 104 (312) T ss_pred CC--CHHHHHHHHHHHHHHHHCCCC T ss_conf 03--069999999999999845899 No 19 >cd04241 AAK_FomA-like AAK_FomA-like: This CD includes a fosfomycin biosynthetic gene product, FomA, and similar proteins found in a wide range of organisms. Together, the fomA and fomB genes in the fosfomycin biosynthetic gene cluster of Streptomyces wedmorensis confer high-level fosfomycin resistance. FomA and FomB proteins converted fosfomycin to fosfomycin monophosphate and fosfomycin diphosphate in the presence of ATP and a magnesium ion, indicating that FomA and FomB catalyzed phosphorylations of fosfomycin and fosfomycin monophosphate, respectively. FomA and related sequences in this CD are members of the Amino Acid Kinase Superfamily (AAK). Probab=99.01 E-value=5.7e-09 Score=78.53 Aligned_cols=99 Identities=19% Similarity=0.277 Sum_probs=74.3 Q ss_pred EEEEEECHHHHCCHH--------HHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHHH Q ss_conf 899986756724746--------899999999998518971799845724578899970898115578346612489999 Q 537021.9.peg.3 27 TIVVKYGGHVMNCTD--------LSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEVV 98 (150) Q Consensus 27 tiVIK~GG~~l~d~~--------l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv 98 (150) .||||+||+++.+++ ...+++++|+. ..|.++|||||||.....++++.+.. .+.+.++++.+... T Consensus 1 mIVIKiGgs~lt~~~~~~~~~~~~i~~~~~~i~~--~~~~~iiiV~GgG~~g~~~~~~~~~~----~~~~~~~~~~~~~~ 74 (252) T cd04241 1 MIILKLGGSVITDKDRPETIREENLERIARELAE--AIDEKLVLVHGGGSFGHPKAKEYGLP----DGDGSFSAEGVAET 74 (252) T ss_pred CEEEEECHHHEECCCCCCCCCHHHHHHHHHHHHH--HCCCCEEEEECCHHHHHHHHHHHCCC----CCCCCCCHHHHHHH T ss_conf 9999965787048976565119999999999998--67998899989548769999871344----68741228889999 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCEE Q ss_conf 9997416689999999828997687786558989 Q 537021.9.peg.3 99 EMVLAGSINKKIVSLINQTGTQAIGICGKDGNMV 132 (150) Q Consensus 99 ~~vL~G~vn~~lv~~L~~~g~~avglsg~dg~li 132 (150) .+.+. ++|..+++.|.++|.+++++++.+..+. T Consensus 75 ~~~~~-~l~~~~~~~l~~~g~~a~~l~~~~~~~~ 107 (252) T cd04241 75 HEAML-ELNSIVVDALLEAGVPAVSVPPSSFFVT 107 (252) T ss_pred HHHHH-HHHHHHHHHHHHCCCCEEEECHHHHHHH T ss_conf 99999-9999999999976996352040444443 No 20 >PRK12686 carbamate kinase; Reviewed Probab=98.73 E-value=3.1e-07 Score=67.43 Aligned_cols=96 Identities=18% Similarity=0.288 Sum_probs=65.0 Q ss_pred CCCEEEEEECHHHHCCH-----HH---HHHHHHHHHHHHHCCCCEEEEECCCHHHHH-HHHHCCCCCEEECCCCCCCHHH Q ss_conf 59889998675672474-----68---999999999985189717998457245788-9997089811557834661248 Q 537021.9.peg.3 24 ENETIVVKYGGHVMNCT-----DL---SKDFVNDIALLKKSNITPVIVHGGGPQIGA-VLEKMGIKSKFENGLRITDQQT 94 (150) Q Consensus 24 r~ktiVIK~GG~~l~d~-----~l---~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~-~l~~~gi~~~~v~G~RVTd~~t 94 (150) -.|++||.+||++|..+ +. ....+++|+.|...|.++||+||.|||+.. .|++....+..+..++...-.+ T Consensus 2 m~k~iVIAlGGNAl~~~~~t~~~Q~~~~~~~a~~ia~l~~~g~~vVitHGNGPQVG~l~l~~~~~~~~~~p~~Pld~~~A 81 (313) T PRK12686 2 MKEKIVIALGGNAILQTEASAEAQQTAVREAMQHLVDLIEAGADIVITHGNGPQVGSLLLQQAESNSNKVPAMPLDTCVA 81 (313) T ss_pred CCCEEEEEECCHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHHHHCCCCCCCCCCCHHHH T ss_conf 97779998280312899999999999999999999999986997999818807989999998764368889876404543 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCE Q ss_conf 999999974166899999998289976 Q 537021.9.peg.3 95 AEVVEMVLAGSINKKIVSLINQTGTQA 121 (150) Q Consensus 95 l~iv~~vL~G~vn~~lv~~L~~~g~~a 121 (150) +.-- -+.-.+...|-+.|.+++.+. T Consensus 82 ~sQG--~IGy~lqqal~n~l~~~~~~~ 106 (313) T PRK12686 82 MSQG--MIGYWLQTALNNELTERNIDK 106 (313) T ss_pred CCCH--HHHHHHHHHHHHHHHHCCCCC T ss_conf 0233--799999999999998638998 No 21 >cd04235 AAK_CK AAK_CK: Carbamate kinase (CK) catalyzes both the ATP-phosphorylation of carbamate and carbamoyl phosphate (CP) utilization with the production of ATP from ADP and CP. Both CK (this CD) and nonhomologous CP synthetase synthesize carbamoyl phosphate, an essential precursor of arginine and pyrimidine bases, in the presence of ATP, bicarbonate, and ammonia. CK is a homodimer of 33 kDa subunits and is a member of the Amino Acid Kinase Superfamily (AAK). Probab=98.68 E-value=4.9e-07 Score=66.16 Aligned_cols=93 Identities=23% Similarity=0.280 Sum_probs=64.3 Q ss_pred EEEEEECHHHHCCH-------H---HHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHH Q ss_conf 89998675672474-------6---8999999999985189717998457245788999708981155783466124899 Q 537021.9.peg.3 27 TIVVKYGGHVMNCT-------D---LSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAE 96 (150) Q Consensus 27 tiVIK~GG~~l~d~-------~---l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~ 96 (150) .+||.+||+++..+ + .....+++|+.|...|.++||+||.|||+...+.++...+++...+ .|+ T Consensus 1 riViALGGNAl~~~g~~~t~~~Q~~~v~~~a~~ia~l~~~G~~vVitHGNGPQVG~ll~~~~~~~~~~p~~------Pld 74 (308) T cd04235 1 RIVVALGGNALLRRGEPGTAEEQRENVKIAAKALADLIKNGHEVVITHGNGPQVGNLLLQNEAAAEKVPAY------PLD 74 (308) T ss_pred CEEEEECHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHHHCCCCCCC------CCH T ss_conf 98999573650899999999999999999999999999869979998699518999999888624678998------620 Q ss_pred H-HHHH---HHHHHHHHHHHHHHHCCCCEEEEE Q ss_conf 9-9999---741668999999982899768778 Q 537021.9.peg.3 97 V-VEMV---LAGSINKKIVSLINQTGTQAIGIC 125 (150) Q Consensus 97 i-v~~v---L~G~vn~~lv~~L~~~g~~avgls 125 (150) + +.|. +.-.++..|...|.+++.+..-.+ T Consensus 75 ~~~A~sQG~IGy~lqqal~n~l~~~~~~~~v~t 107 (308) T cd04235 75 VCGAMSQGMIGYMLQQALDNELPKRGIDKPVVT 107 (308) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEE T ss_conf 464653059999999999999985489984599 No 22 >PRK12454 carbamate kinase-like carbamoyl phosphate synthetase; Reviewed Probab=98.27 E-value=4.6e-06 Score=59.92 Aligned_cols=92 Identities=24% Similarity=0.367 Sum_probs=62.9 Q ss_pred CCEEEEEECHHHHCCH----------HHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHH Q ss_conf 9889998675672474----------689999999999851897179984572457889997089811557834661248 Q 537021.9.peg.3 25 NETIVVKYGGHVMNCT----------DLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQT 94 (150) Q Consensus 25 ~ktiVIK~GG~~l~d~----------~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~t 94 (150) .|++||.+||+++..+ +.....+++|+.|...|.++||+||.|||+...+.+...-... . +++-- T Consensus 2 ~k~iVvALGGNAl~~~g~~~t~~~Q~~~v~~~a~~ia~li~~g~~vVitHGNGPQVG~i~l~~~~~~~~----~-~p~~P 76 (313) T PRK12454 2 KKRIVIALGGNALQQRGEKGTAENMMKNVRRTAEQIARLIEEGYEVVITHGNGPQVGNLLLQMDAAKDV----G-IPPFP 76 (313) T ss_pred CCEEEEECCHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHHHCCC----C-CCCCC T ss_conf 955999808365089999999999999999999999999987997999818813999999999873246----8-99866 Q ss_pred HHHHHHHHHHHHHHHHHHH----HHHCCCCE Q ss_conf 9999999741668999999----98289976 Q 537021.9.peg.3 95 AEVVEMVLAGSINKKIVSL----INQTGTQA 121 (150) Q Consensus 95 l~iv~~vL~G~vn~~lv~~----L~~~g~~a 121 (150) |++.-..--|.+...|..+ |.++|... T Consensus 77 Ld~~~A~sQG~IGy~lqqal~n~L~~~~~~~ 107 (313) T PRK12454 77 LDVAGAMTQGWIGYMIQQALRNVLPKRGIER 107 (313) T ss_pred CHHHHHCCCHHHHHHHHHHHHHHHHHCCCCC T ss_conf 0355230134899999999999998539998 No 23 >cd04254 AAK_UMPK-PyrH-Ec UMP kinase (UMPK)-Ec, the microbial/chloroplast uridine monophosphate kinase (uridylate kinase) enzyme that catalyzes UMP phosphorylation and plays a key role in pyrimidine nucleotide biosynthesis; regulation of this process is via feed-back control and via gene repression of carbamoyl phosphate synthetase (the first enzyme of the pyrimidine biosynthesis pathway). The UMP kinase of E. coli (Ec) is known to function as a homohexamer, with GTP and UTP being allosteric effectors. Like other related enzymes (carbamate kinase, aspartokinase, and N-acetylglutamate kinase) the E. coli and most bacterial and chloroplast UMPKs (this CD) have a conserved, N-terminal, lysine residue proposed to function in the catalysis of the phosphoryl group transfer, whereas most archaeal UMPKs appear to lack this residue and the Pyrococcus furiosus structure has an additional Mg ion bound to the ATP molecule which is proposed to function as the catalysis instead. Members of this CD be Probab=98.12 E-value=4e-05 Score=53.89 Aligned_cols=92 Identities=18% Similarity=0.284 Sum_probs=61.2 Q ss_pred CEEEEEECHHHHC-------CHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHH-HHHHHCCCCCEEECCCCCCCHHHHHH Q ss_conf 8899986756724-------746899999999998518971799845724578-89997089811557834661248999 Q 537021.9.peg.3 26 ETIVVKYGGHVMN-------CTDLSKDFVNDIALLKKSNITPVIVHGGGPQIG-AVLEKMGIKSKFENGLRITDQQTAEV 97 (150) Q Consensus 26 ktiVIK~GG~~l~-------d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~-~~l~~~gi~~~~v~G~RVTd~~tl~i 97 (150) |.+|||+||+++. |.+.+.+++++|..++..|.+++||+|||...- ......++. | ...+.+. T Consensus 1 kRIViKiGGs~L~~~~~~~ld~~~I~~la~~I~~l~~~G~eVviVvggG~i~rg~~~~~~~~~-------~-~~~d~~g- 71 (231) T cd04254 1 KRVLLKLSGEALAGENGFGIDPEVLNRIAREIKEVVDLGVEVAIVVGGGNIFRGASAAEAGMD-------R-ATADYMG- 71 (231) T ss_pred CEEEEEECHHHCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCHHCCHHHHHCCCC-------C-CHHHHHH- T ss_conf 959999516660789998859999999999999999779869999817730124224534666-------3-5188999- Q ss_pred HHHHHHHHHHHH-HHHHHHHCCCCEEEEEEEEC Q ss_conf 999974166899-99999828997687786558 Q 537021.9.peg.3 98 VEMVLAGSINKK-IVSLINQTGTQAIGICGKDG 129 (150) Q Consensus 98 v~~vL~G~vn~~-lv~~L~~~g~~avglsg~dg 129 (150) | ++-.+|.. +..+|...|.++.-++..+- T Consensus 72 --~-lat~~n~~ll~~al~~~g~~~~~~t~~~~ 101 (231) T cd04254 72 --M-LATVINALALQDALESLGVKTRVMSAIPM 101 (231) T ss_pred --H-HHHHHHHHHHHHHHHHCCCCEEEEECCCC T ss_conf --9-99999999999999867998799742234 No 24 >PRK00358 pyrH uridylate kinase; Provisional Probab=98.11 E-value=7.1e-05 Score=52.27 Aligned_cols=90 Identities=22% Similarity=0.331 Sum_probs=56.5 Q ss_pred CEEEEEECHHHHC-------CHHHHHHHHHHHHHHHHCCCCEEEEECCCHH-HHHHHHHCCCCCEEECCCCCCCHHHHHH Q ss_conf 8899986756724-------7468999999999985189717998457245-7889997089811557834661248999 Q 537021.9.peg.3 26 ETIVVKYGGHVMN-------CTDLSKDFVNDIALLKKSNITPVIVHGGGPQ-IGAVLEKMGIKSKFENGLRITDQQTAEV 97 (150) Q Consensus 26 ktiVIK~GG~~l~-------d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~q-I~~~l~~~gi~~~~v~G~RVTd~~tl~i 97 (150) |-+|||+||+++. |.+...+++++|+.++..|.++|||+|||.. ....+.+.+++.. .++. T Consensus 1 kRIViKiG~s~L~~~~~~~l~~~~i~~la~~I~~l~~~G~evvlVsSGg~~a~~~~~~~~~~~~~-----------~~d~ 69 (231) T PRK00358 1 KRVLLKLSGEALAGDEGFGIDPDVLDRIAQEIKELVELGVEVAVVIGGGNIFRGAAAAKAGMDRV-----------TADY 69 (231) T ss_pred CEEEEEECHHHCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCHHHCCHHHHHCCCCCC-----------HHHH T ss_conf 95999965565038999983999999999999999977996999983873315264774588654-----------6999 Q ss_pred HHHHHHHHHHHHH-HHHHHHCCCCEEEEEEE Q ss_conf 9999741668999-99998289976877865 Q 537021.9.peg.3 98 VEMVLAGSINKKI-VSLINQTGTQAIGICGK 127 (150) Q Consensus 98 v~~vL~G~vn~~l-v~~L~~~g~~avglsg~ 127 (150) ..+ ++-..|..+ ...|...+..+...+.. T Consensus 70 ~g~-~at~~n~~ll~~~l~~~~~~~~~~~~~ 99 (231) T PRK00358 70 MGM-LATVMNALALQDALERAGVNTRVQSAI 99 (231) T ss_pred HHH-HHHHHHHHHHHHHHHHCCCCEEEEECC T ss_conf 999-999999999999997369732675122 No 25 >cd04256 AAK_P5CS_ProBA AAK_P5CS_ProBA: Glutamate-5-kinase (G5K) domain of the bifunctional delta 1-pyrroline-5-carboxylate synthetase (P5CS), composed of an N-terminal G5K (ProB) and a C-terminal glutamyl 5- phosphate reductase (G5PR, ProA), the first and second enzyme catalyzing proline (and, in mammals, ornithine) biosynthesis. G5K transfers the terminal phosphoryl group of ATP to the gamma-carboxyl group of glutamate, and is subject to feedback allosteric inhibition by proline or ornithine. In plants, proline plays an important role as an osmoprotectant and, in mammals, ornithine biosynthesis is crucial for proper ammonia detoxification, since a G5K mutation has been shown to cause human hyperammonaemia. Probab=98.04 E-value=3e-05 Score=54.64 Aligned_cols=47 Identities=23% Similarity=0.356 Sum_probs=39.1 Q ss_pred CCCEEEEEECHHHHCCHH-------HHHHHHHHHHHHHHCCCCEEEEECCCHHH Q ss_conf 598899986756724746-------89999999999851897179984572457 Q 537021.9.peg.3 24 ENETIVVKYGGHVMNCTD-------LSKDFVNDIALLKKSNITPVIVHGGGPQI 70 (150) Q Consensus 24 r~ktiVIK~GG~~l~d~~-------l~~~~~~dI~ll~~~G~k~ViVHGgg~qI 70 (150) +-|.+|||+|++++.+++ .+.+++++|+.|+..|..+||||+|+-.. T Consensus 7 ~~krIVIKiGss~lt~~d~~~~~~~~i~~l~~~I~~L~~~G~evvlVsSGAv~~ 60 (284) T cd04256 7 HAKRIVVKLGSAVVTREDECGLALGRLASIVEQVSELQSQGREVILVTSGAVAF 60 (284) T ss_pred CCCEEEEEECHHHEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCHHHC T ss_conf 487899995702237999997589999999999999997899699990684653 No 26 >cd04239 AAK_UMPK-like AAK_UMPK-like: UMP kinase (UMPK)-like, the microbial/chloroplast uridine monophosphate kinase (uridylate kinase) enzyme that catalyzes UMP phosphorylation and plays a key role in pyrimidine nucleotide biosynthesis. Regulation of this process is via feed-back control and via gene repression of carbamoyl phosphate synthetase (the first enzyme of the pyrimidine biosynthesis pathway). The UMP kinases of E. coli (Ec) and Pyrococcus furiosus (Pf) are known to function as homohexamers, with GTP and UTP being allosteric effectors. Like other related enzymes (carbamate kinase, aspartokinase, and N-acetylglutamate kinase) the E. coli and most bacterial UMPKs have a conserved, N-terminal, lysine residue proposed to function in the catalysis of the phosphoryl group transfer, whereas most archaeal UMPKs appear to lack this residue and the Pyrococcus furiosus structure has an additional Mg ion bound to the ATP molecule which is proposed to function as the catalysis instead. Als Probab=98.00 E-value=0.00011 Score=51.06 Aligned_cols=88 Identities=19% Similarity=0.287 Sum_probs=56.4 Q ss_pred EEEEECHHHHC------CHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHHHHHH Q ss_conf 99986756724------746899999999998518971799845724578899970898115578346612489999999 Q 537021.9.peg.3 28 IVVKYGGHVMN------CTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEVVEMV 101 (150) Q Consensus 28 iVIK~GG~~l~------d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~~v 101 (150) +|||+||+.+. |.+.+.+++++|+.|+..|.++|||+|||...... ...+.. . ++..++...+. T Consensus 2 IViKiG~s~lt~~~~~l~~~~i~~la~~i~~l~~~g~evilV~s~G~~a~g~-~~~~~~------~---~~~~~~~~~~~ 71 (229) T cd04239 2 IVLKLSGEALAGEGGGIDPEVLKEIAREIKEVVDLGVEVAIVVGGGNIARGY-IAAARG------M---PRATADYIGML 71 (229) T ss_pred EEEEECHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCH-HHHHCC------C---CCCHHHHHHHH T ss_conf 8999765770189999199999999999999997799799998997344664-344156------6---71359999999 Q ss_pred HHHHHHHHH-HHHHHHCCCCEEEEEE Q ss_conf 741668999-9999828997687786 Q 537021.9.peg.3 102 LAGSINKKI-VSLINQTGTQAIGICG 126 (150) Q Consensus 102 L~G~vn~~l-v~~L~~~g~~avglsg 126 (150) +-..|..+ ...|.+++..+.-.+. T Consensus 72 -a~~~~~~ll~~~l~~~~~~~~~~~~ 96 (229) T cd04239 72 -ATVMNALALQDALEKLGVKTRVMSA 96 (229) T ss_pred -HHHHHHHHHHHHHHHCCCCEEEECH T ss_conf -9999999999999867997898317 No 27 >PRK12314 gamma-glutamyl kinase; Provisional Probab=97.85 E-value=0.00017 Score=49.80 Aligned_cols=46 Identities=20% Similarity=0.249 Sum_probs=39.8 Q ss_pred CCCEEEEEECHHHHC------CHHHHHHHHHHHHHHHHCCCCEEEEECCCHH Q ss_conf 598899986756724------7468999999999985189717998457245 Q 537021.9.peg.3 24 ENETIVVKYGGHVMN------CTDLSKDFVNDIALLKKSNITPVIVHGGGPQ 69 (150) Q Consensus 24 r~ktiVIK~GG~~l~------d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~q 69 (150) .-|.+|||+||+++. |.+...+++++|+.++..|.++|||++|.-. T Consensus 8 ~~krIViKiG~s~lt~~~g~~~~~~i~~la~~I~~l~~~G~~vvlVsSGav~ 59 (265) T PRK12314 8 SAKRIVIKVGSSSLSYENGKINLERIEQLVFVISDLMNKGKEVILVSSGAIG 59 (265) T ss_pred CCCEEEEEECHHHEECCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCHHH T ss_conf 0878999966112678999859999999999999999789989998637443 No 28 >COG1608 Predicted archaeal kinase [General function prediction only] Probab=97.82 E-value=7.8e-05 Score=52.00 Aligned_cols=100 Identities=20% Similarity=0.407 Sum_probs=67.5 Q ss_pred EEEEECHHHHCCHHH--------HHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHHHH Q ss_conf 999867567247468--------999999999985189717998457245788999708981155783466124899999 Q 537021.9.peg.3 28 IVVKYGGHVMNCTDL--------SKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEVVE 99 (150) Q Consensus 28 iVIK~GG~~l~d~~l--------~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~ 99 (150) +|+|+|||++.|.+. ...++..|+- .--.+.|+|||||..=--..++.++ +.-.+ +-++.-...+. T Consensus 3 ~IlKlGGSvITdK~~p~t~r~~~l~ria~eI~~--~~~~~livVHGgGSFGHp~Ak~~~~-~~~~~---~~s~~G~~~~~ 76 (252) T COG1608 3 IILKLGGSVITDKDKPRTVREDRLRRIAREISN--GKPEKLIVVHGGGSFGHPAAKEFGL-EGLKN---YLSPLGFSLTH 76 (252) T ss_pred EEEEECCEEEECCCCCCHHHHHHHHHHHHHHHC--CCCCCEEEEECCCCCCCHHHHHHCC-CCCCC---CCCCCCHHHHH T ss_conf 899955616525887321118999999999845--9954389994586656788997076-54446---66743439999 Q ss_pred HHHHHHHHHHHHHHHHHCCCCEEE---EE--EEECCEEEE Q ss_conf 997416689999999828997687---78--655898999 Q 537021.9.peg.3 100 MVLAGSINKKIVSLINQTGTQAIG---IC--GKDGNMVFA 134 (150) Q Consensus 100 ~vL~G~vn~~lv~~L~~~g~~avg---ls--g~dg~li~a 134 (150) .++. +.|..++..|.+.|+.|+. +| -.+|++..+ T Consensus 77 ~am~-~L~~~V~~~l~~~Gv~av~~~P~s~~~~~gr~~~~ 115 (252) T COG1608 77 LAML-ELNSIVVDALLDAGVRAVSVVPISFSTFNGRILYT 115 (252) T ss_pred HHHH-HHHHHHHHHHHHCCCCCCCCCCCCEEECCCCEEEC T ss_conf 9999-99999999998669853222686403317703220 No 29 >PRK12352 putative carbamate kinase; Reviewed Probab=97.73 E-value=0.00018 Score=49.74 Aligned_cols=87 Identities=24% Similarity=0.316 Sum_probs=56.8 Q ss_pred CCEEEEEECHHHHCC-------HHHH---HHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHH Q ss_conf 988999867567247-------4689---999999999851897179984572457889997089811557834661248 Q 537021.9.peg.3 25 NETIVVKYGGHVMNC-------TDLS---KDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQT 94 (150) Q Consensus 25 ~ktiVIK~GG~~l~d-------~~l~---~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~t 94 (150) .+++||-+||+++.. ++.+ ..-++.|+-|-..|.++||+||-|||+...+.+...-... + .+++-- T Consensus 2 ~k~iViALGGNAll~~g~~~t~~~Q~~~v~~~a~~ia~li~~g~~vvitHGNGPQVG~l~lq~e~a~~~-~---~~p~~P 77 (316) T PRK12352 2 KELVVVAIGGNSIIKDNASQSIEHQAEAVKAVADTVLEMLASDYDIVLTHGNGPQVGLDLRRAEIAHER-E---GLPLTP 77 (316) T ss_pred CCEEEEECCHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHHHHHCC-C---CCCCCC T ss_conf 966999767343289999999999999999999999999976996999817816988999999875336-7---899875 Q ss_pred HHH-HHHHHHHHHHHHHHHHHHH Q ss_conf 999-9999741668999999982 Q 537021.9.peg.3 95 AEV-VEMVLAGSINKKIVSLINQ 116 (150) Q Consensus 95 l~i-v~~vL~G~vn~~lv~~L~~ 116 (150) |++ +.|. -|.+...|..+|.. T Consensus 78 Ld~~~A~s-QG~IGY~lqqal~n 99 (316) T PRK12352 78 LANCVADT-QGGIGYLIQQALNN 99 (316) T ss_pred HHHHHHCC-CHHHHHHHHHHHHH T ss_conf 45541212-20899999999999 No 30 >cd04242 AAK_G5K_ProB AAK_G5K_ProB: Glutamate-5-kinase (G5K) catalyzes glutamate-dependent ATP cleavage; G5K transfers the terminal phosphoryl group of ATP to the gamma-carboxyl group of glutamate, in the first and controlling step of proline (and, in mammals, ornithine) biosynthesis. G5K is subject to feedback allosteric inhibition by proline or ornithine. In microorganisms and plants, proline plays an important role as an osmoprotectant and, in mammals, ornithine biosynthesis is crucial for proper ammonia detoxification, since a G5K mutation has been shown to cause human hyperammonaemia. Microbial G5K generally consists of two domains: a catalytic G5K domain and one PUA (pseudo uridine synthases and archaeosine-specific transglycosylases) domain, and some lack the PUA domain. G5K requires free Mg for activity, it is tetrameric, and it aggregates to higher forms in a proline-dependent way. G5K lacking the PUA domain remains tetrameric, active, and proline-inhibitable, but the Mg requir Probab=97.72 E-value=0.00021 Score=49.30 Aligned_cols=51 Identities=25% Similarity=0.440 Sum_probs=41.1 Q ss_pred EEEEECHHHHCCH------HHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCC Q ss_conf 9998675672474------6899999999998518971799845724578899970898 Q 537021.9.peg.3 28 IVVKYGGHVMNCT------DLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIK 80 (150) Q Consensus 28 iVIK~GG~~l~d~------~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~ 80 (150) +|||+|++++.++ +...+++++|+.|+..|.++||||+|. |..-...+|+. T Consensus 2 iViKiGss~lt~~~~~~~~~~i~~la~~I~~l~~~G~evvlVsSGA--va~G~~~l~~~ 58 (251) T cd04242 2 IVVKVGSSLLTDEDGGLDLGRLASLVEQIAELRNQGKEVILVSSGA--VAAGRQRLGLE 58 (251) T ss_pred EEEEECCCEEECCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCH--HHHCHHHCCCC T ss_conf 8999670116899998599999999999999997899799982475--66187653998 No 31 >PRK09411 carbamate kinase; Reviewed Probab=97.67 E-value=0.00075 Score=45.71 Aligned_cols=87 Identities=24% Similarity=0.336 Sum_probs=59.3 Q ss_pred CEEEEEECHHHHCC-------HHHH---HHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHH Q ss_conf 88999867567247-------4689---9999999998518971799845724578899970898115578346612489 Q 537021.9.peg.3 26 ETIVVKYGGHVMNC-------TDLS---KDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTA 95 (150) Q Consensus 26 ktiVIK~GG~~l~d-------~~l~---~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl 95 (150) |++||-+||+++.. ++.+ ...++.|+.|.. +.++||.||-|||+...+-+. ...+ -+++-.| T Consensus 2 k~iViALGGNAl~~~g~~~t~e~Q~~~v~~aa~~ia~l~~-~~~vVITHGNGPQVG~l~lq~-~a~~------~~p~~PL 73 (297) T PRK09411 2 KTLVVALGGNALLQRGEALTAENQYRNIASAVPALARLAR-SYRLAIVHGNGPQVGLLALQN-LAWK------EVEPYPL 73 (297) T ss_pred CEEEEECCHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHH-CCCEEEEECCCHHHHHHHHHH-HHHC------CCCCCCC T ss_conf 7599976814217999998999999999999999999980-893999847812989999998-8632------6999883 Q ss_pred HH-HHHHHHHHHHHHHHHHHHHCCCCE Q ss_conf 99-999974166899999998289976 Q 537021.9.peg.3 96 EV-VEMVLAGSINKKIVSLINQTGTQA 121 (150) Q Consensus 96 ~i-v~~vL~G~vn~~lv~~L~~~g~~a 121 (150) ++ ..|. -|.+...|...|...+... T Consensus 74 d~~~A~s-QG~IGY~l~q~L~~~~~~~ 99 (297) T PRK09411 74 DVLVAES-QGMIGYMLAQSLSAQPQMP 99 (297) T ss_pred CHHHHHH-HHHHHHHHHHHHHHCCCCC T ss_conf 0342422-0699999999998437899 No 32 >TIGR00746 arcC carbamate kinase; InterPro: IPR003964 The arginine dihydrolase (AD) pathway is found in many prokaryotes and some primitive eukaryotes. The three- enzyme anaerobic pathway breaks down L-arginine to form 1 mol of ATP, carbon dioxide and ammonia. In simpler bacteria, the first enzyme, arginine deiminase, can account for up to 100f total cell protein . Carbamate kinase is involved in the last step of the AD pathway, converting carbamoyl phosphate and ADP into ammonia, carbon dioxide and ATP . The second step of the pathway involves the degradation of L-citrulline to carbamoyl phosphate and L-ornithine, using ornithine carbamoyltransferase . The crystal structure of Enterococcus faecium carbamate kinase has been determined to 2.8A resolution . The enzyme exists as a homodimer of two 33kDa subunits. The hallmark of the dimer is a 16-stranded beta-sheet, surrounded by alpha-helices. Each subunit contains an active site within a large crevice.; GO: 0008804 carbamate kinase activity, 0006525 arginine metabolic process. Probab=97.63 E-value=0.00017 Score=49.90 Aligned_cols=94 Identities=23% Similarity=0.357 Sum_probs=63.3 Q ss_pred CEEEEEECHHHHC--CH-----HHH---HHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHC-CCCCEEECCCCCCCHHH Q ss_conf 8899986756724--74-----689---9999999998518971799845724578899970-89811557834661248 Q 537021.9.peg.3 26 ETIVVKYGGHVMN--CT-----DLS---KDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKM-GIKSKFENGLRITDQQT 94 (150) Q Consensus 26 ktiVIK~GG~~l~--d~-----~l~---~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~-gi~~~~v~G~RVTd~~t 94 (150) |++||-+||+||. .+ +.+ +.-+..|+.+...|.++||.||=|||+..++=+. -.+++ .++.+.= T Consensus 1 k~~VvALGGNAlLqRg~~~s~~~Q~~nv~~t~~~l~~~~~~G~~lvi~HGNGPQVG~LlLQ~~A~~~~-----~~~~~~P 75 (321) T TIGR00746 1 KRVVVALGGNALLQRGEKGSAENQRKNVEQTAPQLVKLIKRGYELVITHGNGPQVGLLLLQNQAADSE-----KVVPAMP 75 (321) T ss_pred CEEEEEECCHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHHHHC-----CCCCCCC T ss_conf 94889737358863788379889999999999999999861897999847803688899999897603-----7889887 Q ss_pred HHH-HHHHHHHHHHHHHHHHHH----HCCCCEEEEE Q ss_conf 999-999974166899999998----2899768778 Q 537021.9.peg.3 95 AEV-VEMVLAGSINKKIVSLIN----QTGTQAIGIC 125 (150) Q Consensus 95 l~i-v~~vL~G~vn~~lv~~L~----~~g~~avgls 125 (150) |++ .-|. =|.|...|...|+ +.|...--.| T Consensus 76 LDV~GAmS-QG~IGYml~q~l~~~L~~~g~~~~vaT 110 (321) T TIGR00746 76 LDVLGAMS-QGMIGYMLQQALNNELPKEGLEKPVAT 110 (321) T ss_pred CCEECCCC-CCHHHHHHHHHHHHHHHHCCCCCCCEE T ss_conf 43202033-530789999999999986689998538 No 33 >COG0549 ArcC Carbamate kinase [Amino acid transport and metabolism] Probab=97.52 E-value=0.0012 Score=44.47 Aligned_cols=86 Identities=29% Similarity=0.382 Sum_probs=58.4 Q ss_pred CEEEEEECHHHHCCH----------HHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHH Q ss_conf 889998675672474----------6899999999998518971799845724578899970898115578346612489 Q 537021.9.peg.3 26 ETIVVKYGGHVMNCT----------DLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTA 95 (150) Q Consensus 26 ktiVIK~GG~~l~d~----------~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl 95 (150) +.+||-+||+++... +....-++.|+.|..-|.++||.||-|||+...+.+.-..-+.. +. .+--| T Consensus 1 ~~iVvALGGNAll~~g~~~tae~Q~~~v~~ta~~i~~l~~~g~e~VitHGNGPQVG~l~lq~~aa~~~~-~~---p~~PL 76 (312) T COG0549 1 KRIVVALGGNALLQRGEPLTAEAQYEAVKITAEQIADLIASGYEVVITHGNGPQVGLLLLQNEAADSEK-GV---PAYPL 76 (312) T ss_pred CEEEEEECCHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHHCCCC-CC---CCCCH T ss_conf 918999353453479999898999999999999999998558859997189756779998766532545-78---88646 Q ss_pred HH-HHHHHHHHHHHHHHHHHHH Q ss_conf 99-9999741668999999982 Q 537021.9.peg.3 96 EV-VEMVLAGSINKKIVSLINQ 116 (150) Q Consensus 96 ~i-v~~vL~G~vn~~lv~~L~~ 116 (150) ++ +.|. -|.+...|..+|.. T Consensus 77 d~~~Ams-QG~IGy~l~qal~n 97 (312) T COG0549 77 DVLVAMS-QGMIGYMLQQALRN 97 (312) T ss_pred HHHHHHH-HHHHHHHHHHHHHH T ss_conf 8886730-12899999999997 No 34 >cd04261 AAK_AKii-LysC-BS AAK_AKii-LysC-BS: Amino Acid Kinase Superfamily (AAK), AKii; this CD includes the N-terminal catalytic aspartokinase (AK) domain of the lysine-sensitive aspartokinase isoenzyme AKII of Bacillus subtilis 168, and the lysine plus threonine-sensitive aspartokinase of Corynebacterium glutamicum, and related sequences. In B. subtilis 168, the regulation of the diaminopimelate (Dap)-lysine biosynthetic pathway involves dual control by Dap and lysine, effected through separate Dap- and lysine-sensitive aspartokinase isoenzymes. The B. subtilis 168 AKII is induced by methionine, and repressed and inhibited by lysine. Although Corynebacterium glutamicum is known to contain a single aspartokinase isoenzyme type, both the succinylase and dehydrogenase variant pathways of DAP-lysine synthesis operate simultaneously in this organism. In this organism and other various Gram-positive bacteria, the DAP-lysine pathway is feedback regulated by the concerted action of lysine and Probab=97.46 E-value=0.0019 Score=43.10 Aligned_cols=98 Identities=20% Similarity=0.299 Sum_probs=63.9 Q ss_pred EEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECC-CHHHHHHHHHCCCCCEEECCCCCCCHHHHHHHHHHHHHH Q ss_conf 8999867567247468999999999985189717998457-245788999708981155783466124899999997416 Q 537021.9.peg.3 27 TIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGG-GPQIGAVLEKMGIKSKFENGLRITDQQTAEVVEMVLAGS 105 (150) Q Consensus 27 tiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGg-g~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~~vL~G~ 105 (150) .+|.||||+-+.|++.+..+++-|......|.++++|.++ |.--|++++...- ..+ --++..++.+.. .|+ T Consensus 1 ~iV~KFGGtSv~~~e~i~~va~iI~~~~~~~~~vvVVvSA~gg~Td~Li~la~~---~~~---~~~~~e~d~ils--~GE 72 (239) T cd04261 1 LIVQKFGGTSVASIERIKRVAERIKKRKKKGNQVVVVVSAMGGTTDELIELAKE---ISP---RPPARELDVLLS--TGE 72 (239) T ss_pred CEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHH---HCC---CCCHHHHHHHHH--HHH T ss_conf 989997820139999999999999989866992899977987536889999998---706---999899988877--779 Q ss_pred -HHHH-HHHHHHHCCCCEEEEEEEECCEE Q ss_conf -6899-99999828997687786558989 Q 537021.9.peg.3 106 -INKK-IVSLINQTGTQAIGICGKDGNMV 132 (150) Q Consensus 106 -vn~~-lv~~L~~~g~~avglsg~dg~li 132 (150) .+.. +...|++.|.++..+++.+-.++ T Consensus 73 ~lsa~ll~~~L~~~G~~a~~~~~~~~~i~ 101 (239) T cd04261 73 QVSIALLAMALNRLGIKAISLTGWQAGIL 101 (239) T ss_pred HHHHHHHHHHHHHCCCCCEEEEHHHCCEE T ss_conf 99999999999976997579686667543 No 35 >PRK12354 carbamate kinase; Reviewed Probab=97.25 E-value=0.0021 Score=42.91 Aligned_cols=82 Identities=23% Similarity=0.291 Sum_probs=54.6 Q ss_pred EEEEEECHHHHCC-------HH---HHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHH Q ss_conf 8999867567247-------46---8999999999985189717998457245788999708981155783466124899 Q 537021.9.peg.3 27 TIVVKYGGHVMNC-------TD---LSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAE 96 (150) Q Consensus 27 tiVIK~GG~~l~d-------~~---l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~ 96 (150) .+||-+||+++.. ++ ....-++.|+.|.. |.++||+||-|||+...+.+.. ... + +++--|+ T Consensus 2 riViALGGNAll~~g~~~t~~~Q~~~v~~aa~~ia~l~~-g~~vvitHGNGPQVG~i~l~~~-~~~--~----~p~~PLd 73 (302) T PRK12354 2 RIVIALGGNALLRRGEPMTAENQRANIRIAAEQIAKIAP-GNEVVIVHGNGPQVGLLALQNA-AYK--A----VPPYPLD 73 (302) T ss_pred EEEEECCHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHC-CCEEEEEECCCHHHHHHHHHHH-HCC--C----CCCCCHH T ss_conf 499982735317999999999999999999999999975-9959998278168899999877-416--8----9998826 Q ss_pred H-HHHHHHHHHHHHHHHHHHHC Q ss_conf 9-99997416689999999828 Q 537021.9.peg.3 97 V-VEMVLAGSINKKIVSLINQT 117 (150) Q Consensus 97 i-v~~vL~G~vn~~lv~~L~~~ 117 (150) + +.|. -|.+...|..+|... T Consensus 74 ~~~A~S-QG~IGY~lqqaL~n~ 94 (302) T PRK12354 74 VLGAET-EGMIGYMLEQELGNL 94 (302) T ss_pred HHHHHC-CHHHHHHHHHHHHHH T ss_conf 764521-169999999999987 No 36 >cd04246 AAK_AK-DapG-like AAK_AK-DapG-like: Amino Acid Kinase Superfamily (AAK), AK-DapG-like; this CD includes the N-terminal catalytic aspartokinase (AK) domain of the diaminopimelate-sensitive aspartokinase isoenzyme AKI (DapG), a monofunctional enzymes found in Bacilli (Bacillus subtilis 168), Clostridia, and Actinobacteria bacterial species, as well as, the catalytic AK domain of the lysine-sensitive aspartokinase isoenzyme AKII of Bacillus subtilis 168, the lysine plus threonine-sensitive aspartokinase of Corynebacterium glutamicum, and related isoenzymes. In Bacillus subtilis, the regulation of the diaminopimelate-lysine biosynthetic pathway involves dual control by diaminopimelate and lysine, effected through separate diaminopimelate- and lysine-sensitive aspartokinase isoenzymes. The role of the AKI isoenzyme is most likely to provide a constant level of aspartyl-beta-phosphate for the biosynthesis of diaminopimelate for peptidoglycan synthesis and dipicolinate during sporulati Probab=97.16 E-value=0.0084 Score=38.99 Aligned_cols=96 Identities=18% Similarity=0.199 Sum_probs=61.3 Q ss_pred EEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHHHHHHHHHHH Q ss_conf 89998675672474689999999999851897179984572457889997089811557834661248999999974166 Q 537021.9.peg.3 27 TIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEVVEMVLAGSI 106 (150) Q Consensus 27 tiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~~vL~G~v 106 (150) .+|.||||+-+.|++.++++++-|......|.++++|.++=--++..|-++.-+. . .-.++..++.+... .-.. T Consensus 1 ~iV~KFGGtSv~~~e~i~~va~iI~~~~~~g~~viVVvSA~gg~Td~L~~l~~~~--~---~~~~~~~~d~ils~-GE~l 74 (239) T cd04246 1 IIVQKFGGTSVADIERIKRVAERIKKAVKKGYQVVVVVSAMGGTTDELIGLAKEV--S---PRPSPRELDMLLST-GEQI 74 (239) T ss_pred CEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHHHHH--C---CCCCHHHHHHHHHH-HHHH T ss_conf 9899878100499999999999999998679829999848986567799988876--4---77258899999998-8999 Q ss_pred HHH-HHHHHHHCCCCEEEEEEEE Q ss_conf 899-9999982899768778655 Q 537021.9.peg.3 107 NKK-IVSLINQTGTQAIGICGKD 128 (150) Q Consensus 107 n~~-lv~~L~~~g~~avglsg~d 128 (150) +.. +...|++.|+++.-+.+.+ T Consensus 75 s~~lla~~L~~~G~~a~~~~~~~ 97 (239) T cd04246 75 SAALLAMALNRLGIKAISLTGWQ 97 (239) T ss_pred HHHHHHHHHHHCCCCCEEEEHHH T ss_conf 99999999997599808857323 No 37 >cd04253 AAK_UMPK-PyrH-Pf AAK_UMPK-PyrH-Pf: UMP kinase (UMPK)-Pf, the mostly archaeal uridine monophosphate kinase (uridylate kinase) enzymes that catalyze UMP phosphorylation and play a key role in pyrimidine nucleotide biosynthesis; regulation of this process is via feed-back control and via gene repression of carbamoyl phosphate synthetase (the first enzyme of the pyrimidine biosynthesis pathway). The UMP kinase of Pyrococcus furiosus (Pf) is known to function as a homohexamer, with GTP and UTP being allosteric effectors. Like other related enzymes (carbamate kinase, aspartokinase, and N-acetylglutamate kinase) the E. coli and most bacterial UMPKs have a conserved, N-terminal, lysine residue proposed to function in the catalysis of the phosphoryl group transfer, whereas most archaeal UMPKs (this CD) appear to lack this residue and the Pyrococcus furiosus structure has an additional Mg ion bound to the ATP molecule which is proposed to function as the catalysis instead. Members of thi Probab=97.16 E-value=0.0015 Score=43.73 Aligned_cols=44 Identities=27% Similarity=0.486 Sum_probs=36.0 Q ss_pred EEEEECHHHHCC---HHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHH Q ss_conf 999867567247---468999999999985189717998457245788 Q 537021.9.peg.3 28 IVVKYGGHVMNC---TDLSKDFVNDIALLKKSNITPVIVHGGGPQIGA 72 (150) Q Consensus 28 iVIK~GG~~l~d---~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~ 72 (150) +|||+||+++.+ .+...++++.|+.++. |.++|+|+|||..... T Consensus 2 IViKiGgs~l~~~~~~~~i~~la~~i~~l~~-~~~~iiVvggG~~ar~ 48 (221) T cd04253 2 IVISLGGSVLAPEKDADFIKEYANVLRKISD-GHKVAVVVGGGRLARE 48 (221) T ss_pred EEEEEEHHHCCCCCCHHHHHHHHHHHHHHHC-CCEEEEEECCCHHHHH T ss_conf 7999607770799887999999999999975-9969999877778876 No 38 >COG0528 PyrH Uridylate kinase [Nucleotide transport and metabolism] Probab=97.00 E-value=0.014 Score=37.60 Aligned_cols=97 Identities=19% Similarity=0.257 Sum_probs=69.9 Q ss_pred HCCCEEEEEECHHHHCC-------HHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHH Q ss_conf 75988999867567247-------46899999999998518971799845724578899970898115578346612489 Q 537021.9.peg.3 23 YENETIVVKYGGHVMNC-------TDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTA 95 (150) Q Consensus 23 ~r~ktiVIK~GG~~l~d-------~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl 95 (150) .+-+-+|+|+||+++.. ++....++++|..+...|.++-+|-|||.-.-....... |.--+ +. T Consensus 3 ~~~~rillkLsGe~l~g~~~~gid~~~i~~~a~~i~~~~~~g~eV~iVvGGGni~Rg~~~~~~-------g~~r~---~~ 72 (238) T COG0528 3 PKYMRILLKLSGEALAGEQGFGIDPEVLDRIANEIKELVDLGVEVAVVVGGGNIARGYIGAAA-------GMDRV---TA 72 (238) T ss_pred CCEEEEEEEEECCEECCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHC-------CCCHH---HH T ss_conf 633799999421036478888879899999999999998669689999789789976789876-------98512---24 Q ss_pred HHHHHHHHHHHHH-HHHHHHHHCCCCEEEEEEEECC Q ss_conf 9999997416689-9999998289976877865589 Q 537021.9.peg.3 96 EVVEMVLAGSINK-KIVSLINQTGTQAIGICGKDGN 130 (150) Q Consensus 96 ~iv~~vL~G~vn~-~lv~~L~~~g~~avglsg~dg~ 130 (150) +..=| |+-.+|. -|-..|.+.|+++.-+|..... T Consensus 73 D~mGm-laTvmNal~L~~aL~~~~~~~~v~sai~~~ 107 (238) T COG0528 73 DYMGM-LATVMNALALQDALERLGVDTRVQSAIAMP 107 (238) T ss_pred HHHHH-HHHHHHHHHHHHHHHHCCCCCEECCCCCCC T ss_conf 68889-999999999999986358761221311176 No 39 >PRK08841 aspartate kinase; Validated Probab=96.73 E-value=0.024 Score=36.09 Aligned_cols=99 Identities=16% Similarity=0.253 Sum_probs=66.3 Q ss_pred CEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECC-CHHHHHHHHHCCCCCEEECCCCCCCHHHHHHHHHHHHH Q ss_conf 88999867567247468999999999985189717998457-24578899970898115578346612489999999741 Q 537021.9.peg.3 26 ETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGG-GPQIGAVLEKMGIKSKFENGLRITDQQTAEVVEMVLAG 104 (150) Q Consensus 26 ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGg-g~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~~vL~G 104 (150) +.+|.||||+-+.+++.++++++-|..-...|.++|+|.-+ +..-|++++.... ++ .+-+++.|+.+... .- T Consensus 2 ~~iV~KFGGTSv~~~e~i~~va~~i~~~~~~~~~vvVVvSAm~g~Td~Ll~la~~----i~--~~~~~relD~lls~-GE 74 (392) T PRK08841 2 PLIVQKFGGTSVGSIERIQTVAEHIIKAKNDGNQVVVVVSAMAGETNRLLGLAKQ----VD--SVPTARELDVLLSA-GE 74 (392) T ss_pred CEEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHH----HC--CCCCHHHHHHHHHH-HH T ss_conf 8799915863579999999999999988756998999962899968999999986----32--59998999999976-59 Q ss_pred HHHHHH-HHHHHHCCCCEEEEEEEECCE Q ss_conf 668999-999982899768778655898 Q 537021.9.peg.3 105 SINKKI-VSLINQTGTQAIGICGKDGNM 131 (150) Q Consensus 105 ~vn~~l-v~~L~~~g~~avglsg~dg~l 131 (150) ++...| ...|++.|.+|..+++.+..+ T Consensus 75 ~lS~~lla~~L~~~G~~a~~l~~~~~~i 102 (392) T PRK08841 75 QVSMALLAMTLNKLGYKARSLTGAQANI 102 (392) T ss_pred HHHHHHHHHHHHHCCCCCEEEECHHCCC T ss_conf 9999999999986799757987123351 No 40 >PRK07431 aspartate kinase; Provisional Probab=96.43 E-value=0.05 Score=34.01 Aligned_cols=102 Identities=17% Similarity=0.246 Sum_probs=69.5 Q ss_pred CEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHHHHHHHHHH Q ss_conf 88999867567247468999999999985189717998457245788999708981155783466124899999997416 Q 537021.9.peg.3 26 ETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEVVEMVLAGS 105 (150) Q Consensus 26 ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~~vL~G~ 105 (150) +.+|-||||+-+.|.+.++++++-|...+..|.++|+|--+--..+..|-++.-+ ..+ -.+++.|+.+... .-+ T Consensus 2 ~lIV~KFGGTSVa~~e~i~~va~~I~~~~~~g~~vvVVVSAmg~~Td~L~~la~~--i~~---~~~~re~D~lls~-GE~ 75 (594) T PRK07431 2 ALIVQKFGGTSVGSVERIQAVAQRIARTKEAGNDVVVVVSAMGKTTDELVKLAEE--ISD---NPPRREMDMLLST-GEQ 75 (594) T ss_pred CEEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHH--HCC---CCCHHHHHHHHHH-HHH T ss_conf 7499842876579899999999999999875899999974998729999999998--747---9998999999977-599 Q ss_pred HHHHHH-HHHHHCCCCEEEEEEEECCEEE Q ss_conf 689999-9998289976877865589899 Q 537021.9.peg.3 106 INKKIV-SLINQTGTQAIGICGKDGNMVF 133 (150) Q Consensus 106 vn~~lv-~~L~~~g~~avglsg~dg~li~ 133 (150) +...|. ..|+..|.+|+.++|...++++ T Consensus 76 ~S~~Lla~~L~~~G~~a~sl~~~q~gi~T 104 (594) T PRK07431 76 VSIALLSMALQELGQPAISLTGAQVGIVT 104 (594) T ss_pred HHHHHHHHHHHHCCCCCEEEEHHHCCEEC T ss_conf 99999999999689971992000161776 No 41 >PRK08210 aspartate kinase I; Reviewed Probab=96.43 E-value=0.08 Score=32.69 Aligned_cols=100 Identities=19% Similarity=0.326 Sum_probs=63.0 Q ss_pred CEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCH------HHHHHHHHCCCCCEEECCCCCCCHHHHHHHH Q ss_conf 8899986756724746899999999998518971799845724------5788999708981155783466124899999 Q 537021.9.peg.3 26 ETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGP------QIGAVLEKMGIKSKFENGLRITDQQTAEVVE 99 (150) Q Consensus 26 ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~------qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~ 99 (150) |++|.||||+-+.|++.+..+++-|..-...|.++|+|.-+=- ..|.+++...-. .+--++..++.+. T Consensus 2 k~iV~KFGGtSv~~~~~i~~v~~ii~~~~~~~~~~vVVvSAmg~~~~~~~td~L~~l~~~~------~~~~~~~~~D~l~ 75 (405) T PRK08210 2 KIIVQKFGGTSVSTEERRKMVVNKIKKALKEGYKVVVVVSAMGRKGDPYATDTLLSLVGEE------FSDISKREQDLLM 75 (405) T ss_pred CEEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCHHHHHHHHHHHH------CCCCCHHHHHHHH T ss_conf 8798816850269999999999999878866998899979989899811499999999865------0259878899998 Q ss_pred HHHHHHHHHHH-HHHHHHCCCCEEEEEEEECCEE Q ss_conf 99741668999-9999828997687786558989 Q 537021.9.peg.3 100 MVLAGSINKKI-VSLINQTGTQAIGICGKDGNMV 132 (150) Q Consensus 100 ~vL~G~vn~~l-v~~L~~~g~~avglsg~dg~li 132 (150) . ..-.+...| ...|++.|++|.-+.+.+..++ T Consensus 76 s-~GE~~S~~l~a~~L~~~g~~a~~~~~~~~~i~ 108 (405) T PRK08210 76 S-CGEIISSVVFSNMLNEAGIKAVALTGGQAGII 108 (405) T ss_pred H-HHHHHHHHHHHHHHHHCCCCCEEEEHHHCCEE T ss_conf 7-75899999999999747998679774158478 No 42 >cd04240 AAK_UC AAK_UC: Uncharacterized (UC) amino acid kinase-like proteins found mainly in archaea and a few bacteria. Sequences in this CD are members of the Amino Acid Kinase (AAK) superfamily. Probab=96.36 E-value=0.0038 Score=41.17 Aligned_cols=41 Identities=37% Similarity=0.463 Sum_probs=29.9 Q ss_pred EEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHH Q ss_conf 9986756724746899999999998518971799845724578899 Q 537021.9.peg.3 29 VVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVL 74 (150) Q Consensus 29 VIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l 74 (150) |||+|||.+.+. .++++.+..+ .|.+++||.|||+..|..= T Consensus 1 VvKiGGSL~~~~---~~ll~~l~~~--~~~~v~IVpGGG~fAd~vR 41 (203) T cd04240 1 VVKIGGSLIREA---VRLLRWLKTL--SGGGVVIVPGGGPFADVVR 41 (203) T ss_pred CEEEEEECCCCH---HHHHHHHHHH--CCCCEEEEECCHHHHHHHH T ss_conf 979820564668---9999999983--5997899979808999999 No 43 >cd04260 AAK_AKi-DapG-BS AAK_AKi-DapG-BS: Amino Acid Kinase Superfamily (AAK), AKi-DapG; this CD includes the N-terminal catalytic aspartokinase (AK) domain of the diaminopimelate-sensitive aspartokinase isoenzyme AKI (DapG), a monofunctional class enzyme found in Bacilli (Bacillus subtilis 168), Clostridia, and Actinobacteria bacterial species. In Bacillus subtilis, the regulation of the diaminopimelate-lysine biosynthetic pathway involves dual control by diaminopimelate and lysine, effected through separate diaminopimelate- and lysine-sensitive aspartokinase isoenzymes. AKI activity is invariant during the exponential and stationary phases of growth and is not altered by addition of amino acids to the growth medium. The role of this isoenzyme is most likely to provide a constant level of aspartyl-beta-phosphate for the biosynthesis of diaminopimelate for peptidoglycan synthesis and dipicolinate during sporulation. The B. subtilis AKI is tetrameric consisting of two alpha and two bet Probab=96.35 E-value=0.068 Score=33.18 Aligned_cols=99 Identities=13% Similarity=0.260 Sum_probs=62.2 Q ss_pred EEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCC-H-----HHHHHHHHCCCCCEEECCCCCCCHHHHHHHHH Q ss_conf 89998675672474689999999999851897179984572-4-----57889997089811557834661248999999 Q 537021.9.peg.3 27 TIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGG-P-----QIGAVLEKMGIKSKFENGLRITDQQTAEVVEM 100 (150) Q Consensus 27 tiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg-~-----qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~~ 100 (150) .+|.||||+-+.|++-+.++++-|..-...|.++++|..+= . ..|++++...-.. +--+++.++.+-. T Consensus 1 iiV~KFGGtSv~~~~~i~~va~iI~~~~~~~~~~vVVvSAmg~~~~~~~td~L~~l~~~~~------~~~~~~~~D~l~s 74 (244) T cd04260 1 IIVQKFGGTSVSTKERREQVAKKVKQAVDEGYKPVVVVSAMGRKGDPYATDTLINLVYAEN------SDISPRELDLLMS 74 (244) T ss_pred CEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHC------CCCCHHHHHHHHH T ss_conf 9898118523799999999999998898779988999799999997589999999999753------0389899999997 Q ss_pred HHHHHHHHHH-HHHHHHCCCCEEEEEEEECCEE Q ss_conf 9741668999-9999828997687786558989 Q 537021.9.peg.3 101 VLAGSINKKI-VSLINQTGTQAIGICGKDGNMV 132 (150) Q Consensus 101 vL~G~vn~~l-v~~L~~~g~~avglsg~dg~li 132 (150) +.-.+...| ...|++.|+++.-+.+.+-.++ T Consensus 75 -~GE~lSa~lla~~L~~~Gi~a~~~~~~~~~i~ 106 (244) T cd04260 75 -CGEIISAVVLTSTLRAQGLKAVALTGAQAGIL 106 (244) T ss_pred -HCCHHHHHHHHHHHHHCCCCCEEEEHHHCCEE T ss_conf -43099999999999964998748658888353 No 44 >TIGR02076 pyrH_arch uridylate kinase, putative; InterPro: IPR011818 Uridylate kinases (also known as UMP kinases) are key enzymes in the synthesis of nucleoside triphosphates. They catalyse the reversible transfer of the gamma-phosphoryl group from an ATP donor to UMP, yielding UDP, which is the starting point for the synthesis of all other pyrimidine nucleotides. The eukaryotic enzyme has a dual specificity, phosphorylating both UMP and CMP, while the bacterial enzyme is specific to UMP. The bacterial enzyme shows no sequence similarity to the eukaryotic enzyme or other nucleoside monophosphate kinases, but rather appears to be part of the amino acid kinase family. It is dependent on magnesium for activity and is activated by GTP and repressed by UTP , . In many bacterial genomes, the gene tends to be located immediately downstream of elongation factor T and upstream of ribosome recycling factor. A related protein family, believed to be equivalent in function is found in the archaea and in spirochetes. Structurally, the bacterial and archaeal proteins are homohexamers centred around a hollow nucleus and organised as a trimer of dimers , . Each monomer within the protein forms the amino acid kinase fold and can be divided into an N-terminal region which binds UMP and mediates intersubunit interactions within the dimer, and a C-terminal region which binds ATP and contains a mobile loop covering the active site. Inhibition of enzyme activity by UTP appears to be due to competition for the binding site for UMP, not allosteric inhibition as was previously suspected. This entry represents the archaeal and spirochete proteins.; GO: 0009041 uridylate kinase activity, 0006221 pyrimidine nucleotide biosynthetic process. Probab=96.17 E-value=0.0059 Score=39.98 Aligned_cols=75 Identities=21% Similarity=0.325 Sum_probs=55.7 Q ss_pred EEEEECHHHHCCH----HHHHHHHHHHHHHHHCC--CCEEEEECCCHHHHHH---HHHC--CCCCEEECCCCCCCHHHHH Q ss_conf 9998675672474----68999999999985189--7179984572457889---9970--8981155783466124899 Q 537021.9.peg.3 28 IVVKYGGHVMNCT----DLSKDFVNDIALLKKSN--ITPVIVHGGGPQIGAV---LEKM--GIKSKFENGLRITDQQTAE 96 (150) Q Consensus 28 iVIK~GG~~l~d~----~l~~~~~~dI~ll~~~G--~k~ViVHGgg~qI~~~---l~~~--gi~~~~v~G~RVTd~~tl~ 96 (150) +||++||+++.++ +.+.+|++-|..+..-| .++.+|-|||+-.-++ .+++ |..-.|.|= | T Consensus 1 iV~~LGGsV~~~~~~d~e~~~~~A~~l~~~~~~Gek~~v~vVvGGG~~AR~YI~~aR~lnPG~~E~~lD~--------i- 71 (232) T TIGR02076 1 IVISLGGSVLSPENIDAEKIKEYANILRKLSDEGEKHKVGVVVGGGKTAREYIGVARELNPGASETFLDE--------I- 71 (232) T ss_pred CEEECCCEEECCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEEECCCHHHHHHHHHHHHHCCCCCCHHHHH--------H- T ss_conf 9783345161688888789999999999998449946899887887658899999986088888024567--------7- Q ss_pred HHHHHHHHHHHHHHHHHH Q ss_conf 999997416689999999 Q 537021.9.peg.3 97 VVEMVLAGSINKKIVSLI 114 (150) Q Consensus 97 iv~~vL~G~vn~~lv~~L 114 (150) -+.+-++|..|...+ T Consensus 72 ---GI~~TRLNA~LLi~a 86 (232) T TIGR02076 72 ---GIDATRLNAMLLIAA 86 (232) T ss_pred ---HHHHHHHHHHHHHHH T ss_conf ---799999999999987 No 45 >PRK05429 gamma-glutamyl kinase; Provisional Probab=96.15 E-value=0.11 Score=31.92 Aligned_cols=56 Identities=23% Similarity=0.396 Sum_probs=45.8 Q ss_pred CCCEEEEEECHHHHCCH------HHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCC Q ss_conf 59889998675672474------68999999999985189717998457245788999708981 Q 537021.9.peg.3 24 ENETIVVKYGGHVMNCT------DLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKS 81 (150) Q Consensus 24 r~ktiVIK~GG~~l~d~------~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~ 81 (150) .-|.+|||+|-+.+.++ ....+++++|+.|+..|.++|||--|- |..-+..++++. T Consensus 7 ~~krIVIKiGSs~Lt~~~g~l~~~~i~~l~~~I~~L~~~G~evilVSSGA--Ia~G~~~L~~~~ 68 (372) T PRK05429 7 DARRIVVKVGSSLLTDDGGGLDRARIAELARQIAALRAAGHEVVLVSSGA--VAAGRSRLGLPK 68 (372) T ss_pred CCCEEEEECCCCEEECCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCHH--HHHHHHHCCCCC T ss_conf 28789999174336898888889999999999999997899999988407--986277609998 No 46 >PRK13402 gamma-glutamyl kinase; Provisional Probab=96.10 E-value=0.019 Score=36.72 Aligned_cols=55 Identities=22% Similarity=0.265 Sum_probs=44.2 Q ss_pred CCEEEEEECHHHHCC------HHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCC Q ss_conf 988999867567247------468999999999985189717998457245788999708981 Q 537021.9.peg.3 25 NETIVVKYGGHVMNC------TDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKS 81 (150) Q Consensus 25 ~ktiVIK~GG~~l~d------~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~ 81 (150) -|.+|||+|-+.+.+ ...+.+++++|+.|+..|.++|||-.|.-.. -+..++.+. T Consensus 5 ~kRIVIKiGSs~lt~~~g~l~~~~i~~l~~~I~~L~~~G~evilVSSGAia~--G~~~L~~~~ 65 (363) T PRK13402 5 WKRIVVKVGSALITPHKQGCSSHYLLGIAQFIVYLRAQGHQVVLVSSGSVAA--GYHKLGFID 65 (363) T ss_pred CCEEEEEECCCEEECCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH--HHHHCCCCC T ss_conf 8669999670106899998789999999999999997899899988787996--477609976 No 47 >COG0263 ProB Glutamate 5-kinase [Amino acid transport and metabolism] Probab=96.05 E-value=0.064 Score=33.34 Aligned_cols=89 Identities=22% Similarity=0.356 Sum_probs=59.3 Q ss_pred CCEEEEEECHHHHCCH------HHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHHH Q ss_conf 9889998675672474------6899999999998518971799845724578899970898115578346612489999 Q 537021.9.peg.3 25 NETIVVKYGGHVMNCT------DLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEVV 98 (150) Q Consensus 25 ~ktiVIK~GG~~l~d~------~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv 98 (150) -+++|||+|-+.+.++ ..+..++++++.|+..|.++|||--|. |-.=+..+|++.+-.+ +..++.+.. T Consensus 6 ~~riVvKiGSs~Lt~~~g~l~~~~l~~l~~~ia~L~~~G~eVilVSSGA--iaaG~~~Lg~~~rp~~---l~~kQA~AA- 79 (369) T COG0263 6 ARRIVVKIGSSSLTDGTGGLDRSKLEELVRQVAALHKAGHEVVLVSSGA--IAAGRTRLGLPKRPKT---LAEKQAAAA- 79 (369) T ss_pred CEEEEEEECCCEEECCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCCH--HHHCHHHCCCCCCCCC---HHHHHHHHH- T ss_conf 5379999775023579998298999999999999986898899983446--6517444199988761---288899987- Q ss_pred HHHHHHHHH--HHHHHHHHHCCCCEEE Q ss_conf 999741668--9999999828997687 Q 537021.9.peg.3 99 EMVLAGSIN--KKIVSLINQTGTQAIG 123 (150) Q Consensus 99 ~~vL~G~vn--~~lv~~L~~~g~~avg 123 (150) .||+. ..-.+.+.++|...-- T Consensus 80 ----VGQ~~Lm~~y~~~f~~~g~~v~Q 102 (369) T COG0263 80 ----VGQVRLMQLYEELFARYGIKVGQ 102 (369) T ss_pred ----HCHHHHHHHHHHHHHHCCCEEEE T ss_conf ----37999999999999860974568 No 48 >PRK12443 uridylate kinase; Reviewed Probab=95.94 E-value=0.024 Score=36.07 Aligned_cols=91 Identities=20% Similarity=0.311 Sum_probs=57.8 Q ss_pred CEEEEEECHHHHC-------CHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHH-HHHCCCCCEEECCCCCCCHHHHHH Q ss_conf 8899986756724-------74689999999999851897179984572457889-997089811557834661248999 Q 537021.9.peg.3 26 ETIVVKYGGHVMN-------CTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAV-LEKMGIKSKFENGLRITDQQTAEV 97 (150) Q Consensus 26 ktiVIK~GG~~l~-------d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~-l~~~gi~~~~v~G~RVTd~~tl~i 97 (150) |.+++|++|+++. |++....++++|..++..|.++-||-|||.-.--. .+..|++ |+| .+.| T Consensus 5 kRVllKlSGEaL~g~~~~g~d~~~l~~ia~eI~~~~~~G~evaiVvGGGNi~RG~~~~~~gid-------r~~-aD~i-- 74 (247) T PRK12443 5 KRVLIKLSGGALADQTGNSFNSKRLEHIANEILSIVDLGIEVSIVIGGGNIFRGHLAEEWGID-------RVE-ADNI-- 74 (247) T ss_pred CEEEEEEEHHHHCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHCCCC-------CCH-HHHH-- T ss_conf 789999738882899988879999999999999999779879999768850232236553987-------310-3688-- Q ss_pred HHHHHHHHHHHHHH-HHHHH-CCCCEEEEEEEE Q ss_conf 99997416689999-99982-899768778655 Q 537021.9.peg.3 98 VEMVLAGSINKKIV-SLINQ-TGTQAIGICGKD 128 (150) Q Consensus 98 v~~vL~G~vn~~lv-~~L~~-~g~~avglsg~d 128 (150) =| |+-.+|..+. +.|.+ .|+.+.-+|... T Consensus 75 -GM-LaTvmNal~L~~~l~~~~g~~~rv~sai~ 105 (247) T PRK12443 75 -GT-LGTIINSLMLRGVLTSKTNKEVRVMTSIP 105 (247) T ss_pred -HH-HHHHHHHHHHHHHHHHHCCCCEEEEECCC T ss_conf -89-99999999999999975599469973233 No 49 >TIGR01027 proB glutamate 5-kinase; InterPro: IPR005715 L-glutamate 5-phosphotransferase, (gamma-glutamyl kinase, proB, 2.7.2.11 from EC), catalyzes the first step in proline biosynthesisATP + L-glutamate = ADP + L-glutamate 5-phosphate. the product of which rapidly cyclises to 5-oxoproline and phosphate. Bacterial ProB proteins hits the full length of this model, but the ProB-like domain of delta 1-pyrroline-5-carboxylate synthetase does not hit the C-terminal 100 residues. ; GO: 0004349 glutamate 5-kinase activity, 0006561 proline biosynthetic process, 0005737 cytoplasm. Probab=95.92 E-value=0.07 Score=33.10 Aligned_cols=86 Identities=22% Similarity=0.382 Sum_probs=59.8 Q ss_pred CEEEEEECHHHHCCH--------HHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHH Q ss_conf 889998675672474--------689999999999851897179984572457889997089811557834661248999 Q 537021.9.peg.3 26 ETIVVKYGGHVMNCT--------DLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEV 97 (150) Q Consensus 26 ktiVIK~GG~~l~d~--------~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~i 97 (150) +++|||+|=|.|.++ .-+..++++|+.|+..|.++|||=-|. |-.=++.+|++.+=.+ .+.++.+.. T Consensus 1 ~riVvK~GsS~Lt~~~g~h~l~~~~i~~lv~~v~~L~~~Gh~vviVSSGA--~AaG~~~LG~~~rP~~---la~KQAlAA 75 (379) T TIGR01027 1 KRIVVKVGSSSLTGSSGSHELDRSRIAELVEQVAALHAAGHEVVIVSSGA--VAAGFEALGLPERPKT---LAEKQALAA 75 (379) T ss_pred CEEEEEEECCEEECCCCCCEECHHHHHHHHHHHHHHHHCCCEEEEEECCH--HHHHHHHCCCCCCCCC---HHHHHHHHH T ss_conf 96899970321117888741377999999999999986599899981675--8862344556999862---567887873 Q ss_pred HHHHHHHHHHH--HHHHHHHHCCCCE Q ss_conf 99997416689--9999998289976 Q 537021.9.peg.3 98 VEMVLAGSINK--KIVSLINQTGTQA 121 (150) Q Consensus 98 v~~vL~G~vn~--~lv~~L~~~g~~a 121 (150) .||+.- .=...++.+|.+. T Consensus 76 -----VGQ~~Lm~~y~~~F~~Yg~~~ 96 (379) T TIGR01027 76 -----VGQVRLMQLYENLFSSYGLKV 96 (379) T ss_pred -----CCHHHHHHHHHHHHHHCCCCH T ss_conf -----145689999999997548822 No 50 >PRK06635 aspartate kinase; Reviewed Probab=95.87 E-value=0.1 Score=32.04 Aligned_cols=100 Identities=14% Similarity=0.211 Sum_probs=61.6 Q ss_pred CEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHHHHHHHHHH Q ss_conf 88999867567247468999999999985189717998457245788999708981155783466124899999997416 Q 537021.9.peg.3 26 ETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEVVEMVLAGS 105 (150) Q Consensus 26 ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~~vL~G~ 105 (150) +.+|.||||+-+.|++.++.+++-|..-...|.++++|.-+---++..|-.+.-+ +.+ --+++.++.+.. .|+ T Consensus 2 ~~iV~KFGGtSv~~~~~~~~v~~ii~~~~~~~~~~vvVvSA~~~~Td~Li~la~~--~~~---~~~~~e~d~ils--~GE 74 (402) T PRK06635 2 ALIVQKFGGTSVGDVERIKNVAERVKAEVEAGHQVVVVVSAMGGTTDELLDLAKE--VSP---APDPRELDMLLS--TGE 74 (402) T ss_pred CEEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHH--HCC---CCCHHHHHHHHH--HHH T ss_conf 7799957850369999999999999878872999899964999878999999998--607---998799999998--717 Q ss_pred -HHHHH-HHHHHHCCCCEEEEEEEECCEE Q ss_conf -68999-9999828997687786558989 Q 537021.9.peg.3 106 -INKKI-VSLINQTGTQAIGICGKDGNMV 132 (150) Q Consensus 106 -vn~~l-v~~L~~~g~~avglsg~dg~li 132 (150) +...| ...|++.|++|.-+.+.+..++ T Consensus 75 ~~S~~l~a~~L~~~g~~a~~~~~~~~~i~ 103 (402) T PRK06635 75 QVSVALLAMALQSLGVKARSFTGWQAPII 103 (402) T ss_pred HHHHHHHHHHHHHCCCCCEEECHHHCCEE T ss_conf 99999999999857998589147878553 No 51 >TIGR02075 pyrH_bact uridylate kinase; InterPro: IPR015963 Uridylate kinases (also known as UMP kinases) are key enzymes in the synthesis of nucleoside triphosphates. They catalyse the reversible transfer of the gamma-phosphoryl group from an ATP donor to UMP, yielding UDP, which is the starting point for the synthesis of all other pyrimidine nucleotides. The eukaryotic enzyme has a dual specificity, phosphorylating both UMP and CMP, while the bacterial enzyme is specific to UMP. The bacterial enzyme shows no sequence similarity to the eukaryotic enzyme or other nucleoside monophosphate kinases, but rather appears to be part of the amino acid kinase family. It is dependent on magnesium for activity and is activated by GTP and repressed by UTP , . In many bacterial genomes, the gene tends to be located immediately downstream of elongation factor T and upstream of ribosome recycling factor. A related protein family, believed to be equivalent in function is found in the archaea and in spirochetes. Structurally, the bacterial and archaeal proteins are homohexamers centred around a hollow nucleus and organised as a trimer of dimers , . Each monomer within the protein forms the amino acid kinase fold and can be divided into an N-terminal region which binds UMP and mediates intersubunit interactions within the dimer, and a C-terminal region which binds ATP and contains a mobile loop covering the active site. Inhibition of enzyme activity by UTP appears to be due to competition for the binding site for UMP, not allosteric inhibition as was previously suspected. This entry represents the bacterial proteins.. Probab=95.76 E-value=0.2 Score=30.13 Aligned_cols=100 Identities=20% Similarity=0.285 Sum_probs=73.6 Q ss_pred CEEEEEECHHHHCC-H-------HHHHHHHHHHHHHHHCCCCEEEEECCCHHH-HHHHHHCCCCCEEECCCCCCCHHHHH Q ss_conf 88999867567247-4-------689999999999851897179984572457-88999708981155783466124899 Q 537021.9.peg.3 26 ETIVVKYGGHVMNC-T-------DLSKDFVNDIALLKKSNITPVIVHGGGPQI-GAVLEKMGIKSKFENGLRITDQQTAE 96 (150) Q Consensus 26 ktiVIK~GG~~l~d-~-------~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI-~~~l~~~gi~~~~v~G~RVTd~~tl~ 96 (150) |-+++|++|++|.+ + ..++.++++|..++..|+.+=||=|||-=. ...+++.|++ | ...+-|- T Consensus 2 kRvLlKLSGEaL~g~~~~f~ID~~~l~~iA~eik~~~~~G~evgIViGGGNIfRG~~~~~~Gi~-------R-~~aDymG 73 (236) T TIGR02075 2 KRVLLKLSGEALAGKDSQFGIDPKRLNRIANEIKEVVKKGIEVGIVIGGGNIFRGKSAAELGID-------R-VSADYMG 73 (236) T ss_pred CEEEEEECCHHHCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHCCCC-------E-EECCCCC T ss_conf 6489996104324788886426899999999999998679489999778278877889874896-------1-1143230 Q ss_pred HHHHHHHHHHHH-HHHHHHHHCCCCEEEEE-EEECCEEEEEEC Q ss_conf 999997416689-99999982899768778-655898999974 Q 537021.9.peg.3 97 VVEMVLAGSINK-KIVSLINQTGTQAIGIC-GKDGNMVFAEKA 137 (150) Q Consensus 97 iv~~vL~G~vn~-~lv~~L~~~g~~avgls-g~dg~li~a~~~ 137 (150) .. +--+|- -|.++|.+.|+++.=+| ...-+--.|++. T Consensus 74 ML----ATviNglaL~~~L~~~g~~~~V~Sfai~~~~~i~E~Y 112 (236) T TIGR02075 74 ML----ATVINGLALKDALEKLGLKTRVLSFAISMPKQICESY 112 (236) T ss_pred HH----HHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCCCCCC T ss_conf 58----9999999999888736995689877753586413232 No 52 >COG0527 LysC Aspartokinases [Amino acid transport and metabolism] Probab=95.64 E-value=0.18 Score=30.41 Aligned_cols=107 Identities=12% Similarity=0.153 Sum_probs=69.6 Q ss_pred EEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHC-CC-CC---------------EEECCC-- Q ss_conf 899986756724746899999999998518971799845724578899970-89-81---------------155783-- Q 537021.9.peg.3 27 TIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKM-GI-KS---------------KFENGL-- 87 (150) Q Consensus 27 tiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~-gi-~~---------------~~v~G~-- 87 (150) .+|.||||+-+.|.+-+...++=+.....-|.++|+|--+..-++..|-++ .- .+ ++.... T Consensus 3 ~iV~KFGGTSva~~e~i~~va~iv~~~~~~g~~vVVVvSA~~~vTd~Lv~~a~~~~~~~~~~~~~~~~~~~~el~~~~~~ 82 (447) T COG0527 3 LIVQKFGGTSVADAERILRVADIVKEDSEEGVKVVVVVSAMGGVTDLLVALAEGAESGRDAVAEQRHRDIASELILDPFI 82 (447) T ss_pred EEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCHH T ss_conf 08998687553888999999999985463397189997898774589999986331445205788899999987415104 Q ss_pred ---------------------CCCCHHHHHHHHHHHHHHHHHHH-HHHHHHCCCCEEEEEEEECCEEEE Q ss_conf ---------------------46612489999999741668999-999982899768778655898999 Q 537021.9.peg.3 88 ---------------------RITDQQTAEVVEMVLAGSINKKI-VSLINQTGTQAIGICGKDGNMVFA 134 (150) Q Consensus 88 ---------------------RVTd~~tl~iv~~vL~G~vn~~l-v~~L~~~g~~avglsg~dg~li~a 134 (150) +-.++..++.+.-. .-+++..| .+.|+..|++|.-+++.+..+++. T Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~D~lls~-GE~~Sa~lla~~L~~~Gv~A~~~~~~~~~i~t~ 150 (447) T COG0527 83 AARLAEVIAEFKKVLLGIALLGEVSPRERDELLSL-GERLSAALLAAALNALGVDARSLDGRQAGIATD 150 (447) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHH-CHHHHHHHHHHHHHHCCCCEEEECHHHCEEEEC T ss_conf 56676666666676666664147987899999850-548899999999996799659975588425535 No 53 >COG5630 ARG2 Acetylglutamate synthase [Amino acid transport and metabolism] Probab=95.35 E-value=0.013 Score=37.85 Aligned_cols=124 Identities=9% Similarity=-0.069 Sum_probs=91.1 Q ss_pred HHHHHHHHHHCC----CEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCC Q ss_conf 999999998759----8899986756724746899999999998518971799845724578899970898115578346 Q 537021.9.peg.3 14 EQVLPFVQFYEN----ETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRI 89 (150) Q Consensus 14 ~ea~pYi~~~r~----ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RV 89 (150) |+.+-|-..++. ..-+.++++-+..++- ....+.+..+...+.-+++-|||+++-+..+-..|.+ ..+| |+ T Consensus 125 r~i~~~~vfs~~~~~~r~~~s~v~~~v~~~~~--~e~t~s~f~~~~v~~t~ivl~~~~~p~i~e~~i~gn~--i~~G-r~ 199 (495) T COG5630 125 RTGLYHSVFSQESRFFDIGNSNFIPIVKPYVY--NEETASEFMTKDVVKFMDCLCQGNIPHIDKFFILNNA--IPSG-RN 199 (495) T ss_pred HHHHHHHHHHHHCCCCCCCCCCEECCCCCCCC--HHHHHHHHHHHHHHHEEEEECCCCCCCCEEEEECCCC--CCCC-CC T ss_conf 32233110232200024432312035588750--3566778999764001354247888653025424650--3456-55 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEEECCCCCCCCCCCC Q ss_conf 61248999999974166899999998289976877865589899997475211179951 Q 537021.9.peg.3 90 TDQQTAEVVEMVLAGSINKKIVSLINQTGTQAIGICGKDGNMVFAEKARHSLRLSPNTK 148 (150) Q Consensus 90 Td~~tl~iv~~vL~G~vn~~lv~~L~~~g~~avglsg~dg~li~a~~~~~~lg~vg~~~ 148 (150) |.++++. +..++ ..++..+...++.++..+-+.++ .++.+.- +.+.+.+++++ T Consensus 200 ~~~h~~~-~~~~~-~~~~~~l~~~~~~~~kR~t~s~~---~l~~~~~-~~de~~~~e~~ 252 (495) T COG5630 200 DNAHVFI-LSQEL-EHLSSSLSHNISTLTKREPRSQN---LLHRMEV-KKDEISSLECE 252 (495) T ss_pred CCCCHHH-HHHHH-HHHHHHHHHCCCCCCCCCCCCCC---HHHHHHH-HHCCCCCCCCC T ss_conf 6621012-78888-88788876246635455766433---0233232-21100004665 No 54 >cd04234 AAK_AK AAK_AK: Amino Acid Kinase Superfamily (AAK), Aspartokinase (AK); this CD includes the N-terminal catalytic domain of aspartokinase (4-L-aspartate-4-phosphotransferase;). AK is the first enzyme in the biosynthetic pathway of the aspartate family of amino acids (lysine, threonine, methionine, and isoleucine) and the bacterial cell wall component, meso-diaminopimelate. It also catalyzes the conversion of aspartate and ATP to aspartylphosphate and ADP. One mechanism for the regulation of this pathway is by the production of several isoenzymes of aspartokinase with different repressors and allosteric inhibitors. Pairs of ACT domains are proposed to specifically bind amino acids leading to allosteric regulation of the enzyme. In Escherichia coli, three different aspartokinase isoenzymes are regulated specifically by lysine, methionine, and threonine. AK-HSDHI (ThrA) and AK-HSDHII (MetL) are bifunctional enzymes that consist of an N-terminal AK and a C-terminal homoserine dehyd Probab=95.34 E-value=0.22 Score=29.93 Aligned_cols=39 Identities=21% Similarity=0.300 Sum_probs=29.5 Q ss_pred EEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECC Q ss_conf 8999867567247468999999999985189717998457 Q 537021.9.peg.3 27 TIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGG 66 (150) Q Consensus 27 tiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGg 66 (150) .+|.||||+-+.|++.++.+++-|... .-+.++|+|.-+ T Consensus 1 ~~V~KFGGtSv~~~~~i~~v~~ii~~~-~~~~~~vVVvSA 39 (227) T cd04234 1 MVVQKFGGTSVASAERIKRVADIIKAY-EKGNRVVVVVSA 39 (227) T ss_pred CEEEEECCCCCCCHHHHHHHHHHHHHH-HCCCCEEEEECC T ss_conf 989967830179999999999999865-438988999738 No 55 >cd07041 STAS_RsbR_RsbS_like Sulphate Transporter and Anti-Sigma factor antagonist domain of the "stressosome" complex proteins RsbS and RsbR, regulators of the bacterial stress activated alternative sigma factor sigma-B by phosphorylation. The STAS (Sulphate Transporter and Anti-Sigma factor antagonist) domain of proteins related to RsbS and RsbR which are part of the "stressosome" complex that plays an important role in the regulation of the bacterial stress activated alternative sigma factor sigma-B. During stress conditions RsbS and RsbR are phosphorylated which leads to the release of RsbT, an activator of of the RsbU phosphatase, which in turn activates RsbV which leads to the release and activation of sigma factor B. RsbS is a single domain protein (STAS domain), while RsbR-like proteins have a well-conserved C-terminal STATS domain and a variable N-terminal domain. The STAS domain is also found in the C- terminal region of sulphate transporters and anti-anti-sigma factors. Probab=95.01 E-value=0.18 Score=30.52 Aligned_cols=77 Identities=14% Similarity=0.259 Sum_probs=65.7 Q ss_pred HHHHCCCHHHH-HHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCE Q ss_conf 23220622899-9999998759889998675672474689999999999851897179984572457889997089811 Q 537021.9.peg.3 5 IYQFQAEILEQ-VLPFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSK 82 (150) Q Consensus 5 ~~~~~a~~l~e-a~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~ 82 (150) +...+++.+.| .+.++.+.+-+++++-+.|-...|+.....+.+-...++.+|.+++++ |-+|++-+.+-+.|++.. T Consensus 20 ld~~r~~~l~e~ll~~i~~~~~~~viiDlsGv~~iD~~~~~~L~~l~~~~~LlG~~~il~-Gi~P~vA~~lv~~g~~l~ 97 (109) T cd07041 20 LDDERAEQLQERLLEAISRRRARGVIIDLTGVPVIDSAVARHLLRLARALRLLGARTILT-GIRPEVAQTLVELGIDLS 97 (109) T ss_pred ECHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEE-ECCHHHHHHHHHHCCCCC T ss_conf 859999999999999999729889999745886377899999999999999639869999-439999999999299803 No 56 >KOG1154 consensus Probab=94.20 E-value=0.078 Score=32.79 Aligned_cols=98 Identities=20% Similarity=0.260 Sum_probs=60.0 Q ss_pred CCCEEEEEECHHHHCCHH-------HHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCH---H Q ss_conf 598899986756724746-------899999999998518971799845724578899970898115578346612---4 Q 537021.9.peg.3 24 ENETIVVKYGGHVMNCTD-------LSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQ---Q 93 (150) Q Consensus 24 r~ktiVIK~GG~~l~d~~-------l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~---~ 93 (150) +-+++|||+|-+++.++. -+..++++++.|+..|.+++||--|+-... ++++.++-..-.-.|-|-+ + T Consensus 8 ka~rIVVKLGSavit~e~~~~laLgrla~IVEqV~~L~~~G~evilVSSGaVA~G--~qrLr~~~~~s~s~r~~l~~~~~ 85 (285) T KOG1154 8 KAYRIVVKLGSAVITREDTCGLALGRLASIVEQVSELQRMGREVILVSSGAVAFG--RQRLRQELLPSSSMRQTLKPQSE 85 (285) T ss_pred CCEEEEEEECCEEEECCCCCCHHHHHHHHHHHHHHHHHHCCCEEEEEECCHHHHH--HHHHHHHHCCCHHHHHHHCCCCC T ss_conf 0049999814468877987540478999999999999856855999923525550--77751020343567875267541 Q ss_pred HHHH-HHHHHHHH--HHHHHHHHHHHCCCCEEEE Q ss_conf 8999-99997416--6899999998289976877 Q 537021.9.peg.3 94 TAEV-VEMVLAGS--INKKIVSLINQTGTQAIGI 124 (150) Q Consensus 94 tl~i-v~~vL~G~--vn~~lv~~L~~~g~~avgl 124 (150) +.+. +..+ .|+ +-..-...+.+.|+.+..+ T Consensus 86 l~e~rA~AA-vGQ~~Lmalye~lF~Qy~~~~AQv 118 (285) T KOG1154 86 LAEKRACAA-VGQSGLMALYETLFTQYGITIAQV 118 (285) T ss_pred HHHHHHHHH-HCCCHHHHHHHHHHHHHCCEEEEE T ss_conf 146788887-275218999999999837530035 No 57 >PRK06291 aspartate kinase; Provisional Probab=92.59 E-value=1 Score=25.68 Aligned_cols=50 Identities=14% Similarity=0.210 Sum_probs=36.0 Q ss_pred EEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHH Q ss_conf 89998675672474689999999999851897179984572457889997 Q 537021.9.peg.3 27 TIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEK 76 (150) Q Consensus 27 tiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~ 76 (150) .+|.||||+-+.|.+.+..+++=|..-..-|.++|+|.-+=.-++..|-+ T Consensus 2 ~iV~KFGGTSva~~~~i~~v~~iI~~~~~~~~~~vVVvSA~~gvTd~L~~ 51 (466) T PRK06291 2 RLVMKFGGTSVGDGERIRHVAKLVKKYREEGNEVVVVVSAMTGVTDRLLE 51 (466) T ss_pred CEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHH T ss_conf 78997384237989999999999876776589899996799997899999 No 58 >cd04259 AAK_AK-DapDC AAK_AK-DapDC: Amino Acid Kinase Superfamily (AAK), AK-DapDC; this CD includes the N-terminal catalytic aspartokinase (AK) domain of the bifunctional enzyme AK - DAP decarboxylase (DapDC) found in some bacteria. Aspartokinase is the first enzyme in the aspartate metabolic pathway, catalyzes the conversion of aspartate and ATP to aspartylphosphate and ADP. DapDC, which is the lysA gene product, catalyzes the decarboxylation of DAP to lysine. Probab=89.65 E-value=1.1 Score=25.54 Aligned_cols=47 Identities=19% Similarity=0.232 Sum_probs=32.5 Q ss_pred EEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHH Q ss_conf 99986756724746899999999998518971799845724578899 Q 537021.9.peg.3 28 IVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVL 74 (150) Q Consensus 28 iVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l 74 (150) +|.||||+-+.|.+.++.+++=|..-...|.++|+|--+-.-++..| T Consensus 2 iV~KFGGTSv~~~~~~~~v~~ii~~~~~~~~~~vvVVSA~~gvTd~L 48 (295) T cd04259 2 VVLKFGGTSVSSRARWDTIAKLAQKHLNTGGQPLIVCSALSGISNKL 48 (295) T ss_pred EEEEECHHCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCHHH T ss_conf 89976801058799999999999988747999899979999983689 No 59 >cd06844 STAS Sulphate Transporter and Anti-Sigma factor antagonist domain found in the C-terminal region of sulphate transporters as well as in bacterial and archaeal proteins involved in the regulation of sigma factors. The STAS (Sulphate Transporter and Anti-Sigma factor antagonist) domain is found in the C-terminal region of sulphate transporters as well as in bacterial and archaeal proteins involved in the regulation of sigma factors, like anti-anti-sigma factors and "stressosome" components. The sigma factor regulators are involved in protein-protein interaction which is regulated by phosphorylation. Probab=87.85 E-value=2.5 Score=23.07 Aligned_cols=74 Identities=15% Similarity=0.160 Sum_probs=62.0 Q ss_pred HHCCCHHHHHH-HHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCC Q ss_conf 22062289999-9999875988999867567247468999999999985189717998457245788999708981 Q 537021.9.peg.3 7 QFQAEILEQVL-PFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKS 81 (150) Q Consensus 7 ~~~a~~l~ea~-pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~ 81 (150) -..|+-|++.+ +++.+...+++|+-+.|-...|..-...++.-...++..|.++++ .|-.|++.+.++..|+.- T Consensus 20 ~~ta~~lr~~l~~~i~~~~~~~iVvDls~v~f~DSsGl~~Ll~~~k~~~~~Gg~l~l-~~~~p~V~~vf~~tGldk 94 (100) T cd06844 20 HHSVEQFKEELLHNITNVAGKTIVIDISALEFMDSSGTGVLLERSRLAEAVGGQFVL-TGISPAVRITLTESGLDK 94 (100) T ss_pred HHHHHHHHHHHHHHHHHCCCCEEEEECCCCEEECHHHHHHHHHHHHHHHHCCCEEEE-ECCCHHHHHHHHHHCCCC T ss_conf 888999999999999738997899987688188478999999999999966998999-839989999999948685 No 60 >cd04244 AAK_AK-LysC-like AAK_AK-LysC-like: Amino Acid Kinase Superfamily (AAK), AK-LysC-like; this CD includes the N-terminal catalytic aspartokinase (AK) domain of the lysine-sensitive AK isoenzyme found in higher plants. The lysine-sensitive AK isoenzyme is a monofunctional protein. It is involved in the overall regulation of the aspartate pathway and can be synergistically inhibited by S-adenosylmethionine. Also included in this CD is an uncharacterized LysC-like AK found in Euryarchaeota and some bacteria. AK catalyzes the conversion of aspartate and ATP to aspartylphosphate and ADP. Probab=87.61 E-value=1.6 Score=24.44 Aligned_cols=48 Identities=15% Similarity=0.194 Sum_probs=32.1 Q ss_pred EEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHH Q ss_conf 9998675672474689999999999851897179984572457889997 Q 537021.9.peg.3 28 IVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEK 76 (150) Q Consensus 28 iVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~ 76 (150) +|.||||+-+.|.+.+.++++ |..-..-+.++|+|.-+=--++..|-+ T Consensus 2 iV~KFGGtSva~~~~i~~v~~-II~~~~~~~~~vVVVSA~~gvTn~L~~ 49 (298) T cd04244 2 LVMKFGGTSVGSAERIRHVAD-LVGTYAEGHEVVVVVSAMGGVTDRLLL 49 (298) T ss_pred EEEEECCCCCCCHHHHHHHHH-HHHHHCCCCCEEEEECCCCCCHHHHHH T ss_conf 579849101189999999999-985546699989997799998899999 No 61 >pfam00752 XPG_N XPG N-terminal domain. Probab=87.30 E-value=1.7 Score=24.26 Aligned_cols=52 Identities=23% Similarity=0.373 Sum_probs=36.4 Q ss_pred HHHHCCCEEEEEECH--------------HHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHH Q ss_conf 998759889998675--------------6724746899999999998518971799845724578 Q 537021.9.peg.3 20 VQFYENETIVVKYGG--------------HVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIG 71 (150) Q Consensus 20 i~~~r~ktiVIK~GG--------------~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~ 71 (150) +..|+||++-|-..- ..+.+......|+..+..|...|++||.|+-|.|--. T Consensus 20 l~~~~Gk~vAID~s~wl~q~~~~~~~~~~~~~~~~~~l~~~~~ri~~l~~~~I~PvfVFDG~~p~~ 85 (100) T pfam00752 20 IEALEGKTLAIDASIWLYQFLKAVRDQLGNALQNTSHLMGFFSRLCRLKDFGIKPIFVFDGGPPPL 85 (100) T ss_pred HHHHCCCEEEECCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCCCHH T ss_conf 999689879977199999999997023137888828999999999999988998999878999663 No 62 >PRK08961 bifunctional aspartate kinase/diaminopimelate decarboxylase protein; Provisional Probab=86.72 E-value=1.9 Score=23.82 Aligned_cols=54 Identities=13% Similarity=0.140 Sum_probs=40.1 Q ss_pred HCCCEEEEEECHHHHCCHHHHHHHHHHHH-HHHHCCCCEEEEECCCHHHHHHHHH Q ss_conf 75988999867567247468999999999-9851897179984572457889997 Q 537021.9.peg.3 23 YENETIVVKYGGHVMNCTDLSKDFVNDIA-LLKKSNITPVIVHGGGPQIGAVLEK 76 (150) Q Consensus 23 ~r~ktiVIK~GG~~l~d~~l~~~~~~dI~-ll~~~G~k~ViVHGgg~qI~~~l~~ 76 (150) +-.+.+|.||||+-+.+.+-++++++-|. .+...|.++|+|.-+---++..|.+ T Consensus 5 ~~~~wVV~KFGGTSVas~~~~~~Va~iI~~~~~~~g~rvvVVvSAm~gvTn~L~~ 59 (865) T PRK08961 5 SADRWVVLKFGGTSVSRRHRWDTIAKLARKRLNETGGRVLVVVSALSGVTNELTA 59 (865) T ss_pred CCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHH T ss_conf 8898799862846689899999999999999985099889998599986389999 No 63 >COG1366 SpoIIAA Anti-anti-sigma regulatory factor (antagonist of anti-sigma factor) [Signal transduction mechanisms] Probab=83.95 E-value=4 Score=21.84 Aligned_cols=76 Identities=18% Similarity=0.226 Sum_probs=63.5 Q ss_pred HHCCCHHHHHHH-HHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEE Q ss_conf 220622899999-99987598899986756724746899999999998518971799845724578899970898115 Q 537021.9.peg.3 7 QFQAEILEQVLP-FVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKF 83 (150) Q Consensus 7 ~~~a~~l~ea~p-Yi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~ 83 (150) -..|+.|+|... ++..-..+.+||-+.+-...|..-...++.-...++..|.+++++ |-.|++.+.++..|+...| T Consensus 25 ~~~a~~~~e~~~~~~~~~~~~~ivIDls~v~~~dS~gl~~L~~~~~~~~~~g~~~~l~-~i~p~v~~~~~~~gl~~~~ 101 (117) T COG1366 25 AARAPALKETLLEVIAASGARGLVIDLSGVDFMDSAGLGVLVALLKSARLRGVELVLV-GIQPEVARTLELTGLDKSF 101 (117) T ss_pred HHHHHHHHHHHHHHHHHCCCCEEEEECCCCCEECCHHHHHHHHHHHHHHHCCCCEEEE-ECCHHHHHHHHHHCCCCEE T ss_conf 8889999999999997469977999888887745458999999999998669948999-2999999999993974125 No 64 >PRK00192 mannosyl-3-phosphoglycerate phosphatase; Reviewed Probab=82.52 E-value=3.4 Score=22.24 Aligned_cols=67 Identities=15% Similarity=0.215 Sum_probs=45.8 Q ss_pred CCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHH-HHHHHHHCCCCCEEE--CCCCCC Q ss_conf 5988999867567247468999999999985189717998457245-788999708981155--783466 Q 537021.9.peg.3 24 ENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQ-IGAVLEKMGIKSKFE--NGLRIT 90 (150) Q Consensus 24 r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~q-I~~~l~~~gi~~~~v--~G~RVT 90 (150) .-+.|+.=+.|..+.+........+.|..|+..|+++|+.-|=.+. +...++++|+...|+ ||-+|- T Consensus 5 ~~~LI~tDlDGTLL~~~~~~~~~~~ai~~l~~~Gi~vviaTGR~~~~~~~~~~~l~l~~~~I~~NGa~I~ 74 (275) T PRK00192 5 SKLLVFTDLDGTLLDHTYDWEPAKPTLKALKEKGIPVIPCTSKTAAEVEVLRAELGLEDPFIVENGAAIY 74 (275) T ss_pred CCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHCCCCEEEEECCCEEE T ss_conf 8469999485778489998368999999999789999996898899999999995899749996795899 No 65 >smart00485 XPGN Xeroderma pigmentosum G N-region. domain in nucleases Probab=81.90 E-value=4.2 Score=21.68 Aligned_cols=52 Identities=13% Similarity=0.179 Sum_probs=35.7 Q ss_pred HHHHCCCEEEEEECHHH--------------HCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHH Q ss_conf 99875988999867567--------------24746899999999998518971799845724578 Q 537021.9.peg.3 20 VQFYENETIVVKYGGHV--------------MNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIG 71 (150) Q Consensus 20 i~~~r~ktiVIK~GG~~--------------l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~ 71 (150) +..|+|+++.|=...=. ..+......|+.-+..|...|+.||.|+.|.|--. T Consensus 19 l~~l~gk~vAIDas~wl~k~~~~~~~~~~~~~~~~~~l~~~~~ri~~l~~~~I~PIfVFDG~~~~~ 84 (99) T smart00485 19 LEALRGKTLAIDASIWLYQFLTACREKLGTPLPNSKHLMGLFYRTCRLLEFGIKPIFVFDGKPPPL 84 (99) T ss_pred HHHHCCCEEEEECHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCCCHH T ss_conf 999699879991589999999997100157777719999999999999987998999888999765 No 66 >TIGR00657 asp_kinases aspartate kinase; InterPro: IPR001341 Bacteria, plants and fungi metabolise aspartic acid to produce four amino acids - lysine, threonine, methionine and isoleucine - in a series of reactions known as the aspartate pathway. Additionally, several important metabolic intermediates are produced by these reactions, such as diaminopimelic acid, an essential component of bacterial cell wall biosynthesis, and dipicolinic acid, which is involved in sporulation in Gram-positive bacteria. Members of the animal kingdom do not posses this pathway and must therefore acquire these essential amino acids through their diet. Research into improving the metabolic flux through this pathway has the potential to increase the yield of the essential amino acids in important crops, thus improving their nutritional value. Additionally, since the enzymes are not present in animals, inhibitors of them are promising targets for the development of novel antibiotics and herbicides. For more information see . Aspartate kinase (2.7.2.4 from EC) (AK) catalyzes the first reaction in the aspartate pathway; the phosphorylation of aspartate. The product of this reaction can then be used in the biosynthesis of lysine or in the pathway leading to homoserine, which participates in the biosynthesis of threonine, isoleucine and methionine . In bacteria there are three different aspartate kinase isozymes which differ in sensitivity to repression and inhibition by Lys, Met and Thr. AK1 and AK2 are bifunctional enzymes which both consist of an N-terminal AK domain and a C-terminal homoserine dehydrogenase domain. AK1 is involved in threonine biosynthesis and AK2, in that of methionine. The third isozyme, AK3 is monofunctional and involved in lysine synthesis. In archaea and plants there may be a single isozyme of AK which in plants is multifunctional. This entry represents a region encoding aspartate kinase activity found in both the monofunctional and bifunctional enzymes. Synonym(s): Aspartokinase; GO: 0004072 aspartate kinase activity, 0008652 amino acid biosynthetic process. Probab=79.81 E-value=4.1 Score=21.75 Aligned_cols=52 Identities=17% Similarity=0.205 Sum_probs=45.4 Q ss_pred CEEEEEECHHHHCCH--HHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHC Q ss_conf 889998675672474--6899999999998518971799845724578899970 Q 537021.9.peg.3 26 ETIVVKYGGHVMNCT--DLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKM 77 (150) Q Consensus 26 ktiVIK~GG~~l~d~--~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~ 77 (150) ..+|.||||.-+.+. +-+.+.++=|..-...|.++|+|-=+-.-++.+|-++ T Consensus 2 ~~iV~KFGGTSVg~~PverI~~~A~~v~~~~~~g~~vvVVVSA~~g~Td~Lv~~ 55 (504) T TIGR00657 2 MLIVQKFGGTSVGNAPVERIRRVAKIVLKEKEKGNQVVVVVSAMAGVTDALVEL 55 (504) T ss_pred CEEEEEECCEEECCCCHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCHHHHHH T ss_conf 747997367576673237899999988887535897389997377533689999 No 67 >cd04243 AAK_AK-HSDH-like AAK_AK-HSDH-like: Amino Acid Kinase Superfamily (AAK), AK-HSDH-like; this family includes the N-terminal catalytic domain of aspartokinase (AK) of the bifunctional enzyme AK- homoserine dehydrogenase (HSDH). These aspartokinases are found in such bacteria as E. coli (AKI-HSDHI, ThrA and AKII-HSDHII, MetL) and in higher plants (Z. mays AK-HSDH). AK and HSDH are the first and third enzymes in the biosynthetic pathway of the aspartate family of amino acids. AK catalyzes the phosphorylation of Asp to P-aspartyl phosphate. HSDH catalyzes the NADPH-dependent conversion of Asp 3-semialdehyde to homoserine. ThrA and MetL are involved in threonine and methionine biosynthesis, respectively. In E. coli, ThrA is subject to allosteric regulation by the end product L-threonine and the native enzyme is reported to be tetrameric. As with bacteria, plant AK and HSDH are feedback inhibited by pathway end products. Maize AK-HSDH is a Thr-sensitive 180-kD enzyme. Arabidopsis AK- Probab=77.88 E-value=5.9 Score=20.72 Aligned_cols=47 Identities=15% Similarity=0.234 Sum_probs=30.4 Q ss_pred EEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHH Q ss_conf 9998675672474689999999999851897179984572457889997 Q 537021.9.peg.3 28 IVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEK 76 (150) Q Consensus 28 iVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~ 76 (150) .|.||||+-+.|++.+.++++ |.. +..+.++|+|.-+=--++..|.+ T Consensus 2 kV~KFGGtSv~~~~~i~~v~~-iI~-~~~~~~~vVVVSA~~gvTd~L~~ 48 (293) T cd04243 2 KVLKFGGTSVASAERIRRVAD-IIK-SRASSPVLVVVSALGGVTNRLVA 48 (293) T ss_pred EEEEECCCCCCCHHHHHHHHH-HHH-HCCCCCEEEEECCCCCCHHHHHH T ss_conf 899878123699999999999-998-37798959998199886899999 No 68 >cd04257 AAK_AK-HSDH AAK_AK-HSDH: Amino Acid Kinase Superfamily (AAK), AK-HSDH; this CD includes the N-terminal catalytic domain of aspartokinase (AK) of the bifunctional enzyme AK - homoserine dehydrogenase (HSDH). These aspartokinases are found in bacteria (E. coli AKI-HSDHI, ThrA and E. coli AKII-HSDHII, MetL) and higher plants (Z. mays AK-HSDH). AK and HSDH are the first and third enzymes in the biosynthetic pathway of the aspartate family of amino acids. AK catalyzes the phosphorylation of Asp to P-aspartyl phosphate. HSDH catalyzes the NADPH-dependent conversion of Asp 3-semialdehyde to homoserine. ThrA and MetL are involved in threonine and methionine biosynthesis, respectively. In E. coli, ThrA is subject to allosteric regulation by the end product L-threonine and the native enzyme is reported to be tetrameric. As with bacteria, plant AK and HSDH are feedback inhibited by pathway end products. Maize AK-HSDH is a Thr-sensitive 180-kD enzyme. Arabidopsis AK-HSDH is an alanine-act Probab=77.66 E-value=6.2 Score=20.58 Aligned_cols=48 Identities=17% Similarity=0.262 Sum_probs=28.9 Q ss_pred EEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHH Q ss_conf 9998675672474689999999999851897179984572457889997 Q 537021.9.peg.3 28 IVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEK 76 (150) Q Consensus 28 iVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~ 76 (150) -|.||||+-+.|.+.+.++++ |..-+.-+-++|+|--+=--++..|.+ T Consensus 2 kV~KFGGtSv~~~~~i~~v~~-iI~~~~~~~~~vVVVSA~ggvTd~L~~ 49 (294) T cd04257 2 KVLKFGGTSLANAERIRRVAD-IILNAAKQEQVAVVVSAPGKVTDLLLE 49 (294) T ss_pred EEEEECCCCCCCHHHHHHHHH-HHHHHCCCCCEEEEECCCCCCHHHHHH T ss_conf 899878111589999999999-997532489989998099986799999 No 69 >TIGR01283 nifE nitrogenase MoFe cofactor biosynthesis protein NifE; InterPro: IPR005973 The enzyme responsible for nitrogen fixation, the nitrogenase, shows a high degree of conservation of structure, function, and amino acid sequence across wide phylogenetic ranges. All known Mo-nitrogenases consist of two components, component I (also called dinitrogenase, or Fe-Mo protein), an alpha2beta2 tetramer encoded by the nifD and nifK genes, and component II (dinitrogenase reductase, or Fe protein) a homodimer encoded by the nifH gene. Two operons, nifDK and nifEN, encode a tetrameric (alpha2beta2 and N2E2) enzymatic complex. Nitrogenase contains two unusual rare metal clusters; one of them is the iron molybdenum cofactor (FeMo-co), which is considered to be the site of dinitrogen reduction and whose biosynthesis requires the products of nifNE and of some other nif genes. It has been proposed that NifNE might serve as a scaffold upon which FeMo-co is built and then inserted into component I. ; GO: 0005515 protein binding, 0006461 protein complex assembly, 0009399 nitrogen fixation. Probab=77.47 E-value=6.6 Score=20.44 Aligned_cols=72 Identities=19% Similarity=0.302 Sum_probs=50.5 Q ss_pred CCCHHHHHHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCC Q ss_conf 06228999999998759889998675672474689999999999851897179984572457889997089811557834 Q 537021.9.peg.3 9 QAEILEQVLPFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLR 88 (150) Q Consensus 9 ~a~~l~ea~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~R 88 (150) -+.+=.+-.||=.+.+||..+|-.||.+ +.+ ++ |+.|+.+|+.+|++ |.|=+..-++..|....-+|.. T Consensus 321 ~~~~~~~l~~yR~~L~GKkaaiY~GGa~----Ksw-Sl---v~Al~dLGMeVV~~---GTqkg~~EDy~~I~e~~~~~~~ 389 (470) T TIGR01283 321 EAKIRPELEPYRERLKGKKAAIYTGGAV----KSW-SL---VSALQDLGMEVVAT---GTQKGTEEDYERIRELMGEGTV 389 (470) T ss_pred HHHHHHHHHHHHHHHCCCEEEEECCCHH----HHH-HH---HHHHHHCCCEEEEE---EECCCCHHHHHHHHHHHCCCCE T ss_conf 9999999999899734988998658647----899-99---98884579179998---3007988899999997079967 Q ss_pred CCC Q ss_conf 661 Q 537021.9.peg.3 89 ITD 91 (150) Q Consensus 89 VTd 91 (150) +=| T Consensus 390 m~D 392 (470) T TIGR01283 390 MLD 392 (470) T ss_pred EEE T ss_conf 862 No 70 >PRK08373 aspartate kinase; Validated Probab=75.53 E-value=7.4 Score=20.10 Aligned_cols=100 Identities=14% Similarity=0.176 Sum_probs=51.0 Q ss_pred CCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCC----------------------CCC Q ss_conf 5988999867567247468999999999985189717998457245788999708----------------------981 Q 537021.9.peg.3 24 ENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMG----------------------IKS 81 (150) Q Consensus 24 r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~g----------------------i~~ 81 (150) ..|.+|.||||+-+.+. ++..+ +|..-..-+.++|+|.-+-.-++..|-++. +++ T Consensus 2 ~~km~V~KFGGTSV~~a--~~~v~-~iv~~~~~~~~vvVVVSAm~gvTd~Li~~a~~~~~~~~~I~~~h~~~a~~l~~~~ 78 (359) T PRK08373 2 VGKMIVVKFGGSSVRYD--FEEAV-ELVKSLSEEKEVIVVVSALKGVTDCLLKYANTDKSAALEIEKIHEEFAKEHGIDT 78 (359) T ss_pred CCCCEEEEECHHCCHHH--HHHHH-HHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCH T ss_conf 88879984281013328--99999-9999875289889996799997799999997657778999999999999836667 Q ss_pred EE--------ECCCCCCCHHHHHHHHHHHHHHHHHHH-HHHHHHCCCCEEEEEEE Q ss_conf 15--------578346612489999999741668999-99998289976877865 Q 537021.9.peg.3 82 KF--------ENGLRITDQQTAEVVEMVLAGSINKKI-VSLINQTGTQAIGICGK 127 (150) Q Consensus 82 ~~--------v~G~RVTd~~tl~iv~~vL~G~vn~~l-v~~L~~~g~~avglsg~ 127 (150) +. .+.....++...+.+ ..+.-.+...| .+.|++.|++|.-+.+. T Consensus 79 ~~l~~~l~~l~~~~~~~~~~~~D~I-ls~GE~lSa~Lla~~L~~~Gi~a~~lda~ 132 (359) T PRK08373 79 EILSPYLKELFNRPDFPSEALRDYI-LSIGERLSAVIFAEGLKILGINGKVVDPW 132 (359) T ss_pred HHHHHHHHHHHHHCCCCCHHHHHHH-HHHHHHHHHHHHHHHHHHCCCCCEEECHH T ss_conf 7889999998731138845766667-76739999999999999779980997548 No 71 >pfam01740 STAS STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP binding function. Probab=75.27 E-value=7.5 Score=20.06 Aligned_cols=73 Identities=19% Similarity=0.274 Sum_probs=54.4 Q ss_pred CCHHHHHH-HHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEE Q ss_conf 62289999-999987598899986756724746899999999998518971799845724578899970898115 Q 537021.9.peg.3 10 AEILEQVL-PFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKF 83 (150) Q Consensus 10 a~~l~ea~-pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~ 83 (150) |+-|++.. ..+..-.-+.+|+-+.+--..|......+..-...++..|.++.++ |-.|++.+.++..|+...| T Consensus 24 a~~~~~~l~~~~~~~~~~~vvlD~~~v~~iDssgl~~l~~~~~~~~~~g~~l~l~-~~~~~v~~~l~~~gl~~~~ 97 (106) T pfam01740 24 AEAFRERLLRAIEEGEIRHVILDLSGVPFIDSSGLGALLELYKELRRRGVELVLV-GPSPEVRRTLEKTGLDDII 97 (106) T ss_pred HHHHHHHHHHHHHCCCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCEEEEE-CCCHHHHHHHHHHCCCCEE T ss_conf 9999999999986589978999970898889029999999999999769909997-6998999999980998664 No 72 >cd04258 AAK_AKiii-LysC-EC AAK_AKiii-LysC-EC: Amino Acid Kinase Superfamily (AAK), AKiii-LysC-EC: this CD includes the N-terminal catalytic aspartokinase (AK) domain of the lysine-sensitive aspartokinase isoenzyme AKIII. AKIII is a monofunctional class enzyme (LysC) found in some bacteria such as E. coli. Aspartokinase is the first enzyme in the aspartate metabolic pathway and catalyzes the conversion of aspartate and ATP to aspartylphosphate and ADP. In E. coli, LysC is reported to be a homodimer of 50 kD subunits. Probab=75.04 E-value=6.7 Score=20.36 Aligned_cols=45 Identities=16% Similarity=0.228 Sum_probs=28.3 Q ss_pred EEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHH Q ss_conf 899986756724746899999999998518971799845724578899 Q 537021.9.peg.3 27 TIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVL 74 (150) Q Consensus 27 tiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l 74 (150) .+|.||||+-+.|.+.+.++++=| ..-+.++|+|--+-.-++..| T Consensus 1 m~V~KFGGtSv~~~~~i~~v~~ii---~~~~~~~vVVVSA~~gvTd~L 45 (292) T cd04258 1 MVVAKFGGTSVADYAAMLRCAAIV---KSDASVRLVVVSASAGVTNLL 45 (292) T ss_pred CEEEEECCCCCCCHHHHHHHHHHH---HCCCCCEEEEEECCCCCHHHH T ss_conf 989957865448899999999999---758998099994899974999 No 73 >cd04247 AAK_AK-Hom3 AAK_AK-Hom3: Amino Acid Kinase Superfamily (AAK), AK-Hom3; this CD includes the N-terminal catalytic domain of the aspartokinase HOM3, a monofunctional class enzyme found in Saccharomyces cerevisiae and other related AK domains. Aspartokinase, the first enzyme in the aspartate metabolic pathway, catalyzes the conversion of aspartate and ATP to aspartylphosphate and ADP, and in fungi, is responsible for the production of threonine, isoleucine and methionine. S. cerevisiae has a single aspartokinase isoenzyme type, which is regulated by feedback, allosteric inhibition by L-threonine. Recent studies show that the allosteric transition triggered by binding of threonine to AK involves a large change in the conformation of the native hexameric enzyme that is converted to an inactive one of different shape and substantially smaller hydrodynamic size. Probab=72.30 E-value=8.9 Score=19.60 Aligned_cols=38 Identities=18% Similarity=0.253 Sum_probs=26.7 Q ss_pred EEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECC Q ss_conf 8999867567247468999999999985189717998457 Q 537021.9.peg.3 27 TIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGG 66 (150) Q Consensus 27 tiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGg 66 (150) -+|.||||+-+.+. ....+.+|..-...+.++|+|.-+ T Consensus 2 ~iV~KFGGTSva~~--~~~~~~~I~~~~~~~~~vvVVvSA 39 (306) T cd04247 2 WVVQKFGGTSVGKF--PDNIADDIVKAYLKGNKVAVVCSA 39 (306) T ss_pred CEEEEECHHHHHHH--HHHHHHHHHHHHHCCCCEEEEECC T ss_conf 89997791408879--999999998776279988999889 No 74 >COG0474 MgtA Cation transport ATPase [Inorganic ion transport and metabolism] Probab=71.82 E-value=9.1 Score=19.53 Aligned_cols=100 Identities=19% Similarity=0.340 Sum_probs=69.6 Q ss_pred EEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHH-HHHCCCCCE-----EECCC---CCCCHHHHHHHH Q ss_conf 998675672474689999999999851897179984572457889-997089811-----55783---466124899999 Q 537021.9.peg.3 29 VVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAV-LEKMGIKSK-----FENGL---RITDQQTAEVVE 99 (150) Q Consensus 29 VIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~-l~~~gi~~~-----~v~G~---RVTd~~tl~iv~ 99 (150) .+-+|.-.|.||- ++...+.|..++..|++++++-|=.+..-.. .+++|+..+ .++|. ..+|++-.+.++ T Consensus 536 l~~lGl~g~~Dpp-r~~v~~aI~~l~~AGI~v~MiTGD~~~TA~aIa~~~Gi~~~~~~~~vi~G~el~~l~~~el~~~~~ 614 (917) T COG0474 536 LVFLGLTGIEDPP-REDVKEAIEELREAGIKVWMITGDHVETAIAIAKECGIEAEAESALVIDGAELDALSDEELAELVE 614 (917) T ss_pred CEEEEEEEECCCC-CCCHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHHHCCCCCCCCCEEEEHHHHHHCCHHHHHHHHH T ss_conf 0402001212899-711899999999889947998799989999999981998676642565159987368899999887 Q ss_pred HH-HHHHH----HHHHHHHHHHCCCCEEEEEEEECC Q ss_conf 99-74166----899999998289976877865589 Q 537021.9.peg.3 100 MV-LAGSI----NKKIVSLINQTGTQAIGICGKDGN 130 (150) Q Consensus 100 ~v-L~G~v----n~~lv~~L~~~g~~avglsg~dg~ 130 (150) .. ...++ -..|+.+|+++ .+-++++|--.| T Consensus 615 ~~~VfARvsP~qK~~IV~~lq~~-g~vVamtGDGvN 649 (917) T COG0474 615 ELSVFARVSPEQKARIVEALQKS-GHVVAMTGDGVN 649 (917) T ss_pred HCCEEEEECHHHHHHHHHHHHHC-CCEEEEECCCHH T ss_conf 58989992789999999999977-999999679716 No 75 >pfam10686 DUF2493 Protein of unknown function (DUF2493). Members of this family are all Proteobacteria. The function is not known. Probab=70.54 E-value=9.7 Score=19.35 Aligned_cols=52 Identities=25% Similarity=0.355 Sum_probs=34.0 Q ss_pred CEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCH-H----HHHHHHHCCCC Q ss_conf 8899986756724746899999999998518971799845724-5----78899970898 Q 537021.9.peg.3 26 ETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGP-Q----IGAVLEKMGIK 80 (150) Q Consensus 26 ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~-q----I~~~l~~~gi~ 80 (150) -+-|+-.||--..|-.+....+..+ +..--..||||||.+ - ..++.+++|++ T Consensus 3 g~~V~~~Ggrd~~D~~~i~~~Ld~~---~~~~pd~vlihGG~~kGad~lA~~WA~~~gv~ 59 (71) T pfam10686 3 GTRVAFSGGRDFNDHRLIWDALDKV---HARHPDMVLLHGGAPKGAERIAARWARRRGVP 59 (71) T ss_pred CCEEEEEECCCCCHHHHHHHHHHHH---HHHCCCEEEEECCCCCCHHHHHHHHHHHCCCC T ss_conf 9989998089865099999999999---98689879997798633799999999986997 No 76 >COG0561 Cof Predicted hydrolases of the HAD superfamily [General function prediction only] Probab=68.79 E-value=11 Score=19.12 Aligned_cols=102 Identities=17% Similarity=0.311 Sum_probs=66.6 Q ss_pred CCEEEEEECHHHHCCHH-HHHHHHHHHHHHHHCCCCEEEEECCCH-HHHHHHHHCCCCCEEE--CCCCCCCHHHHH-HHH Q ss_conf 98899986756724746-899999999998518971799845724-5788999708981155--783466124899-999 Q 537021.9.peg.3 25 NETIVVKYGGHVMNCTD-LSKDFVNDIALLKKSNITPVIVHGGGP-QIGAVLEKMGIKSKFE--NGLRITDQQTAE-VVE 99 (150) Q Consensus 25 ~ktiVIK~GG~~l~d~~-l~~~~~~dI~ll~~~G~k~ViVHGgg~-qI~~~l~~~gi~~~~v--~G~RVTd~~tl~-iv~ 99 (150) -|.++.=+.|..+.+.. ..+...+-|..++..|++++|+-|-.. .+-..+++++++..++ ||-.+-++ -+ +.. T Consensus 3 ~kli~~DlDGTLl~~~~~i~~~~~~al~~~~~~g~~v~iaTGR~~~~~~~~~~~l~~~~~~I~~NGa~i~~~--~~~i~~ 80 (264) T COG0561 3 IKLLAFDLDGTLLDSNKTISPETKEALARLREKGVKVVLATGRPLPDVLSILEELGLDGPLITFNGALIYNG--GELLFQ 80 (264) T ss_pred CEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHCCCCCCEEECCCCEEEEC--CEEEEE T ss_conf 569999476667489998899999999999978298999889987999999997299984998689599989--988882 Q ss_pred HHHHHHHHHHHHHHHHHCCCCEEEEEEEE Q ss_conf 99741668999999982899768778655 Q 537021.9.peg.3 100 MVLAGSINKKIVSLINQTGTQAIGICGKD 128 (150) Q Consensus 100 ~vL~G~vn~~lv~~L~~~g~~avglsg~d 128 (150) ..+....-..+...+........-+...+ T Consensus 81 ~~l~~~~~~~i~~~~~~~~~~~~~~~~~~ 109 (264) T COG0561 81 KPLSREDVEELLELLEDFQGIALVLYTDD 109 (264) T ss_pred CCCCHHHHHHHHHHHHHCCCCCEEEEECC T ss_conf 46999999999999986678628999326 No 77 >TIGR00268 TIGR00268 conserved hypothetical protein TIGR00268; InterPro: IPR005232 This family of conserved hypothetical proteins has no known function but is predicted to be part of the wider PP-loop superfamily .. Probab=68.69 E-value=9.6 Score=19.37 Aligned_cols=109 Identities=17% Similarity=0.298 Sum_probs=64.8 Q ss_pred HHHHHHHHH-HCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHH--H-------HHHCCCCCEE Q ss_conf 999999998-75988999867567247468999999999985189717998457245788--9-------9970898115 Q 537021.9.peg.3 14 EQVLPFVQF-YENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGA--V-------LEKMGIKSKF 83 (150) Q Consensus 14 ~ea~pYi~~-~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~--~-------l~~~gi~~~~ 83 (150) ..--.+++. |.+|-++|-|+|.+ |..++..++.|++-... +=|.=-.|+++. . .++.|+..+| T Consensus 3 ~~L~~fl~~kf~nk~~~IAySGGv--DS~lla~v~~~v~g~~~-----lait~~sP~~sp~~el~~A~~~A~~~g~~he~ 75 (263) T TIGR00268 3 ENLRNFLKEKFKNKKVLIAYSGGV--DSSLLAAVCSDVAGTEV-----LAITVVSPSISPRRELEDAKAIAKEIGVKHEL 75 (263) T ss_pred HHHHHHHHHHHHCCCEEEEEECCH--HHHHHHHHHHHHHHHCE-----EEEEEECCCCCCHHHHHHHHHHHHHHCCCEEE T ss_conf 688889876764081699951746--58999999987531011-----35787627757354489999999880830121 Q ss_pred ECCCCCC--------CHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEE--EEEEEECCEEEEEE Q ss_conf 5783466--------12489999999741668999999982899768--77865589899997 Q 537021.9.peg.3 84 ENGLRIT--------DQQTAEVVEMVLAGSINKKIVSLINQTGTQAI--GICGKDGNMVFAEK 136 (150) Q Consensus 84 v~G~RVT--------d~~tl~iv~~vL~G~vn~~lv~~L~~~g~~av--glsg~dg~li~a~~ 136 (150) ++==..+ +..+-=+=+ ++-..|+..-.+.|.+.| |.++.| |-.-+| T Consensus 76 ~~~d~~~n~~f~~N~~~RCY~CK~-----~~~~~L~~~a~~~gy~~V~dGtN~dD--L~~~RP 131 (263) T TIGR00268 76 VKIDKMANPNFRANVEERCYFCKK-----KVLSILVKLAEKRGYDVVVDGTNADD--LEDHRP 131 (263) T ss_pred ECCCCCCCCCCCCCCCCCCHHHHH-----HHHHHHHHHHHHCCCCEEEECCCCCC--CCCCCC T ss_conf 100123685001688544415488-----89999899998639957982346200--023675 No 78 >PRK05925 aspartate kinase; Provisional Probab=67.54 E-value=11 Score=18.96 Aligned_cols=104 Identities=18% Similarity=0.302 Sum_probs=54.8 Q ss_pred CEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHH--------------------------CCC Q ss_conf 889998675672474689999999999851897179984572457889997--------------------------089 Q 537021.9.peg.3 26 ETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEK--------------------------MGI 79 (150) Q Consensus 26 ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~--------------------------~gi 79 (150) +.+|-||||+-+.+++.++++++=|. . +.+.++|.-+=.-++..|.+ +++ T Consensus 2 ~~iV~KFGGtSv~~~e~i~~va~ii~---~-~~~~~vVVSA~~gvTd~L~~~~~~~~~~~~~~~~~l~~~h~~i~~~l~~ 77 (440) T PRK05925 2 APLVYKFGGTSLGTAESIRRVCDIIC---K-EKPSFVVVSAVAGVTDLLEEFCRVSKGKREQITEKIREKHEEIAKELGI 77 (440) T ss_pred CEEEEEECCCCCCCHHHHHHHHHHHH---H-CCCCEEEEECCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCC T ss_conf 86999379623389999999999997---1-5997899969999708999998762010789999999999999987254 Q ss_pred CCE----------EECCCCCCCHHHHHHHHHHHHHH-HHHHH-HHHHHHCCCCEEEEEEE-----ECCEEEEEE Q ss_conf 811----------55783466124899999997416-68999-99998289976877865-----589899997 Q 537021.9.peg.3 80 KSK----------FENGLRITDQQTAEVVEMVLAGS-INKKI-VSLINQTGTQAIGICGK-----DGNMVFAEK 136 (150) Q Consensus 80 ~~~----------~v~G~RVTd~~tl~iv~~vL~G~-vn~~l-v~~L~~~g~~avglsg~-----dg~li~a~~ 136 (150) +.. +.+..+.|+.+.-.++ . .|+ +...| ...|+..|.++..+.+. |+++-.|+| T Consensus 78 ~~~~~~~~~~l~~~~~~~~~~~~~~~~il--s-~GE~~Sa~lla~~l~~~g~~a~~~da~~~i~td~~~~~a~~ 148 (440) T PRK05925 78 EFSLSPWWERLEHFEDKEEISSEDQARIL--A-IGEDISASLICAYCCTNVLPLEFLEARQVILTDDQYLRATP 148 (440) T ss_pred CHHHHHHHHHHHHHHHCCCCCHHHHHHHH--H-HHHHHHHHHHHHHHHHCCCCCEEECCEEEEEECCCCCCCCC T ss_conf 21246899999987640448847788898--8-72999999999999967998288430328982684433033 No 79 >cd04255 AAK_UMPK-MosAB AAK_UMPK-MosAB: This CD includes the alpha and beta subunits of the Mo storage protein (MosA and MosB) which are related to uridine monophosphate kinase (UMPK) enzymes that catalyze the phosphorylation of UMP by ATP, yielding UDP, and playing a key role in pyrimidine nucleotide biosynthesis. The Mo storage protein from the nitrogen-fixing bacterium, Azotobacter vinelandii, is characterized as an alpha4-beta4 octamer containing a polynuclear molybdenum-oxide cluster which is ATP-dependent to bind Mo and pH-dependent to release Mo. These and related bacterial sequences in this CD are members of the Amino Acid Kinase Superfamily (AAK). Probab=67.15 E-value=7.4 Score=20.09 Aligned_cols=85 Identities=22% Similarity=0.408 Sum_probs=50.9 Q ss_pred CEEEEEECHHHHCCH--HHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHH---HCCCCCEEECCCCCCCHHHHHHHHH Q ss_conf 889998675672474--68999999999985189717998457245788999---7089811557834661248999999 Q 537021.9.peg.3 26 ETIVVKYGGHVMNCT--DLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLE---KMGIKSKFENGLRITDQQTAEVVEM 100 (150) Q Consensus 26 ktiVIK~GG~~l~d~--~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~---~~gi~~~~v~G~RVTd~~tl~iv~~ 100 (150) ..-|||+||.-+.|- +..-.+++.|..+..-. |.+|+-|||-..-..++ .+|+++--. -+.... T Consensus 31 ~~~VvKIGGqsi~DrG~~av~Plv~Ei~~~~~~h-k~li~TGgG~RaRHiy~igldLgmPtGvL----------a~L~~~ 99 (262) T cd04255 31 DLNVVKIGGQSIIDRGAEAVLPLVEEIVALRPEH-KLLILTGGGTRARHVYSIGLDLGMPTGVL----------AKLGAS 99 (262) T ss_pred CCEEEEECCEEEECCCHHHHHHHHHHHHHHHHHC-CEEEEECCCHHHHHHHHHHHCCCCCHHHH----------HHHHHH T ss_conf 7228998780300373342432899999751315-28998367500334312341479961156----------776422 Q ss_pred HHHHHHHHHHHHH-HHHCCCCEEE Q ss_conf 9741668999999-9828997687 Q 537021.9.peg.3 101 VLAGSINKKIVSL-INQTGTQAIG 123 (150) Q Consensus 101 vL~G~vn~~lv~~-L~~~g~~avg 123 (150) + ++| |..+.+. |+.+|+..++ T Consensus 100 v-s~Q-Na~ml~~LLa~~G~~~i~ 121 (262) T cd04255 100 V-SEQ-NAEMLATLLAKHGGSKVG 121 (262) T ss_pred H-HHH-HHHHHHHHHCCCCCEEEC T ss_conf 4-464-799999984313870257 No 80 >PRK04596 minC septum formation inhibitor; Reviewed Probab=67.00 E-value=11 Score=18.89 Aligned_cols=59 Identities=17% Similarity=0.406 Sum_probs=44.4 Q ss_pred HHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCC Q ss_conf 9875988999867567247468999999999985189717998457245788999708981 Q 537021.9.peg.3 21 QFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKS 81 (150) Q Consensus 21 ~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~ 81 (150) ..|++.-+||-+..-. .++++ ..+-.=+..|+..|+.+|=|.|+.++...+.+.+|++. T Consensus 46 ~FF~naPVVIDls~L~-~~~~~-~~~~aLv~~Lr~~gl~pVgVrg~~~e~~~~A~~~GL~i 104 (248) T PRK04596 46 KLFGRAAVILDFGGLS-QVPDL-ATAKALLDGLRSAGVLPVALAYGTSEIDLLSQQLGLPL 104 (248) T ss_pred HHHCCCCEEEECHHHC-CCCCH-HHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHCCCCC T ss_conf 8666994999815715-77567-89999999999879889998279989999999769864 No 81 >TIGR01092 P5CS delta l-pyrroline-5-carboxylate synthetase; InterPro: IPR005766 delta l-pyrroline-5-carboxylate synthetase contains a glutamate 5-kinase (ProB, 2.7.2.11 from EC) region followed by a gamma-glutamyl phosphate reductase (ProA, 1.2.1.41 from EC) region and catalyses the first and second steps in proline biosynthesis.; GO: 0003824 catalytic activity, 0006561 proline biosynthetic process. Probab=64.65 E-value=10 Score=19.15 Aligned_cols=45 Identities=24% Similarity=0.323 Sum_probs=36.1 Q ss_pred CCEEEEEECHHHHCCHH-------HHHHHHHHHHHHHHCCCCEEEEECCCHH Q ss_conf 98899986756724746-------8999999999985189717998457245 Q 537021.9.peg.3 25 NETIVVKYGGHVMNCTD-------LSKDFVNDIALLKKSNITPVIVHGGGPQ 69 (150) Q Consensus 25 ~ktiVIK~GG~~l~d~~-------l~~~~~~dI~ll~~~G~k~ViVHGgg~q 69 (150) -|-||||+|-.++.-.+ -+.+++++++.|++-|..++||--|--. T Consensus 7 ~KRiVVK~G~AVVTr~d~~GLALGRL~si~EQl~~L~~~G~E~~LVtSGAV~ 58 (738) T TIGR01092 7 VKRIVVKVGTAVVTRGDGRGLALGRLGSIVEQLSELNSDGYEVILVTSGAVA 58 (738) T ss_pred CCEEEEEECCEEEECCCCCEEHHHHHHHHHHHHHHHCCCCCEEEEECCCHHH T ss_conf 5478998254578778984014567899999999860289289998144123 No 82 >PTZ00217 flap endonuclease-1; Provisional Probab=64.50 E-value=13 Score=18.58 Aligned_cols=50 Identities=16% Similarity=0.236 Sum_probs=31.4 Q ss_pred HHHHCCCEEEEEECHHH---------------HCCH-----HHHHHHHHHHHHHHHCCCCEEEEECCCHH Q ss_conf 99875988999867567---------------2474-----68999999999985189717998457245 Q 537021.9.peg.3 20 VQFYENETIVVKYGGHV---------------MNCT-----DLSKDFVNDIALLKKSNITPVIVHGGGPQ 69 (150) Q Consensus 20 i~~~r~ktiVIK~GG~~---------------l~d~-----~l~~~~~~dI~ll~~~G~k~ViVHGgg~q 69 (150) |..|+|+++.|=..-.+ +.+. .-...++..+..|...|++||.|+.|.|- T Consensus 23 L~~l~GK~VAIDas~~LYqfl~air~~~~~~~l~n~~G~~tshl~gl~~r~~~ll~~gIkPVfVFDGkpp 92 (394) T PTZ00217 23 LKNLFGRRIAIDASMALYQFIIAIRSSGQGSELTNESGEVTSHLNGLFNRTIRLLEEGIKPIYVFDGKPP 92 (394) T ss_pred HHHCCCCEEEEECHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCEEEEEECCCCC T ss_conf 8990998899961499999999864442344210467870299999999999999879979999789994 No 83 >TIGR01116 ATPase-IIA1_Ca calcium-translocating P-type ATPase, SERCA-type; InterPro: IPR005782 This model describes Ca2+ ATPases (3.6.3.8 from EC) from eukaryotes and includes significant representatives from plants. Plasma membrane Ca2+ ATPases are primarily responsible for extrusion of calcium from cytoplasmic milieu to outside, coupled to the hydrolyis of ATP. Besides Ca2+ ATPases, Na+/Ca2+ exchangers are also involved in the maintenance of Ca2+ homestasis in the cell. ; GO: 0005388 calcium-transporting ATPase activity, 0006816 calcium ion transport, 0016021 integral to membrane. Probab=63.53 E-value=5.9 Score=20.75 Aligned_cols=95 Identities=15% Similarity=0.273 Sum_probs=64.9 Q ss_pred HHHHHCCCHHHHHHHHHHHH------------CCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHH Q ss_conf 02322062289999999987------------598899986756724746899999999998518971799845724578 Q 537021.9.peg.3 4 KIYQFQAEILEQVLPFVQFY------------ENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIG 71 (150) Q Consensus 4 ~~~~~~a~~l~ea~pYi~~~------------r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~ 71 (150) .+.|.+.+.=+..+--|+++ |.++=.+=+|=-.|.||. ++...+.|...|..|||+++|-|=.+.-- T Consensus 418 ~~VPL~~~~k~~il~~~~~~~~~aLRCLA~A~~~Es~Ltf~G~vGl~DPP-R~EV~~ai~~CR~AGIrVImITGD~KeTA 496 (800) T TIGR01116 418 SAVPLTEKMKNTILASIKEMSSEALRCLALAFKEESKLTFIGVVGLLDPP-RPEVADAIEKCREAGIRVIMITGDNKETA 496 (800) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHEECCCCCCEEEEEEECCCCC-CHHHHHHHHHHHHCCCEEEEEECCCCHHH T ss_conf 01347846789999999987641324110012146786788885068948-16899999988737978999847982468 Q ss_pred HHHH-HCCCCCEEE----------CCC---CCCCHHHHHHHH Q ss_conf 8999-708981155----------783---466124899999 Q 537021.9.peg.3 72 AVLE-KMGIKSKFE----------NGL---RITDQQTAEVVE 99 (150) Q Consensus 72 ~~l~-~~gi~~~~v----------~G~---RVTd~~tl~iv~ 99 (150) +.-- +-|+=+.|- .|+ |+=+..-+++|+ T Consensus 497 ~AicR~IG~lsSfTG~EF~~m~~~~~~~FSR~ePsHK~~lV~ 538 (800) T TIGR01116 497 EAICRRIGILSSFTGREFDEMGPKKALVFSRVEPSHKLRLVE 538 (800) T ss_pred HHHHHHHHHCCCCCCCHHHHCCCCCCEEEEECCCHHHHHHHH T ss_conf 999767501024200036500601466876058255787653 No 84 >cd04245 AAK_AKiii-YclM-BS AAK_AKiii-YclM-BS: Amino Acid Kinase Superfamily (AAK), AKiii-YclM-BS; this CD includes the N-terminal catalytic aspartokinase (AK) domain of the lysine plus threonine-sensitive aspartokinase isoenzyme AKIII, a monofunctional class enzyme found in Bacilli (Bacillus subtilis YclM) and Clostridia species. Aspartokinase is the first enzyme in the aspartate metabolic pathway and catalyzes the conversion of aspartate and ATP to aspartylphosphate and ADP. In Bacillus subtilis (BS), YclM is reported to be a single polypeptide of 50 kD. The Bacillus subtilis 168 AKIII is induced by lysine and repressed by threonine, and it is synergistically inhibited by lysine and threonine. Probab=63.09 E-value=13 Score=18.61 Aligned_cols=35 Identities=23% Similarity=0.325 Sum_probs=23.2 Q ss_pred EEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEE Q ss_conf 9998675672474689999999999851897179984 Q 537021.9.peg.3 28 IVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVH 64 (150) Q Consensus 28 iVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVH 64 (150) -|.||||+-+.|.+-++++++ |..- ....++|+|- T Consensus 2 kV~KFGGtSv~~~~~i~~v~~-Ii~~-~~~~~vVVVS 36 (288) T cd04245 2 KVVKFGGSSLASAEQFQKVKA-IVKA-DPERKIVVVS 36 (288) T ss_pred EEEEECCCCCCCHHHHHHHHH-HHHC-CCCCEEEEEC T ss_conf 899888113589999999999-9971-7898099990 No 85 >PRK00009 phosphoenolpyruvate carboxylase; Reviewed Probab=63.06 E-value=14 Score=18.41 Aligned_cols=65 Identities=25% Similarity=0.451 Sum_probs=42.8 Q ss_pred CEEEEEEC------HHHHCCHHHHHHHHHHHHHHHHCCCCEEEEE--------CCCHHHHHHHHHCCCCCEEECC-CCCC Q ss_conf 88999867------5672474689999999999851897179984--------5724578899970898115578-3466 Q 537021.9.peg.3 26 ETIVVKYG------GHVMNCTDLSKDFVNDIALLKKSNITPVIVH--------GGGPQIGAVLEKMGIKSKFENG-LRIT 90 (150) Q Consensus 26 ktiVIK~G------G~~l~d~~l~~~~~~dI~ll~~~G~k~ViVH--------Ggg~qI~~~l~~~gi~~~~v~G-~RVT 90 (150) ..+-+-|. |-...+..+.++--+=.......|+++.+-| ||||.-.+.+ ..|+.-++| +|+| T Consensus 568 QeVMlGYSDS~KDgG~l~s~W~ly~Aq~~L~~v~~~~gv~l~~FHGRGGsvgRGGGP~~~ai~---aqP~gsv~g~ir~T 644 (916) T PRK00009 568 QEVMLGYSDSNKDGGFLASNWALYRAQEALVELAEKHGVRLTLFHGRGGTVGRGGGPAYAAIL---SQPPGSVKGRIRVT 644 (916) T ss_pred EEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCHHHHHH---CCCCCCCCCCEEEE T ss_conf 999732566654233425568999999999999997699379850798766778860899986---39986606843665 Q ss_pred CHH Q ss_conf 124 Q 537021.9.peg.3 91 DQQ 93 (150) Q Consensus 91 d~~ 93 (150) -+- T Consensus 645 EQG 647 (916) T PRK00009 645 EQG 647 (916) T ss_pred ECC T ss_conf 346 No 86 >KOG2518 consensus Probab=60.00 E-value=9.7 Score=19.34 Aligned_cols=49 Identities=24% Similarity=0.440 Sum_probs=34.5 Q ss_pred HHHHHCCCEEEEEECHHHHCC----------------HHHHHHHHHHHHHHHHCCCCEEEEECCCHH Q ss_conf 999875988999867567247----------------468999999999985189717998457245 Q 537021.9.peg.3 19 FVQFYENETIVVKYGGHVMNC----------------TDLSKDFVNDIALLKKSNITPVIVHGGGPQ 69 (150) Q Consensus 19 Yi~~~r~ktiVIK~GG~~l~d----------------~~l~~~~~~dI~ll~~~G~k~ViVHGgg~q 69 (150) -+..|+|+|+-| .|.++.. ..-.+=|++-+.+|...|++||||+-|++- T Consensus 18 hi~~~~g~tvav--D~y~WLhrg~~~Ca~el~~~~pT~ryi~y~ik~v~lL~~~gikPilVFDG~~L 82 (556) T KOG2518 18 HISEYKGKTVAV--DGYCWLHRGALACAEKLAKGKPTDRYIQFFIKRVKLLLSYGIKPILVFDGDPL 82 (556) T ss_pred HHHHHCCCEEEE--EHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCEEEEEECCCCC T ss_conf 688755836887--32558864577679998628973899999999999998569748999669976 No 87 >PRK03868 glucose-6-phosphate isomerase; Provisional Probab=59.98 E-value=15 Score=18.07 Aligned_cols=89 Identities=15% Similarity=0.250 Sum_probs=57.7 Q ss_pred CCHHHHHHHHHHHHC--CCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCH-HHHHHHHHCCCCCEEE-- Q ss_conf 622899999999875--98899986756724746899999999998518971799845724-5788999708981155-- Q 537021.9.peg.3 10 AEILEQVLPFVQFYE--NETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGP-QIGAVLEKMGIKSKFE-- 84 (150) Q Consensus 10 a~~l~ea~pYi~~~r--~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~-qI~~~l~~~gi~~~~v-- 84 (150) ...+.+...+++.-+ ..++||-+|||.+- +.+...++ .-....+.++..|-.--| .+++.++.+..+.... T Consensus 42 ~~~~~~~~~~~~~~~~~~~vV~IGIGGS~LG-~~ai~~aL---~~~~~~~~~i~F~~N~Dp~~~~~~l~~l~~~~TlfiV 117 (409) T PRK03868 42 TNLIEESLKYLNDKQAIKNIVVIGIGGSSLG-VKAIYSML---KNKKQLKRELHFLDNTDPISINKTLKKINLEETLFIV 117 (409) T ss_pred HHHHHHHHHHHHHHCCCCEEEEEEECCHHHH-HHHHHHHH---CCCCCCCCEEEEEECCCHHHHHHHHHHCCCCCEEEEE T ss_conf 5799999999864524987999825528899-99999984---1204589749998488989999999608935179999 Q ss_pred CCCCCCCHHHHHHHHHHH Q ss_conf 783466124899999997 Q 537021.9.peg.3 85 NGLRITDQQTAEVVEMVL 102 (150) Q Consensus 85 ~G~RVTd~~tl~iv~~vL 102 (150) --==-|+.||+...+.++ T Consensus 118 iSKSgtT~ET~~~~~~~~ 135 (409) T PRK03868 118 ISKSGTTIETISIFKYIL 135 (409) T ss_pred ECCCCCCHHHHHHHHHHH T ss_conf 859998389999999999 No 88 >PRK10444 UMP phosphatase; Provisional Probab=58.40 E-value=15 Score=18.13 Aligned_cols=55 Identities=7% Similarity=0.149 Sum_probs=28.0 Q ss_pred CEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCC----HHHHHHHHHCCCC Q ss_conf 889998675672474689999999999851897179984572----4578899970898 Q 537021.9.peg.3 26 ETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGG----PQIGAVLEKMGIK 80 (150) Q Consensus 26 ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg----~qI~~~l~~~gi~ 80 (150) |++++=+.|.+....++++.-.+-|..|+..|+++++|-=.. .++-+.|.++|++ T Consensus 2 k~il~DlDGvl~~g~~~ipga~e~l~~L~~~g~~~~fvTNnss~s~~~~~~kL~~~Gl~ 60 (248) T PRK10444 2 KNVICDIDGVLMHDNVAVPGAAEFLHRILEKGLPLVLLTNYPSQTGQDLANRFATAGVD 60 (248) T ss_pred CEEEECCCCEEEECCEECCCHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHHHHCCCC T ss_conf 89999075613889984869999999999879929999389999999999999977999 No 89 >cd01979 Pchlide_reductase_N Pchlide_reductase_N: N protein of the NB protein complex of Protochlorophyllide (Pchlide)_reductase. Pchlide reductase catalyzes the reductive formation of chlorophyllide (chlide) from protochlorophyllide (pchlide) during biosynthesis of chlorophylls and bacteriochlorophylls. This group contains both the light-independent Pchlide reductase (DPOR) and light-dependent Pchlide reductase (LPOR). Angiosperms contain only LPOR, cyanobacteria, algae and gymnosperms contain both DPOR and LPOR, primitive anoxygenic photosynthetic bacteria contain only DPOR. NB is structurally similar to the FeMo protein of nitrogenase, forming an N2B2 heterotetramer. N and B are homologous to the FeMo alpha and beta subunits respectively. Also in common with nitrogenase in vitro DPOR activity requires ATP hydrolysis and dithoionite or ferredoxin as electron donor. The NB protein complex may serve as a catalytic site for Pchlide reduction similar to MoFe for nitrogen reduction. Probab=58.34 E-value=16 Score=17.89 Aligned_cols=77 Identities=13% Similarity=0.315 Sum_probs=42.4 Q ss_pred HHCCCHHHHHHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECC Q ss_conf 22062289999999987598899986756724746899999999998518971799845724578899970898115578 Q 537021.9.peg.3 7 QFQAEILEQVLPFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENG 86 (150) Q Consensus 7 ~~~a~~l~ea~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G 86 (150) +-.+..-+...+|....+||. |+-+|++-++=| +++ +|+.+|+.+|-| |-|++++.+...-++.- -.| T Consensus 258 ~~~~ra~~~le~yr~~L~GK~-vfF~pDsqLEIp-----LAR---FL~rcGm~~vEV--Gtpy~~~~~~~~el~ll-~~~ 325 (396) T cd01979 258 EREARAWRALEPYLDLLRGKS-IFFMGDNLLEIP-----LAR---FLTRCGMIVVEV--GTPYLDKRFQAAELELL-PPM 325 (396) T ss_pred HHHHHHHHHHHHHHHHHCCCE-EEECCCCCHHHH-----HHH---HHHHCCCEEEEE--CCCCCCHHHHHHHHHHC-CCC T ss_conf 899999999999999838974-998188602404-----899---999879878995--56654876678899746-999 Q ss_pred CCCCCHHHH Q ss_conf 346612489 Q 537021.9.peg.3 87 LRITDQQTA 95 (150) Q Consensus 87 ~RVTd~~tl 95 (150) .|+...... T Consensus 326 ~~i~E~~d~ 334 (396) T cd01979 326 VRIVEKPDN 334 (396) T ss_pred CCCCCCCCH T ss_conf 844307977 No 90 >TIGR00600 rad2 DNA excision repair protein (rad2); InterPro: IPR001044 Xeroderma pigmentosum (XP) is a human autosomal recessive disease, characterised by a high incidence of sunlight-induced skin cancer. People's skin cells with this condition are hypersensitive to ultraviolet light, due to defects in the incision step of DNA excision repair. There are a minimum of seven genetic complementation groups involved in this pathway: XP-A to XP-G. XP-G is one of the most rare and phenotypically heterogeneous of XP, showing anything from slight to extreme dysfunction in DNA excision repair , . XP-G can be corrected by a 133 Kd nuclear protein, XPGC . XPGC is an acidic protein that confers normal UV resistance in expressing cells . It is a magnesium-dependent, single-strand DNA endonuclease that makes structure-specific endonucleolytic incisions in a DNA substrate containing a duplex region and single-stranded arms , . XPGC cleaves one strand of the duplex at the border with the single-stranded region . XPG belongs to a family of proteins that includes RAD2 from Saccharomyces cerevisiae (Baker's yeast) and rad13 from Schizosaccharomyces pombe (Fission yeast), which are single-stranded DNA endonucleases , ; mouse and human FEN-1, a structure-specific endonuclease; RAD2 from fission yeast and RAD27 from budding yeast; fission yeast exo1, a 5'-3' double-stranded DNA exonuclease that may act in a pathway that corrects mismatched base pairs; yeast DHS1, and yeast DIN7. Sequence alignment of this family of proteins reveals that similarities are largely confined to two regions. The first is located at the N-terminal extremity (N-region) and corresponds to the first 95 to 105 amino acids. The second region is internal (I-region) and found towards the C-terminus; it spans about 140 residues and contains a highly conserved core of 27 amino acids that includes a conserved pentapeptide (E-A-[DE]-A-[QS]). It is possible that the conserved acidic residues are involved in the catalytic mechanism of DNA excision repair in XPG. The amino acids linking the N- and I-regions are not conserved. This entry represents XPGC, an acidic protein that confers normal UV resistance in expressing cells, can correct XP-G. It is a magnesium-dependent, single-strand DNA endonuclease that makes structure-specific endonucleolytic incisions in a DNA substrate containing a duplex region and single-stranded arms. XPGC cleaves one strand of the duplex at the border with the single-stranded region , .; GO: 0003697 single-stranded DNA binding, 0004519 endonuclease activity, 0006289 nucleotide-excision repair, 0005634 nucleus. Probab=57.82 E-value=11 Score=19.07 Aligned_cols=62 Identities=21% Similarity=0.346 Sum_probs=42.1 Q ss_pred CHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCC------CEEECCCCCCCHHHHHHH Q ss_conf 756724746899999999998518971799845724578899970898------115578346612489999 Q 537021.9.peg.3 33 GGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIK------SKFENGLRITDQQTAEVV 98 (150) Q Consensus 33 GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~------~~~v~G~RVTd~~tl~iv 98 (150) -|+++.+..+. .|++-|..|=-.|||||-|+-||.=+ |++.-|. -.....=|.|...-|... T Consensus 46 ~GN~i~n~Hl~-tlF~RlCKLLFFrIrPvFVFDG~aP~---LKrqTl~kR~~R~~~~~~~a~~T~~KLL~~f 113 (1127) T TIGR00600 46 EGNAIKNSHLL-TLFRRLCKLLFFRIRPVFVFDGGAPL---LKRQTLAKRRQRRDKASEDARKTAEKLLALF 113 (1127) T ss_pred CCCEEECCHHH-HHHHHHHHHHHCCCCEEEEECCCCCC---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 68723050667-78999998861687316884088872---4689999998765323320488999999999 No 91 >pfam06953 ArsD Arsenical resistance operon trans-acting repressor ArsD. This family consists of several bacterial arsenical resistance operon trans-acting repressor ArsD proteins. ArsD is a trans-acting repressor of the arsRDABC operon that confers resistance to arsenicals and antimonials in Escherichia coli. It possesses two-pairs of vicinal cysteine residues, Cys(12)-Cys(13) and Cys(112)-Cys(113), that potentially form separate binding sites for the metalloids that trigger dissociation of ArsD from the operon. However, as a homodimer it has four vicinal cysteine pairs. Probab=57.42 E-value=15 Score=18.16 Aligned_cols=67 Identities=21% Similarity=0.341 Sum_probs=44.0 Q ss_pred CCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 47468999999999985189717998457245788999708981155783466124899999997416689999999828 Q 537021.9.peg.3 38 NCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEVVEMVLAGSINKKIVSLINQT 117 (150) Q Consensus 38 ~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~~vL~G~vn~~lv~~L~~~ 117 (150) .|++|. .|..|+..|+. .|++.+..| .| ---++++ -|+.+...|... T Consensus 22 VD~~L~-~~aa~~~~lk~---------------------~gv~v~R~N---L~------~~P~~F~--~N~~V~~~L~~~ 68 (119) T pfam06953 22 VDPELV-RFAADLDWLKQ---------------------NGVEVERYN---LA------QQPMAFA--ENAVVKAFLETS 68 (119) T ss_pred CCHHHH-HHHHHHHHHHH---------------------CCCEEEECC---CC------CCHHHHH--HCHHHHHHHHHC T ss_conf 798999-99999999996---------------------794799716---63------2879997--279999999974 Q ss_pred CCCEEEEEEEECCEEEEEEC Q ss_conf 99768778655898999974 Q 537021.9.peg.3 118 GTQAIGICGKDGNMVFAEKA 137 (150) Q Consensus 118 g~~avglsg~dg~li~a~~~ 137 (150) |..+..++-+||.++.+.+. T Consensus 69 G~e~LPitlVDGeIv~~G~Y 88 (119) T pfam06953 69 GAEGLPLTLVDGEIVKTGRY 88 (119) T ss_pred CCCCCCEEEECCEEEEECCC T ss_conf 96548889998999881569 No 92 >PRK10530 phosphotransferase; Provisional Probab=57.36 E-value=17 Score=17.79 Aligned_cols=97 Identities=14% Similarity=0.251 Sum_probs=57.8 Q ss_pred CEEEEEECHHHHCCHH-HHHHHHHHHHHHHHCCCCEEEEECCCH-HHHHHHHHCCCCCEEE--CCCCCCCHHHHHHH-HH Q ss_conf 8899986756724746-899999999998518971799845724-5788999708981155--78346612489999-99 Q 537021.9.peg.3 26 ETIVVKYGGHVMNCTD-LSKDFVNDIALLKKSNITPVIVHGGGP-QIGAVLEKMGIKSKFE--NGLRITDQQTAEVV-EM 100 (150) Q Consensus 26 ktiVIK~GG~~l~d~~-l~~~~~~dI~ll~~~G~k~ViVHGgg~-qI~~~l~~~gi~~~~v--~G~RVTd~~tl~iv-~~ 100 (150) |.|+.=+.|.++.+.. +.+...+.|..++..|+++++.-|=.+ .+...++++++...++ ||-.+-|...=+++ .. T Consensus 4 KlIa~DlDGTLl~~~~~i~~~~~~ai~~l~~~Gi~~~iaTGR~~~~~~~~~~~l~~~~~~I~~NGa~i~d~~~~~~l~~~ 83 (272) T PRK10530 4 RVIALDLDGTLLTPKKTILPSSLEALARAREAGYQLIIVTGRHHVAIHPFYQALALDTPAICCNGTYLYDYQAKTVLEAD 83 (272) T ss_pred CEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHCCCCCEEEECCEEEEECCCCEEEEEC T ss_conf 29999087363589895999999999999978999999959987888999998099986897688599977998799954 Q ss_pred HHHHHHHHHHHHHHHHCCCCEE Q ss_conf 9741668999999982899768 Q 537021.9.peg.3 101 VLAGSINKKIVSLINQTGTQAI 122 (150) Q Consensus 101 vL~G~vn~~lv~~L~~~g~~av 122 (150) .+.-+.-..+...+.+.+++.. T Consensus 84 ~l~~~~~~~i~~~~~~~~~~~~ 105 (272) T PRK10530 84 PMPVQQALQLIELLNEHQIHGL 105 (272) T ss_pred CCCHHHHHHHHHHHHHCCCCEE T ss_conf 7999999999999998499579 No 93 >cd00128 XPG Xeroderma pigmentosum G N- and I-regions (XPGN, XPGI); contains the HhH2 motif; domain in nucleases. XPG is a eukaryotic enzyme that functions in nucleotide-excision repair and transcription-coupled repair of oxidative DNA damage. Functionally/structurally related to FEN-1; divalent metal ion-dependent exo- and endonuclease, and bacterial and bacteriophage 5'3' exonucleases. Probab=57.12 E-value=17 Score=17.76 Aligned_cols=50 Identities=18% Similarity=0.252 Sum_probs=33.5 Q ss_pred HHHHCCCEEEEEECHHH--------------HCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHH Q ss_conf 99875988999867567--------------247468999999999985189717998457245 Q 537021.9.peg.3 20 VQFYENETIVVKYGGHV--------------MNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQ 69 (150) Q Consensus 20 i~~~r~ktiVIK~GG~~--------------l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~q 69 (150) +..|+||++.|=-.--+ -........|+.-+..|...|++||.|+-|.|- T Consensus 19 l~~l~gk~iAIDas~~Lyq~l~~~~~~~~~~~~~~~~l~g~~~ri~~Ll~~~I~PifVFDG~~p 82 (316) T cd00128 19 LEELRGKKVAIDASIWLYQFLKACRQELGSGGETTSHLQGFFYRTCRLLELGIKPVFVFDGKPP 82 (316) T ss_pred HHHCCCCEEEEECHHHHHHHHHHCCHHHCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCCC T ss_conf 8990999899971899999998537233148986679999999999999779959999899898 No 94 >PRK00973 glucose-6-phosphate isomerase; Provisional Probab=56.96 E-value=17 Score=17.75 Aligned_cols=87 Identities=13% Similarity=0.232 Sum_probs=52.8 Q ss_pred HHHHHHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHH----HHHHHHHCCCCEEEEECCCH-HHHHHHHHCCCCCEEEC- Q ss_conf 2899999999875988999867567247468999999----99998518971799845724-57889997089811557- Q 537021.9.peg.3 12 ILEQVLPFVQFYENETIVVKYGGHVMNCTDLSKDFVN----DIALLKKSNITPVIVHGGGP-QIGAVLEKMGIKSKFEN- 85 (150) Q Consensus 12 ~l~ea~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~----dI~ll~~~G~k~ViVHGgg~-qI~~~l~~~gi~~~~v~- 85 (150) -+.++..+++.|.+ .+||-+|||.+--..+.+.+.. ....-..-+.++..|..-.| .+.+.|+.+..+...+. T Consensus 61 ~~~~~~~~~k~~~~-vV~IGIGGS~LG~~a~~~aL~~~~~n~~~~~~~~~~~i~f~~NvDp~~l~~~l~~ld~~~TlfiV 139 (454) T PRK00973 61 SYLELKEWIKNFEN-VVVLGIGGSALGNLALHTALLPLNYNELSKEERNGYRIFVLDNVDPEKTASILDVIDPKKTLFNV 139 (454) T ss_pred HHHHHHHHHCCCCE-EEEECCCHHHHHHHHHHHHHHHHCCCCCCHHCCCCCEEEEEECCCHHHHHHHHHHCCCCCEEEEE T ss_conf 99999997478987-99954353678999999995001123100111689759998089989999999747933079999 Q ss_pred -CCCCCCHHHHHHHH Q ss_conf -83466124899999 Q 537021.9.peg.3 86 -GLRITDQQTAEVVE 99 (150) Q Consensus 86 -G~RVTd~~tl~iv~ 99 (150) -=--|+.|||.-.. T Consensus 140 iSKSgtT~ETl~n~~ 154 (454) T PRK00973 140 ISKSGNTAETLANYL 154 (454) T ss_pred EECCCCCHHHHHHHH T ss_conf 848979689999999 No 95 >TIGR01292 TRX_reduct thioredoxin-disulfide reductase; InterPro: IPR005982 Reactive oxygen species (ROS) are known mediators of intracellular signaling cascades. Excessive production of ROS may, however, lead to oxidative stress, loss of cell function, and ultimately apoptosis or necrosis. A balance between oxidant and antioxidant intracellular systems is hence vital for cell function, regulation, and adaptation to diverse growth conditions. Thioredoxin reductase in conjunction with thioredoxin is a ubiquitous oxidoreductase system with antioxidant and redox regulatory roles. Thioredoxin reductase (1.8.1.9 from EC) reduces oxidised thioredoxin in the presence of NADPH. Reduced thioredoxin serves as an electron donor for thioredoxin peroxidase which consequently reduces H_2O_2 to H_2O. In mammals, extracellular forms of Trx also have cytokine-like effects. Mammalian TrxR has a highly reactive active site selenocysteine residue resulting in a profound reductive capacity, reducing several substrates in addition to Trx .; GO: 0004791 thioredoxin-disulfide reductase activity, 0019430 removal of superoxide radicals, 0005737 cytoplasm. Probab=56.33 E-value=18 Score=17.68 Aligned_cols=36 Identities=22% Similarity=0.426 Sum_probs=23.5 Q ss_pred HHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEEC Q ss_conf 987598899986756724746899999999998518971799845 Q 537021.9.peg.3 21 QFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHG 65 (150) Q Consensus 21 ~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHG 65 (150) ..||||.++ -+||. |..+ ++--.|+..+-+|.|||= T Consensus 147 ~ffk~K~V~-VvGGG---dsA~-----eEA~yL~~~a~kV~lvHR 182 (321) T TIGR01292 147 PFFKNKEVA-VVGGG---DSAL-----EEALYLTRIAKKVTLVHR 182 (321) T ss_pred HHHCCCEEE-EECCC---CHHH-----HHHHHHHHCCCEEEEEEC T ss_conf 120598899-98798---2488-----888999853876799977 No 96 >cd03141 GATase1_Hsp31_like Type 1 glutamine amidotransferase (GATase1)-like domain found in proteins similar to Escherichia coli Hsp31 protein. Type 1 glutamine amidotransferase (GATase1)-like domain found in proteins similar to Escherichia coli Hsp31 protein (EcHsp31). This group includes EcHsp31 and Saccharomyces cerevisiae Ydr533c protein. EcHsp31 has chaperone activity. Ydr533c is upregulated in response to various stress conditions along with the heat shock family. EcHsp31 coordinates a metal ion using a 2-His-1-carboxylate motif present in various ions that use iron as a cofactor such as Carboxypeptidase A. The catalytic triad typical of GATase1 domains is not conserved in this GATase1-like domain. However, in common with a typical GATase1 domain, a reactive Cys residue is found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. For EcHsp31, this Cys together with a different His and, an Asp (rather than a Glu) residue form a different Probab=56.22 E-value=18 Score=17.67 Aligned_cols=54 Identities=13% Similarity=0.163 Sum_probs=39.8 Q ss_pred EEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEE------EEEEEECCEEEEE Q ss_conf 15578346612489999999741668999999982899768------7786558989999 Q 537021.9.peg.3 82 KFENGLRITDQQTAEVVEMVLAGSINKKIVSLINQTGTQAI------GICGKDGNMVFAE 135 (150) Q Consensus 82 ~~v~G~RVTd~~tl~iv~~vL~G~vn~~lv~~L~~~g~~av------glsg~dg~li~a~ 135 (150) -.+.|+|||.-..-+-...-+.+.+-..|...|.+.|.+=. .-..+||+||+.+ T Consensus 146 ~l~~Gk~vT~fs~~EE~~~~~~~~~Pf~le~~l~~~Ga~~~~~~~~~~~vvvDg~LITGq 205 (221) T cd03141 146 SLVAGKTVTGFTNEEEEAAGLKKVVPFLLEDELKELGANYVKAEPWAEFVVVDGRLITGQ 205 (221) T ss_pred EEECCCEEEECCCHHHHHCCCCCCCCCCHHHHHHHCCCEEEECCCCCCCEEEECCEEECC T ss_conf 155673785159799874154566874789999986998860689987389829977687 No 97 >TIGR00014 arsC arsenate reductase; InterPro: IPR006659 This entry describes a distinct clade, including ArsC itself, of the broader family of ArsC and related proteins. This clade is almost completely restricted to the proteobacteria. An anion-translocating ATPase has been identified as the product of the arsenical resistance operon of resistance plasmid R773 . When expressed in Escherichia coli this ATP-driven oxyanion pump catalyses extrusion of the oxyanions arsenite, antimonite and arsenate. The pump is composed of two polypeptides, the products of the arsA and arsB genes. The pump alone produces resistance to arsenite and antimonite. This protein, ArsC, catalyzes the reduction of arsenate to arsenite, and thus extends resistance to include arsenate. ; GO: 0008794 arsenate reductase (glutaredoxin) activity, 0006118 electron transport. Probab=55.49 E-value=6.5 Score=20.44 Aligned_cols=10 Identities=30% Similarity=0.737 Sum_probs=4.0 Q ss_pred CEEEEECCCH Q ss_conf 1799845724 Q 537021.9.peg.3 59 TPVIVHGGGP 68 (150) Q Consensus 59 k~ViVHGgg~ 68 (150) |||+|||.+. T Consensus 92 RPiVv~~~~A 101 (114) T TIGR00014 92 RPIVVAGDKA 101 (114) T ss_pred CCEEECCCCC T ss_conf 9857659964 No 98 >TIGR01465 cobM_cbiF precorrin-4 C11-methyltransferase; InterPro: IPR006362 Cobalamin (vitamin B12) is a structurally complex cofactor, consisting of a modified tetrapyrrole with a centrally chelated cobalt. Cobalamin is usually found in one of two biologically active forms: methylcobalamin and adocobalamin. Most prokaryotes, as well as animals, have cobalamin-dependent enzymes, whereas plants and fungi do not appear to use it. In bacteria and archaea, these include methionine synthase, ribonucleotide reductase, glutamate and methylmalonyl-CoA mutases, ethanolamine ammonia lyase, and diol dehydratase . In mammals, cobalamin is obtained through the diet, and is required for methionine synthase and methylmalonyl-CoA mutase . There are at least two distinct cobalamin biosynthetic pathways in bacteria : Aerobic pathway that requires oxygen and in which cobalt is inserted late in the pathway ; found in Pseudomonas denitrificans and Rhodobacter capsulatus. Anaerobic pathway in which cobalt insertion is the first committed step towards cobalamin synthesis ; found in Salmonella typhimurium, Bacillus megaterium, and Propionibacterium freudenreichii shermanii. Either pathway can be divided into two parts: (1) corrin ring synthesis (differs in aerobic and anaerobic pathways) and (2) adenosylation of corrin ring, attachment of aminopropanol arm, and assembly of the nucleotide loop (common to both pathways) . There are about 30 enzymes involved in either pathway, where those involved in the aerobic pathway are prefixed Cob and those of the anaerobic pathway Cbi. Several of these enzymes are pathway-specific: CbiD, CbiG, and CbiK are specific to the anaerobic route of S. typhimurium, whereas CobE, CobF, CobG, CobN, CobS, CobT, and CobW are unique to the aerobic pathway of P. denitrificans. This entry represents CobM and CibF precorrin-4 C11-methyltransferase (2.1.1.133 from EC), both as stand-alone enzymes and when CobJ forms part of a bifunctional enzyme. In the aerobic pathway, CobM catalyses the methylation of precorrin-4 at C-11 to yield precorrin-5. The extruded acyl group is then removed in the subsequent step catalysed by CobF. In the anaerobic pathway, CibF catalyses the methylation of cobalt-precorrin-4 to cobalt-precoriin-5 . Nomenclature note: occasionally precorrin-4 C11-methyltransferase is one of two methyltransferases often referred to as precorrin-3 methylase (the other is precorrin-3 methylase (the other is precorrin-4 C11-methyltransferase, 2.1.1.131 from EC). ; GO: 0046026 precorrin-4 C11-methyltransferase activity, 0009236 cobalamin biosynthetic process. Probab=55.12 E-value=18 Score=17.55 Aligned_cols=126 Identities=17% Similarity=0.223 Sum_probs=86.2 Q ss_pred HHHHCCCHHHHHHHHHHHH--CCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCC-- Q ss_conf 2322062289999999987--598899986756724746899999999998518971799845724578899970898-- Q 537021.9.peg.3 5 IYQFQAEILEQVLPFVQFY--ENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIK-- 80 (150) Q Consensus 5 ~~~~~a~~l~ea~pYi~~~--r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~-- 80 (150) +.-..+..|.|-.|.+.++ +||.+|=-=.| ||.+.-++.++...|..+|+..-+|.|-.-..- ....++.| T Consensus 49 ~~~sA~m~L~ei~~~m~~a~~~GK~VvRLHsG----DPsIYGA~~EQ~~~L~~~gI~~e~vPGvSsf~A-AAA~l~~ELT 123 (252) T TIGR01465 49 VVNSAAMSLEEIVDIMVDAVREGKLVVRLHSG----DPSIYGAIAEQMQLLEALGIPYEVVPGVSSFFA-AAAALGAELT 123 (252) T ss_pred EEECCCCCHHHHHHHHHHHHHCCCEEEEEECC----CHHHHHHHHHHHHHHHHCCCCEEEECCCHHHHH-HHHHHHCCCC T ss_conf 86050269889999999998669849998508----755776699999999867897798688738989-9997300147 Q ss_pred ----CE-----EECCCCCCCHHHHHHHHHH---------HHHHHHHHHHHHHHHCCCCE---EE----EEEEECCEEEEE Q ss_conf ----11-----5578346612489999999---------74166899999998289976---87----786558989999 Q 537021.9.peg.3 81 ----SK-----FENGLRITDQQTAEVVEMV---------LAGSINKKIVSLINQTGTQA---IG----ICGKDGNMVFAE 135 (150) Q Consensus 81 ----~~-----~v~G~RVTd~~tl~iv~~v---------L~G~vn~~lv~~L~~~g~~a---vg----lsg~dg~li~a~ 135 (150) ++ ...|.|.-=||-=++...+ |+..++.+++..|-.+|=.. +. .|.=|-.++.++ T Consensus 124 ~P~vsQtvilTR~eG~RtPmPe~E~l~~lA~hgaTm~IfLs~~~~~~vv~~L~~~GY~~DTPV~vVyratWPDE~I~Rgt 203 (252) T TIGR01465 124 VPEVSQTVILTRAEGRRTPMPEGEKLADLAKHGATMAIFLSAHIIDKVVKELIEGGYSEDTPVAVVYRATWPDEKIVRGT 203 (252) T ss_pred CCCCEEEEEEECCCCCCCCCCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCEEEECC T ss_conf 88403424467543435457766789988741231356778999999999998377688887898985148756589715 No 99 >pfam08282 Hydrolase_3 haloacid dehalogenase-like hydrolase. This family contains haloacid dehalogenase-like hydrolase enzymes. Probab=54.49 E-value=19 Score=17.49 Aligned_cols=26 Identities=15% Similarity=0.247 Sum_probs=10.9 Q ss_pred CCHHHHHHHHHHHHCCCEEEEEECHH Q ss_conf 62289999999987598899986756 Q 537021.9.peg.3 10 AEILEQVLPFVQFYENETIVVKYGGH 35 (150) Q Consensus 10 a~~l~ea~pYi~~~r~ktiVIK~GG~ 35 (150) +..+.++.|+++++.-...+|-..|. T Consensus 39 GR~~~~~~~~~~~l~~~~~~I~~NGa 64 (254) T pfam08282 39 GRPYRGALPVLEELGLDLPVICFNGA 64 (254) T ss_pred CCCHHHHHHHHHHCCCCCCEEECCCC T ss_conf 98799999999980999759977971 No 100 >PRK13385 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase; Provisional Probab=53.91 E-value=18 Score=17.64 Aligned_cols=59 Identities=17% Similarity=0.300 Sum_probs=27.3 Q ss_pred CCCHHHHHHHHHHHHC--CCEEEEEECHHHHCCHHHHHHHHHHHHHHHHC----CCCEEEEE-CCCHHHHH Q ss_conf 0622899999999875--98899986756724746899999999998518----97179984-57245788 Q 537021.9.peg.3 9 QAEILEQVLPFVQFYE--NETIVVKYGGHVMNCTDLSKDFVNDIALLKKS----NITPVIVH-GGGPQIGA 72 (150) Q Consensus 9 ~a~~l~ea~pYi~~~r--~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~----G~k~ViVH-Ggg~qI~~ 72 (150) ...++.+...++.+|. ++.+-+-.||.. . .+++..-+..+..+ .-..|+|| +++|.++. T Consensus 54 ~~~~~~~~~~~~~~~~~~~~~v~iv~GG~t----R-~~SV~ngl~~l~~~~~~~~~~~VlIHDaARP~vs~ 119 (238) T PRK13385 54 PAQWISHTQDILKKYNITDQRVKVVAGGTD----R-NETIMNIIDHIRNVNGINNDDVIVTHDAVRPFLTQ 119 (238) T ss_pred CHHHHHHHHHHHHHCCCCCCCEEEECCCCC----H-HHHHHHHHHHHHHCCCCCCCCEEEEECCCCCCCCH T ss_conf 788899999999861544476899328986----4-89999999998741147876889995466778999 No 101 >cd01976 Nitrogenase_MoFe_alpha Nitrogenase_MoFe_alpha_II: Nitrogenase MoFe protein, beta subunit. A group of proteins similar to the alpha subunit of the MoFe protein of the molybdenum (Mo-) nitrogenase. The nitrogenase enzyme catalyzes the ATP-dependent reduction of dinitrogen to ammonia. The Mo-nitrogenase is the most widespread and best characterized of these systems. Mo-nitrogenase consists of the MoFe protein (component 1) and the Fe protein (component 2). MoFe is an alpha2beta2 tetramer. Each alphabeta pair of MoFe contains one P-cluster (at the alphabeta interface) and, one molecule of iron molybdenum cofactor (FeMoco) contained within the alpha subunit. The Fe protein contains a single [4Fe-4S] cluster. Electrons are transferred from the [4Fe-4S] cluster of the Fe protein to the P-cluster of the MoFe and in turn to FeMoCo, the site of substrate reduction. Probab=53.70 E-value=19 Score=17.41 Aligned_cols=67 Identities=9% Similarity=0.096 Sum_probs=40.1 Q ss_pred HHHHCCCHHHHHHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEE---ECCCHHHHHHHHHCCCC Q ss_conf 23220622899999999875988999867567247468999999999985189717998---45724578899970898 Q 537021.9.peg.3 5 IYQFQAEILEQVLPFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIV---HGGGPQIGAVLEKMGIK 80 (150) Q Consensus 5 ~~~~~a~~l~ea~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViV---HGgg~qI~~~l~~~gi~ 80 (150) |.+..+.+..+..+|-.+++||...|-.||. ... .++ .+++.+|+++|.+ .|-..-..+..+..+.. T Consensus 280 I~~e~~~~~~~l~~yr~~L~GKrv~i~~Gg~---~~~---~~i---~~~~eLGmevV~~g~~~~~~~Dye~~~~~~~~~ 349 (421) T cd01976 280 IAEYKPAMEAVIAKYRPRLEGKTVMLYVGGL---RPR---HYI---GAYEDLGMEVVGTGYEFAHRDDYERTEVIPKEG 349 (421) T ss_pred HHHHHHHHHHHHHHHHHHCCCCEEEEECCCC---CHH---HHH---HHHHHCCCEEEEEEEECCCHHHHHHHHHHCCCC T ss_conf 9999999999999999870898899989987---468---999---999987988999731227476799998508988 No 102 >PRK09444 pntB pyridine nucleotide transhydrogenase; Provisional Probab=50.24 E-value=22 Score=17.06 Aligned_cols=39 Identities=26% Similarity=0.390 Sum_probs=30.2 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCH----------HHHHHHHHCCCCCEE Q ss_conf 999999998518971799845724----------578899970898115 Q 537021.9.peg.3 45 DFVNDIALLKKSNITPVIVHGGGP----------QIGAVLEKMGIKSKF 83 (150) Q Consensus 45 ~~~~dI~ll~~~G~k~ViVHGgg~----------qI~~~l~~~gi~~~~ 83 (150) .=.+|.+.+-.--.++|||.|-|= ++.+.|++.|++.+| T Consensus 291 ~~~~d~a~~l~~A~~ViIVPGYGmAVAqAQh~v~el~~~L~~~g~~V~f 339 (459) T PRK09444 291 TTAEEVAEMLKNSHSVIITPGYGMAVAQAQYPVAEITEKLRARGINVRF 339 (459) T ss_pred CCHHHHHHHHHCCCEEEEECCHHHHHHHHHHHHHHHHHHHHHCCCEEEE T ss_conf 3789999999709949996680899998889999999999977985899 No 103 >COG2054 Uncharacterized archaeal kinase related to aspartokinases, uridylate kinases [General function prediction only] Probab=49.66 E-value=21 Score=17.21 Aligned_cols=73 Identities=23% Similarity=0.239 Sum_probs=38.0 Q ss_pred EEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHH--HHHHHHHHHHHHH Q ss_conf 99867567247468999999999985189717998457245788999708981155783466124--8999999974166 Q 537021.9.peg.3 29 VVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQ--TAEVVEMVLAGSI 106 (150) Q Consensus 29 VIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~--tl~iv~~vL~G~v 106 (150) |||.||+...+ . ...+.+ .+..++.++++|.|||-..|---+ +.- -.+..|.. -|.|..|-..|.. T Consensus 3 vVk~~Gs~~~~-~-~~~~~~---ale~~~~~i~iVpGGg~FAd~VR~---id~----~~~lSdsasHwmAI~~Md~~G~~ 70 (212) T COG2054 3 VVKKGGSGVAE-R-AAAVKE---ALENLQRSILIVPGGGIFADLVRK---IDE----EFGLSDSASHWMAITAMDQYGFY 70 (212) T ss_pred EEEECCCCHHH-H-HHHHHH---HHHHHCCEEEEECCCHHHHHHHHH---HHH----HCCCCCCHHHHHHHHHHHHHHHH T ss_conf 68862787088-8-899999---998414608995184289999999---987----45987507889999999998899 Q ss_pred HHHHHHH Q ss_conf 8999999 Q 537021.9.peg.3 107 NKKIVSL 113 (150) Q Consensus 107 n~~lv~~ 113 (150) -.++... T Consensus 71 lad~~~~ 77 (212) T COG2054 71 LADLASR 77 (212) T ss_pred HHHHHCC T ss_conf 9864300 No 104 >TIGR00453 ispD 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase; InterPro: IPR001228 The bacterial ispD protein catalyzes the third step of the deoxyxylulose-5-phosphate pathway (DXP) of isoprenoid biosynthesis; the formation of 4-diphosphocytidyl-2C-methyl-D-erythritol from CTP and 2C-methyl-D-erythritol 4-phosphate.; GO: 0008299 isoprenoid biosynthetic process. Probab=48.60 E-value=19 Score=17.46 Aligned_cols=34 Identities=21% Similarity=0.378 Sum_probs=16.2 Q ss_pred EEEEEECHHHHCCHHHHHHHHHHHHHHHH-CCCCEEEEEC Q ss_conf 89998675672474689999999999851-8971799845 Q 537021.9.peg.3 27 TIVVKYGGHVMNCTDLSKDFVNDIALLKK-SNITPVIVHG 65 (150) Q Consensus 27 tiVIK~GG~~l~d~~l~~~~~~dI~ll~~-~G~k~ViVHG 65 (150) -+.+.-||+. . .+++..=+..+.. -+.+.||||- T Consensus 70 ~~~~v~GG~~----R-q~SV~~GL~a~~~~~~~~~VlvHD 104 (226) T TIGR00453 70 VVKIVAGGDT----R-QDSVRNGLKALPERADAEIVLVHD 104 (226) T ss_pred CCEEECCCCC----H-HHHHHHHHHHHHHCCCCCEEEEEC T ss_conf 1158469874----6-899999999876358988289847 No 105 >PTZ00114 Heat shock protein 60; Provisional Probab=47.93 E-value=24 Score=16.83 Aligned_cols=113 Identities=16% Similarity=0.174 Sum_probs=54.0 Q ss_pred HHHHHHCCCEEEEEECHHHH--CCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECC-----CCCC Q ss_conf 99998759889998675672--4746899999999998518971799845724578899970898115578-----3466 Q 537021.9.peg.3 18 PFVQFYENETIVVKYGGHVM--NCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENG-----LRIT 90 (150) Q Consensus 18 pYi~~~r~ktiVIK~GG~~l--~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G-----~RVT 90 (150) .++.++.+++.+|++||+.= .++. .+++-..+...+..=..- +|.|||...-+....+.- .+-.++ .++- T Consensus 397 erla~lsg~va~I~vGG~te~el~E~-errl~DAL~avraAveeG-IVpGGGaall~as~~L~~-~~~~~~~eq~gi~~~ 473 (580) T PTZ00114 397 ERLAKITGGVALIKVGGISEVEVNEI-KDRIQDALCATKAAVEEG-IVPGGGSALLFASKELDS-VQTDNYDQRVGVNII 473 (580) T ss_pred HHHHCCCCCEEEEEECCCCHHHHHHH-HHHHHHHHHHHHHHHHCC-EECCCCHHHHHHHHHHHH-HHCCCHHHHHHHHHH T ss_conf 77511477579999688728889999-999987899999997658-216860999999999876-225888799999999 Q ss_pred CHHHHHHHHHHH---HHHHHHHHHHHHHHCCCCEEEEEEEECCEEEE Q ss_conf 124899999997---41668999999982899768778655898999 Q 537021.9.peg.3 91 DQQTAEVVEMVL---AGSINKKIVSLINQTGTQAIGICGKDGNMVFA 134 (150) Q Consensus 91 d~~tl~iv~~vL---~G~vn~~lv~~L~~~g~~avglsg~dg~li~a 134 (150) .++|++.-..| +|.=-..++..|.+.+-...|+....|.+.-. T Consensus 474 -a~ALe~ppr~IaeNAG~d~~~vl~~l~~~~~~~~G~D~~tG~~~Dm 519 (580) T PTZ00114 474 -KDACKAPIKQIAENAGHEGSVVAGNILKEKNKNIGFNAQEGKYVDM 519 (580) T ss_pred -HHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCEEEECCCCEEEEH T ss_conf -9999999999999769978999999996069998486789907356 No 106 >PRK00694 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase; Validated Probab=47.57 E-value=23 Score=16.92 Aligned_cols=53 Identities=21% Similarity=0.202 Sum_probs=20.8 Q ss_pred HHHHHHHHHHHHCCCCEEEEECCC-------HHHHHHHHHCCCCCEEECCCCCCCHHHHH Q ss_conf 999999999851897179984572-------45788999708981155783466124899 Q 537021.9.peg.3 44 KDFVNDIALLKKSNITPVIVHGGG-------PQIGAVLEKMGIKSKFENGLRITDQQTAE 96 (150) Q Consensus 44 ~~~~~dI~ll~~~G~k~ViVHGgg-------~qI~~~l~~~gi~~~~v~G~RVTd~~tl~ 96 (150) ++-+++|..|...|..+|=|-=-+ +.|.+.|.+.|++...+..+-+....++. T Consensus 45 ~atv~Qi~~l~~aGceiVRvtvp~~~~A~al~~I~~~l~~~~~~iPlvADIHF~~~~a~~ 104 (606) T PRK00694 45 DGTVRQICALQEHGCDIVRVTVQGIKEAQACEHIKERLIQQGISIPLVADIHFFPQAAMH 104 (606) T ss_pred HHHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCCCCEEEECCCCHHHHHH T ss_conf 999999999998599989987799999986999999986369999988666888899999 No 107 >pfam02233 PNTB NAD(P) transhydrogenase beta subunit. This family corresponds to the beta subunit of NADP transhydrogenase in prokaryotes, and either the protein N- or C terminal in eukaryotes. The domain is often found in conjunction with pfam01262. Pyridine nucleotide transhydrogenase catalyses the reduction of NAD+ to NADPH. A complete loss of activity occurs upon mutation of Gly314 in E. coli. Probab=47.27 E-value=24 Score=16.77 Aligned_cols=43 Identities=26% Similarity=0.342 Sum_probs=32.8 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCH----------HHHHHHHHCCCCCEE----ECCC Q ss_conf 999999998518971799845724----------578899970898115----5783 Q 537021.9.peg.3 45 DFVNDIALLKKSNITPVIVHGGGP----------QIGAVLEKMGIKSKF----ENGL 87 (150) Q Consensus 45 ~~~~dI~ll~~~G~k~ViVHGgg~----------qI~~~l~~~gi~~~~----v~G~ 87 (150) .=.+|.+.+-.--.++|||.|-|= ++.+.|+++|++.+| +.|| T Consensus 296 ~~~~d~a~~l~~A~~ViIVPGYGmAvAqAQ~~v~el~~~L~~~g~~V~faIHPVAGR 352 (464) T pfam02233 296 ITAEEAADLLLNASSVIIVPGYGMAVAQAQHPVADLAKLLEERGVNVRFAIHPVAGR 352 (464) T ss_pred CCHHHHHHHHHCCCEEEEECCCHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCC T ss_conf 489999999971993999778089999878999999999997898699985766888 No 108 >pfam01973 MAF_flag10 Protein of unknown function DUF115. This family of archaebacterial proteins has no known function. Probab=46.58 E-value=25 Score=16.70 Aligned_cols=27 Identities=11% Similarity=0.362 Sum_probs=16.9 Q ss_pred CCHHHHHHHHHHHHCCCEEEEEECHHH Q ss_conf 622899999999875988999867567 Q 537021.9.peg.3 10 AEILEQVLPFVQFYENETIVVKYGGHV 36 (150) Q Consensus 10 a~~l~ea~pYi~~~r~ktiVIK~GG~~ 36 (150) +|.|.+..+||++++++-++|-.+.+. T Consensus 33 GPSL~~~i~~Lk~~~~~~~iia~~~a~ 59 (169) T pfam01973 33 GPSLDKHLPLLKKNRDKAVIIAADSAL 59 (169) T ss_pred CCCHHHHHHHHHHHCCCEEEEEECHHH T ss_conf 899899999999716983999966799 No 109 >PTZ00063 histone deacetylase; Provisional Probab=46.36 E-value=25 Score=16.68 Aligned_cols=118 Identities=16% Similarity=0.290 Sum_probs=69.8 Q ss_pred CHHHHH-HHHHHHHCCCEEEEEECHHHHCCHH------HHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHH-------- Q ss_conf 228999-9999987598899986756724746------8999999999985189717998457245788999-------- Q 537021.9.peg.3 11 EILEQV-LPFVQFYENETIVVKYGGHVMNCTD------LSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLE-------- 75 (150) Q Consensus 11 ~~l~ea-~pYi~~~r~ktiVIK~GG~~l~d~~------l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~-------- 75 (150) .+|+.+ .|-+..|+=+.||+-+|.+.+..+. ....-.+-+..+++.++.++++=|||--+..... T Consensus 237 ~iF~~vi~~v~~~f~P~~IVlQcGaDsl~gD~Lg~fnls~~gh~~cv~~~~~~~~p~l~lGGGGY~~~nvarcW~yeT~~ 316 (439) T PTZ00063 237 DLFKVVIGKCVEVYRPGAIVLQCGADSLTGDRLGRFNLTIKGHARCVEFVRSLNIPLLVLGGGGYTIRNVARCWAYETGV 316 (439) T ss_pred HHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCCCCCHHHHHHHHHHHH T ss_conf 99999999999986999799946765446784565324658799999999960998899767887851299999999999 Q ss_pred HCC----C----C---------CEEE---CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH-CCCCEEEEEEEECCEEE Q ss_conf 708----9----8---------1155---78346612489999999741668999999982-89976877865589899 Q 537021.9.peg.3 76 KMG----I----K---------SKFE---NGLRITDQQTAEVVEMVLAGSINKKIVSLINQ-TGTQAIGICGKDGNMVF 133 (150) Q Consensus 76 ~~g----i----~---------~~~v---~G~RVTd~~tl~iv~~vL~G~vn~~lv~~L~~-~g~~avglsg~dg~li~ 133 (150) .+| | + |.|. .--...+..+-+..+ ++...+...|+. .+.++|.+.-+--.++. T Consensus 317 ~l~~~~~i~~~iP~~~y~~~~~pd~~L~~~~~~~~n~Nt~~~l~-----~i~~~~~e~l~~~~~apsvq~~~~p~d~~~ 390 (439) T PTZ00063 317 ILNKHHEMSDQISLNDYYDYYAPDFQLHLQPSNMPNYNSPEHLD-----KIKMKIIDNLRYVEHAPGVQFAYVPPDFFD 390 (439) T ss_pred HHCCCCCCCCCCCCCCCHHHHCCCEEEECCCCCCCCCCCHHHHH-----HHHHHHHHHHHCCCCCCCCCCCCCCCCCCC T ss_conf 94886668898998735355289736306987776789899999-----999999999716899897663548897567 No 110 >TIGR00656 asp_kin_monofn asparate kinase, monofunctional class; InterPro: IPR005260 Aspartate kinase catalyzes the first step in the biosynthesis of Lys (via its precursor diaminopimelate), Met, and Thr. In E. coli, a distinct isozyme is inhibited by each of these three amino acid end products . The Met-sensitive (I) and Thr-sensitive (II) forms are bifunctional enzymes fused to homoserine dehydrogenase, which catalyses the next step in the biosynthesis of these amino acids. Inn contrast, the Lys-sensitive form (III) is a monofunctional enzyme; homoserine dehydrogenase is not part of the Lys biosynthetic pathway. This entry represents a subclass of aspartate kinases that are mostly Lys-sensitive and are not fused to homoserine dehydrogenase. The E. coli enzyme is a homodimer, while the Bacillus and Corynebacterium enzymes are alpha 2/beta 2 heterotetramers, where the beta subunit is translated from an in-phase alternative initiator at Met-246 , . The protein slr0657 from Synechocystis PCC6803 is extended by a duplication of the C-terminal region corresponding to the beta chain. Incorporation of a second copy of the C-terminal domain may be quite common in this subgroup of aspartokinases. ; GO: 0004072 aspartate kinase activity, 0009058 biosynthetic process. Probab=45.35 E-value=26 Score=16.58 Aligned_cols=54 Identities=20% Similarity=0.204 Sum_probs=40.7 Q ss_pred CCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCC Q ss_conf 988999867567247468999999999985189717998457245788999708 Q 537021.9.peg.3 25 NETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMG 78 (150) Q Consensus 25 ~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~g 78 (150) .+++|.|+||..+.+.+-+.+.++-+......|.++|+|--+-...+..+-... T Consensus 1 ~~~~v~kfGGt~~~~~~~~~~~~~~~~~~~~~g~~~~vv~sa~~~~td~l~~~~ 54 (480) T TIGR00656 1 MELIVQKFGGTSVGSGERIKNAARIVLKEKEEGHKVVVVVSAMGGVTDALLSLA 54 (480) T ss_pred CCEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCHHHHHHH T ss_conf 944787426721254267888887888776407714899861565202466554 No 111 >cd01974 Nitrogenase_MoFe_beta Nitrogenase_MoFe_beta: Nitrogenase MoFe protein, beta subunit. The nitrogenase enzyme catalyzes the ATP-dependent reduction of dinitrogen to ammonia. The Molybdenum (Mo-) nitrogenase is the most widespread and best characterized of these systems. Mo-nitrogenase consists of the MoFe protein (component 1) and the Fe protein (component 2). MoFe is an alpha2beta2 tetramer. This group contains the beta subunit of the MoFe protein. Each alphabeta pair of MoFe contains one P-cluster (at the alphabeta interface) and, one molecule of iron molybdenum cofactor (FeMoco) contained within the alpha subunit. The Fe protein contains a single [4Fe-4S] cluster. Electrons are transferred from the [4Fe-4S] cluster of the Fe protein to the P-cluster of the MoFe and in turn to FeMoCo, the site of substrate reduction. Probab=45.21 E-value=26 Score=16.57 Aligned_cols=65 Identities=15% Similarity=0.320 Sum_probs=38.1 Q ss_pred CHHHHHCCCHHHHHHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEE--ECCCHHHHHHHHH Q ss_conf 0023220622899999999875988999867567247468999999999985189717998--4572457889997 Q 537021.9.peg.3 3 EKIYQFQAEILEQVLPFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIV--HGGGPQIGAVLEK 76 (150) Q Consensus 3 ~~~~~~~a~~l~ea~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViV--HGgg~qI~~~l~~ 76 (150) +++.++++.+......|-..+.||.+.|- |+. ...-.+. .+|..+|+.++.| ..+.+...+.+++ T Consensus 281 ~~i~~er~r~~d~~~d~~~~l~gkrvai~-g~~-----~~~~~l~---~~l~elG~~~~~vv~~~~~~~~~~~i~~ 347 (435) T cd01974 281 EELEEERGRLVDAMTDSHQYLHGKKFALY-GDP-----DFLIGLT---SFLLELGMEPVHVLTGNGGKRFEKEMQA 347 (435) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCEEEEE-CCH-----HHHHHHH---HHHHHCCCEEEEEEECCCCHHHHHHHHH T ss_conf 99999999999999999887429679998-882-----8999999---9999889978999979997789999999 No 112 >TIGR00705 SppA_67K signal peptide peptidase SppA, 67K type; InterPro: IPR004634 Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes . They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence . Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases . Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base . The geometric orientations of the catalytic residues are similar between families, despite different protein folds . The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) , . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This group of serine peptidases belong to MEROPS peptidase family S49 (protease IV family, clan S-). The predicted active site serine for members of this family occurs in a transmembrane domain. Signal peptides of secretory proteins seem to serve at least two important biological functions. First, they are required for protein targeting to and translocation across membranes, such as the eubacterial plasma membrane and the endoplasmic reticular membrane of eukaryotes. Second, in addition to their role as determinants for protein targeting and translocation, certain signal peptides have a signalling function. During or shortly after pre-protein translocation, the signal peptide is removed by signal peptidases. The integral membrane protein, SppA (protease IV), of Escherichia coli was shown experimentally to degrade signal peptides. The member of this family from Bacillus subtilis has only been shown to be required for efficient processing of pre-proteins under conditions of hyper-secretion . These enzymes have a molecular mass around 67 kDa and a duplication such that the N-terminal half shares extensive homology with the C-terminal half and was shown in E. coli to form homotetramers. E. coli SohB, which is most closely homologous to the C-terminal duplication of SppA, is predicted to perform a similar function of small peptide degradation, but in the periplasm. Many prokaryotes have a single SppA/SohB homolog that may perform the function of either or both. ; GO: 0009003 signal peptidase activity, 0006465 signal peptide processing, 0016021 integral to membrane. Probab=45.17 E-value=15 Score=18.17 Aligned_cols=75 Identities=23% Similarity=0.316 Sum_probs=37.2 Q ss_pred CHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCC--------CHHHHHHHHHHHHH Q ss_conf 7567247468999999999985189717998457245788999708981155783466--------12489999999741 Q 537021.9.peg.3 33 GGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRIT--------DQQTAEVVEMVLAG 104 (150) Q Consensus 33 GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVT--------d~~tl~iv~~vL~G 104 (150) ||++..++- +=+.++.++..|.|||||==|+=..+ .|+.|. +|.|+-=-.-++ T Consensus 371 GGsv~Ase~----IR~e~~~~~~~GkKPVivSMG~~AAS-------------GgYWiasaA~yIvA~p~TiTGSIGvf-- 431 (614) T TIGR00705 371 GGSVFASEI----IRRELERLQARGKKPVIVSMGAMAAS-------------GGYWIASAADYIVADPNTITGSIGVF-- 431 (614) T ss_pred CCCEEHHHH----HHHHHHHHHHCCCCCEEEECCHHHHC-------------CCCHHCCCCCEEEECCCCCCCCHHHH-- T ss_conf 863428789----99999999826899789843502320-------------53002045571334787431001445-- Q ss_pred HHHHHHHHHHHH-CCCCEEEEEE Q ss_conf 668999999982-8997687786 Q 537021.9.peg.3 105 SINKKIVSLINQ-TGTQAIGICG 126 (150) Q Consensus 105 ~vn~~lv~~L~~-~g~~avglsg 126 (150) .|=+..+..+.+ .|++.=+++. T Consensus 432 svl~t~En~~~~y~Gv~~D~V~t 454 (614) T TIGR00705 432 SVLPTVENSLDRYIGVHVDGVST 454 (614) T ss_pred HHHHHHHHHHHCEECEEECCEEE T ss_conf 32522554231000223163342 No 113 >PRK08605 D-lactate dehydrogenase; Validated Probab=44.96 E-value=27 Score=16.54 Aligned_cols=36 Identities=17% Similarity=0.158 Sum_probs=24.5 Q ss_pred HHHHHCCCCEEEEECCCHH-HH-HHHHHCCCCCEEECC Q ss_conf 9985189717998457245-78-899970898115578 Q 537021.9.peg.3 51 ALLKKSNITPVIVHGGGPQ-IG-AVLEKMGIKSKFENG 86 (150) Q Consensus 51 ~ll~~~G~k~ViVHGgg~q-I~-~~l~~~gi~~~~v~G 86 (150) ..+..+|+|+|--+|.|-- || ..++++||.....-| T Consensus 63 ~~~~~l~iK~I~r~gvG~DnIDl~~a~~~gI~V~n~P~ 100 (332) T PRK08605 63 KLLNELGIKQIAQRSAGFDTYDLELANKYNIIISNVPS 100 (332) T ss_pred HHHHHCCCEEEEECCCCCCHHHHHHHHHCCCEEEECCC T ss_conf 96663697799977721745469999979998995998 No 114 >TIGR02078 AspKin_pair Pyrococcus aspartate kinase subunit, putative; InterPro: IPR011819 This family consists of proteins restricted to and found as paralogous pairs (typically close together) in species of Pyrococcus, a hyperthermophilic archaeal genus. Members are always found close to other genes of threonine biosynthesis and appear to represent the Pyrococcal form of aspartate kinase. Alignment to aspartokinase III from Escherichia coli shows that 300 N-terminal and 20 C-terminal amino acids are homologous, but the form in Pyrococcus lacks ~100 amino acids in between.. Probab=43.97 E-value=27 Score=16.45 Aligned_cols=46 Identities=30% Similarity=0.398 Sum_probs=27.4 Q ss_pred EEEEEECHHHHCCHHHHHHHHHHHHHHHHC--CCCEEEEECCCHHHHHHHHHC Q ss_conf 899986756724746899999999998518--971799845724578899970 Q 537021.9.peg.3 27 TIVVKYGGHVMNCTDLSKDFVNDIALLKKS--NITPVIVHGGGPQIGAVLEKM 77 (150) Q Consensus 27 tiVIK~GG~~l~d~~l~~~~~~dI~ll~~~--G~k~ViVHGgg~qI~~~l~~~ 77 (150) .+|+|+||+-+. .+|-+.+.|...+ ..++|+|--+=.=|+-.|-+. T Consensus 1 ~iV~KFGGsSvr-----~~F~eA~~L~~~L~E~~~v~VVVSALkGiTD~L~~y 48 (341) T TIGR02078 1 MIVVKFGGSSVR-----YDFEEALELVKSLSEEKEVVVVVSALKGITDELIRY 48 (341) T ss_pred CEEEEECCCCHH-----HHHHHHHHHHHHHCCCCCEEEEEECCCCHHHHHHHH T ss_conf 968985684136-----668999999998516986899985344425799987 No 115 >PRK10637 cysG siroheme synthase; Provisional Probab=42.73 E-value=29 Score=16.33 Aligned_cols=69 Identities=19% Similarity=0.172 Sum_probs=29.4 Q ss_pred HHHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHH-------HHHHHCCCCCEEECC Q ss_conf 999999987598899986756724746899999999998518971799845724578-------899970898115578 Q 537021.9.peg.3 15 QVLPFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIG-------AVLEKMGIKSKFENG 86 (150) Q Consensus 15 ea~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~-------~~l~~~gi~~~~v~G 86 (150) +.+.|+.+...++.|=|-.|..-.. .+.+.+-+......|.++|=.|||-|.|= +.|.+.||+.+.+-| T Consensus 254 ~il~~~~~~ae~i~vgK~~g~~~~~---q~~i~~llv~~a~~G~~VvRLk~GDP~ifGr~~EE~~~l~~~Gi~~eVvPG 329 (457) T PRK10637 254 DIMNLVRRDADRVFVGKRAGYHCVP---QEEINQILLREAQKGKRVVRLKGGDPFIFGRGGEELETLCNAGIPFSVVPG 329 (457) T ss_pred HHHHHCCCCCEEEEEECCCCCCCCC---HHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHCCCCEEEECC T ss_conf 9996447898899960678888878---999999999998589979998489986434389999999978998899899 No 116 >TIGR02472 sucr_P_syn_N sucrose-phosphate synthase, putative, glycosyltransferase domain; InterPro: IPR012822 This family consists of the N-terminal regions, or in some cases the entirety, of bacterial proteins closely related to plant sucrose-phosphate synthases (SPS). The C-terminal domain (IPR012821 from INTERPRO), found with most members of this family, resembles both bona fide plant sucrose-phosphate phosphatases (SPP) and the SPP-like domain of plant sucrose phosphate synthase (SPS). At least two members of this family lack the SPP-like domain, which may have binding or regulatory rather than enzymatic activity by analogy to plant SPS. This enzyme produces sucrose 6-phosphate and UDP from UDP-glucose and D-fructose 6-phosphate, and may be encoded near the gene for fructokinase.. Probab=42.43 E-value=15 Score=18.05 Aligned_cols=78 Identities=17% Similarity=0.217 Sum_probs=63.2 Q ss_pred HHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCC--CCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 67247468999999999985189717998457245788999708981155783--4661248999999974166899999 Q 537021.9.peg.3 35 HVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGL--RITDQQTAEVVEMVLAGSINKKIVS 112 (150) Q Consensus 35 ~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~--RVTd~~tl~iv~~vL~G~vn~~lv~ 112 (150) .+..+|.|.+.|==.+-.-.+.|..+|=---|||+ .|-.+..||+ =+||.|.++.+-|- +. -+.+=.. T Consensus 348 GiFVNPALTEPFGLTLlEAAAcGLPivAT~DGGP~--------dI~~~C~NGLLvd~ld~e~i~~AL~~-al-sd~~QW~ 417 (445) T TIGR02472 348 GIFVNPALTEPFGLTLLEAAACGLPIVATEDGGPR--------DIIANCRNGLLVDVLDLEAIASALEQ-AL-SDSSQWQ 417 (445) T ss_pred CEEECCCCCCCCHHHHHHHHHCCCCEEECCCCCCH--------HHHHHCCCCCEECCCCHHHHHHHHHH-HC-CCHHHHH T ss_conf 86762721253016899999769972107864866--------88842888750057898999999997-33-8906678 Q ss_pred HHHHCCCCEE Q ss_conf 9982899768 Q 537021.9.peg.3 113 LINQTGTQAI 122 (150) Q Consensus 113 ~L~~~g~~av 122 (150) .-+++|+.+| T Consensus 418 ~Ws~NGi~gV 427 (445) T TIGR02472 418 TWSDNGIEGV 427 (445) T ss_pred HHHHHHHHHH T ss_conf 9998557764 No 117 >COG1211 IspD 4-diphosphocytidyl-2-methyl-D-erithritol synthase [Lipid metabolism] Probab=42.08 E-value=29 Score=16.26 Aligned_cols=20 Identities=20% Similarity=0.361 Sum_probs=8.8 Q ss_pred HCCCCEEEEECCCHHHHHHH Q ss_conf 18971799845724578899 Q 537021.9.peg.3 55 KSNITPVIVHGGGPQIGAVL 74 (150) Q Consensus 55 ~~G~k~ViVHGgg~qI~~~l 74 (150) ..+.++.+|+||....+..+ T Consensus 70 ~~~~~v~~v~GG~~R~~SV~ 89 (230) T COG1211 70 SADKRVEVVKGGATRQESVY 89 (230) T ss_pred CCCCEEEEECCCCCHHHHHH T ss_conf 24870799348751799999 No 118 >cd03134 GATase1_PfpI_like A type 1 glutamine amidotransferase (GATase1)-like domain found in PfpI from Pyrococcus furiosus. A type 1 glutamine amidotransferase (GATase1)-like domain found in PfpI from Pyrococcus furiosus. This group includes proteins similar to PfpI from P. furiosus. and PH1704 from Pyrococcus horikoshii. These enzymes are ATP-independent intracellular proteases and may hydrolyze small peptides to provide a nutritional source. Only Cys of the catalytic triad typical of GATase1 domains is conserved in this group. This Cys residue is found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. For PH1704, it is believed that this Cys together with a different His in one monomer and Glu (from an adjacent monomer) forms a different catalytic triad from the typical GATase1domain. PfpI is homooligomeric. Protease activity is only found for oligomeric forms of PH1704. Probab=41.88 E-value=30 Score=16.24 Aligned_cols=67 Identities=19% Similarity=0.291 Sum_probs=37.0 Q ss_pred HHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEE-EE Q ss_conf 99999985189717998457245788999708981155783466124899999997416689999999828997687-78 Q 537021.9.peg.3 47 VNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEVVEMVLAGSINKKIVSLINQTGTQAIG-IC 125 (150) Q Consensus 47 ~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~~vL~G~vn~~lv~~L~~~g~~avg-ls 125 (150) .+=+...+..| ++|---..||.+ |.+.| ..+|+|+|.-..+ ...|...|.+-+. -- T Consensus 84 ~~~i~~~~~~~-k~iaaIC~gp~~---L~~~g----ll~g~k~T~~~~~---------------~~~l~~~G~~~~~~~V 140 (165) T cd03134 84 VAFVRAFAEAG-KPVAAICHGPWV---LISAG----VVRGRKLTSYPSI---------------KDDLINAGANWVDEEV 140 (165) T ss_pred HHHHHHHHHHC-CCEEEECCHHHH---HHHCC----CCCCCEEEECCCH---------------HHHHHHCCCEEECCCE T ss_conf 99999999719-929999875787---74578----2589889867748---------------9999987997946998 Q ss_pred EEECCEEEEEE Q ss_conf 65589899997 Q 537021.9.peg.3 126 GKDGNMVFAEK 136 (150) Q Consensus 126 g~dg~li~a~~ 136 (150) .+|||+|+++= T Consensus 141 vvDgnlIT~~g 151 (165) T cd03134 141 VVDGNLITSRN 151 (165) T ss_pred EEECCEEECCC T ss_conf 99699997958 No 119 >COG2378 Predicted transcriptional regulator [Transcription] Probab=41.01 E-value=25 Score=16.66 Aligned_cols=80 Identities=19% Similarity=0.253 Sum_probs=53.9 Q ss_pred HCCCHHHHHHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEEC--CCHHHHHHHHHCCCCCEEE- Q ss_conf 2062289999999987598899986756724746899999999998518971799845--7245788999708981155- Q 537021.9.peg.3 8 FQAEILEQVLPFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHG--GGPQIGAVLEKMGIKSKFE- 84 (150) Q Consensus 8 ~~a~~l~ea~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHG--gg~qI~~~l~~~gi~~~~v- 84 (150) .+|.-|-+..+|++.++. +-+=.+-...= -..+.+.+||+.|+..|+.+.--.| ||..+- +.|. T Consensus 5 ~ka~RL~~ii~~L~~~~~-vta~~lA~~~~---VS~RTi~RDi~~L~~~gvPI~~e~G~~~gy~~~---------~~~~L 71 (311) T COG2378 5 RKAERLLQIIQILRAKET-VTAAELADEFE---VSVRTIYRDIATLRAAGVPIEGERGKGGGYRLR---------PGFKL 71 (311) T ss_pred HHHHHHHHHHHHHHHCCC-CHHHHHHHHHC---CCHHHHHHHHHHHHHCCCCEEECCCCCCCEEEC---------CCCCC T ss_conf 489999999999985760-45999998729---889999999999997699765325886318960---------57778 Q ss_pred CCCCCCCHHHHHHHHH Q ss_conf 7834661248999999 Q 537021.9.peg.3 85 NGLRITDQQTAEVVEM 100 (150) Q Consensus 85 ~G~RVTd~~tl~iv~~ 100 (150) ..+-+|.+|...++-. T Consensus 72 ~pl~ft~~E~~Al~~~ 87 (311) T COG2378 72 PPLMFTEEEAEALLLA 87 (311) T ss_pred CCCCCCHHHHHHHHHH T ss_conf 8666898799999999 No 120 >PRK10017 putative pyruvyl transferase; Provisional Probab=40.89 E-value=30 Score=16.19 Aligned_cols=51 Identities=22% Similarity=0.308 Sum_probs=33.2 Q ss_pred HHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEE--ECCCHHHH Q ss_conf 999875988999867567247468999999999985189717998--45724578 Q 537021.9.peg.3 19 FVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIV--HGGGPQIG 71 (150) Q Consensus 19 Yi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViV--HGgg~qI~ 71 (150) +++..+..-.||-.||+.+.|......|.- +++..+..|+|+. +|-||.-+ T Consensus 111 ~~~~L~~~D~vIs~GGs~~~D~yg~~~~~~--~L~a~l~kKpv~~~aQgIGP~~~ 163 (426) T PRK10017 111 FVRLLSGYDAIIQVGGSFFVDLYGVPQFEH--ALCTFMAKKPLFMIGHSVGPFQD 163 (426) T ss_pred HHHHHHHCCEEEEECCCCCCCCCCCCHHHH--HHHHHHCCCCEEEEECCCCCCCC T ss_conf 999986547899717762014768521689--99999739968999044688087 No 121 >pfam00850 Hist_deacetyl Histone deacetylase domain. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related to other proteins. Probab=40.37 E-value=31 Score=16.09 Aligned_cols=62 Identities=19% Similarity=0.239 Sum_probs=37.0 Q ss_pred HHHHH-HHHHHHHCCCEEEEEECHHHHCCH---------HHHHHHHHHHHHH-HHCCCCEEEEECCCHHHHHH Q ss_conf 28999-999998759889998675672474---------6899999999998-51897179984572457889 Q 537021.9.peg.3 12 ILEQV-LPFVQFYENETIVVKYGGHVMNCT---------DLSKDFVNDIALL-KKSNITPVIVHGGGPQIGAV 73 (150) Q Consensus 12 ~l~ea-~pYi~~~r~ktiVIK~GG~~l~d~---------~l~~~~~~dI~ll-~~~G~k~ViVHGgg~qI~~~ 73 (150) +|+++ .|.+++|+-+.+||-.|-++..++ +-+..+.+.+..+ ...+.++++|-|||-..+.. T Consensus 218 ~~~~~l~~~~~~f~Pd~iv~saG~D~~~~Dplg~~~ls~~~~~~~~~~v~~~a~~~~~pv~~vleGGY~~~~l 290 (302) T pfam00850 218 AFEEILLPLLREFQPDLILVSAGFDAHAGDPLGGLNLTTEGYAERTRLLLELADEYGGPVVSVLEGGYNLDAL 290 (302) T ss_pred HHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCEEECHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHH T ss_conf 9999999999720988899945634557887767206899999999999999987599989997787883689 No 122 >PRK10513 sugar phosphatase; Provisional Probab=39.96 E-value=32 Score=16.05 Aligned_cols=65 Identities=17% Similarity=0.290 Sum_probs=47.1 Q ss_pred CEEEEEECHHHHCCH-HHHHHHHHHHHHHHHCCCCEEEEECCCH-HHHHHHHHCCCCCE--E---ECCCCCC Q ss_conf 889998675672474-6899999999998518971799845724-57889997089811--5---5783466 Q 537021.9.peg.3 26 ETIVVKYGGHVMNCT-DLSKDFVNDIALLKKSNITPVIVHGGGP-QIGAVLEKMGIKSK--F---ENGLRIT 90 (150) Q Consensus 26 ktiVIK~GG~~l~d~-~l~~~~~~dI~ll~~~G~k~ViVHGgg~-qI~~~l~~~gi~~~--~---v~G~RVT 90 (150) |.++.=+.|..+.+. .+.+...+.|..|+..|+++++.-|=.+ .+...+++++++.. | .||-.|. T Consensus 4 Kli~~DlDGTLl~~~~~i~~~~~~ai~~l~~~Gi~~~iaTGR~~~~~~~~~~~l~~~~~~~~~I~~NGa~i~ 75 (270) T PRK10513 4 KLIAIDMDGTLLLPDHTISPAVKNAIAAARAKGVNVVLTTGRPYAGVHRYLKELHMEQPGDYCITYNGALVQ 75 (270) T ss_pred CEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHCCCCCCCEEEECCCEEEE T ss_conf 299993662525897968999999999999789999999799868789999983888899889985985999 No 123 >PRK09034 aspartate kinase; Reviewed Probab=39.92 E-value=32 Score=16.05 Aligned_cols=34 Identities=29% Similarity=0.419 Sum_probs=22.1 Q ss_pred EEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 999867567247468999999999985189717998 Q 537021.9.peg.3 28 IVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIV 63 (150) Q Consensus 28 iVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViV 63 (150) .|.||||+-+.+.+.++.+++ |..- ....++|+| T Consensus 2 ~V~KFGGtSv~~~~~i~~v~~-ii~~-~~~~~vVVV 35 (450) T PRK09034 2 KVVKFGGSSLASAEQFKKVLN-IVKS-DPERKIVVV 35 (450) T ss_pred EEEEECCCCCCCHHHHHHHHH-HHHC-CCCCEEEEE T ss_conf 899878313899999999999-9962-789839999 No 124 >cd03147 GATase1_Ydr533c_like Type 1 glutamine amidotransferase (GATase1)-like domain found in Saccharomyces cerevisiae Ydr533c protein. Type 1 glutamine amidotransferase (GATase1)-like domain found in Saccharomyces cerevisiae Ydr533c protein. This group includes proteins similar to S. cerevisiae Ydr533c. Ydr533c is upregulated in response to various stress conditions along with the heat shock family. The catalytic triad typical of GATase1domains is not conserved in this GATase1-like domain. However, in common with a typical GATase1domain, a reactive Cys residue is found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. This Cys together with a different His and Glu residue form a different catalytic triad from the typical GATase1domain. Ydr533c protein is a homodimer. Probab=37.93 E-value=34 Score=15.85 Aligned_cols=102 Identities=19% Similarity=0.206 Sum_probs=49.8 Q ss_pred EEEECHHH-HC----CHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHHHHHHHH Q ss_conf 99867567-24----74689999999999851897179984572457889997089811557834661248999999974 Q 537021.9.peg.3 29 VVKYGGHV-MN----CTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEVVEMVLA 103 (150) Q Consensus 29 VIK~GG~~-l~----d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~~vL~ 103 (150) |.--||+. |- |++ ...++++ .+..| ++|-.-.=||..=..++...=....+.|+|+|.-..-+-...-+. T Consensus 98 Vf~pGGhG~m~Dl~~~~~-l~~ll~~---~~~~g-k~vaaVCHGpaaLl~~~~~~dG~~lv~G~~vTgFsn~EE~~~~~~ 172 (231) T cd03147 98 FFVAGGHGTLFDFPHATN-LQKIAQQ---IYANG-GVVAAVCHGPAILANLKDPKTGKPLIKGKTVTGFTDKGEEIMGVM 172 (231) T ss_pred EEECCCCCCCCCCCCCHH-HHHHHHH---HHHCC-CEEEEECCHHHHHHHCCCCCCCCEEECCCEEEECCCHHHHHHHHH T ss_conf 998898853032423999-9999999---99759-979998068899761636789963556867872185898875256 Q ss_pred HHHH----HHHHHHHHHCCCCEE-------EEEEEECCEEEEE Q ss_conf 1668----999999982899768-------7786558989999 Q 537021.9.peg.3 104 GSIN----KKIVSLINQTGTQAI-------GICGKDGNMVFAE 135 (150) Q Consensus 104 G~vn----~~lv~~L~~~g~~av-------glsg~dg~li~a~ 135 (150) ..+. ..+...|.+.|..=. +-..+|++||+.+ T Consensus 173 ~~~~~~~~~~le~~L~~~Ga~~~~~~~~~~~~vv~Dg~LITGQ 215 (231) T cd03147 173 EILKKRNLESIEDIAERAGANFIRPPGPWDDFTVVDGRIVTGS 215 (231) T ss_pred HHCCCCCCCCHHHHHHHCCCEEECCCCCCCCCEEEECCEEECC T ss_conf 5436656658899999759989736889988689709867688 No 125 >cd01427 HAD_like Haloacid dehalogenase-like hydrolases. The haloacid dehalogenase-like (HAD) superfamily includes L-2-haloacid dehalogenase, epoxide hydrolase, phosphoserine phosphatase, phosphomannomutase, phosphoglycolate phosphatase, P-type ATPase, and many others, all of which use a nucleophilic aspartate in their phosphoryl transfer reaction. All members possess a highly conserved alpha/beta core domain, and many also possess a small cap domain, the fold and function of which is variable. Members of this superfamily are sometimes referred to as belonging to the DDDD superfamily of phosphohydrolases. Probab=36.93 E-value=33 Score=15.97 Aligned_cols=43 Identities=19% Similarity=0.333 Sum_probs=31.7 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEEECC-CHHHHHHHHHCCCCCEE Q ss_conf 68999999999985189717998457-24578899970898115 Q 537021.9.peg.3 41 DLSKDFVNDIALLKKSNITPVIVHGG-GPQIGAVLEKMGIKSKF 83 (150) Q Consensus 41 ~l~~~~~~dI~ll~~~G~k~ViVHGg-g~qI~~~l~~~gi~~~~ 83 (150) .+.+...+-|..|+..|+++++|-|. ...+...++..++...| T Consensus 24 ~~~pg~~e~l~~L~~~G~~v~ivTn~~~~~~~~~~~~~~~~~~~ 67 (139) T cd01427 24 ELYPGVKEALKELKEKGIKLALATNKSRREVLELLEELGLDDYF 67 (139) T ss_pred EECCCHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHCCCHHHC T ss_conf 79858999999999879979999298899999999985997775 No 126 >cd05015 SIS_PGI_1 Phosphoglucose isomerase (PGI) contains two SIS (Sugar ISomerase) domains. This classification is based on the alignment of the first SIS domain. PGI is a multifunctional enzyme which as an intracellular dimer catalyzes the reversible isomerization of glucose 6-phosphate to fructose 6-phosphate. As an extracellular protein, PGI also has functions equivalent to neuroleukin (NLK), autocrine motility factor (AMF), and maturation factor (MF). Evidence suggests that PGI, NLK, AMF, and MF are closely related or identical. NLK is a neurotrophic growth factor that promotes regeneration and survival of neurons. The dimeric form of NLK has isomerase function, whereas its monomeric form carries out neurotrophic activity. AMF is a cytokine that stimulates cell migration and metastasis. MF mediates the differentiation of human myeloid leukemic HL-60 cells to terminal monocytic cells. Probab=36.10 E-value=36 Score=15.67 Aligned_cols=111 Identities=13% Similarity=0.168 Sum_probs=63.5 Q ss_pred HHHHHHHHHHHCC-----CEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHH-HHHHHHHCCCCCEEE-- Q ss_conf 8999999998759-----88999867567247468999999999985189717998457245-788999708981155-- Q 537021.9.peg.3 13 LEQVLPFVQFYEN-----ETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQ-IGAVLEKMGIKSKFE-- 84 (150) Q Consensus 13 l~ea~pYi~~~r~-----ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~q-I~~~l~~~gi~~~~v-- 84 (150) +.+...+.++.|. .+++|-+|||.+-...+. ..+ .-....+.++-.+-...|. +++.+++++.+.... T Consensus 4 ~~~i~~f~~~i~~g~~~~~iv~iGiGGS~LG~k~l~-~al---~~~~~~~~~i~f~dn~Dp~~~~~~l~~l~~~~Tlfiv 79 (158) T cd05015 4 LERIKEFAEKVRSGKKITDVVVIGIGGSDLGPRAVY-EAL---KPYFKGGLRLHFVSNVDPDDLAELLKKLDPETTLFIV 79 (158) T ss_pred HHHHHHHHHHHHCCCCCCEEEEECCCHHHHHHHHHH-HHH---HHHCCCCCEEEEECCCCHHHHHHHHHHCCCHHEEEEE T ss_conf 999999999997599888699981433389999999-997---6245489569995799989999999727930149999 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCE Q ss_conf 78346612489999999741668999999982899768778655898 Q 537021.9.peg.3 85 NGLRITDQQTAEVVEMVLAGSINKKIVSLINQTGTQAIGICGKDGNM 131 (150) Q Consensus 85 ~G~RVTd~~tl~iv~~vL~G~vn~~lv~~L~~~g~~avglsg~dg~l 131 (150) -----|+.|||.....+. ..........+ +-+-+.++..+.++ T Consensus 80 iSKSg~T~ETl~~~~~~~-~~l~~~~~~~~---~~~~vaIT~~~s~l 122 (158) T cd05015 80 ISKSGTTLETLANARLAR-EWLEEAGGDDL---AKHFVAITDNGSGL 122 (158) T ss_pred ECCCCCCHHHHHHHHHHH-HHHHHHCCHHH---HHHEEEEECCCHHH T ss_conf 759998689999999999-99998320355---66078980596699 No 127 >PRK11440 hypothetical protein; Provisional Probab=35.62 E-value=17 Score=17.81 Aligned_cols=93 Identities=17% Similarity=0.249 Sum_probs=53.3 Q ss_pred CCHHHHHHHHHHHHHHHHCCCCEEEEE-CCCHHHHHHHHHC--------CCCC---EEECCCCCCCHHHHHHHHHHHHHH Q ss_conf 474689999999999851897179984-5724578899970--------8981---155783466124899999997416 Q 537021.9.peg.3 38 NCTDLSKDFVNDIALLKKSNITPVIVH-GGGPQIGAVLEKM--------GIKS---KFENGLRITDQQTAEVVEMVLAGS 105 (150) Q Consensus 38 ~d~~l~~~~~~dI~ll~~~G~k~ViVH-Ggg~qI~~~l~~~--------gi~~---~~v~G~RVTd~~tl~iv~~vL~G~ 105 (150) ..++...+..+=+...+..|+.+|.|+ +..|.-...+.+. .++. ++...+...+.+ .-+.|.-.++. T Consensus 29 ~~~~~i~~~~~L~~~aR~~g~pVi~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~d-~vv~K~~~saF 107 (188) T PRK11440 29 TADEVVARAARLAAKFRASGSPVVLVRVGWSADYAEALKQPVDAPSPAKALPENWWQHPAALGKTDSD-IEVTKRQWGAF 107 (188) T ss_pred CHHHHHHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-EEEECCCCCCC T ss_conf 79999999999999999879909999863178710100286668886556787311356032789999-89867845877 Q ss_pred HHHHHHHHHHHCCCCEEEEEEEECCE Q ss_conf 68999999982899768778655898 Q 537021.9.peg.3 106 INKKIVSLINQTGTQAIGICGKDGNM 131 (150) Q Consensus 106 vn~~lv~~L~~~g~~avglsg~dg~l 131 (150) .+++|...|.+.|+..+=++|...|. T Consensus 108 ~~T~L~~~Lr~~gi~~lvi~G~~T~~ 133 (188) T PRK11440 108 YGTDLELQLRRRGIDTIVLCGISTNI 133 (188) T ss_pred CCCCHHHHHHHCCCCEEEEEEECCCH T ss_conf 78988999987799989993221165 No 128 >TIGR03674 fen_arch flap structure-specific endonuclease. Endonuclease that cleaves the 5'-overhanging flap structure that is generated by displacement synthesis when DNA polymerase encounters the 5'-end of a downstream Okazaki fragment. Has 5'-endo-/exonuclease and 5'-pseudo-Y-endonuclease activities. Cleaves the junction between single and double-stranded regions of flap DNA Probab=35.55 E-value=37 Score=15.61 Aligned_cols=28 Identities=43% Similarity=0.627 Sum_probs=19.8 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEEECCCHH Q ss_conf 68999999999985189717998457245 Q 537021.9.peg.3 41 DLSKDFVNDIALLKKSNITPVIVHGGGPQ 69 (150) Q Consensus 41 ~l~~~~~~dI~ll~~~G~k~ViVHGgg~q 69 (150) .+..-|.+-+.++. .|++||.|+-|.|- T Consensus 57 hl~g~f~R~~~Ll~-~gI~PIfVFDGkpp 84 (338) T TIGR03674 57 HLSGLFYRTINLLE-NGIKPVYVFDGKPP 84 (338) T ss_pred HHHHHHHHHHHHHH-CCCEEEEEECCCCC T ss_conf 99999999999997-79989999799985 No 129 >KOG0339 consensus Probab=34.81 E-value=38 Score=15.54 Aligned_cols=73 Identities=18% Similarity=0.138 Sum_probs=45.2 Q ss_pred HHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHHHHH Q ss_conf 98759889998675672474689999999999851897179984572457889997089811557834661248999999 Q 537021.9.peg.3 21 QFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEVVEM 100 (150) Q Consensus 21 ~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~iv~~ 100 (150) +.=.|-+-||.++---+. .+-|.+.=++-+..|+++|-|||||+--.+. +.+...++++= -|+...+++|+| T Consensus 292 ~~g~gPi~vilvPTrela----~Qi~~eaKkf~K~ygl~~v~~ygGgsk~eQ~-k~Lk~g~EivV---aTPgRlid~Vkm 363 (731) T KOG0339 292 KPGEGPIGVILVPTRELA----SQIFSEAKKFGKAYGLRVVAVYGGGSKWEQS-KELKEGAEIVV---ATPGRLIDMVKM 363 (731) T ss_pred CCCCCCEEEEEECCHHHH----HHHHHHHHHHHHHCCCEEEEEECCCCHHHHH-HHHHCCCEEEE---ECHHHHHHHHHH T ss_conf 689997699980638999----9999999986311264278863687488877-76502772899---662888999886 Q ss_pred H Q ss_conf 9 Q 537021.9.peg.3 101 V 101 (150) Q Consensus 101 v 101 (150) - T Consensus 364 K 364 (731) T KOG0339 364 K 364 (731) T ss_pred H T ss_conf 0 No 130 >PRK00155 ispD 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase; Reviewed Probab=34.57 E-value=38 Score=15.51 Aligned_cols=14 Identities=29% Similarity=0.420 Sum_probs=6.3 Q ss_pred CEEEEE-CCCHHHHH Q ss_conf 179984-57245788 Q 537021.9.peg.3 59 TPVIVH-GGGPQIGA 72 (150) Q Consensus 59 k~ViVH-Ggg~qI~~ 72 (150) ..|+|| +++|.+++ T Consensus 98 ~~VlIHDaARP~is~ 112 (228) T PRK00155 98 DWVLVHDAARPFLTP 112 (228) T ss_pred CEEEEECCCCCCCCH T ss_conf 979997066768999 No 131 >TIGR01508 rib_reduct_arch diaminohydroxyphosphoribosylaminopyrimidine reductase; InterPro: IPR006401 These sequences represent a specific reductase of riboflavin biosynthesis in the archaea, diaminohydroxyphosphoribosylaminopyrimidine reductase. It should not be confused with bacterial 5-amino-6-(5-phosphoribosylamino)uracil reductase. The intermediate 2,5-diamino-6-hydroxy-4-(5-phosphoribosylamino)pyrimidine in riboflavin biosynthesis is reduced first and then deaminated in both archaea and fungi, opposite to the order in bacteria. The subsequent deaminase is not presently known and is not closely homologous to the deaminase domain fused to the reductase domain, which is similar to this protein but found in most bacteria. . Probab=34.35 E-value=39 Score=15.49 Aligned_cols=63 Identities=25% Similarity=0.411 Sum_probs=42.8 Q ss_pred HHHHHHCC-CEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHC-------CCCCEEECCC Q ss_conf 99998759-8899986756724746899999999998518971799845724578899970-------8981155783 Q 537021.9.peg.3 18 PFVQFYEN-ETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKM-------GIKSKFENGL 87 (150) Q Consensus 18 pYi~~~r~-ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~-------gi~~~~v~G~ 87 (150) .-++.+.. -..|.|+|-.- -+ +..+++ .|+.-|++=++|=|||.-|-.+.++. =+-|++..|+ T Consensus 116 kk~~~l~~~gv~V~~~G~~~---v~-L~~lld---~L~~kgv~rLmvEGGG~Li~~l~~~~LvDEi~vy~aP~~~GGr 186 (224) T TIGR01508 116 KKVEELEKKGVEVVKFGEER---VD-LEKLLD---ILEDKGVRRLMVEGGGTLIWSLIKENLVDEIRVYIAPKIIGGR 186 (224) T ss_pred HHHHHHHHCCCEEEEECCCC---CC-HHHHHH---HHHHCCCEEEEEECCCEEEHHHHHCCCEEEEEEEECCEEECCC T ss_conf 78887774974899924785---34-689999---9874797099982485101022302843589998814676277 No 132 >PRK05301 pyrroloquinoline quinone biosynthesis protein PqqE; Provisional Probab=33.99 E-value=39 Score=15.45 Aligned_cols=67 Identities=12% Similarity=0.071 Sum_probs=33.4 Q ss_pred HHHHHHHHHHCCCEEEEEECHH-H-H----CC-HHHHHHHHHHHHHHHHCCCCEEEEEC-------CCHHHHHHHHHCCC Q ss_conf 9999999987598899986756-7-2----47-46899999999998518971799845-------72457889997089 Q 537021.9.peg.3 14 EQVLPFVQFYENETIVVKYGGH-V-M----NC-TDLSKDFVNDIALLKKSNITPVIVHG-------GGPQIGAVLEKMGI 79 (150) Q Consensus 14 ~ea~pYi~~~r~ktiVIK~GG~-~-l----~d-~~l~~~~~~dI~ll~~~G~k~ViVHG-------gg~qI~~~l~~~gi 79 (150) .+.+..++.+.=..+-|-+.|. . . .. +..|+..++-|..+...|+++.+-.= -=+.|-++++++|+ T Consensus 105 ~~~a~~L~~~gl~~v~vSlDg~~~e~hD~~rG~~G~f~~~~~~i~~l~~~Gi~v~i~~ti~r~N~~~l~~i~~la~~lGv 184 (375) T PRK05301 105 EARLAALKAAGLDHIQLSFQDSDPELADRIAGTRGAFAQKLEVARLVKAHGYPLTLNAVIHRHNIDQIPRIIELAVELGA 184 (375) T ss_pred HHHHHHHHHHCCCEEEEECCCCCHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHHHHHCCC T ss_conf 99999998509988999567798778777637886299999999999974981699987230568889999999997299 Q ss_pred C Q ss_conf 8 Q 537021.9.peg.3 80 K 80 (150) Q Consensus 80 ~ 80 (150) + T Consensus 185 ~ 185 (375) T PRK05301 185 D 185 (375) T ss_pred C T ss_conf 8 No 133 >TIGR01245 trpD anthranilate phosphoribosyltransferase; InterPro: IPR005940 In many widely different species, including Escherichia coli, Thermotoga maritima, and Archaeoglobus fulgidus, this enzyme is a C-terminal part of a multifunctional protein together with glutamine amidotransferase; the fusion protein is designated anthranilate synthase component II (4.1.3.27 from EC).; GO: 0004048 anthranilate phosphoribosyltransferase activity, 0000162 tryptophan biosynthetic process. Probab=33.91 E-value=39 Score=15.45 Aligned_cols=63 Identities=27% Similarity=0.429 Sum_probs=38.9 Q ss_pred CHHHHHHHHHHHHHHHHCCCC-EEEEEC-CCHHHHHH----------------------HHHCCCCCEEECCCCCCCH-H Q ss_conf 746899999999998518971-799845-72457889----------------------9970898115578346612-4 Q 537021.9.peg.3 39 CTDLSKDFVNDIALLKKSNIT-PVIVHG-GGPQIGAV----------------------LEKMGIKSKFENGLRITDQ-Q 93 (150) Q Consensus 39 d~~l~~~~~~dI~ll~~~G~k-~ViVHG-gg~qI~~~----------------------l~~~gi~~~~v~G~RVTd~-~ 93 (150) |+++..-+++ .|+.+|.+ ..+||| +| +|+. =+..|+++...+-+|+-|+ | T Consensus 192 ~~~L~~~~A~---aL~~LG~krAlVVhG~~G--lDE~s~~g~t~v~el~~g~I~~~~~~PedFG~~~~~l~~l~g~~p~E 266 (331) T TIGR01245 192 DPDLVEVMAE---ALKNLGVKRALVVHGEDG--LDEISLTGPTKVAELKDGEIREYTLDPEDFGLKRAPLEELRGGSPEE 266 (331) T ss_pred CHHHHHHHHH---HHHHHCCCEEEEEECCCC--CCEEECCCCCEEEEEECCCEEEEEECCHHCCCCCCCCHHCCCCCHHH T ss_conf 6888999999---998617670899857889--61453167716998748931466641110479755301104689899 Q ss_pred HHHHHHHHHHHHH Q ss_conf 8999999974166 Q 537021.9.peg.3 94 TAEVVEMVLAGSI 106 (150) Q Consensus 94 tl~iv~~vL~G~v 106 (150) -.+++..+|.|+- T Consensus 267 na~~~~~~l~G~~ 279 (331) T TIGR01245 267 NAEILREILEGKG 279 (331) T ss_pred HHHHHHHHHCCCC T ss_conf 9999999971888 No 134 >PRK05991 precorrin-3B C17-methyltransferase; Provisional Probab=33.91 E-value=39 Score=15.45 Aligned_cols=71 Identities=13% Similarity=0.123 Sum_probs=39.8 Q ss_pred CCCHHHHHHHHHHHH---CCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHH-------HHHHHC- Q ss_conf 062289999999987---598899986756724746899999999998518971799845724578-------899970- Q 537021.9.peg.3 9 QAEILEQVLPFVQFY---ENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIG-------AVLEKM- 77 (150) Q Consensus 9 ~a~~l~ea~pYi~~~---r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~-------~~l~~~- 77 (150) .|.++....||+... .++.. ++.. +.++. .-++....+...|.++++||||-|.|= +.+++. T Consensus 28 ~Advvvgy~~yld~l~~~~~~~~---~~~~-~~~e~---~r~~~al~~A~~Gk~Va~l~sGDP~IyGmag~~~E~l~~~~ 100 (254) T PRK05991 28 AATDFFGYGPYLDRLQLRPDQLR---HASD-NREEL---DRAGAALAMAAAGANVCVVSGGDPGVFAMAAAVCEAIENGP 100 (254) T ss_pred CCCEEEECHHHHHHHHCCCCCEE---ECCC-CHHHH---HHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHHHCC T ss_conf 29999965266665513899579---7588-87588---99999999998799579986788046653499999998350 Q ss_pred ----CCCCEEECC Q ss_conf ----898115578 Q 537021.9.peg.3 78 ----GIKSKFENG 86 (150) Q Consensus 78 ----gi~~~~v~G 86 (150) +++.+.+-| T Consensus 101 ~~~~~i~~eVVPG 113 (254) T PRK05991 101 AAWRAVDLTVVPG 113 (254) T ss_pred CCCCCCCEEEECC T ss_conf 0027976899588 No 135 >PRK00339 minC septum formation inhibitor; Reviewed Probab=33.27 E-value=40 Score=15.38 Aligned_cols=61 Identities=8% Similarity=0.090 Sum_probs=45.9 Q ss_pred HHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCE Q ss_conf 998759889998675672474689999999999851897179984572457889997089811 Q 537021.9.peg.3 20 VQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSK 82 (150) Q Consensus 20 i~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~ 82 (150) =..|++.-+||-+..- .+.....+|..=+.+|+..|+.+|=|.|+.++..+.....|+..- T Consensus 47 P~FF~~~PvVldl~~l--~~~~~~~dl~~L~~~l~~~gl~~vGv~g~~~~~~~~a~~~gL~il 107 (249) T PRK00339 47 PNFFSNTPLVLALDKL--PEGEGELDLPGLMRICRRHGLRTLAIRASRIEDIAAAIAADLPVL 107 (249) T ss_pred HHHHCCCCEEEECHHC--CCCCCCCCHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHCCCCCC T ss_conf 5765698599983342--676554579999999998899899996699899999997799704 No 136 >PRK13639 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=33.19 E-value=40 Score=15.37 Aligned_cols=13 Identities=15% Similarity=0.434 Sum_probs=4.4 Q ss_pred HHHHHHHHHHCCC Q ss_conf 9999999851897 Q 537021.9.peg.3 46 FVNDIALLKKSNI 58 (150) Q Consensus 46 ~~~dI~ll~~~G~ 58 (150) +++|--.|+..|+ T Consensus 224 vf~~~~~l~~~~l 236 (275) T PRK13639 224 IFSDSETIRSANL 236 (275) T ss_pred HHCCHHHHHHCCC T ss_conf 8749999997799 No 137 >TIGR02090 LEU1_arch isopropylmalate/citramalate/homocitrate synthases; InterPro: IPR011830 Methanogenic archaea contain three closely related homologs of the 2-isopropylmalate synthases (LeuA) represented by IPR005671 from INTERPRO. Two of these in Methanococcus janaschii (MJ1392 - CimA ; MJ0503 - AksA ) have been characterised as catalyzing alternative reactions leaving the third (MJ1195) as the presumptive LeuA enzyme. CimA is citramalate (2-methylmalate) synthase, which condenses acetyl-CoA with pyruvate. This enzyme is believed to be involved in the biosynthesis of isoleucine in methanogens and possibly other species lacking threonine dehydratase. AksA is a homocitrate synthase which also produces (homo)2-citrate and (homo)3-citrate in the biosynthesis of Coenzyme B which is restricted solely to methanogenic archaea. Methanogens, then should and apparently do contain all three of these enzymes. Unfortunately, phylogenetic trees do not resolve into three unambiguous clades, making assignment of function to particular genes problematic. Other archaea, which lack a threonine dehydratase (mainly Euryarchaeota), should contain both CimA and LeuA genes. This is true for archaeoglobus fulgidis, but not for the Pyrococci which have none in this clade, but one in IPR005671 from INTERPRO and one in IPR005675 from INTERPRO which may fulfil these roles. Proteins from other species, which have only one hit to this entry and lack threonine dehydratase, are very likely to be LeuA enzymes.; GO: 0046912 transferase activity transferring acyl groups acyl groups converted into alkyl on transfer, 0019752 carboxylic acid metabolic process. Probab=32.60 E-value=17 Score=17.82 Aligned_cols=14 Identities=36% Similarity=0.639 Sum_probs=4.8 Q ss_pred CCHHHHHHHHHCCC Q ss_conf 72457889997089 Q 537021.9.peg.3 66 GGPQIGAVLEKMGI 79 (150) Q Consensus 66 gg~qI~~~l~~~gi 79 (150) |...+...|+.+|| T Consensus 321 G~~a~~~kL~e~GI 334 (371) T TIGR02090 321 GRHAVKAKLKELGI 334 (371) T ss_pred HHHHHHHHHHHCCC T ss_conf 79999999987392 No 138 >PRK06136 uroporphyrin-III C-methyltransferase; Reviewed Probab=32.34 E-value=42 Score=15.28 Aligned_cols=92 Identities=15% Similarity=0.117 Sum_probs=54.7 Q ss_pred CCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCE---------EECCCCCCCHH- Q ss_conf 59889998675672474689999999999851897179984572457889997089811---------55783466124- Q 537021.9.peg.3 24 ENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSK---------FENGLRITDQQ- 93 (150) Q Consensus 24 r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~---------~v~G~RVTd~~- 93 (150) +|+.+|.--+| ||-++....+.+..+...|+.+-+|.|-. .........|++.. |+.|...+++. T Consensus 79 ~G~~VvrL~~G----DP~ifg~~~eei~~l~~~gi~~evvPGVs-a~~Aaaa~~GiPLT~r~~s~sv~~it~~~~~~~~~ 153 (253) T PRK06136 79 KGKVVVRLKGG----DPFVFGRGGEELEALEAAGIPVEVVPGIT-AAIAAAAAAGIPLTHRGVARSVTFVTGHEAAGKLE 153 (253) T ss_pred CCCEEEEEECC----CCCCCCCHHHHHHHHHHCCCCEEEECCHH-HHHHHHHHCCCCCCCCCCCCEEEEEECCCCCCCCC T ss_conf 79979999589----98766405999999997899679956805-99999998398852489741699982677799875 Q ss_pred -HHHHHHHH---------HHHHHHHHHHHHHHHCCCC Q ss_conf -89999999---------7416689999999828997 Q 537021.9.peg.3 94 -TAEVVEMV---------LAGSINKKIVSLINQTGTQ 120 (150) Q Consensus 94 -tl~iv~~v---------L~G~vn~~lv~~L~~~g~~ 120 (150) ..+....+ ++..--..++..|..+|.. T Consensus 154 ~~~~~~~la~~~~TlViym~~~~l~~i~~~L~~~G~~ 190 (253) T PRK06136 154 PEVNWAALADGADTLVIYMGVRNLPYIAEQLIAAGRS 190 (253) T ss_pred CHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHCCCC T ss_conf 1455998736697599986541299999999975999 No 139 >PRK09084 aspartate kinase III; Validated Probab=32.27 E-value=42 Score=15.28 Aligned_cols=37 Identities=24% Similarity=0.341 Sum_probs=23.5 Q ss_pred EEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECC Q ss_conf 8999867567247468999999999985189717998457 Q 537021.9.peg.3 27 TIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGG 66 (150) Q Consensus 27 tiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGg 66 (150) .+|.||||+-+.+.+.++++++ |.. .. ..++|+|--+ T Consensus 1 mvV~KFGGtSv~~~~~i~~v~~-ii~-~~-~~~~vVVVSA 37 (447) T PRK09084 1 LVVAKFGGTSVADFDAMNRSAD-IVL-SN-PNTRLVVLSA 37 (447) T ss_pred CEEEEECCCCCCCHHHHHHHHH-HHH-CC-CCCEEEEEEC T ss_conf 9899478556798999999999-997-27-9982999918 No 140 >KOG4701 consensus Probab=32.18 E-value=42 Score=15.27 Aligned_cols=51 Identities=20% Similarity=0.338 Sum_probs=38.2 Q ss_pred HHHHHHHC-CCEEEEEECHHHHCCHHH-----HHHHHHHHHHHHHCCCCEEEEECCCH Q ss_conf 99999875-988999867567247468-----99999999998518971799845724 Q 537021.9.peg.3 17 LPFVQFYE-NETIVVKYGGHVMNCTDL-----SKDFVNDIALLKKSNITPVIVHGGGP 68 (150) Q Consensus 17 ~pYi~~~r-~ktiVIK~GG~~l~d~~l-----~~~~~~dI~ll~~~G~k~ViVHGgg~ 68 (150) +.|+.+|. |.|-+..+-|.. .|... -..|.+||...+..|+|+.|-.||+- T Consensus 58 lsFL~~F~~~~Tp~LNfAn~C-sd~~~~~l~~CTqi~~di~~CQS~GiKVlLSLGG~~ 114 (568) T KOG4701 58 LSFLIDFNVDGTPVLNFANLC-SDSDTFSLKKCTQIETDIQVCQSNGIKVLLSLGGYN 114 (568) T ss_pred HHHHHHCCCCCCCCEEHHCCC-CCCCCCCCHHHHHHHHHHHHHHHCCEEEEEECCCCC T ss_conf 324664478897421000126-852234202222256478888736808998516755 No 141 >PRK10517 magnesium-transporting ATPase MgtA; Provisional Probab=31.12 E-value=44 Score=15.15 Aligned_cols=89 Identities=22% Similarity=0.360 Sum_probs=56.3 Q ss_pred HCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHH-HHCCCCCEE-ECCC---CCCCHHHHHHHHHH--HH---HHH Q ss_conf 24746899999999998518971799845724578899-970898115-5783---46612489999999--74---166 Q 537021.9.peg.3 37 MNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVL-EKMGIKSKF-ENGL---RITDQQTAEVVEMV--LA---GSI 106 (150) Q Consensus 37 l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l-~~~gi~~~~-v~G~---RVTd~~tl~iv~~v--L~---G~v 106 (150) +.||- ++...+.|..++..|++++++-|-.+..-+.. ++.||+... +.|- ..+|++--+.++.. ++ =+= T Consensus 545 ~~DPp-R~e~~~aI~~l~~aGI~V~MITGD~~~TA~aIA~~lGI~~~~v~tG~el~~lsd~el~~~~~~~~VfAR~sPe~ 623 (900) T PRK10517 545 FLDPP-KETTAPALKALKASGITVKILTGDSELVAAKVCHEVGLDAGEVVIGSDIETLSDDELANLAERTTLFARLTPMH 623 (900) T ss_pred ECCCC-CCHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHCCCCCCCCCCHHHHHCCHHHHHHHHHHCCEEEEECHHH T ss_conf 30789-71099999999977993899899998999999998199954440225344099999999986477897518999 Q ss_pred HHHHHHHHHHCCCCEEEEEEE Q ss_conf 899999998289976877865 Q 537021.9.peg.3 107 NKKIVSLINQTGTQAIGICGK 127 (150) Q Consensus 107 n~~lv~~L~~~g~~avglsg~ 127 (150) ...|+..|++.| +-++++|- T Consensus 624 K~rIV~aLq~~G-~vVAmtGD 643 (900) T PRK10517 624 KERIVTLLKREG-HVVGFMGD 643 (900) T ss_pred HHHHHHHHHHCC-CEEEEECC T ss_conf 999999999779-97999738 No 142 >COG0547 TrpD Anthranilate phosphoribosyltransferase [Amino acid transport and metabolism] Probab=30.88 E-value=44 Score=15.13 Aligned_cols=65 Identities=22% Similarity=0.276 Sum_probs=32.4 Q ss_pred CCHHHHHHHHHHHHHHHHCC-CCEEEEECC--CHHHHH-------------------HHHHCCCCCEEECCCCCC-CHHH Q ss_conf 47468999999999985189-717998457--245788-------------------999708981155783466-1248 Q 537021.9.peg.3 38 NCTDLSKDFVNDIALLKKSN-ITPVIVHGG--GPQIGA-------------------VLEKMGIKSKFENGLRIT-DQQT 94 (150) Q Consensus 38 ~d~~l~~~~~~dI~ll~~~G-~k~ViVHGg--g~qI~~-------------------~l~~~gi~~~~v~G~RVT-d~~t 94 (150) .+|++.+.+++ .|+.+| .+.++|||. -.+++- .-+..|+++...+.++.. .++. T Consensus 197 ~~p~~~~~~A~---~l~~LG~~ralvV~G~~GlDE~~~~~~t~v~~l~~g~i~~~~l~pe~~Gl~~~~~~~l~~~~~~en 273 (338) T COG0547 197 YHPELVELLAE---ALRLLGVERALVVHGLEGLDEVTPTGTTLVAELKDGEIREYTLTPEDFGLERAPLEDLPGGDPEEN 273 (338) T ss_pred ECHHHHHHHHH---HHHHHCCCEEEEEECCCCCCCCCCCCCCEEEEECCCCEEEEEECHHHCCCCCCCHHHCCCCCHHHH T ss_conf 57888999999---999828661899977899631347887448997589568999677762898785432578999999 Q ss_pred HHHHHHHHHHH Q ss_conf 99999997416 Q 537021.9.peg.3 95 AEVVEMVLAGS 105 (150) Q Consensus 95 l~iv~~vL~G~ 105 (150) .++.+.+|.|+ T Consensus 274 a~~~~~vL~G~ 284 (338) T COG0547 274 AEILRAVLAGE 284 (338) T ss_pred HHHHHHHHCCC T ss_conf 99999997799 No 143 >cd01987 USP_OKCHK USP domain is located between the N-terminal sensor domain and C-terminal catalytic domain of this Osmosensitive K+ channel histidine kinase family. The family of KdpD sensor kinase proteins regulates the kdpFABC operon responsible for potassium transport. The USP domain is homologous to the universal stress protein Usp Usp is a small cytoplasmic bacterial protein whose expression is enhanced when the cell is exposed to stress agents. Usp enhances the rate of cell survival during prolonged exposure to such conditions, and may provide a general "stress endurance" activity. Probab=30.85 E-value=44 Score=15.13 Aligned_cols=103 Identities=9% Similarity=0.059 Sum_probs=66.2 Q ss_pred HHHCCCHHHHHHHHHHHHCCCEEEEEECHH--HHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCH--HHHHHHHHCCCCC Q ss_conf 322062289999999987598899986756--724746899999999998518971799845724--5788999708981 Q 537021.9.peg.3 6 YQFQAEILEQVLPFVQFYENETIVVKYGGH--VMNCTDLSKDFVNDIALLKKSNITPVIVHGGGP--QIGAVLEKMGIKS 81 (150) Q Consensus 6 ~~~~a~~l~ea~pYi~~~r~ktiVIK~GG~--~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~--qI~~~l~~~gi~~ 81 (150) .+.-...+|.+..--++++.+.+|+.+--. .-.++.....+-+...+..++|.+++.+.|+.+ +|-+..++.++.- T Consensus 9 s~~s~~lir~a~rlA~~~~a~l~vl~V~~~~~~~~~~~~~~~l~~~~~la~~lga~~~~~~~~d~~~~I~~~A~~~~~t~ 88 (124) T cd01987 9 GPNAERLIRRAARLADRLKAPWYVVYVETPRLNRLSEAERRRLAEALRLAEELGAEVVTLPGDDVAEAIVEFAREHNVTQ 88 (124) T ss_pred CHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHHCCCCE T ss_conf 85079999999999996499899999955975658979999999999999985998999947998999999999849989 Q ss_pred EEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 155783466124899999997416689999999 Q 537021.9.peg.3 82 KFENGLRITDQQTAEVVEMVLAGSINKKIVSLI 114 (150) Q Consensus 82 ~~v~G~RVTd~~tl~iv~~vL~G~vn~~lv~~L 114 (150) .+=|.+--+. + +..+.|.+-..+.... T Consensus 89 -IVlG~~~~~~--~---~~~~~gSv~~~vl~~a 115 (124) T cd01987 89 -IVVGKSRRSR--W---RELFRGSLVDRLLRRA 115 (124) T ss_pred -EEECCCCCCH--H---HHHHCCCHHHHHHHHC T ss_conf -9976898854--6---8872388999999749 No 144 >PRK09436 thrA bifunctional aspartokinase I/homeserine dehydrogenase I; Provisional Probab=30.09 E-value=45 Score=15.04 Aligned_cols=47 Identities=15% Similarity=0.191 Sum_probs=27.3 Q ss_pred EEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHH Q ss_conf 999867567247468999999999985189717998457245788999 Q 537021.9.peg.3 28 IVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLE 75 (150) Q Consensus 28 iVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~ 75 (150) -|.||||+-+.+++.++.+++=|..-..-+ ++++|.-+---++..|- T Consensus 2 kV~KFGGTSva~ae~i~~Va~II~~~~~~~-~vvVVVSA~ggvTd~Ll 48 (817) T PRK09436 2 KVLKFGGTSLANAERFLRVADIIESNARQE-QVAVVLSAPAKVTNHLV 48 (817) T ss_pred EEEEECCCCCCCHHHHHHHHHHHHHHCCCC-CEEEEECCCCCCHHHHH T ss_conf 788757156789999999999997534389-98999839998689999 No 145 >PRK10976 putative sugar phosphatase; Provisional Probab=29.88 E-value=46 Score=15.02 Aligned_cols=66 Identities=24% Similarity=0.381 Sum_probs=43.8 Q ss_pred CEEEEEECHHHHCCHH-HHHHHHHHHHHHHHCCCCEEEEECCCH-HHHHHHHHCCCCCEEE--CCCCCCC Q ss_conf 8899986756724746-899999999998518971799845724-5788999708981155--7834661 Q 537021.9.peg.3 26 ETIVVKYGGHVMNCTD-LSKDFVNDIALLKKSNITPVIVHGGGP-QIGAVLEKMGIKSKFE--NGLRITD 91 (150) Q Consensus 26 ktiVIK~GG~~l~d~~-l~~~~~~dI~ll~~~G~k~ViVHGgg~-qI~~~l~~~gi~~~~v--~G~RVTd 91 (150) |.|+.=+.|..+.+.. +.+...+.|..++..|+++++.-|=.+ .+....+.+++...++ ||-.|-| T Consensus 3 KlIa~DlDGTLl~~~~~is~~~~~ai~~l~~~Gi~~viATGR~~~~~~~~~~~l~~~~~~I~~NGa~i~d 72 (266) T PRK10976 3 QVVASDLDGTLLSPDHTLSPYAKETLKLLTARGIHFVFATGRHHVDVGQIRDNLEIKSYMITSNGARVHD 72 (266) T ss_pred EEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHCCCCCEEEECCCEEEEE T ss_conf 9999937646558879719999999999997899999997999799999999739974189628779991 No 146 >pfam11869 DUF3389 Protein of unknown function (DUF3389). This family of proteins are functionally uncharacterized. This protein is found in bacteria. Proteins in this family are about 80 amino acids in length. Probab=29.88 E-value=46 Score=15.02 Aligned_cols=35 Identities=34% Similarity=0.562 Sum_probs=25.3 Q ss_pred EEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCH Q ss_conf 99986756724746899999999998518971799845724 Q 537021.9.peg.3 28 IVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGP 68 (150) Q Consensus 28 iVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~ 68 (150) +||++.++.+. +.. -.+||.++. |-++++-+||+. T Consensus 17 v~vRL~~~~~~---l~a-~~D~I~l~~--~anvi~A~g~~v 51 (75) T pfam11869 17 VVVRLNGANVV---LLA-AADDISLLG--GANVIIANGGGV 51 (75) T ss_pred EEEEECCCCEE---HHH-HHHHHHHCC--CCCEEEECCCCE T ss_conf 99998696221---898-775665217--873999717621 No 147 >KOG0456 consensus Probab=29.74 E-value=46 Score=15.01 Aligned_cols=47 Identities=19% Similarity=0.265 Sum_probs=30.2 Q ss_pred CCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHH Q ss_conf 9889998675672474689999999999851897179984572457889 Q 537021.9.peg.3 25 NETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAV 73 (150) Q Consensus 25 ~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~ 73 (150) +-+.|-|+||+-..+-+. .+.-|+..+..-..+|++|.-+..--++. T Consensus 78 ~~~~V~KFGGsSV~s~~~--~i~v~~l~~~~~~e~~~vV~SA~sk~Tna 124 (559) T KOG0456 78 GLTCVMKFGGSSVGSAER--MIEVAVLILYFPDERPVVVLSAMSKTTNA 124 (559) T ss_pred CEEEEEECCCCCCCCCCH--HHHHHHHHHHCCCCCEEEEEECCCCCHHH T ss_conf 418999428841133400--13356788746877738999713441254 No 148 >TIGR01369 CPSaseII_lrg carbamoyl-phosphate synthase, large subunit; InterPro: IPR006275 Carbamoyl phosphate synthase (CPSase) is a heterodimeric enzyme composed of a small and a large subunit (with the exception of CPSase III, see below). CPSase catalyses the synthesis of carbamoyl phosphate from biocarbonate, ATP and glutamine (6.3.5.5 from EC) or ammonia (6.3.4.16 from EC), and represents the first committed step in pyrimidine and arginine biosynthesis in prokaryotes and eukaryotes, and in the urea cycle in most terrestrial vertebrates , . CPSase has three active sites, one in the small subunit and two in the large subunit. The small subunit contains the glutamine binding site and catalyses the hydrolysis of glutamine to glutamate and ammonia. The large subunit has two homologous carboxy phosphate domains, both of which have ATP-binding sites; however, the N-terminal carboxy phosphate domain catalyses the phosphorylation of biocarbonate, while the C-terminal domain catalyses the phosphorylation of the carbamate intermediate . The carboxy phosphate domain found duplicated in the large subunit of CPSase is also present as a single copy in the biotin-dependent enzymes acetyl-CoA carboxylase (6.4.1.2 from EC) (ACC), propionyl-CoA carboxylase (6.4.1.3 from EC) (PCCase), pyruvate carboxylase (6.4.1.1 from EC) (PC) and urea carboxylase (6.3.4.6 from EC). Most prokaryotes carry one form of CPSase that participates in both arginine and pyrimidine biosynthesis, however certain bacteria can have separate forms. The large subunit in bacterial CPSase has four structural domains: the carboxy phosphate domain 1, the oligomerisation domain, the carbamoyl phosphate domain 2 and the allosteric domain . CPSase heterodimers from Escherichia coli contain two molecular tunnels: an ammonia tunnel and a carbamate tunnel. These inter-domain tunnels connect the three distinct active sites, and function as conduits for the transport of unstable reaction intermediates (ammonia and carbamate) between successive active sites . The catalytic mechanism of CPSase involves the diffusion of carbamate through the interior of the enzyme from the site of synthesis within the N-terminal domain of the large subunit to the site of phosphorylation within the C-terminal domain. Eukaryotes have two distinct forms of CPSase: a mitochondrial enzyme (CPSase I) that participates in both arginine biosynthesis and the urea cycle; and a cytosolic enzyme (CPSase II) involved in pyrimidine biosynthesis. CPSase II occurs as part of a multi-enzyme complex along with aspartate transcarbamoylase and dihydroorotase; this complex is referred to as the CAD protein . The hepatic expression of CPSase is transcriptionally regulated by glucocorticoids and/or cAMP . There is a third form of the enzyme, CPSase III, found in fish, which uses glutamine as a nitrogen source instead of ammonia . CPSase III is closely related to CPSase I, and is composed of a single polypeptide that may have arisen from gene fusion of the glutaminase and synthetase domains . This entry represents glutamine-dependent CPSase (6.3.5.5 from EC) from prokaryotes and eukaryotes (CPSase II). ; GO: 0004086 carbamoyl-phosphate synthase activity, 0006807 nitrogen compound metabolic process. Probab=29.61 E-value=46 Score=14.99 Aligned_cols=63 Identities=17% Similarity=0.188 Sum_probs=40.6 Q ss_pred HHHHHCCCEEEEEECHHHHC---CHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECC Q ss_conf 99987598899986756724---746899999999998518971799845724578899970898115578 Q 537021.9.peg.3 19 FVQFYENETIVVKYGGHVMN---CTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENG 86 (150) Q Consensus 19 Yi~~~r~ktiVIK~GG~~l~---d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G 86 (150) |.+.+...-++++..|+++. |.+ .+.+...+..|...|.+++=-+|-..+ |.+.||+++.+.- T Consensus 948 ~~Ka~~~~~~~~p~kg~~~~~~~d~d-k~~~~~~a~~L~~~G~~~yat~gT~~~----L~e~gi~~~~v~~ 1013 (1089) T TIGR01369 948 FLKAMLSSGNRIPKKGSVLLSVRDKD-KEELLDLAKKLAEKGYKIYATEGTAKF----LEEAGIKAEVVLK 1013 (1089) T ss_pred HHHHHHHCCCCCCCCCEEEEEECCCC-HHHHHHHHHHHHHCCCEEEEECCHHHH----HHHCCCCHHHEEE T ss_conf 99999981870578877999803443-457999999998637378850202676----7651876332000 No 149 >pfam01872 RibD_C RibD C-terminal domain. The function of this domain is not known, but it is thought to be involved in riboflavin biosynthesis. This domain is found in the C terminus of RibD/RibG, in combination with pfam00383, as well as in isolation in some archaebacterial proteins. This family appears to be related to pfam00186. Probab=29.26 E-value=47 Score=14.95 Aligned_cols=31 Identities=26% Similarity=0.460 Sum_probs=18.5 Q ss_pred HHHHHHHHHHCCCCEEEEECCCHHHHHHHHH Q ss_conf 9999999851897179984572457889997 Q 537021.9.peg.3 46 FVNDIALLKKSNITPVIVHGGGPQIGAVLEK 76 (150) Q Consensus 46 ~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~ 76 (150) +..-+..|...|.+-|+|.||+.-+.+.+++ T Consensus 119 ~~~~l~~L~~~g~~~ilveGG~~l~~~fl~~ 149 (196) T pfam01872 119 LKELLRELKERGIRSLLVEGGATLAGSLLRA 149 (196) T ss_pred HHHHHHHHHHCCCCEEEEECHHHHHHHHHHC T ss_conf 9999999986799879998579999999768 No 150 >TIGR00329 gcp metalloendopeptidase, putative, glycoprotease family; InterPro: IPR000905 Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This group of metallopeptidases belong to MEROPS peptidase family M22 (clan MK). The type example being O-sialoglycoprotein endopeptidase from Pasteurella haemolytica (Mannheimia haemolytica). O-Sialoglycoprotein endopeptidase is secreted by the bacterium P. haemolytica, and digests only proteins that are heavily sialylated, in particular those with sialylated serine and threonine residues . Substrate proteins include glycophorin A and leukocyte surface antigens CD34, CD43, CD44 and CD45 , . Removal of glycosylation, by treatment with neuraminidase, completely negates susceptibility to O-sialoglycoprotein endopeptidase digestion , . Sequence similarity searches have revealed other members of the M22 family, from yeast, Mycobacterium, Haemophilus influenzae and the cyanobacterium Synechocystis . The zinc-binding and catalytic residues of this family have not been determined, although the motif HMEGH may be a zinc-binding region .; GO: 0008450 O-sialoglycoprotein endopeptidase activity, 0006508 proteolysis. Probab=28.82 E-value=48 Score=14.91 Aligned_cols=62 Identities=21% Similarity=0.305 Sum_probs=37.5 Q ss_pred HHHHCCHHHHHHHHHHHHHHHHCCCC-EEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHH----HHHHH Q ss_conf 56724746899999999998518971-79984572457889997089811557834661248----99999 Q 537021.9.peg.3 34 GHVMNCTDLSKDFVNDIALLKKSNIT-PVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQT----AEVVE 99 (150) Q Consensus 34 G~~l~d~~l~~~~~~dI~ll~~~G~k-~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~t----l~iv~ 99 (150) |+.++|. .-+.++.++ +.+|+. ..+-|=|||.|++.|.+.|-+-+|.==+-.|+... ++.-- T Consensus 159 geT~DdA--~Ge~fDK~a--r~~gl~Gt~iayPGGp~i~~~l~~~g~~~~f~~p~P~~~~~~~~~~lDfSF 225 (337) T TIGR00329 159 GETLDDA--VGEAFDKVA--RLLGLGGTMIAYPGGPAIEEELAKKGDEKTFYFPLPLTKKPMKRNMLDFSF 225 (337) T ss_pred EEEHHHH--HHHHHHHHH--HHHCCCCCEEECCCCCHHHHHHHCCCCCCCEECCCCCCCCCCCCCCCHHHH T ss_conf 5202245--424467777--761889626506897426899854788775542666213688763403337 No 151 >PRK06859 consensus Probab=28.23 E-value=49 Score=14.84 Aligned_cols=47 Identities=19% Similarity=0.321 Sum_probs=18.9 Q ss_pred HHHHHHHCCCCCEEECCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 78899970898115578346612489999999741668999999982899 Q 537021.9.peg.3 70 IGAVLEKMGIKSKFENGLRITDQQTAEVVEMVLAGSINKKIVSLINQTGT 119 (150) Q Consensus 70 I~~~l~~~gi~~~~v~G~RVTd~~tl~iv~~vL~G~vn~~lv~~L~~~g~ 119 (150) ++..|+++|.++.-.+--+|+| ||.+++.-.-|+-| .|-+...+.++ T Consensus 106 Ln~El~r~g~~~~~~~~~~viD--TL~lAR~~~Pg~~~-sLdaLc~r~~i 152 (253) T PRK06859 106 MDYEFRKLNLNVKTDDICLVTD--TLQMARQMYPGKRN-NLDALCDRLGI 152 (253) T ss_pred HHHHHHHCCCCCCCCCCCCEEE--HHHHHHHHCCCCCC-CHHHHHHHCCC T ss_conf 9999997499965322001133--99999987879977-99999997399 No 152 >KOG3517 consensus Probab=27.83 E-value=50 Score=14.80 Aligned_cols=71 Identities=24% Similarity=0.327 Sum_probs=43.3 Q ss_pred EECHHHHCCHHHHHHHHHHHHHHHHCCCCEE-------EEECCCHHHHHHHHHCC-CCCEEECCC--CCCCHHHHHHHHH Q ss_conf 8675672474689999999999851897179-------98457245788999708-981155783--4661248999999 Q 537021.9.peg.3 31 KYGGHVMNCTDLSKDFVNDIALLKKSNITPV-------IVHGGGPQIGAVLEKMG-IKSKFENGL--RITDQQTAEVVEM 100 (150) Q Consensus 31 K~GG~~l~d~~l~~~~~~dI~ll~~~G~k~V-------iVHGgg~qI~~~l~~~g-i~~~~v~G~--RVTd~~tl~iv~~ 100 (150) -+||..+....|-..+--.|..|..+|+++- +-||.=..|-++..+.| |-|--+.|- |||++...+.++. T Consensus 10 QLGGVFVNGRPLPna~RlrIVELarlGiRPCDISRQLrvSHGCVSKILaRy~EtGsIlPGaIGGSkPRVTTP~VV~~IR~ 89 (334) T KOG3517 10 QLGGVFVNGRPLPNAIRLRIVELARLGIRPCDISRQLRVSHGCVSKILARYNETGSILPGAIGGSKPRVTTPKVVKYIRS 89 (334) T ss_pred HCCCEEECCCCCCCHHHHHHHHHHHCCCCCCCHHHHHHHCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHH T ss_conf 12515676854861255668889871887553344542310259999987625775456644787886677359999987 Q ss_pred H Q ss_conf 9 Q 537021.9.peg.3 101 V 101 (150) Q Consensus 101 v 101 (150) - T Consensus 90 ~ 90 (334) T KOG3517 90 L 90 (334) T ss_pred H T ss_conf 6 No 153 >PRK06278 cobyrinic acid a,c-diamide synthase; Validated Probab=27.83 E-value=34 Score=15.86 Aligned_cols=17 Identities=35% Similarity=0.384 Sum_probs=14.0 Q ss_pred HHHHCCCEEEEEECHHH Q ss_conf 99875988999867567 Q 537021.9.peg.3 20 VQFYENETIVVKYGGHV 36 (150) Q Consensus 20 i~~~r~ktiVIK~GG~~ 36 (150) .-+.|+++||.|+|+++ T Consensus 266 ~~kL~~kvfv~KiGpDv 282 (482) T PRK06278 266 VSKLRGKTFVSKIGPDV 282 (482) T ss_pred HHHHCCCEEEEECCCCH T ss_conf 88750648998608866 No 154 >pfam07521 RMMBL RNA-metabolising metallo-beta-lactamase. The metallo-beta-lactamase fold contains five sequence motifs. The first four motifs are found in pfam00753 and are common to all metallo-beta-lactamases. The fifth motif appears to be specific to function. This entry represents the fifth motif from metallo-beta-lactamases involved in RNA metabolism. Probab=27.62 E-value=50 Score=14.77 Aligned_cols=33 Identities=18% Similarity=0.369 Sum_probs=20.5 Q ss_pred EEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECC Q ss_conf 8999867567247468999999999985189717998457 Q 537021.9.peg.3 27 TIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGG 66 (150) Q Consensus 27 tiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGg 66 (150) +.-+.+.|.+ |.+-+..|++.+ .. .++++|||- T Consensus 9 v~~~~fSgHA--d~~eL~~~i~~~---~p--~~v~~vHGe 41 (43) T pfam07521 9 VETLHFSGHA--DQEELLELLKLL---KP--KKVILVHGE 41 (43) T ss_pred EEEEEEECCC--CHHHHHHHHHHC---CC--CEEEEECCC T ss_conf 9999887368--989999999966---99--989994399 No 155 >cd02871 GH18_chitinase_D-like GH18 domain of Chitinase D (ChiD). ChiD, a chitinase found in Bacillus circulans, hydrolyzes the 1,4-beta-linkages of N-acetylglucosamine in chitin and chitodextrins. The domain architecture of ChiD includes a catalytic glycosyl hydrolase family 18 (GH18) domain, a chitin-binding domain, and a fibronectin type III domain. The chitin-binding and fibronectin type III domains are located either N-terminal or C-terminal to the catalytic domain. This family includes exochitinase Chi36 from Bacillus cereus. Probab=27.43 E-value=51 Score=14.75 Aligned_cols=24 Identities=29% Similarity=0.452 Sum_probs=14.0 Q ss_pred HHHHHHHHHHHHCCCCEEEEECCC Q ss_conf 999999999851897179984572 Q 537021.9.peg.3 44 KDFVNDIALLKKSNITPVIVHGGG 67 (150) Q Consensus 44 ~~~~~dI~ll~~~G~k~ViVHGgg 67 (150) ..|.+||..+++.|.+++|=-||+ T Consensus 60 ~~f~~dI~~~qs~G~kVllSiGGa 83 (312) T cd02871 60 AEFKADIKALQAKGKKVLISIGGA 83 (312) T ss_pred HHHHHHHHHHHHCCCEEEEEECCC T ss_conf 999999999997899899980787 No 156 >PRK12852 groEL chaperonin GroEL; Reviewed Probab=27.09 E-value=51 Score=14.71 Aligned_cols=109 Identities=16% Similarity=0.188 Sum_probs=53.2 Q ss_pred HCCCEEEEEECHHHHC-CHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCC----CCCCHHHHHH Q ss_conf 7598899986756724-7468999999999985189717998457245788999708981155783----4661248999 Q 537021.9.peg.3 23 YENETIVVKYGGHVMN-CTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGL----RITDQQTAEV 97 (150) Q Consensus 23 ~r~ktiVIK~GG~~l~-d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~----RVTd~~tl~i 97 (150) ..+.+.+|.+||+.-. =.+..+++-.-+...+..=..- +|.|||...-+....+.--.....+. +.- .++|++ T Consensus 372 ~~g~~~tI~vgG~t~~~~~E~er~l~DAl~avr~a~~~g-vvpGGGa~e~~~~~~l~~~~~~~~~~q~~i~a~-a~ALe~ 449 (542) T PRK12852 372 LAGGVAVIRVGGATEVEVKEKKDRVEDALNATRAAVQEG-IVPGGGVALLRAKKAVGRINNDNADVQAGINIV-LKALEA 449 (542) T ss_pred CCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHCC-EEECCHHHHHHHHHHHHHHHCCCHHHHHHHHHH-HHHHHH T ss_conf 378627999667744589999999999999999872279-561557999999999998735586289999999-999999 Q ss_pred HHHHHH---HHHHHHHHHHHHHCCCCEEEEEEEECCEEE Q ss_conf 999974---166899999998289976877865589899 Q 537021.9.peg.3 98 VEMVLA---GSINKKIVSLINQTGTQAIGICGKDGNMVF 133 (150) Q Consensus 98 v~~vL~---G~vn~~lv~~L~~~g~~avglsg~dg~li~ 133 (150) +-..|+ |.=-.+++..|.+...+..|+....|.+.- T Consensus 450 ip~tLaeNAG~d~~~vv~~l~~~~~~~~GiD~~~G~~~D 488 (542) T PRK12852 450 PIRQIAENAGVEGSIVVGKILENKSETFGFDAQTEEYVD 488 (542) T ss_pred HHHHHHHHCCCCHHHHHHHHHHCCCCCEEEECCCCCEEE T ss_conf 999999877998899999998448998848778992703 No 157 >KOG0202 consensus Probab=27.05 E-value=51 Score=14.71 Aligned_cols=112 Identities=18% Similarity=0.257 Sum_probs=70.8 Q ss_pred CCHHHHHHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHH-HHHCCCCCEE--ECC Q ss_conf 6228999999998759889998675672474689999999999851897179984572457889-9970898115--578 Q 537021.9.peg.3 10 AEILEQVLPFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAV-LEKMGIKSKF--ENG 86 (150) Q Consensus 10 a~~l~ea~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~-l~~~gi~~~~--v~G 86 (150) -+-+.+.++|.+.-.+=|||=-+| |.||. +......|..++..|++++++-|-.+.--.. ..+.|+.+.- +.+ T Consensus 557 ~~~l~~~s~~~~~E~~LtFvGlVG---i~DPP-R~ev~~ai~~c~~aGIrV~mITGD~~~TA~AI~r~iGi~~~~ed~~~ 632 (972) T KOG0202 557 DQDLNDTSNRATAESDLTFVGLVG---ILDPP-RPEVADAIELCRQAGIRVIMITGDNKETAEAIAREIGIFSEDEDVSS 632 (972) T ss_pred HHHHCCCCCCCCCCCCEEEEEEEE---CCCCC-CHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHCCCCCCCCCCC T ss_conf 343125432111125438998742---46899-65689999999973977999838987899999998398767742112 Q ss_pred CCCCCHHHHHH--------HHHHH-----HHHHHHHHHHHHHHCCCCEEEEEE Q ss_conf 34661248999--------99997-----416689999999828997687786 Q 537021.9.peg.3 87 LRITDQQTAEV--------VEMVL-----AGSINKKIVSLINQTGTQAIGICG 126 (150) Q Consensus 87 ~RVTd~~tl~i--------v~~vL-----~G~vn~~lv~~L~~~g~~avglsg 126 (150) .-.|-.|-.+. +..++ .-+--.+|+.+|++.|- -+.++| T Consensus 633 ~~~TG~efD~ls~~~~~~~~~~~~vFaR~~P~HK~kIVeaLq~~ge-ivAMTG 684 (972) T KOG0202 633 MALTGSEFDDLSDEELDDAVRRVLVFARAEPQHKLKIVEALQSRGE-VVAMTG 684 (972) T ss_pred CCCCHHHHCCCCHHHHHHHHHCCEEEEECCCHHHHHHHHHHHHCCC-EEEECC T ss_conf 3223444404998999877642359996383658999999986498-898327 No 158 >PRK09451 glmU bifunctional N-acetylglucosamine-1-phosphate uridyltransferase/glucosamine-1-phosphate acetyltransferase; Provisional Probab=26.84 E-value=52 Score=14.68 Aligned_cols=38 Identities=18% Similarity=0.359 Sum_probs=29.3 Q ss_pred CCCHHHHHHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHH Q ss_conf 0622899999999875988999867567247468999999 Q 537021.9.peg.3 9 QAEILEQVLPFVQFYENETIVVKYGGHVMNCTDLSKDFVN 48 (150) Q Consensus 9 ~a~~l~ea~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~ 48 (150) .|..++.|+||++. ++.|+|-+|..-+.+++.+.+|++ T Consensus 82 TghAV~~A~~~l~~--~~~vLVl~GD~PLi~~~tl~~l~~ 119 (456) T PRK09451 82 TGHAMQQAAPFFAD--DEDILMLYGDVPLISVETLQRLRD 119 (456) T ss_pred CHHHHHHHHHHCCC--CCCEEEEECCCCCCCHHHHHHHHH T ss_conf 29999998886066--885899959801569999999986 No 159 >PRK11391 etp phosphotyrosine-protein phosphatase; Provisional Probab=26.72 E-value=52 Score=14.67 Aligned_cols=69 Identities=13% Similarity=0.216 Sum_probs=46.4 Q ss_pred CEEEEEECHHHHCCHHHHHHHHHHHH---HHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHH Q ss_conf 88999867567247468999999999---98518971799845724578899970898115578346612489 Q 537021.9.peg.3 26 ETIVVKYGGHVMNCTDLSKDFVNDIA---LLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTA 95 (150) Q Consensus 26 ktiVIK~GG~~l~d~~l~~~~~~dI~---ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl 95 (150) +.|..-+-|++..+|-. +.++++.. ...+.|.....=++.-|..-+.+++.|+...-+...++|.+..- T Consensus 3 ~~ILfVC~GNiCRSPmA-Eai~r~~~~~i~v~SaG~~~~~g~~~d~~a~~~l~~~gid~~~h~ar~i~~~~~~ 74 (144) T PRK11391 3 NSILVVCTGNICRSPIG-ERLLRKRLPGVKVKSAGVHGLVKHPADATAADVAANHGVSLEGHAGRKLTAEMAR 74 (144) T ss_pred CEEEEEECCHHHHHHHH-HHHHHHHCCCCCEEEEEEECCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCHHHHH T ss_conf 87999959725288999-9999950778815766653578989799999999981999300002247887986 No 160 >TIGR01737 FGAM_synth_I phosphoribosylformylglycinamidine synthase I; InterPro: IPR010075 In some species, phosphoribosylformylglycinamidine synthase is composed of a single polypeptide chain. This entry describes the PurQ protein of Bacillus subtilis (where PurL, PurQ, and PurS are required for phosphoribosylformylglycinamidine synthase activity) and functionally equivalent proteins from other bacteria and archaea.; GO: 0004642 phosphoribosylformylglycinamidine synthase activity, 0006189 'de novo' IMP biosynthetic process, 0005737 cytoplasm. Probab=26.64 E-value=52 Score=14.66 Aligned_cols=84 Identities=17% Similarity=0.331 Sum_probs=49.8 Q ss_pred CEEEEEECHHHHCCHHHHHHHHHHHHHHH-HCCCCEEEEECCCHHHHHHHHHCC-CCCE-----E--E---CCCCCCCHH Q ss_conf 88999867567247468999999999985-189717998457245788999708-9811-----5--5---783466124 Q 537021.9.peg.3 26 ETIVVKYGGHVMNCTDLSKDFVNDIALLK-KSNITPVIVHGGGPQIGAVLEKMG-IKSK-----F--E---NGLRITDQQ 93 (150) Q Consensus 26 ktiVIK~GG~~l~d~~l~~~~~~dI~ll~-~~G~k~ViVHGgg~qI~~~l~~~g-i~~~-----~--v---~G~RVTd~~ 93 (150) |.-||+++|+-.+.+.. ..+. .+|.++++|| |.+|-+... ++-+ | | .| ++=-+ T Consensus 2 kvAV~~FPGtNCd~D~~--------~A~~k~~Gv~~~~Vh-----ikqw~~D~~r~~~~rsvfD~D~VvlPGG--FSyGD 66 (264) T TIGR01737 2 KVAVIRFPGTNCDRDTV--------YALEKLLGVDAEIVH-----IKQWYEDKSRLSDQRSVFDYDAVVLPGG--FSYGD 66 (264) T ss_pred EEEEEECCCCCCHHHHH--------HHHHHHCCCCEEEEE-----EEECHHHHCCCCCCCEEEEEEEEEECCC--CCHHH T ss_conf 68999779998408999--------999971499837998-----6521000047876641787334886687--77235 Q ss_pred HHHHHHHHHHHHHHHHHH---HHHHHCCCCEEEEE Q ss_conf 899999997416689999---99982899768778 Q 537021.9.peg.3 94 TAEVVEMVLAGSINKKIV---SLINQTGTQAIGIC 125 (150) Q Consensus 94 tl~iv~~vL~G~vn~~lv---~~L~~~g~~avgls 125 (150) =|..-.++-. +|...+. ..+.+.|.+-.|+| T Consensus 67 YLRaGAIAa~-rvk~p~~~~V~~~A~~G~pVLGiC 100 (264) T TIGR01737 67 YLRAGAIAAL-RVKSPIMQEVREFAEKGKPVLGIC 100 (264) T ss_pred HHHHHHHHHH-HCCHHHHHHHHHHHHCCCEEEEEE T ss_conf 7779999987-503088999999995598289976 No 161 >TIGR00099 Cof-subfamily Cof-like hydrolase; InterPro: IPR000150 The Haloacid Dehydrogenase (HAD) superfamily includes phosphatases, phosphonatases, P-type ATPases, beta-phosphoglucomutases, phosphomannomutases, and dehalogenases, which are involved in a variety of cellular processes ranging from amino acid biosynthesis to detoxification . Proteins in this entry are mostly uncharacterised, though they form a distinct subgroup within the HAD superfamily. Members are found almost exclusively in bacteria and many species contain several paralogs, for example Escherichia coli contains a total of six proteins from this entry. Sequence similarities suggest that these enzymes are phosphatases which work on phosphorylated sugars. The YbiV (P75792 from SWISSPROT) protein from E. coli has been experimentally characterised . This enzyme catalyzes the hydrolysis of sugar phosphate to sugar and inorganic phosphate. It has a wide substrate specificity, catalyzing the hydrolysis of ribose-5-phosphate and glucose-6-phosphate most efficiently, but it is not known if these are the real substrates in vivo. The protein appears to be a monomer that contains two domains, an alpha-beta hydrolase domain that forms a Rossman fold, and an alpha-beta domain. The active site is found in a negatively charged cavity found at the interface between the two domains. ; GO: 0016787 hydrolase activity, 0008152 metabolic process. Probab=26.39 E-value=53 Score=14.63 Aligned_cols=112 Identities=13% Similarity=0.303 Sum_probs=76.0 Q ss_pred EEEEECHHHHC-CH-HHHHHHHHHHHHHHHCCCCEEEEECCC-----HHHHHHHHHCCCCC-EEE---CCCCCCCHHH-H Q ss_conf 99986756724-74-689999999999851897179984572-----45788999708981-155---7834661248-9 Q 537021.9.peg.3 28 IVVKYGGHVMN-CT-DLSKDFVNDIALLKKSNITPVIVHGGG-----PQIGAVLEKMGIKS-KFE---NGLRITDQQT-A 95 (150) Q Consensus 28 iVIK~GG~~l~-d~-~l~~~~~~dI~ll~~~G~k~ViVHGgg-----~qI~~~l~~~gi~~-~~v---~G~RVTd~~t-l 95 (150) +++=++|..+. +. ...+...+-|..++.-|+++||.=|=. .++.....+.+++. .++ +|..+.+... = T Consensus 2 i~~D~DGTLL~~~~~~i~~~~~~al~~l~~~G~~~~laTGR~~~g~~~~~~~~~~~~~~~~~~~i~~nGg~~~~~~~~~~ 81 (270) T TIGR00099 2 IFIDLDGTLLNKDDKEISPSTKEALAKLREKGIKVVLATGRPLAGVYKELKEILKELGLDTDPYISANGGAVIDEGGGNE 81 (270) T ss_pred EEEECCCEEECCCCCCCCHHHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHHHHHHCCCCCEEEEECCCEEEEECCCCE T ss_conf 88963850355998751888999999999669889998188643478999998885067855687617848998468700 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCC-CEEEEEEEECCEEEEEECCC Q ss_conf 999999741668999999982899-76877865589899997475 Q 537021.9.peg.3 96 EVVEMVLAGSINKKIVSLINQTGT-QAIGICGKDGNMVFAEKARH 139 (150) Q Consensus 96 ~iv~~vL~G~vn~~lv~~L~~~g~-~avglsg~dg~li~a~~~~~ 139 (150) .+.+..|...+-..|...+++++. -.+-+.+....+...++.++ T Consensus 82 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 126 (270) T TIGR00099 82 ILYKKPLDLDLVEELLNFLKKHGQELDVILYSDSDSIYALKNDPE 126 (270) T ss_pred EEEECCCCHHHHHHHHHHHHHCCCCEEEEEECCCCEEEEECCCCC T ss_conf 563024788899999999973489257999817762565237875 No 162 >TIGR03590 PseG pseudaminic acid biosynthesis-associated protein PseG. This protein is found in association with enzymes involved in the biosynthesis of pseudaminic acid, a component of polysaccharide in certain Pseudomonas strains as well as a modification of flagellin in Campylobacter and Hellicobacter. The role of this protein is unclear, although it may participate in N-acetylation in conjunction with, or in the absence of PseH (TIGR03585) as it often scores above the trusted cutoff to pfam00583 representing a family of acetyltransferases. Probab=26.34 E-value=53 Score=14.63 Aligned_cols=99 Identities=18% Similarity=0.225 Sum_probs=50.3 Q ss_pred CCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHC-CCCCEEECCCCCCC-HHHHHHHHHH- Q ss_conf 98899986756724746899999999998518971799845724578899970-89811557834661-2489999999- Q 537021.9.peg.3 25 NETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKM-GIKSKFENGLRITD-QQTAEVVEMV- 101 (150) Q Consensus 25 ~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~-gi~~~~v~G~RVTd-~~tl~iv~~v- 101 (150) -+.++|-+||+ +..++...+++-+... ....++.+|.|.+..-.+.+.+. .-.+.+. -..+ ++..+....+ T Consensus 171 ~~~Ili~~GGs--D~~~lt~~il~~l~~~-~~~~~i~vvig~~~~~~~~i~~~~~~~~~~~---~~~~~~~m~~~m~~aD 244 (280) T TIGR03590 171 LRRVLVSFGGA--DPDNLTLKLLSALAES-QINISITLVTGSSNPNLDELKKFAKEYPNII---LFIDVENMAELMNEAD 244 (280) T ss_pred CCEEEEEECCC--CHHHHHHHHHHHHHHH-CCCCCEEEEECCCCCCHHHHHHHHHHCCCEE---EECCHHHHHHHHHHCC T ss_conf 53289997787--7000899999999851-6685679998679876699999997289969---9659889999999779 Q ss_pred H-HHHHHHHHHHHHHHCCCCEEEEEEEECC Q ss_conf 7-4166899999998289976877865589 Q 537021.9.peg.3 102 L-AGSINKKIVSLINQTGTQAIGICGKDGN 130 (150) Q Consensus 102 L-~G~vn~~lv~~L~~~g~~avglsg~dg~ 130 (150) + .+.-.. -.-.+...|++++.++-.|.. T Consensus 245 laI~agG~-t~~El~~~gvP~i~i~~adNQ 273 (280) T TIGR03590 245 LAIGAAGS-TSWERCCLGLPSLSIVLAENQ 273 (280) T ss_pred EEEECCCH-HHHHHHHHCCCEEEEEECCCH T ss_conf 99985965-899999949998999970256 No 163 >COG3956 Protein containing tetrapyrrole methyltransferase domain and MazG-like (predicted pyrophosphatase) domain [General function prediction only] Probab=25.65 E-value=54 Score=14.55 Aligned_cols=72 Identities=21% Similarity=0.253 Sum_probs=52.2 Q ss_pred HHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHHHHH Q ss_conf 998759889998675672474689999999999851897179984572457889997089811557834661248999 Q 537021.9.peg.3 20 VQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQTAEV 97 (150) Q Consensus 20 i~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~tl~i 97 (150) +.+-+.|-+|..++|+-+..+... +=+.....-+---|=|-||-..||..++.+.|.| ++|..+-|.-+|.- T Consensus 77 veaAkekdIvYAVPGHP~VAEktV----qlL~e~~ek~ni~Vkilgg~SFiD~~fealkiDP--veG~q~vDa~~l~~ 148 (488) T COG3956 77 VEAAKEKDIVYAVPGHPLVAEKTV----QLLIEACEKENIKVKILGGQSFIDALFEALKIDP--VEGFQIVDATDLSN 148 (488) T ss_pred HHHHCCCCEEEECCCCCHHHHHHH----HHHHHHHHCCCCEEEEECCCHHHHHHHHHHCCCC--CCCCEEECCCHHHH T ss_conf 986223563896699951688899----9999987625852797575026889998845785--56736741000157 No 164 >TIGR01249 pro_imino_pep_1 proline iminopeptidase; InterPro: IPR005944 Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes . They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence . Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases . Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base . The geometric orientations of the catalytic residues are similar between families, despite different protein folds . The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) , . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This group of serine peptidase belong to MEROPS peptidase family S33 (clan SC). They are proline iminopeptidase (Prolyl aminopeptidase, 3.4.11.5 from EC), which catalyzes the removal of the N-terminal proline from peptides. This family represents one of two related families of proline iminopeptidase containing the alpha/beta fold. The fine specificities of the various members, including both the range of short peptides from which proline can be removed and whether other amino acids such as alanine can be also removed, may vary among members. ; GO: 0016804 prolyl aminopeptidase activity, 0005737 cytoplasm. Probab=25.63 E-value=54 Score=14.54 Aligned_cols=55 Identities=22% Similarity=0.344 Sum_probs=31.3 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEE---------ECCCCCCCHHHHHHHHHH Q ss_conf 999999998518971799845724578899970898115---------578346612489999999 Q 537021.9.peg.3 45 DFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKF---------ENGLRITDQQTAEVVEMV 101 (150) Q Consensus 45 ~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~---------v~G~RVTd~~tl~iv~~v 101 (150) .+++||..++ .++.+||||==.-|.=+++...+.--| -.|-|=.|+..++-...+ T Consensus 242 ~lL~ni~~i~--~i~~~iv~GRyDl~cPl~~awaL~kafPea~L~v~~~AGHsa~dp~~~a~lv~a 305 (310) T TIGR01249 242 FLLDNISKIR--NIPTVIVHGRYDLICPLQSAWALHKAFPEAELKVVNNAGHSAFDPNNLAALVVA 305 (310) T ss_pred HHHHHHHHHC--CCCEEEEECCEEHCCHHCCHHHHHHCCCCCEEEEECCCCCCCCCHHHHHHHHHH T ss_conf 8886677640--687379844600002003544675218551466745788665765479999999 No 165 >cd07022 S49_Sppa_36K_type Signal peptide peptidase A (SppA) 36K type, a serine protease, has catalytic Ser-Lys dyad. Signal peptide peptidase A (SppA; Peptidase S49; Protease IV) 36K type: SppA is found in all three domains of life and is involved in the cleavage of signal peptides after their removal from the precursor proteins by signal peptidases. Members in this subfamily are all bacterial and include sohB peptidase and protein C. These are sometimes referred to as 36K type since they contain only one domain, unlike E. coli SppA that also contains an amino-terminal domain. Site-directed mutagenesis and sequence analysis have shown these SppAs to be serine proteases. The predicted active site serine for members in this family occurs in a transmembrane domain. Mutagenesis studies also suggest that the catalytic center comprises a Ser-Lys dyad and not the usual Ser-His-Asp catalytic triad found in the majority of serine proteases. Probab=25.56 E-value=55 Score=14.54 Aligned_cols=78 Identities=14% Similarity=0.215 Sum_probs=45.3 Q ss_pred CHHHHHHHHHCCCCCEEE-CC--------CCCCCHHHHHHHHHHHHHHHHHHHHHHHHH-CCCCEEEEEEEECCEEEEEE Q ss_conf 245788999708981155-78--------346612489999999741668999999982-89976877865589899997 Q 537021.9.peg.3 67 GPQIGAVLEKMGIKSKFE-NG--------LRITDQQTAEVVEMVLAGSINKKIVSLINQ-TGTQAIGICGKDGNMVFAEK 136 (150) Q Consensus 67 g~qI~~~l~~~gi~~~~v-~G--------~RVTd~~tl~iv~~vL~G~vn~~lv~~L~~-~g~~avglsg~dg~li~a~~ 136 (150) .+.+..+|++.|++.+.+ .| .|=.+++..+..+..+ ..+.........+ ++...--+....|.++.++. T Consensus 115 ~~~~~~ll~k~Gi~~~~~~sg~~K~~~~p~~~~s~e~~~~~q~~l-d~~~~~F~~~Va~~R~~~~~~~~~~~G~v~~g~~ 193 (214) T cd07022 115 HVDQSKALEKAGLKVTLIFAGAHKVDGNPDEPLSDEARARLQAEV-DALYAMFVAAVARNRGLSAAAVRATEGGVFRGQE 193 (214) T ss_pred CCCHHHHHHHCCCEEEEEECCCCCCCCCCCCCCCHHHHHHHHHHH-HHHHHHHHHHHHHHCCCCHHHHHHHCCCEEEHHH T ss_conf 177688898679547874057633236875656889999999999-9999999999998579997788871698988999 Q ss_pred CCCCCCCCCC Q ss_conf 4752111799 Q 537021.9.peg.3 137 ARHSLRLSPN 146 (150) Q Consensus 137 ~~~~lg~vg~ 146 (150) -.++|++-+ T Consensus 194 -A~~~GLVD~ 202 (214) T cd07022 194 -AVAAGLADA 202 (214) T ss_pred -HHHCCCCCC T ss_conf -998599754 No 166 >KOG1533 consensus Probab=25.06 E-value=56 Score=14.48 Aligned_cols=107 Identities=19% Similarity=0.195 Sum_probs=61.3 Q ss_pred CHHHHHHHHHHHHCCCEEEEEECHHH--HCCHHHHHHHHHHHHHHHHCCCCEEEEECC--------CHHHHHHHHHC--- Q ss_conf 22899999999875988999867567--247468999999999985189717998457--------24578899970--- Q 537021.9.peg.3 11 EILEQVLPFVQFYENETIVVKYGGHV--MNCTDLSKDFVNDIALLKKSNITPVIVHGG--------GPQIGAVLEKM--- 77 (150) Q Consensus 11 ~~l~ea~pYi~~~r~ktiVIK~GG~~--l~d~~l~~~~~~dI~ll~~~G~k~ViVHGg--------g~qI~~~l~~~--- 77 (150) .||++. |+.++++.++.-++|-+ ..+.+...++++ .|...+.+++.||=. +..|+.+|-.+ T Consensus 86 dwl~~~---l~~~~~~Y~lFDcPGQVELft~h~~l~~I~~---~Lek~~~rl~~V~LiDs~ycs~p~~~iS~lL~sl~tM 159 (290) T KOG1533 86 DWLLEK---LKPLTDHYVLFDCPGQVELFTHHDSLNKIFR---KLEKLDYRLVAVNLIDSHYCSDPSKFISSLLVSLATM 159 (290) T ss_pred HHHHHH---HHHCCCCEEEEECCCCEEEEECCCHHHHHHH---HHHHCCCEEEEEEEEECEEECCHHHHHHHHHHHHHHH T ss_conf 999997---4523474899957982798742560999999---9997695379997330214378489999999999999 Q ss_pred -CCCCEEE-----------CC-----C-CCCCHHHHHHHHHHH--------HHHHHHHHHHHHHHCCCCEEE Q ss_conf -8981155-----------78-----3-466124899999997--------416689999999828997687 Q 537021.9.peg.3 78 -GIKSKFE-----------NG-----L-RITDQQTAEVVEMVL--------AGSINKKIVSLINQTGTQAIG 123 (150) Q Consensus 78 -gi~~~~v-----------~G-----~-RVTd~~tl~iv~~vL--------~G~vn~~lv~~L~~~g~~avg 123 (150) .++..-+ .| + =.|+-+.+...+.-| -++.|..|++.++..+.-+.. T Consensus 160 l~melphVNvlSK~Dl~~~ygkl~f~ld~yt~v~Dl~yL~~~ld~dp~~~kYrkLne~ic~~IeD~~LVSF~ 231 (290) T KOG1533 160 LHMELPHVNVLSKADLLKKYGKLPFNLDFYTEVQDLSYLEDLLDVDPRLRKYRKLNEAICELIEDFNLVSFE 231 (290) T ss_pred HHHCCCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCEEEE T ss_conf 862365001425767787616665453123206667777887502735667888899999997605730468 No 167 >TIGR02824 quinone_pig3 putative NAD(P)H quinone oxidoreductase, PIG3 family; InterPro: IPR014189 Members of this family are putative quinone oxidoreductases that belong to the broader superfamily of zinc-dependent alcohol (of medium chain length) dehydrogenases and quinone oxidoreductases. An alignment shows no motif of conserved Cys residues as are found in zinc-binding members of the superfamily, consequently members of this entry are likely to be quinone oxidoreductases. A member of this family in Homo sapiens (Human), PIG3, is induced by p53 but is otherwise uncharacterised.. Probab=24.94 E-value=43 Score=15.23 Aligned_cols=28 Identities=18% Similarity=0.219 Sum_probs=11.1 Q ss_pred HHHHHCCCCEEEEECCCHHHHHHHHHCC Q ss_conf 9985189717998457245788999708 Q 537021.9.peg.3 51 ALLKKSNITPVIVHGGGPQIGAVLEKMG 78 (150) Q Consensus 51 ~ll~~~G~k~ViVHGgg~qI~~~l~~~g 78 (150) .+-+..|.++..-=|.--......+++| T Consensus 163 qLAKA~Ga~V~~TaGS~eK~~~a~~~LG 190 (334) T TIGR02824 163 QLAKAFGARVFTTAGSDEKCATACEALG 190 (334) T ss_pred HHHHHCCCEEEEEECCHHHHHHHHHHCC T ss_conf 9998569759998289899999998609 No 168 >TIGR01326 OAH_OAS_sulfhy O-acetylhomoserine aminocarboxypropyltransferase/cysteine synthase; InterPro: IPR006235 This family of sequences is a distinct clade of the Cys/Met metabolism pyridoxal phosphate-dependent enzyme superfamily. Members include examples of OAH/OAS sulfhydrylase, an enzyme with activity both as O-acetylhomoserine sulfhydrylase (OAH, 2.5.1.49 from EC) and O-acetylserine sulfhydrylase (OAS, 2.5.1.47 from EC). An alternate name for OAH sulfhydrylase is homocysteine synthase.; GO: 0016765 transferase activity transferring alkyl or aryl (other than methyl) groups, 0006520 amino acid metabolic process. Probab=24.87 E-value=42 Score=15.27 Aligned_cols=51 Identities=22% Similarity=0.266 Sum_probs=35.1 Q ss_pred HHHHHHHHHHHHCCCCEEEE---E-CCCHHHHHHHHHCCCCCEEECCCCCCC-HHHHHH Q ss_conf 99999999985189717998---4-572457889997089811557834661-248999 Q 537021.9.peg.3 44 KDFVNDIALLKKSNITPVIV---H-GGGPQIGAVLEKMGIKSKFENGLRITD-QQTAEV 97 (150) Q Consensus 44 ~~~~~dI~ll~~~G~k~ViV---H-Ggg~qI~~~l~~~gi~~~~v~G~RVTd-~~tl~i 97 (150) .+..-.|.-|-..|-++|=. . |.--++.-.|+++|||.+|+|+ -| +|.++- T Consensus 83 AA~~~Ailnla~AGDnIVsS~~LYGGTynLF~~TlkrlGIevrFvd~---dd~pe~~~k 138 (434) T TIGR01326 83 AAITYAILNLAQAGDNIVSSSYLYGGTYNLFKHTLKRLGIEVRFVDG---DDDPEELEK 138 (434) T ss_pred HHHHHHHHHHHHCCCCEEECCCCCCHHHHHHHHHHHHCCEEEEEECC---CCCHHHHHH T ss_conf 99999999997269826980644422578999955544814887278---888789997 No 169 >cd01013 isochorismatase Isochorismatase, also known as 2,3 dihydro-2,3 dihydroxybenzoate synthase, catalyses the conversion of isochorismate, in the presence of water, to 2,3-dihydroxybenzoate and pyruvate, via the hydrolysis of a vinyl ether, an uncommon reaction in biological systems. Isochorismatase is part of the phenazine biosynthesis pathway. Phenazines are antimicrobial compounds that provide the competitive advantage for certain bacteria. Probab=24.80 E-value=56 Score=14.45 Aligned_cols=91 Identities=14% Similarity=0.233 Sum_probs=53.9 Q ss_pred HHHHHHHHHHHHHHHHCCCCEEEE-E--CCCHHHHHHHHHC---CC-----CCEEECCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 468999999999985189717998-4--5724578899970---89-----81155783466124899999997416689 Q 537021.9.peg.3 40 TDLSKDFVNDIALLKKSNITPVIV-H--GGGPQIGAVLEKM---GI-----KSKFENGLRITDQQTAEVVEMVLAGSINK 108 (150) Q Consensus 40 ~~l~~~~~~dI~ll~~~G~k~ViV-H--Ggg~qI~~~l~~~---gi-----~~~~v~G~RVTd~~tl~iv~~vL~G~vn~ 108 (150) +++..++.+=+...+..|+.++-+ | ...+.-...+... |. +.+.++.+...+.+. -+.+.-.++..++ T Consensus 53 ~~vv~ni~rL~~aaR~~GipVi~t~~~~~~~~~~r~~~~~~~g~~~~~g~~g~eiv~eL~P~~~d~-vi~K~~~saF~~T 131 (203) T cd01013 53 PQLIANIARLRDWCRQAGIPVVYTAQPGNQTPEQRALLNDFWGPGLTASPEETKIVTELAPQPDDT-VLTKWRYSAFKRS 131 (203) T ss_pred HHHHHHHHHHHHHHHHCCCEEEEEECCCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCE-EEECCCCCCCCCC T ss_conf 889999999999999879969998347879966730011023788768986453371438999998-9854867886799 Q ss_pred HHHHHHHHCCCCEEEEEEEECCE Q ss_conf 99999982899768778655898 Q 537021.9.peg.3 109 KIVSLINQTGTQAIGICGKDGNM 131 (150) Q Consensus 109 ~lv~~L~~~g~~avglsg~dg~l 131 (150) +|...|++.|+..+-++|...|. T Consensus 132 ~L~~~Lr~~gi~~Lvv~Gv~T~~ 154 (203) T cd01013 132 PLLERLKESGRDQLIITGVYAHI 154 (203) T ss_pred CHHHHHHHCCCCEEEEEEEECCC T ss_conf 98999997699989998654251 No 170 >PRK12851 groEL chaperonin GroEL; Reviewed Probab=24.55 E-value=57 Score=14.42 Aligned_cols=109 Identities=14% Similarity=0.153 Sum_probs=55.5 Q ss_pred HHHCCCEEEEEECHHHH--CCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCC----CCHHH Q ss_conf 98759889998675672--4746899999999998518971799845724578899970898115578346----61248 Q 537021.9.peg.3 21 QFYENETIVVKYGGHVM--NCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRI----TDQQT 94 (150) Q Consensus 21 ~~~r~ktiVIK~GG~~l--~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RV----Td~~t 94 (150) .+..+.+.+|.+||+.- .+ +..+++-.-+...+.+=... +|.|||..--+....+.- .+-+.|..- .=.++ T Consensus 370 ~~l~g~~~~I~vgG~t~~~l~-E~er~l~DAl~~vr~~i~~g-vVpGGGa~e~~~a~~l~~-~~~~~g~eqlav~afa~A 446 (541) T PRK12851 370 AKLAGGVAVIRVGASTEVEVK-EKKDRVDDALHATRAAVEEG-IVPGGGVALLRAVKALDK-LTTANGDQRTGVEIVRRA 446 (541) T ss_pred CCCCCCEEEEEEECCCHHHHH-HHHHHHHHHHHHHHHHHHCC-EECCCHHHHHHHHHHHHH-HCCCCHHHHHHHHHHHHH T ss_conf 014786389996068688999-99999998899999974388-760777999999999986-136876899999999999 Q ss_pred HHHHHHHHH---HHHHHHHHHHHHHCCCCEEEEEEEECCEEE Q ss_conf 999999974---166899999998289976877865589899 Q 537021.9.peg.3 95 AEVVEMVLA---GSINKKIVSLINQTGTQAIGICGKDGNMVF 133 (150) Q Consensus 95 l~iv~~vL~---G~vn~~lv~~L~~~g~~avglsg~dg~li~ 133 (150) |+++-.+|+ |.=-.+++..|.. +-...|+....|.+.- T Consensus 447 Le~ip~tLaeNaG~d~~~vl~~L~~-~~~~~G~d~~~g~~~d 487 (541) T PRK12851 447 LEAPARQIAENAGAEGSVVVGKVRE-KPESYGFNAATGEYGD 487 (541) T ss_pred HHHHHHHHHHHCCCCHHHHHHHHHH-CCCCEEEECCCCCEEE T ss_conf 9999999999769998999999982-7999848678995711 No 171 >cd02516 CDP-ME_synthetase CDP-ME synthetase is involved in mevalonate-independent isoprenoid production. 4-diphosphocytidyl-2-methyl-D-erythritol synthase (CDP-ME), also called 2C-methyl-d-erythritol 4-phosphate cytidylyltransferase catalyzes the third step in the alternative (non-mevalonate) pathway of Isopentenyl diphosphate (IPP) biosynthesis: the formation of 4-diphosphocytidyl-2C-methyl-D-erythritol from CTP and 2C-methyl-D-erythritol 4-phosphate. This mevalonate independent pathway that utilizes pyruvate and glyceraldehydes 3-phosphate as starting materials for production of IPP occurs in a variety of bacteria, archaea and plant cells, but is absent in mammals. Thus, CDP-ME synthetase is an attractive targets for the structure-based design of selective antibacterial, herbicidal and antimalarial drugs. Probab=24.34 E-value=58 Score=14.39 Aligned_cols=48 Identities=15% Similarity=0.341 Sum_probs=21.7 Q ss_pred HHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECC-CHHH Q ss_conf 9999875988999867567247468999999999985189717998457-2457 Q 537021.9.peg.3 18 PFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGG-GPQI 70 (150) Q Consensus 18 pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGg-g~qI 70 (150) .+.+.+..+.+.+-.||.. . .+++..-+..+....-..|+||-+ +|.+ T Consensus 61 ~~~~~~~~~~i~~v~GG~t----R-~~SV~ngl~~l~~~~~~~VlIHDaaRP~i 109 (218) T cd02516 61 ELAKYGLSKVVKIVEGGAT----R-QDSVLNGLKALPDADPDIVLIHDAARPFV 109 (218) T ss_pred HHHHHCCCCCEEEECCCCC----H-HHHHHHHHHHHCCCCCCEEEEECCCCCCC T ss_conf 9886447987699899840----9-99999898740126999899925767789 No 172 >PRK05802 hypothetical protein; Provisional Probab=24.21 E-value=58 Score=14.37 Aligned_cols=73 Identities=21% Similarity=0.113 Sum_probs=50.2 Q ss_pred CCCHHHH--HHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCH----HHHHHHHHCCCCCE Q ss_conf 0622899--99999987598899986756724746899999999998518971799845724----57889997089811 Q 537021.9.peg.3 9 QAEILEQ--VLPFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGP----QIGAVLEKMGIKSK 82 (150) Q Consensus 9 ~a~~l~e--a~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~----qI~~~l~~~gi~~~ 82 (150) |+|.... -++||..-++...+|-..|=.+. | .+.-|..|.+-|-++.++-..|| .|++.|++.+++.- T Consensus 155 RGPywNGifGlk~I~~~kn~k~LvIaRGIgqA-P-----~vpVikkL~~n~NkV~vIid~~~~~~~fI~eyl~~~~~~~i 228 (328) T PRK05802 155 RGPYWNGVFGLKNIKSTKNGKSLVIARGIGQA-P-----AVPVIKKLYSNGNKVIVILDKGPFENNFIKEYLEEYNIEVI 228 (328) T ss_pred ECCCCCCCCCHHHHHHCCCCEEEEEECCCCCC-C-----CHHHHHHHHHCCCEEEEEECCCCCCCHHHHHHHHHCCCEEE T ss_conf 67742440370567505797399993562325-5-----55999999868997999985885531659999985783699 Q ss_pred EECCC Q ss_conf 55783 Q 537021.9.peg.3 83 FENGL 87 (150) Q Consensus 83 ~v~G~ 87 (150) -.+=+ T Consensus 229 ~~~~l 233 (328) T PRK05802 229 ECNLL 233 (328) T ss_pred EECCC T ss_conf 83142 No 173 >PRK09181 aspartate kinase; Validated Probab=23.91 E-value=59 Score=14.34 Aligned_cols=39 Identities=26% Similarity=0.356 Sum_probs=21.4 Q ss_pred CEEEEEECHHHHCCHHHHHHHHHHHHHHHHCC---CC-EEEEECCC Q ss_conf 88999867567247468999999999985189---71-79984572 Q 537021.9.peg.3 26 ETIVVKYGGHVMNCTDLSKDFVNDIALLKKSN---IT-PVIVHGGG 67 (150) Q Consensus 26 ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G---~k-~ViVHGgg 67 (150) ...|-||||+-+.|.+ .+.+.|..-...+ .+ +|+|=--| T Consensus 3 m~~V~KFGGTSVa~~e---~i~~~iii~~~~~~~~~~~vVVVSAmg 45 (476) T PRK09181 3 MHTVEKIGGTSMSAFD---AVLDNIILRPRKGKDLYNRIFVVSAYG 45 (476) T ss_pred CEEEEEECCCCCCCHH---HHHHHHHHHHCCCCCCCCEEEEECCCC T ss_conf 5199954866302299---999998654201356898899988999 No 174 >pfam02579 Nitro_FeMo-Co Dinitrogenase iron-molybdenum cofactor. This family contains several NIF (B, Y and X) proteins which are iron-molybdenum cofactors (FeMo-co) in the dinitrogenase enzyme which catalyses the reduction of dinitrogen to ammonium. Dinitrogenase is a hetero-tetrameric (alpha(2)beta(2)) enzyme which contains the iron-molybdenum cofactor (FeMo-co) at its active site. Probab=23.77 E-value=59 Score=14.32 Aligned_cols=32 Identities=44% Similarity=0.473 Sum_probs=27.3 Q ss_pred HHHHCCCCEEEEECCCHHHHHHHHHCCCCCEE Q ss_conf 98518971799845724578899970898115 Q 537021.9.peg.3 52 LLKKSNITPVIVHGGGPQIGAVLEKMGIKSKF 83 (150) Q Consensus 52 ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~ 83 (150) .|...|..+|++.+-|+..-+.|++.||++-. T Consensus 48 ~l~~~~~~~vi~~~iG~~a~~~L~~~gI~v~~ 79 (94) T pfam02579 48 LLALEGVDAVIVGGIGPNALAKLEAAGIKVYK 79 (94) T ss_pred HHHHCCCCEEEECCCCHHHHHHHHHCCCEEEE T ss_conf 99976998999687598899999988999999 No 175 >KOG0356 consensus Probab=23.71 E-value=59 Score=14.31 Aligned_cols=82 Identities=23% Similarity=0.253 Sum_probs=45.1 Q ss_pred HHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEEECCCCCCCHHH-HH Q ss_conf 99998759889998675672474689999999999851897179984572457889997089811557834661248-99 Q 537021.9.peg.3 18 PFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQQT-AE 96 (150) Q Consensus 18 pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~~t-l~ 96 (150) ..+.++.+-+.||++||+--..-+-..+=++|-----..++..=+|.|||...=.....+.--+. .+-++... .+ T Consensus 381 er~a~ls~gvavi~vGg~se~Ev~ekk~rv~dalnat~aaveeGivpGGG~all~~~~~l~~lk~----~~~~~~~~G~~ 456 (550) T KOG0356 381 ERLAKLSGGVAVIKVGGHSEVEVNEKKDRVEDALNATRAAVEEGIVPGGGTALLRAIPVLDELKA----TNNFDQKVGVE 456 (550) T ss_pred HHHHHHCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCC----CCCHHHHHHHH T ss_conf 88987527817999547541025443325889888888775158556760788987655510466----65067665099 Q ss_pred HHHHHHH Q ss_conf 9999974 Q 537021.9.peg.3 97 VVEMVLA 103 (150) Q Consensus 97 iv~~vL~ 103 (150) +++.+|. T Consensus 457 iv~~Al~ 463 (550) T KOG0356 457 IVKKALR 463 (550) T ss_pred HHHHHHH T ss_conf 9999986 No 176 >PRK13561 putative phosphodiesterase; Provisional Probab=23.42 E-value=60 Score=14.28 Aligned_cols=63 Identities=13% Similarity=0.128 Sum_probs=29.0 Q ss_pred CCHHHHHHHHHHHHC--CCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEE-CCCHHHHHHH Q ss_conf 622899999999875--9889998675672474689999999999851897179984-5724578899 Q 537021.9.peg.3 10 AEILEQVLPFVQFYE--NETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVH-GGGPQIGAVL 74 (150) Q Consensus 10 a~~l~ea~pYi~~~r--~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVH-Ggg~qI~~~l 74 (150) ..++......+++|+ -..+++-+.-+++.+. .+....-+..|+.+|+++-|=+ |.|..-=..| T Consensus 500 ~~f~~~l~~~l~~~~~~p~~L~lEItEs~~~~~--~~~~~~~l~~Lr~~Gv~ialDDFGtGySSL~yL 565 (651) T PRK13561 500 PNMVADMLELLTRYRIQPGTLILEVTESRRIDD--PHAAVAILRPLRNAGVRIALDDFGMGYAGLRQL 565 (651) T ss_pred CCHHHHHHHHHHHHCCCHHHEEEEECCCHHHCC--HHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHH T ss_conf 719999999999849684539999779324349--999999999999789999998999964209998 No 177 >TIGR03343 biphenyl_bphD 2-hydroxy-6-oxo-6-phenylhexa-2,4-dienoate hydrolase. Members of this family are 2-hydroxy-6-oxo-6-phenylhexa-2,4-dienoate hydrolase, or HOPD hydrolase, the BphD protein of biphenyl degradation. BphD acts on the product of ring meta-cleavage by BphC. Many species carrying bphC and bphD are capable of degrading polychlorinated biphenyls as well as biphenyl itself. Probab=23.29 E-value=30 Score=16.16 Aligned_cols=65 Identities=12% Similarity=0.106 Sum_probs=31.4 Q ss_pred CHHHHHHHHHHHHCCCEEEEEECHHHHCCH-----HHHHHHHHHHH-HHHHCCCCEEEE--ECCCHHHHHHHHH Q ss_conf 228999999998759889998675672474-----68999999999-985189717998--4572457889997 Q 537021.9.peg.3 11 EILEQVLPFVQFYENETIVVKYGGHVMNCT-----DLSKDFVNDIA-LLKKSNITPVIV--HGGGPQIGAVLEK 76 (150) Q Consensus 11 ~~l~ea~pYi~~~r~ktiVIK~GG~~l~d~-----~l~~~~~~dI~-ll~~~G~k~ViV--HGgg~qI~~~l~~ 76 (150) .++++..|+..+ +-..+++=+.|.=-.+. ......++|+. ++..+|+.-+.+ |--|-.|-..+.. T Consensus 48 ~~~~~l~~~~~~-~yrVIa~Dl~G~G~S~~~~~~~~~~~~~a~~~~~lld~L~i~~~~lvG~S~GG~iA~~~A~ 120 (282) T TIGR03343 48 NYYRNIGPFVDA-GYRVILKDSPGFNKSDAVVMDEQRGLVNARAVKGLMDALDIEKAHLVGNSMGGATALNFAL 120 (282) T ss_pred HHHHHHHHHHHC-CCEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCEEEEEECCHHHHHHHHHH T ss_conf 999977999978-9859997189988899876555324799999999999749983799973737999999999 No 178 >PTZ00286 6-phospho-1-fructokinase; Provisional Probab=23.17 E-value=61 Score=14.25 Aligned_cols=82 Identities=16% Similarity=0.273 Sum_probs=47.2 Q ss_pred HHHHHHCCCCEEEEECCCH------HHHHHHHHCCCC------CEEE-CCCCCCCH-----HHHHHHHHHHHHHHHHHHH Q ss_conf 9998518971799845724------578899970898------1155-78346612-----4899999997416689999 Q 537021.9.peg.3 50 IALLKKSNITPVIVHGGGP------QIGAVLEKMGIK------SKFE-NGLRITDQ-----QTAEVVEMVLAGSINKKIV 111 (150) Q Consensus 50 I~ll~~~G~k~ViVHGgg~------qI~~~l~~~gi~------~~~v-~G~RVTd~-----~tl~iv~~vL~G~vn~~lv 111 (150) +..|...|+..+++-||-- .|.+..++.|+. |+.+ |.+-+||. .+.+.+..+ ++..=+ T Consensus 180 Vd~L~~~gId~LfvIGGDGTlrgA~~I~~e~~~Rgl~I~VVGIPKTIDNDI~~~D~TFGFdTAVe~AteA----I~~ah~ 255 (486) T PTZ00286 180 VDTLVRMKINILFTVGGDGTQRGALKIYEEAKRRGENIAVFGVPKTIDNDLAFSHRTFGFQTAVEQAVNA----VRAAYA 255 (486) T ss_pred HHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCHHCCCCHHHHHHHHHHH----HHHHHH T ss_conf 9999985999899978846688999999999971788506646854357641001002402399999999----999988 Q ss_pred HHHH-HCCCCEEEEEEEECCEEEEE Q ss_conf 9998-28997687786558989999 Q 537021.9.peg.3 112 SLIN-QTGTQAIGICGKDGNMVFAE 135 (150) Q Consensus 112 ~~L~-~~g~~avglsg~dg~li~a~ 135 (150) ++-+ .+|+--+-+-|.+.+||.+. T Consensus 256 EA~sa~nGIGlVkLMGR~SGfIA~~ 280 (486) T PTZ00286 256 EAVSLNYGVGIVKLMGRESGFIAAQ 280 (486) T ss_pred HHHCCCCCEEEEEECCCCCCHHHHH T ss_conf 7523568518999725551289988 No 179 >pfam01075 Glyco_transf_9 Glycosyltransferase family 9 (heptosyltransferase). Members of this family belong to glycosyltransferase family 9. Lipopolysaccharide is a major component of the outer leaflet of the outer membrane in Gram-negative bacteria. It is composed of three domains; lipid A, Core oligosaccharide and the O-antigen. All of these enzymes transfer heptose to the lipopolysaccharide core. Probab=22.96 E-value=61 Score=14.22 Aligned_cols=114 Identities=17% Similarity=0.201 Sum_probs=59.1 Q ss_pred HHHHHHHHHHHHCCCEEEEEECHHHHCCHH-HHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHH-C--CCCCEEECCC Q ss_conf 289999999987598899986756724746-89999999999851897179984572457889997-0--8981155783 Q 537021.9.peg.3 12 ILEQVLPFVQFYENETIVVKYGGHVMNCTD-LSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEK-M--GIKSKFENGL 87 (150) Q Consensus 12 ~l~ea~pYi~~~r~ktiVIK~GG~~l~d~~-l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~-~--gi~~~~v~G~ 87 (150) ....+..+......+.+||..|++- .... -.+.|++=+..|..-|.++|+ .||.++.++...+ . +.+.+.++=. T Consensus 94 ~~~~~~~~~~~~~~~~i~i~pga~~-~~K~Wp~e~f~~L~~~l~~~~~~vvl-~gg~~~~e~~~~~~i~~~~~~~~~~l~ 171 (249) T pfam01075 94 ERAAALDKAGNGGRPYIAFCPGASR-PAKRWPAEHFAELAQALQERGWQVVL-FGGAEDREEEIAKRLAAGLRTPCVNLL 171 (249) T ss_pred HHHHHHHHHHCCCCCEEEEECCCCC-CCCCCCHHHHHHHHHHHHHCCCCEEE-EECCCHHHHHHHHHHHHHCCCCEEECC T ss_conf 9999998620259998999738788-56779999999999999966995699-738678999999999863899868626 Q ss_pred -CCCCHHHHHHHHHH--HHHHHHHHHHHHHHHCCCCEEEEEEEE Q ss_conf -46612489999999--741668999999982899768778655 Q 537021.9.peg.3 88 -RITDQQTAEVVEMV--LAGSINKKIVSLINQTGTQAIGICGKD 128 (150) Q Consensus 88 -RVTd~~tl~iv~~v--L~G~vn~~lv~~L~~~g~~avglsg~d 128 (150) +.+=.|...++..+ +.| .-......-...|.+.++|=|.. T Consensus 172 g~~sL~el~ali~~a~l~I~-nDSGp~HiAaA~g~Pti~ifGpT 214 (249) T pfam01075 172 GKTSLEQAAALLAGADLVVG-NDSGLMHLAAALDRPVIGLYGPT 214 (249) T ss_pred CCCCHHHHHHHHHHCCEEEE-CCCHHHHHHHHCCCCEEEEECCC T ss_conf 99999999999985066885-79859999998399889997889 No 180 >PRK05569 flavodoxin; Provisional Probab=22.93 E-value=61 Score=14.22 Aligned_cols=70 Identities=14% Similarity=0.332 Sum_probs=42.6 Q ss_pred CEEEEEECHHHHCCH----HHHHHHHHHHHHHHHCCCCEEEEEC-----CCH---HHHHHHHHCCCCCEEECCCCC---C Q ss_conf 889998675672474----6899999999998518971799845-----724---578899970898115578346---6 Q 537021.9.peg.3 26 ETIVVKYGGHVMNCT----DLSKDFVNDIALLKKSNITPVIVHG-----GGP---QIGAVLEKMGIKSKFENGLRI---T 90 (150) Q Consensus 26 ktiVIK~GG~~l~d~----~l~~~~~~dI~ll~~~G~k~ViVHG-----gg~---qI~~~l~~~gi~~~~v~G~RV---T 90 (150) +.++ +|-.++.+. .-+..|++++..+..-| |++.++| +|. ++.+.|+.+|++. ++.+++ - T Consensus 50 d~i~--~GspT~~~~~~~~~~~~~~l~~l~~~~~~~-K~~a~FGSyGW~~Ge~~~~~~~~l~~~g~~v--v~~~~~~~~P 124 (141) T PRK05569 50 DAVA--FGSPSMDNNNIEQEEMAPFLDQFKLTPNEN-KKCILFGSYGWDNGEFMKLWKDRMKDYGFNV--IGDLAVNESP 124 (141) T ss_pred CEEE--EECCCCCCCCCCHHHHHHHHHHHHCCCCCC-CEEEEEECCCCCCCHHHHHHHHHHHHCCCEE--ECEEEEEECC T ss_conf 9059--905455888777176999999985345589-8899996677886469999999999789848--2405896188 Q ss_pred CHHHHHHHHH Q ss_conf 1248999999 Q 537021.9.peg.3 91 DQQTAEVVEM 100 (150) Q Consensus 91 d~~tl~iv~~ 100 (150) |++.++-.+. T Consensus 125 ~~e~l~~c~e 134 (141) T PRK05569 125 NKEELNSAKE 134 (141) T ss_pred CHHHHHHHHH T ss_conf 9899999999 No 181 >TIGR01250 pro_imino_pep_2 proline-specific peptidases; InterPro: IPR005945 Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes . They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence . Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases . Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base . The geometric orientations of the catalytic residues are similar between families, despite different protein folds . The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) , . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This group of serine peptidase belong to MEROPS peptidase family S33 (clan SC). They are proline iminopeptidase (Prolyl aminopeptidase, 3.4.11.5 from EC), which catalyzes the removal of the N-terminal proline from peptides. This family represents one of two related families of proline iminopeptidase containing the alpha/beta fold. The fine specificities of the various members, including both the range of short peptides from which proline can be removed and whether other amino acids such as alanine can be also removed, may vary among members. One of the members of this family is the tricorn protease (TRI) interacting factor 1 from Thermoplasma acidophilum. Factor 1 (F1) is a 33.5 kDa serine peptidase of the alpha/beta-hydrolase family. Tricorn generates small peptides, which are cleaved by F1 to yield single amino acids , . ; GO: 0016804 prolyl aminopeptidase activity, 0005737 cytoplasm. Probab=22.71 E-value=44 Score=15.11 Aligned_cols=30 Identities=27% Similarity=0.603 Sum_probs=19.3 Q ss_pred CCCEEEEECCCHH--------HHHHHHH-CCCCCEEECCC Q ss_conf 9717998457245--------7889997-08981155783 Q 537021.9.peg.3 57 NITPVIVHGGGPQ--------IGAVLEK-MGIKSKFENGL 87 (150) Q Consensus 57 G~k~ViVHGgg~q--------I~~~l~~-~gi~~~~v~G~ 87 (150) +++++++||| |- ..+.+++ .|++.-+.|-+ T Consensus 25 ~~~Ll~lHGG-PG~~H~Yl~~l~~~~~~~~G~~V~~YDQL 63 (302) T TIGR01250 25 KIKLLVLHGG-PGSSHEYLENLKELLKEDLGREVIMYDQL 63 (302) T ss_pred CCEEEEEECC-CCCCHHHHHHHHHHHHHCCCCEEEEEECC T ss_conf 9218997088-98874237999999764078279998324 No 182 >TIGR00963 secA preprotein translocase, SecA subunit; InterPro: IPR000185 Secretion across the inner membrane in some Gram-negative bacteria occurs via the preprotein translocase pathway. Proteins are produced in the cytoplasm as precursors, and require a chaperone subunit to direct them to the translocase component. . From there, the mature proteins are either targeted to the outer membrane, or remain as periplasmic proteins. The translocase protein subunits are encoded on the bacterial chromosome. The translocase itself comprises 7 proteins, including a chaperone protein (SecB), an ATPase (SecA), an integral membrane complex (SecCY, SecE and SecG), and two additional membrane proteins that promote the release of the mature peptide into the periplasm (SecD and SecF) . The chaperone protein SecB is a highly acidic homotetrameric protein that exists as a "dimer of dimers" in the bacterial cytoplasm. SecB maintains preproteins in an unfolded state after translation, and targets these to the peripheral membrane protein ATPase SecA for secretion . SecA is a cytoplasmic protein of 800 to 960 amino acid residues. Homologs of secA are also encoded in the chloroplast genome of some algae as well as in the nuclear genome of plants . It could be involved in the intraorganellar protein transport into thylakoids.; GO: 0005524 ATP binding, 0006605 protein targeting, 0006886 intracellular protein transport. Probab=22.62 E-value=62 Score=14.18 Aligned_cols=45 Identities=22% Similarity=0.330 Sum_probs=34.1 Q ss_pred HHHHHHHHHHHHHHHHCCCCEEEEECCCH----HHHHHHHHCCCCCEEEC Q ss_conf 46899999999998518971799845724----57889997089811557 Q 537021.9.peg.3 40 TDLSKDFVNDIALLKKSNITPVIVHGGGP----QIGAVLEKMGIKSKFEN 85 (150) Q Consensus 40 ~~l~~~~~~dI~ll~~~G~k~ViVHGgg~----qI~~~l~~~gi~~~~v~ 85 (150) ..-+.+++.||..+|..| +||||==..= -++++|++.||+..-.| T Consensus 418 ~~Kw~Av~~e~~~~h~~G-qPvLvGT~svE~SE~LS~lL~~~~I~H~VLN 466 (904) T TIGR00963 418 EEKWKAVVDEIKEIHAKG-QPVLVGTTSVEKSELLSKLLKERGIPHNVLN 466 (904) T ss_pred HHHHHHHHHHHHHHHHCC-CCEEEEECCHHHHHHHHHHHHHCCCCCCCCC T ss_conf 889999999999987468-9877752217778999999985789812247 No 183 >cd03466 Nitrogenase_NifN_2 Nitrogenase_nifN_2: A subgroup of the NifN subunit of the NifEN complex: NifN forms an alpha2beta2 tetramer with NifE. NifN and nifE are structurally homologous to nitrogenase MoFe protein beta and alpha subunits respectively. NifEN participates in the synthesis of the iron-molybdenum cofactor (FeMoco) of the MoFe protein. NifB-co (an iron and sulfur containing precursor of the FeMoco) from NifB is transferred to the NifEN complex where it is further processed to FeMoco. The nifEN bound precursor of FeMoco has been identified as a molybdenum-free, iron- and sulfur- containing analog of FeMoco. It has been suggested that this nifEN bound precursor also acts as a cofactor precursor in nitrogenase systems which require a cofactor other than FeMoco: i.e. iron-vanadium cofactor (FeVco) or iron only cofactor (FeFeco). This group also contains the Clostidium fused NifN-NifB protein. Probab=22.55 E-value=62 Score=14.17 Aligned_cols=105 Identities=14% Similarity=0.233 Sum_probs=49.3 Q ss_pred CHHHHHCCCHHHHHHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEE-ECCC-----HHHHHHHHH Q ss_conf 0023220622899999999875988999867567247468999999999985189717998-4572-----457889997 Q 537021.9.peg.3 3 EKIYQFQAEILEQVLPFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIV-HGGG-----PQIGAVLEK 76 (150) Q Consensus 3 ~~~~~~~a~~l~ea~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViV-HGgg-----~qI~~~l~~ 76 (150) +++.++++.++...++|-+...||.++|--+++. .-.+. .+|..+|+.|++| -|.. ..+.+.++. T Consensus 278 e~i~~er~r~~d~~~d~~~~~~gkr~ai~~~~~~------~~~~~---~~l~ElG~~~~~v~t~~~~~~~~~~~~~~~~~ 348 (429) T cd03466 278 EKYTRERGRLLDAMIDAHKYNFGRKAAIYGEPDF------VVAIT---RFVLENGMVPVLIATGSESKKLKEKLEEDLKE 348 (429) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCHH------HHHHH---HHHHHCCCEEEEEEECCCCHHHHHHHHHHHHH T ss_conf 9999999999999999865035867999957366------99999---99998799789998389867789999999974 Q ss_pred CCCCCEEECCCCCCCHHHHH-HHHHH-----HHHHHHHHHHHHHHHCCCCEE Q ss_conf 08981155783466124899-99999-----741668999999982899768 Q 537021.9.peg.3 77 MGIKSKFENGLRITDQQTAE-VVEMV-----LAGSINKKIVSLINQTGTQAI 122 (150) Q Consensus 77 ~gi~~~~v~G~RVTd~~tl~-iv~~v-----L~G~vn~~lv~~L~~~g~~av 122 (150) .+.++.-+++ +|...++ .++.. +.+.-.+.+...+ |++-+ T Consensus 349 ~~~~~~v~~~---~d~~el~~~i~~~~~dliig~s~~k~iA~kl---giP~l 394 (429) T cd03466 349 YVEKCVILDG---ADFFDIESYAKELKIDVLIGNSYGRRIAEKL---GIPLI 394 (429) T ss_pred CCCCCEEEEC---CCHHHHHHHHHHCCCCEEEECCCHHHHHHHH---CCCEE T ss_conf 5898489828---9999999999765999999785268999980---99979 No 184 >KOG1342 consensus Probab=22.48 E-value=62 Score=14.16 Aligned_cols=57 Identities=19% Similarity=0.385 Sum_probs=43.7 Q ss_pred HHHHHHHHCCCEEEEEECHHHHCCHHH------HHHHHHHHHHHHHCCCCEEEEECCCHHHHH Q ss_conf 999999875988999867567247468------999999999985189717998457245788 Q 537021.9.peg.3 16 VLPFVQFYENETIVVKYGGHVMNCTDL------SKDFVNDIALLKKSNITPVIVHGGGPQIGA 72 (150) Q Consensus 16 a~pYi~~~r~ktiVIK~GG~~l~d~~l------~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~ 72 (150) ..|-+..|+-+-+|+-+|++-+..+.+ .....+-+...++.++.+.+|=|||--+-. T Consensus 244 i~~v~e~f~P~AiVLQCGaDSL~gDRlgcFnLsi~Gh~~Cv~fvksfn~pllvlGGGGYT~rN 306 (425) T KOG1342 244 ISKVMERFQPEAIVLQCGADSLAGDRLGCFNLSIKGHAECVKFVKSFNLPLLVLGGGGYTLRN 306 (425) T ss_pred HHHHHHHHCCCEEEEECCCCCCCCCCCCCEEECCHHHHHHHHHHHHCCCCEEEECCCCCCCHH T ss_conf 999998749764999737765567755500303164899999999739867997478655134 No 185 >cd03148 GATase1_EcHsp31_like Type 1 glutamine amidotransferase (GATase1)-like domain found in Escherichia coli Hsp31 protein (EcHsp31). Type 1 glutamine amidotransferase (GATase1)-like domain found in Escherichia coli Hsp31 protein (EcHsp31). This group includes proteins similar to EcHsp31. EcHsp31 has chaperone activity. EcHsp31 coordinates a metal ion using a 2-His-1-carboxylate motif present in various ions that use iron as a cofactor such as Carboxypeptidase A. The catalytic triad typical of GATase1 domains is not conserved in this GATase1-like domain. However, in common with a typical GATase1domain, a reactive Cys residue is found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. This Cys together with a different His and, an Asp (rather than a Glu) residue form a different catalytic triad from the typical GATase1 domain. EcHsp31 is a homodimer. Probab=22.29 E-value=63 Score=14.14 Aligned_cols=100 Identities=18% Similarity=0.196 Sum_probs=53.5 Q ss_pred EEEECHHH-HC----CHHHHHHHHHHHHHHHHCCCC-EEEEECCCHHHHHHHHHCCCCCEEECCCCCCCH-----HHHHH Q ss_conf 99867567-24----746899999999998518971-799845724578899970898115578346612-----48999 Q 537021.9.peg.3 29 VVKYGGHV-MN----CTDLSKDFVNDIALLKKSNIT-PVIVHGGGPQIGAVLEKMGIKSKFENGLRITDQ-----QTAEV 97 (150) Q Consensus 29 VIK~GG~~-l~----d~~l~~~~~~dI~ll~~~G~k-~ViVHGgg~qI~~~l~~~gi~~~~v~G~RVTd~-----~tl~i 97 (150) |.--||+. |- |+++ ..++++ .+..|.- --+-||-.--++..+ .+ -...++|.|+|-= +...+ T Consensus 100 vf~pGGHG~m~Dl~~~~~l-~~ll~~---~~~~gk~vaaVCHGpaaLl~~~~--~~-G~~l~~Gk~vTgFtd~eE~~~~~ 172 (232) T cd03148 100 VFIPGGHGALIGIPESQDV-AAALQW---AIKNDRFVITLCHGPAAFLAARH--GG-GKNPLEGYSVCVFPDSLDEGANI 172 (232) T ss_pred EEECCCCCCCCCCCCCHHH-HHHHHH---HHHCCCEEEEECCHHHHHHCCCC--CC-CCCCCCCCEEEECCCHHHHHHHC T ss_conf 9967997530577579999-999999---99859979998674798740206--79-97121796775028889875301 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCEE-----EEEEEECCEEEEE Q ss_conf 9999741668999999982899768-----7786558989999 Q 537021.9.peg.3 98 VEMVLAGSINKKIVSLINQTGTQAI-----GICGKDGNMVFAE 135 (150) Q Consensus 98 v~~vL~G~vn~~lv~~L~~~g~~av-----glsg~dg~li~a~ 135 (150) -...+.+++-..+...|.+.|.+-. +-..+|++||+.+ T Consensus 173 ~~~~~~~~~p~~le~~l~~~Ga~~~~~~~~~~vv~Dg~LiTGQ 215 (232) T cd03148 173 EIGYMPGHLTWLVGEELKKMGMNIINDDITGRVHKDRKLLTGD 215 (232) T ss_pred CCCCCCCCCCHHHHHHHHHCCCCEECCCCCCCEEECCCEEECC T ss_conf 1033753230469999998798162588987668769887798 No 186 >pfam02593 DUF166 Uncharacterized ArCR, COG1810. Probab=22.27 E-value=63 Score=14.13 Aligned_cols=106 Identities=20% Similarity=0.326 Sum_probs=64.3 Q ss_pred CHHHHHHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCH-----HHHHHHHHCCCCCEEEC Q ss_conf 2289999999987598899986756724746899999999998518971799845724-----57889997089811557 Q 537021.9.peg.3 11 EILEQVLPFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGP-----QIGAVLEKMGIKSKFEN 85 (150) Q Consensus 11 ~~l~ea~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~-----qI~~~l~~~gi~~~~v~ 85 (150) .++.+...|+....+-=++|-|+ .+|++...+++ .++..|.+.|||-+..| |+.+.+++.|++-.+-. T Consensus 35 e~Ie~pee~Lp~~~~~Dl~i~y~----lhPDl~~~l~~---~~~e~g~kalIvp~~~~~g~~~~lk~~~e~~g~~~~~P~ 107 (215) T pfam02593 35 EFIEDPEEYLPEIPEADLVIAYG----LHPDLTLELAE---IAAETGIKALIVPAEAPKGLRKGLKEQLEEFGVEVEFPE 107 (215) T ss_pred CCCCCHHHHCCCCCCCCEEEEEC----CCCCHHHHHHH---HHHHCCCCEEEEECCCCCCHHHHHHHHHHHCCCEEECCC T ss_conf 32368788656677876899943----59507999999---998649988997468862128999999997597387787 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEEECCEEEEE Q ss_conf 83466124899999997416689999999828997687786558989999 Q 537021.9.peg.3 86 GLRITDQQTAEVVEMVLAGSINKKIVSLINQTGTQAIGICGKDGNMVFAE 135 (150) Q Consensus 86 G~RVTd~~tl~iv~~vL~G~vn~~lv~~L~~~g~~avglsg~dg~li~a~ 135 (150) = |= .|--.=|+.+-+-+...|-+-+-+...++.+..++ T Consensus 108 ~--------~C----sL~~~~~p~i~~F~~~FG~P~~ev~v~~~~I~~V~ 145 (215) T pfam02593 108 P--------FC----SLEPVGNPVIDEFAERFGRPELEVEVEDGKIKDVR 145 (215) T ss_pred C--------CC----CCCCCCCHHHHHHHHHHCCCEEEEEECCCEEEEEE T ss_conf 6--------55----78898886699999981996699997499499998 No 187 >cd01425 RPS2 Ribosomal protein S2 (RPS2), involved in formation of the translation initiation complex, where it might contact the messenger RNA and several components of the ribosome. It has been shown that in Escherichia coli RPS2 is essential for the binding of ribosomal protein S1 to the 30s ribosomal subunit. In humans, most likely in all vertebrates, and perhaps in all metazoans, the protein also functions as the 67 kDa laminin receptor (LAMR1 or 67LR), which is formed from a 37 kDa precursor, and is overexpressed in many tumors. 67LR is a cell surface receptor which interacts with a variety of ligands, laminin-1 and others. It is assumed that the ligand interactions are mediated via the conserved C-terminus, which becomes extracellular as the protein undergoes conformational changes which are not well understood. Specifically, a conserved palindromic motif, LMWWML, may participate in the interactions. 67LR plays essential roles in the adhesion of cells to the basement membrane an Probab=22.13 E-value=63 Score=14.12 Aligned_cols=116 Identities=16% Similarity=0.182 Sum_probs=69.8 Q ss_pred CCHHHHHHHHHHHHCCCEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCC--CCEEECCC Q ss_conf 6228999999998759889998675672474689999999999851897179984572457889997089--81155783 Q 537021.9.peg.3 10 AEILEQVLPFVQFYENETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGI--KSKFENGL 87 (150) Q Consensus 10 a~~l~ea~pYi~~~r~ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi--~~~~v~G~ 87 (150) ..|=-...|||-..|+.+-+|-+.=.. ..+...+.=+..+..-| +-|+.-|..++.....++... ...|++++ T Consensus 13 ~~wnp~m~~yI~g~r~g~~IidL~~T~----~~L~~A~~~i~~~~~~~-g~ILfVgtk~~~~~~v~~~a~~~~~~yv~~r 87 (193) T cd01425 13 RRWNPKMKPYIYGERNGIHIIDLEKTL----EKLRLALNFIANIAAKG-GKILFVGTKPQAQRAVKKFAERTGSFYVNGR 87 (193) T ss_pred CCCCCCCCCHHHCCCCCCEEEEHHHHH----HHHHHHHHHHHHHHHCC-CEEEEEECCHHHHHHHHHHHHHCCCEEECCE T ss_conf 757987200222451981997799999----99999999999999759-9599997578999999999998399454485 Q ss_pred C----CCCHHHHHHHHH----------------------------HHHHHHHHHHHHHHHHCCCCEEEEEEEECC Q ss_conf 4----661248999999----------------------------974166899999998289976877865589 Q 537021.9.peg.3 88 R----ITDQQTAEVVEM----------------------------VLAGSINKKIVSLINQTGTQAIGICGKDGN 130 (150) Q Consensus 88 R----VTd~~tl~iv~~----------------------------vL~G~vn~~lv~~L~~~g~~avglsg~dg~ 130 (150) + .|...++.--.. ++.-..++..+...+..|++.+|+...|.+ T Consensus 88 WigG~LTN~~~~~~~i~~~~~~~~~k~~~~~~g~~~~~~~Pdliiv~d~~~~~~ai~Ea~~l~IPvI~i~Dtn~~ 162 (193) T cd01425 88 WLGGTLTNWKTIRKSIKRLKKLEKEKLEKNLGGIKDMFRLPDLVIVLDPRKEHQAIREASKLGIPVIAIVDTNCD 162 (193) T ss_pred ECCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHCCCCCCEEEEECCCCCHHHHHHHHHHCCCEEEEECCCCC T ss_conf 758772488888668998887542256652034010166999899988984168999998618755788508999 No 188 >cd04248 AAK_AK-Ectoine AAK_AK-Ectoine: Amino Acid Kinase Superfamily (AAK), AK-Ectoine; this CD includes the N-terminal catalytic domain of the aspartokinase of the ectoine (1,4,5,6-tetrahydro-2-methyl pyrimidine-4-carboxylate) biosynthetic pathway found in Methylomicrobium alcaliphilum, Vibrio cholerae, and other various halotolerant or halophilic bacteria. Bacteria exposed to hyperosmotic stress accumulate organic solutes called 'compatible solutes' of which ectoine, a heterocyclic amino acid, is one. Apart from its osmotic function, ectoine also exhibits a protective effect on proteins, nucleic acids and membranes against a variety of stress factors. de novo synthesis of ectoine starts with the phosphorylation of L-aspartate and shares its first two enzymatic steps with the biosynthesis of amino acids of the aspartate family: aspartokinase and L-aspartate-semialdehyde dehydrogenase. The M. alcaliphilum and the V. cholerae aspartokinases are encoded on the ectABCask operon. Probab=22.11 E-value=64 Score=14.11 Aligned_cols=15 Identities=33% Similarity=0.395 Sum_probs=10.5 Q ss_pred EEEEECHHHHCCHHH Q ss_conf 999867567247468 Q 537021.9.peg.3 28 IVVKYGGHVMNCTDL 42 (150) Q Consensus 28 iVIK~GG~~l~d~~l 42 (150) -|.||||+-+.|.+. T Consensus 2 kV~KFGGtSva~~~~ 16 (304) T cd04248 2 TVEKIGGTSMSAFGA 16 (304) T ss_pred EEEEECCCCCCCHHH T ss_conf 798677333258999 No 189 >COG2217 ZntA Cation transport ATPase [Inorganic ion transport and metabolism] Probab=22.11 E-value=64 Score=14.11 Aligned_cols=80 Identities=23% Similarity=0.358 Sum_probs=52.4 Q ss_pred HHHHHH--CCCE-EEEEECHHH----HCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHH-HHHHCCCCCEEECCCCC Q ss_conf 999987--5988-999867567----247468999999999985189717998457245788-99970898115578346 Q 537021.9.peg.3 18 PFVQFY--ENET-IVVKYGGHV----MNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGA-VLEKMGIKSKFENGLRI 89 (150) Q Consensus 18 pYi~~~--r~kt-iVIK~GG~~----l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~-~l~~~gi~~~~v~G~RV 89 (150) +.+..+ .|+| +.+-++|.. -..+.+++.-.+-|..|+..|+++++.-|=.+..-+ ..+++|| .+++.+ + T Consensus 507 ~~~~~~~~~G~t~v~va~dg~~~g~i~~~D~~R~~a~~aI~~L~~~Gi~~~mLTGDn~~~A~~iA~~lGI-d~v~A~--l 583 (713) T COG2217 507 ERIEALESEGKTVVFVAVDGKLVGVIALADELRPDAKEAIAALKALGIKVVMLTGDNRRTAEAIAKELGI-DEVRAE--L 583 (713) T ss_pred HHHHHHHHCCCEEEEEEECCEEEEEEEECCCCCHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHCCH-HHEECC--C T ss_conf 4578988549769999999999999997078883499999999987993999859998999999997392-623507--9 Q ss_pred CCHHHHHHHHH Q ss_conf 61248999999 Q 537021.9.peg.3 90 TDQQTAEVVEM 100 (150) Q Consensus 90 Td~~tl~iv~~ 100 (150) .+++=.++++. T Consensus 584 lPedK~~~V~~ 594 (713) T COG2217 584 LPEDKAEIVRE 594 (713) T ss_pred CCHHHHHHHHH T ss_conf 91889999999 No 190 >PRK01158 phosphoglycolate phosphatase; Provisional Probab=22.10 E-value=64 Score=14.11 Aligned_cols=67 Identities=22% Similarity=0.219 Sum_probs=44.6 Q ss_pred CEEEEEECHHHHCCHH-HHHHHHHHHHHHHHCCCCEEEEECCCH-HHHHHHHHCCCCCEEE--CCCCCCCH Q ss_conf 8899986756724746-899999999998518971799845724-5788999708981155--78346612 Q 537021.9.peg.3 26 ETIVVKYGGHVMNCTD-LSKDFVNDIALLKKSNITPVIVHGGGP-QIGAVLEKMGIKSKFE--NGLRITDQ 92 (150) Q Consensus 26 ktiVIK~GG~~l~d~~-l~~~~~~dI~ll~~~G~k~ViVHGgg~-qI~~~l~~~gi~~~~v--~G~RVTd~ 92 (150) |.++.=+.|..+.+.. ..+...+.|..++..|+++++.-|-.. .+.......+....++ ||-.+.+. T Consensus 4 Kli~~DlDGTLl~~~~~i~~~~~~ai~~l~~~Gi~v~iaTGR~~~~~~~~~~~~~~~~~~I~~nG~~i~~~ 74 (226) T PRK01158 4 KAIVIDIDGTITDPDRRLHLKAVEAIRKAEKLGVPVILVTGNILCFARAAAILIGISGPVIAENGGVVTFG 74 (226) T ss_pred CEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHCCCCCEEEEECCEEEEEC T ss_conf 38999457776469197199999999999988998999899977889999997099972998388599977 No 191 >cd07042 STAS_SulP_like_sulfate_transporter Sulphate Transporter and Anti-Sigma factor antagonist domain of SulP-like sulfate transporters, plays a role in the function and regulation of the transport activity, proposed general NTP binding function. The SulP family is a large and diverse family of anion transporters, with members from eubacteria, plants, fungi, and mammals. They contain 10 to 14 transmembrane helices which form the catalytic core of the protein and a C-terminal extension, the STAS (Sulphate Transporter and AntiSigma factor antagonist) domain which plays a role in the function and regulation of the transport activity. The STAS domain is found in the C-terminal region of sulphate transporters and bacterial anti-sigma factor antagonists. It has been suggested that this domain may have a general NTP binding function. Probab=21.05 E-value=67 Score=13.98 Aligned_cols=74 Identities=15% Similarity=0.229 Sum_probs=52.2 Q ss_pred CCCHHHHHHHHHHHHCC--CEEEEEECHHHHCCHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCEE Q ss_conf 06228999999998759--8899986756724746899999999998518971799845724578899970898115 Q 537021.9.peg.3 9 QAEILEQVLPFVQFYEN--ETIVVKYGGHVMNCTDLSKDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGIKSKF 83 (150) Q Consensus 9 ~a~~l~ea~pYi~~~r~--ktiVIK~GG~~l~d~~l~~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi~~~~ 83 (150) -|..|++...-...... +.+|+-+.+-...|....+.|.+=+..++..|+++++. |-.+++-+.|++.|+.... T Consensus 23 ~~~~~~~~i~~~~~~~~~~~~vvld~~~v~~iD~s~~~~L~~~~~~~~~~g~~l~~~-~~~~~~~~~l~~~g~~~~~ 98 (107) T cd07042 23 NAEYFKDRLLRLVDEDPPLKVVILDLSAVNFIDSTAAEALEELVKDLRKRGVELYLA-GLNPQVRELLERAGLLDEI 98 (107) T ss_pred EHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEE-CCCHHHHHHHHHCCCHHHC T ss_conf 799999999998715899729999984998547599999999999999779999998-0987999999986997564 No 192 >COG0648 Nfo Endonuclease IV [DNA replication, recombination, and repair] Probab=20.98 E-value=67 Score=13.97 Aligned_cols=28 Identities=11% Similarity=0.226 Sum_probs=23.3 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEEECCCH Q ss_conf 6899999999998518971799845724 Q 537021.9.peg.3 41 DLSKDFVNDIALLKKSNITPVIVHGGGP 68 (150) Q Consensus 41 ~l~~~~~~dI~ll~~~G~k~ViVHGgg~ 68 (150) ..++.|.+++.....+|.+.+++|.|-. T Consensus 84 kS~~~l~~e~~r~~~lG~~~lv~HpG~~ 111 (280) T COG0648 84 KSIERLIDEIDRCEQLGAKLLVFHPGSY 111 (280) T ss_pred HHHHHHHHHHHHHHHCCCCEEEECCCCC T ss_conf 9999999999999974996899777501 No 193 >cd04895 ACT_ACR_1 ACT domain-containing protein which is composed almost entirely of four ACT domain repeats (the "ACR" protein). This CD includes the N-terminal ACT domain, of a novel type of ACT domain-containing protein which is composed almost entirely of four ACT domain repeats (the "ACR" protein). ACR proteins, found only in Arabidopsis and Oryza, as yet, are proposed to function as novel regulatory or sensor proteins in plants. Nine ACR gene products have been described (ACR1-8 in Arabidopsis and OsARC1-9 in Oryza) and are represented in this CD. Members of this CD belong to the superfamily of ACT regulatory domains. Probab=20.93 E-value=67 Score=13.96 Aligned_cols=19 Identities=26% Similarity=0.352 Sum_probs=17.1 Q ss_pred CCCCCCCHHHHHHHHHHHH Q ss_conf 7834661248999999974 Q 537021.9.peg.3 85 NGLRITDQQTAEVVEMVLA 103 (150) Q Consensus 85 ~G~RVTd~~tl~iv~~vL~ 103 (150) .|..+||+..++.++..|. T Consensus 51 ~G~Ki~d~~~i~~I~~~L~ 69 (72) T cd04895 51 LGNKLTDDSLIAYIEKSLG 69 (72) T ss_pred CCCCCCCHHHHHHHHHHHC T ss_conf 9999558899999999866 No 194 >PRK01096 deoxyguanosinetriphosphate triphosphohydrolase-like protein; Provisional Probab=20.58 E-value=68 Score=13.91 Aligned_cols=35 Identities=17% Similarity=0.203 Sum_probs=30.2 Q ss_pred HHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCC Q ss_conf 999999999851897179984572457889997089 Q 537021.9.peg.3 44 KDFVNDIALLKKSNITPVIVHGGGPQIGAVLEKMGI 79 (150) Q Consensus 44 ~~~~~dI~ll~~~G~k~ViVHGgg~qI~~~l~~~gi 79 (150) ..+++.++|.|.+| +|=-=|+|-..|++++++.|- T Consensus 100 ~~lvea~aLaHDlG-HPPFGH~GE~aL~~~~~~~~g 134 (442) T PRK01096 100 GAIVQAACLAHDIG-NPPFGHAGEEAIREWFQDAAG 134 (442) T ss_pred HHHHHHHHHHHHCC-CCCCCCHHHHHHHHHHHHCCC T ss_conf 78999999886158-997762289999999997257 No 195 >COG4148 ModC ABC-type molybdate transport system, ATPase component [Inorganic ion transport and metabolism] Probab=20.50 E-value=61 Score=14.25 Aligned_cols=50 Identities=12% Similarity=0.266 Sum_probs=29.8 Q ss_pred CCCHHHHHCCCHHHHHHHHHHHHCCCEE--EEEECHHHHCCHHHHHHHHHHHHHHH Q ss_conf 9600232206228999999998759889--99867567247468999999999985 Q 537021.9.peg.3 1 MTEKIYQFQAEILEQVLPFVQFYENETI--VVKYGGHVMNCTDLSKDFVNDIALLK 54 (150) Q Consensus 1 ~~~~~~~~~a~~l~ea~pYi~~~r~kti--VIK~GG~~l~d~~l~~~~~~dI~ll~ 54 (150) |+|-++-.+-+-=+|.+||+.+.++++= ++-+.-|. |+ ...+++.+.+|. T Consensus 152 mDEPLaSLD~~RK~EilpylERL~~e~~IPIlYVSHS~--~E--v~RLAd~vV~le 203 (352) T COG4148 152 MDEPLASLDLPRKREILPYLERLRDEINIPILYVSHSL--DE--VLRLADRVVVLE 203 (352) T ss_pred ECCCHHHCCCCHHHHHHHHHHHHHHHCCCCEEEEECCH--HH--HHHHHHEEEEEC T ss_conf 06840323651046778999998762188789993689--99--996542389820 No 196 >PRK10528 multifunctional acyl-CoA thioesterase I and protease I and lysophospholipase L1; Provisional Probab=20.11 E-value=70 Score=13.85 Aligned_cols=54 Identities=15% Similarity=0.202 Sum_probs=36.1 Q ss_pred CCHHHHHHHHHHHHCCCEEEEEECHHHHC----CHHHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 62289999999987598899986756724----7468999999999985189717998 Q 537021.9.peg.3 10 AEILEQVLPFVQFYENETIVVKYGGHVMN----CTDLSKDFVNDIALLKKSNITPVIV 63 (150) Q Consensus 10 a~~l~ea~pYi~~~r~ktiVIK~GG~~l~----d~~l~~~~~~dI~ll~~~G~k~ViV 63 (150) +.-|+.--.-+.+|+-..++|.+||+=+. -+...+|+-+=|..++..|++++|+ T Consensus 57 ~~gL~Rl~~~L~~~~P~~Vii~lGgND~LRg~~~~~~~~NL~~ii~~~~~~g~~vlL~ 114 (191) T PRK10528 57 QQGLARLPALLKQHQPRWVLVELGGNDGLRGFPPQQTEQTLRQIIQDVKAANAQPLLM 114 (191) T ss_pred HHHHHHHHHHHHCCCCCEEEEEECCCCHHCCCCHHHHHHHHHHHHHHHHHCCCCEEEE T ss_conf 4699998988740799989998468434308998999999999999999889978997 Done!