Query gi|254780971|ref|YP_003065384.1| phosphoribosylformylglycinamidine synthase subunit I [Candidatus Liberibacter asiaticus str. psy62] Match_columns 219 No_of_seqs 117 out of 1998 Neff 5.5 Searched_HMMs 39220 Date Mon May 30 02:40:42 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780971.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR01737 FGAM_synth_I phospho 100.0 0 0 681.5 17.7 219 1-219 1-262 (264) 2 PRK03619 phosphoribosylformylg 100.0 0 0 621.5 19.2 217 1-218 1-217 (223) 3 COG0047 PurL Phosphoribosylfor 100.0 0 0 597.4 19.7 218 1-219 3-228 (231) 4 PRK01175 phosphoribosylformylg 100.0 0 0 573.0 19.6 215 2-218 5-249 (255) 5 cd01740 GATase1_FGAR_AT Type 1 100.0 0 0 542.8 18.6 214 3-218 1-238 (238) 6 TIGR01735 FGAM_synt phosphorib 100.0 0 0 520.0 14.2 214 2-217 1129-1397(1401) 7 PRK05297 phosphoribosylformylg 100.0 0 0 507.7 18.7 215 2-218 1040-1290(1294) 8 KOG1907 consensus 100.0 0 0 462.3 14.7 215 2-218 1060-1316(1320) 9 TIGR01857 FGAM-synthase phosph 100.0 0 0 458.7 15.2 217 2-218 1018-1276(1279) 10 TIGR01739 tegu_FGAM_synt herpe 100.0 1.6E-32 4E-37 228.8 10.0 200 2-207 1026-1285(1313) 11 PRK13143 hisH imidazole glycer 99.8 1.4E-19 3.5E-24 146.1 12.2 183 2-218 3-197 (201) 12 PRK13153 consensus 99.8 1.3E-19 3.3E-24 146.2 10.7 184 3-219 2-202 (203) 13 PRK13175 consensus 99.8 2.7E-19 6.8E-24 144.2 11.6 187 2-219 3-202 (206) 14 PRK13525 glutamine amidotransf 99.8 2.1E-19 5.4E-24 144.9 10.4 173 1-218 2-184 (191) 15 COG0118 HisH Glutamine amidotr 99.8 1.3E-18 3.2E-23 139.9 11.8 184 1-218 2-200 (204) 16 PRK13147 consensus 99.8 2E-18 5E-23 138.7 12.4 189 1-219 1-208 (211) 17 PRK13180 consensus 99.8 2.1E-18 5.4E-23 138.5 12.2 190 2-218 3-206 (209) 18 PRK13146 hisH imidazole glycer 99.8 1.9E-18 4.8E-23 138.8 10.9 187 1-219 2-205 (208) 19 PRK13527 glutamine amidotransf 99.8 2.4E-18 6.1E-23 138.2 9.5 175 1-218 1-190 (196) 20 PRK13181 hisH imidazole glycer 99.8 3.3E-18 8.4E-23 137.2 10.1 185 3-219 2-198 (199) 21 TIGR01855 IMP_synth_hisH imida 99.8 2.7E-18 6.8E-23 137.9 9.1 172 15-219 16-210 (211) 22 PRK13141 hisH imidazole glycer 99.8 1.1E-17 2.7E-22 134.0 11.9 187 1-219 1-201 (204) 23 PRK13170 hisH imidazole glycer 99.8 1.1E-17 2.8E-22 133.9 11.3 181 1-219 1-195 (196) 24 PRK13148 consensus 99.8 6.5E-18 1.7E-22 135.4 10.1 185 1-219 2-222 (225) 25 PRK13172 consensus 99.8 6.9E-18 1.8E-22 135.2 9.9 187 1-219 2-212 (213) 26 PRK13173 consensus 99.7 2.3E-17 5.8E-22 131.9 10.7 184 1-219 2-208 (211) 27 PRK13178 consensus 99.7 1.2E-17 3E-22 133.7 9.2 183 3-218 2-203 (213) 28 cd01748 GATase1_IGP_Synthase T 99.7 2.7E-17 6.9E-22 131.4 10.9 183 3-218 1-198 (198) 29 PRK13174 consensus 99.7 6E-17 1.5E-21 129.2 12.7 185 2-219 3-207 (212) 30 CHL00188 hisH imidazole glycer 99.7 2.5E-17 6.3E-22 131.7 9.8 188 1-218 2-207 (210) 31 PRK13152 hisH imidazole glycer 99.7 7.2E-17 1.8E-21 128.7 10.9 182 3-219 2-200 (201) 32 PRK13151 consensus 99.7 9.3E-17 2.4E-21 128.0 11.0 176 1-218 1-191 (195) 33 pfam01174 SNO SNO glutamine am 99.7 7.5E-17 1.9E-21 128.6 9.1 177 5-218 1-187 (188) 34 PRK13171 consensus 99.7 2.5E-16 6.5E-21 125.2 11.4 181 1-219 2-196 (200) 35 cd01749 GATase1_PB Glutamine A 99.7 6.9E-17 1.8E-21 128.8 8.5 172 3-218 1-183 (183) 36 PRK13179 consensus 99.7 2.7E-16 6.9E-21 125.0 11.5 183 3-218 2-204 (207) 37 PRK13177 consensus 99.7 1.2E-16 3E-21 127.4 9.4 184 2-219 4-204 (207) 38 PRK13144 consensus 99.7 1.8E-16 4.5E-21 126.2 9.1 176 1-219 1-186 (190) 39 PRK13176 consensus 99.7 4.7E-16 1.2E-20 123.5 11.1 191 1-219 1-213 (216) 40 PRK13142 hisH imidazole glycer 99.6 7.5E-15 1.9E-19 115.8 7.6 171 3-218 2-185 (192) 41 cd01742 GATase1_GMP_Synthase T 99.5 1.2E-13 3.1E-18 108.0 11.4 177 3-218 1-181 (181) 42 PRK00074 guaA GMP synthase; Re 99.5 8.2E-13 2.1E-17 102.8 11.1 179 2-218 6-187 (513) 43 PRK00784 cobyric acid synthase 99.4 6.5E-12 1.7E-16 97.0 11.1 163 1-197 254-435 (492) 44 PRK07765 para-aminobenzoate sy 99.4 9.1E-12 2.3E-16 96.1 11.8 183 1-219 1-190 (221) 45 cd01743 GATase1_Anthranilate_S 99.4 1.7E-11 4.3E-16 94.4 12.9 178 4-218 2-184 (184) 46 PRK06895 para-aminobenzoate sy 99.4 1.4E-11 3.6E-16 94.9 12.4 183 1-219 3-188 (191) 47 TIGR00888 guaA_Nterm GMP synth 99.4 1.2E-12 3.1E-17 101.7 6.9 165 18-219 15-189 (195) 48 pfam00117 GATase Glutamine ami 99.4 1.7E-11 4.2E-16 94.4 11.5 178 5-218 2-184 (187) 49 PRK13526 glutamine amidotransf 99.3 1.5E-11 3.8E-16 94.7 9.0 162 1-202 3-169 (179) 50 PRK00758 GMP synthase subunit 99.3 7.3E-11 1.9E-15 90.3 11.9 175 2-219 1-179 (184) 51 COG0311 PDX2 Predicted glutami 99.2 6.1E-11 1.6E-15 90.8 9.7 168 1-202 1-179 (194) 52 PRK08857 para-aminobenzoate sy 99.2 1.6E-10 4E-15 88.2 10.9 174 15-219 13-191 (192) 53 cd01744 GATase1_CPSase Small c 99.2 1.6E-10 4E-15 88.1 10.7 174 3-218 1-178 (178) 54 cd01750 GATase1_CobQ Type 1 gl 99.2 3.1E-10 8E-15 86.3 11.8 86 3-94 1-87 (194) 55 PRK05670 anthranilate synthase 99.2 2.6E-10 6.5E-15 86.8 11.2 180 3-219 2-186 (192) 56 COG0518 GuaA GMP synthase - Gl 99.2 1.8E-10 4.5E-15 87.8 10.1 183 1-218 2-190 (198) 57 cd01745 GATase1_2 Subgroup of 99.2 1.3E-10 3.2E-15 88.8 8.7 151 16-218 23-189 (189) 58 PRK01077 cobyrinic acid a,c-di 99.2 7E-10 1.8E-14 84.0 11.7 172 2-198 247-434 (451) 59 PRK06278 cobyrinic acid a,c-di 99.2 4.9E-11 1.3E-15 91.4 5.2 167 1-197 2-186 (482) 60 CHL00101 trpG anthranilate syn 99.1 1.5E-09 3.8E-14 82.0 11.6 180 3-219 2-187 (190) 61 PRK11366 puuD gamma-glutamyl-g 99.1 9.6E-10 2.4E-14 83.2 10.3 181 18-219 32-242 (254) 62 PRK08007 para-aminobenzoate sy 99.1 2.5E-09 6.3E-14 80.5 11.4 169 15-219 13-186 (187) 63 PRK06774 para-aminobenzoate sy 99.1 1.9E-09 4.8E-14 81.3 10.3 172 15-218 13-189 (191) 64 PRK08250 glutamine amidotransf 99.1 2.4E-09 6.1E-14 80.6 10.5 174 1-200 1-183 (235) 65 cd01746 GATase1_CTP_Synthase T 99.0 7.7E-09 2E-13 77.4 12.7 174 18-218 20-235 (235) 66 COG0504 PyrG CTP synthase (UTP 99.0 5.1E-09 1.3E-13 78.5 11.8 176 18-218 308-523 (533) 67 TIGR00313 cobQ cobyric acid sy 99.0 4.4E-10 1.1E-14 85.3 6.1 175 1-197 249-447 (502) 68 PRK07649 para-aminobenzoate/an 99.0 3.7E-09 9.3E-14 79.4 10.6 168 15-218 13-185 (195) 69 cd01741 GATase1_1 Subgroup of 99.0 3.6E-09 9.2E-14 79.5 10.3 172 2-199 1-181 (188) 70 pfam07722 Peptidase_C26 Peptid 99.0 1.3E-08 3.2E-13 76.0 12.2 160 19-198 31-219 (219) 71 cd03130 GATase1_CobB Type 1 gl 99.0 5.3E-09 1.4E-13 78.4 9.7 159 15-197 14-185 (198) 72 COG2071 Predicted glutamine am 99.0 1.9E-08 4.8E-13 74.9 12.3 177 17-218 31-235 (243) 73 PRK05665 amidotransferase; Pro 99.0 2.1E-08 5.4E-13 74.6 12.5 169 1-198 3-187 (240) 74 PRK06490 glutamine amidotransf 98.9 1.2E-08 3E-13 76.2 10.0 172 2-204 13-193 (243) 75 PRK09065 glutamine amidotransf 98.9 3.6E-08 9.2E-13 73.1 12.2 173 1-200 4-191 (238) 76 PRK09522 bifunctional anthrani 98.9 7.5E-08 1.9E-12 71.1 12.6 178 1-219 2-188 (531) 77 KOG0623 consensus 98.8 2.5E-08 6.3E-13 74.1 9.2 185 5-218 6-205 (541) 78 TIGR00337 PyrG CTP synthase; I 98.8 2.3E-08 5.9E-13 74.3 8.7 157 40-218 372-569 (571) 79 PRK06186 hypothetical protein; 98.8 1.3E-07 3.4E-12 69.5 12.5 178 17-218 20-223 (229) 80 PRK05380 pyrG CTP synthetase; 98.8 3.7E-08 9.5E-13 73.0 9.3 178 17-218 307-524 (534) 81 pfam07685 GATase_3 CobB/CobQ-l 98.8 1.9E-08 4.9E-13 74.9 7.7 54 38-94 4-57 (158) 82 PRK07567 glutamine amidotransf 98.8 5.5E-08 1.4E-12 71.9 9.9 159 18-199 20-192 (242) 83 PRK05637 anthranilate synthase 98.8 1.8E-07 4.5E-12 68.7 12.2 184 2-219 3-203 (208) 84 KOG1622 consensus 98.8 1.2E-08 3E-13 76.2 5.3 143 40-218 58-201 (552) 85 PRK07053 glutamine amidotransf 98.7 1.8E-07 4.5E-12 68.7 10.1 171 2-200 4-182 (235) 86 cd01747 GATase1_Glutamyl_Hydro 98.7 1.5E-07 3.7E-12 69.2 8.2 190 3-206 1-221 (273) 87 COG1492 CobQ Cobyric acid synt 98.5 2.3E-07 6E-12 67.9 6.0 88 1-95 252-341 (486) 88 COG0512 PabA Anthranilate/para 98.5 3.3E-06 8.3E-11 60.6 11.7 168 15-219 15-189 (191) 89 TIGR01382 PfpI intracellular p 98.5 1.4E-07 3.6E-12 69.3 3.9 60 36-100 71-130 (189) 90 PRK13896 cobyrinic acid a,c-di 98.5 7.6E-06 1.9E-10 58.2 12.4 171 2-197 234-418 (432) 91 cd03134 GATase1_PfpI_like A ty 98.4 1.5E-06 3.8E-11 62.8 8.0 92 2-101 1-116 (165) 92 cd03135 GATase1_DJ-1 Type 1 gl 98.4 1.4E-06 3.6E-11 62.9 7.7 92 3-101 1-115 (163) 93 TIGR01368 CPSaseIIsmall carbam 98.4 4.3E-06 1.1E-10 59.8 9.3 175 2-218 198-378 (383) 94 PRK12564 carbamoyl phosphate s 98.4 9.2E-06 2.4E-10 57.7 11.0 122 66-218 229-351 (355) 95 cd03169 GATase1_PfpI_1 Type 1 98.3 3.6E-06 9.2E-11 60.3 7.9 92 2-101 1-130 (180) 96 KOG1224 consensus 98.3 8.5E-06 2.2E-10 57.9 9.7 171 16-219 29-215 (767) 97 PRK12838 carbamoyl phosphate s 98.3 1.8E-05 4.7E-10 55.8 11.3 174 2-218 169-347 (356) 98 cd03140 GATase1_PfpI_3 Type 1 98.2 3.5E-06 9E-11 60.4 6.5 91 3-101 1-113 (170) 99 CHL00197 carA carbamoyl-phosph 98.2 4.3E-05 1.1E-09 53.5 12.0 146 41-218 216-374 (383) 100 PRK11574 hypothetical protein; 98.1 1.3E-05 3.2E-10 56.8 7.9 92 2-100 4-121 (196) 101 cd03137 GATase1_AraC_1 AraC tr 98.0 1.1E-05 2.9E-10 57.1 5.5 92 3-100 1-117 (187) 102 cd03128 GAT_1 Type 1 glutamine 98.0 3.1E-05 7.9E-10 54.3 7.6 84 3-92 1-92 (92) 103 cd03139 GATase1_PfpI_2 Type 1 98.0 2.6E-05 6.7E-10 54.8 6.7 57 38-100 59-115 (183) 104 KOG3210 consensus 97.9 4.1E-05 1E-09 53.6 7.2 174 3-200 14-205 (226) 105 TIGR00566 trpG_papA glutamine 97.9 4.5E-05 1.1E-09 53.3 7.0 163 15-219 13-211 (212) 106 cd01653 GATase1 Type 1 glutami 97.9 5.1E-05 1.3E-09 52.9 6.9 84 3-92 1-92 (115) 107 COG1797 CobB Cobyrinic acid a, 97.9 4.7E-05 1.2E-09 53.2 6.6 170 2-197 247-434 (451) 108 COG0693 ThiJ Putative intracel 97.9 9.4E-05 2.4E-09 51.3 7.9 94 1-101 3-122 (188) 109 PRK13566 anthranilate synthase 97.8 0.00072 1.8E-08 45.6 12.2 162 17-218 539-712 (724) 110 pfam01965 DJ-1_PfpI DJ-1/PfpI 97.8 1.2E-05 3.1E-10 57.0 3.0 61 35-100 28-88 (141) 111 KOG2387 consensus 97.7 3.1E-05 7.9E-10 54.3 3.4 150 39-207 361-537 (585) 112 PRK09393 ftrA transcriptional 97.7 6.9E-05 1.8E-09 52.1 4.8 110 2-119 9-147 (320) 113 pfam09825 BPL_N Biotin-protein 97.6 0.0069 1.8E-07 39.3 14.9 188 1-208 1-224 (364) 114 COG0505 CarA Carbamoylphosphat 97.6 0.0017 4.4E-08 43.2 10.9 163 15-218 191-359 (368) 115 TIGR01815 TrpE-clade3 anthrani 97.6 0.0002 5.1E-09 49.1 5.9 163 17-218 540-714 (726) 116 cd03136 GATase1_AraC_ArgR_like 97.5 0.0003 7.8E-09 48.0 6.5 92 3-101 1-117 (185) 117 cd03132 GATase1_catalase Type 97.1 0.0022 5.5E-08 42.6 7.2 91 2-98 3-114 (142) 118 cd03138 GATase1_AraC_2 AraC tr 97.1 0.00014 3.7E-09 50.1 1.1 61 38-101 66-126 (195) 119 TIGR01383 not_thiJ DJ-1 family 97.0 0.00071 1.8E-08 45.6 3.4 56 38-98 63-118 (186) 120 COG3442 Predicted glutamine am 96.9 0.0017 4.3E-08 43.3 4.6 147 26-197 34-198 (250) 121 COG4285 Uncharacterized conser 96.8 0.049 1.3E-06 33.9 12.5 175 1-201 1-208 (253) 122 KOG1559 consensus 96.8 0.0064 1.6E-07 39.5 7.0 189 3-206 55-277 (340) 123 TIGR01823 PabB-fungal para-ami 96.8 0.00055 1.4E-08 46.4 1.5 49 38-94 57-111 (851) 124 TIGR00379 cobB cobyrinic acid 96.5 0.01 2.7E-07 38.2 6.6 122 1-129 252-389 (464) 125 KOG3179 consensus 96.4 0.0058 1.5E-07 39.8 4.5 67 36-108 54-120 (245) 126 KOG0370 consensus 96.1 0.019 4.9E-07 36.5 6.0 71 16-94 185-255 (1435) 127 cd03141 GATase1_Hsp31_like Typ 95.5 0.013 3.4E-07 37.5 3.1 56 37-98 86-142 (221) 128 cd03133 GATase1_ES1 Type 1 glu 95.1 0.015 3.9E-07 37.1 2.5 58 37-94 78-140 (213) 129 PRK11780 isoprenoid biosynthes 94.7 0.025 6.4E-07 35.7 2.8 57 38-94 82-143 (217) 130 KOG2764 consensus 94.5 0.045 1.1E-06 34.1 3.7 82 40-129 66-157 (247) 131 COG4977 Transcriptional regula 94.4 0.04 1E-06 34.5 3.2 92 2-100 12-129 (328) 132 cd03144 GATase1_ScBLP_like Typ 94.4 0.11 2.8E-06 31.6 5.4 83 2-91 1-89 (114) 133 pfam04204 HTS Homoserine O-suc 94.4 0.17 4.4E-06 30.4 6.4 135 39-200 96-238 (298) 134 PRK05368 homoserine O-succinyl 94.2 0.17 4.3E-06 30.5 6.0 135 38-199 96-238 (302) 135 PRK05282 peptidase E; Validate 93.4 0.34 8.6E-06 28.5 6.4 87 2-93 33-127 (233) 136 cd03146 GAT1_Peptidase_E Type 93.3 0.24 6.2E-06 29.4 5.5 75 17-95 51-130 (212) 137 cd03145 GAT1_cyanophycinase Ty 92.6 0.49 1.3E-05 27.5 6.3 94 2-99 31-137 (217) 138 cd03147 GATase1_Ydr533c_like T 92.4 0.091 2.3E-06 32.2 2.3 53 35-95 88-143 (231) 139 COG1348 NifH Nitrogenase subun 90.4 0.44 1.1E-05 27.8 4.2 88 8-99 91-196 (278) 140 KOG0026 consensus 90.3 0.92 2.3E-05 25.8 5.7 58 28-95 44-108 (223) 141 cd03148 GATase1_EcHsp31_like T 89.8 0.37 9.5E-06 28.3 3.4 53 37-97 92-147 (232) 142 cd03129 GAT1_Peptidase_E_like 89.0 2.1 5.3E-05 23.5 6.7 89 2-95 31-130 (210) 143 PRK04155 chaperone protein Hch 88.0 0.48 1.2E-05 27.6 2.9 56 40-101 147-203 (288) 144 COG4242 CphB Cyanophycinase an 87.1 0.84 2.1E-05 26.0 3.7 73 18-94 73-155 (293) 145 PRK13235 nifH nitrogenase redu 84.8 0.77 2E-05 26.2 2.6 21 148-168 210-230 (274) 146 TIGR02069 cyanophycinase cyano 83.9 3.1 8E-05 22.4 5.4 53 38-94 99-157 (297) 147 PRK06108 aspartate aminotransf 83.8 0.69 1.8E-05 26.5 2.0 58 132-189 129-192 (382) 148 pfam08532 Glyco_hydro_42M Beta 83.4 2.4 6E-05 23.1 4.6 75 17-104 35-113 (207) 149 pfam03575 Peptidase_S51 Peptid 82.9 4.3 0.00011 21.5 5.9 76 16-96 4-86 (154) 150 PRK03708 ppnK inorganic polyph 80.5 5.1 0.00013 21.0 5.4 96 1-111 1-115 (278) 151 PRK05957 aspartate aminotransf 78.6 2.1 5.3E-05 23.5 2.9 57 132-191 134-196 (389) 152 PRK01911 ppnK inorganic polyph 78.3 6.1 0.00016 20.5 5.7 73 1-88 1-95 (290) 153 TIGR01287 nifH nitrogenase iro 78.2 1.5 3.8E-05 24.4 2.1 91 42-138 124-230 (278) 154 PRK13233 nifH nitrogenase redu 77.2 1.6 4E-05 24.3 2.0 18 150-167 213-230 (275) 155 PRK04885 ppnK inorganic polyph 77.1 6 0.00015 20.6 4.9 65 1-88 1-70 (265) 156 PRK07337 aminotransferase; Val 77.1 3.6 9.2E-05 21.9 3.8 157 18-191 22-199 (388) 157 PRK08960 hypothetical protein; 77.0 2 5E-05 23.7 2.4 64 126-191 132-201 (387) 158 PRK11249 katE hydroperoxidase 76.7 6.7 0.00017 20.2 7.3 90 2-97 572-682 (725) 159 TIGR02086 IPMI_arch 3-isopropy 75.5 1.2 3.1E-05 25.0 1.0 44 55-100 66-116 (431) 160 PRK08362 consensus 74.9 1.9 4.9E-05 23.7 1.9 59 133-192 133-197 (389) 161 PRK06575 consensus 74.7 1.6 4.1E-05 24.2 1.5 59 133-192 137-201 (399) 162 TIGR03537 DapC succinyldiamino 73.8 3.5 8.8E-05 22.1 3.0 11 167-177 142-152 (350) 163 pfam06283 ThuA Trehalose utili 73.6 8.1 0.00021 19.7 5.9 140 37-202 46-199 (216) 164 cd02033 BchX Chlorophyllide re 73.5 2.6 6.7E-05 22.8 2.3 55 34-88 147-216 (329) 165 PRK06107 aspartate aminotransf 72.5 3.2 8.1E-05 22.3 2.5 155 17-192 24-203 (402) 166 PRK06242 flavodoxin; Provision 71.8 8.9 0.00023 19.5 5.9 50 1-50 1-52 (150) 167 TIGR01424 gluta_reduc_2 glutat 71.4 3 7.7E-05 22.5 2.2 22 41-65 2-23 (478) 168 PRK08361 aspartate aminotransf 70.6 6.9 0.00018 20.2 3.9 59 131-190 136-200 (390) 169 PRK07310 consensus 70.5 4.9 0.00013 21.1 3.2 59 133-192 137-201 (395) 170 PRK07366 succinyldiaminopimela 69.7 3.3 8.4E-05 22.2 2.1 57 132-189 137-199 (388) 171 TIGR03540 DapC_direct LL-diami 69.4 9.7 0.00025 19.2 4.5 58 133-191 137-200 (383) 172 PRK06348 aspartate aminotransf 69.2 4.2 0.00011 21.5 2.6 58 133-191 135-198 (383) 173 PRK05794 consensus 68.3 4.3 0.00011 21.4 2.5 59 133-192 136-200 (397) 174 PRK09271 flavodoxin; Provision 68.3 9.8 0.00025 19.2 4.3 75 1-81 1-87 (160) 175 TIGR01701 Fdhalpha-like oxidor 67.4 5.5 0.00014 20.8 2.9 110 5-131 174-299 (824) 176 PRK02155 ppnK inorganic polyph 66.8 11 0.00029 18.8 6.7 73 2-88 7-96 (291) 177 COG3155 ElbB Uncharacterized p 66.7 3.6 9.1E-05 22.0 1.8 56 38-93 82-142 (217) 178 PRK08443 consensus 65.5 6.2 0.00016 20.5 2.8 58 133-191 135-198 (388) 179 PRK07682 hypothetical protein; 65.3 3 7.5E-05 22.5 1.2 58 133-191 127-190 (378) 180 PRK07367 consensus 63.9 5.3 0.00013 20.9 2.2 59 132-191 133-197 (385) 181 COG1168 MalY Bifunctional PLP- 63.8 9.1 0.00023 19.4 3.4 65 124-188 118-191 (388) 182 PRK07324 transaminase; Validat 62.8 6.7 0.00017 20.3 2.6 55 134-189 127-187 (373) 183 PRK07777 aminotransferase; Val 62.5 10 0.00025 19.1 3.4 47 133-179 131-177 (386) 184 PRK00561 ppnK inorganic polyph 62.4 14 0.00035 18.3 6.0 70 1-94 1-73 (259) 185 PRK06207 aspartate aminotransf 62.3 12 0.0003 18.7 3.8 57 132-188 147-212 (406) 186 PRK01231 ppnK inorganic polyph 60.8 15 0.00037 18.1 5.5 73 2-88 6-95 (296) 187 PRK09276 aspartate aminotransf 60.6 7.4 0.00019 20.0 2.5 58 132-190 138-201 (385) 188 PRK07681 aspartate aminotransf 60.6 6 0.00015 20.5 2.0 55 133-188 139-199 (399) 189 PRK13234 nifH nitrogenase redu 60.6 12 0.0003 18.7 3.5 16 152-167 216-231 (293) 190 PRK07550 hypothetical protein; 60.1 8.3 0.00021 19.6 2.7 59 133-192 137-201 (387) 191 PRK03957 V-type ATP synthase s 59.8 6.6 0.00017 20.3 2.1 25 1-32 1-26 (100) 192 PRK06836 aspartate aminotransf 59.7 14 0.00035 18.2 3.8 53 132-186 142-200 (396) 193 CHL00072 chlL photochlorophyll 59.3 9.7 0.00025 19.2 2.9 14 154-167 209-222 (271) 194 cd05297 GH4_alpha_glucosidase_ 58.7 3.6 9.3E-05 21.9 0.7 16 79-94 160-175 (423) 195 PRK09265 aminotransferase AlaT 58.2 7.1 0.00018 20.1 2.1 37 143-179 150-186 (404) 196 COG4090 Uncharacterized protei 58.0 9.2 0.00023 19.4 2.6 51 37-98 81-134 (154) 197 KOG3093 consensus 57.5 4.5 0.00012 21.3 1.0 52 42-94 129-183 (200) 198 TIGR03375 type_I_sec_LssB type 57.4 16 0.00041 17.8 3.8 21 108-128 462-482 (694) 199 PRK06507 consensus 57.3 6.3 0.00016 20.4 1.7 60 132-192 136-201 (400) 200 PRK05764 aspartate aminotransf 57.3 14 0.00035 18.2 3.4 157 18-191 23-200 (389) 201 PRK07778 consensus 57.0 12 0.0003 18.7 3.0 54 133-188 136-195 (386) 202 pfam01513 NAD_kinase ATP-NAD k 55.7 18 0.00045 17.5 5.2 68 2-88 1-68 (243) 203 COG1897 MetA Homoserine trans- 55.6 9.2 0.00023 19.4 2.3 137 38-201 96-240 (307) 204 PRK03372 ppnK inorganic polyph 55.6 18 0.00046 17.5 7.1 73 2-88 6-102 (303) 205 PRK04539 ppnK inorganic polyph 55.5 18 0.00046 17.5 6.0 73 2-88 7-101 (296) 206 PRK05942 aspartate aminotransf 55.1 9.1 0.00023 19.4 2.2 55 132-187 142-202 (394) 207 PRK06220 consensus 54.8 18 0.00046 17.5 3.7 46 133-179 137-182 (384) 208 PRK02649 ppnK inorganic polyph 54.8 18 0.00047 17.4 5.3 73 1-88 2-101 (305) 209 PRK08068 transaminase; Reviewe 54.5 10 0.00025 19.1 2.3 55 133-188 140-200 (389) 210 PRK13185 chlL protochlorophyll 54.1 16 0.00042 17.7 3.4 19 150-168 207-225 (269) 211 PRK07683 aminotransferase A; V 52.7 9.1 0.00023 19.4 1.9 55 133-189 135-195 (387) 212 pfam01812 5-FTHF_cyc-lig 5-for 52.1 14 0.00035 18.3 2.7 50 41-94 115-166 (182) 213 PRK10680 molybdopterin biosynt 51.8 20 0.00052 17.1 6.0 81 12-92 203-295 (411) 214 COG1486 CelF Alpha-galactosida 50.9 5.5 0.00014 20.8 0.5 23 69-92 153-175 (442) 215 pfam09897 DUF2124 Uncharacteri 50.1 16 0.0004 17.9 2.8 52 38-99 77-130 (147) 216 cd05296 GH4_P_beta_glucosidase 48.2 6.8 0.00017 20.2 0.7 23 70-94 152-174 (419) 217 PRK09147 aminotransferase; Pro 48.2 9.4 0.00024 19.3 1.4 58 133-191 139-202 (397) 218 PRK06290 aspartate aminotransf 47.4 24 0.00061 16.7 3.7 56 132-188 151-212 (411) 219 PRK06056 consensus 46.2 18 0.00045 17.5 2.5 56 133-189 141-202 (402) 220 PRK13232 nifH nitrogenase redu 46.1 17 0.00044 17.6 2.5 21 148-168 208-228 (273) 221 pfam09508 Lact_bio_phlase Lact 46.0 25 0.00064 16.6 6.5 95 1-97 436-560 (716) 222 cd03131 GATase1_HTS Type 1 glu 45.7 9.5 0.00024 19.3 1.1 98 38-154 59-164 (175) 223 PRK05395 3-dehydroquinate dehy 45.4 26 0.00066 16.5 4.2 54 1-54 2-81 (143) 224 KOG0259 consensus 45.4 21 0.00053 17.1 2.8 36 144-179 182-217 (447) 225 PRK07543 consensus 45.1 26 0.00066 16.5 3.6 60 132-192 136-201 (400) 226 PRK10333 putative ligase; Prov 44.9 16 0.00041 17.8 2.1 12 83-94 165-176 (198) 227 cd07023 S49_Sppa_N_C Signal pe 44.0 27 0.00069 16.4 4.6 66 2-89 1-76 (208) 228 PRK08363 alanine aminotransfer 44.0 9.5 0.00024 19.3 0.9 33 147-179 152-184 (398) 229 PRK13236 nitrogenase reductase 43.9 12 0.0003 18.7 1.3 33 66-98 165-199 (295) 230 PRK08069 consensus 43.9 19 0.00047 17.4 2.3 152 17-191 21-199 (390) 231 PRK01395 V-type ATP synthase s 43.0 20 0.0005 17.2 2.4 28 2-36 5-32 (104) 232 PRK03767 TrpR binding protein 42.8 28 0.00072 16.3 4.5 59 1-65 2-87 (200) 233 cd03143 A4_beta-galactosidase_ 42.7 28 0.00072 16.2 4.8 54 18-84 32-85 (154) 234 pfam00155 Aminotran_1_2 Aminot 42.6 28 0.00072 16.2 3.2 45 134-179 110-154 (351) 235 PRK09054 phosphogluconate dehy 42.4 14 0.00036 18.2 1.5 31 70-100 205-243 (603) 236 PRK13402 gamma-glutamyl kinase 42.0 29 0.00074 16.2 5.0 110 57-192 206-322 (363) 237 PRK08912 hypothetical protein; 40.5 16 0.00041 17.8 1.6 31 147-177 145-175 (387) 238 COG1591 Holliday junction reso 39.8 31 0.0008 16.0 4.8 72 9-85 5-86 (137) 239 cd05298 GH4_GlvA_pagL_like Gly 39.1 21 0.00054 17.0 2.0 17 78-94 158-174 (437) 240 COG3340 PepE Peptidase E [Amin 38.5 33 0.00084 15.8 6.3 89 2-95 34-134 (224) 241 PRK10446 ribosomal protein S6 37.6 34 0.00086 15.7 7.2 85 1-97 1-112 (300) 242 KOG4716 consensus 37.3 16 0.00041 17.8 1.2 35 39-76 17-51 (503) 243 PRK04761 ppnK inorganic polyph 37.1 32 0.00083 15.9 2.7 41 34-88 18-58 (246) 244 PRK06375 consensus 37.0 31 0.00079 16.0 2.6 155 15-189 18-193 (381) 245 pfam01220 DHquinase_II Dehydro 36.9 35 0.00088 15.7 4.4 53 1-53 1-79 (140) 246 PRK02645 ppnK inorganic polyph 35.9 36 0.00092 15.6 5.5 79 2-94 5-98 (304) 247 PRK08175 aminotransferase; Val 33.8 32 0.00083 15.9 2.3 55 132-188 136-197 (395) 248 PRK03378 ppnK inorganic polyph 33.6 39 0.001 15.3 5.0 72 2-88 7-96 (292) 249 TIGR02922 TIGR02922 conserved 33.0 19 0.00047 17.4 0.9 22 75-96 42-64 (69) 250 TIGR03538 DapC_gpp succinyldia 32.9 39 0.001 15.3 2.6 57 133-190 139-201 (394) 251 PRK01185 ppnK inorganic polyph 32.5 41 0.001 15.2 6.8 71 1-88 1-82 (272) 252 PRK06358 threonine-phosphate d 32.2 41 0.0011 15.2 3.4 10 169-178 149-158 (354) 253 TIGR01351 adk adenylate kinase 32.2 15 0.00039 18.0 0.4 83 52-142 29-124 (232) 254 TIGR01438 TGR thioredoxin and 31.7 42 0.0011 15.1 2.6 17 153-169 123-139 (513) 255 TIGR02817 adh_fam_1 zinc-bindi 31.7 6.1 0.00016 20.5 -1.7 52 4-74 92-143 (338) 256 PRK12448 dihydroxy-acid dehydr 31.5 36 0.00092 15.6 2.2 31 70-100 178-216 (615) 257 KOG4649 consensus 31.1 41 0.001 15.2 2.4 37 163-199 261-303 (354) 258 TIGR02692 tRNA_CCA_actino tRNA 30.8 19 0.00047 17.4 0.6 76 43-118 155-263 (472) 259 cd07565 aliphatic_amidase alip 29.8 34 0.00088 15.7 1.9 75 1-80 1-83 (291) 260 PRK09148 aminotransferase; Val 28.8 29 0.00074 16.2 1.3 57 133-190 139-201 (406) 261 TIGR02280 PaaB1 phenylacetate 28.7 47 0.0012 14.8 2.6 61 40-100 42-140 (259) 262 cd03252 ABCC_Hemolysin The ABC 28.0 49 0.0012 14.7 4.3 12 187-198 211-222 (237) 263 pfam03486 HI0933_like HI0933-l 28.0 33 0.00085 15.8 1.5 18 7-24 100-117 (405) 264 PRK02259 aspartoacylase; Provi 27.9 5.8 0.00015 20.7 -2.4 42 162-217 174-219 (263) 265 cd07576 R-amidase_like Pseudom 27.2 50 0.0013 14.7 11.6 184 2-199 1-227 (254) 266 COG3044 Predicted ATPase of th 26.9 51 0.0013 14.6 2.4 55 145-199 172-243 (554) 267 PRK12359 flavodoxin FldB; Prov 26.8 51 0.0013 14.6 4.3 79 1-84 1-86 (169) 268 pfam04170 NlpE Uncharacterized 26.6 37 0.00095 15.5 1.6 61 101-162 16-81 (86) 269 pfam08877 MepB MepB protein. M 26.1 52 0.0013 14.5 3.9 65 86-156 14-80 (123) 270 KOG3967 consensus 25.9 52 0.0013 14.6 2.2 40 31-73 89-129 (297) 271 TIGR02214 spoVD_pbp stage V sp 25.9 49 0.0013 14.7 2.1 117 68-207 148-279 (660) 272 cd06909 M14_ASPA Aspartoacylas 25.6 9.4 0.00024 19.3 -1.7 30 187-217 218-247 (282) 273 PRK07212 consensus 25.4 54 0.0014 14.4 3.9 57 132-191 136-198 (378) 274 TIGR01531 glyc_debranch glycog 25.0 37 0.00093 15.5 1.3 119 94-213 1175-1327(1584) 275 TIGR00705 SppA_67K signal pept 24.8 55 0.0014 14.4 2.3 20 64-83 376-396 (614) 276 TIGR01959 nuoF_fam NADH-quinon 24.8 38 0.00096 15.4 1.3 35 125-163 154-191 (420) 277 PRK00402 3-isopropylmalate deh 24.6 23 0.00059 16.8 0.2 127 68-202 78-236 (419) 278 COG1436 NtpG Archaeal/vacuolar 24.6 37 0.00094 15.5 1.2 29 1-36 3-31 (104) 279 PRK13231 nitrogenase reductase 24.3 57 0.0014 14.3 2.2 14 85-98 126-139 (264) 280 COG2360 Aat Leu/Phe-tRNA-prote 23.6 59 0.0015 14.2 2.9 15 40-54 51-65 (221) 281 KOG0257 consensus 22.8 38 0.00098 15.4 1.1 11 168-178 179-189 (420) 282 TIGR01135 glmS glucosamine--fr 22.8 61 0.0015 14.1 5.2 58 19-87 325-389 (628) 283 PRK11423 methylmalonyl-CoA dec 22.6 61 0.0016 14.1 3.7 80 40-119 46-138 (261) 284 pfam09558 DUF2375 Protein of u 22.6 40 0.001 15.3 1.1 22 75-96 42-64 (71) 285 PRK07392 threonine-phosphate d 22.5 61 0.0016 14.1 3.4 16 43-59 98-113 (358) 286 PRK04020 rps2P 30S ribosomal p 22.4 49 0.0013 14.7 1.5 10 95-104 97-106 (204) 287 TIGR02707 butyr_kinase butyrat 22.0 41 0.001 15.2 1.0 18 41-58 294-312 (353) 288 PRK07854 enoyl-CoA hydratase; 21.3 65 0.0017 13.9 3.2 80 3-91 10-98 (243) 289 COG5246 PRP11 Splicing factor 21.2 55 0.0014 14.4 1.6 38 101-140 45-82 (222) 290 COG2054 Uncharacterized archae 21.2 65 0.0017 13.9 2.6 17 43-59 28-44 (212) 291 COG2910 Putative NADH-flavin r 21.2 65 0.0017 13.9 6.0 92 1-99 1-124 (211) 292 COG1197 Mfd Transcription-repa 21.2 41 0.001 15.2 0.9 18 130-148 471-488 (1139) 293 KOG3212 consensus 20.6 67 0.0017 13.8 3.7 38 16-54 73-110 (208) 294 PRK09559 putative global regul 20.6 67 0.0017 13.8 2.3 18 180-197 280-299 (327) 295 PTZ00254 40S ribosomal protein 20.4 56 0.0014 14.3 1.5 10 95-104 101-110 (242) 296 TIGR00088 trmD tRNA (guanine-N 20.4 48 0.0012 14.8 1.1 50 17-75 22-74 (247) 297 KOG1739 consensus 20.2 49 0.0013 14.7 1.2 19 180-198 213-231 (611) No 1 >TIGR01737 FGAM_synth_I phosphoribosylformylglycinamidine synthase I; InterPro: IPR010075 In some species, phosphoribosylformylglycinamidine synthase is composed of a single polypeptide chain. This entry describes the PurQ protein of Bacillus subtilis (where PurL, PurQ, and PurS are required for phosphoribosylformylglycinamidine synthase activity) and functionally equivalent proteins from other bacteria and archaea.; GO: 0004642 phosphoribosylformylglycinamidine synthase activity, 0006189 'de novo' IMP biosynthetic process, 0005737 cytoplasm. Probab=100.00 E-value=0 Score=681.52 Aligned_cols=219 Identities=53% Similarity=0.955 Sum_probs=214.9 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHCCCCEEEE----ECCCCC-CCC------CCEEEECCCCCCCCCCCHHHHHHH---H Q ss_conf 96899844970647999999998549963998----215464-456------478997287345431140677520---0 Q gi|254780971|r 1 MKTAIVQIPGLNRDNDMIKAITKIIGQSPILV----WQSDTD-IPD------VDLIVIPGGFSYGDYLRCGAIAAR---T 66 (219) Q Consensus 1 mkvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v----~~~~~~-l~~------~d~lvipGGFSygD~l~aG~i~~~---~ 66 (219) ||||||||||||||+||.+||.+.+|.++.+| ||+|.. +++ ||+||||||||||||||||+|||. + T Consensus 1 mkvAV~~FPGtNCd~D~~~A~~k~~Gv~~~~Vhikqw~~D~~r~~~~rsvfD~D~VvlPGGFSyGDYLRaGAIAa~rvk~ 80 (264) T TIGR01737 1 MKVAVIRFPGTNCDRDTVYALEKLLGVDAEIVHIKQWYEDKSRLSDQRSVFDYDAVVLPGGFSYGDYLRAGAIAALRVKS 80 (264) T ss_pred CEEEEEECCCCCCHHHHHHHHHHHCCCCEEEEEEEECHHHHCCCCCCCEEEEEEEEEECCCCCHHHHHHHHHHHHHHCCH T ss_conf 96899977999840899999997149983799865210000478766417873348866877723577799999875030 Q ss_pred HHHHHHHHHHHCCCEEEEECCCHHHHEECCCHHH-------------HHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCC Q ss_conf 2121223322059717860640310100000101-------------101244332454225767525235776413799 Q gi|254780971|r 67 PVMQAIKKKAQQGIKVMGICNGFQILVELNLLPG-------------ILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMN 133 (219) Q Consensus 67 ~~~~~i~~~~~~g~~vLGICNGfQiL~elGLlPg-------------~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~ 133 (219) |++++|++++++|+||||||||||||+|+||||| ||++|.|.||+|||++|+|++++|+||+.|++| T Consensus 81 p~~~~V~~~A~~G~pVLGiCNGFQIL~E~GLLpGfDeDkPlaekPemAL~~N~~~rFisr~v~LrV~n~~T~FT~~y~kG 160 (264) T TIGR01737 81 PIMQEVREFAEKGKPVLGICNGFQILVEAGLLPGFDEDKPLAEKPEMALLPNASLRFISRWVYLRVENADTAFTKKYKKG 160 (264) T ss_pred HHHHHHHHHHHCCCEEEEEECHHHHHHHHHHCCCCCCCCCCCCCCHHHHHCCCCCCCEEEEEEEEEEECCCHHHHCCCCC T ss_conf 88999999995598289976647999997624898878874357122442058897453203689985678366357988 Q ss_pred CEEEEEEECCCEEEEEC----HHHHHHHHHCCEEEEECCCC-------CCCCCCCCCEEEEECCCCCEEEECCCCHHHCC Q ss_conf 68998620133023308----67741154225036531688-------88898811266888699989998488434322 Q gi|254780971|r 134 QIIKCPVAHHDGNYFID----AKGLAEIEKNNQIVFRYASG-------TNPNGSLHDIAGVINRRGNVLGMMPHPENIIE 202 (219) Q Consensus 134 ~~l~~piaHgEGrf~~~----~~~l~~l~~~~~i~~~Y~d~-------~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~ 202 (219) +||.+||||||||||++ ++.|++|++|+||+|||||+ +|||||..|||||||++||||||||||||+.+ T Consensus 161 ev~~~PiAHgEGrYy~~sGke~e~~~~l~~n~qivFrY~d~~G~~~e~~NPNGS~~nIAGi~N~~gnV~GmMPHPERA~~ 240 (264) T TIGR01737 161 EVIRVPIAHGEGRYYADSGKEDETLAELESNDQIVFRYCDEDGDVSEEVNPNGSVDNIAGIVNEEGNVLGMMPHPERASE 240 (264) T ss_pred CEEEEEEEECCCCEECCCCCCHHHHHHHHHCCEEEEEEECCCCCEECCCCCCCCCCCEEEEECCCCCEEEECCCHHHHHH T ss_conf 78876544258726626899757999985479189997679985324547897646710578089877871898078764 Q ss_pred CCCCC-----CCHHHHHHHHCC Q ss_conf 10067-----762899986429 Q gi|254780971|r 203 KFHGG-----IDGRGLFASLLT 219 (219) Q Consensus 203 ~~~~~-----~dG~~~f~~~~s 219 (219) +++++ .||++||+++++ T Consensus 241 ~~lg~r~eG~~DG~~Lf~~l~~ 262 (264) T TIGR01737 241 KLLGSRREGADDGLKLFESLVE 262 (264) T ss_pred HHHCCCCCCCHHHHHHHHHHHH T ss_conf 5406887563779999999997 No 2 >PRK03619 phosphoribosylformylglycinamidine synthase subunit I; Provisional Probab=100.00 E-value=0 Score=621.49 Aligned_cols=217 Identities=56% Similarity=0.997 Sum_probs=213.5 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 96899844970647999999998549963998215464456478997287345431140677520021212233220597 Q gi|254780971|r 1 MKTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGI 80 (219) Q Consensus 1 mkvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~ 80 (219) ||||||||||||||+||++||+ .+|+++.+||+++..|++||++|||||||||||||||++|+++++++++++|+++|+ T Consensus 1 mkvaIl~~pGsNcd~e~~~Af~-~~G~~~~~v~~~d~~L~~~~~lv~pGGFSyGD~L~aGaia~~~~i~~~i~~f~~~g~ 79 (223) T PRK03619 1 MKIAVITFPGTNCDVDMAHAIR-LAGAEPVYVWHTDTDLDGVDAVVLPGGFSYGDYLRCGAIAAFAPVMQAVVEAAEKGK 79 (223) T ss_pred CEEEEEECCCCCCHHHHHHHHH-HCCCCEEEEEECCCCCCCCCEEEECCCCCCCCCCCCCHHHHHCHHHHHHHHHHHCCC T ss_conf 9699997176494999999999-869956999844798764538998350056534553347652789999999986799 Q ss_pred EEEEECCCHHHHEECCCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEECHHHHHHHHHC Q ss_conf 17860640310100000101101244332454225767525235776413799689986201330233086774115422 Q gi|254780971|r 81 KVMGICNGFQILVELNLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFIDAKGLAEIEKN 160 (219) Q Consensus 81 ~vLGICNGfQiL~elGLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~~~~l~~l~~~ 160 (219) ++||||||||+|+|+|||||+|++|+|+||+|||+.++|++++|+||+++++++++++|||||||||+++++.|++|+++ T Consensus 80 ~vLGICNGfQiL~e~gLlPgaL~~N~s~rF~~r~v~lkv~~~~s~f~~~~~~g~~l~ipVAHgEGr~~~~~~~l~~l~~~ 159 (223) T PRK03619 80 PVLGICNGFQILLEAGLLPGALTRNENLKFICRDVKLRVENNDTAFTSRYEKGEVIRIPIAHGEGNYYADEETLKELEGN 159 (223) T ss_pred CEEEECHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEEEECCCCCCHHCCCCCCCEEEEEEECCCCCEECCHHHHHHHHHC T ss_conf 77998727898986477886210269998798656788658984010378999889887244875335599999999978 Q ss_pred CEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHC Q ss_conf 5036531688888988112668886999899984884343221006776289998642 Q gi|254780971|r 161 NQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLL 218 (219) Q Consensus 161 ~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~ 218 (219) +||++||||+.|||||.++||||||++||||||||||||++++++++.||++||+|++ T Consensus 160 ~qi~~~Y~d~~NPNGS~~~IAGI~~~~GrvlgmMPHPERa~~~~~g~~dG~~~f~s~v 217 (223) T PRK03619 160 GQVVFRYVDETNPNGSVNDIAGICNENGNVLGLMPHPERAVEALLGSDDGLKLFKSAL 217 (223) T ss_pred CCEEEEECCCCCCCCCHHHHHEEECCCCCEEEECCCCCCCCCCCCCCCCHHHHHHHHH T ss_conf 9789998999999889646528587999998888996446355447861499999999 No 3 >COG0047 PurL Phosphoribosylformylglycinamidine (FGAM) synthase, glutamine amidotransferase domain [Nucleotide transport and metabolism] Probab=100.00 E-value=0 Score=597.38 Aligned_cols=218 Identities=53% Similarity=0.870 Sum_probs=212.7 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCC-CCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCC Q ss_conf 9689984497064799999999854996399821546445-647899728734543114067752002121223322059 Q gi|254780971|r 1 MKTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDTDIP-DVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQG 79 (219) Q Consensus 1 mkvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~~l~-~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g 79 (219) ||||||||||||||+||++||+ .+|+++.+||++|..+. +||++|+|||||||||||+|+|++.+++++++++|+++| T Consensus 3 ~kvaVi~fpGtN~d~d~~~A~~-~aG~~~~~V~~~d~~~~~~~d~vv~pGGFSyGDyLr~Gaiaa~~~v~~~v~~~a~~g 81 (231) T COG0047 3 PKVAVLRFPGTNCDYDMAAAFE-RAGFEAEDVWHSDLLLGRDFDGVVLPGGFSYGDYLRAGAIAAIAPVMDEVREFAEKG 81 (231) T ss_pred CEEEEEEECCCCCHHHHHHHHH-HCCCCCEEEEEEECCCCCCCCEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCC T ss_conf 5699997078684599999999-849982499855035677746899748877643357326775578999999998689 Q ss_pred CEEEEECCCHHHHEECCCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEECHHHHHHHHH Q ss_conf 71786064031010000010110124433245422576752523577641379968998620133023308677411542 Q gi|254780971|r 80 IKVMGICNGFQILVELNLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFIDAKGLAEIEK 159 (219) Q Consensus 80 ~~vLGICNGfQiL~elGLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~~~~l~~l~~ 159 (219) +|+||||||||+|+|.|||||+|++|+|.||+|||+.++|++++|+||+++++++++++|||||||||++++++|++|++ T Consensus 82 ~~vLGICNGfQiL~e~gLlPGal~~N~s~~F~cr~v~l~V~~~~t~ft~~~~~g~~i~ipVAHgEGr~~~~~~~l~~l~~ 161 (231) T COG0047 82 KPVLGICNGFQILSEAGLLPGALTRNESLRFECRWVYLRVENNNTPFTSGYEGGEVIPIPVAHGEGRYYADDETLAELEE 161 (231) T ss_pred CEEEEECCHHHHHHHCCCCCCCEECCCCCCEEEEEEEEEEECCCCHHHHHCCCCCEEEEEEEECCEEEECCHHHHHHHHH T ss_conf 71999853508989727577610247898458898999993589877873578855987777056247716788998754 Q ss_pred CCEEEEECCCC-------CCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHCC Q ss_conf 25036531688-------8889881126688869998999848843432210067762899986429 Q gi|254780971|r 160 NNQIVFRYASG-------TNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLLT 219 (219) Q Consensus 160 ~~~i~~~Y~d~-------~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~s 219 (219) |+||+|||+|. +|||||.++||||||++|||+||||||||++++|+++.||+++|+|++. T Consensus 162 ngqvvfrY~d~~G~~~~~~NPNGS~~~IaGI~n~~G~V~gmMPHPERa~~~~~g~~Dg~~lF~s~~~ 228 (231) T COG0047 162 NGQVVFRYVDNNGETEEYANPNGSVNGIAGITNEDGNVLGMMPHPERASESLLGGEDGLRLFRSARK 228 (231) T ss_pred CCEEEEEEECCCCCEEEEECCCCCHHHCEEEECCCCCEEEECCCCHHHHHCCCCCCHHHHHHHHHHH T ss_conf 8979999936998540334798996662057769997898648944433202687138999999998 No 4 >PRK01175 phosphoribosylformylglycinamidine synthase subunit I; Provisional Probab=100.00 E-value=0 Score=572.98 Aligned_cols=215 Identities=38% Similarity=0.718 Sum_probs=198.9 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCC-----CCCCCCCEEEECCCCCCCCCCCHHHHHH---HHHHHHHHH Q ss_conf 68998449706479999999985499639982154-----6445647899728734543114067752---002121223 Q gi|254780971|r 2 KTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSD-----TDIPDVDLIVIPGGFSYGDYLRCGAIAA---RTPVMQAIK 73 (219) Q Consensus 2 kvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~-----~~l~~~d~lvipGGFSygD~l~aG~i~~---~~~~~~~i~ 73 (219) |||||||||||||+||++||++ +|+++.+||++| ..|+++|+||||||||||||||||++|+ ++.+.++++ T Consensus 5 kvaIl~fpGsNcd~e~~~Af~~-aG~~~~~v~~~dl~~~~~~L~~~~~lv~pGGFSyGD~l~aG~~~a~~~~~~~~~~i~ 83 (255) T PRK01175 5 KVGVLRMEGTNNETEVFKSVRR-SGGEPEYVHINDLAAGRKSVSDYDCLIIPGGFSAGDYIRAGAIFAARLKAVLGKDIR 83 (255) T ss_pred EEEEEECCCCCHHHHHHHHHHH-CCCCEEEEEEEECCCCCCCCCCCCEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 6999965864819999999998-599638998652336766655554899737767643357518899986799999999 Q ss_pred HHHHCCCEEEEECCCHHHHEECCCHHH-----HHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEE Q ss_conf 322059717860640310100000101-----101244332454225767525235776413799689986201330233 Q gi|254780971|r 74 KKAQQGIKVMGICNGFQILVELNLLPG-----ILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYF 148 (219) Q Consensus 74 ~~~~~g~~vLGICNGfQiL~elGLlPg-----~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~ 148 (219) +|+++|++|||||||||+|+|+||||| +|++|+|+||+|||+.++|++++|+||++++ +.++.+|||||||||+ T Consensus 84 ~f~~~g~~vLGICNGfQiL~e~gLLPg~~~~~~L~~N~s~rF~~r~v~l~v~~~~~~f~~~~~-~~~l~ipvAHgEGrf~ 162 (255) T PRK01175 84 EFIDSGRPIIGICNGFQVLMELGLIPAERPEIALTNNESNRFECRYTYIKMTSEKCIFVKNLG-KGVFQVPVAHAEGRVA 162 (255) T ss_pred HHHHCCCCEEEECCHHHHHHHCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEECCCHHHHHCCC-CCEEEEEEECCCCCEE T ss_conf 999679967998016899997587789998802024799987867879999448346664148-9779987553768557 Q ss_pred EC-HHHHHHHHHCCEEEEECCCC--------CCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCC--------CCCHH Q ss_conf 08-67741154225036531688--------888988112668886999899984884343221006--------77628 Q gi|254780971|r 149 ID-AKGLAEIEKNNQIVFRYASG--------TNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHG--------GIDGR 211 (219) Q Consensus 149 ~~-~~~l~~l~~~~~i~~~Y~d~--------~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~--------~~dG~ 211 (219) ++ ++.|++|++++||++||||+ .|||||.++||||||++||||||||||||++.+|+. ..||+ T Consensus 163 ~~~~~~l~~l~~n~qi~~rY~d~~G~~~~yp~NPNGS~~~IAGI~n~~GrvlgmMPHPER~~~~~q~~~~~~~~~~g~g~ 242 (255) T PRK01175 163 VSEDSVLKRLYENDQIVFRYVDENGVYDGYPWNPNGSIDNIAGITNETGNVIGLMPHPERIYYGYQAMYNERRKDYGTGK 242 (255) T ss_pred ECCHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCCCHHHHCEEECCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCHH T ss_conf 58989999998589658999669986457888998797672086789999988789963256465588786667778699 Q ss_pred HHHHHHC Q ss_conf 9998642 Q gi|254780971|r 212 GLFASLL 218 (219) Q Consensus 212 ~~f~~~~ 218 (219) +||+|++ T Consensus 243 ~~F~nav 249 (255) T PRK01175 243 IFFRSLV 249 (255) T ss_pred HHHHHHH T ss_conf 9999999 No 5 >cd01740 GATase1_FGAR_AT Type 1 glutamine amidotransferase (GATase1)-like domain found in Formylglycinamide ribonucleotide amidotransferase. Type 1 glutamine amidotransferase (GATase1)-like domain found in Formylglycinamide ribonucleotide amidotransferase (FGAR-AT). FGAR-AT catalyzes the ATP-dependent conversion of formylglycinamide ribonucleotide (FGAR) and glutamine to formylglycinamidine ribonucleotide (FGAM), ADP, Pi, and glutamate in the fourth step of the purine biosynthetic pathway. FGAR-AT is a glutamine amidotransferase. Glutamine amidotransferase activity catalyses the transfer of ammonia from the amide side chain of glutamine to an acceptor substrate. FGAR-AT belongs to the triad family of amidotransferases having a conserved Cys-His-Glu catalytic triad in the glutaminase active site Probab=100.00 E-value=0 Score=542.81 Aligned_cols=214 Identities=51% Similarity=0.907 Sum_probs=199.8 Q ss_pred EEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCC-----CCCCCCEEEECCCCCCCCCCCHHHHHHHHH-HHHHHHHHH Q ss_conf 89984497064799999999854996399821546-----445647899728734543114067752002-121223322 Q gi|254780971|r 3 TAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDT-----DIPDVDLIVIPGGFSYGDYLRCGAIAARTP-VMQAIKKKA 76 (219) Q Consensus 3 vaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~-----~l~~~d~lvipGGFSygD~l~aG~i~~~~~-~~~~i~~~~ 76 (219) ||||||||||||+||++||++ +|+++.+||++|. .|++||+|+||||||||||||||++|+.++ +++++++|+ T Consensus 1 VaVl~~pGsNcd~e~~~Af~~-aG~~~~~V~~~dl~~~~~~L~~~~~lv~pGGFSyGD~l~ag~~~a~s~~~~~~l~~f~ 79 (238) T cd01740 1 VAVLRFPGSNCDRDMAYAFEL-AGFEAEDVWHNDLLAGRKDLDDYDGVVLPGGFSYGDYLRAGAIAAASPLLMEEVKEFA 79 (238) T ss_pred CEEEECCCCCCHHHHHHHHHH-CCCCEEEEEECCCCCCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHCCCHHHHHHHHH T ss_conf 989991884988999999998-6992699981016566466110758998686677666555678874811308999975 Q ss_pred HCCCEEEEECCCHHHHEECCCHHHHHCCCCCCCEEEEE----EEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEECHH Q ss_conf 05971786064031010000010110124433245422----57675252357764137996899862013302330867 Q gi|254780971|r 77 QQGIKVMGICNGFQILVELNLLPGILMRNCSLKFVCKQ----VLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFIDAK 152 (219) Q Consensus 77 ~~g~~vLGICNGfQiL~elGLlPg~l~~N~s~rf~~r~----~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~~~ 152 (219) ++|+++||||||||+|+|+||||+++.+|.+.+|.|+| +.++|++++|+||+++++++++++|||||||||+++++ T Consensus 80 ~~g~~vLGICNGfQiL~~lGllpg~~~~n~~~~~~~~~~~r~v~l~v~~~~s~f~~~~~~g~~l~ipvaHgEGr~~~~~~ 159 (238) T cd01740 80 ERGGLVLGICNGFQILVELGLLPGALIRNKGLKFICRWQNRFVTLRVENNDSPFTKGYMEGEVLRIPVAHGEGRFYADDE 159 (238) T ss_pred CCCCEEEEECHHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEEEECCCCCHHHHCCCCCCEEEEEEECCCEEEECCHH T ss_conf 48974898530567888706778500148776546756005799997899986771657898888887537235971828 Q ss_pred HHHHHHHCCEEEEECCC---------CCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCC-----CCCCHHHHHHHHC Q ss_conf 74115422503653168---------888898811266888699989998488434322100-----6776289998642 Q gi|254780971|r 153 GLAEIEKNNQIVFRYAS---------GTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFH-----GGIDGRGLFASLL 218 (219) Q Consensus 153 ~l~~l~~~~~i~~~Y~d---------~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~-----~~~dG~~~f~~~~ 218 (219) .++++.+++|++ +|+| |.|||||.++||||||++||||||||||||++++|+ +..||++||+|++ T Consensus 160 ~l~~l~~~~~i~-~~~~~~~~~~~~yP~NPNGS~~~IAGI~s~~GrvlgmMPHPER~~~~~q~~~~~~~~~g~~~F~nav 238 (238) T cd01740 160 TLAELEENGQIA-QYVDDDGNVTERYPANPNGSLDGIAGICNEDGRVLGMMPHPERAVEPWQWERLLGGSDGLKLFRNAV 238 (238) T ss_pred HHHHHHHCCCEE-EEEECCCCCCCCCCCCCCCCHHCEEEEECCCCCEEEECCCHHHHCCCCCCCCCCCCCCHHHHHHHCC T ss_conf 789998679569-9995799802268889988820212788799999997478577306555656788981899997539 No 6 >TIGR01735 FGAM_synt phosphoribosylformylglycinamidine synthase; InterPro: IPR010073 This entry represents a single-molecule form of phosphoribosylformylglycinamidine synthase, also called FGAM synthase, an enzyme of purine de novo biosynthesis. This form is found mostly in eukaryotes and proteobacteria. In Bacillus subtilis PurL (FGAM synthase II) and PurQ (FGAM synthase I) perform the equivalent function; the unrelated small protein PurS is also required and may be a third subunit .; GO: 0004642 phosphoribosylformylglycinamidine synthase activity, 0006189 'de novo' IMP biosynthetic process. Probab=100.00 E-value=0 Score=520.04 Aligned_cols=214 Identities=33% Similarity=0.493 Sum_probs=194.8 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCC-----CCCCCCCEEEECCCCCCCCCCCHHHHHHHHH-----HHHH Q ss_conf 68998449706479999999985499639982154-----6445647899728734543114067752002-----1212 Q gi|254780971|r 2 KTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSD-----TDIPDVDLIVIPGGFSYGDYLRCGAIAARTP-----VMQA 71 (219) Q Consensus 2 kvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~-----~~l~~~d~lvipGGFSygD~l~aG~i~~~~~-----~~~~ 71 (219) ||||||++|+|+|+||+.||++ +||++++||++| ..|++|.+||++|||||||+|+||+.||+|. +.++ T Consensus 1129 kvAILREqG~NG~~EMAaAF~r-AGF~~~DVhmsDL~~Grv~Ld~F~glvacGGFSYgDVLGag~GWA~SIlF~~~~~~q 1207 (1401) T TIGR01735 1129 KVAILREQGVNGDIEMAAAFDR-AGFEAIDVHMSDLLAGRVHLDEFRGLVACGGFSYGDVLGAGKGWAKSILFNPRLRDQ 1207 (1401) T ss_pred CEEEEEECCCCCCHHHHHHHHH-CCCCEEEEEEEEHCCCCCCHHHCCEEEEECCCCHHHHHHHHHHHHHHHHCCHHHHHH T ss_conf 1788850686700689999974-589578523100100566522455589971753021301124467766405778999 Q ss_pred HHHHH-HCCCEEEEECCCHHHHEEC-CCHHHH-------------------HCCCC-CCCEEEEEEEEEECCCHHHHHHH Q ss_conf 23322-0597178606403101000-001011-------------------01244-33245422576752523577641 Q gi|254780971|r 72 IKKKA-QQGIKVMGICNGFQILVEL-NLLPGI-------------------LMRNC-SLKFVCKQVLLEVVNSNTAFTKS 129 (219) Q Consensus 72 i~~~~-~~g~~vLGICNGfQiL~el-GLlPg~-------------------l~~N~-s~rf~~r~~~~~v~~~~s~~~~~ 129 (219) +.+|. +.+++.||||||||||++| +||||+ |.+|+ |+||||||..++|.++.|+|+++ T Consensus 1208 F~~Ff~rpdtf~LGVCNGCQm~snL~~~iPG~~~wp~~e~GPD~~~~RPGlFvrn~LSerFEaR~a~V~v~~s~Si~L~G 1287 (1401) T TIGR01735 1208 FAAFFKRPDTFSLGVCNGCQMLSNLLEIIPGTENWPHLEMGPDSSQARPGLFVRNRLSERFEARVALVEVKESPSIMLKG 1287 (1401) T ss_pred HHHHCCCCCCEEECCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCCCEEEEEEEEEECCCCEEEECC T ss_conf 99860588842303210789999764347878888841227888888875133202352013212654655665001137 Q ss_pred CCCCCEEEEEEECCCEEEEE-CHHHHHHHHHCCEEEEECCC---------CCCCCCCCCCEEEEECCCCCEEEECCCCHH Q ss_conf 37996899862013302330-86774115422503653168---------888898811266888699989998488434 Q gi|254780971|r 130 YKMNQIIKCPVAHHDGNYFI-DAKGLAEIEKNNQIVFRYAS---------GTNPNGSLHDIAGVINRRGNVLGMMPHPEN 199 (219) Q Consensus 130 ~~~~~~l~~piaHgEGrf~~-~~~~l~~l~~~~~i~~~Y~d---------~~NPNGS~~~IAgi~s~~G~vlgmMPHPER 199 (219) |+ |+.|++|||||||+-.+ +++.+..++.++.|++||+| |.|||||+.+||||||.||||+.||||||| T Consensus 1288 M~-GS~LPv~~aHGEG~a~F~~~~~~~~~~~~gl~aLry~D~~G~~Te~YPlNPNGSP~GIagits~DGR~~imMPHPER 1366 (1401) T TIGR01735 1288 MA-GSRLPVAVAHGEGRAEFESEELLAEADASGLVALRYVDDDGNVTEAYPLNPNGSPEGIAGITSADGRVTIMMPHPER 1366 (1401) T ss_pred CC-CCCCCEEEEECCCEEECCCHHHHHHHHHHCCEEEEEECCCCCHHHCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCC T ss_conf 78-88377377506625641887898877551955778675755733118688887744302011888758966688972 Q ss_pred HCCCCC-------CCC-CH-----HHHHHHH Q ss_conf 322100-------677-62-----8999864 Q gi|254780971|r 200 IIEKFH-------GGI-DG-----RGLFASL 217 (219) Q Consensus 200 ~~~~~~-------~~~-dG-----~~~f~~~ 217 (219) ++..|. |.+ |+ ++||+|+ T Consensus 1367 ~~R~~q~sW~P~~W~e~~~GPSpWlrLF~NA 1397 (1401) T TIGR01735 1367 VFRAVQNSWRPEDWDELDTGPSPWLRLFRNA 1397 (1401) T ss_pred CCCCCCCCCCCCCCCCCCCCCCHHHHHHHHH T ss_conf 1124668778778863457787678987887 No 7 >PRK05297 phosphoribosylformylglycinamidine synthase; Provisional Probab=100.00 E-value=0 Score=507.71 Aligned_cols=215 Identities=33% Similarity=0.505 Sum_probs=195.2 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCC-----CCCCCCEEEECCCCCCCCCCCHHHHHHHH-----HHHHH Q ss_conf 689984497064799999999854996399821546-----44564789972873454311406775200-----21212 Q gi|254780971|r 2 KTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDT-----DIPDVDLIVIPGGFSYGDYLRCGAIAART-----PVMQA 71 (219) Q Consensus 2 kvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~-----~l~~~d~lvipGGFSygD~l~aG~i~~~~-----~~~~~ 71 (219) |||||||||+||++||++||++ +||++++|+++|. .|.+|++|++||||||||+|+||+.||++ .+.++ T Consensus 1040 kVAIlreqG~Nge~EmAaAF~~-AGF~~~DVhmsDL~~g~~~L~~f~glva~GGFSyGDvlGsgkGwA~sil~N~~~r~~ 1118 (1294) T PRK05297 1040 KVAILREQGVNSHVEMAAAFDR-AGFDAIDVHMSDLLAGRVTLEDFKGLVACGGFSYGDVLGAGEGWAKSILFNPRLRDQ 1118 (1294) T ss_pred CEEEECCCCCCCHHHHHHHHHH-CCCCEEEEEHHHCCCCCCCHHHCCEEEECCCCCCCCCCCHHHHHHHHHHCCHHHHHH T ss_conf 3899768987967999999998-699348865231003611787791899878768767775678999999729999999 Q ss_pred HHHHHH-CCCEEEEECCCHHHHEECCCH-HH-----HHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCC Q ss_conf 233220-597178606403101000001-01-----10124433245422576752523577641379968998620133 Q gi|254780971|r 72 IKKKAQ-QGIKVMGICNGFQILVELNLL-PG-----ILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHD 144 (219) Q Consensus 72 i~~~~~-~g~~vLGICNGfQiL~elGLl-Pg-----~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgE 144 (219) +.+|.+ +++++||||||||+|++||+| || +|++|.|+||||||++++|.+++|+|+++|+ |++++||||||| T Consensus 1119 f~~Ff~r~DtfsLGICNGcQ~L~~Lg~i~pG~e~~P~~~~N~S~rfEsR~~~v~I~~s~Si~l~~m~-Gs~lpi~VaHGE 1197 (1294) T PRK05297 1119 FAAFFARPDTFALGVCNGCQMMSNLKEIIPGAEHWPRFVRNRSEQFEARFSLVEVQESPSIFLQGMA-GSRLPIAVAHGE 1197 (1294) T ss_pred HHHHHCCCCCEEEEEEHHHHHHHHCCCCCCCCCCCCCEECCCCCCEEEEEEEEEECCCCCHHHCCCC-CCEEEEEEECCC T ss_conf 9999739994599953889999977997888778972222454787898678898898986666578-987548887251 Q ss_pred EEE-EECHHHHHHHHHCCEEEEECCC---------CCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCC------- Q ss_conf 023-3086774115422503653168---------88889881126688869998999848843432210067------- Q gi|254780971|r 145 GNY-FIDAKGLAEIEKNNQIVFRYAS---------GTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGG------- 207 (219) Q Consensus 145 Grf-~~~~~~l~~l~~~~~i~~~Y~d---------~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~------- 207 (219) ||| +.+++.+++|.+++||++||+| |.|||||+.+||||||+|||||+|||||||++.+|++. T Consensus 1198 Gr~~f~~~~~~~~l~~~gqia~~YvD~~G~pt~~YP~NPNGS~~~IaGitS~DGR~l~mMpHpER~~~~~q~s~~P~~~~ 1277 (1294) T PRK05297 1198 GRAEFPDAEDLAAAEASGLVALRYVDNHGQVTETYPANPNGSPNGITGLTTADGRVTIMMPHPERVFRTVQNSWHPEDWG 1277 (1294) T ss_pred CEEECCCHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCCCCHHHEEEEECCCCCEEEECCCHHHCCCCCCCCCCCCCCC T ss_conf 12662888999999977966899978999981448889988821035886698888657378677030454764698668 Q ss_pred CCH--HHHHHHHC Q ss_conf 762--89998642 Q gi|254780971|r 208 IDG--RGLFASLL 218 (219) Q Consensus 208 ~dG--~~~f~~~~ 218 (219) .++ +++|+|+- T Consensus 1278 ~~sPW~~~F~nAr 1290 (1294) T PRK05297 1278 EDSPWMRMFRNAR 1290 (1294) T ss_pred CCCCHHHHHHHHH T ss_conf 8981999999999 No 8 >KOG1907 consensus Probab=100.00 E-value=0 Score=462.34 Aligned_cols=215 Identities=27% Similarity=0.418 Sum_probs=192.3 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCC-----CCCCCCEEEECCCCCCCCCCCHHHHHHHH-----HHHHH Q ss_conf 689984497064799999999854996399821546-----44564789972873454311406775200-----21212 Q gi|254780971|r 2 KTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDT-----DIPDVDLIVIPGGFSYGDYLRCGAIAART-----PVMQA 71 (219) Q Consensus 2 kvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~-----~l~~~d~lvipGGFSygD~l~aG~i~~~~-----~~~~~ 71 (219) ||||||++|+|+|+||++||. .+||++++|+++|. .|++|.+|++||||||+|+|+||+.||.+ .+..+ T Consensus 1060 kVAilREeGvNg~rEMa~af~-~AgF~~~DVtmtDlL~G~~~ld~frGlaf~GGFSYaDvLgSakGWAasil~ne~v~~Q 1138 (1320) T KOG1907 1060 KVAILREEGVNGDREMAAAFY-AAGFETVDVTMTDLLAGRHHLDDFRGLAFCGGFSYADVLGSAKGWAASILFNESVRSQ 1138 (1320) T ss_pred CEEEEECCCCCCHHHHHHHHH-HCCCCEEEEEEEHHHCCCEEHHHHCCEEEECCCCHHHHHCCCCCHHHHEEECHHHHHH T ss_conf 068861105562799999999-8588136446205435712586705446525743676514543324430107337799 Q ss_pred HHHHH-HCCCEEEEECCCHHHHEECCCHHH--------HHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEEC Q ss_conf 23322-059717860640310100000101--------101244332454225767525235776413799689986201 Q gi|254780971|r 72 IKKKA-QQGIKVMGICNGFQILVELNLLPG--------ILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAH 142 (219) Q Consensus 72 i~~~~-~~g~~vLGICNGfQiL~elGLlPg--------~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaH 142 (219) ..+|. +++++.||||||||+|+.+|+++. +|.+|+|+||||||+.++|..+.|+|+++|+ +.+|++|+|| T Consensus 1139 F~~F~~R~DtFslGiCNGCQlms~Lg~i~p~~~~~p~~~l~~Nes~rfE~r~~~vkI~~~~SIml~gM~-gs~LgvwvAH 1217 (1320) T KOG1907 1139 FEAFFNRQDTFSLGICNGCQLMSRLGWIGPEVGKWPDVFLDHNESGRFECRFGMVKIESNVSIMLSGMA-GSVLGVWVAH 1217 (1320) T ss_pred HHHHHCCCCCEEEECCCHHHHHHHHCCCCCCCCCCCCEEEECCCCCCEEEEEEEEEECCCCHHHHCCCC-CCCEEEEEEE T ss_conf 999853787312202417599998645574225787534421544322545788986787302120246-7722258870 Q ss_pred CCEEEEE-CHHHHHHHHHCCEEEEECCCC---------CCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCC---- Q ss_conf 3302330-867741154225036531688---------88898811266888699989998488434322100677---- Q gi|254780971|r 143 HDGNYFI-DAKGLAEIEKNNQIVFRYASG---------TNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGI---- 208 (219) Q Consensus 143 gEGrf~~-~~~~l~~l~~~~~i~~~Y~d~---------~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~---- 208 (219) |||||++ +++.++.++.++++.+||||+ .|||||+++||||||+|||||+|||||||++..|.+.. T Consensus 1218 GEGRa~f~~e~~~e~~~~~gl~~iryvdd~g~~te~yPfNpNGS~~gIAgicSpdGRhLAMMPHpER~~l~~q~~~yP~s 1297 (1320) T KOG1907 1218 GEGRATFRSEQNLEHLKKEGLVCIRYVDDYGNVTELYPFNPNGSPDGIAGICSPDGRHLAMMPHPERVFLMWQWSYYPWS 1297 (1320) T ss_pred CCCCEECCCHHHHHHHHHCCEEEEEEECCCCCEEEECCCCCCCCCCCCEEEECCCCCEEECCCCCHHEEEEEECCCCCCC T ss_conf 56621438578898775458347998638898745246589988010013417998753026882350454305657853 Q ss_pred -------CH--HHHHHHHC Q ss_conf -------62--89998642 Q gi|254780971|r 209 -------DG--RGLFASLL 218 (219) Q Consensus 209 -------dG--~~~f~~~~ 218 (219) .| .++|+|+- T Consensus 1298 ~~~e~~~~sPWirmF~nAr 1316 (1320) T KOG1907 1298 KRVEWGGLSPWIRMFRNAR 1316 (1320) T ss_pred CCCCCCCCCHHHHHHHHHH T ss_conf 2225568883999999777 No 9 >TIGR01857 FGAM-synthase phosphoribosylformylglycinamidine synthase; InterPro: IPR010141 This entry represents a single-molecule form of phosphoribosylformylglycinamidine synthase, also called FGAM synthase, an enzyme of purine de novo biosynthesis, which represent a second clade of the enzymes found in Clostridia, Bifidobacteria and Streptococcus species. This enzyme performs the fourth step in IMP biosynthesis (the precursor of all purines) from PRPP.. Probab=100.00 E-value=0 Score=458.75 Aligned_cols=217 Identities=32% Similarity=0.502 Sum_probs=201.7 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCC-------------CCCCCCCEEEECCCCCCCCCC-CHHHHHH--- Q ss_conf 68998449706479999999985499639982154-------------644564789972873454311-4067752--- Q gi|254780971|r 2 KTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSD-------------TDIPDVDLIVIPGGFSYGDYL-RCGAIAA--- 64 (219) Q Consensus 2 kvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~-------------~~l~~~d~lvipGGFSygD~l-~aG~i~~--- 64 (219) ||-|-.|||||||||+++||++.|+....++.++. .+|++.++++||||||-||.. ||+++.+ T Consensus 1018 ~VvIpvFPGtNsEYD~~~aFe~~GAev~~~~~~nL~~~a~~~Sv~~l~~~i~ksqI~~lpGGFSAgDEPDGSAKFi~~~l 1097 (1279) T TIGR01857 1018 RVVIPVFPGTNSEYDSAKAFEKEGAEVKLVILRNLTEEALVESVEELVDEIDKSQILMLPGGFSAGDEPDGSAKFIAAIL 1097 (1279) T ss_pred EEEEECCCCCCCHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCHHHHHHHH T ss_conf 67981358898488888898545895899600038888999999999874046408995587877778885289999976 Q ss_pred -HHHHHHHHHHHHH-CCCEEEEECCCHHHHEECCCHH-----------HHHCCCCCCCEEEEEEEEEECCCHHHHHHHCC Q ss_conf -0021212233220-5971786064031010000010-----------11012443324542257675252357764137 Q gi|254780971|r 65 -RTPVMQAIKKKAQ-QGIKVMGICNGFQILVELNLLP-----------GILMRNCSLKFVCKQVLLEVVNSNTAFTKSYK 131 (219) Q Consensus 65 -~~~~~~~i~~~~~-~g~~vLGICNGfQiL~elGLlP-----------g~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~ 131 (219) +..++++|++|++ +|+++||||||||.|+++|||| .||+-|+-+|+.|+-|.+||.+++|||+.... T Consensus 1098 ~npkv~~~v~~lL~~RDGLiLGiCNGFQALvK~GLlPYG~i~~~~~~~pTLtyN~inrH~s~~V~Tki~~~~SPWL~~~~ 1177 (1279) T TIGR01857 1098 RNPKVKEAVKKLLKERDGLILGICNGFQALVKLGLLPYGEIVDVEEDSPTLTYNKINRHVSKIVRTKIASKNSPWLSEVS 1177 (1279) T ss_pred HCHHHHHHHHHHHHHCCCEEEEEEHHHHHHHHHCCCCCCCEEECCCCCCCCCCCCHHHHHEEEEEEEECCCCCCCCCCEE T ss_conf 08147999999996079849862005678888247888642400457863120201012010123565047788402103 Q ss_pred CCCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCC---------CCCCCCCCCEEEEECCCCCEEEECCCCHHHCC Q ss_conf 996899862013302330867741154225036531688---------88898811266888699989998488434322 Q gi|254780971|r 132 MNQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASG---------TNPNGSLHDIAGVINRRGNVLGMMPHPENIIE 202 (219) Q Consensus 132 ~~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~---------~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~ 202 (219) .|++..|||+|||||||++++.|++|.+|+||+.||+|- .|||||..+|-||+|+||||||-|-|.||+-+ T Consensus 1178 vGdih~ipvSHGEGrFva~~~~l~~L~~NGQIatQYVD~~G~~~m~~~yNPNGS~~AIEGI~S~dG~i~GKMGHSER~~~ 1257 (1279) T TIGR01857 1178 VGDIHAIPVSHGEGRFVASDEVLKELRENGQIATQYVDFEGKPAMDLKYNPNGSSLAIEGIISPDGRIFGKMGHSERYGD 1257 (1279) T ss_pred ECCEEEEEEECCCCCEECCHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCCCCCEEECCEECCCCEEEEECCCEEECCC T ss_conf 24277533444898323077899898637963789872678635788669785302463445689727520452024478 Q ss_pred CCCCCCCH---HHHHHHHC Q ss_conf 10067762---89998642 Q gi|254780971|r 203 KFHGGIDG---RGLFASLL 218 (219) Q Consensus 203 ~~~~~~dG---~~~f~~~~ 218 (219) .++.|.|| +.||+|.| T Consensus 1258 ~lf~NipGnk~Q~Lf~~gv 1276 (1279) T TIGR01857 1258 GLFKNIPGNKDQKLFKSGV 1276 (1279) T ss_pred CCEECCCCCCCHHHHHHHC T ss_conf 8546078765718898622 No 10 >TIGR01739 tegu_FGAM_synt herpesvirus tegument protein/v-FGAM-synthase; InterPro: IPR010077 This entry describes a family of large proteins of herpes virus. The protein is described variably as tegument protein or phosphoribosylformylglycinamidine synthase (FGAM-synthase). The protein shows homology to eukaryotic FGAM-synthase. ; GO: 0004642 phosphoribosylformylglycinamidine synthase activity, 0005198 structural molecule activity, 0019033 viral tegument. Probab=99.98 E-value=1.6e-32 Score=228.78 Aligned_cols=200 Identities=19% Similarity=0.326 Sum_probs=169.8 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCC-----C-----CCCCCCEEEECCCCCCCCCCCHHHHHH-----HH Q ss_conf 68998449706479999999985499639982154-----6-----445647899728734543114067752-----00 Q gi|254780971|r 2 KTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSD-----T-----DIPDVDLIVIPGGFSYGDYLRCGAIAA-----RT 66 (219) Q Consensus 2 kvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~-----~-----~l~~~d~lvipGGFSygD~l~aG~i~~-----~~ 66 (219) ||+|+.+||.+.+.-++.||+. +||++..|...+ . .+++|-|+||.|.--+-|...++.+++ +. T Consensus 1026 ~V~V~~~PG~~~~~~Ll~Al~~-aGF~~~~v~~~e~~~~~~~~Dtls~~~~~G~~i~G~~g~~~~~~g~~~~~~~l~~~~ 1104 (1313) T TIGR01739 1026 RVAVLLLPGQSKPESLLAALTN-AGFDPRIVSITELKKTDFSYDTLSLDTFSGLIIGGASGTLDSEVGARALAAALLRNQ 1104 (1313) T ss_pred EEEEEECCCCCCCHHHHHHHHH-CCCCEEEEEEEECCCCCCCCCCCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHHCC T ss_conf 1899987957884699999983-798448988862145888621113587579998798786326689999999997277 Q ss_pred ---HHHHHHHHHH-HCCCEEEEECC-CHHHHEECCCH-----------------------HHHHCCCCCCCEEEEEEEEE Q ss_conf ---2121223322-05971786064-03101000001-----------------------01101244332454225767 Q gi|254780971|r 67 ---PVMQAIKKKA-QQGIKVMGICN-GFQILVELNLL-----------------------PGILMRNCSLKFVCKQVLLE 118 (219) Q Consensus 67 ---~~~~~i~~~~-~~g~~vLGICN-GfQiL~elGLl-----------------------Pg~l~~N~s~rf~~r~~~~~ 118 (219) .+.+++.+|. +.|+|+||+-. |||||..++++ |..|..|+|++|||||.++. T Consensus 1105 Gds~~~~~L~~F~~r~dTFslg~Gelg~qlL~~l~~~~~~~~~~~~~~~~~~~~~mPlf~~~~L~~N~S~~yESrWLN~~ 1184 (1313) T TIGR01739 1105 GDSAFLRALLTFLNRPDTFSLGFGELGTQLLLALNIVGYTQSSPVISVETEVQESMPLFSPVALEKNASGLYESRWLNIY 1184 (1313) T ss_pred CCHHHHHHHHHHHHCCCCEEEEECHHHHHHHHHCCEEEECCCCCCCCEECHHHHCCCCCCCCCCCCCCCCCCCCCCCCEE T ss_conf 85589999999850698349931278889887424023107886431011043415333545455565532000112256 Q ss_pred ECC-CHHHHHHHCCCCCEEEEEEECCCEEE----EECHHHHHHHHHCCEEEEECC------------CCCCCCCCCCCEE Q ss_conf 525-23577641379968998620133023----308677411542250365316------------8888898811266 Q gi|254780971|r 119 VVN-SNTAFTKSYKMNQIIKCPVAHHDGNY----FIDAKGLAEIEKNNQIVFRYA------------SGTNPNGSLHDIA 181 (219) Q Consensus 119 v~~-~~s~~~~~~~~~~~l~~piaHgEGrf----~~~~~~l~~l~~~~~i~~~Y~------------d~~NPNGS~~~IA 181 (219) |++ ++|+|++.++ |.++++|+ .|-+ +.++...+.|.+.+||+..|+ ||.||+| -.+|| T Consensus 1185 IP~~t~Sv~l~~l~-G~VlPcW~---QGthLG~~~~~~~~~~~l~~~g~Ia~~Fhg~~~~~~~~A~~YPRNPsg-~~~vA 1259 (1313) T TIGR01739 1185 IPETTKSVFLRPLR-GSVLPCWA---QGTHLGLSYKSDGVEEELENSGQIAATFHGNSPSSGLPATNYPRNPSG-GSNVA 1259 (1313) T ss_pred ECCCCCCEEEECCC-CCCCCCEE---ECCCCCCCCCCCCHHHHHHCCCCEEEEECCCCCCCCCCCCCCCCCCCC-CCCEE T ss_conf 46887311330556-87401100---113266022786434555205547877426888887510048688888-87223 Q ss_pred EEECCCCCEEEECCCCHHHCCCCCCC Q ss_conf 88869998999848843432210067 Q gi|254780971|r 182 GVINRRGNVLGMMPHPENIIEKFHGG 207 (219) Q Consensus 182 gi~s~~G~vlgmMPHPER~~~~~~~~ 207 (219) ||||+||||+||.-.|=-++++|.|- T Consensus 1260 GlCS~DGRhlA~L~DPslsf~~wQWQ 1285 (1313) T TIGR01739 1260 GLCSEDGRHLALLIDPSLSFFPWQWQ 1285 (1313) T ss_pred EECCCHHHHHHHHHCHHHHCCCCCCC T ss_conf 67473156897761756716776875 No 11 >PRK13143 hisH imidazole glycerol phosphate synthase subunit HisH; Provisional Probab=99.82 E-value=1.4e-19 Score=146.08 Aligned_cols=183 Identities=25% Similarity=0.390 Sum_probs=123.5 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCE Q ss_conf 68998449706479999999985499639982154644564789972873454311406775200212122332205971 Q gi|254780971|r 2 KTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIK 81 (219) Q Consensus 2 kvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~ 81 (219) ||+||.|-.-| -...++|+++ .+..+.++ .+..++.++|+|||||..||++... ....+.+.+.++++.++| T Consensus 3 ~I~Iid~G~~n-~~si~~~l~~-lg~~~~i~-~~~~~l~~~d~iILPGvG~~~~~~~-----~l~~~~~~i~~~~~~~~P 74 (201) T PRK13143 3 MIVIIDYGLGN-LRSVSKGLER-AGADVEIT-SDPEEILSADGIVLPGVGAFGDVME-----NLGPLKETINEAVDDGKP 74 (201) T ss_pred EEEEEECCCCH-HHHHHHHHHH-CCCEEEEC-CCHHHHHHCCEEEECCCCCHHHHHH-----HHHHHHHHHHHHHHCCCC T ss_conf 89999889719-9999999998-69859992-8999984379679738881889998-----765157999999986998 Q ss_pred EEEECCCHHHHEE----------CCCHHHHHCCCCCCCE--EEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEE Q ss_conf 7860640310100----------0001011012443324--542257675252357764137996899862013302330 Q gi|254780971|r 82 VMGICNGFQILVE----------LNLLPGILMRNCSLKF--VCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFI 149 (219) Q Consensus 82 vLGICNGfQiL~e----------lGLlPg~l~~N~s~rf--~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~ 149 (219) +||||.|+|+|.+ |||++|...+...... .-.|..+++.+ ++++++++..+ ..+|..|+ |++ T Consensus 75 iLGIClGmQll~~~s~E~~~~~GLg~~~g~v~~~~~~~~ip~~G~~~i~~~~-~~~l~~~i~~~--~~~yf~Hs---~~~ 148 (201) T PRK13143 75 FLGICLGMQMLFESSEEGGLTKGLGLIPGRVVRFPPGVKVPHMGWNTVNIVK-DCPLFEGIKDG--EYFYFVHS---YYA 148 (201) T ss_pred EEEEEHHHHHHHHHCCCCCCCCCCEEECCEEEECCCCCCCCCCCCEEEEEEC-CCHHHCCCCCC--CEEEEEEE---EEE T ss_conf 7998089865534140267644536863389973654446634426876532-68012286655--47999714---489 Q ss_pred CHHHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHC Q ss_conf 867741154225036531688888988112668886999899984884343221006776289998642 Q gi|254780971|r 150 DAKGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLL 218 (219) Q Consensus 150 ~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~ 218 (219) ... ..+.+. .+++. |. .++++.. ++|++|.++|||++ +..|+.+++|++ T Consensus 149 ~~~------~~~~~~-~~t~~----~~--~~~a~i~-~~ni~GvQFHPEks------~~~G~~ilknF~ 197 (201) T PRK13143 149 DPD------DPDVIV-ATTDY----GI--EFPAVVC-KGNVFGTQFHPEKS------GETGLKILENFV 197 (201) T ss_pred EEC------CCCCEE-EEECC----CC--EEEEEEE-CCCEEEEECCCCCC------CCCHHHHHHHHH T ss_conf 827------986268-99619----98--0899993-89899991898236------725599999999 No 12 >PRK13153 consensus Probab=99.82 E-value=1.3e-19 Score=146.21 Aligned_cols=184 Identities=24% Similarity=0.375 Sum_probs=124.7 Q ss_pred EEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEE Q ss_conf 89984497064799999999854996399821546445647899728734543114067752002121223322059717 Q gi|254780971|r 3 TAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIKV 82 (219) Q Consensus 3 vaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~v 82 (219) |+||.+ |+.-=....+|+++ .+.++.++ .+..++.++|.||+||..||++.. ....+..+.+.+++++++++|+ T Consensus 2 I~IiDy-g~gNi~Sv~~al~~-lg~~~~ii-~~~~~i~~~d~lIlPGVGsf~~~m---~~L~~~~l~~~i~~~~~~g~pi 75 (203) T PRK13153 2 IGIIDY-NAGNLASVINAFEK-IGAKARLE-SDPSKLKEYDKLILPGVGAFGDAM---EHLKSNGMDEAIKEFAKSGKPF 75 (203) T ss_pred EEEEEC-CCCHHHHHHHHHHH-CCCCEEEE-CCHHHHHHCCEEEECCCCCHHHHH---HHHHHCCCHHHHHHHHHCCCCE T ss_conf 999966-93289999999998-69988997-699999658938987888348899---9777537578899998759968 Q ss_pred EEECCCHHHHEE----------CCCHHHHHCCCCCCCEE-------EEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCE Q ss_conf 860640310100----------00010110124433245-------4225767525235776413799689986201330 Q gi|254780971|r 83 MGICNGFQILVE----------LNLLPGILMRNCSLKFV-------CKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDG 145 (219) Q Consensus 83 LGICNGfQiL~e----------lGLlPg~l~~N~s~rf~-------~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEG 145 (219) ||||.|+|+|.+ |||++|...+-...+|. --|..++.. ++++++++++.+ ...+-.|. T Consensus 76 LGIClGmQlL~~~s~E~g~~~GLgli~G~V~~l~~~~~~~~~kiPhiGWn~i~~~-~~~~~~~~~~~~--~~~YF~HS-- 150 (203) T PRK13153 76 LGICLGMQLLFEKSYEFGEHEGLGLIEGEVVKFDESKFDEPLKVPHMGWNRLEFK-KETPLFRGLPES--FYLYFVHS-- 150 (203) T ss_pred EEEEHHHHHCCCCCEECCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCHHHEECC-CCCHHHCCCCCC--CEEEEEEE-- T ss_conf 9985553420047590675688848877899867222355667873435541034-568666077768--73999910-- Q ss_pred EEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHCC Q ss_conf 23308677411542250365316888889881126688869998999848843432210067762899986429 Q gi|254780971|r 146 NYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLLT 219 (219) Q Consensus 146 rf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~s 219 (219) |++. ..++..++... + |. .=+|+|. .+|++|+++|||++-+ .|++|++|++. T Consensus 151 -y~~~------~~~~~~~~~~~-~-----g~-~~~aai~--~~Ni~G~QFHPEKS~~------~Gl~llknFl~ 202 (203) T PRK13153 151 -YHAV------CDDKDVLGKTT-Y-----GY-EFVSAVQ--HDNVFGFQPHPEKSHD------NGLKILKNFVE 202 (203) T ss_pred -EECC------CCCCCEEEEEC-C-----CC-EEEEEEE--ECCEEEEECCCCCCCH------HHHHHHHHHHC T ss_conf -3425------68768999955-9-----98-6899998--2999999289440778------59999999961 No 13 >PRK13175 consensus Probab=99.81 E-value=2.7e-19 Score=144.21 Aligned_cols=187 Identities=24% Similarity=0.363 Sum_probs=126.4 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCE Q ss_conf 68998449706479999999985499639982154644564789972873454311406775200212122332205971 Q gi|254780971|r 2 KTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIK 81 (219) Q Consensus 2 kvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~ 81 (219) ||+||.+ |+.-=...++|+++ .+.++.++ .+..++.++|+|||||+-||+... .-..+..+.+.++++.++++| T Consensus 3 kI~IiD~-g~gn~~si~~al~~-lg~~~~i~-~~~~~i~~~d~lILPGvGsf~~~~---~~l~~~~l~~~i~~~~~~g~P 76 (206) T PRK13175 3 KIAVIDY-GMGNLHSVCKALER-LGAEPILT-SDPADLLAADALILPGVGAFDPAM---QNLRSRGLIPPIKDAIASGKP 76 (206) T ss_pred EEEEEEC-CCCHHHHHHHHHHH-CCCCEEEE-CCHHHHHCCCEEEECCCCCCHHHH---HHHHHCCCHHHHHHHHHCCCC T ss_conf 7999979-97689999999998-79989997-998998328928968998526788---755531871689999856997 Q ss_pred EEEECCCHHHHEE---------CCCHHHHHCCCCCCCE----EEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEE Q ss_conf 7860640310100---------0001011012443324----54225767525235776413799689986201330233 Q gi|254780971|r 82 VMGICNGFQILVE---------LNLLPGILMRNCSLKF----VCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYF 148 (219) Q Consensus 82 vLGICNGfQiL~e---------lGLlPg~l~~N~s~rf----~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~ 148 (219) +||||-|+|+|.+ |||++|...+..+... .--|..+...+ ++++++++... ...|..|+ |+ T Consensus 77 iLGIClGmQlL~~~seeg~~~GLgli~g~V~k~~~~~~~~iPh~Gw~~i~~~~-~~~l~~~~~~~--~~~yf~HS---y~ 150 (206) T PRK13175 77 FLGICLGLQLLFESSEEGTEPGLGILPGKVQRFPSEPGLRIPHMGWNQLLTQP-DCPLWENLPPN--PWVYFVHS---YY 150 (206) T ss_pred EEEEEHHHHHHHHCCCCCCCCCCCEECEEEEECCCCCCCCCCCCCCEEEEECC-CCCCCCCCCCC--CCEEEEEC---CE T ss_conf 79986786676132534564776352107986265555337610450578517-88420112678--82389750---07 Q ss_pred ECHHHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHCC Q ss_conf 08677411542250365316888889881126688869998999848843432210067762899986429 Q gi|254780971|r 149 IDAKGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLLT 219 (219) Q Consensus 149 ~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~s 219 (219) +.. .+...++..+.+ |+..=+|++- .+|++|++.|||++ +..|+.|++|+++ T Consensus 151 ~~~------~~~~~~~~~~~~-----~~~~~~a~v~--~~ni~G~QFHPEkS------~~~G~~llknFl~ 202 (206) T PRK13175 151 AVP------LDPSDTAATVTH-----GSQKVTAAIA--RDNLMAVQFHPEKS------GTAGLKLLRNFVS 202 (206) T ss_pred EEE------CCCCCEEEEEEC-----CCCCEEEEEE--CCCEEEEECCCCCC------CCCHHHHHHHHHH T ss_conf 840------687626999960-----8983899997--89999994898246------8437899999999 No 14 >PRK13525 glutamine amidotransferase subunit PdxT; Provisional Probab=99.81 E-value=2.1e-19 Score=144.89 Aligned_cols=173 Identities=22% Similarity=0.368 Sum_probs=115.9 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 96899844970647999999998549963998215464456478997287345431140677520021212233220597 Q gi|254780971|r 1 MKTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGI 80 (219) Q Consensus 1 mkvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~ 80 (219) |||+||-.+|. -++..+|+++ .|.++.+ ..++.++.++|.||||||-| ....+.+.+..+.+.|++++++|+ T Consensus 2 ~~IGvl~~qGn--~rs~~~aL~~-lG~~~~~-v~~~~di~~ad~lILPGG~s----~am~~ll~~~gl~~~i~~~i~~gk 73 (191) T PRK13525 2 MKIGVLALQGA--VREHIAALEA-LGAEAVE-VRRPEDLDEIDGLILPGGES----TTMGKLLRDFGLLEPLREFIASGL 73 (191) T ss_pred CEEEEEECCCC--HHHHHHHHHH-CCCCEEE-ECCHHHHHHCCEEEECCCCH----HHHHHHHHHCCCHHHHHHHHHCCC T ss_conf 37999964788--9999999998-7997899-89999995399899789765----899999998686899999998499 Q ss_pred EEEEECCCHHHHEE---------CCCHHHHHCCCCCCCEEEEEEEEEECC-CHHHHHHHCCCCCEEEEEEECCCEEEEEC Q ss_conf 17860640310100---------000101101244332454225767525-23577641379968998620133023308 Q gi|254780971|r 81 KVMGICNGFQILVE---------LNLLPGILMRNCSLKFVCKQVLLEVVN-SNTAFTKSYKMNQIIKCPVAHHDGNYFID 150 (219) Q Consensus 81 ~vLGICNGfQiL~e---------lGLlPg~l~~N~s~rf~~r~~~~~v~~-~~s~~~~~~~~~~~l~~piaHgEGrf~~~ 150 (219) |+||||-|+|+|.+ |||+|+...+|.-+|- +.. ..++-+.++..+ .....-|+ |+ T Consensus 74 P~LGIClGmqLL~~~~E~~~~~gLGli~g~V~Rn~~grq--------~~s~~~~~~~~~~~~~--~~~yFvhs---p~-- 138 (191) T PRK13525 74 PVFGTCAGMILLAKEIEGPEQEHLGLLDITVRRNAFGRQ--------VDSFEAELDIKGLGDP--FPAVFIRA---PY-- 138 (191) T ss_pred CEEEECHHHHEEEECCCCCCCCCCCCCCCEEEECCCCCC--------CCCCCCCCCCCCCCCC--CCEEEEEC---CE-- T ss_conf 857622212111202335897776778837863464751--------4444267636688998--45899708---88-- Q ss_pred HHHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHC Q ss_conf 67741154225036531688888988112668886999899984884343221006776289998642 Q gi|254780971|r 151 AKGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLL 218 (219) Q Consensus 151 ~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~ 218 (219) .....++-.+..+|-+ .++++ ..||++|...|||++- +.+|.+|++ T Consensus 139 ---~~~~~~~~~~~a~~g~---------~~vaV--~~~Nv~g~QFHPEkSg--------~~~i~~~Fl 184 (191) T PRK13525 139 ---IERVGPGVEVLARVGG---------RIVAV--RQGNILATSFHPELTD--------DRRVHRYFL 184 (191) T ss_pred ---EEECCCCCEEEEEECC---------EEEEE--EECCEEEEEECCCCCC--------CCHHHHHHH T ss_conf ---8044999489999899---------99999--9899999980993059--------837999999 No 15 >COG0118 HisH Glutamine amidotransferase [Amino acid transport and metabolism] Probab=99.79 E-value=1.3e-18 Score=139.94 Aligned_cols=184 Identities=27% Similarity=0.426 Sum_probs=123.1 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECC-CCCCCCCCCHHHHHHHHHHHHHHHHHHHCC Q ss_conf 9689984497064799999999854996399821546445647899728-734543114067752002121223322059 Q gi|254780971|r 1 MKTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPG-GFSYGDYLRCGAIAARTPVMQAIKKKAQQG 79 (219) Q Consensus 1 mkvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipG-GFSygD~l~aG~i~~~~~~~~~i~~~~~~g 79 (219) |+|+||.+ |+---+...+||+++ |.++ .+..+...+..+|.||+|| | +|+|.. +-+.+..+.+.+++.++.+ T Consensus 2 ~~i~IIDy-g~GNL~Sv~~Aler~-G~~~-~vs~d~~~i~~AD~liLPGVG-af~~am---~~L~~~gl~~~i~~~~~~~ 74 (204) T COG0118 2 MMVAIIDY-GSGNLRSVKKALERL-GAEV-VVSRDPEEILKADKLILPGVG-AFGAAM---ANLRERGLIEAIKEAVESG 74 (204) T ss_pred CEEEEEEC-CCCHHHHHHHHHHHC-CCEE-EEECCHHHHHHCCEEEECCCC-CHHHHH---HHHHHCCHHHHHHHHHHCC T ss_conf 78999972-863599999999975-9806-981698897108887866978-889999---9888636699999997559 Q ss_pred CEEEEECCCHHHHEE----------CCCHHHHHCCCCCC--C-EEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEE Q ss_conf 717860640310100----------00010110124433--2-4542257675252357764137996899862013302 Q gi|254780971|r 80 IKVMGICNGFQILVE----------LNLLPGILMRNCSL--K-FVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGN 146 (219) Q Consensus 80 ~~vLGICNGfQiL~e----------lGLlPg~l~~N~s~--r-f~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGr 146 (219) +|+||||-|+|+|.| |||+||...+=+.. + =.--|-.+.+. ..++++.+++.+.. ..--|. T Consensus 75 kP~LGIClGMQlLfe~SeE~~~~~GLg~i~G~V~r~~~~~~kvPHMGWN~l~~~-~~~~l~~gi~~~~~--~YFVHS--- 148 (204) T COG0118 75 KPFLGICLGMQLLFERSEEGGGVKGLGLIPGKVVRFPAEDLKVPHMGWNQVEFV-RGHPLFKGIPDGAY--FYFVHS--- 148 (204) T ss_pred CCEEEEEHHHHHHHHCCCCCCCCCCCCEECCEEEECCCCCCCCCCCCCCEEECC-CCCHHHCCCCCCCE--EEEEEE--- T ss_conf 977998175777664154368988702130168876888887886665223126-78744328998877--999978--- Q ss_pred EEECHHHHHHHHHCCEEE-EECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHC Q ss_conf 330867741154225036-531688888988112668886999899984884343221006776289998642 Q gi|254780971|r 147 YFIDAKGLAEIEKNNQIV-FRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLL 218 (219) Q Consensus 147 f~~~~~~l~~l~~~~~i~-~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~ 218 (219) |+... ...+..++ ..| |.. =.|.|. ++|++|.+.||||+-.- |+.|++|++ T Consensus 149 Y~~~~-----~~~~~v~~~~~Y-------G~~-f~AaV~--k~N~~g~QFHPEKSg~~------Gl~lL~NFl 200 (204) T COG0118 149 YYVPP-----GNPETVVATTDY-------GEP-FPAAVA--KDNVFGTQFHPEKSGKA------GLKLLKNFL 200 (204) T ss_pred EEECC-----CCCCEEEEECCC-------CCE-EEEEEE--ECCEEEEECCCCCCHHH------HHHHHHHHH T ss_conf 76157-----998618886468-------983-589997--08899992386311387------899999998 No 16 >PRK13147 consensus Probab=99.79 E-value=2e-18 Score=138.69 Aligned_cols=189 Identities=24% Similarity=0.383 Sum_probs=120.4 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHC---CCCE-EEEECCCCCCCCCCEEEECC-CCCCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 9689984497064799999999854---9963-99821546445647899728-73454311406775200212122332 Q gi|254780971|r 1 MKTAIVQIPGLNRDNDMIKAITKII---GQSP-ILVWQSDTDIPDVDLIVIPG-GFSYGDYLRCGAIAARTPVMQAIKKK 75 (219) Q Consensus 1 mkvaVl~~pGsNcd~e~~~A~~~~~---~~~~-~~v~~~~~~l~~~d~lvipG-GFSygD~l~aG~i~~~~~~~~~i~~~ 75 (219) |||+||.+.-.| =+...+|+++++ +.+. +.+..++.++.++|.||||| | ||++... .+..+..+.+.++++ T Consensus 1 MkI~IIDyg~GN-i~Sv~~al~~~~~~~gl~~~~~i~~~~~~i~~~d~lILPGVG-sF~~~m~--~L~~~~~l~~~i~~~ 76 (211) T PRK13147 1 MRVVVIDYNGGN-LASAARALEKAATRKGIDATVVISADPTDVRNADRIVLPGQG-AFADCAQ--GLAAIPGLRNAIETA 76 (211) T ss_pred CEEEEECCCCHH-HHHHHHHHHHHHHHCCCCCEEEEECCHHHHHHCCEEEECCCC-CHHHHHH--HHHHCCCHHHHHHHH T ss_conf 989999379436-999999999887535888439996999999649978989866-4899999--988635859999999 Q ss_pred HHCCCEEEEECCCHHHHEE----------CCCHHHHHCC--CCCCC-EEEEEEEEEECCCHHHHHHHCCCCCEEEEEEEC Q ss_conf 2059717860640310100----------0001011012--44332-454225767525235776413799689986201 Q gi|254780971|r 76 AQQGIKVMGICNGFQILVE----------LNLLPGILMR--NCSLK-FVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAH 142 (219) Q Consensus 76 ~~~g~~vLGICNGfQiL~e----------lGLlPg~l~~--N~s~r-f~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaH 142 (219) .++++|+||||-|+|+|.+ |||++|...+ +...+ =---|..+.+.++..++++++..+.. ..-.| T Consensus 77 ~~~~~piLGICLGMQlL~~~s~E~g~~~GLgli~G~v~kl~~~~~kvPhiGWn~i~~~~~~~~~~~~~~~~~~--fYFvH 154 (211) T PRK13147 77 TAAGTPFLGICVGMQLMAERGLEHERTPGFGWISGEIARMDAPGLRLPHMGWNTLDFTPGAHPLTDGLIPGDH--GYFVH 154 (211) T ss_pred HHCCCCEEEECHHHHHHCCCCCCCCCCCCEEEEEEEEEECCCCCCCCCEECEEEEECCCCCCHHHCCCCCCCE--EEEEC T ss_conf 9769987998367575216465379877400770289989876788876252688705786643247676886--99961 Q ss_pred CCEEEEECHHHHHHHHHCCEEE-EECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHCC Q ss_conf 3302330867741154225036-5316888889881126688869998999848843432210067762899986429 Q gi|254780971|r 143 HDGNYFIDAKGLAEIEKNNQIV-FRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLLT 219 (219) Q Consensus 143 gEGrf~~~~~~l~~l~~~~~i~-~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~s 219 (219) + |++.+. ..+..++ ..| |.. =+|.|. .+|++|.+.|||++-.. |+++++|++. T Consensus 155 S---Y~~~~~-----~~~~~~~~~~y-------g~~-~~s~v~--k~Ni~G~QFHPEKS~~~------Gl~ll~nFlk 208 (211) T PRK13147 155 S---YALRDG-----ADSDLVATTQY-------GGQ-VPAIVA--RGNRAGTQFHVEKSQEV------GLTILGNFLR 208 (211) T ss_pred C---CCCCCC-----CCCCEEEEEEC-------CCE-EEEEEE--ECCEEEEECCCHHCCHH------HHHHHHHHHC T ss_conf 2---340688-----88768999989-------998-999999--89999996880014686------8999999867 No 17 >PRK13180 consensus Probab=99.79 E-value=2.1e-18 Score=138.48 Aligned_cols=190 Identities=18% Similarity=0.235 Sum_probs=121.6 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECC-CCCCCCCCCHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 689984497064799999999854996399821546445647899728-7345431140677520021212233220597 Q gi|254780971|r 2 KTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPG-GFSYGDYLRCGAIAARTPVMQAIKKKAQQGI 80 (219) Q Consensus 2 kvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipG-GFSygD~l~aG~i~~~~~~~~~i~~~~~~g~ 80 (219) ||+||.+ |+.-=+...+|+++ .+.++.+ ..+..++.++|.||||| | ||++.. +-..+..+.+.+.++.+.|+ T Consensus 3 kI~IiDy-g~gNi~Sv~~al~~-~g~~~~i-~~~~~~~~~~d~lIlPGVG-sf~~~m---~~L~~~~~~~~i~~~~~~gk 75 (209) T PRK13180 3 RVVVLDY-GSGNLRSAQRALER-VGAEVEV-TADPDAALNADGLVVPGVG-AFAACM---AGLRAVGGDRIIGERLAAGR 75 (209) T ss_pred EEEEEEC-CCCHHHHHHHHHHH-CCCCEEE-ECCHHHHHHCCEEEECCCC-CHHHHH---HHHHHCCCHHHHHHHHHCCC T ss_conf 7999978-94389999999998-6998999-7999999538968999988-557777---76553691999999997699 Q ss_pred EEEEECCCHHHHEE-----------CCCHHHHHCCCCCCC-EEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEE Q ss_conf 17860640310100-----------000101101244332-454225767525235776413799689986201330233 Q gi|254780971|r 81 KVMGICNGFQILVE-----------LNLLPGILMRNCSLK-FVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYF 148 (219) Q Consensus 81 ~vLGICNGfQiL~e-----------lGLlPg~l~~N~s~r-f~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~ 148 (219) |+||||-|+|+|.+ |||+||...+-++.+ -.--|..++..+ +++.+++++.+. ..+..|+..-.. T Consensus 76 piLGIClGMQlL~~~s~E~g~~~~GLg~i~G~v~~l~~~~vPh~GWn~v~~~~-~~~l~~~i~~~~--~fYFvHSy~v~~ 152 (209) T PRK13180 76 PVLGICVGMQILFERGVEHGVETEGCGEWPGTVERLDAPVLPHMGWNTVEAPA-GSVLFAGLDADE--RFYFVHSYAAQR 152 (209) T ss_pred CEEEEEEHEEEEEECEECCCCCCCCCCEECCEEEECCCCCCCCCCCEEEECCC-CCHHHHCCCCCC--CEEEECEEEECC T ss_conf 87998412088870311167677881365508997788878735655777067-763540888589--588968526634 Q ss_pred ECHHHHHHHHHCCEEEE-ECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHC Q ss_conf 08677411542250365-31688888988112668886999899984884343221006776289998642 Q gi|254780971|r 149 IDAKGLAEIEKNNQIVF-RYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLL 218 (219) Q Consensus 149 ~~~~~l~~l~~~~~i~~-~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~ 218 (219) .+.+ .........++. .| | ..=||++. ++|++|.+.|||++-.. |++|++|++ T Consensus 153 ~~~~-~~~~~~~~~~~~~~~-------g-~~~~aav~--~~ni~G~QFHPEKS~~~------Gl~ll~nFl 206 (209) T PRK13180 153 WEGS-PDPRTAPPLVTWATH-------G-APFVAAVE--NGPLSATQFHPEKSGDA------GAALLRNWV 206 (209) T ss_pred CCCC-CCCCCCCCEEEEEEC-------C-CEEEEEEE--CCCEEEEECCCCCCCHH------HHHHHHHHH T ss_conf 5655-565665535899847-------9-82699998--29999993996326773------899999998 No 18 >PRK13146 hisH imidazole glycerol phosphate synthase subunit HisH; Provisional Probab=99.78 E-value=1.9e-18 Score=138.80 Aligned_cols=187 Identities=21% Similarity=0.341 Sum_probs=122.9 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECC-CCCCCCCCCHHHHHHHHHHHHHHHHHHHCC Q ss_conf 9689984497064799999999854996399821546445647899728-734543114067752002121223322059 Q gi|254780971|r 1 MKTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPG-GFSYGDYLRCGAIAARTPVMQAIKKKAQQG 79 (219) Q Consensus 1 mkvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipG-GFSygD~l~aG~i~~~~~~~~~i~~~~~~g 79 (219) ||++||.+.-.|- ....+|+++++....++++.+..++.++|.||||| | ||+... .-..+..+.+.+.++++++ T Consensus 2 ~kI~IiDyg~gNi-~Sv~~al~~~~~~~~i~i~~~~~~i~~~d~lIlPGVG-sf~~~m---~~L~~~~~~~~i~~~~~~~ 76 (208) T PRK13146 2 MSVAIIDYGSGNL-RSAARALERAAPGADVTVTADPDAVRAADRLVLPGVG-AFADCM---RGLRAVGGLREAVEEAAAG 76 (208) T ss_pred CEEEEEECCCCHH-HHHHHHHHHHCCCCEEEEECCHHHHHHCCEEEECCCC-CHHHHH---HHHHHCCCHHHHHHHHHCC T ss_conf 8899993795589-9999999985799739996999998428878983889-779999---9876407689999999839 Q ss_pred CEEEEECCCHHHHEE----------CCCHHHHHCC--CCCCCE---EEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCC Q ss_conf 717860640310100----------0001011012--443324---5422576752523577641379968998620133 Q gi|254780971|r 80 IKVMGICNGFQILVE----------LNLLPGILMR--NCSLKF---VCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHD 144 (219) Q Consensus 80 ~~vLGICNGfQiL~e----------lGLlPg~l~~--N~s~rf---~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgE 144 (219) +|+||||-|+|+|.+ |||++|...+ +...++ ---|..+++.+ ++++++++..+ -..+-.|. T Consensus 77 kpiLGICLGmQlL~~~s~E~g~~~GLgli~G~V~kl~~~~~~~kiPhiGWn~i~~~~-~~~l~~~i~~~--~~fYFvHS- 152 (208) T PRK13146 77 RPFLGICVGMQLLFERGEEHGDTPGLGWIPGDVVRLQPDGPALKVPHMGWNTVDQPR-DHPLFDGIPDG--ARFYFVHS- 152 (208) T ss_pred CCEEEEEEEEEEEECCCEECCCCCCCEEECCEEEECCCCCCCCCCCCCCEEEEEECC-CCCCCCCCCCC--CEEEEEEE- T ss_conf 987988854641211240068657011874189999878866657442527788568-98200478878--87999731- Q ss_pred EEEEECHHHHHHHHHCCEEEE-ECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHCC Q ss_conf 023308677411542250365-316888889881126688869998999848843432210067762899986429 Q gi|254780971|r 145 GNYFIDAKGLAEIEKNNQIVF-RYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLLT 219 (219) Q Consensus 145 Grf~~~~~~l~~l~~~~~i~~-~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~s 219 (219) |++.... ....++. .| |. .=+|.|. .+|++|++.|||++-.. |++|++|++. T Consensus 153 --Y~~~~~~-----~~~~~a~t~y-------g~-~~~a~v~--~~ni~G~QFHPEKS~~~------Gl~llknFl~ 205 (208) T PRK13146 153 --YAAAPAN-----PADVLAWTDY-------GG-PFTAAVA--RGNLFATQFHPEKSGDA------GLRLLRNFLA 205 (208) T ss_pred --EEEECCC-----CHHEEEEEEC-------CC-EEEEEEE--CCCEEEEECCCCCCCHH------HHHHHHHHHC T ss_conf --3753188-----2100899638-------98-7999998--39999998993005683------8999999863 No 19 >PRK13527 glutamine amidotransferase subunit PdxT; Provisional Probab=99.77 E-value=2.4e-18 Score=138.16 Aligned_cols=175 Identities=22% Similarity=0.353 Sum_probs=112.3 Q ss_pred CEEEEEECCCCCCHHH----HHHHHHHHCCCC-EEEEECCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 9689984497064799----999999854996-39982154644564789972873454311406775200212122332 Q gi|254780971|r 1 MKTAIVQIPGLNRDND----MIKAITKIIGQS-PILVWQSDTDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKK 75 (219) Q Consensus 1 mkvaVl~~pGsNcd~e----~~~A~~~~~~~~-~~~v~~~~~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~ 75 (219) |||+||-.+|- =+| ..+|+++ .+.. ......++.+|.++|.||||||- |++ +.+.+.+..+.+.++++ T Consensus 1 ~~igvlalqG~--~~~hi~sl~kAl~~-lg~~~e~~~v~~p~~l~~~d~LILPGG~-f~~---~m~~L~~~gl~~~i~~~ 73 (196) T PRK13527 1 MKIGVLALQGD--VEEHIDALKRALDE-LGINGEVVRVRRPEDLKDCDALIIPGGE-STT---IGRLMKRYGILDEIKEK 73 (196) T ss_pred CEEEEEEECCC--HHHHHHHHHHHHHH-CCCCCEEEEECCHHHHHCCCEEEECCCC-HHH---HHHHHHHCCCHHHHHHH T ss_conf 95999972479--99999999999998-2999059996998998319979989966-799---99998886888999999 Q ss_pred HHCCCEEEEECCCHHHHEE----------CCCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCE Q ss_conf 2059717860640310100----------000101101244332454225767525235776413799689986201330 Q gi|254780971|r 76 AQQGIKVMGICNGFQILVE----------LNLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDG 145 (219) Q Consensus 76 ~~~g~~vLGICNGfQiL~e----------lGLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEG 145 (219) +++|+|+||||-|+|+|.+ |||+|+...+|.-+|-...+ + ....+.++.. ....+.-|+ T Consensus 74 i~~g~PiLGIClGmQLL~~~s~e~~~~~GLG~i~g~V~rn~~g~q~~s~-----~--~~~~~~~~~~--~~~~yFVhs-- 142 (196) T PRK13527 74 IEEGLPILGTCAGLILLAKEVDDEVVQPLLGLMDVTVNRNAFGRQRDSF-----E--AELDLEGLDG--PFHAVFIRA-- 142 (196) T ss_pred HHCCCCEEEECHHHHHHHHHCCCCCCCCCCCCCCEEEEECCCCCCCCCC-----C--CCEEECCCCC--CEEEEEEEE-- T ss_conf 9769977997087999866315677767767575489965788745555-----1--6644147899--968999974-- Q ss_pred EEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHC Q ss_conf 2330867741154225036531688888988112668886999899984884343221006776289998642 Q gi|254780971|r 146 NYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLL 218 (219) Q Consensus 146 rf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~ 218 (219) |++ .+..++-.+..+|.+ .|+++ ..||++|...|||.+-+ .++.++++ T Consensus 143 -~~~-----~~~~~~~~v~a~~~~---------~~~aV--~k~Ni~g~QFHPEkSgd--------~~i~~~Fl 190 (196) T PRK13527 143 -PAI-----DEAGGDVEVLAKLDD---------RIVAV--EQGNVLATAFHPELTDD--------TRIHEYFL 190 (196) T ss_pred -EEE-----ECCCCCEEEEEEECC---------EEEEE--EECCEEEEECCCCCCCC--------CHHHHHHH T ss_conf -235-----027998799999799---------99999--98999999559722387--------55999999 No 20 >PRK13181 hisH imidazole glycerol phosphate synthase subunit HisH; Provisional Probab=99.77 E-value=3.3e-18 Score=137.24 Aligned_cols=185 Identities=22% Similarity=0.343 Sum_probs=125.0 Q ss_pred EEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEE Q ss_conf 89984497064799999999854996399821546445647899728734543114067752002121223322059717 Q gi|254780971|r 3 TAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIKV 82 (219) Q Consensus 3 vaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~v 82 (219) |+||.+-..|-. ..++|+++ .+..+.++ .+..++.++|.||+||--||+... .-..+..+.+.++++.+.++|+ T Consensus 2 I~IiD~G~~n~~-si~r~l~~-lg~~~~i~-~~~~~l~~~d~lIlPGvG~~~~~~---~~l~~~~~~~~i~~~~~~~~Pi 75 (199) T PRK13181 2 IAIIDYGAGNLR-SVANALKR-LGVEAVVS-SDPEEIEGADKVILPGVGAFGQAM---AALRESGLDEAIKEHVEKKQPV 75 (199) T ss_pred EEEEECCCHHHH-HHHHHHHH-CCCEEEEE-CCHHHHHHHHCEECCCCCCCCHHH---HHHHHCCCHHHHHHHHHCCCCE T ss_conf 999988960999-99999998-69809997-999997421100457986533025---6665638679999999769988 Q ss_pred EEECCCHHHHEE---------CCCHHHHHCCCCCCCEE---EEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEEC Q ss_conf 860640310100---------00010110124433245---422576752523577641379968998620133023308 Q gi|254780971|r 83 MGICNGFQILVE---------LNLLPGILMRNCSLKFV---CKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFID 150 (219) Q Consensus 83 LGICNGfQiL~e---------lGLlPg~l~~N~s~rf~---~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~ 150 (219) ||||-|+|+|.+ |||++|...+..+.+.. --|..+++.+ ++++++++.. ...+|..|+ |+++ T Consensus 76 LGIClGmQlL~~~~~eg~~~GLgl~~g~v~~~~~~~~~~phiGw~~i~~~~-~~~lf~~~~~--~~~~yf~Hs---~~v~ 149 (199) T PRK13181 76 LGICLGMQLLCESSEEGNVKGLGLIPGDVKRFRSGKDKVPQMGWNSVKPLK-ESPLFKGIEE--NSRFYFVHS---YYVP 149 (199) T ss_pred EEEEEEEEEEEECCCCCCCCCCCEECCEEEECCCCCCCCCCCCCEEEEEEC-CCHHHCCCCC--CCEEEEEEE---EEEE T ss_conf 998042676541000365466666212899827765555777745887621-6844537655--560488850---3786 Q ss_pred HHHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHCC Q ss_conf 677411542250365316888889881126688869998999848843432210067762899986429 Q gi|254780971|r 151 AKGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLLT 219 (219) Q Consensus 151 ~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~s 219 (219) .. .....++.. + .|.. -+|++ +.+|++|+++|||++- ..|++|++|++. T Consensus 150 ~~-----~~~~~~a~~--~----~g~~-~~~~v--~~~ni~GvQFHPE~s~------~~G~~il~nFl~ 198 (199) T PRK13181 150 PE-----DPEDVLATT--N----YGVP-FCSAV--EKDNIYAVQFHPEKSG------KAGLKLLKNFAE 198 (199) T ss_pred CC-----CCCCEEEEE--C----CCCE-EEEEE--ECCCEEEEECCCCCCC------HHHHHHHHHHHC T ss_conf 17-----876327996--3----8984-99999--8999999908978178------609999999866 No 21 >TIGR01855 IMP_synth_hisH imidazole glycerol phosphate synthase, glutamine amidotransferase subunit; InterPro: IPR010139 Imidazole glycerol phosphate synthetase (IGPS) is a key metabolic enzyme, which links amino acid and nucleotide biosynthesis: it catalyses the closure of the imidazole ring within histidine biosynthesis (fifth step), and provides the substrate for de novo purine biosynthesis. IGPS consists of two different subunits: HisH, a glutamine amidotransferase (glutaminase), and HisF, a synthase (cyclase). HisH functions to provide a source of nitrogen, which is required for the synthesis of histidine and purines. In the HisH glutaminase reaction, the hydrolysis of glutamine yields ammonia, which is then used by HisF in the subsequent synthase reaction. The X-ray structure of the HisH/HisF heterodimer of IGPS reveals a putative tunnel for the transfer of ammonia from HisH to HisF via a (beta-alpha)8 barrel fold within HisF that abuts HisH . Ammonia tunnels connect the glutaminase and synthase active sites. HisH belongs to a large group of enzymes found in diverse and fundamental anabolic pathways. These enzymes share a common domain, referred to as the type-I glutamine amidotransferase (GATase) domain, which can occur either as single polypeptides (as with HisH) or as domains of large multifunctional proteins.; GO: 0016763 transferase activity transferring pentosyl groups, 0000105 histidine biosynthetic process, 0005737 cytoplasm. Probab=99.76 E-value=2.7e-18 Score=137.85 Aligned_cols=172 Identities=22% Similarity=0.391 Sum_probs=116.4 Q ss_pred HHHHHHHHHHCCCCEEEEECCC-CCCCCCCEEEECC-CCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH Q ss_conf 9999999985499639982154-6445647899728-7345431140677520021212233220597178606403101 Q gi|254780971|r 15 NDMIKAITKIIGQSPILVWQSD-TDIPDVDLIVIPG-GFSYGDYLRCGAIAARTPVMQAIKKKAQQGIKVMGICNGFQIL 92 (219) Q Consensus 15 ~e~~~A~~~~~~~~~~~v~~~~-~~l~~~d~lvipG-GFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~vLGICNGfQiL 92 (219) |-..+||++ .|+...+ ..++ ..+..+|.||+|| | ||.|.- +-+.+..+.+.+....+.++|+||||-|+||| T Consensus 16 rGV~~Al~~-~G~~~~v-~~~~y~~~~~aD~l~LPGVG-~F~~aM---~~l~~~~~~~~~~~~~~~~~P~lGiCLGMQLL 89 (211) T TIGR01855 16 RGVKRALKR-VGAEPVV-VKDSYKELEKADKLILPGVG-AFKAAM---ARLRELGLLDLLELVVKKKKPVLGICLGMQLL 89 (211) T ss_pred HHHHHHHHH-CCCEEEE-ECCCCHHHHCCCEEEECCCC-CHHHHH---HHHHHHHHHHHHHHHHHCCCCEEEEEHHHHHH T ss_conf 999999984-4981799-62860343106867857858-889999---99885005878899996189689998023430 Q ss_pred EE-----------CCCHHHH---HCCCCCCC----EEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEECHHHH Q ss_conf 00-----------0001011---01244332----454225767525235776413799689986201330233086774 Q gi|254780971|r 93 VE-----------LNLLPGI---LMRNCSLK----FVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFIDAKGL 154 (219) Q Consensus 93 ~e-----------lGLlPg~---l~~N~s~r----f~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~~~~l 154 (219) .| |||+||. |..+.+.+ =-=-|-.+.+....|+++++++++. +||+=.+.. T Consensus 90 fe~SeE~g~~~~GLG~i~G~V~k~~~~~~~~~lk~PHmGWN~v~~~~~~C~ll~g~~~~~-----------~~YFVHSY~ 158 (211) T TIGR01855 90 FERSEEGGGEVPGLGLIKGKVVKLEARVSLSKLKVPHMGWNEVEPVKESCPLLKGIDKGA-----------YFYFVHSYY 158 (211) T ss_pred HCCCCCCCCCCCCCEECCCEEEEECCCCCCCCCCCCCCCCCEEEEHHHCCCHHCCCCCCC-----------CEEEEEEEE T ss_conf 015624378866531127613420567788725457565642312311551130776678-----------155688667 Q ss_pred HHHHHC--CEEE-EECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHCC Q ss_conf 115422--5036-5316888889881126688869998999848843432210067762899986429 Q gi|254780971|r 155 AEIEKN--NQIV-FRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLLT 219 (219) Q Consensus 155 ~~l~~~--~~i~-~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~s 219 (219) .+..++ ..++ .+|-.+. .|.+ +++|++|.++||||+-.. |++|++|++. T Consensus 159 ~~~~~~S~~v~a~~~YG~~F--------~AaV--~~~n~~G~QFHPEKS~~~------G~~lL~nf~~ 210 (211) T TIGR01855 159 AVCEEESEAVLAKADYGEKF--------TAAV--QKDNIFGTQFHPEKSGKT------GLKLLKNFLE 210 (211) T ss_pred EECCCCHHHEEEEHHCCCCE--------EEEE--ECCCEEEEEECCHHHHHH------HHHHHHHHHC T ss_conf 76168811501120018630--------4466--548789997285688889------9999999852 No 22 >PRK13141 hisH imidazole glycerol phosphate synthase subunit HisH; Provisional Probab=99.76 E-value=1.1e-17 Score=133.97 Aligned_cols=187 Identities=22% Similarity=0.320 Sum_probs=127.0 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 96899844970647999999998549963998215464456478997287345431140677520021212233220597 Q gi|254780971|r 1 MKTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGI 80 (219) Q Consensus 1 mkvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~ 80 (219) |.++||.+-..|- ....+|+++ .|.++.++ .+..++.++|+||+||.=||++.. +-.....+.+.++++.++++ T Consensus 1 ~mI~Iid~g~gNi-~si~~al~~-lG~~~~ii-~~~~~l~~~d~lIlPGvGsf~~~~---~~L~~~~l~~~i~~~~~~~~ 74 (204) T PRK13141 1 TMIAIIDYGMGNL-HSVSKALER-LGAEAIIT-SDPEEILAADAVILPGVGAFPDAM---QNLRERGLVEVIKEAVASGK 74 (204) T ss_pred CEEEEEECCCCHH-HHHHHHHHH-CCCCEEEE-CCHHHHHHCCEEEECCCCCCHHHH---HHHHHCCCHHHHHHHHHCCC T ss_conf 9899997897589-999999998-69989997-999998408989986887505687---76465392999999986599 Q ss_pred EEEEECCCHHHHEE----------CCCHHHHHCCCCCCCE----EEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEE Q ss_conf 17860640310100----------0001011012443324----542257675252357764137996899862013302 Q gi|254780971|r 81 KVMGICNGFQILVE----------LNLLPGILMRNCSLKF----VCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGN 146 (219) Q Consensus 81 ~vLGICNGfQiL~e----------lGLlPg~l~~N~s~rf----~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGr 146 (219) |+||||-|+|+|.+ |||++|...+..+.+. .--|..+...+ +++.++++..+ ..+|..|+ T Consensus 75 PiLGIClGmQlL~~~s~E~g~~~GLgl~~g~v~~~~~~~~~~vPhiGw~~i~~~~-~~~l~~~~~~~--~~~yf~HS--- 148 (204) T PRK13141 75 PLLGICLGMQLLFESSEEFGDTEGLGLLPGKVRRFPPGEGLKVPHMGWNQLEFTR-PSPLLAGLPEG--AYVYFVHS--- 148 (204) T ss_pred CEEEEHHHHHHHHHCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCEEEEECC-CCHHHHCCCCC--CCEEEEEC--- T ss_conf 6897549899973013104664665436689997788888856510305776113-78054167777--72378624--- Q ss_pred EEECHHHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHCC Q ss_conf 3308677411542250365316888889881126688869998999848843432210067762899986429 Q gi|254780971|r 147 YFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLLT 219 (219) Q Consensus 147 f~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~s 219 (219) |+++. .+...+. .+++. +... +|.+.. +|++|.+.|||++- +.|+.|++|++. T Consensus 149 y~v~~------~~~~~~~-a~~~~---g~~i--~a~i~~--~ni~G~QFHPEkS~------~~G~~ilknFl~ 201 (204) T PRK13141 149 YYADP------PDEEVVA-ATCDY---GQKF--PAAVAK--DNLFGVQFHPEKSG------DIGLKILKNFLE 201 (204) T ss_pred CEEEC------CCCCCEE-EEECC---CCEE--EEEEEC--CCEEEEECCCCCCC------HHHHHHHHHHHH T ss_conf 27833------8975289-99559---9889--999971--99999988985148------527999999999 No 23 >PRK13170 hisH imidazole glycerol phosphate synthase subunit HisH; Provisional Probab=99.76 E-value=1.1e-17 Score=133.90 Aligned_cols=181 Identities=19% Similarity=0.281 Sum_probs=116.5 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 96899844970647999999998549963998215464456478997287345431140677520021212233220597 Q gi|254780971|r 1 MKTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGI 80 (219) Q Consensus 1 mkvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~ 80 (219) |||+||.+-..| =....+|+++ .+.++.++ .+..++.++|.||+||==||++.. .-..+..+.+.+++ .++ T Consensus 1 MkI~IiDyG~GN-i~Sv~~al~~-lg~~~~i~-~~~~~i~~~d~lILPGVGsF~~am---~~L~~~~l~~~i~~---~~k 71 (196) T PRK13170 1 MNVVIIDTGCAN-LSSVKFAFER-LGYNPVVS-RDPDVILAADKLFLPGVGTAQAAM---DQLRERELIELIKA---CTQ 71 (196) T ss_pred CEEEEECCCCCH-HHHHHHHHHH-CCCCEEEE-CCHHHHHHCCEEEECCCCCHHHHH---HHHHHHHHHHHHHH---CCC T ss_conf 989999279558-9999999998-79979996-798998438979971899889999---99987356899997---089 Q ss_pred EEEEECCCHHHHEE----------CCCHHHHHCCCCCCCE---EEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEE Q ss_conf 17860640310100----------0001011012443324---5422576752523577641379968998620133023 Q gi|254780971|r 81 KVMGICNGFQILVE----------LNLLPGILMRNCSLKF---VCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNY 147 (219) Q Consensus 81 ~vLGICNGfQiL~e----------lGLlPg~l~~N~s~rf---~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf 147 (219) |+||||-|+|+|.+ |||+||...+=.+..+ ---|-.+++.+ +++++++++.+. ..+-.|. | T Consensus 72 piLGICLGMQlL~~~S~E~g~~~GLgli~G~V~kl~~~~~~vPhiGWn~i~~~~-~~~l~~~i~~~~--~fYFvHS---Y 145 (196) T PRK13170 72 PVLGICLGMQLLGRRSEESGGVDCLGIIDGDVPKLTDFGLPLPHMGWNQVTPQA-GHPLFQGIEDGS--YFYFVHS---Y 145 (196) T ss_pred CEEEEEEEEEEEEECCCCCCCCCCCEEEEEEEEECCCCCCCCCEEEEEEEEECC-CCHHHCCCCCCC--EEEEEEE---E T ss_conf 748888003134635536898660005422999977678877442148887358-987664988588--7999877---8 Q ss_pred EECHHHHHHHHHCCEEE-EECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHCC Q ss_conf 30867741154225036-5316888889881126688869998999848843432210067762899986429 Q gi|254780971|r 148 FIDAKGLAEIEKNNQIV-FRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLLT 219 (219) Q Consensus 148 ~~~~~~l~~l~~~~~i~-~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~s 219 (219) +++.+ +..++ .+|.. .. +|.| ..+|++|.+.|||++-.. |++|++|++. T Consensus 146 ~~~~~-------~~~ia~t~yg~------~f--~s~v--~~~Ni~G~QFHPEKS~~~------Gl~lLknFl~ 195 (196) T PRK13170 146 AMPVN-------PYTIAQCNYGE------PF--SAAI--QKDNFYGVQFHPERSGAA------GAQLLKNFLE 195 (196) T ss_pred EECCC-------CCEEEEEECCC------EE--EEEE--EECCEEEEECCCCCCCHH------HHHHHHHHHC T ss_conf 70688-------88899997899------89--9999--719999996892006784------8999999961 No 24 >PRK13148 consensus Probab=99.76 E-value=6.5e-18 Score=135.35 Aligned_cols=185 Identities=21% Similarity=0.347 Sum_probs=115.4 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECC-CCCCCCCCCHHHHHHHHHHHHHHHHHHHCC Q ss_conf 9689984497064799999999854996399821546445647899728-734543114067752002121223322059 Q gi|254780971|r 1 MKTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPG-GFSYGDYLRCGAIAARTPVMQAIKKKAQQG 79 (219) Q Consensus 1 mkvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipG-GFSygD~l~aG~i~~~~~~~~~i~~~~~~g 79 (219) .+|+||.+ |+.-=+...+|+++++....+.+..++.++.++|.||||| | ||++.. +-.....+.+.+.+ ..++ T Consensus 2 ~~I~IIDy-g~GNl~Sv~~al~~~g~~~~~~i~~~~~~i~~~d~lILPGVG-sF~~am---~~L~~~~l~~~i~~-~~~~ 75 (225) T PRK13148 2 STIAIVDY-GMGNFHSVARALQHAAPDADIRICNRPEQIDAADRVVFPGQG-AMPDCM---RTLNESGLRAAVER-AAAS 75 (225) T ss_pred CEEEEEEC-CCHHHHHHHHHHHHHCCCCCEEEECCHHHHHHCCEEEECCCC-CHHHHH---HHHHHCCHHHHHHH-HHCC T ss_conf 87999979-944999999999985789739998999999549979989978-879999---99886487999999-8638 Q ss_pred CEEEEECCCHHHHEE---------CCCHHHHHCCCC---------------------CCCE---EEEEEEEEECCCHHHH Q ss_conf 717860640310100---------000101101244---------------------3324---5422576752523577 Q gi|254780971|r 80 IKVMGICNGFQILVE---------LNLLPGILMRNC---------------------SLKF---VCKQVLLEVVNSNTAF 126 (219) Q Consensus 80 ~~vLGICNGfQiL~e---------lGLlPg~l~~N~---------------------s~rf---~~r~~~~~v~~~~s~~ 126 (219) +|+||||-|+|+|.+ |||+||...+=. ..++ ---|-.++..+ ++++ T Consensus 76 kpiLGICLGMQlLf~~SeEg~~~GLgli~G~V~kl~~~~~~~~~~~~~~~~~~~~~~~~~lkiPh~GWn~v~~~~-~~~l 154 (225) T PRK13148 76 KPLMGVCVGEQMLFERSEEGDTPCLGIFPGEVRRFAGPQFADPVAADQAAAAPPAAARERLKVPHMGWNQVRQTR-SHAL 154 (225) T ss_pred CCEEEEEHHHHHHHHCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCEEECCC-CCCH T ss_conf 986998277887662452269775655878999814765456421123334555443548888888612037789-9712 Q ss_pred HHHCCCCCEEEEEEECCCEEEEECHHHHHHHHHCCEE-E-EECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCC Q ss_conf 6413799689986201330233086774115422503-6-5316888889881126688869998999848843432210 Q gi|254780971|r 127 TKSYKMNQIIKCPVAHHDGNYFIDAKGLAEIEKNNQI-V-FRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKF 204 (219) Q Consensus 127 ~~~~~~~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i-~-~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~ 204 (219) ++++..++ ...-.|. |++.+ .+...+ + ..| | ..=+|.|. .+|++|.++|||++-.. T Consensus 155 ~~~i~~~~--~fYFvHS---Y~~~~------~~~~~~~a~t~y-------g-~~f~s~v~--~~Ni~G~QFHPEKS~~~- 212 (225) T PRK13148 155 WEGIPDGT--HFYFVHS---YYAAP------SDPALTTGVTDY-------G-VAFTCAVA--AANIFAVQFHPEKSAEH- 212 (225) T ss_pred HCCCCCCC--EEEEEEE---EEECC------CCCCCEEEEEEC-------C-CEEEEEEE--ECCEEEEECCHHHHCHH- T ss_conf 12667787--0688754---56316------881558999858-------9-98999999--69999990881231577- Q ss_pred CCCCCHHHHHHHHCC Q ss_conf 067762899986429 Q gi|254780971|r 205 HGGIDGRGLFASLLT 219 (219) Q Consensus 205 ~~~~dG~~~f~~~~s 219 (219) |++|++|++. T Consensus 213 -----Gl~lLknFl~ 222 (225) T PRK13148 213 -----GLRLYRNFVD 222 (225) T ss_pred -----HHHHHHHHHC T ss_conf -----9999999876 No 25 >PRK13172 consensus Probab=99.75 E-value=6.9e-18 Score=135.18 Aligned_cols=187 Identities=24% Similarity=0.358 Sum_probs=119.9 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECC-CCCCCCCCCHHHHHHHHHHHHHHHHHHHCC Q ss_conf 9689984497064799999999854996399821546445647899728-734543114067752002121223322059 Q gi|254780971|r 1 MKTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPG-GFSYGDYLRCGAIAARTPVMQAIKKKAQQG 79 (219) Q Consensus 1 mkvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipG-GFSygD~l~aG~i~~~~~~~~~i~~~~~~g 79 (219) ++|+||.+ |+.-=+...+|+++ .+.++.++ .+..++.++|.||||| | ||++.. .-.....+.+.++++.+++ T Consensus 2 ~~I~IIDy-G~GNi~Sv~~al~~-lg~~~~i~-~~~~~i~~~d~lIlPGVG-sF~~~m---~~L~~~~l~~~i~~~~~~~ 74 (213) T PRK13172 2 SSVSIVDY-GVGNLLSVARAFQY-FDASVNLV-STPEEIMSADRLVLPGVG-AFEDGM---KGLTTLNFIEPIKQFARSG 74 (213) T ss_pred CEEEEEEC-CCHHHHHHHHHHHH-CCCCEEEE-CCHHHHHHCCEEEECCCC-CHHHHH---HHHHHCCCHHHHHHHHHCC T ss_conf 87999978-94299999999998-69988997-799998338978976878-469999---9887238637899987659 Q ss_pred CEEEEECCCHHHHEE----------CCCHHHHHCCCCCCCE--------EEEEEEEEECCCH----HHHHHHCCCCCEEE Q ss_conf 717860640310100----------0001011012443324--------5422576752523----57764137996899 Q gi|254780971|r 80 IKVMGICNGFQILVE----------LNLLPGILMRNCSLKF--------VCKQVLLEVVNSN----TAFTKSYKMNQIIK 137 (219) Q Consensus 80 ~~vLGICNGfQiL~e----------lGLlPg~l~~N~s~rf--------~~r~~~~~v~~~~----s~~~~~~~~~~~l~ 137 (219) +|+||||-|+|+|.+ |||+||....-.+... ---|..+...+.. .+++++...+.. T Consensus 75 kpiLGICLGmQlLf~~s~E~g~~~GLgli~G~v~kl~~~~~~~~~~kvPh~GWn~i~~~~~~~~~~~~l~~~~~~~~~-- 152 (213) T PRK13172 75 KPFLGICLGMQMMLSKSTEFGQHEGLGLIAGEVVSVPSHGVDGQLHKIPHIGWNELVSTSEGEDWCHTILKNIPLNSS-- 152 (213) T ss_pred CCEEEEEHHHHHHHHCCCCCCCCCCCCEECCEEEECCCCCCCCCCCCCCEECHHHEECCCCCCCCCCHHHCCCCCCCE-- T ss_conf 957998576662653864687647665755259877755666645627620144533245543334366618977861-- Q ss_pred EEEECCCEEEEECHHHHHHHHHCCEEE-EECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHH Q ss_conf 862013302330867741154225036-5316888889881126688869998999848843432210067762899986 Q gi|254780971|r 138 CPVAHHDGNYFIDAKGLAEIEKNNQIV-FRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFAS 216 (219) Q Consensus 138 ~piaHgEGrf~~~~~~l~~l~~~~~i~-~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~ 216 (219) ..-.|. |++.+. ..+..++ ..|. |. .++++- .++|++|++.|||++-.. |+.|++| T Consensus 153 fYFvHS---Y~~~~~-----~~~~~~a~~~y~------g~--~~~a~i-~~~Ni~G~QFHPEKS~~~------Gl~lL~n 209 (213) T PRK13172 153 VYFVHS---FMAMPS-----NPKKRLADTLYD------GQ--AISAVI-KDENMYGCQFHPEKSGEV------GLSIIQQ 209 (213) T ss_pred EEEEEE---EEEECC-----CCCCEEEEEEEC------CE--EEEEEE-ECCCEEEEECCHHHCCHH------HHHHHHH T ss_conf 799987---788538-----887389999989------99--999999-889999980880223686------9999999 Q ss_pred HCC Q ss_conf 429 Q gi|254780971|r 217 LLT 219 (219) Q Consensus 217 ~~s 219 (219) +++ T Consensus 210 Fl~ 212 (213) T PRK13172 210 FLQ 212 (213) T ss_pred HHC T ss_conf 972 No 26 >PRK13173 consensus Probab=99.74 E-value=2.3e-17 Score=131.90 Aligned_cols=184 Identities=21% Similarity=0.316 Sum_probs=116.0 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECC-CCCCCCCCCHHHHHHHHHHHHHHHHHHHCC Q ss_conf 9689984497064799999999854996399821546445647899728-734543114067752002121223322059 Q gi|254780971|r 1 MKTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPG-GFSYGDYLRCGAIAARTPVMQAIKKKAQQG 79 (219) Q Consensus 1 mkvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipG-GFSygD~l~aG~i~~~~~~~~~i~~~~~~g 79 (219) .||+||.+--.|- ....+|+++ .+.++.++ .+..++.++|.||+|| | ||++.. +......+.+.+++ +..+ T Consensus 2 ~kI~IiDyg~gNi-~Sv~~al~~-lg~~~~i~-~~~~~i~~~d~lILPGVG-sF~~~m---~~L~~~~l~~~i~~-~~~~ 73 (211) T PRK13173 2 TKIALLDYGMGNL-HSASKALSA-VGAEVSIT-NDPKVIAAADKIVFPGVG-AMRDCM---AGMHEAGIDEVVRQ-AIFN 73 (211) T ss_pred CEEEEEECCCHHH-HHHHHHHHH-CCCCEEEE-CCHHHHHHCCEEEECCCC-CHHHHH---HHHHHCCHHHHHHH-HHCC T ss_conf 8799996894499-999999998-69988996-799998438968975778-689999---99886297999998-8629 Q ss_pred CEEEEECCCHHHHEE----------CCCHHHHHCC---C----CCCCE---EEEEEEEEECCCHHHHHHHCCCCCEEEEE Q ss_conf 717860640310100----------0001011012---4----43324---54225767525235776413799689986 Q gi|254780971|r 80 IKVMGICNGFQILVE----------LNLLPGILMR---N----CSLKF---VCKQVLLEVVNSNTAFTKSYKMNQIIKCP 139 (219) Q Consensus 80 ~~vLGICNGfQiL~e----------lGLlPg~l~~---N----~s~rf---~~r~~~~~v~~~~s~~~~~~~~~~~l~~p 139 (219) +|+||||-|+|+|.+ |||++|...+ + +..++ ---|-.+...+...+++++++.++ ... T Consensus 74 kpiLGICLGMQlLf~~s~E~g~~~GLgli~G~V~~l~~~~~~~~~~~~kvPhiGWn~v~~~~~~~~l~~~i~~~~--~fY 151 (211) T PRK13173 74 KPVMAICVGMQALFEQSAENGGTPCLSILDGTVKAFDPSWKDPKGVQIKVPHMGWNTISGMDFEHPLWNGIDDKA--HFY 151 (211) T ss_pred CCEEEEEHHHHHHHCCCCCCCCCCCCEEEEEEEEECCCCCCCCCCCCCCCCCEEEEEEECCCCCCHHHCCCCCCC--EEE T ss_conf 998998188898506898899967130660069876744467555543466000345433576536444998786--079 Q ss_pred EECCCEEEEECHHHHHHHHHCCEEEE--ECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHH Q ss_conf 20133023308677411542250365--3168888898811266888699989998488434322100677628999864 Q gi|254780971|r 140 VAHHDGNYFIDAKGLAEIEKNNQIVF--RYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASL 217 (219) Q Consensus 140 iaHgEGrf~~~~~~l~~l~~~~~i~~--~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~ 217 (219) -.|. |+++. .++..++. .|-. .=+|.|. .+|++|++.|||++-.. |++|++|+ T Consensus 152 FvHS---Y~~~~------~~~~~~~~~~~yg~--------~f~a~v~--k~Ni~G~QFHPEKS~~~------Gl~lLknF 206 (211) T PRK13173 152 FVHS---YYCEP------ADSSQVAAICDYGQ--------PFCASIL--KDNLFATQFHPEKSHTA------GLQLLKNF 206 (211) T ss_pred EEEE---EEEEC------CCCCEEEEEEECCC--------EEEEEEE--ECCEEEEECCHHHCCHH------HHHHHHHH T ss_conf 9866---78703------78250789983898--------8999999--69999990880124586------89999999 Q ss_pred CC Q ss_conf 29 Q gi|254780971|r 218 LT 219 (219) Q Consensus 218 ~s 219 (219) +. T Consensus 207 l~ 208 (211) T PRK13173 207 VE 208 (211) T ss_pred HC T ss_conf 73 No 27 >PRK13178 consensus Probab=99.74 E-value=1.2e-17 Score=133.67 Aligned_cols=183 Identities=18% Similarity=0.226 Sum_probs=121.7 Q ss_pred EEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEE Q ss_conf 89984497064799999999854996399821546445647899728734543114067752002121223322059717 Q gi|254780971|r 3 TAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIKV 82 (219) Q Consensus 3 vaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~v 82 (219) |+||.+-- .-=+...+|+++ .+.++.+ ..+..++.++|.||+||.-||+... .-.....+.+.+.++.++++|+ T Consensus 2 I~IiDyg~-GNi~Sv~~al~~-lg~~~~i-~~~~~~i~~~d~lIlPGVGsf~~~m---~~L~~~~l~~~i~~~~~~gkpi 75 (213) T PRK13178 2 LAILDYKA-GNQTSVRRALNH-LGIPCVI-TADPEVIQGAAGIIFPGVGAAGQAM---NELRTTGLDEVLREQVQAGKPL 75 (213) T ss_pred EEEEECCC-CHHHHHHHHHHH-CCCCEEE-ECCHHHHHHCCEEEECCCCCHHHHH---HHHHHCCHHHHHHHHHHCCCCE T ss_conf 99996593-089999999998-6999899-7799999439989989988569999---9875006289999874369957 Q ss_pred EEECCCHHHHEE---------CCCHHHHHCC---C----CCCCE---EEEEEEEEECCCHHHHHHHCCCCCEEEEEEECC Q ss_conf 860640310100---------0001011012---4----43324---542257675252357764137996899862013 Q gi|254780971|r 83 MGICNGFQILVE---------LNLLPGILMR---N----CSLKF---VCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHH 143 (219) Q Consensus 83 LGICNGfQiL~e---------lGLlPg~l~~---N----~s~rf---~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHg 143 (219) ||||-|+|+|.+ |||++|...+ + ++.++ ---|..+... ++++.++++..+. ..+-.|. T Consensus 76 LGIClGmQlLf~~seE~~~~GLgli~G~v~~l~~~~~~~~~~~~kvPhiGWn~i~~~-~~~~l~~~~~~~~--~fYFvHS 152 (213) T PRK13178 76 LGICVGCQIMLDYSQENDTKCLGIIPGECRLFNPALTDEDGAPIRVPHMGWNSVVQR-RPCPLFKGIEPEA--EFYFVHS 152 (213) T ss_pred EEEEHHHEEEECCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCEEEEEC-CCCCCCCCCCCCC--EEEEEEE T ss_conf 998424555541442454224675610699888543443477756772122127717-9985224567778--7999868 Q ss_pred CEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHC Q ss_conf 302330867741154225036531688888988112668886999899984884343221006776289998642 Q gi|254780971|r 144 DGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLL 218 (219) Q Consensus 144 EGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~ 218 (219) |+++.+ ++..++..+. |.. =+|.+ ..+|++|++.|||++-.. |+.|++|++ T Consensus 153 ---Y~~~~~------~~~~ia~t~y------g~~-f~a~v--~k~Ni~G~QFHPEKSg~~------Gl~lLknF~ 203 (213) T PRK13178 153 ---YFPAPP------EEYVIATTTY------GIE-FCSAH--GGPGLWAVQFHPEKSGRP------GLRLLANFH 203 (213) T ss_pred ---EEECCC------CCCEEEEEEC------CCE-EEEEE--EECCEEEEECCCCCCCHH------HHHHHHHHH T ss_conf ---985389------8648999978------997-99999--809999991995316886------999999999 No 28 >cd01748 GATase1_IGP_Synthase Type 1 glutamine amidotransferase (GATase1) domain found in imidazole glycerol phosphate synthase (IGPS). Type 1 glutamine amidotransferase (GATase1) domain found in imidazole glycerol phosphate synthase (IGPS). IGPS incorporates ammonia derived from glutamine into N1-[(5'-phosphoribulosyl)-formimino]-5-aminoimidazole-4-carboxamide ribonucleotide (PRFAR) to form 5'-(5-aminoimidazole-4-carboxamide) ribonucleotide (AICAR) and imidazole glycerol phosphate (IGP). The glutamine amidotransferase domain generates the ammonia nucleophile which is channeled from the glutaminase active site to the PRFAR active site. IGPS belong to the triad family of amidotransferases having a conserved Cys-His-Glu catalytic triad in the glutaminase active site. Probab=99.74 E-value=2.7e-17 Score=131.40 Aligned_cols=183 Identities=25% Similarity=0.399 Sum_probs=118.6 Q ss_pred EEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECC-CCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCE Q ss_conf 89984497064799999999854996399821546445647899728-73454311406775200212122332205971 Q gi|254780971|r 3 TAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPG-GFSYGDYLRCGAIAARTPVMQAIKKKAQQGIK 81 (219) Q Consensus 3 vaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipG-GFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~ 81 (219) |+||.+-. .--...++|+++ .+..+.++ .++.++.++|+||||| | ||++.. +...+..+.+.++++.++++| T Consensus 1 I~IiD~G~-~ni~si~~~l~~-lg~~~~ii-~~~~~l~~~d~iIlPGvG-~f~~~~---~~l~~~~~~~~i~~~~~~~~P 73 (198) T cd01748 1 IAIIDYGM-GNLRSVANALER-LGAEVIIT-SDPEEILSADKLILPGVG-AFGDAM---ANLRERGLIEALKEAIASGKP 73 (198) T ss_pred CEEEECCC-CHHHHHHHHHHH-CCCCEEEE-CCHHHHHHCCEEEECCCC-CCHHHH---HHHHHCCHHHHHHHHHHCCCC T ss_conf 99997697-789999999998-79929997-999998308989987887-403444---441003819999999876996 Q ss_pred EEEECCCHHHHEE----------CCCHHHHHCCCCCCCE----EEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEE Q ss_conf 7860640310100----------0001011012443324----5422576752523577641379968998620133023 Q gi|254780971|r 82 VMGICNGFQILVE----------LNLLPGILMRNCSLKF----VCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNY 147 (219) Q Consensus 82 vLGICNGfQiL~e----------lGLlPg~l~~N~s~rf----~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf 147 (219) +||||-|+|+|.+ ||+++|...+-.+... .--|..+.+.+ +++.+.++..+ ..+|..|+ | T Consensus 74 iLGIClG~QlL~~~s~E~~~~~gLg~~~g~V~~~~~~~~~~iP~iGwn~i~~~~-~~~~~~~~~~~--~~~yF~HS---~ 147 (198) T cd01748 74 FLGICLGMQLLFESSEEGGGTKGLGLIPGKVVRFPASEGLKVPHMGWNQLEITK-ESPLFKGIPDG--SYFYFVHS---Y 147 (198) T ss_pred EEEEEEEEEEEEEEEECCCCCCCCCCCCEEEEECCCCCCCCCCCCCEEEEECCC-CCCHHCCCCCC--CEEEEEEC---C T ss_conf 899981145466676506863553655409998877778866411224675033-68032077767--53588624---1 Q ss_pred EECHHHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHC Q ss_conf 30867741154225036531688888988112668886999899984884343221006776289998642 Q gi|254780971|r 148 FIDAKGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLL 218 (219) Q Consensus 148 ~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~ 218 (219) +++. .+...+.. +++. |. .-+|++ +++|++|++.|||.+- ..|++|++|+| T Consensus 148 ~v~~------~~~~~~~a-~~~~----~~-~~~a~i--~~~ni~GvQFHPEkS~------~~G~~ll~nFl 198 (198) T cd01748 148 YAPP------DDPDYILA-TTDY----GG-KFPAAV--EKDNIFGTQFHPEKSG------KAGLKLLKNFL 198 (198) T ss_pred EEEC------CCCCEEEE-EECC----CC-EEEEEE--ECCCEEEEECCCCCCC------HHHHHHHHHHC T ss_conf 7752------89750688-9759----98-399999--8799999978985247------60899999539 No 29 >PRK13174 consensus Probab=99.74 E-value=6e-17 Score=129.21 Aligned_cols=185 Identities=21% Similarity=0.356 Sum_probs=115.3 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECC-CCCCCCCCCHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 689984497064799999999854996399821546445647899728-7345431140677520021212233220597 Q gi|254780971|r 2 KTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPG-GFSYGDYLRCGAIAARTPVMQAIKKKAQQGI 80 (219) Q Consensus 2 kvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipG-GFSygD~l~aG~i~~~~~~~~~i~~~~~~g~ 80 (219) +|+||.+ |+.-=....+|+++ .+.+..++..+..++.++|.||||| | ||+... +-..+.. ++++.+...+++ T Consensus 3 ~I~IiDy-G~gNi~Sv~~al~~-~g~~~~~i~~~~~~i~~~d~lIlPGVG-sf~~~m---~~L~~~~-~~~~i~~~~~~k 75 (212) T PRK13174 3 TVAVIDY-GMGNLHSVAKALEH-VGAGRVLVTSDAAVIREADRVVFPGVG-AIRDCM---AEIRRLG-FDSLVREVSQDR 75 (212) T ss_pred EEEEEEC-CCHHHHHHHHHHHH-CCCCEEEECCCHHHHHHCCEEEECCCC-CHHHHH---HHHHHCC-HHHHHHHHHCCC T ss_conf 8999958-81079999999997-499879983898999638979965888-689999---9988769-699999997499 Q ss_pred EEEEECCCHHHHEE----------CCCHHHHHC---CCCC---CCE---EEEEEEEEECCCHHHHHHHCCCCCEEEEEEE Q ss_conf 17860640310100----------000101101---2443---324---5422576752523577641379968998620 Q gi|254780971|r 81 KVMGICNGFQILVE----------LNLLPGILM---RNCS---LKF---VCKQVLLEVVNSNTAFTKSYKMNQIIKCPVA 141 (219) Q Consensus 81 ~vLGICNGfQiL~e----------lGLlPg~l~---~N~s---~rf---~~r~~~~~v~~~~s~~~~~~~~~~~l~~pia 141 (219) |+||||-|+|+|.+ |||+||... .+.+ .+. ---|..+...+ ++++++++..+. ..+-. T Consensus 76 piLGICLGMQlL~~~s~E~g~~~GLgli~G~V~~~~~~~~~~~~~~kiPhiGWn~i~~~~-~~~l~~~~~~~~--~fYFv 152 (212) T PRK13174 76 PFLGICVGMQALLERSEENGGVDCIGLFPGQVRFFGKDLHEDGEHLKVPHMGWNQVSQAV-DHPLWHDIPDLA--RFYFV 152 (212) T ss_pred CEEEEEHHHHHHHHCCCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCEECCEEEEECC-CCCHHCCCCCCC--EEEEE T ss_conf 879980889987617547898775336350799867555655545567550012476068-881210677657--59987 Q ss_pred CCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHCC Q ss_conf 133023308677411542250365316888889881126688869998999848843432210067762899986429 Q gi|254780971|r 142 HHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLLT 219 (219) Q Consensus 142 HgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~s 219 (219) |. |++... .+...++..+. |.. ++++.. .+|++|++.|||++-.. |+.|++|+++ T Consensus 153 HS---Y~~~~~-----~~~~~~a~t~~------g~~--f~a~v~-k~Ni~G~QFHPEKS~~~------Gl~lLknFi~ 207 (212) T PRK13174 153 HS---YYIEAG-----NPRQVVGRGHY------GVD--FAAALA-DGSRFAVQFHPEKSHTH------GLQLLQNFAA 207 (212) T ss_pred EE---EEECCC-----CHHCEEEEEEC------CCE--EEEEEE-ECCEEEECCCCCCCCHH------HHHHHHHHHH T ss_conf 56---675269-----82038999989------988--999998-49999990891005775------8999999983 No 30 >CHL00188 hisH imidazole glycerol phosphate synthase subunit hisH; Provisional Probab=99.73 E-value=2.5e-17 Score=131.66 Aligned_cols=188 Identities=24% Similarity=0.322 Sum_probs=115.0 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 96899844970647999999998549963998215464456478997287345431140677520021212233220597 Q gi|254780971|r 1 MKTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGI 80 (219) Q Consensus 1 mkvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~ 80 (219) |||+||.+.-.| =+...+||++ .+.++.++ .++.++.++|.|||||.=||++.. .-..+..+.+.++++.++|+ T Consensus 2 ~~I~Iid~g~GN-i~Sv~~al~~-~g~~~~ii-~~~~~i~~~d~lILPGVGsf~~~m---~~L~~~~l~~~i~~~~~~g~ 75 (210) T CHL00188 2 MKIGIIDYSMGN-LHSVSRAIQQ-AGQQPCII-NSESELAQVHALVLPGVGSFDLAM---KKLEKKGLITPIKKWIAEGN 75 (210) T ss_pred CEEEEEECCCHH-HHHHHHHHHH-CCCCEEEE-CCHHHHHHCCEEEECCCCCHHHHH---HHHHHCCCHHHHHHHHHCCC T ss_conf 779999588478-9999999998-69987997-999999648968988867389999---98863051699999997699 Q ss_pred EEEEECCCHHHHEE---------CCCHHHHHCCCCCCCE----EEEEEEEEECCCHH-----HHHHHCCCCCEEEEEEEC Q ss_conf 17860640310100---------0001011012443324----54225767525235-----776413799689986201 Q gi|254780971|r 81 KVMGICNGFQILVE---------LNLLPGILMRNCSLKF----VCKQVLLEVVNSNT-----AFTKSYKMNQIIKCPVAH 142 (219) Q Consensus 81 ~vLGICNGfQiL~e---------lGLlPg~l~~N~s~rf----~~r~~~~~v~~~~s-----~~~~~~~~~~~l~~piaH 142 (219) |+||||-|+|+|.+ |||+||...+-..... ---|-.+++.+.++ +...++..+. ...-.| T Consensus 76 piLGICLGmQlLf~~seEg~~~GLgli~G~v~k~~~~~~~kvPh~GWn~i~~~~~~~~~~~~~~~~~~~~~~--~~YFvH 153 (210) T CHL00188 76 PFIGICLGLHLLFETSEEGKEEGLGIYKGQVKRLKHSPVKVIPHMGWNRLECQNSECQNSEWVNWKAWPLNP--WAYFVH 153 (210) T ss_pred CEEEEEHHEEEEEHHCCCCCCCCCCEECCEEEECCCCCCCCCCCCCHHHEECCCCCCCCCCCCHHHCCCCCC--CEEEEE T ss_conf 869984210221022115676773456668999378866467320542324457643344451121577677--079974 Q ss_pred CCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHC Q ss_conf 3302330867741154225036531688888988112668886999899984884343221006776289998642 Q gi|254780971|r 143 HDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLL 218 (219) Q Consensus 143 gEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~ 218 (219) . |++ ...+...+...+.+ |...=+|+| ..+|++|++.|||++-.. |+.+++|++ T Consensus 154 S---Y~~------~~~~~~~~~~~~~y-----g~~~~~a~v--~k~Ni~G~QFHPEKS~~~------Gl~lL~nFl 207 (210) T CHL00188 154 S---YGV------MPKSQACATTTTFY-----GKQQMVAAI--EYDNIFAMQFHPEKSGEF------GLWLLREFM 207 (210) T ss_pred E---EEE------CCCCCCEEEEEEEE-----CCEEEEEEE--EECCEEEEECCCCCCCHH------HHHHHHHHH T ss_conf 1---355------27984379999963-----998999999--719999994880007687------999999997 No 31 >PRK13152 hisH imidazole glycerol phosphate synthase subunit HisH; Provisional Probab=99.72 E-value=7.2e-17 Score=128.67 Aligned_cols=182 Identities=21% Similarity=0.370 Sum_probs=120.1 Q ss_pred EEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECC-CCCCCCCCCHHHHHHHHHHHHHHHHHH-HCCC Q ss_conf 89984497064799999999854996399821546445647899728-734543114067752002121223322-0597 Q gi|254780971|r 3 TAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPG-GFSYGDYLRCGAIAARTPVMQAIKKKA-QQGI 80 (219) Q Consensus 3 vaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipG-GFSygD~l~aG~i~~~~~~~~~i~~~~-~~g~ 80 (219) |+||.+-..|- ...++|+++ .+.++.++ .+..++.++|.||||| | ||++. .+-..+..+.+.++++. ++++ T Consensus 2 I~IiDyg~gNi-~Sv~~al~~-lg~~~~i~-~~~~~i~~~d~lIlPGVG-sf~~~---m~~L~~~~l~~~i~~~~~~~~~ 74 (201) T PRK13152 2 IALIDYKAGNL-NSVAKAFEK-IGAINFIA-KNPKDLQKADKLLLPGVG-SFKEA---MKNLKELGFIEALKEQVLVQKK 74 (201) T ss_pred EEEEECCCCHH-HHHHHHHHH-CCCCEEEE-CCHHHHHHCCEEEECCCC-CHHHH---HHHHHHCCCHHHHHHHHHHCCC T ss_conf 99995594279-999999998-69988997-899999658958982888-49999---9877751858899999987699 Q ss_pred EEEEECCCHHHHEE----------CCCHHHHHCC---CCCCC-EEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEE Q ss_conf 17860640310100----------0001011012---44332-4542257675252357764137996899862013302 Q gi|254780971|r 81 KVMGICNGFQILVE----------LNLLPGILMR---NCSLK-FVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGN 146 (219) Q Consensus 81 ~vLGICNGfQiL~e----------lGLlPg~l~~---N~s~r-f~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGr 146 (219) |+||||-|+|+|.+ |||++|...+ +...+ -.--|..+++.+ +++.++++..+ -..+-.|+. T Consensus 75 piLGIClGmQlL~~~s~E~~~~~GLg~i~G~v~kl~~~~~~kvPhiGWn~v~~~~-~~~l~~~~~~~--~~fYF~HSy-- 149 (201) T PRK13152 75 PILGICLGMQLFLERGYEGGVCEGLGFIEGEVVKFEEDLNLKIPHMGWNELEILK-QSPLYQGIPNK--SDFYFVHSF-- 149 (201) T ss_pred CEEEEEHHHEEEEECCCCCCCCCCEEEEEEEEEECCCCCCCCCCCCEEEEEEECC-CCHHHCCCCCC--CEEEEEEEE-- T ss_conf 6699852134621042437863655899769998677767666632258986034-79756388766--719997200-- Q ss_pred EEECHHHHHHHHHCCEEE-EECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHCC Q ss_conf 330867741154225036-5316888889881126688869998999848843432210067762899986429 Q gi|254780971|r 147 YFIDAKGLAEIEKNNQIV-FRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLLT 219 (219) Q Consensus 147 f~~~~~~l~~l~~~~~i~-~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~s 219 (219) +++. .++...+ .+|.. .=+|+| +.+|++|++.|||++-.. |++|++|++. T Consensus 150 -~v~~------~~~~~~~~~~yg~--------~~~aav--~~~Ni~G~QFHPEKS~~~------Gl~lLknFl~ 200 (201) T PRK13152 150 -YVKC------KDEFVSAKAQYGH--------KFVASL--QKDHIFATQFHPEKSQNL------GLKLLENFLR 200 (201) T ss_pred -EECC------CCCCEEEEEECCC--------EEEEEE--EECCEEEEECCCCCCCHH------HHHHHHHHHC T ss_conf -6236------8874899998499--------899999--849999991894116775------8999999974 No 32 >PRK13151 consensus Probab=99.71 E-value=9.3e-17 Score=127.99 Aligned_cols=176 Identities=21% Similarity=0.331 Sum_probs=107.1 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECC-CCCCCCCCCHHHHHHHHHHHHHHHHHHHCC Q ss_conf 9689984497064799999999854996399821546445647899728-734543114067752002121223322059 Q gi|254780971|r 1 MKTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPG-GFSYGDYLRCGAIAARTPVMQAIKKKAQQG 79 (219) Q Consensus 1 mkvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipG-GFSygD~l~aG~i~~~~~~~~~i~~~~~~g 79 (219) |||+||.+--.| =+...+||++ .+.++.+ ..+..++.++|.||+|| | ||++... -.....+.+.+ .+.+ T Consensus 1 MkI~IiDyg~GN-i~Sv~~al~~-lg~~~~i-~~~~~~i~~~d~lIlPGVG-sf~~~m~---~L~~~~l~~~i---~~~~ 70 (195) T PRK13151 1 MKIIIIDTACAN-LASLKFCLDR-LGFNATI-SRDLKELESADKLFLPGVG-TAKEAMK---NLEQFNLIDFI---QNTK 70 (195) T ss_pred CEEEEEECCCHH-HHHHHHHHHH-CCCCEEE-ECCHHHHHHCCEEEECCCC-CHHHHHH---HHHHCCCHHHH---HHCC T ss_conf 989999479609-9999999998-5998799-6799999448969988889-9999999---99874778889---8639 Q ss_pred CEEEEECCCHHHHEE---------CCCHHHHHCCC---CCCCE-EEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEE Q ss_conf 717860640310100---------00010110124---43324-542257675252357764137996899862013302 Q gi|254780971|r 80 IKVMGICNGFQILVE---------LNLLPGILMRN---CSLKF-VCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGN 146 (219) Q Consensus 80 ~~vLGICNGfQiL~e---------lGLlPg~l~~N---~s~rf-~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGr 146 (219) +|+||||-|+|+|.+ |||++|...+- ++.+. ---|..++ +++.++++++ +. ...-.|. T Consensus 71 kpiLGICLGMQlL~~~seE~~~~gLgli~g~v~k~~~~~~~~vPhiGWn~v~---~~~~l~~~i~-~~--~fYFvHS--- 141 (195) T PRK13151 71 KPLLGICLGMQILGNFSEELNQETLKLIDFTTQKFKAKEGFTFPHMGWNQVH---SSHALFKGLE-GA--YFYFVHS--- 141 (195) T ss_pred CCEEEEEEEEEEEECCCCCCCCCCCCEEEEEEEEECCCCCCCCCEEEHHHCC---CCCHHHCCCC-CC--EEEEEEE--- T ss_conf 9668877647876514757675762642026888646557888674112256---6770121666-76--2788730--- Q ss_pred EEECHHHHHHHHHCCEEE-EECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHC Q ss_conf 330867741154225036-531688888988112668886999899984884343221006776289998642 Q gi|254780971|r 147 YFIDAKGLAEIEKNNQIV-FRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLL 218 (219) Q Consensus 147 f~~~~~~l~~l~~~~~i~-~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~ 218 (219) |+++. .+ ..++ ..|.. .. +|.| ..+|++|++.|||++-.. |+++++|++ T Consensus 142 Y~~~~------~~-~~ia~~~y~~------~f--~a~v--~k~Ni~G~QFHPEKS~~~------Gl~lL~nFl 191 (195) T PRK13151 142 YCVGL------GE-YTIADCEYSQ------KF--SASV--MKDNFYGVQFHPERSSEA------GEILISNFI 191 (195) T ss_pred EECCC------CC-CEEEEEECCC------EE--EEEE--EECCEEEEECCCHHCCHH------HHHHHHHHH T ss_conf 45267------65-6378973798------89--9999--919999992880030587------999999998 No 33 >pfam01174 SNO SNO glutamine amidotransferase family. This family and its amidotransferase domain was first described in. It is predicted that members of this family are involved in the pyridoxine biosynthetic pathway, based on the proximity and co-regulation of the corresponding genes and physical interaction between the members of pfam01174 and pfam01680. Probab=99.70 E-value=7.5e-17 Score=128.59 Aligned_cols=177 Identities=21% Similarity=0.313 Sum_probs=107.1 Q ss_pred EEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHC-CCEEE Q ss_conf 98449706479999999985499639982154644564789972873454311406775200212122332205-97178 Q gi|254780971|r 5 IVQIPGLNRDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQ-GIKVM 83 (219) Q Consensus 5 Vl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~-g~~vL 83 (219) ||..+|. -++..+|+++ .|.++.++ .++.+|.++|.||||||+|. ...+.+.++.+.+.+++++.+ ++|+| T Consensus 1 vlalqGN--l~sv~nal~~-lG~e~~~v-~~p~di~~ad~LILPGg~s~----~~~~ll~~~gl~~~l~e~i~~~~kPiL 72 (188) T pfam01174 1 VLALQGA--VEEHEEAIKK-CGAENKTV-KRPEDLAQCDALIIPGGEST----AMSLLAKRYGFYEPLYEFVHNPNKPIW 72 (188) T ss_pred CCCCCCC--HHHHHHHHHH-CCCCEEEE-CCHHHHHCCCEEEECCCCHH----HHHHHHHHCCCHHHHHHHHHHCCCCEE T ss_conf 9832358--9999999998-69987996-89999940898998898879----999999986888999999980799663 Q ss_pred EECCCHHHHEE---------CCCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEECHHHH Q ss_conf 60640310100---------000101101244332454225767525235776413799689986201330233086774 Q gi|254780971|r 84 GICNGFQILVE---------LNLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFIDAKGL 154 (219) Q Consensus 84 GICNGfQiL~e---------lGLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~~~~l 154 (219) |||-|+|+|.+ |||+++...+|..++- +...+++..| .++.... ...--|+ | .+ T Consensus 73 GIClGMqLL~~~~e~g~~~gLGli~~~V~r~~~g~~------~~~~~~~~~~-~~~~~~~--~~yFvhs---~-----~~ 135 (188) T pfam01174 73 GTCAGLILLSKQLGNELVKTLGLLKVTVKRNAFGRQ------VDSFEKECDF-KNLIPKF--PGVFIRA---P-----VI 135 (188) T ss_pred EHHHHHHHHHCCCCCCCCCCCCCCCEEEEECCCCCC------CCCCCCCCCC-CCCCCCC--EEEEEEC---C-----EE T ss_conf 331446421102477864424653226876576754------6554478633-2689994--3999817---7-----45 Q ss_pred HHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHC Q ss_conf 1154225036531688888988112668886999899984884343221006776289998642 Q gi|254780971|r 155 AEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLL 218 (219) Q Consensus 155 ~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~ 218 (219) .+..++..++..|.++ + .++++ ..||++|..+|||.+-+.. .-+..+.++++ T Consensus 136 ~~~~~~~~v~~~~~~~----~---~~v~v--~~~Ni~g~QFHPEKS~d~~---~~~~~fl~~~~ 187 (188) T pfam01174 136 EEILDPEVVVVLYELD----G---KIVVA--KQGNILATSFHPELAEDDY---RVHDWFVENFV 187 (188) T ss_pred CCCCCCCEEEEEEECC----C---EEEEE--EECCEEEEECCCEECCCCC---HHHHHHHHHHH T ss_conf 1049985299999579----9---99999--9899999951984579978---89999999963 No 34 >PRK13171 consensus Probab=99.70 E-value=2.5e-16 Score=125.19 Aligned_cols=181 Identities=20% Similarity=0.347 Sum_probs=112.5 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECC-CCCCCCCCCHHHHHHHHHHHHHHHHHHHCC Q ss_conf 9689984497064799999999854996399821546445647899728-734543114067752002121223322059 Q gi|254780971|r 1 MKTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPG-GFSYGDYLRCGAIAARTPVMQAIKKKAQQG 79 (219) Q Consensus 1 mkvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipG-GFSygD~l~aG~i~~~~~~~~~i~~~~~~g 79 (219) +||+||.+-..| =....+|+++ .+.++.++ .+..++.++|.||+|| | ||+... +-.....+.+.+++ .+ T Consensus 2 ~~I~IiDyg~gN-i~Sv~~al~~-lg~~~~ii-~~~~~i~~~~~lIlPGVG-sf~~~m---~~L~~~~~~~~i~~---~~ 71 (200) T PRK13171 2 TDVALIDAGGAN-LGSVRYALER-LGVEARVV-RDAAGLQGAQRVILPGVG-AAPEAM---ARLRAQGLIEPLRE---LQ 71 (200) T ss_pred CEEEEEECCCCH-HHHHHHHHHH-CCCCEEEE-CCHHHHHCCCEEEECCCC-CHHHHH---HHHHHCCCHHHHHH---CC T ss_conf 879999689517-9999999998-49987997-899999529989989988-789999---99987212678986---49 Q ss_pred CEEEEECCCHHHHEE---------CCCHHHHHCC---CCCCC-EEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEE Q ss_conf 717860640310100---------0001011012---44332-4542257675252357764137996899862013302 Q gi|254780971|r 80 IKVMGICNGFQILVE---------LNLLPGILMR---NCSLK-FVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGN 146 (219) Q Consensus 80 ~~vLGICNGfQiL~e---------lGLlPg~l~~---N~s~r-f~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGr 146 (219) +|+||||-|+|+|.+ |||+||...+ +...+ -.--|..+...+ +++.++++..++ ..+-.|+ T Consensus 72 ~piLGIClGmQlLf~~SeEg~~~GLgli~G~V~kl~~~~~~~vPh~GWn~i~~~~-~~~~~~~~~~~~--~fYFvHS--- 145 (200) T PRK13171 72 VPLIGICLGMQLLFEHSEEGDVECLGLLPGIVRHMTPALGIRVPHMGWNRLVPMR-DSALLAGLPERA--SAYFVHG--- 145 (200) T ss_pred CCEEEEEHHHHHHHCCCCCCCCCCCCEECCEEEECCCCCCCCCCEEEEEEEECCC-CCHHHCCCCCCC--CEEEECC--- T ss_conf 9735560765754035437997661677018998677767868364243453367-887672998798--1999829--- Q ss_pred EEECHHHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHCC Q ss_conf 3308677411542250365316888889881126688869998999848843432210067762899986429 Q gi|254780971|r 147 YFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLLT 219 (219) Q Consensus 147 f~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~s 219 (219) |.++ ..++...+..|. .. =+|.+ .++|++|.+.|||++-+. |+++++|++. T Consensus 146 Y~~~------~~~~~i~~~~~~-------~~-f~a~v--~~~ni~G~QFHPEKS~~~------Gl~ll~nF~~ 196 (200) T PRK13171 146 YAAP------VTADTVAACDHG-------GL-FTAIV--QNGLRCGAQFHPERSADT------GARILRNFLE 196 (200) T ss_pred CCCC------CCCCEEEEECCC-------CE-EEEEE--ECCCEEEECCCCCCCCHH------HHHHHHHHHH T ss_conf 6117------888789994279-------84-99999--819989980894326675------7999999985 No 35 >cd01749 GATase1_PB Glutamine Amidotransferase (GATase_I) involved in pyridoxine biosynthesis. Glutamine Amidotransferase (GATase_I) involved in pyridoxine biosynthesis. Glutamine amidotransferase (GATase) activity involves the removal of the ammonia group from a glutamate molecule and its subsequent transfer to a specific substrate, thus creating a new carbon-nitrogen group on the substrate. This group contains proteins like Bacillus subtilus YaaE and Plasmodium falciparum Pdx2 which are members of the triad glutamine aminotransferase family and function in a pathway for the biosynthesis of vitamin B6. Probab=99.70 E-value=6.9e-17 Score=128.81 Aligned_cols=172 Identities=26% Similarity=0.431 Sum_probs=107.7 Q ss_pred EEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEE Q ss_conf 89984497064799999999854996399821546445647899728734543114067752002121223322059717 Q gi|254780971|r 3 TAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIKV 82 (219) Q Consensus 3 vaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~v 82 (219) |+||.-+|. -++..+||++ .|.++.++ .++.++.++|.||+|||=| ++ +.+.+.+..+.+.+++++++|+|+ T Consensus 1 iGvl~lqGn--~~s~~~Al~~-lG~~~~~~-~~p~~i~~ad~LILPGg~s-~~---~~~~L~~~gl~~~i~~~i~~gkP~ 72 (183) T cd01749 1 IGVLALQGD--FREHIRALER-LGVEVIEV-RTPEDLEGIDGLIIPGGES-TT---IGKLLRRTGLLDPLREFIRAGKPV 72 (183) T ss_pred CEEEEECCC--HHHHHHHHHH-CCCCEEEE-CCHHHHHHCCEEEECCCCH-HH---HHHHHHHCCCHHHHHHHHHCCCCE T ss_conf 979983478--9999999998-79979998-9999983189899889408-99---999998878889999999849984 Q ss_pred EEECCCHHHHEE----------CCCHHHHHCCCCCCCEEE-EEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEECH Q ss_conf 860640310100----------000101101244332454-225767525235776413799689986201330233086 Q gi|254780971|r 83 MGICNGFQILVE----------LNLLPGILMRNCSLKFVC-KQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFIDA 151 (219) Q Consensus 83 LGICNGfQiL~e----------lGLlPg~l~~N~s~rf~~-r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~~ 151 (219) ||||-|+|+|.+ |||+||...+|.-++-.. -+..+++.. +..+.. .. ++.+. T Consensus 73 LGIClGMQlL~~~s~E~g~~~GLGli~g~V~rn~~g~q~~sf~~~l~ip~--------~g~n~~-~~--------~Fv~s 135 (183) T cd01749 73 FGTCAGLILLAKEVEDQGGQPLLGLLDITVRRNAFGRQVDSFEADLDIPG--------LGLGPF-PA--------VFIRA 135 (183) T ss_pred EEHHHHHHHHHHHCCCCCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCC--------CCCCCC-CE--------EEECC T ss_conf 75107788876523467886763655519963563676665525752786--------788876-66--------99448 Q ss_pred HHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHC Q ss_conf 7741154225036531688888988112668886999899984884343221006776289998642 Q gi|254780971|r 152 KGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLL 218 (219) Q Consensus 152 ~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~ 218 (219) .......+.-.++..|-+ .|+++ .+||++|.++|||++-+ .+|.|+++ T Consensus 136 p~~~~~~~~~~~~~~~~~---------~~~aV--~~~Ni~g~QFHPEKSgd--------~~i~~~fl 183 (183) T cd01749 136 PVIEEVGPGVEVLAEYDG---------KIVAV--RQGNVLATSFHPELTDD--------TRIHEYFL 183 (183) T ss_pred CEEEECCCCEEEEEEECC---------EEEEE--EECCEEEEECCCCCCCC--------CCHHHHHC T ss_conf 646005998099999899---------99999--98999999479801598--------64888639 No 36 >PRK13179 consensus Probab=99.70 E-value=2.7e-16 Score=124.99 Aligned_cols=183 Identities=20% Similarity=0.349 Sum_probs=116.3 Q ss_pred EEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECC-CCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCE Q ss_conf 89984497064799999999854996399821546445647899728-73454311406775200212122332205971 Q gi|254780971|r 3 TAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPG-GFSYGDYLRCGAIAARTPVMQAIKKKAQQGIK 81 (219) Q Consensus 3 vaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipG-GFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~ 81 (219) |+||.+ |..-=....+|+.+ .+.++. +..+..++.++|.||+|| | ||++.. .-..+..+.+.++++.++++| T Consensus 2 I~IiDy-g~gNi~Sv~~al~~-~g~~~~-i~~~~~~i~~~d~lILPGVG-sF~~~m---~~L~~~~l~~~i~~~~~~~kp 74 (207) T PRK13179 2 IAIIDY-GMGNLRSVQKGFEK-VGFEAV-VTADPKVVLEAEKVVLPGVG-AFRDCM---RNLEQGGFVEPILRVIRDGRP 74 (207) T ss_pred EEEEEC-CCHHHHHHHHHHHH-CCCCEE-EECCHHHHHHCCEEEECCCC-CHHHHH---HHHHHCCCHHHHHHHHHCCCC T ss_conf 999956-93589999999998-599879-96899999439989989978-679999---998853761689999865996 Q ss_pred EEEECCCHHHHEE----------CCCHHHHHCCC------CCCCE---EEEEEEEEECCCHHHHHHHCCCCCEEEEEEEC Q ss_conf 7860640310100----------00010110124------43324---54225767525235776413799689986201 Q gi|254780971|r 82 VMGICNGFQILVE----------LNLLPGILMRN------CSLKF---VCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAH 142 (219) Q Consensus 82 vLGICNGfQiL~e----------lGLlPg~l~~N------~s~rf---~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaH 142 (219) +||||-|+|+|.+ |||+||...+- .+.++ ---|..+.+.+ +++.+.+++.+. ...-.| T Consensus 75 iLGICLGmQlLf~~S~E~g~~~GLgli~G~V~kl~~~~~~~~~~~kiPhiGWn~i~~~~-~~~~~~~~~~~~--~fYFvH 151 (207) T PRK13179 75 FLGICVGMQLLFTDSEEFGLYQGLNVIPGHVLRFPEGMREGGEELKVPHMGWNQLSIRR-RPPAFQGIEDGT--NVYFVH 151 (207) T ss_pred EEEEEHHHHHCCCCCCCCCCCCCCCEECEEEEECCCCCCCCCCCCCCCEEEEEEEECCC-CCCHHCCCCCCC--EEEEEE T ss_conf 89980779963798806995577878340799876443444654668751125664367-883101445677--079984 Q ss_pred CCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHC Q ss_conf 3302330867741154225036531688888988112668886999899984884343221006776289998642 Q gi|254780971|r 143 HDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLL 218 (219) Q Consensus 143 gEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~ 218 (219) . |++. ..+...++....+ |.. =+|+|. .+|++|++.|||++-.. |++|++|++ T Consensus 152 S---Y~~~------~~~~~~i~~~~~y-----g~~-~~a~v~--k~Ni~G~QFHPEKS~~~------Gl~lLknF~ 204 (207) T PRK13179 152 S---YYVK------PDDESVVAATTGY-----GIE-FCAAVW--KDNIVATQFHPEKSQAV------GLSILKNFG 204 (207) T ss_pred C---CEEC------CCCCCCEEEEEEC-----CCE-EEEEEE--ECCEEEEECCHHHCCHH------HHHHHHHHH T ss_conf 3---1641------5784518999848-----997-999999--69999995880125586------899999997 No 37 >PRK13177 consensus Probab=99.70 E-value=1.2e-16 Score=127.36 Aligned_cols=184 Identities=20% Similarity=0.345 Sum_probs=110.4 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECC-CCCCCCCCCHHHHHHHHHHHHHHHHH-HHCC Q ss_conf 689984497064799999999854996399821546445647899728-73454311406775200212122332-2059 Q gi|254780971|r 2 KTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPG-GFSYGDYLRCGAIAARTPVMQAIKKK-AQQG 79 (219) Q Consensus 2 kvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipG-GFSygD~l~aG~i~~~~~~~~~i~~~-~~~g 79 (219) +|+||.+ |+.-=+...+|+++ .+.+...+..+..++.++|.||+|| | ||++... .+.....+.+.+.+. .+++ T Consensus 4 ~I~IiDy-g~GNi~Sv~~al~~-~g~~~~~i~~~~~~i~~~d~lIlPGVG-sF~~~m~--~Lk~~~~~~~~i~~~~~~~~ 78 (207) T PRK13177 4 TIALIDY-GAGNLRSVHNALKA-AGAEGVVVTADPDEVRRADRVVLPGVG-AFAACMG--GLRAIPGMVEAMEERVLEGG 78 (207) T ss_pred EEEEECC-CCHHHHHHHHHHHH-CCCCCEEEECCHHHHHCCCEEEECCCC-CHHHHHH--HHHHCCCHHHHHHHHHHHCC T ss_conf 8999918-93279999999998-299828995899998359979989978-7899997--76410444899999999769 Q ss_pred CEEEEECCCHHHHEE----------CCCHHHHHCCC--CCCCE---EEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCC Q ss_conf 717860640310100----------00010110124--43324---5422576752523577641379968998620133 Q gi|254780971|r 80 IKVMGICNGFQILVE----------LNLLPGILMRN--CSLKF---VCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHD 144 (219) Q Consensus 80 ~~vLGICNGfQiL~e----------lGLlPg~l~~N--~s~rf---~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgE 144 (219) +|+||||-|+|+|.+ |||++|....= ....+ ---|-.+...++...++. .+. .+..|+ T Consensus 79 ~piLGICLGmQlLf~~s~E~g~~~GLgli~G~V~~l~~~~~~~kiPh~GWn~v~~~~~~~~~~~---~~~---~YFvHS- 151 (207) T PRK13177 79 APFLGICVGMQLLADRGEEHGVTPGLGWIGGEVRLIEPADPSCKVPHMGWNDVAPTTGSHPLIR---PGE---AYFLHS- 151 (207) T ss_pred CCEEEECHHHHHHHHCCCCCCCCCCCCEECCEEEEECCCCCCCCCCEEEEHEEEECCCCCCCCC---CCC---EEEEEE- T ss_conf 9569981878857657607887775455453689816888777587644010254466776677---784---578988- Q ss_pred EEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHCC Q ss_conf 023308677411542250365316888889881126688869998999848843432210067762899986429 Q gi|254780971|r 145 GNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLLT 219 (219) Q Consensus 145 Grf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~s 219 (219) |++... .++..++..+. |. .=+|.|. .+|++|.++|||++-.. |+++++|++. T Consensus 152 --Y~~~~~-----~~~~i~~~t~y------~~-~~~a~v~--k~Ni~G~QFHPEKS~~~------Gl~lLknFl~ 204 (207) T PRK13177 152 --YHFVPD-----DGAHVLATTDH------GG-GITAAVA--RDNIVGVQFHPEKSQAY------GLALLARFLD 204 (207) T ss_pred --EEEECC-----CCCCEEEEECC------CC-EEEEEEE--ECCEEEEECCCCCCCHH------HHHHHHHHHC T ss_conf --898538-----97416899737------99-8999999--59999993992006786------8999999877 No 38 >PRK13144 consensus Probab=99.68 E-value=1.8e-16 Score=126.17 Aligned_cols=176 Identities=23% Similarity=0.357 Sum_probs=103.6 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 96899844970647999999998549963998215464456478997287345431140677520021212233220597 Q gi|254780971|r 1 MKTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGI 80 (219) Q Consensus 1 mkvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~ 80 (219) |||+||.+---|- +...+|+++ .+.++. +..+..++.++|.||+||-=||+.. + ...+.+++.+. .+ T Consensus 1 mkI~IiDyg~GNi-~Sv~~al~~-~g~~~~-i~~~~~~i~~~d~lIlPGVGsf~~a-----~----~~~~~~~~~i~-~k 67 (190) T PRK13144 1 VRVGVVDYTVGNI-GSVLAALKR-AGAEPV-VVKEPEEANRVDALVLPGVGTYEAA-----Y----ALARSFKEVIL-EK 67 (190) T ss_pred CEEEEECCCCHHH-HHHHHHHHH-CCCCEE-EECCHHHHHHCCEEEECCCCCHHHH-----H----HHHHHHHHHHH-CC T ss_conf 9899992793689-999999998-499869-9789999954897897488859999-----9----99999999974-79 Q ss_pred EEEEECCCHHHHEE---------CCCHHHHHCCCCCCC-EEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEEC Q ss_conf 17860640310100---------000101101244332-45422576752523577641379968998620133023308 Q gi|254780971|r 81 KVMGICNGFQILVE---------LNLLPGILMRNCSLK-FVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFID 150 (219) Q Consensus 81 ~vLGICNGfQiL~e---------lGLlPg~l~~N~s~r-f~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~ 150 (219) |+||||-|+|+|.+ |||+||...+=++.+ ----|..++..++ ++++. +++||+. T Consensus 68 piLGIClGmQlL~~~seE~~~~GLgli~G~v~kl~~~kvPh~GWn~v~~~~~-~~~~~---------------~~~fYFv 131 (190) T PRK13144 68 PTLAICLGMQLLFESSEEGGGRGLGIFRGRVERIKARKVPHIGWSYTRVVKP-LPFVE---------------EGYYYYL 131 (190) T ss_pred CEEEEEHHHHHHHCCCCCCCCCCEEEEEEEEEECCCCCCCEECCCEEECCCC-CCCCC---------------CCEEEEE T ss_conf 8489876588664456334787630797699987888887636600034678-87644---------------5559999 Q ss_pred HHHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHCC Q ss_conf 677411542250365316888889881126688869998999848843432210067762899986429 Q gi|254780971|r 151 AKGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLLT 219 (219) Q Consensus 151 ~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~s 219 (219) .+..-...++ .....|++- |.. =+|.|. .+|++|.+.|||++-+. |++|++|++| T Consensus 132 HSY~~~~~~~-~~~~~~~~~----g~~-~~s~v~--~~Ni~G~QFHPEKS~~~------Gl~lLknFl~ 186 (190) T PRK13144 132 HSYGVRWDED-EAHVAYVEL----GRR-YVAAVE--KGHILGVQFHPERSGRA------GLELIRRFLA 186 (190) T ss_pred EEEEECCCCC-CCEEEEEEC----CCE-EEEEEE--ECCEEEEECCCHHCCHH------HHHHHHHHHH T ss_conf 9887226887-648999975----997-899999--59899997880021586------8999999999 No 39 >PRK13176 consensus Probab=99.68 E-value=4.7e-16 Score=123.50 Aligned_cols=191 Identities=23% Similarity=0.334 Sum_probs=115.9 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHCC---C-CEEEEECCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 96899844970647999999998549---9-6399821546445647899728734543114067752002121223322 Q gi|254780971|r 1 MKTAIVQIPGLNRDNDMIKAITKIIG---Q-SPILVWQSDTDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKA 76 (219) Q Consensus 1 mkvaVl~~pGsNcd~e~~~A~~~~~~---~-~~~~v~~~~~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~ 76 (219) |||+||.+. +.-=....+|+.+++. . ..+++..+..++.++|.|||||-=||++... .+.+...+.+.+.+.+ T Consensus 1 MkI~IIDyg-~GNi~Sv~~al~~~g~~~~i~~~i~~~~~~~~~~~~~~lILPGVGsf~~~m~--~L~~~~g~~~~i~~~~ 77 (216) T PRK13176 1 MRVAIIDYG-SGNLHSATKAFERAAREAGIAAEIDVTSDPDRVARADRIVLPGVGAFADCRR--GLDAVDGMVEALTEAV 77 (216) T ss_pred CEEEEEECC-CCHHHHHHHHHHHHHHHCCCCCEEEEECCHHHHHCCCEEEECCCCCHHHHHH--HHHHCCCHHHHHHHHH T ss_conf 989999069-8379999999999876438896099978999983799799899877899999--8776277289999999 Q ss_pred -HCCCEEEEECCCHHHHEE----------CCCHHHHHC---CCC-CCCE-EEEEEEEEECCCHHHHHHHCCCCC-EEEEE Q ss_conf -059717860640310100----------000101101---244-3324-542257675252357764137996-89986 Q gi|254780971|r 77 -QQGIKVMGICNGFQILVE----------LNLLPGILM---RNC-SLKF-VCKQVLLEVVNSNTAFTKSYKMNQ-IIKCP 139 (219) Q Consensus 77 -~~g~~vLGICNGfQiL~e----------lGLlPg~l~---~N~-s~rf-~~r~~~~~v~~~~s~~~~~~~~~~-~l~~p 139 (219) ++++|+||||-|+|+|.+ |||++|... ..+ +.+- ---|..+++. +++++++++..++ -...+ T Consensus 78 ~~~~~piLGIClGmQlL~~~s~E~~~~~GLgli~G~V~kl~~~~~~~kvPh~GWn~v~~~-~~~~~~~~i~~~~~~~~fY 156 (216) T PRK13176 78 EKKARPFLGICVGMQLMATRGLEKTVTDGLGWIAGDVVEITPSDPSLKIPHMGWNTLDVK-RPHPVFDGIPTGDKGLHAY 156 (216) T ss_pred HHCCCCEEEEEHHHEEHHHCCCCCCCCCCEEEEEEEEEECCCCCCCCCCCEECHHHEEEC-CCCHHHCCCCCCCCCCEEE T ss_conf 976995799853121265368678886733067249999478886764864167786867-8867773877788873589 Q ss_pred EECCCEEEEECHHHHHHHHHCCEEE-EECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHC Q ss_conf 2013302330867741154225036-531688888988112668886999899984884343221006776289998642 Q gi|254780971|r 140 VAHHDGNYFIDAKGLAEIEKNNQIV-FRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLL 218 (219) Q Consensus 140 iaHgEGrf~~~~~~l~~l~~~~~i~-~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~ 218 (219) -.|+ |++... .++..++ ..|. ... +|.|. .+|++|.+.|||++-.. |+.|++|++ T Consensus 157 FvHS---Y~~~~~-----~~~~~~a~t~yg------~~f--~a~v~--k~Ni~G~QFHPEKS~~~------Gl~ll~nFl 212 (216) T PRK13176 157 FVHS---YHLAAA-----NEADVLAHADYG------GPV--TAIVG--RDTMVGTQFHPEKSQRL------GLALIANFL 212 (216) T ss_pred EEEE---EEEECC-----CCCCEEEEEECC------CEE--EEEEE--ECCEEEECCCCCCCCHH------HHHHHHHHH T ss_conf 9975---676048-----847799999759------989--99998--39999990993102784------999999985 Q ss_pred C Q ss_conf 9 Q gi|254780971|r 219 T 219 (219) Q Consensus 219 s 219 (219) . T Consensus 213 ~ 213 (216) T PRK13176 213 K 213 (216) T ss_pred C T ss_conf 6 No 40 >PRK13142 hisH imidazole glycerol phosphate synthase subunit HisH; Provisional Probab=99.57 E-value=7.5e-15 Score=115.78 Aligned_cols=171 Identities=25% Similarity=0.377 Sum_probs=99.3 Q ss_pred EEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECC-CCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCE Q ss_conf 89984497064799999999854996399821546445647899728-73454311406775200212122332205971 Q gi|254780971|r 3 TAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPG-GFSYGDYLRCGAIAARTPVMQAIKKKAQQGIK 81 (219) Q Consensus 3 vaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipG-GFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~ 81 (219) |+||.+ |+.-=....+|+++ .+.++. +..+..++.++|.||||| | ||++.. .-..+..+.+.+++. .++| T Consensus 2 I~IIDy-G~GNi~Sv~~al~~-lg~~~~-i~~~~~~i~~~d~lILPGVG-sF~~am---~~L~~~~l~~~i~~~--~~kp 72 (192) T PRK13142 2 IVIVDY-GLGNISNVKRAIEH-LGYEVV-VSNTSKIIDQAETIILPGVG-HFKDAM---SEIKRLNLNAILAKN--TDKK 72 (192) T ss_pred EEEEEC-CCHHHHHHHHHHHH-CCCCEE-EECCHHHHHHCCEEEECCCC-CHHHHH---HHHHHCCHHHHHHHH--CCCC T ss_conf 899976-94589999999998-599879-96899998437958975768-789999---998875859999986--2898 Q ss_pred EEEECCCHHHHEE---------CCCHHHHHCCCCCCC--EEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEEC Q ss_conf 7860640310100---------000101101244332--45422576752523577641379968998620133023308 Q gi|254780971|r 82 VMGICNGFQILVE---------LNLLPGILMRNCSLK--FVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFID 150 (219) Q Consensus 82 vLGICNGfQiL~e---------lGLlPg~l~~N~s~r--f~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~ 150 (219) +||||-|+|+|.+ |||+||...+=.+.. ----|..++ .+.+.+. .. +.-.|. |.++ T Consensus 73 iLGICLGMQlL~~~SeEg~~~GLgli~G~V~kl~~~~~vPhmGWn~v~---~~~~~~~----~~---fYFvHS---Y~~~ 139 (192) T PRK13142 73 MIGICLGMQLMYEHSDEGDASGLGFIPGNISRIQTEYPVPHLGWNNLV---SKHPMLN----QD---VYFVHS---YQAP 139 (192) T ss_pred EEEEECCEEEEECCCCCCCCCCCCEEEEEEEEECCCCCCCEECCHHHC---CCCCCCC----CC---EEEEEE---EECC T ss_conf 589650101431145556767757786899983689978665815625---3385423----64---479987---6527 Q ss_pred HHHHHHHHHCCEEE-EECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHC Q ss_conf 67741154225036-531688888988112668886999899984884343221006776289998642 Q gi|254780971|r 151 AKGLAEIEKNNQIV-FRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLL 218 (219) Q Consensus 151 ~~~l~~l~~~~~i~-~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~ 218 (219) .. +..++ ..|-. .. +|.| ..+|++|++.|||++-.. |++|++|++ T Consensus 140 ------~~-~~~~a~t~yg~------~f--~s~v--~~~Ni~G~QFHPEKS~~~------Gl~lLknFl 185 (192) T PRK13142 140 ------MS-ENVIAYAQYGA------DI--PAIV--QFNNYIGIQFHPEKSGTY------GLQILRQAI 185 (192) T ss_pred ------CC-CCEEEEEECCC------EE--EEEE--EECCEEEEECCHHHCCHH------HHHHHHHHH T ss_conf ------87-57899998899------69--9999--959999995881235585------899999996 No 41 >cd01742 GATase1_GMP_Synthase Type 1 glutamine amidotransferase (GATase1) domain found in GMP synthetase. Type 1 glutamine amidotransferase (GATase1) domain found in GMP synthetase. GMP synthetase is a glutamine amidotransferase from the de novo purine biosynthetic pathway. Glutamine amidotransferase (GATase) activity catalyse the transfer of ammonia from the amide side chain of glutamine to an acceptor substrate. GMP synthetase catalyses the amination of the nucleotide precursor xanthosine 5'-monophospahte to form GMP. GMP synthetase belongs to the triad family of amidotransferases having a conserved Cys-His-Glu catalytic triad in the glutaminase active site. Probab=99.54 E-value=1.2e-13 Score=108.03 Aligned_cols=177 Identities=15% Similarity=0.296 Sum_probs=111.8 Q ss_pred EEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCC----CCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHC Q ss_conf 8998449706479999999985499639982154----644564789972873454311406775200212122332205 Q gi|254780971|r 3 TAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSD----TDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQ 78 (219) Q Consensus 3 vaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~----~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~ 78 (219) |+||.+--+|- +-.++++++ .|.++.++-.+. ....++|+++|+||- ++.. ........++..+. T Consensus 1 I~iiD~g~~~~-~si~~~l~~-~G~~~~vv~~~~~~~~~~~~~~dgvils~GP--~~~~-------~~~~~~~~~~i~~~ 69 (181) T cd01742 1 ILILDFGSQYT-HLIARRVRE-LGVYSEILPNTTPLEEIKLKNPKGIILSGGP--SSVY-------EEDAPRVDPEIFEL 69 (181) T ss_pred CEEEECCCHHH-HHHHHHHHH-CCCEEEEEECCCCHHHHHHCCCCEEEECCCC--CCCC-------CCCCCHHHHHHHHC T ss_conf 79999986389-999999997-7994999969999899973498989989999--9855-------57983330999847 Q ss_pred CCEEEEECCCHHHHEECCCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEECHHHHHHHH Q ss_conf 97178606403101000001011012443324542257675252357764137996899862013302330867741154 Q gi|254780971|r 79 GIKVMGICNGFQILVELNLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFIDAKGLAEIE 158 (219) Q Consensus 79 g~~vLGICNGfQiL~elGLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~~~~l~~l~ 158 (219) ++|+||||-|+|+|++. +.|.+.+.....+ .|..+.+.+ ++++++++. ....++..|++- +.++. T Consensus 70 ~~PilGIClG~Qll~~a--~Gg~v~~~~~~~~--G~~~~~~~~-~~~l~~~~~--~~~~~~~~H~~~--------v~~~p 134 (181) T cd01742 70 GVPVLGICYGMQLIAKA--LGGKVERGDKREY--GKAEIEIDD-SSPLFEGLP--DEQTVWMSHGDE--------VVKLP 134 (181) T ss_pred CCCEEEEHHHHHHHHHH--CCCCEEECCCCEE--CCEEEEEEC-CCCCCCCCC--CCEEEEEECCEE--------EEECC T ss_conf 99999985999999987--5976021898553--644789812-621005889--856788563629--------98458 Q ss_pred HCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHC Q ss_conf 225036531688888988112668886999899984884343221006776289998642 Q gi|254780971|r 159 KNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLL 218 (219) Q Consensus 159 ~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~ 218 (219) ++-.+..+. + ++ .++|+..++.+++|.+.|||-.. +..|.+|++|++ T Consensus 135 ~~~~~~a~s-~----~~---~v~~~~~~~~~i~gvQfHPEs~~-----s~~G~~il~NFl 181 (181) T cd01742 135 EGFKVIASS-D----NC---PVAAIANEEKKIYGVQFHPEVTH-----TEKGKEILKNFL 181 (181) T ss_pred CCEEEEEEC-C----CC---EEEEEEECCCCEEEEEECCCCCC-----CCCHHHHHHHHC T ss_conf 730799995-9----94---09999989999899995788899-----847379998429 No 42 >PRK00074 guaA GMP synthase; Reviewed Probab=99.47 E-value=8.2e-13 Score=102.77 Aligned_cols=179 Identities=21% Similarity=0.305 Sum_probs=112.4 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCC---CCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHC Q ss_conf 68998449706479999999985499639982154---644564789972873454311406775200212122332205 Q gi|254780971|r 2 KTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSD---TDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQ 78 (219) Q Consensus 2 kvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~---~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~ 78 (219) ||+||.| ||--..=.++.+..++-.+.++-+... ..-.+..++||-||=+ | .-...+|-. -....+- T Consensus 6 ~IlIlDF-GSQytqLIaRRvRElgVysEI~p~~~~~~~i~~~~pkgIILSGGP~------S-V~~~~aP~~--d~~if~l 75 (513) T PRK00074 6 KILILDF-GSQYTQLIARRVRELGVYSEIVPYDISAEEIRAFNPKGIILSGGPA------S-VYEEGAPRA--DPEIFEL 75 (513) T ss_pred EEEEEEC-CCHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHCCCCEEEECCCCC------C-CCCCCCCCC--CHHHHHC T ss_conf 6999988-9649999999998238359996699998998504997899889996------3-689999889--9899826 Q ss_pred CCEEEEECCCHHHHEECCCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEECHHHHHHHH Q ss_conf 97178606403101000001011012443324542257675252357764137996899862013302330867741154 Q gi|254780971|r 79 GIKVMGICNGFQILVELNLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFIDAKGLAEIE 158 (219) Q Consensus 79 g~~vLGICNGfQiL~elGLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~~~~l~~l~ 158 (219) +.|+||||-|+|+|++. +.|...+-... |--...+.+ .++|++++++. ....+|.+|+. ...++. T Consensus 76 ~IPILGICYG~Qlia~~--~GG~V~~s~~r--EyG~a~l~i-~~~s~Lf~gl~--~~~~VWMSHgD--------~V~~lP 140 (513) T PRK00074 76 GVPVLGICYGMQLMAHQ--LGGKVERAGKR--EYGRAELEV-DGDSALFKGIP--EEQDVWMSHGD--------KVTELP 140 (513) T ss_pred CCCEEEECHHHHHHHHH--HCCEEEECCCC--EECCEEEEE-ECCCCCCCCCC--CCCEEEEECCC--------CEEECC T ss_conf 99869975889999998--59999856960--112148999-15873336899--86368875243--------014379 Q ss_pred HCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHC Q ss_conf 225036531688888988112668886999899984884343221006776289998642 Q gi|254780971|r 159 KNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLL 218 (219) Q Consensus 159 ~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~ 218 (219) +.-.+..+- .|+ .||+|.+++.+++|++.|||=.. +++|+.|++|++ T Consensus 141 ~gF~viA~s-----~n~---~iAai~~~~~~iyGvQFHPEV~H-----T~~G~~iL~NFl 187 (513) T PRK00074 141 EGFKVIAST-----ENC---PIAAIANEERKFYGVQFHPEVTH-----TEQGKKLLENFV 187 (513) T ss_pred CCCEEEEEC-----CCC---CEEEEEECCCCEEEEECCCEEEE-----CCCCHHHHHHHH T ss_conf 984799954-----997---36789806786788702863453-----177579999999 No 43 >PRK00784 cobyric acid synthase; Provisional Probab=99.38 E-value=6.5e-12 Score=97.00 Aligned_cols=163 Identities=24% Similarity=0.348 Sum_probs=92.5 Q ss_pred CEEEEEECCC-CC-CHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHC Q ss_conf 9689984497-06-479999999985499639982154644564789972873454311406775200212122332205 Q gi|254780971|r 1 MKTAIVQIPG-LN-RDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQ 78 (219) Q Consensus 1 mkvaVl~~pG-sN-cd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~ 78 (219) ++|||++.|- || .|.| +|...-+....++ ....+|.+.|++||||.=+--.+| .++..+.+.+++++++++ T Consensus 254 l~IaVi~lP~iSNftDfd---pL~~e~~V~l~~v-~~~~~l~~~D~vIlPGsK~Ti~DL---~~Lr~~Gl~~~i~~~~~~ 326 (492) T PRK00784 254 LRVAVPRLPRISNFTDFD---PLRAEPGVDVRYV-RPGEPLPDADLVILPGSKNTIADL---AWLRESGWDEAIRAHARR 326 (492) T ss_pred EEEEEEECCCCCCCCCCH---HHHHCCCEEEEEE-CCCCCCCCCCEEEECCCCCCHHHH---HHHHHCCHHHHHHHHHHC T ss_conf 589998067556865443---5640787289997-897656899999979986648999---999986949999999975 Q ss_pred CCEEEEECCCHHHHEE-----------------CCCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEE Q ss_conf 9717860640310100-----------------00010110124433245422576752523577641379968998620 Q gi|254780971|r 79 GIKVMGICNGFQILVE-----------------LNLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVA 141 (219) Q Consensus 79 g~~vLGICNGfQiL~e-----------------lGLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~pia 141 (219) |++|+|||.|||||-+ |||||- .-.|+......++... ..+..+. .+-+.=|- T Consensus 327 G~~V~GICGGyQMLG~~I~DP~gvE~~~~~~~GLGLLp~------~T~f~~~K~~~~~~~~-~~~~~g~---~v~GYEIH 396 (492) T PRK00784 327 GGPVLGICGGYQMLGRTIADPDGVEGAPGEVEGLGLLDV------ETVFTPEKTLRQVSGL-LLLGSGA---PVSGYEIH 396 (492) T ss_pred CCCEEEECCHHHHHCCEECCCCCCCCCCCCCCCCCCEEE------EEEECCCEEEEEEEEE-ECCCCCC---CEEEEEEE T ss_conf 997899870288630200386566678775465231347------9997587089988999-8788998---22257876 Q ss_pred CCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCC Q ss_conf 13302330867741154225036531688888988112668886999899984884 Q gi|254780971|r 142 HHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHP 197 (219) Q Consensus 142 HgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHP 197 (219) ||.=+ ..+.....++..+... --|.++++|||+|.-=|= T Consensus 397 ~G~T~----------~~~~~~p~~~~~~~~~-------~dG~~~~dG~V~GTYlHG 435 (492) T PRK00784 397 MGRTT----------GPALARPFLRLDDGGR-------PDGAVSADGRVFGTYLHG 435 (492) T ss_pred CCEEC----------CCCCCCCCEEECCCCC-------CCCEECCCCCEEEEECHH T ss_conf 65651----------6866676244457888-------671287899899997321 No 44 >PRK07765 para-aminobenzoate synthase component II; Provisional Probab=99.38 E-value=9.1e-12 Score=96.08 Aligned_cols=183 Identities=18% Similarity=0.200 Sum_probs=111.5 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCC-------CCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 9689984497064799999999854996399821546-------445647899728734543114067752002121223 Q gi|254780971|r 1 MKTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDT-------DIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIK 73 (219) Q Consensus 1 mkvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~-------~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~ 73 (219) ||+.||-+-.|=.-. .++.+.. .+..+.++..++. ...++|+|||-||=+ +.-.+| ...+.++ T Consensus 1 MrILiIDn~DSFT~n-i~~~lr~-lg~~v~V~~~d~~~~~~~~~~~~~~dgIILSpGPg--~P~~~~------~~~~~i~ 70 (221) T PRK07765 1 MRILVVDNYDSFVFN-LVQYLGQ-LGVEAEVWRNDDVRLADEAAVAAGFDGVLLSPGPG--TPERAG------ASIDMVR 70 (221) T ss_pred CEEEEEECCCCHHHH-HHHHHHH-CCCEEEEEECCCCCHHHHHHHHCCCCEEEECCCCC--CHHHCC------CCHHHHH T ss_conf 979999388933999-9999987-79918999899899889987634899899969999--801167------2488998 Q ss_pred HHHHCCCEEEEECCCHHHHEECCCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEECHHH Q ss_conf 32205971786064031010000010110124433245422576752523577641379968998620133023308677 Q gi|254780971|r 74 KKAQQGIKVMGICNGFQILVELNLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFIDAKG 153 (219) Q Consensus 74 ~~~~~g~~vLGICNGfQiL~elGLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~~~~ 153 (219) ++.+.+.|+||||-|+|+|++. +.|.+.+-....|- .+..-.. ..+..+.++. ..+..+..|+..- .. T Consensus 71 ~~~~~~iPILGIClG~Q~ia~~--~Gg~v~~~~~~~hG--~~~~i~~-~~~~lf~gl~--~~~~~~ryHS~~v---~~-- 138 (221) T PRK07765 71 ACAAAGTPLLGVCLGHQAIGVA--FGATVDRAPELLHG--KTSSVHH-TNVGVLQGLP--DPFTATRYHSLTI---LP-- 138 (221) T ss_pred HHHCCCCCEEEEEHHHHHHHHH--HCCEEEECCCCCEE--EEEEEEE-CCCCCCCCCC--CCEEEEEEEEEEE---EE-- T ss_conf 7532599889870999999998--39789888955365--6879998-9987615899--9747887421899---76-- Q ss_pred HHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHCC Q ss_conf 411542250365316888889881126688869998999848843432210067762899986429 Q gi|254780971|r 154 LAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLLT 219 (219) Q Consensus 154 l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~s 219 (219) +.+.+ +..+.-.. .|| -|+|+-.++.+++|.+.|||=.. ++.|..|++|+++ T Consensus 139 -~~lP~-~~~v~A~s----~~~---~ima~~h~~~~i~GVQFHPEs~~-----T~~G~~ll~NFl~ 190 (221) T PRK07765 139 -ETLPA-ELEVTART----DSG---VIMAVRHRELPIHGVQFHPESIL-----TEGGHRMLANWLT 190 (221) T ss_pred -CCCCC-CCEEEEEC----CCC---CEEEEEECCCCEEEEEECCCCCC-----CCCHHHHHHHHHH T ss_conf -15899-62899984----998---68658707897899997885357-----8166999999999 No 45 >cd01743 GATase1_Anthranilate_Synthase Type 1 glutamine amidotransferase (GATase1) domain found in Anthranilate synthase. Type 1 glutamine amidotransferase (GATase1) domain found in Anthranilate synthase (ASase). This group contains proteins similar to para-aminobenzoate (PABA) synthase and ASase. These enzymes catalyze similar reactions and produce similar products, PABA and ortho-aminobenzoate (anthranilate). Each enzyme is composed of non-identical subunits: a glutamine amidotransferase subunit (component II) and a subunit that produces an aminobenzoate products (component I). ASase catalyses the synthesis of anthranilate from chorismate and glutamine and is a tetrameric protein comprising two copies each of components I and II. Component II of ASase belongs to the family of triad GTases which hydrolyze glutamine and transfer nascent ammonia between the active sites. In some bacteria, such as Escherichia coli, component II can be much larger than in other organisms, due to the prese Probab=99.38 E-value=1.7e-11 Score=94.37 Aligned_cols=178 Identities=19% Similarity=0.245 Sum_probs=105.1 Q ss_pred EEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCC-----CCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHC Q ss_conf 9984497064799999999854996399821546-----44564789972873454311406775200212122332205 Q gi|254780971|r 4 AIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDT-----DIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQ 78 (219) Q Consensus 4 aVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~-----~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~ 78 (219) .||.+.++ =..-.++.+.+ .+..+.++-.++. +..++|++||-||= |+.- .......+.+..++ T Consensus 2 LiiD~~ds-ft~~i~r~lr~-~g~~~~v~p~~~~~~~~~~~~~~~giiLS~GP--g~p~-------~~~~~~~~~~~l~~ 70 (184) T cd01743 2 LLIDNYDS-FTYNLVQYLRE-LGAEVVVVRNDEITLEELELLNPDAIVISPGP--GHPE-------DAGISLEIIRALAG 70 (184) T ss_pred EEEECCCC-HHHHHHHHHHH-CCCEEEEEECCCCCHHHHHHCCCCEEEECCCC--CCHH-------HCCCHHHHHHHHHC T ss_conf 99937981-79999999986-79959999899999999973297999999999--8925-------61460999999846 Q ss_pred CCEEEEECCCHHHHEECCCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEECHHHHHHHH Q ss_conf 97178606403101000001011012443324542257675252357764137996899862013302330867741154 Q gi|254780971|r 79 GIKVMGICNGFQILVELNLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFIDAKGLAEIE 158 (219) Q Consensus 79 g~~vLGICNGfQiL~elGLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~~~~l~~l~ 158 (219) ++|+||||-|+|+|.+. +.|.+.+.+..++ ..+..+... ++..+.++. ....++..|+.. .+...+ T Consensus 71 ~iPiLGIC~G~Qll~~~--~Gg~v~~~~~~~~-G~~~~i~~~--~~~~~~~~~--~~~~v~~~Hs~~---~~~~~~---- 136 (184) T cd01743 71 KVPILGVCLGHQAIAEA--FGGKVVRAPEPMH-GKTSEIHHD--GSGLFKGLP--QPFTVGRYHSLV---VDPDPL---- 136 (184) T ss_pred CCCEEEECHHHHHHHHH--CCCEEEECCCCEE-CEEEEEEEC--CCCCEECCC--CCCEEEEEEEEE---EECCCC---- T ss_conf 99899980999999998--3984874575140-545899977--976364278--888898861468---981357---- Q ss_pred HCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHC Q ss_conf 225036531688888988112668886999899984884343221006776289998642 Q gi|254780971|r 159 KNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLL 218 (219) Q Consensus 159 ~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~ 218 (219) .....+..+. +++ .|+++..++.+++|.|.||| +.. +.+|+.|++|++ T Consensus 137 ~~~~~~~a~~----~~~---~i~ai~~~~~~i~GvQFHPE-s~~----t~~G~~il~nFl 184 (184) T cd01743 137 PDLLEVTAST----EDG---VIMALRHRDLPIYGVQFHPE-SIL----TEYGLRLLENFL 184 (184) T ss_pred CCCCEEEEEC----CCC---CEEEEEECCCCEEEEEECCC-CCC----CCCHHHHHHHHC T ss_conf 8750799988----998---49999989999899983687-689----827389998439 No 46 >PRK06895 para-aminobenzoate synthase component II; Provisional Probab=99.38 E-value=1.4e-11 Score=94.87 Aligned_cols=183 Identities=17% Similarity=0.235 Sum_probs=111.4 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCC---CCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 968998449706479999999985499639982154---64456478997287345431140677520021212233220 Q gi|254780971|r 1 MKTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSD---TDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQ 77 (219) Q Consensus 1 mkvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~---~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~ 77 (219) .||.||.+-.|= .+-.++.+.+ .+..+.++-.++ ..+.+++++||-||= |+.-.++ .+.+ +.+... T Consensus 3 ~kILiiD~~dsf-T~ni~~~lr~-lg~~~~v~~~~~~~~~~~~~~~~IIlSpGP--g~p~~~~------~~~~-~i~~~~ 71 (191) T PRK06895 3 TNLLIINNHDSF-TFNLVDLIRK-LGVPMKVVNVEDLDLDEVENFSHILISPGP--DVPRAYP------QLFA-MLERYY 71 (191) T ss_pred CCEEEEECCCHH-HHHHHHHHHH-CCCEEEEEECCCCCHHHHHCCCEEEEECCC--CCCCCCC------HHHH-HHHHHC T ss_conf 869999698848-9999999987-799699984784899898558969990899--9801052------2499-999860 Q ss_pred CCCEEEEECCCHHHHEECCCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEECHHHHHHH Q ss_conf 59717860640310100000101101244332454225767525235776413799689986201330233086774115 Q gi|254780971|r 78 QGIKVMGICNGFQILVELNLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFIDAKGLAEI 157 (219) Q Consensus 78 ~g~~vLGICNGfQiL~elGLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~~~~l~~l 157 (219) .++|+||||-|+|+|++. +.|.+.+-+..++ -+...+++ .+++++++++. ..+.+|..|+.. .+++ .+ T Consensus 72 ~~~PILGIClG~Q~i~~~--~Gg~v~~~~~~~h-G~~~~i~~-~~~~~lf~g~~--~~~~v~~yHs~~---v~~~---~l 139 (191) T PRK06895 72 QQKSILGVCLGHQTLCEF--FGGTLYNLENVRH-GQQRTLKV-RSNSPLFDGLP--EEFNIGLYHSWA---VSQE---NF 139 (191) T ss_pred CCCCEEEECHHHHHHHHH--HCCEEEECCCCEE-EEEEEEEE-CCCCCCCCCCC--CCEEEEEEEEEE---EECC---CC T ss_conf 689878982999999998--2957996887455-22789999-18983465899--974876667778---7605---78 Q ss_pred HHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHCC Q ss_conf 42250365316888889881126688869998999848843432210067762899986429 Q gi|254780971|r 158 EKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLLT 219 (219) Q Consensus 158 ~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~s 219 (219) . .+..+.-+++ ++ .|+|+-.++.+++|.+.|||=.. +++|+.|++|+++ T Consensus 140 p-~~~~v~a~~~----~~---~i~a~~h~~~~i~GvQFHPEs~~-----T~~G~~il~NFl~ 188 (191) T PRK06895 140 P-TPLEITAVCD----EE---VVMAFQHKHLPIYGVQFHPESYI-----SEFGEQILRNWLA 188 (191) T ss_pred C-CCEEEEEECC----CC---EEEEEEECCCCEEEEEECCCCCC-----CCCHHHHHHHHHC T ss_conf 9-7469999759----98---09988848998999982884688-----8785999999865 No 47 >TIGR00888 guaA_Nterm GMP synthase, N-terminal domain; InterPro: IPR004739 The N-terminal region of GMP synthase 6.3.5.2 from EC is recognised by this signature. GMP synthase catalyzes the synthesis of GMP from XMP , .ATP + xanthosine 5'-phosphate + L-glutamine + H(2)O = AMP + diphosphate + GMP + L-glutamate ; GO: 0003922 GMP synthase (glutamine-hydrolyzing) activity, 0005524 ATP binding, 0006177 GMP biosynthetic process. Probab=99.37 E-value=1.2e-12 Score=101.66 Aligned_cols=165 Identities=21% Similarity=0.324 Sum_probs=106.2 Q ss_pred HHHHHHHCCCCEEEEECCC--CCCCC--CCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHE Q ss_conf 9999985499639982154--64456--4789972873454311406775200212122332205971786064031010 Q gi|254780971|r 18 IKAITKIIGQSPILVWQSD--TDIPD--VDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIKVMGICNGFQILV 93 (219) Q Consensus 18 ~~A~~~~~~~~~~~v~~~~--~~l~~--~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~vLGICNGfQiL~ 93 (219) +|-+. ..+....++-++. ++|.+ -.||||=||=+ .+| + ....+...+-.+-|.||||||-|+|+|+ T Consensus 15 aRRlR-E~gVysE~~p~t~~~eeI~~~~P~GiILSGGP~-sv~------~--~n~~~~~~~if~LgVPvLGICYG~Ql~A 84 (195) T TIGR00888 15 ARRLR-ELGVYSEIVPNTTPLEEIKEKNPKGIILSGGPA-SVY------A--ENAPEADEKIFELGVPVLGICYGMQLMA 84 (195) T ss_pred EEEEE-ECCCEEEEEECCCCHHHHHHCCCCEEEEECCCC-CCC------C--CCCHHHHHHHHHCCCCEEEEEHHHHHHH T ss_conf 12201-014135784077877998612986799727887-646------7--8850224888627996887624578999 Q ss_pred ECCCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCC------EEEEEEECCCEEEEECHHHHHHHHHCCEEEEEC Q ss_conf 00001011012443324542257675252357764137996------899862013302330867741154225036531 Q gi|254780971|r 94 ELNLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQ------IIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRY 167 (219) Q Consensus 94 elGLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~------~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y 167 (219) +. +.|...+ +.+-|=-.+.++|.+++ ..++++. . .+.+|.+|+ |+ .++|-+.=-|..+ T Consensus 85 ~~--lGG~V~~--~~~~EYG~a~l~i~~~~-~LF~glp--~~LTA~P~~~VWMSH~-------D~-V~~lP~gF~vlA~- 148 (195) T TIGR00888 85 KQ--LGGEVGR--AEKREYGKAELEILDED-DLFEGLP--KALTAKPESTVWMSHG-------DK-VKELPEGFKVLAT- 148 (195) T ss_pred HH--HCCEEEE--CCCCCCCEEEEEEECCC-HHHHCCC--HHHCCCCCCCCCCCCC-------CE-EEECCCCCEEEEE- T ss_conf 98--2987765--68887313899984288-1230780--3534688887112346-------24-2008699768875- Q ss_pred CCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHCC Q ss_conf 6888889881126688869998999848843432210067762899986429 Q gi|254780971|r 168 ASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLLT 219 (219) Q Consensus 168 ~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~s 219 (219) -+|-|+ +|++-+++.|++|.+.|||= .+ +..|..|++|++. T Consensus 149 ----s~~hcP--~aAm~~~~K~~YGVQFHPEV-~H----t~~G~~~L~NF~~ 189 (195) T TIGR00888 149 ----SDNHCP--VAAMEHEDKPIYGVQFHPEV-TH----TEYGKELLKNFVK 189 (195) T ss_pred ----CCCCCC--CCEEECCCCCEEEEEECCCE-EE----CHHHHHHHHHHHH T ss_conf ----289881--00000489966666517831-21----5345789998887 No 48 >pfam00117 GATase Glutamine amidotransferase class-I. Probab=99.35 E-value=1.7e-11 Score=94.41 Aligned_cols=178 Identities=19% Similarity=0.289 Sum_probs=103.3 Q ss_pred EEECCCCCCHHHHHHHHHHHCCCCEEEEECCCC-----CCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCC Q ss_conf 984497064799999999854996399821546-----445647899728734543114067752002121223322059 Q gi|254780971|r 5 IVQIPGLNRDNDMIKAITKIIGQSPILVWQSDT-----DIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQG 79 (219) Q Consensus 5 Vl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~-----~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g 79 (219) ||-+-.|+- +-.++++++ .+..+.++..+.. ++.++|++||.||. |+.-.+ ....+.++++.+.+ T Consensus 2 iiD~~dsft-~~i~~~l~~-~g~~~~v~~~~~~~~~~~~~~~~d~iilspGp--g~p~~~------~~~~~~i~~~~~~~ 71 (187) T pfam00117 2 LIDNGDSFT-YNLARALRE-LGVEVEVVPNDTPAEEILELLNPDGIIISPGP--GSPGDA------GGAIEAIKELRENK 71 (187) T ss_pred EEECCCCHH-HHHHHHHHH-CCCEEEEEECCCCHHHHHHHCCCCEEEECCCC--CCHHHC------CCHHHHHHHHHHCC T ss_conf 893786389-999999986-89979999899986999852599989991998--611013------45899999999779 Q ss_pred CEEEEECCCHHHHEECCCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEECHHHHHHHHH Q ss_conf 71786064031010000010110124433245422576752523577641379968998620133023308677411542 Q gi|254780971|r 80 IKVMGICNGFQILVELNLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFIDAKGLAEIEK 159 (219) Q Consensus 80 ~~vLGICNGfQiL~elGLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~~~~l~~l~~ 159 (219) +|+||||-|+|+|... +.|.+.+.+.... .|. ..+.. ++..+.+.. .....+-.|+. .++.+ .+. T Consensus 72 ~PiLGIClG~Qll~~~--~Gg~v~~~~~~~~--G~~-~~i~~-~~~~~~~~~--~~~~~~~~Hs~---~v~~~---~lp- 136 (187) T pfam00117 72 IPILGICLGHQLLALA--FGGKVIKAKEPHH--GKN-SPVGH-DKGLFYGLP--NVFIVRRYHSY---AVDPD---TLP- 136 (187) T ss_pred CCEEEEEHHHHHHHHH--CCCCCCCCCEEEC--CEE-EEECC-CHHHHCCCC--CCCEEEEEEEE---EEEEC---CCC- T ss_conf 9899998889999997--1998644310105--704-89732-116442799--77377760348---99862---578- Q ss_pred CCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHC Q ss_conf 25036531688888988112668886999899984884343221006776289998642 Q gi|254780971|r 160 NNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLL 218 (219) Q Consensus 160 ~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~ 218 (219) .+..+.-+. .++. -|.|+-..+.+++|+|.|||-+. +++|..+++|++ T Consensus 137 ~~~~i~a~s----~~~~--~i~a~~~~~~~~~gvQfHPE~~~-----t~~G~~ll~nfl 184 (187) T pfam00117 137 DGLEVTAAS----DNGG--EIMAIRHKENPIFGVQFHPESSL-----TPGGPELLFNFL 184 (187) T ss_pred CCEEEEEEC----CCCC--CEEEEEECCCCEEEEEECCCCCC-----CCCCHHHHHHHH T ss_conf 862999986----8887--18999988998999995997799-----977048998777 No 49 >PRK13526 glutamine amidotransferase subunit PdxT; Provisional Probab=99.30 E-value=1.5e-11 Score=94.72 Aligned_cols=162 Identities=25% Similarity=0.342 Sum_probs=109.7 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 96899844970647999999998549963998215464456478997287345431140677520021212233220597 Q gi|254780971|r 1 MKTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGI 80 (219) Q Consensus 1 mkvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~ 80 (219) |||+||-.+|. -.|-.+++++ .+.++..| ....+|++.|+||||||=| ..-++.+.+..+.+.++++++. + T Consensus 3 ~kIGVLaLQG~--~~eH~~~l~~-lg~~~~~V-r~~~dL~~idgLIiPGGES----Tti~~ll~~~~L~~~l~~~~~~-~ 73 (179) T PRK13526 3 QKVGVLAIQGG--YQKHADMFKS-LGVEVKLV-KFNNDFDSIDRLVIPGGES----TTLLNLLNKHQIFDKLYNFCSS-K 73 (179) T ss_pred EEEEEEEECCC--HHHHHHHHHH-CCCCEEEE-CCHHHHHCCCEEEECCCCH----HHHHHHHHHCCCHHHHHHHHHC-C T ss_conf 38999996288--7999999998-79968997-1788860077589759977----8999997675549999999858-9 Q ss_pred EEEEECCCHHHHEE----CCCHHHHHCCCCCCCEEEEEE-EEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEECHHHHH Q ss_conf 17860640310100----000101101244332454225-7675252357764137996899862013302330867741 Q gi|254780971|r 81 KVMGICNGFQILVE----LNLLPGILMRNCSLKFVCKQV-LLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFIDAKGLA 155 (219) Q Consensus 81 ~vLGICNGfQiL~e----lGLlPg~l~~N~s~rf~~r~~-~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~~~~l~ 155 (219) |++|.|-|.-+|.+ +|+++-+..+|.-+|=..-+. .+.+ . +..+ .+-|.. ..... T Consensus 74 Pv~GTCAGlILLak~~~~lg~ldi~V~RNayGrQ~~SFe~~~~~-----------~-~~~~-------~avFIR-AP~I~ 133 (179) T PRK13526 74 PVFGTCAGSIILSKGEGYLNLLDLEVQRNAYGRQVDSFVADISF-----------N-DKNI-------TGVFIR-APKFI 133 (179) T ss_pred CEEEECCEEEEEECCCCCCCCEEEEEEECCCCCCCCEEEEECCC-----------C-CCCE-------EEEEEC-CCCEE T ss_conf 90886120689826987667135899825545743205871365-----------8-9953-------699982-75188 Q ss_pred HHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCC Q ss_conf 15422503653168888898811266888699989998488434322 Q gi|254780971|r 156 EIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIE 202 (219) Q Consensus 156 ~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~ 202 (219) ++-++-.|..+|.+ +| +| -..||++|.=.|||=+-+ T Consensus 134 ~vg~~V~vLa~~~~--~~------V~---vrqgn~la~sFHPELt~D 169 (179) T PRK13526 134 VVGNQVDILSKYQN--SP------VL---LRQANILVSSFHPELTQD 169 (179) T ss_pred ECCCCCEEEEEECC--EE------EE---EEECCEEEEEECCCCCCC T ss_conf 80899589999899--88------99---998999999828875487 No 50 >PRK00758 GMP synthase subunit A; Validated Probab=99.29 E-value=7.3e-11 Score=90.30 Aligned_cols=175 Identities=17% Similarity=0.305 Sum_probs=104.9 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCC----CCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 6899844970647999999998549963998215464----456478997287345431140677520021212233220 Q gi|254780971|r 2 KTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDTD----IPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQ 77 (219) Q Consensus 2 kvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~~----l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~ 77 (219) ||+||-+--|.. +-.++++.+ .+..+.++-+ +.. +.++|+|+|.+|=+ ...++ ...+.++ + T Consensus 1 kIlviD~~dsft-~ni~~~l~~-~g~~v~v~~~-~~~~~~i~~~~dgiilS~GP~---~~~~~------~~~~~i~---~ 65 (184) T PRK00758 1 KILVINNGGQYN-HLIHRTLRY-LGVDAKIIPN-TTPLEEIKAEPDGIILSGGPE---IERAG------NCEEYLL---E 65 (184) T ss_pred CEEEEECCCHHH-HHHHHHHHH-CCCEEEEEEC-CCCHHHHHHCCCEEEECCCCC---CCCCC------CHHHHHH---H T ss_conf 999998986389-999999997-8992899959-898999963899899889998---33322------0699998---6 Q ss_pred CCCEEEEECCCHHHHEECCCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEECHHHHHHH Q ss_conf 59717860640310100000101101244332454225767525235776413799689986201330233086774115 Q gi|254780971|r 78 QGIKVMGICNGFQILVELNLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFIDAKGLAEI 157 (219) Q Consensus 78 ~g~~vLGICNGfQiL~elGLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~~~~l~~l 157 (219) .+.|+||||-|+|+|.+. +.|.+.+.+...+ .+..+.+.+ ++..++++. +.+..|..|++- .+.+ T Consensus 66 ~~iPiLGIClG~Q~l~~~--~Gg~v~~~~~~~~--G~~~i~~~~-~~~lf~~~~--~~~~v~~~H~~~--------v~~~ 130 (184) T PRK00758 66 LDVPILGICLGHQLIAKA--FGGEVGRAEKGEY--ALVEVEILD-EDDILRGLP--PEITVWASHADE--------VKEL 130 (184) T ss_pred CCCCEEEEEHHHHHHHHH--CCCCCCCCCCEEE--CCEEEEEEC-CCCCCCCCC--CCCEEEEEEEEE--------EEEC T ss_conf 599889971776665876--6993302762154--336899973-887511158--863787640058--------9965 Q ss_pred HHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHCC Q ss_conf 42250365316888889881126688869998999848843432210067762899986429 Q gi|254780971|r 158 EKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLLT 219 (219) Q Consensus 158 ~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~s 219 (219) .+ +..++.+.+ ++ .|.|+-+++.+++|.+.|||-. .+++|+.+|+|++. T Consensus 131 p~-~~~v~a~s~----~~---~v~~i~~~~~~i~gvQFHPEs~-----~t~~G~~ll~nFl~ 179 (184) T PRK00758 131 PD-GFEILARSD----IC---EVEAMKHKDKPIYGVQFHPEVS-----HTEKGEEIFKNFIK 179 (184) T ss_pred CC-CEEEEEECC----CC---CEEEEEECCCCEEEEEECCCCC-----CCCCHHHHHHHHHH T ss_conf 87-159999849----99---8899998999999999188479-----99278999999999 No 51 >COG0311 PDX2 Predicted glutamine amidotransferase involved in pyridoxine biosynthesis [Coenzyme metabolism] Probab=99.25 E-value=6.1e-11 Score=90.80 Aligned_cols=168 Identities=24% Similarity=0.403 Sum_probs=107.3 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 96899844970647999999998549963998215464456478997287345431140677520021212233220597 Q gi|254780971|r 1 MKTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGI 80 (219) Q Consensus 1 mkvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~ 80 (219) |||+||-.+|- -+|-..+++++++.++..+ ....+|++.|+||||||=| ..-++++.+..+++.+++++++|+ T Consensus 1 m~IGVLalQG~--v~EH~~~l~~~~~~e~~~V-k~~~dL~~~d~LIiPGGES----TTi~rL~~~~gl~e~l~~~~~~G~ 73 (194) T COG0311 1 MKIGVLALQGA--VEEHLEALEKAGGAEVVEV-KRPEDLEGVDGLIIPGGES----TTIGRLLKRYGLLEPLREFIADGL 73 (194) T ss_pred CEEEEEEECCC--HHHHHHHHHHHCCCCEEEE-CCHHHHCCCCEEEECCCCH----HHHHHHHHHCCCHHHHHHHHHCCC T ss_conf 94799985046--8999999986058735997-1778855376789549507----899999987384899999997699 Q ss_pred EEEEECCCHHHHEE----------CCCHHHHHCCCCCCCEE-EEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEE Q ss_conf 17860640310100----------00010110124433245-42257675252357764137996899862013302330 Q gi|254780971|r 81 KVMGICNGFQILVE----------LNLLPGILMRNCSLKFV-CKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFI 149 (219) Q Consensus 81 ~vLGICNGfQiL~e----------lGLlPg~l~~N~s~rf~-~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~ 149 (219) |++|.|-|.-+|.+ ||++--+..+|.-+|=. |=...+.+..... ..+ ...-|.. T Consensus 74 Pv~GTCAGlIlLakei~~~~~~~~Lg~mdi~V~RNAfGRQ~dSFe~~~di~~~~~------------~~~---~~avFIR 138 (194) T COG0311 74 PVFGTCAGLILLAKEILDGPEQPLLGLLDVTVRRNAFGRQVDSFETELDIEGFGL------------PFP---FPAVFIR 138 (194) T ss_pred CEEEECHHHHHHHHHHCCCCCCCCCCEEEEEEECCCCCCCCCCCEEEEEEECCCC------------CCC---CEEEEEE T ss_conf 6477514256534655089877521227899981365543123010678641367------------776---2399998 Q ss_pred CHHHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCC Q ss_conf 86774115422503653168888898811266888699989998488434322 Q gi|254780971|r 150 DAKGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIE 202 (219) Q Consensus 150 ~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~ 202 (219) .+ ...+.-+.-++..++.+ .|.+. +.||+||+-.|||=+-+ T Consensus 139 AP-~I~~vg~~V~vLa~l~~---------~iVav--~qgn~LatsFHPELT~D 179 (194) T COG0311 139 AP-VIEEVGDGVEVLATLDG---------RIVAV--KQGNILATSFHPELTDD 179 (194) T ss_pred CC-EEEHHCCCCEEEEECCC---------EEEEE--EECCEEEEECCCCCCCC T ss_conf 64-02000676067666088---------79999--86788998517543587 No 52 >PRK08857 para-aminobenzoate synthase component II; Provisional Probab=99.23 E-value=1.6e-10 Score=88.21 Aligned_cols=174 Identities=17% Similarity=0.281 Sum_probs=102.2 Q ss_pred HHHHHHHHHHCCCCEEEEECCCCCC-----CCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCCH Q ss_conf 9999999985499639982154644-----56478997287345431140677520021212233220597178606403 Q gi|254780971|r 15 NDMIKAITKIIGQSPILVWQSDTDI-----PDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIKVMGICNGF 89 (219) Q Consensus 15 ~e~~~A~~~~~~~~~~~v~~~~~~l-----~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~vLGICNGf 89 (219) +-.++.+.. .+..+.++..++.++ .+.++|||-||= |+.-.++ ....+.+....+.|+||||-|+ T Consensus 13 ~ni~~~l~~-lg~~v~vv~~d~~~~~~i~~~~p~~IilS~GP--g~p~~~~-------~~~~~~~~~~~~iPILGIClG~ 82 (192) T PRK08857 13 YNLYQYFCE-LGAEVKVVRNDEIDIAGIEALNPSHLVISPGP--CTPNEAG-------ISLQAIEHFAGKLPILGVCLGH 82 (192) T ss_pred HHHHHHHHH-CCCEEEEEECCCCCHHHHHHCCCCEEEECCCC--CCHHHCC-------CCHHHHHHHCCCCCEEEECHHH T ss_conf 999999987-79928999699899999984297959998999--9967828-------6146699735799989987999 Q ss_pred HHHEECCCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCC Q ss_conf 10100000101101244332454225767525235776413799689986201330233086774115422503653168 Q gi|254780971|r 90 QILVELNLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYAS 169 (219) Q Consensus 90 QiL~elGLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d 169 (219) |+|++. +.|.+.+-+...|- +-..+ ..+.+..++++. ..+.+|..|+.. .+.+ .+.+.-.+.. ..+ T Consensus 83 Q~ia~~--~Gg~v~~~~~~~hG-~~~~i--~~~~~~lf~~~~--~~~~v~~~Hs~~---v~~~---~lp~~~~v~a-~t~ 148 (192) T PRK08857 83 QAIAQV--FGGDVVRARQVMHG-KTSPI--RHTGRSVFKGLN--NPLTVTRYHSLV---VKND---TLPECFELTA-WTE 148 (192) T ss_pred HHHHHH--HCCEEEECCCCEEE-EEEEE--EECCCEEEECCC--CCEEEEECCCEE---EECC---CCCCCEEEEE-EEE T ss_conf 999998--39829988843251-15799--989930473699--887989625527---8957---4899829999-972 Q ss_pred CCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHCC Q ss_conf 88889881126688869998999848843432210067762899986429 Q gi|254780971|r 170 GTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLLT 219 (219) Q Consensus 170 ~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~s 219 (219) +-+|+...|+|+-+++.+++|.+.|||= +. +.+|..|++|++| T Consensus 149 --~~~~~~~~i~a~~h~~~~i~GVQFHPEs-~~----T~~G~~il~NFl~ 191 (192) T PRK08857 149 --LEDGSMDEIMGFQHKTLPIEAVQFHPES-IK----TEQGHQLLANFLA 191 (192) T ss_pred --CCCCCCCCEEEEEECCCCEEEEEECCCC-CC----CCCHHHHHHHHHC T ss_conf --6889977053888389998999877977-99----8474899999726 No 53 >cd01744 GATase1_CPSase Small chain of the glutamine-dependent form of carbamoyl phosphate synthase, CPSase II. This group of sequences represents the small chain of the glutamine-dependent form of carbamoyl phosphate synthase, CPSase II. CPSase II catalyzes the production of carbomyl phosphate (CP) from bicarbonate, glutamine and two molecules of MgATP. The reaction is believed to proceed by a series of four biochemical reactions involving a minimum of three discrete highly reactive intermediates. The synthesis of CP is critical for the initiation of two separate biosynthetic pathways. In one CP is coupled to aspartate, its carbon and nitrogen nuclei ultimately incorporated into the aromatic moieties of pyrimidine nucleotides. In the second pathway CP is condensed with ornithine at the start of the urea cycle and is utilized for the detoxification of ammonia and biosynthesis of arginine. CPSases may be encoded by one or by several genes, depending on the species. The E.coli enzyme is Probab=99.22 E-value=1.6e-10 Score=88.15 Aligned_cols=174 Identities=19% Similarity=0.222 Sum_probs=96.9 Q ss_pred EEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCC----CCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHC Q ss_conf 8998449706479999999985499639982154----644564789972873454311406775200212122332205 Q gi|254780971|r 3 TAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSD----TDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQ 78 (219) Q Consensus 3 vaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~----~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~ 78 (219) |+||.|-. .+-.++.+.++ +..+.++-.+. ..-.++++|||-||= |+.. ......+.+++..+. T Consensus 1 i~ilDfGs---qy~i~r~lrel-g~~~~v~~~~~~~~~i~~~~p~giIlS~GP--g~p~------~~~~~~~~~~~~~~~ 68 (178) T cd01744 1 VVVIDFGV---KHNILRELLKR-GCEVTVVPYNTDAEEILKLDPDGIFLSNGP--GDPA------LLDEAIKTVRKLLGK 68 (178) T ss_pred CEEEECCC---HHHHHHHHHHC-CCEEEEEECCCCHHHHHHCCCCEEEECCCC--CCHH------HHCCHHHHHHHHHHC T ss_conf 99998874---99999999987-997999939899999984199979989999--9957------734449999999746 Q ss_pred CCEEEEECCCHHHHEECCCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEECHHHHHHHH Q ss_conf 97178606403101000001011012443324542257675252357764137996899862013302330867741154 Q gi|254780971|r 79 GIKVMGICNGFQILVELNLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFIDAKGLAEIE 158 (219) Q Consensus 79 g~~vLGICNGfQiL~elGLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~~~~l~~l~ 158 (219) +.|+||||-|+|+|++. +.|...+-+.+++-. +.+.... ..+.+...+..|+. ..+.+ .+. T Consensus 69 ~iPILGIClG~Q~ia~~--~Gg~v~k~~~g~~G~----------n~pv~~~-~~~~~~~~~~~H~~---~v~~~---~lp 129 (178) T cd01744 69 KIPIFGICLGHQLLALA--LGAKTYKMKFGHRGS----------NHPVKDL-ITGRVYITSQNHGY---AVDPD---SLP 129 (178) T ss_pred CCCEEEECHHHHHHHHH--CCCCCCCCCCCCCCC----------CCCCEEC-CCCCEEEEECCCCE---EEEEC---CCC T ss_conf 99889981217889988--198300156672478----------7121676-88838886447554---87624---589 Q ss_pred HCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHC Q ss_conf 225036531688888988112668886999899984884343221006776289998642 Q gi|254780971|r 159 KNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLL 218 (219) Q Consensus 159 ~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~ 218 (219) ++-.+.. + +.| ..-|+||.+++.+++|.+.|||-+ ++..||..||+|++ T Consensus 130 ~~~~v~a-~----s~~--d~~i~ai~h~~~~i~GVQFHPEs~----~~~~~~~~if~nFl 178 (178) T cd01744 130 GGLEVTH-V----NLN--DGTVEGIRHKDLPVFSVQFHPEAS----PGPHDTEYLFDEFL 178 (178) T ss_pred CCCEEEE-E----CCC--CCCEEEEEECCCCEEEEEECCCCC----CCCCCCHHHHHHHC T ss_conf 9857999-6----489--985799998999999998599799----99977489897239 No 54 >cd01750 GATase1_CobQ Type 1 glutamine amidotransferase (GATase1) domain found in Cobyric Acid Synthase (CobQ). Type 1 glutamine amidotransferase (GATase1) domain found in Cobyric Acid Synthase (CobQ). CobQ plays a role in cobalamin biosythesis. CobQ catalyses amidations at positions B, D, E, and G on adenosylcobyrinic A,C-diamide in the biosynthesis of cobalamin. CobQ belongs to the triad family of amidotransferases. Two of the three residues of the catalytic triad that are involved in glutamine binding, hydrolysis and transfer of the resulting ammonia to the acceptor substrate in other triad aminodotransferases are conserved in CobQ. Probab=99.21 E-value=3.1e-10 Score=86.25 Aligned_cols=86 Identities=29% Similarity=0.429 Sum_probs=61.5 Q ss_pred EEEEECCC-CCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCE Q ss_conf 89984497-06479999999985499639982154644564789972873454311406775200212122332205971 Q gi|254780971|r 3 TAIVQIPG-LNRDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIK 81 (219) Q Consensus 3 vaVl~~pG-sNcd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~ 81 (219) +||++.|- ||=+ -..++...-+....++.. ...+.++|+|||||+=+.-..| .+...+.+.+++++++++|+| T Consensus 1 iavi~lp~isNft--Df~~l~~~~~v~v~~v~~-~~~l~~~D~iIlPGsK~t~~DL---~~l~~~Gl~~~i~~~~~~g~~ 74 (194) T cd01750 1 IAVIRYPDISNFT--DLDPLAREPGVDVRYVEV-PEGLGDADLIILPGSKDTIQDL---AWLRKRGLAEAIKNYARAGGP 74 (194) T ss_pred CEEEECCCCCCHH--HHHHHHCCCCCEEEEECC-CCCCCCCCEEEECCCCCHHHHH---HHHHHCCHHHHHHHHHHCCCC T ss_conf 9788269746645--547760069938999679-7764669999989988389999---999874979999999975997 Q ss_pred EEEECCCHHHHEE Q ss_conf 7860640310100 Q gi|254780971|r 82 VMGICNGFQILVE 94 (219) Q Consensus 82 vLGICNGfQiL~e 94 (219) +||||.|+|||-+ T Consensus 75 vlgICGGyQmLG~ 87 (194) T cd01750 75 VLGICGGYQMLGK 87 (194) T ss_pred EEEECHHHHHHHC T ss_conf 8997365673000 No 55 >PRK05670 anthranilate synthase component II; Provisional Probab=99.21 E-value=2.6e-10 Score=86.82 Aligned_cols=180 Identities=17% Similarity=0.295 Sum_probs=107.1 Q ss_pred EEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCC-----CCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 89984497064799999999854996399821546-----4456478997287345431140677520021212233220 Q gi|254780971|r 3 TAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDT-----DIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQ 77 (219) Q Consensus 3 vaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~-----~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~ 77 (219) |.||.+.+| =.+-.++.+.+ .+..+.++-.++. ...+.|+|||-||= ||.-.++ ....+.+... T Consensus 2 iLiiD~~ds-fT~nI~~~lr~-~g~~v~V~~~d~~~~~~i~~~~pdgiiLS~GP--g~P~~~~-------~~~~~i~~~~ 70 (192) T PRK05670 2 ILLIDNYDS-FTYNLVQYLGE-LGAEVVVYRNDEITLEEIEALAPDAIVLSPGP--GTPAEAG-------ISLELIREFA 70 (192) T ss_pred EEEEECCCC-HHHHHHHHHHH-CCCEEEEEECCCCCHHHHHHCCCCEEEECCCC--CCHHHCC-------CHHHHHHHHC T ss_conf 999968986-89999999986-89969999899899999985098989999999--9936605-------5499999734 Q ss_pred CCCEEEEECCCHHHHEECCCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEECHHHHHHH Q ss_conf 59717860640310100000101101244332454225767525235776413799689986201330233086774115 Q gi|254780971|r 78 QGIKVMGICNGFQILVELNLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFIDAKGLAEI 157 (219) Q Consensus 78 ~g~~vLGICNGfQiL~elGLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~~~~l~~l 157 (219) .+.|+||||-|+|+|.+. +.|.+.+.....+ .++..+. .+++..++++. ..+.+|..|+. ..+.. .+ T Consensus 71 ~~iPiLGIClG~Qlia~~--~Gg~v~~~~~~~h-G~~~~i~--~~~~~lf~~~~--~~~~v~~~Hs~---~v~~~---~l 137 (192) T PRK05670 71 GKVPILGVCLGHQAIGEA--FGGKVVRAKEIMH-GKTSPIE--HDGKGIFAGLP--NPFTVTRYHSL---VVERE---SL 137 (192) T ss_pred CCCCEEEEEHHHHHHHHH--CCCEEEECCCCCC-CEEEEEE--ECCCCCCCCCC--CCCEEEECCEE---EEEEC---CC T ss_conf 699789984778999987--1973888886363-7048999--89998715888--87468611207---99933---47 Q ss_pred HHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHCC Q ss_conf 42250365316888889881126688869998999848843432210067762899986429 Q gi|254780971|r 158 EKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLLT 219 (219) Q Consensus 158 ~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~s 219 (219) . ++..+.-+.+ +| -|+|+..++.+++|.+.||| +. .+.+|..||+|+++ T Consensus 138 p-~~~~v~a~s~----d~---~i~a~~~~~~~~~gvQFHPE-s~----~t~~G~~il~nFl~ 186 (192) T PRK05670 138 P-DCLEVTAWTD----DG---EIMGVRHKELPIYGVQFHPE-SI----LTEHGHKLLENFLE 186 (192) T ss_pred C-CCEEEEEEEC----CC---CEEEEEECCCCEEEEEECCC-CC----CCCCHHHHHHHHHH T ss_conf 9-9679999979----97---19999989999899967998-89----98176999999999 No 56 >COG0518 GuaA GMP synthase - Glutamine amidotransferase domain [Nucleotide transport and metabolism] Probab=99.20 E-value=1.8e-10 Score=87.84 Aligned_cols=183 Identities=20% Similarity=0.225 Sum_probs=115.4 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHCCCC-EEEEECCC---CCCCCCCEEEECCCCC--CCCCCCHHHHHHHHHHHHHHHH Q ss_conf 9689984497064799999999854996-39982154---6445647899728734--5431140677520021212233 Q gi|254780971|r 1 MKTAIVQIPGLNRDNDMIKAITKIIGQS-PILVWQSD---TDIPDVDLIVIPGGFS--YGDYLRCGAIAARTPVMQAIKK 74 (219) Q Consensus 1 mkvaVl~~pGsNcd~e~~~A~~~~~~~~-~~~v~~~~---~~l~~~d~lvipGGFS--ygD~l~aG~i~~~~~~~~~i~~ 74 (219) .++.|+-|++--... .++.+...+... .+..+..+ ....+.|++|+.||=+ |.|.- |+. ...+.|.+ T Consensus 2 ~~ilIld~g~q~~~l-i~r~~re~g~v~~e~~~~~~~~~~~~~~~~~giIlsGgp~sv~~~~~-----w~~-~~~~~i~~ 74 (198) T COG0518 2 RKILILDFGGQYLGL-IARRLRELGYVYSEIVPYTGDAEELPLDSPDGIIISGGPMSVYDEDP-----WLP-REKDLIKD 74 (198) T ss_pred CEEEEEECCCCHHHH-HHHHHHHCCCCEEEEEECCCCCCCCCCCCCCEEEECCCCCCCCCCCC-----CCH-HHHHHHHH T ss_conf 579999579707679-99999873982699985788801156668888998589887764433-----305-57899997 Q ss_pred HHHCCCEEEEECCCHHHHEECCCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEECHHHH Q ss_conf 22059717860640310100000101101244332454225767525235776413799689986201330233086774 Q gi|254780971|r 75 KAQQGIKVMGICNGFQILVELNLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFIDAKGL 154 (219) Q Consensus 75 ~~~~g~~vLGICNGfQiL~elGLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~~~~l 154 (219) ....++|+||||-|.|+|++. +.|+..++.. .|--|..+++...++++++++..... .+|..|++- . T Consensus 75 ~~~p~~pvLGIC~G~Ql~A~~--lGg~V~~~~~--~E~G~~~v~~~~~~~~l~~gl~~~~~-~v~~sH~D~--------v 141 (198) T COG0518 75 AGVPGKPVLGICLGHQLLAKA--LGGKVERGPK--REIGWTPVELTEGDDPLFAGLPDLFT-TVFMSHGDT--------V 141 (198) T ss_pred HCCCCCCEEEECHHHHHHHHH--HCCEEECCCC--CCCCEEEEEEECCCCCCCCCCCCCCC-CCCCCHHCC--------C T ss_conf 188888789987618999998--4988964688--76515999972586500048854337-106613074--------4 Q ss_pred HHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHC Q ss_conf 1154225036531688888988112668886999899984884343221006776289998642 Q gi|254780971|r 155 AEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLL 218 (219) Q Consensus 155 ~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~ 218 (219) .+|-+.-.+..+ .+.-| ++++-.. .++.|++.|||=.. ..|..+++|+. T Consensus 142 ~~lP~g~~vlA~--s~~cp------~qa~~~~-~~~~gvQFHpEv~~------~~~~~~l~nf~ 190 (198) T COG0518 142 VELPEGAVVLAS--SETCP------NQAFRYG-KRAYGVQFHPEVTH------EYGEALLENFA 190 (198) T ss_pred CCCCCCCEEEEC--CCCCH------HHHEECC-CCEEEEEEEEEEEH------HHHHHHHHHHH T ss_conf 348999899865--89980------4205667-85799942018738------99999999865 No 57 >cd01745 GATase1_2 Subgroup of proteins having the Type 1 glutamine amidotransferase (GATase1) domain. This group contains a subgroup of proteins having the Type 1 glutamine amidotransferase (GATase1) domain. GATase activity catalyses the transfer of ammonia from the amide side chain of glutamine to an acceptor substrate. Glutamine amidotransferases (GATase) includes the triad family of amidotransferases which have a conserved Cys-His-Glu catalytic triad in the glutaminase active site. In this subgroup this triad is conserved. GATase activity can be found in a range of biosynthetic enzymes, including: glutamine amidotransferase, formylglycinamide ribonucleotide, GMP synthetase , anthranilate synthase component II, glutamine-dependent carbamoyl phosphate synthase, cytidine triphosphate synthetase, gamma-glutamyl hydrolase, imidazole glycerol phosphate synthase and, cobyric acid synthase. Glutamine amidotransferase (GATase) domains can occur either as single polypeptides, as in glutamine Probab=99.19 E-value=1.3e-10 Score=88.76 Aligned_cols=151 Identities=26% Similarity=0.462 Sum_probs=89.3 Q ss_pred HHHHHHHHHCCCCEEEEECCC------CCCCCCCEEEECCCCC-----CCCCCC--HHHHH-HHHHH-HHHHHHHHHCCC Q ss_conf 999999985499639982154------6445647899728734-----543114--06775-20021-212233220597 Q gi|254780971|r 16 DMIKAITKIIGQSPILVWQSD------TDIPDVDLIVIPGGFS-----YGDYLR--CGAIA-ARTPV-MQAIKKKAQQGI 80 (219) Q Consensus 16 e~~~A~~~~~~~~~~~v~~~~------~~l~~~d~lvipGGFS-----ygD~l~--aG~i~-~~~~~-~~~i~~~~~~g~ 80 (219) .-..++++ .|..++.+-... ..+..+|+++++||-. ||.... .+.+- .+... ..-++.+.++++ T Consensus 23 ~y~~~i~~-~G~~Pi~iP~~~~~~~~~~~l~~lDGliLtGG~Di~p~~yg~~~~~~~~~~~~~RD~~E~~l~~~a~~~~~ 101 (189) T cd01745 23 YYVDAVRK-AGGLPVLLPPVDDEEDLEQYLELLDGLLLTGGGDVDPPLYGEEPHPELGPIDPERDAFELALLRAALERGK 101 (189) T ss_pred HHHHHHHH-CCCEEEEECCCCCHHHHHHHHHHCCEEEECCCCCCCHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCC T ss_conf 99999998-79999996899987999999987799997288778810038877765678886689999999999998098 Q ss_pred EEEEECCCHHHHEECCCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEECHHHHHHHHHC Q ss_conf 17860640310100000101101244332454225767525235776413799689986201330233086774115422 Q gi|254780971|r 81 KVMGICNGFQILVELNLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFIDAKGLAEIEKN 160 (219) Q Consensus 81 ~vLGICNGfQiL~elGLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~~~~l~~l~~~ 160 (219) |+||||-|+|+|... ..|+|.+. +.+.+.|..| .+++- . T Consensus 102 PilGICRG~Q~inv~--~GGtL~q~------------------------------i~VNS~H~~~--------i~~l~-~ 140 (189) T cd01745 102 PILGICRGMQLLNVA--LGGTLYQD------------------------------IRVNSLHHQA--------IKRLA-D 140 (189) T ss_pred CCEEEHHHHHHHHHH--HCCEEEEC------------------------------CCCCCCCHHH--------HHCCC-C T ss_conf 760350549999999--59857503------------------------------4057641006--------76078-8 Q ss_pred CEEEEECCCCCCCCCCCCCEEEEECCCCC-EEEECCCCHHHCCCCCCCCCHHHHHHHHC Q ss_conf 50365316888889881126688869998-99984884343221006776289998642 Q gi|254780971|r 161 NQIVFRYASGTNPNGSLHDIAGVINRRGN-VLGMMPHPENIIEKFHGGIDGRGLFASLL 218 (219) Q Consensus 161 ~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~-vlgmMPHPER~~~~~~~~~dG~~~f~~~~ 218 (219) +.++.-+.. .| .|.|+..++.. ++|.|-||||.... +...++||+.++ T Consensus 141 ~l~~~a~~~----dg---~iEa~~~~~~~~~~gvqwHPE~~~~~---~~~~~~lF~~fv 189 (189) T cd01745 141 GLRVEARAP----DG---VIEAIESPDRPFVLGVQWHPEWLADT---DPDSLKLFEAFV 189 (189) T ss_pred CCEEEEECC----CC---CEEEEEECCCCCEEEEECCCCCCCCC---CHHHHHHHHHHC T ss_conf 868999969----99---87999968999089997488678989---857889999869 No 58 >PRK01077 cobyrinic acid a,c-diamide synthase; Validated Probab=99.16 E-value=7e-10 Score=84.02 Aligned_cols=172 Identities=21% Similarity=0.285 Sum_probs=100.3 Q ss_pred EEEEEECCCCC-CHHHHHHHHHHHCCCCEEEE-ECCCCCCC-CCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHC Q ss_conf 68998449706-47999999998549963998-21546445-64789972873454311406775200212122332205 Q gi|254780971|r 2 KTAIVQIPGLN-RDNDMIKAITKIIGQSPILV-WQSDTDIP-DVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQ 78 (219) Q Consensus 2 kvaVl~~pGsN-cd~e~~~A~~~~~~~~~~~v-~~~~~~l~-~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~ 78 (219) ||||.+-+--| .-.|...++++ .|.+.+.. -.+|..++ +.|+|+|||||. .+-+..++.+..+.++|++++++ T Consensus 247 rIava~D~AF~FyY~enl~~L~~-~Gael~~fSPl~d~~lP~~~D~l~lpGGyP---El~a~~L~~n~~l~~~I~~~~~~ 322 (451) T PRK01077 247 RIAVARDAAFSFYYPDNLELLRA-AGAELVFFSPLADEALPDDCDALYLGGGYP---ELFAAELAANTSMRASIRAAAAA 322 (451) T ss_pred EEEEECCCCCCCCCHHHHHHHHH-CCCEEEEECCCCCCCCCCCCCEEEECCCCH---HHHHHHHHHHHHHHHHHHHHHHC T ss_conf 69994342247557667999997-898899957767888986678899579850---24399998708999999999986 Q ss_pred CCEEEEECCCHHHHEE------------CCCHHHHHCCCC-CCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCE Q ss_conf 9717860640310100------------000101101244-332454225767525235776413799689986201330 Q gi|254780971|r 79 GIKVMGICNGFQILVE------------LNLLPGILMRNC-SLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDG 145 (219) Q Consensus 79 g~~vLGICNGfQiL~e------------lGLlPg~l~~N~-s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEG 145 (219) |+|++|+|.|+|+|.+ +||||+.-...+ -.+|- .+.++.. ..++|.+ .|+.+.-.=-| .+ T Consensus 323 G~pi~aeCGG~~~Lg~~i~d~~G~~~~m~Gllp~~t~~~~~~~~~G--Y~~~~~~-~~~~~~~---~g~~i~GHEfH-ys 395 (451) T PRK01077 323 GKPIYAECGGLMYLGESLEDADGERHPMVGLLPGEASMTKRLAALG--YREAELL-ADTLLGA---AGERLRGHEFH-YS 395 (451) T ss_pred CCCEEEECHHHHHHHHHCCCCCCCEEECCCCCCEEEEECCCCCCCC--CEEEEEC-CCCCCCC---CCCEEEEEEEC-CE T ss_conf 9978998711788754327789988524354013779867855225--4699954-8975667---99789898542-64 Q ss_pred EEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCH Q ss_conf 23308677411542250365316888889881126688869998999848843 Q gi|254780971|r 146 NYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPE 198 (219) Q Consensus 146 rf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPE 198 (219) +.. ..+...+++..+..+..+.. +|.. +||++|.-+|-. T Consensus 396 ~~~---------~~~~~~~~~~~~~~g~~~~~---~G~~--~~nv~asy~Hlh 434 (451) T PRK01077 396 TLE---------ETPLEPLYRVRDADGGPVGE---GGYR--RGNVLASYLHLH 434 (451) T ss_pred ECC---------CCCCCEEEEEECCCCCCCCC---CEEE--ECCEEEEEEEEC T ss_conf 641---------79987008998689998885---8799--799999963752 No 59 >PRK06278 cobyrinic acid a,c-diamide synthase; Validated Probab=99.15 E-value=4.9e-11 Score=91.39 Aligned_cols=167 Identities=25% Similarity=0.393 Sum_probs=97.2 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECC-CCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCC Q ss_conf 96899844970647999999998549963998215-46445647899728734543114067752002121223322059 Q gi|254780971|r 1 MKTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQS-DTDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQG 79 (219) Q Consensus 1 mkvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~-~~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g 79 (219) |++++|.-.|| .-.|+..+.-.+.++..+ -..+.+.|++++|||- -+-+|.+ ...+-+++.+| + T Consensus 2 meIGildIkgs------lP~FE~FG~LPTkii~enn~k~ikdLD~lIIPGGS----lvEs~sl--~~elk~eI~~f---~ 66 (482) T PRK06278 2 MEIGLLDIKGT------LPCFENFGNLPTKIIKENNIKAIKDLDMLIIPGGS----IVESGSL--TDELKKEILDF---D 66 (482) T ss_pred CEEEEEEECCC------CCHHHHCCCCCHHHHCCCCHHHHHCCCEEEECCCC----EEECCCC--CHHHHHHHHHC---C T ss_conf 35789996266------41132216786366443672235307579965973----5733775--77899999862---7 Q ss_pred CEEEEECCCHHHHEE----------------CCCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECC Q ss_conf 717860640310100----------------0001011012443324542257675252357764137996899862013 Q gi|254780971|r 80 IKVMGICNGFQILVE----------------LNLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHH 143 (219) Q Consensus 80 ~~vLGICNGfQiL~e----------------lGLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHg 143 (219) +++||||.|||+|.+ ||||.- |.|----..+|..++.++ .|+ +.+.+..-+.+ -|- T Consensus 67 ~~IiGICsGFQiL~~kiDIGRKS~vPIikeGLGLLdV----~fsPlictDrV~f~i~~~--~~~-~~~~~~~~GFH-CHT 138 (482) T PRK06278 67 NYILGICSGFQILAEKIDIGRKSPVPIIKEGLGLLNV----EFSPLICTDRVKFNIEND--SFF-GEKNESGTGFH-CHT 138 (482) T ss_pred CEEEEECHHHHHHHHHCCCCCCCCCCEEECCCCCEEE----EEEEEEECCEEEEEEECC--CCC-CCCCCCEEEEE-EEC T ss_conf 8589961167887764376667999741004552014----661136515156897514--003-55577331255-310 Q ss_pred CEEEE-ECHHHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCC Q ss_conf 30233-0867741154225036531688888988112668886999899984884 Q gi|254780971|r 144 DGNYF-IDAKGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHP 197 (219) Q Consensus 144 EGrf~-~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHP 197 (219) .|+.- ++.+ -+.|....-.-+.|..- |- ..=+-|.| +|+|+|.|-|- T Consensus 139 YG~Iei~~~~-tk~lt~S~i~klNYk~~--~~--~~IlSG~~--kgkV~GTmVHn 186 (482) T PRK06278 139 YGNIEITNNS-TKKLTYSKIQKLNYKMV--GK--QNILSGVF--KGNVYGTMVHN 186 (482) T ss_pred CCCEEECCCC-CCEEEHHHHHHHCCCCC--CC--CEEEEEEC--CCCCHHHHHHH T ss_conf 1437842788-75224655434023436--76--40552001--68461454555 No 60 >CHL00101 trpG anthranilate synthase component 2 Probab=99.12 E-value=1.5e-09 Score=81.95 Aligned_cols=180 Identities=14% Similarity=0.244 Sum_probs=101.6 Q ss_pred EEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCC---CCC--CCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 89984497064799999999854996399821546---445--6478997287345431140677520021212233220 Q gi|254780971|r 3 TAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDT---DIP--DVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQ 77 (219) Q Consensus 3 vaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~---~l~--~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~ 77 (219) |.||.+-.+= -+-.++.+.+ .+..+.++-.++. ++. +.|+|||-||= ||... ......+.+... T Consensus 2 ILiiD~~dsf-t~~i~r~lre-lg~~~~v~~~d~~~~~~i~~~~p~gIILS~GP--g~p~~-------~~~~~~~~~~~~ 70 (190) T CHL00101 2 ILIIDNYDSF-TYNLVQSLGE-LNSDLLVCRNDEIDLSKIKNLNPRHIIISPGP--GHPRD-------SGISLDVISSYA 70 (190) T ss_pred EEEEECCCHH-HHHHHHHHHH-CCCEEEEECCCCCCHHHHHHCCCCEEEECCCC--CCHHH-------CCCCHHHHHHHH T ss_conf 9999788808-9999999986-89989998699899999970797989997999--99578-------576168999971 Q ss_pred CCCEEEEECCCHHHHEECCCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEECHHHHHHH Q ss_conf 59717860640310100000101101244332454225767525235776413799689986201330233086774115 Q gi|254780971|r 78 QGIKVMGICNGFQILVELNLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFIDAKGLAEI 157 (219) Q Consensus 78 ~g~~vLGICNGfQiL~elGLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~~~~l~~l 157 (219) .+.|+||||-|+|+|.+. +.|.+.+.+...|- ...+. ..+.+..++++. ..+..|..|+. ..+.+ .+ T Consensus 71 ~~iPILGIClG~Q~ia~~--~Gg~v~~~~~~~~G--~~~~i-~~~~~~lf~g~~--~~~~~~~~hs~---~v~~~---~l 137 (190) T CHL00101 71 PTIPILGVCLGHQSIGYV--YGGKIIKASKPMHG--KTSLI-YHNHDDLFQGLP--NPFIATRYHSL---IIDRE---NL 137 (190) T ss_pred CCCCEEEECHHHHHHHHH--CCCEEEECCCEEEC--CEEEE-EECCCCCCCCCC--CCCEEEECCEE---EEEEC---CC T ss_conf 499878973988999887--58649978820436--24799-978987543799--88489870779---99804---57 Q ss_pred HHCCEEEEECCCCCCCCCCCCCEEEEECCCCC-EEEECCCCHHHCCCCCCCCCHHHHHHHHCC Q ss_conf 42250365316888889881126688869998-999848843432210067762899986429 Q gi|254780971|r 158 EKNNQIVFRYASGTNPNGSLHDIAGVINRRGN-VLGMMPHPENIIEKFHGGIDGRGLFASLLT 219 (219) Q Consensus 158 ~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~-vlgmMPHPER~~~~~~~~~dG~~~f~~~~s 219 (219) .+ +..+.-.. .+| .|+|+-.++.+ ++|.+.|||=.. +.+|+.|++|+++ T Consensus 138 p~-~~~v~a~s----~~~---~i~ai~h~~~~~i~GVQFHPEs~~-----T~~G~~il~NFl~ 187 (190) T CHL00101 138 PS-CLEITAWT----EDG---LIMACRHKKYPMLRGIQFHPESIL-----TEHGQQLLRNFLS 187 (190) T ss_pred CC-CEEEEEEC----CCC---CEEEEEECCCCCEEEEEECCCCCC-----CCCHHHHHHHHHH T ss_conf 89-87999997----999---689999789998799997486699-----9486999999998 No 61 >PRK11366 puuD gamma-glutamyl-gamma-aminobutyrate hydrolase; Provisional Probab=99.11 E-value=9.6e-10 Score=83.16 Aligned_cols=181 Identities=22% Similarity=0.249 Sum_probs=97.7 Q ss_pred HHHHHHHCCCCEEEEEC--CCC-----CCCCCCEEEECCCC------CCCCCCCHHHH-HHHHH-HHHHHHHHHHCCCEE Q ss_conf 99999854996399821--546-----44564789972873------45431140677-52002-121223322059717 Q gi|254780971|r 18 IKAITKIIGQSPILVWQ--SDT-----DIPDVDLIVIPGGF------SYGDYLRCGAI-AARTP-VMQAIKKKAQQGIKV 82 (219) Q Consensus 18 ~~A~~~~~~~~~~~v~~--~~~-----~l~~~d~lvipGGF------SygD~l~aG~i-~~~~~-~~~~i~~~~~~g~~v 82 (219) ..|..+++|. ++++-. .+. -++..|+|+++||- -||........ -.+.. -+.-++.+.++++|+ T Consensus 32 ~~ai~~aGgi-PvllP~~~~~~~~~~~~l~~lDGllltGG~~Di~P~~Yge~~~~~~~dp~RD~~E~~L~~~A~~~~~Pi 110 (254) T PRK11366 32 LNAIIHAGGL-PIALPHALAEPSLLEQLLPKLDGIYLPGSPSNVQPHLYGENGDEPDADPGRDLLSMALINAALERRIPI 110 (254) T ss_pred HHHHHHCCCE-EEECCCCCCCHHHHHHHHHHCCEEEECCCCCCCCHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCE T ss_conf 9999982985-898488878878999999878989968998988942368898888888421299999999999859997 Q ss_pred EEECCCHHHHEE-CC-C-------HHHHHCCCCCC--CEEEEE---EEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEE Q ss_conf 860640310100-00-0-------10110124433--245422---5767525235776413799689986201330233 Q gi|254780971|r 83 MGICNGFQILVE-LN-L-------LPGILMRNCSL--KFVCKQ---VLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYF 148 (219) Q Consensus 83 LGICNGfQiL~e-lG-L-------lPg~l~~N~s~--rf~~r~---~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~ 148 (219) ||||-|+|+|-- +| - .|+.+.+.... ...-++ -.+.+. ..|.+-+-+.....+.+.+.|.++ T Consensus 111 LGICRG~Q~lNVa~GGtL~qdl~~~~~~~~H~~~~~~~~~~~~~~~H~V~i~-~~s~L~~i~~~~~~~~VNS~HhQa--- 186 (254) T PRK11366 111 FAICRGLQELVVATGGSLHRKLCEQPELLEHREDPELPVEQQYAPSHEVQVE-EGGLLSALLPECSNFWVNSLHGQG--- 186 (254) T ss_pred EEECHHHHHHHHHHCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEEC-CCCCHHHHHCCCCCEEEECHHHHH--- T ss_conf 9860355899987388644220037652344568889854332564118986-898699974898705874213543--- Q ss_pred ECHHHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCC-EEEECCCCHHHCCCCCCCCCHHHHHHHHCC Q ss_conf 08677411542250365316888889881126688869998-999848843432210067762899986429 Q gi|254780971|r 149 IDAKGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGN-VLGMMPHPENIIEKFHGGIDGRGLFASLLT 219 (219) Q Consensus 149 ~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~-vlgmMPHPER~~~~~~~~~dG~~~f~~~~s 219 (219) ++++-+.=+|..+- |.|- |.+|..++.. ++|.+-|||+..+ .....++||++++. T Consensus 187 -----v~~lg~gl~v~A~a-----~DG~---IEAie~~~~~f~lGVQWHPE~~~~---~~~~~~~LF~aFv~ 242 (254) T PRK11366 187 -----AKVVSPRLRVEARS-----PDGL---VEAVSVINHPFALGVQWHPEWNSS---EYALSRILFEGFIT 242 (254) T ss_pred -----HHCCCCCCEEEEEE-----CCCC---EEEEEECCCCEEEEEECCCCCCCC---CCHHHHHHHHHHHH T ss_conf -----31337773899990-----8997---799995799858999708877876---78678999999999 No 62 >PRK08007 para-aminobenzoate synthase component II; Provisional Probab=99.08 E-value=2.5e-09 Score=80.53 Aligned_cols=169 Identities=16% Similarity=0.306 Sum_probs=98.5 Q ss_pred HHHHHHHHHHCCCCEEEEECCCCC---CC--CCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCCH Q ss_conf 999999998549963998215464---45--6478997287345431140677520021212233220597178606403 Q gi|254780971|r 15 NDMIKAITKIIGQSPILVWQSDTD---IP--DVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIKVMGICNGF 89 (219) Q Consensus 15 ~e~~~A~~~~~~~~~~~v~~~~~~---l~--~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~vLGICNGf 89 (219) +-.++.+.+ .+.++.++..++.. +. +.|+|||-+|= ||...++ ....+.+......|+||||-|+ T Consensus 13 ~ni~~~l~~-~g~~v~V~~~d~~~~~~i~~~~p~~IvlSpGP--g~P~~~~-------~~~~~i~~~~~~iPiLGIClG~ 82 (187) T PRK08007 13 WNLYQYFCE-LGADVLVKRNDALTLADIDALKPQKIVISPGP--CTPDEAG-------ISLDVIRHYAGRLPILGVCLGH 82 (187) T ss_pred HHHHHHHHH-CCCEEEEEECCCCCHHHHHHCCCCEEEECCCC--CCHHHCC-------CCHHHHHHHCCCCCEEEECHHH T ss_conf 999999987-79979999389899999984298979999999--9958908-------4046899753899889987999 Q ss_pred HHHEECCCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCC Q ss_conf 10100000101101244332454225767525235776413799689986201330233086774115422503653168 Q gi|254780971|r 90 QILVELNLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYAS 169 (219) Q Consensus 90 QiL~elGLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d 169 (219) |+|++. +.|.+.+-....+ -+ ... +..+.+..++++. ..+.+|-.|+. ..+.. .+.+.-.+.. .. T Consensus 83 Q~ia~~--~Gg~v~~~~~~~h-G~-~~~-i~~~~~~lf~~~~--~~~~v~~~hs~---~v~~~---~lp~~~~v~a-~~- 147 (187) T PRK08007 83 QAMAQA--FGGKVVRAAKVMH-GK-TSP-ITHNGEGVFRGLA--NPLTVTRYHSL---VVEPD---SLPECFEVTA-WS- 147 (187) T ss_pred HHHHHH--CCCEEEECCCCCC-CC-CEE-EEECCCCCCCCCC--CCEEEEEEEEE---EEECC---CCCCCEEEEE-EC- T ss_conf 999998--0980734783201-52-069-9989986267899--87378975578---88647---6898159999-92- Q ss_pred CCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHCC Q ss_conf 88889881126688869998999848843432210067762899986429 Q gi|254780971|r 170 GTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLLT 219 (219) Q Consensus 170 ~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~s 219 (219) .++ .|+|+-+++.++.|.+.|||-.. +++|..|++|++. T Consensus 148 ---~~~---~v~ai~~~~~~~~GVQFHPEs~~-----T~~G~~il~NFl~ 186 (187) T PRK08007 148 ---ETR---EIMGIRHRQWDLEGVQFHPESIL-----SEQGHQLLANFLH 186 (187) T ss_pred ---CCC---CEEEEEECCCCEEEEEECCCCCC-----CCCHHHHHHHHHC T ss_conf ---998---68999989999899995897799-----9677899999857 No 63 >PRK06774 para-aminobenzoate synthase component II; Provisional Probab=99.07 E-value=1.9e-09 Score=81.26 Aligned_cols=172 Identities=17% Similarity=0.287 Sum_probs=99.8 Q ss_pred HHHHHHHHHHCCCCEEEEECCCCC---CC--CCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCCH Q ss_conf 999999998549963998215464---45--6478997287345431140677520021212233220597178606403 Q gi|254780971|r 15 NDMIKAITKIIGQSPILVWQSDTD---IP--DVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIKVMGICNGF 89 (219) Q Consensus 15 ~e~~~A~~~~~~~~~~~v~~~~~~---l~--~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~vLGICNGf 89 (219) +-.++.+.+ .|.++.++..++.. +. +.|+|||-+|= |+.-.+| ....+.+......|+||||-|+ T Consensus 13 ~nl~~~l~~-~g~~v~V~~~d~~~~~~i~~~~p~~ivLSpGP--G~P~~~~-------~~~~~i~~~~~~iPILGIClG~ 82 (191) T PRK06774 13 YNLYQYFCE-LGTEVMVKRNDELQLTDIEQLAPSHVVISPGP--CTPNEAG-------ISLAVIRHFADKLPILGVCLGH 82 (191) T ss_pred HHHHHHHHH-CCCEEEEECCCCCCHHHHHHCCCCEEEECCCC--CCHHHHC-------CHHHHHHHHCCCCCEEEEHHHH T ss_conf 999999987-79959998699889999985399969999999--8967907-------3068999744699788616879 Q ss_pred HHHEECCCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCC Q ss_conf 10100000101101244332454225767525235776413799689986201330233086774115422503653168 Q gi|254780971|r 90 QILVELNLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYAS 169 (219) Q Consensus 90 QiL~elGLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d 169 (219) |+|.+. +.|.+.+-+...+ -+-..+ ..+.+..++++. +.+.+|..|+.. .+. +.+.+.=.+.. ..+ T Consensus 83 Q~ia~~--~Gg~v~~~~~~~h-G~~~~i--~~~~~~lf~g~~--~~~~v~~~Hs~~---v~~---~~lp~~~~v~a-~s~ 148 (191) T PRK06774 83 QALGQA--FGARVVRARQVMH-GKTSAI--CHSGQGVFRGLN--QPLTVTRYHSLV---IAT---DSLPGCFELTA-WTE 148 (191) T ss_pred HHHHHH--CCCEEEECCCCEE-CEEEEE--EECCCCCCCCCC--CCCEEEECCCEE---EEE---CCCCCCEEEEE-EEC T ss_conf 998656--0984928997475-703689--989987606899--986885046507---883---44899879999-979 Q ss_pred CCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHC Q ss_conf 8888988112668886999899984884343221006776289998642 Q gi|254780971|r 170 GTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLL 218 (219) Q Consensus 170 ~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~ 218 (219) .+|...-|+|+-.++..++|.+.|||= +. +++|..|++|++ T Consensus 149 ---~~~~~~~i~ai~~~~~~i~GVQFHPEs-~~----T~~G~~il~NFl 189 (191) T PRK06774 149 ---RGGEMDEIMGIRHRTLPLEGVQFHPES-IL----SEQGHQLLDNFL 189 (191) T ss_pred ---CCCCCCEEEEEEECCCCEEEEEECCCC-CC----CCCHHHHHHHHH T ss_conf ---999988688888389998998768988-99----958889999984 No 64 >PRK08250 glutamine amidotransferase; Provisional Probab=99.06 E-value=2.4e-09 Score=80.60 Aligned_cols=174 Identities=14% Similarity=0.175 Sum_probs=94.8 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHCCCCEEEE--ECCC---CCCCCCCEEEECCC-CC-CCCCCCHHHHHHHHHHHHHHH Q ss_conf 96899844970647999999998549963998--2154---64456478997287-34-543114067752002121223 Q gi|254780971|r 1 MKTAIVQIPGLNRDNDMIKAITKIIGQSPILV--WQSD---TDIPDVDLIVIPGG-FS-YGDYLRCGAIAARTPVMQAIK 73 (219) Q Consensus 1 mkvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v--~~~~---~~l~~~d~lvipGG-FS-ygD~l~aG~i~~~~~~~~~i~ 73 (219) |||.||+-.=.-.---....+.. .|.+..++ +..+ ..+++||+||+-|| .| ..|+-.. -+.....-.+-++ T Consensus 1 MrilviqH~~~E~pG~i~~~l~~-~G~~~~~~~~~~g~~lP~~~~~~d~liVlGGpms~~~~~e~~-p~l~l~~E~~lir 78 (235) T PRK08250 1 MRVHFIIHESFESPGAYLKWAED-RGYTISWSRVYAGEALPENADGFDMLIVLGGPQSPRTTREEC-PYFDSKAEQHLIN 78 (235) T ss_pred CEEEEEECCCCCCCCHHHHHHHH-CCCEEEEEECCCCCCCCCCCCCCCEEEEECCCCCCCCCCCCC-CCCCHHHHHHHHH T ss_conf 97999981899897078999997-899699996689998999810258899978998777732127-8747099999999 Q ss_pred HHHHCCCEEEEECCCHHHHEECCCHHHHHCCCCCCCEEEEEEEEEECC--CHHHHHHHCCCCCEEEEEEECCCEEEEECH Q ss_conf 322059717860640310100000101101244332454225767525--235776413799689986201330233086 Q gi|254780971|r 74 KKAQQGIKVMGICNGFQILVELNLLPGILMRNCSLKFVCKQVLLEVVN--SNTAFTKSYKMNQIIKCPVAHHDGNYFIDA 151 (219) Q Consensus 74 ~~~~~g~~vLGICNGfQiL~elGLlPg~l~~N~s~rf~~r~~~~~v~~--~~s~~~~~~~~~~~l~~piaHgEGrf~~~~ 151 (219) +..+.++|+||||-|.|+|.+. +.|...+|... |--|..+++.. ...+++.++.. .+.+.--|+| .|..++ T Consensus 79 ~a~~~~~PvlGIClG~QliA~A--lGg~V~~~~~~--EiG~~~i~~t~~~~~d~~~~~~~~--~~~v~~wH~d-~~~LP~ 151 (235) T PRK08250 79 QAITARKAVVGVCLGSQLIGEA--LGAKYCQSPEK--EIGHFPITLTEAGLRDPLLKHFGS--TLTVGHWHND-MPGLTD 151 (235) T ss_pred HHHHCCCCEEEEEHHHHHHHHH--CCCEEECCCCC--EEEEEEEEEECCCCCCHHHHCCCC--CCEEEEEEEE-EEECCC T ss_conf 9998699889981677999998--09877548998--244897163014445846652898--7657898523-620699 Q ss_pred HHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHH Q ss_conf 7741154225036531688888988112668886999899984884343 Q gi|254780971|r 152 KGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENI 200 (219) Q Consensus 152 ~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~ 200 (219) . -.. +-.. +.-|| .+..- ..+++|++.|||=. T Consensus 152 g--------A~l-LAss-~~c~~------Qaf~~-g~~~~g~QfHpE~t 183 (235) T PRK08250 152 Q--------ATV-LATS-EGCPR------QIVQY-GNLVYGFQCHMEFT 183 (235) T ss_pred C--------CEE-EECC-CCCHH------HHEEE-CCCEEEEECCEECC T ss_conf 9--------679-9739-99848------95882-69589997103789 No 65 >cd01746 GATase1_CTP_Synthase Type 1 glutamine amidotransferase (GATase1) domain found in Cytidine Triphosphate Synthetase. Type 1 glutamine amidotransferase (GATase1) domain found in Cytidine Triphosphate Synthetase (CTP). CTP is involved in pyrimidine ribonucleotide/ribonucleoside metabolism. CTPs produce CTP from UTP and glutamine and regulate intracellular CTP levels through interactions with four ribonucleotide triphosphates. The enzyme exists as a dimer of identical chains that aggregates as a tetramer. CTP is derived form UTP in three separate steps involving two active sites. In one active site, the UTP O4 oxygen is activated by Mg-ATP-dependent phosphorylation, followed by displacement of the resulting 4-phosphate moiety by ammonia. At a separate site, ammonia is generated via rate limiting glutamine hydrolysis (glutaminase) activity. A gated channel that spans between the glutamine hydrolysis and amidoligase active sites provides a path for ammonia diffusion. CTPs belong to th Probab=99.05 E-value=7.7e-09 Score=77.38 Aligned_cols=174 Identities=25% Similarity=0.393 Sum_probs=98.5 Q ss_pred HHHHHHH---CCCCEEEEECCCC---------CCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEE Q ss_conf 9999985---4996399821546---------445647899728734543114067752002121223322059717860 Q gi|254780971|r 18 IKAITKI---IGQSPILVWQSDT---------DIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIKVMGI 85 (219) Q Consensus 18 ~~A~~~~---~~~~~~~v~~~~~---------~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~vLGI 85 (219) ..|+..+ .+....+.|.+.+ .|.++|++++||||.+ +-..|++.| ++-.-++..|.||| T Consensus 20 ~eAL~ha~~~~~~~v~i~wi~s~~l~~~~~~~~L~~~dgilvpgGFG~--rG~eGki~A-------i~yARen~iPfLGI 90 (235) T cd01746 20 LEALKHAGIALGVKLEIKWIDSEDLEEENAEEALKGADGILVPGGFGI--RGVEGKILA-------IKYARENNIPFLGI 90 (235) T ss_pred HHHHHHHHHHCCCEEEEEEEEHHHCCCCCHHHHHHCCCCEEECCCCCC--CCHHHHHHH-------HHHHHHCCCCEEEE T ss_conf 999999899749905899985565476688999730697895788887--764588999-------99999739972544 Q ss_pred CCCHHHHE-E-----CCCHHHH----HCC-----------------CCCC--CEEEEEEEEEECCCHHHHHHHCCCCCEE Q ss_conf 64031010-0-----0001011----012-----------------4433--2454225767525235776413799689 Q gi|254780971|r 86 CNGFQILV-E-----LNLLPGI----LMR-----------------NCSL--KFVCKQVLLEVVNSNTAFTKSYKMNQII 136 (219) Q Consensus 86 CNGfQiL~-e-----lGLlPg~----l~~-----------------N~s~--rf~~r~~~~~v~~~~s~~~~~~~~~~~l 136 (219) |-|||+++ | +|+= .+ +.. +.-+ |.-+..+.++ .+|...+-+...++ T Consensus 91 ClGmQ~aviE~ARnvlg~~-~A~s~E~~~~~~~~vi~~~~~~~~~~~~GgTmRLG~~~~~l~---~~s~~~~~y~~~~i- 165 (235) T cd01746 91 CLGMQLAVIEFARNVLGLP-DANSTEFDPDTPHPVVDLMPEQKGVKDLGGTMRLGAYPVILK---PGTLAHKYYGKDEV- 165 (235) T ss_pred ECCCCHHHHHHHHHHCCCC-CCCCCCCCCCCCCCEEEECHHHCCCHHCCCCEEEEEEEEEEC---CCHHHHHHCCCCEE- T ss_conf 1034322689998752997-767422479999877996756616400134134202577622---43066675188323- Q ss_pred EEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCC-CCEEEECCCCHHHCCCCCCCCCHHHHHH Q ss_conf 986201330233086774115422503653168888898811266888699-9899984884343221006776289998 Q gi|254780971|r 137 KCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRR-GNVLGMMPHPENIIEKFHGGIDGRGLFA 215 (219) Q Consensus 137 ~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~-G~vlgmMPHPER~~~~~~~~~dG~~~f~ 215 (219) .=.| --||-++++..+.|++++++..=+ +|+|..-.|.=+ ++ -=.+|.+.|||....+.. --++|. T Consensus 166 --~ERH-RHRYevN~~y~~~le~~Gl~~sG~----~~~~~~veiiEl--~~Hpffvg~QfHPEf~Srp~~----phPlF~ 232 (235) T cd01746 166 --EERH-RHRYEVNPEYVDELEEAGLRFSGT----DPDGGLVEIVEL--PDHPFFVGTQFHPEFKSRPLK----PHPLFV 232 (235) T ss_pred --EEEE-CCCEEECHHHHHHHHHCCCEEEEE----CCCCCEEEEEEC--CCCCCEEEECCCCCCCCCCCC----CCCCHH T ss_conf --1230-235567899999998689689999----899997999986--899847887577644689999----993836 Q ss_pred HHC Q ss_conf 642 Q gi|254780971|r 216 SLL 218 (219) Q Consensus 216 ~~~ 218 (219) ++| T Consensus 233 ~fi 235 (235) T cd01746 233 GFV 235 (235) T ss_pred HHC T ss_conf 439 No 66 >COG0504 PyrG CTP synthase (UTP-ammonia lyase) [Nucleotide transport and metabolism] Probab=99.05 E-value=5.1e-09 Score=78.52 Aligned_cols=176 Identities=28% Similarity=0.426 Sum_probs=100.7 Q ss_pred HHHHHHH---CCCCEEEEECCCCCCC--------C-CCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEE Q ss_conf 9999985---4996399821546445--------6-47899728734543114067752002121223322059717860 Q gi|254780971|r 18 IKAITKI---IGQSPILVWQSDTDIP--------D-VDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIKVMGI 85 (219) Q Consensus 18 ~~A~~~~---~~~~~~~v~~~~~~l~--------~-~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~vLGI 85 (219) ..|+... .+..+.+.|.+..++. . +|++++|||| |++-..|+++| ++-+-+++.|.||| T Consensus 308 ~EAL~hag~~~~~~v~i~wIdse~le~~~~~~~~~~~dgIlVPGGF--G~RG~eGkI~A-------i~yAREn~iP~lGI 378 (533) T COG0504 308 IEALKHAGIALGVKVNIKWIDSEDLEEENAAELEKLVDGILVPGGF--GYRGVEGKIAA-------IRYARENNIPFLGI 378 (533) T ss_pred HHHHHHHHHHCCCCCEEEEECCCCCCCCCHHHHHHCCCEEEECCCC--CCCCHHHHHHH-------HHHHHHCCCCEEEE T ss_conf 9999950764198402698751236654445566318879968987--76755889999-------99998669986888 Q ss_pred CCCHHHHE-E-----CCCHHHH--------------HCCC-C-------CCCEEEEEEEEEECCCHHHHHHHCCCCCEEE Q ss_conf 64031010-0-----0001011--------------0124-4-------3324542257675252357764137996899 Q gi|254780971|r 86 CNGFQILV-E-----LNLLPGI--------------LMRN-C-------SLKFVCKQVLLEVVNSNTAFTKSYKMNQIIK 137 (219) Q Consensus 86 CNGfQiL~-e-----lGLlPg~--------------l~~N-~-------s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~ 137 (219) |-|+|+.+ | +||-..- |++. + .-|.-...+.+ ...|..-+-+... . T Consensus 379 ClGmQ~aviE~ARnv~Gl~~AnS~Efdp~t~~pVv~l~~eq~~~~~lGGTmRLG~y~~~l---~~gT~a~~lY~~~---~ 452 (533) T COG0504 379 CLGMQLAVIEFARNVLGLEGANSTEFDPDTKYPVVDLMPEQKDVVDLGGTMRLGAYPCRL---KPGTLAAKLYGKD---E 452 (533) T ss_pred CHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCCCEEECCCEEEEC---CCCCHHHHHHCCC---C T ss_conf 566789999999985197657643548899986588335423677677500416610112---7886899873877---1 Q ss_pred EEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHH Q ss_conf 86201330233086774115422503653168888898811266888699989998488434322100677628999864 Q gi|254780971|r 138 CPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASL 217 (219) Q Consensus 138 ~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~ 217 (219) +.-.| --||-++.+..+++++.+.+.- ...|+|..-.|--+-+ .--.+|++.|||-...|.. .-++|.++ T Consensus 453 v~ERH-RHRYEvN~~y~~~le~~Gl~~s----g~s~d~~lvEivE~~~-hpfFv~~QfHPEf~SrP~~----phPlf~~f 522 (533) T COG0504 453 IYERH-RHRYEVNNDYRDQLEKAGLVFS----GTSPDGGLVEIVELPD-HPFFVATQFHPEFKSRPLR----PHPLFVGF 522 (533) T ss_pred EEEEC-CCHHHCCHHHHHHHHHCCEEEE----EECCCCCEEEEEECCC-CCEEEEECCCCCCCCCCCC----CCCCHHHH T ss_conf 40021-5313308899999986896999----9868998689998389-9648987466313478999----98347999 Q ss_pred C Q ss_conf 2 Q gi|254780971|r 218 L 218 (219) Q Consensus 218 ~ 218 (219) | T Consensus 523 v 523 (533) T COG0504 523 V 523 (533) T ss_pred H T ss_conf 9 No 67 >TIGR00313 cobQ cobyric acid synthase CobQ; InterPro: IPR004459 Cobyric acid synthase (CobQ) catalyses amidations at positions B, D, E, and G on adenosylcobyrinic A,C-diamide in the biosynthesis of cobalamin. NH(2) groups are provided by glutamine, and one molecule of ATP is hydrogenolysed for each amidation .; GO: 0003824 catalytic activity, 0009236 cobalamin biosynthetic process. Probab=99.04 E-value=4.4e-10 Score=85.31 Aligned_cols=175 Identities=19% Similarity=0.270 Sum_probs=101.1 Q ss_pred CEEEEEECCC-CC-CHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECCCCCCCCCCCHH----HHHHHHHHHHHHHH Q ss_conf 9689984497-06-479999999985499639982154644564789972873454311406----77520021212233 Q gi|254780971|r 1 MKTAIVQIPG-LN-RDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPGGFSYGDYLRCG----AIAARTPVMQAIKK 74 (219) Q Consensus 1 mkvaVl~~pG-sN-cd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipGGFSygD~l~aG----~i~~~~~~~~~i~~ 74 (219) +||.|+|+|- || .|-|= |+.-.+..+.++ .-+.+|.+.|.|||||- +.- .....+.+-+++.+ T Consensus 249 irv~v~RLpriSNFTD~eP---L~~~~~V~v~fl-~l~~~L~~~d~vIiPGs-------k~ti~dl~~L~~~~~~~~i~~ 317 (502) T TIGR00313 249 IRVGVVRLPRISNFTDFEP---LRYEADVAVKFL-ELEDSLDGLDAVIIPGS-------KDTIADLKALKESGFAEEILD 317 (502) T ss_pred EEEEEEECCCCCCCCCCCH---HHCCCCCEEEEE-CCCCCCCCCEEEEECCC-------CCHHHHHHHHHHCCCHHHHHH T ss_conf 4899985376356677522---111487289983-26887788448996799-------657899999974686679999 Q ss_pred HHHCCCEEEEECCCHHHHEE-----------------CCCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEE Q ss_conf 22059717860640310100-----------------0001011012443324542257675252357764137996899 Q gi|254780971|r 75 KAQQGIKVMGICNGFQILVE-----------------LNLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIK 137 (219) Q Consensus 75 ~~~~g~~vLGICNGfQiL~e-----------------lGLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~ 137 (219) +++++..|+|||.|||||.+ |||||-.=.-+...|-. +.+.-. .+-+.+.+.+++.-+|-+ T Consensus 318 ~~~~ig~V~GICGGYQ~LG~~l~D~~~~E~~~~~~~GLgLLD~~T~f~~~~K~~-~q~~~~-~~~~~vIl~~~~G~~V~G 395 (502) T TIGR00313 318 LAKEIGIVIGICGGYQMLGKELVDKEKVESDVGDVEGLGLLDAKTEFREEEKIT-KQSQGR-VEGNAVILDRIRGETVKG 395 (502) T ss_pred HHHCCCEEEEECCCHHHHHHHHCCCCCCCCCCCEECCCCCCCCEEEECCCCCEE-EEECCE-EECCCEEEECCCCCEEEE T ss_conf 960698899834760210031036322267710002221254145671771378-511015-625834661541230344 Q ss_pred EEEECCCEEEEEC-HHHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCC Q ss_conf 8620133023308-67741154225036531688888988112668886999899984884 Q gi|254780971|r 138 CPVAHHDGNYFID-AKGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHP 197 (219) Q Consensus 138 ~piaHgEGrf~~~-~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHP 197 (219) -=| | ||-=+.. .+..+.|.+.++=.= -| .+--.||+.... ||++.|.-.|= T Consensus 396 YEi-H-~G~t~~~~Sqd~~Pl~~~drGfG-~~-~~G~~Gsi~~~~-----dg~~~GtYlHG 447 (502) T TIGR00313 396 YEI-H-EGLTRRKASQDVKPLFEVDRGFG-NV-SEGFDGSIDKAV-----DGNVWGTYLHG 447 (502) T ss_pred EEE-E-CCCCCCCCCCCCCCHHHHCCCCC-CC-CCCCCCCCHHHC-----CCEEEECHHCC T ss_conf 772-0-45001355300255144313621-23-678763201022-----41031210033 No 68 >PRK07649 para-aminobenzoate/anthranilate synthase glutamine amidotransferase component II; Validated Probab=99.03 E-value=3.7e-09 Score=79.44 Aligned_cols=168 Identities=16% Similarity=0.285 Sum_probs=95.8 Q ss_pred HHHHHHHHHHCCCCEEEEECCCCC---CC--CCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCCH Q ss_conf 999999998549963998215464---45--6478997287345431140677520021212233220597178606403 Q gi|254780971|r 15 NDMIKAITKIIGQSPILVWQSDTD---IP--DVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIKVMGICNGF 89 (219) Q Consensus 15 ~e~~~A~~~~~~~~~~~v~~~~~~---l~--~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~vLGICNGf 89 (219) +-.++.+.. .|.++.++.+++.. +. +.|+|||-+|= |+.-.+ .....+.+......|+||||-|+ T Consensus 13 ~ni~~~l~~-~g~~~~v~~~d~~~~~~i~~~~pd~IiLSpGP--g~p~~~-------~~~~~~~~~~~~~iPILGIClG~ 82 (195) T PRK07649 13 YNLVQYLGE-LGQELVVKRNDEVTISDIENMKPDFLMISPGP--CSPNEA-------GISMEVIRYFAGKIPIFGVCLGH 82 (195) T ss_pred HHHHHHHHH-CCCEEEEECCCCCCHHHHHHCCCCEEEECCCC--CCHHHC-------CCCHHHHHHHCCCCCEEEECHHH T ss_conf 999999997-89979998899689999984198989988999--995784-------76146799752899784302999 Q ss_pred HHHEECCCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCC Q ss_conf 10100000101101244332454225767525235776413799689986201330233086774115422503653168 Q gi|254780971|r 90 QILVELNLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYAS 169 (219) Q Consensus 90 QiL~elGLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d 169 (219) |+|.+. ..|.+.+.+...+ .+-..+. . ..+..++++. ..+..|..|+. ..+.+ .+.+ +..+.-+. T Consensus 83 Q~ia~~--~Gg~v~~~~~~~h-G~~~~i~-~-~~~~lf~g~~--~~~~v~~yHs~---~v~~~---~lp~-~~~v~a~s- 147 (195) T PRK07649 83 QSIAQV--FGGEVVRAERLMH-GKTSLMH-H-DGKTIFSDIP--NPFTATRYHSL---IVKKE---TLPD-CFEVTAWT- 147 (195) T ss_pred HHHHHH--CCCCEEECCCEEE-CEEEEEE-E-CCCCCCCCCC--CCEEEEECCCE---EEEEC---CCCC-CEEEEEEC- T ss_conf 999987--3972833796076-4065799-8-8976343888--87589861615---89623---5787-42899994- Q ss_pred CCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHC Q ss_conf 8888988112668886999899984884343221006776289998642 Q gi|254780971|r 170 GTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLL 218 (219) Q Consensus 170 ~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~ 218 (219) .+|. |+||-.++.+++|.+.|||- + .+++|+.||+|++ T Consensus 148 ---~~~~---i~ai~h~~~pi~GVQFHPEs-~----~T~~G~~il~NFl 185 (195) T PRK07649 148 ---EEGE---IMAIRHKTLPIEGVQFHPES-I----MTSHGKELLQNFI 185 (195) T ss_pred ---CCCC---EEEEEECCCCEEEEEECCCC-C----CCCCHHHHHHHHH T ss_conf ---8997---89999389988999868988-8----9977689999999 No 69 >cd01741 GATase1_1 Subgroup of proteins having the Type 1 glutamine amidotransferase (GATase1) domain. This group contains a subgroup of proteins having the Type 1 glutamine amidotransferase (GATase1) domain. GATase activity catalyses the transfer of ammonia from the amide side chain of glutamine to an acceptor substrate. Glutamine amidotransferases (GATase) includes the triad family of amidotransferases which have a conserved Cys-His-Glu catalytic triad in the glutaminase active site. In this subgroup this triad is conserved. GATase activity can be found in a range of biosynthetic enzymes, including: glutamine amidotransferase, formylglycinamide ribonucleotide, GMP synthetase , anthranilate synthase component II, glutamine-dependent carbamoyl phosphate synthase, cytidine triphosphate synthetase, gamma-glutamyl hydrolase, imidazole glycerol phosphate synthase and, cobyric acid synthase. Glutamine amidotransferase (GATase) domains can occur either as single polypeptides, as in glutamine Probab=99.02 E-value=3.6e-09 Score=79.48 Aligned_cols=172 Identities=20% Similarity=0.216 Sum_probs=98.3 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHCCC--CEEEEE---CCC--CCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 68998449706479999999985499--639982---154--64456478997287345431140677520021212233 Q gi|254780971|r 2 KTAIVQIPGLNRDNDMIKAITKIIGQ--SPILVW---QSD--TDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKK 74 (219) Q Consensus 2 kvaVl~~pGsNcd~e~~~A~~~~~~~--~~~~v~---~~~--~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~ 74 (219) |++||..--... .++...+-+..+. ..+.+. ..+ .++.+||++|+-||-.--++-.. .+. ..+.+.+++ T Consensus 1 ~i~il~~~~~~~-~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~dgvii~Gg~~~~~~~~~-pwi--~~~~~~i~~ 76 (188) T cd01741 1 RILILQHDTPEG-PGLFEDLLREAGAETIEIDVVDVYAGELLPDLDDYDGLVILGGPMSVDEDDY-PWL--KKLKELIRQ 76 (188) T ss_pred CEEEEECCCCCC-CHHHHHHHHHCCCCCEEEEEEECCCCCCCCCHHHCCEEEEECCCCCCCCCCC-HHH--HHHHHHHHH T ss_conf 989997799999-7489999996699846899996666989978444088999699877786788-669--999999999 Q ss_pred HHHCCCEEEEECCCHHHHEECCCHHHHHCCCCCCCEEEEEEEEEECC--CHHHHHHHCCCCCEEEEEEECCCEEEEECHH Q ss_conf 22059717860640310100000101101244332454225767525--2357764137996899862013302330867 Q gi|254780971|r 75 KAQQGIKVMGICNGFQILVELNLLPGILMRNCSLKFVCKQVLLEVVN--SNTAFTKSYKMNQIIKCPVAHHDGNYFIDAK 152 (219) Q Consensus 75 ~~~~g~~vLGICNGfQiL~elGLlPg~l~~N~s~rf~~r~~~~~v~~--~~s~~~~~~~~~~~l~~piaHgEGrf~~~~~ 152 (219) ..++++|+||||-|.|+|.+. +.|+..++..+ .+-.|..+++.+ .+.+++.++. ..+.++..|++- + T Consensus 77 ~~~~~~PilGIC~G~Qlla~a--lGg~V~~~~~~-~e~G~~~i~lt~~~~~~~~~~~~~--~~~~~~~~H~d~-v----- 145 (188) T cd01741 77 ALAAGKPVLGICLGHQLLARA--LGGKVGRNPKG-WEIGWFPVTLTEAGKADPLFAGLP--DEFPVFHWHGDT-V----- 145 (188) T ss_pred HHHCCCCEEEECHHHHHHHHH--CCCEEEECCCC-CCEEEEEEEEECCCCCCCCCCCCC--CCCCHHHHHHEE-E----- T ss_conf 998799999984689999998--39999878987-521579999954766542214799--765434021027-2----- Q ss_pred HHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHH Q ss_conf 74115422503653168888898811266888699989998488434 Q gi|254780971|r 153 GLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPEN 199 (219) Q Consensus 153 ~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER 199 (219) ..|-++- ..+-. |+ .+.++++.-. .|++|.+.|||. T Consensus 146 --~~lP~~~-~~la~----s~---~~~~qa~~~~-~~~~g~QfHPE~ 181 (188) T cd01741 146 --VELPPGA-VLLAS----SE---ACPNQAFRYG-DRALGLQFHPEE 181 (188) T ss_pred --EECCCCC-EEEEE----CC---CCCEEEEEEC-CEEEEEECCCCH T ss_conf --4389998-99998----89---9978899979-958999576127 No 70 >pfam07722 Peptidase_C26 Peptidase C26. These peptidases have gamma-glutamyl hydrolase activity; that is they catalyse the cleavage of the gamma-glutamyl bond in poly-gamma-glutamyl substrates. They are structurally related to pfam00117, but contain extensions in four loops and at the C terminus. Probab=99.00 E-value=1.3e-08 Score=76.02 Aligned_cols=160 Identities=24% Similarity=0.407 Sum_probs=86.3 Q ss_pred HHHHHHCCCCEEEEECCC--C----CCCCCCEEEECCCCC------CCCCC--CHHHH-HHHHHH-HHHHHHHHHCCCEE Q ss_conf 999985499639982154--6----445647899728734------54311--40677-520021-21223322059717 Q gi|254780971|r 19 KAITKIIGQSPILVWQSD--T----DIPDVDLIVIPGGFS------YGDYL--RCGAI-AARTPV-MQAIKKKAQQGIKV 82 (219) Q Consensus 19 ~A~~~~~~~~~~~v~~~~--~----~l~~~d~lvipGGFS------ygD~l--~aG~i-~~~~~~-~~~i~~~~~~g~~v 82 (219) +|+++ +|..++.+.... . -++.+|+|+||||=+ ||... ..+.+ ..+... ...++...++++|+ T Consensus 31 ~~i~~-aGg~pv~iP~~~~~~~~~~~l~~~dGlll~GG~~di~P~~yg~~~~~~~~~~~~~Rd~~e~~l~~~a~~~~~Pi 109 (219) T pfam07722 31 KAVEG-AGGLPVLLPILEDPEDLEALLDLVDGLLLTGGQSNVDPSLYGEEPSPSGGYYDPARDAFELALIRAALERGKPI 109 (219) T ss_pred HHHHH-CCCEEEEECCCCCHHHHHHHHHHCCEEEECCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCE T ss_conf 99998-59999997899998999999976798998598876782214677686678757888999999999998769987 Q ss_pred EEECCCHHHHEE-CC--C---HHHH----HCCCC-CCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEECH Q ss_conf 860640310100-00--0---1011----01244-332454225767525235776413799689986201330233086 Q gi|254780971|r 83 MGICNGFQILVE-LN--L---LPGI----LMRNC-SLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFIDA 151 (219) Q Consensus 83 LGICNGfQiL~e-lG--L---lPg~----l~~N~-s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~~ 151 (219) ||||-|||+|.- +| | ++.. ..+.. ..++ ...-.+++. ..|.+-+-+. .+.+.+...|.++ T Consensus 110 lGICrG~Qllnv~~GGtL~q~i~~~~~~~~~~~~~~~~~-~~~h~v~i~-~~s~l~~~~~-~~~~~vnS~Hhq~------ 180 (219) T pfam07722 110 LGICRGMQLLNVALGGTLYQDIPEHPGNSDHHHQLAVQY-APSHAVSLE-PGSLLARLLG-AEEILVNSLHHQA------ 180 (219) T ss_pred EEECHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCC-CCCEEEEEC-CCCHHHHHHC-CCCEEEECCHHHH------ T ss_conf 997650899999967830002301257544566887642-560389977-8973998729-9845797450657------ Q ss_pred HHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCC--EEEECCCCH Q ss_conf 77411542250365316888889881126688869998--999848843 Q gi|254780971|r 152 KGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGN--VLGMMPHPE 198 (219) Q Consensus 152 ~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~--vlgmMPHPE 198 (219) +++|-+ +..+.-. +|.|. |.+|..++.+ ++|.|-||| T Consensus 181 --I~~l~~-~l~v~A~----a~Dg~---iEaie~~~~~~f~~GVQwHPE 219 (219) T pfam07722 181 --IKRLAP-GLRVEAT----APDGT---IEAIESPNAPYFVLGVQWHPE 219 (219) T ss_pred --HHCCCC-CCEEEEE----ECCCC---EEEEEECCCCCCEEEEEECCC T ss_conf --740489-8589999----48997---899996699986799972889 No 71 >cd03130 GATase1_CobB Type 1 glutamine amidotransferase (GATase1) domain found in Cobyrinic Acid a,c-Diamide Synthase. Type 1 glutamine amidotransferase (GATase1) domain found in Cobyrinic Acid a,c-Diamide Synthase. CobB plays a role in cobalamin biosythesis catalyzing the conversion of cobyrinic acid to cobyrinic acid a,c-diamide. CobB belongs to the triad family of amidotransferases. Two of the three residues of the catalytic triad that are involved in glutamine binding, hydrolysis and transfer of the resulting ammonia to the acceptor substrate in other triad aminodotransferases are conserved in CobB. Probab=98.98 E-value=5.3e-09 Score=78.40 Aligned_cols=159 Identities=20% Similarity=0.283 Sum_probs=96.6 Q ss_pred HHHHHHHHHHCCCCEE-EEECCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHE Q ss_conf 9999999985499639-982154644564789972873454311406775200212122332205971786064031010 Q gi|254780971|r 15 NDMIKAITKIIGQSPI-LVWQSDTDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIKVMGICNGFQILV 93 (219) Q Consensus 15 ~e~~~A~~~~~~~~~~-~v~~~~~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~vLGICNGfQiL~ 93 (219) .|+...++++ |.+.+ .--..|..|++.|+|.||||| -.+-+..++++..++++|++++++|+|+++.|.|++.|+ T Consensus 14 ~enl~~L~~~-Gaelv~FSPl~d~~lP~~d~lylgGGy---PE~~a~~Ls~N~~~~~~ir~~~~~G~PiyAECGGlMyL~ 89 (198) T cd03130 14 PENLELLEAA-GAELVPFSPLKDEELPDADGLYLGGGY---PELFAEELSANQSMRESIRAFAESGGPIYAECGGLMYLG 89 (198) T ss_pred HHHHHHHHHC-CCEEEEECCCCCCCCCCCCEEECCCCC---HHHHHHHHHHHHHHHHHHHHHHHCCCCEEEECHHHHHHH T ss_conf 9999999988-997898777678889987689827998---758899997679999999999984997799820488874 Q ss_pred E------------CCCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEECHHHHHHHHHCC Q ss_conf 0------------0001011012443324542257675252357764137996899862013302330867741154225 Q gi|254780971|r 94 E------------LNLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFIDAKGLAEIEKNN 161 (219) Q Consensus 94 e------------lGLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~~~~l~~l~~~~ 161 (219) + .|+||+.-...+..++ -.+.+++.. .++|.. .++.+.-.=-|- .+...++ .. T Consensus 90 ~~i~~~~G~~~~M~Gllp~~~~m~~rl~l--GY~~~~~~~-~~~~~~---~g~~lrGHEFHy-S~~~~~~--------~~ 154 (198) T cd03130 90 ESLDDEEGQSYPMAGVLPGDARMTKRLGL--GYREAEALG-DTLLGK---KGTTLRGHEFHY-SRLEPPP--------EP 154 (198) T ss_pred HHCCCCCCCEEEEEEEEEEEEEECCCCCC--CEEEEEECC-CCCCCC---CCCEEEEEEEEE-EEEECCC--------CC T ss_conf 22055899897502115168897776325--638999827-862237---996888777551-0403189--------87 Q ss_pred EEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCC Q ss_conf 036531688888988112668886999899984884 Q gi|254780971|r 162 QIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHP 197 (219) Q Consensus 162 ~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHP 197 (219) .-+++-.......+.. .|.+ .+|++|.-+|- T Consensus 155 ~~~~~~~~~~g~~~~~---~G~~--~~n~~asY~H~ 185 (198) T cd03130 155 DFAATVRRGRGIDGGE---DGYV--YGNVLASYLHL 185 (198) T ss_pred CCEEEEEECCCCCCCC---CEEE--ECCEEEEEEEE T ss_conf 5207998156998882---7899--89999998786 No 72 >COG2071 Predicted glutamine amidotransferases [General function prediction only] Probab=98.97 E-value=1.9e-08 Score=74.87 Aligned_cols=177 Identities=24% Similarity=0.348 Sum_probs=97.8 Q ss_pred HHHHHHHHCCCCEEEEECC-C-----CCCCCCCEEEECCCCCCCC------CCCHHHHH---HH-HHHHHHHHHHHHCCC Q ss_conf 9999998549963998215-4-----6445647899728734543------11406775---20-021212233220597 Q gi|254780971|r 17 MIKAITKIIGQSPILVWQS-D-----TDIPDVDLIVIPGGFSYGD------YLRCGAIA---AR-TPVMQAIKKKAQQGI 80 (219) Q Consensus 17 ~~~A~~~~~~~~~~~v~~~-~-----~~l~~~d~lvipGGFSygD------~l~aG~i~---~~-~~~~~~i~~~~~~g~ 80 (219) -..|..++++. +..+-.- + .-++..|+|+++|| +.=| ....-... .+ +--+.-++...++++ T Consensus 31 yv~ai~~aGg~-pillP~~~d~~~~~~~l~~iDgliltGg-~nV~P~~YGee~~~~~~~~~p~RD~~E~aLi~~ALe~~i 108 (243) T COG2071 31 YVDAIIKAGGI-PILLPALEDPEDARQYLDLIDGLILTGG-SNVDPSLYGEEPSEKDGPYDPERDAFELALIRAALERGI 108 (243) T ss_pred HHHHHHHCCCC-EEEECCCCCHHHHHHHHHHCCEEEECCC-CCCCHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCC T ss_conf 99999976995-0885488877789999865367895289-868987839888855677884300778999999997699 Q ss_pred EEEEECCCHHHHE-ECC--------CHHHHHCCCCCC--CEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEE Q ss_conf 1786064031010-000--------010110124433--24542257675252357764137996899862013302330 Q gi|254780971|r 81 KVMGICNGFQILV-ELN--------LLPGILMRNCSL--KFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFI 149 (219) Q Consensus 81 ~vLGICNGfQiL~-elG--------LlPg~l~~N~s~--rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~ 149 (219) |+||||-|+|+|- -+| -.|+.+-|.... .+++..++++ ..|..-+-+.+.. ..+.+-|. T Consensus 109 PILgICRG~QllNVa~GGtL~q~i~~~~~~~~H~~~~~~~~~~H~V~i~---~~s~La~i~g~~~-~~VNS~Hh------ 178 (243) T COG2071 109 PILGICRGLQLLNVALGGTLYQDISEQPGHIDHRQPNPVHIESHEVHIE---PGSKLAKILGESE-FMVNSFHH------ 178 (243) T ss_pred CEEEECCCHHHHHHHHCCEEEHHHHCCCCCCCCCCCCCCCCCEEEEEEC---CCCCHHHHCCCCC-EEECCHHH------ T ss_conf 8899840157887875675302123054500256999865431579954---7841777537652-26315177------ Q ss_pred CHHHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECC-CCCEEEECCCCHHHCCCCCCCCCHHHHHHHHC Q ss_conf 8677411542250365316888889881126688869-99899984884343221006776289998642 Q gi|254780971|r 150 DAKGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINR-RGNVLGMMPHPENIIEKFHGGIDGRGLFASLL 218 (219) Q Consensus 150 ~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~-~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~ 218 (219) +.+++|-..=.|..+=.| |- |.+|.+. +..++|.|=|||..++.. ..-++||+.++ T Consensus 179 --QaIk~La~~L~V~A~a~D-----G~---VEAie~~~~~fvlGVQWHPE~~~~~~---~~~~~LFe~F~ 235 (243) T COG2071 179 --QAIKKLAPGLVVEARAPD-----GT---VEAVEVKNDAFVLGVQWHPEYLVDTN---PLSLALFEAFV 235 (243) T ss_pred --HHHHHHCCCCEEEEECCC-----CC---EEEEEECCCCEEEEEECCHHHHCCCC---HHHHHHHHHHH T ss_conf --899976788679999799-----95---89898538850799861755522678---68899999999 No 73 >PRK05665 amidotransferase; Provisional Probab=98.97 E-value=2.1e-08 Score=74.57 Aligned_cols=169 Identities=20% Similarity=0.259 Sum_probs=98.8 Q ss_pred CEEEEEECCCCC-------CHH-HHHHHHHHHCCCCE----EEEECCC--CCCCCCCEEEECCCC--CCCCCCCHHHHHH Q ss_conf 968998449706-------479-99999998549963----9982154--644564789972873--4543114067752 Q gi|254780971|r 1 MKTAIVQIPGLN-------RDN-DMIKAITKIIGQSP----ILVWQSD--TDIPDVDLIVIPGGF--SYGDYLRCGAIAA 64 (219) Q Consensus 1 mkvaVl~~pGsN-------cd~-e~~~A~~~~~~~~~----~~v~~~~--~~l~~~d~lvipGGF--SygD~l~aG~i~~ 64 (219) ||+|||..--.+ +|+ +|...+.+..+.+. +.|...+ .++.+||+++|.|+= .|.|. .|. T Consensus 3 mki~IL~~d~~~~~~~~~~g~y~~mF~~ll~~~~~~~~~~~~~v~~ge~P~~~~~~dg~vITGS~~~vyd~~----pWi- 77 (240) T PRK05665 3 LRICILETDVLRPELVAQYQGYGRMFEQLFARQPIAAEFVVYNVVQGDYPADDEKFDAYLVTGSKADSFGTD----PWI- 77 (240) T ss_pred EEEEEEECCCCCHHHHHHCCCHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCHHHCCEEEEECCCCCCCCCC----HHH- T ss_conf 089999789998789977499889999999618998559999700687998977878999958987468996----769- Q ss_pred HHHHHHHHHHHHHCCCEEEEECCCHHHHEECCCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCC Q ss_conf 00212122332205971786064031010000010110124433245422576752523577641379968998620133 Q gi|254780971|r 65 RTPVMQAIKKKAQQGIKVMGICNGFQILVELNLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHD 144 (219) Q Consensus 65 ~~~~~~~i~~~~~~g~~vLGICNGfQiL~elGLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgE 144 (219) .++.+-+++..++++|+||||=|.|+|.+. |.|...+|..+ ++.....+++. ...+|+.... +.+.++..|.+ T Consensus 78 -~~L~~~i~~~~~~~~pilGICfGHQliA~A--LGG~V~~~~~G-w~vG~~~~~~~-~~~~~~~~~~--~~~~l~~~H~D 150 (240) T PRK05665 78 -QTLKTYLLKLYERGDKLLGVCFGHQLLALL--LGGKAERASQG-WGVGIHRYQLA-AHAPWMSPAV--TELTLLISHQD 150 (240) T ss_pred -HHHHHHHHHHHHCCCCEEEEEHHHHHHHHH--HCCEEEECCCC-CEEEEEEEEEC-CCCCCCCCCC--CCCEEEEEECC T ss_conf -999999999987699989994436888886--09836127997-36667998875-8774335875--64079977224 Q ss_pred EEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCH Q ss_conf 023308677411542250365316888889881126688869998999848843 Q gi|254780971|r 145 GNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPE 198 (219) Q Consensus 145 Grf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPE 198 (219) -=. ++-+.-.+.. . .+.-| ++++. -.+|+++++.||| T Consensus 151 ~V~--------~lP~ga~~La-s-s~~cp------~q~~~-~~~~~~~~Q~HPE 187 (240) T PRK05665 151 QVT--------ALPEGATVIA-S-SDFCP------NAAYH-IRDQVLCFQGHPE 187 (240) T ss_pred EEE--------ECCCCEEEEE-E-CCCCC------HHHEE-CCCCEEEEECCCC T ss_conf 278--------5799848998-6-79983------33426-2786899916968 No 74 >PRK06490 glutamine amidotransferase; Provisional Probab=98.93 E-value=1.2e-08 Score=76.20 Aligned_cols=172 Identities=17% Similarity=0.181 Sum_probs=96.9 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHCCCCEEEEE--CCC---CCCCCCCEEEECCC-CCCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 68998449706479999999985499639982--154---64456478997287-3454311406775200212122332 Q gi|254780971|r 2 KTAIVQIPGLNRDNDMIKAITKIIGQSPILVW--QSD---TDIPDVDLIVIPGG-FSYGDYLRCGAIAARTPVMQAIKKK 75 (219) Q Consensus 2 kvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~--~~~---~~l~~~d~lvipGG-FSygD~l~aG~i~~~~~~~~~i~~~ 75 (219) ||.||+-.-.-..--....+.+ .|.+..++. ..| ..+++||++|+-|| .|--|.. .+.. ...+-|++. T Consensus 13 ~vLviqH~~~e~pG~i~~~L~~-~G~~~~i~r~~~gd~lP~~l~~~dglvVlGGpmsa~D~~---p~l~--~e~~~I~~~ 86 (243) T PRK06490 13 PILIVLHQERSTPGRVGQLLRE-RGYPLDIRRPRLGDPLPETLEDHAGAVIFGGPMSANDPD---DFIR--REIDWISVP 86 (243) T ss_pred CEEEEECCCCCCCHHHHHHHHH-CCCEEEEEECCCCCCCCCCCCCCCEEEEECCCCCCCCCC---CHHH--HHHHHHHHH T ss_conf 5799966899997299999997-898589996789998999754605899928999989987---1799--999999999 Q ss_pred HHCCCEEEEECCCHHHHEECCCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEECHHHHH Q ss_conf 20597178606403101000001011012443324542257675252357764137996899862013302330867741 Q gi|254780971|r 76 AQQGIKVMGICNGFQILVELNLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFIDAKGLA 155 (219) Q Consensus 76 ~~~g~~vLGICNGfQiL~elGLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~~~~l~ 155 (219) .++++|+||||-|.|+|.+. +.|...+|..++.|--|..++.......++. .. ..+ .--|+|+ |..++. T Consensus 87 l~~~~P~LGICLGaQLlA~a--lGg~V~~~~~g~~EiG~~~i~~t~~g~~~~~-~p-~~v---~~wH~e~-f~LP~g--- 155 (243) T PRK06490 87 LKENKPFLGICLGAQMLARH--LGARVAPHPDGRVEIGYYPLRPTEAGRALMP-WP-EMV---YHWHREG-FDLPKG--- 155 (243) T ss_pred HHCCCCEEEECHHHHHHHHH--HCCEEECCCCCCCCCCCEEEEECCCCCCCCC-CC-CEE---EEEECCC-CCCCCC--- T ss_conf 98699889988849999998--3988844888877578523567745323458-97-836---7862432-006999--- Q ss_pred HHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHH---CCCC Q ss_conf 154225036531688888988112668886999899984884343---2210 Q gi|254780971|r 156 EIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENI---IEKF 204 (219) Q Consensus 156 ~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~---~~~~ 204 (219) -..+-.. +.-|| .++.- ..|++|++.|||=. ++.| T Consensus 156 ------A~lLA~s-~~c~~------QAFr~-G~~a~g~QFHpEvt~~~i~~W 193 (243) T PRK06490 156 ------AELLATG-DDFPN------QAFRY-GDNAWGLQFHPEVTRAMMHRW 193 (243) T ss_pred ------CEEEECC-CCCCH------HHEEE-CCCEEEEECCEECCHHHHHHH T ss_conf ------5898789-99836------82897-694899954847878999999 No 75 >PRK09065 glutamine amidotransferase; Provisional Probab=98.92 E-value=3.6e-08 Score=73.10 Aligned_cols=173 Identities=17% Similarity=0.230 Sum_probs=101.3 Q ss_pred CEEEEEECCCCC-------CHHH-HHHHHHHHCCCCEE--EEECCC--CCCCCCCEEEECCCCC-CCCCCCHHHHHHHHH Q ss_conf 968998449706-------4799-99999985499639--982154--6445647899728734-543114067752002 Q gi|254780971|r 1 MKTAIVQIPGLN-------RDND-MIKAITKIIGQSPI--LVWQSD--TDIPDVDLIVIPGGFS-YGDYLRCGAIAARTP 67 (219) Q Consensus 1 mkvaVl~~pGsN-------cd~e-~~~A~~~~~~~~~~--~v~~~~--~~l~~~d~lvipGGFS-ygD~l~aG~i~~~~~ 67 (219) +|++||..--.. +|++ |...+.+..+.... .+...+ -++++||++|+-|+=+ --|.. -|. .. T Consensus 4 ~ki~IL~~g~~~~~~~~~~G~y~~~f~~ll~~~~~~~~~~~v~~ge~~p~~~~~dg~iitGS~~~v~d~~---pWi--~~ 78 (238) T PRK09065 4 LPLLIIETGTPPPSIRARYGDFPHWIRVALGLAEQPVVVVDVFAGEPLPAPEDFAGVLITGSHAMVTDRA---PWS--ER 78 (238) T ss_pred CEEEEEECCCCCHHHHHHCCCHHHHHHHHHHCCCCCEEEEECCCCCCCCCHHHCCEEEEECCCCCCCCCC---HHH--HH T ss_conf 1699998898987999873998999999872369857999672798998957848899979852057998---889--99 Q ss_pred HHHHHHHHHHCCCEEEEECCCHHHHEECCCHHHHHCCCCCCCEEEEEEEEEECC--CHHHHHHHCCCCCEEEEEEECCCE Q ss_conf 121223322059717860640310100000101101244332454225767525--235776413799689986201330 Q gi|254780971|r 68 VMQAIKKKAQQGIKVMGICNGFQILVELNLLPGILMRNCSLKFVCKQVLLEVVN--SNTAFTKSYKMNQIIKCPVAHHDG 145 (219) Q Consensus 68 ~~~~i~~~~~~g~~vLGICNGfQiL~elGLlPg~l~~N~s~rf~~r~~~~~v~~--~~s~~~~~~~~~~~l~~piaHgEG 145 (219) +.+.+++..++++|+||||-|.|+|.+. +.|...+|..+. |--|..++... .+.+++.++. ..+.++..|.+- T Consensus 79 l~~~ir~~~~~~~PilGICfGhQllA~A--lGG~V~~~p~G~-eiG~~~v~lt~~g~~dplf~~~p--~~~~~~~~H~D~ 153 (238) T PRK09065 79 TADWLRQAAAAGMPLLGICYGHQLLAHA--LGGEVGYNPAGR-ESGTVALELHPAAAQDPLFAGLP--ATFPAHATHLQT 153 (238) T ss_pred HHHHHHHHHHCCCCEEEEEHHHHHHHHH--HCCEEEECCCCC-CCCEEEEEECHHCCCCCHHCCCC--CCCEEEEEEECE T ss_conf 9999999987799999981778999998--498797679976-36512568751004571111898--761799995142 Q ss_pred EEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHH Q ss_conf 2330867741154225036531688888988112668886999899984884343 Q gi|254780971|r 146 NYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENI 200 (219) Q Consensus 146 rf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~ 200 (219) =+. |-+.- ..+=.. +..| ++++.- ..|+++++.|||=. T Consensus 154 V~~--------lP~ga-~~LAss-~~c~------~qa~~~-g~~~~gvQfHPEft 191 (238) T PRK09065 154 VLR--------LPPGA-VVLARS-AHDP------HQAFRY-GPSAWGVQFHPEFT 191 (238) T ss_pred EEE--------CCCCE-EEEEEC-CCCH------HHHEEE-CCCEEEEECCCCCC T ss_conf 896--------49983-899809-9990------571785-89489995686289 No 76 >PRK09522 bifunctional anthranilate synthase/anthranilate phosphoribosyltransferase; Provisional Probab=98.88 E-value=7.5e-08 Score=71.06 Aligned_cols=178 Identities=16% Similarity=0.234 Sum_probs=101.3 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCC-------CC--CCCCEEEECCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 9689984497064799999999854996399821546-------44--56478997287345431140677520021212 Q gi|254780971|r 1 MKTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDT-------DI--PDVDLIVIPGGFSYGDYLRCGAIAARTPVMQA 71 (219) Q Consensus 1 mkvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~-------~l--~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~ 71 (219) +||.+|.---|= -+-.++.+.. .+.++.+ .++|. .+ .+.+.|||-.|=+.- ..+....+ T Consensus 2 ~~ILlIDNyDSF-TyNL~~~l~~-~g~~v~V-~rnd~~~~~~~e~~~~~~p~~IVlSPGPG~P---------~dag~~~~ 69 (531) T PRK09522 2 ADILLLDNIDSF-TYNLADQLRS-NGHNVVI-YRNHIPAQTLIERLATMSNPVLMLSPGPGVP---------SEAGCMPE 69 (531) T ss_pred CEEEEEECCCCH-HHHHHHHHHH-CCCCEEE-EECCCCCCCCHHHHHHCCCCEEEECCCCCCC---------HHCCCHHH T ss_conf 709998288862-8899999986-6995699-9699831301999972495969989999890---------23657399 Q ss_pred HHHHHHCCCEEEEECCCHHHHEECCCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEECH Q ss_conf 23322059717860640310100000101101244332454225767525235776413799689986201330233086 Q gi|254780971|r 72 IKKKAQQGIKVMGICNGFQILVELNLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFIDA 151 (219) Q Consensus 72 i~~~~~~g~~vLGICNGfQiL~elGLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~~ 151 (219) +.+......|+||||-|.|.|.+. ..|.+.+-..-.| -+-.. +..+.+..++++. +.+.+|-.|+- + T Consensus 70 ii~~~~~~iPILGICLGhQ~ia~~--~Gg~v~~a~~~~H-Gk~s~--i~~~~~~lF~glp--~~~~v~rYHSl---~--- 136 (531) T PRK09522 70 LLTRLRGKLPIIGICLGHQAIVEA--YGGYVGQAGEILH-GKASS--IEHDGQAMFAGLT--NPLPVARYHSL---V--- 136 (531) T ss_pred HHHHHCCCCCEEEECHHHHHHHHH--CCCEEEECCCCCC-CEEEE--EEECCCCCCCCCC--CCCEEEEECEE---C--- T ss_conf 999865999989987877899997--2988967798432-33656--7989852028999--99779964433---0--- Q ss_pred HHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHCC Q ss_conf 77411542250365316888889881126688869998999848843432210067762899986429 Q gi|254780971|r 152 KGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLLT 219 (219) Q Consensus 152 ~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~s 219 (219) .+.+.+.-.|.. .. +| -|.|+-.++..+.|.+.|||-.. +++|++|++|++. T Consensus 137 --~~~lP~~l~v~A-~~-----~g---~iMai~h~~~pi~GVQFHPESi~-----T~~G~~il~NFLn 188 (531) T PRK09522 137 --GSNIPAGLTINA-HF-----NG---MVMAVRHDADRVCGFQFHPESIL-----TTQGARLLEQTLA 188 (531) T ss_pred --CCCCCCCEEEEE-EE-----CC---EEEEEEECCCCEEEEEECCCCCC-----CCCHHHHHHHHHC T ss_conf --447999879997-16-----99---99999989998699820875468-----8777999999856 No 77 >KOG0623 consensus Probab=98.85 E-value=2.5e-08 Score=74.13 Aligned_cols=185 Identities=26% Similarity=0.396 Sum_probs=113.9 Q ss_pred EEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEE Q ss_conf 98449706479999999985499639982154644564789972873454311406775200212122332205971786 Q gi|254780971|r 5 IVQIPGLNRDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIKVMG 84 (219) Q Consensus 5 Vl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~vLG 84 (219) +|.+ |+---+....|+++ .|++...+ +++.++.+.+-||+||==.||-.. -...+....+.+.++++.|+|.+| T Consensus 6 ~ld~-~agn~~si~nal~h-lg~~i~~v-~~P~DI~~a~rLIfPGVGnfg~~~---D~L~~~Gf~eplr~YiesgkPfmg 79 (541) T KOG0623 6 LLDY-GAGNVRSIRNALRH-LGFSIKDV-QTPGDILNADRLIFPGVGNFGPAM---DVLNRTGFAEPLRKYIESGKPFMG 79 (541) T ss_pred EEEC-CCCCHHHHHHHHHH-CCCEEEEC-CCCHHHCCCCEEEECCCCCCHHHH---HHHHHHHHHHHHHHHHHCCCCEEE T ss_conf 9841-78647889999986-38365542-581220367437603866412788---887643247899999866997476 Q ss_pred ECCCHHHHEE----------CCCHHHHHCCCC-CCC--EEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEECH Q ss_conf 0640310100----------000101101244-332--454225767525235776413799689986201330233086 Q gi|254780971|r 85 ICNGFQILVE----------LNLLPGILMRNC-SLK--FVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFIDA 151 (219) Q Consensus 85 ICNGfQiL~e----------lGLlPg~l~~N~-s~r--f~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~~ 151 (219) ||-|.|.|.+ ||.+||.+-+-+ |.+ =.-.|-.+.|.+ .+-|+-.- .... ...-|. |.+. T Consensus 80 icvGlQaLF~gSvE~p~skGLgvipg~v~RFD~s~k~VPhIGWNsc~v~s-d~effg~~-p~~~--~YFVHS----yl~~ 151 (541) T KOG0623 80 ICVGLQALFDGSVENPPSKGLGVIPGIVGRFDASAKIVPHIGWNSCQVGS-DSEFFGDV-PNRH--VYFVHS----YLNR 151 (541) T ss_pred EHHHHHHHHCCCCCCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCCCCC-CCCCCCCC-CCCE--EEEEEE----ECCC T ss_conf 31457998636443788676435444011035777857745631154577-52002567-8734--999853----1133 Q ss_pred HHHHHHHHCCE-EE-EECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHC Q ss_conf 77411542250-36-531688888988112668886999899984884343221006776289998642 Q gi|254780971|r 152 KGLAEIEKNNQ-IV-FRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLL 218 (219) Q Consensus 152 ~~l~~l~~~~~-i~-~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~ 218 (219) +.-+.++..+- |+ .+| ||..=|++| ..+|+++.+.|||.+-+. |+..+++++ T Consensus 152 ek~~~len~~wkiat~kY-------G~E~Fi~ai--~knN~~AtQFHPEKSG~a------GL~vl~~FL 205 (541) T KOG0623 152 EKPKSLENKDWKIATCKY-------GSESFISAI--RKNNVHATQFHPEKSGEA------GLSVLRRFL 205 (541) T ss_pred CCCCCCCCCCCEEEEECC-------CCHHHHHHH--HCCCEEEEECCCCCCCCH------HHHHHHHHH T ss_conf 345678987736764046-------708899998--607654575265445501------189999998 No 78 >TIGR00337 PyrG CTP synthase; InterPro: IPR004468 CTP synthase is involved in pyrimidine ribonucleotide/ribonucleoside metabolism. The enzyme catalyzes the reaction L-glutamine + H2O + UTP + ATP = CTP + phosphate + ADP + L-glutamate. The enzyme exists as a dimer of identical chains that aggregates as a tetramer. This gene has been found circa 500 bp 5 upstream of enolase in both beta (Nitrosomonas europaea) and gamma (Escherichia coli) subdivisions of Proteobacterium .; GO: 0003883 CTP synthase activity, 0006221 pyrimidine nucleotide biosynthetic process. Probab=98.83 E-value=2.3e-08 Score=74.34 Aligned_cols=157 Identities=28% Similarity=0.382 Sum_probs=100.1 Q ss_pred CCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHE-E-----CCCH-------------H- Q ss_conf 564789972873454311406775200212122332205971786064031010-0-----0001-------------0- Q gi|254780971|r 40 PDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIKVMGICNGFQILV-E-----LNLL-------------P- 99 (219) Q Consensus 40 ~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~vLGICNGfQiL~-e-----lGLl-------------P- 99 (219) .++|+|.+|||| |++--.|+|.| ++-+-++..|-||||-|+|+.+ | +||- | T Consensus 372 ~~~DGILVPGGF--G~RG~EGKI~A-------i~yAREN~iPFLGICLGmQ~A~IEFARNV~Gl~~AnStEf~~~t~~p~ 442 (571) T TIGR00337 372 SEVDGILVPGGF--GERGVEGKIRA-------IKYARENNIPFLGICLGMQLAVIEFARNVLGLKGANSTEFDPETKYPK 442 (571) T ss_pred HCCCEEEECCCC--CCCCCHHHHHH-------HHHHHHCCCCEEEHHHHHHHHHHHHHHHHCCCCCCCHHHCCCCCCCCC T ss_conf 306806836888--97660247899-------999876369733021777888999976534798887412464557773 Q ss_pred --H-HHCCC-------CCC-----------CEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEECHHHHHHHH Q ss_conf --1-10124-------433-----------24542257675252357764137996899862013302330867741154 Q gi|254780971|r 100 --G-ILMRN-------CSL-----------KFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFIDAKGLAEIE 158 (219) Q Consensus 100 --g-~l~~N-------~s~-----------rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~~~~l~~l~ 158 (219) - .|++. +.- |.=.+.+.++-....|...+-|....|.+=+ .| ||-+|++..++++ T Consensus 443 ~~Vv~LlpEq~d~~~~~~VRt~~~~lGGTMRLG~ypc~l~~~~eWt~a~klYg~~~v~ERH-RH---RYE~Nn~~~~~~E 518 (571) T TIGR00337 443 SPVVDLLPEQKDQNGNKEVRTEIEDLGGTMRLGLYPCILKPGSEWTLAFKLYGKEEVYERH-RH---RYEVNNEYREELE 518 (571) T ss_pred CCCEECCCCCCCCCCCEECCEEECCCCCEEECCCCCEEECCCCCHHHHHHHCCCCEEEEEC-CC---CCCCCHHHHHHHH T ss_conf 3111425322244676001301013872430587221771787232675533786788712-44---4122757898998 Q ss_pred HCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHC Q ss_conf 225036531688888988112668886999899984884343221006776289998642 Q gi|254780971|r 159 KNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLL 218 (219) Q Consensus 159 ~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~ 218 (219) ++|+++-=+ -|+|-..-|.=|- ..-=.+|-+.|||=...+ .++.++|..|| T Consensus 519 ~~GL~vsG~----S~dg~l~EiiE~~-~HP~FvA~QFHPEF~Srp----~~p~plF~Gfv 569 (571) T TIGR00337 519 NKGLIVSGT----SPDGRLVEIIELP-DHPFFVACQFHPEFTSRP----NRPHPLFLGFV 569 (571) T ss_pred HCCCEEEEE----CCCCCEEEEEEEC-CCCEEEEEECCCCCCCCC----CCCCCCCHHHH T ss_conf 579589998----6998389999847-978688862267511677----88887505532 No 79 >PRK06186 hypothetical protein; Validated Probab=98.83 E-value=1.3e-07 Score=69.49 Aligned_cols=178 Identities=19% Similarity=0.228 Sum_probs=99.7 Q ss_pred HHHHHHHH---CCCCEEEEECCC------CCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECC Q ss_conf 99999985---499639982154------644564789972873454311406775200212122332205971786064 Q gi|254780971|r 17 MIKAITKI---IGQSPILVWQSD------TDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIKVMGICN 87 (219) Q Consensus 17 ~~~A~~~~---~~~~~~~v~~~~------~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~vLGICN 87 (219) ...|+... ......+.|.+. ..|.++|++++|||| |.+-.-|+|.+ ++-.-++..|-||||- T Consensus 20 v~eAL~ha~~~~~~~v~i~wI~se~~~~~~~L~~~dgilv~pGf--G~RG~eGki~A-------i~yARen~iP~LGICL 90 (229) T PRK06186 20 IPLALDLAAEVLGLPVDYEWLPTDDIHDPEDLAGFDGIWCVPGS--PYRNEDGALTA-------IRFARENGIPFLGTCG 90 (229) T ss_pred HHHHHHHHHHHCCCCEEEEEEECCCCCCCCHHCCCCEEEECCCC--CCCCHHHHHHH-------HHHHHHCCCCEEEEHH T ss_conf 99999987997099068999720116771022259989989987--77656389999-------9999876998786427 Q ss_pred CHHHHE-E-----CCCHHH---HHCCCCCCC--------EEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEEC Q ss_conf 031010-0-----000101---101244332--------45422576752523577641379968998620133023308 Q gi|254780971|r 88 GFQILV-E-----LNLLPG---ILMRNCSLK--------FVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFID 150 (219) Q Consensus 88 GfQiL~-e-----lGLlPg---~l~~N~s~r--------f~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~ 150 (219) |||+++ | +||--. .+-.+.... ..-....+++ ..+|...+-+.... +.=.|- -||-++ T Consensus 91 GmQ~avIEfARnvlgl~dAnS~Efd~~~~~pvi~~l~~~~~~~~~~~~l-~~~s~~~~iYg~~~---i~ERHR-HRYEvN 165 (229) T PRK06186 91 GFQHALLEYARNVLGWADAEHAETDPEASRPVIAPLSCSLVEATAAIRL-RPGSLIARAYGTDE---IEEGYH-CRYGVN 165 (229) T ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEECCCCCHHCCCCCEEE-CCCCHHHHHHCCCH---HHHHHC-CCCCCC T ss_conf 7899999999986599888764557899971893688732038751574-89983999869653---013323-455469 Q ss_pred HHHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHC Q ss_conf 67741154225036531688888988112668886999899984884343221006776289998642 Q gi|254780971|r 151 AKGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLL 218 (219) Q Consensus 151 ~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~ 218 (219) .+..++|++.|.+..-+ +|.|.+. |.=+- ..--.+|.+.|||-...+... -++|.++| T Consensus 166 ~~y~~~le~~Gl~~sG~----~~~~~ve-iiEl~-~Hpffvg~QfHPEf~Srp~~p----hPlF~~Fi 223 (229) T PRK06186 166 PEFAAALESGALRVSGW----DEEGDVR-AVELP-GHPFFVATLFQPERAALAGRL----PPLVRAFL 223 (229) T ss_pred HHHHHHHHHCCCEEEEE----CCCCCEE-EEEEC-CCCCEEEECCCCCCCCCCCCC----CCCHHHHH T ss_conf 89999999789899998----8999889-99928-998589965875456899998----81599999 No 80 >PRK05380 pyrG CTP synthetase; Validated Probab=98.82 E-value=3.7e-08 Score=73.01 Aligned_cols=178 Identities=27% Similarity=0.405 Sum_probs=102.3 Q ss_pred HHHHHHHH---CCCCEEEEECCCC---------CCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEE Q ss_conf 99999985---4996399821546---------44564789972873454311406775200212122332205971786 Q gi|254780971|r 17 MIKAITKI---IGQSPILVWQSDT---------DIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIKVMG 84 (219) Q Consensus 17 ~~~A~~~~---~~~~~~~v~~~~~---------~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~vLG 84 (219) ...||..+ .+..+.+.|.+.+ .|.++|++++|||| |++---|+|.| ++-+-+...|-|| T Consensus 307 v~EAL~Hag~~~~~kv~i~wIdse~le~~~~~~~L~~~dGIlVPGGF--G~RGiEGKI~A-------i~yARen~IPfLG 377 (534) T PRK05380 307 VIEALKHAGIANDVKVNIKWIDSEDLEEENVAELLKDVDGILVPGGF--GERGIEGKILA-------IRYARENNIPFLG 377 (534) T ss_pred HHHHHHHHHHHCCCCCEEEEEEHHHCCCCCHHHHHHCCCEEEECCCC--CCCCHHHHHHH-------HHHHHHCCCCCEE T ss_conf 99999988886387422799885763664677776318829957876--65431568999-------9999981998110 Q ss_pred ECCCHHHHE-E-----CCCHH--------HH------HCCC---C-----CCCEEEEEEEEEECCCHHHHHHHCCCCCEE Q ss_conf 064031010-0-----00010--------11------0124---4-----332454225767525235776413799689 Q gi|254780971|r 85 ICNGFQILV-E-----LNLLP--------GI------LMRN---C-----SLKFVCKQVLLEVVNSNTAFTKSYKMNQII 136 (219) Q Consensus 85 ICNGfQiL~-e-----lGLlP--------g~------l~~N---~-----s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l 136 (219) ||-|+|+.+ | +||=- .+ |++. . +-|.-+..+.++ ..|...+-+...+. T Consensus 378 ICLGMQ~avIEfARNVlgl~dAnStEfd~~t~~pVI~lm~eq~~~~~~GGTMRLG~y~~~l~---~gS~a~~~Yg~~~~- 453 (534) T PRK05380 378 ICLGMQLAVIEFARNVLGLEDANSTEFDPDTPHPVIDLMPEQKDVSDLGGTMRLGAYPCKLK---PGTLAAEIYGKTEV- 453 (534) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCEEECCCCCCCCEEEECHHHCCCCCCCCEEECCCEEEEEC---CCCHHHHHHCCCCE- T ss_conf 11223788999999845899973253589999986998823137765666045164456756---88899998589981- Q ss_pred EEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHH Q ss_conf 98620133023308677411542250365316888889881126688869998999848843432210067762899986 Q gi|254780971|r 137 KCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFAS 216 (219) Q Consensus 137 ~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~ 216 (219) +.=.| -=||-++.+..++|++.|.+.-- .+|.|..--|--+-+ .--.+|-+.|||=...|.. .-++|.+ T Consensus 454 -I~ERH-RHRYEvN~~y~~~le~~Gl~~sG----~s~d~~lvEiiEl~~-HpfFvg~QfHPEfkSrP~~----PhPlF~~ 522 (534) T PRK05380 454 -IYERH-RHRYEVNNKYREQLEKAGLVFSG----TSPDGRLVEIIELPD-HPWFVGVQFHPEFKSRPRR----PHPLFAG 522 (534) T ss_pred -ECCCC-CCCCCCCHHHHHHHHHCCCEEEE----ECCCCCEEEEEECCC-CCEEEEECCCCCCCCCCCC----CCCCHHH T ss_conf -66645-64200189999999978979999----989998789998689-9808996577665789999----9836999 Q ss_pred HC Q ss_conf 42 Q gi|254780971|r 217 LL 218 (219) Q Consensus 217 ~~ 218 (219) +| T Consensus 523 fI 524 (534) T PRK05380 523 FV 524 (534) T ss_pred HH T ss_conf 99 No 81 >pfam07685 GATase_3 CobB/CobQ-like glutamine amidotransferase domain. Probab=98.81 E-value=1.9e-08 Score=74.86 Aligned_cols=54 Identities=31% Similarity=0.592 Sum_probs=45.8 Q ss_pred CCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHEE Q ss_conf 445647899728734543114067752002121223322059717860640310100 Q gi|254780971|r 38 DIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIKVMGICNGFQILVE 94 (219) Q Consensus 38 ~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~vLGICNGfQiL~e 94 (219) .+.++|+|+||||+.. +.+-.++.+..+.++|++++++|+||+|||.|+|+|.+ T Consensus 4 ~p~~~D~lilpGg~~~---~~~~~~~~~~~~~~~i~~~~~~G~pi~giCGG~q~Lg~ 57 (158) T pfam07685 4 LPGDADLVILPGGKPT---IQDLALLRNSGLDEALREHAEAGGPILGICGGYQMLGE 57 (158) T ss_pred CCCCCCEEEECCCCCH---HHHHHHHHHCCHHHHHHHHHHCCCCEEEECHHHHHHHH T ss_conf 9999879998995409---99999987755799999999779969998758898765 No 82 >PRK07567 glutamine amidotransferase; Provisional Probab=98.81 E-value=5.5e-08 Score=71.93 Aligned_cols=159 Identities=16% Similarity=0.217 Sum_probs=92.8 Q ss_pred HHHHHHHCCCCEE-----EEECC---CCCCCCCCEEEECCC-CCCCCCC---CHHHHHHHHHHHHHHHHHHHCCCEEEEE Q ss_conf 9999985499639-----98215---464456478997287-3454311---4067752002121223322059717860 Q gi|254780971|r 18 IKAITKIIGQSPI-----LVWQS---DTDIPDVDLIVIPGG-FSYGDYL---RCGAIAARTPVMQAIKKKAQQGIKVMGI 85 (219) Q Consensus 18 ~~A~~~~~~~~~~-----~v~~~---~~~l~~~d~lvipGG-FSygD~l---~aG~i~~~~~~~~~i~~~~~~g~~vLGI 85 (219) ..+|-+.+|-+.. .+... +.+|.+||++++.|| |+..|.. +.-.......+..-+++..+.++|+||| T Consensus 20 ~~~f~~~~gl~~~q~~~~rld~~~~p~~dL~~ydgvivgGsp~~vsd~~~~~~~~q~~~e~~l~~Ll~~~~~~~~P~LGi 99 (242) T PRK07567 20 YAAFLRYGGLKPAELRRIRLDHEELPDLDLDDFSGVIVGGSPFNVSDPAYSKSPTQRRVEAELSGLLDEVVARDFPFLGC 99 (242) T ss_pred HHHHHHHHCCCHHHEEEEEECCCCCCCCCHHHCCEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEE T ss_conf 99999974998899799981488999878300078999488975678544564688899999999999998469988998 Q ss_pred CCCHHHHEECCCHHHHHCCCCCCCEEEEEEEEEECC--CHHHHHHHCCCCCEEEEEEECCCEEEEECHHHHHHHHHCCEE Q ss_conf 640310100000101101244332454225767525--235776413799689986201330233086774115422503 Q gi|254780971|r 86 CNGFQILVELNLLPGILMRNCSLKFVCKQVLLEVVN--SNTAFTKSYKMNQIIKCPVAHHDGNYFIDAKGLAEIEKNNQI 163 (219) Q Consensus 86 CNGfQiL~elGLlPg~l~~N~s~rf~~r~~~~~v~~--~~s~~~~~~~~~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i 163 (219) |-|.|+|... +.|...++ ..-+.-++.+.+.. ...|++.++. ..+..+..|+|--.-. -+ +-+ T Consensus 100 C~G~Qlla~a--lGG~V~~~--~~~e~G~~~i~lt~~g~~Dpl~~~lp--~~f~a~~~H~d~V~~L--------P~-ga~ 164 (242) T PRK07567 100 CYGVGTLGHH--LGGVVDRT--HGEPVGATTVTLTDAGRRDPLTADLP--DTFTAFTGHKEAVVAL--------PP-GAV 164 (242) T ss_pred CHHHHHHHHH--CCCEEECC--CCCEEEEEEEEECCCCCCCCCCCCCC--CCEEEEEECCCEEEEC--------CC-CCE T ss_conf 5638999997--29888279--98707799999814567791101798--4417998616413768--------99-848 Q ss_pred EEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHH Q ss_conf 653168888898811266888699989998488434 Q gi|254780971|r 164 VFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPEN 199 (219) Q Consensus 164 ~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER 199 (219) .+-+.+ .-| +.++. -..++.|.+.|||= T Consensus 165 lLA~s~-~cp------vqafr-~g~~~y~~QFHPE~ 192 (242) T PRK07567 165 LLATGP-TCP------VQMFR-AGENVYATQFHPEL 192 (242) T ss_pred EEECCC-CCC------HHEEE-ECCCEEEEECCCCC T ss_conf 998499-982------36189-37978999248407 No 83 >PRK05637 anthranilate synthase component II; Provisional Probab=98.80 E-value=1.8e-07 Score=68.65 Aligned_cols=184 Identities=16% Similarity=0.233 Sum_probs=95.0 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCC-----CCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 6899844970647999999998549963998215464-----45647899728734543114067752002121223322 Q gi|254780971|r 2 KTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDTD-----IPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKA 76 (219) Q Consensus 2 kvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~~-----l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~ 76 (219) ||.+|-.-.|= .+-.++.+. ..|.++.++-+ +.. -.+.|+|||-.|= |.= .......++.+.. T Consensus 3 ~ILlIDnyDSF-T~Nl~~~l~-~~g~~v~V~rn-~~~~~~~~~~~pd~ivlSPGP--------g~P-~d~g~~~~~~~~~ 70 (208) T PRK05637 3 HVVLIDNHDSF-VYNLVDAFA-VAGYKCTVFRN-TVPVETILAANPDLICLSPGP--------GYP-ADAGNMMALIERT 70 (208) T ss_pred EEEEEECCCCH-HHHHHHHHH-HCCCEEEEEEC-CCCHHHHHHCCCCEEEECCCC--------CCH-HHCCCHHHHHHHH T ss_conf 79999488975-999999998-67990799969-999999985199959999999--------995-7775749999997 Q ss_pred HCCCEEEEECCCHHHHEECCCHHHHHCCCCCCCEEEEEEEEEECC--CHHHHHHHCCCC----------CEEEEEEECCC Q ss_conf 059717860640310100000101101244332454225767525--235776413799----------68998620133 Q gi|254780971|r 77 QQGIKVMGICNGFQILVELNLLPGILMRNCSLKFVCKQVLLEVVN--SNTAFTKSYKMN----------QIIKCPVAHHD 144 (219) Q Consensus 77 ~~g~~vLGICNGfQiL~elGLlPg~l~~N~s~rf~~r~~~~~v~~--~~s~~~~~~~~~----------~~l~~piaHgE 144 (219) ....|+||||-|+|.|++. ..|.+.+ .. ..--+-..+...+ ..++.+.++..+ +.++++-.|.- T Consensus 71 ~~~iPILGICLGhQ~i~~~--~Gg~v~~-~~-~~HG~~s~i~~~~~g~~~~lf~gl~~~~~~~~~~~~~~~~~v~rYHSl 146 (208) T PRK05637 71 LGQIPLLGICLGYQALIEY--HGGKVEP-CG-PVHGTTDNMILTDAGVQSPVFAGLATDVEPDHPEIPGRKVPIGRYHSL 146 (208) T ss_pred HCCCCEEEHHHHHHHHHHH--CCCEEEC-CC-CCCCEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCC T ss_conf 3569822114778999997--0978830-67-765506678987888877000688765544553456775357710601 Q ss_pred EEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHCC Q ss_conf 023308677411542250365316888889881126688869998999848843432210067762899986429 Q gi|254780971|r 145 GNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLLT 219 (219) Q Consensus 145 Grf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~s 219 (219) .+ ..+. ++......++.. .+ .-|.|+..++..+.|.+.|||-.. +++|+.|++|++. T Consensus 147 ---~~-----~~~p-~~~~~~~~~~~~--~~--~viMa~~h~~~p~~GVQFHPESi~-----T~~G~~il~nfv~ 203 (208) T PRK05637 147 ---GC-----VVAP-DGIESLGTCSSE--IG--DVIMAARTTDGKAIGLQFHPESVL-----SPTGPVILSRCVE 203 (208) T ss_pred ---EE-----EECC-CCCEEEEEEECC--CC--CEEEEEEECCCCEEEEEECCCCCC-----CCCHHHHHHHHHH T ss_conf ---67-----1079-997488999759--99--999999978999799866975368-----8777999999999 No 84 >KOG1622 consensus Probab=98.77 E-value=1.2e-08 Score=76.23 Aligned_cols=143 Identities=24% Similarity=0.429 Sum_probs=87.2 Q ss_pred CCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHEECCCHHHHHCCCCCCCEEEEEEEEEE Q ss_conf 56478997287345431140677520021212233220597178606403101000001011012443324542257675 Q gi|254780971|r 40 PDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIKVMGICNGFQILVELNLLPGILMRNCSLKFVCKQVLLEV 119 (219) Q Consensus 40 ~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~vLGICNGfQiL~elGLlPg~l~~N~s~rf~~r~~~~~v 119 (219) ..+.++||-|| -|+=| +-..|-++ -..++-+.||||||-|+|++.++ ..|+..+-.. -|.-...+.+ T Consensus 58 ~~~rgiIiSGG-P~SVy------a~dAP~~d--p~if~~~vpvLGICYGmQ~i~~~--~Gg~V~~~~~--RE~G~~eI~v 124 (552) T KOG1622 58 YGPRGIIISGG-PNSVY------AEDAPSFD--PAIFELGVPVLGICYGMQLINKL--NGGTVVKGMV--REDGEDEIEV 124 (552) T ss_pred CCCEEEEEECC-CCCCC------CCCCCCCC--HHHHCCCCCCEEEHHHHHHHHHH--HCCCCCCCCC--CCCCCCEEEC T ss_conf 79429998189-87322------67688898--36713277430201679999998--3875024623--5778734775 Q ss_pred CCCHHHHHHHCCCCCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCC-EEEEECCCCCEEEECCCCH Q ss_conf 252357764137996899862013302330867741154225036531688888988112-6688869998999848843 Q gi|254780971|r 120 VNSNTAFTKSYKMNQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHD-IAGVINRRGNVLGMMPHPE 198 (219) Q Consensus 120 ~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~-IAgi~s~~G~vlgmMPHPE 198 (219) ..+ +..++++...+...+|++||. -+.++-+.--+. .. |.+. .|||+++..+++|+.|||| T Consensus 125 ~~~-~~lF~~~~~~~~~~VlltHgd--------sl~~v~~g~kv~-a~--------s~n~~va~i~~e~kkiyglqfhpE 186 (552) T KOG1622 125 DDS-VDLFSGLHKTEFMTVLLTHGD--------SLSKVPEGFKVV-AF--------SGNKPVAGILNELKKIYGLQFHPE 186 (552) T ss_pred CCH-HHHHHHHCCCCEEEEEECCCC--------CHHHCCCCCEEE-EE--------ECCCCEEEEHHHHHHHHCCCCCCC T ss_conf 861-566543232410366100562--------064444542467-76--------168612211011003206788883 Q ss_pred HHCCCCCCCCCHHHHHHHHC Q ss_conf 43221006776289998642 Q gi|254780971|r 199 NIIEKFHGGIDGRGLFASLL 218 (219) Q Consensus 199 R~~~~~~~~~dG~~~f~~~~ 218 (219) -.. ...|+.+++|++ T Consensus 187 V~~-----t~~g~~ll~nFl 201 (552) T KOG1622 187 VTL-----TPNGKELLKNFL 201 (552) T ss_pred CCC-----CCCHHHHHHHHH T ss_conf 103-----675668999899 No 85 >PRK07053 glutamine amidotransferase; Provisional Probab=98.72 E-value=1.8e-07 Score=68.69 Aligned_cols=171 Identities=16% Similarity=0.181 Sum_probs=92.5 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHCCCCEEEEE--CCC---CCCCCCCEEEECCC-CCCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 68998449706479999999985499639982--154---64456478997287-3454311406775200212122332 Q gi|254780971|r 2 KTAIVQIPGLNRDNDMIKAITKIIGQSPILVW--QSD---TDIPDVDLIVIPGG-FSYGDYLRCGAIAARTPVMQAIKKK 75 (219) Q Consensus 2 kvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~--~~~---~~l~~~d~lvipGG-FSygD~l~aG~i~~~~~~~~~i~~~ 75 (219) ||.||+-.-.-.---....+++ .|....++. ..+ .++.+||++|+-|| .|--|.-. --+. ....+-|+++ T Consensus 4 ~ilvlqH~~~E~pG~i~~~l~~-~g~~~~~~~~~~~~~~p~~~~~~d~livlGGpms~~d~~~-~Pwl--~~e~~lIr~a 79 (235) T PRK07053 4 TAVAIRHVAFEDLGSFEQVLGE-RGYRVRYVDVGVDDLEVLDALEPDLLVVLGGPIGVYDDAL-YPFL--APEIALLRQR 79 (235) T ss_pred EEEEEECCCCCCCHHHHHHHHH-CCCEEEEEECCCCCCCCCCCCCCCEEEEECCCCCCCCCCC-CCCH--HHHHHHHHHH T ss_conf 3999968998898699999997-8995999967889889999666799999389989887766-8558--9999999999 Q ss_pred HHCCCEEEEECCCHHHHEECCCHHHHHCCCCCCCEEEEEEEEEECC--CHHHHHHHCCCCCEEEEEEECCCEEEEECHHH Q ss_conf 2059717860640310100000101101244332454225767525--23577641379968998620133023308677 Q gi|254780971|r 76 AQQGIKVMGICNGFQILVELNLLPGILMRNCSLKFVCKQVLLEVVN--SNTAFTKSYKMNQIIKCPVAHHDGNYFIDAKG 153 (219) Q Consensus 76 ~~~g~~vLGICNGfQiL~elGLlPg~l~~N~s~rf~~r~~~~~v~~--~~s~~~~~~~~~~~l~~piaHgEGrf~~~~~~ 153 (219) .+.++|+||||-|.|+|.+. +.|...++ .+.|--|..+.... .+.+++..... .+.+..-|++. |..++ T Consensus 80 ~~~~~PvLGIClG~QLlA~A--lGG~V~~~--~~~EiG~~~v~~t~~~~~dpl~~~~~~--~~~v~~wH~D~-~~LP~-- 150 (235) T PRK07053 80 LAAGLPTLGICLGAQLIARA--LGARVYPG--AQKEIGWAPLTLTDAGRASPLRHLDGA--DTPVLHWHGDT-FDLPE-- 150 (235) T ss_pred HHCCCCEEEECHHHHHHHHH--HCCEEECC--CCCCEEEEEEEECCCCCCCCHHCCCCC--CCEEEEEECEE-EECCC-- T ss_conf 98699889984738999997--09778608--996005887785512357700014876--77279995408-53499-- Q ss_pred HHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHH Q ss_conf 41154225036531688888988112668886999899984884343 Q gi|254780971|r 154 LAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENI 200 (219) Q Consensus 154 l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~ 200 (219) +-..+-+.+ .-| +.++.- ..+++|++.|||=. T Consensus 151 -------ga~~LAss~-~c~------~Qaf~~-g~~~~g~QFHpE~t 182 (235) T PRK07053 151 -------GAILLASTP-ACR------NQAFAW-GNHVLALQFHPEIR 182 (235) T ss_pred -------CCEEEECCC-CCH------HHHEEE-CCCEEEEECCEECC T ss_conf -------838998289-985------896784-79689996584688 No 86 >cd01747 GATase1_Glutamyl_Hydrolase Type 1 glutamine amidotransferase (GATase1) domain found in gamma-Glutamyl Hydrolase. Type 1 glutamine amidotransferase (GATase1) domain found in gamma-Glutamyl Hydrolase. gamma-Glutamyl Hydrolase catalyzes the cleavage of the gamma-glutamyl chain of folylpoly-gamma-glutamyl substrates and is a central enzyme in folyl and antifolyl poly-gamma-glutamate metabolism. GATase activity involves the removal of the ammonia group from a glutamate molecule and its subsequent transfer to a specific substrate, thus creating a new carbon-nitrogen group on the substrate. gamma-Glutamyl hydrolases belong to the triad family of amidotransferases having a conserved Cys-His-Glu catalytic triad in the glutaminase active site. Probab=98.66 E-value=1.5e-07 Score=69.21 Aligned_cols=190 Identities=17% Similarity=0.250 Sum_probs=97.1 Q ss_pred EEEEECCCCCCHHH----------HHHHHHHHCCCCEEEEECCCC------CCCCCCEEEECCCCCC---CCCCCHHHHH Q ss_conf 89984497064799----------999999854996399821546------4456478997287345---4311406775 Q gi|254780971|r 3 TAIVQIPGLNRDND----------MIKAITKIIGQSPILVWQSDT------DIPDVDLIVIPGGFSY---GDYLRCGAIA 63 (219) Q Consensus 3 vaVl~~pGsNcd~e----------~~~A~~~~~~~~~~~v~~~~~------~l~~~d~lvipGGFSy---gD~l~aG~i~ 63 (219) ++||.-|+..++.. =++-++ .+|..++-+|.+.. -+....||++|||=.+ ++|.++++ T Consensus 1 IGIltqp~~~~~~~~~~~sYIaASYVK~lE-saGArVVPI~~~~~~~~~~~l~~~INGvLfpGG~~~~~~s~y~~~~~-- 77 (273) T cd01747 1 IGILTQPVDGAGSNKTGHSYIAASYVKFLE-SAGARVVPIWINESEEYYDKLFKSINGILFPGGAVDIDTSGYARTAK-- 77 (273) T ss_pred CEEEECCCCCCCCCCCCCEEEEHHHHHHHH-HCCCEEEEEECCCCHHHHHHHHHHCCEEEECCCCCCCCCCCHHHHHH-- T ss_conf 948840167534566787476089899999-78986998746999899999987546799678662056670899999-- Q ss_pred HHHHHHHHHHHHHHCCC--EEEEECCCHHHHEECCCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCC-------C Q ss_conf 20021212233220597--17860640310100000101101244332454225767525235776413799-------6 Q gi|254780971|r 64 ARTPVMQAIKKKAQQGI--KVMGICNGFQILVELNLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMN-------Q 134 (219) Q Consensus 64 ~~~~~~~~i~~~~~~g~--~vLGICNGfQiL~elGLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~-------~ 134 (219) .+++..++..++|. ||.|+|.|||+|+-+--=.-.+.......-.+..+++.-...+|.+++.+... + T Consensus 78 ---~i~~~A~e~Nd~G~yFPiwGtCLGFelL~~~~s~~~~~l~~~~s~n~~~pL~ft~~~~~Srlf~~~~~~l~~~l~~~ 154 (273) T cd01747 78 ---IIYNLALERNDAGDYFPVWGTCLGFELLTYLTSGETLLLEATEATNSALPLNFTEDALQSRLFKRFPPDLLKSLATE 154 (273) T ss_pred ---HHHHHHHHHHCCCCCCCEEEEHHHHHHHHHHHCCCCCEECCCCCCCCCCCCCCCCCCCCCHHHHCCCHHHHHHHHHC T ss_conf ---99999999850899563333303099999997399612310267773301014467644637651999999998507 Q ss_pred EEEEEEECCCEEEEECHHHH---HHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCC Q ss_conf 89986201330233086774---1154225036531688888988112668886999899984884343221006 Q gi|254780971|r 135 IIKCPVAHHDGNYFIDAKGL---AEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHG 206 (219) Q Consensus 135 ~l~~piaHgEGrf~~~~~~l---~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~ 206 (219) .+.+. .|.-| +.++.+ +.|.+.-+|...=.| .||-. =|+-|=..+=-+.|.+-|||+....|.. T Consensus 155 ~~t~n-~H~~g---itp~~f~~~~~L~~~f~ilstn~D---~~g~e-FVStiE~k~YP~yg~QfHPEK~~FEw~~ 221 (273) T cd01747 155 PLTMN-NHRYG---ISPENFTENGLLSDFFNVLTTNDD---WNGVE-FISTVEAYKYPIYGVQWHPEKNAFEWKK 221 (273) T ss_pred CCCHH-HHCCC---CCHHHHHHHHHHHHHHEEEEEEEC---CCCCE-EEEEEECCCCCEEEEEECCCCCCCCCCC T ss_conf 60043-30541---678887456888744265687765---99988-9886303247646674068757655577 No 87 >COG1492 CobQ Cobyric acid synthase [Coenzyme metabolism] Probab=98.51 E-value=2.3e-07 Score=67.90 Aligned_cols=88 Identities=28% Similarity=0.469 Sum_probs=60.1 Q ss_pred CEEEEEECCC-CC-CHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHC Q ss_conf 9689984497-06-479999999985499639982154644564789972873454311406775200212122332205 Q gi|254780971|r 1 MKTAIVQIPG-LN-RDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQ 78 (219) Q Consensus 1 mkvaVl~~pG-sN-cd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~ 78 (219) ++|||++.|- || -|.|...+. -+..+.++ ....+|.+.|+++|||==+-=..| ++..+..+-+++++++++ T Consensus 252 i~Iav~~lp~isNFtD~dpL~~~---~~v~v~~v-~~~~~l~~~dlvIlPGsk~t~~DL---~~lr~~g~d~~i~~~~~~ 324 (486) T COG1492 252 IRIAVIRLPRISNFTDFDPLRAE---PDVRVRFV-KPGSDLRDADLVILPGSKNTIADL---KILREGGMDEKILEYARK 324 (486) T ss_pred EEEEEECCCCCCCCCCCHHHHCC---CCEEEEEE-CCCCCCCCCCEEEECCCCCCHHHH---HHHHHCCHHHHHHHHHHC T ss_conf 49999657875564452146527---98079996-167778998889947985608899---999976979999999747 Q ss_pred CCEEEEECCCHHHHEEC Q ss_conf 97178606403101000 Q gi|254780971|r 79 GIKVMGICNGFQILVEL 95 (219) Q Consensus 79 g~~vLGICNGfQiL~el 95 (219) ++++||||.|||||.+. T Consensus 325 ~~~viGICGG~QmLG~~ 341 (486) T COG1492 325 GGDVIGICGGYQMLGRR 341 (486) T ss_pred CCCEEEECCHHHHHHHH T ss_conf 99799986357764334 No 88 >COG0512 PabA Anthranilate/para-aminobenzoate synthases component II [Amino acid transport and metabolism / Coenzyme metabolism] Probab=98.50 E-value=3.3e-06 Score=60.58 Aligned_cols=168 Identities=18% Similarity=0.311 Sum_probs=89.0 Q ss_pred HHHHHHHHHHCCCCEEEEECCC-CC-----CCCCCEEEECCCCCCCCCCCHHHHHHHHH-HHHHHHHHHHCCCEEEEECC Q ss_conf 9999999985499639982154-64-----45647899728734543114067752002-12122332205971786064 Q gi|254780971|r 15 NDMIKAITKIIGQSPILVWQSD-TD-----IPDVDLIVIPGGFSYGDYLRCGAIAARTP-VMQAIKKKAQQGIKVMGICN 87 (219) Q Consensus 15 ~e~~~A~~~~~~~~~~~v~~~~-~~-----l~~~d~lvipGGFSygD~l~aG~i~~~~~-~~~~i~~~~~~g~~vLGICN 87 (219) +-.+..|.. .+.++ .|+++| .+ -.++|.||+-=| .|.=. ... ..+.++++ ....|+||||- T Consensus 15 yNLv~yl~~-lg~~v-~V~rnd~~~~~~~~~~~pd~iviSPG--------PG~P~-d~G~~~~~i~~~-~~~~PiLGVCL 82 (191) T COG0512 15 YNLVQYLRE-LGAEV-TVVRNDDISLELIEALKPDAIVISPG--------PGTPK-DAGISLELIRRF-AGRIPILGVCL 82 (191) T ss_pred HHHHHHHHH-CCCCE-EEEECCCCCHHHHHHCCCCEEEECCC--------CCCHH-HCCHHHHHHHHH-CCCCCEEEECC T ss_conf 999999997-39934-99988965877886049998998489--------97937-736189999986-08998899874 Q ss_pred CHHHHEECCCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEEC Q ss_conf 03101000001011012443324542257675252357764137996899862013302330867741154225036531 Q gi|254780971|r 88 GFQILVELNLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRY 167 (219) Q Consensus 88 GfQiL~elGLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y 167 (219) |.|.+.+. ..|.+.+-..-. --.++ .+..+.+..++++... +.+-=.|. .+++++. +.+.=.|.... T Consensus 83 GHQai~~~--fGg~V~~a~~~~--HGK~s-~i~h~g~~iF~glp~~--f~v~RYHS---Lvv~~~~---lP~~l~vtA~~ 149 (191) T COG0512 83 GHQAIAEA--FGGKVVRAKEPM--HGKTS-IITHDGSGLFAGLPNP--FTVTRYHS---LVVDPET---LPEELEVTAES 149 (191) T ss_pred CHHHHHHH--HCCEEEECCCCC--CCEEE-EEECCCCCCCCCCCCC--CEEEEEEE---EEECCCC---CCCCEEEEEEE T ss_conf 37899997--397887559776--78125-5402996001579998--88886278---9844787---99833899996 Q ss_pred CCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHCC Q ss_conf 6888889881126688869998999848843432210067762899986429 Q gi|254780971|r 168 ASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLLT 219 (219) Q Consensus 168 ~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~s 219 (219) .|+ .-|-|+-.++..+.|++.|||-.. ++.|.++++|++. T Consensus 150 ~d~-------~~IMai~h~~~pi~gvQFHPESil-----T~~G~~il~Nfl~ 189 (191) T COG0512 150 EDG-------GVIMAVRHKKLPIYGVQFHPESIL-----TEYGHRILENFLR 189 (191) T ss_pred CCC-------CEEEEEEECCCCEEEEECCCCCCC-----CCCHHHHHHHHHH T ss_conf 899-------879888608998788820871036-----6214899999984 No 89 >TIGR01382 PfpI intracellular protease, PfpI family; InterPro: IPR006286 Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad . This group of cysteine peptidases belong to MEROPS peptidase family C56 (Pfp1 endopeptidase, clan PC(C)). The member of this family from Pyrococcus horikoshii has been solved to 2 Angstrom resolution. It is an ATP-independent intracellular protease that crystallises as a hexameric ring. Cys-101 is proposed as the active site residue in a catalytic triad with the adjacent His-102 and a Glu residue from an adjacent monomer. A member of this family from Bacillus subtilis, GSP18, has been shown to be expressed in response to several forms of stress. A role in the degradation of small peptides has been suggested. This family is contained in a larger one of the thiamine biosynthesis protein ThiJ and its homologs.; GO: 0016798 hydrolase activity acting on glycosyl bonds. Probab=98.47 E-value=1.4e-07 Score=69.33 Aligned_cols=60 Identities=37% Similarity=0.662 Sum_probs=53.1 Q ss_pred CCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHEECCCHHH Q ss_conf 46445647899728734543114067752002121223322059717860640310100000101 Q gi|254780971|r 36 DTDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIKVMGICNGFQILVELNLLPG 100 (219) Q Consensus 36 ~~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~vLGICNGfQiL~elGLlPg 100 (219) +.+-.+||+|+||||-+-.|||| .++.+..-+++|.++||+|-.||.|-|+|+..|.|-| T Consensus 71 ~v~~~~YDal~ipGG~a~~e~LR-----~d~~~~~lvR~f~e~gK~vaaIChgp~lLi~A~VlrG 130 (189) T TIGR01382 71 DVDPEDYDALVIPGGRASPEYLR-----LDEKVIRLVREFVEKGKPVAAICHGPQLLISAGVLRG 130 (189) T ss_pred CCCHHHCCEEEEECCCCCCCCCC-----CCHHHHHHHHHHHHCCCEEEEEECCCEEEEECCEECC T ss_conf 47852564789717735600014-----8768999999984178819998453000002551037 No 90 >PRK13896 cobyrinic acid a,c-diamide synthase; Provisional Probab=98.45 E-value=7.6e-06 Score=58.23 Aligned_cols=171 Identities=19% Similarity=0.322 Sum_probs=98.3 Q ss_pred EEEEEECCCCC-CHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 68998449706-47999999998549963998215464456478997287345431140677520021212233220597 Q gi|254780971|r 2 KTAIVQIPGLN-RDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGI 80 (219) Q Consensus 2 kvaVl~~pGsN-cd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~ 80 (219) ||||-+-+-=+ .=.|....++. ++.-+.+--..|..|++.|+|.||||| -.+-+...+.+.. ++.+++++++|. T Consensus 234 rIavA~D~AF~FyY~dnl~~L~~-~aelv~FSPl~D~~LP~~dglylgGGy---PEl~a~~Ls~n~~-l~si~~~a~~G~ 308 (432) T PRK13896 234 TVAVADDSAFCFRYPATRERLRE-RADVVTFSPVAGDPLPDCDGVYLPGGY---PELHGEALADGPA-LDELGARAADGL 308 (432) T ss_pred EEEEEECCCCCCCCHHHHHHHHH-CCCEEEECCCCCCCCCCCCEEEECCCC---CHHHHHHHHCCHH-HHHHHHHHHCCC T ss_conf 69999758627017899999984-788898766357989988989847998---0378999860878-999999998699 Q ss_pred EEEEECCCHHHHEE------------CCCHHHHHCC-CCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEE Q ss_conf 17860640310100------------0001011012-4433245422576752523577641379968998620133023 Q gi|254780971|r 81 KVMGICNGFQILVE------------LNLLPGILMR-NCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNY 147 (219) Q Consensus 81 ~vLGICNGfQiL~e------------lGLlPg~l~~-N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf 147 (219) |+++-|.|+..|++ .|+||+.-.. ++-.+| -++.+++.. .++|.+ .|+.+.-+=-| | T Consensus 309 PiyAECGGlMyL~~~l~~~~g~~~~M~Gvlp~~~~m~~r~~~l--GY~~~~~~~-~~~~~~---~G~~~rGHEFH----Y 378 (432) T PRK13896 309 PVLGECGGLMALAESLTTTDGDTYEMAGVLPADVRMQDRYQAL--DHVELRATG-DTLTAG---SGAHLRGHEFH----Y 378 (432) T ss_pred CEEEECCCHHHHHHHHCCCCCCEEEEEEEECEEEEECCCCCCC--CCEEEEECC-CCCCCC---CCCEEEEECCC----C T ss_conf 7899873089984351657998877251354288977785544--755999637-862468---99889885752----7 Q ss_pred EECHHHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCC Q ss_conf 30867741154225036531688888988112668886999899984884 Q gi|254780971|r 148 FIDAKGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHP 197 (219) Q Consensus 148 ~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHP 197 (219) .. . ........+++......-++. -.|++ .+|++|.-.|- T Consensus 379 S~----~-~~~~~~~~~~~~~rg~g~~~~---~dG~~--~~n~~asY~Hl 418 (432) T PRK13896 379 SA----A-TVASDARFAFAVERGDGIDGD---HDGLT--EYRTLGTYAHV 418 (432) T ss_pred CC----C-CCCCCCCEEEEEECCCCCCCC---CCEEE--ECCEEEEEEEC T ss_conf 41----7-558988546898616788888---57687--38999986332 No 91 >cd03134 GATase1_PfpI_like A type 1 glutamine amidotransferase (GATase1)-like domain found in PfpI from Pyrococcus furiosus. A type 1 glutamine amidotransferase (GATase1)-like domain found in PfpI from Pyrococcus furiosus. This group includes proteins similar to PfpI from P. furiosus. and PH1704 from Pyrococcus horikoshii. These enzymes are ATP-independent intracellular proteases and may hydrolyze small peptides to provide a nutritional source. Only Cys of the catalytic triad typical of GATase1 domains is conserved in this group. This Cys residue is found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. For PH1704, it is believed that this Cys together with a different His in one monomer and Glu (from an adjacent monomer) forms a different catalytic triad from the typical GATase1domain. PfpI is homooligomeric. Protease activity is only found for oligomeric forms of PH1704. Probab=98.41 E-value=1.5e-06 Score=62.77 Aligned_cols=92 Identities=25% Similarity=0.449 Sum_probs=66.8 Q ss_pred EEEEEECCCCCCHHHH---HHHHHHHCCCCEEEEECC---------------------CCCCCCCCEEEECCCCCCCCCC Q ss_conf 6899844970647999---999998549963998215---------------------4644564789972873454311 Q gi|254780971|r 2 KTAIVQIPGLNRDNDM---IKAITKIIGQSPILVWQS---------------------DTDIPDVDLIVIPGGFSYGDYL 57 (219) Q Consensus 2 kvaVl~~pGsNcd~e~---~~A~~~~~~~~~~~v~~~---------------------~~~l~~~d~lvipGGFSygD~l 57 (219) ||+|+.++|.. |.|. ...|.+ +|.++.++..+ +.+..+||+|++|||.+ T Consensus 1 KV~il~~~Gfe-e~E~~~p~~~l~~-aG~~V~~~s~~~~~~v~~~~g~~~v~~d~~l~~~~~~~yD~ivlPGG~g----- 73 (165) T cd03134 1 KVAILAADGFE-DVELTYPLYRLRE-AGAEVVVAGPEAGGEIQGKHGYDTVTVDLTIADVDADDYDALVIPGGTN----- 73 (165) T ss_pred CEEEEECCCCH-HHHHHHHHHHHHH-CCCEEEEEEECCCCCEEECCCCCEEECCCCHHHCCHHHCCEEEECCCCC----- T ss_conf 99999269820-9999999999997-8997999997499856832897268678657998962288899789858----- Q ss_pred CHHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHEECCCHHHH Q ss_conf 40677520021212233220597178606403101000001011 Q gi|254780971|r 58 RCGAIAARTPVMQAIKKKAQQGIKVMGICNGFQILVELNLLPGI 101 (219) Q Consensus 58 ~aG~i~~~~~~~~~i~~~~~~g~~vLGICNGfQiL~elGLlPg~ 101 (219) +-.+.....+.+.++++.++++++-.||.|-++|.+.|||.|. T Consensus 74 -~~~l~~~~~~~~~i~~~~~~~k~iaaIC~gp~~L~~~gll~g~ 116 (165) T cd03134 74 -PDKLRRDPDAVAFVRAFAEAGKPVAAICHGPWVLISAGVVRGR 116 (165) T ss_pred -HHHHHHCHHHHHHHHHHHHHCCCEEEECCHHHHHHHCCCCCCC T ss_conf -8888529899999999997199299998757877457825898 No 92 >cd03135 GATase1_DJ-1 Type 1 glutamine amidotransferase (GATase1)-like domain found in Human DJ-1. Type 1 glutamine amidotransferase (GATase1)-like domain found in Human DJ-1. DJ-1 is involved in multiple physiological processes including cancer, Parkinson's disease and male fertility. It is unclear how DJ-1 functions in these. DJ-1 has been shown to possess chaperone activity. DJ-1 is preferentially expressed in the testis and moderately in other tissues; it is induced together with genes involved in oxidative stress response. The Drosophila homologue (DJ-1A) plays an essential role in oxidative stress response and neuronal maintenance. Inhibition of DJ-1A function through RNAi, results in the cellular accumulation of reactive oxygen species, organismal hypersensitivity to oxidative stress, and dysfunction and degeneration of dopaminergic and photoreceptor neurons. DJ-1 has lacks enzymatic activity and the catalytic triad of typical GATase1 domains, however it does contain the highly Probab=98.40 E-value=1.4e-06 Score=62.93 Aligned_cols=92 Identities=23% Similarity=0.421 Sum_probs=67.0 Q ss_pred EEEEECCCCCCHHHHH---HHHHHHCCCCEEEEECC--------------------CCCCCCCCEEEECCCCCCCCCCCH Q ss_conf 8998449706479999---99998549963998215--------------------464456478997287345431140 Q gi|254780971|r 3 TAIVQIPGLNRDNDMI---KAITKIIGQSPILVWQS--------------------DTDIPDVDLIVIPGGFSYGDYLRC 59 (219) Q Consensus 3 vaVl~~pGsNcd~e~~---~A~~~~~~~~~~~v~~~--------------------~~~l~~~d~lvipGGFSygD~l~a 59 (219) |+||.++|. -|.|.. .+|.+ ++..+.++... +.+.++||.+++|||...-+.+ T Consensus 1 V~il~~dgf-~~~e~~~p~~vl~~-ag~~v~~~s~~~~~~v~~~~g~~i~~d~~l~~~~~~~~D~liipGG~~~~~~l-- 76 (163) T cd03135 1 VLVILADGF-EEIEAVTPVDVLRR-AGIEVTTASLEKKLAVGSSHGIKVKADKTLSDVNLDDYDAIVIPGGLPGAQNL-- 76 (163) T ss_pred CEEEECCCC-CHHHHHHHHHHHHH-CCCEEEEEEECCCCEEECCCCCEEECCCCHHHCCCCCCCEEEECCCCCHHHHH-- T ss_conf 999966995-77999999999997-99979999947995287379978946876655782017989986898217677-- Q ss_pred HHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHEECCCHHHH Q ss_conf 677520021212233220597178606403101000001011 Q gi|254780971|r 60 GAIAARTPVMQAIKKKAQQGIKVMGICNGFQILVELNLLPGI 101 (219) Q Consensus 60 G~i~~~~~~~~~i~~~~~~g~~vLGICNGfQiL~elGLlPg~ 101 (219) .....+.+.++++.++++++-+||.|-++|.+.|||.|. T Consensus 77 ---~~~~~~~~~i~~~~~~~k~i~aiC~G~~~La~agll~gr 115 (163) T cd03135 77 ---ADNEKLIKLLKEFNAKGKLIAAICAAPAVLAKAGLLKGK 115 (163) T ss_pred ---HHCHHHHHHHHHHHHHCCEEEEECHHHHHHHHCCCCCCC T ss_conf ---669999999999998299479864548999986998998 No 93 >TIGR01368 CPSaseIIsmall carbamoyl-phosphate synthase, small subunit; InterPro: IPR006274 Carbamoyl phosphate synthase (CPSase) is a heterodimeric enzyme composed of a small and a large subunit (with the exception of CPSase III, see below). CPSase catalyses the synthesis of carbamoyl phosphate from biocarbonate, ATP and glutamine (6.3.5.5 from EC) or ammonia (6.3.4.16 from EC), and represents the first committed step in pyrimidine and arginine biosynthesis in prokaryotes and eukaryotes, and in the urea cycle in most terrestrial vertebrates , . CPSase has three active sites, one in the small subunit and two in the large subunit. The small subunit contains the glutamine binding site and catalyses the hydrolysis of glutamine to glutamate and ammonia. The large subunit has two homologous carboxy phosphate domains, both of which have ATP-binding sites; however, the N-terminal carboxy phosphate domain catalyses the phosphorylation of biocarbonate, while the C-terminal domain catalyses the phosphorylation of the carbamate intermediate . The carboxy phosphate domain found duplicated in the large subunit of CPSase is also present as a single copy in the biotin-dependent enzymes acetyl-CoA carboxylase (6.4.1.2 from EC) (ACC), propionyl-CoA carboxylase (6.4.1.3 from EC) (PCCase), pyruvate carboxylase (6.4.1.1 from EC) (PC) and urea carboxylase (6.3.4.6 from EC). Most prokaryotes carry one form of CPSase that participates in both arginine and pyrimidine biosynthesis, however certain bacteria can have separate forms. The large subunit in bacterial CPSase has four structural domains: the carboxy phosphate domain 1, the oligomerisation domain, the carbamoyl phosphate domain 2 and the allosteric domain . CPSase heterodimers from Escherichia coli contain two molecular tunnels: an ammonia tunnel and a carbamate tunnel. These inter-domain tunnels connect the three distinct active sites, and function as conduits for the transport of unstable reaction intermediates (ammonia and carbamate) between successive active sites . The catalytic mechanism of CPSase involves the diffusion of carbamate through the interior of the enzyme from the site of synthesis within the N-terminal domain of the large subunit to the site of phosphorylation within the C-terminal domain. Eukaryotes have two distinct forms of CPSase: a mitochondrial enzyme (CPSase I) that participates in both arginine biosynthesis and the urea cycle; and a cytosolic enzyme (CPSase II) involved in pyrimidine biosynthesis. CPSase II occurs as part of a multi-enzyme complex along with aspartate transcarbamoylase and dihydroorotase; this complex is referred to as the CAD protein . The hepatic expression of CPSase is transcriptionally regulated by glucocorticoids and/or cAMP . There is a third form of the enzyme, CPSase III, found in fish, which uses glutamine as a nitrogen source instead of ammonia . CPSase III is closely related to CPSase I, and is composed of a single polypeptide that may have arisen from gene fusion of the glutaminase and synthetase domains . This entry represents the small subunit of the glutamine-dependent form (6.3.5.5 from EC) of carbamoyl phosphate synthase, CPSase II. The small subunit catalyses the hydrolysis of glutamine to ammonia, which in turn used by the large chain to synthesize carbamoyl phosphate. The C-terminal domain of the small subunit of CPSase has glutamine amidotransferase activity. Note that the sequence from the mammalian urea cycle form has lost the active site Cys, resulting in an ammonia-dependent form, CPSase I (6.3.4.16 from EC). CPSases of pyrimidine biosynthesis, arginine biosynthesis, and the urea cycle may be encoded by one or by several genes, depending on the species. ; GO: 0004086 carbamoyl-phosphate synthase activity, 0006807 nitrogen compound metabolic process. Probab=98.36 E-value=4.3e-06 Score=59.83 Aligned_cols=175 Identities=22% Similarity=0.354 Sum_probs=104.7 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHCCCCEEEEEC-CCC-CCC-CC-CEEEECCCCCCCCCCCHHHH-HHHHHHHHHHHHHH Q ss_conf 689984497064799999999854996399821-546-445-64-78997287345431140677-52002121223322 Q gi|254780971|r 2 KTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQ-SDT-DIP-DV-DLIVIPGGFSYGDYLRCGAI-AARTPVMQAIKKKA 76 (219) Q Consensus 2 kvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~-~~~-~l~-~~-d~lvipGGFSygD~l~aG~i-~~~~~~~~~i~~~~ 76 (219) ||+|+-| |. =+...+.|.+ -|+++++|=. .+. ++. -. |||.|=.| .|-= ++-....+.+++.+ T Consensus 198 ~Vv~~Df-G~--K~nIlr~L~~-rG~~v~vVP~~~~~~~i~~~~PDGiflSNG--------PGDPH~~~~~~i~~~~~l~ 265 (383) T TIGR01368 198 RVVVIDF-GV--KQNILRRLVK-RGCEVTVVPYDTDAEEIKKYNPDGIFLSNG--------PGDPHAAVEPAIETVRELL 265 (383) T ss_pred EEEEEEC-CC--CHHHHHHHHH-CCCEEEEECCCCCHHHHHHHCCCEEEEECC--------CCCCHHHHHHHHHHHHHHH T ss_conf 9999855-88--7558989976-698799967899889998657888999087--------9880567899999999998 Q ss_pred HCCCEEEEECCCHHHHEEC-CCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEECHHHHH Q ss_conf 0597178606403101000-001011012443324542257675252357764137996899862013302330867741 Q gi|254780971|r 77 QQGIKVMGICNGFQILVEL-NLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFIDAKGLA 155 (219) Q Consensus 77 ~~g~~vLGICNGfQiL~el-GLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~~~~l~ 155 (219) ++.+|++|||-|.|||... | +=+ -..||=-|=.|-=|. ++..| .+.|=. +..| |.+|++. T Consensus 266 ~~~~P~fGICLGHQllALA~G----a~T--yKlKFGHRG~NhPV~--------~~~~g-~v~ITs-QNHG-yAVD~~s-- 326 (383) T TIGR01368 266 EAKIPIFGICLGHQLLALAFG----AKT--YKLKFGHRGGNHPVK--------DLRTG-RVEITS-QNHG-YAVDEES-- 326 (383) T ss_pred HCCCCCCCCCHHHHHHHHHCC----CCE--EECCCCCCCCCCCEE--------ECCCC-EEEEEE-ECCC-EECCHHH-- T ss_conf 448971002666899999738----870--004656788751314--------42798-799997-0687-1257535-- Q ss_pred HHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHC Q ss_conf 154225036531688888988112668886999899984884343221006776289998642 Q gi|254780971|r 156 EIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLL 218 (219) Q Consensus 156 ~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~ 218 (219) |..+...+... |.| ...++||..++..++.-+-|||-+= |--|-..||..++ T Consensus 327 -l~~~~~~~th~----nLN--Dgt~EG~~h~~~P~fSVQyHPEA~P----GPhD~~~lFd~F~ 378 (383) T TIGR01368 327 -LLAGDLEVTHV----NLN--DGTVEGIRHKDLPVFSVQYHPEASP----GPHDTEYLFDEFI 378 (383) T ss_pred -CCCCCEEEEEE----ECC--CCCCCCCCCCCCCEEEEECCCCCCC----CCHHHHHHHHHHH T ss_conf -57885389998----359--7933153005878667700688888----7002379999999 No 94 >PRK12564 carbamoyl phosphate synthase small subunit; Reviewed Probab=98.36 E-value=9.2e-06 Score=57.70 Aligned_cols=122 Identities=21% Similarity=0.333 Sum_probs=69.7 Q ss_pred HHHHHHHHHHHHCCCEEEEECCCHHHHEEC-CCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCC Q ss_conf 021212233220597178606403101000-0010110124433245422576752523577641379968998620133 Q gi|254780971|r 66 TPVMQAIKKKAQQGIKVMGICNGFQILVEL-NLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHD 144 (219) Q Consensus 66 ~~~~~~i~~~~~~g~~vLGICNGfQiL~el-GLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgE 144 (219) ....+.+++.++...|++|||-|-|+|... |.=- ...+|--|=.+--|. .+..+++ .+- ++.. T Consensus 229 ~~~i~~ik~li~~~~PifGICLGHQllalA~Ga~t------~Km~fGHrG~NhPV~--------~l~t~~v-~iT-SQNH 292 (355) T PRK12564 229 DYAIEMIKELLEGKIPIFGICLGHQLLALALGAKT------YKMKFGHRGANHPVK--------DLETGKV-EIT-SQNH 292 (355) T ss_pred HHHHHHHHHHHCCCCCEEEECHHHHHHHHHCCCEE------CCCCCCCCCCCCCCE--------ECCCCEE-EEE-ECCC T ss_conf 99999999985269878998887999999719844------205556578997733--------5578739-997-1674 Q ss_pred EEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHC Q ss_conf 02330867741154225036531688888988112668886999899984884343221006776289998642 Q gi|254780971|r 145 GNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLL 218 (219) Q Consensus 145 Grf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~ 218 (219) | |.++++.+ .+ + +-..|. |.| ...|.||...+..++..+.|||-+- |..|...||..++ T Consensus 293 G-yaVd~~sl---~~-~-~~vt~~---nln--DgtvEGi~h~~~p~fSVQfHPEa~p----GP~D~~~lFd~F~ 351 (355) T PRK12564 293 G-FAVDEDSL---PE-N-LEVTHV---NLN--DGTVEGLRHKDLPAFSVQYHPEASP----GPHDAAYLFDEFV 351 (355) T ss_pred C-EEECCCCC---CC-C-EEEEEE---ECC--CCCCCCEEECCCCEEEEEECCCCCC----CCCCCHHHHHHHH T ss_conf 2-28666658---98-7-399999---389--9967217888998789721889998----9975388999999 No 95 >cd03169 GATase1_PfpI_1 Type 1 glutamine amidotransferase (GATase1)-like domain found in a subgroup of proteins similar to PfpI from Pyrococcus furiosus. Type 1 glutamine amidotransferase (GATase1)-like domain found in a subgroup of proteins similar to PfpI from Pyrococcus furiosus. PfpI is an ATP-independent intracellular proteases which may hydrolyze small peptides to provide a nutritional source. Only Cys of the catalytic triad typical of GATase1 domains is conserved in this group. This Cys residue is found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. Probab=98.31 E-value=3.6e-06 Score=60.31 Aligned_cols=92 Identities=28% Similarity=0.450 Sum_probs=66.9 Q ss_pred EEEEEECCCCCCHHHH---HHHHHHHCCCCEEEEEC-----------------------------------CCCCCCCCC Q ss_conf 6899844970647999---99999854996399821-----------------------------------546445647 Q gi|254780971|r 2 KTAIVQIPGLNRDNDM---IKAITKIIGQSPILVWQ-----------------------------------SDTDIPDVD 43 (219) Q Consensus 2 kvaVl~~pGsNcd~e~---~~A~~~~~~~~~~~v~~-----------------------------------~~~~l~~~d 43 (219) ||+|+...|. =|.|. ..+|.+ +|.++.++-- ++.+..+|| T Consensus 1 kvlil~adg~-Ee~E~~~P~~~l~~-aG~eV~vaSp~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~dv~~~~yD 78 (180) T cd03169 1 KILILTGDFV-EDYEVMVPFQALQE-VGHEVDVVAPGKKKGDTVVTAIHDFPGWQTYTEKPGHRFAVTADFDEVDPDDYD 78 (180) T ss_pred CEEEECCCCC-HHHHHHHHHHHHHH-CCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCEEECCCCHHHCCHHHCC T ss_conf 9899947883-07899999999998-899899995688678744432234566320002478254267876789861099 Q ss_pred EEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHEECCCHHHH Q ss_conf 8997287345431140677520021212233220597178606403101000001011 Q gi|254780971|r 44 LIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIKVMGICNGFQILVELNLLPGI 101 (219) Q Consensus 44 ~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~vLGICNGfQiL~elGLlPg~ 101 (219) +|++|||-+ .|+ +.....+.+-++++.++|+++=.||-|-++|...|||.|. T Consensus 79 ai~iPGG~g-~~~-----l~~d~~~~~li~~~~~~gK~iaaIChgp~~L~~agll~Gk 130 (180) T cd03169 79 ALVIPGGRA-PEY-----LRLDEKVLAIVRHFAEANKPVAAICHGPQILAAAGVLKGR 130 (180) T ss_pred EEEECCCCC-HHH-----HHCCHHHHHHHHHHHHCCCEEEEECCHHHHHHHCCCCCCC T ss_conf 999899822-877-----7119799999999998199699778168999876965898 No 96 >KOG1224 consensus Probab=98.30 E-value=8.5e-06 Score=57.91 Aligned_cols=171 Identities=20% Similarity=0.333 Sum_probs=87.5 Q ss_pred HHHHHHHHHCCCCEEEEECCCC----------CCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEE Q ss_conf 9999999854996399821546----------445647899728734543114067752002121223322059717860 Q gi|254780971|r 16 DMIKAITKIIGQSPILVWQSDT----------DIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIKVMGI 85 (219) Q Consensus 16 e~~~A~~~~~~~~~~~v~~~~~----------~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~vLGI 85 (219) ....++++..+...+++.|.+. ++.-||.||+.-| .++-..|+...+...+... .++.|+||| T Consensus 29 Niy~ll~~~~~vp~V~~vh~~~~~~d~~~~l~q~~~FDaIVVgPG------PG~P~~a~d~gI~~rl~~~-~~~iPilGI 101 (767) T KOG1224 29 NIYQLLSTINGVPPVVIVHDEWTWEDAYHYLYQDVAFDAIVVGPG------PGSPMCAADIGICLRLLLE-CRDIPILGI 101 (767) T ss_pred HHHHHHHHHCCCCCEEEEECCCCCHHHHHHHHHCCCCCEEEECCC------CCCCCCHHHHHHHHHHHHH-CCCCCEEEE T ss_conf 399999874179967998343237789998864002244896689------9798747887899999986-578864654 Q ss_pred CCCHHHHEECCCHHHHH--CCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEE---EECHHHHHHHHHC Q ss_conf 64031010000010110--124433245422576752523577641379968998620133023---3086774115422 Q gi|254780971|r 86 CNGFQILVELNLLPGIL--MRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNY---FIDAKGLAEIEKN 160 (219) Q Consensus 86 CNGfQiL~elGLlPg~l--~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf---~~~~~~l~~l~~~ 160 (219) |-|||.| +|..|+- ..|.-.+ -|-.. +.-+.+..+.++..|. |.+-.-=|| +++....+.| T Consensus 102 CLGfQal---~l~hGA~v~~~n~p~H--Grvs~--i~~~~~~~f~gi~sg~----~~~fK~~RYHSL~in~~pid~l--- 167 (767) T KOG1224 102 CLGFQAL---GLVHGAHVVHANEPVH--GRVSG--IEHDGNILFSGIPSGR----NSDFKVVRYHSLIINSLPIDLL--- 167 (767) T ss_pred EHHHHHH---HHHCCCCEECCCCCCC--CEEEE--EEECCCEEECCCCCCC----CCCCEEEEEEEEEECCCCHHHH--- T ss_conf 3214767---6531661103787643--24666--7726847871577788----6644047767787257761121--- Q ss_pred CEEE-EECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHCC Q ss_conf 5036-5316888889881126688869998999848843432210067762899986429 Q gi|254780971|r 161 NQIV-FRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLLT 219 (219) Q Consensus 161 ~~i~-~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~s 219 (219) +|+ ..|-| ||-+ .-+|...+--|+|+.-|||-.. ++-|..+|+|+++ T Consensus 168 -~il~t~~dd----ng~i--lMsi~~~~fPhfG~qyHPES~~-----s~~g~~lfkNFl~ 215 (767) T KOG1224 168 -PILWTIYDD----NGHI--LMSIMHSSFPHFGLQYHPESIA-----STYGSQLFKNFLD 215 (767) T ss_pred -CCEEEEECC----CCEE--EEEEECCCCCCCCEEECHHHHH-----HHHHHHHHHHHHH T ss_conf -201376068----8628--9875315788534233707753-----1345799987777 No 97 >PRK12838 carbamoyl phosphate synthase small subunit; Reviewed Probab=98.29 E-value=1.8e-05 Score=55.77 Aligned_cols=174 Identities=22% Similarity=0.376 Sum_probs=86.1 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECC-C---CCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 6899844970647999999998549963998215-4---64456478997287345431140677520021212233220 Q gi|254780971|r 2 KTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQS-D---TDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQ 77 (219) Q Consensus 2 kvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~-~---~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~ 77 (219) ||+++-| | .-+-..+.|.+. +.++.++=.+ + ..-.+.|+|+|-.|= ||. ++-....+.+++.+. T Consensus 169 ~V~viD~-G--vK~nIlr~L~~r-g~~V~VvP~~~~~~~i~~~~pDGiflSNGP--GDP------~~~~~~i~~vr~l~~ 236 (356) T PRK12838 169 HVALIDF-G--LKHSILRSLSKR-GCNVTVLPYNASLEDIKNLNPDGIVLSNGP--GDP------KELQPYLPTIKDLAS 236 (356) T ss_pred EEEEEEC-C--CHHHHHHHHHHC-CCEEEEECCCCCCCHHHCCCCCEEEECCCC--CCH------HHHHHHHHHHHHHHC T ss_conf 7999958-8--369999999978-988999899887540232497489943899--896------887889999999974 Q ss_pred CCCEEEEECCCHHHHEEC-CCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEECHHHHHH Q ss_conf 597178606403101000-0010110124433245422576752523577641379968998620133023308677411 Q gi|254780971|r 78 QGIKVMGICNGFQILVEL-NLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFIDAKGLAE 156 (219) Q Consensus 78 ~g~~vLGICNGfQiL~el-GLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~~~~l~~ 156 (219) ..|++|||-|-|||.-. |.=- ...+|--|=.+-=|. .+..+++. +- ++..| |.++++.| T Consensus 237 -~~PifGICLGHQllalA~Gakt------~KmkfGHrG~NhPV~--------~~~tgkv~-IT-SQNHG-yaVd~~sl-- 296 (356) T PRK12838 237 -SYPILGICLGHQLIALALGAET------YKLPFGHRGANHPVI--------DLATGRVW-MT-SQNHG-YVVDEDSI-- 296 (356) T ss_pred -CCCEEEECHHHHHHHHHHCCEE------EECCCCCCCCCCCCE--------ECCCCCEE-EE-ECCCC-CEECCCCC-- T ss_conf -9888997488999999709748------641556688782647--------76899489-97-26876-16776768-- Q ss_pred HHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHC Q ss_conf 54225036531688888988112668886999899984884343221006776289998642 Q gi|254780971|r 157 IEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLL 218 (219) Q Consensus 157 l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~ 218 (219) .+.+ +-..+. |.| ...|+||..++..++..+.|||-+- |-.|...||+.++ T Consensus 297 -~~~~-~~vt~~---nln--D~tveGi~~~~~p~fsVQfHPEa~P----GP~D~~~lF~~F~ 347 (356) T PRK12838 297 -DEKQ-LSVTFF---NLN--DGSIEGLRHKKLPVLSVQFHPEAHP----GPHDAEYIFDEFL 347 (356) T ss_pred -CCCC-EEEEEE---CCC--CCCEEEEEECCCCEEEEEECCCCCC----CCCCCHHHHHHHH T ss_conf -9886-599999---089--9966107988998688712889998----9874388999999 No 98 >cd03140 GATase1_PfpI_3 Type 1 glutamine amidotransferase (GATase1)-like domain found in a subgroup of proteins similar to PfpI from Pyrococcus furiosus. Type 1 glutamine amidotransferase (GATase1)-like domain found in a subgroup of proteins similar to PfpI from Pyrococcus furiosus. PfpI is an ATP-independent intracellular proteases which may hydrolyze small peptides to provide a nutritional source. Only Cys of the catalytic triad typical of GATase1 domains is conserved in this group. This Cys residue is found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. Probab=98.23 E-value=3.5e-06 Score=60.37 Aligned_cols=91 Identities=21% Similarity=0.284 Sum_probs=63.9 Q ss_pred EEEEECCCCCCHHHHH---HHHHHHCCCCEEEEECCC-------------------CCCCCCCEEEECCCCCCCCCCCHH Q ss_conf 8998449706479999---999985499639982154-------------------644564789972873454311406 Q gi|254780971|r 3 TAIVQIPGLNRDNDMI---KAITKIIGQSPILVWQSD-------------------TDIPDVDLIVIPGGFSYGDYLRCG 60 (219) Q Consensus 3 vaVl~~pGsNcd~e~~---~A~~~~~~~~~~~v~~~~-------------------~~l~~~d~lvipGGFSygD~l~aG 60 (219) |+|+-.+|. .|.|.. ..+.+..++++..+-.+. ....+||.+++|||.++.. T Consensus 1 V~v~l~dGf-Ee~E~~~~~~~Lrr~~~~~v~~~s~~~~~V~~~~G~~i~~D~~l~~v~~~~~D~lvlPGG~~~~~----- 74 (170) T cd03140 1 IAVFLTDEF-ADWEGAYLAALLNSYEGFEVRTVSPTGEPVTSIGGLRVVPDYSLDDLPPEDYDLLILPGGDSWDN----- 74 (170) T ss_pred CEEEECCCH-HHHHHHHHHHHHHCCCCCEEEEEECCCCEEEECCCCEEECCCCHHHCCCCCCCEEEECCCCCCCC----- T ss_conf 989976975-08999999999963699889999789995886799479567586888902177899868875100----- Q ss_pred HHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHEECCCHHHH Q ss_conf 77520021212233220597178606403101000001011 Q gi|254780971|r 61 AIAARTPVMQAIKKKAQQGIKVMGICNGFQILVELNLLPGI 101 (219) Q Consensus 61 ~i~~~~~~~~~i~~~~~~g~~vLGICNGfQiL~elGLlPg~ 101 (219) .....+.+.++++.++|+++-+||.|-++|.+.|||.|. T Consensus 75 --~~~~~l~~~l~~~~~~gk~iaaIC~g~~vLa~aGlL~g~ 113 (170) T cd03140 75 --PEAPDLAGLVRQALKQGKPVAAICGATLALARAGLLNNR 113 (170) T ss_pred --CCCHHHHHHHHHHHHHCCCEEEECHHHHHHHHCCCCCCC T ss_conf --249999999999987499099986558999985986896 No 99 >CHL00197 carA carbamoyl-phosphate synthase arginine-specific small subunit; Provisional Probab=98.23 E-value=4.3e-05 Score=53.45 Aligned_cols=146 Identities=23% Similarity=0.354 Sum_probs=77.6 Q ss_pred CCCEEEECCCCCCCCCCC-----------HHHHHHHHHHHHHHHHHHHCC-CEEEEECCCHHHHEEC-CCHHHHHCCCCC Q ss_conf 647899728734543114-----------067752002121223322059-7178606403101000-001011012443 Q gi|254780971|r 41 DVDLIVIPGGFSYGDYLR-----------CGAIAARTPVMQAIKKKAQQG-IKVMGICNGFQILVEL-NLLPGILMRNCS 107 (219) Q Consensus 41 ~~d~lvipGGFSygD~l~-----------aG~i~~~~~~~~~i~~~~~~g-~~vLGICNGfQiL~el-GLlPg~l~~N~s 107 (219) +++..|+|--.++.+.+. .|-=+.-....+.+++.++++ .|++|||-|-|||.-. |.=- .. T Consensus 216 g~~v~VvP~~~~~~~I~~~~PDGIfLSNGPGDP~~~~~~i~~i~~li~~~~iPifGICLGHQlLalA~GakT------~K 289 (383) T CHL00197 216 GCEITIVPATSTYQDIMAYQPDGILLSNGPGDPSTVHYGIKTVKKLIKKANIPIFGICMGHQILSLALEAKT------FK 289 (383) T ss_pred CCEEEEECCCCCCCHHHCCCCCEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCEEEECHHHHHHHHHCCCEE------EE T ss_conf 998999899985113425688889978999997898999999999996489978995588999999739837------56 Q ss_pred CCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCC Q ss_conf 32454225767525235776413799689986201330233086774115422503653168888898811266888699 Q gi|254780971|r 108 LKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRR 187 (219) Q Consensus 108 ~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~ 187 (219) .+|--|=.+- |-. ..+++ .+ .++..| |.++++.| .++. +-..|. |.| ...|+||...+ T Consensus 290 mkfGHrG~Nh-------PV~---~t~kv-~I-TSQNHG-yaVd~~sl---~~~~-~~vt~~---nln--D~tvEG~~~~~ 347 (383) T CHL00197 290 LKFGHRGLNH-------PSG---LNQQV-EI-TSQNHG-FAVNAESL---AKNK-VYVTHF---NLN--DGTVAGISHSP 347 (383) T ss_pred CCCCCCCCCC-------CCC---CCCCE-EE-EECCCC-EEECCCCC---CCCC-CEEEEE---CCC--CCCCCCEEECC T ss_conf 6888779886-------601---58757-99-627876-17674447---8873-189999---388--89614869889 Q ss_pred CCEEEECCCCHHHCCCCCCCCCHHHHHHHHC Q ss_conf 9899984884343221006776289998642 Q gi|254780971|r 188 GNVLGMMPHPENIIEKFHGGIDGRGLFASLL 218 (219) Q Consensus 188 G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~ 218 (219) -.++..+.|||-+- |-.|...||.-++ T Consensus 348 ~p~fsvQfHPEa~p----GP~Ds~~lF~~F~ 374 (383) T CHL00197 348 KPYFSVQYHPEASP----GPHDADYLFEYFI 374 (383) T ss_pred CCEEEECCCCCCCC----CCCCCHHHHHHHH T ss_conf 98788774889898----9873088999999 No 100 >PRK11574 hypothetical protein; Provisional Probab=98.15 E-value=1.3e-05 Score=56.84 Aligned_cols=92 Identities=21% Similarity=0.359 Sum_probs=65.6 Q ss_pred EEEEEECCCCCCHHHH---HHHHHHHCCCCEEEEEC-C---------------------CCCCCCCCEEEECCCCCCCCC Q ss_conf 6899844970647999---99999854996399821-5---------------------464456478997287345431 Q gi|254780971|r 2 KTAIVQIPGLNRDNDM---IKAITKIIGQSPILVWQ-S---------------------DTDIPDVDLIVIPGGFSYGDY 56 (219) Q Consensus 2 kvaVl~~pGsNcd~e~---~~A~~~~~~~~~~~v~~-~---------------------~~~l~~~d~lvipGGFSygD~ 56 (219) ||.|+..+|.- |.|. .-.|.| +|.++.++-. + +.+..+||+|+||||+.- T Consensus 4 k~LV~lA~GfE-EiEa~~~vDvLRR-agi~V~~asv~~~~~~~v~gs~gi~i~aD~~l~~v~~~~yD~ivlPGG~~g--- 78 (196) T PRK11574 4 SALVCLAPGSE-ETEAVTTIDLLVR-GGIKVTTASVASDGNLTITCSRGVKLLADAPLVEVADGEYDVIVLPGGIKG--- 78 (196) T ss_pred EEEEECCCCCC-HHHHHHHHHHHHH-CCCEEEEEEECCCCCCEEEECCCCEEECCCCHHHCCCCCCCEEECCCCCHH--- T ss_conf 69999169850-2125189999977-798499998538997379947997895786777778255888991788037--- Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCEEEEECCCHH-HHEECCCHHH Q ss_conf 1406775200212122332205971786064031-0100000101 Q gi|254780971|r 57 LRCGAIAARTPVMQAIKKKAQQGIKVMGICNGFQ-ILVELNLLPG 100 (219) Q Consensus 57 l~aG~i~~~~~~~~~i~~~~~~g~~vLGICNGfQ-iL~elGLlPg 100 (219) +-.+.....+.+.+++|.++|+++-.||-+=| +|.+.|||++ T Consensus 79 --a~~L~~~~~v~~~lk~~~~~gK~iAAICAaP~~vL~~~gll~~ 121 (196) T PRK11574 79 --AECFRDSPLLVETVKQFHRSGRIVAAICAAPATVLVPHDIFPI 121 (196) T ss_pred --HHHHHHCHHHHHHHHHHHHCCCEEEEECHHHHHHHHHCCCCCC T ss_conf --9998739899999999998799199982406999987886579 No 101 >cd03137 GATase1_AraC_1 AraC transcriptional regulators having a Type 1 glutamine amidotransferase (GATase1)-like domain. A subgroup of AraC transcriptional regulators having a Type 1 glutamine amidotransferase (GATase1)-like domain. AraC regulators are defined by a AraC-type helix-turn-helix DNA binding domain at their C-terminal. AraC family transcriptional regulators are widespread among bacteria and are involved in regulating diverse and important biological functions, including carbon metabolism, stress responses and virulence in different microorganisms. The catalytic triad typical of GATase1 domains is not conserved in this GATase1-like domain. However, in common with typical GATase1domains a reactive cys residue is found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. Probab=98.01 E-value=1.1e-05 Score=57.11 Aligned_cols=92 Identities=21% Similarity=0.478 Sum_probs=60.7 Q ss_pred EEEEECCCCC-----CHHHHHHHHHHHCCCCEE---EEECC-----------------CCCCCCCCEEEECCCCCCCCCC Q ss_conf 8998449706-----479999999985499639---98215-----------------4644564789972873454311 Q gi|254780971|r 3 TAIVQIPGLN-----RDNDMIKAITKIIGQSPI---LVWQS-----------------DTDIPDVDLIVIPGGFSYGDYL 57 (219) Q Consensus 3 vaVl~~pGsN-----cd~e~~~A~~~~~~~~~~---~v~~~-----------------~~~l~~~d~lvipGGFSygD~l 57 (219) |+|+-|||.+ .-.|+.+.-.+..+...+ ++..+ ..+.+++|.|++|||..- T Consensus 1 I~ill~~gf~~~~~~~~~evl~~An~~~~~~~~~~~~vs~~g~~V~ss~G~~i~~d~~~~~~~~~D~livpgg~~~---- 76 (187) T cd03137 1 VAVLVFPGVSLLDLSGPAEVFGEANRALGPPAYELRVCSPEGGPVRSSSGLSLVADAGLDALAAADTVIVPGGPDV---- 76 (187) T ss_pred CEEEEECCCHHHHHHHHHHHHHHHHHHCCCCCEEEEEEECCCCCEEECCCCEEECCCCHHHCCCCCEEEECCCCCH---- T ss_conf 9999979972999999999999989855999879999956999158479978953765334564699998898861---- Q ss_pred CHHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHEECCCHHH Q ss_conf 4067752002121223322059717860640310100000101 Q gi|254780971|r 58 RCGAIAARTPVMQAIKKKAQQGIKVMGICNGFQILVELNLLPG 100 (219) Q Consensus 58 ~aG~i~~~~~~~~~i~~~~~~g~~vLGICNGfQiL~elGLlPg 100 (219) -.......+.+.+++..+++..+.|||+|..+|.+.|||-| T Consensus 77 --~~~~~~~~l~~~L~~~~~~~~~l~~ictG~~~LA~aGLL~g 117 (187) T cd03137 77 --DGRPPPPALLAALRRAAARGARVASVCTGAFVLAEAGLLDG 117 (187) T ss_pred --HHCCCCHHHHHHHHHHHHCCCEEEEECCCCHHHHHCCCCCC T ss_conf --10259999999999997538668875314098874566589 No 102 >cd03128 GAT_1 Type 1 glutamine amidotransferase (GATase1)-like domain. Type 1 glutamine amidotransferase (GATase1)-like domain. This group contains proteins similar to Class I glutamine amidotransferases, the intracellular PH1704 from Pyrococcus horikoshii, the C-terminal of the large catalase: Escherichia coli HP-II, Sinorhizobium meliloti Rm1021 ThuA, the A4 beta-galactosidase middle domain and peptidase E. The majority of proteins in this group have a reactive Cys found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. For Class I glutamine amidotransferases proteins which transfer ammonia from the amide side chain of glutamine to an acceptor substrate, this Cys forms a Cys-His-Glu catalytic triad in the active site. Glutamine amidotransferases activity can be found in a range of biosynthetic enzymes included in this cd: glutamine amidotransferase, formylglycinamide ribonucleotide, GMP synthetase, anthranilate synthase component II, glutamin Probab=98.01 E-value=3.1e-05 Score=54.34 Aligned_cols=84 Identities=26% Similarity=0.418 Sum_probs=60.4 Q ss_pred EEEEECCCCCCH--HHHHHHHHHHCCCCEEEEECCCC------CCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 899844970647--99999999854996399821546------4456478997287345431140677520021212233 Q gi|254780971|r 3 TAIVQIPGLNRD--NDMIKAITKIIGQSPILVWQSDT------DIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKK 74 (219) Q Consensus 3 vaVl~~pGsNcd--~e~~~A~~~~~~~~~~~v~~~~~------~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~ 74 (219) ++++..++.+.+ .....++++ .+....++...+. .+.++|++++|||.+...... ......+.+++ T Consensus 1 i~i~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~d~vii~gg~~~~~~~~-----~~~~~~~~i~~ 74 (92) T cd03128 1 VAVLLFGGSEELELASPLDALRE-AGAEVDVVSPDGGPVESDVDLDDYDGLILPGGPGTPDDLA-----WDEALLALLRE 74 (92) T ss_pred CEEEECCCCCHHHHHHHHHHHHH-CCCEEEEEECCCCCCCCCCCHHHCCEEEECCCCCCCCCCH-----HCHHHHHHHHH T ss_conf 98995699458999999999997-7997999953899644536864588999789986642101-----09899999999 Q ss_pred HHHCCCEEEEECCCHHHH Q ss_conf 220597178606403101 Q gi|254780971|r 75 KAQQGIKVMGICNGFQIL 92 (219) Q Consensus 75 ~~~~g~~vLGICNGfQiL 92 (219) +.+++++++|+|.|.|+| T Consensus 75 ~~~~~~~i~~~c~G~~~l 92 (92) T cd03128 75 AAAAGKPVLGICLGAQLL 92 (92) T ss_pred HHHCCCEEEEECCHHHHC T ss_conf 998799799999047539 No 103 >cd03139 GATase1_PfpI_2 Type 1 glutamine amidotransferase (GATase1)-like domain found in a subgroup of proteins similar to PfpI from Pyrococcus furiosus. Type 1 glutamine amidotransferase (GATase1)-like domain found in a subgroup of proteins similar to PfpI from Pyrococcus furiosus. PfpI is an ATP-independent intracellular proteases which may hydrolyze small peptides to provide a nutritional source. Only Cys of the catalytic triad typical of GATase1 domains is conserved in this group. This Cys residue is found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. Probab=97.97 E-value=2.6e-05 Score=54.79 Aligned_cols=57 Identities=30% Similarity=0.615 Sum_probs=45.5 Q ss_pred CCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHEECCCHHH Q ss_conf 445647899728734543114067752002121223322059717860640310100000101 Q gi|254780971|r 38 DIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIKVMGICNGFQILVELNLLPG 100 (219) Q Consensus 38 ~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~vLGICNGfQiL~elGLlPg 100 (219) +++++|.|++|||.. .........+.+.++++.++++.+.+||+|--+|.+.|||.| T Consensus 59 ~~~~~D~lvvpGg~~------~~~~~~~~~l~~~Lr~~~~~g~~i~siCtGa~~LA~aGlL~g 115 (183) T cd03139 59 DPPDLDVLLVPGGGG------TRALVNDPALLDFIRRQAARAKYVTSVCTGALLLAAAGLLDG 115 (183) T ss_pred HCCCCCEEEECCCCC------HHHHCCCHHHHHHHHHHHHCCCEEEEECHHHHHHHHCCCCCC T ss_conf 667589999879987------453048989999999987619838987376799997477699 No 104 >KOG3210 consensus Probab=97.94 E-value=4.1e-05 Score=53.59 Aligned_cols=174 Identities=23% Similarity=0.298 Sum_probs=95.9 Q ss_pred EEEEECCCCCCHHHHHHHHHHHCCCCEEEE------ECCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 899844970647999999998549963998------21546445647899728734543114067752002121223322 Q gi|254780971|r 3 TAIVQIPGLNRDNDMIKAITKIIGQSPILV------WQSDTDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKA 76 (219) Q Consensus 3 vaVl~~pGsNcd~e~~~A~~~~~~~~~~~v------~~~~~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~ 76 (219) ++||..+|- =.|-..-+.+......+.+ .....++.+.|+|+||||-| ..-+.++-++.+++.+-.|. T Consensus 14 IGVLALQGA--FiEH~N~~~~c~~en~y~Ik~~~~tVKT~~D~aq~DaLIIPGGES----T~mslia~~tgL~d~L~~fV 87 (226) T KOG3210 14 IGVLALQGA--FIEHVNHVEKCIVENRYEIKLSVMTVKTKNDLAQCDALIIPGGES----TAMSLIAERTGLYDDLYAFV 87 (226) T ss_pred EEEEEHHHH--HHHHHHHHHHHHCCCCCEEEEEEEEECCHHHHHHCCEEEECCCCC----HHHHHHHHHHHHHHHHHHHH T ss_conf 653102157--899887887753058626788888615777786577778339850----26788874410177799984 Q ss_pred HCC-CEEEEECCCHHHHEEC-----------CCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCC Q ss_conf 059-7178606403101000-----------0010110124433245422576752523577641379968998620133 Q gi|254780971|r 77 QQG-IKVMGICNGFQILVEL-----------NLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHD 144 (219) Q Consensus 77 ~~g-~~vLGICNGfQiL~el-----------GLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgE 144 (219) .++ +++-|.|.|+-.|++- +++.-...+|.-+|-. . .||+...-. -.+|-+... T Consensus 88 hn~~k~~WGTCAGmI~LS~ql~nek~~~~tL~~lkv~V~RN~FG~Qa-----------q-SFT~~~~~s--nfi~~~~~F 153 (226) T KOG3210 88 HNPSKVTWGTCAGMIYLSQQLSNEKKLVKTLNLLKVKVKRNAFGRQA-----------Q-SFTRICDFS--NFIPHCNDF 153 (226) T ss_pred CCCCCCEEEECHHHHHHHHHHCCCCCCHHHHHCEEEEEEECCCCCHH-----------H-CCEEHHCCC--CCCCCCCCC T ss_conf 48886403410225310565528852044330046777602136010-----------0-211110110--346575668 Q ss_pred EEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHH Q ss_conf 02330867741154225036531688888988112668886999899984884343 Q gi|254780971|r 145 GNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENI 200 (219) Q Consensus 145 Grf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~ 200 (219) ---++.....+++.+.=.|...|..+- ||-..=+|. ..++++|+.-.|||-+ T Consensus 154 pATFIRAPVie~ILD~I~V~~l~~~~~--nG~~~iVAa--~Q~~~iL~TSFHPELa 205 (226) T KOG3210 154 PATFIRAPVIEEILDPIHVQVLYKLDG--NGQELIVAA--KQKNNILATSFHPELA 205 (226) T ss_pred CHHHEECHHHHHHCCCHHEEEEEEECC--CCCEEEEEE--ECCCCEEEEECCHHHH T ss_conf 344311156887546222147899448--995789998--3269776760584450 No 105 >TIGR00566 trpG_papA glutamine amidotransferase of anthranilate synthase or para-aminobenzoate synthase; InterPro: IPR006221 This entry presents the glutamine amidotransferase domain or peptide of the tryptophan-biosynthetic pathway enzyme anthranilate synthase or of the folate biosynthetic pathway enzyme para-aminobenzoate synthase. In at least one case, a single polypeptide from Bacillus subtilis was shown to have both functions. ; GO: 0004049 anthranilate synthase activity, 0008152 metabolic process. Probab=97.91 E-value=4.5e-05 Score=53.30 Aligned_cols=163 Identities=24% Similarity=0.354 Sum_probs=88.6 Q ss_pred HHHHHHHHHHC----CCCEEEEECCCCC-----------CCCCCEEEECCCCCCCCCCCHHHHHHHH-HHHHHHHHHHHC Q ss_conf 99999999854----9963998215464-----------4564789972873454311406775200-212122332205 Q gi|254780971|r 15 NDMIKAITKII----GQSPILVWQSDTD-----------IPDVDLIVIPGGFSYGDYLRCGAIAART-PVMQAIKKKAQQ 78 (219) Q Consensus 15 ~e~~~A~~~~~----~~~~~~v~~~~~~-----------l~~~d~lvipGGFSygD~l~aG~i~~~~-~~~~~i~~~~~~ 78 (219) |-+.+.|...+ |.+.++|++++.. .+=||.|++-=|=-+=| ..+-.+ .+.+.++.++-+ T Consensus 13 YNlv~~~~~~gdisPGv~~V~Vkrnd~~~~~~ie~L~P~~~lfdaivisPGPC~P~-----eAa~~~Gii~~~i~h~aGk 87 (212) T TIGR00566 13 YNLVQYFEELGDISPGVAEVVVKRNDSLTLQEIEALLPLLPLFDAIVISPGPCTPN-----EAAISMGIILEAIRHFAGK 87 (212) T ss_pred HHHHHHHHHCCCCCCCCEEEEEECCCCCCHHHHHHHCCCCCCCCEEEECCCCCCCC-----HHHCCHHHHHHHHHHHCCC T ss_conf 89999864217767361057772067310345876311356334567658656832-----1220411799998742376 Q ss_pred --CCEEEEECCCHHHHEECCCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEEC-CCEEEEECHHHHH Q ss_conf --9717860640310100000101101244332454225767525235776413799689986201-3302330867741 Q gi|254780971|r 79 --GIKVMGICNGFQILVELNLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAH-HDGNYFIDAKGLA 155 (219) Q Consensus 79 --g~~vLGICNGfQiL~elGLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaH-gEGrf~~~~~~l~ 155 (219) ..|+||||-|+|.|...- |+.. +..+.++-|++-.+ .| |+|+|- -|. T Consensus 88 ldevPILGvCLG~QA~~qA~---Gg~V---------------------~~a~~~~HGk~s~i--~hng~~~f~----GL~ 137 (212) T TIGR00566 88 LDEVPILGVCLGHQALAQAF---GGDV---------------------VRANTVMHGKTSEI--EHNGAGIFR----GLF 137 (212) T ss_pred CCCCCCCCCCHHHHHHHHHC---CCCE---------------------EEECCCCCCCCCCE--EECCCEEEC----CCC T ss_conf 22676020105568788742---8722---------------------54211368730012--477710124----655 Q ss_pred H-HHHCCEEEEECCC----------------CCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHC Q ss_conf 1-5422503653168----------------8888988112668886999899984884343221006776289998642 Q gi|254780971|r 156 E-IEKNNQIVFRYAS----------------GTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLL 218 (219) Q Consensus 156 ~-l~~~~~i~~~Y~d----------------~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~ 218 (219) . +-.+-+.+.||+. -+.-+|. -|=||-.++--+.|-+.|||=.. +.-|..|.+|++ T Consensus 138 ~GyPP~~L~~TRYHSL~V~p~~l~~~~~~~~~~~~~G~--eiM~irh~~~p~eGVQFHPESil-----Se~G~~ll~Nfl 210 (212) T TIGR00566 138 NGYPPDTLTATRYHSLVVEPETLPTCFQVTALEEENGI--EIMAIRHRDLPLEGVQFHPESIL-----SELGHQLLANFL 210 (212) T ss_pred CCCCCCCCCCCCEEEEEECCCCCCCCCCEEEEECCCCC--EEEHHHHCCCCCCCEEECCHHHH-----HHHHHHHHHHHH T ss_conf 77686644357505566744577622010100016870--13101334785022010713553-----422789999872 Q ss_pred C Q ss_conf 9 Q gi|254780971|r 219 T 219 (219) Q Consensus 219 s 219 (219) . T Consensus 211 ~ 211 (212) T TIGR00566 211 H 211 (212) T ss_pred C T ss_conf 3 No 106 >cd01653 GATase1 Type 1 glutamine amidotransferase (GATase1)-like domain. Type 1 glutamine amidotransferase (GATase1)-like domain. This group includes proteins similar to Class I glutamine amidotransferases, the intracellular PH1704 from Pyrococcus horikoshii, the C-terminal of the large catalase: Escherichia coli HP-II, Sinorhizobium meliloti Rm1021 ThuA. and, the A4 beta-galactosidase middle domain. The majority of proteins in this group have a reactive Cys found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. For Class I glutamine amidotransferases proteins which transfer ammonia from the amide side chain of glutamine to an acceptor substrate, this Cys forms a Cys-His-Glu catalytic triad in the active site. Glutamine amidotransferases activity can be found in a range of biosynthetic enzymes included in this cd: glutamine amidotransferase, formylglycinamide ribonucleotide, GMP synthetase, anthranilate synthase component II, glutamine-depende Probab=97.89 E-value=5.1e-05 Score=52.95 Aligned_cols=84 Identities=26% Similarity=0.481 Sum_probs=59.2 Q ss_pred EEEEECCCCCCH--HHHHHHHHHHCCCCEEEEECCCC------CCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 899844970647--99999999854996399821546------4456478997287345431140677520021212233 Q gi|254780971|r 3 TAIVQIPGLNRD--NDMIKAITKIIGQSPILVWQSDT------DIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKK 74 (219) Q Consensus 3 vaVl~~pGsNcd--~e~~~A~~~~~~~~~~~v~~~~~------~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~ 74 (219) ++|+..++.+.. .....++.+ .+....++..... ++.++|++++|||++-.+... ......+.+++ T Consensus 1 v~i~~~~~~~~~~~~~~~~~l~~-~~~~~~~i~~~~~~~~~~~~~~~~d~liipgg~~~~~~~~-----~~~~~~~~~~~ 74 (115) T cd01653 1 VAVLLFPGFEELELASPLDALRE-AGAEVDVVSPDGGPVESDVDLDDYDGLILPGGPGTPDDLA-----RDEALLALLRE 74 (115) T ss_pred CEEEECCCCCHHHHHHHHHHHHH-CCCEEEEEECCCCCCCCCCCHHCCCEEEECCCCCCHHHHH-----HCHHHHHHHHH T ss_conf 98994599528999999999997-7997999965899866657620089999899863299986-----39999999999 Q ss_pred HHHCCCEEEEECCCHHHH Q ss_conf 220597178606403101 Q gi|254780971|r 75 KAQQGIKVMGICNGFQIL 92 (219) Q Consensus 75 ~~~~g~~vLGICNGfQiL 92 (219) +..++++++|+|.|.|+| T Consensus 75 ~~~~~~~i~~~c~g~~~l 92 (115) T cd01653 75 AAAAGKPILGICLGAQLL 92 (115) T ss_pred HHHCCCEEEEECHHHHHH T ss_conf 998699899988588999 No 107 >COG1797 CobB Cobyrinic acid a,c-diamide synthase [Coenzyme metabolism] Probab=97.88 E-value=4.7e-05 Score=53.19 Aligned_cols=170 Identities=23% Similarity=0.376 Sum_probs=97.4 Q ss_pred EEEEEECCCCC-CHHHHHHHHHHHCCCCEEE-EECCCCCCC-CCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHC Q ss_conf 68998449706-4799999999854996399-821546445-64789972873454311406775200212122332205 Q gi|254780971|r 2 KTAIVQIPGLN-RDNDMIKAITKIIGQSPIL-VWQSDTDIP-DVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQ 78 (219) Q Consensus 2 kvaVl~~pGsN-cd~e~~~A~~~~~~~~~~~-v~~~~~~l~-~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~ 78 (219) |+||.+-+-=| .=.|....+++ .|.+.+. -=.+|..++ +.|++-|||||- .+-+++..+++...+++++++++ T Consensus 247 rIAVA~D~AF~FyY~~nl~~Lr~-~GAelv~FSPL~D~~lP~~~D~vYlgGGYP---ElfA~~L~~n~~~~~~i~~~~~~ 322 (451) T COG1797 247 RIAVARDAAFNFYYPENLELLRE-AGAELVFFSPLADEELPPDVDAVYLGGGYP---ELFAEELSANESMRRAIKAFAAA 322 (451) T ss_pred EEEEEECCHHCCCCHHHHHHHHH-CCCEEEEECCCCCCCCCCCCCEEEECCCCH---HHHHHHHHHCHHHHHHHHHHHHC T ss_conf 69998545100032999999997-797799857867777997788899689774---88999984479999999999876 Q ss_pred CCEEEEECCCHHHHEE------------CCCHHHHH-CCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCE Q ss_conf 9717860640310100------------00010110-1244332454225767525235776413799689986201330 Q gi|254780971|r 79 GIKVMGICNGFQILVE------------LNLLPGIL-MRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDG 145 (219) Q Consensus 79 g~~vLGICNGfQiL~e------------lGLlPg~l-~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEG 145 (219) |+|++|=|.|+--|++ .|+||+.- +.+.-..+--++... . ..|++.+. |+.+.-+=-| T Consensus 323 G~piyaECGGlMYL~~~le~~~G~~~~M~Gvlp~~~~m~~Rl~~lGY~~~~~--~-~d~~~~~~---G~~irGHEFH--- 393 (451) T COG1797 323 GKPIYAECGGLMYLGESLEDADGDTYEMVGVLPGSTRMTKRLQALGYREAEA--V-DDTLLLRA---GEKIRGHEFH--- 393 (451) T ss_pred CCCEEEECCCCEEEHHHEECCCCCEEEEEEEECCCHHHHHHHHCCCEEEEEE--C-CCCCCCCC---CCEEEEEEEE--- T ss_conf 9945885155112101204268966543300033013543343166069984--3-77634568---8636632103--- Q ss_pred EEEECHHHHHHHH--HCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCC Q ss_conf 2330867741154--225036531688888988112668886999899984884 Q gi|254780971|r 146 NYFIDAKGLAEIE--KNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHP 197 (219) Q Consensus 146 rf~~~~~~l~~l~--~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHP 197 (219) | .++. ...+.+++-.+..-.. +.-+|+++ ||++|--.|- T Consensus 394 -y-------S~~~~~~~~~~a~~~~~g~g~~---~~~~G~~~--gnv~asY~H~ 434 (451) T COG1797 394 -Y-------SRLITEEDAEPAFRVRRGDGID---NGRDGYRS--GNVLASYLHL 434 (451) T ss_pred -E-------EECCCCCCCCEEEEEECCCCCC---CCCCCEEE--CCEEEEEEEE T ss_conf -0-------0123577675103553255766---66540366--7767998755 No 108 >COG0693 ThiJ Putative intracellular protease/amidase [General function prediction only] Probab=97.86 E-value=9.4e-05 Score=51.26 Aligned_cols=94 Identities=28% Similarity=0.440 Sum_probs=65.1 Q ss_pred CEEEEEECCCCCCHHHH---HHHHHHHCCCCEEEEECCC----------------------CCCCCCCEEEECCCCCCCC Q ss_conf 96899844970647999---9999985499639982154----------------------6445647899728734543 Q gi|254780971|r 1 MKTAIVQIPGLNRDNDM---IKAITKIIGQSPILVWQSD----------------------TDIPDVDLIVIPGGFSYGD 55 (219) Q Consensus 1 mkvaVl~~pGsNcd~e~---~~A~~~~~~~~~~~v~~~~----------------------~~l~~~d~lvipGGFSygD 55 (219) +|++|+.++|.. +.|. ...|.+ ++..+.++..+. .+..+||++++|||..-.+ T Consensus 3 ~~i~i~~~~g~e-~~E~~~p~~~l~~-ag~~v~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~ydal~ipGG~~~~~ 80 (188) T COG0693 3 KKIAILLADGFE-DLELIVPYDVLRR-AGFEVDVASPEGKGKSVTSKRGGLVVADDKAFDDADAADYDALVIPGGDHGPE 80 (188) T ss_pred CEEEEEECCCCC-HHHHHHHHHHHHH-CCCEEEEEEECCCCCCEEECCCCEEEECCHHCCCCCCCCCCEEEECCCCCCHH T ss_conf 327998348953-5443029999997-79707999752676541644786078321211657813598999889975232 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHEECC-CHHHH Q ss_conf 11406775200212122332205971786064031010000-01011 Q gi|254780971|r 56 YLRCGAIAARTPVMQAIKKKAQQGIKVMGICNGFQILVELN-LLPGI 101 (219) Q Consensus 56 ~l~aG~i~~~~~~~~~i~~~~~~g~~vLGICNGfQiL~elG-LlPg~ 101 (219) +++ .+..+.+.+++|.+.|++|-.||.|-|+|...| ++.|. T Consensus 81 ~~~-----~~~~~~~~v~~~~~~~k~vaaIC~g~~~L~~ag~ll~g~ 122 (188) T COG0693 81 YLR-----PDPDLLAFVRDFYANGKPVAAICHGPAVLAAAGLLLKGR 122 (188) T ss_pred HCC-----CCHHHHHHHHHHHHCCCEEEEECCHHHHHHHCCCCCCCC T ss_conf 213-----517899999999976998999846889987476831484 No 109 >PRK13566 anthranilate synthase; Provisional Probab=97.84 E-value=0.00072 Score=45.59 Aligned_cols=162 Identities=21% Similarity=0.320 Sum_probs=90.8 Q ss_pred HHHHHHHHCCCCEEEEECCCC----CCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH Q ss_conf 999999854996399821546----4456478997287345431140677520021212233220597178606403101 Q gi|254780971|r 17 MIKAITKIIGQSPILVWQSDT----DIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIKVMGICNGFQIL 92 (219) Q Consensus 17 ~~~A~~~~~~~~~~~v~~~~~----~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~vLGICNGfQiL 92 (219) .+..|.+ .|.++.++-+... +-.++|++||-=| .|.= +...+...+.+.++.+.|+||||-|-|.+ T Consensus 539 La~Ylrq-lGAeV~vvR~d~~~~~l~~~~pD~vvlSPG--------PG~P-~d~g~~~~i~~~~~~~iPilGVCLGhQaI 608 (724) T PRK13566 539 LANYFRQ-TGAEVTTVRYPVAEEILDRVNPDLVVLSPG--------PGRP-EDFDCKATIDAALARNLPIFGVCLGLQAI 608 (724) T ss_pred HHHHHHH-HCCEEEEEECCCCHHHHHHHCCCEEEECCC--------CCCC-HHCCCHHHHHHHHHCCCCEEEECCCHHHH T ss_conf 9999998-198799981898767776318898998799--------9790-00558599999985799879977127699 Q ss_pred EEC-C----CHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEE---EECHHHHHHHHHCCEEE Q ss_conf 000-0----010110124433245422576752523577641379968998620133023---30867741154225036 Q gi|254780971|r 93 VEL-N----LLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNY---FIDAKGLAEIEKNNQIV 164 (219) Q Consensus 93 ~el-G----LlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf---~~~~~~l~~l~~~~~i~ 164 (219) .|. | -+| ..+|- +-..+++.. .+..++++.. .+.+ ||| +.+.+. +.+ .+.+ T Consensus 609 ~eafGg~l~~~~-~p~HG-------K~S~I~~~~-~~~~F~glp~--~f~v------~RYHSL~v~~~~---lP~-~L~V 667 (724) T PRK13566 609 VEAFGGELGQLA-YPMHG-------KPSRIRVRG-PGLVFSGLPN--EFTV------GRYHSLFADPER---LPD-ELLV 667 (724) T ss_pred HHHHCCEEEECC-CCCCC-------CEEEEEEEC-CCCCCCCCCC--CEEE------EEEEEEEECCCC---CCC-CEEE T ss_conf 897493885458-57889-------813579848-9741078999--6078------841379862677---998-3389 Q ss_pred EECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHC Q ss_conf 531688888988112668886999899984884343221006776289998642 Q gi|254780971|r 165 FRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLL 218 (219) Q Consensus 165 ~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~ 218 (219) .-..+ +|- |-||-.+.-.+.|.+.|||-.+- +++..|+.|.+|++ T Consensus 668 TA~te----dGv---iMai~H~~lpi~gVQFHPESIlT--~~g~~G~~ii~N~~ 712 (724) T PRK13566 668 TAETE----DGV---IMAIEHKTLPVAAVQFHPESIMT--LGGEVGLRIIENVV 712 (724) T ss_pred EEECC----CCE---EEEEEECCCCCEEEEECCHHHCC--CCCCCCHHHHHHHH T ss_conf 99858----982---89885178761256107678622--88744399999999 No 110 >pfam01965 DJ-1_PfpI DJ-1/PfpI family. The family includes the protease PfpI. This domain is also found in transcriptional regulators. Probab=97.83 E-value=1.2e-05 Score=56.97 Aligned_cols=61 Identities=30% Similarity=0.527 Sum_probs=48.9 Q ss_pred CCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHEECCCHHH Q ss_conf 546445647899728734543114067752002121223322059717860640310100000101 Q gi|254780971|r 35 SDTDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIKVMGICNGFQILVELNLLPG 100 (219) Q Consensus 35 ~~~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~vLGICNGfQiL~elGLlPg 100 (219) ++.+..+||.+++|||+.-.+.+ .....+.+.++++.++++++.+||.|-.+|.+.|||.| T Consensus 28 ~~~~~~~~D~lvipGG~~~~~~l-----~~~~~~~~~l~~~~~~~~~i~aiC~g~~~La~~gll~g 88 (141) T pfam01965 28 DDVNADDYDALVIPGGHAAAEDL-----RDDEKLVKFVKEFYEAGKPIAAICHGPVLLAAAGLLKG 88 (141) T ss_pred HHCCHHHCCEEEECCCCCHHHHH-----HHCHHHHHHHHHHHHHCCEEEEECHHHHHHHHCCCCCC T ss_conf 88895249999996994067664-----01999999999999839989998766899998698899 No 111 >KOG2387 consensus Probab=97.70 E-value=3.1e-05 Score=54.35 Aligned_cols=150 Identities=23% Similarity=0.359 Sum_probs=85.1 Q ss_pred CCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHE-E-----CCCH--------HHH--- Q ss_conf 4564789972873454311406775200212122332205971786064031010-0-----0001--------011--- Q gi|254780971|r 39 IPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIKVMGICNGFQILV-E-----LNLL--------PGI--- 101 (219) Q Consensus 39 l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~vLGICNGfQiL~-e-----lGLl--------Pg~--- 101 (219) |.+.|++++|||| ||+--.|+|.|- +-+-+++.|.||||-|+|+-+ | ||+- |.+ T Consensus 361 l~~adGilvPGGF--G~RGveG~i~Aa-------k~ARen~iP~LGiCLGmQ~AvIEfaRnvLg~~dAnStEF~p~~~~~ 431 (585) T KOG2387 361 LKSADGILVPGGF--GDRGVEGKILAA-------KWARENKIPFLGICLGMQLAVIEFARNVLGLKDANSTEFDPETKNP 431 (585) T ss_pred HCCCCEEEECCCC--CCCCHHHHHHHH-------HHHHHCCCCEEEEEHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC T ss_conf 5248828967866--544305789999-------9987649975763111269999999986177778732248788986 Q ss_pred ---HCCCCC-------CCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCC Q ss_conf ---012443-------3245422576752523577641379968998620133023308677411542250365316888 Q gi|254780971|r 102 ---LMRNCS-------LKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGT 171 (219) Q Consensus 102 ---l~~N~s-------~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~ 171 (219) +++..+ -|.-+|-+.++..+ |..-+-+ |.+-.+-=.| --||-.+++..++|++++....-. |. T Consensus 432 vVi~MPE~~~~~mGgtMRLG~R~t~f~~~~--s~~~kLY--G~~~~V~ERH-RHRyEVNP~~v~~le~~Gl~FvGk-d~- 504 (585) T KOG2387 432 VVIFMPEHNKTHMGGTMRLGSRRTVFQDKD--SKLRKLY--GNVEFVDERH-RHRYEVNPEMVKQLEQAGLSFVGK-DV- 504 (585) T ss_pred EEEECCCCCCCCCCCEEEECCCCEEEECCC--HHHHHHH--CCCHHHHHHH-HCCEECCHHHHHHHHHCCCEEEEE-CC- T ss_conf 799876887444464466246411650571--5789875--7950465654-110114889999998669689851-68- Q ss_pred CCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCC Q ss_conf 889881126688869998999848843432210067 Q gi|254780971|r 172 NPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGG 207 (219) Q Consensus 172 NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~ 207 (219) .|-...|--+-+ .--..|.+.|||-...+...+ T Consensus 505 --~g~rmeI~El~~-HP~fVg~QfHPE~~srp~kps 537 (585) T KOG2387 505 --TGKRMEIIELES-HPFFVGVQFHPEFKSRPDKPS 537 (585) T ss_pred --CCCEEEEEECCC-CCCEEEECCCHHHHCCCCCCC T ss_conf --885788998079-872565313777745899999 No 112 >PRK09393 ftrA transcriptional activator FtrA; Provisional Probab=97.67 E-value=6.9e-05 Score=52.10 Aligned_cols=110 Identities=19% Similarity=0.305 Sum_probs=67.5 Q ss_pred EEEEEECCCCC-----CHHHHHHHHHHHCCCCEEEEEC---C---------------C---CCCCCCCEEEECCCCCCCC Q ss_conf 68998449706-----4799999999854996399821---5---------------4---6445647899728734543 Q gi|254780971|r 2 KTAIVQIPGLN-----RDNDMIKAITKIIGQSPILVWQ---S---------------D---TDIPDVDLIVIPGGFSYGD 55 (219) Q Consensus 2 kvaVl~~pGsN-----cd~e~~~A~~~~~~~~~~~v~~---~---------------~---~~l~~~d~lvipGGFSygD 55 (219) ||+||.|+|-. +-.|..++-....+.+. +-|+ . | ...+..|.|++|||..-.. T Consensus 9 rV~ill~dgf~~l~~a~~~evf~~An~~~g~~~-y~~~~~S~dg~~V~ss~G~~v~~d~~~~~~~~~D~lvV~gg~~~~~ 87 (320) T PRK09393 9 LVVALAYDGLCTFEFGCAVEIFGLPRPELGVDW-YRFAVCAVEPGPLRAAGGITVAADGGLELLDRADTIVIPGWRDPDE 87 (320) T ss_pred EEEEEEECCCCEECHHHHHHHHHHHHHHCCCCC-EEEEEEECCCCEEEECCCEEEECCCCHHHCCCCCEEEECCCCCCCC T ss_conf 899999799875013069999863044429986-6999993799805647991796787765657688899878777244 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHEECCCHHHH---HCCCCCCCEEEEEEEEEE Q ss_conf 1140677520021212233220597178606403101000001011---012443324542257675 Q gi|254780971|r 56 YLRCGAIAARTPVMQAIKKKAQQGIKVMGICNGFQILVELNLLPGI---LMRNCSLKFVCKQVLLEV 119 (219) Q Consensus 56 ~l~aG~i~~~~~~~~~i~~~~~~g~~vLGICNGfQiL~elGLlPg~---l~~N~s~rf~~r~~~~~v 119 (219) .....+.+.+++...+|..+.|||.|.-+|.+.|||.|. ..+-....|..++-.+++ T Consensus 88 -------~~~~~l~~wLr~~~~~g~~l~~vctGaf~LA~aGLLdg~~aTtHW~~~~~f~~~fP~v~v 147 (320) T PRK09393 88 -------PVPEPLLKALRAAHARGARLCSICSGVFVLAAAGLLDGRRATTHWRYAERLAARYPAIRV 147 (320) T ss_pred -------CCCHHHHHHHHHHHHCCCEEEEECHHHHHHHHCCCCCCCEEEECHHHHHHHHHHCCCCEE T ss_conf -------699899999999987599899982999999973887897368859999999976897378 No 113 >pfam09825 BPL_N Biotin-protein ligase, N terminal. The function of this structural domain is unknown. It is found to the N terminus of the biotin protein ligase catalytic domain. Probab=97.64 E-value=0.0069 Score=39.32 Aligned_cols=188 Identities=23% Similarity=0.332 Sum_probs=98.9 Q ss_pred CEEEEEECCCCC--CHHHHHHHHHHHCCC--CEEEEECCC----CCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 968998449706--479999999985499--639982154----644564789972873454311406775200212122 Q gi|254780971|r 1 MKTAIVQIPGLN--RDNDMIKAITKIIGQ--SPILVWQSD----TDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAI 72 (219) Q Consensus 1 mkvaVl~~pGsN--cd~e~~~A~~~~~~~--~~~~v~~~~----~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i 72 (219) |-|.|-.-||+. +=+-+...+.++..- .+..|..+. -=..+..+||+||| .|-.-+-.. +..-...| T Consensus 1 MNVLVYsG~G~t~~sv~h~l~sLr~~l~p~YaV~~v~~~~l~~ePW~~~~aLLV~PGG---aDlpy~~~l--~g~g~~~I 75 (364) T pfam09825 1 MNVLVYNGPGTTPESVKHCLESLRLLLEPYYAVSPVTASVLENEPWMSKTALLVFPGG---ADLPYCRDL--NGPGNRKI 75 (364) T ss_pred CEEEEECCCCCCHHHHHHHHHHHHHHHCCCEEEEECCHHHHHCCCCCCCEEEEEECCC---CCCHHHHHH--CCCCCHHH T ss_conf 9289976899898999999999999747872688646888744963116369992488---675677761--86111779 Q ss_pred HHHHHCCCEEEEECCCHHHHE------------------ECCCHHHH--------HCCCCCCCEEEEEEEEEECCCHHHH Q ss_conf 332205971786064031010------------------00001011--------0124433245422576752523577 Q gi|254780971|r 73 KKKAQQGIKVMGICNGFQILV------------------ELNLLPGI--------LMRNCSLKFVCKQVLLEVVNSNTAF 126 (219) Q Consensus 73 ~~~~~~g~~vLGICNGfQiL~------------------elGLlPg~--------l~~N~s~rf~~r~~~~~v~~~~s~~ 126 (219) ++|.++|+.-||+|-|---.+ ||+..||+ +..|... -.|-+.+++..+. T Consensus 76 ~~fV~~GG~yLG~CAG~Yy~s~r~eF~~g~~~leV~G~ReL~FfpG~~rG~af~gf~Y~se~--Garaa~l~v~~~~--- 150 (364) T pfam09825 76 RDFVRQGGAYLGFCAGGYFGSSRVEFEVGDPTMEVVGPRELGFFPGTCRGPAFKGFQYNSEA--GARAVKLNVNKAQ--- 150 (364) T ss_pred HHHHHCCCEEEEEECCCCCCCCEEEECCCCCCCEEECCCEECCCCCCCCCCCCCCCEECCCC--CCEEEEEEECCCC--- T ss_conf 99997398589970233444543798058866345115211135776558766662004667--8422024511467--- Q ss_pred HHHCCCCCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCC-CCCC-CCCEEEEECCCCCEEEECCCCHHHCCCC Q ss_conf 6413799689986201330233086774115422503653168888-8988-1126688869998999848843432210 Q gi|254780971|r 127 TKSYKMNQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTN-PNGS-LHDIAGVINRRGNVLGMMPHPENIIEKF 204 (219) Q Consensus 127 ~~~~~~~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~N-PNGS-~~~IAgi~s~~G~vlgmMPHPER~~~~~ 204 (219) . ..+.+.+. -.|-|-|+ |.+.+ ++-.|..+|.++.. |+|. .-++--..--+|+++=.=||||=+-..+ T Consensus 151 --~--~~~~~~~y-yNGGg~Fv-da~~~----~~VevLA~Y~e~~dv~~~~~~AAVV~ckVGkG~aiLTGpHPEF~~~~l 220 (364) T pfam09825 151 --S--LPSHFYSY-YNGGGVFV-DAEKY----SNVEILARYAEDLDVPSGDGKAAVVYCKVGKGKAVLTGPHPEFNPSSL 220 (364) T ss_pred --C--CCCCEEEE-ECCCEEEE-CCCCC----CCCEEEEEECCCCCCCCCCCEEEEEEEECCCCCEEEECCCCCCCCCCC T ss_conf --8--87644899-87983895-42004----893799873344678999860579999707851898658855560115 Q ss_pred CCCC Q ss_conf 0677 Q gi|254780971|r 205 HGGI 208 (219) Q Consensus 205 ~~~~ 208 (219) .... T Consensus 221 ~~~~ 224 (364) T pfam09825 221 KKSD 224 (364) T ss_pred CCCC T ss_conf 7666 No 114 >COG0505 CarA Carbamoylphosphate synthase small subunit [Amino acid transport and metabolism / Nucleotide transport and metabolism] Probab=97.58 E-value=0.0017 Score=43.15 Aligned_cols=163 Identities=23% Similarity=0.339 Sum_probs=88.0 Q ss_pred HHHHHHHHHHCCCCEEEEECC-C-CC--CCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCCHH Q ss_conf 999999998549963998215-4-64--4564789972873454311406775200212122332205971786064031 Q gi|254780971|r 15 NDMIKAITKIIGQSPILVWQS-D-TD--IPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIKVMGICNGFQ 90 (219) Q Consensus 15 ~e~~~A~~~~~~~~~~~v~~~-~-~~--l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~vLGICNGfQ 90 (219) +...+.|.+ -|....+|=.+ + .+ --+.|+|.|-.|= ||. +.-....+.+++++...+|++|||-|-| T Consensus 191 ~nIlr~L~~-rg~~vtVVP~~t~~eeIl~~~pDGiflSNGP--GDP------~~~~~~i~~ik~l~~~~iPifGICLGHQ 261 (368) T COG0505 191 RNILRELVK-RGCRVTVVPADTSAEEILALNPDGIFLSNGP--GDP------APLDYAIETIKELLGTKIPIFGICLGHQ 261 (368) T ss_pred HHHHHHHHH-CCCEEEEECCCCCHHHHHHHCCCEEEEECCC--CCH------HHHHHHHHHHHHHHCCCCCEEEECHHHH T ss_conf 779999997-7984999769899999985189979974899--880------6778999999998646898378868889 Q ss_pred HHEEC-CCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCC Q ss_conf 01000-00101101244332454225767525235776413799689986201330233086774115422503653168 Q gi|254780971|r 91 ILVEL-NLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYAS 169 (219) Q Consensus 91 iL~el-GLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d 169 (219) ||.-. |-= -..-||--|=.+--| +++..+.+. +- +|..| |-.|++ .+.+.+ .+... T Consensus 262 llalA~Ga~------T~KmkFGHrG~NhPV--------~dl~tgrv~-IT-SQNHG-yaVd~~---s~~~~~-~vth~-- 318 (368) T COG0505 262 LLALALGAK------TYKMKFGHRGANHPV--------KDLDTGRVY-IT-SQNHG-YAVDED---SLVETL-KVTHV-- 318 (368) T ss_pred HHHHHCCCC------EEECCCCCCCCCCCC--------CCCCCCEEE-EE-ECCCC-EECCHH---HCCCCC-EEEEE-- T ss_conf 999866786------432036887789676--------316588089-99-46886-013822---327776-05898-- Q ss_pred CCCCC-CCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHC Q ss_conf 88889-88112668886999899984884343221006776289998642 Q gi|254780971|r 170 GTNPN-GSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLL 218 (219) Q Consensus 170 ~~NPN-GS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~ 218 (219) |.| || +.||--.+..++.-+.|||.+-- -.|-..||..++ T Consensus 319 --nlnDgT---vEGi~h~~~P~fSVQ~HPEAsPG----PhDt~ylFd~Fi 359 (368) T COG0505 319 --NLNDGT---VEGIRHKDLPAFSVQYHPEASPG----PHDTRYLFDEFI 359 (368) T ss_pred --ECCCCC---CCCEECCCCCEEEECCCCCCCCC----CCCCHHHHHHHH T ss_conf --179997---00245279863787148999989----864477999999 No 115 >TIGR01815 TrpE-clade3 anthranilate synthase; InterPro: IPR010112 This entry represents a small clade of anthranilate synthases from alpha proteobacteria and Nostoc (a cyanobacterium). This enzyme is the first step in the pathway for the biosynthesis of tryprophan from chorismate.; GO: 0004049 anthranilate synthase activity, 0000162 tryptophan biosynthetic process. Probab=97.56 E-value=0.0002 Score=49.15 Aligned_cols=163 Identities=22% Similarity=0.346 Sum_probs=98.1 Q ss_pred HHHHHHHHCCCCEEEEECC--CC--CCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHH Q ss_conf 9999998549963998215--46--4456478997287345431140677520021212233220597178606403101 Q gi|254780971|r 17 MIKAITKIIGQSPILVWQS--DT--DIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIKVMGICNGFQIL 92 (219) Q Consensus 17 ~~~A~~~~~~~~~~~v~~~--~~--~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~vLGICNGfQiL 92 (219) +|.+|.+ .|..+..+-|. +. +-..=|++||- .|.|.=.- =.+..-|++..+.+.|+.|||-|-|-| T Consensus 540 LAnY~Rq-TGAsVTTlRh~~ae~~fd~~rPDLVVLS--------PGPGrP~d-Fdv~~Ti~aa~ar~lP~FGVCLGLQg~ 609 (726) T TIGR01815 540 LANYLRQ-TGASVTTLRHSFAEELFDEERPDLVVLS--------PGPGRPKD-FDVKETIKAALARDLPVFGVCLGLQGL 609 (726) T ss_pred HHHHHHH-CCCEEEECCCHHHHHHHHCCCCCEEEEC--------CCCCCCCC-CCHHHHHHHHHHCCCCEEEEHHHHHHH T ss_conf 7888763-4875530430478999732799889868--------73123875-447889999997289857741346899 Q ss_pred EEC-----CCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEE---EECHHHHHHHHHCCEEE Q ss_conf 000-----0010110124433245422576752523577641379968998620133023---30867741154225036 Q gi|254780971|r 93 VEL-----NLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNY---FIDAKGLAEIEKNNQIV 164 (219) Q Consensus 93 ~el-----GLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf---~~~~~~l~~l~~~~~i~ 164 (219) +|. +.||- =.++ |-..++|..-.|++++++.+. +. =||| +.+.+ +|-++=.|. T Consensus 610 vEafGG~L~vL~~----P~HG----K~srirVl~p~s~~F~gLpe~--~T------VGRYHSlfA~~~---rLP~el~vT 670 (726) T TIGR01815 610 VEAFGGELDVLAI----PVHG----KASRIRVLEPDSVLFAGLPEE--LT------VGRYHSLFARSD---RLPKELKVT 670 (726) T ss_pred HHHHCCCCCCCCC----CCCC----CCCEEEEECCCEEEECCCCCC--EE------EEEEEEEEECCC---CCCCCCEEE T ss_conf 9874672135788----8788----633688836875787278970--34------300376755645---687423787 Q ss_pred EECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHHC Q ss_conf 531688888988112668886999899984884343221006776289998642 Q gi|254780971|r 165 FRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASLL 218 (219) Q Consensus 165 ~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~~ 218 (219) ..=.|.. |=||=-.+-=+-|.+.|||=.+ . +.+.-|+.+++|.| T Consensus 671 A~s~Dg~--------iMaiEH~~lP~AAVQFHPESIm-t-L~G~aGl~mi~NVv 714 (726) T TIGR01815 671 AESEDGV--------IMAIEHRKLPVAAVQFHPESIM-T-LDGDAGLAMIENVV 714 (726) T ss_pred EEECCCE--------EEEEECCCCCCCEEECCCCHHH-C-CCCHHHHHHHHHHH T ss_conf 8506871--------7320247687112511761100-0-56046899999999 No 116 >cd03136 GATase1_AraC_ArgR_like AraC transcriptional regulators having an N-terminal Type 1 glutamine amidotransferase (GATase1)-like domain. A subgroup of AraC transcriptional regulators having an N-terminal Type 1 glutamine amidotransferase (GATase1)-like domain. This group contains proteins similar to the Pseudomonas aeruginosa ArgR regulator. ArgR functions in the control of expression of certain genes of arginine biosynthesis and catabolism. AraC regulators are defined by a AraC-type helix-turn-helix DNA binding domain at their C-terminal. AraC family transcriptional regulators are widespread among bacteria and are involved in regulating diverse and important biological functions, including carbon metabolism, stress responses and virulence in different microorganisms. The catalytic triad typical of GATase1 domains is not conserved in this GATase1-like domain. However, in common with typical GATase1domains a reactive cys residue is found in some sequences in the sharp turn betwee Probab=97.53 E-value=0.0003 Score=48.00 Aligned_cols=92 Identities=21% Similarity=0.416 Sum_probs=60.4 Q ss_pred EEEEECCCCC-----CHHHHHHHHHHHCCCCEE---EEECC-----------------CCCCCCCCEEEECCCCCCCCCC Q ss_conf 8998449706-----479999999985499639---98215-----------------4644564789972873454311 Q gi|254780971|r 3 TAIVQIPGLN-----RDNDMIKAITKIIGQSPI---LVWQS-----------------DTDIPDVDLIVIPGGFSYGDYL 57 (219) Q Consensus 3 vaVl~~pGsN-----cd~e~~~A~~~~~~~~~~---~v~~~-----------------~~~l~~~d~lvipGGFSygD~l 57 (219) +++|-|||.. .-.|+.++-.++.+...+ ++..+ ..+.+++|.|++|||+.-.. T Consensus 1 igill~~g~~~l~~ag~~evf~~An~~~~~~~f~v~~vs~~~~~V~~~~G~~i~~~~~~~~~~~~D~liV~Gg~~~~~-- 78 (185) T cd03136 1 FGFLLLPGFSLLALASAIEPLRAANRLAGRELYRWRVLSLDGAPVTSSNGLRVAPDAALEDAPPLDYLFVVGGLGARR-- 78 (185) T ss_pred CEEEEECCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEEECCCCEEEECCCCEEECCCCHHHCCCCCEEEECCCCCCCC-- T ss_conf 999996998488899999999998852599857999996599804737970896676700167678999878888021-- Q ss_pred CHHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHEECCCHHHH Q ss_conf 40677520021212233220597178606403101000001011 Q gi|254780971|r 58 RCGAIAARTPVMQAIKKKAQQGIKVMGICNGFQILVELNLLPGI 101 (219) Q Consensus 58 ~aG~i~~~~~~~~~i~~~~~~g~~vLGICNGfQiL~elGLlPg~ 101 (219) .....+.+.+++..+++..+.++|+|..+|.+.|||.|. T Consensus 79 -----~~~~~~~~~L~~~~~~g~~v~svctGa~~LA~aGlL~g~ 117 (185) T cd03136 79 -----AVTPALLAWLRRAARRGVALGGIDTGAFLLARAGLLDGR 117 (185) T ss_pred -----CCCHHHHHHHHHHHHCCCEEEEECHHHHHHHHCCCCCCC T ss_conf -----479999999999987099899864799999981887999 No 117 >cd03132 GATase1_catalase Type 1 glutamine amidotransferase (GATase1)-like domain found in at the C-terminal of several large catalases. Type 1 glutamine amidotransferase (GATase1)-like domain found in at the C-terminal of several large catalases. Catalase catalyzes the dismutation of hydrogen peroxide (H2O2) to water and oxygen. This group includes the large catalases: Neurospora crassa Catalase-1 and Catalase-3 and, Escherichia coli HP-II. This GATase1-like domain has an essential role in HP-II catalase activity. However, it lacks enzymatic activity and the catalytic triad typical of GATase1 domains. Catalase-1 and -3 are homotetrameric, HP-II is homohexameric. It has been proposed that this domain may facilitate the folding and oligomerization process. The interface between this GATase1-like domain of HP-II and the core of the subunit forms part of a channel which provides access to the deeply buried catalase active sites of HPII. Catalase-1 is associated with non-growing cells; C Probab=97.14 E-value=0.0022 Score=42.56 Aligned_cols=91 Identities=18% Similarity=0.241 Sum_probs=64.6 Q ss_pred EEEEEECCCCCCHH--HHHHHHHHHCCCCEEEEECCC-------------------CCCCCCCEEEECCCCCCCCCCCHH Q ss_conf 68998449706479--999999985499639982154-------------------644564789972873454311406 Q gi|254780971|r 2 KTAIVQIPGLNRDN--DMIKAITKIIGQSPILVWQSD-------------------TDIPDVDLIVIPGGFSYGDYLRCG 60 (219) Q Consensus 2 kvaVl~~pGsNcd~--e~~~A~~~~~~~~~~~v~~~~-------------------~~l~~~d~lvipGGFSygD~l~aG 60 (219) |||||..+|..-+. ....||++ .|..+.++--+. ..-..||.+++|||-.-.|.| T Consensus 3 kVaiLv~dg~~~~e~~~~~~al~~-aG~~v~via~~~g~v~~~~g~~~~vd~t~~~~~s~~fDAvvipgG~~~~~~L--- 78 (142) T cd03132 3 KVGILVADGVDAAELSALKAALKA-AGANVKVVAPTLGGVVDSDGKTLEVDQTYAGAPSVLFDAVVVPGGAEAAFAL--- 78 (142) T ss_pred EEEEEECCCCCHHHHHHHHHHHHH-CCCEEEEEECCCCCEECCCCCEEECCEEECCCCHHHCCEEEECCCCCCHHHH--- T ss_conf 899996699888999999999998-8998999957987077589988805443536896471368843887798886--- Q ss_pred HHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHEECCCH Q ss_conf 77520021212233220597178606403101000001 Q gi|254780971|r 61 AIAARTPVMQAIKKKAQQGIKVMGICNGFQILVELNLL 98 (219) Q Consensus 61 ~i~~~~~~~~~i~~~~~~g~~vLGICNGfQiL~elGLl 98 (219) .....+.+.+.++.+.+|||-.||.|-++|..+|+. T Consensus 79 --~~~~~~~~fv~eay~h~KpI~a~~~~~~lL~~agi~ 114 (142) T cd03132 79 --APSGRALHFVTEAFKHGKPIGAVGEGSDLLEAAGIP 114 (142) T ss_pred --CCCHHHHHHHHHHHHCCCEEEEECCHHHHHHHCCCC T ss_conf --448679999999997699799937729999976979 No 118 >cd03138 GATase1_AraC_2 AraC transcriptional regulators having a Type 1 glutamine amidotransferase (GATase1)-like domain. A subgroup of AraC transcriptional regulators having a Type 1 glutamine amidotransferase (GATase1)-like domain. AraC regulators are defined by a AraC-type helix-turn-helix DNA binding domain at their C-terminal. AraC family transcriptional regulators are widespread among bacteria and are involved in regulating diverse and important biological functions, including carbon metabolism, stress responses and virulence in different microorganisms. The catalytic triad typical of GATase1 domains is not conserved in this GATase1-like domain. However, in common with typical GATase1domains a reactive cys residue is found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. Probab=97.14 E-value=0.00014 Score=50.06 Aligned_cols=61 Identities=30% Similarity=0.535 Sum_probs=44.4 Q ss_pred CCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHEECCCHHHH Q ss_conf 4456478997287345431140677520021212233220597178606403101000001011 Q gi|254780971|r 38 DIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIKVMGICNGFQILVELNLLPGI 101 (219) Q Consensus 38 ~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~vLGICNGfQiL~elGLlPg~ 101 (219) +.+++|.+++||+....+.. .......+.+-+++..+++..+.|||.|.-+|.+.|||.|. T Consensus 66 ~~~~~D~livpg~~~~~~~~---~~~~~~~l~~~L~~~~~~g~~i~sictGa~~LA~aGLL~g~ 126 (195) T cd03138 66 DVPAPDLVIVPGLGGDPDEL---LLADNPALIAWLRRQHANGATVAAACTGVFLLAEAGLLDGR 126 (195) T ss_pred HCCCCCEEEECCCCCCCCCC---CCCCCHHHHHHHHHHHHHCCEEECCCHHHHHHHHCCCCCCC T ss_conf 56656889975766786301---23369999999998665271784165899999973663799 No 119 >TIGR01383 not_thiJ DJ-1 family protein; InterPro: IPR006287 These sequences represent the DJ-1 clade of the so-called ThiJ/PfpI family of proteins. PfpI, represented by IPR006286 from INTERPRO, is a putative intracellular cysteine protease. DJ-1 is described as an oncogene that acts cooperatively with H-Ras . Many members of the DJ-1 clade are annotated (apparently incorrectly) as ThiJ (thiamine pyridinylase), a protein of thiamine biosynthesis. However, published reports of ThiJ activity and identification of a ThiJ/ThiD bifunctional protein describe an unrelated locus mapping near ThiM, rather than the DJ-1 homolog of Escherichia coli. The ThiJ designation for this family may be spurious; the cited paper refers to a locus near thiD and thiM in E. coli, unlike the gene represented here. Current public annotation reflects ThiJ/ThiD bifunctional activity, apparently a property of ThiD and not of this locus. This group of proteins are classified as either DJ-1 putative peptidases or non-peptidase homologs in MEROPS peptidase family C56 (clan PC(C)). . Probab=96.97 E-value=0.00071 Score=45.64 Aligned_cols=56 Identities=29% Similarity=0.485 Sum_probs=45.0 Q ss_pred CCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHEECCCH Q ss_conf 4456478997287345431140677520021212233220597178606403101000001 Q gi|254780971|r 38 DIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIKVMGICNGFQILVELNLL 98 (219) Q Consensus 38 ~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~vLGICNGfQiL~elGLl 98 (219) +...||+||||||+ .||-.+.-...+.+.|++|.++|+++=.||-.==+|-+.+.+ T Consensus 63 ~~~~fD~ivLPGG~-----pGa~nL~~S~~l~~~lk~~~~~gkl~AAICAaP~vll~~~~l 118 (186) T TIGR01383 63 DDEEFDLIVLPGGM-----PGAENLRDSKLLENLLKKQESKGKLVAAICAAPIVLLAAGVL 118 (186) T ss_pred CCCCCCEEEECCCC-----HHHHHHHCCHHHHHHHHHHHHCCCCEEEEHHHHHHHHHCCCC T ss_conf 77776778936982-----266642025789999999997499388420458986302100 No 120 >COG3442 Predicted glutamine amidotransferase [General function prediction only] Probab=96.87 E-value=0.0017 Score=43.26 Aligned_cols=147 Identities=24% Similarity=0.429 Sum_probs=78.5 Q ss_pred CCCEEE--EECCC-CCCCCCCEEEECCCCCCCCCCCHHHHHHHH--HHHHHHHHHHHCCCEEEEECCCHHHHEE------ Q ss_conf 996399--82154-644564789972873454311406775200--2121223322059717860640310100------ Q gi|254780971|r 26 GQSPIL--VWQSD-TDIPDVDLIVIPGGFSYGDYLRCGAIAART--PVMQAIKKKAQQGIKVMGICNGFQILVE------ 94 (219) Q Consensus 26 ~~~~~~--v~~~~-~~l~~~d~lvipGGFSygD~l~aG~i~~~~--~~~~~i~~~~~~g~~vLGICNGfQiL~e------ 94 (219) +..+.+ |...| ...+++|++++.||= || .-+++++. .....+++.+++|+|+|.||.|+|+|=+ T Consensus 34 gi~v~i~~vsl~d~~~~~~~Dl~~~GGgq---D~--eQ~i~t~d~~~k~~~l~~~i~~g~p~laiCgg~QlLG~yY~~a~ 108 (250) T COG3442 34 GIKVEIVEVSLTDTFPDDSYDLYFLGGGQ---DY--EQEIATRDLLTKKEGLKDAIENGKPVLAICGGYQLLGQYYETAS 108 (250) T ss_pred CCCEEEEEEECCCCCCCCCCCEEEECCCH---HH--HHHHHHHHHCCCCHHHHHHHHCCCCEEEECCCHHHCCCEEECCC T ss_conf 98169999634789874440279951720---78--88887521001037799998669958998255533040321689 Q ss_pred ------CCCHHHHHCCC-CCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEEC Q ss_conf ------00010110124-43324542257675252357764137996899862013302330867741154225036531 Q gi|254780971|r 95 ------LNLLPGILMRN-CSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRY 167 (219) Q Consensus 95 ------lGLlPg~l~~N-~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y 167 (219) +|||- .-+.| ...||+-. +.+.++-+. ..+ ++.+.=-=-| -||-+.+++ .+.-+.|++-| T Consensus 109 G~ri~GlGiLd-~~T~~~~~~R~IGd---iv~~~~~~~--e~~--~et~~GFENH-~GrT~L~~d----~~pLG~Vv~G~ 175 (250) T COG3442 109 GTRIDGLGILD-HYTENPQTKRFIGD---IVIENTLAG--EEF--GETLVGFENH-GGRTYLGPD----VKPLGKVVYGY 175 (250) T ss_pred CCEEECCCCEE-EEECCCCCCCEEEE---EEEECCCCH--HHH--CCEEEEEECC-CCCEECCCC----CCCCEEEEECC T ss_conf 84761322112-34126666504536---886214565--773--8804625437-881341898----76460078866 Q ss_pred CCCCCCCCCCCCEEEEECCCCCEEEECCCC Q ss_conf 688888988112668886999899984884 Q gi|254780971|r 168 ASGTNPNGSLHDIAGVINRRGNVLGMMPHP 197 (219) Q Consensus 168 ~d~~NPNGS~~~IAgi~s~~G~vlgmMPHP 197 (219) .+| ++. .-.|+ --+|++|.-.|- T Consensus 176 --GNn--~eD-~~eG~--~ykn~~aTY~HG 198 (250) T COG3442 176 --GNN--GED-GTEGA--HYKNVIATYFHG 198 (250) T ss_pred --CCC--CCC-CCCCE--EEEEEEEEEECC T ss_conf --777--554-66423--452047875117 No 121 >COG4285 Uncharacterized conserved protein [Function unknown] Probab=96.85 E-value=0.049 Score=33.89 Aligned_cols=175 Identities=22% Similarity=0.297 Sum_probs=92.2 Q ss_pred CEEEEEECCCCC--CHHHHHHHHHHHCC--CCEEEEECC----CCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 968998449706--47999999998549--963998215----4644564789972873454311406775200212122 Q gi|254780971|r 1 MKTAIVQIPGLN--RDNDMIKAITKIIG--QSPILVWQS----DTDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAI 72 (219) Q Consensus 1 mkvaVl~~pGsN--cd~e~~~A~~~~~~--~~~~~v~~~----~~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i 72 (219) |+|.|-.-+|+. |=+-+.+.+...+- .++..|..+ +-=.....+||+||| .|..-.- .-+......+ T Consensus 1 m~VlVYn~~GvSp~~lkhtv~sLr~~~~p~y~v~~V~~~~Li~EpW~~~T~lLV~pGG---aDlpY~~--~l~g~g~a~i 75 (253) T COG4285 1 MNVLVYNGLGVSPYSLKHTVRSLRLFAPPYYAVDRVDAQFLIKEPWEETTLLLVFPGG---ADLPYVQ--VLQGLGTARI 75 (253) T ss_pred CCEEEECCCCCCHHHHHHHHHHHHHHCCCHHEEEEEEEHEEECCCCHHCEEEEEECCC---CCCHHHH--HHCCHHHHHH T ss_conf 9069967899881899999999986443111477753010005855102079993288---7743877--7521225558 Q ss_pred HHHHHCCCEEEEECCCHHHHE-----------------ECCCHHHH--------HCCCCCCCEEEEEEEEEECCCHHHHH Q ss_conf 332205971786064031010-----------------00001011--------01244332454225767525235776 Q gi|254780971|r 73 KKKAQQGIKVMGICNGFQILV-----------------ELNLLPGI--------LMRNCSLKFVCKQVLLEVVNSNTAFT 127 (219) Q Consensus 73 ~~~~~~g~~vLGICNGfQiL~-----------------elGLlPg~--------l~~N~s~rf~~r~~~~~v~~~~s~~~ 127 (219) +.+.++|+--||||-|---=+ +|+..||+ +.-|.+.. .|-..+.+. T Consensus 76 ~~yvk~GG~fLGiCAG~YFg~~~veF~~p~~~~vvgkRdL~fFpGT~~GP~y~gF~Y~S~~G--aRaa~l~~~------- 146 (253) T COG4285 76 KNYVKEGGNFLGICAGGYFGSAYVEFAEPTGIEVVGKRDLGFFPGTARGPAYAGFSYNSESG--ARAAPLKFN------- 146 (253) T ss_pred HHHHHCCCEEEEEECCCCCCCEEEEEECCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCC--CEEEEEEEC------- T ss_conf 78875587089983255435347998358996346501114257765787668701037665--133346417------- Q ss_pred HHCCCCCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHC Q ss_conf 41379968998620133023308677411542250365316888889881126688869998999848843432 Q gi|254780971|r 128 KSYKMNQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENII 201 (219) Q Consensus 128 ~~~~~~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~ 201 (219) ++. + .+.+--.-|-++.+.|.+.. -.|..||.+-. |-+.+|--.+=.+|+|+=.=||||-.. T Consensus 147 -d~~-~---~~~~~FNGG~~F~~aE~~~~----v~I~ArY~e~~---~~pAAIV~~~vgkG~vvLsGpH~Ey~p 208 (253) T COG4285 147 -DFL-G---DCYAYFNGGGYFEDAENYPN----VEIEARYEELP---GKPAAIVSCTVGKGLVVLSGPHPEYLP 208 (253) T ss_pred -CCC-C---CEEEEECCCEEEECCCCCCC----CEEEEEHHCCC---CCCEEEEEEEECCCCEEEECCCHHHCH T ss_conf -876-2---05799748468852678998----27998630079---984069999956851897368811153 No 122 >KOG1559 consensus Probab=96.78 E-value=0.0064 Score=39.53 Aligned_cols=189 Identities=20% Similarity=0.315 Sum_probs=88.0 Q ss_pred EEEEECCCCCCHHHH---------HHH---HHHHCCCCEEEEECCCC------CCCCCCEEEECCCCCC-CCCCCHHHHH Q ss_conf 899844970647999---------999---99854996399821546------4456478997287345-4311406775 Q gi|254780971|r 3 TAIVQIPGLNRDNDM---------IKA---ITKIIGQSPILVWQSDT------DIPDVDLIVIPGGFSY-GDYLRCGAIA 63 (219) Q Consensus 3 vaVl~~pGsNcd~e~---------~~A---~~~~~~~~~~~v~~~~~------~l~~~d~lvipGGFSy-gD~l~aG~i~ 63 (219) ++||.-||-|-+--| +.. +-..+|..++-+..++. .+.-+.++++|||..- |||- .++ T Consensus 55 IGIL~hpg~g~~~rl~n~t~~~yIAASYVK~aEsgGARViPli~nepEe~lfqklelvNGviftGGwak~~dY~---~vv 131 (340) T KOG1559 55 IGILSHPGDGASGRLKNATGRSYIAASYVKLAESGGARVIPLIYNEPEEILFQKLELVNGVIFTGGWAKRGDYF---EVV 131 (340) T ss_pred EEEECCCCCCCCCEECCCCCCCHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHCEEEECCCCCCCCCHH---HHH T ss_conf 68860578882320136667301288899998718955887754880899999998736057417432224099---999 Q ss_pred HHHHHHHHHHHHHH--CCCEEEEECCCHHHHEECCCH-HHHHCCCCCCCEEEEEEEEEECCCH--------HHHHHHCCC Q ss_conf 20021212233220--597178606403101000001-0110124433245422576752523--------577641379 Q gi|254780971|r 64 ARTPVMQAIKKKAQ--QGIKVMGICNGFQILVELNLL-PGILMRNCSLKFVCKQVLLEVVNSN--------TAFTKSYKM 132 (219) Q Consensus 64 ~~~~~~~~i~~~~~--~g~~vLGICNGfQiL~elGLl-Pg~l~~N~s~rf~~r~~~~~v~~~~--------s~~~~~~~~ 132 (219) ..+++.+.+..+ +-.||-|||-||.+|+-+=-- -..|++-++...-+....+.-.+.. .-.++.+.. T Consensus 132 --kkifnk~le~nDaGehFPvyg~CLGFE~lsmiISqnrdile~~d~vd~AssLqF~~nvn~~~t~FQrFPpELLkkL~~ 209 (340) T KOG1559 132 --KKIFNKVLERNDAGEHFPVYGICLGFELLSMIISQNRDILERFDAVDVASSLQFVGNVNIHGTMFQRFPPELLKKLST 209 (340) T ss_pred --HHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHCCCCCCCCEEEECCCCEEEHHHHHCCHHHHHHHCC T ss_conf --999988876067866362124345389999998557068775032021353023035540011575489999987352 Q ss_pred CCEEEEEEECCCE----EEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCC Q ss_conf 9689986201330----2330867741154225036531688888988112668886999899984884343221006 Q gi|254780971|r 133 NQIIKCPVAHHDG----NYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHG 206 (219) Q Consensus 133 ~~~l~~piaHgEG----rf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~ 206 (219) +.|.+. -|--| +|..++ .|+. -=.|+.+-.|.+ |..=|.-+-+..=-|.|.+=|||+.-..|-. T Consensus 210 -dcLvmq-~Hk~gisp~nF~~N~-~Ls~---FFnilTT~~D~~----~k~fvSTv~~~kYPvtgfQWHPEKnafEWgs 277 (340) T KOG1559 210 -DCLVMQ-NHKFGISPKNFQGNP-ALSS---FFNILTTCTDGN----SKTFVSTVESKKYPVTGFQWHPEKNAFEWGS 277 (340) T ss_pred -CHHEEE-CCCCCCCHHHCCCCH-HHHH---HHHHEEEECCCC----CEEEEEEECCEECCCEEEEECCCCCCCCCCC T ss_conf -311000-332344623045798-8997---876533301798----5079885203012410045667557432355 No 123 >TIGR01823 PabB-fungal para-aminobenzoate synthase; InterPro: IPR010117 This entry represents the fungal clade of para-aminobenzoate synthase, which acts on chorismate to form para-aminobenzoic acid (PABA), a precursor of folate.. Probab=96.77 E-value=0.00055 Score=46.38 Aligned_cols=49 Identities=29% Similarity=0.351 Sum_probs=35.9 Q ss_pred CCCCCCEEEECCCCCCCCCCCHH---HHHHHHHHHHHHHHHH---HCCCEEEEECCCHHHHEE Q ss_conf 44564789972873454311406---7752002121223322---059717860640310100 Q gi|254780971|r 38 DIPDVDLIVIPGGFSYGDYLRCG---AIAARTPVMQAIKKKA---QQGIKVMGICNGFQILVE 94 (219) Q Consensus 38 ~l~~~d~lvipGGFSygD~l~aG---~i~~~~~~~~~i~~~~---~~g~~vLGICNGfQiL~e 94 (219) .|..||.||+. .|.| ..|....+++.|=+.+ .+-.||||||-|||-|+= T Consensus 57 ~l~~FdAiVvG--------PGPG~PN~GA~d~gI~~~~w~l~~Gk~D~~Pv~GICLGFQslcL 111 (851) T TIGR01823 57 LLLLFDAIVVG--------PGPGDPNVGAKDVGILREVWELANGKLDEVPVLGICLGFQSLCL 111 (851) T ss_pred HHHHCCEEEEC--------CCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHH T ss_conf 87417858988--------87767765114446889999732895313551100156899998 No 124 >TIGR00379 cobB cobyrinic acid a,c-diamide synthase; InterPro: IPR004484 This family describes cobyrinic acid a,c-diamide synthase, the cobB (cbiA in Salmonella) protein of cobalamin biosynthesis. It is responsible for the amidation of cobyrinic acid to cobyrinic acid a,c-diamide via the intermediate formation of cobyrinic acid c-monoamide or hydrogenobrynic acid. NH(2) groups are provided by glutamine and one molecule of ATP hydrogenolyzed for each amidation .; GO: 0042242 cobyrinic acid ac-diamide synthase activity, 0009236 cobalamin biosynthetic process. Probab=96.53 E-value=0.01 Score=38.19 Aligned_cols=122 Identities=26% Similarity=0.369 Sum_probs=72.9 Q ss_pred CEEEEEECCCCCCH-HHHHHHHHHHCCCCEEE-EECCCCCCC-CCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 96899844970647-99999999854996399-821546445-6478997287345431140677520021212233220 Q gi|254780971|r 1 MKTAIVQIPGLNRD-NDMIKAITKIIGQSPIL-VWQSDTDIP-DVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQ 77 (219) Q Consensus 1 mkvaVl~~pGsNcd-~e~~~A~~~~~~~~~~~-v~~~~~~l~-~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~ 77 (219) .|+||.+-+-=|== .|-..+|++.+ ++.+. -=-.|..|| |+|+|=|.|||= .+=|...-.+..+.++|++|.. T Consensus 252 ~kiava~D~aFnFYY~dNld~L~~~~-a~l~ffspL~D~eLPPdvD~lYiGGGfP---ElFae~L~~N~~l~~~i~~f~~ 327 (464) T TIGR00379 252 VKIAVARDQAFNFYYQDNLDALEENA-AELVFFSPLEDKELPPDVDALYIGGGFP---ELFAEELEKNEKLRDSIKEFIQ 327 (464) T ss_pred EEEEEEEECCCCHHHHHHHHHHHHCC-CEEEEECCCCCCCCCCCCCEEEECCCCH---HHHHHHHHHHHHHHHHHHHHHH T ss_conf 37899973321403577899997478-2798517766877971215765078852---4536533444899999999973 Q ss_pred CCCEEEEECCCHHHHEE------------CCCHHHHHCCCCCCCEEEE-EEEEEECCCHHHHHHH Q ss_conf 59717860640310100------------0001011012443324542-2576752523577641 Q gi|254780971|r 78 QGIKVMGICNGFQILVE------------LNLLPGILMRNCSLKFVCK-QVLLEVVNSNTAFTKS 129 (219) Q Consensus 78 ~g~~vLGICNGfQiL~e------------lGLlPg~l~~N~s~rf~~r-~~~~~v~~~~s~~~~~ 129 (219) +|.|+-|=|.|-==|++ +|+||-+-. =.+|+++. ++.-++.+ ++.|++. T Consensus 328 ~G~P~YGECGGLMYL~~Sl~~~Eg~~~PM~g~LP~~~~--M~~~~Q~lgY~~a~~l~-~~~i~~~ 389 (464) T TIGR00379 328 QGRPIYGECGGLMYLSESLDNVEGEIFPMVGLLPLAAK--MTGRVQGLGYVVAEVLK-DCLIAKK 389 (464) T ss_pred CCCEEEEECCCHHHHCCCCCCCCCEEECCCCCCCCCEE--ECCEECCCHHHHHHHHH-HHHHHHH T ss_conf 78838985060443200101589705421131387314--14310001032232332-1256652 No 125 >KOG3179 consensus Probab=96.36 E-value=0.0058 Score=39.82 Aligned_cols=67 Identities=28% Similarity=0.300 Sum_probs=41.7 Q ss_pred CCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHEECCCHHHHHCCCCCC Q ss_conf 4644564789972873454311406775200212122332205971786064031010000010110124433 Q gi|254780971|r 36 DTDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIKVMGICNGFQILVELNLLPGILMRNCSL 108 (219) Q Consensus 36 ~~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~vLGICNGfQiL~elGLlPg~l~~N~s~ 108 (219) +.+|++|++++|-| |--|...-.-|.- .+...+++....-++|||||=|-||.+.. ..|...+|..+ T Consensus 54 ~~Dl~ky~gfvIsG--S~~dAf~d~dWI~--KLcs~~kkld~mkkkvlGICFGHQiiara--~Gg~Vgra~KG 120 (245) T KOG3179 54 EEDLEKYDGFVISG--SKHDAFSDADWIK--KLCSFVKKLDFMKKKVLGICFGHQIIARA--KGGKVGRAPKG 120 (245) T ss_pred HHHHHHHCEEEEEC--CCCCCCCCCHHHH--HHHHHHHHHHHHCCCEEEEECCHHHHHHH--HCCCCCCCCCC T ss_conf 44653415499957--7433335406999--99999999875156647882117999886--17811027888 No 126 >KOG0370 consensus Probab=96.08 E-value=0.019 Score=36.48 Aligned_cols=71 Identities=20% Similarity=0.389 Sum_probs=51.2 Q ss_pred HHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHEE Q ss_conf 9999999854996399821546445647899728734543114067752002121223322059717860640310100 Q gi|254780971|r 16 DMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIKVMGICNGFQILVE 94 (219) Q Consensus 16 e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~vLGICNGfQiL~e 94 (219) ...+-|.+.+..-+++-|.-+.+-.+||+|.|-+| .|-=.....+.+.+++..+.++|+.|||-|-|+|.. T Consensus 185 N~IRcL~~RGa~vtVvPw~~~i~~~~yDGlflSNG--------PGdPe~~~~~v~~vr~lL~~~~PvfGIClGHQllA~ 255 (1435) T KOG0370 185 NQIRCLVKRGAEVTVVPWDYPIAKEEYDGLFLSNG--------PGDPELCPLLVQNVRELLESNVPVFGICLGHQLLAL 255 (1435) T ss_pred HHHHHHHHHCCEEEEECCCCCCCCCCCCEEEEECC--------CCCCHHHHHHHHHHHHHHHCCCCEEEEEHHHHHHHH T ss_conf 99999998495699903776220125561797279--------999056589999999998279985897366699998 No 127 >cd03141 GATase1_Hsp31_like Type 1 glutamine amidotransferase (GATase1)-like domain found in proteins similar to Escherichia coli Hsp31 protein. Type 1 glutamine amidotransferase (GATase1)-like domain found in proteins similar to Escherichia coli Hsp31 protein (EcHsp31). This group includes EcHsp31 and Saccharomyces cerevisiae Ydr533c protein. EcHsp31 has chaperone activity. Ydr533c is upregulated in response to various stress conditions along with the heat shock family. EcHsp31 coordinates a metal ion using a 2-His-1-carboxylate motif present in various ions that use iron as a cofactor such as Carboxypeptidase A. The catalytic triad typical of GATase1 domains is not conserved in this GATase1-like domain. However, in common with a typical GATase1 domain, a reactive Cys residue is found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. For EcHsp31, this Cys together with a different His and, an Asp (rather than a Glu) residue form a different Probab=95.46 E-value=0.013 Score=37.53 Aligned_cols=56 Identities=25% Similarity=0.417 Sum_probs=40.9 Q ss_pred CCCCCCCEEEECCCCCCCCCCCHHH-HHHHHHHHHHHHHHHHCCCEEEEECCCHHHHEECCCH Q ss_conf 6445647899728734543114067-7520021212233220597178606403101000001 Q gi|254780971|r 37 TDIPDVDLIVIPGGFSYGDYLRCGA-IAARTPVMQAIKKKAQQGIKVMGICNGFQILVELNLL 98 (219) Q Consensus 37 ~~l~~~d~lvipGGFSygD~l~aG~-i~~~~~~~~~i~~~~~~g~~vLGICNGfQiL~elGLl 98 (219) .+.++||+|++|||- ++-. +.....+.+-++++.++|++|-.||.|-..|....+- T Consensus 86 v~~~~ydai~~pGG~------g~m~dl~~~~~l~~l~~~~~~~gk~vaaiCHGp~~L~~~~~~ 142 (221) T cd03141 86 VDPSDYDAIFIPGGH------GPMFDLPDNPDLQDLLREFYENGKVVAAVCHGPAALLNVKLS 142 (221) T ss_pred CCHHHCCEEEECCCC------CHHHHHHCCHHHHHHHHHHHHCCCEEEEECCHHHHHHHHCCC T ss_conf 998788389989985------458774416999999999997498599976307988750056 No 128 >cd03133 GATase1_ES1 Type 1 glutamine amidotransferase (GATase1)-like domain found in zebrafish ES1. Type 1 glutamine amidotransferase (GATase1)-like domain found in zebrafish ES1. This group includes, proteins similar to ES1, Escherichia coli enhancing lycopene biosynthesis protein 2, Azospirillum brasilense iaaC and, human HES1. The catalytic triad typical of GATase1domains is not conserved in this GATase1-like domain. However, in common with GATase1domains a reactive cys residue is found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. Zebrafish ES1 is expressed specifically in adult photoreceptor cells and appears to be a cytoplasmic protein. A. brasilense iaaC is involved in controlling IAA biosynthesis. Probab=95.09 E-value=0.015 Score=37.13 Aligned_cols=58 Identities=24% Similarity=0.401 Sum_probs=37.9 Q ss_pred CCCCCCCEEEECCCCCCCCCCCH----HHHHH-HHHHHHHHHHHHHCCCEEEEECCCHHHHEE Q ss_conf 64456478997287345431140----67752-002121223322059717860640310100 Q gi|254780971|r 37 TDIPDVDLIVIPGGFSYGDYLRC----GAIAA-RTPVMQAIKKKAQQGIKVMGICNGFQILVE 94 (219) Q Consensus 37 ~~l~~~d~lvipGGFSygD~l~a----G~i~~-~~~~~~~i~~~~~~g~~vLGICNGfQiL~e 94 (219) .+-.|||.|||||||--.-.|.. |.-.. +..+.+.+++|.+.+||+=-||-.--++.+ T Consensus 78 ~~~~d~DaLiiPGGfGaAKNLs~fA~~g~~~~v~~~V~~~i~~f~~a~KPiG~iCIAP~l~a~ 140 (213) T cd03133 78 LKAADFDALIFPGGFGAAKNLSDFAVKGADCTVNPEVERLVREFHQAGKPIGAICIAPALAAK 140 (213) T ss_pred CCHHHCCEEEECCCHHHHHHHHHHHCCCCCCEECHHHHHHHHHHHHCCCCEEEECHHHHHHHH T ss_conf 998678989978851455531002105887569899999999999849999997578999999 No 129 >PRK11780 isoprenoid biosynthesis protein with amidotransferase-like domain; Provisional Probab=94.74 E-value=0.025 Score=35.74 Aligned_cols=57 Identities=26% Similarity=0.360 Sum_probs=36.5 Q ss_pred CCCCCCEEEECCCCCCCCCCCHHHH----HH-HHHHHHHHHHHHHCCCEEEEECCCHHHHEE Q ss_conf 4456478997287345431140677----52-002121223322059717860640310100 Q gi|254780971|r 38 DIPDVDLIVIPGGFSYGDYLRCGAI----AA-RTPVMQAIKKKAQQGIKVMGICNGFQILVE 94 (219) Q Consensus 38 ~l~~~d~lvipGGFSygD~l~aG~i----~~-~~~~~~~i~~~~~~g~~vLGICNGfQiL~e 94 (219) +..|||.|||||||--.-.|..-+. .. +..+.+.+++|.+.++|+=-||----++.+ T Consensus 82 ~~~d~DaLI~PGGfGaAKNLs~fA~~G~~~~V~p~V~~~i~~f~~a~KPiG~iCIaP~l~a~ 143 (217) T PRK11780 82 DAEDFDALIVPGGFGAAKNLSNFAFKGAECSVDPEVKALARAFHQAGKPIGFICIAPAMLPK 143 (217) T ss_pred CHHHCCEEEECCCCHHHHCCCCHHHCCCCCEECHHHHHHHHHHHHHCCCEEEECHHHHHHHH T ss_conf 97567879967862044311016534998648789999999999809998986076899999 No 130 >KOG2764 consensus Probab=94.53 E-value=0.045 Score=34.13 Aligned_cols=82 Identities=23% Similarity=0.332 Sum_probs=49.5 Q ss_pred CCCCEEEECCCCCCCCCCCHHHHHH-HHHHHHHHHHHHHCCCEEEEECCCHHHHEEC-CCHHHHHCCC--------CCCC Q ss_conf 5647899728734543114067752-0021212233220597178606403101000-0010110124--------4332 Q gi|254780971|r 40 PDVDLIVIPGGFSYGDYLRCGAIAA-RTPVMQAIKKKAQQGIKVMGICNGFQILVEL-NLLPGILMRN--------CSLK 109 (219) Q Consensus 40 ~~~d~lvipGGFSygD~l~aG~i~~-~~~~~~~i~~~~~~g~~vLGICNGfQiL~el-GLlPg~l~~N--------~s~r 109 (219) ..||++|||||- +.+.-.+ ...+.+-++++.++++++-.||-|=-++.-. |+..|.-..- ..+. T Consensus 66 ~~yDviilPGG~------~g~e~L~~~~~v~~lvK~q~~~gkLIaaICaap~~al~a~gl~~gkk~T~~ps~k~~L~~~g 139 (247) T KOG2764 66 SKYDVIILPGGL------PGAETLSECEKVVDLVKEQAESGKLIAAICAAPLTALAAHGLLGGKKCTAHPSVKPKLEEGG 139 (247) T ss_pred CCCCEEEECCCC------HHHHHHHHCHHHHHHHHHHHHCCCEEEEEECCHHHHHHHCCCCCCCEEEECCCHHHHHHHCC T ss_conf 566689946996------05666530688999999987529969986205698974165556614553410556676368 Q ss_pred EEEEEEEEEECCCHHHHHHH Q ss_conf 45422576752523577641 Q gi|254780971|r 110 FVCKQVLLEVVNSNTAFTKS 129 (219) Q Consensus 110 f~~r~~~~~v~~~~s~~~~~ 129 (219) | .|+.-+|+.+-++.|+. T Consensus 140 y--~yve~~vv~dG~liTSr 157 (247) T KOG2764 140 Y--KYVEPRVVKDGNLITSR 157 (247) T ss_pred C--EEECCCEEEECCEEECC T ss_conf 3--77438768707188636 No 131 >COG4977 Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain [Transcription] Probab=94.42 E-value=0.04 Score=34.45 Aligned_cols=92 Identities=24% Similarity=0.526 Sum_probs=56.3 Q ss_pred EEEEEECCC-CCCHHH----HHHHHHHHCCCCEEEEEC---CC------------------CCCCCCCEEEECCCCCCCC Q ss_conf 689984497-064799----999999854996399821---54------------------6445647899728734543 Q gi|254780971|r 2 KTAIVQIPG-LNRDND----MIKAITKIIGQSPILVWQ---SD------------------TDIPDVDLIVIPGGFSYGD 55 (219) Q Consensus 2 kvaVl~~pG-sNcd~e----~~~A~~~~~~~~~~~v~~---~~------------------~~l~~~d~lvipGGFSygD 55 (219) +++++.+|+ +.+... ..++-.+..+...+ .|. .+ ...+.+|.++.++|+ ++ T Consensus 12 ~~~~ll~p~f~l~~fa~~ve~lr~An~~~~~~~~-~w~~~s~~g~~V~ss~G~~i~~d~~~~~~~~~~~v~v~~g~--~~ 88 (328) T COG4977 12 RFGFLLLPNFSLMAFASAVEPLRAANRLAGRSLY-VWSIVSADGGPVRSSSGLSIAPDGGLEAAPPIDILPVCGGL--GP 88 (328) T ss_pred EEEEEEECCCCHHHHHHHHHHHHHHHHHCCCCCC-CEEEEECCCCCCCCCCCCEEECCCCCCCCCCCEEEEEECCC--CC T ss_conf 7899982897325655317899986664146533-34886168887636898267168764556763299980687--75 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHEECCCHHH Q ss_conf 114067752002121223322059717860640310100000101 Q gi|254780971|r 56 YLRCGAIAARTPVMQAIKKKAQQGIKVMGICNGFQILVELNLLPG 100 (219) Q Consensus 56 ~l~aG~i~~~~~~~~~i~~~~~~g~~vLGICNGfQiL~elGLlPg 100 (219) -.....+.+...+++.++.|..+-|||.|.-+|.+.|||-| T Consensus 89 ----~~~~~~~~l~~~Lr~~~~~G~~l~gictGaf~LA~aGLLdG 129 (328) T COG4977 89 ----ERPVNAPALLAWLRRAARRGARLGGLCTGAFVLAEAGLLDG 129 (328) T ss_pred ----CCCCCHHHHHHHHHHHHHCCCEEEEEHHHHHHHHHHCCCCC T ss_conf ----44456499999999998469958886086999997115379 No 132 >cd03144 GATase1_ScBLP_like Type 1 glutamine amidotransferase (GATase1)-like domain found in proteins similar to Saccharomyces cerevisiae biotin-apoprotein ligase (ScBLP). Type 1 glutamine amidotransferase (GATase1)-like domain found in proteins similar to Saccharomyces cerevisiae biotin-apoprotein ligase (ScBLP). Biotin-apoprotein ligase modifies proteins by covalently attaching biotin. ScBLP is known to biotinylate acety-CoA carboxylase and pyruvate carboxylase. The catalytic triad typical of GATase1 domains is not conserved in this GATase1-like domain. However, the Cys residue found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow in a typical GATase1 domain is conserved. Probab=94.37 E-value=0.11 Score=31.64 Aligned_cols=83 Identities=20% Similarity=0.323 Sum_probs=47.0 Q ss_pred EEEEEECCCCC--CHHHHHHHHHHHCCCCEEEEECCC----CCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 68998449706--479999999985499639982154----644564789972873454311406775200212122332 Q gi|254780971|r 2 KTAIVQIPGLN--RDNDMIKAITKIIGQSPILVWQSD----TDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKK 75 (219) Q Consensus 2 kvaVl~~pGsN--cd~e~~~A~~~~~~~~~~~v~~~~----~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~ 75 (219) +|.|-+-||+- |-+.+.+.+... -++..+..++ --....+++|+||| .|..-+-+ -...-...|++| T Consensus 1 nVLVY~g~G~s~~~vk~~~~~L~~~--~av~~v~~~~L~~~pw~~~~~llv~PGG---~d~~y~~~--l~~~g~~~i~~f 73 (114) T cd03144 1 NVLVYNGPGASPGSLKHLAELLRLY--LAVSTVTADELAVGPWESKTALLVVPGG---ADLPYCRA--LNGKGNRRIRNF 73 (114) T ss_pred CEEEECCCCCCHHHHHHHHHHHHHH--CCEEECCHHHHHCCCCCCCEEEEEECCC---CCCHHHHH--HCCHHHHHHHHH T ss_conf 9799878998989999999999874--3645406999844954267359994487---67678888--453667999999 Q ss_pred HHCCCEEEEECCCHHH Q ss_conf 2059717860640310 Q gi|254780971|r 76 AQQGIKVMGICNGFQI 91 (219) Q Consensus 76 ~~~g~~vLGICNGfQi 91 (219) .++|+.-||||-|.-. T Consensus 74 V~~GG~ylG~CaGaY~ 89 (114) T cd03144 74 VRNGGNYLGICAGAYL 89 (114) T ss_pred HHCCCCEEEECCCCCC T ss_conf 9759947998553103 No 133 >pfam04204 HTS Homoserine O-succinyltransferase. Probab=94.36 E-value=0.17 Score=30.43 Aligned_cols=135 Identities=13% Similarity=0.185 Sum_probs=78.5 Q ss_pred CCCCCEEEECCC----CCCCCCCCHHHHHHHHHHHHHHHHHHHCCC-EEEEECCCHHHHEEC--CCHHHHHCCCCCCCEE Q ss_conf 456478997287----345431140677520021212233220597-178606403101000--0010110124433245 Q gi|254780971|r 39 IPDVDLIVIPGG----FSYGDYLRCGAIAARTPVMQAIKKKAQQGI-KVMGICNGFQILVEL--NLLPGILMRNCSLKFV 111 (219) Q Consensus 39 l~~~d~lvipGG----FSygD~l~aG~i~~~~~~~~~i~~~~~~g~-~vLGICNGfQiL~el--GLlPg~l~~N~s~rf~ 111 (219) -..|||+|+-|- ..|.|+- .+.. +.+|.+.++.-. ..|.||=|.|....- |+=.-.|-+-.+|-|. T Consensus 96 ~~~~DGlIITGAPvE~l~FeeV~------YW~E-l~~I~dWs~~~v~Stl~iCWaAqAaL~~~ygI~K~~l~~KlfGVf~ 168 (298) T pfam04204 96 DEKFDGLIITGAPVEHLPFEDVD------YWEE-LTEIMDWSKTNVTSTLHICWGAQAALYHFYGIPKHLLPEKLFGVFR 168 (298) T ss_pred HCCCCEEEECCCCCCCCCCCCCC------CHHH-HHHHHHHHHHCCCCEEEECHHHHHHHHHHHCCCCCCCCCCEEEEEC T ss_conf 58875699728987778855374------1999-9999999986254404574999999999819980017874588512 Q ss_pred EEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEE-EECHHHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCCE Q ss_conf 422576752523577641379968998620133023-3086774115422503653168888898811266888699989 Q gi|254780971|r 112 CKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNY-FIDAKGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGNV 190 (219) Q Consensus 112 ~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf-~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~v 190 (219) .+ +.+.++|+++++. +.+.+|.+ || .++.+.++ ...++-++-+.+ ..++.-+.++++|. T Consensus 169 h~-----~~~~~~pLlrGfd--d~f~~PhS----R~t~i~~~~i~--~~~~L~iLa~s~-------~~G~~l~~~~d~r~ 228 (298) T pfam04204 169 HR-----VLDPHSPLLRGFD--DEFYAPHS----RHTEVRREDIE--AVPGLEILAESE-------EAGVYLVASKDGRQ 228 (298) T ss_pred CC-----CCCCCCCCCCCCC--CCCCCCCC----CCCCCCHHHHH--CCCCCEEEECCC-------CCCEEEEECCCCCE T ss_conf 34-----5789980104787--65306742----33469988971--378847986778-------77617998289886 Q ss_pred EEECCCCHHH Q ss_conf 9984884343 Q gi|254780971|r 191 LGMMPHPENI 200 (219) Q Consensus 191 lgmMPHPER~ 200 (219) +=++-|||-- T Consensus 229 vf~~GH~EYd 238 (298) T pfam04204 229 VFVTGHPEYD 238 (298) T ss_pred EEEECCCCCC T ss_conf 8882786667 No 134 >PRK05368 homoserine O-succinyltransferase; Provisional Probab=94.20 E-value=0.17 Score=30.46 Aligned_cols=135 Identities=13% Similarity=0.194 Sum_probs=78.6 Q ss_pred CCCCCCEEEECCC----CCCCCCCCHHHHHHHHHHHHHHHHHHHC-CCEEEEECCCHHHHEE--CCCHHHHHCCCCCCCE Q ss_conf 4456478997287----3454311406775200212122332205-9717860640310100--0001011012443324 Q gi|254780971|r 38 DIPDVDLIVIPGG----FSYGDYLRCGAIAARTPVMQAIKKKAQQ-GIKVMGICNGFQILVE--LNLLPGILMRNCSLKF 110 (219) Q Consensus 38 ~l~~~d~lvipGG----FSygD~l~aG~i~~~~~~~~~i~~~~~~-g~~vLGICNGfQiL~e--lGLlPg~l~~N~s~rf 110 (219) +-..|||+|+-|- ..|.|+ ..+.. +.+|.+.++. -...|.||=|.|.... -|+=.-.|-+-.+|-| T Consensus 96 k~~~~DGlIITGAPvE~l~Fe~V------~YW~E-l~~I~dws~~~v~stl~iCWaA~AaL~~~ygI~K~~l~~K~fGVf 168 (302) T PRK05368 96 KDQKFDGLIVTGAPVELLPFEDV------DYWDE-LREILDWAKTHVTSTLFICWAAQAALYHLYGIPKYTLPEKLSGVF 168 (302) T ss_pred HHCCCCEEEECCCCCCCCCCCCC------CCHHH-HHHHHHHHHHCCCCEEEECHHHHHHHHHHHCCCCEECCCCEEEEE T ss_conf 65877669972898777885537------41999-999999998626522567499999999970997100687358751 Q ss_pred EEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEE-EECHHHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCC Q ss_conf 5422576752523577641379968998620133023-308677411542250365316888889881126688869998 Q gi|254780971|r 111 VCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNY-FIDAKGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGN 189 (219) Q Consensus 111 ~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf-~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~ 189 (219) ..+ +.+..+|+++++. +.+.+|.+ || .++.+.+++ ..++-++-..+.. | +.-+.++++| T Consensus 169 ~h~-----~~~~~~pL~rGfd--d~f~~PhS----R~tei~~~~i~~--~~~L~ILa~s~e~---G----~~l~~s~~~r 228 (302) T PRK05368 169 EHR-----VLVPHHPLLRGFD--DSFLVPHS----RYTEVREEDIRA--ATGLEILAESEEA---G----VYLFASKDKR 228 (302) T ss_pred CCC-----CCCCCCHHHCCCC--CCCCCCCC----CCCCCCHHHHHH--CCCCEEEECCCCC---C----CEEEECCCCC T ss_conf 223-----4689980013787--65306741----223689999950--8896798636777---7----1699828988 Q ss_pred EEEECCCCHH Q ss_conf 9998488434 Q gi|254780971|r 190 VLGMMPHPEN 199 (219) Q Consensus 190 vlgmMPHPER 199 (219) .+=++-|||- T Consensus 229 ~vfi~GH~EY 238 (302) T PRK05368 229 IVFVTGHPEY 238 (302) T ss_pred EEEEECCCCC T ss_conf 6888278666 No 135 >PRK05282 peptidase E; Validated Probab=93.40 E-value=0.34 Score=28.54 Aligned_cols=87 Identities=15% Similarity=0.322 Sum_probs=60.1 Q ss_pred EEEEEECCCCCCHHH-----HHHHHHHHCCCCEEEEECCC---CCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 689984497064799-----99999985499639982154---6445647899728734543114067752002121223 Q gi|254780971|r 2 KTAIVQIPGLNRDND-----MIKAITKIIGQSPILVWQSD---TDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIK 73 (219) Q Consensus 2 kvaVl~~pGsNcd~e-----~~~A~~~~~~~~~~~v~~~~---~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~ 73 (219) |+.-|-|-+...++| +..||.+ .+..+.-++..+ ..+.+.|+|.+.||=+| .-=+.+.+..+.+.++ T Consensus 33 ~ilFIPyA~~~~~~d~Yt~~v~~af~~-lg~~v~gih~~~dp~~AI~~Ad~I~vgGGNTF----~Ll~~L~~~gl~~~ir 107 (233) T PRK05282 33 KAVFIPYAGVTQSWDDYTAKVAEVLAP-LGIEVTGIHRVEDPVAAIENADGIIVGGGNTF----QLLKQLYERGLLAPIR 107 (233) T ss_pred EEEEECCCCCCCCHHHHHHHHHHHHHH-CCCEEEEEECCCCHHHHHHHCCEEEECCCCHH----HHHHHHHHCCCHHHHH T ss_conf 599975888879889999999999986-69828876245898999974997998697399----9999999858299999 Q ss_pred HHHHCCCEEEEECCCHHHHE Q ss_conf 32205971786064031010 Q gi|254780971|r 74 KKAQQGIKVMGICNGFQILV 93 (219) Q Consensus 74 ~~~~~g~~vLGICNGfQiL~ 93 (219) +..++|+|-+|.--|.-|.+ T Consensus 108 ~~V~~G~pYiG~SAGsnia~ 127 (233) T PRK05282 108 EAVKNGTPYIGWSAGANVAC 127 (233) T ss_pred HHHHCCCCEEEECCHHHHCC T ss_conf 99984997796264154057 No 136 >cd03146 GAT1_Peptidase_E Type 1 glutamine amidotransferase (GATase1)-like domain found in peptidase E. Type 1 glutamine amidotransferase (GATase1)-like domain found in peptidase E. This group contains proteins similar to the aspartyl dipeptidases Salmonella typhimurium peptidase E and Xenopus laevis peptidase E. In bacteria peptidase E is believed to play a role in degrading peptides generated by intracellular protein breakdown or imported into the cell as nutrient sources. Peptidase E uniquely hydrolyses only Asp-X dipeptides (where X is any amino acid), and one tripeptide Asp-Gly-Gly. Peptidase E is believed to be a serine peptidase having a Ser-His-Glu catalytic triad which differs from the Cys-His-Glu catalytic triad typical of GATase1 domains by having a Ser in place of the reactive Cys at the nucleophile elbow. Xenopus PepE is developmentally regulated in response to thyroid hormone and, it is thought to play a role in apoptosis during tail reabsorption. Probab=93.27 E-value=0.24 Score=29.43 Aligned_cols=75 Identities=16% Similarity=0.194 Sum_probs=54.7 Q ss_pred HHHHHHHHCCCCEEEEECCC-----CCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHH Q ss_conf 99999985499639982154-----6445647899728734543114067752002121223322059717860640310 Q gi|254780971|r 17 MIKAITKIIGQSPILVWQSD-----TDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIKVMGICNGFQI 91 (219) Q Consensus 17 ~~~A~~~~~~~~~~~v~~~~-----~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~vLGICNGfQi 91 (219) ...||.++.+.+...++.-+ ..|.+.|+|.+.|| +..+.-+.+....+.+.+++..++|.++.|.--|.-+ T Consensus 51 ~~~~f~~l~~~~~~~l~~~~~~d~~~~l~~ad~I~v~GG----nt~~ll~~~~~~~l~~~l~~~~~~G~~~~G~SAGa~~ 126 (212) T cd03146 51 FYAAFESLRGVEVSHLHLFDTEDPLDALLEADVIYVGGG----NTFNLLAQWREHGLDAILKAALERGVVYIGWSAGSNC 126 (212) T ss_pred HHHHHHCCCCCEEEEEECCCCCCHHHHHHHCCEEEECCC----CHHHHHHHHHHCCHHHHHHHHHHCCCEEEEECCCHHH T ss_conf 999861336962568752486788999974999998897----6999999999789899999999779889998615776 Q ss_pred HEEC Q ss_conf 1000 Q gi|254780971|r 92 LVEL 95 (219) Q Consensus 92 L~el 95 (219) ..+. T Consensus 127 ~~~~ 130 (212) T cd03146 127 WFPS 130 (212) T ss_pred HCCC T ss_conf 1677 No 137 >cd03145 GAT1_cyanophycinase Type 1 glutamine amidotransferase (GATase1)-like domain found in cyanophycinase. Type 1 glutamine amidotransferase (GATase1)-like domain found in cyanophycinase. This group contains proteins similar to the extracellular cyanophycinases from Pseudomonas anguilliseptica BI (CphE) and Synechocystis sp. PCC 6803 CphB. Cyanophycinases are intracellular exopeptidases which hydrolyze the polymer cyanophycin (multi L-arginyl-poly-L-aspartic acid) to the dipeptide beta-Asp-Arg. Cyanophycinase is believed to be a serine-type exopeptidase having a Ser-His-Glu catalytic triad which differs from the Cys-His-Glu catalytic triad typical of GATase1 domains by having a Ser in place of the reactive Cys at the nucleophile elbow. Probab=92.64 E-value=0.49 Score=27.49 Aligned_cols=94 Identities=16% Similarity=0.304 Sum_probs=63.8 Q ss_pred EEEEEECCCCC-CHH--HHHHHHHHHCCCCEEEEECCC----------CCCCCCCEEEECCCCCCCCCCCHHHHHHHHHH Q ss_conf 68998449706-479--999999985499639982154----------64456478997287345431140677520021 Q gi|254780971|r 2 KTAIVQIPGLN-RDN--DMIKAITKIIGQSPILVWQSD----------TDIPDVDLIVIPGGFSYGDYLRCGAIAARTPV 68 (219) Q Consensus 2 kvaVl~~pGsN-cd~--e~~~A~~~~~~~~~~~v~~~~----------~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~ 68 (219) |+.|+-....- +++ +...+|++++..++.++...+ ..|.+.|+|.+.|| |-.+.-..|..+++ T Consensus 31 ri~viptAs~~~~~~~~~~~~~f~~lG~~~v~~l~~~~r~~a~~~~~~~~l~~a~~i~~~GG----~q~~~~~~~~~t~~ 106 (217) T cd03145 31 RIVVIPAASEEPAEVGEEYRDVFERLGAREVEVLVIDSREAANDPEVVARLRDADGIFFTGG----DQLRITSALGGTPL 106 (217) T ss_pred EEEEECCCCCCHHHHHHHHHHHHHHCCCCCEEEEECCCHHHCCCHHHHHHHHCCCEEEECCC----CHHHHHHHHHCCCH T ss_conf 59998588788599999999999983999505982268354489899999964998999389----99999999963969 Q ss_pred HHHHHHHHHCCCEEEEECCCHHHHEECCCHH Q ss_conf 2122332205971786064031010000010 Q gi|254780971|r 69 MQAIKKKAQQGIKVMGICNGFQILVELNLLP 99 (219) Q Consensus 69 ~~~i~~~~~~g~~vLGICNGfQiL~elGLlP 99 (219) .+.+++..++|.++.|.--|.-+|.+.-+.. T Consensus 107 ~~~l~~~~~~G~vi~G~SAGA~~~~~~~i~~ 137 (217) T cd03145 107 LDALRKVYRGGVVIGGTSAGAAVMSDTMIAG 137 (217) T ss_pred HHHHHHHHHCCCEEEECCHHHHHHHHHCCCC T ss_conf 9999999986994884117888745421026 No 138 >cd03147 GATase1_Ydr533c_like Type 1 glutamine amidotransferase (GATase1)-like domain found in Saccharomyces cerevisiae Ydr533c protein. Type 1 glutamine amidotransferase (GATase1)-like domain found in Saccharomyces cerevisiae Ydr533c protein. This group includes proteins similar to S. cerevisiae Ydr533c. Ydr533c is upregulated in response to various stress conditions along with the heat shock family. The catalytic triad typical of GATase1domains is not conserved in this GATase1-like domain. However, in common with a typical GATase1domain, a reactive Cys residue is found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. This Cys together with a different His and Glu residue form a different catalytic triad from the typical GATase1domain. Ydr533c protein is a homodimer. Probab=92.41 E-value=0.091 Score=32.18 Aligned_cols=53 Identities=23% Similarity=0.442 Sum_probs=38.6 Q ss_pred CCCCCCCCCEEEECCCCCCCCCCCHHHHH--H-HHHHHHHHHHHHHCCCEEEEECCCHHHHEEC Q ss_conf 54644564789972873454311406775--2-0021212233220597178606403101000 Q gi|254780971|r 35 SDTDIPDVDLIVIPGGFSYGDYLRCGAIA--A-RTPVMQAIKKKAQQGIKVMGICNGFQILVEL 95 (219) Q Consensus 35 ~~~~l~~~d~lvipGGFSygD~l~aG~i~--~-~~~~~~~i~~~~~~g~~vLGICNGfQiL~el 95 (219) ++.+.++||+|.+|||. |.++ . +..+-+-+.++.++|+++..||.|--.|+.+ T Consensus 88 ~~v~~~~YdaVf~pGGh--------G~m~Dl~~~~~l~~ll~~~~~~gk~vaaVCHGpaaLl~~ 143 (231) T cd03147 88 DEVNPDDYGIFFVAGGH--------GTLFDFPHATNLQKIAQQIYANGGVVAAVCHGPAILANL 143 (231) T ss_pred HHCCHHHCEEEEECCCC--------CCCCCCCCCHHHHHHHHHHHHCCCEEEEECCHHHHHHHC T ss_conf 68898782599988988--------530324239999999999997599799980688997616 No 139 >COG1348 NifH Nitrogenase subunit NifH (ATPase) [Inorganic ion transport and metabolism] Probab=90.42 E-value=0.44 Score=27.80 Aligned_cols=88 Identities=28% Similarity=0.379 Sum_probs=52.3 Q ss_pred CCCCCCH-H---HHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECCCCC------CCCCC---CHHHHH---HHHHHHHH Q ss_conf 4970647-9---9999999854996399821546445647899728734------54311---406775---20021212 Q gi|254780971|r 8 IPGLNRD-N---DMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPGGFS------YGDYL---RCGAIA---ARTPVMQA 71 (219) Q Consensus 8 ~pGsNcd-~---e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipGGFS------ygD~l---~aG~i~---~~~~~~~~ 71 (219) +||.-|- | .+..-+++++.++.. -|.-+-|+=+=|.+|||+ |.|+. .+|.+. |-..+.+. T Consensus 91 epGvGCAGRGVitai~~Le~lgaf~~~----~DvviyDVLGDVVCGGFAmPiReg~AdeiyIVtSge~MalYAANNI~kg 166 (278) T COG1348 91 EPGVGCAGRGVITAINLLEELGAFEED----LDVVIYDVLGDVVCGGFAMPIREGYADEIYIVTSGEMMALYAANNIAKG 166 (278) T ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCCC----CCEEEEECCCCEEECCEEEEHHCCCCCEEEEEECCCHHHHHHHHHHHHH T ss_conf 999884651199999999981873012----8789995357734746000200266637999956835899998899999 Q ss_pred HHHHHHCCCEEEE--ECCCHHHHEECCCHH Q ss_conf 2332205971786--064031010000010 Q gi|254780971|r 72 IKKKAQQGIKVMG--ICNGFQILVELNLLP 99 (219) Q Consensus 72 i~~~~~~g~~vLG--ICNGfQiL~elGLlP 99 (219) ++++++.+..-|| |||.=+.-.|..+++ T Consensus 167 i~k~a~~~~~rLgGiIcNsR~~~~e~e~v~ 196 (278) T COG1348 167 IRKYAKTGGVRLGGIICNSRSVDRERELVE 196 (278) T ss_pred HHHHHHCCCCCEEEEEECCCCCCCHHHHHH T ss_conf 999873379631457745777643799999 No 140 >KOG0026 consensus Probab=90.33 E-value=0.92 Score=25.75 Aligned_cols=58 Identities=19% Similarity=0.331 Sum_probs=31.9 Q ss_pred CEEEEECCCC-------CCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHEEC Q ss_conf 6399821546-------4456478997287345431140677520021212233220597178606403101000 Q gi|254780971|r 28 SPILVWQSDT-------DIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIKVMGICNGFQILVEL 95 (219) Q Consensus 28 ~~~~v~~~~~-------~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~vLGICNGfQiL~el 95 (219) ..+.|.++|. .+.--.+++-||- |--+ .+.+..+-...+.--.|++|||-|-|-+.|. T Consensus 44 ~~~~VyRNDeiTV~El~~~NP~~LliSPGP---------G~P~-DsGIs~~~i~~f~~~iP~fGvCMGlQCi~e~ 108 (223) T KOG0026 44 CHFEVYRNDELTVEELKRKNPRGLLISPGP---------GTPQ-DSGISLQTVLELGPLVPLFGVCMGLQCIGEA 108 (223) T ss_pred CCEEEEECCCCCHHHHHHCCCCCEEECCCC---------CCCC-CCCCHHHHHHHHCCCCCEEEEEECHHHHHHH T ss_conf 717998467311788862598807856999---------8976-4542489999857878556564003444565 No 141 >cd03148 GATase1_EcHsp31_like Type 1 glutamine amidotransferase (GATase1)-like domain found in Escherichia coli Hsp31 protein (EcHsp31). Type 1 glutamine amidotransferase (GATase1)-like domain found in Escherichia coli Hsp31 protein (EcHsp31). This group includes proteins similar to EcHsp31. EcHsp31 has chaperone activity. EcHsp31 coordinates a metal ion using a 2-His-1-carboxylate motif present in various ions that use iron as a cofactor such as Carboxypeptidase A. The catalytic triad typical of GATase1 domains is not conserved in this GATase1-like domain. However, in common with a typical GATase1domain, a reactive Cys residue is found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. This Cys together with a different His and, an Asp (rather than a Glu) residue form a different catalytic triad from the typical GATase1 domain. EcHsp31 is a homodimer. Probab=89.82 E-value=0.37 Score=28.27 Aligned_cols=53 Identities=23% Similarity=0.408 Sum_probs=38.6 Q ss_pred CCCCCCCEEEECCCCCCCCCCCHHHHHH---HHHHHHHHHHHHHCCCEEEEECCCHHHHEECCC Q ss_conf 6445647899728734543114067752---002121223322059717860640310100000 Q gi|254780971|r 37 TDIPDVDLIVIPGGFSYGDYLRCGAIAA---RTPVMQAIKKKAQQGIKVMGICNGFQILVELNL 97 (219) Q Consensus 37 ~~l~~~d~lvipGGFSygD~l~aG~i~~---~~~~~~~i~~~~~~g~~vLGICNGfQiL~elGL 97 (219) .+..+||+|.+|||- |.++- +..+-+-+.++.++|+++-.||.|--.|....+ T Consensus 92 ~d~~~Ydavf~pGGH--------G~m~Dl~~~~~l~~ll~~~~~~gk~vaaVCHGpaaLl~~~~ 147 (232) T cd03148 92 ADDSEYAAVFIPGGH--------GALIGIPESQDVAAALQWAIKNDRFVITLCHGPAAFLAARH 147 (232) T ss_pred CCCCCCEEEEECCCC--------CCCCCCCCCHHHHHHHHHHHHCCCEEEEECCHHHHHHCCCC T ss_conf 784222289967997--------53057757999999999999859979998674798740206 No 142 >cd03129 GAT1_Peptidase_E_like Type 1 glutamine amidotransferase (GATase1)-like domain found in peptidase E_like proteins. Type 1 glutamine amidotransferase (GATase1)-like domain found in peptidase E_like proteins. This group contains proteins similar to the aspartyl dipeptidases Salmonella typhimurium peptidase E and Xenopus laevis peptidase E and, extracellular cyanophycinases from Pseudomonas anguilliseptica BI (CphE) and Synechocystis sp. PCC 6803 CphB. In bacteria peptidase E is believed to play a role in degrading peptides generated by intracellular protein breakdown or imported into the cell as nutrient sources. Peptidase E uniquely hydrolyses only Asp-X dipeptides (where X is any amino acid), and one tripeptide Asp-Gly-Gly. Cyanophycinases are intracellular exopeptidases which hydrolyze the polymer cyanophycin (multi L-arginyl-poly-L-aspartic acid) to the dipeptide beta-Asp-Arg. Peptidase E and cyanophycinases are thought to have a Ser-His-Glu catalytic triad which differs from Probab=88.99 E-value=2.1 Score=23.50 Aligned_cols=89 Identities=20% Similarity=0.369 Sum_probs=61.5 Q ss_pred EEEEEECCCCC-CHH--HHHHHHHHHCCCCEEEEECCC--------CCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 68998449706-479--999999985499639982154--------6445647899728734543114067752002121 Q gi|254780971|r 2 KTAIVQIPGLN-RDN--DMIKAITKIIGQSPILVWQSD--------TDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQ 70 (219) Q Consensus 2 kvaVl~~pGsN-cd~--e~~~A~~~~~~~~~~~v~~~~--------~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~ 70 (219) |+.+|-..... .++ ....+|.++ +.+...++..+ ..+.+.|+|.+.|| |..+.-..|..+++.+ T Consensus 31 ~i~~iptAs~~~~~~~~~~~~~~~~l-G~~~~~l~~~~~a~~~~~~~~l~~ad~i~~~GG----~~~~~~~~~~~t~~~~ 105 (210) T cd03129 31 RVLFIPTASGDRDEYGEEYRAAFERL-GVEVVHLLLIDTANDPDVVARLLEADGIFVGGG----NQLRLLSVLRETPLLD 105 (210) T ss_pred EEEEEECCCCCCHHHHHHHHHHHHHC-CCEEEEEECCCCCCCHHHHHHHHHCCEEEECCC----CHHHHHHHHHHCCHHH T ss_conf 49999489888089999999999974-994887512256787169999971999998898----8999999997488999 Q ss_pred HHHHHHHCCCEEEEECCCHHHHEEC Q ss_conf 2233220597178606403101000 Q gi|254780971|r 71 AIKKKAQQGIKVMGICNGFQILVEL 95 (219) Q Consensus 71 ~i~~~~~~g~~vLGICNGfQiL~el 95 (219) .+++..++|.++.|.--|.-++.+. T Consensus 106 ~l~~~~~~G~v~~G~SAGa~~~~~~ 130 (210) T cd03129 106 AILKRVARGVVIGGTSAGAAVMGET 130 (210) T ss_pred HHHHHHHCCCEEEEECHHHHHCCCC T ss_conf 9999998499099735578862876 No 143 >PRK04155 chaperone protein HchA; Provisional Probab=87.97 E-value=0.48 Score=27.59 Aligned_cols=56 Identities=18% Similarity=0.236 Sum_probs=37.8 Q ss_pred CCCCEEEECCCCCCCCCCCHHHHHHHHH-HHHHHHHHHHCCCEEEEECCCHHHHEECCCHHHH Q ss_conf 5647899728734543114067752002-1212233220597178606403101000001011 Q gi|254780971|r 40 PDVDLIVIPGGFSYGDYLRCGAIAARTP-VMQAIKKKAQQGIKVMGICNGFQILVELNLLPGI 101 (219) Q Consensus 40 ~~~d~lvipGGFSygD~l~aG~i~~~~~-~~~~i~~~~~~g~~vLGICNGfQiL~elGLlPg~ 101 (219) .+|++|.||||- ++-.....++ +-+.++.+.++|++++-+|.|=--|....+=+|. T Consensus 147 sdY~AVFiPGGH------G~M~dLP~s~~l~~iL~~a~~~~k~v~alCHGPAALLaa~~~~g~ 203 (288) T PRK04155 147 SDYAAVFIPGGH------GALIGIPESEDVKAALQWALDNDRFIITLCHGPAALLAAGVDHED 203 (288) T ss_pred CCCEEEEECCCC------CCCCCCCCCHHHHHHHHHHHHCCCEEEEECCCHHHHHHCCCCCCC T ss_conf 882489936897------554678889899999999997399899981787988712047998 No 144 >COG4242 CphB Cyanophycinase and related exopeptidases [Secondary metabolites biosynthesis, transport, and catabolism / Inorganic ion transport and metabolism] Probab=87.14 E-value=0.84 Score=26.00 Aligned_cols=73 Identities=25% Similarity=0.381 Sum_probs=54.3 Q ss_pred HHHHHHHCCCCEEEEECC---C-------CCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECC Q ss_conf 999998549963998215---4-------644564789972873454311406775200212122332205971786064 Q gi|254780971|r 18 IKAITKIIGQSPILVWQS---D-------TDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIKVMGICN 87 (219) Q Consensus 18 ~~A~~~~~~~~~~~v~~~---~-------~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~vLGICN 87 (219) .++|+.++-.++.++..+ + .++.+.+++.+.|| |-+|--..++.+|+++.+....+.|..+=|.-- T Consensus 73 ~rife~~gv~~v~ildir~R~~a~~s~~~~~v~~a~gIfftGG----DQ~ri~~~lkdTpl~~~ir~r~r~G~avgGTSA 148 (293) T COG4242 73 IRIFEMMGVEEVQILDIRNREDASSSDIVAKVENATGIFFTGG----DQLRIIGSLKDTPLMAAIRQRVRRGIAVGGTSA 148 (293) T ss_pred HHHHHHHCCCEEEEEEEECCCCCCHHHHHHHHHHCCEEEEECC----CCEEEEEECCCCHHHHHHHHHHHCCCEECCCCC T ss_conf 5688873622168986422013653779999872866998258----602443101478789999999735845436652 Q ss_pred CHHHHEE Q ss_conf 0310100 Q gi|254780971|r 88 GFQILVE 94 (219) Q Consensus 88 GfQiL~e 94 (219) |.-+|.. T Consensus 149 GAavM~~ 155 (293) T COG4242 149 GAAVMSD 155 (293) T ss_pred CHHHCCC T ss_conf 3444278 No 145 >PRK13235 nifH nitrogenase reductase; Reviewed Probab=84.76 E-value=0.77 Score=26.25 Aligned_cols=21 Identities=19% Similarity=0.216 Sum_probs=15.4 Q ss_pred EECHHHHHHHHHCCEEEEECC Q ss_conf 308677411542250365316 Q gi|254780971|r 148 FIDAKGLAEIEKNNQIVFRYA 168 (219) Q Consensus 148 ~~~~~~l~~l~~~~~i~~~Y~ 168 (219) +-.+..+.+-+.++|.++.|. T Consensus 210 IPr~~~V~~Ae~~~~tVie~~ 230 (274) T PRK13235 210 VPRDNMVQRAEINRKTVIDYD 230 (274) T ss_pred CCCCHHHHHHHHCCCEEEEEC T ss_conf 589458899987397789978 No 146 >TIGR02069 cyanophycinase cyanophycinase; InterPro: IPR011811 This entry describes both cytosolic and extracellular cyanophycinases. The former are part of a system in many Cyanobacteria and a few other species of generating and later utilizing a storage polymer for nitrogen, carbon, and energy, called cyanophycin. The latter are found in species such as Pseudomonas anguilliseptica that can use external cyanophycin. The polymer has a backbone of L-aspartic acid, with most Asp side chain carboxyl groups attached to L-arginine.; GO: 0016787 hydrolase activity, 0044260 cellular macromolecule metabolic process. Probab=83.85 E-value=3.1 Score=22.35 Aligned_cols=53 Identities=25% Similarity=0.517 Sum_probs=48.6 Q ss_pred CCCCCCEEEECCCCCCCCCCCHHHHHH-----HHHHHHHHHHHHHCC-CEEEEECCCHHHHEE Q ss_conf 445647899728734543114067752-----002121223322059-717860640310100 Q gi|254780971|r 38 DIPDVDLIVIPGGFSYGDYLRCGAIAA-----RTPVMQAIKKKAQQG-IKVMGICNGFQILVE 94 (219) Q Consensus 38 ~l~~~d~lvipGGFSygD~l~aG~i~~-----~~~~~~~i~~~~~~g-~~vLGICNGfQiL~e 94 (219) .|.++++|.|.|| |=||--++.. .+|+++.|++..++| ..+.|.=-|.-+|.+ T Consensus 99 ~~~~a~gIFFtGG----DQlRits~l~tHqGGdTpl~~~lr~r~~~G~~~~~GTSAGAaVMs~ 157 (297) T TIGR02069 99 LLSNATGIFFTGG----DQLRITSLLGTHQGGDTPLLDRLRKRVEEGKIILGGTSAGAAVMSD 157 (297) T ss_pred HHHHCCEEEEECC----HHHHHHHHHHCCCCCCCHHHHHHHHHHHCCEEEEEECCCCHHHCCC T ss_conf 8741977998474----4899999862158898768999998863875899964730222889 No 147 >PRK06108 aspartate aminotransferase; Provisional Probab=83.85 E-value=0.69 Score=26.54 Aligned_cols=58 Identities=19% Similarity=0.236 Sum_probs=41.4 Q ss_pred CCCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCC------EEEEECCCCC Q ss_conf 996899862013302330867741154225036531688888988112------6688869998 Q gi|254780971|r 132 MNQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHD------IAGVINRRGN 189 (219) Q Consensus 132 ~~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~------IAgi~s~~G~ 189 (219) ...++.+|....+++|..|.+.+++......-++=+|.|+||.|.... |+.+|.+.+- T Consensus 129 g~~~~~vpl~~~~~~~~~d~~~le~~~~~~~k~iil~~P~NPtG~v~s~e~l~~l~~la~~~~v 192 (382) T PRK06108 129 GARVVCVPLQYGGGGWQLDVDRLLAAITPRTRALFINSPNNPTGWTASRDDQQAILAHCRRHGL 192 (382) T ss_pred CCEEEECCCCCCCCCCCCCHHHHHHHCCCCCEEEEECCCCCCCCHHCCHHHHHHHHHHHHCCCC T ss_conf 8827853456667886679999996457566499988996996620447889999998762697 No 148 >pfam08532 Glyco_hydro_42M Beta-galactosidase trimerisation domain. This is non catalytic domain B of beta-galactosidase enzymes belong to the glycosyl hydrolase 42 family. This domain is related to glutamine amidotransferase enzymes, but the catalytic residues are replaced by non functional amino acids. This domain is involved in trimerisation. Probab=83.43 E-value=2.4 Score=23.14 Aligned_cols=75 Identities=20% Similarity=0.294 Sum_probs=47.4 Q ss_pred HHHHHHHHCCCCEEEEECCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCE-EEEECCCHH---HH Q ss_conf 99999985499639982154644564789972873454311406775200212122332205971-786064031---01 Q gi|254780971|r 17 MIKAITKIIGQSPILVWQSDTDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIK-VMGICNGFQ---IL 92 (219) Q Consensus 17 ~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~-vLGICNGfQ---iL 92 (219) ..+||.+ .|..+.+|. .+.+|++|++||.|+=+ + ....+.+.+++|+++|+. ++|-=.|.- .. T Consensus 35 ~Y~aL~~-~gv~vDiv~-~~~dls~YklvvvP~l~-----~------~~~~~~~~L~~yv~~GG~lv~g~rsG~~de~~~ 101 (207) T pfam08532 35 WYRALRR-LGIPVDVVP-PDADLSGYKLVVAPMLY-----M------VSEELAKKLEAYVKNGGTLVTTYRSGIVDENDR 101 (207) T ss_pred HHHHHHH-CCCCEEEEC-CCCCCCCCCEEEEEEEE-----E------ECHHHHHHHHHHHHCCCEEEEECCCCCCCCCCC T ss_conf 9999998-699677704-76881219599985578-----8------099999999999967998999536751979980 Q ss_pred EECCCHHHHHCC Q ss_conf 000001011012 Q gi|254780971|r 93 VELNLLPGILMR 104 (219) Q Consensus 93 ~elGLlPg~l~~ 104 (219) +-.++.||.|.. T Consensus 102 i~~~~~Pg~L~~ 113 (207) T pfam08532 102 IPLGGYPGPLRE 113 (207) T ss_pred CCCCCCCCCCHH T ss_conf 877989962054 No 149 >pfam03575 Peptidase_S51 Peptidase family S51. Probab=82.86 E-value=4.3 Score=21.47 Aligned_cols=76 Identities=18% Similarity=0.239 Sum_probs=55.2 Q ss_pred HHHHHHHHHCCCCEEEEECCC-------CCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCC Q ss_conf 999999985499639982154-------6445647899728734543114067752002121223322059717860640 Q gi|254780971|r 16 DMIKAITKIIGQSPILVWQSD-------TDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIKVMGICNG 88 (219) Q Consensus 16 e~~~A~~~~~~~~~~~v~~~~-------~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~vLGICNG 88 (219) ....||++ .+.++..++... ..+.+.|+|.+.|| +...-=+.+..+.+.+.+++..++|++..|.--| T Consensus 4 ~~~~~f~~-lg~~v~~l~~~~~~~~~~~~~i~~ad~I~v~GG----nt~~ll~~l~~~g~~~~l~~~~~~G~~~~G~SAG 78 (154) T pfam03575 4 KFREALEK-LGLEVSGLHLFTPSVEDIEAKILKADVIYVGGG----NTFHLLKLLRETGLDDIIREAVQAGLPYIGWSAG 78 (154) T ss_pred HHHHHHHH-CCCEEEEEECCCCCHHHHHHHHHHCCEEEECCC----HHHHHHHHHHHCCCHHHHHHHHHCCCEEEEECHH T ss_conf 99999997-699778986667974729999985999998987----5999999999868299999999859778850604 Q ss_pred HHHHEECC Q ss_conf 31010000 Q gi|254780971|r 89 FQILVELN 96 (219) Q Consensus 89 fQiL~elG 96 (219) .-++.... T Consensus 79 a~i~~~~i 86 (154) T pfam03575 79 ANVAGPSI 86 (154) T ss_pred HHHCCCCC T ss_conf 66434773 No 150 >PRK03708 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=80.47 E-value=5.1 Score=21.02 Aligned_cols=96 Identities=23% Similarity=0.270 Sum_probs=45.4 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHH---CCCCEEEEECC---------C--CCCC--CCCEEEECCCCCCCCCCCHHHHHH Q ss_conf 968998449706479999999985---49963998215---------4--6445--647899728734543114067752 Q gi|254780971|r 1 MKTAIVQIPGLNRDNDMIKAITKI---IGQSPILVWQS---------D--TDIP--DVDLIVIPGGFSYGDYLRCGAIAA 64 (219) Q Consensus 1 mkvaVl~~pGsNcd~e~~~A~~~~---~~~~~~~v~~~---------~--~~l~--~~d~lvipGGFSygD~l~aG~i~~ 64 (219) |||+|+.=++.---.+.+..+.++ -|.++.+-... . ..+. +.|+++.-|| | |.+.. T Consensus 1 MKigIv~n~~k~~a~~~a~~l~~~L~~~g~~v~ld~~~~~~~~~~~~~~~~~~~~~~~Dlvi~lGG----D----GT~L~ 72 (278) T PRK03708 1 MRFGIVARRDKEEALKLAYRVYDFLKVSGYEVVVDSDTYEHLPQFSEEDVLPLEEFDVDFILAIGG----D----GTILR 72 (278) T ss_pred CEEEEEEECCCHHHHHHHHHHHHHHHHCCCEEEEEHHHHHHCCCCCCCCCCCCCCCCCCEEEEECC----C----HHHHH T ss_conf 999999218998999999999999998899899974786566655556767543578789999878----6----89999 Q ss_pred HHHHHHHHHHHHHCCCEEEEECCC---HHHHEECCCHHHHHCCCCCCCEE Q ss_conf 002121223322059717860640---31010000010110124433245 Q gi|254780971|r 65 RTPVMQAIKKKAQQGIKVMGICNG---FQILVELNLLPGILMRNCSLKFV 111 (219) Q Consensus 65 ~~~~~~~i~~~~~~g~~vLGICNG---fQiL~elGLlPg~l~~N~s~rf~ 111 (219) + .+......|+|||--| |=.=++..=+..+|.+-..++|. T Consensus 73 ------a-~~~~~~~iPilGiN~G~lGFLt~~~~~~~~~~l~~i~~g~y~ 115 (278) T PRK03708 73 ------I-EHKTKKEIPILSINMGTLGFLTEVEPEDTFFALSRLLEGEYY 115 (278) T ss_pred ------H-HHHCCCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHCCCCE T ss_conf ------9-996478998898835875303556878999999999728974 No 151 >PRK05957 aspartate aminotransferase; Provisional Probab=78.62 E-value=2.1 Score=23.50 Aligned_cols=57 Identities=18% Similarity=0.218 Sum_probs=38.6 Q ss_pred CCCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCC------EEEEECCCCCEE Q ss_conf 996899862013302330867741154225036531688888988112------668886999899 Q gi|254780971|r 132 MNQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHD------IAGVINRRGNVL 191 (219) Q Consensus 132 ~~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~------IAgi~s~~G~vl 191 (219) ...++.+|. +..|..+.+.+++......-++=|+.|+||.|+... |+.+|.+.+-++ T Consensus 134 g~~~v~vp~---d~~~~~~~~~l~~~i~~~~k~iil~~P~NPTG~v~s~~~l~~l~~la~~~~~~i 196 (389) T PRK05957 134 GCQPVLVPT---DENYQLRPEAIEAAITPKTRAIVTISPNNPTGVVYPEALLRAVNQICAEHGIYH 196 (389) T ss_pred CCCEEEEEC---CCCCCCCHHHHHHHCCCCCEEEECCCCCCCCCCCCCHHHHHHHHHHHCCCCCEE T ss_conf 993588657---998897999999845102627861899899898876778999998760678189 No 152 >PRK01911 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=78.33 E-value=6.1 Score=20.51 Aligned_cols=73 Identities=27% Similarity=0.434 Sum_probs=39.5 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHH---CCCCEEEEECC-------------------CCCCCCCCEEEECCCCCCCCCCC Q ss_conf 968998449706479999999985---49963998215-------------------46445647899728734543114 Q gi|254780971|r 1 MKTAIVQIPGLNRDNDMIKAITKI---IGQSPILVWQS-------------------DTDIPDVDLIVIPGGFSYGDYLR 58 (219) Q Consensus 1 mkvaVl~~pGsNcd~e~~~A~~~~---~~~~~~~v~~~-------------------~~~l~~~d~lvipGGFSygD~l~ 58 (219) ||+||+--+..--..+.++.+.+. -+.+..+ ... +..-.++|+++.-|| | T Consensus 1 MKIaIigk~~~~~~~~~~~~Li~~L~~~g~~v~l-e~~~a~~l~~~~~~~~~~~~~~~~~~~~~Dlvi~iGG----D--- 72 (290) T PRK01911 1 MKIAIFGQTYQASKSPHIKRLFELLEEHGAEIYI-EEEFLNFLTQDLKFEPKYKGFFDGNNFDFDMVISIGG----D--- 72 (290) T ss_pred CEEEEECCCCCHHHHHHHHHHHHHHHHCCCEEEE-EHHHHHHHHHCCCCCCCCCCCCCCCCCCCCEEEEECC----C--- T ss_conf 9999988889847999999999999988998999-7689787765125664312321135777779999787----6--- Q ss_pred HHHHHHHHHHHHHHHHHHHCCCEEEEECCC Q ss_conf 067752002121223322059717860640 Q gi|254780971|r 59 CGAIAARTPVMQAIKKKAQQGIKVMGICNG 88 (219) Q Consensus 59 aG~i~~~~~~~~~i~~~~~~g~~vLGICNG 88 (219) |. +..+.+.+...+.|+|||--| T Consensus 73 -GT------~L~a~~~~~~~~iPilGiN~G 95 (290) T PRK01911 73 -GT------FLRAAARVGNSGIPILGINTG 95 (290) T ss_pred -HH------HHHHHHHHHHCCCCEEEEECC T ss_conf -89------999999861259968999448 No 153 >TIGR01287 nifH nitrogenase iron protein; InterPro: IPR005977 The enzyme responsible for nitrogen fixation, the nitrogenase, shows a high degree of conservation of structure, function, and amino acid sequence across wide phylogenetic ranges. All known Mo-nitrogenases consist of two components, component I (also called dinitrogenase, or Fe-Mo protein), an alpha2beta2 tetramer encoded by the nifD and nifK genes, and component II (dinitrogenase reductase, or Fe protein) a homodimer encoded by the nifH gene , . Two operons, nifDK and nifEN, encode a tetrameric (alpha2beta2 and N2E2) enzymatic complex. Nitrogenase contains two unusual rare metal clusters; one of them is the iron molybdenum cofactor (FeMo-co), which is considered to be the site of dinitrogen reduction and whose biosynthesis requires the products of nifNE and of some other nif genes . It has been proposed that NifNE might serve as a scaffold upon which FeMo-co is built and then inserted into component I .; GO: 0016163 nitrogenase activity, 0009399 nitrogen fixation, 0016612 molybdenum-iron nitrogenase complex. Probab=78.17 E-value=1.5 Score=24.40 Aligned_cols=91 Identities=24% Similarity=0.328 Sum_probs=57.2 Q ss_pred CCEE--EECCCCCCC------CC---CCHHHHHH---HHHHHHHHHHHHHCCCEEEE--ECCCHHHHEECCCHHHHHCCC Q ss_conf 4789--972873454------31---14067752---00212122332205971786--064031010000010110124 Q gi|254780971|r 42 VDLI--VIPGGFSYG------DY---LRCGAIAA---RTPVMQAIKKKAQQGIKVMG--ICNGFQILVELNLLPGILMRN 105 (219) Q Consensus 42 ~d~l--vipGGFSyg------D~---l~aG~i~~---~~~~~~~i~~~~~~g~~vLG--ICNGfQiL~elGLlPg~l~~N 105 (219) ||.| |.+|||+-- |. ..||.++| -..+.+=|++++++|+--|| |||.=|+.-|..|+. .|. T Consensus 124 YDVLGDVVCGGFAmP~R~g~A~eiYiVtSge~MAlYAANNI~kGI~kYa~~GGv~LGG~IcN~R~~~~e~El~~-~fA-- 200 (278) T TIGR01287 124 YDVLGDVVCGGFAMPLREGLAQEIYIVTSGEMMALYAANNIAKGILKYAKSGGVRLGGIICNSRNVDDEKELVD-EFA-- 200 (278) T ss_pred EEECCCEEECCCCCHHHCCCCCEEEEECCCHHHHHHHHHHHHHHHHHHHHCCCCEECCEEECCCCCCCHHHHHH-HHH-- T ss_conf 84236556566036100588654888603406789999788777899975388222247871457621789999-999-- Q ss_pred CCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEE Q ss_conf 433245422576752523577641379968998 Q gi|254780971|r 106 CSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKC 138 (219) Q Consensus 106 ~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~ 138 (219) .|.-++-++ .|+.++.+--.-++.-+|++. T Consensus 201 --~~lgtq~i~-~VPrs~~V~~AEl~~~TVIE~ 230 (278) T TIGR01287 201 --KKLGTQLIH-FVPRSNIVQKAELEKKTVIEY 230 (278) T ss_pred --HHHCCEEEE-ECCCCCCCHHHHHCCCCEEEE T ss_conf --873770675-217885212787368845633 No 154 >PRK13233 nifH nitrogenase reductase; Reviewed Probab=77.23 E-value=1.6 Score=24.30 Aligned_cols=18 Identities=17% Similarity=0.233 Sum_probs=11.4 Q ss_pred CHHHHHHHHHCCEEEEEC Q ss_conf 867741154225036531 Q gi|254780971|r 150 DAKGLAEIEKNNQIVFRY 167 (219) Q Consensus 150 ~~~~l~~l~~~~~i~~~Y 167 (219) .+....+-+.++|.++.| T Consensus 213 r~~~V~~Ae~~~~tVie~ 230 (275) T PRK13233 213 RDNIVQKAEFNKKTVVEF 230 (275) T ss_pred CCHHHHHHHHCCCEEEEE T ss_conf 977888778739767998 No 155 >PRK04885 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=77.11 E-value=6 Score=20.55 Aligned_cols=65 Identities=26% Similarity=0.497 Sum_probs=37.4 Q ss_pred CEEEEEECCCCCCHH---HHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 968998449706479---99999998549963998215464456478997287345431140677520021212233220 Q gi|254780971|r 1 MKTAIVQIPGLNRDN---DMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQ 77 (219) Q Consensus 1 mkvaVl~~pGsNcd~---e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~ 77 (219) ||++|+ +|.+. +++..+.+......+ +. +..+.|+++.-|| | |.+ ..+++++.. T Consensus 1 mk~~iv----~~~~~~s~~~~~~~~~~l~~~~~---~~--~~~~~Dlvi~iGG----D----GT~------L~a~~~~~~ 57 (265) T PRK04885 1 MKVAII----SNGDEKSKRVASKLKKYLKDFGF---IL--DEKNPDIVISVGG----D----GTL------LSAFHRYEN 57 (265) T ss_pred CEEEEE----ECCCHHHHHHHHHHHHHHHHCCC---CC--CCCCCCEEEEECC----C----HHH------HHHHHHHHC T ss_conf 969999----69698999999999999987698---55--8778999999887----3----999------999998630 Q ss_pred C--CCEEEEECCC Q ss_conf 5--9717860640 Q gi|254780971|r 78 Q--GIKVMGICNG 88 (219) Q Consensus 78 ~--g~~vLGICNG 88 (219) . +.|+|||--| T Consensus 58 ~~~~vPilGIN~G 70 (265) T PRK04885 58 QLDKVRFVGVHTG 70 (265) T ss_pred CCCCCCEEEEECC T ss_conf 3679758987358 No 156 >PRK07337 aminotransferase; Validated Probab=77.10 E-value=3.6 Score=21.95 Aligned_cols=157 Identities=18% Similarity=0.214 Sum_probs=75.5 Q ss_pred HHHHHHHCCCCEEEEECCCCCCCCCCE-------EEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHC--CCE----EEE Q ss_conf 999998549963998215464456478-------9972873454311406775200212122332205--971----786 Q gi|254780971|r 18 IKAITKIIGQSPILVWQSDTDIPDVDL-------IVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQ--GIK----VMG 84 (219) Q Consensus 18 ~~A~~~~~~~~~~~v~~~~~~l~~~d~-------lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~--g~~----vLG 84 (219) ++.+++ .|.+.+..|..+-+++--.. -+--|=|.|+...+ ...+++++.++.++ |.. =+- T Consensus 22 a~~~~~-~G~dvi~l~~g~pdf~~p~~i~~a~~~~~~~~~~~Y~~~~G------~~~lreaia~~~~~~~g~~i~pe~I~ 94 (388) T PRK07337 22 AQALER-AGRDIIHMGIGEPDFTAPEPVVEAAARALRRGVTQYTSALG------LAPLREAIAAHYARRFGLDVAPERIV 94 (388) T ss_pred HHHHHH-CCCCEEECCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCC------CHHHHHHHHHHHHHHHCCCCCHHHEE T ss_conf 999997-79980899799899987999999999998469998899988------79999999999999868999668879 Q ss_pred ECCCHHHHEECCCHHHHHCCCCCCCEEEEEEEEEECCC--HHHHHHHCCCCCEEEEEEECCCEEEEECHHHHHHHHHCCE Q ss_conf 06403101000001011012443324542257675252--3577641379968998620133023308677411542250 Q gi|254780971|r 85 ICNGFQILVELNLLPGILMRNCSLKFVCKQVLLEVVNS--NTAFTKSYKMNQIIKCPVAHHDGNYFIDAKGLAEIEKNNQ 162 (219) Q Consensus 85 ICNGfQiL~elGLlPg~l~~N~s~rf~~r~~~~~v~~~--~s~~~~~~~~~~~l~~piaHgEGrf~~~~~~l~~l~~~~~ 162 (219) |++|.|--..+ +--+|+ |.. ..|-+.-+.- -..+++.. .+....+|. ..+.+|..|.+.++......- T Consensus 95 it~G~~~al~~--~~~~l~-~pG-----D~Vlv~~P~Yp~~~~~~~~~-g~~~~~vp~-~~~~~~~~d~~~l~~~~~~~t 164 (388) T PRK07337 95 VTAGASAALLL--ACLALV-ERG-----DEVLMPDPSYPCNRHFVAAA-EGRPVLVPS-GPAERFQLTADDVRTHWGERT 164 (388) T ss_pred ECCCHHHHHHH--HHHHHC-CCC-----CEEEECCCCCHHHHHHHHHC-CCEEEEEEC-CCCCCCCCCHHHHHHHCCCCC T ss_conf 84875999999--999966-999-----98998588736599999973-988999403-710067789799998578655 Q ss_pred EEEECCCCCCCCCCCCC------EEEEECCCCCEE Q ss_conf 36531688888988112------668886999899 Q gi|254780971|r 163 IVFRYASGTNPNGSLHD------IAGVINRRGNVL 191 (219) Q Consensus 163 i~~~Y~d~~NPNGS~~~------IAgi~s~~G~vl 191 (219) -++-+|.|+||.|+... |+-+|.+.+-++ T Consensus 165 k~iil~nP~NPtG~v~s~~el~~i~~~a~~~~~~v 199 (388) T PRK07337 165 RGVLLASPSNPTGTSIEPDELRRIVEAVRARGGFT 199 (388) T ss_pred EEEEECCCCCCCCEEEEHHHHHHHHHHHHCCCEEE T ss_conf 39997899798885626777433554431458088 No 157 >PRK08960 hypothetical protein; Provisional Probab=77.02 E-value=2 Score=23.66 Aligned_cols=64 Identities=22% Similarity=0.283 Sum_probs=43.7 Q ss_pred HHHHCCCCCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCC------EEEEECCCCCEE Q ss_conf 764137996899862013302330867741154225036531688888988112------668886999899 Q gi|254780971|r 126 FTKSYKMNQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHD------IAGVINRRGNVL 191 (219) Q Consensus 126 ~~~~~~~~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~------IAgi~s~~G~vl 191 (219) +++.. .+.+..+|+ ..|++|..+.+.++........++-+++|.||.|.... |+-+|.+.+-++ T Consensus 132 ~~~~~-~~~~~~vp~-~~~~~~~~~~~~le~~~~~~~~~iil~nP~NPTG~v~s~~~l~~l~~~a~~~~~~v 201 (387) T PRK08960 132 FLRLV-EGAAQLVPV-GPDTRYQLTPALVERHWDSDSVGALVASPANPTGTLLSRDELAALSQALKARGGHL 201 (387) T ss_pred HHHHH-CCEEEEEEC-CCCCCCCCCHHHHHHHCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCEE T ss_conf 99985-374499975-75447788999999727878729997689898863146899999999876239779 No 158 >PRK11249 katE hydroperoxidase II; Provisional Probab=76.75 E-value=6.7 Score=20.22 Aligned_cols=90 Identities=20% Similarity=0.307 Sum_probs=40.5 Q ss_pred EEEEEECCCCCCHH--HHHHHHHHHCCCCEEEEECC-------C-------C-----CCCCCCEEEECCCCCCCCCCCHH Q ss_conf 68998449706479--99999998549963998215-------4-------6-----44564789972873454311406 Q gi|254780971|r 2 KTAIVQIPGLNRDN--DMIKAITKIIGQSPILVWQS-------D-------T-----DIPDVDLIVIPGGFSYGDYLRCG 60 (219) Q Consensus 2 kvaVl~~pGsNcd~--e~~~A~~~~~~~~~~~v~~~-------~-------~-----~l~~~d~lvipGGFSygD~l~aG 60 (219) |||||...|.+.+. .+..|++. .|..+.+|--. + . .---||.+++|||=. ..- T Consensus 572 kvaiL~~dg~d~~~~~~l~~~l~~-~g~~~~iv~p~~g~v~~~~g~~~~~~~~~~~~~Sv~fDAv~v~~g~~-----~~~ 645 (725) T PRK11249 572 KVAILLNDGVDAADLLAILKALKA-KGAHAKLLYPRMGEVTADDGTVLPIAATFAGAPSVTFDAVIVPGGDA-----NIA 645 (725) T ss_pred EEEEEECCCCCHHHHHHHHHHHHH-CCCEEEEEECCCCCEECCCCCEECCCEEECCCCCCEEEEEEECCCHH-----HHH T ss_conf 799997289898999999999996-89989999368650565899876155587689851242689658646-----689 Q ss_pred HHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHEECCC Q ss_conf 7752002121223322059717860640310100000 Q gi|254780971|r 61 AIAARTPVMQAIKKKAQQGIKVMGICNGFQILVELNL 97 (219) Q Consensus 61 ~i~~~~~~~~~i~~~~~~g~~vLGICNGfQiL~elGL 97 (219) ..+........+.+..+-.|+|-.+-.|-++|..+|+ T Consensus 646 ~l~~~g~a~~fv~eay~H~K~I~~~g~~~~ll~~~~~ 682 (725) T PRK11249 646 DLADNGDARHYLLEAYKHLKPIALAGDARQLLAALKL 682 (725) T ss_pred HHHHCCCHHHHHHHHHHHCCEEEECCCHHHHHHHCCC T ss_conf 8741632899999997504668875557999986499 No 159 >TIGR02086 IPMI_arch 3-isopropylmalate dehydratase, large subunit; InterPro: IPR011826 3-isopropylmalate dehydratase (or isopropylmalate isomerase; 4.2.1.33 from EC) catalyses the stereo-specific isomerisation of 2-isopropylmalate and 3-isopropylmalate, via the formation of 2-isopropylmaleate. This enzyme performs the second step in the biosynthesis of leucine, and is present in most prokaryotes and many fungal species. The prokaryotic enzyme is a heterodimer composed of a large (LeuC) and small (LeuD) subunit, while the fungal form is a monomeric enzyme. Both forms of isopropylmalate are related and are part of the larger aconitase family . Aconitases are mostly monomeric proteins which share four domains in common and contain a single, labile [4Fe-4S] cluster. Three structural domains (1, 2 and 3) are tightly packed around the iron-sulphur cluster, while a fourth domain (4) forms a deep active-site cleft. The prokaryotic enzyme is encoded by two adjacent genes, leuC and leuD, corresponding to aconitase domains 1-3 and 4 respectively , . LeuC does not bind an iron-sulphur cluster. It is thought that some prokaryotic isopropylamalate dehydrogenases can also function as homoaconitase 4.2.1.36 from EC, converting cis-homoaconitate to homoisocitric acid in lysine biosynthesis . Homoaconitase has been identified in higher fungi (mitochondria) and several archaea and one thermophilic species of bacteria, Thermus thermophilus . This entry represents the large subunit of 3-isopropylmalate dehydratase (LeuC), as well as homoaconitase enzymes, from prokaryotes. Homoaconitase, aconitase and 3-isopropylmalate dehydratase have similar overall structures and domain organisation . All are dehydratases that bind a [4Fe-4S]-cluster. Information about related proteins can be found at Protein of the Month: Aconitase .; GO: 0003861 3-isopropylmalate dehydratase activity, 0051539 4 iron 4 sulfur cluster binding, 0009098 leucine biosynthetic process. Probab=75.54 E-value=1.2 Score=25.02 Aligned_cols=44 Identities=36% Similarity=0.618 Sum_probs=29.6 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHC-CCEEE-----EECCCHHHHEECCCH-HH Q ss_conf 311406775200212122332205-97178-----606403101000001-01 Q gi|254780971|r 55 DYLRCGAIAARTPVMQAIKKKAQQ-GIKVM-----GICNGFQILVELNLL-PG 100 (219) Q Consensus 55 D~l~aG~i~~~~~~~~~i~~~~~~-g~~vL-----GICNGfQiL~elGLl-Pg 100 (219) |+.-...-..-+.+-+.+++|+++ |.+++ |||. |||+|.|+. || T Consensus 66 DH~~P~~~v~~A~~Q~~~REF~K~~gI~~f~~~g~GIcH--Q~L~E~Gya~PG 116 (431) T TIGR02086 66 DHVVPAPTVEAAELQKEIREFAKRHGIKVFFDVGEGICH--QVLVEKGYAEPG 116 (431) T ss_pred ECCCCCCCHHHHHHHHHHHHHHHHCCCCEEEECCCEEEE--EEEEECCCCCCC T ss_conf 268899656778877899988876687059987980277--886530624776 No 160 >PRK08362 consensus Probab=74.93 E-value=1.9 Score=23.68 Aligned_cols=59 Identities=15% Similarity=0.248 Sum_probs=40.2 Q ss_pred CCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCC------EEEEECCCCCEEE Q ss_conf 96899862013302330867741154225036531688888988112------6688869998999 Q gi|254780971|r 133 NQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHD------IAGVINRRGNVLG 192 (219) Q Consensus 133 ~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~------IAgi~s~~G~vlg 192 (219) .+++.+|. +.|+.|..|-+.+++...+.--++-+|.|+||.|.... |+-+|.+.+-++- T Consensus 133 ~~~v~vp~-~~~~~~~~d~~~l~~~i~~~~k~iil~~P~NPtG~v~s~~~l~~i~~~a~~~~~~ii 197 (389) T PRK08362 133 GKPVEVPT-YEENEFRLNVDDLKKYVTEKTRALIINSPNNPTGSVLTKKDLEEIADFAVEHDLMVI 197 (389) T ss_pred CEEEEEEC-CCCCCCCCCHHHHHHHCCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHHCCEEEE T ss_conf 98999854-577788769999997454468299984898984746319999999999875495798 No 161 >PRK06575 consensus Probab=74.71 E-value=1.6 Score=24.23 Aligned_cols=59 Identities=19% Similarity=0.200 Sum_probs=40.2 Q ss_pred CCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCC------EEEEECCCCCEEE Q ss_conf 96899862013302330867741154225036531688888988112------6688869998999 Q gi|254780971|r 133 NQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHD------IAGVINRRGNVLG 192 (219) Q Consensus 133 ~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~------IAgi~s~~G~vlg 192 (219) +..+.++. .-|.+|..|.+.+++...+.--++-+|.|.||.|.... |+.+|.+.++++- T Consensus 137 ~~~v~~~~-~~e~~~~~d~e~l~~~~~~~~k~iil~nP~NPTG~v~~~eel~~i~~~~~k~~~v~i 201 (399) T PRK06575 137 GTPVFVNC-GIENNFKLSAEALERSITDKTKWLIINSPSNPTGASYNFEELENIAKVLRKYPHVNV 201 (399) T ss_pred CEEEEECC-CCCCCCCCCHHHHHHHCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCEEE T ss_conf 85699415-755465558999986444476799989999998887889999999999874587378 No 162 >TIGR03537 DapC succinyldiaminopimelate transaminase. Note: the detailed information included in the EC:2.6.1.17 record includes the assertions that the enzyme uses the pyridoxal pyrophosphate cofactor, which is consistent with the pfam00155 family, and the assertion that the amino group donor is L-glutamate, which is undetermined for the sequences in this clade. Probab=73.75 E-value=3.5 Score=22.07 Aligned_cols=11 Identities=27% Similarity=0.504 Sum_probs=4.6 Q ss_pred CCCCCCCCCCC Q ss_conf 16888889881 Q gi|254780971|r 167 YASGTNPNGSL 177 (219) Q Consensus 167 Y~d~~NPNGS~ 177 (219) .|.|+||.|+. T Consensus 142 l~nP~NPTG~v 152 (350) T TIGR03537 142 INYPHNPTGAV 152 (350) T ss_pred ECCCCCCCCCC T ss_conf 89998983647 No 163 >pfam06283 ThuA Trehalose utilisation. This family consists of several bacterial ThuA like proteins. ThuA appears to be involved in utilisation of trehalose. The thuA and thuB genes form part of the trehalose/sucrose transport operon thuEFGKAB, which is located on the pSymB megaplasmid. The thuA and thuB genes are induced in vitro by trehalose but not by sucrose and the extent of its induction depends on the concentration of trehalose available in the medium. Probab=73.57 E-value=8.1 Score=19.71 Aligned_cols=140 Identities=19% Similarity=0.236 Sum_probs=74.0 Q ss_pred CCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHEE----CCCHHHHHCCCCCCCEEE Q ss_conf 6445647899728734543114067752002121223322059717860640310100----000101101244332454 Q gi|254780971|r 37 TDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIKVMGICNGFQILVE----LNLLPGILMRNCSLKFVC 112 (219) Q Consensus 37 ~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~vLGICNGfQiL~e----lGLlPg~l~~N~s~rf~~ 112 (219) .+|.+||+||+-.. .||.+. ..-.+++.++.++|+-.+|+=.+.--.-+ ..|+.|.+.. .... T Consensus 46 ~~L~~yDvvv~~~t--~g~~~~-------~~q~~a~~~~v~~GgG~vg~H~a~~~~~~w~~y~~liGg~f~~--Hp~~-- 112 (216) T pfam06283 46 ENLAQYDVLVFNGT--TGDELS-------DEQEAALEKYVAEGGGLVGLHSATDAFRNWPEYNRLMGGSFKW--HPAG-- 112 (216) T ss_pred HHHHHCCEEEEECC--CCCCCC-------HHHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHCCEECC--CCCC-- T ss_conf 76853899999789--998799-------9999999999985998798565434677888999985860325--7999-- Q ss_pred EEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEE-ECHHHHHHHH----HCCEEEEECCC-CCCCC--CCCCCEEEEE Q ss_conf 225767525235776413799689986201330233-0867741154----22503653168-88889--8811266888 Q gi|254780971|r 113 KQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYF-IDAKGLAEIE----KNNQIVFRYAS-GTNPN--GSLHDIAGVI 184 (219) Q Consensus 113 r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~-~~~~~l~~l~----~~~~i~~~Y~d-~~NPN--GS~~~IAgi~ 184 (219) ....++|++.+.|.++++.. +|. ..+|.|.... .+-+|.++-.. ...|+ |...-+|=.- T Consensus 113 ~~~~v~v~d~~HPi~~gl~~-------------~~~~~~DE~Y~~~~~~~~~~~~VL~~~~~~~~~~~~~~~~~Pv~W~~ 179 (216) T pfam06283 113 EPFTVWVVDPDHPITKGLPE-------------SFELPTDELYGEKDDNPRPDVHVLATADEGGYNGGGEGFRHPVAWTR 179 (216) T ss_pred CEEEEEECCCCCHHHHCCCC-------------CCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCEEEEEEEE T ss_conf 46999972899955607765-------------67465436622467788898589999823777889878736799999 Q ss_pred C-CCCCEEEE-CCCCHHHCC Q ss_conf 6-99989998-488434322 Q gi|254780971|r 185 N-RRGNVLGM-MPHPENIIE 202 (219) Q Consensus 185 s-~~G~vlgm-MPHPER~~~ 202 (219) . ..|||+.. |-|-+++++ T Consensus 180 ~~g~GRvfyt~lGH~~~~~~ 199 (216) T pfam06283 180 EYGKGRVFYTALGHEEYTYD 199 (216) T ss_pred ECCCCCEEEECCCCCHHHCC T ss_conf 62994589989998852306 No 164 >cd02033 BchX Chlorophyllide reductase converts chlorophylls into bacteriochlorophylls by reducing the chlorin B-ring. This family contains the X subunit of this three-subunit enzyme. Sequence and structure similarity between bchX, protochlorophyllide reductase L subunit (bchL and chlL) and nitrogenase Fe protein (nifH gene) suggest their functional similarity. Members of the BchX family serve as the unique electron donors to their respective catalytic subunits (bchN-bchB, bchY-bchZ and nitrogenase component 1). Mechanistically, they hydrolyze ATP and transfer electrons through a Fe4-S4 cluster. Probab=73.48 E-value=2.6 Score=22.83 Aligned_cols=55 Identities=22% Similarity=0.237 Sum_probs=33.1 Q ss_pred CCCCCCCCCCEEEECCCCCC------CCC---CCHHHHHH---HHHHHHHHHHHHHCCCE--EEE-ECCC Q ss_conf 15464456478997287345------431---14067752---00212122332205971--786-0640 Q gi|254780971|r 34 QSDTDIPDVDLIVIPGGFSY------GDY---LRCGAIAA---RTPVMQAIKKKAQQGIK--VMG-ICNG 88 (219) Q Consensus 34 ~~~~~l~~~d~lvipGGFSy------gD~---l~aG~i~~---~~~~~~~i~~~~~~g~~--vLG-ICNG 88 (219) .-|.-+-|+=+=|.+|||+- .|. +.||.+.| -..+.+.+.+|++.|+. +.| |||- T Consensus 147 d~D~V~yDVLGDVVCGGFAmPiR~g~A~evyIVtSgE~MalyAANNI~~~i~~~a~~gg~vrl~GlI~N~ 216 (329) T cd02033 147 DFDYVLLDFLGDVVCGGFGLPIARDMAQKVIVVGSNDLQSLYVANNVCNAVEYFRKLGGNVGVAGMVINK 216 (329) T ss_pred CCCEEEEECCCCEEECCCCCCCCCCCCCEEEEEECCCHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEC T ss_conf 8999999224535664633533568762899996780889998878999999998638971011598606 No 165 >PRK06107 aspartate aminotransferase; Provisional Probab=72.46 E-value=3.2 Score=22.32 Aligned_cols=155 Identities=15% Similarity=0.195 Sum_probs=76.4 Q ss_pred HHHHHHHHCCCCEEEEECCCCCCCC-------CCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHC--CCE----EE Q ss_conf 9999998549963998215464456-------4789972873454311406775200212122332205--971----78 Q gi|254780971|r 17 MIKAITKIIGQSPILVWQSDTDIPD-------VDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQ--GIK----VM 83 (219) Q Consensus 17 ~~~A~~~~~~~~~~~v~~~~~~l~~-------~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~--g~~----vL 83 (219) .++.+++ .|.+.+..|..+-+++- ..--+--|=+.|++..+. ..+++++.++.++ |.. =+ T Consensus 24 ~a~~l~~-~G~dvi~L~iG~pdf~~p~~i~~a~~~al~~~~~~Y~~~~G~------~eLReAia~~~~~~~g~~v~p~~I 96 (402) T PRK06107 24 RARELRA-AGRSIVDLTVGEPDFDTPDHIKAAAVAAIERGETKYTLVNGT------PALRKAIIAKMERRNGLHYADNEI 96 (402) T ss_pred HHHHHHH-CCCCEEECCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCC------HHHHHHHHHHHHHHHCCCCCCCEE T ss_conf 9999997-799727788989899978999999999985599999699888------999999999999986899982049 Q ss_pred EECCCHHHHEECC---CH-HHH--HCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEEEECHHHHHHH Q ss_conf 6064031010000---01-011--01244332454225767525235776413799689986201330233086774115 Q gi|254780971|r 84 GICNGFQILVELN---LL-PGI--LMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNYFIDAKGLAEI 157 (219) Q Consensus 84 GICNGfQiL~elG---Ll-Pg~--l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf~~~~~~l~~l 157 (219) =|++|.|-...+- ++ ||- ++.+ ..+. .-. ..... ..+..+.+|... +..|..+.+.+++. T Consensus 97 ~vt~G~~~al~~~~~al~~pGD~Vlv~~--------P~Y~---~y~-~~~~~-~g~~~~~~p~~~-~~~~~~d~d~le~~ 162 (402) T PRK06107 97 CVGGGAKQAIFLALMATLEEGDEVIIPA--------PYWV---SYP-DMVLA-NDGTPVIVPCPE-EDGFKLTAEALEAA 162 (402) T ss_pred EECCCHHHHHHHHHHHHCCCCCEEEEEC--------CCCC---CHH-HHHHH-CCCCEEECCCCC-CCCCCCCHHHHHHH T ss_conf 9868889999999999579999899816--------8665---379-99998-399567368782-34875899999974 Q ss_pred HHCCEEEEECCCCCCCCCCCCC------EEEEECCCCCEEE Q ss_conf 4225036531688888988112------6688869998999 Q gi|254780971|r 158 EKNNQIVFRYASGTNPNGSLHD------IAGVINRRGNVLG 192 (219) Q Consensus 158 ~~~~~i~~~Y~d~~NPNGS~~~------IAgi~s~~G~vlg 192 (219) ...+--++=++.|+||.|+... |+.+|.+.+.++- T Consensus 163 ~~~~tk~iil~~P~NPTG~v~s~~~l~~i~~i~~~~~~i~i 203 (402) T PRK06107 163 ITPRTRWLILNAPSNPTGAAYSRDELRALADVLLRHPHVLV 203 (402) T ss_pred CCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCCEE T ss_conf 56476699988998989966534899999887752489328 No 166 >PRK06242 flavodoxin; Provisional Probab=71.82 E-value=8.9 Score=19.45 Aligned_cols=50 Identities=24% Similarity=0.316 Sum_probs=35.5 Q ss_pred CEEEEEECCC-CCCHHHHHHHHHHHCCCCEEEEECC-CCCCCCCCEEEECCC Q ss_conf 9689984497-0647999999998549963998215-464456478997287 Q gi|254780971|r 1 MKTAIVQIPG-LNRDNDMIKAITKIIGQSPILVWQS-DTDIPDVDLIVIPGG 50 (219) Q Consensus 1 mkvaVl~~pG-sNcd~e~~~A~~~~~~~~~~~v~~~-~~~l~~~d~lvipGG 50 (219) ||.+|+-+-- +-...-++.|+....+.+.+.+... +.++.+||+|++..| T Consensus 1 MK~~IvY~S~~tGNT~KvA~aiae~l~~~~~~~~~~~~~~l~~yDlI~~Gsg 52 (150) T PRK06242 1 MKALIVYVSIHHGNTEKIAKAMAEVLNADVIKPTDISPEDLNEYDLIGFGSG 52 (150) T ss_pred CCEEEEEEECCCCCHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCEEEECCC T ss_conf 9689999956889799999999976597399744588343021788998352 No 167 >TIGR01424 gluta_reduc_2 glutathione-disulfide reductase; InterPro: IPR006324 These sequences represent glutathione reductases of plants and some bacteria, including cyanobacteria.; GO: 0004362 glutathione-disulfide reductase activity, 0050660 FAD binding, 0050661 NADP binding, 0006749 glutathione metabolic process. Probab=71.37 E-value=3 Score=22.45 Aligned_cols=22 Identities=50% Similarity=0.681 Sum_probs=11.7 Q ss_pred CCCEEEECCCCCCCCCCCHHHHHHH Q ss_conf 6478997287345431140677520 Q gi|254780971|r 41 DVDLIVIPGGFSYGDYLRCGAIAAR 65 (219) Q Consensus 41 ~~d~lvipGGFSygD~l~aG~i~~~ 65 (219) |||+.||.+| |=| +||++.+|. T Consensus 2 DyDLFVIGAG-SGG--vrAar~AA~ 23 (478) T TIGR01424 2 DYDLFVIGAG-SGG--VRAARLAAA 23 (478) T ss_pred CCCEEEEECC-CHH--HHHHHHHHH T ss_conf 7630687067-166--899999987 No 168 >PRK08361 aspartate aminotransferase; Provisional Probab=70.56 E-value=6.9 Score=20.16 Aligned_cols=59 Identities=20% Similarity=0.286 Sum_probs=37.3 Q ss_pred CCCCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCC------EEEEECCCCCE Q ss_conf 7996899862013302330867741154225036531688888988112------66888699989 Q gi|254780971|r 131 KMNQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHD------IAGVINRRGNV 190 (219) Q Consensus 131 ~~~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~------IAgi~s~~G~v 190 (219) ....++.+|. +.|+.|..|.+.+++....+--++-++.|+||.|.... |+.+|.+.+-+ T Consensus 136 ~g~~~~~vpl-~~~~~~~~d~~~l~~~~~~~~k~ivl~~P~NPTG~v~s~e~l~~l~~la~~~~i~ 200 (390) T PRK08361 136 AEAKPIRIPL-REENNFQPDPDELLEAITKRTRMIVINYPNNPTGAVLDKETAKAIADIAEDYNIY 200 (390) T ss_pred CCCEEEEEEC-CCCCCCCCCHHHHHHHCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCE T ss_conf 2997999971-5878987799999973654675999889989757786678888999999863955 No 169 >PRK07310 consensus Probab=70.55 E-value=4.9 Score=21.10 Aligned_cols=59 Identities=12% Similarity=0.158 Sum_probs=38.5 Q ss_pred CCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCC------EEEEECCCCCEEE Q ss_conf 96899862013302330867741154225036531688888988112------6688869998999 Q gi|254780971|r 133 NQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHD------IAGVINRRGNVLG 192 (219) Q Consensus 133 ~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~------IAgi~s~~G~vlg 192 (219) +..+.++. ..|+.|..+-+.++......--++-+|.|+||.|+... |+.+|.+.+-++. T Consensus 137 ~~~v~~~~-~~~~~~~~d~~~l~~~~~~~~k~iil~~P~NPTG~v~s~~~l~~l~~~a~~~~i~vi 201 (395) T PRK07310 137 GVPVFVQA-KEENHFKVTVEQLEAARTSKTKVVLLNSPSNPTGMIYSAEELRAIGEWAVEHDILIL 201 (395) T ss_pred CEEEEEEC-CCCCCCCCCHHHHHHHCCCCCEEEEECCCCCCCCCCCCCCHHHHHHHHHHHCCEEEE T ss_conf 98999756-744574504999997465897499989997985626635414677655652687999 No 170 >PRK07366 succinyldiaminopimelate transaminase; Validated Probab=69.70 E-value=3.3 Score=22.21 Aligned_cols=57 Identities=9% Similarity=0.117 Sum_probs=35.4 Q ss_pred CCCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCC------EEEEECCCCC Q ss_conf 996899862013302330867741154225036531688888988112------6688869998 Q gi|254780971|r 132 MNQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHD------IAGVINRRGN 189 (219) Q Consensus 132 ~~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~------IAgi~s~~G~ 189 (219) ..++..+|. +.|+.|..+-+.+++.....--++-.|.|.||.|+... |+.+|.+.+- T Consensus 137 G~~~~~vpl-~~~~~~~~d~~~l~~~i~~~tk~iil~~P~NPTG~v~s~~~l~~i~~~a~~~~i 199 (388) T PRK07366 137 GGQIYPMPL-RAENDFLPVFADIPTEVLAQARLMVLSYPHNPTTAIAPLSFFQEAVAFCQQHDL 199 (388) T ss_pred CCEEEEEEC-CCCCCCCCCHHHHHHHHHHCCCEEEECCCCCCCCEEEEHHHHHHHHHHHHHCCE T ss_conf 985799754-666798479999987404228299985898984668038888889998740550 No 171 >TIGR03540 DapC_direct LL-diaminopimelate aminotransferase. This clade of the pfam00155 superfamily of aminotransferases includes several which are adjacent to elements of the lysine biosynthesis via diaminopimelate pathway (GenProp0125). Every member of this clade is from a genome which possesses most of the lysine biosynthesis pathway but lacks any of the known aminotransferases, succinylases, desuccinylases, acetylases or deacetylases typical of the acylated versions of this pathway nor do they have the direct, NADPH-dependent enzyme (ddh). Although there is no experimental characterization of any of the sequences in this clade, a direct pathway is known in plants and Chlamydia, so it seems quite reasonable that these enzymes catalyze the same transformation. Probab=69.36 E-value=9.7 Score=19.22 Aligned_cols=58 Identities=10% Similarity=0.228 Sum_probs=32.8 Q ss_pred CCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCC------EEEEECCCCCEE Q ss_conf 96899862013302330867741154225036531688888988112------668886999899 Q gi|254780971|r 133 NQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHD------IAGVINRRGNVL 191 (219) Q Consensus 133 ~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~------IAgi~s~~G~vl 191 (219) ..++.+|. .-|+.|..|-+.+++.....--++-.|.|+||.|+... |+.+|.+.+-++ T Consensus 137 ~~~v~vpl-~~~~~~~~d~e~l~~~~~~~~k~iil~~P~NPTG~v~s~~~l~~i~~la~~~~i~i 200 (383) T TIGR03540 137 GEPYEMPL-KEENGFLPDFDAIPEDIAKKAKLMFINYPNNPTGAVAPLKFFKELVEFAKEYNIIV 200 (383) T ss_pred CEEEEEEC-CCCCCCCCCHHHHHHHCCCCCEEEEECCCCCCCCCCCCCHHHHCCCCCCCCCCCEE T ss_conf 85899613-35779857999998745237749998999897070121022220100124554057 No 172 >PRK06348 aspartate aminotransferase; Provisional Probab=69.25 E-value=4.2 Score=21.55 Aligned_cols=58 Identities=19% Similarity=0.253 Sum_probs=36.8 Q ss_pred CCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCC------EEEEECCCCCEE Q ss_conf 96899862013302330867741154225036531688888988112------668886999899 Q gi|254780971|r 133 NQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHD------IAGVINRRGNVL 191 (219) Q Consensus 133 ~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~------IAgi~s~~G~vl 191 (219) ..++.+|.. .+..|..|.+.++......--++-+|.|+||.|.... |+.+|.+.+-++ T Consensus 135 ~~~v~~~~~-~~~~~~~d~~~le~~i~~~tk~iil~~P~NPTG~v~~~e~l~~l~~la~~~~i~v 198 (383) T PRK06348 135 GKPIIFETY-EEDGFQINVDKLEALITSKTKAIILNSPNNPTGAVFSKETLEEIAKVAIENDLVI 198 (383) T ss_pred CCEEECCCC-CCCCCCCCHHHHHHHCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHHCCEEE T ss_conf 862532445-6668657968989856878729998999798888777789999997653068699 No 173 >PRK05794 consensus Probab=68.35 E-value=4.3 Score=21.45 Aligned_cols=59 Identities=12% Similarity=0.139 Sum_probs=39.7 Q ss_pred CCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCC------EEEEECCCCCEEE Q ss_conf 96899862013302330867741154225036531688888988112------6688869998999 Q gi|254780971|r 133 NQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHD------IAGVINRRGNVLG 192 (219) Q Consensus 133 ~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~------IAgi~s~~G~vlg 192 (219) +..+.++. ..|+.|..+.+.+++....+--++-+|.|+||.|+... |+.+|-+.+-++- T Consensus 136 ~~~v~~~~-~~~~~~~~d~~~l~~~~~~~~k~iil~~P~NPtG~v~s~~~l~~l~~la~~~~i~vi 200 (397) T PRK05794 136 GVPVFVDT-KEENDFKYTIEELENAITSKTKAIIINSPNNPTGTVYSKEELEEIAKFAKEHDLFII 200 (397) T ss_pred CCEEEEEC-CCCCCCCCCHHHHHHHCCCCCCEEEECCCCCCCCCCCCCHHHHHHHHHHHHCEEEEE T ss_conf 80299656-732377779999997533357189978998987824350679999976541036884 No 174 >PRK09271 flavodoxin; Provisional Probab=68.33 E-value=9.8 Score=19.20 Aligned_cols=75 Identities=11% Similarity=0.174 Sum_probs=39.7 Q ss_pred CEEEEEECCCCCCHHHHHHH----HHHHCCCCEEEEECC-------CCCCCCCCEEEECCCCCCCCCCCHHHHHH-HHHH Q ss_conf 96899844970647999999----998549963998215-------46445647899728734543114067752-0021 Q gi|254780971|r 1 MKTAIVQIPGLNRDNDMIKA----ITKIIGQSPILVWQS-------DTDIPDVDLIVIPGGFSYGDYLRCGAIAA-RTPV 68 (219) Q Consensus 1 mkvaVl~~pGsNcd~e~~~A----~~~~~~~~~~~v~~~-------~~~l~~~d~lvipGGFSygD~l~aG~i~~-~~~~ 68 (219) |||.|+---=|--..|++.. ++. .|.++..+-.. ..++.+||++++ |-..||| |..-. .... T Consensus 1 MKvlIvYaS~TGNTE~vA~~I~~~l~~-~G~eV~~~e~d~~~~~~~~~d~~~yDl~ll-G~yTwgd----G~lPdE~~DF 74 (160) T PRK09271 1 MRILLAYASLSGNTREVARMIAARCEE-AGHAVTWVETDLQTLAQAPLDPEEYDLFLL-GTWTDNA----GRTPPEMKRF 74 (160) T ss_pred CEEEEEEECCCCCHHHHHHHHHHHHHH-CCCEEEEEEECCHHHHHCCCCCCCCCEEEE-ECCCCCC----CCCCHHHHHH T ss_conf 918999984887689999999999997-698237987010232203356111888999-6570489----9896678999 Q ss_pred HHHHHHHHHCCCE Q ss_conf 2122332205971 Q gi|254780971|r 69 MQAIKKKAQQGIK 81 (219) Q Consensus 69 ~~~i~~~~~~g~~ 81 (219) ++.+.+.--.|+. T Consensus 75 ~e~L~~~dl~gk~ 87 (160) T PRK09271 75 IAELVETIGKPPN 87 (160) T ss_pred HHHHHHHCCCCCE T ss_conf 9999860456876 No 175 >TIGR01701 Fdhalpha-like oxidoreductase alpha (molybdopterin) subunit; InterPro: IPR010046 This entry represents a well-defined clade of molybdopterin-dependent formate dehydrogenases, bacterial type , , , . It represents the alpha subunit, which contains a molybdopterin cofactor and generally associate with two other subunits which contain iron-sulphur clusters and cytochromes.; GO: 0008863 formate dehydrogenase activity, 0030151 molybdenum ion binding, 0051539 4 iron 4 sulfur cluster binding. Probab=67.38 E-value=5.5 Score=20.80 Aligned_cols=110 Identities=20% Similarity=0.272 Sum_probs=73.3 Q ss_pred EEECCCCC--------CHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHH-HHHHHHHH Q ss_conf 98449706--------4799999999854996399821546445647899728734543114067752002-12122332 Q gi|254780971|r 5 IVQIPGLN--------RDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPGGFSYGDYLRCGAIAARTP-VMQAIKKK 75 (219) Q Consensus 5 Vl~~pGsN--------cd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipGGFSygD~l~aG~i~~~~~-~~~~i~~~ 75 (219) +.|+=||| |+.-...||.+..|...-.+... ++...|+||+-|= .-..+.| +++.+.+. T Consensus 174 faR~~GsNNlpDcS~mCH~pS~vaL~~SiG~G~g~v~l~--D~~~~D~~v~iG~----------n~gtN~PR~l~~L~~a 241 (824) T TIGR01701 174 FARSLGSNNLPDCSNMCHEPSSVALKKSIGIGKGSVLLE--DFEHTDLLVLIGS----------NAGTNHPRMLKELIKA 241 (824) T ss_pred HHHHHCCCCCHHHHHHCCCCCHHHHHHHCCCCEEEEEEE--ECCCCCEEEEECC----------CCCCCCCHHHHHHHHH T ss_conf 998727787133354316861123422023230367730--0005837999545----------4898884368899999 Q ss_pred HHCCCEEEEECCCHHHHEECCCH-------HHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCC Q ss_conf 20597178606403101000001-------011012443324542257675252357764137 Q gi|254780971|r 76 AQQGIKVMGICNGFQILVELNLL-------PGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYK 131 (219) Q Consensus 76 ~~~g~~vLGICNGfQiL~elGLl-------Pg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~ 131 (219) .++|..|+-| =.|.|.||+ |-..+.+.--+--+.++.+++ -.+-.++.|+. T Consensus 242 ~~rG~KiI~i----NPl~E~GL~rF~~P~~P~~~L~g~gt~I~s~y~Qv~~-GGD~Al~~G~~ 299 (824) T TIGR01701 242 KKRGAKIIAI----NPLRERGLERFASPQIPIKMLTGKGTQISSEYYQVRI-GGDIALLVGLM 299 (824) T ss_pred HHCCCEEEEE----CCCCCCHHHHCCCCCCCHHHHCCCCCCEEHEEECCCC-CHHHHHHHHHH T ss_conf 9639839997----5885501130167887602112787300010104777-42799999988 No 176 >PRK02155 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=66.81 E-value=11 Score=18.78 Aligned_cols=73 Identities=22% Similarity=0.341 Sum_probs=40.1 Q ss_pred EEEEEECCCCCCHHHHHHHHHHH---CCCCEEEEEC--CC-----------CCC-CCCCEEEECCCCCCCCCCCHHHHHH Q ss_conf 68998449706479999999985---4996399821--54-----------644-5647899728734543114067752 Q gi|254780971|r 2 KTAIVQIPGLNRDNDMIKAITKI---IGQSPILVWQ--SD-----------TDI-PDVDLIVIPGGFSYGDYLRCGAIAA 64 (219) Q Consensus 2 kvaVl~~pGsNcd~e~~~A~~~~---~~~~~~~v~~--~~-----------~~l-~~~d~lvipGGFSygD~l~aG~i~~ 64 (219) ||+|+.=++..--.++++.+..+ -|.++.+-.. .. .++ .+.|++|.-|| | |. T Consensus 7 ~Vgiv~k~~~~~~~~~~~~l~~~L~~~g~~v~~e~~~a~~~~~~~~~~~~~~~~~~~~Dlvi~lGG----D----GT--- 75 (291) T PRK02155 7 TVALVGRYQTPGIAEPLEALAACIAKRGFEVVFEADTARNTGLTGYPALTPAEIGARADVAVVLGG----D----GT--- 75 (291) T ss_pred EEEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEEHHHHHHCCCCCCCCCCHHHHCCCCCEEEEECC----C----HH--- T ss_conf 999991489868999999999999978899999555777639988774797994637678999767----8----89--- Q ss_pred HHHHHHHHHHHHHCCCEEEEECCC Q ss_conf 002121223322059717860640 Q gi|254780971|r 65 RTPVMQAIKKKAQQGIKVMGICNG 88 (219) Q Consensus 65 ~~~~~~~i~~~~~~g~~vLGICNG 88 (219) +..+.+.+...+.|+|||--| T Consensus 76 ---lL~~a~~~~~~~~PilGiN~G 96 (291) T PRK02155 76 ---MLGIGRQLAPYGTPLIGINHG 96 (291) T ss_pred ---HHHHHHHHHCCCCCEEEEECC T ss_conf ---999999871159968998547 No 177 >COG3155 ElbB Uncharacterized protein involved in an early stage of isoprenoid biosynthesis [Secondary metabolites biosynthesis, transport, and catabolism] Probab=66.69 E-value=3.6 Score=21.98 Aligned_cols=56 Identities=21% Similarity=0.304 Sum_probs=32.7 Q ss_pred CCCCCCEEEECCCCCCCCCCCHHHHH----H-HHHHHHHHHHHHHCCCEEEEECCCHHHHE Q ss_conf 44564789972873454311406775----2-00212122332205971786064031010 Q gi|254780971|r 38 DIPDVDLIVIPGGFSYGDYLRCGAIA----A-RTPVMQAIKKKAQQGIKVMGICNGFQILV 93 (219) Q Consensus 38 ~l~~~d~lvipGGFSygD~l~aG~i~----~-~~~~~~~i~~~~~~g~~vLGICNGfQiL~ 93 (219) +..++|.+++||||.-.-.|.--++- + ..-+....+.+++.|+|+==||----||- T Consensus 82 ~~e~~DALivPGGFGAAKNLsdFA~kGaeC~v~pDv~al~~a~~~agKP~G~iCIaP~m~p 142 (217) T COG3155 82 DAEELDALIVPGGFGAAKNLSDFASKGAECSVDPDLKALAQAMHQAGKPLGFMCIAPAMLP 142 (217) T ss_pred CHHHCCEEECCCCCCHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECHHHHH T ss_conf 9766342661586302320577744686020187699999999973898157885577778 No 178 >PRK08443 consensus Probab=65.49 E-value=6.2 Score=20.46 Aligned_cols=58 Identities=19% Similarity=0.224 Sum_probs=34.9 Q ss_pred CCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCC------EEEEECCCCCEE Q ss_conf 96899862013302330867741154225036531688888988112------668886999899 Q gi|254780971|r 133 NQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHD------IAGVINRRGNVL 191 (219) Q Consensus 133 ~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~------IAgi~s~~G~vl 191 (219) ...+.++.- .+..|..+.+.+++.....--++-+|.|+||.|+..+ |+.+|.+.+-++ T Consensus 135 ~~~v~v~~~-~~~~~~~d~~~l~~~i~~~~k~iil~~P~NPTG~v~s~~~l~~l~~~a~~~~~~i 198 (388) T PRK08443 135 GVPVFIETD-EENGFKITAEQLKKAITPKTKVLVLNTPSNPTGSVYSKEELEAIAKVLKGTDIWV 198 (388) T ss_pred CEEEEEEEC-CCCCCCCCHHHHHHHCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCCEE T ss_conf 847998655-4348677999999747878739998689899884764478999999862357567 No 179 >PRK07682 hypothetical protein; Validated Probab=65.25 E-value=3 Score=22.52 Aligned_cols=58 Identities=10% Similarity=0.240 Sum_probs=38.0 Q ss_pred CCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCC------EEEEECCCCCEE Q ss_conf 96899862013302330867741154225036531688888988112------668886999899 Q gi|254780971|r 133 NQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHD------IAGVINRRGNVL 191 (219) Q Consensus 133 ~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~------IAgi~s~~G~vl 191 (219) ...++++. +.|..|..+.+.+++.....--++=+|.|+||.|.... |+.+|.+.+-++ T Consensus 127 ~~~v~~~~-~~~~~~~~d~~~le~~~~~~~k~iil~~P~NPtG~v~s~~el~~l~~la~~~~i~i 190 (378) T PRK07682 127 GVPVPVAT-SLENEFKVQPAQIEAAITAKTKAILLCSPNNPTGAVLNKSELEEIAVIVEKHNLIV 190 (378) T ss_pred CEEEEECC-CCCCCCCCCHHHHHHHCCCCCCEEEECCCCCCCCCCCCHHHHHHHHHHHHHCCEEE T ss_conf 85699402-65458637999999736526838998289798888889999999999764459679 No 180 >PRK07367 consensus Probab=63.93 E-value=5.3 Score=20.90 Aligned_cols=59 Identities=17% Similarity=0.201 Sum_probs=36.4 Q ss_pred CCCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCC------EEEEECCCCCEE Q ss_conf 996899862013302330867741154225036531688888988112------668886999899 Q gi|254780971|r 132 MNQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHD------IAGVINRRGNVL 191 (219) Q Consensus 132 ~~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~------IAgi~s~~G~vl 191 (219) ..+++.+|.. .|..|..+.+.++......--++-+++|.||.|+... |+.+|.+.+-++ T Consensus 133 g~~~v~v~~~-~~~~~~~d~~~l~~~~~~~~k~~il~~P~NPTG~v~s~~~l~~l~~~a~~~~~~i 197 (385) T PRK07367 133 GGTPVIVPTD-AATGFKITPEQLRQAITPKTKLLVLNSPSNPTGMVYTPEEIAALAEVIVEHDLYV 197 (385) T ss_pred CCEEEEECCC-CCCCCCCCHHHHHHHCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHHCCEEE T ss_conf 9857992136-2127677999999737867749997899798785665788889988887558699 No 181 >COG1168 MalY Bifunctional PLP-dependent enzyme with beta-cystathionase and maltose regulon repressor activities [Amino acid transport and metabolism] Probab=63.85 E-value=9.1 Score=19.40 Aligned_cols=65 Identities=29% Similarity=0.511 Sum_probs=44.6 Q ss_pred HHHHHHCCCC--CEEEEEEECCCEEEEECHHHHHHHHHCCE-EEEECCCCCCCCCCCCC------EEEEECCCC Q ss_conf 5776413799--68998620133023308677411542250-36531688888988112------668886999 Q gi|254780971|r 124 TAFTKSYKMN--QIIKCPVAHHDGNYFIDAKGLAEIEKNNQ-IVFRYASGTNPNGSLHD------IAGVINRRG 188 (219) Q Consensus 124 s~~~~~~~~~--~~l~~piaHgEGrf~~~~~~l~~l~~~~~-i~~~Y~d~~NPNGS~~~------IAgi~s~~G 188 (219) .||.+..+.+ ++++.|..-..|||.+|=+.|++....+. ..+=-|.|.||-|+... |+-||-+-| T Consensus 118 ~PF~~~i~~n~R~~i~~pL~~~~~~y~iD~~~LE~~~~~~~vkl~iLCnPHNP~Grvwt~eeL~~i~elc~kh~ 191 (388) T COG1168 118 PPFYNAIKLNGRKVIENPLVEDDGRYEIDFDALEKAFVDERVKLFILCNPHNPTGRVWTKEELRKIAELCLRHG 191 (388) T ss_pred HHHHHHHHHCCCEEEECCCCCCCCCEEECHHHHHHHHHCCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHCC T ss_conf 18999875169679746643469847752999999984377608999379999986436999999999999839 No 182 >PRK07324 transaminase; Validated Probab=62.85 E-value=6.7 Score=20.26 Aligned_cols=55 Identities=16% Similarity=0.312 Sum_probs=27.4 Q ss_pred CEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCC------EEEEECCCCC Q ss_conf 6899862013302330867741154225036531688888988112------6688869998 Q gi|254780971|r 134 QIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHD------IAGVINRRGN 189 (219) Q Consensus 134 ~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~------IAgi~s~~G~ 189 (219) ++..+|+ +-|+.|..|-+.++++...+--++-.+.|.||.|...+ |+.+|.+.+- T Consensus 127 ~v~~~~l-~~~~~~~~Dl~~l~~~i~~~tkliil~nP~NPTG~v~s~e~l~~l~~la~~~~i 187 (373) T PRK07324 127 EVDYWKL-REENGWLPDLDELKRLVRPNTKLICINNANNPTGALMDRAFLEEIVEIAKSVDA 187 (373) T ss_pred EEEECCC-CCCCCCCCCHHHHHHHCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCE T ss_conf 7654076-722598879999996188787699979997988977889999999987541785 No 183 >PRK07777 aminotransferase; Validated Probab=62.51 E-value=10 Score=19.14 Aligned_cols=47 Identities=15% Similarity=0.269 Sum_probs=27.1 Q ss_pred CCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCC Q ss_conf 96899862013302330867741154225036531688888988112 Q gi|254780971|r 133 NQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHD 179 (219) Q Consensus 133 ~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~ 179 (219) ...+.+|+...+.+|..|.+.+++.....--++-+++|+||.|.... T Consensus 131 ~~~v~vpl~~~~~~~~~d~~~l~~~~~~~tk~iil~~P~NPTG~v~s 177 (386) T PRK07777 131 AHRVAVPLVPDGRGFALDVDALRAAVTPRTRALIVNSPHNPTGTVLS 177 (386) T ss_pred CEEEECCCCCCCCCCCCCHHHHHHHCCCCCEEEEECCCCCCCCCCCC T ss_conf 92561343477888466969999746877769997999698884888 No 184 >PRK00561 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=62.42 E-value=14 Score=18.26 Aligned_cols=70 Identities=23% Similarity=0.361 Sum_probs=40.3 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 96899844970647999999998549963998215464456478997287345431140677520021212233220597 Q gi|254780971|r 1 MKTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGI 80 (219) Q Consensus 1 mkvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~ 80 (219) ||++|+.-.-... ..++..+.++... .....+|.|+++.-|| | |. +..+.+.+...+. T Consensus 1 mk~~i~~~~~~~s-~~~~~~l~~~~~~-------~~~~~d~~DlviviGG----D----GT------~L~a~~~~~~~~i 58 (259) T PRK00561 1 MKYKIFASTTPQT-EPVLPKLKKVLKK-------KLAVEDGADYLFVLGG----D----GF------FVSTAANYNCAGC 58 (259) T ss_pred CEEEEEECCCHHH-HHHHHHHHHHHHC-------CCCCCCCCCEEEEECC----H----HH------HHHHHHHHCCCCC T ss_conf 9699993888657-9999999999854-------7867889999999897----1----99------9999998554799 Q ss_pred EEEEECC---CHHHHEE Q ss_conf 1786064---0310100 Q gi|254780971|r 81 KVMGICN---GFQILVE 94 (219) Q Consensus 81 ~vLGICN---GfQiL~e 94 (219) |+|||-- || |++ T Consensus 59 PilGIN~G~lGF--Lt~ 73 (259) T PRK00561 59 KVVGINTGHLGF--YTS 73 (259) T ss_pred CEEEEECCCCEE--EEC T ss_conf 689996697336--415 No 185 >PRK06207 aspartate aminotransferase; Provisional Probab=62.32 E-value=12 Score=18.72 Aligned_cols=57 Identities=23% Similarity=0.316 Sum_probs=37.3 Q ss_pred CCCEEEEEEEC---CCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCC------EEEEECCCC Q ss_conf 99689986201---3302330867741154225036531688888988112------668886999 Q gi|254780971|r 132 MNQIIKCPVAH---HDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHD------IAGVINRRG 188 (219) Q Consensus 132 ~~~~l~~piaH---gEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~------IAgi~s~~G 188 (219) .++++++++.+ .+.+|-.|-+.++......--++-+++|+||.|+... |+.+|.+.+ T Consensus 147 g~~~vpv~l~~~~~~~~~~~~d~d~le~~i~~~tk~iil~nP~NPTG~v~s~e~l~~l~~la~~~~ 212 (406) T PRK06207 147 EGEIVPVQLDYVSVDETRAGLDLDQLEAAFKAGVRVFLFSNPNNPAGVVYSPEEIAQIAALARRYG 212 (406) T ss_pred CCEEEEEECCCCCCCCCCCCCCHHHHHHHCCCCCEEEEECCCCCCCCHHCCHHHHHHHHHHHHHCC T ss_conf 987999856776776344689999999745448769998899798572133999999999875579 No 186 >PRK01231 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=60.80 E-value=15 Score=18.08 Aligned_cols=73 Identities=21% Similarity=0.397 Sum_probs=38.3 Q ss_pred EEEEEECCCCCCHHHHHHHHHHH---CCCCEEEEECC-------C------CCC-CCCCEEEECCCCCCCCCCCHHHHHH Q ss_conf 68998449706479999999985---49963998215-------4------644-5647899728734543114067752 Q gi|254780971|r 2 KTAIVQIPGLNRDNDMIKAITKI---IGQSPILVWQS-------D------TDI-PDVDLIVIPGGFSYGDYLRCGAIAA 64 (219) Q Consensus 2 kvaVl~~pGsNcd~e~~~A~~~~---~~~~~~~v~~~-------~------~~l-~~~d~lvipGGFSygD~l~aG~i~~ 64 (219) ||+|+.=|+.---.++++.+.++ -+.++.+-... . ..+ ...|+++.-|| | |. T Consensus 6 ~vgIv~k~~~~~a~~~~~~l~~~L~~~gi~v~ld~~~a~~~~~~~~~~~~~~~~~~~~Dlii~lGG----D----GT--- 74 (296) T PRK01231 6 NIGLIGRLGSSQVVETLRRLKRFLLDRHLHVILEEETAEVLPGHGLQTVSRKLLGEVCDLVIVVGG----D----GS--- 74 (296) T ss_pred EEEEEEECCCHHHHHHHHHHHHHHHHCCCEEEEEHHHHHHCCCCCCCCCCHHHCCCCEEEEEEECC----C----CH--- T ss_conf 999998489879999999999999878899999324776658788662453341653049999578----7----28--- Q ss_pred HHHHHHHHHHHHHCCCEEEEECCC Q ss_conf 002121223322059717860640 Q gi|254780971|r 65 RTPVMQAIKKKAQQGIKVMGICNG 88 (219) Q Consensus 65 ~~~~~~~i~~~~~~g~~vLGICNG 88 (219) +..+.+.+.....|+|||--| T Consensus 75 ---~L~~~~~~~~~~~PilGiN~G 95 (296) T PRK01231 75 ---LLGAARALARHNVPVLGINRG 95 (296) T ss_pred ---HHHHHHHHCCCCCCEEEEECC T ss_conf ---999999960369978988558 No 187 >PRK09276 aspartate aminotransferase; Provisional Probab=60.63 E-value=7.4 Score=19.97 Aligned_cols=58 Identities=12% Similarity=0.229 Sum_probs=33.5 Q ss_pred CCCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCC------EEEEECCCCCE Q ss_conf 996899862013302330867741154225036531688888988112------66888699989 Q gi|254780971|r 132 MNQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHD------IAGVINRRGNV 190 (219) Q Consensus 132 ~~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~------IAgi~s~~G~v 190 (219) ...+..+|. +.|+.|..|-+.+++....+--++-.+.|.||.|+... |+-+|.+.+-. T Consensus 138 g~~~v~vp~-~~~~~~~~d~~~l~~~~~~~~k~iil~~P~NPTG~v~s~~~l~~l~~la~~~~i~ 201 (385) T PRK09276 138 GGEPYFMPL-KEENGFLPDLDAIPEDVARKAKLMFINYPNNPTGAVADLEFFEKVVDFAKKYDII 201 (385) T ss_pred CCEEEECCC-CCCCCCCCCHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHCCCCCE T ss_conf 997850686-6667985799999975101672999899989855111188999998763255756 No 188 >PRK07681 aspartate aminotransferase; Provisional Probab=60.57 E-value=6 Score=20.54 Aligned_cols=55 Identities=9% Similarity=0.179 Sum_probs=32.0 Q ss_pred CCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCC------EEEEECCCC Q ss_conf 96899862013302330867741154225036531688888988112------668886999 Q gi|254780971|r 133 NQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHD------IAGVINRRG 188 (219) Q Consensus 133 ~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~------IAgi~s~~G 188 (219) ..+..+|. ..|+.|..|-+.+++.....--++-++.|+||.|+... |+-+|.+.+ T Consensus 139 ~~~v~vpl-~~e~~~~~d~~~l~~~i~~~tk~iil~~P~NPTG~v~s~e~l~~l~~la~~~~ 199 (399) T PRK07681 139 ATSYYMPL-KKENDFLPDLELIPEEIADKAKMMILNFPGNPVPAMAHEDFFKEVIAFAKKHN 199 (399) T ss_pred CEEEEEEC-CCCCCCHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHHCC T ss_conf 98999505-76667436899999853225559997699898887789999999999853136 No 189 >PRK13234 nifH nitrogenase reductase; Reviewed Probab=60.57 E-value=12 Score=18.72 Aligned_cols=16 Identities=19% Similarity=0.223 Sum_probs=7.1 Q ss_pred HHHHHHHHCCEEEEEC Q ss_conf 7741154225036531 Q gi|254780971|r 152 KGLAEIEKNNQIVFRY 167 (219) Q Consensus 152 ~~l~~l~~~~~i~~~Y 167 (219) ...++-+.++|-++.| T Consensus 216 ~~v~~aE~~~~TviE~ 231 (293) T PRK13234 216 NIVQHAELRRMTVIEY 231 (293) T ss_pred HHHHHHHHCCCEEEEE T ss_conf 8899999739777896 No 190 >PRK07550 hypothetical protein; Provisional Probab=60.12 E-value=8.3 Score=19.65 Aligned_cols=59 Identities=17% Similarity=0.229 Sum_probs=31.5 Q ss_pred CCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCC------EEEEECCCCCEEE Q ss_conf 96899862013302330867741154225036531688888988112------6688869998999 Q gi|254780971|r 133 NQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHD------IAGVINRRGNVLG 192 (219) Q Consensus 133 ~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~------IAgi~s~~G~vlg 192 (219) .....+|+... ..|..|.+.+++....+--++-++.|+||.|.... |+-+|.+.+-++- T Consensus 137 ~~~~~vp~~~~-~~~~~d~~~le~~~~~~tk~iil~~P~NPtG~v~s~e~l~~l~~la~~~~~~iI 201 (387) T PRK07550 137 IRAVLVPLDTG-PGLLPDPEAAAALITPRTRAIVLVTPNNPTGVVYPPELLHELYDLARRRGLALI 201 (387) T ss_pred CEEEECCCCCC-CCCCCCHHHHHHHCCCCCCEEECCCCCCCCCCCCCHHHHHHHHHHCCCEEEEEE T ss_conf 98973477756-798779999997556578377038996998851077985346543033158999 No 191 >PRK03957 V-type ATP synthase subunit F; Provisional Probab=59.76 E-value=6.6 Score=20.27 Aligned_cols=25 Identities=32% Similarity=0.434 Sum_probs=19.9 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHCCC-CEEEE Q ss_conf 968998449706479999999985499-63998 Q gi|254780971|r 1 MKTAIVQIPGLNRDNDMIKAITKIIGQ-SPILV 32 (219) Q Consensus 1 mkvaVl~~pGsNcd~e~~~A~~~~~~~-~~~~v 32 (219) ||+||+ +|+|+.-.|. ++|. +.+.+ T Consensus 1 mkIaVi------GD~Dtv~GF~-LaGi~~~~~v 26 (100) T PRK03957 1 MKIAVV------GDSDTVVGFR-LAGLTEVYEV 26 (100) T ss_pred CEEEEE------CCHHHHHHHH-HHCCCCCCCC T ss_conf 979998------5798999999-8088867478 No 192 >PRK06836 aspartate aminotransferase; Provisional Probab=59.72 E-value=14 Score=18.25 Aligned_cols=53 Identities=13% Similarity=0.255 Sum_probs=33.0 Q ss_pred CCCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCC------EEEEECC Q ss_conf 996899862013302330867741154225036531688888988112------6688869 Q gi|254780971|r 132 MNQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHD------IAGVINR 186 (219) Q Consensus 132 ~~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~------IAgi~s~ 186 (219) .+.++.+|. +.|+ |..|-+.+++.....--++-.++|+||.|+... |+.+|.+ T Consensus 142 g~~~v~v~~-~~e~-~~~d~~~l~~~i~~~tk~iilnsP~NPTG~v~s~e~l~~i~~l~~~ 200 (396) T PRK06836 142 GGKLVVVPP-DTED-FQPDLDALEAAITPKTKAVIINSPNNPTGVIYSEETLKALGALLEE 200 (396) T ss_pred CCEEEEECC-CCCC-CCCCHHHHHHHCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHH T ss_conf 997999316-7656-7679999997367566299986898989986799999999999998 No 193 >CHL00072 chlL photochlorophyllide reductase subunit L Probab=59.29 E-value=9.7 Score=19.22 Aligned_cols=14 Identities=7% Similarity=0.199 Sum_probs=5.5 Q ss_pred HHHHHHCCEEEEEC Q ss_conf 41154225036531 Q gi|254780971|r 154 LAEIEKNNQIVFRY 167 (219) Q Consensus 154 l~~l~~~~~i~~~Y 167 (219) ..+-+.+++.++.| T Consensus 209 V~~ae~~~~TviE~ 222 (271) T CHL00072 209 IRVSRVKGKTLFEM 222 (271) T ss_pred HHHHHHCCCCEEEC T ss_conf 68999748820422 No 194 >cd05297 GH4_alpha_glucosidase_galactosidase Glycoside Hydrolases Family 4; Alpha-glucosidases and alpha-galactosidases. linked to 3D####ucture Probab=58.75 E-value=3.6 Score=21.94 Aligned_cols=16 Identities=44% Similarity=0.762 Sum_probs=12.2 Q ss_pred CCEEEEECCCHHHHEE Q ss_conf 9717860640310100 Q gi|254780971|r 79 GIKVMGICNGFQILVE 94 (219) Q Consensus 79 g~~vLGICNGfQiL~e 94 (219) +..++|+|++-|-+.+ T Consensus 160 ~~k~iGlC~~p~~~~~ 175 (423) T cd05297 160 PIKTVGLCHGVQGTAE 175 (423) T ss_pred CCCEEEECCCHHHHHH T ss_conf 9978987977699999 No 195 >PRK09265 aminotransferase AlaT; Validated Probab=58.24 E-value=7.1 Score=20.08 Aligned_cols=37 Identities=14% Similarity=0.260 Sum_probs=16.6 Q ss_pred CCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCC Q ss_conf 3302330867741154225036531688888988112 Q gi|254780971|r 143 HDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHD 179 (219) Q Consensus 143 gEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~ 179 (219) -|..|..|-+.++.......-++-.+.|+||.|+..+ T Consensus 150 ~~~~f~~d~~~l~~~i~~~tk~iil~nP~NPTG~v~~ 186 (404) T PRK09265 150 EEAGWFPDLDDIRSKITPRTKAIVIINPNNPTGAVYS 186 (404) T ss_pred CCCCCCCCHHHHHHHCCCCCEEEEECCCCCCCCCCCC T ss_conf 3237788999999765745559998689795120025 No 196 >COG4090 Uncharacterized protein conserved in archaea [Function unknown] Probab=57.99 E-value=9.2 Score=19.36 Aligned_cols=51 Identities=22% Similarity=0.461 Sum_probs=30.0 Q ss_pred CCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHC--CCEEEEECCCHHHHEE-CCCH Q ss_conf 644564789972873454311406775200212122332205--9717860640310100-0001 Q gi|254780971|r 37 TDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQ--GIKVMGICNGFQILVE-LNLL 98 (219) Q Consensus 37 ~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~--g~~vLGICNGfQiL~e-lGLl 98 (219) ..+...|.+|+-||-|.-- .| .-.+..+++.++ +++++|+| ||-|.+ .||- T Consensus 81 e~~n~aDvvVLlGGLaMP~---~g------v~~d~~kel~ee~~~kkliGvC--fm~mF~ragW~ 134 (154) T COG4090 81 EELNSADVVVLLGGLAMPK---IG------VTPDDAKELLEELGNKKLIGVC--FMNMFERAGWD 134 (154) T ss_pred CCCCCCCEEEEECCCCCCC---CC------CCHHHHHHHHHHCCCCCEEEEE--HHHHHHHCCCC T ss_conf 6666445899975644676---78------9979999999843898658751--89899874850 No 197 >KOG3093 consensus Probab=57.47 E-value=4.5 Score=21.32 Aligned_cols=52 Identities=31% Similarity=0.459 Sum_probs=27.1 Q ss_pred CCEEEECC-CCC-CCCCCCHHHHHHHHHHHHHHHHHHHCCC-EEEEECCCHHHHEE Q ss_conf 47899728-734-5431140677520021212233220597-17860640310100 Q gi|254780971|r 42 VDLIVIPG-GFS-YGDYLRCGAIAARTPVMQAIKKKAQQGI-KVMGICNGFQILVE 94 (219) Q Consensus 42 ~d~lvipG-GFS-ygD~l~aG~i~~~~~~~~~i~~~~~~g~-~vLGICNGfQiL~e 94 (219) .|+++.|| +|+ +|-++|-|+.....- ++.-+.++..-+ +.+|+|--=||+.+ T Consensus 129 lDLiivPGvAFd~~g~RlGhGkGYYD~f-lkry~~~~~~~kp~~vgL~l~EQI~~~ 183 (200) T KOG3093 129 LDLIIVPGVAFDRKGARLGHGKGYYDDF-LKRYQIHAPEQKPLLVGLCLKEQILSE 183 (200) T ss_pred CEEEEECCCCCCHHHHHCCCCCCHHHHH-HHHHHHHCCCCCCHHHHHHHHHHHCCC T ss_conf 0289836622133202126886518899-999998655568314544546765366 No 198 >TIGR03375 type_I_sec_LssB type I secretion system ATPase, LssB family. Type I protein secretion is a system in some Gram-negative bacteria to export proteins (often proteases) across both inner and outer membranes to the extracellular medium. This is one of three proteins of the type I secretion apparatus. Targeted proteins are not cleaved at the N-terminus, but rather carry signals located toward the extreme C-terminus to direct type I secretion. This model is related to models TIGR01842 and TIGR01846, and to bacteriocin ABC transporters that cleave their substrates during export. Probab=57.43 E-value=16 Score=17.81 Aligned_cols=21 Identities=10% Similarity=0.004 Sum_probs=8.4 Q ss_pred CCEEEEEEEEEECCCHHHHHH Q ss_conf 324542257675252357764 Q gi|254780971|r 108 LKFVCKQVLLEVVNSNTAFTK 128 (219) Q Consensus 108 ~rf~~r~~~~~v~~~~s~~~~ 128 (219) +.-+.+.+..+-.+...+.++ T Consensus 462 g~i~~~~vsf~y~~~~~~vl~ 482 (694) T TIGR03375 462 GEIEFRNVSFAYPGQETPALD 482 (694) T ss_pred CEEEEEEEEEECCCCCCHHHC T ss_conf 349999999987988922213 No 199 >PRK06507 consensus Probab=57.27 E-value=6.3 Score=20.40 Aligned_cols=60 Identities=17% Similarity=0.165 Sum_probs=35.4 Q ss_pred CCCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCC------EEEEECCCCCEEE Q ss_conf 996899862013302330867741154225036531688888988112------6688869998999 Q gi|254780971|r 132 MNQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHD------IAGVINRRGNVLG 192 (219) Q Consensus 132 ~~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~------IAgi~s~~G~vlg 192 (219) ...++.+|.. .|..|..+.+.++......--++=.|.|+||.|.... |+.+|-+.++++- T Consensus 136 g~~~v~v~l~-~~~g~~~d~~~le~~~~~~tk~iil~~P~NPTG~v~s~~~l~~i~~~a~~~~~v~v 201 (400) T PRK06507 136 GGVPVAVPCR-EETGFKLRPEDLEAAITPRTKWLFLNFPNNPTGAACTRAEMAAIAEVMLRHPHVWI 201 (400) T ss_pred CCEEEEEECC-CCCCCCCCHHHHHHHCCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHHCCCCEE T ss_conf 9915997567-01176416999997467566499974992931288647799999986653168214 No 200 >PRK05764 aspartate aminotransferase; Provisional Probab=57.25 E-value=14 Score=18.25 Aligned_cols=157 Identities=16% Similarity=0.191 Sum_probs=73.5 Q ss_pred HHHHHHHCCCCEEEEECCCCCCCCCC-------EEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHC--CCE----EEE Q ss_conf 99999854996399821546445647-------89972873454311406775200212122332205--971----786 Q gi|254780971|r 18 IKAITKIIGQSPILVWQSDTDIPDVD-------LIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQ--GIK----VMG 84 (219) Q Consensus 18 ~~A~~~~~~~~~~~v~~~~~~l~~~d-------~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~--g~~----vLG 84 (219) ++..++ .|.+.+..|..+-+++--. -.+--|-|.|++..|- ..++++|.++.++ |.. =+= T Consensus 23 a~~~~~-~G~dvi~l~~g~pd~~~p~~i~~a~~~~~~~~~~~Y~~~~G~------~~LR~aia~~~~~~~g~~v~~d~I~ 95 (389) T PRK05764 23 AKELKA-AGRDVISLGAGEPDFDTPEHIKEAAIAALDEGKTKYTPAAGI------PELREAIAEKLKRDNGLDYEPDQII 95 (389) T ss_pred HHHHHH-CCCCEEECCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCC------HHHHHHHHHHHHHHHCCCCCHHHEE T ss_conf 999997-799827888978999988999999999984799999899887------9999999999999868998579789 Q ss_pred ECCCHHHHEECCCHHHHHCCCCCCCEEEEEEEEEECC--CHHHHHHHCCCCCEEEEEEECCCEEEEECHHHHHHHHHCCE Q ss_conf 0640310100000101101244332454225767525--23577641379968998620133023308677411542250 Q gi|254780971|r 85 ICNGFQILVELNLLPGILMRNCSLKFVCKQVLLEVVN--SNTAFTKSYKMNQIIKCPVAHHDGNYFIDAKGLAEIEKNNQ 162 (219) Q Consensus 85 ICNGfQiL~elGLlPg~l~~N~s~rf~~r~~~~~v~~--~~s~~~~~~~~~~~l~~piaHgEGrf~~~~~~l~~l~~~~~ 162 (219) |.+|+|--+.+=+ -+|+ |... .+-+.-+. .-....+. ...+++.+|.- .|+.|..|.+.+++.....- T Consensus 96 it~G~~~al~~~~--~~l~-~pGD-----~Vlv~~P~Y~~y~~~~~~-~g~~~v~vp~~-~~~~~~~d~~~l~~~~~~~~ 165 (389) T PRK05764 96 VTTGAKQALYNAF--MALL-NPGD-----EVIIPAPYWVSYPEMVKL-AGGKPVFVPTG-EENGFKLTPEQLEAAITPKT 165 (389) T ss_pred ECCCHHHHHHHHH--HHHC-CCCC-----EEEECCCCCHHHHHHHHH-CCCEEEEECCC-HHHCCCCCHHHHHHHCCCCC T ss_conf 8888799999999--9958-9999-----899857862458999996-49865550436-22087689999998638466 Q ss_pred EEEECCCCCCCCCCCCC------EEEEECCCCCEE Q ss_conf 36531688888988112------668886999899 Q gi|254780971|r 163 IVFRYASGTNPNGSLHD------IAGVINRRGNVL 191 (219) Q Consensus 163 i~~~Y~d~~NPNGS~~~------IAgi~s~~G~vl 191 (219) -++-++.|.||.|+... |+.+|.+.+-++ T Consensus 166 k~i~l~~P~NPtG~v~s~~~l~~l~~~a~~~~i~i 200 (389) T PRK05764 166 KALILNSPSNPTGAVYSKEELEAIADVAVEHDIWV 200 (389) T ss_pred EEEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCEE T ss_conf 49998899899886757799999999988635327 No 201 >PRK07778 consensus Probab=56.96 E-value=12 Score=18.72 Aligned_cols=54 Identities=26% Similarity=0.328 Sum_probs=30.4 Q ss_pred CCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCC------EEEEECCCC Q ss_conf 96899862013302330867741154225036531688888988112------668886999 Q gi|254780971|r 133 NQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHD------IAGVINRRG 188 (219) Q Consensus 133 ~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~------IAgi~s~~G 188 (219) .+++.+|+ +.|++|..+.+.++.+..+-. ++-+++|+||.|.... |+.+|.+.+ T Consensus 136 ~~~v~vp~-~~~~~~~~~~~~l~~~~~~~k-~iil~~P~NPTG~v~s~~~l~~l~~la~~~~ 195 (386) T PRK07778 136 CEVVEIPC-GPDTRFQPTAAMLAELDPPVR-GVIVASPANPTGTVIAPEELAAIASWCEASG 195 (386) T ss_pred CEEEECCC-CCCCCCCCHHHHHHHCCCCCE-EEEECCCCCCCCCCCHHHHHHHHHHHHHHCC T ss_conf 82572367-744566723889862586643-9998999898877850799999999985378 No 202 >pfam01513 NAD_kinase ATP-NAD kinase. Members of this family include ATP-NAD kinases EC:2.7.1.23, which catalyses the phosphorylation of NAD to NADP utilising ATP and other nucleoside triphosphates as well as inorganic polyphosphate as a source of phosphorus. Also includes NADH kinases EC:2.7.1.86. Probab=55.70 E-value=18 Score=17.54 Aligned_cols=68 Identities=25% Similarity=0.457 Sum_probs=42.8 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCE Q ss_conf 68998449706479999999985499639982154644564789972873454311406775200212122332205971 Q gi|254780971|r 2 KTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIK 81 (219) Q Consensus 2 kvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~ 81 (219) ||+|+.=|...--.+++.-+.++..... .+..-.++|+++.-|| | |. +....+.+...+.| T Consensus 1 kvgiv~n~~~~~a~~~~~~l~~~L~~~~-----~~~~~~~~Dlii~lGG----D----GT------~L~~~~~~~~~~~P 61 (243) T pfam01513 1 KVGIIVNPDKVEAEERASELQRLLLDAT-----REMVEEGVDLIVVLGG----D----GT------ALDAARLLGDHDIP 61 (243) T ss_pred CEEEEECCCCHHHHHHHHHHHHHHHHCC-----HHHHCCCCCEEEEECC----C----HH------HHHHHHHHCCCCCC T ss_conf 9899987996789999999999988647-----2130559889999898----7----89------99999984567995 Q ss_pred EEEECCC Q ss_conf 7860640 Q gi|254780971|r 82 VMGICNG 88 (219) Q Consensus 82 vLGICNG 88 (219) +|||--| T Consensus 62 ilGin~G 68 (243) T pfam01513 62 ILGINTG 68 (243) T ss_pred EEEEECC T ss_conf 8998569 No 203 >COG1897 MetA Homoserine trans-succinylase [Amino acid transport and metabolism] Probab=55.62 E-value=9.2 Score=19.37 Aligned_cols=137 Identities=14% Similarity=0.216 Sum_probs=75.6 Q ss_pred CCCCCCEEEECCC----CCCCCCCCHHHHHHHHHHHHHHHHHHHCC-CEEEEECCCHHHHEEC--CCHHHHHCCCCCCCE Q ss_conf 4456478997287----34543114067752002121223322059-7178606403101000--001011012443324 Q gi|254780971|r 38 DIPDVDLIVIPGG----FSYGDYLRCGAIAARTPVMQAIKKKAQQG-IKVMGICNGFQILVEL--NLLPGILMRNCSLKF 110 (219) Q Consensus 38 ~l~~~d~lvipGG----FSygD~l~aG~i~~~~~~~~~i~~~~~~g-~~vLGICNGfQiL~el--GLlPg~l~~N~s~rf 110 (219) +-..|||+|+-|- -.|.| .+.+.. ++++.+..+.- ...|-||=|.|.-... |+=.-.|..--++-| T Consensus 96 k~~~FDG~IiTGAPve~l~fee------V~YW~e-l~~I~eWskt~V~STl~ICWgaqAaly~~yGv~K~~l~~Kl~GVy 168 (307) T COG1897 96 KDQKFDGLIITGAPVELLPFEE------VAYWEE-LKQIFEWSKTHVTSTLHICWGAQAALYYFYGVPKYTLPEKLSGVY 168 (307) T ss_pred HHCCCCCEEEECCCCCCCCCHH------HHHHHH-HHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHCEE T ss_conf 6454671598388534457004------306999-999999886420013566888888899980987343654340212 Q ss_pred EEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEE-EECHHHHHHHHHCCEEEEECCCCCCCCCCCCCEEEEECCCCC Q ss_conf 5422576752523577641379968998620133023-308677411542250365316888889881126688869998 Q gi|254780971|r 111 VCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNY-FIDAKGLAEIEKNNQIVFRYASGTNPNGSLHDIAGVINRRGN 189 (219) Q Consensus 111 ~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf-~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~IAgi~s~~G~ 189 (219) ..+ +....++.++|++ +.+-.|.+ |+ .++.+.+++..+-. |.. =.+ ..+..=+.+++|| T Consensus 169 ~h~-----~l~p~~~l~rGfd--d~f~~PhS----R~t~~~~e~i~~~~~Le-IL~-es~-------e~G~~l~a~k~~r 228 (307) T COG1897 169 KHD-----ILSPHSLLTRGFD--DSFLAPHS----RYTDVPKEDILAVPDLE-ILA-ESK-------EAGVYLLASKDGR 228 (307) T ss_pred ECC-----CCCCCCHHHCCCC--CCCCCCCC----CCCCCCHHHHHHCCCCE-EEE-CCC-------CCCEEEEECCCCC T ss_conf 212-----4586626442677--43237642----22457999984289851-343-266-------4536898627777 Q ss_pred EEEECCCCHHHC Q ss_conf 999848843432 Q gi|254780971|r 190 VLGMMPHPENII 201 (219) Q Consensus 190 vlgmMPHPER~~ 201 (219) .+=|+-|||--. T Consensus 229 ~ifv~gH~EYD~ 240 (307) T COG1897 229 NIFVTGHPEYDA 240 (307) T ss_pred EEEEECCCCHHH T ss_conf 678717741025 No 204 >PRK03372 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=55.58 E-value=18 Score=17.52 Aligned_cols=73 Identities=18% Similarity=0.341 Sum_probs=40.4 Q ss_pred EEEEEECCCCCCHHHHHHHHHHH---CCCCEEEEECCC---------------------CCCCCCCEEEECCCCCCCCCC Q ss_conf 68998449706479999999985---499639982154---------------------644564789972873454311 Q gi|254780971|r 2 KTAIVQIPGLNRDNDMIKAITKI---IGQSPILVWQSD---------------------TDIPDVDLIVIPGGFSYGDYL 57 (219) Q Consensus 2 kvaVl~~pGsNcd~e~~~A~~~~---~~~~~~~v~~~~---------------------~~l~~~d~lvipGGFSygD~l 57 (219) +|+|+.=|+.---.|+++.+.+. .+.++.+..... ....+.|+++.-|| | T Consensus 6 ~V~IV~k~~~~~a~~~a~~l~~~L~~~g~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Dlvi~lGG----D-- 79 (303) T PRK03372 6 TVLLVAHTGRDEATESARRVVKQLGDAGIGVRVLAAEAADLPDDMRALGVEIEVVDADPDAADGCELVLVLGG----D-- 79 (303) T ss_pred EEEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECC----C-- T ss_conf 8999970899899999999999999788989997040100244455556542234532233578558999778----7-- Q ss_pred CHHHHHHHHHHHHHHHHHHHCCCEEEEECCC Q ss_conf 4067752002121223322059717860640 Q gi|254780971|r 58 RCGAIAARTPVMQAIKKKAQQGIKVMGICNG 88 (219) Q Consensus 58 ~aG~i~~~~~~~~~i~~~~~~g~~vLGICNG 88 (219) |.+. .+.+.+...+.|+|||--| T Consensus 80 --GT~L------~aar~~~~~~iPilGiN~G 102 (303) T PRK03372 80 --GTFL------RAAELARNADVPVLGVNLG 102 (303) T ss_pred --HHHH------HHHHHHCCCCCCEEEEECC T ss_conf --8999------9999844479988987259 No 205 >PRK04539 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=55.52 E-value=18 Score=17.52 Aligned_cols=73 Identities=25% Similarity=0.423 Sum_probs=37.9 Q ss_pred EEEEEECCCCCCHHHHHHHHHHH---CCCCEEEEEC---------CC---------CCC-CCCCEEEECCCCCCCCCCCH Q ss_conf 68998449706479999999985---4996399821---------54---------644-56478997287345431140 Q gi|254780971|r 2 KTAIVQIPGLNRDNDMIKAITKI---IGQSPILVWQ---------SD---------TDI-PDVDLIVIPGGFSYGDYLRC 59 (219) Q Consensus 2 kvaVl~~pGsNcd~e~~~A~~~~---~~~~~~~v~~---------~~---------~~l-~~~d~lvipGGFSygD~l~a 59 (219) +|+|+.=|+.---.+++.-+.+. -+..+++-.. .+ .++ ...|+++.-|| | T Consensus 7 ~Igiv~k~~~~~~~~~~~~l~~~L~~~g~~v~ld~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Dlii~lGG----D---- 78 (296) T PRK04539 7 NIGIVTRPNTPDIQDTAHTLITFLKQHGFTVYLDEVGVKEGCIYTQDTVGCHIVNKTELGQYCDLVAVLGG----D---- 78 (296) T ss_pred EEEEEEECCCHHHHHHHHHHHHHHHHCCCEEEEECCHHHHCCCCCCCCCCCCCCCHHHHCCCCCEEEEECC----C---- T ss_conf 89999708987999999999999997879999965411203546433456554686781646779999787----0---- Q ss_pred HHHHHHHHHHHHHHHHHHCCCEEEEECCC Q ss_conf 67752002121223322059717860640 Q gi|254780971|r 60 GAIAARTPVMQAIKKKAQQGIKVMGICNG 88 (219) Q Consensus 60 G~i~~~~~~~~~i~~~~~~g~~vLGICNG 88 (219) |. +..+.+.+.....|+|||--| T Consensus 79 GT------lL~a~r~~~~~~~PilGiN~G 101 (296) T PRK04539 79 GT------FLSVAREIAPRAVPIIGINQG 101 (296) T ss_pred HH------HHHHHHHHHHCCCCEEEEECC T ss_conf 89------999999860059978998447 No 206 >PRK05942 aspartate aminotransferase; Provisional Probab=55.13 E-value=9.1 Score=19.41 Aligned_cols=55 Identities=15% Similarity=0.236 Sum_probs=31.4 Q ss_pred CCCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCC------EEEEECCC Q ss_conf 996899862013302330867741154225036531688888988112------66888699 Q gi|254780971|r 132 MNQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHD------IAGVINRR 187 (219) Q Consensus 132 ~~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~------IAgi~s~~ 187 (219) ..++..+|.. .|+.|..|-+.+++...+.--++-.+.|+||.|+... |+.+|.+. T Consensus 142 g~~~~~v~l~-~~~~~~~d~~~l~~~i~~~tk~iil~~P~NPTG~v~~~e~l~~i~~~a~~~ 202 (394) T PRK05942 142 GAQIYPIILK-PENDWLIDLSSIPEEVAQQAKILYFNYPSNPTTATAPREFFEEIVAFARKY 202 (394) T ss_pred CCEEEEEEEC-CCCCCCCCHHHHHHHHHCCCCEEEEECCCCCCCHHCCHHHHHHHHHHHHCC T ss_conf 9989999705-876887799999986112670899938988101220199999999988517 No 207 >PRK06220 consensus Probab=54.85 E-value=18 Score=17.49 Aligned_cols=46 Identities=13% Similarity=0.204 Sum_probs=27.9 Q ss_pred CCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCC Q ss_conf 96899862013302330867741154225036531688888988112 Q gi|254780971|r 133 NQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHD 179 (219) Q Consensus 133 ~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~ 179 (219) ..++.+|... +..|..+.+.+++.....--++=+|+|.||.|.... T Consensus 137 ~~~v~v~~~~-~~~~~~d~~~l~~~~~~~~k~i~l~~P~NPtG~v~s 182 (384) T PRK06220 137 GTPVFVPVYE-EDGFQYRPEAIRACLTPRTKAILINSPSNPTGHVLS 182 (384) T ss_pred CEEEEECCCC-CCCCCCCHHHHHHHCCCCCEEEEECCCCCCCCCCCC T ss_conf 8399922463-237677999999747888739997289998786771 No 208 >PRK02649 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=54.80 E-value=18 Score=17.44 Aligned_cols=73 Identities=23% Similarity=0.314 Sum_probs=39.3 Q ss_pred CEEEEEECCCCCCHHHHH----HHHHHHCCCCEEEEECC---------C--------------CCCCCCCEEEECCCCCC Q ss_conf 968998449706479999----99998549963998215---------4--------------64456478997287345 Q gi|254780971|r 1 MKTAIVQIPGLNRDNDMI----KAITKIIGQSPILVWQS---------D--------------TDIPDVDLIVIPGGFSY 53 (219) Q Consensus 1 mkvaVl~~pGsNcd~e~~----~A~~~~~~~~~~~v~~~---------~--------------~~l~~~d~lvipGGFSy 53 (219) .||+|+.=++.---.+++ ..|.+ -|.+++.-... + .-..++|+++.-|| T Consensus 2 ~kiGIi~~~~~~~a~~~a~~l~~~L~~-~gi~v~l~~~~a~~l~~~~~~~~~~~~~~~~~~~~~~~~~~Dl~IvlGG--- 77 (305) T PRK02649 2 PKAGIIYNDGKPLAVRTAEELQDKLEA-AGWEVVRASSSGGMLGYANPDQPVCHTGIDALVPEGFDSSMKFAIVLGG--- 77 (305) T ss_pred CEEEEEECCCCHHHHHHHHHHHHHHHH-CCCEEEEECCHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECC--- T ss_conf 889999738998999999999999998-8999999744123228787553211244111263335777339999837--- Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCC Q ss_conf 43114067752002121223322059717860640 Q gi|254780971|r 54 GDYLRCGAIAARTPVMQAIKKKAQQGIKVMGICNG 88 (219) Q Consensus 54 gD~l~aG~i~~~~~~~~~i~~~~~~g~~vLGICNG 88 (219) | |. +..+.+.....+.|+|||--| T Consensus 78 -D----GT------~L~aar~~~~~~iPilGIN~G 101 (305) T PRK02649 78 -D----GT------VLSAARQTAPCGIPLLTINTG 101 (305) T ss_pred -C----HH------HHHHHHHHCCCCCCEEEEECC T ss_conf -6----99------999999853369978989448 No 209 >PRK08068 transaminase; Reviewed Probab=54.50 E-value=10 Score=19.13 Aligned_cols=55 Identities=15% Similarity=0.274 Sum_probs=29.9 Q ss_pred CCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCC------EEEEECCCC Q ss_conf 96899862013302330867741154225036531688888988112------668886999 Q gi|254780971|r 133 NQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHD------IAGVINRRG 188 (219) Q Consensus 133 ~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~------IAgi~s~~G 188 (219) .+++.+|. ..|+.|..|-+.+++.....--++-.+.|+||.|+... |+.+|.+.+ T Consensus 140 ~~~~~vpl-~~~~~~~~d~e~l~~~i~~~tk~iil~~P~NPTG~v~~~~~l~~l~~la~~~~ 200 (389) T PRK08068 140 AQFETMPL-IAENNFLPDYEKIPESVAEKAKLMYLNYPNNPTGAVATKAFFEETVAFAKKHN 200 (389) T ss_pred CEEEEEEC-CCCCCCCCCHHHHHHHCHHCCEEEEECCCCCCCCHHCCHHHHHHHHHHHHCCC T ss_conf 75689963-66679868999998743104659998899798461205999999999871077 No 210 >PRK13185 chlL protochlorophyllide reductase iron-sulfur ATP-binding protein; Provisional Probab=54.15 E-value=16 Score=17.75 Aligned_cols=19 Identities=11% Similarity=0.260 Sum_probs=9.9 Q ss_pred CHHHHHHHHHCCEEEEECC Q ss_conf 8677411542250365316 Q gi|254780971|r 150 DAKGLAEIEKNNQIVFRYA 168 (219) Q Consensus 150 ~~~~l~~l~~~~~i~~~Y~ 168 (219) +++...+-+.+++.++.|. T Consensus 207 ~~~~V~~se~~g~TviE~~ 225 (269) T PRK13185 207 DLDAIRRSRLKGKTLFEME 225 (269) T ss_pred CCHHHHHHHHCCCEEEEEC T ss_conf 9788999987498678858 No 211 >PRK07683 aminotransferase A; Validated Probab=52.67 E-value=9.1 Score=19.41 Aligned_cols=55 Identities=15% Similarity=0.190 Sum_probs=33.1 Q ss_pred CCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCC------EEEEECCCCC Q ss_conf 96899862013302330867741154225036531688888988112------6688869998 Q gi|254780971|r 133 NQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHD------IAGVINRRGN 189 (219) Q Consensus 133 ~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~------IAgi~s~~G~ 189 (219) ...+.++.. +..|..+.+.++.....+.-++-.|+|.||.|.... |+.+|.+.+- T Consensus 135 ~~~v~v~~~--~~~~~~d~~~le~~~~~~~k~iil~~P~NPtG~v~s~e~l~~i~~la~~~~i 195 (387) T PRK07683 135 ATPVFIDTR--ETGFRLTAEALENAITEKTRCVVLPYPSNPTGVTLSKEELKDIANVLKDKNI 195 (387) T ss_pred CEEEECCCC--CCCCCCCHHHHHHHCCCCCCEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCC T ss_conf 723536878--6788789999997355257578647995989788789999999997613694 No 212 >pfam01812 5-FTHF_cyc-lig 5-formyltetrahydrofolate cyclo-ligase family. 5-formyltetrahydrofolate cyclo-ligase or methenyl-THF synthetase EC:6.3.3.2 catalyses the interchange of 5-formyltetrahydrofolate (5-FTHF) to 5-10-methenyltetrahydrofolate, this requires ATP and Mg2+. 5-FTHF is used in chemotherapy where it is clinically known as Leucovorin. Probab=52.08 E-value=14 Score=18.26 Aligned_cols=50 Identities=22% Similarity=0.372 Sum_probs=23.6 Q ss_pred CCCEEEECC-CCC-CCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHEE Q ss_conf 647899728-734-543114067752002121223322059717860640310100 Q gi|254780971|r 41 DVDLIVIPG-GFS-YGDYLRCGAIAARTPVMQAIKKKAQQGIKVMGICNGFQILVE 94 (219) Q Consensus 41 ~~d~lvipG-GFS-ygD~l~aG~i~~~~~~~~~i~~~~~~g~~vLGICNGfQiL~e 94 (219) ..|++++|| +|. .|-+||=|+.....- +.+....+.+.+|+|-=||++-+ T Consensus 115 ~lDlilVP~vafD~~G~RLG~GgGyYDR~----L~~~~~~~~~~Igva~~~Q~~~~ 166 (182) T pfam01812 115 QIDLVLVPGVAFDRQGYRLGRGGGYYDRY----LARLQGHGPLTVGLAYDEQLVER 166 (182) T ss_pred CCCEEEECCEEECCCCCEEECCCCCHHHH----HHHCCCCCCEEEEEEEHHEEECC T ss_conf 68889955135758994700687407789----88554569839999820225176 No 213 >PRK10680 molybdopterin biosynthesis protein MoeA; Provisional Probab=51.82 E-value=20 Score=17.14 Aligned_cols=81 Identities=15% Similarity=0.144 Sum_probs=42.8 Q ss_pred CCHHHHHHHHHHHCCCCEEEEEC-CCC----------CCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 64799999999854996399821-546----------4456478997287345431140677520021212233220597 Q gi|254780971|r 12 NRDNDMIKAITKIIGQSPILVWQ-SDT----------DIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGI 80 (219) Q Consensus 12 Ncd~e~~~A~~~~~~~~~~~v~~-~~~----------~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~ 80 (219) |+-.-|..|+-+..|.++...-. .|. .+.++|+||.-||-|.||+=-.-.......-...-+-..+-|+ T Consensus 203 dsN~~~l~a~l~~~G~~~~~~g~v~Dd~~~i~~~l~~a~~~~DlvIttGGvS~G~~D~v~~~l~~~G~~~f~~VaikPGk 282 (411) T PRK10680 203 DTNRLAVHLMLEQLGCEVINLGIIRDDPHALRAAFIEADSQADVVISSGGVSVGEADYTKTILEELGEIAFWKLAIKPGK 282 (411) T ss_pred ECCHHHHHHHHHHCCCEEEEECCCCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCHHHHHHHHHCCCCCCCEEEECCCC T ss_conf 44699999999978988999687378899999999976512898998477658863249999997386420517877999 Q ss_pred EE-EEECCCHHHH Q ss_conf 17-8606403101 Q gi|254780971|r 81 KV-MGICNGFQIL 92 (219) Q Consensus 81 ~v-LGICNGfQiL 92 (219) |+ +|..++.-++ T Consensus 283 P~~~g~~~~~~v~ 295 (411) T PRK10680 283 PFAFGKLSNSWFC 295 (411) T ss_pred CEEEEEECCCEEE T ss_conf 6699996884599 No 214 >COG1486 CelF Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases [Carbohydrate transport and metabolism] Probab=50.89 E-value=5.5 Score=20.81 Aligned_cols=23 Identities=26% Similarity=0.574 Sum_probs=14.0 Q ss_pred HHHHHHHHHCCCEEEEECCCHHHH Q ss_conf 212233220597178606403101 Q gi|254780971|r 69 MQAIKKKAQQGIKVMGICNGFQIL 92 (219) Q Consensus 69 ~~~i~~~~~~g~~vLGICNGfQiL 92 (219) .+++.+... ...++|+|||-|-. T Consensus 153 TeAv~r~~~-~~K~VGlCh~~~g~ 175 (442) T COG1486 153 TEAVRRLYP-KIKIVGLCHGPIGI 175 (442) T ss_pred HHHHHHHCC-CCCEEEECCCHHHH T ss_conf 999998578-78789758845789 No 215 >pfam09897 DUF2124 Uncharacterized protein conserved in archaea (DUF2124). This domain, found in various hypothetical archaeal proteins, has no known function. Probab=50.13 E-value=16 Score=17.88 Aligned_cols=52 Identities=19% Similarity=0.362 Sum_probs=28.4 Q ss_pred CCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHH--CCCEEEEECCCHHHHEECCCHH Q ss_conf 4456478997287345431140677520021212233220--5971786064031010000010 Q gi|254780971|r 38 DIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQ--QGIKVMGICNGFQILVELNLLP 99 (219) Q Consensus 38 ~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~--~g~~vLGICNGfQiL~elGLlP 99 (219) ...+.|.+|+=||-+--.. +. -.+++++.++ .++.++|||= +.++-+.||.. T Consensus 77 ~~~~~D~lVlmGGLAMP~~---~v------t~e~v~~li~k~~~~kviGvCF-ms~F~kagW~~ 130 (147) T pfam09897 77 TDLNPDVLVLLGGLAMPKS---GV------TPEDVKELIEKLNPKKVIGVCF-MSMFEKAGWDD 130 (147) T ss_pred CCCCCCEEEEECCCCCCCC---CC------CHHHHHHHHHHCCCCCEEEEEH-HHHHHHCCCCC T ss_conf 7899989999676557999---98------9999999999649987899974-99898769987 No 216 >cd05296 GH4_P_beta_glucosidase Glycoside Hydrolases Family 4; Phospho-beta-glucosidase. Some bacteria simultaneously translocate and phosphorylate disaccharides via the phosphoenolpyruvate-dependent phosphotransferase system (PEP-PTS). After translocation, these phospho-disaccharides may be hydrolyzed by the GH4 glycoside hydrolases such as the phospho-beta-glucosidases. Other organisms (such as archaea and Thermotoga maritima ) lack the PEP-PTS system, but have several enzymes normally associated with the PEP-PTS operon. The 6-phospho-beta-glucosidase from Thermotoga maritima hydrolylzes cellobiose 6-phosphate (6P) into glucose-6P and glucose, in an NAD+ and Mn2+ dependent fashion. The Escherichia coli 6-phospho-beta-glucosidase (also called celF) hydrolyzes a variety of phospho-beta-glucosides including cellobiose-6P, salicin-6P, arbutin-6P, and gentobiose-6P. Phospho-beta-glucosidases are part of the NAD(P)-binding Rossmann fold superfamily, which includes a wide variety of protein Probab=48.24 E-value=6.8 Score=20.19 Aligned_cols=23 Identities=30% Similarity=0.501 Sum_probs=15.3 Q ss_pred HHHHHHHHCCCEEEEECCCHHHHEE Q ss_conf 1223322059717860640310100 Q gi|254780971|r 70 QAIKKKAQQGIKVMGICNGFQILVE 94 (219) Q Consensus 70 ~~i~~~~~~g~~vLGICNGfQiL~e 94 (219) .++.++. +..++|+|++-|-+.+ T Consensus 152 ~a~~r~~--~~k~vGlCh~~~~~~~ 174 (419) T cd05296 152 EAVLRHT--GDRVIGLCNVPIGLQR 174 (419) T ss_pred HHHHHCC--CCCEEEECCCHHHHHH T ss_conf 9999668--9988997977798999 No 217 >PRK09147 aminotransferase; Provisional Probab=48.17 E-value=9.4 Score=19.32 Aligned_cols=58 Identities=14% Similarity=0.129 Sum_probs=30.3 Q ss_pred CCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCC------EEEEECCCCCEE Q ss_conf 96899862013302330867741154225036531688888988112------668886999899 Q gi|254780971|r 133 NQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHD------IAGVINRRGNVL 191 (219) Q Consensus 133 ~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~------IAgi~s~~G~vl 191 (219) .++..+|. ..|..|..|-+.+++.....--++=.|.|.||.|+..+ |+.+|.+.+-++ T Consensus 139 ~~~v~vp~-~~e~~~~~d~~~l~~~i~~~tk~iil~sP~NPTG~v~s~e~l~~l~~la~~~~i~i 202 (397) T PRK09147 139 AEPYFLNC-DPANGFAPDFDAVPAEVWARTQLLFVCSPGNPTGAVLPLDDWKKLFALSDRYGFVI 202 (397) T ss_pred CEEEEECC-CCCCCCCCCHHHHHHHHHHCCCEEEECCCCCCCCCCCCHHHHHHHHHHHCCCEEEE T ss_conf 86899235-71227776978989873112748986899898899888899999999632472899 No 218 >PRK06290 aspartate aminotransferase; Provisional Probab=47.40 E-value=24 Score=16.71 Aligned_cols=56 Identities=11% Similarity=0.237 Sum_probs=26.2 Q ss_pred CCCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCC------EEEEECCCC Q ss_conf 996899862013302330867741154225036531688888988112------668886999 Q gi|254780971|r 132 MNQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHD------IAGVINRRG 188 (219) Q Consensus 132 ~~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~------IAgi~s~~G 188 (219) .+++..+|. ..|..|..+-+.+++.....--++-.+.|+||.|+... |+-+|.+.+ T Consensus 151 G~~~v~vp~-~~e~~~~~d~~~l~~~i~~ktK~iilnsP~NPTG~v~s~e~l~~i~~~a~~~~ 212 (411) T PRK06290 151 GGEVYNLPL-LEENNFLPDLDAIPKDVLEKAKLLYLNYPNNPTGAVATKEFYKEVVDFAFEND 212 (411) T ss_pred CCEEEEEEC-CCCCCCCCCHHHHHHHCCCCCEEEEECCCCCCCCEEECHHHHHHHHHHHHHCC T ss_conf 998999865-66678765957719862557659998899999861424899999999986428 No 219 >PRK06056 consensus Probab=46.20 E-value=18 Score=17.55 Aligned_cols=56 Identities=18% Similarity=0.309 Sum_probs=33.5 Q ss_pred CCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCC------EEEEECCCCC Q ss_conf 96899862013302330867741154225036531688888988112------6688869998 Q gi|254780971|r 133 NQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHD------IAGVINRRGN 189 (219) Q Consensus 133 ~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~------IAgi~s~~G~ 189 (219) +..+.++. ..|+.|..|-+.++......--++-|+.|+||.|.... |+.+|.+.+- T Consensus 141 ~~~v~v~~-~~~~~~~~d~~~le~~~~~~tk~iil~~P~NPtG~v~s~e~l~~l~~la~~~~i 202 (402) T PRK06056 141 GVPVEVVA-GADQGYLVTVEQLEAARTERTKVLLFVSPSNPTGAVYSPEQVRAIGRWAAEHGI 202 (402) T ss_pred CCEEEECC-CCCCCCCCCHHHHHHHCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHCC T ss_conf 97078223-733377879999996366687399984897988887427889999999997146 No 220 >PRK13232 nifH nitrogenase reductase; Reviewed Probab=46.13 E-value=17 Score=17.59 Aligned_cols=21 Identities=14% Similarity=0.158 Sum_probs=14.5 Q ss_pred EECHHHHHHHHHCCEEEEECC Q ss_conf 308677411542250365316 Q gi|254780971|r 148 FIDAKGLAEIEKNNQIVFRYA 168 (219) Q Consensus 148 ~~~~~~l~~l~~~~~i~~~Y~ 168 (219) +-.+..+.+-..+++.++.|- T Consensus 208 IPr~~~v~~A~~~g~tvie~~ 228 (273) T PRK13232 208 VPRDNIVQRAEINRKTVIDFD 228 (273) T ss_pred CCCCHHHHHHHHCCCEEEEEC T ss_conf 899778876487298358979 No 221 >pfam09508 Lact_bio_phlase Lacto-N-biose phosphorylase. The gene which codes for this protein in gut-bacteria is located in a novel putative operon for galactose metabolism. The protein appears to be a carbohydrate-processing phosphorolytic enzyme (EC:2.4.1.211), unlike either glycoside hydrolases or glycoside lyase. Intestinal colonisation by bifidobacteria is important for human health, especially in pediatrics, because colonisation seems to prevent infection by some pathogenic bacteria that cause diarrhoea or other illnesses. The operon seems to be involved in intestinal colonisation by bifidobacteria mediated by metabolism of mucin sugars. In addition, it may also resolve the question of the nature of the bifidus factor in human milk as the lacto-N-biose structure found in milk oligosaccharides. Probab=46.01 E-value=25 Score=16.57 Aligned_cols=95 Identities=20% Similarity=0.309 Sum_probs=58.2 Q ss_pred CEEEEEECCCCCCHHH---HHH------------HHHHHCCCCEEE--EECCC----CCCCCCCEEEECCCCCCCCCCCH Q ss_conf 9689984497064799---999------------999854996399--82154----64456478997287345431140 Q gi|254780971|r 1 MKTAIVQIPGLNRDND---MIK------------AITKIIGQSPIL--VWQSD----TDIPDVDLIVIPGGFSYGDYLRC 59 (219) Q Consensus 1 mkvaVl~~pGsNcd~e---~~~------------A~~~~~~~~~~~--v~~~~----~~l~~~d~lvipGGFSygD~l~a 59 (219) .|||||..=|.-.-.- .++ .++.+.|....+ +-..| ..++|+|.||=.|. -+.--.. T Consensus 436 ~kVAVLn~WGklRsW~~~mv~Hal~ykq~ysY~GilEaLSG~p~dV~FIsFdDi~~~gi~~didViINaGd--a~TA~SG 513 (716) T pfam09508 436 PRVAVLNSWGKLRSWQTHMVAHALYYKQTYSYAGILESLSGLPFEVEFISFDDILENGILEDIDVIINVGD--AGTAWSG 513 (716) T ss_pred CEEEEECCCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHCCCCEEEEEECHHHHHHCCCCCCCCEEEECCC--CCCCCCC T ss_conf 34999803001105765324441021011028779988559970479954898865299767858995676--6564667 Q ss_pred HHHHHHHHHHHHHHHHHHCCCEEEEECC-------C--HHHHEECCC Q ss_conf 6775200212122332205971786064-------0--310100000 Q gi|254780971|r 60 GAIAARTPVMQAIKKKAQQGIKVMGICN-------G--FQILVELNL 97 (219) Q Consensus 60 G~i~~~~~~~~~i~~~~~~g~~vLGICN-------G--fQiL~elGL 97 (219) |..|....+..+|++|..+|+-.+||+- | ||+---||+ T Consensus 514 G~~W~d~~i~~~ir~fV~~GGGfIGVGePsA~~~~G~~FQLadVLGV 560 (716) T pfam09508 514 GSVWEDPRVISALRRFVAEGGGLIGVGEPSAHQYQGRYFQLADVLGV 560 (716) T ss_pred CCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCCCEEEEEHHHCCC T ss_conf 40137759999999999828967870686434569737860453440 No 222 >cd03131 GATase1_HTS Type 1 glutamine amidotransferase (GATase1)-like domain found in homoserine trans-succinylase (HTS). Type 1 glutamine amidotransferase (GATase1)-like domain found in homoserine trans-succinylase (HTS). HTS, the first enzyme in methionine biosynthesis in Escherichia coli, transfers a succinyl group from succinyl-CoA to homoserine forming succinyl homoserine. It has been suggested that the succinyl group of succinyl-CoA is initially transferred to an enzyme nucleophile before subsequent transfer to homoserine. The catalytic triad typical of GATase1 domains is not conserved in this GATase1-like domain. However, in common with GATase1 domains a reactive cys residue is found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. It has been proposed that this cys is in the active site of the molecule. However, as succinyl has been found bound to a conserved lysine residue, this conserved cys may play a role in dimer formation. HTS acti Probab=45.75 E-value=9.5 Score=19.28 Aligned_cols=98 Identities=10% Similarity=0.114 Sum_probs=53.8 Q ss_pred CCCCCCEEEECCC----CCCCCCCCHHHHHHHHHHHHHHHHHH-HCCCEEEEECCCHHHHEEC--CCHHHHHCCCCCCCE Q ss_conf 4456478997287----34543114067752002121223322-0597178606403101000--001011012443324 Q gi|254780971|r 38 DIPDVDLIVIPGG----FSYGDYLRCGAIAARTPVMQAIKKKA-QQGIKVMGICNGFQILVEL--NLLPGILMRNCSLKF 110 (219) Q Consensus 38 ~l~~~d~lvipGG----FSygD~l~aG~i~~~~~~~~~i~~~~-~~g~~vLGICNGfQiL~el--GLlPg~l~~N~s~rf 110 (219) +-..||++|+-|- ..|.|+ ..+..+.+ +.+.+ ..-...|.||=|.|....- |+=.-.|-+-.+|-| T Consensus 59 ~~~~~DglIITGAPve~l~fe~v------~YW~El~~-i~dwa~~~v~stl~iCWaA~Aal~~~ygI~k~~l~~K~fGVf 131 (175) T cd03131 59 RDAKFDGLIVTGAPVEHLPFEQV------DYWEELTE-ILDWAKTHVTSTLFSCWAAMAALYYFYGIKKHQLPEKIFGVF 131 (175) T ss_pred HHCCCCEEEEECCCCCCCCCCCC------CCHHHHHH-HHHHHHHHCCCEEEEHHHHHHHHHHHHCCCCEECCCCEEEEE T ss_conf 41777668971899887883438------77999999-999998736633456499999999983998400687348864 Q ss_pred EEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCCEEE-EECHHHH Q ss_conf 5422576752523577641379968998620133023-3086774 Q gi|254780971|r 111 VCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHDGNY-FIDAKGL 154 (219) Q Consensus 111 ~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgEGrf-~~~~~~l 154 (219) ..+- .. ++|.++++. +.+.+|.| || .++.+.+ T Consensus 132 ~h~~-----~~-~~pLl~g~~--d~f~~PhS----R~t~i~~~~i 164 (175) T cd03131 132 PHTI-----LE-PHPLLRGLD--DGFDVPHS----RYAEVDREDI 164 (175) T ss_pred CCCC-----CC-CCCCCCCCC--CCCCCCCC----CCCCCCHHHH T ss_conf 1346-----89-990406789--86267858----8889899998 No 223 >PRK05395 3-dehydroquinate dehydratase; Provisional Probab=45.40 E-value=26 Score=16.51 Aligned_cols=54 Identities=24% Similarity=0.408 Sum_probs=27.8 Q ss_pred CEEEEEECCCCC--CHHH-------HH----HHHHH---HCCCCEEEEECC-CC--------CCCCCCEEEE-CCCCCCC Q ss_conf 968998449706--4799-------99----99998---549963998215-46--------4456478997-2873454 Q gi|254780971|r 1 MKTAIVQIPGLN--RDND-------MI----KAITK---IIGQSPILVWQS-DT--------DIPDVDLIVI-PGGFSYG 54 (219) Q Consensus 1 mkvaVl~~pGsN--cd~e-------~~----~A~~~---~~~~~~~~v~~~-~~--------~l~~~d~lvi-pGGFSyg 54 (219) ||+.||.-|..| +.|| +. ..+.+ ..+.+....-.+ |. ...++|++|| ||||+.. T Consensus 2 ~kIliinGPNLnlLG~Rep~iYG~~tl~~i~~~~~~~~~~~~i~~~~~QSN~EgelI~~Iq~a~~~~dgiIiNpga~THt 81 (143) T PRK05395 2 MKILVLNGPNLNLLGTREPEIYGTTTLADIEALLKEEAAELGVELEFFQSNHEGELIDRIHEARDDVDGIIINPGAYTHT 81 (143) T ss_pred CEEEEECCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHHHHCCCEEEECCCHHHHH T ss_conf 77999808981005787887488568999999999999975975999851868999999999863375799736213324 No 224 >KOG0259 consensus Probab=45.37 E-value=21 Score=17.10 Aligned_cols=36 Identities=19% Similarity=0.424 Sum_probs=27.4 Q ss_pred CEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCC Q ss_conf 302330867741154225036531688888988112 Q gi|254780971|r 144 DGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHD 179 (219) Q Consensus 144 EGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~ 179 (219) |-.+-+|-+.++.|.+++-+|+--..|.||+|+.+. T Consensus 182 e~~weIDL~~veal~DENT~AivviNP~NPcGnVys 217 (447) T KOG0259 182 EKDWEIDLDGVEALADENTVAIVVINPNNPCGNVYS 217 (447) T ss_pred CCCCEECHHHHHHHHCCCEEEEEEECCCCCCCCCCC T ss_conf 656153168899764467069999679999865151 No 225 >PRK07543 consensus Probab=45.08 E-value=26 Score=16.48 Aligned_cols=60 Identities=13% Similarity=0.175 Sum_probs=36.2 Q ss_pred CCCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCC------EEEEECCCCCEEE Q ss_conf 996899862013302330867741154225036531688888988112------6688869998999 Q gi|254780971|r 132 MNQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHD------IAGVINRRGNVLG 192 (219) Q Consensus 132 ~~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~------IAgi~s~~G~vlg 192 (219) .+..+.++.. .|..|..+.+.+++.....--++=++.|+||.|.... |+.+|.+..+++- T Consensus 136 g~~~v~v~~~-~e~~~~~d~~~le~~i~~~~k~iil~~P~NPTG~v~s~~~l~~l~~~~~~~~~v~i 201 (400) T PRK07543 136 GGTPVFVETT-AEFGFKLTPEALEAAITPKTKWFIFNSPSNPTGAAYTRAELKALTDVLVKHPHVWV 201 (400) T ss_pred CCEEEEEECC-CCCCCCCCHHHHHHHCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHCCCEE T ss_conf 8835995246-33375438477897478667199987998988837641678999998887538603 No 226 >PRK10333 putative ligase; Provisional Probab=44.86 E-value=16 Score=17.85 Aligned_cols=12 Identities=17% Similarity=0.421 Sum_probs=6.0 Q ss_pred EEECCCHHHHEE Q ss_conf 860640310100 Q gi|254780971|r 83 MGICNGFQILVE 94 (219) Q Consensus 83 LGICNGfQiL~e 94 (219) +|+|--||+.-+ T Consensus 165 Igla~~~Q~~~~ 176 (198) T PRK10333 165 VGYAHDCQLVEK 176 (198) T ss_pred EEEEECCEEECC T ss_conf 999712276065 No 227 >cd07023 S49_Sppa_N_C Signal peptide peptidase A (SppA), a serine protease, has catalytic Ser-Lys dyad. Signal peptide peptidase A (SppA; Peptidase S49; Protease IV): SppA is found in all three domains of life and is involved in the cleavage of signal peptides after their removal from the precursor proteins by signal peptidases. This subfamily contains members with either a single domain (sometimes referred to as 36K type), such as sohB peptidase, protein C and archaeal signal peptide peptidase, or an amino-terminal domain in addition to the carboxyl-terminal protease domain that is conserved in all the S49 family members (sometimes referred to as 67K type), similar to E. coli and Arabidopsis thaliana SppA peptidases. Site-directed mutagenesis and sequence analysis have shown these SppAs to be serine proteases. The predicted active site serine for members in this family occurs in a transmembrane domain. Mutagenesis studies also suggest that the catalytic center comprises a Ser-Lys dyad Probab=43.97 E-value=27 Score=16.37 Aligned_cols=66 Identities=21% Similarity=0.414 Sum_probs=40.6 Q ss_pred EEEEEECCCCC------CHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEE----CCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 68998449706------47999999998549963998215464456478997----287345431140677520021212 Q gi|254780971|r 2 KTAIVQIPGLN------RDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVI----PGGFSYGDYLRCGAIAARTPVMQA 71 (219) Q Consensus 2 kvaVl~~pGsN------cd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvi----pGGFSygD~l~aG~i~~~~~~~~~ 71 (219) |||||...|+- .-.+...+++++.... +.++|++ ||| ...+...+.++ T Consensus 1 ~iavi~~~G~i~~~~~~~~~~i~~~l~~A~~d~------------~vk~ivL~idSpGG----------~~~~s~ei~~~ 58 (208) T cd07023 1 KIAVIDIEGTISDGGGIGADSLIEQLRKAREDD------------SVKAVVLRINSPGG----------SVVASEEIYRE 58 (208) T ss_pred CEEEEEEEEEECCCCCCCHHHHHHHHHHHHHCC------------CCCEEEEEEECCCC----------CHHHHHHHHHH T ss_conf 979999887888999869999999999995089------------97489999748996----------29999999999 Q ss_pred HHHHHHCCCEEEEECCCH Q ss_conf 233220597178606403 Q gi|254780971|r 72 IKKKAQQGIKVMGICNGF 89 (219) Q Consensus 72 i~~~~~~g~~vLGICNGf 89 (219) ++++.+.++||+..+.+. T Consensus 59 i~~~k~~~KpV~a~~~~~ 76 (208) T cd07023 59 IRRLRKAKKPVVASMGDV 76 (208) T ss_pred HHHHHCCCCEEEEEECCH T ss_conf 998751498599997771 No 228 >PRK08363 alanine aminotransferase; Validated Probab=43.97 E-value=9.5 Score=19.27 Aligned_cols=33 Identities=18% Similarity=0.266 Sum_probs=11.8 Q ss_pred EEECHHHHHHHHHCCEEEEECCCCCCCCCCCCC Q ss_conf 330867741154225036531688888988112 Q gi|254780971|r 147 YFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHD 179 (219) Q Consensus 147 f~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~ 179 (219) |..|-+.+++.....--++-.++|+||.|.... T Consensus 152 ~~~d~~~l~~~i~~~tk~i~l~nP~NPTG~v~s 184 (398) T PRK08363 152 WQPDIDDIRKKITEKTKAIAVINPNNPTGALYD 184 (398) T ss_pred CCCCHHHHHHHCCCCCEEEEECCCCCCCCCCCC T ss_conf 968999999753113729998799998888654 No 229 >PRK13236 nitrogenase reductase; Reviewed Probab=43.94 E-value=12 Score=18.69 Aligned_cols=33 Identities=30% Similarity=0.474 Sum_probs=15.1 Q ss_pred HHHHHHHHHHHHCCCEEEE--ECCCHHHHEECCCH Q ss_conf 0212122332205971786--06403101000001 Q gi|254780971|r 66 TPVMQAIKKKAQQGIKVMG--ICNGFQILVELNLL 98 (219) Q Consensus 66 ~~~~~~i~~~~~~g~~vLG--ICNGfQiL~elGLl 98 (219) ..+.+.+++|++++..-|| |||.-..--|..++ T Consensus 165 NNI~~~i~~~a~~g~~rlgGiI~N~r~~~~e~~~v 199 (295) T PRK13236 165 NNIARGILKYAHTGGVRLGGLICNSRNVDREIELI 199 (295) T ss_pred HHHHHHHHHHHHCCCCCEEEEEEECCCCCCHHHHH T ss_conf 99999999997426970358996078887479999 No 230 >PRK08069 consensus Probab=43.93 E-value=19 Score=17.41 Aligned_cols=152 Identities=18% Similarity=0.197 Sum_probs=69.3 Q ss_pred HHHHHHHHCCCCEEEEECCCCCCCC-------CCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHC--CCE----EE Q ss_conf 9999998549963998215464456-------4789972873454311406775200212122332205--971----78 Q gi|254780971|r 17 MIKAITKIIGQSPILVWQSDTDIPD-------VDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQ--GIK----VM 83 (219) Q Consensus 17 ~~~A~~~~~~~~~~~v~~~~~~l~~-------~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~--g~~----vL 83 (219) .++.++ ..|.+.+..|..+-+++- ..--+--|=+.|++.-|. ..+.+++.++.++ |.. =+ T Consensus 21 ~a~~~~-~~g~~vi~l~~G~pdf~~p~~i~~a~~~~~~~~~~~Y~~~~G~------~~lreaia~~~~~~~g~~~~p~~V 93 (390) T PRK08069 21 KAKELK-AEGHDVIGLGAGEPDFNTPEHIIDAAHKAMNEGHTKYTPSGGL------QALKQEIVKKFKRDQGLAYDPSEI 93 (390) T ss_pred HHHHHH-HCCCCEEECCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCH------HHHHHHHHHHHHHHHCCCCCHHEE T ss_conf 999999-7699847899978999988999999999985489989899672------999999999999986899881118 Q ss_pred EECCCHHHHEEC---CCH-HHHHCCCCCCCEEEEEEEEEEC---C-CHHHHHHHCCCCCEEEEEEECCCEEEEECHHHHH Q ss_conf 606403101000---001-0110124433245422576752---5-2357764137996899862013302330867741 Q gi|254780971|r 84 GICNGFQILVEL---NLL-PGILMRNCSLKFVCKQVLLEVV---N-SNTAFTKSYKMNQIIKCPVAHHDGNYFIDAKGLA 155 (219) Q Consensus 84 GICNGfQiL~el---GLl-Pg~l~~N~s~rf~~r~~~~~v~---~-~~s~~~~~~~~~~~l~~piaHgEGrf~~~~~~l~ 155 (219) -||+|.|--..+ -++ ||- .+-+--+ . ...+- +..+.++.+|. .-+.+|..+.+.++ T Consensus 94 ~it~G~~~al~~~~~~l~~pGD------------~Vlv~~P~Y~~y~~~~~---~~g~~~v~v~~-~~~~~~~~d~~~l~ 157 (390) T PRK08069 94 IVCTGAKHALYTLFQVLLDEGD------------EVIIPTPYWVSYPEQVK---LAGGKPVYVEG-LEDNDFKITAEQLE 157 (390) T ss_pred EECCCHHHHHHHHHHHHCCCCC------------EEEECCCCCCCHHHHHH---HHCCEEEEECC-CCCCCCCCCHHHHH T ss_conf 9878889999999997279989------------89980686642799999---81880487145-66678767999999 Q ss_pred HHHHCCEEEEECCCCCCCCCCCCC------EEEEECCCCCEE Q ss_conf 154225036531688888988112------668886999899 Q gi|254780971|r 156 EIEKNNQIVFRYASGTNPNGSLHD------IAGVINRRGNVL 191 (219) Q Consensus 156 ~l~~~~~i~~~Y~d~~NPNGS~~~------IAgi~s~~G~vl 191 (219) +.....--++-++.|+||.|+... |+-+|-+.+-++ T Consensus 158 ~~i~~~~k~iil~~P~NPTG~v~s~e~l~~i~~la~~~~~~v 199 (390) T PRK08069 158 EAITEKTKAVIINSPSNPTGMIYTKEELAALGEVCLEHDILI 199 (390) T ss_pred HHCCCCCCEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCEEE T ss_conf 854436858998899698681655068999999987563059 No 231 >PRK01395 V-type ATP synthase subunit F; Provisional Probab=43.01 E-value=20 Score=17.25 Aligned_cols=28 Identities=21% Similarity=0.337 Sum_probs=22.3 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCC Q ss_conf 68998449706479999999985499639982154 Q gi|254780971|r 2 KTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSD 36 (219) Q Consensus 2 kvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~ 36 (219) |+||+ +|+|+...|. +.|.+++.+...+ T Consensus 5 KIaVv------Gd~DsvlgF~-~lG~dvfpv~~~e 32 (104) T PRK01395 5 KIGVI------GDKDSILPFK-ALGFDVFPVIEEQ 32 (104) T ss_pred EEEEE------CCHHHHHHHH-HCCEEEEECCCHH T ss_conf 79998------6857878888-6580478548879 No 232 >PRK03767 TrpR binding protein WrbA; Provisional Probab=42.83 E-value=28 Score=16.26 Aligned_cols=59 Identities=17% Similarity=0.287 Sum_probs=35.5 Q ss_pred CEEEEEECCCCCCHHHHHHHHHH----HCCCCEEEEECCC-----------------------CCCCCCCEEEECCCCCC Q ss_conf 96899844970647999999998----5499639982154-----------------------64456478997287345 Q gi|254780971|r 1 MKTAIVQIPGLNRDNDMIKAITK----IIGQSPILVWQSD-----------------------TDIPDVDLIVIPGGFSY 53 (219) Q Consensus 1 mkvaVl~~pGsNcd~e~~~A~~~----~~~~~~~~v~~~~-----------------------~~l~~~d~lvipGGFSy 53 (219) .||+||=+-++-.-+.|+++..+ ..|.++.+....| .+|.++|+|+|. T Consensus 2 ~kI~IvyyS~~G~t~~lA~~ia~Ga~~~~G~ev~l~~v~e~~~~~~~~~~~~~~~~~~~~a~~~dL~~aDgiifG----- 76 (200) T PRK03767 2 AKVLVLYYSMYGHIETMAEAVAEGAREVAGAEVTIKRVPETVPEEVAKKAGGKTDQAAPVATPQELADYDAIIFG----- 76 (200) T ss_pred CCEEEEEECCCCHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCHHHHHHCCCCCCCCCCCCCHHHHHHCCEEEEC----- T ss_conf 708999938998899999999976655179779998555648888997436876667883789999737722632----- Q ss_pred CCCCCHHHHHHH Q ss_conf 431140677520 Q gi|254780971|r 54 GDYLRCGAIAAR 65 (219) Q Consensus 54 gD~l~aG~i~~~ 65 (219) -..+=|.+++. T Consensus 77 -sPT~~G~~aaq 87 (200) T PRK03767 77 -TPTRFGNMAGQ 87 (200) T ss_pred -CCCCCCCCHHH T ss_conf -77556775499 No 233 >cd03143 A4_beta-galactosidase_middle_domain A4 beta-galactosidase middle domain: a type 1 glutamine amidotransferase (GATase1)-like domain. A4 beta-galactosidase middle domain: a type 1 glutamine amidotransferase (GATase1)-like domain. This group includes proteins similar to beta-galactosidase from Thermus thermophilus. Beta-Galactosidase hydrolyzes the beta-1,4-D-galactosidic linkage of lactose, as well as those of related chromogens, o-nitrophenyl-beta-D-galactopyranoside (ONP-Gal) and 5-bromo-4-chloro-3-indolyl-beta-D-galactoside (X-gal). This A4 beta-galactosidase middle domain lacks the catalytic triad of typical GATase1 domains. The reactive Cys residue found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow in typical GATase1 domains is not conserved in this group. Probab=42.69 E-value=28 Score=16.25 Aligned_cols=54 Identities=20% Similarity=0.383 Sum_probs=35.6 Q ss_pred HHHHHHHCCCCEEEEECCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEE Q ss_conf 9999985499639982154644564789972873454311406775200212122332205971786 Q gi|254780971|r 18 IKAITKIIGQSPILVWQSDTDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIKVMG 84 (219) Q Consensus 18 ~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~vLG 84 (219) .++|.+ .+..+.++. .+.++.+|++||+|.=..-.| ...+.+.++.++|+.++. T Consensus 32 y~~l~~-~~~~vD~v~-~~~~l~~Y~lvv~P~l~~~~~-----------~~~~~l~~~v~~GG~lv~ 85 (154) T cd03143 32 YRALRE-LGIPVDVVP-PDADLSGYKLVVLPDLYLLSD-----------ATAAALRAYVENGGTLVA 85 (154) T ss_pred HHHHHH-CCCCCCCCC-CCCCCCCCCEEEECCEEECCH-----------HHHHHHHHHHHCCCEEEE T ss_conf 999998-499645247-878823385787450004599-----------999999999976997999 No 234 >pfam00155 Aminotran_1_2 Aminotransferase class I and II. Probab=42.59 E-value=28 Score=16.24 Aligned_cols=45 Identities=20% Similarity=0.406 Sum_probs=18.0 Q ss_pred CEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCC Q ss_conf 6899862013302330867741154225036531688888988112 Q gi|254780971|r 134 QIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHD 179 (219) Q Consensus 134 ~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~ 179 (219) .+..+|.. .+..|..|-+.+++..++...++=.+.|+||.|...+ T Consensus 110 ~~~~~~~~-~~~~~~~d~~~l~~~~~~~~~~i~i~~P~NPTG~~~~ 154 (351) T pfam00155 110 EVVRYPLY-DSNDFHLDFDALEAALKEKPKVVLHESPHNPTGTVAP 154 (351) T ss_pred EEEEEECC-CCCCCCCCHHHHHHHCCCCCEEEEECCCCCCCCCCCC T ss_conf 77898456-4347798999999754159869998899198777768 No 235 >PRK09054 phosphogluconate dehydratase; Validated Probab=42.35 E-value=14 Score=18.18 Aligned_cols=31 Identities=29% Similarity=0.515 Sum_probs=17.4 Q ss_pred HHHHHHHHCCCEEEEECCC------HHHHEEC-CC-HHH Q ss_conf 1223322059717860640------3101000-00-101 Q gi|254780971|r 70 QAIKKKAQQGIKVMGICNG------FQILVEL-NL-LPG 100 (219) Q Consensus 70 ~~i~~~~~~g~~vLGICNG------fQiL~el-GL-lPg 100 (219) +++.+....-.+.-|-|.| .|+|+|. || ||| T Consensus 205 eeL~e~E~~a~ps~GtC~~mgTANTM~~l~EaLGL~LPG 243 (603) T PRK09054 205 DELLEAESASYHSPGTCTFYGTANSNQMLMEVMGLHLPG 243 (603) T ss_pred HHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHCCCCCC T ss_conf 999999961589998656647399999999996588887 No 236 >PRK13402 gamma-glutamyl kinase; Provisional Probab=42.05 E-value=29 Score=16.18 Aligned_cols=110 Identities=22% Similarity=0.291 Sum_probs=49.6 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHEECCCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEE Q ss_conf 14067752002121223322059717860640310100000101101244332454225767525235776413799689 Q gi|254780971|r 57 LRCGAIAARTPVMQAIKKKAQQGIKVMGICNGFQILVELNLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQII 136 (219) Q Consensus 57 l~aG~i~~~~~~~~~i~~~~~~g~~vLGICNGfQiL~elGLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l 136 (219) ++.|.|.++ .++-+...+.|.+++ ||||-+-=+-..++-|. +..-.|.... +..+.+-. T Consensus 206 ~GtGGM~TK---l~AA~~a~~~Gi~~~-Ia~G~~~~~i~~i~~g~---~~GT~F~p~~---~~~~~rK~----------- 264 (363) T PRK13402 206 VGTGGMRTK---IQAAKKAISHGIETF-IINGFTADIFNQLLKGQ---NPGTLFTPDE---KPMQEHLH----------- 264 (363) T ss_pred CCCCCHHHH---HHHHHHHHHCCCCEE-ECCCCCCCHHHHHHCCC---CCCEEEECCC---CCCCHHHH----------- T ss_conf 887834789---999999997799699-81699974799997089---8624995477---77644899----------- Q ss_pred EEEEECC---CEEEEECHHHHHHHHHCCEEEE----ECCCCCCCCCCCCCEEEEECCCCCEEE Q ss_conf 9862013---3023308677411542250365----316888889881126688869998999 Q gi|254780971|r 137 KCPVAHH---DGNYFIDAKGLAEIEKNNQIVF----RYASGTNPNGSLHDIAGVINRRGNVLG 192 (219) Q Consensus 137 ~~piaHg---EGrf~~~~~~l~~l~~~~~i~~----~Y~d~~NPNGS~~~IAgi~s~~G~vlg 192 (219) |++|+ .|+.++|+...+.|.+++.=.+ .=+.. +=+..++--|++++|+.+| T Consensus 265 --Wi~~~~~~~G~i~iD~GA~~Al~~~g~SLLp~GI~~v~G---~F~~Gd~V~i~~~~g~~ia 322 (363) T PRK13402 265 --WMTHTSQAQGEVVVENDFDTALDDHSEQLTSDDVVAVKG---DFSVGDTILVRKGDGTKLA 322 (363) T ss_pred --HHHCCCCCCEEEEECCCHHHHHHHCCCEECCCCCEEEEC---EECCCCEEEEECCCCCEEE T ss_conf --984046777239978038999974898016012478978---4669987999999998999 No 237 >PRK08912 hypothetical protein; Provisional Probab=40.45 E-value=16 Score=17.79 Aligned_cols=31 Identities=16% Similarity=0.190 Sum_probs=11.8 Q ss_pred EEECHHHHHHHHHCCEEEEECCCCCCCCCCC Q ss_conf 3308677411542250365316888889881 Q gi|254780971|r 147 YFIDAKGLAEIEKNNQIVFRYASGTNPNGSL 177 (219) Q Consensus 147 f~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~ 177 (219) |..+.+.++......--++-++.|.||.|.. T Consensus 145 ~~~d~~~l~~~~~~~tk~iil~~P~NPTG~v 175 (387) T PRK08912 145 WRLPEAALAAAFSPRTKAVLLNNPLNPAGKV 175 (387) T ss_pred CCCCHHHHHHCCCCCCEEEEECCCCCCCCCC T ss_conf 7889999985278886699989998988856 No 238 >COG1591 Holliday junction resolvase - archaeal type [DNA replication, recombination, and repair] Probab=39.75 E-value=31 Score=15.96 Aligned_cols=72 Identities=18% Similarity=0.284 Sum_probs=43.3 Q ss_pred CCCCCHHHHHHHHHHHCCCCEEEEECCC-CCCCCCCEEEECCCCCCCCCC------CHH-HHHHHHHHHHHHHHHHHC-- Q ss_conf 9706479999999985499639982154-644564789972873454311------406-775200212122332205-- Q gi|254780971|r 9 PGLNRDNDMIKAITKIIGQSPILVWQSD-TDIPDVDLIVIPGGFSYGDYL------RCG-AIAARTPVMQAIKKKAQQ-- 78 (219) Q Consensus 9 pGsNcd~e~~~A~~~~~~~~~~~v~~~~-~~l~~~d~lvipGGFSygD~l------~aG-~i~~~~~~~~~i~~~~~~-- 78 (219) -|++-|||+++.|++ .|++++.+-.+- ..-+-.|++..-||- +| +++ .+--+.-=.+.+.+|++. T Consensus 5 kG~~~EReLv~~L~e-~GfAvvR~paSG~sk~p~pDivA~~g~~----~l~iE~K~~~~~kiYl~~e~ve~L~~FA~~fG 79 (137) T COG1591 5 KGSRFERELVRILWE-RGFAVVRAPASGGSKRPLPDIVAGNGGV----YLAIEVKSRRETKIYLDKEQVEKLVEFARRFG 79 (137) T ss_pred CCCHHHHHHHHHHHH-CCCEEEECCCCCCCCCCCCCEEECCCCE----EEEEEEEECCCCCEEECHHHHHHHHHHHHHCC T ss_conf 332688999999986-4854998346789999998788528978----99999986158847875999999999999719 Q ss_pred CCEEEEE Q ss_conf 9717860 Q gi|254780971|r 79 GIKVMGI 85 (219) Q Consensus 79 g~~vLGI 85 (219) |.|.|+| T Consensus 80 g~p~iav 86 (137) T COG1591 80 GEPYIAV 86 (137) T ss_pred CCEEEEE T ss_conf 9568999 No 239 >cd05298 GH4_GlvA_pagL_like Glycoside Hydrolases Family 4; GlvA- and pagL-like glycosidases. Bacillus subtilis GlvA and Clostridium acetobutylicum pagL are 6-phospho-alpha-glucosidase, catalyzing the hydrolysis of alpha-glucopyranoside bonds to release glucose from oligosaccharides. The substrate specificities of other members of this subgroup are unknown. Some bacteria simultaneously translocate and phosphorylate disaccharides via the phosphoenolpyruvate-dependent phosphotransferase system (PEP_PTS). After translocation, these phospho-disaccharides may be hydrolyzed by the GH4 glycoside hydrolases, which include 6-phospho-beta-glucosidases, 6-phospho-alpha-glucosidases, alpha-glucosidases/alpha-glucuronidases (only from Thermotoga), and alpha-galactosidases. Members of this subfamily are part of the NAD(P)-binding Rossmann fold superfamily, which includes a wide variety of protein families including the NAD(P)-binding domains of alcohol dehydrogenases, tyrosine-dependent oxidoreductas Probab=39.06 E-value=21 Score=17.03 Aligned_cols=17 Identities=12% Similarity=0.456 Sum_probs=13.1 Q ss_pred CCCEEEEECCCHHHHEE Q ss_conf 59717860640310100 Q gi|254780971|r 78 QGIKVMGICNGFQILVE 94 (219) Q Consensus 78 ~g~~vLGICNGfQiL~e 94 (219) .+..++|+|+|-|-+.+ T Consensus 158 ~~~k~vGLCh~~~~~~~ 174 (437) T cd05298 158 PNARILNICDMPIAIMD 174 (437) T ss_pred CCCCEEEECCCHHHHHH T ss_conf 99988988978899999 No 240 >COG3340 PepE Peptidase E [Amino acid transport and metabolism] Probab=38.54 E-value=33 Score=15.84 Aligned_cols=89 Identities=18% Similarity=0.325 Sum_probs=54.7 Q ss_pred EEEEEECCCCCCHH-----HHHHHHHHHCCCCEEEEECCC-------CCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHH Q ss_conf 68998449706479-----999999985499639982154-------644564789972873454311406775200212 Q gi|254780971|r 2 KTAIVQIPGLNRDN-----DMIKAITKIIGQSPILVWQSD-------TDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVM 69 (219) Q Consensus 2 kvaVl~~pGsNcd~-----e~~~A~~~~~~~~~~~v~~~~-------~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~ 69 (219) ||+-|-+-+.--+. ....||.+ .|..+.-++... ..|.+.|+|.+.||=.| +-| ..+....+. T Consensus 34 ~i~FIPtAs~~~~~~~Yv~k~~~~l~~-lg~~v~~L~l~~~~~~~Ie~~l~~~d~IyVgGGNTF-~LL---~~lke~gld 108 (224) T COG3340 34 TIAFIPTASVDSEDDFYVEKVRNALAK-LGLEVSELHLSKPPLAAIENKLMKADIIYVGGGNTF-NLL---QELKETGLD 108 (224) T ss_pred EEEEEECCCCCCCHHHHHHHHHHHHHH-CCCEEEEEECCCCCHHHHHHHHHHCCEEEECCCHHH-HHH---HHHHHHCCH T ss_conf 289970476650207899999999987-697155425467999999976511558998885189-999---999984758 Q ss_pred HHHHHHHHCCCEEEEECCCHHHHEEC Q ss_conf 12233220597178606403101000 Q gi|254780971|r 70 QAIKKKAQQGIKVMGICNGFQILVEL 95 (219) Q Consensus 70 ~~i~~~~~~g~~vLGICNGfQiL~el 95 (219) +.+++..++|.+.+|+--|.-+-.+. T Consensus 109 ~iIr~~vk~G~~YiG~SAGA~ia~p~ 134 (224) T COG3340 109 DIIRERVKAGTPYIGWSAGANIAGPT 134 (224) T ss_pred HHHHHHHHCCCCEEEECCCCEEECCC T ss_conf 99999997698368743672465475 No 241 >PRK10446 ribosomal protein S6 modification protein; Provisional Probab=37.56 E-value=34 Score=15.74 Aligned_cols=85 Identities=18% Similarity=0.207 Sum_probs=45.5 Q ss_pred CEEEEEECCCCCCH-HHHHHHHHHHCCCCEEEEE----------------CCCCCCCCCCEEEECCCCCCCCCCCHHHHH Q ss_conf 96899844970647-9999999985499639982----------------154644564789972873454311406775 Q gi|254780971|r 1 MKTAIVQIPGLNRD-NDMIKAITKIIGQSPILVW----------------QSDTDIPDVDLIVIPGGFSYGDYLRCGAIA 63 (219) Q Consensus 1 mkvaVl~~pGsNcd-~e~~~A~~~~~~~~~~~v~----------------~~~~~l~~~d~lvipGGFSygD~l~aG~i~ 63 (219) ||++||.-+++... +-+..|+++. |.+..++. +....|+++|.++.=-|-|..-| + T Consensus 1 mki~iLs~~~~~yst~RL~eaa~~r-Gh~v~vidp~~~~~~i~~~~~~v~~~g~~L~~~DavipR~g~~~t~~-~----- 73 (300) T PRK10446 1 MKIAILSRDGTLYSCKRLREAAIQR-GHLVEILDPLSCYMNINPAASSIHYKGRKLPHFDAVIPRIGTAITFY-G----- 73 (300) T ss_pred CEEEEEECCCCCCHHHHHHHHHHHC-CCEEEEEEHHHCEEEECCCCCEEEECCEECCCCCEEEECCCCCCCHH-H----- T ss_conf 9799993699760799999999987-99599961489189824898649989808687888998267774558-9----- Q ss_pred HHHHHHHHHHHHHHCCCEEE------EECC----CHHHHEECCC Q ss_conf 20021212233220597178------6064----0310100000 Q gi|254780971|r 64 ARTPVMQAIKKKAQQGIKVM------GICN----GFQILVELNL 97 (219) Q Consensus 64 ~~~~~~~~i~~~~~~g~~vL------GICN----GfQiL~elGL 97 (219) ...+..+...|.+++ -.|. -.|+|.+.|+ T Consensus 74 -----~~vLr~lE~~Gv~viN~~~aI~~~~DKl~t~qlL~~~gi 112 (300) T PRK10446 74 -----TAALRQFEMLGSYPLNESVAIARARDKLRSMQLLARQGI 112 (300) T ss_pred -----HHHHHHHHHCCCCEECCHHHHHHHHHHHHHHHHHHHCCC T ss_conf -----999999998899186489999985769999999987597 No 242 >KOG4716 consensus Probab=37.29 E-value=16 Score=17.83 Aligned_cols=35 Identities=43% Similarity=0.409 Sum_probs=17.5 Q ss_pred CCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 45647899728734543114067752002121223322 Q gi|254780971|r 39 IPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKA 76 (219) Q Consensus 39 l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~ 76 (219) --|||+|||.|| |-| |.+.+-|+.-...-+..+|. T Consensus 17 sydyDLIviGgG-SgG--LacaKeAa~~G~kV~~lDfV 51 (503) T KOG4716 17 SYDYDLIVIGGG-SGG--LACAKEAADLGAKVACLDFV 51 (503) T ss_pred CCCCCEEEECCC-CCH--HHHHHHHHHCCCCEEEEEEC T ss_conf 477447998688-513--66779988608707999612 No 243 >PRK04761 ppnK inorganic polyphosphate/ATP-NAD kinase; Reviewed Probab=37.10 E-value=32 Score=15.86 Aligned_cols=41 Identities=34% Similarity=0.612 Sum_probs=29.4 Q ss_pred CCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCC Q ss_conf 1546445647899728734543114067752002121223322059717860640 Q gi|254780971|r 34 QSDTDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQGIKVMGICNG 88 (219) Q Consensus 34 ~~~~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g~~vLGICNG 88 (219) +....+.+.|++|.-|| | |. +..+++++...+.|++||--| T Consensus 18 y~~~~~~eaDliIvlGG----D----GT------~L~a~r~~~~~~~PilGIN~G 58 (246) T PRK04761 18 YGDVPIEEADVIVALGG----D----GF------MLQTLHRYMNSGKPVYGMNRG 58 (246) T ss_pred HCCCCHHHCCEEEEECC----C----HH------HHHHHHHHHCCCCCEEEEECC T ss_conf 58999657999999898----7----99------999999874149937989779 No 244 >PRK06375 consensus Probab=36.95 E-value=31 Score=15.99 Aligned_cols=155 Identities=14% Similarity=0.138 Sum_probs=68.4 Q ss_pred HHHHHHHHHHCCCCEEEEECCCCCCCC-------CCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHC--CCE---- Q ss_conf 999999998549963998215464456-------4789972873454311406775200212122332205--971---- Q gi|254780971|r 15 NDMIKAITKIIGQSPILVWQSDTDIPD-------VDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQ--GIK---- 81 (219) Q Consensus 15 ~e~~~A~~~~~~~~~~~v~~~~~~l~~-------~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~--g~~---- 81 (219) .|.++++++ -|.+++..|..+-+++- ..-.+-.|-+.|++.- |. ..+++++.++.++ |.. T Consensus 18 ~~~a~~l~~-~G~dvi~l~iG~pdf~~P~~i~~a~~~~~~~~~~~Y~~~~--G~----~~lR~aia~~~~~~~g~~~~~~ 90 (381) T PRK06375 18 SNRVEKLRK-QGKKLYNFGIGEPDFTTPDNIIKYAFEMAKSGKTHYTPSA--GI----PELREKIALKLKSRNNIDAVAK 90 (381) T ss_pred HHHHHHHHH-CCCCCEECCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCC--CC----HHHHHHHHHHHHHHCCCCCCCC T ss_conf 999999997-7997077389999998689999999999737998889999--86----9999999999998559999957 Q ss_pred EEEECCCHHHHEECCCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHH--HCCCCCEEEEEEECCCEEEEECHHHHHHHHH Q ss_conf 78606403101000001011012443324542257675252357764--1379968998620133023308677411542 Q gi|254780971|r 82 VMGICNGFQILVELNLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTK--SYKMNQIIKCPVAHHDGNYFIDAKGLAEIEK 159 (219) Q Consensus 82 vLGICNGfQiL~elGLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~--~~~~~~~l~~piaHgEGrf~~~~~~l~~l~~ 159 (219) -+=|+.|.|-+. ++--+++ |.. ..|-+.-+.- ..+.. .+...+. +++.. +..|..|-+.+++... T Consensus 91 ~i~vt~~~~~l~---~~~~~l~-~pG-----D~Vii~~P~Y-~~y~~~~~~~g~~~--v~~~~-~~~~~~d~~~l~~~i~ 157 (381) T PRK06375 91 NVLVTPTKFGIN---LAMMVIL-NPG-----DEVLIPSPYY-VSYPDIVKLAGGKP--VFVDT-DDDYNIDIDNMRKAVT 157 (381) T ss_pred EEEEECHHHHHH---HHHHHHC-CCC-----CEEEECCCCC-HHHHHHHHHHCCEE--EEEEC-CCCCCCCHHHHHHHCC T ss_conf 699854899999---9999866-999-----9899818887-61999999819977--64203-8877979999996378 Q ss_pred CCEEEEECCCCCCCCCCCCC------EEEEECCCCC Q ss_conf 25036531688888988112------6688869998 Q gi|254780971|r 160 NNQIVFRYASGTNPNGSLHD------IAGVINRRGN 189 (219) Q Consensus 160 ~~~i~~~Y~d~~NPNGS~~~------IAgi~s~~G~ 189 (219) ..--++=+|.|+||.|+... |+.+|.+.+- T Consensus 158 ~~tk~i~l~nP~NPTG~v~s~e~l~~i~~la~~~~i 193 (381) T PRK06375 158 PKTRAIIINNPVNPTGKVLSEKEIRELVDFALENNL 193 (381) T ss_pred CCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHHCEE T ss_conf 677499989997988999999999999998875658 No 245 >pfam01220 DHquinase_II Dehydroquinase class II. Probab=36.93 E-value=35 Score=15.68 Aligned_cols=53 Identities=23% Similarity=0.350 Sum_probs=27.4 Q ss_pred CEEEEEECCCCC--CHHH-----------HHHHHH---HHCCCCEEEEECCC---------CCCCCCCEEEE-CCCCCC Q ss_conf 968998449706--4799-----------999999---85499639982154---------64456478997-287345 Q gi|254780971|r 1 MKTAIVQIPGLN--RDND-----------MIKAIT---KIIGQSPILVWQSD---------TDIPDVDLIVI-PGGFSY 53 (219) Q Consensus 1 mkvaVl~~pGsN--cd~e-----------~~~A~~---~~~~~~~~~v~~~~---------~~l~~~d~lvi-pGGFSy 53 (219) ||+.||.-|..| +.|| ....+. +-.+.+....-.+. ....++|++|| ||+|+. T Consensus 1 mkILiinGPNLnlLG~Rep~iYG~~tl~~i~~~~~~~a~~~g~~~~~~QSN~EgeiId~I~~a~~~~dgiIiNpga~TH 79 (140) T pfam01220 1 MKILVLNGPNLNLLGKREPEIYGSTTLADIEARLRELAAELGVELEFFQSNHEGELIDWIHEARGDVDGIIINPAAYTH 79 (140) T ss_pred CEEEEECCCCHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHHHCCCCEEEECCCHHEE T ss_conf 9799984898201577788748736999999999999997299268996286899999999975447689985621011 No 246 >PRK02645 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=35.93 E-value=36 Score=15.58 Aligned_cols=79 Identities=20% Similarity=0.280 Sum_probs=42.7 Q ss_pred EEEEEECCCCCCHHHHHHHHHH---HCCCCEEEEECCC----------CCCCCCCEEEECCCCCCCCCCCHHHHHHHHHH Q ss_conf 6899844970647999999998---5499639982154----------64456478997287345431140677520021 Q gi|254780971|r 2 KTAIVQIPGLNRDNDMIKAITK---IIGQSPILVWQSD----------TDIPDVDLIVIPGGFSYGDYLRCGAIAARTPV 68 (219) Q Consensus 2 kvaVl~~pGsNcd~e~~~A~~~---~~~~~~~~v~~~~----------~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~ 68 (219) ||+|+.=|++---.|+++.+.+ .-|.++....... ..-...|+++.-|| | |. + T Consensus 5 ~VlIV~k~~~~~A~~~a~~l~~~L~~rGi~v~~~~~~~~~~~~~~~~~~~~~~~Dlvi~lGG----D----GT------~ 70 (304) T PRK02645 5 LVIIAYKAGSPQAKEAAERCAKQLEARGIKVLMGPSGPKDNPYPVFLASAEELPDLAIVLGG----D----GT------V 70 (304) T ss_pred EEEEEECCCCHHHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCCCCCCCEEEEECC----C----HH------H T ss_conf 99999858999999999999999998889999844434447776200144668889999786----8----89------9 Q ss_pred HHHHHHHHHCCCEEEEECC-C-HHHHEE Q ss_conf 2122332205971786064-0-310100 Q gi|254780971|r 69 MQAIKKKAQQGIKVMGICN-G-FQILVE 94 (219) Q Consensus 69 ~~~i~~~~~~g~~vLGICN-G-fQiL~e 94 (219) ..+.+.+...+.|+|||-- | -=-|+| T Consensus 71 L~aar~~~~~~iPilGiN~~G~lGFLte 98 (304) T PRK02645 71 LAAARHLAPHDIPILSFNVGGHLGFLTH 98 (304) T ss_pred HHHHHHHCCCCCCEEEEECCCEEEEECC T ss_conf 9999985426998899824864697167 No 247 >PRK08175 aminotransferase; Validated Probab=33.78 E-value=32 Score=15.87 Aligned_cols=55 Identities=13% Similarity=0.216 Sum_probs=25.4 Q ss_pred CCCEEEEEEECCCEEEEECHHH-HHHHHHCCEEEEECCCCCCCCCCCCC------EEEEECCCC Q ss_conf 9968998620133023308677-41154225036531688888988112------668886999 Q gi|254780971|r 132 MNQIIKCPVAHHDGNYFIDAKG-LAEIEKNNQIVFRYASGTNPNGSLHD------IAGVINRRG 188 (219) Q Consensus 132 ~~~~l~~piaHgEGrf~~~~~~-l~~l~~~~~i~~~Y~d~~NPNGS~~~------IAgi~s~~G 188 (219) ...+..+|+ +-|+.|..+-+. ++.... .--++-.+.|+||.|.... |+.+|.+.+ T Consensus 136 g~~~~~vpl-~~~~~~~~~le~~~~~~~~-ktk~iiln~P~NPTG~v~s~e~l~~i~~~a~~~~ 197 (395) T PRK08175 136 GAQVRSVPL-VEGVDFFNELERAIRESYP-KPKMMILGFPSNPTAQCVELEFFEKVVALAKRYD 197 (395) T ss_pred CCCEEEEEC-CCCCCCHHHHHHHHHHCCC-CCEEEEECCCCCCCCCCCCHHHHHHHHHHHHHCC T ss_conf 784078504-5777604419999972687-7459998899998887579999999999888669 No 248 >PRK03378 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=33.56 E-value=39 Score=15.34 Aligned_cols=72 Identities=29% Similarity=0.361 Sum_probs=37.7 Q ss_pred EEEEEECCCCCCHHHHHHHHHHH---CCCCEEEEECC--------C---CC----CCCCCEEEECCCCCCCCCCCHHHHH Q ss_conf 68998449706479999999985---49963998215--------4---64----4564789972873454311406775 Q gi|254780971|r 2 KTAIVQIPGLNRDNDMIKAITKI---IGQSPILVWQS--------D---TD----IPDVDLIVIPGGFSYGDYLRCGAIA 63 (219) Q Consensus 2 kvaVl~~pGsNcd~e~~~A~~~~---~~~~~~~v~~~--------~---~~----l~~~d~lvipGGFSygD~l~aG~i~ 63 (219) +|+|+.=|..-.-.+++..+.++ -+.++.+ ... + .+ -.+.|++|.-|| | |. T Consensus 7 ~IgIi~k~~~~~a~~~~~~l~~~L~~~g~~v~l-d~~~a~~l~~~~~~~~~~~~~~~~~Dlvi~lGG----D----GT-- 75 (292) T PRK03378 7 CIGIVGHPRHPTALTTHEMLYRWLCTKGYEVIV-EQQIAHELQLKNVKTGTLAEIGQQADLAVVVGG----D----GN-- 75 (292) T ss_pred EEEEEEECCCHHHHHHHHHHHHHHHHCCCEEEE-CHHHHHHCCCCCCCCCCHHHHCCCCCEEEEECC----C----HH-- T ss_conf 999995279858999999999999978998998-347876529986554786685305689999777----8----89-- Q ss_pred HHHHHHHHHHHHHHCCCEEEEECCC Q ss_conf 2002121223322059717860640 Q gi|254780971|r 64 ARTPVMQAIKKKAQQGIKVMGICNG 88 (219) Q Consensus 64 ~~~~~~~~i~~~~~~g~~vLGICNG 88 (219) +..+.+.+...+.|+|||--| T Consensus 76 ----~L~aar~~~~~~~PilGIN~G 96 (292) T PRK03378 76 ----MLGAARTLARYDIKVIGINRG 96 (292) T ss_pred ----HHHHHHHHCCCCCCEEEEECC T ss_conf ----999999854369968988379 No 249 >TIGR02922 TIGR02922 conserved hypothetical protein TIGR02922; InterPro: IPR014271 Two members of this family are found in Colwellia psychrerythraea (strain 34H / ATCC BAA-681) and one each in various other species of Colwellia and Shewanella. One member from C. psychrerythraea is of special interest because it is preceded by the same cis-regulatory site as a number of genes that have the PEP-CTERM domain described by IPR013424 from INTERPRO.. Probab=33.04 E-value=19 Score=17.42 Aligned_cols=22 Identities=23% Similarity=0.665 Sum_probs=17.3 Q ss_pred HHHCCCEEEEECCC-HHHHEECC Q ss_conf 22059717860640-31010000 Q gi|254780971|r 75 KAQQGIKVMGICNG-FQILVELN 96 (219) Q Consensus 75 ~~~~g~~vLGICNG-fQiL~elG 96 (219) .+++||.++.+|.| |.+|=.+| T Consensus 42 eFk~GKsIiAV~EG~i~ilNsvG 64 (69) T TIGR02922 42 EFKKGKSIIAVCEGEITILNSVG 64 (69) T ss_pred CCCCCCEEEEEECCCEEEECCCC T ss_conf 56689868998248668850355 No 250 >TIGR03538 DapC_gpp succinyldiaminopimelate transaminase. This family of succinyldiaminopimelate transaminases (DapC) includes the experimentally characterized enzyme from Bordatella pertussis. The majority of genes in this family are proximal to genes encoding components of the lysine biosynthesis via diaminopimelate pathway (GenProp0125). Probab=32.91 E-value=39 Score=15.32 Aligned_cols=57 Identities=12% Similarity=0.164 Sum_probs=31.0 Q ss_pred CCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCC------EEEEECCCCCE Q ss_conf 96899862013302330867741154225036531688888988112------66888699989 Q gi|254780971|r 133 NQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHD------IAGVINRRGNV 190 (219) Q Consensus 133 ~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~------IAgi~s~~G~v 190 (219) .++..+|. ..|+.|..|-+.+++.....--++-.+.|+||.|+... |+-+|.+.+-. T Consensus 139 ~~~~~v~~-~~e~~~~~d~~~l~~~~~~~tk~iil~sP~NPTG~v~s~~~l~~l~~~a~~~~i~ 201 (394) T TIGR03538 139 AEPYFLNC-TAENGFLPDFDAVPESVWRRCQLLFVCSPGNPTGAVLSLETLKKLIELADQYGFI 201 (394) T ss_pred CEEEECCC-CCCCCCCCCHHHHHHHCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEE T ss_conf 98997586-7345888899999874523665999889999989777879999999997326769 No 251 >PRK01185 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=32.45 E-value=41 Score=15.22 Aligned_cols=71 Identities=25% Similarity=0.447 Sum_probs=36.7 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCC---------CCCC--CCCEEEECCCCCCCCCCCHHHHHHHHHHH Q ss_conf 968998449706479999999985499639982154---------6445--64789972873454311406775200212 Q gi|254780971|r 1 MKTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSD---------TDIP--DVDLIVIPGGFSYGDYLRCGAIAARTPVM 69 (219) Q Consensus 1 mkvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~---------~~l~--~~d~lvipGGFSygD~l~aG~i~~~~~~~ 69 (219) ||+||+.=+..---.++++.+.++......++..++ .++. +.|+++.-|| | |.+... T Consensus 1 MKi~iv~~~~~~~a~~~a~~l~~~L~~~~~v~~d~~~a~~l~~~~~~~~e~~~Dlvi~iGG----D----GTlLr~---- 68 (272) T PRK01185 1 MKVAFVIRKDCKRCANIARSIISLLPSDWEIIYDKEAAKFLKKPGLDINEISADIIIVIGG----D----GTVLRT---- 68 (272) T ss_pred CEEEEEEECCCHHHHHHHHHHHHHHHCCCEEEEEHHHHHHCCCCCCCCCCCCCCEEEEECC----C----HHHHHH---- T ss_conf 9899999789989999999999999669919985688887099999855689889999778----3----999999---- Q ss_pred HHHHHHHHCCCEEEEECCC Q ss_conf 1223322059717860640 Q gi|254780971|r 70 QAIKKKAQQGIKVMGICNG 88 (219) Q Consensus 70 ~~i~~~~~~g~~vLGICNG 88 (219) .+ ...+|+|||--| T Consensus 69 --~~---~~~~PilGIN~G 82 (272) T PRK01185 69 --LQ---FAKGPVLGINMG 82 (272) T ss_pred --HH---HCCCCEEEECCC T ss_conf --98---779999998178 No 252 >PRK06358 threonine-phosphate decarboxylase; Provisional Probab=32.19 E-value=41 Score=15.20 Aligned_cols=10 Identities=40% Similarity=0.677 Sum_probs=4.5 Q ss_pred CCCCCCCCCC Q ss_conf 8888898811 Q gi|254780971|r 169 SGTNPNGSLH 178 (219) Q Consensus 169 d~~NPNGS~~ 178 (219) .|+||.|+.. T Consensus 149 nPnNPTG~~~ 158 (354) T PRK06358 149 NPNNPTGQLI 158 (354) T ss_pred CCCCCCCCCC T ss_conf 8999866406 No 253 >TIGR01351 adk adenylate kinases; InterPro: IPR006259 Most members of this family are known or believed to be adenylate kinase. Adenylate kinase (ADK) 2.7.4.3 from EC converts ATP + AMP to ADP + ADP, that is, it uses ATP as a phosphate donor for AMP. Two ADK isozymes have been identified in mammalian cells. These specifically bind AMP and favour binding to ATP over other nucleotide triphosphates (AK1 is cytosolic and AK2 is located in the mitochondria). However, some members accept other nucleotide triphosphates as donors, and may be unable to use ATP, and may fail to complement adenylate kinase mutants. An example of a nucleoside-triphosphate--adenylate kinase (2.7.4.10 from EC) is Q9UIJ7 from SWISSPROT, a GTP:AMP phosphotransferase that has been identified in bovine heart and human cells and derived from mitrochondrail GTP AMP that is specific for the phosphorylation of AMP, but can only use GTP or ITP as a substrate .This family is designated subfamily rather than equivalog for this reason. ADK has also been identified in different bacterial species and in yeast . Two further enzymes are known to be related to the ADK family, i.e. yeast uridine monophosphokinase and slime mold UMP-CMP kinase. Within the ADK family there are several conserved regions, including the ATP-binding domains. One of the most conserved areas includes an Arg residue, whose modification inactivates the enzyme, together with an Asp that resides in the catalytic cleft of the enzyme and participates in a salt bridge. ; GO: 0005524 ATP binding, 0016776 phosphotransferase activity phosphate group as acceptor, 0019201 nucleotide kinase activity, 0006139 nucleobase nucleoside nucleotide and nucleic acid metabolic process. Probab=32.16 E-value=15 Score=17.98 Aligned_cols=83 Identities=24% Similarity=0.254 Sum_probs=45.6 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHH--HHHCCCEEEEECCCHHHHEECCCHHHHHCCCCCC---CEEEEEEEEEECCC--HH Q ss_conf 45431140677520021212233--2205971786064031010000010110124433---24542257675252--35 Q gi|254780971|r 52 SYGDYLRCGAIAARTPVMQAIKK--KAQQGIKVMGICNGFQILVELNLLPGILMRNCSL---KFVCKQVLLEVVNS--NT 124 (219) Q Consensus 52 SygD~l~aG~i~~~~~~~~~i~~--~~~~g~~vLGICNGfQiL~elGLlPg~l~~N~s~---rf~~r~~~~~v~~~--~s 124 (219) |=||-||+ ++++.+++-+.+++ +.++|.+|== ++. .+|+-..|..+... |.+.-|..-..+.| +. T Consensus 29 STGDllR~-~~~~~T~LG~~~k~y~y~~~G~LVPD-----~~v--~~lv~~rl~~~~~~~~~~~~~GfILDGfPRT~~QA 100 (232) T TIGR01351 29 STGDLLRA-AVKAGTPLGKKAKEYNYMDKGELVPD-----EIV--NQLVKERLQQNPDCVSLKSENGFILDGFPRTLSQA 100 (232) T ss_pred CCHHHHHH-HHHHCCHHHHHHHCCCCCCCCCCCCH-----HHH--HHHHHHHHHCCCCCEEEEECCCEEECCCCCCHHHH T ss_conf 02589999-98707977898732672003775778-----999--99999997456000110106862663888878999 Q ss_pred HHHHHC------CCCCEEEEEEEC Q ss_conf 776413------799689986201 Q gi|254780971|r 125 AFTKSY------KMNQIIKCPVAH 142 (219) Q Consensus 125 ~~~~~~------~~~~~l~~piaH 142 (219) -+|..+ +...|+.+-|.+ T Consensus 101 eaL~~~l~~~g~~~d~V~~L~vp~ 124 (232) T TIGR01351 101 EALDAMLKELGLPIDAVIELDVPD 124 (232) T ss_pred HHHHHHHHHCCCCCCEEEEEECCC T ss_conf 999999986189841788853587 No 254 >TIGR01438 TGR thioredoxin and glutathione reductase; InterPro: IPR006338 This homodimeric, FAD-containing member of the pyridine nucleotide disulphide oxidoreductase family contains a C-terminal motif Cys-SeCys-Gly, where SeCys is selenocysteine encoded by TGA (in some sequence reports interpreted as a stop codon). In some members of this subfamily, Cys-SeCys-Gly is replaced by Cys-Cys-Gly. The reach of the selenium atom at the C-terminal arm of the protein is proposed to allow broad substrate specificity , .; GO: 0016654 oxidoreductase activity acting on NADH or NADPH disulfide as acceptor, 0050660 FAD binding, 0050661 NADP binding, 0006118 electron transport. Probab=31.69 E-value=42 Score=15.14 Aligned_cols=17 Identities=24% Similarity=0.405 Sum_probs=10.9 Q ss_pred HHHHHHHCCEEEEECCC Q ss_conf 74115422503653168 Q gi|254780971|r 153 GLAEIEKNNQIVFRYAS 169 (219) Q Consensus 153 ~l~~l~~~~~i~~~Y~d 169 (219) .|.++.+.+.|..+|.+ T Consensus 123 ayaefvdk~ki~i~~t~ 139 (513) T TIGR01438 123 AYAEFVDKDKIVIKATD 139 (513) T ss_pred EEEEEECCCEEEEEEEE T ss_conf 00244278468999860 No 255 >TIGR02817 adh_fam_1 zinc-binding alcohol dehydrogenase family protein; InterPro: IPR014182 Members of this entry form a distinct subset of the larger family of oxidoreductases that includes zinc-binding alcohol dehydrogenases and NADPH:quinone reductases. Some current members of this entry carry designations as putative alginate lyase, however none have been characterised as such.. Probab=31.66 E-value=6.1 Score=20.49 Aligned_cols=52 Identities=15% Similarity=0.263 Sum_probs=29.8 Q ss_pred EEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 99844970647999999998549963998215464456478997287345431140677520021212233 Q gi|254780971|r 4 AIVQIPGLNRDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKK 74 (219) Q Consensus 4 aVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~ 74 (219) +=|.=||||||+-++ +..+|-+.+..|+=.+.=++| |. +|-|++-|+|-|.= T Consensus 92 G~i~RpGsNaEfhLV---------DERIVG~KP~sLsfa~AAALP--------LT--~ITAWElLFDRL~~ 143 (338) T TIGR02817 92 GDITRPGSNAEFHLV---------DERIVGHKPKSLSFAEAAALP--------LT--SITAWELLFDRLGI 143 (338) T ss_pred CCCCCCCHHHHHHHH---------HHHHHCCCCCCCHHHHHHHCC--------HH--HHHHHHHHHHHHCC T ss_conf 336888314688877---------899836898960089997310--------24--89999998866158 No 256 >PRK12448 dihydroxy-acid dehydratase; Provisional Probab=31.49 E-value=36 Score=15.57 Aligned_cols=31 Identities=32% Similarity=0.670 Sum_probs=15.9 Q ss_pred HHHHHHHHCCCEEEEECCC------HHHHEE-CCC-HHH Q ss_conf 1223322059717860640------310100-000-101 Q gi|254780971|r 70 QAIKKKAQQGIKVMGICNG------FQILVE-LNL-LPG 100 (219) Q Consensus 70 ~~i~~~~~~g~~vLGICNG------fQiL~e-lGL-lPg 100 (219) +++.+..+.-.|.-|.|.| +|+|+| ||| ||| T Consensus 178 eel~~~E~~a~p~~GsC~gmgTANTM~~l~EaLGl~LPG 216 (615) T PRK12448 178 EDVAQIERSACPTCGSCSGMFTANSMNCLTEALGLSLPG 216 (615) T ss_pred HHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHCCCCC T ss_conf 999999972389998555668699999999998578777 No 257 >KOG4649 consensus Probab=31.06 E-value=41 Score=15.21 Aligned_cols=37 Identities=19% Similarity=0.192 Sum_probs=29.1 Q ss_pred EEEECCCC------CCCCCCCCCEEEEECCCCCEEEECCCCHH Q ss_conf 36531688------88898811266888699989998488434 Q gi|254780971|r 163 IVFRYASG------TNPNGSLHDIAGVINRRGNVLGMMPHPEN 199 (219) Q Consensus 163 i~~~Y~d~------~NPNGS~~~IAgi~s~~G~vlgmMPHPER 199 (219) .+--|.+| .|||+|..-.-..|+.||+|.-||--|-- T Consensus 261 t~~vy~~p~l~F~~h~~~~S~~~ll~~~s~dgkv~il~~~~sl 303 (354) T KOG4649 261 TLHVYLSPKLRFDLHSPGISYPKLLRRSSGDGKVMILMTSKSL 303 (354) T ss_pred EEEEEECCCCCEECCCCCCCCHHHHHHHCCCCCEEEEEECCCC T ss_conf 8999957642100368788605664543479957999945411 No 258 >TIGR02692 tRNA_CCA_actino tRNA adenylyltransferase; InterPro: IPR014065 The enzyme tRNA adenylyltransferase, also called tRNA-nucleotidyltransferase and CCA-adding enzyme, can add or repair the required CCA triplet at the 3 -end of tRNA molecules. Genes encoding tRNA include the CCA tail in some but not all bacteria, and this enzyme may be required for viability. Members of this family represent a distinct clade within the larger family IPR002646 from INTERPRO (tRNA nucleotidyltransferase/poly(A) polymerase family protein). The example from Streptomyces coelicolor was shown to act as a CCA-adding enzyme and not as a poly(A) polymerase.. Probab=30.82 E-value=19 Score=17.42 Aligned_cols=76 Identities=22% Similarity=0.317 Sum_probs=41.8 Q ss_pred CEEEECC--CCCCCCC----CCHHHHHHH------HHHHHHHHH-------------HHHCCCEEEE--ECCCHHHHEEC Q ss_conf 7899728--7345431----140677520------021212233-------------2205971786--06403101000 Q gi|254780971|r 43 DLIVIPG--GFSYGDY----LRCGAIAAR------TPVMQAIKK-------------KAQQGIKVMG--ICNGFQILVEL 95 (219) Q Consensus 43 d~lvipG--GFSygD~----l~aG~i~~~------~~~~~~i~~-------------~~~~g~~vLG--ICNGfQiL~el 95 (219) ..|--|. --||+|+ |||.++.++ ..+..++.+ ..+=+|.++| --.|-.+|+++ T Consensus 155 ~~L~TP~~Pe~SF~DDPLRMLRAARf~sQLgF~v~~~v~~Am~~mA~~i~~isAERv~~EL~Kll~~~~P~~Gi~llV~T 234 (472) T TIGR02692 155 KVLDTPATPEQSFSDDPLRMLRAARFVSQLGFEVAPRVRAAMTEMADRIEIISAERVRDELDKLLLGDHPREGIDLLVET 234 (472) T ss_pred HHCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHH T ss_conf 84057887435678881689999998776076124789999999887743324788888874201477836678998740 Q ss_pred CCH----HH--HHCCCCCCCEEEEEEEEE Q ss_conf 001----01--101244332454225767 Q gi|254780971|r 96 NLL----PG--ILMRNCSLKFVCKQVLLE 118 (219) Q Consensus 96 GLl----Pg--~l~~N~s~rf~~r~~~~~ 118 (219) ||. |. +|--..-.+|.-|.|+-. T Consensus 235 Gla~~vlPE~~alrl~~DEHh~HKDVY~H 263 (472) T TIGR02692 235 GLADRVLPEIPALRLEIDEHHQHKDVYEH 263 (472) T ss_pred CCHHHCCCCCCHHHHCCCCCCCCCHHHHH T ss_conf 40111046762024132233567547889 No 259 >cd07565 aliphatic_amidase aliphatic amidases (class 2 nitrilases). Aliphatic amidases catalyze the hydrolysis of short-chain aliphatic amides to form ammonia and the corresponding organic acid. This group includes Pseudomonas aeruginosa (Pa) AmiE, the amidase from Geobacillus pallidus RAPc8 (RAPc8 amidase), and Helicobacter pylori (Hp) AmiE and AmiF. PaAimE and HpAmiE hydrolyze various very short aliphatic amides, including propionamide, acetamide and acrylamide. HpAmiF is a formamidase which specifically hydrolyzes formamide. These proteins belong to a larger nitrilase superfamily comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13), this subgroup corresponds to class 2. Members of this superfamily generally form homomeric complexes, the basic Probab=29.82 E-value=34 Score=15.70 Aligned_cols=75 Identities=21% Similarity=0.381 Sum_probs=42.7 Q ss_pred CEEEEEEC--CCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECCC----CCCCCCCCHHHHHH--HHHHHHHH Q ss_conf 96899844--970647999999998549963998215464456478997287----34543114067752--00212122 Q gi|254780971|r 1 MKTAIVQI--PGLNRDNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPGG----FSYGDYLRCGAIAA--RTPVMQAI 72 (219) Q Consensus 1 mkvaVl~~--pGsNcd~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipGG----FSygD~l~aG~i~~--~~~~~~~i 72 (219) +|||++++ |..++-.|....++++..+ +.......+..|+||||== |++..... -..+. ..+..+.+ T Consensus 1 v~va~vQ~~~~~~~~~~dv~~n~~~i~~~----i~~a~a~~~GadLIVlPE~~~tG~~~~~~~~-~~~a~~~~Gp~~~~l 75 (291) T cd07565 1 VGVAVVQYKVPVLHTKEEVLENAERIADM----VEGTKRGLPGMDLIVFPEYSTQGLMYDKWTM-DETACTVPGPETDIF 75 (291) T ss_pred CEEEEEECCCCCCCCHHHHHHHHHHHHHH----HHHHHHCCCCCEEEECCCCCCCCCCCCHHHH-HHHHHCCCCHHHHHH T ss_conf 96999963488889989999999999999----9999734999869988886557899776788-875422998799999 Q ss_pred HHHHHCCC Q ss_conf 33220597 Q gi|254780971|r 73 KKKAQQGI 80 (219) Q Consensus 73 ~~~~~~g~ 80 (219) .+.+++-. T Consensus 76 ~~~A~~~~ 83 (291) T cd07565 76 AEACKEAK 83 (291) T ss_pred HHHHHHCC T ss_conf 99998669 No 260 >PRK09148 aminotransferase; Validated Probab=28.81 E-value=29 Score=16.17 Aligned_cols=57 Identities=14% Similarity=0.195 Sum_probs=27.6 Q ss_pred CCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCC------EEEEECCCCCE Q ss_conf 96899862013302330867741154225036531688888988112------66888699989 Q gi|254780971|r 133 NQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHD------IAGVINRRGNV 190 (219) Q Consensus 133 ~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~------IAgi~s~~G~v 190 (219) ..+..+|.. .++.|..+-+.+.+.....--++-.+.|+||.|.... |+.+|.+.+-+ T Consensus 139 ~~~~~v~~~-~~~~~~~~le~~~~~~~~ktk~iil~~P~NPTG~v~~~~~l~~l~~~a~~~~i~ 201 (406) T PRK09148 139 GVIRSVPAE-PDEEFFPALERAVRHSIPKPIALIVNYPSNPTAYVADLDFYKDLVAFAKKHDIW 201 (406) T ss_pred CCEEECCCC-CCCCCCCCHHHHHHHCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCEEE T ss_conf 915860577-643333029999974587644999778999888768999999999875405489 No 261 >TIGR02280 PaaB1 phenylacetate degradation probable enoyl-CoA hydratase paaB; InterPro: IPR011968 This family of proteins are found within apparent operons for the degradation of phenylacetic acid . These proteins contain the enoyl-CoA hydratase domain. This activity is consistent with current hypotheses for the degradation pathway which involve the ligation of phenylacetate with coenzyme A (paaF), hydroxylation (paaGHIJK), ring-opening (paaN) and degradation of the resulting fatty acid-like compound to a Krebs cycle intermediate (paaABCDE).. Probab=28.67 E-value=47 Score=14.82 Aligned_cols=61 Identities=25% Similarity=0.427 Sum_probs=39.6 Q ss_pred CCCCEEEECC---CCCCCCCC-----CHH-HH--H--HHHHHHHHHHHHHH-CCCEEEEECCC----------------- Q ss_conf 5647899728---73454311-----406-77--5--20021212233220-59717860640----------------- Q gi|254780971|r 40 PDVDLIVIPG---GFSYGDYL-----RCG-AI--A--ARTPVMQAIKKKAQ-QGIKVMGICNG----------------- 88 (219) Q Consensus 40 ~~~d~lvipG---GFSygD~l-----~aG-~i--~--~~~~~~~~i~~~~~-~g~~vLGICNG----------------- 88 (219) .+++.|++-| |||-|-+| +++ .. . .-+..++-+.+..+ =-+||+-=-|| T Consensus 42 ~~~R~l~LTGAGRGFcAGQDL~dr~v~~~G~~pDLG~~ve~fYnPLvRrL~~Lp~PVv~AVNGVAAGAGAnLALAcDIvl 121 (259) T TIGR02280 42 DDARALLLTGAGRGFCAGQDLSDRNVTPGGAAPDLGETVEKFYNPLVRRLRALPKPVVCAVNGVAAGAGANLALACDIVL 121 (259) T ss_pred CCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHCHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHHH T ss_conf 75458887388874344655544678874225853221777507489999726997798535103447788887511999 Q ss_pred -------HHHHEECCCHHH Q ss_conf -------310100000101 Q gi|254780971|r 89 -------FQILVELNLLPG 100 (219) Q Consensus 89 -------fQiL~elGLlPg 100 (219) -|-.+++||||. T Consensus 122 Aa~sA~FiqAF~klGL~PD 140 (259) T TIGR02280 122 AARSAKFIQAFAKLGLIPD 140 (259) T ss_pred HHHHHHHHHHHHHHCCCCC T ss_conf 9875579988875246779 No 262 >cd03252 ABCC_Hemolysin The ABC-transporter hemolysin B is a central component of the secretion machinery that translocates the toxin, hemolysin A, in a Sec-independent fashion across both membranes of E. coli. The hemolysin A (HlyA) transport machinery is composed of the ATP-binding cassette (ABC) transporter HlyB located in the inner membrane, hemolysin D (HlyD), also anchored in the inner membrane, and TolC, which resides in the outer membrane. HlyD apparently forms a continuous channel that bridges the entire periplasm, interacting with TolC and HlyB. This arrangement prevents the appearance of periplasmic intermediates of HlyA during substrate transport. Little is known about the molecular details of HlyA transport, but it is evident that ATP-hydrolysis by the ABC-transporter HlyB is a necessary source of energy. Probab=28.04 E-value=49 Score=14.75 Aligned_cols=12 Identities=25% Similarity=0.515 Sum_probs=6.1 Q ss_pred CCCEEEECCCCH Q ss_conf 998999848843 Q gi|254780971|r 187 RGNVLGMMPHPE 198 (219) Q Consensus 187 ~G~vlgmMPHPE 198 (219) +|++...=+|-| T Consensus 211 ~G~iv~~G~~~e 222 (237) T cd03252 211 KGRIVEQGSHDE 222 (237) T ss_pred CCEEEEECCHHH T ss_conf 999999898799 No 263 >pfam03486 HI0933_like HI0933-like protein. Probab=27.97 E-value=33 Score=15.79 Aligned_cols=18 Identities=17% Similarity=0.370 Sum_probs=10.4 Q ss_pred ECCCCCCHHHHHHHHHHH Q ss_conf 449706479999999985 Q gi|254780971|r 7 QIPGLNRDNDMIKAITKI 24 (219) Q Consensus 7 ~~pGsNcd~e~~~A~~~~ 24 (219) .||-||--.+...||.+. T Consensus 100 vfP~s~~A~~Vl~~L~~~ 117 (405) T pfam03486 100 LFPVSDKASDIVDALLNE 117 (405) T ss_pred EECCCCCHHHHHHHHHHH T ss_conf 978989869999999999 No 264 >PRK02259 aspartoacylase; Provisional Probab=27.90 E-value=5.8 Score=20.66 Aligned_cols=42 Identities=26% Similarity=0.348 Sum_probs=26.7 Q ss_pred EEEEECCC----CCCCCCCCCCEEEEECCCCCEEEECCCCHHHCCCCCCCCCHHHHHHHH Q ss_conf 03653168----888898811266888699989998488434322100677628999864 Q gi|254780971|r 162 QIVFRYAS----GTNPNGSLHDIAGVINRRGNVLGMMPHPENIIEKFHGGIDGRGLFASL 217 (219) Q Consensus 162 ~i~~~Y~d----~~NPNGS~~~IAgi~s~~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~ 217 (219) ..++||.. |.|++|. |+| |=||++--.+|+.=..|.++|..+ T Consensus 174 ~~v~~~~~~vd~Pr~~~G~---~~a-----------mIHp~lq~rD~~pL~pGdPlF~tf 219 (263) T PRK02259 174 LVVHRHLGSIDYPRDENGQ---IDA-----------MIHPQLQGRDWQPLKPGDPLFLTF 219 (263) T ss_pred EEEEEEECEECCCCCCCCC---EEE-----------EECHHHCCCCCCCCCCCCCEEECC T ss_conf 7999993866278899999---708-----------967444189874168999637547 No 265 >cd07576 R-amidase_like Pseudomonas sp. MCI3434 R-amidase and related proteins (putative class 13 nitrilases). Pseudomonas sp. MCI3434 R-amidase hydrolyzes (R,S)-piperazine-2-tert-butylcarboxamide to form (R)-piperazine-2-carboxylic acid. It does so with strict R-stereoselectively. Its preferred substrates are carboxamide compounds which have the amino or imino group connected to their beta- or gamma-carbon. This subgroup belongs to a larger nitrilase superfamily comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13), class 13 represents proteins that at the time were difficult to place in a distinct similarity group. It has been suggested that this subgroup represents a new class. Members of the nitrilase superfamily generally form homomeric compl Probab=27.24 E-value=50 Score=14.66 Aligned_cols=184 Identities=20% Similarity=0.249 Sum_probs=76.9 Q ss_pred EEEEEECCCCCCHHHH--HH---HHHHHCCCCEEEEECCCCCCCCCCEEEECC----CCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 6899844970647999--99---999854996399821546445647899728----73454311406775200212122 Q gi|254780971|r 2 KTAIVQIPGLNRDNDM--IK---AITKIIGQSPILVWQSDTDIPDVDLIVIPG----GFSYGDYLRCGAIAARTPVMQAI 72 (219) Q Consensus 2 kvaVl~~pGsNcd~e~--~~---A~~~~~~~~~~~v~~~~~~l~~~d~lvipG----GFSygD~l~aG~i~~~~~~~~~i 72 (219) |||++++.-+-+|.|- .+ ..+++.. .+.|+++||= ||..+|.+..-+.....+..+.+ T Consensus 1 kiA~~Q~~~~~gD~~~N~~~~~~~i~~A~~-------------~ga~lvvfPE~~l~Gy~~~~~~~~~a~~~~~~~~~~l 67 (254) T cd07576 1 RLALYQGPARDGDVAANLARLDEAAARAAA-------------AGADLLVFPELFLTGYNIGDAVARLAEPADGPALQAL 67 (254) T ss_pred CEEEEECCCCCCCHHHHHHHHHHHHHHHHH-------------CCCEEEECCCCCCCCCCCCCHHHHHHCCCCCHHHHHH T ss_conf 999998999889999999999999999998-------------8697999488863589866123443114786999999 Q ss_pred HHHHHC-CCEEE-EE--------CCCHHHHEECCCH---------HHHHC------CCCCCCEEEEEEEEEEC---CCH- Q ss_conf 332205-97178-60--------6403101000001---------01101------24433245422576752---523- Q gi|254780971|r 73 KKKAQQ-GIKVM-GI--------CNGFQILVELNLL---------PGILM------RNCSLKFVCKQVLLEVV---NSN- 123 (219) Q Consensus 73 ~~~~~~-g~~vL-GI--------CNGfQiL~elGLl---------Pg~l~------~N~s~rf~~r~~~~~v~---~~~- 123 (219) .+.+++ +..++ |+ .|-+-++-.-|-+ |+... -+...-|+.++..+.+. ... T Consensus 68 ~~~A~~~~i~iv~G~~e~~~~~~yNs~~~i~~~G~i~~~y~K~hL~~~~E~~~f~~G~~~~vf~~~~~~iG~~IC~D~~f 147 (254) T cd07576 68 RAIARRHGIAIVVGYPERAGGAVYNAAVLIDEDGTVLANYRKTHLFGDSERAAFTPGDRFPVVELRGLRVGLLICYDVEF 147 (254) T ss_pred HHHHHHCCCEEEEEEEEEECCEEEEEEEEECCCCEEEEEECCCCCCCCCCCEEECCCCCCEEEEECCEEEEEEEECCCCC T ss_conf 99999729869998999759978899999928983988571444688663058838986346688987999888615438 Q ss_pred --HHHHHHCCCCCEEEEEEECCCEEEEECHHHH-HHHHHCCEEEEECCCCC--CCCCCCCCEEEEECCCCCEEEECCCCH Q ss_conf --5776413799689986201330233086774-11542250365316888--889881126688869998999848843 Q gi|254780971|r 124 --TAFTKSYKMNQIIKCPVAHHDGNYFIDAKGL-AEIEKNNQIVFRYASGT--NPNGSLHDIAGVINRRGNVLGMMPHPE 198 (219) Q Consensus 124 --s~~~~~~~~~~~l~~piaHgEGrf~~~~~~l-~~l~~~~~i~~~Y~d~~--NPNGS~~~IAgi~s~~G~vlgmMPHPE 198 (219) ..-....+.-+++-+|.|-.+..-...+..+ .+-.+ +|+..-|++.. ..+....+-..|.+++|++++.++.-| T Consensus 148 pe~~r~la~~Ga~ii~~psa~~~~~~~~~~~~~~arA~e-n~~~vv~~n~~G~~~~~~~~G~S~ii~P~G~ila~~~~~e 226 (254) T cd07576 148 PELVRALALAGADLVLVPTALMEPYGFVARTLVPARAFE-NQIFVAYANRCGAEDGLTYVGLSSIAGPDGTVLARAGRGE 226 (254) T ss_pred CHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHHH-CCCEEEEEECCCCCCCEEECCCEEEEECCCEEEEECCCCC T ss_conf 799999998699799988788885206999999999998-2994999966415898077287699918982987669988 Q ss_pred H Q ss_conf 4 Q gi|254780971|r 199 N 199 (219) Q Consensus 199 R 199 (219) - T Consensus 227 ~ 227 (254) T cd07576 227 A 227 (254) T ss_pred E T ss_conf 6 No 266 >COG3044 Predicted ATPase of the ABC class [General function prediction only] Probab=26.89 E-value=51 Score=14.62 Aligned_cols=55 Identities=13% Similarity=0.088 Sum_probs=26.1 Q ss_pred EEEEECHHH-HHHHHHCCEEEEECC-----------CC----CCCCCCCCC-EEEEECCCCCEEEECCCCHH Q ss_conf 023308677-411542250365316-----------88----888988112-66888699989998488434 Q gi|254780971|r 145 GNYFIDAKG-LAEIEKNNQIVFRYA-----------SG----TNPNGSLHD-IAGVINRRGNVLGMMPHPEN 199 (219) Q Consensus 145 Grf~~~~~~-l~~l~~~~~i~~~Y~-----------d~----~NPNGS~~~-IAgi~s~~G~vlgmMPHPER 199 (219) .+.+.|.+. -++|++-|+++|-=. ++ .=|=-++.+ =--|--++|..+-=||-|+- T Consensus 172 ck~v~DQe~lR~rLddlGLvafIangS~lpRk~gicgpp~e~aVPFe~Pes~pvEIElp~gk~v~gmgIp~g 243 (554) T COG3044 172 CKIVEDQEALRARLDDLGLVAFIANGSVLPRKAGICGPPMEEAVPFEAPESLPVEIELPNGKEVVGMGIPQG 243 (554) T ss_pred HHHHCCHHHHHHHHHHCCEEEEECCCCCCCHHHCCCCCCCCCCCCCCCCCCCEEEEECCCCCEEECCCCCCC T ss_conf 876425999998876355289971797560542478998556867658987616898278976633677753 No 267 >PRK12359 flavodoxin FldB; Provisional Probab=26.79 E-value=51 Score=14.61 Aligned_cols=79 Identities=18% Similarity=0.233 Sum_probs=40.7 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHCCCCEEEEE---C-CCCCCCCCCEEEECCCCCCCCCCCHHHHHH-HHHHHHHHHHH Q ss_conf 968998449706479999999985499639982---1-546445647899728734543114067752-00212122332 Q gi|254780971|r 1 MKTAIVQIPGLNRDNDMIKAITKIIGQSPILVW---Q-SDTDIPDVDLIVIPGGFSYGDYLRCGAIAA-RTPVMQAIKKK 75 (219) Q Consensus 1 mkvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~---~-~~~~l~~~d~lvipGGFSygD~l~aG~i~~-~~~~~~~i~~~ 75 (219) |||+|+=-.-|-...+.+..+.+..+.+.+.++ . ...++.+||.||+ |==.+++ |.+-. +....+.+.+. T Consensus 1 MKIgifYGS~TGnTe~vA~~I~~~lg~~~vd~~~v~~~~~~~~~~yd~lIl-G~pTW~~----GeLq~dW~~~~~~l~~~ 75 (169) T PRK12359 1 MKIGLFYGSSTCYTEMAAEKIRDIIGEELVDLHNLKDDPPKLMEQYDVLIL-GIPTWDF----GEIQEDWEAVWDQLDDL 75 (169) T ss_pred CEEEEEEECCCCHHHHHHHHHHHHHCCCCCCEEEHHCCCHHHHHHCCEEEE-ECCCCCC----CCCHHHHHHHHHHHHHC T ss_conf 969999979997099999999998561452313250299879962987999-7686677----86576799999888638 Q ss_pred HHCCCEE--EE Q ss_conf 2059717--86 Q gi|254780971|r 76 AQQGIKV--MG 84 (219) Q Consensus 76 ~~~g~~v--LG 84 (219) --.|+.| .| T Consensus 76 D~sGK~VAlFG 86 (169) T PRK12359 76 NLEGKIVALYG 86 (169) T ss_pred CCCCCEEEEEE T ss_conf 87898899984 No 268 >pfam04170 NlpE Uncharacterized lipoprotein NlpE involved in copper resistance. This family represents a bacterial outer membrane lipoprotein that is necessary for signalling by the Cpx pathway. This pathway responds to cell envelope disturbances and increases the expression of periplasmic protein folding and degradation factors. While the molecular function of the NlpE protein is unknown, it may be involved in detecting bacterial adhesion to abiotic surfaces. In Escherichia coli and Salmonella typhi, NlpE is also known to confer copper tolerance in copper-sensitive strains of Escherichia coli, and may be involved in copper efflux and delivery of copper to copper-dependent enzymes. Probab=26.63 E-value=37 Score=15.47 Aligned_cols=61 Identities=11% Similarity=0.113 Sum_probs=36.4 Q ss_pred HHCCCCCCCEEEEEEEEEECCCHHH----HHHHCCCCCEEEEEEECCCEEEE-ECHHHHHHHHHCCE Q ss_conf 1012443324542257675252357----76413799689986201330233-08677411542250 Q gi|254780971|r 101 ILMRNCSLKFVCKQVLLEVVNSNTA----FTKSYKMNQIIKCPVAHHDGNYF-IDAKGLAEIEKNNQ 162 (219) Q Consensus 101 ~l~~N~s~rf~~r~~~~~v~~~~s~----~~~~~~~~~~l~~piaHgEGrf~-~~~~~l~~l~~~~~ 162 (219) +|+-|.-+.|+-+.+++.....+.. |+.... +.+|.+--+.++.++| ..+..|..|..+++ T Consensus 16 tLtL~~Dgty~l~~~Ylg~~~~~~~~~g~~~W~~~-~~~i~L~~~~~~~~~y~v~~~~L~~LD~~G~ 81 (86) T pfam04170 16 TLTLNADGTYVLVETYLGKKEGPFTFAGYGTWNRT-GDKITLTDSEGEKTQYFVGENRLEMLDREGK 81 (86) T ss_pred EEEECCCCCEEEEEEECCCCCCCEEEEEEEEEECC-CCEEEEECCCCCEEEEEECCCEEEEECCCCC T ss_conf 99997999699999993788998068768999758-9999994389987799987998982446999 No 269 >pfam08877 MepB MepB protein. MepB is a functionally uncharacterized protein in the mepRAB gene cluster of Staphylococcus aureus. Probab=26.12 E-value=52 Score=14.53 Aligned_cols=65 Identities=18% Similarity=0.128 Sum_probs=45.0 Q ss_pred CCCHHHHEECCCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEEECCC--EEEEECHHHHHH Q ss_conf 64031010000010110124433245422576752523577641379968998620133--023308677411 Q gi|254780971|r 86 CNGFQILVELNLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPVAHHD--GNYFIDAKGLAE 156 (219) Q Consensus 86 CNGfQiL~elGLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~piaHgE--Grf~~~~~~l~~ 156 (219) .||.++...++ ..++++.+.|..-|-.- ..+.+.||...-. .+-|-|-|..+. |.|+++.+.|-+ T Consensus 14 l~~~~i~~R~A----K~TP~K~G~FVt~Wkr~-~~g~~~Pf~~~d~-~d~lvI~v~d~~~~G~FiFpk~~L~~ 80 (123) T pfam08877 14 LNGHTIRFRLA----KKTPTKPGYFVAFWEKD-ENGQNQPFDYDDA-PDLLIIVVIDDNRRGQFIFPKEVLIE 80 (123) T ss_pred ECCEEEEEEEC----CCCCCCCEEEEEEEEEC-CCCCCCCCCCCCC-CCEEEEEEECCCEEEEEEEEHHHHHH T ss_conf 99979999842----46899850699999878-9998478743347-89799999728906779903999976 No 270 >KOG3967 consensus Probab=25.90 E-value=52 Score=14.56 Aligned_cols=40 Identities=20% Similarity=0.429 Sum_probs=25.7 Q ss_pred EEECCCCCCCCCC-EEEECCCCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 9821546445647-899728734543114067752002121223 Q gi|254780971|r 31 LVWQSDTDIPDVD-LIVIPGGFSYGDYLRCGAIAARTPVMQAIK 73 (219) Q Consensus 31 ~v~~~~~~l~~~d-~lvipGGFSygD~l~aG~i~~~~~~~~~i~ 73 (219) ++.+.+..+.+.+ ++|+--| +-++|||.||.+-.+.+.+. T Consensus 89 FiF~s~~~lt~~~kLlVLIHG---SGvVrAGQWARrLIIN~~Ld 129 (297) T KOG3967 89 FIFMSEDALTNPQKLLVLIHG---SGVVRAGQWARRLIINEDLD 129 (297) T ss_pred EEEECHHHHCCCCCEEEEEEC---CCEEECCHHHHHHHHCCCCC T ss_conf 488776574594215999926---82576045766652036634 No 271 >TIGR02214 spoVD_pbp stage V sporulation protein D; InterPro: IPR011927 This entry describes the SpoVD family of homologs of the cell division protein FtsI, a penicillin binding protein. This family is restricted to Bacillus subtilis and related Gram-positive species with known or suspected endospore formation capability. In these species, the functional equivalent of FtsI is desginated PBP-2B, a paralog of SpoVD.. Probab=25.87 E-value=49 Score=14.70 Aligned_cols=117 Identities=21% Similarity=0.257 Sum_probs=66.5 Q ss_pred HHHHHHHHHHCCCE---EEEECCC-HHHHEECCC--------HHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCE Q ss_conf 12122332205971---7860640-310100000--------10110124433245422576752523577641379968 Q gi|254780971|r 68 VMQAIKKKAQQGIK---VMGICNG-FQILVELNL--------LPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQI 135 (219) Q Consensus 68 ~~~~i~~~~~~g~~---vLGICNG-fQiL~elGL--------lPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~ 135 (219) +.++-+||-..|.+ +||.-|- -|=|.-|.| +||.+ ++|..--+.+.+++.+-|.+-.+ |.- T Consensus 148 v~edskRYYp~G~flshvLG~t~sD~~GL~GlEl~YdkeL~G~pG~~------~~e~d~~~~~~P~~~~~y~~P~d-G~~ 220 (660) T TIGR02214 148 VSEDSKRYYPNGDFLSHVLGFTNSDGEGLEGLELYYDKELKGLPGVR------VAEVDANSLELPYAEDEYKKPVD-GKD 220 (660) T ss_pred ECCCCCCCCCCHHHHHHHCCCCCCCCCCCCCHHHHHCHHHCCCCCEE------EEEEECCCCCCCCCCCCCCCCCC-CCC T ss_conf 74674222786002666525557760345231233141006899607------98874367778854123668868-873 Q ss_pred EEEEEECCCEEEEECHHHHHHHHHCC-EEEEECCCCCCCCCCCCCEEEEECCC-CCEEEECCCCHHH-CCCCCCC Q ss_conf 99862013302330867741154225-03653168888898811266888699-9899984884343-2210067 Q gi|254780971|r 136 IKCPVAHHDGNYFIDAKGLAEIEKNN-QIVFRYASGTNPNGSLHDIAGVINRR-GNVLGMMPHPENI-IEKFHGG 207 (219) Q Consensus 136 l~~piaHgEGrf~~~~~~l~~l~~~~-~i~~~Y~d~~NPNGS~~~IAgi~s~~-G~vlgmMPHPER~-~~~~~~~ 207 (219) +.+-| |+..+++.+.. +.|.+ ..||.|- ++-+.|++ |++|||-=.|.-- -++|..+ T Consensus 221 l~LTI----------D~~iqy~~Er~~~~a~~---~~~a~g~---~i~vMnpknGeILGM~nrP~ydpn~p~~~s 279 (660) T TIGR02214 221 LVLTI----------DENIQYIVEREADEALS---ERKADGV---SIIVMNPKNGEILGMSNRPSYDPNDPRDKS 279 (660) T ss_pred CEEEH----------HHHHHHHHHHHHHHHHH---HCCCCEE---EEEEECCCCCEEEEEECCCCCCCCCCCCCC T ss_conf 31000----------02346798899999975---2698758---999976999717652067887877743037 No 272 >cd06909 M14_ASPA Aspartoacylase (ASPA) belongs to the Succinylglutamate desuccinylase/aspartoacylase subfamily of the M14 family of metallocarboxypeptidases. ASPA (also known as aminoacylase 2; EC:3.5.1.15) cleaves N-acetyl L-aspartic acid (NAA) into aspartate and acetate. NAA is abundant in the brain, and hydrolysis of NAA by ASPA may help maintain white matter. ASPA is an NAA scavenger in other tissues. Mutations in the gene encoding ASPA cause Canavan disease (CD), a fatal progressive neurodegenerative disorder involving dysmyelination and spongiform degeneration of white matter in children. This enzyme binds zinc which is necessary for activity. Measurement of elevated NAA levels in urine is used in the diagnosis of CD. Probab=25.56 E-value=9.4 Score=19.32 Aligned_cols=30 Identities=23% Similarity=0.372 Sum_probs=20.4 Q ss_pred CCCEEEECCCCHHHCCCCCCCCCHHHHHHHH Q ss_conf 9989998488434322100677628999864 Q gi|254780971|r 187 RGNVLGMMPHPENIIEKFHGGIDGRGLFASL 217 (219) Q Consensus 187 ~G~vlgmMPHPER~~~~~~~~~dG~~~f~~~ 217 (219) +|...| |=||++-..+|+.=..|.+||..+ T Consensus 218 ~G~~~a-mIHp~lq~rD~~pl~pGdPlF~tf 247 (282) T cd06909 218 NGEIAA-MIHPQLQDRDWQPLKPGDPLFLTF 247 (282) T ss_pred CCCEEE-EECHHHCCCCCCCCCCCCCEEECC T ss_conf 998759-956256278875889999615457 No 273 >PRK07212 consensus Probab=25.39 E-value=54 Score=14.44 Aligned_cols=57 Identities=18% Similarity=0.212 Sum_probs=31.9 Q ss_pred CCCEEEEEEECCCEEEEECHHHHHHHHHCCEEEEECCCCCCCCCCCCC------EEEEECCCCCEE Q ss_conf 996899862013302330867741154225036531688888988112------668886999899 Q gi|254780971|r 132 MNQIIKCPVAHHDGNYFIDAKGLAEIEKNNQIVFRYASGTNPNGSLHD------IAGVINRRGNVL 191 (219) Q Consensus 132 ~~~~l~~piaHgEGrf~~~~~~l~~l~~~~~i~~~Y~d~~NPNGS~~~------IAgi~s~~G~vl 191 (219) ..+++.+|.....| ..+++ +++...+.--++-+|.|+||.|.... |+.+|.+.+-++ T Consensus 136 g~~~v~vpl~~~~~--~~~~~-l~~~i~~~~k~i~l~~P~NPtG~v~s~~~l~~l~~la~~~~~~i 198 (378) T PRK07212 136 GGKPVWVPLNPEDF--QLPDD-LAEAITDKTKLIVVNSPSNPTGAVFGKESLQGVADLAVDHDLYV 198 (378) T ss_pred CCEEEEEECCCCCC--CCHHH-HHHHCCCCCEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHCCEE T ss_conf 99699983555678--52899-99856657829998899799898847176889999777424501 No 274 >TIGR01531 glyc_debranch glycogen debranching enzymye; InterPro: IPR006421 Glycogen debranching enzyme possesses two different catalytic activities; oligo-1,4-->1,4-glucantransferase (2.4.1.25 from EC) and amylo-1,6-glucosidase (3.2.1.33 from EC). Site-directed mutagenesis studies in Saccharomyces cerevisiae indicate that the transferase and glucosidase activities are independent and located in different regions of the polypeptide chain. Proteins in this family belong to the larger alpha-amylase family. The family contains eukaryotic proteins.; GO: 0004134 4-alpha-glucanotransferase activity, 0004135 amylo-alpha-16-glucosidase activity, 0005978 glycogen biosynthetic process. Probab=25.02 E-value=37 Score=15.53 Aligned_cols=119 Identities=19% Similarity=0.276 Sum_probs=62.5 Q ss_pred ECCCHHHHHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCC-CEEEEE----------EECCCEEEEE-CHHHHHHHHHCC Q ss_conf 0000101101244332454225767525235776413799-689986----------2013302330-867741154225 Q gi|254780971|r 94 ELNLLPGILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMN-QIIKCP----------VAHHDGNYFI-DAKGLAEIEKNN 161 (219) Q Consensus 94 elGLlPg~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~-~~l~~p----------iaHgEGrf~~-~~~~l~~l~~~~ 161 (219) +=||||-.|-.=...||=||....-....---....-+.| .+|..+ ++|.+|++.- --++..+..+.. T Consensus 1175 rHGLIPNLL~~G~~aRYNcRDA~WfwL~~i~~Yv~~~p~G~~IL~~~v~R~yp~Dd~~~~~~G~~~q~l~~~i~~aL~~H 1254 (1584) T TIGR01531 1175 RHGLIPNLLDEGIKARYNCRDAVWFWLQCIQDYVEIVPNGLKILKDPVRRIYPDDDSIPVDDGELDQLLFELIIEALQKH 1254 (1584) T ss_pred HHCCCCCHHHCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHH T ss_conf 41475233543577763516889999999998740588605441575211488876566888730244799999999986 Q ss_pred EEEEECCC-CCCCC--------CCCCCEEEEECCCCCEEE-----------ECCCCHHHCCCCCCCC--CHHHH Q ss_conf 03653168-88889--------881126688869998999-----------8488434322100677--62899 Q gi|254780971|r 162 QIVFRYAS-GTNPN--------GSLHDIAGVINRRGNVLG-----------MMPHPENIIEKFHGGI--DGRGL 213 (219) Q Consensus 162 ~i~~~Y~d-~~NPN--------GS~~~IAgi~s~~G~vlg-----------mMPHPER~~~~~~~~~--dG~~~ 213 (219) .--++|-+ +..|+ | -+.+|||-.++|-|+| -|---||+-..-..+| ||..+ T Consensus 1255 ~~Gi~fRErnAGP~iD~~M~D~G-Fnv~~Gv~~~TGfvyGGn~fNCGTWMDKMGsS~kAGn~G~PaTPRDGaAv 1327 (1584) T TIGR01531 1255 FQGIQFRERNAGPQIDRQMKDEG-FNVTAGVDESTGFVYGGNRFNCGTWMDKMGSSEKAGNKGIPATPRDGAAV 1327 (1584) T ss_pred HHHHHHHHHCCCCCCCCCCCCCC-CEEEEEECCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHH T ss_conf 42000223157744531046778-30465331677737688877757734557773104687877768861366 No 275 >TIGR00705 SppA_67K signal peptide peptidase SppA, 67K type; InterPro: IPR004634 Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes . They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence . Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases . Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base . The geometric orientations of the catalytic residues are similar between families, despite different protein folds . The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) , . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This group of serine peptidases belong to MEROPS peptidase family S49 (protease IV family, clan S-). The predicted active site serine for members of this family occurs in a transmembrane domain. Signal peptides of secretory proteins seem to serve at least two important biological functions. First, they are required for protein targeting to and translocation across membranes, such as the eubacterial plasma membrane and the endoplasmic reticular membrane of eukaryotes. Second, in addition to their role as determinants for protein targeting and translocation, certain signal peptides have a signalling function. During or shortly after pre-protein translocation, the signal peptide is removed by signal peptidases. The integral membrane protein, SppA (protease IV), of Escherichia coli was shown experimentally to degrade signal peptides. The member of this family from Bacillus subtilis has only been shown to be required for efficient processing of pre-proteins under conditions of hyper-secretion . These enzymes have a molecular mass around 67 kDa and a duplication such that the N-terminal half shares extensive homology with the C-terminal half and was shown in E. coli to form homotetramers. E. coli SohB, which is most closely homologous to the C-terminal duplication of SppA, is predicted to perform a similar function of small peptide degradation, but in the periplasm. Many prokaryotes have a single SppA/SohB homolog that may perform the function of either or both. ; GO: 0009003 signal peptidase activity, 0006465 signal peptide processing, 0016021 integral to membrane. Probab=24.82 E-value=55 Score=14.38 Aligned_cols=20 Identities=15% Similarity=0.300 Sum_probs=11.0 Q ss_pred HHHHHHHHHHHHHHCC-CEEE Q ss_conf 2002121223322059-7178 Q gi|254780971|r 64 ARTPVMQAIKKKAQQG-IKVM 83 (219) Q Consensus 64 ~~~~~~~~i~~~~~~g-~~vL 83 (219) |...++.++.+.-++| |||+ T Consensus 376 Ase~IR~e~~~~~~~GkKPVi 396 (614) T TIGR00705 376 ASEIIRRELERLQARGKKPVI 396 (614) T ss_pred HHHHHHHHHHHHHHCCCCCEE T ss_conf 878999999999826899789 No 276 >TIGR01959 nuoF_fam NADH-quinone oxidoreductase, F subunit; InterPro: IPR011537 This entry describes the F chain of complexes that resemble NADH-quinone oxidoreductases. The electron acceptor is a quinone, ubiquinone, in mitochondria and most bacteria, including Escherichia coli, where the recommended gene symbol is nuoF. This family does not have any members in chloroplast or cyanobacteria, where the quinone may be plastoquinone and NADH may be replaced by NADPH, nor in Methanosarcina, where NADH is replaced by F420H2. ; GO: 0010181 FMN binding, 0016651 oxidoreductase activity acting on NADH or NADPH, 0051287 NAD binding, 0051539 4 iron 4 sulfur cluster binding, 0006118 electron transport. Probab=24.77 E-value=38 Score=15.45 Aligned_cols=35 Identities=17% Similarity=0.284 Sum_probs=25.0 Q ss_pred HHHHHCCCCCEEEEEEECCCEEEEECHHH--HHHHHHC-CEE Q ss_conf 77641379968998620133023308677--4115422-503 Q gi|254780971|r 125 AFTKSYKMNQIIKCPVAHHDGNYFIDAKG--LAEIEKN-NQI 163 (219) Q Consensus 125 ~~~~~~~~~~~l~~piaHgEGrf~~~~~~--l~~l~~~-~~i 163 (219) +.=++++ ++|-+.||=|+|+|-+|+ |+.|+-. +|. T Consensus 154 IlGsGFD----f~l~~H~GAGAYICGEETALleSLEGkrGqP 191 (420) T TIGR01959 154 ILGSGFD----FELFVHRGAGAYICGEETALLESLEGKRGQP 191 (420) T ss_pred CCCCCCC----EEEEEECCCCCEECHHHHHHHHHCCCCCCCC T ss_conf 3788620----5763433888505047897453105766845 No 277 >PRK00402 3-isopropylmalate dehydratase large subunit; Reviewed Probab=24.61 E-value=23 Score=16.79 Aligned_cols=127 Identities=18% Similarity=0.313 Sum_probs=55.0 Q ss_pred HHHHHHHHHHC-CCEE------EEECCCHHHHEECCC-HHHHHC-CCCCCC-----EEEEEEEE---EE---CCCHHHHH Q ss_conf 12122332205-9717------860640310100000-101101-244332-----45422576---75---25235776 Q gi|254780971|r 68 VMQAIKKKAQQ-GIKV------MGICNGFQILVELNL-LPGILM-RNCSLK-----FVCKQVLL---EV---VNSNTAFT 127 (219) Q Consensus 68 ~~~~i~~~~~~-g~~v------LGICNGfQiL~elGL-lPg~l~-~N~s~r-----f~~r~~~~---~v---~~~~s~~~ 127 (219) ..+.+++|+++ |.+- -|||. |++.|.|+ +||.++ --+|.. +-+-=.-+ .+ .-+...|+ T Consensus 78 ~~~~~r~fa~~~gi~~~~d~~~~GI~H--qv~~E~g~~~PG~~ivg~DSHT~t~GalGala~GvG~td~~~~latg~~w~ 155 (419) T PRK00402 78 QQKILRDFAREHGIPHFFDVGRMGICH--ALLPEKGLVRPGDVVVGADSHTCTYGALGAFATGMGSTDLAAAMATGKTWF 155 (419) T ss_pred HHHHHHHHHHHHCCCCCCCCCCCCEEE--EEECCCCCCCCCCEEEECCCCCCCCCCCCEEEECCCHHHHHHHHCCCCEEE T ss_conf 999999999984976032489986887--860002744688279835888776675523663467799999980292677 Q ss_pred HHCCCCCEEEEEEECCCEE---EEECHHH----HHHHHHCCE--EEEECCCCCCCCCCCCC---EEEEECCCCCEEEECC Q ss_conf 4137996899862013302---3308677----411542250--36531688888988112---6688869998999848 Q gi|254780971|r 128 KSYKMNQIIKCPVAHHDGN---YFIDAKG----LAEIEKNNQ--IVFRYASGTNPNGSLHD---IAGVINRRGNVLGMMP 195 (219) Q Consensus 128 ~~~~~~~~l~~piaHgEGr---f~~~~~~----l~~l~~~~~--i~~~Y~d~~NPNGS~~~---IAgi~s~~G~vlgmMP 195 (219) + ..+++.+.+ +|+ .+...+. ++++..++- -++.|.-+.-.+=|... |+-.+-+-|--.|++| T Consensus 156 ~---vPe~v~v~l---~G~l~~gv~akDviL~i~~~l~~~g~~~~~vEf~G~~i~~Ls~~~R~Ti~NMa~E~GA~~gi~~ 229 (419) T PRK00402 156 R---VPETIKVVL---NGKLPPGVYAKDVILHLIGQIGVDGATYKAMEFTGEAIEALSMDERMTLANMAIEAGAKAGIFA 229 (419) T ss_pred C---CCCEEEEEE---ECCCCCCCCHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCHHHHHHHHCCCHHHCCEEEEEC T ss_conf 0---785789999---4748999779999999998738577787699993863034898896445402143344346666 Q ss_pred CCHHHCC Q ss_conf 8434322 Q gi|254780971|r 196 HPENIIE 202 (219) Q Consensus 196 HPER~~~ 202 (219) -=|+..+ T Consensus 230 ~D~~t~~ 236 (419) T PRK00402 230 PDEKTLE 236 (419) T ss_pred CCHHHHH T ss_conf 8688999 No 278 >COG1436 NtpG Archaeal/vacuolar-type H+-ATPase subunit F [Energy production and conversion] Probab=24.56 E-value=37 Score=15.52 Aligned_cols=29 Identities=24% Similarity=0.463 Sum_probs=22.3 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCC Q ss_conf 968998449706479999999985499639982154 Q gi|254780971|r 1 MKTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQSD 36 (219) Q Consensus 1 mkvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~~ 36 (219) ||++|+ +|+|++-.|. ++|.....+...+ T Consensus 3 ~~I~VI------Gd~dtv~GFr-LaGv~~~~v~~~~ 31 (104) T COG1436 3 MKIAVI------GDRDTVTGFR-LAGVRVVYVADDE 31 (104) T ss_pred EEEEEE------ECCCHHHCEE-EECCEEEEEECCH T ss_conf 189999------7601302046-6042367984580 No 279 >PRK13231 nitrogenase reductase-like protein; Reviewed Probab=24.34 E-value=57 Score=14.32 Aligned_cols=14 Identities=21% Similarity=0.714 Sum_probs=9.3 Q ss_pred ECCCHHHHEECCCH Q ss_conf 06403101000001 Q gi|254780971|r 85 ICNGFQILVELNLL 98 (219) Q Consensus 85 ICNGfQiL~elGLl 98 (219) +|+||.+-...|+- T Consensus 126 vcggfa~Pir~~~A 139 (264) T PRK13231 126 VCGGFSVPLREDYA 139 (264) T ss_pred CCCCEECCCCCCCC T ss_conf 05667045542669 No 280 >COG2360 Aat Leu/Phe-tRNA-protein transferase [Posttranslational modification, protein turnover, chaperones] Probab=23.59 E-value=59 Score=14.23 Aligned_cols=15 Identities=27% Similarity=0.569 Sum_probs=9.7 Q ss_pred CCCCEEEECCCCCCC Q ss_conf 564789972873454 Q gi|254780971|r 40 PDVDLIVIPGGFSYG 54 (219) Q Consensus 40 ~~~d~lvipGGFSyg 54 (219) ++-++|+.|..|-+. T Consensus 51 pdpR~vl~~d~fhi~ 65 (221) T COG2360 51 PDPRAVLPPDDFHIS 65 (221) T ss_pred CCCCEEECCCCCCCC T ss_conf 898766562125436 No 281 >KOG0257 consensus Probab=22.81 E-value=38 Score=15.40 Aligned_cols=11 Identities=27% Similarity=0.594 Sum_probs=4.5 Q ss_pred CCCCCCCCCCC Q ss_conf 68888898811 Q gi|254780971|r 168 ASGTNPNGSLH 178 (219) Q Consensus 168 ~d~~NPNGS~~ 178 (219) |.|.||-|... T Consensus 179 ntPhNPtGkvf 189 (420) T KOG0257 179 NTPHNPTGKVF 189 (420) T ss_pred ECCCCCCCCCC T ss_conf 48999867305 No 282 >TIGR01135 glmS glucosamine--fructose-6-phosphate aminotransferase, isomerizing; InterPro: IPR005855 Glucosamine:fructose-6-phosphate aminotransferase (2.6.1.16 from EC) catalyses the formation of glucosamine 6-phosphate and is the first and rate-limiting enzyme of the hexosamine biosynthetic pathway. The final product of the hexosamine pathway, UDP-N-acetyl glucosamine, is an active precursor of numerous macromolecules containing amino sugars. This family of sequences belong to the MEROPS peptidase family C44 (clan PB(C)), and are classified as non-peptidase homologs.; GO: 0004360 glutamine-fructose-6-phosphate transaminase (isomerizing) activity, 0016051 carbohydrate biosynthetic process, 0005737 cytoplasm. Probab=22.79 E-value=61 Score=14.13 Aligned_cols=58 Identities=22% Similarity=0.379 Sum_probs=33.5 Q ss_pred HHHHHHCCCCEEEEECCCC-----CCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCC--CEEEEECC Q ss_conf 9999854996399821546-----445647899728734543114067752002121223322059--71786064 Q gi|254780971|r 19 KAITKIIGQSPILVWQSDT-----DIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKKAQQG--IKVMGICN 87 (219) Q Consensus 19 ~A~~~~~~~~~~~v~~~~~-----~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~~~~g--~~vLGICN 87 (219) +.|+++++..+.+..-+|. -+++=+++|.- .-||.=|= -..+++...++| -.+||||| T Consensus 325 Y~iE~lA~iPv~Ve~ASEfRYr~~~~~~~~L~I~I--------SQSGETAD---TL~ALr~aK~~G~~~~~L~IcN 389 (628) T TIGR01135 325 YLIERLAGIPVEVEVASEFRYRKPVVDKKTLVIAI--------SQSGETAD---TLEALRLAKELGLYAKTLGICN 389 (628) T ss_pred HHHHHHCCCCCEEEEECCEEECCCCCCCCCEEEEE--------ECCCCHHH---HHHHHHHHHHCCCCEEEEEEEE T ss_conf 99998618972899841200136775887659999--------74861088---9999999984798515899983 No 283 >PRK11423 methylmalonyl-CoA decarboxylase; Provisional Probab=22.60 E-value=61 Score=14.11 Aligned_cols=80 Identities=15% Similarity=0.284 Sum_probs=37.7 Q ss_pred CCCCEEEEC---CC--CCCCCCCCHHH------HHHHHHHHHHHHHHHHCCCEEEEECCCHHHHEECCCH--HHHHCCCC Q ss_conf 564789972---87--34543114067------7520021212233220597178606403101000001--01101244 Q gi|254780971|r 40 PDVDLIVIP---GG--FSYGDYLRCGA------IAARTPVMQAIKKKAQQGIKVMGICNGFQILVELNLL--PGILMRNC 106 (219) Q Consensus 40 ~~~d~lvip---GG--FSygD~l~aG~------i~~~~~~~~~i~~~~~~g~~vLGICNGfQiL~elGLl--Pg~l~~N~ 106 (219) ++.+.+||- || ||-|-++..-. .....++.+.+....+-.+|++.-+||.=+=--++|. -....--+ T Consensus 46 ~~~r~vvl~~~~g~~~F~AG~Dl~e~~~~~~~~~~~~~~~~~~~~~i~~~~kPvIAav~G~a~GgG~~lal~cD~~ia~~ 125 (261) T PRK11423 46 PEIRVVILRAPSGSKVWSAGHDIHELPAGGRDPLSYDDPLRQILRMIQKFPKPVIAMVEGSVWGGAFDLVMSCDLIIAAS 125 (261) T ss_pred CCCEEEEEECCCCCCCCCCCCCHHHHHCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHCHHHHHHHHHHHCCCC T ss_conf 99879999658887542378665453225766677877999999999948998999967733214089999875421011 Q ss_pred CCCEEEEEEEEEE Q ss_conf 3324542257675 Q gi|254780971|r 107 SLKFVCKQVLLEV 119 (219) Q Consensus 107 s~rf~~r~~~~~v 119 (219) +.+|-.-.+.+.+ T Consensus 126 ~a~f~~~~~~~Gl 138 (261) T PRK11423 126 TSTFAMTPANLGV 138 (261) T ss_pred CCEEECCCHHCCC T ss_conf 6269576200277 No 284 >pfam09558 DUF2375 Protein of unknown function (DUF2375). Two members of this family are found in Colwellia psychrerythraea (strain 34H / ATCC BAA-681) and one each in various other species of Colwellia and Shewanella. One member from C. psychrerythraea is of special interest because it is preceded by the same cis-regulatory site as a number of genes that have the PEP-CTERM domain described by PEP_anchor (IPR013424). Probab=22.58 E-value=40 Score=15.31 Aligned_cols=22 Identities=23% Similarity=0.498 Sum_probs=16.6 Q ss_pred HHHCCCEEEEECCC-HHHHEECC Q ss_conf 22059717860640-31010000 Q gi|254780971|r 75 KAQQGIKVMGICNG-FQILVELN 96 (219) Q Consensus 75 ~~~~g~~vLGICNG-fQiL~elG 96 (219) .+++|+.++.+|.| |.||-.+| T Consensus 42 eFrrGKsIIAV~EGe~~IlNslG 64 (71) T pfam09558 42 EFRRGKSIIAVLEGECKILNSLG 64 (71) T ss_pred HHCCCCEEEEEEECCCHHHHHHH T ss_conf 87189759999715401366676 No 285 >PRK07392 threonine-phosphate decarboxylase; Validated Probab=22.50 E-value=61 Score=14.09 Aligned_cols=16 Identities=25% Similarity=0.476 Sum_probs=6.0 Q ss_pred CEEEECCCCCCCCCCCH Q ss_conf 78997287345431140 Q gi|254780971|r 43 DLIVIPGGFSYGDYLRC 59 (219) Q Consensus 43 d~lvipGGFSygD~l~a 59 (219) |-+++|. =+|+.|-++ T Consensus 98 ~~v~i~~-Ptf~~Y~~~ 113 (358) T PRK07392 98 REVYLLT-PAFGDYRRA 113 (358) T ss_pred CEEEEEC-CCCHHHHHH T ss_conf 8699957-851899999 No 286 >PRK04020 rps2P 30S ribosomal protein S2; Provisional Probab=22.38 E-value=49 Score=14.70 Aligned_cols=10 Identities=30% Similarity=0.600 Sum_probs=6.9 Q ss_pred CCCHHHHHCC Q ss_conf 0001011012 Q gi|254780971|r 95 LNLLPGILMR 104 (219) Q Consensus 95 lGLlPg~l~~ 104 (219) -.|+||+|+- T Consensus 97 ~RWlgG~LTN 106 (204) T PRK04020 97 GRFIPGTLTN 106 (204) T ss_pred CCCCCCCCCC T ss_conf 6227986568 No 287 >TIGR02707 butyr_kinase butyrate kinase; InterPro: IPR011245 This group represents bacterial butyrate kinase, an enzyme that facilitates the formation of butyryl-CoA by phosphorylating butyrate in the presence of ATP to form butyryl phosphate . The final steps in butyrate synthesis by anaerobic bacteria can occur via butyrate kinase and phosphotransbutyrylase or via butyryl-CoA:acetate CoA-transferase, the latter providing the dominant route for butyrate formation in human colonic bacteria .; GO: 0005524 ATP binding, 0047761 butyrate kinase activity, 0016310 phosphorylation, 0005737 cytoplasm. Probab=21.97 E-value=41 Score=15.24 Aligned_cols=18 Identities=39% Similarity=0.929 Sum_probs=11.4 Q ss_pred CCCEEEECCCCCCCCC-CC Q ss_conf 6478997287345431-14 Q gi|254780971|r 41 DVDLIVIPGGFSYGDY-LR 58 (219) Q Consensus 41 ~~d~lvipGGFSygD~-l~ 58 (219) ++|.|||-||-+|+++ +- T Consensus 294 ~VDAI~LTGGlAys~~f~v 312 (353) T TIGR02707 294 KVDAIILTGGLAYSKYFFV 312 (353) T ss_pred CEEEEEECCCHHCCCCHHH T ss_conf 2668985460130620455 No 288 >PRK07854 enoyl-CoA hydratase; Provisional Probab=21.29 E-value=65 Score=13.94 Aligned_cols=80 Identities=18% Similarity=0.340 Sum_probs=43.3 Q ss_pred EEEEEC--CC-CCC-HHHHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECC---CCCCCCCCCHHHHHH--HHHHHHHHH Q ss_conf 899844--97-064-799999999854996399821546445647899728---734543114067752--002121223 Q gi|254780971|r 3 TAIVQI--PG-LNR-DNDMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPG---GFSYGDYLRCGAIAA--RTPVMQAIK 73 (219) Q Consensus 3 vaVl~~--pG-sNc-d~e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipG---GFSygD~l~aG~i~~--~~~~~~~i~ 73 (219) |++|++ |- -|. +.+|...+.... +. ..| ++.+.+||-| .||-|-+|....... ...+.+.+. T Consensus 10 Va~itlnrP~~~NAl~~~m~~~l~~al--~~----~~d---~~vr~vVltg~G~~FcaG~Dl~~~~~~~~~~~~~~~~~~ 80 (243) T PRK07854 10 VLTIELQRPERRNALNSELVEELREAV--RK----AGD---AGARAIVLTGQGTVFCAGADLSGDVYADDFPDALIELLH 80 (243) T ss_pred EEEEEECCCCCCCCCCHHHHHHHHHHH--HH----HCC---CCCEEEEEECCCCCEECCCCCCCCCCHHHHHHHHHHHHH T ss_conf 999997587767997999999999999--97----238---995599997889816689986552116677899999999 Q ss_pred HHHHCCCEEEEECCCHHH Q ss_conf 322059717860640310 Q gi|254780971|r 74 KKAQQGIKVMGICNGFQI 91 (219) Q Consensus 74 ~~~~~g~~vLGICNGfQi 91 (219) ...+-.+|++.-+||.=+ T Consensus 81 ~i~~~~kPvIAav~G~a~ 98 (243) T PRK07854 81 AIDAAPVPVIAAINGPAI 98 (243) T ss_pred HHHHCCCCEEEEECCEEC T ss_conf 998589988999758323 No 289 >COG5246 PRP11 Splicing factor 3a, subunit 2 [RNA processing and modification] Probab=21.24 E-value=55 Score=14.38 Aligned_cols=38 Identities=18% Similarity=0.156 Sum_probs=25.2 Q ss_pred HHCCCCCCCEEEEEEEEEECCCHHHHHHHCCCCCEEEEEE Q ss_conf 1012443324542257675252357764137996899862 Q gi|254780971|r 101 ILMRNCSLKFVCKQVLLEVVNSNTAFTKSYKMNQIIKCPV 140 (219) Q Consensus 101 ~l~~N~s~rf~~r~~~~~v~~~~s~~~~~~~~~~~l~~pi 140 (219) -|..|.+++|+|+.+..+-.+..|.. +... |......+ T Consensus 45 yl~knh~Gk~vC~LC~T~H~~e~Sy~-~H~~-GKKH~~n~ 82 (222) T COG5246 45 YLSKNHTGKYVCLLCKTKHLTEMSYV-KHRE-GKKHKENS 82 (222) T ss_pred CHHHCCCCCEEEEEECCCCCCHHHHH-HHHC-CCHHHHHH T ss_conf 11105777188521010222177777-7613-52023457 No 290 >COG2054 Uncharacterized archaeal kinase related to aspartokinases, uridylate kinases [General function prediction only] Probab=21.22 E-value=65 Score=13.93 Aligned_cols=17 Identities=29% Similarity=0.786 Sum_probs=14.0 Q ss_pred CEEEECCCCCCCCCCCH Q ss_conf 78997287345431140 Q gi|254780971|r 43 DLIVIPGGFSYGDYLRC 59 (219) Q Consensus 43 d~lvipGGFSygD~l~a 59 (219) ..+|+|||--|.|..|. T Consensus 28 ~i~iVpGGg~FAd~VR~ 44 (212) T COG2054 28 SILIVPGGGIFADLVRK 44 (212) T ss_pred EEEEECCCHHHHHHHHH T ss_conf 08995184289999999 No 291 >COG2910 Putative NADH-flavin reductase [General function prediction only] Probab=21.22 E-value=65 Score=13.93 Aligned_cols=92 Identities=22% Similarity=0.313 Sum_probs=49.1 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECC--------------C--------CCCCCCCEEEECCCCCCCCCCC Q ss_conf 96899844970647999999998549963998215--------------4--------6445647899728734543114 Q gi|254780971|r 1 MKTAIVQIPGLNRDNDMIKAITKIIGQSPILVWQS--------------D--------TDIPDVDLIVIPGGFSYGDYLR 58 (219) Q Consensus 1 mkvaVl~~pGsNcd~e~~~A~~~~~~~~~~~v~~~--------------~--------~~l~~~d~lvipGGFSygD~l~ 58 (219) ||+|||-.-|--+.+=...|+.| +..-+.++.+. | ..|.++|.+|--=||..+|..- T Consensus 1 mKIaiIgAsG~~Gs~i~~EA~~R-GHeVTAivRn~~K~~~~~~~~i~q~Difd~~~~a~~l~g~DaVIsA~~~~~~~~~~ 79 (211) T COG2910 1 MKIAIIGASGKAGSRILKEALKR-GHEVTAIVRNASKLAARQGVTILQKDIFDLTSLASDLAGHDAVISAFGAGASDNDE 79 (211) T ss_pred CEEEEEECCCHHHHHHHHHHHHC-CCEEEEEEECHHHCCCCCCCEEECCCCCCHHHHHHHHCCCCEEEEECCCCCCCHHH T ss_conf 90789953745679999999867-98048998076766522353020002227456676635876699721578887157 Q ss_pred HHHHHHHHHHHHHHHHHHHC-CCE---E------EEECCCHHHHEECCCHH Q ss_conf 06775200212122332205-971---7------86064031010000010 Q gi|254780971|r 59 CGAIAARTPVMQAIKKKAQQ-GIK---V------MGICNGFQILVELNLLP 99 (219) Q Consensus 59 aG~i~~~~~~~~~i~~~~~~-g~~---v------LGICNGfQiL~elGLlP 99 (219) ..+ ...+.+....+. +.+ | |=|--| |.|+-+...| T Consensus 80 ---~~~--k~~~~li~~l~~agv~RllVVGGAGSL~id~g-~rLvD~p~fP 124 (211) T COG2910 80 ---LHS--KSIEALIEALKGAGVPRLLVVGGAGSLEIDEG-TRLVDTPDFP 124 (211) T ss_pred ---HHH--HHHHHHHHHHHHCCCEEEEEECCCCCEEECCC-CEEECCCCCC T ss_conf ---788--89999999986159705999847420587688-4550589985 No 292 >COG1197 Mfd Transcription-repair coupling factor (superfamily II helicase) [DNA replication, recombination, and repair / Transcription] Probab=21.22 E-value=41 Score=15.23 Aligned_cols=18 Identities=17% Similarity=0.301 Sum_probs=11.0 Q ss_pred CCCCCEEEEEEECCCEEEE Q ss_conf 3799689986201330233 Q gi|254780971|r 130 YKMNQIIKCPVAHHDGNYF 148 (219) Q Consensus 130 ~~~~~~l~~piaHgEGrf~ 148 (219) ++.|+ +-+++-||=|||. T Consensus 471 L~~Gd-~VVH~~HGIGrfl 488 (1139) T COG1197 471 LKPGD-YVVHIDHGIGRFL 488 (1139) T ss_pred CCCCC-EEEECCCCCEEEE T ss_conf 78898-4775157713442 No 293 >KOG3212 consensus Probab=20.62 E-value=67 Score=13.85 Aligned_cols=38 Identities=18% Similarity=0.194 Sum_probs=26.9 Q ss_pred HHHHHHHHHCCCCEEEEECCCCCCCCCCEEEECCCCCCC Q ss_conf 999999985499639982154644564789972873454 Q gi|254780971|r 16 DMIKAITKIIGQSPILVWQSDTDIPDVDLIVIPGGFSYG 54 (219) Q Consensus 16 e~~~A~~~~~~~~~~~v~~~~~~l~~~d~lvipGGFSyg 54 (219) ++..++.+ -..+.+.|...+..+.=+|-.||+-|||+- T Consensus 73 ~vv~lLrd-enadDVfVi~vpeem~y~dh~VIcSgrs~r 110 (208) T KOG3212 73 EVVKLLRD-ENADDVFVIPVPEEMFYADHTVICSGRSDR 110 (208) T ss_pred HHHHHHHH-CCCCCEEEEECCCCCEEEEEEEEEECCCHH T ss_conf 99999875-465735998336435333359997357658 No 294 >PRK09559 putative global regulator; Reviewed Probab=20.62 E-value=67 Score=13.85 Aligned_cols=18 Identities=28% Similarity=0.307 Sum_probs=9.1 Q ss_pred EEEEECCCCCEE--EECCCC Q ss_conf 668886999899--984884 Q gi|254780971|r 180 IAGVINRRGNVL--GMMPHP 197 (219) Q Consensus 180 IAgi~s~~G~vl--gmMPHP 197 (219) +++...++|++. +-+++. T Consensus 280 ~~a~~~~~g~~~~l~vl~~~ 299 (327) T PRK09559 280 LAAVQLEDGQVWVQVVMNND 299 (327) T ss_pred EEEEECCCCCEEEEEEECCC T ss_conf 99898589939999996677 No 295 >PTZ00254 40S ribosomal protein SA; Provisional Probab=20.36 E-value=56 Score=14.35 Aligned_cols=10 Identities=20% Similarity=0.398 Sum_probs=6.3 Q ss_pred CCCHHHHHCC Q ss_conf 0001011012 Q gi|254780971|r 95 LNLLPGILMR 104 (219) Q Consensus 95 lGLlPg~l~~ 104 (219) -.++||+|+- T Consensus 101 ~Rw~pGtlTN 110 (242) T PTZ00254 101 GRFTPGTFTN 110 (242) T ss_pred CCCCCCCCCC T ss_conf 5127866468 No 296 >TIGR00088 trmD tRNA (guanine-N1)-methyltransferase; InterPro: IPR002649 In transfer RNA many different modified nucleosides are found, especially in the anticodon region. tRNA (guanine-N1-)-methyltransferase 2.1.1.31 from EC is one of several nucleases operating together with the tRNA-modifying enzymes before the formation of the mature tRNA. It catalyses the reaction:S-adenosyl-L-methionine + tRNA -> S-adenosyl-L-homocysteine + tRNA containing N1-methylguanine methylating guanosine(G) to N1-methylguanine (1-methylguanosine (m1G)) at position 37 of tRNAs that read CUN (leucine), CCN(proline), and CGG (arginine) codons. The presence of m1G improves the cellular growth rate and the polypeptide steptime and also prevents the tRNA from shifting the reading frame . The mechanism of the trmD3-induced frameshift involving mutant tRNA(Pro) and tRNA(Leu) species has been investigated . It has been suggested that the conformation of the anticodon loop may be a major determining element for the formation of m1G37 in vivo . ; GO: 0003723 RNA binding, 0008175 tRNA methyltransferase activity, 0008033 tRNA processing. Probab=20.36 E-value=48 Score=14.76 Aligned_cols=50 Identities=24% Similarity=0.193 Sum_probs=26.4 Q ss_pred HHHHHHHHCCCCEEEEECCCC---CCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 999999854996399821546---44564789972873454311406775200212122332 Q gi|254780971|r 17 MIKAITKIIGQSPILVWQSDT---DIPDVDLIVIPGGFSYGDYLRCGAIAARTPVMQAIKKK 75 (219) Q Consensus 17 ~~~A~~~~~~~~~~~v~~~~~---~l~~~d~lvipGGFSygD~l~aG~i~~~~~~~~~i~~~ 75 (219) |.+|+++ .-....++--+|. +=..+|=....| |||.+..-.|+.++|... T Consensus 22 l~~A~~k-~l~~~e~~n~RdFs~dKH~tvDD~pYGG--------G~GMvLk~ePi~~Al~~~ 74 (247) T TIGR00088 22 LKKALEK-NLVSFEVVNPRDFSKDKHKTVDDRPYGG--------GAGMVLKPEPIFDALESV 74 (247) T ss_pred HHHHHHC-CEEEEEEECCCCCCCCCCCCCCCCCCCC--------CCCCCCCHHHHHHHHHHH T ss_conf 8999754-7056675322000356345256753378--------976504867899999852 No 297 >KOG1739 consensus Probab=20.17 E-value=49 Score=14.70 Aligned_cols=19 Identities=21% Similarity=0.420 Sum_probs=12.4 Q ss_pred EEEEECCCCCEEEECCCCH Q ss_conf 6688869998999848843 Q gi|254780971|r 180 IAGVINRRGNVLGMMPHPE 198 (219) Q Consensus 180 IAgi~s~~G~vlgmMPHPE 198 (219) =|-|.|-+++||++|||-- T Consensus 213 ~atilSvkathlal~~~~~ 231 (611) T KOG1739 213 GATILSVKATHLALFPHVT 231 (611) T ss_pred CCEEEEECCCCEEEECCCC T ss_conf 7456640220000002444 Done!