Query gi|254780420|ref|YP_003064833.1| carbamoyl phosphate synthase small subunit [Candidatus Liberibacter asiaticus str. psy62] Match_columns 396 No_of_seqs 184 out of 3591 Neff 5.7 Searched_HMMs 39220 Date Sun May 29 16:19:40 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780420.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR01368 CPSaseIIsmall carbam 100.0 0 0 1236.3 29.1 370 14-391 1-383 (383) 2 CHL00197 carA carbamoyl-phosph 100.0 0 0 1131.7 33.2 376 10-395 4-383 (383) 3 PRK12564 carbamoyl phosphate s 100.0 0 0 1114.1 32.9 352 13-390 4-355 (355) 4 COG0505 CarA Carbamoylphosphat 100.0 0 0 1097.8 33.7 366 12-394 2-367 (368) 5 PRK12838 carbamoyl phosphate s 100.0 0 0 1097.0 31.1 353 13-394 2-355 (356) 6 KOG0370 consensus 100.0 0 0 893.9 26.1 348 14-395 3-358 (1435) 7 pfam00988 CPSase_sm_chain Carb 100.0 0 0 433.9 12.9 130 13-146 2-131 (131) 8 cd01744 GATase1_CPSase Small c 100.0 0 0 422.1 15.8 178 208-386 1-178 (178) 9 PRK05670 anthranilate synthase 100.0 1.3E-42 0 320.8 16.5 182 208-393 2-192 (192) 10 PRK00758 GMP synthase subunit 100.0 4.6E-41 1.4E-45 309.8 16.1 174 207-390 1-182 (184) 11 PRK07649 para-aminobenzoate/an 100.0 7.5E-41 1.4E-45 308.3 16.0 185 208-396 2-195 (195) 12 pfam00117 GATase Glutamine ami 100.0 2E-40 5.6E-45 305.2 13.5 177 209-386 1-185 (187) 13 CHL00101 trpG anthranilate syn 100.0 6.2E-40 1.5E-44 301.8 15.3 178 208-389 2-189 (190) 14 PRK08007 para-aminobenzoate sy 100.0 1E-39 2.7E-44 300.3 15.3 176 208-387 2-186 (187) 15 cd01742 GATase1_GMP_Synthase T 100.0 1.7E-39 4.3E-44 298.7 15.1 172 208-386 1-181 (181) 16 PRK06774 para-aminobenzoate sy 100.0 2.8E-39 7.1E-44 297.2 15.3 177 208-387 2-190 (191) 17 PRK08857 para-aminobenzoate sy 100.0 5.8E-39 1.5E-43 295.0 16.1 177 208-387 2-191 (192) 18 cd01743 GATase1_Anthranilate_S 100.0 5.5E-39 1.4E-43 295.1 14.4 175 208-386 1-184 (184) 19 PRK13566 anthranilate synthase 100.0 9.2E-37 2.3E-41 279.5 16.8 190 201-393 519-719 (724) 20 TIGR00888 guaA_Nterm GMP synth 100.0 2.7E-37 7E-42 283.2 12.3 177 208-392 1-194 (195) 21 PRK07765 para-aminobenzoate sy 100.0 2.9E-36 7.5E-41 275.9 15.8 180 206-388 1-191 (221) 22 PRK06895 para-aminobenzoate sy 100.0 3.1E-36 7.9E-41 275.8 14.6 177 206-388 3-189 (191) 23 COG0512 PabA Anthranilate/para 100.0 3.2E-35 8.1E-40 268.7 15.6 179 206-388 2-190 (191) 24 PRK09522 bifunctional anthrani 100.0 4.1E-34 1E-38 260.9 15.9 179 206-391 2-192 (531) 25 PRK05637 anthranilate synthase 100.0 7.4E-34 1.9E-38 259.0 16.0 180 206-391 2-207 (208) 26 PRK00074 guaA GMP synthase; Re 100.0 1.4E-32 3.4E-37 250.1 15.6 179 206-393 5-194 (513) 27 TIGR01815 TrpE-clade3 anthrani 100.0 2.6E-31 6.7E-36 241.1 14.7 257 125-392 447-720 (726) 28 TIGR00566 trpG_papA glutamine 100.0 1.1E-31 2.7E-36 243.8 8.8 170 208-387 2-211 (212) 29 KOG0026 consensus 99.9 7.3E-27 1.9E-31 209.8 12.0 183 205-391 18-216 (223) 30 TIGR01823 PabB-fungal para-ami 99.9 1.7E-25 4.4E-30 200.1 9.8 187 204-393 4-246 (851) 31 COG0518 GuaA GMP synthase - Gl 99.9 3.7E-24 9.4E-29 190.8 11.2 175 206-390 2-194 (198) 32 PRK13143 hisH imidazole glycer 99.9 6.6E-23 1.7E-27 182.0 13.6 170 206-390 2-201 (201) 33 cd01745 GATase1_2 Subgroup of 99.9 9.8E-23 2.5E-27 180.8 9.4 149 215-386 20-189 (189) 34 KOG1622 consensus 99.9 4.1E-22 1.1E-26 176.3 10.1 174 205-389 16-204 (552) 35 KOG1224 consensus 99.9 6.5E-22 1.7E-26 174.9 10.2 186 203-396 12-224 (767) 36 PRK13141 hisH imidazole glycer 99.9 7.4E-21 1.9E-25 167.5 14.3 168 207-389 2-203 (204) 37 PRK11366 puuD gamma-glutamyl-g 99.9 8E-21 2E-25 167.3 13.5 172 217-393 29-248 (254) 38 cd01748 GATase1_IGP_Synthase T 99.9 3E-21 7.6E-26 170.3 11.0 164 208-386 1-198 (198) 39 PRK13175 consensus 99.9 2.2E-20 5.6E-25 164.2 14.6 171 206-391 2-206 (206) 40 cd01741 GATase1_1 Subgroup of 99.9 7.5E-21 1.9E-25 167.5 11.8 166 207-386 1-188 (188) 41 pfam07722 Peptidase_C26 Peptid 99.9 5.3E-21 1.4E-25 168.5 10.8 149 218-370 28-219 (219) 42 PRK13181 hisH imidazole glycer 99.8 9.8E-21 2.5E-25 166.7 9.3 166 208-388 2-199 (199) 43 COG2071 Predicted glutamine am 99.8 1.3E-18 3.4E-23 151.7 11.2 167 220-392 32-241 (243) 44 PRK09065 glutamine amidotransf 99.8 6.6E-18 1.7E-22 146.8 11.6 162 206-373 4-192 (238) 45 PRK13180 consensus 99.8 2.2E-17 5.5E-22 143.1 12.5 172 206-389 2-209 (209) 46 PRK13144 consensus 99.7 3.7E-17 9.5E-22 141.5 12.7 165 207-389 2-188 (190) 47 PRK13152 hisH imidazole glycer 99.7 2.2E-17 5.6E-22 143.1 11.3 164 208-388 2-201 (201) 48 PRK07053 glutamine amidotransf 99.7 7.1E-18 1.8E-22 146.5 8.7 150 218-373 18-183 (235) 49 PRK13177 consensus 99.7 3.9E-17 1E-21 141.3 11.8 164 206-387 3-204 (207) 50 PRK06490 glutamine amidotransf 99.7 3.2E-17 8.1E-22 141.9 10.9 165 202-373 8-187 (243) 51 PRK13146 hisH imidazole glycer 99.7 6.8E-17 1.7E-21 139.6 11.9 167 206-387 2-205 (208) 52 PRK13153 consensus 99.7 5.2E-17 1.3E-21 140.5 10.8 164 208-388 2-203 (203) 53 PRK13170 hisH imidazole glycer 99.7 4.1E-17 1E-21 141.2 9.8 165 207-388 2-196 (196) 54 PRK05665 amidotransferase; Pro 99.7 1.7E-16 4.3E-21 136.9 12.9 122 246-373 57-190 (240) 55 PRK08250 glutamine amidotransf 99.7 3.9E-17 1E-21 141.3 8.8 150 218-374 16-185 (235) 56 PRK13172 consensus 99.7 2.5E-16 6.2E-21 135.7 12.7 166 206-388 2-213 (213) 57 PRK13176 consensus 99.7 1.2E-16 3.2E-21 137.8 11.1 167 207-387 2-213 (216) 58 PRK13171 consensus 99.7 1.9E-16 5E-21 136.4 11.8 167 206-389 2-198 (200) 59 PRK05380 pyrG CTP synthetase; 99.7 3.3E-16 8.4E-21 134.8 12.6 263 128-394 190-532 (534) 60 PRK07567 glutamine amidotransf 99.7 1.5E-16 3.9E-21 137.2 10.6 123 245-373 50-194 (242) 61 PRK13178 consensus 99.7 5.3E-16 1.3E-20 133.3 13.1 168 208-391 2-208 (213) 62 PRK13174 consensus 99.7 7.7E-16 2E-20 132.2 13.6 168 206-389 2-209 (212) 63 CHL00188 hisH imidazole glycer 99.7 5.9E-16 1.5E-20 133.0 11.8 163 206-387 2-208 (210) 64 PRK13525 glutamine amidotransf 99.7 1.8E-15 4.6E-20 129.6 13.1 166 206-392 2-190 (191) 65 PRK13179 consensus 99.7 1.5E-15 3.7E-20 130.3 12.6 164 208-388 2-206 (207) 66 PRK13173 consensus 99.7 2.2E-15 5.6E-20 129.0 13.2 167 206-388 2-209 (211) 67 PRK13148 consensus 99.7 2.9E-15 7.5E-20 128.1 13.3 165 206-387 2-222 (225) 68 PRK13151 consensus 99.7 6.4E-16 1.6E-20 132.8 9.9 165 207-389 2-194 (195) 69 COG0118 HisH Glutamine amidotr 99.7 3.1E-15 8E-20 127.9 13.2 168 206-389 2-203 (204) 70 COG0504 PyrG CTP synthase (UTP 99.7 3.3E-15 8.5E-20 127.7 12.9 265 127-395 189-532 (533) 71 TIGR01855 IMP_synth_hisH imida 99.6 5.1E-16 1.3E-20 133.5 8.1 167 208-388 1-211 (211) 72 cd01747 GATase1_Glutamyl_Hydro 99.6 5.7E-15 1.5E-19 126.1 11.5 177 217-393 23-250 (273) 73 PRK13147 consensus 99.6 1.1E-14 2.8E-19 124.1 12.5 164 207-387 2-208 (211) 74 PRK06186 hypothetical protein; 99.6 5.3E-14 1.4E-18 119.2 10.5 141 246-389 53-226 (229) 75 cd01746 GATase1_CTP_Synthase T 99.5 2.4E-14 6.1E-19 121.7 8.4 144 240-386 50-235 (235) 76 PRK13142 hisH imidazole glycer 99.5 6.9E-14 1.7E-18 118.5 10.7 161 208-387 2-186 (192) 77 pfam01174 SNO SNO glutamine am 99.5 1.2E-13 3.2E-18 116.7 11.5 157 216-389 8-186 (188) 78 PRK13527 glutamine amidotransf 99.5 2.2E-13 5.7E-18 114.9 11.8 154 221-390 21-194 (196) 79 cd01749 GATase1_PB Glutamine A 99.5 1E-12 2.6E-17 110.3 11.6 161 208-386 3-183 (183) 80 TIGR00337 PyrG CTP synthase; I 99.4 5.2E-13 1.3E-17 112.3 8.8 187 199-387 302-570 (571) 81 KOG3179 consensus 99.2 3.4E-11 8.8E-16 99.5 7.4 136 232-372 46-195 (245) 82 KOG2387 consensus 98.8 1.5E-08 3.8E-13 80.9 6.9 134 246-382 363-540 (585) 83 KOG1559 consensus 98.8 2.2E-08 5.6E-13 79.7 6.8 153 217-373 80-272 (340) 84 KOG0623 consensus 98.6 3.2E-07 8.1E-12 71.6 8.5 167 208-387 4-206 (541) 85 TIGR01737 FGAM_synth_I phospho 98.4 3.4E-06 8.8E-11 64.3 10.2 180 207-388 2-263 (264) 86 PRK01175 phosphoribosylformylg 98.4 7.8E-06 2E-10 61.8 11.3 186 205-391 3-254 (255) 87 COG0047 PurL Phosphoribosylfor 98.4 2E-05 5.1E-10 58.9 12.6 177 205-388 2-229 (231) 88 PRK03619 phosphoribosylformylg 98.3 2.4E-05 6.2E-10 58.3 12.0 121 266-391 67-222 (223) 89 cd01740 GATase1_FGAR_AT Type 1 98.2 5.2E-05 1.3E-09 56.0 12.2 164 218-384 14-236 (238) 90 PRK13526 glutamine amidotransf 98.0 1.5E-05 3.8E-10 59.8 6.4 153 207-388 4-179 (179) 91 PRK01077 cobyrinic acid a,c-di 97.9 0.00033 8.3E-09 50.4 11.9 88 202-294 242-340 (451) 92 TIGR01382 PfpI intracellular p 97.9 7.5E-06 1.9E-10 61.9 3.3 97 206-311 27-137 (189) 93 cd01750 GATase1_CobQ Type 1 gl 97.9 0.00018 4.7E-09 52.1 10.2 66 221-294 16-89 (194) 94 cd01653 GATase1 Type 1 glutami 97.4 0.00012 3.1E-09 53.4 3.5 73 218-290 16-92 (115) 95 PRK05368 homoserine O-succinyl 97.4 0.0033 8.4E-08 43.3 10.7 148 237-393 90-255 (302) 96 pfam04204 HTS Homoserine O-suc 97.3 0.0067 1.7E-07 41.1 11.2 148 237-393 89-254 (298) 97 COG0311 PDX2 Predicted glutami 97.1 0.0048 1.2E-07 42.1 8.5 151 220-392 16-193 (194) 98 PRK13896 cobyrinic acid a,c-di 97.1 0.027 6.9E-07 36.8 12.1 252 89-388 140-431 (432) 99 pfam07685 GATase_3 CobB/CobQ-l 97.1 0.0027 6.9E-08 43.9 6.9 48 245-294 6-59 (158) 100 cd03128 GAT_1 Type 1 glutamine 96.9 0.0012 3.1E-08 46.3 3.8 73 218-290 16-92 (92) 101 cd03169 GATase1_PfpI_1 Type 1 96.7 0.0024 6.1E-08 44.3 4.4 60 233-293 63-124 (180) 102 COG1492 CobQ Cobyric acid synt 96.3 0.021 5.4E-07 37.6 7.1 216 58-294 85-342 (486) 103 cd03135 GATase1_DJ-1 Type 1 gl 96.0 0.0076 1.9E-07 40.7 3.6 61 233-293 47-109 (163) 104 PRK00784 cobyric acid synthase 96.0 0.018 4.5E-07 38.1 5.4 156 117-294 158-344 (492) 105 cd03140 GATase1_PfpI_3 Type 1 95.8 0.013 3.4E-07 39.0 4.2 64 230-294 45-108 (170) 106 KOG3210 consensus 95.6 0.18 4.6E-06 31.0 9.2 75 218-297 28-111 (226) 107 cd03130 GATase1_CobB Type 1 gl 95.4 0.11 2.7E-06 32.6 7.5 78 212-295 10-93 (198) 108 pfam01965 DJ-1_PfpI DJ-1/PfpI 95.4 0.017 4.3E-07 38.3 3.4 63 231-293 19-83 (141) 109 TIGR01369 CPSaseII_lrg carbamo 94.8 0.13 3.3E-06 32.1 6.7 164 109-285 471-681 (1089) 110 cd03134 GATase1_PfpI_like A ty 94.3 0.044 1.1E-06 35.3 3.3 60 233-293 49-110 (165) 111 PRK11574 hypothetical protein; 94.1 0.074 1.9E-06 33.8 4.1 65 232-296 52-118 (196) 112 PRK06278 cobyrinic acid a,c-di 94.1 0.11 2.7E-06 32.6 4.9 13 279-291 68-80 (482) 113 TIGR01857 FGAM-synthase phosph 93.8 0.2 5E-06 30.8 5.8 161 202-372 1013-1255(1279) 114 COG3442 Predicted glutamine am 93.7 0.052 1.3E-06 34.9 2.7 164 221-393 28-219 (250) 115 KOG2764 consensus 92.7 0.14 3.5E-06 31.8 3.6 66 230-296 52-119 (247) 116 PRK02006 murD UDP-N-acetylmura 92.6 0.32 8.2E-06 29.3 5.3 34 93-127 72-105 (501) 117 cd03131 GATase1_HTS Type 1 glu 92.5 0.16 4.1E-06 31.4 3.7 63 238-302 54-124 (175) 118 TIGR00313 cobQ cobyric acid sy 92.2 0.11 2.7E-06 32.7 2.5 45 246-294 287-339 (502) 119 COG0693 ThiJ Putative intracel 91.8 0.2 5.1E-06 30.7 3.5 63 233-295 53-117 (188) 120 PRK05297 phosphoribosylformylg 91.1 1.5 3.8E-05 24.6 9.0 89 201-293 1034-1143(1294) 121 cd03141 GATase1_Hsp31_like Typ 90.7 0.22 5.5E-06 30.5 2.7 61 234-294 78-140 (221) 122 cd03132 GATase1_catalase Type 90.1 0.27 6.9E-06 29.8 2.8 61 234-294 50-112 (142) 123 cd06315 PBP1_ABC_sugar_binding 89.3 0.71 1.8E-05 26.8 4.5 64 214-283 17-87 (280) 124 cd05296 GH4_P_beta_glucosidase 88.8 0.33 8.4E-06 29.2 2.5 66 239-311 128-197 (419) 125 PRK02231 ppnK inorganic polyph 88.6 0.46 1.2E-05 28.2 3.1 31 109-139 34-66 (272) 126 KOG0538 consensus 88.0 2.5 6.3E-05 23.0 7.8 93 205-299 200-304 (363) 127 cd06312 PBP1_ABC_sugar_binding 88.0 1.4 3.6E-05 24.7 5.3 71 222-297 150-230 (271) 128 cd05298 GH4_GlvA_pagL_like Gly 87.9 0.44 1.1E-05 28.3 2.7 73 233-311 120-197 (437) 129 KOG0292 consensus 87.9 0.59 1.5E-05 27.4 3.3 15 280-294 457-471 (1202) 130 COG4607 CeuA ABC-type enteroch 87.8 0.93 2.4E-05 26.0 4.3 48 202-253 55-125 (320) 131 PRK02645 ppnK inorganic polyph 87.7 0.61 1.5E-05 27.3 3.3 19 106-124 19-37 (304) 132 PRK12815 carB carbamoyl phosph 86.7 2.9 7.4E-05 22.6 6.9 144 128-284 480-660 (1068) 133 cd06305 PBP1_methylthioribose_ 86.4 1.2 3.1E-05 25.2 4.3 48 240-292 173-225 (273) 134 PRK02155 ppnK inorganic polyph 86.3 1.1 2.7E-05 25.6 3.9 36 91-128 8-43 (291) 135 PRK01390 murD UDP-N-acetylmura 85.9 1.7 4.2E-05 24.3 4.7 24 117-140 113-141 (457) 136 cd03137 GATase1_AraC_1 AraC tr 85.3 1.3 3.2E-05 25.1 3.9 59 233-293 53-112 (187) 137 PRK00141 murD UDP-N-acetylmura 84.6 2.6 6.6E-05 22.9 5.2 30 93-127 79-108 (476) 138 TIGR01836 PHA_synth_III_C poly 84.5 1.6 4.1E-05 24.4 4.1 73 210-294 86-162 (367) 139 cd01538 PBP1_ABC_xylose_bindin 84.5 1.5 3.7E-05 24.7 3.9 41 246-291 190-231 (288) 140 PRK03369 murD UDP-N-acetylmura 84.1 3 7.8E-05 22.4 5.4 30 93-127 73-102 (487) 141 cd03139 GATase1_PfpI_2 Type 1 83.3 2 5.2E-05 23.6 4.2 58 234-293 52-110 (183) 142 PRK10936 periplasmic sensory p 82.7 2.8 7.2E-05 22.6 4.7 53 240-298 220-275 (340) 143 cd06317 PBP1_ABC_sugar_binding 82.7 1.9 4.8E-05 23.9 3.8 84 204-292 124-229 (275) 144 PRK03806 murD UDP-N-acetylmura 82.4 4 0.0001 21.6 5.4 24 117-140 104-132 (438) 145 PRK09701 D-allose transporter 82.3 1.7 4.5E-05 24.1 3.6 53 240-297 208-263 (311) 146 COG1797 CobB Cobyrinic acid a, 82.1 2.7 6.8E-05 22.8 4.5 90 206-301 246-350 (451) 147 PRK03372 ppnK inorganic polyph 81.9 1.8 4.6E-05 24.0 3.5 17 106-122 20-36 (303) 148 cd03133 GATase1_ES1 Type 1 glu 81.9 4.6 0.00012 21.1 6.0 86 214-305 55-152 (213) 149 cd06319 PBP1_ABC_sugar_binding 81.8 2.6 6.5E-05 22.9 4.3 68 219-291 147-226 (277) 150 PRK04308 murD UDP-N-acetylmura 81.7 3.5 8.9E-05 22.0 4.9 22 119-140 111-137 (445) 151 cd05297 GH4_alpha_glucosidase_ 81.6 1.3 3.3E-05 25.0 2.7 81 218-311 114-198 (423) 152 cd06318 PBP1_ABC_sugar_binding 81.2 2.2 5.6E-05 23.4 3.8 50 241-295 183-235 (282) 153 PRK01438 murD UDP-N-acetylmura 81.2 3.7 9.5E-05 21.8 4.9 30 93-127 80-109 (481) 154 PRK03378 ppnK inorganic polyph 80.8 2 5.1E-05 23.7 3.4 33 90-124 7-39 (292) 155 PRK11197 lldD L-lactate dehydr 80.5 5.1 0.00013 20.8 5.5 98 211-310 228-346 (381) 156 PRK02649 ppnK inorganic polyph 80.2 1.8 4.6E-05 24.0 3.0 36 91-128 4-39 (305) 157 cd03332 LMO_FMN L-Lactate 2-mo 80.1 5.3 0.00013 20.7 7.1 86 211-299 236-334 (383) 158 PRK10355 xylF D-xylose transpo 80.0 2.4 6.1E-05 23.1 3.6 47 240-291 202-252 (330) 159 PRK10710 DNA-binding transcrip 80.0 5.1 0.00013 20.9 5.3 94 202-299 7-107 (240) 160 PRK01231 ppnK inorganic polyph 79.8 2 5.2E-05 23.6 3.2 36 91-128 7-42 (296) 161 cd03136 GATase1_AraC_ArgR_like 79.7 3.1 8E-05 22.3 4.1 58 235-294 55-112 (185) 162 COG2403 Predicted GTPase [Gene 79.3 5.6 0.00014 20.6 6.2 29 113-141 119-157 (449) 163 PRK04539 ppnK inorganic polyph 79.1 2.9 7.5E-05 22.5 3.8 33 90-124 7-39 (296) 164 PRK03708 ppnK inorganic polyph 78.8 2.5 6.4E-05 23.0 3.4 37 92-130 4-40 (278) 165 cd06309 PBP1_YtfQ_like Peripla 78.6 3.5 8.8E-05 22.0 4.1 40 246-290 185-227 (273) 166 cd06310 PBP1_ABC_sugar_binding 78.4 3.2 8E-05 22.3 3.8 43 246-293 183-226 (273) 167 TIGR00329 gcp metalloendopepti 78.4 1.2 2.9E-05 25.4 1.6 88 246-349 74-164 (337) 168 cd02067 B12-binding B12 bindin 77.9 2.5 6.4E-05 23.0 3.2 84 212-298 13-105 (119) 169 PRK04759 consensus 77.8 3.3 8.3E-05 22.2 3.7 35 90-128 7-41 (294) 170 cd05197 GH4_glycoside_hydrolas 77.7 2.3 5.9E-05 23.3 2.9 82 218-311 112-197 (425) 171 PRK05294 carB carbamoyl phosph 77.4 6.3 0.00016 20.2 6.6 75 202-284 550-659 (1063) 172 cd04737 LOX_like_FMN L-Lactate 77.4 3.4 8.6E-05 22.1 3.7 82 216-299 209-302 (351) 173 cd06313 PBP1_ABC_sugar_binding 77.2 3.1 7.8E-05 22.4 3.5 52 240-296 175-228 (272) 174 cd06316 PBP1_ABC_sugar_binding 77.2 3 7.7E-05 22.4 3.4 89 204-297 125-233 (294) 175 PRK01911 ppnK inorganic polyph 77.0 2.7 6.9E-05 22.8 3.2 16 242-257 169-184 (290) 176 COG0771 MurD UDP-N-acetylmuram 76.7 6.4 0.00016 20.1 5.0 24 117-140 109-137 (448) 177 cd03147 GATase1_Ydr533c_like T 76.4 3.5 8.9E-05 22.0 3.6 57 236-292 84-142 (231) 178 PRK00683 murD UDP-N-acetylmura 76.1 4.9 0.00013 20.9 4.3 25 116-140 100-129 (418) 179 cd02072 Glm_B12_BD B12 binding 75.3 2.4 6.2E-05 23.1 2.5 45 212-257 13-61 (128) 180 TIGR01163 rpe ribulose-phospha 74.8 7.3 0.00019 19.7 4.9 96 126-234 93-197 (216) 181 cd02071 MM_CoA_mut_B12_BD meth 74.6 2.4 6.2E-05 23.1 2.4 45 212-257 13-61 (122) 182 PRK10955 DNA-binding transcrip 74.6 7.4 0.00019 19.7 4.9 89 207-299 3-97 (232) 183 cd06301 PBP1_rhizopine_binding 74.6 5 0.00013 20.9 4.0 43 246-293 183-227 (272) 184 PRK11411 fecB iron-dicitrate t 74.4 7.5 0.00019 19.7 5.1 17 237-253 222-238 (297) 185 PRK02472 murD UDP-N-acetylmura 73.8 7.7 0.0002 19.6 5.2 30 111-140 105-139 (450) 186 cd06302 PBP1_LsrB_Quorum_Sensi 73.7 4.5 0.00012 21.2 3.6 48 240-292 177-227 (298) 187 cd02809 alpha_hydroxyacid_oxid 73.4 4.8 0.00012 21.0 3.6 82 217-299 161-253 (299) 188 cd01139 TroA_f Periplasmic bin 73.0 7.8 0.0002 19.5 4.7 22 236-257 223-244 (342) 189 cd06322 PBP1_ABC_sugar_binding 72.8 6 0.00015 20.3 4.0 34 246-284 179-212 (267) 190 cd03138 GATase1_AraC_2 AraC tr 72.8 5.2 0.00013 20.8 3.7 57 235-293 60-120 (195) 191 PRK11083 DNA-binding response 72.5 8 0.0002 19.4 4.6 91 206-299 4-101 (229) 192 CHL00148 orf27 Ycf27; Reviewed 71.9 5.8 0.00015 20.5 3.8 93 203-299 4-103 (240) 193 PRK02261 methylaspartate mutas 71.9 3.3 8.5E-05 22.1 2.5 70 212-284 17-92 (137) 194 cd06289 PBP1_MalI_like Ligand- 71.4 8.1 0.00021 19.4 4.4 49 336-393 204-252 (268) 195 cd06314 PBP1_tmGBP Periplasmic 71.3 6.8 0.00017 20.0 4.0 53 240-297 171-226 (271) 196 PRK13856 two-component respons 70.7 9 0.00023 19.1 5.0 90 206-299 2-99 (241) 197 PRK11780 isoprenoid biosynthes 70.5 6.2 0.00016 20.2 3.7 86 214-305 58-155 (217) 198 PRK01368 murD UDP-N-acetylmura 70.2 9.2 0.00024 19.0 5.1 27 115-141 101-132 (450) 199 cd06282 PBP1_GntR_like_2 Ligan 70.1 9.3 0.00024 19.0 4.7 25 361-390 223-247 (266) 200 KOG1273 consensus 70.0 9.3 0.00024 19.0 5.0 52 97-151 26-77 (405) 201 cd02922 FCB2_FMN Flavocytochro 69.4 6.6 0.00017 20.0 3.6 86 211-299 196-297 (344) 202 cd06308 PBP1_sensor_kinase_lik 68.8 7.9 0.0002 19.5 3.9 34 246-284 182-216 (270) 203 cd06306 PBP1_TorT-like TorT-li 68.7 5.3 0.00013 20.7 3.0 17 275-291 208-224 (268) 204 cd01140 FatB Siderophore bindi 68.6 9.9 0.00025 18.8 4.8 20 236-255 193-212 (270) 205 cd06321 PBP1_ABC_sugar_binding 68.4 7.4 0.00019 19.7 3.7 34 246-284 181-214 (271) 206 TIGR03659 IsdE heme ABC transp 67.8 10 0.00026 18.7 5.5 65 66-143 73-139 (289) 207 PRK10653 D-ribose transporter 67.1 9.2 0.00023 19.0 4.0 34 246-284 207-240 (295) 208 cd01537 PBP1_Repressors_Sugar_ 67.0 9.6 0.00025 18.9 4.1 55 92-148 3-63 (264) 209 TIGR01383 not_thiJ DJ-1 family 66.9 5.2 0.00013 20.8 2.7 89 200-293 23-115 (186) 210 PRK09393 ftrA transcriptional 66.7 11 0.00028 18.5 4.3 57 236-294 65-121 (320) 211 COG1214 Inactive homolog of me 66.2 7.6 0.00019 19.6 3.4 68 246-322 58-126 (220) 212 PRK11625 Rho-binding antitermi 66.1 11 0.00027 18.6 4.1 55 14-74 26-81 (84) 213 PRK08227 aldolase; Validated 65.4 7.1 0.00018 19.8 3.1 16 238-253 126-141 (291) 214 TIGR01316 gltA glutamate synth 65.2 5.4 0.00014 20.6 2.5 48 215-262 190-245 (462) 215 cd06300 PBP1_ABC_sugar_binding 65.0 8.7 0.00022 19.2 3.5 23 119-141 62-87 (272) 216 cd01539 PBP1_GGBP Periplasmic 64.6 9.8 0.00025 18.8 3.7 88 200-292 134-248 (303) 217 cd01540 PBP1_arabinose_binding 64.5 11 0.00029 18.4 4.0 47 240-290 184-235 (289) 218 cd00958 DhnA Class I fructose- 64.4 8.2 0.00021 19.4 3.3 68 220-291 149-219 (235) 219 pfam09897 DUF2124 Uncharacteri 64.2 6.4 0.00016 20.1 2.7 58 240-303 74-134 (147) 220 PRK10957 iron-enterobactin tra 64.2 12 0.00031 18.2 5.5 22 237-258 106-127 (319) 221 cd06320 PBP1_allose_binding Pe 64.1 12 0.00029 18.3 4.0 45 241-290 175-222 (275) 222 cd04815 PA_M28_2 PA_M28_2: Pro 63.9 6.6 0.00017 20.0 2.7 111 26-137 7-126 (134) 223 pfam04848 Pox_A22 Poxvirus A22 63.3 6.7 0.00017 20.0 2.7 53 207-260 2-66 (143) 224 COG1486 CelF Alpha-galactosida 63.2 8.4 0.00022 19.3 3.2 162 103-312 13-202 (442) 225 pfam01070 FMN_dh FMN-dependent 63.1 11 0.00027 18.6 3.7 77 220-299 161-250 (301) 226 COG2185 Sbm Methylmalonyl-CoA 62.2 13 0.00033 18.0 4.0 54 204-258 11-75 (143) 227 COG0595 mRNA degradation ribon 61.8 8.3 0.00021 19.4 2.9 45 90-139 67-113 (555) 228 pfam02056 Glyco_hydro_4 Family 61.5 9.7 0.00025 18.9 3.2 49 239-293 128-176 (183) 229 smart00732 YqgFc Likely ribonu 61.4 13 0.00034 17.9 5.1 48 206-253 1-58 (99) 230 cd06292 PBP1_LacI_like_10 Liga 61.3 13 0.00034 17.9 4.7 10 220-229 114-123 (273) 231 cd05797 Ribosomal_L10 Ribosoma 60.5 14 0.00035 17.8 3.8 92 207-302 21-127 (157) 232 pfam01976 DUF116 Protein of un 60.3 12 0.0003 18.3 3.5 69 209-285 66-138 (158) 233 cd03465 URO-D_like The URO-D _ 59.2 13 0.00033 18.0 3.5 50 206-255 222-271 (330) 234 PRK13131 consensus 58.5 15 0.00038 17.5 5.5 36 208-243 119-157 (257) 235 PRK01710 murD UDP-N-acetylmura 58.1 15 0.00039 17.5 5.0 32 113-145 112-148 (458) 236 cd01141 TroA_d Periplasmic bin 57.8 15 0.00039 17.5 5.9 42 235-284 58-99 (186) 237 PRK13117 consensus 56.1 16 0.00042 17.3 7.7 33 208-240 126-161 (268) 238 TIGR01184 ntrCD nitrate ABC tr 55.6 16 0.00041 17.3 3.6 55 201-257 128-193 (230) 239 cd06344 PBP1_ABC_ligand_bindin 55.4 15 0.00038 17.6 3.3 163 93-284 42-224 (332) 240 cd01148 TroA_a Metal binding p 55.0 17 0.00043 17.2 6.8 18 236-253 213-230 (284) 241 PRK07226 fructose-bisphosphate 55.0 17 0.00043 17.2 3.8 68 220-291 166-236 (266) 242 TIGR03261 phnS2 putative 2-ami 54.7 8.8 0.00022 19.2 2.1 40 108-148 36-79 (334) 243 PRK04885 ppnK inorganic polyph 54.5 10 0.00027 18.6 2.4 19 240-258 141-159 (265) 244 pfam01513 NAD_kinase ATP-NAD k 54.2 12 0.0003 18.2 2.7 25 92-118 3-27 (243) 245 PRK04690 murD UDP-N-acetylmura 53.5 18 0.00046 17.0 5.0 26 119-145 116-146 (468) 246 PRK04663 murD UDP-N-acetylmura 53.1 18 0.00046 16.9 4.9 32 109-140 99-135 (438) 247 PRK02705 murD UDP-N-acetylmura 52.8 18 0.00047 16.9 3.8 25 115-139 106-135 (459) 248 cd06360 PBP1_alkylbenzenes_lik 52.6 18 0.00047 16.9 4.0 172 92-289 40-230 (336) 249 PRK03670 competence damage-ind 52.2 18 0.00046 17.0 3.3 11 247-257 61-71 (252) 250 PRK12778 putative bifunctional 51.8 19 0.00048 16.9 3.4 41 222-262 498-541 (760) 251 pfam02844 GARS_N Phosphoribosy 51.7 19 0.00049 16.8 3.7 38 238-282 49-90 (99) 252 cd01425 RPS2 Ribosomal protein 50.9 15 0.00038 17.6 2.7 35 243-285 124-158 (193) 253 cd03148 GATase1_EcHsp31_like T 50.7 12 0.0003 18.3 2.2 52 242-293 92-145 (232) 254 COG4977 Transcriptional regula 50.6 20 0.00051 16.7 3.7 57 236-294 68-125 (328) 255 KOG1210 consensus 50.1 12 0.00031 18.2 2.2 79 219-297 199-281 (331) 256 COG4090 Uncharacterized protei 49.2 19 0.00048 16.8 3.1 60 233-296 71-133 (154) 257 PRK10161 transcriptional regul 48.5 21 0.00054 16.5 5.2 91 207-299 4-102 (229) 258 PRK09605 O-sialoglycoprotein e 48.5 13 0.00034 17.9 2.2 12 378-389 501-512 (536) 259 pfam08660 Alg14 Oligosaccharid 48.3 10 0.00026 18.7 1.6 17 240-257 82-98 (166) 260 PRK00421 murC UDP-N-acetylmura 47.1 22 0.00057 16.3 5.2 31 111-141 101-136 (459) 261 pfam10281 Ish1 Putative stress 46.6 14 0.00035 17.8 2.0 32 107-138 5-37 (38) 262 PRK06395 phosphoribosylamine-- 46.5 23 0.00058 16.3 4.4 30 111-140 109-142 (435) 263 COG1879 RbsB ABC-type sugar tr 46.2 22 0.00057 16.3 3.0 81 60-145 34-122 (322) 264 cd06267 PBP1_LacI_sugar_bindin 45.9 23 0.00059 16.2 4.4 12 109-120 19-30 (264) 265 TIGR01892 AcOrn-deacetyl acety 45.8 20 0.0005 16.7 2.7 50 336-386 229-280 (386) 266 cd03144 GATase1_ScBLP_like Typ 44.6 16 0.0004 17.4 2.0 65 219-288 18-88 (114) 267 PRK12779 putative bifunctional 44.6 24 0.00062 16.1 3.2 42 221-262 364-408 (944) 268 PRK10576 iron-hydroxamate tran 44.3 25 0.00063 16.0 4.2 48 205-256 32-103 (292) 269 PRK04020 rps2P 30S ribosomal p 44.0 21 0.00053 16.5 2.6 33 245-285 113-145 (204) 270 COG0521 MoaB Molybdopterin bio 43.7 19 0.00049 16.8 2.4 72 219-294 32-111 (169) 271 PRK09468 ompR osmolarity respo 43.7 25 0.00064 16.0 3.9 48 206-253 6-56 (239) 272 cd06311 PBP1_ABC_sugar_binding 43.2 25 0.00065 15.9 3.6 47 93-141 37-87 (274) 273 COG1004 Ugd Predicted UDP-gluc 43.1 26 0.00065 15.9 3.2 37 245-284 162-198 (414) 274 KOG2278 consensus 43.1 18 0.00046 17.0 2.2 49 128-178 11-61 (207) 275 PRK08745 ribulose-phosphate 3- 42.9 26 0.00066 15.9 4.0 26 209-234 174-199 (223) 276 PRK00561 ppnK inorganic polyph 42.7 21 0.00053 16.5 2.4 18 240-257 130-147 (259) 277 cd04730 NPD_like 2-Nitropropan 42.6 26 0.00066 15.9 6.5 158 122-300 9-183 (236) 278 KOG1834 consensus 42.1 17 0.00044 17.1 1.9 44 198-241 212-255 (952) 279 PRK10416 cell division protein 42.0 27 0.00068 15.8 6.6 60 219-281 315-381 (499) 280 PRK05299 rpsB 30S ribosomal pr 41.9 25 0.00063 16.0 2.7 31 246-284 157-187 (255) 281 cd06296 PBP1_CatR_like Ligand- 41.8 27 0.00068 15.8 4.6 28 92-120 3-30 (270) 282 cd00886 MogA_MoaB MogA_MoaB fa 41.1 21 0.00052 16.6 2.2 70 219-294 25-103 (152) 283 TIGR01011 rpsB_bact ribosomal 41.0 27 0.00068 15.8 2.8 31 246-284 157-187 (227) 284 PRK10610 chemotaxis regulatory 40.6 28 0.00071 15.6 4.0 51 203-253 3-57 (129) 285 KOG4842 consensus 40.4 20 0.0005 16.7 2.0 22 118-139 33-54 (278) 286 PRK09722 allulose-6-phosphate 40.0 28 0.00072 15.6 4.2 93 127-234 94-195 (227) 287 TIGR01686 FkbH FkbH domain; In 39.8 14 0.00035 17.8 1.1 108 129-256 42-152 (337) 288 PRK08883 ribulose-phosphate 3- 39.6 29 0.00073 15.5 4.3 94 126-234 93-195 (220) 289 pfam02992 Transposase_21 Trans 39.5 16 0.0004 17.4 1.4 45 228-274 64-112 (226) 290 TIGR02320 PEP_mutase phosphoen 39.3 27 0.00068 15.8 2.6 57 88-145 34-105 (272) 291 PRK08261 fabG 3-ketoacyl-(acyl 39.2 29 0.00074 15.5 5.5 102 110-235 128-239 (447) 292 pfam02310 B12-binding B12 bind 38.7 30 0.00076 15.4 3.9 40 213-253 15-58 (121) 293 pfam03652 UPF0081 Uncharacteri 38.7 30 0.00076 15.4 4.0 52 207-262 2-63 (134) 294 COG0533 QRI7 Metal-dependent p 38.0 25 0.00064 16.0 2.3 49 246-294 71-121 (342) 295 PRK11173 two-component respons 37.7 31 0.00078 15.3 5.5 49 207-255 5-56 (237) 296 COG2247 LytB Putative cell wal 37.5 26 0.00066 15.9 2.3 22 125-146 61-82 (337) 297 cd06355 PBP1_FmdD_like Peripla 37.4 31 0.00079 15.3 5.2 172 92-292 42-232 (348) 298 PRK06375 consensus 37.2 31 0.0008 15.3 3.4 16 66-81 139-154 (381) 299 COG0061 nadF NAD kinase [Coenz 37.0 31 0.00079 15.3 2.6 12 243-254 162-173 (281) 300 COG0036 Rpe Pentose-5-phosphat 36.8 32 0.00081 15.2 4.2 94 125-234 95-197 (220) 301 KOG1467 consensus 36.6 32 0.00082 15.2 5.5 80 200-286 380-470 (556) 302 TIGR01812 sdhA_frdA_Gneg succi 36.2 12 0.00031 18.2 0.4 16 354-371 252-268 (636) 303 cd06843 PLPDE_III_PvsE_like Ty 36.1 32 0.00083 15.2 3.9 38 109-148 39-76 (377) 304 PRK06720 hypothetical protein; 36.1 29 0.00075 15.5 2.4 15 219-233 32-46 (169) 305 pfam01256 Carb_kinase Carbohyd 36.1 33 0.00083 15.2 3.3 49 246-294 166-219 (241) 306 pfam07073 ROF Modulator of Rho 36.0 33 0.00083 15.2 4.5 55 12-72 18-78 (80) 307 COG1118 CysA ABC-type sulfate/ 35.7 33 0.00084 15.1 3.3 55 206-262 156-225 (345) 308 PRK10816 DNA-binding transcrip 35.7 33 0.00084 15.1 5.1 79 218-299 15-98 (223) 309 pfam00814 Peptidase_M22 Glycop 35.2 15 0.00038 17.5 0.8 45 246-290 49-95 (225) 310 PRK06252 methylcobalamin:coenz 35.2 34 0.00085 15.1 3.6 52 208-260 233-284 (339) 311 cd03307 Mta_CmuA_like MtaA_Cmu 35.2 34 0.00086 15.1 3.5 53 207-260 224-276 (326) 312 pfam12257 DUF3608 Protein of u 35.0 24 0.0006 16.1 1.7 43 248-290 231-273 (281) 313 pfam03575 Peptidase_S51 Peptid 34.9 34 0.00086 15.0 3.0 46 92-140 13-58 (154) 314 PRK04761 ppnK inorganic polyph 34.9 29 0.00075 15.5 2.2 22 106-127 6-27 (246) 315 cd03146 GAT1_Peptidase_E Type 34.7 34 0.00087 15.0 3.5 68 222-292 54-129 (212) 316 cd04736 MDH_FMN Mandelate dehy 34.7 34 0.00087 15.0 3.8 86 212-299 220-315 (361) 317 PRK10693 response regulator of 34.7 33 0.00085 15.1 2.5 40 119-163 60-99 (337) 318 cd06331 PBP1_AmiC_like Type I 34.4 34 0.00088 15.0 3.2 82 206-291 133-230 (333) 319 PRK11070 ssDNA exonuclease Rec 34.4 29 0.00075 15.5 2.1 34 208-241 130-164 (574) 320 pfam06218 NPR2 Nitrogen permea 34.4 24 0.00061 16.1 1.7 58 51-115 44-109 (400) 321 CHL00067 rps2 ribosomal protei 34.4 35 0.00088 15.0 2.8 31 246-284 157-187 (227) 322 PRK13789 phosphoribosylamine-- 34.2 35 0.00089 15.0 4.4 28 112-139 113-143 (426) 323 cd06298 PBP1_CcpA_like Ligand- 34.0 35 0.00089 14.9 4.4 23 269-292 168-190 (268) 324 cd02146 NfsA_FRP This family c 33.9 16 0.00041 17.3 0.8 140 134-288 4-162 (229) 325 TIGR02633 xylG D-xylose ABC tr 33.8 34 0.00087 15.0 2.4 13 218-230 182-194 (501) 326 PRK00109 Holliday junction res 33.5 36 0.00091 14.9 3.8 53 206-262 4-66 (141) 327 pfam00853 Runt Runt domain. 33.2 14 0.00036 17.7 0.4 25 100-127 26-50 (134) 328 cd01149 HutB Hemin binding pro 32.9 36 0.00093 14.8 3.0 20 237-256 182-201 (235) 329 cd06283 PBP1_RegR_EndR_KdgR_li 32.8 36 0.00093 14.8 4.6 55 91-147 2-62 (267) 330 cd06285 PBP1_LacI_like_7 Ligan 32.8 36 0.00093 14.8 4.7 14 111-124 70-83 (265) 331 pfam00318 Ribosomal_S2 Ribosom 32.8 36 0.00093 14.8 2.6 32 246-285 137-168 (205) 332 PRK08005 ribulose-phosphate 3- 32.5 37 0.00094 14.8 4.6 104 112-234 74-191 (210) 333 cd01541 PBP1_AraR Ligand-bindi 32.2 37 0.00095 14.7 4.8 11 219-229 112-122 (273) 334 cd01144 BtuF Cobalamin binding 32.0 18 0.00045 17.0 0.7 21 236-256 176-196 (245) 335 COG1058 CinA Predicted nucleot 32.0 38 0.00096 14.7 3.1 57 219-277 26-89 (255) 336 TIGR01467 cobI_cbiL precorrin- 31.8 22 0.00056 16.4 1.2 42 204-245 99-148 (243) 337 pfam07090 DUF1355 Protein of u 31.8 32 0.00081 15.2 2.0 47 207-254 23-74 (183) 338 TIGR01328 met_gam_lyase methio 31.7 38 0.00097 14.7 2.3 47 213-260 108-157 (392) 339 cd06270 PBP1_GalS_like Ligand 31.6 38 0.00097 14.7 4.5 54 92-147 3-62 (268) 340 cd01542 PBP1_TreR_like Ligand- 31.6 38 0.00097 14.7 4.7 11 219-229 105-115 (259) 341 TIGR03099 dCO2ase_PEP1 pyridox 31.4 38 0.00098 14.7 5.4 38 109-148 62-99 (398) 342 COG3220 Uncharacterized protei 31.0 24 0.00061 16.1 1.2 40 315-354 171-213 (282) 343 cd06274 PBP1_FruR Ligand bindi 30.9 39 0.001 14.6 4.7 21 209-229 96-117 (264) 344 KOG2708 consensus 30.7 28 0.00072 15.6 1.6 50 245-294 69-120 (336) 345 PRK10765 nitroreductase A; Pro 30.7 20 0.00051 16.7 0.8 48 133-181 6-53 (240) 346 cd06324 PBP1_ABC_sugar_binding 30.4 40 0.001 14.5 3.9 16 241-256 195-212 (305) 347 PRK00124 hypothetical protein; 30.3 10 0.00026 18.7 -0.8 81 213-297 10-93 (149) 348 PRK05417 glutathione-dependent 30.3 35 0.0009 14.9 2.0 37 355-391 107-144 (191) 349 PRK11749 putative oxidoreducta 30.2 40 0.001 14.5 4.9 38 222-260 199-239 (460) 350 pfam00834 Ribul_P_3_epim Ribul 29.9 41 0.001 14.5 4.5 26 209-234 169-194 (201) 351 COG2243 CobF Precorrin-2 methy 29.8 23 0.0006 16.2 1.0 41 204-244 89-135 (234) 352 PRK11914 diacylglycerol kinase 29.7 41 0.001 14.5 3.9 42 110-152 28-74 (304) 353 COG2242 CobL Precorrin-6B meth 29.5 41 0.001 14.4 4.2 82 205-292 58-145 (187) 354 TIGR01982 UbiB 2-polyprenylphe 29.5 29 0.00073 15.6 1.4 79 206-294 309-394 (452) 355 PRK10701 DNA-binding transcrip 29.2 42 0.0011 14.4 4.9 89 207-299 3-98 (240) 356 KOG0196 consensus 29.2 29 0.00073 15.5 1.4 47 209-258 472-525 (996) 357 PRK03815 murD UDP-N-acetylmura 29.2 42 0.0011 14.4 3.9 25 118-142 89-118 (401) 358 PRK00099 rplJ 50S ribosomal pr 29.1 42 0.0011 14.4 3.9 83 207-293 24-121 (164) 359 cd06278 PBP1_LacI_like_2 Ligan 29.0 42 0.0011 14.4 4.3 11 219-229 106-116 (266) 360 TIGR02461 osmo_MPG_phos mannos 29.0 41 0.0011 14.4 2.2 40 206-245 88-142 (248) 361 cd01147 HemV-2 Metal binding p 28.8 42 0.0011 14.4 2.4 23 236-258 202-224 (262) 362 PRK00286 xseA exodeoxyribonucl 28.4 43 0.0011 14.3 3.6 86 202-288 132-235 (443) 363 cd03129 GAT1_Peptidase_E_like 28.4 43 0.0011 14.3 4.8 87 204-292 28-129 (210) 364 cd01982 Chlide_reductase_Z Chl 28.4 41 0.001 14.5 2.0 27 219-245 177-204 (412) 365 TIGR01469 cobA_cysG_Cterm urop 28.2 18 0.00047 16.9 0.2 22 257-279 87-108 (242) 366 TIGR00649 MG423 conserved hypo 28.2 43 0.0011 14.3 3.6 85 207-297 24-117 (593) 367 pfam01497 Peripla_BP_2 Peripla 28.1 43 0.0011 14.3 5.9 20 236-255 184-203 (236) 368 cd01545 PBP1_SalR Ligand-bindi 27.9 44 0.0011 14.3 4.3 28 92-120 3-30 (270) 369 PRK11205 tbpA thiamine transpo 27.9 34 0.00086 15.0 1.6 48 103-150 27-82 (326) 370 PRK10766 DNA-binding transcrip 27.9 44 0.0011 14.3 3.6 90 206-299 3-99 (224) 371 PRK09534 btuF corrinoid ABC tr 27.8 44 0.0011 14.2 3.0 112 109-254 131-257 (364) 372 cd01146 FhuD Fe3+-siderophore 27.7 44 0.0011 14.2 4.7 21 236-256 189-209 (256) 373 cd06831 PLPDE_III_ODC_like_AZI 27.5 44 0.0011 14.2 2.3 54 109-165 49-102 (394) 374 KOG1538 consensus 27.4 45 0.0011 14.2 3.9 19 14-32 147-165 (1081) 375 TIGR02541 flagell_FlgJ flagell 27.3 10 0.00026 18.7 -1.2 15 274-294 201-215 (332) 376 pfam02601 Exonuc_VII_L Exonucl 27.1 45 0.0012 14.2 3.5 17 58-74 41-57 (295) 377 TIGR00700 GABAtrnsam 4-aminobu 27.1 36 0.00092 14.8 1.6 92 208-304 42-150 (427) 378 TIGR02155 PA_CoA_ligase phenyl 26.9 28 0.00071 15.6 1.0 24 230-253 146-174 (431) 379 cd01143 YvrC Periplasmic bindi 26.9 45 0.0012 14.1 3.6 17 236-252 179-195 (195) 380 TIGR01681 HAD-SF-IIIC HAD-supe 26.9 40 0.001 14.5 1.8 38 219-256 54-99 (179) 381 COG0391 Uncharacterized conser 26.8 46 0.0012 14.1 3.6 43 240-284 182-228 (323) 382 TIGR03676 aRF1/eRF1 peptide ch 26.7 39 0.00098 14.6 1.7 48 246-295 208-261 (403) 383 pfam03262 Corona_6B_7B Coronav 26.7 24 0.00061 16.1 0.6 114 207-333 51-182 (209) 384 cd06333 PBP1_ABC-type_HAAT_lik 26.6 46 0.0012 14.1 4.0 161 93-283 42-220 (312) 385 cd06290 PBP1_LacI_like_9 Ligan 26.5 46 0.0012 14.1 4.3 29 92-121 3-31 (265) 386 COG0131 HisB Imidazoleglycerol 26.4 20 0.00052 16.6 0.2 23 281-304 72-94 (195) 387 cd01171 YXKO-related B.subtili 26.4 47 0.0012 14.1 3.5 30 109-138 69-103 (254) 388 pfam10563 CdCA1 Cadmium carbon 26.3 35 0.0009 14.9 1.4 35 218-252 5-39 (209) 389 cd00008 53EXOc 5'-3' exonuclea 26.1 25 0.00063 16.0 0.6 26 122-148 33-58 (240) 390 COG2235 ArcA Arginine deiminas 26.0 26 0.00066 15.9 0.7 11 208-218 166-176 (409) 391 cd00379 Ribosomal_L10_P0 Ribos 25.9 47 0.0012 14.0 3.7 82 207-292 19-115 (155) 392 PRK03803 murD UDP-N-acetylmura 25.8 48 0.0012 14.0 6.0 27 114-140 105-136 (448) 393 cd06273 PBP1_GntR_like_1 This 25.5 48 0.0012 14.0 4.7 12 109-120 19-30 (268) 394 PRK13235 nifH nitrogenase redu 25.5 48 0.0012 14.0 2.7 57 203-260 27-91 (274) 395 COG0052 RpsB Ribosomal protein 25.5 48 0.0012 14.0 2.5 31 246-284 156-186 (252) 396 PRK08818 prephenate dehydrogen 25.5 48 0.0012 14.0 2.0 29 34-65 100-128 (373) 397 COG3340 PepE Peptidase E [Amin 25.1 49 0.0012 13.9 2.8 47 91-140 61-107 (224) 398 cd03145 GAT1_cyanophycinase Ty 25.1 49 0.0012 13.9 5.0 89 204-294 28-134 (217) 399 PRK09482 xni exonuclease IX; P 25.1 49 0.0013 13.9 3.4 21 128-148 35-55 (256) 400 KOG3323 consensus 24.9 49 0.0013 13.9 2.2 24 366-391 92-115 (149) 401 COG1935 Uncharacterized conser 24.8 50 0.0013 13.9 2.9 41 213-253 7-48 (122) 402 TIGR01465 cobM_cbiF precorrin- 24.5 32 0.00081 15.3 0.9 85 206-293 72-171 (252) 403 cd06280 PBP1_LacI_like_4 Ligan 24.5 50 0.0013 13.8 3.2 27 92-119 3-29 (263) 404 COG1671 Uncharacterized protei 24.5 18 0.00046 17.0 -0.4 69 213-285 11-82 (150) 405 PRK00750 lysK lysyl-tRNA synth 24.4 50 0.0013 13.8 1.9 21 37-73 29-49 (513) 406 pfam04187 DUF399 Protein of un 24.4 50 0.0013 13.8 2.3 27 90-116 13-39 (213) 407 KOG4328 consensus 24.4 46 0.0012 14.1 1.7 54 200-253 204-261 (498) 408 PRK03967 histidinol-phosphate 24.2 51 0.0013 13.8 4.5 54 223-278 111-170 (337) 409 PRK05990 precorrin-2 C(20)-met 24.2 28 0.00071 15.6 0.6 51 205-256 98-161 (241) 410 PRK05948 precorrin-2 methyltra 24.1 27 0.00069 15.7 0.5 57 205-261 92-163 (238) 411 cd04865 LigD_Pol_like_2 LigD_P 24.0 49 0.0013 13.9 1.8 55 218-275 73-129 (228) 412 smart00475 53EXOc 5'-3' exonuc 23.9 33 0.00085 15.1 0.9 26 122-148 32-57 (259) 413 pfam01702 TGT Queuine tRNA-rib 23.8 52 0.0013 13.7 4.1 69 238-306 73-145 (238) 414 cd00532 MGS-like MGS-like doma 23.7 52 0.0013 13.7 3.9 66 217-282 32-104 (112) 415 PRK09615 ggt gamma-glutamyltra 23.7 47 0.0012 14.0 1.7 14 279-292 422-435 (581) 416 PRK03379 vitamin B12-transport 23.7 52 0.0013 13.7 2.4 78 40-142 44-125 (265) 417 cd06287 PBP1_LacI_like_8 Ligan 23.5 52 0.0013 13.7 4.2 25 268-292 167-191 (269) 418 COG3243 PhaC Poly(3-hydroxyalk 23.4 35 0.00089 15.0 0.9 81 208-296 120-201 (445) 419 COG4586 ABC-type uncharacteriz 23.3 53 0.0013 13.7 2.9 53 204-256 173-234 (325) 420 COG2875 CobM Precorrin-4 methy 23.3 47 0.0012 14.1 1.6 93 213-313 89-190 (254) 421 pfam06511 IpaD Invasion plasmi 23.3 53 0.0013 13.7 2.2 19 378-396 311-329 (337) 422 cd04861 LigD_Pol_like LigD_Pol 23.2 49 0.0012 13.9 1.6 55 218-275 72-128 (227) 423 pfam08918 PhoQ_Sensor PhoQ Sen 23.2 49 0.0013 13.9 1.7 76 112-225 102-177 (180) 424 PRK11248 tauB taurine transpor 23.0 41 0.001 14.5 1.2 52 206-259 147-209 (255) 425 COG4825 Uncharacterized membra 22.8 54 0.0014 13.6 3.1 186 11-256 102-302 (395) 426 pfam09998 DUF2239 Uncharacteri 22.8 54 0.0014 13.6 2.1 42 90-140 72-113 (173) 427 cd02070 corrinoid_protein_B12- 22.8 54 0.0014 13.6 2.4 84 212-299 96-188 (201) 428 PRK09604 putative DNA-binding/ 22.7 51 0.0013 13.8 1.7 48 246-293 72-121 (335) 429 PTZ00340 O-sialoglycoprotein e 22.6 54 0.0014 13.6 1.8 46 246-291 77-124 (348) 430 PRK13606 LPPG:FO 2-phospho-L-l 22.3 55 0.0014 13.6 3.4 38 246-285 186-226 (309) 431 cd02065 B12-binding_like B12 b 22.1 56 0.0014 13.5 2.8 39 218-256 18-60 (125) 432 TIGR01354 cyt_deam_tetra cytid 22.1 56 0.0014 13.5 2.9 72 204-281 19-109 (135) 433 PRK13984 putative oxidoreducta 21.9 56 0.0014 13.5 3.4 40 221-261 341-383 (604) 434 PRK07337 aminotransferase; Val 21.7 57 0.0014 13.5 3.7 13 247-260 164-176 (388) 435 cd01142 TroA_e Periplasmic bin 21.5 57 0.0015 13.4 1.8 21 236-256 209-229 (289) 436 pfam01921 tRNA-synt_1f tRNA sy 21.5 50 0.0013 13.8 1.4 21 37-73 19-39 (355) 437 PRK12775 putative trifunctiona 21.5 57 0.0015 13.4 3.2 42 221-262 490-535 (993) 438 PRK01213 phosphoribosylformylg 21.5 55 0.0014 13.6 1.6 40 20-76 149-188 (736) 439 KOG1907 consensus 21.5 44 0.0011 14.3 1.1 88 203-294 1056-1164(1320) 440 cd06275 PBP1_PurR Ligand-bindi 21.5 57 0.0015 13.4 3.6 12 379-390 238-249 (269) 441 PRK01917 hemerythrin family pr 21.4 57 0.0014 13.5 1.7 14 219-232 124-137 (142) 442 TIGR01922 purO_arch IMP cycloh 21.4 56 0.0014 13.5 1.7 71 221-311 31-113 (209) 443 cd00674 LysRS_core_class_I Thi 21.4 56 0.0014 13.5 1.7 22 36-73 18-39 (354) 444 PRK06425 histidinol-phosphate 21.4 58 0.0015 13.4 3.0 35 225-260 98-135 (332) 445 TIGR01735 FGAM_synt phosphorib 21.2 42 0.0011 14.3 1.0 166 119-294 1015-1234(1401) 446 COG0615 TagD Cytidylyltransfer 21.1 58 0.0015 13.4 2.6 23 234-256 74-96 (140) 447 PRK08574 cystathionine gamma-s 21.1 58 0.0015 13.4 4.4 16 374-389 366-381 (384) 448 cd07062 Peptidase_S66_mccF_lik 20.9 59 0.0015 13.4 2.2 59 49-125 59-122 (308) 449 COG0159 TrpA Tryptophan syntha 20.8 59 0.0015 13.4 3.8 36 208-243 126-164 (265) 450 PRK12809 putative oxidoreducta 20.8 59 0.0015 13.4 3.2 40 221-261 368-410 (639) 451 PRK06848 hypothetical protein; 20.8 59 0.0015 13.3 2.5 12 279-292 95-106 (139) 452 PTZ00170 D-ribulose-5-phosphat 20.7 59 0.0015 13.3 5.3 25 209-233 173-197 (224) 453 TIGR01087 murD UDP-N-acetylmur 20.7 59 0.0015 13.3 4.6 38 108-146 96-142 (476) 454 TIGR03351 PhnX-like phosphonat 20.6 59 0.0015 13.3 7.5 32 114-147 80-111 (220) 455 pfam07801 DUF1647 Protein of u 20.5 60 0.0015 13.3 3.6 48 206-253 90-142 (142) 456 PRK12769 putative oxidoreducta 20.4 60 0.0015 13.3 3.4 39 222-261 386-427 (654) 457 PTZ00254 40S ribosomal protein 20.4 60 0.0015 13.3 2.5 33 245-285 117-149 (242) 458 cd02069 methionine_synthase_B1 20.3 60 0.0015 13.3 2.7 41 212-254 102-147 (213) 459 COG2984 ABC-type uncharacteriz 20.3 60 0.0015 13.3 2.6 115 202-321 157-296 (322) 460 KOG1252 consensus 20.3 45 0.0012 14.2 1.0 60 221-280 144-215 (362) 461 pfam06520 consensus 20.2 57 0.0014 13.5 1.5 70 33-126 8-80 (186) 462 TIGR01992 PTS-IIBC-Tre PTS sys 20.2 25 0.00065 15.9 -0.3 34 276-314 198-237 (489) 463 PRK10953 cysJ sulfite reductas 20.2 61 0.0016 13.3 3.4 37 107-143 127-180 (599) 464 pfam03446 NAD_binding_2 NAD bi 20.1 36 0.00092 14.8 0.5 27 207-233 3-30 (163) 465 TIGR00551 nadB L-aspartate oxi 20.0 33 0.00085 15.1 0.3 30 355-384 269-300 (546) 466 PRK05568 flavodoxin; Provision 20.0 61 0.0016 13.2 3.0 33 218-252 21-54 (142) No 1 >TIGR01368 CPSaseIIsmall carbamoyl-phosphate synthase, small subunit; InterPro: IPR006274 Carbamoyl phosphate synthase (CPSase) is a heterodimeric enzyme composed of a small and a large subunit (with the exception of CPSase III, see below). CPSase catalyses the synthesis of carbamoyl phosphate from biocarbonate, ATP and glutamine (6.3.5.5 from EC) or ammonia (6.3.4.16 from EC), and represents the first committed step in pyrimidine and arginine biosynthesis in prokaryotes and eukaryotes, and in the urea cycle in most terrestrial vertebrates , . CPSase has three active sites, one in the small subunit and two in the large subunit. The small subunit contains the glutamine binding site and catalyses the hydrolysis of glutamine to glutamate and ammonia. The large subunit has two homologous carboxy phosphate domains, both of which have ATP-binding sites; however, the N-terminal carboxy phosphate domain catalyses the phosphorylation of biocarbonate, while the C-terminal domain catalyses the phosphorylation of the carbamate intermediate . The carboxy phosphate domain found duplicated in the large subunit of CPSase is also present as a single copy in the biotin-dependent enzymes acetyl-CoA carboxylase (6.4.1.2 from EC) (ACC), propionyl-CoA carboxylase (6.4.1.3 from EC) (PCCase), pyruvate carboxylase (6.4.1.1 from EC) (PC) and urea carboxylase (6.3.4.6 from EC). Most prokaryotes carry one form of CPSase that participates in both arginine and pyrimidine biosynthesis, however certain bacteria can have separate forms. The large subunit in bacterial CPSase has four structural domains: the carboxy phosphate domain 1, the oligomerisation domain, the carbamoyl phosphate domain 2 and the allosteric domain . CPSase heterodimers from Escherichia coli contain two molecular tunnels: an ammonia tunnel and a carbamate tunnel. These inter-domain tunnels connect the three distinct active sites, and function as conduits for the transport of unstable reaction intermediates (ammonia and carbamate) between successive active sites . The catalytic mechanism of CPSase involves the diffusion of carbamate through the interior of the enzyme from the site of synthesis within the N-terminal domain of the large subunit to the site of phosphorylation within the C-terminal domain. Eukaryotes have two distinct forms of CPSase: a mitochondrial enzyme (CPSase I) that participates in both arginine biosynthesis and the urea cycle; and a cytosolic enzyme (CPSase II) involved in pyrimidine biosynthesis. CPSase II occurs as part of a multi-enzyme complex along with aspartate transcarbamoylase and dihydroorotase; this complex is referred to as the CAD protein . The hepatic expression of CPSase is transcriptionally regulated by glucocorticoids and/or cAMP . There is a third form of the enzyme, CPSase III, found in fish, which uses glutamine as a nitrogen source instead of ammonia . CPSase III is closely related to CPSase I, and is composed of a single polypeptide that may have arisen from gene fusion of the glutaminase and synthetase domains . This entry represents the small subunit of the glutamine-dependent form (6.3.5.5 from EC) of carbamoyl phosphate synthase, CPSase II. The small subunit catalyses the hydrolysis of glutamine to ammonia, which in turn used by the large chain to synthesize carbamoyl phosphate. The C-terminal domain of the small subunit of CPSase has glutamine amidotransferase activity. Note that the sequence from the mammalian urea cycle form has lost the active site Cys, resulting in an ammonia-dependent form, CPSase I (6.3.4.16 from EC). CPSases of pyrimidine biosynthesis, arginine biosynthesis, and the urea cycle may be encoded by one or by several genes, depending on the species. ; GO: 0004086 carbamoyl-phosphate synthase activity, 0006807 nitrogen compound metabolic process. Probab=100.00 E-value=0 Score=1236.27 Aligned_cols=370 Identities=47% Similarity=0.784 Sum_probs=350.3 Q ss_pred EEEEECCCCEEEEEEECCCCEEEEEEEEECCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCCHHHH---------CCCCC Q ss_conf 689972898799997567624888898537888876014161425617321689736845882342---------04653 Q gi|254780420|r 14 AVLVLADGSVIEGMGCGATGSIQAEICFNTSLTGYQEILTDPSYLGQIVNFTFPHIGNVGVNSEDF---------ESISR 84 (396) Q Consensus 14 A~L~LedGt~f~G~~fG~~~~~~GEvVFnT~mtGYqE~lTDPSY~gQIlvfT~P~IGNyGvn~~d~---------ES~~~ 84 (396) |+|+|||||+|.|+|||++++++||||||||||||||+||||||+|||||||||+|||||||++|+ || T Consensus 1 A~LvLeDGt~f~G~sFGa~~~~~GE~VFnTgMTGYQE~lTDPSY~GQIvv~TyP~IGNyG~n~~d~rDe~~LkYfES--- 77 (383) T TIGR01368 1 AYLVLEDGTVFEGYSFGAEGTVAGEVVFNTGMTGYQEILTDPSYKGQIVVFTYPLIGNYGVNDEDARDELGLKYFES--- 77 (383) T ss_pred CEEECCCCCEEEEEEECCCCCEEEEEEEECCCCCCCEEEECHHHCCHHHHHCCCCCCCCCCCHHHHCCCCCCEEEEE--- T ss_conf 97781378777888845666548889883689774203206200323776068752431478035200002303432--- Q ss_pred CC-CCEEEEEEEECCCCCCCCCCCCCCHHHHHHHCCCCEECCCCHHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHH Q ss_conf 33-21023666404654434552417989999742783211257689999998539842366327654589789888774 Q gi|254780420|r 85 KN-FKGAVGLVIKAEITEPSNYRANMHFDAWLKSCGIIGLSGIDTRALTVWIRDYGVSNSVIAHHPDGQFDLEDLKERAK 163 (396) Q Consensus 85 ~~-~~~~~g~Iv~~~~~~~Sn~rs~~sL~~wL~~~~IpgI~GIDTRaLTk~IRe~G~~~a~I~~~~~~~~d~~~~~~~~~ 163 (396) . +|+|+|+||||++..+|||||.+||++||++++||||+||||||||||||++|+|+|+|++++ .+.++++++++ T Consensus 78 -~~~i~v~G~vv~e~~~~~Snwra~~SL~~~L~~~~ipgI~GvDTRaLv~~iR~~G~m~g~I~~~~---~~~~~~~~~~~ 153 (383) T TIGR01368 78 -KGKIHVKGLVVRELSDRYSNWRATESLDQFLKEHGIPGIYGVDTRALVKKIREKGTMKGVISTED---SNDEELVEKAK 153 (383) T ss_pred -CCEEEEEEEEEECCCCCCCHHHHHCCHHHHHHHCCCEEEECCCHHHHHHHHHHCCCEEEEEEECC---CCHHHHHHHHH T ss_conf -88348999985412367871556504899998569847304577999999863585478996067---67035677865 Q ss_pred -HCCCCCCCCHHHCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCHHHHHHHHHCCCCEEEECCCCHHHH Q ss_conf -1588743210110133332111686534454344346777771499996796410232222202454133046574464 Q gi|254780420|r 164 -NWSGLKGAELAKHVTVSQRRDWSEKIWKWGEETSFLKSSDAKYHVVCIDYGIRSNLLRVLSSLGCRITIVGAETSYKDI 242 (396) Q Consensus 164 -~~~~~~~~dL~~~Vs~~~~~~~~~~~~~~~~~~~~~~~~~~~~kIvvIDfGiK~nIlr~L~~~g~~v~Vvp~~~~~~~i 242 (396) ..|.+.+.||+++|||+++|.|....|...........++.+++|||||||+|+||||+|.+|||+|+||||++++++| T Consensus 154 f~~p~~~~~nLv~~Vst~~~~~~~~~~~~~~~~~~~~~~g~~~~~Vv~~DfG~K~nIlr~L~~rG~~v~vVP~~~~~~~i 233 (383) T TIGR01368 154 FESPDIDGINLVAEVSTKEPYTWGQGTWNLWKFFVAKRNGKKPLRVVVIDFGVKQNILRRLVKRGCEVTVVPYDTDAEEI 233 (383) T ss_pred HHCCCCCCCCCCCCEECCCCEEECCCCCHHHHHHHHCCCCCCCEEEEEEECCCCHHHHHHHHHCCCEEEEECCCCCHHHH T ss_conf 42467577652010011465560688641664322113689735999985588755898997669879996789988999 Q ss_pred HHCCCCEEEECCCCCCC-CCCCCCCHHHHHHHCCCCCCEEEEECHHHHHHHHCCCCCCCCCCCCCCCCCCCEECCCCEEE Q ss_conf 41357889985888884-10112110122210257863478610058878752854466776752101123477897099 Q gi|254780420|r 243 MSLQPDGVLLSNGPGDP-AVTSVYSSPIICKLVDSGMPMFGICLGHQLLGLALGAQTVKMYQGHHGVNHPVKNISNGTVE 321 (396) Q Consensus 243 ~~~~pdgi~lSnGPGdP-~~~~~~~~~~~~~~~~~~iPilGIClGhQ~l~~a~G~~~~kl~~GhrG~N~Pv~~~~t~kv~ 321 (396) ++++||||||||||||| +.+.. .++.++++++.++|||||||||||||||+||+|+||||||||.||||+|+.||||+ T Consensus 234 ~~~~PDGiflSNGPGDPH~~~~~-~i~~~~~l~~~~~P~fGICLGHQllALA~Ga~TyKlKFGHRG~NhPV~~~~~g~v~ 312 (383) T TIGR01368 234 KKYNPDGIFLSNGPGDPHAAVEP-AIETVRELLEAKIPIFGICLGHQLLALAFGAKTYKLKFGHRGGNHPVKDLRTGRVE 312 (383) T ss_pred HHHCCCEEEEECCCCCCHHHHHH-HHHHHHHHHHCCCCCCCCCHHHHHHHHHCCCCEEECCCCCCCCCCCEEECCCCEEE T ss_conf 86578889990879880567899-99999999844897100266689999973887000465678875131442798799 Q ss_pred EECCCCCEEEEHHHCC-CCCEEEEEECCCCEEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 9614576157034369-982899998899808726867997789834878877670016899999999997 Q gi|254780420|r 322 IVSMNHGFAIDASSLP-AGLEETHISLFDSSNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDFMHK 391 (396) Q Consensus 322 iTsqNHg~~v~~~slp-~~~~v~~~~~nD~tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~i~~ 391 (396) |||||||||||++||+ .+|+|||+|+||+|||||+|+++|+|||||||||+||||||.||||+|+++|++ T Consensus 313 ITsQNHGyAVD~~sl~~~~~~~th~nLNDgt~EG~~h~~~P~fSVQyHPEA~PGPhD~~~lFd~F~~~~~~ 383 (383) T TIGR01368 313 ITSQNHGYAVDEESLLAGDLEVTHVNLNDGTVEGIRHKDLPVFSVQYHPEASPGPHDTEYLFDEFIDLIKK 383 (383) T ss_pred EEEECCCEECCHHHCCCCCEEEEEEECCCCCCCCCCCCCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHC T ss_conf 99706871257535578853899983597933153005878667700688888700237999999999819 No 2 >CHL00197 carA carbamoyl-phosphate synthase arginine-specific small subunit; Provisional Probab=100.00 E-value=0 Score=1131.69 Aligned_cols=376 Identities=43% Similarity=0.706 Sum_probs=351.4 Q ss_pred CCCCEEEEECCCCEEEEEEECCCCEEEEEEEEECCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCCHHHHCCCCCCCCCE Q ss_conf 78966899728987999975676248888985378888760141614256173216897368458823420465333210 Q gi|254780420|r 10 KKPTAVLVLADGSVIEGMGCGATGSIQAEICFNTSLTGYQEILTDPSYLGQIVNFTFPHIGNVGVNSEDFESISRKNFKG 89 (396) Q Consensus 10 ~~~~A~L~LedGt~f~G~~fG~~~~~~GEvVFnT~mtGYqE~lTDPSY~gQIlvfT~P~IGNyGvn~~d~ES~~~~~~~~ 89 (396) +-.+|+|+||||++|+|++||++++++||||||||||||||+||||||+|||||||||||||||+|.+|+|| .++| T Consensus 4 ~~~~A~LvLedG~~f~G~~fG~~~~~~GEvVFnT~MtGYqE~lTDPSY~gQIl~fTyP~IGNyGin~~d~ES----~~i~ 79 (383) T CHL00197 4 KMIPAILVLEDGTVYRGWSFSNPITSIGEVVFNTGMTGYQEIMTDPSYNEQIVTFTYPEIGNTGINLDDIES----VKIQ 79 (383) T ss_pred CCCCEEEEECCCCEEEEEECCCCCCEEEEEEEECCCCCCCCCCCCCCCCCHHHHCCCCCCEEECCCCHHCCC----CCCC T ss_conf 567689996699989879748777677899997997876820058452344022166773460799212035----6764 Q ss_pred EEEEEEECCCCCCCCCCCCCCHHHHHHHCCCCEECCCCHHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHHCCCCC Q ss_conf 23666404654434552417989999742783211257689999998539842366327654589789888774158874 Q gi|254780420|r 90 AVGLVIKAEITEPSNYRANMHFDAWLKSCGIIGLSGIDTRALTVWIRDYGVSNSVIAHHPDGQFDLEDLKERAKNWSGLK 169 (396) Q Consensus 90 ~~g~Iv~~~~~~~Sn~rs~~sL~~wL~~~~IpgI~GIDTRaLTk~IRe~G~~~a~I~~~~~~~~d~~~~~~~~~~~~~~~ 169 (396) ++|+|||+++..||||||+.||++||++++||||+|||||||||+||++|+|+|+|+++ +.+.+++.++++++++++ T Consensus 80 ~~G~Ivre~~~~~Sn~rs~~sL~~wL~~~~IpgI~gIDTRaLTk~IR~~G~m~g~I~~~---~~~~~~l~~~~~~~~~~~ 156 (383) T CHL00197 80 IKGIIAKNISPSSSNWRSQESLVSYLKRHKIPFIFGIDTRALVRHLRRFGTMNGCISNQ---NLNLSYLEAKIKELPSME 156 (383) T ss_pred EEEEEECCCCCCCCCHHHHCCHHHHHHHCCCCCCCCCCHHHHHHHHHHCCCCEEEEECC---CCCHHHHHHHHHHCCCCC T ss_conf 68999456667888532115899999975986264787899999999649824999079---899899999987489965 Q ss_pred CCCHHHCCCCCCCEECCCCCCCCCC--CCCCCCCCCCCCEEEEEECCCCCHHHHHHHHHCCCCEEEECCCCHHHHHHCCC Q ss_conf 3210110133332111686534454--34434677777149999679641023222220245413304657446441357 Q gi|254780420|r 170 GAELAKHVTVSQRRDWSEKIWKWGE--ETSFLKSSDAKYHVVCIDYGIRSNLLRVLSSLGCRITIVGAETSYKDIMSLQP 247 (396) Q Consensus 170 ~~dL~~~Vs~~~~~~~~~~~~~~~~--~~~~~~~~~~~~kIvvIDfGiK~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~p 247 (396) +.||+++|||+++|.|.+....... .......++.++|||+||||+|+||||+|.+|||+|+||||++++++|++++| T Consensus 157 ~~dLv~~VSt~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~VvviD~GiK~nILr~L~~~g~~v~VvP~~~~~~~I~~~~P 236 (383) T CHL00197 157 GLDLAPSVTTSSYYEWDEPSHPKWFYLADNKRPHSSYPLRIIVIDFGVKYNILRRLKSFGCEITIVPATSTYQDIMAYQP 236 (383) T ss_pred CCCEEEEECCCCCEEECCCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCHHHHHHHHHCCCEEEEECCCCCCCHHHCCCC T ss_conf 78542485168877706887654321343444588887459999677718899999978998999899985113425688 Q ss_pred CEEEECCCCCCCCCCCCCCHHHHHHHCC-CCCCEEEEECHHHHHHHHCCCCCCCCCCCCCCCCCCCEECCCCEEEEECCC Q ss_conf 8899858888841011211012221025-786347861005887875285446677675210112347789709996145 Q gi|254780420|r 248 DGVLLSNGPGDPAVTSVYSSPIICKLVD-SGMPMFGICLGHQLLGLALGAQTVKMYQGHHGVNHPVKNISNGTVEIVSMN 326 (396) Q Consensus 248 dgi~lSnGPGdP~~~~~~~~~~~~~~~~-~~iPilGIClGhQ~l~~a~G~~~~kl~~GhrG~N~Pv~~~~t~kv~iTsqN 326 (396) ||||||||||||+.+.. .++.++++++ +++|||||||||||||+|+||+|+||||||||+||||+ .|||||||||| T Consensus 237 DGIfLSNGPGDP~~~~~-~i~~i~~li~~~~iPifGICLGHQlLalA~GakT~KmkfGHrG~NhPV~--~t~kv~ITSQN 313 (383) T CHL00197 237 DGILLSNGPGDPSTVHY-GIKTVKKLIKKANIPIFGICMGHQILSLALEAKTFKLKFGHRGLNHPSG--LNQQVEITSQN 313 (383) T ss_pred CEEEECCCCCCHHHHHH-HHHHHHHHHHCCCCCEEEECHHHHHHHHHCCCEEEECCCCCCCCCCCCC--CCCCEEEEECC T ss_conf 88997899999789899-9999999996489978995588999999739837566888779886601--58757996278 Q ss_pred CCEEEEHHHCCCC-CEEEEEECCCCEEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCC Q ss_conf 7615703436998-28999988998087268679977898348788776700168999999999973578 Q gi|254780420|r 327 HGFAIDASSLPAG-LEETHISLFDSSNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDFMHKRKTM 395 (396) Q Consensus 327 Hg~~v~~~slp~~-~~v~~~~~nD~tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~i~~~k~~ 395 (396) |||+||++|||.+ +++||+|+||+|||||+|+++|+|||||||||+|||||+.||||+|++||+++|+. T Consensus 314 HGyaVd~~sl~~~~~~vt~~nlnD~tvEG~~~~~~p~fsvQfHPEa~pGP~Ds~~lF~~F~~~~~~~k~~ 383 (383) T CHL00197 314 HGFAVNAESLAKNKVYVTHFNLNDGTVAGISHSPKPYFSVQYHPEASPGPHDADYLFEYFIEIMKHSKSS 383 (383) T ss_pred CCEEECCCCCCCCCCEEEEECCCCCCCCCEEECCCCEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCC T ss_conf 7617674447887318999938889614869889987887748898989873088999999999976189 No 3 >PRK12564 carbamoyl phosphate synthase small subunit; Reviewed Probab=100.00 E-value=0 Score=1114.10 Aligned_cols=352 Identities=55% Similarity=0.932 Sum_probs=339.6 Q ss_pred CEEEEECCCCEEEEEEECCCCEEEEEEEEECCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCCHHHHCCCCCCCCCEEEE Q ss_conf 66899728987999975676248888985378888760141614256173216897368458823420465333210236 Q gi|254780420|r 13 TAVLVLADGSVIEGMGCGATGSIQAEICFNTSLTGYQEILTDPSYLGQIVNFTFPHIGNVGVNSEDFESISRKNFKGAVG 92 (396) Q Consensus 13 ~A~L~LedGt~f~G~~fG~~~~~~GEvVFnT~mtGYqE~lTDPSY~gQIlvfT~P~IGNyGvn~~d~ES~~~~~~~~~~g 92 (396) +|+|+||||++|+|++||++++++||||||||||||||+||||||+|||||||||||||||+|.+|+|| .++|++| T Consensus 4 ka~LiLedGt~f~G~~fG~~~~~~GEvVFnT~MtGYqE~lTDPSY~gQIl~fTyP~IGNyGv~~~d~ES----~~~~~~G 79 (355) T PRK12564 4 KAYLVLEDGTVFEGKAFGAEGETVGEVVFNTSMTGYQEILTDPSYAGQIVTFTYPLIGNYGVNREDFES----DRIHAKG 79 (355) T ss_pred CEEEEECCCCEEEEEEECCCCCEEEEEEEECCCCCCCCCCCCCCCCCHHHEECCCCCCCCCCCHHHHCC----CCCCEEE T ss_conf 379995699999889807776578899997998765700228776673000077675153889457155----6755106 Q ss_pred EEEECCCCCCCCCCCCCCHHHHHHHCCCCEECCCCHHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHHCCCCCCCC Q ss_conf 66404654434552417989999742783211257689999998539842366327654589789888774158874321 Q gi|254780420|r 93 LVIKAEITEPSNYRANMHFDAWLKSCGIIGLSGIDTRALTVWIRDYGVSNSVIAHHPDGQFDLEDLKERAKNWSGLKGAE 172 (396) Q Consensus 93 ~Iv~~~~~~~Sn~rs~~sL~~wL~~~~IpgI~GIDTRaLTk~IRe~G~~~a~I~~~~~~~~d~~~~~~~~~~~~~~~~~d 172 (396) +|||+++..|||||+++||++||++++||||+|||||||||+||++|+|+|+|++++ .+++++++++++.+++.| T Consensus 80 ~Iv~e~~~~~s~~~s~~sL~~~L~~~~IpgI~gVDTRaLtr~iR~~G~m~g~I~~~~-----~~~~~~~~~~~~~~~~~d 154 (355) T PRK12564 80 LIVRELSRIPSNWRSEMSLDDYLKEHGIPGISGIDTRALTRKLREKGAMNGVIATGD-----LEDLLEKARAFPGLEGLD 154 (355) T ss_pred EEECCCCCCCCCCCCCCCHHHHHHHCCCCEEECCCHHHHHHHHHHCCCEEEEEECCC-----HHHHHHHHHHCCCCCCCC T ss_conf 884146778884011069999999779842852538999999986598059994598-----599999998278964566 Q ss_pred HHHCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEE Q ss_conf 01101333321116865344543443467777714999967964102322222024541330465744644135788998 Q gi|254780420|r 173 LAKHVTVSQRRDWSEKIWKWGEETSFLKSSDAKYHVVCIDYGIRSNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLL 252 (396) Q Consensus 173 L~~~Vs~~~~~~~~~~~~~~~~~~~~~~~~~~~~kIvvIDfGiK~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~l 252 (396) |+++|||+++|.|. +.++|||+||||+|+||||+|.+|||+|+||||++++++|++++|||||| T Consensus 155 lv~~VS~~~~~~~~----------------~~~~~V~viD~GvK~nIlr~L~~rg~~v~VvP~~~~~~~I~~~~pDGIfl 218 (355) T PRK12564 155 LAKEVSTKEPYPWP----------------GGGYHVVAIDFGVKRNILRELAERGCRVTVVPATTTAEEILALNPDGVFL 218 (355) T ss_pred CHHHCCCCCCEECC----------------CCCEEEEEEECCCHHHHHHHHHHCCCEEEEECCCCCHHHHHCCCCCEEEE T ss_conf 55132767767727----------------99718999937873889999998798699968978765764148888996 Q ss_pred CCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEECHHHHHHHHCCCCCCCCCCCCCCCCCCCEECCCCEEEEECCCCCEEEE Q ss_conf 58888841011211012221025786347861005887875285446677675210112347789709996145761570 Q gi|254780420|r 253 SNGPGDPAVTSVYSSPIICKLVDSGMPMFGICLGHQLLGLALGAQTVKMYQGHHGVNHPVKNISNGTVEIVSMNHGFAID 332 (396) Q Consensus 253 SnGPGdP~~~~~~~~~~~~~~~~~~iPilGIClGhQ~l~~a~G~~~~kl~~GhrG~N~Pv~~~~t~kv~iTsqNHg~~v~ 332 (396) |||||||+.+. ..++.+++++++++||||||||||+||+|+||+|+||||||||+||||+|+.|+|||||||||||+|| T Consensus 219 SNGPGDP~~~~-~~i~~ik~li~~~~PifGICLGHQllalA~Ga~t~Km~fGHrG~NhPV~~l~t~~v~iTSQNHGyaVd 297 (355) T PRK12564 219 SNGPGDPAALD-YAIEMIKELLEGKIPIFGICLGHQLLALALGAKTYKMKFGHRGANHPVKDLETGKVEITSQNHGFAVD 297 (355) T ss_pred CCCCCCHHHHH-HHHHHHHHHHCCCCCEEEECHHHHHHHHHCCCEECCCCCCCCCCCCCCEECCCCEEEEEECCCCEEEC T ss_conf 79999968879-99999999852698789988879999997198442055565789977335578739997167422866 Q ss_pred HHHCCCCCEEEEEECCCCEEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 3436998289999889980872686799778983487887767001689999999999 Q gi|254780420|r 333 ASSLPAGLEETHISLFDSSNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDFMH 390 (396) Q Consensus 333 ~~slp~~~~v~~~~~nD~tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~i~ 390 (396) ++|||++|++||+|+||+|||||+|+++|+|||||||||+|||||+.||||+|++||+ T Consensus 298 ~~sl~~~~~vt~~nlnDgtvEGi~h~~~p~fSVQfHPEa~pGP~D~~~lFd~F~~~i~ 355 (355) T PRK12564 298 EDSLPENLEVTHVNLNDGTVEGLRHKDLPAFSVQYHPEASPGPHDAAYLFDEFVELME 355 (355) T ss_pred CCCCCCCEEEEEEECCCCCCCCEEECCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHC T ss_conf 6658987399999389996721788899878972188999899753889999999759 No 4 >COG0505 CarA Carbamoylphosphate synthase small subunit [Amino acid transport and metabolism / Nucleotide transport and metabolism] Probab=100.00 E-value=0 Score=1097.84 Aligned_cols=366 Identities=52% Similarity=0.860 Sum_probs=349.0 Q ss_pred CCEEEEECCCCEEEEEEECCCCEEEEEEEEECCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCCHHHHCCCCCCCCCEEE Q ss_conf 96689972898799997567624888898537888876014161425617321689736845882342046533321023 Q gi|254780420|r 12 PTAVLVLADGSVIEGMGCGATGSIQAEICFNTSLTGYQEILTDPSYLGQIVNFTFPHIGNVGVNSEDFESISRKNFKGAV 91 (396) Q Consensus 12 ~~A~L~LedGt~f~G~~fG~~~~~~GEvVFnT~mtGYqE~lTDPSY~gQIlvfT~P~IGNyGvn~~d~ES~~~~~~~~~~ 91 (396) .+|+|+||||++|+|++|||++++.||||||||||||||+||||||+|||||||||||||||+|.+|+|| .+++++ T Consensus 2 ~~a~L~LeDGtvf~G~~fGA~g~~~GEvVFnTsMTGYqE~LTDPSY~gQIv~fTyP~IGNyGvn~~d~Es----~~i~~~ 77 (368) T COG0505 2 MKAYLVLEDGTVFEGYSFGAEGTAVGEVVFNTSMTGYQEILTDPSYKGQIVTFTYPLIGNYGVNDEDFES----DRIHAA 77 (368) T ss_pred CCEEEEECCCCEEEEEECCCCCCEEEEEEEECCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCHHCCC----CCCEEE T ss_conf 8449996679888768668887577899996787765311568764855899714654302678211124----673288 Q ss_pred EEEEECCCCCCCCCCCCCCHHHHHHHCCCCEECCCCHHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHHCCCCCCC Q ss_conf 66640465443455241798999974278321125768999999853984236632765458978988877415887432 Q gi|254780420|r 92 GLVIKAEITEPSNYRANMHFDAWLKSCGIIGLSGIDTRALTVWIRDYGVSNSVIAHHPDGQFDLEDLKERAKNWSGLKGA 171 (396) Q Consensus 92 g~Iv~~~~~~~Sn~rs~~sL~~wL~~~~IpgI~GIDTRaLTk~IRe~G~~~a~I~~~~~~~~d~~~~~~~~~~~~~~~~~ 171 (396) |+|||+++..+||||+++||++||++++||||+|||||+|||+||++|+|+|+|++++ ..+.+.+..++++|+.+.+. T Consensus 78 G~vvre~~~~~Sn~ra~~sL~~~Lk~~gipgI~GIDTRaLtr~iR~~G~m~~~I~~~~--~~~~~~~~~~~~~~~~~~~~ 155 (368) T COG0505 78 GLVVRELSERPSNWRATESLDEYLKEEGIPGIAGIDTRALTRKIREKGAMKGVIATGP--ELDPAKLLERARAFPGILGT 155 (368) T ss_pred EEEECCCCCCCCCCCCCCCHHHHHHHCCCCCEECCCHHHHHHHHHHCCCCCEEEECCC--CCCHHHHHHHHHHCCCCCCC T ss_conf 9997236666575230058999999769977505518999999986697313761376--66868888887623898766 Q ss_pred CHHHCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEE Q ss_conf 10110133332111686534454344346777771499996796410232222202454133046574464413578899 Q gi|254780420|r 172 ELAKHVTVSQRRDWSEKIWKWGEETSFLKSSDAKYHVVCIDYGIRSNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVL 251 (396) Q Consensus 172 dL~~~Vs~~~~~~~~~~~~~~~~~~~~~~~~~~~~kIvvIDfGiK~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~ 251 (396) ||+++||++++|.|.. +.. ....+++||++|||+|+||||+|.+|||+|+||||++++++|++++||||| T Consensus 156 dlv~~VSt~~~~~~~~--~~~--------~~~~~~~Vv~iD~GvK~nIlr~L~~rg~~vtVVP~~t~~eeIl~~~pDGif 225 (368) T COG0505 156 DLVKEVSTKEPYTWPG--LNG--------GGEPGKHVVVIDFGVKRNILRELVKRGCRVTVVPADTSAEEILALNPDGIF 225 (368) T ss_pred CCCCEEECCCCEECCC--CCC--------CCCCCCEEEEEECCCCHHHHHHHHHCCCEEEEECCCCCHHHHHHHCCCEEE T ss_conf 5400565588444366--566--------777776799997673177999999779849997698999999851899799 Q ss_pred ECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEECHHHHHHHHCCCCCCCCCCCCCCCCCCCEECCCCEEEEECCCCCEEE Q ss_conf 85888884101121101222102578634786100588787528544667767521011234778970999614576157 Q gi|254780420|r 252 LSNGPGDPAVTSVYSSPIICKLVDSGMPMFGICLGHQLLGLALGAQTVKMYQGHHGVNHPVKNISNGTVEIVSMNHGFAI 331 (396) Q Consensus 252 lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIClGhQ~l~~a~G~~~~kl~~GhrG~N~Pv~~~~t~kv~iTsqNHg~~v 331 (396) ||||||||+.++ ..++.+++++..+|||||||||||+||||+||+|+||||||||+||||+|+.||||+|||||||||| T Consensus 226 lSNGPGDP~~~~-~~i~~ik~l~~~~iPifGICLGHQllalA~Ga~T~KmkFGHrG~NhPV~dl~tgrv~ITSQNHGyaV 304 (368) T COG0505 226 LSNGPGDPAPLD-YAIETIKELLGTKIPIFGICLGHQLLALALGAKTYKMKFGHRGANHPVKDLDTGRVYITSQNHGYAV 304 (368) T ss_pred EECCCCCHHHHH-HHHHHHHHHHCCCCCEEEECHHHHHHHHHCCCCEEECCCCCCCCCCCCCCCCCCEEEEEECCCCEEC T ss_conf 748998806778-9999999986468983788688899998667864320368877896763165880899946886013 Q ss_pred EHHHCCCCCEEEEEECCCCEEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCC Q ss_conf 034369982899998899808726867997789834878877670016899999999997357 Q gi|254780420|r 332 DASSLPAGLEETHISLFDSSNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDFMHKRKT 394 (396) Q Consensus 332 ~~~slp~~~~v~~~~~nD~tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~i~~~k~ 394 (396) |+++++...++||+|+||+|+||++|+++|+|||||||||+|||||+.||||+|++||++.|. T Consensus 305 d~~s~~~~~~vth~nlnDgTvEGi~h~~~P~fSVQ~HPEAsPGPhDt~ylFd~Fi~~~~~~~~ 367 (368) T COG0505 305 DEDSLVETLKVTHVNLNDGTVEGIRHKDLPAFSVQYHPEASPGPHDTRYLFDEFIELMEAAKK 367 (368) T ss_pred CHHHCCCCCEEEEEECCCCCCCCEECCCCCEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHC T ss_conf 822327776058981799970024527986378714899998986447799999999987524 No 5 >PRK12838 carbamoyl phosphate synthase small subunit; Reviewed Probab=100.00 E-value=0 Score=1096.96 Aligned_cols=353 Identities=41% Similarity=0.716 Sum_probs=327.6 Q ss_pred CEEEEECCCCEEEEEEECCCCEEEEEEEEECCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCCHHHHCCCCCCCCCEEEE Q ss_conf 66899728987999975676248888985378888760141614256173216897368458823420465333210236 Q gi|254780420|r 13 TAVLVLADGSVIEGMGCGATGSIQAEICFNTSLTGYQEILTDPSYLGQIVNFTFPHIGNVGVNSEDFESISRKNFKGAVG 92 (396) Q Consensus 13 ~A~L~LedGt~f~G~~fG~~~~~~GEvVFnT~mtGYqE~lTDPSY~gQIlvfT~P~IGNyGvn~~d~ES~~~~~~~~~~g 92 (396) +|+|+||||++|+|++||++++++||||||||||||||+||||||+|||||||||||||||+|.+|+|| .++|++| T Consensus 2 ka~L~LeDGt~f~G~~fG~~~~~~GEvVFnT~MtGYqE~lTDPSY~gQIv~~TyP~IGNyGvn~~d~ES----~~i~~~G 77 (356) T PRK12838 2 KAYLILEDGTTFEGELFGAPIDVTGELVFNTGMTGYQEVLTDPSYKGQIIVFTYPLIGNYGINASDYES----KQPTVKG 77 (356) T ss_pred CEEEEECCCCEEEEEEECCCCCEEEEEEEECCCCCCCCCCCCCCCCCHHHHHCCCCCCCCCCCHHHCCC----CCCEEEE T ss_conf 279996799999879707777688999996997876710158651234233166675343889157055----5750428 Q ss_pred EEEECCCCCCCCCCCCCCHHHHHHHCCCCEECCCCHHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHHCCCCCCCC Q ss_conf 66404654434552417989999742783211257689999998539842366327654589789888774158874321 Q gi|254780420|r 93 LVIKAEITEPSNYRANMHFDAWLKSCGIIGLSGIDTRALTVWIRDYGVSNSVIAHHPDGQFDLEDLKERAKNWSGLKGAE 172 (396) Q Consensus 93 ~Iv~~~~~~~Sn~rs~~sL~~wL~~~~IpgI~GIDTRaLTk~IRe~G~~~a~I~~~~~~~~d~~~~~~~~~~~~~~~~~d 172 (396) +|||+++..||||||++||++||++++||||+|||||||||+||++|+|+|+|++..+. ...+++. ......+ T Consensus 78 ~Ivre~~~~~Sn~rs~~sL~~~L~~~~IpgI~gIDTRaLTr~IR~~G~m~g~I~~~~~~-~~~~~~~------~~~~~~~ 150 (356) T PRK12838 78 VIVYEAAREGSHYRAKQSLDDYLKEKNIPGISGVDTRALVKHLREKGTMKASITTTDDE-HAFDQIQ------ALVLPFN 150 (356) T ss_pred EEEECCCCCCCCCCCCCCHHHHHHHCCCCEEECCCHHHHHHHHHHCCCCEEEEECCCCC-CCHHHCC------CCCCCCC T ss_conf 99513578888555647999999976985274464899999998658814899358876-6500101------4112456 Q ss_pred HHHCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEE Q ss_conf 01101333321116865344543443467777714999967964102322222024541330465744644135788998 Q gi|254780420|r 173 LAKHVTVSQRRDWSEKIWKWGEETSFLKSSDAKYHVVCIDYGIRSNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLL 252 (396) Q Consensus 173 L~~~Vs~~~~~~~~~~~~~~~~~~~~~~~~~~~~kIvvIDfGiK~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~l 252 (396) ++..||++++|.| ++.++|||+||||+|+||||+|.+|||+|+||||++++++|++++|||||| T Consensus 151 ~~~~vs~~~~~~~----------------~~~~~~V~viD~GvK~nIlr~L~~rg~~V~VvP~~~~~~~i~~~~pDGifl 214 (356) T PRK12838 151 VQAVVSTKEPYAA----------------GNGGKHVALIDFGLKHSILRSLSKRGCNVTVLPYNASLEDIKNLNPDGIVL 214 (356) T ss_pred CCEEEECCCCEEC----------------CCCCEEEEEEECCCHHHHHHHHHHCCCEEEEECCCCCCCHHHCCCCCEEEE T ss_conf 6348955764887----------------999757999958836999999997898899989988754023249748994 Q ss_pred CCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEECHHHHHHHHCCCCCCCCCCCCCCCCCCCEECCCCEEEEECCCCCEEEE Q ss_conf 58888841011211012221025786347861005887875285446677675210112347789709996145761570 Q gi|254780420|r 253 SNGPGDPAVTSVYSSPIICKLVDSGMPMFGICLGHQLLGLALGAQTVKMYQGHHGVNHPVKNISNGTVEIVSMNHGFAID 332 (396) Q Consensus 253 SnGPGdP~~~~~~~~~~~~~~~~~~iPilGIClGhQ~l~~a~G~~~~kl~~GhrG~N~Pv~~~~t~kv~iTsqNHg~~v~ 332 (396) |||||||+.+.. .++.+++++ +++||||||||||+||+|+||+|+||||||||+||||+|+.|||||||||||||+|| T Consensus 215 SNGPGDP~~~~~-~i~~vr~l~-~~~PifGICLGHQllalA~Gakt~KmkfGHrG~NhPV~~~~tgkv~ITSQNHGyaVd 292 (356) T PRK12838 215 SNGPGDPKELQP-YLPTIKDLA-SSYPILGICLGHQLIALALGAETYKLPFGHRGANHPVIDLATGRVWMTSQNHGYVVD 292 (356) T ss_pred CCCCCCHHHHHH-HHHHHHHHH-CCCCEEEECHHHHHHHHHHCCEEEECCCCCCCCCCCCEECCCCCEEEEECCCCCEEC T ss_conf 389989688788-999999997-498889974889999997097486415566887826477689948997268761677 Q ss_pred HHHCCC-CCEEEEEECCCCEEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCC Q ss_conf 343699-82899998899808726867997789834878877670016899999999997357 Q gi|254780420|r 333 ASSLPA-GLEETHISLFDSSNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDFMHKRKT 394 (396) Q Consensus 333 ~~slp~-~~~v~~~~~nD~tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~i~~~k~ 394 (396) ++|||. +|++||+|+||+|||||+|+++|+|||||||||+|||||+.||||+|++||+++|. T Consensus 293 ~~sl~~~~~~vt~~nlnD~tveGi~~~~~p~fsVQfHPEa~PGP~D~~~lF~~F~~~~~~~k~ 355 (356) T PRK12838 293 EDSIDEKQLSVTFFNLNDGSIEGLRHKKLPVLSVQFHPEAHPGPHDAEYIFDEFLEMMGEARR 355 (356) T ss_pred CCCCCCCCEEEEEECCCCCCEEEEEECCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHC T ss_conf 676898865999990899966107988998688712889998987438899999999998736 No 6 >KOG0370 consensus Probab=100.00 E-value=0 Score=893.92 Aligned_cols=348 Identities=39% Similarity=0.666 Sum_probs=327.6 Q ss_pred EEEEECCCCEEEEEEECCCCEEEEEEEEECCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCCH--HH------HCCCCCC Q ss_conf 689972898799997567624888898537888876014161425617321689736845882--34------2046533 Q gi|254780420|r 14 AVLVLADGSVIEGMGCGATGSIQAEICFNTSLTGYQEILTDPSYLGQIVNFTFPHIGNVGVNS--ED------FESISRK 85 (396) Q Consensus 14 A~L~LedGt~f~G~~fG~~~~~~GEvVFnT~mtGYqE~lTDPSY~gQIlvfT~P~IGNyGvn~--~d------~ES~~~~ 85 (396) +.|+||||++|.|++||+++.+.||+||||||+||+|+||||||+||||+||||+|||||++. .| +|| T Consensus 3 s~L~LeDGsv~~G~SFGA~~svaGE~VFqTgmvGYpEslTDPSY~gQiLv~T~PlIGNyGVP~~~~DE~l~~~fES---- 78 (1435) T KOG0370 3 STLVLEDGSVLPGYSFGAPKSVAGELVFQTGMVGYPESLTDPSYKGQILVFTYPLIGNYGVPPDARDEGLLKHFES---- 78 (1435) T ss_pred EEEEECCCCEECCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCC---- T ss_conf 2799625875035235888751357999627767751137925154389983244157899987433234343326---- Q ss_pred CCCEEEEEEEECCCCCCCCCCCCCCHHHHHHHCCCCEECCCCHHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHHC Q ss_conf 32102366640465443455241798999974278321125768999999853984236632765458978988877415 Q gi|254780420|r 86 NFKGAVGLVIKAEITEPSNYRANMHFDAWLKSCGIIGLSGIDTRALTVWIRDYGVSNSVIAHHPDGQFDLEDLKERAKNW 165 (396) Q Consensus 86 ~~~~~~g~Iv~~~~~~~Sn~rs~~sL~~wL~~~~IpgI~GIDTRaLTk~IRe~G~~~a~I~~~~~~~~d~~~~~~~~~~~ 165 (396) .++|+.|+||++++..+|||++.+||.+||++++||||+|||||+|||+|||+|+|.|.|..+.+.. .| T Consensus 79 ~~I~vaglVV~~ys~~ysHW~a~~SL~eWlq~~gVp~i~gvDTRaLtk~lReqGSmLgkl~~e~~~~-----------~~ 147 (1435) T KOG0370 79 GQIHVAGLVVGEYSIEYSHWLATKSLGEWLQEEGVPGIYGVDTRALTKKLREQGSMLGKLSIEKSPV-----------LF 147 (1435) T ss_pred CCEEEEEEEHHHHCCCHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCCCC-----------CC T ss_conf 9558999984240665124121402999997659984226548999999986476236898337887-----------41 Q ss_pred CCCCCCCHHHCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCHHHHHHHHHCCCCEEEECCCCHHHHHHC Q ss_conf 88743210110133332111686534454344346777771499996796410232222202454133046574464413 Q gi|254780420|r 166 SGLKGAELAKHVTVSQRRDWSEKIWKWGEETSFLKSSDAKYHVVCIDYGIRSNLLRVLSSLGCRITIVGAETSYKDIMSL 245 (396) Q Consensus 166 ~~~~~~dL~~~Vs~~~~~~~~~~~~~~~~~~~~~~~~~~~~kIvvIDfGiK~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~ 245 (396) .+++..+|+++||+++++.|. .++..+|+++|||+|+|++|+|.+|||+|+||||+.++++. T Consensus 148 vdpn~~nLvs~VS~Kep~~y~---------------~Gk~~~I~aiDcG~K~N~IRcL~~RGa~vtVvPw~~~i~~~--- 209 (1435) T KOG0370 148 VDPNKRNLVSQVSTKEPKVYG---------------DGKSLRILAIDCGLKYNQIRCLVKRGAEVTVVPWDYPIAKE--- 209 (1435) T ss_pred CCCCCCCCHHHHEECCCEEEC---------------CCCCCEEEECCCCCHHHHHHHHHHHCCEEEEECCCCCCCCC--- T ss_conf 287765420011122635870---------------78641799735674189999999849569990377622012--- Q ss_pred CCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEECHHHHHHHHCCCCCCCCCCCCCCCCCCCEECCCCEEEEECC Q ss_conf 57889985888884101121101222102578634786100588787528544667767521011234778970999614 Q gi|254780420|r 246 QPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGICLGHQLLGLALGAQTVKMYQGHHGVNHPVKNISNGTVEIVSM 325 (396) Q Consensus 246 ~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIClGhQ~l~~a~G~~~~kl~~GhrG~N~Pv~~~~t~kv~iTsq 325 (396) +|||||||||||||+.+.. ....++++++++.||||||||||++|+|.|++|+||||||||+|||+.+..+|+|+|||| T Consensus 210 ~yDGlflSNGPGdPe~~~~-~v~~vr~lL~~~~PvfGIClGHQllA~AaGakT~KmKyGNRGhNiP~~~~~tGrc~ITSQ 288 (1435) T KOG0370 210 EYDGLFLSNGPGDPELCPL-LVQNVRELLESNVPVFGICLGHQLLALAAGAKTYKMKYGNRGHNIPCTCRATGRCFITSQ 288 (1435) T ss_pred CCCEEEEECCCCCCHHHHH-HHHHHHHHHHCCCCEEEEEHHHHHHHHHHCCCEEEEECCCCCCCCCCEECCCCEEEEEEC T ss_conf 5561797279999056589-999999998279985897366699998608744886315567774622003765999815 Q ss_pred CCCEEEEHHHCCCCCEEEEEECCCCEEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCC Q ss_conf 5761570343699828999988998087268679977898348788776700168999999999973578 Q gi|254780420|r 326 NHGFAIDASSLPAGLEETHISLFDSSNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDFMHKRKTM 395 (396) Q Consensus 326 NHg~~v~~~slp~~~~v~~~~~nD~tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~i~~~k~~ 395 (396) ||||+||.++||.+|+++|.|+||++.|||.|..+|+|+||||||++|||+|+.|+||.|++++.+.++. T Consensus 289 NHGYAVD~~tLp~gWk~lFvN~NDgSNEGI~Hss~P~fSvQFHPEat~GP~DTeyLFDiFi~lvkk~kst 358 (1435) T KOG0370 289 NHGYAVDPATLPAGWKPLFVNANDGSNEGIMHSSKPFFSVQFHPEATPGPHDTEYLFDVFIELVKKSKST 358 (1435) T ss_pred CCCEEECCCCCCCCCCHHEEECCCCCCCEEECCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHCC T ss_conf 7761534532457770320413567773376279872116307867899751399999999998887417 No 7 >pfam00988 CPSase_sm_chain Carbamoyl-phosphate synthase small chain, CPSase domain. The carbamoyl-phosphate synthase domain is in the amino terminus of protein. Carbamoyl-phosphate synthase catalyses the ATP-dependent synthesis of carbamyl-phosphate from glutamine or ammonia and bicarbonate. This important enzyme initiates both the urea cycle and the biosynthesis of arginine and/or pyrimidines. The carbamoyl-phosphate synthase (CPS) enzyme in prokaryotes is a heterodimer of a small and large chain. The small chain promotes the hydrolysis of glutamine to ammonia, which is used by the large chain to synthesize carbamoyl phosphate. See pfam00289. The small chain has a GATase domain in the carboxyl terminus. See pfam00117. Probab=100.00 E-value=0 Score=433.86 Aligned_cols=130 Identities=54% Similarity=0.843 Sum_probs=127.6 Q ss_pred CEEEEECCCCEEEEEEECCCCEEEEEEEEECCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCCHHHHCCCCCCCCCEEEE Q ss_conf 66899728987999975676248888985378888760141614256173216897368458823420465333210236 Q gi|254780420|r 13 TAVLVLADGSVIEGMGCGATGSIQAEICFNTSLTGYQEILTDPSYLGQIVNFTFPHIGNVGVNSEDFESISRKNFKGAVG 92 (396) Q Consensus 13 ~A~L~LedGt~f~G~~fG~~~~~~GEvVFnT~mtGYqE~lTDPSY~gQIlvfT~P~IGNyGvn~~d~ES~~~~~~~~~~g 92 (396) +|+|+||||++|+|++||+++.+.||||||||||||||+||||||+|||||||||||||||+|.+|+|| .++|++| T Consensus 2 ka~L~LedG~~f~G~~fG~~~~~~GEvvFnT~mtGYqE~lTDPSY~gQIl~fT~P~IGNyGin~~d~ES----~~~~~~G 77 (131) T pfam00988 2 KAYLVLEDGTVFEGESFGAEGEAVGEVVFNTGMTGYQEILTDPSYAGQIVVFTYPLIGNYGINPEDFES----DKPHAKG 77 (131) T ss_pred CEEEEECCCCEEEEEECCCCCCEEEEEEEECCCCCCCEECCCCCCCCEEEEECCCCCCCCCCCCCCCCC----CCCCCCE T ss_conf 379995799999879737776577799997887654403258765774899746640441779743200----4764057 Q ss_pred EEEECCCCCCCCCCCCCCHHHHHHHCCCCEECCCCHHHHHHHHHHCCCCCEEEE Q ss_conf 664046544345524179899997427832112576899999985398423663 Q gi|254780420|r 93 LVIKAEITEPSNYRANMHFDAWLKSCGIIGLSGIDTRALTVWIRDYGVSNSVIA 146 (396) Q Consensus 93 ~Iv~~~~~~~Sn~rs~~sL~~wL~~~~IpgI~GIDTRaLTk~IRe~G~~~a~I~ 146 (396) +|||+++..|||||++.+|++||++++||||+|||||+|||+||++|+|+|+|+ T Consensus 78 ~iv~~~~~~~S~~~s~~sL~~~L~~~~IpgI~gIDTRaLt~~iR~~G~~~~~I~ 131 (131) T pfam00988 78 LIVRELSDIPSNWRSEESLDEFLKEHGIPGISGVDTRALTRKLREKGAMKGVIS 131 (131) T ss_pred EEECCCCCCCCCHHHHCCHHHHHHHCCCCEEECCCHHHHHHHHHHCCCCCEEEC T ss_conf 985356657665323359999999779722706618999999986698516849 No 8 >cd01744 GATase1_CPSase Small chain of the glutamine-dependent form of carbamoyl phosphate synthase, CPSase II. This group of sequences represents the small chain of the glutamine-dependent form of carbamoyl phosphate synthase, CPSase II. CPSase II catalyzes the production of carbomyl phosphate (CP) from bicarbonate, glutamine and two molecules of MgATP. The reaction is believed to proceed by a series of four biochemical reactions involving a minimum of three discrete highly reactive intermediates. The synthesis of CP is critical for the initiation of two separate biosynthetic pathways. In one CP is coupled to aspartate, its carbon and nitrogen nuclei ultimately incorporated into the aromatic moieties of pyrimidine nucleotides. In the second pathway CP is condensed with ornithine at the start of the urea cycle and is utilized for the detoxification of ammonia and biosynthesis of arginine. CPSases may be encoded by one or by several genes, depending on the species. The E.coli enzyme is Probab=100.00 E-value=0 Score=422.11 Aligned_cols=178 Identities=54% Similarity=0.958 Sum_probs=170.7 Q ss_pred EEEEECCCCCHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEECHH Q ss_conf 99996796410232222202454133046574464413578899858888841011211012221025786347861005 Q gi|254780420|r 208 VVCIDYGIRSNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGICLGH 287 (396) Q Consensus 208 IvvIDfGiK~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIClGh 287 (396) |+++|||.+|||+|+|+++|++++|+|+++++++|.+++||||+||+|||+|...+. ..+.++++++.++||||||||| T Consensus 1 i~ilDfGsqy~i~r~lrelg~~~~v~~~~~~~~~i~~~~p~giIlS~GPg~p~~~~~-~~~~~~~~~~~~iPILGIClG~ 79 (178) T cd01744 1 VVVIDFGVKHNILRELLKRGCEVTVVPYNTDAEEILKLDPDGIFLSNGPGDPALLDE-AIKTVRKLLGKKIPIFGICLGH 79 (178) T ss_pred CEEEECCCHHHHHHHHHHCCCEEEEEECCCCHHHHHHCCCCEEEECCCCCCHHHHCC-HHHHHHHHHHCCCCEEEECHHH T ss_conf 999988749999999998799799993989999998419997998999999577344-4999999974699889981217 Q ss_pred HHHHHHCCCCCCCCCCCCCCCCCCCEECCCCEEEEECCCCCEEEEHHHCCCCCEEEEEECCCCEEEEEEECCCCEEEECC Q ss_conf 88787528544667767521011234778970999614576157034369982899998899808726867997789834 Q gi|254780420|r 288 QLLGLALGAQTVKMYQGHHGVNHPVKNISNGTVEIVSMNHGFAIDASSLPAGLEETHISLFDSSNCGFRLIDQPVFSVQY 367 (396) Q Consensus 288 Q~l~~a~G~~~~kl~~GhrG~N~Pv~~~~t~kv~iTsqNHg~~v~~~slp~~~~v~~~~~nD~tiegi~~~~~~~~~vQf 367 (396) |+||.++||+++|+++||||.||||++..++++|+|+|||+|+|+++++|.+|++++++++|+++||++|+++|+||||| T Consensus 80 Q~ia~~~Gg~v~k~~~g~~G~n~pv~~~~~~~~~~~~~~H~~~v~~~~lp~~~~v~a~s~~d~~i~ai~h~~~~i~GVQF 159 (178) T cd01744 80 QLLALALGAKTYKMKFGHRGSNHPVKDLITGRVYITSQNHGYAVDPDSLPGGLEVTHVNLNDGTVEGIRHKDLPVFSVQF 159 (178) T ss_pred HHHHHHCCCCCCCCCCCCCCCCCCCEECCCCCEEEEECCCCEEEEECCCCCCCEEEEECCCCCCEEEEEECCCCEEEEEE T ss_conf 88998819830015667247871216768883888644755487624589985799964899857999989999999985 Q ss_pred CCCCCCCCCCHHHHHHHHH Q ss_conf 8788776700168999999 Q gi|254780420|r 368 HPESSPGPQDSYYLFSTFI 386 (396) Q Consensus 368 HPEa~PGP~D~~~~F~~F~ 386 (396) ||||+|||+|+++||++|+ T Consensus 160 HPEs~~~~~~~~~if~nFl 178 (178) T cd01744 160 HPEASPGPHDTEYLFDEFL 178 (178) T ss_pred CCCCCCCCCCCHHHHHHHC T ss_conf 9979999977489897239 No 9 >PRK05670 anthranilate synthase component II; Provisional Probab=100.00 E-value=1.3e-42 Score=320.77 Aligned_cols=182 Identities=31% Similarity=0.558 Sum_probs=155.8 Q ss_pred EEEEECC--CCCHHHHHHHHHCCCCEEEECCC-CHHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEE Q ss_conf 9999679--64102322222024541330465-74464413578899858888841011211012221025786347861 Q gi|254780420|r 208 VVCIDYG--IRSNLLRVLSSLGCRITIVGAET-SYKDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGIC 284 (396) Q Consensus 208 IvvIDfG--iK~nIlr~L~~~g~~v~Vvp~~~-~~~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIC 284 (396) |++||++ --+||+|+|+++||+++|+|++. +++++.+.+|||||||||||+|.+... ..++++. +..++|||||| T Consensus 2 iLiiD~~dsfT~nI~~~lr~~g~~v~V~~~d~~~~~~i~~~~pdgiiLS~GPg~P~~~~~-~~~~i~~-~~~~iPiLGIC 79 (192) T PRK05670 2 ILLIDNYDSFTYNLVQYLGELGAEVVVYRNDEITLEEIEALAPDAIVLSPGPGTPAEAGI-SLELIRE-FAGKVPILGVC 79 (192) T ss_pred EEEEECCCCHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHCCCCEEEECCCCCCHHHCCC-HHHHHHH-HCCCCCEEEEE T ss_conf 999968986899999999868996999989989999998509898999999999366055-4999997-34699789984 Q ss_pred CHHHHHHHHCCCCCCCCCCCCCCCCCCCEECCCC------EEEEECCCCCEEEEHHHCCCCCEEEEEECCCCEEEEEEEC Q ss_conf 0058878752854466776752101123477897------0999614576157034369982899998899808726867 Q gi|254780420|r 285 LGHQLLGLALGAQTVKMYQGHHGVNHPVKNISNG------TVEIVSMNHGFAIDASSLPAGLEETHISLFDSSNCGFRLI 358 (396) Q Consensus 285 lGhQ~l~~a~G~~~~kl~~GhrG~N~Pv~~~~t~------kv~iTsqNHg~~v~~~slp~~~~v~~~~~nD~tiegi~~~ 358 (396) ||||+||+++||+++|++.++||.++++....++ .-+.+.++|+++|+..++|++|++++.+ +|+++||++|+ T Consensus 80 lG~Qlia~~~Gg~v~~~~~~~hG~~~~i~~~~~~lf~~~~~~~~v~~~Hs~~v~~~~lp~~~~v~a~s-~d~~i~a~~~~ 158 (192) T PRK05670 80 LGHQAIGEAFGGKVVRAKEIMHGKTSPIEHDGKGIFAGLPNPFTVTRYHSLVVERESLPDCLEVTAWT-DDGEIMGVRHK 158 (192) T ss_pred HHHHHHHHHCCCEEEECCCCCCCEEEEEEECCCCCCCCCCCCCEEEECCEEEEEECCCCCCEEEEEEE-CCCCEEEEEEC T ss_conf 77899998719738888863637048999899987158888746861120799933479967999997-99719999989 Q ss_pred CCCEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHC Q ss_conf 99778983487887767001689999999999735 Q gi|254780420|r 359 DQPVFSVQYHPESSPGPQDSYYLFSTFIDFMHKRK 393 (396) Q Consensus 359 ~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~i~~~k 393 (396) ++|+||||||||+. ++.++..+|++|++++...| T Consensus 159 ~~~~~gvQFHPEs~-~t~~G~~il~nFl~~~~g~~ 192 (192) T PRK05670 159 ELPIYGVQFHPESI-LTEHGHKLLENFLELALGAK 192 (192) T ss_pred CCCEEEEEECCCCC-CCCCHHHHHHHHHHHHCCCC T ss_conf 99989996799889-98176999999999751389 No 10 >PRK00758 GMP synthase subunit A; Validated Probab=100.00 E-value=4.6e-41 Score=309.78 Aligned_cols=174 Identities=27% Similarity=0.440 Sum_probs=141.6 Q ss_pred EEEEEECCCC--CHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEE Q ss_conf 4999967964--10232222202454133046574464413578899858888841011211012221025786347861 Q gi|254780420|r 207 HVVCIDYGIR--SNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGIC 284 (396) Q Consensus 207 kIvvIDfGiK--~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIC 284 (396) ||++||+|-+ |||+|+|+++||+++|+|++++++++++ +|||||||||||.|.. . ...+. +.+.++|||||| T Consensus 1 kIlviD~~dsft~ni~~~l~~~g~~v~v~~~~~~~~~i~~-~~dgiilS~GP~~~~~-~-~~~~~---i~~~~iPiLGIC 74 (184) T PRK00758 1 KILVINNGGQYNHLIHRTLRYLGVDAKIIPNTTPLEEIKA-EPDGIILSGGPEIERA-G-NCEEY---LLELDVPILGIC 74 (184) T ss_pred CEEEEECCCHHHHHHHHHHHHCCCEEEEEECCCCHHHHHH-CCCEEEECCCCCCCCC-C-CHHHH---HHHCCCCEEEEE T ss_conf 9999989863899999999978992899959898999963-8998998899983332-2-06999---986599889971 Q ss_pred CHHHHHHHHCCCCCCCCCCCCCCCCCCCEECCCC------EEEEECCCCCEEEEHHHCCCCCEEEEEECCCCEEEEEEEC Q ss_conf 0058878752854466776752101123477897------0999614576157034369982899998899808726867 Q gi|254780420|r 285 LGHQLLGLALGAQTVKMYQGHHGVNHPVKNISNG------TVEIVSMNHGFAIDASSLPAGLEETHISLFDSSNCGFRLI 358 (396) Q Consensus 285 lGhQ~l~~a~G~~~~kl~~GhrG~N~Pv~~~~t~------kv~iTsqNHg~~v~~~slp~~~~v~~~~~nD~tiegi~~~ 358 (396) ||||+||.|+||+++|++++++|.+.....-.+. .-+...++|++.|. ++|.++++++.+ +|+.+||++|+ T Consensus 75 lG~Q~l~~~~Gg~v~~~~~~~~G~~~i~~~~~~~lf~~~~~~~~v~~~H~~~v~--~~p~~~~v~a~s-~~~~v~~i~~~ 151 (184) T PRK00758 75 LGHQLIAKAFGGEVGRAEKGEYALVEVEILDEDDILRGLPPEITVWASHADEVK--ELPDGFEILARS-DICEVEAMKHK 151 (184) T ss_pred HHHHHHHHHCCCCCCCCCCEEECCEEEEEECCCCCCCCCCCCCEEEEEEEEEEE--ECCCCEEEEEEC-CCCCEEEEEEC T ss_conf 776665876699330276215433689997388751115886378764005899--658715999984-99988999989 Q ss_pred CCCEEEECCCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 99778983487887767001689999999999 Q gi|254780420|r 359 DQPVFSVQYHPESSPGPQDSYYLFSTFIDFMH 390 (396) Q Consensus 359 ~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~i~ 390 (396) ++|+|||||||||+++|+ +..+|++|+++.. T Consensus 152 ~~~i~gvQFHPEs~~t~~-G~~ll~nFl~icg 182 (184) T PRK00758 152 DKPIYGVQFHPEVSHTEK-GEEIFKNFIKICG 182 (184) T ss_pred CCCEEEEEECCCCCCCCC-HHHHHHHHHHHHC T ss_conf 999999991884799927-8999999999855 No 11 >PRK07649 para-aminobenzoate/anthranilate synthase glutamine amidotransferase component II; Validated Probab=100.00 E-value=7.5e-41 Score=308.25 Aligned_cols=185 Identities=25% Similarity=0.425 Sum_probs=160.1 Q ss_pred EEEEECC--CCCHHHHHHHHHCCCCEEEECC-CCHHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEE Q ss_conf 9999679--6410232222202454133046-574464413578899858888841011211012221025786347861 Q gi|254780420|r 208 VVCIDYG--IRSNLLRVLSSLGCRITIVGAE-TSYKDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGIC 284 (396) Q Consensus 208 IvvIDfG--iK~nIlr~L~~~g~~v~Vvp~~-~~~~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIC 284 (396) |++||++ --+||+|+|++.|++++|+|++ .+++++.+++||||+||||||+|.+... ..+.++. +.+++|||||| T Consensus 2 ILlIDn~DsFT~ni~~~l~~~g~~~~v~~~d~~~~~~i~~~~pd~IiLSpGPg~p~~~~~-~~~~~~~-~~~~iPILGIC 79 (195) T PRK07649 2 ILMIDNYDSFTYNLVQYLGELGQELVVKRNDEVTISDIENMKPDFLMISPGPCSPNEAGI-SMEVIRY-FAGKIPIFGVC 79 (195) T ss_pred EEEEECCCCHHHHHHHHHHHCCCEEEEECCCCCCHHHHHHCCCCEEEECCCCCCHHHCCC-CHHHHHH-HCCCCCEEEEC T ss_conf 999958980799999999978997999889968999998419898998899999578476-1467997-52899784302 Q ss_pred CHHHHHHHHCCCCCCCCCCCCCCCCCCCEECCCC------EEEEECCCCCEEEEHHHCCCCCEEEEEECCCCEEEEEEEC Q ss_conf 0058878752854466776752101123477897------0999614576157034369982899998899808726867 Q gi|254780420|r 285 LGHQLLGLALGAQTVKMYQGHHGVNHPVKNISNG------TVEIVSMNHGFAIDASSLPAGLEETHISLFDSSNCGFRLI 358 (396) Q Consensus 285 lGhQ~l~~a~G~~~~kl~~GhrG~N~Pv~~~~t~------kv~iTsqNHg~~v~~~slp~~~~v~~~~~nD~tiegi~~~ 358 (396) ||||+||+++||+++|++.++||.+.++.+-..+ .-+.+.++|++.|+.+++|++|++++.+ +|+.|||++|+ T Consensus 80 lG~Q~ia~~~Gg~v~~~~~~~hG~~~~i~~~~~~lf~g~~~~~~v~~yHs~~v~~~~lp~~~~v~a~s-~~~~i~ai~h~ 158 (195) T PRK07649 80 LGHQSIAQVFGGEVVRAERLMHGKTSLMHHDGKTIFSDIPNPFTATRYHSLIVKKETLPDCFEVTAWT-EEGEIMAIRHK 158 (195) T ss_pred HHHHHHHHHCCCCEEECCCEEECEEEEEEECCCCCCCCCCCCEEEEECCCEEEEECCCCCCEEEEEEC-CCCCEEEEEEC T ss_conf 99999998739728337960764065799889763438888758986161589623578742899994-89978999938 Q ss_pred CCCEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCC Q ss_conf 99778983487887767001689999999999735788 Q gi|254780420|r 359 DQPVFSVQYHPESSPGPQDSYYLFSTFIDFMHKRKTMC 396 (396) Q Consensus 359 ~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~i~~~k~~~ 396 (396) ++|+|||||||||...++ +..||++|++-+.+..+.| T Consensus 159 ~~pi~GVQFHPEs~~T~~-G~~il~NFl~k~~~~~~~~ 195 (195) T PRK07649 159 TLPIEGVQFHPESIMTSH-GKELLQNFIRKYSPSVTSC 195 (195) T ss_pred CCCEEEEEECCCCCCCCC-HHHHHHHHHHHHCCCCCCC T ss_conf 998899986898889977-6899999998613575679 No 12 >pfam00117 GATase Glutamine amidotransferase class-I. Probab=100.00 E-value=2e-40 Score=305.23 Aligned_cols=177 Identities=40% Similarity=0.660 Sum_probs=154.8 Q ss_pred EEEECC--CCCHHHHHHHHHCCCCEEEECCCCHHHHHH-CCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEEC Q ss_conf 999679--641023222220245413304657446441-35788998588888410112110122210257863478610 Q gi|254780420|r 209 VCIDYG--IRSNLLRVLSSLGCRITIVGAETSYKDIMS-LQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGICL 285 (396) Q Consensus 209 vvIDfG--iK~nIlr~L~~~g~~v~Vvp~~~~~~~i~~-~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGICl 285 (396) ++|||| ..+||+|.|+++|++++|+|++++++++.+ .+|||||||||||+|..... ..++++++++.++||||||| T Consensus 1 liiD~~dsft~~i~~~l~~~g~~~~v~~~~~~~~~~~~~~~~d~iilspGpg~p~~~~~-~~~~i~~~~~~~~PiLGICl 79 (187) T pfam00117 1 LLIDNGDSFTYNLARALRELGVEVEVVPNDTPAEEILELLNPDGIIISPGPGSPGDAGG-AIEAIKELRENKIPILGICL 79 (187) T ss_pred CEEECCCCHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHCCCCEEEECCCCCCHHHCCC-HHHHHHHHHHCCCCEEEEEH T ss_conf 98937863899999999868997999989998699985259998999199861101345-89999999977998999988 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCCCCCEECC-----CCEEEEECCCCCEEEEHHHCCCCCEEEEEECCCCEEEEEEECCC Q ss_conf 0588787528544667767521011234778-----97099961457615703436998289999889980872686799 Q gi|254780420|r 286 GHQLLGLALGAQTVKMYQGHHGVNHPVKNIS-----NGTVEIVSMNHGFAIDASSLPAGLEETHISLFDSSNCGFRLIDQ 360 (396) Q Consensus 286 GhQ~l~~a~G~~~~kl~~GhrG~N~Pv~~~~-----t~kv~iTsqNHg~~v~~~slp~~~~v~~~~~nD~tiegi~~~~~ 360 (396) |||+|+.++|+++.|++.+|+|.|+++.+-. ..+.....++|+|.|+++++|+++++++.+.+|+++|+++|+++ T Consensus 80 G~Qll~~~~Gg~v~~~~~~~~G~~~~i~~~~~~~~~~~~~~~~~~~Hs~~v~~~~lp~~~~i~a~s~~~~~i~a~~~~~~ 159 (187) T pfam00117 80 GHQLLALAFGGKVIKAKEPHHGKNSPVGHDKGLFYGLPNVFIVRRYHSYAVDPDTLPDGLEVTAASDNGGEIMAIRHKEN 159 (187) T ss_pred HHHHHHHHCCCCCCCCCEEECCEEEEECCCHHHHCCCCCCCEEEEEEEEEEEECCCCCCEEEEEECCCCCCEEEEEECCC T ss_conf 89999997199864431010570489732116442799773777603489986257886299998688871899998899 Q ss_pred CEEEECCCCCCCCCCCCHHHHHHHHH Q ss_conf 77898348788776700168999999 Q gi|254780420|r 361 PVFSVQYHPESSPGPQDSYYLFSTFI 386 (396) Q Consensus 361 ~~~~vQfHPEa~PGP~D~~~~F~~F~ 386 (396) |+||||||||++.+|++...|++-++ T Consensus 160 ~~~gvQfHPE~~~t~~G~~ll~nfl~ 185 (187) T pfam00117 160 PIFGVQFHPESSLTPGGPELLFNFLI 185 (187) T ss_pred CEEEEEECCCCCCCCCCHHHHHHHHC T ss_conf 89999959977999770489987774 No 13 >CHL00101 trpG anthranilate synthase component 2 Probab=100.00 E-value=6.2e-40 Score=301.80 Aligned_cols=178 Identities=26% Similarity=0.460 Sum_probs=151.1 Q ss_pred EEEEECC--CCCHHHHHHHHHCCCCEEEECC-CCHHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEE Q ss_conf 9999679--6410232222202454133046-574464413578899858888841011211012221025786347861 Q gi|254780420|r 208 VVCIDYG--IRSNLLRVLSSLGCRITIVGAE-TSYKDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGIC 284 (396) Q Consensus 208 IvvIDfG--iK~nIlr~L~~~g~~v~Vvp~~-~~~~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIC 284 (396) |++|||| --+||+|+|+++|++++|+|++ .+++++.+++|||||||||||+|.+... ....+ +.+..++|||||| T Consensus 2 ILiiD~~dsft~~i~r~lrelg~~~~v~~~d~~~~~~i~~~~p~gIILS~GPg~p~~~~~-~~~~~-~~~~~~iPILGIC 79 (190) T CHL00101 2 ILIIDNYDSFTYNLVQSLGELNSDLLVCRNDEIDLSKIKNLNPRHIIISPGPGHPRDSGI-SLDVI-SSYAPTIPILGVC 79 (190) T ss_pred EEEEECCCHHHHHHHHHHHHCCCEEEEECCCCCCHHHHHHCCCCEEEECCCCCCHHHCCC-CHHHH-HHHHCCCCEEEEC T ss_conf 999978880899999999868998999869989999997079798999799999578576-16899-9971499878973 Q ss_pred CHHHHHHHHCCCCCCCCCCCCCCCCCCCEECCCC------EEEEECCCCCEEEEHHHCCCCCEEEEEECCCCEEEEEEEC Q ss_conf 0058878752854466776752101123477897------0999614576157034369982899998899808726867 Q gi|254780420|r 285 LGHQLLGLALGAQTVKMYQGHHGVNHPVKNISNG------TVEIVSMNHGFAIDASSLPAGLEETHISLFDSSNCGFRLI 358 (396) Q Consensus 285 lGhQ~l~~a~G~~~~kl~~GhrG~N~Pv~~~~t~------kv~iTsqNHg~~v~~~slp~~~~v~~~~~nD~tiegi~~~ 358 (396) ||||+||.++||++.|++.++||....+....++ ..+.++++|++.|+..++|++|++++.+ +|+++||++|+ T Consensus 80 lG~Q~ia~~~Gg~v~~~~~~~~G~~~~i~~~~~~lf~g~~~~~~~~~~hs~~v~~~~lp~~~~v~a~s-~~~~i~ai~h~ 158 (190) T CHL00101 80 LGHQSIGYVYGGKIIKASKPMHGKTSLIYHNHDDLFQGLPNPFIATRYHSLIIDRENLPSCLEITAWT-EDGLIMACRHK 158 (190) T ss_pred HHHHHHHHHCCCEEEECCCEEECCEEEEEECCCCCCCCCCCCCEEEECCEEEEEECCCCCCEEEEEEC-CCCCEEEEEEC T ss_conf 98899988758649978820436247999789875437998848987077999804578987999997-99968999978 Q ss_pred CCC-EEEECCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 997-7898348788776700168999999999 Q gi|254780420|r 359 DQP-VFSVQYHPESSPGPQDSYYLFSTFIDFM 389 (396) Q Consensus 359 ~~~-~~~vQfHPEa~PGP~D~~~~F~~F~~~i 389 (396) ++| +||||||||+.-- .++..++++|+++- T Consensus 159 ~~~~i~GVQFHPEs~~T-~~G~~il~NFl~la 189 (190) T CHL00101 159 KYPMLRGIQFHPESILT-EHGQQLLRNFLSLS 189 (190) T ss_pred CCCCEEEEEECCCCCCC-CCHHHHHHHHHHHC T ss_conf 99987999974866999-48699999999860 No 14 >PRK08007 para-aminobenzoate synthase component II; Provisional Probab=100.00 E-value=1e-39 Score=300.30 Aligned_cols=176 Identities=27% Similarity=0.466 Sum_probs=151.9 Q ss_pred EEEEECC--CCCHHHHHHHHHCCCCEEEECC-CCHHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEE Q ss_conf 9999679--6410232222202454133046-574464413578899858888841011211012221025786347861 Q gi|254780420|r 208 VVCIDYG--IRSNLLRVLSSLGCRITIVGAE-TSYKDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGIC 284 (396) Q Consensus 208 IvvIDfG--iK~nIlr~L~~~g~~v~Vvp~~-~~~~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIC 284 (396) |++||++ --+||+|+|++.|++++|+|++ .+.+++.+++||+||||||||+|.+... ....++ .+..++|||||| T Consensus 2 ILiIDn~DsFT~ni~~~l~~~g~~v~V~~~d~~~~~~i~~~~p~~IvlSpGPg~P~~~~~-~~~~i~-~~~~~iPiLGIC 79 (187) T PRK08007 2 ILLIDNYDSFTWNLYQYFCELGADVLVKRNDALTLADIDALKPQKIVISPGPCTPDEAGI-SLDVIR-HYAGRLPILGVC 79 (187) T ss_pred EEEEECCCCHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHCCCCEEEECCCCCCHHHCCC-CHHHHH-HHCCCCCEEEEC T ss_conf 999968981899999999877997999938989999998429897999999999589084-046899-753899889987 Q ss_pred CHHHHHHHHCCCCCCCCCCCCCCCCCCCEECCCC------EEEEECCCCCEEEEHHHCCCCCEEEEEECCCCEEEEEEEC Q ss_conf 0058878752854466776752101123477897------0999614576157034369982899998899808726867 Q gi|254780420|r 285 LGHQLLGLALGAQTVKMYQGHHGVNHPVKNISNG------TVEIVSMNHGFAIDASSLPAGLEETHISLFDSSNCGFRLI 358 (396) Q Consensus 285 lGhQ~l~~a~G~~~~kl~~GhrG~N~Pv~~~~t~------kv~iTsqNHg~~v~~~slp~~~~v~~~~~nD~tiegi~~~ 358 (396) ||||+||.++||++++++.++||.++++.+..++ .-+..+++|++.|+..++|+++++++.+ +|+.+||++|+ T Consensus 80 lG~Q~ia~~~Gg~v~~~~~~~hG~~~~i~~~~~~lf~~~~~~~~v~~~hs~~v~~~~lp~~~~v~a~~-~~~~v~ai~~~ 158 (187) T PRK08007 80 LGHQAMAQAFGGKVVRAAKVMHGKTSPITHNGEGVFRGLANPLTVTRYHSLVVEPDSLPECFEVTAWS-ETREIMGIRHR 158 (187) T ss_pred HHHHHHHHHCCCEEEECCCCCCCCCEEEEECCCCCCCCCCCCEEEEEEEEEEEECCCCCCCEEEEEEC-CCCCEEEEEEC T ss_conf 99999999809807347832015206999899862678998737897557888647689815999992-99868999989 Q ss_pred CCCEEEECCCCCCCCCCCCHHHHHHHHHH Q ss_conf 99778983487887767001689999999 Q gi|254780420|r 359 DQPVFSVQYHPESSPGPQDSYYLFSTFID 387 (396) Q Consensus 359 ~~~~~~vQfHPEa~PGP~D~~~~F~~F~~ 387 (396) ++|+|||||||||.--+ ++..++++|+. T Consensus 159 ~~~~~GVQFHPEs~~T~-~G~~il~NFl~ 186 (187) T PRK08007 159 QWDLEGVQFHPESILSE-QGHQLLANFLH 186 (187) T ss_pred CCCEEEEEECCCCCCCC-CHHHHHHHHHC T ss_conf 99989999589779996-77899999857 No 15 >cd01742 GATase1_GMP_Synthase Type 1 glutamine amidotransferase (GATase1) domain found in GMP synthetase. Type 1 glutamine amidotransferase (GATase1) domain found in GMP synthetase. GMP synthetase is a glutamine amidotransferase from the de novo purine biosynthetic pathway. Glutamine amidotransferase (GATase) activity catalyse the transfer of ammonia from the amide side chain of glutamine to an acceptor substrate. GMP synthetase catalyses the amination of the nucleotide precursor xanthosine 5'-monophospahte to form GMP. GMP synthetase belongs to the triad family of amidotransferases having a conserved Cys-His-Glu catalytic triad in the glutaminase active site. Probab=100.00 E-value=1.7e-39 Score=298.72 Aligned_cols=172 Identities=23% Similarity=0.435 Sum_probs=145.0 Q ss_pred EEEEECCCC--CHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEEC Q ss_conf 999967964--102322222024541330465744644135788998588888410112110122210257863478610 Q gi|254780420|r 208 VVCIDYGIR--SNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGICL 285 (396) Q Consensus 208 IvvIDfGiK--~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGICl 285 (396) |++||||.+ +||+|.|+++|++++|+|++++++++..++|||||||||||+|.+.. ...+.+++++.++||||||| T Consensus 1 I~iiD~g~~~~~si~~~l~~~G~~~~vv~~~~~~~~~~~~~~dgvils~GP~~~~~~~--~~~~~~~i~~~~~PilGICl 78 (181) T cd01742 1 ILILDFGSQYTHLIARRVRELGVYSEILPNTTPLEEIKLKNPKGIILSGGPSSVYEED--APRVDPEIFELGVPVLGICY 78 (181) T ss_pred CEEEECCCHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCC--CCHHHHHHHHCCCCEEEEHH T ss_conf 7999998638999999999779949999699998999734989899899999855579--83330999847999999859 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCCCCCEECCCCEE-------EEECCCCCEEEEHHHCCCCCEEEEEECCCCEEEEEEEC Q ss_conf 05887875285446677675210112347789709-------99614576157034369982899998899808726867 Q gi|254780420|r 286 GHQLLGLALGAQTVKMYQGHHGVNHPVKNISNGTV-------EIVSMNHGFAIDASSLPAGLEETHISLFDSSNCGFRLI 358 (396) Q Consensus 286 GhQ~l~~a~G~~~~kl~~GhrG~N~Pv~~~~t~kv-------~iTsqNHg~~v~~~slp~~~~v~~~~~nD~tiegi~~~ 358 (396) |||+||+|+||+++|++.+++|.+..... ..+.+ ...-++|++.|+ ++|+++++++.+. |+.+||++|+ T Consensus 79 G~Qll~~a~Gg~v~~~~~~~~G~~~~~~~-~~~~l~~~~~~~~~~~~~H~~~v~--~~p~~~~~~a~s~-~~~v~~~~~~ 154 (181) T cd01742 79 GMQLIAKALGGKVERGDKREYGKAEIEID-DSSPLFEGLPDEQTVWMSHGDEVV--KLPEGFKVIASSD-NCPVAAIANE 154 (181) T ss_pred HHHHHHHHCCCCEEECCCCEECCEEEEEE-CCCCCCCCCCCCEEEEEECCEEEE--ECCCCEEEEEECC-CCEEEEEEEC T ss_conf 99999987597602189855364478981-262100588985678856362998--4587307999959-9409999989 Q ss_pred CCCEEEECCCCCCCCCCCCHHHHHHHHH Q ss_conf 9977898348788776700168999999 Q gi|254780420|r 359 DQPVFSVQYHPESSPGPQDSYYLFSTFI 386 (396) Q Consensus 359 ~~~~~~vQfHPEa~PGP~D~~~~F~~F~ 386 (396) ++|+||||||||++++|+ +..|+++|+ T Consensus 155 ~~~i~gvQfHPEs~~s~~-G~~il~NFl 181 (181) T cd01742 155 EKKIYGVQFHPEVTHTEK-GKEILKNFL 181 (181) T ss_pred CCCEEEEEECCCCCCCCC-HHHHHHHHC T ss_conf 999899995788899847-379998429 No 16 >PRK06774 para-aminobenzoate synthase component II; Provisional Probab=100.00 E-value=2.8e-39 Score=297.22 Aligned_cols=177 Identities=31% Similarity=0.480 Sum_probs=152.1 Q ss_pred EEEEECC--CCCHHHHHHHHHCCCCEEEECC-CCHHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEE Q ss_conf 9999679--6410232222202454133046-574464413578899858888841011211012221025786347861 Q gi|254780420|r 208 VVCIDYG--IRSNLLRVLSSLGCRITIVGAE-TSYKDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGIC 284 (396) Q Consensus 208 IvvIDfG--iK~nIlr~L~~~g~~v~Vvp~~-~~~~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIC 284 (396) |++||++ --|||+|+|+++|++|+|+|+| .+++++.+++||+||||||||+|++... ....++. +..++|||||| T Consensus 2 ILiiDn~DsFT~nl~~~l~~~g~~v~V~~~d~~~~~~i~~~~p~~ivLSpGPG~P~~~~~-~~~~i~~-~~~~iPILGIC 79 (191) T PRK06774 2 LLLIDNYDSFTYNLYQYFCELGTEVMVKRNDELQLTDIEQLAPSHVVISPGPCTPNEAGI-SLAVIRH-FADKLPILGVC 79 (191) T ss_pred EEEEECCCCHHHHHHHHHHHCCCEEEEECCCCCCHHHHHHCCCCEEEECCCCCCHHHHCC-HHHHHHH-HCCCCCEEEEH T ss_conf 999969997699999999877995999869988999998539996999999989679073-0689997-44699788616 Q ss_pred CHHHHHHHHCCCCCCCCCCCCCCCCCCCEECCCCE------EEEECCCCCEEEEHHHCCCCCEEEEEECCCC---EEEEE Q ss_conf 00588787528544667767521011234778970------9996145761570343699828999988998---08726 Q gi|254780420|r 285 LGHQLLGLALGAQTVKMYQGHHGVNHPVKNISNGT------VEIVSMNHGFAIDASSLPAGLEETHISLFDS---SNCGF 355 (396) Q Consensus 285 lGhQ~l~~a~G~~~~kl~~GhrG~N~Pv~~~~t~k------v~iTsqNHg~~v~~~slp~~~~v~~~~~nD~---tiegi 355 (396) ||||+||.++||++.+++.++||.+.++.+..++. -+...+.|++.|+.+++|++|++++++.+|+ .|||+ T Consensus 80 lG~Q~ia~~~Gg~v~~~~~~~hG~~~~i~~~~~~lf~g~~~~~~v~~~Hs~~v~~~~lp~~~~v~a~s~~~~~~~~i~ai 159 (191) T PRK06774 80 LGHQALGQAFGARVVRARQVMHGKTSAICHSGQGVFRGLNQPLTVTRYHSLVIATDSLPGCFELTAWTERGGEMDEIMGI 159 (191) T ss_pred HHHHHHHHHCCCEEEECCCCEECEEEEEEECCCCCCCCCCCCCEEEECCCEEEEECCCCCCEEEEEEECCCCCCCEEEEE T ss_conf 87999865609849289974757036899899876068999868850465078834489987999997999998868888 Q ss_pred EECCCCEEEECCCCCCCCCCCCHHHHHHHHHH Q ss_conf 86799778983487887767001689999999 Q gi|254780420|r 356 RLIDQPVFSVQYHPESSPGPQDSYYLFSTFID 387 (396) Q Consensus 356 ~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~ 387 (396) +|+++|+|||||||||...++ +..+|++|++ T Consensus 160 ~~~~~~i~GVQFHPEs~~T~~-G~~il~NFl~ 190 (191) T PRK06774 160 RHRTLPLEGVQFHPESILSEQ-GHQLLDNFLK 190 (191) T ss_pred EECCCCEEEEEECCCCCCCCC-HHHHHHHHHC T ss_conf 838999899876898899958-8899999844 No 17 >PRK08857 para-aminobenzoate synthase component II; Provisional Probab=100.00 E-value=5.8e-39 Score=294.98 Aligned_cols=177 Identities=27% Similarity=0.498 Sum_probs=149.0 Q ss_pred EEEEECC--CCCHHHHHHHHHCCCCEEEECC-CCHHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEE Q ss_conf 9999679--6410232222202454133046-574464413578899858888841011211012221025786347861 Q gi|254780420|r 208 VVCIDYG--IRSNLLRVLSSLGCRITIVGAE-TSYKDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGIC 284 (396) Q Consensus 208 IvvIDfG--iK~nIlr~L~~~g~~v~Vvp~~-~~~~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIC 284 (396) |++||++ --+||+|+|+++|++++|+|++ .+++++.+++||||+||||||+|.+... ....+ +.+..++|||||| T Consensus 2 iLiiDn~DsfT~ni~~~l~~lg~~v~vv~~d~~~~~~i~~~~p~~IilS~GPg~p~~~~~-~~~~~-~~~~~~iPILGIC 79 (192) T PRK08857 2 LLMIDNYDSFTYNLYQYFCELGAEVKVVRNDEIDIAGIEALNPSHLVISPGPCTPNEAGI-SLQAI-EHFAGKLPILGVC 79 (192) T ss_pred EEEEECCCCHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHCCCCEEEECCCCCCHHHCCC-CHHHH-HHHCCCCCEEEEC T ss_conf 999979997799999999877992899969989999998429795999899999678286-14669-9735799989987 Q ss_pred CHHHHHHHHCCCCCCCCCCCCCCCCCCCEECCCCEE------EEECCCCCEEEEHHHCCCCCEEEEEECC-CC---EEEE Q ss_conf 005887875285446677675210112347789709------9961457615703436998289999889-98---0872 Q gi|254780420|r 285 LGHQLLGLALGAQTVKMYQGHHGVNHPVKNISNGTV------EIVSMNHGFAIDASSLPAGLEETHISLF-DS---SNCG 354 (396) Q Consensus 285 lGhQ~l~~a~G~~~~kl~~GhrG~N~Pv~~~~t~kv------~iTsqNHg~~v~~~slp~~~~v~~~~~n-D~---tieg 354 (396) ||||+||.++||++.|++.++||.++++.+..++-. ....++|++.|+.+++|+++++++...+ |+ .||| T Consensus 80 lG~Q~ia~~~Gg~v~~~~~~~hG~~~~i~~~~~~lf~~~~~~~~v~~~Hs~~v~~~~lp~~~~v~a~t~~~~~~~~~i~a 159 (192) T PRK08857 80 LGHQAIAQVFGGDVVRARQVMHGKTSPIRHTGRSVFKGLNNPLTVTRYHSLVVKNDTLPECFELTAWTELEDGSMDEIMG 159 (192) T ss_pred HHHHHHHHHHCCEEEECCCCEEEEEEEEEECCCEEEECCCCCEEEEECCCEEEECCCCCCCEEEEEEEECCCCCCCCEEE T ss_conf 99999999839829988843251157999899304736998879896255278957489982999997268899770538 Q ss_pred EEECCCCEEEECCCCCCCCCCCCHHHHHHHHHH Q ss_conf 686799778983487887767001689999999 Q gi|254780420|r 355 FRLIDQPVFSVQYHPESSPGPQDSYYLFSTFID 387 (396) Q Consensus 355 i~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~ 387 (396) ++|+++|+||||||||+.-- .++..++++|+. T Consensus 160 ~~h~~~~i~GVQFHPEs~~T-~~G~~il~NFl~ 191 (192) T PRK08857 160 FQHKTLPIEAVQFHPESIKT-EQGHQLLANFLA 191 (192) T ss_pred EEECCCCEEEEEECCCCCCC-CCHHHHHHHHHC T ss_conf 88389998999877977998-474899999726 No 18 >cd01743 GATase1_Anthranilate_Synthase Type 1 glutamine amidotransferase (GATase1) domain found in Anthranilate synthase. Type 1 glutamine amidotransferase (GATase1) domain found in Anthranilate synthase (ASase). This group contains proteins similar to para-aminobenzoate (PABA) synthase and ASase. These enzymes catalyze similar reactions and produce similar products, PABA and ortho-aminobenzoate (anthranilate). Each enzyme is composed of non-identical subunits: a glutamine amidotransferase subunit (component II) and a subunit that produces an aminobenzoate products (component I). ASase catalyses the synthesis of anthranilate from chorismate and glutamine and is a tetrameric protein comprising two copies each of components I and II. Component II of ASase belongs to the family of triad GTases which hydrolyze glutamine and transfer nascent ammonia between the active sites. In some bacteria, such as Escherichia coli, component II can be much larger than in other organisms, due to the prese Probab=100.00 E-value=5.5e-39 Score=295.12 Aligned_cols=175 Identities=29% Similarity=0.522 Sum_probs=146.6 Q ss_pred EEEEECC--CCCHHHHHHHHHCCCCEEEECC-CCHHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEE Q ss_conf 9999679--6410232222202454133046-574464413578899858888841011211012221025786347861 Q gi|254780420|r 208 VVCIDYG--IRSNLLRVLSSLGCRITIVGAE-TSYKDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGIC 284 (396) Q Consensus 208 IvvIDfG--iK~nIlr~L~~~g~~v~Vvp~~-~~~~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIC 284 (396) |++|||| -.+||+|+|+++|++++|+|++ .+.+++..++|||||||||||+|+.... ... +.+.++.++|||||| T Consensus 1 iLiiD~~dsft~~i~r~lr~~g~~~~v~p~~~~~~~~~~~~~~~giiLS~GPg~p~~~~~-~~~-~~~~l~~~iPiLGIC 78 (184) T cd01743 1 ILLIDNYDSFTYNLVQYLRELGAEVVVVRNDEITLEELELLNPDAIVISPGPGHPEDAGI-SLE-IIRALAGKVPILGVC 78 (184) T ss_pred CEEEECCCCHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHCCCCEEEECCCCCCHHHCCC-HHH-HHHHHHCCCCEEEEC T ss_conf 899937981799999999867995999989999999997329799999999989256146-099-999984699899980 Q ss_pred CHHHHHHHHCCCCCCCCCCCCCCCCCCCEECCCCEE------EEECCCCCEEEEHHHCCCCCEEEEEECCCCEEEEEEEC Q ss_conf 005887875285446677675210112347789709------99614576157034369982899998899808726867 Q gi|254780420|r 285 LGHQLLGLALGAQTVKMYQGHHGVNHPVKNISNGTV------EIVSMNHGFAIDASSLPAGLEETHISLFDSSNCGFRLI 358 (396) Q Consensus 285 lGhQ~l~~a~G~~~~kl~~GhrG~N~Pv~~~~t~kv------~iTsqNHg~~v~~~slp~~~~v~~~~~nD~tiegi~~~ 358 (396) ||||+|+.++||+++|++.++||.++++....++.. .-..+.|+|.++..++|..+.+++. .+|++||+++|+ T Consensus 79 ~G~Qll~~~~Gg~v~~~~~~~~G~~~~i~~~~~~~~~~~~~~~~v~~~Hs~~~~~~~~~~~~~~~a~-~~~~~i~ai~~~ 157 (184) T cd01743 79 LGHQAIAEAFGGKVVRAPEPMHGKTSEIHHDGSGLFKGLPQPFTVGRYHSLVVDPDPLPDLLEVTAS-TEDGVIMALRHR 157 (184) T ss_pred HHHHHHHHHCCCEEEECCCCEECEEEEEEECCCCCEECCCCCCEEEEEEEEEEECCCCCCCCEEEEE-CCCCCEEEEEEC T ss_conf 9999999983984874575140545899977976364278888898861468981357875079998-899849999989 Q ss_pred CCCEEEECCCCCCCCCCCCHHHHHHHHH Q ss_conf 9977898348788776700168999999 Q gi|254780420|r 359 DQPVFSVQYHPESSPGPQDSYYLFSTFI 386 (396) Q Consensus 359 ~~~~~~vQfHPEa~PGP~D~~~~F~~F~ 386 (396) ++|+||||||||+..++ ++..++++|+ T Consensus 158 ~~~i~GvQFHPEs~~t~-~G~~il~nFl 184 (184) T cd01743 158 DLPIYGVQFHPESILTE-YGLRLLENFL 184 (184) T ss_pred CCCEEEEEECCCCCCCC-CHHHHHHHHC T ss_conf 99989998368768982-7389998439 No 19 >PRK13566 anthranilate synthase; Provisional Probab=100.00 E-value=9.2e-37 Score=279.49 Aligned_cols=190 Identities=24% Similarity=0.353 Sum_probs=159.2 Q ss_pred CCCCCCEEEEEECC--CCCHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCC Q ss_conf 77777149999679--6410232222202454133046574464413578899858888841011211012221025786 Q gi|254780420|r 201 SSDAKYHVVCIDYG--IRSNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGM 278 (396) Q Consensus 201 ~~~~~~kIvvIDfG--iK~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~i 278 (396) ....+.||++||-= --+|+.++|++.|++|+|++++.+.+.+...+||+|+||||||.|++... ...+.+.+..++ T Consensus 519 ~~~~g~rVLlVDn~DSFvhtLa~YlrqlGAeV~vvR~d~~~~~l~~~~pD~vvlSPGPG~P~d~g~--~~~i~~~~~~~i 596 (724) T PRK13566 519 AVGSGKRVLLVDHEDSFVHTLANYFRQTGAEVTTVRYPVAEEILDRVNPDLVVLSPGPGRPEDFDC--KATIDAALARNL 596 (724) T ss_pred CCCCCCEEEEEECCCHHHHHHHHHHHHHCCEEEEEECCCCHHHHHHHCCCEEEECCCCCCCHHCCC--HHHHHHHHHCCC T ss_conf 678775799984653279999999998198799981898767776318898998799979000558--599999985799 Q ss_pred CEEEEECHHHHHHHHCCCCCCCCCCCCCCCCCCCEECCCCEEE------EE-CCCCCEEEEHHHCCCCCEEEEEECCCCE Q ss_conf 3478610058878752854466776752101123477897099------96-1457615703436998289999889980 Q gi|254780420|r 279 PMFGICLGHQLLGLALGAQTVKMYQGHHGVNHPVKNISNGTVE------IV-SMNHGFAIDASSLPAGLEETHISLFDSS 351 (396) Q Consensus 279 PilGIClGhQ~l~~a~G~~~~kl~~GhrG~N~Pv~~~~t~kv~------iT-sqNHg~~v~~~slp~~~~v~~~~~nD~t 351 (396) ||||+|||||.|+.||||++..++..-||--.+|++...+-++ +| .--|+..++++++|.+|+||+++ .||. T Consensus 597 PilGVCLGhQaI~eafGg~l~~~~~p~HGK~S~I~~~~~~~~F~glp~~f~v~RYHSL~v~~~~lP~~L~VTA~t-edGv 675 (724) T PRK13566 597 PIFGVCLGLQAIVEAFGGELGQLAYPMHGKPSRIRVRGPGLVFSGLPNEFTVGRYHSLFADPERLPDELLVTAET-EDGV 675 (724) T ss_pred CEEEECCCHHHHHHHHCCEEEECCCCCCCCEEEEEEECCCCCCCCCCCCEEEEEEEEEEECCCCCCCCEEEEEEC-CCCE T ss_conf 879977127699897493885458578898135798489741078999607884137986267799833899985-8982 Q ss_pred EEEEEECCCCEEEECCCCCC--CCCCCCHHHHHHHHHHHHHHHC Q ss_conf 87268679977898348788--7767001689999999999735 Q gi|254780420|r 352 NCGFRLIDQPVFSVQYHPES--SPGPQDSYYLFSTFIDFMHKRK 393 (396) Q Consensus 352 iegi~~~~~~~~~vQfHPEa--~PGP~D~~~~F~~F~~~i~~~k 393 (396) |||++|+.+|++|||||||| +-|..-+.-+..+++.+..++. T Consensus 676 iMai~H~~lpi~gVQFHPESIlT~~g~~G~~ii~N~~~~~~~~~ 719 (724) T PRK13566 676 IMAIEHKTLPVAAVQFHPESIMTLGGEVGLRIIENVVRLLARKA 719 (724) T ss_pred EEEEEECCCCCEEEEECCHHHCCCCCCCCHHHHHHHHHHHHHHH T ss_conf 89885178761256107678622887443999999999986654 No 20 >TIGR00888 guaA_Nterm GMP synthase, N-terminal domain; InterPro: IPR004739 The N-terminal region of GMP synthase 6.3.5.2 from EC is recognised by this signature. GMP synthase catalyzes the synthesis of GMP from XMP , .ATP + xanthosine 5'-phosphate + L-glutamine + H(2)O = AMP + diphosphate + GMP + L-glutamate ; GO: 0003922 GMP synthase (glutamine-hydrolyzing) activity, 0005524 ATP binding, 0006177 GMP biosynthetic process. Probab=100.00 E-value=2.7e-37 Score=283.19 Aligned_cols=177 Identities=27% Similarity=0.477 Sum_probs=144.1 Q ss_pred EEEEECCCCCH--HHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEEC Q ss_conf 99996796410--2322222024541330465744644135788998588888410112110122210257863478610 Q gi|254780420|r 208 VVCIDYGIRSN--LLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGICL 285 (396) Q Consensus 208 IvvIDfGiK~n--Ilr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGICl 285 (396) |+|||||-.|| |-|+|+++|+..+++||++++++|.+.+|.|||||+||. ...+..+.+...++++.++||||||| T Consensus 1 ILvlDFGsQY~hLIaRRlRE~gVysE~~p~t~~~eeI~~~~P~GiILSGGP~--sv~~~n~~~~~~~if~LgVPvLGICY 78 (195) T TIGR00888 1 ILVLDFGSQYNHLIARRLRELGVYSEIVPNTTPLEEIKEKNPKGIILSGGPA--SVYAENAPEADEKIFELGVPVLGICY 78 (195) T ss_pred CEEEECCCCCEEEEEEEEEECCCEEEEEECCCCHHHHHHCCCCEEEEECCCC--CCCCCCCHHHHHHHHHCCCCEEEEEH T ss_conf 9788337651000112201014135784077877998612986799727887--64678850224888627996887624 Q ss_pred HHHHHHHHCCCCCCCCC---CC------------CCCCCCCCEECCCCEEEEECCCCCEEEEHHHCCCCCEEEEEECCCC Q ss_conf 05887875285446677---67------------5210112347789709996145761570343699828999988998 Q gi|254780420|r 286 GHQLLGLALGAQTVKMY---QG------------HHGVNHPVKNISNGTVEIVSMNHGFAIDASSLPAGLEETHISLFDS 350 (396) Q Consensus 286 GhQ~l~~a~G~~~~kl~---~G------------hrG~N~Pv~~~~t~kv~iTsqNHg~~v~~~slp~~~~v~~~~~nD~ 350 (396) |||+||.+|||++.+-. || ++|.+.-..--..-+||++ |+..|.+ ||++|++.+.+.|.- T Consensus 79 G~Ql~A~~lGG~V~~~~~~EYG~a~l~i~~~~~LF~glp~~LTA~P~~~VWMS---H~D~V~~--lP~gF~vlA~s~~hc 153 (195) T TIGR00888 79 GMQLMAKQLGGEVGRAEKREYGKAELEILDEDDLFEGLPKALTAKPESTVWMS---HGDKVKE--LPEGFKVLATSDNHC 153 (195) T ss_pred HHHHHHHHHCCEEEECCCCCCCEEEEEEECCCHHHHCCCHHHCCCCCCCCCCC---CCCEEEE--CCCCCEEEEECCCCC T ss_conf 57899998298776568887313899984288123078035346888871123---4624200--869976887528988 Q ss_pred EEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 087268679977898348788776700168999999999973 Q gi|254780420|r 351 SNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDFMHKR 392 (396) Q Consensus 351 tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~i~~~ 392 (396) =+|+|+|.++|+||||||||-.= ...+..|++||+..+++. T Consensus 154 P~aAm~~~~K~~YGVQFHPEV~H-t~~G~~~L~NF~~~~C~~ 194 (195) T TIGR00888 154 PVAAMEHEDKPIYGVQFHPEVTH-TEYGKELLKNFVKDVCGC 194 (195) T ss_pred CCCEEECCCCCEEEEEECCCEEE-CHHHHHHHHHHHHHHCCC T ss_conf 10000048996666651783121-534578999888763158 No 21 >PRK07765 para-aminobenzoate synthase component II; Provisional Probab=100.00 E-value=2.9e-36 Score=275.93 Aligned_cols=180 Identities=27% Similarity=0.456 Sum_probs=146.8 Q ss_pred CEEEEEECC--CCCHHHHHHHHHCCCCEEEECCC-CH--HHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCE Q ss_conf 149999679--64102322222024541330465-74--46441357889985888884101121101222102578634 Q gi|254780420|r 206 YHVVCIDYG--IRSNLLRVLSSLGCRITIVGAET-SY--KDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPM 280 (396) Q Consensus 206 ~kIvvIDfG--iK~nIlr~L~~~g~~v~Vvp~~~-~~--~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPi 280 (396) .||++||++ --|||+|+|++.|++++|++++. .. .+.+..+||||+||||||+|+.... ..++++.+...++|| T Consensus 1 MrILiIDn~DSFT~ni~~~lr~lg~~v~V~~~d~~~~~~~~~~~~~~dgIILSpGPg~P~~~~~-~~~~i~~~~~~~iPI 79 (221) T PRK07765 1 MRILVVDNYDSFVFNLVQYLGQLGVEAEVWRNDDVRLADEAAVAAGFDGVLLSPGPGTPERAGA-SIDMVRACAAAGTPL 79 (221) T ss_pred CEEEEEECCCCHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHHCCCCEEEECCCCCCHHHCCC-CHHHHHHHHCCCCCE T ss_conf 9799993889339999999987799189998998998899876348998999699998011672-488998753259988 Q ss_pred EEEECHHHHHHHHCCCCCCCCCCCCCCCCCCCEECCCCE------EEEECCCCCEEEEHHHCCCCCEEEEEECCCCEEEE Q ss_conf 786100588787528544667767521011234778970------99961457615703436998289999889980872 Q gi|254780420|r 281 FGICLGHQLLGLALGAQTVKMYQGHHGVNHPVKNISNGT------VEIVSMNHGFAIDASSLPAGLEETHISLFDSSNCG 354 (396) Q Consensus 281 lGIClGhQ~l~~a~G~~~~kl~~GhrG~N~Pv~~~~t~k------v~iTsqNHg~~v~~~slp~~~~v~~~~~nD~tieg 354 (396) ||||||||+|+.++||++.+++...||.-..+....++- -+...+.|+..|+++++|++|++++.+. |+.||+ T Consensus 80 LGIClG~Q~ia~~~Gg~v~~~~~~~hG~~~~i~~~~~~lf~gl~~~~~~~ryHS~~v~~~~lP~~~~v~A~s~-~~~ima 158 (221) T PRK07765 80 LGVCLGHQAIGVAFGATVDRAPELLHGKTSSVHHTNVGVLQGLPDPFTATRYHSLTILPETLPAELEVTARTD-SGVIMA 158 (221) T ss_pred EEEEHHHHHHHHHHCCEEEECCCCCEEEEEEEEECCCCCCCCCCCCEEEEEEEEEEEEECCCCCCCEEEEECC-CCCEEE T ss_conf 9870999999998397898889553656879998998761589997478874218997615899628999849-986865 Q ss_pred EEECCCCEEEECCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 6867997789834878877670016899999999 Q gi|254780420|r 355 FRLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDF 388 (396) Q Consensus 355 i~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~ 388 (396) ++|+++|+|||||||||.-- ..+..|+.+|++. T Consensus 159 ~~h~~~~i~GVQFHPEs~~T-~~G~~ll~NFl~i 191 (221) T PRK07765 159 VRHRELPIHGVQFHPESILT-EGGHRMLANWLTY 191 (221) T ss_pred EEECCCCEEEEEECCCCCCC-CCHHHHHHHHHHH T ss_conf 87078978999978853578-1669999999998 No 22 >PRK06895 para-aminobenzoate synthase component II; Provisional Probab=100.00 E-value=3.1e-36 Score=275.78 Aligned_cols=177 Identities=22% Similarity=0.395 Sum_probs=145.6 Q ss_pred CEEEEEECC--CCCHHHHHHHHHCCCCEEEECC-CCHHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEE Q ss_conf 149999679--6410232222202454133046-5744644135788998588888410112110122210257863478 Q gi|254780420|r 206 YHVVCIDYG--IRSNLLRVLSSLGCRITIVGAE-TSYKDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFG 282 (396) Q Consensus 206 ~kIvvIDfG--iK~nIlr~L~~~g~~v~Vvp~~-~~~~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilG 282 (396) .||++||+| --+||.|+|+++|++++|++++ .+.+++. ++|||+||+|||+|..... ...+++++ ..++|||| T Consensus 3 ~kILiiD~~dsfT~ni~~~lr~lg~~~~v~~~~~~~~~~~~--~~~~IIlSpGPg~p~~~~~-~~~~i~~~-~~~~PILG 78 (191) T PRK06895 3 TNLLIINNHDSFTFNLVDLIRKLGVPMKVVNVEDLDLDEVE--NFSHILISPGPDVPRAYPQ-LFAMLERY-YQQKSILG 78 (191) T ss_pred CCEEEEECCCHHHHHHHHHHHHCCCEEEEEECCCCCHHHHH--CCCEEEEECCCCCCCCCCH-HHHHHHHH-CCCCCEEE T ss_conf 86999969884899999999877996999847848998985--5896999089998010522-49999986-06898789 Q ss_pred EECHHHHHHHHCCCCCCCCCCCCCCCCCCCEECCCCEE-------EEECCCCCEEEEHHHCCCCCEEEEEECCCCEEEEE Q ss_conf 61005887875285446677675210112347789709-------99614576157034369982899998899808726 Q gi|254780420|r 283 ICLGHQLLGLALGAQTVKMYQGHHGVNHPVKNISNGTV-------EIVSMNHGFAIDASSLPAGLEETHISLFDSSNCGF 355 (396) Q Consensus 283 IClGhQ~l~~a~G~~~~kl~~GhrG~N~Pv~~~~t~kv-------~iTsqNHg~~v~~~slp~~~~v~~~~~nD~tiegi 355 (396) ||||||+||.++||++.+++...||...++.....+.+ +-..+.|+++|++.++|.+|++++++ +|+.+||+ T Consensus 79 IClG~Q~i~~~~Gg~v~~~~~~~hG~~~~i~~~~~~~lf~g~~~~~~v~~yHs~~v~~~~lp~~~~v~a~~-~~~~i~a~ 157 (191) T PRK06895 79 VCLGHQTLCEFFGGTLYNLENVRHGQQRTLKVRSNSPLFDGLPEEFNIGLYHSWAVSQENFPTPLEITAVC-DEEVVMAF 157 (191) T ss_pred ECHHHHHHHHHHCCEEEECCCCEEEEEEEEEECCCCCCCCCCCCCEEEEEEEEEEEECCCCCCCEEEEEEC-CCCEEEEE T ss_conf 82999999998295799688745522789999189834658999748766677787605789746999975-99809988 Q ss_pred EECCCCEEEECCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 867997789834878877670016899999999 Q gi|254780420|r 356 RLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDF 388 (396) Q Consensus 356 ~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~ 388 (396) +|+++|+||||||||+.- ..++..++++|+++ T Consensus 158 ~h~~~~i~GvQFHPEs~~-T~~G~~il~NFl~i 189 (191) T PRK06895 158 QHKHLPIYGVQFHPESYI-SEFGEQILRNWLAI 189 (191) T ss_pred EECCCCEEEEEECCCCCC-CCCHHHHHHHHHCC T ss_conf 848998999982884688-87859999998656 No 23 >COG0512 PabA Anthranilate/para-aminobenzoate synthases component II [Amino acid transport and metabolism / Coenzyme metabolism] Probab=100.00 E-value=3.2e-35 Score=268.65 Aligned_cols=179 Identities=29% Similarity=0.509 Sum_probs=146.5 Q ss_pred CEEEEEECC--CCCHHHHHHHHHCCCCEEEECC-CCHHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEE Q ss_conf 149999679--6410232222202454133046-5744644135788998588888410112110122210257863478 Q gi|254780420|r 206 YHVVCIDYG--IRSNLLRVLSSLGCRITIVGAE-TSYKDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFG 282 (396) Q Consensus 206 ~kIvvIDfG--iK~nIlr~L~~~g~~v~Vvp~~-~~~~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilG 282 (396) .+|++||-= --||++++|.+.|++|+|+.++ .+.+++.+++||+|+||||||.|++... ..++++++ .+++|||| T Consensus 2 ~~IL~IDNyDSFtyNLv~yl~~lg~~v~V~rnd~~~~~~~~~~~pd~iviSPGPG~P~d~G~-~~~~i~~~-~~~~PiLG 79 (191) T COG0512 2 MMILLIDNYDSFTYNLVQYLRELGAEVTVVRNDDISLELIEALKPDAIVISPGPGTPKDAGI-SLELIRRF-AGRIPILG 79 (191) T ss_pred CEEEEEECCCCHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHCCCCEEEECCCCCCHHHCCH-HHHHHHHH-CCCCCEEE T ss_conf 56999977442499999999973993499988965877886049998998489979377361-89999986-08998899 Q ss_pred EECHHHHHHHHCCCCCCCCCCCCCCCCCCCEECCCCEEE-------EECCCCCEEEEHHHCCCCCEEEEEECCCCEEEEE Q ss_conf 610058878752854466776752101123477897099-------9614576157034369982899998899808726 Q gi|254780420|r 283 ICLGHQLLGLALGAQTVKMYQGHHGVNHPVKNISNGTVE-------IVSMNHGFAIDASSLPAGLEETHISLFDSSNCGF 355 (396) Q Consensus 283 IClGhQ~l~~a~G~~~~kl~~GhrG~N~Pv~~~~t~kv~-------iTsqNHg~~v~~~slp~~~~v~~~~~nD~tiegi 355 (396) ||||||.|+.||||++.+++..-||-..++.+-..+ ++ -..-=|+..|+++++|.+|++|+++..|+.|||+ T Consensus 80 VCLGHQai~~~fGg~V~~a~~~~HGK~s~i~h~g~~-iF~glp~~f~v~RYHSLvv~~~~lP~~l~vtA~~~d~~~IMai 158 (191) T COG0512 80 VCLGHQAIAEAFGGKVVRAKEPMHGKTSIITHDGSG-LFAGLPNPFTVTRYHSLVVDPETLPEELEVTAESEDGGVIMAV 158 (191) T ss_pred ECCCHHHHHHHHCCEEEECCCCCCCEEEEEECCCCC-CCCCCCCCCEEEEEEEEEECCCCCCCCEEEEEEECCCCEEEEE T ss_conf 874378999973978875597767812554029960-0157999888886278984478799833899996899879888 Q ss_pred EECCCCEEEECCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 867997789834878877670016899999999 Q gi|254780420|r 356 RLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDF 388 (396) Q Consensus 356 ~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~ 388 (396) +|+++|+|||||||||- ....+..+..+|+++ T Consensus 159 ~h~~~pi~gvQFHPESi-lT~~G~~il~Nfl~~ 190 (191) T COG0512 159 RHKKLPIYGVQFHPESI-LTEYGHRILENFLRL 190 (191) T ss_pred EECCCCEEEEECCCCCC-CCCCHHHHHHHHHHH T ss_conf 60899878882087103-662148999999844 No 24 >PRK09522 bifunctional anthranilate synthase/anthranilate phosphoribosyltransferase; Provisional Probab=100.00 E-value=4.1e-34 Score=260.86 Aligned_cols=179 Identities=22% Similarity=0.329 Sum_probs=140.7 Q ss_pred CEEEEEEC--CCCCHHHHHHHHHCCCCEEEECCC----CHHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCC Q ss_conf 14999967--964102322222024541330465----744644135788998588888410112110122210257863 Q gi|254780420|r 206 YHVVCIDY--GIRSNLLRVLSSLGCRITIVGAET----SYKDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMP 279 (396) Q Consensus 206 ~kIvvIDf--GiK~nIlr~L~~~g~~v~Vvp~~~----~~~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iP 279 (396) .+|++||- =--|||+++|...|++|+|++++. ..+++..++||+|+||||||.|++... ..++++. +.+++| T Consensus 2 ~~ILlIDNyDSFTyNL~~~l~~~g~~v~V~rnd~~~~~~~e~~~~~~p~~IVlSPGPG~P~dag~-~~~ii~~-~~~~iP 79 (531) T PRK09522 2 ADILLLDNIDSFTYNLADQLRSNGHNVVIYRNHIPAQTLIERLATMSNPVLMLSPGPGVPSEAGC-MPELLTR-LRGKLP 79 (531) T ss_pred CEEEEEECCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHHHCCCCEEEECCCCCCCHHCCC-HHHHHHH-HCCCCC T ss_conf 70999828886288999999866995699969983130199997249596998999989023657-3999998-659999 Q ss_pred EEEEECHHHHHHHHCCCCCCCCCCCCCCCCCCCEECCCCE------EEEECCCCCEEEEHHHCCCCCEEEEEECCCCEEE Q ss_conf 4786100588787528544667767521011234778970------9996145761570343699828999988998087 Q gi|254780420|r 280 MFGICLGHQLLGLALGAQTVKMYQGHHGVNHPVKNISNGT------VEIVSMNHGFAIDASSLPAGLEETHISLFDSSNC 353 (396) Q Consensus 280 ilGIClGhQ~l~~a~G~~~~kl~~GhrG~N~Pv~~~~t~k------v~iTsqNHg~~v~~~slp~~~~v~~~~~nD~tie 353 (396) |||||||||+|+.+||+++.+++..-||.-.++.+-.++- -+...+.|++.+ +++|.+|+++..+ |++|| T Consensus 80 ILGICLGhQ~ia~~~Gg~v~~a~~~~HGk~s~i~~~~~~lF~glp~~~~v~rYHSl~~--~~lP~~l~v~A~~--~g~iM 155 (531) T PRK09522 80 IIGICLGHQAIVEAYGGYVGQAGEILHGKASSIEHDGQAMFAGLTNPLPVARYHSLVG--SNIPAGLTINAHF--NGMVM 155 (531) T ss_pred EEEECHHHHHHHHHCCCEEEECCCCCCCEEEEEEECCCCCCCCCCCCCEEEEECEECC--CCCCCCEEEEEEE--CCEEE T ss_conf 8998787789999729889677984323365679898520289999977996443304--4799987999716--99999 Q ss_pred EEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 26867997789834878877670016899999999997 Q gi|254780420|r 354 GFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDFMHK 391 (396) Q Consensus 354 gi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~i~~ 391 (396) |++|+++|+|||||||||- ...++..++.+|+.-... T Consensus 156 ai~h~~~pi~GVQFHPESi-~T~~G~~il~NFLn~~~~ 192 (531) T PRK09522 156 AVRHDADRVCGFQFHPESI-LTTQGARLLEQTLAWAQQ 192 (531) T ss_pred EEEECCCCEEEEEECCCCC-CCCCHHHHHHHHHCCCCC T ss_conf 9998999869982087546-887779999998565434 No 25 >PRK05637 anthranilate synthase component II; Provisional Probab=100.00 E-value=7.4e-34 Score=259.04 Aligned_cols=180 Identities=27% Similarity=0.383 Sum_probs=136.7 Q ss_pred CEEEEEECC--CCCHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEE Q ss_conf 149999679--641023222220245413304657446441357889985888884101121101222102578634786 Q gi|254780420|r 206 YHVVCIDYG--IRSNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGI 283 (396) Q Consensus 206 ~kIvvIDfG--iK~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGI 283 (396) .||++||.. --|||+++|...|++++|++++.+.+++.+++||+|+||||||.|++... ..++++.+ .+++||||| T Consensus 2 ~~ILlIDnyDSFT~Nl~~~l~~~g~~v~V~rn~~~~~~~~~~~pd~ivlSPGPg~P~d~g~-~~~~~~~~-~~~iPILGI 79 (208) T PRK05637 2 THVVLIDNHDSFVYNLVDAFAVAGYKCTVFRNTVPVETILAANPDLICLSPGPGYPADAGN-MMALIERT-LGQIPLLGI 79 (208) T ss_pred CEEEEEECCCCHHHHHHHHHHHCCCEEEEEECCCCHHHHHHCCCCEEEECCCCCCHHHCCC-HHHHHHHH-HCCCCEEEH T ss_conf 7799994889759999999986799079996999999998519995999999999577757-49999997-356982211 Q ss_pred ECHHHHHHHHCCCCCCCCCCCC----------CCCCCCCEE-CCC----------CE-EEEECCCCCEEEEHHHCCCCCE Q ss_conf 1005887875285446677675----------210112347-789----------70-9996145761570343699828 Q gi|254780420|r 284 CLGHQLLGLALGAQTVKMYQGH----------HGVNHPVKN-ISN----------GT-VEIVSMNHGFAIDASSLPAGLE 341 (396) Q Consensus 284 ClGhQ~l~~a~G~~~~kl~~Gh----------rG~N~Pv~~-~~t----------~k-v~iTsqNHg~~v~~~slp~~~~ 341 (396) |||||+||.++||++.+++--| .|..+|+.. +.+ ++ +. ...-|+.++. .+|.+++ T Consensus 80 CLGhQ~i~~~~Gg~v~~~~~~HG~~s~i~~~~~g~~~~lf~gl~~~~~~~~~~~~~~~~~-v~rYHSl~~~--~~p~~~~ 156 (208) T PRK05637 80 CLGYQALIEYHGGKVEPCGPVHGTTDNMILTDAGVQSPVFAGLATDVEPDHPEIPGRKVP-IGRYHSLGCV--VAPDGIE 156 (208) T ss_pred HHHHHHHHHHCCCEEECCCCCCCEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCE-EEEECCCEEE--ECCCCCE T ss_conf 477899999709788306776550667898788887700068876554455345677535-7710601671--0799974 Q ss_pred EE--EEECCCCEEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 99--998899808726867997789834878877670016899999999997 Q gi|254780420|r 342 ET--HISLFDSSNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDFMHK 391 (396) Q Consensus 342 v~--~~~~nD~tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~i~~ 391 (396) .. ..+..++.+||++|+++|+|||||||||--- .++..|+.+|++.+.. T Consensus 157 ~~~~~~~~~~~viMa~~h~~~p~~GVQFHPESi~T-~~G~~il~nfv~~~~~ 207 (208) T PRK05637 157 SLGTCSSEIGDVIMAARTTDGKAIGLQFHPESVLS-PTGPVILSRCVEQLLA 207 (208) T ss_pred EEEEEECCCCCEEEEEEECCCCEEEEEECCCCCCC-CCHHHHHHHHHHHHHC T ss_conf 88999759999999999789997998669753688-7779999999999961 No 26 >PRK00074 guaA GMP synthase; Reviewed Probab=100.00 E-value=1.4e-32 Score=250.15 Aligned_cols=179 Identities=23% Similarity=0.410 Sum_probs=142.8 Q ss_pred CEEEEEECCCCCH--HHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEE Q ss_conf 1499996796410--23222220245413304657446441357889985888884101121101222102578634786 Q gi|254780420|r 206 YHVVCIDYGIRSN--LLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGI 283 (396) Q Consensus 206 ~kIvvIDfGiK~n--Ilr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGI 283 (396) -+|++||||..|+ |.|+++++|+...++|++++.+++.+.+|.||+||+||....+. .....-..+++.++||||| T Consensus 5 ~~IlIlDFGSQytqLIaRRvRElgVysEI~p~~~~~~~i~~~~pkgIILSGGP~SV~~~--~aP~~d~~if~l~IPILGI 82 (513) T PRK00074 5 DKILILDFGSQYTQLIARRVRELGVYSEIVPYDISAEEIRAFNPKGIILSGGPASVYEE--GAPRADPEIFELGVPVLGI 82 (513) T ss_pred CEEEEEECCCHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHCCCCEEEECCCCCCCCCC--CCCCCCHHHHHCCCCEEEE T ss_conf 86999988964999999999823835999669999899850499789988999636899--9988998998269986997 Q ss_pred ECHHHHHHHHCCCCCCCCC---CCCCCC----CCCCE-ECC-CCEEEEECCCCCEEEEHHHCCCCCEEEEEECCCCEEEE Q ss_conf 1005887875285446677---675210----11234-778-97099961457615703436998289999889980872 Q gi|254780420|r 284 CLGHQLLGLALGAQTVKMY---QGHHGV----NHPVK-NIS-NGTVEIVSMNHGFAIDASSLPAGLEETHISLFDSSNCG 354 (396) Q Consensus 284 ClGhQ~l~~a~G~~~~kl~---~GhrG~----N~Pv~-~~~-t~kv~iTsqNHg~~v~~~slp~~~~v~~~~~nD~tieg 354 (396) |||||+||..+||++.+.+ ||+--. +.|.. ++. +-.||++ ||..|. ++|++|++++.+. |..+++ T Consensus 83 CYG~Qlia~~~GG~V~~s~~rEyG~a~l~i~~~s~Lf~gl~~~~~VWMS---HgD~V~--~lP~gF~viA~s~-n~~iAa 156 (513) T PRK00074 83 CYGMQLMAHQLGGKVERAGKREYGRAELEVDGDSALFKGIPEEQDVWMS---HGDKVT--ELPEGFKVIASTE-NCPIAA 156 (513) T ss_pred CHHHHHHHHHHCCEEEECCCCEECCEEEEEECCCCCCCCCCCCCEEEEE---CCCCEE--ECCCCCEEEEECC-CCCEEE T ss_conf 5889999998599998569601121489991587333689986368875---243014--3799847999549-973678 Q ss_pred EEECCCCEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHC Q ss_conf 686799778983487887767001689999999999735 Q gi|254780420|r 355 FRLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDFMHKRK 393 (396) Q Consensus 355 i~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~i~~~k 393 (396) ++|+++++||||||||..- ..++..++.+|+--|.+.+ T Consensus 157 i~~~~~~iyGvQFHPEV~H-T~~G~~iL~NFl~~Ic~~~ 194 (513) T PRK00074 157 IANEERKFYGVQFHPEVTH-TEQGKKLLENFVFDICGCK 194 (513) T ss_pred EEECCCCEEEEECCCEEEE-CCCCHHHHHHHHHHHHCCC T ss_conf 9806786788702863453-1775799999999987889 No 27 >TIGR01815 TrpE-clade3 anthranilate synthase; InterPro: IPR010112 This entry represents a small clade of anthranilate synthases from alpha proteobacteria and Nostoc (a cyanobacterium). This enzyme is the first step in the pathway for the biosynthesis of tryprophan from chorismate.; GO: 0004049 anthranilate synthase activity, 0000162 tryptophan biosynthetic process. Probab=99.98 E-value=2.6e-31 Score=241.06 Aligned_cols=257 Identities=21% Similarity=0.254 Sum_probs=182.9 Q ss_pred CCCHHHHHHHHHHCCC-----CCEEEECCCCCCCCHHHHHHHHHHCCCCCCCCHHHCCCCCCCEECCCCCCCCCCCCCCC Q ss_conf 2576899999985398-----42366327654589789888774158874321011013333211168653445434434 Q gi|254780420|r 125 GIDTRALTVWIRDYGV-----SNSVIAHHPDGQFDLEDLKERAKNWSGLKGAELAKHVTVSQRRDWSEKIWKWGEETSFL 199 (396) Q Consensus 125 GIDTRaLTk~IRe~G~-----~~a~I~~~~~~~~d~~~~~~~~~~~~~~~~~dL~~~Vs~~~~~~~~~~~~~~~~~~~~~ 199 (396) +.||=-.-|-||-+-. .-|.+-++ .+-+.|++...+|+.--.+.-.-++.+-.... .....+-+++. T Consensus 447 ~mnTGL~LRTIR~~dgiAeVRaGATLLyD--S~P~aEeaEt~lKAaA~~~aIR~ak~a~~a~~------~~~~~~~gkyl 518 (726) T TIGR01815 447 DMNTGLTLRTIRLKDGIAEVRAGATLLYD--SDPEAEEAETELKAAAAIEAIRDAKAADAAKE------ESAVAKVGKYL 518 (726) T ss_pred CCCCCHHHHHHHHHCCCEEEEECEEECCC--CCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCC------CHHHHHHCCCC T ss_conf 87521254443242380167604033268--88514467889999999998752202574345------01265520247 Q ss_pred CCCCCCCEEEEEECC--CCCHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCC Q ss_conf 677777149999679--641023222220245413304657446441357889985888884101121101222102578 Q gi|254780420|r 200 KSSDAKYHVVCIDYG--IRSNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSG 277 (396) Q Consensus 200 ~~~~~~~kIvvIDfG--iK~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~ 277 (396) +.+..+.+|.+||=- ==|.+.++|++-|+.||-++...+-+-+...+||-|+||||||.|++.+ +..++++.+... T Consensus 519 PdGq~G~~iLLvDHeDSFVHTLAnY~RqTGAsVTTlRh~~ae~~fd~~rPDLVVLSPGPGrP~dFd--v~~Ti~aa~ar~ 596 (726) T TIGR01815 519 PDGQEGKRILLVDHEDSFVHTLANYLRQTGASVTTLRHSFAEELFDEERPDLVVLSPGPGRPKDFD--VKETIKAALARD 596 (726) T ss_pred CCCCCCCEEEEEECCCCHHHHHHHHHHHCCCEEEECCCHHHHHHHHCCCCCEEEECCCCCCCCCCC--HHHHHHHHHHCC T ss_conf 886258779998547543777788876348755304304789997327998898687312387544--788999999728 Q ss_pred CCEEEEECHHHHHHHHCCCCCCCCCCCCCCCCCCCEECCCCEEEEEC--------CCCCEEEEHHHCCCCCEEEEEECCC Q ss_conf 63478610058878752854466776752101123477897099961--------4576157034369982899998899 Q gi|254780420|r 278 MPMFGICLGHQLLGLALGAQTVKMYQGHHGVNHPVKNISNGTVEIVS--------MNHGFAIDASSLPAGLEETHISLFD 349 (396) Q Consensus 278 iPilGIClGhQ~l~~a~G~~~~kl~~GhrG~N~Pv~~~~t~kv~iTs--------qNHg~~v~~~slp~~~~v~~~~~nD 349 (396) +||||+|||.|-|+-||||+...|.+.-||-.--|+-+.-+.|.+.. -=|+..-+++.||.+|.||+++. | T Consensus 597 lP~FGVCLGLQg~vEafGG~L~vL~~P~HGK~srirVl~p~s~~F~gLpe~~TVGRYHSlfA~~~rLP~el~vTA~s~-D 675 (726) T TIGR01815 597 LPVFGVCLGLQGLVEAFGGELDVLAIPVHGKASRIRVLEPDSVLFAGLPEELTVGRYHSLFARSDRLPKELKVTAESE-D 675 (726) T ss_pred CCEEEEHHHHHHHHHHHCCCCCCCCCCCCCCCCEEEEECCCEEEECCCCCCEEEEEEEEEEECCCCCCCCCEEEEEEC-C T ss_conf 985774134689998746721357888788633688836875787278970343003767556456874237878506-8 Q ss_pred CEEEEEEECCCCEEEECCCCCC--CCCCCCHHHHHHHHHHHHHHH Q ss_conf 8087268679977898348788--776700168999999999973 Q gi|254780420|r 350 SSNCGFRLIDQPVFSVQYHPES--SPGPQDSYYLFSTFIDFMHKR 392 (396) Q Consensus 350 ~tiegi~~~~~~~~~vQfHPEa--~PGP~D~~~~F~~F~~~i~~~ 392 (396) ++||||+|+.+|+-+||||||| +=+-.=+-.|.+|.++-.-++ T Consensus 676 g~iMaiEH~~lP~AAVQFHPESImtL~G~aGl~mi~NVv~~la~~ 720 (726) T TIGR01815 676 GVIMAIEHRKLPVAAVQFHPESIMTLDGDAGLAMIENVVDKLAKS 720 (726) T ss_pred CEEEEEECCCCCCCEEECCCCHHHCCCCHHHHHHHHHHHHHHHHH T ss_conf 717320247687112511761100056046899999999997331 No 28 >TIGR00566 trpG_papA glutamine amidotransferase of anthranilate synthase or para-aminobenzoate synthase; InterPro: IPR006221 This entry presents the glutamine amidotransferase domain or peptide of the tryptophan-biosynthetic pathway enzyme anthranilate synthase or of the folate biosynthetic pathway enzyme para-aminobenzoate synthase. In at least one case, a single polypeptide from Bacillus subtilis was shown to have both functions. ; GO: 0004049 anthranilate synthase activity, 0008152 metabolic process. Probab=99.97 E-value=1.1e-31 Score=243.84 Aligned_cols=170 Identities=28% Similarity=0.536 Sum_probs=133.3 Q ss_pred EEEEE----CCCCCHHHHHHHHHC------CCCEEEECC-CCHHHHHHCCC-----CEEEECCCCCCCCCCCCC---CHH Q ss_conf 99996----796410232222202------454133046-57446441357-----889985888884101121---101 Q gi|254780420|r 208 VVCID----YGIRSNLLRVLSSLG------CRITIVGAE-TSYKDIMSLQP-----DGVLLSNGPGDPAVTSVY---SSP 268 (396) Q Consensus 208 IvvID----fGiK~nIlr~L~~~g------~~v~Vvp~~-~~~~~i~~~~p-----dgi~lSnGPGdP~~~~~~---~~~ 268 (396) |.+|| | -||+++++.+.| .+|.|-+++ ..+++|.++.| |+|++||||+.|+..... +.+ T Consensus 2 vL~IDnYDSF--TYNlv~~~~~~gdisPGv~~V~Vkrnd~~~~~~ie~L~P~~~lfdaivisPGPC~P~eAa~~~Gii~~ 79 (212) T TIGR00566 2 VLLIDNYDSF--TYNLVQYFEELGDISPGVAEVVVKRNDSLTLQEIEALLPLLPLFDAIVISPGPCTPNEAAISMGIILE 79 (212) T ss_pred EEEEECCCHH--HHHHHHHHHHCCCCCCCCEEEEEECCCCCCHHHHHHHCCCCCCCCEEEECCCCCCCCHHHCCHHHHHH T ss_conf 2456567423--78999986421776736105777206731034587631135633456765865683212204117999 Q ss_pred HHHHHCCCCC---CEEEEECHHHHHHHHCCCCCCCC---CCC------CC------CC-C-CCCEECCCCEEEEECCCCC Q ss_conf 2221025786---34786100588787528544667---767------52------10-1-1234778970999614576 Q gi|254780420|r 269 IICKLVDSGM---PMFGICLGHQLLGLALGAQTVKM---YQG------HH------GV-N-HPVKNISNGTVEIVSMNHG 328 (396) Q Consensus 269 ~~~~~~~~~i---PilGIClGhQ~l~~a~G~~~~kl---~~G------hr------G~-N-~Pv~~~~t~kv~iTsqNHg 328 (396) +++. +++|+ ||||||||||.||+|+||++.++ ++| |. |. | .|-+-+...| | |+ T Consensus 80 ~i~h-~aGkldevPILGvCLG~QA~~qA~Gg~V~~a~~~~HGk~s~i~hng~~~f~GL~~GyPP~~L~~TR-Y-----HS 152 (212) T TIGR00566 80 AIRH-FAGKLDEVPILGVCLGHQALAQAFGGDVVRANTVMHGKTSEIEHNGAGIFRGLFNGYPPDTLTATR-Y-----HS 152 (212) T ss_pred HHHH-HCCCCCCCCCCCCCHHHHHHHHHCCCCEEEECCCCCCCCCCEEECCCEEECCCCCCCCCCCCCCCC-E-----EE T ss_conf 9874-237622676020105568788742872254211368730012477710124655776866443575-0-----55 Q ss_pred EEEEHHHCCCCCEEEEEECCC-CEEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHH Q ss_conf 157034369982899998899-80872686799778983487887767001689999999 Q gi|254780420|r 329 FAIDASSLPAGLEETHISLFD-SSNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFID 387 (396) Q Consensus 329 ~~v~~~slp~~~~v~~~~~nD-~tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~ 387 (396) ..|.+++||..+.+|.++..+ ..|||+|||++|++||||||||.=- .-+..|.++|++ T Consensus 153 L~V~p~~l~~~~~~~~~~~~~G~eiM~irh~~~p~eGVQFHPESilS-e~G~~ll~Nfl~ 211 (212) T TIGR00566 153 LVVEPETLPTCFQVTALEEENGIEIMAIRHRDLPLEGVQFHPESILS-ELGHQLLANFLH 211 (212) T ss_pred EEECCCCCCCCCCEEEEECCCCCEEEHHHHCCCCCCCEEECCHHHHH-HHHHHHHHHHHC T ss_conf 66744577622010100016870131013347850220107135534-227899998723 No 29 >KOG0026 consensus Probab=99.94 E-value=7.3e-27 Score=209.82 Aligned_cols=183 Identities=28% Similarity=0.480 Sum_probs=144.5 Q ss_pred CCEEEEEEC--CCCCHHHHHH-HHHCCCCEEEECC-CCHHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCE Q ss_conf 714999967--9641023222-2202454133046-57446441357889985888884101121101222102578634 Q gi|254780420|r 205 KYHVVCIDY--GIRSNLLRVL-SSLGCRITIVGAE-TSYKDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPM 280 (396) Q Consensus 205 ~~kIvvIDf--GiK~nIlr~L-~~~g~~v~Vvp~~-~~~~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPi 280 (396) ...|++||= --.||...+| .+.||.+.|+++| .+.+|+.+++|++++||+|||.|++.. ...+.+.+ +.+++|+ T Consensus 18 n~piv~IDNYDSFT~Nv~qYL~~e~g~~~~VyRNDeiTV~El~~~NP~~LliSPGPG~P~DsG-Is~~~i~~-f~~~iP~ 95 (223) T KOG0026 18 NGPIIVIDNYDSFTYNLCQYLMGELGCHFEVYRNDELTVEELKRKNPRGLLISPGPGTPQDSG-ISLQTVLE-LGPLVPL 95 (223) T ss_pred CCCEEEEECCCCHHHHHHHHHHHCCCCCEEEEECCCCCHHHHHHCCCCCEEECCCCCCCCCCC-CHHHHHHH-HCCCCCE T ss_conf 687899845551348999986312587179984673117888625988078569998976454-24899998-5787855 Q ss_pred EEEECHHHHHHHHCCCCCCCCCCCC-CCCCCCCEECCCCEEE---------EECCCCCEEEEHHHCC-CCCEEEEEECCC Q ss_conf 7861005887875285446677675-2101123477897099---------9614576157034369-982899998899 Q gi|254780420|r 281 FGICLGHQLLGLALGAQTVKMYQGH-HGVNHPVKNISNGTVE---------IVSMNHGFAIDASSLP-AGLEETHISLFD 349 (396) Q Consensus 281 lGIClGhQ~l~~a~G~~~~kl~~Gh-rG~N~Pv~~~~t~kv~---------iTsqNHg~~v~~~slp-~~~~v~~~~~nD 349 (396) ||||+|.|+|..+||+++....|+- ||-..||.+-..|.+- +...-|+.+...+||| ..|+||++.. | T Consensus 96 fGvCMGlQCi~e~fGGkv~~a~~~i~HGK~S~i~~D~~~~~G~f~g~~q~~~V~RYHSLa~~~sSlP~d~L~VTawTE-n 174 (223) T KOG0026 96 FGVCMGLQCIGEAFGGKIVRSPFGVMHGKSSMVHYDEKGEEGLFSGLSNPFIVGRYHSLVIEKDSFPSDELEVTAWTE-D 174 (223) T ss_pred EEEEECHHHHHHHHCCEEECCCCCEEECCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEECCCCCCHHHEEEEEECC-C T ss_conf 656400344456528576305752462453332237754332102799872777654400013568710304566413-8 Q ss_pred CEEEEEEECCCC-EEEECCCCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 808726867997-789834878877670016899999999997 Q gi|254780420|r 350 SSNCGFRLIDQP-VFSVQYHPESSPGPQDSYYLFSTFIDFMHK 391 (396) Q Consensus 350 ~tiegi~~~~~~-~~~vQfHPEa~PGP~D~~~~F~~F~~~i~~ 391 (396) |.|||.||+++. +-||||||||-= ..++.-+..+|+.+..+ T Consensus 175 G~iMgaRHkKY~~ieGVQfHPESIl-teeGk~~irNflni~~~ 216 (223) T KOG0026 175 GLVMAARHRKYKHIQGVQFHPESII-TTEGKTIVRNFIKIVEK 216 (223) T ss_pred CEEEEEECCCCEECCCEEECCHHHH-HHHHHHHHHHHHHHCCC T ss_conf 5788741233100232232607764-46658999999875336 No 30 >TIGR01823 PabB-fungal para-aminobenzoate synthase; InterPro: IPR010117 This entry represents the fungal clade of para-aminobenzoate synthase, which acts on chorismate to form para-aminobenzoic acid (PABA), a precursor of folate.. Probab=99.93 E-value=1.7e-25 Score=200.13 Aligned_cols=187 Identities=27% Similarity=0.392 Sum_probs=131.7 Q ss_pred CCCEEEEEECC--CCCHHHHHHHHH--------CCCCEEEECCCCH--HHHHHC--CCCEEEECCCCCCCCC--CCCCCH Q ss_conf 77149999679--641023222220--------2454133046574--464413--5788998588888410--112110 Q gi|254780420|r 204 AKYHVVCIDYG--IRSNLLRVLSSL--------GCRITIVGAETSY--KDIMSL--QPDGVLLSNGPGDPAV--TSVYSS 267 (396) Q Consensus 204 ~~~kIvvIDfG--iK~nIlr~L~~~--------g~~v~Vvp~~~~~--~~i~~~--~pdgi~lSnGPGdP~~--~~~~~~ 267 (396) .+.+|..|||= --||+++.|.+. .+.|++|-+|+.- ++++.+ .+|+||+.||||||.. .+..+. T Consensus 4 ~rl~vL~ID~YDSftyNvv~lle~~~d~~PGvnav~V~tvhsD~~~sm~~l~e~l~~FdAiVvGPGPG~PN~GA~d~gI~ 83 (851) T TIGR01823 4 ARLQVLFIDAYDSFTYNVVALLEKALDVRPGVNAVSVITVHSDKFDSMDDLLELLLLFDAIVVGPGPGDPNVGAKDVGIL 83 (851) T ss_pred CCCCEEEEEECCHHHHHHHHHHHHHHCCCCCCCEEEEEEEECCCCCCHHHHHHHHHHCCEEEECCCCCCCCCCHHHHHHH T ss_conf 31441578701101689999999874158774468999970687235346888874178589888776776511444688 Q ss_pred HHHHHHCCC---CCCEEEEECHHHHHHHHCCCCCCCCC-CCCCCCCCCCEECCCCEEEE----EC---------CCCCEE Q ss_conf 122210257---86347861005887875285446677-67521011234778970999----61---------457615 Q gi|254780420|r 268 PIICKLVDS---GMPMFGICLGHQLLGLALGAQTVKMY-QGHHGVNHPVKNISNGTVEI----VS---------MNHGFA 330 (396) Q Consensus 268 ~~~~~~~~~---~iPilGIClGhQ~l~~a~G~~~~kl~-~GhrG~N~Pv~~~~t~kv~i----Ts---------qNHg~~ 330 (396) ..+..+-.+ .+||||||||+|.|||+.|+.+.+|+ +..||.=|-| .+.+..| .+ .=|+.= T Consensus 84 ~~~w~l~~Gk~D~~Pv~GICLGFQslcL~qGa~i~rlpn~p~HGqV~~~---~t~~~~if~g~~~GyPdtvk~trYHSLy 160 (851) T TIGR01823 84 REVWELANGKLDEVPVLGICLGFQSLCLYQGADIERLPNLPLHGQVYEV---KTSDAAIFEGVSSGYPDTVKSTRYHSLY 160 (851) T ss_pred HHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCHHHCCCCCCCCEEEEE---EECCHHHHCCCCCCCCCCEEEEEEEEEE T ss_conf 9999732895313551100156899998558823561788657437888---7401155524114788632244557887 Q ss_pred EEHHHCC------------------CCC----EEEEEECCCCEEEEEEECCCCEEEECCCCCCCCCCCC-HHHHHHHHHH Q ss_conf 7034369------------------982----8999988998087268679977898348788776700-1689999999 Q gi|254780420|r 331 IDASSLP------------------AGL----EETHISLFDSSNCGFRLIDQPVFSVQYHPESSPGPQD-SYYLFSTFID 387 (396) Q Consensus 331 v~~~slp------------------~~~----~v~~~~~nD~tiegi~~~~~~~~~vQfHPEa~PGP~D-~~~~F~~F~~ 387 (396) +..+++. +++ ..+..+-+|..+|.++|+++|+|||||||||+---.+ +..||.+|++ T Consensus 161 an~~~id~strGvdsGnPqadrfaskdldLL~~~l~ed~e~~i~Ms~~~~k~P~fGv~yHPESc~S~~~a~~kl~~nfl~ 240 (851) T TIGR01823 161 ANKESIDSSTRGVDSGNPQADRFASKDLDLLALILLEDEEDQILMSVKAKKKPIFGVQYHPESCKSEKEASVKLLKNFLK 240 (851) T ss_pred ECCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCEEEHHHCCCCCEEEEEECCCCCCCCHHHHHHHHHHHHH T ss_conf 31788653325556788501233432367899886226888122003105887745650564334554578999999998 Q ss_pred HHHHHC Q ss_conf 999735 Q gi|254780420|r 388 FMHKRK 393 (396) Q Consensus 388 ~i~~~k 393 (396) |-..+- T Consensus 241 la~~~N 246 (851) T TIGR01823 241 LALINN 246 (851) T ss_pred HHHCCC T ss_conf 740167 No 31 >COG0518 GuaA GMP synthase - Glutamine amidotransferase domain [Nucleotide transport and metabolism] Probab=99.91 E-value=3.7e-24 Score=190.78 Aligned_cols=175 Identities=29% Similarity=0.406 Sum_probs=131.2 Q ss_pred CEEEEEECCCCCH--HHHHHHHHC-CCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCC---CCCHHHHHHHCCCCCC Q ss_conf 1499996796410--232222202-454133046574464413578899858888841011---2110122210257863 Q gi|254780420|r 206 YHVVCIDYGIRSN--LLRVLSSLG-CRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTS---VYSSPIICKLVDSGMP 279 (396) Q Consensus 206 ~kIvvIDfGiK~n--Ilr~L~~~g-~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~---~~~~~~~~~~~~~~iP 279 (396) .+|+++|||..++ |-|.++++| ...+++|++.+.+++...+||||+||.||-...... ......+..+....+| T Consensus 2 ~~ilIld~g~q~~~li~r~~re~g~v~~e~~~~~~~~~~~~~~~~~giIlsGgp~sv~~~~~w~~~~~~~i~~~~~p~~p 81 (198) T COG0518 2 RKILILDFGGQYLGLIARRLRELGYVYSEIVPYTGDAEELPLDSPDGIIISGGPMSVYDEDPWLPREKDLIKDAGVPGKP 81 (198) T ss_pred CEEEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCEEEECCCCCCCCCCCCCCHHHHHHHHHHCCCCCC T ss_conf 57999957970767999999873982699985788801156668888998589887764433305578999971888887 Q ss_pred EEEEECHHHHHHHHCCCCCCCCC---CCCCCCCCCCEEC------CCC--EEE-EECCCCCEEEEHHHCCCCCEEEEEEC Q ss_conf 47861005887875285446677---6752101123477------897--099-96145761570343699828999988 Q gi|254780420|r 280 MFGICLGHQLLGLALGAQTVKMY---QGHHGVNHPVKNI------SNG--TVE-IVSMNHGFAIDASSLPAGLEETHISL 347 (396) Q Consensus 280 ilGIClGhQ~l~~a~G~~~~kl~---~GhrG~N~Pv~~~------~t~--kv~-iTsqNHg~~v~~~slp~~~~v~~~~~ 347 (396) |||||||||+||.++|+++.+.+ ||. .||.-. ..+ ... ...|.|+..|++ ||.++++++.+. T Consensus 82 vLGIC~G~Ql~A~~lGg~V~~~~~~E~G~----~~v~~~~~~~~l~~gl~~~~~~v~~sH~D~v~~--lP~g~~vlA~s~ 155 (198) T COG0518 82 VLGICLGHQLLAKALGGKVERGPKREIGW----TPVELTEGDDPLFAGLPDLFTTVFMSHGDTVVE--LPEGAVVLASSE 155 (198) T ss_pred EEEECHHHHHHHHHHCCEEECCCCCCCCE----EEEEEECCCCCCCCCCCCCCCCCCCCHHCCCCC--CCCCCEEEECCC T ss_conf 89987618999998498896468876515----999972586500048854337106613074434--899989986589 Q ss_pred CCCEEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 9980872686799778983487887767001689999999999 Q gi|254780420|r 348 FDSSNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDFMH 390 (396) Q Consensus 348 nD~tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~i~ 390 (396) + --++++++. +++||||||||.+. ....-++++|..-+. T Consensus 156 ~-cp~qa~~~~-~~~~gvQFHpEv~~--~~~~~~l~nf~~~i~ 194 (198) T COG0518 156 T-CPNQAFRYG-KRAYGVQFHPEVTH--EYGEALLENFAHEIC 194 (198) T ss_pred C-CHHHHEECC-CCEEEEEEEEEEEH--HHHHHHHHHHHHHHC T ss_conf 9-804205667-85799942018738--999999998656640 No 32 >PRK13143 hisH imidazole glycerol phosphate synthase subunit HisH; Provisional Probab=99.90 E-value=6.6e-23 Score=181.96 Aligned_cols=170 Identities=26% Similarity=0.471 Sum_probs=112.5 Q ss_pred CEEEEEECCCCC--HHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCCCC---CHHHHHHHCCCCCCE Q ss_conf 149999679641--023222220245413304657446441357889985888884101121---101222102578634 Q gi|254780420|r 206 YHVVCIDYGIRS--NLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTSVY---SSPIICKLVDSGMPM 280 (396) Q Consensus 206 ~kIvvIDfGiK~--nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~~---~~~~~~~~~~~~iPi 280 (396) .+|++||||.-+ +|.|.|+++|+++.++ .+.+++. ++|+|+|+ |.|.+...... ..+.+.+.+..++|| T Consensus 2 k~I~Iid~G~~n~~si~~~l~~lg~~~~i~---~~~~~l~--~~d~iILP-GvG~~~~~~~~l~~~~~~i~~~~~~~~Pi 75 (201) T PRK13143 2 KMIVIIDYGLGNLRSVSKGLERAGADVEIT---SDPEEIL--SADGIVLP-GVGAFGDVMENLGPLKETINEAVDDGKPF 75 (201) T ss_pred CEEEEEECCCCHHHHHHHHHHHCCCEEEEC---CCHHHHH--HCCEEEEC-CCCCHHHHHHHHHHHHHHHHHHHHCCCCE T ss_conf 889999889719999999999869859992---8999984--37967973-88818899987651579999999869987 Q ss_pred EEEECHHHHHHHH------------CCCCCCCCC----CCCCCCC-------CCC-EECCCC-EEEEECCCCCEEEEHHH Q ss_conf 7861005887875------------285446677----6752101-------123-477897-09996145761570343 Q gi|254780420|r 281 FGICLGHQLLGLA------------LGAQTVKMY----QGHHGVN-------HPV-KNISNG-TVEIVSMNHGFAIDASS 335 (396) Q Consensus 281 lGIClGhQ~l~~a------------~G~~~~kl~----~GhrG~N-------~Pv-~~~~t~-kv~iTsqNHg~~v~~~s 335 (396) ||||||||+|+.. +++++.|++ ..|-|.| +|. .++.++ .+|+. |+|.+++ T Consensus 76 LGIClGmQll~~~s~E~~~~~GLg~~~g~v~~~~~~~~ip~~G~~~i~~~~~~~l~~~i~~~~~~yf~---Hs~~~~~-- 150 (201) T PRK13143 76 LGICLGMQMLFESSEEGGLTKGLGLIPGRVVRFPPGVKVPHMGWNTVNIVKDCPLFEGIKDGEYFYFV---HSYYADP-- 150 (201) T ss_pred EEEEHHHHHHHHHCCCCCCCCCCEEECCEEEECCCCCCCCCCCCEEEEEECCCHHHCCCCCCCEEEEE---EEEEEEE-- T ss_conf 99808986553414026764453686338997365444663442687653268012286655479997---1448982-- Q ss_pred CCCCCEEEEEECCCCEEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 6998289999889980872686799778983487887767001689999999999 Q gi|254780420|r 336 LPAGLEETHISLFDSSNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDFMH 390 (396) Q Consensus 336 lp~~~~v~~~~~nD~tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~i~ 390 (396) .+.+..+...+- +......+++.++||||||||.+ + .++..++++|+++++ T Consensus 151 ~~~~~~~~~t~~--~~~~~a~i~~~ni~GvQFHPEks-~-~~G~~ilknF~~~~k 201 (201) T PRK13143 151 DDPDVIVATTDY--GIEFPAVVCKGNVFGTQFHPEKS-G-ETGLKILENFVELIK 201 (201) T ss_pred CCCCCEEEEECC--CCEEEEEEECCCEEEEECCCCCC-C-CCHHHHHHHHHHHHC T ss_conf 798626899619--98089999389899991898236-7-255999999999869 No 33 >cd01745 GATase1_2 Subgroup of proteins having the Type 1 glutamine amidotransferase (GATase1) domain. This group contains a subgroup of proteins having the Type 1 glutamine amidotransferase (GATase1) domain. GATase activity catalyses the transfer of ammonia from the amide side chain of glutamine to an acceptor substrate. Glutamine amidotransferases (GATase) includes the triad family of amidotransferases which have a conserved Cys-His-Glu catalytic triad in the glutaminase active site. In this subgroup this triad is conserved. GATase activity can be found in a range of biosynthetic enzymes, including: glutamine amidotransferase, formylglycinamide ribonucleotide, GMP synthetase , anthranilate synthase component II, glutamine-dependent carbamoyl phosphate synthase, cytidine triphosphate synthetase, gamma-glutamyl hydrolase, imidazole glycerol phosphate synthase and, cobyric acid synthase. Glutamine amidotransferase (GATase) domains can occur either as single polypeptides, as in glutamine Probab=99.88 E-value=9.8e-23 Score=180.76 Aligned_cols=149 Identities=29% Similarity=0.463 Sum_probs=115.2 Q ss_pred CCCHHHHHHHHHCCCCEEEECCCCHHHHHH--CCCCEEEECCCCCC--CC---------------CCCCCCHHHHHHHCC Q ss_conf 641023222220245413304657446441--35788998588888--41---------------011211012221025 Q gi|254780420|r 215 IRSNLLRVLSSLGCRITIVGAETSYKDIMS--LQPDGVLLSNGPGD--PA---------------VTSVYSSPIICKLVD 275 (396) Q Consensus 215 iK~nIlr~L~~~g~~v~Vvp~~~~~~~i~~--~~pdgi~lSnGPGd--P~---------------~~~~~~~~~~~~~~~ 275 (396) +..+.++.+.+.|+...++|+..+..++.. .+.||++||+|+ | |. ..+.....+++.+++ T Consensus 20 l~~~y~~~i~~~G~~Pi~iP~~~~~~~~~~~l~~lDGliLtGG~-Di~p~~yg~~~~~~~~~~~~~RD~~E~~l~~~a~~ 98 (189) T cd01745 20 LNQYYVDAVRKAGGLPVLLPPVDDEEDLEQYLELLDGLLLTGGG-DVDPPLYGEEPHPELGPIDPERDAFELALLRAALE 98 (189) T ss_pred HHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHCCEEEECCCC-CCCHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHH T ss_conf 86999999998799999968999879999999877999972887-78810038877765678886689999999999998 Q ss_pred CCCCEEEEECHHHHHHHHCCCCCCCCCCCCCCCCCCCEECCCCEEEEECCCCCEEEEHHHCCCCCEEEEEECCCCEEEEE Q ss_conf 78634786100588787528544667767521011234778970999614576157034369982899998899808726 Q gi|254780420|r 276 SGMPMFGICLGHQLLGLALGAQTVKMYQGHHGVNHPVKNISNGTVEIVSMNHGFAIDASSLPAGLEETHISLFDSSNCGF 355 (396) Q Consensus 276 ~~iPilGIClGhQ~l~~a~G~~~~kl~~GhrG~N~Pv~~~~t~kv~iTsqNHg~~v~~~slp~~~~v~~~~~nD~tiegi 355 (396) .++||||||.|+|+|+.++||+.+. ++. +.| .|+++|+ .|++++++++++. |++|||+ T Consensus 99 ~~~PilGICRG~Q~inv~~GGtL~q-------------~i~-----VNS-~H~~~i~--~l~~~l~~~a~~~-dg~iEa~ 156 (189) T cd01745 99 RGKPILGICRGMQLLNVALGGTLYQ-------------DIR-----VNS-LHHQAIK--RLADGLRVEARAP-DGVIEAI 156 (189) T ss_pred HCCCCEEEHHHHHHHHHHHCCEEEE-------------CCC-----CCC-CCHHHHH--CCCCCCEEEEECC-CCCEEEE T ss_conf 0987603505499999995985750-------------340-----576-4100676--0788868999969-9987999 Q ss_pred EECCCC-EEEECCCCCCCCC-CCCHHHHHHHHH Q ss_conf 867997-7898348788776-700168999999 Q gi|254780420|r 356 RLIDQP-VFSVQYHPESSPG-PQDSYYLFSTFI 386 (396) Q Consensus 356 ~~~~~~-~~~vQfHPEa~PG-P~D~~~~F~~F~ 386 (396) +|++.| ++|||||||-.+. ......||+.|+ T Consensus 157 ~~~~~~~~~gvqwHPE~~~~~~~~~~~lF~~fv 189 (189) T cd01745 157 ESPDRPFVLGVQWHPEWLADTDPDSLKLFEAFV 189 (189) T ss_pred EECCCCCEEEEECCCCCCCCCCHHHHHHHHHHC T ss_conf 968999089997488678989857889999869 No 34 >KOG1622 consensus Probab=99.88 E-value=4.1e-22 Score=176.34 Aligned_cols=174 Identities=24% Similarity=0.399 Sum_probs=130.5 Q ss_pred CCEEEEEECCCCCH--HHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEE Q ss_conf 71499996796410--2322222024541330465744644135788998588888410112110122210257863478 Q gi|254780420|r 205 KYHVVCIDYGIRSN--LLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFG 282 (396) Q Consensus 205 ~~kIvvIDfGiK~n--Ilr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilG 282 (396) .-+|+++|||..|+ |-|+++++.+...++|.++++..|.+..|.||+||+||-.-.+ ..+...--.+++.++|+|| T Consensus 16 ~d~i~iLD~GaQY~~~I~RrvRel~v~se~~p~~t~~~~i~~~~~rgiIiSGGP~SVya--~dAP~~dp~if~~~vpvLG 93 (552) T KOG1622 16 FDTILILDFGAQYGKVIDRRVRELNVQSEILPLTTPAKTITEYGPRGIIISGGPNSVYA--EDAPSFDPAIFELGVPVLG 93 (552) T ss_pred CCEEEEEECCCHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHCCCCEEEEEECCCCCCCC--CCCCCCCHHHHCCCCCCEE T ss_conf 76389994441243788989988754331013777156664479429998189873226--7688898367132774302 Q ss_pred EECHHHHHHHHCCCCCCC-CC--CCCCCCCCCCEEC--------CC--CEEEEECCCCCEEEEHHHCCCCCEEEEEECCC Q ss_conf 610058878752854466-77--6752101123477--------89--70999614576157034369982899998899 Q gi|254780420|r 283 ICLGHQLLGLALGAQTVK-MY--QGHHGVNHPVKNI--------SN--GTVEIVSMNHGFAIDASSLPAGLEETHISLFD 349 (396) Q Consensus 283 IClGhQ~l~~a~G~~~~k-l~--~GhrG~N~Pv~~~--------~t--~kv~iTsqNHg~~v~~~slp~~~~v~~~~~nD 349 (396) ||||+|+|+.-+|+.+.| |. +|- .|.-|.+. .+ -.||+| ||..++. +|.+|+|-..+.| T Consensus 94 ICYGmQ~i~~~~Gg~V~~~~~RE~G~--~eI~v~~~~~lF~~~~~~~~~~Vllt---Hgdsl~~--v~~g~kv~a~s~n- 165 (552) T KOG1622 94 ICYGMQLINKLNGGTVVKGMVREDGE--DEIEVDDSVDLFSGLHKTEFMTVLLT---HGDSLSK--VPEGFKVVAFSGN- 165 (552) T ss_pred EHHHHHHHHHHHCCCCCCCCCCCCCC--CEEECCCHHHHHHHHCCCCEEEEEEC---CCCCHHH--CCCCCEEEEEECC- T ss_conf 01679999998387502462357787--34775861566543232410366100---5620644--4454246776168- Q ss_pred CEEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 8087268679977898348788776700168999999999 Q gi|254780420|r 350 SSNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDFM 389 (396) Q Consensus 350 ~tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~i 389 (396) .-+.|+.|+.+++||+|||||..--| -...++.+|+--+ T Consensus 166 ~~va~i~~e~kkiyglqfhpEV~~t~-~g~~ll~nFl~~v 204 (552) T KOG1622 166 KPVAGILNELKKIYGLQFHPEVTLTP-NGKELLKNFLFDV 204 (552) T ss_pred CCEEEEHHHHHHHHCCCCCCCCCCCC-CHHHHHHHHHHHH T ss_conf 61221101100320678888310367-5668999899987 No 35 >KOG1224 consensus Probab=99.87 E-value=6.5e-22 Score=174.93 Aligned_cols=186 Identities=29% Similarity=0.459 Sum_probs=118.3 Q ss_pred CCCCEEEEEECC--CCCHHHHHHHHHC-CC-CEEEECCCCHHH----HHH-CCCCEEEECCCCCCCCCCCCCCHHHHHHH Q ss_conf 777149999679--6410232222202-45-413304657446----441-35788998588888410112110122210 Q gi|254780420|r 203 DAKYHVVCIDYG--IRSNLLRVLSSLG-CR-ITIVGAETSYKD----IMS-LQPDGVLLSNGPGDPAVTSVYSSPIICKL 273 (396) Q Consensus 203 ~~~~kIvvIDfG--iK~nIlr~L~~~g-~~-v~Vvp~~~~~~~----i~~-~~pdgi~lSnGPGdP~~~~~~~~~~~~~~ 273 (396) ..+.++.+||+= ..+||.|.|.... .- |.++-+++.-+| +.. .-+|+|++++|||.|. +..... ++.++ T Consensus 12 ~~rl~~LlID~YDSyTfNiy~ll~~~~~vp~V~~vh~~~~~~d~~~~l~q~~~FDaIVVgPGPG~P~-~a~d~g-I~~rl 89 (767) T KOG1224 12 LPRLRTLLIDNYDSYTFNIYQLLSTINGVPPVVIVHDEWTWEDAYHYLYQDVAFDAIVVGPGPGSPM-CAADIG-ICLRL 89 (767) T ss_pred HHHEEEEEEECCCCHHHHHHHHHHHHCCCCCEEEEECCCCCHHHHHHHHHCCCCCEEEECCCCCCCC-CHHHHH-HHHHH T ss_conf 3120288871245412139999987417996799834323778999886400224489668997987-478878-99999 Q ss_pred C--CCCCCEEEEECHHHHHHHHCCCCCCCCCCCCCCCCCCCEECCCCEEEE----EC---------CCCCEEEEHHHCCC Q ss_conf 2--578634786100588787528544667767521011234778970999----61---------45761570343699 Q gi|254780420|r 274 V--DSGMPMFGICLGHQLLGLALGAQTVKMYQGHHGVNHPVKNISNGTVEI----VS---------MNHGFAIDASSLPA 338 (396) Q Consensus 274 ~--~~~iPilGIClGhQ~l~~a~G~~~~kl~~GhrG~N~Pv~~~~t~kv~i----Ts---------qNHg~~v~~~slp~ 338 (396) + ...|||||||||||.||++.||.+.-..-..||. |-.+.+++..+ .| .-|+.-|+ ++|. T Consensus 90 ~~~~~~iPilGICLGfQal~l~hGA~v~~~n~p~HGr---vs~i~~~~~~~f~gi~sg~~~~fK~~RYHSL~in--~~pi 164 (767) T KOG1224 90 LLECRDIPILGICLGFQALGLVHGAHVVHANEPVHGR---VSGIEHDGNILFSGIPSGRNSDFKVVRYHSLIIN--SLPI 164 (767) T ss_pred HHHCCCCCEEEEEHHHHHHHHHCCCCEECCCCCCCCE---EEEEEECCCEEECCCCCCCCCCCEEEEEEEEEEC--CCCH T ss_conf 9865788646543214767653166110378764324---6667726847871577788664404776778725--7761 Q ss_pred CC-EEEEEECCCC--EEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCC Q ss_conf 82-8999988998--0872686799778983487887767001689999999999735788 Q gi|254780420|r 339 GL-EETHISLFDS--SNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDFMHKRKTMC 396 (396) Q Consensus 339 ~~-~v~~~~~nD~--tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~i~~~k~~~ 396 (396) ++ .+....-.|+ .+|+++|+++|.||+||||||.. ..-+..||.+|+++-..+-..| T Consensus 165 d~l~il~t~~ddng~ilMsi~~~~fPhfG~qyHPES~~-s~~g~~lfkNFl~lt~~~n~~c 224 (767) T KOG1224 165 DLLPILWTIYDDNGHILMSIMHSSFPHFGLQYHPESIA-STYGSQLFKNFLDLTVNYNSRC 224 (767) T ss_pred HHHCCEEEEECCCCEEEEEEECCCCCCCCEEECHHHHH-HHHHHHHHHHHHHHHCCCCCCC T ss_conf 12120137606886289875315788534233707753-1345799987777633675101 No 36 >PRK13141 hisH imidazole glycerol phosphate synthase subunit HisH; Provisional Probab=99.87 E-value=7.4e-21 Score=167.52 Aligned_cols=168 Identities=27% Similarity=0.449 Sum_probs=109.8 Q ss_pred EEEEEECCCC--CHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCCC-----CCHHHHHHHCCCCCC Q ss_conf 4999967964--102322222024541330465744644135788998588888410112-----110122210257863 Q gi|254780420|r 207 HVVCIDYGIR--SNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTSV-----YSSPIICKLVDSGMP 279 (396) Q Consensus 207 kIvvIDfGiK--~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~-----~~~~~~~~~~~~~iP 279 (396) -|++||+|+- .+|.|.|.+.|+++.++. +.+++.+ .|+|+| ||.|....... ...+.+++.+..++| T Consensus 2 mI~Iid~g~gNi~si~~al~~lG~~~~ii~---~~~~l~~--~d~lIl-PGvGsf~~~~~~L~~~~l~~~i~~~~~~~~P 75 (204) T PRK13141 2 MIAIIDYGMGNLHSVSKALERLGAEAIITS---DPEEILA--ADAVIL-PGVGAFPDAMQNLRERGLVEVIKEAVASGKP 75 (204) T ss_pred EEEEEECCCCHHHHHHHHHHHCCCCEEEEC---CHHHHHH--CCEEEE-CCCCCCHHHHHHHHHCCCHHHHHHHHHCCCC T ss_conf 899997897589999999998699899979---9999840--898998-6887505687764653929999999865996 Q ss_pred EEEEECHHHHHHHH------------CCCCCCCCCCC------CCCCCC-------CC-EECCC-CEEEEECCCCCEEEE Q ss_conf 47861005887875------------28544667767------521011-------23-47789-709996145761570 Q gi|254780420|r 280 MFGICLGHQLLGLA------------LGAQTVKMYQG------HHGVNH-------PV-KNISN-GTVEIVSMNHGFAID 332 (396) Q Consensus 280 ilGIClGhQ~l~~a------------~G~~~~kl~~G------hrG~N~-------Pv-~~~~t-~kv~iTsqNHg~~v~ 332 (396) |||||||||+|+.. +.+++.|++.+ |-|.|. |. .++.. ..+|+. |+|.|+ T Consensus 76 iLGIClGmQlL~~~s~E~g~~~GLgl~~g~v~~~~~~~~~~vPhiGw~~i~~~~~~~l~~~~~~~~~~yf~---HSy~v~ 152 (204) T PRK13141 76 LLGICLGMQLLFESSEEFGDTEGLGLLPGKVRRFPPGEGLKVPHMGWNQLEFTRPSPLLAGLPEGAYVYFV---HSYYAD 152 (204) T ss_pred EEEEHHHHHHHHHCCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCEEEEECCCCHHHHCCCCCCCEEEE---ECCEEE T ss_conf 89754989997301310466466543668999778888885651030577611378054167777723786---242783 Q ss_pred HHHCCCCCEEEEEECCCCEEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 343699828999988998087268679977898348788776700168999999999 Q gi|254780420|r 333 ASSLPAGLEETHISLFDSSNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDFM 389 (396) Q Consensus 333 ~~slp~~~~v~~~~~nD~tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~i 389 (396) . .+.. .+...+..+..+.+..++ .+++|+|||||.| | ..+..++++|++|+ T Consensus 153 ~--~~~~-~~~a~~~~g~~i~a~i~~-~ni~G~QFHPEkS-~-~~G~~ilknFl~l~ 203 (204) T PRK13141 153 P--PDEE-VVAATCDYGQKFPAAVAK-DNLFGVQFHPEKS-G-DIGLKILKNFLEMV 203 (204) T ss_pred C--CCCC-CEEEEECCCCEEEEEEEC-CCEEEEECCCCCC-C-HHHHHHHHHHHHHH T ss_conf 3--8975-289995599889999971-9999998898514-8-52799999999974 No 37 >PRK11366 puuD gamma-glutamyl-gamma-aminobutyrate hydrolase; Provisional Probab=99.86 E-value=8e-21 Score=167.29 Aligned_cols=172 Identities=23% Similarity=0.355 Sum_probs=122.6 Q ss_pred CHHHHHHHHHCCCCEEEECC-CC---HHHHHHCCCCEEEECCCCCC--CCC-------------CCCCCHHHHHHHCCCC Q ss_conf 10232222202454133046-57---44644135788998588888--410-------------1121101222102578 Q gi|254780420|r 217 SNLLRVLSSLGCRITIVGAE-TS---YKDIMSLQPDGVLLSNGPGD--PAV-------------TSVYSSPIICKLVDSG 277 (396) Q Consensus 217 ~nIlr~L~~~g~~v~Vvp~~-~~---~~~i~~~~pdgi~lSnGPGd--P~~-------------~~~~~~~~~~~~~~~~ 277 (396) ..=++.+.+.|.-..++|+. .+ +++++. .-||++|++||+| |.- .+.....+++..++.+ T Consensus 29 ~~Y~~ai~~aGgiPvllP~~~~~~~~~~~~l~-~lDGllltGG~~Di~P~~Yge~~~~~~~dp~RD~~E~~L~~~A~~~~ 107 (254) T PRK11366 29 EKYLNAIIHAGGLPIALPHALAEPSLLEQLLP-KLDGIYLPGSPSNVQPHLYGENGDEPDADPGRDLLSMALINAALERR 107 (254) T ss_pred HHHHHHHHHCCCEEEECCCCCCCHHHHHHHHH-HCCEEEECCCCCCCCHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCC T ss_conf 99999999829858984888788789999998-78989968998988942368898888888421299999999999859 Q ss_pred CCEEEEECHHHHHHHHCCCCCCC-C---C--CCCCC-----------CCCCCEECC----------CCEEEEECCCCCEE Q ss_conf 63478610058878752854466-7---7--67521-----------011234778----------97099961457615 Q gi|254780420|r 278 MPMFGICLGHQLLGLALGAQTVK-M---Y--QGHHG-----------VNHPVKNIS----------NGTVEIVSMNHGFA 330 (396) Q Consensus 278 iPilGIClGhQ~l~~a~G~~~~k-l---~--~GhrG-----------~N~Pv~~~~----------t~kv~iTsqNHg~~ 330 (396) +||||||.|+|+|..|+||+.+. + + ..||- ..|+|.-.. ..++++.|. |..+ T Consensus 108 ~PiLGICRG~Q~lNVa~GGtL~qdl~~~~~~~~H~~~~~~~~~~~~~~~H~V~i~~~s~L~~i~~~~~~~~VNS~-HhQa 186 (254) T PRK11366 108 IPIFAICRGLQELVVATGGSLHRKLCEQPELLEHREDPELPVEQQYAPSHEVQVEEGGLLSALLPECSNFWVNSL-HGQG 186 (254) T ss_pred CCEEEECHHHHHHHHHHCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEEEECCCCCHHHHHCCCCCEEEECH-HHHH T ss_conf 997986035589998738864422003765234456888985433256411898689869997489870587421-3543 Q ss_pred EEHHHCCCCCEEEEEECCCCEEEEEEECCCCE-EEECCCCCCCCCC-CCHHHHHHHHHHHHHHHC Q ss_conf 70343699828999988998087268679977-8983487887767-001689999999999735 Q gi|254780420|r 331 IDASSLPAGLEETHISLFDSSNCGFRLIDQPV-FSVQYHPESSPGP-QDSYYLFSTFIDFMHKRK 393 (396) Q Consensus 331 v~~~slp~~~~v~~~~~nD~tiegi~~~~~~~-~~vQfHPEa~PGP-~D~~~~F~~F~~~i~~~k 393 (396) |+ .|.++|.|++++. ||+||++++++.++ +|||||||-..-. -....||+.|++-.++++ T Consensus 187 v~--~lg~gl~v~A~a~-DG~IEAie~~~~~f~lGVQWHPE~~~~~~~~~~~LF~aFv~a~r~~~ 248 (254) T PRK11366 187 AK--VVSPRLRVEARSP-DGLVEAVSVINHPFALGVQWHPEWNSSEYALSRILFEGFITACQHHI 248 (254) T ss_pred HH--CCCCCCEEEEEEC-CCCEEEEEECCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 31--3377738999908-99779999579985899970887787678678999999999999999 No 38 >cd01748 GATase1_IGP_Synthase Type 1 glutamine amidotransferase (GATase1) domain found in imidazole glycerol phosphate synthase (IGPS). Type 1 glutamine amidotransferase (GATase1) domain found in imidazole glycerol phosphate synthase (IGPS). IGPS incorporates ammonia derived from glutamine into N1-[(5'-phosphoribulosyl)-formimino]-5-aminoimidazole-4-carboxamide ribonucleotide (PRFAR) to form 5'-(5-aminoimidazole-4-carboxamide) ribonucleotide (AICAR) and imidazole glycerol phosphate (IGP). The glutamine amidotransferase domain generates the ammonia nucleophile which is channeled from the glutaminase active site to the PRFAR active site. IGPS belong to the triad family of amidotransferases having a conserved Cys-His-Glu catalytic triad in the glutaminase active site. Probab=99.86 E-value=3e-21 Score=170.29 Aligned_cols=164 Identities=25% Similarity=0.416 Sum_probs=106.0 Q ss_pred EEEEECCCCC--HHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCCC-----CCHHHHHHHCCCCCCE Q ss_conf 9999679641--02322222024541330465744644135788998588888410112-----1101222102578634 Q gi|254780420|r 208 VVCIDYGIRS--NLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTSV-----YSSPIICKLVDSGMPM 280 (396) Q Consensus 208 IvvIDfGiK~--nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~-----~~~~~~~~~~~~~iPi 280 (396) |++||||.-+ +|.|.|++.|+++.++. +.+++. +.|+|+| ||+|....... ...+.+++.++.++|| T Consensus 1 I~IiD~G~~ni~si~~~l~~lg~~~~ii~---~~~~l~--~~d~iIl-PGvG~f~~~~~~l~~~~~~~~i~~~~~~~~Pi 74 (198) T cd01748 1 IAIIDYGMGNLRSVANALERLGAEVIITS---DPEEIL--SADKLIL-PGVGAFGDAMANLRERGLIEALKEAIASGKPF 74 (198) T ss_pred CEEEECCCCHHHHHHHHHHHCCCCEEEEC---CHHHHH--HCCEEEE-CCCCCCHHHHHHHHHCCHHHHHHHHHHCCCCE T ss_conf 99997697789999999998799299979---999983--0898998-78874034444410038199999998769968 Q ss_pred EEEECHHHHHHHH------------CCCCCCCCCCC------CCCCCC-------CC-EECC-CCEEEEECCCCCEEEEH Q ss_conf 7861005887875------------28544667767------521011-------23-4778-97099961457615703 Q gi|254780420|r 281 FGICLGHQLLGLA------------LGAQTVKMYQG------HHGVNH-------PV-KNIS-NGTVEIVSMNHGFAIDA 333 (396) Q Consensus 281 lGIClGhQ~l~~a------------~G~~~~kl~~G------hrG~N~-------Pv-~~~~-t~kv~iTsqNHg~~v~~ 333 (396) ||||||||+|+.. +++++.|++.. |-|.|. |. .++. ..++|+ +|+|.|.. T Consensus 75 LGIClG~QlL~~~s~E~~~~~gLg~~~g~V~~~~~~~~~~iP~iGwn~i~~~~~~~~~~~~~~~~~~yF---~HS~~v~~ 151 (198) T cd01748 75 LGICLGMQLLFESSEEGGGTKGLGLIPGKVVRFPASEGLKVPHMGWNQLEITKESPLFKGIPDGSYFYF---VHSYYAPP 151 (198) T ss_pred EEEEEEEEEEEEEEECCCCCCCCCCCCEEEEECCCCCCCCCCCCCEEEEECCCCCCHHCCCCCCCEEEE---EECCEEEC T ss_conf 999811454666765068635536554099988777788664112246750336803207776753588---62417752 Q ss_pred HHCCCCCEEEEEECCCCEEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHH Q ss_conf 43699828999988998087268679977898348788776700168999999 Q gi|254780420|r 334 SSLPAGLEETHISLFDSSNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFI 386 (396) Q Consensus 334 ~slp~~~~v~~~~~nD~tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~ 386 (396) +....+....-.+..+... +++.+++|||||||.| |. .+..|+++|+ T Consensus 152 ---~~~~~~~a~~~~~~~~~a~-i~~~ni~GvQFHPEkS-~~-~G~~ll~nFl 198 (198) T cd01748 152 ---DDPDYILATTDYGGKFPAA-VEKDNIFGTQFHPEKS-GK-AGLKLLKNFL 198 (198) T ss_pred ---CCCCEEEEEECCCCEEEEE-EECCCEEEEECCCCCC-CH-HHHHHHHHHC T ss_conf ---8975068897599839999-9879999997898524-76-0899999539 No 39 >PRK13175 consensus Probab=99.86 E-value=2.2e-20 Score=164.21 Aligned_cols=171 Identities=27% Similarity=0.447 Sum_probs=111.5 Q ss_pred CEEEEEECCCC--CHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCC--CCC--CC-CCCHHHHHHHCCCCC Q ss_conf 14999967964--102322222024541330465744644135788998588888--410--11-211012221025786 Q gi|254780420|r 206 YHVVCIDYGIR--SNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGD--PAV--TS-VYSSPIICKLVDSGM 278 (396) Q Consensus 206 ~kIvvIDfGiK--~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGd--P~~--~~-~~~~~~~~~~~~~~i 278 (396) .+|++||||.- ++|.|.|++.|+++.++. +.+++. +.|+|+|+ |-|. ++. .. ....+.+++.++.++ T Consensus 2 ~kI~IiD~g~gn~~si~~al~~lg~~~~i~~---~~~~i~--~~d~lILP-GvGsf~~~~~~l~~~~l~~~i~~~~~~g~ 75 (206) T PRK13175 2 PKIAVIDYGMGNLHSVCKALERLGAEPILTS---DPADLL--AADALILP-GVGAFDPAMQNLRSRGLIPPIKDAIASGK 75 (206) T ss_pred CEEEEEECCCCHHHHHHHHHHHCCCCEEEEC---CHHHHH--CCCEEEEC-CCCCCHHHHHHHHHCCCHHHHHHHHHCCC T ss_conf 8799997997689999999998799899979---989983--28928968-99852678875553187168999985699 Q ss_pred CEEEEECHHHHHHHH-----------CCCCCCCCC------CCCCCCCC-------CC-EECCC-CEEEEECCCCCEEEE Q ss_conf 347861005887875-----------285446677------67521011-------23-47789-709996145761570 Q gi|254780420|r 279 PMFGICLGHQLLGLA-----------LGAQTVKMY------QGHHGVNH-------PV-KNISN-GTVEIVSMNHGFAID 332 (396) Q Consensus 279 PilGIClGhQ~l~~a-----------~G~~~~kl~------~GhrG~N~-------Pv-~~~~t-~kv~iTsqNHg~~v~ 332 (396) ||||||||||+|+.. +++++.|++ ..|-|.|. |+ +++.. ..+|+. |+|.+. T Consensus 76 PiLGIClGmQlL~~~seeg~~~GLgli~g~V~k~~~~~~~~iPh~Gw~~i~~~~~~~l~~~~~~~~~~yf~---HSy~~~ 152 (206) T PRK13175 76 PFLGICLGLQLLFESSEEGTEPGLGILPGKVQRFPSEPGLRIPHMGWNQLLTQPDCPLWENLPPNPWVYFV---HSYYAV 152 (206) T ss_pred CEEEEEHHHHHHHHCCCCCCCCCCCEECEEEEECCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCEEEE---ECCEEE T ss_conf 77998678667613253456477635210798626555533761045057851788420112678823897---500784 Q ss_pred HHHCCCCCEEEEEECCC-CEEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 34369982899998899-808726867997789834878877670016899999999997 Q gi|254780420|r 333 ASSLPAGLEETHISLFD-SSNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDFMHK 391 (396) Q Consensus 333 ~~slp~~~~v~~~~~nD-~tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~i~~ 391 (396) +. +..+.+....-.+ .-+.+++ ..+++|+|||||.| |+ .+..|+++|++++++ T Consensus 153 ~~--~~~~~~~~~~~~~~~~~a~v~--~~ni~G~QFHPEkS-~~-~G~~llknFl~~v~k 206 (206) T PRK13175 153 PL--DPSDTAATVTHGSQKVTAAIA--RDNLMAVQFHPEKS-GT-AGLKLLRNFVSQVQK 206 (206) T ss_pred EC--CCCCEEEEEECCCCCEEEEEE--CCCEEEEECCCCCC-CC-CHHHHHHHHHHHHHC T ss_conf 06--876269999608983899997--89999994898246-84-378999999999649 No 40 >cd01741 GATase1_1 Subgroup of proteins having the Type 1 glutamine amidotransferase (GATase1) domain. This group contains a subgroup of proteins having the Type 1 glutamine amidotransferase (GATase1) domain. GATase activity catalyses the transfer of ammonia from the amide side chain of glutamine to an acceptor substrate. Glutamine amidotransferases (GATase) includes the triad family of amidotransferases which have a conserved Cys-His-Glu catalytic triad in the glutaminase active site. In this subgroup this triad is conserved. GATase activity can be found in a range of biosynthetic enzymes, including: glutamine amidotransferase, formylglycinamide ribonucleotide, GMP synthetase , anthranilate synthase component II, glutamine-dependent carbamoyl phosphate synthase, cytidine triphosphate synthetase, gamma-glutamyl hydrolase, imidazole glycerol phosphate synthase and, cobyric acid synthase. Glutamine amidotransferase (GATase) domains can occur either as single polypeptides, as in glutamine Probab=99.86 E-value=7.5e-21 Score=167.48 Aligned_cols=166 Identities=27% Similarity=0.379 Sum_probs=107.3 Q ss_pred EEEEEECCCC---CHHHHHHHHHC---CCCEEEECCCC--HHHHHHCCCCEEEECCCCCCCCCCCC----CCHHHHHHHC Q ss_conf 4999967964---10232222202---45413304657--44644135788998588888410112----1101222102 Q gi|254780420|r 207 HVVCIDYGIR---SNLLRVLSSLG---CRITIVGAETS--YKDIMSLQPDGVLLSNGPGDPAVTSV----YSSPIICKLV 274 (396) Q Consensus 207 kIvvIDfGiK---~nIlr~L~~~g---~~v~Vvp~~~~--~~~i~~~~pdgi~lSnGPGdP~~~~~----~~~~~~~~~~ 274 (396) ||++|.+..- ......|++++ .++.++..... ..++ .++|||++|+||.++..... .....+++++ T Consensus 1 ~i~il~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~--~~~dgvii~Gg~~~~~~~~~pwi~~~~~~i~~~~ 78 (188) T cd01741 1 RILILQHDTPEGPGLFEDLLREAGAETIEIDVVDVYAGELLPDL--DDYDGLVILGGPMSVDEDDYPWLKKLKELIRQAL 78 (188) T ss_pred CEEEEECCCCCCCHHHHHHHHHCCCCCEEEEEEECCCCCCCCCH--HHCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHH T ss_conf 98999779999974899999966998468999966669899784--4408899969987778678866999999999999 Q ss_pred CCCCCEEEEECHHHHHHHHCCCCCCCCCCCCCCCCCCCEECCCCE----------EEEECCCCCEEEEHHHCCCCCEEEE Q ss_conf 578634786100588787528544667767521011234778970----------9996145761570343699828999 Q gi|254780420|r 275 DSGMPMFGICLGHQLLGLALGAQTVKMYQGHHGVNHPVKNISNGT----------VEIVSMNHGFAIDASSLPAGLEETH 344 (396) Q Consensus 275 ~~~iPilGIClGhQ~l~~a~G~~~~kl~~GhrG~N~Pv~~~~t~k----------v~iTsqNHg~~v~~~slp~~~~v~~ 344 (396) +.++||||||+|||+||.|+||++.+.+.|....-+++.-...++ -+..-|.|+..|.. ||.+.++++ T Consensus 79 ~~~~PilGIC~G~Qlla~alGg~V~~~~~~~e~G~~~i~lt~~~~~~~~~~~~~~~~~~~~~H~d~v~~--lP~~~~~la 156 (188) T cd01741 79 AAGKPVLGICLGHQLLARALGGKVGRNPKGWEIGWFPVTLTEAGKADPLFAGLPDEFPVFHWHGDTVVE--LPPGAVLLA 156 (188) T ss_pred HCCCCEEEECHHHHHHHHHCCCEEEECCCCCCEEEEEEEEECCCCCCCCCCCCCCCCCHHHHHHEEEEE--CCCCCEEEE T ss_conf 879999998468999999839999878987521579999954766542214799765434021027243--899989999 Q ss_pred EECCCCEEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHH Q ss_conf 988998087268679977898348788776700168999999 Q gi|254780420|r 345 ISLFDSSNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFI 386 (396) Q Consensus 345 ~~~nD~tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~ 386 (396) .|. +..++++++++ .+|||||||| ..++.+|+ T Consensus 157 ~s~-~~~~qa~~~~~-~~~g~QfHPE--------~~~~~~f~ 188 (188) T cd01741 157 SSE-ACPNQAFRYGD-RALGLQFHPE--------ERLLRNFL 188 (188) T ss_pred ECC-CCCEEEEEECC-EEEEEECCCC--------HHHHHHCC T ss_conf 889-99788999799-5899957612--------76676309 No 41 >pfam07722 Peptidase_C26 Peptidase C26. These peptidases have gamma-glutamyl hydrolase activity; that is they catalyse the cleavage of the gamma-glutamyl bond in poly-gamma-glutamyl substrates. They are structurally related to pfam00117, but contain extensions in four loops and at the C terminus. Probab=99.85 E-value=5.3e-21 Score=168.51 Aligned_cols=149 Identities=26% Similarity=0.414 Sum_probs=107.2 Q ss_pred HHHHHHHHHCCCCEEEECCCCHHHHHH--CCCCEEEECCCCCC--CCC---------------CCCCCHHHHHHHCCCCC Q ss_conf 023222220245413304657446441--35788998588888--410---------------11211012221025786 Q gi|254780420|r 218 NLLRVLSSLGCRITIVGAETSYKDIMS--LQPDGVLLSNGPGD--PAV---------------TSVYSSPIICKLVDSGM 278 (396) Q Consensus 218 nIlr~L~~~g~~v~Vvp~~~~~~~i~~--~~pdgi~lSnGPGd--P~~---------------~~~~~~~~~~~~~~~~i 278 (396) +-++.+.+.|+.+.++|++.+.+++.. ..-|||+|++|+-| |.. .+.....+++..++.++ T Consensus 28 ~Yv~~i~~aGg~pv~iP~~~~~~~~~~~l~~~dGlll~GG~~di~P~~yg~~~~~~~~~~~~~Rd~~e~~l~~~a~~~~~ 107 (219) T pfam07722 28 SYVKAVEGAGGLPVLLPILEDPEDLEALLDLVDGLLLTGGQSNVDPSLYGEEPSPSGGYYDPARDAFELALIRAALERGK 107 (219) T ss_pred HHHHHHHHCCCEEEEECCCCCHHHHHHHHHHCCEEEECCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCC T ss_conf 99999998599999978999989999999767989985988767822146776866787578889999999999987699 Q ss_pred CEEEEECHHHHHHHHCCCCCCCCC---CC---C-------CCCCCCCEECCC---------CEEEEECCCCCEEEEHHHC Q ss_conf 347861005887875285446677---67---5-------210112347789---------7099961457615703436 Q gi|254780420|r 279 PMFGICLGHQLLGLALGAQTVKMY---QG---H-------HGVNHPVKNISN---------GTVEIVSMNHGFAIDASSL 336 (396) Q Consensus 279 PilGIClGhQ~l~~a~G~~~~kl~---~G---h-------rG~N~Pv~~~~t---------~kv~iTsqNHg~~v~~~sl 336 (396) ||||||+|||+|..++|++.++-- .+ | ...+|+|.-... .++. +...|.++|+ .| T Consensus 108 PilGICrG~Qllnv~~GGtL~q~i~~~~~~~~~~~~~~~~~~~~h~v~i~~~s~l~~~~~~~~~~-vnS~Hhq~I~--~l 184 (219) T pfam07722 108 PILGICRGMQLLNVALGGTLYQDIPEHPGNSDHHHQLAVQYAPSHAVSLEPGSLLARLLGAEEIL-VNSLHHQAIK--RL 184 (219) T ss_pred CEEEECHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEECCCCHHHHHHCCCCEE-EECCHHHHHH--CC T ss_conf 87997650899999967830002301257544566887642560389977897399872998457-9745065774--04 Q ss_pred CCCCEEEEEECCCCEEEEEEECCC--CEEEECCCCC Q ss_conf 998289999889980872686799--7789834878 Q gi|254780420|r 337 PAGLEETHISLFDSSNCGFRLIDQ--PVFSVQYHPE 370 (396) Q Consensus 337 p~~~~v~~~~~nD~tiegi~~~~~--~~~~vQfHPE 370 (396) +++|++++++. ||+||++++++. +++||||||| T Consensus 185 ~~~l~v~A~a~-Dg~iEaie~~~~~~f~~GVQwHPE 219 (219) T pfam07722 185 APGLRVEATAP-DGTIEAIESPNAPYFVLGVQWHPE 219 (219) T ss_pred CCCCEEEEEEC-CCCEEEEEECCCCCCEEEEEECCC T ss_conf 89858999948-997899996699986799972889 No 42 >PRK13181 hisH imidazole glycerol phosphate synthase subunit HisH; Provisional Probab=99.84 E-value=9.8e-21 Score=166.65 Aligned_cols=166 Identities=25% Similarity=0.440 Sum_probs=106.4 Q ss_pred EEEEECCCCC--HHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCC----C-CCCHHHHHHHCCCCCCE Q ss_conf 9999679641--023222220245413304657446441357889985888884101----1-21101222102578634 Q gi|254780420|r 208 VVCIDYGIRS--NLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVT----S-VYSSPIICKLVDSGMPM 280 (396) Q Consensus 208 IvvIDfGiK~--nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~----~-~~~~~~~~~~~~~~iPi 280 (396) |++||||.-+ +|.|.|++.|+++.+++ +.+++. +.|+++|+ |.|-.... . ....+.+++.++.++|| T Consensus 2 I~IiD~G~~n~~si~r~l~~lg~~~~i~~---~~~~l~--~~d~lIlP-GvG~~~~~~~~l~~~~~~~~i~~~~~~~~Pi 75 (199) T PRK13181 2 IAIIDYGAGNLRSVANALKRLGVEAVVSS---DPEEIE--GADKVILP-GVGAFGQAMAALRESGLDEAIKEHVEKKQPV 75 (199) T ss_pred EEEEECCCHHHHHHHHHHHHCCCEEEEEC---CHHHHH--HHHCEECC-CCCCCCHHHHHHHHCCCHHHHHHHHHCCCCE T ss_conf 99998896099999999998698099979---999974--21100457-9865330256665638679999999769988 Q ss_pred EEEECHHHHHHHH-----------CCCCCCCCCCC-----CCCCCC-------CC-EECCC-CEEEEECCCCCEEEEHHH Q ss_conf 7861005887875-----------28544667767-----521011-------23-47789-709996145761570343 Q gi|254780420|r 281 FGICLGHQLLGLA-----------LGAQTVKMYQG-----HHGVNH-------PV-KNISN-GTVEIVSMNHGFAIDASS 335 (396) Q Consensus 281 lGIClGhQ~l~~a-----------~G~~~~kl~~G-----hrG~N~-------Pv-~~~~t-~kv~iTsqNHg~~v~~~s 335 (396) ||||||||+|+.. +++++.|++.+ |-|.|. |. +++.. ..+|+ +|+|.|+.. T Consensus 76 LGIClGmQlL~~~~~eg~~~GLgl~~g~v~~~~~~~~~~phiGw~~i~~~~~~~lf~~~~~~~~~yf---~Hs~~v~~~- 151 (199) T PRK13181 76 LGICLGMQLLCESSEEGNVKGLGLIPGDVKRFRSGKDKVPQMGWNSVKPLKESPLFKGIEENSRFYF---VHSYYVPPE- 151 (199) T ss_pred EEEEEEEEEEEECCCCCCCCCCCEECCEEEECCCCCCCCCCCCCEEEEEECCCHHHCCCCCCCEEEE---EEEEEEECC- T ss_conf 9980426765410003654666662128998277655557777458876216844537655560488---850378617- Q ss_pred CCCCCEEEEEECCCCEEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 69982899998899808726867997789834878877670016899999999 Q gi|254780420|r 336 LPAGLEETHISLFDSSNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDF 388 (396) Q Consensus 336 lp~~~~v~~~~~nD~tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~ 388 (396) +.+. +....-.+..+. ...+..++||||||||.+ + ..+..++++|++| T Consensus 152 -~~~~-~~a~~~~g~~~~-~~v~~~ni~GvQFHPE~s-~-~~G~~il~nFl~i 199 (199) T PRK13181 152 -DPED-VLATTNYGVPFC-SAVEKDNIYAVQFHPEKS-G-KAGLKLLKNFAEL 199 (199) T ss_pred -CCCC-EEEEECCCCEEE-EEEECCCEEEEECCCCCC-C-HHHHHHHHHHHCC T ss_conf -8763-279963898499-999899999990897817-8-6099999998669 No 43 >COG2071 Predicted glutamine amidotransferases [General function prediction only] Probab=99.79 E-value=1.3e-18 Score=151.68 Aligned_cols=167 Identities=25% Similarity=0.395 Sum_probs=115.6 Q ss_pred HHHHHHHCCCCEEEECCCCH---HHHHHCCCCEEEECCCCCC--C---------C------CCCCCCHHHHHHHCCCCCC Q ss_conf 32222202454133046574---4644135788998588888--4---------1------0112110122210257863 Q gi|254780420|r 220 LRVLSSLGCRITIVGAETSY---KDIMSLQPDGVLLSNGPGD--P---------A------VTSVYSSPIICKLVDSGMP 279 (396) Q Consensus 220 lr~L~~~g~~v~Vvp~~~~~---~~i~~~~pdgi~lSnGPGd--P---------~------~~~~~~~~~~~~~~~~~iP 279 (396) ++...+.|--..++|.-.+. .+++. .-||++||+| .| | + ..+.+...++++.++.++| T Consensus 32 v~ai~~aGg~pillP~~~d~~~~~~~l~-~iDgliltGg-~nV~P~~YGee~~~~~~~~~p~RD~~E~aLi~~ALe~~iP 109 (243) T COG2071 32 VDAIIKAGGIPILLPALEDPEDARQYLD-LIDGLILTGG-SNVDPSLYGEEPSEKDGPYDPERDAFELALIRAALERGIP 109 (243) T ss_pred HHHHHHCCCCEEEECCCCCHHHHHHHHH-HCCEEEECCC-CCCCHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCC T ss_conf 9999976995088548887778999986-5367895289-8689878398888556778843007789999999976998 Q ss_pred EEEEECHHHHHHHHCCCCCCCCC---C---CCCCC------CCCCEECCCC--------E-EEEECCCCCEEEEHHHCCC Q ss_conf 47861005887875285446677---6---75210------1123477897--------0-9996145761570343699 Q gi|254780420|r 280 MFGICLGHQLLGLALGAQTVKMY---Q---GHHGV------NHPVKNISNG--------T-VEIVSMNHGFAIDASSLPA 338 (396) Q Consensus 280 ilGIClGhQ~l~~a~G~~~~kl~---~---GhrG~------N~Pv~~~~t~--------k-v~iTsqNHg~~v~~~slp~ 338 (396) |||||.|.|+|..|+||..++=- - .||-. .|+|.-.... . +.+-|- |.++|+ .|.. T Consensus 110 ILgICRG~QllNVa~GGtL~q~i~~~~~~~~H~~~~~~~~~~H~V~i~~~s~La~i~g~~~~~VNS~-HhQaIk--~La~ 186 (243) T COG2071 110 ILGICRGLQLLNVALGGTLYQDISEQPGHIDHRQPNPVHIESHEVHIEPGSKLAKILGESEFMVNSF-HHQAIK--KLAP 186 (243) T ss_pred EEEECCCHHHHHHHHCCEEEHHHHCCCCCCCCCCCCCCCCCEEEEEECCCCCHHHHCCCCCEEECCH-HHHHHH--HHCC T ss_conf 8998401578878756753021230545002569998654315799547841777537652263151-778999--7678 Q ss_pred CCEEEEEECCCCEEEEEEECC-CCEEEECCCCCCCCCCC-CHHHHHHHHHHHHHHH Q ss_conf 828999988998087268679-97789834878877670-0168999999999973 Q gi|254780420|r 339 GLEETHISLFDSSNCGFRLID-QPVFSVQYHPESSPGPQ-DSYYLFSTFIDFMHKR 392 (396) Q Consensus 339 ~~~v~~~~~nD~tiegi~~~~-~~~~~vQfHPEa~PGP~-D~~~~F~~F~~~i~~~ 392 (396) +|+|+++.. ||+||++++++ ..+.|||||||-...-. -...||+.|+.--+++ T Consensus 187 ~L~V~A~a~-DG~VEAie~~~~~fvlGVQWHPE~~~~~~~~~~~LFe~F~~~~~~~ 241 (243) T COG2071 187 GLVVEARAP-DGTVEAVEVKNDAFVLGVQWHPEYLVDTNPLSLALFEAFVNACKKH 241 (243) T ss_pred CCEEEEECC-CCCEEEEEECCCCEEEEEECCHHHHCCCCHHHHHHHHHHHHHHHHH T ss_conf 867999979-9958989853885079986175552267868899999999998751 No 44 >PRK09065 glutamine amidotransferase; Provisional Probab=99.77 E-value=6.6e-18 Score=146.76 Aligned_cols=162 Identities=21% Similarity=0.265 Sum_probs=100.8 Q ss_pred CEEEEEECCCCCH------------HHHHHHHHCCCCEEEECC--CCHHHHHHCCCCEEEECCCCCCCCCCCC---CCHH Q ss_conf 1499996796410------------232222202454133046--5744644135788998588888410112---1101 Q gi|254780420|r 206 YHVVCIDYGIRSN------------LLRVLSSLGCRITIVGAE--TSYKDIMSLQPDGVLLSNGPGDPAVTSV---YSSP 268 (396) Q Consensus 206 ~kIvvIDfGiK~n------------Ilr~L~~~g~~v~Vvp~~--~~~~~i~~~~pdgi~lSnGPGdP~~~~~---~~~~ 268 (396) +||++|-||--.. +.+.|...+..++++... -.+.++ .++||++++++|-.-.+... ...+ T Consensus 4 ~ki~IL~~g~~~~~~~~~~G~y~~~f~~ll~~~~~~~~~~~v~~ge~~p~~--~~~dg~iitGS~~~v~d~~pWi~~l~~ 81 (238) T PRK09065 4 LPLLIIETGTPPPSIRARYGDFPHWIRVALGLAEQPVVVVDVFAGEPLPAP--EDFAGVLITGSHAMVTDRAPWSERTAD 81 (238) T ss_pred CEEEEEECCCCCHHHHHHCCCHHHHHHHHHHCCCCCEEEEECCCCCCCCCH--HHCCEEEEECCCCCCCCCCHHHHHHHH T ss_conf 169999889898799987399899999987236985799967279899895--784889997985205799888999999 Q ss_pred HHHHHCCCCCCEEEEECHHHHHHHHCCCCCCCCCCCCCCCCCCCEECCCCE----------EEEECCCCCEEEEHHHCCC Q ss_conf 222102578634786100588787528544667767521011234778970----------9996145761570343699 Q gi|254780420|r 269 IICKLVDSGMPMFGICLGHQLLGLALGAQTVKMYQGHHGVNHPVKNISNGT----------VEIVSMNHGFAIDASSLPA 338 (396) Q Consensus 269 ~~~~~~~~~iPilGIClGhQ~l~~a~G~~~~kl~~GhrG~N~Pv~~~~t~k----------v~iTsqNHg~~v~~~slp~ 338 (396) .++.+++.++||||||+|||+||.|+||++.+-+.|..-.-++|.-...++ .+-.-|.|...|. .||+ T Consensus 82 ~ir~~~~~~~PilGICfGhQllA~AlGG~V~~~p~G~eiG~~~v~lt~~g~~dplf~~~p~~~~~~~~H~D~V~--~lP~ 159 (238) T PRK09065 82 WLRQAAAAGMPLLGICYGHQLLAHALGGEVGYNPAGRESGTVALELHPAAAQDPLFAGLPATFPAHATHLQTVL--RLPP 159 (238) T ss_pred HHHHHHHCCCCEEEEEHHHHHHHHHHCCEEEECCCCCCCCEEEEEECHHCCCCCHHCCCCCCCEEEEEEECEEE--ECCC T ss_conf 99999877999999817789999984987976799763651256875100457111189876179999514289--6499 Q ss_pred CCEEEEEECCCCEEEEEEECCCCEEEECCCCCCCC Q ss_conf 82899998899808726867997789834878877 Q gi|254780420|r 339 GLEETHISLFDSSNCGFRLIDQPVFSVQYHPESSP 373 (396) Q Consensus 339 ~~~v~~~~~nD~tiegi~~~~~~~~~vQfHPEa~P 373 (396) +.++.+.|.. .-+++++..+ .+||||||||-+| T Consensus 160 ga~~LAss~~-c~~qa~~~g~-~~~gvQfHPEft~ 192 (238) T PRK09065 160 GAVVLARSAH-DPHQAFRYGP-SAWGVQFHPEFTA 192 (238) T ss_pred CEEEEEECCC-CHHHHEEECC-CEEEEECCCCCCH T ss_conf 8389980999-9057178589-4899956862899 No 45 >PRK13180 consensus Probab=99.75 E-value=2.2e-17 Score=143.11 Aligned_cols=172 Identities=23% Similarity=0.359 Sum_probs=106.2 Q ss_pred CEEEEEECCCC--CHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECC-CCCCCCC--C-CCCCHHHHHHHCCCCCC Q ss_conf 14999967964--10232222202454133046574464413578899858-8888410--1-12110122210257863 Q gi|254780420|r 206 YHVVCIDYGIR--SNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSN-GPGDPAV--T-SVYSSPIICKLVDSGMP 279 (396) Q Consensus 206 ~kIvvIDfGiK--~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSn-GPGdP~~--~-~~~~~~~~~~~~~~~iP 279 (396) .+|++||+|+= .++.|.|.++|+++.++ .+.+++. +.|+|+|.+ |.=+... . .....+.+++.++.++| T Consensus 2 kkI~IiDyg~gNi~Sv~~al~~~g~~~~i~---~~~~~~~--~~d~lIlPGVGsf~~~m~~L~~~~~~~~i~~~~~~gkp 76 (209) T PRK13180 2 KRVVVLDYGSGNLRSAQRALERVGAEVEVT---ADPDAAL--NADGLVVPGVGAFAACMAGLRAVGGDRIIGERLAAGRP 76 (209) T ss_pred CEEEEEECCCCHHHHHHHHHHHCCCCEEEE---CCHHHHH--HCCEEEECCCCCHHHHHHHHHHCCCHHHHHHHHHCCCC T ss_conf 779999789438999999999869989997---9999995--38968999988557777765536919999999976998 Q ss_pred EEEEECHHHHHHHH---CC----------CCCCCCC---CCCCCCCC-------CC-EECC-CCEEEEECCCCCEEEEHH Q ss_conf 47861005887875---28----------5446677---67521011-------23-4778-970999614576157034 Q gi|254780420|r 280 MFGICLGHQLLGLA---LG----------AQTVKMY---QGHHGVNH-------PV-KNIS-NGTVEIVSMNHGFAIDAS 334 (396) Q Consensus 280 ilGIClGhQ~l~~a---~G----------~~~~kl~---~GhrG~N~-------Pv-~~~~-t~kv~iTsqNHg~~v~~~ 334 (396) |||||||+|+|+.. .| +++.|++ ..|-|.|. +. ..+. ..+.|+. |+|+|... T Consensus 77 iLGIClGMQlL~~~s~E~g~~~~GLg~i~G~v~~l~~~~vPh~GWn~v~~~~~~~l~~~i~~~~~fYFv---HSy~v~~~ 153 (209) T PRK13180 77 VLGICVGMQILFERGVEHGVETEGCGEWPGTVERLDAPVLPHMGWNTVEAPAGSVLFAGLDADERFYFV---HSYAAQRW 153 (209) T ss_pred EEEEEEHEEEEEECEECCCCCCCCCCEECCEEEECCCCCCCCCCCEEEECCCCCHHHHCCCCCCCEEEE---CEEEECCC T ss_conf 799841208887031116767788136550899778887873565577706776354088858958896---85266345 Q ss_pred HCC-----CCCEEEEEECCCCEEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 369-----9828999988998087268679977898348788776700168999999999 Q gi|254780420|r 335 SLP-----AGLEETHISLFDSSNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDFM 389 (396) Q Consensus 335 slp-----~~~~v~~~~~nD~tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~i 389 (396) ..+ ....++...-...-+.+++ +.+++|||||||-|. ..+..|+++|++.+ T Consensus 154 ~~~~~~~~~~~~~~~~~~g~~~~aav~--~~ni~G~QFHPEKS~--~~Gl~ll~nFl~~~ 209 (209) T PRK13180 154 EGSPDPRTAPPLVTWATHGAPFVAAVE--NGPLSATQFHPEKSG--DAGAALLRNWVDTL 209 (209) T ss_pred CCCCCCCCCCCEEEEEECCCEEEEEEE--CCCEEEEECCCCCCC--HHHHHHHHHHHHHC T ss_conf 655565665535899847982699998--299999939963267--73899999998639 No 46 >PRK13144 consensus Probab=99.75 E-value=3.7e-17 Score=141.48 Aligned_cols=165 Identities=19% Similarity=0.294 Sum_probs=107.9 Q ss_pred EEEEEECCCC--CHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEE Q ss_conf 4999967964--10232222202454133046574464413578899858888841011211012221025786347861 Q gi|254780420|r 207 HVVCIDYGIR--SNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGIC 284 (396) Q Consensus 207 kIvvIDfGiK--~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIC 284 (396) ||++||+|+= .|+.+.|...|+++.+. .+.+++. +.|+|+| ||.|....... ....+++++. +.|||||| T Consensus 2 kI~IiDyg~GNi~Sv~~al~~~g~~~~i~---~~~~~i~--~~d~lIl-PGVGsf~~a~~-~~~~~~~~i~-~kpiLGIC 73 (190) T PRK13144 2 RVGVVDYTVGNIGSVLAALKRAGAEPVVV---KEPEEAN--RVDALVL-PGVGTYEAAYA-LARSFKEVIL-EKPTLAIC 73 (190) T ss_pred EEEEECCCCHHHHHHHHHHHHCCCCEEEE---CCHHHHH--HCCEEEE-CCCCCHHHHHH-HHHHHHHHHH-CCCEEEEE T ss_conf 89999279368999999999849986997---8999995--4897897-48885999999-9999999974-79848987 Q ss_pred CHHHHHHHHCC-----------CCCCCC---CCCCCCCCCCCEE------CCCCEEEEECCCCCEEEEHHHCCCCCEEEE Q ss_conf 00588787528-----------544667---7675210112347------789709996145761570343699828999 Q gi|254780420|r 285 LGHQLLGLALG-----------AQTVKM---YQGHHGVNHPVKN------ISNGTVEIVSMNHGFAIDASSLPAGLEETH 344 (396) Q Consensus 285 lGhQ~l~~a~G-----------~~~~kl---~~GhrG~N~Pv~~------~~t~kv~iTsqNHg~~v~~~slp~~~~v~~ 344 (396) ||+|+|....- +++.|+ +..|-|.|. |+. +..+..|.. |+|.+..+ +....+.. T Consensus 74 lGmQlL~~~seE~~~~GLgli~G~v~kl~~~kvPh~GWn~-v~~~~~~~~~~~~~fYFv---HSY~~~~~--~~~~~~~~ 147 (190) T PRK13144 74 LGMQLLFESSEEGGGRGLGIFRGRVERIKARKVPHIGWSY-TRVVKPLPFVEEGYYYYL---HSYGVRWD--EDEAHVAY 147 (190) T ss_pred HHHHHHHCCCCCCCCCCEEEEEEEEEECCCCCCCEECCCE-EECCCCCCCCCCCEEEEE---EEEEECCC--CCCCEEEE T ss_conf 6588664456334787630797699987888887636600-034678876445559999---98872268--87648999 Q ss_pred EECCCCEEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 988998087268679977898348788776700168999999999 Q gi|254780420|r 345 ISLFDSSNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDFM 389 (396) Q Consensus 345 ~~~nD~tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~i 389 (396) .+....-+..+. ..+++|||||||-|. ..+..|.++|+.-- T Consensus 148 ~~~g~~~~s~v~--~~Ni~G~QFHPEKS~--~~Gl~lLknFl~~~ 188 (190) T PRK13144 148 VELGRRYVAAVE--KGHILGVQFHPERSG--RAGLELIRRFLAAW 188 (190) T ss_pred EECCCEEEEEEE--ECCEEEEECCCHHCC--HHHHHHHHHHHHHH T ss_conf 975997899999--598999978800215--86899999999995 No 47 >PRK13152 hisH imidazole glycerol phosphate synthase subunit HisH; Provisional Probab=99.74 E-value=2.2e-17 Score=143.09 Aligned_cols=164 Identities=24% Similarity=0.383 Sum_probs=103.0 Q ss_pred EEEEECCCC--CHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCCCCC-----HHHHH-HHCCCCCC Q ss_conf 999967964--10232222202454133046574464413578899858888841011211-----01222-10257863 Q gi|254780420|r 208 VVCIDYGIR--SNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTSVYS-----SPIIC-KLVDSGMP 279 (396) Q Consensus 208 IvvIDfGiK--~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~~~-----~~~~~-~~~~~~iP 279 (396) |++||+|+= .++.|.|...|+++.++ .+.+++. +.|+|+| ||-|.-...-... .+.++ .++..++| T Consensus 2 I~IiDyg~gNi~Sv~~al~~lg~~~~i~---~~~~~i~--~~d~lIl-PGVGsf~~~m~~L~~~~l~~~i~~~~~~~~~p 75 (201) T PRK13152 2 IALIDYKAGNLNSVAKAFEKIGAINFIA---KNPKDLQ--KADKLLL-PGVGSFKEAMKNLKELGFIEALKEQVLVQKKP 75 (201) T ss_pred EEEEECCCCHHHHHHHHHHHCCCCEEEE---CCHHHHH--HCCEEEE-CCCCCHHHHHHHHHHCCCHHHHHHHHHHCCCC T ss_conf 9999559427999999999869988997---8999996--5895898-28884999998777518588999999876996 Q ss_pred EEEEECHHHHHHHH-------CC-----CCCCCC------CCCCCCCCC-------CC-EECCC-CEEEEECCCCCEEEE Q ss_conf 47861005887875-------28-----544667------767521011-------23-47789-709996145761570 Q gi|254780420|r 280 MFGICLGHQLLGLA-------LG-----AQTVKM------YQGHHGVNH-------PV-KNISN-GTVEIVSMNHGFAID 332 (396) Q Consensus 280 ilGIClGhQ~l~~a-------~G-----~~~~kl------~~GhrG~N~-------Pv-~~~~t-~kv~iTsqNHg~~v~ 332 (396) |||||||||+|+.. -| +++.|+ +..|-|.|. |. .++.+ ...|+. |+|.+. T Consensus 76 iLGIClGmQlL~~~s~E~~~~~GLg~i~G~v~kl~~~~~~kvPhiGWn~v~~~~~~~l~~~~~~~~~fYF~---HSy~v~ 152 (201) T PRK13152 76 ILGICLGMQLFLERGYEGGVCEGLGFIEGEVVKFEEDLNLKIPHMGWNELEILKQSPLYQGIPNKSDFYFV---HSFYVK 152 (201) T ss_pred EEEEEHHHEEEEECCCCCCCCCCEEEEEEEEEECCCCCCCCCCCCEEEEEEECCCCHHHCCCCCCCEEEEE---EEEEEC T ss_conf 69985213462104243786365589976999867776766663225898603479756388766719997---200623 Q ss_pred HHHCCCCCEEEEEEC-CCCEEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 343699828999988-99808726867997789834878877670016899999999 Q gi|254780420|r 333 ASSLPAGLEETHISL-FDSSNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDF 388 (396) Q Consensus 333 ~~slp~~~~v~~~~~-nD~tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~ 388 (396) .. ++. ++...- .+.-+.+++ ..++||+|||||-|. ..+..|.++|++| T Consensus 153 ~~--~~~--~~~~~~yg~~~~aav~--~~Ni~G~QFHPEKS~--~~Gl~lLknFl~l 201 (201) T PRK13152 153 CK--DEF--VSAKAQYGHKFVASLQ--KDHIFATQFHPEKSQ--NLGLKLLENFLRL 201 (201) T ss_pred CC--CCC--EEEEEECCCEEEEEEE--ECCEEEEECCCCCCC--HHHHHHHHHHHCC T ss_conf 68--874--8999984998999998--499999918941167--7589999999749 No 48 >PRK07053 glutamine amidotransferase; Provisional Probab=99.74 E-value=7.1e-18 Score=146.51 Aligned_cols=150 Identities=26% Similarity=0.401 Sum_probs=102.3 Q ss_pred HHHHHHHHHCCCCEEEECCC-CHHHHHHCCCCEEEECCCCCCCCCCCC-----CCHHHHHHHCCCCCCEEEEECHHHHHH Q ss_conf 02322222024541330465-744644135788998588888410112-----110122210257863478610058878 Q gi|254780420|r 218 NLLRVLSSLGCRITIVGAET-SYKDIMSLQPDGVLLSNGPGDPAVTSV-----YSSPIICKLVDSGMPMFGICLGHQLLG 291 (396) Q Consensus 218 nIlr~L~~~g~~v~Vvp~~~-~~~~i~~~~pdgi~lSnGPGdP~~~~~-----~~~~~~~~~~~~~iPilGIClGhQ~l~ 291 (396) .|.+.|.++|+.+.++.... ...--...++|++++-+||=++.+.+. .....++..++.++|+||||||||+|| T Consensus 18 ~i~~~l~~~g~~~~~~~~~~~~~~p~~~~~~d~livlGGpms~~d~~~~Pwl~~e~~lIr~a~~~~~PvLGIClG~QLlA 97 (235) T PRK07053 18 SFEQVLGERGYRVRYVDVGVDDLEVLDALEPDLLVVLGGPIGVYDDALYPFLAPEIALLRQRLAAGLPTLGICLGAQLIA 97 (235) T ss_pred HHHHHHHHCCCEEEEEECCCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEECHHHHHHH T ss_conf 99999997899599996788988999966679999938998988776685589999999999986998899847389999 Q ss_pred HHCCCCCCCCCCCCCCC----------CCCCEECCCCEEEEECCCCCEEEEHHHCCCCCEEEEEECCCCEEEEEEECCCC Q ss_conf 75285446677675210----------11234778970999614576157034369982899998899808726867997 Q gi|254780420|r 292 LALGAQTVKMYQGHHGV----------NHPVKNISNGTVEIVSMNHGFAIDASSLPAGLEETHISLFDSSNCGFRLIDQP 361 (396) Q Consensus 292 ~a~G~~~~kl~~GhrG~----------N~Pv~~~~t~kv~iTsqNHg~~v~~~slp~~~~v~~~~~nD~tiegi~~~~~~ 361 (396) .|+||++++.+....|. ..|+....... ...-|-|+.++ +||++.++.+.|- +-.+++++.. .. T Consensus 98 ~AlGG~V~~~~~~EiG~~~v~~t~~~~~dpl~~~~~~~-~~v~~wH~D~~---~LP~ga~~LAss~-~c~~Qaf~~g-~~ 171 (235) T PRK07053 98 RALGARVYPGAQKEIGWAPLTLTDAGRASPLRHLDGAD-TPVLHWHGDTF---DLPEGAILLASTP-ACRNQAFAWG-NH 171 (235) T ss_pred HHHCCEEECCCCCCEEEEEEEECCCCCCCCHHCCCCCC-CEEEEEECEEE---ECCCCCEEEECCC-CCHHHHEEEC-CC T ss_conf 97097786089960058877855123577000148767-72799954085---3499838998289-9858967847-96 Q ss_pred EEEECCCCCCCC Q ss_conf 789834878877 Q gi|254780420|r 362 VFSVQYHPESSP 373 (396) Q Consensus 362 ~~~vQfHPEa~P 373 (396) +||+|||||..| T Consensus 172 ~~g~QFHpE~t~ 183 (235) T PRK07053 172 VLALQFHPEIRA 183 (235) T ss_pred EEEEECCEECCH T ss_conf 899965846888 No 49 >PRK13177 consensus Probab=99.74 E-value=3.9e-17 Score=141.29 Aligned_cols=164 Identities=24% Similarity=0.405 Sum_probs=101.6 Q ss_pred CEEEEEECCCC--CHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCC--C----C---CCCCHHHHHHHC Q ss_conf 14999967964--10232222202454133046574464413578899858888841--0----1---121101222102 Q gi|254780420|r 206 YHVVCIDYGIR--SNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPA--V----T---SVYSSPIICKLV 274 (396) Q Consensus 206 ~kIvvIDfGiK--~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~--~----~---~~~~~~~~~~~~ 274 (396) .+|++||+|+= .++.+.|...|++..++- .+.++|. +.|.|+| ||-=+ . . ......+.+..+ T Consensus 3 ~~I~IiDyg~GNi~Sv~~al~~~g~~~~~i~--~~~~~i~--~~d~lIl---PGVGsF~~~m~~Lk~~~~~~~~i~~~~~ 75 (207) T PRK13177 3 STIALIDYGAGNLRSVHNALKAAGAEGVVVT--ADPDEVR--RADRVVL---PGVGAFAACMGGLRAIPGMVEAMEERVL 75 (207) T ss_pred CEEEEECCCCHHHHHHHHHHHHCCCCCEEEE--CCHHHHH--CCCEEEE---CCCCCHHHHHHHHHHCCCHHHHHHHHHH T ss_conf 9899991893279999999998299828995--8999983--5997998---9978789999776410444899999999 Q ss_pred CCCCCEEEEECHHHHHHHH------------CCCCCCCC-------CCCCCCCCCCCEE-------CCCCEEEEECCCCC Q ss_conf 5786347861005887875------------28544667-------7675210112347-------78970999614576 Q gi|254780420|r 275 DSGMPMFGICLGHQLLGLA------------LGAQTVKM-------YQGHHGVNHPVKN-------ISNGTVEIVSMNHG 328 (396) Q Consensus 275 ~~~iPilGIClGhQ~l~~a------------~G~~~~kl-------~~GhrG~N~Pv~~-------~~t~kv~iTsqNHg 328 (396) +.+.||||||||+|+|+.. +.|++.|+ +..|-|.|. |.. +..+..|+. |+ T Consensus 76 ~~~~piLGICLGmQlLf~~s~E~g~~~GLgli~G~V~~l~~~~~~~kiPh~GWn~-v~~~~~~~~~~~~~~~YFv---HS 151 (207) T PRK13177 76 EGGAPFLGICVGMQLLADRGEEHGVTPGLGWIGGEVRLIEPADPSCKVPHMGWND-VAPTTGSHPLIRPGEAYFL---HS 151 (207) T ss_pred HCCCCEEEECHHHHHHHHCCCCCCCCCCCCEECCEEEEECCCCCCCCCCEEEEHE-EEECCCCCCCCCCCCEEEE---EE T ss_conf 7699569981878857657607887775455453689816888777587644010-2544667766777845789---88 Q ss_pred EEEEHHHCCCCCEEEEEECCCCEE-EEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHH Q ss_conf 157034369982899998899808-72686799778983487887767001689999999 Q gi|254780420|r 329 FAIDASSLPAGLEETHISLFDSSN-CGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFID 387 (396) Q Consensus 329 ~~v~~~slp~~~~v~~~~~nD~ti-egi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~ 387 (396) |++.+.. . -.+.+..-.+..+ ..+. ...+||||||||-|. ..+..|+++|++ T Consensus 152 Y~~~~~~--~-~~i~~~t~y~~~~~a~v~--k~Ni~G~QFHPEKS~--~~Gl~lLknFl~ 204 (207) T PRK13177 152 YHFVPDD--G-AHVLATTDHGGGITAAVA--RDNIVGVQFHPEKSQ--AYGLALLARFLD 204 (207) T ss_pred EEEECCC--C-CCEEEEECCCCEEEEEEE--ECCEEEEECCCCCCC--HHHHHHHHHHHC T ss_conf 8985389--7-416899737998999999--599999939920067--868999999877 No 50 >PRK06490 glutamine amidotransferase; Provisional Probab=99.73 E-value=3.2e-17 Score=141.94 Aligned_cols=165 Identities=22% Similarity=0.353 Sum_probs=108.9 Q ss_pred CCCCCEEEEEEC---CCCCHHHHHHHHHCCCCEEEECCC--CHHHHHHCCCCEEEECCCCCCCCCCCCC---CHHHHHHH Q ss_conf 777714999967---964102322222024541330465--7446441357889985888884101121---10122210 Q gi|254780420|r 202 SDAKYHVVCIDY---GIRSNLLRVLSSLGCRITIVGAET--SYKDIMSLQPDGVLLSNGPGDPAVTSVY---SSPIICKL 273 (396) Q Consensus 202 ~~~~~kIvvIDf---GiK~nIlr~L~~~g~~v~Vvp~~~--~~~~i~~~~pdgi~lSnGPGdP~~~~~~---~~~~~~~~ 273 (396) ...+++|++|=- --=-.|-+.|.++|+++.++.... .+-+-+ .++||+++-+||=+..+.... ....++.. T Consensus 8 ~~~~k~vLviqH~~~e~pG~i~~~L~~~G~~~~i~r~~~gd~lP~~l-~~~dglvVlGGpmsa~D~~p~l~~e~~~I~~~ 86 (243) T PRK06490 8 ARDKRPILIVLHQERSTPGRVGQLLRERGYPLDIRRPRLGDPLPETL-EDHAGAVIFGGPMSANDPDDFIRREIDWISVP 86 (243) T ss_pred CCCCCCEEEEECCCCCCCHHHHHHHHHCCCEEEEEECCCCCCCCCCC-CCCCEEEEECCCCCCCCCCCHHHHHHHHHHHH T ss_conf 37898579996689999729999999789858999678999899975-46058999289999899871799999999999 Q ss_pred CCCCCCEEEEECHHHHHHHHCCCCCCCCCCCCC--CCCCCCEECCCCEEE-----EECCCCCEEEEHHHCCCCCEEEEEE Q ss_conf 257863478610058878752854466776752--101123477897099-----9614576157034369982899998 Q gi|254780420|r 274 VDSGMPMFGICLGHQLLGLALGAQTVKMYQGHH--GVNHPVKNISNGTVE-----IVSMNHGFAIDASSLPAGLEETHIS 346 (396) Q Consensus 274 ~~~~iPilGIClGhQ~l~~a~G~~~~kl~~Ghr--G~N~Pv~~~~t~kv~-----iTsqNHg~~v~~~slp~~~~v~~~~ 346 (396) ++.++|+||||||||+||.|+||++.+.+.|.. |. +||.-...++.. ..-|=|+-.+ +||++.++.+.+ T Consensus 87 l~~~~P~LGICLGaQLlA~alGg~V~~~~~g~~EiG~-~~i~~t~~g~~~~~~p~~v~~wH~e~f---~LP~gA~lLA~s 162 (243) T PRK06490 87 LKENKPFLGICLGAQMLARHLGARVAPHPDGRVEIGY-YPLRPTEAGRALMPWPEMVYHWHREGF---DLPKGAELLATG 162 (243) T ss_pred HHCCCCEEEECHHHHHHHHHHCCEEECCCCCCCCCCC-EEEEECCCCCCCCCCCCEEEEEECCCC---CCCCCCEEEECC T ss_conf 9869988998884999999839888448888775785-235677453234589783678624320---069995898789 Q ss_pred CCCCEEEEEEECCCCEEEECCCCCCCC Q ss_conf 899808726867997789834878877 Q gi|254780420|r 347 LFDSSNCGFRLIDQPVFSVQYHPESSP 373 (396) Q Consensus 347 ~nD~tiegi~~~~~~~~~vQfHPEa~P 373 (396) . +-.+++++. ...+||+|||||.++ T Consensus 163 ~-~c~~QAFr~-G~~a~g~QFHpEvt~ 187 (243) T PRK06490 163 D-DFPNQAFRY-GDNAWGLQFHPEVTR 187 (243) T ss_pred C-CCCHHHEEE-CCCEEEEECCEECCH T ss_conf 9-983682897-694899954847878 No 51 >PRK13146 hisH imidazole glycerol phosphate synthase subunit HisH; Provisional Probab=99.73 E-value=6.8e-17 Score=139.62 Aligned_cols=167 Identities=23% Similarity=0.338 Sum_probs=98.3 Q ss_pred CEEEEEECCCCC--HHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCC------CCCC-CCHHHHHHHCCC Q ss_conf 149999679641--0232222202454133046574464413578899858888841------0112-110122210257 Q gi|254780420|r 206 YHVVCIDYGIRS--NLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPA------VTSV-YSSPIICKLVDS 276 (396) Q Consensus 206 ~kIvvIDfGiK~--nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~------~~~~-~~~~~~~~~~~~ 276 (396) .+|++||+|+=+ ++.+.|...++...++- ..+.+++. +.|.|+| ||--+ .... .....+.+.++. T Consensus 2 ~kI~IiDyg~gNi~Sv~~al~~~~~~~~i~i-~~~~~~i~--~~d~lIl---PGVGsf~~~m~~L~~~~~~~~i~~~~~~ 75 (208) T PRK13146 2 MSVAIIDYGSGNLRSAARALERAAPGADVTV-TADPDAVR--AADRLVL---PGVGAFADCMRGLRAVGGLREAVEEAAA 75 (208) T ss_pred CEEEEEECCCCHHHHHHHHHHHHCCCCEEEE-ECCHHHHH--HCCEEEE---CCCCCHHHHHHHHHHCCCHHHHHHHHHC T ss_conf 8899993795589999999998579973999-69999984--2887898---3889779999987640768999999983 Q ss_pred CCCEEEEECHHHHHHHH------------CCCCCCCCC-------CCCCCCCC-------CC-EECC-CCEEEEECCCCC Q ss_conf 86347861005887875------------285446677-------67521011-------23-4778-970999614576 Q gi|254780420|r 277 GMPMFGICLGHQLLGLA------------LGAQTVKMY-------QGHHGVNH-------PV-KNIS-NGTVEIVSMNHG 328 (396) Q Consensus 277 ~iPilGIClGhQ~l~~a------------~G~~~~kl~-------~GhrG~N~-------Pv-~~~~-t~kv~iTsqNHg 328 (396) ++||||||||||+|+.. +.|++.|+. ..|-|.|. |. ..+. ..+.|.. |+ T Consensus 76 ~kpiLGICLGmQlL~~~s~E~g~~~GLgli~G~V~kl~~~~~~~kiPhiGWn~i~~~~~~~l~~~i~~~~~fYFv---HS 152 (208) T PRK13146 76 GRPFLGICVGMQLLFERGEEHGDTPGLGWIPGDVVRLQPDGPALKVPHMGWNTVDQPRDHPLFDGIPDGARFYFV---HS 152 (208) T ss_pred CCCEEEEEEEEEEEECCCEECCCCCCCEEECCEEEECCCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCEEEEE---EE T ss_conf 998798885464121124006865701187418999987886665744252778856898200478878879997---31 Q ss_pred EEEEHHHCCCCCEEEEEECCCCEEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHH Q ss_conf 15703436998289999889980872686799778983487887767001689999999 Q gi|254780420|r 329 FAIDASSLPAGLEETHISLFDSSNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFID 387 (396) Q Consensus 329 ~~v~~~slp~~~~v~~~~~nD~tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~ 387 (396) |.+.+.. +...+...+ .+..+.++. ...++||+|||||-|. ..+..|+++|+. T Consensus 153 Y~~~~~~--~~~~~a~t~-yg~~~~a~v-~~~ni~G~QFHPEKS~--~~Gl~llknFl~ 205 (208) T PRK13146 153 YAAAPAN--PADVLAWTD-YGGPFTAAV-ARGNLFATQFHPEKSG--DAGLRLLRNFLA 205 (208) T ss_pred EEEECCC--CHHEEEEEE-CCCEEEEEE-ECCCEEEEECCCCCCC--HHHHHHHHHHHC T ss_conf 3753188--210089963-898799999-8399999989930056--838999999863 No 52 >PRK13153 consensus Probab=99.72 E-value=5.2e-17 Score=140.47 Aligned_cols=164 Identities=23% Similarity=0.400 Sum_probs=102.3 Q ss_pred EEEEECCCC--CHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCC-CCC----CCHHHHHHHCCCCCCE Q ss_conf 999967964--102322222024541330465744644135788998588888410-112----1101222102578634 Q gi|254780420|r 208 VVCIDYGIR--SNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAV-TSV----YSSPIICKLVDSGMPM 280 (396) Q Consensus 208 IvvIDfGiK--~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~-~~~----~~~~~~~~~~~~~iPi 280 (396) |++||+|+= .++.+.|.+.|+++.++ .+.+++.+ .|.|+| ||-|.-.. ... ...+.+++..+.+.|| T Consensus 2 I~IiDyg~gNi~Sv~~al~~lg~~~~ii---~~~~~i~~--~d~lIl-PGVGsf~~~m~~L~~~~l~~~i~~~~~~g~pi 75 (203) T PRK13153 2 IGIIDYNAGNLASVINAFEKIGAKARLE---SDPSKLKE--YDKLIL-PGVGAFGDAMEHLKSNGMDEAIKEFAKSGKPF 75 (203) T ss_pred EEEEECCCCHHHHHHHHHHHCCCCEEEE---CCHHHHHH--CCEEEE-CCCCCHHHHHHHHHHCCCHHHHHHHHHCCCCE T ss_conf 9999669328999999999869988997---69999965--893898-78883488999777537578899998759968 Q ss_pred EEEECHHHHHHHH---CC---------CCCCCCC---------CCCCCCCC-------CC-EECCCC-EEEEECCCCCEE Q ss_conf 7861005887875---28---------5446677---------67521011-------23-477897-099961457615 Q gi|254780420|r 281 FGICLGHQLLGLA---LG---------AQTVKMY---------QGHHGVNH-------PV-KNISNG-TVEIVSMNHGFA 330 (396) Q Consensus 281 lGIClGhQ~l~~a---~G---------~~~~kl~---------~GhrG~N~-------Pv-~~~~t~-kv~iTsqNHg~~ 330 (396) ||||||+|+|+.. .| |++.|+. ..|-|.|. |. +.+..+ ..|++ |+|. T Consensus 76 LGIClGmQlL~~~s~E~g~~~GLgli~G~V~~l~~~~~~~~~kiPhiGWn~i~~~~~~~~~~~~~~~~~~YF~---HSy~ 152 (203) T PRK13153 76 LGICLGMQLLFEKSYEFGEHEGLGLIEGEVVKFDESKFDEPLKVPHMGWNRLEFKKETPLFRGLPESFYLYFV---HSYH 152 (203) T ss_pred EEEEHHHHHCCCCCEECCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCHHHEECCCCCHHHCCCCCCCEEEEE---EEEE T ss_conf 9985553420047590675688848877899867222355667873435541034568666077768739999---1034 Q ss_pred EEHHHCCCCCEEEEEECCCC-EEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 70343699828999988998-08726867997789834878877670016899999999 Q gi|254780420|r 331 IDASSLPAGLEETHISLFDS-SNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDF 388 (396) Q Consensus 331 v~~~slp~~~~v~~~~~nD~-tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~ 388 (396) +.. .+-.+.+.+..+. -+.+++ ..+++|+|||||-|- ..+..|+++|++| T Consensus 153 ~~~----~~~~~~~~~~~g~~~~aai~--~~Ni~G~QFHPEKS~--~~Gl~llknFl~l 203 (203) T PRK13153 153 AVC----DDKDVLGKTTYGYEFVSAVQ--HDNVFGFQPHPEKSH--DNGLKILKNFVEL 203 (203) T ss_pred CCC----CCCCEEEEECCCCEEEEEEE--ECCEEEEECCCCCCC--HHHHHHHHHHHCC T ss_conf 256----87689999559986899998--299999928944077--8599999999619 No 53 >PRK13170 hisH imidazole glycerol phosphate synthase subunit HisH; Provisional Probab=99.72 E-value=4.1e-17 Score=141.18 Aligned_cols=165 Identities=26% Similarity=0.416 Sum_probs=103.8 Q ss_pred EEEEEECCCC--CHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCCCCC--HHHHHHHCCCCCCEEE Q ss_conf 4999967964--10232222202454133046574464413578899858888841011211--0122210257863478 Q gi|254780420|r 207 HVVCIDYGIR--SNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTSVYS--SPIICKLVDSGMPMFG 282 (396) Q Consensus 207 kIvvIDfGiK--~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~~~--~~~~~~~~~~~iPilG 282 (396) +|++||+|+= .|+.+.|...|+++.+. .+.+++. +.|.|+| ||-|.-..+-... ..+...+.+.+.|+|| T Consensus 2 kI~IiDyG~GNi~Sv~~al~~lg~~~~i~---~~~~~i~--~~d~lIL-PGVGsF~~am~~L~~~~l~~~i~~~~kpiLG 75 (196) T PRK13170 2 NVVIIDTGCANLSSVKFAFERLGYNPVVS---RDPDVIL--AADKLFL-PGVGTAQAAMDQLRERELIELIKACTQPVLG 75 (196) T ss_pred EEEEECCCCCHHHHHHHHHHHCCCCEEEE---CCHHHHH--HCCEEEE-CCCCCHHHHHHHHHHHHHHHHHHHCCCCEEE T ss_conf 89999279558999999999879979996---7989984--3897997-1899889999999873568999970897488 Q ss_pred EECHHHHHHHHC---------C---CCCCCC-----CCCCCCCCC-------CC-EECCCC-EEEEECCCCCEEEEHHHC Q ss_conf 610058878752---------8---544667-----767521011-------23-477897-099961457615703436 Q gi|254780420|r 283 ICLGHQLLGLAL---------G---AQTVKM-----YQGHHGVNH-------PV-KNISNG-TVEIVSMNHGFAIDASSL 336 (396) Q Consensus 283 IClGhQ~l~~a~---------G---~~~~kl-----~~GhrG~N~-------Pv-~~~~t~-kv~iTsqNHg~~v~~~sl 336 (396) ||||||+|+... | |++.|+ +..|-|.|. |. .++..+ ..|.. |+|.+.. T Consensus 76 ICLGMQlL~~~S~E~g~~~GLgli~G~V~kl~~~~~~vPhiGWn~i~~~~~~~l~~~i~~~~~fYFv---HSY~~~~--- 149 (196) T PRK13170 76 ICLGMQLLGRRSEESGGVDCLGIIDGDVPKLTDFGLPLPHMGWNQVTPQAGHPLFQGIEDGSYFYFV---HSYAMPV--- 149 (196) T ss_pred EEEEEEEEEECCCCCCCCCCCEEEEEEEEECCCCCCCCCEEEEEEEEECCCCHHHCCCCCCCEEEEE---EEEEECC--- T ss_conf 8800313463553689866000542299997767887744214888735898766498858879998---7787068--- Q ss_pred CCCCEEEEEECCCCEEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 9982899998899808726867997789834878877670016899999999 Q gi|254780420|r 337 PAGLEETHISLFDSSNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDF 388 (396) Q Consensus 337 p~~~~v~~~~~nD~tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~ 388 (396) .+..+...+-...-+..+ +..++||+|||||-|. ..+..|.++|+++ T Consensus 150 -~~~~ia~t~yg~~f~s~v--~~~Ni~G~QFHPEKS~--~~Gl~lLknFl~l 196 (196) T PRK13170 150 -NPYTIAQCNYGEPFSAAI--QKDNFYGVQFHPERSG--AAGAQLLKNFLEM 196 (196) T ss_pred -CCCEEEEEECCCEEEEEE--EECCEEEEECCCCCCC--HHHHHHHHHHHCC T ss_conf -888899997899899999--7199999968920067--8489999999619 No 54 >PRK05665 amidotransferase; Provisional Probab=99.72 E-value=1.7e-16 Score=136.86 Aligned_cols=122 Identities=27% Similarity=0.433 Sum_probs=83.1 Q ss_pred CCCEEEECCCCCCCCCCCCC---CHHHHHHHCCCCCCEEEEECHHHHHHHHCCCCCCCCCCCC-CCCCCCCEECCCCE-- Q ss_conf 57889985888884101121---1012221025786347861005887875285446677675-21011234778970-- Q gi|254780420|r 246 QPDGVLLSNGPGDPAVTSVY---SSPIICKLVDSGMPMFGICLGHQLLGLALGAQTVKMYQGH-HGVNHPVKNISNGT-- 319 (396) Q Consensus 246 ~pdgi~lSnGPGdP~~~~~~---~~~~~~~~~~~~iPilGIClGhQ~l~~a~G~~~~kl~~Gh-rG~N~Pv~~~~t~k-- 319 (396) .+||++|+++|-+-.+...- ..+.++.+.+.++||||||+|||+||.|+||++.|.+.|- -|. +++.....+. T Consensus 57 ~~dg~vITGS~~~vyd~~pWi~~L~~~i~~~~~~~~pilGICfGHQliA~ALGG~V~~~~~Gw~vG~-~~~~~~~~~~~~ 135 (240) T PRK05665 57 KFDAYLVTGSKADSFGTDPWIQTLKTYLLKLYERGDKLLGVCFGHQLLALLLGGKAERASQGWGVGI-HRYQLAAHAPWM 135 (240) T ss_pred HCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEHHHHHHHHHHCCEEEECCCCCEEEE-EEEEECCCCCCC T ss_conf 8789999589874689967699999999999876999899944368888860983612799736667-998875877433 Q ss_pred -----EEEECCCCCEEEEHHHCCCCCEEEEEECCCCE-EEEEEECCCCEEEECCCCCCCC Q ss_conf -----99961457615703436998289999889980-8726867997789834878877 Q gi|254780420|r 320 -----VEIVSMNHGFAIDASSLPAGLEETHISLFDSS-NCGFRLIDQPVFSVQYHPESSP 373 (396) Q Consensus 320 -----v~iTsqNHg~~v~~~slp~~~~v~~~~~nD~t-iegi~~~~~~~~~vQfHPEa~P 373 (396) -.-.-+.|...|. .||++.++++.| |.+ +++++.. ..++++|||||=++ T Consensus 136 ~~~~~~~~l~~~H~D~V~--~lP~ga~~Lass--~~cp~q~~~~~-~~~~~~Q~HPEf~~ 190 (240) T PRK05665 136 SPAVTELTLLISHQDQVT--ALPEGATVIASS--DFCPNAAYHIR-DQVLCFQGHPEFVH 190 (240) T ss_pred CCCCCCCEEEEEECCEEE--ECCCCEEEEEEC--CCCCHHHEECC-CCEEEEECCCCCCH T ss_conf 587564079977224278--579984899867--99833342627-86899916968799 No 55 >PRK08250 glutamine amidotransferase; Provisional Probab=99.71 E-value=3.9e-17 Score=141.32 Aligned_cols=150 Identities=21% Similarity=0.216 Sum_probs=101.0 Q ss_pred HHHHHHHHHCCCCEEEECCCCHHHH--HHCCCCEEEECCCCCCCCCCCC--------CCHHHHHHHCCCCCCEEEEECHH Q ss_conf 0232222202454133046574464--4135788998588888410112--------11012221025786347861005 Q gi|254780420|r 218 NLLRVLSSLGCRITIVGAETSYKDI--MSLQPDGVLLSNGPGDPAVTSV--------YSSPIICKLVDSGMPMFGICLGH 287 (396) Q Consensus 218 nIlr~L~~~g~~v~Vvp~~~~~~~i--~~~~pdgi~lSnGPGdP~~~~~--------~~~~~~~~~~~~~iPilGIClGh 287 (396) .|-..|.++|+++.++..... +.+ ...++|++++-+||-.|..... .....+++.++.++|+||||||| T Consensus 16 ~i~~~l~~~G~~~~~~~~~~g-~~lP~~~~~~d~liVlGGpms~~~~~e~~p~l~l~~E~~lir~a~~~~~PvlGIClG~ 94 (235) T PRK08250 16 AYLKWAEDRGYTISWSRVYAG-EALPENADGFDMLIVLGGPQSPRTTREECPYFDSKAEQHLINQAITARKAVVGVCLGS 94 (235) T ss_pred HHHHHHHHCCCEEEEEECCCC-CCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEEHHH T ss_conf 789999978996999966899-9899981025889997899877773212787470999999999998699889981677 Q ss_pred HHHHHHCCCCCCCCCCCCCCC----------CCCCEECCCCEEEEECCCCCEEEEHHHCCCCCEEEEEECCCCEEEEEEE Q ss_conf 887875285446677675210----------1123477897099961457615703436998289999889980872686 Q gi|254780420|r 288 QLLGLALGAQTVKMYQGHHGV----------NHPVKNISNGTVEIVSMNHGFAIDASSLPAGLEETHISLFDSSNCGFRL 357 (396) Q Consensus 288 Q~l~~a~G~~~~kl~~GhrG~----------N~Pv~~~~t~kv~iTsqNHg~~v~~~slp~~~~v~~~~~nD~tiegi~~ 357 (396) |+||.|+||++.|.+.+.-|. .+|........ ....|-|+.. -+||++-.+.+.|.. -.+++++. T Consensus 95 QliA~AlGg~V~~~~~~EiG~~~i~~t~~~~~d~~~~~~~~~-~~v~~wH~d~---~~LP~gA~lLAss~~-c~~Qaf~~ 169 (235) T PRK08250 95 QLIGEALGAKYCQSPEKEIGHFPITLTEAGLRDPLLKHFGST-LTVGHWHNDM---PGLTDQATVLATSEG-CPRQIVQY 169 (235) T ss_pred HHHHHHCCCEEECCCCCEEEEEEEEEECCCCCCHHHHCCCCC-CEEEEEEEEE---EECCCCCEEEECCCC-CHHHHEEE T ss_conf 999998098775489982448971630144458466528987-6578985236---206999679973999-84895882 Q ss_pred CCCCEEEECCCCCCCCC Q ss_conf 79977898348788776 Q gi|254780420|r 358 IDQPVFSVQYHPESSPG 374 (396) Q Consensus 358 ~~~~~~~vQfHPEa~PG 374 (396) .+. +||+|||||..|- T Consensus 170 g~~-~~g~QfHpE~t~~ 185 (235) T PRK08250 170 GNL-VYGFQCHMEFTVE 185 (235) T ss_pred CCC-EEEEECCEECCHH T ss_conf 695-8999710378999 No 56 >PRK13172 consensus Probab=99.71 E-value=2.5e-16 Score=135.70 Aligned_cols=166 Identities=28% Similarity=0.472 Sum_probs=103.6 Q ss_pred CEEEEEECCCC--CHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCC--CC-C----CCCHHHHHHHCCC Q ss_conf 14999967964--10232222202454133046574464413578899858888841--01-1----2110122210257 Q gi|254780420|r 206 YHVVCIDYGIR--SNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPA--VT-S----VYSSPIICKLVDS 276 (396) Q Consensus 206 ~kIvvIDfGiK--~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~--~~-~----~~~~~~~~~~~~~ 276 (396) .+|++||+|+= .++.+.|...|+++.++ .+++++. +.|.|+| ||-=+ .+ . ....+.++...+. T Consensus 2 ~~I~IIDyG~GNi~Sv~~al~~lg~~~~i~---~~~~~i~--~~d~lIl---PGVGsF~~~m~~L~~~~l~~~i~~~~~~ 73 (213) T PRK13172 2 SSVSIVDYGVGNLLSVARAFQYFDASVNLV---STPEEIM--SADRLVL---PGVGAFEDGMKGLTTLNFIEPIKQFARS 73 (213) T ss_pred CEEEEEECCCHHHHHHHHHHHHCCCCEEEE---CCHHHHH--HCCEEEE---CCCCCHHHHHHHHHHCCCHHHHHHHHHC T ss_conf 879999789429999999999869988997---7999983--3897897---6878469999988723863789998765 Q ss_pred CCCEEEEECHHHHHHHHC-------C-----CCCCCC----------CCCCCCCCCCCEE-------------CC-CCEE Q ss_conf 863478610058878752-------8-----544667----------7675210112347-------------78-9709 Q gi|254780420|r 277 GMPMFGICLGHQLLGLAL-------G-----AQTVKM----------YQGHHGVNHPVKN-------------IS-NGTV 320 (396) Q Consensus 277 ~iPilGIClGhQ~l~~a~-------G-----~~~~kl----------~~GhrG~N~Pv~~-------------~~-t~kv 320 (396) ++|+||||||+|+|.... | |++.|+ +..|-|.|.-+.. +. ..+. T Consensus 74 ~kpiLGICLGmQlLf~~s~E~g~~~GLgli~G~v~kl~~~~~~~~~~kvPh~GWn~i~~~~~~~~~~~~l~~~~~~~~~f 153 (213) T PRK13172 74 GKPFLGICLGMQMMLSKSTEFGQHEGLGLIAGEVVSVPSHGVDGQLHKIPHIGWNELVSTSEGEDWCHTILKNIPLNSSV 153 (213) T ss_pred CCCEEEEEHHHHHHHHCCCCCCCCCCCCEECCEEEECCCCCCCCCCCCCCEECHHHEECCCCCCCCCCHHHCCCCCCCEE T ss_conf 99579985766626538646876476657552598777556666456276201445332455433343666189778617 Q ss_pred EEECCCCCEEEEHHHCCCCCEEEEEECCCCE-EEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 9961457615703436998289999889980-8726867997789834878877670016899999999 Q gi|254780420|r 321 EIVSMNHGFAIDASSLPAGLEETHISLFDSS-NCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDF 388 (396) Q Consensus 321 ~iTsqNHg~~v~~~slp~~~~v~~~~~nD~t-iegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~ 388 (396) |.. |+|.+.+.. |.. +.+....++. +-++ .+..+++|+|||||=|. ..+..|+++|+++ T Consensus 154 YFv---HSY~~~~~~-~~~--~~a~~~y~g~~~~a~-i~~~Ni~G~QFHPEKS~--~~Gl~lL~nFl~l 213 (213) T PRK13172 154 YFV---HSFMAMPSN-PKK--RLADTLYDGQAISAV-IKDENMYGCQFHPEKSG--EVGLSIIQQFLQI 213 (213) T ss_pred EEE---EEEEEECCC-CCC--EEEEEEECCEEEEEE-EECCCEEEEECCHHHCC--HHHHHHHHHHHCC T ss_conf 999---877885388-873--899999899999999-98899999808802236--8699999999729 No 57 >PRK13176 consensus Probab=99.71 E-value=1.2e-16 Score=137.79 Aligned_cols=167 Identities=21% Similarity=0.318 Sum_probs=96.9 Q ss_pred EEEEEECCCC--CHHHHHHHHHCCCCEEE-E--CCCCHHHHHHCCCCEEEECCCCCCCCCCC---C-----CCHH-HHHH Q ss_conf 4999967964--10232222202454133-0--46574464413578899858888841011---2-----1101-2221 Q gi|254780420|r 207 HVVCIDYGIR--SNLLRVLSSLGCRITIV-G--AETSYKDIMSLQPDGVLLSNGPGDPAVTS---V-----YSSP-IICK 272 (396) Q Consensus 207 kIvvIDfGiK--~nIlr~L~~~g~~v~Vv-p--~~~~~~~i~~~~pdgi~lSnGPGdP~~~~---~-----~~~~-~~~~ 272 (396) ||++||+|+= .|+.+.|.++|.++.+- . ...+.+++. ++|.|+| ||-.+-.. . ...+ ++.. T Consensus 2 kI~IIDyg~GNi~Sv~~al~~~g~~~~i~~~i~~~~~~~~~~--~~~~lIL---PGVGsf~~~m~~L~~~~g~~~~i~~~ 76 (216) T PRK13176 2 RVAIIDYGSGNLHSATKAFERAAREAGIAAEIDVTSDPDRVA--RADRIVL---PGVGAFADCRRGLDAVDGMVEALTEA 76 (216) T ss_pred EEEEEECCCCHHHHHHHHHHHHHHHCCCCCEEEEECCHHHHH--CCCEEEE---CCCCCHHHHHHHHHHCCCHHHHHHHH T ss_conf 899990698379999999999876438896099978999983--7997998---99877899999877627728999999 Q ss_pred HCCCCCCEEEEECHHHHHHHH------------CCCCCCCC-------CCCCCCCCCC-------C-EEC----CCCEEE Q ss_conf 025786347861005887875------------28544667-------7675210112-------3-477----897099 Q gi|254780420|r 273 LVDSGMPMFGICLGHQLLGLA------------LGAQTVKM-------YQGHHGVNHP-------V-KNI----SNGTVE 321 (396) Q Consensus 273 ~~~~~iPilGIClGhQ~l~~a------------~G~~~~kl-------~~GhrG~N~P-------v-~~~----~t~kv~ 321 (396) +...++||||||||+|+|+.. +.|++.|+ +..|-|.|.= . .++ .....| T Consensus 77 ~~~~~~piLGIClGmQlL~~~s~E~~~~~GLgli~G~V~kl~~~~~~~kvPh~GWn~v~~~~~~~~~~~i~~~~~~~~fY 156 (216) T PRK13176 77 VEKKARPFLGICVGMQLMATRGLEKTVTDGLGWIAGDVVEITPSDPSLKIPHMGWNTLDVKRPHPVFDGIPTGDKGLHAY 156 (216) T ss_pred HHHCCCCEEEEEHHHEEHHHCCCCCCCCCCEEEEEEEEEECCCCCCCCCCCEECHHHEEECCCCHHHCCCCCCCCCCEEE T ss_conf 99769957998531212653686788867330672499994788867648641677868678867773877788873589 Q ss_pred EECCCCCEEEEHHHCCCCCEEEEEECCCCEEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHH Q ss_conf 961457615703436998289999889980872686799778983487887767001689999999 Q gi|254780420|r 322 IVSMNHGFAIDASSLPAGLEETHISLFDSSNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFID 387 (396) Q Consensus 322 iTsqNHg~~v~~~slp~~~~v~~~~~nD~tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~ 387 (396) .+ |+|.+.... +...+.+.+-...-+..+. ..++||+|||||-|.- -+..|+++|++ T Consensus 157 Fv---HSY~~~~~~--~~~~~a~t~yg~~f~a~v~--k~Ni~G~QFHPEKS~~--~Gl~ll~nFl~ 213 (216) T PRK13176 157 FV---HSYHLAAAN--EADVLAHADYGGPVTAIVG--RDTMVGTQFHPEKSQR--LGLALIANFLK 213 (216) T ss_pred EE---EEEEEECCC--CCCEEEEEECCCEEEEEEE--ECCEEEECCCCCCCCH--HHHHHHHHHHC T ss_conf 99---756760488--4779999975998999998--3999999099310278--49999999856 No 58 >PRK13171 consensus Probab=99.71 E-value=1.9e-16 Score=136.40 Aligned_cols=167 Identities=25% Similarity=0.336 Sum_probs=103.8 Q ss_pred CEEEEEECCCC--CHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCC-CCCCCC-CCHHHHHHHCCCCCCEE Q ss_conf 14999967964--102322222024541330465744644135788998588888-410112-11012221025786347 Q gi|254780420|r 206 YHVVCIDYGIR--SNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGD-PAVTSV-YSSPIICKLVDSGMPMF 281 (396) Q Consensus 206 ~kIvvIDfGiK--~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGd-P~~~~~-~~~~~~~~~~~~~iPil 281 (396) .+|++||+|+= .++.+.|.+.|+++.++. +.+++. +.|+|+|. |=|. +..... .....+..+.+.++||| T Consensus 2 ~~I~IiDyg~gNi~Sv~~al~~lg~~~~ii~---~~~~i~--~~~~lIlP-GVGsf~~~m~~L~~~~~~~~i~~~~~piL 75 (200) T PRK13171 2 TDVALIDAGGANLGSVRYALERLGVEARVVR---DAAGLQ--GAQRVILP-GVGAAPEAMARLRAQGLIEPLRELQVPLI 75 (200) T ss_pred CEEEEEECCCCHHHHHHHHHHHCCCCEEEEC---CHHHHH--CCCEEEEC-CCCCHHHHHHHHHHCCCHHHHHHCCCCEE T ss_conf 8799996895179999999998499879978---999995--29989989-98878999999987212678986499735 Q ss_pred EEECHHHHHHHH-----------CCCCCCCCC------CCCCCCCCCC--------EECCC-CEEEEECCCCCEEEEHHH Q ss_conf 861005887875-----------285446677------6752101123--------47789-709996145761570343 Q gi|254780420|r 282 GICLGHQLLGLA-----------LGAQTVKMY------QGHHGVNHPV--------KNISN-GTVEIVSMNHGFAIDASS 335 (396) Q Consensus 282 GIClGhQ~l~~a-----------~G~~~~kl~------~GhrG~N~Pv--------~~~~t-~kv~iTsqNHg~~v~~~s 335 (396) |||||+|+|+.. +-+++.|+. ..|-|.|.=. .++.+ ...|.. |+|++..+ T Consensus 76 GIClGmQlLf~~SeEg~~~GLgli~G~V~kl~~~~~~~vPh~GWn~i~~~~~~~~~~~~~~~~~fYFv---HSY~~~~~- 151 (200) T PRK13171 76 GICLGMQLLFEHSEEGDVECLGLLPGIVRHMTPALGIRVPHMGWNRLVPMRDSALLAGLPERASAYFV---HGYAAPVT- 151 (200) T ss_pred EEEHHHHHHHCCCCCCCCCCCCEECCEEEECCCCCCCCCCEEEEEEEECCCCCHHHCCCCCCCCEEEE---CCCCCCCC- T ss_conf 56076575403543799766167701899867776786836424345336788767299879819998---29611788- Q ss_pred CCCCCEEEEEECCCCEEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 699828999988998087268679977898348788776700168999999999 Q gi|254780420|r 336 LPAGLEETHISLFDSSNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDFM 389 (396) Q Consensus 336 lp~~~~v~~~~~nD~tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~i 389 (396) .+ .+...+-...-+.. .++.+++|+|||||-|. ..+..|+++|++|- T Consensus 152 --~~-~i~~~~~~~~f~a~--v~~~ni~G~QFHPEKS~--~~Gl~ll~nF~~l~ 198 (200) T PRK13171 152 --AD-TVAACDHGGLFTAI--VQNGLRCGAQFHPERSA--DTGARILRNFLEMS 198 (200) T ss_pred --CC-EEEEECCCCEEEEE--EECCCEEEECCCCCCCC--HHHHHHHHHHHHHH T ss_conf --87-89994279849999--98199899808943266--75799999998540 No 59 >PRK05380 pyrG CTP synthetase; Validated Probab=99.70 E-value=3.3e-16 Score=134.79 Aligned_cols=263 Identities=18% Similarity=0.310 Sum_probs=166.8 Q ss_pred HHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHHCCCCCCCCHHHCCCCCCCEECCC-------------------- Q ss_conf 689999998539842366327654589789888774158874321011013333211168-------------------- Q gi|254780420|r 128 TRALTVWIRDYGVSNSVIAHHPDGQFDLEDLKERAKNWSGLKGAELAKHVTVSQRRDWSE-------------------- 187 (396) Q Consensus 128 TRaLTk~IRe~G~~~a~I~~~~~~~~d~~~~~~~~~~~~~~~~~dL~~~Vs~~~~~~~~~-------------------- 187 (396) |-.=+|.||+-|-+.-+|..-.+..++ ++.++++..|........+....++..|...- T Consensus 190 TQhSVkeLr~~GIqPDilvcRse~~l~-~~~k~KIalfcnV~~~~VI~~~Dv~sIYeVPl~l~~qgl~~~I~~~L~L~~~ 268 (534) T PRK05380 190 TQHSVKELRSIGIQPDILVCRSERPLP-EEEKRKIALFCNVPEEAVISAPDVDSIYEVPLLLHEQGLDDIVLKRLGLEAP 268 (534) T ss_pred HHHHHHHHHHCCCCCCEEEEECCCCCC-HHHHHHHHHHCCCCHHHEEECCCCCHHHHHHHHHHHCCCHHHHHHHHCCCCC T ss_conf 188899999658998747862577689-8999988641588789968626852377768999985838999998678989 Q ss_pred --CCCCCCCCCCCCCCCCCCCEEEEEEC--C---CCCHHHHHHHH----HCCCCEEEECCC------CHHHHHHCCCCEE Q ss_conf --65344543443467777714999967--9---64102322222----024541330465------7446441357889 Q gi|254780420|r 188 --KIWKWGEETSFLKSSDAKYHVVCIDY--G---IRSNLLRVLSS----LGCRITIVGAET------SYKDIMSLQPDGV 250 (396) Q Consensus 188 --~~~~~~~~~~~~~~~~~~~kIvvIDf--G---iK~nIlr~L~~----~g~~v~Vvp~~~------~~~~i~~~~pdgi 250 (396) ....|..-.....++....+|+++-= . .=.||+..|.- .+++|.+.--++ +..+.++ +.||| T Consensus 269 ~~~l~~W~~~v~~~~~~~~~V~IaiVGKYv~L~DaY~Sv~EAL~Hag~~~~~kv~i~wIdse~le~~~~~~~L~-~~dGI 347 (534) T PRK05380 269 EPDLSEWEDLVERIKNPKGEVTIALVGKYVELPDAYKSVIEALKHAGIANDVKVNIKWIDSEDLEEENVAELLK-DVDGI 347 (534) T ss_pred CCCHHHHHHHHHHHHCCCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEEEHHHCCCCCHHHHHH-CCCEE T ss_conf 97899999999987587655699986313678604999999999888863874227998857636646777763-18829 Q ss_pred EECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEECHHHHHHHHCCCCCCCCCCCC-----CCCCCCCEECCCCE------ Q ss_conf 9858888841011211012221025786347861005887875285446677675-----21011234778970------ Q gi|254780420|r 251 LLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGICLGHQLLGLALGAQTVKMYQGH-----HGVNHPVKNISNGT------ 319 (396) Q Consensus 251 ~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIClGhQ~l~~a~G~~~~kl~~Gh-----rG~N~Pv~~~~t~k------ 319 (396) ++-+|=|+=..- .-+..++.+-+.+||-||||||+|+....|--.+..++-.| ....|||.++.... T Consensus 348 lVPGGFG~RGiE--GKI~Ai~yARen~IPfLGICLGMQ~avIEfARNVlgl~dAnStEfd~~t~~pVI~lm~eq~~~~~~ 425 (534) T PRK05380 348 LVPGGFGERGIE--GKILAIRYARENNIPFLGICLGMQLAVIEFARNVLGLEDANSTEFDPDTPHPVIDLMPEQKDVSDL 425 (534) T ss_pred EECCCCCCCCHH--HHHHHHHHHHHCCCCCEEHHHHHHHHHHHHHHHHCCCCCCCEEECCCCCCCCEEEECHHHCCCCCC T ss_conf 957876654315--689999999981998110112237889999998458999732535899999869988231377656 Q ss_pred ----------EEEE-----------------CCCCCEEEEHHH---C-CCCCEEEEEECCCCEEEEEEECCCCEE-EECC Q ss_conf ----------9996-----------------145761570343---6-998289999889980872686799778-9834 Q gi|254780420|r 320 ----------VEIV-----------------SMNHGFAIDASS---L-PAGLEETHISLFDSSNCGFRLIDQPVF-SVQY 367 (396) Q Consensus 320 ----------v~iT-----------------sqNHg~~v~~~s---l-p~~~~v~~~~~nD~tiegi~~~~~~~~-~vQf 367 (396) |.+. -+-|-|.|.++= | ..++.++..|...+.+|-++.++.|+| |||| T Consensus 426 GGTMRLG~y~~~l~~gS~a~~~Yg~~~~I~ERHRHRYEvN~~y~~~le~~Gl~~sG~s~d~~lvEiiEl~~HpfFvg~Qf 505 (534) T PRK05380 426 GGTMRLGAYPCKLKPGTLAAEIYGKTEVIYERHRHRYEVNNKYREQLEKAGLVFSGTSPDGRLVEIIELPDHPWFVGVQF 505 (534) T ss_pred CCEEECCCEEEEECCCCHHHHHHCCCCEECCCCCCCCCCCHHHHHHHHHCCCEEEEECCCCCEEEEEECCCCCEEEEECC T ss_conf 66045164456756888999985899816664564200189999999978979999989998789998689980899657 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHCC Q ss_conf 878877670016899999999997357 Q gi|254780420|r 368 HPESSPGPQDSYYLFSTFIDFMHKRKT 394 (396) Q Consensus 368 HPEa~PGP~D~~~~F~~F~~~i~~~k~ 394 (396) |||=..-|.+.+-+|..|++...++|. T Consensus 506 HPEfkSrP~~PhPlF~~fI~Aal~~~~ 532 (534) T PRK05380 506 HPEFKSRPRRPHPLFAGFVKAALEYKK 532 (534) T ss_pred CCCCCCCCCCCCCCHHHHHHHHHHHHH T ss_conf 766578999998369999999998752 No 60 >PRK07567 glutamine amidotransferase; Provisional Probab=99.70 E-value=1.5e-16 Score=137.17 Aligned_cols=123 Identities=24% Similarity=0.320 Sum_probs=82.7 Q ss_pred CCCCEEEECCCCCCCCCCCC-----------CCHHHHHHHCCCCCCEEEEECHHHHHHHHCCCCCCCCCCCCC-C----- Q ss_conf 35788998588888410112-----------110122210257863478610058878752854466776752-1----- Q gi|254780420|r 245 LQPDGVLLSNGPGDPAVTSV-----------YSSPIICKLVDSGMPMFGICLGHQLLGLALGAQTVKMYQGHH-G----- 307 (396) Q Consensus 245 ~~pdgi~lSnGPGdP~~~~~-----------~~~~~~~~~~~~~iPilGIClGhQ~l~~a~G~~~~kl~~Ghr-G----- 307 (396) .++||||+++||.+-.+... ....+++++.+.++|+||||||||+||.|+||++.+. +|.- | T Consensus 50 ~~ydgvivgGsp~~vsd~~~~~~~~q~~~e~~l~~Ll~~~~~~~~P~LGiC~G~Qlla~alGG~V~~~-~~~e~G~~~i~ 128 (242) T PRK07567 50 DDFSGVIVGGSPFNVSDPAYSKSPTQRRVEAELSGLLDEVVARDFPFLGCCYGVGTLGHHLGGVVDRT-HGEPVGATTVT 128 (242) T ss_pred HHCCEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEECHHHHHHHHHCCCEEECC-CCCEEEEEEEE T ss_conf 00078999488975678544564688899999999999998469988998563899999729888279-98707799999 Q ss_pred -----CCCCCEECCCCEEEEECCCCCEEEEHHHCCCCCEEEEEECCCCEEEEEEECCCCEEEECCCCCCCC Q ss_conf -----011234778970999614576157034369982899998899808726867997789834878877 Q gi|254780420|r 308 -----VNHPVKNISNGTVEIVSMNHGFAIDASSLPAGLEETHISLFDSSNCGFRLIDQPVFSVQYHPESSP 373 (396) Q Consensus 308 -----~N~Pv~~~~t~kv~iTsqNHg~~v~~~slp~~~~v~~~~~nD~tiegi~~~~~~~~~vQfHPEa~P 373 (396) ..+|+..-.-.+ +-.-+-|+..|. .||.+..+.+.|.. --+++++..+ ++|++|||||-.+ T Consensus 129 lt~~g~~Dpl~~~lp~~-f~a~~~H~d~V~--~LP~ga~lLA~s~~-cpvqafr~g~-~~y~~QFHPE~d~ 194 (242) T PRK07567 129 LTDAGRRDPLTADLPDT-FTAFTGHKEAVV--ALPPGAVLLATGPT-CPVQMFRAGE-NVYATQFHPELDA 194 (242) T ss_pred ECCCCCCCCCCCCCCCC-EEEEEECCCEEE--ECCCCCEEEECCCC-CCHHEEEECC-CEEEEECCCCCCH T ss_conf 81456779110179844-179986164137--68998489984999-8236189379-7899924840799 No 61 >PRK13178 consensus Probab=99.70 E-value=5.3e-16 Score=133.35 Aligned_cols=168 Identities=23% Similarity=0.397 Sum_probs=104.4 Q ss_pred EEEEECCCC--CHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCC-CC----CCHHHHHHHCCCCCCE Q ss_conf 999967964--1023222220245413304657446441357889985888884101-12----1101222102578634 Q gi|254780420|r 208 VVCIDYGIR--SNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVT-SV----YSSPIICKLVDSGMPM 280 (396) Q Consensus 208 IvvIDfGiK--~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~-~~----~~~~~~~~~~~~~iPi 280 (396) |++||+|+= +++.+.|...|+++.|. .+.+++. +.|+|+| ||=|.=... .. ...+.+.+..+.++|| T Consensus 2 I~IiDyg~GNi~Sv~~al~~lg~~~~i~---~~~~~i~--~~d~lIl-PGVGsf~~~m~~L~~~~l~~~i~~~~~~gkpi 75 (213) T PRK13178 2 LAILDYKAGNQTSVRRALNHLGIPCVIT---ADPEVIQ--GAAGIIF-PGVGAAGQAMNELRTTGLDEVLREQVQAGKPL 75 (213) T ss_pred EEEEECCCCHHHHHHHHHHHCCCCEEEE---CCHHHHH--HCCEEEE-CCCCCHHHHHHHHHHCCHHHHHHHHHHCCCCE T ss_conf 9999659308999999999869998997---7999994--3998998-99885699999875006289999874369957 Q ss_pred EEEECHHHHHHHH--------CC---CCCCCC------------CCCCCCCCC-------CC-EECCC-CEEEEECCCCC Q ss_conf 7861005887875--------28---544667------------767521011-------23-47789-70999614576 Q gi|254780420|r 281 FGICLGHQLLGLA--------LG---AQTVKM------------YQGHHGVNH-------PV-KNISN-GTVEIVSMNHG 328 (396) Q Consensus 281 lGIClGhQ~l~~a--------~G---~~~~kl------------~~GhrG~N~-------Pv-~~~~t-~kv~iTsqNHg 328 (396) ||||||+|+|... +| +++.|+ +..|-|.|. |. .++.+ .+.|.. |+ T Consensus 76 LGIClGmQlLf~~seE~~~~GLgli~G~v~~l~~~~~~~~~~~~kvPhiGWn~i~~~~~~~l~~~~~~~~~fYFv---HS 152 (213) T PRK13178 76 LGICVGCQIMLDYSQENDTKCLGIIPGECRLFNPALTDEDGAPIRVPHMGWNSVVQRRPCPLFKGIEPEAEFYFV---HS 152 (213) T ss_pred EEEEHHHEEEECCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCEEEEE---EE T ss_conf 998424555541442454224675610699888543443477756772122127717998522456777879998---68 Q ss_pred EEEEHHHCCCCCEEEEEECCCCEEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 157034369982899998899808726867997789834878877670016899999999997 Q gi|254780420|r 329 FAIDASSLPAGLEETHISLFDSSNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDFMHK 391 (396) Q Consensus 329 ~~v~~~slp~~~~v~~~~~nD~tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~i~~ 391 (396) |.++++ .+. +.+.+-.+...-+ .+...++||+|||||-|. ..+..|+++|+..-.+ T Consensus 153 Y~~~~~---~~~-~ia~t~yg~~f~a-~v~k~Ni~G~QFHPEKSg--~~Gl~lLknF~~~~~~ 208 (213) T PRK13178 153 YFPAPP---EEY-VIATTTYGIEFCS-AHGGPGLWAVQFHPEKSG--RPGLRLLANFHRYCTE 208 (213) T ss_pred EEECCC---CCC-EEEEEECCCEEEE-EEEECCEEEEECCCCCCC--HHHHHHHHHHHHHHHH T ss_conf 985389---864-8999978997999-998099999919953168--8699999999999876 No 62 >PRK13174 consensus Probab=99.69 E-value=7.7e-16 Score=132.21 Aligned_cols=168 Identities=24% Similarity=0.372 Sum_probs=102.8 Q ss_pred CEEEEEECCCC--CHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCCC-----CCHHHHHHHCCCCC Q ss_conf 14999967964--102322222024541330465744644135788998588888410112-----11012221025786 Q gi|254780420|r 206 YHVVCIDYGIR--SNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTSV-----YSSPIICKLVDSGM 278 (396) Q Consensus 206 ~kIvvIDfGiK--~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~-----~~~~~~~~~~~~~i 278 (396) ++|++||+|+= .|+.+.|...|++..++. .+.+++. +.|.|+| ||-|.=...-. ...++++.+ ..+. T Consensus 2 k~I~IiDyG~gNi~Sv~~al~~~g~~~~~i~--~~~~~i~--~~d~lIl-PGVGsf~~~m~~L~~~~~~~~i~~~-~~~k 75 (212) T PRK13174 2 QTVAVIDYGMGNLHSVAKALEHVGAGRVLVT--SDAAVIR--EADRVVF-PGVGAIRDCMAEIRRLGFDSLVREV-SQDR 75 (212) T ss_pred CEEEEEECCCHHHHHHHHHHHHCCCCEEEEC--CCHHHHH--HCCEEEE-CCCCCHHHHHHHHHHCCHHHHHHHH-HCCC T ss_conf 8899995881079999999997499879983--8989996--3897996-5888689999998876969999999-7499 Q ss_pred CEEEEECHHHHHHHHC---C---------CCCCC-----------CCCCCCCCCC-------CC-EECCC-CEEEEECCC Q ss_conf 3478610058878752---8---------54466-----------7767521011-------23-47789-709996145 Q gi|254780420|r 279 PMFGICLGHQLLGLAL---G---------AQTVK-----------MYQGHHGVNH-------PV-KNISN-GTVEIVSMN 326 (396) Q Consensus 279 PilGIClGhQ~l~~a~---G---------~~~~k-----------l~~GhrG~N~-------Pv-~~~~t-~kv~iTsqN 326 (396) ||||||||+|+|+... | |++.| ++.+|-|.|. |. .++.+ .+.|.. T Consensus 76 piLGICLGMQlL~~~s~E~g~~~GLgli~G~V~~~~~~~~~~~~~~kiPhiGWn~i~~~~~~~l~~~~~~~~~fYFv--- 152 (212) T PRK13174 76 PFLGICVGMQALLERSEENGGVDCIGLFPGQVRFFGKDLHEDGEHLKVPHMGWNQVSQAVDHPLWHDIPDLARFYFV--- 152 (212) T ss_pred CEEEEEHHHHHHHHCCCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCEECCEEEEECCCCCHHCCCCCCCEEEEE--- T ss_conf 87998088998761754789877533635079986755565554556755001247606888121067765759987--- Q ss_pred CCEEEEHHHCCCCCEEEEE-ECCCCEEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 7615703436998289999-88998087268679977898348788776700168999999999 Q gi|254780420|r 327 HGFAIDASSLPAGLEETHI-SLFDSSNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDFM 389 (396) Q Consensus 327 Hg~~v~~~slp~~~~v~~~-~~nD~tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~i 389 (396) |+|.+.... ...+++. +-...-+..++ ..++||+|||||-|. ..+..|+++|+..- T Consensus 153 HSY~~~~~~---~~~~~a~t~~g~~f~a~v~--k~Ni~G~QFHPEKS~--~~Gl~lLknFi~w~ 209 (212) T PRK13174 153 HSYYIEAGN---PRQVVGRGHYGVDFAAALA--DGSRFAVQFHPEKSH--THGLQLLQNFAAWD 209 (212) T ss_pred EEEEECCCC---HHCEEEEEECCCEEEEEEE--ECCEEEECCCCCCCC--HHHHHHHHHHHHHC T ss_conf 566752698---2038999989988999998--499999908910057--75899999998317 No 63 >CHL00188 hisH imidazole glycerol phosphate synthase subunit hisH; Provisional Probab=99.68 E-value=5.9e-16 Score=132.99 Aligned_cols=163 Identities=24% Similarity=0.361 Sum_probs=101.2 Q ss_pred CEEEEEECCC--CCHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCC--CC-C----CCCHHHHHHHCCC Q ss_conf 1499996796--410232222202454133046574464413578899858888841--01-1----2110122210257 Q gi|254780420|r 206 YHVVCIDYGI--RSNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPA--VT-S----VYSSPIICKLVDS 276 (396) Q Consensus 206 ~kIvvIDfGi--K~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~--~~-~----~~~~~~~~~~~~~ 276 (396) .+|++||+|+ =.|+.+.|.+.|+++.+. .+.+++. +.|.|+| ||-=+ .+ . ....+.+++..+. T Consensus 2 ~~I~Iid~g~GNi~Sv~~al~~~g~~~~ii---~~~~~i~--~~d~lIL---PGVGsf~~~m~~L~~~~l~~~i~~~~~~ 73 (210) T CHL00188 2 MKIGIIDYSMGNLHSVSRAIQQAGQQPCII---NSESELA--QVHALVL---PGVGSFDLAMKKLEKKGLITPIKKWIAE 73 (210) T ss_pred CEEEEEECCCHHHHHHHHHHHHCCCCEEEE---CCHHHHH--HCCEEEE---CCCCCHHHHHHHHHHCCCHHHHHHHHHC T ss_conf 779999588478999999999869987997---9999996--4896898---8867389999988630516999999976 Q ss_pred CCCEEEEECHHHHHHHHC-----------CCCCCCCC------CCCCCCCCCCEECC-----------------CCEEEE Q ss_conf 863478610058878752-----------85446677------67521011234778-----------------970999 Q gi|254780420|r 277 GMPMFGICLGHQLLGLAL-----------GAQTVKMY------QGHHGVNHPVKNIS-----------------NGTVEI 322 (396) Q Consensus 277 ~iPilGIClGhQ~l~~a~-----------G~~~~kl~------~GhrG~N~Pv~~~~-----------------t~kv~i 322 (396) ++||||||||+|+|+... .+++.|++ ..|-|.|. +. .. ....|. T Consensus 74 g~piLGICLGmQlLf~~seEg~~~GLgli~G~v~k~~~~~~~kvPh~GWn~-i~-~~~~~~~~~~~~~~~~~~~~~~~YF 151 (210) T CHL00188 74 GNPFIGICLGLHLLFETSEEGKEEGLGIYKGQVKRLKHSPVKVIPHMGWNR-LE-CQNSECQNSEWVNWKAWPLNPWAYF 151 (210) T ss_pred CCCEEEEEHHEEEEEHHCCCCCCCCCCEECCEEEECCCCCCCCCCCCCHHH-EE-CCCCCCCCCCCCHHHCCCCCCCEEE T ss_conf 998699842102210221156767734566689993788664673205423-24-4576433444511215776770799 Q ss_pred ECCCCCEEEEHHHCCCCCEEEEEECCC-CEEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHH Q ss_conf 614576157034369982899998899-80872686799778983487887767001689999999 Q gi|254780420|r 323 VSMNHGFAIDASSLPAGLEETHISLFD-SSNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFID 387 (396) Q Consensus 323 TsqNHg~~v~~~slp~~~~v~~~~~nD-~tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~ 387 (396) . |+|.+.+.. +.......+-.+ .-+.+++ ..+++|+|||||=|. ..+..|+++|++ T Consensus 152 v---HSY~~~~~~--~~~~~~~~~yg~~~~~a~v~--k~Ni~G~QFHPEKS~--~~Gl~lL~nFl~ 208 (210) T CHL00188 152 V---HSYGVMPKS--QACATTTTFYGKQQMVAAIE--YDNIFAMQFHPEKSG--EFGLWLLREFMK 208 (210) T ss_pred E---EEEEECCCC--CCEEEEEEEECCEEEEEEEE--ECCEEEEECCCCCCC--HHHHHHHHHHHH T ss_conf 7---413552798--43799999639989999997--199999948800076--879999999975 No 64 >PRK13525 glutamine amidotransferase subunit PdxT; Provisional Probab=99.67 E-value=1.8e-15 Score=129.61 Aligned_cols=166 Identities=19% Similarity=0.352 Sum_probs=101.8 Q ss_pred CEEEEEEC-CCCCHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCCC-----CCHHHHHHHCCCCCC Q ss_conf 14999967-964102322222024541330465744644135788998588888410112-----110122210257863 Q gi|254780420|r 206 YHVVCIDY-GIRSNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTSV-----YSSPIICKLVDSGMP 279 (396) Q Consensus 206 ~kIvvIDf-GiK~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~-----~~~~~~~~~~~~~iP 279 (396) .||.|||+ |-=.+.++.|.+.|+++.++ .+.++|.+ .|+++|-+ |....+.. ...+.+++..+.++| T Consensus 2 ~~IGvl~~qGn~rs~~~aL~~lG~~~~~v---~~~~di~~--ad~lILPG--G~s~am~~ll~~~gl~~~i~~~i~~gkP 74 (191) T PRK13525 2 MKIGVLALQGAVREHIAALEALGAEAVEV---RRPEDLDE--IDGLILPG--GESTTMGKLLRDFGLLEPLREFIASGLP 74 (191) T ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEE---CCHHHHHH--CCEEEECC--CCHHHHHHHHHHCCCHHHHHHHHHCCCC T ss_conf 37999964788999999999879978998---99999953--99899789--7658999999986868999999984998 Q ss_pred EEEEECHHHHHHHHCC-----------CCCCCCCCCCCCCC-----CCCEECCCC-EEEEECCCCCEEEEHHHCCCCCEE Q ss_conf 4786100588787528-----------54466776752101-----123477897-099961457615703436998289 Q gi|254780420|r 280 MFGICLGHQLLGLALG-----------AQTVKMYQGHHGVN-----HPVKNISNG-TVEIVSMNHGFAIDASSLPAGLEE 342 (396) Q Consensus 280 ilGIClGhQ~l~~a~G-----------~~~~kl~~GhrG~N-----~Pv~~~~t~-kv~iTsqNHg~~v~~~slp~~~~v 342 (396) +||||||+|+|+...- +++.|..|| |-.+ ..+..+..+ +.|+. |+|-+.. ..++.++ T Consensus 75 ~LGIClGmqLL~~~~E~~~~~gLGli~g~V~Rn~~g-rq~~s~~~~~~~~~~~~~~~~yFv---hsp~~~~--~~~~~~~ 148 (191) T PRK13525 75 VFGTCAGMILLAKEIEGPEQEHLGLLDITVRRNAFG-RQVDSFEAELDIKGLGDPFPAVFI---RAPYIER--VGPGVEV 148 (191) T ss_pred EEEECHHHHEEEECCCCCCCCCCCCCCCEEEECCCC-CCCCCCCCCCCCCCCCCCCCEEEE---ECCEEEE--CCCCCEE T ss_conf 576222121112023358977767788378634647-514444267636688998458997---0888804--4999489 Q ss_pred EEEECCCCEEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 99988998087268679977898348788776700168999999999973 Q gi|254780420|r 343 THISLFDSSNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDFMHKR 392 (396) Q Consensus 343 ~~~~~nD~tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~i~~~ 392 (396) +..- ++.+-+++ ..+++++|||||-|. | ..+.++|++|+++. T Consensus 149 ~a~~--g~~~vaV~--~~Nv~g~QFHPEkSg---~-~~i~~~Fl~~vk~~ 190 (191) T PRK13525 149 LARV--GGRIVAVR--QGNILATSFHPELTD---D-RRVHRYFLDMVKEA 190 (191) T ss_pred EEEE--CCEEEEEE--ECCEEEEEECCCCCC---C-CHHHHHHHHHHHHH T ss_conf 9998--99999999--899999980993059---8-37999999999851 No 65 >PRK13179 consensus Probab=99.67 E-value=1.5e-15 Score=130.25 Aligned_cols=164 Identities=21% Similarity=0.392 Sum_probs=101.0 Q ss_pred EEEEECCCC--CHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCCCC-----CHHHHHHHCCCCCCE Q ss_conf 999967964--1023222220245413304657446441357889985888884101121-----101222102578634 Q gi|254780420|r 208 VVCIDYGIR--SNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTSVY-----SSPIICKLVDSGMPM 280 (396) Q Consensus 208 IvvIDfGiK--~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~~-----~~~~~~~~~~~~iPi 280 (396) |++||+|+= .|+.+.|.+.|+++.+. .+.+++. +.|.|+| ||=|.=...... ..+.+++..+.+.|| T Consensus 2 I~IiDyg~gNi~Sv~~al~~~g~~~~i~---~~~~~i~--~~d~lIL-PGVGsF~~~m~~L~~~~l~~~i~~~~~~~kpi 75 (207) T PRK13179 2 IAIIDYGMGNLRSVQKGFEKVGFEAVVT---ADPKVVL--EAEKVVL-PGVGAFRDCMRNLEQGGFVEPILRVIRDGRPF 75 (207) T ss_pred EEEEECCCHHHHHHHHHHHHCCCCEEEE---CCHHHHH--HCCEEEE-CCCCCHHHHHHHHHHCCCHHHHHHHHHCCCCE T ss_conf 9999569358999999999859987996---8999994--3998998-99786799999988537616899998659968 Q ss_pred EEEECHHHHHHHHC-------C-----CCCCCC-----------CCCCCCCCCCCE---------ECCC-CEEEEECCCC Q ss_conf 78610058878752-------8-----544667-----------767521011234---------7789-7099961457 Q gi|254780420|r 281 FGICLGHQLLGLAL-------G-----AQTVKM-----------YQGHHGVNHPVK---------NISN-GTVEIVSMNH 327 (396) Q Consensus 281 lGIClGhQ~l~~a~-------G-----~~~~kl-----------~~GhrG~N~Pv~---------~~~t-~kv~iTsqNH 327 (396) ||||||+|+|.... | +++.|+ +-.|-|.|. |. ++.. ...|.. | T Consensus 76 LGICLGmQlLf~~S~E~g~~~GLgli~G~V~kl~~~~~~~~~~~kiPhiGWn~-i~~~~~~~~~~~~~~~~~fYFv---H 151 (207) T PRK13179 76 LGICVGMQLLFTDSEEFGLYQGLNVIPGHVLRFPEGMREGGEELKVPHMGWNQ-LSIRRRPPAFQGIEDGTNVYFV---H 151 (207) T ss_pred EEEEHHHHHCCCCCCCCCCCCCCCEECEEEEECCCCCCCCCCCCCCCEEEEEE-EECCCCCCHHCCCCCCCEEEEE---E T ss_conf 99807799637988069955778783407998764434446546687511256-6436788310144567707998---4 Q ss_pred CEEEEHHHCCCCCEEEEEECCCCE-EEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 615703436998289999889980-8726867997789834878877670016899999999 Q gi|254780420|r 328 GFAIDASSLPAGLEETHISLFDSS-NCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDF 388 (396) Q Consensus 328 g~~v~~~slp~~~~v~~~~~nD~t-iegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~ 388 (396) +|.+.+.. ++. +++..-.+.. +..+. ..+++|+|||||=|. ..+..|+++|+++ T Consensus 152 SY~~~~~~--~~~-i~~~~~yg~~~~a~v~--k~Ni~G~QFHPEKS~--~~Gl~lLknF~el 206 (207) T PRK13179 152 SYYVKPDD--ESV-VAATTGYGIEFCAAVW--KDNIVATQFHPEKSQ--AVGLSILKNFGEM 206 (207) T ss_pred CCEECCCC--CCC-EEEEEECCCEEEEEEE--ECCEEEEECCHHHCC--HHHHHHHHHHHHC T ss_conf 31641578--451-8999848997999999--699999958801255--8689999999713 No 66 >PRK13173 consensus Probab=99.67 E-value=2.2e-15 Score=128.97 Aligned_cols=167 Identities=20% Similarity=0.342 Sum_probs=100.9 Q ss_pred CEEEEEECCCC--CHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCC-CCC----CCCHHHHHHHCCCCC Q ss_conf 14999967964--10232222202454133046574464413578899858888841-011----211012221025786 Q gi|254780420|r 206 YHVVCIDYGIR--SNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPA-VTS----VYSSPIICKLVDSGM 278 (396) Q Consensus 206 ~kIvvIDfGiK--~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~-~~~----~~~~~~~~~~~~~~i 278 (396) .+|++||+|+= .|+.+.|...|+++.++ .+.+++. +.|.|+|- |=|.=. ... ....+.++... .+. T Consensus 2 ~kI~IiDyg~gNi~Sv~~al~~lg~~~~i~---~~~~~i~--~~d~lILP-GVGsF~~~m~~L~~~~l~~~i~~~~-~~k 74 (211) T PRK13173 2 TKIALLDYGMGNLHSASKALSAVGAEVSIT---NDPKVIA--AADKIVFP-GVGAMRDCMAGMHEAGIDEVVRQAI-FNK 74 (211) T ss_pred CEEEEEECCCHHHHHHHHHHHHCCCCEEEE---CCHHHHH--HCCEEEEC-CCCCHHHHHHHHHHCCHHHHHHHHH-CCC T ss_conf 879999689449999999999869988996---7999984--38968975-7786899999988629799999886-299 Q ss_pred CEEEEECHHHHHHHHC---C---------CCCCCC------------CCCCCCCCCCC--------E-ECCC-CEEEEEC Q ss_conf 3478610058878752---8---------544667------------76752101123--------4-7789-7099961 Q gi|254780420|r 279 PMFGICLGHQLLGLAL---G---------AQTVKM------------YQGHHGVNHPV--------K-NISN-GTVEIVS 324 (396) Q Consensus 279 PilGIClGhQ~l~~a~---G---------~~~~kl------------~~GhrG~N~Pv--------~-~~~t-~kv~iTs 324 (396) ||||||||+|+|+... | |++.|+ +..|-|.|.=. . ++.. ...|.. T Consensus 75 piLGICLGMQlLf~~s~E~g~~~GLgli~G~V~~l~~~~~~~~~~~~kvPhiGWn~v~~~~~~~~l~~~i~~~~~fYFv- 153 (211) T PRK13173 75 PVMAICVGMQALFEQSAENGGTPCLSILDGTVKAFDPSWKDPKGVQIKVPHMGWNTISGMDFEHPLWNGIDDKAHFYFV- 153 (211) T ss_pred CEEEEEHHHHHHHCCCCCCCCCCCCEEEEEEEEECCCCCCCCCCCCCCCCCEEEEEEECCCCCCHHHCCCCCCCEEEEE- T ss_conf 9899818889850689889996713066006987674446755554346600034543357653644499878607998- Q ss_pred CCCCEEEEHHHCCCCCEEEEEECCCCEEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 4576157034369982899998899808726867997789834878877670016899999999 Q gi|254780420|r 325 MNHGFAIDASSLPAGLEETHISLFDSSNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDF 388 (396) Q Consensus 325 qNHg~~v~~~slp~~~~v~~~~~nD~tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~ 388 (396) |+|.++++. .+..+...+-...-+..+. ..+++|+|||||=|. ..+..|+++|++. T Consensus 154 --HSY~~~~~~--~~~~~~~~~yg~~f~a~v~--k~Ni~G~QFHPEKS~--~~Gl~lLknFl~~ 209 (211) T PRK13173 154 --HSYYCEPAD--SSQVAAICDYGQPFCASIL--KDNLFATQFHPEKSH--TAGLQLLKNFVEW 209 (211) T ss_pred --EEEEEECCC--CCEEEEEEECCCEEEEEEE--ECCEEEEECCHHHCC--HHHHHHHHHHHCC T ss_conf --667870378--2507899838988999999--699999908801245--8689999999735 No 67 >PRK13148 consensus Probab=99.66 E-value=2.9e-15 Score=128.10 Aligned_cols=165 Identities=24% Similarity=0.399 Sum_probs=96.6 Q ss_pred CEEEEEECCCC--CHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCC--C-CC----CCCHHHHHHHCCC Q ss_conf 14999967964--10232222202454133046574464413578899858888841--0-11----2110122210257 Q gi|254780420|r 206 YHVVCIDYGIR--SNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPA--V-TS----VYSSPIICKLVDS 276 (396) Q Consensus 206 ~kIvvIDfGiK--~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~--~-~~----~~~~~~~~~~~~~ 276 (396) .+|++||+|+= .|+.|.|...|.+..+.- ..+.++|. +.|.|+| ||-=+ . .. ....+.+++.. . T Consensus 2 ~~I~IIDyg~GNl~Sv~~al~~~g~~~~~~i-~~~~~~i~--~~d~lIL---PGVGsF~~am~~L~~~~l~~~i~~~~-~ 74 (225) T PRK13148 2 STIAIVDYGMGNFHSVARALQHAAPDADIRI-CNRPEQID--AADRVVF---PGQGAMPDCMRTLNESGLRAAVERAA-A 74 (225) T ss_pred CEEEEEECCCHHHHHHHHHHHHHCCCCCEEE-ECCHHHHH--HCCEEEE---CCCCCHHHHHHHHHHCCHHHHHHHHH-C T ss_conf 8799997994499999999998578973999-89999995--4997998---99788799999988648799999986-3 Q ss_pred CCCEEEEECHHHHHHHHC--C---------CCCCC--------------------------CCCCCCCCCCCCEE----- Q ss_conf 863478610058878752--8---------54466--------------------------77675210112347----- Q gi|254780420|r 277 GMPMFGICLGHQLLGLAL--G---------AQTVK--------------------------MYQGHHGVNHPVKN----- 314 (396) Q Consensus 277 ~iPilGIClGhQ~l~~a~--G---------~~~~k--------------------------l~~GhrG~N~Pv~~----- 314 (396) +.||||||||+|+|.... | |++.| ++..|-|.|. |.. T Consensus 75 ~kpiLGICLGMQlLf~~SeEg~~~GLgli~G~V~kl~~~~~~~~~~~~~~~~~~~~~~~~~lkiPh~GWn~-v~~~~~~~ 153 (225) T PRK13148 75 SKPLMGVCVGEQMLFERSEEGDTPCLGIFPGEVRRFAGPQFADPVAADQAAAAPPAAARERLKVPHMGWNQ-VRQTRSHA 153 (225) T ss_pred CCCEEEEEHHHHHHHHCCCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEECCCE-EECCCCCC T ss_conf 89869982778876624522697756558789998147654564211233345554435488888886120-37789971 Q ss_pred ----CCC-CEEEEECCCCCEEEEHHHCCCCCEEEEEECCCCEEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHH Q ss_conf ----789-7099961457615703436998289999889980872686799778983487887767001689999999 Q gi|254780420|r 315 ----ISN-GTVEIVSMNHGFAIDASSLPAGLEETHISLFDSSNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFID 387 (396) Q Consensus 315 ----~~t-~kv~iTsqNHg~~v~~~slp~~~~v~~~~~nD~tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~ 387 (396) +.. ...|.. |+|.+.+.. ....+...+-...-+..+. ..++||+|||||=|. ..+..|+++|++ T Consensus 154 l~~~i~~~~~fYFv---HSY~~~~~~--~~~~~a~t~yg~~f~s~v~--~~Ni~G~QFHPEKS~--~~Gl~lLknFl~ 222 (225) T PRK13148 154 LWEGIPDGTHFYFV---HSYYAAPSD--PALTTGVTDYGVAFTCAVA--AANIFAVQFHPEKSA--EHGLRLYRNFVD 222 (225) T ss_pred HHCCCCCCCEEEEE---EEEEECCCC--CCCEEEEEECCCEEEEEEE--ECCEEEEECCHHHHC--HHHHHHHHHHHC T ss_conf 21266778706887---545631688--1558999858998999999--699999908812315--779999999876 No 68 >PRK13151 consensus Probab=99.66 E-value=6.4e-16 Score=132.77 Aligned_cols=165 Identities=25% Similarity=0.370 Sum_probs=101.0 Q ss_pred EEEEEECCCC--CHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCCCCC--HHHHHHHCCCCCCEEE Q ss_conf 4999967964--10232222202454133046574464413578899858888841011211--0122210257863478 Q gi|254780420|r 207 HVVCIDYGIR--SNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTSVYS--SPIICKLVDSGMPMFG 282 (396) Q Consensus 207 kIvvIDfGiK--~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~~~--~~~~~~~~~~~iPilG 282 (396) ||++||+|+= .|+.+.|...|+++.+. .+.+++. +.|+|+| ||=|.=..+-... ..++..+...+.|||| T Consensus 2 kI~IiDyg~GNi~Sv~~al~~lg~~~~i~---~~~~~i~--~~d~lIl-PGVGsf~~~m~~L~~~~l~~~i~~~~kpiLG 75 (195) T PRK13151 2 KIIIIDTACANLASLKFCLDRLGFNATIS---RDLKELE--SADKLFL-PGVGTAKEAMKNLEQFNLIDFIQNTKKPLLG 75 (195) T ss_pred EEEEEECCCHHHHHHHHHHHHCCCCEEEE---CCHHHHH--HCCEEEE-CCCCCHHHHHHHHHHCCCHHHHHHCCCCEEE T ss_conf 89999479609999999999859987996---7999994--4896998-8889999999999874778889863996688 Q ss_pred EECHHHHHHHHC-----------CCCCCCC------CCCCCCCCCCCE-------ECCCCEEEEECCCCCEEEEHHHCCC Q ss_conf 610058878752-----------8544667------767521011234-------7789709996145761570343699 Q gi|254780420|r 283 ICLGHQLLGLAL-----------GAQTVKM------YQGHHGVNHPVK-------NISNGTVEIVSMNHGFAIDASSLPA 338 (396) Q Consensus 283 IClGhQ~l~~a~-----------G~~~~kl------~~GhrG~N~Pv~-------~~~t~kv~iTsqNHg~~v~~~slp~ 338 (396) ||||+|+|+... .+.+.|+ +..|-|.|. |. ++.....|.. |+|+++.+ T Consensus 76 ICLGMQlL~~~seE~~~~gLgli~g~v~k~~~~~~~~vPhiGWn~-v~~~~~l~~~i~~~~fYFv---HSY~~~~~---- 147 (195) T PRK13151 76 ICLGMQILGNFSEELNQETLKLIDFTTQKFKAKEGFTFPHMGWNQ-VHSSHALFKGLEGAYFYFV---HSYCVGLG---- 147 (195) T ss_pred EEEEEEEEECCCCCCCCCCCCEEEEEEEEECCCCCCCCCEEEHHH-CCCCCHHHCCCCCCEEEEE---EEEECCCC---- T ss_conf 776478765147576757626420268886465578886741122-5667701216667627887---30452676---- Q ss_pred CCEEEEEECCCCEEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 828999988998087268679977898348788776700168999999999 Q gi|254780420|r 339 GLEETHISLFDSSNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDFM 389 (396) Q Consensus 339 ~~~v~~~~~nD~tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~i 389 (396) +..+...+-...-+..++ ...++|+|||||=|. ..+..|.++|++-+ T Consensus 148 ~~~ia~~~y~~~f~a~v~--k~Ni~G~QFHPEKS~--~~Gl~lL~nFlk~i 194 (195) T PRK13151 148 EYTIADCEYSQKFSASVM--KDNFYGVQFHPERSS--EAGEILISNFIKDI 194 (195) T ss_pred CCEEEEEECCCEEEEEEE--ECCEEEEECCCHHCC--HHHHHHHHHHHHHC T ss_conf 563789737988999999--199999928800305--87999999998752 No 69 >COG0118 HisH Glutamine amidotransferase [Amino acid transport and metabolism] Probab=99.66 E-value=3.1e-15 Score=127.90 Aligned_cols=168 Identities=26% Similarity=0.450 Sum_probs=104.1 Q ss_pred CEEEEEECCCC--CHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCCC-----CCHHHHHHHCCCCC Q ss_conf 14999967964--102322222024541330465744644135788998588888410112-----11012221025786 Q gi|254780420|r 206 YHVVCIDYGIR--SNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTSV-----YSSPIICKLVDSGM 278 (396) Q Consensus 206 ~kIvvIDfGiK--~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~-----~~~~~~~~~~~~~i 278 (396) .+|++||+|.= +|+.+.|.+.|+++.|. .+.++|. +.|+||| ||=|.=.++-. ...+.+++....+. T Consensus 2 ~~i~IIDyg~GNL~Sv~~Aler~G~~~~vs---~d~~~i~--~AD~liL-PGVGaf~~am~~L~~~gl~~~i~~~~~~~k 75 (204) T COG0118 2 MMVAIIDYGSGNLRSVKKALERLGAEVVVS---RDPEEIL--KADKLIL-PGVGAFGAAMANLRERGLIEAIKEAVESGK 75 (204) T ss_pred CEEEEEECCCCHHHHHHHHHHHCCCEEEEE---CCHHHHH--HCCEEEE-CCCCCHHHHHHHHHHCCHHHHHHHHHHCCC T ss_conf 789999728635999999999759806981---6988971--0888786-697888999998886366999999975599 Q ss_pred CEEEEECHHHHHHHHC-------C-----CCCCC-----CCCCCCCCCC-------CCEE-CCC-CEEEEECCCCCEEEE Q ss_conf 3478610058878752-------8-----54466-----7767521011-------2347-789-709996145761570 Q gi|254780420|r 279 PMFGICLGHQLLGLAL-------G-----AQTVK-----MYQGHHGVNH-------PVKN-ISN-GTVEIVSMNHGFAID 332 (396) Q Consensus 279 PilGIClGhQ~l~~a~-------G-----~~~~k-----l~~GhrG~N~-------Pv~~-~~t-~kv~iTsqNHg~~v~ 332 (396) |+||||||+|+|.... | +++.| +|..|-|.|+ |... +.. .++|+. |+|.+. T Consensus 76 P~LGIClGMQlLfe~SeE~~~~~GLg~i~G~V~r~~~~~~kvPHMGWN~l~~~~~~~l~~gi~~~~~~YFV---HSY~~~ 152 (204) T COG0118 76 PFLGICLGMQLLFERSEEGGGVKGLGLIPGKVVRFPAEDLKVPHMGWNQVEFVRGHPLFKGIPDGAYFYFV---HSYYVP 152 (204) T ss_pred CEEEEEHHHHHHHHCCCCCCCCCCCCEECCEEEECCCCCCCCCCCCCCEEECCCCCHHHCCCCCCCEEEEE---EEEEEC T ss_conf 77998175777664154368988702130168876888887886665223126787443289988779999---787615 Q ss_pred HHHCCCCCEEEEEECCCC-EEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 343699828999988998-087268679977898348788776700168999999999 Q gi|254780420|r 333 ASSLPAGLEETHISLFDS-SNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDFM 389 (396) Q Consensus 333 ~~slp~~~~v~~~~~nD~-tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~i 389 (396) + ...+-.+...+ ... -...+. +.++|++|||||-|. .=+..|..+|++++ T Consensus 153 ~--~~~~~v~~~~~-YG~~f~AaV~--k~N~~g~QFHPEKSg--~~Gl~lL~NFl~~~ 203 (204) T COG0118 153 P--GNPETVVATTD-YGEPFPAAVA--KDNVFGTQFHPEKSG--KAGLKLLKNFLEWI 203 (204) T ss_pred C--CCCCEEEEECC-CCCEEEEEEE--ECCEEEEECCCCCCH--HHHHHHHHHHHHHC T ss_conf 7--99861888646-8983589997--088999923863113--87899999998444 No 70 >COG0504 PyrG CTP synthase (UTP-ammonia lyase) [Nucleotide transport and metabolism] Probab=99.65 E-value=3.3e-15 Score=127.72 Aligned_cols=265 Identities=18% Similarity=0.295 Sum_probs=164.6 Q ss_pred CHHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHHCCCCCCCCHHHCCCCCCCEECCC------------------- Q ss_conf 7689999998539842366327654589789888774158874321011013333211168------------------- Q gi|254780420|r 127 DTRALTVWIRDYGVSNSVIAHHPDGQFDLEDLKERAKNWSGLKGAELAKHVTVSQRRDWSE------------------- 187 (396) Q Consensus 127 DTRaLTk~IRe~G~~~a~I~~~~~~~~d~~~~~~~~~~~~~~~~~dL~~~Vs~~~~~~~~~------------------- 187 (396) -|-.=+|.||.-|.+.-+|..-.+...+ ++.++++.-+........++....+..|.... T Consensus 189 PTQhSVkeLR~iGI~PDiii~Rs~~~l~-~~~~~KIAlfc~V~~~~Vi~~~Dv~siY~vPl~l~~qgl~~~i~~~l~l~~ 267 (533) T COG0504 189 PTQHSVKELRSIGIQPDILICRSERPLP-EEERRKIALFCNVPEEAVISAPDVESIYEVPLLLEKQGLDDYILERLNLNA 267 (533) T ss_pred CCHHHHHHHHHCCCCCCEEEEECCCCCC-HHHHHHHHHHCCCCHHHEEECCCHHHHHHHHHHHHHCCHHHHHHHHHCCCC T ss_conf 8427789999658996659994588799-899999998638988885734667888771899987532789999858998 Q ss_pred ---CCCCCCCCCCCCCCCCCCCEEEEEECC-----CCCHHHHHHHH----HCCCCEEEECCC-C-----HHHHHHCCCCE Q ss_conf ---653445434434677777149999679-----64102322222----024541330465-7-----44644135788 Q gi|254780420|r 188 ---KIWKWGEETSFLKSSDAKYHVVCIDYG-----IRSNLLRVLSS----LGCRITIVGAET-S-----YKDIMSLQPDG 249 (396) Q Consensus 188 ---~~~~~~~~~~~~~~~~~~~kIvvIDfG-----iK~nIlr~L~~----~g~~v~Vvp~~~-~-----~~~i~~~~pdg 249 (396) ....|..-.....++..+.+|+++-== +=.+++..|.. .+++|.+.--++ + .+++..+ .|| T Consensus 268 ~~~dl~~W~~~v~~i~~~~~~v~IalVGKYv~l~DaY~Sv~EAL~hag~~~~~~v~i~wIdse~le~~~~~~~~~~-~dg 346 (533) T COG0504 268 PEPDLSEWKDLVDKIKNPKKEVTIALVGKYVELPDAYKSVIEALKHAGIALGVKVNIKWIDSEDLEEENAAELEKL-VDG 346 (533) T ss_pred CCCCHHHHHHHHHHHCCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCHHHHHHC-CCE T ss_conf 8755677999999852877726999987675860679999999995076419840269875123665444556631-887 Q ss_pred EEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEECHHHHHHHHCCCCCCCCCCCCC-----CCCCCCEECCCCEEE--- Q ss_conf 998588888410112110122210257863478610058878752854466776752-----101123477897099--- Q gi|254780420|r 250 VLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGICLGHQLLGLALGAQTVKMYQGHH-----GVNHPVKNISNGTVE--- 321 (396) Q Consensus 250 i~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIClGhQ~l~~a~G~~~~kl~~Ghr-----G~N~Pv~~~~t~kv~--- 321 (396) |++-+|=|.-.... -+..++..-+.++|.||||||+|+....+--.+..|+-.|- -.-+||.++...... T Consensus 347 IlVPGGFG~RG~eG--kI~Ai~yAREn~iP~lGIClGmQ~aviE~ARnv~Gl~~AnS~Efdp~t~~pVv~l~~eq~~~~~ 424 (533) T COG0504 347 ILVPGGFGYRGVEG--KIAAIRYARENNIPFLGICLGMQLAVIEFARNVLGLEGANSTEFDPDTKYPVVDLMPEQKDVVD 424 (533) T ss_pred EEECCCCCCCCHHH--HHHHHHHHHHCCCCEEEECHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCEEEECCCCCCCCC T ss_conf 99689877675588--9999999986699868885667899999999851976576435488999865883354236776 Q ss_pred -------------EE----------------CCCCCEEEEHHH---C-CCCCEEEEEECCCCEEEEEEECCCCEE-EECC Q ss_conf -------------96----------------145761570343---6-998289999889980872686799778-9834 Q gi|254780420|r 322 -------------IV----------------SMNHGFAIDASS---L-PAGLEETHISLFDSSNCGFRLIDQPVF-SVQY 367 (396) Q Consensus 322 -------------iT----------------sqNHg~~v~~~s---l-p~~~~v~~~~~nD~tiegi~~~~~~~~-~vQf 367 (396) +. -+.|-|.|.++- + ..+|.+..++.+.+.+|-++.++.|+| ++|| T Consensus 425 lGGTmRLG~y~~~l~~gT~a~~lY~~~~v~ERHRHRYEvN~~y~~~le~~Gl~~sg~s~d~~lvEivE~~~hpfFv~~Qf 504 (533) T COG0504 425 LGGTMRLGAYPCRLKPGTLAAKLYGKDEIYERHRHRYEVNNDYRDQLEKAGLVFSGTSPDGGLVEIVELPDHPFFVATQF 504 (533) T ss_pred CCCEEECCCEEEECCCCCHHHHHHCCCCEEEECCCHHHCCHHHHHHHHHCCEEEEEECCCCCEEEEEECCCCCEEEEECC T ss_conf 77500416610112788689987387714002153133088999999868969999868998689998389964898746 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHCCC Q ss_conf 8788776700168999999999973578 Q gi|254780420|r 368 HPESSPGPQDSYYLFSTFIDFMHKRKTM 395 (396) Q Consensus 368 HPEa~PGP~D~~~~F~~F~~~i~~~k~~ 395 (396) |||=..-|.+.+.+|..|++...+++.. T Consensus 505 HPEf~SrP~~phPlf~~fv~Aa~~~~~~ 532 (533) T COG0504 505 HPEFKSRPLRPHPLFVGFVKAALEYKKD 532 (533) T ss_pred CCCCCCCCCCCCCCHHHHHHHHHHHHCC T ss_conf 6313478999983479999999975317 No 71 >TIGR01855 IMP_synth_hisH imidazole glycerol phosphate synthase, glutamine amidotransferase subunit; InterPro: IPR010139 Imidazole glycerol phosphate synthetase (IGPS) is a key metabolic enzyme, which links amino acid and nucleotide biosynthesis: it catalyses the closure of the imidazole ring within histidine biosynthesis (fifth step), and provides the substrate for de novo purine biosynthesis. IGPS consists of two different subunits: HisH, a glutamine amidotransferase (glutaminase), and HisF, a synthase (cyclase). HisH functions to provide a source of nitrogen, which is required for the synthesis of histidine and purines. In the HisH glutaminase reaction, the hydrolysis of glutamine yields ammonia, which is then used by HisF in the subsequent synthase reaction. The X-ray structure of the HisH/HisF heterodimer of IGPS reveals a putative tunnel for the transfer of ammonia from HisH to HisF via a (beta-alpha)8 barrel fold within HisF that abuts HisH . Ammonia tunnels connect the glutaminase and synthase active sites. HisH belongs to a large group of enzymes found in diverse and fundamental anabolic pathways. These enzymes share a common domain, referred to as the type-I glutamine amidotransferase (GATase) domain, which can occur either as single polypeptides (as with HisH) or as domains of large multifunctional proteins.; GO: 0016763 transferase activity transferring pentosyl groups, 0000105 histidine biosynthetic process, 0005737 cytoplasm. Probab=99.65 E-value=5.1e-16 Score=133.49 Aligned_cols=167 Identities=27% Similarity=0.455 Sum_probs=107.5 Q ss_pred EEEEECCCC--CH----HHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCC-CCCCCC---C-CHHHHHHHCCC Q ss_conf 999967964--10----2322222024541330465744644135788998588888-410112---1-10122210257 Q gi|254780420|r 208 VVCIDYGIR--SN----LLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGD-PAVTSV---Y-SSPIICKLVDS 276 (396) Q Consensus 208 IvvIDfGiK--~n----Ilr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGd-P~~~~~---~-~~~~~~~~~~~ 276 (396) ||+||+|+= .| +.+.|.+.|++..|+.-+ ++++. ..|.||| ||=|. ++..+. . ..+++...... T Consensus 1 i~viDYG~GNL~SlyrGV~~Al~~~G~~~~v~~~~--y~~~~--~aD~l~L-PGVG~F~~aM~~l~~~~~~~~~~~~~~~ 75 (211) T TIGR01855 1 IVVIDYGVGNLRSLYRGVKRALKRVGAEPVVVKDS--YKELE--KADKLIL-PGVGAFKAAMARLRELGLLDLLELVVKK 75 (211) T ss_pred CEEEECCCHHHHHHHHHHHHHHHHCCCEEEEECCC--CHHHH--CCCEEEE-CCCCCHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 97874371656889999999998449817996286--03431--0686785-7858889999998850058788999961 Q ss_pred CCCEEEEECHHHHHHHHC---CC----------CCC---------CCCCCCCCCC--CCCEEC---CC-----CEEEEEC Q ss_conf 863478610058878752---85----------446---------6776752101--123477---89-----7099961 Q gi|254780420|r 277 GMPMFGICLGHQLLGLAL---GA----------QTV---------KMYQGHHGVN--HPVKNI---SN-----GTVEIVS 324 (396) Q Consensus 277 ~iPilGIClGhQ~l~~a~---G~----------~~~---------kl~~GhrG~N--~Pv~~~---~t-----~kv~iTs 324 (396) ++|+||||||+|||.... |. ++. |+|-+|-|.| +|+++. .. .++|.. T Consensus 76 ~~P~lGiCLGMQLLfe~SeE~g~~~~GLG~i~G~V~k~~~~~~~~~lk~PHmGWN~v~~~~~~C~ll~g~~~~~~~YFV- 154 (211) T TIGR01855 76 KKPVLGICLGMQLLFERSEEGGGEVPGLGLIKGKVVKLEARVSLSKLKVPHMGWNEVEPVKESCPLLKGIDKGAYFYFV- 154 (211) T ss_pred CCCEEEEEHHHHHHHCCCCCCCCCCCCCEECCCEEEEECCCCCCCCCCCCCCCCCEEEEHHHCCCHHCCCCCCCCEEEE- T ss_conf 8968999802343001562437886653112761342056778872545756564231231155113077667815568- Q ss_pred CCCCEEEEHHHCCC-CCEEEEEECCCCEEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 45761570343699-82899998899808726867997789834878877670016899999999 Q gi|254780420|r 325 MNHGFAIDASSLPA-GLEETHISLFDSSNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDF 388 (396) Q Consensus 325 qNHg~~v~~~slp~-~~~v~~~~~nD~tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~ 388 (396) |+|.+...+ . +...+..+---.-.+.++ ...++|+|||||=|. +=+..|+++|+++ T Consensus 155 --HSY~~~~~~--~S~~v~a~~~YG~~F~AaV~--~~n~~G~QFHPEKS~--~~G~~lL~nf~~~ 211 (211) T TIGR01855 155 --HSYYAVCEE--ESEAVLAKADYGEKFTAAVQ--KDNIFGTQFHPEKSG--KTGLKLLKNFLEL 211 (211) T ss_pred --EEEEEECCC--CHHHEEEEHHCCCCEEEEEE--CCCEEEEEECCHHHH--HHHHHHHHHHHCC T ss_conf --866776168--81150112001863044665--487899972856888--8999999998529 No 72 >cd01747 GATase1_Glutamyl_Hydrolase Type 1 glutamine amidotransferase (GATase1) domain found in gamma-Glutamyl Hydrolase. Type 1 glutamine amidotransferase (GATase1) domain found in gamma-Glutamyl Hydrolase. gamma-Glutamyl Hydrolase catalyzes the cleavage of the gamma-glutamyl chain of folylpoly-gamma-glutamyl substrates and is a central enzyme in folyl and antifolyl poly-gamma-glutamate metabolism. GATase activity involves the removal of the ammonia group from a glutamate molecule and its subsequent transfer to a specific substrate, thus creating a new carbon-nitrogen group on the substrate. gamma-Glutamyl hydrolases belong to the triad family of amidotransferases having a conserved Cys-His-Glu catalytic triad in the glutaminase active site. Probab=99.63 E-value=5.7e-15 Score=126.06 Aligned_cols=177 Identities=19% Similarity=0.304 Sum_probs=108.5 Q ss_pred CHHHHHHHHHCCCCEEEECCCCHHHHHHC--CCCEEEECCCCCCC--CCCCCC---CHHHHHHHCC-CC-CCEEEEECHH Q ss_conf 10232222202454133046574464413--57889985888884--101121---1012221025-78-6347861005 Q gi|254780420|r 217 SNLLRVLSSLGCRITIVGAETSYKDIMSL--QPDGVLLSNGPGDP--AVTSVY---SSPIICKLVD-SG-MPMFGICLGH 287 (396) Q Consensus 217 ~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~--~pdgi~lSnGPGdP--~~~~~~---~~~~~~~~~~-~~-iPilGIClGh 287 (396) .+=++.|..-|++|+-+|++.+.+++..+ +-.||++-+|--|= ...... +.+...+.-+ +. -||+|+|||+ T Consensus 23 ASYVK~lEsaGArVVPI~~~~~~~~~~~l~~~INGvLfpGG~~~~~~s~y~~~~~~i~~~A~e~Nd~G~yFPiwGtCLGF 102 (273) T cd01747 23 ASYVKFLESAGARVVPIWINESEEYYDKLFKSINGILFPGGAVDIDTSGYARTAKIIYNLALERNDAGDYFPVWGTCLGF 102 (273) T ss_pred HHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHCCEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCEEEEHHHH T ss_conf 89899999789869987469998999999875467996786620566708999999999999985089956333330309 Q ss_pred HHHHHHCCCCCCCC-CCCCCCCCCCCEE---CCCCEE---------------EEECCCCCEEEEHHHC------CCCCEE Q ss_conf 88787528544667-7675210112347---789709---------------9961457615703436------998289 Q gi|254780420|r 288 QLLGLALGAQTVKM-YQGHHGVNHPVKN---ISNGTV---------------EIVSMNHGFAIDASSL------PAGLEE 342 (396) Q Consensus 288 Q~l~~a~G~~~~kl-~~GhrG~N~Pv~~---~~t~kv---------------~iTsqNHg~~v~~~sl------p~~~~v 342 (396) |+|.....++..-+ .+.......|..- ....|. -+|.+||.|.|.++++ .+-+++ T Consensus 103 elL~~~~s~~~~~l~~~~s~n~~~pL~ft~~~~~Srlf~~~~~~l~~~l~~~~~t~n~H~~gitp~~f~~~~~L~~~f~i 182 (273) T cd01747 103 ELLTYLTSGETLLLEATEATNSALPLNFTEDALQSRLFKRFPPDLLKSLATEPLTMNNHRYGISPENFTENGLLSDFFNV 182 (273) T ss_pred HHHHHHHCCCCCEECCCCCCCCCCCCCCCCCCCCCHHHHCCCHHHHHHHHHCCCCHHHHCCCCCHHHHHHHHHHHHHHEE T ss_conf 99999973996123102677733010144676446376519999999985076004330541678887456888744265 Q ss_pred EEEECC-CCE--EEEEEECCCCEEEECCCCCCCC--------CCCC------HHHHHHHHHHHHHHHC Q ss_conf 999889-980--8726867997789834878877--------6700------1689999999999735 Q gi|254780420|r 343 THISLF-DSS--NCGFRLIDQPVFSVQYHPESSP--------GPQD------SYYLFSTFIDFMHKRK 393 (396) Q Consensus 343 ~~~~~n-D~t--iegi~~~~~~~~~vQfHPEa~P--------GP~D------~~~~F~~F~~~i~~~k 393 (396) +..|.. +|. |--++|+++|+||||||||-.+ =||+ +.|+=+-|++.-.+.+ T Consensus 183 lstn~D~~g~eFVStiE~k~YP~yg~QfHPEK~~FEw~~~~~i~Hs~~ai~~sq~~a~fFV~eaRkn~ 250 (273) T cd01747 183 LTTNDDWNGVEFISTVEAYKYPIYGVQWHPEKNAFEWKKSSSIPHSEEAIRLTQYFANFFVNEARKSN 250 (273) T ss_pred EEEEECCCCCEEEEEEECCCCCEEEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCC T ss_conf 68776599988988630324764667406875765557767899899999999999999999996389 No 73 >PRK13147 consensus Probab=99.62 E-value=1.1e-14 Score=124.09 Aligned_cols=164 Identities=22% Similarity=0.329 Sum_probs=96.5 Q ss_pred EEEEEECCCC--CHHHHHHHH------HCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCC--C----CCC--CCHHHH Q ss_conf 4999967964--102322222------02454133046574464413578899858888841--0----112--110122 Q gi|254780420|r 207 HVVCIDYGIR--SNLLRVLSS------LGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPA--V----TSV--YSSPII 270 (396) Q Consensus 207 kIvvIDfGiK--~nIlr~L~~------~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~--~----~~~--~~~~~~ 270 (396) ||++||+|+= .|+.|.|.. .++++.+. .+.+++. +.|.|+| ||-=+ . ... ...+.+ T Consensus 2 kI~IIDyg~GNi~Sv~~al~~~~~~~gl~~~~~i~---~~~~~i~--~~d~lIL---PGVGsF~~~m~~L~~~~~l~~~i 73 (211) T PRK13147 2 RVVVIDYNGGNLASAARALEKAATRKGIDATVVIS---ADPTDVR--NADRIVL---PGQGAFADCAQGLAAIPGLRNAI 73 (211) T ss_pred EEEEECCCCHHHHHHHHHHHHHHHHCCCCCEEEEE---CCHHHHH--HCCEEEE---CCCCCHHHHHHHHHHCCCHHHHH T ss_conf 89999379436999999999887535888439996---9999996--4997898---98664899999988635859999 Q ss_pred HHHCCCCCCEEEEECHHHHHHHHC---C---------CCCCCC-----CCCCCCCCC--------CCEE-CC-CCEEEEE Q ss_conf 210257863478610058878752---8---------544667-----767521011--------2347-78-9709996 Q gi|254780420|r 271 CKLVDSGMPMFGICLGHQLLGLAL---G---------AQTVKM-----YQGHHGVNH--------PVKN-IS-NGTVEIV 323 (396) Q Consensus 271 ~~~~~~~iPilGIClGhQ~l~~a~---G---------~~~~kl-----~~GhrG~N~--------Pv~~-~~-t~kv~iT 323 (396) +..+..+.||||||||+|+|+... | +++.|+ +..|-|.|. |..+ +. ..+.|.. T Consensus 74 ~~~~~~~~piLGICLGMQlL~~~s~E~g~~~GLgli~G~v~kl~~~~~kvPhiGWn~i~~~~~~~~~~~~~~~~~~fYFv 153 (211) T PRK13147 74 ETATAAGTPFLGICVGMQLMAERGLEHERTPGFGWISGEIARMDAPGLRLPHMGWNTLDFTPGAHPLTDGLIPGDHGYFV 153 (211) T ss_pred HHHHHCCCCEEEECHHHHHHCCCCCCCCCCCCEEEEEEEEEECCCCCCCCCEECEEEEECCCCCCHHHCCCCCCCEEEEE T ss_conf 99997699879983675752164653798774007702899898767888762526887057866432476768869996 Q ss_pred CCCCCEEEEHHHCCCCCEEEEEECCCCEEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHH Q ss_conf 1457615703436998289999889980872686799778983487887767001689999999 Q gi|254780420|r 324 SMNHGFAIDASSLPAGLEETHISLFDSSNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFID 387 (396) Q Consensus 324 sqNHg~~v~~~slp~~~~v~~~~~nD~tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~ 387 (396) |+|++.+.. +.. .+...+-.+.-+..+. ...++|+|||||=|. ..+..|+++|+. T Consensus 154 ---HSY~~~~~~-~~~-~~~~~~yg~~~~s~v~--k~Ni~G~QFHPEKS~--~~Gl~ll~nFlk 208 (211) T PRK13147 154 ---HSYALRDGA-DSD-LVATTQYGGQVPAIVA--RGNRAGTQFHVEKSQ--EVGLTILGNFLR 208 (211) T ss_pred ---CCCCCCCCC-CCC-EEEEEECCCEEEEEEE--ECCEEEEECCCHHCC--HHHHHHHHHHHC T ss_conf ---123406888-876-8999989998999999--899999968800146--868999999867 No 74 >PRK06186 hypothetical protein; Validated Probab=99.55 E-value=5.3e-14 Score=119.25 Aligned_cols=141 Identities=16% Similarity=0.196 Sum_probs=99.0 Q ss_pred CCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEECHHHHHHHHCCCCCCCCCCCC-----CCCCCCCEECCC--- Q ss_conf 578899858888841011211012221025786347861005887875285446677675-----210112347789--- Q gi|254780420|r 246 QPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGICLGHQLLGLALGAQTVKMYQGH-----HGVNHPVKNISN--- 317 (396) Q Consensus 246 ~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIClGhQ~l~~a~G~~~~kl~~Gh-----rG~N~Pv~~~~t--- 317 (396) +.|||++-+|=|.=.. ...+..++..-+.+||.||||||+|+....+--.+..++-.+ --..|||..... T Consensus 53 ~~dgilv~pGfG~RG~--eGki~Ai~yARen~iP~LGICLGmQ~avIEfARnvlgl~dAnS~Efd~~~~~pvi~~l~~~~ 130 (229) T PRK06186 53 GFDGIWCVPGSPYRNE--DGALTAIRFARENGIPFLGTCGGFQHALLEYARNVLGWADAEHAETDPEASRPVIAPLSCSL 130 (229) T ss_pred CCCEEEECCCCCCCCH--HHHHHHHHHHHHCCCCEEEEHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCEEEECCCCCH T ss_conf 5998998998777656--38999999998769987864277899999999986599888764557899971893688732 Q ss_pred ----CEEEEEC----------------CCCCEEEEHHH---C-CCCCEEEEEECCCCEEEEEEECCCCEE-EECCCCCCC Q ss_conf ----7099961----------------45761570343---6-998289999889980872686799778-983487887 Q gi|254780420|r 318 ----GTVEIVS----------------MNHGFAIDASS---L-PAGLEETHISLFDSSNCGFRLIDQPVF-SVQYHPESS 372 (396) Q Consensus 318 ----~kv~iTs----------------qNHg~~v~~~s---l-p~~~~v~~~~~nD~tiegi~~~~~~~~-~vQfHPEa~ 372 (396) .++.++. +.|-|.|.++= | ..+|.++.++.. +.+|-++.++.|+| |||||||=. T Consensus 131 ~~~~~~~~l~~~s~~~~iYg~~~i~ERHRHRYEvN~~y~~~le~~Gl~~sG~~~~-~~veiiEl~~Hpffvg~QfHPEf~ 209 (229) T PRK06186 131 VEATAAIRLRPGSLIARAYGTDEIEEGYHCRYGVNPEFAAALESGALRVSGWDEE-GDVRAVELPGHPFFVATLFQPERA 209 (229) T ss_pred HCCCCCEEECCCCHHHHHHCCCHHHHHHCCCCCCCHHHHHHHHHCCCEEEEECCC-CCEEEEEECCCCCEEEECCCCCCC T ss_conf 0387515748998399986965301332345546989999999789899998899-988999928998589965875456 Q ss_pred CCCCCHHHHHHHHHHHH Q ss_conf 76700168999999999 Q gi|254780420|r 373 PGPQDSYYLFSTFIDFM 389 (396) Q Consensus 373 PGP~D~~~~F~~F~~~i 389 (396) .=|...+.+|..|+.-- T Consensus 210 Srp~~phPlF~~Fi~AA 226 (229) T PRK06186 210 ALAGRLPPLVRAFLRAC 226 (229) T ss_pred CCCCCCCCCHHHHHHHH T ss_conf 89999881599999998 No 75 >cd01746 GATase1_CTP_Synthase Type 1 glutamine amidotransferase (GATase1) domain found in Cytidine Triphosphate Synthetase. Type 1 glutamine amidotransferase (GATase1) domain found in Cytidine Triphosphate Synthetase (CTP). CTP is involved in pyrimidine ribonucleotide/ribonucleoside metabolism. CTPs produce CTP from UTP and glutamine and regulate intracellular CTP levels through interactions with four ribonucleotide triphosphates. The enzyme exists as a dimer of identical chains that aggregates as a tetramer. CTP is derived form UTP in three separate steps involving two active sites. In one active site, the UTP O4 oxygen is activated by Mg-ATP-dependent phosphorylation, followed by displacement of the resulting 4-phosphate moiety by ammonia. At a separate site, ammonia is generated via rate limiting glutamine hydrolysis (glutaminase) activity. A gated channel that spans between the glutamine hydrolysis and amidoligase active sites provides a path for ammonia diffusion. CTPs belong to th Probab=99.55 E-value=2.4e-14 Score=121.70 Aligned_cols=144 Identities=25% Similarity=0.394 Sum_probs=101.0 Q ss_pred HHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEECHHHHHHHHCCCCCCCCCCCC-----CCCCCCCEE Q ss_conf 464413578899858888841011211012221025786347861005887875285446677675-----210112347 Q gi|254780420|r 240 KDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGICLGHQLLGLALGAQTVKMYQGH-----HGVNHPVKN 314 (396) Q Consensus 240 ~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIClGhQ~l~~a~G~~~~kl~~Gh-----rG~N~Pv~~ 314 (396) .+.++ +.|||++.+|=|.=.. ...+..++..-+.++|.||||||+|++...+--.+.+++-.+ --..|||.. T Consensus 50 ~~~L~-~~dgilvpgGFG~rG~--eGki~Ai~yARen~iPfLGIClGmQ~aviE~ARnvlg~~~A~s~E~~~~~~~~vi~ 126 (235) T cd01746 50 EEALK-GADGILVPGGFGIRGV--EGKILAIKYARENNIPFLGICLGMQLAVIEFARNVLGLPDANSTEFDPDTPHPVVD 126 (235) T ss_pred HHHHH-CCCCEEECCCCCCCCH--HHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCEEE T ss_conf 99973-0697895788887764--58899999999739972544103432268999875299776742247999987799 Q ss_pred CCCC----------------EEEEE----------------CCCCCEEEEHHH---C-CCCCEEEEEECCCCEEEEEEEC Q ss_conf 7897----------------09996----------------145761570343---6-9982899998899808726867 Q gi|254780420|r 315 ISNG----------------TVEIV----------------SMNHGFAIDASS---L-PAGLEETHISLFDSSNCGFRLI 358 (396) Q Consensus 315 ~~t~----------------kv~iT----------------sqNHg~~v~~~s---l-p~~~~v~~~~~nD~tiegi~~~ 358 (396) .... .+.+. -+.|-|.|.++= | ..++.++.++.+++.+|.++.+ T Consensus 127 ~~~~~~~~~~~GgTmRLG~~~~~l~~~s~~~~~y~~~~i~ERHRHRYevN~~y~~~le~~Gl~~sG~~~~~~~veiiEl~ 206 (235) T cd01746 127 LMPEQKGVKDLGGTMRLGAYPVILKPGTLAHKYYGKDEVEERHRHRYEVNPEYVDELEEAGLRFSGTDPDGGLVEIVELP 206 (235) T ss_pred ECHHHCCCHHCCCCEEEEEEEEEECCCHHHHHHCCCCEEEEEECCCEEECHHHHHHHHHCCCEEEEECCCCCEEEEEECC T ss_conf 67566164001341342025776224306667518832312302355678999999986896899998999979999868 Q ss_pred CCCEE-EECCCCCCCCCCCCHHHHHHHHH Q ss_conf 99778-98348788776700168999999 Q gi|254780420|r 359 DQPVF-SVQYHPESSPGPQDSYYLFSTFI 386 (396) Q Consensus 359 ~~~~~-~vQfHPEa~PGP~D~~~~F~~F~ 386 (396) +.|+| |||||||=..=|...+-||..|+ T Consensus 207 ~Hpffvg~QfHPEf~Srp~~phPlF~~fi 235 (235) T cd01746 207 DHPFFVGTQFHPEFKSRPLKPHPLFVGFV 235 (235) T ss_pred CCCCEEEECCCCCCCCCCCCCCCCHHHHC T ss_conf 99847887577644689999993836439 No 76 >PRK13142 hisH imidazole glycerol phosphate synthase subunit HisH; Provisional Probab=99.54 E-value=6.9e-14 Score=118.48 Aligned_cols=161 Identities=25% Similarity=0.347 Sum_probs=97.0 Q ss_pred EEEEECCCC--CHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCCCC-----CHHHHHHHCCCCCCE Q ss_conf 999967964--1023222220245413304657446441357889985888884101121-----101222102578634 Q gi|254780420|r 208 VVCIDYGIR--SNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTSVY-----SSPIICKLVDSGMPM 280 (396) Q Consensus 208 IvvIDfGiK--~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~~-----~~~~~~~~~~~~iPi 280 (396) |++||+|+= .|+.+.|...|+++.+. .+.+++. +.|.|+|- |=|.=..+-.. ....+++. .+.|| T Consensus 2 I~IIDyG~GNi~Sv~~al~~lg~~~~i~---~~~~~i~--~~d~lILP-GVGsF~~am~~L~~~~l~~~i~~~--~~kpi 73 (192) T PRK13142 2 IVIVDYGLGNISNVKRAIEHLGYEVVVS---NTSKIID--QAETIILP-GVGHFKDAMSEIKRLNLNAILAKN--TDKKM 73 (192) T ss_pred EEEEECCCHHHHHHHHHHHHCCCCEEEE---CCHHHHH--HCCEEEEC-CCCCHHHHHHHHHHCCHHHHHHHH--CCCCE T ss_conf 8999769458999999999859987996---8999984--37958975-768789999998875859999986--28985 Q ss_pred EEEECHHHHHHHHC------C-----CCCCCC----CCCCCCCCCCCEE--CCCCEEEEECCCCCEEEEHHHCCCCCEEE Q ss_conf 78610058878752------8-----544667----7675210112347--78970999614576157034369982899 Q gi|254780420|r 281 FGICLGHQLLGLAL------G-----AQTVKM----YQGHHGVNHPVKN--ISNGTVEIVSMNHGFAIDASSLPAGLEET 343 (396) Q Consensus 281 lGIClGhQ~l~~a~------G-----~~~~kl----~~GhrG~N~Pv~~--~~t~kv~iTsqNHg~~v~~~slp~~~~v~ 343 (396) ||||||+|+|.... | +++.|+ +..|-|.|.-... +.....|.. |+|.+..+ +-.+. T Consensus 74 LGICLGMQlL~~~SeEg~~~GLgli~G~V~kl~~~~~vPhmGWn~v~~~~~~~~~~fYFv---HSY~~~~~----~~~~a 146 (192) T PRK13142 74 IGICLGMQLMYEHSDEGDASGLGFIPGNISRIQTEYPVPHLGWNNLVSKHPMLNQDVYFV---HSYQAPMS----ENVIA 146 (192) T ss_pred EEEECCEEEEECCCCCCCCCCCCEEEEEEEEECCCCCCCEECCHHHCCCCCCCCCCEEEE---EEEECCCC----CCEEE T ss_conf 896501014311455567677577868999836899786658156253385423644799---87652787----57899 Q ss_pred EEECCCCEEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHH Q ss_conf 99889980872686799778983487887767001689999999 Q gi|254780420|r 344 HISLFDSSNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFID 387 (396) Q Consensus 344 ~~~~nD~tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~ 387 (396) ..+- ...+-++. ...++||+|||||-|. ..+..|.++|++ T Consensus 147 ~t~y-g~~f~s~v-~~~Ni~G~QFHPEKS~--~~Gl~lLknFl~ 186 (192) T PRK13142 147 YAQY-GADIPAIV-QFNNYIGIQFHPEKSG--TYGLQILRQAIQ 186 (192) T ss_pred EEEC-CCEEEEEE-EECCEEEEECCHHHCC--HHHHHHHHHHHC T ss_conf 9988-99699999-9599999958812355--858999999963 No 77 >pfam01174 SNO SNO glutamine amidotransferase family. This family and its amidotransferase domain was first described in. It is predicted that members of this family are involved in the pyridoxine biosynthetic pathway, based on the proximity and co-regulation of the corresponding genes and physical interaction between the members of pfam01174 and pfam01680. Probab=99.54 E-value=1.2e-13 Score=116.66 Aligned_cols=157 Identities=17% Similarity=0.261 Sum_probs=87.6 Q ss_pred CCHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCCCC-----CHHHHHHH-CCCCCCEEEEECHHHH Q ss_conf 41023222220245413304657446441357889985888884101121-----10122210-2578634786100588 Q gi|254780420|r 216 RSNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTSVY-----SSPIICKL-VDSGMPMFGICLGHQL 289 (396) Q Consensus 216 K~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~~-----~~~~~~~~-~~~~iPilGIClGhQ~ 289 (396) =++..+.|.+.|++++++ .+.++|.+ .|+|||- | |-...+... ..+.+++. .++++|+||||||+|+ T Consensus 8 l~sv~nal~~lG~e~~~v---~~p~di~~--ad~LILP-G-g~s~~~~~ll~~~gl~~~l~e~i~~~~kPiLGIClGMqL 80 (188) T pfam01174 8 VEEHEEAIKKCGAENKTV---KRPEDLAQ--CDALIIP-G-GESTAMSLLAKRYGFYEPLYEFVHNPNKPIWGTCAGLIL 80 (188) T ss_pred HHHHHHHHHHCCCCEEEE---CCHHHHHC--CCEEEEC-C-CCHHHHHHHHHHCCCHHHHHHHHHHCCCCEEEHHHHHHH T ss_conf 999999999869987996---89999940--8989988-9-887999999998688899999998079966333144642 Q ss_pred HHHHCCC-CC----------CCCCCCCCC----CCCCCEECCCC-EEEEECCCCCEEEEHHHCCCCCEEEEEECCCCEEE Q ss_conf 7875285-44----------667767521----01123477897-09996145761570343699828999988998087 Q gi|254780420|r 290 LGLALGA-QT----------VKMYQGHHG----VNHPVKNISNG-TVEIVSMNHGFAIDASSLPAGLEETHISLFDSSNC 353 (396) Q Consensus 290 l~~a~G~-~~----------~kl~~GhrG----~N~Pv~~~~t~-kv~iTsqNHg~~v~~~slp~~~~v~~~~~nD~tie 353 (396) |+..... .+ .|..+|..- .+.+..++... ..|.. |+|.+... +.+-.++...-.++.+. T Consensus 81 L~~~~e~g~~~gLGli~~~V~r~~~g~~~~~~~~~~~~~~~~~~~~~yFv---hs~~~~~~--~~~~~v~~~~~~~~~~v 155 (188) T pfam01174 81 LSKQLGNELVKTLGLLKVTVKRNAFGRQVDSFEKECDFKNLIPKFPGVFI---RAPVIEEI--LDPEVVVVLYELDGKIV 155 (188) T ss_pred HHCCCCCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCEEEEE---ECCEECCC--CCCCEEEEEEECCCEEE T ss_conf 11024778644246532268765767546554478633268999439998---17745104--99852999995799999 Q ss_pred EEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 268679977898348788776700168999999999 Q gi|254780420|r 354 GFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDFM 389 (396) Q Consensus 354 gi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~i 389 (396) +++ ..++||+|||||=|. |...+.+.|++-+ T Consensus 156 ~v~--~~Ni~g~QFHPEKS~---d~~~~~~~fl~~~ 186 (188) T pfam01174 156 VAK--QGNILATSFHPELAE---DDYRVHDWFVENF 186 (188) T ss_pred EEE--ECCEEEEECCCEECC---CCCHHHHHHHHHH T ss_conf 999--899999951984579---9788999999996 No 78 >PRK13527 glutamine amidotransferase subunit PdxT; Provisional Probab=99.52 E-value=2.2e-13 Score=114.86 Aligned_cols=154 Identities=18% Similarity=0.285 Sum_probs=88.7 Q ss_pred HHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCC---CCCCCHHHHHHHCCCCCCEEEEECHHHHHHHHCC-- Q ss_conf 22222024541330465744644135788998588888410---1121101222102578634786100588787528-- Q gi|254780420|r 221 RVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAV---TSVYSSPIICKLVDSGMPMFGICLGHQLLGLALG-- 295 (396) Q Consensus 221 r~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~---~~~~~~~~~~~~~~~~iPilGIClGhQ~l~~a~G-- 295 (396) +.|...|..+.++.-. +.++|. +.|+++|-+|--.... ......+.+++.++.++|+||||+|+|+|+.... T Consensus 21 kAl~~lg~~~e~~~v~-~p~~l~--~~d~LILPGG~f~~~m~~L~~~gl~~~i~~~i~~g~PiLGIClGmQLL~~~s~e~ 97 (196) T PRK13527 21 RALDELGINGEVVRVR-RPEDLK--DCDALIIPGGESTTIGRLMKRYGILDEIKEKIEEGLPILGTCAGLILLAKEVDDE 97 (196) T ss_pred HHHHHCCCCCEEEEEC-CHHHHH--CCCEEEECCCCHHHHHHHHHHCCCHHHHHHHHHCCCCEEEECHHHHHHHHHCCCC T ss_conf 9999829990599969-989983--1997998996679999998886888999999976997799708799986631567 Q ss_pred ----------CCCCCCCCCCCC--CCCCC--EECCCC-EEEEECCCCCEEEEHHHCCCCCEEEEEECCCCEEEEEEECCC Q ss_conf ----------544667767521--01123--477897-099961457615703436998289999889980872686799 Q gi|254780420|r 296 ----------AQTVKMYQGHHG--VNHPV--KNISNG-TVEIVSMNHGFAIDASSLPAGLEETHISLFDSSNCGFRLIDQ 360 (396) Q Consensus 296 ----------~~~~kl~~GhrG--~N~Pv--~~~~t~-kv~iTsqNHg~~v~~~slp~~~~v~~~~~nD~tiegi~~~~~ 360 (396) +++.|..||..= ...|+ ..+... ..|.. |+|.++. .+++.++++.. ++.+.+++ .. T Consensus 98 ~~~~GLG~i~g~V~rn~~g~q~~s~~~~~~~~~~~~~~~~yFV---hs~~~~~--~~~~~~v~a~~--~~~~~aV~--k~ 168 (196) T PRK13527 98 VVQPLLGLMDVTVNRNAFGRQRDSFEAELDLEGLDGPFHAVFI---RAPAIDE--AGGDVEVLAKL--DDRIVAVE--QG 168 (196) T ss_pred CCCCCCCCCCEEEEECCCCCCCCCCCCCEEECCCCCCEEEEEE---EEEEEEC--CCCCEEEEEEE--CCEEEEEE--EC T ss_conf 7767767575489965788745555166441478999689999---7423502--79987999997--99999999--89 Q ss_pred CEEEECCCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 778983487887767001689999999999 Q gi|254780420|r 361 PVFSVQYHPESSPGPQDSYYLFSTFIDFMH 390 (396) Q Consensus 361 ~~~~vQfHPEa~PGP~D~~~~F~~F~~~i~ 390 (396) ++|++|||||=|. | ..+...|++++. T Consensus 169 Ni~g~QFHPEkSg---d-~~i~~~Fl~~v~ 194 (196) T PRK13527 169 NVLATAFHPELTD---D-TRIHEYFLKKVV 194 (196) T ss_pred CEEEEECCCCCCC---C-CHHHHHHHHHHC T ss_conf 9999955972238---7-559999999971 No 79 >cd01749 GATase1_PB Glutamine Amidotransferase (GATase_I) involved in pyridoxine biosynthesis. Glutamine Amidotransferase (GATase_I) involved in pyridoxine biosynthesis. Glutamine amidotransferase (GATase) activity involves the removal of the ammonia group from a glutamate molecule and its subsequent transfer to a specific substrate, thus creating a new carbon-nitrogen group on the substrate. This group contains proteins like Bacillus subtilus YaaE and Plasmodium falciparum Pdx2 which are members of the triad glutamine aminotransferase family and function in a pathway for the biosynthesis of vitamin B6. Probab=99.47 E-value=1e-12 Score=110.26 Aligned_cols=161 Identities=20% Similarity=0.346 Sum_probs=89.2 Q ss_pred EEEEECCCCCHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCCC----CCHHHHHHHCCCCCCEEEE Q ss_conf 999967964102322222024541330465744644135788998588888410112----1101222102578634786 Q gi|254780420|r 208 VVCIDYGIRSNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTSV----YSSPIICKLVDSGMPMFGI 283 (396) Q Consensus 208 IvvIDfGiK~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~----~~~~~~~~~~~~~iPilGI 283 (396) |+.|- |-=++.+|.|.+.|+++.++ .+.++|.+ .|+++|-.| |-.+.... ...+.+++....++|+||| T Consensus 3 vl~lq-Gn~~s~~~Al~~lG~~~~~~---~~p~~i~~--ad~LILPGg-~s~~~~~~L~~~gl~~~i~~~i~~gkP~LGI 75 (183) T cd01749 3 VLALQ-GDFREHIRALERLGVEVIEV---RTPEDLEG--IDGLIIPGG-ESTTIGKLLRRTGLLDPLREFIRAGKPVFGT 75 (183) T ss_pred EEEEC-CCHHHHHHHHHHCCCCEEEE---CCHHHHHH--CCEEEECCC-CHHHHHHHHHHCCCHHHHHHHHHCCCCEEEH T ss_conf 99834-78999999999879979998---99999831--898998894-0899999998878889999999849984751 Q ss_pred ECHHHHHHHHC--C----------CCCCCCCCCC-CC---CCCCCEECCCCEEEEECCCCCEEEEHHHCCCCCEEEEEEC Q ss_conf 10058878752--8----------5446677675-21---0112347789709996145761570343699828999988 Q gi|254780420|r 284 CLGHQLLGLAL--G----------AQTVKMYQGH-HG---VNHPVKNISNGTVEIVSMNHGFAIDASSLPAGLEETHISL 347 (396) Q Consensus 284 ClGhQ~l~~a~--G----------~~~~kl~~Gh-rG---~N~Pv~~~~t~kv~iTsqNHg~~v~~~slp~~~~v~~~~~ 347 (396) |||+|+|+... + +++.|..||- .+ ...++.++.-+.++.. -.|+|.+.. ...+..++... T Consensus 76 ClGMQlL~~~s~E~g~~~GLGli~g~V~rn~~g~q~~sf~~~l~ip~~g~n~~~~~-Fv~sp~~~~--~~~~~~~~~~~- 151 (183) T cd01749 76 CAGLILLAKEVEDQGGQPLLGLLDITVRRNAFGRQVDSFEADLDIPGLGLGPFPAV-FIRAPVIEE--VGPGVEVLAEY- 151 (183) T ss_pred HHHHHHHHHHCCCCCCCCCCCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCCCEE-EECCCEEEE--CCCCEEEEEEE- T ss_conf 07788876523467886763655519963563676665525752786788876669-944864600--59980999998- Q ss_pred CCCEEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHH Q ss_conf 998087268679977898348788776700168999999 Q gi|254780420|r 348 FDSSNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFI 386 (396) Q Consensus 348 nD~tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~ 386 (396) ++.+.++ ...++|++|||||-|. | ..+.++|+ T Consensus 152 -~~~~~aV--~~~Ni~g~QFHPEKSg---d-~~i~~~fl 183 (183) T cd01749 152 -DGKIVAV--RQGNVLATSFHPELTD---D-TRIHEYFL 183 (183) T ss_pred -CCEEEEE--EECCEEEEECCCCCCC---C-CCHHHHHC T ss_conf -9999999--9899999947980159---8-64888639 No 80 >TIGR00337 PyrG CTP synthase; InterPro: IPR004468 CTP synthase is involved in pyrimidine ribonucleotide/ribonucleoside metabolism. The enzyme catalyzes the reaction L-glutamine + H2O + UTP + ATP = CTP + phosphate + ADP + L-glutamate. The enzyme exists as a dimer of identical chains that aggregates as a tetramer. This gene has been found circa 500 bp 5 upstream of enolase in both beta (Nitrosomonas europaea) and gamma (Escherichia coli) subdivisions of Proteobacterium .; GO: 0003883 CTP synthase activity, 0006221 pyrimidine nucleotide biosynthetic process. Probab=99.44 E-value=5.2e-13 Score=112.29 Aligned_cols=187 Identities=21% Similarity=0.354 Sum_probs=124.9 Q ss_pred CCCCCCCCEEEEEE----CC-CCCHHHHHHHHH----CCC--CEEEEC-CCCHH-------HHHH-----CCCCEEEECC Q ss_conf 46777771499996----79-641023222220----245--413304-65744-------6441-----3578899858 Q gi|254780420|r 199 LKSSDAKYHVVCID----YG-IRSNLLRVLSSL----GCR--ITIVGA-ETSYK-------DIMS-----LQPDGVLLSN 254 (396) Q Consensus 199 ~~~~~~~~kIvvID----fG-iK~nIlr~L~~~----g~~--v~Vvp~-~~~~~-------~i~~-----~~pdgi~lSn 254 (396) ..++...-+|+++= += .=.||...|.-- ++. +.++.. |...+ ++++ ..-|||++-+ T Consensus 302 ~~~~~~~v~Ig~VGKY~eL~DaY~Sv~EAL~Hag~~~d~~v~i~Wi~s~d~e~~~~e~~k~~~l~~~~~~~~~DGILVPG 381 (571) T TIGR00337 302 LINPKEEVTIGLVGKYVELKDAYLSVIEALKHAGAKLDTKVNIKWIDSEDLEEEEQEVDKAELLDGWNELSEVDGILVPG 381 (571) T ss_pred HHCCCCCEEEEEEEECCCHHHHHHHHHHHHHHHHHHCCCEEEEEEECHHCCCCCCCCHHHHHHHHHHHHHHCCCEEEECC T ss_conf 30552516798974002513567889999987677637425689862101320012001355532244543068068368 Q ss_pred CCCCCCCCCCCCHHHHHHHCCCCCCEEEEECHHHHHHHHCCCCCCCCCCCC-----CCCCCC---CEECCCCE------- Q ss_conf 888841011211012221025786347861005887875285446677675-----210112---34778970------- Q gi|254780420|r 255 GPGDPAVTSVYSSPIICKLVDSGMPMFGICLGHQLLGLALGAQTVKMYQGH-----HGVNHP---VKNISNGT------- 319 (396) Q Consensus 255 GPGdP~~~~~~~~~~~~~~~~~~iPilGIClGhQ~l~~a~G~~~~kl~~Gh-----rG~N~P---v~~~~t~k------- 319 (396) |=|+=-.-. -+.+++.+-+.+||.||||||+|+-..-|-=.+..|+-.| .-..+| |.++..-. T Consensus 382 GFG~RG~EG--KI~Ai~yAREN~iPFLGICLGmQ~A~IEFARNV~Gl~~AnStEf~~~t~~p~~~Vv~LlpEq~d~~~~~ 459 (571) T TIGR00337 382 GFGERGVEG--KIRAIKYARENNIPFLGICLGMQLAVIEFARNVLGLKGANSTEFDPETKYPKSPVVDLLPEQKDQNGNK 459 (571) T ss_pred CCCCCCCHH--HHHHHHHHHHCCCCEEEHHHHHHHHHHHHHHHHCCCCCCCHHHCCCCCCCCCCCCEECCCCCCCCCCCE T ss_conf 889766024--789999987636973302177788899997653479888741246455777331114253222446760 Q ss_pred ---EEEEC-----------------------------------CCCCEEEEHHH---C-CCCCEEEEEECCCCEEEEEEE Q ss_conf ---99961-----------------------------------45761570343---6-998289999889980872686 Q gi|254780420|r 320 ---VEIVS-----------------------------------MNHGFAIDASS---L-PAGLEETHISLFDSSNCGFRL 357 (396) Q Consensus 320 ---v~iTs-----------------------------------qNHg~~v~~~s---l-p~~~~v~~~~~nD~tiegi~~ 357 (396) .++.. +-|-|.|-++= + ..+|.++..+.++.-+|-++. T Consensus 460 ~VRt~~~~lGGTMRLG~ypc~l~~~~eWt~a~klYg~~~v~ERHRHRYE~Nn~~~~~~E~~GL~vsG~S~dg~l~EiiE~ 539 (571) T TIGR00337 460 EVRTEIEDLGGTMRLGLYPCILKPGSEWTLAFKLYGKEEVYERHRHRYEVNNEYREELENKGLIVSGTSPDGRLVEIIEL 539 (571) T ss_pred ECCEEECCCCCEEECCCCCEEECCCCCHHHHHHHCCCCEEEEECCCCCCCCHHHHHHHHHCCCEEEEECCCCCEEEEEEE T ss_conf 01301013872430587221771787232675533786788712444122757898998579589998699838999984 Q ss_pred CCCCEE-EECCCCCCCCCCCCHHHHHHHHHH Q ss_conf 799778-983487887767001689999999 Q gi|254780420|r 358 IDQPVF-SVQYHPESSPGPQDSYYLFSTFID 387 (396) Q Consensus 358 ~~~~~~-~vQfHPEa~PGP~D~~~~F~~F~~ 387 (396) ++.|+| ++|||||=.-=|.+.+.||.-|+. T Consensus 540 ~~HP~FvA~QFHPEF~Srp~~p~plF~Gfv~ 570 (571) T TIGR00337 540 PDHPFFVACQFHPEFTSRPNRPHPLFLGFVK 570 (571) T ss_pred CCCCEEEEEECCCCCCCCCCCCCCCCHHHHC T ss_conf 7978688862267511677888875055324 No 81 >KOG3179 consensus Probab=99.23 E-value=3.4e-11 Score=99.47 Aligned_cols=136 Identities=23% Similarity=0.279 Sum_probs=89.9 Q ss_pred EEECCCCHHHHHHCCCCEEEECCCCCCCCCCCCC---CHHHHHHHCCCCCCEEEEECHHHHHHHHCCCCCCCCCCC---C Q ss_conf 3304657446441357889985888884101121---101222102578634786100588787528544667767---5 Q gi|254780420|r 232 IVGAETSYKDIMSLQPDGVLLSNGPGDPAVTSVY---SSPIICKLVDSGMPMFGICLGHQLLGLALGAQTVKMYQG---H 305 (396) Q Consensus 232 Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~~---~~~~~~~~~~~~iPilGIClGhQ~l~~a~G~~~~kl~~G---h 305 (396) |+....+-++-+. +++|++||+-+-|-..-..- ....++++...++||+|||+|||+||.|.|+++-+-+-| - T Consensus 46 V~~gefP~~~Dl~-ky~gfvIsGS~~dAf~d~dWI~KLcs~~kkld~mkkkvlGICFGHQiiara~Gg~Vgra~KG~~~~ 124 (245) T KOG3179 46 VIDGEFPQEEDLE-KYDGFVISGSKHDAFSDADWIKKLCSFVKKLDFMKKKVLGICFGHQIIARAKGGKVGRAPKGPDLG 124 (245) T ss_pred EECCCCCCHHHHH-HHCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHCCCCCCCCCCCCCC T ss_conf 8548788744653-415499957743333540699999999999875156647882117999886178110278887444 Q ss_pred CCCCCCCEECCCCEEEEE--------CCCCCEEEEHHHCCCCCEEEEEECCCCEEEEEEECCCCEEEECCCCCCC Q ss_conf 210112347789709996--------1457615703436998289999889980872686799778983487887 Q gi|254780420|r 306 HGVNHPVKNISNGTVEIV--------SMNHGFAIDASSLPAGLEETHISLFDSSNCGFRLIDQPVFSVQYHPESS 372 (396) Q Consensus 306 rG~N~Pv~~~~t~kv~iT--------sqNHg~~v~~~slp~~~~v~~~~~nD~tiegi~~~~~~~~~vQfHPEa~ 372 (396) -|.---|++...+.-|.- +|=|...|.+ +|++.++++.+- +--+|++. +...+++.|-|||=+ T Consensus 125 lg~itivk~~~~~~~yFG~~~~~l~IikcHqDevle--~PE~a~llasSe-~ceve~fs-~~~~~l~fQGHPEyn 195 (245) T KOG3179 125 LGSITIVKDAEKPEKYFGEIPKSLNIIKCHQDEVLE--LPEGAELLASSE-KCEVEMFS-IEDHLLCFQGHPEYN 195 (245) T ss_pred CCCEEEEEECCCCHHHCCCCHHHHHHHHHCCCCEEC--CCCHHHHHCCCC-CCCEEEEE-ECCEEEEECCCCHHH T ss_conf 342389985246164326434442187653101433--774454300001-15257898-345379962882343 No 82 >KOG2387 consensus Probab=98.80 E-value=1.5e-08 Score=80.91 Aligned_cols=134 Identities=24% Similarity=0.383 Sum_probs=81.0 Q ss_pred CCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEECHHHHHHHHCCCCCCCCC------CCCCCCCCCC-EECCCC Q ss_conf 578899858888841011211012221025786347861005887875285446677------6752101123-477897 Q gi|254780420|r 246 QPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGICLGHQLLGLALGAQTVKMY------QGHHGVNHPV-KNISNG 318 (396) Q Consensus 246 ~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIClGhQ~l~~a~G~~~~kl~------~GhrG~N~Pv-~~~~t~ 318 (396) ..|||++-+|=|+-.... .+...+..-+.++|.||||||+|+-..-|--.+.-|+ |--.. ++|| +.+..+ T Consensus 363 ~adGilvPGGFG~RGveG--~i~Aak~ARen~iP~LGiCLGmQ~AvIEfaRnvLg~~dAnStEF~p~~-~~~vVi~MPE~ 439 (585) T KOG2387 363 SADGILVPGGFGDRGVEG--KILAAKWARENKIPFLGICLGMQLAVIEFARNVLGLKDANSTEFDPET-KNPVVIFMPEH 439 (585) T ss_pred CCCEEEECCCCCCCCHHH--HHHHHHHHHHCCCCEEEEEHHHHHHHHHHHHHHHCCCCCCCCCCCCCC-CCCEEEECCCC T ss_conf 488289678665443057--899999987649975763111269999999986177778732248788-98679987688 Q ss_pred --------------EEEE-E-----------------CCCCCEEEEHHHC----CCCCEEEEEECCCCEEEEEEECCCCE Q ss_conf --------------0999-6-----------------1457615703436----99828999988998087268679977 Q gi|254780420|r 319 --------------TVEI-V-----------------SMNHGFAIDASSL----PAGLEETHISLFDSSNCGFRLIDQPV 362 (396) Q Consensus 319 --------------kv~i-T-----------------sqNHg~~v~~~sl----p~~~~v~~~~~nD~tiegi~~~~~~~ 362 (396) +.+. + -+-|-|.|.++-. ..++.-...+..-.-.|-++.++.|+ T Consensus 440 ~~~~mGgtMRLG~R~t~f~~~~s~~~kLYG~~~~V~ERHRHRyEVNP~~v~~le~~Gl~FvGkd~~g~rmeI~El~~HP~ 519 (585) T KOG2387 440 NKTHMGGTMRLGSRRTVFQDKDSKLRKLYGNVEFVDERHRHRYEVNPEMVKQLEQAGLSFVGKDVTGKRMEIIELESHPF 519 (585) T ss_pred CCCCCCCEEEECCCCEEEECCCHHHHHHHCCCHHHHHHHHCCEECCHHHHHHHHHCCCEEEEECCCCCEEEEEECCCCCC T ss_conf 74444644662464116505715789875795046565411011488999999866968985168885788998079872 Q ss_pred E-EECCCCCCCCCCCCHHHHH Q ss_conf 8-9834878877670016899 Q gi|254780420|r 363 F-SVQYHPESSPGPQDSYYLF 382 (396) Q Consensus 363 ~-~vQfHPEa~PGP~D~~~~F 382 (396) | |+|||||=..-|...+-+| T Consensus 520 fVg~QfHPE~~srp~kpsp~f 540 (585) T KOG2387 520 FVGVQFHPEFKSRPDKPSPLF 540 (585) T ss_pred EEEECCCHHHHCCCCCCCCCH T ss_conf 565313777745899999652 No 83 >KOG1559 consensus Probab=98.77 E-value=2.2e-08 Score=79.70 Aligned_cols=153 Identities=20% Similarity=0.338 Sum_probs=88.9 Q ss_pred CHHHHHHHHHCCCCEEEECCCCHHHHHH--CCCCEEEECCCCCCCCCCCCCCHHHHHHH----CC----C-CCCEEEEEC Q ss_conf 1023222220245413304657446441--35788998588888410112110122210----25----7-863478610 Q gi|254780420|r 217 SNLLRVLSSLGCRITIVGAETSYKDIMS--LQPDGVLLSNGPGDPAVTSVYSSPIICKL----VD----S-GMPMFGICL 285 (396) Q Consensus 217 ~nIlr~L~~~g~~v~Vvp~~~~~~~i~~--~~pdgi~lSnGPGdP~~~~~~~~~~~~~~----~~----~-~iPilGICl 285 (396) .+-++.+..-|++|+-+.+|-+-+.... .--.||++.+| + +...++ .++++++ ++ + ..|+.|||| T Consensus 80 ASYVK~aEsgGARViPli~nepEe~lfqklelvNGviftGG--w-ak~~dY-~~vvkkifnk~le~nDaGehFPvyg~CL 155 (340) T KOG1559 80 ASYVKLAESGGARVIPLIYNEPEEILFQKLELVNGVIFTGG--W-AKRGDY-FEVVKKIFNKVLERNDAGEHFPVYGICL 155 (340) T ss_pred HHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHCEEEECCC--C-CCCCCH-HHHHHHHHHHHHHCCCCCCCCCHHHHHH T ss_conf 88999987189558877548808999999987360574174--3-222409-9999999988876067866362124345 Q ss_pred HHHHHHHHCC-C-CC-CC---------CCCCCCCC-------CCCCEECCC-CEEEEECCCCCEEEEHHHCCCC------ Q ss_conf 0588787528-5-44-66---------77675210-------112347789-7099961457615703436998------ Q gi|254780420|r 286 GHQLLGLALG-A-QT-VK---------MYQGHHGV-------NHPVKNISN-GTVEIVSMNHGFAIDASSLPAG------ 339 (396) Q Consensus 286 GhQ~l~~a~G-~-~~-~k---------l~~GhrG~-------N~Pv~~~~t-~kv~iTsqNHg~~v~~~slp~~------ 339 (396) |+.+|.+-.- . .+ ++ |.|--.+. |.|-..+.. ..--.+.|||-|.+.+++...+ T Consensus 156 GFE~lsmiISqnrdile~~d~vd~AssLqF~~nvn~~~t~FQrFPpELLkkL~~dcLvmq~Hk~gisp~nF~~N~~Ls~F 235 (340) T KOG1559 156 GFELLSMIISQNRDILERFDAVDVASSLQFVGNVNIHGTMFQRFPPELLKKLSTDCLVMQNHKFGISPKNFQGNPALSSF 235 (340) T ss_pred HHHHHHHHHHCCHHHHHHHCCCCCCCCEEEECCCCEEEHHHHHCCHHHHHHHCCCHHEEECCCCCCCHHHCCCCHHHHHH T ss_conf 38999999855706877503202135302303554001157548999998735231100033234462304579889978 Q ss_pred CEEEEEECCCCEE---EEEEECCCCEEEECCCCCCCC Q ss_conf 2899998899808---726867997789834878877 Q gi|254780420|r 340 LEETHISLFDSSN---CGFRLIDQPVFSVQYHPESSP 373 (396) Q Consensus 340 ~~v~~~~~nD~ti---egi~~~~~~~~~vQfHPEa~P 373 (396) |.+.-.+..++++ --++.+.+|+.|+|||||-+| T Consensus 236 FnilTT~~D~~~k~fvSTv~~~kYPvtgfQWHPEKna 272 (340) T KOG1559 236 FNILTTCTDGNSKTFVSTVESKKYPVTGFQWHPEKNA 272 (340) T ss_pred HHHEEEECCCCCEEEEEEECCEECCCEEEEECCCCCC T ss_conf 7653330179850798852030124100456675574 No 84 >KOG0623 consensus Probab=98.59 E-value=3.2e-07 Score=71.57 Aligned_cols=167 Identities=26% Similarity=0.377 Sum_probs=96.8 Q ss_pred EEEEECCCC--CHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCC-CCC----CCHHHHHHHCCCCCCE Q ss_conf 999967964--102322222024541330465744644135788998588888410-112----1101222102578634 Q gi|254780420|r 208 VVCIDYGIR--SNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAV-TSV----YSSPIICKLVDSGMPM 280 (396) Q Consensus 208 IvvIDfGiK--~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~-~~~----~~~~~~~~~~~~~iPi 280 (396) |-+||+|.- .+|-+.|+.+|+++..+.. ..+|++ .|-+++ +|-|+-.. ++. ...+-+++.++...|+ T Consensus 4 v~~ld~~agn~~si~nal~hlg~~i~~v~~---P~DI~~--a~rLIf-PGVGnfg~~~D~L~~~Gf~eplr~YiesgkPf 77 (541) T KOG0623 4 VTLLDYGAGNVRSIRNALRHLGFSIKDVQT---PGDILN--ADRLIF-PGVGNFGPAMDVLNRTGFAEPLRKYIESGKPF 77 (541) T ss_pred EEEEECCCCCHHHHHHHHHHCCCEEEECCC---CHHHCC--CCEEEE-CCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCE T ss_conf 799841786478899999863836554258---122036--743760-38664127888876432478999998669974 Q ss_pred EEEECHHHHHHHHC-------CCCCC-----C-----CCCCCCCCCCC-C-------EECCCCEEEEECCCCCEEEEH-- Q ss_conf 78610058878752-------85446-----6-----77675210112-3-------477897099961457615703-- Q gi|254780420|r 281 FGICLGHQLLGLAL-------GAQTV-----K-----MYQGHHGVNHP-V-------KNISNGTVEIVSMNHGFAIDA-- 333 (396) Q Consensus 281 lGIClGhQ~l~~a~-------G~~~~-----k-----l~~GhrG~N~P-v-------~~~~t~kv~iTsqNHg~~v~~-- 333 (396) +|||+|.|+|--.. |..+. + -...|-|.|.- | -+....++|+. |+|...+ T Consensus 78 mgicvGlQaLF~gSvE~p~skGLgvipg~v~RFD~s~k~VPhIGWNsc~v~sd~effg~~p~~~~YFV---HSyl~~ek~ 154 (541) T KOG0623 78 MGICVGLQALFDGSVENPPSKGLGVIPGIVGRFDASAKIVPHIGWNSCQVGSDSEFFGDVPNRHVYFV---HSYLNREKP 154 (541) T ss_pred EEEHHHHHHHHCCCCCCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEE---EEECCCCCC T ss_conf 76314579986364437886764354440110357778577456311545775200256787349998---531133345 Q ss_pred HHCC-CCCEEEEEECC-CCEEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHH Q ss_conf 4369-98289999889-980872686799778983487887767001689999999 Q gi|254780420|r 334 SSLP-AGLEETHISLF-DSSNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFID 387 (396) Q Consensus 334 ~slp-~~~~v~~~~~n-D~tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~ 387 (396) .+|+ ++|++.-.+-- ..-|..| ...+++++|||||-|. .-+.....+|+. T Consensus 155 ~~len~~wkiat~kYG~E~Fi~ai--~knN~~AtQFHPEKSG--~aGL~vl~~FL~ 206 (541) T KOG0623 155 KSLENKDWKIATCKYGSESFISAI--RKNNVHATQFHPEKSG--EAGLSVLRRFLH 206 (541) T ss_pred CCCCCCCCEEEEECCCCHHHHHHH--HCCCEEEEECCCCCCC--CHHHHHHHHHHH T ss_conf 678987736764046708899998--6076545752654455--011899999985 No 85 >TIGR01737 FGAM_synth_I phosphoribosylformylglycinamidine synthase I; InterPro: IPR010075 In some species, phosphoribosylformylglycinamidine synthase is composed of a single polypeptide chain. This entry describes the PurQ protein of Bacillus subtilis (where PurL, PurQ, and PurS are required for phosphoribosylformylglycinamidine synthase activity) and functionally equivalent proteins from other bacteria and archaea.; GO: 0004642 phosphoribosylformylglycinamidine synthase activity, 0006189 'de novo' IMP biosynthetic process, 0005737 cytoplasm. Probab=98.43 E-value=3.4e-06 Score=64.27 Aligned_cols=180 Identities=27% Similarity=0.338 Sum_probs=105.3 Q ss_pred EEEEEECC---CCCHHHHHHH-HHCCCCEEE----ECC-CC-HHHH-HHCCCCEEEECCC--CCCC------CCC---CC Q ss_conf 49999679---6410232222-202454133----046-57-4464-4135788998588--8884------101---12 Q gi|254780420|r 207 HVVCIDYG---IRSNLLRVLS-SLGCRITIV----GAE-TS-YKDI-MSLQPDGVLLSNG--PGDP------AVT---SV 264 (396) Q Consensus 207 kIvvIDfG---iK~nIlr~L~-~~g~~v~Vv----p~~-~~-~~~i-~~~~pdgi~lSnG--PGdP------~~~---~~ 264 (396) |||||=|. -=+-.++.|. ..|.+..+| =|+ ++ +++- --.++|+|+|-.| -||- ++. + T Consensus 2 kvAV~~FPGtNCd~D~~~A~~k~~Gv~~~~Vhikqw~~D~~r~~~~rsvfD~D~VvlPGGFSyGDYLRaGAIAa~rvk~- 80 (264) T TIGR01737 2 KVAVIRFPGTNCDRDTVYALEKLLGVDAEIVHIKQWYEDKSRLSDQRSVFDYDAVVLPGGFSYGDYLRAGAIAALRVKS- 80 (264) T ss_pred EEEEEECCCCCCHHHHHHHHHHHCCCCEEEEEEEECHHHHCCCCCCCEEEEEEEEEECCCCCHHHHHHHHHHHHHHCCH- T ss_conf 6899977999840899999997149983799865210000478766417873348866877723577799999875030- Q ss_pred CCHHHHHHHCCCCCCEEEEECHHHHHHHHC---C-------------CCCC--CCCCCCCCCCCCCE--------ECCCC Q ss_conf 110122210257863478610058878752---8-------------5446--67767521011234--------77897 Q gi|254780420|r 265 YSSPIICKLVDSGMPMFGICLGHQLLGLAL---G-------------AQTV--KMYQGHHGVNHPVK--------NISNG 318 (396) Q Consensus 265 ~~~~~~~~~~~~~iPilGIClGhQ~l~~a~---G-------------~~~~--kl~~GhrG~N~Pv~--------~~~t~ 318 (396) .+...++++.+.+.||||||=|+|+|.-+= | |=+. .++|==|=.+--|. .+..| T Consensus 81 p~~~~V~~~A~~G~pVLGiCNGFQIL~E~GLLpGfDeDkPlaekPemAL~~N~~~rFisr~v~LrV~n~~T~FT~~y~kG 160 (264) T TIGR01737 81 PIMQEVREFAEKGKPVLGICNGFQILVEAGLLPGFDEDKPLAEKPEMALLPNASLRFISRWVYLRVENADTAFTKKYKKG 160 (264) T ss_pred HHHHHHHHHHHCCCEEEEEECHHHHHHHHHHCCCCCCCCCCCCCCHHHHHCCCCCCCEEEEEEEEEEECCCHHHHCCCCC T ss_conf 88999999995598289976647999997624898878874357122442058897453203689985678366357988 Q ss_pred EEEEECCCCC---EEEE----HHHC---CC-CCEEE-E----------EECCCCE---EEEEEECCCCEEEECCCCCCC- Q ss_conf 0999614576---1570----3436---99-82899-9----------9889980---872686799778983487887- Q gi|254780420|r 319 TVEIVSMNHG---FAID----ASSL---PA-GLEET-H----------ISLFDSS---NCGFRLIDQPVFSVQYHPESS- 372 (396) Q Consensus 319 kv~iTsqNHg---~~v~----~~sl---p~-~~~v~-~----------~~~nD~t---iegi~~~~~~~~~vQfHPEa~- 372 (396) .|-.--==|| |-++ .+++ .+ +-.|. | .|.| || |+||..++.+++|..=|||=+ T Consensus 161 ev~~~PiAHgEGrYy~~sGke~e~~~~l~~n~qivFrY~d~~G~~~e~~NPN-GS~~nIAGi~N~~gnV~GmMPHPERA~ 239 (264) T TIGR01737 161 EVIRVPIAHGEGRYYADSGKEDETLAELESNDQIVFRYCDEDGDVSEEVNPN-GSVDNIAGIVNEEGNVLGMMPHPERAS 239 (264) T ss_pred CEEEEEEEECCCCEECCCCCCHHHHHHHHHCCEEEEEEECCCCCEECCCCCC-CCCCCEEEEECCCCCEEEECCCHHHHH T ss_conf 7887654425872662689975799998547918999767998532454789-764671057808987787189807876 Q ss_pred --------CCCCCHHHHHHHHHHH Q ss_conf --------7670016899999999 Q gi|254780420|r 373 --------PGPQDSYYLFSTFIDF 388 (396) Q Consensus 373 --------PGP~D~~~~F~~F~~~ 388 (396) -|-+|+..||.-|++- T Consensus 240 ~~~lg~r~eG~~DG~~Lf~~l~~~ 263 (264) T TIGR01737 240 EKLLGSRREGADDGLKLFESLVEW 263 (264) T ss_pred HHHHCCCCCCCHHHHHHHHHHHHH T ss_conf 454068875637799999999971 No 86 >PRK01175 phosphoribosylformylglycinamidine synthase subunit I; Provisional Probab=98.40 E-value=7.8e-06 Score=61.78 Aligned_cols=186 Identities=18% Similarity=0.246 Sum_probs=99.3 Q ss_pred CCEEEEEEC-CC--CCHHHHHHHHHCCCCEEEECC-CCHHHHHHCCCCEEEECCCC--CCCCCCCCC--------CHHHH Q ss_conf 714999967-96--410232222202454133046-57446441357889985888--884101121--------10122 Q gi|254780420|r 205 KYHVVCIDY-GI--RSNLLRVLSSLGCRITIVGAE-TSYKDIMSLQPDGVLLSNGP--GDPAVTSVY--------SSPII 270 (396) Q Consensus 205 ~~kIvvIDf-Gi--K~nIlr~L~~~g~~v~Vvp~~-~~~~~i~~~~pdgi~lSnGP--GdP~~~~~~--------~~~~~ 270 (396) ++||++|-| |+ -+-..+.+.+.|+++..|-.+ .-..+-.-.++|+|+|-.|= ||--..... ..+.+ T Consensus 3 ~PkvaIl~fpGsNcd~e~~~Af~~aG~~~~~v~~~dl~~~~~~L~~~~~lv~pGGFSyGD~l~aG~~~a~~~~~~~~~~i 82 (255) T PRK01175 3 SPKVGVLRMEGTNNETEVFKSVRRSGGEPEYVHINDLAAGRKSVSDYDCLIIPGGFSAGDYIRAGAIFAARLKAVLGKDI 82 (255) T ss_pred CCEEEEEECCCCCHHHHHHHHHHHCCCCEEEEEEEECCCCCCCCCCCCEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHH T ss_conf 98699996586481999999999859963899865233676665555489973776764335751889998679999999 Q ss_pred HHHCCCCCCEEEEECHHHHHHHHC---CCCCC-CCC------CCCCCCC------CCC--EECCCCEEEE-ECCCCC-EE Q ss_conf 210257863478610058878752---85446-677------6752101------123--4778970999-614576-15 Q gi|254780420|r 271 CKLVDSGMPMFGICLGHQLLGLAL---GAQTV-KMY------QGHHGVN------HPV--KNISNGTVEI-VSMNHG-FA 330 (396) Q Consensus 271 ~~~~~~~iPilGIClGhQ~l~~a~---G~~~~-kl~------~GhrG~N------~Pv--~~~~t~kv~i-TsqNHg-~~ 330 (396) +++.+.+.||||||=|+|+|..+- |++-. .|. |--|=++ .++ +.+..+.+.| .++.-| |. T Consensus 83 ~~f~~~g~~vLGICNGfQiL~e~gLLPg~~~~~~L~~N~s~rF~~r~v~l~v~~~~~~f~~~~~~~~l~ipvAHgEGrf~ 162 (255) T PRK01175 83 REFIDSGRPIIGICNGFQVLMELGLIPAERPEIALTNNESNRFECRYTYIKMTSEKCIFVKNLGKGVFQVPVAHAEGRVA 162 (255) T ss_pred HHHHHCCCCEEEECCHHHHHHHCCCCCCCCCCCCCCCCCCCCEEEEEEEEEEECCCHHHHHCCCCCEEEEEEECCCCCEE T ss_conf 99996799679980168999975877899988020247999878678799994483466641489779987553768557 Q ss_pred E-EHHHC----CCCCEE-EEEE-----------CCCC---EEEEEEECCCCEEEECCCCCCC------------CCCCCH Q ss_conf 7-03436----998289-9998-----------8998---0872686799778983487887------------767001 Q gi|254780420|r 331 I-DASSL----PAGLEE-THIS-----------LFDS---SNCGFRLIDQPVFSVQYHPESS------------PGPQDS 378 (396) Q Consensus 331 v-~~~sl----p~~~~v-~~~~-----------~nD~---tiegi~~~~~~~~~vQfHPEa~------------PGP~D~ 378 (396) + +.+.| ..+..+ .|.+ .| | .|+||..++..+++..=|||-+ .|.-|. T Consensus 163 ~~~~~~l~~l~~n~qi~~rY~d~~G~~~~yp~NPN-GS~~~IAGI~n~~GrvlgmMPHPER~~~~~q~~~~~~~~~~g~g 241 (255) T PRK01175 163 VSEDSVLKRLYENDQIVFRYVDENGVYDGYPWNPN-GSIDNIAGITNETGNVIGLMPHPERIYYGYQAMYNERRKDYGTG 241 (255) T ss_pred ECCHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCC-CCHHHHCEEECCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCCH T ss_conf 58989999998589658999669986457888998-79767208678999998878996325646558878666777869 Q ss_pred HHHHHHHHHHHHH Q ss_conf 6899999999997 Q gi|254780420|r 379 YYLFSTFIDFMHK 391 (396) Q Consensus 379 ~~~F~~F~~~i~~ 391 (396) ..+|.+.++-+++ T Consensus 242 ~~~F~nav~~~k~ 254 (255) T PRK01175 242 KIFFRSLVNYLEK 254 (255) T ss_pred HHHHHHHHHHHHH T ss_conf 9999999999964 No 87 >COG0047 PurL Phosphoribosylformylglycinamidine (FGAM) synthase, glutamine amidotransferase domain [Nucleotide transport and metabolism] Probab=98.36 E-value=2e-05 Score=58.92 Aligned_cols=177 Identities=24% Similarity=0.299 Sum_probs=99.2 Q ss_pred CCEEEEEECCCCCH---HHHHHHHHCCCCEEEECCCCHHHHHHC-CCCEEEECCCC--CCCCCCC-----CCCHHHHHHH Q ss_conf 71499996796410---232222202454133046574464413-57889985888--8841011-----2110122210 Q gi|254780420|r 205 KYHVVCIDYGIRSN---LLRVLSSLGCRITIVGAETSYKDIMSL-QPDGVLLSNGP--GDPAVTS-----VYSSPIICKL 273 (396) Q Consensus 205 ~~kIvvIDfGiK~n---Ilr~L~~~g~~v~Vvp~~~~~~~i~~~-~pdgi~lSnGP--GdP~~~~-----~~~~~~~~~~ 273 (396) +.||++|=|-=-+. ..+.+...|+++..|-++ |.... ++|+|++-.|= ||--.+. ..+.+.++++ T Consensus 2 ~~kvaVi~fpGtN~d~d~~~A~~~aG~~~~~V~~~----d~~~~~~~d~vv~pGGFSyGDyLr~Gaiaa~~~v~~~v~~~ 77 (231) T COG0047 2 RPKVAVLRFPGTNCDYDMAAAFERAGFEAEDVWHS----DLLLGRDFDGVVLPGGFSYGDYLRAGAIAAIAPVMDEVREF 77 (231) T ss_pred CCEEEEEEECCCCCHHHHHHHHHHCCCCCEEEEEE----ECCCCCCCCEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHH T ss_conf 95699997078684599999999849982499855----03567774689974887764335732677557899999999 Q ss_pred CCCCCCEEEEECHHHHHHHHC---CCCCC--CCCCCCCCCCCCCEECCCC----------EEEEECCCCC---EEEEHHH Q ss_conf 257863478610058878752---85446--6776752101123477897----------0999614576---1570343 Q gi|254780420|r 274 VDSGMPMFGICLGHQLLGLAL---GAQTV--KMYQGHHGVNHPVKNISNG----------TVEIVSMNHG---FAIDASS 335 (396) Q Consensus 274 ~~~~iPilGIClGhQ~l~~a~---G~~~~--kl~~GhrG~N~Pv~~~~t~----------kv~iTsqNHg---~~v~~~s 335 (396) .+...|+||||=|+|+|.++- |+=+. .++|--|=. .++-..+. .+.--.--|| |.++.+. T Consensus 78 a~~g~~vLGICNGfQiL~e~gLlPGal~~N~s~~F~cr~v--~l~V~~~~t~ft~~~~~g~~i~ipVAHgEGr~~~~~~~ 155 (231) T COG0047 78 AEKGKPVLGICNGFQILSEAGLLPGALTRNESLRFECRWV--YLRVENNNTPFTSGYEGGEVIPIPVAHGEGRYYADDET 155 (231) T ss_pred HHCCCEEEEECCHHHHHHHCCCCCCCEECCCCCCEEEEEE--EEEEECCCCHHHHHCCCCCEEEEEEEECCEEEECCHHH T ss_conf 8689719998535089897275776102478984588989--99993589877873578855987777056247716788 Q ss_pred C---CC-CCEEE-EE----------ECCCCE---EEEEEECCCCEEEECCCCCCC----CCCCCHHHHHHHHHHH Q ss_conf 6---99-82899-99----------889980---872686799778983487887----7670016899999999 Q gi|254780420|r 336 L---PA-GLEET-HI----------SLFDSS---NCGFRLIDQPVFSVQYHPESS----PGPQDSYYLFSTFIDF 388 (396) Q Consensus 336 l---p~-~~~v~-~~----------~~nD~t---iegi~~~~~~~~~vQfHPEa~----PGP~D~~~~F~~F~~~ 388 (396) | .. +..+. |. |.| || |+||..++-.+++..=|||-. .|-.|...||...++- T Consensus 156 l~~l~~ngqvvfrY~d~~G~~~~~~NPN-GS~~~IaGI~n~~G~V~gmMPHPERa~~~~~g~~Dg~~lF~s~~~~ 229 (231) T COG0047 156 LAELEENGQVVFRYVDNNGETEEYANPN-GSVNGIAGITNEDGNVLGMMPHPERASESLLGGEDGLRLFRSARKY 229 (231) T ss_pred HHHHHHCCEEEEEEECCCCCEEEEECCC-CCHHHCEEEECCCCCEEEECCCCHHHHHCCCCCCHHHHHHHHHHHH T ss_conf 9987548979999936998540334798-9966620577699978986489444332026871389999999986 No 88 >PRK03619 phosphoribosylformylglycinamidine synthase subunit I; Provisional Probab=98.30 E-value=2.4e-05 Score=58.28 Aligned_cols=121 Identities=24% Similarity=0.372 Sum_probs=71.1 Q ss_pred CHHHHHHHCCCCCCEEEEECHHHHHHHHCCC---CCCC---CCCCCCCC-------CCCC-EECCCCE---EEEECCCCC Q ss_conf 1012221025786347861005887875285---4466---77675210-------1123-4778970---999614576 Q gi|254780420|r 266 SSPIICKLVDSGMPMFGICLGHQLLGLALGA---QTVK---MYQGHHGV-------NHPV-KNISNGT---VEIVSMNHG 328 (396) Q Consensus 266 ~~~~~~~~~~~~iPilGIClGhQ~l~~a~G~---~~~k---l~~GhrG~-------N~Pv-~~~~t~k---v~iTsqNHg 328 (396) +.+.++++.+.+.|+||||=|+|+|..+ |. ...+ .+|--|=. |-|- +++..+. ++|. || T Consensus 67 i~~~i~~f~~~g~~vLGICNGfQiL~e~-gLlPgaL~~N~s~rF~~r~v~lkv~~~~s~f~~~~~~g~~l~ipVA---Hg 142 (223) T PRK03619 67 VMQAVVEAAEKGKPVLGICNGFQILLEA-GLLPGALTRNENLKFICRDVKLRVENNDTAFTSRYEKGEVIRIPIA---HG 142 (223) T ss_pred HHHHHHHHHHCCCCEEEECHHHHHHHHC-CCCCCCCCCCCCCCEEEEEEEEEECCCCCCHHCCCCCCCEEEEEEE---CC T ss_conf 9999999986799779987278989864-7788621026999879865678865898401037899988988724---48 Q ss_pred ---EEEEHHHCC----CCCEEEEE----ECCCCE---EEEEEECCCCEEEECCCCCCCC----CCCCHHHHHHHHHHHHH Q ss_conf ---157034369----98289999----889980---8726867997789834878877----67001689999999999 Q gi|254780420|r 329 ---FAIDASSLP----AGLEETHI----SLFDSS---NCGFRLIDQPVFSVQYHPESSP----GPQDSYYLFSTFIDFMH 390 (396) Q Consensus 329 ---~~v~~~slp----~~~~v~~~----~~nD~t---iegi~~~~~~~~~vQfHPEa~P----GP~D~~~~F~~F~~~i~ 390 (396) |..+++.|. .+..+..+ |.| || |+||..++..+++..=|||-+- |..|+..+|...++-.+ T Consensus 143 EGr~~~~~~~l~~l~~~~qi~~~Y~d~~NPN-GS~~~IAGI~~~~GrvlgmMPHPERa~~~~~g~~dG~~~f~s~v~~~~ 221 (223) T PRK03619 143 EGNYYADEETLKELEGNGQVVFRYVDETNPN-GSVNDIAGICNENGNVLGLMPHPERAVEALLGSDDGLKLFKSALDAWR 221 (223) T ss_pred CCCEECCHHHHHHHHHCCCEEEEECCCCCCC-CCHHHHHEEECCCCCEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHH T ss_conf 7533559999999997897899989999998-896465285879999988889964463554478614999999999975 Q ss_pred H Q ss_conf 7 Q gi|254780420|r 391 K 391 (396) Q Consensus 391 ~ 391 (396) + T Consensus 222 ~ 222 (223) T PRK03619 222 E 222 (223) T ss_pred H T ss_conf 1 No 89 >cd01740 GATase1_FGAR_AT Type 1 glutamine amidotransferase (GATase1)-like domain found in Formylglycinamide ribonucleotide amidotransferase. Type 1 glutamine amidotransferase (GATase1)-like domain found in Formylglycinamide ribonucleotide amidotransferase (FGAR-AT). FGAR-AT catalyzes the ATP-dependent conversion of formylglycinamide ribonucleotide (FGAR) and glutamine to formylglycinamidine ribonucleotide (FGAM), ADP, Pi, and glutamate in the fourth step of the purine biosynthetic pathway. FGAR-AT is a glutamine amidotransferase. Glutamine amidotransferase activity catalyses the transfer of ammonia from the amide side chain of glutamine to an acceptor substrate. FGAR-AT belongs to the triad family of amidotransferases having a conserved Cys-His-Glu catalytic triad in the glutaminase active site Probab=98.22 E-value=5.2e-05 Score=55.97 Aligned_cols=164 Identities=23% Similarity=0.223 Sum_probs=87.9 Q ss_pred HHHHHHHHHCCCCEEEECCCC-HHHHHHCCCCEEEECCC--CCCCC------CCCCCCHHHHHHHCCCCCCEEEEECHHH Q ss_conf 023222220245413304657-44644135788998588--88841------0112110122210257863478610058 Q gi|254780420|r 218 NLLRVLSSLGCRITIVGAETS-YKDIMSLQPDGVLLSNG--PGDPA------VTSVYSSPIICKLVDSGMPMFGICLGHQ 288 (396) Q Consensus 218 nIlr~L~~~g~~v~Vvp~~~~-~~~i~~~~pdgi~lSnG--PGdP~------~~~~~~~~~~~~~~~~~iPilGIClGhQ 288 (396) -..+.|...|+++..|-.+-- .....-.++|+|++.+| -||-- .......+.++.+.+...|+||||=|+| T Consensus 14 e~~~Af~~aG~~~~~V~~~dl~~~~~~L~~~~~lv~pGGFSyGD~l~ag~~~a~s~~~~~~l~~f~~~g~~vLGICNGfQ 93 (238) T cd01740 14 DMAYAFELAGFEAEDVWHNDLLAGRKDLDDYDGVVLPGGFSYGDYLRAGAIAAASPLLMEEVKEFAERGGLVLGICNGFQ 93 (238) T ss_pred HHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCEEEECCCCCCCCCCCHHHHHHHCCCHHHHHHHHHCCCCEEEEECHHHH T ss_conf 99999998699269998101656646611075899868667766655567887481130899997548974898530567 Q ss_pred HHHHHC---CCCCC--CCCCCCCCCC--CCCEECCC----------CE---EEEECCCCC---EEEEHHHC----CCCCE Q ss_conf 878752---85446--6776752101--12347789----------70---999614576---15703436----99828 Q gi|254780420|r 289 LLGLAL---GAQTV--KMYQGHHGVN--HPVKNISN----------GT---VEIVSMNHG---FAIDASSL----PAGLE 341 (396) Q Consensus 289 ~l~~a~---G~~~~--kl~~GhrG~N--~Pv~~~~t----------~k---v~iTsqNHg---~~v~~~sl----p~~~~ 341 (396) +|...- |+... ++.|--|-.+ ..++-..+ +. ++|. || |..+.+++ ..+.. T Consensus 94 iL~~lGllpg~~~~n~~~~~~~~~~~r~v~l~v~~~~s~f~~~~~~g~~l~ipva---HgEGr~~~~~~~l~~l~~~~~i 170 (238) T cd01740 94 ILVELGLLPGALIRNKGLKFICRWQNRFVTLRVENNDSPFTKGYMEGEVLRIPVA---HGEGRFYADDETLAELEENGQI 170 (238) T ss_pred HHHHHCCCCCCCCCCCCCCCCCCCEEEEEEEEECCCCCHHHHCCCCCCEEEEEEE---CCCEEEECCHHHHHHHHHCCCE T ss_conf 8887067785001487765467560057999978999867716578988888875---3723597182878999867956 Q ss_pred EEE------------EECC--CCEEEEEEECCCCEEEECCCCCCC---------CCCCCHHHHHHH Q ss_conf 999------------9889--980872686799778983487887---------767001689999 Q gi|254780420|r 342 ETH------------ISLF--DSSNCGFRLIDQPVFSVQYHPESS---------PGPQDSYYLFST 384 (396) Q Consensus 342 v~~------------~~~n--D~tiegi~~~~~~~~~vQfHPEa~---------PGP~D~~~~F~~ 384 (396) +.+ .|.| ...|+||..++-.+++..-|||-+ ||..|...+|++ T Consensus 171 ~~~~~~~~~~~~~yP~NPNGS~~~IAGI~s~~GrvlgmMPHPER~~~~~q~~~~~~~~~g~~~F~n 236 (238) T cd01740 171 AQYVDDDGNVTERYPANPNGSLDGIAGICNEDGRVLGMMPHPERAVEPWQWERLLGGSDGLKLFRN 236 (238) T ss_pred EEEEECCCCCCCCCCCCCCCCHHCEEEEECCCCCEEEECCCHHHHCCCCCCCCCCCCCCHHHHHHH T ss_conf 999957998022688899888202127887999999974785773065556567889818999975 No 90 >PRK13526 glutamine amidotransferase subunit PdxT; Provisional Probab=98.04 E-value=1.5e-05 Score=59.81 Aligned_cols=153 Identities=18% Similarity=0.261 Sum_probs=83.8 Q ss_pred EEEEEEC-CCCCHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCCC-----CCHHHHHHHCCCCCCE Q ss_conf 4999967-964102322222024541330465744644135788998588888410112-----1101222102578634 Q gi|254780420|r 207 HVVCIDY-GIRSNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTSV-----YSSPIICKLVDSGMPM 280 (396) Q Consensus 207 kIvvIDf-GiK~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~-----~~~~~~~~~~~~~iPi 280 (396) ||-|+-+ |.=.-=++.|.+.|+++..|... +++.. -||++|-+|-.. .... ...+.++++..+ .|| T Consensus 4 kIGVLaLQG~~~eH~~~l~~lg~~~~~Vr~~---~dL~~--idgLIiPGGEST--ti~~ll~~~~L~~~l~~~~~~-~Pv 75 (179) T PRK13526 4 KVGVLAIQGGYQKHADMFKSLGVEVKLVKFN---NDFDS--IDRLVIPGGEST--TLLNLLNKHQIFDKLYNFCSS-KPV 75 (179) T ss_pred EEEEEEECCCHHHHHHHHHHCCCCEEEECCH---HHHHC--CCEEEECCCCHH--HHHHHHHHCCCHHHHHHHHHC-CCE T ss_conf 8999996288799999999879968997178---88600--775897599778--999997675549999999858-990 Q ss_pred EEEECHHHHHHHHCCCCCCCCCCCCCCCCCCCEECCCCEEEEECCCCCEEEEHH-----------------HCCCCCEEE Q ss_conf 786100588787528544667767521011234778970999614576157034-----------------369982899 Q gi|254780420|r 281 FGICLGHQLLGLALGAQTVKMYQGHHGVNHPVKNISNGTVEIVSMNHGFAIDAS-----------------SLPAGLEET 343 (396) Q Consensus 281 lGIClGhQ~l~~a~G~~~~kl~~GhrG~N~Pv~~~~t~kv~iTsqNHg~~v~~~-----------------slp~~~~v~ 343 (396) ||.|-|.-+||. |-+ +-| +.|+.-.|-.+-.|..+|..+-+ ++.++.+|. T Consensus 76 ~GTCAGlILLak--~~~-------~lg----~ldi~V~RNayGrQ~~SFe~~~~~~~~~~~avFIRAP~I~~vg~~V~vL 142 (179) T PRK13526 76 FGTCAGSIILSK--GEG-------YLN----LLDLEVQRNAYGRQVDSFVADISFNDKNITGVFIRAPKFIVVGNQVDIL 142 (179) T ss_pred EEECCEEEEEEC--CCC-------CCC----CEEEEEEECCCCCCCCEEEEECCCCCCCEEEEEECCCCEEECCCCCEEE T ss_conf 886120689826--987-------667----1358998255457432058713658995369998275188808995899 Q ss_pred EEECCCCEEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 998899808726867997789834878877670016899999999 Q gi|254780420|r 344 HISLFDSSNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDF 388 (396) Q Consensus 344 ~~~~nD~tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~ 388 (396) +.- |+.+-+ -+..++++.-||||-+. |+ .+-.-|++| T Consensus 143 a~~--~~~~V~--vrqgn~la~sFHPELt~---D~-r~H~yFl~m 179 (179) T PRK13526 143 SKY--QNSPVL--LRQANILVSSFHPELTQ---DP-TVHEYFLAM 179 (179) T ss_pred EEE--CCEEEE--EEECCEEEEEECCCCCC---CC-HHHHHHHCC T ss_conf 998--998899--99899999982887548---77-799997429 No 91 >PRK01077 cobyrinic acid a,c-diamide synthase; Validated Probab=97.94 E-value=0.00033 Score=50.36 Aligned_cols=88 Identities=18% Similarity=0.317 Sum_probs=55.6 Q ss_pred CCCCCEEEEE-----ECCCCCHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCCC------CCHHHH Q ss_conf 7777149999-----67964102322222024541330465744644135788998588888410112------110122 Q gi|254780420|r 202 SDAKYHVVCI-----DYGIRSNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTSV------YSSPII 270 (396) Q Consensus 202 ~~~~~kIvvI-----DfGiK~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~------~~~~~~ 270 (396) ...+.+|++- .|-...| ++.|.+.|++++-+..- .++-+-.+.|+|+|-.|. |+.... ...+.+ T Consensus 242 ~~~~~rIava~D~AF~FyY~en-l~~L~~~Gael~~fSPl--~d~~lP~~~D~l~lpGGy--PEl~a~~L~~n~~l~~~I 316 (451) T PRK01077 242 PPPGVRIAVARDAAFSFYYPDN-LELLRAAGAELVFFSPL--ADEALPDDCDALYLGGGY--PELFAAELAANTSMRASI 316 (451) T ss_pred CCCCCEEEEECCCCCCCCCHHH-HHHHHHCCCEEEEECCC--CCCCCCCCCCEEEECCCC--HHHHHHHHHHHHHHHHHH T ss_conf 6789769994342247557667-99999789889995776--788898667889957985--024399998708999999 Q ss_pred HHHCCCCCCEEEEECHHHHHHHHC Q ss_conf 210257863478610058878752 Q gi|254780420|r 271 CKLVDSGMPMFGICLGHQLLGLAL 294 (396) Q Consensus 271 ~~~~~~~iPilGIClGhQ~l~~a~ 294 (396) ++..+.+.||+|||=|.|.|+... T Consensus 317 ~~~~~~G~pi~aeCGG~~~Lg~~i 340 (451) T PRK01077 317 RAAAAAGKPIYAECGGLMYLGESL 340 (451) T ss_pred HHHHHCCCCEEEECHHHHHHHHHC T ss_conf 999986997899871178875432 No 92 >TIGR01382 PfpI intracellular protease, PfpI family; InterPro: IPR006286 Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad . This group of cysteine peptidases belong to MEROPS peptidase family C56 (Pfp1 endopeptidase, clan PC(C)). The member of this family from Pyrococcus horikoshii has been solved to 2 Angstrom resolution. It is an ATP-independent intracellular protease that crystallises as a hexameric ring. Cys-101 is proposed as the active site residue in a catalytic triad with the adjacent His-102 and a Glu residue from an adjacent monomer. A member of this family from Bacillus subtilis, GSP18, has been shown to be expressed in response to several forms of stress. A role in the degradation of small peptides has been suggested. This family is contained in a larger one of the thiamine biosynthesis protein ThiJ and its homologs.; GO: 0016798 hydrolase activity acting on glycosyl bonds. Probab=97.93 E-value=7.5e-06 Score=61.87 Aligned_cols=97 Identities=20% Similarity=0.323 Sum_probs=71.4 Q ss_pred CEEEEEE-CCCCCHHHH-----------HHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCC--CCCCCHHHHH Q ss_conf 1499996-796410232-----------2222024541330465744644135788998588888410--1121101222 Q gi|254780420|r 206 YHVVCID-YGIRSNLLR-----------VLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAV--TSVYSSPIIC 271 (396) Q Consensus 206 ~kIvvID-fGiK~nIlr-----------~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~--~~~~~~~~~~ 271 (396) ++|-++- -+-|.+++. +=.+.|..|+| |.++++|...+||+|+|.+|-++|.. .+.....++| T Consensus 27 ~~V~~v~~g~~~g~~v~~aihdfeGdqty~~~~g~~v~v---d~~~~~v~~~~YDal~ipGG~a~~e~LR~d~~~~~lvR 103 (189) T TIGR01382 27 HEVTVVSIGKEAGTIVGDAIHDFEGDQTYTEKHGYSVTV---DATIDDVDPEDYDALVIPGGRASPEYLRLDEKVIRLVR 103 (189) T ss_pred CEEEEEEECCEECCEEECEEECCCCCCCCCCCCCEEEEE---ECCEECCCHHHCCEEEEECCCCCCCCCCCCHHHHHHHH T ss_conf 889898316500232201101156763213478816887---44121478525647897177356000148768999999 Q ss_pred HHCCCCCCEEEEECHHHHHHHHCCCCCCCCCCCCCCCCCC Q ss_conf 1025786347861005887875285446677675210112 Q gi|254780420|r 272 KLVDSGMPMFGICLGHQLLGLALGAQTVKMYQGHHGVNHP 311 (396) Q Consensus 272 ~~~~~~iPilGIClGhQ~l~~a~G~~~~kl~~GhrG~N~P 311 (396) ++.+.++||..||.|-|+|+.|- +.| ||+..-.| T Consensus 104 ~f~e~gK~vaaIChgp~lLi~A~---Vlr---Gkk~T~y~ 137 (189) T TIGR01382 104 EFVEKGKPVAAICHGPQLLISAG---VLR---GKKLTAYP 137 (189) T ss_pred HHHHCCCEEEEEECCCEEEEECC---EEC---CCEEEECC T ss_conf 98417881999845300000255---103---75375339 No 93 >cd01750 GATase1_CobQ Type 1 glutamine amidotransferase (GATase1) domain found in Cobyric Acid Synthase (CobQ). Type 1 glutamine amidotransferase (GATase1) domain found in Cobyric Acid Synthase (CobQ). CobQ plays a role in cobalamin biosythesis. CobQ catalyses amidations at positions B, D, E, and G on adenosylcobyrinic A,C-diamide in the biosynthesis of cobalamin. CobQ belongs to the triad family of amidotransferases. Two of the three residues of the catalytic triad that are involved in glutamine binding, hydrolysis and transfer of the resulting ammonia to the acceptor substrate in other triad aminodotransferases are conserved in CobQ. Probab=97.91 E-value=0.00018 Score=52.09 Aligned_cols=66 Identities=26% Similarity=0.429 Sum_probs=42.1 Q ss_pred HHHH-HHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCCCC-------CHHHHHHHCCCCCCEEEEECHHHHHHH Q ss_conf 2222-20245413304657446441357889985888884101121-------101222102578634786100588787 Q gi|254780420|r 221 RVLS-SLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTSVY-------SSPIICKLVDSGMPMFGICLGHQLLGL 292 (396) Q Consensus 221 r~L~-~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~~-------~~~~~~~~~~~~iPilGIClGhQ~l~~ 292 (396) +.|. +.++++.++... +++ .+||.||| ||......+. ..+.+++..+.+.||||||-|.|+|+. T Consensus 16 ~~l~~~~~v~v~~v~~~---~~l--~~~D~iIl---PGsK~t~~DL~~l~~~Gl~~~i~~~~~~g~~vlgICGGyQmLG~ 87 (194) T cd01750 16 DPLAREPGVDVRYVEVP---EGL--GDADLIIL---PGSKDTIQDLAWLRKRGLAEAIKNYARAGGPVLGICGGYQMLGK 87 (194) T ss_pred HHHHCCCCCEEEEECCC---CCC--CCCCEEEE---CCCCCHHHHHHHHHHCCHHHHHHHHHHCCCCEEEECHHHHHHHC T ss_conf 77600699389996797---764--66999998---99883899999998749799999999759978997365673000 Q ss_pred HC Q ss_conf 52 Q gi|254780420|r 293 AL 294 (396) Q Consensus 293 a~ 294 (396) .+ T Consensus 88 ~i 89 (194) T cd01750 88 YI 89 (194) T ss_pred EE T ss_conf 35 No 94 >cd01653 GATase1 Type 1 glutamine amidotransferase (GATase1)-like domain. Type 1 glutamine amidotransferase (GATase1)-like domain. This group includes proteins similar to Class I glutamine amidotransferases, the intracellular PH1704 from Pyrococcus horikoshii, the C-terminal of the large catalase: Escherichia coli HP-II, Sinorhizobium meliloti Rm1021 ThuA. and, the A4 beta-galactosidase middle domain. The majority of proteins in this group have a reactive Cys found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. For Class I glutamine amidotransferases proteins which transfer ammonia from the amide side chain of glutamine to an acceptor substrate, this Cys forms a Cys-His-Glu catalytic triad in the active site. Glutamine amidotransferases activity can be found in a range of biosynthetic enzymes included in this cd: glutamine amidotransferase, formylglycinamide ribonucleotide, GMP synthetase, anthranilate synthase component II, glutamine-depende Probab=97.45 E-value=0.00012 Score=53.39 Aligned_cols=73 Identities=30% Similarity=0.542 Sum_probs=50.2 Q ss_pred HHHHHHHHHCCCCEEEECCCCHH--HHHHCCCCEEEECCCCCCCCCCC--CCCHHHHHHHCCCCCCEEEEECHHHHH Q ss_conf 02322222024541330465744--64413578899858888841011--211012221025786347861005887 Q gi|254780420|r 218 NLLRVLSSLGCRITIVGAETSYK--DIMSLQPDGVLLSNGPGDPAVTS--VYSSPIICKLVDSGMPMFGICLGHQLL 290 (396) Q Consensus 218 nIlr~L~~~g~~v~Vvp~~~~~~--~i~~~~pdgi~lSnGPGdP~~~~--~~~~~~~~~~~~~~iPilGIClGhQ~l 290 (396) .....|.+.+..+.+++...... .....++|+|++.+|+..+.... ......+++....+.|++|+|.|+|++ T Consensus 16 ~~~~~l~~~~~~~~~i~~~~~~~~~~~~~~~~d~liipgg~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~c~g~~~l 92 (115) T cd01653 16 SPLDALREAGAEVDVVSPDGGPVESDVDLDDYDGLILPGGPGTPDDLARDEALLALLREAAAAGKPILGICLGAQLL 92 (115) T ss_pred HHHHHHHHCCCEEEEEECCCCCCCCCCCHHCCCEEEECCCCCCHHHHHHCHHHHHHHHHHHHCCCEEEEECHHHHHH T ss_conf 99999997799799996589986665762008999989986329998639999999999998699899988588999 No 95 >PRK05368 homoserine O-succinyltransferase; Provisional Probab=97.43 E-value=0.0033 Score=43.27 Aligned_cols=148 Identities=18% Similarity=0.184 Sum_probs=89.0 Q ss_pred CCHHHHHHCCCCEEEECCCCCCCCCCCC----C---CHHHHHHHCCCCCCEEEEECHHHHHH-HHCCCCCCCCC------ Q ss_conf 5744644135788998588888410112----1---10122210257863478610058878-75285446677------ Q gi|254780420|r 237 TSYKDIMSLQPDGVLLSNGPGDPAVTSV----Y---SSPIICKLVDSGMPMFGICLGHQLLG-LALGAQTVKMY------ 302 (396) Q Consensus 237 ~~~~~i~~~~pdgi~lSnGPGdP~~~~~----~---~~~~~~~~~~~~iPilGIClGhQ~l~-~a~G~~~~kl~------ 302 (396) .++++|...++||+++.+-| ++..+- + ..+++..........|.||+|.|+.. ..+|-+-+.|+ T Consensus 90 ~~f~~ik~~~~DGlIITGAP--vE~l~Fe~V~YW~El~~I~dws~~~v~stl~iCWaA~AaL~~~ygI~K~~l~~K~fGV 167 (302) T PRK05368 90 RTFEDIKDQKFDGLIVTGAP--VELLPFEDVDYWDELREILDWAKTHVTSTLFICWAAQAALYHLYGIPKYTLPEKLSGV 167 (302) T ss_pred HHHHHHHHCCCCEEEECCCC--CCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEECHHHHHHHHHHHCCCCEECCCCEEEE T ss_conf 64999965877669972898--7778855374199999999999862652256749999999997099710068735875 Q ss_pred CCCCCC--CCCCEECCCCEEEEECCCCCEEEEHHHCC--CCCEEEEEECCCCEEEEEEECCCCEEEECCCCCCCCCCCCH Q ss_conf 675210--11234778970999614576157034369--98289999889980872686799778983487887767001 Q gi|254780420|r 303 QGHHGV--NHPVKNISNGTVEIVSMNHGFAIDASSLP--AGLEETHISLFDSSNCGFRLIDQPVFSVQYHPESSPGPQDS 378 (396) Q Consensus 303 ~GhrG~--N~Pv~~~~t~kv~iTsqNHg~~v~~~slp--~~~~v~~~~~nD~tiegi~~~~~~~~~vQfHPEa~PGP~D~ 378 (396) |.|+-. +||...--+. .+..-|..--.|+++.+. .+++|...+...|.. -+..++....=+|-|||= |+ T Consensus 168 f~h~~~~~~~pL~rGfdd-~f~~PhSR~tei~~~~i~~~~~L~ILa~s~e~G~~-l~~s~~~r~vfi~GH~EY-----d~ 240 (302) T PRK05368 168 FEHRVLVPHHPLLRGFDD-SFLVPHSRYTEVREEDIRAATGLEILAESEEAGVY-LFASKDKRIVFVTGHPEY-----DA 240 (302) T ss_pred ECCCCCCCCCHHHCCCCC-CCCCCCCCCCCCCHHHHHHCCCCEEEECCCCCCCE-EEECCCCCEEEEECCCCC-----CH T ss_conf 122346899800137876-53067412236899999508896798636777716-998289886888278666-----75 Q ss_pred HHHHHHHHHHHHHHC Q ss_conf 689999999999735 Q gi|254780420|r 379 YYLFSTFIDFMHKRK 393 (396) Q Consensus 379 ~~~F~~F~~~i~~~k 393 (396) .-|-+++..-+.+.. T Consensus 241 ~TL~~EY~RD~~~g~ 255 (302) T PRK05368 241 DTLAREYFRDLGRGL 255 (302) T ss_pred HHHHHHHHHHHHCCC T ss_conf 489999999997699 No 96 >pfam04204 HTS Homoserine O-succinyltransferase. Probab=97.32 E-value=0.0067 Score=41.11 Aligned_cols=148 Identities=18% Similarity=0.137 Sum_probs=87.5 Q ss_pred CCHHHHHHCCCCEEEECCCCCCCCCCCC----C---CHHHHHHHCCCCCCEEEEECHHHH-HHHHCCCCCCCCC------ Q ss_conf 5744644135788998588888410112----1---101222102578634786100588-7875285446677------ Q gi|254780420|r 237 TSYKDIMSLQPDGVLLSNGPGDPAVTSV----Y---SSPIICKLVDSGMPMFGICLGHQL-LGLALGAQTVKMY------ 302 (396) Q Consensus 237 ~~~~~i~~~~pdgi~lSnGPGdP~~~~~----~---~~~~~~~~~~~~iPilGIClGhQ~-l~~a~G~~~~kl~------ 302 (396) .++++|...++||+++.+-|= +..+- + ..+++..........|.||+|.|+ |-..+|-+-..++ T Consensus 89 ~~f~~ik~~~~DGlIITGAPv--E~l~FeeV~YW~El~~I~dWs~~~v~Stl~iCWaAqAaL~~~ygI~K~~l~~KlfGV 166 (298) T pfam04204 89 RTFDEIKDEKFDGLIITGAPV--EHLPFEDVDYWEELTEIMDWSKTNVTSTLHICWGAQAALYHFYGIPKHLLPEKLFGV 166 (298) T ss_pred HHHHHHHHCCCCEEEECCCCC--CCCCCCCCCCHHHHHHHHHHHHHCCCCEEEECHHHHHHHHHHHCCCCCCCCCCEEEE T ss_conf 719999658875699728987--778855374199999999999862544045749999999998199800178745885 Q ss_pred CCCCCC--CCCCEECCCCEEEEECCCCCEEEEHHHC--CCCCEEEEEECCCCEEEEEEECCCCEEEECCCCCCCCCCCCH Q ss_conf 675210--1123477897099961457615703436--998289999889980872686799778983487887767001 Q gi|254780420|r 303 QGHHGV--NHPVKNISNGTVEIVSMNHGFAIDASSL--PAGLEETHISLFDSSNCGFRLIDQPVFSVQYHPESSPGPQDS 378 (396) Q Consensus 303 ~GhrG~--N~Pv~~~~t~kv~iTsqNHg~~v~~~sl--p~~~~v~~~~~nD~tiegi~~~~~~~~~vQfHPEa~PGP~D~ 378 (396) |.|+-. +||...--+.. +..-|..--.|+.+.+ ..+|+|...+...|... +..++....=+|-|||= |+ T Consensus 167 f~h~~~~~~~pLlrGfdd~-f~~PhSR~t~i~~~~i~~~~~L~iLa~s~~~G~~l-~~~~d~r~vf~~GH~EY-----d~ 239 (298) T pfam04204 167 FRHRVLDPHSPLLRGFDDE-FYAPHSRHTEVRREDIEAVPGLEILAESEEAGVYL-VASKDGRQVFVTGHPEY-----DA 239 (298) T ss_pred ECCCCCCCCCCCCCCCCCC-CCCCCCCCCCCCHHHHHCCCCCEEEECCCCCCEEE-EECCCCCEEEEECCCCC-----CH T ss_conf 1234578998010478765-30674233469988971378847986778776179-98289886888278666-----75 Q ss_pred HHHHHHHHHHHHHHC Q ss_conf 689999999999735 Q gi|254780420|r 379 YYLFSTFIDFMHKRK 393 (396) Q Consensus 379 ~~~F~~F~~~i~~~k 393 (396) .-|-+++..-+.+.. T Consensus 240 ~TL~~EY~RD~~~g~ 254 (298) T pfam04204 240 DTLKDEYERDLAKGL 254 (298) T ss_pred HHHHHHHHHHHHCCC T ss_conf 489999999997699 No 97 >COG0311 PDX2 Predicted glutamine amidotransferase involved in pyridoxine biosynthesis [Coenzyme metabolism] Probab=97.09 E-value=0.0048 Score=42.12 Aligned_cols=151 Identities=20% Similarity=0.337 Sum_probs=74.8 Q ss_pred HHHHHHHC-CCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCCC-----CCHHHHHHHCCCCCCEEEEECHHHHHHHH Q ss_conf 32222202-4541330465744644135788998588888410112-----11012221025786347861005887875 Q gi|254780420|r 220 LRVLSSLG-CRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTSV-----YSSPIICKLVDSGMPMFGICLGHQLLGLA 293 (396) Q Consensus 220 lr~L~~~g-~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~-----~~~~~~~~~~~~~iPilGIClGhQ~l~~a 293 (396) ++.|.+.+ .++..+.. .+++.. -||++|.+|-.. .... ...+.++++...++|+||-|-|+=+||.- T Consensus 16 ~~~l~~~~~~e~~~Vk~---~~dL~~--~d~LIiPGGEST--Ti~rL~~~~gl~e~l~~~~~~G~Pv~GTCAGlIlLake 88 (194) T COG0311 16 LEALEKAGGAEVVEVKR---PEDLEG--VDGLIIPGGEST--TIGRLLKRYGLLEPLREFIADGLPVFGTCAGLILLAKE 88 (194) T ss_pred HHHHHHHCCCCEEEECC---HHHHCC--CCEEEECCCCHH--HHHHHHHHCCCHHHHHHHHHCCCCEEEECHHHHHHHHH T ss_conf 99998605873599717---788553--767895495078--99999987384899999997699647751425653465 Q ss_pred CCCCCCCCCCCCCCCCCCCEECCCCEEEEECCCCCEEE--EHH-------------------HCCCCCEEEEEECCCCEE Q ss_conf 28544667767521011234778970999614576157--034-------------------369982899998899808 Q gi|254780420|r 294 LGAQTVKMYQGHHGVNHPVKNISNGTVEIVSMNHGFAI--DAS-------------------SLPAGLEETHISLFDSSN 352 (396) Q Consensus 294 ~G~~~~kl~~GhrG~N~Pv~~~~t~kv~iTsqNHg~~v--~~~-------------------slp~~~~v~~~~~nD~ti 352 (396) .-..... .+-| +.|+.-.|-.+-.|--+|.. +-+ ..-++.++.+. + |+-| T Consensus 89 i~~~~~~---~~Lg----~mdi~V~RNAfGRQ~dSFe~~~di~~~~~~~~~~avFIRAP~I~~vg~~V~vLa~-l-~~~i 159 (194) T COG0311 89 ILDGPEQ---PLLG----LLDVTVRRNAFGRQVDSFETELDIEGFGLPFPFPAVFIRAPVIEEVGDGVEVLAT-L-DGRI 159 (194) T ss_pred HCCCCCC---CCCC----EEEEEEECCCCCCCCCCCEEEEEEECCCCCCCCEEEEEECCEEEHHCCCCEEEEE-C-CCEE T ss_conf 5089877---5212----2789998136554312301067864136777623999986402000676067666-0-8879 Q ss_pred EEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 7268679977898348788776700168999999999973 Q gi|254780420|r 353 CGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDFMHKR 392 (396) Q Consensus 353 egi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~i~~~ 392 (396) - .-+..++.+.-||||-+ |..-+-+-|+++...+ T Consensus 160 V--av~qgn~LatsFHPELT----~D~r~Heyf~~~v~~~ 193 (194) T COG0311 160 V--AVKQGNILATSFHPELT----DDTRLHEYFLDMVLGA 193 (194) T ss_pred E--EEEECCEEEEECCCCCC----CCCCHHHHHHHHHHCC T ss_conf 9--99867889985175435----8740999999986516 No 98 >PRK13896 cobyrinic acid a,c-diamide synthase; Provisional Probab=97.06 E-value=0.027 Score=36.84 Aligned_cols=252 Identities=16% Similarity=0.178 Sum_probs=116.4 Q ss_pred EEEEEEEECCCCCCCCCCCCCCHHHHHHHCCCCEECCCCH-HHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHHCCC Q ss_conf 0236664046544345524179899997427832112576-899999985398423663276545897898887741588 Q gi|254780420|r 89 GAVGLVIKAEITEPSNYRANMHFDAWLKSCGIIGLSGIDT-RALTVWIRDYGVSNSVIAHHPDGQFDLEDLKERAKNWSG 167 (396) Q Consensus 89 ~~~g~Iv~~~~~~~Sn~rs~~sL~~wL~~~~IpgI~GIDT-RaLTk~IRe~G~~~a~I~~~~~~~~d~~~~~~~~~~~~~ 167 (396) .+.|+|.+..... .| ..-|.+=|++ +||.+--|=- ..|..-=|.-|-..+ . + ....+++.+.+.+..+ T Consensus 140 ~iaGVIlN~v~s~-rh---~~~l~~al~~-~i~vlG~lPr~~~l~lp~RHLGL~~~----~-E-~~~~~~~~~~~~~~vD 208 (432) T PRK13896 140 DVAGVLAQRAHGG-RH---ADGIRDALPD-ELTYFGRVPPRSDLEIPDRHLGLHMG----S-E-APIDDDALDAAAAHID 208 (432) T ss_pred EEEEEEECCCCCH-HH---HHHHHHHHHC-CCCEEEEECCCCCCCCCCCCCCCCCC----C-C-CCHHHHHHHHHHHHCC T ss_conf 4766884267758-89---9999999870-89489884247778984102598733----3-4-4238999999986478 Q ss_pred CCCCCHHHCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCEEEEE-----ECCCCCHHHHHHHHHCCCCEEEECCCCHHHH Q ss_conf 74321011013333211168653445434434677777149999-----6796410232222202454133046574464 Q gi|254780420|r 168 LKGAELAKHVTVSQRRDWSEKIWKWGEETSFLKSSDAKYHVVCI-----DYGIRSNLLRVLSSLGCRITIVGAETSYKDI 242 (396) Q Consensus 168 ~~~~dL~~~Vs~~~~~~~~~~~~~~~~~~~~~~~~~~~~kIvvI-----DfGiK~nIlr~L~~~g~~v~Vvp~~~~~~~i 242 (396) ++. +.+. ....+. .....+....+.+|++- .|=...| ++.|.+. .++ ++.+--.++- T Consensus 209 l~~--ll~~-a~~~~~-----------~~~~~~~~~~~~rIavA~D~AF~FyY~dn-l~~L~~~-ael--v~FSPl~D~~ 270 (432) T PRK13896 209 IER--IVDV-ARAPPR-----------PEAHDEAPDTDQTVAVADDSAFCFRYPAT-RERLRER-ADV--VTFSPVAGDP 270 (432) T ss_pred HHH--HHHH-HCCCCC-----------CCCCCCCCCCCCEEEEEECCCCCCCCHHH-HHHHHHC-CCE--EEECCCCCCC T ss_conf 999--9987-508888-----------87776668889769999758627017899-9999847-888--9876635798 Q ss_pred HHCCCCEEEECCCCCCCCCCCCC-----CHHHHHHHCCCCCCEEEEECHHHHHHHHCCCCCCCCCCCCCCCCCCCEECCC Q ss_conf 41357889985888884101121-----1012221025786347861005887875285446677675210112347789 Q gi|254780420|r 243 MSLQPDGVLLSNGPGDPAVTSVY-----SSPIICKLVDSGMPMFGICLGHQLLGLALGAQTVKMYQGHHGVNHPVKNISN 317 (396) Q Consensus 243 ~~~~pdgi~lSnGPGdP~~~~~~-----~~~~~~~~~~~~iPilGIClGhQ~l~~a~G~~~~kl~~GhrG~N~Pv~~~~t 317 (396) +- +.|||+|.+| -|+..... ....+++....+.||++-|=|...|+..+-. ..|..||.-.+.. T Consensus 271 LP-~~dglylgGG--yPEl~a~~Ls~n~~l~si~~~a~~G~PiyAECGGlMyL~~~l~~--------~~g~~~~M~Gvlp 339 (432) T PRK13896 271 LP-DCDGVYLPGG--YPELHGEALADGPALDELGARAADGLPVLGECGGLMALAESLTT--------TDGDTYEMAGVLP 339 (432) T ss_pred CC-CCCEEEECCC--CCHHHHHHHHCCHHHHHHHHHHHCCCCEEEECCCHHHHHHHHCC--------CCCCEEEEEEEEC T ss_conf 99-8898984799--80378999860878999999998699789987308998435165--------7998877251354 Q ss_pred CEEEEEC--CCCCEEEE---HHHC--CCCCEE----EEEE------------------CCCCEEEEEEECCCCEEEECCC Q ss_conf 7099961--45761570---3436--998289----9998------------------8998087268679977898348 Q gi|254780420|r 318 GTVEIVS--MNHGFAID---ASSL--PAGLEE----THIS------------------LFDSSNCGFRLIDQPVFSVQYH 368 (396) Q Consensus 318 ~kv~iTs--qNHg~~v~---~~sl--p~~~~v----~~~~------------------~nD~tiegi~~~~~~~~~vQfH 368 (396) .++.+|. |.+||... .+++ +.+-.+ .|++ -.++.-+|+..++.-.-=+-+| T Consensus 340 ~~~~m~~r~~~lGY~~~~~~~~~~~~~~G~~~rGHEFHYS~~~~~~~~~~~~~~~rg~g~~~~~dG~~~~n~~asY~HlH 419 (432) T PRK13896 340 ADVRMQDRYQALDHVELRATGDTLTAGSGAHLRGHEFHYSAATVASDARFAFAVERGDGIDGDHDGLTEYRTLGTYAHVH 419 (432) T ss_pred EEEEECCCCCCCCCEEEEECCCCCCCCCCCEEEEECCCCCCCCCCCCCCEEEEEECCCCCCCCCCEEEECCEEEEEEECC T ss_conf 28897778554475599963786246899889885752741755898854689861678888857687389999863326 Q ss_pred CCCCCCCCCHHHHHHHHHHH Q ss_conf 78877670016899999999 Q gi|254780420|r 369 PESSPGPQDSYYLFSTFIDF 388 (396) Q Consensus 369 PEa~PGP~D~~~~F~~F~~~ 388 (396) |+|. -|++|++- T Consensus 420 ~~s~--------~~~rFv~~ 431 (432) T PRK13896 420 PESG--------AFDRFLEA 431 (432) T ss_pred CCCC--------HHHHHHHC T ss_conf 7852--------78888740 No 99 >pfam07685 GATase_3 CobB/CobQ-like glutamine amidotransferase domain. Probab=97.05 E-value=0.0027 Score=43.89 Aligned_cols=48 Identities=29% Similarity=0.527 Sum_probs=32.8 Q ss_pred CCCCEEEECCCCCCCCCCC------CCCHHHHHHHCCCCCCEEEEECHHHHHHHHC Q ss_conf 3578899858888841011------2110122210257863478610058878752 Q gi|254780420|r 245 LQPDGVLLSNGPGDPAVTS------VYSSPIICKLVDSGMPMFGICLGHQLLGLAL 294 (396) Q Consensus 245 ~~pdgi~lSnGPGdP~~~~------~~~~~~~~~~~~~~iPilGIClGhQ~l~~a~ 294 (396) .++|+|+|-.|- |.... ....+.+++..+...||+|||-|.|+|+..+ T Consensus 6 ~~~D~lilpGg~--~~~~~~~~~~~~~~~~~i~~~~~~G~pi~giCGG~q~Lg~~i 59 (158) T pfam07685 6 GDADLVILPGGK--PTIQDLALLRNSGLDEALREHAEAGGPILGICGGYQMLGETI 59 (158) T ss_pred CCCCEEEECCCC--CHHHHHHHHHHCCHHHHHHHHHHCCCCEEEECHHHHHHHHHH T ss_conf 998799989954--099999998775579999999977996999875889876550 No 100 >cd03128 GAT_1 Type 1 glutamine amidotransferase (GATase1)-like domain. Type 1 glutamine amidotransferase (GATase1)-like domain. This group contains proteins similar to Class I glutamine amidotransferases, the intracellular PH1704 from Pyrococcus horikoshii, the C-terminal of the large catalase: Escherichia coli HP-II, Sinorhizobium meliloti Rm1021 ThuA, the A4 beta-galactosidase middle domain and peptidase E. The majority of proteins in this group have a reactive Cys found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. For Class I glutamine amidotransferases proteins which transfer ammonia from the amide side chain of glutamine to an acceptor substrate, this Cys forms a Cys-His-Glu catalytic triad in the active site. Glutamine amidotransferases activity can be found in a range of biosynthetic enzymes included in this cd: glutamine amidotransferase, formylglycinamide ribonucleotide, GMP synthetase, anthranilate synthase component II, glutamin Probab=96.86 E-value=0.0012 Score=46.33 Aligned_cols=73 Identities=30% Similarity=0.538 Sum_probs=50.8 Q ss_pred HHHHHHHHHCCCCEEEECCCCHH--HHHHCCCCEEEECCCCCCCCCCC--CCCHHHHHHHCCCCCCEEEEECHHHHH Q ss_conf 02322222024541330465744--64413578899858888841011--211012221025786347861005887 Q gi|254780420|r 218 NLLRVLSSLGCRITIVGAETSYK--DIMSLQPDGVLLSNGPGDPAVTS--VYSSPIICKLVDSGMPMFGICLGHQLL 290 (396) Q Consensus 218 nIlr~L~~~g~~v~Vvp~~~~~~--~i~~~~pdgi~lSnGPGdP~~~~--~~~~~~~~~~~~~~iPilGIClGhQ~l 290 (396) .....+.+.++.+.+++...... .....++|++++.+|+..+.... ......+++....+.||+|+|.|.|++ T Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vii~gg~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~c~G~~~l 92 (92) T cd03128 16 SPLDALREAGAEVDVVSPDGGPVESDVDLDDYDGLILPGGPGTPDDLAWDEALLALLREAAAAGKPVLGICLGAQLL 92 (92) T ss_pred HHHHHHHHCCCEEEEEECCCCCCCCCCCHHHCCEEEECCCCCCCCCCHHCHHHHHHHHHHHHCCCEEEEECCHHHHC T ss_conf 99999997799799995389964453686458899978998664210109899999999998799799999047539 No 101 >cd03169 GATase1_PfpI_1 Type 1 glutamine amidotransferase (GATase1)-like domain found in a subgroup of proteins similar to PfpI from Pyrococcus furiosus. Type 1 glutamine amidotransferase (GATase1)-like domain found in a subgroup of proteins similar to PfpI from Pyrococcus furiosus. PfpI is an ATP-independent intracellular proteases which may hydrolyze small peptides to provide a nutritional source. Only Cys of the catalytic triad typical of GATase1 domains is conserved in this group. This Cys residue is found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. Probab=96.72 E-value=0.0024 Score=44.27 Aligned_cols=60 Identities=18% Similarity=0.390 Sum_probs=46.6 Q ss_pred EECCCCHHHHHHCCCCEEEECCCCCCCCCC--CCCCHHHHHHHCCCCCCEEEEECHHHHHHHH Q ss_conf 304657446441357889985888884101--1211012221025786347861005887875 Q gi|254780420|r 233 VGAETSYKDIMSLQPDGVLLSNGPGDPAVT--SVYSSPIICKLVDSGMPMFGICLGHQLLGLA 293 (396) Q Consensus 233 vp~~~~~~~i~~~~pdgi~lSnGPGdP~~~--~~~~~~~~~~~~~~~iPilGIClGhQ~l~~a 293 (396) +..+.+++++...++|+||+.+|+| |... +.....+++++.+.++||-.||-|-++|+-+ T Consensus 63 ~~~~~~~~dv~~~~yDai~iPGG~g-~~~l~~d~~~~~li~~~~~~gK~iaaIChgp~~L~~a 124 (180) T cd03169 63 FAVTADFDEVDPDDYDALVIPGGRA-PEYLRLDEKVLAIVRHFAEANKPVAAICHGPQILAAA 124 (180) T ss_pred EECCCCHHHCCHHHCCEEEECCCCC-HHHHHCCHHHHHHHHHHHHCCCEEEEECCHHHHHHHC T ss_conf 4267876789861099999899822-8777119799999999998199699778168999876 No 102 >COG1492 CobQ Cobyric acid synthase [Coenzyme metabolism] Probab=96.30 E-value=0.021 Score=37.61 Aligned_cols=216 Identities=19% Similarity=0.226 Sum_probs=99.4 Q ss_pred CCCEEEECCCCCCCCCCCH--HH---------HCCCCCCCCCEEEEEEEECCCCCC--CCCCC----CCCHHHHHHHCCC Q ss_conf 5617321689736845882--34---------204653332102366640465443--45524----1798999974278 Q gi|254780420|r 58 LGQIVNFTFPHIGNVGVNS--ED---------FESISRKNFKGAVGLVIKAEITEP--SNYRA----NMHFDAWLKSCGI 120 (396) Q Consensus 58 ~gQIlvfT~P~IGNyGvn~--~d---------~ES~~~~~~~~~~g~Iv~~~~~~~--Sn~rs----~~sL~~wL~~~~I 120 (396) --|+|++--| ||+.--.. ++ .||.. .....-.+||.|-.-.| .|.|. .+.+.+.. +-.| T Consensus 85 ~sQVIv~G~~-~G~~s~~~yy~~~~~~l~~~v~~s~~--~l~~~~d~Vv~EGAGSpaEiNlr~~Di~Nm~~a~~~-dapv 160 (486) T COG1492 85 GSQVIVMGKD-IGRKSAVEYYQEGKGLLWVAVKESLE--RLDREYDVVVIEGAGSPAEINLRDRDIANMGVAEIA-DAPV 160 (486) T ss_pred CEEEEEECCC-CCCCCHHHHHHHHHHHHHHHHHHHHH--HHHHCCCEEEEECCCCHHHCCCCCCCCCCEEEEHHC-CCCE T ss_conf 4479994634-46657178899988999999999998--754105589996379823227663661100220233-8988 Q ss_pred CEECCCCH------HHHHHHH---HHCCCCCEEEECCCCCCCC-HHHHHHHHHHCCCCCCCCHHHCCCCCCCEECCCCCC Q ss_conf 32112576------8999999---8539842366327654589-789888774158874321011013333211168653 Q gi|254780420|r 121 IGLSGIDT------RALTVWI---RDYGVSNSVIAHHPDGQFD-LEDLKERAKNWSGLKGAELAKHVTVSQRRDWSEKIW 190 (396) Q Consensus 121 pgI~GIDT------RaLTk~I---Re~G~~~a~I~~~~~~~~d-~~~~~~~~~~~~~~~~~dL~~~Vs~~~~~~~~~~~~ 190 (396) .-..+||- +.=|..| .+.+-.+|.|.+.-.++.+ ++...+.+.+..+..-....+.. +... T Consensus 161 ILV~DIdRGGvfAsl~GT~~lL~~~~r~~VkG~iINkFRGd~~ll~~gik~Le~~tg~~vlGv~P~~---------~~~~ 231 (486) T COG1492 161 ILVGDIDRGGVFASLYGTLALLPESDRALVKGFIINKFRGDESLLDPGLKWLEELTGVPVLGVLPYL---------KDAL 231 (486) T ss_pred EEEEEECCCCEEEEEEEHHEECCHHHHCCEEEEEEEEECCCHHHHHHHHHHHHHHHCCEEEEECCCC---------CCCC T ss_conf 9999511564541730212236876760014799963179878874599999986197467565255---------5455 Q ss_pred CCCCCCC-----CCCCCCCCCEEEEEECC--CCCHHHHHHHH-HCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCC Q ss_conf 4454344-----34677777149999679--64102322222-0245413304657446441357889985888884101 Q gi|254780420|r 191 KWGEETS-----FLKSSDAKYHVVCIDYG--IRSNLLRVLSS-LGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVT 262 (396) Q Consensus 191 ~~~~~~~-----~~~~~~~~~kIvvIDfG--iK~nIlr~L~~-~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~ 262 (396) ...+.+- ....+....+|+++-+- .-+.-+.-|.. .+.+|.+++...+..+ ||.|+| ||.=+.. T Consensus 232 ~p~EDS~~~~~~~~~~~~~~i~Iav~~lp~isNFtD~dpL~~~~~v~v~~v~~~~~l~~-----~dlvIl---PGsk~t~ 303 (486) T COG1492 232 RPAEDSLSLEQPKAGGNKRAIRIAVIRLPRISNFTDFDPLRAEPDVRVRFVKPGSDLRD-----ADLVIL---PGSKNTI 303 (486) T ss_pred CCCCCCCCCHHHCCCCCCCCEEEEEECCCCCCCCCCCHHHHCCCCEEEEEECCCCCCCC-----CCEEEE---CCCCCCH T ss_conf 76420227000102589775499996578755644521465279807999616777899-----888994---7985608 Q ss_pred CCC-------CHHHHHHHCCCCCCEEEEECHHHHHHHHC Q ss_conf 121-------10122210257863478610058878752 Q gi|254780420|r 263 SVY-------SSPIICKLVDSGMPMFGICLGHQLLGLAL 294 (396) Q Consensus 263 ~~~-------~~~~~~~~~~~~iPilGIClGhQ~l~~a~ 294 (396) .+. ..+.+.+....+.||+|||=|+|.|+... T Consensus 304 ~DL~~lr~~g~d~~i~~~~~~~~~viGICGG~QmLG~~i 342 (486) T COG1492 304 ADLKILREGGMDEKILEYARKGGDVIGICGGYQMLGRRL 342 (486) T ss_pred HHHHHHHHCCHHHHHHHHHHCCCCEEEECCHHHHHHHHH T ss_conf 899999976979999999747997999863577643341 No 103 >cd03135 GATase1_DJ-1 Type 1 glutamine amidotransferase (GATase1)-like domain found in Human DJ-1. Type 1 glutamine amidotransferase (GATase1)-like domain found in Human DJ-1. DJ-1 is involved in multiple physiological processes including cancer, Parkinson's disease and male fertility. It is unclear how DJ-1 functions in these. DJ-1 has been shown to possess chaperone activity. DJ-1 is preferentially expressed in the testis and moderately in other tissues; it is induced together with genes involved in oxidative stress response. The Drosophila homologue (DJ-1A) plays an essential role in oxidative stress response and neuronal maintenance. Inhibition of DJ-1A function through RNAi, results in the cellular accumulation of reactive oxygen species, organismal hypersensitivity to oxidative stress, and dysfunction and degeneration of dopaminergic and photoreceptor neurons. DJ-1 has lacks enzymatic activity and the catalytic triad of typical GATase1 domains, however it does contain the highly Probab=96.01 E-value=0.0076 Score=40.74 Aligned_cols=61 Identities=16% Similarity=0.216 Sum_probs=46.2 Q ss_pred EECCCCHHHHHHCCCCEEEECCCCCCCCCC--CCCCHHHHHHHCCCCCCEEEEECHHHHHHHH Q ss_conf 304657446441357889985888884101--1211012221025786347861005887875 Q gi|254780420|r 233 VGAETSYKDIMSLQPDGVLLSNGPGDPAVT--SVYSSPIICKLVDSGMPMFGICLGHQLLGLA 293 (396) Q Consensus 233 vp~~~~~~~i~~~~pdgi~lSnGPGdP~~~--~~~~~~~~~~~~~~~iPilGIClGhQ~l~~a 293 (396) +..+.+++++...++|++++.+|++.+... +......++++.+.+.||.+||-|-++|+.| T Consensus 47 i~~d~~l~~~~~~~~D~liipGG~~~~~~l~~~~~~~~~i~~~~~~~k~i~aiC~G~~~La~a 109 (163) T cd03135 47 VKADKTLSDVNLDDYDAIVIPGGLPGAQNLADNEKLIKLLKEFNAKGKLIAAICAAPAVLAKA 109 (163) T ss_pred EECCCCHHHCCCCCCCEEEECCCCCHHHHHHHCHHHHHHHHHHHHHCCEEEEECHHHHHHHHC T ss_conf 946876655782017989986898217677669999999999998299479864548999986 No 104 >PRK00784 cobyric acid synthase; Provisional Probab=95.98 E-value=0.018 Score=38.14 Aligned_cols=156 Identities=22% Similarity=0.251 Sum_probs=77.8 Q ss_pred HCCCCEECCCC----------HHHHH-HHHHHCCCCCEEEECCCCCCCC-HHHHHHHHHHCCC---------CCCCCHHH Q ss_conf 42783211257----------68999-9998539842366327654589-7898887741588---------74321011 Q gi|254780420|r 117 SCGIIGLSGID----------TRALT-VWIRDYGVSNSVIAHHPDGQFD-LEDLKERAKNWSG---------LKGAELAK 175 (396) Q Consensus 117 ~~~IpgI~GID----------TRaLT-k~IRe~G~~~a~I~~~~~~~~d-~~~~~~~~~~~~~---------~~~~dL~~ 175 (396) ...|.-+.+|| |.+|- ..-|+ -.+|.|.+.-.++.+ ++...+.+++..+ +++..|.. T Consensus 158 ~apviLV~DIdRGGvfAsl~GT~~lL~~~eR~--li~G~IiNKFRGD~~ll~pG~~~le~~tg~PvlGviP~~~~l~l~~ 235 (492) T PRK00784 158 DAPVILVADIDRGGVFASLVGTLALLSPEERA--RVKGFVINKFRGDISLLEPGLDWLEELTGIPVLGVLPYLDDLRLPA 235 (492) T ss_pred CCCEEEEEECCCCCHHHHHHHHHHHCCHHHHH--HEEEEEEEEECCCHHHHCCHHHHHHHHHCCCCEEEECCCCCCCCCC T ss_conf 99889999756764268776388759998871--1589999764587466355999999986898068614656799881 Q ss_pred CCCCCCCEECCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCHH--HHHHH-HHCCCCEEEECCCCHHHHHHCCCCEEEE Q ss_conf 01333321116865344543443467777714999967964102--32222-2024541330465744644135788998 Q gi|254780420|r 176 HVTVSQRRDWSEKIWKWGEETSFLKSSDAKYHVVCIDYGIRSNL--LRVLS-SLGCRITIVGAETSYKDIMSLQPDGVLL 252 (396) Q Consensus 176 ~Vs~~~~~~~~~~~~~~~~~~~~~~~~~~~~kIvvIDfGiK~nI--lr~L~-~~g~~v~Vvp~~~~~~~i~~~~pdgi~l 252 (396) +=|... ...........+.+|+||.+=-=-|- +.-|. +-++++..++. .+++- +||.|+| T Consensus 236 EDs~~l------------~~~~~~~~~~~~l~IaVi~lP~iSNftDfdpL~~e~~V~l~~v~~---~~~l~--~~D~vIl 298 (492) T PRK00784 236 EDSLDL------------LLERAAGKGAGRLRVAVPRLPRISNFTDFDPLRAEPGVDVRYVRP---GEPLP--DADLVIL 298 (492) T ss_pred CCCCCC------------CCCCCCCCCCCCEEEEEEECCCCCCCCCCHHHHHCCCEEEEEECC---CCCCC--CCCEEEE T ss_conf 015754------------344556777773589998067556865443564078728999789---76568--9999997 Q ss_pred CCCCCCCCCCCCC-------CHHHHHHHCCCCCCEEEEECHHHHHHHHC Q ss_conf 5888884101121-------10122210257863478610058878752 Q gi|254780420|r 253 SNGPGDPAVTSVY-------SSPIICKLVDSGMPMFGICLGHQLLGLAL 294 (396) Q Consensus 253 SnGPGdP~~~~~~-------~~~~~~~~~~~~iPilGIClGhQ~l~~a~ 294 (396) ||.-....+. ....+++....+.||+|||=|.|.|+... T Consensus 299 ---PGsK~Ti~DL~~Lr~~Gl~~~i~~~~~~G~~V~GICGGyQMLG~~I 344 (492) T PRK00784 299 ---PGSKNTIADLAWLRESGWDEAIRAHARRGGPVLGICGGYQMLGRTI 344 (492) T ss_pred ---CCCCCCHHHHHHHHHCCHHHHHHHHHHCCCCEEEECCHHHHHCCEE T ss_conf ---9986648999999986949999999975997899870288630200 No 105 >cd03140 GATase1_PfpI_3 Type 1 glutamine amidotransferase (GATase1)-like domain found in a subgroup of proteins similar to PfpI from Pyrococcus furiosus. Type 1 glutamine amidotransferase (GATase1)-like domain found in a subgroup of proteins similar to PfpI from Pyrococcus furiosus. PfpI is an ATP-independent intracellular proteases which may hydrolyze small peptides to provide a nutritional source. Only Cys of the catalytic triad typical of GATase1 domains is conserved in this group. This Cys residue is found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. Probab=95.81 E-value=0.013 Score=39.02 Aligned_cols=64 Identities=19% Similarity=0.216 Sum_probs=47.1 Q ss_pred CEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEECHHHHHHHHC Q ss_conf 41330465744644135788998588888410112110122210257863478610058878752 Q gi|254780420|r 230 ITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGICLGHQLLGLAL 294 (396) Q Consensus 230 v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIClGhQ~l~~a~ 294 (396) +.+.| +.+++++...++|.|+|-+|+.-.........+.++++.+.+.+|.+||-|-++|+.|- T Consensus 45 ~~i~~-D~~l~~v~~~~~D~lvlPGG~~~~~~~~~~l~~~l~~~~~~gk~iaaIC~g~~vLa~aG 108 (170) T cd03140 45 LRVVP-DYSLDDLPPEDYDLLILPGGDSWDNPEAPDLAGLVRQALKQGKPVAAICGATLALARAG 108 (170) T ss_pred CEEEC-CCCHHHCCCCCCCEEEECCCCCCCCCCCHHHHHHHHHHHHHCCCEEEECHHHHHHHHCC T ss_conf 47956-75868889021778998688751002499999999999874990999865589999859 No 106 >KOG3210 consensus Probab=95.58 E-value=0.18 Score=31.04 Aligned_cols=75 Identities=17% Similarity=0.360 Sum_probs=42.7 Q ss_pred HHH-HHHHHH--CCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCCCCCH-----HHHHHHC-CCCCCEEEEECHHH Q ss_conf 023-222220--24541330465744644135788998588888410112110-----1222102-57863478610058 Q gi|254780420|r 218 NLL-RVLSSL--GCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTSVYSS-----PIICKLV-DSGMPMFGICLGHQ 288 (396) Q Consensus 218 nIl-r~L~~~--g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~~~~-----~~~~~~~-~~~iPilGIClGhQ 288 (396) |++ |++.+- +..+.|.|..+ .+|.. +-||+++-+|-.. ..+..+. ..+-.+. +..+|++|-|-|+- T Consensus 28 N~~~~c~~en~y~Ik~~~~tVKT-~~D~a--q~DaLIIPGGEST--~mslia~~tgL~d~L~~fVhn~~k~~WGTCAGmI 102 (226) T KOG3210 28 NHVEKCIVENRYEIKLSVMTVKT-KNDLA--QCDALIIPGGEST--AMSLIAERTGLYDDLYAFVHNPSKVTWGTCAGMI 102 (226) T ss_pred HHHHHHHCCCCCEEEEEEEEECC-HHHHH--HCCEEEECCCCCH--HHHHHHHHHHHHHHHHHHHCCCCCCEEEECHHHH T ss_conf 78877530586267888886157-77786--5777783398502--6788874410177799984488864034102253 Q ss_pred HHHHHCCCC Q ss_conf 878752854 Q gi|254780420|r 289 LLGLALGAQ 297 (396) Q Consensus 289 ~l~~a~G~~ 297 (396) +|+..+-++ T Consensus 103 ~LS~ql~ne 111 (226) T KOG3210 103 YLSQQLSNE 111 (226) T ss_pred HHHHHHCCC T ss_conf 105655288 No 107 >cd03130 GATase1_CobB Type 1 glutamine amidotransferase (GATase1) domain found in Cobyrinic Acid a,c-Diamide Synthase. Type 1 glutamine amidotransferase (GATase1) domain found in Cobyrinic Acid a,c-Diamide Synthase. CobB plays a role in cobalamin biosythesis catalyzing the conversion of cobyrinic acid to cobyrinic acid a,c-diamide. CobB belongs to the triad family of amidotransferases. Two of the three residues of the catalytic triad that are involved in glutamine binding, hydrolysis and transfer of the resulting ammonia to the acceptor substrate in other triad aminodotransferases are conserved in CobB. Probab=95.37 E-value=0.11 Score=32.63 Aligned_cols=78 Identities=23% Similarity=0.414 Sum_probs=51.0 Q ss_pred ECCCCCHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCCCC------CHHHHHHHCCCCCCEEEEEC Q ss_conf 679641023222220245413304657446441357889985888884101121------10122210257863478610 Q gi|254780420|r 212 DYGIRSNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTSVY------SSPIICKLVDSGMPMFGICL 285 (396) Q Consensus 212 DfGiK~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~~------~~~~~~~~~~~~iPilGICl 285 (396) .|=-..| ++.|.+.|++++-+..-. ++-+- +-|||.|.. |-|+..... ....+++..+.+.||++.|= T Consensus 10 ~FyY~en-l~~L~~~Gaelv~FSPl~--d~~lP-~~d~lylgG--GyPE~~a~~Ls~N~~~~~~ir~~~~~G~PiyAECG 83 (198) T cd03130 10 NFYYPEN-LELLEAAGAELVPFSPLK--DEELP-DADGLYLGG--GYPELFAEELSANQSMRESIRAFAESGGPIYAECG 83 (198) T ss_pred CEECHHH-HHHHHHCCCEEEEECCCC--CCCCC-CCCEEECCC--CCHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEECH T ss_conf 5557999-999998899789877767--88899-876898279--98758899997679999999999984997799820 Q ss_pred HHHHHHHHCC Q ss_conf 0588787528 Q gi|254780420|r 286 GHQLLGLALG 295 (396) Q Consensus 286 GhQ~l~~a~G 295 (396) |++.|+..+- T Consensus 84 GlMyL~~~i~ 93 (198) T cd03130 84 GLMYLGESLD 93 (198) T ss_pred HHHHHHHHCC T ss_conf 4888742205 No 108 >pfam01965 DJ-1_PfpI DJ-1/PfpI family. The family includes the protease PfpI. This domain is also found in transcriptional regulators. Probab=95.35 E-value=0.017 Score=38.26 Aligned_cols=63 Identities=19% Similarity=0.326 Sum_probs=48.1 Q ss_pred EEEECCCCHHHHHHCCCCEEEECCCCCCCCCC--CCCCHHHHHHHCCCCCCEEEEECHHHHHHHH Q ss_conf 13304657446441357889985888884101--1211012221025786347861005887875 Q gi|254780420|r 231 TIVGAETSYKDIMSLQPDGVLLSNGPGDPAVT--SVYSSPIICKLVDSGMPMFGICLGHQLLGLA 293 (396) Q Consensus 231 ~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~--~~~~~~~~~~~~~~~iPilGIClGhQ~l~~a 293 (396) ..+..+.+++++...++|.|++..|.+-+... +......++++.+.+.+|.+||-|-.+|+.+ T Consensus 19 ~~v~~d~~~~~~~~~~~D~lvipGG~~~~~~l~~~~~~~~~l~~~~~~~~~i~aiC~g~~~La~~ 83 (141) T pfam01965 19 VKLRADKTLDDVNADDYDALVIPGGHAAAEDLRDDEKLVKFVKEFYEAGKPIAAICHGPVLLAAA 83 (141) T ss_pred CEEECCCCHHHCCHHHCCEEEECCCCCHHHHHHHCHHHHHHHHHHHHHCCEEEEECHHHHHHHHC T ss_conf 78967701888895249999996994067664019999999999998399899987668999986 No 109 >TIGR01369 CPSaseII_lrg carbamoyl-phosphate synthase, large subunit; InterPro: IPR006275 Carbamoyl phosphate synthase (CPSase) is a heterodimeric enzyme composed of a small and a large subunit (with the exception of CPSase III, see below). CPSase catalyses the synthesis of carbamoyl phosphate from biocarbonate, ATP and glutamine (6.3.5.5 from EC) or ammonia (6.3.4.16 from EC), and represents the first committed step in pyrimidine and arginine biosynthesis in prokaryotes and eukaryotes, and in the urea cycle in most terrestrial vertebrates , . CPSase has three active sites, one in the small subunit and two in the large subunit. The small subunit contains the glutamine binding site and catalyses the hydrolysis of glutamine to glutamate and ammonia. The large subunit has two homologous carboxy phosphate domains, both of which have ATP-binding sites; however, the N-terminal carboxy phosphate domain catalyses the phosphorylation of biocarbonate, while the C-terminal domain catalyses the phosphorylation of the carbamate intermediate . The carboxy phosphate domain found duplicated in the large subunit of CPSase is also present as a single copy in the biotin-dependent enzymes acetyl-CoA carboxylase (6.4.1.2 from EC) (ACC), propionyl-CoA carboxylase (6.4.1.3 from EC) (PCCase), pyruvate carboxylase (6.4.1.1 from EC) (PC) and urea carboxylase (6.3.4.6 from EC). Most prokaryotes carry one form of CPSase that participates in both arginine and pyrimidine biosynthesis, however certain bacteria can have separate forms. The large subunit in bacterial CPSase has four structural domains: the carboxy phosphate domain 1, the oligomerisation domain, the carbamoyl phosphate domain 2 and the allosteric domain . CPSase heterodimers from Escherichia coli contain two molecular tunnels: an ammonia tunnel and a carbamate tunnel. These inter-domain tunnels connect the three distinct active sites, and function as conduits for the transport of unstable reaction intermediates (ammonia and carbamate) between successive active sites . The catalytic mechanism of CPSase involves the diffusion of carbamate through the interior of the enzyme from the site of synthesis within the N-terminal domain of the large subunit to the site of phosphorylation within the C-terminal domain. Eukaryotes have two distinct forms of CPSase: a mitochondrial enzyme (CPSase I) that participates in both arginine biosynthesis and the urea cycle; and a cytosolic enzyme (CPSase II) involved in pyrimidine biosynthesis. CPSase II occurs as part of a multi-enzyme complex along with aspartate transcarbamoylase and dihydroorotase; this complex is referred to as the CAD protein . The hepatic expression of CPSase is transcriptionally regulated by glucocorticoids and/or cAMP . There is a third form of the enzyme, CPSase III, found in fish, which uses glutamine as a nitrogen source instead of ammonia . CPSase III is closely related to CPSase I, and is composed of a single polypeptide that may have arisen from gene fusion of the glutaminase and synthetase domains . This entry represents glutamine-dependent CPSase (6.3.5.5 from EC) from prokaryotes and eukaryotes (CPSase II). ; GO: 0004086 carbamoyl-phosphate synthase activity, 0006807 nitrogen compound metabolic process. Probab=94.84 E-value=0.13 Score=32.06 Aligned_cols=164 Identities=18% Similarity=0.324 Sum_probs=103.0 Q ss_pred CCHHHHHHHC--CCCEECCCCHHHHHHHHHHCCCCCEEEECC-CCCCCC------HHHHHHHHHHCCCCCCCC-HHHCCC Q ss_conf 7989999742--783211257689999998539842366327-654589------789888774158874321-011013 Q gi|254780420|r 109 MHFDAWLKSC--GIIGLSGIDTRALTVWIRDYGVSNSVIAHH-PDGQFD------LEDLKERAKNWSGLKGAE-LAKHVT 178 (396) Q Consensus 109 ~sL~~wL~~~--~IpgI~GIDTRaLTk~IRe~G~~~a~I~~~-~~~~~d------~~~~~~~~~~~~~~~~~d-L~~~Vs 178 (396) ..|.+-|+.. +..+|..+|. .+=+...+.|=.=+.|+.. ...+.+ +.++.+.+.-.|..+--| ++.+-- T Consensus 471 V~~e~~l~~~~n~~~~I~~~~~-e~L~~aK~lGFSD~~iA~~~~~~~~~~~~~~~vR~~R~~~gi~Pv~K~vDTcAAEF~ 549 (1089) T TIGR01369 471 VELEKELEELKNKLTGIEELDK-ELLRKAKKLGFSDAQIARLVNVTEAESIEELEVRKLRKELGIIPVYKRVDTCAAEFE 549 (1089) T ss_pred HHHHHHHHHHHCCCCCCCCCCH-HHHHHHHHCCCCHHHHHHHHCCCHHHCCCHHHHHHHHHHCCCEEEEEEEECCCCCCC T ss_conf 9999998753012445334898-899998641888789999868974442006899999984697477877523212020 Q ss_pred CCCCEECCCCCCCCC-CCCCCCCCCCCCCEEEEE-------------ECCCCCHHHHHHHHHCCCCEEEECC-----CCH Q ss_conf 333211168653445-434434677777149999-------------6796410232222202454133046-----574 Q gi|254780420|r 179 VSQRRDWSEKIWKWG-EETSFLKSSDAKYHVVCI-------------DYGIRSNLLRVLSSLGCRITIVGAE-----TSY 239 (396) Q Consensus 179 ~~~~~~~~~~~~~~~-~~~~~~~~~~~~~kIvvI-------------DfGiK~nIlr~L~~~g~~v~Vvp~~-----~~~ 239 (396) .+.+|-| .++... ++..+ ..+.|++|++| ||+.=+ -+..|++.|.+++++.|| |++ T Consensus 550 A~TpY~Y--STY~~e~~d~~~--~~~~Kk~~lvlGSGp~RIGqgvEFDYc~Vh-~~~aLr~~Gye~ImiN~NPETVSTDy 624 (1089) T TIGR01369 550 AKTPYLY--STYEGERDDVEF--TEKEKKKVLVLGSGPNRIGQGVEFDYCCVH-AVLALREAGYETIMINYNPETVSTDY 624 (1089) T ss_pred CCCCCCC--CCCCCCCCCCCC--CCCCCCEEEEECCCCEECCCCEEEHHHHHH-HHHHHHHCCCEEEEEECCCCCCCCCH T ss_conf 1568852--577666567666--657785689987845140663120567899-99999872995999977899743666 Q ss_pred HH-------------HH----HCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCC-CCEEEEEC Q ss_conf 46-------------44----1357889985888884101121101222102578-63478610 Q gi|254780420|r 240 KD-------------IM----SLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSG-MPMFGICL 285 (396) Q Consensus 240 ~~-------------i~----~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~-iPilGICl 285 (396) |. |+ ..+|+||++|=| | + +..++.+++-+.+ +||||-=. T Consensus 625 D~sDrLYFEplt~E~Vm~I~e~E~~~GVIVq~G-G---Q---tp~nlA~~L~~~GG~~iLGTS~ 681 (1089) T TIGR01369 625 DTSDRLYFEPLTFEDVMNIIELEKPEGVIVQFG-G---Q---TPLNLAKELEEAGGVPILGTSP 681 (1089) T ss_pred HHHHHCCCCCCCCCCCEEHEEECCCCEEEEECC-C---H---HHHHHHHHHHHCCCCEEECCCH T ss_conf 651011587635031001100058667999748-7---3---2678999999708931736885 No 110 >cd03134 GATase1_PfpI_like A type 1 glutamine amidotransferase (GATase1)-like domain found in PfpI from Pyrococcus furiosus. A type 1 glutamine amidotransferase (GATase1)-like domain found in PfpI from Pyrococcus furiosus. This group includes proteins similar to PfpI from P. furiosus. and PH1704 from Pyrococcus horikoshii. These enzymes are ATP-independent intracellular proteases and may hydrolyze small peptides to provide a nutritional source. Only Cys of the catalytic triad typical of GATase1 domains is conserved in this group. This Cys residue is found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. For PH1704, it is believed that this Cys together with a different His in one monomer and Glu (from an adjacent monomer) forms a different catalytic triad from the typical GATase1domain. PfpI is homooligomeric. Protease activity is only found for oligomeric forms of PH1704. Probab=94.33 E-value=0.044 Score=35.34 Aligned_cols=60 Identities=20% Similarity=0.346 Sum_probs=45.5 Q ss_pred EECCCCHHHHHHCCCCEEEECCCCCCCCCC--CCCCHHHHHHHCCCCCCEEEEECHHHHHHHH Q ss_conf 304657446441357889985888884101--1211012221025786347861005887875 Q gi|254780420|r 233 VGAETSYKDIMSLQPDGVLLSNGPGDPAVT--SVYSSPIICKLVDSGMPMFGICLGHQLLGLA 293 (396) Q Consensus 233 vp~~~~~~~i~~~~pdgi~lSnGPGdP~~~--~~~~~~~~~~~~~~~iPilGIClGhQ~l~~a 293 (396) +..+.+++++...++|+|+|-.|.| |... .......++++.+.+.||..||-|-++|+.+ T Consensus 49 v~~d~~l~~~~~~~yD~ivlPGG~g-~~~l~~~~~~~~~i~~~~~~~k~iaaIC~gp~~L~~~ 110 (165) T cd03134 49 VTVDLTIADVDADDYDALVIPGGTN-PDKLRRDPDAVAFVRAFAEAGKPVAAICHGPWVLISA 110 (165) T ss_pred EECCCCHHHCCHHHCCEEEECCCCC-HHHHHHCHHHHHHHHHHHHHCCCEEEECCHHHHHHHC T ss_conf 8678657998962288899789858-8888529899999999997199299998757877457 No 111 >PRK11574 hypothetical protein; Provisional Probab=94.12 E-value=0.074 Score=33.78 Aligned_cols=65 Identities=14% Similarity=0.119 Sum_probs=45.3 Q ss_pred EEECCCCHHHHHHCCCCEEEECCCCCCCCCC--CCCCHHHHHHHCCCCCCEEEEECHHHHHHHHCCC Q ss_conf 3304657446441357889985888884101--1211012221025786347861005887875285 Q gi|254780420|r 232 IVGAETSYKDIMSLQPDGVLLSNGPGDPAVT--SVYSSPIICKLVDSGMPMFGICLGHQLLGLALGA 296 (396) Q Consensus 232 Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~--~~~~~~~~~~~~~~~iPilGIClGhQ~l~~a~G~ 296 (396) .+..|..++++...++|+|+|-+|..-+... +....++++++.+.+.+|-.||-+=+++--..|. T Consensus 52 ~i~aD~~l~~v~~~~yD~ivlPGG~~ga~~L~~~~~v~~~lk~~~~~gK~iAAICAaP~~vL~~~gl 118 (196) T PRK11574 52 KLLADAPLVEVADGEYDVIVLPGGIKGAECFRDSPLLVETVKQFHRSGRIVAAICAAPATVLVPHDI 118 (196) T ss_pred EEECCCCHHHCCCCCCCEEECCCCCHHHHHHHHCHHHHHHHHHHHHCCCEEEEECHHHHHHHHHCCC T ss_conf 8957867777782558889917880379998739899999999998799199982406999987886 No 112 >PRK06278 cobyrinic acid a,c-diamide synthase; Validated Probab=94.10 E-value=0.11 Score=32.62 Aligned_cols=13 Identities=46% Similarity=0.971 Sum_probs=4.5 Q ss_pred CEEEEECHHHHHH Q ss_conf 3478610058878 Q gi|254780420|r 279 PMFGICLGHQLLG 291 (396) Q Consensus 279 PilGIClGhQ~l~ 291 (396) +|+|||-|+|+|+ T Consensus 68 ~IiGICsGFQiL~ 80 (482) T PRK06278 68 YILGICSGFQILA 80 (482) T ss_pred EEEEECHHHHHHH T ss_conf 5899611678877 No 113 >TIGR01857 FGAM-synthase phosphoribosylformylglycinamidine synthase; InterPro: IPR010141 This entry represents a single-molecule form of phosphoribosylformylglycinamidine synthase, also called FGAM synthase, an enzyme of purine de novo biosynthesis, which represent a second clade of the enzymes found in Clostridia, Bifidobacteria and Streptococcus species. This enzyme performs the fourth step in IMP biosynthesis (the precursor of all purines) from PRPP.. Probab=93.80 E-value=0.2 Score=30.79 Aligned_cols=161 Identities=25% Similarity=0.351 Sum_probs=86.7 Q ss_pred CCCCCEEEEEEC-C--CCCHHHHHHHHHCCCCEEEECC-CCHHHHH--------HCCCCEEE-ECCC--CCC-CCCCCCC Q ss_conf 777714999967-9--6410232222202454133046-5744644--------13578899-8588--888-4101121 Q gi|254780420|r 202 SDAKYHVVCIDY-G--IRSNLLRVLSSLGCRITIVGAE-TSYKDIM--------SLQPDGVL-LSNG--PGD-PAVTSVY 265 (396) Q Consensus 202 ~~~~~kIvvIDf-G--iK~nIlr~L~~~g~~v~Vvp~~-~~~~~i~--------~~~pdgi~-lSnG--PGd-P~~~~~~ 265 (396) .-.|++|++==| | .=|-..+.+.+.|++|..++-+ -+.+.|. +++=.-|| |.+| -|| |.=..+. T Consensus 1013 ~v~KP~VvIpvFPGtNsEYD~~~aFe~~GAev~~~~~~nL~~~a~~~Sv~~l~~~i~ksqI~~lpGGFSAgDEPDGSAKF 1092 (1279) T TIGR01857 1013 KVEKPRVVIPVFPGTNSEYDSAKAFEKEGAEVKLVILRNLTEEALVESVEELVDEIDKSQILMLPGGFSAGDEPDGSAKF 1092 (1279) T ss_pred EEECCEEEEECCCCCCCHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCHHH T ss_conf 54078679813588984888888985458958996000388889999999998740464089955878777788852899 Q ss_pred ---------CHHHHHHHCC--CCCCEEEEECHHHHHHHHCCCCCCCCCCCCCCC-------------C-CCCEECCCCEE Q ss_conf ---------1012221025--786347861005887875285446677675210-------------1-12347789709 Q gi|254780420|r 266 ---------SSPIICKLVD--SGMPMFGICLGHQLLGLALGAQTVKMYQGHHGV-------------N-HPVKNISNGTV 320 (396) Q Consensus 266 ---------~~~~~~~~~~--~~iPilGIClGhQ~l~~a~G~~~~kl~~GhrG~-------------N-~Pv~~~~t~kv 320 (396) +.+.++.+++ .++ |||||=|+|+|- -+|. ||||.-.. | |=-+-+.| T Consensus 1093 i~~~l~npkv~~~v~~lL~~RDGL-iLGiCNGFQALv-K~GL----lPYG~i~~~~~~~pTLtyN~inrH~s~~V~T--- 1163 (1279) T TIGR01857 1093 IAAILRNPKVKEAVKKLLKERDGL-ILGICNGFQALV-KLGL----LPYGEIVDVEEDSPTLTYNKINRHVSKIVRT--- 1163 (1279) T ss_pred HHHHHHCHHHHHHHHHHHHHCCCE-EEEEEHHHHHHH-HHCC----CCCCCEEECCCCCCCCCCCCHHHHHEEEEEE--- T ss_conf 999760814799999999607984-986200567888-8247----8886424004578631202010120101235--- Q ss_pred EEECC------------------CCC---EEEEHHH---CCC-CCEEE-E------------EECCCCE---EEEEEECC Q ss_conf 99614------------------576---1570343---699-82899-9------------9889980---87268679 Q gi|254780420|r 321 EIVSM------------------NHG---FAIDASS---LPA-GLEET-H------------ISLFDSS---NCGFRLID 359 (396) Q Consensus 321 ~iTsq------------------NHg---~~v~~~s---lp~-~~~v~-~------------~~~nD~t---iegi~~~~ 359 (396) -|+|. .|| |.+...- |-. +=..| | +|.| || ||||..++ T Consensus 1164 ki~~~~SPWL~~~~vGdih~ipvSHGEGrFva~~~~l~~L~~NGQIatQYVD~~G~~~m~~~yNPN-GS~~AIEGI~S~d 1242 (1279) T TIGR01857 1164 KIASKNSPWLSEVSVGDIHAIPVSHGEGRFVASDEVLKELRENGQIATQYVDFEGKPAMDLKYNPN-GSSLAIEGIISPD 1242 (1279) T ss_pred EECCCCCCCCCCEEECCEEEEEEECCCCCEECCHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCC-CCCEEECCEECCC T ss_conf 650477884021032427753344489832307789989863796378987267863578866978-5302463445689 Q ss_pred CCEEEECCCCCCC Q ss_conf 9778983487887 Q gi|254780420|r 360 QPVFSVQYHPESS 372 (396) Q Consensus 360 ~~~~~vQfHPEa~ 372 (396) -.|||=.=|=|=. T Consensus 1243 G~i~GKMGHSER~ 1255 (1279) T TIGR01857 1243 GRIFGKMGHSERY 1255 (1279) T ss_pred CEEEEECCCEEEC T ss_conf 7275204520244 No 114 >COG3442 Predicted glutamine amidotransferase [General function prediction only] Probab=93.71 E-value=0.052 Score=34.87 Aligned_cols=164 Identities=19% Similarity=0.182 Sum_probs=80.0 Q ss_pred HHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCC-----CCCHHHHHHHCCCCCCEEEEECHHHHHHHHCC Q ss_conf 2222202454133046574464413578899858888841011-----21101222102578634786100588787528 Q gi|254780420|r 221 RVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTS-----VYSSPIICKLVDSGMPMFGICLGHQLLGLALG 295 (396) Q Consensus 221 r~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~-----~~~~~~~~~~~~~~iPilGIClGhQ~l~~a~G 295 (396) +...+||..|+|++-+-. +.+.....|-+|+..|- |-+..- ....+-++..++...|+|.||=|.|+|+..+- T Consensus 28 ~ra~~rgi~v~i~~vsl~-d~~~~~~~Dl~~~GGgq-D~eQ~i~t~d~~~k~~~l~~~i~~g~p~laiCgg~QlLG~yY~ 105 (250) T COG3442 28 QRAEKRGIKVEIVEVSLT-DTFPDDSYDLYFLGGGQ-DYEQEIATRDLLTKKEGLKDAIENGKPVLAICGGYQLLGQYYE 105 (250) T ss_pred HHHHHCCCCEEEEEEECC-CCCCCCCCCEEEECCCH-HHHHHHHHHHHCCCCHHHHHHHHCCCCEEEECCCHHHCCCEEE T ss_conf 789866981699996347-89874440279951720-7888887521001037799998669958998255533040321 Q ss_pred ----CCCCCCC-CCCCCCCCCCEECCCCEEEEE--------------CCCCCEE-E-EHHHCCCCCEEEEEECC--CCEE Q ss_conf ----5446677-675210112347789709996--------------1457615-7-03436998289999889--9808 Q gi|254780420|r 296 ----AQTVKMY-QGHHGVNHPVKNISNGTVEIV--------------SMNHGFA-I-DASSLPAGLEETHISLF--DSSN 352 (396) Q Consensus 296 ----~~~~kl~-~GhrG~N~Pv~~~~t~kv~iT--------------sqNHg~~-v-~~~slp~~~~v~~~~~n--D~ti 352 (396) -++.-+- .+|--.| |-.+-..|.+.|+ -.||+=. . .++.-|-+ +|.+=.-| ++-- T Consensus 106 ~a~G~ri~GlGiLd~~T~~-~~~~R~IGdiv~~~~~~~e~~~et~~GFENH~GrT~L~~d~~pLG-~Vv~G~GNn~eD~~ 183 (250) T COG3442 106 TASGTRIDGLGILDHYTEN-PQTKRFIGDIVIENTLAGEEFGETLVGFENHGGRTYLGPDVKPLG-KVVYGYGNNGEDGT 183 (250) T ss_pred CCCCCEEECCCCEEEEECC-CCCCCEEEEEEEECCCCHHHHCCEEEEEECCCCCEECCCCCCCCE-EEEECCCCCCCCCC T ss_conf 6898476132211234126-666504536886214565773880462543788134189876460-07886677755466 Q ss_pred EEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHC Q ss_conf 72686799778983487887767001689999999999735 Q gi|254780420|r 353 CGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDFMHKRK 393 (396) Q Consensus 353 egi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~i~~~k 393 (396) ||.. .+++++.=||=-.-| -.-.|=|.++....+.| T Consensus 184 eG~~--ykn~~aTY~HGP~L~---rNp~LAd~Ll~tAl~~k 219 (250) T COG3442 184 EGAH--YKNVIATYFHGPILS---RNPELADRLLTTALEKK 219 (250) T ss_pred CCEE--EEEEEEEEECCCCCC---CCHHHHHHHHHHHHHHH T ss_conf 4234--520478751175446---88789999999999886 No 115 >KOG2764 consensus Probab=92.73 E-value=0.14 Score=31.85 Aligned_cols=66 Identities=18% Similarity=0.261 Sum_probs=44.5 Q ss_pred CEEEECCCCHHHHHHCCCCEEEECCC-CCCCC-CCCCCCHHHHHHHCCCCCCEEEEECHHHHHHHHCCC Q ss_conf 41330465744644135788998588-88841-011211012221025786347861005887875285 Q gi|254780420|r 230 ITIVGAETSYKDIMSLQPDGVLLSNG-PGDPA-VTSVYSSPIICKLVDSGMPMFGICLGHQLLGLALGA 296 (396) Q Consensus 230 v~Vvp~~~~~~~i~~~~pdgi~lSnG-PGdP~-~~~~~~~~~~~~~~~~~iPilGIClGhQ~l~~a~G~ 296 (396) ++.+|-+..++... .+||.|+|-+| ||--. .......++++++.+.+.+|+.||-|--+...+.|- T Consensus 52 v~~~~d~~l~D~~~-~~yDviilPGG~~g~e~L~~~~~v~~lvK~q~~~gkLIaaICaap~~al~a~gl 119 (247) T KOG2764 52 VHILPDNALFDVVD-SKYDVIILPGGLPGAETLSECEKVVDLVKEQAESGKLIAAICAAPLTALAAHGL 119 (247) T ss_pred EEECCCCCCHHHCC-CCCCEEEECCCCHHHHHHHHCHHHHHHHHHHHHCCCEEEEEECCHHHHHHHCCC T ss_conf 37625531012113-566689946996056665306889999999875299699862056989741655 No 116 >PRK02006 murD UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase; Provisional Probab=92.60 E-value=0.32 Score=29.28 Aligned_cols=34 Identities=18% Similarity=0.050 Sum_probs=13.4 Q ss_pred EEEECCCCCCCCCCCCCCHHHHHHHCCCCEECCCC Q ss_conf 66404654434552417989999742783211257 Q gi|254780420|r 93 LVIKAEITEPSNYRANMHFDAWLKSCGIIGLSGID 127 (396) Q Consensus 93 ~Iv~~~~~~~Sn~rs~~sL~~wL~~~~IpgI~GID 127 (396) +||...-..|.|... ..+-...++.+||-|+.|| T Consensus 72 ~vV~SPGI~p~~p~~-~~~l~~A~~~gi~i~~eie 105 (501) T PRK02006 72 LVALSPGLSPLEPAL-AALLAAARERGIPVWGELE 105 (501) T ss_pred EEEECCEECCCCCCH-HHHHHHHHHCCCCEEEHHH T ss_conf 999899008888543-1999999987995876899 No 117 >cd03131 GATase1_HTS Type 1 glutamine amidotransferase (GATase1)-like domain found in homoserine trans-succinylase (HTS). Type 1 glutamine amidotransferase (GATase1)-like domain found in homoserine trans-succinylase (HTS). HTS, the first enzyme in methionine biosynthesis in Escherichia coli, transfers a succinyl group from succinyl-CoA to homoserine forming succinyl homoserine. It has been suggested that the succinyl group of succinyl-CoA is initially transferred to an enzyme nucleophile before subsequent transfer to homoserine. The catalytic triad typical of GATase1 domains is not conserved in this GATase1-like domain. However, in common with GATase1 domains a reactive cys residue is found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. It has been proposed that this cys is in the active site of the molecule. However, as succinyl has been found bound to a conserved lysine residue, this conserved cys may play a role in dimer formation. HTS acti Probab=92.55 E-value=0.16 Score=31.43 Aligned_cols=63 Identities=14% Similarity=0.202 Sum_probs=41.8 Q ss_pred CHHHHHHCCCCEEEECCCCCCCCCCCC----C---CHHHHHHHCCCCCCEEEEECHHHHHHHHC-CCCCCCCC Q ss_conf 744644135788998588888410112----1---10122210257863478610058878752-85446677 Q gi|254780420|r 238 SYKDIMSLQPDGVLLSNGPGDPAVTSV----Y---SSPIICKLVDSGMPMFGICLGHQLLGLAL-GAQTVKMY 302 (396) Q Consensus 238 ~~~~i~~~~pdgi~lSnGPGdP~~~~~----~---~~~~~~~~~~~~iPilGIClGhQ~l~~a~-G~~~~kl~ 302 (396) ++++|...++||+++.+-| ++..+- + ..+++...-......|+||+|.|+....+ |-+-+.++ T Consensus 54 ~~~~i~~~~~DglIITGAP--ve~l~fe~v~YW~El~~i~dwa~~~v~stl~iCWaA~Aal~~~ygI~k~~l~ 124 (175) T cd03131 54 TFDDIRDAKFDGLIVTGAP--VEHLPFEQVDYWEELTEILDWAKTHVTSTLFSCWAAMAALYYFYGIKKHQLP 124 (175) T ss_pred HHHHHHHCCCCEEEEECCC--CCCCCCCCCCCHHHHHHHHHHHHHHCCCEEEEHHHHHHHHHHHHCCCCEECC T ss_conf 4998741777668971899--8878834387799999999999873663345649999999998399840068 No 118 >TIGR00313 cobQ cobyric acid synthase CobQ; InterPro: IPR004459 Cobyric acid synthase (CobQ) catalyses amidations at positions B, D, E, and G on adenosylcobyrinic A,C-diamide in the biosynthesis of cobalamin. NH(2) groups are provided by glutamine, and one molecule of ATP is hydrogenolysed for each amidation .; GO: 0003824 catalytic activity, 0009236 cobalamin biosynthetic process. Probab=92.24 E-value=0.11 Score=32.67 Aligned_cols=45 Identities=31% Similarity=0.497 Sum_probs=27.5 Q ss_pred CCCEEEECCCCCCCCCCCCC-------CHHHHHHHCCCCCC-EEEEECHHHHHHHHC Q ss_conf 57889985888884101121-------10122210257863-478610058878752 Q gi|254780420|r 246 QPDGVLLSNGPGDPAVTSVY-------SSPIICKLVDSGMP-MFGICLGHQLLGLAL 294 (396) Q Consensus 246 ~pdgi~lSnGPGdP~~~~~~-------~~~~~~~~~~~~iP-ilGIClGhQ~l~~a~ 294 (396) ++|.|++ ||.=....+. ..+.+..+ .+.+| ++|||=|.|+|+..+ T Consensus 287 ~~d~vIi---PGsk~ti~dl~~L~~~~~~~~i~~~-~~~ig~V~GICGGYQ~LG~~l 339 (502) T TIGR00313 287 GLDAVII---PGSKDTIADLKALKESGFAEEILDL-AKEIGIVIGICGGYQMLGKEL 339 (502) T ss_pred CCEEEEE---CCCCCHHHHHHHHHHCCCHHHHHHH-HHCCCEEEEECCCHHHHHHHH T ss_conf 8448996---7996578999999746866799999-606988998347602100310 No 119 >COG0693 ThiJ Putative intracellular protease/amidase [General function prediction only] Probab=91.83 E-value=0.2 Score=30.71 Aligned_cols=63 Identities=19% Similarity=0.315 Sum_probs=43.7 Q ss_pred EECCCCHHHHHHCCCCEEEECCCCCCCCCCC--CCCHHHHHHHCCCCCCEEEEECHHHHHHHHCC Q ss_conf 3046574464413578899858888841011--21101222102578634786100588787528 Q gi|254780420|r 233 VGAETSYKDIMSLQPDGVLLSNGPGDPAVTS--VYSSPIICKLVDSGMPMFGICLGHQLLGLALG 295 (396) Q Consensus 233 vp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~--~~~~~~~~~~~~~~iPilGIClGhQ~l~~a~G 295 (396) +..+..++++...++|++++..|..-|.... ......++++.+.+.||..||=|-++|+-+-. T Consensus 53 ~~~~~~~~~~~~~~ydal~ipGG~~~~~~~~~~~~~~~~v~~~~~~~k~vaaIC~g~~~L~~ag~ 117 (188) T COG0693 53 VADDKAFDDADAADYDALVIPGGDHGPEYLRPDPDLLAFVRDFYANGKPVAAICHGPAVLAAAGL 117 (188) T ss_pred EECCHHCCCCCCCCCCEEEECCCCCCHHHCCCCHHHHHHHHHHHHCCCEEEEECCHHHHHHHCCC T ss_conf 83212116578135989998899752322135178999999999769989998468899874768 No 120 >PRK05297 phosphoribosylformylglycinamidine synthase; Provisional Probab=91.06 E-value=1.5 Score=24.63 Aligned_cols=89 Identities=16% Similarity=0.249 Sum_probs=52.6 Q ss_pred CCCCCCEEEEEEC-CC--CCHHHHHHHHHCCCCEEEECCCCHHHHHH-----CCCCEEEECCC--CCCCCCCCC------ Q ss_conf 7777714999967-96--41023222220245413304657446441-----35788998588--888410112------ Q gi|254780420|r 201 SSDAKYHVVCIDY-GI--RSNLLRVLSSLGCRITIVGAETSYKDIMS-----LQPDGVLLSNG--PGDPAVTSV------ 264 (396) Q Consensus 201 ~~~~~~kIvvIDf-Gi--K~nIlr~L~~~g~~v~Vvp~~~~~~~i~~-----~~pdgi~lSnG--PGdP~~~~~------ 264 (396) ....++|||++=+ |+ -+-..+.+.+-|+++.-|-. .|+++ .++.||++.+| -||.--..+ T Consensus 1034 ~~~~kPkVAIlreqG~Nge~EmAaAF~~AGF~~~DVhm----sDL~~g~~~L~~f~glva~GGFSyGDvlGsgkGwA~si 1109 (1294) T PRK05297 1034 ATGARPKVAILREQGVNSHVEMAAAFDRAGFDAIDVHM----SDLLAGRVTLEDFKGLVACGGFSYGDVLGAGEGWAKSI 1109 (1294) T ss_pred CCCCCCCEEEECCCCCCCHHHHHHHHHHCCCCEEEEEH----HHCCCCCCCHHHCCEEEECCCCCCCCCCCHHHHHHHHH T ss_conf 47999838997689879679999999986993488652----31003611787791899878768767775678999999 Q ss_pred ----CCHHHHHHHCCC-CCCEEEEECHHHHHHHH Q ss_conf ----110122210257-86347861005887875 Q gi|254780420|r 265 ----YSSPIICKLVDS-GMPMFGICLGHQLLGLA 293 (396) Q Consensus 265 ----~~~~~~~~~~~~-~iPilGIClGhQ~l~~a 293 (396) .+...+..++.. .-=.||||=|.|+|+.- T Consensus 1110 l~N~~~r~~f~~Ff~r~DtfsLGICNGcQ~L~~L 1143 (1294) T PRK05297 1110 LFNPRLRDQFAAFFARPDTFALGVCNGCQMMSNL 1143 (1294) T ss_pred HCCHHHHHHHHHHHCCCCCEEEEEEHHHHHHHHC T ss_conf 7299999999999739994599953889999977 No 121 >cd03141 GATase1_Hsp31_like Type 1 glutamine amidotransferase (GATase1)-like domain found in proteins similar to Escherichia coli Hsp31 protein. Type 1 glutamine amidotransferase (GATase1)-like domain found in proteins similar to Escherichia coli Hsp31 protein (EcHsp31). This group includes EcHsp31 and Saccharomyces cerevisiae Ydr533c protein. EcHsp31 has chaperone activity. Ydr533c is upregulated in response to various stress conditions along with the heat shock family. EcHsp31 coordinates a metal ion using a 2-His-1-carboxylate motif present in various ions that use iron as a cofactor such as Carboxypeptidase A. The catalytic triad typical of GATase1 domains is not conserved in this GATase1-like domain. However, in common with a typical GATase1 domain, a reactive Cys residue is found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. For EcHsp31, this Cys together with a different His and, an Asp (rather than a Glu) residue form a different Probab=90.66 E-value=0.22 Score=30.51 Aligned_cols=61 Identities=13% Similarity=0.194 Sum_probs=45.3 Q ss_pred ECCCCHHHHHHCCCCEEEECCCCCCCCCC--CCCCHHHHHHHCCCCCCEEEEECHHHHHHHHC Q ss_conf 04657446441357889985888884101--12110122210257863478610058878752 Q gi|254780420|r 234 GAETSYKDIMSLQPDGVLLSNGPGDPAVT--SVYSSPIICKLVDSGMPMFGICLGHQLLGLAL 294 (396) Q Consensus 234 p~~~~~~~i~~~~pdgi~lSnGPGdP~~~--~~~~~~~~~~~~~~~iPilGIClGhQ~l~~a~ 294 (396) ....++++|...++|+||+.+|.|-..+. +.....+++.+.+.+.||-.||-|-..|..+- T Consensus 78 ~~~~~l~~v~~~~ydai~~pGG~g~m~dl~~~~~l~~l~~~~~~~gk~vaaiCHGp~~L~~~~ 140 (221) T cd03141 78 ANTKKLSDVDPSDYDAIFIPGGHGPMFDLPDNPDLQDLLREFYENGKVVAAVCHGPAALLNVK 140 (221) T ss_pred HCCCCHHHCCHHHCCEEEECCCCCHHHHHHCCHHHHHHHHHHHHCCCEEEEECCHHHHHHHHC T ss_conf 587896789987883899899854587744169999999999974985999763079887500 No 122 >cd03132 GATase1_catalase Type 1 glutamine amidotransferase (GATase1)-like domain found in at the C-terminal of several large catalases. Type 1 glutamine amidotransferase (GATase1)-like domain found in at the C-terminal of several large catalases. Catalase catalyzes the dismutation of hydrogen peroxide (H2O2) to water and oxygen. This group includes the large catalases: Neurospora crassa Catalase-1 and Catalase-3 and, Escherichia coli HP-II. This GATase1-like domain has an essential role in HP-II catalase activity. However, it lacks enzymatic activity and the catalytic triad typical of GATase1 domains. Catalase-1 and -3 are homotetrameric, HP-II is homohexameric. It has been proposed that this domain may facilitate the folding and oligomerization process. The interface between this GATase1-like domain of HP-II and the core of the subunit forms part of a channel which provides access to the deeply buried catalase active sites of HPII. Catalase-1 is associated with non-growing cells; C Probab=90.07 E-value=0.27 Score=29.80 Aligned_cols=61 Identities=20% Similarity=0.235 Sum_probs=44.6 Q ss_pred ECCCCHHHHHHCCCCEEEECCCCCCCCC--CCCCCHHHHHHHCCCCCCEEEEECHHHHHHHHC Q ss_conf 0465744644135788998588888410--112110122210257863478610058878752 Q gi|254780420|r 234 GAETSYKDIMSLQPDGVLLSNGPGDPAV--TSVYSSPIICKLVDSGMPMFGICLGHQLLGLAL 294 (396) Q Consensus 234 p~~~~~~~i~~~~pdgi~lSnGPGdP~~--~~~~~~~~~~~~~~~~iPilGIClGhQ~l~~a~ 294 (396) +.+.++++..+..+|++++-.|+..+.. .+......+++.+...+||-.||-|-++|..+- T Consensus 50 ~vd~t~~~~~s~~fDAvvipgG~~~~~~L~~~~~~~~fv~eay~h~KpI~a~~~~~~lL~~ag 112 (142) T cd03132 50 EVDQTYAGAPSVLFDAVVVPGGAEAAFALAPSGRALHFVTEAFKHGKPIGAVGEGSDLLEAAG 112 (142) T ss_pred ECCEEECCCCHHHCCEEEECCCCCCHHHHCCCHHHHHHHHHHHHCCCEEEEECCHHHHHHHCC T ss_conf 054435368964713688438877988864486799999999976997999377299999769 No 123 >cd06315 PBP1_ABC_sugar_binding_like_6 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=89.29 E-value=0.71 Score=26.84 Aligned_cols=64 Identities=23% Similarity=0.455 Sum_probs=27.2 Q ss_pred CCCCHHHHHHHHHCCCCEEEECCCCH-------HHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEE Q ss_conf 96410232222202454133046574-------46441357889985888884101121101222102578634786 Q gi|254780420|r 214 GIRSNLLRVLSSLGCRITIVGAETSY-------KDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGI 283 (396) Q Consensus 214 GiK~nIlr~L~~~g~~v~Vvp~~~~~-------~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGI 283 (396) |+++.+-..=..+|.+++++-..-+. ++....+||||++ +|-|+... ...++...+.+||+.|. T Consensus 17 ~v~~G~~eaA~~lGw~v~v~D~~g~~~~~~~~i~~ai~~k~D~Iii--~~~D~~~~----~~~l~~A~~agIPvv~~ 87 (280) T cd06315 17 GVGEGVREAAKAIGWNLRILDGRGSEAGQAAALNQAIALKPDGIVL--GGVDAAEL----QAELELAQKAGIPVVGW 87 (280) T ss_pred HHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHHHCCCCEEEE--CCCCHHHH----HHHHHHHHHCCCCEEEE T ss_conf 4988999999975987999889999999999999999639999999--98297887----89999999879978962 No 124 >cd05296 GH4_P_beta_glucosidase Glycoside Hydrolases Family 4; Phospho-beta-glucosidase. Some bacteria simultaneously translocate and phosphorylate disaccharides via the phosphoenolpyruvate-dependent phosphotransferase system (PEP-PTS). After translocation, these phospho-disaccharides may be hydrolyzed by the GH4 glycoside hydrolases such as the phospho-beta-glucosidases. Other organisms (such as archaea and Thermotoga maritima ) lack the PEP-PTS system, but have several enzymes normally associated with the PEP-PTS operon. The 6-phospho-beta-glucosidase from Thermotoga maritima hydrolylzes cellobiose 6-phosphate (6P) into glucose-6P and glucose, in an NAD+ and Mn2+ dependent fashion. The Escherichia coli 6-phospho-beta-glucosidase (also called celF) hydrolyzes a variety of phospho-beta-glucosides including cellobiose-6P, salicin-6P, arbutin-6P, and gentobiose-6P. Phospho-beta-glucosidases are part of the NAD(P)-binding Rossmann fold superfamily, which includes a wide variety of protein Probab=88.82 E-value=0.33 Score=29.22 Aligned_cols=66 Identities=23% Similarity=0.370 Sum_probs=48.7 Q ss_pred HHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEECHHHH----HHHHCCCCCCCCCCCCCCCCCC Q ss_conf 446441357889985888884101121101222102578634786100588----7875285446677675210112 Q gi|254780420|r 239 YKDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGICLGHQL----LGLALGAQTVKMYQGHHGVNHP 311 (396) Q Consensus 239 ~~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIClGhQ~----l~~a~G~~~~kl~~GhrG~N~P 311 (396) +++|..+.||+++|- --+|... +...+.+. .++.++|+|=++|- ||..+|..-.++.+-.-|.||= T Consensus 128 a~~i~e~cP~AwliN--ytNP~~i---vt~a~~r~--~~~k~vGlCh~~~~~~~~la~~Lg~~~~~i~~~~~GlNH~ 197 (419) T cd05296 128 AEDVEELAPDAWLIN--FTNPAGI---VTEAVLRH--TGDRVIGLCNVPIGLQRRIAELLGVDPEDVFIDYAGLNHL 197 (419) T ss_pred HHHHHHHCCCCEEEE--CCCHHHH---HHHHHHHC--CCCCEEEECCCHHHHHHHHHHHHCCCHHHEEEEEEEECCH T ss_conf 999998699838997--3787999---99999966--8998899797779899999999589879907899865548 No 125 >PRK02231 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=88.63 E-value=0.46 Score=28.19 Aligned_cols=31 Identities=13% Similarity=0.129 Sum_probs=18.3 Q ss_pred CCHHHHHHHCCCCEECCCCHHHH--HHHHHHCC Q ss_conf 79899997427832112576899--99998539 Q gi|254780420|r 109 MHFDAWLKSCGIIGLSGIDTRAL--TVWIRDYG 139 (396) Q Consensus 109 ~sL~~wL~~~~IpgI~GIDTRaL--Tk~IRe~G 139 (396) .+++++.++-++.-.-|=|=-.| .|.+.... T Consensus 34 ~~~~e~~~~~Dlii~iGGDGT~L~a~r~~~~~~ 66 (272) T PRK02231 34 ASLEEIGQRAQLAIVIGGDGNMLGRARVLAKYD 66 (272) T ss_pred CCHHHHCCCCCEEEEECCCHHHHHHHHHHHCCC T ss_conf 798897017789999787589999999860059 No 126 >KOG0538 consensus Probab=87.96 E-value=2.5 Score=23.05 Aligned_cols=93 Identities=29% Similarity=0.419 Sum_probs=63.0 Q ss_pred CCEEEEEECCCCCHHHHHHHHHCCCCEEEECCCCHHHH---HHCCCCEEEECCCCCC-----CCCCCCCCHHHHHHHCCC Q ss_conf 71499996796410232222202454133046574464---4135788998588888-----410112110122210257 Q gi|254780420|r 205 KYHVVCIDYGIRSNLLRVLSSLGCRITIVGAETSYKDI---MSLQPDGVLLSNGPGD-----PAVTSVYSSPIICKLVDS 276 (396) Q Consensus 205 ~~kIvvIDfGiK~nIlr~L~~~g~~v~Vvp~~~~~~~i---~~~~pdgi~lSnGPGd-----P~~~~~~~~~~~~~~~~~ 276 (396) .+-...+|--.-|+=+..|+..---.+|+---.+.+|. .+..-+||++||--|. |+..+ ..+.+.+..++ T Consensus 200 ~~~~~~id~Sl~W~Di~wLr~~T~lPIvvKGilt~eDA~~Ave~G~~GIIVSNHGgRQlD~vpAtI~--~L~Evv~aV~~ 277 (363) T KOG0538 200 AYVSSQIDPSLSWKDIKWLRSITKLPIVVKGVLTGEDARKAVEAGVAGIIVSNHGGRQLDYVPATIE--ALPEVVKAVEG 277 (363) T ss_pred HHHHCCCCCCCCHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHCCCEEEEECCCCCCCCCCCCHHH--HHHHHHHHHCC T ss_conf 6642378877774246999852758769983114387999998088659985787532576641188--79999998628 Q ss_pred CCCEE---EEECHHHHH-HHHCCCCCC Q ss_conf 86347---861005887-875285446 Q gi|254780420|r 277 GMPMF---GICLGHQLL-GLALGAQTV 299 (396) Q Consensus 277 ~iPil---GIClGhQ~l-~~a~G~~~~ 299 (396) ++|+| |+--|.-++ |+|+||+.. T Consensus 278 ri~V~lDGGVR~G~DVlKALALGAk~V 304 (363) T KOG0538 278 RIPVFLDGGVRRGTDVLKALALGAKGV 304 (363) T ss_pred CEEEEEECCCCCCHHHHHHHHCCCCEE T ss_conf 547997267335427999985167368 No 127 >cd06312 PBP1_ABC_sugar_binding_like_4 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=87.96 E-value=1.4 Score=24.74 Aligned_cols=71 Identities=13% Similarity=0.148 Sum_probs=38.1 Q ss_pred HHHHHCCCCEEEECCCCHH-------HHHHCC--CCEEEECCCCCCCCCCCCCCHHHHHHH-CCCCCCEEEEECHHHHHH Q ss_conf 2222024541330465744-------644135--788998588888410112110122210-257863478610058878 Q gi|254780420|r 222 VLSSLGCRITIVGAETSYK-------DIMSLQ--PDGVLLSNGPGDPAVTSVYSSPIICKL-VDSGMPMFGICLGHQLLG 291 (396) Q Consensus 222 ~L~~~g~~v~Vvp~~~~~~-------~i~~~~--pdgi~lSnGPGdP~~~~~~~~~~~~~~-~~~~iPilGIClGhQ~l~ 291 (396) .|.+.+..+.++..+.+.. +++.-+ .|+||-.|.. .... +...++.. +.+++.|.|+-..-+.+. T Consensus 150 ~l~~~~~~~~~~~~~~d~~~a~~~~~~~L~~~pdi~~I~~~nd~---~a~g--a~~Al~~ag~~~~i~vvg~D~~~~~~~ 224 (271) T cd06312 150 GLGGAGITEEVIETGADPTEVASRIAAYLRANPDVDAVLTLGAP---SAAP--AAKALKQAGLKGKVKLGGFDLSPATLQ 224 (271) T ss_pred HHHHCCCEEEEEECCCCHHHHHHHHHHHHHHCCCCCEEEECCCH---HHHH--HHHHHHHCCCCCCEEEEEECCCHHHHH T ss_conf 98755980799857999999999999999849898889988967---8999--999999669999859999889999999 Q ss_pred HHCCCC Q ss_conf 752854 Q gi|254780420|r 292 LALGAQ 297 (396) Q Consensus 292 ~a~G~~ 297 (396) .---+. T Consensus 225 ~i~~G~ 230 (271) T cd06312 225 AIKAGY 230 (271) T ss_pred HHHCCC T ss_conf 998699 No 128 >cd05298 GH4_GlvA_pagL_like Glycoside Hydrolases Family 4; GlvA- and pagL-like glycosidases. Bacillus subtilis GlvA and Clostridium acetobutylicum pagL are 6-phospho-alpha-glucosidase, catalyzing the hydrolysis of alpha-glucopyranoside bonds to release glucose from oligosaccharides. The substrate specificities of other members of this subgroup are unknown. Some bacteria simultaneously translocate and phosphorylate disaccharides via the phosphoenolpyruvate-dependent phosphotransferase system (PEP_PTS). After translocation, these phospho-disaccharides may be hydrolyzed by the GH4 glycoside hydrolases, which include 6-phospho-beta-glucosidases, 6-phospho-alpha-glucosidases, alpha-glucosidases/alpha-glucuronidases (only from Thermotoga), and alpha-galactosidases. Members of this subfamily are part of the NAD(P)-binding Rossmann fold superfamily, which includes a wide variety of protein families including the NAD(P)-binding domains of alcohol dehydrogenases, tyrosine-dependent oxidoreductas Probab=87.91 E-value=0.44 Score=28.34 Aligned_cols=73 Identities=22% Similarity=0.352 Sum_probs=49.7 Q ss_pred EECCCC-HHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEECHHH----HHHHHCCCCCCCCCCCCCC Q ss_conf 304657-44644135788998588888410112110122210257863478610058----8787528544667767521 Q gi|254780420|r 233 VGAETS-YKDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGICLGHQ----LLGLALGAQTVKMYQGHHG 307 (396) Q Consensus 233 vp~~~~-~~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIClGhQ----~l~~a~G~~~~kl~~GhrG 307 (396) +|.-.. +++|.++.||+++|- --+|... +.+.+.+.. .++.++|+|=|+| .||..+|..-.++.+-.-| T Consensus 120 iPv~ldia~di~e~cP~AwliN--ytNP~~~---vt~al~~~~-~~~k~vGLCh~~~~~~~~la~~Lg~~~~~v~~~~aG 193 (437) T cd05298 120 IGPMIELIDDIEKYSPDAWILN--YSNPAAI---VAEALRRLF-PNARILNICDMPIAIMDSMAAILGLDRKDLEPDYFG 193 (437) T ss_pred HHHHHHHHHHHHHHCCCCEEEE--CCCHHHH---HHHHHHHHC-CCCCEEEECCCHHHHHHHHHHHHCCCHHHEEEEEEE T ss_conf 9999999999998799928997--5787899---999999748-999889889788999999999958988991788961 Q ss_pred CCCC Q ss_conf 0112 Q gi|254780420|r 308 VNHP 311 (396) Q Consensus 308 ~N~P 311 (396) .||= T Consensus 194 lNH~ 197 (437) T cd05298 194 LNHF 197 (437) T ss_pred CCHH T ss_conf 0429 No 129 >KOG0292 consensus Probab=87.89 E-value=0.59 Score=27.42 Aligned_cols=15 Identities=27% Similarity=0.197 Sum_probs=6.3 Q ss_pred EEEEECHHHHHHHHC Q ss_conf 478610058878752 Q gi|254780420|r 280 MFGICLGHQLLGLAL 294 (396) Q Consensus 280 ilGIClGhQ~l~~a~ 294 (396) ||--..||-++-.+- T Consensus 457 IF~ag~g~lll~~~~ 471 (1202) T KOG0292 457 IFYAGTGNLLLRSPD 471 (1202) T ss_pred EEECCCCCEEEECCC T ss_conf 366268518997498 No 130 >COG4607 CeuA ABC-type enterochelin transport system, periplasmic component [Inorganic ion transport and metabolism] Probab=87.80 E-value=0.93 Score=26.05 Aligned_cols=48 Identities=27% Similarity=0.520 Sum_probs=35.8 Q ss_pred CCCCCEEEEEECCCCCHHHHHHHHHCCCCEEE-ECC----------------------CCHHHHHHCCCCEEEEC Q ss_conf 77771499996796410232222202454133-046----------------------57446441357889985 Q gi|254780420|r 202 SDAKYHVVCIDYGIRSNLLRVLSSLGCRITIV-GAE----------------------TSYKDIMSLQPDGVLLS 253 (396) Q Consensus 202 ~~~~~kIvvIDfGiK~nIlr~L~~~g~~v~Vv-p~~----------------------~~~~~i~~~~pdgi~lS 253 (396) +....|||++|+| .|..|.++|+++.++ |.. -++|.|-+.+||-||+| T Consensus 55 pknPekVvv~D~g----aLD~ld~lGve~~~v~~~~~~P~yL~~y~~dky~nvGtlfEPD~Eai~a~kPdLIIig 125 (320) T COG4607 55 PKNPEKVVVLDLG----ALDTLDALGVEVVAVGPGKNLPAYLQKYKDDKYANVGTLFEPDYEAIAAAKPDLIIIG 125 (320) T ss_pred CCCCCEEEEECCH----HHHHHHHHCCCCCCCCCCCCCCHHHHHHCCCCCCCCCCCCCCCHHHHHHCCCCEEEEC T ss_conf 6887558995411----5666887297202236877863789971467866667535878899974499889977 No 131 >PRK02645 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=87.65 E-value=0.61 Score=27.34 Aligned_cols=19 Identities=26% Similarity=0.156 Sum_probs=11.9 Q ss_pred CCCCCHHHHHHHCCCCEEC Q ss_conf 2417989999742783211 Q gi|254780420|r 106 RANMHFDAWLKSCGIIGLS 124 (396) Q Consensus 106 rs~~sL~~wL~~~~IpgI~ 124 (396) ..-..+.+||+++||-.+. T Consensus 19 ~~a~~l~~~L~~rGi~v~~ 37 (304) T PRK02645 19 EAAERCAKQLEARGIKVLM 37 (304) T ss_pred HHHHHHHHHHHHCCCEEEE T ss_conf 9999999999988899998 No 132 >PRK12815 carB carbamoyl phosphate synthase large subunit; Reviewed Probab=86.68 E-value=2.9 Score=22.56 Aligned_cols=144 Identities=18% Similarity=0.260 Sum_probs=69.8 Q ss_pred HHHHHHHHHHCCCCCEEEECCCC-CCCCHHHHHHHHHHCCCCCCCC-HHHCCCCCCCEECCCCCCCCCCCCCCCCCCCCC Q ss_conf 68999999853984236632765-4589789888774158874321-011013333211168653445434434677777 Q gi|254780420|r 128 TRALTVWIRDYGVSNSVIAHHPD-GQFDLEDLKERAKNWSGLKGAE-LAKHVTVSQRRDWSEKIWKWGEETSFLKSSDAK 205 (396) Q Consensus 128 TRaLTk~IRe~G~~~a~I~~~~~-~~~d~~~~~~~~~~~~~~~~~d-L~~~Vs~~~~~~~~~~~~~~~~~~~~~~~~~~~ 205 (396) +..+-+.....|-....|+.--. ...++.++.++..-.|..+..| .+.+.....+|-| .++....+. ..+..+ T Consensus 480 ~~~~l~~aK~~GFSD~~Ia~l~~~~e~~vr~~R~~~~i~P~yK~VDTcA~EF~a~T~Y~Y--sTY~~e~e~---~~~~~~ 554 (1068) T PRK12815 480 SADLLRKAKEKGFSDRLIAELTGVTEEDVRELRKKLGIRPSFKMVDTCAAEFEAKTPYYY--STYFGESEA---KIPSEK 554 (1068) T ss_pred CHHHHHHHHHCCCCHHHHHHHHCCCHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCEEE--EECCCCCCC---CCCCCC T ss_conf 999999999849999999977099999999999976995589842774555467876377--634786777---688898 Q ss_pred CEEEEEECC-------------CCCHHHHHHHHHCCCCEEEECC-----CC-------------HHHHH----HCCCCEE Q ss_conf 149999679-------------6410232222202454133046-----57-------------44644----1357889 Q gi|254780420|r 206 YHVVCIDYG-------------IRSNLLRVLSSLGCRITIVGAE-----TS-------------YKDIM----SLQPDGV 250 (396) Q Consensus 206 ~kIvvIDfG-------------iK~nIlr~L~~~g~~v~Vvp~~-----~~-------------~~~i~----~~~pdgi 250 (396) ++|++|-.| .=+ -++.|++.|.+.+++.+| |+ .|+|+ ..+|+|| T Consensus 555 ~kvliLGsGP~RIGqgiEFDYc~vh-a~~aLr~~G~etImiN~NPETVSTDyD~sDrLYFEPlt~E~V~~I~~~E~p~gV 633 (1068) T PRK12815 555 KKVLILGSGPIRIGQGIEFDYMCVH-AAFALKKEGYETIMINNNPETVSTDYDTADRLYFEPITLEDILNVAEAENIKGV 633 (1068) T ss_pred CEEEEECCCCCEECCCCCCHHHHHH-HHHHHHHCCCEEEEECCCCCCCCCCCCCCCCEEECCCCHHHHHHHHHHHCCCEE T ss_conf 3499956787142244240025899-999999689748996278431236745577357536879999999998589979 Q ss_pred EECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEE Q ss_conf 9858888841011211012221025786347861 Q gi|254780420|r 251 LLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGIC 284 (396) Q Consensus 251 ~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIC 284 (396) +++=|==.| .++...+.+.++||||-- T Consensus 634 ivqfGGQt~-------lnla~~L~~~gv~IlGts 660 (1068) T PRK12815 634 IVQFGGQTA-------INLAKPLEEAGLPILGTS 660 (1068) T ss_pred EEECCCCCH-------HHHHHHHHHCCCEEECCC T ss_conf 996488236-------799999997699498278 No 133 >cd06305 PBP1_methylthioribose_binding_like Methylthioribose-binding protein-like of ABC-type transport systems that belong to a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein (PBP1) superfamily. Methylthioribose-binding protein-like of ABC-type transport systems that belong to a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein (PBP1) superfamily, which consists of two alpha/beta globular domains connected by a three-stranded hinge. This Venus flytrap-like domain undergoes transition from an open to a closed conformational state upon ligand binding. The sugar-binding domain of the periplasmic proteins in this group is also homologous to the ligand-binding domain of eukaryotic receptors such as metabotropic glutamate receptor (mGluR), DNA-binding transcriptional repressors such as LacI and GalR. Probab=86.45 E-value=1.2 Score=25.19 Aligned_cols=48 Identities=10% Similarity=0.232 Sum_probs=23.8 Q ss_pred HHHHHCCCC----EEEECCCCCCCCCCCCCCHHHHHHH-CCCCCCEEEEECHHHHHHH Q ss_conf 464413578----8998588888410112110122210-2578634786100588787 Q gi|254780420|r 240 KDIMSLQPD----GVLLSNGPGDPAVTSVYSSPIICKL-VDSGMPMFGICLGHQLLGL 292 (396) Q Consensus 240 ~~i~~~~pd----gi~lSnGPGdP~~~~~~~~~~~~~~-~~~~iPilGIClGhQ~l~~ 292 (396) ++++.-+|| +||-+| |..... ....++.. +.++++|.|+-..-+.+.. T Consensus 173 ~~~L~~~pd~~~~ai~~~n---d~~a~G--a~~Al~~~G~~~~v~v~G~D~~~~~~~~ 225 (273) T cd06305 173 EAVLKKYPKGGIDAIWAAW---DEFAKG--AKQALDEAGRTDEIKIYGVDISPEDLQL 225 (273) T ss_pred HHHHHHCCCCCEEEEEECC---CHHHHH--HHHHHHHCCCCCCCEEEEECCCHHHHHH T ss_conf 9999738887627999358---578999--9999997699999489997899899999 No 134 >PRK02155 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=86.28 E-value=1.1 Score=25.63 Aligned_cols=36 Identities=22% Similarity=0.154 Sum_probs=22.0 Q ss_pred EEEEEECCCCCCCCCCCCCCHHHHHHHCCCCEECCCCH Q ss_conf 36664046544345524179899997427832112576 Q gi|254780420|r 91 VGLVIKAEITEPSNYRANMHFDAWLKSCGIIGLSGIDT 128 (396) Q Consensus 91 ~g~Iv~~~~~~~Sn~rs~~sL~~wL~~~~IpgI~GIDT 128 (396) .|+|.+.. .+.-.+.-..|.+||.++++-.+-.-+| T Consensus 8 Vgiv~k~~--~~~~~~~~~~l~~~L~~~g~~v~~e~~~ 43 (291) T PRK02155 8 VALVGRYQ--TPGIAEPLEALAACIAKRGFEVVFEADT 43 (291) T ss_pred EEEEECCC--CHHHHHHHHHHHHHHHHCCCEEEEEHHH T ss_conf 99991489--8689999999999999788999995557 No 135 >PRK01390 murD UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase; Provisional Probab=85.94 E-value=1.7 Score=24.26 Aligned_cols=24 Identities=21% Similarity=0.412 Sum_probs=13.4 Q ss_pred HCCCCEECCCC-----HHHHHHHHHHCCC Q ss_conf 42783211257-----6899999985398 Q gi|254780420|r 117 SCGIIGLSGID-----TRALTVWIRDYGV 140 (396) Q Consensus 117 ~~~IpgI~GID-----TRaLTk~IRe~G~ 140 (396) +..|.||+|=. |..|...|+..|- T Consensus 113 ~~~iIaVTGTnGKTTTtsli~~iL~~~g~ 141 (457) T PRK01390 113 DAPFIAITGTNGKSTTTALIAHLLRQAGR 141 (457) T ss_pred CCCEEEEECCCCHHHHHHHHHHHHHHCCC T ss_conf 89889992899649899999999997399 No 136 >cd03137 GATase1_AraC_1 AraC transcriptional regulators having a Type 1 glutamine amidotransferase (GATase1)-like domain. A subgroup of AraC transcriptional regulators having a Type 1 glutamine amidotransferase (GATase1)-like domain. AraC regulators are defined by a AraC-type helix-turn-helix DNA binding domain at their C-terminal. AraC family transcriptional regulators are widespread among bacteria and are involved in regulating diverse and important biological functions, including carbon metabolism, stress responses and virulence in different microorganisms. The catalytic triad typical of GATase1 domains is not conserved in this GATase1-like domain. However, in common with typical GATase1domains a reactive cys residue is found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. Probab=85.34 E-value=1.3 Score=25.07 Aligned_cols=59 Identities=17% Similarity=0.280 Sum_probs=38.6 Q ss_pred EECCCCHHHHHHCCCCEEEECCCCCCCCCCC-CCCHHHHHHHCCCCCCEEEEECHHHHHHHH Q ss_conf 3046574464413578899858888841011-211012221025786347861005887875 Q gi|254780420|r 233 VGAETSYKDIMSLQPDGVLLSNGPGDPAVTS-VYSSPIICKLVDSGMPMFGICLGHQLLGLA 293 (396) Q Consensus 233 vp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~-~~~~~~~~~~~~~~iPilGIClGhQ~l~~a 293 (396) +..+.++++.. ++|.+++..|++.+.... ......+++....+..|.|||-|-.+||.| T Consensus 53 i~~d~~~~~~~--~~D~livpgg~~~~~~~~~~~l~~~L~~~~~~~~~l~~ictG~~~LA~a 112 (187) T cd03137 53 LVADAGLDALA--AADTVIVPGGPDVDGRPPPPALLAALRRAAARGARVASVCTGAFVLAEA 112 (187) T ss_pred EECCCCHHHCC--CCCEEEECCCCCHHHCCCCHHHHHHHHHHHHCCCEEEEECCCCHHHHHC T ss_conf 95376533456--4699998898861102599999999999975386688753140988745 No 137 >PRK00141 murD UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase; Provisional Probab=84.61 E-value=2.6 Score=22.89 Aligned_cols=30 Identities=13% Similarity=0.188 Sum_probs=14.5 Q ss_pred EEEECCCCCCCCCCCCCCHHHHHHHCCCCEECCCC Q ss_conf 66404654434552417989999742783211257 Q gi|254780420|r 93 LVIKAEITEPSNYRANMHFDAWLKSCGIIGLSGID 127 (396) Q Consensus 93 ~Iv~~~~~~~Sn~rs~~sL~~wL~~~~IpgI~GID 127 (396) +||...-..+.|. .-...++.+||-|+.|+ T Consensus 79 ~vV~SPGI~~~~p-----~l~~a~~~gi~viseie 108 (476) T PRK00141 79 IVVTSPGWRPDSP-----LLVDAQSAGLEVIGDVE 108 (476) T ss_pred EEEECCCCCCCCH-----HHHHHHHCCCCEEEHHH T ss_conf 9998997899799-----99999987995771999 No 138 >TIGR01836 PHA_synth_III_C poly(R)-hydroxyalkanoic acid synthase, class III, PhaC subunit; InterPro: IPR010125 This entry represents the PhaC subunit of a heterodimeric form of polyhydroxyalkanoic acid (PHA) synthase. Excepting the PhaC of Bacillus megaterium (which needs PhaR), all members require PhaE (IPR010123 from INTERPRO) for activity and are designated class III. This enzyme builds ester polymers for carbon and energy storage that accumulate in inclusions, and both this enzyme and the depolymerase associate with the inclusions. Class III enzymes polymerise short-chain-length hydroxyalkanoates.; GO: 0008415 acyltransferase activity, 0042619 poly-hydroxybutyrate biosynthetic process. Probab=84.54 E-value=1.6 Score=24.39 Aligned_cols=73 Identities=18% Similarity=0.293 Sum_probs=43.6 Q ss_pred EEECCCCCHHHHHHHHHCCCCEEEECCCCHHH--HHHC-CCCEEEECCCCCCCCCCCCCCHHHHHHH-CCCCCCEEEEEC Q ss_conf 99679641023222220245413304657446--4413-5788998588888410112110122210-257863478610 Q gi|254780420|r 210 CIDYGIRSNLLRVLSSLGCRITIVGAETSYKD--IMSL-QPDGVLLSNGPGDPAVTSVYSSPIICKL-VDSGMPMFGICL 285 (396) Q Consensus 210 vIDfGiK~nIlr~L~~~g~~v~Vvp~~~~~~~--i~~~-~pdgi~lSnGPGdP~~~~~~~~~~~~~~-~~~~iPilGICl 285 (396) .+|+=-=+|.+|.|.++|.+|-++-|-++-.. -+.+ +|=--+| +. +.+.+++. -.-.|-+||||. T Consensus 86 m~DLQ~drSlv~~LL~~G~DVYLiDWGyP~~~D~~ltLdDY~~~YI------~~-----cV~~i~~~~~~d~i~lLG~Cq 154 (367) T TIGR01836 86 MLDLQEDRSLVRGLLERGQDVYLIDWGYPDRADRYLTLDDYVNGYI------DK-----CVDYICRTSKLDQISLLGICQ 154 (367) T ss_pred CEEECCCCHHHHHHHHCCCCEEEEECCCCCHHHHCCCHHHHHHHHH------HH-----HHHHHHHHCCCCCCCEEEEEC T ss_conf 0232488168888885698489960689873331140466676778------99-----999998650789610431431 Q ss_pred HHHHHHHHC Q ss_conf 058878752 Q gi|254780420|r 286 GHQLLGLAL 294 (396) Q Consensus 286 GhQ~l~~a~ 294 (396) |= .+++-| T Consensus 155 GG-TfsL~y 162 (367) T TIGR01836 155 GG-TFSLCY 162 (367) T ss_pred CH-HHHHHH T ss_conf 04-789999 No 139 >cd01538 PBP1_ABC_xylose_binding Periplasmic xylose-binding component of the ABC-type transport systems that belong to a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein (PBP1) superfamily. Periplasmic xylose-binding component of the ABC-type transport systems that belong to a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein (PBP1) superfamily, which consists of two alpha/beta globular domains connected by a three-stranded hinge. This Venus flytrap-like domain undergoes a transition from an open to a closed conformational state upon ligand binding. Moreover, the periplasmic xylose-binding protein is homologous to the ligand-binding domain of eukaryotic receptors such as glutamate receptor (GluR) and DNA-binding transcriptional repressors such as LacI and GalR. Probab=84.52 E-value=1.5 Score=24.67 Aligned_cols=41 Identities=20% Similarity=0.203 Sum_probs=23.0 Q ss_pred CCCEEEECCCCCCCCCCCCCCHHHHHHH-CCCCCCEEEEECHHHHHH Q ss_conf 5788998588888410112110122210-257863478610058878 Q gi|254780420|r 246 QPDGVLLSNGPGDPAVTSVYSSPIICKL-VDSGMPMFGICLGHQLLG 291 (396) Q Consensus 246 ~pdgi~lSnGPGdP~~~~~~~~~~~~~~-~~~~iPilGIClGhQ~l~ 291 (396) ++|+||-.| |+.... +...++.. +.+++.|.|+.-.-+.+. T Consensus 190 ~i~aI~a~n---D~~A~G--a~~Al~~aG~~~~v~vvG~Dg~~~~l~ 231 (288) T cd01538 190 KVDGVLAAN---DGTAGG--AIAALKAAGLAGKPPVTGQDAELAAVQ 231 (288) T ss_pred CCCEEEECC---CHHHHH--HHHHHHHCCCCCCCEEEECCCCHHHHH T ss_conf 575898558---567799--999999749999873983599899999 No 140 >PRK03369 murD UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase; Provisional Probab=84.10 E-value=3 Score=22.41 Aligned_cols=30 Identities=17% Similarity=0.144 Sum_probs=13.6 Q ss_pred EEEECCCCCCCCCCCCCCHHHHHHHCCCCEECCCC Q ss_conf 66404654434552417989999742783211257 Q gi|254780420|r 93 LVIKAEITEPSNYRANMHFDAWLKSCGIIGLSGID 127 (396) Q Consensus 93 ~Iv~~~~~~~Sn~rs~~sL~~wL~~~~IpgI~GID 127 (396) +||...-..++|. .-...++.+||-++.++ T Consensus 73 ~vV~SPGI~~~~p-----~l~~a~~~gi~i~~eie 102 (487) T PRK03369 73 LVVTSPGFPPTAP-----VLAAAAAAGVPIWGDVE 102 (487) T ss_pred EEEECCCCCCCCH-----HHHHHHHCCCCEEEHHH T ss_conf 8998995799899-----99999988990765999 No 141 >cd03139 GATase1_PfpI_2 Type 1 glutamine amidotransferase (GATase1)-like domain found in a subgroup of proteins similar to PfpI from Pyrococcus furiosus. Type 1 glutamine amidotransferase (GATase1)-like domain found in a subgroup of proteins similar to PfpI from Pyrococcus furiosus. PfpI is an ATP-independent intracellular proteases which may hydrolyze small peptides to provide a nutritional source. Only Cys of the catalytic triad typical of GATase1 domains is conserved in this group. This Cys residue is found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. Probab=83.26 E-value=2 Score=23.65 Aligned_cols=58 Identities=24% Similarity=0.286 Sum_probs=38.9 Q ss_pred ECCCCHHHHHHCCCCEEEECCCCCCCCCCC-CCCHHHHHHHCCCCCCEEEEECHHHHHHHH Q ss_conf 046574464413578899858888841011-211012221025786347861005887875 Q gi|254780420|r 234 GAETSYKDIMSLQPDGVLLSNGPGDPAVTS-VYSSPIICKLVDSGMPMFGICLGHQLLGLA 293 (396) Q Consensus 234 p~~~~~~~i~~~~pdgi~lSnGPGdP~~~~-~~~~~~~~~~~~~~iPilGIClGhQ~l~~a 293 (396) ..+.++++.. ++|.|++..|+|...... ......+++....+..|.+||-|--+||.| T Consensus 52 ~~d~~~~~~~--~~D~lvvpGg~~~~~~~~~~~l~~~Lr~~~~~g~~i~siCtGa~~LA~a 110 (183) T cd03139 52 LPDTSFADPP--DLDVLLVPGGGGTRALVNDPALLDFIRRQAARAKYVTSVCTGALLLAAA 110 (183) T ss_pred ECCCCHHHCC--CCCEEEECCCCCHHHHCCCHHHHHHHHHHHHCCCEEEEECHHHHHHHHC T ss_conf 6058844667--5899998799874530489899999999876198389873767999974 No 142 >PRK10936 periplasmic sensory protein associated with the TorRS two-component regulatory system; Provisional Probab=82.71 E-value=2.8 Score=22.65 Aligned_cols=53 Identities=9% Similarity=0.139 Sum_probs=29.2 Q ss_pred HHHHHCCC--CEEEECCCCCCCCCCCCCCHHHHHH-HCCCCCCEEEEECHHHHHHHHCCCCC Q ss_conf 46441357--8899858888841011211012221-02578634786100588787528544 Q gi|254780420|r 240 KDIMSLQP--DGVLLSNGPGDPAVTSVYSSPIICK-LVDSGMPMFGICLGHQLLGLALGAQT 298 (396) Q Consensus 240 ~~i~~~~p--dgi~lSnGPGdP~~~~~~~~~~~~~-~~~~~iPilGIClGhQ~l~~a~G~~~ 298 (396) ++++.-+| |+|| .|.++ .. .+...++. -..+++-|.|.=+.++.+..-.-|++ T Consensus 220 e~iL~~~pdid~I~-g~~~~---a~--~Ai~al~~~g~~~~v~ivg~D~s~~~~~~Ik~G~i 275 (340) T PRK10936 220 QELLERHPEIDYIA-GSAVA---AE--AAIGELRGRNLTDPVKLVSFYLSHQVYRGLKRGKV 275 (340) T ss_pred HHHHHHCCCCCEEE-ECCHH---HH--HHHHHHHHCCCCCCEEEEEECCCHHHHHHHHCCCE T ss_conf 99998499967998-07566---99--99999996088887499992699999999984977 No 143 >cd06317 PBP1_ABC_sugar_binding_like_8 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Pperiplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=82.69 E-value=1.9 Score=23.88 Aligned_cols=84 Identities=13% Similarity=0.130 Sum_probs=39.0 Q ss_pred CCCEEEEEE--CCCCCH------HHHHHHHHCCCCEEE---ECCCCHH-------HHHH-C--CCCEEEECCCCCCCCCC Q ss_conf 771499996--796410------232222202454133---0465744-------6441-3--57889985888884101 Q gi|254780420|r 204 AKYHVVCID--YGIRSN------LLRVLSSLGCRITIV---GAETSYK-------DIMS-L--QPDGVLLSNGPGDPAVT 262 (396) Q Consensus 204 ~~~kIvvID--fGiK~n------Ilr~L~~~g~~v~Vv---p~~~~~~-------~i~~-~--~pdgi~lSnGPGdP~~~ 262 (396) .+.+|+++- -|.... ..+.|.+.+-.+.++ +.+.+.+ +++. + ++|+||-.|. .... T Consensus 124 ~~~~v~~i~g~~~~~~~~~R~~Gf~~~l~~~~~~i~iv~~~~~~~~~~~a~~~~~~lL~~~~~~i~~I~~~nd---~~a~ 200 (275) T cd06317 124 GKGQIVVIAGQPGNGTAIERQKGFEDELAEVCPGVEVLDTQPADWDREKAQVAMEALITKFGDDIDGVYAGDD---NMAR 200 (275) T ss_pred CCCEEEEEECCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHHHHHHCCCCCCEEECCCC---HHHH T ss_conf 9961799937999859999999999999985887569888741477899999999998734575334405895---7899 Q ss_pred CCCCHHHHHHH-CCCCCCEEEEECHHHHHHH Q ss_conf 12110122210-2578634786100588787 Q gi|254780420|r 263 SVYSSPIICKL-VDSGMPMFGICLGHQLLGL 292 (396) Q Consensus 263 ~~~~~~~~~~~-~~~~iPilGIClGhQ~l~~ 292 (396) . +...+++. +.++++|.|+-..-+.+.. T Consensus 201 G--a~~Al~~~G~~~~v~vvg~d~~~~~~~~ 229 (275) T cd06317 201 G--ALNAAKEAGLAGGIVIVGANNFAVGYEA 229 (275) T ss_pred H--HHHHHHHCCCCCCCEEEEECCCHHHHHH T ss_conf 9--9999997499899879987980999999 No 144 >PRK03806 murD UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase; Provisional Probab=82.37 E-value=4 Score=21.56 Aligned_cols=24 Identities=21% Similarity=0.367 Sum_probs=13.7 Q ss_pred HCCCCEECCCC-----HHHHHHHHHHCCC Q ss_conf 42783211257-----6899999985398 Q gi|254780420|r 117 SCGIIGLSGID-----TRALTVWIRDYGV 140 (396) Q Consensus 117 ~~~IpgI~GID-----TRaLTk~IRe~G~ 140 (396) +..++||+|=. |..|..+|+..|- T Consensus 104 ~~~iIaVTGTnGKTTTt~li~~iL~~~g~ 132 (438) T PRK03806 104 QAPIVAITGSNGKSTVTTLVGEMAKAAGV 132 (438) T ss_pred CCCEEEEECCCCCHHHHHHHHHHHHHCCC T ss_conf 99889994899848999999999986599 No 145 >PRK09701 D-allose transporter subunit; Provisional Probab=82.32 E-value=1.7 Score=24.11 Aligned_cols=53 Identities=8% Similarity=0.097 Sum_probs=29.4 Q ss_pred HHHHHCCCC--EEEECCCCCCCCCCCCCCHHHHHHH-CCCCCCEEEEECHHHHHHHHCCCC Q ss_conf 464413578--8998588888410112110122210-257863478610058878752854 Q gi|254780420|r 240 KDIMSLQPD--GVLLSNGPGDPAVTSVYSSPIICKL-VDSGMPMFGICLGHQLLGLALGAQ 297 (396) Q Consensus 240 ~~i~~~~pd--gi~lSnGPGdP~~~~~~~~~~~~~~-~~~~iPilGIClGhQ~l~~a~G~~ 297 (396) ++++.-+|| +||-.|.+. ... +.+.++.. ..+++.+.|.-..-+.+..--.|+ T Consensus 208 ~~~L~a~Pdi~~I~~~nd~~---a~G--a~~Al~~aG~~~~i~vvG~D~~~~~~~~I~~G~ 263 (311) T PRK09701 208 TNVLQRNPNIKAIYCANDTM---AMG--VAQAVANAGKTGKVLVVGTDGIPEARKMVEAGQ 263 (311) T ss_pred HHHHHCCCCCCEEEECCCHH---HHH--HHHHHHHCCCCCCEEEEEECCCHHHHHHHHCCC T ss_conf 99995499976899779726---899--999999669999839999789999999998598 No 146 >COG1797 CobB Cobyrinic acid a,c-diamide synthase [Coenzyme metabolism] Probab=82.13 E-value=2.7 Score=22.81 Aligned_cols=90 Identities=22% Similarity=0.379 Sum_probs=58.3 Q ss_pred CEEEEE-----ECCCCCHHHHHHHHHCCCCEEEECCCCHHH-HHHCCCCEEEECCCCCCCCCCCCC------CHHHHHHH Q ss_conf 149999-----679641023222220245413304657446-441357889985888884101121------10122210 Q gi|254780420|r 206 YHVVCI-----DYGIRSNLLRVLSSLGCRITIVGAETSYKD-IMSLQPDGVLLSNGPGDPAVTSVY------SSPIICKL 273 (396) Q Consensus 206 ~kIvvI-----DfGiK~nIlr~L~~~g~~v~Vvp~~~~~~~-i~~~~pdgi~lSnGPGdP~~~~~~------~~~~~~~~ 273 (396) .||++. +|-...| ++.|++.|++++-+.. .++ -+--+-|+|.|-+ |-|+..... ..+.++++ T Consensus 246 ~rIAVA~D~AF~FyY~~n-l~~Lr~~GAelv~FSP---L~D~~lP~~~D~vYlgG--GYPElfA~~L~~n~~~~~~i~~~ 319 (451) T COG1797 246 VRIAVARDAAFNFYYPEN-LELLREAGAELVFFSP---LADEELPPDVDAVYLGG--GYPELFAEELSANESMRRAIKAF 319 (451) T ss_pred CEEEEEECCHHCCCCHHH-HHHHHHCCCEEEEECC---CCCCCCCCCCCEEEECC--CCHHHHHHHHHHCHHHHHHHHHH T ss_conf 369998545100032999-9999977977998578---67777997788899689--77488999984479999999999 Q ss_pred CCCCCCEEEEECHHHHHHHHC---CCCCCCC Q ss_conf 257863478610058878752---8544667 Q gi|254780420|r 274 VDSGMPMFGICLGHQLLGLAL---GAQTVKM 301 (396) Q Consensus 274 ~~~~iPilGIClGhQ~l~~a~---G~~~~kl 301 (396) .+.+.||+|=|=|.-.|+..+ .++++.| T Consensus 320 ~~~G~piyaECGGlMYL~~~le~~~G~~~~M 350 (451) T COG1797 320 AAAGKPIYAECGGLMYLGESLEDADGDTYEM 350 (451) T ss_pred HHCCCCEEEECCCCEEEHHHEECCCCCEEEE T ss_conf 8769945885155112101204268966543 No 147 >PRK03372 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=81.91 E-value=1.8 Score=24.02 Aligned_cols=17 Identities=18% Similarity=0.020 Sum_probs=9.2 Q ss_pred CCCCCHHHHHHHCCCCE Q ss_conf 24179899997427832 Q gi|254780420|r 106 RANMHFDAWLKSCGIIG 122 (396) Q Consensus 106 rs~~sL~~wL~~~~Ipg 122 (396) .....+.+||.++++-. T Consensus 20 ~~a~~l~~~L~~~g~~v 36 (303) T PRK03372 20 ESARRVVKQLGDAGIGV 36 (303) T ss_pred HHHHHHHHHHHHCCCEE T ss_conf 99999999999788989 No 148 >cd03133 GATase1_ES1 Type 1 glutamine amidotransferase (GATase1)-like domain found in zebrafish ES1. Type 1 glutamine amidotransferase (GATase1)-like domain found in zebrafish ES1. This group includes, proteins similar to ES1, Escherichia coli enhancing lycopene biosynthesis protein 2, Azospirillum brasilense iaaC and, human HES1. The catalytic triad typical of GATase1domains is not conserved in this GATase1-like domain. However, in common with GATase1domains a reactive cys residue is found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. Zebrafish ES1 is expressed specifically in adult photoreceptor cells and appears to be a cytoplasmic protein. A. brasilense iaaC is involved in controlling IAA biosynthesis. Probab=81.86 E-value=4.6 Score=21.15 Aligned_cols=86 Identities=20% Similarity=0.328 Sum_probs=55.3 Q ss_pred CCCCHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCC------------CCCCCHHHHHHHCCCCCCEE Q ss_conf 964102322222024541330465744644135788998588888410------------11211012221025786347 Q gi|254780420|r 214 GIRSNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAV------------TSVYSSPIICKLVDSGMPMF 281 (396) Q Consensus 214 GiK~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~------------~~~~~~~~~~~~~~~~iPil 281 (396) |-++|+|-+=.+. ++=.+ .++.++...++|++++-+|=|--.. ........++.+.+.++||= T Consensus 55 ~e~RNVLvESARI-aRG~I----~~l~~~~~~d~DaLiiPGGfGaAKNLs~fA~~g~~~~v~~~V~~~i~~f~~a~KPiG 129 (213) T cd03133 55 GESRNVLVESARI-ARGNI----KDLAKLKAADFDALIFPGGFGAAKNLSDFAVKGADCTVNPEVERLVREFHQAGKPIG 129 (213) T ss_pred CCHHHHHHHHHHH-HCCCC----CCHHHCCHHHCCEEEECCCHHHHHHHHHHHCCCCCCEECHHHHHHHHHHHHCCCCEE T ss_conf 6017789899887-62585----564569986789899788514555310021058875698999999999998499999 Q ss_pred EEECHHHHHHHHCCCCCCCCCCCC Q ss_conf 861005887875285446677675 Q gi|254780420|r 282 GICLGHQLLGLALGAQTVKMYQGH 305 (396) Q Consensus 282 GIClGhQ~l~~a~G~~~~kl~~Gh 305 (396) -||..--+++..+|- -.|+..|+ T Consensus 130 ~iCIAP~l~a~vl~~-~~~~TiG~ 152 (213) T cd03133 130 AICIAPALAAKILGE-GVEVTIGN 152 (213) T ss_pred EECHHHHHHHHHCCC-CCEEECCC T ss_conf 975789999998189-82896689 No 149 >cd06319 PBP1_ABC_sugar_binding_like_10 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=81.84 E-value=2.6 Score=22.94 Aligned_cols=68 Identities=10% Similarity=0.212 Sum_probs=32.3 Q ss_pred HHHHHHHHCCCCEEE--ECCCCHH-------HHHHC--CCCEEEECCCCCCCCCCCCCCHHHHHHH-CCCCCCEEEEECH Q ss_conf 232222202454133--0465744-------64413--5788998588888410112110122210-2578634786100 Q gi|254780420|r 219 LLRVLSSLGCRITIV--GAETSYK-------DIMSL--QPDGVLLSNGPGDPAVTSVYSSPIICKL-VDSGMPMFGICLG 286 (396) Q Consensus 219 Ilr~L~~~g~~v~Vv--p~~~~~~-------~i~~~--~pdgi~lSnGPGdP~~~~~~~~~~~~~~-~~~~iPilGIClG 286 (396) ..+.|.+.|+.+.++ ..+.+.+ +++.. ++++||..|.. ... .+...+++. ...++.|.|.-.. T Consensus 147 f~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~p~i~ai~~~~d~---~a~--Ga~~a~~~~G~~~~i~vvg~D~~ 221 (277) T cd06319 147 FKEAMKEAGCDLAGIRQQKDFSYQETFDYTNDLLTANPDIRAIWLQGSD---RYQ--GALDAIATAGKTGKVLLICFDAE 221 (277) T ss_pred HHHHHHHCCCCEEEEEEECCCCHHHHHHHHHHHHHHCCCCCEEEECCCH---HHH--HHHHHHHHCCCCCCEEEEECCCC T ss_conf 9999987599767999608999999999999999738997689966988---999--99999997599998589963797 Q ss_pred HHHHH Q ss_conf 58878 Q gi|254780420|r 287 HQLLG 291 (396) Q Consensus 287 hQ~l~ 291 (396) -+.+. T Consensus 222 ~~~~~ 226 (277) T cd06319 222 PEFIE 226 (277) T ss_pred HHHHH T ss_conf 99999 No 150 >PRK04308 murD UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase; Provisional Probab=81.71 E-value=3.5 Score=22.00 Aligned_cols=22 Identities=18% Similarity=0.345 Sum_probs=11.4 Q ss_pred CCCEECCCC-----HHHHHHHHHHCCC Q ss_conf 783211257-----6899999985398 Q gi|254780420|r 119 GIIGLSGID-----TRALTVWIRDYGV 140 (396) Q Consensus 119 ~IpgI~GID-----TRaLTk~IRe~G~ 140 (396) .+.||+|=. |..|...++..|- T Consensus 111 ~~IaVTGTnGKTTTtsli~~iL~~~g~ 137 (445) T PRK04308 111 KVIAITGSNGKTTVTSLVGYLCIKCGL 137 (445) T ss_pred CEEEEECCCCCHHHHHHHHHHHHHCCC T ss_conf 579994899837799999999997599 No 151 >cd05297 GH4_alpha_glucosidase_galactosidase Glycoside Hydrolases Family 4; Alpha-glucosidases and alpha-galactosidases. linked to 3D####ucture Probab=81.62 E-value=1.3 Score=25.02 Aligned_cols=81 Identities=26% Similarity=0.386 Sum_probs=51.2 Q ss_pred HHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEECHHHH----HHHH Q ss_conf 023222220245413304657446441357889985888884101121101222102578634786100588----7875 Q gi|254780420|r 218 NLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGICLGHQL----LGLA 293 (396) Q Consensus 218 nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIClGhQ~----l~~a 293 (396) .+.+.|+...+-+.+ +++|.++.||+++|. --+|+. .+.+.+.+.. ++.++|||=+.|- ||.. T Consensus 114 G~~~alRtIPv~lei------a~~i~e~~P~AwiiN--ytNP~~---ivteal~r~~--~~k~iGlC~~p~~~~~~la~~ 180 (423) T cd05297 114 GIFRALRTIPVLLDI------ARDIEELCPDAWLLN--YANPMA---ELTWALNRYT--PIKTVGLCHGVQGTAEQLAKL 180 (423) T ss_pred HHHHHHCCHHHHHHH------HHHHHHHCCCCEEEE--CCCHHH---HHHHHHHHHC--CCCEEEECCCHHHHHHHHHHH T ss_conf 899986228999999------999998799838997--678889---9999999738--997898797769999999998 Q ss_pred CCCCCCCCCCCCCCCCCC Q ss_conf 285446677675210112 Q gi|254780420|r 294 LGAQTVKMYQGHHGVNHP 311 (396) Q Consensus 294 ~G~~~~kl~~GhrG~N~P 311 (396) +|..-.++.+-.-|.||= T Consensus 181 l~~~~~~v~~~~~GLNH~ 198 (423) T cd05297 181 LGEPPEEVDYQVAGINHM 198 (423) T ss_pred HCCCHHHCEEEEECCCEE T ss_conf 299889918999505446 No 152 >cd06318 PBP1_ABC_sugar_binding_like_9 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=81.24 E-value=2.2 Score=23.39 Aligned_cols=50 Identities=12% Similarity=0.118 Sum_probs=26.8 Q ss_pred HHHHC--CCCEEEECCCCCCCCCCCCCCHHHHHHH-CCCCCCEEEEECHHHHHHHHCC Q ss_conf 64413--5788998588888410112110122210-2578634786100588787528 Q gi|254780420|r 241 DIMSL--QPDGVLLSNGPGDPAVTSVYSSPIICKL-VDSGMPMFGICLGHQLLGLALG 295 (396) Q Consensus 241 ~i~~~--~pdgi~lSnGPGdP~~~~~~~~~~~~~~-~~~~iPilGIClGhQ~l~~a~G 295 (396) +++.- +.++||-.|.+ .... +...+++. ..++++|.|+.---+.+...-. T Consensus 183 ~~L~~~p~i~~i~~~nD~---~a~G--a~~Al~~~G~~~~v~vvG~D~~~~~~~~i~~ 235 (282) T cd06318 183 DLLVAHPDINVVYSENDD---MALG--AMRVLAEAGKTDDVKVAAADGQKEALALIKG 235 (282) T ss_pred HHHHHCCCCEEEEECCCH---HHHH--HHHHHHHCCCCCCEEEEEECCCHHHHHHHHC T ss_conf 999758997099988967---9999--9999996699998299998988999999983 No 153 >PRK01438 murD UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase; Provisional Probab=81.15 E-value=3.7 Score=21.79 Aligned_cols=30 Identities=20% Similarity=0.191 Sum_probs=17.5 Q ss_pred EEEECCCCCCCCCCCCCCHHHHHHHCCCCEECCCC Q ss_conf 66404654434552417989999742783211257 Q gi|254780420|r 93 LVIKAEITEPSNYRANMHFDAWLKSCGIIGLSGID 127 (396) Q Consensus 93 ~Iv~~~~~~~Sn~rs~~sL~~wL~~~~IpgI~GID 127 (396) +||...-..++|. +-.+.++++||-|+.+| T Consensus 80 ~vV~SPGI~~~~p-----~~~~a~~~gi~i~~eie 109 (481) T PRK01438 80 LVVTSPGWRPTHP-----LLAAAAEAGIPVWGDVE 109 (481) T ss_pred EEEECCCCCCCCH-----HHHHHHHCCCCEECHHH T ss_conf 9998997899899-----99999986993861899 No 154 >PRK03378 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=80.81 E-value=2 Score=23.71 Aligned_cols=33 Identities=18% Similarity=0.356 Sum_probs=22.0 Q ss_pred EEEEEEECCCCCCCCCCCCCCHHHHHHHCCCCEEC Q ss_conf 23666404654434552417989999742783211 Q gi|254780420|r 90 AVGLVIKAEITEPSNYRANMHFDAWLKSCGIIGLS 124 (396) Q Consensus 90 ~~g~Iv~~~~~~~Sn~rs~~sL~~wL~~~~IpgI~ 124 (396) .-|+|.+.. .|.-+.....|.+||+++++-.+- T Consensus 7 ~IgIi~k~~--~~~a~~~~~~l~~~L~~~g~~v~l 39 (292) T PRK03378 7 CIGIVGHPR--HPTALTTHEMLYRWLCTKGYEVIV 39 (292) T ss_pred EEEEEEECC--CHHHHHHHHHHHHHHHHCCCEEEE T ss_conf 999995279--858999999999999978998998 No 155 >PRK11197 lldD L-lactate dehydrogenase; Provisional Probab=80.49 E-value=5.1 Score=20.83 Aligned_cols=98 Identities=24% Similarity=0.335 Sum_probs=61.0 Q ss_pred EECCCCCHHHHHHHHHCCCCEEEECCCCHHHHH---HCCCCEEEECCCCCC-----CCCCCCCCHHHHHHHCCCCCCEE- Q ss_conf 967964102322222024541330465744644---135788998588888-----41011211012221025786347- Q gi|254780420|r 211 IDYGIRSNLLRVLSSLGCRITIVGAETSYKDIM---SLQPDGVLLSNGPGD-----PAVTSVYSSPIICKLVDSGMPMF- 281 (396) Q Consensus 211 IDfGiK~nIlr~L~~~g~~v~Vvp~~~~~~~i~---~~~pdgi~lSnGPGd-----P~~~~~~~~~~~~~~~~~~iPil- 281 (396) +|=.+-+.-++.|++.=---.|+---.+.+|-. ..--|||++||==|. |+..+ ..+.+.+...+++||+ T Consensus 228 ~d~~ltW~di~wlr~~w~~plvlKGIl~~eDA~~A~~~G~dgIiVSNHGGRQLD~apa~i~--~LpeI~~aV~~~~~V~~ 305 (381) T PRK11197 228 FDPSISWKDLEWIREFWDGPMVIKGILDPEDARDAVRFGADGIVVSNHGGRQLDGVLSSAR--ALPAIADAVKGDIAILA 305 (381) T ss_pred CCCCCCHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHCCCCEEEEECCCCCCCCCCCCHHH--HHHHHHHHHCCCCEEEE T ss_conf 5888999999999987299767852588999999996699889995776321567844899--99999998678973999 Q ss_pred --EEECHHHHH-HHHCCCCCCCC---------CCCCCCCCC Q ss_conf --861005887-87528544667---------767521011 Q gi|254780420|r 282 --GICLGHQLL-GLALGAQTVKM---------YQGHHGVNH 310 (396) Q Consensus 282 --GIClGhQ~l-~~a~G~~~~kl---------~~GhrG~N~ 310 (396) ||--|-.++ |+|+|++..=+ -.|..|..| T Consensus 306 DgGiRrG~DV~KALALGA~aV~vGRp~lygLaa~G~~GV~~ 346 (381) T PRK11197 306 DSGIRNGLDVVRMIALGADTVLLGRAFVYALAAAGQAGVAN 346 (381) T ss_pred ECCCCCHHHHHHHHHCCCCEEEECHHHHHHHHHCCHHHHHH T ss_conf 68978668999999769988976759999877133889999 No 156 >PRK02649 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=80.25 E-value=1.8 Score=24.04 Aligned_cols=36 Identities=11% Similarity=0.155 Sum_probs=23.4 Q ss_pred EEEEEECCCCCCCCCCCCCCHHHHHHHCCCCEECCCCH Q ss_conf 36664046544345524179899997427832112576 Q gi|254780420|r 91 VGLVIKAEITEPSNYRANMHFDAWLKSCGIIGLSGIDT 128 (396) Q Consensus 91 ~g~Iv~~~~~~~Sn~rs~~sL~~wL~~~~IpgI~GIDT 128 (396) -|+|.+.... ........+.+||+++|+..+-.=++ T Consensus 4 iGIi~~~~~~--~a~~~a~~l~~~L~~~gi~v~l~~~~ 39 (305) T PRK02649 4 AGIIYNDGKP--LAVRTAEELQDKLEAAGWEVVRASSS 39 (305) T ss_pred EEEEECCCCH--HHHHHHHHHHHHHHHCCCEEEEECCH T ss_conf 9999738998--99999999999999889999997441 No 157 >cd03332 LMO_FMN L-Lactate 2-monooxygenase (LMO) FMN-binding domain. LMO is a FMN-containing enzyme that catalyzes the conversion of L-lactate and oxygen to acetate, carbon dioxide, and water. LMO is a member of the family of alpha-hydroxy acid oxidases. It is thought to be a homooctamer with two- and four- fold axes in the center of the octamer. Probab=80.05 E-value=5.3 Score=20.74 Aligned_cols=86 Identities=24% Similarity=0.319 Sum_probs=55.9 Q ss_pred EECCCCCHHHHHHHHHCCC-CEEEECCCCHHHH---HHCCCCEEEECCCCCC-----CCCCCCCCHHHHHHHCCCCCCEE Q ss_conf 9679641023222220245-4133046574464---4135788998588888-----41011211012221025786347 Q gi|254780420|r 211 IDYGIRSNLLRVLSSLGCR-ITIVGAETSYKDI---MSLQPDGVLLSNGPGD-----PAVTSVYSSPIICKLVDSGMPMF 281 (396) Q Consensus 211 IDfGiK~nIlr~L~~~g~~-v~Vvp~~~~~~~i---~~~~pdgi~lSnGPGd-----P~~~~~~~~~~~~~~~~~~iPil 281 (396) .|-..-+.-++.|++. .. -.|+--=.+.+|- .+.--|||++||==|. |+..+ ..+.+.+...+++||+ T Consensus 236 ~d~~ltW~di~wlr~~-w~~plilKGI~~~eDA~~A~~~G~dgIiVSNHGGRQLD~apa~i~--~LpeI~~aV~~~~~V~ 312 (383) T cd03332 236 SGPSLTWEDLAFLREW-TDLPIVLKGILHPDDARRAVEAGVDGVVVSNHGGRQVDGSIAALD--ALPEIVEAVGDRLTVL 312 (383) T ss_pred CCCCCCHHHHHHHHHH-CCCCEEECCCCCHHHHHHHHHCCCCEEEEECCCCCCCCCCCCHHH--HHHHHHHHHCCCCEEE T ss_conf 7888998999999987-699853235689999999997599889980786344678832789--9999999847998499 Q ss_pred ---EEECHHHHH-HHHCCCCCC Q ss_conf ---861005887-875285446 Q gi|254780420|r 282 ---GICLGHQLL-GLALGAQTV 299 (396) Q Consensus 282 ---GIClGhQ~l-~~a~G~~~~ 299 (396) ||--|--++ |+|+||+.. T Consensus 313 ~DgGIRrG~DV~KAlALGA~~V 334 (383) T cd03332 313 FDSGVRTGADIMKALALGAKAV 334 (383) T ss_pred EECCCCCHHHHHHHHHCCCCEE T ss_conf 9799786799999997699989 No 158 >PRK10355 xylF D-xylose transporter subunit XylF; Provisional Probab=80.04 E-value=2.4 Score=23.15 Aligned_cols=47 Identities=13% Similarity=0.193 Sum_probs=23.3 Q ss_pred HHHHHCCC---CEEEECCCCCCCCCCCCCCHHHHHHH-CCCCCCEEEEECHHHHHH Q ss_conf 46441357---88998588888410112110122210-257863478610058878 Q gi|254780420|r 240 KDIMSLQP---DGVLLSNGPGDPAVTSVYSSPIICKL-VDSGMPMFGICLGHQLLG 291 (396) Q Consensus 240 ~~i~~~~p---dgi~lSnGPGdP~~~~~~~~~~~~~~-~~~~iPilGIClGhQ~l~ 291 (396) ++++.-+| |+||-.| |+.... +...++.. +.++++|.|+=---..+- T Consensus 202 e~~L~a~~~~id~V~a~N---D~mAlG--A~~Al~~aG~~~~v~V~G~Dg~~~al~ 252 (330) T PRK10355 202 ENALTANNNKIDAVVASN---DATAGG--AIQALSAQGLSGKVAISGQDADLAAIK 252 (330) T ss_pred HHHHHHCCCCEEEEEECC---CCHHHH--HHHHHHHCCCCCCCEEECCCCCHHHHH T ss_conf 999960998516999899---714899--999999669999976990279989999 No 159 >PRK10710 DNA-binding transcriptional regulator BaeR; Provisional Probab=80.02 E-value=5.1 Score=20.85 Aligned_cols=94 Identities=18% Similarity=0.277 Sum_probs=57.5 Q ss_pred CCCCCEEEEEE--CCCCCHHHHHHHHHCCCCEEEECCCCH-HHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCC Q ss_conf 77771499996--796410232222202454133046574-464413578899858888841011211012221025786 Q gi|254780420|r 202 SDAKYHVVCID--YGIRSNLLRVLSSLGCRITIVGAETSY-KDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGM 278 (396) Q Consensus 202 ~~~~~kIvvID--fGiK~nIlr~L~~~g~~v~Vvp~~~~~-~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~i 278 (396) .+.+.||.++| --+-..+...|...|+.|.+...-..+ +.+....||-|++-=.. |...+......+++. .++ T Consensus 7 ~~~~~kILlVEDd~~l~~~l~~~L~~~g~~V~~a~~~~eal~~~~~~~~DliilDi~l--p~~~Gl~l~~~lr~~--~~~ 82 (240) T PRK10710 7 DENTPRILIVEDEPKLGQLLIDYLRAASYAPTLISHGDEVLPYVRQTPPDLILLDLML--PGTDGLTLCREIRRF--SDI 82 (240) T ss_pred CCCCCEEEEEECCHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHHCCCCEEEECCCC--CCCCCCCCHHHHHCC--CCC T ss_conf 9998879999298999999999999889999998999999999973799899987999--888776321122115--764 Q ss_pred CEEEEECH----HHHHHHHCCCCCC Q ss_conf 34786100----5887875285446 Q gi|254780420|r 279 PMFGICLG----HQLLGLALGAQTV 299 (396) Q Consensus 279 PilGIClG----hQ~l~~a~G~~~~ 299 (396) ||+=+.-- .++.|+.+||.-| T Consensus 83 piI~lta~~~~~~~~~al~~GAddY 107 (240) T PRK10710 83 PIVMVTAKIEEIDRLLGLEIGADDY 107 (240) T ss_pred CEEEECCCCCHHHHHHHHHCCCCEE T ss_conf 6899816788899999998699876 No 160 >PRK01231 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=79.79 E-value=2 Score=23.62 Aligned_cols=36 Identities=11% Similarity=0.108 Sum_probs=22.9 Q ss_pred EEEEEECCCCCCCCCCCCCCHHHHHHHCCCCEECCCCH Q ss_conf 36664046544345524179899997427832112576 Q gi|254780420|r 91 VGLVIKAEITEPSNYRANMHFDAWLKSCGIIGLSGIDT 128 (396) Q Consensus 91 ~g~Iv~~~~~~~Sn~rs~~sL~~wL~~~~IpgI~GIDT 128 (396) .|+|.+.. .+.-+.....+.+||+++++-..-.=++ T Consensus 7 vgIv~k~~--~~~a~~~~~~l~~~L~~~gi~v~ld~~~ 42 (296) T PRK01231 7 IGLIGRLG--SSQVVETLRRLKRFLLDRHLHVILEEET 42 (296) T ss_pred EEEEEECC--CHHHHHHHHHHHHHHHHCCCEEEEEHHH T ss_conf 99998489--8799999999999998788999993247 No 161 >cd03136 GATase1_AraC_ArgR_like AraC transcriptional regulators having an N-terminal Type 1 glutamine amidotransferase (GATase1)-like domain. A subgroup of AraC transcriptional regulators having an N-terminal Type 1 glutamine amidotransferase (GATase1)-like domain. This group contains proteins similar to the Pseudomonas aeruginosa ArgR regulator. ArgR functions in the control of expression of certain genes of arginine biosynthesis and catabolism. AraC regulators are defined by a AraC-type helix-turn-helix DNA binding domain at their C-terminal. AraC family transcriptional regulators are widespread among bacteria and are involved in regulating diverse and important biological functions, including carbon metabolism, stress responses and virulence in different microorganisms. The catalytic triad typical of GATase1 domains is not conserved in this GATase1-like domain. However, in common with typical GATase1domains a reactive cys residue is found in some sequences in the sharp turn betwee Probab=79.72 E-value=3.1 Score=22.31 Aligned_cols=58 Identities=19% Similarity=0.192 Sum_probs=38.7 Q ss_pred CCCCHHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEECHHHHHHHHC Q ss_conf 465744644135788998588888410112110122210257863478610058878752 Q gi|254780420|r 235 AETSYKDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGICLGHQLLGLAL 294 (396) Q Consensus 235 ~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIClGhQ~l~~a~ 294 (396) .+.++++.. .+|.+++-+|++.+..........+++..+....|.++|-|-.+||.|- T Consensus 55 ~~~~~~~~~--~~D~liV~Gg~~~~~~~~~~~~~~L~~~~~~g~~v~svctGa~~LA~aG 112 (185) T cd03136 55 PDAALEDAP--PLDYLFVVGGLGARRAVTPALLAWLRRAARRGVALGGIDTGAFLLARAG 112 (185) T ss_pred CCCCHHHCC--CCCEEEECCCCCCCCCCCHHHHHHHHHHHHCCCEEEEECHHHHHHHHCC T ss_conf 676700167--6789998788880214799999999999870998998647999999818 No 162 >COG2403 Predicted GTPase [General function prediction only] Probab=79.27 E-value=5.6 Score=20.57 Aligned_cols=29 Identities=14% Similarity=0.141 Sum_probs=15.0 Q ss_pred HHHHHCCCC--EECCCCH--------HHHHHHHHHCCCC Q ss_conf 999742783--2112576--------8999999853984 Q gi|254780420|r 113 AWLKSCGII--GLSGIDT--------RALTVWIRDYGVS 141 (396) Q Consensus 113 ~wL~~~~Ip--gI~GIDT--------RaLTk~IRe~G~~ 141 (396) +|+..-..| .|.|+-| |++.|.||+.|-- T Consensus 119 et~~~~ekPviaV~atrtg~GKsaVS~~v~r~l~ergyr 157 (449) T COG2403 119 ETMLKLEKPVIAVTATRTGVGKSAVSRYVARLLRERGYR 157 (449) T ss_pred HHHHHHCCCEEEEEEECCCCCHHHHHHHHHHHHHHCCCC T ss_conf 776651485599997236655678889999999866982 No 163 >PRK04539 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=79.12 E-value=2.9 Score=22.51 Aligned_cols=33 Identities=15% Similarity=0.233 Sum_probs=20.9 Q ss_pred EEEEEEECCCCCCCCCCCCCCHHHHHHHCCCCEEC Q ss_conf 23666404654434552417989999742783211 Q gi|254780420|r 90 AVGLVIKAEITEPSNYRANMHFDAWLKSCGIIGLS 124 (396) Q Consensus 90 ~~g~Iv~~~~~~~Sn~rs~~sL~~wL~~~~IpgI~ 124 (396) --|+|.+.-.. .-......+.+||+++++-.+- T Consensus 7 ~Igiv~k~~~~--~~~~~~~~l~~~L~~~g~~v~l 39 (296) T PRK04539 7 NIGIVTRPNTP--DIQDTAHTLITFLKQHGFTVYL 39 (296) T ss_pred EEEEEEECCCH--HHHHHHHHHHHHHHHCCCEEEE T ss_conf 89999708987--9999999999999978799999 No 164 >PRK03708 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=78.80 E-value=2.5 Score=23.02 Aligned_cols=37 Identities=19% Similarity=0.249 Sum_probs=21.9 Q ss_pred EEEEECCCCCCCCCCCCCCHHHHHHHCCCCEECCCCHHH Q ss_conf 666404654434552417989999742783211257689 Q gi|254780420|r 92 GLVIKAEITEPSNYRANMHFDAWLKSCGIIGLSGIDTRA 130 (396) Q Consensus 92 g~Iv~~~~~~~Sn~rs~~sL~~wL~~~~IpgI~GIDTRa 130 (396) |+|.+.... .-+..-..+.+||.++|+-.+-.-++.. T Consensus 4 gIv~n~~k~--~a~~~a~~l~~~L~~~g~~v~ld~~~~~ 40 (278) T PRK03708 4 GIVARRDKE--EALKLAYRVYDFLKVSGYEVVVDSDTYE 40 (278) T ss_pred EEEEECCCH--HHHHHHHHHHHHHHHCCCEEEEEHHHHH T ss_conf 999218998--9999999999999988998999747865 No 165 >cd06309 PBP1_YtfQ_like Periplasmic binding domain of ABC-type YtfQ-like transport systems. Periplasmic binding domain of ABC-type YtfQ-like transport systems. The YtfQ protein from Escherichia coli is up-regulated under glucose-limited conditions and shares homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily. Members of this group are predicted to be involved in the transport of sugar-containing molecules across cellular and organellar membranes; however their ligand specificity is not determined experimentally. Probab=78.64 E-value=3.5 Score=22.02 Aligned_cols=40 Identities=13% Similarity=0.042 Sum_probs=16.6 Q ss_pred CCCEEEECCCCCCCCCCCCCCHHHHHHHC---CCCCCEEEEECHHHHH Q ss_conf 57889985888884101121101222102---5786347861005887 Q gi|254780420|r 246 QPDGVLLSNGPGDPAVTSVYSSPIICKLV---DSGMPMFGICLGHQLL 290 (396) Q Consensus 246 ~pdgi~lSnGPGdP~~~~~~~~~~~~~~~---~~~iPilGIClGhQ~l 290 (396) ++|+||-.|.. .... +...+++.= ..++.|.|+.-.-+.+ T Consensus 185 ~i~ai~~~nd~---~a~G--a~~al~~~G~~~~~di~ivg~D~~~~~~ 227 (273) T cd06309 185 DIDAVYAHNDE---MALG--AIQAIKAAGKKPGKDIKIVSIDGTKDAF 227 (273) T ss_pred CCCEEEECCCH---HHHH--HHHHHHHCCCCCCCCEEEEEECCCHHHH T ss_conf 88889867857---9999--9999998399999986999989999999 No 166 >cd06310 PBP1_ABC_sugar_binding_like_2 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=78.40 E-value=3.2 Score=22.30 Aligned_cols=43 Identities=7% Similarity=0.121 Sum_probs=21.2 Q ss_pred CCCEEEECCCCCCCCCCCCCCHHHHHHH-CCCCCCEEEEECHHHHHHHH Q ss_conf 5788998588888410112110122210-25786347861005887875 Q gi|254780420|r 246 QPDGVLLSNGPGDPAVTSVYSSPIICKL-VDSGMPMFGICLGHQLLGLA 293 (396) Q Consensus 246 ~pdgi~lSnGPGdP~~~~~~~~~~~~~~-~~~~iPilGIClGhQ~l~~a 293 (396) ++++||-.|. ..... +...+++. +..++.|.|.-..-+.+..- T Consensus 183 di~ai~~~nd---~~a~g--a~~al~~~g~~~~i~vvg~D~~~~~~~~i 226 (273) T cd06310 183 DLKGIFGANE---GSAVG--AARAVRQAGKAGKVKVVGFDASPEQIDAL 226 (273) T ss_pred CCCEEEECCC---HHHHH--HHHHHHHCCCCCCEEEEEECCCHHHHHHH T ss_conf 8757984384---89999--99999977999993999989869999999 No 167 >TIGR00329 gcp metalloendopeptidase, putative, glycoprotease family; InterPro: IPR000905 Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site . The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This group of metallopeptidases belong to MEROPS peptidase family M22 (clan MK). The type example being O-sialoglycoprotein endopeptidase from Pasteurella haemolytica (Mannheimia haemolytica). O-Sialoglycoprotein endopeptidase is secreted by the bacterium P. haemolytica, and digests only proteins that are heavily sialylated, in particular those with sialylated serine and threonine residues . Substrate proteins include glycophorin A and leukocyte surface antigens CD34, CD43, CD44 and CD45 , . Removal of glycosylation, by treatment with neuraminidase, completely negates susceptibility to O-sialoglycoprotein endopeptidase digestion , . Sequence similarity searches have revealed other members of the M22 family, from yeast, Mycobacterium, Haemophilus influenzae and the cyanobacterium Synechocystis . The zinc-binding and catalytic residues of this family have not been determined, although the motif HMEGH may be a zinc-binding region .; GO: 0008450 O-sialoglycoprotein endopeptidase activity, 0006508 proteolysis. Probab=78.38 E-value=1.2 Score=25.37 Aligned_cols=88 Identities=20% Similarity=0.223 Sum_probs=50.4 Q ss_pred CCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEE--ECHHHHHHHHCCC-CCCCCCCCCCCCCCCCEECCCCEEEE Q ss_conf 57889985888884101121101222102578634786--1005887875285-44667767521011234778970999 Q gi|254780420|r 246 QPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGI--CLGHQLLGLALGA-QTVKMYQGHHGVNHPVKNISNGTVEI 322 (396) Q Consensus 246 ~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGI--ClGhQ~l~~a~G~-~~~kl~~GhrG~N~Pv~~~~t~kv~i 322 (396) +.|.|=.|.|||=+-.....+.-..--.+..++|++|| |+||-.+++---. +..+++| |--+.+| T Consensus 74 ~I~~IA~t~~PGL~g~L~vGat~Ar~La~~l~kPligv~H~~GH~~~~~l~~~~~~~~~P~--------~~LlVSG---- 141 (337) T TIGR00329 74 EIDLIAVTSGPGLGGALIVGATFARSLALSLDKPLIGVNHLLGHIYAPLLDTNLNQLQFPF--------VSLLVSG---- 141 (337) T ss_pred CCCEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCCCCCCCCC--------EEEEEEC---- T ss_conf 0257898558798514689999999998987289551457899998874147774536673--------1677754---- Q ss_pred ECCCCCEEEEHHHCCCCCEEEEEECCC Q ss_conf 614576157034369982899998899 Q gi|254780420|r 323 VSMNHGFAIDASSLPAGLEETHISLFD 349 (396) Q Consensus 323 TsqNHg~~v~~~slp~~~~v~~~~~nD 349 (396) .|...+...+. .+.++...++.| T Consensus 142 ---GHT~~~~~~~~-~~~e~~geT~Dd 164 (337) T TIGR00329 142 ---GHTQIIAVKGI-GDYEILGETLDD 164 (337) T ss_pred ---CEEEEEEEECC-CCEEEEEEEHHH T ss_conf ---01153223046-757898520224 No 168 >cd02067 B12-binding B12 binding domain (B12-BD). This domain binds different cobalamid derivates, like B12 (adenosylcobamide) or methylcobalamin or methyl-Co(III) 5-hydroxybenzimidazolylcobamide, it is found in several enzymes, such as glutamate mutase, methionine synthase and methylmalonyl-CoA mutase. Cobalamin undergoes a conformational change on binding the protein; the dimethylbenzimidazole group, which is coordinated to the cobalt in the free cofactor, moves away from the corrin and is replaced by a histidine contributed by the protein. The sequence Asp-X-His-X-X-Gly, which contains this histidine ligand, is conserved in many cobalamin-binding proteins. Probab=77.95 E-value=2.5 Score=23.00 Aligned_cols=84 Identities=14% Similarity=0.086 Sum_probs=49.4 Q ss_pred ECCCCCHHHHHHHHHCCCCEEEECCCCHHHHH----HCCCCEEEEC--CCCCCCCCCCCCCHHHHHHHCCCCCCEEEEEC Q ss_conf 67964102322222024541330465744644----1357889985--88888410112110122210257863478610 Q gi|254780420|r 212 DYGIRSNLLRVLSSLGCRITIVGAETSYKDIM----SLQPDGVLLS--NGPGDPAVTSVYSSPIICKLVDSGMPMFGICL 285 (396) Q Consensus 212 DfGiK~nIlr~L~~~g~~v~Vvp~~~~~~~i~----~~~pdgi~lS--nGPGdP~~~~~~~~~~~~~~~~~~iPilGICl 285 (396) |.|.+ -+-+.|...|++|+-+..+.+.+++. +.+||.|.|| .+..-|. .. ..++.+++.-..++||+.=.. T Consensus 13 ~iG~~-iv~~~l~~~G~~V~~lG~~vp~e~~v~~a~~~~~d~I~lS~~~~~~~~~-~~-~~i~~l~~~g~~~i~v~vGG~ 89 (119) T cd02067 13 DIGKN-IVARALRDAGFEVIDLGVDVPPEEIVEAAKEEDADAIGLSGLLTTHMTL-MK-EVIEELKEAGLDDIPVLVGGA 89 (119) T ss_pred HHHHH-HHHHHHHHCCCEEEECCCCCCHHHHHHHHHHCCCCEEEEECCCCCCHHH-HH-HHHHHHHHCCCCCCEEEEECC T ss_conf 78999-9999999789989989999999999999997099999996220242689-99-999999976999985999899 Q ss_pred H---HHHHHHHCCCCC Q ss_conf 0---588787528544 Q gi|254780420|r 286 G---HQLLGLALGAQT 298 (396) Q Consensus 286 G---hQ~l~~a~G~~~ 298 (396) - .+.++...|+.- T Consensus 90 ~~~~~~~~~~~~Gad~ 105 (119) T cd02067 90 IVTRDFKFLKEIGVDA 105 (119) T ss_pred CCCHHHHHHHHCCCCE T ss_conf 8974399999869979 No 169 >PRK04759 consensus Probab=77.78 E-value=3.3 Score=22.21 Aligned_cols=35 Identities=20% Similarity=0.412 Sum_probs=20.8 Q ss_pred EEEEEEECCCCCCCCCCCCCCHHHHHHHCCCCEECCCCH Q ss_conf 236664046544345524179899997427832112576 Q gi|254780420|r 90 AVGLVIKAEITEPSNYRANMHFDAWLKSCGIIGLSGIDT 128 (396) Q Consensus 90 ~~g~Iv~~~~~~~Sn~rs~~sL~~wL~~~~IpgI~GIDT 128 (396) ..|+|.+-. .+.-......+.+||.++++-.. +|. T Consensus 7 ~I~Iv~k~~--~~~~~~~~~~l~~~L~~~g~~v~--vd~ 41 (294) T PRK04759 7 VIAIIGKPR--DQQAIQTHKELYHWLTSLGYTVF--IDD 41 (294) T ss_pred EEEEEEECC--CHHHHHHHHHHHHHHHHCCCEEE--ECH T ss_conf 999994179--97999999999999986899999--916 No 170 >cd05197 GH4_glycoside_hydrolases Glycoside Hydrases Family 4. Glycoside hydrolases cleave glycosidic bonds to release smaller sugars from oligo- or polysaccharides. Some bacteria simultaneously translocate and phosphorylate disaccharides via the phosphoenolpyruvate-dependent phosphotransferase system (PEP-PTS). After translocation, these phospho-disaccharides may be hydrolyzed by GH4 glycoside hydrolases. Other organisms (such as archaea and Thermotoga maritima) lack the PEP-PTS system, but have several enzymes normally associated with the PEP-PTS operon. GH4 family members include 6-phospho-beta-glucosidases, 6-phospho-alpha-glucosidases, alpha-glucosidases/alpha-glucuronidases (only from Thermotoga), and alpha-galactosidases. They require two cofactors, NAD+ and a divalent metal (Mn2+, Ni2+, Mg2+), for activity. Some also require reducing conditions. GH4 glycoside hydrolases are part of the NAD(P)-binding Rossmann fold superfamily, which includes a wide variety of protein families in Probab=77.66 E-value=2.3 Score=23.26 Aligned_cols=82 Identities=21% Similarity=0.267 Sum_probs=52.4 Q ss_pred HHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEECHHH----HHHHH Q ss_conf 02322222024541330465744644135788998588888410112110122210257863478610058----87875 Q gi|254780420|r 218 NLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGICLGHQ----LLGLA 293 (396) Q Consensus 218 nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIClGhQ----~l~~a 293 (396) .|.|.|+..-.-+.+ +++|..+.||+++|- --+|... +.+.+.+.. .++.+.|+|=+.| .||.. T Consensus 112 G~~~alRtIPv~l~i------a~~i~e~cP~AwliN--ytNP~~i---vt~al~~~~-~~~k~vGLCh~~~~~~~~la~~ 179 (425) T cd05197 112 GTFSGLRQIPYVLDI------ARKXEKLSPDAWYLN--FTNPAGE---VTEAVRRYV-PPEKAVGLCNVPIGVMEIVAKL 179 (425) T ss_pred HHHHHHHHHHHHHHH------HHHHHHHCCCEEEEE--CCCHHHH---HHHHHHHHC-CCCCEEEECCCHHHHHHHHHHH T ss_conf 999998429999999------999998699808997--3787899---999999748-8884898586679999999999 Q ss_pred CCCCCCCCCCCCCCCCCC Q ss_conf 285446677675210112 Q gi|254780420|r 294 LGAQTVKMYQGHHGVNHP 311 (396) Q Consensus 294 ~G~~~~kl~~GhrG~N~P 311 (396) +|..-.++.+-.-|.||= T Consensus 180 lg~~~~~i~~~~~GlNH~ 197 (425) T cd05197 180 LGESEEKVDWQYAGLNHG 197 (425) T ss_pred HCCCHHHCEEEEEEECCH T ss_conf 499889967999976646 No 171 >PRK05294 carB carbamoyl phosphate synthase large subunit; Reviewed Probab=77.40 E-value=6.3 Score=20.20 Aligned_cols=75 Identities=20% Similarity=0.368 Sum_probs=47.1 Q ss_pred CCCCCEEEEEECC-------------CCCHHHHHHHHHCCCCEEEECC-----CC-------------HHHHH----HCC Q ss_conf 7777149999679-------------6410232222202454133046-----57-------------44644----135 Q gi|254780420|r 202 SDAKYHVVCIDYG-------------IRSNLLRVLSSLGCRITIVGAE-----TS-------------YKDIM----SLQ 246 (396) Q Consensus 202 ~~~~~kIvvIDfG-------------iK~nIlr~L~~~g~~v~Vvp~~-----~~-------------~~~i~----~~~ 246 (396) +..+.+|++|-.| .=+ -++.|++.|.+++++.+| |+ .|.|+ ..+ T Consensus 550 ~~~~~kvlvlGsGp~rIGqgiEFDY~~v~-a~~alk~~G~~~imIN~NPETVSTD~d~sDrLYFEplt~E~V~~I~~~E~ 628 (1063) T PRK05294 550 PSDRKKVMILGGGPNRIGQGIEFDYCCVH-AALALREAGYETIMVNCNPETVSTDYDTSDRLYFEPLTLEDVLEIIEKEK 628 (1063) T ss_pred CCCCCCEEEECCCCCCCCCCCCCHHHHHH-HHHHHHHCCCEEEEECCCCCCCCCCCCCCCEEEECCCCHHHHHHHHHHHC T ss_conf 77886379965777320444230067999-99999965981589537811111565667614543688999999999758 Q ss_pred CCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEE Q ss_conf 78899858888841011211012221025786347861 Q gi|254780420|r 247 PDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGIC 284 (396) Q Consensus 247 pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIC 284 (396) |+||+++=|=-. ..++...+.+.+++|||-- T Consensus 629 p~gvi~qfGGQt-------~~nla~~L~~~g~~ilGts 659 (1063) T PRK05294 629 PKGVIVQFGGQT-------PLKLAKALEAAGVPILGTS 659 (1063) T ss_pred CCEEEEECCCCC-------HHHHHHHHHHCCCCEECCC T ss_conf 987999738836-------7899999998799477788 No 172 >cd04737 LOX_like_FMN L-Lactate oxidase (LOX) FMN-binding domain. LOX is a member of the family of FMN-containing alpha-hydroxyacid oxidases and catalyzes the oxidation of l-lactate using molecular oxygen to generate pyruvate and H2O2. This family occurs in both prokaryotes and eukaryotes. Members of this family include flavocytochrome b2 (FCB2), glycolate oxidase (GOX), lactate monooxygenase (LMO), mandelate dehydrogenase (MDH), and long chain hydroxyacid oxidase (LCHAO). Probab=77.37 E-value=3.4 Score=22.10 Aligned_cols=82 Identities=28% Similarity=0.424 Sum_probs=51.2 Q ss_pred CCHHHHHHHHHCCCCEEEECCCCHHHH---HHCCCCEEEECCCCCC-----CCCCCCCCHHHHHHHCCCCCCEE---EEE Q ss_conf 410232222202454133046574464---4135788998588888-----41011211012221025786347---861 Q gi|254780420|r 216 RSNLLRVLSSLGCRITIVGAETSYKDI---MSLQPDGVLLSNGPGD-----PAVTSVYSSPIICKLVDSGMPMF---GIC 284 (396) Q Consensus 216 K~nIlr~L~~~g~~v~Vvp~~~~~~~i---~~~~pdgi~lSnGPGd-----P~~~~~~~~~~~~~~~~~~iPil---GIC 284 (396) -+.-++.|++.----.|+--=.+.+|- ...--|||++||=-|. |+..+ ..+.+.+...+++||+ ||= T Consensus 209 ~w~di~~lr~~~~lplilKGI~~~eDA~~A~~~G~dgIvVSNHGGRQLD~~p~~i~--~LpeI~~av~~~~~V~~DgGIR 286 (351) T cd04737 209 SPADIEFIAKISGLPVIVKGIQSPEDADVAINAGADGIWVSNHGGRQLDGGPASFD--SLPEIAEAVNHRVPIIFDSGVR 286 (351) T ss_pred CHHHHHHHHHHCCCCEEECCCCCHHHHHHHHHCCCCEEEECCCCCCCCCCCHHHHH--HHHHHHHHHCCCCEEEECCCCC T ss_conf 98999999986499853236677999999987499889977875123567604788--9999999866896499769867 Q ss_pred CHHHHH-HHHCCCCCC Q ss_conf 005887-875285446 Q gi|254780420|r 285 LGHQLL-GLALGAQTV 299 (396) Q Consensus 285 lGhQ~l-~~a~G~~~~ 299 (396) -|--++ |+|+||+-. T Consensus 287 ~G~DV~KALALGA~aV 302 (351) T cd04737 287 RGEHVFKALASGADAV 302 (351) T ss_pred CHHHHHHHHHCCCCEE T ss_conf 4689999997699889 No 173 >cd06313 PBP1_ABC_sugar_binding_like_5 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=77.21 E-value=3.1 Score=22.38 Aligned_cols=52 Identities=13% Similarity=0.109 Sum_probs=27.8 Q ss_pred HHHHHC--CCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEECHHHHHHHHCCC Q ss_conf 464413--578899858888841011211012221025786347861005887875285 Q gi|254780420|r 240 KDIMSL--QPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGICLGHQLLGLALGA 296 (396) Q Consensus 240 ~~i~~~--~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIClGhQ~l~~a~G~ 296 (396) ++++.- +.|+||-.| |+.... +...++..=.+++.|.|+.-.-+.+..-..| T Consensus 175 ~~~L~~~pdid~I~~~n---D~~a~G--a~~Al~~aG~~~v~v~g~Dg~~~~l~~I~~G 228 (272) T cd06313 175 ETWLTKYPQLDGAFCHN---DSMALA--AYQIMKAAGRTKIVIGGVDGDPPAIQAVSDG 228 (272) T ss_pred HHHHHHCCCCCEEEECC---CHHHHH--HHHHHHHCCCCCCEEEEECCCHHHHHHHHCC T ss_conf 99996499987899888---477899--9999997699986899978999999999859 No 174 >cd06316 PBP1_ABC_sugar_binding_like_7 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=77.19 E-value=3 Score=22.43 Aligned_cols=89 Identities=19% Similarity=0.192 Sum_probs=46.3 Q ss_pred CCCEEEEEECCCCCHH--HH------HHHHHCCCCEEEECC--CC-------HHHHHHCCC--CEEEECCCCCCCCCCCC Q ss_conf 7714999967964102--32------222202454133046--57-------446441357--88998588888410112 Q gi|254780420|r 204 AKYHVVCIDYGIRSNL--LR------VLSSLGCRITIVGAE--TS-------YKDIMSLQP--DGVLLSNGPGDPAVTSV 264 (396) Q Consensus 204 ~~~kIvvIDfGiK~nI--lr------~L~~~g~~v~Vvp~~--~~-------~~~i~~~~p--dgi~lSnGPGdP~~~~~ 264 (396) .+-+|+++..+..... -| .|.+.+.+++|+... .. .++++.-+| |+||-.| |+.... T Consensus 125 ~~g~v~ii~~~~~~~~~~~R~~Gf~~~l~~~~p~i~iv~~~~~~~~~~a~~~~~~~L~~~Pdi~~I~~~n---d~~a~G- 200 (294) T cd06316 125 GKGKVGLIYHGADYFVTNQRDQGFKETIKKNYPDITIVAEKGIDGPSKAEDIANAMLTQNPDLKGIYAVW---DVPAEG- 200 (294) T ss_pred CCCCEEEEECCCCCHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHHHHHHHHCCCCCEEEECC---CHHHHH- T ss_conf 6880899975899768999999999999720897299984388888999999999997399988899868---636599- Q ss_pred CCHHHHHHHCCCCCCEEEEECHHHH-HHHHCCCC Q ss_conf 1101222102578634786100588-78752854 Q gi|254780420|r 265 YSSPIICKLVDSGMPMFGICLGHQL-LGLALGAQ 297 (396) Q Consensus 265 ~~~~~~~~~~~~~iPilGIClGhQ~-l~~a~G~~ 297 (396) +...++..=...++|.|+..|... +...-++. T Consensus 201 -a~~Al~~aG~~~v~ivg~d~g~~~~~~~i~~G~ 233 (294) T cd06316 201 -VIAALRAAGRDDIKVTTVDLGLNVALDMAKGGN 233 (294) T ss_pred -HHHHHHHCCCCCCEEEEEECCCHHHHHHHHCCC T ss_conf -999999819998379999589689999997799 No 175 >PRK01911 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=77.01 E-value=2.7 Score=22.75 Aligned_cols=16 Identities=25% Similarity=0.571 Sum_probs=9.0 Q ss_pred HHHCCCCEEEECCCCC Q ss_conf 4413578899858888 Q gi|254780420|r 242 IMSLQPDGVLLSNGPG 257 (396) Q Consensus 242 i~~~~pdgi~lSnGPG 257 (396) +..+.-||+++|---| T Consensus 169 ~~~~~~DGlIvsTPTG 184 (290) T PRK01911 169 LTSYWADGLIIATPTG 184 (290) T ss_pred EEEEECCEEEEECCCC T ss_conf 8999536799967885 No 176 >COG0771 MurD UDP-N-acetylmuramoylalanine-D-glutamate ligase [Cell envelope biogenesis, outer membrane] Probab=76.68 E-value=6.4 Score=20.14 Aligned_cols=24 Identities=17% Similarity=0.451 Sum_probs=13.3 Q ss_pred HCCCCEECCCC-----HHHHHHHHHHCCC Q ss_conf 42783211257-----6899999985398 Q gi|254780420|r 117 SCGIIGLSGID-----TRALTVWIRDYGV 140 (396) Q Consensus 117 ~~~IpgI~GID-----TRaLTk~IRe~G~ 140 (396) ...+.+|+|=+ |..+..+++..|- T Consensus 109 ~~p~vaITGTNGKTTTTsli~~~l~~~G~ 137 (448) T COG0771 109 EAPIVAITGTNGKTTTTSLIAHLLKAAGL 137 (448) T ss_pred CCCEEEEECCCCHHHHHHHHHHHHHHCCC T ss_conf 99989997999628899999999985599 No 177 >cd03147 GATase1_Ydr533c_like Type 1 glutamine amidotransferase (GATase1)-like domain found in Saccharomyces cerevisiae Ydr533c protein. Type 1 glutamine amidotransferase (GATase1)-like domain found in Saccharomyces cerevisiae Ydr533c protein. This group includes proteins similar to S. cerevisiae Ydr533c. Ydr533c is upregulated in response to various stress conditions along with the heat shock family. The catalytic triad typical of GATase1domains is not conserved in this GATase1-like domain. However, in common with a typical GATase1domain, a reactive Cys residue is found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. This Cys together with a different His and Glu residue form a different catalytic triad from the typical GATase1domain. Ydr533c protein is a homodimer. Probab=76.41 E-value=3.5 Score=21.98 Aligned_cols=57 Identities=12% Similarity=0.132 Sum_probs=40.9 Q ss_pred CCCHHHHHHCCCCEEEECCCCCCCCCC--CCCCHHHHHHHCCCCCCEEEEECHHHHHHH Q ss_conf 657446441357889985888884101--121101222102578634786100588787 Q gi|254780420|r 236 ETSYKDIMSLQPDGVLLSNGPGDPAVT--SVYSSPIICKLVDSGMPMFGICLGHQLLGL 292 (396) Q Consensus 236 ~~~~~~i~~~~pdgi~lSnGPGdP~~~--~~~~~~~~~~~~~~~iPilGIClGhQ~l~~ 292 (396) ..+++++...+||+||+-+|-|--.+. +.....++..+.+.+.||-.||-|-=+|.- T Consensus 84 t~~l~~v~~~~YdaVf~pGGhG~m~Dl~~~~~l~~ll~~~~~~gk~vaaVCHGpaaLl~ 142 (231) T cd03147 84 IKKADEVNPDDYGIFFVAGGHGTLFDFPHATNLQKIAQQIYANGGVVAAVCHGPAILAN 142 (231) T ss_pred CCCHHHCCHHHCEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEECCHHHHHHH T ss_conf 68956889878259998898853032423999999999999759979998068899761 No 178 >PRK00683 murD UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase; Provisional Probab=76.11 E-value=4.9 Score=20.93 Aligned_cols=25 Identities=20% Similarity=0.391 Sum_probs=12.4 Q ss_pred HHCCCCEECCCC-----HHHHHHHHHHCCC Q ss_conf 742783211257-----6899999985398 Q gi|254780420|r 116 KSCGIIGLSGID-----TRALTVWIRDYGV 140 (396) Q Consensus 116 ~~~~IpgI~GID-----TRaLTk~IRe~G~ 140 (396) .+..+.||+|=. |-.|...++..|. T Consensus 100 ~~~~~IaITGTNGKTTTt~ll~~iL~~~g~ 129 (418) T PRK00683 100 QRYPSLGITGSTGKTTTILFLEHLLRTLGI 129 (418) T ss_pred CCCCEEEEECCCCCEEHHHHHHHHHHHCCC T ss_conf 479879997689866099999999986799 No 179 >cd02072 Glm_B12_BD B12 binding domain of glutamate mutase (Glm). Glutamate mutase catalysis the conversion of (S)-glutamate with (2S,3S)-3-methylaspartate. The rearrangement reaction is initiated by the extraction of a hydrogen from the protein-bound substrate by a 5'-desoxyadenosyl radical, which is generated by the homolytic cleavage of the organometallic bond of the cofactor B12. Glm is a heterotetrameric molecule consisting of two alpha and two epsilon polypeptide chains. Probab=75.30 E-value=2.4 Score=23.09 Aligned_cols=45 Identities=18% Similarity=0.312 Sum_probs=33.1 Q ss_pred ECCCCCHHHHHHHHHCCCCEEEECCCCHHHHHH----CCCCEEEECCCCC Q ss_conf 679641023222220245413304657446441----3578899858888 Q gi|254780420|r 212 DYGIRSNLLRVLSSLGCRITIVGAETSYKDIMS----LQPDGVLLSNGPG 257 (396) Q Consensus 212 DfGiK~nIlr~L~~~g~~v~Vvp~~~~~~~i~~----~~pdgi~lSnGPG 257 (396) |.|.| =|-+.|...|++|.-...+.+.+++.+ .++|.|.+|.=-| T Consensus 13 d~G~~-iva~~l~~~GfeVi~lG~~~~~e~~~~~a~e~~ad~i~vSsl~g 61 (128) T cd02072 13 AVGNK-ILDHAFTEAGFNVVNLGVLSPQEEFIDAAIETDADAILVSSLYG 61 (128) T ss_pred HHHHH-HHHHHHHHCCCEEEECCCCCCHHHHHHHHHHCCCCEEEECCCCC T ss_conf 78899-99999997897298479889999999999873999999823202 No 180 >TIGR01163 rpe ribulose-phosphate 3-epimerase; InterPro: IPR000056 Ribulose-phosphate 3-epimerase (5.1.3.1 from EC) (also known as pentose-5-phosphate 3-epimerase or PPE) is the enzyme that converts D-ribulose 5-phosphate into D-xylulose 5-phosphate in Calvin's reductive pentose phosphate cycle. In Alcaligenes eutrophus two copies of the gene coding for PPE are known , one is chromosomally encoded P40117 from SWISSPROT, the other one is on a plasmid Q04539 from SWISSPROT. PPE has been found in a wide range of bacteria, archaebacteria, fungi and plants. All the proteins have from 209 to 241 amino acid residues. The enzyme has a TIM barrel structure.; GO: 0004750 ribulose-phosphate 3-epimerase activity, 0005975 carbohydrate metabolic process. Probab=74.83 E-value=7.3 Score=19.74 Aligned_cols=96 Identities=19% Similarity=0.251 Sum_probs=62.9 Q ss_pred CCHHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHH---------CCCCCCCCHHHCCCCCCCEECCCCCCCCCCCC Q ss_conf 576899999985398423663276545897898887741---------58874321011013333211168653445434 Q gi|254780420|r 126 IDTRALTVWIRDYGVSNSVIAHHPDGQFDLEDLKERAKN---------WSGLKGAELAKHVTVSQRRDWSEKIWKWGEET 196 (396) Q Consensus 126 IDTRaLTk~IRe~G~~~a~I~~~~~~~~d~~~~~~~~~~---------~~~~~~~dL~~~Vs~~~~~~~~~~~~~~~~~~ 196 (396) ...-++.+.||+.|. +|.|+-.|.. .++.+...+.. .|+..|+.....+.-| .+...+.+. T Consensus 93 ~h~~R~l~~Ik~~G~-~AG~v~NP~T--Pl~~~~~~L~~~D~VLlMSVnPGFgGQkFIP~~~~K-ir~~R~~id------ 162 (216) T TIGR01163 93 EHIHRLLQLIKELGA-KAGIVLNPAT--PLEALEYVLEDVDLVLLMSVNPGFGGQKFIPETLEK-IRELRKMID------ 162 (216) T ss_pred CCHHHHHHHHHHCCC-CEEEEECCCC--CHHHHHHHHHHCCEEEEEEEECCCCCCCCHHHHHHH-HHHHHHHHH------ T ss_conf 267999999997189-7068867999--987899898762989988760799884110578999-999999998------ Q ss_pred CCCCCCCCCCEEEEEECCCCCHHHHHHHHHCCCCEEEE Q ss_conf 43467777714999967964102322222024541330 Q gi|254780420|r 197 SFLKSSDAKYHVVCIDYGIRSNLLRVLSSLGCRITIVG 234 (396) Q Consensus 197 ~~~~~~~~~~kIvvIDfGiK~nIlr~L~~~g~~v~Vvp 234 (396) +...+..--.-+|=|+|.+=.+.+.+-|+++-|-- T Consensus 163 ---~~~~~~~~~ieVDGGv~~~ni~~~~~AGAD~~VaG 197 (216) T TIGR01163 163 ---KLELGLSILIEVDGGVNEDNIAEVAEAGADILVAG 197 (216) T ss_pred ---HHCCCCCEEEEECCCCCHHHHHHHHHCCCCEEEEE T ss_conf ---60279955899717989767999997589899983 No 181 >cd02071 MM_CoA_mut_B12_BD methylmalonyl CoA mutase B12 binding domain. This domain binds to B12 (adenosylcobamide), which initiates the conversion of succinyl CoA and methylmalonyl CoA by forming an adenosyl radical, which then undergoes a rearrangement exchanging a hydrogen atom with a group attached to a neighboring carbon atom. This family is present in both mammals and bacteria. Bacterial members are heterodimers and involved in the fermentation of pyruvate to propionate. Mammalian members are homodimers and responsible for the conversion of odd-chain fatty acids and branched-chain amino acids via propionyl CoA to succinyl CoA for further degradation. Probab=74.63 E-value=2.4 Score=23.07 Aligned_cols=45 Identities=24% Similarity=0.347 Sum_probs=34.5 Q ss_pred ECCCCCHHHHHHHHHCCCCEEEECCCCHHHHHH----CCCCEEEECCCCC Q ss_conf 679641023222220245413304657446441----3578899858888 Q gi|254780420|r 212 DYGIRSNLLRVLSSLGCRITIVGAETSYKDIMS----LQPDGVLLSNGPG 257 (396) Q Consensus 212 DfGiK~nIlr~L~~~g~~v~Vvp~~~~~~~i~~----~~pdgi~lSnGPG 257 (396) |-|.| =+-+.|...|++|+-.+.+.+++++.+ .++|.|-+|.=-| T Consensus 13 d~G~~-iva~~l~d~GfeVi~lG~~~s~eeiv~~A~~e~ad~IglSsL~g 61 (122) T cd02071 13 DRGAK-VIARALRDAGFEVIYTGLRQTPEEIVEAAIQEDVDVIGLSSLSG 61 (122) T ss_pred HHHHH-HHHHHHHHCCCEEEECCCCCCHHHHHHHHHHCCCCEEEEECCCC T ss_conf 87799-99999997897699679988999999999973998999964655 No 182 >PRK10955 DNA-binding transcriptional regulator CpxR; Provisional Probab=74.62 E-value=7.4 Score=19.70 Aligned_cols=89 Identities=19% Similarity=0.245 Sum_probs=42.9 Q ss_pred EEEEEE--CCCCCHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEE Q ss_conf 499996--796410232222202454133046574464413578899858888841011211012221025786347861 Q gi|254780420|r 207 HVVCID--YGIRSNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGIC 284 (396) Q Consensus 207 kIvvID--fGiK~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIC 284 (396) ||+++| --+-..+.+.|...|++|.....-..+-++++.+||-|+|--+- |...+......+++ ...+||+=+. T Consensus 3 kILlVEDd~~l~~~l~~~L~~~G~~V~~a~~g~~al~~l~~~~DlvilDi~L--P~~dG~~l~~~iR~--~~~~PII~lt 78 (232) T PRK10955 3 KILLVDDDRELTSLLKELLEMEGFNVIVAHDGEQALDLLDDSIDLLLLDVMM--PKKNGIDTLKALRQ--THQTPVIMLT 78 (232) T ss_pred EEEEEECCHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHCCCCEEEEECCC--CCCCCCCCCCEEEC--CCCCCEEEEE T ss_conf 8999969899999999999888999999899999999964898999991899--98886723012440--7888789980 Q ss_pred ----CHHHHHHHHCCCCCC Q ss_conf ----005887875285446 Q gi|254780420|r 285 ----LGHQLLGLALGAQTV 299 (396) Q Consensus 285 ----lGhQ~l~~a~G~~~~ 299 (396) .-.++.|+..||.-| T Consensus 79 a~~~~~d~i~~l~~GADDY 97 (232) T PRK10955 79 ARGSELDRVLGLELGADDY 97 (232) T ss_pred CCCCHHHHHHHHHCCCCEE T ss_conf 5567688999997697576 No 183 >cd06301 PBP1_rhizopine_binding_like Periplasmic binding proteins specific to rhizopines. Periplasmic binding proteins specific to rhizopines, which are simple sugar-like compounds produced in the nodules induced by the symbiotic root nodule bacteria, such as Rhizobium and Sinorhizobium. Rhizopine-binding-like proteins from other bacteria are also included. Two inositol based rhizopine compounds are known to date: L-3-O-methly-scyllo-inosamine (3-O-MSI) and scyllo-inosamine. Bacterial strains that can metabolize rhizopine have a greater competitive advantage in nodulation and rhizopine synthesis is regulated by NifA/NtrA regulatory transcription activators which are maximally expressed at the onset of nitrogen fixation in bacteroids. The members of this group belong to the pentose/hexose sugar-binding protein family of the type I periplasmic binding protein superfamily. Probab=74.57 E-value=5 Score=20.90 Aligned_cols=43 Identities=19% Similarity=0.170 Sum_probs=20.2 Q ss_pred CCCEEEECCCCCCCCCCCCCCHHHHHHH--CCCCCCEEEEECHHHHHHHH Q ss_conf 5788998588888410112110122210--25786347861005887875 Q gi|254780420|r 246 QPDGVLLSNGPGDPAVTSVYSSPIICKL--VDSGMPMFGICLGHQLLGLA 293 (396) Q Consensus 246 ~pdgi~lSnGPGdP~~~~~~~~~~~~~~--~~~~iPilGIClGhQ~l~~a 293 (396) ++|+||-.|. ..... +...+++. ..+++.|.|...-.+.+..- T Consensus 183 ~~~aI~~~nd---~~a~G--a~~al~~~g~~~~~i~v~G~D~~~~~~~~i 227 (272) T cd06301 183 KIDAVVANND---EMALG--AIMALKAAGKSDKDVPVAGIDGTPDALAAV 227 (272) T ss_pred CCCEEEECCC---HHHHH--HHHHHHHCCCCCCCEEEEEECCCHHHHHHH T ss_conf 7787995582---78999--999999869899987999989859999998 No 184 >PRK11411 fecB iron-dicitrate transporter substrate-binding subunit; Provisional Probab=74.35 E-value=7.5 Score=19.66 Aligned_cols=17 Identities=29% Similarity=0.630 Sum_probs=11.3 Q ss_pred CCHHHHHHCCCCEEEEC Q ss_conf 57446441357889985 Q gi|254780420|r 237 TSYKDIMSLQPDGVLLS 253 (396) Q Consensus 237 ~~~~~i~~~~pdgi~lS 253 (396) .|.|.+...+||-||+. T Consensus 222 iS~E~l~~~dpD~I~v~ 238 (297) T PRK11411 222 ISLEQLLALNPDWLLVA 238 (297) T ss_pred CCHHHHHCCCCCEEEEE T ss_conf 07999840499989999 No 185 >PRK02472 murD UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase; Provisional Probab=73.78 E-value=7.7 Score=19.57 Aligned_cols=30 Identities=20% Similarity=0.359 Sum_probs=14.3 Q ss_pred HHHHHHHCCCCEECCCC-----HHHHHHHHHHCCC Q ss_conf 89999742783211257-----6899999985398 Q gi|254780420|r 111 FDAWLKSCGIIGLSGID-----TRALTVWIRDYGV 140 (396) Q Consensus 111 L~~wL~~~~IpgI~GID-----TRaLTk~IRe~G~ 140 (396) |...+.+..+.||+|=. |..|...|+..|- T Consensus 105 l~~~~~~~~~IaVTGTnGKTTTt~ll~~iL~~~g~ 139 (450) T PRK02472 105 LAYLISEAPIIGITGSNGKTTTTTLIAEMLNAGGQ 139 (450) T ss_pred HHHHHCCCCEEEEECCCCCCHHHHHHHHHHHHCCC T ss_conf 88875159789982799973199999999997499 No 186 >cd06302 PBP1_LsrB_Quorum_Sensing Periplasmic binding domain of autoinducer-2 (AI-2) receptor LsrB from Salmonella typhimurium and its close homologs. Periplasmic binding domain of autoinducer-2 (AI-2) receptor LsrB from Salmonella typhimurium and its close homologs from other bacteria. The members of this group are homologous to a family of periplasmic pentose/hexose sugar-binding proteins that function as the primary receptors for chemotaxis and transporters of many sugar based solutes in bacteria and archaea and that are a member of the type I periplasmic binding protein superfamily. LsrB binds a chemically distinct form of the AI-2 signal that lacks boron, in contrast to the Vibrio harveyi AI-2 signaling molecule that has an unusual furanosyl borate diester. Hence, many bacteria coordinate their gene expression according to the local density of their population by producing species specific AI-2. This process of quorum sensing allows LsrB to function as a periplasmic AI-2 binding p Probab=73.70 E-value=4.5 Score=21.20 Aligned_cols=48 Identities=13% Similarity=0.336 Sum_probs=26.2 Q ss_pred HHHHHCCCC--EEEECCCCCCCCCCCCCCHHHHHHH-CCCCCCEEEEECHHHHHHH Q ss_conf 464413578--8998588888410112110122210-2578634786100588787 Q gi|254780420|r 240 KDIMSLQPD--GVLLSNGPGDPAVTSVYSSPIICKL-VDSGMPMFGICLGHQLLGL 292 (396) Q Consensus 240 ~~i~~~~pd--gi~lSnGPGdP~~~~~~~~~~~~~~-~~~~iPilGIClGhQ~l~~ 292 (396) ++++.-+|| +||-.|.+ .... +...++.. +.+++.|.|+..--+.+.. T Consensus 177 ~~~L~~~Pdi~~I~~~nd~---~a~G--a~~Al~~aG~~~~V~vvG~D~~~~~~~~ 227 (298) T cd06302 177 QELLKAYPDLKGIIGPTSV---GIPG--AARAVEEAGLKGKVAVTGLGLPNQMAPY 227 (298) T ss_pred HHHHHHCCCCCEEEECCCH---HHHH--HHHHHHHCCCCCCEEEEEECCCHHHHHH T ss_conf 9999759991299978981---6999--9999996799998599996887999999 No 187 >cd02809 alpha_hydroxyacid_oxid_FMN Family of homologous FMN-dependent alpha-hydroxyacid oxidizing enzymes. This family occurs in both prokaryotes and eukaryotes. Members of this family include flavocytochrome b2 (FCB2), glycolate oxidase (GOX), lactate monooxygenase (LMO), mandelate dehydrogenase (MDH), and long chain hydroxyacid oxidase (LCHAO). In green plants, glycolate oxidase is one of the key enzymes in photorespiration where it oxidizes glycolate to glyoxylate. LMO catalyzes the oxidation of L-lactate to acetate and carbon dioxide. MDH oxidizes (S)-mandelate to phenylglyoxalate. It is an enzyme in the mandelate pathway that occurs in several strains of Pseudomonas which converts (R)-mandelate to benzoate. Probab=73.37 E-value=4.8 Score=21.05 Aligned_cols=82 Identities=29% Similarity=0.373 Sum_probs=49.1 Q ss_pred CHHHHHHHHHCCC-CEEEECCCCHHHHH---HCCCCEEEECCCCCC---CCCCCCCCHHHHHHHCCCCCCEE---EEECH Q ss_conf 1023222220245-41330465744644---135788998588888---41011211012221025786347---86100 Q gi|254780420|r 217 SNLLRVLSSLGCR-ITIVGAETSYKDIM---SLQPDGVLLSNGPGD---PAVTSVYSSPIICKLVDSGMPMF---GICLG 286 (396) Q Consensus 217 ~nIlr~L~~~g~~-v~Vvp~~~~~~~i~---~~~pdgi~lSnGPGd---P~~~~~~~~~~~~~~~~~~iPil---GIClG 286 (396) +.-++.|++. .. -.|+--=.+.+|.. +.--|||++||==|. -........+.+.+.+.+++||+ ||=-| T Consensus 161 w~~i~~l~~~-~~~p~i~KGi~~~~DA~~a~~~G~dgI~VSNHGGRqlD~~p~~i~~L~~i~~~v~~~~~v~~DgGvR~G 239 (299) T cd02809 161 WDDLAWLRSQ-WKGPLILKGILTPEDALRAVDAGADGIVVSNHGGRQLDGAPATIDALPEIVAAVGGRIEVLLDGGIRRG 239 (299) T ss_pred HHHHHHHHHH-CCCCEEEECCCCHHHHHHHHHCCCCEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEECCCCCCH T ss_conf 9999999986-699879972788999999998599889972887333688877899999999985467289971884753 Q ss_pred HHHH-HHHCCCCCC Q ss_conf 5887-875285446 Q gi|254780420|r 287 HQLL-GLALGAQTV 299 (396) Q Consensus 287 hQ~l-~~a~G~~~~ 299 (396) .-++ |+|+||+.. T Consensus 240 ~Dv~KAlaLGA~~V 253 (299) T cd02809 240 TDVLKALALGADAV 253 (299) T ss_pred HHHHHHHHCCCCEE T ss_conf 68999997699889 No 188 >cd01139 TroA_f Periplasmic binding protein TroA_f. These proteins are predicted to function as initial receptors in the ABC metal ion uptake in eubacteria and archaea. They belong to the TroA superfamily of helical backbone metal receptor proteins that share a distinct fold and ligand binding mechanism. A typical TroA protein is comprised of two globular subdomains connected by a single helix and can bind their ligands in the cleft between these domains. Probab=72.96 E-value=7.8 Score=19.51 Aligned_cols=22 Identities=14% Similarity=0.300 Sum_probs=18.3 Q ss_pred CCCHHHHHHCCCCEEEECCCCC Q ss_conf 6574464413578899858888 Q gi|254780420|r 236 ETSYKDIMSLQPDGVLLSNGPG 257 (396) Q Consensus 236 ~~~~~~i~~~~pdgi~lSnGPG 257 (396) ..+.|.|++++||-||++++.+ T Consensus 223 ~Vs~Eqil~wnPDvIii~~~~~ 244 (342) T cd01139 223 ELNAEYVIAANPEIIIATGGNW 244 (342) T ss_pred CCCHHHHHHHCCCEEEEECCCC T ss_conf 4379998754888899967754 No 189 >cd06322 PBP1_ABC_sugar_binding_like_12 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. This group includes the periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consist of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=72.84 E-value=6 Score=20.33 Aligned_cols=34 Identities=15% Similarity=0.169 Sum_probs=16.4 Q ss_pred CCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEE Q ss_conf 578899858888841011211012221025786347861 Q gi|254780420|r 246 QPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGIC 284 (396) Q Consensus 246 ~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIC 284 (396) ++|+||-+|. ..... +...+++.-...+.|.|.- T Consensus 179 ~~~aI~~~nD---~~A~G--a~~al~~~g~~di~vvGfD 212 (267) T cd06322 179 DLDGIFAFGD---DAALG--AVSAIKAAGRDNVKVIGFD 212 (267) T ss_pred CCCEEEECCC---HHHHH--HHHHHHHCCCCCCEEEEEC T ss_conf 7748998187---99999--9999997599998899994 No 190 >cd03138 GATase1_AraC_2 AraC transcriptional regulators having a Type 1 glutamine amidotransferase (GATase1)-like domain. A subgroup of AraC transcriptional regulators having a Type 1 glutamine amidotransferase (GATase1)-like domain. AraC regulators are defined by a AraC-type helix-turn-helix DNA binding domain at their C-terminal. AraC family transcriptional regulators are widespread among bacteria and are involved in regulating diverse and important biological functions, including carbon metabolism, stress responses and virulence in different microorganisms. The catalytic triad typical of GATase1 domains is not conserved in this GATase1-like domain. However, in common with typical GATase1domains a reactive cys residue is found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. Probab=72.79 E-value=5.2 Score=20.79 Aligned_cols=57 Identities=23% Similarity=0.315 Sum_probs=36.1 Q ss_pred CCCCHHHHHHCCCCEEEECCCCCCCCCCCC----CCHHHHHHHCCCCCCEEEEECHHHHHHHH Q ss_conf 465744644135788998588888410112----11012221025786347861005887875 Q gi|254780420|r 235 AETSYKDIMSLQPDGVLLSNGPGDPAVTSV----YSSPIICKLVDSGMPMFGICLGHQLLGLA 293 (396) Q Consensus 235 ~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~----~~~~~~~~~~~~~iPilGIClGhQ~l~~a 293 (396) .+.++++.. ++|-+|+-++.|+|..... .....+++....+..|.|||-|--+||.| T Consensus 60 ~d~~~~~~~--~~D~livpg~~~~~~~~~~~~~~~l~~~L~~~~~~g~~i~sictGa~~LA~a 120 (195) T cd03138 60 PDATLADVP--APDLVIVPGLGGDPDELLLADNPALIAWLRRQHANGATVAAACTGVFLLAEA 120 (195) T ss_pred CCCCHHHCC--CCCEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCEEECCCHHHHHHHHC T ss_conf 786844566--5688997576678630123369999999998665271784165899999973 No 191 >PRK11083 DNA-binding response regulator CreB; Provisional Probab=72.49 E-value=8 Score=19.44 Aligned_cols=91 Identities=19% Similarity=0.315 Sum_probs=40.4 Q ss_pred CEEEEEE--CCCCCHHHHHHHHHCCCCEEEECCCCH-HHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEE Q ss_conf 1499996--796410232222202454133046574-4644135788998588888410112110122210257863478 Q gi|254780420|r 206 YHVVCID--YGIRSNLLRVLSSLGCRITIVGAETSY-KDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFG 282 (396) Q Consensus 206 ~kIvvID--fGiK~nIlr~L~~~g~~v~Vvp~~~~~-~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilG 282 (396) .||+++| --+-..+...|...|++|..+..-..+ +.+...+||-|+|-=+- |...+......+++. ...+||+= T Consensus 4 ~kILiVEDd~~l~~~l~~~L~~~g~~v~~~~~~~~al~~~~~~~~DlvilDi~L--P~~~G~~l~~~iR~~-~~~~pII~ 80 (229) T PRK11083 4 PTILLVEDEQAIADTLVYALQSEGFTVEWFERGLPALDKLRQQPPDLVILDVGL--PDISGFELCRQLRAF-HPALPVIF 80 (229) T ss_pred CEEEEEECCHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHHCCCCEEEECCCC--CCCCHHHHHHHHHHH-CCCCEEEE T ss_conf 999999699999999999999889999998999999999971899899973889--998768899999970-89972999 Q ss_pred EECH----HHHHHHHCCCCCC Q ss_conf 6100----5887875285446 Q gi|254780420|r 283 ICLG----HQLLGLALGAQTV 299 (396) Q Consensus 283 IClG----hQ~l~~a~G~~~~ 299 (396) +.-- ..+.|+..||.-| T Consensus 81 lta~~~~~d~i~~l~~GADDY 101 (229) T PRK11083 81 LTARSDEVDRLVGLEIGADDY 101 (229) T ss_pred EECCCCHHHHHHHHHCCCCEE T ss_conf 836789899999997599877 No 192 >CHL00148 orf27 Ycf27; Reviewed Probab=71.87 E-value=5.8 Score=20.46 Aligned_cols=93 Identities=26% Similarity=0.330 Sum_probs=57.6 Q ss_pred CCCCEEEEEE--CCCCCHHHHHHHHHCCCCEEEECCCCH-HHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCC Q ss_conf 7771499996--796410232222202454133046574-4644135788998588888410112110122210257863 Q gi|254780420|r 203 DAKYHVVCID--YGIRSNLLRVLSSLGCRITIVGAETSY-KDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMP 279 (396) Q Consensus 203 ~~~~kIvvID--fGiK~nIlr~L~~~g~~v~Vvp~~~~~-~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iP 279 (396) +.|+||+++| --+-..|...|.+.|++|..+..-..+ +.+...+||-|+|-=+- |...+......+++ ...+| T Consensus 4 ~~k~kILiVEDd~~l~~~l~~~L~~~G~~v~~a~~g~~a~~~~~~~~~DlviLDi~L--P~~dG~~l~~~iR~--~~~~P 79 (240) T CHL00148 4 SSKEKILVVDDEASIRRILETRLSIIGYEVITASDGEEALKLFHQEQPDLVILDVMM--PKLDGYGVCQEIRK--ESDVP 79 (240) T ss_pred CCCCEEEEEECCHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHHCCCCEEEEECCC--CCCCCCEEEECCCC--CCCCC T ss_conf 999829999398999999999999789999998999999999974799999997999--98886630541403--79954 Q ss_pred EEEEE----CHHHHHHHHCCCCCC Q ss_conf 47861----005887875285446 Q gi|254780420|r 280 MFGIC----LGHQLLGLALGAQTV 299 (396) Q Consensus 280 ilGIC----lGhQ~l~~a~G~~~~ 299 (396) |+=+. .-.++.|+.+||.-| T Consensus 80 II~LTa~~~~~d~v~gl~~GADDY 103 (240) T CHL00148 80 IIMLTALGDVSDRITGLELGADDY 103 (240) T ss_pred EEEEECCCCHHHHHHHHHCCCCEE T ss_conf 899816789899999997699789 No 193 >PRK02261 methylaspartate mutase subunit S; Provisional Probab=71.86 E-value=3.3 Score=22.14 Aligned_cols=70 Identities=19% Similarity=0.200 Sum_probs=43.2 Q ss_pred ECCCCCHHHHHHHHHCCCCEEEECCCCHHHHHH----CCCCEEEECC--CCCCCCCCCCCCHHHHHHHCCCCCCEEEEE Q ss_conf 679641023222220245413304657446441----3578899858--888841011211012221025786347861 Q gi|254780420|r 212 DYGIRSNLLRVLSSLGCRITIVGAETSYKDIMS----LQPDGVLLSN--GPGDPAVTSVYSSPIICKLVDSGMPMFGIC 284 (396) Q Consensus 212 DfGiK~nIlr~L~~~g~~v~Vvp~~~~~~~i~~----~~pdgi~lSn--GPGdP~~~~~~~~~~~~~~~~~~iPilGIC 284 (396) |-|.| =|-+.|...|++|+-...+.+.+++.+ .+.|.|.+|. |-... .+.. .....++.-...+|++.=. T Consensus 17 d~G~~-iva~~l~~~GfeVi~lG~~~~pe~~v~~A~~~~ad~igiSsl~G~~~~-~~~~-l~~~l~~~g~~di~vvvGG 92 (137) T PRK02261 17 AVGNK-ILDRALTEAGFEVINLGVMTSQEEFIDAAIETDADAILVSSLYGHGEI-DCRG-LREKCIEAGLGDILLYVGG 92 (137) T ss_pred HHHHH-HHHHHHHHCCCEEEECCCCCCHHHHHHHHHHCCCCEEEECCCCCCCCH-HHHH-HHHHHHHCCCCCCEEEECC T ss_conf 78899-999999978974984688799999999998739999997111126612-7999-9999996799999699836 No 194 >cd06289 PBP1_MalI_like Ligand-binding domain of MalI, a transcription regulator of the maltose system of Escherichia coli and its close homologs from other bacteria. This group includes the ligand-binding domain of MalI, a transcription regulator of the maltose system of Escherichia coli and its close homologs from other bacteria. They are members of the LacI-GalR family of repressor proteins which are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=71.39 E-value=8.1 Score=19.41 Aligned_cols=49 Identities=8% Similarity=0.037 Sum_probs=23.1 Q ss_pred CCCCCEEEEEECCCCEEEEEEECCCCEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHC Q ss_conf 6998289999889980872686799778983487887767001689999999999735 Q gi|254780420|r 336 LPAGLEETHISLFDSSNCGFRLIDQPVFSVQYHPESSPGPQDSYYLFSTFIDFMHKRK 393 (396) Q Consensus 336 lp~~~~v~~~~~nD~tiegi~~~~~~~~~vQfHPEa~PGP~D~~~~F~~F~~~i~~~k 393 (396) +|.++.+... |++..+ ..-.-++-+|+..|+.- +..-++-.+++|+..+ T Consensus 204 vp~disiigf---d~~~~~-~~~~p~ltti~~~~~~~-----g~~a~~~l~~~i~~~~ 252 (268) T cd06289 204 PGRDIAVVGF---DDVAEA-ALATPALTTVSTDPREI-----GRRAAELLLRRIADPD 252 (268) T ss_pred CCCCEEEEEE---CCHHHH-HHCCCCEEEEEECHHHH-----HHHHHHHHHHHHCCCC T ss_conf 9985499995---787899-72289716998499999-----9999999999976989 No 195 >cd06314 PBP1_tmGBP Periplasmic sugar-binding domain of Thermotoga maritima glucose-binding protein (tmGBP) and its close homologs. Periplasmic sugar-binding domain of Thermotoga maritima glucose-binding protein (tmGBP) and its close homologs from other bacteria. They are a member of the type I periplasmic binding protein superfamily which consists of two domains connected by a three-stranded hinge. TmGBP is specific for glucose and its binding pocket is buried at the interface of the two domains. TmGBP also exhibits high thermostability and the highest structural similarity to E. coli glucose binding protein (ecGBP). Probab=71.31 E-value=6.8 Score=19.96 Aligned_cols=53 Identities=11% Similarity=0.058 Sum_probs=29.7 Q ss_pred HHHHHCCC--CEEEECCCCCCCCCCCCCCHHHHHHH-CCCCCCEEEEECHHHHHHHHCCCC Q ss_conf 46441357--88998588888410112110122210-257863478610058878752854 Q gi|254780420|r 240 KDIMSLQP--DGVLLSNGPGDPAVTSVYSSPIICKL-VDSGMPMFGICLGHQLLGLALGAQ 297 (396) Q Consensus 240 ~~i~~~~p--dgi~lSnGPGdP~~~~~~~~~~~~~~-~~~~iPilGIClGhQ~l~~a~G~~ 297 (396) ++++.-+| |+||-.|....+. +...++.. ..+++.|.|+.+..+.+..---+. T Consensus 171 ~~~L~~~pdi~~i~~~~~~~a~g-----a~~Al~~~g~~~~i~vvg~D~~~~~~~~i~~G~ 226 (271) T cd06314 171 EDALNAHPDLKCMFGLYAYNGPA-----IAEAVKAAGKLGKVKIVGFDEDPDTLQGVKEGT 226 (271) T ss_pred HHHHHHCCCCCEEEEECCCCHHH-----HHHHHHHHCCCCCCEEEEECCCHHHHHHHHCCC T ss_conf 99997589987899808851699-----999999847589967998789899999998299 No 196 >PRK13856 two-component response regulator VirG; Provisional Probab=70.69 E-value=9 Score=19.09 Aligned_cols=90 Identities=21% Similarity=0.308 Sum_probs=47.4 Q ss_pred CEEEEEEC--CCCCHHHHHHHHHCCCCEEEECCCCHHHH-HHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEE Q ss_conf 14999967--96410232222202454133046574464-4135788998588888410112110122210257863478 Q gi|254780420|r 206 YHVVCIDY--GIRSNLLRVLSSLGCRITIVGAETSYKDI-MSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFG 282 (396) Q Consensus 206 ~kIvvIDf--GiK~nIlr~L~~~g~~v~Vvp~~~~~~~i-~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilG 282 (396) +||+++|= -+-..+..+|...|++|.....-..+.+. ....||.++|--+..|- .+......++. ....|++= T Consensus 2 ~~ILlVEDD~~l~~~l~~~L~~~g~~v~~a~~~~~~~~~l~~~~~DlvIlDi~lp~~--~Gl~~~~~ir~--~~~~piii 77 (241) T PRK13856 2 KHVLVIDDDVAMRHLIVEYLTIHAFKVTAVADSQQFNRVLSSETVDVVVVDLNLGRE--DGLEIVRSLAT--KSDVPIII 77 (241) T ss_pred CEEEEEECCHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHCCCCCEEEEECCCCCC--CCHHHHHHHCC--CCCCCEEE T ss_conf 989999698999999999998779999998999999999865999999996999876--61345556403--69973699 Q ss_pred EEC-----HHHHHHHHCCCCCC Q ss_conf 610-----05887875285446 Q gi|254780420|r 283 ICL-----GHQLLGLALGAQTV 299 (396) Q Consensus 283 ICl-----GhQ~l~~a~G~~~~ 299 (396) +.- ..++.|+.+||.-| T Consensus 78 lt~~~~~~~d~i~~l~~GADDY 99 (241) T PRK13856 78 ISGDRLEEADKVVALELGATDF 99 (241) T ss_pred EECCCCCHHHHHHHHHCCCCEE T ss_conf 9726787899999987699889 No 197 >PRK11780 isoprenoid biosynthesis protein with amidotransferase-like domain; Provisional Probab=70.51 E-value=6.2 Score=20.22 Aligned_cols=86 Identities=21% Similarity=0.265 Sum_probs=54.6 Q ss_pred CCCCHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCC------------CCCHHHHHHHCCCCCCEE Q ss_conf 96410232222202454133046574464413578899858888841011------------211012221025786347 Q gi|254780420|r 214 GIRSNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTS------------VYSSPIICKLVDSGMPMF 281 (396) Q Consensus 214 GiK~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~------------~~~~~~~~~~~~~~iPil 281 (396) +-++|+|-+=.+. ++=.| .++.++...++|++++-+|=|--.... ......++.+.+.++||= T Consensus 58 ~e~RNVLvESARI-aRG~I----~~l~~~~~~d~DaLI~PGGfGaAKNLs~fA~~G~~~~V~p~V~~~i~~f~~a~KPiG 132 (217) T PRK11780 58 AETRNVLVESARI-ARGEI----KDLAEADAEDFDALIVPGGFGAAKNLSNFAFKGAECSVDPEVKALARAFHQAGKPIG 132 (217) T ss_pred CHHHHHHHHHHHH-HCCCC----CCHHHCCHHHCCEEEECCCCHHHHCCCCHHHCCCCCEECHHHHHHHHHHHHHCCCEE T ss_conf 3247789999887-61586----674569975678799678620443110165349986487899999999998099989 Q ss_pred EEECHHHHHHHHCCCCCCCCCCCC Q ss_conf 861005887875285446677675 Q gi|254780420|r 282 GICLGHQLLGLALGAQTVKMYQGH 305 (396) Q Consensus 282 GIClGhQ~l~~a~G~~~~kl~~Gh 305 (396) -||.---+++..+|.. .|+..|+ T Consensus 133 ~iCIaP~l~a~vl~~~-~~~TiG~ 155 (217) T PRK11780 133 FICIAPAMLPKILGEG-VKLTIGN 155 (217) T ss_pred EECHHHHHHHHHCCCC-CEEECCC T ss_conf 8607689999972888-4896578 No 198 >PRK01368 murD UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase; Provisional Probab=70.19 E-value=9.2 Score=19.02 Aligned_cols=27 Identities=19% Similarity=0.368 Sum_probs=14.0 Q ss_pred HHHCCCCEECCCC-----HHHHHHHHHHCCCC Q ss_conf 9742783211257-----68999999853984 Q gi|254780420|r 115 LKSCGIIGLSGID-----TRALTVWIRDYGVS 141 (396) Q Consensus 115 L~~~~IpgI~GID-----TRaLTk~IRe~G~~ 141 (396) ++..++.||+|=. |..|..+|+..|-- T Consensus 101 ~~~~~~IaITGTnGKTTTt~li~~iL~~~g~~ 132 (450) T PRK01368 101 SKNLKFIAITGTNGKSTTTALISHILNSNGLD 132 (450) T ss_pred CCCCCEEEEECCCCCHHHHHHHHHHHHHCCCC T ss_conf 67997799968999748999999999975996 No 199 >cd06282 PBP1_GntR_like_2 Ligand-binding domain of putative DNA transcription repressors highly similar to that of the repressor specific for gluconate (GntR) which is a member of the LacI-GalR family of bacterial transcription regulators. This group includes the ligand-binding domain of putative DNA transcription repressors highly similar to that of the repressor specific for gluconate (GntR) which is a member of the LacI-GalR family of bacterial transcription regulators. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding Probab=70.12 E-value=9.3 Score=19.01 Aligned_cols=25 Identities=8% Similarity=0.082 Sum_probs=9.6 Q ss_pred CEEEECCCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 778983487887767001689999999999 Q gi|254780420|r 361 PVFSVQYHPESSPGPQDSYYLFSTFIDFMH 390 (396) Q Consensus 361 ~~~~vQfHPEa~PGP~D~~~~F~~F~~~i~ 390 (396) ++-+|..+++.- +..-++..+++|+ T Consensus 223 ~lTti~q~~~~~-----g~~A~~~l~~~i~ 247 (266) T cd06282 223 PLATVAQPNRDI-----GRQAVQRLLARIA 247 (266) T ss_pred CCEEEECCHHHH-----HHHHHHHHHHHHC T ss_conf 967998699999-----9999999999977 No 200 >KOG1273 consensus Probab=70.04 E-value=9.3 Score=18.99 Aligned_cols=52 Identities=15% Similarity=0.132 Sum_probs=31.4 Q ss_pred CCCCCCCCCCCCCCHHHHHHHCCCCEECCCCHHHHHHHHHHCCCCCEEEECCCCC Q ss_conf 4654434552417989999742783211257689999998539842366327654 Q gi|254780420|r 97 AEITEPSNYRANMHFDAWLKSCGIIGLSGIDTRALTVWIRDYGVSNSVIAHHPDG 151 (396) Q Consensus 97 ~~~~~~Sn~rs~~sL~~wL~~~~IpgI~GIDTRaLTk~IRe~G~~~a~I~~~~~~ 151 (396) ..|...|.|..-..+. -.+|-..|++.|||.+.|.|-.+=-.-..++-.+++ T Consensus 26 a~~~~Fs~~G~~lAvG---c~nG~vvI~D~~T~~iar~lsaH~~pi~sl~WS~dg 77 (405) T KOG1273 26 AECCQFSRWGDYLAVG---CANGRVVIYDFDTFRIARMLSAHVRPITSLCWSRDG 77 (405) T ss_pred CCEEEECCCCCEEEEE---CCCCCEEEEECCCCCHHHHHHCCCCCEEEEEECCCC T ss_conf 3447763675536663---268728999711323666643033323677764778 No 201 >cd02922 FCB2_FMN Flavocytochrome b2 (FCB2) FMN-binding domain. FCB2 (AKA L-lactate:cytochrome c oxidoreductase) is a respiratory enzyme located in the intermembrane space of fungal mitochondria which catalyzes the oxidation of L-lactate to pyruvate. FCB2 also participates in a short electron-transport chain involving cytochrome c and cytochrome oxidase which ultimately directs the reducing equivalents gained from L-lactate oxidation to oxygen, yielding one molecule of ATP for every L-lactate molecule consumed. FCB2 is composed of 2 domains: a C-terminal flavin-binding domain, which includes the active site for lacate oxidation, and an N-terminal b2-cytochrome domain, required for efficient cytochrome c reduction. FCB2 is a homotetramer and contains two noncovalently bound cofactors, FMN and heme per subunit. Probab=69.42 E-value=6.6 Score=20.05 Aligned_cols=86 Identities=27% Similarity=0.383 Sum_probs=52.4 Q ss_pred EECCCCCHHHHHHHHHCCCC-EEEECCCCHHHHH---HCCCCEEEECCCCCC-----CCCCCCCCHHHHHH---HCCCCC Q ss_conf 96796410232222202454-1330465744644---135788998588888-----41011211012221---025786 Q gi|254780420|r 211 IDYGIRSNLLRVLSSLGCRI-TIVGAETSYKDIM---SLQPDGVLLSNGPGD-----PAVTSVYSSPIICK---LVDSGM 278 (396) Q Consensus 211 IDfGiK~nIlr~L~~~g~~v-~Vvp~~~~~~~i~---~~~pdgi~lSnGPGd-----P~~~~~~~~~~~~~---~~~~~i 278 (396) +|-..-+.-++.|++. ... .|+--=.+.+|-. ..--|||++||==|. |+..+ ..+.+.+ ...+++ T Consensus 196 ~~~~~tw~di~~lr~~-~~~plivKGIl~~~DA~~A~~~G~dgIiVSNHGGRqLD~~~~~i~--~Lp~I~~~~~av~~~~ 272 (344) T cd02922 196 IDPTLTWDDIKWLRKH-TKLPIVLKGVQTVEDAVLAAEYGVDGIVLSNHGGRQLDTAPAPIE--VLLEIRKHCPEVFDKI 272 (344) T ss_pred CCCCCCHHHHHHHHHH-CCCCEEECCCCCHHHHHHHHHCCCCEEEEECCCCCCCCCCCCHHH--HHHHHHHHHHHHCCCE T ss_conf 4888999999999986-699701002577999999996599889971886212578831899--9899999889858870 Q ss_pred CEE---EEECHHHHH-HHHCCCCCC Q ss_conf 347---861005887-875285446 Q gi|254780420|r 279 PMF---GICLGHQLL-GLALGAQTV 299 (396) Q Consensus 279 Pil---GIClGhQ~l-~~a~G~~~~ 299 (396) ||+ ||--|.-++ |+|+|++-. T Consensus 273 ~v~~DgGiR~G~DV~KAlALGA~aV 297 (344) T cd02922 273 EVYVDGGVRRGTDVLKALCLGAKAV 297 (344) T ss_pred EEEECCCCCCHHHHHHHHHCCCCEE T ss_conf 8997188575789999997699989 No 202 >cd06308 PBP1_sensor_kinase_like Periplasmic binding domain of two-component sensor kinase signaling systems. Periplasmic binding domain of two-component sensor kinase signaling systems, some of which are fused with a C-terminal histidine kinase A domain (HisK) and/or a signal receiver domain (REC). Members of this group share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily and are predicted to be involved in sensing of environmental stimuli; their substrate specificities, however, are not known in detail. Probab=68.82 E-value=7.9 Score=19.50 Aligned_cols=34 Identities=21% Similarity=0.209 Sum_probs=14.7 Q ss_pred CCCEEEECCCCCCCCCCCCCCHHHHHHH-CCCCCCEEEEE Q ss_conf 5788998588888410112110122210-25786347861 Q gi|254780420|r 246 QPDGVLLSNGPGDPAVTSVYSSPIICKL-VDSGMPMFGIC 284 (396) Q Consensus 246 ~pdgi~lSnGPGdP~~~~~~~~~~~~~~-~~~~iPilGIC 284 (396) ++|+||-.| |..... +...+++. ..+++.|.|+. T Consensus 182 di~aI~a~n---D~~A~G--a~~Al~~aG~~~~i~vvg~D 216 (270) T cd06308 182 DIDLVYAHN---DPMALG--AYLAAKRAGREKEIKFIGID 216 (270) T ss_pred CCCEEEECC---CHHHHH--HHHHHHHCCCCCCCEEEEEC T ss_conf 888899778---658999--99999976999995899888 No 203 >cd06306 PBP1_TorT-like TorT-like proteins, a periplasmic binding protein family that activates induction of the Tor respiratory system upon trimethylamine N-oxide (TMAO) electron-acceptor binding in bacteria. TorT-like proteins, a periplasmic binding protein family that activates induction of the Tor respiratory system upon trimethylamine N-oxide (TMAO) electron-acceptor binding in bacteria. The Tor respiratory system is consists of three proteins (TorC, TorA, and TorD) and is induced in the presence of TMAO. The TMAO control is tightly regulated by three proteins: TorS, TorT, and TorR. Thus, the disruption of any of these proteins can abolish the Tor respiratory induction. TorT shares homology with the sugar-binding domain of the type I periplasmic binding proteins. The members of TorT-like family bind TMAO or related compounds and are predicted to be involved in signal transduction and/or substrate transport. Probab=68.67 E-value=5.3 Score=20.72 Aligned_cols=17 Identities=12% Similarity=0.282 Sum_probs=7.9 Q ss_pred CCCCCEEEEECHHHHHH Q ss_conf 57863478610058878 Q gi|254780420|r 275 DSGMPMFGICLGHQLLG 291 (396) Q Consensus 275 ~~~iPilGIClGhQ~l~ 291 (396) .+++.|.|.-.--+.+. T Consensus 208 ~~~v~vvg~d~~~~~~~ 224 (268) T cd06306 208 TDQIKIVSTYLSHAVYR 224 (268) T ss_pred CCCEEEEEECCCHHHHH T ss_conf 99759999369989999 No 204 >cd01140 FatB Siderophore binding protein FatB. These proteins have been shown to function as ABC-type initial receptors in the siderophore-mediated iron uptake in some eubacterial species. They belong to the TroA superfamily of periplasmic metal binding proteins that share a distinct fold and ligand binding mechanism. A typical TroA protein is comprised of two globular subdomains connected by a single helix and can bind their ligands in the cleft between these domains. Probab=68.62 E-value=9.9 Score=18.79 Aligned_cols=20 Identities=20% Similarity=0.418 Sum_probs=16.5 Q ss_pred CCCHHHHHHCCCCEEEECCC Q ss_conf 65744644135788998588 Q gi|254780420|r 236 ETSYKDIMSLQPDGVLLSNG 255 (396) Q Consensus 236 ~~~~~~i~~~~pdgi~lSnG 255 (396) ..+.|.+.+++||-||+.+. T Consensus 193 ~is~E~l~~~dPDvI~v~~~ 212 (270) T cd01140 193 PVSFEYILEANPDWLFVIDR 212 (270) T ss_pred CCCHHHHHHHCCCEEEEEEC T ss_conf 14787898739999999606 No 205 >cd06321 PBP1_ABC_sugar_binding_like_11 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. This group includes the periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consist of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=68.40 E-value=7.4 Score=19.68 Aligned_cols=34 Identities=15% Similarity=0.198 Sum_probs=15.8 Q ss_pred CCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEE Q ss_conf 578899858888841011211012221025786347861 Q gi|254780420|r 246 QPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGIC 284 (396) Q Consensus 246 ~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIC 284 (396) ++|+||-.|- +.... +...+++.-..++.+.|+. T Consensus 181 ~~~aifa~nD---~~A~G--a~~a~~~~G~~~i~ivg~d 214 (271) T cd06321 181 KLDGVFAIND---PTAIG--ADLAAKQAGRNDIKITSVD 214 (271) T ss_pred CCCEEEECCC---HHHHH--HHHHHHHCCCCCCEEEEEE T ss_conf 6778997786---89999--9999997597999899996 No 206 >TIGR03659 IsdE heme ABC transporter, heme-binding protein isdE. This family of ABC substrate-binding proteins is observed primarily in close proximity with proteins localized to the cell wall and bearing the NEAT (NEAr Transporter, pfam05031) heme-binding domain. IsdE has been shown to bind heme and is involved in the process of scavenging heme for the purpose of obtaining iron. Probab=67.82 E-value=10 Score=18.69 Aligned_cols=65 Identities=17% Similarity=0.236 Sum_probs=33.3 Q ss_pred CCCCCCCCCCHHHHCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCHHHHHHHCCCCEEC-CCCH-HHHHHHHHHCCCCCE Q ss_conf 89736845882342046533321023666404654434552417989999742783211-2576-899999985398423 Q gi|254780420|r 66 FPHIGNVGVNSEDFESISRKNFKGAVGLVIKAEITEPSNYRANMHFDAWLKSCGIIGLS-GIDT-RALTVWIRDYGVSNS 143 (396) Q Consensus 66 ~P~IGNyGvn~~d~ES~~~~~~~~~~g~Iv~~~~~~~Sn~rs~~sL~~wL~~~~IpgI~-GIDT-RaLTk~IRe~G~~~a 143 (396) .|-+|++.-+ ++|..-. -+| -+|+... .....+.+-|++.+||... ..++ -.+-+.||.-|...+ T Consensus 73 v~~vG~~~~p--nlE~I~a-LkP---DLIi~~~-------~~~~~~~~~l~~~~i~v~~~~~~~~~~i~~~i~~lg~i~g 139 (289) T TIGR03659 73 VPEVGNPMSP--DMEKIKS-LKP---TVVLSVT-------TLEEDLGPKFKQLGVEATFLNLTSVDGMKKSITELGEKYG 139 (289) T ss_pred CCCCCCCCCC--CHHHHHH-CCC---CEEEECC-------CCHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHHHHHC T ss_conf 5721887899--9999973-699---7899557-------4258789999860975999837899999999999999838 No 207 >PRK10653 D-ribose transporter subunit RbsB; Provisional Probab=67.07 E-value=9.2 Score=19.03 Aligned_cols=34 Identities=15% Similarity=0.128 Sum_probs=15.5 Q ss_pred CCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEE Q ss_conf 578899858888841011211012221025786347861 Q gi|254780420|r 246 QPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGIC 284 (396) Q Consensus 246 ~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIC 284 (396) +||+||-.| |..... +...++..-..++.|.|.- T Consensus 207 ~~~aIf~~n---D~~A~G--a~~al~~~G~~di~vvGfD 240 (295) T PRK10653 207 DVQAVFAQN---DEMALG--ALRALQTAGKSDVMVVGFD 240 (295) T ss_pred CCCEEEECC---CHHHHH--HHHHHHHCCCCCCEEEEEC T ss_conf 977899779---779999--9999998099983999289 No 208 >cd01537 PBP1_Repressors_Sugar_Binding_like Ligand-binding domain of the LacI-GalR family of transcription regulators and the sugar-binding domain of ABC-type transport systems. Ligand-binding domain of the LacI-GalR family of transcription regulators and the sugar-binding domain of ABC-type transport systems, all of which contain the type I periplasmic binding protein-like fold. Their specific ligands include lactose, ribose, fructose, xylose, arabinose, galactose/glucose, and other sugars. The LacI family of proteins consists of transcriptional regulators related to the lac repressor; in general the sugar binding domain in this family binds a sugar, which in turn changes the DNA binding activity of the repressor domain. The core structure of the periplasmic binding proteins is classified into two types and they differ in number and order of beta strands in each domain: type I, which has six beta strands, and type II, which has five beta strands. These two distinct structural arrangem Probab=67.02 E-value=9.6 Score=18.88 Aligned_cols=55 Identities=15% Similarity=0.096 Sum_probs=26.4 Q ss_pred EEEEECCCCCCCCCCCCCCHHHHHHHCCCCEEC---CCCH---HHHHHHHHHCCCCCEEEECC Q ss_conf 666404654434552417989999742783211---2576---89999998539842366327 Q gi|254780420|r 92 GLVIKAEITEPSNYRANMHFDAWLKSCGIIGLS---GIDT---RALTVWIRDYGVSNSVIAHH 148 (396) Q Consensus 92 g~Iv~~~~~~~Sn~rs~~sL~~wL~~~~IpgI~---GIDT---RaLTk~IRe~G~~~a~I~~~ 148 (396) |+|+.+. ..|..-+-...+.++++++|.-.+. .-|- +...+.+..++ .-|.|... T Consensus 3 gvi~p~~-~~~f~~~~~~gi~~~a~~~gy~~~~~~~~~~~~~~~~~i~~~~~~~-vDgiIi~~ 63 (264) T cd01537 3 GVLVPDL-DNPFFAQVLKGIEEAAKAAGYQVLLANSQNDAEKQLSALENLIARG-VDGIIIAP 63 (264) T ss_pred EEEECCC-CCHHHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHHHCC-CCEEEEEC T ss_conf 9997889-7799999999999999986998999979999899999999999769-99999967 No 209 >TIGR01383 not_thiJ DJ-1 family protein; InterPro: IPR006287 These sequences represent the DJ-1 clade of the so-called ThiJ/PfpI family of proteins. PfpI, represented by IPR006286 from INTERPRO, is a putative intracellular cysteine protease. DJ-1 is described as an oncogene that acts cooperatively with H-Ras . Many members of the DJ-1 clade are annotated (apparently incorrectly) as ThiJ (thiamine pyridinylase), a protein of thiamine biosynthesis. However, published reports of ThiJ activity and identification of a ThiJ/ThiD bifunctional protein describe an unrelated locus mapping near ThiM, rather than the DJ-1 homolog of Escherichia coli. The ThiJ designation for this family may be spurious; the cited paper refers to a locus near thiD and thiM in E. coli, unlike the gene represented here. Current public annotation reflects ThiJ/ThiD bifunctional activity, apparently a property of ThiD and not of this locus. This group of proteins are classified as either DJ-1 putative peptidases or non-peptidase homologs in MEROPS peptidase family C56 (clan PC(C)). . Probab=66.89 E-value=5.2 Score=20.78 Aligned_cols=89 Identities=22% Similarity=0.241 Sum_probs=53.4 Q ss_pred CCC--CCCCEEEEEECCCCCHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCC-CCCCC-CCCCCCHHHHHHHCC Q ss_conf 677--77714999967964102322222024541330465744644135788998588-88841-011211012221025 Q gi|254780420|r 200 KSS--DAKYHVVCIDYGIRSNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNG-PGDPA-VTSVYSSPIICKLVD 275 (396) Q Consensus 200 ~~~--~~~~kIvvIDfGiK~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnG-PGdP~-~~~~~~~~~~~~~~~ 275 (396) +++ +.+-.++++++=-+. |.=.+++=+.+-+|.+++++..-++|.|+|-+| ||--+ .-+....++++++.+ T Consensus 23 rRaelG~~Vt~~~v~~~~~~-----l~V~gs~gv~~~AD~~l~d~~~~~fD~ivLPGG~pGa~nL~~S~~l~~~lk~~~~ 97 (186) T TIGR01383 23 RRAELGIKVTVAIVGLNGSG-----LLVKGSRGVKILADASLEDVDDEEFDLIVLPGGMPGAENLRDSKLLENLLKKQES 97 (186) T ss_pred HHCCCCCEEEEEEEEECCCC-----CEEEEECCCEECCCCCHHHCCCCCCCEEEECCCCHHHHHHHCCHHHHHHHHHHHH T ss_conf 21135870799997425897-----6046535627734871412577776778936982266642025789999999997 Q ss_pred CCCCEEEEECHHHHHHHH Q ss_conf 786347861005887875 Q gi|254780420|r 276 SGMPMFGICLGHQLLGLA 293 (396) Q Consensus 276 ~~iPilGIClGhQ~l~~a 293 (396) .+.=|-.||=.==++..+ T Consensus 98 ~gkl~AAICAaP~vll~~ 115 (186) T TIGR01383 98 KGKLVAAICAAPIVLLAA 115 (186) T ss_pred CCCCEEEEHHHHHHHHHC T ss_conf 499388420458986302 No 210 >PRK09393 ftrA transcriptional activator FtrA; Provisional Probab=66.75 E-value=11 Score=18.54 Aligned_cols=57 Identities=12% Similarity=0.102 Sum_probs=35.2 Q ss_pred CCCHHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEECHHHHHHHHC Q ss_conf 65744644135788998588888410112110122210257863478610058878752 Q gi|254780420|r 236 ETSYKDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGICLGHQLLGLAL 294 (396) Q Consensus 236 ~~~~~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIClGhQ~l~~a~ 294 (396) +.++++.. ..|-||+..|.+-....+.....-+++.......|-|||-|--+||.|- T Consensus 65 d~~~~~~~--~~D~lvV~gg~~~~~~~~~~l~~wLr~~~~~g~~l~~vctGaf~LA~aG 121 (320) T PRK09393 65 DGGLELLD--RADTIVIPGWRDPDEPVPEPLLKALRAAHARGARLCSICSGVFVLAAAG 121 (320) T ss_pred CCCHHHCC--CCCEEEECCCCCCCCCCCHHHHHHHHHHHHCCCEEEEECHHHHHHHHCC T ss_conf 87765657--6888998787772446998999999999875998999829999999738 No 211 >COG1214 Inactive homolog of metal-dependent proteases, putative molecular chaperone [Posttranslational modification, protein turnover, chaperones] Probab=66.23 E-value=7.6 Score=19.62 Aligned_cols=68 Identities=19% Similarity=0.199 Sum_probs=34.5 Q ss_pred CCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEECHHHHHHHHCCCCCCCCCCCCCCCCCCC-EECCCCEEEE Q ss_conf 5788998588888410112110122210257863478610058878752854466776752101123-4778970999 Q gi|254780420|r 246 QPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGICLGHQLLGLALGAQTVKMYQGHHGVNHPV-KNISNGTVEI 322 (396) Q Consensus 246 ~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIClGhQ~l~~a~G~~~~kl~~GhrG~N~Pv-~~~~t~kv~i 322 (396) +-|+|..+.|||.=.-.-..+.-..--.+..++|++|||= |-++|.+... +. +..-+| .|-.-+++|. T Consensus 58 dld~Iav~~GPGSFTGlRIG~~~AkgLA~~l~iplvgvss---L~~~A~~~~~---~~---~~~v~v~idArr~~vY~ 126 (220) T COG1214 58 DLDAIAVAKGPGSFTGLRIGVAFAKGLALALNIPLVGVSS---LEALAQGGAE---KN---AGFVLVAIDARRGEVYW 126 (220) T ss_pred HCCEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCEEEECH---HHHHHHHHHC---CC---CCCEEEEEECCCCEEEE T ss_conf 8488999308974223889999999998752986899576---9999863201---57---87437788426764785 No 212 >PRK11625 Rho-binding antiterminator; Provisional Probab=66.07 E-value=11 Score=18.57 Aligned_cols=55 Identities=25% Similarity=0.408 Sum_probs=33.5 Q ss_pred EEEEECCCCEEEEEEECCCCEEEEE-EEEECCCCCCCCCCCCCCCCCCEEEECCCCCCCCCC Q ss_conf 6899728987999975676248888-985378888760141614256173216897368458 Q gi|254780420|r 14 AVLVLADGSVIEGMGCGATGSIQAE-ICFNTSLTGYQEILTDPSYLGQIVNFTFPHIGNVGV 74 (396) Q Consensus 14 A~L~LedGt~f~G~~fG~~~~~~GE-vVFnT~mtGYqE~lTDPSY~gQIlvfT~P~IGNyGv 74 (396) -.|.|.||+.+.|++.--....--| +|.- .--+-||.= ..+|+.||.|.||.+=| T Consensus 26 L~leL~dGe~~~gkA~dl~trkkvEyLvie-~~g~~q~lR-----LD~I~sfshpe~G~vvv 81 (84) T PRK11625 26 LTLELKDGEKLQAKASDLVSRKNVEYLVVE-AAGETRELR-----LDKIASFSHPEIGTVVV 81 (84) T ss_pred EEEEECCCCEEEEEEEEEEECCCCEEEEEE-ECCCEEEEE-----EEEEEECCCCCCCEEEE T ss_conf 999994898999997366861671799998-148644677-----21453127877755996 No 213 >PRK08227 aldolase; Validated Probab=65.39 E-value=7.1 Score=19.84 Aligned_cols=16 Identities=19% Similarity=0.173 Sum_probs=7.2 Q ss_pred CHHHHHHCCCCEEEEC Q ss_conf 7446441357889985 Q gi|254780420|r 238 SYKDIMSLQPDGVLLS 253 (396) Q Consensus 238 ~~~~i~~~~pdgi~lS 253 (396) +.+|-+.+.-|+|-.. T Consensus 126 sVeeAvrlGAdAVsv~ 141 (291) T PRK08227 126 DMEDAVRLNVCAVAAQ 141 (291) T ss_pred CHHHHHHCCCCEEEEE T ss_conf 4999986799789998 No 214 >TIGR01316 gltA glutamate synthase (NADPH), homotetrameric; InterPro: IPR006004 One pathway for the assimilation of ammonia and glutamate biosynthesis involves homotetrameric glutamate synthase (1.4.1.13 from EC) which transfers the amide group of glutamine to 2-oxoglutarate to yield two molecules of glutamate .2 L-glutamate + NADP^+ = L-glutamine + 2-oxoglutarate + NADPH + H^+. . Probab=65.22 E-value=5.4 Score=20.63 Aligned_cols=48 Identities=21% Similarity=0.504 Sum_probs=33.5 Q ss_pred CCCHHH----HHHHHHCCCC---EEEECCCCHHHHHHCC-CCEEEECCCCCCCCCC Q ss_conf 641023----2222202454---1330465744644135-7889985888884101 Q gi|254780420|r 215 IRSNLL----RVLSSLGCRI---TIVGAETSYKDIMSLQ-PDGVLLSNGPGDPAVT 262 (396) Q Consensus 215 iK~nIl----r~L~~~g~~v---~Vvp~~~~~~~i~~~~-pdgi~lSnGPGdP~~~ 262 (396) .-++|+ +.|.++|++. .++--+.+++++++.. +|+|||.-|-|.|.-. T Consensus 190 LpKei~~~E~k~LkklGv~fr~~~lvGkt~TL~eL~~~YGfDAVFIgtGAG~pkl~ 245 (462) T TIGR01316 190 LPKEIVETEVKKLKKLGVKFRTDYLVGKTVTLEELLEKYGFDAVFIGTGAGLPKLL 245 (462) T ss_pred CCHHHHHHHHHHHHHCCEEEEECCEECCCHHHHHHHHHCCCCEEEEEECCCCCEEC T ss_conf 87578898888763266379944375051128888875197079995068987001 No 215 >cd06300 PBP1_ABC_sugar_binding_like_1 Periplasmic sugar-binding component of uncharacterized ABC-type transport systems that are members of the pentose/hexose sugar-binding protein family of the type I periplasmic binding protein superfamily. Periplasmic sugar-binding component of uncharacterized ABC-type transport systems that are members of the pentose/hexose sugar-binding protein family of the type I periplasmic binding protein superfamily, which consists of two alpha/beta globular domains connected by a three-stranded hinge. This Venus flytrap-like domain undergoes transition from an open to a closed conformational state upon ligand binding. Members of this group are predicted to be involved in the transport of sugar-containing molecules across cellular and organellar membranes; however their substrate specificity is not known in detail. Probab=65.04 E-value=8.7 Score=19.19 Aligned_cols=23 Identities=17% Similarity=0.131 Sum_probs=13.3 Q ss_pred CCCEECCCCHHHHHH---HHHHCCCC Q ss_conf 783211257689999---99853984 Q gi|254780420|r 119 GIIGLSGIDTRALTV---WIRDYGVS 141 (396) Q Consensus 119 ~IpgI~GIDTRaLTk---~IRe~G~~ 141 (396) +...|+-+|+.+++. +.++.|.. T Consensus 62 D~Iiv~p~d~~a~~~~l~~A~~aGIp 87 (272) T cd06300 62 DAIIINPASPTALNPVIEEACEAGIP 87 (272) T ss_pred CEEEEECCCHHHHHHHHHHHHHCCCE T ss_conf 99999789767889999999985996 No 216 >cd01539 PBP1_GGBP Periplasmic glucose/galactose-binding protein (GGBP) involved in chemotaxis towards, and active transport of, glucose and galactose in various bacterial species. Periplasmic glucose/galactose-binding protein (GGBP) involved in chemotaxis towards, and active transport of, glucose and galactose in various bacterial species. GGBP is a member of the pentose/hexose sugar-binding protein family of the type I periplasmic binding protein superfamily which consists of two alpha/beta globular domains connected by a three-stranded hinge. This Venus flytrap-like domain undergoes transition from an open to a closed conformational state upon ligand binding. Moreover, the periplasmic GGBP is homologous to the ligand-binding domain of eukaryotic receptors such as glutamate receptor (GluR) and DNA-binding transcriptional repressors such as LacI and GalR. Probab=64.63 E-value=9.8 Score=18.83 Aligned_cols=88 Identities=17% Similarity=0.232 Sum_probs=43.5 Q ss_pred CCCCCCCEEEEEE--CCCCCHHH------HHHHHHCCCCEEE---ECCCCH-------HHHHHC---CCCEEEECCCCCC Q ss_conf 6777771499996--79641023------2222202454133---046574-------464413---5788998588888 Q gi|254780420|r 200 KSSDAKYHVVCID--YGIRSNLL------RVLSSLGCRITIV---GAETSY-------KDIMSL---QPDGVLLSNGPGD 258 (396) Q Consensus 200 ~~~~~~~kIvvID--fGiK~nIl------r~L~~~g~~v~Vv---p~~~~~-------~~i~~~---~pdgi~lSnGPGd 258 (396) ++++++..+++|- -|....+. ..|.+.+.++.++ +.+++. ++++.- +.|+||-.| | T Consensus 134 ~~g~gki~~vil~G~~g~~~~~~R~~Gf~~~l~~~~~~~~iv~~~~~~w~~~~a~~~~e~~L~~~~d~id~I~a~N---D 210 (303) T cd01539 134 KNGDGIIQYVMLKGEPGHPDAIARTKYSIETLNDAGIKTEELASDTANWDRAQAKDKMDALLLKYGDKIEAVIANN---D 210 (303) T ss_pred HCCCCCEEEEEEECCCCCHHHHHHHHHHHHHHHHCCCCEEEEEEEECCCCHHHHHHHHHHHHHHCCCCEEEEEECC---C T ss_conf 1688845799996899997899999869999985498015777871687889999999999985899735999789---4 Q ss_pred CCCCCCCCHHHHHHH------CCCCCCEEEEECHHHHHHH Q ss_conf 410112110122210------2578634786100588787 Q gi|254780420|r 259 PAVTSVYSSPIICKL------VDSGMPMFGICLGHQLLGL 292 (396) Q Consensus 259 P~~~~~~~~~~~~~~------~~~~iPilGIClGhQ~l~~ 292 (396) +.... +...++.. ..++++|.|+=-.-+.+.. T Consensus 211 ~mAlG--A~~Al~~aG~~~~~~~~~I~VvG~Dg~~~~l~a 248 (303) T cd01539 211 AMALG--AIEALQKYGYNKGDKSKNIPVVGVDALPEALEL 248 (303) T ss_pred HHHHH--HHHHHHHCCCCCCCCCCCCEEEEECCCHHHHHH T ss_conf 77999--999999759876667888679986799999999 No 217 >cd01540 PBP1_arabinose_binding Periplasmic L-arabinose-binding protein (ABP), a member of a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily. Periplasmic L-arabinose-binding protein (ABP), a member of a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily. ABP is only involved in transport contrary to other related sugar-binding proteins such as the glucose/galactose-binding protein (GGBP) and the ribose-binding protein (RBP), both of which are involved in chemotaxis as well as transport. The periplasmic ABP consists of two alpha/beta globular domains connected by a three-stranded hinge, a Venus flytrap-like domain, which undergoes a transition from an open to a closed conformational state upon ligand binding. Moreover, ABP is homologous to the ligand-binding domain of eukaryotic receptors such as metabotropic glutamate receptor (mGluR) and DNA-binding transcriptional repressors such a Probab=64.55 E-value=11 Score=18.43 Aligned_cols=47 Identities=11% Similarity=0.082 Sum_probs=19.5 Q ss_pred HHHHHCCCCE---EEECCCCCCCCCCCCCCHHHHHHH--CCCCCCEEEEECHHHHH Q ss_conf 4644135788---998588888410112110122210--25786347861005887 Q gi|254780420|r 240 KDIMSLQPDG---VLLSNGPGDPAVTSVYSSPIICKL--VDSGMPMFGICLGHQLL 290 (396) Q Consensus 240 ~~i~~~~pdg---i~lSnGPGdP~~~~~~~~~~~~~~--~~~~iPilGIClGhQ~l 290 (396) ++++.-+||. +|++ -.|+.... +...++.. -..++-+.||+=--..+ T Consensus 184 ~~~L~a~pdi~~~~i~a--~ND~mAlG--A~~Al~~aGl~~~~v~vvgidg~~~~~ 235 (289) T cd01540 184 ASTLTKNPNVKNWIIYG--LNDETVLG--AVRATEQSGIAAADVIGVGINGSDAAD 235 (289) T ss_pred HHHHHHCCCCCCEEEEE--CCCHHHHH--HHHHHHHCCCCCCCEEEEEEECCCHHH T ss_conf 99997589988439997--79668899--999999749988976999972882599 No 218 >cd00958 DhnA Class I fructose-1,6-bisphosphate (FBP) aldolases of the archaeal type (DhnA homologs) found in bacteria and archaea. Catalysis of the enzymes proceeds via a Schiff-base mechanism like other class I aldolases, although this subfamily is clearly divergent based on sequence similarity to other class I and class II (metal dependent) aldolase subfamilies. Probab=64.41 E-value=8.2 Score=19.37 Aligned_cols=68 Identities=18% Similarity=0.368 Sum_probs=43.9 Q ss_pred HHHHHHHCCCCEEEECCCC---HHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEECHHHHHH Q ss_conf 3222220245413304657---44644135788998588888410112110122210257863478610058878 Q gi|254780420|r 220 LRVLSSLGCRITIVGAETS---YKDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGICLGHQLLG 291 (396) Q Consensus 220 lr~L~~~g~~v~Vvp~~~~---~~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIClGhQ~l~ 291 (396) .|.=.++|++++=++|..+ +.++.+-.|-.|+++.||+..... .....+...++.+ --|+|+|-.+.- T Consensus 149 aRia~ELGADiVKv~y~g~~e~f~~vv~~~~vPVviaGG~k~~~~~--~~L~~v~~a~~aG--a~Gv~~GRNvfQ 219 (235) T cd00958 149 ARIGAELGADIVKTKYTGDAESFKEVVEGCPVPVVIAGGPKKDSEE--EFLKMVYDAMEAG--AAGVAVGRNIFQ 219 (235) T ss_pred HHHHHHHCCCEEEECCCCCHHHHHHHHHHCCCCEEECCCCCCCCHH--HHHHHHHHHHHCC--CCEEEECCHHHC T ss_conf 9999997899898239999899999997089989980899999999--9999999999769--956763641225 No 219 >pfam09897 DUF2124 Uncharacterized protein conserved in archaea (DUF2124). This domain, found in various hypothetical archaeal proteins, has no known function. Probab=64.19 E-value=6.4 Score=20.13 Aligned_cols=58 Identities=22% Similarity=0.265 Sum_probs=32.1 Q ss_pred HHHHHCCCCEEEECCCCCCCC---CCCCCCHHHHHHHCCCCCCEEEEECHHHHHHHHCCCCCCCCCC Q ss_conf 464413578899858888841---0112110122210257863478610058878752854466776 Q gi|254780420|r 240 KDIMSLQPDGVLLSNGPGDPA---VTSVYSSPIICKLVDSGMPMFGICLGHQLLGLALGAQTVKMYQ 303 (396) Q Consensus 240 ~~i~~~~pdgi~lSnGPGdP~---~~~~~~~~~~~~~~~~~iPilGIClGhQ~l~~a~G~~~~kl~~ 303 (396) ++....+||.++|=+|=--|. ..+ .+.+++.++.. ..|.|||+ |-+-...|. ..|+.| T Consensus 74 ~~~~~~~~D~lVlmGGLAMP~~~vt~e-~v~~li~k~~~--~kviGvCF--ms~F~kagW-~~~IdF 134 (147) T pfam09897 74 GENTDLNPDVLVLLGGLAMPKSGVTPE-DVKELIEKLNP--KKVIGVCF--MSMFEKAGW-DDKIDF 134 (147) T ss_pred CCCCCCCCCEEEEECCCCCCCCCCCHH-HHHHHHHHCCC--CCEEEEEH--HHHHHHCCC-CCCCCC T ss_conf 776789998999967655799998999-99999996499--87899974--998987699-876675 No 220 >PRK10957 iron-enterobactin transporter periplasmic binding protein; Provisional Probab=64.19 E-value=12 Score=18.21 Aligned_cols=22 Identities=23% Similarity=0.400 Sum_probs=16.0 Q ss_pred CCHHHHHHCCCCEEEECCCCCC Q ss_conf 5744644135788998588888 Q gi|254780420|r 237 TSYKDIMSLQPDGVLLSNGPGD 258 (396) Q Consensus 237 ~~~~~i~~~~pdgi~lSnGPGd 258 (396) -++|.|.+++||-|+.|.-.+| T Consensus 106 pnlE~I~~l~PDLIi~~~~~~~ 127 (319) T PRK10957 106 PDAEAVAAQMPDLIVISATGGD 127 (319) T ss_pred CCHHHHHHCCCCEEEEECCCCC T ss_conf 9999996079999998176650 No 221 >cd06320 PBP1_allose_binding Periplasmic allose-binding domain of bacterial transport systems that function as a primary receptor of active transport and chemotaxis. Periplasmic allose-binding domain of bacterial transport systems that function as a primary receptor of active transport and chemotaxis. The members of this group are belonging to a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily. Like other periplasmic receptors of the ABC-type transport systems, the allose-binding protein consists of two alpha/beta domains connected by a three-stranded hinge. This Venus flytrap-like domain undergoes transition from an open to a closed conformational state upon ligand binding. Probab=64.07 E-value=12 Score=18.34 Aligned_cols=45 Identities=7% Similarity=-0.008 Sum_probs=21.4 Q ss_pred HHHHCC--CCEEEECCCCCCCCCCCCCCHHHHHHH-CCCCCCEEEEECHHHHH Q ss_conf 644135--788998588888410112110122210-25786347861005887 Q gi|254780420|r 241 DIMSLQ--PDGVLLSNGPGDPAVTSVYSSPIICKL-VDSGMPMFGICLGHQLL 290 (396) Q Consensus 241 ~i~~~~--pdgi~lSnGPGdP~~~~~~~~~~~~~~-~~~~iPilGIClGhQ~l 290 (396) +++..+ +++||..|.. .... +...+++. +.+++.|.|.=.-.+.+ T Consensus 175 ~~L~~~pd~~ai~~~nD~---~A~G--a~~al~~~g~~~di~vvG~D~~~~~~ 222 (275) T cd06320 175 TILQRNPDLKAIYCNNDT---MALG--VVEAVKNAGKQGKVLVVGTDGIPEAY 222 (275) T ss_pred HHHHHCCCCCEEEECCCH---HHHH--HHHHHHHCCCCCCEEEEEECCCHHHH T ss_conf 988628766779966988---9999--99999976999982999989769999 No 222 >cd04815 PA_M28_2 PA_M28_2: Protease-associated (PA) domain, peptidase family M28, subfamily-2. A subfamily of PA-domain containing proteins belonging to the peptidase family M28. Family M28 contains aminopeptidases and carboxypeptidases, and has co-catalytic zinc ions. The PA domain is an insert domain in a diverse fraction of proteases. The significance of the PA domain to many of the proteins in which it is inserted is undetermined. It may be a protein-protein interaction domain. At peptidase active sites, the PA domain may participate in substrate binding and/or promoting conformational changes, which influence the stability and accessibility of the site to substrate. Proteins into which the PA domain is inserted include the following members of the peptidase family M28: i) prostate-specific membrane antigen (PSMA), ii) yeast aminopeptidase Y, and ii) human TfR (transferrin receptor)1 and human TfR2. The proteins listed above belong to other subfamilies; relatively little is known a Probab=63.91 E-value=6.6 Score=20.04 Aligned_cols=111 Identities=19% Similarity=0.122 Sum_probs=63.4 Q ss_pred EEEECC-CCEEEEEEEEECCCCCCCCCCCCCCCCCCEEEECCCCCC-----CCCCCH-HHHCCCCCCCCCEEEEEEEECC Q ss_conf 997567-624888898537888876014161425617321689736-----845882-3420465333210236664046 Q gi|254780420|r 26 GMGCGA-TGSIQAEICFNTSLTGYQEILTDPSYLGQIVNFTFPHIG-----NVGVNS-EDFESISRKNFKGAVGLVIKAE 98 (396) Q Consensus 26 G~~fG~-~~~~~GEvVFnT~mtGYqE~lTDPSY~gQIlvfT~P~IG-----NyGvn~-~d~ES~~~~~~~~~~g~Iv~~~ 98 (396) |.|.|- .+-..+|||.-.+.--. |.+.+-+-.|.|+.|..|+.- .||--- ........--+..+.|++||.. T Consensus 7 GgSv~Tp~~Gi~aeVv~v~s~~eL-~~~~~~~v~GKIVf~n~~m~~~~~g~~YG~~v~~R~~Ga~~Aak~GAva~liRSv 85 (134) T cd04815 7 GGSVATPPEGITAEVVVVKSFDEL-KAAPAGAVKGKIVFFNQPMVRTQTGSGYGPTVAYRRRGAVEAAKKGAVAVLIRSI 85 (134) T ss_pred CCCCCCCCCCCEEEEEEECCHHHH-HHCCHHHCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCEEEEEEEC T ss_conf 787778988848989998899999-6168523486189993785467774521467776525299998569779999603 Q ss_pred CCCCCCCCCCCCHHHHHHHCCCCE--ECCCCHHHHHHHHHH Q ss_conf 544345524179899997427832--112576899999985 Q gi|254780420|r 99 ITEPSNYRANMHFDAWLKSCGIIG--LSGIDTRALTVWIRD 137 (396) Q Consensus 99 ~~~~Sn~rs~~sL~~wL~~~~Ipg--I~GIDTRaLTk~IRe 137 (396) .....+.--..+..-=-...+||+ |+--|--.|-|.+.. T Consensus 86 ~t~~~~~PHTG~m~Y~~~v~kIPaaAiS~~dAd~L~rm~~~ 126 (134) T cd04815 86 GTDSHRSPHTGMMSYDDGVPKIPAAAISVEDADMLERLAAR 126 (134) T ss_pred CCCCCCCCCCCCEECCCCCCCCCEEECCHHHHHHHHHHHHC T ss_conf 67778989757612488999868696188759999999867 No 223 >pfam04848 Pox_A22 Poxvirus A22 protein. Probab=63.30 E-value=6.7 Score=20.00 Aligned_cols=53 Identities=23% Similarity=0.310 Sum_probs=34.7 Q ss_pred EEEEEECCCCCHHHHHHHHH------CCCCEEEECCCC-----HHHHHHCCCCEEEECCCC-CCCC Q ss_conf 49999679641023222220------245413304657-----446441357889985888-8841 Q gi|254780420|r 207 HVVCIDYGIRSNLLRVLSSL------GCRITIVGAETS-----YKDIMSLQPDGVLLSNGP-GDPA 260 (396) Q Consensus 207 kIvvIDfGiK~nIlr~L~~~------g~~v~Vvp~~~~-----~~~i~~~~pdgi~lSnGP-GdP~ 260 (396) .|+.+|-|+| |..+++.+. -+++..+|++.+ +.++.+.+||-|+|---| +.|. T Consensus 2 iIlSiDiGiK-NlA~c~~e~~~n~I~~~~v~~~d~~~d~~~~v~~d~~~~~~d~VLiEkQp~r~~~ 66 (143) T pfam04848 2 IICAIDIGTK-NPARTIIEIKDNKIKLIDISKLDWSSDWEYRVARDLYNYNYDVVLLERQGRRSPN 66 (143) T ss_pred EEEEEECCCC-CCEEEEEECCCCEEEEEEEECCCCCCCHHHHHHHHHHCCCCCEEEEEECCCCCCC T ss_conf 7999842777-7015899737984999986048988527999999974358998999607999984 No 224 >COG1486 CelF Alpha-galactosidases/6-phospho-beta-glucosidases, family 4 of glycosyl hydrolases [Carbohydrate transport and metabolism] Probab=63.16 E-value=8.4 Score=19.29 Aligned_cols=162 Identities=19% Similarity=0.267 Sum_probs=87.7 Q ss_pred CCCCCCCCHHHHHHHCCCCE----ECCCCHHHHH-------HHHHHCCCCCEEEECCCCCCCCHHHHHHHHHHCCCCCCC Q ss_conf 45524179899997427832----1125768999-------999853984236632765458978988877415887432 Q gi|254780420|r 103 SNYRANMHFDAWLKSCGIIG----LSGIDTRALT-------VWIRDYGVSNSVIAHHPDGQFDLEDLKERAKNWSGLKGA 171 (396) Q Consensus 103 Sn~rs~~sL~~wL~~~~Ipg----I~GIDTRaLT-------k~IRe~G~~~a~I~~~~~~~~d~~~~~~~~~~~~~~~~~ 171 (396) |.+....=+.+-++....|+ +.+||-++|- |.+++.|+. ..|.. ..|.+++++ +. T Consensus 13 St~tp~~v~g~l~~~e~l~~~ei~L~Did~~r~~~i~~~~~~~v~~~g~~-~kv~~----ttd~~eAl~---------gA 78 (442) T COG1486 13 STYTPKLLLGDLARTEELPVRELALYDIDEERLKIIAILAKKLVEEAGAP-VKVEA----TTDRREALE---------GA 78 (442) T ss_pred CCCCHHHHHHHHHCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHHHCCC-EEEEE----ECCHHHHHC---------CC T ss_conf 53147999989841755776669998387788999999999999963998-58999----458999844---------79 Q ss_pred CHHHCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCHHHH-----------HHHHHCCCCEEEECCCC-H Q ss_conf 10110133332111686534454344346777771499996796410232-----------22220245413304657-4 Q gi|254780420|r 172 ELAKHVTVSQRRDWSEKIWKWGEETSFLKSSDAKYHVVCIDYGIRSNLLR-----------VLSSLGCRITIVGAETS-Y 239 (396) Q Consensus 172 dL~~~Vs~~~~~~~~~~~~~~~~~~~~~~~~~~~~kIvvIDfGiK~nIlr-----------~L~~~g~~v~Vvp~~~~-~ 239 (396) |++ .++-.. ++-+-++.-.--+.||.+++ .|+. +|.-.. + T Consensus 79 dfV---i~~~rv------------------G~l~~r~~De~IplkyG~~gqET~G~GGi~~glRt-------Ipvildi~ 130 (442) T COG1486 79 DFV---ITQIRV------------------GGLEAREKDERIPLKHGLYGQETNGPGGIFYGLRT-------IPVILDIA 130 (442) T ss_pred CEE---EEEEEE------------------CCCCCCHHHHCCCHHHCCCCCCCCCCCHHHHHCCC-------CHHHHHHH T ss_conf 889---998750------------------68430022311326417045446472078763024-------41999999 Q ss_pred HHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEECHHH----HHHHHCCCCC-CCCCCCCCCCCCCC Q ss_conf 4644135788998588888410112110122210257863478610058----8787528544-66776752101123 Q gi|254780420|r 240 KDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGICLGHQ----LLGLALGAQT-VKMYQGHHGVNHPV 312 (396) Q Consensus 240 ~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIClGhQ----~l~~a~G~~~-~kl~~GhrG~N~Pv 312 (396) +++.++.||+.+|- --+|+. .+.+.+.+... ++=+.|+|=|-| .||..+|.+- ..+.+-.-|.||=+ T Consensus 131 ~~m~~~~P~Aw~lN--ytNPa~---~vTeAv~r~~~-~~K~VGlCh~~~g~~~~lAe~L~~~~~~~l~~~~aGlNH~~ 202 (442) T COG1486 131 KDMEKVCPNAWMLN--YTNPAA---IVTEAVRRLYP-KIKIVGLCHGPIGIAMELAEVLGLEPREDLRYRVAGLNHMV 202 (442) T ss_pred HHHHHHCCCCEEEE--CCCHHH---HHHHHHHHHCC-CCCEEEECCCHHHHHHHHHHHHCCCCHHCEEEEEECCCHHH T ss_conf 99998697747874--368699---99999998578-78789758845789999999958994321468995030244 No 225 >pfam01070 FMN_dh FMN-dependent dehydrogenase. Probab=63.10 E-value=11 Score=18.60 Aligned_cols=77 Identities=32% Similarity=0.459 Sum_probs=44.7 Q ss_pred HHHHHHHCCCC-EEEECCCCHHHH---HHCCCCEEEECCCCCC-----CCCCCCCCHHHHHHHCCCCCCEE---EEECHH Q ss_conf 32222202454-133046574464---4135788998588888-----41011211012221025786347---861005 Q gi|254780420|r 220 LRVLSSLGCRI-TIVGAETSYKDI---MSLQPDGVLLSNGPGD-----PAVTSVYSSPIICKLVDSGMPMF---GICLGH 287 (396) Q Consensus 220 lr~L~~~g~~v-~Vvp~~~~~~~i---~~~~pdgi~lSnGPGd-----P~~~~~~~~~~~~~~~~~~iPil---GIClGh 287 (396) ++.|++. ..+ .++---.+.++. .+.--|||++||==|. |...+ ..+.+.+....++||+ ||.-|. T Consensus 161 i~~l~~~-~~~PvivKGI~s~eDA~~a~~~Gv~~I~VSnHGGRqlD~~~~t~~--~L~eI~~~v~~~~~i~~DGGIR~G~ 237 (301) T pfam01070 161 LAWLRDQ-WKGPLVLKGILSPEDAKRAVEAGVDGIVVSNHGGRQLDGAPATID--ALPEIVAAVGGRIPVLVDGGIRRGT 237 (301) T ss_pred HHHHHHH-CCCCEEEECCCCHHHHHHHHHCCCCEEEECCCCCCCCCCCCCHHH--HHHHHHHHHCCCEEEEECCCCCCCH T ss_conf 9999986-699889982899999999998599999964998544688867999--9999999856774899638747626 Q ss_pred HHH-HHHCCCCCC Q ss_conf 887-875285446 Q gi|254780420|r 288 QLL-GLALGAQTV 299 (396) Q Consensus 288 Q~l-~~a~G~~~~ 299 (396) -++ |+|+||+.. T Consensus 238 DV~KAlALGA~~V 250 (301) T pfam01070 238 DVLKALALGADAV 250 (301) T ss_pred HHHHHHHCCCCEE T ss_conf 8999998089866 No 226 >COG2185 Sbm Methylmalonyl-CoA mutase, C-terminal domain/subunit (cobalamin-binding) [Lipid metabolism] Probab=62.22 E-value=13 Score=18.01 Aligned_cols=54 Identities=19% Similarity=0.242 Sum_probs=34.3 Q ss_pred CCCEEEEEE-------CCCCCHHHHHHHHHCCCCEEEECCCCHHHHHH----CCCCEEEECCCCCC Q ss_conf 771499996-------79641023222220245413304657446441----35788998588888 Q gi|254780420|r 204 AKYHVVCID-------YGIRSNLLRVLSSLGCRITIVGAETSYKDIMS----LQPDGVLLSNGPGD 258 (396) Q Consensus 204 ~~~kIvvID-------fGiK~nIlr~L~~~g~~v~Vvp~~~~~~~i~~----~~pdgi~lSnGPGd 258 (396) .++||++.- -|+|- |-|.|...|++|+..+...+.+|+.+ .+.|.|.+|.=-|. T Consensus 11 ~rprvlvak~GlDgHd~gakv-ia~~l~d~GfeVi~~g~~~tp~e~v~aA~~~dv~vIgvSsl~g~ 75 (143) T COG2185 11 ARPRVLVAKLGLDGHDRGAKV-IARALADAGFEVINLGLFQTPEEAVRAAVEEDVDVIGVSSLDGG 75 (143) T ss_pred CCCEEEEECCCCCCCCCCHHH-HHHHHHHCCCEEEECCCCCCHHHHHHHHHHCCCCEEEEEECCCH T ss_conf 986699953676544313199-99999857937981587589999999998647988999734404 No 227 >COG0595 mRNA degradation ribonucleases J1/J2 (metallo-beta-lactamase superfamily) [Translation, ribosomal structure and biogenesis; Replication, recombination and repair] Probab=61.78 E-value=8.3 Score=19.36 Aligned_cols=45 Identities=16% Similarity=0.056 Sum_probs=22.7 Q ss_pred EEEEEEECCCCCCCCCCCCCCHHHHHHHCC-CCEECCCCHHHHHH-HHHHCC Q ss_conf 236664046544345524179899997427-83211257689999-998539 Q gi|254780420|r 90 AVGLVIKAEITEPSNYRANMHFDAWLKSCG-IIGLSGIDTRALTV-WIRDYG 139 (396) Q Consensus 90 ~~g~Iv~~~~~~~Sn~rs~~sL~~wL~~~~-IpgI~GIDTRaLTk-~IRe~G 139 (396) ++|+++...+.. | ...+.-.|.+.+ +|-...-=|.+|.+ ++++++ T Consensus 67 vkgI~lTHgHeD--H---IGaip~ll~~~~~~piy~s~lt~~Li~~k~~~~~ 113 (555) T COG0595 67 VKGIFLTHGHED--H---IGALPYLLKQVLFAPIYASPLTAALIKEKLKEHG 113 (555) T ss_pred CEEEEECCCCHH--H---CCCHHHHHHCCCCCCEECCHHHHHHHHHHHHHHC T ss_conf 139994687444--3---2346789852776844447766999998988732 No 228 >pfam02056 Glyco_hydro_4 Family 4 glycosyl hydrolase. Probab=61.46 E-value=9.7 Score=18.88 Aligned_cols=49 Identities=22% Similarity=0.327 Sum_probs=33.4 Q ss_pred HHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEECHHHHHHHH Q ss_conf 4464413578899858888841011211012221025786347861005887875 Q gi|254780420|r 239 YKDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGICLGHQLLGLA 293 (396) Q Consensus 239 ~~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIClGhQ~l~~a 293 (396) ++++.++.||+++|- -.+|... +.+.+.+.. .++.++|+|=++|-+... T Consensus 128 a~~i~e~~P~AwliN--ytNP~~~---vt~al~r~~-~~~k~vGlCh~~~~~~~~ 176 (183) T pfam02056 128 AKDIEELCPDAWVLN--YTNPAAM---VTEAVYRRY-PNIKAVGLCHSPIGIEER 176 (183) T ss_pred HHHHHHHCCCCEEEE--CCCHHHH---HHHHHHHHC-CCCCEEEECCCHHHHHHH T ss_conf 999999799838998--2788899---999999858-999889969877999999 No 229 >smart00732 YqgFc Likely ribonuclease with RNase H fold. YqgF proteins are likely to function as an alternative to RuvC in most bacteria, and could be the principal holliday junction resolvases in low-GC Gram-positive bacteria. In Spt6p orthologues, the catalytic residues are substituted indicating that they lack enzymatic functions. Probab=61.43 E-value=13 Score=17.88 Aligned_cols=48 Identities=15% Similarity=0.209 Sum_probs=28.0 Q ss_pred CEEEEEECCCCCHHH---HHHHHHCCCCEEEECCCC---HHHH----HHCCCCEEEEC Q ss_conf 149999679641023---222220245413304657---4464----41357889985 Q gi|254780420|r 206 YHVVCIDYGIRSNLL---RVLSSLGCRITIVGAETS---YKDI----MSLQPDGVLLS 253 (396) Q Consensus 206 ~kIvvIDfGiK~nIl---r~L~~~g~~v~Vvp~~~~---~~~i----~~~~pdgi~lS 253 (396) +||..||||.|.--+ +.+...-.-+.+++.... .+.| .+++|+.|++. T Consensus 1 krvLgiD~G~kriG~Ai~D~~~~~a~pl~~i~~~~~~~~~~~l~~~i~~~~~~~iviG 58 (99) T smart00732 1 KRVLGLDPGRKGIGVAVVDETGKLADPLEVIPRTNKEADAARLKKLIKKYQPDLIVIG 58 (99) T ss_pred CCEEEEEECCCCEEEEEEECCCCEECCCEEEECCCCHHHHHHHHHHHHHHCCCEEEEC T ss_conf 9799992178969999998999732131899859826999999999998499889974 No 230 >cd06292 PBP1_LacI_like_10 Ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=61.32 E-value=13 Score=17.87 Aligned_cols=10 Identities=50% Similarity=0.753 Sum_probs=3.8 Q ss_pred HHHHHHHCCC Q ss_conf 3222220245 Q gi|254780420|r 220 LRVLSSLGCR 229 (396) Q Consensus 220 lr~L~~~g~~ 229 (396) +++|.+.|++ T Consensus 114 ~~~L~~~G~~ 123 (273) T cd06292 114 VRHLVALGHR 123 (273) T ss_pred HHHHHHHCCC T ss_conf 9999985998 No 231 >cd05797 Ribosomal_L10 Ribosomal protein L10 family, L10 subfamily; composed of bacterial 50S ribosomal protein and eukaryotic mitochondrial 39S ribosomal protein, L10. L10 occupies the L7/L12 stalk of the ribosome. The N-terminal domain (NTD) of L10 interacts with L11 protein and forms the base of the L7/L12 stalk, while the extended C-terminal helix binds to two or three dimers of the NTD of L7/L12 (L7 and L12 are identical except for an acetylated N-terminus). The L7/L12 stalk is known to contain the binding site for elongation factors G and Tu (EF-G and EF-Tu, respectively); however, there is disagreement as to whether or not L10 is involved in forming the binding site. The stalk is believed to be associated with GTPase activities in protein synthesis. In a neuroblastoma cell line, L10 has been shown to interact with the SH3 domain of Src and to activate the binding of the Nck1 adaptor protein with skeletal proteins such as the Wiskott-Aldrich Syndrome Protein (WASP) and the WASP-in Probab=60.49 E-value=14 Score=17.85 Aligned_cols=92 Identities=20% Similarity=0.302 Sum_probs=49.5 Q ss_pred EEEEEEC-CCCCHHH----HHHHHHCCCCEEEECCC--------CHHHHHH--CCCCEEEECCCCCCCCCCCCCCHHHHH Q ss_conf 4999967-9641023----22222024541330465--------7446441--357889985888884101121101222 Q gi|254780420|r 207 HVVCIDY-GIRSNLL----RVLSSLGCRITIVGAET--------SYKDIMS--LQPDGVLLSNGPGDPAVTSVYSSPIIC 271 (396) Q Consensus 207 kIvvIDf-GiK~nIl----r~L~~~g~~v~Vvp~~~--------~~~~i~~--~~pdgi~lSnGPGdP~~~~~~~~~~~~ 271 (396) -++++|| |+.-+.. +.|.+.|+++.|+.++. .++.+.. ..|-++++|+. ||....+.... +. T Consensus 21 ~i~~~~y~gl~v~~~~~lR~~l~~~~~~~kVvKNtL~~~Al~~t~~~~l~~~l~G~~a~~f~~e--d~~~~aK~i~~-f~ 97 (157) T cd05797 21 SVVVADYRGLTVAQLTELRKELREAGVKLKVVKNTLAKRALEGTGFEDLDDLLKGPTAIAFSEE--DPVAAAKVLKD-FA 97 (157) T ss_pred EEEEEECCCCCHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHCCCCCHHHHCCCCEEEEEECC--CCCHHHHHHHH-HH T ss_conf 7999976899689999999999976986999614999999861585312553667769999368--85388999999-98 Q ss_pred HHCCCCCCEEEEECHHHHHHHHCCCCCCCCC Q ss_conf 1025786347861005887875285446677 Q gi|254780420|r 272 KLVDSGMPMFGICLGHQLLGLALGAQTVKMY 302 (396) Q Consensus 272 ~~~~~~iPilGIClGhQ~l~~a~G~~~~kl~ 302 (396) +-. .++=+.|-|+.-+++..+-=-..-||| T Consensus 98 K~~-~~~~~~gg~~eg~~l~~~~i~~lakLP 127 (157) T cd05797 98 KEN-KKLEIKGGVVEGKVLDAEEVKALAKLP 127 (157) T ss_pred HHC-CCCCEEEEEECCEECCHHHHHHHHCCC T ss_conf 752-552012333257132899999997599 No 232 >pfam01976 DUF116 Protein of unknown function DUF116. This archaebacterial protein has no known function. The protein contains seven conserved cysteines and may also be an integral membrane protein. Probab=60.34 E-value=12 Score=18.28 Aligned_cols=69 Identities=23% Similarity=0.374 Sum_probs=43.6 Q ss_pred EEEECCCCCH---HHHHHHHHCCCCEEEECCCCHHHH-HHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEE Q ss_conf 9996796410---232222202454133046574464-413578899858888841011211012221025786347861 Q gi|254780420|r 209 VCIDYGIRSN---LLRVLSSLGCRITIVGAETSYKDI-MSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGIC 284 (396) Q Consensus 209 vvIDfGiK~n---Ilr~L~~~g~~v~Vvp~~~~~~~i-~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIC 284 (396) -.-.|| |=. +++.-.++|+++-|+|..+-+..+ ...+|+||+= -++.....+-+..+....+|.+||- T Consensus 66 ~C~~CG-~C~I~~l~~la~~~G~~v~i~~Ggt~~rkii~~~~p~~Iig-------VAC~~dL~~gi~~~~~~~ip~~gV~ 137 (158) T pfam01976 66 NCKECG-KCDIGDLKELAEEYGYKVYIVPGGTFAKKIIKEYRPKAIIG-------VACERDLNEGIQDLKKKGIPVQGVL 137 (158) T ss_pred CCCCCC-CCCHHHHHHHHHHCCCEEEEECCCHHHHHHHHHCCCCEEEE-------EECHHHHHHHHHHHHCCCCCEEEEE T ss_conf 287899-98442799999980987999648289999998729998999-------9204899999998650899869997 Q ss_pred C Q ss_conf 0 Q gi|254780420|r 285 L 285 (396) Q Consensus 285 l 285 (396) + T Consensus 138 l 138 (158) T pfam01976 138 L 138 (158) T ss_pred C T ss_conf 5 No 233 >cd03465 URO-D_like The URO-D _like protein superfamily includes bacterial and eukaryotic uroporphyrinogen decarboxylases (URO-D), coenzyme M methyltransferases and other putative bacterial methyltransferases. Uroporphyrinogen decarboxylase (URO-D) decarboxylates the four acetate side chains of uroporphyrinogen III (uro-III) to create coproporphyrinogen III, an important branching point of the tetrapyrrole biosynthetic pathway. The methyltransferases represented here are important for ability of methanogenic organisms to use other compounds than carbon dioxide for reduction to methane. Probab=59.20 E-value=13 Score=18.01 Aligned_cols=50 Identities=14% Similarity=0.202 Sum_probs=37.8 Q ss_pred CEEEEEECCCCCHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCC Q ss_conf 14999967964102322222024541330465744644135788998588 Q gi|254780420|r 206 YHVVCIDYGIRSNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNG 255 (396) Q Consensus 206 ~kIvvIDfGiK~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnG 255 (396) ..+.+.-||--.+++..+.+.|+++.-+-+..+..+..+.-++.+.|-++ T Consensus 222 ~~~ilh~~g~~~~~l~~~~~~~~~~~~~d~~~~l~~~~~~~~~~~~i~Gn 271 (330) T cd03465 222 GPVIHHNCGDTAPILELMADLGADVFSIDVTVDLAEAKKKVGDKACLMGN 271 (330) T ss_pred CCEEEECCCCHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHCCCCEEEEC T ss_conf 98367407862879999986588889858889999999980999258817 No 234 >PRK13131 consensus Probab=58.50 E-value=15 Score=17.54 Aligned_cols=36 Identities=19% Similarity=0.276 Sum_probs=16.5 Q ss_pred EEEEECCCCCH--HHHHHHHHCCC-CEEEECCCCHHHHH Q ss_conf 99996796410--23222220245-41330465744644 Q gi|254780420|r 208 VVCIDYGIRSN--LLRVLSSLGCR-ITIVGAETSYKDIM 243 (396) Q Consensus 208 IvvIDfGiK~n--Ilr~L~~~g~~-v~Vvp~~~~~~~i~ 243 (396) +.+.|+-...+ +.+.+.+.|.. +-.+..+++.+.+. T Consensus 119 vIipDLP~eE~~~~~~~~~~~~l~~I~lvaPtt~~~Ri~ 157 (257) T PRK13131 119 VLIADMPLIEKELVIKSAQKHQIKQIFIASPNASVKDLE 157 (257) T ss_pred EECCCCCHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHH T ss_conf 655899967889999999977984799728999889999 No 235 >PRK01710 murD UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase; Provisional Probab=58.14 E-value=15 Score=17.50 Aligned_cols=32 Identities=16% Similarity=0.315 Sum_probs=20.5 Q ss_pred HHHHHCCCCEECCCC-----HHHHHHHHHHCCCCCEEE Q ss_conf 999742783211257-----689999998539842366 Q gi|254780420|r 113 AWLKSCGIIGLSGID-----TRALTVWIRDYGVSNSVI 145 (396) Q Consensus 113 ~wL~~~~IpgI~GID-----TRaLTk~IRe~G~~~a~I 145 (396) .++.+.++.||+|=. |-.|...|++.|- ...+ T Consensus 112 ~~~~~~~~IaVTGTnGKTTTtsmi~~iL~~~g~-~~~~ 148 (458) T PRK01710 112 IKYCPAKVFGITGSDGKTTTTTLIYEMLKEEGY-KTWV 148 (458) T ss_pred HHHCCCCEEEEECCCCCHHHHHHHHHHHHHCCC-CEEE T ss_conf 863758779993689847899999999998499-7585 No 236 >cd01141 TroA_d Periplasmic binding protein TroA_d. These proteins are predicted to function as initial receptors in the ABC metal ion uptake in eubacteria and archaea. They belong to the TroA superfamily of helical backbone metal receptor proteins that share a distinct fold and ligand binding mechanism. A typical TroA protein is comprised of two globular subdomains connected by a single helix and can bind their ligands in the cleft between these domains. Probab=57.75 E-value=15 Score=17.46 Aligned_cols=42 Identities=29% Similarity=0.417 Sum_probs=28.1 Q ss_pred CCCCHHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEE Q ss_conf 46574464413578899858888841011211012221025786347861 Q gi|254780420|r 235 AETSYKDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGIC 284 (396) Q Consensus 235 ~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIC 284 (396) .+-++|.|++++||-||.+.+.. |.. .+.++-+.+||++-+. T Consensus 58 ~~pn~E~ilaL~PDlVi~~~~~~-~~~-------~~~~L~~~gI~v~~~~ 99 (186) T cd01141 58 GSLNVELIVALKPDLVILYGGFQ-AQT-------ILDKLEQLGIPVLYVN 99 (186) T ss_pred CCCCHHHHHHHCCCEEEEECCCC-CHH-------HHHHHHHCCCCEEEEE T ss_conf 89899999970999999958878-678-------9999996499579960 No 237 >PRK13117 consensus Probab=56.13 E-value=16 Score=17.27 Aligned_cols=33 Identities=15% Similarity=0.179 Sum_probs=11.3 Q ss_pred EEEEECCCCCH--HHHHHHHHCCC-CEEEECCCCHH Q ss_conf 99996796410--23222220245-41330465744 Q gi|254780420|r 208 VVCIDYGIRSN--LLRVLSSLGCR-ITIVGAETSYK 240 (396) Q Consensus 208 IvvIDfGiK~n--Ilr~L~~~g~~-v~Vvp~~~~~~ 240 (396) +.+.|+-.-.+ +...+.+.|.. +-+++.+++-+ T Consensus 126 vIipDLP~eE~~~~~~~~~~~gl~~I~lv~Ptt~~~ 161 (268) T PRK13117 126 VLIADVPVEESAPFRQAAKKHGIAPIFICPPNADDD 161 (268) T ss_pred EEECCCCHHHHHHHHHHHHHCCCCEEEEECCCCCHH T ss_conf 985799978858999999867983799847999999 No 238 >TIGR01184 ntrCD nitrate ABC transporter, ATP-binding proteins C and D; InterPro: IPR005890 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This model describes the ATP binding subunits of nitrate transport in bacteria and archaea. This protein belongs to the ATP-binding cassette (ABC) superfamily. It is thought that the two subunits encoded by ntrC and ntrD form the binding surface for interaction with ATP. This model is restricted in identifying ATP binding subunit associated with the nitrate transport. Nitrate assimilation is aided by other proteins derived from the operon which among others include products of ntrA - a regulatory protein; ntrB - a hydropbobic transmembrane permease and narB - a reductase. ; GO: 0015112 nitrate transmembrane transporter activity, 0015706 nitrate transport, 0016020 membrane. Probab=55.57 E-value=16 Score=17.29 Aligned_cols=55 Identities=25% Similarity=0.391 Sum_probs=33.1 Q ss_pred CCCCCCEEEEEE--CC-----CCCHHHHHHHH----HCCCCEEEECCCCHHHHHHCCCCEEEECCCCC Q ss_conf 777771499996--79-----64102322222----02454133046574464413578899858888 Q gi|254780420|r 201 SSDAKYHVVCID--YG-----IRSNLLRVLSS----LGCRITIVGAETSYKDIMSLQPDGVLLSNGPG 257 (396) Q Consensus 201 ~~~~~~kIvvID--fG-----iK~nIlr~L~~----~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPG 257 (396) .-..++++.++| || .|.|+=..|.+ ..+.+..+-+|.+-.-+++. --|.|.|||. T Consensus 128 aLs~RP~~LlLDEPFGALDAlTr~~LQe~L~~I~~e~~~T~~MvTHDvDEAllLsD--R~VMlTnGP~ 193 (230) T TIGR01184 128 ALSIRPKVLLLDEPFGALDALTRGNLQEELLQIVEEARVTVVMVTHDVDEALLLSD--RVVMLTNGPA 193 (230) T ss_pred HHHHCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHC--CEEEEECCCC T ss_conf 65317601231087405667526889999999873169748998524457888735--1024307774 No 239 >cd06344 PBP1_ABC_ligand_binding_like_9 Type I periplasmic ligand-binding domain of uncharacterized ABC (ATPase Binding Cassette)-type active transport systems that are predicted to be involved in uptake of amino acids, peptides, or inorganic ions. This subgroup includes the type I periplasmic ligand-binding domain of uncharacterized ABC (ATPase Binding Cassette)-type active transport systems that are predicted to be involved in uptake of amino acids, peptides, or inorganic ions. Members of this group are sequence-similar to members of the family of ABC-type hydrophobic amino acid transporters, such as leucine/isoleucine/valine binding protein (LIVBP); however their ligand specificity has not been determined experimentally. Probab=55.41 E-value=15 Score=17.56 Aligned_cols=163 Identities=15% Similarity=0.144 Sum_probs=72.4 Q ss_pred EEEECCCCCCCCCCCCCCHHHHHHHCCCCEECCCC----HHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHHCCCC Q ss_conf 66404654434552417989999742783211257----68999999853984236632765458978988877415887 Q gi|254780420|r 93 LVIKAEITEPSNYRANMHFDAWLKSCGIIGLSGID----TRALTVWIRDYGVSNSVIAHHPDGQFDLEDLKERAKNWSGL 168 (396) Q Consensus 93 ~Iv~~~~~~~Sn~rs~~sL~~wL~~~~IpgI~GID----TRaLTk~IRe~G~~~a~I~~~~~~~~d~~~~~~~~~~~~~~ 168 (396) +|+.+--..|.. +..-..++..+.+|.+|-|-- |-+......+.|... |+...... .+... T Consensus 42 lv~~Dd~~dp~~--a~~~a~kLv~~~~V~aviG~~~S~~~~a~~~i~~~~~ip~--is~~at~~----~lt~~------- 106 (332) T cd06344 42 VVIANDGNDPEI--AKKVADELVKDPEILGVVGHYSSDATLAALDIYQKAKLVL--ISPTSTSV----KLSNP------- 106 (332) T ss_pred EEEECCCCCHHH--HHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHHHHCCCEE--EECCCCCC----CCCCC------- T ss_conf 999679999899--9999999973599279997888655788899998518457--75575775----33567------- Q ss_pred CCCCHHHCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCEEEEE-E----CCC--CCHHHHHHHHHCCCCEEEECCC---- Q ss_conf 4321011013333211168653445434434677777149999-6----796--4102322222024541330465---- Q gi|254780420|r 169 KGAELAKHVTVSQRRDWSEKIWKWGEETSFLKSSDAKYHVVCI-D----YGI--RSNLLRVLSSLGCRITIVGAET---- 237 (396) Q Consensus 169 ~~~dL~~~Vs~~~~~~~~~~~~~~~~~~~~~~~~~~~~kIvvI-D----fGi--K~nIlr~L~~~g~~v~Vvp~~~---- 237 (396) .+..-++...... +...+ ..+........+++++ | ||. .......|.++|-.+.+-+++. T Consensus 107 --~~~~Fr~~~~~~~-~~~~l------a~~~~~~~~~k~vai~~~~~~~yg~~~~~~f~~~~~~~gg~~v~~~~~~~~~d 177 (332) T cd06344 107 --GPYFFRTVPSNAV-AARAL------AKYLKKKNKIKKVAIFYNSTSPYSQSLKQEFTSALLERGGGIVVTPCDLSSPD 177 (332) T ss_pred --CCEEEEECCCHHH-HHHHH------HHHHHHHCCCCEEEEEEECCCHHHHHHHHHHHHHHHHCCCEEEEEEEECCCCC T ss_conf --7718992488689-99999------99999854898899999478647789999999999982995999987169899 Q ss_pred -CH----HHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEE Q ss_conf -74----464413578899858888841011211012221025786347861 Q gi|254780420|r 238 -SY----KDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGIC 284 (396) Q Consensus 238 -~~----~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIC 284 (396) ++ .++.+.+||.||+++.+++ ... ..... +....++|++|-. T Consensus 178 ~d~~~~l~~i~~~~~d~i~~~~~~~~---~~~-~~~~~-~~~g~~~~l~g~~ 224 (332) T cd06344 178 FNANTAVSQAINNGATVLVLFPDTDT---LDK-ALEVA-KANKGRLTLLGGD 224 (332) T ss_pred CCHHHHHHHHHHCCCCEEEECCCHHH---HHH-HHHHH-HHCCCCCEEEECC T ss_conf 79999999986379989999086788---999-99999-9769995599458 No 240 >cd01148 TroA_a Metal binding protein TroA_a. These proteins are predicted to function as initial receptors in ABC transport of metal ions in eubacteria. They belong to the TroA superfamily of helical backbone metal receptor proteins that share a distinct fold and ligand binding mechanism. A typical TroA protein is comprised of two globular subdomains connected by a single helix and can bind the metal ion in the cleft between these domains. Probab=55.04 E-value=17 Score=17.16 Aligned_cols=18 Identities=17% Similarity=0.510 Sum_probs=16.1 Q ss_pred CCCHHHHHHCCCCEEEEC Q ss_conf 657446441357889985 Q gi|254780420|r 236 ETSYKDIMSLQPDGVLLS 253 (396) Q Consensus 236 ~~~~~~i~~~~pdgi~lS 253 (396) ..+.|.|.+++||-||+. T Consensus 213 ~vs~E~i~~~nPDvIiv~ 230 (284) T cd01148 213 TVSWETVIARNPDVIVII 230 (284) T ss_pred CCCHHHHHHHCCCEEEEE T ss_conf 348888877189989998 No 241 >PRK07226 fructose-bisphosphate aldolase; Provisional Probab=55.03 E-value=17 Score=17.15 Aligned_cols=68 Identities=18% Similarity=0.348 Sum_probs=43.3 Q ss_pred HHHHHHHCCCCEEEECCCC---HHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEECHHHHHH Q ss_conf 3222220245413304657---44644135788998588888410112110122210257863478610058878 Q gi|254780420|r 220 LRVLSSLGCRITIVGAETS---YKDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGICLGHQLLG 291 (396) Q Consensus 220 lr~L~~~g~~v~Vvp~~~~---~~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIClGhQ~l~ 291 (396) .|.=.++|++++=++|.-+ +.++.+-.|--|+++.||-.+... .....++..++.+ -.|||+|-++.- T Consensus 166 aRia~ELGADiVKv~y~gd~e~f~~vv~~~~vPVliaGG~k~~~~~--~~L~~v~~ai~aG--a~Gv~~GRNvfQ 236 (266) T PRK07226 166 ARVGAELGADIIKTNYTGDPESFREVVEGCPVPVVIAGGPKTGSDL--EFLQMVEDAMEAG--AAGVAIGRNIFQ 236 (266) T ss_pred HHHHHHHCCCEEEECCCCCHHHHHHHHHHCCCCEEEECCCCCCCHH--HHHHHHHHHHHCC--CCEEEECCHHHC T ss_conf 9999996799898359999899999998479878995788889889--9999999999759--946652610235 No 242 >TIGR03261 phnS2 putative 2-aminoethylphosphonate ABC transport system, 1-aminoethylphosphonate-binding protein component. This ABC transporter extracellular solute-binding protein is found in a number of genomes in operon-like contexts strongly suggesting a substrate specificity for 2-aminoethylphosphonate (2-AEP). The characterized PhnSTUV system is absent in the genomes in which this system is found. These genomes encode systems for the catabolism of 2-AEP, making the need for a 2-AEP-specific transporter likely. Probab=54.68 E-value=8.8 Score=19.18 Aligned_cols=40 Identities=15% Similarity=-0.006 Sum_probs=22.3 Q ss_pred CCCHHHHHHHC-C--CCEECCCCHHHHHHHHH-HCCCCCEEEECC Q ss_conf 17989999742-7--83211257689999998-539842366327 Q gi|254780420|r 108 NMHFDAWLKSC-G--IIGLSGIDTRALTVWIR-DYGVSNSVIAHH 148 (396) Q Consensus 108 ~~sL~~wL~~~-~--IpgI~GIDTRaLTk~IR-e~G~~~a~I~~~ 148 (396) +.=+..|-+++ | |--+. -+|=.|..+|+ |++...+=+... T Consensus 36 ~~i~~~Fek~~tgIkV~~v~-~~tgel~~rl~aEk~~p~aDV~~g 79 (334) T TIGR03261 36 AKYKDAFEKVNPDIKINWVR-DSTGIITAKLLAEKNNPQADVVWG 79 (334) T ss_pred HHHHHHHHHHCCCCEEEEEE-CCHHHHHHHHHHCCCCCCCEEEEE T ss_conf 99999999777994899998-972999999996457998659984 No 243 >PRK04885 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=54.53 E-value=10 Score=18.65 Aligned_cols=19 Identities=21% Similarity=0.345 Sum_probs=12.5 Q ss_pred HHHHHCCCCEEEECCCCCC Q ss_conf 4644135788998588888 Q gi|254780420|r 240 KDIMSLQPDGVLLSNGPGD 258 (396) Q Consensus 240 ~~i~~~~pdgi~lSnGPGd 258 (396) +.+..+.-||+++|---|- T Consensus 141 ~~~~~~r~DGlIVSTPTGS 159 (265) T PRK04885 141 ELFERFRGDGLCVSTPTGS 159 (265) T ss_pred EEEEEEECCEEEEECCCCH T ss_conf 9989995677999669851 No 244 >pfam01513 NAD_kinase ATP-NAD kinase. Members of this family include ATP-NAD kinases EC:2.7.1.23, which catalyses the phosphorylation of NAD to NADP utilising ATP and other nucleoside triphosphates as well as inorganic polyphosphate as a source of phosphorus. Also includes NADH kinases EC:2.7.1.86. Probab=54.18 E-value=12 Score=18.23 Aligned_cols=25 Identities=8% Similarity=0.211 Sum_probs=14.0 Q ss_pred EEEEECCCCCCCCCCCCCCHHHHHHHC Q ss_conf 666404654434552417989999742 Q gi|254780420|r 92 GLVIKAEITEPSNYRANMHFDAWLKSC 118 (396) Q Consensus 92 g~Iv~~~~~~~Sn~rs~~sL~~wL~~~ 118 (396) |+|++-.. +.+-.....+..||.++ T Consensus 3 giv~n~~~--~~a~~~~~~l~~~L~~~ 27 (243) T pfam01513 3 GIIVNPDK--VEAEERASELQRLLLDA 27 (243) T ss_pred EEEECCCC--HHHHHHHHHHHHHHHHC T ss_conf 99987996--78999999999998864 No 245 >PRK04690 murD UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase; Provisional Probab=53.48 E-value=18 Score=16.99 Aligned_cols=26 Identities=23% Similarity=0.432 Sum_probs=14.3 Q ss_pred CCCEECCCC-----HHHHHHHHHHCCCCCEEE Q ss_conf 783211257-----689999998539842366 Q gi|254780420|r 119 GIIGLSGID-----TRALTVWIRDYGVSNSVI 145 (396) Q Consensus 119 ~IpgI~GID-----TRaLTk~IRe~G~~~a~I 145 (396) ++.||+|=. |-.|..+|+..|- ++.+ T Consensus 116 k~IaVTGTNGKTTTt~ll~~iL~~~g~-~~~~ 146 (468) T PRK04690 116 GAVCVTGTKGKSTTTALLAHLLRAAGH-RTAL 146 (468) T ss_pred CEEEEECCCCHHHHHHHHHHHHHHCCC-CEEE T ss_conf 789996998589999999999985799-8289 No 246 >PRK04663 murD UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase; Provisional Probab=53.13 E-value=18 Score=16.95 Aligned_cols=32 Identities=25% Similarity=0.413 Sum_probs=19.9 Q ss_pred CCHHHHHHHCCCCEECCCC-----HHHHHHHHHHCCC Q ss_conf 7989999742783211257-----6899999985398 Q gi|254780420|r 109 MHFDAWLKSCGIIGLSGID-----TRALTVWIRDYGV 140 (396) Q Consensus 109 ~sL~~wL~~~~IpgI~GID-----TRaLTk~IRe~G~ 140 (396) ..|-.|..+..|.||+|=. |..|...|+..|. T Consensus 99 iel~~~~~~~~iIaVTGTnGKTTTtsli~~iL~~~g~ 135 (438) T PRK04663 99 IELFAWAVDKPVIAITGSNGKSTVTDLTGVMAKAAGV 135 (438) T ss_pred HHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHCCC T ss_conf 8999764589789994899828999999999982899 No 247 >PRK02705 murD UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase; Provisional Probab=52.76 E-value=18 Score=16.91 Aligned_cols=25 Identities=28% Similarity=0.478 Sum_probs=11.2 Q ss_pred HHHCCCCEECCCC-----HHHHHHHHHHCC Q ss_conf 9742783211257-----689999998539 Q gi|254780420|r 115 LKSCGIIGLSGID-----TRALTVWIRDYG 139 (396) Q Consensus 115 L~~~~IpgI~GID-----TRaLTk~IRe~G 139 (396) ++...+.||+|=. |-.|..+++..| T Consensus 106 ~~~~~~IaVTGTnGKTTTt~ll~~iL~~~g 135 (459) T PRK02705 106 LKHIPWVGITGTNGKTTVTHLLAHILQAAG 135 (459) T ss_pred HCCCCEEEEECCCCCHHHHHHHHHHHHHCC T ss_conf 504975777178972789999999999839 No 248 >cd06360 PBP1_alkylbenzenes_like Type I periplasmic binding component of active transport systems that are predicted be involved in anaerobic biodegradation of alkylbenzenes such as toluene and ethylbenzene. This group includes the type I periplasmic binding component of active transport systems that are predicted be involved in anaerobic biodegradation of alkylbenzenes such as toluene and ethylbenzene; their substrate specificity is not well characterized, however. Probab=52.65 E-value=18 Score=16.90 Aligned_cols=172 Identities=16% Similarity=0.201 Sum_probs=74.9 Q ss_pred EEEEECCCCCCCCCCCCCCHHHHHHHCCCCEECCCCHHHH----HHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHHCCC Q ss_conf 6664046544345524179899997427832112576899----999985398423663276545897898887741588 Q gi|254780420|r 92 GLVIKAEITEPSNYRANMHFDAWLKSCGIIGLSGIDTRAL----TVWIRDYGVSNSVIAHHPDGQFDLEDLKERAKNWSG 167 (396) Q Consensus 92 g~Iv~~~~~~~Sn~rs~~sL~~wL~~~~IpgI~GIDTRaL----Tk~IRe~G~~~a~I~~~~~~~~d~~~~~~~~~~~~~ 167 (396) -+|+.+.-..|. ++.....+...+.+|.+|-|--+-.. ...+.+.|...-.-....+ . +. . T Consensus 40 elv~~D~~~~p~--~a~~~a~~Li~~d~V~aiiG~~~S~~~~a~~~~~~~~~vp~i~~~s~~~-~-----~~-------~ 104 (336) T cd06360 40 EFVVEDDEAKPD--VAVEKARKLIEQDKVDVVVGPVHSGEALAMVKVLREPGTPLINPNAGAD-D-----LT-------G 104 (336) T ss_pred EEEEECCCCCHH--HHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHHHCCCEEECCCCCCC-E-----EE-------C T ss_conf 999867999989--9999999998549963885576707655333999850953431255553-2-----41-------2 Q ss_pred CCCCCHHHCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCEEEEEE----CCC--CCHHHHHHHHHCCCCEEE---EC-CC Q ss_conf 743210110133332111686534454344346777771499996----796--410232222202454133---04-65 Q gi|254780420|r 168 LKGAELAKHVTVSQRRDWSEKIWKWGEETSFLKSSDAKYHVVCID----YGI--RSNLLRVLSSLGCRITIV---GA-ET 237 (396) Q Consensus 168 ~~~~dL~~~Vs~~~~~~~~~~~~~~~~~~~~~~~~~~~~kIvvID----fGi--K~nIlr~L~~~g~~v~Vv---p~-~~ 237 (396) ....+..-.++.... .+...... +.. .....+|+++. ||. -..+.+.|.+.|.+|.-. |. .+ T Consensus 105 ~~~~~~~Fr~~~~~~-~~~~~~~~------~~~-~~~~~~vaiv~~d~~~g~~~~~~~~~~~~~~G~~vv~~~~~~~~~~ 176 (336) T cd06360 105 RLCAPNFFRTSFSNA-QWAAPMGK------YAA-DDGYKKVVTVAWDYAFGYEVVEGFKEAFTEAGGKIVKELWVPFGTS 176 (336) T ss_pred CCCCCCEEEECCCHH-HHHHHHHH------HHH-HHCCCEEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEEEECCCCCC T ss_conf 578776687059859-99999999------999-8489869994687446699999999999973996999996589985 Q ss_pred CHH----HHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEE-ECHHHH Q ss_conf 744----6441357889985888884101121101222102578634786-100588 Q gi|254780420|r 238 SYK----DIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGI-CLGHQL 289 (396) Q Consensus 238 ~~~----~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGI-ClGhQ~ 289 (396) ++. .+.+.+||.|++...+.+-. .........=+..++|++|- ++.-+. T Consensus 177 Df~~~i~~~~~~~pd~v~~~~~~~~~~---~~~~q~~~~G~~~~~~~~g~~~~~~~~ 230 (336) T cd06360 177 DFASYLAQIPDDVPDAVFVFFAGGDAI---KFVKQYDAAGLKAKIPLIGSGFLTDGT 230 (336) T ss_pred CHHHHHHHHHHCCCCEEEEECCCCHHH---HHHHHHHHCCCCCCEEEEEECCCCCHH T ss_conf 679999999876989999935660489---999999973998780699865447888 No 249 >PRK03670 competence damage-inducible protein A; Provisional Probab=52.17 E-value=18 Score=16.98 Aligned_cols=11 Identities=36% Similarity=0.845 Sum_probs=4.3 Q ss_pred CCEEEECCCCC Q ss_conf 78899858888 Q gi|254780420|r 247 PDGVLLSNGPG 257 (396) Q Consensus 247 pdgi~lSnGPG 257 (396) +|-||.|+|=| T Consensus 61 ~d~Vi~tGGLG 71 (252) T PRK03670 61 PEVLIISGGLG 71 (252) T ss_pred CCEEEECCCCC T ss_conf 99999838827 No 250 >PRK12778 putative bifunctional 2-polyprenylphenol hydroxylase/glutamate synthase subunit beta; Provisional Probab=51.83 E-value=19 Score=16.87 Aligned_cols=41 Identities=24% Similarity=0.445 Sum_probs=26.5 Q ss_pred HHHHHCCCCE---EEECCCCHHHHHHCCCCEEEECCCCCCCCCC Q ss_conf 2222024541---3304657446441357889985888884101 Q gi|254780420|r 222 VLSSLGCRIT---IVGAETSYKDIMSLQPDGVLLSNGPGDPAVT 262 (396) Q Consensus 222 ~L~~~g~~v~---Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~ 262 (396) .|.+.|+++. .+--+.+++++++..+|+|||.-|-+.|... T Consensus 498 ~l~~lGV~~~~n~~vGkdit~~eL~~egyDAVfla~Ga~~pr~l 541 (760) T PRK12778 498 NLVKMGVTFEKDCIVGKTISVEELEEEGFKGIFVASGAGLPNFM 541 (760) T ss_pred HHHHCCCEEECCCEECCCCCHHHHHHCCCCEEEEECCCCCCCCC T ss_conf 99864979989957788689999975889999994067877528 No 251 >pfam02844 GARS_N Phosphoribosylglycinamide synthetase, N domain. Phosphoribosylglycinamide synthetase catalyses the second step in the de novo biosynthesis of purine. The reaction catalysed by Phosphoribosylglycinamide synthetase is the ATP- dependent addition of 5-phosphoribosylamine to glycine to form 5'phosphoribosylglycinamide. This domain is related to the N-terminal domain of biotin carboxylase/carbamoyl phosphate synthetase (see pfam00289). Probab=51.74 E-value=19 Score=16.80 Aligned_cols=38 Identities=32% Similarity=0.632 Sum_probs=24.7 Q ss_pred CHHHHH----HCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEE Q ss_conf 744644----135788998588888410112110122210257863478 Q gi|254780420|r 238 SYKDIM----SLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFG 282 (396) Q Consensus 238 ~~~~i~----~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilG 282 (396) ++++|. +.++|-+|+ ||-+|-. .-+...+.+.++|+|| T Consensus 49 d~~~i~~~~~~~~idlvii--GPE~pL~-----~Gl~D~l~~~gi~vfG 90 (99) T pfam02844 49 DFEALADFAKEENIDLVVV--GPEAPLV-----AGIVDALRAAGIPVFG 90 (99) T ss_pred CHHHHHHHHHHCCCCEEEE--CCCHHHH-----HHHHHHHHHCCCEEEC T ss_conf 7999999999819749998--9606778-----7889999868991799 No 252 >cd01425 RPS2 Ribosomal protein S2 (RPS2), involved in formation of the translation initiation complex, where it might contact the messenger RNA and several components of the ribosome. It has been shown that in Escherichia coli RPS2 is essential for the binding of ribosomal protein S1 to the 30s ribosomal subunit. In humans, most likely in all vertebrates, and perhaps in all metazoans, the protein also functions as the 67 kDa laminin receptor (LAMR1 or 67LR), which is formed from a 37 kDa precursor, and is overexpressed in many tumors. 67LR is a cell surface receptor which interacts with a variety of ligands, laminin-1 and others. It is assumed that the ligand interactions are mediated via the conserved C-terminus, which becomes extracellular as the protein undergoes conformational changes which are not well understood. Specifically, a conserved palindromic motif, LMWWML, may participate in the interactions. 67LR plays essential roles in the adhesion of cells to the basement membrane an Probab=50.85 E-value=15 Score=17.58 Aligned_cols=35 Identities=26% Similarity=0.354 Sum_probs=24.1 Q ss_pred HHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEEC Q ss_conf 4135788998588888410112110122210257863478610 Q gi|254780420|r 243 MSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGICL 285 (396) Q Consensus 243 ~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGICl 285 (396) +...||.||+.+ |.. ...+++++...+||+.|||= T Consensus 124 ~~~~Pdliiv~d----~~~----~~~ai~Ea~~l~IPvI~i~D 158 (193) T cd01425 124 MFRLPDLVIVLD----PRK----EHQAIREASKLGIPVIAIVD 158 (193) T ss_pred CCCCCCEEEEEC----CCC----CHHHHHHHHHHCCCEEEEEC T ss_conf 166999899988----984----16899999861875578850 No 253 >cd03148 GATase1_EcHsp31_like Type 1 glutamine amidotransferase (GATase1)-like domain found in Escherichia coli Hsp31 protein (EcHsp31). Type 1 glutamine amidotransferase (GATase1)-like domain found in Escherichia coli Hsp31 protein (EcHsp31). This group includes proteins similar to EcHsp31. EcHsp31 has chaperone activity. EcHsp31 coordinates a metal ion using a 2-His-1-carboxylate motif present in various ions that use iron as a cofactor such as Carboxypeptidase A. The catalytic triad typical of GATase1 domains is not conserved in this GATase1-like domain. However, in common with a typical GATase1domain, a reactive Cys residue is found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. This Cys together with a different His and, an Asp (rather than a Glu) residue form a different catalytic triad from the typical GATase1 domain. EcHsp31 is a homodimer. Probab=50.67 E-value=12 Score=18.26 Aligned_cols=52 Identities=13% Similarity=0.107 Sum_probs=35.1 Q ss_pred HHHCCCCEEEECCCCCCCCCC--CCCCHHHHHHHCCCCCCEEEEECHHHHHHHH Q ss_conf 441357889985888884101--1211012221025786347861005887875 Q gi|254780420|r 242 IMSLQPDGVLLSNGPGDPAVT--SVYSSPIICKLVDSGMPMFGICLGHQLLGLA 293 (396) Q Consensus 242 i~~~~pdgi~lSnGPGdP~~~--~~~~~~~~~~~~~~~iPilGIClGhQ~l~~a 293 (396) +..-++|+||+.+|=|-=.+. +.....++..+.+.+.||-.||=|--.|.-+ T Consensus 92 ~d~~~Ydavf~pGGHG~m~Dl~~~~~l~~ll~~~~~~gk~vaaVCHGpaaLl~~ 145 (232) T cd03148 92 ADDSEYAAVFIPGGHGALIGIPESQDVAAALQWAIKNDRFVITLCHGPAAFLAA 145 (232) T ss_pred CCCCCCEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEECCHHHHHHCC T ss_conf 784222289967997530577579999999999998599799986747987402 No 254 >COG4977 Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain [Transcription] Probab=50.60 E-value=20 Score=16.68 Aligned_cols=57 Identities=16% Similarity=0.158 Sum_probs=35.0 Q ss_pred CCCHHHHHHCCCCEEEECCCCCCCCCCC-CCCHHHHHHHCCCCCCEEEEECHHHHHHHHC Q ss_conf 6574464413578899858888841011-2110122210257863478610058878752 Q gi|254780420|r 236 ETSYKDIMSLQPDGVLLSNGPGDPAVTS-VYSSPIICKLVDSGMPMFGICLGHQLLGLAL 294 (396) Q Consensus 236 ~~~~~~i~~~~pdgi~lSnGPGdP~~~~-~~~~~~~~~~~~~~iPilGIClGhQ~l~~a~ 294 (396) +..+++... +|-|+...|=++..... ......+++....+.+|-|||-|-=+||.|- T Consensus 68 d~~~~~~~~--~~~v~v~~g~~~~~~~~~~~l~~~Lr~~~~~G~~l~gictGaf~LA~aG 125 (328) T COG4977 68 DGGLEAAPP--IDILPVCGGLGPERPVNAPALLAWLRRAARRGARLGGLCTGAFVLAEAG 125 (328) T ss_pred CCCCCCCCC--CEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCEEEEEHHHHHHHHHHC T ss_conf 876455676--3299980687754445649999999999846995888608699999711 No 255 >KOG1210 consensus Probab=50.09 E-value=12 Score=18.20 Aligned_cols=79 Identities=15% Similarity=0.161 Sum_probs=51.0 Q ss_pred HHHHHHHHCCCCEEE-ECCCC---HHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEECHHHHHHHHC Q ss_conf 232222202454133-04657---44644135788998588888410112110122210257863478610058878752 Q gi|254780420|r 219 LLRVLSSLGCRITIV-GAETS---YKDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGICLGHQLLGLAL 294 (396) Q Consensus 219 Ilr~L~~~g~~v~Vv-p~~~~---~~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIClGhQ~l~~a~ 294 (396) |-.++.++|++|+++ |.++. +++=...+|.-..+++|++++..++..+...++.+..++.-+.=--.|+-+-.+.. T Consensus 199 l~qE~i~~~v~Vt~~~P~~~~tpGfE~En~tkP~~t~ii~g~ss~~~~e~~a~~~~~~~~rg~f~~~~~~~g~l~s~~~~ 278 (331) T KOG1210 199 LRQELIKYGVHVTLYYPPDTLTPGFERENKTKPEETKIIEGGSSVIKCEEMAKAIVKGMKRGNFTVSLGFTGFLLSILSQ 278 (331) T ss_pred HHHHHHHCCEEEEEECCCCCCCCCCCCCCCCCCHHEEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECHHHHHHHHHHC T ss_conf 99987652669999728987897643102367421031007888768899999998677606758760469999987425 Q ss_pred CCC Q ss_conf 854 Q gi|254780420|r 295 GAQ 297 (396) Q Consensus 295 G~~ 297 (396) |.. T Consensus 279 ~~~ 281 (331) T KOG1210 279 GMS 281 (331) T ss_pred CCC T ss_conf 788 No 256 >COG4090 Uncharacterized protein conserved in archaea [Function unknown] Probab=49.17 E-value=19 Score=16.81 Aligned_cols=60 Identities=18% Similarity=0.253 Sum_probs=32.8 Q ss_pred EECCCCHHHHHHCC-CCEEEECCCCCCCCCCCCCCHHHHHHHCC--CCCCEEEEECHHHHHHHHCCC Q ss_conf 30465744644135-78899858888841011211012221025--786347861005887875285 Q gi|254780420|r 233 VGAETSYKDIMSLQ-PDGVLLSNGPGDPAVTSVYSSPIICKLVD--SGMPMFGICLGHQLLGLALGA 296 (396) Q Consensus 233 vp~~~~~~~i~~~~-pdgi~lSnGPGdP~~~~~~~~~~~~~~~~--~~iPilGIClGhQ~l~~a~G~ 296 (396) -++-...+|+..++ -|.|+|=+|-.=|.- +. ..+.++++++ +..|+.|+|+ |-|-.-.|. T Consensus 71 ~d~G~ql~e~e~~n~aDvvVLlGGLaMP~~-gv-~~d~~kel~ee~~~kkliGvCf--m~mF~ragW 133 (154) T COG4090 71 TDHGYQLGEREELNSADVVVLLGGLAMPKI-GV-TPDDAKELLEELGNKKLIGVCF--MNMFERAGW 133 (154) T ss_pred ECCCEECCCCCCCCCCCEEEEECCCCCCCC-CC-CHHHHHHHHHHCCCCCEEEEEH--HHHHHHCCC T ss_conf 124354277566664458999756446767-89-9799999998438986587518--989987485 No 257 >PRK10161 transcriptional regulator PhoB; Provisional Probab=48.54 E-value=21 Score=16.47 Aligned_cols=91 Identities=18% Similarity=0.188 Sum_probs=34.6 Q ss_pred EEEEEEC--CCCCHHHHHHHHHCCCCEEEECCCCH-HHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHH-HCCCCCCEEE Q ss_conf 4999967--96410232222202454133046574-464413578899858888841011211012221-0257863478 Q gi|254780420|r 207 HVVCIDY--GIRSNLLRVLSSLGCRITIVGAETSY-KDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICK-LVDSGMPMFG 282 (396) Q Consensus 207 kIvvIDf--GiK~nIlr~L~~~g~~v~Vvp~~~~~-~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~-~~~~~iPilG 282 (396) ||+++|= -+-..+...|...|++|.....-..+ +.+....||-|+|--+- |...+......+++ ....++||+= T Consensus 4 kILiVEDd~~l~~~l~~~L~~~g~~v~~a~~g~~a~~~l~~~~~DliilDi~l--P~~dG~~~~~~ir~~~~~~~~PII~ 81 (229) T PRK10161 4 RILVVEDEAPIREMVCFVLEQNGFQPVEAEDYDSAVNQLNEPWPDLILLDWML--PGGSGIQFIKHLKRESMTRDIPVVM 81 (229) T ss_pred CEEEEECCHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHCCCCCEEEEECCC--CCCCCHHHHHHHHHCCCCCCCCEEE T ss_conf 19999599999999999999779999998999999999852899899997899--8876335878877502468975899 Q ss_pred EE----CHHHHHHHHCCCCCC Q ss_conf 61----005887875285446 Q gi|254780420|r 283 IC----LGHQLLGLALGAQTV 299 (396) Q Consensus 283 IC----lGhQ~l~~a~G~~~~ 299 (396) +. .-..+.|+.+||.-| T Consensus 82 lta~~~~~d~v~gl~~GADDY 102 (229) T PRK10161 82 LTARGEEEDRVRGLETGADDY 102 (229) T ss_pred EECCCCHHHHHHHHHCCCCEE T ss_conf 955666577999987698765 No 258 >PRK09605 O-sialoglycoprotein endopeptidase/protein kinase; Reviewed Probab=48.54 E-value=13 Score=17.91 Aligned_cols=12 Identities=17% Similarity=-0.022 Sum_probs=6.9 Q ss_pred HHHHHHHHHHHH Q ss_conf 168999999999 Q gi|254780420|r 378 SYYLFSTFIDFM 389 (396) Q Consensus 378 ~~~~F~~F~~~i 389 (396) ...+|+.|++-- T Consensus 501 ~e~l~~~~l~gY 512 (536) T PRK09605 501 FEELIKAFEEGY 512 (536) T ss_pred HHHHHHHHHHHH T ss_conf 899999999999 No 259 >pfam08660 Alg14 Oligosaccharide biosynthesis protein Alg14 like. Alg14 is involved dolichol-linked oligosaccharide biosynthesis and anchors the catalytic subunit Alg13 to the ER membrane. Probab=48.33 E-value=10 Score=18.72 Aligned_cols=17 Identities=53% Similarity=0.901 Sum_probs=12.9 Q ss_pred HHHHHCCCCEEEECCCCC Q ss_conf 464413578899858888 Q gi|254780420|r 240 KDIMSLQPDGVLLSNGPG 257 (396) Q Consensus 240 ~~i~~~~pdgi~lSnGPG 257 (396) .-+.+.+|| +++||||| T Consensus 82 ~il~k~kPd-vii~tG~g 98 (166) T pfam08660 82 KLLRRERPD-VILCNGPG 98 (166) T ss_pred HHHHHHCCC-EEEECCCC T ss_conf 999853998-99977996 No 260 >PRK00421 murC UDP-N-acetylmuramate--L-alanine ligase; Provisional Probab=47.08 E-value=22 Score=16.31 Aligned_cols=31 Identities=16% Similarity=0.279 Sum_probs=20.6 Q ss_pred HHHHHHHCCCCEECCCC-----HHHHHHHHHHCCCC Q ss_conf 89999742783211257-----68999999853984 Q gi|254780420|r 111 FDAWLKSCGIIGLSGID-----TRALTVWIRDYGVS 141 (396) Q Consensus 111 L~~wL~~~~IpgI~GID-----TRaLTk~IRe~G~~ 141 (396) |.+|++...+.||+|=. |..|...+++.|-- T Consensus 101 l~~~~~~~~~IaVtGTnGKTTtT~ll~~il~~~g~~ 136 (459) T PRK00421 101 LAELMRLRTSIAVAGTHGKTTTTSLLAHVLAEAGLD 136 (459) T ss_pred HHHHHHCCCEEEEECCCCCCHHHHHHHHHHHHCCCC T ss_conf 999982596499977888611899999999976998 No 261 >pfam10281 Ish1 Putative stress-responsive nuclear envelope protein. This family of proteins found in fungi is a putative stress-responsive nuclear envelope protein Ish1. Probab=46.58 E-value=14 Score=17.77 Aligned_cols=32 Identities=28% Similarity=0.388 Sum_probs=24.1 Q ss_pred CCCCHHHHHHHCCCCEECCCCHH-HHHHHHHHC Q ss_conf 41798999974278321125768-999999853 Q gi|254780420|r 107 ANMHFDAWLKSCGIIGLSGIDTR-ALTVWIRDY 138 (396) Q Consensus 107 s~~sL~~wL~~~~IpgI~GIDTR-aLTk~IRe~ 138 (396) +...|.+||..+|||.=....|| .|....|++ T Consensus 5 s~~~Lk~wL~~~gi~~~~~~~~rd~Ll~~a~~~ 37 (38) T pfam10281 5 SDSDLKEWLDSHGIPVPQKAATRDELLALARKN 37 (38) T ss_pred CHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHC T ss_conf 488999999984997897136899999999861 No 262 >PRK06395 phosphoribosylamine--glycine ligase; Provisional Probab=46.50 E-value=23 Score=16.25 Aligned_cols=30 Identities=7% Similarity=-0.111 Sum_probs=19.6 Q ss_pred HHHHHHHCCCCE--ECC--CCHHHHHHHHHHCCC Q ss_conf 899997427832--112--576899999985398 Q gi|254780420|r 111 FDAWLKSCGIIG--LSG--IDTRALTVWIRDYGV 140 (396) Q Consensus 111 L~~wL~~~~Ipg--I~G--IDTRaLTk~IRe~G~ 140 (396) -.+||++++||+ =+. .|.....+.+.+.+. T Consensus 109 aK~fm~~~~IP~ta~~~~~~~~~ea~~~~~~~~~ 142 (435) T PRK06395 109 MRYLMERHNIPGNINFNACFSEKDAARDYITSMK 142 (435) T ss_pred HHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHCCC T ss_conf 9999998099856645323999999999997099 No 263 >COG1879 RbsB ABC-type sugar transport system, periplasmic component [Carbohydrate transport and metabolism] Probab=46.20 E-value=22 Score=16.32 Aligned_cols=81 Identities=14% Similarity=0.228 Sum_probs=38.7 Q ss_pred CEEEECCCCCCCCCCCHH--HHCCCCCCCCCEEEEE--EEECCCCCCCCCCCCCCHHHHH-HHCCCCEECCCCHHHHHH- Q ss_conf 173216897368458823--4204653332102366--6404654434552417989999-742783211257689999- Q gi|254780420|r 60 QIVNFTFPHIGNVGVNSE--DFESISRKNFKGAVGL--VIKAEITEPSNYRANMHFDAWL-KSCGIIGLSGIDTRALTV- 133 (396) Q Consensus 60 QIlvfT~P~IGNyGvn~~--d~ES~~~~~~~~~~g~--Iv~~~~~~~Sn~rs~~sL~~wL-~~~~IpgI~GIDTRaLTk- 133 (396) -.+.|.++..+|.-.... -.|... . ..|. .+.......+.-.....+.+.+ +..+...|.-+|..+++. T Consensus 34 ~~i~~~~~~~~~~f~~~~~~g~~~~a--~---~~g~~~~~~~~~~~~d~~~Q~~~i~~~ia~~~daIiv~~~d~~a~~~~ 108 (322) T COG1879 34 KTIGVVVPTLGNPFFQAVRKGAEAAA--K---KLGVVVAVVIADAQNDVAKQIAQIEDLIAQGVDAIIINPVDPDALTPA 108 (322) T ss_pred CEEEEEECCCCCCHHHHHHHHHHHHH--H---HCCCCEEEEECCCCCCHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHH T ss_conf 87999947788818999999999999--8---649955897157888819999999999855998899928980758999 Q ss_pred --HHHHCCCCCEEE Q ss_conf --998539842366 Q gi|254780420|r 134 --WIRDYGVSNSVI 145 (396) Q Consensus 134 --~IRe~G~~~a~I 145 (396) +.++.|..--.+ T Consensus 109 v~~a~~aGIpVv~~ 122 (322) T COG1879 109 VKKAKAAGIPVVTV 122 (322) T ss_pred HHHHHHCCCEEEEE T ss_conf 99999779849996 No 264 >cd06267 PBP1_LacI_sugar_binding_like Ligand binding domain of the LacI tanscriptional regulator family belonging to the type I periplasmic-binding fold protein superfamily. Ligand binding domain of the LacI tanscriptional regulator family belonging to the type I periplasmic-binding fold protein superfamily. In most cases, ligands are monosaccharide including lactose, ribose, fructose, xylose, arabinose, galactose/glucose, and other sugars. The LacI family of proteins consists of transcriptional regulators related to the lac repressor. In this case, the domain sugar binding changes the DNA binding activity of the repressor domain. Probab=45.88 E-value=23 Score=16.19 Aligned_cols=12 Identities=8% Similarity=0.144 Sum_probs=4.7 Q ss_pred CCHHHHHHHCCC Q ss_conf 798999974278 Q gi|254780420|r 109 MHFDAWLKSCGI 120 (396) Q Consensus 109 ~sL~~wL~~~~I 120 (396) ..+.+.+++++. T Consensus 19 ~gi~~~a~~~gy 30 (264) T cd06267 19 RGIEEAAREAGY 30 (264) T ss_pred HHHHHHHHHCCC T ss_conf 999999998699 No 265 >TIGR01892 AcOrn-deacetyl acetylornithine deacetylase (ArgE); InterPro: IPR010169 This entry represents a clade of acetylornithine deacetylases (argE) from proteobacteria, which catalyse the final step in arginine biosynthesis. The enzyme is closely related to and may share a common origin with dapE (succinyl-diaminopimelate desuccinylase), cpg2 (carboxypeptidase G2) and ACY1 (aminoacylase I), all of which are members of the metal-dependent aminoacylase family, ArgE/dapE/ACY1/CPG2/yscS .; GO: 0008270 zinc ion binding, 0008777 acetylornithine deacetylase activity, 0050897 cobalt ion binding, 0006526 arginine biosynthetic process, 0005737 cytoplasm. Probab=45.77 E-value=20 Score=16.70 Aligned_cols=50 Identities=14% Similarity=0.106 Sum_probs=23.0 Q ss_pred CCCCCEEEEEECCCCEEEEEEECCCCEEEECCCCCCCC--CCCCHHHHHHHHH Q ss_conf 69982899998899808726867997789834878877--6700168999999 Q gi|254780420|r 336 LPAGLEETHISLFDSSNCGFRLIDQPVFSVQYHPESSP--GPQDSYYLFSTFI 386 (396) Q Consensus 336 lp~~~~v~~~~~nD~tiegi~~~~~~~~~vQfHPEa~P--GP~D~~~~F~~F~ 386 (396) +.+.|++-|-++|=|.|.|=.-++-=.-+-.||=|..| | .|-..|-..|- T Consensus 229 ~~~~F~ppy~tl~~G~v~GG~A~N~i~~~C~~~~d~R~~~G-~~p~~l~~~l~ 280 (386) T TIGR01892 229 LEEAFTPPYATLNIGVVKGGKAVNIIPAACEFVFDLRPIPG-MDPEELLALLE 280 (386) T ss_pred CCCCCCCCCCEEEEEEEECCCEEEEECCCCEEEEEECCCCC-CCHHHHHHHHH T ss_conf 77787598871442067468425550346789996068867-89899989999 No 266 >cd03144 GATase1_ScBLP_like Type 1 glutamine amidotransferase (GATase1)-like domain found in proteins similar to Saccharomyces cerevisiae biotin-apoprotein ligase (ScBLP). Type 1 glutamine amidotransferase (GATase1)-like domain found in proteins similar to Saccharomyces cerevisiae biotin-apoprotein ligase (ScBLP). Biotin-apoprotein ligase modifies proteins by covalently attaching biotin. ScBLP is known to biotinylate acety-CoA carboxylase and pyruvate carboxylase. The catalytic triad typical of GATase1 domains is not conserved in this GATase1-like domain. However, the Cys residue found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow in a typical GATase1 domain is conserved. Probab=44.63 E-value=16 Score=17.43 Aligned_cols=65 Identities=17% Similarity=0.250 Sum_probs=35.4 Q ss_pred HHHHHHHHCCCCEEEECCCCHHHHHHC----CCCEEEECCCCCCCCCC--CCCCHHHHHHHCCCCCCEEEEECHHH Q ss_conf 232222202454133046574464413----57889985888884101--12110122210257863478610058 Q gi|254780420|r 219 LLRVLSSLGCRITIVGAETSYKDIMSL----QPDGVLLSNGPGDPAVT--SVYSSPIICKLVDSGMPMFGICLGHQ 288 (396) Q Consensus 219 Ilr~L~~~g~~v~Vvp~~~~~~~i~~~----~pdgi~lSnGPGdP~~~--~~~~~~~~~~~~~~~iPilGIClGhQ 288 (396) +++.|+.. . .|.+ .+.++|... +.+-+++-.|...|... .......++++.+.+=--||||.|-= T Consensus 18 ~~~~L~~~-~--av~~--v~~~~L~~~pw~~~~~llv~PGG~d~~y~~~l~~~g~~~i~~fV~~GG~ylG~CaGaY 88 (114) T cd03144 18 LAELLRLY-L--AVST--VTADELAVGPWESKTALLVVPGGADLPYCRALNGKGNRRIRNFVRNGGNYLGICAGAY 88 (114) T ss_pred HHHHHHHH-C--CEEE--CCHHHHHCCCCCCCEEEEEECCCCCCHHHHHHCCHHHHHHHHHHHCCCCEEEECCCCC T ss_conf 99999874-3--6454--0699984495426735999448767678888453667999999975994799855310 No 267 >PRK12779 putative bifunctional glutamate synthase subunit beta/2-polyprenylphenol hydroxylase; Provisional Probab=44.62 E-value=24 Score=16.06 Aligned_cols=42 Identities=21% Similarity=0.365 Sum_probs=31.1 Q ss_pred HHHHHHCCCC---EEEECCCCHHHHHHCCCCEEEECCCCCCCCCC Q ss_conf 2222202454---13304657446441357889985888884101 Q gi|254780420|r 221 RVLSSLGCRI---TIVGAETSYKDIMSLQPDGVLLSNGPGDPAVT 262 (396) Q Consensus 221 r~L~~~g~~v---~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~ 262 (396) ..|.+.|+++ .++--+.+++++.+..+|+|||.-|-|.|... T Consensus 364 ~~l~~lGV~f~~n~~VGk~~tl~eL~~eGydAVfIg~GAg~p~~l 408 (944) T PRK12779 364 EKIKLLGGRFVKNFVVGKTATLEDLKAEGFWKIFVGTGAGLPTFM 408 (944) T ss_pred HHHHHCCEEEEECCEECCCCCHHHHHHCCCCEEEEEECCCCCCCC T ss_conf 999967839997856477688999976899989997478887568 No 268 >PRK10576 iron-hydroxamate transporter substrate-binding subunit; Provisional Probab=44.29 E-value=25 Score=16.03 Aligned_cols=48 Identities=15% Similarity=0.223 Sum_probs=31.3 Q ss_pred CCEEEEEECCCCCHHHHHHHHHCCCCEE---------------EECC---------CCHHHHHHCCCCEEEECCCC Q ss_conf 7149999679641023222220245413---------------3046---------57446441357889985888 Q gi|254780420|r 205 KYHVVCIDYGIRSNLLRVLSSLGCRITI---------------VGAE---------TSYKDIMSLQPDGVLLSNGP 256 (396) Q Consensus 205 ~~kIvvIDfGiK~nIlr~L~~~g~~v~V---------------vp~~---------~~~~~i~~~~pdgi~lSnGP 256 (396) ..|||++|++ +...|..+|+..+= +|.. -++|.|..++||-|+.|+.= T Consensus 32 P~RVv~LdW~----~aE~llaLGv~Pvgvad~~~Y~~Wv~~P~lp~~V~DVGtR~ePNLE~Ia~LkPDLIi~s~~~ 103 (292) T PRK10576 32 PNRIVALEWL----PVELLLALGITPYGVADTHNYRLWVSEPPLPDSVIDVGLRTEPNLELLTEMKPSFMVWSAGY 103 (292) T ss_pred CCEEEEECHH----HHHHHHHCCCCCCEEECCCCCHHHCCCCCCCCCCCCCCCCCCCCHHHHHHCCCCEEEECCCC T ss_conf 9869994419----89999977998524433100223226889998765676888989999974699889946765 No 269 >PRK04020 rps2P 30S ribosomal protein S2; Provisional Probab=44.03 E-value=21 Score=16.53 Aligned_cols=33 Identities=21% Similarity=0.631 Sum_probs=23.4 Q ss_pred CCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEEC Q ss_conf 35788998588888410112110122210257863478610 Q gi|254780420|r 245 LQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGICL 285 (396) Q Consensus 245 ~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGICl 285 (396) ..||.+|+. ||..- ...+++....+||+.|||= T Consensus 113 ~ePdlliV~----Dp~~e----~~AV~EA~~l~IPvIaivD 145 (204) T PRK04020 113 IEPDVLVVT----DPRGD----AQAVKEAIEIGIPVVALCD 145 (204) T ss_pred CCCCEEEEE----CCCCC----HHHHHHHHHCCCCEEEEEC T ss_conf 689989997----88630----4899999876999899827 No 270 >COG0521 MoaB Molybdopterin biosynthesis enzymes [Coenzyme metabolism] Probab=43.70 E-value=19 Score=16.79 Aligned_cols=72 Identities=26% Similarity=0.353 Sum_probs=43.6 Q ss_pred HHHHHHHHCCCC---EEEECCCC--HHHHHH--C-CCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEECHHHHH Q ss_conf 232222202454---13304657--446441--3-578899858888841011211012221025786347861005887 Q gi|254780420|r 219 LLRVLSSLGCRI---TIVGAETS--YKDIMS--L-QPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGICLGHQLL 290 (396) Q Consensus 219 Ilr~L~~~g~~v---~Vvp~~~~--~~~i~~--~-~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIClGhQ~l 290 (396) |...|.+.|.++ .++|-+.. ..++.+ . ..|.|+.++|-| +...+. -.+.+..++++.+|=||- =+..+ T Consensus 32 l~~~L~~ag~~~~~~~iV~D~~~~I~~~l~~~~~~~~DvvlttGGTG-~t~RDv-TpEA~~~l~dKeipGFgE--~fR~~ 107 (169) T COG0521 32 LVELLEEAGHNVAAYTIVPDDKEQIRATLIALIDEDVDVVLTTGGTG-ITPRDV-TPEATRPLFDKEIPGFGE--LFRRL 107 (169) T ss_pred HHHHHHHCCCCCCEEEEECCCHHHHHHHHHHHHCCCCCEEEECCCCC-CCCCCC-CHHHHHHHHHCCCCCHHH--HHHHH T ss_conf 99999975976305999469789999999998648877999759836-898867-889999988531875999--99985 Q ss_pred HHHC Q ss_conf 8752 Q gi|254780420|r 291 GLAL 294 (396) Q Consensus 291 ~~a~ 294 (396) ++.. T Consensus 108 S~~~ 111 (169) T COG0521 108 SLEE 111 (169) T ss_pred HHHC T ss_conf 4431 No 271 >PRK09468 ompR osmolarity response regulator; Provisional Probab=43.66 E-value=25 Score=15.96 Aligned_cols=48 Identities=17% Similarity=0.322 Sum_probs=18.4 Q ss_pred CEEEEEE--CCCCCHHHHHHHHHCCCCEEEECCCCHHHHH-HCCCCEEEEC Q ss_conf 1499996--7964102322222024541330465744644-1357889985 Q gi|254780420|r 206 YHVVCID--YGIRSNLLRVLSSLGCRITIVGAETSYKDIM-SLQPDGVLLS 253 (396) Q Consensus 206 ~kIvvID--fGiK~nIlr~L~~~g~~v~Vvp~~~~~~~i~-~~~pdgi~lS 253 (396) +||+++| --+-..+-..|...|++|..+..-..+.+.+ ...||-|+|- T Consensus 6 ~kILiVEDd~~~~~~l~~~L~~~g~~v~~a~~~~~a~~~~~~~~~DlvilD 56 (239) T PRK09468 6 YKILVVDDDMRLRALLERYLTEQGFQVRSAANAEQMDRLLTRESFHLMVLD 56 (239) T ss_pred CEEEEEECCHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHHCCCCEEEEC T ss_conf 789999099999999999999889999998999999999975899899987 No 272 >cd06311 PBP1_ABC_sugar_binding_like_3 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=43.20 E-value=25 Score=15.91 Aligned_cols=47 Identities=11% Similarity=0.200 Sum_probs=21.8 Q ss_pred EEEECCCCCCCCCCCCCCHHHHHHHC-CCCEECCCCHHHHHHHH---HHCCCC Q ss_conf 66404654434552417989999742-78321125768999999---853984 Q gi|254780420|r 93 LVIKAEITEPSNYRANMHFDAWLKSC-GIIGLSGIDTRALTVWI---RDYGVS 141 (396) Q Consensus 93 ~Iv~~~~~~~Sn~rs~~sL~~wL~~~-~IpgI~GIDTRaLTk~I---Re~G~~ 141 (396) +++.+.-..+.. -...+++++.+. +...+.-+|+.+++..+ ++.|.. T Consensus 37 ~~~~da~~D~~~--Q~~~ie~lI~qgvDaIiv~p~d~~a~~~~v~~a~~aGIp 87 (274) T cd06311 37 FILVTASNDTEQ--QNAQQDLLINRKIDALVILPFESAPLTQPVAKAKKAGIF 87 (274) T ss_pred EEEECCCCCHHH--HHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHCCCE T ss_conf 999718999999--999999999749999999579778889999999986997 No 273 >COG1004 Ugd Predicted UDP-glucose 6-dehydrogenase [Cell envelope biogenesis, outer membrane] Probab=43.07 E-value=26 Score=15.90 Aligned_cols=37 Identities=11% Similarity=0.242 Sum_probs=17.3 Q ss_pred CCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEE Q ss_conf 3578899858888841011211012221025786347861 Q gi|254780420|r 245 LQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGIC 284 (396) Q Consensus 245 ~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIC 284 (396) ++||-|++. --+.. ......++...++..+.|+|=-- T Consensus 162 ~~PdRIViG--~~~~~-a~~~~~ely~~~~~~~~p~l~t~ 198 (414) T COG1004 162 LYPDRIVIG--VRSER-AAAVLRELYAPFLRQDVPILFTD 198 (414) T ss_pred CCCCEEEEC--CCCHH-HHHHHHHHHHHHHHCCCCEEEEC T ss_conf 589838992--57716-89999999753420499889956 No 274 >KOG2278 consensus Probab=43.07 E-value=18 Score=16.98 Aligned_cols=49 Identities=20% Similarity=0.302 Sum_probs=26.4 Q ss_pred HHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHHCCCCC--CCCHHHCCC Q ss_conf 689999998539842366327654589789888774158874--321011013 Q gi|254780420|r 128 TRALTVWIRDYGVSNSVIAHHPDGQFDLEDLKERAKNWSGLK--GAELAKHVT 178 (396) Q Consensus 128 TRaLTk~IRe~G~~~a~I~~~~~~~~d~~~~~~~~~~~~~~~--~~dL~~~Vs 178 (396) .|||++.||..-+-+..-+. .|+-..++++++ +..+...+ -.|+-+.|. T Consensus 11 skaLs~iLRHtA~~~~L~m~-~dGfvpv~~lL~-lnq~r~~~~t~ddi~riVk 61 (207) T KOG2278 11 SKALSRILRHTATELRLNMR-GDGFVPVEDLLN-LNQFRGANHTIDDIRRIVK 61 (207) T ss_pred HHHHHHHHHHHHHHHCCCCC-CCCCEEHHHHHC-CCHHCCCCCCHHHHHHHHH T ss_conf 99999999988887440115-888467899754-6311246775999999983 No 275 >PRK08745 ribulose-phosphate 3-epimerase; Provisional Probab=42.86 E-value=26 Score=15.88 Aligned_cols=26 Identities=19% Similarity=0.459 Sum_probs=18.4 Q ss_pred EEEECCCCCHHHHHHHHHCCCCEEEE Q ss_conf 99967964102322222024541330 Q gi|254780420|r 209 VCIDYGIRSNLLRVLSSLGCRITIVG 234 (396) Q Consensus 209 vvIDfGiK~nIlr~L~~~g~~v~Vvp 234 (396) ..+|=|++...++.|.+.|+++.|.- T Consensus 174 I~VDGGI~~~ti~~l~~aGad~~V~G 199 (223) T PRK08745 174 LEIDGGVKADNIGAIAAAGADTFVAG 199 (223) T ss_pred EEEECCCCHHHHHHHHHCCCCEEEEC T ss_conf 99978879899999998699999974 No 276 >PRK00561 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=42.70 E-value=21 Score=16.55 Aligned_cols=18 Identities=17% Similarity=0.257 Sum_probs=11.9 Q ss_pred HHHHHCCCCEEEECCCCC Q ss_conf 464413578899858888 Q gi|254780420|r 240 KDIMSLQPDGVLLSNGPG 257 (396) Q Consensus 240 ~~i~~~~pdgi~lSnGPG 257 (396) +.+..+.-||+++|-=-| T Consensus 130 ~~~~~~r~DGlIvSTPTG 147 (259) T PRK00561 130 EFWEKYRGSGLLIGPRTG 147 (259) T ss_pred EEEEEEECCEEEEECCCC T ss_conf 997888537899968986 No 277 >cd04730 NPD_like 2-Nitropropane dioxygenase (NPD), one of the nitroalkane oxidizing enzyme families, catalyzes oxidative denitrification of nitroalkanes to their corresponding carbonyl compounds and nitrites. NDP is a member of the NAD(P)H-dependent flavin oxidoreductase family that reduce a range of alternative electron acceptors. Most use FAD/FMN as a cofactor and NAD(P)H as electron donor. Some contain 4Fe-4S cluster to transfer electron from FAD to FMN. Probab=42.61 E-value=26 Score=15.85 Aligned_cols=158 Identities=21% Similarity=0.342 Sum_probs=82.9 Q ss_pred EECCCCHHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHHCCCCC----CCCHHHCCCCCCCEECCCCCCCCCCCCC Q ss_conf 211257689999998539842366327654589789888774158874----3210110133332111686534454344 Q gi|254780420|r 122 GLSGIDTRALTVWIRDYGVSNSVIAHHPDGQFDLEDLKERAKNWSGLK----GAELAKHVTVSQRRDWSEKIWKWGEETS 197 (396) Q Consensus 122 gI~GIDTRaLTk~IRe~G~~~a~I~~~~~~~~d~~~~~~~~~~~~~~~----~~dL~~~Vs~~~~~~~~~~~~~~~~~~~ 197 (396) ++.++-+-.|+.-+-+.| ..|.|... ..+.+++.+.+++..... +.||....+. ..+.+ . T Consensus 9 ~M~~vs~~~LaaAvs~aG-glG~l~~~---~~~~~~l~~~i~~~~~~~~~pfgvnl~~~~~~---~~~~~---------~ 72 (236) T cd04730 9 PMAGVSTPELAAAVSNAG-GLGFIGAG---YLTPEALRAEIRKIRALTDKPFGVNLLVPSSN---PDFEA---------L 72 (236) T ss_pred CCCCCCCHHHHHHHHHCC-CEEEECCC---CCCHHHHHHHHHHHHHHCCCCEEECCCCCCCC---CCHHH---------H T ss_conf 877878699999999689-85585788---89999999999999974699724433246776---36899---------9 Q ss_pred CCCCCCCCCEEEEEECCCCCHHHHHHHHHCCCCEEEECCCCH---HHHHHCCCCEEEECCCC---CCCCCC---CCCCHH Q ss_conf 346777771499996796410232222202454133046574---46441357889985888---884101---121101 Q gi|254780420|r 198 FLKSSDAKYHVVCIDYGIRSNLLRVLSSLGCRITIVGAETSY---KDIMSLQPDGVLLSNGP---GDPAVT---SVYSSP 268 (396) Q Consensus 198 ~~~~~~~~~kIvvIDfGiK~nIlr~L~~~g~~v~Vvp~~~~~---~~i~~~~pdgi~lSnGP---GdP~~~---~~~~~~ 268 (396) ...--+.+.+++.+=+|.-..+++.|.+.|..| +.--++. ....+..+|+|+.- |+ |+.... .....+ T Consensus 73 ~~~~~~~~v~~v~~~~g~p~~~v~~l~~~g~~v--~~~v~s~~~A~~a~~~GaD~iv~q-G~eAGGH~g~~~~~~~~lv~ 149 (236) T cd04730 73 LEVALEEGVPVVSFSFGPPAEVVERLKAAGIKV--IPTVTSVEEARKAEAAGADALVAQ-GAEAGGHRGTFDIGTFALVP 149 (236) T ss_pred HHHHHHCCCCEEEECCCCCHHHHHHHHHCCCEE--EEECCCHHHHHHHHHCCCCEEEEE-CCCCCCCCCCCCCCHHHHHH T ss_conf 999997699999987989789999999829989--995898999999998189989997-77777788987555677999 Q ss_pred HHHHHCCCCCCEE---EEECHHHHHH-HHCCCCCCC Q ss_conf 2221025786347---8610058878-752854466 Q gi|254780420|r 269 IICKLVDSGMPMF---GICLGHQLLG-LALGAQTVK 300 (396) Q Consensus 269 ~~~~~~~~~iPil---GIClGhQ~l~-~a~G~~~~k 300 (396) .+++.. ++||+ ||.=|.++.+ +++|+.-.- T Consensus 150 ~v~~~~--~ipviaAGGI~~g~~i~aal~lGA~gV~ 183 (236) T cd04730 150 EVRDAV--DIPVIAAGGIADGRGIAAALALGADGVQ 183 (236) T ss_pred HHHHHH--CCCEEEECCCCCHHHHHHHHHHCCCEEE T ss_conf 999982--9868965462778999999980897999 No 278 >KOG1834 consensus Probab=42.07 E-value=17 Score=17.11 Aligned_cols=44 Identities=16% Similarity=0.178 Sum_probs=27.2 Q ss_pred CCCCCCCCCEEEEEECCCCCHHHHHHHHHCCCCEEEECCCCHHH Q ss_conf 34677777149999679641023222220245413304657446 Q gi|254780420|r 198 FLKSSDAKYHVVCIDYGIRSNLLRVLSSLGCRITIVGAETSYKD 241 (396) Q Consensus 198 ~~~~~~~~~kIvvIDfGiK~nIlr~L~~~g~~v~Vvp~~~~~~~ 241 (396) +.+...-++.|.++|||.|..--..|+...++-+-.|-...-++ T Consensus 212 y~ke~~Y~ltVtAyDCg~kraa~d~lV~v~Vkp~C~pgwq~wnk 255 (952) T KOG1834 212 YTKEHQYKLTVTAYDCGKKRAASDSLVTVHVKPTCKPGWQGWNK 255 (952) T ss_pred CCCCEEEEEEEEEEECCCCCCCCCCEEEEEECCCCCCCHHHHHH T ss_conf 66330589999997134665567626999716765633333443 No 279 >PRK10416 cell division protein FtsY; Provisional Probab=42.00 E-value=27 Score=15.79 Aligned_cols=60 Identities=13% Similarity=0.181 Sum_probs=33.6 Q ss_pred HHHHHHHHCCCCEEEECCC-CH---HHHHH--CCCCEEEECCCCC-CCCCCCCCCHHHHHHHCCCCCCEE Q ss_conf 2322222024541330465-74---46441--3578899858888-841011211012221025786347 Q gi|254780420|r 219 LLRVLSSLGCRITIVGAET-SY---KDIMS--LQPDGVLLSNGPG-DPAVTSVYSSPIICKLVDSGMPMF 281 (396) Q Consensus 219 Ilr~L~~~g~~v~Vvp~~~-~~---~~i~~--~~pdgi~lSnGPG-dP~~~~~~~~~~~~~~~~~~iPil 281 (396) +.++|.+.|.+|.+.-.|| .+ +.+.- .+-+-=+++...| ||+.+- ...+......++-++ T Consensus 315 LA~~~~~~gkkVllaA~DTfRaAAieQL~~w~~r~~v~vi~~~~g~Dpa~V~---~dai~~a~~~~~Dvv 381 (499) T PRK10416 315 LARQFEQQGKSVMLAAGDTFRAAAVEQLQVWGQRNNIPVIAQHTGADSASVI---FDAIQAAKARNVDVL 381 (499) T ss_pred HHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHCCCCCEEEECCCCCCHHHHH---HHHHHHHHHCCCCEE T ss_conf 9999997799537884066756899999998424573698368999979999---999999997299989 No 280 >PRK05299 rpsB 30S ribosomal protein S2; Provisional Probab=41.91 E-value=25 Score=15.99 Aligned_cols=31 Identities=23% Similarity=0.456 Sum_probs=22.3 Q ss_pred CCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEE Q ss_conf 578899858888841011211012221025786347861 Q gi|254780420|r 246 QPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGIC 284 (396) Q Consensus 246 ~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIC 284 (396) -||.||+. ||.. ....++++...+||+.||| T Consensus 157 lPd~viv~----d~~~----e~~AV~EA~kl~IPvI~iv 187 (255) T PRK05299 157 LPDALFVV----DPNK----EHIAVKEARKLGIPVVAIV 187 (255) T ss_pred CCCEEEEE----CCCC----CHHHHHHHHHCCCCEEEEE T ss_conf 99879984----7853----3799999997599888762 No 281 >cd06296 PBP1_CatR_like Ligand-binding domain of a LacI-like transcriptional regulator, CatR which is involved in catechol degradation. This group includes the ligand-binding domain of a LacI-like transcriptional regulator, CatR which is involved in catechol degradation. This group belongs to the the LacI-GalR family repressors that are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=41.82 E-value=27 Score=15.77 Aligned_cols=28 Identities=14% Similarity=0.029 Sum_probs=11.0 Q ss_pred EEEEECCCCCCCCCCCCCCHHHHHHHCCC Q ss_conf 66640465443455241798999974278 Q gi|254780420|r 92 GLVIKAEITEPSNYRANMHFDAWLKSCGI 120 (396) Q Consensus 92 g~Iv~~~~~~~Sn~rs~~sL~~wL~~~~I 120 (396) |+||.+...... -+-...+.+.++++|. T Consensus 3 Gvivp~i~npf~-~~l~~gi~~~~~~~gy 30 (270) T cd06296 3 GLVFPDLDSPWA-SEVLRGVEEAAAAAGY 30 (270) T ss_pred EEEECCCCCHHH-HHHHHHHHHHHHHCCC T ss_conf 999799853999-9999999999998499 No 282 >cd00886 MogA_MoaB MogA_MoaB family. Members of this family are involved in biosynthesis of the molybdenum cofactor (MoCF) an essential cofactor of a diverse group of redox enzymes. MoCF biosynthesis is an evolutionarily conserved pathway present in eubacteria, archaea, and eukaryotes. MoCF contains a tricyclic pyranopterin, termed molybdopterin (MPT). MogA, together with MoeA, is responsible for the metal incorporation into MPT, the third step in MoCF biosynthesis. The plant homolog Cnx1 is a MoeA-MogA fusion protein. The mammalian homolog gephyrin is a MogA-MoeA fusion protein, that plays a critical role in postsynaptic anchoring of inhibitory glycine receptors and major GABAa receptor subtypes. In contrast, MoaB shows high similarity to MogA, but little is known about its physiological role. All well studied members of this family form highly stable trimers. Probab=41.11 E-value=21 Score=16.58 Aligned_cols=70 Identities=23% Similarity=0.374 Sum_probs=42.3 Q ss_pred HHHHHHHHCCCC---EEEECCC-CH-HHHHH----CCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEECHHHH Q ss_conf 232222202454---1330465-74-46441----357889985888884101121101222102578634786100588 Q gi|254780420|r 219 LLRVLSSLGCRI---TIVGAET-SY-KDIMS----LQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGICLGHQL 289 (396) Q Consensus 219 Ilr~L~~~g~~v---~Vvp~~~-~~-~~i~~----~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIClGhQ~ 289 (396) +.+.|.+.|++| .++|-+. .. +.+.+ .++|-|+.++|-| +...+ ...+.++.+++..+|= +|.++ T Consensus 25 l~~~l~~~G~~v~~~~ivpDd~~~I~~~l~~~~~~~~~DlIittGGTG-~~~rD-~TpEA~~~v~~k~~pG----~~e~~ 98 (152) T cd00886 25 LVELLEEAGHEVVAYEIVPDDKDEIREALIEWADEDGVDLILTTGGTG-LAPRD-VTPEATRPLLDKELPG----FGEAF 98 (152) T ss_pred HHHHHHHCCCEEEEEEEECCCHHHHHHHHHHHHHCCCCCEEEECCCCC-CCCCC-CCHHHHHHHHHHCCCC----HHHHH T ss_conf 999998749978899995999899999999998536988899668555-89988-8899999988516965----99999 Q ss_pred HHHHC Q ss_conf 78752 Q gi|254780420|r 290 LGLAL 294 (396) Q Consensus 290 l~~a~ 294 (396) ....+ T Consensus 99 r~~s~ 103 (152) T cd00886 99 RALSL 103 (152) T ss_pred HHHHH T ss_conf 99875 No 283 >TIGR01011 rpsB_bact ribosomal protein S2; InterPro: IPR005706 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites , . About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many of ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome , . This family describes the bacterial, mitochondrial and chloroplast forms of ribosomal protein S2.; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0015935 small ribosomal subunit. Probab=41.00 E-value=27 Score=15.78 Aligned_cols=31 Identities=23% Similarity=0.401 Sum_probs=21.9 Q ss_pred CCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEE Q ss_conf 578899858888841011211012221025786347861 Q gi|254780420|r 246 QPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGIC 284 (396) Q Consensus 246 ~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIC 284 (396) -||.||+ =||.. ....+.++-.-+|||++|| T Consensus 157 lPd~l~v----iD~~~----E~IAv~EA~kLgIPvvA~v 187 (227) T TIGR01011 157 LPDLLFV----IDPRK----EKIAVAEARKLGIPVVAIV 187 (227) T ss_pred CCCEEEE----ECCCC----CHHHHHHHHHCCCCEEEEE T ss_conf 6970899----74821----0689999733799789870 No 284 >PRK10610 chemotaxis regulatory protein CheY; Provisional Probab=40.56 E-value=28 Score=15.64 Aligned_cols=51 Identities=14% Similarity=0.147 Sum_probs=23.1 Q ss_pred CCCCEEEEEECC--CCCHHHHHHHHHCCC-CEEEECCC-CHHHHHHCCCCEEEEC Q ss_conf 777149999679--641023222220245-41330465-7446441357889985 Q gi|254780420|r 203 DAKYHVVCIDYG--IRSNLLRVLSSLGCR-ITIVGAET-SYKDIMSLQPDGVLLS 253 (396) Q Consensus 203 ~~~~kIvvIDfG--iK~nIlr~L~~~g~~-v~Vvp~~~-~~~~i~~~~pdgi~lS 253 (396) +++.||+++|== +-.-+-+.|.+.|+. |.....-. -.+.+.+..||-|++- T Consensus 3 ~k~lrILivDD~~~~r~~l~~~L~~~g~~~v~~a~~g~~al~~~~~~~~Dlil~D 57 (129) T PRK10610 3 DKELKFLVVDDFSTMRRIVRNLLKELGFNNVEEAEDGVDALNKLQAGGFGFVISD 57 (129) T ss_pred CCCCEEEEEECCHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHCCCCEEEEE T ss_conf 9998999997989999999999998699789998999999999985899999981 No 285 >KOG4842 consensus Probab=40.38 E-value=20 Score=16.71 Aligned_cols=22 Identities=9% Similarity=-0.061 Sum_probs=8.8 Q ss_pred CCCCEECCCCHHHHHHHHHHCC Q ss_conf 2783211257689999998539 Q gi|254780420|r 118 CGIIGLSGIDTRALTVWIRDYG 139 (396) Q Consensus 118 ~~IpgI~GIDTRaLTk~IRe~G 139 (396) +++.+++-.--|-|...+=+.+ T Consensus 33 ~K~~~vss~~~kll~~~llk~i 54 (278) T KOG4842 33 QKVAVVSSKPNKLLALNLLKEI 54 (278) T ss_pred CEEEEECCCHHHHHHHHHHHHH T ss_conf 1145531501787755465544 No 286 >PRK09722 allulose-6-phosphate 3-epimerase; Provisional Probab=39.99 E-value=28 Score=15.58 Aligned_cols=93 Identities=17% Similarity=0.218 Sum_probs=48.5 Q ss_pred CHHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHHC---------CCCCCCCHHHCCCCCCCEECCCCCCCCCCCCC Q ss_conf 768999999853984236632765458978988877415---------88743210110133332111686534454344 Q gi|254780420|r 127 DTRALTVWIRDYGVSNSVIAHHPDGQFDLEDLKERAKNW---------SGLKGAELAKHVTVSQRRDWSEKIWKWGEETS 197 (396) Q Consensus 127 DTRaLTk~IRe~G~~~a~I~~~~~~~~d~~~~~~~~~~~---------~~~~~~dL~~~Vs~~~~~~~~~~~~~~~~~~~ 197 (396) +..++.+.||+.| +++.|+-.++. +++.+..-+... |+..|+...+.+--| .... .. T Consensus 94 ~~~~~i~~Ik~~g-~k~GlAlnP~T--pi~~i~~~l~~vD~VLvMsV~PGf~GQ~Fi~~~l~K-I~~l----------r~ 159 (227) T PRK09722 94 QAFRLIDEIRRAG-MKVGLVLNPET--PVEAIKYYIHLADKVTVMTVDPGFAGQPFIPEMLDK-IAEL----------KA 159 (227) T ss_pred CHHHHHHHHHHCC-CCEEEEECCCC--CHHHHHHHHHHCCEEEEEEECCCCCCCCCCHHHHHH-HHHH----------HH T ss_conf 5999999999869-97223338999--866887667437989999888999876566889999-9999----------99 Q ss_pred CCCCCCCCCEEEEEECCCCCHHHHHHHHHCCCCEEEE Q ss_conf 3467777714999967964102322222024541330 Q gi|254780420|r 198 FLKSSDAKYHVVCIDYGIRSNLLRVLSSLGCRITIVG 234 (396) Q Consensus 198 ~~~~~~~~~kIvvIDfGiK~nIlr~L~~~g~~v~Vvp 234 (396) ..+..+... ..-+|=|++..-+..|.+.|+++.|+- T Consensus 160 ~~~~~~~~~-~I~VDGGI~~~~i~~~~~aGAd~~V~G 195 (227) T PRK09722 160 WREREGLEY-EIEVDGSCNQKTYEKLMAAGADVFIVG 195 (227) T ss_pred HHHHCCCCC-EEEEECCCCHHHHHHHHHCCCCEEEEC T ss_conf 998259982-699989888999999998699999977 No 287 >TIGR01686 FkbH FkbH domain; InterPro: IPR010037 This entry describes a domain of unknown function. One of proteins with this domain is a modular polyketide synthase, which is 4800 amino acids in length from Streptomyces avermilitis, where the domain is the C-terminal segment. By contrast, the FkbH protein from Streptomyces hygroscopicus (Q84G18 from SWISSPROT) apparently contains only this domain. The remaining members of the family all contain an additional N-terminal domain of between 200 and 275 amino acids, which show less than 20 0dentity to one another. It seems likely then that these proteins are involved in disparate functions, probably the biosynthesis of different natural products. For instance, the FkbH gene is found in a gene cluster believed to be responsible for the biosynthesis of unusual 'PKS extender units' in the ascomycin pathway . This domain is composed of two parts, the first of which is a member of subfamily IIIC of the haloacid dehalogenase (HAD) superfamily of aspartate-nucleophile hydrolases. All of the characterised enzymes within subfamily III are phosphatases, and all of the active site residues characteristic of HAD-superfamily phosphatases are present in this domain. The C-terminal portion of this domain is unique to this family (by BLAST).. Probab=39.82 E-value=14 Score=17.81 Aligned_cols=108 Identities=20% Similarity=0.190 Sum_probs=52.9 Q ss_pred HHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHHCCC-CCCCCHHHCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCE Q ss_conf 899999985398423663276545897898887741588-7432101101333321116865344543443467777714 Q gi|254780420|r 129 RALTVWIRDYGVSNSVIAHHPDGQFDLEDLKERAKNWSG-LKGAELAKHVTVSQRRDWSEKIWKWGEETSFLKSSDAKYH 207 (396) Q Consensus 129 RaLTk~IRe~G~~~a~I~~~~~~~~d~~~~~~~~~~~~~-~~~~dL~~~Vs~~~~~~~~~~~~~~~~~~~~~~~~~~~~k 207 (396) |.+...++..|...|+.+.. |-+.+++.+..-++ +-..+.-... ...|.++....-+-....+.+ .-. T Consensus 42 ~~~i~~l~~~G~LlA~aSkN-----d~~~~k~~l~~~~d~l~~~e~f~~~----~~~W~PKs~~~~~~A~~LNl~--~~s 110 (337) T TIGR01686 42 QEKIVTLKKQGVLLALASKN-----DEDDVKEVLERRKDFLLKAEDFSLL----SIAWGPKSESLRKIAKKLNLG--TDS 110 (337) T ss_pred HHHHHHHHHCCCEEEEEECC-----CHHHHHHHHHCCCCCCCCHHHHHHH----HHCCCCCCHHHHHHHHHHCCC--CCH T ss_conf 99998898778414220028-----7689999840383310013246778----643799716899999984885--231 Q ss_pred EEEEECCCCCHHH-HHHHHH-CCCCEEEECCCCHHHHHHCCCCEEEECCCC Q ss_conf 9999679641023-222220-245413304657446441357889985888 Q gi|254780420|r 208 VVCIDYGIRSNLL-RVLSSL-GCRITIVGAETSYKDIMSLQPDGVLLSNGP 256 (396) Q Consensus 208 IvvIDfGiK~nIl-r~L~~~-g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGP 256 (396) +++||= |++ +..++. +=+|+|++..-+..+. +.+.++.+| T Consensus 111 ~~F~DD----~p~E~~~~~~~LPEV~v~~~~~d~~~~-----~~~l~~~~P 152 (337) T TIGR01686 111 FLFIDD----NPAERAEVKIELPEVLVVTLECDPAEL-----AAILLLLNP 152 (337) T ss_pred HEEECC----CHHHHHHHHHHCCCEEEEECCCCHHHH-----HHHHCCCCC T ss_conf 100078----875487687518946898437883789-----997367871 No 288 >PRK08883 ribulose-phosphate 3-epimerase; Provisional Probab=39.60 E-value=29 Score=15.54 Aligned_cols=94 Identities=17% Similarity=0.287 Sum_probs=46.3 Q ss_pred CCHHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHHC---------CCCCCCCHHHCCCCCCCEECCCCCCCCCCCC Q ss_conf 5768999999853984236632765458978988877415---------8874321011013333211168653445434 Q gi|254780420|r 126 IDTRALTVWIRDYGVSNSVIAHHPDGQFDLEDLKERAKNW---------SGLKGAELAKHVTVSQRRDWSEKIWKWGEET 196 (396) Q Consensus 126 IDTRaLTk~IRe~G~~~a~I~~~~~~~~d~~~~~~~~~~~---------~~~~~~dL~~~Vs~~~~~~~~~~~~~~~~~~ 196 (396) -+..++.+.||+.|. ++.|+-.++. +++.+..-+... |+..|+.+...+.-+-. .-. T Consensus 93 ~~~~~~i~~Ik~~g~-k~GlalnP~T--~~~~l~~~l~~~D~VLvMtV~PGf~GQ~f~~~~l~Ki~-----------~l~ 158 (220) T PRK08883 93 EHVDRTLQLIKEHGC-QAGVVLNPAT--PLAHLEYIMDKVDLILLMSVNPGFGGQSFIPHTLDKLR-----------AVR 158 (220) T ss_pred CCHHHHHHHHHHCCC-CEEEEECCCC--CHHHHHHHHHHCCEEEEEEECCCCCCCCCCHHHHHHHH-----------HHH T ss_conf 499999999998599-6688847999--87999999974697999874589887545577999999-----------999 Q ss_pred CCCCCCCCCCEEEEEECCCCCHHHHHHHHHCCCCEEEE Q ss_conf 43467777714999967964102322222024541330 Q gi|254780420|r 197 SFLKSSDAKYHVVCIDYGIRSNLLRVLSSLGCRITIVG 234 (396) Q Consensus 197 ~~~~~~~~~~kIvvIDfGiK~nIlr~L~~~g~~v~Vvp 234 (396) ......+.... ..+|=|++..-++.|.+.|+++.|.- T Consensus 159 ~~~~~~~~~~~-I~VDGGI~~~ti~~l~~aGad~~V~G 195 (220) T PRK08883 159 KMIDASGRDIR-LEIDGGVKVDNIREIAEAGADMFVAG 195 (220) T ss_pred HHHHHCCCCCE-EEEECCCCHHHHHHHHHCCCCEEEEC T ss_conf 98874499807-99989878999999998799999968 No 289 >pfam02992 Transposase_21 Transposase family tnp2. Probab=39.51 E-value=16 Score=17.36 Aligned_cols=45 Identities=24% Similarity=0.358 Sum_probs=30.2 Q ss_pred CCCEEEECCCCHHHHHHCCCCEEEEC---CCCCCC-CCCCCCCHHHHHHHC Q ss_conf 45413304657446441357889985---888884-101121101222102 Q gi|254780420|r 228 CRITIVGAETSYKDIMSLQPDGVLLS---NGPGDP-AVTSVYSSPIICKLV 274 (396) Q Consensus 228 ~~v~Vvp~~~~~~~i~~~~pdgi~lS---nGPGdP-~~~~~~~~~~~~~~~ 274 (396) .-|.++|||.+...-++ ..-|+|| +||-.| .+.+.+..+++.++. T Consensus 64 WPV~l~~YNLPp~~cmk--~~~~~lsllIpGP~~Pg~diDVyl~PLieeL~ 112 (226) T pfam02992 64 WPVILVPYNLPPWLCMK--RENIFLSLLIPGPKQPGNDIDVFLQPLIEELK 112 (226) T ss_pred CEEEEEECCCCHHHHHH--HHHHHEEEECCCCCCCCCCCCHHHHHHHHHHH T ss_conf 41688505698788631--34432024133788888663236788999999 No 290 >TIGR02320 PEP_mutase phosphoenolpyruvate phosphomutase; InterPro: IPR012698 Phosphonates are a class of organophosphorus compounds, characterised by a stable C-P bond, which are found in a variety of biologically produced molecules including antiobiotics, lipids, proteins and polysaccharides . The functions of these molecules include phosphorus storage, cell communication, host recognition and chemical warfare. Phosphoenolpyruvate (PEP) phosphomutase catalyses the reversible conversion of PEP to 3-phosphonopyruvate, forming a stable C-P bond, which is the entry point to all known phosphonate biosynthetic pathways . This entry consists of characterised and predicted PEP phosphomutases found in bacterial and eukayotic species. A closely related enzyme, phosphonopyruvate hydrolase from Variovorax sp. Pal2, is excluded from this entry, and the functional identification of some more distantly related sequences, such as from Bacteroides fragilis, Treponema denticola, and Clostridium tetani E88 is unknown. PEP phosphomutase forms a modified TIM barrel fold where the eighth alpha helix adopts a different conformation than in the classical TIM barrel fold . The substrate binds in the central channel of the barrel and is anchored to the active site by the Mg(2+) cofactor . In the absence of substrate the active site is acessible to the solvent, while substrate-binding causes a conformational change where a large loop shields the site from solvent . This shielding appears to be required for catalysis to occur.; GO: 0050188 phosphoenolpyruvate mutase activity. Probab=39.34 E-value=27 Score=15.77 Aligned_cols=57 Identities=23% Similarity=0.259 Sum_probs=42.7 Q ss_pred CEEEEEEEECC-----CCCCCCCCCCCCHHHHHHHC--CCCEECCCCH--------HHHHHHHHHCCCCCEEE Q ss_conf 10236664046-----54434552417989999742--7832112576--------89999998539842366 Q gi|254780420|r 88 KGAVGLVIKAE-----ITEPSNYRANMHFDAWLKSC--GIIGLSGIDT--------RALTVWIRDYGVSNSVI 145 (396) Q Consensus 88 ~~~~g~Iv~~~-----~~~~Sn~rs~~sL~~wL~~~--~IpgI~GIDT--------RaLTk~IRe~G~~~a~I 145 (396) .|++||=|... +.+ ..|.---++-++|.+- |||-+-+=|| |.|+|++|++|+--.|+ T Consensus 34 lWaSGLs~S~~lGvRDsnE-ASWt~lv~vve~~sDavdGvPvLvDgDtGyGNFNnARr~arkle~~GaaG~Cl 105 (272) T TIGR02320 34 LWASGLSVSAALGVRDSNE-ASWTQLVEVVETMSDAVDGVPVLVDGDTGYGNFNNARRLARKLEDRGAAGVCL 105 (272) T ss_pred EHHHHHHHHHHCCCCCCHH-HHHHHHHHHHHHHHCCCCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCCEEEE T ss_conf 0343677864147853203-20899999986431258880168537879871467999999997468632220 No 291 >PRK08261 fabG 3-ketoacyl-(acyl-carrier-protein) reductase; Provisional Probab=39.19 E-value=29 Score=15.50 Aligned_cols=102 Identities=18% Similarity=0.262 Sum_probs=48.6 Q ss_pred CHHHHHHHCCCCEECCCCHHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHHCCCCCC----CCHHHCCCC---CCC Q ss_conf 9899997427832112576899999985398423663276545897898887741588743----210110133---332 Q gi|254780420|r 110 HFDAWLKSCGIIGLSGIDTRALTVWIRDYGVSNSVIAHHPDGQFDLEDLKERAKNWSGLKG----AELAKHVTV---SQR 182 (396) Q Consensus 110 sL~~wL~~~~IpgI~GIDTRaLTk~IRe~G~~~a~I~~~~~~~~d~~~~~~~~~~~~~~~~----~dL~~~Vs~---~~~ 182 (396) +-.++-..+.+-|+ ||.|-|.||...+.+.+.... ..+..++ ..++=+-+.+. ...+ .|+. ..+ T Consensus 128 ~~~~~~a~~~legf----~rs~~ke~~~g~t~~l~~v~~-~~~~~~~---~~lrf~ls~~sa~v~gq~~-~v~~~~~~~~ 198 (447) T PRK08261 128 DPAQAIAQRALEGF----TRSLGKELRRGATAQLVYVAP-GAETGLE---STLRFFLSPRSAYVSGQVV-RVGAADAAPP 198 (447) T ss_pred CHHHHHHHHHHHHH----HHHHHHHHHCCCEEEEEEECC-CCHHHHH---HHHHHHHCCCCCCCCCEEE-EECCCCCCCC T ss_conf 99999999877788----998999972684367999788-8400115---5242440666331125389-9638988898 Q ss_pred EECCCCCCCCCCCCCCCCCCCCCCEEEEEE---CCCCCHHHHHHHHHCCCCEEEEC Q ss_conf 111686534454344346777771499996---79641023222220245413304 Q gi|254780420|r 183 RDWSEKIWKWGEETSFLKSSDAKYHVVCID---YGIRSNLLRVLSSLGCRITIVGA 235 (396) Q Consensus 183 ~~~~~~~~~~~~~~~~~~~~~~~~kIvvID---fGiK~nIlr~L~~~g~~v~Vvp~ 235 (396) .+|.++. . -|||++= =||=..|.+.|.+.|++|+++.. T Consensus 199 ~~~~~pL--------------~-GKVALVTGAArGIG~AIA~~LAreGA~VVi~Di 239 (447) T PRK08261 199 ADWDRPL--------------A-GKVALVTGAARGIGAAIAEVLARDGAHVVCLDV 239 (447) T ss_pred CCCCCCC--------------C-CCEEEEECCCCHHHHHHHHHHHHCCCEEEEEEC T ss_conf 8878889--------------9-998999172578999999999986999999827 No 292 >pfam02310 B12-binding B12 binding domain. This domain binds to B12 (adenosylcobamide), it is found in several enzymes, such as glutamate mutase, methionine synthase and methylmalonyl-CoA mutase. Probab=38.67 E-value=30 Score=15.44 Aligned_cols=40 Identities=30% Similarity=0.489 Sum_probs=22.4 Q ss_pred CCCCCHHHHHHHHHCCCCEEEECCCCHHHHH----HCCCCEEEEC Q ss_conf 7964102322222024541330465744644----1357889985 Q gi|254780420|r 213 YGIRSNLLRVLSSLGCRITIVGAETSYKDIM----SLQPDGVLLS 253 (396) Q Consensus 213 fGiK~nIlr~L~~~g~~v~Vvp~~~~~~~i~----~~~pdgi~lS 253 (396) +|+.+ |...|++.|++|.++..+.+.+++. +.+||-|-+| T Consensus 15 lGl~~-la~~l~~~G~~V~~~d~~~~~~~i~~~i~~~~pdiVgiS 58 (121) T pfam02310 15 LGLNY-VAAALRAAGFEVILLGADVPPEDIVEAIRAENPDVVGLS 58 (121) T ss_pred HHHHH-HHHHHHHCCCCCCCEECCCCHHHHHHHHHHCCCCEEEEE T ss_conf 89999-999999859950010247999999999998099999995 No 293 >pfam03652 UPF0081 Uncharacterized protein family (UPF0081). Probab=38.65 E-value=30 Score=15.44 Aligned_cols=52 Identities=19% Similarity=0.427 Sum_probs=28.6 Q ss_pred EEEEEECCCCCH---HHHHHHHHCCCCEEEECCC---CHHHHH----HCCCCEEEECCCCCCCCCC Q ss_conf 499996796410---2322222024541330465---744644----1357889985888884101 Q gi|254780420|r 207 HVVCIDYGIRSN---LLRVLSSLGCRITIVGAET---SYKDIM----SLQPDGVLLSNGPGDPAVT 262 (396) Q Consensus 207 kIvvIDfGiK~n---Ilr~L~~~g~~v~Vvp~~~---~~~~i~----~~~pdgi~lSnGPGdP~~~ 262 (396) +|+.||||.|+= |-..+...-.-+.+++... .+++|. .++|++|++ |-|... T Consensus 2 riLgiD~G~kriGvAisd~~~~~a~Pl~~i~~~~~~~~~~~i~~ii~e~~i~~iVv----GlP~~~ 63 (134) T pfam03652 2 RILGLDVGTKRIGVAISDPLGILASPLETIERKNGKPDLEELAELIKEWQPDGIVV----GLPLNM 63 (134) T ss_pred CEEEEECCCCEEEEEEECCCCCEEECEEEEECCCCHHHHHHHHHHHHHHCCCEEEE----ECCCCC T ss_conf 79999808997999993599982502488985885589999999999839998999----558999 No 294 >COG0533 QRI7 Metal-dependent proteases with possible chaperone activity [Posttranslational modification, protein turnover, chaperones] Probab=38.02 E-value=25 Score=15.96 Aligned_cols=49 Identities=16% Similarity=0.175 Sum_probs=31.6 Q ss_pred CCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEE--ECHHHHHHHHC Q ss_conf 57889985888884101121101222102578634786--10058878752 Q gi|254780420|r 246 QPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGI--CLGHQLLGLAL 294 (396) Q Consensus 246 ~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGI--ClGhQ~l~~a~ 294 (396) +.|+|=.+.|||=.-..-....-...-.+..++|+.|| |.||-..+.-- T Consensus 71 dID~IA~T~gPGL~gaL~VG~~~Ak~LA~a~~kPli~VnH~~gHi~a~~l~ 121 (342) T COG0533 71 DIDAIAVTAGPGLGGALLVGATAAKALALALNKPLIPVNHLEGHIEAARLE 121 (342) T ss_pred CCCEEEEECCCCCHHHHHHHHHHHHHHHHHHCCCEEECCHHHHHHHHHHHC T ss_conf 188899816999447789999999999998299972312279888788752 No 295 >PRK11173 two-component response regulator; Provisional Probab=37.66 E-value=31 Score=15.34 Aligned_cols=49 Identities=10% Similarity=0.199 Sum_probs=17.9 Q ss_pred EEEEEE--CCCCCHHHHHHHHHCCCCEEEECCCCHHH-HHHCCCCEEEECCC Q ss_conf 499996--79641023222220245413304657446-44135788998588 Q gi|254780420|r 207 HVVCID--YGIRSNLLRVLSSLGCRITIVGAETSYKD-IMSLQPDGVLLSNG 255 (396) Q Consensus 207 kIvvID--fGiK~nIlr~L~~~g~~v~Vvp~~~~~~~-i~~~~pdgi~lSnG 255 (396) ||+++| -.+-..+...|...|++|..+..-..+.+ +....||-|+|--+ T Consensus 5 ~ILiVEDD~~~~~~l~~~L~~~G~~V~~a~~~~ea~~~l~~~~~DlvilDi~ 56 (237) T PRK11173 5 HILIVEDELVTRNTLKSIFEAEGYDVFEATDGAEMHQILSENDINLVIMDIN 56 (237) T ss_pred EEEEEECCHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHCCCCCEEEEECC T ss_conf 8999959899999999999988999999899999999986389989999389 No 296 >COG2247 LytB Putative cell wall-binding domain [Cell envelope biogenesis, outer membrane] Probab=37.52 E-value=26 Score=15.88 Aligned_cols=22 Identities=18% Similarity=0.116 Sum_probs=7.7 Q ss_pred CCCHHHHHHHHHHCCCCCEEEE Q ss_conf 2576899999985398423663 Q gi|254780420|r 125 GIDTRALTVWIRDYGVSNSVIA 146 (396) Q Consensus 125 GIDTRaLTk~IRe~G~~~a~I~ 146 (396) |+=|-++-..|-+.+.-+-.|. T Consensus 61 g~ynes~~~eI~~lnpd~VLII 82 (337) T COG2247 61 GIYNESVLDEIIELNPDLVLII 82 (337) T ss_pred CCCCHHHHHHHHHHCCCEEEEE T ss_conf 4213999999996198659997 No 297 >cd06355 PBP1_FmdD_like Periplasmic component (FmdD) of an active transport system for short-chain amides and urea (FmdDEF). This group includes the periplasmic component (FmdD) of an active transport system for short-chain amides and urea (FmdDEF), found in Methylophilus methylotrophus, and its homologs from other bacteria. FmdD, a type I periplasmic binding protein, is induced by short-chain amides and urea and repressed by excess ammonia, while FmdE and FmdF are hydrophobic transmembrane proteins. FmdDEF is predicted to be an ATP-dependent transporter and closely resembles the periplasmic binding protein and the two transmembrane proteins present in various hydrophobic amino acid-binding transport systems. Probab=37.42 E-value=31 Score=15.31 Aligned_cols=172 Identities=12% Similarity=0.085 Sum_probs=76.5 Q ss_pred EEEEECCCCCCCCCCCCCCHHHHHHHCCCCEECCCC----HHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHHCCC Q ss_conf 666404654434552417989999742783211257----6899999985398423663276545897898887741588 Q gi|254780420|r 92 GLVIKAEITEPSNYRANMHFDAWLKSCGIIGLSGID----TRALTVWIRDYGVSNSVIAHHPDGQFDLEDLKERAKNWSG 167 (396) Q Consensus 92 g~Iv~~~~~~~Sn~rs~~sL~~wL~~~~IpgI~GID----TRaLTk~IRe~G~~~a~I~~~~~~~~d~~~~~~~~~~~~~ 167 (396) -+|+.+--..|. .+.....+.+.+.+|.+|-|-- +.+....+.+.+.+.-.-... ++. + ..+ T Consensus 42 elv~~Dd~~~p~--~a~~~a~~Li~~d~V~~iiG~~~S~~~~a~~pv~~~~~~~~~~p~~~-~g~----~------~~~- 107 (348) T cd06355 42 EAVVEDGASDWP--TFAEKARKLLTQDKVAAVFGCWTSASRKAVLPVFERHNGLLFYPVQY-EGL----E------QSP- 107 (348) T ss_pred EEEEECCCCCHH--HHHHHHHHHHHHCCCEEEECCCCCHHHHHCCHHHHHHCCEEEEECCC-CCC----C------CCC- T ss_conf 999857999989--99999999997189369987777176541344689839727851113-564----3------678- Q ss_pred CCCCCHHHCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCEEEEE----ECC--CCCHHHHHHHHHCCCCEEE---EC-CC Q ss_conf 74321011013333211168653445434434677777149999----679--6410232222202454133---04-65 Q gi|254780420|r 168 LKGAELAKHVTVSQRRDWSEKIWKWGEETSFLKSSDAKYHVVCI----DYG--IRSNLLRVLSSLGCRITIV---GA-ET 237 (396) Q Consensus 168 ~~~~dL~~~Vs~~~~~~~~~~~~~~~~~~~~~~~~~~~~kIvvI----DfG--iK~nIlr~L~~~g~~v~Vv---p~-~~ 237 (396) +.. ... ..+. .....-..+.....+.++|++| +|| .....-..|.+.|.+|.-. |. .+ T Consensus 108 ----~~f-~~~-~~~~------~~~~~~~~~~~~~~g~k~vaii~~d~~~g~~~~~~~~~~~~~~G~~vv~~~~~~~g~~ 175 (348) T cd06355 108 ----NVF-YTG-AAPN------QQIIPAVDWLMSNKGGKRFYLVGSDYVYPRTANKILKAQLESLGGEVVGEEYLPLGHT 175 (348) T ss_pred ----CEE-EEC-CCCH------HHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHHHHHHHHCCCEEEEEEECCCCCC T ss_conf ----889-933-7836------8999999999984799879999168513489999999999985998999981379975 Q ss_pred CH----HHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHH-HCCCCCCEEEEECHHHHHHH Q ss_conf 74----464413578899858888841011211012221-02578634786100588787 Q gi|254780420|r 238 SY----KDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICK-LVDSGMPMFGICLGHQLLGL 292 (396) Q Consensus 238 ~~----~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~-~~~~~iPilGIClGhQ~l~~ 292 (396) ++ ..|.+.+||.||++...++-. ......... +....+|++|...+...+.. T Consensus 176 Dfs~~l~ki~~a~pD~v~~~~~g~~~~---~~~~q~~~~G~~~~~~~~~~~~~~~~~~~~ 232 (348) T cd06355 176 DFQSIINKIKAAKPDVVVSTVNGDSNV---AFFKQLKAAGITASKVPVLSFSVAEEELRG 232 (348) T ss_pred CHHHHHHHHHHCCCCEEEEECCCCCHH---HHHHHHHHHCCCCCCCEEEEEECCHHHHHH T ss_conf 679999999976999999947651248---999999981787667358884057788753 No 298 >PRK06375 consensus Probab=37.23 E-value=31 Score=15.29 Aligned_cols=16 Identities=0% Similarity=0.156 Sum_probs=7.7 Q ss_pred CCCCCCCCCCHHHHCC Q ss_conf 8973684588234204 Q gi|254780420|r 66 FPHIGNVGVNSEDFES 81 (396) Q Consensus 66 ~P~IGNyGvn~~d~ES 81 (396) .|.-.+|+++-++.|. T Consensus 139 ~~~~~~~~~d~~~l~~ 154 (381) T PRK06375 139 VDTDDDYNIDIDNMRK 154 (381) T ss_pred EECCCCCCCCHHHHHH T ss_conf 2038877979999996 No 299 >COG0061 nadF NAD kinase [Coenzyme metabolism] Probab=36.96 E-value=31 Score=15.30 Aligned_cols=12 Identities=33% Similarity=0.625 Sum_probs=7.6 Q ss_pred HHCCCCEEEECC Q ss_conf 413578899858 Q gi|254780420|r 243 MSLQPDGVLLSN 254 (396) Q Consensus 243 ~~~~pdgi~lSn 254 (396) ....-||+++|- T Consensus 162 ~~~r~DGliVsT 173 (281) T COG0061 162 ESFRGDGLIVST 173 (281) T ss_pred EEEECCEEEEEC T ss_conf 999868999918 No 300 >COG0036 Rpe Pentose-5-phosphate-3-epimerase [Carbohydrate transport and metabolism] Probab=36.82 E-value=32 Score=15.25 Aligned_cols=94 Identities=18% Similarity=0.302 Sum_probs=51.5 Q ss_pred CCCHHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHH---------CCCCCCCCHHHCCCCCCCEECCCCCCCCCCC Q ss_conf 2576899999985398423663276545897898887741---------5887432101101333321116865344543 Q gi|254780420|r 125 GIDTRALTVWIRDYGVSNSVIAHHPDGQFDLEDLKERAKN---------WSGLKGAELAKHVTVSQRRDWSEKIWKWGEE 195 (396) Q Consensus 125 GIDTRaLTk~IRe~G~~~a~I~~~~~~~~d~~~~~~~~~~---------~~~~~~~dL~~~Vs~~~~~~~~~~~~~~~~~ 195 (396) ..+.-.+.+.||+.|..-|+..+ |.. .++.+..-+.. -|+..|+.+...+.-|-... . T Consensus 95 ~~~~~r~i~~Ik~~G~kaGv~ln-P~T--p~~~i~~~l~~vD~VllMsVnPGfgGQ~Fi~~~l~Ki~~l-r--------- 161 (220) T COG0036 95 TEHIHRTIQLIKELGVKAGLVLN-PAT--PLEALEPVLDDVDLVLLMSVNPGFGGQKFIPEVLEKIREL-R--------- 161 (220) T ss_pred CCCHHHHHHHHHHCCCEEEEEEC-CCC--CHHHHHHHHHHCCEEEEEEECCCCCCCCCCHHHHHHHHHH-H--------- T ss_conf 76899999999975985779978-999--7789998986578999985779986631479999999999-9--------- Q ss_pred CCCCCCCCCCCEEEEEECCCCCHHHHHHHHHCCCCEEEE Q ss_conf 443467777714999967964102322222024541330 Q gi|254780420|r 196 TSFLKSSDAKYHVVCIDYGIRSNLLRVLSSLGCRITIVG 234 (396) Q Consensus 196 ~~~~~~~~~~~kIvvIDfGiK~nIlr~L~~~g~~v~Vvp 234 (396) ..... .+....-+|=|++.+-++.+.+.|+++.|.- T Consensus 162 -~~~~~--~~~~~IeVDGGI~~~t~~~~~~AGad~~VaG 197 (220) T COG0036 162 -AMIDE--RLDILIEVDGGINLETIKQLAAAGADVFVAG 197 (220) T ss_pred -HHHCC--CCCEEEEEECCCCHHHHHHHHHCCCCEEEEE T ss_conf -97402--4775999968969888999997399999997 No 301 >KOG1467 consensus Probab=36.55 E-value=32 Score=15.22 Aligned_cols=80 Identities=29% Similarity=0.359 Sum_probs=46.1 Q ss_pred CCCCCCCEEEEEEC-----CCCCHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEE------ECCCCCCCCCCCCCCHH Q ss_conf 67777714999967-----96410232222202454133046574464413578899------85888884101121101 Q gi|254780420|r 200 KSSDAKYHVVCIDY-----GIRSNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVL------LSNGPGDPAVTSVYSSP 268 (396) Q Consensus 200 ~~~~~~~kIvvIDf-----GiK~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~------lSnGPGdP~~~~~~~~~ 268 (396) +.-.++++|+|+|= |-| .||.|..+|.+++-+-.+. +.-|+ ..-+-|| ||||-=- +..+....- T Consensus 380 ~~~~k~frVvVVDSRP~~EG~~--~lr~Lv~~GinctYv~I~a-~syim-~evtkvfLGahailsNG~vy-sR~GTa~va 454 (556) T KOG1467 380 KELGKKFRVVVVDSRPNLEGRK--LLRRLVDRGINCTYVLINA-ASYIM-LEVTKVFLGAHAILSNGAVY-SRVGTACVA 454 (556) T ss_pred HHHCCCEEEEEEECCCCCCHHH--HHHHHHHCCCCEEEEEEHH-HHHHH-HHHHEEEECHHHHHCCCCHH-HHCCHHHHH T ss_conf 9736664899980798753479--9999997598707998616-68898-75100253214563285102-200258999 Q ss_pred HHHHHCCCCCCEEEEECH Q ss_conf 222102578634786100 Q gi|254780420|r 269 IICKLVDSGMPMFGICLG 286 (396) Q Consensus 269 ~~~~~~~~~iPilGIClG 286 (396) ++. -..++|++-.|=- T Consensus 455 lvA--na~nVPVlVCCE~ 470 (556) T KOG1467 455 LVA--NAFNVPVLVCCEA 470 (556) T ss_pred HHH--CCCCCCEEEEECC T ss_conf 973--3368977998220 No 302 >TIGR01812 sdhA_frdA_Gneg succinate dehydrogenase or fumarate reductase, flavoprotein subunit; InterPro: IPR014006 Succinate:quinone oxidoreductase (1.3.5.1 from EC) refers collectively to succinate:quinone reductase (SQR, or Complex II) and quinol:fumarate reductase (QFR) . SQR is found in aerobic organisms, and catalyses the oxidation of succinate to fumarate in the citric acid cycle and donates the electrons to quinone in the membrane. QFR can be found in anaerobic cells respiring with fumarate as terminal electron acceptor. SQR and QFR are very similar in composition and structure, despite catalysing opposite reactions in vivo. They are thought to have evolved from a common ancestor, and in Escherichia coli they are capable of functionally replacing each other . Succinate:quinone oxidoreductases consist of a peripheral domain, exposed to the cytoplasm in bacteria and to the matrix in mitochondria, and a membrane-integral anchor domain that spans the membrane (Fig. 1). The peripheral part, which contains the dicarboxylate binding site, is composed of a flavoprotein subunit, with one covalently bound FAD, and an iron-sulphur protein subunit containing three iron-sulphur clusters. The membrane-integral domain functions to anchor the peripheral domain to the membrane and is required for quinone reduction and oxidation. The anchor domain shows the largest variability in composition and primary sequence, being composed either of one large subunit, or two smaller subunits, which may, or may not, contain protoheme groups. This entry represents the flavoprotein subunit found in both the SQR and QFR enzymes. This subunit contains an N-terminal domain which binds the FAD cofactor, a central catalytic domain with an unsual fold, and a C-terminal domain whose role is unclear , , . The dicarboxylate binding site is located between the FAD and catalytic domains.. Probab=36.21 E-value=12 Score=18.15 Aligned_cols=16 Identities=44% Similarity=0.675 Sum_probs=8.7 Q ss_pred EEE-ECCCCEEEECCCCCC Q ss_conf 268-679977898348788 Q gi|254780420|r 354 GFR-LIDQPVFSVQYHPES 371 (396) Q Consensus 354 gi~-~~~~~~~~vQfHPEa 371 (396) |+- .+| +-=|||||=+ T Consensus 252 G~PqL~D--MEFvQFHPTG 268 (636) T TIGR01812 252 GVPQLKD--MEFVQFHPTG 268 (636) T ss_pred CCCCCCC--CCCEECCCCC T ss_conf 4643168--4405036745 No 303 >cd06843 PLPDE_III_PvsE_like Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzyme PvsE. This subfamily is composed of PvsE from Vibrio parahaemolyticus and similar proteins. PvsE is a vibrioferrin biosynthesis protein which is homologous to eukaryotic ornithine decarboxylase (ODC) and diaminopimelate decarboxylase (DapDC). ODC and DapDC are fold type III PLP-dependent enzymes that contain an N-terminal PLP-binding TIM-barrel domain and a C-terminal beta-sandwich domain, similar to bacterial alanine racemases. It has been suggested that PvsE may be involved in the biosynthesis of the polycarboxylate siderophore vibrioferrin. It may catalyze the decarboxylation of serine to yield ethanolamine. PvsE may require homodimer formation and the presence of the PLP cofactor for activity. Probab=36.10 E-value=32 Score=15.17 Aligned_cols=38 Identities=11% Similarity=-0.071 Sum_probs=15.8 Q ss_pred CCHHHHHHHCCCCEECCCCHHHHHHHHHHCCCCCEEEECC Q ss_conf 7989999742783211257689999998539842366327 Q gi|254780420|r 109 MHFDAWLKSCGIIGLSGIDTRALTVWIRDYGVSNSVIAHH 148 (396) Q Consensus 109 ~sL~~wL~~~~IpgI~GIDTRaLTk~IRe~G~~~a~I~~~ 148 (396) ..+-.+|.+.+. |+.=.=--.| +.+|..|...-+|.+. T Consensus 39 ~~il~~l~~~G~-g~DvaS~~El-~~~~~~~~~~~Iif~g 76 (377) T cd06843 39 PPILRALAPHVD-GFEVASGGEI-AHVRAAVPDAPLIFGG 76 (377) T ss_pred HHHHHHHHHCCC-CEEEECHHHH-HHHHHCCCCCCEEECC T ss_conf 999999997699-6898299999-9998539996479889 No 304 >PRK06720 hypothetical protein; Provisional Probab=36.09 E-value=29 Score=15.47 Aligned_cols=15 Identities=13% Similarity=0.359 Sum_probs=5.5 Q ss_pred HHHHHHHHCCCCEEE Q ss_conf 232222202454133 Q gi|254780420|r 219 LLRVLSSLGCRITIV 233 (396) Q Consensus 219 Ilr~L~~~g~~v~Vv 233 (396) +.+.|.+.|++|.+. T Consensus 32 ~A~~la~~Ga~Vvi~ 46 (169) T PRK06720 32 TALLLAKQGAKVIVT 46 (169) T ss_pred HHHHHHHCCCEEEEE T ss_conf 999999869989995 No 305 >pfam01256 Carb_kinase Carbohydrate kinase. This family is related to pfam02110 and pfam00294 implying that it also is a carbohydrate kinase. (personal obs Yeats C). Probab=36.06 E-value=33 Score=15.17 Aligned_cols=49 Identities=24% Similarity=0.255 Sum_probs=20.9 Q ss_pred CCCEEEECCCCCCCCCCCCCC----HHHHHHHCCCCC-CEEEEECHHHHHHHHC Q ss_conf 578899858888841011211----012221025786-3478610058878752 Q gi|254780420|r 246 QPDGVLLSNGPGDPAVTSVYS----SPIICKLVDSGM-PMFGICLGHQLLGLAL 294 (396) Q Consensus 246 ~pdgi~lSnGPGdP~~~~~~~----~~~~~~~~~~~i-PilGIClGhQ~l~~a~ 294 (396) .|++=+.-|--|+|....... .-++-.++..+. |.-..|+|--+-++|- T Consensus 166 ~p~g~~~~n~~Gn~~lAtaGsGDVLaGii~~llaqg~~~~~Aa~~av~lHg~Ag 219 (241) T pfam01256 166 APGGEVWINSTGNSALAKGGSGDVLAGLIGGLLAQNEDPYDAAIAAAWLHGAAS 219 (241) T ss_pred ECCCCEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHH T ss_conf 489977870689988677875468899999999778998999999999999999 No 306 >pfam07073 ROF Modulator of Rho-dependent transcription termination (ROF). This family consists of several bacterial modulator of Rho-dependent transcription termination (ROF) proteins. ROF binds transcription termination factor Rho and inhibits Rho-dependent termination in vivo. Probab=36.02 E-value=33 Score=15.16 Aligned_cols=55 Identities=25% Similarity=0.323 Sum_probs=36.2 Q ss_pred CCEEEEECCCCEEEEEEECCCCE--EEEEEEEECCCCCCCCCCCCCCCCCCEEEEC----CCCCCCC Q ss_conf 96689972898799997567624--8888985378888760141614256173216----8973684 Q gi|254780420|r 12 PTAVLVLADGSVIEGMGCGATGS--IQAEICFNTSLTGYQEILTDPSYLGQIVNFT----FPHIGNV 72 (396) Q Consensus 12 ~~A~L~LedGt~f~G~~fG~~~~--~~GEvVFnT~mtGYqE~lTDPSY~gQIlvfT----~P~IGNy 72 (396) -+-.|.|.||+.++|.+..-... ..-=++..++ .|=|++..| +|..|+ .|++|.. T Consensus 18 y~l~l~LkdG~~i~G~A~dt~~~~~k~E~L~i~~~-~~~~~i~Ld-----~I~s~~~l~~np~f~~v 78 (80) T pfam07073 18 YPLKLELKDGEQVEGKALDTLTNADKEECLVIEQE-GETQLLRLD-----HIASITALTENPHFGRV 78 (80) T ss_pred CEEEEEECCCCEEEEEEEHHEECCCCEEEEEEECC-CCEEEEECC-----EEEEEEECCCCCEEEEE T ss_conf 87999996999999998622784764599999718-968999925-----15732776889825577 No 307 >COG1118 CysA ABC-type sulfate/molybdate transport systems, ATPase component [Inorganic ion transport and metabolism] Probab=35.71 E-value=33 Score=15.13 Aligned_cols=55 Identities=27% Similarity=0.466 Sum_probs=34.0 Q ss_pred CEEEEEE--CC-----CCCHHHHHHHH---H-CCCCEEEECCCCHHHHHHCCCCEEEECCCC----CCCCCC Q ss_conf 1499996--79-----64102322222---0-245413304657446441357889985888----884101 Q gi|254780420|r 206 YHVVCID--YG-----IRSNLLRVLSS---L-GCRITIVGAETSYKDIMSLQPDGVLLSNGP----GDPAVT 262 (396) Q Consensus 206 ~kIvvID--fG-----iK~nIlr~L~~---~-g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGP----GdP~~~ 262 (396) ++|.++| || +++.+-+.|++ + |+.+.-| +-+.+|.+.+-..-++|+||- |+|.+. T Consensus 156 P~vLLLDEPf~ALDa~vr~~lr~wLr~~~~~~~~ttvfV--THD~eea~~ladrvvvl~~G~Ieqvg~p~ev 225 (345) T COG1118 156 PKVLLLDEPFGALDAKVRKELRRWLRKLHDRLGVTTVFV--THDQEEALELADRVVVLNQGRIEQVGPPDEV 225 (345) T ss_pred CCEEEECCCCHHHHHHHHHHHHHHHHHHHHHHCCEEEEE--ECCHHHHHHHCCEEEEECCCEEEEECCHHHH T ss_conf 986863587214519999999999999998609639999--5899999864165999528805530897887 No 308 >PRK10816 DNA-binding transcriptional regulator PhoP; Provisional Probab=35.69 E-value=33 Score=15.13 Aligned_cols=79 Identities=15% Similarity=0.137 Sum_probs=30.3 Q ss_pred HHHHHHHHHCCCCEEEECCCCH-HHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEE----CHHHHHHH Q ss_conf 0232222202454133046574-464413578899858888841011211012221025786347861----00588787 Q gi|254780420|r 218 NLLRVLSSLGCRITIVGAETSY-KDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGIC----LGHQLLGL 292 (396) Q Consensus 218 nIlr~L~~~g~~v~Vvp~~~~~-~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIC----lGhQ~l~~ 292 (396) .+...|...|+.|.++..-..+ +.+....||-+++--...+.. +......++. ....+||+-+. ....+-|+ T Consensus 15 ~l~~~L~~~g~~V~~a~~~~~al~~~~~~~~dlvilD~~lp~~~--G~~l~~~ir~-~~~~~piI~lta~~~~~~~~~al 91 (223) T PRK10816 15 HLKVQLQDAGHQVDAAEDAKEADYYLNEHLPDIAIVDLGLPDED--GLSLIRRWRS-NDVSLPILVLTARESWQDKVEVL 91 (223) T ss_pred HHHHHHHHCCCEEEEECCHHHHHHHHHHCCCCEEEEECCCCCCC--CCCCCCCHHC-CCCCCCEEEEECCCCHHHHHHHH T ss_conf 99999997899999989999999999757998999979998988--6400120110-48987689994445677899999 Q ss_pred HCCCCCC Q ss_conf 5285446 Q gi|254780420|r 293 ALGAQTV 299 (396) Q Consensus 293 a~G~~~~ 299 (396) ..|+.-| T Consensus 92 ~~Gaddy 98 (223) T PRK10816 92 SAGADDY 98 (223) T ss_pred HCCCCEE T ss_conf 8699886 No 309 >pfam00814 Peptidase_M22 Glycoprotease family. The Peptidase M22 proteins are part of the HSP70-actin superfamily. The region represented here is an insert into the fold and is not found in the rest of the family (beyond the Peptidase M22 family). Included in this family are the Rhizobial NodU proteins and the HypF regulator. This region also contains the histidine dyad believed to coordinate the metal ion and hence provide catalytic activity. Interestingly the histidines are not well conserved, and there is a lack of experimental evidence to support peptidase activity as a general property of this family. There also appear to be instances of this domain outside of the HSP70-actin superfamily. Probab=35.25 E-value=15 Score=17.53 Aligned_cols=45 Identities=20% Similarity=0.226 Sum_probs=26.9 Q ss_pred CCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEE--CHHHHH Q ss_conf 578899858888841011211012221025786347861--005887 Q gi|254780420|r 246 QPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGIC--LGHQLL 290 (396) Q Consensus 246 ~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIC--lGhQ~l 290 (396) +.|+|-.+.|||-+........-...-....++|+.||+ -||-+. T Consensus 49 did~IAvt~GPGl~~~L~vG~~~Ak~La~~l~~Pli~VnHleaH~~~ 95 (225) T pfam00814 49 DLDAIAVTAGPGLFTGLRVGATFAKGLALALDKPLVGVNHLEAHALA 95 (225) T ss_pred HCCEEEEECCCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHH T ss_conf 68999980599818869999999999999956984024668999988 No 310 >PRK06252 methylcobalamin:coenzyme M methyltransferase; Validated Probab=35.21 E-value=34 Score=15.07 Aligned_cols=52 Identities=13% Similarity=0.054 Sum_probs=35.8 Q ss_pred EEEEECCCCCHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCC Q ss_conf 99996796410232222202454133046574464413578899858888841 Q gi|254780420|r 208 VVCIDYGIRSNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPA 260 (396) Q Consensus 208 IvvIDfGiK~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~ 260 (396) +++.=||-...++..+.+.|+++.-+-+..+.++..+.-.+.+.|-++ =||. T Consensus 233 ~ilh~~g~~~~~~~~~~~~g~d~~s~D~~~~l~~a~~~~g~~~~lqGN-ldP~ 284 (339) T PRK06252 233 TILHICGDLTSILEELADCGFDGISIDEKVSVKQAKENVGNRARLIGN-VSTS 284 (339) T ss_pred EEEEECCCCHHHHHHHHHCCCCEEEECCCCCHHHHHHHHCCCEEEEEC-CCCH T ss_conf 188865861378999996499999716999999999981899179936-8977 No 311 >cd03307 Mta_CmuA_like MtaA_CmuA_like family. MtaA/CmuA, also MtsA, or methyltransferase 2 (MT2) MT2-A and MT2-M isozymes, are methylcobamide:Coenzyme M methyltransferases, which play a role in metabolic pathways of methane formation from various substrates, such as methylated amines and methanol. Coenzyme M, 2-mercaptoethylsulfonate or CoM, is methylated during methanogenesis in a reaction catalyzed by three proteins. A methyltransferase methylates the corrinoid cofactor, which is bound to a second polypeptide, a corrinoid protein. The methylated corrinoid protein then serves as a substrate for MT2-A and related enzymes, which methylate CoM. Probab=35.18 E-value=34 Score=15.07 Aligned_cols=53 Identities=13% Similarity=0.112 Sum_probs=38.7 Q ss_pred EEEEEECCCCCHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCC Q ss_conf 499996796410232222202454133046574464413578899858888841 Q gi|254780420|r 207 HVVCIDYGIRSNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPA 260 (396) Q Consensus 207 kIvvIDfGiK~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~ 260 (396) .+.+.=||-...++..+.+.|+++.-+.++.+..+..+.-++.+.|-++ =||. T Consensus 224 ~~i~h~~G~~~~~~~~~~~~g~d~is~D~~~~l~~a~~~~~~~~~iqGN-ldP~ 276 (326) T cd03307 224 PTILHICGNTTPILEYIAQCGFDGISVDEKVDVKTAKEIVGGRAALIGN-VSPS 276 (326) T ss_pred CEEEEECCCHHHHHHHHHHCCCCEEECCCCCCHHHHHHHCCCCEEEEEC-CCCH T ss_conf 8278845875889999997199899668989999999980898169942-8848 No 312 >pfam12257 DUF3608 Protein of unknown function (DUF3608). This domain family is found in eukaryotes, and is approximately 280 amino acids in length. The family is found in association with pfam00610. Probab=34.97 E-value=24 Score=16.14 Aligned_cols=43 Identities=21% Similarity=0.209 Sum_probs=23.1 Q ss_pred CEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEECHHHHH Q ss_conf 8899858888841011211012221025786347861005887 Q gi|254780420|r 248 DGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGICLGHQLL 290 (396) Q Consensus 248 dgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIClGhQ~l 290 (396) .-|++|+|+|==+........+-++++..++-+==||||-|=| T Consensus 231 ~ivvITpG~GvFeVd~~ll~~T~~r~~~~gig~DlIcL~~~PL 273 (281) T pfam12257 231 HVIIITPGTGLYDVDYDLLLETSKKLLSLEIALDIICLSQPPL 273 (281) T ss_pred EEEEEECCCCEEEECHHHHHHHHHHHHHCCCCEEEEECCCCCC T ss_conf 7999918996388789999999999871775456898489987 No 313 >pfam03575 Peptidase_S51 Peptidase family S51. Probab=34.86 E-value=34 Score=15.04 Aligned_cols=46 Identities=20% Similarity=0.203 Sum_probs=33.7 Q ss_pred EEEEECCCCCCCCCCCCCCHHHHHHHCCCCEECCCCHHHHHHHHHHCCC Q ss_conf 6664046544345524179899997427832112576899999985398 Q gi|254780420|r 92 GLVIKAEITEPSNYRANMHFDAWLKSCGIIGLSGIDTRALTVWIRDYGV 140 (396) Q Consensus 92 g~Iv~~~~~~~Sn~rs~~sL~~wL~~~~IpgI~GIDTRaLTk~IRe~G~ 140 (396) |+-|..+.... ++...+.+++.+.++.-++|=+|-.|.+.+|+.|. T Consensus 13 g~~v~~l~~~~---~~~~~~~~~i~~ad~I~v~GGnt~~ll~~l~~~g~ 58 (154) T pfam03575 13 GLEVSGLHLFT---PSVEDIEAKILKADVIYVGGGNTFHLLKLLRETGL 58 (154) T ss_pred CCEEEEEECCC---CCHHHHHHHHHHCCEEEECCCHHHHHHHHHHHCCC T ss_conf 99778986667---97472999998599999898759999999998682 No 314 >PRK04761 ppnK inorganic polyphosphate/ATP-NAD kinase; Reviewed Probab=34.85 E-value=29 Score=15.47 Aligned_cols=22 Identities=18% Similarity=0.315 Sum_probs=12.1 Q ss_pred CCCCCHHHHHHHCCCCEECCCC Q ss_conf 2417989999742783211257 Q gi|254780420|r 106 RANMHFDAWLKSCGIIGLSGID 127 (396) Q Consensus 106 rs~~sL~~wL~~~~IpgI~GID 127 (396) .++..+.++.+.++-.-+...| T Consensus 6 ~a~~a~~~l~~~y~~~~~~eaD 27 (246) T PRK04761 6 EAQAALEELVKRYGDVPIEEAD 27 (246) T ss_pred HHHHHHHHHHHHHCCCCHHHCC T ss_conf 9999999999985899965799 No 315 >cd03146 GAT1_Peptidase_E Type 1 glutamine amidotransferase (GATase1)-like domain found in peptidase E. Type 1 glutamine amidotransferase (GATase1)-like domain found in peptidase E. This group contains proteins similar to the aspartyl dipeptidases Salmonella typhimurium peptidase E and Xenopus laevis peptidase E. In bacteria peptidase E is believed to play a role in degrading peptides generated by intracellular protein breakdown or imported into the cell as nutrient sources. Peptidase E uniquely hydrolyses only Asp-X dipeptides (where X is any amino acid), and one tripeptide Asp-Gly-Gly. Peptidase E is believed to be a serine peptidase having a Ser-His-Glu catalytic triad which differs from the Cys-His-Glu catalytic triad typical of GATase1 domains by having a Ser in place of the reactive Cys at the nucleophile elbow. Xenopus PepE is developmentally regulated in response to thyroid hormone and, it is thought to play a role in apoptosis during tail reabsorption. Probab=34.73 E-value=34 Score=15.02 Aligned_cols=68 Identities=15% Similarity=0.163 Sum_probs=31.2 Q ss_pred HHHHH-CCCCEEEECC--CCHHHHHHCCCCEEEECCCCCCCCCCCC-----CCHHHHHHHCCCCCCEEEEECHHHHHHH Q ss_conf 22220-2454133046--5744644135788998588888410112-----1101222102578634786100588787 Q gi|254780420|r 222 VLSSL-GCRITIVGAE--TSYKDIMSLQPDGVLLSNGPGDPAVTSV-----YSSPIICKLVDSGMPMFGICLGHQLLGL 292 (396) Q Consensus 222 ~L~~~-g~~v~Vvp~~--~~~~~i~~~~pdgi~lSnGPGdP~~~~~-----~~~~~~~~~~~~~iPilGIClGhQ~l~~ 292 (396) .+.++ +|++..+..- .+..+.+. +-|+|+++.| |....-. .....+++..+.+.|+.|.--|--+.+. T Consensus 54 ~f~~l~~~~~~~l~~~~~~d~~~~l~-~ad~I~v~GG--nt~~ll~~~~~~~l~~~l~~~~~~G~~~~G~SAGa~~~~~ 129 (212) T cd03146 54 AFESLRGVEVSHLHLFDTEDPLDALL-EADVIYVGGG--NTFNLLAQWREHGLDAILKAALERGVVYIGWSAGSNCWFP 129 (212) T ss_pred HHHCCCCCEEEEEECCCCCCHHHHHH-HCCEEEECCC--CHHHHHHHHHHCCHHHHHHHHHHCCCEEEEECCCHHHHCC T ss_conf 86133696256875248678899997-4999998897--6999999999789899999999779889998615776167 No 316 >cd04736 MDH_FMN Mandelate dehydrogenase (MDH)-like FMN-binding domain. MDH is part of a widespread family of homologous FMN-dependent a-hydroxy acid oxidizing enzymes that oxidizes (S)-mandelate to phenylglyoxalate. MDH is an enzyme in the mandelate pathway that occurs in several strains of Pseudomonas which converts (R)-mandelate to benzoate. This family occurs in both prokaryotes and eukaryotes. Members of this family include flavocytochrome b2 (FCB2), glycolate oxidase (GOX), lactate monooxygenase (LMO), mandelate dehydrogenase (MDH), and long chain hydroxyacid oxidase (LCHAO). Probab=34.72 E-value=34 Score=15.02 Aligned_cols=86 Identities=31% Similarity=0.375 Sum_probs=47.6 Q ss_pred ECCCCCHHHHHHHHHCCCCEEEECCCCHHHHH---HCCCCEEEECCCCCCC---CCCCCCCHHHHHHHCCCCCCEE---E Q ss_conf 67964102322222024541330465744644---1357889985888884---1011211012221025786347---8 Q gi|254780420|r 212 DYGIRSNLLRVLSSLGCRITIVGAETSYKDIM---SLQPDGVLLSNGPGDP---AVTSVYSSPIICKLVDSGMPMF---G 282 (396) Q Consensus 212 DfGiK~nIlr~L~~~g~~v~Vvp~~~~~~~i~---~~~pdgi~lSnGPGdP---~~~~~~~~~~~~~~~~~~iPil---G 282 (396) |-..-+.-++.|++.=---.|+--=.+.+|-. ..--|||++||==|.- +.....+.+.+.+.. +.||+ | T Consensus 220 ~~~~tW~di~wlr~~w~~plilKGI~~~eDA~~A~~~G~dgIiVSNHGGRQLD~a~~~id~Lp~I~~av--~~~V~~DgG 297 (361) T cd04736 220 DASFNWQDLRWLRDLWPHKLLVKGIVTAEDAKRCIELGADGVILSNHGGRQLDDAIAPIEALAEIVAAT--YKPVLIDSG 297 (361) T ss_pred CCCCCHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHCCCCEEEECCCCCCCCCCCCCHHHHHHHHHHHH--CCEEEEECC T ss_conf 889999999999986699745521489999999987699999975886333577741477799999971--994999489 Q ss_pred EECHHHHH-HHHCCCCCC Q ss_conf 61005887-875285446 Q gi|254780420|r 283 ICLGHQLL-GLALGAQTV 299 (396) Q Consensus 283 IClGhQ~l-~~a~G~~~~ 299 (396) |=-|-.++ |+|+||+.. T Consensus 298 IRrG~DV~KALALGA~aV 315 (361) T cd04736 298 IRRGSDIVKALALGANAV 315 (361) T ss_pred CCCHHHHHHHHHCCCCEE T ss_conf 887899999997799989 No 317 >PRK10693 response regulator of RpoS; Provisional Probab=34.69 E-value=33 Score=15.10 Aligned_cols=40 Identities=13% Similarity=0.323 Sum_probs=21.0 Q ss_pred CCCEECCCCHHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHH Q ss_conf 783211257689999998539842366327654589789888774 Q gi|254780420|r 119 GIIGLSGIDTRALTVWIRDYGVSNSVIAHHPDGQFDLEDLKERAK 163 (396) Q Consensus 119 ~IpgI~GIDTRaLTk~IRe~G~~~a~I~~~~~~~~d~~~~~~~~~ 163 (396) ..|++.|+ .+.++||+.+..--+|.-.. ..+.++..+.++ T Consensus 60 ~MP~mdGl---ell~~lr~~~~~~PVIvlTa--~~~~~d~v~al~ 99 (337) T PRK10693 60 AMPRMNGL---KFVEHLRNRGDQTPVLVISA--TENMADIAKALR 99 (337) T ss_pred CCCCCCHH---HHHHHHHHHCCCCEEEEEEC--CCCHHHHHHHHH T ss_conf 99999989---99999998589964999986--899999999997 No 318 >cd06331 PBP1_AmiC_like Type I periplasmic components of amide-binding protein (AmiC) and the active transport system for short-chain and urea (FmdDEF). This group includes the type I periplasmic components of amide-binding protein (AmiC) and the active transport system for short-chain and urea (FmdDEF), found in bacteria and Archaea. AmiC controls expression of the amidase operon by a ligand-triggered conformational switch. In the absence of ligand or presence of butyramide (repressor), AmiC (the ligand sensor and negative regulator) adopts an open conformation and inhibits the transcription antitermination function of AmiR by direct protein-protein interaction. In the presence of inducing ligands such as acetamide, AmiC adopts a closed conformation which disrupts a silencing AmiC-AmiR complex and the expression of amidase and other genes of the operon is induced. FmdDEF is predicted to be an ATP-dependent transporter and closely resembles the periplasmic binding protein and the two t Probab=34.43 E-value=34 Score=14.99 Aligned_cols=82 Identities=16% Similarity=0.203 Sum_probs=42.1 Q ss_pred CEEEEE----ECC--CCCHHHHHHHHHCCCCEE---EECC-CCH----HHHHHCCCCEEEECCCCCCCCCCCCCCHHHHH Q ss_conf 149999----679--641023222220245413---3046-574----46441357889985888884101121101222 Q gi|254780420|r 206 YHVVCI----DYG--IRSNLLRVLSSLGCRITI---VGAE-TSY----KDIMSLQPDGVLLSNGPGDPAVTSVYSSPIIC 271 (396) Q Consensus 206 ~kIvvI----DfG--iK~nIlr~L~~~g~~v~V---vp~~-~~~----~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~ 271 (396) +||+++ +|| .....-..+.+.|.+|+- +|.. +++ ..|.+.+||.||++.-+.|-. ..... .+ T Consensus 133 kkvai~~~d~~~G~~~~~~~~~~~~~~G~~vv~~~~~~~~~~Dfs~~l~~i~~~~pD~v~~~~~~~~~~---~~~~q-~~ 208 (333) T cd06331 133 KRFYLIGSDYVWPRESNRIARALLEELGGEVVGEEYLPLGTSDFGSVIEKIKAAGPDVVLSTLVGDSNV---AFYRQ-FA 208 (333) T ss_pred CEEEEEECCCCHHHHHHHHHHHHHHHCCCEEEEEEEECCCCCCHHHHHHHHHHCCCCEEEEECCCCHHH---HHHHH-HH T ss_conf 879999448617899999999999985994999997069972479999998765999999935660399---99999-99 Q ss_pred H--HCCCCCCEEEEECHHHHHH Q ss_conf 1--0257863478610058878 Q gi|254780420|r 272 K--LVDSGMPMFGICLGHQLLG 291 (396) Q Consensus 272 ~--~~~~~iPilGIClGhQ~l~ 291 (396) . +....+|+++..++.+.+. T Consensus 209 ~~G~~~~~~~~~~~~~~~~~~~ 230 (333) T cd06331 209 AAGLDADRIPILSLTLDENELA 230 (333) T ss_pred HCCCCCCCEEEEECCCCHHHHH T ss_conf 7599966618997545678898 No 319 >PRK11070 ssDNA exonuclease RecJ; Provisional Probab=34.43 E-value=29 Score=15.47 Aligned_cols=34 Identities=24% Similarity=0.311 Sum_probs=18.4 Q ss_pred EEEEECCCCCH-HHHHHHHHCCCCEEEECCCCHHH Q ss_conf 99996796410-23222220245413304657446 Q gi|254780420|r 208 VVCIDYGIRSN-LLRVLSSLGCRITIVGAETSYKD 241 (396) Q Consensus 208 IvvIDfGiK~n-Ilr~L~~~g~~v~Vvp~~~~~~~ 241 (396) |+-+|||+..- -+..+.++|++|+|.-+..+.++ T Consensus 130 iITvDcGi~a~e~i~~a~~~GidvIVtDHH~~~~~ 164 (574) T PRK11070 130 IVTVDNGISSHAGVAHAKSLGIPVIVTDHHLPGDT 164 (574) T ss_pred EEEECCCHHHHHHHHHHHHCCCCEEEECCCCCCCC T ss_conf 99957853029999999977999999789999765 No 320 >pfam06218 NPR2 Nitrogen permease regulator 2. This family of regulators are involved in post-translational control of nitrogen permease. Probab=34.37 E-value=24 Score=16.09 Aligned_cols=58 Identities=14% Similarity=0.261 Sum_probs=34.0 Q ss_pred CCCCCCCCCCEEEECCCCCCCCCC-------CHHHHCCCCCCCCCEEEEEEEECCCCCCCCC-CCCCCHHHHH Q ss_conf 141614256173216897368458-------8234204653332102366640465443455-2417989999 Q gi|254780420|r 51 ILTDPSYLGQIVNFTFPHIGNVGV-------NSEDFESISRKNFKGAVGLVIKAEITEPSNY-RANMHFDAWL 115 (396) Q Consensus 51 ~lTDPSY~gQIlvfT~P~IGNyGv-------n~~d~ES~~~~~~~~~~g~Iv~~~~~~~Sn~-rs~~sL~~wL 115 (396) +++.|..+|..|+++. +.|-| ....|+- -.+..--++|+.+.++. +.| +..+-|.++| T Consensus 44 IIPk~~Lc~klItv~~---~~yrilG~PV~I~~~kY~R---N~f~FN~cfV~~~~~~~-~~YepvV~kLa~~l 109 (400) T pfam06218 44 IIPKPQLCNKLITVKA---NKYRIIGYPVNIENSKYAR---NAFLFNFCFVFPYDADT-SPYEPAVKRLGKML 109 (400) T ss_pred ECCCHHHCCCEEEEEE---CCEEEEEEEEEECCCCCCC---CCEEEEEEEEECCCCCC-CCCHHHHHHHHHHH T ss_conf 7179599796699996---8889996215726886434---43689899997676664-11189999999999 No 321 >CHL00067 rps2 ribosomal protein S2 Probab=34.37 E-value=35 Score=14.98 Aligned_cols=31 Identities=23% Similarity=0.401 Sum_probs=21.8 Q ss_pred CCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEE Q ss_conf 578899858888841011211012221025786347861 Q gi|254780420|r 246 QPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGIC 284 (396) Q Consensus 246 ~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIC 284 (396) -||.||+- ||... ..+++++...+||+.||| T Consensus 157 lPd~iii~----d~~~e----~~ai~Ea~kL~IPvIaiv 187 (227) T CHL00067 157 LPDIVIIV----DQQEE----YTALRECIKLGIPTISIV 187 (227) T ss_pred CCCEEEEE----CCCCC----HHHHHHHHHCCCCEEEEE T ss_conf 89769996----77533----799999987599989996 No 322 >PRK13789 phosphoribosylamine--glycine ligase; Provisional Probab=34.17 E-value=35 Score=14.96 Aligned_cols=28 Identities=11% Similarity=0.028 Sum_probs=13.8 Q ss_pred HHHHHHCCCCEECC---CCHHHHHHHHHHCC Q ss_conf 99997427832112---57689999998539 Q gi|254780420|r 112 DAWLKSCGIIGLSG---IDTRALTVWIRDYG 139 (396) Q Consensus 112 ~~wL~~~~IpgI~G---IDTRaLTk~IRe~G 139 (396) .+||++++||--.- -|....-+.|++.+ T Consensus 113 K~fm~~~~IPTa~~~~f~~~~~A~~~l~~~~ 143 (426) T PRK13789 113 KSLMKEAKIPTASYKTFTEYSSSLSYLESEM 143 (426) T ss_pred HHHHHHCCCCCCEEEEECCHHHHHHHHHHCC T ss_conf 9999972999760157368999999998579 No 323 >cd06298 PBP1_CcpA_like Ligand-binding domain of the catabolite control protein A (CcpA), which functions as the major transcriptional regulator of carbon catabolite repression/regulation. Ligand-binding domain of the catabolite control protein A (CcpA), which functions as the major transcriptional regulator of carbon catabolite repression/regulation (CCR), a process in which enzymes necessary for the metabolism of alternative sugars are inhibited in the presence of glucose. In gram-positive bacteria, CCR is controlled by HPr, a phosphoenolpyruvate:sugar phsophotrasnferase system (PTS) and a transcriptional regulator CcpA. Moreover, CcpA can regulate sporulation and antibiotic resistance as well as play a role in virulence development of certain pathogens such as the group A streptococcus. The ligand binding domain of CcpA is a member of the LacI-GalR family of bacterial transcription regulators. Probab=33.98 E-value=35 Score=14.94 Aligned_cols=23 Identities=13% Similarity=0.234 Sum_probs=9.3 Q ss_pred HHHHHCCCCCCEEEEECHHHHHHH Q ss_conf 222102578634786100588787 Q gi|254780420|r 269 IICKLVDSGMPMFGICLGHQLLGL 292 (396) Q Consensus 269 ~~~~~~~~~iPilGIClGhQ~l~~ 292 (396) ....+++.+ |+=+|+-+.-.+|+ T Consensus 168 ~~~~l~~~~-~~~ai~~~nD~~A~ 190 (268) T cd06298 168 LAEELLEDG-KPTAAFVTDDELAI 190 (268) T ss_pred HHHHHHHCC-CCCEEEECCHHHHH T ss_conf 999998369-99789966879999 No 324 >cd02146 NfsA_FRP This family contains NADPH-dependent flavin reductase and oxygen-insensitive nitroreductase. These enzymes are homodimeric flavoproteins that contain one FMN per monomer as a cofactor. Flavin reductase catalyzes the reduction of flavin by using NADPH as an electron donor. Oxygen-insensitive nitroreductase, such as NfsA protein in Escherichia coli, catalyzes reduction of nitrocompounds using NADPH as electron donor. Probab=33.85 E-value=16 Score=17.28 Aligned_cols=140 Identities=16% Similarity=0.258 Sum_probs=55.7 Q ss_pred HHHHCCCCCEEEECCCCCCCCHHHHHHHHHHCCCCCCCCHHHCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCEEEEEEC Q ss_conf 99853984236632765458978988877415887432101101333321116865344543443467777714999967 Q gi|254780420|r 134 WIRDYGVSNSVIAHHPDGQFDLEDLKERAKNWSGLKGAELAKHVTVSQRRDWSEKIWKWGEETSFLKSSDAKYHVVCIDY 213 (396) Q Consensus 134 ~IRe~G~~~a~I~~~~~~~~d~~~~~~~~~~~~~~~~~dL~~~Vs~~~~~~~~~~~~~~~~~~~~~~~~~~~~kIvvIDf 213 (396) .|+.+-+.+.- ...+-.+..++.+.+.+...|.-.+....+-|-++.+.. .+.+........... .-..+=|.+.|+ T Consensus 4 ~i~~rRSvR~f-~~~~I~~e~l~~il~aa~~APSs~n~Q~~~~i~V~d~~~-r~~la~~~~~q~~v~-~Ap~~lvf~aD~ 80 (229) T cd02146 4 TLLNHRSIRKF-KDEPIPDETLETLIAAAQSAPTSSNLQAYSVIVVTDPEL-KAKLAELAGGQPYVA-QAPVFLVFCADF 80 (229) T ss_pred HHHHCCCCCCC-CCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEEECHHH-HHHHHHHCCCCHHHH-CCCEEEEEEEEC T ss_conf 89747852486-979899999999999998688846778728999948899-999998746740353-398589999751 Q ss_pred CCCCHHHHHHHHHCCCCEEEEC---------CCC--HHHH----HHCCCCEEEECCCC-CCCCCCCCCCHHHHHHHC--- Q ss_conf 9641023222220245413304---------657--4464----41357889985888-884101121101222102--- Q gi|254780420|r 214 GIRSNLLRVLSSLGCRITIVGA---------ETS--YKDI----MSLQPDGVLLSNGP-GDPAVTSVYSSPIICKLV--- 274 (396) Q Consensus 214 GiK~nIlr~L~~~g~~v~Vvp~---------~~~--~~~i----~~~~pdgi~lSnGP-GdP~~~~~~~~~~~~~~~--- 274 (396) +-..+.+..+|..-..... |+. ++.+ .++-.-.+.|+ |- .||. .+++++ T Consensus 81 ---~R~~~~~~~~~~~~~~~~~~~~~~~~~~Da~iAaqn~~lAAeslGLG~~~IG-~irn~~~--------~v~elL~LP 148 (229) T cd02146 81 ---HRLAKIAEKHGGAEAALDNLESLLVAAVDAGLAAQNAAVAAESLGLGIVYIG-GIRNDPE--------AVIELLNLP 148 (229) T ss_pred ---HHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEH-HHHCCHH--------HHHHHHCCC T ss_conf ---8889999973897435573888999999999999999999998699858741-4540999--------999997889 Q ss_pred CCCCCEEEEECHHH Q ss_conf 57863478610058 Q gi|254780420|r 275 DSGMPMFGICLGHQ 288 (396) Q Consensus 275 ~~~iPilGIClGhQ 288 (396) +.-.|++|+|+|+= T Consensus 149 ~~~~Pv~gL~lGyP 162 (229) T cd02146 149 EYVFPVFGLAVGYP 162 (229) T ss_pred CCEEEEEEEEEECC T ss_conf 97168999653153 No 325 >TIGR02633 xylG D-xylose ABC transporter, ATP-binding protein; InterPro: IPR013455 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). Several bacterial species use the enzymes xylose isomerase and xylulokinase for xylose utilisation. This entry represents the ATP-binding cassette (ABC) subunit of the known, or predicted, high-affinity xylose ABC transporter used for xylose import. The genes encoding these proteins, which closely resemble other sugar transport ABC transporter genes, are typically found near xylose utilisation enzymes and regulatory proteins. . Probab=33.84 E-value=34 Score=15.03 Aligned_cols=13 Identities=23% Similarity=0.452 Sum_probs=5.0 Q ss_pred HHHHHHHHHCCCC Q ss_conf 0232222202454 Q gi|254780420|r 218 NLLRVLSSLGCRI 230 (396) Q Consensus 218 nIlr~L~~~g~~v 230 (396) +|+|-|.++|..+ T Consensus 182 diirDLk~~g~AC 194 (501) T TIGR02633 182 DIIRDLKAKGVAC 194 (501) T ss_pred HHHHHHHHCCCEE T ss_conf 9999997469359 No 326 >PRK00109 Holliday junction resolvase-like protein; Reviewed Probab=33.48 E-value=36 Score=14.89 Aligned_cols=53 Identities=17% Similarity=0.276 Sum_probs=28.6 Q ss_pred CEEEEEECCCCCH---HHHHHHHHCCCCEEEECCC---CHHHH----HHCCCCEEEECCCCCCCCCC Q ss_conf 1499996796410---2322222024541330465---74464----41357889985888884101 Q gi|254780420|r 206 YHVVCIDYGIRSN---LLRVLSSLGCRITIVGAET---SYKDI----MSLQPDGVLLSNGPGDPAVT 262 (396) Q Consensus 206 ~kIvvIDfGiK~n---Ilr~L~~~g~~v~Vvp~~~---~~~~i----~~~~pdgi~lSnGPGdP~~~ 262 (396) -||+.||||.|+- +-+.+...-.-+.+++... .+++| ..+++++|++ |-|-.. T Consensus 4 griLgiD~G~kriGlAisD~~~~~a~Pl~~i~~~~~~~~~~~l~~ii~e~~i~~iVi----GlP~~~ 66 (141) T PRK00109 4 GRILGFDVGTKRIGVAIGDPLGGTARPLETIKAQNGEPDWDALEKLLKEWQPDLLVV----GLPLNM 66 (141) T ss_pred CCEEEEEECCCEEEEEEECCCCCEEECCEEEECCCCHHHHHHHHHHHHHHCCCEEEE----ECCCCC T ss_conf 888999818997999994699984744143772770689999999999839998999----447999 No 327 >pfam00853 Runt Runt domain. Probab=33.15 E-value=14 Score=17.69 Aligned_cols=25 Identities=20% Similarity=0.448 Sum_probs=18.6 Q ss_pred CCCCCCCCCCCHHHHHHHCCCCEECCCC Q ss_conf 4434552417989999742783211257 Q gi|254780420|r 100 TEPSNYRANMHFDAWLKSCGIIGLSGID 127 (396) Q Consensus 100 ~~~Sn~rs~~sL~~wL~~~~IpgI~GID 127 (396) .-|+||||.++|-. ..+|.++..|- T Consensus 26 ~LP~HWRsNKtLp~---~fkVv~L~~v~ 50 (134) T pfam00853 26 VLPSHWRSNKTLPV---AFKVVALGEVP 50 (134) T ss_pred CCCCCCCCCCCCCC---CEEEEEEEECC T ss_conf 18531124776754---16899964218 No 328 >cd01149 HutB Hemin binding protein HutB. These proteins have been shown to function as initial receptors in ABC transport of hemin and hemoproteins in many eubacterial species. They belong to the TroA superfamily of periplasmic metal binding proteins that share a distinct fold and ligand binding mechanism. A typical TroA protein is comprised of two globular subdomains connected by a single helix and can bind the metal ion in the cleft between these domains. Probab=32.90 E-value=36 Score=14.82 Aligned_cols=20 Identities=25% Similarity=0.373 Sum_probs=17.1 Q ss_pred CCHHHHHHCCCCEEEECCCC Q ss_conf 57446441357889985888 Q gi|254780420|r 237 TSYKDIMSLQPDGVLLSNGP 256 (396) Q Consensus 237 ~~~~~i~~~~pdgi~lSnGP 256 (396) .+.|.|++.+||-||++... T Consensus 182 vs~E~i~~~nPDvI~v~~~~ 201 (235) T cd01149 182 LSAEALIAAQPDVILVMSRG 201 (235) T ss_pred CCHHHHHHHCCCEEEEECCC T ss_conf 79999998699999995898 No 329 >cd06283 PBP1_RegR_EndR_KdgR_like Ligand-binding domain of DNA transcription repressor RegR and other putative regulators such as KdgR and EndR. Ligand-binding domain of DNA transcription repressor RegR and other putative regulators such as KdgR and EndR, all of which are members of the LacI-GalR family of bacterial transcription regulators. RegR regulates bacterial competence and the expression of virulence factors, including hyaluronidase. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA b Probab=32.84 E-value=36 Score=14.82 Aligned_cols=55 Identities=11% Similarity=0.120 Sum_probs=25.6 Q ss_pred EEEEEECCCCCCCCCCCCCCHHHHHHHCCCCEECCC---CH---HHHHHHHHHCCCCCEEEEC Q ss_conf 366640465443455241798999974278321125---76---8999999853984236632 Q gi|254780420|r 91 VGLVIKAEITEPSNYRANMHFDAWLKSCGIIGLSGI---DT---RALTVWIRDYGVSNSVIAH 147 (396) Q Consensus 91 ~g~Iv~~~~~~~Sn~rs~~sL~~wL~~~~IpgI~GI---DT---RaLTk~IRe~G~~~a~I~~ 147 (396) -|+||.++... .--+-...+.+.++++|.-.+-.. |. +...+.+++++ .-|+|.. T Consensus 2 Igvivp~l~np-f~~~i~~gi~~~a~~~gy~l~i~~s~~~~~~e~~~i~~l~~~~-vDgiIl~ 62 (267) T cd06283 2 IGVIVADITNP-FSSLVLKGIEDVCRAHGYQVLVCNSDNDPEKEKEYLESLLAYQ-VDGLIVN 62 (267) T ss_pred EEEEECCCCCH-HHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHHHCC-CCEEEEC T ss_conf 99997998759-9999999999999986998999978999899999999999669-9999985 No 330 >cd06285 PBP1_LacI_like_7 Ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=32.83 E-value=36 Score=14.82 Aligned_cols=14 Identities=21% Similarity=0.043 Sum_probs=6.5 Q ss_pred HHHHHHHCCCCEEC Q ss_conf 89999742783211 Q gi|254780420|r 111 FDAWLKSCGIIGLS 124 (396) Q Consensus 111 L~~wL~~~~IpgI~ 124 (396) +-+.|++.+||.+. T Consensus 70 ~~~~l~~~~iPvV~ 83 (265) T cd06285 70 FLDELTRRGVPFVL 83 (265) T ss_pred HHHHHHHCCCCEEE T ss_conf 99999967997899 No 331 >pfam00318 Ribosomal_S2 Ribosomal protein S2. Probab=32.82 E-value=36 Score=14.81 Aligned_cols=32 Identities=25% Similarity=0.377 Sum_probs=22.2 Q ss_pred CCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEEC Q ss_conf 5788998588888410112110122210257863478610 Q gi|254780420|r 246 QPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGICL 285 (396) Q Consensus 246 ~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGICl 285 (396) -||.||+.+ |.. ...+++++...+||+.|||= T Consensus 137 ~Pd~vii~d----~~~----~~~ai~Ea~~l~IP~I~ivD 168 (205) T pfam00318 137 LPDLVIVVD----PNK----EHIAIKEASKLGIPVIAIVD 168 (205) T ss_pred CCCEEEEEC----CCC----CHHHHHHHHHCCCCEEEECC T ss_conf 898699857----864----53899999875997565405 No 332 >PRK08005 ribulose-phosphate 3-epimerase; Validated Probab=32.53 E-value=37 Score=14.78 Aligned_cols=104 Identities=16% Similarity=0.152 Sum_probs=47.4 Q ss_pred HHHHHHCCCCEEC-----CCCHHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHH---------CCCCCCCCHHHCC Q ss_conf 9999742783211-----2576899999985398423663276545897898887741---------5887432101101 Q gi|254780420|r 112 DAWLKSCGIIGLS-----GIDTRALTVWIRDYGVSNSVIAHHPDGQFDLEDLKERAKN---------WSGLKGAELAKHV 177 (396) Q Consensus 112 ~~wL~~~~IpgI~-----GIDTRaLTk~IRe~G~~~a~I~~~~~~~~d~~~~~~~~~~---------~~~~~~~dL~~~V 177 (396) -+++.+.|.--|+ --+-..+.++||+.|. ++.|+-.++. +++.+...+.. .|+..|+.+...+ T Consensus 74 i~~~~~~g~d~it~H~Ea~~~~~~~i~~Ik~~g~-k~GlAlnP~T--~i~~~~~~l~~vD~VLvMtV~PGf~GQ~Fi~~~ 150 (210) T PRK08005 74 LPWLAAIRPGWIFIHAESVQNPSEILADIRAIGA-KAGLALNPAT--PLLPYRYLALQLDALMIMTSEPDGRGQQFIAAM 150 (210) T ss_pred HHHHHHCCCCEEEEECCCCCCHHHHHHHHHHCCC-EEEEEECCCC--CHHHHHHHHHCCCEEEEEEECCCCCCCCCCHHH T ss_conf 9999972998599935677699999999997498-0788837999--879987304007989998778999872117889 Q ss_pred CCCCCEECCCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCHHHHHHHHHCCCCEEEE Q ss_conf 333321116865344543443467777714999967964102322222024541330 Q gi|254780420|r 178 TVSQRRDWSEKIWKWGEETSFLKSSDAKYHVVCIDYGIRSNLLRVLSSLGCRITIVG 234 (396) Q Consensus 178 s~~~~~~~~~~~~~~~~~~~~~~~~~~~~kIvvIDfGiK~nIlr~L~~~g~~v~Vvp 234 (396) --|-.. . +.......| -+|=|++...++.|.+.|+++.|+- T Consensus 151 ~~KI~~-~--------------r~~~~~~~I-~vDGGIn~~t~~~~~~aGad~~V~G 191 (210) T PRK08005 151 CEKVSQ-S--------------REHFPAAEC-WADGGITLRAARLLAAAGAQHLVIG 191 (210) T ss_pred HHHHHH-H--------------HHHCCCCCE-EEECCCCHHHHHHHHHCCCCEEEEC T ss_conf 999999-9--------------962877888-9978878899999998699999979 No 333 >cd01541 PBP1_AraR Ligand-binding domain of DNA transcription repressor specific for arabinose (AraR) which is a member of the LacI-GalR family of bacterial transcription regulators. Ligand-binding domain of DNA transcription repressor specific for arabinose (AraR) which is a member of the LacI-GalR family of bacterial transcription regulators. The ligand-binding domain of AraR is structurally homologous to the periplasmic sugar-binding domain of ABC-type transporters and both domains contain the type I periplasmic binding protein-like fold. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the type I periplasmic binding proteins. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which i Probab=32.16 E-value=37 Score=14.74 Aligned_cols=11 Identities=36% Similarity=0.401 Sum_probs=5.9 Q ss_pred HHHHHHHHCCC Q ss_conf 23222220245 Q gi|254780420|r 219 LLRVLSSLGCR 229 (396) Q Consensus 219 Ilr~L~~~g~~ 229 (396) .+++|.++|++ T Consensus 112 a~~~L~~~G~~ 122 (273) T cd01541 112 ATEYLIELGHR 122 (273) T ss_pred HHHHHHHCCCC T ss_conf 99999856996 No 334 >cd01144 BtuF Cobalamin binding protein BtuF. These proteins have been shown to function as initial receptors in ABC transport of vitamin B12 (cobalamin) in eubacterial and some archaeal species. They belong to the TroA superfamily of helical backbone metal receptor proteins that share a distinct fold and ligand binding mechanism. A typical TroA protein is comprised of two globular subdomains connected by a single helix and can bind the metal ion in the cleft between these domains. In addition, these proteins sometimes have a low complexity region containing a metal-binding histidine-rich motif (repetitive HDH sequence). Probab=32.04 E-value=18 Score=17.03 Aligned_cols=21 Identities=29% Similarity=0.572 Sum_probs=17.3 Q ss_pred CCCHHHHHHCCCCEEEECCCC Q ss_conf 657446441357889985888 Q gi|254780420|r 236 ETSYKDIMSLQPDGVLLSNGP 256 (396) Q Consensus 236 ~~~~~~i~~~~pdgi~lSnGP 256 (396) ..+.|.|.+.+||-||++..+ T Consensus 176 ~vs~E~i~~~nPDvI~v~~~~ 196 (245) T cd01144 176 QVSWEDVLAANPDVIVLSPCG 196 (245) T ss_pred CCCHHHHHHHCCCEEEEECCC T ss_conf 757889998599999995898 No 335 >COG1058 CinA Predicted nucleotide-utilizing enzyme related to molybdopterin-biosynthesis enzyme MoeA [General function prediction only] Probab=32.03 E-value=38 Score=14.73 Aligned_cols=57 Identities=25% Similarity=0.410 Sum_probs=30.2 Q ss_pred HHHHHHHHCCCC---EEEECCC-CHHHHH---HCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCC Q ss_conf 232222202454---1330465-744644---1357889985888884101121101222102578 Q gi|254780420|r 219 LLRVLSSLGCRI---TIVGAET-SYKDIM---SLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSG 277 (396) Q Consensus 219 Ilr~L~~~g~~v---~Vvp~~~-~~~~i~---~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~ 277 (396) |-+.|.++|++| ++|+=+- .+.+.+ ..++|-||+|+|=| |.+-+. -.+.+.+++... T Consensus 26 la~~L~~~G~~v~~~~~VgD~~~~I~~~l~~a~~r~D~vI~tGGLG-PT~DDi-T~e~vAka~g~~ 89 (255) T COG1058 26 LADELTELGVDLARITTVGDNPDRIVEALREASERADVVITTGGLG-PTHDDL-TAEAVAKALGRP 89 (255) T ss_pred HHHHHHHCCCEEEEEEECCCCHHHHHHHHHHHHHCCCEEEECCCCC-CCCCHH-HHHHHHHHHCCC T ss_conf 9999996496189999639999999999999971899899879858-996276-899999982998 No 336 >TIGR01467 cobI_cbiL precorrin-2 C20-methyltransferase; InterPro: IPR006364 Cobalamin (vitamin B12) is a structurally complex cofactor, consisting of a modified tetrapyrrole with a centrally chelated cobalt. Cobalamin is usually found in one of two biologically active forms: methylcobalamin and adocobalamin. Most prokaryotes, as well as animals, have cobalamin-dependent enzymes, whereas plants and fungi do not appear to use it. In bacteria and archaea, these include methionine synthase, ribonucleotide reductase, glutamate and methylmalonyl-CoA mutases, ethanolamine ammonia lyase, and diol dehydratase . In mammals, cobalamin is obtained through the diet, and is required for methionine synthase and methylmalonyl-CoA mutase . There are at least two distinct cobalamin biosynthetic pathways in bacteria : Aerobic pathway that requires oxygen and in which cobalt is inserted late in the pathway ; found in Pseudomonas denitrificans and Rhodobacter capsulatus. Anaerobic pathway in which cobalt insertion is the first committed step towards cobalamin synthesis ; found in Salmonella typhimurium, Bacillus megaterium, and Propionibacterium freudenreichii shermanii. Either pathway can be divided into two parts: (1) corrin ring synthesis (differs in aerobic and anaerobic pathways) and (2) adenosylation of corrin ring, attachment of aminopropanol arm, and assembly of the nucleotide loop (common to both pathways) . There are about 30 enzymes involved in either pathway, where those involved in the aerobic pathway are prefixed Cob and those of the anaerobic pathway Cbi. Several of these enzymes are pathway-specific: CbiD, CbiG, and CbiK are specific to the anaerobic route of S. typhimurium, whereas CobE, CobF, CobG, CobN, CobS, CobT, and CobW are unique to the aerobic pathway of P. denitrificans. In the anaerobic pathway, cobalt is inserted into precorrin-2 by CbiK to generate cobalt-precorrin-2, which is the substrate for CbiL, a C20 methyltransferase . In Clostridium difficile, CbiK and CbiL are fused into a bifunctional enzyme. In the aerobic pathway, CobI catalyses the methylation of precorrin-2 to precorrin-3a. This entry represents CbiL and CobI precorrin-2 C20-methyltransferases (2.1.1.130 from EC), both as stand-alone enzymes and when CbiL forms part of a bifunctional enzyme. This entry also includes other closely related S-adenosylmethionine-dependent methyltransferases involved in Cbl biosynthesis. ; GO: 0008757 S-adenosylmethionine-dependent methyltransferase activity, 0009236 cobalamin biosynthetic process. Probab=31.85 E-value=22 Score=16.36 Aligned_cols=42 Identities=26% Similarity=0.281 Sum_probs=29.9 Q ss_pred CCCEEEEEECC--CCCH----HHHHHHH--HCCCCEEEECCCCHHHHHHC Q ss_conf 77149999679--6410----2322222--02454133046574464413 Q gi|254780420|r 204 AKYHVVCIDYG--IRSN----LLRVLSS--LGCRITIVGAETSYKDIMSL 245 (396) Q Consensus 204 ~~~kIvvIDfG--iK~n----Ilr~L~~--~g~~v~Vvp~~~~~~~i~~~ 245 (396) .+..||++=.| .=|| +.+.|.. .+..+.++|-=||+..+-+. T Consensus 99 ~G~~Vaf~TlGDP~~YSTF~yl~~~l~~~~~~~~~e~vPGitS~~a~As~ 148 (243) T TIGR01467 99 EGRDVAFITLGDPSLYSTFSYLLQRLKEALPGIEVEVVPGITSFAACASA 148 (243) T ss_pred CCCCEEEEEECCCCHHHHHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHH T ss_conf 69828999845720223799999999863089617997488717899998 No 337 >pfam07090 DUF1355 Protein of unknown function (DUF1355). This family consists of several hypothetical bacterial proteins of around 250 residues in length. The function of this family is unknown. Probab=31.78 E-value=32 Score=15.24 Aligned_cols=47 Identities=19% Similarity=0.137 Sum_probs=33.0 Q ss_pred EEEEEECCCCCHHHHHHHHHCCCCEEEECCCCHHHH----HH-CCCCEEEECC Q ss_conf 499996796410232222202454133046574464----41-3578899858 Q gi|254780420|r 207 HVVCIDYGIRSNLLRVLSSLGCRITIVGAETSYKDI----MS-LQPDGVLLSN 254 (396) Q Consensus 207 kIvvIDfGiK~nIlr~L~~~g~~v~Vvp~~~~~~~i----~~-~~pdgi~lSn 254 (396) .-+-.+.|+++ ++..|.+.|.+|+-.|.....+++ .+ .++|.||||. T Consensus 23 ~s~~~~~ga~~-~~~al~~~~~~v~~mp~h~a~~~FP~t~eeL~~YD~VILSD 74 (183) T pfam07090 23 TSLKYDEGADL-LLFAILRSPYDVDYMPAHDAQIAFPVTLEELSAYDAVILSD 74 (183) T ss_pred CCCCHHCCHHH-HHHHHHHCCCEEEEECCCCCHHHCCCCHHHHHCCCEEEEEE T ss_conf 43110048799-99999867951897423104645773888884088899950 No 338 >TIGR01328 met_gam_lyase methionine gamma-lyase; InterPro: IPR006237 This family of sequences is a methionine gamma-lyase subset of a family of PLP-dependent trans-sulphuration enzymes. The member from the parasite Trichomonas vaginalis is described as catalyzing alpha gamma- and alpha-beta eliminations and gamma-replacement reactions on methionine, cysteine, and some derivatives. Likewise, the enzyme from Pseudomonas degrades cysteine as well as methionine.. Probab=31.74 E-value=38 Score=14.69 Aligned_cols=47 Identities=19% Similarity=0.393 Sum_probs=28.1 Q ss_pred CCCCCHHHHHHH-HHCCCCEEEECCCCHHHHHH-CCC-CEEEECCCCCCCC Q ss_conf 796410232222-20245413304657446441-357-8899858888841 Q gi|254780420|r 213 YGIRSNLLRVLS-SLGCRITIVGAETSYKDIMS-LQP-DGVLLSNGPGDPA 260 (396) Q Consensus 213 fGiK~nIlr~L~-~~g~~v~Vvp~~~~~~~i~~-~~p-dgi~lSnGPGdP~ 260 (396) ||.-+.+++.+. +.|.+|.-+|.+ ..++|.+ ++| --|+.=--|-+|. T Consensus 108 YGCTfall~~~ltkfgi~v~f~~~~-~pee~~a~i~dnT~ivYfETPaNPt 157 (392) T TIGR01328 108 YGCTFALLEDALTKFGIDVKFVDMA-SPEEVKAAIKDNTKIVYFETPANPT 157 (392) T ss_pred CCCHHHHHHHHHHHHCCEEEEECCC-CHHHHHHHCCCCCEEEEEECCCCCC T ss_conf 2014899873267737527983178-7799998448997799986599875 No 339 >cd06270 PBP1_GalS_like Ligand binding domain of DNA transcription iso-repressor GalS, which is one of two regulatory proteins involved in galactose transport and metabolism. Ligand binding domain of DNA transcription iso-repressor GalS, which is one of two regulatory proteins involved in galactose transport and metabolism. Transcription of the galactose regulon genes is regulated by Gal iso-repressor (GalS) and Gal repressor (GalR) in different ways, but both repressors recognize the same DNA binding site in the absence of D-galactose. GalS is a dimeric protein like GalR,and its major role is in regulating expression of the high-affinity galactose transporter encoded by the mgl operon, whereas GalR is the exclusive regulator of galactose permease, the low-affinity galactose transporter. GalS and GalR are members of the LacI-GalR family of transcription regulators and both contain the type I periplasmic binding protein-like fold. Hence, they are homologous to the periplasmic sugar bindi Probab=31.61 E-value=38 Score=14.68 Aligned_cols=54 Identities=15% Similarity=0.113 Sum_probs=22.0 Q ss_pred EEEEECCCCCCCCCCCCCCHHHHHHHCCCCEEC---CCCH---HHHHHHHHHCCCCCEEEEC Q ss_conf 666404654434552417989999742783211---2576---8999999853984236632 Q gi|254780420|r 92 GLVIKAEITEPSNYRANMHFDAWLKSCGIIGLS---GIDT---RALTVWIRDYGVSNSVIAH 147 (396) Q Consensus 92 g~Iv~~~~~~~Sn~rs~~sL~~wL~~~~IpgI~---GIDT---RaLTk~IRe~G~~~a~I~~ 147 (396) |+||.++.. |.--+--..+.+.++++|.-.+- +-|. +...+.+++++ .-|+|.. T Consensus 3 Gvivp~i~n-pff~~i~~~ie~~~~~~gy~~ll~~s~~~~~~e~~~l~~l~~~~-vdGiIi~ 62 (268) T cd06270 3 GLVVSDLDG-PFFGPLLSGVESVARKAGKHLIITAGHHSAEKEREAIEFLLERR-CDALILH 62 (268) T ss_pred EEEECCCCC-HHHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHHHCC-CCEEEEE T ss_conf 999778877-99999999999999985999999958999899999999999659-9999995 No 340 >cd01542 PBP1_TreR_like Ligand-binding domain of DNA transcription repressor specific for trehalose (TreR) which is a member of the LacI-GalR family of bacterial transcription regulators. Ligand-binding domain of DNA transcription repressor specific for trehalose (TreR) which is a member of the LacI-GalR family of bacterial transcription regulators. The ligand-binding domain of TreR is structurally homologous to the periplasmic sugar-binding domain of ABC-type transporters and both domains contain the type I periplasmic binding protein-like fold. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the type I periplasmic binding proteins. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding wh Probab=31.60 E-value=38 Score=14.68 Aligned_cols=11 Identities=27% Similarity=0.323 Sum_probs=5.0 Q ss_pred HHHHHHHHCCC Q ss_conf 23222220245 Q gi|254780420|r 219 LLRVLSSLGCR 229 (396) Q Consensus 219 Ilr~L~~~g~~ 229 (396) .+++|.++|++ T Consensus 105 a~~~L~~~Ghr 115 (259) T cd01542 105 LGEYLAQQGHK 115 (259) T ss_pred HHHHHHHHCCC T ss_conf 99999982898 No 341 >TIGR03099 dCO2ase_PEP1 pyridoxal-dependent decarboxylase, exosortase system type 1 associated. The sequences in this family contain the pyridoxal binding domain (pfam02784) and C-terminal sheet domain (pfam00278) of a family of Pyridoxal-dependent decarboxylases. Characterized enzymes in this family decarboxylate substrates such as ornithine, diaminopimelate and arginine. The genes of the family modeled here, with the exception of those observed in certain Burkholderia species, are all found in the context of exopolysaccharide biosynthesis loci containing the exosortase/PEP-CTERM protein sorting system. More specifically, these are characteristic of the type 1 exosortase system represented by the Genome Property GenProp0652. The substrate of these enzymes may be a precursor of the carrier or linker which is hypothesized to release the PEP-CTERM protein from the exosortase enzyme. These enzymes are apparently most closely related to the diaminopimelate decarboxylase modeled by TIGR01048 Probab=31.42 E-value=38 Score=14.66 Aligned_cols=38 Identities=16% Similarity=0.061 Sum_probs=16.3 Q ss_pred CCHHHHHHHCCCCEECCCCHHHHHHHHHHCCCCCEEEECC Q ss_conf 7989999742783211257689999998539842366327 Q gi|254780420|r 109 MHFDAWLKSCGIIGLSGIDTRALTVWIRDYGVSNSVIAHH 148 (396) Q Consensus 109 ~sL~~wL~~~~IpgI~GIDTRaLTk~IRe~G~~~a~I~~~ 148 (396) ..+-.+|.+.+. |+.=+=.-.| ++.+..|.....|... T Consensus 62 ~~il~~l~~~G~-g~DvaS~gEl-~~al~~G~~~~~Ii~~ 99 (398) T TIGR03099 62 PALLAHMAPLVD-GFDVASAGEL-AVALDTGYDPGCISFA 99 (398) T ss_pred HHHHHHHHHCCC-CEEEECHHHH-HHHHHCCCCCHHEEEC T ss_conf 999999998199-2999699999-9999849991001778 No 342 >COG3220 Uncharacterized protein conserved in bacteria [Function unknown] Probab=30.96 E-value=24 Score=16.12 Aligned_cols=40 Identities=25% Similarity=0.510 Sum_probs=24.9 Q ss_pred CCCCEEEEECCCCCEEEEH--HHCCC-CCEEEEEECCCCEEEE Q ss_conf 7897099961457615703--43699-8289999889980872 Q gi|254780420|r 315 ISNGTVEIVSMNHGFAIDA--SSLPA-GLEETHISLFDSSNCG 354 (396) Q Consensus 315 ~~t~kv~iTsqNHg~~v~~--~slp~-~~~v~~~~~nD~tieg 354 (396) +.-+.+|++++||||.+.. +.+|. .....|++--|.-.+| T Consensus 171 LDvNNiyVna~NHG~d~~~yL~~~p~~rV~eIHlAGh~~~~~G 213 (282) T COG3220 171 LDVNNIYVNAVNHGFDPEEYLAALPVERVGEIHLAGHDEEEEG 213 (282) T ss_pred EEECCEEEECCCCCCCHHHHHHCCCHHHHEEEEECCCCCCCCC T ss_conf 9614327850136888999984189878411122367767687 No 343 >cd06274 PBP1_FruR Ligand binding domain of DNA transcription repressor specific for fructose (FruR) and its close homologs. Ligand binding domain of DNA transcription repressor specific for fructose (FruR) and its close homologs, all of which are a member of the LacI-GalR family of bacterial transcription regulators. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to members of the type I periplasmic binding protein superfamily. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor Probab=30.89 E-value=39 Score=14.60 Aligned_cols=21 Identities=24% Similarity=0.074 Sum_probs=10.8 Q ss_pred EEEE-CCCCCHHHHHHHHHCCC Q ss_conf 9996-79641023222220245 Q gi|254780420|r 209 VCID-YGIRSNLLRVLSSLGCR 229 (396) Q Consensus 209 vvID-fGiK~nIlr~L~~~g~~ 229 (396) |..| +..=+..+++|.++|++ T Consensus 96 V~~Dn~~~~~~a~~~L~~~Ghr 117 (264) T cd06274 96 VVSDNRDGAAELTRELLAAPPE 117 (264) T ss_pred EEECHHHHHHHHHHHHHHCCCC T ss_conf 9976699999999999985996 No 344 >KOG2708 consensus Probab=30.74 E-value=28 Score=15.58 Aligned_cols=50 Identities=26% Similarity=0.377 Sum_probs=25.7 Q ss_pred CCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEE--ECHHHHHHHHC Q ss_conf 357889985888884101121101222102578634786--10058878752 Q gi|254780420|r 245 LQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGI--CLGHQLLGLAL 294 (396) Q Consensus 245 ~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGI--ClGhQ~l~~a~ 294 (396) .+.|-|-..-|||--+.....++....-.+-.++|+.|+ |.||--++... T Consensus 69 ~diD~icyTKGPGmgaPL~~vaivaRtlsllw~kPlv~VNHCigHIEMGR~i 120 (336) T KOG2708 69 DDIDCICYTKGPGMGAPLSVVAIVARTLSLLWNKPLVGVNHCIGHIEMGREI 120 (336) T ss_pred HHCCEEEECCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHCCEE T ss_conf 2287899727898787626689999999998279832301211113221022 No 345 >PRK10765 nitroreductase A; Provisional Probab=30.70 E-value=20 Score=16.66 Aligned_cols=48 Identities=13% Similarity=0.077 Sum_probs=26.1 Q ss_pred HHHHHCCCCCEEEECCCCCCCCHHHHHHHHHHCCCCCCCCHHHCCCCCC Q ss_conf 9998539842366327654589789888774158874321011013333 Q gi|254780420|r 133 VWIRDYGVSNSVIAHHPDGQFDLEDLKERAKNWSGLKGAELAKHVTVSQ 181 (396) Q Consensus 133 k~IRe~G~~~a~I~~~~~~~~d~~~~~~~~~~~~~~~~~dL~~~Vs~~~ 181 (396) ..|+++-+.+.- ...+-.+.+++.+.+.+...|.-.+....+-+-++. T Consensus 6 e~i~~rRSiR~F-~~~pv~~e~l~~il~aa~~ApSs~n~Q~~siI~V~D 53 (240) T PRK10765 6 ELILGHRSIRHF-TDEPISEAQLEAIINAARAASSSSFLQCSSIIRITD 53 (240) T ss_pred HHHHHCCCCCCC-CCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEECC T ss_conf 999976753477-999899999999999999786813588748999659 No 346 >cd06324 PBP1_ABC_sugar_binding_like_13 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. This group includes the periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=30.36 E-value=40 Score=14.54 Aligned_cols=16 Identities=6% Similarity=0.129 Sum_probs=7.7 Q ss_pred HHHHC--CCCEEEECCCC Q ss_conf 64413--57889985888 Q gi|254780420|r 241 DIMSL--QPDGVLLSNGP 256 (396) Q Consensus 241 ~i~~~--~pdgi~lSnGP 256 (396) +++.. +.|+||-.|-+ T Consensus 195 ~lL~~~Pdi~aI~~~ND~ 212 (305) T cd06324 195 NLLKRYPDVRLIWAANDQ 212 (305) T ss_pred HHHHHCCCCCEEEECCCH T ss_conf 999739999889978857 No 347 >PRK00124 hypothetical protein; Validated Probab=30.28 E-value=10 Score=18.71 Aligned_cols=81 Identities=12% Similarity=0.090 Sum_probs=44.0 Q ss_pred CCCCCHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHH---HHCCCCCCEEEEECHHHH Q ss_conf 79641023222220245413304657446441357889985888884101121101222---102578634786100588 Q gi|254780420|r 213 YGIRSNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTSVYSSPIIC---KLVDSGMPMFGICLGHQL 289 (396) Q Consensus 213 fGiK~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~---~~~~~~iPilGIClGhQ~ 289 (396) |=+|.-|.|.=.++++.|++|-+.. ...-..-.-.-|+.+.|+ ...+.++.+... -++...+|+-..||+-.. T Consensus 10 CPVk~~i~~~a~r~~i~v~~Van~~-~~~~~~~~i~~v~V~~g~---DaAD~~I~~~~~~gDiVIT~DipLAa~~l~Kga 85 (149) T PRK00124 10 CPVKEIIYRVAERHGIPVTLVANHF-LRVPYSPFIRTLYVDSGF---DAADNEIVERAEKGDIVITQDYPLAAEALEKGA 85 (149) T ss_pred CCHHHHHHHHHHHHCCEEEEEECCC-CCCCCCCCEEEEEECCCC---CHHHHHHHHHCCCCCEEEECCHHHHHHHHHCCC T ss_conf 9409999999998697699996897-069998764899989997---639999998388999999698899999998798 Q ss_pred HHHHCCCC Q ss_conf 78752854 Q gi|254780420|r 290 LGLALGAQ 297 (396) Q Consensus 290 l~~a~G~~ 297 (396) +++..-|+ T Consensus 86 ~vi~prG~ 93 (149) T PRK00124 86 IVLNPRGE 93 (149) T ss_pred EEECCCCC T ss_conf 99999986 No 348 >PRK05417 glutathione-dependent formaldehyde-activating enzyme; Provisional Probab=30.27 E-value=35 Score=14.93 Aligned_cols=37 Identities=22% Similarity=0.327 Sum_probs=25.3 Q ss_pred EEECCCCEEEECC-CCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 6867997789834-878877670016899999999997 Q gi|254780420|r 355 FRLIDQPVFSVQY-HPESSPGPQDSYYLFSTFIDFMHK 391 (396) Q Consensus 355 i~~~~~~~~~vQf-HPEa~PGP~D~~~~F~~F~~~i~~ 391 (396) |++++.|+||.-| |||-+.---=+.-=|.-|+..|-+ T Consensus 107 IEn~~HpF~GLDFvHtELs~~~GW~~P~FAaFVSSIIE 144 (191) T PRK05417 107 IENKDHPFYGLDFVHTELSQEQGWSAPGFAAFVSSIIE 144 (191) T ss_pred ECCCCCCCCCCCEECHHHCCCCCCCCCCHHHHHHHHHH T ss_conf 11678873463310110023358887308889999986 No 349 >PRK11749 putative oxidoreductase; Provisional Probab=30.16 E-value=40 Score=14.52 Aligned_cols=38 Identities=29% Similarity=0.520 Sum_probs=22.7 Q ss_pred HHHHHCCCCE---EEECCCCHHHHHHCCCCEEEECCCCCCCC Q ss_conf 2222024541---33046574464413578899858888841 Q gi|254780420|r 222 VLSSLGCRIT---IVGAETSYKDIMSLQPDGVLLSNGPGDPA 260 (396) Q Consensus 222 ~L~~~g~~v~---Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~ 260 (396) .|.+.|+++. .+.-+.+++++.+ ++|+|||.-|=..|. T Consensus 199 ~i~~~GV~~~~n~~vG~ditl~~L~~-~ydAV~lAtGa~~~r 239 (460) T PRK11749 199 RLLKLGVEIRTNTAVGRDITLDELRA-EYDAVFIGTGLGLPR 239 (460) T ss_pred HHHHCCCEEECCEEECCCCCHHHHHH-CCCEEEEECCCCCCC T ss_conf 99853978985558566432887741-148899944789886 No 350 >pfam00834 Ribul_P_3_epim Ribulose-phosphate 3 epimerase family. This enzyme catalyses the conversion of D-ribulose 5-phosphate into D-xylulose 5-phosphate. Probab=29.93 E-value=41 Score=14.49 Aligned_cols=26 Identities=15% Similarity=0.420 Sum_probs=18.1 Q ss_pred EEEECCCCCHHHHHHHHHCCCCEEEE Q ss_conf 99967964102322222024541330 Q gi|254780420|r 209 VCIDYGIRSNLLRVLSSLGCRITIVG 234 (396) Q Consensus 209 vvIDfGiK~nIlr~L~~~g~~v~Vvp 234 (396) ..+|=|+....+..|.+.|+++.|.- T Consensus 169 I~vDGGIn~~ti~~l~~~Gad~~V~G 194 (201) T pfam00834 169 IEVDGGVNLDNIPQIAEAGADVLVAG 194 (201) T ss_pred EEEECCCCHHHHHHHHHCCCCEEEEC T ss_conf 99989888999999998799999978 No 351 >COG2243 CobF Precorrin-2 methylase [Coenzyme metabolism] Probab=29.79 E-value=23 Score=16.17 Aligned_cols=41 Identities=22% Similarity=0.280 Sum_probs=30.0 Q ss_pred CCCEEEEEECC------CCCHHHHHHHHHCCCCEEEECCCCHHHHHH Q ss_conf 77149999679------641023222220245413304657446441 Q gi|254780420|r 204 AKYHVVCIDYG------IRSNLLRVLSSLGCRITIVGAETSYKDIMS 244 (396) Q Consensus 204 ~~~kIvvIDfG------iK~nIlr~L~~~g~~v~Vvp~~~~~~~i~~ 244 (396) .+..|+++=.| .=..|++.|..+|+++.+||-=+|+...-+ T Consensus 89 ~G~~VAf~~lGDP~~YsTf~~l~~~l~~~~~e~e~VPGVsS~~a~aa 135 (234) T COG2243 89 AGRDVAFLTLGDPTFYSTFMYLLERLRERGFEVEVVPGVSSFSACAA 135 (234) T ss_pred CCCEEEEEECCCCCHHHHHHHHHHHHHCCCCCEEEECCCCHHHHHHH T ss_conf 69928999814852888799999985306973599589436999999 No 352 >PRK11914 diacylglycerol kinase; Reviewed Probab=29.70 E-value=41 Score=14.46 Aligned_cols=42 Identities=19% Similarity=0.122 Sum_probs=19.4 Q ss_pred CHHHHHHHCCCCEECC-----CCHHHHHHHHHHCCCCCEEEECCCCCC Q ss_conf 9899997427832112-----576899999985398423663276545 Q gi|254780420|r 110 HFDAWLKSCGIIGLSG-----IDTRALTVWIRDYGVSNSVIAHHPDGQ 152 (396) Q Consensus 110 sL~~wL~~~~IpgI~G-----IDTRaLTk~IRe~G~~~a~I~~~~~~~ 152 (396) .+.+.|+++++.-..- -|.+.|++..-+.|. ..+|+.+.|+. T Consensus 28 ~i~~~l~~~g~~~~~~~t~~~~~a~~la~~a~~~g~-d~vv~~GGDGT 74 (304) T PRK11914 28 RAIARLHHRGVDVVEIVGTDAHHARHLVAAALAKGT-DALVVVGGDGV 74 (304) T ss_pred HHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHCCC-CEEEEEECCHH T ss_conf 999999987990999932787899999998886499-69999956259 No 353 >COG2242 CobL Precorrin-6B methylase 2 [Coenzyme metabolism] Probab=29.55 E-value=41 Score=14.45 Aligned_cols=82 Identities=16% Similarity=0.250 Sum_probs=43.8 Q ss_pred CCEEEEEECC--CCCHHHHHHHHHCCC-CEEEECCCCHHHHHHC-CCCEEEECCCCCCCCCCCCCCHHHHHHHCC--CCC Q ss_conf 7149999679--641023222220245-4133046574464413-578899858888841011211012221025--786 Q gi|254780420|r 205 KYHVVCIDYG--IRSNLLRVLSSLGCR-ITIVGAETSYKDIMSL-QPDGVLLSNGPGDPAVTSVYSSPIICKLVD--SGM 278 (396) Q Consensus 205 ~~kIvvIDfG--iK~nIlr~L~~~g~~-v~Vvp~~~~~~~i~~~-~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~--~~i 278 (396) .-||.+||-= .-..|-|++.+.|++ +.|+..+++ +.+..+ .||+||+..| |.= + .+.+.....+. +++ T Consensus 58 ~~~v~AIe~~~~a~~~~~~N~~~fg~~n~~vv~g~Ap-~~L~~~~~~daiFIGGg-~~i---~-~ile~~~~~l~~ggrl 131 (187) T COG2242 58 SGRVIAIERDEEALELIERNAARFGVDNLEVVEGDAP-EALPDLPSPDAIFIGGG-GNI---E-EILEAAWERLKPGGRL 131 (187) T ss_pred CCEEEEEECCHHHHHHHHHHHHHHCCCCEEEEECCCH-HHHCCCCCCCEEEECCC-CCH---H-HHHHHHHHHCCCCCEE T ss_conf 8559999258889999999999849996799954645-76369999999998798-777---8-9999999971868769 Q ss_pred CEEEEECHHHHHHH Q ss_conf 34786100588787 Q gi|254780420|r 279 PMFGICLGHQLLGL 292 (396) Q Consensus 279 PilGIClGhQ~l~~ 292 (396) =+=-|-|=+...++ T Consensus 132 V~naitlE~~~~a~ 145 (187) T COG2242 132 VANAITLETLAKAL 145 (187) T ss_pred EEEEECHHHHHHHH T ss_conf 99860088899999 No 354 >TIGR01982 UbiB 2-polyprenylphenol 6-hydroxylase; InterPro: IPR010232 This entry represents the enzyme (UbiB) which catalyses the first hydroxylation step in the ubiquinone biosynthetic pathway in bacteria , . The gene is also known as AarF in certain species. It is believed that the reaction converts 2-polyprenylphenol to 6-hydroxy-2-polyprenylphenol. Members are mainly proteobacteria.; GO: 0006744 ubiquinone biosynthetic process. Probab=29.52 E-value=29 Score=15.56 Aligned_cols=79 Identities=23% Similarity=0.333 Sum_probs=35.2 Q ss_pred CEEEEEECCCCCHHHHHHHHHCCC--CEEEE--CCCCHHHHHHCCCCEEEECCCCCCCCCCCCC-CHHHHHHHCC--CCC Q ss_conf 149999679641023222220245--41330--4657446441357889985888884101121-1012221025--786 Q gi|254780420|r 206 YHVVCIDYGIRSNLLRVLSSLGCR--ITIVG--AETSYKDIMSLQPDGVLLSNGPGDPAVTSVY-SSPIICKLVD--SGM 278 (396) Q Consensus 206 ~kIvvIDfGiK~nIlr~L~~~g~~--v~Vvp--~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~~-~~~~~~~~~~--~~i 278 (396) -+|++||||| +=+|.+..++ .+.+- -+-+++.|.+.+.|.=-+ |++.+.. ....++.+.+ -+. T Consensus 309 g~i~~~DFGI----VG~l~~~~r~~LAE~l~Gf~~rDY~~VA~~H~~aGyv------p~~~d~~~f~~a~R~~~EP~~g~ 378 (452) T TIGR01982 309 GKIIALDFGI----VGRLSEEDRRYLAEILYGFINRDYRRVAEVHFDAGYV------PSDTDMAEFEQAIRAIGEPIFGQ 378 (452) T ss_pred CEEEEEECCC----CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC------CHHHHHHHHHHHHHHCCCCCCCC T ss_conf 8089985374----2001378899999999999854299999999862588------84356999999862001543366 Q ss_pred CEEEEECHHHHHHHHC Q ss_conf 3478610058878752 Q gi|254780420|r 279 PMFGICLGHQLLGLAL 294 (396) Q Consensus 279 PilGIClGhQ~l~~a~ 294 (396) |+=-|-+|+-|..+-- T Consensus 379 Pl~~IS~g~~L~~LF~ 394 (452) T TIGR01982 379 PLKEISVGKLLAKLFK 394 (452) T ss_pred CHHHCCHHHHHHHHHH T ss_conf 6774028999999999 No 355 >PRK10701 DNA-binding transcriptional regulator RstA; Provisional Probab=29.22 E-value=42 Score=14.41 Aligned_cols=89 Identities=20% Similarity=0.217 Sum_probs=36.9 Q ss_pred EEEEEE--CCCCCHHHHHHHHHCCCCEEEECCCCH-HHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEE Q ss_conf 499996--796410232222202454133046574-46441357889985888884101121101222102578634786 Q gi|254780420|r 207 HVVCID--YGIRSNLLRVLSSLGCRITIVGAETSY-KDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGI 283 (396) Q Consensus 207 kIvvID--fGiK~nIlr~L~~~g~~v~Vvp~~~~~-~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGI 283 (396) ||+++| --+-..+.+.|.+.|++|.+...-..+ +.+....||-|+|-=+- |...+......++.. ...||+=+ T Consensus 3 kILlVEDd~~l~~~l~~~L~~~g~~v~~a~~g~~a~~~~~~~~~DlvilDi~L--P~~dG~~l~~~iR~~--~~~PiI~l 78 (240) T PRK10701 3 TIVFVEDDAEVGSLIAAYLAKHDIDVTVEPRGDRAEETILREQPDLVLLDIML--PGKDGMTICRDLRPK--WSGPIVLL 78 (240) T ss_pred EEEEEECCHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHHCCCCEEEEECCC--CCCCCCEEEEECCCC--CCCCEEEE T ss_conf 89999799999999999998879999998999999999861799999992899--767887876311025--89878999 Q ss_pred E-CH---HHHHHHHCCCCCC Q ss_conf 1-00---5887875285446 Q gi|254780420|r 284 C-LG---HQLLGLALGAQTV 299 (396) Q Consensus 284 C-lG---hQ~l~~a~G~~~~ 299 (396) . .+ -.+.|+..||.-| T Consensus 79 ta~~~~~d~v~gl~~GADDY 98 (240) T PRK10701 79 TSLDSDMNHILALEMGACDY 98 (240) T ss_pred EECCCHHHHHHHHHCCCCEE T ss_conf 40575788999997688776 No 356 >KOG0196 consensus Probab=29.21 E-value=29 Score=15.55 Aligned_cols=47 Identities=21% Similarity=0.344 Sum_probs=20.3 Q ss_pred EEEECCCCCHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEE-------CCCCCC Q ss_conf 99967964102322222024541330465744644135788998-------588888 Q gi|254780420|r 209 VCIDYGIRSNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLL-------SNGPGD 258 (396) Q Consensus 209 vvIDfGiK~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~l-------SnGPGd 258 (396) +++||-+||-= ....++.. ..+...++.-.|..++|+.+.+ ..|+|. T Consensus 472 ~ildYEvky~e-k~~~e~~~--~~~~t~~~~~ti~gL~p~t~YvfqVRarT~aG~G~ 525 (996) T KOG0196 472 VILDYEVKYYE-KDEDERSY--STLKTKTTTATITGLKPGTVYVFQVRARTAAGYGP 525 (996) T ss_pred CCEEEEEEEEE-CCCCCCCE--EEEECCCCEEEEECCCCCCEEEEEEEEECCCCCCC T ss_conf 53157999860-44564411--57750451688624689867999999832567788 No 357 >PRK03815 murD UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase; Provisional Probab=29.15 E-value=42 Score=14.40 Aligned_cols=25 Identities=32% Similarity=0.398 Sum_probs=16.2 Q ss_pred CCCCEECCCC-----HHHHHHHHHHCCCCC Q ss_conf 2783211257-----689999998539842 Q gi|254780420|r 118 CGIIGLSGID-----TRALTVWIRDYGVSN 142 (396) Q Consensus 118 ~~IpgI~GID-----TRaLTk~IRe~G~~~ 142 (396) ..++||+|-. |-.|...|+..|+.. T Consensus 89 ~~~I~ITGTNGKsTtt~li~~iL~~~g~~~ 118 (401) T PRK03815 89 PFSIWISGTNGKTTTTQMLTHLLEDFGAVS 118 (401) T ss_pred CCCEEEECCCCHHHHHHHHHHHHHHCCCCE T ss_conf 874899847777899999999998669813 No 358 >PRK00099 rplJ 50S ribosomal protein L10; Reviewed Probab=29.07 E-value=42 Score=14.39 Aligned_cols=83 Identities=17% Similarity=0.278 Sum_probs=43.3 Q ss_pred EEEEEEC-CCCCHHHH----HHHHHCCCCEEEECCC--------CHHHHHH--CCCCEEEECCCCCCCCCCCCCCHHHHH Q ss_conf 4999967-96410232----2222024541330465--------7446441--357889985888884101121101222 Q gi|254780420|r 207 HVVCIDY-GIRSNLLR----VLSSLGCRITIVGAET--------SYKDIMS--LQPDGVLLSNGPGDPAVTSVYSSPIIC 271 (396) Q Consensus 207 kIvvIDf-GiK~nIlr----~L~~~g~~v~Vvp~~~--------~~~~i~~--~~pdgi~lSnGPGdP~~~~~~~~~~~~ 271 (396) -+++.|| |+.-+-+. .|++.|+.+.|+.++. .++.+.. ..|.++++|+ +||....+...+..+ T Consensus 24 ~~~~~~y~Gltv~~~~~LR~~l~~~g~~~~VvKNtL~k~Al~~t~~~~l~~~l~G~tav~f~~--eD~~~~aK~~~~f~K 101 (164) T PRK00099 24 SAVVADYRGLTVAQMTELRKELREAGVELKVVKNTLARRALEGTGFEGLDDLLKGPTAIAFSN--EDPVAAAKVLKDFAK 101 (164) T ss_pred EEEEEECCCCCHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHCCCCCCCCHHCCCCEEEEEEC--CCCCHHHHHHHHHHH T ss_conf 699997689966999999999997696799960389999995468755410147875999826--886278999999986 Q ss_pred HHCCCCCCEEEEECHHHHHHHH Q ss_conf 1025786347861005887875 Q gi|254780420|r 272 KLVDSGMPMFGICLGHQLLGLA 293 (396) Q Consensus 272 ~~~~~~iPilGIClGhQ~l~~a 293 (396) -. .+.=+.|-++.-+++... T Consensus 102 -~~-~~~~ikgg~~eg~~l~~~ 121 (164) T PRK00099 102 -DN-KKLEIKGGAFEGKVLSAE 121 (164) T ss_pred -HC-CCCEEEEEEECCEECCHH T ss_conf -45-882789988778453899 No 359 >cd06278 PBP1_LacI_like_2 Ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=29.04 E-value=42 Score=14.39 Aligned_cols=11 Identities=36% Similarity=0.712 Sum_probs=4.4 Q ss_pred HHHHHHHHCCC Q ss_conf 23222220245 Q gi|254780420|r 219 LLRVLSSLGCR 229 (396) Q Consensus 219 Ilr~L~~~g~~ 229 (396) .+++|.++|++ T Consensus 106 a~~~L~~~Gh~ 116 (266) T cd06278 106 AAELLLAKGCR 116 (266) T ss_pred HHHHHHHCCCC T ss_conf 99999980998 No 360 >TIGR02461 osmo_MPG_phos mannosyl-3-phosphoglycerate phosphatase; InterPro: IPR012813 Members of this family are mannosyl-3-phosphoglycerate phosphatase (3.1.3.70 from EC). It acts sequentially after mannosyl-3-phosphoglycerate synthase (2.4.1.217 from EC) in a two-step pathway of biosynthesis of the compatible solute mannosylglycerate, a typical osmolyte of thermophiles.; GO: 0050531 mannosyl-3-phosphoglycerate phosphatase activity, 0051479 mannosylglycerate biosynthetic process, 0005737 cytoplasm. Probab=29.01 E-value=41 Score=14.42 Aligned_cols=40 Identities=15% Similarity=0.231 Sum_probs=22.4 Q ss_pred CEEEEEECCCCCHHHHHHHH---------H-C---CC-CEEEEC-CCCHHHHHHC Q ss_conf 14999967964102322222---------0-2---45-413304-6574464413 Q gi|254780420|r 206 YHVVCIDYGIRSNLLRVLSS---------L-G---CR-ITIVGA-ETSYKDIMSL 245 (396) Q Consensus 206 ~kIvvIDfGiK~nIlr~L~~---------~-g---~~-v~Vvp~-~~~~~~i~~~ 245 (396) -.-.||.+|.++.=+|...+ . . |- +-+..| +++.++|..+ T Consensus 88 G~y~V~eLG~~~~~IR~~L~~alaeacte~E~se~~Gn~~~~~yg~~t~Eev~~l 142 (248) T TIGR02461 88 GGYEVIELGKPVAEIRAALKSALAEACTEIEASEEAGNLSIKGYGDMTAEEVARL 142 (248) T ss_pred CCEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHCCCCCHHHHHHH T ss_conf 7606742367678999998999999999987444314501112278888899986 No 361 >cd01147 HemV-2 Metal binding protein HemV-2. These proteins are predicted to function as initial receptors in ABC transport of metal ions. They belong to the TroA superfamily of helical backbone metal receptor proteins that share a distinct fold and ligand binding mechanism. A typical TroA protein is comprised of two globular subdomains connected by a single helix and can bind the metal ion in the cleft between these domains. In addition, these proteins sometimes have a low complexity region containing a metal-binding histidine-rich motif (repetitive HDH sequence). Probab=28.82 E-value=42 Score=14.36 Aligned_cols=23 Identities=30% Similarity=0.401 Sum_probs=19.2 Q ss_pred CCCHHHHHHCCCCEEEECCCCCC Q ss_conf 65744644135788998588888 Q gi|254780420|r 236 ETSYKDIMSLQPDGVLLSNGPGD 258 (396) Q Consensus 236 ~~~~~~i~~~~pdgi~lSnGPGd 258 (396) ..+.|.|++.+||-||++.+.++ T Consensus 202 ~vs~E~i~~~nPDvI~~~~~~~~ 224 (262) T cd01147 202 EVSPEQILLWNPDVIFLDTGSFY 224 (262) T ss_pred CCCHHHHHHHCCCEEEEECCCCC T ss_conf 46999999859998999278753 No 362 >PRK00286 xseA exodeoxyribonuclease VII large subunit; Reviewed Probab=28.43 E-value=43 Score=14.32 Aligned_cols=86 Identities=17% Similarity=0.242 Sum_probs=53.6 Q ss_pred CCCCCEEEEEEC--C-CCCHHHHHHHHHC--CCCEEEECCC----CHHHHH-------H-CCCCEEEECCCCCCCCCCCC Q ss_conf 777714999967--9-6410232222202--4541330465----744644-------1-35788998588888410112 Q gi|254780420|r 202 SDAKYHVVCIDY--G-IRSNLLRVLSSLG--CRITIVGAET----SYKDIM-------S-LQPDGVLLSNGPGDPAVTSV 264 (396) Q Consensus 202 ~~~~~kIvvIDf--G-iK~nIlr~L~~~g--~~v~Vvp~~~----~~~~i~-------~-~~pdgi~lSnGPGdP~~~~~ 264 (396) |.-..+|+||== | +-+-|++.+.+|. |++.++|.-. ...+|. . .++|.|+|.=|=|..++.-. T Consensus 132 P~~P~~IgvITS~tgAa~~Di~~~~~~R~p~~~i~l~p~~VQG~~A~~~I~~ai~~~~~~~~~DvIIi~RGGGS~eDL~~ 211 (443) T PRK00286 132 PFFPKRIGVITSPTGAAIRDILTVLSRRFPSVEVIIYPTLVQGEGAAASIVEAIERANARGEVDVLIVARGGGSLEDLWA 211 (443) T ss_pred CCCCCEEEEEECCCHHHHHHHHHHHHCCCCCEEEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHH T ss_conf 87855799983684389999999985049965999981456265479999999999852248888999368788889765 Q ss_pred CCH-HHHHHHCCCCCCEEEEECHHH Q ss_conf 110-122210257863478610058 Q gi|254780420|r 265 YSS-PIICKLVDSGMPMFGICLGHQ 288 (396) Q Consensus 265 ~~~-~~~~~~~~~~iPilGIClGhQ 288 (396) ..- .+.+.+...++||.- .-||+ T Consensus 212 FNdE~varaI~~s~iPVIS-aIGHE 235 (443) T PRK00286 212 FNDEAVARAIAASKIPVIS-AVGHE 235 (443) T ss_pred CCCHHHHHHHHHCCCCEEE-CCCCC T ss_conf 1879999999848997895-14667 No 363 >cd03129 GAT1_Peptidase_E_like Type 1 glutamine amidotransferase (GATase1)-like domain found in peptidase E_like proteins. Type 1 glutamine amidotransferase (GATase1)-like domain found in peptidase E_like proteins. This group contains proteins similar to the aspartyl dipeptidases Salmonella typhimurium peptidase E and Xenopus laevis peptidase E and, extracellular cyanophycinases from Pseudomonas anguilliseptica BI (CphE) and Synechocystis sp. PCC 6803 CphB. In bacteria peptidase E is believed to play a role in degrading peptides generated by intracellular protein breakdown or imported into the cell as nutrient sources. Peptidase E uniquely hydrolyses only Asp-X dipeptides (where X is any amino acid), and one tripeptide Asp-Gly-Gly. Cyanophycinases are intracellular exopeptidases which hydrolyze the polymer cyanophycin (multi L-arginyl-poly-L-aspartic acid) to the dipeptide beta-Asp-Arg. Peptidase E and cyanophycinases are thought to have a Ser-His-Glu catalytic triad which differs from Probab=28.43 E-value=43 Score=14.32 Aligned_cols=87 Identities=20% Similarity=0.244 Sum_probs=48.0 Q ss_pred CCCEEEEEECCC------CCHHHHHHHHHCCCCEEEECCC--CHHHHHH--CCCCEEEECCCCCCCCCCCCC-----CHH Q ss_conf 771499996796------4102322222024541330465--7446441--357889985888884101121-----101 Q gi|254780420|r 204 AKYHVVCIDYGI------RSNLLRVLSSLGCRITIVGAET--SYKDIMS--LQPDGVLLSNGPGDPAVTSVY-----SSP 268 (396) Q Consensus 204 ~~~kIvvIDfGi------K~nIlr~L~~~g~~v~Vvp~~~--~~~~i~~--~~pdgi~lSnGPGdP~~~~~~-----~~~ 268 (396) ...||++|-... -.+..+.+.++||++..++.-. .-+++.+ .+-|+|+++ .||+...-.. ... T Consensus 28 ~~~~i~~iptAs~~~~~~~~~~~~~~~~lG~~~~~l~~~~~a~~~~~~~~l~~ad~i~~~--GG~~~~~~~~~~~t~~~~ 105 (210) T cd03129 28 AGARVLFIPTASGDRDEYGEEYRAAFERLGVEVVHLLLIDTANDPDVVARLLEADGIFVG--GGNQLRLLSVLRETPLLD 105 (210) T ss_pred CCCEEEEEECCCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCCHHHHHHHHHCCEEEEC--CCCHHHHHHHHHHCCHHH T ss_conf 996499994898880899999999999749948875122567871699999719999988--988999999997488999 Q ss_pred HHHHHCCCCCCEEEEECHHHHHHH Q ss_conf 222102578634786100588787 Q gi|254780420|r 269 IICKLVDSGMPMFGICLGHQLLGL 292 (396) Q Consensus 269 ~~~~~~~~~iPilGIClGhQ~l~~ 292 (396) .+++.+..+.++.|.-=|.-+++. T Consensus 106 ~l~~~~~~G~v~~G~SAGa~~~~~ 129 (210) T cd03129 106 AILKRVARGVVIGGTSAGAAVMGE 129 (210) T ss_pred HHHHHHHCCCEEEEECHHHHHCCC T ss_conf 999999849909973557886287 No 364 >cd01982 Chlide_reductase_Z Chlide_reductase_Z : Z subunit of chlorophyllide (chlide) reductase (BchZ). Chlide reductase participates in photosynthetic pigment synthesis playing a role in the conversion of chlorophylls(Chl) into bacteriochlorophylls (BChl). Chlide reductase catalyzes the reduction of the B-ring of the tetrapyrolle. Chlide reductase is a three subunit enzyme (subunits are designated BchX, BchY and BchZ). The similarity between these three subunits and the subunits for nitrogenase suggests that BchX serves as an electron donor for the BchY-BchY catalytic subunits. Probab=28.39 E-value=41 Score=14.47 Aligned_cols=27 Identities=19% Similarity=0.365 Sum_probs=21.2 Q ss_pred HHHHHHHHCCCCEE-EECCCCHHHHHHC Q ss_conf 23222220245413-3046574464413 Q gi|254780420|r 219 LLRVLSSLGCRITI-VGAETSYKDIMSL 245 (396) Q Consensus 219 Ilr~L~~~g~~v~V-vp~~~~~~~i~~~ 245 (396) |-|.|...|.+|-+ .|..++.++|.++ T Consensus 177 ikRLL~~lGi~VN~V~P~gas~~dl~~L 204 (412) T cd01982 177 VKRLVTGIGAEVNHVYPFESHLAEIPKL 204 (412) T ss_pred HHHHHHHCCCEEEEEECCCCCHHHHHCC T ss_conf 9999997698478995499998998416 No 365 >TIGR01469 cobA_cysG_Cterm uroporphyrin-III C-methyltransferase; InterPro: IPR006366 This domain is found in a family of sequences that represent enzymes, or enzyme domains, with uroporphyrin-III C-methyltransferase activity. This enzyme catalyzes the first step committed to the biosynthesis of either siroheme or cobalamin (vitamin B12) rather than protoheme (heme). Cobalamin contains cobalt while siroheme contains iron. Siroheme is a cofactor for nitrite and sulphite reductases and therefore plays a role in cysteine biosynthesis; many members of this family are CysG, siroheme synthase, with an additional N-terminal domain and with additional oxidation and iron insertion activities.; GO: 0008168 methyltransferase activity, 0006779 porphyrin biosynthetic process. Probab=28.24 E-value=18 Score=16.92 Aligned_cols=22 Identities=32% Similarity=0.452 Sum_probs=9.5 Q ss_pred CCCCCCCCCCHHHHHHHCCCCCC Q ss_conf 88410112110122210257863 Q gi|254780420|r 257 GDPAVTSVYSSPIICKLVDSGMP 279 (396) Q Consensus 257 GdP~~~~~~~~~~~~~~~~~~iP 279 (396) |||...... -|....+.+.+|| T Consensus 87 GDP~vFGRg-GEE~~~L~~~GI~ 108 (242) T TIGR01469 87 GDPFVFGRG-GEEAEALAEAGIP 108 (242) T ss_pred CCCCEECCH-HHHHHHHHHCCCC T ss_conf 998437856-8999999966897 No 366 >TIGR00649 MG423 conserved hypothetical protein; InterPro: IPR004613 This is a family of conserved hypothetical proteins, the members of which contain an ATP-binding domain at the N-terminal end of the protein. It is possibly part of a superfamily of beta-lactmases.. Probab=28.19 E-value=43 Score=14.29 Aligned_cols=85 Identities=24% Similarity=0.320 Sum_probs=55.9 Q ss_pred EEEEEECCCCCH--HHHHH----HHHCCC---CEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCC Q ss_conf 499996796410--23222----220245---413304657446441357889985888884101121101222102578 Q gi|254780420|r 207 HVVCIDYGIRSN--LLRVL----SSLGCR---ITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSG 277 (396) Q Consensus 207 kIvvIDfGiK~n--Ilr~L----~~~g~~---v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~ 277 (396) -|+++|-|++.= .++.= .-.|.+ -.|.|-.+-..++.+ +-.|||||+|=-| +.+. +.-+++++ .. T Consensus 24 ev~i~D~G~~~PRVlIHed~di~~~~G~dL~~~~vIPD~~~l~~~~~-kvkgI~~tHGH~D--HIGa-v~yL~~~~--~~ 97 (593) T TIGR00649 24 EVVIIDAGIRFPRVLIHEDTDIAEMLGVDLIEKGVIPDFTYLKENED-KVKGIVITHGHED--HIGA-VPYLLHQY--GF 97 (593) T ss_pred EEEEEECCCCCCCEEEECCCCHHHHCCHHHHHCCCCCCCHHHHHHCC-CEEEEEECCCCCC--CCCH-HHHHHHHC--CC T ss_conf 59998232237964641552035440411432031689616752125-0678897087100--0122-88898656--88 Q ss_pred CCEEEEECHHHHHHHHCCCC Q ss_conf 63478610058878752854 Q gi|254780420|r 278 MPMFGICLGHQLLGLALGAQ 297 (396) Q Consensus 278 iPilGIClGhQ~l~~a~G~~ 297 (396) -||.|==|+..||-.-+-++ T Consensus 98 ~PiYGt~lt~~L~~~ki~~E 117 (593) T TIGR00649 98 PPIYGTPLTIALIKSKIKAE 117 (593) T ss_pred CCEECCHHHHHHHHHHHHHH T ss_conf 82404378999999988654 No 367 >pfam01497 Peripla_BP_2 Periplasmic binding protein. This family includes bacterial periplasmic binding proteins. Several of which are involved in iron transport. Probab=28.12 E-value=43 Score=14.28 Aligned_cols=20 Identities=25% Similarity=0.592 Sum_probs=16.9 Q ss_pred CCCHHHHHHCCCCEEEECCC Q ss_conf 65744644135788998588 Q gi|254780420|r 236 ETSYKDIMSLQPDGVLLSNG 255 (396) Q Consensus 236 ~~~~~~i~~~~pdgi~lSnG 255 (396) ..+.|.|++.+||-||+++. T Consensus 184 ~vs~E~i~~~nPD~I~~~~~ 203 (236) T pfam01497 184 PISFENILAADPDVIIVSGR 203 (236) T ss_pred CCCHHHHHHHCCCEEEEECC T ss_conf 78989999859999999789 No 368 >cd01545 PBP1_SalR Ligand-binding domain of DNA transcription repressor SalR, a member of the LacI-GalR family of bacterial transcription regulators. Ligand-binding domain of DNA transcription repressor SalR, a member of the LacI-GalR family of bacterial transcription regulators. The SalR binds to glucose based compound Salicin which is chemically related to aspirin. The ligand-binding of SalR is structurally homologous to the periplasmic sugar-binding domain of ABC-transporters and both domains contain the type I periplasmic binding protein-like fold. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the type I periplasmic binding proteins. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand bind Probab=27.94 E-value=44 Score=14.26 Aligned_cols=28 Identities=11% Similarity=0.020 Sum_probs=13.0 Q ss_pred EEEEECCCCCCCCCCCCCCHHHHHHHCCC Q ss_conf 66640465443455241798999974278 Q gi|254780420|r 92 GLVIKAEITEPSNYRANMHFDAWLKSCGI 120 (396) Q Consensus 92 g~Iv~~~~~~~Sn~rs~~sL~~wL~~~~I 120 (396) |+||.+...... -+-...+.+-++++|- T Consensus 3 Gvivp~i~npff-~~l~~gi~~~~~~~GY 30 (270) T cd01545 3 GLLYDNPSPGYV-SEIQLGALDACRDTGY 30 (270) T ss_pred EEEECCCCCHHH-HHHHHHHHHHHHHCCC T ss_conf 999799973899-9999999999998499 No 369 >PRK11205 tbpA thiamine transporter substrate binding subunit; Provisional Probab=27.91 E-value=34 Score=15.05 Aligned_cols=48 Identities=17% Similarity=0.109 Sum_probs=25.9 Q ss_pred CCCCCC----CCH-HHHHHHCCCCE--ECCCCHHH-HHHHHHHCCCCCEEEECCCC Q ss_conf 455241----798-99997427832--11257689-99999853984236632765 Q gi|254780420|r 103 SNYRAN----MHF-DAWLKSCGIIG--LSGIDTRA-LTVWIRDYGVSNSVIAHHPD 150 (396) Q Consensus 103 Sn~rs~----~sL-~~wL~~~~Ipg--I~GIDTRa-LTk~IRe~G~~~a~I~~~~~ 150 (396) .||.++ ..+ .+|=++.||-. +..=|.=+ |.|..+|.....|=+..+-| T Consensus 27 dsf~~d~g~~~~l~~~Fek~tGikV~~v~~~dg~~ll~Rl~~Eg~np~ADVv~gvD 82 (326) T PRK11205 27 DSFAADWGPGPAVKKAFEAECGCELKFVALEDGVSLLNRLRLEGKNSKADVVLGLD 82 (326) T ss_pred CCCCCCCCCCHHHHHHHHHHHCCEEEEEECCCHHHHHHHHHHCCCCCCCCEEEECC T ss_conf 54444668647999999898796899996796799999999738999765899817 No 370 >PRK10766 DNA-binding transcriptional regulator TorR; Provisional Probab=27.91 E-value=44 Score=14.26 Aligned_cols=90 Identities=17% Similarity=0.273 Sum_probs=42.6 Q ss_pred CEEEEEEC--CCCCHHHHHHHHHCCCCEEEECCCCH-HHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEE Q ss_conf 14999967--96410232222202454133046574-4644135788998588888410112110122210257863478 Q gi|254780420|r 206 YHVVCIDY--GIRSNLLRVLSSLGCRITIVGAETSY-KDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFG 282 (396) Q Consensus 206 ~kIvvIDf--GiK~nIlr~L~~~g~~v~Vvp~~~~~-~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilG 282 (396) +||.++|= -+-..+...|...|++|..+..-..+ +.+....||-|++.-.- |..-+......+++ ...+||+= T Consensus 3 ~kILlVEDD~~l~~~l~~~L~~~g~~V~~~~~~~~a~~~l~~~~~DlvilDi~l--p~~~G~el~~~iR~--~~~~piI~ 78 (224) T PRK10766 3 YHILVVEDEPVTRARLQGYFEQEGYRVSEAASGAGMREIMQNQHVDLILLDINL--PGEDGLMLTRELRS--RSTVGIIL 78 (224) T ss_pred CEEEEECCCHHHHHHHHHHHHHCCCEEEEECCHHHHHHHHHHCCCCEEEECCCC--CCCCCCCHHHHHHH--CCCCCEEE T ss_conf 719999199999999999999879999998999999999960899999988999--98876613767630--47855686 Q ss_pred EEC-H---HHHHHHHCCCCCC Q ss_conf 610-0---5887875285446 Q gi|254780420|r 283 ICL-G---HQLLGLALGAQTV 299 (396) Q Consensus 283 ICl-G---hQ~l~~a~G~~~~ 299 (396) ++= . ..+.|+..||.-| T Consensus 79 lta~~~~~~~i~al~~GaddY 99 (224) T PRK10766 79 VTGRTDSIDRIVGLEMGADDY 99 (224) T ss_pred ECCCCCHHHHHHHHHCCCHHH T ss_conf 335589899999998494145 No 371 >PRK09534 btuF corrinoid ABC transporter substrate-binding protein; Reviewed Probab=27.80 E-value=44 Score=14.24 Aligned_cols=112 Identities=16% Similarity=0.139 Sum_probs=52.4 Q ss_pred CCHHHHHHHCCCCEECCC---CHHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHHCCCCCCCCHHHCCCCCCCEEC Q ss_conf 798999974278321125---76899999985398423663276545897898887741588743210110133332111 Q gi|254780420|r 109 MHFDAWLKSCGIIGLSGI---DTRALTVWIRDYGVSNSVIAHHPDGQFDLEDLKERAKNWSGLKGAELAKHVTVSQRRDW 185 (396) Q Consensus 109 ~sL~~wL~~~~IpgI~GI---DTRaLTk~IRe~G~~~a~I~~~~~~~~d~~~~~~~~~~~~~~~~~dL~~~Vs~~~~~~~ 185 (396) ...-+.|++.||+-+.=- .=-.+-..|+.-|...+.- +.++.-.+++++++... .+++. T Consensus 131 ~~~v~~L~~~GI~V~v~~~a~SiddV~~~I~~iG~~tg~~---e~A~~lv~~Mk~~i~~v--------~~kv~------- 192 (364) T PRK09534 131 NDTVTRLRSAGITVFQFRAATSIDGVVEKTATIGRLTGNC---AAAAETNAEMRDRVDAV--------ADAVA------- 192 (364) T ss_pred HHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHHHHHCCH---HHHHHHHHHHHHHHHHH--------HHHHH------- T ss_conf 6789999976996999189889999999999999983985---68999999999999999--------99861------- Q ss_pred CCCCCCCCCCCCCCCCCCCCCEEEEEECC-----CCCHHHHHHHHH-C-CCC----EEEEC-CCCHHHHHHCCCCEEEEC Q ss_conf 68653445434434677777149999679-----641023222220-2-454----13304-657446441357889985 Q gi|254780420|r 186 SEKIWKWGEETSFLKSSDAKYHVVCIDYG-----IRSNLLRVLSSL-G-CRI----TIVGA-ETSYKDIMSLQPDGVLLS 253 (396) Q Consensus 186 ~~~~~~~~~~~~~~~~~~~~~kIvvIDfG-----iK~nIlr~L~~~-g-~~v----~Vvp~-~~~~~~i~~~~pdgi~lS 253 (396) ....+++| ++.+| -|...+..|... | -++ ..-.| ..|.|.|++.+||-||.+ T Consensus 193 ---------------~~~~kp~V-~y~~~~~~TaG~~TFi~elI~lAGG~NIAad~~~~gwpqvS~E~iI~~nPDVII~~ 256 (364) T PRK09534 193 ---------------DVEDRPRV-YYHLGDGYTAGPNTFIGALIEAAGGHNVAADVNTTSYPQLSEEVIVSQDPDVVVTG 256 (364) T ss_pred ---------------CCCCCCEE-EEECCCCCCCCCCCCHHHHHHHCCCEEEHHCCCCCCCCCCCHHHHHHHCCCEEEEC T ss_conf ---------------56889879-99837983778984589999980876531004887887378999986599889967 Q ss_pred C Q ss_conf 8 Q gi|254780420|r 254 N 254 (396) Q Consensus 254 n 254 (396) . T Consensus 257 ~ 257 (364) T PRK09534 257 V 257 (364) T ss_pred C T ss_conf 8 No 372 >cd01146 FhuD Fe3+-siderophore binding domain FhuD. These proteins have been shown to function as initial receptors in ABC transport of Fe3+-siderophores in many eubacterial species. They belong to the TroA-like superfamily of helical backbone metal receptor proteins that share a distinct fold and ligand binding mechanism. A typical TroA-like protein is comprised of two globular subdomains connected by a long alpha helix and binds its specific ligands in the cleft between these domains. Probab=27.74 E-value=44 Score=14.23 Aligned_cols=21 Identities=10% Similarity=0.078 Sum_probs=17.1 Q ss_pred CCCHHHHHHCCCCEEEECCCC Q ss_conf 657446441357889985888 Q gi|254780420|r 236 ETSYKDIMSLQPDGVLLSNGP 256 (396) Q Consensus 236 ~~~~~~i~~~~pdgi~lSnGP 256 (396) ..+.|.+.+.+||-||++... T Consensus 189 ~is~E~l~~~nPDvI~~~~~~ 209 (256) T cd01146 189 TISLERLAKADADVLFVFTYE 209 (256) T ss_pred CCCHHHHHHHCCCEEEEECCC T ss_conf 059889987589989997588 No 373 >cd06831 PLPDE_III_ODC_like_AZI Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzyme Ornithine Decarboxylase-like Antizyme Inhibitor. Antizyme inhibitor (AZI) is homologous to the fold type III PLP-dependent enzyme ODC but does not retain any decarboxylase activity. Like ODC, AZI is presumed to exist as a homodimer. Antizyme is a regulatory protein that binds directly to the ODC monomer to block its active site, leading to its degradation by the 26S proteasome. AZI binds to Antizyme with a higher affinity than ODC, preventing the formation of the Antizyme-ODC complex. Thus, AZI blocks the ability of Antizyme to promote ODC degradation, which leads to increased ODC enzymatic activity and polyamine levels. AZI also prevents the degradation of other proteins regulated by Antizyme, such as cyclin D1. Probab=27.51 E-value=44 Score=14.21 Aligned_cols=54 Identities=19% Similarity=0.138 Sum_probs=27.4 Q ss_pred CCHHHHHHHCCCCEECCCCHHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHHC Q ss_conf 798999974278321125768999999853984236632765458978988877415 Q gi|254780420|r 109 MHFDAWLKSCGIIGLSGIDTRALTVWIRDYGVSNSVIAHHPDGQFDLEDLKERAKNW 165 (396) Q Consensus 109 ~sL~~wL~~~~IpgI~GIDTRaLTk~IRe~G~~~a~I~~~~~~~~d~~~~~~~~~~~ 165 (396) ..+-.+|.+.|. | .++-.+.=.++.+..|.....|......+ ..+++...++.- T Consensus 49 ~~il~~L~~~G~-g-~DvaS~gEl~~al~aG~~p~~Iif~gp~K-t~~ei~~A~~~G 102 (394) T cd06831 49 PAVLEILAALGT-G-FACSSKNEMALVQELGVSPENIIYTNPCK-QASQIKYAAKVG 102 (394) T ss_pred HHHHHHHHHCCC-C-EEEECHHHHHHHHHCCCCHHHEEECCCCC-CHHHHHHHHHCC T ss_conf 999999997499-8-89869999999998699879979779999-999999999859 No 374 >KOG1538 consensus Probab=27.38 E-value=45 Score=14.19 Aligned_cols=19 Identities=16% Similarity=0.167 Sum_probs=9.0 Q ss_pred EEEEECCCCEEEEEEECCC Q ss_conf 6899728987999975676 Q gi|254780420|r 14 AVLVLADGSVIEGMGCGAT 32 (396) Q Consensus 14 A~L~LedGt~f~G~~fG~~ 32 (396) -.|-+.|||+-.-..-|-+ T Consensus 147 lalG~~nGTIsiRNk~gEe 165 (1081) T KOG1538 147 LALGMFNGTISIRNKNGEE 165 (1081) T ss_pred EEEECCCCEEEEECCCCCC T ss_conf 9997158548862267885 No 375 >TIGR02541 flagell_FlgJ flagellar rod assembly protein/muramidase FlgJ; InterPro: IPR013377 The N-terminal region of this protein acts directly in flagellar rod assembly, while the C-terminal region is a flagellum-specific muramidase (peptidoglycan hydrolase) required for formation of the outer membrane L ring .; GO: 0016798 hydrolase activity acting on glycosyl bonds, 0001539 ciliary or flagellar motility, 0009296 flagellum biogenesis, 0009288 flagellin-based flagellum. Probab=27.26 E-value=10 Score=18.71 Aligned_cols=15 Identities=33% Similarity=0.508 Sum_probs=9.5 Q ss_pred CCCCCCEEEEECHHHHHHHHC Q ss_conf 257863478610058878752 Q gi|254780420|r 274 VDSGMPMFGICLGHQLLGLAL 294 (396) Q Consensus 274 ~~~~iPilGIClGhQ~l~~a~ 294 (396) .+.+|| -|-+||+|. T Consensus 201 q~~GVp------~~l~lAQAA 215 (332) T TIGR02541 201 QQLGVP------PHLILAQAA 215 (332) T ss_pred HHHCCC------HHHHHHHHH T ss_conf 871883------678999987 No 376 >pfam02601 Exonuc_VII_L Exonuclease VII, large subunit. This family consist of exonuclease VII, large subunit EC:3.1.11.6 This enzyme catalyses exonucleolytic cleavage in either 5'-3' or 3'-5' direction to yield 5'-phosphomononucleotides. This exonuclease VII enzyme is composed of one large subunit and 4 small ones. Probab=27.12 E-value=45 Score=14.16 Aligned_cols=17 Identities=18% Similarity=0.092 Sum_probs=8.8 Q ss_pred CCCEEEECCCCCCCCCC Q ss_conf 56173216897368458 Q gi|254780420|r 58 LGQIVNFTFPHIGNVGV 74 (396) Q Consensus 58 ~gQIlvfT~P~IGNyGv 74 (396) .-+++.|-.+.-|+... T Consensus 41 ~~~i~l~p~~VQG~~a~ 57 (295) T pfam02601 41 LVEIEIYPTLVQGDGAA 57 (295) T ss_pred CCEEEEECCCCCCCHHH T ss_conf 96799947357650489 No 377 >TIGR00700 GABAtrnsam 4-aminobutyrate transaminase; InterPro: IPR004632 Eukaryotic 4-aminobutyrate aminotransferase (2.6.1.19 from EC) is a class III pyridoxal-phosphate-dependent aminotransferase. The enzyme catalyses the conversion of 4-aminobutanoate and 2-oxoglutarate into succinate semialdehyde and L-glutamate. The degree of sequence difference between this set and known bacterial examples is greater than the distance between, in either set, the most similar enzyme with a distinct function, and so the prokaryotic and eukaryotic sets have been placed into separate families. This family describes known bacterial examples of the enzyme. The best archaeal matches are presumed but not trusted to have the equivalent function. Escherichia coli has two isozymes. Alternate names include GABA transaminase, gamma-amino-N-butyrate transaminase, and beta-alanine--oxoglutarate aminotransferase.; GO: 0003867 4-aminobutyrate transaminase activity, 0009448 gamma-aminobutyric acid metabolic process. Probab=27.09 E-value=36 Score=14.85 Aligned_cols=92 Identities=11% Similarity=0.181 Sum_probs=44.7 Q ss_pred EEEEECCCCC--------HHHHHHHHHCCCCEEEECC-CCHHHHHHCCC-----CEEEECCCCCCCCCCCCCCHHHHHHH Q ss_conf 9999679641--------0232222202454133046-57446441357-----88998588888410112110122210 Q gi|254780420|r 208 VVCIDYGIRS--------NLLRVLSSLGCRITIVGAE-TSYKDIMSLQP-----DGVLLSNGPGDPAVTSVYSSPIICKL 273 (396) Q Consensus 208 IvvIDfGiK~--------nIlr~L~~~g~~v~Vvp~~-~~~~~i~~~~p-----dgi~lSnGPGdP~~~~~~~~~~~~~~ 273 (396) |+++..|-.+ .++.+|..-++.|+-+|-- +=+|++-.+-| ..|||..|= +.++. +.. +.+. T Consensus 42 Iavln~GhshP~vV~AV~~Q~~ef~Ht~f~v~pYe~Yv~lAEkln~iaPgSg~kk~~ffnsGA---EAVEN-AVK-IAR~ 116 (427) T TIGR00700 42 IAVLNVGHSHPKVVEAVKRQVEEFTHTAFQVVPYESYVELAEKLNAIAPGSGEKKAVFFNSGA---EAVEN-AVK-IARK 116 (427) T ss_pred HHHHCCCCCCHHHHHHHHHHHHHHCCCCCEEECCHHHHHHHHHHCCCCCCCCCHHEEEECCCH---HHHHH-HHH-HHHH T ss_conf 232036898735899999999974388534521625899999971578897310002111543---78888-999-9987 Q ss_pred CCCCCCEEEEEC---HHHHHHHHCCCCCCCCCCC Q ss_conf 257863478610---0588787528544667767 Q gi|254780420|r 274 VDSGMPMFGICL---GHQLLGLALGAQTVKMYQG 304 (396) Q Consensus 274 ~~~~iPilGICl---GhQ~l~~a~G~~~~kl~~G 304 (396) ..++--+..--- |---|..|+-+|+.--|+| T Consensus 117 yTgrpgVvaF~~gfHGRTnmTMalT~Kv~PYK~G 150 (427) T TIGR00700 117 YTGRPGVVAFDRGFHGRTNMTMALTAKVKPYKIG 150 (427) T ss_pred HCCCCCEEEECCCCCCCHHHHHHHHHCCCCCCCC T ss_conf 4589616863687466126788765227873457 No 378 >TIGR02155 PA_CoA_ligase phenylacetate-CoA ligase; InterPro: IPR011880 Phenylacetate-CoA ligase (PA-CoA ligase) catalyzes the first step in aromatic catabolism of phenylacetic acid (PA) into phenylacetyl-CoA (PA-CoA) . Often located in a conserved gene cluster with enzymes involved in phenylacetic acid activation (paaG/H/I/J), phenylacetate-CoA ligase has been found among the proteobacteria as well as in Gram-positive prokaryotes . In the B-subclass proteobacterium Azoarcus evansii, phenylacetate-CoA ligase has been shown to be induced under aerobic and anaerobic growth conditions. It remains unclear however, whether this induction is due to the same enzyme or to another isoenzyme restricted to specific anaerobic growth conditions .. Probab=26.95 E-value=28 Score=15.64 Aligned_cols=24 Identities=21% Similarity=0.384 Sum_probs=13.7 Q ss_pred CEEEECC--CC---HHHHHHCCCCEEEEC Q ss_conf 4133046--57---446441357889985 Q gi|254780420|r 230 ITIVGAE--TS---YKDIMSLQPDGVLLS 253 (396) Q Consensus 230 v~Vvp~~--~~---~~~i~~~~pdgi~lS 253 (396) .+|||.. .+ ++=|..++||-|... T Consensus 146 ~~~vPiSGG~TekQ~qLI~Df~P~iI~~T 174 (431) T TIGR02155 146 CTVVPISGGQTEKQVQLIQDFKPDIIMVT 174 (431) T ss_pred CEEEECCCCCHHHHHHHHHCCCCCEEEEC T ss_conf 07871379780567776433798889857 No 379 >cd01143 YvrC Periplasmic binding protein YvrC. These proteins are predicted to function as initial receptors in ABC transport of metal ions in eubacteria and archaea. They belong to the TroA superfamily of periplasmic metal binding proteins that share a distinct fold and ligand binding mechanism. A typical TroA protein is comprised of two globular subdomains connected by a single helix and can bind the metal ion in the cleft between these domains. Probab=26.94 E-value=45 Score=14.14 Aligned_cols=17 Identities=29% Similarity=0.589 Sum_probs=13.8 Q ss_pred CCCHHHHHHCCCCEEEE Q ss_conf 65744644135788998 Q gi|254780420|r 236 ETSYKDIMSLQPDGVLL 252 (396) Q Consensus 236 ~~~~~~i~~~~pdgi~l 252 (396) ..+.|.|++.+||-||+ T Consensus 179 ~vs~E~i~~~nPDvI~v 195 (195) T cd01143 179 QVSPEEILKANPDVIIL 195 (195) T ss_pred CCCHHHHHHHCCCEEEC T ss_conf 55999999878399979 No 380 >TIGR01681 HAD-SF-IIIC HAD-superfamily phosphatase, subfamily IIIC; InterPro: IPR010033 This entry represents the IIIC subfamily of the Haloacid Dehalogenase (HAD) superfamily of aspartate nucleophile hydrolases. Subfamily III, which includes subfamily IIIA (IPR006549 from INTERPRO) and subfamily IIIB (IPR005519 from INTERPRO) contains sequences which do not contain either of the insert domains between the 1st and 2nd conserved catalytic motifs, subfamily I (IPR006388 from INTERPRO, IPR006402 from INTERPRO, IPR006439 from INTERPRO, IPR006383 from INTERPRO, IPR006545 from INTERPRO, IPR006434 from INTERPRO, and IPR006435 from INTERPRO), or between the 2nd and 3rd, subfamily II (IPR006357 from INTERPRO, and IPR006379 from INTERPRO). Subfamily IIIC contains five relatively distantly related clades: a family of viral proteins (IPR007827 from INTERPRO), a family of eukaryotic proteins called MDP-1 and a family of archaeal proteins most closely related to MDP-1 (IPR010036 from INTERPRO), a family of bacteria including the Streptomyces FkbH protein (IPR010037 from INTERPRO), and a small clade including the Pasteurella multocida BcbF and EcbF proteins. The overall lack of species overlap among these clades may indicate a conserved function, but the degree of divergence between the clades and the differences in architecture outside of the domain in some clades warns against such a conclusion. No member of this subfamily is characterised with respect to function, however the MDP-1 protein is a characterised phosphatase. All of the characterised enzymes within subfamily III are phosphatases, and all of the active site residues characteristic of HAD-superfamily phosphatases are present in subfamily IIIC.. Probab=26.90 E-value=40 Score=14.55 Aligned_cols=38 Identities=21% Similarity=0.427 Sum_probs=28.8 Q ss_pred HHHHHHHH---CCCCEEEECC-C--CHHHHHHCCCC--EEEECCCC Q ss_conf 23222220---2454133046-5--74464413578--89985888 Q gi|254780420|r 219 LLRVLSSL---GCRITIVGAE-T--SYKDIMSLQPD--GVLLSNGP 256 (396) Q Consensus 219 Ilr~L~~~---g~~v~Vvp~~-~--~~~~i~~~~pd--gi~lSnGP 256 (396) .|..|.++ ||-+.+..++ . -+.++++.+.+ +++||.|- T Consensus 54 ~L~~l~~~PWaG~~l~l~S~n~~~~~~~~~~~~~~~Yf~~i~~gG~ 99 (179) T TIGR01681 54 VLQTLKKNPWAGFLLALASYNDDEHVAYELLKIFKDYFGVIISGGH 99 (179) T ss_pred HHHHHHHCCCCCEEEEEEECCCHHHHHHHHHHHCHHHCCEEECCCE T ss_conf 9999972898867999864388579999987313102256751881 No 381 >COG0391 Uncharacterized conserved protein [Function unknown] Probab=26.82 E-value=46 Score=14.12 Aligned_cols=43 Identities=19% Similarity=0.424 Sum_probs=25.4 Q ss_pred HHHHHC-CCCEEEECCCCCCCCCC--CCCCH-HHHHHHCCCCCCEEEEE Q ss_conf 464413-57889985888884101--12110-12221025786347861 Q gi|254780420|r 240 KDIMSL-QPDGVLLSNGPGDPAVT--SVYSS-PIICKLVDSGMPMFGIC 284 (396) Q Consensus 240 ~~i~~~-~pdgi~lSnGPGdP~~~--~~~~~-~~~~~~~~~~iPilGIC 284 (396) +.|.+. +-|-|+| |||.|-.. ..... .+...+.+..-|+.+|| T Consensus 182 eaveAI~~AD~Ivi--GPgSl~TSIlP~Lllp~I~eaLr~~~ap~i~v~ 228 (323) T COG0391 182 EAVEAIKEADLIVI--GPGSLFTSILPILLLPGIAEALRETVAPIVYVC 228 (323) T ss_pred HHHHHHHHCCEEEE--CCCCCHHHHCHHHCCHHHHHHHHHCCCCEEEEC T ss_conf 99999985898998--688327664303010579999986789779861 No 382 >TIGR03676 aRF1/eRF1 peptide chain release factor 1, archaeal and eukaryotic forms. Directs the termination of nascent peptide synthesis (translation) in response to the termination codons UAA, UAG and UGA. This model identifies both archaeal (aRF1) and eukaryotic (eRF1) of the protein. Also known as translation termination factor 1. Probab=26.69 E-value=39 Score=14.64 Aligned_cols=48 Identities=17% Similarity=0.341 Sum_probs=28.0 Q ss_pred CCCEEEECCCCCCCCCCCCCCHHHHHHHCC-CCCCEEEEECH-----HHHHHHHCC Q ss_conf 578899858888841011211012221025-78634786100-----588787528 Q gi|254780420|r 246 QPDGVLLSNGPGDPAVTSVYSSPIICKLVD-SGMPMFGICLG-----HQLLGLALG 295 (396) Q Consensus 246 ~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~-~~iPilGIClG-----hQ~l~~a~G 295 (396) +-.||+|+ |||+-. ........+..-++ .-+.++-+|+| +|++-+|-- T Consensus 208 ~v~Giilg-Gp~~~K-~~f~~~~~ld~rL~~kVi~~~Dv~Y~gE~Gl~e~ie~s~d 261 (403) T TIGR03676 208 KLKGILIG-GPGPTK-EEFAEGDYLHHELKKKILGLFDVSYTGESGLRELVEKAED 261 (403) T ss_pred CEEEEEEC-CCCHHH-HHHHHCCCCCHHHHHHHHEEEECCCCCCHHHHHHHHHHHH T ss_conf 53799933-784176-7763145535889965033430478872449999999999 No 383 >pfam03262 Corona_6B_7B Coronavirus 6B/7B protein. Probab=26.68 E-value=24 Score=16.11 Aligned_cols=114 Identities=18% Similarity=0.346 Sum_probs=44.5 Q ss_pred EEEEEECCCCCHHHHHHHHHCCCCEEEECCCCHHHHHHC-----CC-----CEEEECCCCCCCCCCCCCCHHHHHHHCCC Q ss_conf 499996796410232222202454133046574464413-----57-----88998588888410112110122210257 Q gi|254780420|r 207 HVVCIDYGIRSNLLRVLSSLGCRITIVGAETSYKDIMSL-----QP-----DGVLLSNGPGDPAVTSVYSSPIICKLVDS 276 (396) Q Consensus 207 kIvvIDfGiK~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~-----~p-----dgi~lSnGPGdP~~~~~~~~~~~~~~~~~ 276 (396) .|..+-+|.+ +++=.--|+..+.=-+..++.+-... +| |-..+|-|-|+-.. ...++-+.-+. T Consensus 51 qiLsLPlga~---v~C~~veGf~CtwPGf~~~ahDHiDfyfDlsnPFYsFVd~FYis~g~~~~ri----~Lr~vga~pK~ 123 (209) T pfam03262 51 QILSLPLGAE---VECESVEGFNCTWPGFQNAAHDHIDFYFDLSNPFYSFVDTFYISVGEGNQKI----NLRLVGATPKD 123 (209) T ss_pred HEEECCCCCE---EEECCCCCEECCCCCCCCCCCCCEEEEECCCCCHHHHHHEEEEEECCCCCEE----EEEECCCCCCC T ss_conf 7575246876---8762546515368674431224403786079725611013899940576368----99980588711 Q ss_pred CCCEEEE-ECHHHHHHHHCCCCCCCCCCCCCCCCCCCE--EC-CCCEEEEEC---CC-CCEEEEH Q ss_conf 8634786-100588787528544667767521011234--77-897099961---45-7615703 Q gi|254780420|r 277 GMPMFGI-CLGHQLLGLALGAQTVKMYQGHHGVNHPVK--NI-SNGTVEIVS---MN-HGFAIDA 333 (396) Q Consensus 277 ~iPilGI-ClGhQ~l~~a~G~~~~kl~~GhrG~N~Pv~--~~-~t~kv~iTs---qN-Hg~~v~~ 333 (396) | .|-| |----.+.+-+|-.++- -|-.-|||. ++ -+.|+|+.. +| |||+..+ T Consensus 124 K--rLnvgc~~sf~vdLpfgtQiyh----Drd~~~~V~grhleCthrvyfVkyCp~~~hgyCf~d 182 (209) T pfam03262 124 K--RLNIGCHTSFSVDLPFGTQIYH----DRDMQLLVEGRHLECTHRVYFVKYCPYHAHGYCFKD 182 (209) T ss_pred C--EEEECCCCCCCCCCCCCEEEEE----CCCCEEECCCEECCCCCEEEEEEECCCCCCCEEECC T ss_conf 1--6876245663015765205670----453043316665013336899986899986333536 No 384 >cd06333 PBP1_ABC-type_HAAT_like Type I periplasmic binding component of ABC (ATPase Binding Cassette)-type transport systems that are predicted to be involved in uptake of amino acids. This subgroup includes the type I periplasmic binding component of ABC (ATPase Binding Cassette)-type transport systems that are predicted to be involved in uptake of amino acids. Members of this subgroup are sequence-similar to members of the family of ABC-type hydrophobic amino acid transporters (HAAT), such as leucine-isoleucine-valine-binding protein (LIVBP); their ligand specificity has not been determined experimentally, however. Probab=26.57 E-value=46 Score=14.10 Aligned_cols=161 Identities=17% Similarity=0.239 Sum_probs=73.2 Q ss_pred EEEECCCCCCCCCCCCCCHHHHHHHCCCCEECCCC----HHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHHCCCC Q ss_conf 66404654434552417989999742783211257----68999999853984236632765458978988877415887 Q gi|254780420|r 93 LVIKAEITEPSNYRANMHFDAWLKSCGIIGLSGID----TRALTVWIRDYGVSNSVIAHHPDGQFDLEDLKERAKNWSGL 168 (396) Q Consensus 93 ~Iv~~~~~~~Sn~rs~~sL~~wL~~~~IpgI~GID----TRaLTk~IRe~G~~~a~I~~~~~~~~d~~~~~~~~~~~~~~ 168 (396) +|+.+-...|. ++.....++..+.+|.+|-|-- |.++...+.+.|...-..+.... .......| T Consensus 42 lv~~Dd~~~p~--~a~~~a~~li~~~~V~~ivG~~~S~~~~a~~~~~~~~~vp~is~~~~~~-------~~~~~~~y--- 109 (312) T cd06333 42 LIVLDDGSDPT--KAVTNARKLIEEDKVDAIIGPSTTPATMAVAPVAEEAKTPMISLAPAAA-------IVEPKRKW--- 109 (312) T ss_pred EEEECCCCCHH--HHHHHHHHHHHHCCCEEEECCCCCHHHHHHHHHHHHCCCEEEECCCCCC-------CCCCCCCC--- T ss_conf 99836999989--9999999986517933986687737788889999974986997478761-------22489870--- Q ss_pred CCCCHHHCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCEEEEEE----CC--CCCHHHHHHHHHCCCCEE---EE-CCCC Q ss_conf 43210110133332111686534454344346777771499996----79--641023222220245413---30-4657 Q gi|254780420|r 169 KGAELAKHVTVSQRRDWSEKIWKWGEETSFLKSSDAKYHVVCID----YG--IRSNLLRVLSSLGCRITI---VG-AETS 238 (396) Q Consensus 169 ~~~dL~~~Vs~~~~~~~~~~~~~~~~~~~~~~~~~~~~kIvvID----fG--iK~nIlr~L~~~g~~v~V---vp-~~~~ 238 (396) +-.+...... ...... .+... .+-+||++|. || ....+.+.+.+.|.+|+- ++ .+++ T Consensus 110 -----~fr~~~~~~~-~~~~~~------~~~~~-~g~kkvail~~~~~yg~~~~~~~~~~~~~~G~~vv~~~~~~~~~~D 176 (312) T cd06333 110 -----VFKTPQNDRL-MAEAIL------ADMKK-RGVKTVAFIGFSDAYGESGLKELKALAPKYGIEVVADERYGRTDTS 176 (312) T ss_pred -----EEECCCCHHH-HHHHHH------HHHHH-CCCCEEEEEECCCCHHHHHHHHHHHHHHHCCCEEEEEEEECCCCCC T ss_conf -----7961598799-999999------99997-3998899996176055999999999999779939999972798767 Q ss_pred H----HHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEE Q ss_conf 4----46441357889985888884101121101222102578634786 Q gi|254780420|r 239 Y----KDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGI 283 (396) Q Consensus 239 ~----~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGI 283 (396) + ..+.+.+||.||+..-+.+ .. ......++ +.-+.|++|- T Consensus 177 fs~~l~ki~~~~pD~v~~~~~~~~---~~-~~~~q~~~-~G~~~~~~~~ 220 (312) T cd06333 177 VTAQLLKIRAARPDAVLIWGSGTP---AA-LPAKNLRE-RGYKGPIYQT 220 (312) T ss_pred HHHHHHHHHHCCCCEEEEECCCCH---HH-HHHHHHHH-CCCCCCEEEC T ss_conf 799999998569899999457516---99-99999997-6998708812 No 385 >cd06290 PBP1_LacI_like_9 Ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=26.52 E-value=46 Score=14.09 Aligned_cols=29 Identities=14% Similarity=0.072 Sum_probs=12.6 Q ss_pred EEEEECCCCCCCCCCCCCCHHHHHHHCCCC Q ss_conf 666404654434552417989999742783 Q gi|254780420|r 92 GLVIKAEITEPSNYRANMHFDAWLKSCGII 121 (396) Q Consensus 92 g~Iv~~~~~~~Sn~rs~~sL~~wL~~~~Ip 121 (396) |+||.++... .--.--..+.+-++++|.- T Consensus 3 Gvivp~i~np-f~~~i~~gie~~~~~~gy~ 31 (265) T cd06290 3 GVLTQDFASP-FYGRILKGMERGLNGSGYS 31 (265) T ss_pred EEEECCCCCH-HHHHHHHHHHHHHHHCCCE T ss_conf 9997898769-9999999999999986998 No 386 >COG0131 HisB Imidazoleglycerol-phosphate dehydratase [Amino acid transport and metabolism] Probab=26.38 E-value=20 Score=16.61 Aligned_cols=23 Identities=43% Similarity=0.439 Sum_probs=16.1 Q ss_pred EEEECHHHHHHHHCCCCCCCCCCC Q ss_conf 786100588787528544667767 Q gi|254780420|r 281 FGICLGHQLLGLALGAQTVKMYQG 304 (396) Q Consensus 281 lGIClGhQ~l~~a~G~~~~kl~~G 304 (396) .||||| |+|..|+|-+--=-.|| T Consensus 72 ~gIaLG-qAl~~AlGdk~GI~Rfg 94 (195) T COG0131 72 TGIALG-QALKEALGDKRGIRRFG 94 (195) T ss_pred HHHHHH-HHHHHHHCCCCCCEECC T ss_conf 998999-99999856421531115 No 387 >cd01171 YXKO-related B.subtilis YXKO protein of unknown function and related proteins. Based on the conservation of the ATP binding site, the substrate binding site and the Mg2+binding site and structural homology this group is a member of the ribokinase-like superfamily. Probab=26.35 E-value=47 Score=14.07 Aligned_cols=30 Identities=7% Similarity=0.012 Sum_probs=14.9 Q ss_pred CCHHHHHHHCCCCEEC-C----CCHHHHHHHHHHC Q ss_conf 7989999742783211-2----5768999999853 Q gi|254780420|r 109 MHFDAWLKSCGIIGLS-G----IDTRALTVWIRDY 138 (396) Q Consensus 109 ~sL~~wL~~~~IpgI~-G----IDTRaLTk~IRe~ 138 (396) ..+..++.+..+..|- | =+|+.+++.++.+ T Consensus 69 ~~~~~~~~~~~aiviGpGlg~~~~~~~~~~~~~~~ 103 (254) T cd01171 69 EELLELLERADAVVIGPGLGRDEEAAEILEKALAK 103 (254) T ss_pred HHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHHC T ss_conf 67776533587899827889999999999999844 No 388 >pfam10563 CdCA1 Cadmium carbonic anhydrase repeat. This domain is the cadmium carbonic anhydrase repeat unit of the beta-carbonic anhydrase of a marine diatom, that uses both zinc and cadmium for catalysis of the reversible hydration of carbon dioxide for use in inorganic carbon acquisition for photosynthesis (thus being a cambialistic enzyme). Compared with alpha- and gamma-carbonic anhydrases that use three histidines to coordinate the zinc-atom, this beta-carbonic anhydrase has two cysteines and one histidine, and rapidly binds cadmium. Probab=26.33 E-value=35 Score=14.91 Aligned_cols=35 Identities=29% Similarity=0.444 Sum_probs=26.3 Q ss_pred HHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEE Q ss_conf 02322222024541330465744644135788998 Q gi|254780420|r 218 NLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLL 252 (396) Q Consensus 218 nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~l 252 (396) -|+-.|..||..-++++...-+.++....|+|++- T Consensus 5 ~i~~aLq~RGW~A~Iv~~s~~~~~lvdV~p~G~lK 39 (209) T pfam10563 5 QIKAALEGRGWTATIVPRSEMASDLVDVGPDGILK 39 (209) T ss_pred HHHHHHHHCCCCEEEEEHHHHHHHHEEECCCCEEE T ss_conf 99999980898368730666333443208764388 No 389 >cd00008 53EXOc 5'-3' exonuclease; T5 type 5'-3' exonuclease domains may co-occur with DNA polymerase I (Pol I) domains, or be part of Pol I containing complexes. They digest dsDNA and ssDNA, releasing mono-,di- and tri-nucleotides, as well as oligonucleotides, and have also been reported to possess RNase H activity. Also called 5' nuclease family, involved in structure-specific cleavage of flaps formed by Pol I activity (similar to mammalian flap endonuclease I, FEN-1). A single nucleic acid strand may be threaded through the 5' nuclease enzyme before cleavage occurs. The domain binds two divalent metal ions which are necessary for activity. Probab=26.14 E-value=25 Score=16.01 Aligned_cols=26 Identities=15% Similarity=0.234 Sum_probs=17.3 Q ss_pred EECCCCHHHHHHHHHHCCCCCEEEECC Q ss_conf 211257689999998539842366327 Q gi|254780420|r 122 GLSGIDTRALTVWIRDYGVSNSVIAHH 148 (396) Q Consensus 122 gI~GIDTRaLTk~IRe~G~~~a~I~~~ 148 (396) +|.| =+|.|-+.+++......+++.+ T Consensus 33 ai~g-f~~~l~~l~~~~~p~~i~v~fD 58 (240) T cd00008 33 AVYG-FLNMLLKLIKEYKPTYVAVVFD 58 (240) T ss_pred HHHH-HHHHHHHHHHHCCCCEEEEEEC T ss_conf 8999-9999999998659988999972 No 390 >COG2235 ArcA Arginine deiminase [Amino acid transport and metabolism] Probab=25.96 E-value=26 Score=15.86 Aligned_cols=11 Identities=27% Similarity=0.440 Sum_probs=4.7 Q ss_pred EEEEECCCCCH Q ss_conf 99996796410 Q gi|254780420|r 208 VVCIDYGIRSN 218 (396) Q Consensus 208 IvvIDfGiK~n 218 (396) =+.++=|+--| T Consensus 166 ~~~i~~GV~in 176 (409) T COG2235 166 FASIGNGVSIN 176 (409) T ss_pred CEEEECCEEEC T ss_conf 33760877852 No 391 >cd00379 Ribosomal_L10_P0 Ribosomal protein L10 family; composed of the large subunit ribosomal protein called L10 in bacteria, P0 in eukaryotes, and L10e in archaea, as well as uncharacterized P0-like eukaryotic proteins. In all three kingdoms, L10 forms a tight complex with multiple copies of the small acidic protein L12(e). This complex forms a stalk structure on the large subunit of the ribosome. The N-terminal domain (NTD) of L10 interacts with L11 protein and forms the base of the L7/L12 stalk, while the extended C-terminal helix binds to two or three dimers of the NTD of L7/L12 (L7 and L12 are identical except for an acetylated N-terminus). The L7/L12 stalk is known to contain the binding site for elongation factors G and Tu (EF-G and EF-Tu, respectively); however, there is disagreement as to whether or not L10 is involved in forming the binding site. The stalk is believed to be associated with GTPase activities in protein synthesis. In a neuroblastoma cell line, L10 has been sho Probab=25.92 E-value=47 Score=14.01 Aligned_cols=82 Identities=18% Similarity=0.299 Sum_probs=43.2 Q ss_pred EEEEEEC-CCCCHHHHHH----HHHCCCCEEEECCC--------CHHHHHH--CCCCEEEECCCCCCCCCCCCCCHHHHH Q ss_conf 4999967-9641023222----22024541330465--------7446441--357889985888884101121101222 Q gi|254780420|r 207 HVVCIDY-GIRSNLLRVL----SSLGCRITIVGAET--------SYKDIMS--LQPDGVLLSNGPGDPAVTSVYSSPIIC 271 (396) Q Consensus 207 kIvvIDf-GiK~nIlr~L----~~~g~~v~Vvp~~~--------~~~~i~~--~~pdgi~lSnGPGdP~~~~~~~~~~~~ 271 (396) .|+++|| |+.-+.+..| ++.|+++.|+.++. ..+++.. ..|-++++|++ ||....+.... +. T Consensus 19 ~v~v~~~~gl~~~~~~~lR~~l~~~~~~~~V~KNtL~k~Al~~~~~~~l~~~l~G~~a~~f~~~--d~~~~aK~~~~-f~ 95 (155) T cd00379 19 SVVVVDYRGLTVAQLTELRKELRESGAKLKVGKNTLMRRALKGTGFEELKPLLKGPTALAFTNE--DPVEVAKVLKD-FA 95 (155) T ss_pred EEEEEECCCCCHHHHHHHHHHHHHCCCEEEEEECHHHHHHHHHCCCCCCCCCCCCCEEEEEECC--CCHHHHHHHHH-HH T ss_conf 8999977999789999999998857917999855889999860586544322547758998278--82589999999-99 Q ss_pred HHCCCCCCEEEEECHHHHHHH Q ss_conf 102578634786100588787 Q gi|254780420|r 272 KLVDSGMPMFGICLGHQLLGL 292 (396) Q Consensus 272 ~~~~~~iPilGIClGhQ~l~~ 292 (396) +-. .+..+.|=++.-+++.. T Consensus 96 k~~-~~~~~k~G~~~~~~~~~ 115 (155) T cd00379 96 KEN-KKLFAKGGVVAGKVLDP 115 (155) T ss_pred HHC-CCCEEEEEEECCEEECH T ss_conf 756-78215788776845279 No 392 >PRK03803 murD UDP-N-acetylmuramoyl-L-alanyl-D-glutamate synthetase; Provisional Probab=25.77 E-value=48 Score=14.00 Aligned_cols=27 Identities=15% Similarity=0.223 Sum_probs=16.5 Q ss_pred HHHHCCCCEECCCC-----HHHHHHHHHHCCC Q ss_conf 99742783211257-----6899999985398 Q gi|254780420|r 114 WLKSCGIIGLSGID-----TRALTVWIRDYGV 140 (396) Q Consensus 114 wL~~~~IpgI~GID-----TRaLTk~IRe~G~ 140 (396) ++.+..+.||+|=. |..|..++++.|- T Consensus 105 ~~~~~~~IaVTGTnGKTTTtsli~~iL~~~g~ 136 (448) T PRK03803 105 RAAKAPIVAITGSNAKSTVTTLVGEMAKAAGK 136 (448) T ss_pred HHCCCCEEEECCCCCHHHHHHHHHHHHHHCCC T ss_conf 21789889985899838899999999986598 No 393 >cd06273 PBP1_GntR_like_1 This group includes the ligand-binding domain of putative DNA transcription repressors which are highly similar to that of the repressor specific for gluconate (GntR), a member of the LacI-GalR family of bacterial transcription regulators. This group includes the ligand-binding domain of putative DNA transcription repressors which are highly similar to that of the repressor specific for gluconate (GntR), a member of the LacI-GalR family of bacterial transcription regulators. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational Probab=25.50 E-value=48 Score=13.96 Aligned_cols=12 Identities=25% Similarity=0.332 Sum_probs=5.0 Q ss_pred CCHHHHHHHCCC Q ss_conf 798999974278 Q gi|254780420|r 109 MHFDAWLKSCGI 120 (396) Q Consensus 109 ~sL~~wL~~~~I 120 (396) ..+.+.++++|. T Consensus 19 ~~ie~~~~~~Gy 30 (268) T cd06273 19 QAFQETLAAHGY 30 (268) T ss_pred HHHHHHHHHCCC T ss_conf 999999998599 No 394 >PRK13235 nifH nitrogenase reductase; Reviewed Probab=25.50 E-value=48 Score=13.96 Aligned_cols=57 Identities=18% Similarity=0.255 Sum_probs=34.0 Q ss_pred CCCCEEEEEECCCCCHHHHHHHHHCCCCEEEE------CCCCHHHHHHCCCCEEE--ECCCCCCCC Q ss_conf 77714999967964102322222024541330------46574464413578899--858888841 Q gi|254780420|r 203 DAKYHVVCIDYGIRSNLLRVLSSLGCRITIVG------AETSYKDIMSLQPDGVL--LSNGPGDPA 260 (396) Q Consensus 203 ~~~~kIvvIDfGiK~nIlr~L~~~g~~v~Vvp------~~~~~~~i~~~~pdgi~--lSnGPGdP~ 260 (396) ..++||.+|||--|.+=.+.|...-..-+|+. .+.+.++|....+.|+. -|.|| .|. T Consensus 27 ~~GkkVl~IgcDPk~dsT~~L~gg~~~~tvld~l~~~~~~~~ledvi~~g~~gi~cveaggp-~pg 91 (274) T PRK13235 27 EMGKKVMVVGCDPKADSTRLLLGGLQQKTVLDTLREEGEDVELEDIIKEGYGNTRCTESGGP-EPG 91 (274) T ss_pred HCCCEEEEECCCCCCCHHHHHHCCCCCCCHHHHHHHCCCCCCHHHHEEECCCCEEEEECCCC-CCC T ss_conf 78997999898984536678738998997899998628776778944317898189868998-756 No 395 >COG0052 RpsB Ribosomal protein S2 [Translation, ribosomal structure and biogenesis] Probab=25.49 E-value=48 Score=13.96 Aligned_cols=31 Identities=19% Similarity=0.388 Sum_probs=22.0 Q ss_pred CCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEE Q ss_conf 578899858888841011211012221025786347861 Q gi|254780420|r 246 QPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGIC 284 (396) Q Consensus 246 ~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIC 284 (396) -||.+|+. ||..- ...+++...-+|||.+|| T Consensus 156 ~Pd~l~Vi----Dp~~e----~iAv~EA~klgIPVvAlv 186 (252) T COG0052 156 LPDVLFVI----DPRKE----KIAVKEANKLGIPVVALV 186 (252) T ss_pred CCCEEEEE----CCCHH----HHHHHHHHHCCCCEEEEE T ss_conf 99989996----88176----899999997599989984 No 396 >PRK08818 prephenate dehydrogenase; Provisional Probab=25.45 E-value=48 Score=13.96 Aligned_cols=29 Identities=21% Similarity=0.256 Sum_probs=17.6 Q ss_pred EEEEEEEEECCCCCCCCCCCCCCCCCCEEEEC Q ss_conf 48888985378888760141614256173216 Q gi|254780420|r 34 SIQAEICFNTSLTGYQEILTDPSYLGQIVNFT 65 (396) Q Consensus 34 ~~~GEvVFnT~mtGYqE~lTDPSY~gQIlvfT 65 (396) +..-|+|.+--|+|=+++ ||-.||+++.| T Consensus 100 ~s~~eVv~~HPMfGpP~s---PsL~gqVVIlt 128 (373) T PRK08818 100 ASQAEVVGLHPMTAPPKS---PTLKGRVMVVC 128 (373) T ss_pred HCCCCEECCCCCCCCCCC---CCCCCEEEEEE T ss_conf 444422237887789998---64044189970 No 397 >COG3340 PepE Peptidase E [Amino acid transport and metabolism] Probab=25.15 E-value=49 Score=13.92 Aligned_cols=47 Identities=19% Similarity=0.189 Sum_probs=31.3 Q ss_pred EEEEEECCCCCCCCCCCCCCHHHHHHHCCCCEECCCCHHHHHHHHHHCCC Q ss_conf 36664046544345524179899997427832112576899999985398 Q gi|254780420|r 91 VGLVIKAEITEPSNYRANMHFDAWLKSCGIIGLSGIDTRALTVWIRDYGV 140 (396) Q Consensus 91 ~g~Iv~~~~~~~Sn~rs~~sL~~wL~~~~IpgI~GIDTRaLTk~IRe~G~ 140 (396) .|+.+.++...-++ ...+..+|.+..+.-+.|=.|+-|-+.+||.|. T Consensus 61 lg~~v~~L~l~~~~---~~~Ie~~l~~~d~IyVgGGNTF~LL~~lke~gl 107 (224) T COG3340 61 LGLEVSELHLSKPP---LAAIENKLMKADIIYVGGGNTFNLLQELKETGL 107 (224) T ss_pred CCCEEEEEECCCCC---HHHHHHHHHHCCEEEECCCHHHHHHHHHHHHCC T ss_conf 69715542546799---999997651155899888518999999998475 No 398 >cd03145 GAT1_cyanophycinase Type 1 glutamine amidotransferase (GATase1)-like domain found in cyanophycinase. Type 1 glutamine amidotransferase (GATase1)-like domain found in cyanophycinase. This group contains proteins similar to the extracellular cyanophycinases from Pseudomonas anguilliseptica BI (CphE) and Synechocystis sp. PCC 6803 CphB. Cyanophycinases are intracellular exopeptidases which hydrolyze the polymer cyanophycin (multi L-arginyl-poly-L-aspartic acid) to the dipeptide beta-Asp-Arg. Cyanophycinase is believed to be a serine-type exopeptidase having a Ser-His-Glu catalytic triad which differs from the Cys-His-Glu catalytic triad typical of GATase1 domains by having a Ser in place of the reactive Cys at the nucleophile elbow. Probab=25.11 E-value=49 Score=13.91 Aligned_cols=89 Identities=17% Similarity=0.303 Sum_probs=52.2 Q ss_pred CCCEEEEEECCCC------CHHHHHHHHHCCC-CEEEECCC----CHHHHHH--CCCCEEEECCCCCCCCCCCCC----- Q ss_conf 7714999967964------1023222220245-41330465----7446441--357889985888884101121----- Q gi|254780420|r 204 AKYHVVCIDYGIR------SNLLRVLSSLGCR-ITIVGAET----SYKDIMS--LQPDGVLLSNGPGDPAVTSVY----- 265 (396) Q Consensus 204 ~~~kIvvIDfGiK------~nIlr~L~~~g~~-v~Vvp~~~----~~~~i~~--~~pdgi~lSnGPGdP~~~~~~----- 265 (396) .+.+|+++.-... ..-.+.+.++|+. +.+++-+. +-.++.+ .+-|+||++ .|||...-.. T Consensus 28 ~~~ri~viptAs~~~~~~~~~~~~~f~~lG~~~v~~l~~~~r~~a~~~~~~~~l~~a~~i~~~--GG~q~~~~~~~~~t~ 105 (217) T cd03145 28 AGARIVVIPAASEEPAEVGEEYRDVFERLGAREVEVLVIDSREAANDPEVVARLRDADGIFFT--GGDQLRITSALGGTP 105 (217) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHHHHHCCCCCEEEEECCCHHHCCCHHHHHHHHCCCEEEEC--CCCHHHHHHHHHCCC T ss_conf 987599985887885999999999999839995059822683544898999999649989993--899999999996396 Q ss_pred CHHHHHHHCCCCCCEEEEECHHHHHHHHC Q ss_conf 10122210257863478610058878752 Q gi|254780420|r 266 SSPIICKLVDSGMPMFGICLGHQLLGLAL 294 (396) Q Consensus 266 ~~~~~~~~~~~~iPilGIClGhQ~l~~a~ 294 (396) ....+++.++.+.||.|.-=|.-+++... T Consensus 106 ~~~~l~~~~~~G~vi~G~SAGA~~~~~~~ 134 (217) T cd03145 106 LLDALRKVYRGGVVIGGTSAGAAVMSDTM 134 (217) T ss_pred HHHHHHHHHHCCCEEEECCHHHHHHHHHC T ss_conf 99999999986994884117888745421 No 399 >PRK09482 xni exonuclease IX; Provisional Probab=25.06 E-value=49 Score=13.91 Aligned_cols=21 Identities=24% Similarity=0.252 Sum_probs=13.7 Q ss_pred HHHHHHHHHHCCCCCEEEECC Q ss_conf 689999998539842366327 Q gi|254780420|r 128 TRALTVWIRDYGVSNSVIAHH 148 (396) Q Consensus 128 TRaLTk~IRe~G~~~a~I~~~ 148 (396) +++|.+.|++......+++.+ T Consensus 35 ~~~l~kli~~~~P~~i~v~fD 55 (256) T PRK09482 35 QHALDKLIRHSQPTHAVAVFD 55 (256) T ss_pred HHHHHHHHHHCCCCEEEEEEE T ss_conf 999999999719987999984 No 400 >KOG3323 consensus Probab=24.93 E-value=49 Score=13.89 Aligned_cols=24 Identities=17% Similarity=0.520 Sum_probs=13.6 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 34878877670016899999999997 Q gi|254780420|r 366 QYHPESSPGPQDSYYLFSTFIDFMHK 391 (396) Q Consensus 366 QfHPEa~PGP~D~~~~F~~F~~~i~~ 391 (396) |||--. +|+.+..|+..|++++++ T Consensus 92 dfH~am--~~~ea~elY~qfl~~l~k 115 (149) T KOG3323 92 DFHLAM--KGEEAKELYNQFLELLRK 115 (149) T ss_pred CHHHHC--CCHHHHHHHHHHHHHHHH T ss_conf 554406--954669999999999987 No 401 >COG1935 Uncharacterized conserved protein [Function unknown] Probab=24.79 E-value=50 Score=13.87 Aligned_cols=41 Identities=20% Similarity=0.268 Sum_probs=27.2 Q ss_pred CCCCCHHHHHHHHHCCCCEEEECCCCHHHHHHCCC-CEEEEC Q ss_conf 79641023222220245413304657446441357-889985 Q gi|254780420|r 213 YGIRSNLLRVLSSLGCRITIVGAETSYKDIMSLQP-DGVLLS 253 (396) Q Consensus 213 fGiK~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~p-dgi~lS 253 (396) +|+-.|.+..|.+.+.+-..+..--++..++.++| |-+||. T Consensus 7 tGIs~~vie~l~~~~~rTieiRsa~N~~tv~rl~~GDlVFlT 48 (122) T COG1935 7 TGISRRVIESLLRNPIRTIEIRSARNLLTVLRLHEGDLVFLT 48 (122) T ss_pred ECHHHHHHHHHHHCCCEEEEEECCCCHHHHHCCCCCCEEEEE T ss_conf 151199999998688539999744236776447888889996 No 402 >TIGR01465 cobM_cbiF precorrin-4 C11-methyltransferase; InterPro: IPR006362 Cobalamin (vitamin B12) is a structurally complex cofactor, consisting of a modified tetrapyrrole with a centrally chelated cobalt. Cobalamin is usually found in one of two biologically active forms: methylcobalamin and adocobalamin. Most prokaryotes, as well as animals, have cobalamin-dependent enzymes, whereas plants and fungi do not appear to use it. In bacteria and archaea, these include methionine synthase, ribonucleotide reductase, glutamate and methylmalonyl-CoA mutases, ethanolamine ammonia lyase, and diol dehydratase . In mammals, cobalamin is obtained through the diet, and is required for methionine synthase and methylmalonyl-CoA mutase . There are at least two distinct cobalamin biosynthetic pathways in bacteria : Aerobic pathway that requires oxygen and in which cobalt is inserted late in the pathway ; found in Pseudomonas denitrificans and Rhodobacter capsulatus. Anaerobic pathway in which cobalt insertion is the first committed step towards cobalamin synthesis ; found in Salmonella typhimurium, Bacillus megaterium, and Propionibacterium freudenreichii shermanii. Either pathway can be divided into two parts: (1) corrin ring synthesis (differs in aerobic and anaerobic pathways) and (2) adenosylation of corrin ring, attachment of aminopropanol arm, and assembly of the nucleotide loop (common to both pathways) . There are about 30 enzymes involved in either pathway, where those involved in the aerobic pathway are prefixed Cob and those of the anaerobic pathway Cbi. Several of these enzymes are pathway-specific: CbiD, CbiG, and CbiK are specific to the anaerobic route of S. typhimurium, whereas CobE, CobF, CobG, CobN, CobS, CobT, and CobW are unique to the aerobic pathway of P. denitrificans. This entry represents CobM and CibF precorrin-4 C11-methyltransferase (2.1.1.133 from EC), both as stand-alone enzymes and when CobJ forms part of a bifunctional enzyme. In the aerobic pathway, CobM catalyses the methylation of precorrin-4 at C-11 to yield precorrin-5. The extruded acyl group is then removed in the subsequent step catalysed by CobF. In the anaerobic pathway, CibF catalyses the methylation of cobalt-precorrin-4 to cobalt-precoriin-5 . Nomenclature note: occasionally precorrin-4 C11-methyltransferase is one of two methyltransferases often referred to as precorrin-3 methylase (the other is precorrin-3 methylase (the other is precorrin-4 C11-methyltransferase, 2.1.1.131 from EC). ; GO: 0046026 precorrin-4 C11-methyltransferase activity, 0009236 cobalamin biosynthetic process. Probab=24.49 E-value=32 Score=15.25 Aligned_cols=85 Identities=19% Similarity=0.338 Sum_probs=54.8 Q ss_pred CEEEEEE------CCCCCHHHHHHHHHCCCCEEEECCCCHH---HHHHCC---C---CEEEECCCCCCCCCCCCCCHHHH Q ss_conf 1499996------7964102322222024541330465744---644135---7---88998588888410112110122 Q gi|254780420|r 206 YHVVCID------YGIRSNLLRVLSSLGCRITIVGAETSYK---DIMSLQ---P---DGVLLSNGPGDPAVTSVYSSPII 270 (396) Q Consensus 206 ~kIvvID------fGiK~nIlr~L~~~g~~v~Vvp~~~~~~---~i~~~~---p---dgi~lSnGPGdP~~~~~~~~~~~ 270 (396) +.||=|= ||.=.-+++.|.+.|...+|+|-=+++. -.++.. | .-|+|.=+=|-=..+. ..|.+ T Consensus 72 K~VvRLHsGDPsIYGA~~EQ~~~L~~~gI~~e~vPGvSsf~AAAA~l~~ELT~P~vsQtvilTR~eG~RtPmP--e~E~l 149 (252) T TIGR01465 72 KLVVRLHSGDPSIYGAIAEQMQLLEALGIPYEVVPGVSSFFAAAAALGAELTVPEVSQTVILTRAEGRRTPMP--EGEKL 149 (252) T ss_pred CEEEEEECCCHHHHHHHHHHHHHHHHCCCCEEEECCCHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCCCCCC--CHHHH T ss_conf 8499985087557766999999998678977986887389899997300147884034244675434354577--66789 Q ss_pred HHHCCCCCCEEEEECHHHHHHHH Q ss_conf 21025786347861005887875 Q gi|254780420|r 271 CKLVDSGMPMFGICLGHQLLGLA 293 (396) Q Consensus 271 ~~~~~~~iPilGIClGhQ~l~~a 293 (396) ..+.+. --.|+|=||-+++-.. T Consensus 150 ~~lA~h-gaTm~IfLs~~~~~~v 171 (252) T TIGR01465 150 ADLAKH-GATMAIFLSAHIIDKV 171 (252) T ss_pred HHHHHH-CCHHHHHHHHHHHHHH T ss_conf 988741-2313567789999999 No 403 >cd06280 PBP1_LacI_like_4 Ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=24.47 E-value=50 Score=13.83 Aligned_cols=27 Identities=15% Similarity=0.147 Sum_probs=12.1 Q ss_pred EEEEECCCCCCCCCCCCCCHHHHHHHCC Q ss_conf 6664046544345524179899997427 Q gi|254780420|r 92 GLVIKAEITEPSNYRANMHFDAWLKSCG 119 (396) Q Consensus 92 g~Iv~~~~~~~Sn~rs~~sL~~wL~~~~ 119 (396) |+||.++.. |.--.--..+.+.++++| T Consensus 3 Gvivpdi~n-pf~~~i~~gi~~~~~~~g 29 (263) T cd06280 3 GLIVADIRN-PFFTAVSRAVEDAAYRAG 29 (263) T ss_pred EEEECCCCC-HHHHHHHHHHHHHHHHCC T ss_conf 999888877-899999999999999869 No 404 >COG1671 Uncharacterized protein conserved in bacteria [Function unknown] Probab=24.46 E-value=18 Score=17.00 Aligned_cols=69 Identities=17% Similarity=0.272 Sum_probs=39.5 Q ss_pred CCCCCHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHH---HCCCCCCEEEEEC Q ss_conf 796410232222202454133046574464413578899858888841011211012221---0257863478610 Q gi|254780420|r 213 YGIRSNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICK---LVDSGMPMFGICL 285 (396) Q Consensus 213 fGiK~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~---~~~~~iPilGICl 285 (396) |=+|.-|.|--.+.|+.|++|.+..-.-- ..-.-.-++.+.|+ ...+..+.+...+ .+...||+-..|| T Consensus 11 CPVk~~i~r~A~r~~~~v~~Van~~~~~~-~~~~i~~v~V~~g~---DaaD~~Iv~~a~~gDlVVT~Di~LA~~ll 82 (150) T COG1671 11 CPVKDEIYRVAERMGLKVTFVANFPHRVP-PSPEIRTVVVDAGF---DAADDWIVNLAEKGDLVVTADIPLASLLL 82 (150) T ss_pred CCHHHHHHHHHHHHCCEEEEEECCCCCCC-CCCCEEEEEECCCC---CHHHHHHHHHCCCCCEEEECCHHHHHHHH T ss_conf 85189999999996975999917875689-99853689955773---32889999837789989977647899999 No 405 >PRK00750 lysK lysyl-tRNA synthetase; Reviewed Probab=24.44 E-value=50 Score=13.84 Aligned_cols=21 Identities=48% Similarity=0.912 Sum_probs=17.3 Q ss_pred EEEEEECCCCCCCCCCCCCCCCCCEEEECCCCCCCCC Q ss_conf 8898537888876014161425617321689736845 Q gi|254780420|r 37 AEICFNTSLTGYQEILTDPSYLGQIVNFTFPHIGNVG 73 (396) Q Consensus 37 GEvVFnT~mtGYqE~lTDPSY~gQIlvfT~P~IGNyG 73 (396) +++||.||.| || -.|||||.| T Consensus 29 ~~vv~~tG~~--------pS--------G~~HiG~~~ 49 (513) T PRK00750 29 GEVLFETGYG--------PS--------GLPHIGTFG 49 (513) T ss_pred CEEEEECCCC--------CC--------CCCCCCCCC T ss_conf 7399936868--------99--------886230452 No 406 >pfam04187 DUF399 Protein of unknown function, DUF399. No function is known for any member of this family. Probab=24.42 E-value=50 Score=13.83 Aligned_cols=27 Identities=15% Similarity=0.177 Sum_probs=15.1 Q ss_pred EEEEEEECCCCCCCCCCCCCCHHHHHH Q ss_conf 236664046544345524179899997 Q gi|254780420|r 90 AVGLVIKAEITEPSNYRANMHFDAWLK 116 (396) Q Consensus 90 ~~g~Iv~~~~~~~Sn~rs~~sL~~wL~ 116 (396) +.-+++-|..+.+.|++.+..+-.-|. T Consensus 13 advVllGE~Hdn~~~H~~Q~~il~~L~ 39 (213) T pfam04187 13 ADVVLLGEQHDNPAHHRLQLWLLRALH 39 (213) T ss_pred CCEEEECCCCCCHHHHHHHHHHHHHHH T ss_conf 998997888799889999999999999 No 407 >KOG4328 consensus Probab=24.37 E-value=46 Score=14.09 Aligned_cols=54 Identities=13% Similarity=0.116 Sum_probs=24.0 Q ss_pred CCCCCCCEEEEEECC--CCCH-HHHHHHHHCCCCEEEECC-CCHHHHHHCCCCEEEEC Q ss_conf 677777149999679--6410-232222202454133046-57446441357889985 Q gi|254780420|r 200 KSSDAKYHVVCIDYG--IRSN-LLRVLSSLGCRITIVGAE-TSYKDIMSLQPDGVLLS 253 (396) Q Consensus 200 ~~~~~~~kIvvIDfG--iK~n-Ilr~L~~~g~~v~Vvp~~-~~~~~i~~~~pdgi~lS 253 (396) ..++++-.|-+-||| .|.+ =+......+.-|.-+... ++...|++..+||-+=+ T Consensus 204 a~GdK~G~VG~Wn~~~~~~d~d~v~~f~~hs~~Vs~l~F~P~n~s~i~ssSyDGtiR~ 261 (498) T KOG4328 204 AVGDKGGQVGLWNFGTQEKDKDGVYLFTPHSGPVSGLKFSPANTSQIYSSSYDGTIRL 261 (498) T ss_pred EECCCCCCEEEEECCCCCCCCCCEEEECCCCCCCCCEEECCCCHHHEEEECCCCEEEE T ss_conf 9715777277876677777667548845677622343755798435565124761566 No 408 >PRK03967 histidinol-phosphate aminotransferase; Provisional Probab=24.23 E-value=51 Score=13.80 Aligned_cols=54 Identities=19% Similarity=0.236 Sum_probs=25.1 Q ss_pred HHHHCCCCEEEECC----CCHHHHHH--CCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCC Q ss_conf 22202454133046----57446441--3578899858888841011211012221025786 Q gi|254780420|r 223 LSSLGCRITIVGAE----TSYKDIMS--LQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGM 278 (396) Q Consensus 223 L~~~g~~v~Vvp~~----~~~~~i~~--~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~i 278 (396) ....|+.+.-+|.+ .+.+.+.+ .+.+.|||.| |.+|.-.- ...+.+.++++... T Consensus 111 ~~~~g~~~~~vpl~~~f~~d~~~~~~~~~~~~~v~l~n-PnNPTG~~-~~~~el~~~~~~~~ 170 (337) T PRK03967 111 AKLNGIPVIDVPLKEDFTIDGDRIVEKAKDASAVFICS-PNNPTGNL-QPIEEILKVLDTGK 170 (337) T ss_pred HHHCCCEEEEEECCCCCCCCHHHHHHCCCCCCEEEEEC-CCCCCCCC-CCHHHHHHHHHCCC T ss_conf 99839859996153456789999985177887899938-99987621-67899998650088 No 409 >PRK05990 precorrin-2 C(20)-methyltransferase; Reviewed Probab=24.18 E-value=28 Score=15.65 Aligned_cols=51 Identities=18% Similarity=0.066 Sum_probs=28.2 Q ss_pred CCEEEEEECC------CCCHHHHHHHHHCCCCEEEECCCCHHHHHH-------CCCCEEEECCCC Q ss_conf 7149999679------641023222220245413304657446441-------357889985888 Q gi|254780420|r 205 KYHVVCIDYG------IRSNLLRVLSSLGCRITIVGAETSYKDIMS-------LQPDGVLLSNGP 256 (396) Q Consensus 205 ~~kIvvIDfG------iK~nIlr~L~~~g~~v~Vvp~~~~~~~i~~-------~~pdgi~lSnGP 256 (396) +.+||++=.| .=..+++.|.+ ++.+.|+|-=+++...-+ ..-+.+.+.+|+ T Consensus 98 G~~Va~lt~GDp~~YsT~~yl~~~l~~-~~~vevIPGItS~~AaAA~~g~pL~~~~e~l~Iip~~ 161 (241) T PRK05990 98 GRDVAVICEGDPFFYGSYMYLHDRLAP-RYETEVVPGVCSMLGGASVLGAPLVYRNQSLSVLSGV 161 (241) T ss_pred CCEEEEEECCCCHHHHHHHHHHHHHHC-CCCEEEECCHHHHHHHHHHHCCCCEECCCEEEEEECC T ss_conf 996999957794366469999999627-9998998976699999997099726168316998067 No 410 >PRK05948 precorrin-2 methyltransferase; Provisional Probab=24.06 E-value=27 Score=15.72 Aligned_cols=57 Identities=14% Similarity=0.007 Sum_probs=35.1 Q ss_pred CCEEEEEECC------CCCHHHHHHHHH--CCCCEEEECCCCHHHHHH-------CCCCEEEECCCCCCCCC Q ss_conf 7149999679------641023222220--245413304657446441-------35788998588888410 Q gi|254780420|r 205 KYHVVCIDYG------IRSNLLRVLSSL--GCRITIVGAETSYKDIMS-------LQPDGVLLSNGPGDPAV 261 (396) Q Consensus 205 ~~kIvvIDfG------iK~nIlr~L~~~--g~~v~Vvp~~~~~~~i~~-------~~pdgi~lSnGPGdP~~ 261 (396) +.+||++-.| .=.-+++.|.++ ++.+.|+|-=+++...-+ ..-+-+.+.+++.+++. T Consensus 92 G~~Va~l~~GDp~~Yst~~yl~~~l~~~~~~i~vevIPGIsS~~aaAA~~g~pL~~~~e~l~Vip~~~~~~~ 163 (238) T PRK05948 92 GEDVAFACEGDVSLYSTFTYLAQTLRELHPQVAIQTIPGVCSPLAAAAALGLPLTLGSERLAILPALYRLEE 163 (238) T ss_pred CCCEEEEECCCCCHHHHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHCCCCCCCCCCEEEEEECCCCHHH T ss_conf 993999944683066648999999984199943997388216999999719974627850899967899899 No 411 >cd04865 LigD_Pol_like_2 LigD_Pol_like_2: Polymerase (Pol) domain of bacterial LigD proteins similar to Pseudomonas aeruginosa (Pae) LigD, subgroup 2. The LigD Pol domain belongs to the archaeal/eukaryal primase (AEP) superfamily. In prokaryotes, LigD along with Ku is required for non-homologous end joining (NHEJ)-mediated repair of DNA double-strand breaks (DSB). NHEJ-mediated DNA DSB repair is error-prone. It has been suggested that LigD Pol contributes to NHEJ-mediated DNA DSB repair in vivo, by filling in short 5'-overhangs with ribonucleotides; the filled in termini would then be sealed by the associated LigD ligase domain, resulting in short stretches of RNA incorporated into the genomic DNA. The Pol domains of PaeLigD and Mycobacterium tuberculosis (Mt)LigD are stimulated by manganese, are error-prone, and prefer adding rNTPs to dNTPs in vitro; however PaeLigD and MtLigD belong to other subgroups, proteins in this subgroup await functional characterization. Probab=23.96 E-value=49 Score=13.89 Aligned_cols=55 Identities=24% Similarity=0.402 Sum_probs=32.9 Q ss_pred HHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEEC--CCCCCCCCCCCCCHHHHHHHCC Q ss_conf 023222220245413304657446441357889985--8888841011211012221025 Q gi|254780420|r 218 NLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLS--NGPGDPAVTSVYSSPIICKLVD 275 (396) Q Consensus 218 nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lS--nGPGdP~~~~~~~~~~~~~~~~ 275 (396) .-|-.|.+.|| ++.-||.+..+++. +||-+++- +|||.+-..-..+...++++++ T Consensus 73 ~~L~~lan~g~-iE~H~w~sr~~~~~--~PD~lvfDLDP~~~~~f~~v~~~A~~vr~~L~ 129 (228) T cd04865 73 AALLWLANLGC-IELHPWPSRAGDLD--HPDELVIDLDPQPGTSFEDVVEVALLVREVLD 129 (228) T ss_pred HHHHHHHHCCC-EEEECCCCCCCCCC--CCCEEEEECCCCCCCCHHHHHHHHHHHHHHHH T ss_conf 99999986294-67417877778778--99879982899988999999999999999999 No 412 >smart00475 53EXOc 5'-3' exonuclease. Probab=23.94 E-value=33 Score=15.10 Aligned_cols=26 Identities=15% Similarity=0.226 Sum_probs=11.6 Q ss_pred EECCCCHHHHHHHHHHCCCCCEEEECC Q ss_conf 211257689999998539842366327 Q gi|254780420|r 122 GLSGIDTRALTVWIRDYGVSNSVIAHH 148 (396) Q Consensus 122 gI~GIDTRaLTk~IRe~G~~~a~I~~~ 148 (396) +|.|. +|.|-+.+++...-..+++.+ T Consensus 32 ai~gf-~~~l~~l~~~~~p~~i~v~fD 57 (259) T smart00475 32 AVYGF-LRMLLKLIKEEKPTYVAVVFD 57 (259) T ss_pred HHHHH-HHHHHHHHHHCCCCEEEEEEE T ss_conf 79999-999999998769977999981 No 413 >pfam01702 TGT Queuine tRNA-ribosyltransferase. This is a family of queuine tRNA-ribosyltransferases EC:2.4.2.29, also known as tRNA-guanine transglycosylase and guanine insertion enzyme. Queuine tRNA-ribosyltransferase modifies tRNAs for asparagine, aspartic acid, histidine and tyrosine with queuine. It catalyses the exchange of guanine-34 at the wobble position with 7-aminomethyl-7-deazaguanine, and the addition of a cyclopentenediol moiety to 7-aminomethyl-7-deazaguanine-34 tRNA; giving a hypermodified base queuine in the wobble position. The aligned region contains a zinc binding motif C-x-C-x2-C-x29-H, and important tRNA and 7-aminomethyl-7deazaguanine binding residues. Probab=23.77 E-value=52 Score=13.74 Aligned_cols=69 Identities=17% Similarity=0.288 Sum_probs=39.9 Q ss_pred CHHHHHHCCCCEEEECC-CCCCCCCCCCCCHHHHHHHCCCCCC--EEEEECHHHHH-HHHCCCCCCCCCCCCC Q ss_conf 74464413578899858-8888410112110122210257863--47861005887-8752854466776752 Q gi|254780420|r 238 SYKDIMSLQPDGVLLSN-GPGDPAVTSVYSSPIICKLVDSGMP--MFGICLGHQLL-GLALGAQTVKMYQGHH 306 (396) Q Consensus 238 ~~~~i~~~~pdgi~lSn-GPGdP~~~~~~~~~~~~~~~~~~iP--ilGIClGhQ~l-~~a~G~~~~kl~~Ghr 306 (396) +++++.+.++||..|++ ++|.+...-....+.+...+....| ++|+.-=.+++ +.++|..+..-.+..| T Consensus 73 s~~~l~~~~~~GyaIgGl~~ge~~~~~~~~l~~~~~~Lp~~kPr~l~G~g~P~~i~~~v~~GiD~FD~~~ptr 145 (238) T pfam01702 73 SAEELAELDFDGYAIGGLSVGEEKEEMLELVDAVTPLLPEDKPRYLMGVGTPEDILEAVALGVDMFDCVFPTR 145 (238) T ss_pred HHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHHHHHHCCCCCCEEECCCCCHHHHHHHHHHCCCCEECCCHHH T ss_conf 9999744899879856877788999999999999855887785640699999999999995888100341379 No 414 >cd00532 MGS-like MGS-like domain. This domain composes the whole protein of methylglyoxal synthetase, which catalyzes the enolization of dihydroxyacetone phosphate (DHAP) to produce methylglyoxal. The family also includes the C-terminal domain in carbamoyl phosphate synthetase (CPS) where it catalyzes the last phosphorylation of a coaboxyphosphate intermediate to form the product carbamoyl phosphate and may also play a regulatory role. This family also includes inosine monophosphate cyclohydrolase. The known structures in this family show a common phosphate binding site. Probab=23.71 E-value=52 Score=13.74 Aligned_cols=66 Identities=17% Similarity=0.126 Sum_probs=39.7 Q ss_pred CHHHHHHHHHCCCCEEEECCC-----CH-HHHHH-CCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEE Q ss_conf 102322222024541330465-----74-46441-35788998588888410112110122210257863478 Q gi|254780420|r 217 SNLLRVLSSLGCRITIVGAET-----SY-KDIMS-LQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFG 282 (396) Q Consensus 217 ~nIlr~L~~~g~~v~Vvp~~~-----~~-~~i~~-~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilG 282 (396) ...-+.|.+.|..+..+.--. .. +.|.+ -+.|-|+-.+.|+...........+.+..++.++|.+- T Consensus 32 ~GTa~~L~~~Gi~~~~V~Ki~e~g~p~v~d~i~~~g~I~lVINt~~~~~~~~~~~Dg~~IRr~Av~~~IP~~T 104 (112) T cd00532 32 GGTSRVLADAGIPVRAVSKRHEDGEPTVDAAIAEKGKFDVVINLRDPRRDRCTDEDGTALLRLARLYKIPVTT 104 (112) T ss_pred HHHHHHHHHCCCEEEEEEECCCCCCCCHHHHHHCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCEEC T ss_conf 6899999987983489850487899889999866896799998888887733245489999999976999884 No 415 >PRK09615 ggt gamma-glutamyltranspeptidase periplasmic precursor; Reviewed Probab=23.70 E-value=47 Score=14.01 Aligned_cols=14 Identities=29% Similarity=0.285 Sum_probs=7.3 Q ss_pred CEEEEECHHHHHHH Q ss_conf 34786100588787 Q gi|254780420|r 279 PMFGICLGHQLLGL 292 (396) Q Consensus 279 PilGIClGhQ~l~~ 292 (396) |-.||.|-.|+-.. T Consensus 422 p~tGi~lNN~~~~F 435 (581) T PRK09615 422 GESGILLNNQMDDF 435 (581) T ss_pred CCCCEEECCCCCCC T ss_conf 89464776665355 No 416 >PRK03379 vitamin B12-transporter protein BtuF; Provisional Probab=23.69 E-value=52 Score=13.73 Aligned_cols=78 Identities=18% Similarity=0.168 Sum_probs=41.3 Q ss_pred EEECCCCCCCCCCCCCCCCCCEEEECCCCCCCCC-CCHHHHCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCHHHHHHHC Q ss_conf 8537888876014161425617321689736845-882342046533321023666404654434552417989999742 Q gi|254780420|r 40 CFNTSLTGYQEILTDPSYLGQIVNFTFPHIGNVG-VNSEDFESISRKNFKGAVGLVIKAEITEPSNYRANMHFDAWLKSC 118 (396) Q Consensus 40 VFnT~mtGYqE~lTDPSY~gQIlvfT~P~IGNyG-vn~~d~ES~~~~~~~~~~g~Iv~~~~~~~Sn~rs~~sL~~wL~~~ 118 (396) |=.|..+-|++... .-|-||+|. .|.|..-+..+- +|+.... ++ ....+ +-|++. T Consensus 44 Vgv~~~~dyP~~a~-----------~lp~VG~~~~~~~E~IlaL~PD-------LVia~~~---~~--~~~~~-~~L~~~ 99 (265) T PRK03379 44 VGVSSYSDYPPQAQ-----------KIEQVSTWQGMNLERIVALKPD-------LVLAWRG---GN--AERQV-DQLASL 99 (265) T ss_pred EEEECCCCCCHHHH-----------CCCCCCCCCCCCHHHHHHHCCC-------EEEEECC---CC--CHHHH-HHHHHC T ss_conf 88668899986785-----------4870278889899999963999-------8999568---89--58999-999816 Q ss_pred CCCEECCCCHHHH---HHHHHHCCCCC Q ss_conf 7832112576899---99998539842 Q gi|254780420|r 119 GIIGLSGIDTRAL---TVWIRDYGVSN 142 (396) Q Consensus 119 ~IpgI~GIDTRaL---Tk~IRe~G~~~ 142 (396) |||-+ -+|.+.| ...||.-|... T Consensus 100 GI~v~-~~~~~sl~di~~~i~~lG~~~ 125 (265) T PRK03379 100 GIKVM-WVDATSIEQIADALRQLAPWS 125 (265) T ss_pred CCEEE-ECCCCCHHHHHHHHHHHHHHC T ss_conf 97588-359999999999999999865 No 417 >cd06287 PBP1_LacI_like_8 Ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=23.47 E-value=52 Score=13.71 Aligned_cols=25 Identities=8% Similarity=0.227 Sum_probs=13.4 Q ss_pred HHHHHHCCCCCCEEEEECHHHHHHH Q ss_conf 1222102578634786100588787 Q gi|254780420|r 268 PIICKLVDSGMPMFGICLGHQLLGL 292 (396) Q Consensus 268 ~~~~~~~~~~iPilGIClGhQ~l~~ 292 (396) ....+++...-|.-+|+-....+|+ T Consensus 167 ~a~~~ll~~~~~~~AI~~~nD~~A~ 191 (269) T cd06287 167 AACAQLLAQHPDLDALCVPVDAFAV 191 (269) T ss_pred HHHHHHHCCCCCCCEEEEECHHHHH T ss_conf 9999997179999779982799999 No 418 >COG3243 PhaC Poly(3-hydroxyalkanoate) synthetase [Lipid metabolism] Probab=23.43 E-value=35 Score=14.97 Aligned_cols=81 Identities=17% Similarity=0.281 Sum_probs=46.3 Q ss_pred EEEEECCCCCHHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCCCCCHHHHHHHCC-CCCCEEEEECH Q ss_conf 99996796410232222202454133046574464413578899858888841011211012221025-78634786100 Q gi|254780420|r 208 VVCIDYGIRSNLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTSVYSSPIICKLVD-SGMPMFGICLG 286 (396) Q Consensus 208 IvvIDfGiK~nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~-~~iPilGIClG 286 (396) --++|.=-..+.+|.|.+.|..|-|+.|-.+-....+.+.+--++ .... .+++.++.+.. .+|=.+|-|.| T Consensus 120 ~yI~DL~~~~s~V~~l~~~g~~vfvIsw~nPd~~~~~~~~edYi~-------e~l~-~aid~V~~itg~~dInliGyC~G 191 (445) T COG3243 120 FYILDLSPEKSLVRWLLEQGLDVFVISWRNPDASLAAKNLEDYIL-------EGLS-EAIDTVKDITGQKDINLIGYCVG 191 (445) T ss_pred EEEEECCCCCHHHHHHHHCCCCEEEEECCCCHHHHHHCCHHHHHH-------HHHH-HHHHHHHHHHCCCCCCEEEECCC T ss_conf 247867877408999997598368996158607655346999999-------9999-99999998747651235546145 Q ss_pred HHHHHHHCCC Q ss_conf 5887875285 Q gi|254780420|r 287 HQLLGLALGA 296 (396) Q Consensus 287 hQ~l~~a~G~ 296 (396) =-+++.|.+. T Consensus 192 Gt~~~~a~a~ 201 (445) T COG3243 192 GTLLAAALAL 201 (445) T ss_pred HHHHHHHHHH T ss_conf 6999999997 No 419 >COG4586 ABC-type uncharacterized transport system, ATPase component [General function prediction only] Probab=23.34 E-value=53 Score=13.69 Aligned_cols=53 Identities=19% Similarity=0.293 Sum_probs=35.7 Q ss_pred CCCEEEEEE-------CCCCCHHHHHHHHHCC--CCEEEECCCCHHHHHHCCCCEEEECCCC Q ss_conf 771499996-------7964102322222024--5413304657446441357889985888 Q gi|254780420|r 204 AKYHVVCID-------YGIRSNLLRVLSSLGC--RITIVGAETSYKDIMSLQPDGVLLSNGP 256 (396) Q Consensus 204 ~~~kIvvID-------fGiK~nIlr~L~~~g~--~v~Vvp~~~~~~~i~~~~pdgi~lSnGP 256 (396) .+++|+++| -=+|.||.+.|.+.+. ..+|+=..-.+++|..+.-.-+.|++|- T Consensus 173 h~p~VLfLDEpTvgLDV~aq~~ir~Flke~n~~~~aTVllTTH~~~di~~lc~rv~~I~~Gq 234 (325) T COG4586 173 HPPKVLFLDEPTVGLDVNAQANIREFLKEYNEERQATVLLTTHIFDDIATLCDRVLLIDQGQ 234 (325) T ss_pred CCCCEEEECCCCCCCCHHHHHHHHHHHHHHHHHHCCEEEEEECCHHHHHHHHHHEEEEECCC T ss_conf 89838974487457514389999999999877537369998411213888634369960782 No 420 >COG2875 CobM Precorrin-4 methylase [Coenzyme metabolism] Probab=23.28 E-value=47 Score=14.05 Aligned_cols=93 Identities=19% Similarity=0.314 Sum_probs=61.2 Q ss_pred CCCCCHHHHHHHHHCCCCEEEECCCCHHH---HHHC------CCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEE Q ss_conf 79641023222220245413304657446---4413------57889985888884101121101222102578634786 Q gi|254780420|r 213 YGIRSNLLRVLSSLGCRITIVGAETSYKD---IMSL------QPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGI 283 (396) Q Consensus 213 fGiK~nIlr~L~~~g~~v~Vvp~~~~~~~---i~~~------~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGI 283 (396) ||.=+-++|.|.++|...+|+|--+++.. .+.. --..|+|.-.-|. ... -..+.+..+... --.+.| T Consensus 89 YgA~~EQm~~L~~~gI~yevvPGVss~~AAAA~L~~ELT~P~vsQtvilTR~sgr-t~v--pe~e~l~~la~~-~aTm~I 164 (254) T COG2875 89 YGALAEQMRELEALGIPYEVVPGVSSFAAAAAALGIELTVPGVSQTVILTRPSGR-TPV--PEKESLAALAKH-GATMVI 164 (254) T ss_pred HHHHHHHHHHHHHCCCCEEEECCCHHHHHHHHHHCCEEECCCCCEEEEEECCCCC-CCC--CCHHHHHHHHHC-CCEEEE T ss_conf 8899999999997699859937905999999983854405774106999713468-789--966589999854-862686 Q ss_pred ECHHHHHHHHCCCCCCCCCCCCCCCCCCCE Q ss_conf 100588787528544667767521011234 Q gi|254780420|r 284 CLGHQLLGLALGAQTVKMYQGHHGVNHPVK 313 (396) Q Consensus 284 ClGhQ~l~~a~G~~~~kl~~GhrG~N~Pv~ 313 (396) -||-|.|... +.+|.-||.+..-||- T Consensus 165 ~L~v~~I~~v----v~~L~~g~y~~dtPVa 190 (254) T COG2875 165 FLGVHAIDKV----VEELLEGGYPPDTPVA 190 (254) T ss_pred EEHHHHHHHH----HHHHHCCCCCCCCCEE T ss_conf 5255689999----9998607899999789 No 421 >pfam06511 IpaD Invasion plasmid antigen IpaD. This family consists of several invasion plasmid antigen IpaD proteins. Entry of Shigella flexneri into epithelial cells and lysis of the phagosome involve the IpaB, IpaC, and IpaD proteins, which are secreted by type III secretion machinery. Probab=23.25 E-value=53 Score=13.68 Aligned_cols=19 Identities=16% Similarity=0.483 Sum_probs=12.4 Q ss_pred HHHHHHHHHHHHHHHCCCC Q ss_conf 1689999999999735788 Q gi|254780420|r 378 SYYLFSTFIDFMHKRKTMC 396 (396) Q Consensus 378 ~~~~F~~F~~~i~~~k~~~ 396 (396) +..|||+.+......-+.| T Consensus 311 ansl~dnlvkvlsstiss~ 329 (337) T pfam06511 311 ANSLFDNLVKVLSSTISSC 329 (337) T ss_pred HHHHHHHHHHHHHHHHHHH T ss_conf 0278999999999887878 No 422 >cd04861 LigD_Pol_like LigD_Pol_like: Polymerase (Pol) domain of bacterial LigD proteins similar to Pseudomonas aeruginosa (Pae) LigD. The LigD Pol domain belongs to the archaeal/eukaryal primase (AEP) superfamily. In prokaryotes, LigD along with Ku is required for non-homologous end joining (NHEJ)-mediated repair of DNA double-strand breaks (DSB). NHEJ-mediated DNA DSB repair is error-prone. PaeLigD is monomeric, containing an N-terminal phosphoesterase module, a central polymerase (Pol) domain, and a C-terminal ATP-dependent ligase domain. Mycobacterium tuberculosis (Mt)LigD, also found in this group, is monomeric and contains the same modules but these are arranged differently: an N-terminal Pol domain, a central phosphoesterase module, and a C-terminal ligase domain. It has been suggested that LigD Pol contributes to NHEJ-mediated DNA DSB repair in vivo, by filling in short 5'-overhangs with ribonucleotides; the filled in termini would then be sealed by the associated LigD ligase do Probab=23.23 E-value=49 Score=13.94 Aligned_cols=55 Identities=22% Similarity=0.288 Sum_probs=33.3 Q ss_pred HHHHHHHHHCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCCCCCCC--CCHHHHHHHCC Q ss_conf 02322222024541330465744644135788998588888410112--11012221025 Q gi|254780420|r 218 NLLRVLSSLGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDPAVTSV--YSSPIICKLVD 275 (396) Q Consensus 218 nIlr~L~~~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~~~~--~~~~~~~~~~~ 275 (396) .-|-.|.+.|| ++.-||.+..+++. +||-+++-=-||++..... .+...++++++ T Consensus 72 ~~L~~lan~~~-lE~H~w~sr~~~~~--~PD~lvfDLDP~~~~~f~~v~~~A~~~r~~L~ 128 (227) T cd04861 72 AGLVWLANLGA-IELHPWLSRADDLE--RPDRLVFDLDPGPGVPFEDVVEAALLLRELLD 128 (227) T ss_pred HHHHHHHHHCC-EEECCCCCCCCCCC--CCCEEEEECCCCCCCCHHHHHHHHHHHHHHHH T ss_conf 99999987093-76217767667778--99879997899879999999999999999999 No 423 >pfam08918 PhoQ_Sensor PhoQ Sensor. The PhoQ Sensor is required for the virulence of various Gram-negative bacteria by allowing interaction of PhoPQ with the intracellular membrane, resulting in remodelling of the bacterial cell surface and subsequent bacterial resistance to host antimicrobial peptides. The domain contains a major flat acidic surface, which binds to at least 3 calcium ions, neutralising the domain's negative charge and allowing interaction with the negatively charged membrane. Probab=23.18 E-value=49 Score=13.88 Aligned_cols=76 Identities=20% Similarity=0.328 Sum_probs=36.5 Q ss_pred HHHHHHCCCCEECCCCHHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHHCCCCCCCCHHHCCCCCCCEECCCCCCC Q ss_conf 99997427832112576899999985398423663276545897898887741588743210110133332111686534 Q gi|254780420|r 112 DAWLKSCGIIGLSGIDTRALTVWIRDYGVSNSVIAHHPDGQFDLEDLKERAKNWSGLKGAELAKHVTVSQRRDWSEKIWK 191 (396) Q Consensus 112 ~~wL~~~~IpgI~GIDTRaLTk~IRe~G~~~a~I~~~~~~~~d~~~~~~~~~~~~~~~~~dL~~~Vs~~~~~~~~~~~~~ 191 (396) .+||+++| .+.|||.-= +.++++.. -..+.++++++..-+...+.-.|.+.. |.- T Consensus 102 ~eWL~k~g---~~Eidtd~~--------~s~~~L~~-------~~~~q~~L~~~d~~d~~~lTHSVaVN~-Y~A------ 156 (180) T pfam08918 102 PEWLKKTG---FHELDTDTD--------TSSALLGN-------NPEAQDQLKAYDDNDDNELTHSVAVNV-YPA------ 156 (180) T ss_pred HHHCCCCC---CEEEECCCC--------HHHHHHCC-------CHHHHHHHHHCCCCCCCCCEEEEEEEC-CCC------ T ss_conf 97605799---468764765--------47887656-------989999987546778873268899852-315------ Q ss_pred CCCCCCCCCCCCCCCEEEEEECCCCCHHHHHHHH Q ss_conf 4543443467777714999967964102322222 Q gi|254780420|r 192 WGEETSFLKSSDAKYHVVCIDYGIRSNLLRVLSS 225 (396) Q Consensus 192 ~~~~~~~~~~~~~~~kIvvIDfGiK~nIlr~L~~ 225 (396) ...-....|||+|- |-.+|.+ T Consensus 157 --------T~~lP~LtIVVVDt-----IPqeLq~ 177 (180) T pfam08918 157 --------TARLPALTIVVVDT-----IPQELQR 177 (180) T ss_pred --------CCCCCCEEEEEECC-----CCHHHHH T ss_conf --------35798608999607-----8778742 No 424 >PRK11248 tauB taurine transporter ATP-binding subunit; Provisional Probab=22.99 E-value=41 Score=14.48 Aligned_cols=52 Identities=21% Similarity=0.412 Sum_probs=31.2 Q ss_pred CEEEEEE--C-----CCCCHHHHHHHH----HCCCCEEEECCCCHHHHHHCCCCEEEECCCCCCC Q ss_conf 1499996--7-----964102322222----0245413304657446441357889985888884 Q gi|254780420|r 206 YHVVCID--Y-----GIRSNLLRVLSS----LGCRITIVGAETSYKDIMSLQPDGVLLSNGPGDP 259 (396) Q Consensus 206 ~kIvvID--f-----GiK~nIlr~L~~----~g~~v~Vvp~~~~~~~i~~~~pdgi~lSnGPGdP 259 (396) ++|.++| + -.+..+.+.|.+ .|..+..+-+| .++...+.=.-+++++|||.- T Consensus 147 P~iLllDEPt~~LD~~~r~~l~~ll~~l~~~~g~Til~vTHd--l~ea~~ladrv~vm~~gp~~i 209 (255) T PRK11248 147 PQLLLLDEPFGALDAFTREQMQELLLKLWQETGKQVLLITHD--IEEAVFMATELVLLSPGPGRV 209 (255) T ss_pred CCEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCEEEEECCC--HHHHHHHCCEEEEEECCCEEE T ss_conf 999998088777998999999999999999619999998868--999999699999982899089 No 425 >COG4825 Uncharacterized membrane-anchored protein conserved in bacteria [Function unknown] Probab=22.77 E-value=54 Score=13.61 Aligned_cols=186 Identities=21% Similarity=0.246 Sum_probs=108.6 Q ss_pred CCCEEEEECCCCEEEEEEECCCCEE----EEEEEEECCCCCCCCCCCCCCCCCCEEEECC---C-CCCCCCCCHHHHCCC Q ss_conf 8966899728987999975676248----8889853788887601416142561732168---9-736845882342046 Q gi|254780420|r 11 KPTAVLVLADGSVIEGMGCGATGSI----QAEICFNTSLTGYQEILTDPSYLGQIVNFTF---P-HIGNVGVNSEDFESI 82 (396) Q Consensus 11 ~~~A~L~LedGt~f~G~~fG~~~~~----~GEvVFnT~mtGYqE~lTDPSY~gQIlvfT~---P-~IGNyGvn~~d~ES~ 82 (396) .-+-.++-|+|.+|.|.--=+.++. .-|=.|.-+-.|.-|-++ ||+|--+-|.- | +|---||++-+..-. T Consensus 102 dGkKvrL~E~gl~~~G~R~~~rg~~~~~~~ia~~~~~A~~~l~~r~e--affgNTi~fi~sE~pllidg~gIPD~~n~~~ 179 (395) T COG4825 102 DGKKVRLREGGLVYKGDRRLIRGTERTDHDIADLMREAKSGLAERLE--AFFGNTIEFIKSESPLLIDGIGIPDLDNDLR 179 (395) T ss_pred CCCEEEEECCCEEECCCEEHCCCCCCCCHHHHHHHHHHHHHHHHHHH--HHHHCHHHHHHCCCHHHHCCCCCCCCCCHHC T ss_conf 67368872276111030000036503624499999987764999999--9860579887325644416889986301204 Q ss_pred CCCCCCEEEEEEEECCCCCCCCCCCCCCHHHHHHHCCCCEECCCCHHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHH Q ss_conf 53332102366640465443455241798999974278321125768999999853984236632765458978988877 Q gi|254780420|r 83 SRKNFKGAVGLVIKAEITEPSNYRANMHFDAWLKSCGIIGLSGIDTRALTVWIRDYGVSNSVIAHHPDGQFDLEDLKERA 162 (396) Q Consensus 83 ~~~~~~~~~g~Iv~~~~~~~Sn~rs~~sL~~wL~~~~IpgI~GIDTRaLTk~IRe~G~~~a~I~~~~~~~~d~~~~~~~~ 162 (396) +++ -+||... +|+-.--++|..|+++.++.-| |||+-|=+ +|+.|-...+|.-+++.- ..+.+.. T Consensus 180 gR~------VlIva~~---~~~~~dLk~lk~fi~ey~pvlI-gVdgaAD~--l~~~GykP~lIvGdpd~i-~~~~Lr~-- 244 (395) T COG4825 180 GRH------VLIVADE---PSFEDDLKSLKPFIKEYQPVLI-GVDGAADV--LRKAGYKPQLIVGDPDQI-STEALRC-- 244 (395) T ss_pred CCE------EEEECCC---CCHHHHHHHHHHHHHHHCCEEE-EECCHHHH--HHHCCCCCCEEECCCCHH-HHHHHHC-- T ss_conf 757------9997389---8668889998888875087799-70550788--997479864465380121-0888841-- Q ss_pred HHCCCCCCCCHHHCCCCCCCEECCCCCCCCCCCCCCCCCCCCCCEEEE--EECCCCCHHHHHHHHHCCCCEEEECCCCHH Q ss_conf 415887432101101333321116865344543443467777714999--967964102322222024541330465744 Q gi|254780420|r 163 KNWSGLKGAELAKHVTVSQRRDWSEKIWKWGEETSFLKSSDAKYHVVC--IDYGIRSNLLRVLSSLGCRITIVGAETSYK 240 (396) Q Consensus 163 ~~~~~~~~~dL~~~Vs~~~~~~~~~~~~~~~~~~~~~~~~~~~~kIvv--IDfGiK~nIlr~L~~~g~~v~Vvp~~~~~~ 240 (396) - ++| |+- .|=+.- =+.++..+|.--+.+|.--+.. T Consensus 245 --g---------a~v------------------------------vlpad~dGhAp--GleRiQdLGvgAmTFP~~gsst 281 (395) T COG4825 245 --G---------AKV------------------------------VLPADADGHAP--GLERIQDLGVGAMTFPAAGSST 281 (395) T ss_pred --C---------CCE------------------------------EECCCCCCCCH--HHHHHHHCCCCEEECCCCCCHH T ss_conf --6---------626------------------------------65357888870--0789986286513136678602 Q ss_pred HH---HH--CCCCEEEECCCC Q ss_conf 64---41--357889985888 Q gi|254780420|r 241 DI---MS--LQPDGVLLSNGP 256 (396) Q Consensus 241 ~i---~~--~~pdgi~lSnGP 256 (396) ++ ++ ..+.-+++-++| T Consensus 282 DlAllLAd~hga~~lv~vG~~ 302 (395) T COG4825 282 DLALLLADHHGAALLVTVGHR 302 (395) T ss_pred HHHHHHHHCCCCCEEEECCCC T ss_conf 389998741582027861774 No 426 >pfam09998 DUF2239 Uncharacterized protein conserved in bacteria (DUF2239). This domain, found in various hypothetical bacterial proteins, has no known function. Probab=22.76 E-value=54 Score=13.61 Aligned_cols=42 Identities=21% Similarity=0.336 Sum_probs=35.8 Q ss_pred EEEEEEECCCCCCCCCCCCCCHHHHHHHCCCCEECCCCHHHHHHHHHHCCC Q ss_conf 236664046544345524179899997427832112576899999985398 Q gi|254780420|r 90 AVGLVIKAEITEPSNYRANMHFDAWLKSCGIIGLSGIDTRALTVWIRDYGV 140 (396) Q Consensus 90 ~~g~Iv~~~~~~~Sn~rs~~sL~~wL~~~~IpgI~GIDTRaLTk~IRe~G~ 140 (396) --|+|-||.+.-|-|| +||..| ||=.-+--|.|+-.-|..+. T Consensus 72 KLGVVaREVTLLPRHW-------dWLa~Q--pGGASvaLRkLVd~Arr~~~ 113 (173) T pfam09998 72 KLGVVAREVTLLPRHW-------DWLAAQ--PGGASVALRKLVEEARRAHA 113 (173) T ss_pred CCCEEEEEEECCCHHH-------HHHHCC--CCCHHHHHHHHHHHHHHCCC T ss_conf 7552421453252467-------998169--98463999999999985388 No 427 >cd02070 corrinoid_protein_B12-BD B12 binding domain of corrinoid proteins. A family of small methanogenic corrinoid proteins that bind methyl-Co(III) 5-hydroxybenzimidazolylcobamide as a cofactor. They play a role on the methanogenesis from trimethylamine, dimethylamine or monomethylamine, which is initiated by a series of corrinoid-dependent methyltransferases. Probab=22.75 E-value=54 Score=13.61 Aligned_cols=84 Identities=17% Similarity=0.138 Sum_probs=45.9 Q ss_pred ECCCCCHHH-HHHHHHCCCCEEEECCCCHHHHH----HCCCCEEEECCCCC--CCCCCCCCCHHHHHHH-CCCCCCEEEE Q ss_conf 679641023-22222024541330465744644----13578899858888--8410112110122210-2578634786 Q gi|254780420|r 212 DYGIRSNLL-RVLSSLGCRITIVGAETSYKDIM----SLQPDGVLLSNGPG--DPAVTSVYSSPIICKL-VDSGMPMFGI 283 (396) Q Consensus 212 DfGiK~nIl-r~L~~~g~~v~Vvp~~~~~~~i~----~~~pdgi~lSnGPG--dP~~~~~~~~~~~~~~-~~~~iPilGI 283 (396) |+|. ||+ ..|+..|++|+-+-.|++.+++. +.+||-|-||--=- -|. ... .++.+++. +..++||+-= T Consensus 96 ~lG~--~mv~~~l~~~G~~V~~LG~~vp~e~~v~~~~~~~~divglS~~~~~~~~~-~~~-~i~~lr~~~~~~~v~i~vG 171 (201) T cd02070 96 DIGK--NLVATMLEANGFEVIDLGRDVPPEEFVEAVKEHKPDILGLSALMTTTMGG-MKE-VIEALKEAGLRDKVKVMVG 171 (201) T ss_pred HHHH--HHHHHHHHHCCCEEEECCCCCCHHHHHHHHHHHCCCEEEEECCCCCCHHH-HHH-HHHHHHHCCCCCCCEEEEE T ss_conf 8999--99999998789779977899997999999997298999996256688999-999-9999997289889859998 Q ss_pred EC-HHHHHHHHCCCCCC Q ss_conf 10-05887875285446 Q gi|254780420|r 284 CL-GHQLLGLALGAQTV 299 (396) Q Consensus 284 Cl-GhQ~l~~a~G~~~~ 299 (396) .. =.+-++...|+.-+ T Consensus 172 G~a~~~~~a~~~GAD~~ 188 (201) T cd02070 172 GAPVNQEFADEIGADGY 188 (201) T ss_pred CCCCCHHHHHHHCCCEE T ss_conf 80179999999298878 No 428 >PRK09604 putative DNA-binding/iron metalloprotein/AP endonuclease; Reviewed Probab=22.74 E-value=51 Score=13.78 Aligned_cols=48 Identities=21% Similarity=0.302 Sum_probs=29.2 Q ss_pred CCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEE--ECHHHHHHHH Q ss_conf 57889985888884101121101222102578634786--1005887875 Q gi|254780420|r 246 QPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGI--CLGHQLLGLA 293 (396) Q Consensus 246 ~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGI--ClGhQ~l~~a 293 (396) +.|+|=.+.|||=+-.......-...-....++|+.|| +.||-+.++- T Consensus 72 ~id~IAvT~gPGL~g~L~VG~~~Ak~La~~~~~Pli~VnH~~gHi~~~~l 121 (335) T PRK09604 72 DIDAIAVTAGPGLVGALLVGATAAKALALALNKPLIGVNHLEGHLLAARL 121 (335) T ss_pred HCCEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCEEECCHHHHHHHHHHC T ss_conf 89789994799961139999999999999809982411257777766540 No 429 >PTZ00340 O-sialoglycoprotein endopeptidase; Provisional Probab=22.60 E-value=54 Score=13.59 Aligned_cols=46 Identities=22% Similarity=0.281 Sum_probs=25.4 Q ss_pred CCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEE--CHHHHHH Q ss_conf 578899858888841011211012221025786347861--0058878 Q gi|254780420|r 246 QPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGIC--LGHQLLG 291 (396) Q Consensus 246 ~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIC--lGhQ~l~ 291 (396) +.|+|-.+.|||=+...-....-...-....++|+.||- .||-+.+ T Consensus 77 did~IAvT~gPGL~g~L~VG~~~AK~La~~~~~Pli~VnHlegHi~~~ 124 (348) T PTZ00340 77 DISLICYTKGPGMGAPLAVGATVAKTLSLLWGKPLVGVNHCVAHIEMG 124 (348) T ss_pred CCCEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCEEECCCHHHHHHHH T ss_conf 185799727998516589999999999998099835213024334212 No 430 >PRK13606 LPPG:FO 2-phospho-L-lactate transferase; Provisional Probab=22.34 E-value=55 Score=13.56 Aligned_cols=38 Identities=26% Similarity=0.607 Sum_probs=23.6 Q ss_pred CCCEEEECCCCCCCCCC--CCCCHHHHHH-HCCCCCCEEEEEC Q ss_conf 57889985888884101--1211012221-0257863478610 Q gi|254780420|r 246 QPDGVLLSNGPGDPAVT--SVYSSPIICK-LVDSGMPMFGICL 285 (396) Q Consensus 246 ~pdgi~lSnGPGdP~~~--~~~~~~~~~~-~~~~~iPilGICl 285 (396) +.|.|+| |||+|-.. .....+-+++ +.+.+.|+..||= T Consensus 186 ~AD~IVi--gPsnp~tSI~PiL~VpgireAL~~s~a~vVaVSP 226 (309) T PRK13606 186 DADVVVI--GPSNPVTSIGPILAVPGIREALRETGAPVVAVSP 226 (309) T ss_pred HCCCEEE--CCCCCEEEEEHHHCCHHHHHHHHHCCCCEEEECC T ss_conf 3992898--6897503531131572699999857998899897 No 431 >cd02065 B12-binding_like B12 binding domain (B12-BD). Most of the members bind different cobalamid derivates, like B12 (adenosylcobamide) or methylcobalamin or methyl-Co(III) 5-hydroxybenzimidazolylcobamide. This domain is found in several enzymes, such as glutamate mutase, methionine synthase and methylmalonyl-CoA mutase. Cobalamin undergoes a conformational change on binding the protein; the dimethylbenzimidazole group, which is coordinated to the cobalt in the free cofactor, moves away from the corrin and is replaced by a histidine contributed by the protein. The sequence Asp-X-His-X-X-Gly, which contains this histidine ligand, is conserved in many cobalamin-binding proteins. Not all members of this family contain the conserved binding motif. Probab=22.10 E-value=56 Score=13.52 Aligned_cols=39 Identities=21% Similarity=0.178 Sum_probs=28.3 Q ss_pred HHHHHHHHHCCCCEEEECCCCHHHHH----HCCCCEEEECCCC Q ss_conf 02322222024541330465744644----1357889985888 Q gi|254780420|r 218 NLLRVLSSLGCRITIVGAETSYKDIM----SLQPDGVLLSNGP 256 (396) Q Consensus 218 nIlr~L~~~g~~v~Vvp~~~~~~~i~----~~~pdgi~lSnGP 256 (396) .+...|...|+++..+..+.+.+++. ..+||.|-+|--= T Consensus 18 ~v~~~l~~~g~~v~~lg~~~~~e~~v~~~~~~~~dvVgiS~~~ 60 (125) T cd02065 18 IVAIALRDNGFEVIDLGVDVPPEEIVEAAKEEDADVVGLSALS 60 (125) T ss_pred HHHHHHHHCCCEEEECCCCCCHHHHHHHHHHHCCCEEEEECCH T ss_conf 9999999789989989998899999999986089999985211 No 432 >TIGR01354 cyt_deam_tetra cytidine deaminase; InterPro: IPR006262 Cytidine deaminase is a small homotetrameric zinc metalloprotein. It is found in humans and most bacteria. A related homodimeric form, IPR006263 from INTERPRO, with a much larger subunit is found in E. coli and in Arabidopsis. Both types may act on deoxycytidine as well as cytidine.; GO: 0004126 cytidine deaminase activity, 0046087 cytidine metabolic process. Probab=22.08 E-value=56 Score=13.52 Aligned_cols=72 Identities=8% Similarity=0.170 Sum_probs=32.3 Q ss_pred CCCEE-EEEECCCCCHHHHHHHHHCCCCEEEEC--CCCHHH------HHH----CC----CCEEEECCCCCC-CCCCCCC Q ss_conf 77149-999679641023222220245413304--657446------441----35----788998588888-4101121 Q gi|254780420|r 204 AKYHV-VCIDYGIRSNLLRVLSSLGCRITIVGA--ETSYKD------IMS----LQ----PDGVLLSNGPGD-PAVTSVY 265 (396) Q Consensus 204 ~~~kI-vvIDfGiK~nIlr~L~~~g~~v~Vvp~--~~~~~~------i~~----~~----pdgi~lSnGPGd-P~~~~~~ 265 (396) .+++| |++.. +.+ +.+ .|++|+-=.| ..-+|. |.+ -+ .+.|.+.....+ +...=-. T Consensus 19 S~F~VGAa~~t--~dg--~~f--~G~NvENASY~~~~CAEr~Ai~~~~~~yP~GG~r~G~f~~~~~~~~~~~D~~~PCG~ 92 (135) T TIGR01354 19 SNFKVGAALLT--KDG--RVF--TGVNVENASYGLTICAERSAIGKAISAYPAGGRRDGKFVAIAVAASANDDSVSPCGA 92 (135) T ss_pred CCCCEEEEEEE--CCC--CEE--EEEEEEECCCCCCHHHHHHHHHHHHHCCCCCCCCCCCEEEEEEEECCCCCCCCCCCH T ss_conf 68811699985--699--889--876342136767046899999999860887875355127999862698883267603 Q ss_pred CHHHHHHHCC-CCCCEE Q ss_conf 1012221025-786347 Q gi|254780420|r 266 SSPIICKLVD-SGMPMF 281 (396) Q Consensus 266 ~~~~~~~~~~-~~iPil 281 (396) +...+.++.. .+.||+ T Consensus 93 CRQvl~EF~~h~d~~i~ 109 (135) T TIGR01354 93 CRQVLAEFAKHPDTKII 109 (135) T ss_pred HHHHHHHHCCCCCCEEE T ss_conf 57778875089983389 No 433 >PRK13984 putative oxidoreductase; Provisional Probab=21.88 E-value=56 Score=13.49 Aligned_cols=40 Identities=25% Similarity=0.395 Sum_probs=25.1 Q ss_pred HHHHHHCCCCE---EEECCCCHHHHHHCCCCEEEECCCCCCCCC Q ss_conf 22222024541---330465744644135788998588888410 Q gi|254780420|r 221 RVLSSLGCRIT---IVGAETSYKDIMSLQPDGVLLSNGPGDPAV 261 (396) Q Consensus 221 r~L~~~g~~v~---Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~ 261 (396) ..|.+.|+++. .+--+.+++++++ .+|+|||.-|-+.|.. T Consensus 341 ~~i~~~GV~f~~n~~VGkDit~eeL~~-~yDAVfLa~Ga~~~r~ 383 (604) T PRK13984 341 AFIEALGVKIHTNTRVGKDISLEELRE-KHDAVFVSTGFTLGRS 383 (604) T ss_pred HHHHHCCCEEECCCEECCCCCHHHHHH-CCCEEEEECCCCCCCC T ss_conf 999972989976857798478999970-5899999538887766 No 434 >PRK07337 aminotransferase; Validated Probab=21.70 E-value=57 Score=13.47 Aligned_cols=13 Identities=46% Similarity=0.979 Sum_probs=5.8 Q ss_pred CCEEEECCCCCCCC Q ss_conf 78899858888841 Q gi|254780420|r 247 PDGVLLSNGPGDPA 260 (396) Q Consensus 247 pdgi~lSnGPGdP~ 260 (396) ..+|+|.| |.+|. T Consensus 164 tk~iil~n-P~NPt 176 (388) T PRK07337 164 TRGVLLAS-PSNPT 176 (388) T ss_pred CEEEEECC-CCCCC T ss_conf 53999789-97988 No 435 >cd01142 TroA_e Periplasmic binding protein TroA_e. These proteins are predicted to function as initial receptors in the ABC metal ion uptake in eubacteria and archaea. They belong to the TroA superfamily of helical backbone metal receptor proteins that share a distinct fold and ligand binding mechanism. A typical TroA protein is comprised of two globular subdomains connected by a single helix and can bind their ligands in the cleft between these domains. Probab=21.53 E-value=57 Score=13.45 Aligned_cols=21 Identities=24% Similarity=0.545 Sum_probs=17.2 Q ss_pred CCCHHHHHHCCCCEEEECCCC Q ss_conf 657446441357889985888 Q gi|254780420|r 236 ETSYKDIMSLQPDGVLLSNGP 256 (396) Q Consensus 236 ~~~~~~i~~~~pdgi~lSnGP 256 (396) ..+.|.|++++||-||++++. T Consensus 209 ~is~E~i~~~nPDvI~~~~~~ 229 (289) T cd01142 209 EVSLEQLLKWNPDVIIVGNAD 229 (289) T ss_pred CCCHHHHHHHCCCEEEECCCC T ss_conf 428999986599889984764 No 436 >pfam01921 tRNA-synt_1f tRNA synthetases class I (K). This family includes only lysyl tRNA synthetases from prokaryotes. Probab=21.52 E-value=50 Score=13.83 Aligned_cols=21 Identities=48% Similarity=0.793 Sum_probs=16.3 Q ss_pred EEEEEECCCCCCCCCCCCCCCCCCEEEECCCCCCCCC Q ss_conf 8898537888876014161425617321689736845 Q gi|254780420|r 37 AEICFNTSLTGYQEILTDPSYLGQIVNFTFPHIGNVG 73 (396) Q Consensus 37 GEvVFnT~mtGYqE~lTDPSY~gQIlvfT~P~IGNyG 73 (396) |++||.||+| || -.|||||-+ T Consensus 19 ~~~v~~tG~~--------PS--------G~~HIGnfr 39 (355) T pfam01921 19 GEILVETGIG--------PS--------GLPHIGNFR 39 (355) T ss_pred CCEEEECCCC--------CC--------CCCEEECCC T ss_conf 8589952668--------99--------886421560 No 437 >PRK12775 putative trifunctional 2-polyprenylphenol hydroxylase/glutamate synthase subunit beta/ferritin domain-containing protein; Provisional Probab=21.51 E-value=57 Score=13.44 Aligned_cols=42 Identities=24% Similarity=0.502 Sum_probs=27.0 Q ss_pred HHHHHHCCCC---EEEECCCCHHHHHH-CCCCEEEECCCCCCCCCC Q ss_conf 2222202454---13304657446441-357889985888884101 Q gi|254780420|r 221 RVLSSLGCRI---TIVGAETSYKDIMS-LQPDGVLLSNGPGDPAVT 262 (396) Q Consensus 221 r~L~~~g~~v---~Vvp~~~~~~~i~~-~~pdgi~lSnGPGdP~~~ 262 (396) ..|.+.|+++ +++--+.+++++++ ..+|+|||.-|-|.|... T Consensus 490 ~~l~~lGVef~~n~~VGkditl~eL~~e~gyDAVFIg~GA~~~~~l 535 (993) T PRK12775 490 QRLKDIGVKFETNKVVGKTFTIPQLMNDRGFDAVFVAAGAGAPTFL 535 (993) T ss_pred HHHHHCCEEEECCCEECCCCCHHHHHHCCCCCEEEEECCCCCCCCC T ss_conf 9998789499838886560789999534598989995587867318 No 438 >PRK01213 phosphoribosylformylglycinamidine synthase II; Provisional Probab=21.49 E-value=55 Score=13.55 Aligned_cols=40 Identities=30% Similarity=0.364 Sum_probs=26.1 Q ss_pred CCCEEEEEEECCCCEEEEEEEEECCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCCH Q ss_conf 898799997567624888898537888876014161425617321689736845882 Q gi|254780420|r 20 DGSVIEGMGCGATGSIQAEICFNTSLTGYQEILTDPSYLGQIVNFTFPHIGNVGVNS 76 (396) Q Consensus 20 dGt~f~G~~fG~~~~~~GEvVFnT~mtGYqE~lTDPSY~gQIlvfT~P~IGNyGvn~ 76 (396) .|-.-.|..||-+. +.||+.| ||+|.+.-||+. |-+|+=. T Consensus 149 ~G~s~YGN~~G~Pt-v~G~~~F------------~~~y~~nplv~a----~~vG~v~ 188 (736) T PRK01213 149 AGIGGYGNCVGVPT-VGGEVYF------------DPSYNGNPLVNA----MCVGLVE 188 (736) T ss_pred CCHHHCCCCCCCCC-CCEEEEE------------CCCCCCCEEEEE----EEEEEEE T ss_conf 43210574446575-3338897------------156477623675----5788741 No 439 >KOG1907 consensus Probab=21.48 E-value=44 Score=14.27 Aligned_cols=88 Identities=16% Similarity=0.239 Sum_probs=44.4 Q ss_pred CCCCEEEEEE-CCCCC--HHHHHHHHHCCCCEEEECCCCHHHHHHC-----CCCEEEECCCC--CCCC------CCCCC- Q ss_conf 7771499996-79641--0232222202454133046574464413-----57889985888--8841------01121- Q gi|254780420|r 203 DAKYHVVCID-YGIRS--NLLRVLSSLGCRITIVGAETSYKDIMSL-----QPDGVLLSNGP--GDPA------VTSVY- 265 (396) Q Consensus 203 ~~~~kIvvID-fGiK~--nIlr~L~~~g~~v~Vvp~~~~~~~i~~~-----~pdgi~lSnGP--GdP~------~~~~~- 265 (396) +..+|||+|- =|+-- -+.-++...|+++.= .+-.|+++- ++-||..-+|= ||-- +.... T Consensus 1056 s~~PkVAilREeGvNg~rEMa~af~~AgF~~~D----VtmtDlL~G~~~ld~frGlaf~GGFSYaDvLgSakGWAasil~ 1131 (1320) T KOG1907 1056 STAPKVAILREEGVNGDREMAAAFYAAGFETVD----VTMTDLLAGRHHLDDFRGLAFCGGFSYADVLGSAKGWAASILF 1131 (1320) T ss_pred CCCCCEEEEECCCCCCHHHHHHHHHHCCCCEEE----EEEEHHHCCCEEHHHHCCEEEECCCCHHHHHCCCCCHHHHEEE T ss_conf 678706886110556279999999985881364----4620543571258670544652574367651454332443010 Q ss_pred ---CHHHHHHHCC-CCCCEEEEECHHHHHHHHC Q ss_conf ---1012221025-7863478610058878752 Q gi|254780420|r 266 ---SSPIICKLVD-SGMPMFGICLGHQLLGLAL 294 (396) Q Consensus 266 ---~~~~~~~~~~-~~iPilGIClGhQ~l~~a~ 294 (396) ...-+.++.. ...=-||||=|.|++++-- T Consensus 1132 ne~v~~QF~~F~~R~DtFslGiCNGCQlms~Lg 1164 (1320) T KOG1907 1132 NESVRSQFEAFFNRQDTFSLGICNGCQLMSRLG 1164 (1320) T ss_pred CHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHC T ss_conf 733779999985378731220241759999864 No 440 >cd06275 PBP1_PurR Ligand-binding domain of purine repressor, PurR, which functions as the master regulatory protein of de novo purine nucleotide biosynthesis in Escherichia coli. Ligand-binding domain of purine repressor, PurR, which functions as the master regulatory protein of de novo purine nucleotide biosynthesis in Escherichia coli. This dimeric PurR belongs to the LacI-GalR family of transcription regulators and is activated to bind to DNA operator sites by initially binding either of high affinity corepressors, hypoxanthine or guanine. PurR is composed of two functional domains: aan N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the purine transcription repressor undergoes a Probab=21.47 E-value=57 Score=13.44 Aligned_cols=12 Identities=0% Similarity=0.110 Sum_probs=4.5 Q ss_pred HHHHHHHHHHHH Q ss_conf 689999999999 Q gi|254780420|r 379 YYLFSTFIDFMH 390 (396) Q Consensus 379 ~~~F~~F~~~i~ 390 (396) ..-++.++++|+ T Consensus 238 ~~a~~~L~~~i~ 249 (269) T cd06275 238 ELAVNMLLERIK 249 (269) T ss_pred HHHHHHHHHHHC T ss_conf 999999999966 No 441 >PRK01917 hemerythrin family protein; Provisional Probab=21.41 E-value=57 Score=13.46 Aligned_cols=14 Identities=21% Similarity=0.311 Sum_probs=5.6 Q ss_pred HHHHHHHHCCCCEE Q ss_conf 23222220245413 Q gi|254780420|r 219 LLRVLSSLGCRITI 232 (396) Q Consensus 219 Ilr~L~~~g~~v~V 232 (396) ..+.|++.|+.+.+ T Consensus 124 la~~Lk~~G~~~~~ 137 (142) T PRK01917 124 MVSHLKMLGFAMVD 137 (142) T ss_pred HHHHHHHCCCCCCC T ss_conf 99999980982013 No 442 >TIGR01922 purO_arch IMP cyclohydrolase; InterPro: IPR010191 This entry represents IMP cyclohydrolase, the final step in the biosynthesis of inosine monophosphate (IMP) in archaea . In bacteria this step is catalysed by a bifunctional enzyme (purH).; GO: 0003937 IMP cyclohydrolase activity, 0006164 purine nucleotide biosynthetic process, 0006188 IMP biosynthetic process. Probab=21.39 E-value=56 Score=13.50 Aligned_cols=71 Identities=24% Similarity=0.278 Sum_probs=41.5 Q ss_pred HHHHHHCC-CCEEEECCCCHHHHHHCCC-----------CEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEECHHH Q ss_conf 22222024-5413304657446441357-----------88998588888410112110122210257863478610058 Q gi|254780420|r 221 RVLSSLGC-RITIVGAETSYKDIMSLQP-----------DGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGICLGHQ 288 (396) Q Consensus 221 r~L~~~g~-~v~Vvp~~~~~~~i~~~~p-----------dgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGIClGhQ 288 (396) |.+..+.- .+.|||-+-+..+-. -|| +-+++|||- +.+ .+.+ |+ +.+.|. T Consensus 31 R~~~~~~~n~~~vvP~~~~e~~~f-rNPYvtYnC~rivd~~~Vv~NG~----htD-~IAd---Kl-~~G~pp-------- 92 (209) T TIGR01922 31 RKIEVREENAVAVVPEDDSEITEF-RNPYVTYNCIRIVDETAVVSNGS----HTD-PIAD---KL-ELGVPP-------- 92 (209) T ss_pred CEEEECCCCCEEEEECCCCCCEEE-CCCCEEEEEEEEECCEEEECCCC----CCC-CHHH---HH-HCCCCH-------- T ss_conf 715660798379840688720020-48856554466646868973687----226-1134---55-448986-------- Q ss_pred HHHHHCCCCCCCCCCCCCCCCCC Q ss_conf 87875285446677675210112 Q gi|254780420|r 289 LLGLALGAQTVKMYQGHHGVNHP 311 (396) Q Consensus 289 ~l~~a~G~~~~kl~~GhrG~N~P 311 (396) =-|++.-+.=|-|-|---|-| T Consensus 93 --RdAL~~~l~aMDYEkDeY~TP 113 (209) T TIGR01922 93 --RDALASVLLAMDYEKDEYNTP 113 (209) T ss_pred --HHHHHHHHHHEEECCCCCCCC T ss_conf --898885131001013688877 No 443 >cd00674 LysRS_core_class_I This is the catalytic core domain of lysyl tRNA synthetase (LysRS). This class I enzyme is a monomer, which aminoacylates the 2'-OH of the nucleotide at the 3' of the appropriate tRNA. The core domain is based on the Rossman fold and is responsible for the ATP-dependent formation of the enzyme bound aminoacyl-adenylate. It contains the characteristic class I HIGH and KMSKS motifs, which are involved in ATP binding. The class I LysRS is found only in archaea and some bacteria and has evolved separately from class II LysRS, as the two do not share structural or sequence similarity. Probab=21.39 E-value=56 Score=13.50 Aligned_cols=22 Identities=27% Similarity=0.540 Sum_probs=14.7 Q ss_pred EEEEEEECCCCCCCCCCCCCCCCCCEEEECCCCCCCCC Q ss_conf 88898537888876014161425617321689736845 Q gi|254780420|r 36 QAEICFNTSLTGYQEILTDPSYLGQIVNFTFPHIGNVG 73 (396) Q Consensus 36 ~GEvVFnT~mtGYqE~lTDPSY~gQIlvfT~P~IGNyG 73 (396) -+++||+||+| || -.|||||.+ T Consensus 18 k~~~v~~tG~~--------PS--------G~~HIGn~r 39 (354) T cd00674 18 KEKYVVASGIS--------PS--------GLIHIGNFR 39 (354) T ss_pred CCCEEEECCCC--------CC--------CCCCCCCCC T ss_conf 98589947879--------99--------886320550 No 444 >PRK06425 histidinol-phosphate aminotransferase; Validated Probab=21.36 E-value=58 Score=13.42 Aligned_cols=35 Identities=17% Similarity=0.287 Sum_probs=14.5 Q ss_pred HHCCCCEEEECCC---CHHHHHHCCCCEEEECCCCCCCC Q ss_conf 2024541330465---74464413578899858888841 Q gi|254780420|r 225 SLGCRITIVGAET---SYKDIMSLQPDGVLLSNGPGDPA 260 (396) Q Consensus 225 ~~g~~v~Vvp~~~---~~~~i~~~~pdgi~lSnGPGdP~ 260 (396) ..|+++..+|.+. ..+.+.+.+||-|||.| |.+|. T Consensus 98 ~~g~~~~~vp~~~~~~~~~~~~~~~~~lv~i~n-PNNPT 135 (332) T PRK06425 98 THGIRISALPFNLINNNPEILNNYNFDLIFIVS-PDNPL 135 (332) T ss_pred HCCCEEEEECCCCCCCCHHHHHCCCCCEEEEEC-CCCCC T ss_conf 869989995743332588888458989899941-81720 No 445 >TIGR01735 FGAM_synt phosphoribosylformylglycinamidine synthase; InterPro: IPR010073 This entry represents a single-molecule form of phosphoribosylformylglycinamidine synthase, also called FGAM synthase, an enzyme of purine de novo biosynthesis. This form is found mostly in eukaryotes and proteobacteria. In Bacillus subtilis PurL (FGAM synthase II) and PurQ (FGAM synthase I) perform the equivalent function; the unrelated small protein PurS is also required and may be a third subunit .; GO: 0004642 phosphoribosylformylglycinamidine synthase activity, 0006189 'de novo' IMP biosynthetic process. Probab=21.24 E-value=42 Score=14.35 Aligned_cols=166 Identities=16% Similarity=0.225 Sum_probs=86.0 Q ss_pred CCCEECCCCHHHHHHHHHHCCCCCEEEECCCCCC-------------------------CCHHHHHHHHHHCCCCCCCCH Q ss_conf 7832112576899999985398423663276545-------------------------897898887741588743210 Q gi|254780420|r 119 GIIGLSGIDTRALTVWIRDYGVSNSVIAHHPDGQ-------------------------FDLEDLKERAKNWSGLKGAEL 173 (396) Q Consensus 119 ~IpgI~GIDTRaLTk~IRe~G~~~a~I~~~~~~~-------------------------~d~~~~~~~~~~~~~~~~~dL 173 (396) =|.=+..=|+.+.+.++|++|..-.++....... .+.-.-.+++++-|.--..+- T Consensus 1015 aV~qV~~~~~~~v~~~~~~~~L~~~~~~~~~~~~~~~i~i~~~~~~~~s~~~~~L~~~W~ets~Q~qRLrdnp~Ca~~E~ 1094 (1401) T TIGR01735 1015 AVIQVAEEDLAAVLELLRAAGLTALILGIGTPTDEPSIEISVNGATLLSEKRSELRDIWEETSFQLQRLRDNPECAEEEF 1094 (1401) T ss_pred CEEEECCCHHHHHHHHHHHCCCCEEEEECCCCCCCCEEEEEECCEEEEHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHH T ss_conf 36883241178999999864871333131401567778986088465025689999988889999998323733699999 Q ss_pred HHC-------CCCCCCEECCCCCCCCCCCCCCCCCCCCCCEEEEEE-CCCCC--HHHHHHHHHCCCCEEEECCCCHHHHH Q ss_conf 110-------133332111686534454344346777771499996-79641--02322222024541330465744644 Q gi|254780420|r 174 AKH-------VTVSQRRDWSEKIWKWGEETSFLKSSDAKYHVVCID-YGIRS--NLLRVLSSLGCRITIVGAETSYKDIM 243 (396) Q Consensus 174 ~~~-------Vs~~~~~~~~~~~~~~~~~~~~~~~~~~~~kIvvID-fGiK~--nIlr~L~~~g~~v~Vvp~~~~~~~i~ 243 (396) ... .+.+-.|+.++.+ .....+.+.++|||+|= =|+=- =+--.+.+-||++. |..-+|++ T Consensus 1095 ~~~~~~~~p~l~~~l~fd~~~d~------~~P~I~~G~~PkvAILREqG~NG~~EMAaAF~rAGF~~~----DVhmsDL~ 1164 (1401) T TIGR01735 1095 EGLKDRDEPGLKLSLTFDVNEDI------AAPFINKGVKPKVAILREQGVNGDIEMAAAFDRAGFEAI----DVHMSDLL 1164 (1401) T ss_pred HHHCCCCCCCCCCCCCCCCCCCC------CCCCCCCCCCCCEEEEEECCCCCCHHHHHHHHHCCCCEE----EEEEEEHC T ss_conf 98405788862023478865555------686223689861788850686700689999974589578----52310010 Q ss_pred H-----CCCCEEEECCC--CCCCCCCCC-------C---CHHHHHHHCC-CCCCEEEEECHHHHHH-HHC Q ss_conf 1-----35788998588--888410112-------1---1012221025-7863478610058878-752 Q gi|254780420|r 244 S-----LQPDGVLLSNG--PGDPAVTSV-------Y---SSPIICKLVD-SGMPMFGICLGHQLLG-LAL 294 (396) Q Consensus 244 ~-----~~pdgi~lSnG--PGdP~~~~~-------~---~~~~~~~~~~-~~iPilGIClGhQ~l~-~a~ 294 (396) . ..+.||+.-+| -||-=...+ . ...-+.+++. .+-=-||||=|-|+|+ |+- T Consensus 1165 ~Grv~Ld~F~glvacGGFSYgDVLGag~GWA~SIlF~~~~~~qF~~Ff~rpdtf~LGVCNGCQm~snL~~ 1234 (1401) T TIGR01735 1165 AGRVHLDEFRGLVACGGFSYGDVLGAGKGWAKSILFNPRLRDQFAAFFKRPDTFSLGVCNGCQMLSNLLE 1234 (1401) T ss_pred CCCCCHHHCCEEEEECCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCCCEEECCCHHHHHHHHHHH T ss_conf 0566522455589971753021301124467766405778999998605888423032107899997643 No 446 >COG0615 TagD Cytidylyltransferase [Cell envelope biogenesis, outer membrane / Lipid metabolism] Probab=21.10 E-value=58 Score=13.39 Aligned_cols=23 Identities=26% Similarity=0.453 Sum_probs=18.9 Q ss_pred ECCCCHHHHHHCCCCEEEECCCC Q ss_conf 04657446441357889985888 Q gi|254780420|r 234 GAETSYKDIMSLQPDGVLLSNGP 256 (396) Q Consensus 234 p~~~~~~~i~~~~pdgi~lSnGP 256 (396) |++.+++.+..++||-|++.+-+ T Consensus 74 p~~~~~~~i~~~k~Div~lG~D~ 96 (140) T COG0615 74 PWDIKFEDIEEYKPDIVVLGDDQ 96 (140) T ss_pred CCCCCHHHHHHHCCCEEEECCCC T ss_conf 65467689999599999977987 No 447 >PRK08574 cystathionine gamma-synthase; Provisional Probab=21.05 E-value=58 Score=13.38 Aligned_cols=16 Identities=19% Similarity=0.189 Sum_probs=8.6 Q ss_pred CCCCHHHHHHHHHHHH Q ss_conf 6700168999999999 Q gi|254780420|r 374 GPQDSYYLFSTFIDFM 389 (396) Q Consensus 374 GP~D~~~~F~~F~~~i 389 (396) |=.|..-|.+++.+-+ T Consensus 366 GlEd~~DLi~DL~~AL 381 (384) T PRK08574 366 GLEDVEDLIEDLDQAL 381 (384) T ss_pred CCCCHHHHHHHHHHHH T ss_conf 1099999999999998 No 448 >cd07062 Peptidase_S66_mccF_like Microcin C7 self-immunity protein determines resistance to exogenous microcin C7. Microcin C7 self-immunity protein (mccF): MccF, a homolog of the LD-carboxypeptidase family, mediates resistance against exogenously added microcin C7 (MccC7), a ribosomally-encoded peptide antibiotic that contains a phosphoramidate linkage to adenosine monophosphate at its C-terminus. The plasmid-encoded mccF gene is transcribed in the opposite direction to the other five genes (mccA-E) and is required for the full expression of immunity but not for production. The catalytic triad residues (Ser, His, Glu) of LD-carboxypeptidase are also conserved in MccF, strongly suggesting that MccF shares the hydrolytic activity with LD-carboxypeptidases. Substrates of MccF have not been deduced, but could likely be microcin C7 precursors. The possible role of MccF is to defend producer cells against exogenous microcin from re-entering after having been exported. It is suggested that M Probab=20.88 E-value=59 Score=13.35 Aligned_cols=59 Identities=22% Similarity=0.310 Sum_probs=33.9 Q ss_pred CCCCCCCCCCCCEEEECCCCCCCCCCCH----HHHCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCHHHHHHHC-CCCEE Q ss_conf 6014161425617321689736845882----342046533321023666404654434552417989999742-78321 Q gi|254780420|r 49 QEILTDPSYLGQIVNFTFPHIGNVGVNS----EDFESISRKNFKGAVGLVIKAEITEPSNYRANMHFDAWLKSC-GIIGL 123 (396) Q Consensus 49 qE~lTDPSY~gQIlvfT~P~IGNyGvn~----~d~ES~~~~~~~~~~g~Iv~~~~~~~Sn~rs~~sL~~wL~~~-~IpgI 123 (396) ++++.||+-..= ++..|-||... -|++.....+ . ++--+++. ..|..+|..+ +++.+ T Consensus 59 ~~a~~Dp~i~aI-----~~~rGGyga~rlL~~LD~~~i~~~P-K-----~~iGySDi-------TaL~~al~~~~g~~t~ 120 (308) T cd07062 59 MAAFADPSIKAI-----IPTIGGDDSNELLPYLDYELIKKNP-K-----IFIGYSDI-------TALHLAIYKKTGLVTY 120 (308) T ss_pred HHHHHCCCCCEE-----EECCCCCCHHHHHHHCCHHHHHHCC-C-----EEEECCHH-------HHHHHHHHHHCCCEEE T ss_conf 998629998999-----9917877788887646755686399-4-----89971464-------7999999997298699 Q ss_pred CC Q ss_conf 12 Q gi|254780420|r 124 SG 125 (396) Q Consensus 124 ~G 125 (396) +| T Consensus 121 hG 122 (308) T cd07062 121 YG 122 (308) T ss_pred EC T ss_conf 78 No 449 >COG0159 TrpA Tryptophan synthase alpha chain [Amino acid transport and metabolism] Probab=20.85 E-value=59 Score=13.35 Aligned_cols=36 Identities=19% Similarity=0.254 Sum_probs=17.4 Q ss_pred EEEEECCCCCH--HHHHHHHHCCC-CEEEECCCCHHHHH Q ss_conf 99996796410--23222220245-41330465744644 Q gi|254780420|r 208 VVCIDYGIRSN--LLRVLSSLGCR-ITIVGAETSYKDIM 243 (396) Q Consensus 208 IvvIDfGiK~n--Ilr~L~~~g~~-v~Vvp~~~~~~~i~ 243 (396) +++.|+-.-+. +.....++|.+ +-.++.+++.+.+. T Consensus 126 livpDLP~ee~~~~~~~~~~~gi~~I~lvaPtt~~~rl~ 164 (265) T COG0159 126 LLVPDLPPEESDELLKAAEKHGIDPIFLVAPTTPDERLK 164 (265) T ss_pred EEECCCCHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHH T ss_conf 985789866777899999976986798869999989999 No 450 >PRK12809 putative oxidoreductase Fe-S binding subunit; Reviewed Probab=20.85 E-value=59 Score=13.35 Aligned_cols=40 Identities=18% Similarity=0.409 Sum_probs=25.7 Q ss_pred HHHHHHCCCCE---EEECCCCHHHHHHCCCCEEEECCCCCCCCC Q ss_conf 22222024541---330465744644135788998588888410 Q gi|254780420|r 221 RVLSSLGCRIT---IVGAETSYKDIMSLQPDGVLLSNGPGDPAV 261 (396) Q Consensus 221 r~L~~~g~~v~---Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~ 261 (396) ..|.+.|+++. .+--+.+++++++ ++|+|||.-|-+.|.. T Consensus 368 ~~l~~lGV~f~~n~~VGkDit~~eL~~-~yDAVflg~Ga~~~~~ 410 (639) T PRK12809 368 EIFTAMGIDFHLNCEIGRDITFSDLTS-EYDAVFIGVGTYGMMR 410 (639) T ss_pred HHHHHCCCEEEECCEECCCCCHHHHHH-HCCEEEEEECCCCCCC T ss_conf 999864988991967798688999973-1798999736788854 No 451 >PRK06848 hypothetical protein; Validated Probab=20.77 E-value=59 Score=13.34 Aligned_cols=12 Identities=25% Similarity=0.479 Sum_probs=5.8 Q ss_pred CEEEEECHHHHHHH Q ss_conf 34786100588787 Q gi|254780420|r 279 PMFGICLGHQLLGL 292 (396) Q Consensus 279 PilGIClGhQ~l~~ 292 (396) +==|+|. |.|.. T Consensus 95 sPCG~CR--Qvl~E 106 (139) T PRK06848 95 SPCGACR--ELISD 106 (139) T ss_pred CCCHHHH--HHHHH T ss_conf 7827899--99999 No 452 >PTZ00170 D-ribulose-5-phosphate 3-epimerase; Provisional Probab=20.75 E-value=59 Score=13.34 Aligned_cols=25 Identities=16% Similarity=0.441 Sum_probs=17.7 Q ss_pred EEEECCCCCHHHHHHHHHCCCCEEE Q ss_conf 9996796410232222202454133 Q gi|254780420|r 209 VCIDYGIRSNLLRVLSSLGCRITIV 233 (396) Q Consensus 209 vvIDfGiK~nIlr~L~~~g~~v~Vv 233 (396) +-+|=|+...-++.|.+.|+++.|. T Consensus 173 I~VDGGIn~~ti~~l~~aGad~~V~ 197 (224) T PTZ00170 173 IQVDGGINPDTIDLAAEAGANVIVA 197 (224) T ss_pred EEEECCCCHHHHHHHHHCCCCEEEE T ss_conf 9995899989999999869999997 No 453 >TIGR01087 murD UDP-N-acetylmuramoylalanine--D-glutamate ligase; InterPro: IPR005762 The bacterial cell wall provides strength and rigidity to counteract internal osmotic pressure, and protection against the environment. The peptidoglycan layer gives the cell wall its strength, and helps maintain the overall shape of the cell. The basic peptidoglycan structure of both Gram-positive and Gram-negative bacteria is comprised of a sheet of glycan chains connected by short cross-linking polypeptides. Biosynthesis of peptidoglycan is a multi-step (11-12 steps) process comprising three main stages: (1) formation of UDP-N-acetylmuramic acid (UDPMurNAc) from N-acetylglucosamine (GlcNAc). (2) addition of a short polypeptide chain to the UDPMurNAc. (3) addition of a second GlcNAc to the disaccharide-pentapeptide building block and transport of this unit through the cytoplasmic membrane and incorporation into the growing peptidoglycan layer. Stage two involves four key Mur ligase enzymes: MurC (6.3.2.8 from EC) , MurD (6.3.2.9 from EC) , MurE (6.3.2.13 from EC) and MurF (6.3.2.10 from EC) . These four Mur ligases are responsible for the successive additions of L-alanine, D-glutamate, meso-diaminopimelate or L-lysine, and D-alanyl-D-alanine to UDP-N-acetylmuramic acid. All four Mur ligases are topologically similar to one another, even though they display low sequence identity. They are each composed of three domains: an N-terminal Rossmann-fold domain responsible for binding the UDPMurNAc substrate; a central domain (similar to ATP-binding domains of several ATPases and GTPases); and a C-terminal domain (similar to dihydrofolate reductase fold) that appears to be associated with binding the incoming amino acid. The conserved sequence motifs found in the four Mur enzymes also map to other members of the Mur ligase family, including folylpolyglutamate synthetase, cyanophycin synthetase and the capB enzyme from Bacillales . This entry represents UDP-N-acetylmuramoylalanine-D-glutamate ligase (MurD). MurD catalyses the addition of d-glutamate to the nucleotide precursor UDP-N-acetylmuramoyl-l-alanine. ; GO: 0005524 ATP binding, 0008764 UDP-N-acetylmuramoylalanine-D-glutamate ligase activity, 0008360 regulation of cell shape, 0009252 peptidoglycan biosynthetic process, 0009273 peptidoglycan-based cell wall biogenesis, 0051301 cell division, 0005737 cytoplasm. Probab=20.74 E-value=59 Score=13.34 Aligned_cols=38 Identities=16% Similarity=0.369 Sum_probs=25.0 Q ss_pred CCCHHHHHH----HCCCCEECCCC-----HHHHHHHHHHCCCCCEEEE Q ss_conf 179899997----42783211257-----6899999985398423663 Q gi|254780420|r 108 NMHFDAWLK----SCGIIGLSGID-----TRALTVWIRDYGVSNSVIA 146 (396) Q Consensus 108 ~~sL~~wL~----~~~IpgI~GID-----TRaLTk~IRe~G~~~a~I~ 146 (396) +..|--++. ..++.||+|=+ |+.|+..++..| .++.++ T Consensus 96 di~L~~~~~~~~~~~~~~aITGTnGKtTTT~L~~~~L~~~G-~~a~~g 142 (476) T TIGR01087 96 DIELALRLVDFPEPAKVVAITGTNGKTTTTSLLYHLLKAAG-LKAVLG 142 (476) T ss_pred HHHHHHHHCCCCCCCCEEEEECCCCHHHHHHHHHHHHHHCC-CCEEEE T ss_conf 67999974157898737999726860799999999998469-977862 No 454 >TIGR03351 PhnX-like phosphonatase-like hydrolase. This clade of sequences are the closest homologs to the PhnX enzyme, phosphonoacetaldehyde (Pald) hydrolase (phosphonatase, TIGR01422). This phosphonatase-like enzyme and PhnX itself are members of the haloacid dehalogenase (HAD) superfamily (pfam00702) having a a number of distinctive features that set them apart from typical HAD enzymes. The typical HAD N-terminal motif DxDx(T/V) here is DxAGT and the usual conserved lysine prior to the C-terminal motif is instead an arginine. Also distinctive of phosphonatase, and particular to its bi-catalytic mechanism is a conserved lysine in the variable "cap" domain. This lysine forms a Schiff base with the aldehyde of phosphonoacetaldehyde, providing, through the resulting positive charge, a polarization of the C-P bond necesary for cleavage as well as a route to the initial product of cleavage, an ene-amine. The conservation of these elements in this phosphonatase-like enzyme suggests that the Probab=20.64 E-value=59 Score=13.32 Aligned_cols=32 Identities=13% Similarity=0.126 Sum_probs=17.1 Q ss_pred HHHHCCCCEECCCCHHHHHHHHHHCCCCCEEEEC Q ss_conf 9974278321125768999999853984236632 Q gi|254780420|r 114 WLKSCGIIGLSGIDTRALTVWIRDYGVSNSVIAH 147 (396) Q Consensus 114 wL~~~~IpgI~GIDTRaLTk~IRe~G~~~a~I~~ 147 (396) +...+.+.-+-|+ +.|...||++|...|+.++ T Consensus 80 ~~~~~~~~~~pG~--~elL~~L~~~gi~~av~T~ 111 (220) T TIGR03351 80 AYDDGPPVALPGA--EEAFRSLRSSGIKVALTTG 111 (220) T ss_pred HHHCCCCEECCCH--HHHHHHHHHCCCCEEEECC T ss_conf 9861898067119--9999999987997766359 No 455 >pfam07801 DUF1647 Protein of unknown function (DUF1647). The sequences making up this family are all derived from hypothetical proteins expressed by C. elegans. The region in question is approximately 160 amino acids long. The GO annotation for this protein indicates the protein to be involved in nematode larval development and to have a positive regulation on growth rate. Probab=20.54 E-value=60 Score=13.31 Aligned_cols=48 Identities=10% Similarity=0.077 Sum_probs=35.3 Q ss_pred CEEEEEECCCCCHHHHHHHHHCCCCEEEECCCC-----HHHHHHCCCCEEEEC Q ss_conf 149999679641023222220245413304657-----446441357889985 Q gi|254780420|r 206 YHVVCIDYGIRSNLLRVLSSLGCRITIVGAETS-----YKDIMSLQPDGVLLS 253 (396) Q Consensus 206 ~kIvvIDfGiK~nIlr~L~~~g~~v~Vvp~~~~-----~~~i~~~~pdgi~lS 253 (396) .|+++.++|++.+-++.|.+..+.+++--.|++ ....+.++..+++|. T Consensus 90 ~K~ILY~L~ls~~~i~~L~~~~~n~E~R~Fn~S~YP~yV~nw~~YrFKpLIlA 142 (142) T pfam07801 90 HKYILYSLNLSEAYISKLPKNPKNFEFRVFNTSKYPKYVNNWMEYRFKPLILA 142 (142) T ss_pred CCEEEEECCCCHHHHHHHCCCCCCEEEEECCCCCCCHHHHHHHHHCCCCCEEC T ss_conf 65799846888889987343777448885368778378888987334100309 No 456 >PRK12769 putative oxidoreductase Fe-S binding subunit; Reviewed Probab=20.40 E-value=60 Score=13.29 Aligned_cols=39 Identities=21% Similarity=0.392 Sum_probs=25.9 Q ss_pred HHHHHCCCCE---EEECCCCHHHHHHCCCCEEEECCCCCCCCC Q ss_conf 2222024541---330465744644135788998588888410 Q gi|254780420|r 222 VLSSLGCRIT---IVGAETSYKDIMSLQPDGVLLSNGPGDPAV 261 (396) Q Consensus 222 ~L~~~g~~v~---Vvp~~~~~~~i~~~~pdgi~lSnGPGdP~~ 261 (396) .|.+.|+++. .+--+.+++++++ ++|+|||.-|-+.|.. T Consensus 386 ~l~~lGV~f~~n~~VGkDitl~eL~~-~yDAVfla~Ga~~~r~ 427 (654) T PRK12769 386 IFSAMGIHFELNCEVGKDISLESLLE-DYDAVFVGVGTYRSMK 427 (654) T ss_pred HHHHCCCEEECCCEECCCCCHHHHHH-CCCEEEEECCCCCCCC T ss_conf 99826988983717687658999973-6998999517887866 No 457 >PTZ00254 40S ribosomal protein SA; Provisional Probab=20.35 E-value=60 Score=13.28 Aligned_cols=33 Identities=18% Similarity=0.428 Sum_probs=23.1 Q ss_pred CCCCEEEECCCCCCCCCCCCCCHHHHHHHCCCCCCEEEEEC Q ss_conf 35788998588888410112110122210257863478610 Q gi|254780420|r 245 LQPDGVLLSNGPGDPAVTSVYSSPIICKLVDSGMPMFGICL 285 (396) Q Consensus 245 ~~pdgi~lSnGPGdP~~~~~~~~~~~~~~~~~~iPilGICl 285 (396) ..||.+|++ ||..- ...+++...-+||+.|||= T Consensus 117 ~eP~lliV~----DP~~d----~~AV~EA~~~nIPvIal~D 149 (242) T PTZ00254 117 QEPRLLIVT----DPRTD----HQAIREASYVNIPVIALCD 149 (242) T ss_pred CCCCEEEEE----CCCCC----HHHHHHHHHCCCCEEEEEC T ss_conf 688889996----67544----0899999985998898616 No 458 >cd02069 methionine_synthase_B12_BD B12 binding domain of methionine synthase. This domain binds methylcobalamin, which it uses as an intermediate methyl carrier from methyltetrahydrofolate (CH3H4folate) to homocysteine (Hcy). Probab=20.30 E-value=60 Score=13.27 Aligned_cols=41 Identities=29% Similarity=0.424 Sum_probs=31.7 Q ss_pred ECCCCCHHHH-HHHHHCCCCEEEECCCCHHHHH----HCCCCEEEECC Q ss_conf 6796410232-2222024541330465744644----13578899858 Q gi|254780420|r 212 DYGIRSNLLR-VLSSLGCRITIVGAETSYKDIM----SLQPDGVLLSN 254 (396) Q Consensus 212 DfGiK~nIlr-~L~~~g~~v~Vvp~~~~~~~i~----~~~pdgi~lSn 254 (396) |.| +||+. .|...|++|.=+-.|.+.+.+. +.+||-|-||- T Consensus 102 diG--kniv~~~l~~~Gf~V~dLG~dv~~e~~v~~~~~~~~divglSa 147 (213) T cd02069 102 DIG--KNLVGVILSNNGYEVIDLGVMVPIEKILEAAKEHKADIIGLSG 147 (213) T ss_pred CHH--HHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHCCCCEEEEEC T ss_conf 121--8999999997896688505899899999999974999999832 No 459 >COG2984 ABC-type uncharacterized transport system, periplasmic component [General function prediction only] Probab=20.29 E-value=60 Score=13.27 Aligned_cols=115 Identities=18% Similarity=0.251 Sum_probs=56.7 Q ss_pred CCCCCEE-EEEECCCCCH--HH----HHHHHHCCCCEEEECCCC-----HHHHHHCCCCEEEECCCCCCCCCCCCCCHHH Q ss_conf 7777149-9996796410--23----222220245413304657-----4464413578899858888841011211012 Q gi|254780420|r 202 SDAKYHV-VCIDYGIRSN--LL----RVLSSLGCRITIVGAETS-----YKDIMSLQPDGVLLSNGPGDPAVTSVYSSPI 269 (396) Q Consensus 202 ~~~~~kI-vvIDfGiK~n--Il----r~L~~~g~~v~Vvp~~~~-----~~~i~~~~pdgi~lSnGPGdP~~~~~~~~~~ 269 (396) ++.+ +| +++-=|--++ .+ +.+.+.|.+|+......+ +-..+.-++|.|++ |-|...... ..-+ T Consensus 157 Pnak-~Igv~Y~p~E~ns~~l~eelk~~A~~~Gl~vve~~v~~~ndi~~a~~~l~g~~d~i~~---p~dn~i~s~-~~~l 231 (322) T COG2984 157 PNAK-SIGVLYNPGEANSVSLVEELKKEARKAGLEVVEAAVTSVNDIPRAVQALLGKVDVIYI---PTDNLIVSA-IESL 231 (322) T ss_pred CCCE-EEEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCCCHHHHHHHCCCCCEEEE---ECCHHHHHH-HHHH T ss_conf 8870-6999957988660899999999998779889998347632008999973478767998---660677888-9999 Q ss_pred HHHHCCCCCCEEEEE-----------CHHH--HHHHHCCCCCCCCCCCCCCCCCCCEECCCCEEE Q ss_conf 221025786347861-----------0058--878752854466776752101123477897099 Q gi|254780420|r 270 ICKLVDSGMPMFGIC-----------LGHQ--LLGLALGAQTVKMYQGHHGVNHPVKNISNGTVE 321 (396) Q Consensus 270 ~~~~~~~~iPilGIC-----------lGhQ--~l~~a~G~~~~kl~~GhrG~N~Pv~~~~t~kv~ 321 (396) +....+.++|+++=- +|.+ -++..-|.-+.|+-.|.+-.++|+......+.. T Consensus 232 ~~~a~~~kiPli~sd~~~V~~Ga~aA~gvdy~~~G~qtg~~v~~ILkG~~p~dip~~~~~~~~~~ 296 (322) T COG2984 232 LQVANKAKIPLIASDTSSVKEGALAALGVDYKDLGKQTGEMVVKILKGKKPKDIPVSVAAGFKLV 296 (322) T ss_pred HHHHHHHCCCEECCCHHHHHCCCCEEECCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCEEE T ss_conf 99988708973547778876686036606799988999999999976999555685114664178 No 460 >KOG1252 consensus Probab=20.29 E-value=45 Score=14.16 Aligned_cols=60 Identities=20% Similarity=0.461 Sum_probs=28.4 Q ss_pred HHHHHHCCCCEEEECCC---C-------HHHHHHCCCCEEEEC--CCCCCCCCCCCCCHHHHHHHCCCCCCE Q ss_conf 22222024541330465---7-------446441357889985--888884101121101222102578634 Q gi|254780420|r 221 RVLSSLGCRITIVGAET---S-------YKDIMSLQPDGVLLS--NGPGDPAVTSVYSSPIICKLVDSGMPM 280 (396) Q Consensus 221 r~L~~~g~~v~Vvp~~~---~-------~~~i~~~~pdgi~lS--nGPGdP~~~~~~~~~~~~~~~~~~iPi 280 (396) ..|+.+|.+|+..|... . ++++++.-|++++|- .-||||..-.....+.+.+-+.+++-| T Consensus 144 ~~l~a~Gaeii~tp~a~~~~~~e~ai~~a~~l~~~~pna~~l~Qf~np~Np~~hy~ttg~EI~~q~~g~vDi 215 (362) T KOG1252 144 ILLRALGAEIILTPPAAGMKGPESAIGKAEELLNKTPNAYILDQFHNPGNPLAHYETTGPEIWRQLDGKVDI 215 (362) T ss_pred HHHHHCCCEEEECCHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHHCCCCCCCCCCCCCCHHHHHHHCCCCCE T ss_conf 999971887995686872477188999999999868873878874288885301256428999982699778 No 461 >pfam06520 consensus Probab=20.22 E-value=57 Score=13.47 Aligned_cols=70 Identities=11% Similarity=0.209 Sum_probs=36.2 Q ss_pred CEEEEEEEEECCCCCCCCCCCCCCCCCCEEEECCCCCCC-CCCCHHHHCCCCCCCCCEEEEEEEECCCCCCCCCCCCCCH Q ss_conf 248888985378888760141614256173216897368-4588234204653332102366640465443455241798 Q gi|254780420|r 33 GSIQAEICFNTSLTGYQEILTDPSYLGQIVNFTFPHIGN-VGVNSEDFESISRKNFKGAVGLVIKAEITEPSNYRANMHF 111 (396) Q Consensus 33 ~~~~GEvVFnT~mtGYqE~lTDPSY~gQIlvfT~P~IGN-yGvn~~d~ES~~~~~~~~~~g~Iv~~~~~~~Sn~rs~~sL 111 (396) ..-.|.++|+++.++=-+++-+-+=.. |-|+|- ++.|.++ .+|-|... | -....| T Consensus 8 ~~q~GDIIFq~~~s~~~~aI~~AT~S~------~SHvGIIv~~~g~~--------------~~V~EA~~-p---V~~TpL 63 (186) T pfam06520 8 QYETGDIIFQCIGSAQFKAISIATRCW------YNHVGIIIGHNGKD--------------YLVAEAVV-P---VKITTL 63 (186) T ss_pred CCCCCCEEEEECCCCHHHHHHHHHCCC------CCCEEEEEEECCCE--------------EEEEEECC-C---EEEEEH T ss_conf 676678999988960879999985898------76248999877966--------------89998238-6---434279 Q ss_pred HHHHHH--CCCCEECCC Q ss_conf 999974--278321125 Q gi|254780420|r 112 DAWLKS--CGIIGLSGI 126 (396) Q Consensus 112 ~~wL~~--~~IpgI~GI 126 (396) ++|+++ ++--.+.-+ T Consensus 64 ~~fi~Rg~~~~~~V~RL 80 (186) T pfam06520 64 STFISRGKNQRYVIKRL 80 (186) T ss_pred HHHHHCCCCCCEEEEEE T ss_conf 99985387884799980 No 462 >TIGR01992 PTS-IIBC-Tre PTS system, trehalose-specific IIBC component; InterPro: IPR011296 This entry represents the fused enzyme II B and C components of the trehalose-specific PTS sugar transporter system . Trehalose is converted to trehalose-6-phosphate in the process of translocation into the cell. These transporters lack their own IIA domains and instead use the glucose IIA protein (IIAglc or Crr) . The exceptions to this rule are Staphylococci and Streptococci which contain their own A domain as a C-terminal fusion. This family is closely related to the sucrose transporting PTS IIBC enzymes described by the IIB component and the IIC component (IPR004719 from INTERPRO), respectively. In Escherichia coli, Bacillus subtilis and Pseudomonas fluorescens the presence of this gene is associated with the presence of trehalase which degrades T6P to glucose and glucose-6-P. Trehalose may also be transported (in Salmonella) via the mannose PTS or galactose permease systems , or (in Sinorhizobium, Thermococcus and Sulfolobus, for instance) by ABC transporters , , .; GO: 0015574 trehalose transmembrane transporter activity, 0015771 trehalose transport, 0009276 1-2nm peptidoglycan-based cell wall, 0016021 integral to membrane. Probab=20.19 E-value=25 Score=15.93 Aligned_cols=34 Identities=35% Similarity=0.531 Sum_probs=19.9 Q ss_pred CCCCEEEEECHH-----HHHHHH-CCCCCCCCCCCCCCCCCCCEE Q ss_conf 786347861005-----887875-285446677675210112347 Q gi|254780420|r 276 SGMPMFGICLGH-----QLLGLA-LGAQTVKMYQGHHGVNHPVKN 314 (396) Q Consensus 276 ~~iPilGIClGh-----Q~l~~a-~G~~~~kl~~GhrG~N~Pv~~ 314 (396) +.-||||||||- |||.-. +|.+.-.++ .|-||=| T Consensus 198 G~~~iLGIVLGl~LVhPqLlNAy~~g~~~~~i~-----g~iPvWd 237 (489) T TIGR01992 198 GGSQILGIVLGLMLVHPQLLNAYAVGTALAEIK-----GNIPVWD 237 (489) T ss_pred CCCCHHHHHHHHHHCCHHHHHHHHHHHHCCCCC-----CCCCCEE T ss_conf 787136777767740776875888743202567-----8876120 No 463 >PRK10953 cysJ sulfite reductase subunit alpha; Provisional Probab=20.16 E-value=61 Score=13.25 Aligned_cols=37 Identities=11% Similarity=0.090 Sum_probs=28.8 Q ss_pred CCCCHHHHHHHCCCCEECCCC-----------------HHHHHHHHHHCCCCCE Q ss_conf 417989999742783211257-----------------6899999985398423 Q gi|254780420|r 107 ANMHFDAWLKSCGIIGLSGID-----------------TRALTVWIRDYGVSNS 143 (396) Q Consensus 107 s~~sL~~wL~~~~IpgI~GID-----------------TRaLTk~IRe~G~~~a 143 (396) .-..|-+||...+.|-+.|+- -|.|-++|-+.|+.+- T Consensus 127 Na~~F~~~L~~~~~p~L~~l~faVfGLGDssY~~Fc~~gk~ld~rL~~LGA~rl 180 (599) T PRK10953 127 EAVALHKFLFSKKAPKLENTAFAVFGLGDTSYEFFCQSGKDFDSKLAELGGERL 180 (599) T ss_pred CHHHHHHHHHHCCCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 299999998608786568964899706875489999999999999997699723 No 464 >pfam03446 NAD_binding_2 NAD binding domain of 6-phosphogluconate dehydrogenase. The NAD binding domain of 6-phosphogluconate dehydrogenase adopts a Rossmann fold. Probab=20.10 E-value=36 Score=14.84 Aligned_cols=27 Identities=22% Similarity=0.305 Sum_probs=14.5 Q ss_pred EEEEEECC-CCCHHHHHHHHHCCCCEEE Q ss_conf 49999679-6410232222202454133 Q gi|254780420|r 207 HVVCIDYG-IRSNLLRVLSSLGCRITIV 233 (396) Q Consensus 207 kIvvIDfG-iK~nIlr~L~~~g~~v~Vv 233 (396) +|.+|=+| +=..|.++|.+.|++|.++ T Consensus 3 ~Ig~IGlG~MG~~ma~~L~~~g~~v~v~ 30 (163) T pfam03446 3 KIGFIGLGVMGSPMALNLLKAGYTVTVY 30 (163) T ss_pred EEEEEEEHHHHHHHHHHHHHCCCEEEEE T ss_conf 8999836798999999999779969999 No 465 >TIGR00551 nadB L-aspartate oxidase; InterPro: IPR005288 L-aspartate oxidase is the B protein, NadB, of the quinolinate synthetase complex. Quinolinate synthetase makes a precursor of the pyridine nucleotide portion of NAD. This model identifies proteins that cluster as L-aspartate oxidase (a flavoprotein difficult to separate from the set of closely related flavoprotein subunits of succinate dehydrogenase and fumarate reductase) by both UPGMA and neighbour-joining trees.; GO: 0008734 L-aspartate oxidase activity, 0009435 NAD biosynthetic process, 0005737 cytoplasm. Probab=20.01 E-value=33 Score=15.09 Aligned_cols=30 Identities=20% Similarity=0.249 Sum_probs=19.0 Q ss_pred EEECCCCEEEECCCCCCCCCCCC--HHHHHHH Q ss_conf 68679977898348788776700--1689999 Q gi|254780420|r 355 FRLIDQPVFSVQYHPESSPGPQD--SYYLFST 384 (396) Q Consensus 355 i~~~~~~~~~vQfHPEa~PGP~D--~~~~F~~ 384 (396) +.-++-.-|.-.|||-.-=-|+| ++=+|.+ T Consensus 269 Lvd~~G~Rf~~~~hp~GeLAPRDiVaRAI~~~ 300 (546) T TIGR00551 269 LVDRDGKRFMADVHPRGELAPRDIVARAIDEE 300 (546) T ss_pred EECCCCCEECCCCCCCCCCCCHHHHHHHHHHH T ss_conf 35588867026778888767404889999999 No 466 >PRK05568 flavodoxin; Provisional Probab=20.00 E-value=61 Score=13.23 Aligned_cols=33 Identities=21% Similarity=0.262 Sum_probs=26.5 Q ss_pred HHHHHHHHHCCCCEEEEC-CCCHHHHHHCCCCEEEE Q ss_conf 023222220245413304-65744644135788998 Q gi|254780420|r 218 NLLRVLSSLGCRITIVGA-ETSYKDIMSLQPDGVLL 252 (396) Q Consensus 218 nIlr~L~~~g~~v~Vvp~-~~~~~~i~~~~pdgi~l 252 (396) .|..-|.+.|++|.++.. +++.++|+ +.|+|+| T Consensus 21 ~Iaegl~~~Gv~V~~~~~~~~~~~di~--~~d~i~~ 54 (142) T PRK05568 21 LIAEGAKENGLEVKLLNVSEASVDDVE--NADVVAL 54 (142) T ss_pred HHHHHHHHCCCEEEEEEECCCCHHHHH--HCCEEEE T ss_conf 999999866983999980049988997--3996899 Done!