Query gi|254780200|ref|YP_003064613.1| ferrochelatase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 343 No_of_seqs 129 out of 1620 Neff 6.6 Searched_HMMs 39220 Date Mon May 23 12:01:40 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780200.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK00035 hemH ferrochelatase; 100.0 0 0 786.5 34.3 323 14-343 1-323 (325) 2 COG0276 HemH Protoheme ferro-l 100.0 0 0 742.5 33.3 319 15-341 2-320 (320) 3 pfam00762 Ferrochelatase Ferro 100.0 0 0 733.7 33.3 311 18-339 1-312 (312) 4 TIGR00109 hemH ferrochelatase; 100.0 0 0 734.7 30.4 325 14-340 1-339 (339) 5 PRK12435 ferrochelatase; Provi 100.0 0 0 665.4 31.8 303 15-343 2-310 (316) 6 KOG1321 consensus 100.0 0 0 623.5 27.3 322 12-341 32-362 (395) 7 cd03411 Ferrochelatase_N Ferro 100.0 1.4E-45 0 338.0 14.4 158 19-183 1-159 (159) 8 cd00419 Ferrochelatase_C Ferro 100.0 3.9E-43 0 320.9 14.4 135 188-322 1-135 (135) 9 cd03409 Chelatase_Class_II Cla 99.3 9.4E-12 2.4E-16 102.4 7.0 87 208-308 2-88 (101) 10 pfam06180 CbiK Cobalt chelatas 99.1 7.8E-07 2E-11 68.2 23.3 193 122-339 57-255 (256) 11 COG2138 Sirohydrochlorin ferro 98.6 4.2E-05 1.1E-09 56.1 19.7 226 87-341 10-242 (245) 12 PRK00923 sirohydrochlorin coba 98.5 2.4E-06 6.2E-11 64.8 10.1 83 112-198 36-130 (130) 13 COG0276 HemH Protoheme ferro-l 98.3 0.0001 2.6E-09 53.5 14.9 156 157-341 18-179 (320) 14 PRK02395 hypothetical protein; 98.2 3E-05 7.7E-10 57.2 10.6 205 113-338 37-260 (284) 15 cd03414 CbiX_SirB_C Sirohydroc 98.2 0.00016 4.1E-09 52.1 13.9 115 207-339 2-117 (117) 16 PRK00923 sirohydrochlorin coba 98.1 0.0002 5.1E-09 51.5 13.4 119 206-339 4-130 (130) 17 PRK12435 ferrochelatase; Provi 98.1 0.0002 5.1E-09 51.5 13.1 24 177-200 285-308 (316) 18 cd03409 Chelatase_Class_II Cla 98.1 2E-05 5.2E-10 58.4 7.9 55 91-148 15-70 (101) 19 PRK00035 hemH ferrochelatase; 97.9 0.00047 1.2E-08 48.9 12.7 77 121-200 242-321 (325) 20 pfam00762 Ferrochelatase Ferro 97.9 0.00045 1.1E-08 49.0 12.0 109 224-340 63-174 (312) 21 cd03414 CbiX_SirB_C Sirohydroc 97.9 0.00019 4.8E-09 51.6 9.5 92 99-198 23-117 (117) 22 COG4822 CbiK Cobalamin biosynt 97.8 0.0041 1.1E-07 42.3 23.4 205 102-339 47-257 (265) 23 cd03415 CbiX_CbiC Archaeal sir 97.5 0.0014 3.6E-08 45.6 9.3 114 207-337 2-124 (125) 24 pfam01903 CbiX CbiX. The funct 97.5 0.0023 5.8E-08 44.1 10.3 97 227-332 9-105 (106) 25 pfam01903 CbiX CbiX. The funct 97.4 0.0012 3.2E-08 45.9 8.3 91 101-192 14-106 (106) 26 TIGR00109 hemH ferrochelatase; 97.1 0.01 2.6E-07 39.6 10.5 160 157-340 20-190 (339) 27 cd03415 CbiX_CbiC Archaeal sir 97.1 0.00082 2.1E-08 47.2 4.4 93 97-197 21-125 (125) 28 PRK05782 precorrin-8X methylmu 96.9 0.021 5.2E-07 37.5 10.4 98 96-201 23-132 (332) 29 PRK05782 precorrin-8X methylmu 96.8 0.057 1.4E-06 34.4 12.2 59 264-331 263-323 (332) 30 pfam06180 CbiK Cobalt chelatas 96.8 0.0087 2.2E-07 40.1 7.9 32 19-50 1-36 (256) 31 cd03416 CbiX_SirB_N Sirohydroc 96.8 0.04 1E-06 35.5 10.9 57 264-321 44-100 (101) 32 KOG1321 consensus 96.4 0.056 1.4E-06 34.4 9.7 108 89-202 246-364 (395) 33 cd03416 CbiX_SirB_N Sirohydroc 96.2 0.037 9.4E-07 35.7 8.0 50 98-150 21-71 (101) 34 COG2138 Sirohydrochlorin ferro 96.1 0.072 1.8E-06 33.6 8.8 24 18-41 2-25 (245) 35 cd03411 Ferrochelatase_N Ferro 95.9 0.18 4.7E-06 30.8 10.3 95 223-324 63-159 (159) 36 cd03413 CbiK_C Anaerobic cobal 94.9 0.038 9.6E-07 35.6 4.1 44 102-145 21-64 (103) 37 cd03412 CbiK_N Anaerobic cobal 94.4 0.5 1.3E-05 27.8 10.4 43 122-169 55-97 (127) 38 cd03412 CbiK_N Anaerobic cobal 94.1 0.59 1.5E-05 27.3 9.2 107 229-339 14-127 (127) 39 PRK13129 consensus 84.6 3.6 9.2E-05 21.8 11.0 215 18-289 18-239 (267) 40 TIGR01371 met_syn_B12ind 5-met 81.4 4.7 0.00012 21.0 7.0 141 118-293 427-591 (778) 41 cd06811 PLPDE_III_yhfX_like Ty 81.2 4.8 0.00012 21.0 6.6 44 96-144 39-85 (382) 42 PRK09545 znuA high-affinity zi 79.4 5.5 0.00014 20.6 12.4 211 95-337 62-302 (308) 43 PRK08176 pdxK pyridoxal kinase 78.3 5.9 0.00015 20.4 10.4 197 119-335 26-243 (281) 44 PRK09613 thiH thiamine biosynt 78.0 6 0.00015 20.3 5.9 61 265-335 367-427 (471) 45 PRK13609 diacylglycerol glucos 74.2 7.5 0.00019 19.6 9.2 103 171-285 128-237 (388) 46 KOG0159 consensus 72.9 2.3 5.8E-05 23.2 1.9 27 254-289 435-461 (519) 47 PRK05756 pyridoxamine kinase; 72.1 8.4 0.00021 19.3 11.6 195 120-335 14-233 (287) 48 CHL00194 ycf39 Ycf39; Provisio 70.8 8.9 0.00023 19.1 10.1 141 127-288 92-234 (319) 49 TIGR02684 dnstrm_HI1420 probab 70.3 9.1 0.00023 19.0 4.9 63 33-104 13-86 (91) 50 cd01173 pyridoxal_pyridoxamine 68.3 10 0.00026 18.7 10.3 197 120-336 11-229 (254) 51 cd07993 LPLAT_DHAPAT-like Lyso 68.1 10 0.00026 18.7 4.4 62 225-293 85-153 (205) 52 PRK13139 consensus 67.5 10 0.00027 18.6 10.7 227 20-308 18-250 (254) 53 COG4750 LicC CTP:phosphocholin 66.2 6.4 0.00016 20.1 3.0 71 85-181 8-78 (231) 54 COG1312 UxuA D-mannonate dehyd 65.7 11 0.00029 18.4 4.2 76 232-311 230-311 (362) 55 pfam04273 DUF442 Putative phos 65.6 11 0.00029 18.4 7.5 65 268-337 18-84 (110) 56 cd04182 GT_2_like_f GT_2_like_ 65.4 11 0.00029 18.4 5.8 24 120-143 24-47 (186) 57 PRK01506 consensus 64.4 12 0.0003 18.2 11.8 163 128-320 79-278 (317) 58 PRK04117 consensus 63.5 12 0.00031 18.1 14.9 164 129-321 74-273 (309) 59 PRK06765 homoserine O-acetyltr 63.2 12 0.00032 18.1 6.3 66 265-337 308-377 (387) 60 KOG3062 consensus 63.0 7.3 0.00019 19.7 2.8 52 98-150 63-114 (281) 61 PRK01999 consensus 62.6 13 0.00032 18.0 13.3 162 129-321 75-274 (311) 62 COG0761 lytB 4-Hydroxy-3-methy 62.6 13 0.00033 18.0 6.0 46 268-323 204-249 (294) 63 COG1453 Predicted oxidoreducta 61.1 14 0.00035 17.8 10.1 114 99-220 37-160 (391) 64 PRK06849 hypothetical protein; 60.0 14 0.00036 17.7 5.4 17 228-244 236-252 (387) 65 TIGR00677 fadh2_euk methylenet 59.9 14 0.00036 17.7 4.7 53 86-143 45-100 (312) 66 PRK00077 eno phosphopyruvate h 59.7 14 0.00036 17.7 5.7 186 136-333 161-380 (427) 67 TIGR00249 sixA phosphohistidin 59.6 14 0.00037 17.7 4.4 52 231-285 64-117 (169) 68 cd01020 TroA_b Metal binding p 57.3 16 0.0004 17.4 11.5 143 177-337 94-258 (264) 69 TIGR00110 ilvD dihydroxy-acid 56.4 14 0.00035 17.8 3.3 56 89-144 31-104 (601) 70 PRK00175 metX homoserine O-ace 56.0 16 0.00042 17.2 5.8 68 265-339 299-370 (379) 71 PRK13137 consensus 55.9 16 0.00042 17.2 10.6 220 20-297 27-251 (266) 72 TIGR02720 pyruv_oxi_spxB pyruv 55.3 14 0.00036 17.7 3.2 56 128-198 5-60 (577) 73 TIGR00216 ispH_lytB 4-hydroxy- 54.5 17 0.00044 17.1 4.3 19 20-43 87-105 (354) 74 PRK13135 consensus 54.3 17 0.00044 17.1 10.6 218 18-294 17-240 (267) 75 cd01979 Pchlide_reductase_N Pc 54.2 17 0.00045 17.1 11.6 58 116-178 66-123 (396) 76 KOG4357 consensus 53.4 13 0.00032 18.1 2.7 29 15-43 62-90 (164) 77 KOG1322 consensus 53.3 18 0.00046 17.0 6.7 88 113-213 34-121 (371) 78 PRK12677 xylose isomerase; Pro 51.7 19 0.00049 16.8 4.3 17 318-334 340-356 (389) 79 PRK13121 consensus 51.5 19 0.00049 16.8 9.8 189 20-252 19-212 (265) 80 TIGR00538 hemN oxygen-independ 50.6 20 0.0005 16.7 3.3 191 86-341 111-313 (462) 81 COG1213 Predicted sugar nucleo 50.1 16 0.0004 17.4 2.7 30 111-143 23-52 (239) 82 pfam06888 Put_Phosphatase Puta 50.0 20 0.00052 16.6 3.4 21 122-143 73-95 (234) 83 pfam05013 FGase N-formylglutam 49.7 20 0.00052 16.6 5.4 31 185-218 113-143 (221) 84 PRK13113 consensus 49.2 21 0.00053 16.5 10.4 191 18-252 17-212 (263) 85 TIGR00676 fadh2 5,10-methylene 49.1 21 0.00053 16.5 4.2 45 95-143 61-108 (302) 86 PRK08332 ribonucleotide-diphos 48.9 21 0.00054 16.5 6.1 86 207-293 1368-1482(1748) 87 TIGR03599 YloV DAK2 domain fus 48.4 21 0.00055 16.4 10.8 53 260-322 343-395 (530) 88 pfam06685 DUF1186 Protein of u 47.8 22 0.00056 16.4 4.9 77 220-306 141-219 (249) 89 cd06594 GH31_glucosidase_YihQ 47.5 22 0.00056 16.4 4.9 74 122-202 70-156 (317) 90 COG1608 Predicted archaeal kin 47.2 13 0.00032 18.1 1.9 92 112-212 73-169 (252) 91 pfam05378 Hydant_A_N Hydantoin 47.1 22 0.00057 16.3 4.3 47 259-311 124-175 (175) 92 PRK12360 4-hydroxy-3-methylbut 46.9 22 0.00057 16.3 5.9 28 270-299 203-230 (281) 93 PRK13575 3-dehydroquinate dehy 46.2 23 0.00059 16.2 6.8 46 90-135 12-59 (238) 94 TIGR01137 cysta_beta cystathio 46.2 15 0.00037 17.6 2.1 36 9-53 295-330 (527) 95 PRK13116 consensus 45.6 24 0.0006 16.2 10.5 248 20-337 19-272 (278) 96 TIGR00284 TIGR00284 dihydropte 44.8 24 0.00062 16.1 6.9 163 15-197 185-368 (529) 97 TIGR02134 transald_staph trans 44.4 13 0.00033 18.0 1.6 12 266-277 199-211 (237) 98 TIGR00689 rpiB_lacA_lacB sugar 44.3 25 0.00063 16.0 3.8 18 223-240 38-55 (146) 99 TIGR01258 pgm_1 phosphoglycera 44.1 17 0.00043 17.2 2.2 14 154-167 183-196 (248) 100 PRK10714 undecaprenyl phosphat 43.2 25 0.00065 15.9 5.5 25 118-143 101-125 (324) 101 PRK06129 3-hydroxyacyl-CoA deh 42.1 26 0.00068 15.8 3.0 34 258-291 155-191 (308) 102 PRK13118 consensus 41.8 27 0.00068 15.8 10.1 222 17-295 15-242 (269) 103 TIGR01793 cit_synth_euk citrat 41.6 27 0.00069 15.8 5.9 23 86-112 96-118 (430) 104 pfam08645 PNK3P Polynucleotide 41.6 27 0.00069 15.8 6.6 24 119-143 28-51 (158) 105 cd06422 NTP_transferase_like_1 41.3 27 0.00069 15.7 5.2 48 120-177 28-75 (221) 106 pfam02543 CmcH_NodU Carbamoylt 40.9 18 0.00046 16.9 1.9 24 233-258 253-278 (360) 107 PRK00881 purH bifunctional pho 40.8 28 0.00071 15.7 3.6 41 254-296 302-343 (514) 108 cd01310 TatD_DNAse TatD like p 40.2 28 0.00072 15.6 12.4 137 121-285 13-152 (251) 109 TIGR01481 ccpA catabolite cont 40.1 16 0.00041 17.3 1.6 202 68-313 45-264 (332) 110 pfam02401 LYTB LytB protein. T 39.1 29 0.00075 15.5 6.0 30 268-299 200-229 (280) 111 PRK07105 pyridoxamine kinase; 39.0 29 0.00075 15.5 8.6 192 120-335 16-232 (284) 112 PRK09432 metF 5,10-methylenete 38.1 30 0.00077 15.4 3.8 53 87-144 63-118 (296) 113 TIGR01415 trpB_rel pyridoxal-p 38.1 30 0.00077 15.4 5.8 123 71-202 56-215 (426) 114 COG2240 PdxK Pyridoxal/pyridox 37.6 31 0.00079 15.3 9.8 200 121-335 13-227 (281) 115 TIGR02767 TraG-Ti Ti-type conj 37.2 31 0.0008 15.3 2.6 133 67-214 193-350 (641) 116 TIGR00962 atpA ATP synthase F1 36.8 32 0.00081 15.3 5.4 91 117-224 240-343 (520) 117 TIGR02759 TraD_Ftype type IV c 36.7 32 0.00081 15.2 6.2 131 36-185 98-257 (613) 118 pfam08029 HisG_C HisG, C-termi 36.7 32 0.00081 15.2 3.1 23 123-145 48-70 (73) 119 PRK04781 histidinol-phosphate 36.3 32 0.00082 15.2 7.4 14 323-336 347-360 (362) 120 TIGR02843 CyoB cytochrome o ub 36.1 24 0.00061 16.1 1.9 31 185-215 71-107 (646) 121 PTZ00081 enolase (2-phospho-D- 35.7 33 0.00084 15.1 5.4 182 136-333 171-395 (442) 122 pfam08981 consensus 35.7 33 0.00084 15.1 6.0 144 123-291 12-168 (181) 123 COG0803 LraI ABC-type metal io 35.3 34 0.00085 15.1 9.6 143 176-338 131-297 (303) 124 cd01465 vWA_subgroup VWA subgr 35.2 31 0.00079 15.3 2.4 38 279-323 128-165 (170) 125 cd06431 GT8_LARGE_C LARGE cata 34.5 24 0.00062 16.1 1.7 36 129-169 90-125 (280) 126 KOG0467 consensus 34.4 35 0.00088 15.0 7.0 49 225-277 202-250 (887) 127 pfam07302 AroM AroM protein. T 33.8 33 0.00085 15.1 2.3 187 25-295 6-196 (221) 128 COG0406 phoE Broad specificity 33.5 36 0.00091 14.9 9.5 122 88-220 26-160 (208) 129 PRK12391 tryptophan synthase s 33.1 36 0.00092 14.9 8.7 96 27-153 27-134 (419) 130 PRK10481 hypothetical protein; 32.8 36 0.00092 14.9 2.4 193 17-294 3-198 (224) 131 TIGR01211 ELP3 histone acetylt 32.8 32 0.0008 15.3 2.1 166 125-314 249-457 (573) 132 cd01841 NnaC_like NnaC (CMP-Ne 32.7 37 0.00094 14.8 7.2 23 128-150 80-103 (174) 133 COG1250 FadB 3-hydroxyacyl-CoA 32.3 35 0.0009 14.9 2.3 35 258-293 155-192 (307) 134 COG0434 SgcQ Predicted TIM-bar 32.1 37 0.00096 14.7 3.1 34 179-213 120-153 (263) 135 TIGR01916 F420_puta_dedox F420 32.0 32 0.00081 15.3 2.0 55 188-242 137-210 (260) 136 TIGR03455 HisG_C-term ATP phos 32.0 38 0.00096 14.7 2.9 23 265-287 75-97 (100) 137 KOG1461 consensus 31.8 38 0.00097 14.7 3.1 21 123-143 56-76 (673) 138 pfam09379 FERM_N FERM N-termin 31.6 38 0.00097 14.7 3.7 36 231-266 20-56 (79) 139 PRK00553 ribose-phosphate pyro 31.4 39 0.00098 14.7 11.0 64 232-298 189-276 (340) 140 COG3739 Uncharacterized integr 31.4 19 0.00049 16.8 0.8 33 134-166 100-132 (263) 141 cd06592 GH31_glucosidase_KIAA1 31.0 39 0.001 14.6 11.2 99 94-200 27-153 (303) 142 pfam07429 Fuc4NAc_transf 4-alp 30.9 39 0.001 14.6 5.4 26 52-83 87-112 (361) 143 PRK13122 consensus 30.6 40 0.001 14.6 9.9 212 16-288 1-217 (242) 144 TIGR02079 THD1 threonine dehyd 30.6 40 0.001 14.6 3.3 56 154-210 120-177 (415) 145 TIGR01422 phosphonatase phosph 30.4 40 0.001 14.6 8.1 121 55-236 45-167 (199) 146 pfam07796 DUF1638 Protein of u 30.3 40 0.001 14.5 4.3 131 96-244 12-148 (166) 147 pfam05675 DUF817 Protein of un 30.2 40 0.001 14.5 2.4 33 134-166 89-121 (235) 148 pfam03786 UxuA D-mannonate deh 30.1 40 0.001 14.5 4.5 93 211-317 212-312 (350) 149 TIGR02810 agaZ_gatZ D-tagatose 30.1 40 0.001 14.5 3.7 257 9-295 20-396 (430) 150 pfam01888 CbiD CbiD. CbiD is e 29.7 41 0.001 14.5 6.0 77 177-285 174-250 (261) 151 cd01714 ETF_beta The electron 29.7 41 0.001 14.5 6.3 51 89-144 34-84 (202) 152 TIGR03624 conserved hypothetic 29.6 11 0.00029 18.3 -0.6 48 229-276 136-190 (345) 153 cd06425 M1P_guanylylT_B_like_N 29.5 41 0.0011 14.5 3.7 24 120-143 29-52 (233) 154 PRK10426 alpha-glucosidase; Pr 29.5 41 0.0011 14.4 7.0 202 122-336 313-563 (682) 155 pfam02571 CbiJ Precorrin-6x re 29.4 41 0.0011 14.4 4.9 48 125-179 52-100 (246) 156 PRK13112 consensus 29.4 42 0.0011 14.4 10.2 221 17-294 16-242 (279) 157 PRK09029 O-succinylbenzoic aci 29.3 42 0.0011 14.4 2.7 35 249-285 346-386 (458) 158 cd03013 PRX5_like Peroxiredoxi 29.2 42 0.0011 14.4 3.1 22 125-146 53-74 (155) 159 COG1489 SfsA DNA-binding prote 29.2 42 0.0011 14.4 2.8 17 178-194 186-202 (235) 160 PRK13117 consensus 28.9 42 0.0011 14.4 11.1 221 17-294 15-242 (268) 161 KOG3112 consensus 28.8 42 0.0011 14.4 2.7 29 116-144 93-121 (262) 162 PRK09932 glycerate kinase II; 28.5 43 0.0011 14.3 2.4 63 230-300 265-337 (381) 163 smart00855 PGAM Phosphoglycera 28.5 43 0.0011 14.3 6.9 37 88-126 23-59 (155) 164 PTZ00110 helicase; Provisional 28.5 43 0.0011 14.3 5.8 60 93-163 262-322 (602) 165 KOG0488 consensus 28.4 11 0.00028 18.5 -0.9 67 208-279 177-244 (309) 166 KOG0884 consensus 28.3 43 0.0011 14.3 3.3 15 286-300 118-132 (161) 167 TIGR02494 PFLE_PFLC glycyl-rad 28.3 43 0.0011 14.3 8.3 128 77-217 125-270 (305) 168 PRK07238 bifunctional RNase H/ 28.3 43 0.0011 14.3 10.1 97 228-332 231-331 (375) 169 COG1208 GCD1 Nucleoside-diphos 28.2 43 0.0011 14.3 2.8 23 121-143 31-53 (358) 170 PRK00050 mraW S-adenosyl-methy 28.1 44 0.0011 14.3 4.0 17 206-222 235-251 (309) 171 cd01019 ZnuA Zinc binding prot 28.0 44 0.0011 14.3 11.8 215 95-337 43-281 (286) 172 TIGR01125 TIGR01125 MiaB-like 27.9 44 0.0011 14.3 6.1 143 34-193 103-280 (475) 173 cd01994 Alpha_ANH_like_IV This 27.7 44 0.0011 14.2 11.9 127 129-284 16-144 (194) 174 PRK09627 oorA 2-oxoglutarate-a 27.5 45 0.0011 14.2 6.9 35 303-337 340-374 (375) 175 TIGR03571 lucif_BA3436 lucifer 27.5 45 0.0011 14.2 3.8 36 264-299 255-290 (298) 176 COG0159 TrpA Tryptophan syntha 27.5 45 0.0011 14.2 10.5 190 19-252 18-212 (265) 177 pfam12098 DUF3574 Protein of u 27.3 27 0.0007 15.7 1.0 65 250-321 38-102 (104) 178 PRK09997 hydroxypyruvate isome 27.1 45 0.0012 14.2 7.0 85 125-210 87-177 (258) 179 PRK08208 coproporphyrinogen II 26.7 46 0.0012 14.1 13.1 117 85-208 103-231 (436) 180 TIGR00423 TIGR00423 conserved 26.6 46 0.0012 14.1 5.0 43 96-138 77-128 (331) 181 TIGR00588 ogg 8-oxoguanine DNA 26.5 30 0.00077 15.4 1.1 60 223-293 229-290 (379) 182 PRK09533 bifunctional transald 26.2 47 0.0012 14.1 6.5 66 118-193 436-503 (950) 183 pfam05990 DUF900 Alpha/beta hy 26.1 47 0.0012 14.0 11.9 24 120-143 75-99 (230) 184 COG4555 NatA ABC-type Na+ tran 26.0 47 0.0012 14.0 2.7 26 193-218 171-196 (245) 185 KOG3182 consensus 26.0 47 0.0012 14.0 3.5 12 229-240 134-145 (212) 186 pfam01795 Methyltransf_5 MraW 25.8 48 0.0012 14.0 4.3 17 206-222 236-252 (310) 187 PRK00347 sugar fermentation st 25.7 48 0.0012 14.0 3.9 23 256-278 187-209 (234) 188 COG3384 Aromatic ring-opening 25.7 48 0.0012 14.0 4.6 14 19-32 9-22 (268) 189 TIGR02207 lipid_A_htrB lipid A 25.6 48 0.0012 14.0 2.5 11 207-217 124-134 (308) 190 TIGR01494 ATPase_P-type ATPase 25.6 48 0.0012 14.0 5.9 114 180-300 238-360 (478) 191 TIGR02208 lipid_A_msbB lipid A 25.6 48 0.0012 14.0 2.4 18 127-144 114-131 (307) 192 PRK00025 lpxB lipid-A-disaccha 25.4 49 0.0012 14.0 6.7 16 322-337 365-380 (382) 193 cd01451 vWA_Magnesium_chelatas 25.0 49 0.0013 13.9 5.3 53 270-336 125-177 (178) 194 pfam06821 DUF1234 Alpha/Beta h 24.9 50 0.0013 13.9 3.7 34 181-218 34-67 (171) 195 TIGR03099 dCO2ase_PEP1 pyridox 24.9 45 0.0012 14.2 1.7 43 87-131 23-70 (398) 196 pfam12309 KBP_C KIF-1 binding 24.8 50 0.0013 13.9 2.0 19 177-195 152-170 (365) 197 pfam02629 CoA_binding CoA bind 24.8 50 0.0013 13.9 3.1 30 114-143 63-93 (96) 198 COG1090 Predicted nucleoside-d 24.8 50 0.0013 13.9 6.1 24 305-336 272-295 (297) 199 PRK10302 hypothetical protein; 24.7 50 0.0013 13.9 4.8 79 120-200 137-226 (272) 200 PRK13111 trpA tryptophan synth 24.6 50 0.0013 13.9 10.4 218 18-294 9-232 (256) 201 KOG2599 consensus 24.5 50 0.0013 13.9 9.3 175 146-338 54-240 (308) 202 cd02523 PC_cytidylyltransferas 24.3 51 0.0013 13.8 2.2 25 120-144 27-51 (229) 203 TIGR01162 purE phosphoribosyla 24.2 51 0.0013 13.8 3.5 29 191-219 14-42 (159) 204 cd04724 Tryptophan_synthase_al 24.2 51 0.0013 13.8 10.8 186 22-252 4-194 (242) 205 PRK10799 putative hydrolase-ox 23.8 52 0.0013 13.8 9.6 41 298-339 194-234 (247) 206 PRK13115 consensus 23.6 52 0.0013 13.7 9.7 214 17-289 22-242 (269) 207 pfam00113 Enolase_C Enolase, C 23.4 53 0.0013 13.7 5.5 120 178-332 130-249 (296) 208 pfam10103 DUF2342 Uncharacteri 23.3 22 0.00055 16.4 -0.1 46 231-276 140-192 (341) 209 PRK02269 ribose-phosphate pyro 23.2 53 0.0014 13.7 14.6 165 128-321 75-279 (321) 210 TIGR01615 A_thal_3542 uncharac 23.1 54 0.0014 13.7 1.9 48 214-266 89-141 (144) 211 COG4623 Predicted soluble lyti 23.0 54 0.0014 13.7 5.2 17 258-276 385-401 (473) 212 TIGR02812 fadR_gamma fatty aci 22.8 54 0.0014 13.6 2.3 122 71-201 7-163 (275) 213 KOG0460 consensus 22.8 54 0.0014 13.6 5.5 95 208-312 104-205 (449) 214 COG0474 MgtA Cation transport 22.8 54 0.0014 13.6 4.5 51 250-312 536-586 (917) 215 TIGR02094 more_P_ylases alpha- 22.7 46 0.0012 14.1 1.5 51 225-275 413-466 (618) 216 cd01017 AdcA Metal binding pro 22.5 55 0.0014 13.6 12.3 204 95-335 43-274 (282) 217 PRK10927 essential cell divisi 22.5 55 0.0014 13.6 3.4 54 225-284 247-315 (319) 218 COG1350 Predicted alternative 22.2 56 0.0014 13.6 5.9 68 71-149 65-139 (432) 219 cd04198 eIF-2B_gamma_N The N-t 22.1 56 0.0014 13.5 5.2 24 121-144 30-53 (214) 220 PRK06827 phosphoribosylpyropho 22.1 56 0.0014 13.5 3.6 149 129-298 115-314 (381) 221 cd02538 G1P_TT_short G1P_TT_sh 21.9 56 0.0014 13.5 2.2 25 120-144 29-53 (240) 222 cd04197 eIF-2B_epsilon_N The N 21.9 56 0.0014 13.5 7.5 24 121-144 30-53 (217) 223 TIGR01710 typeII_sec_gspG gene 21.8 57 0.0014 13.5 3.1 58 59-135 15-72 (137) 224 cd06595 GH31_xylosidase_XylS-l 21.7 57 0.0015 13.5 6.7 76 122-202 73-149 (292) 225 pfam03945 Endotoxin_N delta en 21.6 57 0.0015 13.5 3.7 55 114-170 100-154 (224) 226 cd01461 vWA_interalpha_trypsin 21.6 57 0.0015 13.5 4.2 44 266-319 117-160 (171) 227 PRK13463 phosphatase PhoE; Pro 21.5 57 0.0015 13.5 9.2 124 88-221 26-159 (203) 228 pfam11633 Nsp3 Replicase polyp 21.5 58 0.0015 13.5 1.9 54 266-325 80-133 (142) 229 PRK13389 UTP--glucose-1-phosph 21.4 58 0.0015 13.5 3.0 25 120-144 37-61 (302) 230 PRK05282 peptidase E; Validate 20.9 59 0.0015 13.4 6.3 13 35-47 18-30 (233) 231 PRK10015 hypothetical protein; 20.8 59 0.0015 13.4 5.8 12 206-217 272-283 (429) 232 TIGR01429 AMP_deaminase AMP de 20.7 14 0.00035 17.8 -1.6 42 223-266 526-582 (616) 233 pfam07090 DUF1355 Protein of u 20.7 60 0.0015 13.4 5.5 121 142-273 15-140 (183) 234 PRK10122 UTP--glucose-1-phosph 20.5 60 0.0015 13.3 5.1 25 120-144 32-56 (297) 235 PRK13134 consensus 20.4 60 0.0015 13.3 10.9 189 18-252 19-213 (257) 236 pfam06309 Torsin Torsin. This 20.4 60 0.0015 13.3 3.7 37 180-217 27-63 (127) 237 TIGR01392 homoserO_Ac_trn homo 20.3 61 0.0015 13.3 4.8 59 263-328 320-379 (395) 238 PRK13405 bchH magnesium chelat 20.3 61 0.0015 13.3 12.6 184 17-217 113-316 (1207) 239 COG3222 Uncharacterized protei 20.3 61 0.0015 13.3 4.7 91 189-284 88-186 (211) 240 KOG0434 consensus 20.2 61 0.0016 13.3 5.9 25 177-201 848-872 (1070) 241 cd06376 PBP1_mGluR_groupIII Li 20.1 61 0.0016 13.3 9.4 21 20-44 106-126 (463) 242 TIGR02416 CO_dehy_Mo_lg carbon 20.1 59 0.0015 13.4 1.6 33 226-258 499-546 (775) No 1 >PRK00035 hemH ferrochelatase; Reviewed Probab=100.00 E-value=0 Score=786.46 Aligned_cols=323 Identities=41% Similarity=0.728 Sum_probs=313.8 Q ss_pred CCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCHHHH Q ss_conf 77786799997178889999889999999874895441178688899876665125747889999987077887404577 Q gi|254780200|r 14 VKFGKIGVLLVNLGTPDGHDFFSLRRYLREFLLDKRVVELPSWQWRPILFGYILNFRPSKIKHAYAKIWNTAKNESILRT 93 (343) Q Consensus 14 ~~~~K~gvLL~n~G~P~~~~~~~V~~yL~~fl~D~~Vi~~p~~~~~~il~~~I~~~R~~~~~~~Y~~Iw~~~gg~SPL~~ 93 (343) .+|+|+||||||||||++++ ||++||++||+|++||++|+++|+++|+++|+|+|+++++++|++|| |+|||+. T Consensus 1 ~~~~k~gvll~nlG~P~~~~--~v~~yL~~fl~d~~vi~~p~~~~~~~l~~~I~~~R~~~~~~~Y~~I~----g~SPL~~ 74 (325) T PRK00035 1 MMMKKVGVLLLNLGGPETPE--DVRPYLKNFLSDRRVIDLPRPLWQPLLAGIILPERLPKVAKHYESIG----GGSPLNV 74 (325) T ss_pred CCCCCEEEEEEECCCCCCHH--HHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCC----CCCHHHH T ss_conf 98776699998578999979--99999999838997467864889999875630551599999998657----9990899 Q ss_pred HHHHHHHHHHHHHHCCCCCCCHHHHHCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 75999999998652247843001121045424789999987099658996317111001468999999999987179986 Q gi|254780200|r 94 HTRDQATNLAKRLESISSIVVDWAMRYGKPSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMRWSPS 173 (343) Q Consensus 94 ~t~~qa~~L~~~L~~~~~~~V~~amry~~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~~~~~ 173 (343) +|+.|+++|+++|+...+++|++|||||+|+|++++++|+++||++|+++|||||||++||||+++.+.+++++.++.++ T Consensus 75 ~T~~q~~~L~~~L~~~~~~~V~~amry~~P~i~~~l~~l~~~g~~~ii~lPlyPqyS~sTtgs~~~~~~~~l~~~~~~~~ 154 (325) T PRK00035 75 ITRAQAEALEARLAQGVDLPVYLGMRYGEPSIEDAVEQLKADGVDRIVVLPLYPQYSYSTTASYFDDAARALAKLRGQPE 154 (325) T ss_pred HHHHHHHHHHHHHCCCCCCEEEEEEECCCCCHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 99999999999853599933998754689977899999985798559999775114176775999999999986599996 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEE Q ss_conf 42133455780068999999999997338998669998336750267627998889999999999972667567047862 Q gi|254780200|r 174 LRTVPPYYEDSDYISALAQSIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKICFQ 253 (343) Q Consensus 174 ~~~I~~~~~~p~yI~a~~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l~fQ 253 (343) +++|++||++|.||++++++|++++++.+ ++++|||||||||++++++||||.+||++|+++|+++||++.++|.+||| T Consensus 155 ~~~I~~~~~~p~yI~a~~~~I~~~l~~~~-~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~~~fQ 233 (325) T PRK00035 155 IRTIRSYYDHPGYIEALAESIREALAQLG-RPDRLLFSAHGLPKRYIDAGDPYPQQCEETARLIAEALGLGPEEYDLTYQ 233 (325) T ss_pred EEEECCCCCCHHHHHHHHHHHHHHHHHCC-CCCEEEEECCCCHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCCCEEEEEE T ss_conf 48847745798999999999999998679-97679996065628888649908999999999999982899534189876 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHCCCCCEEEECCHHHHCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHH Q ss_conf 25777654430268999989976998399974405422125588875899999997599758981689879899999999 Q gi|254780200|r 254 SRFGRIKCLEPPTDKTVEKLAHDGIKSLAIITPGFSSDCLETSYEIAHEAKEIFVNGGGEKFTQVPCLNSSNLSIDLLEK 333 (343) Q Consensus 254 Sr~Gp~~WL~P~t~~~l~~L~~~G~k~v~v~p~gFvsD~lETl~Eidie~~e~~~e~Gg~~~~rvp~lN~~~~fi~~La~ 333 (343) ||+||++||+|+|+|+|++|+++|+|+|+|+||||++||+|||||||+|+||.|.++||++|.|+|||||||.||++|++ T Consensus 234 Sr~Gp~~WL~P~t~~~l~~l~~~G~k~V~v~p~gFvsD~lETl~Eidie~~e~~~~~G~~~~~~vp~lN~~~~fi~~L~~ 313 (325) T PRK00035 234 SRFGPEPWLEPYTDDTLEELAEKGVKKIVVVPPGFVSDHLETLEEIDIEYREIAEEAGGKEYRRIPCLNDSPEFIEALAD 313 (325) T ss_pred CCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHH T ss_conf 37789886887589999998857997599989820453446599988999999998599558980799999899999999 Q ss_pred HHHHHHCCCC Q ss_conf 9999842889 Q gi|254780200|r 334 ITRRELMGWI 343 (343) Q Consensus 334 lv~~~l~gw~ 343 (343) +|+++|+||. T Consensus 314 lv~~~l~~~~ 323 (325) T PRK00035 314 LVLERLQGWP 323 (325) T ss_pred HHHHHHCCCC T ss_conf 9999856999 No 2 >COG0276 HemH Protoheme ferro-lyase (ferrochelatase) [Coenzyme metabolism] Probab=100.00 E-value=0 Score=742.51 Aligned_cols=319 Identities=42% Similarity=0.744 Sum_probs=309.8 Q ss_pred CCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCHHHHH Q ss_conf 77867999971788899998899999998748954411786888998766651257478899999870778874045777 Q gi|254780200|r 15 KFGKIGVLLVNLGTPDGHDFFSLRRYLREFLLDKRVVELPSWQWRPILFGYILNFRPSKIKHAYAKIWNTAKNESILRTH 94 (343) Q Consensus 15 ~~~K~gvLL~n~G~P~~~~~~~V~~yL~~fl~D~~Vi~~p~~~~~~il~~~I~~~R~~~~~~~Y~~Iw~~~gg~SPL~~~ 94 (343) .|+|+||||||||||++++ ||++||+|||+||+|+++|+++|++ |+++|++.|+++++++|++|| |+|||+.+ T Consensus 2 ~~~k~avLL~nlG~P~~~e--~v~~yL~~~~~d~~v~~~~~~~~~~-l~~~I~~~R~~~~~~~Y~~ig----g~sPL~~~ 74 (320) T COG0276 2 KMKKTAVLLLNLGGPETLE--DVRPYLKNFLSDRRVIELPRPLWYP-LAGIILPLRLKKVAKNYESIG----GKSPLNVI 74 (320) T ss_pred CCCCEEEEEEECCCCCCHH--HHHHHHHHHHCCCCCCCCCHHHHHH-HHHHHHHHCCHHHHHHHHHHC----CCCCCHHH T ss_conf 9875079998569999778--9999999983699877762455543-543310100188999999846----88943999 Q ss_pred HHHHHHHHHHHHHCCCCCCCHHHHHCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 59999999986522478430011210454247899999870996589963171110014689999999999871799864 Q gi|254780200|r 95 TRDQATNLAKRLESISSIVVDWAMRYGKPSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMRWSPSL 174 (343) Q Consensus 95 t~~qa~~L~~~L~~~~~~~V~~amry~~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~~~~~~ 174 (343) |+.|+++|+++|+ ..++.|++|||||+|+|++++++|+++|+++||++|||||||++||||+.+.+++++.+.+..+++ T Consensus 75 T~~q~~~L~~~L~-~~~~~V~~amry~~P~i~~~v~~l~~~gv~~iv~~pLyPqyS~sTt~s~~~~~~~al~~~~~~~~i 153 (320) T COG0276 75 TRAQAAALEERLD-LPDFKVYLAMRYGPPFIEEAVEELKKDGVERIVVLPLYPQYSSSTTGSYVDELARALKELRGQPKI 153 (320) T ss_pred HHHHHHHHHHHHC-CCCCCEEEEECCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHCCCCCCE T ss_conf 9999999999737-998607976347898689999999971987599997776302301899999999999853789866 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEEC Q ss_conf 21334557800689999999999973389986699983367502676279988899999999999726675670478622 Q gi|254780200|r 175 RTVPPYYEDSDYISALAQSIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKICFQS 254 (343) Q Consensus 175 ~~I~~~~~~p~yI~a~~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l~fQS 254 (343) ++|++||++|.||++|+++|++++++.+.++++|||||||||++++++||||.+||.+|+++|++.||++.++|.++||| T Consensus 154 ~~I~~~~~~p~yI~a~a~~I~~~~~~~~~~~~~llfSaHglP~~~~~~GDpY~~q~~~t~~li~e~lg~~~~~~~~~~QS 233 (320) T COG0276 154 STIPDYYDEPLYIEALADSIREKLAKHPRDDDVLLFSAHGLPKRYIDEGDPYPQQCQETTRLIAEALGLPEEEYDLTFQS 233 (320) T ss_pred EEECCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCHHHHHCCCCHHHHHHHHHHHHHHHCCCCCHHEEEEEEC T ss_conf 88167557868999999999999974787770899956899655655699669999999999999809980340588625 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCCEEEECCHHHHCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHH Q ss_conf 57776544302689999899769983999744054221255888758999999975997589816898798999999999 Q gi|254780200|r 255 RFGRIKCLEPPTDKTVEKLAHDGIKSLAIITPGFSSDCLETSYEIAHEAKEIFVNGGGEKFTQVPCLNSSNLSIDLLEKI 334 (343) Q Consensus 255 r~Gp~~WL~P~t~~~l~~L~~~G~k~v~v~p~gFvsD~lETl~Eidie~~e~~~e~Gg~~~~rvp~lN~~~~fi~~La~l 334 (343) |+||+|||+|+|+|++++|.++|+|+++|+||||+|||+|||||||+|+++.|.++||++|.|+|||||||.||++|+++ T Consensus 234 ~~G~~~WL~P~t~~~l~~L~~~g~k~iiv~pigFvsDhlETL~Eid~e~~e~~~~~Gg~~y~rip~lN~~p~fi~~la~l 313 (320) T COG0276 234 RFGPEPWLQPYTDDLLEELGEKGVKKIIVVPIGFVSDHLETLYEIDHEYRELAEEAGGKKYVRIPCLNDSPEFIDALADL 313 (320) T ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCCCCHHHHHHHHHH T ss_conf 99998888977999999988638984999897010212888998888899999980885189668999999999999999 Q ss_pred HHHHHCC Q ss_conf 9998428 Q gi|254780200|r 335 TRRELMG 341 (343) Q Consensus 335 v~~~l~g 341 (343) |++++.+ T Consensus 314 v~~~~~~ 320 (320) T COG0276 314 VRELLNE 320 (320) T ss_pred HHHHHCC T ss_conf 9987329 No 3 >pfam00762 Ferrochelatase Ferrochelatase. Probab=100.00 E-value=0 Score=733.66 Aligned_cols=311 Identities=44% Similarity=0.747 Sum_probs=300.6 Q ss_pred CEEEEEECCCCCCCCCHHHHHHHHHHHCCCHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCHHHHHHHH Q ss_conf 67999971788899998899999998748954411786888998766651257478899999870778874045777599 Q gi|254780200|r 18 KIGVLLVNLGTPDGHDFFSLRRYLREFLLDKRVVELPSWQWRPILFGYILNFRPSKIKHAYAKIWNTAKNESILRTHTRD 97 (343) Q Consensus 18 K~gvLL~n~G~P~~~~~~~V~~yL~~fl~D~~Vi~~p~~~~~~il~~~I~~~R~~~~~~~Y~~Iw~~~gg~SPL~~~t~~ 97 (343) |+||||+|||||++++ +|++||++||+|++|+++|+++|+ +++|+++|+++++++|++| ||+|||+.+|+. T Consensus 1 k~aVLL~n~G~P~s~~--~v~~yL~~~l~d~~vi~~p~~~~~---~~~I~~~R~~~~~~~Y~~i----gg~SPl~~~t~~ 71 (312) T pfam00762 1 KTAVLLLNLGGPETPE--DVRPFLRNFLSDRRVIELPLQLWL---AGIILPERPKKSAEAYRKI----GGGSPLNVITRA 71 (312) T ss_pred CEEEEEEECCCCCCHH--HHHHHHHHHCCCCCCCCCCHHHHH---HHHCCCCCCHHHHHHHHHC----CCCCHHHHHHHH T ss_conf 9799998678989979--999999997489986768779997---6312423579999999977----998858999999 Q ss_pred HHHHHHHHHHCC-CCCCCHHHHHCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 999999865224-7843001121045424789999987099658996317111001468999999999987179986421 Q gi|254780200|r 98 QATNLAKRLESI-SSIVVDWAMRYGKPSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMRWSPSLRT 176 (343) Q Consensus 98 qa~~L~~~L~~~-~~~~V~~amry~~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~~~~~~~~ 176 (343) |+++|+++|++. .+++|++|||||+|+|++++++|+++|+++|+++|||||||++||||+++.+.+++++.++.+++++ T Consensus 72 q~~~L~~~l~~~~~~~~V~~amry~~P~i~~~l~~l~~~g~~~ii~lplyPqyS~~Tt~s~~~~~~~~~~~~~~~~~~~~ 151 (312) T pfam00762 72 QAEALQKRLGERGADVKVYLAMRYGPPSIEDALEELKADGVDRIVVLPLYPQYSASTTGSYLDELARALKKGRPAPEVRV 151 (312) T ss_pred HHHHHHHHHHHCCCCCEEEEEEECCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHCCCCCCEEE T ss_conf 99999999871499953898855689977889999996598769999777542303076999999999986589985127 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEECCC Q ss_conf 33455780068999999999997338998669998336750267627998889999999999972667567047862257 Q gi|254780200|r 177 VPPYYEDSDYISALAQSIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKICFQSRF 256 (343) Q Consensus 177 I~~~~~~p~yI~a~~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l~fQSr~ 256 (343) |++||+||.||++|+++|++++++.+.++++|||||||||++++++||||.+||.+|+++|+++||++ +|.+|||||+ T Consensus 152 I~~~~~~p~yi~a~~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gdpY~~~~~~t~~~i~~~l~~~--~~~~~fQSr~ 229 (312) T pfam00762 152 IRRYYDHPGYIEALADSIREALAKLGDDPDRLLFSAHGLPERYIKKGDPYPAQCEETARLVAERLGLS--DYRLAFQSRF 229 (312) T ss_pred ECCCCCCHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHCCCCHHHHHHHHHHHHHHHCCCC--CEEEEEEECC T ss_conf 06757898999999999999998569877469997056604677569907999999999999866987--2889998468 Q ss_pred CCCCCCCHHHHHHHHHHHHCCCCCEEEECCHHHHCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHH Q ss_conf 77654430268999989976998399974405422125588875899999997599758981689879899999999999 Q gi|254780200|r 257 GRIKCLEPPTDKTVEKLAHDGIKSLAIITPGFSSDCLETSYEIAHEAKEIFVNGGGEKFTQVPCLNSSNLSIDLLEKITR 336 (343) Q Consensus 257 Gp~~WL~P~t~~~l~~L~~~G~k~v~v~p~gFvsD~lETl~Eidie~~e~~~e~Gg~~~~rvp~lN~~~~fi~~La~lv~ 336 (343) ||++||+|+|+++|++|+++|+|+|+|+||||++||+|||||||+|+||.|.++||++|.|+|||||||.||++|+++|+ T Consensus 230 g~~~Wl~P~t~~~l~~l~~~G~k~v~v~p~gFv~D~lETl~Eidie~~e~~~~~G~~~~~~ip~lNd~~~fi~~L~~lv~ 309 (312) T pfam00762 230 GPEPWLEPYTDDTLEELAKEGVKKVVVVPIGFVSDHLETLYELDIEYRELAEEAGGENYRRIPCLNDSPAFIEALADLVR 309 (312) T ss_pred CCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHH T ss_conf 99998998889999999976997399988845442467699988999999997599739980899999899999999999 Q ss_pred HHH Q ss_conf 984 Q gi|254780200|r 337 REL 339 (343) Q Consensus 337 ~~l 339 (343) ++| T Consensus 310 ~~l 312 (312) T pfam00762 310 EHL 312 (312) T ss_pred HHC T ss_conf 649 No 4 >TIGR00109 hemH ferrochelatase; InterPro: IPR001015 Synonym(s): Protoheme ferro-lyase, Iron chelatase, etc. Ferrochelatase catalyzes the last step in heme biosynthesis: the chelation of a ferrous ion to proto-porphyrin IX, to form protoheme , . In eukaryotic cells, it binds to the mitochondrial inner membrane with its active site on the matrix side of the membrane. The X-ray structure of Bacillus subtilis and human ferrochelatase have been solved , . The human enzyme exists as a homodimer. Each subunit contains one [Fe2S2] cluster. The monomer is folded into two similar domains, each with a four-stranded parallel beta-sheet flanked by an alpha-helix in a beta-alpha-beta motif that is reminiscent of the fold found in the periplasmic binding proteins. The topological similarity between the domains suggests that they have arisen from a gene duplication event. However, significant differences exist between the two domains, including an N-terminal section (residues 80-130) that forms part of the active site pocket, and a C-terminal extension (residues 390-423) that is involved in coordination of the [Fe2S2]cluster and in stabilisation of the homodimer. The [Fe2S2] cluster ligands are Cys196, Cys403, Cys406 and Cys411. The experiments with Co(II) binding show that His230 and Asp383 are part of the enzyme active site . Ferrochelatase seems to have a structurally conserved core region that is common to the enzyme from bacteria, plants and mammals. Porphyrin binds in the identified cleft; this cleft also includes the metal-binding site of the enzyme. It is likely that the structure of the cleft region will have different conformations upon substrate binding and release .; GO: 0004325 ferrochelatase activity, 0006783 heme biosynthetic process. Probab=100.00 E-value=0 Score=734.67 Aligned_cols=325 Identities=35% Similarity=0.623 Sum_probs=311.0 Q ss_pred CCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCHHHH Q ss_conf 77786799997178889999889999999874895441178688899876665125747889999987077887404577 Q gi|254780200|r 14 VKFGKIGVLLVNLGTPDGHDFFSLRRYLREFLLDKRVVELPSWQWRPILFGYILNFRPSKIKHAYAKIWNTAKNESILRT 93 (343) Q Consensus 14 ~~~~K~gvLL~n~G~P~~~~~~~V~~yL~~fl~D~~Vi~~p~~~~~~il~~~I~~~R~~~~~~~Y~~Iw~~~gg~SPL~~ 93 (343) +.++|+||||||||||++|+.+||++||++||.||+||++|+..||++++.+|+++|.+|+++.|++||+ .|||||.. T Consensus 1 ~k~~k~GvlL~NlGgP~~P~~e~v~~fL~~~f~Dp~~~~~~~~~~~K~~~~~I~~lR~~k~~K~Y~~iG~--EGgSPL~~ 78 (339) T TIGR00109 1 TKRKKTGVLLLNLGGPDKPKLEEVERFLKQLFADPRIIDISRAKWRKLLAKMILPLRSKKIAKNYEEIGT--EGGSPLLQ 78 (339) T ss_pred CCCCCEEEEEEECCCCCCCCHHHHHHHHHHHCCCCCEECCCCHHCCCCHHHHHHHHHHHHHHHHHHHHCC--CCCCHHHH T ss_conf 9888638888406642336512268999975178300116512006428999765210889999997277--78871468 Q ss_pred HHHHHHHHHHHHHHC----CCCCCCHHHHHCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 759999999986522----4784300112104542478999998709965899631711100146899999999998717 Q gi|254780200|r 94 HTRDQATNLAKRLES----ISSIVVDWAMRYGKPSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMR 169 (343) Q Consensus 94 ~t~~qa~~L~~~L~~----~~~~~V~~amry~~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~ 169 (343) ||+.|+.+|++.|+. .-+..|++|||||+||+++++++|.+++++++|+||||||||+|||||+++.+.+++++.+ T Consensus 79 it~~Q~~~L~~~L~~~~~~~~~~~~~~am~Y~~P~~~~av~~l~~d~v~~~V~l~lYPhfS~sTtgS~f~~l~~~~~~~~ 158 (339) T TIGR00109 79 ITEQQAEALEKRLDKEIQEEIDAKVYIAMRYGEPFTEEAVKELLKDGVERLVVLPLYPHFSSSTTGSSFNELAEALKKLR 158 (339) T ss_pred HHHHHHHHHHHHCCCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHCC T ss_conf 99999999986244224356873304532127821899999986454648888514873000127889999999998348 Q ss_pred CC--CCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCC--C-CEEEEEECCCCHHHHHH-CCCCHHHHHHHHHHHHHHHCC Q ss_conf 99--864213345578006899999999999733899--8-66999833675026762-799888999999999997266 Q gi|254780200|r 170 WS--PSLRTVPPYYEDSDYISALAQSIREHFESIQWT--P-EMLLVSFHQMPVSYLLK-GDPYGCHCHKTARLLKEFLSW 243 (343) Q Consensus 170 ~~--~~~~~I~~~~~~p~yI~a~~~~I~~~l~~~~~~--~-~~llfSaHglP~~~~~~-gDpY~~q~~~t~~~i~~~l~~ 243 (343) .. +.+++|++||++++||+||+++|++.|++.+.+ + .+|||||||||+++++. ||+|++||++|+++|++.||. T Consensus 159 slR~~~~~~I~~w~~~~~y~~a~~d~i~~~l~~~p~p~Rd~~~LlfS~HglP~~~~d~aGD~Y~~~~~~~~~~i~~~l~~ 238 (339) T TIGR00109 159 SLRSLKISVIESWYDNEKYIKALADSIKETLKKFPEPERDKAVLLFSAHGLPQSYVDEAGDPYPAECEATTRLIAEKLGL 238 (339) T ss_pred CCCCCCEEEEECCCCCHHHHHHHHHHHHHHHHCCCCCCHHCEEEEEEECCCCHHHHHCCCCCCHHHHHHHHHHHHHHHCC T ss_conf 96576314751445688799999999999984077950110687774167844777426895379999999999997133 Q ss_pred ---CCCCCEEEEEC-CCCCCCCCCHHHHHHHHHHHHCCCCCEEEECCHHHHCCHHHHHHHHHHHHHHHHHCCCCEEEEEC Q ss_conf ---75670478622-57776544302689999899769983999744054221255888758999999975997589816 Q gi|254780200|r 244 ---PDDRFKICFQS-RFGRIKCLEPPTDKTVEKLAHDGIKSLAIITPGFSSDCLETSYEIAHEAKEIFVNGGGEKFTQVP 319 (343) Q Consensus 244 ---~~~~~~l~fQS-r~Gp~~WL~P~t~~~l~~L~~~G~k~v~v~p~gFvsD~lETl~Eidie~~e~~~e~Gg~~~~rvp 319 (343) ..++|.++||| |+||.|||+|+|+|.++.|.++|+.+++|+||||++||+|||+|||+|||+.|.+.|+.+|.|+| T Consensus 239 knt~~~~y~l~~QSe~~GP~~WL~P~T~e~~~~l~~~~~~~~~~~P~gF~~Dh~ETL~EiD~e~R~~~e~~g~~~y~~~p 318 (339) T TIGR00109 239 KNTAPNEYRLTYQSERVGPEPWLGPYTEELLEKLGEKGVDHIVVVPIGFTADHLETLYEIDEEYREVAEEAGGKKYQRIP 318 (339) T ss_pred CCCCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCC T ss_conf 57867643788621676886566632799999986506880899886301245788887566666665541244343468 Q ss_pred CCCCCHHHHHHHHHHHHHHHC Q ss_conf 898798999999999999842 Q gi|254780200|r 320 CLNSSNLSIDLLEKITRRELM 340 (343) Q Consensus 320 ~lN~~~~fi~~La~lv~~~l~ 340 (343) |||+++.||++||++|.+++. T Consensus 319 ~LN~~~~FI~~~a~~V~~~l~ 339 (339) T TIGR00109 319 ALNAKPEFIEALAELVLKKLE 339 (339) T ss_pred CCCCCHHHHHHHHHHHHHHCC T ss_conf 988787999999999998549 No 5 >PRK12435 ferrochelatase; Provisional Probab=100.00 E-value=0 Score=665.45 Aligned_cols=303 Identities=21% Similarity=0.358 Sum_probs=274.1 Q ss_pred CCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCHHHHH Q ss_conf 77867999971788899998899999998748954411786888998766651257478899999870778874045777 Q gi|254780200|r 15 KFGKIGVLLVNLGTPDGHDFFSLRRYLREFLLDKRVVELPSWQWRPILFGYILNFRPSKIKHAYAKIWNTAKNESILRTH 94 (343) Q Consensus 15 ~~~K~gvLL~n~G~P~~~~~~~V~~yL~~fl~D~~Vi~~p~~~~~~il~~~I~~~R~~~~~~~Y~~Iw~~~gg~SPL~~~ 94 (343) .++|+||||||||||++++ ||++||++|++|+++++ ..+ .+..++|++| ||+|||+.+ T Consensus 2 ~k~K~gVLL~nlG~P~s~~--dV~pyL~~i~~gr~p~~-------~~l---------~~~~~~Y~~i----gg~SPl~~~ 59 (316) T PRK12435 2 KKKKIGLLVMAYGTPESLD--DVEAYYTHIRHGRKPSE-------EAL---------EDLIGRYKAI----GGISPLAKI 59 (316) T ss_pred CCCCEEEEEEECCCCCCHH--HHHHHHHHHCCCCCCCH-------HHH---------HHHHHHHHHH----CCCCHHHHH T ss_conf 9885899998789999876--89999999706899989-------999---------9999999982----898854999 Q ss_pred HHHHHHHHHHHHHCC---CCCCCHHHHHCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 599999999865224---78430011210454247899999870996589963171110014689999999999871799 Q gi|254780200|r 95 TRDQATNLAKRLESI---SSIVVDWAMRYGKPSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMRWS 171 (343) Q Consensus 95 t~~qa~~L~~~L~~~---~~~~V~~amry~~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~~~ 171 (343) |+.|+++|+++|++. .++.|++|||||+|+|++++++|+++|+++|+++|||||||.+||+|+.+.+.++.++. +. T Consensus 60 t~~qa~~L~~~L~~~~~~~~~~v~~gmRy~~P~i~~~l~~l~~~g~~~ii~lplyPqyS~~t~~s~~~~~~~~~~~~-~~ 138 (316) T PRK12435 60 TKEQAHKLTDSMNNMFTEYEFNCYLGLKHTAPFIEDAVEEMKRDGIEQAISIVLAPHYSTFSIKAYNERAIRLSEEI-GG 138 (316) T ss_pred HHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHC-CC T ss_conf 99999999999715258987516774289999899999999984995699986676445420899999999999862-99 Q ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCC-CCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEE Q ss_conf 86421334557800689999999999973389-98669998336750267627998889999999999972667567047 Q gi|254780200|r 172 PSLRTVPPYYEDSDYISALAQSIREHFESIQW-TPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKI 250 (343) Q Consensus 172 ~~~~~I~~~~~~p~yI~a~~~~I~~~l~~~~~-~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l 250 (343) |.+++|++||+||+||+|++++|++++++... +..+|||||||||++++++||||.+||++|+++|+++|++. +|.+ T Consensus 139 p~i~~I~~~~~~p~yI~a~a~~I~~~l~~~~~~~~~~llFSaHglP~~~i~~gDpY~~q~~~t~~~va~~l~~~--~~~~ 216 (316) T PRK12435 139 PVIEPIDQWYDEPKFISYWADQIKETFTKIEDKEKAVVIFSAHSLPEKIIAAGDPYVEQLQHTADLIAAAANIQ--NYTI 216 (316) T ss_pred CEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHCCCCCHHHHHHHHHHHHHHCCCC--CCEE T ss_conf 86898268789879999999999999986636566579996277855563568980899999999999974999--8569 Q ss_pred EEECCCC-CCCCCCHHHHHHHHHHHHC-CCCCEEEECCHHHHCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHH Q ss_conf 8622577-7654430268999989976-9983999744054221255888758999999975997589816898798999 Q gi|254780200|r 251 CFQSRFG-RIKCLEPPTDKTVEKLAHD-GIKSLAIITPGFSSDCLETSYEIAHEAKEIFVNGGGEKFTQVPCLNSSNLSI 328 (343) Q Consensus 251 ~fQSr~G-p~~WL~P~t~~~l~~L~~~-G~k~v~v~p~gFvsD~lETl~Eidie~~e~~~e~Gg~~~~rvp~lN~~~~fi 328 (343) |||||+| |.+||+|+|+|++++|+++ |+|+|+|+||||+|||+|||||||+|+|+.|.+.|+ .|.|+|||||||.|| T Consensus 217 ~yQSr~g~p~~WL~P~t~d~l~~l~~~~~~k~vvv~Pi~FvsDhlETL~EldiE~ke~a~e~G~-~~~rvp~lN~~p~fI 295 (316) T PRK12435 217 GWQSAGNTPDPWIGPDVQDLTRDLYEEHGYESFIYCPVGFVAEHLEVLYDNDYECKVVTDELNA-AYFRPNMPNAQSTFI 295 (316) T ss_pred EEECCCCCCCCCCCCCHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHHHHHHHHHHHHHCCC-CEEECCCCCCCHHHH T ss_conf 9889999899999987999999999827975899989743033078899999999999997599-579668999998999 Q ss_pred HHHHHHHHHHHCCCC Q ss_conf 999999999842889 Q gi|254780200|r 329 DLLEKITRRELMGWI 343 (343) Q Consensus 329 ~~La~lv~~~l~gw~ 343 (343) ++|+++|+++|+.|+ T Consensus 296 ~~La~lV~~~l~~~~ 310 (316) T PRK12435 296 DCLATIVSRKMKEIV 310 (316) T ss_pred HHHHHHHHHHHHHHH T ss_conf 999999999986431 No 6 >KOG1321 consensus Probab=100.00 E-value=0 Score=623.54 Aligned_cols=322 Identities=26% Similarity=0.462 Sum_probs=301.9 Q ss_pred CCCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCHH Q ss_conf 98777867999971788899998899999998748954411786888998766651257478899999870778874045 Q gi|254780200|r 12 PKVKFGKIGVLLVNLGTPDGHDFFSLRRYLREFLLDKRVVELPSWQWRPILFGYILNFRPSKIKHAYAKIWNTAKNESIL 91 (343) Q Consensus 12 p~~~~~K~gvLL~n~G~P~~~~~~~V~~yL~~fl~D~~Vi~~p~~~~~~il~~~I~~~R~~~~~~~Y~~Iw~~~gg~SPL 91 (343) +...++||||||||||||++.+ ||.+||.++|.|+++|++|+.+ |.-|+.||+.+|.+|+++.|+.| |||||+ T Consensus 32 ~~~~k~ktgilllNmGGP~~ld--dV~~FL~rLfaD~DiI~Lp~~~-Q~~lakfIak~RtPKvqe~Y~~I----GGGSPi 104 (395) T KOG1321 32 VAQKKPKTGILLLNMGGPETLD--DVQDFLYRLFADPDIIPLPAFL-QKTLAKFIAKRRTPKVQEQYREI----GGGSPI 104 (395) T ss_pred CCCCCCCCEEEEECCCCCCCHH--HHHHHHHHHHCCCCEEECCHHH-HHHHHHHHHHHCCCHHHHHHHHC----CCCCCH T ss_conf 2236887238997179986221--5788999983488701077888-75688788875283377777750----699800 Q ss_pred HHHHHHHHHHHHHHHHCCC----CCCCHHHHHCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 7775999999998652247----843001121045424789999987099658996317111001468999999999987 Q gi|254780200|r 92 RTHTRDQATNLAKRLESIS----SIVVDWAMRYGKPSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIH 167 (343) Q Consensus 92 ~~~t~~qa~~L~~~L~~~~----~~~V~~amry~~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~ 167 (343) +.+|+.|++.+.+.|++.. ...+++|||||+|+.|+++++++++|++++|++|+|||||++||||+++.+++.+.+ T Consensus 105 r~wT~~Q~~~m~k~Ld~~~petaphk~YVgfRY~~PlTEea~~qikkd~v~r~VafsqYPQyS~sTsGSSln~l~r~~r~ 184 (395) T KOG1321 105 RKWTEYQAEEMCKILDKKCPETAPHKPYVGFRYAHPLTEEALEQIKKDGVTRAVAFSQYPQYSCSTSGSSLNELWRQFRE 184 (395) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCCHHHHHHHHHCCCEEEEEECCCCCEEEECCCCCHHHHHHHHHH T ss_conf 67888889999999974195547765346665248564999999975276047863158750231576429999999984 Q ss_pred HCC--CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCC---CCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHC Q ss_conf 179--9864213345578006899999999999733899---86699983367502676279988899999999999726 Q gi|254780200|r 168 MRW--SPSLRTVPPYYEDSDYISALAQSIREHFESIQWT---PEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLS 242 (343) Q Consensus 168 ~~~--~~~~~~I~~~~~~p~yI~a~~~~I~~~l~~~~~~---~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~ 242 (343) .+. ..++.+|++|+.+++||++++++|.+.|+.+..+ +..++|||||+|++++.+||||+.|+.+|+++||++|+ T Consensus 185 ~~~~~~~~wsiIdrW~t~~glIkafA~~I~keL~~F~~~~r~~VVIlFSAHslPms~Vn~GDpY~~Ei~atv~~iMeeL~ 264 (395) T KOG1321 185 DGYERDIKWSIIDRWPTREGLIKAFAENIEKELQTFPEPVRDDVVILFSAHSLPMSVVNAGDPYPAEIAATVDLIMEELK 264 (395) T ss_pred CCCCCCCCEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCHHHHHCCCCCHHHHHHHHHHHHHHHC T ss_conf 47656881586054455516999999999999974687555647999944778478774589757899999999999961 Q ss_pred CCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHCCCCCEEEECCHHHHCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCC Q ss_conf 67567047862257776544302689999899769983999744054221255888758999999975997589816898 Q gi|254780200|r 243 WPDDRFKICFQSRFGRIKCLEPPTDKTVEKLAHDGIKSLAIITPGFSSDCLETSYEIAHEAKEIFVNGGGEKFTQVPCLN 322 (343) Q Consensus 243 ~~~~~~~l~fQSr~Gp~~WL~P~t~~~l~~L~~~G~k~v~v~p~gFvsD~lETl~Eidie~~e~~~e~Gg~~~~rvp~lN 322 (343) .. |+|.++||||+||.+||+|+|+++++.|.++|.||++.|||+|+|||+|||+|||+||+|++.+.|++++.|+++|| T Consensus 265 ~~-N~y~lawQSkVGP~pWL~p~Tde~i~~lgk~g~knll~VPIaFvSeHIETL~EiD~ey~e~a~k~gve~~~Rv~sln 343 (395) T KOG1321 265 YK-NPYRLAWQSKVGPLPWLGPATDEVIEGLGKKGVKNLLLVPIAFVSEHIETLHEIDIEYIEEALKKGVENWKRVESLN 343 (395) T ss_pred CC-CCCHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCEEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHEECCCCC T ss_conf 37-93024343036775445600589999987612244589752165677878998418999999997654010146889 Q ss_pred CCHHHHHHHHHHHHHHHCC Q ss_conf 7989999999999998428 Q gi|254780200|r 323 SSNLSIDLLEKITRRELMG 341 (343) Q Consensus 323 ~~~~fi~~La~lv~~~l~g 341 (343) ++|.||++|||+|.+||+. T Consensus 344 ~~p~fI~~lADlV~ehL~s 362 (395) T KOG1321 344 GNPTFIEGLADLVAEHLKS 362 (395) T ss_pred CCCCHHHHHHHHHHHHHHH T ss_conf 9821899899999986213 No 7 >cd03411 Ferrochelatase_N Ferrochelatase, N-terminal domain: Ferrochelatase (protoheme ferrolyase or HemH) is the terminal enzyme of the heme biosynthetic pathway. It catalyzes the insertion of ferrous iron into the protoporphyrin IX ring yielding protoheme. This enzyme is ubiquitous in nature and widely distributed in bacteria and eukaryotes. Recently, some archaeal members have been identified. The oligomeric state of these enzymes varies depending on the presence of a dimerization motif at the C-terminus. Probab=100.00 E-value=1.4e-45 Score=338.00 Aligned_cols=158 Identities=45% Similarity=0.746 Sum_probs=150.9 Q ss_pred EEEEEECCCCCCCCCHHHHHHHHHHHCCCHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHH Q ss_conf 79999717888999988999999987489544117868889987666512574788999998707788740457775999 Q gi|254780200|r 19 IGVLLVNLGTPDGHDFFSLRRYLREFLLDKRVVELPSWQWRPILFGYILNFRPSKIKHAYAKIWNTAKNESILRTHTRDQ 98 (343) Q Consensus 19 ~gvLL~n~G~P~~~~~~~V~~yL~~fl~D~~Vi~~p~~~~~~il~~~I~~~R~~~~~~~Y~~Iw~~~gg~SPL~~~t~~q 98 (343) +||||+|||||++++ +|++||++||+|++||++|.++| ++|+++|+++|+++++++|++|| |||||+.+|+.| T Consensus 1 ~~VlL~n~G~P~~~~--~v~~yL~~~l~d~~vi~~p~~~~-~~l~~~I~~~R~~k~~~~Y~~i~----~~SPL~~~t~~q 73 (159) T cd03411 1 TAVLLVNLGGPESLE--DVRPFLKNFLSDRRVIELPRPLR-PILAGIILPRRPPKVAKNYKKIG----GGSPLNEITRAQ 73 (159) T ss_pred CEEEEEECCCCCCHH--HHHHHHHHHCCCCCCCCCCCHHH-HHHHHHCCHHHHHHHHHHHHHHC----CCCHHHHHHHHH T ss_conf 979999689989869--99999999858986310761268-99888807000699999999826----988789999999 Q ss_pred HHHHHHHHHCCC-CCCCHHHHHCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 999998652247-8430011210454247899999870996589963171110014689999999999871799864213 Q gi|254780200|r 99 ATNLAKRLESIS-SIVVDWAMRYGKPSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMRWSPSLRTV 177 (343) Q Consensus 99 a~~L~~~L~~~~-~~~V~~amry~~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~~~~~~~~I 177 (343) +++|+++|++.+ +++|++|||||+|+++|++++|+++|+++|+++|||||||++||||+++++.+++++.++.+++++| T Consensus 74 ~~~L~~~L~~~~~~~~V~~amry~~P~i~~~l~~l~~~g~~~iv~lPlyPqyS~sTt~S~~~~~~~~~~~~~~~~~i~~I 153 (159) T cd03411 74 AEALEKALDERGIDVKVYLAMRYGPPSIEEALEELKADGVDRIVVLPLYPQYSASTTGSYLDEVERALKKLRPAPELRVI 153 (159) T ss_pred HHHHHHHHHHCCCCCEEEEEEECCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHCCCCCEEEEE T ss_conf 99999998732899518987432799899999999854888499997987413144899999999999964899848982 Q ss_pred CCCCCC Q ss_conf 345578 Q gi|254780200|r 178 PPYYED 183 (343) Q Consensus 178 ~~~~~~ 183 (343) ++|||| T Consensus 154 ~~f~d~ 159 (159) T cd03411 154 RSFYDH 159 (159) T ss_pred CCCCCC T ss_conf 886788 No 8 >cd00419 Ferrochelatase_C Ferrochelatase, C-terminal domain: Ferrochelatase (protoheme ferrolyase or HemH) is the terminal enzyme of the heme biosynthetic pathway. It catalyzes the insertion of ferrous iron into the protoporphyrin IX ring yielding protoheme. This enzyme is ubiquitous in nature and widely distributed in bacteria and eukaryotes. Recently, some archaeal members have been identified. The oligomeric state of these enzymes varies depending on the presence of a dimerization motif at the C-terminus. Probab=100.00 E-value=3.9e-43 Score=320.87 Aligned_cols=135 Identities=44% Similarity=0.761 Sum_probs=131.5 Q ss_pred HHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCHHHH Q ss_conf 99999999999733899866999833675026762799888999999999997266756704786225777654430268 Q gi|254780200|r 188 SALAQSIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKICFQSRFGRIKCLEPPTD 267 (343) Q Consensus 188 ~a~~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l~fQSr~Gp~~WL~P~t~ 267 (343) ++++++|++++++.+.++++|||||||||++++++||||.+||++|+++|+++|+.+.++|.++||||+||++||+|+|+ T Consensus 1 ~a~a~~I~~~l~~~~~~~~~llfS~HglP~~~~~~gd~Y~~q~~~t~~~i~~~l~~~~~~~~~~~QS~~g~~~Wl~P~~~ 80 (135) T cd00419 1 EALADHIREALAELPREKDRLLFSAHGLPVRDIKKGDPYPDQCEETARLVAERLGLPFDEYELAYQSRFGPGEWLEPSTD 80 (135) T ss_pred CHHHHHHHHHHHHCCCCCCEEEEECCCCHHHHCCCCCCHHHHHHHHHHHHHHHCCCCCHHEEEEEECCCCCCCCCCCCHH T ss_conf 94799999999967998878999726475877638997799999999999998299810168998648888887899879 Q ss_pred HHHHHHHHCCCCCEEEECCHHHHCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCC Q ss_conf 9999899769983999744054221255888758999999975997589816898 Q gi|254780200|r 268 KTVEKLAHDGIKSLAIITPGFSSDCLETSYEIAHEAKEIFVNGGGEKFTQVPCLN 322 (343) Q Consensus 268 ~~l~~L~~~G~k~v~v~p~gFvsD~lETl~Eidie~~e~~~e~Gg~~~~rvp~lN 322 (343) ++|++|+++|+++|+|+||||++||+|||||||+|+++.|.++||++|.|+|||| T Consensus 81 ~~l~~l~~~G~~~v~v~p~gF~sD~lETl~eld~e~~~~~~~~Gg~~~~~vp~lN 135 (135) T cd00419 81 DALEELAKEGVKNVVVVPIGFVSDHLETLYELDIEYRELAEEAGGENYRRVPCLN 135 (135) T ss_pred HHHHHHHHCCCCCEEEECCEEECCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCC T ss_conf 9999998669971999898530365144888789999999983995588728999 No 9 >cd03409 Chelatase_Class_II Class II Chelatase: a family of ATP-independent monomeric or homodimeric enzymes that catalyze the insertion of metal into protoporphyrin rings. This family includes protoporphyrin IX ferrochelatase (HemH), sirohydrochlorin ferrochelatase (SirB) and the cobaltochelatases, CbiK and CbiX. HemH and SirB are involved in heme and siroheme biosynthesis, respectively, while the cobaltochelatases are associated with cobalamin biosynthesis. Excluded from this family are the ATP-dependent heterotrimeric chelatases (class I) and the multifunctional homodimeric enzymes with dehydrogenase and chelatase activities (class III). Probab=99.28 E-value=9.4e-12 Score=102.45 Aligned_cols=87 Identities=31% Similarity=0.390 Sum_probs=78.7 Q ss_pred EEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHCCCCCEEEECCH Q ss_conf 99983367502676279988899999999999726675670478622577765443026899998997699839997440 Q gi|254780200|r 208 LLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKICFQSRFGRIKCLEPPTDKTVEKLAHDGIKSLAIITPG 287 (343) Q Consensus 208 llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l~fQSr~Gp~~WL~P~t~~~l~~L~~~G~k~v~v~p~g 287 (343) ||++.||.|.. |||..+++++++.|+++++ +.++.+||||. ++|++++++++|.++|+++|+++|.. T Consensus 2 iL~v~hGs~~~-----dpy~~~ie~~a~~i~~~~~--~~~v~~~f~e~------~~P~i~eai~~l~~~G~~~ivvvP~~ 68 (101) T cd03409 2 LLVVGHGSPYK-----DPYKKDIEAQAHNLAESLP--DFPYYVGFQSG------LGPDTEEAIRELAEEGYQRVVIVPLA 68 (101) T ss_pred EEEEECCCCCC-----CCHHHHHHHHHHHHHHHCC--CCCEEHHHHHH------CCCCHHHHHHHHHHCCCCEEEEEEEE T ss_conf 69996699976-----6608999999999998879--98241437977------19999999999997698679998643 Q ss_pred HHHCCHHHHHHHHHHHHHHHH Q ss_conf 542212558887589999999 Q gi|254780200|r 288 FSSDCLETSYEIAHEAKEIFV 308 (343) Q Consensus 288 FvsD~lETl~Eidie~~e~~~ 308 (343) |+ +|.||++||+.|...... T Consensus 69 l~-~g~h~~~Dip~el~~~~~ 88 (101) T cd03409 69 PV-SGDEVFYDIDSEIGLVRK 88 (101) T ss_pred EE-CCHHHHHHHHHHHHHHHH T ss_conf 00-560577330799978888 No 10 >pfam06180 CbiK Cobalt chelatase (CbiK). This family consists of several bacterial cobalt chelatase (CbiK) proteins (EC:4.99.1.-). Probab=99.06 E-value=7.8e-07 Score=68.24 Aligned_cols=193 Identities=12% Similarity=0.192 Sum_probs=134.5 Q ss_pred CCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCC-CCCCCCCCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 5424789999987099658996317111001468999999999987179-986421334557800689999999999973 Q gi|254780200|r 122 KPSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMRW-SPSLRTVPPYYEDSDYISALAQSIREHFES 200 (343) Q Consensus 122 ~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~~-~~~~~~I~~~~~~p~yI~a~~~~I~~~l~~ 200 (343) -++..++|++|+++|+++|++.||+ ..-|.-++.+.+.+.+... ..++.+-.+.-.++.=++..++.|.+.+.. T Consensus 57 i~~p~eaL~~L~~~G~~~V~VQplh-----ii~G~Ey~~l~~~v~~~~~~F~~i~~g~PLL~~~~D~~~v~~al~~~~~~ 131 (256) T pfam06180 57 IDTPLQALNKLADQGYEEVIVQPLH-----IIPGEEYEKLKREVNKFSPDFKRIKLGRPLLDYPEDYEEVVEALKDQIPP 131 (256) T ss_pred CCCHHHHHHHHHHCCCCEEEEEECC-----EECCHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHHHHCCC T ss_conf 7999999999998799889992362-----43747699999999997531884997315668868999999999984533 Q ss_pred CCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHCCCCC Q ss_conf 38998669998336750267627998889999999999972667567047862257776544302689999899769983 Q gi|254780200|r 201 IQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKICFQSRFGRIKCLEPPTDKTVEKLAHDGIKS 280 (343) Q Consensus 201 ~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l~fQSr~Gp~~WL~P~t~~~l~~L~~~G~k~ 280 (343) .. +.+.+||-.||-+-. + ...+...+......+.++ +.++= + +-| |++++++++|.+.|+|+ T Consensus 132 ~~-~~~a~VlmGHGt~h~----a----~~~Y~~l~~~l~~~~~~~--v~vgt---v--Eg~--P~~e~vl~~L~~~g~k~ 193 (256) T pfam06180 132 LR-KDEALVFMGHGTDHH----S----NAVYACLDHVMRNYPFPN--VFVGT---V--EGY--PGVDDVIAELRKKGITE 193 (256) T ss_pred CC-CCCEEEEECCCCCCC----C----CHHHHHHHHHHHHCCCCC--EEEEE---E--CCC--CCHHHHHHHHHHCCCCE T ss_conf 36-898699981898985----3----089999999999569998--79999---6--897--99999999999769985 Q ss_pred EEEECCHHHH-CCHHHHHHHHH----HHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHHHHH Q ss_conf 9997440542-21255888758----99999997599758981689879899999999999984 Q gi|254780200|r 281 LAIITPGFSS-DCLETSYEIAH----EAKEIFVNGGGEKFTQVPCLNSSNLSIDLLEKITRREL 339 (343) Q Consensus 281 v~v~p~gFvs-D~lETl~Eidi----e~~e~~~e~Gg~~~~rvp~lN~~~~fi~~La~lv~~~l 339 (343) |.++|.=+|+ ||- ..|+.- --+..+.++|..--.....|...|.|.+...+-+++.+ T Consensus 194 V~L~PfM~VAGdHa--~nDmaGde~dSWks~L~~~G~~v~~~l~GLGe~~~i~~i~i~hl~~a~ 255 (256) T pfam06180 194 VTLMPLMLVAGDHA--KNDMASDEEDSWKNRFEAAGIKVEIILSGLGEIDEIRSIFIDHIKDAI 255 (256) T ss_pred EEEECHHHHCHHHH--HHCCCCCCCCHHHHHHHHCCCEEEEEECCCCCCHHHHHHHHHHHHHHH T ss_conf 99965076332515--432368994159999997798667985367589999999999999984 No 11 >COG2138 Sirohydrochlorin ferrochelatase [Inorganic ion transport and metabolism] Probab=98.61 E-value=4.2e-05 Score=56.14 Aligned_cols=226 Identities=19% Similarity=0.178 Sum_probs=136.4 Q ss_pred CCCHHHHHHHHHHHHHHHHHHCCCC-CCCHHH-HHCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHH Q ss_conf 7404577759999999986522478-430011-21045424789999987099658996317111001468999999999 Q gi|254780200|r 87 NESILRTHTRDQATNLAKRLESISS-IVVDWA-MRYGKPSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQE 164 (343) Q Consensus 87 g~SPL~~~t~~qa~~L~~~L~~~~~-~~V~~a-mry~~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~ 164 (343) -||++....+... ++.+.+.+..+ -.|..| |-..+|+++++++.+..+|+++|+++|||=....-|+--.-.++... T Consensus 10 HGsr~p~~~~~~~-~~a~~~~~~~~~~~v~~~f~e~~~P~l~~~~~al~~~G~~~ivvVPlfl~~g~H~~~DIP~~L~~~ 88 (245) T COG2138 10 HGSRLPRGREVAE-AIAARLEERGDFPPVRVAFLELAEPSLREALQALVARGVDRIVVVPLFLAAGYHTKRDIPAELGLA 88 (245) T ss_pred CCCCCCCHHHHHH-HHHHHHHHHCCCCCCHHHHHHHCCCCHHHHHHHHHHCCCCEEEEEEHHHCCCCHHHCCCHHHHHHH T ss_conf 4888862789999-999999864599630247877538998999999986699819994442245731101518999876 Q ss_pred HHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHH--HHHHHHHHHHHHHC Q ss_conf 9871799864213345578006899999999999733899866999833675026762799888--99999999999726 Q gi|254780200|r 165 LIHMRWSPSLRTVPPYYEDSDYISALAQSIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGC--HCHKTARLLKEFLS 242 (343) Q Consensus 165 l~~~~~~~~~~~I~~~~~~p~yI~a~~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~--q~~~t~~~i~~~l~ 242 (343) -.+ .+...+. ++-.+|.-.+.+.+++.+.......+-+.+++..+| .+++.. ..+..++.+.+.. T Consensus 89 ~~~---~~~~~~~-p~G~~~~~~~~~~~r~~~~~~~~~~~~~~vv~~~~G--------s~~~~~~~~~~~va~~l~~~~- 155 (245) T COG2138 89 RQA---HPQVDLS-PLGTHPAVLDLLGQRLEDAGADEADDAERVVLEPRG--------SSDPIANAAVYRVARLLGEGT- 155 (245) T ss_pred HHC---CCCCCCC-CCCCCHHHHHHHHHHHHHHCCCCCCCCCEEEEECCC--------CCCCCCHHHHHHHHHHHHHCC- T ss_conf 635---8965566-567754799999999887305643442159996058--------776431257899999997066- Q ss_pred CCCCCCE---EEEECCCCCCCCCCHHHHHHHHHHHHCCCCCEEEECCHHHHCCHHHHHHHHHHHHHHHHHCCCCEEEEEC Q ss_conf 6756704---7862257776544302689999899769983999744054221255888758999999975997589816 Q gi|254780200|r 243 WPDDRFK---ICFQSRFGRIKCLEPPTDKTVEKLAHDGIKSLAIITPGFSSDCLETSYEIAHEAKEIFVNGGGEKFTQVP 319 (343) Q Consensus 243 ~~~~~~~---l~fQSr~Gp~~WL~P~t~~~l~~L~~~G~k~v~v~p~gFvsD~lETl~Eidie~~e~~~e~Gg~~~~rvp 319 (343) +.|. .+|+-- + +|.+.+....+. .+++ ++-|-|+.|.+.|.+....+. ...-...-.....+ T Consensus 156 ---~~~~~~~~~~~~~--~----~~~l~~~~~~~~---~~~~-vv~P~fL~~G~l~~~~~~~~~--~~~~~~~~~i~~~~ 220 (245) T COG2138 156 ---ASWKAVITLFTGV--A----EPGLAGETARLR---YRRV-VVLPYFLFDGLLTDRIRPEVE--LRLAVPEGEIALAR 220 (245) T ss_pred ---CCCEEEEEEECCC--C----CCCHHHHHHHCC---CCCE-EEEEHHHHCCHHHHHHHHHHH--HHHCCCCCCEEECC T ss_conf ---7731232311046--6----765243333114---6717-997766747535654577664--22036775257657 Q ss_pred CCCCCHHHHHHHHHHHHHHHCC Q ss_conf 8987989999999999998428 Q gi|254780200|r 320 CLNSSNLSIDLLEKITRRELMG 341 (343) Q Consensus 320 ~lN~~~~fi~~La~lv~~~l~g 341 (343) -++.+|...+++.+.+.+...+ T Consensus 221 ~lG~~p~l~~~~~~r~~~~~~~ 242 (245) T COG2138 221 PLGTHPRLADAVLDRVREARAG 242 (245) T ss_pred CCCCCHHHHHHHHHHHHHHHHH T ss_conf 5788878999999999998752 No 12 >PRK00923 sirohydrochlorin cobaltochelatase; Reviewed Probab=98.46 E-value=2.4e-06 Score=64.77 Aligned_cols=83 Identities=14% Similarity=0.283 Sum_probs=55.6 Q ss_pred CCCHHHH-HCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHH-----------HHCCCCCCCCCCC Q ss_conf 4300112-104542478999998709965899631711100146899999999998-----------7179986421334 Q gi|254780200|r 112 IVVDWAM-RYGKPSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELI-----------HMRWSPSLRTVPP 179 (343) Q Consensus 112 ~~V~~am-ry~~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~-----------~~~~~~~~~~I~~ 179 (343) ..|+.|+ -..+|+++++++++.++|+++|+++|+|=.-..-+. +.+-+.+. ......++++-++ T Consensus 36 ~~V~~afle~~~P~i~~~~~~l~~~G~~~ivvvP~fL~~G~H~~----~DIP~~l~~~~~~~~~~~~~~~~~v~i~~~~p 111 (130) T PRK00923 36 YIVEVGFMEFNEPTIPEALKKFAGTGVDKIIVVPVFLAHGVHTK----RDIPKILGLDEGEKEGEIEEDGKDVELVYAEP 111 (130) T ss_pred CEEEEEEEECCCCCHHHHHHHHHHCCCCEEEEEEHHHCCCCCCC----HHHHHHHHHHHHHCCCCHHHCCCCEEEEECCC T ss_conf 82988776157898999999999669988999863016682422----00188987767516542221388527997788 Q ss_pred CCCCCHHHHHHHHHHHHHH Q ss_conf 5578006899999999999 Q gi|254780200|r 180 YYEDSDYISALAQSIREHF 198 (343) Q Consensus 180 ~~~~p~yI~a~~~~I~~~l 198 (343) ...||..++.+.++++++| T Consensus 112 lG~~p~l~dvl~~R~~EAL 130 (130) T PRK00923 112 IGADPRIADIVLDRANEAL 130 (130) T ss_pred CCCCHHHHHHHHHHHHHHC T ss_conf 8899799999999999759 No 13 >COG0276 HemH Protoheme ferro-lyase (ferrochelatase) [Coenzyme metabolism] Probab=98.29 E-value=0.0001 Score=53.53 Aligned_cols=156 Identities=21% Similarity=0.314 Sum_probs=100.2 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHH----HHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHH Q ss_conf 99999999987179986421334557800689999----99999997338998669998336750267627998889999 Q gi|254780200|r 157 AQDKVFQELIHMRWSPSLRTVPPYYEDSDYISALA----QSIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHK 232 (343) Q Consensus 157 ~~~~~~~~l~~~~~~~~~~~I~~~~~~p~yI~a~~----~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~ 232 (343) ..+++..-+......+.+.-++.+.-.| .-..++ +.++...+...+. .|=....++ T Consensus 18 ~~e~v~~yL~~~~~d~~v~~~~~~~~~~-l~~~I~~~R~~~~~~~Y~~igg~-------------------sPL~~~T~~ 77 (320) T COG0276 18 TLEDVRPYLKNFLSDRRVIELPRPLWYP-LAGIILPLRLKKVAKNYESIGGK-------------------SPLNVITRA 77 (320) T ss_pred CHHHHHHHHHHHHCCCCCCCCCHHHHHH-HHHHHHHHCCHHHHHHHHHHCCC-------------------CCCHHHHHH T ss_conf 7789999999983699877762455543-54331010018899999984688-------------------943999999 Q ss_pred HHHHHHHHHCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHCCCCCEEEECC--HHHHCCHHHHHHHHHHHHHHHHHC Q ss_conf 999999972667567047862257776544302689999899769983999744--054221255888758999999975 Q gi|254780200|r 233 TARLLKEFLSWPDDRFKICFQSRFGRIKCLEPPTDKTVEKLAHDGIKSLAIITP--GFSSDCLETSYEIAHEAKEIFVNG 310 (343) Q Consensus 233 t~~~i~~~l~~~~~~~~l~fQSr~Gp~~WL~P~t~~~l~~L~~~G~k~v~v~p~--gFvsD~lETl~Eidie~~e~~~e~ 310 (343) -++++.++|+.+ ++.+.+--|.| .|++++++++|.++|+++++++|- -|.+--..+-.+ --.+.+-... T Consensus 78 q~~~L~~~L~~~--~~~V~~amry~-----~P~i~~~v~~l~~~gv~~iv~~pLyPqyS~sTt~s~~~--~~~~al~~~~ 148 (320) T COG0276 78 QAAALEERLDLP--DFKVYLAMRYG-----PPFIEEAVEELKKDGVERIVVLPLYPQYSSSTTGSYVD--ELARALKELR 148 (320) T ss_pred HHHHHHHHHCCC--CCCEEEEECCC-----CCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCHHHHHH--HHHHHHHHCC T ss_conf 999999973799--86079763478-----98689999999971987599997776302301899999--9999998537 Q ss_pred CCCEEEEECCCCCCHHHHHHHHHHHHHHHCC Q ss_conf 9975898168987989999999999998428 Q gi|254780200|r 311 GGEKFTQVPCLNSSNLSIDLLEKITRRELMG 341 (343) Q Consensus 311 Gg~~~~rvp~lN~~~~fi~~La~lv~~~l~g 341 (343) +...+..|++.-++|.||+++++.|+++++. T Consensus 149 ~~~~i~~I~~~~~~p~yI~a~a~~I~~~~~~ 179 (320) T COG0276 149 GQPKISTIPDYYDEPLYIEALADSIREKLAK 179 (320) T ss_pred CCCCEEEECCCCCCHHHHHHHHHHHHHHHHH T ss_conf 8986688167557868999999999999974 No 14 >PRK02395 hypothetical protein; Provisional Probab=98.20 E-value=3e-05 Score=57.17 Aligned_cols=205 Identities=15% Similarity=0.138 Sum_probs=132.8 Q ss_pred CCHHHHHCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHH---------HHHHCCCCCCCCCCCCCCC Q ss_conf 3001121045424789999987099658996317111001468999999999---------9871799864213345578 Q gi|254780200|r 113 VVDWAMRYGKPSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQE---------LIHMRWSPSLRTVPPYYED 183 (343) Q Consensus 113 ~V~~amry~~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~---------l~~~~~~~~~~~I~~~~~~ 183 (343) +|..|+---+|++.++|+.+. .++++++|||=-.-+.|-.-+=+++.=. +.......++++-.+...| T Consensus 37 eV~~~f~keeP~l~~vL~~v~---a~~V~VVPlfis~GYfT~~vIPreLgL~~~~~~~~~g~~~~~~g~~i~Y~~PvGth 113 (284) T PRK02395 37 EVREGFWKEEPSLRQVLRTVE---SDEVYVVPLFISEGYFTETVIPRELGLEHQGPVPTGGLVAVDGGKRVHYTGPVGTH 113 (284) T ss_pred HHHHHHHHCCCCHHHHHHHHC---CCCEEEEEEEECCCCCCCCCCCHHHCCCCCCCCCCCCCEEECCCCEEEECCCCCCC T ss_conf 899998725998899998706---78479974430454001440658656776777787773145089338965677888 Q ss_pred CHHHHHHHHHHHHHHHHCCC-CCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCC Q ss_conf 00689999999999973389-98669998336750267627998889999999999972667567047862257776544 Q gi|254780200|r 184 SDYISALAQSIREHFESIQW-TPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKICFQSRFGRIKCL 262 (343) Q Consensus 184 p~yI~a~~~~I~~~l~~~~~-~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l~fQSr~Gp~~WL 262 (343) |..-+.++.+.++.+.+... ....|+...||.-.. -.-..-++..+++|+++=. .......|-.. T Consensus 114 p~m~dvi~~RA~~v~~~~~~~~~t~L~lvgHGT~rn-----~ns~~ai~~~a~~ire~~~--FaEV~~~Fmde------- 179 (284) T PRK02395 114 PGMADVIAQRARDVTGDPEDADDTALLLVGHGTERN-----ENSAKAIYYHADRLRERGR--FAEVEAVFLDE------- 179 (284) T ss_pred CCHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCCC-----CCHHHHHHHHHHHHHHCCC--CHHHHHHHHCC------- T ss_conf 237899999999852788888773599984577888-----6537889999999997487--18888876305------- Q ss_pred CHHHHHHHHHHHHCCCCCEEEECCHHHHCCHHHHHHHHHHHHHH-----HHH----CCCCEEEEECCCCCCHHHHHHHHH Q ss_conf 30268999989976998399974405422125588875899999-----997----599758981689879899999999 Q gi|254780200|r 263 EPPTDKTVEKLAHDGIKSLAIITPGFSSDCLETSYEIAHEAKEI-----FVN----GGGEKFTQVPCLNSSNLSIDLLEK 333 (343) Q Consensus 263 ~P~t~~~l~~L~~~G~k~v~v~p~gFvsD~lETl~Eidie~~e~-----~~e----~Gg~~~~rvp~lN~~~~fi~~La~ 333 (343) +|.+.+.-+.+. .++|+++| =|++|.+-|.+||-...--. +.. .-|....+...++++|...+.+.+ T Consensus 180 eP~v~~~~e~~~---~~~VVvVP-fFiAdG~Ht~edIP~~lGlt~~~~~~~~~P~~~~g~~i~ya~~vGt~p~~advile 255 (284) T PRK02395 180 EPEVDDWPDLVE---ADDVVVVP-LFVSEGWHTQEDIPEDMGLTDDYRTGPDSPTAVDGHTIWYAGPVGTEPLMADVILE 255 (284) T ss_pred CCCCCCHHHHCC---CCCEEEEE-EEECCCCCCCCCCHHHCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCHHHHHHHHH T ss_conf 875232677617---88679996-37526753344574651888787547889866367058971366775348999999 Q ss_pred HHHHH Q ss_conf 99998 Q gi|254780200|r 334 ITRRE 338 (343) Q Consensus 334 lv~~~ 338 (343) ++.+. T Consensus 256 ~a~~a 260 (284) T PRK02395 256 LAADA 260 (284) T ss_pred HHHHC T ss_conf 99881 No 15 >cd03414 CbiX_SirB_C Sirohydrochlorin cobalt chelatase (CbiX) and sirohydrochlorin iron chelatase (SirB), C-terminal domain. SirB catalyzes the ferro-chelation of sirohydrochlorin to siroheme, the prosthetic group of sulfite and nitrite reductases. CbiX is a cobaltochelatase, responsible for the chelation of Co2+ into sirohydrochlorin, an important step in the vitamin B12 biosynthetic pathway. CbiX often contains a C-terminal histidine-rich region that may be important for metal delivery and/or storage, and may also contain an iron-sulfur center. Both CbiX and SirB are found in a wide range of bacteria. Probab=98.17 E-value=0.00016 Score=52.13 Aligned_cols=115 Identities=20% Similarity=0.264 Sum_probs=61.3 Q ss_pred EEEEEECCCCHHHHHHCCC-CHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEC Q ss_conf 6999833675026762799-888999999999997266756704786225777654430268999989976998399974 Q gi|254780200|r 207 MLLVSFHQMPVSYLLKGDP-YGCHCHKTARLLKEFLSWPDDRFKICFQSRFGRIKCLEPPTDKTVEKLAHDGIKSLAIIT 285 (343) Q Consensus 207 ~llfSaHglP~~~~~~gDp-Y~~q~~~t~~~i~~~l~~~~~~~~l~fQSr~Gp~~WL~P~t~~~l~~L~~~G~k~v~v~p 285 (343) .||+-+||=- || ...++.+.++.+.++++.. ....||-.. -+|++.+.++++.++|.++|+|+| T Consensus 2 avllvgHGSr-------~p~a~~~~~~la~~l~~~~~~~--~v~~~fle~------~~P~l~e~l~~l~~~G~~~ivvvP 66 (117) T cd03414 2 AVVLVGRGSS-------DPDANADVAKIARLLEEGTGFA--RVETAFAAA------TRPSLPEALERLRALGARRVVVLP 66 (117) T ss_pred EEEEEECCCC-------CHHHHHHHHHHHHHHHHHCCCC--EEEEEEECC------CCCCHHHHHHHHHHCCCCEEEEEE T ss_conf 8999958999-------8789999999999999768998--188877536------799899999999976997599976 Q ss_pred CHHHHCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHHHHH Q ss_conf 405422125588875899999997599758981689879899999999999984 Q gi|254780200|r 286 PGFSSDCLETSYEIAHEAKEIFVNGGGEKFTQVPCLNSSNLSIDLLEKITRREL 339 (343) Q Consensus 286 ~gFvsD~lETl~Eidie~~e~~~e~Gg~~~~rvp~lN~~~~fi~~La~lv~~~l 339 (343) . |..... -..++.-+..+.. +.-+.++...+.|+.||..++.+++.+++.| T Consensus 67 ~-fL~~G~-~~~~i~~~~~~~~-~~p~~~i~~~~pLG~~p~l~~~l~~Ri~eal 117 (117) T cd03414 67 Y-LLFTGV-LMDRIEEQVAELA-AEPGIEFVLAPPLGPHPELAEALLERVREAL 117 (117) T ss_pred H-HHCCCC-HHHHHHHHHHHHH-HCCCCEEEECCCCCCCHHHHHHHHHHHHHHC T ss_conf 2-312784-2766799999998-4889718988889899899999999999769 No 16 >PRK00923 sirohydrochlorin cobaltochelatase; Reviewed Probab=98.11 E-value=0.0002 Score=51.46 Aligned_cols=119 Identities=15% Similarity=0.131 Sum_probs=85.2 Q ss_pred CEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEC Q ss_conf 66999833675026762799888999999999997266756704786225777654430268999989976998399974 Q gi|254780200|r 206 EMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKICFQSRFGRIKCLEPPTDKTVEKLAHDGIKSLAIIT 285 (343) Q Consensus 206 ~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l~fQSr~Gp~~WL~P~t~~~l~~L~~~G~k~v~v~p 285 (343) ..||+-+||-... .........++.++++.+. .....||-.. -+|++.+.++++.++|+++|+|+| T Consensus 4 ~alllvgHGSr~~------~~~~~~~~~a~~l~~~~~~--~~V~~afle~------~~P~i~~~~~~l~~~G~~~ivvvP 69 (130) T PRK00923 4 IGLLLVGHGSRLP------YNKEVVTEIAEKIKEKFPD--YIVEVGFMEF------NEPTIPEALKKFAGTGVDKIIVVP 69 (130) T ss_pred EEEEEEECCCCCH------HHHHHHHHHHHHHHHHCCC--CEEEEEEEEC------CCCCHHHHHHHHHHCCCCEEEEEE T ss_conf 6999997899976------7899999999999985899--8298877615------789899999999966998899986 Q ss_pred CHHHHCCHHHHHHHHHHHHH-------HHHHCC-CCEEEEECCCCCCHHHHHHHHHHHHHHH Q ss_conf 40542212558887589999-------999759-9758981689879899999999999984 Q gi|254780200|r 286 PGFSSDCLETSYEIAHEAKE-------IFVNGG-GEKFTQVPCLNSSNLSIDLLEKITRREL 339 (343) Q Consensus 286 ~gFvsD~lETl~Eidie~~e-------~~~e~G-g~~~~rvp~lN~~~~fi~~La~lv~~~l 339 (343) -|.++..=+..||=.+... .+.+.+ +..+.+.+.+++||..++.+++.+++.| T Consensus 70 -~fL~~G~H~~~DIP~~l~~~~~~~~~~~~~~~~~v~i~~~~plG~~p~l~dvl~~R~~EAL 130 (130) T PRK00923 70 -VFLAHGVHTKRDIPKILGLDEGEKEGEIEEDGKDVELVYAEPIGADPRIADIVLDRANEAL 130 (130) T ss_pred -HHHCCCCCCCHHHHHHHHHHHHHCCCCHHHCCCCEEEEECCCCCCCHHHHHHHHHHHHHHC T ss_conf -3016682422001889877675165422213885279977888899799999999999759 No 17 >PRK12435 ferrochelatase; Provisional Probab=98.09 E-value=0.0002 Score=51.47 Aligned_cols=24 Identities=17% Similarity=0.211 Sum_probs=11.9 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 334557800689999999999973 Q gi|254780200|r 177 VPPYYEDSDYISALAQSIREHFES 200 (343) Q Consensus 177 I~~~~~~p~yI~a~~~~I~~~l~~ 200 (343) ++...++|.||+++++.|.+.+++ T Consensus 285 vp~lN~~p~fI~~La~lV~~~l~~ 308 (316) T PRK12435 285 PNMPNAQSTFIDCLATIVSRKMKE 308 (316) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 689999989999999999999864 No 18 >cd03409 Chelatase_Class_II Class II Chelatase: a family of ATP-independent monomeric or homodimeric enzymes that catalyze the insertion of metal into protoporphyrin rings. This family includes protoporphyrin IX ferrochelatase (HemH), sirohydrochlorin ferrochelatase (SirB) and the cobaltochelatases, CbiK and CbiX. HemH and SirB are involved in heme and siroheme biosynthesis, respectively, while the cobaltochelatases are associated with cobalamin biosynthesis. Excluded from this family are the ATP-dependent heterotrimeric chelatases (class I) and the multifunctional homodimeric enzymes with dehydrogenase and chelatase activities (class III). Probab=98.08 E-value=2e-05 Score=58.37 Aligned_cols=55 Identities=33% Similarity=0.452 Sum_probs=45.1 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCHHHHHC-CCCCHHHHHHHHHHCCCCEEEEEECCHH Q ss_conf 577759999999986522478430011210-4542478999998709965899631711 Q gi|254780200|r 91 LRTHTRDQATNLAKRLESISSIVVDWAMRY-GKPSVKEIINNLREEGCDRLLIFPLYPQ 148 (343) Q Consensus 91 L~~~t~~qa~~L~~~L~~~~~~~V~~amry-~~P~i~~~l~~l~~~g~~~ii~lPLyPq 148 (343) -+...+++++.++++++ +.+|+.|+.. ..|++++++++|.++|+++|+++|+|.. T Consensus 15 y~~~ie~~a~~i~~~~~---~~~v~~~f~e~~~P~i~eai~~l~~~G~~~ivvvP~~l~ 70 (101) T cd03409 15 YKKDIEAQAHNLAESLP---DFPYYVGFQSGLGPDTEEAIRELAEEGYQRVVIVPLAPV 70 (101) T ss_pred HHHHHHHHHHHHHHHCC---CCCEEHHHHHHCCCCHHHHHHHHHHCCCCEEEEEEEEEE T ss_conf 08999999999998879---982414379771999999999999769867999864300 No 19 >PRK00035 hemH ferrochelatase; Reviewed Probab=97.94 E-value=0.00047 Score=48.88 Aligned_cols=77 Identities=17% Similarity=0.342 Sum_probs=33.4 Q ss_pred CCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHH---HHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 454247899999870996589963171110014689999---99999987179986421334557800689999999999 Q gi|254780200|r 121 GKPSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQD---KVFQELIHMRWSPSLRTVPPYYEDSDYISALAQSIREH 197 (343) Q Consensus 121 ~~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~---~~~~~l~~~~~~~~~~~I~~~~~~p~yI~a~~~~I~~~ 197 (343) -.|++.+++++|.++|+++|+++|. -|.+=...+.+| +..+...+. ..-++..|+...++|.||+++++.|+++ T Consensus 242 L~P~t~~~l~~l~~~G~k~V~v~p~--gFvsD~lETl~Eidie~~e~~~~~-G~~~~~~vp~lN~~~~fi~~L~~lv~~~ 318 (325) T PRK00035 242 LEPYTDDTLEELAEKGVKKIVVVPP--GFVSDHLETLEEIDIEYREIAEEA-GGKEYRRIPCLNDSPEFIEALADLVLER 318 (325) T ss_pred CCCCHHHHHHHHHHCCCCEEEEECC--CCCCCCHHHHHHHHHHHHHHHHHC-CCCEEEECCCCCCCHHHHHHHHHHHHHH T ss_conf 8875899999988579975999898--204534465999889999999985-9955898079999989999999999998 Q ss_pred HHH Q ss_conf 973 Q gi|254780200|r 198 FES 200 (343) Q Consensus 198 l~~ 200 (343) ++. T Consensus 319 l~~ 321 (325) T PRK00035 319 LQG 321 (325) T ss_pred HCC T ss_conf 569 No 20 >pfam00762 Ferrochelatase Ferrochelatase. Probab=97.90 E-value=0.00045 Score=49.03 Aligned_cols=109 Identities=16% Similarity=0.253 Sum_probs=43.6 Q ss_pred CCCHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHCCCCCEEEECC--HHHHCCHHHHHHHHH Q ss_conf 998889999999999972667567047862257776544302689999899769983999744--054221255888758 Q gi|254780200|r 224 DPYGCHCHKTARLLKEFLSWPDDRFKICFQSRFGRIKCLEPPTDKTVEKLAHDGIKSLAIITP--GFSSDCLETSYEIAH 301 (343) Q Consensus 224 DpY~~q~~~t~~~i~~~l~~~~~~~~l~fQSr~Gp~~WL~P~t~~~l~~L~~~G~k~v~v~p~--gFvsD~lETl~Eidi 301 (343) .|=......-++++.++|+-...++.+.+--|.|+ |+++++++++.++|+++++++|. -|.+=...+..|- T Consensus 63 SPl~~~t~~q~~~L~~~l~~~~~~~~V~~amry~~-----P~i~~~l~~l~~~g~~~ii~lplyPqyS~~Tt~s~~~~-- 135 (312) T pfam00762 63 SPLNVITRAQAEALQKRLGERGADVKVYLAMRYGP-----PSIEDALEELKADGVDRIVVLPLYPQYSASTTGSYLDE-- 135 (312) T ss_pred CHHHHHHHHHHHHHHHHHHHCCCCCEEEEEEECCC-----CCHHHHHHHHHHCCCCEEEEEECCCCCCCCCHHHHHHH-- T ss_conf 85899999999999999871499953898855689-----97788999999659876999977754230307699999-- Q ss_pred HHHHHHHH-CCCCEEEEECCCCCCHHHHHHHHHHHHHHHC Q ss_conf 99999997-5997589816898798999999999999842 Q gi|254780200|r 302 EAKEIFVN-GGGEKFTQVPCLNSSNLSIDLLEKITRRELM 340 (343) Q Consensus 302 e~~e~~~e-~Gg~~~~rvp~lN~~~~fi~~La~lv~~~l~ 340 (343) ..+...+ .....+..|+...++|.||+++++.|+++++ T Consensus 136 -~~~~~~~~~~~~~~~~I~~~~~~p~yi~a~~~~I~~~l~ 174 (312) T pfam00762 136 -LARALKKGRPAPEVRVIRRYYDHPGYIEALADSIREALA 174 (312) T ss_pred -HHHHHHHCCCCCCEEEECCCCCCHHHHHHHHHHHHHHHH T ss_conf -999998658998512706757898999999999999998 No 21 >cd03414 CbiX_SirB_C Sirohydrochlorin cobalt chelatase (CbiX) and sirohydrochlorin iron chelatase (SirB), C-terminal domain. SirB catalyzes the ferro-chelation of sirohydrochlorin to siroheme, the prosthetic group of sulfite and nitrite reductases. CbiX is a cobaltochelatase, responsible for the chelation of Co2+ into sirohydrochlorin, an important step in the vitamin B12 biosynthetic pathway. CbiX often contains a C-terminal histidine-rich region that may be important for metal delivery and/or storage, and may also contain an iron-sulfur center. Both CbiX and SirB are found in a wide range of bacteria. Probab=97.86 E-value=0.00019 Score=51.61 Aligned_cols=92 Identities=22% Similarity=0.362 Sum_probs=64.5 Q ss_pred HHHHHHHHHCCCCCCCHHHHH-CCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHC--CCCCCC Q ss_conf 999998652247843001121-04542478999998709965899631711100146899999999998717--998642 Q gi|254780200|r 99 ATNLAKRLESISSIVVDWAMR-YGKPSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMR--WSPSLR 175 (343) Q Consensus 99 a~~L~~~L~~~~~~~V~~amr-y~~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~--~~~~~~ 175 (343) ++.++++++ ...|+.|+- ...|+++++++++.+.|+++|+++|+|= ++ |...+.+.+++.+.. ...++. T Consensus 23 a~~l~~~~~---~~~v~~~fle~~~P~l~e~l~~l~~~G~~~ivvvP~fL-~~----G~~~~~i~~~~~~~~~~p~~~i~ 94 (117) T cd03414 23 ARLLEEGTG---FARVETAFAAATRPSLPEALERLRALGARRVVVLPYLL-FT----GVLMDRIEEQVAELAAEPGIEFV 94 (117) T ss_pred HHHHHHHCC---CCEEEEEEECCCCCCHHHHHHHHHHCCCCEEEEEEHHH-CC----CCHHHHHHHHHHHHHHCCCCEEE T ss_conf 999997689---98188877536799899999999976997599976231-27----84276679999999848897189 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 13345578006899999999999 Q gi|254780200|r 176 TVPPYYEDSDYISALAQSIREHF 198 (343) Q Consensus 176 ~I~~~~~~p~yI~a~~~~I~~~l 198 (343) +-++...||...+++.+++++++ T Consensus 95 ~~~pLG~~p~l~~~l~~Ri~eal 117 (117) T cd03414 95 LAPPLGPHPELAEALLERVREAL 117 (117) T ss_pred ECCCCCCCHHHHHHHHHHHHHHC T ss_conf 88889899899999999999769 No 22 >COG4822 CbiK Cobalamin biosynthesis protein CbiK, Co2+ chelatase [Coenzyme metabolism] Probab=97.85 E-value=0.0041 Score=42.30 Aligned_cols=205 Identities=17% Similarity=0.249 Sum_probs=128.4 Q ss_pred HHHHHHCCCCCCCHHHHHCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCCCC-CCCCC-CC Q ss_conf 99865224784300112104542478999998709965899631711100146899999999998717998-64213-34 Q gi|254780200|r 102 LAKRLESISSIVVDWAMRYGKPSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMRWSP-SLRTV-PP 179 (343) Q Consensus 102 L~~~L~~~~~~~V~~amry~~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~~~~-~~~~I-~~ 179 (343) +-++|.++++.. ...| -.+|.+|+++|.+++++=||+= --|--++.+-+...+..... ++..- +- T Consensus 47 IIkkLK~rdgi~------~dTP--~~aL~klk~~gy~eviiQ~lhi-----IpG~EyEklvr~V~~~~~dF~~lkig~Pl 113 (265) T COG4822 47 IIKKLKERDGID------FDTP--IQALNKLKDQGYEEVIIQPLHI-----IPGIEYEKLVREVNKYSNDFKRLKIGRPL 113 (265) T ss_pred HHHHHHHHCCCC------CCCH--HHHHHHHHHCCCHHEEEEEEEE-----CCCHHHHHHHHHHHHHHHHHHEEECCCCE T ss_conf 999988626942------2888--9999999971513125333221-----47267999999999876330314617731 Q ss_pred CCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCC Q ss_conf 55780068999999999997338998669998336750267627998889999999999972667567047862257776 Q gi|254780200|r 180 YYEDSDYISALAQSIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKICFQSRFGRI 259 (343) Q Consensus 180 ~~~~p~yI~a~~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l~fQSr~Gp~ 259 (343) .|....|- -+.+.|++..... .+.+.++|--||.-- -...|+..-+.+...-|..+ ..+| .+ . T Consensus 114 Ly~k~DYe-~~v~aik~~~ppl-~k~e~~vlmgHGt~h--------~s~~~YacLd~~~~~~~f~~--v~v~---~v--e 176 (265) T COG4822 114 LYYKNDYE-ICVEAIKDQIPPL-NKDEILVLMGHGTDH--------HSNAAYACLDHVLDEYGFDN--VFVA---AV--E 176 (265) T ss_pred EECHHHHH-HHHHHHHHHCCCC-CCCEEEEEEECCCCC--------CHHHHHHHHHHHHHHCCCCC--EEEE---EE--C T ss_conf 54010599-9999999766886-767279998358875--------17789999999998648885--6999---82--4 Q ss_pred CCCCHHHHHHHHHHHHCCCCCEEEECCHHHHCCHHHHHHHH----HHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHH Q ss_conf 54430268999989976998399974405422125588875----89999999759975898168987989999999999 Q gi|254780200|r 260 KCLEPPTDKTVEKLAHDGIKSLAIITPGFSSDCLETSYEIA----HEAKEIFVNGGGEKFTQVPCLNSSNLSIDLLEKIT 335 (343) Q Consensus 260 ~WL~P~t~~~l~~L~~~G~k~v~v~p~gFvsD~lETl~Eid----ie~~e~~~e~Gg~~~~rvp~lN~~~~fi~~La~lv 335 (343) -. |.++++++.|.+.|++.|-++|.=||+-.--+ -|+. ...++.+..+|.+.-.+.--|+..|+|-+...+-+ T Consensus 177 ~y--P~~d~vi~~l~~~~~~~v~L~PlMlvAG~Ha~-nDMasddedswk~il~~~G~~v~~~l~GLGE~~~iq~ifi~Hi 253 (265) T COG4822 177 GY--PLVDTVIEYLRKNGIKEVHLIPLMLVAGDHAK-NDMASDDEDSWKNILEKNGFKVEVYLHGLGENPAIQAIFIDHI 253 (265) T ss_pred CC--CCHHHHHHHHHHCCCCEEEEEEEEEEECHHHH-HHHCCCCHHHHHHHHHHCCCEEEEEEECCCCCHHHHHHHHHHH T ss_conf 78--75899999999768764888656886040223-2111263478999998679546887605777678999999999 Q ss_pred HHHH Q ss_conf 9984 Q gi|254780200|r 336 RREL 339 (343) Q Consensus 336 ~~~l 339 (343) +..+ T Consensus 254 k~ai 257 (265) T COG4822 254 KDAI 257 (265) T ss_pred HHHH T ss_conf 9998 No 23 >cd03415 CbiX_CbiC Archaeal sirohydrochlorin cobalt chelatase (CbiX) single domain. Proteins in this subgroup contain a single CbiX domain N-terminal to a precorrin-8X methylmutase (CbiC) domain. CbiX is a cobaltochelatase, responsible for the chelation of Co2+ into sirohydrochlorin, while CbiC catalyzes the conversion of cobalt-precorrin 8 to cobyrinic acid by methyl rearrangement. Both CbiX and CbiC are involved in vitamin B12 biosynthesis. Probab=97.48 E-value=0.0014 Score=45.58 Aligned_cols=114 Identities=18% Similarity=0.135 Sum_probs=65.1 Q ss_pred EEEEEECCCCHHHHHHCCCC-HHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEC Q ss_conf 69998336750267627998-88999999999997266756704786225777654430268999989976998399974 Q gi|254780200|r 207 MLLVSFHQMPVSYLLKGDPY-GCHCHKTARLLKEFLSWPDDRFKICFQSRFGRIKCLEPPTDKTVEKLAHDGIKSLAIIT 285 (343) Q Consensus 207 ~llfSaHglP~~~~~~gDpY-~~q~~~t~~~i~~~l~~~~~~~~l~fQSr~Gp~~WL~P~t~~~l~~L~~~G~k~v~v~p 285 (343) .+|+.+||== ++| ....++.++.+.++++.+ ..++| .+--+|++.+.+.++.++|+++|+|+| T Consensus 2 AilLv~HGSR-------~~~~~e~~~~la~~v~e~~~~~---v~~~f------mElaePsi~e~l~~~v~~G~~~IiVvP 65 (125) T cd03415 2 AIIIITHGSR-------RNTFNEDMEEWAAYLERKLGVP---VYLTY------NEYAEPNWRDLLNELLSEGYGHIIIAL 65 (125) T ss_pred EEEEEECCCC-------CHHHHHHHHHHHHHHHHHCCCC---EEEEE------ECCCCCCHHHHHHHHHHCCCCCEEEEE T ss_conf 6999946789-------5778999999999999861995---58755------110699889999999984998199995 Q ss_pred CHHHHCCHHHHHHHHHHHHHHH------HHCCCCE--EEEECCCCCCHHHHHHHHHHHHH Q ss_conf 4054221255888758999999------9759975--89816898798999999999999 Q gi|254780200|r 286 PGFSSDCLETSYEIAHEAKEIF------VNGGGEK--FTQVPCLNSSNLSIDLLEKITRR 337 (343) Q Consensus 286 ~gFvsD~lETl~Eidie~~e~~------~e~Gg~~--~~rvp~lN~~~~fi~~La~lv~~ 337 (343) . |.+...-+..||=-+.-..- ...+|+. ..+.+.+.+||.-.++|...|++ T Consensus 66 ~-FL~~G~Hv~~DIP~~lg~~~~~~~~~~~~~gk~v~~~~a~Plg~~plv~~~l~~Rv~~ 124 (125) T cd03415 66 A-FLGRGNHVARDIMGELGVSRFYKWVMSKYGGKEILVYVTEPLADSPLVKLALFYRVKR 124 (125) T ss_pred E-EECCCCCHHHCCHHHHCCCCCCCEEEEEECCCEEEEEECCCCCCCHHHHHHHHHHHHC T ss_conf 3-2068865010265450888567435786269447899848989888899999999861 No 24 >pfam01903 CbiX CbiX. The function of CbiX is uncertain, however it is found in cobalamin biosynthesis operons and so may have a related function. Some CbiX proteins contain a striking histidine-rich region at their C-terminus, which suggests that it might be involved in metal chelation. Probab=97.47 E-value=0.0023 Score=44.08 Aligned_cols=97 Identities=21% Similarity=0.241 Sum_probs=54.9 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHCCCCCEEEECCHHHHCCHHHHHHHHHHHHHH Q ss_conf 88999999999997266756704786225777654430268999989976998399974405422125588875899999 Q gi|254780200|r 227 GCHCHKTARLLKEFLSWPDDRFKICFQSRFGRIKCLEPPTDKTVEKLAHDGIKSLAIITPGFSSDCLETSYEIAHEAKEI 306 (343) Q Consensus 227 ~~q~~~t~~~i~~~l~~~~~~~~l~fQSr~Gp~~WL~P~t~~~l~~L~~~G~k~v~v~p~gFvsD~lETl~Eidie~~e~ 306 (343) ...+...++.+.+..+. .....||-.. -+|++.+.+++|.++|.++|+|+|. |.+...-+..||--+..+. T Consensus 9 ~~~~~~l~~~l~~~~~~--~~V~~~fle~------~~P~l~~~~~~l~~~G~~~ivvvP~-fL~~G~H~~~DIp~~l~~~ 79 (106) T pfam01903 9 NEDFEELARLLREKLGF--VPVEVGFLEL------AEPSLPEALRELVAQGARRIVVVPL-FLFAGGHVKRDIPEELAEA 79 (106) T ss_pred HHHHHHHHHHHHHHCCC--CEEEEEEEEC------CCCCHHHHHHHHHHCCCCEEEEEEE-EEECCCCHHHHHHHHHHHH T ss_conf 99999999999965799--8499998844------7898999999999669964999875-7405620176899999999 Q ss_pred HHHCCCCEEEEECCCCCCHHHHHHHH Q ss_conf 99759975898168987989999999 Q gi|254780200|r 307 FVNGGGEKFTQVPCLNSSNLSIDLLE 332 (343) Q Consensus 307 ~~e~Gg~~~~rvp~lN~~~~fi~~La 332 (343) ..+..+.++.+.+.|+.||..++.|+ T Consensus 80 ~~~~p~i~i~~~~~LG~~p~l~~ll~ 105 (106) T pfam01903 80 KAAHPGIEVHYGRPLGPHPLLAELLA 105 (106) T ss_pred HHHCCCCEEEECCCCCCCHHHHHHHC T ss_conf 98889966997888889979999971 No 25 >pfam01903 CbiX CbiX. The function of CbiX is uncertain, however it is found in cobalamin biosynthesis operons and so may have a related function. Some CbiX proteins contain a striking histidine-rich region at their C-terminus, which suggests that it might be involved in metal chelation. Probab=97.41 E-value=0.0012 Score=45.93 Aligned_cols=91 Identities=16% Similarity=0.210 Sum_probs=59.8 Q ss_pred HHHHHHHCCC-CCCCHHHHH-CCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 9998652247-843001121-04542478999998709965899631711100146899999999998717998642133 Q gi|254780200|r 101 NLAKRLESIS-SIVVDWAMR-YGKPSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMRWSPSLRTVP 178 (343) Q Consensus 101 ~L~~~L~~~~-~~~V~~amr-y~~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~~~~~~~~I~ 178 (343) .+.+.+.+.. ...|..|+- ...|++++++++|.+.|+++|+++|+|=.-..-+....-+.+.+ ++......++.+-+ T Consensus 14 ~l~~~l~~~~~~~~V~~~fle~~~P~l~~~~~~l~~~G~~~ivvvP~fL~~G~H~~~DIp~~l~~-~~~~~p~i~i~~~~ 92 (106) T pfam01903 14 ELARLLREKLGFVPVEVGFLELAEPSLPEALRELVAQGARRIVVVPLFLFAGGHVKRDIPEELAE-AKAAHPGIEVHYGR 92 (106) T ss_pred HHHHHHHHHCCCCEEEEEEEECCCCCHHHHHHHHHHCCCCEEEEEEEEEECCCCHHHHHHHHHHH-HHHHCCCCEEEECC T ss_conf 99999996579984999988447898999999999669964999875740562017689999999-99888996699788 Q ss_pred CCCCCCHHHHHHHH Q ss_conf 45578006899999 Q gi|254780200|r 179 PYYEDSDYISALAQ 192 (343) Q Consensus 179 ~~~~~p~yI~a~~~ 192 (343) +...||..++++++ T Consensus 93 ~LG~~p~l~~ll~~ 106 (106) T pfam01903 93 PLGPHPLLAELLAE 106 (106) T ss_pred CCCCCHHHHHHHCC T ss_conf 88899799999719 No 26 >TIGR00109 hemH ferrochelatase; InterPro: IPR001015 Synonym(s): Protoheme ferro-lyase, Iron chelatase, etc. Ferrochelatase catalyzes the last step in heme biosynthesis: the chelation of a ferrous ion to proto-porphyrin IX, to form protoheme , . In eukaryotic cells, it binds to the mitochondrial inner membrane with its active site on the matrix side of the membrane. The X-ray structure of Bacillus subtilis and human ferrochelatase have been solved , . The human enzyme exists as a homodimer. Each subunit contains one [Fe2S2] cluster. The monomer is folded into two similar domains, each with a four-stranded parallel beta-sheet flanked by an alpha-helix in a beta-alpha-beta motif that is reminiscent of the fold found in the periplasmic binding proteins. The topological similarity between the domains suggests that they have arisen from a gene duplication event. However, significant differences exist between the two domains, including an N-terminal section (residues 80-130) that forms part of the active site pocket, and a C-terminal extension (residues 390-423) that is involved in coordination of the [Fe2S2]cluster and in stabilisation of the homodimer. The [Fe2S2] cluster ligands are Cys196, Cys403, Cys406 and Cys411. The experiments with Co(II) binding show that His230 and Asp383 are part of the enzyme active site . Ferrochelatase seems to have a structurally conserved core region that is common to the enzyme from bacteria, plants and mammals. Porphyrin binds in the identified cleft; this cleft also includes the metal-binding site of the enzyme. It is likely that the structure of the cleft region will have different conformations upon substrate binding and release .; GO: 0004325 ferrochelatase activity, 0006783 heme biosynthetic process. Probab=97.13 E-value=0.01 Score=39.60 Aligned_cols=160 Identities=21% Similarity=0.237 Sum_probs=103.2 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCCCCH---HHHHHHH-HHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHH Q ss_conf 99999999987179986421334557800---6899999-9999997338998669998336750267627998889999 Q gi|254780200|r 157 AQDKVFQELIHMRWSPSLRTVPPYYEDSD---YISALAQ-SIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHK 232 (343) Q Consensus 157 ~~~~~~~~l~~~~~~~~~~~I~~~~~~p~---yI~a~~~-~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~ 232 (343) ..+++.+-+....-++.+--|+.-+-.+. +|.-+.. +|+.-++... .+.|.|=..-.+. T Consensus 20 ~~e~v~~fL~~~f~Dp~~~~~~~~~~~K~~~~~I~~lR~~k~~K~Y~~iG-----------------~EGgSPL~~it~~ 82 (339) T TIGR00109 20 KLEEVERFLKQLFADPRIIDISRAKWRKLLAKMILPLRSKKIAKNYEEIG-----------------TEGGSPLLQITEQ 82 (339) T ss_pred CHHHHHHHHHHHCCCCCEECCCCHHCCCCHHHHHHHHHHHHHHHHHHHHC-----------------CCCCCHHHHHHHH T ss_conf 51226899997517830011651200642899976521088999999727-----------------7788714689999 Q ss_pred HHHHHHHHHC--CCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEC--CHHHHCCHHHHHHHHHHHHHHHH Q ss_conf 9999999726--6756704786225777654430268999989976998399974--40542212558887589999999 Q gi|254780200|r 233 TARLLKEFLS--WPDDRFKICFQSRFGRIKCLEPPTDKTVEKLAHDGIKSLAIIT--PGFSSDCLETSYEIAHEAKEIFV 308 (343) Q Consensus 233 t~~~i~~~l~--~~~~~~~l~fQSr~Gp~~WL~P~t~~~l~~L~~~G~k~v~v~p--~gFvsD~lETl~Eidie~~e~~~ 308 (343) -+.++.++|. ++. ....+-..| .+...|++.+++++|.++|+++++++| |-|-+=-.--- .+ ++.+... T Consensus 83 Q~~~L~~~L~~~~~~---~~~~~~~~a-m~Y~~P~~~~av~~l~~d~v~~~V~l~lYPhfS~sTtgS~--f~-~l~~~~~ 155 (339) T TIGR00109 83 QAEALEKRLDKEIQE---EIDAKVYIA-MRYGEPFTEEAVKELLKDGVERLVVLPLYPHFSSSTTGSS--FN-ELAEALK 155 (339) T ss_pred HHHHHHHHCCCCCCC---CCCCCEEEE-EECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHH--HH-HHHHHHH T ss_conf 999998624422435---687330453-2127821899999986454648888514873000127889--99-9999998 Q ss_pred HCCCC---EEEEECCCCCCHHHHHHHHHHHHHHHC Q ss_conf 75997---589816898798999999999999842 Q gi|254780200|r 309 NGGGE---KFTQVPCLNSSNLSIDLLEKITRRELM 340 (343) Q Consensus 309 e~Gg~---~~~rvp~lN~~~~fi~~La~lv~~~l~ 340 (343) +.++. .+..|+.-.+++.||.+++|.|++.|+ T Consensus 156 ~~~slR~~~~~~I~~w~~~~~y~~a~~d~i~~~l~ 190 (339) T TIGR00109 156 KLRSLRSLKISVIESWYDNEKYIKALADSIKETLK 190 (339) T ss_pred HCCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHH T ss_conf 34896576314751445688799999999999984 No 27 >cd03415 CbiX_CbiC Archaeal sirohydrochlorin cobalt chelatase (CbiX) single domain. Proteins in this subgroup contain a single CbiX domain N-terminal to a precorrin-8X methylmutase (CbiC) domain. CbiX is a cobaltochelatase, responsible for the chelation of Co2+ into sirohydrochlorin, while CbiC catalyzes the conversion of cobalt-precorrin 8 to cobyrinic acid by methyl rearrangement. Both CbiX and CbiC are involved in vitamin B12 biosynthesis. Probab=97.07 E-value=0.00082 Score=47.18 Aligned_cols=93 Identities=17% Similarity=0.243 Sum_probs=61.8 Q ss_pred HHHHHHHHHHHCCCCCCCHHH-HHCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHH------- Q ss_conf 999999986522478430011-210454247899999870996589963171110014689999999999871------- Q gi|254780200|r 97 DQATNLAKRLESISSIVVDWA-MRYGKPSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHM------- 168 (343) Q Consensus 97 ~qa~~L~~~L~~~~~~~V~~a-mry~~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~------- 168 (343) ..++.+++++ +.+|+.| |-+.+|++++++.++.++|+++|+++|+|= +.. .-..+++-+.+.-. T Consensus 21 ~la~~v~e~~----~~~v~~~fmElaePsi~e~l~~~v~~G~~~IiVvP~FL--~~G--~Hv~~DIP~~lg~~~~~~~~~ 92 (125) T cd03415 21 EWAAYLERKL----GVPVYLTYNEYAEPNWRDLLNELLSEGYGHIIIALAFL--GRG--NHVARDIMGELGVSRFYKWVM 92 (125) T ss_pred HHHHHHHHHC----CCCEEEEEECCCCCCHHHHHHHHHHCCCCCEEEEEEEE--CCC--CCHHHCCHHHHCCCCCCCEEE T ss_conf 9999999861----99558755110699889999999984998199995320--688--650102654508885674357 Q ss_pred ----CCCCCCCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf ----79986421334557800689999999999 Q gi|254780200|r 169 ----RWSPSLRTVPPYYEDSDYISALAQSIREH 197 (343) Q Consensus 169 ----~~~~~~~~I~~~~~~p~yI~a~~~~I~~~ 197 (343) +....+-+-++..++|.--+++..+|+++ T Consensus 93 ~~~~gk~v~~~~a~Plg~~plv~~~l~~Rv~~a 125 (125) T cd03415 93 SKYGGKEILVYVTEPLADSPLVKLALFYRVKRA 125 (125) T ss_pred EEECCCEEEEEECCCCCCCHHHHHHHHHHHHCC T ss_conf 862694478998489898888999999998619 No 28 >PRK05782 precorrin-8X methylmutase; Validated Probab=96.91 E-value=0.021 Score=37.45 Aligned_cols=98 Identities=18% Similarity=0.217 Sum_probs=61.1 Q ss_pred HHHHHHHHHHHHCCCCCCCH-HHHHCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHH-------- Q ss_conf 99999999865224784300-112104542478999998709965899631711100146899999999998-------- Q gi|254780200|r 96 RDQATNLAKRLESISSIVVD-WAMRYGKPSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELI-------- 166 (343) Q Consensus 96 ~~qa~~L~~~L~~~~~~~V~-~amry~~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~-------- 166 (343) +.++..+++.|. ++++ .-|-|.+|..++.+.++.+.|.++|+++|+|=- -.. -+.+++..++. T Consensus 23 ~~la~~~~e~lG----~~~~~~y~e~a~P~l~~gl~~l~~~g~~~ivv~p~FL~-~G~---HV~rDI~g~~g~~~~~~~~ 94 (332) T PRK05782 23 ENLAAYLEEKLG----VPAYLTYNEYAEPNWRSLLNDLLREGYDDVVIALAFLG-RGN---HVARDIMGELGVSEFGRWV 94 (332) T ss_pred HHHHHHHHHHCC----CCEEEEECCCCCCCHHHHHHHHHHHCCCCEEEEEEECC-CCC---CHHHHHHHHHCCCHHHHHH T ss_conf 999999997449----84686330026997799999999816786799985506-886---0043424452752466677 Q ss_pred -H--HCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHC Q ss_conf -7--1799864213345578006899999999999733 Q gi|254780200|r 167 -H--MRWSPSLRTVPPYYEDSDYISALAQSIREHFESI 201 (343) Q Consensus 167 -~--~~~~~~~~~I~~~~~~p~yI~a~~~~I~~~l~~~ 201 (343) . .+...++-+-++-.++|.--.++..+++.++... T Consensus 95 ~~~~~g~~~~~~~t~~l~~~~~v~~~~~~R~~~a~~~~ 132 (332) T PRK05782 95 ESKYFGKRVELYVTEPLGDSPLVKLALLYRIKRALGAG 132 (332) T ss_pred HHHCCCCEEEEEECCCCCCCHHHHHHHHHHHHHCCCCC T ss_conf 75303743699976888987389999999998602688 No 29 >PRK05782 precorrin-8X methylmutase; Validated Probab=96.84 E-value=0.057 Score=34.39 Aligned_cols=59 Identities=22% Similarity=0.349 Sum_probs=34.0 Q ss_pred HHHHHHHHHHHHCCCCCEEE--ECCHHHHCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHH Q ss_conf 02689999899769983999--744054221255888758999999975997589816898798999999 Q gi|254780200|r 264 PPTDKTVEKLAHDGIKSLAI--ITPGFSSDCLETSYEIAHEAKEIFVNGGGEKFTQVPCLNSSNLSIDLL 331 (343) Q Consensus 264 P~t~~~l~~L~~~G~k~v~v--~p~gFvsD~lETl~Eidie~~e~~~e~Gg~~~~rvp~lN~~~~fi~~L 331 (343) |.---.|-++.++|.+.=+| +|+|||. -.|.||...+.+.-.....-.-+.|+--++.+ T Consensus 263 PTAL~~ll~li~eg~~~pLVIG~PVGFVg---------AaESKe~L~~~~vP~I~~~GrKGGS~vAaAiv 323 (332) T PRK05782 263 PTALAEALKIWKEGADIPFIVATPPGFTN---------ASEVKEELIKSGIPAVVVRGTYGGSGIAVAII 323 (332) T ss_pred HHHHHHHHHHHHCCCCCCEEEECCCCCCC---------HHHHHHHHHHCCCCEEEEECCCCCHHHHHHHH T ss_conf 58999999999759999849978976467---------49999999868999799816987699999999 No 30 >pfam06180 CbiK Cobalt chelatase (CbiK). This family consists of several bacterial cobalt chelatase (CbiK) proteins (EC:4.99.1.-). Probab=96.81 E-value=0.0087 Score=40.05 Aligned_cols=32 Identities=19% Similarity=0.238 Sum_probs=20.9 Q ss_pred EEEEEECCCCCCCC----CHHHHHHHHHHHCCCHHH Q ss_conf 79999717888999----988999999987489544 Q gi|254780200|r 19 IGVLLVNLGTPDGH----DFFSLRRYLREFLLDKRV 50 (343) Q Consensus 19 ~gvLL~n~G~P~~~----~~~~V~~yL~~fl~D~~V 50 (343) +||||+.+||--.- +...++.-+++=|-|-.| T Consensus 1 KAILlvsFGTs~~~~r~~ti~~ie~~v~~afP~~~V 36 (256) T pfam06180 1 KAILVVSFGTSYKDTREKTIDKIEKKVAAEFPDYDV 36 (256) T ss_pred CEEEEEECCCCCHHHHHHHHHHHHHHHHHHCCCCEE T ss_conf 979999788995678999999999999998889818 No 31 >cd03416 CbiX_SirB_N Sirohydrochlorin cobalt chelatase (CbiX) and sirohydrochlorin iron chelatase (SirB), N-terminal domain. SirB catalyzes the ferro-chelation of sirohydrochlorin to siroheme, the prosthetic group of sulfite and nitrite reductases. CbiX is a cobaltochelatase, responsible for the chelation of Co2+ into sirohydrochlorin, an important step in the vitamin B12 biosynthetic pathway. CbiX often contains a C-terminal histidine-rich region that may be important for metal delivery and/or storage, and may also contain an iron-sulfur center. Both are found in a wide range of bacteria. This subgroup also contains single domain proteins from archaea and bacteria which may represent the ancestral form of class II chelatases before domain duplication occurred. Probab=96.76 E-value=0.04 Score=35.46 Aligned_cols=57 Identities=16% Similarity=0.223 Sum_probs=23.5 Q ss_pred HHHHHHHHHHHHCCCCCEEEECCHHHHCCHHHHHHHHHHHHHHHHHCCCCEEEEECCC Q ss_conf 0268999989976998399974405422125588875899999997599758981689 Q gi|254780200|r 264 PPTDKTVEKLAHDGIKSLAIITPGFSSDCLETSYEIAHEAKEIFVNGGGEKFTQVPCL 321 (343) Q Consensus 264 P~t~~~l~~L~~~G~k~v~v~p~gFvsD~lETl~Eidie~~e~~~e~Gg~~~~rvp~l 321 (343) |++.+.++++.++|.++|+|+| -|.+...=+..||=-+..+...+..+.++.+.++| T Consensus 44 P~l~~~l~~l~~~G~~~ivvvP-~fL~~G~H~~~DIP~~l~~~~~~~p~~~~~~~~~l 100 (101) T cd03416 44 PSLAEALDELAAQGATRIVVVP-LFLLAGGHVKEDIPAALAAARARHPGVRIRYAPPL 100 (101) T ss_pred CCHHHHHHHHHHCCCCEEEEEE-EEECCCCCHHHHHHHHHHHHHHHCCCCEEEECCCC T ss_conf 9899999999966998599966-14135743076899999999988899589966999 No 32 >KOG1321 consensus Probab=96.39 E-value=0.056 Score=34.40 Aligned_cols=108 Identities=17% Similarity=0.266 Sum_probs=70.9 Q ss_pred CHHHHHHHHHHHHHHHHHHCCCCCCC----HHHHH-CCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHH--HHHHH- Q ss_conf 04577759999999986522478430----01121-045424789999987099658996317111001468--99999- Q gi|254780200|r 89 SILRTHTRDQATNLAKRLESISSIVV----DWAMR-YGKPSVKEIINNLREEGCDRLLIFPLYPQYSAATTG--TAQDK- 160 (343) Q Consensus 89 SPL~~~t~~qa~~L~~~L~~~~~~~V----~~amr-y~~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtg--S~~~~- 160 (343) -|-.....+-..++-++|+...++.- .+|-. .-.|..+++++.|.+.|..+++++|.. +|+. ..+++ T Consensus 246 DpY~~Ei~atv~~iMeeL~~~N~y~lawQSkVGP~pWL~p~Tde~i~~lgk~g~knll~VPIa-----FvSeHIETL~Ei 320 (395) T KOG1321 246 DPYPAEIAATVDLIMEELKYKNPYRLAWQSKVGPLPWLGPATDEVIEGLGKKGVKNLLLVPIA-----FVSEHIETLHEI 320 (395) T ss_pred CCCHHHHHHHHHHHHHHHCCCCCCHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCCEEEEEEH-----HHHHHHHHHHHH T ss_conf 975789999999999996137930243430367754456005899999876122445897521-----656778789984 Q ss_pred ---HHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCC Q ss_conf ---999998717998642133455780068999999999997338 Q gi|254780200|r 161 ---VFQELIHMRWSPSLRTVPPYYEDSDYISALAQSIREHFESIQ 202 (343) Q Consensus 161 ---~~~~l~~~~~~~~~~~I~~~~~~p~yI~a~~~~I~~~l~~~~ 202 (343) +.+.+.+.+ .-+++-+.....+|.||+++++.+.++|+... T Consensus 321 D~ey~e~a~k~g-ve~~~Rv~sln~~p~fI~~lADlV~ehL~s~~ 364 (395) T KOG1321 321 DIEYIEEALKKG-VENWKRVESLNGNPTFIEGLADLVAEHLKSNQ 364 (395) T ss_pred HHHHHHHHHHHH-HHHHEECCCCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 189999999976-54010146889982189989999998621312 No 33 >cd03416 CbiX_SirB_N Sirohydrochlorin cobalt chelatase (CbiX) and sirohydrochlorin iron chelatase (SirB), N-terminal domain. SirB catalyzes the ferro-chelation of sirohydrochlorin to siroheme, the prosthetic group of sulfite and nitrite reductases. CbiX is a cobaltochelatase, responsible for the chelation of Co2+ into sirohydrochlorin, an important step in the vitamin B12 biosynthetic pathway. CbiX often contains a C-terminal histidine-rich region that may be important for metal delivery and/or storage, and may also contain an iron-sulfur center. Both are found in a wide range of bacteria. This subgroup also contains single domain proteins from archaea and bacteria which may represent the ancestral form of class II chelatases before domain duplication occurred. Probab=96.21 E-value=0.037 Score=35.69 Aligned_cols=50 Identities=28% Similarity=0.488 Sum_probs=36.0 Q ss_pred HHHHHHHHHHCCCCCCCHHHHH-CCCCCHHHHHHHHHHCCCCEEEEEECCHHHH Q ss_conf 9999998652247843001121-0454247899999870996589963171110 Q gi|254780200|r 98 QATNLAKRLESISSIVVDWAMR-YGKPSVKEIINNLREEGCDRLLIFPLYPQYS 150 (343) Q Consensus 98 qa~~L~~~L~~~~~~~V~~amr-y~~P~i~~~l~~l~~~g~~~ii~lPLyPqyS 150 (343) .++.+++++. ...|..|+- ...|++++++.++.++|+++|+++|+|=.-. T Consensus 21 l~~~l~~~~~---~~~V~~afle~~~P~l~~~l~~l~~~G~~~ivvvP~fL~~G 71 (101) T cd03416 21 LAERLRERLP---GDEVELAFLELAEPSLAEALDELAAQGATRIVVVPLFLLAG 71 (101) T ss_pred HHHHHHHHCC---CCEEEEEEEECCCCCHHHHHHHHHHCCCCEEEEEEEEECCC T ss_conf 9999997689---98698665404799899999999966998599966141357 No 34 >COG2138 Sirohydrochlorin ferrochelatase [Inorganic ion transport and metabolism] Probab=96.06 E-value=0.072 Score=33.64 Aligned_cols=24 Identities=17% Similarity=-0.010 Sum_probs=15.2 Q ss_pred CEEEEEECCCCCCCCCHHHHHHHH Q ss_conf 679999717888999988999999 Q gi|254780200|r 18 KIGVLLVNLGTPDGHDFFSLRRYL 41 (343) Q Consensus 18 K~gvLL~n~G~P~~~~~~~V~~yL 41 (343) ..++|++..||+.......+..+. T Consensus 2 ~~~~llvgHGsr~p~~~~~~~~~a 25 (245) T COG2138 2 MPALLLVGHGSRLPRGREVAEAIA 25 (245) T ss_pred CCCEEEEECCCCCCCHHHHHHHHH T ss_conf 724001004888862789999999 No 35 >cd03411 Ferrochelatase_N Ferrochelatase, N-terminal domain: Ferrochelatase (protoheme ferrolyase or HemH) is the terminal enzyme of the heme biosynthetic pathway. It catalyzes the insertion of ferrous iron into the protoporphyrin IX ring yielding protoheme. This enzyme is ubiquitous in nature and widely distributed in bacteria and eukaryotes. Recently, some archaeal members have been identified. The oligomeric state of these enzymes varies depending on the presence of a dimerization motif at the C-terminus. Probab=95.88 E-value=0.18 Score=30.81 Aligned_cols=95 Identities=16% Similarity=0.310 Sum_probs=60.5 Q ss_pred CCCCHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEC--CHHHHCCHHHHHHHH Q ss_conf 799888999999999997266756704786225777654430268999989976998399974--405422125588875 Q gi|254780200|r 223 GDPYGCHCHKTARLLKEFLSWPDDRFKICFQSRFGRIKCLEPPTDKTVEKLAHDGIKSLAIIT--PGFSSDCLETSYEIA 300 (343) Q Consensus 223 gDpY~~q~~~t~~~i~~~l~~~~~~~~l~fQSr~Gp~~WL~P~t~~~l~~L~~~G~k~v~v~p--~gFvsD~lETl~Eid 300 (343) |.|=....+.-++.+.+.|+-...++.+-+--|.| .|++.|+++++.++|+++|+++| |-|.+-...+..|- T Consensus 63 ~SPL~~~t~~q~~~L~~~L~~~~~~~~V~~amry~-----~P~i~~~l~~l~~~g~~~iv~lPlyPqyS~sTt~S~~~~- 136 (159) T cd03411 63 GSPLNEITRAQAEALEKALDERGIDVKVYLAMRYG-----PPSIEEALEELKADGVDRIVVLPLYPQYSASTTGSYLDE- 136 (159) T ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEEECC-----CCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCHHHHHHH- T ss_conf 88789999999999999987328995189874327-----998999999998548884999979874131448999999- Q ss_pred HHHHHHHHHCCCCEEEEECCCCCC Q ss_conf 899999997599758981689879 Q gi|254780200|r 301 HEAKEIFVNGGGEKFTQVPCLNSS 324 (343) Q Consensus 301 ie~~e~~~e~Gg~~~~rvp~lN~~ 324 (343) -.+........-.+..|++..+| T Consensus 137 -~~~~~~~~~~~~~i~~I~~f~d~ 159 (159) T cd03411 137 -VERALKKLRPAPELRVIRSFYDH 159 (159) T ss_pred -HHHHHHHCCCCCEEEEECCCCCC T ss_conf -99999964899848982886788 No 36 >cd03413 CbiK_C Anaerobic cobalamin biosynthetic cobalt chelatase (CbiK), C-terminal domain. CbiK is part of the cobalt-early path for cobalamin biosynthesis. It catalyzes the insertion of cobalt into the oxidized form of precorrin-2, factor II (sirohydrochlorin), the second step of the anaerobic branch of vitamin B12 biosynthesis. CbiK belongs to the class II family of chelatases, and is a homomeric enzyme that does not require ATP for its enzymatic activity. Probab=94.93 E-value=0.038 Score=35.63 Aligned_cols=44 Identities=20% Similarity=0.366 Sum_probs=24.4 Q ss_pred HHHHHHCCCCCCCHHHHHCCCCCHHHHHHHHHHCCCCEEEEEEC Q ss_conf 99865224784300112104542478999998709965899631 Q gi|254780200|r 102 LAKRLESISSIVVDWAMRYGKPSVKEIINNLREEGCDRLLIFPL 145 (343) Q Consensus 102 L~~~L~~~~~~~V~~amry~~P~i~~~l~~l~~~g~~~ii~lPL 145 (343) |+..|.+.+.-.|++|.--+.|.+++++++|++.|+.++.++|+ T Consensus 21 l~~~l~~~~~~nv~vgtvEg~P~~e~vl~~L~~~g~k~V~L~Pl 64 (103) T cd03413 21 LEYVLREEDPANVFVGTVEGYPGLDDVLAKLKKAGIKKVTLMPL 64 (103) T ss_pred HHHHHHHCCCCCEEEEEECCCCCHHHHHHHHHHCCCCEEEEECH T ss_conf 99999964998789999578589999999999769986999610 No 37 >cd03412 CbiK_N Anaerobic cobalamin biosynthetic cobalt chelatase (CbiK), N-terminal domain. CbiK is part of the cobalt-early path for cobalamin biosynthesis. It catalyzes the insertion of cobalt into the oxidized form of precorrin-2, factor II (sirohydrochlorin), the second step of the anaerobic branch of vitamin B12 biosynthesis. CbiK belongs to the class II family of chelatases and is a homomeric enzyme that does not require ATP for its enzymatic activity. Probab=94.44 E-value=0.5 Score=27.79 Aligned_cols=43 Identities=16% Similarity=0.304 Sum_probs=31.7 Q ss_pred CCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 542478999998709965899631711100146899999999998717 Q gi|254780200|r 122 KPSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMR 169 (343) Q Consensus 122 ~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~ 169 (343) -++..++|++|.++|+++|++.|++ ..-|.-++.+.+.+.+.. T Consensus 55 i~~p~eaL~~L~~~G~~~V~VQslh-----ii~G~Ey~~l~~~v~~~~ 97 (127) T cd03412 55 VDTPEEALAKLAADGYTEVIVQSLH-----IIPGEEYEKLKREVDAFK 97 (127) T ss_pred CCCHHHHHHHHHHCCCCEEEEEECC-----EECCHHHHHHHHHHHHHH T ss_conf 9999999999998799889994463-----667587999999999974 No 38 >cd03412 CbiK_N Anaerobic cobalamin biosynthetic cobalt chelatase (CbiK), N-terminal domain. CbiK is part of the cobalt-early path for cobalamin biosynthesis. It catalyzes the insertion of cobalt into the oxidized form of precorrin-2, factor II (sirohydrochlorin), the second step of the anaerobic branch of vitamin B12 biosynthesis. CbiK belongs to the class II family of chelatases and is a homomeric enzyme that does not require ATP for its enzymatic activity. Probab=94.11 E-value=0.59 Score=27.32 Aligned_cols=107 Identities=19% Similarity=0.205 Sum_probs=65.5 Q ss_pred HHHHHHHHHHHHH--CCCCCCCEEEEECCCCCCCC-----CCHHHHHHHHHHHHCCCCCEEEECCHHHHCCHHHHHHHHH Q ss_conf 9999999999972--66756704786225777654-----4302689999899769983999744054221255888758 Q gi|254780200|r 229 HCHKTARLLKEFL--SWPDDRFKICFQSRFGRIKC-----LEPPTDKTVEKLAHDGIKSLAIITPGFSSDCLETSYEIAH 301 (343) Q Consensus 229 q~~~t~~~i~~~l--~~~~~~~~l~fQSr~Gp~~W-----L~P~t~~~l~~L~~~G~k~v~v~p~gFvsD~lETl~Eidi 301 (343) +..+|...|.+.+ ..|+-+...||-|++=..+| --|+..++|++|..+|+++|+|.|- .+....|=- +|-- T Consensus 14 ~~~~ti~~ie~~v~~afPd~~v~~AfTS~~I~~kl~~~g~~i~~p~eaL~~L~~~G~~~V~VQsl-hii~G~Ey~-~l~~ 91 (127) T cd03412 14 TAEKTIDAIEDKVRAAFPDYEVRWAFTSRMIRKKLKKRGIEVDTPEEALAKLAADGYTEVIVQSL-HIIPGEEYE-KLKR 91 (127) T ss_pred HHHHHHHHHHHHHHHHCCCCEEEEEHHHHHHHHHHHHCCCCCCCHHHHHHHHHHCCCCEEEEEEC-CEECCHHHH-HHHH T ss_conf 89999999999999988898499642289999999986999999999999999879988999446-366758799-9999 Q ss_pred HHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHHHHH Q ss_conf 99999997599758981689879899999999999984 Q gi|254780200|r 302 EAKEIFVNGGGEKFTQVPCLNSSNLSIDLLEKITRREL 339 (343) Q Consensus 302 e~~e~~~e~Gg~~~~rvp~lN~~~~fi~~La~lv~~~l 339 (343) +... .+.+.+.......|=.+++=.+.+++.+.+++ T Consensus 92 ~v~~--~~~~f~~i~~g~PLL~~~eD~~~v~~al~~~~ 127 (127) T cd03412 92 EVDA--FKKGFKKIKLGRPLLYSPEDYEEVAAALKDQI 127 (127) T ss_pred HHHH--HHCCCCEEEEECCCCCCHHHHHHHHHHHHHHC T ss_conf 9999--74579679970567698799999999987339 No 39 >PRK13129 consensus Probab=84.56 E-value=3.6 Score=21.83 Aligned_cols=215 Identities=15% Similarity=0.185 Sum_probs=103.8 Q ss_pred CEE-EEEECCCCCCCCCHHHHHHHHHHHCCC-HHHHCCCHHHHHHHHHHHHHCCCCHHHHHH-HHHHHCCCCCCCHHHHH Q ss_conf 679-999717888999988999999987489-544117868889987666512574788999-99870778874045777 Q gi|254780200|r 18 KIG-VLLVNLGTPDGHDFFSLRRYLREFLLD-KRVVELPSWQWRPILFGYILNFRPSKIKHA-YAKIWNTAKNESILRTH 94 (343) Q Consensus 18 K~g-vLL~n~G~P~~~~~~~V~~yL~~fl~D-~~Vi~~p~~~~~~il~~~I~~~R~~~~~~~-Y~~Iw~~~gg~SPL~~~ 94 (343) |.| |-.+..|=|+ .+.-.+++..+-.. -++|++.-++--|+--| +-+++. ++.+ ..|.-+.. T Consensus 18 ~~ali~yitaG~P~---~e~s~~~~~~l~~~GaDiiEiGiPfSDP~ADG-------pvIq~A~~~AL----~~G~~~~~- 82 (267) T PRK13129 18 RCALMPFLMAGDPD---LETTAEALLILQENGADLIELGIPYSDPLADG-------PVIQAAATRAL----QSGTTLEK- 82 (267) T ss_pred CCEEEEEEECCCCC---HHHHHHHHHHHHHCCCCEEEECCCCCCCCCCC-------HHHHHHHHHHH----HCCCCHHH- T ss_conf 95589887071899---89999999999977999999799888877658-------99999999999----76987899- Q ss_pred HHHHHHHHHHHHHCCCCCCCHHHHHCCCCCH----HHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 5999999998652247843001121045424----789999987099658996317111001468999999999987179 Q gi|254780200|r 95 TRDQATNLAKRLESISSIVVDWAMRYGKPSV----KEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMRW 170 (343) Q Consensus 95 t~~qa~~L~~~L~~~~~~~V~~amry~~P~i----~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~~ 170 (343) ...+.+.+.+. .+.++ +-|.|++|.. ++-++++++.|++-+++ |=.|.- -.+.+.+.+.+.+. T Consensus 83 ~~~~~~~~r~~----~~~Pi-vlM~Y~N~i~~~G~e~F~~~~~~~GvdGvIi-pDLP~e-------E~~~~~~~~~~~gl 149 (267) T PRK13129 83 VLEMLESLKGK----LTIPI-ILFTYYNPLLNRGMERFCEQAAAAGVAGLVV-PDLPLE-------EAEKLSPIAAERGI 149 (267) T ss_pred HHHHHHHHHCC----CCCCE-EEEEEHHHHHHHHHHHHHHHHHHCCCCEEEC-CCCCHH-------HHHHHHHHHHHCCC T ss_conf 99999985434----78888-9986107898855999999998669875767-899989-------99999999985398 Q ss_pred CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEE Q ss_conf 98642133455780068999999999997338998669998336750267627998889999999999972667567047 Q gi|254780200|r 171 SPSLRTVPPYYEDSDYISALAQSIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKI 250 (343) Q Consensus 171 ~~~~~~I~~~~~~p~yI~a~~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l 250 (343) . .+.+|.+=.. .++|+.-.+...+ ...+.|.-|+.=. + ..-..++.+..+.|.+.-+. +..+ T Consensus 150 ~-~I~lvaPtt~--------~~Ri~~i~~~~~g--FiY~vs~~GvTG~---~-~~~~~~~~~~i~~ik~~t~~---Pv~v 211 (267) T PRK13129 150 D-LILLVAPTTP--------AERMKRIAQQSRG--FTYLVSVTGVTGE---R-SQMENRVESLLQQLRQVTSK---PIAV 211 (267) T ss_pred E-EEEEECCCCC--------HHHHHHHHHCCCC--EEEEEECCCCCCC---C-CCCCHHHHHHHHHHHHHCCC---CEEE T ss_conf 1-6899489996--------8999999816898--0898734665676---5-44508899999999983489---8178 Q ss_pred EEECCCCCCCCCCHHHHHHHHHHHHCCCCCEEEECCHHH Q ss_conf 862257776544302689999899769983999744054 Q gi|254780200|r 251 CFQSRFGRIKCLEPPTDKTVEKLAHDGIKSLAIITPGFS 289 (343) Q Consensus 251 ~fQSr~Gp~~WL~P~t~~~l~~L~~~G~k~v~v~p~gFv 289 (343) +| | =++.+...++.+-|..-| |+==+|| T Consensus 212 GF----G------Is~~e~v~~~~~~~ADGv-IVGSaiV 239 (267) T PRK13129 212 GF----G------ISGPEQARQVREWGADGA-IVGSAFV 239 (267) T ss_pred EE----C------CCCHHHHHHHHHCCCCEE-EECHHHH T ss_conf 84----4------799999999985499999-9878999 No 40 >TIGR01371 met_syn_B12ind 5-methyltetrahydropteroyltriglutamate--homocysteine S-methyltransferase; InterPro: IPR006276 This group represents cobalamin-independent methionine synthase. A family of uncharacterised archaeal proteins having substantial homology to the C-terminal region of this family, are not included in this group, though they are contained in a larger family of vitamin-B12 independent methionine synthase.; GO: 0003871 5-methyltetrahydropteroyltriglutamate-homocysteine S-methyltransferase activity, 0009086 methionine biosynthetic process. Probab=81.43 E-value=4.7 Score=21.02 Aligned_cols=141 Identities=18% Similarity=0.241 Sum_probs=90.4 Q ss_pred HHCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHH-----HHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHH Q ss_conf 210454247899999870996589963171110014689-----999999999871799864213345578006899999 Q gi|254780200|r 118 MRYGKPSVKEIINNLREEGCDRLLIFPLYPQYSAATTGT-----AQDKVFQELIHMRWSPSLRTVPPYYEDSDYISALAQ 192 (343) Q Consensus 118 mry~~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS-----~~~~~~~~l~~~~~~~~~~~I~~~~~~p~yI~a~~~ 192 (343) .|--.||.+-...|=+. +-||++| .||.|| .++....+.++-. -+...|-+++.+ T Consensus 427 ~~R~~pf~~R~~~Q~~~------L~LP~LP---TTTIGSFPQT~evR~~R~~~r~g~-----------iS~e~Ye~~~k~ 486 (778) T TIGR01371 427 FRRRSPFKERLELQQER------LNLPLLP---TTTIGSFPQTKEVRKARAAFRKGE-----------ISEEEYEKFIKE 486 (778) T ss_pred CCCCCCHHHHHHHHHHH------HCCCCCC---CCCCCCCCCCHHHHHHHHHHHCCC-----------CCHHHHHHHHHH T ss_conf 35788988999999998------0765588---666688788888999999973258-----------778999999998 Q ss_pred HHHHHHHHCC-CCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHC---CCCCCCEEEEECC-CC------CCCC Q ss_conf 9999997338-9986699983367502676279988899999999999726---6756704786225-77------7654 Q gi|254780200|r 193 SIREHFESIQ-WTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLS---WPDDRFKICFQSR-FG------RIKC 261 (343) Q Consensus 193 ~I~~~l~~~~-~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~---~~~~~~~l~fQSr-~G------p~~W 261 (343) .|++.++-.. -.=|.|+ ||=|+|. -+++=-.|.|+ ...+-|-.+|=|| += =+.| T Consensus 487 eI~~~I~~QEelGLDVLV---HGEfERN------------DMVEYFGE~L~GFafT~nGWVQSYGSRCVkPPIIyGDVsR 551 (778) T TIGR01371 487 EIKKVIKIQEELGLDVLV---HGEFERN------------DMVEYFGEKLAGFAFTQNGWVQSYGSRCVKPPIIYGDVSR 551 (778) T ss_pred HHHHHHHHHHHCCCCEEE---CCCCCCC------------CHHHHHHHHHCHHEEEECCEEEECCCEECCCCEEEECCCC T ss_conf 899999874232786010---3775548------------3368888721201221030152047210357778713211 Q ss_pred CCHHHHHHHHH---HHHCCCC-----CEEEECCHHHHCCH Q ss_conf 43026899998---9976998-----39997440542212 Q gi|254780200|r 262 LEPPTDKTVEK---LAHDGIK-----SLAIITPGFSSDCL 293 (343) Q Consensus 262 L~P~t~~~l~~---L~~~G~k-----~v~v~p~gFvsD~l 293 (343) .+|-|.+=..- |=++=+| =|-|..=||+=|=+ T Consensus 552 p~PMTV~w~~YAQSLT~KpVKGMLTGPVTIL~WSF~R~D~ 591 (778) T TIGR01371 552 PKPMTVKWSVYAQSLTSKPVKGMLTGPVTILNWSFVRDDI 591 (778) T ss_pred CCCCCHHHHHHHCCCCCCCCCCCCCCCHHCCCCCCCCCCC T ss_conf 7777788897712146889776568321103553555688 No 41 >cd06811 PLPDE_III_yhfX_like Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzyme yhfX. This subfamily is composed of the uncharacterized protein yhfX from Escherichia coli K-12 and similar bacterial proteins. These proteins are homologous to bacterial alanine racemases (AR), which are fold type III PLP-dependent enzymes containing an N-terminal PLP-binding TIM-barrel domain and a C-terminal beta-sandwich domain. AR exists as homodimers with active sites that lie at the interface between the TIM barrel domain of one subunit and the beta-sandwich domain of the other subunit. It catalyzes the interconversion between L- and D-alanine, which is an essential component of the peptidoglycan layer of bacterial cell walls. Members of this subfamily may act as PLP-dependent enzymes. Probab=81.20 E-value=4.8 Score=20.96 Aligned_cols=44 Identities=20% Similarity=0.329 Sum_probs=22.3 Q ss_pred HHHHHHHHHHHHCCCCCCCHHHHH--CC-CCCHHHHHHHHHHCCCCEEEEEE Q ss_conf 999999998652247843001121--04-54247899999870996589963 Q gi|254780200|r 96 RDQATNLAKRLESISSIVVDWAMR--YG-KPSVKEIINNLREEGCDRLLIFP 144 (343) Q Consensus 96 ~~qa~~L~~~L~~~~~~~V~~amr--y~-~P~i~~~l~~l~~~g~~~ii~lP 144 (343) .+-++.|.+.-+. .++.. +.|. .| +|++...+.+ .|+++++++= T Consensus 39 ~~NAr~l~~~A~~-~gi~L-y~MtKQ~GRNP~l~~~l~~---~G~~g~VaVD 85 (382) T cd06811 39 EENARLLAETAEK-YGIEL-YFMTKQFGRNPFLARALLE---AGIPGAVAVD 85 (382) T ss_pred HHHHHHHHHHHHH-CCCEE-EEEEECCCCCHHHHHHHHH---CCCCCEEEEE T ss_conf 9999999999997-39689-9996105899899999997---3998568960 No 42 >PRK09545 znuA high-affinity zinc transporter periplasmic component; Reviewed Probab=79.36 E-value=5.5 Score=20.56 Aligned_cols=211 Identities=10% Similarity=0.101 Sum_probs=116.4 Q ss_pred HHHHHHHHHHHHHCCCCCCCHHHHHCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHH------HH Q ss_conf 599999999865224784300112104542478999998709965899631711100146899999999998------71 Q gi|254780200|r 95 TRDQATNLAKRLESISSIVVDWAMRYGKPSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELI------HM 168 (343) Q Consensus 95 t~~qa~~L~~~L~~~~~~~V~~amry~~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~------~~ 168 (343) +-.+..+|++ .++.++.|.-+ +++++..++.+... +++.+..-+.-..-...+.. .....-. .. T Consensus 62 ~Psd~~~l~~-----ADli~~~G~~l-E~~l~k~l~~~~~~---~~i~~~~~~~~~~~~~~~~~-~~~~~~~h~~~~~~~ 131 (308) T PRK09545 62 RPSDVKRLQS-----ADLVVWVGPEM-EAFLEKPVSKLPEN---KQVTIAQLPDVKPLLMKSAH-HDEDEDDHDHAAEKH 131 (308) T ss_pred CHHHHHHHHC-----CCEEEEECCCH-HHHHHHHHHHCCCC---CEEEEEECCCCCCCCCCCCC-CCCCCCCCCCCCCCC T ss_conf 9999999860-----99999958656-77899999736657---80699822785423334545-655555555444556 Q ss_pred -CCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCC Q ss_conf -7998642133455780068999999999997338998669998336750267627998889999999999972667567 Q gi|254780200|r 169 -RWSPSLRTVPPYYEDSDYISALAQSIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDR 247 (343) Q Consensus 169 -~~~~~~~~I~~~~~~p~yI~a~~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~ 247 (343) .....-.+-+++|-+|.....+++.|.+.|.+.+-.. -.-+-.+.+-|..++.+.-+.+.+.+.-..++ T Consensus 132 ~~~h~hg~~DPH~WldP~na~~~a~~I~~~L~~~dP~~----------~~~y~~N~~~~~~~L~~ld~~~~~~la~~~~r 201 (308) T PRK09545 132 DEDHHHGDYNMHLWLSPEIARATAVAIHDKLVELMPQS----------KAKLDANLKDFEAQLAATDKQIGNQLAPVKGK 201 (308) T ss_pred CCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCHHH----------HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 76555699997144799999999999999999868451----------79999999999999999999999972577788 Q ss_pred CEEEEE-------CCCC--------CCCCCCHHH---HHHHHHHHHCCCCCEEEECCHHHHCCHHHHHHHHHHHHHHHHH Q ss_conf 047862-------2577--------765443026---8999989976998399974405422125588875899999997 Q gi|254780200|r 248 FKICFQ-------SRFG--------RIKCLEPPT---DKTVEKLAHDGIKSLAIITPGFSSDCLETSYEIAHEAKEIFVN 309 (343) Q Consensus 248 ~~l~fQ-------Sr~G--------p~~WL~P~t---~~~l~~L~~~G~k~v~v~p~gFvsD~lETl~Eidie~~e~~~e 309 (343) -.++|. .++| ..+=.+|+. .+..+...++++ +++++-|.|-..-.||+ +++ T Consensus 202 ~~v~~HdAf~Yfa~~ygL~~~~~~~~~~e~~Ps~~~l~~l~~~ik~~~v-~~If~Ep~~~~k~~~~i----------a~~ 270 (308) T PRK09545 202 GYFVFHDAYGYFEKHYGLTPLGHFTVNPEIQPGAQRLHEIRTQLVEHKA-TCVFAEPQFRPAVIESV----------AKG 270 (308) T ss_pred EEEEECCCHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHCC-CEEEECCCCCHHHHHHH----------HHH T ss_conf 6999772079999986993878871287767899999999999998399-79996599994999999----------986 Q ss_pred CCCCEEEEECCCCCC-----HHHHHHHHHHHHH Q ss_conf 599758981689879-----8999999999999 Q gi|254780200|r 310 GGGEKFTQVPCLNSS-----NLSIDLLEKITRR 337 (343) Q Consensus 310 ~Gg~~~~rvp~lN~~-----~~fi~~La~lv~~ 337 (343) .|. ....+..|+.+ +.+++++.+++.+ T Consensus 271 ~g~-kv~~lDpLg~~~~~~~~~Y~~~m~~~~~~ 302 (308) T PRK09545 271 TSV-RMGTLDPLGTNIKLGKTSYSNFLSQLANQ 302 (308) T ss_pred CCC-EEEEECCCCCCCCCCHHHHHHHHHHHHHH T ss_conf 598-59996787556888811799999999999 No 43 >PRK08176 pdxK pyridoxal kinase; Reviewed Probab=78.35 E-value=5.9 Score=20.36 Aligned_cols=197 Identities=15% Similarity=0.144 Sum_probs=110.4 Q ss_pred HCCCCCHHHHHHHHHHCCCCEE----EEEECCHHHHHHH----HHHHHHHHHHHHHHHCCCCCCC-CCCCCCCCCHHHHH Q ss_conf 1045424789999987099658----9963171110014----6899999999998717998642-13345578006899 Q gi|254780200|r 119 RYGKPSVKEIINNLREEGCDRL----LIFPLYPQYSAAT----TGTAQDKVFQELIHMRWSPSLR-TVPPYYEDSDYISA 189 (343) Q Consensus 119 ry~~P~i~~~l~~l~~~g~~~i----i~lPLyPqyS~sT----tgS~~~~~~~~l~~~~~~~~~~-~I~~~~~~p~yI~a 189 (343) -||+=...-++--|...|++-. ++|+=.|+|...+ +...++.+.+.+.+++...++. ++..|-.++..+++ T Consensus 26 ~~G~vgn~aaip~Ls~~G~~v~~lpTv~lSnhtgy~~~~g~~~~~~~~~~~l~~l~~~~~l~~~daI~TGylgs~~qi~~ 105 (281) T PRK08176 26 VYGSVGNSIAVPAIKANGLRVAAVPTVLLSNTPHYPTFYGGAIPDEWFAGYLRALQERDALRQLRAVTTGYMGSASQIKI 105 (281) T ss_pred ECCEECHHHHHHHHHHCCCEEEEECCEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCEEEEECCCCHHHHHH T ss_conf 62546248899999977982125562654579999983178589999999999999759723478578610499999999 Q ss_pred HHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCC-CCHHHHHHHHHHHHHHH-----CCCCCCCEEEEECCCCCCCC-C Q ss_conf 99999999973389986699983367502676279-98889999999999972-----66756704786225777654-4 Q gi|254780200|r 190 LAQSIREHFESIQWTPEMLLVSFHQMPVSYLLKGD-PYGCHCHKTARLLKEFL-----SWPDDRFKICFQSRFGRIKC-L 262 (343) Q Consensus 190 ~~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gD-pY~~q~~~t~~~i~~~l-----~~~~~~~~l~fQSr~Gp~~W-L 262 (343) .++-|++.-++.+ +.+++. =|+ +.++|- .|-. .+.+...+++| -+.+|.++.++-+ |. +. - T Consensus 106 V~~~i~~~k~~~p---~~~~vv---DPV-MGD~g~~lyv~--~~~~~~~r~~Llp~AdiITPN~~Ea~~Lt--g~-~i~~ 173 (281) T PRK08176 106 LAEWLTALRADHP---DLLIMV---DPV-IGDIDSGIYVK--PDLPEAYRQHLLPLAQGLTPNIFELEILT--GK-PCRT 173 (281) T ss_pred HHHHHHHHHHHCC---CCEEEE---CCE-EECCCCCCCCC--HHHHHHHHHHHHCCCCEECCCHHHHHHHH--CC-CCCC T ss_conf 9999999973289---976996---727-64499983678--77999999960130639738999999973--98-8699 Q ss_pred CHHHHHHHHHHHHCCCCCEEEECCHHHHCCHHHHHHHHHHHHHHHHHCCCCEEEEECC-----CCCCHHHHHHHHHHH Q ss_conf 3026899998997699839997440542212558887589999999759975898168-----987989999999999 Q gi|254780200|r 263 EPPTDKTVEKLAHDGIKSLAIITPGFSSDCLETSYEIAHEAKEIFVNGGGEKFTQVPC-----LNSSNLSIDLLEKIT 335 (343) Q Consensus 263 ~P~t~~~l~~L~~~G~k~v~v~p~gFvsD~lETl~Eidie~~e~~~e~Gg~~~~rvp~-----lN~~~~fi~~La~lv 335 (343) ..+..+.+++|.+.|.+.|+|-..++-.++-+. ..+. ...+...+...|- -++-+.|.++|+.-. T Consensus 174 ~~~~~~aa~~L~~~G~~~VvITs~~~~~~~~~~-~~~~-------~~~~~~~~~~~p~i~~~~~GTGDlfaA~l~a~l 243 (281) T PRK08176 174 LDEAIAAAKSLLSDTLKWVVITSAAGNEPNQEM-QVAV-------VTADSVNVISHPRVDTDLKGTGDLFCAELVSGL 243 (281) T ss_pred HHHHHHHHHHHHHCCCCEEEEEECCCCCCCCCE-EEEE-------ECCCEEEEEEECCCCCCCCCCCHHHHHHHHHHH T ss_conf 999999999999719998999806577788716-8999-------859958999714128998873299999999999 No 44 >PRK09613 thiH thiamine biosynthesis protein ThiH; Reviewed Probab=78.03 E-value=6 Score=20.29 Aligned_cols=61 Identities=16% Similarity=0.227 Sum_probs=34.8 Q ss_pred HHHHHHHHHHHCCCCCEEEECCHHHHCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHH Q ss_conf 26899998997699839997440542212558887589999999759975898168987989999999999 Q gi|254780200|r 265 PTDKTVEKLAHDGIKSLAIITPGFSSDCLETSYEIAHEAKEIFVNGGGEKFTQVPCLNSSNLSIDLLEKIT 335 (343) Q Consensus 265 ~t~~~l~~L~~~G~k~v~v~p~gFvsD~lETl~Eidie~~e~~~e~Gg~~~~rvp~lN~~~~fi~~La~lv 335 (343) ++++++..|.++|. =|||.+-|--. =--+-.+.++++...+.+|. .+|.=-.|-+.|.|-. T Consensus 367 s~dEvi~~l~~~gy------iPSfCTACYR~-GRTG~~FM~lak~g~I~~~C---~PNAl~T~~EyL~DYa 427 (471) T PRK09613 367 SLDEVIRELCEMGY------IPSFCTACYRK-GRTGDRFMSLAKSGQIKNFC---QPNALLTFKEYLLDYA 427 (471) T ss_pred CHHHHHHHHHHCCC------CCCCCCCCCCC-CCCHHHHHHHHHCCCCCCCC---CCCHHHHHHHHHHHHC T ss_conf 99999999997799------77300105468-98578899862043623665---8007889999988728 No 45 >PRK13609 diacylglycerol glucosyltransferase; Provisional Probab=74.19 E-value=7.5 Score=19.61 Aligned_cols=103 Identities=6% Similarity=0.081 Sum_probs=49.8 Q ss_pred CCCCCCCCCCCCCCHHHHHHH-------HHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCC Q ss_conf 986421334557800689999-------9999999733899866999833675026762799888999999999997266 Q gi|254780200|r 171 SPSLRTVPPYYEDSDYISALA-------QSIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSW 243 (343) Q Consensus 171 ~~~~~~I~~~~~~p~yI~a~~-------~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~ 243 (343) .|-+.+|.+|.-|+..+.--+ +.+++.+.+.+.+++++.. .|+|++-.=.. ++ ....+.+++|+ T Consensus 128 iP~~tViTD~~~H~~Wi~~~~D~y~Vase~~k~~l~~~Gv~~~kI~v--tGiPVr~~F~~-~~------~k~~~r~~lgL 198 (388) T PRK13609 128 IPVYNVLTDFCVHKIWIHREVDRYFVATDHVKEVMVDIGVPAEQIVE--TGIPIRSSFEL-KI------NPDIIYNKYQL 198 (388) T ss_pred CCEEEEECCCCCHHHHCCCCCCEEEECCHHHHHHHHHHCCCHHHEEE--ECCCCCHHHCC-CC------CHHHHHHHHCC T ss_conf 98899947852046455789997993989999999980998889998--89843878727-58------87899998289 Q ss_pred CCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEC Q ss_conf 756704786225777654430268999989976998399974 Q gi|254780200|r 244 PDDRFKICFQSRFGRIKCLEPPTDKTVEKLAHDGIKSLAIIT 285 (343) Q Consensus 244 ~~~~~~l~fQSr~Gp~~WL~P~t~~~l~~L~~~G~k~v~v~p 285 (343) .++...+ .-+ |.+......+.+.+++|.+..--.++|+| T Consensus 199 ~~d~~~v-Lim--gGg~G~~g~i~~l~~~L~~~~~~qiiVVc 237 (388) T PRK13609 199 CKNKKIL-LIV--AGAHGVLGNVKELCQSFMSVPDLQVVVVC 237 (388) T ss_pred CCCCCEE-EEE--CCHHHHHHHHHHHHHHHHCCCCCEEEEEE T ss_conf 9878479-997--66012114799999997458992499990 No 46 >KOG0159 consensus Probab=72.93 E-value=2.3 Score=23.23 Aligned_cols=27 Identities=30% Similarity=0.459 Sum_probs=20.7 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHCCCCCEEEECCHHH Q ss_conf 257776544302689999899769983999744054 Q gi|254780200|r 254 SRFGRIKCLEPPTDKTVEKLAHDGIKSLAIITPGFS 289 (343) Q Consensus 254 Sr~Gp~~WL~P~t~~~l~~L~~~G~k~v~v~p~gFv 289 (343) ++|-|+.||++++ +-...-+.+|.||= T Consensus 435 ~~F~PeRWL~~~~---------~~~~pF~~LPFGfG 461 (519) T KOG0159 435 EEFLPERWLKPST---------KTIHPFASLPFGFG 461 (519) T ss_pred CCCCHHHHCCCCC---------CCCCCCEECCCCCC T ss_conf 0268055416555---------77687553377777 No 47 >PRK05756 pyridoxamine kinase; Validated Probab=72.05 E-value=8.4 Score=19.27 Aligned_cols=195 Identities=14% Similarity=0.169 Sum_probs=111.0 Q ss_pred CCCCCHHHHHHHHHHCCCCEE----EEEECCHHHHHHH----HHHHHHHHHHHHHHHCCCCCCC-CCCCCCCCCHHHHHH Q ss_conf 045424789999987099658----9963171110014----6899999999998717998642-133455780068999 Q gi|254780200|r 120 YGKPSVKEIINNLREEGCDRL----LIFPLYPQYSAAT----TGTAQDKVFQELIHMRWSPSLR-TVPPYYEDSDYISAL 190 (343) Q Consensus 120 y~~P~i~~~l~~l~~~g~~~i----i~lPLyPqyS~sT----tgS~~~~~~~~l~~~~~~~~~~-~I~~~~~~p~yI~a~ 190 (343) ||.=...-++.-|...|++-. +.++-.|+|...+ ...-++.+.+.+..++..-+++ ++..|-.++..+++. T Consensus 14 ~G~vGn~aa~~~Lq~~G~~v~~v~Tv~~Snhtgy~~~~g~~~~~~~l~~~~~~l~~~~~l~~~davltGYlgs~~~~~~i 93 (287) T PRK05756 14 YGHVGNSAAVFPMQRLGVNVWPLNTVQFSNHTGYGKWTGCVMPPSHITEIVQGIAAIGWLGECDAVLSGYLGSAEQGEAI 93 (287) T ss_pred CCCCHHHHHHHHHHHCCCEEEEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCEEEECCCCCHHHHHHH T ss_conf 78512687799999869866775058803689999877876999999999998986264455888997376999999999 Q ss_pred HHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCC-CCHHHHHHHHHHHHHHH-----CCCCCCCEEEEECCCCCCCCCCH Q ss_conf 9999999973389986699983367502676279-98889999999999972-----66756704786225777654430 Q gi|254780200|r 191 AQSIREHFESIQWTPEMLLVSFHQMPVSYLLKGD-PYGCHCHKTARLLKEFL-----SWPDDRFKICFQSRFGRIKCLEP 264 (343) Q Consensus 191 ~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gD-pY~~q~~~t~~~i~~~l-----~~~~~~~~l~fQSr~Gp~~WL~P 264 (343) ++.|++.-+..+ ..+.+ -. |+ +.++|- .|-.+ +....+++.| -+.+|+|++++-+ |..-=-.- T Consensus 94 ~~~i~~~k~~np---~~~~v-~D--PV-mGD~g~g~yv~~--~~~~~~~~~llp~AdiiTPN~fEle~Lt--G~~i~~~~ 162 (287) T PRK05756 94 LGAVRRVKAANP---QALYL-CD--PV-MGHPEKGCIVAP--GVAEFLRDRALPAADIITPNLFELEWLS--GHPVETLE 162 (287) T ss_pred HHHHHHHHHHCC---CCEEE-EC--CE-EECCCCCEECCH--HHHHHHHHHCCCCCCEEECCHHHHHHHC--CCCCCCHH T ss_conf 999999985299---82799-66--52-504898553287--6999999852253579838889999976--99659999 Q ss_pred HHHHHHHHHHHCCCCCEEEECCH---HHHCCHHHHHHHHHHHHHHHHHCCCCEEEEECCC-------CCCHHHHHHHHHH Q ss_conf 26899998997699839997440---5422125588875899999997599758981689-------8798999999999 Q gi|254780200|r 265 PTDKTVEKLAHDGIKSLAIITPG---FSSDCLETSYEIAHEAKEIFVNGGGEKFTQVPCL-------NSSNLSIDLLEKI 334 (343) Q Consensus 265 ~t~~~l~~L~~~G~k~v~v~p~g---FvsD~lETl~Eidie~~e~~~e~Gg~~~~rvp~l-------N~~~~fi~~La~l 334 (343) +..+++..|.+.|.|.|+|-... +-.|+++++.- ...+....+.|.+ ++-+.|...|... T Consensus 163 ~~~~A~~~l~~~g~~~VvvtS~~~~~~~~~~~~~l~~----------~~~~~~~~~~p~i~~~~~~~GtGDlfaAll~a~ 232 (287) T PRK05756 163 DAVAAARALIARGPKIVLVTSLARAGYPADRFEMLLV----------TADGAWHISRPLVDFPRQPVGVGDLTSALFLAR 232 (287) T ss_pred HHHHHHHHHHHHCCCEEEEECCCCCCCCCCCEEEEEE----------CCCCEEEEEEEEECCCCCCCCHHHHHHHHHHHH T ss_conf 9999999999719988999168777788883689998----------189439997322147779898269999999999 Q ss_pred H Q ss_conf 9 Q gi|254780200|r 335 T 335 (343) Q Consensus 335 v 335 (343) . T Consensus 233 l 233 (287) T PRK05756 233 L 233 (287) T ss_pred H T ss_conf 8 No 48 >CHL00194 ycf39 Ycf39; Provisional Probab=70.83 E-value=8.9 Score=19.09 Aligned_cols=141 Identities=16% Similarity=0.194 Sum_probs=73.0 Q ss_pred HHHHHHHHCCCCEEEEEECC--HHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCC Q ss_conf 89999987099658996317--1110014689999999999871799864213345578006899999999999733899 Q gi|254780200|r 127 EIINNLREEGCDRLLIFPLY--PQYSAATTGTAQDKVFQELIHMRWSPSLRTVPPYYEDSDYISALAQSIREHFESIQWT 204 (343) Q Consensus 127 ~~l~~l~~~g~~~ii~lPLy--PqyS~sTtgS~~~~~~~~l~~~~~~~~~~~I~~~~~~p~yI~a~~~~I~~~l~~~~~~ 204 (343) ..++..++.|++++|-++.. .+++.+.....-....+.+...+ .++.++++=+--..++..++.-| ++. T Consensus 92 ~li~AAk~aGVkr~V~lS~lga~~~~~~p~~~~K~~~E~~L~~Sg--l~~TIlRPs~F~q~l~~~~a~pi---~~~---- 162 (319) T CHL00194 92 ALIEAAKAAKVKRFIFFSILNAEQYPQVPLMKIKSDIEEKLKQSG--INYTIFRLAGFFQGLISQYAIPI---LDS---- 162 (319) T ss_pred HHHHHHHHCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHCC--CCEEEECCHHHHHHHHHHHHHHH---HCC---- T ss_conf 999999984998899961356666887567787999999998679--98599847399998899876776---307---- Q ss_pred CCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHCCCCCEEEE Q ss_conf 86699983367502676279988899999999999726675670478622577765443026899998997699839997 Q gi|254780200|r 205 PEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKICFQSRFGRIKCLEPPTDKTVEKLAHDGIKSLAII 284 (343) Q Consensus 205 ~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l~fQSr~Gp~~WL~P~t~~~l~~L~~~G~k~v~v~ 284 (343) ..+..+.-+.|..++.-.| -++.++..|..+.. ..-.|.= .||.-|.--.+.+.++++..+. .+++=+ T Consensus 163 -~~v~~~~~~~~ia~I~~~D--------VA~~~a~aL~~~~~-~gk~y~L-~GP~a~T~~EIa~l~~~~~Gk~-~~i~~v 230 (319) T CHL00194 163 -QTIWITGESTPIAYIDTQD--------AAKFALKSLSLPET-KNKTFPL-VGPKSWNSSEIISLCEQLSGQK-AKVTRI 230 (319) T ss_pred -CCEEECCCCCCCCCCCHHH--------HHHHHHHHHCCCCC-CCCEEEE-ECCCCCCHHHHHHHHHHHHCCC-CCEEEC T ss_conf -8577669987528877999--------99999999589775-8989995-4986389999999999985999-877868 Q ss_pred CCHH Q ss_conf 4405 Q gi|254780200|r 285 TPGF 288 (343) Q Consensus 285 p~gF 288 (343) |+.+ T Consensus 231 P~~~ 234 (319) T CHL00194 231 PLFL 234 (319) T ss_pred CHHH T ss_conf 9899 No 49 >TIGR02684 dnstrm_HI1420 probable addiction module antidote protein; InterPro: IPR014057 Members of this strictly bacterial protein family are small, at roughly 100 amino acids. The gene is almost invariably the downstream member of a gene pair. It is a predicted DNA-binding protein from a clade within the helix-turn-helix family IPR001387 from INTERPRO. These gene pairs, when found on the bacterial chromosome, are located often with prophage regions, but also both in integrated plasmid regions and in housekeeping gene regions. Analysis suggests that the gene pair may serve as an addiction module.. Probab=70.30 E-value=9.1 Score=19.01 Aligned_cols=63 Identities=16% Similarity=0.158 Sum_probs=38.1 Q ss_pred CHHHHHHHHHHHCCCH--HHHCCCHHHHHHHHHHHHHCCCC-HHHH-------H-HHHHHHCCCCCCCHHHHHHHHHHHH Q ss_conf 9889999999874895--44117868889987666512574-7889-------9-9998707788740457775999999 Q gi|254780200|r 33 DFFSLRRYLREFLLDK--RVVELPSWQWRPILFGYILNFRP-SKIK-------H-AYAKIWNTAKNESILRTHTRDQATN 101 (343) Q Consensus 33 ~~~~V~~yL~~fl~D~--~Vi~~p~~~~~~il~~~I~~~R~-~~~~-------~-~Y~~Iw~~~gg~SPL~~~t~~qa~~ 101 (343) +.++|+.||.+-|-|. + |..+...+ |.|++-|- ..++ | .|+.. .++|.|-..-..+-+.+ T Consensus 13 ~~E~~a~YL~~a~e~gGdD----p~~~~~AL--G~iArArGMtqlA~~tGlsREsLYkAL---s~~GnP~f~T~lkV~~A 83 (91) T TIGR02684 13 TDEEVAEYLAQALEDGGDD----PALIAAAL--GVIARARGMTQLAKKTGLSRESLYKAL---SGGGNPTFDTILKVTKA 83 (91) T ss_pred CHHHHHHHHHHHHHCCCCC----HHHHHHHH--HHHHHHCCHHHHHHHHCCCHHHHHHHH---CCCCCCCHHHHHHHHHH T ss_conf 8789999999997436999----88999998--899865165789998287678878873---56889566888999984 Q ss_pred HHH Q ss_conf 998 Q gi|254780200|r 102 LAK 104 (343) Q Consensus 102 L~~ 104 (343) |.- T Consensus 84 LG~ 86 (91) T TIGR02684 84 LGL 86 (91) T ss_pred CCC T ss_conf 086 No 50 >cd01173 pyridoxal_pyridoxamine_kinase Pyridoxal kinase plays a key role in the synthesis of the active coenzyme pyridoxal-5'-phosphate (PLP), by catalyzing the phosphorylation of the precursor vitamin B6 in the presence of Zn2+ and ATP. Mammals are unable to synthesize PLP de novo and require its precursors in the form of vitamin B6 (pyridoxal, pyridoxine, and pyridoxamine) from their diet. Pyridoxal kinase encoding genes are also found in many other species including yeast and bacteria. Probab=68.28 E-value=10 Score=18.73 Aligned_cols=197 Identities=14% Similarity=0.165 Sum_probs=108.8 Q ss_pred CCCCCHHHHHHHHHHCCCCEE----EEEECCHHHHH----HHHHHHHHHHHHHHHHHCCCCCCCC-CCCCCCCCHHHHHH Q ss_conf 045424789999987099658----99631711100----1468999999999987179986421-33455780068999 Q gi|254780200|r 120 YGKPSVKEIINNLREEGCDRL----LIFPLYPQYSA----ATTGTAQDKVFQELIHMRWSPSLRT-VPPYYEDSDYISAL 190 (343) Q Consensus 120 y~~P~i~~~l~~l~~~g~~~i----i~lPLyPqyS~----sTtgS~~~~~~~~l~~~~~~~~~~~-I~~~~~~p~yI~a~ 190 (343) ||.=...-++.-|...|++-. ++++-.|+|-. ......++.+.+.+.+.+...++.- ...|-.++.-++++ T Consensus 11 ~G~vg~~aai~~l~~~G~~v~~lpTv~lSnhtg~~~~~g~~~~~~~l~~~l~~~~~~~~~~~~daI~tGyl~s~~~i~~v 90 (254) T cd01173 11 HGYVGNSAAVFPLQRLGWDVDALPTVQFSNHTGYGTWTGFVLSAEELEDLLEGLEALGLLLEYDAVLTGYLGSAEQVEAV 90 (254) T ss_pred CCCCHHHHHHHHHHHCCCEEEEEEEEECCCCCCCCCCCCEECCHHHHHHHHHHHHHCCCCCCCCEEEECCCCCHHHHHHH T ss_conf 78512799999999879855787558723379999855044889999999999997498666999999116999999999 Q ss_pred HHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHH-----CCCCCCCEEEEECCCCCCCCCCHH Q ss_conf 999999997338998669998336750267627998889999999999972-----667567047862257776544302 Q gi|254780200|r 191 AQSIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFL-----SWPDDRFKICFQSRFGRIKCLEPP 265 (343) Q Consensus 191 ~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l-----~~~~~~~~l~fQSr~Gp~~WL~P~ 265 (343) ++.|++.-+..+ ...++. =|+ +.++|--|. --.+....+.+.| -+.+|.+++++-+.. +.+ -..+ T Consensus 91 ~~~i~~~k~~~~--~~~~v~----DPV-mgD~g~~y~-~~~~~~~~~r~~L~p~AdviTPN~~Ea~~Ltg~-~i~-~~~~ 160 (254) T cd01173 91 AEIVKRLKEKNP--NLLYVC----DPV-MGDNGKLYV-VAEEIVPVYRDLLVPLADIITPNQFELELLTGK-KIN-DLED 160 (254) T ss_pred HHHHHHHHHCCC--CCCEEE----CCE-EECCCCCCC-CCHHHHHHHHHHHHCCCCEECCCHHHHHHHHCC-CCC-CHHH T ss_conf 999999862489--873896----515-852998843-788999999997313100715898999998299-979-9999 Q ss_pred HHHHHHHHHHCCCCCEEEECCHHHHCC-HHHHHHHHHHHHHHHHHCCCCE-E--EEEC--C--CCCCHHHHHHHHHHHH Q ss_conf 689999899769983999744054221-2558887589999999759975-8--9816--8--9879899999999999 Q gi|254780200|r 266 TDKTVEKLAHDGIKSLAIITPGFSSDC-LETSYEIAHEAKEIFVNGGGEK-F--TQVP--C--LNSSNLSIDLLEKITR 336 (343) Q Consensus 266 t~~~l~~L~~~G~k~v~v~p~gFvsD~-lETl~Eidie~~e~~~e~Gg~~-~--~rvp--~--lN~~~~fi~~La~lv~ 336 (343) ..+.+++|.+.|.++|+|-...+-.|. +.++. ...+... + .+++ + -++-+.|.++++.... T Consensus 161 ~~~aa~~L~~~g~~~Vvitg~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~GTGD~fsa~laa~l~ 229 (254) T cd01173 161 AKAAARALHAKGPKTVVVTSVELADDDRIEMLG----------STATEAWLVQRPKIPFPAYFNGTGDLFAALLLARLL 229 (254) T ss_pred HHHHHHHHHHHCCCEEEECCCCCCCCCEEEEEE----------ECCCEEEEEEEEEECCCCCCCCHHHHHHHHHHHHHH T ss_conf 999999999708999999986578888167999----------829915999975878999915599999999999998 No 51 >cd07993 LPLAT_DHAPAT-like Lysophospholipid Acyltransferases (LPLATs) of Glycerophospholipid Biosynthesis: GPAT-like. Lysophospholipid acyltransferase (LPLAT) superfamily member: acyltransferases of de novo and remodeling pathways of glycerophospholipid biosynthesis which catalyze the incorporation of an acyl group from either acylCoAs or acyl-acyl carrier proteins (acylACPs) into acceptors such as glycerol 3-phosphate, dihydroxyacetone phosphate or lyso-phosphatidic acid. Included in this subgroup are such LPLATs as dihydroxyacetone phosphate acyltransferase (DHAPAT, also known as 1 glycerol-3-phosphate O-acyltransferase 1) and similar proteins. Probab=68.13 E-value=10 Score=18.71 Aligned_cols=62 Identities=23% Similarity=0.222 Sum_probs=38.9 Q ss_pred CCHHHHHHHHHHHHHHHCCCCCCCEEEEE----CCCCCCCCCCHH---HHHHHHHHHHCCCCCEEEECCHHHHCCH Q ss_conf 98889999999999972667567047862----257776544302---6899998997699839997440542212 Q gi|254780200|r 225 PYGCHCHKTARLLKEFLSWPDDRFKICFQ----SRFGRIKCLEPP---TDKTVEKLAHDGIKSLAIITPGFSSDCL 293 (343) Q Consensus 225 pY~~q~~~t~~~i~~~l~~~~~~~~l~fQ----Sr~Gp~~WL~P~---t~~~l~~L~~~G~k~v~v~p~gFvsD~l 293 (343) -|.....+.+..+.+. | .-..-|. ||.| +++.|- +.-+++...+....+|.++|++...|.+ T Consensus 85 ly~~vl~~yi~~ll~~-g----~~l~~F~EGtRSR~G--kl~~pK~GlL~~~~~a~~~~~~~dv~ivPVsIsYd~v 153 (205) T cd07993 85 LYRAVLQEYVQELLKN-G----QPLEFFIEGTRSRTG--KLLPPKLGLLSVVVEAYLKGSVPDVLIVPVSISYDRV 153 (205) T ss_pred HHHHHHHHHHHHHHHC-C----CCEEEEECCCCCCCC--CCCCCHHHHHHHHHHHHHHCCCCCEEEEEEEEEECHH T ss_conf 7999999999999848-9----848999378767789--8667568899999999970788973999877863011 No 52 >PRK13139 consensus Probab=67.47 E-value=10 Score=18.62 Aligned_cols=227 Identities=16% Similarity=0.157 Sum_probs=102.7 Q ss_pred EEEEECCCCCCCCCHHHHHHHHHHHCCC-HHHHCCCHHHHHHHHHHHHHCCCCHHHHHHH-HHHHCCCCCCCHHHHHHHH Q ss_conf 9999717888999988999999987489-5441178688899876665125747889999-9870778874045777599 Q gi|254780200|r 20 GVLLVNLGTPDGHDFFSLRRYLREFLLD-KRVVELPSWQWRPILFGYILNFRPSKIKHAY-AKIWNTAKNESILRTHTRD 97 (343) Q Consensus 20 gvLL~n~G~P~~~~~~~V~~yL~~fl~D-~~Vi~~p~~~~~~il~~~I~~~R~~~~~~~Y-~~Iw~~~gg~SPL~~~t~~ 97 (343) =|--+..|=|+ .++-.+.+..+-.. -++|++.-++--|+.-| +-+++.+ +.+ .+|.-+... .. T Consensus 18 li~yitaG~P~---~e~s~~~~~~l~~~GaDiiElGiPFSDP~ADG-------pvIq~A~~~AL----~~G~~~~~~-~~ 82 (254) T PRK13139 18 LMTHIVLGYPS---LQANREVIRAMVAGGVDLMELQIPFSEPMADG-------PVILKANQAAL----AGGFKVREC-FD 82 (254) T ss_pred EEEEECCCCCC---HHHHHHHHHHHHHCCCCEEEECCCCCCCCCCC-------HHHHHHHHHHH----HCCCCHHHH-HH T ss_conf 88785584899---79999999999966999999789888866658-------99999999999----769979999-99 Q ss_pred HHHHHHHHHHCCCCCCCHHHHHCCCCCHH----HHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 99999986522478430011210454247----89999987099658996317111001468999999999987179986 Q gi|254780200|r 98 QATNLAKRLESISSIVVDWAMRYGKPSVK----EIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMRWSPS 173 (343) Q Consensus 98 qa~~L~~~L~~~~~~~V~~amry~~P~i~----~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~~~~~ 173 (343) +.+.+.++. +.++ +-|.|++|... .-++++++.|++-+++ |=.|.- -.+++.+.+.+.+.. . T Consensus 83 ~~~~~~~~~----~~pi-vlM~Y~N~i~~~G~e~F~~~~~~~Gv~GvIi-pDLP~e-------E~~~~~~~~~~~gl~-~ 148 (254) T PRK13139 83 FAREVVAAF----NIPF-LFMTYYNILFKYGVERFIDEVADIGVKGLIV-PDLPPE-------QAQDYIAQCRAKGMA-P 148 (254) T ss_pred HHHHHHHCC----CCCE-EEEEEHHHHHHCCHHHHHHHHHHCCCCEEEC-CCCCHH-------HHHHHHHHHHHCCCC-E T ss_conf 999997248----9768-9995259998709999999999759985864-799978-------899999999846975-7 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEE Q ss_conf 42133455780068999999999997338998669998336750267627998889999999999972667567047862 Q gi|254780200|r 174 LRTVPPYYEDSDYISALAQSIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKICFQ 253 (343) Q Consensus 174 ~~~I~~~~~~p~yI~a~~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l~fQ 253 (343) +.+|.+= ..+ ++|+.-.+..++ .....|--|+-= .+ ........+..+.+.+.-+. +..++| T Consensus 149 I~lvaPt-t~~-------~Ri~~i~~~a~g--FiY~vs~~GvTG---~~-~~~~~~~~~~i~~ik~~t~~---Pv~vGF- 210 (254) T PRK13139 149 IGIYAPT-STD-------ERMGKIAAAADG--FIYCVARRGVTG---SK-TSFDEHVGAFLHRCRAATPL---PLAVGF- 210 (254) T ss_pred EEEECCC-CCH-------HHHHHHHHCCCC--EEEEEECCCCCC---CC-CCCHHHHHHHHHHHHHCCCC---CEEEEC- T ss_conf 9994589-998-------999999851698--699996666679---88-66458899999999855899---879973- Q ss_pred CCCCCCCCCCHHHHHHHHHHHHCCCCCEEEECCHHHHCCHHHHHHHHHHHHHHHH Q ss_conf 2577765443026899998997699839997440542212558887589999999 Q gi|254780200|r 254 SRFGRIKCLEPPTDKTVEKLAHDGIKSLAIITPGFSSDCLETSYEIAHEAKEIFV 308 (343) Q Consensus 254 Sr~Gp~~WL~P~t~~~l~~L~~~G~k~v~v~p~gFvsD~lETl~Eidie~~e~~~ 308 (343) |= ++.+.++.+. .+..-| |+==.|| ..+ +|-+.+.-+.|. T Consensus 211 ---GI------~~~e~v~~~~-~~aDGv-IVGSaiV-k~i---e~~g~~~v~~f~ 250 (254) T PRK13139 211 ---GV------KSAADVDYLK-GKADIA-VVGSQAI-RLF---DEAGAAAVEPFF 250 (254) T ss_pred ---CC------CCHHHHHHHH-CCCCEE-EECHHHH-HHH---HHCCHHHHHHHH T ss_conf ---77------9999999997-169999-9888999-999---975999999999 No 53 >COG4750 LicC CTP:phosphocholine cytidylyltransferase involved in choline phosphorylation for cell surface LPS epitopes [Cell envelope biogenesis, outer membrane] Probab=66.19 E-value=6.4 Score=20.09 Aligned_cols=71 Identities=27% Similarity=0.298 Sum_probs=45.7 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHH Q ss_conf 88740457775999999998652247843001121045424789999987099658996317111001468999999999 Q gi|254780200|r 85 AKNESILRTHTRDQATNLAKRLESISSIVVDWAMRYGKPSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQE 164 (343) Q Consensus 85 ~gg~SPL~~~t~~qa~~L~~~L~~~~~~~V~~amry~~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~ 164 (343) .|-||.++-+|+.--..|-+ -+|+|.||-.+++|++.|++.|+++. .+..+.++. T Consensus 8 AG~gsR~~plT~~tpK~Llk--------------V~g~plIErqI~~L~e~gI~dI~IVv-----------GYlkE~FeY 62 (231) T COG4750 8 AGLGSRFVPLTQSTPKSLLK--------------VNGEPLIERQIEQLREAGIDDITIVV-----------GYLKEQFEY 62 (231) T ss_pred CCCCCCCCCCCCCCCHHHHH--------------HCCCCCHHHHHHHHHHCCCCEEEEEE-----------EEHHHHHHH T ss_conf 25566533131028767887--------------35855099999999977986189996-----------314899999 Q ss_pred HHHHCCCCCCCCCCCCC Q ss_conf 98717998642133455 Q gi|254780200|r 165 LIHMRWSPSLRTVPPYY 181 (343) Q Consensus 165 l~~~~~~~~~~~I~~~~ 181 (343) ++..- ...+.+-+.|. T Consensus 63 LkdKy-~vtLvyN~kY~ 78 (231) T COG4750 63 LKDKY-DVTLVYNPKYR 78 (231) T ss_pred HHHHC-CEEEEECCHHH T ss_conf 98715-74999574077 No 54 >COG1312 UxuA D-mannonate dehydratase [Carbohydrate transport and metabolism] Probab=65.66 E-value=11 Score=18.39 Aligned_cols=76 Identities=14% Similarity=0.259 Sum_probs=32.7 Q ss_pred HHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHC----CCCCEEEE-CCHHH-HCCHHHHHHHHHHHHH Q ss_conf 999999997266756704786225777654430268999989976----99839997-44054-2212558887589999 Q gi|254780200|r 232 KTARLLKEFLSWPDDRFKICFQSRFGRIKCLEPPTDKTVEKLAHD----GIKSLAII-TPGFS-SDCLETSYEIAHEAKE 305 (343) Q Consensus 232 ~t~~~i~~~l~~~~~~~~l~fQSr~Gp~~WL~P~t~~~l~~L~~~----G~k~v~v~-p~gFv-sD~lETl~Eidie~~e 305 (343) +..+.+.+...-+.|-..+|=-| +|- =.--++.+.+...+.. -.+||-.- +-+|. +.|+|---|+ +...+ T Consensus 230 ed~~~ll~~vdSp~NGiTlCtGS-~G~--~~dnDl~~mir~F~~rI~F~H~Rnv~~~~~~sF~E~~hleG~~Dm-~~vmk 305 (362) T COG1312 230 EDYQRLLEMVDSPYNGITLCTGS-LGA--RADNDLPEMIREFASRIHFVHLRNVKREEPGSFHETAHLEGDTDM-VAVMK 305 (362) T ss_pred HHHHHHHHHCCCCCCCEEEECCC-CCC--CCCCCHHHHHHHHHHHCCHHHHHHCCCCCCCCEEECCCCCCCCCH-HHHHH T ss_conf 99999997336876776773464-346--753458999999851110322112146799870133566798449-99999 Q ss_pred HHHHCC Q ss_conf 999759 Q gi|254780200|r 306 IFVNGG 311 (343) Q Consensus 306 ~~~e~G 311 (343) ...+.| T Consensus 306 a~~ee~ 311 (362) T COG1312 306 ALHEEG 311 (362) T ss_pred HHHHCC T ss_conf 999644 No 55 >pfam04273 DUF442 Putative phosphatase (DUF442). Although this domain is uncharacterized it seems likely that it performs a phosphatase function. Probab=65.57 E-value=11 Score=18.38 Aligned_cols=65 Identities=9% Similarity=0.151 Sum_probs=33.1 Q ss_pred HHHHHHHHCCCCCEEEECCHHHHCCHHHHHHHHHHHHHHHHHCCCCEEEEECC--CCCCHHHHHHHHHHHHH Q ss_conf 99998997699839997440542212558887589999999759975898168--98798999999999999 Q gi|254780200|r 268 KTVEKLAHDGIKSLAIITPGFSSDCLETSYEIAHEAKEIFVNGGGEKFTQVPC--LNSSNLSIDLLEKITRR 337 (343) Q Consensus 268 ~~l~~L~~~G~k~v~v~p~gFvsD~lETl~Eidie~~e~~~e~Gg~~~~rvp~--lN~~~~fi~~La~lv~~ 337 (343) +-++.|+++|+|.|+-.=|---..+--+. -+.++.+.++| ..|.++|- -+-++.-|+.+++++.+ T Consensus 18 ~di~~la~~GfktIInnRPd~E~~~qp~~----~~~~~~a~~~G-l~y~~iPv~~~~~t~~~v~~f~~~l~~ 84 (110) T pfam04273 18 DDIAAAARAGFRSVINNRPDGEEPGQPSN----AAEQAAARAAG-LAYRFIPVISGQITEADVEAFQRALAA 84 (110) T ss_pred HHHHHHHHCCCCEEEECCCCCCCCCCCCH----HHHHHHHHHCC-CEEEEEECCCCCCCHHHHHHHHHHHHH T ss_conf 99999998598388533888777899888----99999999839-979996447789899999999999985 No 56 >cd04182 GT_2_like_f GT_2_like_f is a subfamily of the glycosyltransferase family 2 (GT-2) with unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families. Probab=65.39 E-value=11 Score=18.35 Aligned_cols=24 Identities=21% Similarity=0.501 Sum_probs=19.3 Q ss_pred CCCCCHHHHHHHHHHCCCCEEEEE Q ss_conf 045424789999987099658996 Q gi|254780200|r 120 YGKPSVKEIINNLREEGCDRLLIF 143 (343) Q Consensus 120 y~~P~i~~~l~~l~~~g~~~ii~l 143 (343) .+.|.++.+++.+...++++++++ T Consensus 24 ~g~pli~~~i~~~~~~~~~~i~vv 47 (186) T cd04182 24 DGKPLLRHALDAALAAGLSRVIVV 47 (186) T ss_pred CCCCHHHHHHHHHHHCCCCCEEEC T ss_conf 973199999999997499832523 No 57 >PRK01506 consensus Probab=64.44 E-value=12 Score=18.23 Aligned_cols=163 Identities=14% Similarity=0.142 Sum_probs=82.4 Q ss_pred HHHHHHHCCCCEE-EEEECCHHHHHHHH----HH--HHHHHHHHHHHHCCC------CCCCCCCCCCCCCHHHHHHHHHH Q ss_conf 9999987099658-99631711100146----89--999999999871799------86421334557800689999999 Q gi|254780200|r 128 IINNLREEGCDRL-LIFPLYPQYSAATT----GT--AQDKVFQELIHMRWS------PSLRTVPPYYEDSDYISALAQSI 194 (343) Q Consensus 128 ~l~~l~~~g~~~i-i~lPLyPqyS~sTt----gS--~~~~~~~~l~~~~~~------~~~~~I~~~~~~p~yI~a~~~~I 194 (343) .+..+++.|.++| +++|-|| ||+-.- |. +.+.+.+.+...+.. +...-+..||+-|.-.=.-...+ T Consensus 79 ~i~a~r~~gA~~It~ViPY~~-YaRQDr~~~~ge~isak~vA~ll~~~G~d~vitvDlH~~~i~~fF~ip~~~l~~~~~l 157 (317) T PRK01506 79 MIDALKRASAKTINIVIPYYG-YARQDRKARSREPITAKLVANLLETAGATRVITLDLHAPQIQGFFDIPIDHLMGVPIL 157 (317) T ss_pred HHHHHHHCCCCEEEEECCCCC-CCCCCCCCCCCCCEEHHHHHHHHHCCCCCEEEEECCCCHHHHHCCCCCCCEEEECHHH T ss_conf 999988738974899626765-2437643358876138988878741588537996278487750169984315850888 Q ss_pred HHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCC-CC------------ Q ss_conf 99997338998669998336750267627998889999999999972667567047862257776-54------------ Q gi|254780200|r 195 REHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKICFQSRFGRI-KC------------ 261 (343) Q Consensus 195 ~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l~fQSr~Gp~-~W------------ 261 (343) .++++..+.+ ..++.|= +.| ..+-++.+++.||.+ +.+....|.++. -. T Consensus 158 ~~~~~~~~~~-~~vVVsP--------D~G------a~kra~~~a~~L~~~---~~~~~K~R~~~~~v~~~~~~gdV~Gr~ 219 (317) T PRK01506 158 SDYFETKGLK-DIVIVSP--------DHG------GVTRARKMADRLKAP---IAIIDKRRPRPNVAEVMNIIGNIEGKT 219 (317) T ss_pred HHHHHHCCCC-CCEEEEC--------CCH------HHHHHHHHHHHCCCC---EEEEEEECCCCCEEEEECCCCCCCCCE T ss_conf 8899854887-7489904--------924------899999999972996---688986417998046302456557962 Q ss_pred ---------CCHHHHHHHHHHHHCCCCCEEEECC-HHHH-CCHHHHHHHHHHHHHHHHHCCCCEEEEECC Q ss_conf ---------4302689999899769983999744-0542-212558887589999999759975898168 Q gi|254780200|r 262 ---------LEPPTDKTVEKLAHDGIKSLAIITP-GFSS-DCLETSYEIAHEAKEIFVNGGGEKFTQVPC 320 (343) Q Consensus 262 ---------L~P~t~~~l~~L~~~G~k~v~v~p~-gFvs-D~lETl~Eidie~~e~~~e~Gg~~~~rvp~ 320 (343) .+=++.++.+.|.++|.++|.+++. |..+ +.+|-| .++|........+ T Consensus 220 vIIVDDiI~TGgTl~~aa~~Lk~~GA~~V~~~~THglfs~~A~~rl-----------~~s~i~~iv~TnT 278 (317) T PRK01506 220 AILIDDIIDTAGTITLAANALVENGASEVYACCTHPVLSGPAIERI-----------QNSKIKELVVTNS 278 (317) T ss_pred EEEECCHHHCCCHHHHHHHHHHHCCCCEEEEEEECCCCCCHHHHHH-----------HCCCCCEEEEECC T ss_conf 7875132213505999999998659975899997700681799998-----------6189988999388 No 58 >PRK04117 consensus Probab=63.53 E-value=12 Score=18.12 Aligned_cols=164 Identities=15% Similarity=0.130 Sum_probs=80.1 Q ss_pred HHHHHHCCCCEEE-EEECCHHHHHHHH----H--HHHHHHHHHHHHHCCC----CCC--CCCCCCCCCCHHHHHHHHHHH Q ss_conf 9999870996589-9631711100146----8--9999999999871799----864--213345578006899999999 Q gi|254780200|r 129 INNLREEGCDRLL-IFPLYPQYSAATT----G--TAQDKVFQELIHMRWS----PSL--RTVPPYYEDSDYISALAQSIR 195 (343) Q Consensus 129 l~~l~~~g~~~ii-~lPLyPqyS~sTt----g--S~~~~~~~~l~~~~~~----~~~--~~I~~~~~~p~yI~a~~~~I~ 195 (343) +..+++.|..+|. ++|-|| ||+-.- | -+.+.+.+.+...+.. .++ .-+..||+-|.-.=.-...+. T Consensus 74 ~~a~r~agA~~It~ViPY~~-YsRQDr~~~~ge~isak~vA~lL~~~G~d~vitvDlH~~~i~~fF~ip~~~l~~~~~~~ 152 (309) T PRK04117 74 TDALRRSSANSITAVVPYFG-YARQDRKAAPRVPITAKLVANLMQAAGIDRVVTMDLHAGQIQGFFDIPVDNLYGSIVFR 152 (309) T ss_pred HHHHHHCCCCEEEEEECCCC-CCCCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCHHHHHHCCCCCCCEEECCHHHH T ss_conf 99998748973889603550-02144456999870189999888642776589961786887403689733034538778 Q ss_pred HHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCC--C------------ Q ss_conf 99973389986699983367502676279988899999999999726675670478622577765--4------------ Q gi|254780200|r 196 EHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKICFQSRFGRIK--C------------ 261 (343) Q Consensus 196 ~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l~fQSr~Gp~~--W------------ 261 (343) +++...+.++ .++.| | + .-..+-++.+++.||.+ +.++...|.++.+ = T Consensus 153 ~~i~~~~~~~-~vvVs----P----D------~Ga~~ra~~~A~~Lg~~---~~~~~K~R~~~~~~~~~~~~gdV~Gk~v 214 (309) T PRK04117 153 DYIKSKNLKN-PIIAS----P----D------IGGVARARYFAKKLGLD---MVIVDKRREKANESEVMNIIGDVKGKDV 214 (309) T ss_pred HHHHHHCCCC-CEEEC----C----C------CHHHHHHHHHHHHCCCC---EEEEEECCCCCCCCCCCCCCCCCCCCEE T ss_conf 9999618887-46971----5----9------31999999998742788---7998511378885432344566489868 Q ss_pred --------CCHHHHHHHHHHHHCCCCCEEEECC-HHHHCCHHHHHHHHHHHHHHHHHCCCCEEEEECCC Q ss_conf --------4302689999899769983999744-05422125588875899999997599758981689 Q gi|254780200|r 262 --------LEPPTDKTVEKLAHDGIKSLAIITP-GFSSDCLETSYEIAHEAKEIFVNGGGEKFTQVPCL 321 (343) Q Consensus 262 --------L~P~t~~~l~~L~~~G~k~v~v~p~-gFvsD~lETl~Eidie~~e~~~e~Gg~~~~rvp~l 321 (343) .+=++.++.+.|.++|.++|.+++. |-.++. +.+...++|.++.....++ T Consensus 215 IIVDDiI~TGgTi~~aa~~L~~~GA~~V~~~~THglfs~~----------A~~rl~~s~i~~ivvTnTi 273 (309) T PRK04117 215 ILVDDMIDTAGTIVKAAEALKEKGATSVMACCTHAVLSGP----------AYERIAKGALDELVVTNTI 273 (309) T ss_pred EEECCHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCH----------HHHHHHHCCCCEEEECCCC T ss_conf 9757256527999999999986699748999945545836----------9999983899789982898 No 59 >PRK06765 homoserine O-acetyltransferase; Provisional Probab=63.24 E-value=12 Score=18.08 Aligned_cols=66 Identities=11% Similarity=0.139 Sum_probs=40.4 Q ss_pred HHHHHHHHHHHCCCCCEEEECCHHHHCCHHHHHHHHHHHHHHHHHCCC-CEEEEECCCCCCHHHH---HHHHHHHHH Q ss_conf 268999989976998399974405422125588875899999997599-7589816898798999---999999999 Q gi|254780200|r 265 PTDKTVEKLAHDGIKSLAIITPGFSSDCLETSYEIAHEAKEIFVNGGG-EKFTQVPCLNSSNLSI---DLLEKITRR 337 (343) Q Consensus 265 ~t~~~l~~L~~~G~k~v~v~p~gFvsD~lETl~Eidie~~e~~~e~Gg-~~~~rvp~lN~~~~fi---~~La~lv~~ 337 (343) ++.++|... . -+++| +++-+|-+=+-+|..-.++. ..++|. ..|..+++...|+.|. +.+...|++ T Consensus 308 ~l~~aL~~I---~-a~~Lv--i~i~sD~lFpp~e~~e~~~~-l~~~~~~~~~~eI~S~~GHDaFLie~~~~~~~i~~ 377 (387) T PRK06765 308 SLEEALSNI---E-ANVLM--IPCKQDLLQPPRYNYKMVDI-LQKQGKYAEVYEIEGINGHMAGVFDIHLFEKKIYE 377 (387) T ss_pred CHHHHHHHC---C-CCEEE--EEECCCCCCCHHHHHHHHHH-HHHCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHH T ss_conf 899999738---9-98899--98577317698999999999-98559987999979999836106289999999999 No 60 >KOG3062 consensus Probab=63.00 E-value=7.3 Score=19.69 Aligned_cols=52 Identities=15% Similarity=0.146 Sum_probs=22.6 Q ss_pred HHHHHHHHHHCCCCCCCHHHHHCCCCCHHHHHHHHHHCCCCEEEEEECCHHHH Q ss_conf 99999986522478430011210454247899999870996589963171110 Q gi|254780200|r 98 QATNLAKRLESISSIVVDWAMRYGKPSVKEIINNLREEGCDRLLIFPLYPQYS 150 (343) Q Consensus 98 qa~~L~~~L~~~~~~~V~~amry~~P~i~~~l~~l~~~g~~~ii~lPLyPqyS 150 (343) +..++++.|.. +++.+.-+..|-+=|--+-.-.-++.+...+++...-||-- T Consensus 63 L~S~v~R~Lsk-~~iVI~DslNyIKGfRYeLyC~ak~~~tt~Cvv~t~vp~e~ 114 (281) T KOG3062 63 LRSAVDRSLSK-GDIVIVDSLNYIKGFRYELYCEAKAARTTYCVVHTAVPQEL 114 (281) T ss_pred HHHHHHHHCCC-CCEEEEECCCCCCCCEEEEEEEHHCCCEEEEEEEECCCHHH T ss_conf 99999853356-86899814441122020365510105446899985279899 No 61 >PRK01999 consensus Probab=62.64 E-value=13 Score=18.01 Aligned_cols=162 Identities=14% Similarity=0.217 Sum_probs=78.9 Q ss_pred HHHHHHCCCCEEE-EEECCHHHHHHHH----H--HHHHHHHHHHHHHCCC------CCCCCCCCCCCCCHH-HHHHHHHH Q ss_conf 9999870996589-9631711100146----8--9999999999871799------864213345578006-89999999 Q gi|254780200|r 129 INNLREEGCDRLL-IFPLYPQYSAATT----G--TAQDKVFQELIHMRWS------PSLRTVPPYYEDSDY-ISALAQSI 194 (343) Q Consensus 129 l~~l~~~g~~~ii-~lPLyPqyS~sTt----g--S~~~~~~~~l~~~~~~------~~~~~I~~~~~~p~y-I~a~~~~I 194 (343) +..+++.|..+|. ++|-|| ||+--- + -+.+.+.+.+...+.. +...-+..||+-|.- +.+ ...+ T Consensus 75 ~dA~r~~~A~~It~ViPY~~-YaRQDr~~~~~e~isak~va~lL~~~G~d~vitvDlH~~~i~gfF~ipv~~l~a-~~~~ 152 (311) T PRK01999 75 IDAAKRASARSITAVIPYFG-WARQDRKDKPRVSIGAKLVADLLSVAGIDRLITMDLHADQIQGFFDKPVDHLYA-STVF 152 (311) T ss_pred HHHHHHCCCCEEEEECCCCC-CCCCCCCCCCCCCEEHHHHHHHHHCCCCCEEEEEECCCHHHHHCCCCCCCCCCC-CHHH T ss_conf 98898658863899625544-333766568998676777443430157865899826838887447998764437-5889 Q ss_pred HHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCC-------------- Q ss_conf 999973389986699983367502676279988899999999999726675670478622577765-------------- Q gi|254780200|r 195 REHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKICFQSRFGRIK-------------- 260 (343) Q Consensus 195 ~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l~fQSr~Gp~~-------------- 260 (343) .++++....++ .+|.| |. . ...+-++.+++.||.+ +.++.-.|.++.+ T Consensus 153 ~~~~~~~~~~~-~vvVs----PD----~------g~~kra~~~A~~Lg~~---~~~~~K~R~~~~~v~~~~~~gdV~Gk~ 214 (311) T PRK01999 153 LPYIRSLKLDN-LVIAT----PD----V------GGSKRASAYSKYLGVP---VVICYKSREKANEIASMQIIGDVKGKN 214 (311) T ss_pred HHHHHHCCCCC-EEEEC----CC----H------HHHHHHHHHHHHHCCC---EEEEECCCCCCCCCEECCCCCCCCCCE T ss_conf 99999639876-08981----59----0------4999999999861787---233110246788411313343547966 Q ss_pred -C-------CCHHHHHHHHHHHHCCCCCEEEECC-HH-HHCCHHHHHHHHHHHHHHHHHCCCCEEEEECCC Q ss_conf -4-------4302689999899769983999744-05-422125588875899999997599758981689 Q gi|254780200|r 261 -C-------LEPPTDKTVEKLAHDGIKSLAIITP-GF-SSDCLETSYEIAHEAKEIFVNGGGEKFTQVPCL 321 (343) Q Consensus 261 -W-------L~P~t~~~l~~L~~~G~k~v~v~p~-gF-vsD~lETl~Eidie~~e~~~e~Gg~~~~rvp~l 321 (343) . .+=++.++.+.|.++|.++|.+++. |- .-+.+|- ..++|..+.....++ T Consensus 215 ~IIVDDiIdTGgTl~~aa~~L~~~GA~~V~~~~THglfs~~A~~r-----------l~~s~i~~iv~TdTi 274 (311) T PRK01999 215 VILVDDMVDTAGTITKAADLMMEKGAKSVRAIASHAVMSGPASER-----------VENSKLTEMIFTDSI 274 (311) T ss_pred EEEECCHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCHHHHH-----------HHHCCCCEEEECCCC T ss_conf 999754444347899999999855997389999455458559999-----------974899889983785 No 62 >COG0761 lytB 4-Hydroxy-3-methylbut-2-enyl diphosphate reductase IspH [Lipid metabolism] Probab=62.58 E-value=13 Score=18.00 Aligned_cols=46 Identities=20% Similarity=0.192 Sum_probs=28.6 Q ss_pred HHHHHHHHCCCCCEEEECCHHHHCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCC Q ss_conf 99998997699839997440542212558887589999999759975898168987 Q gi|254780200|r 268 KTVEKLAHDGIKSLAIITPGFSSDCLETSYEIAHEAKEIFVNGGGEKFTQVPCLNS 323 (343) Q Consensus 268 ~~l~~L~~~G~k~v~v~p~gFvsD~lETl~Eidie~~e~~~e~Gg~~~~rvp~lN~ 323 (343) +.+.+|+.+ -.+.++--|+=|-|.--|+|+ +.+.|...| .|...++ T Consensus 204 ~Avk~la~~--~Dl~iVVG~~nSSNs~rL~ei-------A~~~g~~ay-lId~~~e 249 (294) T COG0761 204 DAVKELAPE--VDLVIVVGSKNSSNSNRLAEI-------AKRHGKPAY-LIDDAEE 249 (294) T ss_pred HHHHHHHHC--CCEEEEECCCCCCCHHHHHHH-------HHHHCCCEE-EECCHHH T ss_conf 999987603--889999789898667899999-------998599818-9689376 No 63 >COG1453 Predicted oxidoreductases of the aldo/keto reductase family [General function prediction only] Probab=61.06 E-value=14 Score=17.83 Aligned_cols=114 Identities=11% Similarity=0.080 Sum_probs=50.5 Q ss_pred HHHHHHHHHCCCCCCCHHHHHC----CCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 9999986522478430011210----454247899999870996589963171110014689999999999871799864 Q gi|254780200|r 99 ATNLAKRLESISSIVVDWAMRY----GKPSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMRWSPSL 174 (343) Q Consensus 99 a~~L~~~L~~~~~~~V~~amry----~~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~~~~~~ 174 (343) .+.+...++.+-+ -|..|+-| ++|++..+|.+-.+..+.=..-+|..|--+. ..--+.+.+.|.+.+. T Consensus 37 ~~~i~~aie~GiN-yidTA~~Yh~g~sE~~lgkaL~~~~Rekv~LaTKlp~~~~~~~---edm~r~fneqLekl~~---- 108 (391) T COG1453 37 NETIDYAIEHGIN-YIDTAWPYHGGESEEFLGKALKDGYREKVKLATKLPSWPVKDR---EDMERIFNEQLEKLGT---- 108 (391) T ss_pred HHHHHHHHHCCCC-EEEECCCCCCCCCHHHHHHHHHHCCCCEEEEEEECCCCCCCCH---HHHHHHHHHHHHHHCC---- T ss_conf 9999999972876-4762032458874699999864215113798740587534479---9999999999998488---- Q ss_pred CCCCCCCCCCHHHHHHHHH----HHHHHHHCC--CCCCEEEEEECCCCHHHH Q ss_conf 2133455780068999999----999997338--998669998336750267 Q gi|254780200|r 175 RTVPPYYEDSDYISALAQS----IREHFESIQ--WTPEMLLVSFHQMPVSYL 220 (343) Q Consensus 175 ~~I~~~~~~p~yI~a~~~~----I~~~l~~~~--~~~~~llfSaHglP~~~~ 220 (343) ..++-|--|-.=-+.|... .-+++++.. ++=.++-||+||-+.-.. T Consensus 109 Dy~D~yliH~l~~e~~~k~~~~g~~df~~kak~eGkIr~~GFSfHgs~e~~~ 160 (391) T COG1453 109 DYIDYYLIHGLNTETWEKIERLGVFDFLEKAKAEGKIRNAGFSFHGSTEVFK 160 (391) T ss_pred CHHHHHHHCCCCHHHHHHHHCCCHHHHHHHHHHCCCEEEEEECCCCCHHHHH T ss_conf 6101020111467789999703748999999856967886315789789999 No 64 >PRK06849 hypothetical protein; Provisional Probab=60.02 E-value=14 Score=17.70 Aligned_cols=17 Identities=6% Similarity=0.012 Sum_probs=10.1 Q ss_pred HHHHHHHHHHHHHHCCC Q ss_conf 89999999999972667 Q gi|254780200|r 228 CHCHKTARLLKEFLSWP 244 (343) Q Consensus 228 ~q~~~t~~~i~~~l~~~ 244 (343) .++.+.++.+.+.+++. T Consensus 236 p~i~e~v~~f~~~~~~t 252 (387) T PRK06849 236 PRIEEFVTHFVKELNYT 252 (387) T ss_pred HHHHHHHHHHHHHCCCE T ss_conf 89999999999847851 No 65 >TIGR00677 fadh2_euk methylenetetrahydrofolate reductase; InterPro: IPR004621 The enzyme activities methylenetetrahydrofolate reductase (1.5.1.20 from EC) and 5,10-methylenetetrahydrofolate reductase (FADH) (1.7.99.5 from EC) differ in that the former (assigned in many eukaryotes) is defined to use NADP+ as an acceptor, while the latter (assigned in many bacteria) is flexible with respect to the acceptor. Both convert 5-methyltetrahydrofolate to 5,10-methylenetetrahydrofolate. From a larger set of proteins assigned as one or the other, this family describes the subset of proteins found in eukaryotes, and currently designated methylenetetrahydrofolate reductase (1.5.1.20 from EC). This protein is an FAD-containing flavoprotein.; GO: 0004489 methylenetetrahydrofolate reductase (NADPH) activity, 0006555 methionine metabolic process. Probab=59.91 E-value=14 Score=17.69 Aligned_cols=53 Identities=9% Similarity=0.163 Sum_probs=39.9 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHCCCCCHH---HHHHHHHHCCCCEEEEE Q ss_conf 87404577759999999986522478430011210454247---89999987099658996 Q gi|254780200|r 86 KNESILRTHTRDQATNLAKRLESISSIVVDWAMRYGKPSVK---EIINNLREEGCDRLLIF 143 (343) Q Consensus 86 gg~SPL~~~t~~qa~~L~~~L~~~~~~~V~~amry~~P~i~---~~l~~l~~~g~~~ii~l 143 (343) .||+- ...|-..+...|+.+ .+++-+-|.+++=.++ ++|+..+..||++|++| T Consensus 45 AGG~~-~~~t~~ias~~q~~~----~~et~MHLTCTN~~~e~ID~AL~~~~~~G~~NiLAL 100 (312) T TIGR00677 45 AGGTT-AELTLTIASTAQNVL----GVETCMHLTCTNMPIEMIDDALERAKSNGIQNILAL 100 (312) T ss_pred CCCCC-HHHHHHHHHHHHHHH----CCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEECC T ss_conf 78861-456699999987551----455300355789757889999999986565334003 No 66 >PRK00077 eno phosphopyruvate hydratase; Provisional Probab=59.70 E-value=14 Score=17.67 Aligned_cols=186 Identities=15% Similarity=0.183 Sum_probs=91.5 Q ss_pred CCCEEEEEEC-CHHHHHH--HHHHHHHHHHHHHHHHCCCCC----CCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCC-CE Q ss_conf 9965899631-7111001--468999999999987179986----42133455780068999999999997338998-66 Q gi|254780200|r 136 GCDRLLIFPL-YPQYSAA--TTGTAQDKVFQELIHMRWSPS----LRTVPPYYEDSDYISALAQSIREHFESIQWTP-EM 207 (343) Q Consensus 136 g~~~ii~lPL-yPqyS~s--TtgS~~~~~~~~l~~~~~~~~----~~~I~~~~~~p~yI~a~~~~I~~~l~~~~~~~-~~ 207 (343) .+++.+++|. ++-|+.+ ....++..+.+.+++.+.... =-|.+++-++..-++.+.+.|.. ...++ .. T Consensus 161 diQEfmIiP~ga~sf~eal~~~~ev~~~Lk~~L~~kg~~t~vGDEGGfaP~~~~~eeaL~ll~~AI~~----aGy~~G~d 236 (427) T PRK00077 161 DIQEFMIMPVGAPSFKEALRMGAEVFHALKKVLKDKGLSTAVGDEGGFAPNLKSNEEALELILEAIEK----AGYEPGED 236 (427) T ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHH----CCCCCCCE T ss_conf 61478871799888899999999999999999986599876567877489998849999999999997----08887870 Q ss_pred EEE----EECCCC--HHHHHHCC---CCHHHHHHHHHHHHHHH-------CCCCCCCE--EEEECCCCC-----CCCCCH Q ss_conf 999----833675--02676279---98889999999999972-------66756704--786225777-----654430 Q gi|254780200|r 208 LLV----SFHQMP--VSYLLKGD---PYGCHCHKTARLLKEFL-------SWPDDRFK--ICFQSRFGR-----IKCLEP 264 (343) Q Consensus 208 llf----SaHglP--~~~~~~gD---pY~~q~~~t~~~i~~~l-------~~~~~~~~--l~fQSr~Gp-----~~WL~P 264 (343) +-+ .|+.+= .+|.-+++ --..+..+....+.++. ++.+++|. -.+..++|. +-=|-- T Consensus 237 i~ialD~AASefy~~gkY~~~~~~~~~s~~eli~~y~~l~~~YPI~sIEDp~~EdD~~gw~~lt~~lg~k~~iVGDDL~v 316 (427) T PRK00077 237 IALALDCAASEFYEDGKYVLEGEGKVLTSEEQIDYLAELVDKYPIVSIEDGLDENDWDGWALLTERLGDKVQLVGDDLFV 316 (427) T ss_pred EEEEECCCHHHHHHCCCEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCHHHHHHHHHHHHHCCCEEEECCCCCC T ss_conf 89972420676752782465477764588999999999998799778746888226899999999709967998884134 Q ss_pred HHHHHHHHHHHCCCCCEEEEC---CHHHHCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHH Q ss_conf 268999989976998399974---405422125588875899999997599758981689879899999999 Q gi|254780200|r 265 PTDKTVEKLAHDGIKSLAIIT---PGFSSDCLETSYEIAHEAKEIFVNGGGEKFTQVPCLNSSNLSIDLLEK 333 (343) Q Consensus 265 ~t~~~l~~L~~~G~k~v~v~p---~gFvsD~lETl~Eidie~~e~~~e~Gg~~~~rvp~lN~~~~fi~~La~ 333 (343) .-.+.|++-.+++.-|-+++= +|-++|-+| +-+++.++|.....--.+=-+.+.||.-||= T Consensus 317 Tn~~rl~kgi~~~~~NaiLiK~NQiGTvset~e--------ai~la~~~g~~~ivShRSGETeD~~IaDLAV 380 (427) T PRK00077 317 TNPKILKEGIEKGAANSILIKVNQIGTLTETLE--------AIELAKRAGYTAVISHRSGETEDTTIADLAV 380 (427) T ss_pred CCHHHHHHHHHCCCCCEEEEECCCCCCHHHHHH--------HHHHHHHCCCEEEEECCCCCCCCCHHHHHHH T ss_conf 688999725540766479984434673999999--------9999998698799978989866215999898 No 67 >TIGR00249 sixA phosphohistidine phosphatase SixA; InterPro: IPR004449 Phosphohistidine phosphatase SixA (3.1.3 from EC) from Escherichia coli exhibits phosphatase activity towards the HPT domain of the ArcB sensor involved in the multistep his-asp phosphorelay.; GO: 0008969 phosphohistidine phosphatase activity. Probab=59.56 E-value=14 Score=17.65 Aligned_cols=52 Identities=17% Similarity=0.279 Sum_probs=28.8 Q ss_pred HHHHHHHHHHHC--CCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEC Q ss_conf 999999999726--6756704786225777654430268999989976998399974 Q gi|254780200|r 231 HKTARLLKEFLS--WPDDRFKICFQSRFGRIKCLEPPTDKTVEKLAHDGIKSLAIIT 285 (343) Q Consensus 231 ~~t~~~i~~~l~--~~~~~~~l~fQSr~Gp~~WL~P~t~~~l~~L~~~G~k~v~v~p 285 (343) +.|++.|++.++ ++. +.+ . -+-..|..=++=-+.|.+++|.++|++.|++|. T Consensus 64 ~qT~~~v~~~~~leL~~-~~e-~-~~~L~P~~di~~G~~~~le~l~~~gv~sv~lvg 117 (169) T TIGR00249 64 EQTAEIVADALDLELES-EAE-V-LEGLTPLGDIGLGVADYLEKLKEEGVKSVLLVG 117 (169) T ss_pred HHHHHHHHHHCCCCCCC-HHH-C-CCCCCCCCCCHHHHHHHHHHHHCCCCEEEEEEC T ss_conf 89999999870874631-110-0-368887331102089999875206846899850 No 68 >cd01020 TroA_b Metal binding protein TroA_b. These proteins are predicted to function as initial receptors in ABC transport of metal ions. They belong to the TroA superfamily of helical backbone metal receptor proteins that share a distinct fold and ligand binding mechanism. A typical TroA protein is comprised of two globular subdomains connected by a single helix and can bind the metal ion in the cleft between these domains. In addition, these proteins sometimes have a low complexity region containing a metal-binding histidine-rich motif (repetitive HDH sequence). Probab=57.33 E-value=16 Score=17.40 Aligned_cols=143 Identities=12% Similarity=0.118 Sum_probs=78.3 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEECC- Q ss_conf 3345578006899999999999733899866999833675026762799888999999999997266756704786225- Q gi|254780200|r 177 VPPYYEDSDYISALAQSIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKICFQSR- 255 (343) Q Consensus 177 I~~~~~~p~yI~a~~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l~fQSr- 255 (343) =+++|-+|.....+++.|.+.|.+.+-.. -..|-.+..-|..++.+.-+.+.+.+.-..++-.++|... T Consensus 94 dPHiWldp~~~~~~a~~I~~~L~~~dP~~----------~~~y~~N~~~~~~~L~~L~~~~~~~~~~~~~~~~v~~H~af 163 (264) T cd01020 94 NPHLWYDPETMSKVANALADALVKADPDN----------KKYYQANAKKFVASLKPLAAKIAELSAKYKGAPVAATEPVF 163 (264) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHCCHH----------HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCH T ss_conf 98937799999999999999999869021----------79999999999999999999999851457898699965417 Q ss_pred ------CCCC----C---C-----CCH--HHHHHHHH-HHHCCCCCEEEECCHHHHCCHHHHHHHHHHHHHHHHHCCCCE Q ss_conf ------7776----5---4-----430--26899998-997699839997440542212558887589999999759975 Q gi|254780200|r 256 ------FGRI----K---C-----LEP--PTDKTVEK-LAHDGIKSLAIITPGFSSDCLETSYEIAHEAKEIFVNGGGEK 314 (343) Q Consensus 256 ------~Gp~----~---W-----L~P--~t~~~l~~-L~~~G~k~v~v~p~gFvsD~lETl~Eidie~~e~~~e~Gg~~ 314 (343) +|=. . | -+| .....+.+ +.+.|+ +++++-+.|..--.|++- +.+.+.|..- T Consensus 164 ~Yf~~~yGl~~~~~~~~~~~~~~~~eps~~~l~~l~~~ik~~~v-~~If~e~q~~~~~a~~i~-------~~a~~~~v~v 235 (264) T cd01020 164 DYLLDALGMKERTPKGYTATTESETEPSPADIAAFQNAIKNRQI-DALIVNPQQASSATTNIT-------GLAKRSGVPV 235 (264) T ss_pred HHHHHHCCCEEECCCCEEECCCCCCCCCHHHHHHHHHHHHHCCC-CEEEEECCCCHHHHHHHH-------HHHHHCCCEE T ss_conf 99999779945044203412378899999999999999998699-999982898808999999-------9998559569 Q ss_pred EEEECCCCCCHHHHHHHHHHHHH Q ss_conf 89816898798999999999999 Q gi|254780200|r 315 FTQVPCLNSSNLSIDLLEKITRR 337 (343) Q Consensus 315 ~~rvp~lN~~~~fi~~La~lv~~ 337 (343) ............+++++..-++. T Consensus 236 ~~l~e~~~~~~tY~~~M~~Nl~~ 258 (264) T cd01020 236 VEVTETMPNGTTYLTWMLKQVDQ 258 (264) T ss_pred EECCCCCCCCCCHHHHHHHHHHH T ss_conf 84256789999899999999999 No 69 >TIGR00110 ilvD dihydroxy-acid dehydratase; InterPro: IPR004404 Two dehydratases, dihydroxy-acid dehydratase (gene ilvD or ILV3) and 6-phosphogluconate dehydratase (gene edd) have been shown to be evolutionary related . Dihydroxy-acid dehydratase catalyzes the fourth step in the biosynthesis of isoleucine and valine, the dehydratation of 2,3-dihydroxy-isovaleic acid into alpha-ketoisovaleric acid. 6-Phosphogluconate dehydratase catalyzes the first step in the Entner-Doudoroff pathway, the dehydratation of 6-phospho-D-gluconate into 6-phospho-2-dehydro-3-deoxy-D-gluconate. Another protein containing this signature is the Escherichia coli hypothetical protein yjhG. The N-terminal part of the proteins contains a cysteine that could be involved in the binding of a 2Fe-2S iron-sulphur cluster . This family represents dihydroxy-acid dehydratase (DAD). It contains a catalytically essential [4Fe-4S] cluster and catalyses the fourth step in valine and isoleucine biosynthesis.; GO: 0004160 dihydroxy-acid dehydratase activity, 0009082 branched chain family amino acid biosynthetic process. Probab=56.36 E-value=14 Score=17.76 Aligned_cols=56 Identities=20% Similarity=0.355 Sum_probs=34.5 Q ss_pred CHHHHHHHHHHHHHHHHHHCCCCCCC--------------HHHHHCCCCCHH---HHHHHHHHC-CCCEEEEEE Q ss_conf 04577759999999986522478430--------------011210454247---899999870-996589963 Q gi|254780200|r 89 SILRTHTRDQATNLAKRLESISSIVV--------------DWAMRYGKPSVK---EIINNLREE-GCDRLLIFP 144 (343) Q Consensus 89 SPL~~~t~~qa~~L~~~L~~~~~~~V--------------~~amry~~P~i~---~~l~~l~~~-g~~~ii~lP 144 (343) -|=..+-+.+++.+++.+.+-+.++. .-||||+=||-| |+++.|... ..|=+|++| T Consensus 31 vPGh~hL~dl~~~v~~~i~~AGgva~eFNTI~i~DGIAMGh~GM~YSLpSREiIADsvE~m~~Ah~~Dg~v~Ip 104 (601) T TIGR00110 31 VPGHVHLRDLGKRVKEGIEAAGGVAFEFNTIAICDGIAMGHEGMKYSLPSREIIADSVETMVNAHRFDGLVCIP 104 (601) T ss_pred CCCCCCHHHHHHHHHHHHHHHCCCCEECCEEEEECCCCCCCCCCCEECCCCHHHHHHHHHHHHHHHCCCEEEEC T ss_conf 36732488899999999997088601046337757710288765101454012021289998874169379815 No 70 >PRK00175 metX homoserine O-acetyltransferase; Provisional Probab=55.96 E-value=16 Score=17.25 Aligned_cols=68 Identities=9% Similarity=0.190 Sum_probs=40.8 Q ss_pred HHHHHHHHHHHCCCCCEEEECCHHHHCCHHHHHHHHHHHHHHHHHCCC-CEEEEECCCCCCHHHH---HHHHHHHHHHH Q ss_conf 268999989976998399974405422125588875899999997599-7589816898798999---99999999984 Q gi|254780200|r 265 PTDKTVEKLAHDGIKSLAIITPGFSSDCLETSYEIAHEAKEIFVNGGG-EKFTQVPCLNSSNLSI---DLLEKITRREL 339 (343) Q Consensus 265 ~t~~~l~~L~~~G~k~v~v~p~gFvsD~lETl~Eidie~~e~~~e~Gg-~~~~rvp~lN~~~~fi---~~La~lv~~~l 339 (343) ++.++|... . -+++|+ ++-+|-+=..+|....++.+ .++|. ..|.-+++...|+.|. +-+..+|+..| T Consensus 299 dl~~AL~~I---~-a~~Lvi--~i~sD~lFPp~~~~~~a~~l-~~~~~~~~~~eI~S~~GHDaFLie~~~~~~~i~~FL 370 (379) T PRK00175 299 DLAAALAGI---K-ARFLVV--SFTTDWLFPPARSREIVDAL-LAAGADVSYAEIDSPHGHDAFLLDDPRYGPLVRAFL 370 (379) T ss_pred CHHHHHHCC---C-CCEEEE--ECCCCCCCCHHHHHHHHHHH-HHCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHH T ss_conf 899997308---9-988999--60755072899999999999-866997699996899974501558899999999999 No 71 >PRK13137 consensus Probab=55.90 E-value=16 Score=17.24 Aligned_cols=220 Identities=15% Similarity=0.174 Sum_probs=106.9 Q ss_pred EEEEECCCCCCCCCHHHHHHHHHHHCCCHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHH-HHHHCCCCCCCHHHHHHHHH Q ss_conf 99997178889999889999999874895441178688899876665125747889999-98707788740457775999 Q gi|254780200|r 20 GVLLVNLGTPDGHDFFSLRRYLREFLLDKRVVELPSWQWRPILFGYILNFRPSKIKHAY-AKIWNTAKNESILRTHTRDQ 98 (343) Q Consensus 20 gvLL~n~G~P~~~~~~~V~~yL~~fl~D~~Vi~~p~~~~~~il~~~I~~~R~~~~~~~Y-~~Iw~~~gg~SPL~~~t~~q 98 (343) =|-.+.-|-|+-. .-...+..+...-++|++.-++--|+--| +-+++.+ +.+ ..|--+.. ...+ T Consensus 27 li~yitaG~P~~~---~s~~~~~~l~~gaDiiElGiPFSDP~ADG-------PvIQ~A~~~AL----~~G~~l~~-~l~~ 91 (266) T PRK13137 27 FIPFLTAGYPSAA---GFLQVAEELLAYADLLEVGIPYSDPLGDG-------PTIQRASEQAL----AGGTSTRR-TLEL 91 (266) T ss_pred EEEEECCCCCCHH---HHHHHHHHHHCCCCEEEECCCCCCCCCCC-------HHHHHHHHHHH----HCCCCHHH-HHHH T ss_conf 8878668188878---99999999973899899789988856657-------99999999999----77986778-9999 Q ss_pred HHHHHHHHHCCCCCCCHHHHHCCCCC----HHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 99999865224784300112104542----47899999870996589963171110014689999999999871799864 Q gi|254780200|r 99 ATNLAKRLESISSIVVDWAMRYGKPS----VKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMRWSPSL 174 (343) Q Consensus 99 a~~L~~~L~~~~~~~V~~amry~~P~----i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~~~~~~ 174 (343) .+.+.+ ..+.++ +-|-|.+|. +++-++++++.|++-+++ |=-|.- -.+++.+...+.+. .-+ T Consensus 92 ~~~~r~----~~~~Pi-vlM~Y~N~i~~yG~e~F~~~a~~aGvdGlIi-pDLP~e-------E~~~~~~~~~~~gi-~~I 157 (266) T PRK13137 92 VRELRA----LTDTPL-VIMTYLNPIYAVGPEEFMRLFQEAGVDGLIL-PDLPPD-------QDPEIADLAAEIGL-AVT 157 (266) T ss_pred HHHHCC----CCCCCE-EEEECHHHHHHHCHHHHHHHHHHCCCCEEEE-CCCCHH-------HHHHHHHHHHHCCC-CEE T ss_conf 997555----689878-9993458998758999999999769609994-799978-------88999999987599-789 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEEC Q ss_conf 21334557800689999999999973389986699983367502676279988899999999999726675670478622 Q gi|254780200|r 175 RTVPPYYEDSDYISALAQSIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKICFQS 254 (343) Q Consensus 175 ~~I~~~~~~p~yI~a~~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l~fQS 254 (343) .+|.+=-. -++|+.-.+...+ .....|.-|+-= .+..-...++.+..+.+.+.-++ +..++| T Consensus 158 ~lvaPtT~--------~eRi~~i~~~a~G--FiY~Vs~~GvTG---~r~~~~~~~l~~~i~~ik~~t~~---Pv~vGF-- 219 (266) T PRK13137 158 FLIAPTST--------PERVKLVARACTG--FLYAVSVTGVTG---MREGLALGEVPDMLRLARQYAAL---PVAVGF-- 219 (266) T ss_pred EEECCCCC--------HHHHHHHHHHCCC--CEEEEECCCCCC---CCCCCCHHHHHHHHHHHHHCCCC---CEEEEC-- T ss_conf 99379999--------9999999960888--289974467667---77667879999999999863899---879982-- Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCCEEEECCHHHHCCHHHHH Q ss_conf 5777654430268999989976998399974405422125588 Q gi|254780200|r 255 RFGRIKCLEPPTDKTVEKLAHDGIKSLAIITPGFSSDCLETSY 297 (343) Q Consensus 255 r~Gp~~WL~P~t~~~l~~L~~~G~k~v~v~p~gFvsD~lETl~ 297 (343) |= -.| +..+.+.+. -.-+|+ =|=+.+++|--- T Consensus 220 --GI---s~~---e~~~~~~~~--aDGvIV-GSaiV~~i~e~~ 251 (266) T PRK13137 220 --GV---KDK---ATAAQVAQV--ADGVVV-GSAFINAVEEGR 251 (266) T ss_pred --CC---CCH---HHHHHHHHH--CCEEEE-HHHHHHHHHHCC T ss_conf --66---988---999999831--999998-099999999589 No 72 >TIGR02720 pyruv_oxi_spxB pyruvate oxidase; InterPro: IPR014092 Members of this family are examples of pyruvate oxidase (1.2.3.3 from EC), an enzyme with FAD and TPP as cofactors that catalyses the reaction pyruvate + phosphate + O2 + H2O = acetyl phosphate + CO2 + H2O2. It should not be confused with pyruvate dehydrogenase (1.2.2.2 from EC) as in Escherichia coli PoxB, although the E. coli enzyme is closely homologous and has pyruvate oxidase as an alternate name.. Probab=55.30 E-value=14 Score=17.69 Aligned_cols=56 Identities=21% Similarity=0.309 Sum_probs=31.8 Q ss_pred HHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 99999870996589963171110014689999999999871799864213345578006899999999999 Q gi|254780200|r 128 IINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMRWSPSLRTVPPYYEDSDYISALAQSIREHF 198 (343) Q Consensus 128 ~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~~~~~~~~I~~~~~~p~yI~a~~~~I~~~l 198 (343) +|+=|++.|+++|.++| |.+++.+|.+|...+. ++++|.-=|.+-+ |+|...+.+| T Consensus 5 ~LkVl~~WgV~~iYG~p----------GGS~~S~M~AL~~~~~--rl~YiQVRHEe~G---AlAA~adaKL 60 (577) T TIGR02720 5 VLKVLEAWGVDHIYGIP----------GGSFNSTMDALAEERD--RLKYIQVRHEEVG---ALAAAADAKL 60 (577) T ss_pred HHHHHHHCCCCEEECCC----------CCCHHHHHHHHHHHHC--CCEEEEEEHHHHH---HHHHHHHHCC T ss_conf 77688656875651368----------8757789999986101--1204564047788---9999985303 No 73 >TIGR00216 ispH_lytB 4-hydroxy-3-methylbut-2-enyl diphosphate reductase; InterPro: IPR003451 Terpenes are among the largest groups of natural products and include compounds such as vitamins, cholesterol and carotenoids. The biosynthesis of all terpenoids begins with one or both of the two C5 precursors of the pathway: isopentenyl diphosphate (IPP) and dimethylallyl diphosphate (DMAPP). In animals, fungi, and certain bacteria, the synthesis of IPP and DMAPP occurs via the well-known mevalonate pathway, however, a second, nonmevalonate terpenoid pathway has been identified in many eubacteria, algae and the chloroplasts of higher plants . LytB(IspH) catalyses the conversion of 1-hydroy-2-methyl-2-(E)-butenyl 4-diphosphate into IPP and DMAPP in this second pathway The enzyme appears to be responsible for a branch-step in the nonmevalonate pathway, in that IPP and DMAPP are produced in parallel from a single precursor although the exact mechanism of this is not currently fully understood . Escherichia coli LytB protein had been found to regulate the activity of RelA (guanosine 3',5'-bispyrophosphate synthetase I), which in turn controls the level of a regulatory metabolite. It is involved in penicillin tolerance and the stringent response .; GO: 0019288 isopentenyl diphosphate biosynthetic process mevalonate-independent pathway. Probab=54.47 E-value=17 Score=17.09 Aligned_cols=19 Identities=26% Similarity=0.471 Sum_probs=11.7 Q ss_pred EEEEECCCCCCCCCHHHHHHHHHH Q ss_conf 999971788899998899999998 Q gi|254780200|r 20 GVLLVNLGTPDGHDFFSLRRYLRE 43 (343) Q Consensus 20 gvLL~n~G~P~~~~~~~V~~yL~~ 43 (343) -|++=++|=| | ++++-|++ T Consensus 87 ~Vi~rAHGv~--~---~v~e~l~~ 105 (354) T TIGR00216 87 TVIIRAHGVP--P---EVREELEK 105 (354) T ss_pred EEEEECCCCC--H---HHHHHHHH T ss_conf 5999727989--8---89999985 No 74 >PRK13135 consensus Probab=54.28 E-value=17 Score=17.07 Aligned_cols=218 Identities=14% Similarity=0.153 Sum_probs=106.7 Q ss_pred CEEEEEECCCCCCCCCHHHHHHHHHHHC-CCHHHHCCCHHHHHHHHHHHHHCCCCHHHHHH-HHHHHCCCCCCCHHHHHH Q ss_conf 6799997178889999889999999874-89544117868889987666512574788999-998707788740457775 Q gi|254780200|r 18 KIGVLLVNLGTPDGHDFFSLRRYLREFL-LDKRVVELPSWQWRPILFGYILNFRPSKIKHA-YAKIWNTAKNESILRTHT 95 (343) Q Consensus 18 K~gvLL~n~G~P~~~~~~~V~~yL~~fl-~D~~Vi~~p~~~~~~il~~~I~~~R~~~~~~~-Y~~Iw~~~gg~SPL~~~t 95 (343) |-=|-.+..|-|+ .+.-.++++.+- +.-++|++.-++--|+--| +-+++. .+.+ .+|.-+... T Consensus 17 ~ali~yitaG~P~---~~~s~~~l~~l~~~GaDiiElGiPfSDP~ADG-------PvIq~A~~rAL----~~G~~~~~~- 81 (267) T PRK13135 17 KALVTFITAGDPD---LATTEALIPLLAESGADIIELGVPFSDPMADG-------PTIQLSSERAL----AAGTTLPRI- 81 (267) T ss_pred CEEEEEECCCCCC---HHHHHHHHHHHHHCCCCEEEECCCCCCCCCCC-------HHHHHHHHHHH----HCCCCHHHH- T ss_conf 5489887171899---89999999999975999999789989866658-------99999999999----769849999- Q ss_pred HHHHHHHHHHHHCCCCCCCHHHHHCCCCC----HHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 99999999865224784300112104542----47899999870996589963171110014689999999999871799 Q gi|254780200|r 96 RDQATNLAKRLESISSIVVDWAMRYGKPS----VKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMRWS 171 (343) Q Consensus 96 ~~qa~~L~~~L~~~~~~~V~~amry~~P~----i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~~~ 171 (343) ..+.+.+.+. .+.++ +-|.|++|. ++.-++++++.|++-+++-=|-|.. .+++.+.+.+.+.. T Consensus 82 ~~~~~~~r~~----~~~Pi-vlM~Y~N~i~~yG~e~F~~~~~~~GvdGlIipDLP~ee--------~~~~~~~~~~~~l~ 148 (267) T PRK13135 82 LAMVRSVRRR----CQVPI-VLMGYYNPIFAYGLERFAADAAAAGVDGVLLVDLPPEE--------AEEFKACADRHGLD 148 (267) T ss_pred HHHHHHHHCC----CCCCE-EEEECCHHHHHHHHHHHHHHHHHCCCCEEECCCCCHHH--------HHHHHHHHHHCCCC T ss_conf 9999986335----89988-99842309988468999999997499747637899788--------89999999872961 Q ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEEE Q ss_conf 86421334557800689999999999973389986699983367502676279988899999999999726675670478 Q gi|254780200|r 172 PSLRTVPPYYEDSDYISALAQSIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKIC 251 (343) Q Consensus 172 ~~~~~I~~~~~~p~yI~a~~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l~ 251 (343) -+.+|.+--+ .++|+.-.+...+ .....|--|+-= ...+...++.+..+.+.+..++ +..++ T Consensus 149 -~I~lvsPtt~--------~~Ri~~i~~~s~G--FiY~Vs~~GvTG----~~~~~~~~~~~~i~~ik~~t~~---Pv~vG 210 (267) T PRK13135 149 -VIFLLTPTSD--------ESRIRTVARLGRG--FVYYVSVTGVTG----ARSGVEATVGGNVAKIREKITV---PVVVG 210 (267) T ss_pred -EEEEECCCCC--------HHHHHHHHHCCCC--EEEEEECCCCCC----CCCCCHHHHHHHHHHHHHHCCC---CEEEE T ss_conf -8998089895--------7999999961898--189985456667----7644448899999999860689---84898 Q ss_pred EECCCCCCCCCCHHHHHHHHHHHHCCCCCEEEECCHHHHCCHH Q ss_conf 6225777654430268999989976998399974405422125 Q gi|254780200|r 252 FQSRFGRIKCLEPPTDKTVEKLAHDGIKSLAIITPGFSSDCLE 294 (343) Q Consensus 252 fQSr~Gp~~WL~P~t~~~l~~L~~~G~k~v~v~p~gFvsD~lE 294 (343) | |= ++.+...++. .+ -.-+|+==+|| +.+| T Consensus 211 F----GI------~~~e~v~~i~-~~-ADGvIVGSaiV-k~ie 240 (267) T PRK13135 211 F----GI------STPQQAADVA-AM-ADGVVVGSALV-KLFE 240 (267) T ss_pred E----CC------CCHHHHHHHH-CC-CCEEEECHHHH-HHHH T ss_conf 1----67------9999999998-05-99999878999-9998 No 75 >cd01979 Pchlide_reductase_N Pchlide_reductase_N: N protein of the NB protein complex of Protochlorophyllide (Pchlide)_reductase. Pchlide reductase catalyzes the reductive formation of chlorophyllide (chlide) from protochlorophyllide (pchlide) during biosynthesis of chlorophylls and bacteriochlorophylls. This group contains both the light-independent Pchlide reductase (DPOR) and light-dependent Pchlide reductase (LPOR). Angiosperms contain only LPOR, cyanobacteria, algae and gymnosperms contain both DPOR and LPOR, primitive anoxygenic photosynthetic bacteria contain only DPOR. NB is structurally similar to the FeMo protein of nitrogenase, forming an N2B2 heterotetramer. N and B are homologous to the FeMo alpha and beta subunits respectively. Also in common with nitrogenase in vitro DPOR activity requires ATP hydrolysis and dithoionite or ferredoxin as electron donor. The NB protein complex may serve as a catalytic site for Pchlide reduction similar to MoFe for nitrogen reduction. Probab=54.15 E-value=17 Score=17.05 Aligned_cols=58 Identities=5% Similarity=0.083 Sum_probs=40.9 Q ss_pred HHHHCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 112104542478999998709965899631711100146899999999998717998642133 Q gi|254780200|r 116 WAMRYGKPSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMRWSPSLRTVP 178 (343) Q Consensus 116 ~amry~~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~~~~~~~~I~ 178 (343) .||--++.-...++.+++++.-..++.+ ||+++++-+-+.+.++.++....+.+++|+ T Consensus 66 ~~~~~~~~eL~r~v~~i~~~r~p~~iFl-----vgtCpseVIk~DLe~~A~rls~~~~v~Vv~ 123 (396) T cd01979 66 SALLNDYAELDRVVTQIKRDRNPSVIFL-----IGSCTTEVIKMDLEGAAPRLSAEIGVPILV 123 (396) T ss_pred HHCCCCHHHHHHHHHHHHHHCCCCEEEE-----ECCCCHHHHHHHHHHHHHHHHHCCCCEEEE T ss_conf 4236966999999999997489947999-----725857777524999988753267955999 No 76 >KOG4357 consensus Probab=53.45 E-value=13 Score=18.06 Aligned_cols=29 Identities=31% Similarity=0.423 Sum_probs=15.5 Q ss_pred CCCCEEEEEECCCCCCCCCHHHHHHHHHH Q ss_conf 77867999971788899998899999998 Q gi|254780200|r 15 KFGKIGVLLVNLGTPDGHDFFSLRRYLRE 43 (343) Q Consensus 15 ~~~K~gvLL~n~G~P~~~~~~~V~~yL~~ 43 (343) .++||-.|++..-+|++|+.+++|||-.+ T Consensus 62 kkgktlmlfv~v~~psqp~~kd~rpftee 90 (164) T KOG4357 62 KKGKTLMLFVGVSDPSQPDEKDIRPFTEE 90 (164) T ss_pred HCCCEEEEEEEECCCCCCCHHHCCCHHHH T ss_conf 25866999997068899872112402589 No 77 >KOG1322 consensus Probab=53.26 E-value=18 Score=16.96 Aligned_cols=88 Identities=9% Similarity=0.140 Sum_probs=55.8 Q ss_pred CCHHHHHCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHH Q ss_conf 30011210454247899999870996589963171110014689999999999871799864213345578006899999 Q gi|254780200|r 113 VVDWAMRYGKPSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMRWSPSLRTVPPYYEDSDYISALAQ 192 (343) Q Consensus 113 ~V~~amry~~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~~~~~~~~I~~~~~~p~yI~a~~~ 192 (343) -|.+| -+|-|...++++.+.|+++|+++--|= +.|..+...++. +....+.++-..-..|.--+-=++ T Consensus 34 lVpfg---n~pmI~hqieal~nsGi~~I~la~~y~------s~sl~~~~~k~y---~~~lgVei~~s~eteplgtaGpl~ 101 (371) T KOG1322 34 LVPFG---NKPMILHQIEALINSGITKIVLATQYN------SESLNRHLSKAY---GKELGVEILASTETEPLGTAGPLA 101 (371) T ss_pred CCCCC---CCHHHHHHHHHHHHCCCCEEEEEEECC------CHHHHHHHHHHH---HHCCCEEEEEEECCCCCCCCCHHH T ss_conf 65427---630527789998757974799997138------289999999976---124555999982068876546699 Q ss_pred HHHHHHHHCCCCCCEEEEEEC Q ss_conf 999999733899866999833 Q gi|254780200|r 193 SIREHFESIQWTPEMLLVSFH 213 (343) Q Consensus 193 ~I~~~l~~~~~~~~~llfSaH 213 (343) .+++.|..+... ..++++-| T Consensus 102 laR~~L~~~~~~-~ffVLnsD 121 (371) T KOG1322 102 LARDFLWVFEDA-PFFVLNSD 121 (371) T ss_pred HHHHHHHHCCCC-CEEEECCC T ss_conf 999886403798-67995577 No 78 >PRK12677 xylose isomerase; Provisional Probab=51.72 E-value=19 Score=16.80 Aligned_cols=17 Identities=35% Similarity=0.440 Sum_probs=8.4 Q ss_pred ECCCCCCHHHHHHHHHH Q ss_conf 16898798999999999 Q gi|254780200|r 318 VPCLNSSNLSIDLLEKI 334 (343) Q Consensus 318 vp~lN~~~~fi~~La~l 334 (343) .|.|.....+.+.+|+. T Consensus 340 ~~tl~~~~s~~~l~a~~ 356 (389) T PRK12677 340 VPTLAAGESLADLLADR 356 (389) T ss_pred CCCCCCCCCHHHHHHHH T ss_conf 76457887899998767 No 79 >PRK13121 consensus Probab=51.46 E-value=19 Score=16.77 Aligned_cols=189 Identities=14% Similarity=0.147 Sum_probs=89.4 Q ss_pred EEEEECCCCCCCCCHHHHHHHHHHHC-CCHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHH Q ss_conf 99997178889999889999999874-89544117868889987666512574788999998707788740457775999 Q gi|254780200|r 20 GVLLVNLGTPDGHDFFSLRRYLREFL-LDKRVVELPSWQWRPILFGYILNFRPSKIKHAYAKIWNTAKNESILRTHTRDQ 98 (343) Q Consensus 20 gvLL~n~G~P~~~~~~~V~~yL~~fl-~D~~Vi~~p~~~~~~il~~~I~~~R~~~~~~~Y~~Iw~~~gg~SPL~~~t~~q 98 (343) =|--+..|=|+-. .-.++++.+- +.-++|++.-++--|+--| +-+++.+++-- ..|.-+... ... T Consensus 19 li~y~taG~P~~~---~s~~~~~~l~~~GaDiiElGiPfSDP~ADG-------PvIq~A~~rAL---~~G~~~~~~-~~~ 84 (265) T PRK13121 19 LIPFITAGDPDPA---KTVELMHALVEGGADVIELGVPFSDPMADG-------PVIQRASERAL---AHGVSLRQV-LAM 84 (265) T ss_pred EEEEEECCCCCHH---HHHHHHHHHHHCCCCEEEECCCCCCCCCCC-------HHHHHHHHHHH---HCCCCHHHH-HHH T ss_conf 8988707189989---999999999976999999789889977658-------99999999999---779984677-999 Q ss_pred HHHHHHHHHCCCCCCCHHHHHCCCCC----HHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 99999865224784300112104542----47899999870996589963171110014689999999999871799864 Q gi|254780200|r 99 ATNLAKRLESISSIVVDWAMRYGKPS----VKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMRWSPSL 174 (343) Q Consensus 99 a~~L~~~L~~~~~~~V~~amry~~P~----i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~~~~~~ 174 (343) .+.+.+. ..+.++ +-|.|++|- +++-++++++.|++-+++ |=.|-- -.+++.+.+.+.+.. .+ T Consensus 85 ~~~~r~~---~~~~Pi-vlM~Y~N~i~~yG~e~F~~~~~~aGvdGlIi-pDLP~e-------E~~~~~~~~~~~gl~-~I 151 (265) T PRK13121 85 VKEFRET---NQTTPV-VLMGYANPIERMGYDAFAAAARAAGVDGVLV-VDYPPE-------ECEEFAAKMRAAGID-PI 151 (265) T ss_pred HHHHHCC---CCCCCE-EEEEHHHHHHHHHHHHHHHHHHHCCCCEEEC-CCCCHH-------HHHHHHHHHHHCCCC-EE T ss_conf 9983103---799998-9862145999971999999998729873434-899989-------999999999865996-68 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEEEE Q ss_conf 213345578006899999999999733899866999833675026762799888999999999997266756704786 Q gi|254780200|r 175 RTVPPYYEDSDYISALAQSIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKICF 252 (343) Q Consensus 175 ~~I~~~~~~p~yI~a~~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l~f 252 (343) .+|.+-.. .++++.-.+...+ .....|.-|+-=. ++ .-..++.+..+.|.+.-++ +..++| T Consensus 152 ~lvaPtt~--------~~Ri~~i~~~~~g--FiY~Vs~~GvTG~---~~-~~~~~~~~~i~~ik~~t~~---Pv~vGF 212 (265) T PRK13121 152 FLLAPTST--------DERIAAVARVASG--YVYYVSLKGVTGA---AT-LDVSSVAAKLPAIRSHVPL---PVGVGF 212 (265) T ss_pred EEECCCCC--------HHHHHHHHHHCCC--EEEEEECCCCCCC---CC-CCHHHHHHHHHHHHHCCCC---CEEEEC T ss_conf 99589998--------9999999962898--0999755556677---75-6628899999999854799---859976 No 80 >TIGR00538 hemN oxygen-independent coproporphyrinogen III oxidase; InterPro: IPR004558 This family represents HemN, the oxygen-independent coproporphyrinogen III oxidase that replaces HemF function under anaerobic conditions. HemN catalyses the anaerobic transformation of coproporhyrinogen-III into protoporphyrinogen-IX during porphyrin biosynthesis. Several species, including Escherichia coli, Helicobacter pylori, and Aquifex aeolicus, have both a member of this family and a member of another, closely related family for which there is no evidence of coproporphyrinogen III oxidase activity, IPR004559 from INTERPRO. Members of this family have a perfectly conserved motif PYRT[SC]YP in a region N-terminal to the region of homology with the related uncharacterised protein.; GO: 0004109 coproporphyrinogen oxidase activity, 0006779 porphyrin biosynthetic process, 0005737 cytoplasm. Probab=50.62 E-value=20 Score=16.68 Aligned_cols=191 Identities=21% Similarity=0.337 Sum_probs=102.2 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHC-------CCCCCCHHHHHCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHH Q ss_conf 87404577759999999986522-------47843001121045424789999987099658996317111001468999 Q gi|254780200|r 86 KNESILRTHTRDQATNLAKRLES-------ISSIVVDWAMRYGKPSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQ 158 (343) Q Consensus 86 gg~SPL~~~t~~qa~~L~~~L~~-------~~~~~V~~amry~~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~ 158 (343) |||.| +..|..|.+.|-+.+.+ .-++-+++--||- -.|.+..|++.|..||=. | + T Consensus 111 GGGTP-~YL~~~Q~~~l~~~i~~~F~nf~~daEiSiEidPR~~---~~e~~~~L~~~GFNRlS~------------G--v 172 (462) T TIGR00538 111 GGGTP-TYLSPEQIEELMKEIREAFPNFSEDAEISIEIDPRYL---TKEVIKALRDEGFNRLSF------------G--V 172 (462) T ss_pred CCCCC-CCCCHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCC---CHHHHHHHHHCCCCEEEC------------C--E T ss_conf 78983-3378899999999999873201158447765237413---788999999758966423------------5--2 Q ss_pred HHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHH Q ss_conf 99999998717998642133455780068999999999997338998669998336750267627998889999999999 Q gi|254780200|r 159 DKVFQELIHMRWSPSLRTVPPYYEDSDYISALAQSIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLK 238 (343) Q Consensus 159 ~~~~~~l~~~~~~~~~~~I~~~~~~p~yI~a~~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~ 238 (343) ..+...+++. +.-|.+ ..+|....+.+|++ .+..-+.-|| -|||.- ...-..+|.+.|+ T Consensus 173 QDfd~~VQ~a-----vnR~QP----~e~i~~~~~~~R~~--Gf~SiN~DLI---YGLP~Q-------t~esF~~Tl~~v~ 231 (462) T TIGR00538 173 QDFDKEVQQA-----VNRIQP----EEMIFELMEKAREA--GFESINLDLI---YGLPKQ-------TKESFEKTLEKVA 231 (462) T ss_pred ECCCHHHHHH-----HCCCCC----HHHHHHHHHHHHHC--CCCEEEEEEE---CCCCCC-------CHHHHHHHHHHHH T ss_conf 1078555444-----313486----89999999999866--9827874201---388887-------8678999999985 Q ss_pred HHHCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHCCCCCEEEECCHHHHCCHHHHHHHHHHHHHHHHHCC-----CC Q ss_conf 9726675670478622577765443026899998997699839997440542212558887589999999759-----97 Q gi|254780200|r 239 EFLSWPDDRFKICFQSRFGRIKCLEPPTDKTVEKLAHDGIKSLAIITPGFSSDCLETSYEIAHEAKEIFVNGG-----GE 313 (343) Q Consensus 239 ~~l~~~~~~~~l~fQSr~Gp~~WL~P~t~~~l~~L~~~G~k~v~v~p~gFvsD~lETl~Eidie~~e~~~e~G-----g~ 313 (343) +. .+++.. +| | +-..||++| +...+.++ ..|+ .-|=|+.|.++ -..+.++| .. T Consensus 232 ~L---nPDRlA-vF-n-yAyvP~vk~----~q~k~~~~------~LPS--~~~KL~Il~~~----I~~L~~~gY~fIGMD 289 (462) T TIGR00538 232 EL---NPDRLA-VF-N-YAYVPWVKP----AQRKIPEE------ALPS--AEEKLEILEET----IAFLTEAGYVFIGMD 289 (462) T ss_pred HC---CCCEEH-HH-H-HHHHCCHHH----HHCCCCHH------CCCC--HHHHHHHHHHH----HHHHHHCCCEEECCC T ss_conf 31---877001-21-0-222101577----85027620------0588--78999999999----999975797585144 Q ss_pred EEEEECCCCCCHHHHHHHHHHHHHHHCC Q ss_conf 5898168987989999999999998428 Q gi|254780200|r 314 KFTQVPCLNSSNLSIDLLEKITRRELMG 341 (343) Q Consensus 314 ~~~rvp~lN~~~~fi~~La~lv~~~l~g 341 (343) +|.+ -|+++.|+-=..-..++-|| T Consensus 290 HFAk----pddELavAqr~geL~RNFQG 313 (462) T TIGR00538 290 HFAK----PDDELAVAQRKGELHRNFQG 313 (462) T ss_pred CCCC----CCHHHHHHHHCCCCCCCCCC T ss_conf 5779----71389999850530005765 No 81 >COG1213 Predicted sugar nucleotidyltransferases [Cell envelope biogenesis, outer membrane] Probab=50.13 E-value=16 Score=17.42 Aligned_cols=30 Identities=20% Similarity=0.436 Sum_probs=23.0 Q ss_pred CCCCHHHHHCCCCCHHHHHHHHHHCCCCEEEEE Q ss_conf 843001121045424789999987099658996 Q gi|254780200|r 111 SIVVDWAMRYGKPSVKEIINNLREEGCDRLLIF 143 (343) Q Consensus 111 ~~~V~~amry~~P~i~~~l~~l~~~g~~~ii~l 143 (343) ..-|.+| +.|-|.-.++.|.+.|+++++++ T Consensus 23 K~Lvev~---gr~ii~~~i~~L~~~gi~e~vvV 52 (239) T COG1213 23 KALVEVG---GREIIYRTIENLAKAGITEFVVV 52 (239) T ss_pred CHHHHCC---CEEEHHHHHHHHHHCCCCEEEEE T ss_conf 1443158---86738999999987698639999 No 82 >pfam06888 Put_Phosphatase Putative Phosphatase. This family contains a number of putative eukaryotic acid phosphatases. Some family members represent the products of the PSI14 phosphatase family in Lycopersicon esculentum (Tomato). Probab=50.00 E-value=20 Score=16.62 Aligned_cols=21 Identities=38% Similarity=0.754 Sum_probs=10.5 Q ss_pred CCCHHHHHHHHHH--CCCCEEEEE Q ss_conf 5424789999987--099658996 Q gi|254780200|r 122 KPSVKEIINNLRE--EGCDRLLIF 143 (343) Q Consensus 122 ~P~i~~~l~~l~~--~g~~~ii~l 143 (343) .|-+.++++.+++ .|++ ++++ T Consensus 73 ~pgm~~li~~l~~~~~~~d-~~Ii 95 (234) T pfam06888 73 VPGMVRLIKFLAKNGLGCD-LIII 95 (234) T ss_pred CCCHHHHHHHHHHCCCCCE-EEEE T ss_conf 9458999999984767831-9999 No 83 >pfam05013 FGase N-formylglutamate amidohydrolase. Formylglutamate amidohydrolase (FGase) catalyses the terminal reaction in the five-step pathway for histidine utilisation in Pseudomonas putida. By this action, N-formyl-L-glutamate (FG) is hydrolysed to produce L-glutamate plus formate. Probab=49.70 E-value=20 Score=16.59 Aligned_cols=31 Identities=19% Similarity=0.251 Sum_probs=22.1 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHH Q ss_conf 0689999999999973389986699983367502 Q gi|254780200|r 185 DYISALAQSIREHFESIQWTPEMLLVSFHQMPVS 218 (343) Q Consensus 185 ~yI~a~~~~I~~~l~~~~~~~~~llfSaHglP~~ 218 (343) -|=+++.+.|++..+..+ ..++++.||+|-. T Consensus 113 PyH~~l~~~i~~~~~~~g---~~~lid~HS~~~~ 143 (221) T pfam05013 113 PYHAALAALLARLRARFG---RAVLIDCHSFTPV 143 (221) T ss_pred HHHHHHHHHHHHHHHHCC---CEEEEEECCCCCC T ss_conf 999999999999998579---8799995279865 No 84 >PRK13113 consensus Probab=49.21 E-value=21 Score=16.53 Aligned_cols=191 Identities=16% Similarity=0.154 Sum_probs=89.9 Q ss_pred CEEEEEECCCCCCCCCHHHHHHHHHHHCCC-HHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCHHHHHHH Q ss_conf 679999717888999988999999987489-5441178688899876665125747889999987077887404577759 Q gi|254780200|r 18 KIGVLLVNLGTPDGHDFFSLRRYLREFLLD-KRVVELPSWQWRPILFGYILNFRPSKIKHAYAKIWNTAKNESILRTHTR 96 (343) Q Consensus 18 K~gvLL~n~G~P~~~~~~~V~~yL~~fl~D-~~Vi~~p~~~~~~il~~~I~~~R~~~~~~~Y~~Iw~~~gg~SPL~~~t~ 96 (343) |.=|.-+..|=|+- +.-.+.++.+-.. -++|++.-++--|+--| +-+++..++-- .+|.-+... . T Consensus 17 ~ali~yitaG~P~~---e~s~~~~~~l~~~GaDiiElGiPFSDP~ADG-------PvIq~A~~rAL---~~G~~~~~~-~ 82 (263) T PRK13113 17 KAFVAYVMAGDPDY---DTSLEVMRGLPAAGVDIIELGMPFTDPMADG-------PTIQLAGQRAL---EGGMTLDRT-L 82 (263) T ss_pred CEEEEEECCCCCCH---HHHHHHHHHHHHCCCCEEEECCCCCCCCCCC-------HHHHHHHHHHH---HCCCCHHHH-H T ss_conf 54898873828997---9999999999976999999789888877658-------99999999999---779838899-9 Q ss_pred HHHHHHHHHHHCCCCCCCHHHHHCCCCC----HHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 9999999865224784300112104542----478999998709965899631711100146899999999998717998 Q gi|254780200|r 97 DQATNLAKRLESISSIVVDWAMRYGKPS----VKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMRWSP 172 (343) Q Consensus 97 ~qa~~L~~~L~~~~~~~V~~amry~~P~----i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~~~~ 172 (343) .+.+.+.+. ....++ +-|.|++|. +++-++++++.|++-+++ |=.|.- -.+++.+.+.+.+.. T Consensus 83 ~~v~~~r~~---~~~~Pi-vlM~Y~N~i~~~G~e~F~~~~~~~GvdGvIi-pDLP~e-------E~~~~~~~~~~~~l~- 149 (263) T PRK13113 83 DMVRAFRKE---DDTTPI-VMMGYYNPIYSRGVDRFLAEAKEAGIDGLIV-VDLPPE-------EDSELCLPAQAAGLN- 149 (263) T ss_pred HHHHHHHCC---CCCCCE-EEEECHHHHHHHHHHHHHHHHHHCCCCEEEE-CCCCHH-------HHHHHHHHHHHCCCC- T ss_conf 999975123---899888-9983136898856999999987779436971-799978-------889999999977986- Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEEEE Q ss_conf 64213345578006899999999999733899866999833675026762799888999999999997266756704786 Q gi|254780200|r 173 SLRTVPPYYEDSDYISALAQSIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKICF 252 (343) Q Consensus 173 ~~~~I~~~~~~p~yI~a~~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l~f 252 (343) -+.+|.+=.. -++|+.-.+... ......|.-|+-= .+ .+...+..+..+.|.+..+. +..++| T Consensus 150 ~I~lvaPtt~--------~~Ri~~i~~~a~--gFiY~Vs~~GvTG---~~-~~~~~~~~~~i~~ik~~t~~---Pv~vGF 212 (263) T PRK13113 150 FIRLATPTTD--------DRRLPKVLQNTS--GFVYYVSITGITG---AA-AAQAADVAPEVARIKAATDL---PVIVGF 212 (263) T ss_pred EEEEECCCCC--------HHHHHHHHHCCC--CCEEEEECCCCCC---CC-CCCHHHHHHHHHHHHHCCCC---CEEEEC T ss_conf 7999479999--------999999983389--8489983455668---77-55437799999999854799---889983 No 85 >TIGR00676 fadh2 5,10-methylenetetrahydrofolate reductase; InterPro: IPR004620 The enzyme activities methylenetetrahydrofolate reductase (1.5.1.20 from EC) and 5,10-methylenetetrahydrofolate reductase (FADH) (1.7.99.5 from EC) differ in that the former (assigned in many eukaryotes) is defined to use NADP+ as an acceptor, while the latter (assigned in many bacteria) is flexible with respect to the acceptor. Both convert 5-methyltetrahydrofolate to 5,10-methylenetetrahydrofolate. From a larger set of proteins assigned as one or the other, this family describes the subset of proteins found in bacteria, and currently designated 5,10-methylenetetrahydrofolate reductase. This protein is an FAD-containing flavoprotein.; GO: 0009086 methionine biosynthetic process. Probab=49.15 E-value=21 Score=16.53 Aligned_cols=45 Identities=20% Similarity=0.257 Sum_probs=33.3 Q ss_pred HHHHHHHHHHHHHCCCCCCCH---HHHHCCCCCHHHHHHHHHHCCCCEEEEE Q ss_conf 599999999865224784300---1121045424789999987099658996 Q gi|254780200|r 95 TRDQATNLAKRLESISSIVVD---WAMRYGKPSVKEIINNLREEGCDRLLIF 143 (343) Q Consensus 95 t~~qa~~L~~~L~~~~~~~V~---~amry~~P~i~~~l~~l~~~g~~~ii~l 143 (343) |...+.++++..+ +++- ..--..+--|+++|+++++.|+++||+| T Consensus 61 T~~~~~~I~~~~~----~~~v~HLtC~~~t~~e~~~~L~~y~~~Gi~~ilAL 108 (302) T TIGR00676 61 TVRIVRRIKKETG----IPTVPHLTCIGSTREEIEEILREYRELGIRNILAL 108 (302) T ss_pred HHHHHHHHHHHCC----CCCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEE T ss_conf 8999999987328----86300000236898999999999987488679874 No 86 >PRK08332 ribonucleotide-diphosphate reductase subunit alpha; Validated Probab=48.92 E-value=21 Score=16.50 Aligned_cols=86 Identities=17% Similarity=0.208 Sum_probs=44.2 Q ss_pred EEEEEECCCCHHHHHHCCCCH-HHHHHHHHHHHHHHC-----------CCCCCCEEEEECCC-----------CCCCCCC Q ss_conf 699983367502676279988-899999999999726-----------67567047862257-----------7765443 Q gi|254780200|r 207 MLLVSFHQMPVSYLLKGDPYG-CHCHKTARLLKEFLS-----------WPDDRFKICFQSRF-----------GRIKCLE 263 (343) Q Consensus 207 ~llfSaHglP~~~~~~gDpY~-~q~~~t~~~i~~~l~-----------~~~~~~~l~fQSr~-----------Gp~~WL~ 263 (343) ++=+-..|+-.-.+.-|=||. .+-.+.++.+++.+. -....|-+.-.|+. .+..|.. T Consensus 1368 KIGLGvMGlADlLikLGIPYnSeEg~~fa~kvM~fI~~~A~~aS~eLAkERG~FPlfe~S~Y~~g~~PveG~y~~e~~~~ 1447 (1748) T PRK08332 1368 RIGVGMMGLADALFKLGIPYNSEEGFDFMRKATEYLTFYAYKRSVEAAKERGPFPLYDKTEYKDGELPVEGFYHPEIWNL 1447 (1748) T ss_pred CCCEEHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHCCCCCCCCCCCCCCHHHCCC T ss_conf 02400556999999738987868999999999999999999999999980699998402205566677566654222145 Q ss_pred HHHHHHHHHHHHCCCCCEEEEC------CHHHHCCH Q ss_conf 0268999989976998399974------40542212 Q gi|254780200|r 264 PPTDKTVEKLAHDGIKSLAIIT------PGFSSDCL 293 (343) Q Consensus 264 P~t~~~l~~L~~~G~k~v~v~p------~gFvsD~l 293 (343) |. ++.+++..+.|++|..+.. +|.++||. T Consensus 1448 ~w-d~l~~ei~~~gvRNAtvTTIAPTGTISmIAdtS 1482 (1748) T PRK08332 1448 PW-DELVEEIKKYGLRNAMVTTCPPTGSVSMIADTS 1482 (1748) T ss_pred CH-HHHHHHHHHCCCCCEEEEEECCCCCEEEEECCC T ss_conf 46-889998740354420145567876357750677 No 87 >TIGR03599 YloV DAK2 domain fusion protein YloV. This model describes a protein family that contains an N-terminal DAK2 domain (pfam02734), so named because of similarity to the dihydroxyacetone kinase family family. The GTP-binding protein CgtA (a member of the obg family) is a bacterial GTPase associated with ribosome biogenesis, and it has a characteristic extension (TIGR03595) in certain lineages. This protein family described here was found, by the method of partial phylognetic profiling, to have a phylogenetic distribution strongly correlated to that of TIGR03595. This correlation implies some form of functional coupling. Probab=48.37 E-value=21 Score=16.45 Aligned_cols=53 Identities=11% Similarity=0.197 Sum_probs=26.7 Q ss_pred CCCCHHHHHHHHHHHHCCCCCEEEECCHHHHCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCC Q ss_conf 544302689999899769983999744054221255888758999999975997589816898 Q gi|254780200|r 260 KCLEPPTDKTVEKLAHDGIKSLAIITPGFSSDCLETSYEIAHEAKEIFVNGGGEKFTQVPCLN 322 (343) Q Consensus 260 ~WL~P~t~~~l~~L~~~G~k~v~v~p~gFvsD~lETl~Eidie~~e~~~e~Gg~~~~rvp~lN 322 (343) .=.-|+|.|-++.+.+-+.++|+|.|=. -+++ -..+.+.+.-..+...+|+-+ T Consensus 343 qt~NPSt~dil~Ai~~~~a~~ViiLPNn-----~Nii-----laA~qaa~~~~k~V~VipT~s 395 (530) T TIGR03599 343 QTMNPSTEDILKAIEKVNAKNVFVLPNN-----KNII-----LAAEQAAELADKNVVVIPTKT 395 (530) T ss_pred CCCCCCHHHHHHHHHHCCCCEEEEECCC-----CHHH-----HHHHHHHHHCCCCEEEEECCC T ss_conf 9899689999999985798739994799-----1088-----999999874289789994798 No 88 >pfam06685 DUF1186 Protein of unknown function (DUF1186). This family consists of several hypothetical bacterial proteins of around 250 residues in length and is found in several Chlamydia and Anabaena species. The function of this family is unknown. Probab=47.79 E-value=22 Score=16.39 Aligned_cols=77 Identities=23% Similarity=0.179 Sum_probs=30.0 Q ss_pred HHHCCCCHHHHHHHHHHH-HHHHCCCCCCCEEEEECCCCCCCCCCH-HHHHHHHHHHHCCCCCEEEECCHHHHCCHHHHH Q ss_conf 762799888999999999-997266756704786225777654430-268999989976998399974405422125588 Q gi|254780200|r 220 LLKGDPYGCHCHKTARLL-KEFLSWPDDRFKICFQSRFGRIKCLEP-PTDKTVEKLAHDGIKSLAIITPGFSSDCLETSY 297 (343) Q Consensus 220 ~~~gDpY~~q~~~t~~~i-~~~l~~~~~~~~l~fQSr~Gp~~WL~P-~t~~~l~~L~~~G~k~v~v~p~gFvsD~lETl~ 297 (343) +..|-.-+.++....+.+ ..++.-.+ +.+|+|=+-..-=|.| .+...+.+.-+.| .+.+.|+ .+|..+ T Consensus 141 v~~g~isRe~vi~Yf~~Lf~~~l~r~~---s~vW~sLv~~~~~L~p~EL~e~Ir~af~~g-----Lvd~~fi--~~edvE 210 (249) T pfam06685 141 VAEGVISREEVIRYFRELFNEKLEKEP---SYVWTSLVASSCALYPGELYEDIKKAFEDG-----LVDSFFI--SLEDVE 210 (249) T ss_pred HHHCCCCHHHHHHHHHHHHHHHCCCCC---CHHHHHHHHHHHHHCHHHHHHHHHHHHHCC-----CCCCCCC--CHHHHH T ss_conf 980765589999999999861233585---157999999998768798899999998769-----9871005--799999 Q ss_pred HHHHHHHHH Q ss_conf 875899999 Q gi|254780200|r 298 EIAHEAKEI 306 (343) Q Consensus 298 Eidie~~e~ 306 (343) ..=.+.+|. T Consensus 211 ~~l~~~~e~ 219 (249) T pfam06685 211 TILAEEKER 219 (249) T ss_pred HHHHHHHHH T ss_conf 999864589 No 89 >cd06594 GH31_glucosidase_YihQ YihQ is a bacterial alpha-glucosidase with a conserved glycosyl hydrolase family 31 (GH31) domain that catalyzes the release of an alpha-glucosyl residue from the non-reducing end of alpha-glucoside substrates such as alpha-glucosyl fluoride. Orthologs of YihQ that have not yet been functionally characterized are present in plants and fungi. YihQ has sequence similarity to other GH31 enzymes such as CtsZ, a 6-alpha-glucosyltransferase from Bacillus globisporus, and YicI, an alpha-xylosidase from Echerichia coli. In bacteria, YihQ (along with YihO) is important for bacterial O-antigen capsule assembly and translocation. Probab=47.53 E-value=22 Score=16.36 Aligned_cols=74 Identities=14% Similarity=0.194 Sum_probs=39.4 Q ss_pred CCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCCCC----CCCCCCCC---------CCCCHHHH Q ss_conf 542478999998709965899631711100146899999999998717998----64213345---------57800689 Q gi|254780200|r 122 KPSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMRWSP----SLRTVPPY---------YEDSDYIS 188 (343) Q Consensus 122 ~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~~~~----~~~~I~~~---------~~~p~yI~ 188 (343) -|..+..+++|++.|++ +++-+-|.....+. .++ ++..+.+... .-.++-++ +.+|.-.+ T Consensus 70 FPdp~~mv~~L~~~G~k--~v~~i~P~i~~~~~-~~~----~e~~~~g~~v~~~~G~~~~~~~w~g~~~~~Dftnp~ar~ 142 (317) T cd06594 70 YPGLDELIEELKARGIR--VLTYINPYLADDGP-LYY----EEAKDAGYLVKDADGSPYLVDFGEFDCGVLDLTNPAARD 142 (317) T ss_pred CCCHHHHHHHHHHCCCE--EEEECCCCCCCCCC-HHH----HHHHHCCCEEECCCCCEEEEECCCCCCCCCCCCCHHHHH T ss_conf 96989999999988998--99966887467982-669----999979968888999855641467873430589989999 Q ss_pred HHHHHHHHHHHHCC Q ss_conf 99999999997338 Q gi|254780200|r 189 ALAQSIREHFESIQ 202 (343) Q Consensus 189 a~~~~I~~~l~~~~ 202 (343) .|.+++++.+.+.+ T Consensus 143 Ww~~~~~~~~~~~G 156 (317) T cd06594 143 WFKQVIKEMLLDLG 156 (317) T ss_pred HHHHHHHHHHHHCC T ss_conf 99999999998717 No 90 >COG1608 Predicted archaeal kinase [General function prediction only] Probab=47.20 E-value=13 Score=18.07 Aligned_cols=92 Identities=13% Similarity=0.102 Sum_probs=42.1 Q ss_pred CCCHHHHHCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHH----HHHHHHHHHHHHHHHCCCCCCCCCCCCCCCC-HH Q ss_conf 430011210454247899999870996589963171110014----6899999999998717998642133455780-06 Q gi|254780200|r 112 IVVDWAMRYGKPSVKEIINNLREEGCDRLLIFPLYPQYSAAT----TGTAQDKVFQELIHMRWSPSLRTVPPYYEDS-DY 186 (343) Q Consensus 112 ~~V~~amry~~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sT----tgS~~~~~~~~l~~~~~~~~~~~I~~~~~~p-~y 186 (343) ..|..||+.-.-. +++.|.+.|+. .+|.+|.-+++- ..++.+.+..++++ +.. ++-+=+-..++. +| T Consensus 73 ~~~~~am~~L~~~---V~~~l~~~Gv~---av~~~P~s~~~~~gr~~~~~l~~i~~~l~~-gfv-Pvl~GDVv~d~~~g~ 144 (252) T COG1608 73 SLTHLAMLELNSI---VVDALLDAGVR---AVSVVPISFSTFNGRILYTYLEAIKDALEK-GFV-PVLYGDVVPDDDNGY 144 (252) T ss_pred HHHHHHHHHHHHH---HHHHHHHCCCC---CCCCCCCCEEECCCCEEECHHHHHHHHHHC-CCE-EEEECCEEECCCCCE T ss_conf 9999999999999---99999866985---322268640331770322029999999876-987-623334687489854 Q ss_pred HHHHHHHHHHHHHHCCCCCCEEEEEE Q ss_conf 89999999999973389986699983 Q gi|254780200|r 187 ISALAQSIREHFESIQWTPEMLLVSF 212 (343) Q Consensus 187 I~a~~~~I~~~l~~~~~~~~~llfSa 212 (343) ==...+.|-.+|.+. ..++.++|-- T Consensus 145 ~IiSGDdIv~~LA~~-l~pd~v~f~t 169 (252) T COG1608 145 EIISGDDIVLHLAKE-LKPDRVIFLT 169 (252) T ss_pred EEEECCHHHHHHHHH-HCCCEEEEEE T ss_conf 898511899999997-4987799984 No 91 >pfam05378 Hydant_A_N Hydantoinase/oxoprolinase N-terminal region. This family is found at the N-terminus of the pfam01968 family. Probab=47.09 E-value=22 Score=16.31 Aligned_cols=47 Identities=19% Similarity=0.369 Sum_probs=34.7 Q ss_pred CCCCCH----HHHHHHHHHHHCCCCCEEEECC-HHHHCCHHHHHHHHHHHHHHHHHCC Q ss_conf 654430----2689999899769983999744-0542212558887589999999759 Q gi|254780200|r 259 IKCLEP----PTDKTVEKLAHDGIKSLAIITP-GFSSDCLETSYEIAHEAKEIFVNGG 311 (343) Q Consensus 259 ~~WL~P----~t~~~l~~L~~~G~k~v~v~p~-gFvsD~lETl~Eidie~~e~~~e~G 311 (343) +.-++| .+.+.++++.++|+++|+|+.. ||.....|. ..+|.++|.| T Consensus 124 G~~i~~ld~~~v~~~~~~l~~~gv~aiAI~~~fS~~Np~hE~------~v~eii~e~g 175 (175) T pfam05378 124 GEVIKPLDEEAVREALKELKDAGVEAVAVSLLHSYRNPEHEL------RVKEIIREEG 175 (175) T ss_pred CEEEECCCHHHHHHHHHHHHHCCCCEEEEEEECCCCCHHHHH------HHHHHHHHCC T ss_conf 619724899999999999997799899999532779839999------9999999529 No 92 >PRK12360 4-hydroxy-3-methylbut-2-enyl diphosphate reductase; Provisional Probab=46.94 E-value=22 Score=16.30 Aligned_cols=28 Identities=14% Similarity=0.248 Sum_probs=10.3 Q ss_pred HHHHHHCCCCCEEEECCHHHHCCHHHHHHH Q ss_conf 998997699839997440542212558887 Q gi|254780200|r 270 VEKLAHDGIKSLAIITPGFSSDCLETSYEI 299 (343) Q Consensus 270 l~~L~~~G~k~v~v~p~gFvsD~lETl~Ei 299 (343) +.+|+++ -.+.++--|.-|-|---|+|+ T Consensus 203 ~~~La~~--vD~miVVGg~nSsNT~rL~ei 230 (281) T PRK12360 203 ADELSKE--VDVMFVIGGKHSSNTQKLVKI 230 (281) T ss_pred HHHHHHH--CCEEEEECCCCCCCHHHHHHH T ss_conf 9999975--999999888899648999999 No 93 >PRK13575 3-dehydroquinate dehydratase; Provisional Probab=46.25 E-value=23 Score=16.23 Aligned_cols=46 Identities=15% Similarity=0.177 Sum_probs=21.1 Q ss_pred HHHHHHHHHHHHHHHHHHCCC--CCCCHHHHHCCCCCHHHHHHHHHHC Q ss_conf 457775999999998652247--8430011210454247899999870 Q gi|254780200|r 90 ILRTHTRDQATNLAKRLESIS--SIVVDWAMRYGKPSVKEIINNLREE 135 (343) Q Consensus 90 PL~~~t~~qa~~L~~~L~~~~--~~~V~~amry~~P~i~~~l~~l~~~ 135 (343) |-....+.+...+.++.+..+ +|.|.+---+....+.+++++++.. T Consensus 12 p~~~~~~~~~~~~~~~~~gaDiVE~RvD~~~~~~~~~v~~~l~~lr~~ 59 (238) T PRK13575 12 PQLSIEETLIQKINHRIDAIDVLELRIDQWENVTVDQVAEMITKLKVM 59 (238) T ss_pred CCCCHHHHHHHHHHHCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHH T ss_conf 864377999999872447875899985401237989999999999985 No 94 >TIGR01137 cysta_beta cystathionine beta-synthase; InterPro: IPR005857 This model discriminates cystathionine beta-synthase, a protein found primarily in eukaryotes and carrying a C-terminal CBS domain from cysteine synthases. Cysteine synthase (O-acetylserine (thiol)-lyase) is the enzyme responsible for the formation of cysteine from O-acetyl-serine and hydrogen sulphide with the concomitant release of acetic acid - the function of many of these enzymes is unproven.; GO: 0004122 cystathionine beta-synthase activity, 0019343 cysteine biosynthetic process via cystathionine, 0005737 cytoplasm. Probab=46.21 E-value=15 Score=17.59 Aligned_cols=36 Identities=33% Similarity=0.653 Sum_probs=27.2 Q ss_pred CCCCCCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCHHHHCC Q ss_conf 889987778679999717888999988999999987489544117 Q gi|254780200|r 9 RNHPKVKFGKIGVLLVNLGTPDGHDFFSLRRYLREFLLDKRVVEL 53 (343) Q Consensus 9 ~~~p~~~~~K~gvLL~n~G~P~~~~~~~V~~yL~~fl~D~~Vi~~ 53 (343) +.||....+..=|+|+ |+ ++|-||..||+|....+- T Consensus 295 ~~hP~l~~~~~~VvlL-------PD--S~R~YmtK~~nD~Wl~~~ 330 (527) T TIGR01137 295 EDHPELKEDQVIVVLL-------PD--SIRNYMTKFLNDEWLLDN 330 (527) T ss_pred HHCCCCCCCCEEEEEE-------CC--CCCHHHHHEECHHHHHHC T ss_conf 7267888887899984-------68--730001001043788645 No 95 >PRK13116 consensus Probab=45.60 E-value=24 Score=16.16 Aligned_cols=248 Identities=10% Similarity=0.110 Sum_probs=109.5 Q ss_pred EEEEECCCCCCCCCHHHHHHHHHHHC-CCHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHH-HHHHCCCCCCCHHHHHHHH Q ss_conf 99997178889999889999999874-895441178688899876665125747889999-9870778874045777599 Q gi|254780200|r 20 GVLLVNLGTPDGHDFFSLRRYLREFL-LDKRVVELPSWQWRPILFGYILNFRPSKIKHAY-AKIWNTAKNESILRTHTRD 97 (343) Q Consensus 20 gvLL~n~G~P~~~~~~~V~~yL~~fl-~D~~Vi~~p~~~~~~il~~~I~~~R~~~~~~~Y-~~Iw~~~gg~SPL~~~t~~ 97 (343) =|-.+.-|-|+- +.-.+.+..+- +.-++|++.-++--|+--| +-+++.+ +.+ .+|.-+.. ... T Consensus 19 lI~yitaG~P~~---~~s~~~l~~l~~~GaDiiElGiPFSDP~ADG-------PvIQ~A~~rAL----~~G~~~~~-~~~ 83 (278) T PRK13116 19 FVPFIMLSDPSP---EEAFQIISTAIEAGADALELGVPFSDPVADG-------PTVAESHLRAL----DGGATVDS-ALE 83 (278) T ss_pred EEEEECCCCCCH---HHHHHHHHHHHHCCCCEEEECCCCCCCCCCC-------HHHHHHHHHHH----HCCCCHHH-HHH T ss_conf 898854848998---9999999999966999999799988856668-------99999999999----76986789-999 Q ss_pred HHHHHHHHHHCCCCCCCHHHHHCCCCC----HHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 999999865224784300112104542----4789999987099658996317111001468999999999987179986 Q gi|254780200|r 98 QATNLAKRLESISSIVVDWAMRYGKPS----VKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMRWSPS 173 (343) Q Consensus 98 qa~~L~~~L~~~~~~~V~~amry~~P~----i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~~~~~ 173 (343) +.+++++.. .+.++ +-|.|.+|. ++.-++++++.|++-+++ |=-|.- -.+.+.+...+.+.. . T Consensus 84 ~v~~ir~~~---~~~Pi-vlM~Y~N~i~~~G~e~F~~~~~~aGvdGlIi-pDLP~e-------E~~~~~~~~~~~~i~-~ 150 (278) T PRK13116 84 QIKRVRAAY---PEVPI-GMLIYGNVPFTRGLDRFYQEFAEAGADSILL-PDVPVR-------EGAPFSAAAAAAGID-P 150 (278) T ss_pred HHHHHCCCC---CCCCE-EEEECCCHHHHCCHHHHHHHHHHCCCCEEEE-CCCCHH-------HHHHHHHHHHHCCCC-E T ss_conf 999840358---98768-9980572887727999999997769758994-699978-------889999999865766-6 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEE Q ss_conf 42133455780068999999999997338998669998336750267627998889999999999972667567047862 Q gi|254780200|r 174 LRTVPPYYEDSDYISALAQSIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKICFQ 253 (343) Q Consensus 174 ~~~I~~~~~~p~yI~a~~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l~fQ 253 (343) +.++.+ -..+ ++++.-.+... ......|--|+-= .+.+.....+.+..+.|.+.-+. +..++| T Consensus 151 I~l~~p-tt~~-------~ri~~I~~~s~--GFiY~VS~~GvTG---~~~~~~~~~l~~~i~~ik~~t~~---Pv~vGF- 213 (278) T PRK13116 151 IYIAPA-NASE-------KTLEGVSAASK--GYIYAISRDGVTG---TERESSTDGLSAVVDNIKKFDGA---PILLGF- 213 (278) T ss_pred EEEECC-CCCH-------HHHHHHHHHCC--CEEEEEECCCCCC---CCCCCCHHHHHHHHHHHHHCCCC---CEEEEC- T ss_conf 999379-9959-------99999997189--7399986352226---88666678999999999845799---879981- Q ss_pred CCCCCCCCCCHHHHHHHHHHHHCCCCCEEEECCHHHHCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHH Q ss_conf 25777654430268999989976998399974405422125588875899999997599758981689879899999999 Q gi|254780200|r 254 SRFGRIKCLEPPTDKTVEKLAHDGIKSLAIITPGFSSDCLETSYEIAHEAKEIFVNGGGEKFTQVPCLNSSNLSIDLLEK 333 (343) Q Consensus 254 Sr~Gp~~WL~P~t~~~l~~L~~~G~k~v~v~p~gFvsD~lETl~Eidie~~e~~~e~Gg~~~~rvp~lN~~~~fi~~La~ 333 (343) | =++.+..++..+.+..-|+| ==+|| +.+ .+++-..=.....+++-..++..|++ T Consensus 214 ---G------Is~~e~v~~~~~~~aDGVIV-GSAiV-k~I--------------e~~~~~~~~~~~~~~~~~~~~~~l~~ 268 (278) T PRK13116 214 ---G------ISSPQHVADAIAAGASGAIT-GSAIT-KII--------------ASHCEGEHPNPSTIRDMDGLKKDLTE 268 (278) T ss_pred ---C------CCCHHHHHHHHHCCCCEEEE-CHHHH-HHH--------------HHCCCCCCCCCHHHCCHHHHHHHHHH T ss_conf ---6------79899999998668999998-77999-999--------------85276555550112369999999999 Q ss_pred HHHH Q ss_conf 9999 Q gi|254780200|r 334 ITRR 337 (343) Q Consensus 334 lv~~ 337 (343) -|+. T Consensus 269 ~v~~ 272 (278) T PRK13116 269 FISA 272 (278) T ss_pred HHHH T ss_conf 9999 No 96 >TIGR00284 TIGR00284 dihydropteroate synthase-related protein; InterPro: IPR005236 The proteins of this family have been found so far only in the four archaeal species. The central region of the proteins shows considerable homology to the amino-terminal half of dihydropteroate synthases. . Probab=44.81 E-value=24 Score=16.08 Aligned_cols=163 Identities=15% Similarity=0.108 Sum_probs=90.5 Q ss_pred CCCCEEEEEECCCCC-CCCCHHHHHHHHHHHCCCH--HHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCHH Q ss_conf 778679999717888-9999889999999874895--4411786888998766651257478899999870778874045 Q gi|254780200|r 15 KFGKIGVLLVNLGTP-DGHDFFSLRRYLREFLLDK--RVVELPSWQWRPILFGYILNFRPSKIKHAYAKIWNTAKNESIL 91 (343) Q Consensus 15 ~~~K~gvLL~n~G~P-~~~~~~~V~~yL~~fl~D~--~Vi~~p~~~~~~il~~~I~~~R~~~~~~~Y~~Iw~~~gg~SPL 91 (343) .|-+.|+=++.+|.| ...+...|..-++..++=- .+|.+ -+....|.|+.|- -|-|-. T Consensus 185 ~~~r~gad~v~lG~~~~~~d~d~vk~kvk~a~d~~kD~pv~~----------------dt~~~~el~ea~~---aGAs~v 245 (529) T TIGR00284 185 RRLRDGADMVALGTGVFDDDADKVKEKVKIALDLLKDSPVSV----------------DTESLKELKEAVK---AGASFV 245 (529) T ss_pred HHHHCCCCEEEECCCCCCCCHHHHHHHHHHHHHHCCCCEEEE----------------ECCCHHHHHHHHH---CCCCEE T ss_conf 998559968984575466677899999999997207860686----------------2688789999986---386568 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCHHHHHCCC----CCH-------HHHHHHHHHCCCCEEEEEE-CCHHHHH-HHHHHHH Q ss_conf 7775999999998652247843001121045----424-------7899999870996589963-1711100-1468999 Q gi|254780200|r 92 RTHTRDQATNLAKRLESISSIVVDWAMRYGK----PSV-------KEIINNLREEGCDRLLIFP-LYPQYSA-ATTGTAQ 158 (343) Q Consensus 92 ~~~t~~qa~~L~~~L~~~~~~~V~~amry~~----P~i-------~~~l~~l~~~g~~~ii~lP-LyPqyS~-sTtgS~~ 158 (343) ......-...|....+..++.-|..-..+-. -.| .+.++.+++.|+++|++=| |-|-.-. ++--+++ T Consensus 246 ~~~~~~n~~eL~~~~k~~e~Afvv~p~n~~~nyvPetieG~~~~l~~~vkkl~~~G~~K~~aDP~LsPpl~gGCsl~eSi 325 (529) T TIGR00284 246 LSLDVENLEELAEIKKLSEAAFVVVPTNQKENYVPETIEGKIKELAEAVKKLKDSGYEKVVADPVLSPPLRGGCSLVESI 325 (529) T ss_pred EECCCCHHHHHHHHCCCCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCHHHHHHH T ss_conf 72471027889860689987389842577756557420103789999999998669846998665686100131488999 Q ss_pred HHHHHHHHHHCCCCCCCC-----CCCCCCCCHHHHHHHHHHHHH Q ss_conf 999999987179986421-----334557800689999999999 Q gi|254780200|r 159 DKVFQELIHMRWSPSLRT-----VPPYYEDSDYISALAQSIREH 197 (343) Q Consensus 159 ~~~~~~l~~~~~~~~~~~-----I~~~~~~p~yI~a~~~~I~~~ 197 (343) ..+.+ .++.....++-| -.-+-.+..=++|+-..|... T Consensus 326 i~~r~-~~~~l~d~Pl~fg~~NVtEl~dADS~GvnALLaaia~E 368 (529) T TIGR00284 326 IAFRR-VKKLLKDVPLLFGVANVTELVDADSIGVNALLAAIAAE 368 (529) T ss_pred HHHHH-HHHHHCCCCEEEEHHHHHHHHHCCHHHHHHHHHHHHHH T ss_conf 99999-99872668533202335543320511389999999987 No 97 >TIGR02134 transald_staph transaldolase; InterPro: IPR011861 This small family of proteins belong to the transaldolases. Coxiella and Staphylococcus lack members of the known transaldolase families and appear to require a transaldolase activity for completion of the pentose phosphate pathway.. Probab=44.36 E-value=13 Score=17.99 Aligned_cols=12 Identities=33% Similarity=0.346 Sum_probs=6.0 Q ss_pred HHHHHHHHH-HCC Q ss_conf 689999899-769 Q gi|254780200|r 266 TDKTVEKLA-HDG 277 (343) Q Consensus 266 t~~~l~~L~-~~G 277 (343) +.|.+++|+ ..| T Consensus 199 ~~d~~aKlptllG 211 (237) T TIGR02134 199 AADVVAKLPTLLG 211 (237) T ss_pred HHHHHHHHHHHCC T ss_conf 0688874021027 No 98 >TIGR00689 rpiB_lacA_lacB sugar-phosphate isomerases, RpiB/LacA/LacB family; InterPro: IPR003500 This entry represents the sugar isomerase enzymes ribose 5-phosphate isomerase B (rpiB), galactose isomerase subunit A (LacA) and galactose isomerase subunit B (LacB). Galactose-6-phosphate isomerase (5.3.1.26 from EC) is a heteromultimeric protein consisting of subunits LacA and LacB, and catalyses the conversion of D-galactose 6-phosphate to D-tagatose and 6-phosphate in the tagatose 6-phosphate pathway of lactose catabolism . Galactose-6-phosphate isomerase is induced by galactose or lactose. This entry represents the LacB subunit. Ribose 5-phosphate isomerase (5.3.1.6 from EC) forms a homodimer and catalyses the interconversion of D-ribose 5-phosphate and D-ribulose 5-phosphate in the non-oxidative branch of the pentose phosphate pathway. This reaction permits the synthesis of ribose from other sugars, as well as the recycling of sugars from nucleotide breakdown. Two unrelated enzymes can catalyse this reaction: RpiA (found in most organisms) and RpiB (found in some bacteria and eukaryotes). RpiB is also involved in metabolism of the rare sugar, allose, in addition to ribose sugars. The structures of RpiA and RpiB are distinct, RpiB having a Rossmann-type alpha/beta/alpha sandwich topology .; GO: 0005975 carbohydrate metabolic process. Probab=44.34 E-value=25 Score=16.03 Aligned_cols=18 Identities=17% Similarity=-0.055 Sum_probs=7.9 Q ss_pred CCCCHHHHHHHHHHHHHH Q ss_conf 799888999999999997 Q gi|254780200|r 223 GDPYGCHCHKTARLLKEF 240 (343) Q Consensus 223 gDpY~~q~~~t~~~i~~~ 240 (343) .|.|++.+..-++.|.+. T Consensus 38 ~~DYPdYA~~Va~~v~~~ 55 (146) T TIGR00689 38 SVDYPDYAKLVAQKVVEG 55 (146) T ss_pred CCCCHHHHHHHHHHHHCC T ss_conf 988747899999998628 No 99 >TIGR01258 pgm_1 phosphoglycerate mutase 1 family; InterPro: IPR005952 Most members of this family are phosphoglycerate mutase (5.4.2.1 from EC). This enzyme interconverts 2-phosphoglycerate and 3-phosphoglycerate.2-phospho-D-glycerate + 2,3-diphosphoglycerate = 3-phospho-D-glycerate + 2,3-diphosphoglycerate. The enzyme is transiently phosphorylated on an active site histidine by 2,3-diphosphoglyerate, which is both substrate and product. Some members of this family have are phosphoglycerate mutase as a minor activity and act primarily as a bisphoglycerate mutase, interconverting 2,3-diphosphoglycerate and 1,3-diphosphoglycerate (5.4.2.4 from EC). ; GO: 0016868 intramolecular transferase activity phosphotransferases, 0006096 glycolysis. Probab=44.12 E-value=17 Score=17.15 Aligned_cols=14 Identities=14% Similarity=0.223 Sum_probs=6.7 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 68999999999987 Q gi|254780200|r 154 TGTAQDKVFQELIH 167 (343) Q Consensus 154 tgS~~~~~~~~l~~ 167 (343) .|-|++.+.+++.. T Consensus 183 HGNSLRALvKhL~~ 196 (248) T TIGR01258 183 HGNSLRALVKHLEG 196 (248) T ss_pred CCHHHHHHHHHHHC T ss_conf 28048999998612 No 100 >PRK10714 undecaprenyl phosphate 4-deoxy-4-formamido-L-arabinose transferase; Provisional Probab=43.23 E-value=25 Score=15.92 Aligned_cols=25 Identities=12% Similarity=0.275 Sum_probs=14.4 Q ss_pred HHCCCCCHHHHHHHHHHCCCCEEEEE Q ss_conf 21045424789999987099658996 Q gi|254780200|r 118 MRYGKPSVKEIINNLREEGCDRLLIF 143 (343) Q Consensus 118 mry~~P~i~~~l~~l~~~g~~~ii~l 143 (343) +.+-+-.|++-++++ ++|+|-+... T Consensus 101 LQdpPe~Ip~li~~~-~~G~DvV~~~ 125 (324) T PRK10714 101 LQNPPEEIPRLVAKA-DEGYDVVGTV 125 (324) T ss_pred CCCCHHHHHHHHHHH-HCCCCEEEEE T ss_conf 875988999999987-0488689998 No 101 >PRK06129 3-hydroxyacyl-CoA dehydrogenase; Validated Probab=42.09 E-value=26 Score=15.80 Aligned_cols=34 Identities=18% Similarity=0.224 Sum_probs=13.9 Q ss_pred CCCCCCHHHHHHHHHHHHC-CCCCEEEE--CCHHHHC Q ss_conf 7654430268999989976-99839997--4405422 Q gi|254780200|r 258 RIKCLEPPTDKTVEKLAHD-GIKSLAII--TPGFSSD 291 (343) Q Consensus 258 p~~WL~P~t~~~l~~L~~~-G~k~v~v~--p~gFvsD 291 (343) |++|+.|.|.+...++.++ |.+-|+|- .|||+.+ T Consensus 155 ~g~~Ts~~~v~~~~~~~~~lGk~PV~v~ke~pGFi~N 191 (308) T PRK06129 155 PAPWTAPATVARAHALYRAAGQSPVRLRREIDGFVLN 191 (308) T ss_pred CCCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCHHHH T ss_conf 7999898999999999998399889990256215899 No 102 >PRK13118 consensus Probab=41.80 E-value=27 Score=15.77 Aligned_cols=222 Identities=15% Similarity=0.153 Sum_probs=101.8 Q ss_pred CCEE-EEEECCCCCCCCCHHHHHHHHHHHC-CCHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCHHHHH Q ss_conf 8679-9997178889999889999999874-8954411786888998766651257478899999870778874045777 Q gi|254780200|r 17 GKIG-VLLVNLGTPDGHDFFSLRRYLREFL-LDKRVVELPSWQWRPILFGYILNFRPSKIKHAYAKIWNTAKNESILRTH 94 (343) Q Consensus 17 ~K~g-vLL~n~G~P~~~~~~~V~~yL~~fl-~D~~Vi~~p~~~~~~il~~~I~~~R~~~~~~~Y~~Iw~~~gg~SPL~~~ 94 (343) +|+| +..+..|=|+ .+.-.+++..+- +.-++|++.-++--|+--| +-++..+++-- .+|.-+.. T Consensus 15 ~~~ali~yitaG~P~---~e~t~~~~~~l~~~GaDiiElGiPfSDP~ADG-------PvIq~A~~rAL---~~G~~~~~- 80 (269) T PRK13118 15 NRKALVPFVTAGDPS---PEASVPLLHGLVAAGADVIELGMPFSDPMADG-------PAIQLASERAL---AAGQGLAD- 80 (269) T ss_pred CCCEEEEEECCCCCC---HHHHHHHHHHHHHCCCCEEEECCCCCCCCCCC-------HHHHHHHHHHH---HCCCCHHH- T ss_conf 995599887171899---89999999999976999999789888866657-------99999999999---67986889- Q ss_pred HHHHHHHHHHHHHCCCCCCCHHHHHCCCCC----HHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 599999999865224784300112104542----4789999987099658996317111001468999999999987179 Q gi|254780200|r 95 TRDQATNLAKRLESISSIVVDWAMRYGKPS----VKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMRW 170 (343) Q Consensus 95 t~~qa~~L~~~L~~~~~~~V~~amry~~P~----i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~~ 170 (343) ...+.+.+.+.. .+.++ +-|.|.+|. +++-++++++.|++-+++ |=.|. .-.+++.+...+.+. T Consensus 81 ~~~~v~~~r~~~---~~~Pi-vlM~Y~N~i~~~G~e~F~~~~~~~GvdGvIi-pDLP~-------ee~~~~~~~~~~~gl 148 (269) T PRK13118 81 VLQMVREFRQGD---QTTPV-VLMGYLNPIEIYGYERFVAQAKEAGVDGLIL-VDLPP-------EEADELRAPAQAHGL 148 (269) T ss_pred HHHHHHHHHHCC---CCCCE-EEEEECCHHHHHHHHHHHHHHHHCCCCEEEC-CCCCH-------HHHHHHHHHHHHCCC T ss_conf 999999986438---99998-9974000787863999999999859974645-89997-------899999999997598 Q ss_pred CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEE Q ss_conf 98642133455780068999999999997338998669998336750267627998889999999999972667567047 Q gi|254780200|r 171 SPSLRTVPPYYEDSDYISALAQSIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKI 250 (343) Q Consensus 171 ~~~~~~I~~~~~~p~yI~a~~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l 250 (343) ..+.+|.+=.. -++|+.-.+... ......|.-|+-=. ++. -...+.+..+.+.+.-+. +..+ T Consensus 149 -~~I~lvaPtt~--------~~Ri~~i~~~a~--gFiY~vs~~GvTG~---~~~-~~~~~~~~i~~ik~~t~~---Pv~v 210 (269) T PRK13118 149 -DFIRLTSPTTS--------DERLPRVLEHAS--GYLYYVSLAGVTGA---AAL-DTEHVEEAVARLRRHTDL---PVVV 210 (269) T ss_pred -CHHEEECCCCC--------HHHHHHHHHCCC--CCEEEEECCCCCCC---CCC-CHHHHHHHHHHHHHHCCC---CEEE T ss_conf -46403698987--------899999984378--83899854566787---766-719899999999962589---8178 Q ss_pred EEECCCCCCCCCCHHHHHHHHHHHHCCCCCEEEECCHHHHCCHHH Q ss_conf 862257776544302689999899769983999744054221255 Q gi|254780200|r 251 CFQSRFGRIKCLEPPTDKTVEKLAHDGIKSLAIITPGFSSDCLET 295 (343) Q Consensus 251 ~fQSr~Gp~~WL~P~t~~~l~~L~~~G~k~v~v~p~gFvsD~lET 295 (343) +| |= ++.+..+.+.+. . .-+|+= |-+.+.+|. T Consensus 211 GF----GI------s~~e~~~~v~~~-a-DGvIVG-Sa~Vk~i~~ 242 (269) T PRK13118 211 GF----GI------RDAESAAAIARL-A-DGVVVG-SALVDAIAE 242 (269) T ss_pred EE----CC------CCHHHHHHHHCC-C-CEEEEC-HHHHHHHHH T ss_conf 71----67------999999999800-9-999985-899999985 No 103 >TIGR01793 cit_synth_euk citrate (Si)-synthase, eukaryotic; InterPro: IPR010109 Citrate synthase 2.3.3.1 from EC is a member of a small family of enzymes that can directly form a carbon-carbon bond without the presence of metal ion cofactors. It catalyses the first reaction in the Krebs' cycle, namely the conversion of oxaloacetate and acetyl-coenzyme A into citrate and coenzyme A. This reaction is important for energy generation and for carbon assimilation. The reaction proceeds via a non-covalently bound citryl-coenzyme A intermediate in a 2-step process (aldol-Claisen condensation followed by the hydrolysis of citryl-CoA). Citrate synthase enzymes are found in two distinct structural types: type I enzymes (found in eukaryotes, Gram-positive bacteria and archaea) form homodimers and have shorter sequences than type II enzymes, which are found in Gram-negative bacteria and are hexameric in structure. In both types, the monomer is composed of two domains: a large alpha-helical domain consisting of two structural repeats, where the second repeat is interrupted by a small alpha-helical domain. The cleft between these domains forms the active site, where both citrate and acetyl-coenzyme A bind. The enzyme undergoes a conformational change upon binding of the oxaloacetate ligand, whereby the active site cleft closes over in order to form the acetyl-CoA binding site . The energy required for domain closure comes from the interaction of the enzyme with the substrate. Type II enzymes possess an extra N-terminal beta-sheet domain, and some type II enzymes are allosterically inhibited by NADH . This entry includes both mitochondrial and peroxisomal forms of citrate synthase. Peroxisomal forms of the enzyme, recognized by the C-terminal targeting motif SKL, act in the glyoxylate cycle. Eukaryotic homologs include a Tetrahymena thermophila citrate synthase that doubles as a filament protein, a putative citrate synthase from Plasmodium falciparum (no TCA cycle), and a methylcitrate synthase from Aspergillus nidulans (Emericella nidulans).; GO: 0004108 citrate (Si)-synthase activity, 0044262 cellular carbohydrate metabolic process. Probab=41.63 E-value=27 Score=15.75 Aligned_cols=23 Identities=13% Similarity=0.227 Sum_probs=16.3 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 874045777599999999865224784 Q gi|254780200|r 86 KNESILRTHTRDQATNLAKRLESISSI 112 (343) Q Consensus 86 gg~SPL~~~t~~qa~~L~~~L~~~~~~ 112 (343) -|.-| |++|+++|.+++.++.++ T Consensus 96 TG~VP----se~Qv~alsaEw~aRadl 118 (430) T TIGR01793 96 TGEVP----SEEQVDALSAEWRARADL 118 (430) T ss_pred CCCCC----HHHHHHHHHHHHHHHHHH T ss_conf 58998----088999999999855002 No 104 >pfam08645 PNK3P Polynucleotide kinase 3 phosphatase. Polynucleotide kinase 3 phosphatases play a role in the repair of single breaks in DNA induced by DNA-damaging agents such as gamma radiation and camptothecin. Probab=41.60 E-value=27 Score=15.75 Aligned_cols=24 Identities=38% Similarity=0.721 Sum_probs=18.4 Q ss_pred HCCCCCHHHHHHHHHHCCCCEEEEE Q ss_conf 1045424789999987099658996 Q gi|254780200|r 119 RYGKPSVKEIINNLREEGCDRLLIF 143 (343) Q Consensus 119 ry~~P~i~~~l~~l~~~g~~~ii~l 143 (343) .+..|.+.++|++|.++|+. ||++ T Consensus 28 ~~~~p~V~~~L~~L~~~gY~-iVIv 51 (158) T pfam08645 28 KWLYPSVPEKLKKLHEDGYK-IVIF 51 (158) T ss_pred EECCCCHHHHHHHHHHCCCE-EEEE T ss_conf 88077889999999988987-9999 No 105 >cd06422 NTP_transferase_like_1 NTP_transferase_like_1 is a member of the nucleotidyl transferase family. This is a subfamily of nucleotidyl transferases. Nucleotidyl transferases transfer nucleotides onto phosphosugars. The activated sugars are precursors for synthesis of lipopolysaccharide, glycolipids and polysaccharides. Other subfamilies of nucleotidyl transferases include Alpha-D-Glucose-1-Phosphate Cytidylyltransferase, Mannose-1-phosphate guanyltransferase, and Glucose-1-phosphate thymidylyltransferase. Probab=41.32 E-value=27 Score=15.72 Aligned_cols=48 Identities=19% Similarity=0.311 Sum_probs=30.8 Q ss_pred CCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 0454247899999870996589963171110014689999999999871799864213 Q gi|254780200|r 120 YGKPSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMRWSPSLRTV 177 (343) Q Consensus 120 y~~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~~~~~~~~I 177 (343) .+.|-|+-+++++++.|++++++..-| - -+.+.+.+.+..+..++.++ T Consensus 28 ~~kpii~~~i~~l~~~gi~~iii~~~y---~-------~~~i~~~~~~~~~~~~i~~~ 75 (221) T cd06422 28 AGKPLIDHALDRLAAAGIRRIVVNTHH---L-------ADQIEAHLGDSRFGLRITIS 75 (221) T ss_pred CCEEHHHHHHHHHHHCCCCEEEEECCC---C-------HHHHHHHHHCCCCCCCEEEE T ss_conf 998999999999998299679983253---2-------78899987225578737997 No 106 >pfam02543 CmcH_NodU Carbamoyltransferase. This family consists of NodU from Rhizobium and CmcH from Nocardia lactamdurans. NodU a Rhizobium nodulation protein involved in the synthesis of nodulation factors has 6-O-carbamoyltransferase-like activity. CmcH is involved in cephamycin (antibiotic) biosynthesis and has 3-hydroxymethylcephem carbamoyltransferase activity, EC:2.1.3.7 catalysing the reaction: Carbamoyl phosphate + 3-hydroxymethylceph-3-EM-4-carboxylate <= phosphate + 3-carbamoyloxymethylcephem. Probab=40.94 E-value=18 Score=16.93 Aligned_cols=24 Identities=33% Similarity=0.469 Sum_probs=13.6 Q ss_pred HHHHHHHHHCCCCCCCEEEEECC--CCC Q ss_conf 99999997266756704786225--777 Q gi|254780200|r 233 TARLLKEFLSWPDDRFKICFQSR--FGR 258 (343) Q Consensus 233 t~~~i~~~l~~~~~~~~l~fQSr--~Gp 258 (343) ..+.+++.|. ++++-=-||.| +|| T Consensus 253 ~~~~vA~~L~--~gkIVg~fqGr~E~GP 278 (360) T pfam02543 253 IPSEAAELLA--DGKVVGWFQGRMEFGP 278 (360) T ss_pred HHHHHHHHHH--CCCEEEEEECCCCCCC T ss_conf 7999999986--7998999947866787 No 107 >PRK00881 purH bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase; Provisional Probab=40.78 E-value=28 Score=15.67 Aligned_cols=41 Identities=29% Similarity=0.417 Sum_probs=22.7 Q ss_pred CCCCCC-CCCCHHHHHHHHHHHHCCCCCEEEECCHHHHCCHHHH Q ss_conf 257776-5443026899998997699839997440542212558 Q gi|254780200|r 254 SRFGRI-KCLEPPTDKTVEKLAHDGIKSLAIITPGFSSDCLETS 296 (343) Q Consensus 254 Sr~Gp~-~WL~P~t~~~l~~L~~~G~k~v~v~p~gFvsD~lETl 296 (343) |.||.. --=.|-..++.+++.+ .+-.| |+.|+|.-|.||.| T Consensus 302 SAFGGIva~N~~vd~~~A~~i~~-~F~Ev-IiAP~f~~eAL~iL 343 (514) T PRK00881 302 SAFGGIIAFNREVDAETAEAIVK-IFLEV-IIAPSFDEEALEIL 343 (514) T ss_pred CCCCCEEEECCCCCHHHHHHHHH-HHHHH-HCCCCCCHHHHHHH T ss_conf 54574799626268999999999-88874-11788998999998 No 108 >cd01310 TatD_DNAse TatD like proteins; E.coli TatD is a cytoplasmic protein, shown to have magnesium dependent DNase activity. Probab=40.22 E-value=28 Score=15.61 Aligned_cols=137 Identities=11% Similarity=0.106 Sum_probs=61.7 Q ss_pred CCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 45424789999987099658996317111001468999999999987179986421334557800689999999999973 Q gi|254780200|r 121 GKPSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMRWSPSLRTVPPYYEDSDYISALAQSIREHFES 200 (343) Q Consensus 121 ~~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~~~~~~~~I~~~~~~p~yI~a~~~~I~~~l~~ 200 (343) ....++++++.+++.|+..++....- -...+.+.+-.++.. . +...+.-||.+++-+.+...+.++. T Consensus 13 ~~~d~~~vi~~a~~~gv~~ii~~~~~--------~~~~~~~~~la~~~~---~--i~~~~GiHP~~~~~~~~~~~~~l~~ 79 (251) T cd01310 13 FDADRDDVLARAREAGVIKIIVVGTD--------LKSSKRALELAKKYD---N--VYAAVGLHPHDADEHVDEDLDLLEL 79 (251) T ss_pred HCCCHHHHHHHHHHCCCCEEEEECCC--------HHHHHHHHHHHHHCC---C--EEEEECCCHHHHHHCCHHHHHHHHH T ss_conf 44699999999998699889996899--------999999999997599---8--6898236837651235779999999 Q ss_pred CCCCCCEEEEEECCCCHHHHHHCCCCHHH--HHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHCC- Q ss_conf 38998669998336750267627998889--999999999972667567047862257776544302689999899769- Q gi|254780200|r 201 IQWTPEMLLVSFHQMPVSYLLKGDPYGCH--CHKTARLLKEFLSWPDDRFKICFQSRFGRIKCLEPPTDKTVEKLAHDG- 277 (343) Q Consensus 201 ~~~~~~~llfSaHglP~~~~~~gDpY~~q--~~~t~~~i~~~l~~~~~~~~l~fQSr~Gp~~WL~P~t~~~l~~L~~~G- 277 (343) ....+..+-.-==||.-.. ..++...| +.+.-=.+++.++.| +.-.||- ..+++++-|.+.+ T Consensus 80 ~~~~~~~vaIGEiGLD~~~--~~~~~~~Q~~~F~~ql~lA~~~~lP-----v~iH~R~--------A~~~~l~iL~~~~~ 144 (251) T cd01310 80 LAANPKVVAIGEIGLDYYR--DKSPREVQKEVFRAQLELAKELNLP-----VVIHSRD--------AHEDVLEILKEYGP 144 (251) T ss_pred HHCCCCEEEEEEECCCCCC--CCCCHHHHHHHHHHHHHHHHHHCCC-----EEEEECC--------CHHHHHHHHHHCCC T ss_conf 9728990699850148888--8998999999999999999875788-----4898645--------47999999997088 Q ss_pred CCCEEEEC Q ss_conf 98399974 Q gi|254780200|r 278 IKSLAIIT 285 (343) Q Consensus 278 ~k~v~v~p 285 (343) ..++++-| T Consensus 145 ~~~~v~H~ 152 (251) T cd01310 145 PKRGVFHC 152 (251) T ss_pred CCCEEEEE T ss_conf 87738885 No 109 >TIGR01481 ccpA catabolite control protein A; InterPro: IPR006377 Catabolite control protein A is a LacI family global transcriptional regulator found in Gram-positive bacteria. CcpA is involved in repressing carbohydrate utilization genes [ex: alpha-amylase (amyE), acetyl-coenzyme A synthase (acsA)] and in activating genes involved in transporting excess carbon from the cell [e.g. acetate kinase (ackA), alpha-acetolactate synthase (alsS)]. Additionally, disruption of CcpA in Bacillus megaterium, Staphylococcus xylosus, Lactobacillus casei and Lactocacillus pentosus also decreases growth rate, which suggests CcpA is involved in the regulation of other metabolic pathways.; GO: 0030528 transcription regulator activity, 0045449 regulation of transcription. Probab=40.12 E-value=16 Score=17.29 Aligned_cols=202 Identities=15% Similarity=0.181 Sum_probs=97.8 Q ss_pred CCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHCCC---CCCCHHHHHCCCCCHH-HHHHHHHHCCCCEEEEE Q ss_conf 2574788999998707788740457775999999998652247---8430011210454247-89999987099658996 Q gi|254780200|r 68 NFRPSKIKHAYAKIWNTAKNESILRTHTRDQATNLAKRLESIS---SIVVDWAMRYGKPSVK-EIINNLREEGCDRLLIF 143 (343) Q Consensus 68 ~~R~~~~~~~Y~~Iw~~~gg~SPL~~~t~~qa~~L~~~L~~~~---~~~V~~amry~~P~i~-~~l~~l~~~g~~~ii~l 143 (343) ..||-.+|.--++==| .-=|--+..+|...-..|++-++..- .+.+.+++-=-.+-=| +++.-|-.+.+|-||-+ T Consensus 45 ~YRPNAVARGLASKrT-TTVGViIPDIsN~yya~LarGIeDIA~MYKYNIILsNSD~~~ekE~~v~NTLl~KQVDGiIfm 123 (332) T TIGR01481 45 DYRPNAVARGLASKRT-TTVGVIIPDISNIYYAELARGIEDIATMYKYNIILSNSDEDEEKEVQVLNTLLSKQVDGIIFM 123 (332) T ss_pred CCCCCHHHHCCCCCCE-EEEEEEECCCCHHHHHHCCCCCHHHHHHHCCCEEEECCCCCCCCHHHHHHHHHHHHCCEEEEC T ss_conf 7985445322667740-234688158615768501676025751101233565268987521676665331010113444 Q ss_pred ECCHHHHHHHHHHHH-HHHHHHHHHHCCCCCCC--CCCCCC----CCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCC Q ss_conf 317111001468999-99999998717998642--133455----78006899999999999733899866999833675 Q gi|254780200|r 144 PLYPQYSAATTGTAQ-DKVFQELIHMRWSPSLR--TVPPYY----EDSDYISALAQSIREHFESIQWTPEMLLVSFHQMP 216 (343) Q Consensus 144 PLyPqyS~sTtgS~~-~~~~~~l~~~~~~~~~~--~I~~~~----~~p~yI~a~~~~I~~~l~~~~~~~~~llfSaHglP 216 (343) |..+ |++.+.+.+-+ .|=+= .++.=+ =+=.|.+|-.+-+.+.+++ +. .++=|.-=+|- T Consensus 124 -----------G~~Ite~~re~F~rs~-~PVVLAGt~D~~~~~pSVNIDy~~A~kEav~~l~~~-Gh--K~iAfv~G~L~ 188 (332) T TIGR01481 124 -----------GGEITEKLREEFSRSP-VPVVLAGTVDKENELPSVNIDYKQATKEAVKELIAK-GH--KKIAFVSGSLS 188 (332) T ss_pred -----------CHHHHHHHHHHHHCCC-CCEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHC-CC--CEEEEEECCCC T ss_conf -----------3021089999971289-988995354865467731016789999999998633-58--26888736757 Q ss_pred HHHHHHCCC---CHHHHHHHHHHHHHHHCCCCCCCEEEEE----CCCCCCCCCCHHHHHHHHHHHHCCCCCEEEECCHHH Q ss_conf 026762799---8889999999999972667567047862----257776544302689999899769983999744054 Q gi|254780200|r 217 VSYLLKGDP---YGCHCHKTARLLKEFLSWPDDRFKICFQ----SRFGRIKCLEPPTDKTVEKLAHDGIKSLAIITPGFS 289 (343) Q Consensus 217 ~~~~~~gDp---Y~~q~~~t~~~i~~~l~~~~~~~~l~fQ----Sr~Gp~~WL~P~t~~~l~~L~~~G~k~v~v~p~gFv 289 (343) ++ +..-++ |+.-+ +..|+..++ .+.|. -+-|- ...++|.+.+-+-=+|++ T Consensus 189 ~~-~N~~~~l~GykE~L--------~~~gI~~~e-~L~~~~~YsY~~G~---------K~~~~l~~s~p~~tAv~v---- 245 (332) T TIGR01481 189 DS-VNGEVRLEGYKEAL--------KKAGIAYEE-DLVIEAKYSYDAGV---------KLAAELLESSPKITAVVV---- 245 (332) T ss_pred CC-CCCEECCCHHHHHH--------HHCCCCCCC-CEEEECCCCCHHHH---------HHHHHHHHCCCCCCEEEE---- T ss_conf 11-04520133046788--------736874142-21330455504469---------999999853899658998---- Q ss_pred HCCHHHHHHHHHHHHHHHHHCCCC Q ss_conf 221255888758999999975997 Q gi|254780200|r 290 SDCLETSYEIAHEAKEIFVNGGGE 313 (343) Q Consensus 290 sD~lETl~Eidie~~e~~~e~Gg~ 313 (343) +-.|+.+..-.-+.++|+. T Consensus 246 -----~~DE~A~G~lnaa~D~Gi~ 264 (332) T TIGR01481 246 -----ADDELAIGILNAALDAGIK 264 (332) T ss_pred -----CCHHHHHHHHHHHHHCCCC T ss_conf -----1828899999999857898 No 110 >pfam02401 LYTB LytB protein. The mevalonate-independent 2-C-methyl-D-erythritol 4-phosphate (MEP) pathway for isoprenoid biosynthesis is essential in many eubacteria, plants, and the malaria parasite. The LytB gene is involved in the trunk line of the MEP pathway. Probab=39.05 E-value=29 Score=15.49 Aligned_cols=30 Identities=23% Similarity=0.299 Sum_probs=14.1 Q ss_pred HHHHHHHHCCCCCEEEECCHHHHCCHHHHHHH Q ss_conf 99998997699839997440542212558887 Q gi|254780200|r 268 KTVEKLAHDGIKSLAIITPGFSSDCLETSYEI 299 (343) Q Consensus 268 ~~l~~L~~~G~k~v~v~p~gFvsD~lETl~Ei 299 (343) +.+.+|+++ -.+.++--|+-|-|---|+|+ T Consensus 200 ~a~~~lA~~--vD~miVVGg~nSsNT~~L~ei 229 (280) T pfam02401 200 EAVRELAPE--VDVVLVVGGKNSSNSNRLAEI 229 (280) T ss_pred HHHHHHHHH--CCEEEEECCCCCCCHHHHHHH T ss_conf 999977754--989999579999548999999 No 111 >PRK07105 pyridoxamine kinase; Validated Probab=39.04 E-value=29 Score=15.49 Aligned_cols=192 Identities=13% Similarity=0.210 Sum_probs=94.2 Q ss_pred CCCCCHHHHHHHHHHCCCCEEEEE-ECCHHHH-------HHHHHHHHHHHHHHHHHHCCCCCCCC-CCCCCCCCHHHHHH Q ss_conf 045424789999987099658996-3171110-------01468999999999987179986421-33455780068999 Q gi|254780200|r 120 YGKPSVKEIINNLREEGCDRLLIF-PLYPQYS-------AATTGTAQDKVFQELIHMRWSPSLRT-VPPYYEDSDYISAL 190 (343) Q Consensus 120 y~~P~i~~~l~~l~~~g~~~ii~l-PLyPqyS-------~sTtgS~~~~~~~~l~~~~~~~~~~~-I~~~~~~p~yI~a~ 190 (343) ||.=.+.-.+.-|.+.|++-..+. -+.-|.+ .-.....++.+.+...+.+ .++.- ...|-.++.-|+.+ T Consensus 16 ~G~agl~A~i~~lsa~G~~~~~vpTallS~~T~g~~~~~~~~l~~~l~~~~~~w~~~~--i~~dai~tG~Lgs~~~i~~v 93 (284) T PRK07105 16 FGRVALTASIPIMSSMGLQVCPLPTALLSSHTGGFQNPSILDLTDGMQAFLTHWKSLN--LKFDAIYSGYLGSPRQVQIV 93 (284) T ss_pred CCHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCCEECCHHHHHHHHHHHHHHCC--CCCCEEEECCCCCHHHHHHH T ss_conf 4677999999999986995406568994568998464500172899999999998669--98798998987999999999 Q ss_pred HHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHH----CCCCCCCEEEEECCCCCCCCC---- Q ss_conf 999999997338998669998336750267627998889999999999972----667567047862257776544---- Q gi|254780200|r 191 AQSIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFL----SWPDDRFKICFQSRFGRIKCL---- 262 (343) Q Consensus 191 ~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l----~~~~~~~~l~fQSr~Gp~~WL---- 262 (343) ++.|+.. .. +...++.- |+ +.++|.-|...-.+.++.+.+.+ -+.+|..+.++-+ | .++. T Consensus 94 ~~~l~~~-~~---~~~~iVvD----PV-Mgd~G~ly~~~~~~~v~~~r~Llp~AdiITPN~~Ea~~Lt--g-~~~~~~~~ 161 (284) T PRK07105 94 SEFIDDF-KK---KDLLVVVD----PV-MGDNGKLYQGFDQKMVEEMRKLIQKADVITPNLTEACLLL--D-KPYLEKSY 161 (284) T ss_pred HHHHHHH-CC---CCCCEEEC----CC-CCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCHHHHHHHH--C-CCCCCCCC T ss_conf 9999985-57---89848969----65-5768876677879999999998440746279999999996--8-87687779 Q ss_pred -CHHHHHHHHHHHHCCCCCEEEECCHHHHCCHHHHHHHHHHHHHHHHHCCCCEEE-----EECC--CCCCHHHHHHHHHH Q ss_conf -302689999899769983999744054221255888758999999975997589-----8168--98798999999999 Q gi|254780200|r 263 -EPPTDKTVEKLAHDGIKSLAIITPGFSSDCLETSYEIAHEAKEIFVNGGGEKFT-----QVPC--LNSSNLSIDLLEKI 334 (343) Q Consensus 263 -~P~t~~~l~~L~~~G~k~v~v~p~gFvsD~lETl~Eidie~~e~~~e~Gg~~~~-----rvp~--lN~~~~fi~~La~l 334 (343) .-+..+.+++|.+.|.++|+|--..+-.|++..++- ...+..+. |+|. -++-+.|.++|+.- T Consensus 162 ~~~~~~~~a~~L~~~G~~~Vvitg~~~~~~~~~~~~~----------~~~~~~~~~~~~~~i~~~~~GTGd~fss~iaa~ 231 (284) T PRK07105 162 DEEEIKDYLRKLADLGPKIVIITSVPFEEGKIGVAYY----------DRASDRFWKHFGKYIPAHYPGTGDIFTSVITGS 231 (284) T ss_pred CHHHHHHHHHHHHHHCCCEEEEECCCCCCCEEEEEEE----------CCCCCEEEEEECCCCCCCCCCCCHHHHHHHHHH T ss_conf 9999999999999708988999045477871679998----------589988999626541899898417899999999 Q ss_pred H Q ss_conf 9 Q gi|254780200|r 335 T 335 (343) Q Consensus 335 v 335 (343) . T Consensus 232 l 232 (284) T PRK07105 232 L 232 (284) T ss_pred H T ss_conf 9 No 112 >PRK09432 metF 5,10-methylenetetrahydrofolate reductase; Provisional Probab=38.14 E-value=30 Score=15.39 Aligned_cols=53 Identities=13% Similarity=0.217 Sum_probs=34.0 Q ss_pred CCCHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHC---CCCCHHHHHHHHHHCCCCEEEEEE Q ss_conf 7404577759999999986522478430011210---454247899999870996589963 Q gi|254780200|r 87 NESILRTHTRDQATNLAKRLESISSIVVDWAMRY---GKPSVKEIINNLREEGCDRLLIFP 144 (343) Q Consensus 87 g~SPL~~~t~~qa~~L~~~L~~~~~~~V~~amry---~~P~i~~~l~~l~~~g~~~ii~lP 144 (343) ||| -...|...+..+++.. ++++-.-+.+ ..--+.+.+.++++.|+++|++|- T Consensus 63 ggs-tr~~T~~i~~~i~~~~----~i~~~~HLTCvg~t~~~i~~~L~~~~~~GI~nILALR 118 (296) T PRK09432 63 NSG-ERDRTHSIIKGIKKRT----GLEAAPHLTCIDATPEELRTIAKDYWNNGIRHIVALR 118 (296) T ss_pred CCC-CHHHHHHHHHHHHHHH----CCCCCEEEEECCCCHHHHHHHHHHHHHCCCCEEEECC T ss_conf 997-7356999999999874----9981126662699999999999999975975486548 No 113 >TIGR01415 trpB_rel pyridoxal-phosphate dependent TrpB-like enzyme; InterPro: IPR006316 These sequences represent a family of pyridoxal-phosphate dependent enzymes that are closely related to the beta subunit of tryptophan synthase. ; GO: 0004834 tryptophan synthase activity, 0030170 pyridoxal phosphate binding, 0000162 tryptophan biosynthetic process. Probab=38.09 E-value=30 Score=15.39 Aligned_cols=123 Identities=20% Similarity=0.256 Sum_probs=69.8 Q ss_pred CHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCHHHHH----CCCCCHHHHHHHH---HHCCCCEEEEE Q ss_conf 4788999998707788740457775999999998652247843001121----0454247899999---87099658996 Q gi|254780200|r 71 PSKIKHAYAKIWNTAKNESILRTHTRDQATNLAKRLESISSIVVDWAMR----YGKPSVKEIINNL---REEGCDRLLIF 143 (343) Q Consensus 71 ~~~~~~~Y~~Iw~~~gg~SPL~~~t~~qa~~L~~~L~~~~~~~V~~amr----y~~P~i~~~l~~l---~~~g~~~ii~l 143 (343) |.++.+.|..| |-..||.+- +.|++.|+..+.+.+|+=+= -|.=-|..+|.|. +.+|+++|+== T Consensus 56 P~Evl~~Y~~~----GRPTPL~RA-----~~LEe~Lgt~gr~rIyyK~E~~~~tGSHK~NTA~AqAYYak~~G~k~l~TE 126 (426) T TIGR01415 56 PEEVLKAYAKI----GRPTPLIRA-----KGLEELLGTPGRARIYYKYESVSPTGSHKINTAIAQAYYAKKEGAKRLVTE 126 (426) T ss_pred CHHHHHHHHHH----CCCCHHHHH-----HHHHHHHCCCCEEEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCEEEECC T ss_conf 87899999750----788634665-----444786289952789874057577888524479999998755484374323 Q ss_pred ECCHHH-HHHHHHH--------------HH--HHHHHHHHHHC-----CCCC------CCCCCCC-CCCCHHHH-HHHHH Q ss_conf 317111-0014689--------------99--99999998717-----9986------4213345-57800689-99999 Q gi|254780200|r 144 PLYPQY-SAATTGT--------------AQ--DKVFQELIHMR-----WSPS------LRTVPPY-YEDSDYIS-ALAQS 193 (343) Q Consensus 144 PLyPqy-S~sTtgS--------------~~--~~~~~~l~~~~-----~~~~------~~~I~~~-~~~p~yI~-a~~~~ 193 (343) -=.=|. |+.+.++ ++ +-+.|.+.+.- .+|+ =++.+.. -+||+=+- |+++- T Consensus 127 TGAGQWG~AlS~A~al~~L~~~VfMvr~Sf~~KP~Rk~lM~~yGa~V~PSPS~~T~~GR~~L~e~gP~~PGSLGiAISEA 206 (426) T TIGR01415 127 TGAGQWGSALSLAGALFGLKVKVFMVRVSFQQKPYRKYLMELYGAEVIPSPSELTEFGRKVLKEDGPDHPGSLGIAISEA 206 (426) T ss_pred CCCCHHHHHHHHHHHHHCCEEEEEEEECCHHCCHHHHHHHHHCCCEECCCCCCCCHHHHHHHHCCCCCCCCHHHHHHHHH T ss_conf 68864789999999872892589887412104846899998708843368877513468887406777786167899999 Q ss_pred HHHHHHHCC Q ss_conf 999997338 Q gi|254780200|r 194 IREHFESIQ 202 (343) Q Consensus 194 I~~~l~~~~ 202 (343) |..++++.+ T Consensus 207 ~e~al~~~~ 215 (426) T TIGR01415 207 IEEALSDED 215 (426) T ss_pred HHHHHHCCC T ss_conf 999970899 No 114 >COG2240 PdxK Pyridoxal/pyridoxine/pyridoxamine kinase [Coenzyme metabolism] Probab=37.65 E-value=31 Score=15.34 Aligned_cols=200 Identities=18% Similarity=0.108 Sum_probs=113.4 Q ss_pred CCCCHHHHHHHHHHCCCCEE----EEEECCHHHHHHHHH----HHHHHHHHHHHHHCCCCCCCC-CCCCCCCCHHHHHHH Q ss_conf 45424789999987099658----996317111001468----999999999987179986421-334557800689999 Q gi|254780200|r 121 GKPSVKEIINNLREEGCDRL----LIFPLYPQYSAATTG----TAQDKVFQELIHMRWSPSLRT-VPPYYEDSDYISALA 191 (343) Q Consensus 121 ~~P~i~~~l~~l~~~g~~~i----i~lPLyPqyS~sTtg----S~~~~~~~~l~~~~~~~~~~~-I~~~~~~p~yI~a~~ 191 (343) |+=.-.-++--|+.-|.+-. ++||=.|+|-..|.. +.++.+.+.+....+..+++- ...|.-++.=+++++ T Consensus 13 G~vGn~AA~f~lq~~G~~V~~vpTV~fSnHtgyg~~~g~v~~~e~l~~~l~~l~~~~~~~~~davltGYlgs~~qv~~i~ 92 (281) T COG2240 13 GSVGNSAAIFPLQRLGLDVWAVPTVQFSNHTGYGKWTGIVMPPEQLADLLNGLEAIDKLGECDAVLTGYLGSAEQVRAIA 92 (281) T ss_pred CCCCHHHHHHHHHHCCCCEEEECEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCEEEECCCCCHHHHHHHH T ss_conf 02330777879998598235435388248888897788579889999999999854413446889970479889999999 Q ss_pred HHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHH-HHHHHH--HHHHHHHCCCCCCCEEEEECCCCCCCCCCHHHHH Q ss_conf 9999999733899866999833675026762799888-999999--9999972667567047862257776544302689 Q gi|254780200|r 192 QSIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGC-HCHKTA--RLLKEFLSWPDDRFKICFQSRFGRIKCLEPPTDK 268 (343) Q Consensus 192 ~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~-q~~~t~--~~i~~~l~~~~~~~~l~fQSr~Gp~~WL~P~t~~ 268 (343) +-+++.=+. ++ ..+++-= |+ +-+.|-.|.. .+.+.. +++-..-.+.+|.|++.+-| |..-=..=+..+ T Consensus 93 ~~v~~vk~~--~P-~~~~l~D---PV-MGD~gglYV~~~~~~~~~~~lip~AdiiTPN~fELe~Lt--g~~~~~~~da~~ 163 (281) T COG2240 93 GIVKAVKEA--NP-NALYLCD---PV-MGDPGGLYVAPEVAEAYRDELLPLADIITPNIFELEILT--GKPLNTLDDAVK 163 (281) T ss_pred HHHHHHHCC--CC-CEEEEEC---CC-CCCCCCEEECCCHHHHHHHHHCCHHHEECCCHHHHHHHH--CCCCCCHHHHHH T ss_conf 999998624--98-7389967---74-367996352643189999864213007488799999984--988788899999 Q ss_pred HHHHHHHCCCCCEEEECC---HHHHCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHH Q ss_conf 999899769983999744---0542212558887589999999759975898168987989999999999 Q gi|254780200|r 269 TVEKLAHDGIKSLAIITP---GFSSDCLETSYEIAHEAKEIFVNGGGEKFTQVPCLNSSNLSIDLLEKIT 335 (343) Q Consensus 269 ~l~~L~~~G~k~v~v~p~---gFvsD~lETl~Eidie~~e~~~e~Gg~~~~rvp~lN~~~~fi~~La~lv 335 (343) ....|.+.|.+.|+|-.. ++-.++.+.+..-. +..+. .. ..+... --++-|.|...+..-+ T Consensus 164 aa~~L~~~gp~~vlVTS~~~~~~~~~~~~~~~~~~-~~~~h---~~-~~v~~~-~~GtGDL~sallla~l 227 (281) T COG2240 164 AARKLGADGPKIVLVTSLSRAGMSTGNFEMLGKSA-ELAWH---IS-PLVPFI-PNGTGDLFSALLLARL 227 (281) T ss_pred HHHHHHHCCCCEEEEECCCCCCCCCCEEEEECCCH-HHHHH---HH-HCCCCC-CCCCHHHHHHHHHHHH T ss_conf 99997514898899953453678773499961312-23323---43-227889-9982499999999999 No 115 >TIGR02767 TraG-Ti Ti-type conjugative transfer system protien TraG; InterPro: IPR014135 This entry contains the Agrobacterium tumefaciens Ti-plasmid TraG, it is responsible for conjugative transfer of the entire plasmid among Agrobacterium strains . The protein is distantly related to the F-type conjugation system TraG protein. Both of these systems are examples of type IV secretion systems.. Probab=37.20 E-value=31 Score=15.30 Aligned_cols=133 Identities=20% Similarity=0.287 Sum_probs=64.5 Q ss_pred HCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHH-----CCCCCCCHHHHHCC------CCCHH---HHHHHH Q ss_conf 12574788999998707788740457775999999998652-----24784300112104------54247---899999 Q gi|254780200|r 67 LNFRPSKIKHAYAKIWNTAKNESILRTHTRDQATNLAKRLE-----SISSIVVDWAMRYG------KPSVK---EIINNL 132 (343) Q Consensus 67 ~~~R~~~~~~~Y~~Iw~~~gg~SPL~~~t~~qa~~L~~~L~-----~~~~~~V~~amry~------~P~i~---~~l~~l 132 (343) +||+|. =+.=|+ .||+|||+.+....--.=.--+. +.+...|=-|.||+ .||.| -+.++= T Consensus 193 ~PF~p~-----d~~tWG-~GGkaPLLtf~~dFgStHmifFAGSGGfKTTS~vvPTaL~y~Gplv~LDPStEvAPMV~~hR 266 (641) T TIGR02767 193 LPFRPD-----DRATWG-KGGKAPLLTFDLDFGSTHMIFFAGSGGFKTTSVVVPTALKYGGPLVCLDPSTEVAPMVVEHR 266 (641) T ss_pred CCCCCC-----CCCCCC-CCCCCCCCCCCCCCCCEEEEEEECCCCEEECCCCCCCCCCCCCCEEEECCCCCCCCHHHHHH T ss_conf 889852-----786567-78885100103688850014870787811046423654027878798688766440068998 Q ss_pred HHCCCCEEEEEECCHHHHHHH-HHHHHHHHHHH----HHHHCCCCC------CCCCCCCCCCCHHHHHHHHHHHHHHHHC Q ss_conf 870996589963171110014-68999999999----987179986------4213345578006899999999999733 Q gi|254780200|r 133 REEGCDRLLIFPLYPQYSAAT-TGTAQDKVFQE----LIHMRWSPS------LRTVPPYYEDSDYISALAQSIREHFESI 201 (343) Q Consensus 133 ~~~g~~~ii~lPLyPqyS~sT-tgS~~~~~~~~----l~~~~~~~~------~~~I~~~~~~p~yI~a~~~~I~~~l~~~ 201 (343) ++.|-+++||| =|- +..+ -=.++|.+.+. .+|...... ...++.=.....|-++-|.++-.+|=. T Consensus 267 r~~g~R~ViVL--DP~-~~~~kgFNvLDwi~~~~vPr~~kEedIVg~a~~lltd~~r~ess~~~yF~~~A~~LLTaLlA- 342 (641) T TIGR02767 267 RKAGDRKVIVL--DPA-AGVVKGFNVLDWIERDGVPRLKKEEDIVGVAKLLLTDKVRTESSRDDYFRAQAHNLLTALLA- 342 (641) T ss_pred HHHCCCEEEEE--CCC-CCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHH- T ss_conf 74149648997--688-88746640001412378865541467999998885277554565311578889999998876- Q ss_pred CCCCCEEEEEECC Q ss_conf 8998669998336 Q gi|254780200|r 202 QWTPEMLLVSFHQ 214 (343) Q Consensus 202 ~~~~~~llfSaHg 214 (343) |++||-|- T Consensus 343 -----hV~lS~ey 350 (641) T TIGR02767 343 -----HVILSDEY 350 (641) T ss_pred -----HHHCCCCC T ss_conf -----54227764 No 116 >TIGR00962 atpA ATP synthase F1, alpha subunit; InterPro: IPR005294 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (F-, V- and A-ATPases contain rotary motors) and in the type of ions they transport , . F-ATPases (F1F0-ATPases) in mitochondria, chloroplasts and bacterial plasma membranes are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases) are primarily found in eukaryotic vacuoles, catalysing ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases) are found in Archaea and function like F-ATPases. P-ATPases (E1E2-ATPases) are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. F-ATPases (also known as F1F0-ATPase, or H(+)-transporting two-sector ATPase) (3.6.3.14 from EC) are composed of two linked complexes: the F1 ATPase complex is the catalytic core and is composed of 5 subunits (alpha, beta, gamma, delta, epsilon), while the F0 ATPase complex is the membrane-embedded proton channel that is composed of at least 3 subunits (A-C), nine in mitochondria (A-G, F6, F8). Both the F1 and F0 complexes are rotary motors that are coupled back-to-back. In the F1 complex, the central gamma subunit forms the rotor inside the cylinder made of the alpha(3)beta(3) subunits, while in the F0 complex, the ring-shaped C subunits forms the rotor. The two rotors rotate in opposite directions, but the F0 rotor is usually stronger, using the force from the proton gradient to push the F1 rotor in reverse in order to drive ATP synthesis . These ATPases can also work in reverse to hydrolyse ATP to create a proton gradient. This entry represents the alpha subunit found in the F1 complex of F-ATPases. In F-ATPases, there are three copies each of the alpha and beta subunits that form the catalytic core of the F1 complex, while the remaining F1 subunits (gamma, delta, epsilon) form part of the stalks. There is a substrate-binding site on each of the alpha and beta subunits, those on the beta subunits being catalytic, while those on the alpha subunits are regulatory. The alpha-subunit contains a highly conserved adenine-specific non-catalytic nucleotide-binding domain, with a conserved amino acid sequence of Gly-X-X-X-X-Gly-Lys. The alpha and beta subunits form a cylinder that is attached to the central stalk. The alpha/beta subunits undergo a sequence of conformational changes leading to the formation of ATP from ADP, which are induced by the rotation of the gamma subunit, itself is driven by the movement of protons through the F0 complex C subunit . More information about this protein can be found at Protein of the Month: ATP Synthases .; GO: 0016820 hydrolase activity acting on acid anhydrides catalyzing transmembrane movement of substances, 0015986 ATP synthesis coupled proton transport, 0016469 proton-transporting two-sector ATPase complex. Probab=36.80 E-value=32 Score=15.25 Aligned_cols=91 Identities=21% Similarity=0.285 Sum_probs=44.6 Q ss_pred HHHCCCCCHHHHHHH-HHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHC------CCCCCCC---CCCCCCCCHH Q ss_conf 121045424789999-98709965899631711100146899999999998717------9986421---3345578006 Q gi|254780200|r 117 AMRYGKPSVKEIINN-LREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMR------WSPSLRT---VPPYYEDSDY 186 (343) Q Consensus 117 amry~~P~i~~~l~~-l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~------~~~~~~~---I~~~~~~p~y 186 (343) +|.|-.||..-++.+ +..+| .+++++ ||.+-|+...+| ..|+=|- =+-||-|..+ T Consensus 240 ~lqYLAPY~G~tmaEyF~d~G-khaLIi--------------YDDLSKqA~AYR~iSLLLRRPPGREAyPGDVFYLHSRL 304 (520) T TIGR00962 240 SLQYLAPYTGCTMAEYFRDNG-KHALII--------------YDDLSKQAVAYRQISLLLRRPPGREAYPGDVFYLHSRL 304 (520) T ss_pred HHHHHHHHHHHHHHHHHHHCC-CEEEEE--------------EECCHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHH T ss_conf 245420688999999998659-737999--------------74740789999999987188658716997401454568 Q ss_pred HHHHHHHHHHHHHHCCCCCCEEE-EEECCCCHHHHH--HCC Q ss_conf 89999999999973389986699-983367502676--279 Q gi|254780200|r 187 ISALAQSIREHFESIQWTPEMLL-VSFHQMPVSYLL--KGD 224 (343) Q Consensus 187 I~a~~~~I~~~l~~~~~~~~~ll-fSaHglP~~~~~--~gD 224 (343) ++=-|+.=+++.+.+...+.+== =|-=.||. ++ .|| T Consensus 305 LERAAKLsdeYve~FtkGevkGgtGSlTALPI--IETQAGD 343 (520) T TIGR00962 305 LERAAKLSDEYVEKFTKGEVKGGTGSLTALPI--IETQAGD 343 (520) T ss_pred HHHHHHCCHHHHHHHHCCCCCCCCCCCCCCCC--EEECCCC T ss_conf 99986407888854305632466535236772--2521487 No 117 >TIGR02759 TraD_Ftype type IV conjugative transfer system coupling protein TraD; InterPro: IPR014128 The TraD protein performs an essential coupling function in conjugative type IV secretion systems. This protein sits at the inner membrane in contact with the assembled pilus and its scaffold as well as the relaxosome-plasmid DNA complex (through TraM) , .; GO: 0000746 conjugation. Probab=36.74 E-value=32 Score=15.25 Aligned_cols=131 Identities=17% Similarity=0.190 Sum_probs=73.5 Q ss_pred HHHHHHHHHCCCHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCH Q ss_conf 99999998748954411786888998766651257478899999870778874045777599999999865224784300 Q gi|254780200|r 36 SLRRYLREFLLDKRVVELPSWQWRPILFGYILNFRPSKIKHAYAKIWNTAKNESILRTHTRDQATNLAKRLESISSIVVD 115 (343) Q Consensus 36 ~V~~yL~~fl~D~~Vi~~p~~~~~~il~~~I~~~R~~~~~~~Y~~Iw~~~gg~SPL~~~t~~qa~~L~~~L~~~~~~~V~ 115 (343) .=..|+...|.+|.+++.-.-+|+......+ | ..-+....+-....=++++=++...-.+. T Consensus 98 ~Y~~~~~~~L~~p~~~~~~~~~~~~~~~s~~---------------~----AL~~~~~~~~~~~~~f~k~G~~q~~d~~~ 158 (613) T TIGR02759 98 TYKTNQAQILKNPYVIEAYDQLWQALQLSLL---------------A----ALAISLVVFLLFMRFFKKRGEQQSKDQFI 158 (613) T ss_pred EEECHHHHHHHHHHHHHHHHHHHHHHHHHHH---------------H----HHHHHHHHHHHHHHHHHHHCHHHHHHHCC T ss_conf 2201055665413899999999999999999---------------9----99999999999999998603000020011 Q ss_pred HHHHC-CCCC-HHHHHHHHHHC----C-----CC--EEEEEECCHHHHHH-------HHHH----HHHHHHHHHHHHC-- Q ss_conf 11210-4542-47899999870----9-----96--58996317111001-------4689----9999999998717-- Q gi|254780200|r 116 WAMRY-GKPS-VKEIINNLREE----G-----CD--RLLIFPLYPQYSAA-------TTGT----AQDKVFQELIHMR-- 169 (343) Q Consensus 116 ~amry-~~P~-i~~~l~~l~~~----g-----~~--~ii~lPLyPqyS~s-------TtgS----~~~~~~~~l~~~~-- 169 (343) -|.++ ..|. +...|++-.+. | +. +|-.+||+|++|.. |||| +++.+.+.+.+.+ T Consensus 159 rG~~l~~~~~~~~~~lk~~~r~lkkqGvGnGk~Sdi~i~g~~l~K~~sE~Qh~L~~GTtG~GKs~~lr~LL~~iR~rGd~ 238 (613) T TIGR02759 159 RGSQLASSPKELAKKLKKSKRALKKQGVGNGKASDIKIGGLPLIKKGSEVQHILIHGTTGSGKSVALRKLLRWIRQRGDR 238 (613) T ss_pred CCCEEECCHHHHHHHHHHHHHHHHHCCCCCCCCCCCEECCCCCCCCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCE T ss_conf 36423068799999999876546541677865463000780678777742252664541743899999999999863985 Q ss_pred ---CCCCCCCCCCCCCCCH Q ss_conf ---9986421334557800 Q gi|254780200|r 170 ---WSPSLRTVPPYYEDSD 185 (343) Q Consensus 170 ---~~~~~~~I~~~~~~p~ 185 (343) ++....+++.|||... T Consensus 239 AIiYDkgC~f~~~fyd~~~ 257 (613) T TIGR02759 239 AIIYDKGCTFVSRFYDPSQ 257 (613) T ss_pred EEEEECCCCCCCCCCCCCC T ss_conf 8998257420213268888 No 118 >pfam08029 HisG_C HisG, C-terminal domain. Probab=36.66 E-value=32 Score=15.24 Aligned_cols=23 Identities=30% Similarity=0.660 Sum_probs=16.1 Q ss_pred CCHHHHHHHHHHCCCCEEEEEEC Q ss_conf 42478999998709965899631 Q gi|254780200|r 123 PSVKEIINNLREEGCDRLLIFPL 145 (343) Q Consensus 123 P~i~~~l~~l~~~g~~~ii~lPL 145 (343) --+.+.+.+|++.|.+-|+++|+ T Consensus 48 ~~~~~~~~~Lk~~GA~~Ilv~pI 70 (73) T pfam08029 48 KEVWEIMDELKALGASGILVLPI 70 (73) T ss_pred HHHHHHHHHHHHCCCCEEEEEEE T ss_conf 89999999999879977999662 No 119 >PRK04781 histidinol-phosphate aminotransferase; Provisional Probab=36.28 E-value=32 Score=15.20 Aligned_cols=14 Identities=7% Similarity=0.230 Sum_probs=9.0 Q ss_pred CCHHHHHHHHHHHH Q ss_conf 79899999999999 Q gi|254780200|r 323 SSNLSIDLLEKITR 336 (343) Q Consensus 323 ~~~~fi~~La~lv~ 336 (343) +++.|+++|..+.+ T Consensus 347 en~~ll~aL~~i~e 360 (362) T PRK04781 347 QNDRVLAALQRTQE 360 (362) T ss_pred HHHHHHHHHHHHHH T ss_conf 99999999999996 No 120 >TIGR02843 CyoB cytochrome o ubiquinol oxidase, subunit I; InterPro: IPR014207 This entry represents subunit 1 (CyoB) of the cytochrome o terminal oxidase complex, which is a component of the aerobic respiratory chain that reacts with oxygen, reducing it to water with the concomitant transport of 4 protons across the membrane. Also known as the cytochrome bo complex, cytochrome o ubiquinol oxidase contains four subunits, two heme b cofactors and a copper atom which is believed to be the oxygen active site. This complex is structurally related to the cytochrome caa3 oxidases which utilise cytochrome c as the reductant and contain heme a cofactors, as well as the intermediate form aa3 oxidases which also react directly with quinones as the reductant.. Probab=36.13 E-value=24 Score=16.09 Aligned_cols=31 Identities=16% Similarity=0.272 Sum_probs=18.8 Q ss_pred HHHHHHHHHHHHHHHHCCC----CCCE--EEEEECCC Q ss_conf 0689999999999973389----9866--99983367 Q gi|254780200|r 185 DYISALAQSIREHFESIQW----TPEM--LLVSFHQM 215 (343) Q Consensus 185 ~yI~a~~~~I~~~l~~~~~----~~~~--llfSaHgl 215 (343) +|-||.--|.++++...+. .++| =||||||+ T Consensus 71 GFaDAimmR~qqala~g~~~GyL~p~HyDqiFtAHGV 107 (646) T TIGR02843 71 GFADAIMMRTQQALASGESEGYLPPEHYDQIFTAHGV 107 (646) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHH T ss_conf 3688999999999963688997686421112212248 No 121 >PTZ00081 enolase (2-phospho-D-glycerate hydrolase); Provisional Probab=35.71 E-value=33 Score=15.14 Aligned_cols=182 Identities=13% Similarity=0.124 Sum_probs=92.8 Q ss_pred CCCEEEEEEC-CHHHHHH--HHHHHHHHHHHHHHHHCCCC------CCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 9965899631-7111001--46899999999998717998------6421334557800689999999999973389986 Q gi|254780200|r 136 GCDRLLIFPL-YPQYSAA--TTGTAQDKVFQELIHMRWSP------SLRTVPPYYEDSDYISALAQSIREHFESIQWTPE 206 (343) Q Consensus 136 g~~~ii~lPL-yPqyS~s--TtgS~~~~~~~~l~~~~~~~------~~~~I~~~~~~p~yI~a~~~~I~~~l~~~~~~~~ 206 (343) .+++.+++|. ++.|+.+ ....++..+.+.+++..... +=-|.+++-+...-++.+.+.|..+ ..+. T Consensus 171 diQEfMIiP~ga~sf~eal~~~~ev~~~lk~~L~~k~g~~~t~vgDEGGfaP~~~~~eeaL~li~eAi~~a----G~~~- 245 (442) T PTZ00081 171 PFQEFMIAPVGAPSFREALRMGAEVYHTLKSIIKKKYGQDATNVGDEGGFAPNIKSANEALDLLVEAIEKA----GHEG- 245 (442) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHC----CCCC- T ss_conf 84269987899987899999999999999999987608886765777670799999799999999999970----7775- Q ss_pred EEEEEECCCCH--HHHHHCCC--C-------------HHHHHHHHHHHHHHH-------CCCCCCCE--EEEECCCCC-- Q ss_conf 69998336750--26762799--8-------------889999999999972-------66756704--786225777-- Q gi|254780200|r 207 MLLVSFHQMPV--SYLLKGDP--Y-------------GCHCHKTARLLKEFL-------SWPDDRFK--ICFQSRFGR-- 258 (343) Q Consensus 207 ~llfSaHglP~--~~~~~gDp--Y-------------~~q~~~t~~~i~~~l-------~~~~~~~~--l~fQSr~Gp-- 258 (343) .+-+ ||=. +..-+++. | ..+..+....+.++. ++.+++|. -.+.+++|. T Consensus 246 ~i~i---alD~AAsefy~~~~~~Y~l~~k~~~~~~~s~~eli~~y~~l~~~YPI~sIEDpl~E~D~~gw~~lt~~lg~~~ 322 (442) T PTZ00081 246 KVAI---CMDVAASEFYDAEKKMYNLTFKCPEPTYKTGAELVETYEDWVAKYPIVSIEDPFDQDDWEEYAKITEALGGKV 322 (442) T ss_pred CCCE---ECCHHHHHHHHCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCHHHHHHHHHHHHHCCCCE T ss_conf 6542---0023567776368963775330577772599999999999983098789958988467999999999709988 Q ss_pred ---CCCCCHHHHHHHHHHHHCCCCCEEEECC---HHHHCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHH Q ss_conf ---6544302689999899769983999744---0542212558887589999999759975898168987989999999 Q gi|254780200|r 259 ---IKCLEPPTDKTVEKLAHDGIKSLAIITP---GFSSDCLETSYEIAHEAKEIFVNGGGEKFTQVPCLNSSNLSIDLLE 332 (343) Q Consensus 259 ---~~WL~P~t~~~l~~L~~~G~k~v~v~p~---gFvsD~lETl~Eidie~~e~~~e~Gg~~~~rvp~lN~~~~fi~~La 332 (343) +-=|--.-.+.|++-.+++.-|-+++=| |-++|-+| +-+++.++|.....--.+=-+.+.||.-|| T Consensus 323 ~iVGDDL~vTn~~rl~~gi~~~~~naiLIK~NQiGTlset~e--------ai~~A~~~g~~~ivShRSGETeD~~iaDLA 394 (442) T PTZ00081 323 QIVGDDLLVTNPKRIKTALEKKACNALLLKVNQIGSITEAIE--------ACKLCMKNGWGVMVSHRSGETEDTFIADLV 394 (442) T ss_pred EEECCCCCCCCHHHHHHHHHCCCCCEEEEECCCCCHHHHHHH--------HHHHHHHCCCEEEEECCCCCCCCCHHHHHH T ss_conf 998976557689999999854877638983323220999999--------999999879879995798876630088877 Q ss_pred H Q ss_conf 9 Q gi|254780200|r 333 K 333 (343) Q Consensus 333 ~ 333 (343) = T Consensus 395 V 395 (442) T PTZ00081 395 V 395 (442) T ss_pred H T ss_conf 7 No 122 >pfam08981 consensus Probab=35.67 E-value=33 Score=15.13 Aligned_cols=144 Identities=14% Similarity=0.228 Sum_probs=73.7 Q ss_pred CCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC--CCCCHHHHHHHHHHHHHHHH Q ss_conf 4247899999870996589963171110014689999999999871799864213345--57800689999999999973 Q gi|254780200|r 123 PSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMRWSPSLRTVPPY--YEDSDYISALAQSIREHFES 200 (343) Q Consensus 123 P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~~~~~~~~I~~~--~~~p~yI~a~~~~I~~~l~~ 200 (343) --++-+++..++.|+.+|++- ||+|.+-..+.+.++.. ..++-.|.+- +..|+..+ +.+.+++.|++ T Consensus 12 ~tl~~a~~ra~e~gI~~iVvA--------SssG~TA~k~~e~~~~~--~~~lvvVth~~GF~~pg~~e-~~~e~~~~L~~ 80 (181) T pfam08981 12 DTLELAAERAKELGIKHIVVA--------SSSGETALKAAEALEGT--NLNVVVVTHHAGFSEPGEQE-MDPEVRKELEE 80 (181) T ss_pred HHHHHHHHHHHHCCCCEEEEE--------ECCCHHHHHHHHHHCCC--CCEEEEEECCCCCCCCCCCC-CCHHHHHHHHH T ss_conf 999999999997699769998--------07877999999871358--95599994645778998563-89999999997 Q ss_pred CCCCCCEEEEEECCC--CHHHHHH--CCCCHHHHHHHHHHHHHHHCCCCCCCEEEEECC-----CCCCCCCCHHHHHHHH Q ss_conf 389986699983367--5026762--799888999999999997266756704786225-----7776544302689999 Q gi|254780200|r 201 IQWTPEMLLVSFHQM--PVSYLLK--GDPYGCHCHKTARLLKEFLSWPDDRFKICFQSR-----FGRIKCLEPPTDKTVE 271 (343) Q Consensus 201 ~~~~~~~llfSaHgl--P~~~~~~--gDpY~~q~~~t~~~i~~~l~~~~~~~~l~fQSr-----~Gp~~WL~P~t~~~l~ 271 (343) .. ..++..-|-+ .+|-+.+ |--|+.++.+.+-+ .+| .-..+|+--. -|-.++= ++++. T Consensus 81 ~G---~~V~t~tH~lsg~eR~is~kfgG~~p~eiiA~tLR---~fg---qG~KV~vEi~lMAaDaGlIp~~----eeVIa 147 (181) T pfam08981 81 RG---VKVLTGTHALSGLERAISNKFGGISPVEIIAETLR---MLG---QGVKVCVEIALMAADAGLIPVG----EEVIA 147 (181) T ss_pred CC---CEEEEECCCCCCCHHHHCCCCCCCCHHHHHHHHHH---HHC---CCCEEEEEEEEEECCCCCCCCC----CEEEE T ss_conf 79---88999503134423541100479798999999999---728---9747999998875067877889----83899 Q ss_pred HHH--HCCCCCEEEECCHHHHC Q ss_conf 899--76998399974405422 Q gi|254780200|r 272 KLA--HDGIKSLAIITPGFSSD 291 (343) Q Consensus 272 ~L~--~~G~k~v~v~p~gFvsD 291 (343) +. ..|..-.+|+-|+-..+ T Consensus 148 -iGGT~~GaDTA~Vi~PAhs~~ 168 (181) T pfam08981 148 -IGGTGRGADTALVIRPAHSNN 168 (181) T ss_pred -ECCCCCCCCEEEEEEECCCCC T ss_conf -776667763699983455676 No 123 >COG0803 LraI ABC-type metal ion transport system, periplasmic component/surface adhesin [Inorganic ion transport and metabolism] Probab=35.27 E-value=34 Score=15.09 Aligned_cols=143 Identities=16% Similarity=0.096 Sum_probs=83.0 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCC-CCCEEEEEC Q ss_conf 1334557800689999999999973389986699983367502676279988899999999999726675-670478622 Q gi|254780200|r 176 TVPPYYEDSDYISALAQSIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPD-DRFKICFQS 254 (343) Q Consensus 176 ~I~~~~~~p~yI~a~~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~-~~~~l~fQS 254 (343) .=++.|-+|.....+++.|++.|.+.+-.. -..+-++..-|..++.+-.+.....+.-.+ .++.++... T Consensus 131 ~dpH~Wldp~na~~~v~~I~~~L~~~dP~~----------~~~y~~N~~~y~~kL~~l~~~~~~~~~~~~~~r~~vt~h~ 200 (303) T COG0803 131 NDPHVWLDPKNAKIYAENIADALVELDPEN----------KETYEKNAEAYLKKLNKLDEEAKAKLSKIPAQRDVVTSHG 200 (303) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHCCHH----------HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECC T ss_conf 999820489999999999999998869413----------8999999999999999999999999844764664998637 Q ss_pred CCC---C-------CCC-------CCHHHHHHHHHHHHCCCCCEEEECCHHHHCCHHHHHHHHHHHHHHHHHCCC----C Q ss_conf 577---7-------654-------430268999989976998399974405422125588875899999997599----7 Q gi|254780200|r 255 RFG---R-------IKC-------LEPPTDKTVEKLAHDGIKSLAIITPGFSSDCLETSYEIAHEAKEIFVNGGG----E 313 (343) Q Consensus 255 r~G---p-------~~W-------L~P~t~~~l~~L~~~G~k~v~v~p~gFvsD~lETl~Eidie~~e~~~e~Gg----~ 313 (343) .|+ . .-| -+|.....+.++.+++--.++++-..+..+-+||| +.+.|. . T Consensus 201 af~Y~~~~~g~~~~~i~~~~~~~e~s~~~l~~l~~~ik~~~i~~If~e~~~~~~~~~~l----------a~e~g~~~~~~ 270 (303) T COG0803 201 AFGYLARDYGLKQVAIAGISPEAEPSPKDLAKLVDLIKKKNIKAIFVESNVSSKSAETL----------AKETGVKILGL 270 (303) T ss_pred CHHHHHHHCCCCCEEECCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHH----------HHHHCCCCCCC T ss_conf 20654510598512203658556999999999999998569978998188882799999----------99859860210 Q ss_pred EEEEECCCCCCH--HHHHHHHHHHHHH Q ss_conf 589816898798--9999999999998 Q gi|254780200|r 314 KFTQVPCLNSSN--LSIDLLEKITRRE 338 (343) Q Consensus 314 ~~~rvp~lN~~~--~fi~~La~lv~~~ 338 (343) .+.-.-+.++.+ .+++++.+.+..- T Consensus 271 ~~~d~~~~~~~~~~~y~~~~~~n~~~i 297 (303) T COG0803 271 LYLDSLGDKDSKGDTYISMMKANLDTI 297 (303) T ss_pred CCCCCCCCCCCCCCCHHHHHHHHHHHH T ss_conf 022344567876452899999999999 No 124 >cd01465 vWA_subgroup VWA subgroup: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if n Probab=35.25 E-value=31 Score=15.34 Aligned_cols=38 Identities=21% Similarity=0.297 Sum_probs=23.5 Q ss_pred CCEEEECCHHHHCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCC Q ss_conf 839997440542212558887589999999759975898168987 Q gi|254780200|r 279 KSLAIITPGFSSDCLETSYEIAHEAKEIFVNGGGEKFTQVPCLNS 323 (343) Q Consensus 279 k~v~v~p~gFvsD~lETl~Eidie~~e~~~e~Gg~~~~rvp~lN~ 323 (343) ..+-|+.+||=+|+-+. ..+++|...|| +|.+|+..++ T Consensus 128 ~~i~i~tiGiG~~~~~~------~L~~iA~~~~G-~~~~v~~~~~ 165 (170) T cd01465 128 SGITLSTLGFGDNYNED------LMEAIADAGNG-NTAYIDNLAE 165 (170) T ss_pred CCCCCEEEEECCCCCHH------HHHHHHHCCCC-EEEECCCHHH T ss_conf 38862489808879999------99999975798-8998499999 No 125 >cd06431 GT8_LARGE_C LARGE catalytic domain has closest homology to GT8 glycosyltransferase involved in lipooligosaccharide synthesis. The catalytic domain of LARGE is a putative glycosyltransferase. Mutations of LARGE in mouse and human cause dystroglycanopathies, a disease associated with hypoglycosylation of the membrane protein alpha-dystroglycan (alpha-DG) and consequent loss of extracellular ligand binding. LARGE needs to both physically interact with alpha-dystroglycan and function as a glycosyltransferase in order to stimulate alpha-dystroglycan hyperglycosylation. LARGE localizes to the Golgi apparatus and contains three conserved DxD motifs. While two of the motifs are indispensible for glycosylation function, one is important for localization of th eenzyme. LARGE was originally named because it covers approximately large trunck of genomic DNA, more than 600bp long. The predicted protein structure contains an N-terminal cytoplasmic domain, a transmembrane region, a coiled-coil Probab=34.51 E-value=24 Score=16.08 Aligned_cols=36 Identities=6% Similarity=0.185 Sum_probs=21.5 Q ss_pred HHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 99998709965899631711100146899999999998717 Q gi|254780200|r 129 INNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMR 169 (343) Q Consensus 129 l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~ 169 (343) +.++....+++++.+ .+-.+..+-++++.+.+.+.. T Consensus 90 Lp~iLP~~vd~vi~l-----DtDi~F~~~I~eLW~~F~~f~ 125 (280) T cd06431 90 LTEALPSDLEKVIVL-----DTDITFATDIAELWKIFHKFT 125 (280) T ss_pred HHHHCCCCCCEEEEE-----ECCEEECCCHHHHHHHHHHCC T ss_conf 766522224627999-----525000578999999987459 No 126 >KOG0467 consensus Probab=34.42 E-value=35 Score=15.00 Aligned_cols=49 Identities=18% Similarity=0.282 Sum_probs=19.5 Q ss_pred CCHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHCC Q ss_conf 98889999999999972667567047862257776544302689999899769 Q gi|254780200|r 225 PYGCHCHKTARLLKEFLSWPDDRFKICFQSRFGRIKCLEPPTDKTVEKLAHDG 277 (343) Q Consensus 225 pY~~q~~~t~~~i~~~l~~~~~~~~l~fQSr~Gp~~WL~P~t~~~l~~L~~~G 277 (343) .|..-|+..++..++.++..+.--.. +-.| .-|+-|+|+..++.-.-+| T Consensus 202 ~~~f~~~~fak~~~~kl~~k~~al~k---~lwg-d~y~~~ktk~I~~~~~~~g 250 (887) T KOG0467 202 GWGFGIEQFAKFYAKKLGLKDAALLK---FLWG-DRYIDPKTKRICEGKKLKG 250 (887) T ss_pred CCCCCHHHHHHHHHHHCCHHHHHHHH---HHCC-CEEECCHHHHHHCCCCCCC T ss_conf 56201999999998744734665213---2104-5243320466643567666 No 127 >pfam07302 AroM AroM protein. This family consists of several bacterial and archaeal AroM proteins. In Escherichia coli the aroM gene is cotranscribed with aroL. The function of this family is unknown. Probab=33.78 E-value=33 Score=15.09 Aligned_cols=187 Identities=18% Similarity=0.250 Sum_probs=96.3 Q ss_pred CCC-CCCCCCHHHHHHHHHHHCCCH-HHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHH Q ss_conf 178-889999889999999874895-441178688899876665125747889999987077887404577759999999 Q gi|254780200|r 25 NLG-TPDGHDFFSLRRYLREFLLDK-RVVELPSWQWRPILFGYILNFRPSKIKHAYAKIWNTAKNESILRTHTRDQATNL 102 (343) Q Consensus 25 n~G-~P~~~~~~~V~~yL~~fl~D~-~Vi~~p~~~~~~il~~~I~~~R~~~~~~~Y~~Iw~~~gg~SPL~~~t~~qa~~L 102 (343) ..| +|++ ||.|-+..++..+ .+.... .|.++ -...+ ..|. ...|.-+| T Consensus 6 TIGQsPR~----DV~pei~~~l~~~v~i~e~G------aLDgl----s~~eI-~~~a----P~~ge~~L----------- 55 (221) T pfam07302 6 TIGQSPRT----DVLPEITEHLGEPIEITEIG------ALDGL----SREEI-AALA----PGPGEDVL----------- 55 (221) T ss_pred EECCCCCH----HHHHHHHHHHCCCCCEEEEC------CCCCC----CHHHH-HHHC----CCCCCCEE----------- T ss_conf 83579851----42699998708997289843------55799----99999-9719----99998441----------- Q ss_pred HHHHHCCCCCCCHHHHHCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 98652247843001121045424789999987099658996317111001468999999999987179986421334557 Q gi|254780200|r 103 AKRLESISSIVVDWAMRYGKPSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMRWSPSLRTVPPYYE 182 (343) Q Consensus 103 ~~~L~~~~~~~V~~amry~~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~~~~~~~~I~~~~~ 182 (343) -.+|.. +-.|.++-++-.|.+.+.+++|.++|++-|+++. ||. +..+...+ ..|. T Consensus 56 vTRL~D--G~~v~~s~~~v~~~lq~~i~~Le~~G~d~ilLlC---------TG~-----F~~l~~~~-----~ll~---- 110 (221) T pfam07302 56 VTRLRD--GQQVTVSKQKIERRLQQVIEKLDKEGVDVILLLC---------TGE-----FPKLSARN-----LLLE---- 110 (221) T ss_pred EEEECC--CCEEEEEHHHHHHHHHHHHHHHHHCCCCEEEEEE---------CCC-----CCCCCCCC-----CEEE---- T ss_conf 687069--9878988899889899999999867998999961---------477-----89865677-----2441---- Q ss_pred CCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCC--CC Q ss_conf 8006899999999999733899866999833675026762799888999999999997266756704786225777--65 Q gi|254780200|r 183 DSDYISALAQSIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKICFQSRFGR--IK 260 (343) Q Consensus 183 ~p~yI~a~~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l~fQSr~Gp--~~ 260 (343) |.-| +...++.-.. +...- +|+ |-..|+..+.+.-.. ++ ..+.|-.. -| +. T Consensus 111 -P~ri--l~~lV~a~~~---~~~lG-Vvv-------------P~~eQ~~~~~~kW~~-~~-----~~~~~~~a-sPy~~~ 163 (221) T pfam07302 111 -PQRI--LPPLVASIVD---GHQLG-VIV-------------PLPEQISQQANKWQK-LV-----KEVVVVAA-SPYHGS 163 (221) T ss_pred -HHHH--HHHHHHHHCC---CCEEE-EEE-------------CCHHHHHHHHHHHHH-CC-----CCCEEEEE-CCCCCC T ss_conf -1886--7899998678---98599-995-------------579999999998874-58-----87179971-898898 Q ss_pred CCCHHHHHHHHHHHHCCCCCEEEECCHHHHCCHHH Q ss_conf 44302689999899769983999744054221255 Q gi|254780200|r 261 CLEPPTDKTVEKLAHDGIKSLAIITPGFSSDCLET 295 (343) Q Consensus 261 WL~P~t~~~l~~L~~~G~k~v~v~p~gFvsD~lET 295 (343) | ..+.++.++|.++|..=|+.=|+||...|=.. T Consensus 164 ~--~~l~~Aa~~L~~~gadlivLDCiGYt~~~r~~ 196 (221) T pfam07302 164 E--DRLEEAARELLDQGADLIVLDCIGYTQAMKDM 196 (221) T ss_pred H--HHHHHHHHHHHHCCCCEEEEECCCCCHHHHHH T ss_conf 8--99999999999759899999887774999999 No 128 >COG0406 phoE Broad specificity phosphatase PhoE and related phosphatases [General function prediction only] Probab=33.52 E-value=36 Score=14.90 Aligned_cols=122 Identities=21% Similarity=0.248 Sum_probs=56.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHCCCCCHH--HHHHHHH-HCCCCEEEEEECCHHHHHHHHHH----HHHH Q ss_conf 404577759999999986522478430011210454247--8999998-70996589963171110014689----9999 Q gi|254780200|r 88 ESILRTHTRDQATNLAKRLESISSIVVDWAMRYGKPSVK--EIINNLR-EEGCDRLLIFPLYPQYSAATTGT----AQDK 160 (343) Q Consensus 88 ~SPL~~~t~~qa~~L~~~L~~~~~~~V~~amry~~P~i~--~~l~~l~-~~g~~~ii~lPLyPqyS~sTtgS----~~~~ 160 (343) .+||...-.+||+.+.+.|.... +....-|+.|... ++.+.+. ..|.. +...+. ......|+ ..++ T Consensus 26 d~pLt~~G~~QA~~l~~~l~~~~---~~~~~i~sS~l~Ra~~TA~~~a~~~~~~-~~~~~~---l~E~~~G~~eg~~~~e 98 (208) T COG0406 26 DSPLTEEGRAQAEALAERLAARD---IGFDAIYSSPLKRAQQTAEPLAEELGLP-LEVDDR---LREIDFGDWEGLTIDE 98 (208) T ss_pred CCCCCHHHHHHHHHHHHHHHHCC---CCCCEEEECCHHHHHHHHHHHHHHCCCC-CEECCC---CCCCCCCCCCCCCHHH T ss_conf 99968779999999999986467---8977688275441999999999872997-324533---2436764357988899 Q ss_pred HHHH----HHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCC--CCCEEEEEECCCCHHHH Q ss_conf 9999----987179986421334557800689999999999973389--98669998336750267 Q gi|254780200|r 161 VFQE----LIHMRWSPSLRTVPPYYEDSDYISALAQSIREHFESIQW--TPEMLLVSFHQMPVSYL 220 (343) Q Consensus 161 ~~~~----l~~~~~~~~~~~I~~~~~~p~yI~a~~~~I~~~l~~~~~--~~~~llfSaHglP~~~~ 220 (343) +.+. ...+...+.....+. -+.+.+ +.+++...+++... .+..+++..||...+.+ T Consensus 99 ~~~~~p~~~~~~~~~~~~~~~~~---gEs~~~-~~~R~~~~i~~~~~~~~~~~vlvVsHg~~ir~l 160 (208) T COG0406 99 LAEEPPEELAAWLADPYLAPPPG---GESLAD-VSKRVVAALAELLRSPPGNNVLVVSHGGVIRAL 160 (208) T ss_pred HHHCCHHHHHHHCCCCCCCCCCC---CCCHHH-HHHHHHHHHHHHHHHCCCCEEEEEECHHHHHHH T ss_conf 98607788862124842258998---844999-999999999999985589859999894999999 No 129 >PRK12391 tryptophan synthase subunit beta; Reviewed Probab=33.07 E-value=36 Score=14.85 Aligned_cols=96 Identities=21% Similarity=0.392 Sum_probs=54.4 Q ss_pred CCCCCCCHHHHHH-----HHHHHCCCHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHH Q ss_conf 8889999889999-----99987489544117868889987666512574788999998707788740457775999999 Q gi|254780200|r 27 GTPDGHDFFSLRR-----YLREFLLDKRVVELPSWQWRPILFGYILNFRPSKIKHAYAKIWNTAKNESILRTHTRDQATN 101 (343) Q Consensus 27 G~P~~~~~~~V~~-----yL~~fl~D~~Vi~~p~~~~~~il~~~I~~~R~~~~~~~Y~~Iw~~~gg~SPL~~~t~~qa~~ 101 (343) |.|-+|+ ++.+ -+.+=++..|-|++|. .+.+.| +|| -.|||.. |+. T Consensus 27 ~~p~~~~--~l~~~fp~~li~qe~~~er~i~iPe-----------------~v~~~y-~~g----RPTPL~~-----A~~ 77 (419) T PRK12391 27 GEPVGPE--DLAPIFPMELIEQEVSTERWIDIPE-----------------EVREIY-RLW----RPTPLIR-----ARR 77 (419) T ss_pred CCCCCHH--HHHHHCHHHHHHHHHCCCCCCCCCH-----------------HHHHHH-HCC----CCCHHHH-----HHH T ss_conf 8959978--9645361999975505566656978-----------------999998-637----9966688-----888 Q ss_pred HHHHHHCCCCCCCHH----HHHCCCCCHHHHHHHH---HHCCCCEEEEEECCHHHHHHH Q ss_conf 998652247843001----1210454247899999---870996589963171110014 Q gi|254780200|r 102 LAKRLESISSIVVDW----AMRYGKPSVKEIINNL---REEGCDRLLIFPLYPQYSAAT 153 (343) Q Consensus 102 L~~~L~~~~~~~V~~----amry~~P~i~~~l~~l---~~~g~~~ii~lPLyPqyS~sT 153 (343) |.+.|+. ...+++ -.--|.=-|..++.|. ++.|.++|+.=.=.=|.-.+| T Consensus 78 L~k~lgt--~AkIYlKrEdl~htGsHKiNnai~QallaK~mGkkriiaETGAGQhG~A~ 134 (419) T PRK12391 78 LEKALGT--PAKIYYKYEGVSPAGSHKPNTAVAQAYYNKKEGIKRLTTETGAGQWGSAL 134 (419) T ss_pred HHHHHCC--CCEEEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCEEEECCCCCHHHHHH T ss_conf 9997599--96799975677877776412199999999981995798635753677999 No 130 >PRK10481 hypothetical protein; Provisional Probab=32.81 E-value=36 Score=14.85 Aligned_cols=193 Identities=17% Similarity=0.213 Sum_probs=95.8 Q ss_pred CCEEEEEECCC-CCCCCCHHHHHHHHHHHCCCHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCHHHHHH Q ss_conf 86799997178-88999988999999987489544117868889987666512574788999998707788740457775 Q gi|254780200|r 17 GKIGVLLVNLG-TPDGHDFFSLRRYLREFLLDKRVVELPSWQWRPILFGYILNFRPSKIKHAYAKIWNTAKNESILRTHT 95 (343) Q Consensus 17 ~K~gvLL~n~G-~P~~~~~~~V~~yL~~fl~D~~Vi~~p~~~~~~il~~~I~~~R~~~~~~~Y~~Iw~~~gg~SPL~~~t 95 (343) .|.|+ +..| +|++ ||-|-+..++.+-.+.... .|.++ -...+ +.|. ...|.-+|. T Consensus 3 ~~ig~--vTIGQsPR~----DV~pel~~~l~~~~i~e~G------aLDgl----s~~eI-~~~a----P~~ge~~Lv--- 58 (224) T PRK10481 3 ASLAI--LTIGQSPRT----DVLPLLTEYLDEIEITHHG------LLDGL----SREEI-AAYA----PEDGEDTLV--- 58 (224) T ss_pred CEEEE--EEECCCCCH----HHHHHHHHHCCCCCEEEEC------CCCCC----CHHHH-HHCC----CCCCCCEEE--- T ss_conf 16999--995479850----4169999655877578732------45789----98999-8619----999985427--- Q ss_pred HHHHHHHHHHHHCCCCCCCHHHHHCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 99999999865224784300112104542478999998709965899631711100146899999999998717998642 Q gi|254780200|r 96 RDQATNLAKRLESISSIVVDWAMRYGKPSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMRWSPSLR 175 (343) Q Consensus 96 ~~qa~~L~~~L~~~~~~~V~~amry~~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~~~~~~~ 175 (343) .+|.. +-.|.++-+.-.|.+.+.+++|.++|++-|+++. ||. + +++. T Consensus 59 --------TRL~D--G~~V~~s~~~v~~~lq~~i~~le~~G~d~ilLlC---------TG~-----F---------~~l~ 105 (224) T PRK10481 59 --------TRLRD--GQQAHVSRQKVERDLQSVIEVLDNQGYDVILLLC---------TGN-----F---------PSLT 105 (224) T ss_pred --------EEECC--CCEEEEEHHHHHHHHHHHHHHHHHCCCCEEEEEE---------CCC-----C---------CCCC T ss_conf --------88069--9879988899788899999999867998999972---------467-----8---------8866 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEECC Q ss_conf 13345578006899999999999733899866999833675026762799888999999999997266756704786225 Q gi|254780200|r 176 TVPPYYEDSDYISALAQSIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKICFQSR 255 (343) Q Consensus 176 ~I~~~~~~p~yI~a~~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l~fQSr 255 (343) --+..--.|.-| +...++ ++-. +...- +|. |-..|+....+. - .......+|-.. T Consensus 106 ~~~~lll~P~ri--l~~lV~-ai~~--~~~lG-Vvv-------------P~~eQ~~~~~~k----W--~~~~~~~~~~~a 160 (224) T PRK10481 106 ARNTLLLEPSRI--LPPLVA-AIVD--GHQLG-VIV-------------PVEEQLALQAKK----W--QVLQKPPVVSLA 160 (224) T ss_pred CCCCEEECHHHH--HHHHHH-HHCC--CCEEE-EEE-------------CCHHHHHHHHHH----H--HHCCCCEEEEEE T ss_conf 678436515886--689999-8568--98699-996-------------278999999998----8--734786169983 Q ss_pred CCC--CCCCCHHHHHHHHHHHHCCCCCEEEECCHHHHCCHH Q ss_conf 777--654430268999989976998399974405422125 Q gi|254780200|r 256 FGR--IKCLEPPTDKTVEKLAHDGIKSLAIITPGFSSDCLE 294 (343) Q Consensus 256 ~Gp--~~WL~P~t~~~l~~L~~~G~k~v~v~p~gFvsD~lE 294 (343) -| +.| ..+.++-++|..+|..=|+.=|+||...|=. T Consensus 161 -sPY~g~~--~~l~~Aa~~L~~~gadlivLDCiGYt~~~r~ 198 (224) T PRK10481 161 -SPYHGSE--QKLIDAGKELLAQGADLIVLDCLGYHQRHRD 198 (224) T ss_pred -CCCCCCH--HHHHHHHHHHHHCCCCEEEEECCCCCHHHHH T ss_conf -7988988--9999999999865999999987777599999 No 131 >TIGR01211 ELP3 histone acetyltransferase, ELP3 family; InterPro: IPR005910 Histone acetylation is carried out by a class of enzymes known as histone acetyltransferases (HATs), which catalyse the transfer of an acetyl group from acetyl-CoA to the lysine E-amino groups on the N-terminal tails of histones . Early indication that HATs were involved in transcription came from the observation that in actively transcribed regions of chromatin, histones tend to be hyperacetylated, whereas in transcriptionally silent regions histones are hypoacetylated. The histone acetyltransferases are divided into five families. These include the Gcn5-related acetyltransferases (GNATs); the MYST (for 'MOZ, Ybf2/Sas3, Sas2 and Tip60)-related HATs; p300/CBP HATs; the general transcription factor HATs, which include the TFIID subunit TAF250; and the nuclear hormone-related HATs SRC1 and ACTR (SRC3). The GCN5-related N-acetyltransferase superfamily includes such enzymes as the histone acetyltransferases GCN5 and Hat1, the elongator complex subunit Elp3, the mediator-complex subunit Nut1, and Hpa2 . Many GNATs share several functional domains, including an N-terminal region of variable length, an acetyltransferase domain that encompasses the conserved sequence motifs described above, a region that interacts with the coactivator Ada2, and a C-terminal bromodomain that is believed to interact with acetyl-lysine residues. Members of the GNAT family are important for the regulation of cell growth and development. In mice, knockouts of Gcn5L are embryonic lethal. Yeast Gcn5 is needed for normal progression through the G2M boundary and mitotic gene expression. The importance of GNATs is probably related to their role in transcription and DNA repair. The yeast GCN5 (yGCN5) transcriptional coactivator functions as a histone acetyltransferase (HAT) to promote transcriptional activation. The crystal structure of the yeast histone acetyltransferase Hat1-acetyl coenzyme A (AcCoA) shows that Hat1 has an elongated, curved structure, and the AcCoA molecule is bound in a cleft on the concave surface of the protein, marking the active site of the enzyme. A channel of variable width and depth that runs across the protein is probably the binding site for the histone substrate . The central protein core associated with AcCoA binding that appears to be structurally conserved among a superfamily of N-acetyltransferases, including yeast histone acetyltransferase 1 and Serratia marcescens aminoglycoside 3-N-acetyltransferase . The Saccharomyces cerevisiae member YPL086C has been characterised in vitro as an N-terminal acetyltransferase (2.3.1.48 from EC) for all four core histones. It is a component of the RNA polymerase II holoenzyme, designated Elp3p for Elongator Protein 3. Members of this family are found in eukaryotes and archaea. These proteins are part of the larger set of GNAT acetyltransferases. In vivo, ELP3 gene deletion confers typical ELP phenotypes such as slow growth adaptation, slow gene activation, and temperature sensitivity. This suggests a role for the proteins as novel, tightly RNAPII-associated histone acetyltransferases in transcription of DNA packaged in chromatin .. Probab=32.77 E-value=32 Score=15.27 Aligned_cols=166 Identities=17% Similarity=0.225 Sum_probs=90.9 Q ss_pred HHHHHHHHHHCCCCEE-EEEECCHHHHHHHHHHHHHHHHHHHHHHCC------------CCCCCCCCCCCCCCHHHHHHH Q ss_conf 4789999987099658-996317111001468999999999987179------------986421334557800689999 Q gi|254780200|r 125 VKEIINNLREEGCDRL-LIFPLYPQYSAATTGTAQDKVFQELIHMRW------------SPSLRTVPPYYEDSDYISALA 191 (343) Q Consensus 125 i~~~l~~l~~~g~~~i-i~lPLyPqyS~sTtgS~~~~~~~~l~~~~~------------~~~~~~I~~~~~~p~yI~a~~ 191 (343) -++-|++|.+-|+++| |++ .|.++.+.+..++=+. +-.+++ .||--|+.-..-- T Consensus 249 ~e~~id~ML~~G~TrVElGV-----------Qtiy~~i~~~~kRGH~V~~~~~at~llrDaG~KV--~yH~MPGlPGs~f 315 (573) T TIGR01211 249 REEEIDRMLKLGATRVELGV-----------QTIYNDILERIKRGHTVRDVVEATRLLRDAGLKV--VYHIMPGLPGSSF 315 (573) T ss_pred CHHHHHHHHHCCCCEEEEEE-----------ECCHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCE--EEECCCCCCCCCH T ss_conf 86889999835984899752-----------0726899998378985899999987766504622--0311753335663 Q ss_pred HHHHHH----HHHCCCCCCEE----EEEECCCCHH-HHHHCCCC-HHHHHHHHHHHHHHHCCCCCCCE-E---------- Q ss_conf 999999----97338998669----9983367502-67627998-88999999999997266756704-7---------- Q gi|254780200|r 192 QSIREH----FESIQWTPEML----LVSFHQMPVS-YLLKGDPY-GCHCHKTARLLKEFLSWPDDRFK-I---------- 250 (343) Q Consensus 192 ~~I~~~----l~~~~~~~~~l----lfSaHglP~~-~~~~gDpY-~~q~~~t~~~i~~~l~~~~~~~~-l---------- 250 (343) ++-.+. +++..+++|+| =+.--|.+.= +=++|+ | +...+|++++|++...+.+ +|. + T Consensus 316 ErDl~~Fr~~Fedp~FkPDmLKIYPTLV~rGT~LY~lWk~G~-Y~PY~~eEaveLiv~~~~~~P-~WvR~~RiQRDIP~~ 393 (573) T TIGR01211 316 ERDLKMFREIFEDPRFKPDMLKIYPTLVTRGTELYELWKRGE-YKPYTTEEAVELIVEIKKIMP-KWVRIQRIQRDIPAK 393 (573) T ss_pred HHHHHHHHHHHCCCCCCCCCEEECCEEEECCCCHHHHHHCCC-CCCCCHHHHHHHHHHHHHHCC-CCEEEEEEEECCCCC T ss_conf 568999988626879897861566710234761168886789-987767899999999997389-735775774057851 Q ss_pred -----EEECCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEC--CHHHHCCHHHHHHHHH--HHHHHHHHCCCCE Q ss_conf -----86225777654430268999989976998399974--4054221255888758--9999999759975 Q gi|254780200|r 251 -----CFQSRFGRIKCLEPPTDKTVEKLAHDGIKSLAIIT--PGFSSDCLETSYEIAH--EAKEIFVNGGGEK 314 (343) Q Consensus 251 -----~fQSr~Gp~~WL~P~t~~~l~~L~~~G~k~v~v~p--~gFvsD~lETl~Eidi--e~~e~~~e~Gg~~ 314 (343) ...|-+|- =+...|.+.|++==.|=| +|-..++--+..|+|. -.+|....+||.+ T Consensus 394 ~iv~GV~kSNLr~---------Lv~~rm~e~G~~CrcIRcREVGh~~~~~gv~pe~e~v~L~ve~Y~A~GG~E 457 (573) T TIGR01211 394 LIVAGVKKSNLRE---------LVYRRMKEEGIKCRCIRCREVGHQMVKPGVQPEVENVELIVEEYAASGGRE 457 (573) T ss_pred EEECCCCCCCHHH---------HHHHHHHHCCCCCCEEEEEEECCCCCCCCCCCCCCCEEEEEEEEECCCCEE T ss_conf 0132645267789---------999999956898870466760200468885531101377763101488568 No 132 >cd01841 NnaC_like NnaC (CMP-NeuNAc synthetase) _like subfamily of SGNH_hydrolases, a diverse family of lipases and esterases. The tertiary fold of the enzyme is substantially different from that of the alpha/beta hydrolase family and unique among all known hydrolases; its active site closely resembles two of the three components of typical Ser-His-Asp(Glu) triad from other serine hydrolases. E. coli NnaC appears to be involved in polysaccharide synthesis. Probab=32.66 E-value=37 Score=14.81 Aligned_cols=23 Identities=26% Similarity=0.505 Sum_probs=10.2 Q ss_pred HHHHHHHCCC-CEEEEEECCHHHH Q ss_conf 9999987099-6589963171110 Q gi|254780200|r 128 IINNLREEGC-DRLLIFPLYPQYS 150 (343) Q Consensus 128 ~l~~l~~~g~-~~ii~lPLyPqyS 150 (343) .++++++... .+|+++.+.|... T Consensus 80 ii~~i~~~~p~~~Ii~~~~~P~~~ 103 (174) T cd01841 80 IIEQIREEFPNTKIYLLSVLPVLE 103 (174) T ss_pred HHHHHHHHCCCCEEEEEECCCCCC T ss_conf 999999978998799983278777 No 133 >COG1250 FadB 3-hydroxyacyl-CoA dehydrogenase [Lipid metabolism] Probab=32.30 E-value=35 Score=14.94 Aligned_cols=35 Identities=29% Similarity=0.294 Sum_probs=15.1 Q ss_pred CCCCCCHHHHHHHHHHHHC-CCCCEE--EECCHHHHCCH Q ss_conf 7654430268999989976-998399--97440542212 Q gi|254780200|r 258 RIKCLEPPTDKTVEKLAHD-GIKSLA--IITPGFSSDCL 293 (343) Q Consensus 258 p~~WL~P~t~~~l~~L~~~-G~k~v~--v~p~gFvsD~l 293 (343) +++++.+.|.+++.+++++ | |.++ --+|||+.+++ T Consensus 155 ~g~~T~~e~~~~~~~~~~~ig-K~~vv~~D~pGFi~NRi 192 (307) T COG1250 155 RGEKTSDETVERVVEFAKKIG-KTPVVVKDVPGFIVNRL 192 (307) T ss_pred CCCCCCHHHHHHHHHHHHHCC-CCCEEECCCCCEEHHHH T ss_conf 688789999999999999859-98874267886125777 No 134 >COG0434 SgcQ Predicted TIM-barrel enzyme [General function prediction only] Probab=32.14 E-value=37 Score=14.75 Aligned_cols=34 Identities=18% Similarity=0.225 Sum_probs=19.6 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEEC Q ss_conf 45578006899999999999733899866999833 Q gi|254780200|r 179 PYYEDSDYISALAQSIREHFESIQWTPEMLLVSFH 213 (343) Q Consensus 179 ~~~~~p~yI~a~~~~I~~~l~~~~~~~~~llfSaH 213 (343) -+..+.++++--+....++....+ ...++++-.| T Consensus 120 ~~~tdqGiieg~A~e~~r~r~~L~-~~v~vlADv~ 153 (263) T COG0434 120 AYATDQGIIEGNAAELARYRARLG-SRVKVLADVH 153 (263) T ss_pred EEECCCCCCCCHHHHHHHHHHHCC-CCCEEEEECH T ss_conf 276356501444889999898616-7737976111 No 135 >TIGR01916 F420_puta_dedox F420-dependent oxidoreductase, putative; InterPro: IPR008225 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.. Probab=31.96 E-value=32 Score=15.25 Aligned_cols=55 Identities=18% Similarity=0.184 Sum_probs=36.6 Q ss_pred HHHHHHHHHHHHHCCCCCCEEEEE-----------------ECCCCHHHHHHC--CCCHHHHHHHHHHHHHHHC Q ss_conf 999999999997338998669998-----------------336750267627--9988899999999999726 Q gi|254780200|r 188 SALAQSIREHFESIQWTPEMLLVS-----------------FHQMPVSYLLKG--DPYGCHCHKTARLLKEFLS 242 (343) Q Consensus 188 ~a~~~~I~~~l~~~~~~~~~llfS-----------------aHglP~~~~~~g--DpY~~q~~~t~~~i~~~l~ 242 (343) ++.|+.|++.|++.-+...-+|.| +=||+.---.+| |.|..++.-|..+|++.|. T Consensus 137 D~SA~~ir~~L~el~G~~VgVIITDT~GRpFR~G~vGvAiG~aG~~~l~D~~G~~DlyGreL~~T~vAvADELA 210 (260) T TIGR01916 137 DASAEKIRRGLRELLGKDVGVIITDTNGRPFREGQVGVAIGAAGVKALKDWRGEKDLYGRELEVTVVAVADELA 210 (260) T ss_pred HHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCCCCEEEEEECCCHHHHHCCCCCCCCCCCCCEEEEEEHHHHH T ss_conf 78999999999986398478999718888401687437881004047873278959888822122221034788 No 136 >TIGR03455 HisG_C-term ATP phosphoribosyltransferase, C-terminal domain. This domain corresponds to the C-terminal third of the HisG protein. It is absent in many lineages. Probab=31.95 E-value=38 Score=14.73 Aligned_cols=23 Identities=13% Similarity=0.364 Sum_probs=14.7 Q ss_pred HHHHHHHHHHHCCCCCEEEECCH Q ss_conf 26899998997699839997440 Q gi|254780200|r 265 PTDKTVEKLAHDGIKSLAIITPG 287 (343) Q Consensus 265 ~t~~~l~~L~~~G~k~v~v~p~g 287 (343) .+-+.+++|.+.|.+.|+++|+. T Consensus 75 ~~~~~i~~Lk~~GA~~Ilv~pIe 97 (100) T TIGR03455 75 VVNELIDKLKAAGARDILVLPIE 97 (100) T ss_pred HHHHHHHHHHHCCCCEEEEEEHH T ss_conf 99999999998799779991365 No 137 >KOG1461 consensus Probab=31.79 E-value=38 Score=14.71 Aligned_cols=21 Identities=19% Similarity=0.556 Sum_probs=13.6 Q ss_pred CCHHHHHHHHHHCCCCEEEEE Q ss_conf 424789999987099658996 Q gi|254780200|r 123 PSVKEIINNLREEGCDRLLIF 143 (343) Q Consensus 123 P~i~~~l~~l~~~g~~~ii~l 143 (343) |-|+-+|.-|.+.|++++.++ T Consensus 56 pmIdYtL~~L~~agV~eVfvf 76 (673) T KOG1461 56 PMIDYTLEWLERAGVEEVFVF 76 (673) T ss_pred HHHHHHHHHHHHCCCEEEEEE T ss_conf 679999999986596189999 No 138 >pfam09379 FERM_N FERM N-terminal domain. This domain is the N-terminal ubiquitin-like structural domain of the FERM domain. Probab=31.61 E-value=38 Score=14.69 Aligned_cols=36 Identities=11% Similarity=0.134 Sum_probs=26.9 Q ss_pred HHHHHHHHHHHCCCCCC-CEEEEECCCCCCCCCCHHH Q ss_conf 99999999972667567-0478622577765443026 Q gi|254780200|r 231 HKTARLLKEFLSWPDDR-FKICFQSRFGRIKCLEPPT 266 (343) Q Consensus 231 ~~t~~~i~~~l~~~~~~-~~l~fQSr~Gp~~WL~P~t 266 (343) .+-...|.+.+|+.+.. +.++|+.+-|...||.|+- T Consensus 20 ~~ll~~Vc~~l~l~E~~yFGL~y~~~~~~~~WLd~~k 56 (79) T pfam09379 20 QELLDQVCQRLGLREKDYFGLQFNDKKNERIWLDLDK 56 (79) T ss_pred HHHHHHHHHHCCCCCCCEEEEEEECCCCCCCCCCCCC T ss_conf 9999999977596644633689999999808289998 No 139 >PRK00553 ribose-phosphate pyrophosphokinase; Provisional Probab=31.41 E-value=39 Score=14.67 Aligned_cols=64 Identities=20% Similarity=0.198 Sum_probs=37.9 Q ss_pred HHHHHHHHHHCCCCCCCEEEEECCCCC---------------CCC-------CCHHHHHHHHHHHHCCCCCEEEECC--H Q ss_conf 999999997266756704786225777---------------654-------4302689999899769983999744--0 Q gi|254780200|r 232 KTARLLKEFLSWPDDRFKICFQSRFGR---------------IKC-------LEPPTDKTVEKLAHDGIKSLAIITP--G 287 (343) Q Consensus 232 ~t~~~i~~~l~~~~~~~~l~fQSr~Gp---------------~~W-------L~P~t~~~l~~L~~~G~k~v~v~p~--g 287 (343) +-++.+++.||.+ +.+....|.++ .-. .+=++.++.+.|.++|.++|.+++. = T Consensus 189 krA~~~A~~L~~~---~a~~~K~R~~~~~~e~~~~iGdV~Gk~vIIVDDiIdTGgTl~~aa~~Lk~~GA~~V~a~aTHgv 265 (340) T PRK00553 189 KRARLIAESLELP---LAIIDKRRPKHNVAESINVLGEVKNKNCLIVDDMIDTGGTVIAAAKLLKKQKAKKVCVMATHGL 265 (340) T ss_pred HHHHHHHHHCCCC---EEEEECCCCCCCCCCCCCCCCCCCCCEEEEECCHHHHHHHHHHHHHHHHHCCCCEEEEEEECCC T ss_conf 9999999982996---7998534578986411344555468548952524431256999999999879987999997951 Q ss_pred HHHCCHHHHHH Q ss_conf 54221255888 Q gi|254780200|r 288 FSSDCLETSYE 298 (343) Q Consensus 288 FvsD~lETl~E 298 (343) |.-+++|-+.| T Consensus 266 fsg~A~eri~~ 276 (340) T PRK00553 266 FNKNAIQLFDE 276 (340) T ss_pred CCHHHHHHHHH T ss_conf 48579999985 No 140 >COG3739 Uncharacterized integral membrane protein [Function unknown] Probab=31.35 E-value=19 Score=16.77 Aligned_cols=33 Identities=21% Similarity=0.361 Sum_probs=27.9 Q ss_pred HCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHH Q ss_conf 709965899631711100146899999999998 Q gi|254780200|r 134 EEGCDRLLIFPLYPQYSAATTGTAQDKVFQELI 166 (343) Q Consensus 134 ~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~ 166 (343) +.|+.+|..+|||--|-++.+||+..+..+.+. T Consensus 100 e~~~~Ki~GVPLysGFMYaaVgSYi~qaWr~fD 132 (263) T COG3739 100 EFGYSKIFGVPLYSGFMYAAVGSYICQAWRLFD 132 (263) T ss_pred CHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHC T ss_conf 046777538503767799999999999999823 No 141 >cd06592 GH31_glucosidase_KIAA1161 KIAA1161 is an uncharacterized Homo sapiens protein with a glycosyl hydrolase family 31 (GH31) domain that is homologous to the Escherichia coli YihQ glucosidase. Orthologs of KIA1161 are found in eukaryotes and prokaryotes. In bacteria, YihQ (along with YihO) is important for bacterial O-antigen capsule assembly and translocation. Enzymes of the GH31 family possess a wide range of different hydrolytic activities including alpha-glucosidase (glucoamylase and sucrase-isomaltase), alpha-xylosidase, 6-alpha-glucosyltransferase, 3-alpha-isomaltosyltransferase and alpha-1,4-glucan lyase. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. Probab=30.98 E-value=39 Score=14.62 Aligned_cols=99 Identities=13% Similarity=0.281 Sum_probs=53.5 Q ss_pred HHHHHHHHHHHHHHCCC-CCCC-----HH--HHHC------CCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHH Q ss_conf 75999999998652247-8430-----01--1210------454247899999870996589963171110014689999 Q gi|254780200|r 94 HTRDQATNLAKRLESIS-SIVV-----DW--AMRY------GKPSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQD 159 (343) Q Consensus 94 ~t~~qa~~L~~~L~~~~-~~~V-----~~--amry------~~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~ 159 (343) +|++....+.+++.+.+ +..| .+ +++. -=|..++.+++|++.|++ +++-+-|..+..+ . T Consensus 27 ~~q~ev~~va~~~r~~~iP~dvi~lD~~~~~~~~~Ftwd~~rFPdp~~mi~~L~~~G~k--lv~~i~P~v~~~~--~--- 99 (303) T cd06592 27 INQETVLNYAQEIIDNGFPNGQIEIDDNWETCYGDFDFDPTKFPDPKGMIDQLHDLGFR--VTLWVHPFINTDS--E--- 99 (303) T ss_pred CCHHHHHHHHHHHHHCCCCCCEEEEECHHHCCCCCEEECCCCCCCHHHHHHHHHHCCCE--EEEEECCCCCCCC--H--- T ss_conf 78999999999999849998869981724458682375821289989999999986994--8999555105885--8--- Q ss_pred HHHHHHHHHCCC---C--CCCCCCCC---------CCCCHHHHHHHHHHHHHHHH Q ss_conf 999999871799---8--64213345---------57800689999999999973 Q gi|254780200|r 160 KVFQELIHMRWS---P--SLRTVPPY---------YEDSDYISALAQSIREHFES 200 (343) Q Consensus 160 ~~~~~l~~~~~~---~--~~~~I~~~---------~~~p~yI~a~~~~I~~~l~~ 200 (343) .++++.+.+.. + ....+..| +.+|.-.+.|.+.+++.+.+ T Consensus 100 -~~~e~~~~g~~vk~~~g~~p~~~~~W~G~~~~~Dftnp~ar~Ww~~~~~~~~~~ 153 (303) T cd06592 100 -NFREAVEKGYLVSEPSGDIPALTRWWNGTAAVLDFTNPEAVDWFLSRLKSLQEK 153 (303) T ss_pred -HHHHHHHCCCEEECCCCCCCCEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHH T ss_conf -789898689789758998454001258976531689979999999999988874 No 142 >pfam07429 Fuc4NAc_transf 4-alpha-L-fucosyltransferase (Fuc4NAc transferase). This family contains the bacterial enzyme 4-alpha-L-fucosyltransferase (Fuc4NAc transferase) (EC 2.4.1.-) (approximately 360 residues long). This catalyses the synthesis of Fuc4NAc-ManNAcA-GlcNAc-PP-Und (lipid III) as part of the biosynthetic pathway of enterobacterial common antigen (ECA), a polysaccharide comprised of the trisaccharide repeat unit Fuc4NAc-ManNAcA-GlcNAc. Probab=30.87 E-value=39 Score=14.61 Aligned_cols=26 Identities=27% Similarity=0.618 Sum_probs=17.5 Q ss_pred CCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHC Q ss_conf 17868889987666512574788999998707 Q gi|254780200|r 52 ELPSWQWRPILFGYILNFRPSKIKHAYAKIWN 83 (343) Q Consensus 52 ~~p~~~~~~il~~~I~~~R~~~~~~~Y~~Iw~ 83 (343) ..+.++|..+|.|-|.+.+ .|=.||+ T Consensus 87 QFN~~LWLALl~Gki~~~q------~~WhIWG 112 (361) T pfam07429 87 QFNPWLWLALLSGKIKPAQ------FSWHIWG 112 (361) T ss_pred CCCHHHHHHHHHCCCCHHC------EEEEEEC T ss_conf 6888999999809755020------6999987 No 143 >PRK13122 consensus Probab=30.59 E-value=40 Score=14.58 Aligned_cols=212 Identities=16% Similarity=0.163 Sum_probs=95.6 Q ss_pred CCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCHHHHCCCHHHHHHHHHHHHHCCCCHHHHHH-HHHHHCCCCCCCHHHHH Q ss_conf 78679999717888999988999999987489544117868889987666512574788999-99870778874045777 Q gi|254780200|r 16 FGKIGVLLVNLGTPDGHDFFSLRRYLREFLLDKRVVELPSWQWRPILFGYILNFRPSKIKHA-YAKIWNTAKNESILRTH 94 (343) Q Consensus 16 ~~K~gvLL~n~G~P~~~~~~~V~~yL~~fl~D~~Vi~~p~~~~~~il~~~I~~~R~~~~~~~-Y~~Iw~~~gg~SPL~~~ 94 (343) |.|-=|=.+ +|.|+. .+..+.|.+ +.-++|++.-++--|+.-| +-++.. ++.+ .+|.-+... T Consensus 1 ~~K~fipyi-~g~pd~---~~~~~~l~~--~GaDiiElGiPfSDP~ADG-------pvIQ~A~~rAL----~~G~~~~~~ 63 (242) T PRK13122 1 MTKLFIPYI-MGNKDL---IENATLLSE--NGADIIEIGVPFSDPVADG-------PVIMEAGQQAI----KQGITIDYI 63 (242) T ss_pred CCEEEEEEE-CCCCCH---HHHHHHHHH--CCCCEEEECCCCCCCCCCC-------HHHHHHHHHHH----HCCCCHHHH T ss_conf 972377762-689999---999999997--5999999789888866658-------99999999999----769989999 Q ss_pred HHHHHHHHHHHHHCCCCCCCHHHHHCCCCCHHH----HHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 599999999865224784300112104542478----9999987099658996317111001468999999999987179 Q gi|254780200|r 95 TRDQATNLAKRLESISSIVVDWAMRYGKPSVKE----IINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMRW 170 (343) Q Consensus 95 t~~qa~~L~~~L~~~~~~~V~~amry~~P~i~~----~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~~ 170 (343) .+ +.+++++. .+.++ +-|.|++|...- -++++++.|++-+++ |=.|.- -.+++.+.+.+.+. T Consensus 64 ~~-~l~~~r~~----~~~pi-vlM~Y~N~i~~~G~~~F~~~~~~~GvdGvIi-pDLP~e-------e~~~~~~~~~~~gi 129 (242) T PRK13122 64 FN-QLEKHGDQ----IKCNY-VLMTYYNIICHYGEQAFFEKCRDTGVYGLII-PDLPYE-------LSQRLKQQFSHYGV 129 (242) T ss_pred HH-HHHHHCCC----CCCCE-EEEEECHHHHHHCHHHHHHHHHHCCCCEEEC-CCCCHH-------HHHHHHHHHHHCCC T ss_conf 99-99973136----79877-9998516988727999999998769986777-899878-------89999999986798 Q ss_pred CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEE Q ss_conf 98642133455780068999999999997338998669998336750267627998889999999999972667567047 Q gi|254780200|r 171 SPSLRTVPPYYEDSDYISALAQSIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKI 250 (343) Q Consensus 171 ~~~~~~I~~~~~~p~yI~a~~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l 250 (343) . .+.+|.+=. .+ ++|+.-.+..++ .....|.-|+-= .+. .....+.+..+.|.+.-+. +..+ T Consensus 130 ~-~I~lvaPtt-~~-------~Ri~~i~~~s~G--FiY~vs~~GvTG---~~~-~~~~~~~~~i~~ik~~t~~---Pv~v 191 (242) T PRK13122 130 K-IISLVAMTT-DD-------KRIKDIVSHAEG--FIYTVTMNATTG---QNG-AFHPELKRKIESIKAIANV---PVVA 191 (242) T ss_pred C-EEEEECCCC-CH-------HHHHHHHHHCCC--CEEEEECCCCCC---CCC-CCCHHHHHHHHHHHHHCCC---CEEE T ss_conf 6-898718999-89-------999999982999--669873354357---655-5658899999999972599---8587 Q ss_pred EEECCCCCCCCCCHHHHHHHHHHHHCCCCCEEEECCHH Q ss_conf 86225777654430268999989976998399974405 Q gi|254780200|r 251 CFQSRFGRIKCLEPPTDKTVEKLAHDGIKSLAIITPGF 288 (343) Q Consensus 251 ~fQSr~Gp~~WL~P~t~~~l~~L~~~G~k~v~v~p~gF 288 (343) +| | =++.+.++++.+ +..-| |+==+| T Consensus 192 GF----G------I~~~e~v~~i~~-~ADGv-IVGSai 217 (242) T PRK13122 192 GF----G------IRTPQHVADIKE-VADGI-VIGSEI 217 (242) T ss_pred EC----C------CCCHHHHHHHHH-HCCEE-EECHHH T ss_conf 15----8------799999999981-19999-984899 No 144 >TIGR02079 THD1 threonine dehydratase; InterPro: IPR011820 This entry represents threonine dehydratase, the first step in the pathway converting threonine into isoleucine. At least two other clades of biosynthetic threonine dehydratases are described by IPR005787 from INTERPRO and IPR005789 from INTERPRO. The sequences described by this entry are exclusively found in species containing the rest of the isoleucine pathway and which are generally lacking in members of the other two clades of threonine dehydratases. The sequences associated with this entry are often cluster with other elements of the isoleucine pathway.; GO: 0004794 L-threonine ammonia-lyase activity, 0030170 pyridoxal phosphate binding, 0009097 isoleucine biosynthetic process. Probab=30.58 E-value=40 Score=14.58 Aligned_cols=56 Identities=18% Similarity=0.254 Sum_probs=39.3 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCC--CCCEEEE Q ss_conf 68999999999987179986421334557800689999999999973389--9866999 Q gi|254780200|r 154 TGTAQDKVFQELIHMRWSPSLRTVPPYYEDSDYISALAQSIREHFESIQW--TPEMLLV 210 (343) Q Consensus 154 tgS~~~~~~~~l~~~~~~~~~~~I~~~~~~p~yI~a~~~~I~~~l~~~~~--~~~~llf 210 (343) +|+++|+...+..+....-.-.+|+|| ||+.-|+-=...--|-+++.+. +.|.++. T Consensus 120 ~GDtFD~~aa~A~~~~~~~g~~~iPPF-DD~~IIeGQGTva~Eil~ql~~~~k~D~V~~ 177 (415) T TIGR02079 120 VGDTFDEAAAAAKESVEKEGKTFIPPF-DDPDIIEGQGTVAVEILEQLEEETKIDYVVV 177 (415) T ss_pred ECCCHHHHHHHHHHHHHHCCCEECCCC-CCHHHHHCHHHHHHHHHHHCCCCCCCCEEEE T ss_conf 557057899999999997189016899-8756651102765667750586767768998 No 145 >TIGR01422 phosphonatase phosphonoacetaldehyde hydrolase; InterPro: IPR006323 These sequences represent an enzyme that catalyzes the cleavage of the carbon phosphorous bond of a phosphonate. The mechanism depends on the substrate having a carbonyl one carbon away from the cleavage position. This enzyme is a member of the Haloacid Dehalogenase (HAD) superfamily of aspartate-nucleophile hydrolases, and contains a modified version of the conserved catalytic motifs of that superfamily: the first motif is usually DxDx(T/V), here it is DxAxT, and in the third motif the normal conserved lysine is instead an arginine. Additionally, the enzyme contains a unique conserved catalytic lysine (B. cereus pos. 53) which is involved in the binding and activation of the substrate through the formation of a Schiff base . The substrate of this enzyme is the product of 2-aminoethylphosphonate (AEP) transaminase, phosphonoacetaldehyde. This degradation pathway for AEP may be related to its toxic properties which are utilised by microorganisms as a chemical warfare agent.. Probab=30.41 E-value=40 Score=14.56 Aligned_cols=121 Identities=17% Similarity=0.287 Sum_probs=64.4 Q ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHCCCC--CHHHHHHHH Q ss_conf 688899876665125747889999987077887404577759999999986522478430011210454--247899999 Q gi|254780200|r 55 SWQWRPILFGYILNFRPSKIKHAYAKIWNTAKNESILRTHTRDQATNLAKRLESISSIVVDWAMRYGKP--SVKEIINNL 132 (343) Q Consensus 55 ~~~~~~il~~~I~~~R~~~~~~~Y~~Iw~~~gg~SPL~~~t~~qa~~L~~~L~~~~~~~V~~amry~~P--~i~~~l~~l 132 (343) -.-|++|-..- ..|.++++|++.-......+=+..+=++.... .+..=-+|..| ...+++++| T Consensus 45 lgK~dHiral~----~~P~iA~rw~~vFgrlp~~~Di~aiYe~F~pl-----------~ia~~a~haapIp~~levia~l 109 (199) T TIGR01422 45 LGKWDHIRALT----ELPRIAERWRRVFGRLPTEADIQAIYEEFMPL-----------QIAKVAEHAAPIPAVLEVIAEL 109 (199) T ss_pred CCHHHHHHHHH----HHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHH-----------HHHHHHHHCCCCHHHHHHHHHH T ss_conf 63689999986----30289999999708999774589999861689-----------9998655405553689999999 Q ss_pred HHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEE Q ss_conf 87099658996317111001468999999999987179986421334557800689999999999973389986699983 Q gi|254780200|r 133 REEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMRWSPSLRTVPPYYEDSDYISALAQSIREHFESIQWTPEMLLVSF 212 (343) Q Consensus 133 ~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~~~~~~~~I~~~~~~p~yI~a~~~~I~~~l~~~~~~~~~llfSa 212 (343) +..|+. |. |||| +-+++++.+ | ..-+...+++|+|+-+ T Consensus 110 R~rGiK--IG---------ScsG-Ypr~vMd~v----------------------------~-~~Aa~~GY~pD~vVat- 147 (199) T TIGR01422 110 RERGIK--IG---------SCSG-YPREVMDIV----------------------------V-AEAALQGYKPDFVVAT- 147 (199) T ss_pred HHCCCE--EE---------CCCC-CHHHHHHHH----------------------------H-HHHHHCCCCCCEEECC- T ss_conf 738844--42---------4798-757899999----------------------------9-9997478976642466- Q ss_pred CCCCHHHHHHCCCCHHHHHHHHHH Q ss_conf 367502676279988899999999 Q gi|254780200|r 213 HQMPVSYLLKGDPYGCHCHKTARL 236 (343) Q Consensus 213 HglP~~~~~~gDpY~~q~~~t~~~ 236 (343) --+| ..|-||+.+|++++=. T Consensus 148 Ddvp----q~GRP~P~~alkNvie 167 (199) T TIGR01422 148 DDVP----QAGRPYPFMALKNVIE 167 (199) T ss_pred CCCC----CCCCCHHHHHHHHHHH T ss_conf 2246----7889747899998987 No 146 >pfam07796 DUF1638 Protein of unknown function (DUF1638). This family contains sequences covering an approximately 270 amino acid stretch of a group of hypothetical proteins. These proteins are expressed by archaeal species of the Methanosarcina genus. Probab=30.30 E-value=40 Score=14.54 Aligned_cols=131 Identities=13% Similarity=0.124 Sum_probs=57.5 Q ss_pred HHHHHHHHHHHHCC----CCCCCHHHHHCCC--CCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 99999999865224----7843001121045--42478999998709965899631711100146899999999998717 Q gi|254780200|r 96 RDQATNLAKRLESI----SSIVVDWAMRYGK--PSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMR 169 (343) Q Consensus 96 ~~qa~~L~~~L~~~----~~~~V~~amry~~--P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~ 169 (343) +.+..+|++.+++. +.+-|.+|- +|. .-++..+++ .|+.+ |--++.=..-.|+- +..+.+. T Consensus 12 ~~L~~~v~~~i~~~~~~~d~Ill~YG~-CG~g~g~l~~~~~~---~~~~~----~~~~dCi~~~lG~~--~~~~~~~--- 78 (166) T pfam07796 12 EKLKSAVYEAIDEMREFSDGILLFYGD-CGTGLGLLERDCAE---LGVLR----PIGDDCISFLLGGN--AFYAELL--- 78 (166) T ss_pred HHHHHHHHHHHHHHHCCCCEEEEEECC-CCCCCHHHHHHHHH---CCCCC----CCCCCHHHHHCCHH--HHHHHHH--- T ss_conf 999999999999705026828999724-88871269998876---59510----68853899981968--8999973--- Q ss_pred CCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCC Q ss_conf 998642133455780068999999999997338998669998336750267627998889999999999972667 Q gi|254780200|r 170 WSPSLRTVPPYYEDSDYISALAQSIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWP 244 (343) Q Consensus 170 ~~~~~~~I~~~~~~p~yI~a~~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~ 244 (343) +-...||-.|..++.|-+.+-+.+.....+.-.-.+-.|=-.+-+++.|..+.....+.++..++.+|++ T Consensus 79 -----~~~gtffLT~~w~~~~~~~~~~~~G~d~~~~l~~~~f~~Y~rv~~i~tg~~~~~~~~~~a~~~A~~lgl~ 148 (166) T pfam07796 79 -----REPGTFYLTPGWARQWDAFVIKRLGLDRHPELRDMYFGHYRKVVYIDTGLYQTDDFEAKAREFADRLGLE 148 (166) T ss_pred -----CCCCEEEECHHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHHHHCCCC T ss_conf -----6799699556999989999998628773468999998278679998179987677999999999983997 No 147 >pfam05675 DUF817 Protein of unknown function (DUF817). This family consists of several bacterial proteins of unknown function. Probab=30.23 E-value=40 Score=14.54 Aligned_cols=33 Identities=21% Similarity=0.407 Sum_probs=27.9 Q ss_pred HCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHH Q ss_conf 709965899631711100146899999999998 Q gi|254780200|r 134 EEGCDRLLIFPLYPQYSAATTGTAQDKVFQELI 166 (343) Q Consensus 134 ~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~ 166 (343) ++|+-+|..+|||--|=++++||++.++.|-.. T Consensus 89 e~~~~ki~gVPLfsGFMYAaVGSYi~rawR~fd 121 (235) T pfam05675 89 EEGLLKIFGVPLFSGFMYASVGSYICRAWRRFD 121 (235) T ss_pred CCCCEEEECCCCHHHHHHHHHHHHHHHHHHHHE T ss_conf 543202536553021289999999999999950 No 148 >pfam03786 UxuA D-mannonate dehydratase (UxuA). UxuA (this family) and UxuB are required for hexuronate degradation. Probab=30.11 E-value=40 Score=14.52 Aligned_cols=93 Identities=15% Similarity=0.189 Sum_probs=43.7 Q ss_pred EECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEECC-CCCCCCCCHHHHHHHHHHHHC----CCCCEEEEC Q ss_conf 833675026762799888999999999997266756704786225-777654430268999989976----998399974 Q gi|254780200|r 211 SFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKICFQSR-FGRIKCLEPPTDKTVEKLAHD----GIKSLAIIT 285 (343) Q Consensus 211 SaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l~fQSr-~Gp~~WL~P~t~~~l~~L~~~----G~k~v~v~p 285 (343) +.-|+|.- +.. .+..+.+.+...-+. ..+||.+- +| .--+=++.+.+++++.. -.|||-..+ T Consensus 212 ~l~GlPRI-~s~--------~ed~~~ll~~v~Sp~--NGitfC~GSlg--~~~~nDl~~~ir~f~~RI~FvH~RnV~~~~ 278 (350) T pfam03786 212 PIFGLPRI-VTN--------IEDYQRLLELVDSPY--NGITLCTGSYG--ARPDNDLPEMIREFADRIYFAHLRNIKREE 278 (350) T ss_pred CCCCCCEE-CCC--------HHHHHHHHHHCCCCC--CCEEEECCCCC--CCCCCCHHHHHHHHHCCCCEEEEECCCCCC T ss_conf 55787633-189--------999999998479941--26787114566--788899999999984310245430111358 Q ss_pred C--HHH-HCCHHHHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 4--054-2212558887589999999759975898 Q gi|254780200|r 286 P--GFS-SDCLETSYEIAHEAKEIFVNGGGEKFTQ 317 (343) Q Consensus 286 ~--gFv-sD~lETl~Eidie~~e~~~e~Gg~~~~r 317 (343) . +|. ++|+|=-.|+---.+. ..+.|...+.| T Consensus 279 ~~~~F~E~~hl~G~~DM~~v~ka-l~~~~~~gp~R 312 (350) T pfam03786 279 GPKDFYETAHLEGSTDMASVMKA-YHEEGFRGPMR 312 (350) T ss_pred CCCCEEECCCCCCCCHHHHHHHH-HHHHCCCCCCC T ss_conf 99984753666788309999999-99809998534 No 149 >TIGR02810 agaZ_gatZ D-tagatose-bisphosphate aldolase, class II, non-catalytic subunit; InterPro: IPR012062 Escherichia coli and other enteric bacteria contain two closely related D-tagatose 1,6-bisphosphate (TagBP)-specific aldolases involved in catabolism of galactitol (genes gatY gatZ) and of N-acetyl-galactosamine and D-galactosamine (genes kbaY, kbaZ, also called agaY, agaZ). The catalytic subunits GatY/KbaY alone are sufficient to show aldolase activity and contain most or all of the residues that have been identified as essential in substrate/product recognition and catalysis for class II aldolases , . However, these aldolases differ from other Class II aldolases (which are homodimeric enzymes) in that they require subunits GatZ/KbaZ for full activity and for good in vivo and in vitro stability. The Z subunits alone do not show any aldolase activity . It should be noted that the previous suggestion of a tagatose 6P-kinase function for AgaZ and other members of this family turned out to be erroneous , .; GO: 0009024 tagatose-6-phosphate kinase activity, 0019402 galactitol metabolic process. Probab=30.06 E-value=40 Score=14.52 Aligned_cols=257 Identities=20% Similarity=0.218 Sum_probs=134.0 Q ss_pred CCCCCC--------CCCCEEEEEE-------CCCCCCCCCHHHHHHHHHHHCCCHHHHCCCHHHHHHHHHHHHH-C---- Q ss_conf 889987--------7786799997-------1788899998899999998748954411786888998766651-2---- Q gi|254780200|r 9 RNHPKV--------KFGKIGVLLV-------NLGTPDGHDFFSLRRYLREFLLDKRVVELPSWQWRPILFGYIL-N---- 68 (343) Q Consensus 9 ~~~p~~--------~~~K~gvLL~-------n~G~P~~~~~~~V~~yL~~fl~D~~Vi~~p~~~~~~il~~~I~-~---- 68 (343) |+||.| .-.++-||+= .+||=...|..|=+.|-..+ -|+ +++|+- +.||.|==| | T Consensus 20 SAHPlVlEAA~~~A~~~~~~VLIEATsNQVNQfGGYTGMtP~DFr~fV~~I-A~~--~gfP~~--rl~LGGDHLGPN~Wq 94 (430) T TIGR02810 20 SAHPLVLEAALRRARASGTPVLIEATSNQVNQFGGYTGMTPADFRDFVETI-ADR--IGFPRE--RLILGGDHLGPNPWQ 94 (430) T ss_pred CCCHHHHHHHHHHHHHCCCCCEEEECCCCCCCCCCCCCCCHHHHHHHHHHH-HHH--CCCHHH--HHHHCCCCCCCCCCC T ss_conf 787188999999887528971377311430357888799707899999999-973--157379--886067757878876 Q ss_pred CCC--------HHHHHHHHHHHCC----------CCCCCHHHHHHHHHHHHH-----HHHHH----C-------CCCCCC Q ss_conf 574--------7889999987077----------887404577759999999-----98652----2-------478430 Q gi|254780200|r 69 FRP--------SKIKHAYAKIWNT----------AKNESILRTHTRDQATNL-----AKRLE----S-------ISSIVV 114 (343) Q Consensus 69 ~R~--------~~~~~~Y~~Iw~~----------~gg~SPL~~~t~~qa~~L-----~~~L~----~-------~~~~~V 114 (343) -++ ..-.+.|-+=|+. .|.++||-.-|-.-.++- ++... + +++||| T Consensus 95 ~lpA~~AM~~A~~lv~aYv~AGF~KIHLDaSM~CagDP~PL~D~t~A~RAArLc~VAE~a~~~~~ge~~p~YVIGTEVPv 174 (430) T TIGR02810 95 HLPAEEAMEKAEALVKAYVEAGFTKIHLDASMSCAGDPIPLPDATIAERAARLCAVAEAAATEERGETKPVYVIGTEVPV 174 (430) T ss_pred CCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEECC T ss_conf 77679999999999999998588501011664578878663605789999999999999987422788850365102248 Q ss_pred HHHHHCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHH--HH-------CCCCCCCCCCCCCCCCH Q ss_conf 0112104542478999998709965899631711100146899999999998--71-------79986421334557800 Q gi|254780200|r 115 DWAMRYGKPSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELI--HM-------RWSPSLRTVPPYYEDSD 185 (343) Q Consensus 115 ~~amry~~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~--~~-------~~~~~~~~I~~~~~~p~ 185 (343) ==|-.=|.- |+.+.|.... +-+--|+ +..+|-.+.++.|...- .. -..|.+.| ||.. T Consensus 175 PGGetGGDa---DA~e~l~~he-----l~VT~Pe-aA~~Tl~~HR~AF~~~GL~~aw~RVialVVQPGVEF-----dh~~ 240 (430) T TIGR02810 175 PGGETGGDA---DALEALQEHE-----LAVTTPE-AARETLEAHRKAFAARGLEDAWPRVIALVVQPGVEF-----DHSN 240 (430) T ss_pred CCCCCCCCH---HHHHHHHCCC-----CCCCCHH-HHHHHHHHHHHHHHHHCCHHHHHCEEEEEECCCCCC-----CCCC T ss_conf 878887650---4676653047-----7888888-899999999999987171223111033442484215-----9873 Q ss_pred ---HHHHHHHHHHHHHHHCCCCCCEEEEEECCC----CHHH---HHH------------------------------CCC Q ss_conf ---689999999999973389986699983367----5026---762------------------------------799 Q gi|254780200|r 186 ---YISALAQSIREHFESIQWTPEMLLVSFHQM----PVSY---LLK------------------------------GDP 225 (343) Q Consensus 186 ---yI~a~~~~I~~~l~~~~~~~~~llfSaHgl----P~~~---~~~------------------------------gDp 225 (343) |=..-++.+.+.+++. ..+||=|||. |..+ |+. +++ T Consensus 241 V~~Y~p~~A~~LS~~i~~~----p~lVFEAHSTDYQT~~Al~~LV~DHFAILKVGPaLTFALREALFAL~~Ie~eL~p~~ 316 (430) T TIGR02810 241 VIDYQPERAQALSQVIEDT----PTLVFEAHSTDYQTAAALRALVRDHFAILKVGPALTFALREALFALAKIEDELVPAE 316 (430) T ss_pred CCCCCHHHHHHHHHHHCCC----CCEEEEECCCCCCCHHHHHHHHHCCCCEECCCHHHHHHHHHHHHHHHHHHHHCCCCH T ss_conf 2148978999999986268----975887447656777899999763885102560235999999999999998527964 Q ss_pred CHHH-HHHHHHHHHHHHCCCCCCCEEEEEC------------CCCCCC--CCCHHHHHHHHHHHHC--CCCCEEEECCHH Q ss_conf 8889-9999999999726675670478622------------577765--4430268999989976--998399974405 Q gi|254780200|r 226 YGCH-CHKTARLLKEFLSWPDDRFKICFQS------------RFGRIK--CLEPPTDKTVEKLAHD--GIKSLAIITPGF 288 (343) Q Consensus 226 Y~~q-~~~t~~~i~~~l~~~~~~~~l~fQS------------r~Gp~~--WL~P~t~~~l~~L~~~--G~k~v~v~p~gF 288 (343) =..+ |.++.+.||-. - +..|.==|++ =..+.. |-.|.+.+..++|... | + =+|-+- T Consensus 317 ~~~~rL~~v~E~vMl~--~-P~~W~~YY~G~~~~q~L~r~ySySDRIRYYWP~P~~~~Av~~L~~nL~~-~---~iPl~L 389 (430) T TIGR02810 317 ARSHRLREVIEQVMLD--E-PGYWKKYYHGDAAEQRLDRHYSYSDRIRYYWPHPRIAAAVETLMANLDG-Q---PIPLTL 389 (430) T ss_pred HHHCCCHHHHHHHHHH--H-HHHHCCCCCCCHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHCCC-C---CCCHHH T ss_conf 3201457999999886--1-4202026898888999986077213533579707899999999985378-8---896477 Q ss_pred HHCCHHH Q ss_conf 4221255 Q gi|254780200|r 289 SSDCLET 295 (343) Q Consensus 289 vsD~lET 295 (343) +|=.|=- T Consensus 390 lSQYLP~ 396 (430) T TIGR02810 390 LSQYLPE 396 (430) T ss_pred HHHHHHH T ss_conf 9998589 No 150 >pfam01888 CbiD CbiD. CbiD is essential for cobalamin biosynthesis in both S. typhimurium and B. megaterium, no functional role has been ascribed to the protein. The CbiD protein has a putative S-AdoMet binding site. It is possible that CbiD might have the same role as CobF in undertaking the C-1 methylation and deacylation reactions required during the ring contraction process. Probab=29.75 E-value=41 Score=14.48 Aligned_cols=77 Identities=18% Similarity=0.302 Sum_probs=46.7 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEECCC Q ss_conf 33455780068999999999997338998669998336750267627998889999999999972667567047862257 Q gi|254780200|r 177 VPPYYEDSDYISALAQSIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKICFQSRF 256 (343) Q Consensus 177 I~~~~~~p~yI~a~~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l~fQSr~ 256 (343) |.+ ++++.|.+.+...++.+... ..++++|+-=+.- -+..++.++++.+.+- +. T Consensus 174 V~P-~S~~a~~~si~~~l~v~~a~---g~~~lvl~~G~~g-----------------~~~a~~~~~l~~~~~V-----~~ 227 (261) T pfam01888 174 VRP-MSTEAYKASLAQQIDVAAAS---GYQRLVFVPGNIG-----------------EKYARQQFGVPDDAFV-----QM 227 (261) T ss_pred EEE-CCCHHHHHHHHHHHHHHHHC---CCCEEEECCCHHH-----------------HHHHHHHCCCCHHHEE-----EE T ss_conf 985-78499999999999999987---9796999448089-----------------9999986298848879-----95 Q ss_pred CCCCCCCHHHHHHHHHHHHCCCCCEEEEC Q ss_conf 77654430268999989976998399974 Q gi|254780200|r 257 GRIKCLEPPTDKTVEKLAHDGIKSLAIIT 285 (343) Q Consensus 257 Gp~~WL~P~t~~~l~~L~~~G~k~v~v~p 285 (343) | -|+++ +|+..++.|+++|+++- T Consensus 228 g--n~iG~----~L~~A~~~g~~~ill~G 250 (261) T pfam01888 228 A--NFVGF----MLDEAKKVGVEEILLVG 250 (261) T ss_pred E--HHHHH----HHHHHHHCCCCEEEEEE T ss_conf 2--72799----99999986999899974 No 151 >cd01714 ETF_beta The electron transfer flavoprotein (ETF) serves as a specific electron acceptor for various mitochondrial dehydrogenases. ETF transfers electrons to the main respiratory chain via ETF-ubiquinone oxidoreductase. ETF is an heterodimer that consists of an alpha and a beta subunit which binds one molecule of FAD per dimer . A similar system also exists in some bacteria. The homologous pair of proteins (FixA/FixB) are essential for nitrogen fixation. The beta subunit protein is distantly related to and forms a heterodimer with the alpha subunit. Probab=29.68 E-value=41 Score=14.47 Aligned_cols=51 Identities=16% Similarity=0.304 Sum_probs=35.5 Q ss_pred CHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHCCCCCHHHHHHHHHHCCCCEEEEEE Q ss_conf 04577759999999986522478430011210454247899999870996589963 Q gi|254780200|r 89 SILRTHTRDQATNLAKRLESISSIVVDWAMRYGKPSVKEIINNLREEGCDRLLIFP 144 (343) Q Consensus 89 SPL~~~t~~qa~~L~~~L~~~~~~~V~~amry~~P~i~~~l~~l~~~g~~~ii~lP 144 (343) +|.-.+.-+.|-.++++.+ +.+ .+...|++..+++|++..+-|+|+.+++. T Consensus 34 n~~D~~AlE~Alrlke~~g--~~V---tvlsvGp~~a~~~Lr~alAmGaD~ai~i~ 84 (202) T cd01714 34 NPYDEYAVEEALRLKEKYG--GEV---TVVSMGPPQAEEALREALAMGADRAILVS 84 (202) T ss_pred CCCCHHHHHHHHHHHHCCC--CEE---EEEEECCHHHHHHHHHHHHHCCCEEEEEC T ss_conf 9543899999999866039--889---99993747889999999970898359980 No 152 >TIGR03624 conserved hypothetical protein. Members of this protein family have a phylogenetic distribution skewed toward the Actinobacteria (high GC Gram-positive bacteria), but with a few members occuring in the archaea. The function is unknown. Probab=29.63 E-value=11 Score=18.33 Aligned_cols=48 Identities=21% Similarity=0.304 Sum_probs=28.3 Q ss_pred HHHHHHHHHHHHHCCCCCCCE-------EEEECCCCCCCCCCHHHHHHHHHHHHC Q ss_conf 999999999997266756704-------786225777654430268999989976 Q gi|254780200|r 229 HCHKTARLLKEFLSWPDDRFK-------ICFQSRFGRIKCLEPPTDKTVEKLAHD 276 (343) Q Consensus 229 q~~~t~~~i~~~l~~~~~~~~-------l~fQSr~Gp~~WL~P~t~~~l~~L~~~ 276 (343) -+..+....++.|+++.+++. .+.|..|+..|||..++.+.++++++. T Consensus 136 lV~~NI~~~~~~l~v~~~d~rlwlalhE~ah~~lF~~vPWLr~~l~~~i~~~~~~ 190 (345) T TIGR03624 136 LVPPNIVAVERGLGVPPDDFRLWVALHEVTHRRQFRAVPWLRDHLEGAVEELAAG 190 (345) T ss_pred EHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHH T ss_conf 3057899999873999999999999999999999961489999999999999874 No 153 >cd06425 M1P_guanylylT_B_like_N N-terminal domain of the M1P-guanylyltransferase B-isoform like proteins. GDP-mannose pyrophosphorylase (GTP: alpha-d-mannose-1-phosphate guanyltransferase) catalyzes the formation of GDP-d-mannose from GTP and alpha-d-mannose-1-Phosphate. It contains an N-terminal catalytic domain and a C-terminal Lefthanded-beta-Helix fold domain. GDP-d-mannose is the activated form of mannose for formation of cell wall lipoarabinomannan and various mannose-containing glycolipids and polysaccharides. The function of GDP-mannose pyrophosphorylase is essential for cell wall integrity, morphogenesis and viability. Repression of GDP-mannose pyrophosphorylase in yeast leads to phenotypes, such as cell lysis, defective cell wall, and failure of polarized growth and cell separation. Probab=29.51 E-value=41 Score=14.45 Aligned_cols=24 Identities=21% Similarity=0.486 Sum_probs=18.3 Q ss_pred CCCCCHHHHHHHHHHCCCCEEEEE Q ss_conf 045424789999987099658996 Q gi|254780200|r 120 YGKPSVKEIINNLREEGCDRLLIF 143 (343) Q Consensus 120 y~~P~i~~~l~~l~~~g~~~ii~l 143 (343) .+.|.|+..++.|++.|++++++. T Consensus 29 ~gkpli~~~i~~l~~~gi~~iii~ 52 (233) T cd06425 29 CNKPMIEHQIEALAKAGVKEIILA 52 (233) T ss_pred CCEEHHHHHHHHHHHCCCCCCCCH T ss_conf 998699999999998599732101 No 154 >PRK10426 alpha-glucosidase; Provisional Probab=29.46 E-value=41 Score=14.45 Aligned_cols=202 Identities=12% Similarity=0.143 Sum_probs=89.6 Q ss_pred CCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCC----CCCCCCC---------CCCCHHHH Q ss_conf 5424789999987099658996317111001468999999999987179986----4213345---------57800689 Q gi|254780200|r 122 KPSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMRWSPS----LRTVPPY---------YEDSDYIS 188 (343) Q Consensus 122 ~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~~~~~----~~~I~~~---------~~~p~yI~ 188 (343) -|-.+..+++|++.|++ +.+-.-|.-+..+ ..++++++.++..+ -..+-+| +.+|.=.+ T Consensus 313 fPDPk~mI~eL~~~G~r--v~vwInP~i~~~s------~ly~Ea~~kGylvk~~~G~~~~~~~~~~~~~~lDfTNPeA~~ 384 (682) T PRK10426 313 YPQLDSRIKQLNEEGIQ--FLAYINPYLASDG------DLCEEAAKHGYLAKDASGGDYLVEFGEFYAGVVDLTNPEAYD 384 (682) T ss_pred CCCHHHHHHHHHHCCCE--EEEEECCCCCCCC------HHHHHHHHCCCEEECCCCCEEEEEECCCCEEEEECCCHHHHH T ss_conf 87989999999978988--9999678427898------789999878948988999975785158871688597988999 Q ss_pred HHHHHHHHHHHHCCCCC------CE--EEEEEC-CCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEECC---- Q ss_conf 99999999997338998------66--999833-675026762799888999999999997266756704786225---- Q gi|254780200|r 189 ALAQSIREHFESIQWTP------EM--LLVSFH-QMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKICFQSR---- 255 (343) Q Consensus 189 a~~~~I~~~l~~~~~~~------~~--llfSaH-glP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l~fQSr---- 255 (343) .+.+.|++.+.+...+. +. .=..+| |.+-... ..-|...-.++.....+..+-..+ ..+-+| T Consensus 385 W~~~~l~k~l~d~GVdgfk~DfGE~lP~D~~~~~G~~~~~~--hN~Yp~l~ak~~ye~~~~~~~~~~---~v~f~RSg~a 459 (682) T PRK10426 385 WFKEVIKKNMIGLGCSGWMADFGEYLPTDAYLHNGVSAEIM--HNAWPALWAKCNYEALEETGKLGE---ILFFMRAGYT 459 (682) T ss_pred HHHHHHHHHHHHCCCCEEECCCCCCCCCCCEEECCCCHHHH--CCHHHHHHHHHHHHHHHHHCCCCC---CEEEECCCCC T ss_conf 99999998888509828961689877775247379983241--458899999999999998436787---1699505566 Q ss_pred ----CCCCCCCCH-----HHHHHHHHHHH----CCCCCEEEEC---CHHHHCC-HHHHHHHHHHHHHHH------HHCCC Q ss_conf ----777654430-----26899998997----6998399974---4054221-255888758999999------97599 Q gi|254780200|r 256 ----FGRIKCLEP-----PTDKTVEKLAH----DGIKSLAIIT---PGFSSDC-LETSYEIAHEAKEIF------VNGGG 312 (343) Q Consensus 256 ----~Gp~~WL~P-----~t~~~l~~L~~----~G~k~v~v~p---~gFvsD~-lETl~Eidie~~e~~------~e~Gg 312 (343) .++.-|.+= ++.|-|..... .|.--+...+ -||.+.. ...-.||=+.--+.. +-+|| T Consensus 460 GsQry~~~~W~GD~~ssW~~~dgL~~~I~~gLs~glSG~~~w~~DIGGf~~~~~~~~~~EL~~RW~Q~gaF~P~~R~H~g 539 (682) T PRK10426 460 GSQKYSTLFWAGDQNVDWSLHDGLASVVPAALSLGMSGHGLHHSDIGGYTTLFNMKRSKELLLRWCEFSAFTPVMRTHEG 539 (682) T ss_pred CCCCCCCCEECCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCEECCCC T ss_conf 77656553567876666764345999999987402237764455436664667899998999999998645830013689 Q ss_pred CEEEEECCCCCCHHHHHHHHHHHH Q ss_conf 758981689879899999999999 Q gi|254780200|r 313 EKFTQVPCLNSSNLSIDLLEKITR 336 (343) Q Consensus 313 ~~~~rvp~lN~~~~fi~~La~lv~ 336 (343) ......-....+++.++.++..++ T Consensus 540 ~~p~~pw~~~~~~e~~~~~~k~~~ 563 (682) T PRK10426 540 NRPGDNWQFDGDAETIAHFARMTT 563 (682) T ss_pred CCCCCCCCCCCCHHHHHHHHHHHH T ss_conf 999887556798789999999999 No 155 >pfam02571 CbiJ Precorrin-6x reductase CbiJ/CobK. This family consists of Precorrin-6x reductase EC:1.3.1.54. This enzyme catalyses the reaction: precorrin-6Y + NADP(+) <= precorrin-6X + NADPH. CbiJ and CobK both catalyse the reduction of macocycle in the colbalmin biosynthesis pathway. Probab=29.42 E-value=41 Score=14.44 Aligned_cols=48 Identities=27% Similarity=0.352 Sum_probs=22.9 Q ss_pred HHHHHHH-HHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 4789999-987099658996317111001468999999999987179986421334 Q gi|254780200|r 125 VKEIINN-LREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMRWSPSLRTVPP 179 (343) Q Consensus 125 i~~~l~~-l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~~~~~~~~I~~ 179 (343) -++.+.+ +++.+++-||= --.| |+..-+..+. ++.... ..|=+||.++ T Consensus 52 ~~~~m~~~i~~~~i~~vID-ATHP-fA~~is~na~----~a~~~~-~ipyiR~eRp 100 (246) T pfam02571 52 GADGLAAYLREEGIDAVID-ATHP-FAAQISRNAA----AACKEL-GVPLLRLERP 100 (246) T ss_pred CHHHHHHHHHHCCCCEEEE-CCCC-CHHHHHHHHH----HHHHHH-CCCEEEECCC T ss_conf 9999999999779979998-9999-6899999999----999985-9968996062 No 156 >PRK13112 consensus Probab=29.36 E-value=42 Score=14.44 Aligned_cols=221 Identities=15% Similarity=0.159 Sum_probs=99.7 Q ss_pred CCEE-EEEECCCCCCCCCHHHHHHHHHHHC-CCHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCHHHHH Q ss_conf 8679-9997178889999889999999874-8954411786888998766651257478899999870778874045777 Q gi|254780200|r 17 GKIG-VLLVNLGTPDGHDFFSLRRYLREFL-LDKRVVELPSWQWRPILFGYILNFRPSKIKHAYAKIWNTAKNESILRTH 94 (343) Q Consensus 17 ~K~g-vLL~n~G~P~~~~~~~V~~yL~~fl-~D~~Vi~~p~~~~~~il~~~I~~~R~~~~~~~Y~~Iw~~~gg~SPL~~~ 94 (343) +|.| |..+..|-|+-.+ -.+.+..+- +.-++|++.-++--|+--| +-++..+++-- .+|.-+... T Consensus 16 ~r~ali~yitaG~P~~~~---s~~~l~~l~~~GaDiiElGiPFSDPvADG-------PvIQ~A~~rAL---~~G~~~~~~ 82 (279) T PRK13112 16 GRPALVTYFMGGDPDLET---SLKIMKALPKAGADIIELGMPFSDPMADG-------PAIQAAGLRAL---KAGQTLAKT 82 (279) T ss_pred CCCEEEEEEECCCCCHHH---HHHHHHHHHHCCCCEEEECCCCCCCCCCC-------HHHHHHHHHHH---HCCCCHHHH T ss_conf 995589886073899789---99999999877999899789989866657-------99999999999---769968899 Q ss_pred HHHHHHHHHHHHHCCCCCCCHHHHHCCCCC----HHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 599999999865224784300112104542----4789999987099658996317111001468999999999987179 Q gi|254780200|r 95 TRDQATNLAKRLESISSIVVDWAMRYGKPS----VKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMRW 170 (343) Q Consensus 95 t~~qa~~L~~~L~~~~~~~V~~amry~~P~----i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~~ 170 (343) ..+...+.+.- .+.++ +-|-|++|. ++.-++++++.|++-+++ |=.|.-- .+++.+.+.+.+. T Consensus 83 -~~~~~~ir~~~---~~~Pi-vlM~Y~N~i~~~G~e~F~~~~~~aGvdGvIi-pDLP~eE-------~~~~~~~~~~~~i 149 (279) T PRK13112 83 -LYLAREFRKDD---DTTPI-VLMGYYNPIYIYGVERFLTDAKAAGVDGLIV-VDLPPEM-------DAELCIPAMKAGI 149 (279) T ss_pred -HHHHHHHHCCC---CCCCE-EEEEECHHHHHHCHHHHHHHHHHCCCCEEEE-CCCCHHH-------HHHHHHHHHHCCC T ss_conf -99999851348---99887-9985124998847999999999739987984-6999788-------8999999985783 Q ss_pred CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEE Q ss_conf 98642133455780068999999999997338998669998336750267627998889999999999972667567047 Q gi|254780200|r 171 SPSLRTVPPYYEDSDYISALAQSIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKI 250 (343) Q Consensus 171 ~~~~~~I~~~~~~p~yI~a~~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l 250 (343) ..+.++.+--. .++|+.-.+... ......|--|+-=. ...-..++.+..+.|.+.... +..+ T Consensus 150 -~~I~lvaPtt~--------~eRi~~i~~~s~--GFiY~Vs~~GvTG~----~~~~~~~~~~~i~~ik~~t~~---Pv~v 211 (279) T PRK13112 150 -NFIRLATPTTD--------DKRLPKVLANTS--GFVYYVSMTGITGS----ALADTSAVGEAVARIKRHTDL---PVCV 211 (279) T ss_pred -CEEEEECCCCC--------HHHHHHHHHCCC--CCEEEEECCCCCCC----CCCCHHHHHHHHHHHHHHCCC---CCEE T ss_conf -46998258998--------999999985278--80899835666676----645648899999999971789---8767 Q ss_pred EEECCCCCCCCCCHHHHHHHHHHHHCCCCCEEEECCHHHHCCHH Q ss_conf 86225777654430268999989976998399974405422125 Q gi|254780200|r 251 CFQSRFGRIKCLEPPTDKTVEKLAHDGIKSLAIITPGFSSDCLE 294 (343) Q Consensus 251 ~fQSr~Gp~~WL~P~t~~~l~~L~~~G~k~v~v~p~gFvsD~lE 294 (343) +| |= ++.+....+. ++. .-+|+= |=+...+| T Consensus 212 GF----GI------s~~e~~~~~~-~~a-DGvIVG-SAiVk~Ie 242 (279) T PRK13112 212 GF----GV------KTPEQARAIA-AHA-DGVVVG-TAIVNALA 242 (279) T ss_pred EE----CC------CCHHHHHHHH-CCC-CEEEEC-HHHHHHHH T ss_conf 83----56------9999999997-259-999987-79999998 No 157 >PRK09029 O-succinylbenzoic acid--CoA ligase; Provisional Probab=29.31 E-value=42 Score=14.43 Aligned_cols=35 Identities=9% Similarity=0.270 Sum_probs=18.3 Q ss_pred EEEEECCCC-----CCCCCCHHHHH-HHHHHHHCCCCCEEEEC Q ss_conf 478622577-----76544302689-99989976998399974 Q gi|254780200|r 249 KICFQSRFG-----RIKCLEPPTDK-TVEKLAHDGIKSLAIIT 285 (343) Q Consensus 249 ~l~fQSr~G-----p~~WL~P~t~~-~l~~L~~~G~k~v~v~p 285 (343) .+-|..|.. .+.++.|.-.+ +|. ...+++.++|++ T Consensus 346 ~l~~~GR~~d~Ik~~G~~V~p~EIE~~L~--~hp~V~eaaVvg 386 (458) T PRK09029 346 ELTILGRLDNLFISGGENIQPEEIERVIN--QHPLVQQVFVVP 386 (458) T ss_pred EEEEEEECCCEEEECCEEECHHHHHHHHH--HCCCCCEEEEEE T ss_conf 38990453578998999999999999998--699916799995 No 158 >cd03013 PRX5_like Peroxiredoxin (PRX) family, PRX5-like subfamily; members are similar to the human protein, PRX5, a homodimeric TRX peroxidase, widely expressed in tissues and found cellularly in mitochondria, peroxisomes and the cytosol. The cellular location of PRX5 suggests that it may have an important antioxidant role in organelles that are major sources of reactive oxygen species (ROS), as well as a role in the control of signal transduction. PRX5 has been shown to reduce hydrogen peroxide, alkyl hydroperoxides and peroxynitrite. As with all other PRXs, the N-terminal peroxidatic cysteine of PRX5 is oxidized into a sulfenic acid intermediate upon reaction with peroxides. Human PRX5 is able to resolve this intermediate by forming an intramolecular disulfide bond with its C-terminal cysteine (the resolving cysteine), which can then be reduced by TRX, just like an atypical 2-cys PRX. This resolving cysteine, however, is not conserved in other members of the subfamily. In such cases Probab=29.21 E-value=42 Score=14.42 Aligned_cols=22 Identities=18% Similarity=0.370 Sum_probs=13.1 Q ss_pred HHHHHHHHHHCCCCEEEEEECC Q ss_conf 4789999987099658996317 Q gi|254780200|r 125 VKEIINNLREEGCDRLLIFPLY 146 (343) Q Consensus 125 i~~~l~~l~~~g~~~ii~lPLy 146 (343) ..+-++++++.|+++|+++..= T Consensus 53 ~i~~~~~~~~kGvd~I~~isvn 74 (155) T cd03013 53 YVENADELKAKGVDEVICVSVN 74 (155) T ss_pred HHHHHHHHHHCCCCEEEEEEEC T ss_conf 7776999997699689999809 No 159 >COG1489 SfsA DNA-binding protein, stimulates sugar fermentation [General function prediction only] Probab=29.18 E-value=42 Score=14.42 Aligned_cols=17 Identities=24% Similarity=0.366 Sum_probs=6.3 Q ss_pred CCCCCCCHHHHHHHHHH Q ss_conf 34557800689999999 Q gi|254780200|r 178 PPYYEDSDYISALAQSI 194 (343) Q Consensus 178 ~~~~~~p~yI~a~~~~I 194 (343) +.++-+|.|-+++.+.+ T Consensus 186 P~~e~Dp~fa~~l~~A~ 202 (235) T COG1489 186 PNREIDPKFAELLREAI 202 (235) T ss_pred CCCCCCHHHHHHHHHHH T ss_conf 33014988999999999 No 160 >PRK13117 consensus Probab=28.90 E-value=42 Score=14.38 Aligned_cols=221 Identities=14% Similarity=0.103 Sum_probs=101.0 Q ss_pred CCEEE-EEECCCCCCCCCHHHHHHHHHHHC-CCHHHHCCCHHHHHHHHHHHHHCCCCHHHHHH-HHHHHCCCCCCCHHHH Q ss_conf 86799-997178889999889999999874-89544117868889987666512574788999-9987077887404577 Q gi|254780200|r 17 GKIGV-LLVNLGTPDGHDFFSLRRYLREFL-LDKRVVELPSWQWRPILFGYILNFRPSKIKHA-YAKIWNTAKNESILRT 93 (343) Q Consensus 17 ~K~gv-LL~n~G~P~~~~~~~V~~yL~~fl-~D~~Vi~~p~~~~~~il~~~I~~~R~~~~~~~-Y~~Iw~~~gg~SPL~~ 93 (343) +|.|+ -.+.-|-|+- +.-.+++..+- +.-++|++.-++--|+--| +-+++. .+.+ .+|--+.. T Consensus 15 ~~~ali~yitaG~P~~---~~t~~~~~~l~~~GaDiiElGiPfSDP~ADG-------pvIq~A~~rAL----~~G~~~~~ 80 (268) T PRK13117 15 KEGAFVPFVTLGDPSP---ELSLKIIDTLIEAGADALELGIPFSDPLADG-------PTIQNANLRAL----AAGVTPAQ 80 (268) T ss_pred CCCEEEEEECCCCCCH---HHHHHHHHHHHHCCCCEEEECCCCCCCCCCC-------HHHHHHHHHHH----HCCCCHHH T ss_conf 9914898872708997---9999999999966999899789988856557-------99999999998----45996999 Q ss_pred HHHHHHHHHHHHHHCCCCCCCHHHHHCCCCCHH----HHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 759999999986522478430011210454247----8999998709965899631711100146899999999998717 Q gi|254780200|r 94 HTRDQATNLAKRLESISSIVVDWAMRYGKPSVK----EIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMR 169 (343) Q Consensus 94 ~t~~qa~~L~~~L~~~~~~~V~~amry~~P~i~----~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~ 169 (343) ...+.++++++. .+.++ +-|.|.+|... .-++++++.|++-+++-=|=+-. .+++.+...+.+ T Consensus 81 -~~~~~~~ir~~~---~~~pi-vlM~Y~N~i~~~G~e~F~~~~~~aGvdGvIipDLP~eE--------~~~~~~~~~~~g 147 (268) T PRK13117 81 -CFELLAKIRAKY---PTIPI-GLLLYANLVFANGIDNFYARCAEAGVDSVLIADVPVEE--------SAPFRQAAKKHG 147 (268) T ss_pred -HHHHHHHHHCCC---CCCCE-EEEECCCHHHHCCHHHHHHHHHHCCCCEEEECCCCHHH--------HHHHHHHHHHCC T ss_conf -999998850047---89877-99732628987179999999997698779857999788--------589999998679 Q ss_pred CCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCE Q ss_conf 99864213345578006899999999999733899866999833675026762799888999999999997266756704 Q gi|254780200|r 170 WSPSLRTVPPYYEDSDYISALAQSIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFK 249 (343) Q Consensus 170 ~~~~~~~I~~~~~~p~yI~a~~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~ 249 (343) .. .+.++.+=- -.++|++-.+...+ ...+-|--|+-=. + ..-..++.+..+.|.+.-+. +.. T Consensus 148 l~-~I~lv~Ptt--------~~~Ri~~i~~~a~G--FiY~vs~~GvTG~---~-~~~~~~~~~~i~~ik~~t~~---Pv~ 209 (268) T PRK13117 148 IA-PIFICPPNA--------DDDTLRQIASLGRG--YTYLLSRAGVTGA---E-NKAAAPLNHLVEKLKEYNAP---PPL 209 (268) T ss_pred CC-EEEEECCCC--------CHHHHHHHHHHCCC--EEEEEECCCCCCC---C-CCCCHHHHHHHHHHHHCCCC---CEE T ss_conf 83-799847999--------99999999974798--5999836777889---8-66627799999999964799---869 Q ss_pred EEEECCCCCCCCCCHHHHHHHHHHHHCCCCCEEEECCHHHHCCHH Q ss_conf 786225777654430268999989976998399974405422125 Q gi|254780200|r 250 ICFQSRFGRIKCLEPPTDKTVEKLAHDGIKSLAIITPGFSSDCLE 294 (343) Q Consensus 250 l~fQSr~Gp~~WL~P~t~~~l~~L~~~G~k~v~v~p~gFvsD~lE 294 (343) ++| |= ++.+...+..+.|..-| |+= |=+.+++| T Consensus 210 vGF----GI------s~~e~v~~~~~~~aDGv-IVG-SaiV~~i~ 242 (268) T PRK13117 210 QGF----GI------SEPEQVKAAIKAGAAGA-ISG-SAIVKIIE 242 (268) T ss_pred EEE----CC------CCHHHHHHHHHCCCCEE-EEC-HHHHHHHH T ss_conf 983----78------99999999986389989-987-89999998 No 161 >KOG3112 consensus Probab=28.83 E-value=42 Score=14.38 Aligned_cols=29 Identities=21% Similarity=0.527 Sum_probs=25.2 Q ss_pred HHHHCCCCCHHHHHHHHHHCCCCEEEEEE Q ss_conf 11210454247899999870996589963 Q gi|254780200|r 116 WAMRYGKPSVKEIINNLREEGCDRLLIFP 144 (343) Q Consensus 116 ~amry~~P~i~~~l~~l~~~g~~~ii~lP 144 (343) ++-++..||-++.+.=++..|+.+||+|+ T Consensus 93 ~vk~~~~~F~e~l~~~~kSSG~~~VIVLS 121 (262) T KOG3112 93 WVKRHTAHFQEELVELLKSSGARRVIVLS 121 (262) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCEEEEEE T ss_conf 88753168999999999846984699972 No 162 >PRK09932 glycerate kinase II; Provisional Probab=28.52 E-value=43 Score=14.34 Aligned_cols=63 Identities=14% Similarity=0.097 Sum_probs=36.5 Q ss_pred HHHHHHHHHHHHCCC----CCCC------EEEEECCCCCCCCCCHHHHHHHHHHHHCCCCCEEEECCHHHHCCHHHHHHH Q ss_conf 999999999972667----5670------478622577765443026899998997699839997440542212558887 Q gi|254780200|r 230 CHKTARLLKEFLSWP----DDRF------KICFQSRFGRIKCLEPPTDKTVEKLAHDGIKSLAIITPGFSSDCLETSYEI 299 (343) Q Consensus 230 ~~~t~~~i~~~l~~~----~~~~------~l~fQSr~Gp~~WL~P~t~~~l~~L~~~G~k~v~v~p~gFvsD~lETl~Ei 299 (343) +..-.+.|.+.+++. +.++ ++-.||.+|+.++ .+.+++++--+-|+++| |.+-+..|.+|+. T Consensus 265 l~~G~d~v~~~~~l~~~i~~aDLVITGEG~~D~Qtl~GK~p~-------gVa~~A~~~~~Pviai~-G~~~~~~~~~~~~ 336 (381) T PRK09932 265 IKPGIEIVLNAVNLEQAVQGAALVITGEGRIDSQTAGGKAPL-------GVASVAKQFNVPVIGIA-GVLGDGVEVVHQY 336 (381) T ss_pred ECCHHHHHHHHHCHHHHHCCCCEEEECCCCCCCCCCCCCHHH-------HHHHHHHHCCCCEEEEE-CCCCCCHHHHHHC T ss_conf 845699999875989884789999989987766688988599-------99999998499989996-2458887889864 Q ss_pred H Q ss_conf 5 Q gi|254780200|r 300 A 300 (343) Q Consensus 300 d 300 (343) + T Consensus 337 G 337 (381) T PRK09932 337 G 337 (381) T ss_pred C T ss_conf 9 No 163 >smart00855 PGAM Phosphoglycerate mutase family. Phosphoglycerate mutase (PGAM) and bisphosphoglycerate mutase (BPGM) are structurally related enzymes that catalyse reactions involving the transfer of phospho groups between the three carbon atoms of phosphoglycerate PUBMED:2847721, PUBMED:2831102, PUBMED:10958932. Both enzymes can catalyse three different reactions with different specificities, the isomerization of 2-phosphoglycerate (2-PGA) to 3-phosphoglycerate (3-PGA) with 2,3-diphosphoglycerate (2,3-DPG) as the primer of the reaction, the synthesis of 2,3-DPG from 1,3-DPG with 3-PGA as a primer and the degradation of 2,3-DPG to 3-PGA (phosphatase activity). In mammals, PGAM is a dimeric protein with two isoforms, the M (muscle) and B (brain) forms. In yeast, PGAM is a tetrameric protein. Probab=28.52 E-value=43 Score=14.34 Aligned_cols=37 Identities=27% Similarity=0.419 Sum_probs=22.7 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHCCCCCHH Q ss_conf 404577759999999986522478430011210454247 Q gi|254780200|r 88 ESILRTHTRDQATNLAKRLESISSIVVDWAMRYGKPSVK 126 (343) Q Consensus 88 ~SPL~~~t~~qa~~L~~~L~~~~~~~V~~amry~~P~i~ 126 (343) .+||...-..|+..+.+.|.+..... +.--|+.|+.. T Consensus 23 d~~Lt~~G~~Qa~~l~~~l~~~~~~~--~~~i~sSpl~R 59 (155) T smart00855 23 DSPLTELGRAQAEALGELLASLGRLR--FDVIYSSPLLR 59 (155) T ss_pred CCCCCHHHHHHHHHHHHHHHHCCCCC--CCEEEECCCHH T ss_conf 98839789999999999999736999--89899797589 No 164 >PTZ00110 helicase; Provisional Probab=28.46 E-value=43 Score=14.33 Aligned_cols=60 Identities=20% Similarity=0.229 Sum_probs=31.3 Q ss_pred HHHHHHHHHHHHHHHCCC-CCCCHHHHHCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHH Q ss_conf 775999999998652247-84300112104542478999998709965899631711100146899999999 Q gi|254780200|r 93 THTRDQATNLAKRLESIS-SIVVDWAMRYGKPSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQ 163 (343) Q Consensus 93 ~~t~~qa~~L~~~L~~~~-~~~V~~amry~~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~ 163 (343) .=|+++|..+.+.+...+ ...+..+.-||--.+.+-+.+| +.|++=|| +|-|-.++.+.+ T Consensus 262 aPTRELA~QI~~e~~~~~~~~~ir~~~i~GG~~~~~Q~~~L-~~G~dIvV----------ATPGRLiDlL~~ 322 (602) T PTZ00110 262 APTRELAEQIREQALQFGRSSKLKNSVAYGGVPKRFQTYAL-RRGVEILI----------ACPGRLIDFLES 322 (602) T ss_pred CCHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCHHHHHHHH-CCCCCEEE----------ECCHHHHHHHHC T ss_conf 38399999999999997154785499997996879999987-16999999----------792389999964 No 165 >KOG0488 consensus Probab=28.43 E-value=11 Score=18.46 Aligned_cols=67 Identities=27% Similarity=0.111 Sum_probs=38.1 Q ss_pred EEEEECCCCHHHHHHCCCCHHHHHHH-HHHHHHHHCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHCCCC Q ss_conf 99983367502676279988899999-9999997266756704786225777654430268999989976998 Q gi|254780200|r 208 LLVSFHQMPVSYLLKGDPYGCHCHKT-ARLLKEFLSWPDDRFKICFQSRFGRIKCLEPPTDKTVEKLAHDGIK 279 (343) Q Consensus 208 llfSaHglP~~~~~~gDpY~~q~~~t-~~~i~~~l~~~~~~~~l~fQSr~Gp~~WL~P~t~~~l~~L~~~G~k 279 (343) -+||-|-|=. +++-.-|+..+-.- =..+|+.||+.+.+..+-||+| +.||=.= +....+.+.+.|.+ T Consensus 177 TaFT~~Ql~~--LEkrF~~QKYLS~~DR~~LA~~LgLTdaQVKtWfQNR--RtKWKrq-~a~g~~~~~~~~~~ 244 (309) T KOG0488 177 TAFSDHQLFE--LEKRFEKQKYLSVADRIELAASLGLTDAQVKTWFQNR--RTKWKRQ-TAEGGELLYQAGNS 244 (309) T ss_pred HHHHHHHHHH--HHHHHHHHHCCCHHHHHHHHHHCCCCHHHHHHHHHHH--HHHHHHH-HHHHHCCCCCCCCC T ss_conf 1212889999--9998877502567889999998299255688877603--6788777-88652111136666 No 166 >KOG0884 consensus Probab=28.34 E-value=43 Score=14.32 Aligned_cols=15 Identities=40% Similarity=0.246 Sum_probs=12.6 Q ss_pred CHHHHCCHHHHHHHH Q ss_conf 405422125588875 Q gi|254780200|r 286 PGFSSDCLETSYEIA 300 (343) Q Consensus 286 ~gFvsD~lETl~Eid 300 (343) .|=|.|.+|||.||+ T Consensus 118 fgkvidg~etldele 132 (161) T KOG0884 118 FGKVIDGLETLDELE 132 (161) T ss_pred EEEECCCHHHHHHHH T ss_conf 410002155477885 No 167 >TIGR02494 PFLE_PFLC glycyl-radical enzyme activating protein family; InterPro: IPR012839 This subset of the radical-SAM domain includes a number of probable activating proteins acting on different enzymes all requiring an amino-acid-centred radical. The closest relatives to this family are the pyruvate-formate lyase activating enzyme (PflA, 1.97.1.4 from EC, IPR012838 from INTERPRO) and the anaerobic ribonucleotide reductase activating enzyme (IPR012837 from INTERPRO). Included within this subfamily are activators of hydroxyphenyl acetate decarboxylase (HdpA, ), benzylsuccinate synthase (BssD, ), gycerol dehydratase (DhaB2, ) as well as enzymes annotated in Escherichia coli as activators of different isozymes of pyruvate-formate lyase (PFLC and PFLE) however, these appear to lack characterisation and may activate enzymes with distinctive functions. Most of the sequence-level variability between these forms is concentrated within an N-terminal domain, which follows a conserved group of three cysteines and contains a variable pattern of 0 to 8 additional cysteines.. Probab=28.30 E-value=43 Score=14.32 Aligned_cols=128 Identities=13% Similarity=0.117 Sum_probs=66.3 Q ss_pred HHHHHHCC--CCCCCHHHHHHHHHHHHHHHHHHCCC-CCCCH-HHH------HCCCCCHHHHHHHHHH---CCCCEEEEE Q ss_conf 99987077--88740457775999999998652247-84300-112------1045424789999987---099658996 Q gi|254780200|r 77 AYAKIWNT--AKNESILRTHTRDQATNLAKRLESIS-SIVVD-WAM------RYGKPSVKEIINNLRE---EGCDRLLIF 143 (343) Q Consensus 77 ~Y~~Iw~~--~gg~SPL~~~t~~qa~~L~~~L~~~~-~~~V~-~am------ry~~P~i~~~l~~l~~---~g~~~ii~l 143 (343) .|++=|+. ..||=||..--. +..|-++..+.+ +.-|+ -|+ --.-|++.=.|-++|. ...++++.. T Consensus 125 fY~~SGGGvTlSGGEPl~Q~eF--~~~LL~~c~~~gihTAvET~gft~~~~~~~~~~~~DLfL~DiK~~D~~~H~~~tG~ 202 (305) T TIGR02494 125 FYRNSGGGVTLSGGEPLLQPEF--ALALLKACRERGIHTAVETSGFTPWETIEKVLPYVDLFLYDIKHLDDERHKEVTGV 202 (305) T ss_pred HHHHCCCEEEECCCCCCCCHHH--HHHHHHHHHHCCCCEEECCCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHCCC T ss_conf 6651399067348711401589--99999999758994676055688889999888877699872641180120553389 Q ss_pred ECCHHHHHHHHHHHHHHHHHHHHHH-----CCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCH Q ss_conf 3171110014689999999999871-----7998642133455780068999999999997338998669998336750 Q gi|254780200|r 144 PLYPQYSAATTGTAQDKVFQELIHM-----RWSPSLRTVPPYYEDSDYISALAQSIREHFESIQWTPEMLLVSFHQMPV 217 (343) Q Consensus 144 PLyPqyS~sTtgS~~~~~~~~l~~~-----~~~~~~~~I~~~~~~p~yI~a~~~~I~~~l~~~~~~~~~llfSaHglP~ 217 (343) . -.-+++.+.+.+..- ....++-+|+.|-|++.=|+|+++-+++.... +....+ |+.||++-. T Consensus 203 ~---------N~~IL~NL~~L~~~~~~GG~~v~iR~PvIpG~Nds~~~i~a~~~f~~~~~~~-N~~~i~-LLPyH~lG~ 270 (305) T TIGR02494 203 D---------NELILENLEALLAAGKDGGKNVVIRIPVIPGFNDSEENIEAIAEFLRKLLPG-NVKEID-LLPYHRLGE 270 (305) T ss_pred C---------HHHHHHHHHHHHHHCCCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHHCC-CCCEEE-ECCCHHHHH T ss_conf 8---------3789999999997178899558998720489898989999999999985178-976567-448605567 No 168 >PRK07238 bifunctional RNase H/acid phosphatase; Provisional Probab=28.27 E-value=43 Score=14.31 Aligned_cols=97 Identities=12% Similarity=-0.010 Sum_probs=41.7 Q ss_pred HHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHCCCCCEEEECCHHHHCCHHHHHHHHHHHH--- Q ss_conf 89999999999972667567047862257776544302689999899769983999744054221255888758999--- Q gi|254780200|r 228 CHCHKTARLLKEFLSWPDDRFKICFQSRFGRIKCLEPPTDKTVEKLAHDGIKSLAIITPGFSSDCLETSYEIAHEAK--- 304 (343) Q Consensus 228 ~q~~~t~~~i~~~l~~~~~~~~l~fQSr~Gp~~WL~P~t~~~l~~L~~~G~k~v~v~p~gFvsD~lETl~Eidie~~--- 304 (343) ..|.+|++.|++.+|++-. ..-.+.=. .-+.|=+=...+..++.+++= +.- .-.|.|....=|++.|+.-... T Consensus 231 ~RA~eTAe~Ia~~lgl~v~-~d~~LrE~-dfG~WEG~t~~Ei~~~~Pe~~-~~W-~~dp~~~pPgGES~~~v~~Rv~~al 306 (375) T PRK07238 231 QRAYDTAEAAAKALGLDVT-VDDDLIET-DFGAWEGLTFAEAAERDPELH-RRW-LADTSVAPPGGESFDDVARRVRRAR 306 (375) T ss_pred HHHHHHHHHHHHHCCCCCE-ECHHCCCC-CCCCCCCCCHHHHHHHCHHHH-HHH-HCCCCCCCCCCCCHHHHHHHHHHHH T ss_conf 9999999999985199927-87310137-888535881899987799999-987-6088999959968999999999999 Q ss_pred -HHHHHCCCCEEEEECCCCCCHHHHHHHH Q ss_conf -9999759975898168987989999999 Q gi|254780200|r 305 -EIFVNGGGEKFTQVPCLNSSNLSIDLLE 332 (343) Q Consensus 305 -e~~~e~Gg~~~~rvp~lN~~~~fi~~La 332 (343) ++..++.|++...| +|-.-|.+|. T Consensus 307 ~~i~~~h~G~tVlVV----sHggvIR~ll 331 (375) T PRK07238 307 DRIIAEYPGATVLVV----SHVTPIKTLL 331 (375) T ss_pred HHHHHHCCCCCEEEE----ECHHHHHHHH T ss_conf 999987899939999----6859999999 No 169 >COG1208 GCD1 Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) [Cell envelope biogenesis, outer membrane / Translation, ribosomal structure and biogenesis] Probab=28.18 E-value=43 Score=14.30 Aligned_cols=23 Identities=22% Similarity=0.638 Sum_probs=17.4 Q ss_pred CCCCHHHHHHHHHHCCCCEEEEE Q ss_conf 45424789999987099658996 Q gi|254780200|r 121 GKPSVKEIINNLREEGCDRLLIF 143 (343) Q Consensus 121 ~~P~i~~~l~~l~~~g~~~ii~l 143 (343) +.|.++-+++.|++.|++++++. T Consensus 31 gkPii~~~l~~L~~~Gv~eivi~ 53 (358) T COG1208 31 GKPLIEYVLEALAAAGVEEIVLV 53 (358) T ss_pred CEEHHHHHHHHHHHCCCCEEEEE T ss_conf 86459999999998799789998 No 170 >PRK00050 mraW S-adenosyl-methyltransferase MraW; Provisional Probab=28.08 E-value=44 Score=14.29 Aligned_cols=17 Identities=18% Similarity=0.438 Sum_probs=14.3 Q ss_pred CEEEEEECCCCHHHHHH Q ss_conf 66999833675026762 Q gi|254780200|r 206 EMLLVSFHQMPVSYLLK 222 (343) Q Consensus 206 ~~llfSaHglP~~~~~~ 222 (343) -.++.|||||=.|.|++ T Consensus 235 rl~vISfHSLEDRiVK~ 251 (309) T PRK00050 235 RLSVISFHSLEDRIVKR 251 (309) T ss_pred EEEEEEECCHHHHHHHH T ss_conf 39999843325789999 No 171 >cd01019 ZnuA Zinc binding protein ZnuA. These proteins have been shown to function as initial receptors in the ABC uptake of Zn2+. They belong to the TroA superfamily of periplasmic metal binding proteins that share a distinct fold and ligand binding mechanism. They are comprised of two globular subdomains connected by a single helix and bind their specific ligands in the cleft between these domains. A typical TroA protein is comprised of two globular subdomains connected by a single helix and can bind the metal ion in the cleft between these domains. In addition, these proteins sometimes have a low complexity region containing a metal-binding histidine-rich motif (repetitive HDH sequence). Probab=27.99 E-value=44 Score=14.28 Aligned_cols=215 Identities=13% Similarity=0.163 Sum_probs=105.8 Q ss_pred HHHHHHHHHHHHHCCCCCCCHHHHHCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHH-HHHHHCCCCC Q ss_conf 599999999865224784300112104542478999998709965899631711100146899999999-9987179986 Q gi|254780200|r 95 TRDQATNLAKRLESISSIVVDWAMRYGKPSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQ-ELIHMRWSPS 173 (343) Q Consensus 95 t~~qa~~L~~~L~~~~~~~V~~amry~~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~-~l~~~~~~~~ 173 (343) +-.+..+|++ .++.|+.|+-+- ++++..++......+-.+..+.-...+.........+.-.. .-......-. T Consensus 43 ~p~d~~~l~~-----Adliv~~G~~lE-~wl~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h~~ 116 (286) T cd01019 43 RPSDARKLQE-----ADLVVWIGPDLE-AFLDKVLQGRKKGKVLTLAKLIDLKTLEDGASHGDHEHDHEHAHGEHDGHEE 116 (286) T ss_pred CHHHHHHHHC-----CCEEEEECCCCH-HHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 9999999966-----999999699724-6899999747788837853355655433444345545555555565444556 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEE Q ss_conf 42133455780068999999999997338998669998336750267627998889999999999972667567047862 Q gi|254780200|r 174 LRTVPPYYEDSDYISALAQSIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKICFQ 253 (343) Q Consensus 174 ~~~I~~~~~~p~yI~a~~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l~fQ 253 (343) -..=+++|-+|.....+++.|.+.|.+.+-... .-+-++..-|..++.+.-+.+.+.+.-..++..++|. T Consensus 117 ~~~DPH~Wldp~~~~~~a~~I~~~L~~~dP~~~----------~~y~~N~~~~~~~L~~l~~~~~~~l~~~~~~~~v~~H 186 (286) T cd01019 117 GGLDPHLWLSPENAAEVAQAVAEKLSALDPDNA----------ATYAANLEAFNARLAELDATIKERLAPVKTKPFFVFH 186 (286) T ss_pred CCCCCEEECCHHHHHHHHHHHHHHHHHHCHHHH----------HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEC T ss_conf 899981634847799999999999988682209----------9999999999999999999999974455652289856 Q ss_pred C-------CCC--------CCCCCCHHH---HHHHHHHHHCCCCCEEEECCHHHHCCHHHHHHHHHHHHHHHHHCCCCEE Q ss_conf 2-------577--------765443026---8999989976998399974405422125588875899999997599758 Q gi|254780200|r 254 S-------RFG--------RIKCLEPPT---DKTVEKLAHDGIKSLAIITPGFSSDCLETSYEIAHEAKEIFVNGGGEKF 315 (343) Q Consensus 254 S-------r~G--------p~~WL~P~t---~~~l~~L~~~G~k~v~v~p~gFvsD~lETl~Eidie~~e~~~e~Gg~~~ 315 (343) . ++| ..+=.+|+. .+..+.+.+.|+ +++++-++|-..-.|++ +.+.|+. . T Consensus 187 ~af~Yfa~~ygl~~~~~~~~~~~~eps~~~l~~l~~~ik~~~v-~~If~e~~~~~k~a~~i----------a~e~g~~-v 254 (286) T cd01019 187 DAYGYFEKRYGLTQAGVFTIDPEIDPGAKRLAKIRKEIKEKGA-TCVFAEPQFHPKIAETL----------AEGTGAK-V 254 (286) T ss_pred CCHHHHHHHCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHCCC-CEEEEECCCCHHHHHHH----------HHHCCCC-E T ss_conf 6368999866983867872246778999999999999998399-88998289893999999----------9971993-8 Q ss_pred EEECCCCC-----CHHHHHHHHHHHHH Q ss_conf 98168987-----98999999999999 Q gi|254780200|r 316 TQVPCLNS-----SNLSIDLLEKITRR 337 (343) Q Consensus 316 ~rvp~lN~-----~~~fi~~La~lv~~ 337 (343) ..+..+.. .+.+++++...+.. T Consensus 255 ~~ld~l~~~~~~~~~~Y~~~m~~n~~~ 281 (286) T cd01019 255 GELDPLGGLIELGKNSYVNFLRNLADS 281 (286) T ss_pred EEECCCCCCCCCCHHHHHHHHHHHHHH T ss_conf 996377677888740699999999999 No 172 >TIGR01125 TIGR01125 MiaB-like tRNA modifying enzyme YliG, TIGR01125; InterPro: IPR005840 This is a family of mainly hypothetical proteins of unknown function. . Probab=27.88 E-value=44 Score=14.27 Aligned_cols=143 Identities=16% Similarity=0.204 Sum_probs=69.2 Q ss_pred HHHHHHHHHHHCCCHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHH------ Q ss_conf 88999999987489544117868889987666512574788999998707788740457775999999998652------ Q gi|254780200|r 34 FFSLRRYLREFLLDKRVVELPSWQWRPILFGYILNFRPSKIKHAYAKIWNTAKNESILRTHTRDQATNLAKRLE------ 107 (343) Q Consensus 34 ~~~V~~yL~~fl~D~~Vi~~p~~~~~~il~~~I~~~R~~~~~~~Y~~Iw~~~gg~SPL~~~t~~qa~~L~~~L~------ 107 (343) +.+|-.++.+..+..||.+.-+. + .+| . ++.-+-..|-..|...+|+.----+.+.+- T Consensus 103 ~~~i~~~i~~~~~~qrv~~~~~~--~----~~~--------~--~~~pg~ragve~p~ren~rhgdvf~pR~~~Tpr~YA 166 (475) T TIGR01125 103 VEEILNAIESLEPGQRVKDKLPF--K----SEI--------P--EEEPGMRAGVETPQRENTRHGDVFVPRIKLTPRHYA 166 (475) T ss_pred HHHHHHHHHHHCCCCCHHHHCCC--C----CCC--------C--CCCCCCCCCCCCCCCCCCCCCCEEECCCCCCCCCEE T ss_conf 89999999742265203553677--5----641--------1--247752012456431134556457402036877403 Q ss_pred -----CCC----CCCCHHHHH---CCCCCHHHHHHH---HHHCCCCEEEEEE---------CCH---HHHHHHHHHHHHH Q ss_conf -----247----843001121---045424789999---9870996589963---------171---1100146899999 Q gi|254780200|r 108 -----SIS----SIVVDWAMR---YGKPSVKEIINN---LREEGCDRLLIFP---------LYP---QYSAATTGTAQDK 160 (343) Q Consensus 108 -----~~~----~~~V~~amr---y~~P~i~~~l~~---l~~~g~~~ii~lP---------LyP---qyS~sTtgS~~~~ 160 (343) ++- -|....-|| =++| |++++.+ |.++|+.+|+++. ||- .|....+.+.+.+ T Consensus 167 YlKvaEGC~~~CaFCiIP~~rG~~rSR~-ie~i~~Ea~~L~~~GvKEiiliaQDTt~YG~DL~~R~~~~~~e~v~~~L~~ 245 (475) T TIGR01125 167 YLKVAEGCNRRCAFCIIPSLRGKLRSRP-IEEILKEAKRLVDQGVKEIILIAQDTTAYGVDLYTRESEFDGEQVKSKLVE 245 (475) T ss_pred EEEECCCCCCCCCEECCCCCCCCCCCCC-HHHHHHHHHHHHHCCCCEEEEEEEECCCCCCEEEECCCCCCCHHHHHHHHH T ss_conf 6870057789865213623367767768-888999999998439838999964034776411110552240145789999 Q ss_pred HHHHHHHHCCC--CCCCCCCCCCCCCHHHHHHHHH Q ss_conf 99999871799--8642133455780068999999 Q gi|254780200|r 161 VFQELIHMRWS--PSLRTVPPYYEDSDYISALAQS 193 (343) Q Consensus 161 ~~~~l~~~~~~--~~~~~I~~~~~~p~yI~a~~~~ 193 (343) +.+.|.+.... .++.++=|+.-++.-|+++++. T Consensus 246 Ll~~L~k~~G~~WiR~~YlYP~~~~~~vI~~m~~~ 280 (475) T TIGR01125 246 LLEELGKLGGIYWIRLLYLYPDELTDDVIDLMAEG 280 (475) T ss_pred HHHHHHHCCCCCEEEEEEECCCCCCHHHHHHHHCC T ss_conf 99974005896227888760888866788998638 No 173 >cd01994 Alpha_ANH_like_IV This is a subfamily of Adenine nucleotide alpha hydrolases superfamily.Adeninosine nucleotide alpha hydrolases superfamily includes N type ATP PPases and ATP sulphurylases. It forms a apha/beta/apha fold which binds to Adenosine group. This subfamily of proteins is predicted to bind ATP. This domainhas a strongly conserved motif SGGKD at the N terminus. Probab=27.73 E-value=44 Score=14.25 Aligned_cols=127 Identities=14% Similarity=0.177 Sum_probs=77.2 Q ss_pred HHHHHHCCCCEEEEEECCHHHHHHHH--HHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 99998709965899631711100146--8999999999987179986421334557800689999999999973389986 Q gi|254780200|r 129 INNLREEGCDRLLIFPLYPQYSAATT--GTAQDKVFQELIHMRWSPSLRTVPPYYEDSDYISALAQSIREHFESIQWTPE 206 (343) Q Consensus 129 l~~l~~~g~~~ii~lPLyPqyS~sTt--gS~~~~~~~~l~~~~~~~~~~~I~~~~~~p~yI~a~~~~I~~~l~~~~~~~~ 206 (343) +-.+.++|.+-+.++.+.|+...+.. +-..+-+.......+ .| +..+..-.+...|.+.+.+.+++. .+... + T Consensus 16 l~~a~~~g~~v~~L~~~~~~~~~~~~~H~~~~~l~~~qAealg-iP-l~~~~~~~~~~~~~~~l~~~L~~~-k~~gi--~ 90 (194) T cd01994 16 LYRALEEGHEVVALLNLTPEEGSSMMYHTVNHELLELQAEAMG-IP-LIRIEISGEEEDEVEDLKELLRKL-KEEGV--D 90 (194) T ss_pred HHHHHHCCCEEEEEEEEECCCCCEEECCCCCHHHHHHHHHHCC-CC-EEEEECCCCCCHHHHHHHHHHHHH-HHCCC--C T ss_conf 9999986992599999963998805250557899999999859-96-699966898717999999999999-97599--5 Q ss_pred EEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHCCCCCEEEE Q ss_conf 699983367502676279988899999999999726675670478622577765443026899998997699839997 Q gi|254780200|r 207 MLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKICFQSRFGRIKCLEPPTDKTVEKLAHDGIKSLAII 284 (343) Q Consensus 207 ~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l~fQSr~Gp~~WL~P~t~~~l~~L~~~G~k~v~v~ 284 (343) .++| ||-+......-.+.+.+++|+.. + -|- |-. +..+.+.++.+.|.+.+++. T Consensus 91 ~vv~------------GdI~s~~qr~~~e~~c~~lgl~~--~--------~PL-W~~-~~~~ll~e~i~~Gf~aiiv~ 144 (194) T cd01994 91 AVVF------------GAILSEYQRTRVERVCERLGLEP--L--------APL-WGR-DQEELLREMIEAGFKAIIIK 144 (194) T ss_pred EEEE------------CCCCCHHHHHHHHHHHHHCCCEE--E--------CHH-CCC-CHHHHHHHHHHCCCEEEEEE T ss_conf 9999------------96332889999999999739888--7--------001-079-99999999998799099999 No 174 >PRK09627 oorA 2-oxoglutarate-acceptor oxidoreductase subunit OorA; Reviewed Probab=27.54 E-value=45 Score=14.23 Aligned_cols=35 Identities=14% Similarity=0.016 Sum_probs=21.8 Q ss_pred HHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHHH Q ss_conf 99999975997589816898798999999999999 Q gi|254780200|r 303 AKEIFVNGGGEKFTQVPCLNSSNLSIDLLEKITRR 337 (343) Q Consensus 303 ~~e~~~e~Gg~~~~rvp~lN~~~~fi~~La~lv~~ 337 (343) .+++....|......+-..|..|-..+-|.+.|++ T Consensus 340 ~~~i~~~~~~~~~~~i~k~~G~pf~~~ei~~~i~e 374 (375) T PRK09627 340 LEEIERVMQRDDFHFLGKANGRPISPSEIIAKVKE 374 (375) T ss_pred HHHHHHHHCCCCCCEEEEECCCCCCHHHHHHHHHH T ss_conf 99999973898751570069985799999999961 No 175 >TIGR03571 lucif_BA3436 luciferase-type oxidoreductase, BA3436 family. This family is a distinct subgroup among members of the luciferase monooxygenase domain family. The larger family contains both FMN-binding enzymes (luciferase, alkane monooxygenase) and F420-binding enzymes (methylenetetrahydromethanopterin reductase, secondary alcohol dehydrogenase, glucose-6-phosphate dehydrogenase). Although some members of the domain family bind coenzyme F420 rather than FMN, members of this family are from species that lack the genes for F420 biosynthesis. A crystal structure, but not function, is known (but unpublished) for the member from Bacillus cereus, PDB|2B81. Probab=27.48 E-value=45 Score=14.22 Aligned_cols=36 Identities=14% Similarity=0.123 Sum_probs=21.6 Q ss_pred HHHHHHHHHHHHCCCCCEEEECCHHHHCCHHHHHHH Q ss_conf 026899998997699839997440542212558887 Q gi|254780200|r 264 PPTDKTVEKLAHDGIKSLAIITPGFSSDCLETSYEI 299 (343) Q Consensus 264 P~t~~~l~~L~~~G~k~v~v~p~gFvsD~lETl~Ei 299 (343) ..+.|.|+++.+.|+.+|++.....-.+..|+|+.+ T Consensus 255 ~~iie~l~~y~eaGv~~vil~~~~~~~~~~e~le~f 290 (298) T TIGR03571 255 NALIELLDALREAGVNHVALNLRPSRRPAAEVLDEL 290 (298) T ss_pred HHHHHHHHHHHHCCCCEEEECCCCCCCCHHHHHHHH T ss_conf 999999999997699789973788999989999999 No 176 >COG0159 TrpA Tryptophan synthase alpha chain [Amino acid transport and metabolism] Probab=27.46 E-value=45 Score=14.22 Aligned_cols=190 Identities=16% Similarity=0.227 Sum_probs=85.6 Q ss_pred EEEEEECCCCCCCCCHHHHHHHHHHHCCC-HHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCHHHHHHHH Q ss_conf 79999717888999988999999987489-54411786888998766651257478899999870778874045777599 Q gi|254780200|r 19 IGVLLVNLGTPDGHDFFSLRRYLREFLLD-KRVVELPSWQWRPILFGYILNFRPSKIKHAYAKIWNTAKNESILRTHTRD 97 (343) Q Consensus 19 ~gvLL~n~G~P~~~~~~~V~~yL~~fl~D-~~Vi~~p~~~~~~il~~~I~~~R~~~~~~~Y~~Iw~~~gg~SPL~~~t~~ 97 (343) -=|-.+..|-|+ .+...++++.+..+ -++|++.-++--|+--| +.++....+=- .+|-|+ ..+.. T Consensus 18 a~i~yit~GdP~---~e~s~e~i~~L~~~GaD~iELGvPfSDPvADG-------P~Iq~A~~rAL--~~g~t~--~~~le 83 (265) T COG0159 18 ALIPYVTAGDPD---LETSLEIIKTLVEAGADILELGVPFSDPVADG-------PTIQAAHLRAL--AAGVTL--EDTLE 83 (265) T ss_pred CEEEEEECCCCC---HHHHHHHHHHHHHCCCCEEEECCCCCCCCCCC-------HHHHHHHHHHH--HCCCCH--HHHHH T ss_conf 748889488999---89999999999867988899668888867668-------89999899999--779988--99999 Q ss_pred HHHHHHHHHHCCCCCCCHHHHHCCCCCHH----HHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 99999986522478430011210454247----89999987099658996317111001468999999999987179986 Q gi|254780200|r 98 QATNLAKRLESISSIVVDWAMRYGKPSVK----EIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMRWSPS 173 (343) Q Consensus 98 qa~~L~~~L~~~~~~~V~~amry~~P~i~----~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~~~~~ 173 (343) +.+.+.+. ..++++ .-|.|.+|--. .=++++++.|++-+++.=|=|-.+ +.+.+...+.+.. . T Consensus 84 l~~~~r~~---~~~~Pi-vlm~Y~Npi~~~Gie~F~~~~~~~GvdGlivpDLP~ee~--------~~~~~~~~~~gi~-~ 150 (265) T COG0159 84 LVEEIRAK---GVKVPI-VLMTYYNPIFNYGIEKFLRRAKEAGVDGLLVPDLPPEES--------DELLKAAEKHGID-P 150 (265) T ss_pred HHHHHHHC---CCCCCE-EEEEECCHHHHHHHHHHHHHHHHCCCCEEEECCCCHHHH--------HHHHHHHHHCCCC-E T ss_conf 99999861---899988-998701188773599999999975998798578986677--------7899999976986-7 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEEEE Q ss_conf 4213345578006899999999999733899866999833675026762799888999999999997266756704786 Q gi|254780200|r 174 LRTVPPYYEDSDYISALAQSIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKICF 252 (343) Q Consensus 174 ~~~I~~~~~~p~yI~a~~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l~f 252 (343) +.++.+=.. + ++++.-.+.. +....+.|--|.-= ++ .++...+.+-.+.|.+.-+. +.-+.| T Consensus 151 I~lvaPtt~-----~---~rl~~i~~~a--~GFiY~vs~~GvTG--~~--~~~~~~~~~~v~~vr~~~~~---Pv~vGF 212 (265) T COG0159 151 IFLVAPTTP-----D---ERLKKIAEAA--SGFIYYVSRMGVTG--AR--NPVSADVKELVKRVRKYTDV---PVLVGF 212 (265) T ss_pred EEEECCCCC-----H---HHHHHHHHHC--CCCEEEEECCCCCC--CC--CCCCHHHHHHHHHHHHHCCC---CEEEEC T ss_conf 988699999-----8---9999999747--98589996666667--77--65304699999999974489---738744 No 177 >pfam12098 DUF3574 Protein of unknown function (DUF3574). This family of proteins is functionally uncharacterized. This protein is found in bacteria and viruses. Proteins in this family are typically between 144 to 163 amino acids in length. This protein has a conserved TPRF sequence motif. Probab=27.32 E-value=27 Score=15.70 Aligned_cols=65 Identities=14% Similarity=0.172 Sum_probs=44.0 Q ss_pred EEEECCCCCCCCCCHHHHHHHHHHHHCCCCCEEEECCHHHHCCHHHHHHHHHHHHHHHHHCCCCEEEEECCC Q ss_conf 786225777654430268999989976998399974405422125588875899999997599758981689 Q gi|254780200|r 250 ICFQSRFGRIKCLEPPTDKTVEKLAHDGIKSLAIITPGFSSDCLETSYEIAHEAKEIFVNGGGEKFTQVPCL 321 (343) Q Consensus 250 l~fQSr~Gp~~WL~P~t~~~l~~L~~~G~k~v~v~p~gFvsD~lETl~Eidie~~e~~~e~Gg~~~~rvp~l 321 (343) ++..- |.+.|.++.+.. +..|..|-|.|+-+. ..|.-.-+.+|-..||+.|...+.+.....-|. T Consensus 38 lTv~D--a~GqW~~~~~g~----~~rE~skvv~I~~~~-~~~~~~~i~~I~~aYK~~F~Q~SVl~~~~p~cv 102 (104) T pfam12098 38 LTVFD--AYGQWRDRATGR----IVRENSKVVIIVHPD-TPAKRADIEAIREAYKQRFHQQSVLRVDQPVCV 102 (104) T ss_pred EEEEE--CCCEECCCCCCC----EEEECCEEEEEEECC-CHHHHHHHHHHHHHHHHHCCCCEEEEECCCEEE T ss_conf 07996--753571688885----020031799999089-863799999999999987667528998741672 No 178 >PRK09997 hydroxypyruvate isomerase; Provisional Probab=27.06 E-value=45 Score=14.17 Aligned_cols=85 Identities=8% Similarity=-0.013 Sum_probs=40.7 Q ss_pred HHHHHHHHHHCCCCEEEEEECCHHH--H----HHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 4789999987099658996317111--0----014689999999999871799864213345578006899999999999 Q gi|254780200|r 125 VKEIINNLREEGCDRLLIFPLYPQY--S----AATTGTAQDKVFQELIHMRWSPSLRTVPPYYEDSDYISALAQSIREHF 198 (343) Q Consensus 125 i~~~l~~l~~~g~~~ii~lPLyPqy--S----~sTtgS~~~~~~~~l~~~~~~~~~~~I~~~~~~p~yI~a~~~~I~~~l 198 (343) ++.+++--++-||..|.+++=-..- + ..|....++...+.+.+.+...-+.-|++ +++|+|.=.-++...+-+ T Consensus 87 i~~ai~yA~~lg~~~i~vmaG~~~~g~~~~~~~~~~venL~~aa~~a~~~gv~l~iEplN~-~d~pgy~l~~~~~a~~ii 165 (258) T PRK09997 87 VAAAIRYARALGNKKINCLVGKTPSGFSSEQIHDTLVENLRYAANMLMKEDILLLIEPINH-FDIPGFHLTGTRQALKLI 165 (258) T ss_pred HHHHHHHHHHHCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEEEECCC-CCCCCCCCCCHHHHHHHH T ss_conf 9999999998399968881135888889899999999999999999986598799986240-137888048899999999 Q ss_pred HHCCCCCCEEEE Q ss_conf 733899866999 Q gi|254780200|r 199 ESIQWTPEMLLV 210 (343) Q Consensus 199 ~~~~~~~~~llf 210 (343) +..+.++.+++| T Consensus 166 ~~v~~pnv~l~~ 177 (258) T PRK09997 166 DDVGCCNLKIQY 177 (258) T ss_pred HHCCCCCEEEEE T ss_conf 980998557640 No 179 >PRK08208 coproporphyrinogen III oxidase; Validated Probab=26.71 E-value=46 Score=14.13 Aligned_cols=117 Identities=12% Similarity=0.083 Sum_probs=53.4 Q ss_pred CCCCCHHHHHHH---HHHHHHHHHHHCCCCCCCHHHHHCCCC--CHHHHHHHHHHCCCCEE-EEEECCHHHHHHHHHH-- Q ss_conf 887404577759---999999986522478430011210454--24789999987099658-9963171110014689-- Q gi|254780200|r 85 AKNESILRTHTR---DQATNLAKRLESISSIVVDWAMRYGKP--SVKEIINNLREEGCDRL-LIFPLYPQYSAATTGT-- 156 (343) Q Consensus 85 ~gg~SPL~~~t~---~qa~~L~~~L~~~~~~~V~~amry~~P--~i~~~l~~l~~~g~~~i-i~lPLyPqyS~sTtgS-- 156 (343) .|||.|-.--.+ .+.+.+++.++- ..-.+++.+=- +| ..++-++.+++.|+++| +.+=-|-.---...|- T Consensus 103 ~GGGTPS~L~~~~l~~ll~~l~~~f~~-~~~~~EiTiE~-nP~~~~~~~l~~l~~~GvNRiSlGVQsf~~~~L~~lgR~h 180 (436) T PRK08208 103 VGGGTPTLLNIAQLEKLFFSVFDVLGV-DLWNIPKSVET-SPTTTTAEKLALLNAFGVNRVSIGVQSFHDSELHALHRRQ 180 (436) T ss_pred ECCCHHHCCCHHHHHHHHHHHHHHCCC-CCCCEEEEEEE-CCCCCHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHCCCC T ss_conf 679432219999999999999985899-84671599986-6363609999999973987278741448989999846889 Q ss_pred HHHHHHHHH---HHHCC-CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEE Q ss_conf 999999999---87179-98642133455780068999999999997338998669 Q gi|254780200|r 157 AQDKVFQEL---IHMRW-SPSLRTVPPYYEDSDYISALAQSIREHFESIQWTPEML 208 (343) Q Consensus 157 ~~~~~~~~l---~~~~~-~~~~~~I~~~~~~p~yI~a~~~~I~~~l~~~~~~~~~l 208 (343) ..+++.+++ .+.+. ..++..|-.... .=.+.|.+.|++.++- .++|| T Consensus 181 ~~~~~~~ai~~~r~~gf~niniDLIyGlPg--Qt~~~~~~~l~~~l~l---~p~HI 231 (436) T PRK08208 181 NAASVHQALELIRAAHFPSLNIDLIYGIPG--QTHASFMESLHQALVY---RPEEL 231 (436) T ss_pred CHHHHHHHHHHHHHCCCCEEEEEEEECCCC--CCHHHHHHHHHHHHCC---CCCEE T ss_conf 999999999999981998575524436999--9999999999999827---98989 No 180 >TIGR00423 TIGR00423 conserved hypothetical protein TIGR00423; InterPro: IPR005244 This entry describes a family of conserved hypothetical proteins with no known function. . Probab=26.59 E-value=46 Score=14.11 Aligned_cols=43 Identities=19% Similarity=0.369 Sum_probs=19.9 Q ss_pred HHHHHHHHHHHH---CC-----CCCCCHHHHHCCCCCH-HHHHHHHHHCCCC Q ss_conf 999999998652---24-----7843001121045424-7899999870996 Q gi|254780200|r 96 RDQATNLAKRLE---SI-----SSIVVDWAMRYGKPSV-KEIINNLREEGCD 138 (343) Q Consensus 96 ~~qa~~L~~~L~---~~-----~~~~V~~amry~~P~i-~~~l~~l~~~g~~ 138 (343) ..+.++++++.+ +. +..+|+++=+...=++ +++|+.||+.|.+ T Consensus 77 ~~l~~~Ik~~~pPyG~~hiHafSp~Ev~f~A~~~~~~~e~EvL~~LK~aGL~ 128 (331) T TIGR00423 77 EELFRAIKQEFPPYGDVHIHAFSPMEVYFLAKNEGLSIEKEVLKRLKKAGLD 128 (331) T ss_pred HHHHHHHHHCCCCCCCEEEECCCHHHHHHHHHHCCCCHHHHHHHHHHHHCCC T ss_conf 9999999741789652476146868999999861897889999998885035 No 181 >TIGR00588 ogg 8-oxoguanine DNA-glycosylase (ogg); InterPro: IPR004577 All proteins in this family for which functions are known are 8-oxo-guanaine DNA glycosylases that function in base excision repair. The enzyme incises DNA at 8-oxoG residues, and excises 7,8-dihydro-8-oxoguanine from damaged DNA. It has beta-lyase activity that nicks DNA 3' to the lesion.; GO: 0008534 oxidized purine base lesion DNA N-glycosylase activity, 0006281 DNA repair, 0005634 nucleus. Probab=26.54 E-value=30 Score=15.40 Aligned_cols=60 Identities=18% Similarity=0.221 Sum_probs=32.1 Q ss_pred CCCCHH-HHHHHHHHHHHHHCCCCCCCEEEEECCC-CCCCCCCHHHHHHHHHHHHCCCCCEEEECCHHHHCCH Q ss_conf 799888-9999999999972667567047862257-7765443026899998997699839997440542212 Q gi|254780200|r 223 GDPYGC-HCHKTARLLKEFLSWPDDRFKICFQSRF-GRIKCLEPPTDKTVEKLAHDGIKSLAIITPGFSSDCL 293 (343) Q Consensus 223 gDpY~~-q~~~t~~~i~~~l~~~~~~~~l~fQSr~-Gp~~WL~P~t~~~l~~L~~~G~k~v~v~p~gFvsD~l 293 (343) |=-|++ .+.+|++.|++..+..+-.-..+|--+. -.... -+..+.|-.| .|+.-- |+||+ T Consensus 229 G~GYRA~Yi~~tar~l~ee~~~~nitsdta~LQ~ic~~~~Y--edar~~L~~l--~GVG~K-------VADCi 290 (379) T TIGR00588 229 GLGYRARYIRETARALLEEQGGRNITSDTAWLQQICKDADY--EDAREALLEL--PGVGPK-------VADCI 290 (379) T ss_pred CCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCH--HHHHHHHHCC--CCCCCH-------HHHHH T ss_conf 88754068999999988412664200235799986066886--7899997216--999704-------88888 No 182 >PRK09533 bifunctional transaldolase/phosoglucose isomerase; Validated Probab=26.24 E-value=47 Score=14.07 Aligned_cols=66 Identities=11% Similarity=0.172 Sum_probs=37.8 Q ss_pred HHCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHH--HHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHH Q ss_conf 210454247899999870996589963171110014689999--9999998717998642133455780068999999 Q gi|254780200|r 118 MRYGKPSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQD--KVFQELIHMRWSPSLRTVPPYYEDSDYISALAQS 193 (343) Q Consensus 118 mry~~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~--~~~~~l~~~~~~~~~~~I~~~~~~p~yI~a~~~~ 193 (343) |.---+-.+.-.++++++|++++|++=| -||++- -+.+...+....|+++++++ .+|.+|.+..+. T Consensus 436 ~~~~~~~~~~fa~~vr~~g~t~vvl~GM--------GGSSLaPEVia~tf~~~~g~P~L~VLDS--TDP~~V~a~~~~ 503 (950) T PRK09533 436 ELAKLADYEAFAAEVKAQGFTDAVVLGM--------GGSSLGPEVLAETFGQRDGFPKLHVLDS--TDPAQVRALEAA 503 (950) T ss_pred HHHHHHHHHHHHHHHHHCCCCEEEEECC--------CCCHHHHHHHHHHHCCCCCCCEEEEECC--CCHHHHHHHHHC T ss_conf 9876899999999998669876999478--------7320409999987233579970799659--898999999854 No 183 >pfam05990 DUF900 Alpha/beta hydrolase of unknown function (DUF900). This family consists of several hypothetical proteins of unknown function mostly found in Rhizobium species. Members of this family have an alpha/beta hydrolase fold. Probab=26.08 E-value=47 Score=14.05 Aligned_cols=24 Identities=13% Similarity=0.422 Sum_probs=12.3 Q ss_pred CCCCCHHHHHHHHHH-CCCCEEEEE Q ss_conf 045424789999987-099658996 Q gi|254780200|r 120 YGKPSVKEIINNLRE-EGCDRLLIF 143 (343) Q Consensus 120 y~~P~i~~~l~~l~~-~g~~~ii~l 143 (343) +..|..++.|..+.+ .|.++|.++ T Consensus 75 ~s~~~l~~~L~~l~~~~~~~~I~il 99 (230) T pfam05990 75 YSRDALERLLRYLATTPPVKRIHLI 99 (230) T ss_pred HHHHHHHHHHHHHHHCCCCCEEEEE T ss_conf 9999999999999866698748999 No 184 >COG4555 NatA ABC-type Na+ transport system, ATPase component [Energy production and conversion / Inorganic ion transport and metabolism] Probab=26.04 E-value=47 Score=14.05 Aligned_cols=26 Identities=12% Similarity=0.192 Sum_probs=19.1 Q ss_pred HHHHHHHHCCCCCCEEEEEECCCCHH Q ss_conf 99999973389986699983367502 Q gi|254780200|r 193 SIREHFESIQWTPEMLLVSFHQMPVS 218 (343) Q Consensus 193 ~I~~~l~~~~~~~~~llfSaHglP~~ 218 (343) .+.+.+.+...+.-.+|||.|.+++- T Consensus 171 ~~~dfi~q~k~egr~viFSSH~m~Ev 196 (245) T COG4555 171 KFHDFIKQLKNEGRAVIFSSHIMQEV 196 (245) T ss_pred HHHHHHHHHHCCCCEEEEECCCHHHH T ss_conf 99999998525794899961317999 No 185 >KOG3182 consensus Probab=26.01 E-value=47 Score=14.04 Aligned_cols=12 Identities=17% Similarity=0.075 Sum_probs=4.7 Q ss_pred HHHHHHHHHHHH Q ss_conf 999999999997 Q gi|254780200|r 229 HCHKTARLLKEF 240 (343) Q Consensus 229 q~~~t~~~i~~~ 240 (343) .++++++.|+.. T Consensus 134 ple~iArqI~t~ 145 (212) T KOG3182 134 PLEEIARQIVTA 145 (212) T ss_pred CHHHHHHHHHHC T ss_conf 689999999863 No 186 >pfam01795 Methyltransf_5 MraW methylase family. Members of this family are probably SAM dependent methyltransferases based on Escherichia coli mraW. This family appears to be related to pfam01596. Probab=25.81 E-value=48 Score=14.02 Aligned_cols=17 Identities=18% Similarity=0.401 Sum_probs=14.5 Q ss_pred CEEEEEECCCCHHHHHH Q ss_conf 66999833675026762 Q gi|254780200|r 206 EMLLVSFHQMPVSYLLK 222 (343) Q Consensus 206 ~~llfSaHglP~~~~~~ 222 (343) -.++.|||||=.|.|++ T Consensus 236 rl~VISFHSLEDRiVK~ 252 (310) T pfam01795 236 RLSVISFHSLEDRIVKN 252 (310) T ss_pred EEEEEEECCHHHHHHHH T ss_conf 89999854346799999 No 187 >PRK00347 sugar fermentation stimulation protein A; Reviewed Probab=25.73 E-value=48 Score=14.01 Aligned_cols=23 Identities=13% Similarity=0.356 Sum_probs=10.0 Q ss_pred CCCCCCCCHHHHHHHHHHHHCCC Q ss_conf 77765443026899998997699 Q gi|254780200|r 256 FGRIKCLEPPTDKTVEKLAHDGI 278 (343) Q Consensus 256 ~Gp~~WL~P~t~~~l~~L~~~G~ 278 (343) |.|..=+-|...+++.+..+.|+ T Consensus 187 f~p~~~~Dp~fa~~l~~A~~~GV 209 (234) T PRK00347 187 FAPADEIDPEYAELLREAVAAGV 209 (234) T ss_pred EEECHHCCHHHHHHHHHHHHCCC T ss_conf 80864439999999999998898 No 188 >COG3384 Aromatic ring-opening dioxygenase, catalytic LigB subunit related enzyme [Amino acid transport and metabolism] Probab=25.70 E-value=48 Score=14.00 Aligned_cols=14 Identities=21% Similarity=0.365 Sum_probs=9.3 Q ss_pred EEEEEECCCCCCCC Q ss_conf 79999717888999 Q gi|254780200|r 19 IGVLLVNLGTPDGH 32 (343) Q Consensus 19 ~gvLL~n~G~P~~~ 32 (343) +-.|.+..|+|... T Consensus 9 ~p~LflshgsP~~~ 22 (268) T COG3384 9 MPALFLSHGSPMLA 22 (268) T ss_pred CCCEEECCCCCCCC T ss_conf 60115628981223 No 189 >TIGR02207 lipid_A_htrB lipid A biosynthesis lauroyl (or palmitoleoyl) acyltransferase; InterPro: IPR011920 Bacterial lipopolysachharides (LPS) are glycolipids that make up the outer monolayer of the outer membranes of most Gram-negative bacteria. Though LPS moleculesare variable, they all show the same general features: an outer polysaccharide which is attached to the lipid component, termed lipid A . The polysaccharide component consists of a variable repeat-structure polysaccharide known as the O-antigen, and a highly conserved short core oligosaccharide which connects the O-antigen to lipid A. Lipid A is a glucosamine-based phospholipid that makes up the membrane anchor region of LPS . The structure of lipid A is relatively invariant between species, presumably reflecting its fundamental role in membrane integrity. Recognition of lipid A by the innate immune system can lead to a response even at picomolar levels. In some genera, such as Neisseria and Haemophilus, lipooligosaccharides (LOS) are the predominant glycolipids . These are analogous to LPS except that they lack O-antigens, with the LOS oligosaccharide structures limited to 10 saccharide units. This entry represents a narrow clade of acyltransferases, nearly all of which transfer a lauroyl group to KDO2-lipid IV-A, a lipid A precursor; these proteins are termed lipid A biosynthesis lauroyl acyltransferase, HtrB. An exception is a closely related paralog of Escherichia coli HtrB, LpxP, which acts in cold shock conditions by transferring a palmitoleoyl rather than lauroyl group to the lipid A precursor. Members of this family are homologous to the family of acyltransferases responsible for the next step in lipid A biosynthesis .; GO: 0016747 transferase activity transferring groups other than amino-acyl groups, 0009276 1-2nm peptidoglycan-based cell wall, 0016021 integral to membrane. Probab=25.64 E-value=48 Score=14.00 Aligned_cols=11 Identities=36% Similarity=0.655 Sum_probs=7.2 Q ss_pred EEEEEECCCCH Q ss_conf 69998336750 Q gi|254780200|r 207 MLLVSFHQMPV 217 (343) Q Consensus 207 ~llfSaHglP~ 217 (343) .||++-|.|+. T Consensus 124 VlLv~~HfltL 134 (308) T TIGR02207 124 VLLVGVHFLTL 134 (308) T ss_pred EEEEEHHCCCH T ss_conf 89971300455 No 190 >TIGR01494 ATPase_P-type ATPase, P-type (transporting), HAD superfamily, subfamily IC; InterPro: IPR001757 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (F-, V- and A-ATPases contain rotary motors) and in the type of ions they transport , . F-ATPases (F1F0-ATPases) in mitochondria, chloroplasts and bacterial plasma membranes are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases) are primarily found in eukaryotic vacuoles, catalysing ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases) are found in Archaea and function like F-ATPases. P-ATPases (E1E2-ATPases) are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. P-ATPases (sometime known as E1-E2 ATPases) (3.6.3.- from EC) are found in bacteria and in a number of eukaryotic plasma membranes and organelles . P-ATPases function to transport a variety of different compounds, including ions and phospholipids, across a membrane using ATP hydrolysis for energy. There are many different classes of P-ATPases, each of which transports a specific type of ion: H^+, Na^+, K^+, Mg^2+, Ca^2+, Ag^+ and Ag^2+, Zn^2+, Co^2+, Pb^2+, Ni^2+, Cd^2+, Cu^+ and Cu^2+. P-ATPases can be composed of one or two polypeptides, and can usually assume two main conformations called E1 and E2. This entry represents the several classes of P-type ATPases, including those that transport K^+ (3.6.3.12 from EC), Mg^2+ (3.6.3.2 from EC), Cd^2+ (3.6.3.3 from EC), Cu ^2+ (3.6.3.4 from EC), Zn^2+ (3.6.3.5 from EC), Na^+ (3.6.3.7 from EC), Ca^2+ (3.6.3.8 from EC), Na^+/K^+ (3.6.3.9 from EC), and H^+/K^+ (3.6.3.10 from EC). These P-ATPases are found in both prokaryotes and eukaryotes. More information about this protein can be found at Protein of the Month: ATP Synthases . ; GO: 0005524 ATP binding, 0015662 ATPase activity coupled to transmembrane movement of ions phosphorylative mechanism, 0006810 transport, 0016020 membrane. Probab=25.63 E-value=48 Score=14.00 Aligned_cols=114 Identities=24% Similarity=0.323 Sum_probs=60.6 Q ss_pred CCCCCHHHHHHHHHHHHHHHHCCCCCC-EEEEEECCCCHHHHHHCCC-C-HHHHHHHHHHHHHHHCCCCCCCEEEEECC- Q ss_conf 557800689999999999973389986-6999833675026762799-8-88999999999997266756704786225- Q gi|254780200|r 180 YYEDSDYISALAQSIREHFESIQWTPE-MLLVSFHQMPVSYLLKGDP-Y-GCHCHKTARLLKEFLSWPDDRFKICFQSR- 255 (343) Q Consensus 180 ~~~~p~yI~a~~~~I~~~l~~~~~~~~-~llfSaHglP~~~~~~gDp-Y-~~q~~~t~~~i~~~l~~~~~~~~l~fQSr- 255 (343) +....--=.|+.+..++.++..+++.+ +-+=.-+.-|....-||-| - ...|..+-+.+.+. +-. .--.+++.+| T Consensus 238 ~~~~~P~e~Alv~~a~~~~~~~~F~s~~krm~~i~~~~~~~~~KGa~~~~~~~~~~~~~~~~~~-~~~-G~R~l~~a~~~ 315 (478) T TIGR01494 238 YQAGSPDEKALVKFAKKILDVLPFTSERKRMSVIVRDPDGKYVKGAPEVILERCSDTEEHLEEL-ASE-GLRTLAVAYKE 315 (478) T ss_pred CCCCCHHHHHHHHHHHHHCCEECCCCCEEEEEEEEECCCCEEECCCHHHHHHHHHHHHHHHHHH-HHC-CCEEEEEEEEC T ss_conf 3247745799998654431122035540478999985896587183467899888899998876-317-87089999806 Q ss_pred --CC---CCCCCCHHHHHHHHHHHHCCCCCEEEECCHHHHCCHHHHHHHH Q ss_conf --77---7654430268999989976998399974405422125588875 Q gi|254780200|r 256 --FG---RIKCLEPPTDKTVEKLAHDGIKSLAIITPGFSSDCLETSYEIA 300 (343) Q Consensus 256 --~G---p~~WL~P~t~~~l~~L~~~G~k~v~v~p~gFvsD~lETl~Eid 300 (343) .| =.-=|+|++.+++++|.+.|+| |.++ +=||.||=..|. T Consensus 316 ~~lG~~~~~D~lr~~~~~~i~~L~~agi~-v~ml----TGD~~~tA~~ia 360 (478) T TIGR01494 316 ELLGLTAIEDPLREDVKETIEELKRAGIK-VWML----TGDNVETAIAIA 360 (478) T ss_pred EEEEEEEEECCCCCCHHHHHHHHHHCCCE-EEEE----ECCCHHHHHHHH T ss_conf 07899997345777268999999865998-9999----579879999999 No 191 >TIGR02208 lipid_A_msbB lipid A biosynthesis (KDO)2-(lauroyl)-lipid IVA acyltransferase; InterPro: IPR011921 Bacterial lipopolysachharides (LPS) are glycolipids that make up the outer monolayer of the outer membranes of most Gram-negative bacteria. Though LPS moleculesare variable, they all show the same general features: an outer polysaccharide which is attached to the lipid component, termed lipid A . The polysaccharide component consists of a variable repeat-structure polysaccharide known as the O-antigen, and a highly conserved short core oligosaccharide which connects the O-antigen to lipid A. Lipid A is a glucosamine-based phospholipid that makes up the membrane anchor region of LPS . The structure of lipid A is relatively invariant between species, presumably reflecting its fundamental role in membrane integrity. Recognition of lipid A by the innate immune system can lead to a response even at picomolar levels. In some genera, such as Neisseria and Haemophilus, lipooligosaccharides (LOS) are the predominant glycolipids . These are analogous to LPS except that they lack O-antigens, with the LOS oligosaccharide structures limited to 10 saccharide units. This family consists of MsbB in Escherichia coli and closely related proteins in other species. MsbB is homologous to HtrB (IPR011920 from INTERPRO) and acts immediately after it in the biosynthesis of KDO-2 lipid A (also called Re LPS and Re endotoxin). These two enzymes act after creation of KDO-2 lipid IV-A by addition of the KDO sugars.; GO: 0016747 transferase activity transferring groups other than amino-acyl groups, 0009103 lipopolysaccharide biosynthetic process, 0009276 1-2nm peptidoglycan-based cell wall, 0016021 integral to membrane. Probab=25.59 E-value=48 Score=13.99 Aligned_cols=18 Identities=33% Similarity=0.591 Sum_probs=11.9 Q ss_pred HHHHHHHHCCCCEEEEEE Q ss_conf 899999870996589963 Q gi|254780200|r 127 EIINNLREEGCDRLLIFP 144 (343) Q Consensus 127 ~~l~~l~~~g~~~ii~lP 144 (343) |-|++++++|-.=|+++| T Consensus 114 E~~~~~~~~G~~vIllvP 131 (307) T TIGR02208 114 EHIEEAKAEGKKVILLVP 131 (307) T ss_pred HHHHHHHHCCCCEEEEEC T ss_conf 127999962896399717 No 192 >PRK00025 lpxB lipid-A-disaccharide synthase; Reviewed Probab=25.37 E-value=49 Score=13.96 Aligned_cols=16 Identities=0% Similarity=0.152 Sum_probs=8.1 Q ss_pred CCCHHHHHHHHHHHHH Q ss_conf 8798999999999999 Q gi|254780200|r 322 NSSNLSIDLLEKITRR 337 (343) Q Consensus 322 N~~~~fi~~La~lv~~ 337 (343) |.+..-++.+.+++++ T Consensus 365 gas~raA~~Il~~l~~ 380 (382) T PRK00025 365 GAAERAAQAVLELLKQ 380 (382) T ss_pred CHHHHHHHHHHHHHHH T ss_conf 9999999999999973 No 193 >cd01451 vWA_Magnesium_chelatase Magnesium chelatase: Mg-chelatase catalyses the insertion of Mg into protoporphyrin IX (Proto). In chlorophyll biosynthesis, insertion of Mg2+ into protoporphyrin IX is catalysed by magnesium chelatase in an ATP-dependent reaction. Magnesium chelatase is a three sub-unit (BchI, BchD and BchH) enzyme with a novel arrangement of domains: the C-terminal helical domain is located behind the nucleotide binding site. The BchD domain contains a AAA domain at its N-terminus and a VWA domain at its C-terminus. The VWA domain has been speculated to be involved in mediating protein-protein interactions. Probab=25.04 E-value=49 Score=13.92 Aligned_cols=53 Identities=23% Similarity=0.343 Sum_probs=29.4 Q ss_pred HHHHHHCCCCCEEEECCHHHHCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHH Q ss_conf 9989976998399974405422125588875899999997599758981689879899999999999 Q gi|254780200|r 270 VEKLAHDGIKSLAIITPGFSSDCLETSYEIAHEAKEIFVNGGGEKFTQVPCLNSSNLSIDLLEKITR 336 (343) Q Consensus 270 l~~L~~~G~k~v~v~p~gFvsD~lETl~Eidie~~e~~~e~Gg~~~~rvp~lN~~~~fi~~La~lv~ 336 (343) -++++++|+.-+ .++|-.+... +-.++++|...|| +|.++|.+|... |+++|+ T Consensus 125 a~~~~~~gi~~~---vId~~~~~~~-----~~~~~~LA~~~~g-~Y~~id~l~~~~-----i~~~v~ 177 (178) T cd01451 125 ARKLRARGISAL---VIDTEGRPVR-----RGLAKDLARALGG-QYVRLPDLSADA-----IASAVR 177 (178) T ss_pred HHHHHHCCCCEE---EEECCCCCCC-----HHHHHHHHHHCCC-CEEECCCCCHHH-----HHHHHC T ss_conf 999986699789---9979999767-----4899999994299-699899799889-----999872 No 194 >pfam06821 DUF1234 Alpha/Beta hydrolase family of unknown function (DUF1234). The Crystal Structure Of The Yden Gene Product from B. Subtilis has been solved. The structure shows an alpha-beta hydrolase fold suggesting an enzymatic function for these proteins. Probab=24.94 E-value=50 Score=13.91 Aligned_cols=34 Identities=3% Similarity=0.092 Sum_probs=16.5 Q ss_pred CCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHH Q ss_conf 57800689999999999973389986699983367502 Q gi|254780200|r 181 YEDSDYISALAQSIREHFESIQWTPEMLLVSFHQMPVS 218 (343) Q Consensus 181 ~~~p~yI~a~~~~I~~~l~~~~~~~~~llfSaHglP~~ 218 (343) +++|. .+.|.+.+++.+...+ ...++.+||+-.- T Consensus 34 ~~~P~-~~~W~~~L~~~l~~~d---~~~ilVaHSLGc~ 67 (171) T pfam06821 34 WLQPV-LDDWVAALSAAVAAAP---GPVVLVAHSLGCL 67 (171) T ss_pred CCCCC-HHHHHHHHHHHHHHCC---CCEEEEEECHHHH T ss_conf 89989-9999999999997368---9859997357899 No 195 >TIGR03099 dCO2ase_PEP1 pyridoxal-dependent decarboxylase, exosortase system type 1 associated. The sequences in this family contain the pyridoxal binding domain (pfam02784) and C-terminal sheet domain (pfam00278) of a family of Pyridoxal-dependent decarboxylases. Characterized enzymes in this family decarboxylate substrates such as ornithine, diaminopimelate and arginine. The genes of the family modeled here, with the exception of those observed in certain Burkholderia species, are all found in the context of exopolysaccharide biosynthesis loci containing the exosortase/PEP-CTERM protein sorting system. More specifically, these are characteristic of the type 1 exosortase system represented by the Genome Property GenProp0652. The substrate of these enzymes may be a precursor of the carrier or linker which is hypothesized to release the PEP-CTERM protein from the exosortase enzyme. These enzymes are apparently most closely related to the diaminopimelate decarboxylase modeled by TIGR01048 Probab=24.91 E-value=45 Score=14.19 Aligned_cols=43 Identities=14% Similarity=0.241 Sum_probs=17.6 Q ss_pred CCCHHHHHHHHH----HHHHHHHHHCCCCCCCHHHHH-CCCCCHHHHHHH Q ss_conf 740457775999----999998652247843001121-045424789999 Q gi|254780200|r 87 NESILRTHTRDQ----ATNLAKRLESISSIVVDWAMR-YGKPSVKEIINN 131 (343) Q Consensus 87 g~SPL~~~t~~q----a~~L~~~L~~~~~~~V~~amr-y~~P~i~~~l~~ 131 (343) |++|+..+.... .+.+++.|+ ..+.+++||+ +..|.+-..+.+ T Consensus 23 g~TPlyVyd~~~l~~~~~~l~~a~p--~~~~i~YAvKaN~~~~il~~l~~ 70 (398) T TIGR03099 23 GGTPFYAYDRGLVSERVAALRKALP--EELAIHYAVKANPMPALLAHMAP 70 (398) T ss_pred CCCCEEEEEHHHHHHHHHHHHHHCC--CCCEEEEEEECCCCHHHHHHHHH T ss_conf 8999899669999999999998579--99849999602899999999998 No 196 >pfam12309 KBP_C KIF-1 binding protein C terminal. This family of proteins is found in bacteria and eukaryotes. Proteins in this family are typically between 365 and 621 amino acids in length. There is a conserved LLP sequence motif. KBP is a binding partner for KIF1Balpha that is a regulator of its transport function and thus represents a type of kinesin interacting protein. Probab=24.83 E-value=50 Score=13.90 Aligned_cols=19 Identities=11% Similarity=0.260 Sum_probs=8.0 Q ss_pred CCCCCCCCHHHHHHHHHHH Q ss_conf 3345578006899999999 Q gi|254780200|r 177 VPPYYEDSDYISALAQSIR 195 (343) Q Consensus 177 I~~~~~~p~yI~a~~~~I~ 195 (343) -..||.-.+|..-.++.++ T Consensus 152 Ak~yy~ldg~vTdhi~I~q 170 (365) T pfam12309 152 AKEYYTLDGHVTDHIEILQ 170 (365) T ss_pred HHHHHHCCCHHHHHHHHHH T ss_conf 9889300342889999999 No 197 >pfam02629 CoA_binding CoA binding domain. This domain has a Rossmann fold and is found in a number of proteins including succinyl CoA synthetases, malate and ATP-citrate ligases. Probab=24.82 E-value=50 Score=13.89 Aligned_cols=30 Identities=27% Similarity=0.336 Sum_probs=12.9 Q ss_pred CHHHHHCCC-CCHHHHHHHHHHCCCCEEEEE Q ss_conf 001121045-424789999987099658996 Q gi|254780200|r 114 VDWAMRYGK-PSVKEIINNLREEGCDRLLIF 143 (343) Q Consensus 114 V~~amry~~-P~i~~~l~~l~~~g~~~ii~l 143 (343) +..|.-+-+ ++..+++.++.+.|++-++.+ T Consensus 63 idiaii~VP~~~a~~~~~~~v~~GIk~i~nf 93 (96) T pfam02629 63 VDVAVITVPAPFAQEAIDELVDAGIKGIVNI 93 (96) T ss_pred CCEEEEEECHHHHHHHHHHHHHCCCCEEEEE T ss_conf 8789999478998999999998699899994 No 198 >COG1090 Predicted nucleoside-diphosphate sugar epimerase [General function prediction only] Probab=24.78 E-value=50 Score=13.89 Aligned_cols=24 Identities=17% Similarity=0.119 Sum_probs=11.9 Q ss_pred HHHHHCCCCEEEEECCCCCCHHHHHHHHHHHH Q ss_conf 99997599758981689879899999999999 Q gi|254780200|r 305 EIFVNGGGEKFTQVPCLNSSNLSIDLLEKITR 336 (343) Q Consensus 305 e~~~e~Gg~~~~rvp~lN~~~~fi~~La~lv~ 336 (343) +...++|. .|.+. +.=++|++++. T Consensus 272 ~kl~~aGF-~F~y~-------dl~~AL~~il~ 295 (297) T COG1090 272 KKLEAAGF-QFQYP-------DLEEALADILK 295 (297) T ss_pred HHHHHCCC-EEECC-------CHHHHHHHHHH T ss_conf 99987798-16657-------79999999972 No 199 >PRK10302 hypothetical protein; Provisional Probab=24.69 E-value=50 Score=13.88 Aligned_cols=79 Identities=13% Similarity=0.246 Sum_probs=44.8 Q ss_pred CCCCCHHHHHHH-HHHCCCCEEEEEECCHHHHH-HHHHHHHHHHHHHHHH-------HCCCCCCCCC--CCCCCCCHHHH Q ss_conf 045424789999-98709965899631711100-1468999999999987-------1799864213--34557800689 Q gi|254780200|r 120 YGKPSVKEIINN-LREEGCDRLLIFPLYPQYSA-ATTGTAQDKVFQELIH-------MRWSPSLRTV--PPYYEDSDYIS 188 (343) Q Consensus 120 y~~P~i~~~l~~-l~~~g~~~ii~lPLyPqyS~-sTtgS~~~~~~~~l~~-------~~~~~~~~~I--~~~~~~p~yI~ 188 (343) |.++..+.++.+ |.+.|+++|+.= --|-||. .++.. ..+..+.-.+ ....+-+|+| ++..++..|++ T Consensus 137 F~k~~~e~~L~~lL~~~~v~rVi~D-sr~l~~~~~~~~~-~~daq~kKP~vPvh~~~t~~~p~VRfiG~~~~~~n~~~l~ 214 (272) T PRK10302 137 FAKGEEEQALNRGLHQRGVNRVILD-SRPVHAARPHSEA-VRDAQRKKPKVPVHAVVTATNPLVRFIGSDDMAQNRELFA 214 (272) T ss_pred CCCHHHHHHHHHHHHHCCCCEEEEC-CCCCCCCCCCCHH-HHHHHHCCCCCCEEEECCCCCCEEEEECCCCHHHHHHHHH T ss_conf 0770778999999997298757603-4233468999788-9999864999861443168975799817984223469999 Q ss_pred HHHHHHHHHHHH Q ss_conf 999999999973 Q gi|254780200|r 189 ALAQSIREHFES 200 (343) Q Consensus 189 a~~~~I~~~l~~ 200 (343) .|+++|..=+++ T Consensus 215 ~W~~kl~~W~~~ 226 (272) T PRK10302 215 VWLQKLPQWHQT 226 (272) T ss_pred HHHHHHHHHHHC T ss_conf 999999999976 No 200 >PRK13111 trpA tryptophan synthase subunit alpha; Provisional Probab=24.60 E-value=50 Score=13.87 Aligned_cols=218 Identities=17% Similarity=0.190 Sum_probs=102.9 Q ss_pred CEEEEEECCCCCCCCCHHHHHHHHHHHC-CCHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHH-HHHCCCCCCCHHHHHH Q ss_conf 6799997178889999889999999874-8954411786888998766651257478899999-8707788740457775 Q gi|254780200|r 18 KIGVLLVNLGTPDGHDFFSLRRYLREFL-LDKRVVELPSWQWRPILFGYILNFRPSKIKHAYA-KIWNTAKNESILRTHT 95 (343) Q Consensus 18 K~gvLL~n~G~P~~~~~~~V~~yL~~fl-~D~~Vi~~p~~~~~~il~~~I~~~R~~~~~~~Y~-~Iw~~~gg~SPL~~~t 95 (343) |-=|-.+..|=|+-. .-.+.+..+- +.-++|++.-++--|+.-| +-+++.++ .+ .+|--+.. . T Consensus 9 ~ali~y~taG~P~~e---~~~~~~~~l~~~Gad~iEiGiPfSDP~aDG-------pvIq~a~~~AL----~~G~~~~~-~ 73 (256) T PRK13111 9 GALIPYITAGDPDLE---TSLEILKALVEAGADIIELGIPFSDPVADG-------PVIQRASLRAL----AAGVTLAD-V 73 (256) T ss_pred CEEEEEEECCCCCHH---HHHHHHHHHHHCCCCEEEECCCCCCCCCCC-------HHHHHHHHHHH----HCCCCHHH-H T ss_conf 668988707089989---999999999965999999788878876657-------99999999999----77996999-9 Q ss_pred HHHHHHHHHHHHCCCCCCCHHHHHCCCCC----HHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 99999999865224784300112104542----47899999870996589963171110014689999999999871799 Q gi|254780200|r 96 RDQATNLAKRLESISSIVVDWAMRYGKPS----VKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMRWS 171 (343) Q Consensus 96 ~~qa~~L~~~L~~~~~~~V~~amry~~P~----i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~~~ 171 (343) ..+.+.++++ .+.++ +-|.|++|. +++-++++++.|++-+++ |=.|-- -.+++.+++++.+.. T Consensus 74 f~~~~~~r~~----~~~pi-vlM~Y~N~i~~~G~e~F~~~~~~~GvdGvIi-pDLP~e-------E~~~~~~~~~~~gi~ 140 (256) T PRK13111 74 LELLREIRAK----PTIPI-VLMTYYNPIFQYGVEAFAADAAEAGVDGLII-PDLPPE-------EAEEFRAAAKKHGID 140 (256) T ss_pred HHHHHHHHCC----CCCCE-EEEEECCHHHHHCHHHHHHHHHHCCCCEEEE-CCCCHH-------HHHHHHHHHHHCCCE T ss_conf 9999998606----89988-9985030898709999999999759977981-699978-------889999999975980 Q ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEEE Q ss_conf 86421334557800689999999999973389986699983367502676279988899999999999726675670478 Q gi|254780200|r 172 PSLRTVPPYYEDSDYISALAQSIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKIC 251 (343) Q Consensus 172 ~~~~~I~~~~~~p~yI~a~~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l~ 251 (343) .+.+|.+=.. .++|++-.+...+ ...+-|--|+-= .+ ..-.....+..+.|.+.-++ +..++ T Consensus 141 -~I~lvaPtt~--------~~Ri~~i~~~s~g--fiY~vs~~GvTG---~~-~~~~~~~~~~i~~ik~~t~~---Pi~vG 202 (256) T PRK13111 141 -LIFLVAPTTT--------DERLKKIASHASG--FVYYVSRAGVTG---AR-SADAADVADLLARLKAHTDL---PVAVG 202 (256) T ss_pred -EEEEECCCCC--------HHHHHHHHHHCCC--EEEEEECCCCCC---CC-CCCHHHHHHHHHHHHHCCCC---CEEEE T ss_conf -8999699998--------8999999962698--599985677678---87-66628899999999870689---75885 Q ss_pred EECCCCCCCCCCHHHHHHHHHHHHCCCCCEEEECCHHHHCCHH Q ss_conf 6225777654430268999989976998399974405422125 Q gi|254780200|r 252 FQSRFGRIKCLEPPTDKTVEKLAHDGIKSLAIITPGFSSDCLE 294 (343) Q Consensus 252 fQSr~Gp~~WL~P~t~~~l~~L~~~G~k~v~v~p~gFvsD~lE 294 (343) | |= ++.+.++.+.+ |..-| |+= |=+.+++| T Consensus 203 F----GI------s~~e~v~~~~~-~aDGv-IVG-Saiv~~i~ 232 (256) T PRK13111 203 F----GI------STPEQAAAIAE-GADGV-IVG-SALVKIIE 232 (256) T ss_pred C----CC------CCHHHHHHHHC-CCCEE-EEC-HHHHHHHH T ss_conf 2----88------99999999974-59999-986-89999998 No 201 >KOG2599 consensus Probab=24.53 E-value=50 Score=13.86 Aligned_cols=175 Identities=17% Similarity=0.190 Sum_probs=99.1 Q ss_pred CHHHHHHH-HHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCC Q ss_conf 71110014-68999999999987179986421334557800689999999999973389986699983367502676279 Q gi|254780200|r 146 YPQYSAAT-TGTAQDKVFQELIHMRWSPSLRTVPPYYEDSDYISALAQSIREHFESIQWTPEMLLVSFHQMPVSYLLKGD 224 (343) Q Consensus 146 yPqyS~sT-tgS~~~~~~~~l~~~~~~~~~~~I~~~~~~p~yI~a~~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gD 224 (343) ||++-... ...-+.++.+.+.......=-.+...|-.+..|++.+++-+++. .+.+ + .+++..- |+ +-++|- T Consensus 54 Y~~~kG~~~~~~eL~dL~egl~~nn~~~Y~~vLTGY~~n~~~l~~i~~iv~~l-k~~n-p--~~~wv~D--PV-mGDnG~ 126 (308) T KOG2599 54 YAHVKGQVLNEEELEDLYEGLLLNNLNKYDAVLTGYLPNVSFLQKIADIVKKL-KKKN-P--NLTWVCD--PV-MGDNGR 126 (308) T ss_pred CCCCCCCCCCHHHHHHHHHHHHHCCCCCCCEEEEECCCCHHHHHHHHHHHHHH-HHCC-C--CEEEEEC--CC-CCCCCC T ss_conf 74111631689999999998764366332245530568826999999999999-8429-9--8499967--63-467851 Q ss_pred CC-HHHHHHHHHHHHHHH--CCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHCCCCCEEEECC---HHHHCCHHHHHH Q ss_conf 98-889999999999972--667567047862257776544302689999899769983999744---054221255888 Q gi|254780200|r 225 PY-GCHCHKTARLLKEFL--SWPDDRFKICFQSRFGRIKCLEPPTDKTVEKLAHDGIKSLAIITP---GFSSDCLETSYE 298 (343) Q Consensus 225 pY-~~q~~~t~~~i~~~l--~~~~~~~~l~fQSr~Gp~~WL~P~t~~~l~~L~~~G~k~v~v~p~---gFvsD~lETl~E 298 (343) -| ..+|....+.+.-.| -+.+|+|++---+ |..==.+-+...+++.|.+.|+++|+|-+. .|... ||+= T Consensus 127 lYV~eelipvYr~~i~~ladiiTPNqFE~EiLt--g~~I~t~eda~~a~~~lhq~~v~~vVITS~~~~~~~g~---~l~c 201 (308) T KOG2599 127 LYVPEELIPVYRDLIIPLADIITPNQFEAEILT--GMEIRTEEDAKRAVEKLHQKGVKTVVITSFDLGEFTGE---TLRC 201 (308) T ss_pred EECCHHHHHHHHHHHCCHHHHCCCCCHHHHHHC--CCEECCHHHHHHHHHHHHHHCCCEEEEEEEEECCCCCC---EEEE T ss_conf 744587779999863025551077611111432--87424499999999999974898799974430788885---7999 Q ss_pred HHHHHHHHHHHCCCCEEEEECCC-----CCCHHHHHHHHHHHHHH Q ss_conf 75899999997599758981689-----87989999999999998 Q gi|254780200|r 299 IAHEAKEIFVNGGGEKFTQVPCL-----NSSNLSIDLLEKITRRE 338 (343) Q Consensus 299 idie~~e~~~e~Gg~~~~rvp~l-----N~~~~fi~~La~lv~~~ 338 (343) ++-.... -.-+..+|-+ ++-+.|.++|.+.+.+. T Consensus 202 ~gs~~~~------~~f~~~ipki~~~FtGTGDLfsaLLla~~~~~ 240 (308) T KOG2599 202 IGSSCGS------ERFRYLIPKIDGVFTGTGDLFSALLLAWLHES 240 (308) T ss_pred EEECCCC------CEEEEEECCCCEEEECCCHHHHHHHHHHHHCC T ss_conf 9861688------25899922555289605189999999997427 No 202 >cd02523 PC_cytidylyltransferase Phosphocholine cytidylyltransferases catalyze the synthesis of CDP-choline. This family contains proteins similar to prokaryotic phosphocholine (P-cho) cytidylyltransferases. Phosphocholine (PC) cytidylyltransferases catalyze the transfer of a cytidine monophosphate from CTP to phosphocholine to form CDP-choline. PC is the most abundant phospholipid in eukaryotic membranes and it is also important in prokaryotic membranes. For pathogenic prokaryotes, the cell surface PC facilitates the interaction with host surface and induces attachment and invasion. In addition cell wall PC serves as scaffold for a group of choline-binding proteins that are secreted from the cells. Phosphocholine (PC) cytidylyltransferase is a key enzyme in the prokaryotic choline metabolism pathway. It has been hypothesized to consist of a choline transport system, a choline kinase, CTP:phosphocholine cytidylyltransferase, and a choline phosphotransferase that transfers P-Cho from CDP Probab=24.26 E-value=51 Score=13.82 Aligned_cols=25 Identities=36% Similarity=0.664 Sum_probs=19.7 Q ss_pred CCCCCHHHHHHHHHHCCCCEEEEEE Q ss_conf 0454247899999870996589963 Q gi|254780200|r 120 YGKPSVKEIINNLREEGCDRLLIFP 144 (343) Q Consensus 120 y~~P~i~~~l~~l~~~g~~~ii~lP 144 (343) .+.|-++-+++.+++.|+++++++. T Consensus 27 ~gkpli~~~i~~l~~~gi~~i~iv~ 51 (229) T cd02523 27 NGKPLLERQIETLKEAGIDDIVIVT 51 (229) T ss_pred CCEEHHHHHHHHHHHCCCCCCEEEC T ss_conf 9879999999999984998536655 No 203 >TIGR01162 purE phosphoribosylaminoimidazole carboxylase, catalytic subunit; InterPro: IPR000031 Phosphoribosylaminoimidazole carboxylase is a fusion protein in plants and fungi, but consists of two non-interacting proteins in bacteria, PurK and PurE. PurK, N5-carboxyaminoimidazole ribonucleotide (N5_CAIR) synthetase, catalyzes the conversion of 5-aminoimidazole ribonucleotide (AIR), ATP, and bicarbonate to N5-CAIR, ADP, and Pi. PurE converts N5-CAIR to CAIR, the sixth step of de novo purine biosynthesis. In the presence of high concentrations of bicarbonate, PurE is reported able to convert AIR to CAIR directly and without ATP. Some members of this family contain two copies of this domain . The crystal structure of PurE indicates a unique quaternary structure that confirms the octameric nature of the enzyme .; GO: 0004638 phosphoribosylaminoimidazole carboxylase activity, 0006189 'de novo' IMP biosynthetic process, 0009320 phosphoribosylaminoimidazole carboxylase complex. Probab=24.22 E-value=51 Score=13.82 Aligned_cols=29 Identities=24% Similarity=0.249 Sum_probs=16.5 Q ss_pred HHHHHHHHHHCCCCCCEEEEEECCCCHHH Q ss_conf 99999999733899866999833675026 Q gi|254780200|r 191 AQSIREHFESIQWTPEMLLVSFHQMPVSY 219 (343) Q Consensus 191 ~~~I~~~l~~~~~~~~~llfSaHglP~~~ 219 (343) .+.+.+.|++++.+.+.-|-|||=.|+++ T Consensus 14 m~~a~~~L~~fgi~~e~~V~SAHRTP~~~ 42 (159) T TIGR01162 14 MKKAAEILEEFGIPYELRVVSAHRTPELM 42 (159) T ss_pred HHHHHHHHHHCCCCEEEEEECCCCCHHHH T ss_conf 99999999855996678986067780889 No 204 >cd04724 Tryptophan_synthase_alpha Ttryptophan synthase (TRPS) alpha subunit (TSA). TPRS is a bifunctional tetrameric enzyme (2 alpha and 2 beta subunits) that catalyzes the last two steps of L-tryptophan biosynthesis. Alpha and beta subunit catalyze two distinct reactions which are both strongly stimulated by the formation of the complex. The alpha subunit catalyzes the cleavage of indole 3-glycerol phosphate (IGP) to indole and d-glyceraldehyde 3-phosphate (G3P). Indole is then channeled to the active site of the beta subunit, a PLP-dependent enzyme that catalyzes a replacement reaction to convert L-serine into L-tryptophan. Probab=24.22 E-value=51 Score=13.82 Aligned_cols=186 Identities=17% Similarity=0.229 Sum_probs=83.9 Q ss_pred EEECCCCCCCCCHHHHHHHHHHHC-CCHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHH Q ss_conf 997178889999889999999874-8954411786888998766651257478899999870778874045777599999 Q gi|254780200|r 22 LLVNLGTPDGHDFFSLRRYLREFL-LDKRVVELPSWQWRPILFGYILNFRPSKIKHAYAKIWNTAKNESILRTHTRDQAT 100 (343) Q Consensus 22 LL~n~G~P~~~~~~~V~~yL~~fl-~D~~Vi~~p~~~~~~il~~~I~~~R~~~~~~~Y~~Iw~~~gg~SPL~~~t~~qa~ 100 (343) --+..|=|+-. .-.+++..+- +.-++|++.-++--|+.-| +-+++.+++-- .+|.-+... ..+.+ T Consensus 4 ~y~taG~P~~~---~~~~~~~~l~~~G~d~iEiGiPfsDP~aDG-------pvIq~A~~~aL---~~g~~~~~~-~~~~~ 69 (242) T cd04724 4 PYITAGDPDLE---TTLEILKALVEAGADIIELGIPFSDPVADG-------PVIQAASERAL---ANGVTLKDV-LELVK 69 (242) T ss_pred EEEECCCCCHH---HHHHHHHHHHHCCCCEEEECCCCCCCCCCC-------HHHHHHHHHHH---HCCCCHHHH-HHHHH T ss_conf 78737789979---999999999976999999789988877658-------99999999999---769949999-99999 Q ss_pred HHHHHHHCCCCCCCHHHHHCCCCCH----HHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 9998652247843001121045424----789999987099658996317111001468999999999987179986421 Q gi|254780200|r 101 NLAKRLESISSIVVDWAMRYGKPSV----KEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMRWSPSLRT 176 (343) Q Consensus 101 ~L~~~L~~~~~~~V~~amry~~P~i----~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~~~~~~~~ 176 (343) ++++. .+.++ +-|.|++|.. ++-++++++.|++-+++ |=.|- .-.+.+.+.+.+.+.. .+.+ T Consensus 70 ~~r~~----~~~pi-vlM~Y~N~i~~~G~e~F~~~~~~~Gv~Gvii-pDLP~-------ee~~~~~~~~~~~~i~-~I~l 135 (242) T cd04724 70 EIRKK----NTIPI-VLMGYYNPILQYGLERFLRDAKEAGVDGLII-PDLPP-------EEAEEFREAAKEYGLD-LIFL 135 (242) T ss_pred HHHHC----CCCCE-EEEEECHHHHHHCHHHHHHHHHHCCCCEEEE-CCCCH-------HHHHHHHHHHHHCCCE-EEEE T ss_conf 98734----79888-9998445766528999999999759975870-69995-------7846899999865983-8899 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEEEE Q ss_conf 3345578006899999999999733899866999833675026762799888999999999997266756704786 Q gi|254780200|r 177 VPPYYEDSDYISALAQSIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKICF 252 (343) Q Consensus 177 I~~~~~~p~yI~a~~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l~f 252 (343) |.+=. -.++|++-.+... ....+.|.-|+-=. + . +-.....+..+.|.+..+. +..++| T Consensus 136 vsPtt--------~~~ri~~i~~~s~--gfiY~vs~~GvTG~--~-~-~~~~~~~~~i~~ik~~t~~---Pv~vGF 194 (242) T cd04724 136 VAPTT--------PDERIKKIAELAS--GFIYYVSRTGVTGA--R-T-ELPDDLKELIKRIRKYTDL---PIAVGF 194 (242) T ss_pred ECCCC--------CHHHHHHHHHHCC--CEEEEEECCCCCCC--C-C-CCHHHHHHHHHHHHHHCCC---CEEEEC T ss_conf 68988--------7899999997479--84999857777787--7-5-5649999999999871689---748743 No 205 >PRK10799 putative hydrolase-oxidase; Provisional Probab=23.80 E-value=52 Score=13.76 Aligned_cols=41 Identities=12% Similarity=0.084 Sum_probs=29.9 Q ss_pred HHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHHHHH Q ss_conf 875899999997599758981689879899999999999984 Q gi|254780200|r 298 EIAHEAKEIFVNGGGEKFTQVPCLNSSNLSIDLLEKITRREL 339 (343) Q Consensus 298 Eidie~~e~~~e~Gg~~~~rvp~lN~~~~fi~~La~lv~~~l 339 (343) |+.......+.+.| ..+.-+-=-++.--++++|++.+++++ T Consensus 194 d~k~h~~~~A~e~g-l~lid~GHy~tE~~gv~~L~~~L~~~f 234 (247) T PRK10799 194 EVSEQTIHSAREQG-LHFYAAGHHATERGGIRALSEWLNENT 234 (247) T ss_pred CCCHHHHHHHHHCC-CEEEECCCHHHHHHHHHHHHHHHHHHH T ss_conf 85538899999779-969975835888999999999999986 No 206 >PRK13115 consensus Probab=23.63 E-value=52 Score=13.74 Aligned_cols=214 Identities=14% Similarity=0.124 Sum_probs=100.1 Q ss_pred CCEEE-EEECCCCCCCCCHHHHHHHHHHHC-CCHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHH-HHHHCCCCCCCHHHH Q ss_conf 86799-997178889999889999999874-895441178688899876665125747889999-987077887404577 Q gi|254780200|r 17 GKIGV-LLVNLGTPDGHDFFSLRRYLREFL-LDKRVVELPSWQWRPILFGYILNFRPSKIKHAY-AKIWNTAKNESILRT 93 (343) Q Consensus 17 ~K~gv-LL~n~G~P~~~~~~~V~~yL~~fl-~D~~Vi~~p~~~~~~il~~~I~~~R~~~~~~~Y-~~Iw~~~gg~SPL~~ 93 (343) +|.++ -.+..|-|+- +.-.+.+..+- +.-++|++.-++--|+--| +-+++.. +.+ .+|.-+.. T Consensus 22 ~k~alI~yitaG~P~~---e~t~~~i~~l~~~GaDiiElGiPFSDP~ADG-------PvIQ~A~~rAL----~~G~~~~~ 87 (269) T PRK13115 22 GRAALIGYLPAGFPDV---DTSIAAMTALVESGCDIVEVGLPYSDPVMDG-------PVIQAATEAAL----RGGVRVRD 87 (269) T ss_pred CCCEEEEEECCCCCCH---HHHHHHHHHHHHCCCCEEEECCCCCCCCCCC-------HHHHHHHHHHH----HCCCCHHH T ss_conf 9944887852738998---9999999999966999999799988856668-------99999999999----77995999 Q ss_pred HHHHHHHHHHHHHHCCCCCCCHHHHHCCCCC----HHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 7599999999865224784300112104542----478999998709965899631711100146899999999998717 Q gi|254780200|r 94 HTRDQATNLAKRLESISSIVVDWAMRYGKPS----VKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMR 169 (343) Q Consensus 94 ~t~~qa~~L~~~L~~~~~~~V~~amry~~P~----i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~ 169 (343) .-+ +.+.+ .+ .+.++. -|.|++|. +++-++++++.|++-+++ |=.|.- -.+.+.+.+.+.+ T Consensus 88 ~f~-~v~~~----~~-~~~Piv-lM~Y~N~i~~yG~e~F~~~~~~~GvdGvIi-pDLP~e-------E~~~~~~~~~~~g 152 (269) T PRK13115 88 VFR-AVEAI----SA-AGGRAL-VMTYWNPVLRYGVDRFARDLAAAGGAGLIT-PDLIPD-------EAGEWLAASERHG 152 (269) T ss_pred HHH-HHHHH----CC-CCCCEE-EHHHHHHHHHHCHHHHHHHHHHCCCCEEEC-CCCCHH-------HHHHHHHHHHHCC T ss_conf 999-99984----15-799888-547548998736999999999739980764-789978-------9999999998658 Q ss_pred CCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCE Q ss_conf 99864213345578006899999999999733899866999833675026762799888999999999997266756704 Q gi|254780200|r 170 WSPSLRTVPPYYEDSDYISALAQSIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFK 249 (343) Q Consensus 170 ~~~~~~~I~~~~~~p~yI~a~~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~ 249 (343) .. .+.+|.+=.. .++|+.-.+...+ .....|.-|+-= . + ..-.....+..+.|.+...+ +.. T Consensus 153 i~-~I~LvaPtt~--------~eRi~~i~~~a~G--FIY~Vs~~GvTG--~-~-~~~~~~~~~~i~~ik~~t~~---Pv~ 214 (269) T PRK13115 153 LD-RIFLVAPSST--------PERLAETVEASRG--FVYAASTMGVTG--A-R-DAVSSAAPELVARVRAASDI---PVC 214 (269) T ss_pred EE-EEEEECCCCC--------HHHHHHHHHCCCC--CEEEEECCCCCC--C-C-CCCCHHHHHHHHHHHHHCCC---CCE T ss_conf 12-8998589998--------8999999844888--089975454567--7-6-44417799999999971799---817 Q ss_pred EEEECCCCCCCCCCHHHHHHHHHHHHCCCCCEEEECCHHH Q ss_conf 7862257776544302689999899769983999744054 Q gi|254780200|r 250 ICFQSRFGRIKCLEPPTDKTVEKLAHDGIKSLAIITPGFS 289 (343) Q Consensus 250 l~fQSr~Gp~~WL~P~t~~~l~~L~~~G~k~v~v~p~gFv 289 (343) ++| |= ++.+.++++.+- -.-+|+-=+|| T Consensus 215 vGF----GI------s~~e~~~~~~~~--aDGvIVGSa~V 242 (269) T PRK13115 215 VGL----GV------SSAAQAAEIAGY--ADGVIVGSALV 242 (269) T ss_pred EEC----CC------CCHHHHHHHHHH--CCEEEECHHHH T ss_conf 972----78------999999999802--99999868999 No 207 >pfam00113 Enolase_C Enolase, C-terminal TIM barrel domain. Probab=23.39 E-value=53 Score=13.71 Aligned_cols=120 Identities=14% Similarity=0.126 Sum_probs=68.4 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEECCCC Q ss_conf 34557800689999999999973389986699983367502676279988899999999999726675670478622577 Q gi|254780200|r 178 PPYYEDSDYISALAQSIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKICFQSRFG 257 (343) Q Consensus 178 ~~~~~~p~yI~a~~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l~fQSr~G 257 (343) +.+.+...+|+.+.+.+.++ . |+|- -||...+=.+.-..+.+++|- +..++ | T Consensus 130 ~~~~t~~elid~y~~l~~~y---------P-IisI----------EDpl~e~D~~gw~~lt~~lg~---~~qiV-----G 181 (296) T pfam00113 130 SKWLTSDQLADLYKELIKKY---------P-IVSI----------EDPFDEDDWEAWKKLTASLGD---KIQIV-----G 181 (296) T ss_pred CCEECHHHHHHHHHHHHHHC---------C-EEEE----------ECCCCCHHHHHHHHHHHHHCC---CEEEE-----C T ss_conf 62208999999999999869---------9-8899----------679881249999999997499---57998-----9 Q ss_pred CCCCCCHHHHHHHHHHHHCCCCCEEEECCHHHHCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHH Q ss_conf 765443026899998997699839997440542212558887589999999759975898168987989999999 Q gi|254780200|r 258 RIKCLEPPTDKTVEKLAHDGIKSLAIITPGFSSDCLETSYEIAHEAKEIFVNGGGEKFTQVPCLNSSNLSIDLLE 332 (343) Q Consensus 258 p~~WL~P~t~~~l~~L~~~G~k~v~v~p~gFvsD~lETl~Eidie~~e~~~e~Gg~~~~rvp~lN~~~~fi~~La 332 (343) -=|--.-.+.|++-.+++.-|-+++-|.=+---.||+ ++-++++++|.....--..--+.+.||.-|| T Consensus 182 --DDL~~Tn~~rl~~gi~~~~~NaiLiK~NQiGTlset~-----e~~~~ak~~g~~~ivShRSGETeD~~iaDLA 249 (296) T pfam00113 182 --DDLTVTNPKRIAKAIEKKACNSLLLKVNQIGSVTESL-----AAVKMAKDAGWGVMVSHRSGETEDTFIADLV 249 (296) T ss_pred --CCCCCCCHHHHHHHHHCCCCCCEEEECHHHHHHHHHH-----HHHHHHHHCCCEEEEECCCCCCCCCHHHHHH T ss_conf --8755679999999985487771697121110199999-----9999999869649986698877652299988 No 208 >pfam10103 DUF2342 Uncharacterized conserved protein (DUF2342). Members of this family of bacterial hypothetical and uncharacterized proteins have no known function. Probab=23.29 E-value=22 Score=16.41 Aligned_cols=46 Identities=20% Similarity=0.213 Sum_probs=24.7 Q ss_pred HHHHHHHHHHHCCCCCCCE-------EEEECCCCCCCCCCHHHHHHHHHHHHC Q ss_conf 9999999997266756704-------786225777654430268999989976 Q gi|254780200|r 231 HKTARLLKEFLSWPDDRFK-------ICFQSRFGRIKCLEPPTDKTVEKLAHD 276 (343) Q Consensus 231 ~~t~~~i~~~l~~~~~~~~-------l~fQSr~Gp~~WL~P~t~~~l~~L~~~ 276 (343) ..+.....+.|+++.+++. .+.|-.|...|||..++.+.+++++.. T Consensus 140 ~~NI~~~~~~l~v~~~d~rlwvalhE~aH~~lF~~~PWLr~~l~~~v~~~~~~ 192 (341) T pfam10103 140 PPNIAAVARGLGVPPDDFRLWLALHEAAHRRLFTAVPWLRDHLLGAVEEYARG 192 (341) T ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHH T ss_conf 67899999983999999999999999999999830828999999999999875 No 209 >PRK02269 ribose-phosphate pyrophosphokinase; Provisional Probab=23.24 E-value=53 Score=13.69 Aligned_cols=165 Identities=17% Similarity=0.193 Sum_probs=84.3 Q ss_pred HHHHHHHCCCCEEE-EEECCHHHHHHHH----H--HHHHHHHHHHHHHCCC------CCCCCCCCCCCCCHHHHHHHHHH Q ss_conf 99999870996589-9631711100146----8--9999999999871799------86421334557800689999999 Q gi|254780200|r 128 IINNLREEGCDRLL-IFPLYPQYSAATT----G--TAQDKVFQELIHMRWS------PSLRTVPPYYEDSDYISALAQSI 194 (343) Q Consensus 128 ~l~~l~~~g~~~ii-~lPLyPqyS~sTt----g--S~~~~~~~~l~~~~~~------~~~~~I~~~~~~p~yI~a~~~~I 194 (343) .+..+++.|..+|. ++|-|| ||+-.- | -+.+.+.+.+...+.. +...-+..||+-|...=.-...+ T Consensus 75 ~idAlr~a~A~~It~ViPY~~-YaRQDr~~~~ge~isak~vA~lL~~~g~d~vitvDlH~~~i~gfF~ipv~~l~~~~~~ 153 (321) T PRK02269 75 MVDALKRASAESISVVMPYYG-YARQDRKARSREPITSKLVANMLEVAGVDRLLTVDLHAAQIQGFFDIPVDHLMGAPLI 153 (321) T ss_pred HHHHHHHCCCCEEEEEECCCC-CCCCCCCCCCCCCCHHHHHHHHHHHCCCCEEEEECCCCHHHHHCCCCCCCCEEECHHH T ss_conf 999998748973999813553-1304544578997269999998874487779996267477763269984111303899 Q ss_pred HHHHHHCCCC-CCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCC--------------- Q ss_conf 9999733899-866999833675026762799888999999999997266756704786225777--------------- Q gi|254780200|r 195 REHFESIQWT-PEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKICFQSRFGR--------------- 258 (343) Q Consensus 195 ~~~l~~~~~~-~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l~fQSr~Gp--------------- 258 (343) .+++++.... ++.++.| |- .| ..+-++.+++.||.+ +.+...-|.+. T Consensus 154 ~~~i~~~~~~~~~~vvVs----PD----~G------~~~ra~~~A~~l~~~---~~~~~K~R~~~~~~~~~~~~~~gdV~ 216 (321) T PRK02269 154 ADYFDRHGLVGEDVVVVS----PD----HG------GVTRARKLAQFLKTP---IAIIDKRRSVDKMNTSEVMNIIGNVS 216 (321) T ss_pred HHHHHHHCCCCCCEEEEC----CC----CH------HHHHHHHHHHHCCCC---EEEEEEECCCCCCCCEEECCCCCCCC T ss_conf 999998479876708971----79----31------999999999862787---89999873689887434203577406 Q ss_pred --CCC-------CCHHHHHHHHHHHHCCCCCEEEECC-HHHH-CCHHHHHHHHHHHHHHHHHCCCCEEEEECCC Q ss_conf --654-------4302689999899769983999744-0542-2125588875899999997599758981689 Q gi|254780200|r 259 --IKC-------LEPPTDKTVEKLAHDGIKSLAIITP-GFSS-DCLETSYEIAHEAKEIFVNGGGEKFTQVPCL 321 (343) Q Consensus 259 --~~W-------L~P~t~~~l~~L~~~G~k~v~v~p~-gFvs-D~lETl~Eidie~~e~~~e~Gg~~~~rvp~l 321 (343) .-. .+=++.++.+.|.++|.++|.+++. |..+ +.+|-| .++|.++.....+. T Consensus 217 Gk~vIIVDDiIdTGgTl~~aa~~Lk~~GA~~V~~~~THgvfs~~A~~rl-----------~~s~i~~ivvTdTi 279 (321) T PRK02269 217 GKKCILIDDMIDTAGTICHAADALAEAGATAVYASCTHPVLSGPALDNI-----------QKSAIEKLIVLDTI 279 (321) T ss_pred CCEEEEECCHHHCHHHHHHHHHHHHHCCCCEEEEEEECCCCCHHHHHHH-----------HCCCCCEEEEECCC T ss_conf 9769996624314266999999998489982799997802784799998-----------60899889980897 No 210 >TIGR01615 A_thal_3542 uncharacterized plant-specific domain TIGR01615; InterPro: IPR006502 This family of uncharacterised plant proteins are defined by a region found toward the C-terminus. This region is strongly conserved (greater than 30 equence identity between most pairs of members) but flanked by highly divergent regions including stretches of low-complexity sequence. . Probab=23.10 E-value=54 Score=13.67 Aligned_cols=48 Identities=21% Similarity=0.227 Sum_probs=24.4 Q ss_pred CCCHHHHHHCCCCHHHHHHHHHHHHHHH---CCCCCCCEE--EEECCCCCCCCCCHHH Q ss_conf 6750267627998889999999999972---667567047--8622577765443026 Q gi|254780200|r 214 QMPVSYLLKGDPYGCHCHKTARLLKEFL---SWPDDRFKI--CFQSRFGRIKCLEPPT 266 (343) Q Consensus 214 glP~~~~~~gDpY~~q~~~t~~~i~~~l---~~~~~~~~l--~fQSr~Gp~~WL~P~t 266 (343) +||.-+|.++|.=..-+.=.+.+...-| +.+-.+|+- --|| +||+|+. T Consensus 89 ~LP~VfVGk~~rL~~~v~l~c~a~k~SLk~~~m~~PPWR~~~YMqs-----KWlg~~~ 141 (144) T TIGR01615 89 SLPEVFVGKEERLKQLVRLVCDAAKKSLKKKGMPLPPWRKNRYMQS-----KWLGPYK 141 (144) T ss_pred HCCEEEECCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCC-----CCCCCCC T ss_conf 3994773041678899999999999878746688888666545201-----1488877 No 211 >COG4623 Predicted soluble lytic transglycosylase fused to an ABC-type amino acid-binding protein [Cell envelope biogenesis, outer membrane] Probab=22.99 E-value=54 Score=13.66 Aligned_cols=17 Identities=18% Similarity=0.016 Sum_probs=10.6 Q ss_pred CCCCCCHHHHHHHHHHHHC Q ss_conf 7654430268999989976 Q gi|254780200|r 258 RIKCLEPPTDKTVEKLAHD 276 (343) Q Consensus 258 p~~WL~P~t~~~l~~L~~~ 276 (343) |..| =++.+.|..|+++ T Consensus 385 Pd~W--~dvk~rLp~l~qk 401 (473) T COG4623 385 PDSW--FDVKKRLPLLAQK 401 (473) T ss_pred CHHH--HHHHHHHHHHHHC T ss_conf 6058--8999870788753 No 212 >TIGR02812 fadR_gamma fatty acid metabolism transcriptional regulator FadR; InterPro: IPR014178 Members of this entry are transcriptional regulators of fatty acid metabolism (FadR), including both biosynthesis and beta-oxidation reactions, which are found exclusively in a subset of Gammaproteobacteria with strictly one copy per genome. Proteins have an N-terminal DNA-binding domain and a less well conserved C-terminal long chain acyl-CoA-binding domain. FadR when expressed in Escherichia coli shows differences in regulatory response and fatty acid binding profiles. All members of this entry are expected to have the same function.; GO: 0000062 acyl-CoA binding, 0003677 DNA binding, 0003700 transcription factor activity, 0019217 regulation of fatty acid metabolic process. Probab=22.78 E-value=54 Score=13.63 Aligned_cols=122 Identities=18% Similarity=0.265 Sum_probs=59.1 Q ss_pred CHHHHHHH--HHHHCC-CCCCCHHHHHHHHHHHH----------HHHHHHCCCCC---------CCHHHHHCCCCCHHHH Q ss_conf 47889999--987077-88740457775999999----------99865224784---------3001121045424789 Q gi|254780200|r 71 PSKIKHAY--AKIWNT-AKNESILRTHTRDQATN----------LAKRLESISSI---------VVDWAMRYGKPSVKEI 128 (343) Q Consensus 71 ~~~~~~~Y--~~Iw~~-~gg~SPL~~~t~~qa~~----------L~~~L~~~~~~---------~V~~amry~~P~i~~~ 128 (343) |..-||.| ++||.. ++=||-|... |+++++ |=|+|. +++| .|---|--+-+.|=|| T Consensus 7 PA~fAEeYIveSIWn~~FPPGSiLPAE-RELsElIGVTRTTLREVLQRLA-RDGWLTIQHGKPTKVNnfWETSGLNILeT 84 (275) T TIGR02812 7 PAGFAEEYIVESIWNNRFPPGSILPAE-RELSELIGVTRTTLREVLQRLA-RDGWLTIQHGKPTKVNNFWETSGLNILET 84 (275) T ss_pred CHHHHHHHHHHHCCCCCCCCCCCCHHH-HHHHHHCCCCCCHHHHHHHHHH-HHCCCCCCCCCCCCCCCHHHCCCCCHHHH T ss_conf 542354433221016434658987246-6575423766303789998876-41134365889986276243027357899 Q ss_pred HHHHHHCCCCEEE--EEECCHHHHHHHHHHHHHHHHHHHHHHCC---------CCCCCCCCCCCCCCHHHHH--HHHHHH Q ss_conf 9999870996589--96317111001468999999999987179---------9864213345578006899--999999 Q gi|254780200|r 129 INNLREEGCDRLL--IFPLYPQYSAATTGTAQDKVFQELIHMRW---------SPSLRTVPPYYEDSDYISA--LAQSIR 195 (343) Q Consensus 129 l~~l~~~g~~~ii--~lPLyPqyS~sTtgS~~~~~~~~l~~~~~---------~~~~~~I~~~~~~p~yI~a--~~~~I~ 195 (343) |-+|-.+++-.|| +|+ ++|....+| +.+|++.... ...+.......+...||++ +++.|+ T Consensus 85 L~~LD~~~~P~l~dnLLS-----ARTnIS~IY--ir~A~K~np~~~~~~~~~VI~s~e~L~~~~s~~~F~~~SP~~~~~~ 157 (275) T TIGR02812 85 LIRLDGESVPSLIDNLLS-----ARTNISAIY--IRAAFKNNPEESEELIENVIKSLEVLANAESFAEFVEASPLAEKVK 157 (275) T ss_pred HHHHCCCCCCHHHHHHHH-----HHHCCCHHH--HHHHHHCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCHHHHHH T ss_conf 986187777079999997-----764137788--9998622835689999999999999830401799985082689999 Q ss_pred HHHHHC Q ss_conf 999733 Q gi|254780200|r 196 EHFESI 201 (343) Q Consensus 196 ~~l~~~ 201 (343) +.+++. T Consensus 158 ~~l~d~ 163 (275) T TIGR02812 158 KELEDT 163 (275) T ss_pred HHHHCC T ss_conf 874203 No 213 >KOG0460 consensus Probab=22.77 E-value=54 Score=13.63 Aligned_cols=95 Identities=16% Similarity=0.109 Sum_probs=47.2 Q ss_pred EEEEEC---CCCHHHHHHCCCCHHHHHHHHHHHHHHHCC---CCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHCCCCCE Q ss_conf 999833---675026762799888999999999997266---75670478622577765443026899998997699839 Q gi|254780200|r 208 LLVSFH---QMPVSYLLKGDPYGCHCHKTARLLKEFLSW---PDDRFKICFQSRFGRIKCLEPPTDKTVEKLAHDGIKSL 281 (343) Q Consensus 208 llfSaH---glP~~~~~~gDpY~~q~~~t~~~i~~~l~~---~~~~~~l~fQSr~Gp~~WL~P~t~~~l~~L~~~G~k~v 281 (343) -|=+|| ..+.||-.+-|+- -.++-|.+..-. .+. --+.-----||. |-|.+.|--..+-|++++ T Consensus 104 TIn~aHveYeTa~RhYaH~DCP-----GHADYIKNMItGaaqMDG-aILVVaatDG~M----PQTrEHlLLArQVGV~~i 173 (449) T KOG0460 104 TINAAHVEYETAKRHYAHTDCP-----GHADYIKNMITGAAQMDG-AILVVAATDGPM----PQTREHLLLARQVGVKHI 173 (449) T ss_pred EEEEEEEEEECCCCCCCCCCCC-----CHHHHHHHHHCCCCCCCC-EEEEEECCCCCC----CCHHHHHHHHHHCCCCEE T ss_conf 6764356642244300147899-----638899875327323673-499997478988----406888888987287649 Q ss_pred EEECC-HHHHCCHHHHHHHHHHHHHHHHHCCC Q ss_conf 99744-05422125588875899999997599 Q gi|254780200|r 282 AIITP-GFSSDCLETSYEIAHEAKEIFVNGGG 312 (343) Q Consensus 282 ~v~p~-gFvsD~lETl~Eidie~~e~~~e~Gg 312 (343) +|+-= .=..|--|.|+=.+.|.||+..+.|. T Consensus 174 vvfiNKvD~V~d~e~leLVEmE~RElLse~gf 205 (449) T KOG0460 174 VVFINKVDLVDDPEMLELVEMEIRELLSEFGF 205 (449) T ss_pred EEEEECCCCCCCHHHHHHHHHHHHHHHHHCCC T ss_conf 99971202468889999999999999997299 No 214 >COG0474 MgtA Cation transport ATPase [Inorganic ion transport and metabolism] Probab=22.76 E-value=54 Score=13.63 Aligned_cols=51 Identities=27% Similarity=0.287 Sum_probs=34.8 Q ss_pred EEEECCCCCCCCCCHHHHHHHHHHHHCCCCCEEEECCHHHHCCHHHHHHHHHHHHHHHHHCCC Q ss_conf 786225777654430268999989976998399974405422125588875899999997599 Q gi|254780200|r 250 ICFQSRFGRIKCLEPPTDKTVEKLAHDGIKSLAIITPGFSSDCLETSYEIAHEAKEIFVNGGG 312 (343) Q Consensus 250 l~fQSr~Gp~~WL~P~t~~~l~~L~~~G~k~v~v~p~gFvsD~lETl~Eidie~~e~~~e~Gg 312 (343) +.|..-.|=.-=++|++.++++.+.+.|+|-+.+ +=||.|| ++..+++.|. T Consensus 536 l~~lGl~g~~Dppr~~v~~aI~~l~~AGI~v~Mi-----TGD~~~T-------A~aIa~~~Gi 586 (917) T COG0474 536 LVFLGLTGIEDPPREDVKEAIEELREAGIKVWMI-----TGDHVET-------AIAIAKECGI 586 (917) T ss_pred CEEEEEEEECCCCCCCHHHHHHHHHHCCCCEEEE-----CCCCHHH-------HHHHHHHHCC T ss_conf 0402001212899711899999999889947998-----7999899-------9999998199 No 215 >TIGR02094 more_P_ylases alpha-glucan phosphorylases; InterPro: IPR011834 This family consists of known phosphorylases, and homologues believed to share the function of using inorganic phosphate to cleave an alpha 1,4 linkage between the terminal glucose residue and the rest of the polymer (maltodextrin, glycogen, etc.). The name of the glucose storage polymer substrate, and therefore the name of this enzyme, depends on the chain lengths and branching patterns. A number of the members of this family have been shown to operate on small maltodextrins, as may be obtained by utilization of exogenous sources. This family represents a distinct clade from the related families: Glycogen/starch/alpha-glucan phosphorylase and Glycosyl transferase, family 35.; GO: 0004645 phosphorylase activity, 0030170 pyridoxal phosphate binding, 0005975 carbohydrate metabolic process. Probab=22.71 E-value=46 Score=14.11 Aligned_cols=51 Identities=14% Similarity=0.176 Sum_probs=21.1 Q ss_pred CCHH--HHHHHHHHHHHHHCC-CCCCCEEEEECCCCCCCCCCHHHHHHHHHHHH Q ss_conf 9888--999999999997266-75670478622577765443026899998997 Q gi|254780200|r 225 PYGC--HCHKTARLLKEFLSW-PDDRFKICFQSRFGRIKCLEPPTDKTVEKLAH 275 (343) Q Consensus 225 pY~~--q~~~t~~~i~~~l~~-~~~~~~l~fQSr~Gp~~WL~P~t~~~l~~L~~ 275 (343) +|++ -+....++|++-|+. ++-+..+.|=-|-=|.-=-|=.+...+-+..+ T Consensus 413 tYKRa~Llf~D~eRL~~il~~n~~RPVQ~vFAGKAHP~D~~GK~~Iq~i~~~s~ 466 (618) T TIGR02094 413 TYKRATLLFRDLERLKRILNNNPERPVQIVFAGKAHPADEAGKELIQRIVEFSK 466 (618) T ss_pred CCCCHHHHCCCHHHHHHHHCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHH T ss_conf 002613220777899998356888856889715567997545899999999862 No 216 >cd01017 AdcA Metal binding protein AcdA. These proteins have been shown to function in the ABC uptake of Zn2+ and Mn2+ and in competence for genetic transformation and adhesion. The AcdA proteins belong to the TroA superfamily of helical backbone metal receptor proteins that share a distinct fold and ligand binding mechanism. They are comprised of two globular subdomains connected by a long alpha helix and they bind their ligand in the cleft between these domains. In addition, many of these proteins have a low complexity region containing metal binding histidine-rich motif (repetitive HDH sequence). Probab=22.53 E-value=55 Score=13.60 Aligned_cols=204 Identities=12% Similarity=0.095 Sum_probs=102.2 Q ss_pred HHHHHHHHHHHHHCCCCCCCHHHHHCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHH---HHHHHHCCC Q ss_conf 59999999986522478430011210454247899999870996589963171110014689999999---999871799 Q gi|254780200|r 95 TRDQATNLAKRLESISSIVVDWAMRYGKPSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVF---QELIHMRWS 171 (343) Q Consensus 95 t~~qa~~L~~~L~~~~~~~V~~amry~~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~---~~l~~~~~~ 171 (343) +-.+..+|++ .++.|..|.-+ +|++...++.+...++. ++-+- .+...-... ..-...... T Consensus 43 ~p~d~~~l~~-----Adli~~nG~~l-E~~l~~~~~~~~~~~~~---vi~~~-------~~~~~~~~~~~~~~~~~~~~~ 106 (282) T cd01017 43 SPKDIARIAD-----ADVFVYNGLGM-ETWAEKVLKSLQNKKLK---VVEAS-------KGIKLLKAGGAEHDHDHSHSH 106 (282) T ss_pred CHHHHHHHHC-----CCEEEEECCCC-HHHHHHHHHHCCCCCCE---EEECC-------CCCCCCCCCCCCCCCCCCCCC T ss_conf 9999999966-----99999918783-68999999853689977---99877-------895235567765555555445 Q ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEEE Q ss_conf 86421334557800689999999999973389986699983367502676279988899999999999726675670478 Q gi|254780200|r 172 PSLRTVPPYYEDSDYISALAQSIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKIC 251 (343) Q Consensus 172 ~~~~~I~~~~~~p~yI~a~~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l~ 251 (343) ..-..=+++|-+|.....+++.|.+.|.+.+-.. -..+-++.+-|..++.+.-+.+.+.+.-..++--++ T Consensus 107 ~~~~~DPH~Wldp~~~~~~a~~I~~~L~~~dP~~----------~~~y~~N~~~~~~~L~~l~~~~~~~l~~~~~~~~v~ 176 (282) T cd01017 107 HHGDYDPHVWLSPVLAIQQVENIKDALIKLDPDN----------KEYYEKNAAAYAKKLEALDQEYRAKLAKAKGKTFVT 176 (282) T ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCHHH----------HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEE T ss_conf 5799998255781559999999999999868441----------799999999999999999999999874436785687 Q ss_pred EEC-------CCC----------CCCCCCHH-HHHHHHHHHHCCCCCEEEECCHHHHCCHHHHHHHHHHHHHHHHHCCCC Q ss_conf 622-------577----------76544302-689999899769983999744054221255888758999999975997 Q gi|254780200|r 252 FQS-------RFG----------RIKCLEPP-TDKTVEKLAHDGIKSLAIITPGFSSDCLETSYEIAHEAKEIFVNGGGE 313 (343) Q Consensus 252 fQS-------r~G----------p~~WL~P~-t~~~l~~L~~~G~k~v~v~p~gFvsD~lETl~Eidie~~e~~~e~Gg~ 313 (343) |.. ++| ++.=.+|. +.+.++.+.+.|++ ++++-+.|-..-.|++ +.+.|+. T Consensus 177 ~H~af~Yf~~~ygl~~~~~~~~~~~~eps~~~l~~l~~~ik~~~v~-~If~e~~~~~~~~~~i----------a~e~g~~ 245 (282) T cd01017 177 QHAAFGYLARRYGLKQIAIVGVSPEVEPSPKQLAELVEFVKKSDVK-YIFFEENASSKIAETL----------AKETGAK 245 (282) T ss_pred ECCHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHCCCC-EEEEECCCCCHHHHHH----------HHHHCCC T ss_conf 3533789998669848654334888899999999999999985998-9998189990999999----------9981996 Q ss_pred EEEEECCCCC-------CHHHHHHHHHHH Q ss_conf 5898168987-------989999999999 Q gi|254780200|r 314 KFTQVPCLNS-------SNLSIDLLEKIT 335 (343) Q Consensus 314 ~~~rvp~lN~-------~~~fi~~La~lv 335 (343) -...-|+-+. ...+++++..-+ T Consensus 246 v~~l~pl~~~~~~~~~~~~~Y~~~m~~N~ 274 (282) T cd01017 246 LLVLNPLETLTKEEIDDGKDYFSLMKENL 274 (282) T ss_pred EEEECCCCCCCCCCCCCCCCHHHHHHHHH T ss_conf 78544766665234887366999999999 No 217 >PRK10927 essential cell division protein; Provisional Probab=22.52 E-value=55 Score=13.60 Aligned_cols=54 Identities=19% Similarity=0.340 Sum_probs=27.1 Q ss_pred CCHHHHHH-----HHHHHHHHHCCCCCCCEEEEECCCC-CCCC----CCHHH-----HHHHHHHHHCCCCCEEEE Q ss_conf 98889999-----9999999726675670478622577-7654----43026-----899998997699839997 Q gi|254780200|r 225 PYGCHCHK-----TARLLKEFLSWPDDRFKICFQSRFG-RIKC----LEPPT-----DKTVEKLAHDGIKSLAII 284 (343) Q Consensus 225 pY~~q~~~-----t~~~i~~~l~~~~~~~~l~fQSr~G-p~~W----L~P~t-----~~~l~~L~~~G~k~v~v~ 284 (343) .|.-||-. -++-|..+|.+. ++-||+. .+.| ||||- +.++..|...|+.+-+.. T Consensus 247 r~mvQCGSFK~~dQAEsvrA~LAF~------GieSrIts~ggWnRVvlGPy~~k~~a~~~~~~l~~ag~~~cI~~ 315 (319) T PRK10927 247 RWMVQCGSFRGAEQAETVRAQLAFE------GFDSKITTNNGWNRVVIGPVKGKENADSTLNRLKMAGHTNCIRL 315 (319) T ss_pred EEEEEECCCCCHHHHHHHHHHHHHC------CHHHEEECCCCEEEEEECCCCCHHHHHHHHHHHHHCCCCCCEEC T ss_conf 3666304564543688888999752------42213402698656884477857789999999986687773553 No 218 >COG1350 Predicted alternative tryptophan synthase beta-subunit (paralog of TrpB) [General function prediction only] Probab=22.25 E-value=56 Score=13.56 Aligned_cols=68 Identities=21% Similarity=0.326 Sum_probs=41.9 Q ss_pred CHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHC----CCCCHHHHHHHH---HHCCCCEEEEE Q ss_conf 47889999987077887404577759999999986522478430011210----454247899999---87099658996 Q gi|254780200|r 71 PSKIKHAYAKIWNTAKNESILRTHTRDQATNLAKRLESISSIVVDWAMRY----GKPSVKEIINNL---REEGCDRLLIF 143 (343) Q Consensus 71 ~~~~~~~Y~~Iw~~~gg~SPL~~~t~~qa~~L~~~L~~~~~~~V~~amry----~~P~i~~~l~~l---~~~g~~~ii~l 143 (343) |.++.+.|..| |-..||.+- ..|++.|+ +...+++-|-= |.-.|..++.|. +.+|+++++-= T Consensus 65 P~Ev~e~Y~~~----gRPTPL~RA-----~~LE~~L~--tparIYyK~Eg~tptGSHKiNTAlAqaYyak~eg~~rl~TE 133 (432) T COG1350 65 PEEVREAYLQI----GRPTPLIRA-----KNLEEALG--TPARIYYKYEGVTPTGSHKINTALAQAYYAKKEGAKRLTTE 133 (432) T ss_pred CHHHHHHHHHH----CCCCCHHHH-----HHHHHHHC--CCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHCCCEEEECC T ss_conf 69999999972----799721336-----45999849--98079996045478887772349999999875273034304 Q ss_pred ECCHHH Q ss_conf 317111 Q gi|254780200|r 144 PLYPQY 149 (343) Q Consensus 144 PLyPqy 149 (343) -=.=|+ T Consensus 134 TGAGQW 139 (432) T COG1350 134 TGAGQW 139 (432) T ss_pred CCCCHH T ss_conf 577517 No 219 >cd04198 eIF-2B_gamma_N The N-terminal domain of gamma subunit of the eIF-2B is a subfamily of glycosyltransferase 2. N-terminal domain of gamma subunit of the eukaryotic translation initiation factor 2B (eIF-2B): eIF-2B is a guanine nucleotide-exchange factor which mediates the exchange of GDP (bound to initiation factor eIF2) for GTP, generating active eIF2.GTP complex. EIF2B is a complex multimeric protein consisting of five subunits named alpha, beta, gamma, delta and epsilon. Subunit gamma shares sequence similarity with epsilon subunit, and with a family of bifunctional nucleotide-binding enzymes such as ADP-glucose pyrophosphorylase, suggesting that epsilon subunit may play roles in nucleotide binding activity. In yeast, eIF2B gamma enhances the activity of eIF2B-epsilon leading to the idea that these subunits form the catalytic subcomplex. Probab=22.14 E-value=56 Score=13.55 Aligned_cols=24 Identities=17% Similarity=0.353 Sum_probs=21.4 Q ss_pred CCCCHHHHHHHHHHCCCCEEEEEE Q ss_conf 454247899999870996589963 Q gi|254780200|r 121 GKPSVKEIINNLREEGCDRLLIFP 144 (343) Q Consensus 121 ~~P~i~~~l~~l~~~g~~~ii~lP 144 (343) +.|-|+-+++.|.+.|+++++++. T Consensus 30 n~P~i~y~l~~l~~~G~~~i~i~~ 53 (214) T cd04198 30 NKPMIWYPLDWLEKAGFEDVIVVV 53 (214) T ss_pred CEEHHHHHHHHHHHCCCCEEEEEC T ss_conf 997499999999986998899995 No 220 >PRK06827 phosphoribosylpyrophosphate synthetase; Provisional Probab=22.11 E-value=56 Score=13.54 Aligned_cols=149 Identities=17% Similarity=0.263 Sum_probs=73.4 Q ss_pred HHHHHHCCCCEE-EEEECCHHHHHHHHHH-----HHHHHHHHHHHHCCC----CCCC--CCCCCC---------CCCHHH Q ss_conf 999987099658-9963171110014689-----999999999871799----8642--133455---------780068 Q gi|254780200|r 129 INNLREEGCDRL-LIFPLYPQYSAATTGT-----AQDKVFQELIHMRWS----PSLR--TVPPYY---------EDSDYI 187 (343) Q Consensus 129 l~~l~~~g~~~i-i~lPLyPqyS~sTtgS-----~~~~~~~~l~~~~~~----~~~~--~I~~~~---------~~p~yI 187 (343) +.++++....+| +++|.|| ||+---.+ +.+-+.+.+...+.. ..++ -|..|+ ..+.|+ T Consensus 115 IdA~kraSA~rItaViPy~g-YaRQDrk~~RepItAkLvA~lL~~aGvdrviT~DLHa~qIQgfF~i~pvDnl~~~~~~~ 193 (381) T PRK06827 115 IDAIRQAKANRITVIMPLLY-YSRQHKRKGRESLDCALALQELEELGVDHILTFDIHDPNIQNAIPLMSFENLYPSYEIV 193 (381) T ss_pred HHHHHHCCCCEEEEECCCCC-HHCCCCCCCCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCHHHH T ss_conf 98743435874899856881-31257777888747999999999719996999757872247787788754677538999 Q ss_pred HHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEECCC------CCC-- Q ss_conf 999999999997338998669998336750267627998889999999999972667567047862257------776-- Q gi|254780200|r 188 SALAQSIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKICFQSRF------GRI-- 259 (343) Q Consensus 188 ~a~~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l~fQSr~------Gp~-- 259 (343) +.+.+..... ..+ .+..++.|= +.|.. +-++..++.||.+ +-+.|--|- |+. T Consensus 194 ~~l~~~~~~~--~~~-~~~~vVVSP--------D~Ggv------~RA~~~A~~L~~~---laii~KrR~~s~v~~~~n~i 253 (381) T PRK06827 194 KSLLENEKLL--ELD-KENLVVISP--------DTGAM------DRAKYYSSVLKVP---LGLFYKRRDYSTVVNGKNPI 253 (381) T ss_pred HHHHHHCCCC--CCC-CCCCEEEEC--------CCCCH------HHHHHHHHHCCCC---EEEEEEECCCCCCCCCCCCE T ss_conf 9999845312--368-677779926--------99738------9999999971989---89999764676445667843 Q ss_pred ---CC------------------CCHHHHHHHHHHHHCCCCCEEEECC-HHHHCCHHHHHH Q ss_conf ---54------------------4302689999899769983999744-054221255888 Q gi|254780200|r 260 ---KC------------------LEPPTDKTVEKLAHDGIKSLAIITP-GFSSDCLETSYE 298 (343) Q Consensus 260 ---~W------------------L~P~t~~~l~~L~~~G~k~v~v~p~-gFvsD~lETl~E 298 (343) +- .+=++.++.+.|.+.|.++|.+++- |..||.+|.+.+ T Consensus 254 ~~~~~lggdV~GK~~IIvDDmIdTggTl~~aA~~Lk~~GA~~V~a~aTHgvfS~a~e~~~~ 314 (381) T PRK06827 254 VVHELLGRDVEGKDVFIVDDMIASGGSVLDAAKELKSRGAKKIICAVSFPFFTEGLEKFDK 314 (381) T ss_pred EEEEEECCCCCCCEEEEECCCCCCCHHHHHHHHHHHHCCCCEEEEEEECHHCCCHHHHHHH T ss_conf 6752413556798899972431452689999999998799889999973660476999976 No 221 >cd02538 G1P_TT_short G1P_TT_short is the short form of glucose-1-phosphate thymidylyltransferase. This family is the short form of glucose-1-phosphate thymidylyltransferase. Glucose-1-phosphate thymidylyltransferase catalyses the formation of dTDP-glucose, from dTTP and glucose 1-phosphate. It is the first enzyme in the biosynthesis of dTDP-L-rhamnose, a cell wall constituent and a feedback inhibitor of the enzyme.There are two forms of Glucose-1-phosphate thymidylyltransferase in bacteria and archeae; short form and long form. The homotetrameric, feedback inhibited short form is found in numerous bacterial species that produce dTDP-L-rhamnose. The long form, which has an extra 50 amino acids c-terminal, is found in many species for which it serves as a sugar-activating enzyme for antibiotic biosynthesis and or other, unknown pathways, and in which dTDP-L-rhamnose is not necessarily produced. Probab=21.94 E-value=56 Score=13.52 Aligned_cols=25 Identities=28% Similarity=0.461 Sum_probs=20.6 Q ss_pred CCCCCHHHHHHHHHHCCCCEEEEEE Q ss_conf 0454247899999870996589963 Q gi|254780200|r 120 YGKPSVKEIINNLREEGCDRLLIFP 144 (343) Q Consensus 120 y~~P~i~~~l~~l~~~g~~~ii~lP 144 (343) .+.|.++-.+++|.+.|+++++++. T Consensus 29 ~~kPli~~~l~~l~~~Gi~~i~iv~ 53 (240) T cd02538 29 YDKPMIYYPLSTLMLAGIREILIIS 53 (240) T ss_pred CCEEEHHHHHHHHHHCCCCCCEEEC T ss_conf 9997189999999985996231002 No 222 >cd04197 eIF-2B_epsilon_N The N-terminal domain of epsilon subunit of the eIF-2B is a subfamily of glycosyltransferase 2. N-terminal domain of epsilon subunit of the eukaryotic translation initiation factor 2B (eIF-2B): eIF-2B is a guanine nucleotide-exchange factor which mediates the exchange of GDP (bound to initiation factor eIF2) for GTP, generating active eIF2.GTP complex. EIF2B is a complex multimeric protein consisting of five subunits named alpha, beta, gamma, delta and epsilon. Subunit epsilon shares sequence similarity with gamma subunit, and with a family of bifunctional nucleotide-binding enzymes such as ADP-glucose pyrophosphorylase, suggesting that epsilon subunit may play roles in nucleotide binding activity. In yeast, eIF2B gamma enhances the activity of eIF2B-epsilon leading to the idea that these subunits form the catalytic subcomplex. Probab=21.86 E-value=56 Score=13.51 Aligned_cols=24 Identities=17% Similarity=0.378 Sum_probs=21.2 Q ss_pred CCCCHHHHHHHHHHCCCCEEEEEE Q ss_conf 454247899999870996589963 Q gi|254780200|r 121 GKPSVKEIINNLREEGCDRLLIFP 144 (343) Q Consensus 121 ~~P~i~~~l~~l~~~g~~~ii~lP 144 (343) +.|-|+-+++.|.+.|+++|+++- T Consensus 30 n~Pli~y~l~~L~~~G~~ei~v~~ 53 (217) T cd04197 30 NVPLIDYTLEFLALNGVEEVFVFC 53 (217) T ss_pred CEEHHHHHHHHHHHCCCCEEEEEC T ss_conf 998399999999987997799994 No 223 >TIGR01710 typeII_sec_gspG general secretion pathway protein G; InterPro: IPR010054 This entry represents GspG, protein G of the main terminal branch of the general secretion pathway, also called type II secretion. It transports folded proteins across the bacterial outer membrane and is widely distributed in Gram-negative pathogens.; GO: 0008565 protein transporter activity, 0015628 protein secretion by the type II secretion system, 0015627 type II protein secretion system complex. Probab=21.79 E-value=57 Score=13.50 Aligned_cols=58 Identities=16% Similarity=0.120 Sum_probs=36.5 Q ss_pred HHHHHHHHHCCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHCCCCCHHHHHHHHHHC Q ss_conf 99876665125747889999987077887404577759999999986522478430011210454247899999870 Q gi|254780200|r 59 RPILFGYILNFRPSKIKHAYAKIWNTAKNESILRTHTRDQATNLAKRLESISSIVVDWAMRYGKPSVKEIINNLREE 135 (343) Q Consensus 59 ~~il~~~I~~~R~~~~~~~Y~~Iw~~~gg~SPL~~~t~~qa~~L~~~L~~~~~~~V~~amry~~P~i~~~l~~l~~~ 135 (343) -=+|+++|+|.=-.+..+. -..-++.|.++|++.|+- +.-..+. .|+-|+.|+.|+.+ T Consensus 15 lGLLaalv~P~l~s~~d~A-------------~~~~a~~qi~~l~~ALd~---yrLDng~---yPt~eQGL~ALV~k 72 (137) T TIGR01710 15 LGLLAALVVPKLFSQADKA-------------KAQVAKAQIKALKNALDM---YRLDNGR---YPTEEQGLAALVKK 72 (137) T ss_pred HHHHHHHHHHHHCCCCCHH-------------HHHHHHHHHHHHHHHHHH---HHHHHCC---CCCCHHHHHHHCCC T ss_conf 9999999865534875535-------------788899999999999876---6566258---98815789982779 No 224 >cd06595 GH31_xylosidase_XylS-like This family represents an uncharacterized glycosyl hydrolase family 31 (GH31) enzyme found in bacteria and eukaryotes that is related to the XylS xylosidase of Sulfolobus solfataricus. Alpha-xylosidases catalyze the release of an alpha-xylose residue from the non-reducing end of alpha-xyloside substrates. Enzymes of the GH31 family possess a wide range of different hydrolytic activities including alpha-glucosidase (glucoamylase and sucrase-isomaltase), alpha-xylosidase, 6-alpha-glucosyltransferase, 3-alpha-isomaltosyltransferase and alpha-1,4-glucan lyase. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. Probab=21.68 E-value=57 Score=13.49 Aligned_cols=76 Identities=11% Similarity=0.235 Sum_probs=44.6 Q ss_pred CCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC-CCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 5424789999987099658996317111001468999999999987179986421-334557800689999999999973 Q gi|254780200|r 122 KPSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMRWSPSLRT-VPPYYEDSDYISALAQSIREHFES 200 (343) Q Consensus 122 ~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~~~~~~~~-I~~~~~~p~yI~a~~~~I~~~l~~ 200 (343) -|..+..+++|++.|+. +++-+.|+.........++.+.+. ....+.... +.-.+.+|.+.+.+.+.+...+.. T Consensus 73 FPdp~~m~~~l~~~G~~--~v~~~~P~~~~~~~~~~y~~~~~~---~~~~~~~~~~~~~d~~np~~~~~~~~~~~~~~~~ 147 (292) T cd06595 73 FPDPEKLLQDLHDRGLK--VTLNLHPADGIRAHEDQYPEMAKA---LGVDPATEGPILFDLTNPKFMDAYFDNVHRPLEK 147 (292) T ss_pred CCCHHHHHHHHHHCCCE--EEEEECCCCCCCCCHHHHHHHHHH---CCCCCCCCCCEEECCCCHHHHHHHHHHHHHHHHC T ss_conf 99999999999978998--999817875567545779999885---0888888996150489989999999987555430 Q ss_pred CC Q ss_conf 38 Q gi|254780200|r 201 IQ 202 (343) Q Consensus 201 ~~ 202 (343) .. T Consensus 148 ~G 149 (292) T cd06595 148 QG 149 (292) T ss_pred CC T ss_conf 68 No 225 >pfam03945 Endotoxin_N delta endotoxin, N-terminal domain. This family contains insecticidal toxins produced by Bacillus species of bacteria. During spore formation the bacteria produce crystals of this protein. When an insect ingests these proteins they are activated by proteolytic cleavage. The N terminus is cleaved in all of the proteins and a C terminal extension is cleaved in some members. Once activated the endotoxin binds to the gut epithelium and causes cell lysis leading to death. This activated region of the delta endotoxin is composed of three structural domains. The N-terminal helical domain is involved in membrane insertion and pore formation. The second and third domains are involved in receptor binding. Probab=21.63 E-value=57 Score=13.48 Aligned_cols=55 Identities=13% Similarity=0.061 Sum_probs=31.5 Q ss_pred CHHHHHCCCCCHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 001121045424789999987099658996317111001468999999999987179 Q gi|254780200|r 114 VDWAMRYGKPSVKEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMRW 170 (343) Q Consensus 114 V~~amry~~P~i~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~~ 170 (343) |.-.+..-+-.+...+..++..|. ++.+||+|-|.+. -.-.-++....--.+|+. T Consensus 100 v~~~f~~~~~~f~~~~~~F~~~gy-ev~lLp~YaqaAn-LHLlLLrd~i~yg~~Wg~ 154 (224) T pfam03945 100 VRTRFNILDSLFTNAIPSFAVSGY-EVLLLPVYAQAAN-LHLLLLRDAVIFGNRWGL 154 (224) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCC-HHHHHHHHHHHHH-HHHHHHHHHHHHHHHHCC T ss_conf 999999999999997799826885-2568999999999-999999999999999399 No 226 >cd01461 vWA_interalpha_trypsin_inhibitor vWA_interalpha trypsin inhibitor (ITI): ITI is a glycoprotein composed of three polypeptides- two heavy chains and one light chain (bikunin). Bikunin confers the protease-inhibitor function while the heavy chains are involved in rendering stability to the extracellular matrix by binding to hyaluronic acid. The heavy chains carry the VWA domain with a conserved MIDAS motif. Although the exact role of the VWA domains remains unknown, it has been speculated to be involved in mediating protein-protein interactions with the components of the extracellular matrix. Probab=21.57 E-value=57 Score=13.47 Aligned_cols=44 Identities=18% Similarity=0.160 Sum_probs=28.6 Q ss_pred HHHHHHHHHHCCCCCEEEECCHHHHCCHHHHHHHHHHHHHHHHHCCCCEEEEEC Q ss_conf 689999899769983999744054221255888758999999975997589816 Q gi|254780200|r 266 TDKTVEKLAHDGIKSLAIITPGFSSDCLETSYEIAHEAKEIFVNGGGEKFTQVP 319 (343) Q Consensus 266 t~~~l~~L~~~G~k~v~v~p~gFvsD~lETl~Eidie~~e~~~e~Gg~~~~rvp 319 (343) +.+.+.+..+ .++-|.++||=.|+-+- ..+++|...|| .|.+|. T Consensus 117 ~~~~~~~~~~---~~i~i~tig~G~~~~~~------~L~~iA~~~~G-~~~~v~ 160 (171) T cd01461 117 ILKNVREALS---GRIRLFTFGIGSDVNTY------LLERLAREGRG-IARRIY 160 (171) T ss_pred HHHHHHHHHC---CCCEEEEEEECCCCCHH------HHHHHHHCCCC-EEEECC T ss_conf 9999999744---89639999978979999------99999972898-899889 No 227 >PRK13463 phosphatase PhoE; Provisional Probab=21.52 E-value=57 Score=13.46 Aligned_cols=124 Identities=15% Similarity=0.224 Sum_probs=58.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHCCCCCHH--HHHHHHHH-CCCCEEEEEECC-----HHHHHHHHHHHHH Q ss_conf 404577759999999986522478430011210454247--89999987-099658996317-----1110014689999 Q gi|254780200|r 88 ESILRTHTRDQATNLAKRLESISSIVVDWAMRYGKPSVK--EIINNLRE-EGCDRLLIFPLY-----PQYSAATTGTAQD 159 (343) Q Consensus 88 ~SPL~~~t~~qa~~L~~~L~~~~~~~V~~amry~~P~i~--~~l~~l~~-~g~~~ii~lPLy-----PqyS~sTtgS~~~ 159 (343) .+||...-+.||++|.+.|+.. .+ ..-|+.|.-. ++.+-+.. .++. ++..+-+ ..+..-|.....+ T Consensus 26 D~~Lt~~G~~QA~~l~~~l~~~---~~--~~i~sSpl~Ra~eTA~~i~~~~~l~-i~~~~~L~E~~~G~weG~~~~e~~~ 99 (203) T PRK13463 26 NSALTENGILQAKQLGERMKDL---SI--HAIYSSPSERTLHTAELIKGERDIP-IIADEHFYEINMGIWEGQTIDDIER 99 (203) T ss_pred CCCCCHHHHHHHHHHHHHHCCC---CC--CEEECCCCCCCHHHHHHHHHCCCCC-EEECCCCCCCCCCCCCCCCHHHHHH T ss_conf 9983999999999998761478---87--8898078657899999998537997-7877041127887336991899998 Q ss_pred HHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHC--CCCCCEEEEEECCCCHHHHH Q ss_conf 999999871799864213345578006899999999999733--89986699983367502676 Q gi|254780200|r 160 KVFQELIHMRWSPSLRTVPPYYEDSDYISALAQSIREHFESI--QWTPEMLLVSFHQMPVSYLL 221 (343) Q Consensus 160 ~~~~~l~~~~~~~~~~~I~~~~~~p~yI~a~~~~I~~~l~~~--~~~~~~llfSaHglP~~~~~ 221 (343) .+-+....+...|.. +- ...-+.|. .+.+++.++++.. ......++..+||-..+.+- T Consensus 100 ~~p~~~~~~~~~~~~-~~--~~~GEs~~-~~~~Rv~~~l~~i~~~~~~~~vlvVsHg~~ir~ll 159 (203) T PRK13463 100 QYPDDIQLFWNEPHL-FQ--STSGENFE-AVHKRVIEGMQLLLEKHKGESILIVSHAAAAKLLV 159 (203) T ss_pred HCHHHHHHHHHCCCC-CC--CCCCCCHH-HHHHHHHHHHHHHHHHCCCCCEEEEECHHHHHHHH T ss_conf 799999998629122-68--99998999-99999999999999888989199993979999999 No 228 >pfam11633 Nsp3 Replicase polyprotein 1ab. This family of proteins represents Nsp3, the product of ORF1a in group 2 coronavirus.This is the largest replicase subunit. Probab=21.46 E-value=58 Score=13.45 Aligned_cols=54 Identities=17% Similarity=0.182 Sum_probs=31.6 Q ss_pred HHHHHHHHHHCCCCCEEEECCHHHHCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCH Q ss_conf 689999899769983999744054221255888758999999975997589816898798 Q gi|254780200|r 266 TDKTVEKLAHDGIKSLAIITPGFSSDCLETSYEIAHEAKEIFVNGGGEKFTQVPCLNSSN 325 (343) Q Consensus 266 t~~~l~~L~~~G~k~v~v~p~gFvsD~lETl~Eidie~~e~~~e~Gg~~~~rvp~lN~~~ 325 (343) +.|++..|-.-| +-++.+|+||+...+. |.+-....| ..-.-+...+|....=| T Consensus 80 l~dVi~alNs~~-~~l~~mPfGyVthG~~-La~sa~~mR----~ltvP~vv~v~S~~~Vp 133 (142) T pfam11633 80 LADVITALNSLG-KPLCMMPFGYVTHGLD-LAVSAVSMR----SLTVPHVVVVPSKDQVP 133 (142) T ss_pred HHHHHHHHHHCC-CCEEEEEEEEEECCCC-HHHHHHHHH----HCCCCEEEEECCCCCCH T ss_conf 899999886228-9707757678752722-899876876----14556699942824035 No 229 >PRK13389 UTP--glucose-1-phosphate uridylyltransferase subunit GalU; Provisional Probab=21.44 E-value=58 Score=13.45 Aligned_cols=25 Identities=16% Similarity=0.402 Sum_probs=18.7 Q ss_pred CCCCCHHHHHHHHHHCCCCEEEEEE Q ss_conf 0454247899999870996589963 Q gi|254780200|r 120 YGKPSVKEIINNLREEGCDRLLIFP 144 (343) Q Consensus 120 y~~P~i~~~l~~l~~~g~~~ii~lP 144 (343) +..|-|.-+++++.+.|+++|+++. T Consensus 37 ~dkPii~~~vee~~~aGI~ei~iV~ 61 (302) T PRK13389 37 VDKPLIQYVVNECIAAGITEIVLVT 61 (302) T ss_pred CCEEHHHHHHHHHHHCCCCEEEEEE T ss_conf 9887599999999986997899992 No 230 >PRK05282 peptidase E; Validated Probab=20.93 E-value=59 Score=13.38 Aligned_cols=13 Identities=15% Similarity=0.061 Sum_probs=5.9 Q ss_pred HHHHHHHHHHCCC Q ss_conf 8999999987489 Q gi|254780200|r 35 FSLRRYLREFLLD 47 (343) Q Consensus 35 ~~V~~yL~~fl~D 47 (343) +-..+.+++||.+ T Consensus 18 ~~~~~~i~~~~~~ 30 (233) T PRK05282 18 EHALPLIAELLAG 30 (233) T ss_pred HHHHHHHHHHHHC T ss_conf 9999999999847 No 231 >PRK10015 hypothetical protein; Provisional Probab=20.76 E-value=59 Score=13.36 Aligned_cols=12 Identities=33% Similarity=0.454 Sum_probs=8.9 Q ss_pred CEEEEEECCCCH Q ss_conf 669998336750 Q gi|254780200|r 206 EMLLVSFHQMPV 217 (343) Q Consensus 206 ~~llfSaHglP~ 217 (343) ..+=++||-||+ T Consensus 272 ~~~eY~A~~Ipe 283 (429) T PRK10015 272 KLLEYSAHMVPE 283 (429) T ss_pred EEEEECCCCCCC T ss_conf 798862543587 No 232 >TIGR01429 AMP_deaminase AMP deaminase; InterPro: IPR006329 These sequences describe AMP deaminase, a large, well-conserved eukaryotic protein involved in energy metabolism. Most members of the family have an additional, poorly alignable region of 150 amino acids or more N-terminal to the conserved domain which defines this group of sequences.; GO: 0003876 AMP deaminase activity, 0009168 purine ribonucleoside monophosphate biosynthetic process. Probab=20.72 E-value=14 Score=17.80 Aligned_cols=42 Identities=29% Similarity=0.370 Sum_probs=21.9 Q ss_pred CCCCHHHHHHHHHHHHHHHCCCCCCCEE------------EEECCCC---CCCCCCHHH Q ss_conf 7998889999999999972667567047------------8622577---765443026 Q gi|254780200|r 223 GDPYGCHCHKTARLLKEFLSWPDDRFKI------------CFQSRFG---RIKCLEPPT 266 (343) Q Consensus 223 gDpY~~q~~~t~~~i~~~l~~~~~~~~l------------~fQSr~G---p~~WL~P~t 266 (343) .|| -|.+-|-++++|...+..=-|.| +-||=|- +.+||||+- T Consensus 526 DDP--LQFhYTkEaLmEEYaiAAQvwkLS~cD~CElARNSVLQSGfe~~~K~~wLG~dY 582 (616) T TIGR01429 526 DDP--LQFHYTKEALMEEYAIAAQVWKLSACDMCELARNSVLQSGFEHQVKEHWLGPDY 582 (616) T ss_pred CCC--CHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHCCCCH T ss_conf 874--201231112578999999988740422778763344432510343234238773 No 233 >pfam07090 DUF1355 Protein of unknown function (DUF1355). This family consists of several hypothetical bacterial proteins of around 250 residues in length. The function of this family is unknown. Probab=20.72 E-value=60 Score=13.35 Aligned_cols=121 Identities=9% Similarity=0.075 Sum_probs=67.2 Q ss_pred EEECCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECC-----CC Q ss_conf 9631711100146899999999998717998642133455780068999999999997338998669998336-----75 Q gi|254780200|r 142 IFPLYPQYSAATTGTAQDKVFQELIHMRWSPSLRTVPPYYEDSDYISALAQSIREHFESIQWTPEMLLVSFHQ-----MP 216 (343) Q Consensus 142 ~lPLyPqyS~sTtgS~~~~~~~~l~~~~~~~~~~~I~~~~~~p~yI~a~~~~I~~~l~~~~~~~~~llfSaHg-----lP 216 (343) -+==|.+|+.++..+..+.+..++.+.+. +++.++.--....|-..+ +.|. ..|.+|||==| || T Consensus 15 h~KGfd~f~s~~~~~ga~~~~~al~~~~~--~v~~mp~h~a~~~FP~t~-----eeL~----~YD~VILSDiGantllL~ 83 (183) T pfam07090 15 HIKGFDQFTSLKYDEGADLLLFAILRSPY--DVDYMPAHDAQIAFPVTL-----EELS----AYDAVILSDIGSNTLLLQ 83 (183) T ss_pred EECCCCCCCCCCHHCCHHHHHHHHHHCCC--EEEEECCCCCHHHCCCCH-----HHHH----CCCEEEEEECCCCEEECC T ss_conf 75352355431100487999999986795--189742310464577388-----8884----088899950576605318 Q ss_pred HHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHH Q ss_conf 026762799888999999999997266756704786225777654430268999989 Q gi|254780200|r 217 VSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKICFQSRFGRIKCLEPPTDKTVEKL 273 (343) Q Consensus 217 ~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l~fQSr~Gp~~WL~P~t~~~l~~L 273 (343) -..-.+|..-...|....+-|.+-=++.----.++||--.|...|-.--++++|.-- T Consensus 84 p~t~~~g~~~pnrl~li~~yV~~GG~l~MiGGy~sF~Gi~g~~~~~~Tpv~~vLPV~ 140 (183) T pfam07090 84 PDTWYRSKRPPNRLDLIADYVANGGALLMVGGYLSFQGIDGKARYANTPLAEVLPVT 140 (183) T ss_pred CHHHHCCCCCCCHHHHHHHHHHHCCCEEEECCCEEEECCCCCCCCCCCCHHHCCCEE T ss_conf 336652650830999999999808967996431664103666561587086712534 No 234 >PRK10122 UTP--glucose-1-phosphate uridylyltransferase subunit GalF; Provisional Probab=20.51 E-value=60 Score=13.32 Aligned_cols=25 Identities=16% Similarity=0.472 Sum_probs=21.3 Q ss_pred CCCCCHHHHHHHHHHCCCCEEEEEE Q ss_conf 0454247899999870996589963 Q gi|254780200|r 120 YGKPSVKEIINNLREEGCDRLLIFP 144 (343) Q Consensus 120 y~~P~i~~~l~~l~~~g~~~ii~lP 144 (343) +..|-|+-+++++.+.|+++|+++. T Consensus 32 ~dkPii~~~vee~~~aGI~eiiiV~ 56 (297) T PRK10122 32 VDKPMIQYIVDEIVAAGIKEIVLVT 56 (297) T ss_pred CCEEHHHHHHHHHHHCCCCEEEEEE T ss_conf 9853699999999987997899991 No 235 >PRK13134 consensus Probab=20.44 E-value=60 Score=13.31 Aligned_cols=189 Identities=14% Similarity=0.105 Sum_probs=87.5 Q ss_pred CEEEEEECCCCCCCCCHHHHHHHHHHHC-CCHHHHCCCHHHHHHHHHHHHHCCCCHHHHHH-HHHHHCCCCCCCHHHHHH Q ss_conf 6799997178889999889999999874-89544117868889987666512574788999-998707788740457775 Q gi|254780200|r 18 KIGVLLVNLGTPDGHDFFSLRRYLREFL-LDKRVVELPSWQWRPILFGYILNFRPSKIKHA-YAKIWNTAKNESILRTHT 95 (343) Q Consensus 18 K~gvLL~n~G~P~~~~~~~V~~yL~~fl-~D~~Vi~~p~~~~~~il~~~I~~~R~~~~~~~-Y~~Iw~~~gg~SPL~~~t 95 (343) |-=|..+..|=|+. +.-.+.++.+- ..-++|++.-++--|+.-| +-+++. ++.+ .+|.-+.... T Consensus 19 ~ali~yitaG~P~~---e~s~~~i~~l~~~GaDiiEiGiPfSDP~ADG-------PvIq~A~~rAL----~~G~~~~~~~ 84 (257) T PRK13134 19 PALIPFLTAGFPTS---ERFWDELEALDAAGADIIEVGVPFSDPVADG-------PVVAAASQRAL----ESGVTLRWIM 84 (257) T ss_pred CEEEEEEECCCCCH---HHHHHHHHHHHHCCCCEEEECCCCCCCCCCC-------HHHHHHHHHHH----HCCCCHHHHH T ss_conf 57998870707997---9999999999977999999789888876558-------99999999999----6799878999 Q ss_pred HHHHHHHHHHHHCCCCCCCHHHHHCCCCCH----HHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 999999998652247843001121045424----7899999870996589963171110014689999999999871799 Q gi|254780200|r 96 RDQATNLAKRLESISSIVVDWAMRYGKPSV----KEIINNLREEGCDRLLIFPLYPQYSAATTGTAQDKVFQELIHMRWS 171 (343) Q Consensus 96 ~~qa~~L~~~L~~~~~~~V~~amry~~P~i----~~~l~~l~~~g~~~ii~lPLyPqyS~sTtgS~~~~~~~~l~~~~~~ 171 (343) + +.+.++++ .+.++ +-|.|++|.. ++-++++++.|++-+++ |=.|-- --+.+...+.+.+. T Consensus 85 ~-~~~~~~~~----~~~pi-vlMtY~N~i~~yG~e~F~~~~~~aGvdGvIi-pDLP~e-------E~~~~~~~~~~~gi- 149 (257) T PRK13134 85 D-GLAARKGR----LRAGL-VLMGYLNPFMQYGFERFVRDAADAGVAGCII-PDLPLD-------EDADLRALLAARGM- 149 (257) T ss_pred H-HHHHHHCC----CCCCE-EEEECCHHHHHHHHHHHHHHHHHCCCCEEEE-CCCCHH-------HHHHHHHHHHHCCC- T ss_conf 9-99987446----89998-9985345999746899999998679875994-699977-------88999999997598- Q ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCEEE Q ss_conf 86421334557800689999999999973389986699983367502676279988899999999999726675670478 Q gi|254780200|r 172 PSLRTVPPYYEDSDYISALAQSIREHFESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWPDDRFKIC 251 (343) Q Consensus 172 ~~~~~I~~~~~~p~yI~a~~~~I~~~l~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~~~~~~l~ 251 (343) ..+.+|.+= ..+ ++|+.-.....+ .....|--|+-= .+. .-..+..+..+.|.+..++ +..++ T Consensus 150 ~~I~lvaPt-t~~-------~Ri~~i~~~s~g--FIY~vs~~GvTG--~~~--~~~~~~~~~i~~ik~~t~~---Pv~vG 212 (257) T PRK13134 150 DLIALVGPN-TGE-------GRMREYAAVASG--YVYVVSVMGTTG--VRD--GLPVEVADTLARARQCFSI---PVALG 212 (257) T ss_pred CEEEECCCC-CCH-------HHHHHHHHHCCC--EEEEEECCCCCC--CCC--CCHHHHHHHHHHHHHHCCC---CEEEE T ss_conf 269963899-999-------999999962888--089984355668--764--5528899999999970699---87998 Q ss_pred E Q ss_conf 6 Q gi|254780200|r 252 F 252 (343) Q Consensus 252 f 252 (343) | T Consensus 213 F 213 (257) T PRK13134 213 F 213 (257) T ss_pred C T ss_conf 0 No 236 >pfam06309 Torsin Torsin. This family consists of several eukaryotic torsin proteins. Torsion dystonia is an autosomal dominant movement disorder characterized by involuntary, repetitive muscle contractions and twisted postures. The most severe early-onset form of dystonia has been linked to mutations in the human DYT1 (TOR1A) gene encoding a protein termed torsinA. While causative genetic alterations have been identified, the function of torsin proteins and the molecular mechanism underlying dystonia remain unknown. Phylogenetic analysis of the torsin protein family indicates these proteins share distant sequence similarity with the large and diverse family of (pfam00004) proteins. It has been suggested that torsins play a role in effectively managing protein folding and that possible breakdown in a neuroprotective mechanism that is, in part, mediated by torsins may be responsible for the neuronal dysfunction associated with dystonia. Probab=20.42 E-value=60 Score=13.31 Aligned_cols=37 Identities=19% Similarity=0.302 Sum_probs=20.4 Q ss_pred CCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCH Q ss_conf 55780068999999999997338998669998336750 Q gi|254780200|r 180 YYEDSDYISALAQSIREHFESIQWTPEMLLVSFHQMPV 217 (343) Q Consensus 180 ~~~~p~yI~a~~~~I~~~l~~~~~~~~~llfSaHglP~ 217 (343) .|-++.=.+.+.+.|+.+++. ......|++|+||-|= T Consensus 27 lfGQhla~~~v~~al~~~l~~-~~p~KpLVlSfHG~tG 63 (127) T pfam06309 27 LFGQHLVKQLVVRSVKGHWEN-PKPRKPLVLSFHGWTG 63 (127) T ss_pred CCCHHHHHHHHHHHHHHHHCC-CCCCCCEEEEECCCCC T ss_conf 347798999999999999748-9999974887018999 No 237 >TIGR01392 homoserO_Ac_trn homoserine O-acetyltransferase; InterPro: IPR006296 This family describes homoserine-O-acetyltransferase, an enzyme of methionine biosynthesis. The family does not contain a number of homologs that appear to represent sequences derived from within the family but to have changed substantially.; GO: 0004414 homoserine O-acetyltransferase activity, 0009086 methionine biosynthetic process. Probab=20.35 E-value=61 Score=13.30 Aligned_cols=59 Identities=10% Similarity=0.159 Sum_probs=41.0 Q ss_pred CHHHHHHHHHHHHCCCCCEEEECCHHHHCCHHHHHHHHHHHHHHHHHCCCC-EEEEECCCCCCHHHH Q ss_conf 302689999899769983999744054221255888758999999975997-589816898798999 Q gi|254780200|r 263 EPPTDKTVEKLAHDGIKSLAIITPGFSSDCLETSYEIAHEAKEIFVNGGGE-KFTQVPCLNSSNLSI 328 (343) Q Consensus 263 ~P~t~~~l~~L~~~G~k~v~v~p~gFvsD~lETl~Eidie~~e~~~e~Gg~-~~~rvp~lN~~~~fi 328 (343) ..++.++|+++.. . +.=+||.||-|=+..|- .|..+.+..++.. .|.-|++.-.||.|. T Consensus 320 ~~~~~~AL~~~~~----~--~lv~s~~sD~Lfp~~~~-~e~~~~l~~~~~~V~Y~ei~S~~GHDAFL 379 (395) T TIGR01392 320 RGSLTEALAAIKA----P--FLVVSITSDWLFPPAES-RELAKALPAAGLRVTYAEIESPYGHDAFL 379 (395) T ss_pred CCCHHHHHHHCCC----C--EEEEEEECCCCCCHHHH-HHHHHHHHCCCCEEEEEEEECCCCCCCCC T ss_conf 9897999983798----8--89998606876688999-99999985036406688852689986000 No 238 >PRK13405 bchH magnesium chelatase subunit H; Provisional Probab=20.35 E-value=61 Score=13.30 Aligned_cols=184 Identities=16% Similarity=0.104 Sum_probs=83.0 Q ss_pred CCEEEEEECCCCCCCCCHHHHHHHHHHHCCCHHHHCCCHHHHHHHHH--HHHHCCCCHHHHH--HHHHHHCCCCCCCHHH Q ss_conf 86799997178889999889999999874895441178688899876--6651257478899--9998707788740457 Q gi|254780200|r 17 GKIGVLLVNLGTPDGHDFFSLRRYLREFLLDKRVVELPSWQWRPILF--GYILNFRPSKIKH--AYAKIWNTAKNESILR 92 (343) Q Consensus 17 ~K~gvLL~n~G~P~~~~~~~V~~yL~~fl~D~~Vi~~p~~~~~~il~--~~I~~~R~~~~~~--~Y~~Iw~~~gg~SPL~ 92 (343) .|.|=+-| ++|.+ ..-.+|+.+.....=-+-. .-.-.+|+ .-|++|-|.|.+. +|-.+..-.-+||+-| T Consensus 113 t~~G~f~m--~~~~~----~~~~~~~~~~~~~~~~~~~-~~~~~~lr~~pk~Lk~iPgkaqD~r~~~~~~qYWl~gs~eN 185 (1207) T PRK13405 113 TRMGKFDM--SGPAS----GALALLKKLRGKKGSGSSG-AGQMAMLRRLPKILRFIPGTAQDLRAYFLTLQYWLAGSDEN 185 (1207) T ss_pred HCCCCEEE--CCCCC----CHHHHHHHHCCCCCCCCHH-HHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHH T ss_conf 45244550--79985----6689999854889998656-63888875320543348843899999999999882799899 Q ss_pred HHHHHHHHHHHHHHHCC----------CCCC-CHHHHHCCCCCHH----HHHHHHHHC-CCCEEEEEECCHHHHHHHHHH Q ss_conf 77599999999865224----------7843-0011210454247----899999870-996589963171110014689 Q gi|254780200|r 93 THTRDQATNLAKRLESI----------SSIV-VDWAMRYGKPSVK----EIINNLREE-GCDRLLIFPLYPQYSAATTGT 156 (343) Q Consensus 93 ~~t~~qa~~L~~~L~~~----------~~~~-V~~amry~~P~i~----~~l~~l~~~-g~~~ii~lPLyPqyS~sTtgS 156 (343) .. .....|..++... ..-+ .+..|-.++|... +.++.+... +-.-.|.+-+|=-|--+.--. T Consensus 186 ~~--~m~~~l~~~Y~~~~~~~~~~~~~~~~PveyP~~GlyHP~~~~~~~~~~~~~~~~~~~~ptVgilfyRs~~l~gn~~ 263 (1207) T PRK13405 186 VA--NMVRFLVSRYAAGPRAALRGIAKAEAPIEYPDVGLYHPRMPGRITEDLDDLPRPAGAKGTVGLLLMRSYVLAGNTA 263 (1207) T ss_pred HH--HHHHHHHHHHCCCCCHHHCCCCCCCCCCCCCCCEEECCCCCCCCCCCHHHCCCCCCCCCEEEEEEEHHHHHCCCCH T ss_conf 99--9999999985356312213788999871057741028999855320587555545789809999606557238848 Q ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCCH Q ss_conf 9999999998717998642133455780068999999999997338998669998336750 Q gi|254780200|r 157 AQDKVFQELIHMRWSPSLRTVPPYYEDSDYISALAQSIREHFESIQWTPEMLLVSFHQMPV 217 (343) Q Consensus 157 ~~~~~~~~l~~~~~~~~~~~I~~~~~~p~yI~a~~~~I~~~l~~~~~~~~~llfSaHglP~ 217 (343) .++.+.+++...+-. .|+-|...=..- ..|+..+..........|.|..|-+. T Consensus 264 ~idalI~alE~~Gl~----viPvF~~gld~~----~ai~~~f~~~~~~~vDalvs~tgFsL 316 (1207) T PRK13405 264 HYDGVIEAFEAKGLR----VVPAFASGLDGR----PAIERYFMKDGRPTVDAVVSLTGFSL 316 (1207) T ss_pred HHHHHHHHHHHCCCE----EEEEEECCCCCC----HHHHHHHHHCCCCCCEEEEEECCEEC T ss_conf 999999999987991----889986787871----77999986268985118986031103 No 239 >COG3222 Uncharacterized protein conserved in bacteria [Function unknown] Probab=20.29 E-value=61 Score=13.29 Aligned_cols=91 Identities=14% Similarity=0.121 Sum_probs=46.8 Q ss_pred HHHHHHHHHH-HHCCCCCCEEEEEECCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCC-CCCCEEEEECCCCC-----CCC Q ss_conf 9999999999-7338998669998336750267627998889999999999972667-56704786225777-----654 Q gi|254780200|r 189 ALAQSIREHF-ESIQWTPEMLLVSFHQMPVSYLLKGDPYGCHCHKTARLLKEFLSWP-DDRFKICFQSRFGR-----IKC 261 (343) Q Consensus 189 a~~~~I~~~l-~~~~~~~~~llfSaHglP~~~~~~gDpY~~q~~~t~~~i~~~l~~~-~~~~~l~fQSr~Gp-----~~W 261 (343) .+.++.++.. ..+.+. ..++.--|--|.- +. .|-.+|.+.-..+-..||.. +..|.+-=--||-| .+| T Consensus 88 d~gdRlars~~~a~~~~-~~VliIg~DcP~l--t~--elLa~a~taL~~~paVLGpa~dGGy~llgLrr~~pe~fe~ipw 162 (211) T COG3222 88 DLGDRLARSHVDAFDGS-YPVLIIGMDCPGL--TA--ELLADAFTALLQIPAVLGPAFDGGYYLLGLRRFAPELFEAIPW 162 (211) T ss_pred CHHHHHHHHHHHHHCCC-CCEEEEECCCCCC--CH--HHHHHHHHHHHCCCCEECCCCCCCEEEEEEECCCHHHHHCCCC T ss_conf 88899999999975688-7279982589764--79--9999999997147610222225837899731048999950888 Q ss_pred CCHHHH-HHHHHHHHCCCCCEEEE Q ss_conf 430268-99998997699839997 Q gi|254780200|r 262 LEPPTD-KTVEKLAHDGIKSLAII 284 (343) Q Consensus 262 L~P~t~-~~l~~L~~~G~k~v~v~ 284 (343) =+|.+- .+++.+...|...-++- T Consensus 163 g~~~v~~lTl~~lrqng~~~~llp 186 (211) T COG3222 163 GTPDVLELTLKALRQNGIDVYLLP 186 (211) T ss_pred CCCHHHHHHHHHHHHCCCCCCCCC T ss_conf 874599999999997499611367 No 240 >KOG0434 consensus Probab=20.23 E-value=61 Score=13.28 Aligned_cols=25 Identities=16% Similarity=0.255 Sum_probs=11.5 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHC Q ss_conf 3345578006899999999999733 Q gi|254780200|r 177 VPPYYEDSDYISALAQSIREHFESI 201 (343) Q Consensus 177 I~~~~~~p~yI~a~~~~I~~~l~~~ 201 (343) +---|.++.|++-.....+--+++. T Consensus 848 ~ivi~~d~~~ledik~l~~yI~eEl 872 (1070) T KOG0434 848 LIVIHSDEEYLEDIKSLERYILEEL 872 (1070) T ss_pred EEEECCCHHHHHHHHHHHHHHHHHH T ss_conf 3787487899877999999999860 No 241 >cd06376 PBP1_mGluR_groupIII Ligand-binding domain of the group III metabotropic glutamate receptor. Ligand-binding domain of the group III metabotropic glutamate receptor, a family which contains mGlu4R, mGluR6R, mGluR7, and mGluR8; all of which inhibit adenylyl cyclase. The metabotropic glutamate receptor is a member of the family C of G-protein-coupled receptors that transduce extracellular signals into G-protein activation and ultimately into intracellular responses. The mGluRs are classified into three groups which comprise eight subtypes. Probab=20.15 E-value=61 Score=13.27 Aligned_cols=21 Identities=29% Similarity=0.444 Sum_probs=11.8 Q ss_pred EEEEECCCCCCCCCHHHHHHHHHHH Q ss_conf 9999717888999988999999987 Q gi|254780200|r 20 GVLLVNLGTPDGHDFFSLRRYLREF 44 (343) Q Consensus 20 gvLL~n~G~P~~~~~~~V~~yL~~f 44 (343) ||+ |+-.+...-.|...|..| T Consensus 106 avI----G~~~S~~s~~va~ll~~f 126 (463) T cd06376 106 GVI----GASASSVSIMVANILRLF 126 (463) T ss_pred EEE----CCCCCCHHHHHHHHHHCC T ss_conf 998----998741899999987247 No 242 >TIGR02416 CO_dehy_Mo_lg carbon-monoxide dehydrogenase, large subunit; InterPro: IPR012780 Carbon monoxide dehydrogenases catalyse the reversible oxidation of carbon monoxide to carbon dioxide as shown below , :CO + H(2)O + A == CO(2) + AH(2) A variety of electron acceptors can be used by these enzymes including ferredoxin, methyl viologen and benzyl viologen. Under anaerobic conditions, carbon monoxide is utilised by a number of methanogenic archaea and acetogenic, sulphate-reducing or phototrophic bacteria. Under aerobic conditions carbon monoxide can be used as sole carbon and energy source by the carboxidotrophic bacteria, a taxonomically diverse group of obligate or facultative chemolithoautotrophic species. The anaerobic dehydrogenases are mostly nickel-containing iron-sulphur enzymes, while the The aerobic enzymes categorised so far are molybdenum-containing iron-sulphur proteins. This entry represents the large subunit of the molybdenum-containing carbon monoxide dehydrogenase found in aerobic bacteria. This enzyme is a dimer of heterotrimers, where each heterotrimer consists of a small iron-sulphur subunit, a medium FAD-containing flavoprotein subunit and a large molybdopterin-containign subunit which contains the active site , . The large subunit is divided into two domains. The N-terminal domain is composed of mainly of a five-and four-stranded mixed beta-sheet and interacts with the small and medium subunits. The C-terminal domain contains a region which interacts with its counterpart in the overall dimer, providing the main dimer contact, and a beta-sheet surrounded by alpha helices. The active site located in this subunit contains a novel dinuclear heterometal [CuSMo(=O)OH] cluster.; GO: 0005507 copper ion binding, 0018492 carbon-monoxide dehydrogenase (acceptor) activity, 0030151 molybdenum ion binding. Probab=20.13 E-value=59 Score=13.37 Aligned_cols=33 Identities=21% Similarity=0.322 Sum_probs=20.6 Q ss_pred CHHHHHHH--HHHHHHHHCCCCCCCEE-------------EEECCCCC Q ss_conf 88899999--99999972667567047-------------86225777 Q gi|254780200|r 226 YGCHCHKT--ARLLKEFLSWPDDRFKI-------------CFQSRFGR 258 (343) Q Consensus 226 Y~~q~~~t--~~~i~~~l~~~~~~~~l-------------~fQSr~Gp 258 (343) -+-|=++| +|.|++.||+|.+...+ +|=||..| T Consensus 499 sQGQGH~TTyAQIiA~ELGIP~e~I~v~~GdTd~aPyGLGTYGSRSTP 546 (775) T TIGR02416 499 SQGQGHETTYAQIIATELGIPAEDIDVEEGDTDTAPYGLGTYGSRSTP 546 (775) T ss_pred CCCCCCCCHHHHHHHHHCCCCCCCEEEECCCCCCCCCCCCCCCCCCCC T ss_conf 369864304666655552788011067207878888566567765542 Done!