Query gi|254781051|ref|YP_003065464.1| alpha-ketoglutarate decarboxylase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 957 No_of_seqs 189 out of 1404 Neff 4.9 Searched_HMMs 39220 Date Mon May 30 06:02:45 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254781051.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK09404 kgd alpha-ketoglutara 100.0 0 0 3148.4 91.8 919 2-957 1-930 (931) 2 PRK12270 kgd alpha-ketoglutara 100.0 0 0 3043.7 90.6 838 93-957 386-1233(1234) 3 KOG0450 consensus 100.0 0 0 2749.6 75.3 918 11-957 47-1012(1017) 4 COG0567 SucA 2-oxoglutarate de 100.0 0 0 2708.0 71.0 897 6-957 1-905 (906) 5 KOG0451 consensus 100.0 0 0 2190.4 53.4 835 98-957 53-913 (913) 6 TIGR00239 2oxo_dh_E1 2-oxoglut 100.0 0 0 1717.5 57.0 924 14-956 1-990 (990) 7 cd02016 TPP_E1_OGDC_like Thiam 100.0 0 0 1047.6 24.4 262 224-494 1-265 (265) 8 pfam00676 E1_dh Dehydrogenase 100.0 7E-27 1.8E-31 223.0 27.6 291 218-544 3-298 (303) 9 TIGR03182 PDH_E1_alph_y pyruva 99.9 2.9E-19 7.3E-24 165.5 29.6 296 208-542 1-300 (315) 10 cd02000 TPP_E1_PDC_ADC_BCADC T 99.9 2.4E-19 6.1E-24 166.0 27.8 287 214-539 1-292 (293) 11 CHL00149 odpA pyruvate dehydro 99.9 4.2E-19 1.1E-23 164.2 28.8 314 198-543 11-328 (343) 12 TIGR03181 PDH_E1_alph_x pyruva 99.9 8.8E-19 2.2E-23 161.8 28.2 295 201-543 16-314 (341) 13 COG1071 AcoA Pyruvate/2-oxoglu 99.8 9.6E-15 2.4E-19 131.2 27.9 312 204-552 22-339 (358) 14 pfam02779 Transket_pyr Transke 99.7 2.3E-16 5.8E-21 143.5 11.3 171 603-796 2-173 (174) 15 KOG0225 consensus 99.6 2.7E-12 6.9E-17 112.7 22.7 303 196-544 45-357 (394) 16 smart00861 Transket_pyr Transk 99.4 2.3E-12 6E-17 113.2 9.3 164 606-793 2-168 (168) 17 PRK12315 1-deoxy-D-xylulose-5- 99.2 7.9E-07 2E-11 71.3 30.5 409 333-911 113-536 (581) 18 PRK11892 pyruvate dehydrogenas 99.2 1.3E-08 3.4E-13 84.8 21.2 296 613-955 151-458 (464) 19 PTZ00182 3-methyl-2-oxobutanat 99.2 8.7E-09 2.2E-13 86.2 20.0 308 577-937 13-333 (355) 20 CHL00144 odpB pyruvate dehydro 99.2 1.5E-08 3.8E-13 84.4 20.6 294 613-954 13-318 (326) 21 PRK09212 pyruvate dehydrogenas 99.1 7.1E-08 1.8E-12 79.3 21.2 300 607-955 7-319 (327) 22 cd07036 TPP_PYR_E1-PDHc-beta_l 98.9 2E-08 5.1E-13 83.4 9.2 156 611-792 4-167 (167) 23 COG0022 AcoB Pyruvate/2-oxoglu 98.7 1.1E-05 2.8E-10 62.7 19.2 265 611-918 9-283 (324) 24 PRK05444 1-deoxy-D-xylulose-5- 98.6 0.00014 3.6E-09 54.3 29.9 421 333-931 117-545 (576) 25 KOG1182 consensus 97.7 0.0039 9.9E-08 43.4 14.2 335 203-582 80-426 (432) 26 PRK12571 1-deoxy-D-xylulose-5- 97.6 0.011 2.9E-07 40.0 26.4 226 668-931 364-597 (642) 27 cd02012 TPP_TK Thiamine pyroph 97.5 0.0011 2.8E-08 47.6 9.5 123 338-480 110-232 (255) 28 KOG0525 consensus 97.3 0.0019 4.8E-08 45.8 7.8 246 623-910 60-314 (362) 29 cd07033 TPP_PYR_DXS_TK_like Py 97.2 0.0045 1.2E-07 42.9 9.4 145 615-792 8-156 (156) 30 cd02007 TPP_DXS Thiamine pyrop 97.1 0.0075 1.9E-07 41.3 9.5 113 332-472 74-187 (195) 31 cd02006 TPP_Gcl Thiamine pyrop 96.9 0.017 4.4E-07 38.5 9.7 166 285-477 9-196 (202) 32 PRK08659 2-oxoglutarate ferred 96.7 0.043 1.1E-06 35.5 11.0 90 837-934 274-365 (377) 33 cd00568 TPP_enzymes Thiamine p 96.7 0.01 2.6E-07 40.3 7.7 146 298-473 11-167 (168) 34 cd02002 TPP_BFDC Thiamine pyro 96.6 0.024 6.2E-07 37.4 8.8 157 285-473 2-177 (178) 35 cd02013 TPP_Xsc_like Thiamine 96.5 0.032 8.2E-07 36.5 8.8 173 285-493 5-191 (196) 36 PRK11269 glyoxylate carboligas 96.5 0.017 4.3E-07 38.6 7.3 165 286-477 372-558 (591) 37 cd02005 TPP_PDC_IPDC Thiamine 96.4 0.035 9E-07 36.2 8.9 159 286-475 4-174 (183) 38 TIGR03336 IOR_alpha indolepyru 96.2 0.09 2.3E-06 33.1 9.9 158 246-474 4-163 (595) 39 cd02004 TPP_BZL_OCoD_HPCL Thia 96.0 0.056 1.4E-06 34.6 8.0 156 286-474 1-171 (172) 40 pfam02775 TPP_enzyme_C Thiamin 95.9 0.06 1.5E-06 34.5 7.8 110 333-472 28-150 (150) 41 PRK07119 2-ketoisovalerate fer 95.6 0.1 2.6E-06 32.7 7.9 61 837-902 246-308 (350) 42 cd02015 TPP_AHAS Thiamine pyro 95.5 0.18 4.6E-06 30.8 9.1 156 285-476 2-175 (186) 43 PRK06112 acetolactate synthase 95.4 0.055 1.4E-06 34.7 6.1 38 431-474 525-562 (581) 44 PRK08273 pyruvate decarboxylas 95.4 0.25 6.4E-06 29.7 9.4 204 285-537 367-591 (597) 45 cd02003 TPP_IolD Thiamine pyro 95.3 0.29 7.3E-06 29.3 9.5 157 287-475 2-184 (205) 46 cd02014 TPP_POX Thiamine pyrop 95.1 0.24 6.2E-06 29.8 8.5 158 285-474 3-172 (178) 47 cd02010 TPP_ALS Thiamine pyrop 95.0 0.26 6.6E-06 29.6 8.7 156 286-475 1-170 (177) 48 KOG0523 consensus 94.8 0.38 9.7E-06 28.3 8.9 412 340-881 125-552 (632) 49 PRK09627 oorA 2-oxoglutarate-a 94.5 0.094 2.4E-06 32.9 5.3 89 837-934 275-365 (375) 50 COG0674 PorA Pyruvate:ferredox 94.2 0.13 3.3E-06 31.9 5.5 93 835-934 255-351 (365) 51 PRK05899 transketolase; Review 94.0 0.58 1.5E-05 27.0 31.5 457 339-902 124-615 (661) 52 PRK07525 sulfoacetaldehyde ace 93.9 0.38 9.8E-06 28.3 7.4 39 750-788 518-558 (589) 53 PRK06154 hypothetical protein; 93.9 0.3 7.6E-06 29.2 6.8 24 246-270 11-34 (556) 54 COG0028 IlvB Thiamine pyrophos 93.7 0.12 3.1E-06 32.1 4.6 162 285-477 360-533 (550) 55 TIGR03457 sulphoacet_xsc sulfo 93.4 0.4 1E-05 28.2 6.8 15 763-777 526-540 (579) 56 COG4231 Indolepyruvate ferredo 93.3 0.75 1.9E-05 26.1 9.7 211 401-632 108-332 (640) 57 cd02008 TPP_IOR_alpha Thiamine 93.1 0.49 1.2E-05 27.6 6.8 157 265-472 6-174 (178) 58 PRK08199 acetolactate synthase 92.8 0.83 2.1E-05 25.8 7.7 19 251-270 6-24 (553) 59 TIGR03393 indolpyr_decarb indo 92.6 0.71 1.8E-05 26.3 7.0 23 446-472 141-163 (539) 60 PRK05858 hypothetical protein; 92.4 0.94 2.4E-05 25.4 7.5 41 598-642 210-252 (543) 61 PRK06882 acetolactate synthase 92.1 0.55 1.4E-05 27.2 6.0 45 593-641 206-252 (574) 62 cd03371 TPP_PpyrDC Thiamine py 91.8 1.1 2.9E-05 24.8 9.1 109 337-475 52-163 (188) 63 PRK09259 putative oxalyl-CoA d 91.4 0.57 1.5E-05 27.0 5.5 37 750-788 508-544 (572) 64 PRK07064 hypothetical protein; 91.4 0.88 2.2E-05 25.6 6.5 20 758-777 495-514 (544) 65 PRK06276 acetolactate synthase 91.4 0.58 1.5E-05 27.0 5.5 113 333-475 419-543 (586) 66 PRK08366 vorA 2-ketoisovalerat 91.3 1.3 3.3E-05 24.3 8.6 79 837-918 264-344 (394) 67 PRK07449 2-succinyl-6-hydroxy- 91.2 1.3 3.3E-05 24.3 8.0 20 251-271 13-32 (548) 68 cd00956 Transaldolase_FSA Tran 91.1 1 2.6E-05 25.1 6.5 100 414-517 23-139 (211) 69 TIGR03394 indol_phenyl_DC indo 90.7 0.93 2.4E-05 25.4 6.0 25 445-473 139-163 (535) 70 PRK09124 pyruvate dehydrogenas 90.6 0.38 9.7E-06 28.4 4.0 37 749-787 488-524 (574) 71 PRK08979 acetolactate synthase 90.6 0.56 1.4E-05 27.1 4.8 15 762-776 516-530 (572) 72 PRK08327 acetolactate synthase 90.4 1.5 3.9E-05 23.8 8.0 14 251-264 11-24 (568) 73 PRK08617 acetolactate synthase 90.4 0.57 1.4E-05 27.0 4.7 15 764-778 364-378 (552) 74 TIGR00553 pabB para-aminobenzo 90.4 0.081 2.1E-06 33.4 0.4 35 145-179 268-306 (368) 75 PRK09107 acetolactate synthase 90.3 0.67 1.7E-05 26.5 5.0 38 749-788 511-548 (594) 76 PRK06456 acetolactate synthase 90.2 1 2.6E-05 25.1 5.9 45 593-641 207-253 (572) 77 cd02001 TPP_ComE_PpyrDC Thiami 90.2 1.6 4E-05 23.7 7.5 104 337-474 46-154 (157) 78 PRK06546 pyruvate dehydrogenas 89.7 0.99 2.5E-05 25.2 5.5 71 430-517 491-561 (578) 79 PRK01362 putative translaldola 89.6 1.8 4.5E-05 23.3 6.7 58 414-475 24-87 (214) 80 PRK06725 acetolactate synthase 89.3 1.2 3.1E-05 24.5 5.7 46 593-642 214-261 (570) 81 PRK09622 porA pyruvate flavodo 88.7 2 5.1E-05 22.9 9.4 81 835-919 266-349 (407) 82 TIGR03254 oxalate_oxc oxalyl-C 87.7 1.8 4.5E-05 23.3 5.7 25 752-776 499-523 (554) 83 PRK08367 porA pyruvate ferredo 87.6 2.3 5.9E-05 22.4 8.7 80 836-918 261-342 (395) 84 cd03375 TPP_OGFOR Thiamine pyr 87.4 2.4 6E-05 22.4 8.0 160 267-473 3-182 (193) 85 PRK08978 acetolactate synthase 87.2 2 5.1E-05 22.9 5.7 28 445-472 129-157 (548) 86 PRK07710 acetolactate synthase 87.0 2.5 6.4E-05 22.2 6.4 24 246-270 15-38 (571) 87 PRK12656 fructose-6-phosphate 85.5 2.2 5.7E-05 22.6 5.2 58 414-474 24-90 (222) 88 PRK06965 acetolactate synthase 85.4 3 7.6E-05 21.6 5.8 62 749-811 519-584 (587) 89 PRK12474 hypothetical protein; 84.8 3.2 8.1E-05 21.4 8.8 19 251-270 9-27 (518) 90 CHL00099 ilvB acetohydroxyacid 84.2 2.2 5.6E-05 22.6 4.7 38 749-788 509-546 (588) 91 KOG1185 consensus 84.0 3.4 8.6E-05 21.2 7.6 108 678-788 436-555 (571) 92 PRK07789 acetolactate synthase 83.7 3.5 8.9E-05 21.1 7.5 72 749-821 533-608 (612) 93 PRK12376 putative translaldola 82.5 2.5 6.3E-05 22.2 4.4 59 413-475 30-96 (238) 94 PRK06466 acetolactate synthase 82.5 3.8 9.8E-05 20.8 5.4 19 251-270 8-26 (574) 95 pfam10237 N6-adenineMlase Prob 82.5 2.8 7.1E-05 21.8 4.7 104 746-855 14-133 (161) 96 PRK08611 pyruvate oxidase; Pro 81.8 4.1 0.0001 20.6 5.7 38 749-788 488-525 (576) 97 PRK12655 fructose-6-phosphate 81.0 3.5 8.8E-05 21.1 4.7 57 414-474 24-88 (220) 98 cd03372 TPP_ComE Thiamine pyro 80.5 4.5 0.00011 20.3 8.5 113 337-489 46-165 (179) 99 PRK13029 2-oxoacid ferredoxin 78.5 5.1 0.00013 19.8 9.2 62 846-915 1021-1095(1186) 100 cd06250 M14_PaAOTO_like An unc 78.4 5.2 0.00013 19.8 5.5 106 368-474 29-168 (359) 101 cd01987 USP_OKCHK USP domain i 78.1 5.3 0.00013 19.7 7.0 59 205-276 42-100 (124) 102 PRK06457 pyruvate dehydrogenas 77.9 5.3 0.00014 19.7 7.7 38 749-788 477-514 (549) 103 PRK07524 hypothetical protein; 77.3 5.5 0.00014 19.6 7.9 19 251-270 6-24 (534) 104 PRK13030 2-oxoacid ferredoxin 76.2 5.9 0.00015 19.4 9.7 45 655-700 814-858 (1168) 105 PRK09193 indolepyruvate ferred 75.3 6.2 0.00016 19.2 9.5 35 655-690 809-843 (1155) 106 cd02009 TPP_SHCHC_synthase Thi 74.9 6.3 0.00016 19.1 9.0 119 324-473 42-173 (175) 107 PRK07586 hypothetical protein; 73.5 6.8 0.00017 18.9 9.1 20 938-957 484-503 (514) 108 TIGR01824 PabB-clade2 para-ami 73.0 2.2 5.5E-05 22.7 1.8 41 140-180 272-317 (373) 109 COG0147 TrpE Anthranilate/para 72.9 1.1 2.9E-05 24.8 0.3 36 765-800 227-269 (462) 110 PRK06048 acetolactate synthase 71.9 7.3 0.00019 18.6 8.9 46 594-643 208-255 (562) 111 PTZ00089 transketolase; Provis 71.8 7.4 0.00019 18.6 7.8 118 340-471 123-245 (670) 112 PRK05940 anthranilate synthase 71.1 1.3 3.3E-05 24.3 0.3 76 243-326 189-269 (459) 113 PRK07508 para-aminobenzoate sy 71.0 1.2 3.1E-05 24.5 0.2 20 97-116 30-49 (377) 114 PRK07418 acetolactate synthase 70.6 7.8 0.0002 18.4 7.8 38 749-788 517-554 (615) 115 PRK13572 anthranilate synthase 70.5 1.7 4.2E-05 23.5 0.7 10 767-776 203-212 (430) 116 TIGR02351 thiH thiazole biosyn 70.1 2.6 6.6E-05 22.1 1.7 118 9-172 233-366 (378) 117 PRK13574 anthranilate synthase 69.9 1.6 4.1E-05 23.7 0.6 11 767-777 195-205 (422) 118 KOG1184 consensus 69.8 4.9 0.00012 20.0 3.0 80 444-529 192-280 (561) 119 COG3961 Pyruvate decarboxylase 69.7 5.1 0.00013 19.8 3.1 85 439-529 184-277 (557) 120 PRK07282 acetolactate synthase 68.3 8.7 0.00022 18.1 8.1 25 750-774 501-525 (566) 121 PRK13570 anthranilate synthase 67.2 1.9 4.9E-05 23.1 0.5 17 436-452 249-265 (454) 122 PRK13565 anthranilate synthase 67.1 2.4 6E-05 22.4 1.0 21 247-267 222-242 (490) 123 COG1590 Uncharacterized conser 66.1 6.2 0.00016 19.2 2.9 70 431-515 92-170 (208) 124 PRK09070 hypothetical protein; 65.4 2 5.2E-05 22.9 0.3 41 434-484 242-283 (451) 125 PRK07092 benzoylformate decarb 64.1 10 0.00026 17.5 8.8 32 246-278 2-33 (521) 126 PRK13571 anthranilate synthase 64.0 3 7.6E-05 21.6 1.0 23 245-267 231-253 (506) 127 PRK12653 fructose-6-phosphate 63.2 11 0.00027 17.4 5.1 57 414-474 24-88 (220) 128 PRK08155 acetolactate synthase 63.2 11 0.00027 17.4 8.9 26 237-270 10-35 (564) 129 PRK13569 anthranilate synthase 62.4 2.9 7.4E-05 21.7 0.7 24 245-268 232-255 (506) 130 PRK06404 anthranilate synthase 60.7 3.4 8.7E-05 21.2 0.8 87 230-326 91-182 (351) 131 TIGR00959 ffh signal recogniti 60.3 9.6 0.00024 17.8 3.0 144 431-589 154-343 (439) 132 PRK13564 anthranilate synthase 58.5 3.1 7.8E-05 21.5 0.2 25 244-268 244-268 (521) 133 COG2166 sufE Cysteine desulfur 58.5 4 0.0001 20.7 0.8 36 870-906 38-74 (144) 134 PRK13573 anthranilate synthase 57.1 3.3 8.4E-05 21.3 0.2 51 244-301 231-281 (503) 135 COG1091 RfbD dTDP-4-dehydrorha 56.8 14 0.00035 16.6 4.5 112 386-530 158-277 (281) 136 COG0499 SAM1 S-adenosylhomocys 56.2 11 0.00028 17.3 2.8 74 395-472 42-127 (420) 137 smart00604 MD MD domain. 55.3 6.8 0.00017 18.9 1.6 13 324-337 21-33 (145) 138 PRK06163 hypothetical protein; 54.0 15 0.00038 16.3 9.0 106 337-474 61-172 (202) 139 pfam11300 DUF3102 Protein of u 53.5 15 0.00039 16.3 3.6 68 100-171 58-129 (130) 140 pfam02256 Fe_hyd_SSU Iron hydr 53.0 12 0.00031 17.0 2.6 29 8-36 14-45 (60) 141 cd06230 M14_ASTE_ASPA_like The 52.1 14 0.00035 16.6 2.7 87 374-475 6-95 (252) 142 pfam00456 Transketolase_N Tran 51.4 16 0.00042 16.0 13.6 127 340-480 119-249 (333) 143 pfam08423 Rad51 Rad51. Rad51 i 51.1 16 0.00042 16.0 3.0 60 716-776 54-129 (261) 144 PRK07912 salicylate synthase M 51.0 7.8 0.0002 18.4 1.3 33 766-798 218-257 (449) 145 TIGR02730 carot_isom carotene 50.8 7.4 0.00019 18.6 1.2 21 370-390 471-491 (506) 146 TIGR01820 TrpE-arch anthranila 50.6 9.7 0.00025 17.7 1.8 34 146-179 411-448 (508) 147 TIGR01989 COQ6 Ubiquinone bios 50.5 4.9 0.00012 20.0 0.2 50 231-294 247-296 (481) 148 KOG4748 consensus 50.2 12 0.0003 17.1 2.1 156 453-624 127-297 (364) 149 TIGR02461 osmo_MPG_phos mannos 48.8 18 0.00045 15.7 3.7 87 155-241 95-204 (248) 150 TIGR02876 spore_yqfD sporulati 47.5 9.9 0.00025 17.7 1.4 22 743-764 14-35 (406) 151 KOG0182 consensus 46.8 14 0.00035 16.6 2.0 89 389-477 30-118 (246) 152 COG3737 Uncharacterized conser 46.6 6.9 0.00018 18.9 0.5 49 780-846 31-79 (127) 153 COG3914 Spy Predicted O-linked 46.2 12 0.0003 17.1 1.6 47 420-479 302-349 (620) 154 KOG4450 consensus 46.0 5.7 0.00014 19.5 -0.0 21 367-387 21-41 (168) 155 pfam01074 Glyco_hydro_38 Glyco 45.9 20 0.0005 15.4 6.4 69 709-779 188-258 (269) 156 KOG0630 consensus 45.1 11 0.00028 17.3 1.3 22 271-292 241-262 (838) 157 COG3394 Uncharacterized protei 44.3 6.2 0.00016 19.2 -0.0 13 187-199 134-146 (257) 158 TIGR02134 transald_staph trans 43.4 7.6 0.00019 18.5 0.3 115 385-530 12-134 (237) 159 KOG2243 consensus 43.2 12 0.00031 17.0 1.3 18 613-630 1059-1076(5019) 160 pfam03492 Methyltransf_7 SAM d 42.9 22 0.00055 15.1 5.4 67 440-525 137-221 (331) 161 TIGR00838 argH argininosuccina 42.8 22 0.00055 15.1 4.3 38 502-539 33-70 (469) 162 TIGR02743 TraW type-F conjugat 42.0 22 0.00057 15.0 3.2 33 185-217 143-177 (217) 163 TIGR02237 recomb_radB DNA repa 41.5 16 0.00042 16.0 1.8 20 747-766 54-82 (223) 164 PRK10696 C32 tRNA thiolase; Pr 41.5 11 0.00028 17.4 0.8 121 172-298 91-236 (311) 165 PTZ00124 adenosine deaminase; 41.1 23 0.00058 14.9 10.1 90 370-461 219-323 (362) 166 TIGR02470 sucr_synth sucrose s 40.2 18 0.00046 15.7 1.8 140 306-458 543-731 (790) 167 pfam05489 Phage_tail_X Phage T 40.1 2.9 7.5E-05 21.7 -2.2 14 469-482 10-23 (60) 168 COG0299 PurN Folate-dependent 39.3 24 0.00062 14.7 4.6 109 381-508 7-116 (200) 169 pfam02780 Transketolase_C Tran 39.1 24 0.00062 14.7 10.7 96 838-937 11-108 (124) 170 PRK10150 beta-D-glucuronidase; 39.1 25 0.00062 14.7 3.5 43 748-790 342-415 (605) 171 KOG1000 consensus 39.0 18 0.00046 15.7 1.6 22 746-767 584-605 (689) 172 pfam01855 POR_N domain. This f 38.6 25 0.00063 14.6 6.3 100 380-486 65-167 (230) 173 pfam05637 Glyco_transf_34 gala 37.6 25 0.00065 14.6 2.2 58 446-506 22-80 (242) 174 cd03415 CbiX_CbiC Archaeal sir 37.2 26 0.00067 14.5 3.8 35 445-487 11-45 (125) 175 COG5618 Predicted periplasmic 36.8 26 0.00067 14.4 5.6 109 579-715 43-156 (206) 176 cd06254 M14_ASTE_ASPA_like_4 A 36.6 27 0.00068 14.4 3.3 86 371-474 22-115 (288) 177 TIGR00936 ahcY adenosylhomocys 36.4 27 0.00068 14.4 3.8 79 393-471 36-127 (422) 178 PRK04235 hypothetical protein; 36.4 27 0.00068 14.4 3.1 72 431-515 92-170 (200) 179 PRK13559 hypothetical protein; 36.2 27 0.00069 14.4 4.2 58 418-480 74-131 (363) 180 pfam10109 FluMu_gp41 Mu-like p 36.0 27 0.00069 14.3 2.3 45 752-799 36-88 (105) 181 cd01854 YjeQ_engC YjeQ/EngC. 36.0 17 0.00044 15.9 1.1 70 738-811 89-162 (287) 182 PRK04081 hypothetical protein; 35.7 5 0.00013 19.9 -1.6 61 461-537 44-106 (212) 183 PRK05439 pantothenate kinase; 35.5 14 0.00035 16.6 0.6 64 560-623 95-166 (312) 184 TIGR01706 NAPA periplasmic nit 35.5 12 0.00031 17.0 0.3 32 609-640 433-477 (930) 185 cd04821 PA_M28_1_2 PA_M28_1_2: 35.4 22 0.00056 15.0 1.6 51 377-427 24-77 (157) 186 TIGR03494 salicyl_syn salicyla 35.3 13 0.00033 16.8 0.5 51 286-336 6-56 (425) 187 pfam02030 Lipoprotein_8 Hypoth 35.2 21 0.00052 15.3 1.5 35 372-408 267-301 (493) 188 TIGR02812 fadR_gamma fatty aci 35.1 22 0.00056 15.0 1.6 24 165-191 203-226 (275) 189 COG1162 Predicted GTPases [Gen 34.9 19 0.00049 15.5 1.3 70 738-811 90-165 (301) 190 PRK05301 pyrroloquinoline quin 34.5 29 0.00073 14.2 4.8 51 203-265 44-95 (375) 191 TIGR02114 coaB_strep phosphopa 34.5 25 0.00064 14.6 1.8 13 561-573 186-198 (253) 192 cd04814 PA_M28_1 PA_M28_1: Pro 34.5 20 0.00051 15.3 1.3 41 378-418 23-63 (142) 193 cd02524 G1P_cytidylyltransfera 34.4 29 0.00073 14.2 2.8 28 248-275 30-57 (253) 194 TIGR00564 trpE_most anthranila 34.1 22 0.00057 15.0 1.5 40 140-179 426-470 (543) 195 TIGR01418 PEP_synth phosphoeno 33.9 13 0.00033 16.8 0.2 25 440-464 543-572 (877) 196 pfam05221 AdoHcyase S-adenosyl 33.9 29 0.00074 14.1 2.6 36 436-471 92-127 (430) 197 PRK06772 salicylate synthase I 33.8 19 0.00048 15.6 1.1 35 561-595 262-296 (434) 198 KOG2414 consensus 33.4 26 0.00065 14.5 1.7 59 402-460 292-356 (488) 199 cd01316 CAD_DHOase The eukaryo 33.3 30 0.00076 14.0 2.7 13 238-250 240-252 (344) 200 PTZ00186 heat shock 70 kDa pre 33.3 28 0.00072 14.2 1.9 36 251-286 341-381 (657) 201 COG3250 LacZ Beta-galactosidas 33.1 30 0.00076 14.0 2.8 16 374-389 306-321 (808) 202 PRK13410 molecular chaperone D 32.6 29 0.00074 14.1 1.9 36 367-402 428-463 (719) 203 pfam04952 AstE_AspA Succinylgl 32.2 31 0.00079 13.9 3.3 95 371-474 7-107 (289) 204 pfam06898 YqfD Putative stage 32.0 26 0.00066 14.5 1.5 30 440-469 324-354 (383) 205 cd07034 TPP_PYR_PFOR_IOR-alpha 32.0 31 0.0008 13.9 6.3 84 380-471 70-157 (160) 206 COG0652 PpiB Peptidyl-prolyl c 31.8 26 0.00066 14.5 1.5 15 805-819 33-47 (158) 207 TIGR01420 pilT_fam twitching m 31.0 18 0.00046 15.7 0.6 72 105-185 86-172 (350) 208 KOG0628 consensus 30.9 32 0.00083 13.8 3.6 116 372-496 265-425 (511) 209 PRK01433 hscA chaperone protei 30.8 26 0.00067 14.4 1.4 22 246-267 293-314 (595) 210 pfam04194 PDCD2_C Programmed c 30.7 24 0.00061 14.8 1.2 24 581-608 117-140 (169) 211 COG2374 Predicted extracellula 30.6 27 0.00068 14.4 1.4 15 384-398 275-289 (798) 212 smart00087 PTH Parathyroid hor 30.5 17 0.00044 15.8 0.4 19 183-201 16-34 (36) 213 TIGR01283 nifE nitrogenase MoF 30.5 33 0.00084 13.7 2.6 53 411-469 92-161 (470) 214 pfam12076 Wax2_C WAX2 C-termin 30.5 33 0.00084 13.7 2.8 37 751-790 16-52 (164) 215 pfam05320 Pox_RNA_Pol_19 Poxvi 30.4 12 0.00031 17.0 -0.3 17 105-121 80-96 (167) 216 COG5499 Predicted transcriptio 30.3 28 0.0007 14.3 1.4 70 96-173 36-111 (120) 217 pfam03652 UPF0081 Uncharacteri 30.2 33 0.00085 13.7 3.3 62 158-229 70-133 (134) 218 COG3608 Predicted deacylase [G 30.1 33 0.00085 13.7 4.6 94 369-477 51-148 (331) 219 PRK13246 dihydrobiliverdin:fer 30.0 33 0.00085 13.7 1.9 21 210-230 205-225 (236) 220 cd01815 BMSC_UbP_N BMSC_UbP (b 29.9 14 0.00035 16.5 -0.1 46 790-851 1-57 (75) 221 KOG1520 consensus 29.9 10 0.00026 17.6 -0.8 175 603-821 140-343 (376) 222 cd00671 ArgRS_core This is the 29.6 34 0.00087 13.6 2.4 20 103-122 28-47 (267) 223 pfam00425 Chorismate_bind chor 29.5 17 0.00044 15.9 0.3 40 140-179 163-207 (239) 224 PRK06923 isochorismate synthas 29.4 21 0.00053 15.2 0.7 17 373-389 350-368 (399) 225 cd02754 MopB_Nitrate-R-NapA-li 29.1 27 0.0007 14.3 1.3 61 509-581 225-285 (565) 226 KOG2560 consensus 29.1 25 0.00063 14.6 1.0 87 381-500 291-377 (529) 227 cd05816 CBM20_DPE2_repeat2 Dis 28.9 35 0.00089 13.5 2.2 67 616-697 9-76 (99) 228 cd02128 PA_TfR PA_TfR: Proteas 28.8 35 0.00089 13.5 2.8 36 286-327 146-182 (183) 229 PRK05183 hscA chaperone protei 28.7 35 0.0009 13.5 1.8 25 374-398 442-466 (621) 230 KOG1119 consensus 28.5 15 0.00039 16.2 -0.1 32 675-712 122-155 (199) 231 TIGR03142 cytochro_ccmI cytoch 28.4 36 0.00091 13.5 4.3 38 504-541 38-81 (117) 232 PRK13566 anthranilate synthase 28.3 18 0.00046 15.7 0.2 72 249-327 246-322 (724) 233 PRK01889 ribosome-associated G 28.3 24 0.00062 14.7 0.9 26 739-764 122-147 (353) 234 TIGR00612 ispG_gcpE 4-hydroxy- 28.3 36 0.00091 13.4 2.3 157 132-332 114-294 (633) 235 pfam01786 AOX Alternative oxid 28.1 36 0.00091 13.4 2.7 34 107-142 153-186 (214) 236 KOG4156 consensus 28.1 32 0.00082 13.8 1.5 58 782-850 892-951 (1329) 237 COG5097 MED6 RNA polymerase II 28.0 14 0.00036 16.5 -0.4 24 491-514 98-126 (210) 238 TIGR00576 dut dUTP diphosphata 28.0 22 0.00055 15.1 0.6 17 722-738 45-61 (151) 239 TIGR01161 purK phosphoribosyla 27.8 21 0.00055 15.1 0.5 121 103-251 13-136 (386) 240 COG1359 Uncharacterized conser 27.7 19 0.00048 15.5 0.2 30 668-697 28-57 (100) 241 KOG3171 consensus 27.5 10 0.00026 17.6 -1.2 91 239-331 170-265 (273) 242 cd04820 PA_M28_1_1 PA_M28_1_1: 27.2 29 0.00075 14.1 1.1 44 373-416 20-63 (137) 243 COG4098 comFA Superfamily II D 27.1 24 0.00061 14.7 0.7 15 681-695 397-411 (441) 244 PRK05647 purN phosphoribosylgl 27.1 37 0.00095 13.3 6.3 108 381-508 8-117 (200) 245 COG4223 Uncharacterized protei 27.1 25 0.00063 14.7 0.7 53 696-755 313-376 (422) 246 pfam12550 GCR1_C Transcription 26.8 24 0.00062 14.7 0.7 24 30-53 4-27 (81) 247 smart00833 CobW_C Cobalamin sy 26.8 38 0.00096 13.3 1.7 66 866-931 4-78 (92) 248 pfam10925 DUF2680 Protein of u 26.8 38 0.00096 13.3 5.0 43 493-538 12-54 (59) 249 COG0769 MurE UDP-N-acetylmuram 26.7 38 0.00097 13.2 2.4 40 418-457 396-436 (475) 250 pfam06785 UPF0242 Uncharacteri 26.6 18 0.00046 15.7 -0.0 75 558-644 123-197 (401) 251 COG2210 Peroxiredoxin family p 26.5 38 0.00097 13.2 2.3 41 107-147 68-113 (137) 252 PRK01286 deoxyguanosinetriphos 26.4 38 0.00098 13.2 2.6 16 106-122 40-55 (338) 253 PRK11869 2-oxoacid ferredoxin 26.2 39 0.00098 13.2 3.9 109 338-473 69-195 (284) 254 PTZ00075 S-adenosyl-L-homocyst 26.2 39 0.00098 13.2 3.5 209 393-636 40-270 (476) 255 PRK03906 mannonate dehydratase 26.2 27 0.0007 14.3 0.8 29 164-197 87-115 (389) 256 PRK01184 hypothetical protein; 26.2 13 0.00034 16.7 -0.8 54 413-470 4-68 (183) 257 pfam11339 DUF3141 Protein of u 26.1 11 0.00027 17.4 -1.3 42 447-504 426-467 (581) 258 KOG4269 consensus 25.7 25 0.00065 14.6 0.6 24 847-870 922-946 (1112) 259 KOG3350 consensus 25.6 39 0.001 13.1 2.6 74 805-889 130-209 (217) 260 PRK06389 argininosuccinate lya 25.5 40 0.001 13.1 3.9 31 367-400 183-213 (434) 261 COG1166 SpeA Arginine decarbox 25.5 40 0.001 13.1 3.0 300 158-537 62-426 (652) 262 TIGR00747 fabH 3-oxoacyl-(acyl 25.5 19 0.00047 15.6 -0.1 25 674-721 110-134 (329) 263 PRK00098 ribosome-associated G 25.4 33 0.00084 13.7 1.1 153 739-919 92-253 (298) 264 cd02753 MopB_Formate-Dh-H Form 25.3 40 0.001 13.1 3.8 58 510-579 223-280 (512) 265 KOG1845 consensus 25.3 40 0.001 13.1 1.5 39 838-876 661-702 (775) 266 TIGR02403 trehalose_treC alpha 25.3 40 0.001 13.1 1.7 35 92-126 169-203 (555) 267 pfam02633 Creatininase Creatin 25.2 40 0.001 13.1 5.3 52 230-284 71-122 (235) 268 pfam02836 Glyco_hydro_2_C Glyc 25.2 40 0.001 13.1 3.0 23 369-391 16-38 (297) 269 PRK08849 2-octaprenyl-3-methyl 25.2 35 0.0009 13.5 1.2 15 443-457 9-23 (384) 270 COG0128 AroA 5-enolpyruvylshik 25.0 37 0.00094 13.3 1.3 68 564-637 348-415 (428) 271 pfam12565 DUF3747 Protein of u 24.8 32 0.00082 13.8 1.0 14 619-632 96-109 (183) 272 TIGR02010 IscR iron-sulfur clu 24.8 41 0.001 13.0 1.5 17 17-33 36-52 (140) 273 TIGR03159 cas_Csc1 CRISPR-asso 24.8 41 0.001 13.0 1.6 29 604-636 25-53 (223) 274 PHA00098 hypothetical protein 24.7 32 0.00082 13.8 1.0 32 477-517 38-70 (112) 275 PRK11544 hycI hydrogenase 3 ma 24.7 41 0.001 13.0 3.5 40 733-777 13-53 (156) 276 pfam04007 DUF354 Protein of un 24.6 6.8 0.00017 18.9 -2.5 37 422-460 89-125 (335) 277 TIGR02169 SMC_prok_A chromosom 24.6 20 0.00051 15.3 -0.1 44 155-199 135-187 (1202) 278 TIGR00655 PurU formyltetrahydr 24.5 41 0.0011 13.0 2.0 70 160-231 101-172 (294) 279 COG1444 Predicted P-loop ATPas 24.4 28 0.00071 14.3 0.6 46 581-628 537-593 (758) 280 KOG2582 consensus 24.3 42 0.0011 12.9 1.7 78 383-485 332-409 (422) 281 PRK12713 flgJ peptidoglycan hy 24.3 31 0.00079 13.9 0.8 30 2-31 55-84 (339) 282 COG3769 Predicted hydrolase (H 24.2 42 0.0011 12.9 4.2 36 154-192 98-133 (274) 283 pfam02676 TYW3 Methyltransfera 24.2 42 0.0011 12.9 6.0 72 431-514 90-167 (191) 284 cd01020 TroA_b Metal binding p 23.7 43 0.0011 12.9 10.3 105 417-536 35-140 (264) 285 KOG0674 consensus 23.7 20 0.0005 15.4 -0.3 40 696-735 79-118 (406) 286 cd06422 NTP_transferase_like_1 23.6 43 0.0011 12.9 2.5 26 248-273 31-56 (221) 287 PRK12712 flgJ peptidoglycan hy 23.6 33 0.00084 13.7 0.8 32 1-32 54-85 (344) 288 cd01989 STK_N The N-terminal d 23.5 43 0.0011 12.8 6.4 50 222-275 68-117 (146) 289 TIGR00908 2A0305 ethanolamine 23.5 23 0.00059 14.9 0.0 26 770-795 381-408 (456) 290 cd06251 M14_ASTE_ASPA_like_1 A 23.4 43 0.0011 12.8 3.9 86 374-474 28-116 (287) 291 TIGR02382 wecD_rffC TDP-D-fuco 23.2 35 0.0009 13.5 0.9 37 629-665 99-136 (201) 292 pfam08754 consensus 23.2 44 0.0011 12.8 5.5 37 439-475 3-40 (113) 293 PRK10446 ribosomal protein S6 23.1 25 0.00065 14.6 0.2 49 100-150 14-63 (300) 294 PRK13463 phosphatase PhoE; Pro 23.1 44 0.0011 12.8 8.5 97 222-336 103-203 (203) 295 TIGR02744 TrbI_Ftype type-F co 23.1 44 0.0011 12.8 4.5 41 499-540 38-78 (112) 296 TIGR03234 OH-pyruv-isom hydrox 23.0 44 0.0011 12.8 3.9 54 214-267 82-142 (254) 297 pfam10544 T5orf172 T5orf172 do 23.0 27 0.0007 14.3 0.3 40 412-452 8-50 (79) 298 cd06425 M1P_guanylylT_B_like_N 22.9 43 0.0011 12.8 1.3 26 248-273 32-57 (233) 299 PRK09060 dihydroorotase; Valid 22.6 27 0.0007 14.3 0.3 39 746-793 345-384 (444) 300 PRK08948 consensus 22.5 38 0.00096 13.3 1.0 11 563-573 360-370 (392) 301 PRK12753 transketolase; Review 22.4 45 0.0011 12.7 24.0 184 663-876 401-590 (662) 302 PRK11069 recC exonuclease V su 22.4 41 0.001 13.0 1.1 12 20-31 66-77 (1122) 303 cd05817 CBM20_DSP Dual-specifi 22.3 45 0.0012 12.7 1.9 62 620-697 12-75 (100) 304 pfam09011 DUF1898 Domain of un 22.3 32 0.00082 13.8 0.6 22 737-758 81-102 (164) 305 TIGR01950 SoxR redox-sensitive 22.3 35 0.00089 13.5 0.7 28 861-888 45-72 (142) 306 KOG4166 consensus 22.3 45 0.0012 12.7 2.6 12 239-250 90-101 (675) 307 cd02742 GH20_hexosaminidase Be 22.2 45 0.0012 12.6 1.7 18 430-447 142-159 (303) 308 pfam09820 AAA-ATPase_like Pred 22.1 35 0.0009 13.5 0.7 11 233-243 188-198 (279) 309 PRK12700 flgH flagellar basal 22.0 41 0.001 13.0 1.1 19 851-869 64-82 (230) 310 PRK13738 conjugal transfer pil 22.0 46 0.0012 12.6 5.4 21 179-199 124-144 (210) 311 TIGR00821 EII-GUT PTS system, 22.0 27 0.00068 14.4 0.1 12 746-757 52-63 (181) 312 pfam11432 DUF3197 Protein of u 21.8 46 0.0012 12.6 1.8 51 236-294 52-102 (113) 313 TIGR01063 gyrA DNA gyrase, A s 21.7 46 0.0012 12.6 3.4 106 87-221 260-381 (864) 314 PRK13389 UTP--glucose-1-phosph 21.7 46 0.0012 12.6 2.4 25 248-272 40-64 (302) 315 pfam11810 DUF3332 Domain of un 21.6 41 0.001 13.0 1.0 31 397-427 53-85 (175) 316 cd06252 M14_ASTE_ASPA_like_2 A 21.6 47 0.0012 12.6 3.1 90 371-475 39-132 (316) 317 pfam03881 Fructosamin_kin Fruc 21.6 33 0.00084 13.7 0.5 15 468-484 208-222 (287) 318 PRK12451 arginyl-tRNA syntheta 21.4 47 0.0012 12.5 2.4 38 419-480 300-338 (562) 319 PRK12709 flgJ peptidoglycan hy 21.4 40 0.001 13.1 0.9 31 1-31 51-81 (318) 320 PRK12288 ribosome-associated G 21.4 45 0.0011 12.7 1.2 75 740-820 134-215 (344) 321 pfam04908 SH3BGR SH3-binding, 21.4 47 0.0012 12.5 4.2 63 164-229 22-90 (92) 322 TIGR01719 euk_UDPppase uridine 21.3 27 0.00068 14.4 -0.0 22 100-121 102-125 (301) 323 TIGR02735 purC_vibrio phosphor 21.1 29 0.00074 14.1 0.1 23 35-57 87-109 (365) 324 KOG1278 consensus 20.9 32 0.00083 13.8 0.4 10 748-757 541-550 (628) 325 PRK12308 bifunctional arginino 20.8 48 0.0012 12.5 5.4 45 366-426 188-232 (614) 326 PRK00192 mannosyl-3-phosphogly 20.8 48 0.0012 12.4 4.0 11 104-114 30-40 (275) 327 pfam03992 ABM Antibiotic biosy 20.3 34 0.00087 13.6 0.4 28 669-696 27-54 (77) 328 TIGR00520 asnASE_II L-asparagi 20.3 31 0.00079 13.9 0.1 15 94-108 49-63 (360) 329 pfam01521 Fe-S_biosyn Iron-sul 20.3 24 0.0006 14.8 -0.5 33 673-712 29-64 (91) 330 PRK12702 mannosyl-3-phosphogly 20.3 50 0.0013 12.4 3.4 23 154-176 95-117 (302) 331 PRK07333 2-octaprenyl-6-methox 20.2 45 0.0012 12.7 1.0 43 562-611 360-402 (403) 332 pfam06950 DUF1293 Protein of u 20.1 45 0.0012 12.7 1.0 23 486-517 51-73 (115) 333 PRK11181 23S rRNA (guanosine-2 20.1 50 0.0013 12.4 7.0 21 250-270 109-129 (244) 334 COG1208 GCD1 Nucleoside-diphos 20.1 50 0.0013 12.4 2.8 36 248-296 33-68 (358) 335 pfam04329 consensus 20.1 39 0.001 13.1 0.7 33 745-777 57-89 (135) No 1 >PRK09404 kgd alpha-ketoglutarate decarboxylase; Reviewed Probab=100.00 E-value=0 Score=3148.39 Aligned_cols=919 Identities=52% Similarity=0.880 Sum_probs=872.7 Q ss_pred CCHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 62388999834763678968999999998738050798999998736764333333434443222222221112333445 Q gi|254781051|r 2 LQDLNKMFSLSSFLDGTNCCYIEDLYKSYQEDPSSVCKDWYPLFSFLDENSEEYDNLEDGIASFLKEESVASAVFSQKKS 81 (957) Q Consensus 2 ~~~~~~~f~~sSFL~g~Na~YIEeLYekYL~DP~SVd~sWr~fF~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 81 (957) |+++|+.|+++|||+|+|+.|||+||++|++||+|||++|+.||++++.+......... +... ... T Consensus 1 ~~~~~~~~~~~SfL~G~Na~YIEeLYe~yl~DP~SVd~sWr~fF~~l~~~~~~~~~~~s---~~~~-----------~~~ 66 (931) T PRK09404 1 QQSMMKQFLESSFLFGANAGYIEELYERYLEDPDSVDEEWRAFFDGLPDGGSAPDVAHS---PVRE-----------SFR 66 (931) T ss_pred CCHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCCCCCCCCCCC---CCCC-----------CCC T ss_conf 94489999854466788999999999998539765789999999727877765445776---3200-----------011 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCHHHCCCCCCCCCCCCCCCCCCHHHCCCCHHHCCCEEECCCC-CCCCCC Q ss_conf 5566655563367899999999999998817045401677777766787767876669997881715725665-787777 Q gi|254781051|r 82 AVESSCSLGETQSVKDFFQVMKMIDAYRSYGHFKANIDPLGYNSHQKDLSELSPAHYGFVKADYDRKICMKGV-LGLESA 160 (957) Q Consensus 82 ~~~~~~~~~~~~~~~kq~rV~~LI~AYR~~GHl~A~lDPLgl~~~~~~~peL~~~~yGLse~DLd~~f~~~~~-~g~~~~ 160 (957) ...........+...++++|.+||+|||.|||+.|+||||++.. ++..|+|+|++|||+++|||++|+++++ +|.+.+ T Consensus 67 ~~~~~~~~~~~~~~~k~~rV~~LI~AYR~~GHl~A~LDPL~l~~-~~~~peL~~~~yGLte~DLd~~f~~~~~~~g~~~~ 145 (931) T PRK09404 67 RLAASASVSDPDGDAKQVKVLQLINAYRFRGHLAANLDPLGLWK-RPDVPELDPAFYGLTEADLDRTFNTGSLALGKETA 145 (931) T ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCHHEECCCCCCCC-CCCCCCCCHHHCCCCHHHHCCCCCCCCCCCCCCCC T ss_conf 12335666653667888999999999998472220479999999-99898769766698989938864368766788888 Q ss_pred CHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEECC Q ss_conf 89999999998406430277724899899999999841457788879889999999999999899999864033622225 Q gi|254781051|r 161 TIPEIVDVLSHLYCSNIGVEFMHIVDSIERDWVRNTIEDYDFSSNFSKEEQRDILDKLVRAEGFEKFIDIKYKGAKRFGA 240 (957) Q Consensus 161 tL~eIi~~L~~tYCgsIGvEymHI~d~eer~Wlq~riE~~~~~~~~s~eeK~~iL~~L~~Ae~FE~FL~~Kf~g~KRFsL 240 (957) ||+|||++|++||||+||+|||||.|++||+|||+|||+..+.+.|++++|++||++|++||+||+|||+||||+||||| T Consensus 146 tLreIi~~Lk~tYCgsIG~EymHI~d~eer~Wl~~riE~~~~~~~~s~eeKk~IL~~L~~Ae~FE~FL~~Kf~g~KRFsl 225 (931) T PRK09404 146 TLREIIEALKKTYCGSIGVEFMHISDPEERRWLQERIESPRGRPTFSAEEKKAILERLTAAEGFERFLHTKFVGQKRFSL 225 (931) T ss_pred CHHHHHHHHHHHHCCCHHHEEECCCCHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCEEECC T ss_conf 29999999999846758640007899899999999973847789989999999999999999999999887256336505 Q ss_pred CCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEEC Q ss_conf 64658999999999998737910799806765526899988537989999985588743223455510015026889835 Q gi|254781051|r 241 DGSEVIIPAIEEIIRQGVQQGVDEMILGMAHRGRLNVLSQIMNKLPRSIFYEFKGKGPSEKEYSGDVKYHLGLCCNRQIC 320 (957) Q Consensus 241 EG~EslIP~L~~ii~~a~~~Gv~eiviGMaHRGRLNVL~Nvl~Kp~~~iF~EF~g~~~~~~~~sGDVKYHLG~s~~~~~~ 320 (957) ||||||||+|++||+.|+++||++||||||||||||||+|||+||++.||+||+|+.+.+..++||||||||+|++++++ T Consensus 226 EG~EslIp~l~~li~~~~~~gv~~iviGMaHRGRLNvL~nvl~kp~~~iF~EF~g~~~~~~~~sGDVKYHlG~s~~~~~~ 305 (931) T PRK09404 226 EGGESLIPMLDEIIRRAGKLGVKEIVIGMAHRGRLNVLVNVLGKPPRDLFAEFEGKHADEVLGSGDVKYHLGFSSDRETD 305 (931) T ss_pred CCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCHHHHHHHHHCCCHHHHHHHHCCCCCHHCCCCCCCCCCCEEEEEEEEC T ss_conf 67016999999999999876987689625654318999988568999999874589830103567715367056566508 Q ss_pred CCEEEEEECCCCCCCHHCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCC Q ss_conf 96189997178773100044553236777764166654455565445404788862575333467258876451067744 Q gi|254781051|r 321 GKDVKLLLQSNPSHLEFVDPVVIGSVRARQDLKAGIVGEKSISLVERSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYT 400 (957) Q Consensus 321 g~~v~lsL~~NPSHLEaVnPVv~G~~RAkQd~~~d~~~~~~~~~~~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~ 400 (957) |++|||+|+|||||||+|||||+|+||||||+++|. ++++|||||||||||||||||||||||||+|+||+ T Consensus 306 g~~v~~sL~~NPSHLEaVnPVv~G~~RA~Qd~~~d~---------~~~kvlpiliHGDAAfaGQGvV~Etl~ls~l~gY~ 376 (931) T PRK09404 306 GGEVHLSLAFNPSHLEIVNPVVEGSVRARQDRRGDT---------DRSKVLPILIHGDAAFAGQGVVAETLNLSQLRGYR 376 (931) T ss_pred CCEEEEEECCCCCCEEEECCEECCCHHHHHHHHCCC---------CCCEEEEEEEECCHHHCCCCHHHHHHHHHCCCCCC T ss_conf 965899834897642566661102013445651777---------65625788874444321465669876541388865 Q ss_pred CCCEEEEEECCCEECCCCCHHCCCCCCCCHHHHHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEECCCCCC Q ss_conf 36579998416020124701024677840011220340587426788999999999999999738973998642112565 Q gi|254781051|r 401 VAGNIHLIINNQIGFTTNPSSARSCTYASDISKSIGIPIFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVCYRRFGH 480 (957) Q Consensus 401 tGGTiHiVvNNQIGFTT~p~d~RSs~Y~TDiAK~i~aPI~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~YRr~GH 480 (957) |||||||||||||||||+|+|+|||+||||||||++||||||||||||||++|++||++||++|+||||||||||||||| T Consensus 377 tGGTIHiIvNNQIGFTT~p~d~RSs~Y~TDiAK~i~aPI~HVN~DdpEav~~~~~lA~eyR~~F~kDVvIDlvcYRr~GH 456 (931) T PRK09404 377 TGGTIHIVINNQIGFTTNPRDSRSTPYCTDVAKMIQAPIFHVNGDDPEAVVFATQLALEYRQKFKKDVVIDLVCYRRHGH 456 (931) T ss_pred CCCEEEEEECCCCCCCCCHHHCCCCCCCCHHHHHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCCEEEEEEEEECCC T ss_conf 48648999616422214730213676631167874684688648987999999999999999858870797776520377 Q ss_pred CCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCC----- Q ss_conf 643542235578888867189989999999864489898999999999999977566654125876566665686----- Q gi|254781051|r 481 NEGDEPSFTQPMMYKTIRSHKSVLQLYADSLMKNQVISKQELQSLANNWHKYLEAEYKESESYCPEKLGLLHNGE----- 555 (957) Q Consensus 481 NE~DeP~fTQP~mY~~I~~hp~v~~~Y~~~Li~~g~it~~e~~~~~~~~~~~l~~~~~~~k~~~~~~~~~~~~~~----- 555 (957) ||+|||+||||+||++|++||+++++|+++|+++|+||++++++++++|++.|+++++.++.+.|...+++...| T Consensus 457 NE~DeP~fTQP~mY~~I~~hp~v~~~Y~~~L~~~g~it~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~w~~~~~ 536 (931) T PRK09404 457 NEGDEPSFTQPLMYKKIKKHPTTRKLYADKLVAEGVITEEEADEMVNEYRDALDAGFEVVKEYKPNKADKLDGDWSPYLG 536 (931) T ss_pred CCCCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHCCC T ss_conf 76688666787888887649988999999987649989999999999999999999876401367776312365322057 Q ss_pred ----CCCCCCCCHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCEEEEECCCC Q ss_conf ----2666778999999998874368998620567788899899873343564211478988898863583157850100 Q gi|254781051|r 556 ----NERKNSVSKEILKKIGSSIIRLPKSFNTHKIVERLMANRQKMIETGKGIDWSMAESLAFGSLCYEGYKVRLSGQDC 631 (957) Q Consensus 556 ----~~~~T~v~~~~L~~i~~~l~~~Pe~f~~h~~v~k~~~~R~~~~~~~~~iDW~~AE~LA~~SLl~eG~~VRlsGQD~ 631 (957) ..++|||+++.|+.|+.+++++|++|++||+|+|++++|++|++++.+|||||||+|||||||.||++|||||||| T Consensus 537 ~~~~~~~~T~v~~~~L~~~~~~~~~~P~~f~~h~~v~k~~~~R~~~~~~~~~iDWa~AE~LAf~SLl~eG~~VRlsGQD~ 616 (931) T PRK09404 537 HEWDEAVDTGVPLERLKELAEKLTTIPEGFNVHPKVKKILEDRRKMAEGEKPIDWGMAEALAFASLLDEGYPVRLSGQDS 616 (931) T ss_pred CCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCC T ss_conf 55567788877999999999997038876642689999999999998358875557889999999986696266446415 Q ss_pred CCCCCCCCEEEEEECCCCCEEEEHHHCCCCCCEEEEEECCCCCCCHHHHHHCCCCCCCCCCEEECCCCCCCCCCCHHHHH Q ss_conf 22333230279997788844556344278984599982431111013432022113865311100002442454102256 Q gi|254781051|r 632 ERGTFSHRHAILCDQETGKRYFPLGNISKDQGHCEVVNSFLSEQAVLGFEYGYSLNNLNALTIWEAQFGDFANGAQVILD 711 (957) Q Consensus 632 ~RGTFshRHavl~dq~t~~~y~pL~~l~~~q~~f~v~nS~LSE~avlgFEyGys~~~p~~L~iWEAQFGDF~NgAQviiD 711 (957) +||||||||||||||+|+++|+||+||+++|++|+||||+||||||||||||||+++|++|||||||||||+|||||||| T Consensus 617 ~RGTFshRHavl~dq~t~~~y~pL~~~~~~q~~~~v~nS~LSE~avlgFEyGYs~~~p~~LviWEAQFGDF~NgAQiiiD 696 (931) T PRK09404 617 GRGTFSHRHAVLHDQKTGETYIPLNHLSDGQASFEVIDSPLSEEAVLGFEYGYSTAEPNTLVIWEAQFGDFANGAQVVID 696 (931) T ss_pred CCCCCCCCEEEEEECCCCCEEEEHHHCCCCCCEEEEEECHHHHHHHHHHHHHHCCCCCCCEEEEHHHHCCCCCCCEEEEE T ss_conf 66543453158886379958857655546686499993412577776545300025887505424331455467348975 Q ss_pred HHHHHHHHHHCCCCCCEEECCCCCCCCCCCCCCCCHHHHHHHCCCCCEEEEECCCHHHHHHHHHHHHCCCCCCCEEEECC Q ss_conf 55420257623336813543688888887656440578875214378269724889999999999743534886898566 Q gi|254781051|r 712 QFITAGEQKWLCTSNLVCLLPHGYEGQGPEHSSARLERFLQMCAENNMYVANCTSPANYFHILRRQIYDRSSRPLIMMAP 791 (957) Q Consensus 712 qfiss~e~KW~~~sglvllLPHGyeGqGPEHSSaRlERfLQl~a~~N~~V~~~ttpAq~FH~LRRQ~~r~~rkPLiv~tP 791 (957) |||||||+||+|+||||||||||||||||||||||||||||||||||||||||||||||||+|||||+|+|||||||||| T Consensus 697 qfi~s~e~KW~~~sglvllLPHGyeGqGPEHSSaR~ERfLQl~ae~N~~V~~~ttpA~~FH~LRrQ~~r~~rkPLiv~tp 776 (931) T PRK09404 697 QFISSGEQKWGRLSGLVMLLPHGYEGQGPEHSSARLERFLQLCAEDNMQVCNPTTPAQYFHLLRRQVLRPFRKPLVVMTP 776 (931) T ss_pred EHHHHHHHHHCCCCCEEEECCCCCCCCCCCCCCCCHHHHHHHCHHCCCEEECCCCHHHHHHHHHHHHCCCCCCCEEEECC T ss_conf 00103677761016748987787678896444410789997530238189905987999999999850346787598473 Q ss_pred CHHHCCCCCCCCHHHHCCCCCEEECCCCCCCCCCCCCCCCCCHHHCCEEEEECCHHHHHHHHHHHHCCCCCEEEEEEECC Q ss_conf 02210801268878937998603515776433444544457954576899868857899999999769986899984014 Q gi|254781051|r 792 KSLLRHKRVVSSLSDMTCGSVFQAVLSDDAEYHGKTSVKLKEDSHIRRVILCTGKVYYDLLDNRDMRNIADIYLLRIEQL 871 (957) Q Consensus 792 KsLLR~~~a~S~l~d~~~g~~F~~vi~d~~~~~~~~~~~~~~~~~v~rvi~csGKvyydL~~~r~~~~~~~vaivRiEqL 871 (957) |||||||.|+|+++||++|+ |++||+|+. ..++++|+|||||||||||||+++|++++.+|||||||||| T Consensus 777 KsLLR~~~~~S~~~e~~~g~-F~~vi~d~~---------~~~~~~v~rvi~CsGKvyydL~~~r~~~~~~~vaivRiEqL 846 (931) T PRK09404 777 KSLLRHKLAVSSLEELAEGS-FQPVIGDID---------ELDPKKVKRVVLCSGKVYYDLLEARRKRGIDDVAIVRIEQL 846 (931) T ss_pred HHHHCCCCCCCCHHHCCCCC-CEEECCCCC---------CCCHHCCCEEEECCCHHHHHHHHHHHHCCCCCEEEEEEHHC T ss_conf 67634854579989907994-516327987---------68840086899827689999999999769997799982112 Q ss_pred CCCHHHHHHHHHHHCCCC-EEEEECCCCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCCCCCCHHHHHHHHHH Q ss_conf 868289999998537998-3998067853178457899769999875056776018975366467233867899999999 Q gi|254781051|r 872 YPFPEDYLIKVLSRFVQA-EIVWCQEEPQNMGAWTFIEPYLEKVLHSIGADYSRVRYVGRLPSASTAVGHMSRHLEQLSS 950 (957) Q Consensus 872 yPfP~~~l~~~l~~y~~~-e~vW~QEEp~N~GaW~~v~~rl~~~l~~~~~~~~~~~y~gR~~~aspA~G~~~~H~~e~~~ 950 (957) ||||+++|+++|++|||+ +||||||||+|||||+||+|||+.++.+ +..+++||||+++||||||+.++|++||++ T Consensus 847 ~PfP~~~l~~~l~~y~~~~~~vW~QEEp~N~GaW~~v~~rl~~~~~~---~~~~~~y~gR~~~aspA~G~~~~H~~e~~~ 923 (931) T PRK09404 847 YPFPHKELAAELAKYPNVKEVVWCQEEPKNQGAWYFIQHHLEEVLPE---GAQKLRYAGRPASASPAVGYMSLHKKQQEA 923 (931) T ss_pred CCCCHHHHHHHHHHCCCCCEEEEECCCCCCCCCHHHHHHHHHHHHHH---CCCCEEEECCCCCCCCCCCCHHHHHHHHHH T ss_conf 79978999999986899871799655756357888999999998622---387227956798888887887888999999 Q ss_pred HHHHHHC Q ss_conf 9998719 Q gi|254781051|r 951 FIEDALK 957 (957) Q Consensus 951 li~~a~~ 957 (957) ||++||. T Consensus 924 ii~~a~~ 930 (931) T PRK09404 924 LVEDALG 930 (931) T ss_pred HHHHHHC T ss_conf 9999858 No 2 >PRK12270 kgd alpha-ketoglutarate decarboxylase; Reviewed Probab=100.00 E-value=0 Score=3043.66 Aligned_cols=838 Identities=47% Similarity=0.820 Sum_probs=806.2 Q ss_pred HHHHHHHHHHHHHHHHHHCCCHHHCCCCCCCCCCCCCCCCCCHHHCCCCHHHCCCEEECCCCCCCCCCCHHHHHHHHHHH Q ss_conf 67899999999999998817045401677777766787767876669997881715725665787777899999999984 Q gi|254781051|r 93 QSVKDFFQVMKMIDAYRSYGHFKANIDPLGYNSHQKDLSELSPAHYGFVKADYDRKICMKGVLGLESATIPEIVDVLSHL 172 (957) Q Consensus 93 ~~~~kq~rV~~LI~AYR~~GHl~A~lDPLgl~~~~~~~peL~~~~yGLse~DLd~~f~~~~~~g~~~~tL~eIi~~L~~t 172 (957) ..+.|++||++||+|||.||||.|+||||+++. +.+|+|+|++|||+++|||++|+++++.|.+.+||||||++||++ T Consensus 386 ~~~~k~~rV~~LI~AYR~rGHl~A~lDPL~~~~--~~~pdLd~~~~GLt~~DLdr~F~~g~~~G~~~~tLreIl~~Lr~t 463 (1234) T PRK12270 386 DIVDKNARVMELIHAYRVRGHLMADTDPLEYDQ--RSHPDLDIETHGLTLWDLDREFPVGGFGGKETMKLRDVLGVLRDA 463 (1234) T ss_pred HHHHHHHHHHHHHHHHHHHCHHHHCCCCCCCCC--CCCCCCCHHHCCCCHHHHCCEEECCCCCCCCCCCHHHHHHHHHHH T ss_conf 555689999999999997360432279888899--999987965559997992985625884786777799999999987 Q ss_pred HCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEECCCCHHHHHHHHHH Q ss_conf 06430277724899899999999841457788879889999999999999899999864033622225646589999999 Q gi|254781051|r 173 YCSNIGVEFMHIVDSIERDWVRNTIEDYDFSSNFSKEEQRDILDKLVRAEGFEKFIDIKYKGAKRFGADGSEVIIPAIEE 252 (957) Q Consensus 173 YCgsIGvEymHI~d~eer~Wlq~riE~~~~~~~~s~eeK~~iL~~L~~Ae~FE~FL~~Kf~g~KRFsLEG~EslIP~L~~ 252 (957) |||+||+|||||.|++||+|||+|||... ..++.++|++||++|++||+||+|||+||+|+|||||||||||||+||+ T Consensus 464 YCg~IG~EymHI~d~eer~Wlq~riE~~~--~~~s~~~k~~iL~~L~~Ae~FE~FL~~Ky~GqKRFsLEG~EslIP~ld~ 541 (1234) T PRK12270 464 YCRKVGIEYMHIQDPEQRKWLQDRVERKH--EKPTRAEQKYILSKLNAAEAFETFLQTKYVGQKRFSLEGAESVIPLLDA 541 (1234) T ss_pred HCCCCEEEEECCCCHHHHHHHHHHHHCCC--CCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEECCCCHHHHHHHHHH T ss_conf 43461356767798899999999975788--8999899999999999999999999864677534356772589999999 Q ss_pred HHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCHHHHHHHHCCCC-CCCCCCCCCCCCCCCEEEEEEE-CCCEEEEEECC Q ss_conf 999987379107998067655268999885379899999855887-4322345551001502688983-59618999717 Q gi|254781051|r 253 IIRQGVQQGVDEMILGMAHRGRLNVLSQIMNKLPRSIFYEFKGKG-PSEKEYSGDVKYHLGLCCNRQI-CGKDVKLLLQS 330 (957) Q Consensus 253 ii~~a~~~Gv~eiviGMaHRGRLNVL~Nvl~Kp~~~iF~EF~g~~-~~~~~~sGDVKYHLG~s~~~~~-~g~~v~lsL~~ 330 (957) ||++|++.||++||||||||||||||+||||||+++||+||+|+. |....+|||||||||+|+++.+ +|++|||+|++ T Consensus 542 ~i~~aa~~Gv~evviGMaHRGRLNVL~Ni~gKp~~~iF~EFeg~~~p~~~~gsGDVKYHlG~s~~~~~~~G~~v~lsL~~ 621 (1234) T PRK12270 542 VIDQAAEHGLDEVVIGMAHRGRLNVLANIVGKPYSQIFREFEGNLDPRQAHGSGDVKYHLGAEGTYTQMFGNEIKVSLAA 621 (1234) T ss_pred HHHHHHHCCCCEEEECCCCCHHHHHHHHHHCCCHHHHHHHHCCCCCCHHCCCCCCCCCCCCCCCEEECCCCCEEEEEECC T ss_conf 99999876988799816874079999987089999999985799880212677872446765311333789778898448 Q ss_pred CCCCCHHCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEEC Q ss_conf 87731000445532367777641666544555654454047888625753334672588764510677443657999841 Q gi|254781051|r 331 NPSHLEFVDPVVIGSVRARQDLKAGIVGEKSISLVERSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIIN 410 (957) Q Consensus 331 NPSHLEaVnPVv~G~~RAkQd~~~d~~~~~~~~~~~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvN 410 (957) ||||||+|||||+|+||||||+++|.. ++.+||||||||||||+||||||||||||+|+||+|||||||||| T Consensus 622 NPSHLEaVnPVv~G~~RAkQd~~~~~~--------~~~~vlpiliHGDAAFaGQGVV~ETlnlS~L~gY~tGGTIHiVVN 693 (1234) T PRK12270 622 NPSHLEAVDPVLEGIVRAKQDLLDKGE--------DGFTVLPLLLHGDAAFAGQGVVAETLNLSQLRGYRTGGTIHIVVN 693 (1234) T ss_pred CCCCCEEECCEEEEEHHHHHHHHCCCC--------CCCEEEEEEEECHHHHCCCCHHHHHHHHCCCCCCCCCCEEEEEEC T ss_conf 986330116733100233356543577--------775146788743033225777898753115898766875899961 Q ss_pred CCEECCCCCHHCCCCCCCCHHHHHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCCC Q ss_conf 60201247010246778400112203405874267889999999999999997389739986421125656435422355 Q gi|254781051|r 411 NQIGFTTNPSSARSCTYASDISKSIGIPIFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVCYRRFGHNEGDEPSFTQ 490 (957) Q Consensus 411 NQIGFTT~p~d~RSs~Y~TDiAK~i~aPI~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~YRr~GHNE~DeP~fTQ 490 (957) |||||||+|.|+|||+||||||||++||||||||||||||++|++||++|||+|+|||||||||||||||||+|||+||| T Consensus 694 NQIGFTT~P~daRSS~YcTDvAK~i~aPIfHVNgDDPEAVv~v~~LA~eyRq~F~kDVvIDLVcYRR~GHNE~DeP~fTQ 773 (1234) T PRK12270 694 NQIGFTTAPESSRSSEYATDVAKMIQAPIFHVNGDDPEAVVRVARLAFEYRQRFKKDVVIDLVCYRRRGHNEGDDPSMTQ 773 (1234) T ss_pred CCCCCCCCCCCCCCCCCCHHHHHHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCCCCCCCCCC T ss_conf 76011378211456667215788757856885589879999999999999998688727866754203776667866688 Q ss_pred HHHHHHHHHCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCC------CCCCCCCCCH Q ss_conf 7888886718998999999986448989899999999999997756665412587656666568------6266677899 Q gi|254781051|r 491 PMMYKTIRSHKSVLQLYADSLMKNQVISKQELQSLANNWHKYLEAEYKESESYCPEKLGLLHNG------ENERKNSVSK 564 (957) Q Consensus 491 P~mY~~I~~hp~v~~~Y~~~Li~~g~it~~e~~~~~~~~~~~l~~~~~~~k~~~~~~~~~~~~~------~~~~~T~v~~ 564 (957) |+||++|++||+++++|+++|+++|+||++|++++.++|++.|+++|+.++.+.+....+.... ...++|+|+. T Consensus 774 PlMY~~I~~hpsv~~lY~~~Li~eG~it~~e~~~~~~~~~~~Le~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~T~v~~ 853 (1234) T PRK12270 774 PLMYDLIDEKRSVRKLYTEALIGRGDISMEEAEQALRDYQGQLERVFNEVREAEKKPIEPSESVESDQMLPAGLATAVSK 853 (1234) T ss_pred HHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCH T ss_conf 88999987399889999999986799899999999999999999999986413777776433456633456777887699 Q ss_pred HHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCEEEEE Q ss_conf 99999988743689986205677888998998733435642114789888988635831578501002233323027999 Q gi|254781051|r 565 EILKKIGSSIIRLPKSFNTHKIVERLMANRQKMIETGKGIDWSMAESLAFGSLCYEGYKVRLSGQDCERGTFSHRHAILC 644 (957) Q Consensus 565 ~~L~~i~~~l~~~Pe~f~~h~~v~k~~~~R~~~~~~~~~iDW~~AE~LA~~SLl~eG~~VRlsGQD~~RGTFshRHavl~ 644 (957) +.|++|+.+++++|++|++||||+|++++|++|++++ +|||||||+|||||||.||++||||||||+|||||||||||| T Consensus 854 e~L~~i~~~~~~~Pe~F~~Hpkv~kil~~R~~m~~~g-~iDWg~AE~LAfgSLL~eG~~VRLsGQD~~RGTFshRHavl~ 932 (1234) T PRK12270 854 EVLARIGDAFVNLPEGFTVHPRVKPVLEKRREMAYEG-KIDWAFGELLAFGSLLLEGKPVRLSGQDSRRGTFSQRHAVLI 932 (1234) T ss_pred HHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHCC-CCCHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCCEEEE T ss_conf 9999999997039998852689999999999998548-977899999999999708970575254046543231213564 Q ss_pred ECCCCCEEEEHHHCCCC-CCEEEEEECCCCCCCHHHHHHCCCCCCCCCCEEECCCCCCCCCCCHHHHHHHHHHHHHHHCC Q ss_conf 77888445563442789-84599982431111013432022113865311100002442454102256554202576233 Q gi|254781051|r 645 DQETGKRYFPLGNISKD-QGHCEVVNSFLSEQAVLGFEYGYSLNNLNALTIWEAQFGDFANGAQVILDQFITAGEQKWLC 723 (957) Q Consensus 645 dq~t~~~y~pL~~l~~~-q~~f~v~nS~LSE~avlgFEyGys~~~p~~L~iWEAQFGDF~NgAQviiDqfiss~e~KW~~ 723 (957) ||+|+++|+||+||+++ |++|+||||+||||||||||||||+++|++|||||||||||+|||||||||||||||+||+| T Consensus 933 Dq~t~~~y~PL~~l~~~~q~~f~v~nS~LSE~avLGFEYGYS~~~P~~LviWEAQFGDFaNGAQvIIDQFIsSgE~KW~~ 1012 (1234) T PRK12270 933 DRETGEEFTPLQNLADDSQGKFLVYDSLLSEYAAMGFEYGYSVGNPDALVLWEAQFGDFANGAQTIIDEFISSGEAKWGQ 1012 (1234) T ss_pred ECCCCCEEEEHHHCCCCCCCEEEEECCCCHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHHCCCEEEEEEEEECHHHHHHH T ss_conf 13678478577664836585189981521355555555244336897335406555344058648986578617776641 Q ss_pred CCCCEEECCCCCCCCCCCCCCCCHHHHHHHCCCCCEEEEECCCHHHHHHHHHHHHCCCCCCCEEEECCCHHHCCCCCCCC Q ss_conf 36813543688888887656440578875214378269724889999999999743534886898566022108012688 Q gi|254781051|r 724 TSNLVCLLPHGYEGQGPEHSSARLERFLQMCAENNMYVANCTSPANYFHILRRQIYDRSSRPLIMMAPKSLLRHKRVVSS 803 (957) Q Consensus 724 ~sglvllLPHGyeGqGPEHSSaRlERfLQl~a~~N~~V~~~ttpAq~FH~LRRQ~~r~~rkPLiv~tPKsLLR~~~a~S~ 803 (957) +||||||||||||||||||||||||||||||||+|||||||||||||||+|||||+|+|||||||||||||||||.|+|+ T Consensus 1013 ~SgLVlLLPHGyEGQGPEHSSARiERfLQLcAe~NmqV~~pTTPAqyFHlLRRQ~~r~~RKPLIv~tPKSLLRh~~a~S~ 1092 (1234) T PRK12270 1013 RSGVVLLLPHGYEGQGPDHSSARIERFLQLCAEGNMTVAQPSTPANYFHLLRRQALSGIRRPLVVFTPKSMLRNKAAVSD 1092 (1234) T ss_pred HCCEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCEEEEECCCHHHHHHHHHHHHHCCCCCCEEEECCHHHHCCCCCCCC T ss_conf 35757886788888996521466899998610478599934983899999999874157888698584787458334799 Q ss_pred HHHHCCCCCEEECCCCCCCCCCCCCCCCCCHHHCCEEEEECCHHHHHHHHHHHHCCCCCEEEEEEECCCCCHHHHHHHHH Q ss_conf 78937998603515776433444544457954576899868857899999999769986899984014868289999998 Q gi|254781051|r 804 LSDMTCGSVFQAVLSDDAEYHGKTSVKLKEDSHIRRVILCTGKVYYDLLDNRDMRNIADIYLLRIEQLYPFPEDYLIKVL 883 (957) Q Consensus 804 l~d~~~g~~F~~vi~d~~~~~~~~~~~~~~~~~v~rvi~csGKvyydL~~~r~~~~~~~vaivRiEqLyPfP~~~l~~~l 883 (957) ++||++|+ ||+||+|+. ..++++|||||||||||||||+++|++++.+||||||||||||||+++|+++| T Consensus 1093 l~e~~~G~-F~~Vi~D~~---------~~~~~~VkRvilCSGKvYYDL~~~R~~~~~~dvAIvRiEQLyPfP~~~l~~~l 1162 (1234) T PRK12270 1093 VEDFTEGK-FRSVIDDPT---------VGDPAKVKRVLLCSGKLYYDLAARREKDGRDDVAIVRVEQLYPLPERRLRETL 1162 (1234) T ss_pred HHHCCCCC-CEECCCCCC---------CCCHHHCCEEEEECCHHHHHHHHHHHHCCCCCEEEEEEEECCCCCHHHHHHHH T ss_conf 78818995-446168987---------78841083899806721999999999769997699984002899889999999 Q ss_pred HHCCCC-EEEEECCCCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHC Q ss_conf 537998-39980678531784578997699998750567760189753664672338678999999999998719 Q gi|254781051|r 884 SRFVQA-EIVWCQEEPQNMGAWTFIEPYLEKVLHSIGADYSRVRYVGRLPSASTAVGHMSRHLEQLSSFIEDALK 957 (957) Q Consensus 884 ~~y~~~-e~vW~QEEp~N~GaW~~v~~rl~~~l~~~~~~~~~~~y~gR~~~aspA~G~~~~H~~e~~~li~~a~~ 957 (957) ++|||+ |+|||||||+|||||+|+++||..++. +..+++|||||++||||||+.++|++||++||++||. T Consensus 1163 ~~Ypna~e~vW~QEEP~N~GAW~fv~~rl~~~l~----~~~~l~yvgR~~sASpAtG~~~~H~~Eq~~li~~AF~ 1233 (1234) T PRK12270 1163 ARYPNAEEVFWVQEEPANQGAWPFMGLNLPELLP----KLRGLRRVSRRASSSPATGSSKVHAVEQQELLDEAFA 1233 (1234) T ss_pred HHCCCCCEEEEECCCCCCCCCHHHHHHHHHHHHC----CCCCEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHC T ss_conf 7489988689953486635778889999999853----4885268577887888877878999999999999863 No 3 >KOG0450 consensus Probab=100.00 E-value=0 Score=2749.58 Aligned_cols=918 Identities=47% Similarity=0.813 Sum_probs=854.8 Q ss_pred HCCCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC-CCCCCCC Q ss_conf 347636789689999999987380507989999987367643333334344432222222211123334455-5666555 Q gi|254781051|r 11 LSSFLDGTNCCYIEDLYKSYQEDPSSVCKDWYPLFSFLDENSEEYDNLEDGIASFLKEESVASAVFSQKKSA-VESSCSL 89 (957) Q Consensus 11 ~sSFL~g~Na~YIEeLYekYL~DP~SVd~sWr~fF~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 89 (957) .++||+|+++.||||||+.|++||+||+.||.+||.++..+..........+.... .......++ ....... T Consensus 47 td~Fl~g~ss~YveEmy~AW~~dP~SVh~SWDayFrn~~~~~~~~~~a~~~p~~~~-------~~~~~t~~~~~~~~~t~ 119 (1017) T KOG0450 47 TDPFLSGTSSSYVEEMYRAWLEDPNSVHKSWDAYFRNVSAGAPPPAQASQAPLSRS-------AAVAGTQSAVAARPNTG 119 (1017) T ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCC-------CCCCCCCCCCCCCCCCC T ss_conf 56444686137999999998719823555699998753367898544346887533-------11122212423466778 Q ss_pred CCHHHHHHHHHHHHHHHHHHHCCCHHHCCCCCCCCCC-------CCCCCCCCHHHCCCCHHHCCCEEEC--CCCCC--CC Q ss_conf 6336789999999999999881704540167777776-------6787767876669997881715725--66578--77 Q gi|254781051|r 90 GETQSVKDFFQVMKMIDAYRSYGHFKANIDPLGYNSH-------QKDLSELSPAHYGFVKADYDRKICM--KGVLG--LE 158 (957) Q Consensus 90 ~~~~~~~kq~rV~~LI~AYR~~GHl~A~lDPLgl~~~-------~~~~peL~~~~yGLse~DLd~~f~~--~~~~g--~~ 158 (957) .+.+.+.++++|+.|||||+.|||..|+|||||+... ....++|.+++|||+|.|||++|.. .++++ .. T Consensus 120 ~s~~~v~~hl~vqlLiRaYQirGH~~A~LDPLGi~~~~~~~~~p~~i~~~~~l~~Ygf~E~DLDref~L~~t~Fi~~~~~ 199 (1017) T KOG0450 120 ISDKTVEDHLKVQLLIRAYQIRGHHKAKLDPLGINDADLDSSVPADIPEELTLAFYGFTEADLDREFHLPTTTFIGGGES 199 (1017) T ss_pred CCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHCCCCHHHCCCEECCCCCCCCCCCCC T ss_conf 54344999999999999998546401047864566500123478889704436543875321364330477642368976 Q ss_pred CCCHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEE Q ss_conf 77899999999984064302777248998999999998414577888798899999999999998999998640336222 Q gi|254781051|r 159 SATIPEIVDVLSHLYCSNIGVEFMHIVDSIERDWVRNTIEDYDFSSNFSKEEQRDILDKLVRAEGFEKFIDIKYKGAKRF 238 (957) Q Consensus 159 ~~tL~eIi~~L~~tYCgsIGvEymHI~d~eer~Wlq~riE~~~~~~~~s~eeK~~iL~~L~~Ae~FE~FL~~Kf~g~KRF 238 (957) .+|||||+++|+++|||+||+|||||.|.++++||+++||... ..+|+.|+|+.||.||++|..||+||++||+..||| T Consensus 200 ~ltLreIl~rlE~~YC~~igvEfmhI~~~eqcnWir~k~EtP~-~~q~s~e~k~~il~RL~~st~FE~FLa~Kw~seKRF 278 (1017) T KOG0450 200 SLTLREILRRLEKAYCGSIGVEFMHINDLEQCNWIRQKFETPG-PMQYSHEQKRVILDRLTRSTRFEEFLATKWPSEKRF 278 (1017) T ss_pred CCCHHHHHHHHHHHHHCCCEEEEEEECCHHHHHHHHHHCCCCC-CCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 5529999999998763243057886085777279998636998-511577889999999887647999986508852013 Q ss_pred CCCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEE Q ss_conf 25646589999999999987379107998067655268999885379899999855887432234555100150268898 Q gi|254781051|r 239 GADGSEVIIPAIEEIIRQGVQQGVDEMILGMAHRGRLNVLSQIMNKLPRSIFYEFKGKGPSEKEYSGDVKYHLGLCCNRQ 318 (957) Q Consensus 239 sLEG~EslIP~L~~ii~~a~~~Gv~eiviGMaHRGRLNVL~Nvl~Kp~~~iF~EF~g~~~~~~~~sGDVKYHLG~s~~~~ 318 (957) ||||||+|||+|++||+++++.||++||||||||||||||+||++||+++||+||.|..+.+ +||||||||||.+.+|. T Consensus 279 GLEGcE~lIP~mK~iiDrS~elGVe~iviGMpHRGRLNvL~NVvRKpl~qIfseF~g~~~~D-eGSGDVKYHLG~~~~R~ 357 (1017) T KOG0450 279 GLEGCEVLIPAMKTIIDRSSELGVESIVIGMPHRGRLNVLANVVRKPLEQIFSEFSGLEAAD-EGSGDVKYHLGMYYERP 357 (1017) T ss_pred CCCCHHHHHHHHHHHHHHHHHCCCHHEEECCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCC-CCCCCEEEEECCCCCCC T ss_conf 65532545067898862354418021573377653268888887607999987516887776-78875346301330266 Q ss_pred E--CCCEEEEEECCCCCCCHHCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCC Q ss_conf 3--59618999717877310004455323677776416665445556544540478886257533346725887645106 Q gi|254781051|r 319 I--CGKDVKLLLQSNPSHLEFVDPVVIGSVRARQDLKAGIVGEKSISLVERSRVLPLIIHGDAAFAGQGIVSETFGLSGL 396 (957) Q Consensus 319 ~--~g~~v~lsL~~NPSHLEaVnPVv~G~~RAkQd~~~d~~~~~~~~~~~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l 396 (957) + +||.|+|+|++||||||||+|||+||+||+|.+.+|. +++|+|.|||||||||||||||||||+||.| T Consensus 358 ~r~s~k~i~LslVANPSHLEA~DPVV~GKtrA~q~y~~D~---------~~~k~m~ILiHGDaaFAgQGVVyET~hls~L 428 (1017) T KOG0450 358 NRVSGKNITLSLVANPSHLEAVDPVVMGKTRAEQFYTGDE---------EGKKVMGILIHGDAAFAGQGVVYETFHLSDL 428 (1017) T ss_pred CCCCCCEEEEEEECCCHHHCCCCCEEECHHHHHHHHCCCC---------CCCEEEEEEEECCHHHCCCCEEEEEECCCCC T ss_conf 6667853689995072220124862213388898860661---------3562578998545110367448886144457 Q ss_pred CCCCCCCEEEEEECCCEECCCCCHHCCCCCCCCHHHHHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEECC Q ss_conf 77443657999841602012470102467784001122034058742678899999999999999973897399864211 Q gi|254781051|r 397 SGYTVAGNIHLIINNQIGFTTNPSSARSCTYASDISKSIGIPIFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVCYR 476 (957) Q Consensus 397 ~gY~tGGTiHiVvNNQIGFTT~p~d~RSs~Y~TDiAK~i~aPI~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~YR 476 (957) |+|+|||||||||||||||||+|+.+|||+|||||||+++||||||||||||||++||++|+|||++||||||||||||| T Consensus 429 P~YtT~GTvHvVvNNQIgFTTDPR~aRSspYcTDvar~v~aPIFHVNaDD~EAV~~vc~vAAeWR~tFh~DvVVDlVcyR 508 (1017) T KOG0450 429 PSYTTGGTVHVVVNNQIGFTTDPRFARSSPYCTDVARVVNAPIFHVNADDPEAVMHVCKVAAEWRKTFHKDVVVDLVCYR 508 (1017) T ss_pred CCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEEEE T ss_conf 77656974899972621210486422479873568987089757655887699999999999999871667079988874 Q ss_pred CCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCC-CCCCCCCC Q ss_conf 25656435422355788888671899899999998644898989999999999999775666541258765-66665686 Q gi|254781051|r 477 RFGHNEGDEPSFTQPMMYKTIRSHKSVLQLYADSLMKNQVISKQELQSLANNWHKYLEAEYKESESYCPEK-LGLLHNGE 555 (957) Q Consensus 477 r~GHNE~DeP~fTQP~mY~~I~~hp~v~~~Y~~~Li~~g~it~~e~~~~~~~~~~~l~~~~~~~k~~~~~~-~~~~~~~~ 555 (957) ||||||.|||+||||+|||+|++||++.++|++||+++|+++++|+++..++|.+.||.+|+.+|+|.+.. .+|+..+| T Consensus 509 R~GHNE~DeP~FTQPlMYk~I~k~~~~l~~Y~ekLl~egtvs~~evd~~~~k~~~I~eeafe~sKd~~~~~~rdWL~spW 588 (1017) T KOG0450 509 RHGHNEIDEPMFTQPLMYKQIRKHKPVLQKYAEKLLSEGTVSQQEVDEEIKKYDNILEEAFERSKDYKPLHIRDWLDSPW 588 (1017) T ss_pred ECCCCCCCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCC T ss_conf 14877556754346379999870885899999998742851499999999999999999998644454023455540888 Q ss_pred C-------------CCCCCCCHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCC Q ss_conf 2-------------666778999999998874368998620567788899899873343564211478988898863583 Q gi|254781051|r 556 N-------------ERKNSVSKEILKKIGSSIIRLPKSFNTHKIVERLMANRQKMIETGKGIDWSMAESLAFGSLCYEGY 622 (957) Q Consensus 556 ~-------------~~~T~v~~~~L~~i~~~l~~~Pe~f~~h~~v~k~~~~R~~~~~~~~~iDW~~AE~LA~~SLl~eG~ 622 (957) . -..|||+.+.|+.||++++++||+|++|+.++|+|+.|.+|+++ ++||||+||+|||||||.||+ T Consensus 589 ~gF~spd~~~~i~~~~~Tgv~~e~L~~ig~~~ss~PE~F~~Hrgl~Ril~~R~~mi~~-~~iDwal~EalAFgsLl~EG~ 667 (1017) T KOG0450 589 PGFFSPDGQPKILPCPSTGVKEEILKHIGKVASSVPEGFKIHRGLKRILKNRAQMIKS-EGVDWALAEALAFGSLLKEGI 667 (1017) T ss_pred CCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHH-CCCCHHHHHHHHHHHHHHCCC T ss_conf 4435856787633479899997999998776236876662324199999988776651-366367888998888876072 Q ss_pred EEEEECCCCCCCCCCCCEEEEEECCCC-CEEEEHHHCCCCCCEEEEEECCCCCCCHHHHHHCCCCCCCCCCEEECCCCCC Q ss_conf 157850100223332302799977888-4455634427898459998243111101343202211386531110000244 Q gi|254781051|r 623 KVRLSGQDCERGTFSHRHAILCDQETG-KRYFPLGNISKDQGHCEVVNSFLSEQAVLGFEYGYSLNNLNALTIWEAQFGD 701 (957) Q Consensus 623 ~VRlsGQD~~RGTFshRHavl~dq~t~-~~y~pL~~l~~~q~~f~v~nS~LSE~avlgFEyGys~~~p~~L~iWEAQFGD 701 (957) +||||||||+||||||||+|||||+++ +.|+||+||.++|+.|+|+||+|||||||||||||||++||+|||||||||| T Consensus 668 hVRlSGQDVERGTFShRH~VLHDQ~~d~~~y~PlnhL~~~Qa~ytV~NSSLSEygVLGFElGYsm~sPNaLVlWEAQFGD 747 (1017) T KOG0450 668 HVRLSGQDVERGTFSHRHHVLHDQEVDKRTYIPLNHLWPNQAPYTVCNSSLSEYGVLGFELGYSMASPNALVLWEAQFGD 747 (1017) T ss_pred EEEEECCCCCCCCCCCCHHHHCCCCCCCCEECCHHHCCCCCCCEEEECCCHHHHHEECCEECCCCCCCCCEEEEEHHHCC T ss_conf 68740442255433420022042125752545165358777854564463345320000101112699716886133256 Q ss_pred CCCCCHHHHHHHHHHHHHHHCCCCCCEEECCCCCCCCCCCCCCCCHHHHHHHCCCC------------------CEEEEE Q ss_conf 24541022565542025762333681354368888888765644057887521437------------------826972 Q gi|254781051|r 702 FANGAQVILDQFITAGEQKWLCTSNLVCLLPHGYEGQGPEHSSARLERFLQMCAEN------------------NMYVAN 763 (957) Q Consensus 702 F~NgAQviiDqfiss~e~KW~~~sglvllLPHGyeGqGPEHSSaRlERfLQl~a~~------------------N~~V~~ 763 (957) |+|+||+||||||||||+||.|+||||||||||||||||||||||+|||||||+|| ||||+| T Consensus 748 FaNtAQ~IiDQFIssGqaKW~rqsGlVllLPHGyeG~GPEHSSaR~ERfLQm~nddp~~~p~~~~~~~~Ql~dcNw~vvn 827 (1017) T KOG0450 748 FANTAQCIIDQFISSGQAKWVRQSGLVLLLPHGYEGMGPEHSSARPERFLQMSNDDPDVFPDEEEFLQRQLQDCNWQVVN 827 (1017) T ss_pred CCCCCHHHHHHHHCCCHHHHHHHCCEEEECCCCCCCCCCCCCCCCHHHHHHHCCCCCCCCCCCCHHHHHHHHCCCEEEEE T ss_conf 44630564876753423556652676997467767889654431489998741689743776057888777517759994 Q ss_pred CCCHHHHHHHHHHHHCCCCCCCEEEECCCHHHCCCCCCCCHHHHCCCCCEEECCCCCCCCCCCCCCCCCCHHHCCEEEEE Q ss_conf 48899999999997435348868985660221080126887893799860351577643344454445795457689986 Q gi|254781051|r 764 CTSPANYFHILRRQIYDRSSRPLIMMAPKSLLRHKRVVSSLSDMTCGSVFQAVLSDDAEYHGKTSVKLKEDSHIRRVILC 843 (957) Q Consensus 764 ~ttpAq~FH~LRRQ~~r~~rkPLiv~tPKsLLR~~~a~S~l~d~~~g~~F~~vi~d~~~~~~~~~~~~~~~~~v~rvi~c 843 (957) |||||||||+||||++|+||||||||+||||||||+|.|+++||.+|+.||.||+|.. ....+|++||||||| T Consensus 828 ~tTPaNyfHvLRRQi~~~FRKPliif~pKsLLRHp~arS~~~ef~~g~~fq~vi~e~g-------~~~~~pe~vkrlv~c 900 (1017) T KOG0450 828 CTTPANYFHVLRRQIHRPFRKPLIIFTPKSLLRHPEARSSFSEFDEGTGFQRVIPEDG-------KAAQNPENVKRLVFC 900 (1017) T ss_pred CCCHHHHHHHHHHHHHHCCCCCEEEECCHHHHCCCCCCCCHHHHCCCCCCCEECCCCC-------CCCCCHHHCEEEEEE T ss_conf 2880799999999863112576588560776228100388777356778740241326-------555882235179996 Q ss_pred CCHHHHHHHHHHHHCC-CCCEEEEEEECCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHCCCCC Q ss_conf 8857899999999769-986899984014868289999998537998399806785317845789976999987505677 Q gi|254781051|r 844 TGKVYYDLLDNRDMRN-IADIYLLRIEQLYPFPEDYLIKVLSRFVQAEIVWCQEEPQNMGAWTFIEPYLEKVLHSIGADY 922 (957) Q Consensus 844 sGKvyydL~~~r~~~~-~~~vaivRiEqLyPfP~~~l~~~l~~y~~~e~vW~QEEp~N~GaW~~v~~rl~~~l~~~~~~~ 922 (957) ||||||||.++|++.+ .++|||+|||||+|||++.+++.+.+|||+|++||||||+|||+|.||+|||.++|..++ T Consensus 901 sGkVyydL~k~Rk~~~~~~~vAi~RvEQl~PFp~dli~~e~~~YpnaEivWcQEE~~NmG~w~Yv~PRl~T~l~~~~--- 977 (1017) T KOG0450 901 SGKVYYDLTKERKEVGLEGDVAITRVEQLSPFPFDLIQQELNKYPNAEIVWCQEEHKNMGAWDYVEPRLRTALKRLA--- 977 (1017) T ss_pred CCEEEHHHHHHHHHCCCCCCEEEEEEECCCCCCHHHHHHHHHHCCCCEEEEHHHHHCCCCCHHHCCHHHHHHHHHHC--- T ss_conf 66675356688875275563358872011898579999999758982034201111036743220469999987607--- Q ss_pred CEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHC Q ss_conf 60189753664672338678999999999998719 Q gi|254781051|r 923 SRVRYVGRLPSASTAVGHMSRHLEQLSSFIEDALK 957 (957) Q Consensus 923 ~~~~y~gR~~~aspA~G~~~~H~~e~~~li~~a~~ 957 (957) ++++|+||.++||||||+...|.+||+++++.||. T Consensus 978 r~v~Y~GR~Psaa~AtGnk~~Hl~eq~~~l~~a~~ 1012 (1017) T KOG0450 978 RPVKYAGRLPSAAPATGNKQTHLAEQKAFLNKAFQ 1012 (1017) T ss_pred CCCEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHC T ss_conf 86345366886665656256778999999999756 No 4 >COG0567 SucA 2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, and related enzymes [Energy production and conversion] Probab=100.00 E-value=0 Score=2707.99 Aligned_cols=897 Identities=52% Similarity=0.885 Sum_probs=846.6 Q ss_pred HHHHHHCCCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 89998347636789689999999987380507989999987367643333334344432222222211123334455566 Q gi|254781051|r 6 NKMFSLSSFLDGTNCCYIEDLYKSYQEDPSSVCKDWYPLFSFLDENSEEYDNLEDGIASFLKEESVASAVFSQKKSAVES 85 (957) Q Consensus 6 ~~~f~~sSFL~g~Na~YIEeLYekYL~DP~SVd~sWr~fF~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 85 (957) |+.|.++|||+|+|++|||+||++|++||+|||++|+.||+++..+..... .. . T Consensus 1 ~~~~~~~~~l~g~n~~~ie~ly~~~~~~p~svd~~w~~~F~~~~~~~~~~~-------------------~~---~---- 54 (906) T COG0567 1 MKQFLNTSYLSGANAAYIEELYEQYLQDPNSVDPSWQEFFDGLSDPVRESF-------------------RR---L---- 54 (906) T ss_pred CCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHCCCCHHHHH-------------------HH---H---- T ss_conf 964123555577643789999999850976568899999986276433444-------------------55---3---- Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHCCCHHHCCCCCCCCCCCCCCCCCCHHHCCCCHHHCCCEEE-CCCCCCCCCCCHHH Q ss_conf 65556336789999999999999881704540167777776678776787666999788171572-56657877778999 Q gi|254781051|r 86 SCSLGETQSVKDFFQVMKMIDAYRSYGHFKANIDPLGYNSHQKDLSELSPAHYGFVKADYDRKIC-MKGVLGLESATIPE 164 (957) Q Consensus 86 ~~~~~~~~~~~kq~rV~~LI~AYR~~GHl~A~lDPLgl~~~~~~~peL~~~~yGLse~DLd~~f~-~~~~~g~~~~tL~e 164 (957) .....+.....++++|.+||+|||.|||+.|+||||++ +++.+|+|+|++|||+++||++.|+ .++++|...++|+| T Consensus 55 ~~~~~~~~~~~~~~~~~~li~ayR~~Ghl~a~ldPL~~--~~~~~~~l~~~~~glt~~d~~~~~~~~~~~~~~~~~~l~e 132 (906) T COG0567 55 AKKGNDPDATLKSVKVLRLINAYRSRGHLHANLDPLGL--KRPDVPELDPEFHGLTEADLDETFNIGDGFLGKETMTLRE 132 (906) T ss_pred HCCCCCHHHHHHHHHHHHHHHHHHHHCHHHHCCCCCCC--CCCCCCCCCHHHCCCCHHHHHCCCCCCCCCCCCCCCCHHH T ss_conf 01356404667778999999999973315423784358--9998332787335999789605456555656534231999 Q ss_pred HHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEECCCCHH Q ss_conf 99999984064302777248998999999998414577888798899999999999998999998640336222256465 Q gi|254781051|r 165 IVDVLSHLYCSNIGVEFMHIVDSIERDWVRNTIEDYDFSSNFSKEEQRDILDKLVRAEGFEKFIDIKYKGAKRFGADGSE 244 (957) Q Consensus 165 Ii~~L~~tYCgsIGvEymHI~d~eer~Wlq~riE~~~~~~~~s~eeK~~iL~~L~~Ae~FE~FL~~Kf~g~KRFsLEG~E 244 (957) +++.|+++|||+||+|||||.|++||+||++|||+ ..+.|+.|+|++||++|++||+||+|||+||||+||||||||| T Consensus 133 ~~~~l~~~Ycgsig~E~~hi~~~~~~~Wl~~riE~--~~~~~~~e~k~~~l~~L~~ae~fE~fl~~kf~g~KRFslEG~e 210 (906) T COG0567 133 LIEILKKTYCGSIGVEYMHISDPEEKRWLQERIES--GKPTFTAEEKKAILKRLTAAEGFERFLHTKFPGAKRFSLEGGE 210 (906) T ss_pred HHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHHC--CCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHH T ss_conf 99999987246503566335998999999999715--7888788999999999999999999862428897101535456 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEECCCEE Q ss_conf 89999999999987379107998067655268999885379899999855887432234555100150268898359618 Q gi|254781051|r 245 VIIPAIEEIIRQGVQQGVDEMILGMAHRGRLNVLSQIMNKLPRSIFYEFKGKGPSEKEYSGDVKYHLGLCCNRQICGKDV 324 (957) Q Consensus 245 slIP~L~~ii~~a~~~Gv~eiviGMaHRGRLNVL~Nvl~Kp~~~iF~EF~g~~~~~~~~sGDVKYHLG~s~~~~~~g~~v 324 (957) ||||||+++|++|+++||++||||||||||||||+|||||||+.||+||+|+...... +||||||||+|+++.+++++| T Consensus 211 slip~l~~~i~~~~~~G~~~vviGMaHRGRLNvL~nvlgKp~~~if~eF~g~~~~~~~-sGDVKYH~G~~~~~~~~~~~v 289 (906) T COG0567 211 SLIPMLDELIDRAGKQGVKEVVIGMAHRGRLNVLVNVLGKPYRDIFDEFEGKSAEPDL-SGDVKYHLGFSSDRQTDGGKV 289 (906) T ss_pred HHHHHHHHHHHHHHHCCCCEEEECCCCCCHHHHHHHHHCCCHHHHHHHHCCCCCCCCC-CCCCCCCCCCCCCCCCCCCEE T ss_conf 7889999999998866823078436554237889988379989999984799877766-666420356533334689806 Q ss_pred EEEECCCCCCCHHCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCE Q ss_conf 99971787731000445532367777641666544555654454047888625753334672588764510677443657 Q gi|254781051|r 325 KLLLQSNPSHLEFVDPVVIGSVRARQDLKAGIVGEKSISLVERSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGN 404 (957) Q Consensus 325 ~lsL~~NPSHLEaVnPVv~G~~RAkQd~~~d~~~~~~~~~~~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGT 404 (957) ||+|++||||||+|||||+|+|||+||++.|. ++.|||||+|||||||+|||||+||||||+++||+|||| T Consensus 290 ~l~La~NPSHLE~v~PVV~G~vRa~Qd~~~d~---------~~~k~lpiliHGDAAfaGQGVV~Etlnls~~~gysvgGt 360 (906) T COG0567 290 HLSLAFNPSHLEIVNPVVEGSVRAKQDRLGDT---------ERDKVLPILIHGDAAFAGQGVVAETLNLSRLDGYSVGGT 360 (906) T ss_pred EEEECCCCCHHHHHCHHHHCCHHHHHHHHCCC---------CCCEEEEEEEECCHHCCCCCHHHHHHHHHCCCCCCCCCE T ss_conf 89834895254332446534067666541377---------645267888743720178528999987417997666975 Q ss_pred EEEEECCCEECCCCCHHCCCCCCCCHHHHHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCC Q ss_conf 99984160201247010246778400112203405874267889999999999999997389739986421125656435 Q gi|254781051|r 405 IHLIINNQIGFTTNPSSARSCTYASDISKSIGIPIFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVCYRRFGHNEGD 484 (957) Q Consensus 405 iHiVvNNQIGFTT~p~d~RSs~Y~TDiAK~i~aPI~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~YRr~GHNE~D 484 (957) |||||||||||||+|+|+|||+||||+|||++||||||||||||||++|+++|++||++|+|||||||+||||+||||+| T Consensus 361 iHiviNNQiGFTTsp~~sRSt~Y~TDvAKm~~aPifHVN~DDPEAv~~a~~~A~e~R~~F~kDvvIDlvcYRr~GHNE~D 440 (906) T COG0567 361 WHIVINNQIGFTTSPADARSTPYCTDVAKMIEAPIFHVNADDPEAVLFAPALALEYRNGFKKDVVIDLVCYRRHGHNEGD 440 (906) T ss_pred EEEEEECCCCCCCCCCCCCCCCCCCCHHHCCCCCEEECCCCCCHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCC T ss_conf 89997457799899652347888887444367854303568833302667778999863378722322557888987554 Q ss_pred CCCCCCHHHHHHHHHCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCC------CCCCCCCCCCCC Q ss_conf 42235578888867189989999999864489898999999999999977566654125876------566665686266 Q gi|254781051|r 485 EPSFTQPMMYKTIRSHKSVLQLYADSLMKNQVISKQELQSLANNWHKYLEAEYKESESYCPE------KLGLLHNGENER 558 (957) Q Consensus 485 eP~fTQP~mY~~I~~hp~v~~~Y~~~Li~~g~it~~e~~~~~~~~~~~l~~~~~~~k~~~~~------~~~~~~~~~~~~ 558 (957) ||++|||.||+.|++|||++++|+++|+++|++++++++.+.++|++.|+.++...+.+.+. |.+......... T Consensus 441 ePs~TqP~mY~~I~~h~t~r~~ya~~Lv~~gvis~~~~~~~~~~~r~~L~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 520 (906) T COG0567 441 EPSVTQPLMYQKIKKHPTVRKLYADKLIAEGVISEEEADELVNDYRDALDQGFEVVKEYKEMDWLEGDWSGYLNAGLRHV 520 (906) T ss_pred CCCCCCHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC T ss_conf 30012789999876499714447999976223259999999999999863054667666664304445654567774445 Q ss_pred CCCCCHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCC Q ss_conf 67789999999988743689986205677888998998733435642114789888988635831578501002233323 Q gi|254781051|r 559 KNSVSKEILKKIGSSIIRLPKSFNTHKIVERLMANRQKMIETGKGIDWSMAESLAFGSLCYEGYKVRLSGQDCERGTFSH 638 (957) Q Consensus 559 ~T~v~~~~L~~i~~~l~~~Pe~f~~h~~v~k~~~~R~~~~~~~~~iDW~~AE~LA~~SLl~eG~~VRlsGQD~~RGTFsh 638 (957) +|+++++.|+.|+.+++++|++|++||+|+|++++|.+|.+++.++||||||.|||||||.+|++||||||||+|||||| T Consensus 521 ~t~v~~~~L~~L~~kl~~~Pe~f~~h~~v~~~~~~r~~~~~~~~~iDW~~aE~LAfatll~eG~~iRlsGqDs~RGTF~h 600 (906) T COG0567 521 DTGVPKKTLKELGKKLCTIPEGFEVHPRVKKILEDRKAMAEGGQGIDWGMAETLAFATLLDEGHPIRLSGQDSGRGTFSH 600 (906) T ss_pred CCCCCHHHHHHHHHHHHCCCCCEEHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 66548899999878863077630124889999999998741455331667888615500036887442003179867534 Q ss_pred CEEEEEECCCCCEEEEHHHCCCCCCEEEEEECCCCCCCHHHHHHCCCCCCCCCCEEECCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 02799977888445563442789845999824311110134320221138653111000024424541022565542025 Q gi|254781051|r 639 RHAILCDQETGKRYFPLGNISKDQGHCEVVNSFLSEQAVLGFEYGYSLNNLNALTIWEAQFGDFANGAQVILDQFITAGE 718 (957) Q Consensus 639 RHavl~dq~t~~~y~pL~~l~~~q~~f~v~nS~LSE~avlgFEyGys~~~p~~L~iWEAQFGDF~NgAQviiDqfiss~e 718 (957) ||+|+|||.+++.|+||+|++.+|++|+|+||+|||+||||||||||+.+|++|||||||||||+||||||||||||||| T Consensus 601 RHaVlhdq~~~~~y~PL~~l~~~q~~f~v~nS~LSEeAvlgFEYGYs~~~p~~lvlWEAQFGDFaNgAQvviDQfisSge 680 (906) T COG0567 601 RHAVLHDQKTGETYIPLNHLSKGQGKFEVINSPLSEEAVLGFEYGYSLANPKTLVLWEAQFGDFANGAQVVIDQFISSGE 680 (906) T ss_pred CCEEEECCCCCCCCCCHHHCCCCCCEEEEEECHHHHHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCCCEEEECCCCCCHH T ss_conf 42454235674221702331356644999814556888875653143238732210013205613687612136566598 Q ss_pred HHHCCCCCCEEECCCCCCCCCCCCCCCCHHHHHHHCCCCCEEEEECCCHHHHHHHHHHHHCCCCCCCEEEECCCHHHCCC Q ss_conf 76233368135436888888876564405788752143782697248899999999997435348868985660221080 Q gi|254781051|r 719 QKWLCTSNLVCLLPHGYEGQGPEHSSARLERFLQMCAENNMYVANCTSPANYFHILRRQIYDRSSRPLIMMAPKSLLRHK 798 (957) Q Consensus 719 ~KW~~~sglvllLPHGyeGqGPEHSSaRlERfLQl~a~~N~~V~~~ttpAq~FH~LRRQ~~r~~rkPLiv~tPKsLLR~~ 798 (957) +||+|+||||||||||||||||||||||||||||||||+|||||+|||||||||+||||++|.+|||||||||||||||| T Consensus 681 ~KW~r~sgLv~lLPHgyEGQGPEHSSaRlER~LQLcaE~NmqV~~pstpaq~fHlLRrq~~r~~rkPLiimtPKslLR~~ 760 (906) T COG0567 681 QKWGRMSGLVMLLPHGYEGQGPEHSSARLERFLQLCAENNMQVVVPSTPAQYFHLLRRQALRDFRKPLIVMTPKSLLRHK 760 (906) T ss_pred HHHHHHCCCEEECCCCCCCCCCCCCCCHHHHHHHHHHHHCCEEEECCCHHHHHHHHHHHHHHCCCCCEEECCHHHHHHCC T ss_conf 99988657227722767899976764346889986187588788338289999999998764156751855765565462 Q ss_pred CCCCCHHHHCCCCCEEECCCCCCCCCCCCCCCCCCHHHCCEEEEECCHHHHHHHHHHHHCCCCCEEEEEEECCCCCHHHH Q ss_conf 12688789379986035157764334445444579545768998688578999999997699868999840148682899 Q gi|254781051|r 799 RVVSSLSDMTCGSVFQAVLSDDAEYHGKTSVKLKEDSHIRRVILCTGKVYYDLLDNRDMRNIADIYLLRIEQLYPFPEDY 878 (957) Q Consensus 799 ~a~S~l~d~~~g~~F~~vi~d~~~~~~~~~~~~~~~~~v~rvi~csGKvyydL~~~r~~~~~~~vaivRiEqLyPfP~~~ 878 (957) .|+|+++||++| .|++||.|.+. .++ +|+|||+|||||||||.+.|++.+.+||||+|||||||||.++ T Consensus 761 ~a~S~~~el~~~-~F~~vl~d~~~---------~~~-~v~rvvlcSGKvyydl~~~r~~~g~~dvaiiRiEqLyPfP~~~ 829 (906) T COG0567 761 LAVSSLEELTEG-TFQPVLEDIDE---------LDP-KVKRVVLCSGKVYYDLLEQREKDGRDDVAIVRIEQLYPFPAKA 829 (906) T ss_pred CCCCCHHHHCCC-HHHCCHHCCCC---------CCC-CEEEEEEECCCHHHHHHHHHHHCCCCCEEEEEEECCCCCHHHH T ss_conf 668845552301-22100201111---------454-4135763044067999977864687652687520116766899 Q ss_pred HHHHHHHCCCC-EEEEECCCCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHC Q ss_conf 99998537998-39980678531784578997699998750567760189753664672338678999999999998719 Q gi|254781051|r 879 LIKVLSRFVQA-EIVWCQEEPQNMGAWTFIEPYLEKVLHSIGADYSRVRYVGRLPSASTAVGHMSRHLEQLSSFIEDALK 957 (957) Q Consensus 879 l~~~l~~y~~~-e~vW~QEEp~N~GaW~~v~~rl~~~l~~~~~~~~~~~y~gR~~~aspA~G~~~~H~~e~~~li~~a~~ 957 (957) +.++|++|+|. |+|||||||+|||||.|++++|+.+++. ..+++|+|||++||||+|+.++|++||+.++++||. T Consensus 830 l~~~l~~y~~~~e~vW~QEEp~N~Gaw~~~~~~l~~~l~~----~~~l~yagRp~saSpA~G~~~~h~~eq~~l~~dal~ 905 (906) T COG0567 830 LAALLAKYPNVKEFVWCQEEPKNQGAWYYIQPHLEEVLPE----GDKLRYAGRPASASPAVGSMSVHQKQQEKLLEDALG 905 (906) T ss_pred HHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCC----CCHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHC T ss_conf 9999974343023556665777665277788899986212----320001478766676514199999999877675403 No 5 >KOG0451 consensus Probab=100.00 E-value=0 Score=2190.39 Aligned_cols=835 Identities=42% Similarity=0.750 Sum_probs=801.3 Q ss_pred HHHHHHHHHHHHHCCCHHHCCCCCCCCCCCCCCCCCCHHHCCCCHHHCCCEEECCCCC--CCCCCCHHHHHHHHHHHHCC Q ss_conf 9999999999988170454016777777667877678766699978817157256657--87777899999999984064 Q gi|254781051|r 98 FFQVMKMIDAYRSYGHFKANIDPLGYNSHQKDLSELSPAHYGFVKADYDRKICMKGVL--GLESATIPEIVDVLSHLYCS 175 (957) Q Consensus 98 q~rV~~LI~AYR~~GHl~A~lDPLgl~~~~~~~peL~~~~yGLse~DLd~~f~~~~~~--g~~~~tL~eIi~~L~~tYCg 175 (957) +..+.+||+|||.+||+.|++|||++....+...+|+|.+||+- .+.+++ ++...+|.++.+.|+.+||| T Consensus 53 ~a~~~rlv~afr~hGhk~A~vnPl~~l~~v~e~qeL~pa~yG~~--------~t~gLl~~~k~~~sl~~l~~~L~~IYCG 124 (913) T KOG0451 53 QANVYRLVEAFRQHGHKLAAVNPLSILTSVQELQELSPAFYGLQ--------RTDGLLSGPKVAHSLAQLEQLLKDIYCG 124 (913) T ss_pred CCCHHHHHHHHHHHCCHHHCCCCHHHHHCCHHHHHHCHHHHCCH--------HHHHHHCCCCCCCCHHHHHHHHHHHEEC T ss_conf 33278999999971613333680676534198987577750641--------1013440761012699999998763117 Q ss_pred CEEEEEECCCCHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEECCCCHHHHHHHHHHHHH Q ss_conf 30277724899899999999841457788879889999999999999899999864033622225646589999999999 Q gi|254781051|r 176 NIGVEFMHIVDSIERDWVRNTIEDYDFSSNFSKEEQRDILDKLVRAEGFEKFIDIKYKGAKRFGADGSEVIIPAIEEIIR 255 (957) Q Consensus 176 sIGvEymHI~d~eer~Wlq~riE~~~~~~~~s~eeK~~iL~~L~~Ae~FE~FL~~Kf~g~KRFsLEG~EslIP~L~~ii~ 255 (957) ++.+|||||+|.|||.||.+++|... ..+|.++|+++|-+.|++++.|++||++|||+.||||.||+|||+....++++ T Consensus 125 ~tsiEfs~v~~~eEr~W~a~nFE~l~-~e~l~keEr~~i~~Lmlksq~fD~FlatKFpTvKRYGgEGAESM~aFF~eLl~ 203 (913) T KOG0451 125 STSIEFSYVEDIEEREWLARNFETLD-QEQLGKEERCEIAELMLKSQAFDNFLATKFPTVKRYGGEGAESMLAFFWELLR 203 (913) T ss_pred CCHHHHHHHCCHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHCCCCHHHHHHHHHHHHH T ss_conf 20433556333778899998899888-98853798888999998666677787741635454055448889999999998 Q ss_pred HHHHCCCCEEEEECCCCCHHHHHHHHHCCCHHHHHHHHCCC--CCCCCCCCCCCCCCCCEEEEEEECCCEEEEEECCCCC Q ss_conf 98737910799806765526899988537989999985588--7432234555100150268898359618999717877 Q gi|254781051|r 256 QGVQQGVDEMILGMAHRGRLNVLSQIMNKLPRSIFYEFKGK--GPSEKEYSGDVKYHLGLCCNRQICGKDVKLLLQSNPS 333 (957) Q Consensus 256 ~a~~~Gv~eiviGMaHRGRLNVL~Nvl~Kp~~~iF~EF~g~--~~~~~~~sGDVKYHLG~s~~~~~~g~~v~lsL~~NPS 333 (957) .++..+|++||||||||||||+|+.+|+.||..||++..|. +|++....|||..||-.|-|.+..++++|++|+|||| T Consensus 204 ~sa~~~ie~viigmpHRGRlnLlt~Ll~fpP~~mFRK~~G~sEFpE~~~A~gDVlSHl~sS~dykg~~~~lhvtMlpNPS 283 (913) T KOG0451 204 DSAQANIEHVIIGMPHRGRLNLLTALLNFPPAKMFRKLSGASEFPEDIEAMGDVLSHLHSSEDYKGLGKKLHVTMLPNPS 283 (913) T ss_pred HHHHCCCCEEEEECCCCCCCHHHHHHHCCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCHH T ss_conf 78862850588705666740588888439989999873073227503567777888732355440468844799558816 Q ss_pred CCHHCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCCE Q ss_conf 31000445532367777641666544555654454047888625753334672588764510677443657999841602 Q gi|254781051|r 334 HLEFVDPVVIGSVRARQDLKAGIVGEKSISLVERSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQI 413 (957) Q Consensus 334 HLEaVnPVv~G~~RAkQd~~~d~~~~~~~~~~~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQI 413 (957) ||||||||+|||+|+||..+++.+..++.+..-++.|+++++||||||+|||||.||++||++|+|++|||+|+|||||| T Consensus 284 HLEAvNPVAmGKtR~rqqsr~~Gdyspd~sa~~Gd~Vlnv~vHGDaaF~GQGiv~E~~~ls~~PHFrvGGsvHLivNNQv 363 (913) T KOG0451 284 HLEAVNPVAMGKTRSRQQSRGEGDYSPDSSAPFGDHVLNVIVHGDAAFAGQGIVQECLNLSYVPHFRVGGSVHLIVNNQV 363 (913) T ss_pred HHHCCCCHHHCCHHHHHHHHCCCCCCCCCCCCCCCCEEEEEEECCHHHCCCCCCHHHHHHCCCCCEEECCEEEEEECCCC T ss_conf 64015725322036777751378989877688877268999854511115863687774314873355655899962620 Q ss_pred ECCCCCHHCCCCCCCCHHHHHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCCCHHH Q ss_conf 01247010246778400112203405874267889999999999999997389739986421125656435422355788 Q gi|254781051|r 414 GFTTNPSSARSCTYASDISKSIGIPIFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVCYRRFGHNEGDEPSFTQPMM 493 (957) Q Consensus 414 GFTT~p~d~RSs~Y~TDiAK~i~aPI~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~YRr~GHNE~DeP~fTQP~m 493 (957) ||||....+|||.|||||||.|+||||||||||||+|++|++||++||.+|+|||||||+|||||||||+|||.||.|+| T Consensus 364 gfTtp~~rGRSs~ycsDiaK~~~~pviHVNGD~PEevvraTrLAf~Yqr~FRKDvfIdL~CfRrwgHnelddp~ftspvm 443 (913) T KOG0451 364 GFTTPGDRGRSSAYCSDIAKSIQAPVIHVNGDDPEEVVRATRLAFRYQREFRKDVFIDLNCFRRWGHNELDDPTFTSPVM 443 (913) T ss_pred CCCCCCCCCCCCHHHHHHHHHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHHHHHEEEHHHHHHHCCCCCCCCCCCCHHH T ss_conf 02376446653024567898737877984799989999999999999998642225306777775234446744467047 Q ss_pred HHHHHHCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCC---------CCCCCCCCCH Q ss_conf 8886718998999999986448989899999999999997756665412587656666568---------6266677899 Q gi|254781051|r 494 YKTIRSHKSVLQLYADSLMKNQVISKQELQSLANNWHKYLEAEYKESESYCPEKLGLLHNG---------ENERKNSVSK 564 (957) Q Consensus 494 Y~~I~~hp~v~~~Y~~~Li~~g~it~~e~~~~~~~~~~~l~~~~~~~k~~~~~~~~~~~~~---------~~~~~T~v~~ 564 (957) ||.|+.+.+|+++|+++|+++|++|+++++++.++|.++|++++..+..|.|-...+...| ...|+|||+. T Consensus 444 yk~v~aReSvPdlya~~L~~eg~~tee~vkE~~~~y~~~Ln~eL~~~~~y~Pp~~~~~~~W~gf~qapk~it~WdTGv~~ 523 (913) T KOG0451 444 YKEVEARESVPDLYAQQLAKEGVLTEEKVKEMRDEYMKYLNEELALAPAYQPPPSYFEKQWTGFQQAPKEITYWDTGVDY 523 (913) T ss_pred HHHHHHHHCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHCCCHHHHCCCCCCCCH T ss_conf 77777641340899999986155539999999999999998998407766899634677522121580553111347646 Q ss_pred HHHHHHHHHHCCCCCCCCCCHHHHH-HHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCEEEE Q ss_conf 9999998874368998620567788-899899873343564211478988898863583157850100223332302799 Q gi|254781051|r 565 EILKKIGSSIIRLPKSFNTHKIVER-LMANRQKMIETGKGIDWSMAESLAFGSLCYEGYKVRLSGQDCERGTFSHRHAIL 643 (957) Q Consensus 565 ~~L~~i~~~l~~~Pe~f~~h~~v~k-~~~~R~~~~~~~~~iDW~~AE~LA~~SLl~eG~~VRlsGQD~~RGTFshRHavl 643 (957) +.|+.||.+.+++|++|++||.+.| .++.|++.++.|.+|||+|||+||+||||.+|++||||||||+|||||||||++ T Consensus 524 dLLrfiG~~SV~vPedf~~H~HLlKtHv~sRm~Km~~G~kiDWaTAEAlA~GSll~qG~nVRiSGqDVGRGTFshRHAM~ 603 (913) T KOG0451 524 DLLRFIGQQSVTVPEDFNIHPHLLKTHVNSRMKKMENGVKIDWATAEALAIGSLLYQGHNVRISGQDVGRGTFSHRHAML 603 (913) T ss_pred HHHHHHCCCCEECCHHCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCEEE T ss_conf 89998366764063211414799998899999987537764547788898888885367336603404764112121124 Q ss_pred EECCCCCEEEEHHHCCCCC-CEEEEEECCCCCCCHHHHHHCCCCCCCCCCEEECCCCCCCCCCCHHHHHHHHHHHHHHHC Q ss_conf 9778884455634427898-459998243111101343202211386531110000244245410225655420257623 Q gi|254781051|r 644 CDQETGKRYFPLGNISKDQ-GHCEVVNSFLSEQAVLGFEYGYSLNNLNALTIWEAQFGDFANGAQVILDQFITAGEQKWL 722 (957) Q Consensus 644 ~dq~t~~~y~pL~~l~~~q-~~f~v~nS~LSE~avlgFEyGys~~~p~~L~iWEAQFGDF~NgAQviiDqfiss~e~KW~ 722 (957) .||.|++.|+|||++.++| +.++|.||+|||+||||||||+|++||+.|.|||||||||+||||||||+||+|||+||+ T Consensus 604 VdQ~Td~~~IPLN~m~~~qkg~LEvans~LSEEAvLGFEyGmsienP~~L~iWEAQFGDFfNGAQIIiDTFi~sgE~KWl 683 (913) T KOG0451 604 VDQQTDEMFIPLNSMEGGQKGKLEVANSILSEEAVLGFEYGMSIENPNNLIIWEAQFGDFFNGAQIIIDTFIVSGETKWL 683 (913) T ss_pred EECCCCCEEEECCCCCCCCCCEEEECCCCCCHHHHHHHHCCCCCCCCCCCEEEHHHHCCCCCCCEEEEEEEECCCCHHHH T ss_conf 31256613536210377768716742662217666301103235786532453233303136754787640105512445 Q ss_pred CCCCCEEECCCCCCCCCCCCCCCCHHHHHHHCCCC---------CEEEEECCCHHHHHHHHHHHHCCCCCCCEEEECCCH Q ss_conf 33681354368888888765644057887521437---------826972488999999999974353488689856602 Q gi|254781051|r 723 CTSNLVCLLPHGYEGQGPEHSSARLERFLQMCAEN---------NMYVANCTSPANYFHILRRQIYDRSSRPLIMMAPKS 793 (957) Q Consensus 723 ~~sglvllLPHGyeGqGPEHSSaRlERfLQl~a~~---------N~~V~~~ttpAq~FH~LRRQ~~r~~rkPLiv~tPKs 793 (957) .+|||||||||||||+||||||||||||||||+.. ||.|+||||||||||+||||+.|+|||||||.+||. T Consensus 684 ~ssglvmLLPHGyDGAgpeHSSCRiERFLQlCDS~E~~vDGd~VNm~vvnPTTpAQYfHlLRRQ~vrNfRKPLiVv~PK~ 763 (913) T KOG0451 684 ESSGLVMLLPHGYDGAGPEHSSCRIERFLQLCDSKETSVDGDSVNMHVVNPTTPAQYFHLLRRQLVRNFRKPLIVVAPKT 763 (913) T ss_pred HHCCEEEECCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCEEEEEECCCCHHHHHHHHHHHHHHHCCCCEEEECHHH T ss_conf 41874898257768889764315699999873233336777604579707997899999999998774258648855088 Q ss_pred HHCCCCCCCCHHHHCCCCCEEECCCCCCCCCCCCCCCCCCHHHCCEEEEECCHHHHHHHHHHHHCCCCC-EEEEEEECCC Q ss_conf 210801268878937998603515776433444544457954576899868857899999999769986-8999840148 Q gi|254781051|r 794 LLRHKRVVSSLSDMTCGSVFQAVLSDDAEYHGKTSVKLKEDSHIRRVILCTGKVYYDLLDNRDMRNIAD-IYLLRIEQLY 872 (957) Q Consensus 794 LLR~~~a~S~l~d~~~g~~F~~vi~d~~~~~~~~~~~~~~~~~v~rvi~csGKvyydL~~~r~~~~~~~-vaivRiEqLy 872 (957) |||||.|+|+++||.+||.|++||.|. .++|+|||+|||||||.||.|.++|++++.+| +||+|+|.|| T Consensus 764 LLRlPaA~ST~~ef~PGTtf~nVigd~----------~~~p~kvkkvifcSGKH~y~l~k~Re~rgakd~~AI~RvE~LC 833 (913) T KOG0451 764 LLRLPAATSTHEEFQPGTTFHNVIGDT----------IAKPEKVKKVIFCSGKHYYTLAKEREKRGAKDTVAILRVESLC 833 (913) T ss_pred HHHCCCHHHHHHHCCCCCCCCCCCCCC----------CCCHHHHEEEEEECCCCHHHHHHHHHHCCCCCCEEEEEHHHCC T ss_conf 761700120476428985115433443----------3694670179996583143488878760555521467525408 Q ss_pred CCHHHHHHHHHHHCCCC-EEEEECCCCCCCCCHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCCCCCCCHHHHHHHHHHH Q ss_conf 68289999998537998-39980678531784578997699998750567760189753664672338678999999999 Q gi|254781051|r 873 PFPEDYLIKVLSRFVQA-EIVWCQEEPQNMGAWTFIEPYLEKVLHSIGADYSRVRYVGRLPSASTAVGHMSRHLEQLSSF 951 (957) Q Consensus 873 PfP~~~l~~~l~~y~~~-e~vW~QEEp~N~GaW~~v~~rl~~~l~~~~~~~~~~~y~gR~~~aspA~G~~~~H~~e~~~l 951 (957) |||.++|+++|++|+|+ ++||.||||+|||||+||+||+++.| .++|+|+|||+.+.||||.+++|++|.++| T Consensus 834 PFPi~~LQa~l~kY~~vqdfvWSQEEprNmGaWsFVrPRFEn~l------g~~L~~~GRpelp~pAtgIG~vH~~e~eei 907 (913) T KOG0451 834 PFPIQELQAQLAKYGNVQDFVWSQEEPRNMGAWSFVRPRFENLL------GQQLHYCGRPELPTPATGIGKVHKREVEEI 907 (913) T ss_pred CCCHHHHHHHHHHCCCHHHHCCCCCCCCCCCCCEEECHHHHHHH------HHHHEECCCCCCCCCCCCCHHHHHHHHHHH T ss_conf 97469999999866871321103435666786111166778776------445100578999976422103546669998 Q ss_pred HHHHHC Q ss_conf 998719 Q gi|254781051|r 952 IEDALK 957 (957) Q Consensus 952 i~~a~~ 957 (957) |++.|. T Consensus 908 va~tFa 913 (913) T KOG0451 908 VAATFA 913 (913) T ss_pred HHHCCC T ss_conf 742259 No 6 >TIGR00239 2oxo_dh_E1 2-oxoglutarate dehydrogenase, E1 component; InterPro: IPR011603 2-oxoglutarate dehydrogenase is a key enzyme in the TCA cycle, converting 2-oxoglutarate, coenzyme A and NAD(+) to succinyl-CoA, NADH and carbon dioxide . This activity of this enzyme is tightly regulated and it is a major determinant of the metabolic flux through the TCA cycle. This enzyme is composed of multiple copies of three different subunits: 2-oxoglutarate dehydrogenase (E1), dihydrolipoamide succinyltransferase (E2) and lipoamide dehydrogenase (E3) which is often shared with similar enzymes such as pyruvate dehydrogenase . The E2 component forms a large multimeric core which binds the peripheral E1 and E3 subunits. The substrate is transferred between the active sites of the different subunits by a lipoyl moiety, bound to a lysine residue from the E2 polypeptide. This entry represents the E1 subunit of 2-oxoglutarate dehydrogenase. It catalyses the decarboxylation of this compound in a thiamine pyrophosphate-dependent manner, transferring the resultant succinyl group onto the liposyl moiety bound to the E2 subunit. The E1 ortholog from Corynebacterium glutamicum is unusual in having an N-terminal extension that resembles the E2 component of 2-oxoglutarate dehydrogenase enzyme. ; GO: 0004591 oxoglutarate dehydrogenase (succinyl-transferring) activity, 0030976 thiamin pyrophosphate binding, 0006096 glycolysis. Probab=100.00 E-value=0 Score=1717.45 Aligned_cols=924 Identities=45% Similarity=0.762 Sum_probs=836.5 Q ss_pred CCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCCC-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCH Q ss_conf 636789689999999987380507989999987367643-3333343444322222222111233344555666555633 Q gi|254781051|r 14 FLDGTNCCYIEDLYKSYQEDPSSVCKDWYPLFSFLDENS-EEYDNLEDGIASFLKEESVASAVFSQKKSAVESSCSLGET 92 (957) Q Consensus 14 FL~g~Na~YIEeLYekYL~DP~SVd~sWr~fF~~l~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 92 (957) |+.+.|..|+++||+.|+.||.||+++|+.+|+.++... .......... ..................... T Consensus 1 ~~~~~~~~~~~~~~~~~~~dp~~~~~~w~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~ 71 (990) T TIGR00239 1 FLGGANLGYIEELYEAYLKDPSSVDPSWRETFDELPGPGIPAPEQFHAPS---------PSYFRRLAGAAPKGTVTAGDP 71 (990) T ss_pred CCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCC---------HHHHHHHHCCCCCCHHHHHHH T ss_conf 97775226789999975057310232379898642777663543323454---------355655311110000001004 Q ss_pred HHHHHHHHHHHHH-HHHHHCCCHHHCCCCCCC----CCCCCCCCCCCHHHCCCCHHHCCCEEECCCC-CCCC----CCC- Q ss_conf 6789999999999-999881704540167777----7766787767876669997881715725665-7877----778- Q gi|254781051|r 93 QSVKDFFQVMKMI-DAYRSYGHFKANIDPLGY----NSHQKDLSELSPAHYGFVKADYDRKICMKGV-LGLE----SAT- 161 (957) Q Consensus 93 ~~~~kq~rV~~LI-~AYR~~GHl~A~lDPLgl----~~~~~~~peL~~~~yGLse~DLd~~f~~~~~-~g~~----~~t- 161 (957) ....+.+++.++| ++||.+||+.|++|||++ ....+..++|++.+|||++.|+++.|+++.. ++.. +.. T Consensus 72 ~~~~~~~~~~~~~c~~~~~~g~~~~~~~pl~~~~g~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~d~~~~~ 151 (990) T TIGR00239 72 DENASLVKLLQLLCRAYRFRGHLKANLDPLGYSFGSNWDEPKPPELDLSYYGLTEEDLDETFNLGPGVLPKDARDGPKKL 151 (990) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCCCCCCCHHHHH T ss_conf 55578899999988776530210013440000002223555553356320355345432321256300111000002233 Q ss_pred --HHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCCCCHHHHHH-HHHHHHHHHHHHHHHHHHHCCCCEE Q ss_conf --9999999998406430277724899899999999841457788879889999-9999999998999998640336222 Q gi|254781051|r 162 --IPEIVDVLSHLYCSNIGVEFMHIVDSIERDWVRNTIEDYDFSSNFSKEEQRD-ILDKLVRAEGFEKFIDIKYKGAKRF 238 (957) Q Consensus 162 --L~eIi~~L~~tYCgsIGvEymHI~d~eer~Wlq~riE~~~~~~~~s~eeK~~-iL~~L~~Ae~FE~FL~~Kf~g~KRF 238 (957) +.++++.|+++||+++|+||+||.+.+++.|+++++|.......++.++|+. +|++|+.+++||+||++||||.||| T Consensus 152 ~~~~~~~~~l~~~~~~~~g~~~~~~~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~l~~l~~~~g~~~~l~~~~~g~~~~ 231 (990) T TIGR00239 152 GNGLELLEYLKKTYCGSIGLEFDHVPDFEEKDWLQEKIESGLEPFQFNSEEKLSAFLDRLTAAEGFERFLGAKFPGAKRF 231 (990) T ss_pred HHHHHHHHHHHHHHCCCCCCEEECCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 23789999998751221241120134246788998876410002331146788999987665434577652016662100 Q ss_pred CCCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCHHHHHHHHCCCCCCC----------CCCCCCCC Q ss_conf 25646589999999999987379107998067655268999885379899999855887432----------23455510 Q gi|254781051|r 239 GADGSEVIIPAIEEIIRQGVQQGVDEMILGMAHRGRLNVLSQIMNKLPRSIFYEFKGKGPSE----------KEYSGDVK 308 (957) Q Consensus 239 sLEG~EslIP~L~~ii~~a~~~Gv~eiviGMaHRGRLNVL~Nvl~Kp~~~iF~EF~g~~~~~----------~~~sGDVK 308 (957) ++||.|++||+++.+++.+...|++++++||+||||||+|+||++||++.+|.||.|..... ..++|||| T Consensus 232 ~~~g~~~~~p~~~~~~~~~~~~g~~~~~~g~~~~g~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~dp~~~~~~g~g~~~ 311 (990) T TIGR00239 232 SLEGLDALVPGLKELIRHSVKSGTEDVVLGMAHRGRLNVLVNVLGKPPESLFDEFAGKPASDLPSEDPLESATDGTGDVK 311 (990) T ss_pred CCCCCHHHHHHHHHHHHHHHHCCCHHHHHCCCCCCHHHHHHHHHCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCHH T ss_conf 22320245556899987554204112222000123045677653263135677641653101344342100125554112 Q ss_pred CCCC-EEEEEEECCCEEEEEECCCCCCCHHCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCHH Q ss_conf 0150-268898359618999717877310004455323677776416665445556544540478886257533346725 Q gi|254781051|r 309 YHLG-LCCNRQICGKDVKLLLQSNPSHLEFVDPVVIGSVRARQDLKAGIVGEKSISLVERSRVLPLIIHGDAAFAGQGIV 387 (957) Q Consensus 309 YHLG-~s~~~~~~g~~v~lsL~~NPSHLEaVnPVv~G~~RAkQd~~~d~~~~~~~~~~~~~~vlpiliHGDAAfaGQGVV 387 (957) ||+| .+++..++++.+++.|++||||||.++|||.|.+||+++...+. +.+..+..++++|++||||||+||||| T Consensus 312 ~~~g~~~~~~~~~~~~~~l~l~~~p~~l~~~~p~~~g~~~~~~~~~~~~----g~p~~~~~~~l~~~~~gd~~~~g~g~~ 387 (990) T TIGR00239 312 YHLGRFGSDFTTDGKLVHLPLAFNPSHLEIVSPVVLGSTRARLERLNDL----GYPVPEETKVLAILLHGDAAFAGQGVV 387 (990) T ss_pred HHHHCCCCCCCCCCCCCEEEECCCCCCHHHCCCHHHHHHHHHHHHHHHC----CCCCCCCCCEEEEEEECCCCCCCCCHH T ss_conf 3330012221125662012210364200010301210246666554313----786542332035665054223454235 Q ss_pred HHHHHHHCCCCCCCCCEEEEEECCCEECCC-CCHHCCCCCCCCHHHHHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHCC Q ss_conf 887645106774436579998416020124-7010246778400112203405874267889999999999999997389 Q gi|254781051|r 388 SETFGLSGLSGYTVAGNIHLIINNQIGFTT-NPSSARSCTYASDISKSIGIPIFHVNGDDPEAVIRVVRMAVSFRMKFHK 466 (957) Q Consensus 388 ~Etl~ls~l~gY~tGGTiHiVvNNQIGFTT-~p~d~RSs~Y~TDiAK~i~aPI~HVN~DdpEav~~~~~lA~~yR~~F~k 466 (957) +||++|+.++||.+|||||||+|||||||| +|.++||+.||+|++|++++||||||+||||+|.++.++|++||.+|++ T Consensus 388 ~~~~~~~~~~~~~~gg~~~~~~~~~~g~~~~~~~~~~~~~~~~d~~~~~~~p~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 467 (990) T TIGR00239 388 QETLNLSSLRGYSVGGTIHIVINNQIGFTTSNPADARSTPYCSDLAKGIEAPIFHVNADDPEAVAFAFRLAVEYRNTFKR 467 (990) T ss_pred HHHHHHHCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHCCEEEECCCCHHHHHHHHHHHHHHHHHHCC T ss_conf 55430100454223642688750521222366210014432234554320333662476446788999888887653121 Q ss_pred CEEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHCCCCCHHHHHH----HHHHHHHHHHHHHHHHHC Q ss_conf 7399864211256564354223557888886718998999999986448989899999----999999997756665412 Q gi|254781051|r 467 SVVIDIVCYRRFGHNEGDEPSFTQPMMYKTIRSHKSVLQLYADSLMKNQVISKQELQS----LANNWHKYLEAEYKESES 542 (957) Q Consensus 467 DVvIDlv~YRr~GHNE~DeP~fTQP~mY~~I~~hp~v~~~Y~~~Li~~g~it~~e~~~----~~~~~~~~l~~~~~~~k~ 542 (957) |++||++|||||||||.|+|.+|||+||+.|++||++..+|.++|+.+|+++.+++++ +...+.+.++.+...... T Consensus 468 d~~~d~~~~~~~g~~~~d~p~~~~p~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~w~~~~~~~~~~~~~~~~~~ 547 (990) T TIGR00239 468 DVLIDLVGYRRHGHNEADEPSATQPLLYDKIKKHPTPRKVYADKLVSEGVVTEEDVTEHKKWVWNLYRDALEKGDDVVPS 547 (990) T ss_pred CCEEEHHHHHCCCCCCCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 00010011101467644564210036788775045302455656653011114566556778888777776544431121 Q ss_pred CCC----------CCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHH-HHHHCCCCCCCCHHHHH Q ss_conf 587----------6566665686266677899999999887436899862056778889989-98733435642114789 Q gi|254781051|r 543 YCP----------EKLGLLHNGENERKNSVSKEILKKIGSSIIRLPKSFNTHKIVERLMANR-QKMIETGKGIDWSMAES 611 (957) Q Consensus 543 ~~~----------~~~~~~~~~~~~~~T~v~~~~L~~i~~~l~~~Pe~f~~h~~v~k~~~~R-~~~~~~~~~iDW~~AE~ 611 (957) +.+ .+......|.....+.++...+..+++.+.++|+++.+|+.+.|++.+| ..+..++..+||+++|. T Consensus 548 ~~~~~~~~~~~~~~p~~l~~~~~~~~p~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~w~~~~~ 627 (990) T TIGR00239 548 WREWLTAAWEWLKSPPELPEEWLPGYPDKVPESRLKELGKLLGEWPEGLEFHPRLAKIYGDRGKALFDGEKLLDWGLAES 627 (990) T ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCHHHHCCCCCHHCCCHHH T ss_conf 11234455541146421244445545641037889999886531000134566788875200001221442000010124 Q ss_pred HHHHHHHHCCCEEEEECCCCCCCCCCCCEEEEEECCCCCEEEEHHHCCCCCCEEEEEECCCCCCCHHHHHHCCCCCCCCC Q ss_conf 88898863583157850100223332302799977888445563442789845999824311110134320221138653 Q gi|254781051|r 612 LAFGSLCYEGYKVRLSGQDCERGTFSHRHAILCDQETGKRYFPLGNISKDQGHCEVVNSFLSEQAVLGFEYGYSLNNLNA 691 (957) Q Consensus 612 LA~~SLl~eG~~VRlsGQD~~RGTFshRHavl~dq~t~~~y~pL~~l~~~q~~f~v~nS~LSE~avlgFEyGys~~~p~~ 691 (957) |||++|+.+|.+||++|+|++||||+|||+|+||++++..|+||.|+...++.|.|+||+|||++||||||||++..|+. T Consensus 628 ~~~~~~~~~g~~~~~~g~d~~~g~~~~~~~~~~~~~~~~~~~pl~~~~~~~~~~~~~~~~l~~~~~~~~~~g~~~~~p~~ 707 (990) T TIGR00239 628 LAFATLLDDGTPVRLSGEDSGRGTFFHRHAVLHDQESGAGYTPLSHLHNGKGDFAVWNSPLSEGGVLGFEYGYALTSPET 707 (990) T ss_pred HHHHHHHHCCCCEEECCCCCCCCHHHHHHHHEECCCCCCEECCCHHCCCCCCCEEEECCCCCCCCHHHEECCCCCCCCCE T ss_conf 44444431574303215533222012211110035566301010000245432121012012000000001332358742 Q ss_pred CEEECCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCEEECCCCCCCCCCCCCCCCHHHHHHHCCCC-------------- Q ss_conf 111000024424541022565542025762333681354368888888765644057887521437-------------- Q gi|254781051|r 692 LTIWEAQFGDFANGAQVILDQFITAGEQKWLCTSNLVCLLPHGYEGQGPEHSSARLERFLQMCAEN-------------- 757 (957) Q Consensus 692 L~iWEAQFGDF~NgAQviiDqfiss~e~KW~~~sglvllLPHGyeGqGPEHSSaRlERfLQl~a~~-------------- 757 (957) |++||||||||+|+||++|||||++|+.||++.+||||+||||||||||||||+|+|||||||+++ T Consensus 708 l~~w~~~~gd~~~~~~~~~d~~~~~g~~~w~~~~gl~~~lp~g~~g~gp~~~~~~~~~~l~l~~~~p~~~p~~~~~~~~~ 787 (990) T TIGR00239 708 LVLWEAQFGDFANGAQVVIDQFISGGEQKWGRKCGLVLLLPHGYEGQGPEHSSGRLERFLQLAAEDPRYFPSEEKLQRQH 787 (990) T ss_pred EEEEEHHCCCCCCCHHHHHHHHHHHHHHHHHHHCCEEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHH T ss_conf 56630000332232132234445201343332226135515676777764204688999987513654476456776432 Q ss_pred ---CEEEEECCCHHHHHHHHHHHH---CCCCCCCEEEECCCHHHCCCCCCCCHHHHCCCCCEEECCCCCCCCCCCCCCCC Q ss_conf ---826972488999999999974---35348868985660221080126887893799860351577643344454445 Q gi|254781051|r 758 ---NMYVANCTSPANYFHILRRQI---YDRSSRPLIMMAPKSLLRHKRVVSSLSDMTCGSVFQAVLSDDAEYHGKTSVKL 831 (957) Q Consensus 758 ---N~~V~~~ttpAq~FH~LRRQ~---~r~~rkPLiv~tPKsLLR~~~a~S~l~d~~~g~~F~~vi~d~~~~~~~~~~~~ 831 (957) ||+||+||||||+||+||||. +|.+|+||++|+||+|||||.++|+++++.+|+ |++++.|-... ... T Consensus 788 ~dc~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~pl~~~~~~~ll~~p~~~~~~~~~~~g~-~~~~~~~~~~~-----~~~ 861 (990) T TIGR00239 788 QDCNWQVCVPTTPAQLFHILRRQAKLLLREFRRPLVLLSPKSLLRHPLAVSSLEELSEGG-FQPVIGEIEEG-----GLS 861 (990) T ss_pred CCCCEEEEECCCHHHHHHHHHHHHHHHHHHHHCCEEEECCHHHHHCCHHHHHHHHHHCCC-CHHHHHHHHCC-----CCC T ss_conf 035306751342478999988888888876414200012166651410245577652353-10223212202-----222 Q ss_pred CCHHHCCEEEEECCHHHHHH--HHHHHHCCCCCEEEEEEECCCCCHHHHHHHHHHHCCCC-EEEEECCCCCCCCCHHHHH Q ss_conf 79545768998688578999--99999769986899984014868289999998537998-3998067853178457899 Q gi|254781051|r 832 KEDSHIRRVILCTGKVYYDL--LDNRDMRNIADIYLLRIEQLYPFPEDYLIKVLSRFVQA-EIVWCQEEPQNMGAWTFIE 908 (957) Q Consensus 832 ~~~~~v~rvi~csGKvyydL--~~~r~~~~~~~vaivRiEqLyPfP~~~l~~~l~~y~~~-e~vW~QEEp~N~GaW~~v~ 908 (957) .++++++|+++|+||+|||| ...|.+.+..+++++|+|||||||...+++.+.+||+. +++||||||.|||+|.|.. T Consensus 862 ~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~l~p~p~~~~~~~~~~~~~~~~~~w~~~~p~~~g~w~~~~ 941 (990) T TIGR00239 862 LDPEEVKRLVLCSGKVYYDLSVHEQRRENGDKDLAILRIEQLYPFPHKGLKELLEPYPNLKEIVWCQEEPLNQGAWGYSE 941 (990) T ss_pred CCHHHCEEEEEECCCHHHHHHHHHHHHHCCCCCEEEEEHHHHCCCCHHHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCC T ss_conf 45113003233336122346666655413641010101122156754789999852145100223112433445310013 Q ss_pred HHHHHHHHHC-CCCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 7699998750-56776018975366467233867899999999999871 Q gi|254781051|r 909 PYLEKVLHSI-GADYSRVRYVGRLPSASTAVGHMSRHLEQLSSFIEDAL 956 (957) Q Consensus 909 ~rl~~~l~~~-~~~~~~~~y~gR~~~aspA~G~~~~H~~e~~~li~~a~ 956 (957) |++..+++.. +.+..+++|+||+++++||.|+...|.+++..++..++ T Consensus 942 ~~~~~~~~~~d~~~d~~~~~~g~~~~~~~~~g~~~~~~~~~~~~~~~~~ 990 (990) T TIGR00239 942 PYLTEVLPEGDKYKDVKLRYAGRPPSGSPAVGYPSLHLKEEDNLLKDAL 990 (990) T ss_pred CHHHHHCCCCCCCCCEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCC T ss_conf 1022212454321210221216775555322413566555666775419 No 7 >cd02016 TPP_E1_OGDC_like Thiamine pyrophosphate (TPP) family, E1 of OGDC-like subfamily, TPP-binding module; composed of proteins similar to the E1 component of the 2-oxoglutarate dehydrogenase multienzyme complex (OGDC). OGDC catalyzes the oxidative decarboxylation of 2-oxoglutarate to succinyl-CoA and carbon dioxide, a key reaction of the tricarboxylic acid cycle. Probab=100.00 E-value=0 Score=1047.57 Aligned_cols=262 Identities=60% Similarity=1.049 Sum_probs=256.1 Q ss_pred HHHHHHHHHCCCCEECCCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCHHHHHHHHCCCC--CCCC Q ss_conf 99999864033622225646589999999999987379107998067655268999885379899999855887--4322 Q gi|254781051|r 224 FEKFIDIKYKGAKRFGADGSEVIIPAIEEIIRQGVQQGVDEMILGMAHRGRLNVLSQIMNKLPRSIFYEFKGKG--PSEK 301 (957) Q Consensus 224 FE~FL~~Kf~g~KRFsLEG~EslIP~L~~ii~~a~~~Gv~eiviGMaHRGRLNVL~Nvl~Kp~~~iF~EF~g~~--~~~~ 301 (957) ||+|||+||+|+|||||||||||||+|++||++|++.||++||||||||||||||+|||+||++.||+||+|+. |+.. T Consensus 1 FE~FL~~Kf~g~KRFslEG~EslIp~l~~li~~a~~~gv~~iviGMaHRGRLNvL~nvl~Kp~~~iF~eF~g~~~~~~~~ 80 (265) T cd02016 1 FEQFLATKFPGQKRFGLEGAESLIPALDELIDRAAELGVEEVVIGMAHRGRLNVLANVLGKPLEQIFSEFEGKSEFPEDD 80 (265) T ss_pred CHHHHHHHCCCCCEECCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCHHHHHHHHHCCCHHHHHHHHCCCCCCCCCC T ss_conf 94046561888745415668899999999999998769867998256665188999986417999998527998896324 Q ss_pred CCCCCCCCCCCEEEEEEEC-CCEEEEEECCCCCCCHHCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEECCCC Q ss_conf 3455510015026889835-961899971787731000445532367777641666544555654454047888625753 Q gi|254781051|r 302 EYSGDVKYHLGLCCNRQIC-GKDVKLLLQSNPSHLEFVDPVVIGSVRARQDLKAGIVGEKSISLVERSRVLPLIIHGDAA 380 (957) Q Consensus 302 ~~sGDVKYHLG~s~~~~~~-g~~v~lsL~~NPSHLEaVnPVv~G~~RAkQd~~~d~~~~~~~~~~~~~~vlpiliHGDAA 380 (957) .++||||||||+|++++++ |++|||+|+|||||||+|||||+|+||||||+++|. ++++||||||||||| T Consensus 81 ~~sGDVKYHlG~s~~~~~~~gk~v~lsL~~NPSHLEaVnPVv~G~~RAkQ~~~~~~---------~~~kvlpiliHGDAA 151 (265) T cd02016 81 EGSGDVKYHLGYSSDRKTPSGKKVHLSLAPNPSHLEAVNPVVMGKTRAKQDYRGDG---------ERDKVLPILIHGDAA 151 (265) T ss_pred CCCCCCCCCCCCEEEEEECCCCEEEEEECCCCCCCEEECCEECCHHHHHHHHHCCC---------CCCEEEEEEEECCCC T ss_conf 66677163784245786269988999975897641677777444056567764487---------766378999966611 Q ss_pred CCCCCHHHHHHHHHCCCCCCCCCEEEEEECCCEECCCCCHHCCCCCCCCHHHHHHCCCEEEECCCCHHHHHHHHHHHHHH Q ss_conf 33467258876451067744365799984160201247010246778400112203405874267889999999999999 Q gi|254781051|r 381 FAGQGIVSETFGLSGLSGYTVAGNIHLIINNQIGFTTNPSSARSCTYASDISKSIGIPIFHVNGDDPEAVIRVVRMAVSF 460 (957) Q Consensus 381 faGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQIGFTT~p~d~RSs~Y~TDiAK~i~aPI~HVN~DdpEav~~~~~lA~~y 460 (957) |+||||||||||||+|+||+|||||||||||||||||+|+++|||+||||||||++||||||||||||||++|++||++| T Consensus 152 faGQGvv~Etl~ls~l~gy~~GGtiHiivNNQIGFTT~p~~~Rss~Y~tDiaK~~~~Pi~HVNgddpeav~~a~~lA~~y 231 (265) T cd02016 152 FAGQGVVYETLNLSNLPGYTTGGTIHIVVNNQIGFTTDPRDSRSSPYCTDVAKMIGAPIFHVNGDDPEAVVRATRLALEY 231 (265) T ss_pred CCCCCHHHHHHHHHCCCCCCCCCEEEEEEECCEEECCCCHHHCCCCCCHHHHHHHCCCEEEECCCCHHHHHHHHHHHHHH T ss_conf 14565899876663389985476699998074100258134045764002688867988997689989999999999999 Q ss_pred HHHHCCCEEEEEEECCCCCCCCCCCCCCCCHHHH Q ss_conf 9973897399864211256564354223557888 Q gi|254781051|r 461 RMKFHKSVVIDIVCYRRFGHNEGDEPSFTQPMMY 494 (957) Q Consensus 461 R~~F~kDVvIDlv~YRr~GHNE~DeP~fTQP~mY 494 (957) ||+|+|||||||||||||||||+|||+||||+|| T Consensus 232 R~~F~kDvvIDlvcYRr~GHNE~DeP~fTqP~mY 265 (265) T cd02016 232 RQKFKKDVVIDLVCYRRHGHNELDEPSFTQPLMY 265 (265) T ss_pred HHHHCCCEEEEEEEECCCCCCCCCCCCCCCCCCC T ss_conf 9986998799978734568787788677687769 No 8 >pfam00676 E1_dh Dehydrogenase E1 component. This family uses thiamine pyrophosphate as a cofactor. This family includes pyruvate dehydrogenase, 2-oxoglutarate dehydrogenase and 2-oxoisovalerate dehydrogenase. Probab=99.97 E-value=7e-27 Score=223.02 Aligned_cols=291 Identities=22% Similarity=0.279 Sum_probs=240.8 Q ss_pred HHHHHHHHHHHHHHHCCCCEEC----CCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCHHHHHHHH Q ss_conf 9999989999986403362222----564658999999999998737910799806765526899988537989999985 Q gi|254781051|r 218 LVRAEGFEKFIDIKYKGAKRFG----ADGSEVIIPAIEEIIRQGVQQGVDEMILGMAHRGRLNVLSQIMNKLPRSIFYEF 293 (957) Q Consensus 218 L~~Ae~FE~FL~~Kf~g~KRFs----LEG~EslIP~L~~ii~~a~~~Gv~eiviGMaHRGRLNVL~Nvl~Kp~~~iF~EF 293 (957) ++....||.-+...|.-+++.| -.|-|++-.++-..+ .-++.++.. ||+.-..|+. |-|+..||+|+ T Consensus 3 ~m~~~r~ed~~~~~~~~~~~~g~~h~~~GqEA~~vg~~~~l------~~~D~~~~~-yR~~g~~la~--G~~~~~~~ae~ 73 (303) T pfam00676 3 MMTLRRMEDARMALYQRKGRRGFCHLYAGQEALQVGIAAAL------NPGDYVIPT-YRDHGNLLAR--GVSLEQVMAEL 73 (303) T ss_pred CHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHC------CCCCEEEEC-HHHHHHHHHC--CCCHHHHHHHH T ss_conf 55999999999999973898246489888899999999976------999999806-6669999981--99999999997 Q ss_pred CCCCCCCCCCCCCCCCCCCEEEEEEECCCEEEEEECCCCCCCHHCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEE Q ss_conf 58874322345551001502688983596189997178773100044553236777764166654455565445404788 Q gi|254781051|r 294 KGKGPSEKEYSGDVKYHLGLCCNRQICGKDVKLLLQSNPSHLEFVDPVVIGSVRARQDLKAGIVGEKSISLVERSRVLPL 373 (957) Q Consensus 294 ~g~~~~~~~~sGDVKYHLG~s~~~~~~g~~v~lsL~~NPSHLEaVnPVv~G~~RAkQd~~~d~~~~~~~~~~~~~~vlpi 373 (957) .|+......|.|.- -|+-++. -...+.+-.|++-.--|++.|.+-|.+.... +.+.| T Consensus 74 ~Gk~~g~~~Grggs-mh~~~~~--------~~~~~~~~~~~vg~~~~~avG~A~a~k~~~~--------------~~v~v 130 (303) T pfam00676 74 TGNEAGCSKGKGGS-MHGYYAP--------KNNRFYGGNGIVGAQVPLGAGIALAAKYRGK--------------KEVAI 130 (303) T ss_pred CCCCCCCCCCCCCC-CCCCCCC--------CCCCCCCCCCCCCCCCCHHHHHHHHHHHHCC--------------CCEEE T ss_conf 78988998999997-7457666--------4455147874130482265668999996389--------------98799 Q ss_pred EEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCCEECCCCCHHCCCCCCCCHHHHHHCCCEEEECCCCHHHHHHH Q ss_conf 86257533346725887645106774436579998416020124701024677840011220340587426788999999 Q gi|254781051|r 374 IIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQIGFTTNPSSARSCTYASDISKSIGIPIFHVNGDDPEAVIRV 453 (957) Q Consensus 374 liHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQIGFTT~p~d~RSs~Y~TDiAK~i~aPI~HVN~DdpEav~~~ 453 (957) ..+||+| +.||+++|+|||+.+...- .|+||.|||.+.+|..++.+++..-+|.|+.+++|.++|+|.||++|..+ T Consensus 131 ~~~GDGa-~~~G~f~EalN~A~~~~lP---vifv~eNN~yaist~~~~~~~~~~i~~ra~~~gi~~~~vDG~D~~~v~~~ 206 (303) T pfam00676 131 TLFGDGA-TNQGQFFEALNFAALWKLP---VIFVCENNQYAISTPAERSSASTYYADRARGYGIPGIRVDGMDPLAVYQA 206 (303) T ss_pred EEECCCC-CCCCHHHHHHHHHHHCCCC---EEEEEEECCCCCCCCHHHHCCCCCHHHHHHHCCCEEEECCCCCHHHHHHH T ss_conf 9967997-5232999999987543898---79999527733666664514755488886525802896488658999999 Q ss_pred HHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCCC-HHHHHHHHHCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHH Q ss_conf 9999999997389739986421125656435422355-788888671899899999998644898989999999999999 Q gi|254781051|r 454 VRMAVSFRMKFHKSVVIDIVCYRRFGHNEGDEPSFTQ-PMMYKTIRSHKSVLQLYADSLMKNQVISKQELQSLANNWHKY 532 (957) Q Consensus 454 ~~lA~~yR~~F~kDVvIDlv~YRr~GHNE~DeP~fTQ-P~mY~~I~~hp~v~~~Y~~~Li~~g~it~~e~~~~~~~~~~~ 532 (957) ++-|+++..+.++.++|+..|||..||.+.|+|.... |..++...++..+...+.++|+++|++|+++++++.++.++. T Consensus 207 ~~~a~~~aR~g~gP~lie~~tyR~~gHs~~D~~~~Yr~~~e~~~~~~~~DPi~~~~~~L~~~g~~~~~~~~~i~~e~~~e 286 (303) T pfam00676 207 VKFAAERARTGNGPTLIELVTYRYGGHSMSDDPSTYRTREEVEEVRKKKDPIKRLKKHLVSRGVVSEEELKEIEKEVRKE 286 (303) T ss_pred HHHHHHHHHCCCCCEEEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHH T ss_conf 99999998648998899875113786678888767779899999886599699999999987999999999999999999 Q ss_pred HHHHHHHHHCCC Q ss_conf 775666541258 Q gi|254781051|r 533 LEAEYKESESYC 544 (957) Q Consensus 533 l~~~~~~~k~~~ 544 (957) .+++++.++.-. T Consensus 287 v~~a~~~A~~~p 298 (303) T pfam00676 287 IEEAVKKAESDP 298 (303) T ss_pred HHHHHHHHHHCC T ss_conf 999999998086 No 9 >TIGR03182 PDH_E1_alph_y pyruvate dehydrogenase E1 component, alpha subunit. Members of this protein family are the alpha subunit of the E1 component of pyruvate dehydrogenase (PDH). This model represents one branch of a larger family that E1-alpha proteins from 2-oxoisovalerate dehydrogenase, acetoin dehydrogenase, another PDH clade, etc. Probab=99.89 E-value=2.9e-19 Score=165.46 Aligned_cols=296 Identities=20% Similarity=0.225 Sum_probs=233.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCE--E--CCCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHC Q ss_conf 889999999999999899999864033622--2--256465899999999999873791079980676552689998853 Q gi|254781051|r 208 KEEQRDILDKLVRAEGFEKFIDIKYKGAKR--F--GADGSEVIIPAIEEIIRQGVQQGVDEMILGMAHRGRLNVLSQIMN 283 (957) Q Consensus 208 ~eeK~~iL~~L~~Ae~FE~FL~~Kf~g~KR--F--sLEG~EslIP~L~~ii~~a~~~Gv~eiviGMaHRGRLNVL~Nvl~ 283 (957) +|+-+.+++.|+.+-.||.-+..-|..-|= | .-.|-|+.-.++ +....-++.+.+ .||+.--.|+- | T Consensus 1 ke~ll~~yr~M~~~R~~e~~~~~l~~qG~i~g~~h~~~GqEa~~vg~------~~~l~~~D~i~~-~yR~hg~~la~--G 71 (315) T TIGR03182 1 KEELLELYRDMLLIRRFEEKAGQLYGMGKIGGFCHLYIGQEAVAVGL------IAALKPDDYVIT-SYRDHGHALAR--G 71 (315) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHH------HHHCCCCCEEEE-CCCCHHHHHHH--C T ss_conf 97999999999999999999999997798884623888769999999------997799888985-13378999995--8 Q ss_pred CCHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEECCCEEEEEECCCCCCCHHCCHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 79899999855887432234555100150268898359618999717877310004455323677776416665445556 Q gi|254781051|r 284 KLPRSIFYEFKGKGPSEKEYSGDVKYHLGLCCNRQICGKDVKLLLQSNPSHLEFVDPVVIGSVRARQDLKAGIVGEKSIS 363 (957) Q Consensus 284 Kp~~~iF~EF~g~~~~~~~~sGDVKYHLG~s~~~~~~g~~v~lsL~~NPSHLEaVnPVv~G~~RAkQd~~~d~~~~~~~~ 363 (957) -+++.||+|+-|+......|.|- --|++.. .+.+.+-.|.+-.--|++.|.+-|.+.+.. T Consensus 72 ~~~~~~~ael~g~~~g~~~Grgg-s~H~~~~----------~~~~~~~~~ivg~~~p~A~G~A~a~k~~~~--------- 131 (315) T TIGR03182 72 VPPKEVMAELTGRATGCSKGKGG-SMHMFDR----------EKNFYGGHGIVGAQVPLATGLAFANKYRGN--------- 131 (315) T ss_pred CCHHHHHHHHHCCCCCCCCCCCC-CCCCCCC----------CCCCCCCCCCCCCCCCHHHHHHHHHHHHCC--------- T ss_conf 99999999983678899999999-7764452----------127778873040374387769999986089--------- Q ss_pred CCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCCEECCCCCHHCCCCCCCCHHHHHHCCCEEEEC Q ss_conf 54454047888625753334672588764510677443657999841602012470102467784001122034058742 Q gi|254781051|r 364 LVERSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQIGFTTNPSSARSCTYASDISKSIGIPIFHVN 443 (957) Q Consensus 364 ~~~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQIGFTT~p~d~RSs~Y~TDiAK~i~aPI~HVN 443 (957) .-+.+.-.||+|.+ ||+++|+|||+.+-.-- .|-||-|||.+-.|.....-+..--++-|+.++.|.+.|+ T Consensus 132 -----~~v~v~~~GDGa~~-~G~f~EalN~A~~~~lP---vifv~eNN~yaist~~~~~~~~~~i~~ra~~~gi~~~~vD 202 (315) T TIGR03182 132 -----DNVTACFFGDGAAN-QGQFYESFNMAALWKLP---VIFVIENNLYAMGTSVERSSSVTDLYKRGESFGIPGERVD 202 (315) T ss_pred -----CCEEEEEECCCCCC-CCHHHHHHHHHHCCCCC---EEEEEEECCCCCCCCHHHHCCCCHHHHHHHHCCCCEEEEE T ss_conf -----98799995788665-34399999887414898---8999952874155767775366336665543699858983 Q ss_pred CCCHHHHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHCCCCCHHHHH Q ss_conf 67889999999999999997389739986421125656435422355788888671899899999998644898989999 Q gi|254781051|r 444 GDDPEAVIRVVRMAVSFRMKFHKSVVIDIVCYRRFGHNEGDEPSFTQPMMYKTIRSHKSVLQLYADSLMKNQVISKQELQ 523 (957) Q Consensus 444 ~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~YRr~GHNE~DeP~fTQP~mY~~I~~hp~v~~~Y~~~Li~~g~it~~e~~ 523 (957) |.||++|..+++-|+++-.+-+.-++|...|||-.||.+.|.-.+--+.--+..+++ .+...+.++|+++|++|+++++ T Consensus 203 GnD~~av~~~~~~Ai~~~R~g~gP~lie~~tyR~~gHs~~D~~~YR~~~E~~~w~~r-DPi~~~~~~L~~~g~~~~~~~~ 281 (315) T TIGR03182 203 GMDVLAVREAAKEAVERARSGKGPILLEMKTYRFRGHSMSDPAKYRSKEEVEEWRKR-DPIEKLKARLIEQGIASEEELK 281 (315) T ss_pred CCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCCCCCCCHHHHHHHHHC-CHHHHHHHHHHHCCCCCHHHHH T ss_conf 577899999999999997559998899987426888899873024698999988847-9399999999987999999999 Q ss_pred HHHHHHHHHHHHHHHHHHC Q ss_conf 9999999997756665412 Q gi|254781051|r 524 SLANNWHKYLEAEYKESES 542 (957) Q Consensus 524 ~~~~~~~~~l~~~~~~~k~ 542 (957) ++.++.++..+++++.++. T Consensus 282 ~i~~~~~~ev~~a~~~A~~ 300 (315) T TIGR03182 282 EIDKEVRAEVEEAVEFAEN 300 (315) T ss_pred HHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999999985 No 10 >cd02000 TPP_E1_PDC_ADC_BCADC Thiamine pyrophosphate (TPP) family, E1 of PDC_ADC_BCADC subfamily, TPP-binding module; composed of proteins similar to the E1 components of the human pyruvate dehydrogenase complex (PDC), the acetoin dehydrogenase complex (ADC) and the branched chain alpha-keto acid dehydrogenase/2-oxoisovalerate dehydrogenase complex (BCADC). PDC catalyzes the irreversible oxidative decarboxylation of pyruvate to produce acetyl-CoA in the bridging step between glycolysis and the citric acid cycle. ADC participates in the breakdown of acetoin while BCADC participates in the breakdown of branched chain amino acids. BCADC catalyzes the oxidative decarboxylation of 4-methyl-2-oxopentanoate, 3-methyl-2-oxopentanoate and 3-methyl-2-oxobutanoate (branched chain 2-oxo acids derived from the transamination of leucine, valine and isoleucine). Probab=99.89 E-value=2.4e-19 Score=166.05 Aligned_cols=287 Identities=18% Similarity=0.220 Sum_probs=225.7 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCC-EE---CCCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCHHHH Q ss_conf 99999999989999986403362-22---256465899999999999873791079980676552689998853798999 Q gi|254781051|r 214 ILDKLVRAEGFEKFIDIKYKGAK-RF---GADGSEVIIPAIEEIIRQGVQQGVDEMILGMAHRGRLNVLSQIMNKLPRSI 289 (957) Q Consensus 214 iL~~L~~Ae~FE~FL~~Kf~g~K-RF---sLEG~EslIP~L~~ii~~a~~~Gv~eiviGMaHRGRLNVL~Nvl~Kp~~~i 289 (957) +++.++..-.||.-+..-|-.-| +| +--|-|+.-.++-..+ .-++.+.+ .||+.--.|+- |-|+..+ T Consensus 1 ~y~~M~~~R~~e~~~~~l~~~G~~~~~~~~~~GqEa~~vg~~~~l------~~~D~~~~-~yR~~~~~la~--G~~~~~~ 71 (293) T cd02000 1 LYRTMVLIRRFDERLLELYRQGKIGGFYHLSIGQEAVAVGVAAAL------RPGDWVFP-TYRDHGHALAR--GVDLKEM 71 (293) T ss_pred CHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHC------CCCCEEEE-CCCHHHHHHHH--CCCHHHH T ss_conf 978999999999999999978968655208787699999999868------99898981-45208999990--9999999 Q ss_pred HHHHCCCCCCCCCCCCCCCCCCCEEEEEEECCCEEEEEECCCCCCCHHCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCE Q ss_conf 99855887432234555100150268898359618999717877310004455323677776416665445556544540 Q gi|254781051|r 290 FYEFKGKGPSEKEYSGDVKYHLGLCCNRQICGKDVKLLLQSNPSHLEFVDPVVIGSVRARQDLKAGIVGEKSISLVERSR 369 (957) Q Consensus 290 F~EF~g~~~~~~~~sGDVKYHLG~s~~~~~~g~~v~lsL~~NPSHLEaVnPVv~G~~RAkQd~~~d~~~~~~~~~~~~~~ 369 (957) |+|+.|+......|.| ---|++. ..+.+.+-.|.+-.--|++.|.+-|.+.... . T Consensus 72 ~~e~~g~~~g~~~Gr~-gs~H~~~----------~~~~~~~~~~~vg~~~p~a~G~A~a~k~~~~--------------~ 126 (293) T cd02000 72 LAELFGKETGPCKGRG-GSMHIGD----------KEKNFFGGNGIVGGQVPLAAGAALALKYRGE--------------D 126 (293) T ss_pred HHHHHCCCCCCCCCCC-CCCCCCC----------CCCCEECCCCCCCCCCCHHHHHHHHHHHHCC--------------C T ss_conf 9998257778888888-8685887----------1020504654124663347767899997589--------------9 Q ss_pred EEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCCEECCCCCHHCCCCCCCCHHHHHHCCCEEEECCCCHHH Q ss_conf 47888625753334672588764510677443657999841602012470102467784001122034058742678899 Q gi|254781051|r 370 VLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQIGFTTNPSSARSCTYASDISKSIGIPIFHVNGDDPEA 449 (957) Q Consensus 370 vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQIGFTT~p~d~RSs~Y~TDiAK~i~aPI~HVN~DdpEa 449 (957) -+.+...||+|. .||+++|+|||+.+..-- .|-||-|||.+-.|..++......-.|-|+.++.|.++|||.||++ T Consensus 127 ~v~v~~~GDGa~-~~G~f~EalN~A~~~~LP---vlfv~enN~yaist~~~~~~~~~~~~~r~~~~gi~~~~vDG~D~~~ 202 (293) T cd02000 127 RVAVCFFGDGAT-NEGDFHEALNFAALWKLP---VIFVCENNGYAISTPTSRQTAGTSIADRAAAYGIPGIRVDGNDVLA 202 (293) T ss_pred CEEEEEECCCCC-CCCHHHHHHHHHHHCCCC---EEEEEEECCEECCCCHHHHCCCCCHHHHHHHCCCCEEEECCCCHHH T ss_conf 889997168745-514799999988653798---7999974873023568873277448773532698769974878999 Q ss_pred HHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCC-CCCHHHHHHHHHCCCHHHHHHHHHHHCCCCCHHHHHHHHHH Q ss_conf 99999999999997389739986421125656435422-35578888867189989999999864489898999999999 Q gi|254781051|r 450 VIRVVRMAVSFRMKFHKSVVIDIVCYRRFGHNEGDEPS-FTQPMMYKTIRSHKSVLQLYADSLMKNQVISKQELQSLANN 528 (957) Q Consensus 450 v~~~~~lA~~yR~~F~kDVvIDlv~YRr~GHNE~DeP~-fTQP~mY~~I~~hp~v~~~Y~~~Li~~g~it~~e~~~~~~~ 528 (957) |..+++-|+++-.+-++.++|.+.|||-+||...|+|. +-.+.--+.-++ ..+...+.+.|+++|++|+++++++.++ T Consensus 203 v~~~~~~Ai~~~R~~~gP~lie~~TyR~~gHs~~dd~~~YR~~~E~~~~~~-~DPi~~~~~~L~~~g~~te~~~~~i~~e 281 (293) T cd02000 203 VYEAAKEAVERARAGGGPTLIEAVTYRLGGHSTSDDPSRYRTKEEVEEWKK-RDPILRLRKYLIEAGILTEEELAAIEAE 281 (293) T ss_pred HHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCCCCCCCCHHHHHHHHH-CCCHHHHHHHHHHCCCCCHHHHHHHHHH T ss_conf 999999999998538998899998561675568898656889999998884-7919999999998799999999999999 Q ss_pred HHHHHHHHHHH Q ss_conf 99997756665 Q gi|254781051|r 529 WHKYLEAEYKE 539 (957) Q Consensus 529 ~~~~l~~~~~~ 539 (957) .++..+++++. T Consensus 282 i~~~V~~A~~~ 292 (293) T cd02000 282 VKAEVEEAVEF 292 (293) T ss_pred HHHHHHHHHHC T ss_conf 99999999951 No 11 >CHL00149 odpA pyruvate dehydrogenase E1 component alpha subunit; Reviewed Probab=99.89 E-value=4.2e-19 Score=164.22 Aligned_cols=314 Identities=18% Similarity=0.195 Sum_probs=233.3 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEC----CCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCC Q ss_conf 145778887988999999999999989999986403362222----5646589999999999987379107998067655 Q gi|254781051|r 198 EDYDFSSNFSKEEQRDILDKLVRAEGFEKFIDIKYKGAKRFG----ADGSEVIIPAIEEIIRQGVQQGVDEMILGMAHRG 273 (957) Q Consensus 198 E~~~~~~~~s~eeK~~iL~~L~~Ae~FE~FL~~Kf~g~KRFs----LEG~EslIP~L~~ii~~a~~~Gv~eiviGMaHRG 273 (957) +...+...+|+|+-+.+++.|+..-.||.-+..-|..-|-+| --|-|+.-.++-. ...-++.+. -.||+ T Consensus 11 ~~~~~~~~ls~e~ll~~yr~M~l~R~fee~~~~l~~qG~i~g~~h~~~GqEA~~vg~~~------al~~~D~i~-~~yR~ 83 (343) T CHL00149 11 ATNLNSVNLNKEEALVLYRDMLLGRNFEDMCAQMYYRGKMFGFVHLYNGQEAVSTGVIK------ALKPTDYVC-STYRD 83 (343) T ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHH------HCCCCCEEE-ECCHH T ss_conf 88764589899999999999999999999999999779976064688868999999999------779999897-05152 Q ss_pred HHHHHHHHHCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEECCCEEEEEECCCCCCCHHCCHHHHHHHHHHHHHH Q ss_conf 26899988537989999985588743223455510015026889835961899971787731000445532367777641 Q gi|254781051|r 274 RLNVLSQIMNKLPRSIFYEFKGKGPSEKEYSGDVKYHLGLCCNRQICGKDVKLLLQSNPSHLEFVDPVVIGSVRARQDLK 353 (957) Q Consensus 274 RLNVL~Nvl~Kp~~~iF~EF~g~~~~~~~~sGDVKYHLG~s~~~~~~g~~v~lsL~~NPSHLEaVnPVv~G~~RAkQd~~ 353 (957) .=-.|+- |-|++.|++|+-|+......|.| =--|++... ..+...-|++-.--|++.|.+-|.+.+. T Consensus 84 h~~~la~--G~~~~~~~ael~Gk~~G~~~Grg-gsmH~~~~~----------~~~~g~~~ivg~~~p~A~G~A~a~~~~~ 150 (343) T CHL00149 84 HVHALSK--GVPPREVMAELFGKETGCSKGRG-GSMHLFSAP----------HNFLGGFAFIGEGIPIALGAAFQSIYRQ 150 (343) T ss_pred HHHHHHC--CCCHHHHHHHHCCCCCCCCCCCC-CCCCCCCCC----------CCCCCCCCCCCCCCCHHHHHHHHHHHHH T ss_conf 7879981--99999999997588899999999-977766722----------2712455343345306999999999875 Q ss_pred CCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCCEECCCCCHHCCCCCCCCHHHH Q ss_conf 66654455565445404788862575333467258876451067744365799984160201247010246778400112 Q gi|254781051|r 354 AGIVGEKSISLVERSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQIGFTTNPSSARSCTYASDISK 433 (957) Q Consensus 354 ~d~~~~~~~~~~~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQIGFTT~p~d~RSs~Y~TDiAK 433 (957) .... ...++-+.|...||+|-+ ||.++|+|||+.+-.-- .|-||-|||.+-.|.....-+..--.+-|+ T Consensus 151 ~~~k-------~~g~~~V~v~ffGDGat~-eG~fhEalN~A~~~~LP---viFv~eNN~yaist~~~~~~~~~~i~~ra~ 219 (343) T CHL00149 151 QVLK-------DTQELRVTACFFGDGTTN-NGQFFECLNMAVLWKLP---IIFVVENNQWAIGMAHHRSTSEPEIHKKAE 219 (343) T ss_pred HHCC-------CCCCCCEEEEEECCCCCC-CCHHHHHHHHHHHHCCC---EEEEEEECCCCCCCCHHHCCCCCHHHHHHH T ss_conf 4302-------478982799983576332-03899999999984797---689998788146577323057633778886 Q ss_pred HHCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf 20340587426788999999999999999738973998642112565643542235578888867189989999999864 Q gi|254781051|r 434 SIGIPIFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVCYRRFGHNEGDEPSFTQPMMYKTIRSHKSVLQLYADSLMK 513 (957) Q Consensus 434 ~i~aPI~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~YRr~GHNE~DeP~fTQP~mY~~I~~hp~v~~~Y~~~Li~ 513 (957) .++.|-+.|+|.|+++|..+++-|+++=.+-+.-++|...|||-.||-+.|...+--+.-= ..-+...+...+.++|++ T Consensus 220 ~yGi~~~~vDGnD~~av~~~~~~Ai~~aR~g~gP~lie~~TyR~~gHs~~D~~~YR~~~E~-e~w~~~DPi~~~~~~L~~ 298 (343) T CHL00149 220 AFGLPGIEVDGMDVLAVREVAKEAVERARSGQGPTLIEALTYRFRGHSLADPDELRSKQEK-EAWIARDPIKKLKSYIID 298 (343) T ss_pred HCCCCEECCCCCCHHHHHHHHHHHHHHHCCCCCCEEEEEEEECCCCCCCCCCCCCCCHHHH-HHHHHCCCHHHHHHHHHH T ss_conf 5699720179878999999999999984569998899998607988789873556799999-988847969999999998 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 489898999999999999977566654125 Q gi|254781051|r 514 NQVISKQELQSLANNWHKYLEAEYKESESY 543 (957) Q Consensus 514 ~g~it~~e~~~~~~~~~~~l~~~~~~~k~~ 543 (957) .|++++++++++.++.++..+++++.+... T Consensus 299 ~g~~~~~~~~~i~~~~~~~v~~a~~~A~~~ 328 (343) T CHL00149 299 NELVSQKELNKIQKEVKIEIEDAVEFAISS 328 (343) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 799999999999999999999999999838 No 12 >TIGR03181 PDH_E1_alph_x pyruvate dehydrogenase E1 component, alpha subunit. Members of this protein family are the alpha subunit of the E1 component of pyruvate dehydrogenase (PDH). This model represents one branch of a larger family that E1-alpha proteins from 2-oxoisovalerate dehydrogenase, acetoin dehydrogenase, another PDH clade, etc. Probab=99.88 E-value=8.8e-19 Score=161.76 Aligned_cols=295 Identities=15% Similarity=0.160 Sum_probs=233.0 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCE---ECCCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHH Q ss_conf 7788879889999999999999899999864033622---2256465899999999999873791079980676552689 Q gi|254781051|r 201 DFSSNFSKEEQRDILDKLVRAEGFEKFIDIKYKGAKR---FGADGSEVIIPAIEEIIRQGVQQGVDEMILGMAHRGRLNV 277 (957) Q Consensus 201 ~~~~~~s~eeK~~iL~~L~~Ae~FE~FL~~Kf~g~KR---FsLEG~EslIP~L~~ii~~a~~~Gv~eiviGMaHRGRLNV 277 (957) ...+.+|+|+-..+++.++..-.||.-+..-|---|= .+.-|-|+.-.+. +....-++.++ -.||+.=-. T Consensus 16 ~~~~~ls~e~ll~~yr~M~~~R~~e~~~~~l~~~G~ig~~~~~~GqEA~avg~------~~aL~~~D~i~-~~yR~hg~~ 88 (341) T TIGR03181 16 EPAPDLSDEELVELYRDMVLTRRFDTKALALQRQGRLGTYAPNLGQEAAQVGS------ALALRKDDWVF-PSYRDHAAM 88 (341) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHH------HHHCCCCCEEE-ECCCCHHHH T ss_conf 77899899999999999999999999999999679865767999739999999------97589879897-156723633 Q ss_pred HHHHHCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEECCCEEEEEECCCCCCCHHCCHHHHHHHHHHHHHHCCCC Q ss_conf 99885379899999855887432234555100150268898359618999717877310004455323677776416665 Q gi|254781051|r 278 LSQIMNKLPRSIFYEFKGKGPSEKEYSGDVKYHLGLCCNRQICGKDVKLLLQSNPSHLEFVDPVVIGSVRARQDLKAGIV 357 (957) Q Consensus 278 L~Nvl~Kp~~~iF~EF~g~~~~~~~~sGDVKYHLG~s~~~~~~g~~v~lsL~~NPSHLEaVnPVv~G~~RAkQd~~~d~~ 357 (957) |+- |-++.+||+|+.|+..-. ..+ ....+.+.-|.+-+--|++.|.+-|.+.+..| T Consensus 89 la~--G~~~~~~~ae~~G~~~G~---~~~-----------------~~~~~~~~~~iVG~~~p~A~G~A~a~k~~~~~-- 144 (341) T TIGR03181 89 LAR--GVPLVEILLYWRGDERGS---WDP-----------------EGVNILPPNIPIGTQYLHAAGVAYALKLRGED-- 144 (341) T ss_pred EEC--CCCHHHHHHHHCCCCCCC---CCC-----------------CCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCC-- T ss_conf 127--998999999855977678---886-----------------01476888740134654147788899872999-- Q ss_pred CCCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCCEECCCCCHHCCCCCCCCHHHHHHCC Q ss_conf 44555654454047888625753334672588764510677443657999841602012470102467784001122034 Q gi|254781051|r 358 GEKSISLVERSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQIGFTTNPSSARSCTYASDISKSIGI 437 (957) Q Consensus 358 ~~~~~~~~~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQIGFTT~p~d~RSs~Y~TDiAK~i~a 437 (957) -+.+...||+|.+ ||.++|+|||+.+-.-- .|-||-|||.+-.|.....-+..--.+-|+.++. T Consensus 145 ------------~V~v~ffGDGA~~-eG~fhEalN~Aa~~~LP---viFv~eNN~yaist~~~~~~~~~~i~~ra~~yGi 208 (341) T TIGR03181 145 ------------NVAVTYFGDGGTS-EGDFYEALNFAGVFKAP---VVFFVQNNQWAISVPRSKQTAAPTLAQKAIAYGI 208 (341) T ss_pred ------------CEEEEEECCCCCC-CCHHHHHHHHHHHHCCC---EEEEEEECCCCCCCCHHHHCCCCHHHHHHHHCCC T ss_conf ------------8899994688643-24699999999983899---8999961773577763342377369875142498 Q ss_pred CEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCC-CCCHHHHHHHHHCCCHHHHHHHHHHHCCC Q ss_conf 05874267889999999999999997389739986421125656435422-35578888867189989999999864489 Q gi|254781051|r 438 PIFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVCYRRFGHNEGDEPS-FTQPMMYKTIRSHKSVLQLYADSLMKNQV 516 (957) Q Consensus 438 PI~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~YRr~GHNE~DeP~-fTQP~mY~~I~~hp~v~~~Y~~~Li~~g~ 516 (957) |-+.|+|.|+++|..+++-|+++=.+-+.-++|...|||-.||.+.|+|. +--+--.+.-+++ .+...+.+.|+++|+ T Consensus 209 ~~~~vDG~D~~av~~a~~~Av~~~R~g~gP~liE~~TyR~~gHs~~Dd~~~YR~~~E~~~w~~~-DPi~~~~~~L~~~g~ 287 (341) T TIGR03181 209 PGVQVDGNDVLAVYAVTKEAVERARSGGGPTLIEAVTYRLGPHTTADDPTRYRTKEEEEEWRKK-DPILRLRKYLERKGL 287 (341) T ss_pred CEEECCCCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHC-CCHHHHHHHHHHCCC T ss_conf 8674488567999999999999985589988999985027886788998778898999998858-949999999998799 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 898999999999999977566654125 Q gi|254781051|r 517 ISKQELQSLANNWHKYLEAEYKESESY 543 (957) Q Consensus 517 it~~e~~~~~~~~~~~l~~~~~~~k~~ 543 (957) +|+++++++.++.++..+++++.++.- T Consensus 288 ~~e~e~~~i~~~i~~ev~~a~~~A~~s 314 (341) T TIGR03181 288 WDEEQEEALEEEAEAEVAEAVAEALAL 314 (341) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 999999999999999999999999968 No 13 >COG1071 AcoA Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, alpha subunit [Energy production and conversion] Probab=99.77 E-value=9.6e-15 Score=131.23 Aligned_cols=312 Identities=18% Similarity=0.202 Sum_probs=224.8 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHCC-CCE---ECCCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHH Q ss_conf 8879889999999999999899999864033-622---225646589999999999987379107998067655268999 Q gi|254781051|r 204 SNFSKEEQRDILDKLVRAEGFEKFIDIKYKG-AKR---FGADGSEVIIPAIEEIIRQGVQQGVDEMILGMAHRGRLNVLS 279 (957) Q Consensus 204 ~~~s~eeK~~iL~~L~~Ae~FE~FL~~Kf~g-~KR---FsLEG~EslIP~L~~ii~~a~~~Gv~eiviGMaHRGRLNVL~ 279 (957) ..+++++-+.++++++..-.||.-+..-|-- ... .+--|-|+.-.++...++. =++.+++ .||+.=-+|+ T Consensus 22 ~~~~~~~l~~~y~~M~l~R~fd~k~~~l~r~G~i~gf~~~~~GqEA~~vg~~~aL~~-----~~D~i~~-~YR~h~~~l~ 95 (358) T COG1071 22 AALSKEELLELYRLMLLIRRFDEKMLQLQRQGKIGGFYHLYIGQEAVQVGAAAALRP-----GEDWIFP-TYRDHGHLLA 95 (358) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHCCC-----CCCEEEC-CCCCCCEEEE T ss_conf 568999999999999999999999999985777665757766679999999984388-----9887503-5576501002 Q ss_pred HHHCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEECCCEEEEEECCCCCCCHHCCHHHHHHHHHHHHHHCCCCCC Q ss_conf 88537989999985588743223455510015026889835961899971787731000445532367777641666544 Q gi|254781051|r 280 QIMNKLPRSIFYEFKGKGPSEKEYSGDVKYHLGLCCNRQICGKDVKLLLQSNPSHLEFVDPVVIGSVRARQDLKAGIVGE 359 (957) Q Consensus 280 Nvl~Kp~~~iF~EF~g~~~~~~~~sGDVKYHLG~s~~~~~~g~~v~lsL~~NPSHLEaVnPVv~G~~RAkQd~~~d~~~~ 359 (957) - |-|++.||.|+.|+......|. +--.|++.. . ..+++-.+-.-+--|.+.|.+-|.+.+..+ T Consensus 96 ~--G~~~~~~~a~~~G~~~g~~kGr-~~~~h~~~~-~---------~~~~~~~~iVg~Q~~~AaG~A~a~k~~~~~---- 158 (358) T COG1071 96 R--GVPLKEIMAELLGKATGPCKGR-GGSMHYSDK-E---------KGFLGGSGIVGTQIPLAAGAALALKYRGTK---- 158 (358) T ss_pred C--CCCHHHHHHHHHCCCCCCCCCC-CCCCCCCCC-C---------CCCCCCCCEECCCCCHHHHHHHHHHHHCCC---- T ss_conf 5--9999999999855245888888-886310110-0---------045787840466534799999999974899---- Q ss_pred CCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCCEECCCCCHHCCCCCCCCHHHHHHCCCE Q ss_conf 55565445404788862575333467258876451067744365799984160201247010246778400112203405 Q gi|254781051|r 360 KSISLVERSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQIGFTTNPSSARSCTYASDISKSIGIPI 439 (957) Q Consensus 360 ~~~~~~~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQIGFTT~p~d~RSs~Y~TDiAK~i~aPI 439 (957) .-+.+..-||.|-+ ||-++|+|||+.+-.-- .|-+|.|||-+--+.-.....+....+=|..++.|= T Consensus 159 ---------~~Va~~~~GDGat~-qG~FhEalN~A~v~klP---vvf~ieNN~yAiSvp~~~q~~~~~~~~ra~aygipg 225 (358) T COG1071 159 ---------DGVAVAFFGDGATN-QGDFHEALNFAAVWKLP---VVFVIENNQYAISVPRSRQTAAEIIAARAAAYGIPG 225 (358) T ss_pred ---------CCEEEEEECCCCCC-CCHHHHHHHHHHHHCCC---EEEEEECCCCEEECCHHHCCCCHHHHHHHHCCCCCE T ss_conf ---------81899994578655-33399998999985698---799996687167045664033246775501058872 Q ss_pred EEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCC-CCHHHHHHHHHCCCHHHHHHHHHHHCCCCC Q ss_conf 8742678899999999999999973897399864211256564354223-557888886718998999999986448989 Q gi|254781051|r 440 FHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVCYRRFGHNEGDEPSF-TQPMMYKTIRSHKSVLQLYADSLMKNQVIS 518 (957) Q Consensus 440 ~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~YRr~GHNE~DeP~f-TQP~mY~~I~~hp~v~~~Y~~~Li~~g~it 518 (957) +-|+|-||.||..+++-|+++=..=+--++|..+|||-.||...|+|.- -...-=+.-+ +..+...+.+.|++.|++| T Consensus 226 v~VDG~D~~avy~~~~~A~e~AR~g~GPtLIE~~tYR~~~HS~sDd~~~YRskeE~~~~~-~~DPi~r~~~~L~~~g~~s 304 (358) T COG1071 226 VRVDGNDVLAVYEAAKEAVERARAGEGPTLIEAVTYRYGGHSTSDDPSKYRSKEEVEEWK-KRDPIVRLRKYLIEAGILS 304 (358) T ss_pred EEECCCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEEECCCCCCCCCCCCCCCHHHHHHHH-CCCHHHHHHHHHHHCCCCC T ss_conf 787797799999999999999974899889999875358887779833367989998775-0694999999999768989 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCC-CCCCCCCCC Q ss_conf 8999999999999977566654125-876566665 Q gi|254781051|r 519 KQELQSLANNWHKYLEAEYKESESY-CPEKLGLLH 552 (957) Q Consensus 519 ~~e~~~~~~~~~~~l~~~~~~~k~~-~~~~~~~~~ 552 (957) +++.+++.++.++..+++.+.++.. .|.....+. T Consensus 305 ee~~~~i~~e~~~~V~ea~e~a~~~p~P~~~~~~~ 339 (358) T COG1071 305 EEELEAIEAEAKAEVDEAVEFAEASPYPDVSELFE 339 (358) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHH T ss_conf 99999999999999999999997089998467651 No 14 >pfam02779 Transket_pyr Transketolase, pyrimidine binding domain. This family includes transketolase enzymes, pyruvate dehydrogenases, and branched chain alpha-keto acid decarboxylases. Probab=99.70 E-value=2.3e-16 Score=143.51 Aligned_cols=171 Identities=30% Similarity=0.395 Sum_probs=149.4 Q ss_pred CCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCEEEEEECCCCCEEEEHHHCCCCCCEEEEEECCCCCCCHHHHHH Q ss_conf 64211478988898863583157850100223332302799977888445563442789845999824311110134320 Q gi|254781051|r 603 GIDWSMAESLAFGSLCYEGYKVRLSGQDCERGTFSHRHAILCDQETGKRYFPLGNISKDQGHCEVVNSFLSEQAVLGFEY 682 (957) Q Consensus 603 ~iDW~~AE~LA~~SLl~eG~~VRlsGQD~~RGTFshRHavl~dq~t~~~y~pL~~l~~~q~~f~v~nS~LSE~avlgFEy 682 (957) .+.|-.|-..|..-++.+.-.|-+-|+|+..|||.+.....+.+ ++=.++|.+++|.+.+|+-. T Consensus 2 ~~~~r~a~~~al~~~~~~d~~vv~~~~D~~~g~~~~~~~~~~~~----------------~~~R~~~~gIaE~~~vg~a~ 65 (174) T pfam02779 2 KIATRKASGEALAELAERDPRVVGGGADVGGGTFTVTKGLLHPQ----------------GEGRVIDTGIAEQAMVGIAN 65 (174) T ss_pred CCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCHHHCCCCCC----------------CCCCEEECCCCHHHHHHHHH T ss_conf 76499999999999997798999994877887551332465877----------------89856765817799999999 Q ss_pred CCCCCCCCCCEEECCCCCCCCCCCHHHHHHHHHHHHHHHCCCCC-CEEECCCCCCCCCCCCCCCCHHHHHHHCCCCCEEE Q ss_conf 22113865311100002442454102256554202576233368-13543688888887656440578875214378269 Q gi|254781051|r 683 GYSLNNLNALTIWEAQFGDFANGAQVILDQFITAGEQKWLCTSN-LVCLLPHGYEGQGPEHSSARLERFLQMCAENNMYV 761 (957) Q Consensus 683 Gys~~~p~~L~iWEAQFGDF~NgAQviiDqfiss~e~KW~~~sg-lvllLPHGyeGqGPEHSSaRlERfLQl~a~~N~~V 761 (957) |+|+.-. +.+||.||+||. |..-++.|...-++|+.... ++...|+|+.++||.|.|-..+.++..+- ||.| T Consensus 66 G~A~~G~--~~iv~~~f~~F~---~~r~~~~i~~~~~~~~~~v~~v~~~~g~~~g~~G~tHhs~~~~a~~~~iP--gl~V 138 (174) T pfam02779 66 GMALHGL--LPPVEATFGDFA---NIRADAAIRHYAALGKLPVPFVVTRDPIGVGEDGPTHQSQEDEAYLRAIP--GLKV 138 (174) T ss_pred HHHHHCC--CCCCCHHHHHHH---HHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCCCCCCCCCHHHHHHHHCC--CCEE T ss_conf 9997099--641112178888---88659999999985899998786664602567715465642899997589--9889 Q ss_pred EECCCHHHHHHHHHHHHCCCCCCCEEEECCCHHHC Q ss_conf 72488999999999974353488689856602210 Q gi|254781051|r 762 ANCTSPANYFHILRRQIYDRSSRPLIMMAPKSLLR 796 (957) Q Consensus 762 ~~~ttpAq~FH~LRRQ~~r~~rkPLiv~tPKsLLR 796 (957) +.|++|.+.+++||..+.++..+|.+++.||+|+| T Consensus 139 ~~Ps~~~da~~ll~~ai~~~~~~Pv~i~~~r~~~~ 173 (174) T pfam02779 139 VRPSDPAEAKGLLRAAIRRDDDDPVVFRLPRQLLR 173 (174) T ss_pred EECCCHHHHHHHHHHHHHCCCCCCEEEEECHHHCC T ss_conf 96199999999999999648998389983077678 No 15 >KOG0225 consensus Probab=99.60 E-value=2.7e-12 Score=112.68 Aligned_cols=303 Identities=21% Similarity=0.274 Sum_probs=224.6 Q ss_pred HHCC-CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEECC----CCHHHHHHHHHHHHHHHHHCCCCEEEEECC Q ss_conf 8414-57788879889999999999999899999864033622225----646589999999999987379107998067 Q gi|254781051|r 196 TIED-YDFSSNFSKEEQRDILDKLVRAEGFEKFIDIKYKGAKRFGA----DGSEVIIPAIEEIIRQGVQQGVDEMILGMA 270 (957) Q Consensus 196 riE~-~~~~~~~s~eeK~~iL~~L~~Ae~FE~FL~~Kf~g~KRFsL----EG~EslIP~L~~ii~~a~~~Gv~eiviGMa 270 (957) ++|. +.....+++++-..+++.+..---||--...-|--.|=+|- .|-|+.-.+|+..|..+ +.||-.- T Consensus 45 ~l~~~p~~s~~~t~ee~L~~Y~~M~~~RrmE~aad~lYK~k~IRGFCHLy~GQEAvavGme~ait~~-----D~iItsY- 118 (394) T KOG0225 45 KLEEGPSTSVELTKEEALKYYRDMQTIRRMELAADQLYKAKKIRGFCHLYDGQEAVAVGMEAAITKS-----DSIITSY- 118 (394) T ss_pred ECCCCCCCEEEECHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCEEEEECCCHHHHHHHHHHHCCCC-----CCEEEEE- T ss_conf 7157998159950888999999999999999999998865211213663265799999999854677-----7447776- Q ss_pred CCCHHHHHHHHHCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEE---EECCCEEEEEECCCCCCCHHCCHHHHHHHH Q ss_conf 65526899988537989999985588743223455510015026889---835961899971787731000445532367 Q gi|254781051|r 271 HRGRLNVLSQIMNKLPRSIFYEFKGKGPSEKEYSGDVKYHLGLCCNR---QICGKDVKLLLQSNPSHLEFVDPVVIGSVR 347 (957) Q Consensus 271 HRGRLNVL~Nvl~Kp~~~iF~EF~g~~~~~~~~sGDVKYHLG~s~~~---~~~g~~v~lsL~~NPSHLEaVnPVv~G~~R 347 (957) |--..+=+.|-+...||+|..|...--..| .|.|-.. .+-|+.=-| -|--|+-.|.+- T Consensus 119 ---R~Hg~~~~~G~S~~~v~aEL~Gr~~Gc~kG-------KGGSMHmy~k~FyGGnGIV---------GAQiPLGaGia~ 179 (394) T KOG0225 119 ---RCHGWTYLRGVSVREVLAELMGRQAGCSKG-------KGGSMHMYAKNFYGGNGIV---------GAQIPLGAGIAF 179 (394) T ss_pred ---EEEEEEEECCCCHHHHHHHHHCCCCCCCCC-------CCCCEEEECCCCCCCCCEE---------CCCCCCCCCHHH T ss_conf ---520588643861999999984621255667-------7861055032334766423---------257773112899 Q ss_pred HHHHHHCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCCEECCCCCHH-CCCCC Q ss_conf 77764166654455565445404788862575333467258876451067744365799984160201247010-24677 Q gi|254781051|r 348 ARQDLKAGIVGEKSISLVERSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQIGFTTNPSS-ARSCT 426 (957) Q Consensus 348 AkQd~~~d~~~~~~~~~~~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQIGFTT~p~d-~RSs~ 426 (957) |.+.++.+ -+++.+-||.| |-||-+.|.+||+.|=.-- .|-|+-||--|.-|+... +-|+- T Consensus 180 A~kY~~~~--------------~v~~alYGDGA-aNQGQ~fEa~NMA~LW~LP---~IFvCENN~yGMGTs~~Rasa~te 241 (394) T KOG0225 180 AQKYNRED--------------AVCFALYGDGA-ANQGQVFEAFNMAALWKLP---VIFVCENNHYGMGTSAERASASTE 241 (394) T ss_pred HHHHCCCC--------------CEEEEEECCCC-CCCHHHHHHHHHHHHHCCC---EEEEECCCCCCCCCCHHHHHCCHH T ss_conf 99743688--------------65999954665-4440389885399986898---799982688766760343203827 Q ss_pred CCCHHHHHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCC-CCCCCHHHHHHHHHCCCHHH Q ss_conf 84001122034058742678899999999999999973897399864211256564354-22355788888671899899 Q gi|254781051|r 427 YASDISKSIGIPIFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVCYRRFGHNEGDE-PSFTQPMMYKTIRSHKSVLQ 505 (957) Q Consensus 427 Y~TDiAK~i~aPI~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~YRr~GHNE~De-P~fTQP~mY~~I~~hp~v~~ 505 (957) |-+ +.--.|=+||+|-|+-||-.|+++|.+|-.+=+--++..+..||-+||.-.|. -++----=-+..++...+.. T Consensus 242 yyk---RG~yiPGl~VdGmdvlaVr~a~KfA~~~~~~g~GPilmE~~TYRy~GHSmSDPg~sYRtReEiq~vR~kRDPI~ 318 (394) T KOG0225 242 YYK---RGDYIPGLKVDGMDVLAVREATKFAKKYALEGKGPILMEMDTYRYHGHSMSDPGTSYRTREEIQEVRQKRDPIE 318 (394) T ss_pred HHH---CCCCCCCEEECCCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHCCCHHH T ss_conf 775---06878755787831245899999999997537998799986346546546899865211899998876058489 Q ss_pred HHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 999998644898989999999999999775666541258 Q gi|254781051|r 506 LYADSLMKNQVISKQELQSLANNWHKYLEAEYKESESYC 544 (957) Q Consensus 506 ~Y~~~Li~~g~it~~e~~~~~~~~~~~l~~~~~~~k~~~ 544 (957) ...++||+.++.|++|++++.++.++..|.+.+.++.-. T Consensus 319 ~lk~~li~~~late~ELKai~k~irkeVdeav~~A~~~p 357 (394) T KOG0225 319 GLKKRLIELGLATEEELKAIDKEIRKEVDEAVAFATASP 357 (394) T ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 999999870366799999999999999999998763289 No 16 >smart00861 Transket_pyr Transketolase, pyrimidine binding domain. Transketolase (TK) catalyzes the reversible transfer of a two-carbon ketol unit from xylulose 5-phosphate to an aldose receptor, such as ribose 5-phosphate, to form sedoheptulose 7-phosphate and glyceraldehyde 3- phosphate. This enzyme, together with transaldolase, provides a link between the glycolytic and pentose-phosphate pathways. TK requires thiamine pyrophosphate as a cofactor. In most sources where TK has been purified, it is a homodimer of approximately 70 Kd subunits. TK sequences from a variety of eukaryotic and prokaryotic sources show that the enzyme has been evolutionarily conserved. In the peroxisomes of methylotrophic yeast Hansenula polymorpha, there is a highly related enzyme, dihydroxy-acetone synthase (DHAS) (also known as formaldehyde transketolase), which exhibits a very unusual specificity by including formaldehyde amongst its substrates. Probab=99.39 E-value=2.3e-12 Score=113.17 Aligned_cols=164 Identities=26% Similarity=0.245 Sum_probs=124.5 Q ss_pred CHHHHHHHHHHHHHC-CCEEEEECCCCCCCCCCCCEEEEEECCCCCEEEEHHHCCCCCCEEEEEECCCCCCCHHHHHHCC Q ss_conf 114789888988635-8315785010022333230279997788844556344278984599982431111013432022 Q gi|254781051|r 606 WSMAESLAFGSLCYE-GYKVRLSGQDCERGTFSHRHAILCDQETGKRYFPLGNISKDQGHCEVVNSFLSEQAVLGFEYGY 684 (957) Q Consensus 606 W~~AE~LA~~SLl~e-G~~VRlsGQD~~RGTFshRHavl~dq~t~~~y~pL~~l~~~q~~f~v~nS~LSE~avlgFEyGy 684 (957) |..|-.-|+..|+.+ +..|.+.|+|++.+|..+++.+..+.+.. .+.+ ++|.++||.+.+|+..|+ T Consensus 2 ~r~a~~~~l~~l~~~~~~~iv~~~~Dl~~~~~~~~~~~f~~~~~~-----------~~~r--~i~~gIaE~~~vg~a~Gl 68 (168) T smart00861 2 TRKAFGEALAELAERDPERVVVSGADVGGSTGLDRGGVFPDTKGL-----------GPGR--VIDTGIAEQAMVGFAAGL 68 (168) T ss_pred HHHHHHHHHHHHHHHCCCEEEEEEHHHCCCCCCCCCCCCCCCCCC-----------CCCC--EEECCCCHHHHHHHHHHH T ss_conf 189999999999856997199983674887577536610444456-----------9987--564483789999987599 Q ss_pred CCCCCCCCEEECCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCEEECCCCCCCC-CCCCCCCCHH-HHHHHCCCCCEEEE Q ss_conf 113865311100002442454102256554202576233368135436888888-8765644057-88752143782697 Q gi|254781051|r 685 SLNNLNALTIWEAQFGDFANGAQVILDQFITAGEQKWLCTSNLVCLLPHGYEGQ-GPEHSSARLE-RFLQMCAENNMYVA 762 (957) Q Consensus 685 s~~~p~~L~iWEAQFGDF~NgAQviiDqfiss~e~KW~~~sglvllLPHGyeGq-GPEHSSaRlE-RfLQl~a~~N~~V~ 762 (957) |+.. +..|.++|.+|...|.-.|.++++. + +..-++...++|+.|+ ||.|.|-..+ .|..+. ||.|+ T Consensus 69 A~~G---~~pi~~~~~~f~~~a~~qi~~~~~~---~--~~~~v~~~~~~~~~g~~G~tH~s~~~~~~~~~ip---~~~v~ 137 (168) T smart00861 69 ALAG---LRPVVAIFFTFFDRAKDQIRSDGAM---G--RVPVVVRHDSGGGVGEDGPTHHSQEDEALLRAIP---GLKVV 137 (168) T ss_pred HHCC---CCCCCCHHHHHHHHHHHHHHHHHHC---C--CCCEEEEEECEEEECCCCCHHCCCCHHHHHHHCC---CCEEE T ss_conf 8759---9642102357887589999865340---6--6773788601005668864110444388886179---97899 Q ss_pred ECCCHHHHHHHHHHHHCCCCCCCEEEECCCH Q ss_conf 2488999999999974353488689856602 Q gi|254781051|r 763 NCTSPANYFHILRRQIYDRSSRPLIMMAPKS 793 (957) Q Consensus 763 ~~ttpAq~FH~LRRQ~~r~~rkPLiv~tPKs 793 (957) .|++|.+..++||+++..+.+.|.|.+++|+ T Consensus 138 ~P~~~~e~~~~l~~a~~~~~~p~~i~l~~~~ 168 (168) T smart00861 138 APSDPAEAKGLLRAAIRRDDGPPVIRLERKS 168 (168) T ss_pred EECCHHHHHHHHHHHHHCCCCCEEEEECCCC T ss_conf 7099999999999998488998899973899 No 17 >PRK12315 1-deoxy-D-xylulose-5-phosphate synthase; Provisional Probab=99.24 E-value=7.9e-07 Score=71.34 Aligned_cols=409 Identities=17% Similarity=0.175 Sum_probs=234.2 Q ss_pred CCCHHCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCC Q ss_conf 73100044553236777764166654455565445404788862575333467258876451067744365799984160 Q gi|254781051|r 333 SHLEFVDPVVIGSVRARQDLKAGIVGEKSISLVERSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQ 412 (957) Q Consensus 333 SHLEaVnPVv~G~~RAkQd~~~d~~~~~~~~~~~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQ 412 (957) .||-.==.++.|++.|+.-.. ...+|.+|. ||++.. .|+++|.+++..-- . +--|=|+=+|+ T Consensus 113 G~~g~gls~avGmA~A~~~~~------------~~~~v~~v~--GDG~l~-eG~~~EA~~~Ag~~--~-~nlivi~ddN~ 174 (581) T PRK12315 113 GHTSTSIALATGMAKARDLKG------------EKENIIAVI--GDGSLS-GGLAFEGLNNAAEL--K-SNLIIIVNDNQ 174 (581) T ss_pred CCCHHHHHHHHHHHHHHHHHC------------CCCCEEEEE--CCCHHH-HHHHHHHHHHHCCC--C-CCEEEEEECCC T ss_conf 763789999999999999729------------998489998--551344-01679998763458--9-85599997688 Q ss_pred EECCCC-------CHHCCCC--CCCCHHHHHHCCCEEEE-CCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCC Q ss_conf 201247-------0102467--78400112203405874-2678899999999999999973897399864211256564 Q gi|254781051|r 413 IGFTTN-------PSSARSC--TYASDISKSIGIPIFHV-NGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVCYRRFGHNE 482 (957) Q Consensus 413 IGFTT~-------p~d~RSs--~Y~TDiAK~i~aPI~HV-N~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~YRr~GHNE 482 (957) |.-.-+ -.+.|.+ .-..++-+.++.-++.| ||.|+|++..|.+-|- ...+-++|-.....=+|.. T Consensus 175 isI~~~~~~~~~~~~~~~~~~~~~~~~~fe~~G~~~~~v~DGhd~~~i~~al~~ak----~~~~P~~I~v~T~kG~G~~- 249 (581) T PRK12315 175 MSIAENHGGLYKNLKELRDTNGQSENNLFKAMGLDYRYVEDGNDLESLIELFKEVK----DIDHPIVLHIHTLKGKGYK- 249 (581) T ss_pred EEECCCCHHHHHHHHHHHHHCCCCCHHHHHHCCCEEEECCCCCCHHHHHHHHHHHH----HCCCCEEEEEEECCCCCCC- T ss_conf 02358730777777887750375501069984986882579999999999999986----3699979999967863666- Q ss_pred CCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCC Q ss_conf 35422355788888671899899999998644898989999999999999775666541258765666656862666778 Q gi|254781051|r 483 GDEPSFTQPMMYKTIRSHKSVLQLYADSLMKNQVISKQELQSLANNWHKYLEAEYKESESYCPEKLGLLHNGENERKNSV 562 (957) Q Consensus 483 ~DeP~fTQP~mY~~I~~hp~v~~~Y~~~Li~~g~it~~e~~~~~~~~~~~l~~~~~~~k~~~~~~~~~~~~~~~~~~T~v 562 (957) |.-..|.-| |- ..|- +++|+- T Consensus 250 ---~a~~~~~~~-----H~-----------------------------------------~~pf----------~~~~g~ 270 (581) T PRK12315 250 ---PAEENKEAF-----HW-----------------------------------------HMPF----------DLETGQ 270 (581) T ss_pred ---CCCCHHHHH-----CC-----------------------------------------CCCC----------CCCCCC T ss_conf ---110357763-----66-----------------------------------------5676----------613465 Q ss_pred CHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCEEE Q ss_conf 99999999887436899862056778889989987334356421147898889886358315785010022333230279 Q gi|254781051|r 563 SKEILKKIGSSIIRLPKSFNTHKIVERLMANRQKMIETGKGIDWSMAESLAFGSLCYEGYKVRLSGQDCERGTFSHRHAI 642 (957) Q Consensus 563 ~~~~L~~i~~~l~~~Pe~f~~h~~v~k~~~~R~~~~~~~~~iDW~~AE~LA~~SLl~eG~~VRlsGQD~~RGTFshRHav 642 (957) ++ .++.. -+|..+-.=+...++.++-+|---.-|..-+|. T Consensus 271 ~~------------~~~~~----------------------~~~~~~~~~~l~~~~~~~~~ivai~a~~~~~~g------ 310 (581) T PRK12315 271 SK------------VPASG----------------------ESYSSVVLDYLLKKIEEGKPVVAINAAIPGVFG------ 310 (581) T ss_pred CC------------CCCCC----------------------CCHHHHHHHHHHHHHHHCCCEEEECCCCCCCCC------ T ss_conf 45------------67875----------------------698999999999998746441542135787657------ Q ss_pred EEECCCCCEEEEHHHCCCCCCEEEEEECCCCCCCHHHHHHCCCCCCCCCCEEECCCCCCCCCCCHHHHHHHHHHHHHHHC Q ss_conf 99778884455634427898459998243111101343202211386531110000244245410225655420257623 Q gi|254781051|r 643 LCDQETGKRYFPLGNISKDQGHCEVVNSFLSEQAVLGFEYGYSLNNLNALTIWEAQFGDFANGAQVILDQFITAGEQKWL 722 (957) Q Consensus 643 l~dq~t~~~y~pL~~l~~~q~~f~v~nS~LSE~avlgFEyGys~~~p~~L~iWEAQFGDF~NgAQviiDqfiss~e~KW~ 722 (957) ...+..-. ..+| ++--..|.+..+|--|.++. ++.-.-+-|++|.. --+||.+...- . T Consensus 311 ---------l~~f~~~~--P~R~--i~~GIaEq~mv~~a~GlA~~---G~~P~~~~fstFl~---ra~dqi~~dia---l 368 (581) T PRK12315 311 ---------LKEFRKKY--PDRY--VDVGIAEQHSVAFASGMAAA---GARPVIFVNSTFLQ---RAYDQLSHDLA---I 368 (581) T ss_pred ---------CCHHHHHC--CCCE--EECCHHHHHHHHHHHHHHHC---CCEEEEEEECHHHH---HHHHHHHHHHH---H T ss_conf ---------53132418--5016--75160888999998789757---99646888442331---46889989999---8 Q ss_pred CCCCCEEECCCC-CCCCCCCCCCCCHHHHHHHCCCCCEEEEECCCHHHHHHHHHHHHCCCCCCCEEEECCCHHHCCCCCC Q ss_conf 336813543688-8888876564405788752143782697248899999999997435348868985660221080126 Q gi|254781051|r 723 CTSNLVCLLPHG-YEGQGPEHSSARLERFLQMCAENNMYVANCTSPANYFHILRRQIYDRSSRPLIMMAPKSLLRHKRVV 801 (957) Q Consensus 723 ~~sglvllLPHG-yeGqGPEHSSaRlERfLQl~a~~N~~V~~~ttpAq~FH~LRRQ~~r~~rkPLiv~tPKsLLR~~~a~ 801 (957) +....++..-|| --+-||-|-. +|-.--|.+=.||.|..|+.+.+.-.+++--+.+ ...|.++-.|+.-+.... T Consensus 369 ~~lpv~~~~~~~~igedGpTHq~--iedia~lR~iPN~~V~~PaD~~E~~~~~~~Al~~-~~gP~~iR~pr~~~~~~~-- 443 (581) T PRK12315 369 NNNPAVMIVFGGSISGNSVTHQG--SFDIAMISNIPNLVYLAPTSKEELIAMLEWALTQ-HEHPVAIRVPEHVLESGP-- 443 (581) T ss_pred HCCCEEEEEECCCCCCCCCCCCC--HHHHHHHHCCCCCEEEECCCHHHHHHHHHHHHHC-CCCCEEEEECCCCCCCCC-- T ss_conf 36998999961777899987514--7788997256883899319999999999999971-699589993787677787-- Q ss_pred CCHHHHCCCCCEEECCCCCCCCCCCCCCCCCCHHHCCEEEEECCHHHHHHHHHHHHCC---CCCEEEEEEECCCCCHHHH Q ss_conf 8878937998603515776433444544457954576899868857899999999769---9868999840148682899 Q gi|254781051|r 802 SSLSDMTCGSVFQAVLSDDAEYHGKTSVKLKEDSHIRRVILCTGKVYYDLLDNRDMRN---IADIYLLRIEQLYPFPEDY 878 (957) Q Consensus 802 S~l~d~~~g~~F~~vi~d~~~~~~~~~~~~~~~~~v~rvi~csGKvyydL~~~r~~~~---~~~vaivRiEqLyPfP~~~ 878 (957) ....++..|. . .++.+ .+ +-+|+++|-....-+++.+... --++.+|-+--+-||..+. T Consensus 444 ~~~~~~~~g~-~-~v~~~--------------G~--dv~Iia~Gs~v~~A~~aa~~L~~~~gI~~~Vi~~~~vkPlD~~~ 505 (581) T PRK12315 444 TVDTDYSTLN-Y-EVAKA--------------GE--KVAILALGDFYELGEKVAKKLKEKLGIDATLINPKFITGLDEEL 505 (581) T ss_pred CCCCCCCCCC-E-EEEEC--------------CC--CEEEEEEHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHH T ss_conf 6555555441-8-99755--------------89--88999860778999999999875129987999577248879999 Q ss_pred HHHHHHHCCCCEEEEECCCCCCCCC-HHHHHHHH Q ss_conf 9999853799839980678531784-57899769 Q gi|254781051|r 879 LIKVLSRFVQAEIVWCQEEPQNMGA-WTFIEPYL 911 (957) Q Consensus 879 l~~~l~~y~~~e~vW~QEEp~N~Ga-W~~v~~rl 911 (957) +....+.| +.+.+=||+--.|+ ++.|...+ T Consensus 506 l~~~~~~~---~~ivtvEeh~~~GGlGs~v~~~l 536 (581) T PRK12315 506 LENLKEDH---ELVVTLEDGILDGGFGEKIARYY 536 (581) T ss_pred HHHHHHCC---CEEEEEECCCCCCCHHHHHHHHH T ss_conf 99998249---87999927976645899999999 No 18 >PRK11892 pyruvate dehydrogenase subunit beta; Provisional Probab=99.24 E-value=1.3e-08 Score=84.76 Aligned_cols=296 Identities=18% Similarity=0.215 Sum_probs=199.3 Q ss_pred HHHHHHHCCCEEEEECCCCCC--CCCCCCEEEEEECCCCCEEEEHHHCCCCCCEEEEEECCCCCCCHHHHHHCCCCCCCC Q ss_conf 889886358315785010022--333230279997788844556344278984599982431111013432022113865 Q gi|254781051|r 613 AFGSLCYEGYKVRLSGQDCER--GTFSHRHAILCDQETGKRYFPLGNISKDQGHCEVVNSFLSEQAVLGFEYGYSLNNLN 690 (957) Q Consensus 613 A~~SLl~eG~~VRlsGQD~~R--GTFshRHavl~dq~t~~~y~pL~~l~~~q~~f~v~nS~LSE~avlgFEyGys~~~p~ 690 (957) |.+-=+..--.|=+-|||+.. |.|-. -+-|.+.=|.=-|.|.|+||.|..|.--|.++.- T Consensus 151 a~~eEM~rD~~Vfv~GEdVg~~gG~fkv----------------T~GL~~~FG~~RV~DTPISE~gi~G~aiGaA~~G-- 212 (464) T PRK11892 151 AMAEEMRRDEDVFVMGEEVAEYQGAYKV----------------TQGLLQEFGARRVIDTPITEHGFAGVGVGAAMAG-- 212 (464) T ss_pred HHHHHHHHCCCEEEECCCCCCCCCCCCC----------------CHHHHHHHCCHHHCCCCCCCHHHHHHHHHHHHCC-- T ss_conf 9999985099889980400114870103----------------4227877583042078999567889999999759-- Q ss_pred CCEEECCCCCCCCCCCHHHHHHHHHHHHHHHCCCCC------CEEECCCCC-CCCCCCCCCCCHHHHHHHCCCCCEEEEE Q ss_conf 311100002442454102256554202576233368------135436888-8888765644057887521437826972 Q gi|254781051|r 691 ALTIWEAQFGDFANGAQVILDQFITAGEQKWLCTSN------LVCLLPHGY-EGQGPEHSSARLERFLQMCAENNMYVAN 763 (957) Q Consensus 691 ~L~iWEAQFGDF~NgAQviiDqfiss~e~KW~~~sg------lvllLPHGy-eGqGPEHSSaRlERfLQl~a~~N~~V~~ 763 (957) --.|=|-||.||+-.| +||-|..+ +||.-+|| ||+-.|.|- -|+||.||- .+|-|+ +.--.+.|+. T Consensus 213 lRPIvEimf~dF~~~A---~DQIiN~A-AK~rymsgGq~~~PiViR~p~G~g~~~gaqHSQ-s~e~~f--ahiPGLkVv~ 285 (464) T PRK11892 213 LKPIVEFMTFNFAMQA---IDQIINSA-AKTLYMSGGQMGCPIVFRGPNGAAARVGAQHSQ-DYAAWY--SHIPGLKVVA 285 (464) T ss_pred CCCEEEEECHHHHHHH---HHHHHHHH-HHHHCCCCCCCCCCEEEEECCCCCCCCCCCCCC-CHHHHH--HCCCCCEEEE T ss_conf 9847998603168999---99999888-886404588614777998136666677766655-767786--2178707995 Q ss_pred CCCHHHHHHHHHHHHCCCCCCCEEEECCCHHHCCCCCCCCHHHHCCCCCEEECCCCCCCCCCCCCCCCCCHHHCCEEEEE Q ss_conf 48899999999997435348868985660221080126887893799860351577643344454445795457689986 Q gi|254781051|r 764 CTSPANYFHILRRQIYDRSSRPLIMMAPKSLLRHKRVVSSLSDMTCGSVFQAVLSDDAEYHGKTSVKLKEDSHIRRVILC 843 (957) Q Consensus 764 ~ttpAq~FH~LRRQ~~r~~rkPLiv~tPKsLLR~~~a~S~l~d~~~g~~F~~vi~d~~~~~~~~~~~~~~~~~v~rvi~c 843 (957) ||||+....||+--+..+ -|.|.+-+|.|.+.+.-+...+|+.--- -+.-|- ..-+.| .||=. T Consensus 286 Pstp~DAkGLLkaAIrd~--nPVIflE~k~LY~~~~eVP~~~d~~iPi-GkA~V~-------------reG~Dv-TIVsy 348 (464) T PRK11892 286 PYSAADAKGLLKAAIRDP--NPVIFLENEILYGQSFEVPKLDDFVLPI-GKARIH-------------REGKDV-TIVSF 348 (464) T ss_pred ECCHHHHHHHHHHHHHCC--CCEEEEEEHHHCCCCCCCCCCCCCCCCC-CCEEEE-------------CCCCCE-EEEEC T ss_conf 189999999999997389--9889996021037840268754444566-405896-------------379978-99975 Q ss_pred CCHHHHHHHHHHHH-CCCCCEEEEEEECCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHH-HHCCCC Q ss_conf 88578999999997-6998689998401486828999999853799839980678531784578997699998-750567 Q gi|254781051|r 844 TGKVYYDLLDNRDM-RNIADIYLLRIEQLYPFPEDYLIKVLSRFVQAEIVWCQEEPQNMGAWTFIEPYLEKVL-HSIGAD 921 (957) Q Consensus 844 sGKvyydL~~~r~~-~~~~~vaivRiEqLyPfP~~~l~~~l~~y~~~e~vW~QEEp~N~GaW~~v~~rl~~~l-~~~~~~ 921 (957) +.-+++-|..+..- ..--++-+|-+-.|+||..+.|.+-++|-. .+|=|+|-+++.|-++-+-.++.+-. ..| T Consensus 349 G~mv~~aleAAe~La~eGIs~EVIDLRtL~PLD~etI~~SVkKTg--RlvVVeE~~~~~GvgaEIaA~I~E~aF~~L--- 423 (464) T PRK11892 349 SIGMTYALKAAEELAKEGIDAEVIDLRTIRPMDTETIVESVKKTN--RLVTVEEGWPQSGVGAEIAARVMEQAFDYL--- 423 (464) T ss_pred CCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHHHHHHHHC--CEEEEECCCCCCCHHHHHHHHHHHHHHHHC--- T ss_conf 726999999999998739978999788678899899999999879--899996898888679999999999879860--- Q ss_pred CCEEEEEECCCCCCCCCC-CHHHHHHHHHHHHHHH Q ss_conf 760189753664672338-6789999999999987 Q gi|254781051|r 922 YSRVRYVGRLPSASTAVG-HMSRHLEQLSSFIEDA 955 (957) Q Consensus 922 ~~~~~y~gR~~~aspA~G-~~~~H~~e~~~li~~a 955 (957) ...+.-|+-+-.+.|... .-+...-.-++|++.+ T Consensus 424 dAPv~RV~~~D~PiP~a~~LE~~~lP~~~~Iv~Av 458 (464) T PRK11892 424 DAPVLRVTGKDVPMPYAANLEKLALPSVAEVVEAV 458 (464) T ss_pred CCCCEEECCCCCCCCCCHHHHHHHCCCHHHHHHHH T ss_conf 89848978898798889779977589999999999 No 19 >PTZ00182 3-methyl-2-oxobutanate dehydrogenase; Provisional Probab=99.23 E-value=8.7e-09 Score=86.15 Aligned_cols=308 Identities=20% Similarity=0.250 Sum_probs=201.8 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCEEEEECCCCCC-CCCCCCEEEEEECCCCCEEEEH Q ss_conf 899862056778889989987334356421147898889886358315785010022-3332302799977888445563 Q gi|254781051|r 577 LPKSFNTHKIVERLMANRQKMIETGKGIDWSMAESLAFGSLCYEGYKVRLSGQDCER-GTFSHRHAILCDQETGKRYFPL 655 (957) Q Consensus 577 ~Pe~f~~h~~v~k~~~~R~~~~~~~~~iDW~~AE~LA~~SLl~eG~~VRlsGQD~~R-GTFshRHavl~dq~t~~~y~pL 655 (957) -|..|..-|....-- ..-+-+.+-..-|=.-|..-.|.+.-.|=+-|||++. |.|-. - T Consensus 13 ~~~~~~~~~~~~~~~-----~~g~~~~~t~~~Ai~~al~~~m~~d~~v~~~GeDv~~GG~f~~----------------T 71 (355) T PTZ00182 13 QPSEFSFAPRFGSEE-----SSGNTKKMNLFQAINSALDIALSKDPKTVLFGEDVAFGGVFRC----------------S 71 (355) T ss_pred CCCCEEECCCCCCCC-----CCCCCEEEEHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHH----------------H T ss_conf 545424269999877-----7888226579999999999999659988998477366766664----------------0 Q ss_pred HHCCCCCCEEEEEECCCCCCCHHHHHHCCCCCCCCCCEEECCCCCCCCCCCHHHHHHHHHHHHHHHCCCC-------CCE Q ss_conf 4427898459998243111101343202211386531110000244245410225655420257623336-------813 Q gi|254781051|r 656 GNISKDQGHCEVVNSFLSEQAVLGFEYGYSLNNLNALTIWEAQFGDFANGAQVILDQFITAGEQKWLCTS-------NLV 728 (957) Q Consensus 656 ~~l~~~q~~f~v~nS~LSE~avlgFEyGys~~~p~~L~iWEAQFGDF~NgAQviiDqfiss~e~KW~~~s-------glv 728 (957) ..|.+.-|+--|.|.|+||.+..|+--|.++.-- -.+=|-||+||. +.-+||-+..+ +||.-+| .|| T Consensus 72 ~gL~~~fGp~Rv~DtPIsE~~ivG~AvG~A~~G~--rPVvei~f~dFl---~~A~DQIvn~a-Ak~~~~sgg~~~~~plv 145 (355) T PTZ00182 72 LGLLKKYGEQRVFNTPLCEQGIVGFAIGLAAVGW--TAIAEIQFADYI---FPAFDQIVNEA-AKYRYRSGGQFDCGKLT 145 (355) T ss_pred HHHHHHHCCCCEECCCCCHHHHHHHHHHHHHCCC--CEEEEEEHHHHH---HHHHHHHHHHH-HHHHHCCCCEEECCEEE T ss_conf 2599985885003178558889999999997699--359997568789---99999999998-77750038703024289 Q ss_pred EECCCCCCCC-CCCCCCCCHHHHHHHCCCCCEEEEECCCHHHHHHHHHHHHCCCCCCCEEEECCCHHHCCCCCCCCHHHH Q ss_conf 5436888888-876564405788752143782697248899999999997435348868985660221080126887893 Q gi|254781051|r 729 CLLPHGYEGQ-GPEHSSARLERFLQMCAENNMYVANCTSPANYFHILRRQIYDRSSRPLIMMAPKSLLRHKRVVSSLSDM 807 (957) Q Consensus 729 llLPHGyeGq-GPEHSSaRlERfLQl~a~~N~~V~~~ttpAq~FH~LRRQ~~r~~rkPLiv~tPKsLLR~~~a~S~l~d~ 807 (957) +-.|-|.-|. ||.|| ..+|-++ +.--|+.|+.||||+...++|+.-+.. .-|.|++-||.|.|....--+.+++ T Consensus 146 iR~p~G~~~~~g~~HS-qs~ea~f--~~iPGLkVv~Ps~p~DAkgLl~sAi~d--~~PVif~E~k~Ly~~~~~~vp~~~~ 220 (355) T PTZ00182 146 IRSTWGAVGHGGLYHS-QSPEAYF--AHCAGLKIVVPSDPYQAKGLLLACIRD--PNPVIFFEPKILYRSAVEEVPPDDY 220 (355) T ss_pred EECCCCCCCCCCCCCC-CCHHHHH--HCCCCCEEEECCCHHHHHHHHHHHHCC--CCCEEEEECHHHCCCCCCCCCCCCC T ss_conf 9537888777886101-7969998--268997899349999999999876107--9858999654540567667885445 Q ss_pred CCCCCEEECCCCCCCCCCCCCCCCCCHHHCCEEEEECCHHHHHHHHHHHHCC---CCCEEEEEEECCCCCHHHHHHHHHH Q ss_conf 7998603515776433444544457954576899868857899999999769---9868999840148682899999985 Q gi|254781051|r 808 TCGSVFQAVLSDDAEYHGKTSVKLKEDSHIRRVILCTGKVYYDLLDNRDMRN---IADIYLLRIEQLYPFPEDYLIKVLS 884 (957) Q Consensus 808 ~~g~~F~~vi~d~~~~~~~~~~~~~~~~~v~rvi~csGKvyydL~~~r~~~~---~~~vaivRiEqLyPfP~~~l~~~l~ 884 (957) .- -| ++..+ ..+-+. =.|++.|..-+..+++-++.. --++-+|-+--|.|+..+.|.+-++ T Consensus 221 ~i------pl-------Gka~i-~r~G~D--vTIVa~G~mv~~al~AA~~L~~e~GIs~EVIDlRtL~PlD~e~I~~SV~ 284 (355) T PTZ00182 221 TL------EL-------GKAEV-VKEGKD--VTMVGYGSQVGVMKKAAELAEKEHGISVEVIDLQTILPWDRETVADSVK 284 (355) T ss_pred CC------CC-------CEEEE-EEECCC--EEEEEECHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCHHHHHHHHH T ss_conf 44------46-------66799-961794--7999965689999999999975029977999547468999999999999 Q ss_pred HCCCCEEEEECCCCCCCCCHHHHHHHHHHH-HHHCCCCCCEEEEEECCCCCCCC Q ss_conf 379983998067853178457899769999-87505677601897536646723 Q gi|254781051|r 885 RFVQAEIVWCQEEPQNMGAWTFIEPYLEKV-LHSIGADYSRVRYVGRLPSASTA 937 (957) Q Consensus 885 ~y~~~e~vW~QEEp~N~GaW~~v~~rl~~~-l~~~~~~~~~~~y~gR~~~aspA 937 (957) |-. .++-|+|.++.-|-=+-|-.++.+- +..+ ...+.-++-+-.+.|. T Consensus 285 KTg--rllvVeE~~~~~G~gsEI~a~i~E~~f~~L---~ap~~Rv~~~d~P~P~ 333 (355) T PTZ00182 285 KTG--RVIVTHEAPKTSGMGAEIAATIQERCFLSL---EAPIKRVCGYDTPFPL 333 (355) T ss_pred HHC--CEEEEECCCCCCCHHHHHHHHHHHHHHHHC---CCCCEEECCCCCCCCH T ss_conf 879--899998899888689999999999868750---7986897889888986 No 20 >CHL00144 odpB pyruvate dehydrogenase E1 component beta subunit; Validated Probab=99.22 E-value=1.5e-08 Score=84.36 Aligned_cols=294 Identities=16% Similarity=0.217 Sum_probs=195.3 Q ss_pred HHHHHHHCCCEEEEECCCCCCCCCCCCEEEEEECCCCCEEEEHHHCCCCCCEEEEEECCCCCCCHHHHHHCCCCCCCCCC Q ss_conf 88988635831578501002233323027999778884455634427898459998243111101343202211386531 Q gi|254781051|r 613 AFGSLCYEGYKVRLSGQDCERGTFSHRHAILCDQETGKRYFPLGNISKDQGHCEVVNSFLSEQAVLGFEYGYSLNNLNAL 692 (957) Q Consensus 613 A~~SLl~eG~~VRlsGQD~~RGTFshRHavl~dq~t~~~y~pL~~l~~~q~~f~v~nS~LSE~avlgFEyGys~~~p~~L 692 (957) |..-++.+-=.|=+-|||+++ -|-.|---..|.+.-+.=-|.|.++||.+.+|+--|.++.- -- T Consensus 13 al~~~m~~d~~v~~~GeDv~~--------------~gg~f~~t~gl~~kfg~~Rv~dtpIaE~~~vG~A~GlA~~G--~r 76 (326) T CHL00144 13 AIDEEMARDPRVAVIGEDVGH--------------YGGSYKVTKDLHPKYGDLRVLDTPIAENSFTGMAIGAAMTG--LR 76 (326) T ss_pred HHHHHHHHCCCEEEECCCCCC--------------CCCCHHHHHHHHHHHCCCCEECCCCCHHHHHHHHHHHHHCC--CE T ss_conf 999999759988998377675--------------67806750679987385510067735788999999887479--97 Q ss_pred EEECCCCCCCCCCCHHHHHHHHHHH-HHHHC----CCCCCEEECCCC-CCCCCCCCCCCCHHHHHHHCCCCCEEEEECCC Q ss_conf 1100002442454102256554202-57623----336813543688-88888765644057887521437826972488 Q gi|254781051|r 693 TIWEAQFGDFANGAQVILDQFITAG-EQKWL----CTSNLVCLLPHG-YEGQGPEHSSARLERFLQMCAENNMYVANCTS 766 (957) Q Consensus 693 ~iWEAQFGDF~NgAQviiDqfiss~-e~KW~----~~sglvllLPHG-yeGqGPEHSSaRlERfLQl~a~~N~~V~~~tt 766 (957) .+=|.||+||. +.-+||.+..+ --+|+ ...-+|.-.|-| +-|+||.||-. .|-+ ++.=.||.|+.||+ T Consensus 77 Piv~~~~~~F~---~~a~dQi~n~aa~~~~~~gg~~~vplvir~~~g~g~~~g~~Hs~~-~~a~--~~~iPgl~Vv~Ps~ 150 (326) T CHL00144 77 PVVEGMNMGFL---LLAFNQISNNAGMLHYTSGGNFTIPIVIRGPGGVGRQLGAEHSQR-LESY--FQAVPGLQIVACST 150 (326) T ss_pred EEEEEEHHHHH---HHHHHHHHHHHHHHHHCCCCCEEEEEEEEECCCCCCCCCCCCCCC-HHHH--HHCCCCCEEEECCC T ss_conf 99997467788---999999999988986034786135799984777678875412445-6889--83389958995698 Q ss_pred HHHHHHHHHHHHCCCCCCCEEEECCCHHHCCCCCCCCHHHH--CCCCCEEECCCCCCCCCCCCCCCCCCHHHCCEEEEEC Q ss_conf 99999999997435348868985660221080126887893--7998603515776433444544457954576899868 Q gi|254781051|r 767 PANYFHILRRQIYDRSSRPLIMMAPKSLLRHKRVVSSLSDM--TCGSVFQAVLSDDAEYHGKTSVKLKEDSHIRRVILCT 844 (957) Q Consensus 767 pAq~FH~LRRQ~~r~~rkPLiv~tPKsLLR~~~a~S~l~d~--~~g~~F~~vi~d~~~~~~~~~~~~~~~~~v~rvi~cs 844 (957) |.....+|+.=+. ...|.+++-+|.|-+.+.-+. -+|+ .-|. - .|+ .+-+.| .|+|. T Consensus 151 ~~da~~ll~~ai~--~~~Pv~~~e~~~ly~~~~~vp-~~~~~~~~Gk-a-~v~--------------r~G~Dv--TIVa~ 209 (326) T CHL00144 151 PYNAKGLLKSAIR--SDNPVLFFEHVLLYNLKEEIP-DGEYLLPLEK-A-ELV--------------RDGEDI--TILTY 209 (326) T ss_pred HHHHHHHHHHHHH--CCCCEEEEECCCCCCCCCCCC-CCCCCCCCCE-E-EEE--------------EECCCE--EEEEC T ss_conf 7888999999997--289818995345578764477-6556654676-8-999--------------727987--99930 Q ss_pred CHHHHHHHHHHHHCCC--CCEEEEEEECCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHH-HCCCC Q ss_conf 8578999999997699--86899984014868289999998537998399806785317845789976999987-50567 Q gi|254781051|r 845 GKVYYDLLDNRDMRNI--ADIYLLRIEQLYPFPEDYLIKVLSRFVQAEIVWCQEEPQNMGAWTFIEPYLEKVLH-SIGAD 921 (957) Q Consensus 845 GKvyydL~~~r~~~~~--~~vaivRiEqLyPfP~~~l~~~l~~y~~~e~vW~QEEp~N~GaW~~v~~rl~~~l~-~~~~~ 921 (957) |-.-+.-+++-+.... -++-+|-+--|.||..+.|.+-++|..+ +|=+.|-+.+-|-=+-|-.++.+-.. .+ T Consensus 210 G~mv~~al~Aae~L~~~gI~~eVIDlrtl~PlD~e~I~~sv~kT~r--lv~veE~~~~gG~Gsei~a~i~e~~f~~l--- 284 (326) T CHL00144 210 SRMRHHVIQAVKLLVTKGYDPEVIDLISLKPLDMTTISTSIKKTHK--VLIVEECMKTGGIGAELLALINENLFDEL--- 284 (326) T ss_pred HHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHHHHHHHCCC--EEEEECCCCCCCHHHHHHHHHHHHHHHHC--- T ss_conf 6779999999999997799768986045799999999999985498--99997699889789999999999758655--- Q ss_pred CCEEEEEECCCCCCCCCCCHHHHHHHH-HHHHHH Q ss_conf 760189753664672338678999999-999998 Q gi|254781051|r 922 YSRVRYVGRLPSASTAVGHMSRHLEQL-SSFIED 954 (957) Q Consensus 922 ~~~~~y~gR~~~aspA~G~~~~H~~e~-~~li~~ 954 (957) ..++.-+|-+..+.|........-.-+ ++|++. T Consensus 285 ~~p~~Ri~~~d~p~P~s~~lE~~~~p~~~~I~~a 318 (326) T CHL00144 285 DAPIFRLSSQDVPTPYNGSLEEATVIQPEQIIEA 318 (326) T ss_pred CCCCEEECCCCCCCCCCHHHHHHHCCCHHHHHHH T ss_conf 9997896889808989977998759999999999 No 21 >PRK09212 pyruvate dehydrogenase subunit beta; Validated Probab=99.14 E-value=7.1e-08 Score=79.27 Aligned_cols=300 Identities=18% Similarity=0.196 Sum_probs=202.2 Q ss_pred HHHHHHHHHHHHHCCCEEEEECCCCC--CCCCCCCEEEEEECCCCCEEEEHHHCCCCCCEEEEEECCCCCCCHHHHHHCC Q ss_conf 14789888988635831578501002--2333230279997788844556344278984599982431111013432022 Q gi|254781051|r 607 SMAESLAFGSLCYEGYKVRLSGQDCE--RGTFSHRHAILCDQETGKRYFPLGNISKDQGHCEVVNSFLSEQAVLGFEYGY 684 (957) Q Consensus 607 ~~AE~LA~~SLl~eG~~VRlsGQD~~--RGTFshRHavl~dq~t~~~y~pL~~l~~~q~~f~v~nS~LSE~avlgFEyGy 684 (957) .-|=..|..-++.+--.|=+=|||+. .|.|. -...+.+.-|+=-++|.++||.+.+|+--|. T Consensus 7 ~~ai~~al~~~~~~d~~v~~~geDv~~~~G~f~----------------~t~gl~~~fgp~R~~d~gIaEq~~vg~A~Gl 70 (327) T PRK09212 7 REALRDAMREEMERDPKVFLMGEEVGEYQGAYK----------------VTQGLLEQFGPKRVIDTPITEHGFAGLAVGA 70 (327) T ss_pred HHHHHHHHHHHHHHCCCEEEECCCCCCCCCCHH----------------HHHHHHHHHCCCCEECCCCCHHHHHHHHHHH T ss_conf 999999999999559899998277664678035----------------4168999858750316774489999999989 Q ss_pred CCCCCCCCEEECCCCCCCCCCCHHHHHHHHHHHHHHHCCCCC------CEEECCCCC-CCCCCCCCCCCHHHHHHHCCCC Q ss_conf 113865311100002442454102256554202576233368------135436888-8888765644057887521437 Q gi|254781051|r 685 SLNNLNALTIWEAQFGDFANGAQVILDQFITAGEQKWLCTSN------LVCLLPHGY-EGQGPEHSSARLERFLQMCAEN 757 (957) Q Consensus 685 s~~~p~~L~iWEAQFGDF~NgAQviiDqfiss~e~KW~~~sg------lvllLPHGy-eGqGPEHSSaRlERfLQl~a~~ 757 (957) ++.-- -.+=|-||.||. +.-+||-+..+ +||+.+|| +|...|.|. -|+||.|| .-+|-++ +.=. T Consensus 71 A~~G~--~Piv~~~~~~f~---~ra~dQI~n~~-ak~~~~~gg~~~vpvv~r~~~g~~~~~g~~Hs-~~~~a~~--~~iP 141 (327) T PRK09212 71 AFAGL--RPIVEFMTFNFA---MQAIDQIVNSA-AKTNYMSGGQLKCPIVFRGPNGAAARVAAQHS-QCYASWY--AHIP 141 (327) T ss_pred HHCCC--CCEEEEEHHHHH---HHHHHHHHHHH-HHHHHHCCCCCCCEEEEEECCCCCCCCCCCCC-CCHHHHH--CCCC T ss_conf 86799--448997556677---78999999988-87524127852301899817724678774333-1568896--2899 Q ss_pred CEEEEECCCHHHHHHHHHHHHCCCCCCCEEEECCCHHHCCCCCCCCHH-HHCCCCCEEECCCCCCCCCCCCCCCCCCHHH Q ss_conf 826972488999999999974353488689856602210801268878-9379986035157764334445444579545 Q gi|254781051|r 758 NMYVANCTSPANYFHILRRQIYDRSSRPLIMMAPKSLLRHKRVVSSLS-DMTCGSVFQAVLSDDAEYHGKTSVKLKEDSH 836 (957) Q Consensus 758 N~~V~~~ttpAq~FH~LRRQ~~r~~rkPLiv~tPKsLLR~~~a~S~l~-d~~~g~~F~~vi~d~~~~~~~~~~~~~~~~~ 836 (957) ||.|+.|++|++..-+|+. +.+ ...|.+++-+|.|.+.+.-+..-+ .+.-|. -.|+- +-+. T Consensus 142 gl~Vv~Ps~~~d~~~ll~~-a~~-~~~Pvi~~e~~~ly~~~~~v~~~~~~~~iGK--a~vlr--------------eG~D 203 (327) T PRK09212 142 GLKVVAPYFAADCKGLLKT-AIR-DNNPVIFLENEILYGHSHEVPDEEESIPFGK--AAILR--------------EGSD 203 (327) T ss_pred CCEEEECCCHHHHHHHHHH-HHH-CCCCEEEEECCCCCCCCCCCCCCCCCCCCCE--EEEEE--------------ECCC T ss_conf 9889964887788999999-876-5998499824434687656776530233776--89998--------------5797 Q ss_pred CCEEEEECCHHHHHHHHHHHHCC--CCCEEEEEEECCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHH Q ss_conf 76899868857899999999769--9868999840148682899999985379983998067853178457899769999 Q gi|254781051|r 837 IRRVILCTGKVYYDLLDNRDMRN--IADIYLLRIEQLYPFPEDYLIKVLSRFVQAEIVWCQEEPQNMGAWTFIEPYLEKV 914 (957) Q Consensus 837 v~rvi~csGKvyydL~~~r~~~~--~~~vaivRiEqLyPfP~~~l~~~l~~y~~~e~vW~QEEp~N~GaW~~v~~rl~~~ 914 (957) | .|+|+|-.-+.-+++-+... --++.+|-+--|.|+..+.|.+..++..+ ++=|+|-..+-|-=+-|-.++.+- T Consensus 204 v--TIva~G~mv~~al~AAe~L~~~GI~~eVIdl~tlkPlD~e~I~~sv~kT~~--lv~veE~~~~gG~Gsevaa~l~e~ 279 (327) T PRK09212 204 V--TIVTFSIQVKLALEAAELLEKEGISVEVIDLRTLRPLDTETIIESVKKTNR--LVVVEEGWPFAGVGAEIAALIMKE 279 (327) T ss_pred E--EEEEECHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHHHHHHHCCC--EEEEECCCCCCCHHHHHHHHHHHH T ss_conf 4--999864578999999999997599867885212789999999999986197--899967987887899999999997 Q ss_pred HHHCCCCCCEEEEEECCCCCCCCCCCHHHH-HHHHHHHHHHH Q ss_conf 875056776018975366467233867899-99999999987 Q gi|254781051|r 915 LHSIGADYSRVRYVGRLPSASTAVGHMSRH-LEQLSSFIEDA 955 (957) Q Consensus 915 l~~~~~~~~~~~y~gR~~~aspA~G~~~~H-~~e~~~li~~a 955 (957) .... -..++..+|.+..+.|..-....+ .-..++|++.+ T Consensus 280 ~f~~--L~~p~~rv~~~d~p~P~a~~LE~~~~p~~~~I~~av 319 (327) T PRK09212 280 AFDY--LDAPVERVTGKDVPLPYAANLEKLALPSEEDIIEAV 319 (327) T ss_pred HHHH--CCCCCEEECCCCCCCCCCHHHHHHHCCCHHHHHHHH T ss_conf 5864--589867968898089899768988498999999999 No 22 >cd07036 TPP_PYR_E1-PDHc-beta_like Pyrimidine (PYR) binding domain of the beta subunits of the E1 components of human pyruvate dehydrogenase complex (E1- PDHc) and related proteins. Thiamine pyrophosphate (TPP) family, pyrimidine (PYR) binding domain of the beta subunits of the E1 components of: human pyruvate dehydrogenase complex (E1- PDHc), the acetoin dehydrogenase complex (ADC), and the branched chain alpha-keto acid dehydrogenase/2-oxoisovalerate dehydrogenase complex (BCADC), subfamily. The PYR domain is found in many key metabolic enzymes which use TPP (also known as thiamine diphosphate) as a cofactor. TPP binds in the cleft formed by a PYR domain and a PP domain. The PYR domain, binds the aminopyrimidine ring of TPP, the PP domain binds the diphosphate residue. A polar interaction between the conserved glutamate of the PYR domain and the N1' of the TPP aminopyrimidine ring is shared by most TPP-dependent enzymes, and participates in the activation of TPP. The PYR and PP domain Probab=98.86 E-value=2e-08 Score=83.40 Aligned_cols=156 Identities=24% Similarity=0.373 Sum_probs=112.5 Q ss_pred HHHHHHHHHCCCEEEEECCCCCC--CCCCCCEEEEEECCCCCEEEEHHHCCCCCCEEEEEECCCCCCCHHHHHHCCCCCC Q ss_conf 98889886358315785010022--3332302799977888445563442789845999824311110134320221138 Q gi|254781051|r 611 SLAFGSLCYEGYKVRLSGQDCER--GTFSHRHAILCDQETGKRYFPLGNISKDQGHCEVVNSFLSEQAVLGFEYGYSLNN 688 (957) Q Consensus 611 ~LA~~SLl~eG~~VRlsGQD~~R--GTFshRHavl~dq~t~~~y~pL~~l~~~q~~f~v~nS~LSE~avlgFEyGys~~~ 688 (957) ..|+.-++.+--.|=+-|||++. |-| ---..+.+.-++=-++|.+++|.+.+|+-.|.++. T Consensus 4 ~~al~~~~~~d~~v~~~g~dv~~~~g~~----------------~~t~gl~~~fgp~R~~d~gIaE~~~vg~a~GlAl~- 66 (167) T cd07036 4 NEALDEEMERDPRVVVLGEDVGDYGGVF----------------KVTKGLLDKFGPDRVIDTPIAEAGIVGLAVGAAMN- 66 (167) T ss_pred HHHHHHHHHHCCCEEEEECCCCCCCCCH----------------HHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHC- T ss_conf 9999999976989999957837477805----------------77479999857663237996889999999999985- Q ss_pred CCCCEEECCCCCCCCCCCHHHHHHHHHHH-HHHHC----CCCCCEEECCCCCC-CCCCCCCCCCHHHHHHHCCCCCEEEE Q ss_conf 65311100002442454102256554202-57623----33681354368888-88876564405788752143782697 Q gi|254781051|r 689 LNALTIWEAQFGDFANGAQVILDQFITAG-EQKWL----CTSNLVCLLPHGYE-GQGPEHSSARLERFLQMCAENNMYVA 762 (957) Q Consensus 689 p~~L~iWEAQFGDF~NgAQviiDqfiss~-e~KW~----~~sglvllLPHGye-GqGPEHSSaRlERfLQl~a~~N~~V~ 762 (957) -.-.++|.||.||..- -+||.+... ..+|. ....+|+..|+|.- |+||.|| ..+|-++. .=.||.|+ T Consensus 67 -G~~Pvv~~~~~~Fl~~---a~dQi~n~~a~~~~~s~g~~~~pvvir~~~g~~~~~g~~Hs-~~~~a~~~--~iPgl~V~ 139 (167) T cd07036 67 -GLRPIVEIMFADFALP---AFDQIVNEAAKLRYMSGGQFKVPIVIRGPNGGGIGGGAQHS-QSLEAWFA--HIPGLKVV 139 (167) T ss_pred -CCEEEEEEEEHHHHHH---HHHHHHHHHHHHEECCCCCCCCCEEEEECCCCCCCCCCCCC-CCHHHHHC--CCCCCEEE T ss_conf -9937988622555400---59999977567612038940588899960667767453003-68799980--79996899 Q ss_pred ECCCHHHHHHHHHHHHCCCCCCCEEEECCC Q ss_conf 248899999999997435348868985660 Q gi|254781051|r 763 NCTSPANYFHILRRQIYDRSSRPLIMMAPK 792 (957) Q Consensus 763 ~~ttpAq~FH~LRRQ~~r~~rkPLiv~tPK 792 (957) .|+||.++..+||.=+.. ..|.|++-|| T Consensus 140 ~Ps~~~d~~~ll~~a~~~--~~Pv~~~E~k 167 (167) T cd07036 140 APSTPYDAKGLLKAAIRD--DDPVIFLEHK 167 (167) T ss_pred EECCHHHHHHHHHHHHHC--CCCEEEEECC T ss_conf 729999999999999839--9978999339 No 23 >COG0022 AcoB Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit [Energy production and conversion] Probab=98.72 E-value=1.1e-05 Score=62.74 Aligned_cols=265 Identities=20% Similarity=0.277 Sum_probs=177.1 Q ss_pred HHHHHHHHHCCCEEEEECCCCCCCCCCCCEEEEEECCCCCEEEEHHHCCCCCCEEEEEECCCCCCCHHHHHHCCCCCCCC Q ss_conf 98889886358315785010022333230279997788844556344278984599982431111013432022113865 Q gi|254781051|r 611 SLAFGSLCYEGYKVRLSGQDCERGTFSHRHAILCDQETGKRYFPLGNISKDQGHCEVVNSFLSEQAVLGFEYGYSLNNLN 690 (957) Q Consensus 611 ~LA~~SLl~eG~~VRlsGQD~~RGTFshRHavl~dq~t~~~y~pL~~l~~~q~~f~v~nS~LSE~avlgFEyGys~~~p~ 690 (957) ..|..--+..--.|=+-||||+. .|-.|---..|.+.-|.--|.|.||||.|..|.--|-++.-- T Consensus 9 ~~Am~~eM~rD~~V~v~GEDVg~--------------~GGvf~~T~GL~~kfG~~RV~DTPiaE~gi~G~avGaA~~Gl- 73 (324) T COG0022 9 NEAMDEEMERDERVVVLGEDVGV--------------YGGVFRVTKGLQEKFGEERVIDTPIAESGIAGIAVGAALTGL- 73 (324) T ss_pred HHHHHHHHHCCCCEEEECCCCCC--------------CCCCHHHCHHHHHHHCCCCEECCCCCHHHHHHHHHHHHHCCC- T ss_conf 99999998129988998235230--------------277012033388770854000576521216889999997089- Q ss_pred CCEEECCCCCCCCCCCHHHHHHHHHHHHHHHCC-----CCCCEEECCCCCCCCCCCCCCCCHHHHHHHCCCCCEEEEECC Q ss_conf 311100002442454102256554202576233-----368135436888888876564405788752143782697248 Q gi|254781051|r 691 ALTIWEAQFGDFANGAQVILDQFITAGEQKWLC-----TSNLVCLLPHGYEGQGPEHSSARLERFLQMCAENNMYVANCT 765 (957) Q Consensus 691 ~L~iWEAQFGDF~NgAQviiDqfiss~e~KW~~-----~sglvllLPHGyeGqGPEHSSaRlERfLQl~a~~N~~V~~~t 765 (957) -.|=|-||.||+-- .+||-++.+-.---| .+-+|+=-|.|--=+|++|-|-.+|-|..-+ .-+.|+.|| T Consensus 74 -rPivEiqf~dF~~~---a~dqi~n~aAk~ryrsgG~~~~PiviR~p~G~g~~~~~~HSqs~ea~f~h~--PGlKVV~PS 147 (324) T COG0022 74 -RPIVEIQFADFIYP---AFDQIVNQAAKIRYRSGGQFTVPIVIRTPNGGGIGGGAQHSQSLEALFAHI--PGLKVVMPS 147 (324) T ss_pred -CCEEEEEECCHHHH---HHHHHHHHHHHHHHHCCCCEECCEEEECCCCCCCCCHHHCCCCHHHHHHCC--CCCEEEECC T ss_conf -62489863013699---999999999988410489643677997478777774443368889998338--983698179 Q ss_pred CHHHHHHHHHHHHCCCCCCCEEEECCCHHHCC-CCCCCCHHHHCCCCCEEECCCCCCCCCCCCCCCCCCHHHCCEEEEEC Q ss_conf 89999999999743534886898566022108-01268878937998603515776433444544457954576899868 Q gi|254781051|r 766 SPANYFHILRRQIYDRSSRPLIMMAPKSLLRH-KRVVSSLSDMTCGSVFQAVLSDDAEYHGKTSVKLKEDSHIRRVILCT 844 (957) Q Consensus 766 tpAq~FH~LRRQ~~r~~rkPLiv~tPKsLLR~-~~a~S~l~d~~~g~~F~~vi~d~~~~~~~~~~~~~~~~~v~rvi~cs 844 (957) ||+..--||.--+.. .-|.|.+-||.|.|. +.-+. ++ .|- | +-++.++..+.+= --|++- T Consensus 148 tpyDAKGLL~aAIrd--~dPViflE~k~lY~~~~~eVP------~~-~Y~--i-------PlGkA~i~reG~D-vTivty 208 (324) T COG0022 148 TPYDAKGLLKAAIRD--PDPVIFLEHKRLYRSFKGEVP------EE-DYT--I-------PLGKAKIVREGSD-VTIVTY 208 (324) T ss_pred CHHHHHHHHHHHHCC--CCCEEEEECHHHHCCCCCCCC------CC-CCC--C-------CCCCEEEEECCCC-EEEEEE T ss_conf 757778899998258--998799732887565655789------87-742--2-------1560336744885-699985 Q ss_pred CH-HHHHHHHHHHHC-CCCCEEEEEEECCCCCHHHHHHHHHHHCCCC-EEEEECCCCCCCCCHHHHHHHH-HHHHHHC Q ss_conf 85-789999999976-9986899984014868289999998537998-3998067853178457899769-9998750 Q gi|254781051|r 845 GK-VYYDLLDNRDMR-NIADIYLLRIEQLYPFPEDYLIKVLSRFVQA-EIVWCQEEPQNMGAWTFIEPYL-EKVLHSI 918 (957) Q Consensus 845 GK-vyydL~~~r~~~-~~~~vaivRiEqLyPfP~~~l~~~l~~y~~~-e~vW~QEEp~N~GaW~~v~~rl-~~~l~~~ 918 (957) |. |..-|.++-+.. .--++-+|-+--|.|+-.+-+. +.-.+. .++-++|-++-.|-=.-+-..+ ++++..+ T Consensus 209 g~mv~~al~AAe~l~~~Gis~EVIDLRTl~PlD~etIi---~SvkKTgR~viV~Ea~~~~g~gaei~A~i~e~~f~~L 283 (324) T COG0022 209 GAMVHTALEAAEELEKEGISAEVIDLRTLSPLDKETII---ASVKKTGRLVIVHEAPKTGGIGAEIAALIAEEAFDYL 283 (324) T ss_pred CHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHH---HHHHHHCCEEEEEECCCCCCHHHHHHHHHHHHHHHHH T ss_conf 16889999999998626987699724346766799999---9988619689998256567728999999999999751 No 24 >PRK05444 1-deoxy-D-xylulose-5-phosphate synthase; Provisional Probab=98.60 E-value=0.00014 Score=54.32 Aligned_cols=421 Identities=19% Similarity=0.257 Sum_probs=222.1 Q ss_pred CCCHHCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCC Q ss_conf 73100044553236777764166654455565445404788862575333467258876451067744365799984160 Q gi|254781051|r 333 SHLEFVDPVVIGSVRARQDLKAGIVGEKSISLVERSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQ 412 (957) Q Consensus 333 SHLEaVnPVv~G~~RAkQd~~~d~~~~~~~~~~~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQ 412 (957) .||-.==+++.|++.|+.-... ..+|.+|+ ||.+.. .||++|.+++..- +. +.-|=|+-+|+ T Consensus 117 G~~g~gls~a~G~A~a~~l~~~------------~~~v~~~~--GDG~l~-eGis~EA~s~Ag~--~~-~~liii~~DN~ 178 (576) T PRK05444 117 GHSSTSISAALGMAKARDLKGE------------DRKVVAVI--GDGALT-GGMAFEALNNAGD--LK-SDLIVILNDNE 178 (576) T ss_pred CCCCCHHHHHHHHHHHHHHCCC------------CCEEEEEE--CCHHHC-CHHHHHHHHHHHH--CC-CCEEEEEECCC T ss_conf 8762178899999999984699------------95499998--140103-3148999975553--38-87699997798 Q ss_pred EECCCCCHHCCCCCCCCHHHHHHCCCEE-EECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEECCCCC--CCCCCCCCCC Q ss_conf 2012470102467784001122034058-742678899999999999999973897399864211256--5643542235 Q gi|254781051|r 413 IGFTTNPSSARSCTYASDISKSIGIPIF-HVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVCYRRFG--HNEGDEPSFT 489 (957) Q Consensus 413 IGFTT~p~d~RSs~Y~TDiAK~i~aPI~-HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~YRr~G--HNE~DeP~fT 489 (957) |....+- .+ .+..++-+.++-=++ .|+|.|++++..|.+-|- +..|-++|-...-.=+| -+|.| T Consensus 179 isI~~~v-g~---~~~~~~Fea~Gw~~i~~iDGhd~~~i~~al~~ak----~~~~P~lI~~kT~~G~G~~~~e~~----- 245 (576) T PRK05444 179 MSISPNV-GA---LSNYTLFEELGFKYIGPIDGHDLDALVETLKNAK----DLKGPVLLHVVTKKGKGYAPAEAD----- 245 (576) T ss_pred CCCCCCC-CC---CCCCHHHHHCCCEEECCCCCCCHHHHHHHHHHHH----HCCCCEEEEEEEECCCCCCCHHCC----- T ss_conf 2007886-54---3310178980994868658989999999999888----669998999997056677812205----- Q ss_pred CHHHHHHHHHCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHHHH Q ss_conf 57888886718998999999986448989899999999999997756665412587656666568626667789999999 Q gi|254781051|r 490 QPMMYKTIRSHKSVLQLYADSLMKNQVISKQELQSLANNWHKYLEAEYKESESYCPEKLGLLHNGENERKNSVSKEILKK 569 (957) Q Consensus 490 QP~mY~~I~~hp~v~~~Y~~~Li~~g~it~~e~~~~~~~~~~~l~~~~~~~k~~~~~~~~~~~~~~~~~~T~v~~~~L~~ 569 (957) |.-| |-.. .++.++-+ ...+...+ .+ +.+-.+.|.+ T Consensus 246 -~~~~-----Hg~~------------~f~~~~~~--------------~~~~~~~~-----------~~-~~~f~~~L~~ 281 (576) T PRK05444 246 -PIKY-----HGVG------------KFDPETGK--------------QPKKPGKP-----------SY-TKVFGETLCE 281 (576) T ss_pred -HHHC-----CCCC------------CCCHHHCC--------------CCCCCCCC-----------CH-HHHHHHHHHH T ss_conf -1222-----5668------------87944343--------------46776650-----------69-9999999999 Q ss_pred HHHHHCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCEEEEEECCCC Q ss_conf 98874368998620567788899899873343564211478988898863583157850100223332302799977888 Q gi|254781051|r 570 IGSSIIRLPKSFNTHKIVERLMANRQKMIETGKGIDWSMAESLAFGSLCYEGYKVRLSGQDCERGTFSHRHAILCDQETG 649 (957) Q Consensus 570 i~~~l~~~Pe~f~~h~~v~k~~~~R~~~~~~~~~iDW~~AE~LA~~SLl~eG~~VRlsGQD~~RGTFshRHavl~dq~t~ 649 (957) ++++ +..|-=.+| +|+-+| | -..|+. T Consensus 282 l~~~---------------------------~~~iv~ita-a~~~~t-----------~----l~~f~~----------- 307 (576) T PRK05444 282 LAEK---------------------------DPKIVAITA-AMPEGT-----------G----LDEFSK----------- 307 (576) T ss_pred HHHH---------------------------CCCCCEEHH-HCCCCC-----------C----CCHHHH----------- T ss_conf 9874---------------------------302003054-214786-----------6----443677----------- Q ss_pred CEEEEHHHCCCCCCEEEEEECCCCCCCHHHHHHCCCCCCCCCCEEECCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCEE Q ss_conf 44556344278984599982431111013432022113865311100002442454102256554202576233368135 Q gi|254781051|r 650 KRYFPLGNISKDQGHCEVVNSFLSEQAVLGFEYGYSLNNLNALTIWEAQFGDFANGAQVILDQFITAGEQKWLCTSNLVC 729 (957) Q Consensus 650 ~~y~pL~~l~~~q~~f~v~nS~LSE~avlgFEyGys~~~p~~L~iWEAQFGDF~NgAQviiDqfiss~e~KW~~~sglvl 729 (957) -.| .+| +|--..|...+++--|.++. ++.-+=+-|+.|.+ --+||.+...- .+....++ T Consensus 308 --------~~P--~R~--~~~GIaEqhav~~aaGlA~~---G~~Pf~~~~stFl~---Ra~dQi~~d~a---l~~l~V~~ 366 (576) T PRK05444 308 --------RFP--DRY--FDVGIAEQHAVTFAAGLATE---GLKPVVAIYSTFLQ---RAYDQVIHDVA---LQNLPVTF 366 (576) T ss_pred --------HCC--CCE--EECCCHHHHHHHHHHHHHHC---CCCCEEEEECHHHH---HHHHHHHHHHH---HCCCCEEE T ss_conf --------587--634--11574577799997458656---99866888212877---76999987654---12797689 Q ss_pred ECCC-CCCC-CCCCCCCCCHHHHHHHCCCCCEEEEECCCHHHHHHHHHHHHCCCCCCCEEEECCCHHHCCCCCCCCHHHH Q ss_conf 4368-8888-8876564405788752143782697248899999999997435348868985660221080126887893 Q gi|254781051|r 730 LLPH-GYEG-QGPEHSSARLERFLQMCAENNMYVANCTSPANYFHILRRQIYDRSSRPLIMMAPKSLLRHKRVVSSLSDM 807 (957) Q Consensus 730 lLPH-GyeG-qGPEHSSaRlERfLQl~a~~N~~V~~~ttpAq~FH~LRRQ~~r~~rkPLiv~tPKsLLR~~~a~S~l~d~ 807 (957) ..-| |.-| -||-|.. +|-.--|-+=.||.|+.|+.+.+.-.+++.-+. .-..|..+-.|+.-+-. ...+..+.| T Consensus 367 v~d~~GlvGeDGpTHq~--iedla~lR~iPnl~V~~PaD~~E~~~~~~~Al~-~~~gP~~irl~R~~~~~-~~~~~~~~~ 442 (576) T PRK05444 367 AIDRAGLVGADGPTHQG--AFDLSYLRCIPNMVIMAPSDENELRQMLYTALA-YDDGPTAIRYPRGSGVG-VELPELEPL 442 (576) T ss_pred EEECCCCCCCCCCCCCC--HHHHHHHCCCCCEEEEEECCHHHHHHHHHHHHH-CCCCCEEEEECCCCCCC-CCCCCCCCC T ss_conf 96078667899851003--799998526998189962999999999999985-78998899942788888-787765434 Q ss_pred CCCCCEEECCCCCCCCCCCCCCCCCCHHHCCEEEEECCHHHHHHHHHHHHCCC--CCEEEEEEECCCCCHHHHHHHHHHH Q ss_conf 79986035157764334445444579545768998688578999999997699--8689998401486828999999853 Q gi|254781051|r 808 TCGSVFQAVLSDDAEYHGKTSVKLKEDSHIRRVILCTGKVYYDLLDNRDMRNI--ADIYLLRIEQLYPFPEDYLIKVLSR 885 (957) Q Consensus 808 ~~g~~F~~vi~d~~~~~~~~~~~~~~~~~v~rvi~csGKvyydL~~~r~~~~~--~~vaivRiEqLyPfP~~~l~~~l~~ 885 (957) ..|. . .++.+ .+ +-+|+++|-....-+++.++... -++.+|-+-.+-||+.+.+.+++++ T Consensus 443 ~~Gk-~-~vl~~--------------G~--dv~IiatGs~v~~Al~Aa~~L~~~Gi~~~VVs~~~ikPlD~e~l~~~~~~ 504 (576) T PRK05444 443 PIGK-G-EVLRE--------------GS--DVAILAFGTMLAEALEAAERLAAEGISATVVDARFVKPLDEELLLELAAK 504 (576) T ss_pred CCCE-E-EEEEC--------------CC--CEEEEECCHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHHHHHHH T ss_conf 4650-8-99962--------------89--78999538799999999999985699879995587788799999999954 Q ss_pred CCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHCCC-CCCEEEEEECC Q ss_conf 79983998067853178457899769999875056-77601897536 Q gi|254781051|r 886 FVQAEIVWCQEEPQNMGAWTFIEPYLEKVLHSIGA-DYSRVRYVGRL 931 (957) Q Consensus 886 y~~~e~vW~QEEp~N~GaW~~v~~rl~~~l~~~~~-~~~~~~y~gR~ 931 (957) |. .+++=||+--.|+|.- .+.+.+...+. ...++..+|=+ T Consensus 505 ~~---~ivtvEeh~~~GGlGs---~v~e~l~~~g~~~~~~~~~iGi~ 545 (576) T PRK05444 505 HE---LLVTVEENAIMGGFGS---AVLEFLAEHGLLKPVKVLNLGLP 545 (576) T ss_pred CC---EEEEEECCCCCCCHHH---HHHHHHHHCCCCCCCCEEEEEEC T ss_conf 89---7999938975765899---99999997599779746999848 No 25 >KOG1182 consensus Probab=97.71 E-value=0.0039 Score=43.43 Aligned_cols=335 Identities=18% Similarity=0.206 Sum_probs=214.5 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCE-------ECCCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCHH Q ss_conf 88879889999999999999899999864033622-------22564658999999999998737910799806765526 Q gi|254781051|r 203 SSNFSKEEQRDILDKLVRAEGFEKFIDIKYKGAKR-------FGADGSEVIIPAIEEIIRQGVQQGVDEMILGMAHRGRL 275 (957) Q Consensus 203 ~~~~s~eeK~~iL~~L~~Ae~FE~FL~~Kf~g~KR-------FsLEG~EslIP~L~~ii~~a~~~Gv~eiviGMaHRGRL 275 (957) .++++.+.-..+++.++.-...++-|--. .-|-| ||-||. -|-.|+...-+++|.|-=--- T Consensus 80 dp~~~ee~~~kmy~~M~~Ln~MD~IlYes-QRQGRiSFYmT~~GEEa~---------higSAAAL~p~Dli~gQYREa-- 147 (432) T KOG1182 80 DPQLSEEVVLKMYKDMTLLNIMDRILYES-QRQGRISFYMTNFGEEAI---------HIGSAAALEPQDLIYGQYREA-- 147 (432) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHH-HHCCEEEEEEECCCHHHH---------HHHHHHHCCCCCCCCCCCCCC-- T ss_conf 78846999999999889999999998887-636437898703441231---------012354178302223113557-- Q ss_pred HHHHHHHCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEECCCEEEEEECCCCCCCHHCCHHHHHHHHHHHHHHCC Q ss_conf 89998853798999998558874322345551001502688983596189997178773100044553236777764166 Q gi|254781051|r 276 NVLSQIMNKLPRSIFYEFKGKGPSEKEYSGDVKYHLGLCCNRQICGKDVKLLLQSNPSHLEFVDPVVIGSVRARQDLKAG 355 (957) Q Consensus 276 NVL~Nvl~Kp~~~iF~EF~g~~~~~~~~sGDVKYHLG~s~~~~~~g~~v~lsL~~NPSHLEaVnPVv~G~~RAkQd~~~d 355 (957) -||. --|.+++...+.--|+.++-..| --..-|.|..- + +-+.|| |-|-.--|-+.|.+-|--.+.++ T Consensus 148 GVLl-wRgftle~f~~qCyGn~~d~gkG-rQMPvHyGs~e-l----nf~tis-----splatqlpqAvGaaYa~k~~~~n 215 (432) T KOG1182 148 GVLL-WRGFTLEEFMNQCYGNKSDLGKG-RQMPVHYGSKE-L----NFVTIS-----SPLATQLPQAVGAAYALKMRKKN 215 (432) T ss_pred CEEE-ECCCCHHHHHHHHCCCCCCCCCC-CCCCCCCCCCC-C----CEEEEC-----CHHHHCCCHHHHHHHHHHHCCCC T ss_conf 3688-84852999998742785301466-53652137552-3----269961-----55430262655456665404467 Q ss_pred CCCCCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCCEECCCCCHH-CCCCCCCCHHHHH Q ss_conf 654455565445404788862575333467258876451067744365799984160201247010-2467784001122 Q gi|254781051|r 356 IVGEKSISLVERSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQIGFTTNPSS-ARSCTYASDISKS 434 (957) Q Consensus 356 ~~~~~~~~~~~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQIGFTT~p~d-~RSs~Y~TDiAK~ 434 (957) ....|.--||.|-+ .|=..-.||++-+-+- -.|-+.-||--.-.|...+ -||--- ---+.. T Consensus 216 -------------nac~V~yfGdG~aS-EGD~HA~~NfAAtle~---Pvif~CRNNG~AISTptseQyr~DGI-a~kG~a 277 (432) T KOG1182 216 -------------NACAVTYFGDGAAS-EGDAHAAFNFAATLEC---PVIFFCRNNGWAISTPTSEQYRGDGI-AVKGPA 277 (432) T ss_pred -------------CEEEEEEECCCCCC-CCCHHHHHHHHHHHCC---CEEEEECCCCEEECCCCHHHHCCCCE-EEECCC T ss_conf -------------80799994578666-6532333207877279---88999827975640564787067855-873465 Q ss_pred HCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCCCH---HHHHHHHHCCCHHHHHHHHH Q ss_conf 034058742678899999999999999973897399864211256564354223557---88888671899899999998 Q gi|254781051|r 435 IGIPIFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVCYRRFGHNEGDEPSFTQP---MMYKTIRSHKSVLQLYADSL 511 (957) Q Consensus 435 i~aPI~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~YRr~GHNE~DeP~fTQP---~mY~~I~~hp~v~~~Y~~~L 511 (957) +++--+.|.|.|.-||.-|++-|-++--+=.+-|+|....||---|.-.||-.---+ +-|=.-..|| ...+.+-+ T Consensus 278 YGi~sIRVDGnD~lAvYnA~k~ARe~av~e~rPvliEamtYRvGHHSTSDDSt~YRsadEiq~W~~~~~p--isrfr~~i 355 (432) T KOG1182 278 YGIRSIRVDGNDALAVYNAVKEAREMAVTEQRPVLIEAMTYRVGHHSTSDDSTAYRSADEIQYWNKSRHP--ISRFRKYI 355 (432) T ss_pred CCEEEEEECCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHCCCCC--HHHHHHHH T ss_conf 6357998668316799998899999987516860356456652466677753013551255454305680--79999988 Q ss_pred HHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCC-CCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCC Q ss_conf 64489898999999999999977566654125-876566665686266677899999999887436899862 Q gi|254781051|r 512 MKNQVISKQELQSLANNWHKYLEAEYKESESY-CPEKLGLLHNGENERKNSVSKEILKKIGSSIIRLPKSFN 582 (957) Q Consensus 512 i~~g~it~~e~~~~~~~~~~~l~~~~~~~k~~-~~~~~~~~~~~~~~~~T~v~~~~L~~i~~~l~~~Pe~f~ 582 (957) ...|..+++.-.++.++.++++-+++..+..+ +|++...+.+-....+.-+..+ -..|.+.+.+.|+++- T Consensus 356 ~~~GWw~ee~E~~~rk~~rk~vl~a~~~aEk~~K~~~~~lF~dVYd~~P~~L~eq-~~~l~~~i~~~~e~Yp 426 (432) T KOG1182 356 ESNGWWSEEDESELRKNIRKKVLEAIAAAEKKEKPNLTELFEDVYDEMPKNLREQ-ELELKEHIEKHPEHYP 426 (432) T ss_pred HHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHCCCCCC T ss_conf 7467757146899999999999999998761358998998888865324768999-9999999986733385 No 26 >PRK12571 1-deoxy-D-xylulose-5-phosphate synthase; Provisional Probab=97.58 E-value=0.011 Score=39.95 Aligned_cols=226 Identities=15% Similarity=0.201 Sum_probs=140.2 Q ss_pred EECCCCCCCHHHHHHCCCCCCCCCCEEECCCCCCCCCCCHHHHHHHHHH-HHHHHCCCCCCEEECCCC-C-CCCCCCCCC Q ss_conf 8243111101343202211386531110000244245410225655420-257623336813543688-8-888876564 Q gi|254781051|r 668 VNSFLSEQAVLGFEYGYSLNNLNALTIWEAQFGDFANGAQVILDQFITA-GEQKWLCTSNLVCLLPHG-Y-EGQGPEHSS 744 (957) Q Consensus 668 ~nS~LSE~avlgFEyGys~~~p~~L~iWEAQFGDF~NgAQviiDqfiss-~e~KW~~~sglvllLPHG-y-eGqGPEHSS 744 (957) ++--..|.+..++--|.+. ..+.-.=+-|.+|. |-..||-+-+ +- +...+++..-|. . -..||-|-. T Consensus 364 ~~~GIrE~~m~a~a~GlA~---~G~~P~~~tf~~F~---~ra~~qi~~~~al----~~lpv~~v~~~~GlvGeDGpTHq~ 433 (642) T PRK12571 364 FDVGIAEQHAVTFAAGLAA---AGLKPFCAIYSTFL---QRGYDQVLHDVAL----QKLPVRFVIDRAGLVGADGATHAG 433 (642) T ss_pred CCCCHHHHHHHHHHHHHHH---CCCCEEEEEECCHH---HHHHHHHHHHHHH----CCCCEEEEEECCCCCCCCCCCCCC T ss_conf 5524556789999634764---69962899967445---5437887632551----598669999667324799876424 Q ss_pred CCHHHHHHHCCCCCEEEEECCCHHHHHHHHHHHHCCCCCCCEEEECCCHHHCCCCCCCCHHHHCCCCCEEECCCCCCCCC Q ss_conf 40578875214378269724889999999999743534886898566022108012688789379986035157764334 Q gi|254781051|r 745 ARLERFLQMCAENNMYVANCTSPANYFHILRRQIYDRSSRPLIMMAPKSLLRHKRVVSSLSDMTCGSVFQAVLSDDAEYH 824 (957) Q Consensus 745 aRlERfLQl~a~~N~~V~~~ttpAq~FH~LRRQ~~r~~rkPLiv~tPKsLLR~~~a~S~l~d~~~g~~F~~vi~d~~~~~ 824 (957) +|-.--|.+=.||.|+-|+.+.+.-.+++--+.+ -..|-++..|+.-+........-..+.-| . -.++.+. T Consensus 434 --iedla~lR~iPnl~V~~PaD~~E~~~a~~~al~~-~~gP~~i~l~R~~~~~~~~~~~~~~~~~g-~-~~~~~~g---- 504 (642) T PRK12571 434 --MFDMAFLTCLPNLRVMAPADEAELRHMLRTAVAH-DDGPIAVRYPRGEGVGVEIPAVGRVLGIG-K-GRVPREG---- 504 (642) T ss_pred --HHHHHHHCCCCCEEEEECCCHHHHHHHHHHHHHC-CCCCEEEEECCCCCCCCCCCCCCCCCCCC-E-EEEECCC---- T ss_conf --3778875479981899129999999999999866-89988999527878887766421113564-3-8994279---- Q ss_pred CCCCCCCCCHHHCCEEEEECCHHHHHHHHHHHHCCC--CCEEEEEEECCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCC Q ss_conf 445444579545768998688578999999997699--868999840148682899999985379983998067853178 Q gi|254781051|r 825 GKTSVKLKEDSHIRRVILCTGKVYYDLLDNRDMRNI--ADIYLLRIEQLYPFPEDYLIKVLSRFVQAEIVWCQEEPQNMG 902 (957) Q Consensus 825 ~~~~~~~~~~~~v~rvi~csGKvyydL~~~r~~~~~--~~vaivRiEqLyPfP~~~l~~~l~~y~~~e~vW~QEEp~N~G 902 (957) .-.=+|+++|-....-+++.++... -++.+|-+.-+-||..+.+.....++ ..+-+=|+..- | T Consensus 505 -----------~~dv~lia~Gs~v~~Al~Aa~~L~~~Gi~~~Vv~~~~~kp~D~~~~~~~~~~~---~~vv~~E~~~~-g 569 (642) T PRK12571 505 -----------RPDVAILSVGAHLHECLEAAELLEAEGISVTVADARFVKPLDEALIAQLARRH---RVLVTVEDGAM-G 569 (642) T ss_pred -----------CCCEEEEEEHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHHHHHHC---CEEEEEECCCC-C T ss_conf -----------99989998609999999999999866998699816737877999999997009---88999907851-2 Q ss_pred CH-HHHHHHHHHHHHHCCC--CCCEEEEEECC Q ss_conf 45-7899769999875056--77601897536 Q gi|254781051|r 903 AW-TFIEPYLEKVLHSIGA--DYSRVRYVGRL 931 (957) Q Consensus 903 aW-~~v~~rl~~~l~~~~~--~~~~~~y~gR~ 931 (957) .| .++...+.. .+. ...+++.+|=| T Consensus 570 G~g~~v~~~~~~----~g~~~~~~~~~~~Gi~ 597 (642) T PRK12571 570 GFGAHVLHHLAD----AGLLDGGLKLRTLGLP 597 (642) T ss_pred CHHHHHHHHHHH----CCCCCCCCEEEEEECC T ss_conf 389999999997----4987789738998358 No 27 >cd02012 TPP_TK Thiamine pyrophosphate (TPP) family, Transketolase (TK) subfamily, TPP-binding module; TK catalyzes the transfer of a two-carbon unit from ketose phosphates to aldose phosphates. In heterotrophic organisms, TK provides a link between glycolysis and the pentose phosphate pathway and provides precursors for nucleotide, aromatic amino acid and vitamin biosynthesis. In addition, the enzyme plays a central role in the Calvin cycle in plants. Typically, TKs are homodimers. They require TPP and divalent cations, such as magnesium ions, for activity. Probab=97.54 E-value=0.0011 Score=47.57 Aligned_cols=123 Identities=20% Similarity=0.223 Sum_probs=88.7 Q ss_pred CCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCCEECCC Q ss_conf 04455323677776416665445556544540478886257533346725887645106774436579998416020124 Q gi|254781051|r 338 VDPVVIGSVRARQDLKAGIVGEKSISLVERSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQIGFTT 417 (957) Q Consensus 338 VnPVv~G~~RAkQd~~~d~~~~~~~~~~~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQIGFTT 417 (957) -=|++.|.|-|..... ...+|.+|+ ||++.. .|+++|.|+++... ..+=-|=||=||++.-.. T Consensus 110 gls~a~G~A~a~k~~~------------~~~~v~~~i--GDGel~-EG~~wEAl~~A~~~--~L~nLi~ivD~N~~~~~g 172 (255) T cd02012 110 GLSVAVGMALAEKLLG------------FDYRVYVLL--GDGELQ-EGSVWEAASFAGHY--KLDNLIAIVDSNRIQIDG 172 (255) T ss_pred HHHHHHHHHHHHHHCC------------CCCCEEEEE--CCCCCC-CCHHHHHHHHHHHC--CCCCEEEEECCCCCEECC T ss_conf 8999999999998628------------887179994--251103-31289999998555--877569998689826256 Q ss_pred CCHHCCCCCCCCHHHHHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEECCCCCC Q ss_conf 701024677840011220340587426788999999999999999738973998642112565 Q gi|254781051|r 418 NPSSARSCTYASDISKSIGIPIFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVCYRRFGH 480 (957) Q Consensus 418 ~p~d~RSs~Y~TDiAK~i~aPI~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~YRr~GH 480 (957) ...+..+..--.+..+.++-.++.|||.|.+++..+.+-| +..-+|-++|-..+-.=+|- T Consensus 173 ~~~~~~~~~~l~~~~~sfG~~v~~vdGhd~~~i~~a~~~a---~~~~~kP~~I~~~T~KGkG~ 232 (255) T cd02012 173 PTDDILFTEDLAKKFEAFGWNVIEVDGHDVEEILAALEEA---KKSKGKPTLIIAKTIKGKGV 232 (255) T ss_pred CCCCCCCHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHH---HHCCCCCEEEEEEEEECCCC T ss_conf 0302547688999999669811101799999999999999---86799958999996511488 No 28 >KOG0525 consensus Probab=97.26 E-value=0.0019 Score=45.82 Aligned_cols=246 Identities=22% Similarity=0.287 Sum_probs=160.1 Q ss_pred EEEEECCCCCCCCCCCCEEEEEECCCCCEEEEHHHCCCCCCEEEEEECCCCCCCHHHHHHCCCCCCCCCCEEECCCCCCC Q ss_conf 15785010022333230279997788844556344278984599982431111013432022113865311100002442 Q gi|254781051|r 623 KVRLSGQDCERGTFSHRHAILCDQETGKRYFPLGNISKDQGHCEVVNSFLSEQAVLGFEYGYSLNNLNALTIWEAQFGDF 702 (957) Q Consensus 623 ~VRlsGQD~~RGTFshRHavl~dq~t~~~y~pL~~l~~~q~~f~v~nS~LSE~avlgFEyGys~~~p~~L~iWEAQFGDF 702 (957) .--+-|+||.=|- |+.+.. .|.+.-|+=.|.|.||.|.|..||-.|... -.+-.|-|-||+|+ T Consensus 60 ~a~vfgedv~fgg------vfrct~---------gl~~kfgk~rvfntplceqgivgfgig~aa--~g~~aiaeiqfady 122 (362) T KOG0525 60 RAVVFGEDVAFGG------VFRCTT---------GLAEKFGKDRVFNTPLCEQGIVGFGIGLAA--MGATAIAEIQFADY 122 (362) T ss_pred CEEEECCCCCCCE------EEEEEC---------CHHHHHCCCCCCCCCHHHCCCCEECHHHHH--CCCCEEEEEEECCC T ss_conf 3278324542240------798510---------067773756302672022154200131254--06615888850221 Q ss_pred CCCCHHHHHHHHHHHHHHHCCCC-------CCEEECCCCCCCCCCCCCCCCHHHHHHHCCCCCEEEEECCCHHHHHHHHH Q ss_conf 45410225655420257623336-------81354368888888765644057887521437826972488999999999 Q gi|254781051|r 703 ANGAQVILDQFITAGEQKWLCTS-------NLVCLLPHGYEGQGPEHSSARLERFLQMCAENNMYVANCTSPANYFHILR 775 (957) Q Consensus 703 ~NgAQviiDqfiss~e~KW~~~s-------glvllLPHGyeGqGPEHSSaRlERfLQl~a~~N~~V~~~ttpAq~FH~LR 775 (957) .--| +||.+.- -+|+.-.| ||++--|-|--|-|.---|--.|-|. |--..+.|+.|-+|-|-=-||- T Consensus 123 ifpa---fdqivne-aakfryrsgnqfncg~ltir~p~gavghg~~yhsqspeaff--~h~pgikvviprsp~qakglll 196 (362) T KOG0525 123 IFPA---FDQIVNE-AAKFRYRSGNQFNCGGLTIRAPWGAVGHGALYHSQSPEAFF--CHVPGIKVVIPRSPRQAKGLLL 196 (362) T ss_pred CCHH---HHHHHHH-HHHHEECCCCCCCCCCEEEECCCCCCCCCCCCCCCCCHHHE--ECCCCCEEEECCCCCHHHCEEE T ss_conf 2255---8999878-77625235784456765774554424555301568823530--0489836982489504303142 Q ss_pred HHHCCCCCCCEEEECCCHHHCCCCCCCCHHHHCCCCCEEECCCCCCCCCCCCCCCCCCHHHCCEEEEECCHHHHHHHHHH Q ss_conf 97435348868985660221080126887893799860351577643344454445795457689986885789999999 Q gi|254781051|r 776 RQIYDRSSRPLIMMAPKSLLRHKRVVSSLSDMTCGSVFQAVLSDDAEYHGKTSVKLKEDSHIRRVILCTGKVYYDLLDNR 855 (957) Q Consensus 776 RQ~~r~~rkPLiv~tPKsLLR~~~a~S~l~d~~~g~~F~~vi~d~~~~~~~~~~~~~~~~~v~rvi~csGKvyydL~~~r 855 (957) --+ .-..|.|-|-||-|.|...---|.+|..---.=-+||-+ -+.+ .+|----++..-+..+- T Consensus 197 sci--rdpnp~iffepk~lyr~a~edvp~~dy~iplsqaevire--------------g~di-tlv~wgtqvh~i~e~a~ 259 (362) T KOG0525 197 SCI--RDPNPCIFFEPKILYRQAVEDVPEGDYMIPLSQAEVIRE--------------GSDI-TLVAWGTQVHVIMEQAC 259 (362) T ss_pred EEC--CCCCCEEEECHHHHHHHHHHHCCCCCCCCCCCHHHHHHC--------------CCCE-EEEECCHHHHHHHHHHH T ss_conf 010--699965784528999876630787773343217777523--------------7843-89971104689999987 Q ss_pred -HH-CCCCCEEEEEEECCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHH Q ss_conf -97-69986899984014868289999998537998399806785317845789976 Q gi|254781051|r 856 -DM-RNIADIYLLRIEQLYPFPEDYLIKVLSRFVQAEIVWCQEEPQNMGAWTFIEPY 910 (957) Q Consensus 856 -~~-~~~~~vaivRiEqLyPfP~~~l~~~l~~y~~~e~vW~QEEp~N~GaW~~v~~r 910 (957) .+ +---+.-+|-+.-|-||-.+-+.+-..|-. .++-..|-|-.-|--.-+..- T Consensus 260 l~~ek~giscevidlkti~pwd~d~v~~sv~ktg--rllisheapvtggfgaeiast 314 (362) T KOG0525 260 LAKEKLGISCEVIDLKTIIPWDKDTVEESVQKTG--RLLISHEAPVTGGFGAEIAST 314 (362) T ss_pred HHHHHCCCCEEEEEEECCCCCCHHHHHHHHHHHC--EEEEECCCCCCCCCHHHHHHH T ss_conf 6677529846997300015753888999887416--178863477567521888999 No 29 >cd07033 TPP_PYR_DXS_TK_like Pyrimidine (PYR) binding domain of 1-deoxy-D-xylulose-5-phosphate synthase (DXS), transketolase (TK), and related proteins. Thiamine pyrophosphate (TPP) family, pyrimidine (PYR) binding domain of 1-deoxy-D-xylulose-5-phosphate synthase (DXS), transketolase (TK), and the beta subunits of the E1 component of the human pyruvate dehydrogenase complex (E1- PDHc), subfamily. The PYR domain is found in many key metabolic enzymes which use TPP (also known as thiamine diphosphate) as a cofactor. TPP binds in the cleft formed by a PYR domain and a PP domain. The PYR domain, binds the aminopyrimidine ring of TPP, the PP domain binds the diphosphate residue. A polar interaction between the conserved glutamate of the PYR domain and the N1' of the TPP aminopyrimidine ring is shared by most TPP-dependent enzymes, and participates in the activation of TPP. The PYR and PP domains have a common fold, but do not share strong sequence conservation. The PP domain is not included Probab=97.22 E-value=0.0045 Score=42.91 Aligned_cols=145 Identities=19% Similarity=0.222 Sum_probs=106.1 Q ss_pred HHHHHCCCEEEEECCCCCCCCCCCCEEEEEECCCCCEEEEHHHCCCCCCEEEEEECCCCCCCHHHHHHCCCCCCCCCCEE Q ss_conf 98863583157850100223332302799977888445563442789845999824311110134320221138653111 Q gi|254781051|r 615 GSLCYEGYKVRLSGQDCERGTFSHRHAILCDQETGKRYFPLGNISKDQGHCEVVNSFLSEQAVLGFEYGYSLNNLNALTI 694 (957) Q Consensus 615 ~SLl~eG~~VRlsGQD~~RGTFshRHavl~dq~t~~~y~pL~~l~~~q~~f~v~nS~LSE~avlgFEyGys~~~p~~L~i 694 (957) --++.+.-.|-+-++|+...+.... +..-.++ .++|.+++|.+..|.--|-|+.--. - T Consensus 8 ~~~~~~d~~vv~l~~Dl~~~~~~~~---------------f~~~~p~----r~in~GIaE~~~vg~a~GlA~~G~~---p 65 (156) T cd07033 8 LELAKKDPRIVALSADLGGSTGLDK---------------FAKKFPD----RFIDVGIAEQNMVGIAAGLALHGLK---P 65 (156) T ss_pred HHHHHHCCCEEEEECCCCCCCCHHH---------------HHHHCCC----CEECCCCHHHHHHHHHHHHHHCCCC---C T ss_conf 9999779799999687678877589---------------9987897----8444770199999999999975999---8 Q ss_pred ECCCCCCCCCCCHHHHHHHH-HHHHHHHCCCCCCEEECCCC---CCCCCCCCCCCCHHHHHHHCCCCCEEEEECCCHHHH Q ss_conf 00002442454102256554-20257623336813543688---888887656440578875214378269724889999 Q gi|254781051|r 695 WEAQFGDFANGAQVILDQFI-TAGEQKWLCTSNLVCLLPHG---YEGQGPEHSSARLERFLQMCAENNMYVANCTSPANY 770 (957) Q Consensus 695 WEAQFGDF~NgAQviiDqfi-ss~e~KW~~~sglvllLPHG---yeGqGPEHSSaRlERfLQl~a~~N~~V~~~ttpAq~ 770 (957) |-..|..| .+-.+||.. ..+-. ...+++...|+ |-+.||.|.+ +|-.--+.+-.||.|..|++|... T Consensus 66 i~~~~~~F---~~r~~eqi~~~~a~~----~~~v~~v~~~~g~~~~~~G~tH~~--~ed~~~~~~~Pgl~i~~P~~~~e~ 136 (156) T cd07033 66 FVSTFSFF---LQRAYDQIRHDVALQ----NLPVKFVGTHAGISVGEDGPTHQG--IEDIALLRAIPNMTVLRPADANET 136 (156) T ss_pred CHHHHHHH---HHHHHHHHHHHHHHC----CCCEEEEEECCCCCCCCCCCCCCC--CCHHHHHHCCCCCEEEEECCHHHH T ss_conf 11357988---751099999997752----799659976885477789788055--005878723899489961999999 Q ss_pred HHHHHHHHCCCCCCCEEEECCC Q ss_conf 9999997435348868985660 Q gi|254781051|r 771 FHILRRQIYDRSSRPLIMMAPK 792 (957) Q Consensus 771 FH~LRRQ~~r~~rkPLiv~tPK 792 (957) -.+|+.=... .+|-++..|| T Consensus 137 ~~ll~~a~~~--~~P~~irl~r 156 (156) T cd07033 137 AAALEAALEY--DGPVYIRLPR 156 (156) T ss_pred HHHHHHHHHC--CCCEEEEEEC T ss_conf 9999999828--9988999769 No 30 >cd02007 TPP_DXS Thiamine pyrophosphate (TPP) family, DXS subfamily, TPP-binding module; 1-Deoxy-D-xylulose-5-phosphate synthase (DXS) is a regulatory enzyme of the mevalonate-independent pathway involved in terpenoid biosynthesis. Terpeniods are plant natural products with important pharmaceutical activity. DXS catalyzes a transketolase-type condensation of pyruvate with D-glyceraldehyde-3-phosphate to form 1-deoxy-D-xylulose-5-phosphate (DXP) and carbon dioxide. The formation of DXP leads to the formation of the terpene precursor IPP (isopentyl diphosphate) and to the formation of thiamine (vitamin B1) and pyridoxal (vitamin B6). Probab=97.09 E-value=0.0075 Score=41.27 Aligned_cols=113 Identities=20% Similarity=0.403 Sum_probs=79.1 Q ss_pred CCCCHHCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECC Q ss_conf 77310004455323677776416665445556544540478886257533346725887645106774436579998416 Q gi|254781051|r 332 PSHLEFVDPVVIGSVRARQDLKAGIVGEKSISLVERSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINN 411 (957) Q Consensus 332 PSHLEaVnPVv~G~~RAkQd~~~d~~~~~~~~~~~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNN 411 (957) .-||=-==|++.|+|-|..-.. ...+|.+++ ||.... +|+++|.++++.. +. .--|=||.+| T Consensus 74 tGsLG~Gls~a~G~Ala~k~~~------------~~~~v~~l~--GDGEl~-EG~~wEA~~~A~~--~~-~nli~iid~N 135 (195) T cd02007 74 TGHSSTSISAALGMAVARDLKG------------KKRKVIAVI--GDGALT-GGMAFEALNNAGY--LK-SNMIVILNDN 135 (195) T ss_pred CCCHHHHHHHHHHHHHHHHHCC------------CCCEEEEEE--CCCHHH-HHHHHHHHHHHHH--CC-CCEEEEEECC T ss_conf 8646889999999999995679------------998499997--781140-1899999999765--18-9869999679 Q ss_pred CEECCCCCHHCCCCCCCCHHHHHHCCCEEE-ECCCCHHHHHHHHHHHHHHHHHHCCCEEEEE Q ss_conf 020124701024677840011220340587-4267889999999999999997389739986 Q gi|254781051|r 412 QIGFTTNPSSARSCTYASDISKSIGIPIFH-VNGDDPEAVIRVVRMAVSFRMKFHKSVVIDI 472 (957) Q Consensus 412 QIGFTT~p~d~RSs~Y~TDiAK~i~aPI~H-VN~DdpEav~~~~~lA~~yR~~F~kDVvIDl 472 (957) ++...++..+ ..+.-|.++.-++. |||.|.+++..|.+-|- +-+|-++|-. T Consensus 136 ~~~i~~~~~~------~~~~f~afGw~~v~~vDGhd~~~i~~al~~a~----~~~~P~~Iia 187 (195) T cd02007 136 EMSISPNVGT------PGNLFEELGFRYIGPVDGHNIEALIKVLKEVK----DLKGPVLLHV 187 (195) T ss_pred CEEECCCCCC------CCCHHHHCCCCEECCCCCCCHHHHHHHHHHHH----HCCCCEEEEE T ss_conf 8761488666------42368774886778607899999999999986----2899989999 No 31 >cd02006 TPP_Gcl Thiamine pyrophosphate (TPP) family, Gcl subfamily, TPP-binding module; composed of proteins similar to Escherichia coli glyoxylate carboligase (Gcl). E. coli glyoxylate carboligase, plays a key role in glyoxylate metabolism where it catalyzes the condensation of two molecules of glyoxylate to give tartronic semialdehyde and carbon dioxide. This enzyme requires TPP, magnesium ion and FAD as cofactors. Probab=96.86 E-value=0.017 Score=38.51 Aligned_cols=166 Identities=17% Similarity=0.244 Sum_probs=110.2 Q ss_pred CHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEE-EECCCEEEEEECCCCCCCHHCCHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 989999985588743223455510015026889-8359618999717877310004455323677776416665445556 Q gi|254781051|r 285 LPRSIFYEFKGKGPSEKEYSGDVKYHLGLCCNR-QICGKDVKLLLQSNPSHLEFVDPVVIGSVRARQDLKAGIVGEKSIS 363 (957) Q Consensus 285 p~~~iF~EF~g~~~~~~~~sGDVKYHLG~s~~~-~~~g~~v~lsL~~NPSHLEaVnPVv~G~~RAkQd~~~d~~~~~~~~ 363 (957) .|..++.|.....|.+.....|+=-|.-+.... ... ++-+.-....=.++=.=-|-++|-.=|.-| T Consensus 9 ~P~rv~~~l~~~lp~d~ivv~D~G~~~~~~~~~~~~~-~p~~~~~s~~~g~mG~glPaAiGaklA~Pd------------ 75 (202) T cd02006 9 KPQRVYEEMNKAFGRDVRYVTTIGLSQIAGAQMLHVY-KPRHWINCGQAGPLGWTVPAALGVAAADPD------------ 75 (202) T ss_pred CHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHCCCC-CCCEEECCCCCCHHHHHHHHHHHHHHHCCC------------ T ss_conf 9999999999757998199987868999999856868-999186589987033189999999987699------------ Q ss_pred CCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCC---------EECCCC------CHHCCCC--- Q ss_conf 5445404788862575333467258876451067744365799984160---------201247------0102467--- Q gi|254781051|r 364 LVERSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQ---------IGFTTN------PSSARSC--- 425 (957) Q Consensus 364 ~~~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQ---------IGFTT~------p~d~RSs--- 425 (957) +.|++ |-||.+|.=-+--.+|.--.+|| .|.||.||+ -.|..+ ..+..++ T Consensus 76 ----r~Vv~--i~GDGsf~m~~~EL~Ta~r~~lp------ii~vV~NN~~~g~ir~~q~~~~~~~~~~~~~~~~~~~~~~ 143 (202) T cd02006 76 ----RQVVA--LSGDYDFQFMIEELAVGAQHRIP------YIHVLVNNAYLGLIRQAQRAFDMDYQVNLAFENINSSELG 143 (202) T ss_pred ----CEEEE--EECCCCHHHHHHHHHHHHHHCCC------EEEEEEECCHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCC T ss_conf ----85999--97684055129999999996998------3999997955689999999853474423254666885446 Q ss_pred CCCC---HHHHHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEECCC Q ss_conf 7840---011220340587426788999999999999999738973998642112 Q gi|254781051|r 426 TYAS---DISKSIGIPIFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVCYRR 477 (957) Q Consensus 426 ~Y~T---DiAK~i~aPI~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~YRr 477 (957) .|.. -+|+.+++.=+.| ++||....|.+=|++.-.+-+..++||.+|=|. T Consensus 144 ~~~~D~~kiAea~G~~g~rV--~~~~eL~~Al~~A~~~~~~~~~P~lIeviidr~ 196 (202) T cd02006 144 GYGVDHVKVAEGLGCKAIRV--TKPEELAAAFEQAKKLMAEHRVPVVVEAILERV 196 (202) T ss_pred CCCCCHHHHHHHCCCEEEEE--CCHHHHHHHHHHHHHHHCCCCCCEEEEEEECCC T ss_conf 78989999999789989997--999999999999998751479978999997671 No 32 >PRK08659 2-oxoglutarate ferredoxin oxidoreductase subunit alpha; Validated Probab=96.74 E-value=0.043 Score=35.51 Aligned_cols=90 Identities=18% Similarity=0.226 Sum_probs=58.0 Q ss_pred CCEEEEECCHHHHHHHHHHHHCC--CCCEEEEEEECCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHH Q ss_conf 76899868857899999999769--9868999840148682899999985379983998067853178457899769999 Q gi|254781051|r 837 IRRVILCTGKVYYDLLDNRDMRN--IADIYLLRIEQLYPFPEDYLIKVLSRFVQAEIVWCQEEPQNMGAWTFIEPYLEKV 914 (957) Q Consensus 837 v~rvi~csGKvyydL~~~r~~~~--~~~vaivRiEqLyPfP~~~l~~~l~~y~~~e~vW~QEEp~N~GaW~~v~~rl~~~ 914 (957) .+-+|+|.|=.|-...++-+... -.+|.++||-.|+|||.++|++.++++.. |-|=| .|+| |.. ..+... T Consensus 274 Adi~IV~~GS~~~~~~eAvd~Lr~~G~kvg~l~ir~l~PfP~~~i~~~l~~~k~---ViVvE--~N~G-Ql~--~~i~~~ 345 (377) T PRK08659 274 AEVVVVAYGSVARSARRAVKEAREEGIKVGLFRLITVWPFPEEAIRELAKKVKA---IVVPE--MNLG-QMS--LEVERV 345 (377) T ss_pred CCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEEEECCCCHHHHHHHHHCCCE---EEEEC--CCCH-HHH--HHHHHH T ss_conf 989999968557899999999996498688999722069998999999856998---99980--7756-899--999997 Q ss_pred HHHCCCCCCEEEEEECCCCC Q ss_conf 87505677601897536646 Q gi|254781051|r 915 LHSIGADYSRVRYVGRLPSA 934 (957) Q Consensus 915 l~~~~~~~~~~~y~gR~~~a 934 (957) +........-++|-|||-++ T Consensus 346 ~~~~~~~~~i~k~~G~p~~~ 365 (377) T PRK08659 346 VKGRAKVKGINKIGGELITP 365 (377) T ss_pred HCCCCCEEEEEEECCCCCCH T ss_conf 28998646883258981799 No 33 >cd00568 TPP_enzymes Thiamine pyrophosphate (TPP) enzyme family, TPP-binding module; found in many key metabolic enzymes which use TPP (also known as thiamine diphosphate) as a cofactor. These enzymes include, among others, the E1 components of the pyruvate, the acetoin and the branched chain alpha-keto acid dehydrogenase complexes. Probab=96.73 E-value=0.01 Score=40.25 Aligned_cols=146 Identities=20% Similarity=0.177 Sum_probs=90.4 Q ss_pred CCCCCCCCCCCCCCCEEEEEEECCCEEEEEECCCCCCCHHCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEEC Q ss_conf 43223455510015026889835961899971787731000445532367777641666544555654454047888625 Q gi|254781051|r 298 PSEKEYSGDVKYHLGLCCNRQICGKDVKLLLQSNPSHLEFVDPVVIGSVRARQDLKAGIVGEKSISLVERSRVLPLIIHG 377 (957) Q Consensus 298 ~~~~~~sGDVKYHLG~s~~~~~~g~~v~lsL~~NPSHLEaVnPVv~G~~RAkQd~~~d~~~~~~~~~~~~~~vlpiliHG 377 (957) |++.....|+=-+..+....-...+.-+.-....-++|-.--|.++|.+-|..+ ++| |.+.| T Consensus 11 ~~~~ivv~d~G~~~~~~~~~~~~~~~~~~~~~~~~g~mG~~~p~AiGa~~a~p~----------------~~v--v~i~G 72 (168) T cd00568 11 PEDAIVVNDAGNSAYWAYRYLPLRRGRRFLTSTGFGAMGYGLPAAIGAALAAPD----------------RPV--VCIAG 72 (168) T ss_pred CCCCEEEECCHHHHHHHHHHCCCCCCCEEECCCCCCHHHHHHHHHHHHHHHCCC----------------CEE--EEEEC T ss_conf 999699987527699999975628999598289875377799999999987899----------------839--99977 Q ss_pred CCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCCEECCCCCH-----------HCCCCCCCCHHHHHHCCCEEEECCCC Q ss_conf 7533346725887645106774436579998416020124701-----------02467784001122034058742678 Q gi|254781051|r 378 DAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQIGFTTNPS-----------SARSCTYASDISKSIGIPIFHVNGDD 446 (957) Q Consensus 378 DAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQIGFTT~p~-----------d~RSs~Y~TDiAK~i~aPI~HVN~Dd 446 (957) |.+|.=.+--.+|..-.++| .+.||.||+..-.+... ...+..--..+|+.++++.++|+ + T Consensus 73 DG~f~~~~~el~ta~~~~l~------i~~iv~nN~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~a~~~G~~~~~v~--~ 144 (168) T cd00568 73 DGGFMMTGQELATAVRYGLP------VIVVVFNNGGYGTIRMHQEAFYGGRVSGTDLSNPDFAALAEAYGAKGVRVE--D 144 (168) T ss_pred CCCCEECCHHHHHHHHCCCC------EEEEEEECCHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHCCCEEEEEC--C T ss_conf 97421131766456531884------269999776136779999850289876565888877899997498699989--9 Q ss_pred HHHHHHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 899999999999999973897399864 Q gi|254781051|r 447 PEAVIRVVRMAVSFRMKFHKSVVIDIV 473 (957) Q Consensus 447 pEav~~~~~lA~~yR~~F~kDVvIDlv 473 (957) |+....+.+-|+ +.++-++||+. T Consensus 145 ~~el~~al~~a~----~~~~p~li~V~ 167 (168) T cd00568 145 PEDLEAALAEAL----AAGGPALIEVK 167 (168) T ss_pred HHHHHHHHHHHH----HCCCCEEEEEE T ss_conf 999999999998----38997899998 No 34 >cd02002 TPP_BFDC Thiamine pyrophosphate (TPP) family, BFDC subfamily, TPP-binding module; composed of proteins similar to Pseudomonas putida benzoylformate decarboxylase (BFDC). P. putida BFDC plays a role in the mandelate pathway, catalyzing the conversion of benzoylformate to benzaldehyde and carbon dioxide. This enzyme is dependent on TPP and a divalent metal cation as cofactors. Probab=96.59 E-value=0.024 Score=37.39 Aligned_cols=157 Identities=18% Similarity=0.229 Sum_probs=96.7 Q ss_pred CHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEE-EECCCEEEEEECCCCCCCHHCCHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 989999985588743223455510015026889-8359618999717877310004455323677776416665445556 Q gi|254781051|r 285 LPRSIFYEFKGKGPSEKEYSGDVKYHLGLCCNR-QICGKDVKLLLQSNPSHLEFVDPVVIGSVRARQDLKAGIVGEKSIS 363 (957) Q Consensus 285 p~~~iF~EF~g~~~~~~~~sGDVKYHLG~s~~~-~~~g~~v~lsL~~NPSHLEaVnPVv~G~~RAkQd~~~d~~~~~~~~ 363 (957) .|..++.+.....|++.....|+=.|..+.... ......-.++ +.-..+=.--|-++|..-|..+ T Consensus 2 ~P~~~~~~l~~~lp~~~ivv~D~G~~~~~~~~~~~~~~p~~~~~--~~~g~mG~~~paAiGa~lA~p~------------ 67 (178) T cd02002 2 TPEYLAAALAAALPEDAIIVDEAVTNGLPLRDQLPLTRPGSYFT--LRGGGLGWGLPAAVGAALANPD------------ 67 (178) T ss_pred CHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHCCCCCCCEEEE--CCCCHHHHHHHHHHHHHHHCCC------------ T ss_conf 88999999996589885999887054999999727489993980--7985376799999999985989------------ Q ss_pred CCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCCE-E-------------CCCCCHHC-CCCCCC Q ss_conf 54454047888625753334672588764510677443657999841602-0-------------12470102-467784 Q gi|254781051|r 364 LVERSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQI-G-------------FTTNPSSA-RSCTYA 428 (957) Q Consensus 364 ~~~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQI-G-------------FTT~p~d~-RSs~Y~ 428 (957) ++|++|. ||.+|.= -++|- .-..-|+.- .+-||+||+- | +++...+. ....-. T Consensus 68 ----r~vv~i~--GDG~f~m--~~~eL---~Ta~~~~lp-v~iiV~NN~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~p~ 135 (178) T cd02002 68 ----RKVVAII--GDGSFMY--TIQAL---WTAARYGLP-VTVVILNNRGYGALRSFLKRVGPEGPGENAPDGLDLLDPG 135 (178) T ss_pred ----CEEEEEE--CCCCCCC--CCHHH---EEEHHHCCC-EEEEEEECCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCC T ss_conf ----8299997--8973115--52221---000141799-7999997983089999999973777776655555677899 Q ss_pred ---CHHHHHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEE Q ss_conf ---001122034058742678899999999999999973897399864 Q gi|254781051|r 429 ---SDISKSIGIPIFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIV 473 (957) Q Consensus 429 ---TDiAK~i~aPI~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv 473 (957) ..+|++++++-++| ++|+....+.+-|++ -.+-++||++ T Consensus 136 ~d~~~iA~a~G~~~~~v--~~~~el~~al~~a~~----~~~p~vIev~ 177 (178) T cd02002 136 IDFAAIAKAFGVEAERV--ETPEELDEALREALA----EGGPALIEVV 177 (178) T ss_pred CCHHHHHHHCCCCEEEE--CCHHHHHHHHHHHHH----CCCCEEEEEE T ss_conf 89999999839948997--899999999999983----8996899998 No 35 >cd02013 TPP_Xsc_like Thiamine pyrophosphate (TPP) family, Xsc-like subfamily, TPP-binding module; composed of proteins similar to Alcaligenes defragrans sulfoacetaldehyde acetyltransferase (Xsc). Xsc plays a key role in the degradation of taurine, catalyzing the desulfonation of 2-sulfoacetaldehyde into sulfite and acetyl phosphate. This enzyme requires TPP and divalent metal ions for activity. Probab=96.46 E-value=0.032 Score=36.50 Aligned_cols=173 Identities=21% Similarity=0.309 Sum_probs=110.0 Q ss_pred CHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEE-EEC-CCEEEEEECCCCCCCHHCCHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 989999985588743223455510015026889-835-961899971787731000445532367777641666544555 Q gi|254781051|r 285 LPRSIFYEFKGKGPSEKEYSGDVKYHLGLCCNR-QIC-GKDVKLLLQSNPSHLEFVDPVVIGSVRARQDLKAGIVGEKSI 362 (957) Q Consensus 285 p~~~iF~EF~g~~~~~~~~sGDVKYHLG~s~~~-~~~-g~~v~lsL~~NPSHLEaVnPVv~G~~RAkQd~~~d~~~~~~~ 362 (957) .|..+|.+.....|++.....|+=.|..+.... .+. .+... ...+=+.+=.==|.++|..-|..| T Consensus 5 ~P~~~~~~l~~~lp~d~ivv~D~G~~~~~~~~~~~~~~p~~~~--~~~~~g~mG~glpaAiGAk~A~Pd----------- 71 (196) T cd02013 5 HPRQVLRELEKAMPEDAIVSTDIGNICSVANSYLRFEKPRSFI--APLSFGNCGYALPAIIGAKAAAPD----------- 71 (196) T ss_pred CHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHCCCCCCCEEE--ECCCCCHHHHHHHHHHHHHHHCCC----------- T ss_conf 9999999999558988099986438799999966758999598--069866476799999999972789----------- Q ss_pred CCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCCE-EC---------CCC--CHHCCCCCCCCH Q ss_conf 654454047888625753334672588764510677443657999841602-01---------247--010246778400 Q gi|254781051|r 363 SLVERSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQI-GF---------TTN--PSSARSCTYASD 430 (957) Q Consensus 363 ~~~~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQI-GF---------TT~--p~d~RSs~Y~TD 430 (957) +.|++| -||.+|-=..--.+|.-=.+|| .+.||+||.. |. ... ..+-....| .. T Consensus 72 -----r~Vv~i--~GDGsf~m~~~EL~Ta~r~~lp------v~~vV~NN~~~g~i~~~q~~~~~~~~~~~~~~~~df-~~ 137 (196) T cd02013 72 -----RPVVAI--AGDGAWGMSMMEIMTAVRHKLP------VTAVVFRNRQWGAEKKNQVDFYNNRFVGTELESESF-AK 137 (196) T ss_pred -----CEEEEE--ECCCCHHHCHHHHHHHHHHCCC------EEEEEEECCHHHHHHHHHHHHHCCCCCCCCCCCCCH-HH T ss_conf -----749999--5797342144899999997889------069999780578999999987089742453699986-89 Q ss_pred HHHHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCCCHHH Q ss_conf 112203405874267889999999999999997389739986421125656435422355788 Q gi|254781051|r 431 ISKSIGIPIFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVCYRRFGHNEGDEPSFTQPMM 493 (957) Q Consensus 431 iAK~i~aPI~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~YRr~GHNE~DeP~fTQP~m 493 (957) +|+.++++-+.| ++||....|.+-|++.+.+ ++-++||++|=+ |..-| |-|-.| T Consensus 138 ~A~a~G~~g~~V--~~~~el~~al~~Ala~~~~-~~P~lIeV~vD~-----e~~~P-~~~~~~ 191 (196) T cd02013 138 IAEACGAKGITV--DKPEDVGPALQKAIAMMAE-GKTTVIEIVCDQ-----ELGDP-FRRDAL 191 (196) T ss_pred HHHHCCCEEEEE--CCHHHHHHHHHHHHHHCCC-CCEEEEEEEECC-----CCCCC-CCHHHH T ss_conf 999759878997--8999999999999985679-986999999786-----65798-750230 No 36 >PRK11269 glyoxylate carboligase; Provisional Probab=96.46 E-value=0.017 Score=38.62 Aligned_cols=165 Identities=19% Similarity=0.261 Sum_probs=89.1 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCCCCCEEEE-EEECCCEEEEEECCCCCCCHHCCHHHHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 8999998558874322345551001502688-983596189997178773100044553236777764166654455565 Q gi|254781051|r 286 PRSIFYEFKGKGPSEKEYSGDVKYHLGLCCN-RQICGKDVKLLLQSNPSHLEFVDPVVIGSVRARQDLKAGIVGEKSISL 364 (957) Q Consensus 286 ~~~iF~EF~g~~~~~~~~sGDVKYHLG~s~~-~~~~g~~v~lsL~~NPSHLEaVnPVv~G~~RAkQd~~~d~~~~~~~~~ 364 (957) +..++.+.....+.+....-|+-.|..+... .... +.-+.-....=+.+=.=-|-++|-.-|+-| T Consensus 372 p~~v~~~l~~~~~~d~iiv~d~G~~~~~~~~~~~~~-~p~~~~~~g~~G~mG~glPaAIGA~lA~Pd------------- 437 (591) T PRK11269 372 PQRVYEEMNKAFGRDTCYVSTIGLSQIAAAQFLHVY-KPRHWINCGQAGPLGWTIPAALGVRAADPE------------- 437 (591) T ss_pred HHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHCCCC-CCCCEECCCCCCHHHHHHHHHHHHHHHCCC------------- T ss_conf 999999999767998889980607799999853357-998773676655589999999999997899------------- Q ss_pred CCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCC---------EECCC---------CCHHCCCCC Q ss_conf 445404788862575333467258876451067744365799984160---------20124---------701024677 Q gi|254781051|r 365 VERSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQ---------IGFTT---------NPSSARSCT 426 (957) Q Consensus 365 ~~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQ---------IGFTT---------~p~d~RSs~ 426 (957) ++|++|. ||.+|.=-+---+|.---+|| .++||+||+ .+|-. ++..-.... T Consensus 438 ---r~VV~i~--GDG~f~m~~qEL~Tavr~~lp------vv~vV~NN~~~G~Ir~~Q~~~~~~~~~~~~~~~~~~~~~~~ 506 (591) T PRK11269 438 ---RPVVALS--GDYDFQFLIEELAVGAQFNLP------YIHVLVNNAYLGLIRQAQRAFDMDYCVQLAFENINSPELNG 506 (591) T ss_pred ---CCEEEEE--CCHHHHHHHHHHHHHHHHCCC------CEEEEEECCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCC T ss_conf ---9689998--736875419999999995889------08999968823099999997445644220557778865567 Q ss_pred CCCH---HHHHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEECCC Q ss_conf 8400---11220340587426788999999999999999738973998642112 Q gi|254781051|r 427 YASD---ISKSIGIPIFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVCYRR 477 (957) Q Consensus 427 Y~TD---iAK~i~aPI~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~YRr 477 (957) |-.| +|+.+++.=++| ++||.+--|.+=|++...++++-|+||.+-=|. T Consensus 507 ~~~Df~~~Aea~G~~g~rV--~~~~el~~Al~~Ala~~~~~~~P~lv~v~id~~ 558 (591) T PRK11269 507 YGVDHVKVAEGLGCKAIRV--FKPEDIAPAFEQAKALMAEFRVPVVVEVILERV 558 (591) T ss_pred CCCCHHHHHHHCCCEEEEE--CCHHHHHHHHHHHHHHCCCCCCCEEEEEEECCC T ss_conf 8888999999779879998--999999999999974113579988999994264 No 37 >cd02005 TPP_PDC_IPDC Thiamine pyrophosphate (TPP) family, PDC_IPDC subfamily, TPP-binding module; composed of proteins similar to pyruvate decarboxylase (PDC) and indolepyruvate decarboxylase (IPDC). PDC, a key enzyme in alcoholic fermentation, catalyzes the conversion of pyruvate to acetaldehyde and CO2. It is able to utilize other 2-oxo acids as substrates. In plants and various plant-associated bacteria, IPDC plays a role in the indole-3-pyruvic acid (IPA) pathway, a tryptophan-dependent biosynthetic route to indole-3-acetaldehyde (IAA). IPDC catalyzes the decarboxylation of IPA to IAA. Both PDC and IPDC depend on TPP and Mg2+ as cofactors. Probab=96.43 E-value=0.035 Score=36.17 Aligned_cols=159 Identities=16% Similarity=0.199 Sum_probs=92.6 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEECCCEEEEEECCCCCCCHHCCHHHHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 89999985588743223455510015026889835961899971787731000445532367777641666544555654 Q gi|254781051|r 286 PRSIFYEFKGKGPSEKEYSGDVKYHLGLCCNRQICGKDVKLLLQSNPSHLEFVDPVVIGSVRARQDLKAGIVGEKSISLV 365 (957) Q Consensus 286 ~~~iF~EF~g~~~~~~~~sGDVKYHLG~s~~~~~~g~~v~lsL~~NPSHLEaVnPVv~G~~RAkQd~~~d~~~~~~~~~~ 365 (957) ++.++.+.....|++.....|+=-|.-......+.... +.-....=+.+=.--|-++|..-|+.|+ T Consensus 4 p~rv~~~l~~~lp~daivv~D~G~~~~~~~~~~~~~~~-~~~~~~~~g~mG~~lpaAiGa~lA~p~~------------- 69 (183) T cd02005 4 QARLWQQVQNFLKPNDILVAETGTSWFGALDLKLPKGT-RFISQPLWGSIGYSVPAALGAALAAPDR------------- 69 (183) T ss_pred HHHHHHHHHHHCCCCCEEEECCCHHHHHHHHCCCCCCC-EEECCCCCCHHHHHHHHHHHHHHHCCCC------------- T ss_conf 89999999946899999998977889999763638999-8983777503778999999999978998------------- Q ss_pred CCCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCCEECCCCC-HHCCCCCC-------CCHHHHHHCC Q ss_conf 454047888625753334672588764510677443657999841602012470-10246778-------4001122034 Q gi|254781051|r 366 ERSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQIGFTTNP-SSARSCTY-------ASDISKSIGI 437 (957) Q Consensus 366 ~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQIGFTT~p-~d~RSs~Y-------~TDiAK~i~a 437 (957) +|++ +-||.+|-=-+---+|.--.+|+ .|.||+||+ ||+... .......| -..+|+.+++ T Consensus 70 ---~Vv~--i~GDG~f~m~~~El~Ta~r~~l~------vi~vV~NN~-~~~~~r~~~~~~~~~~~~~~~df~~~A~a~G~ 137 (183) T cd02005 70 ---RVIL--LVGDGSFQMTVQELSTMIRYGLN------PIIFLINND-GYTIERAIHGPEASYNDIANWNYTKLPEVFGG 137 (183) T ss_pred ---CEEE--EECCHHHHHHHHHHHHHHHHCCC------EEEEEEECC-CCCEEEEECCCCCCCCCCCCCCHHHHHHHHCC T ss_conf ---4799--95673676208899999982998------199999899-43255751268656576999999999999478 Q ss_pred ----CEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEEC Q ss_conf ----05874267889999999999999997389739986421 Q gi|254781051|r 438 ----PIFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVCY 475 (957) Q Consensus 438 ----PI~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~Y 475 (957) +-+.| ++|+..-.|.+-|+. +..+-++||+++= T Consensus 138 ~~g~~g~rV--~~~~el~~al~~Al~---~~~~P~liev~vd 174 (183) T cd02005 138 GGGGLSFRV--KTEGELDEALKDALF---NRDKLSLIEVILP 174 (183) T ss_pred CCCCEEEEE--CCHHHHHHHHHHHHH---CCCCEEEEEEECC T ss_conf 668428997--899999999999997---2898299999748 No 38 >TIGR03336 IOR_alpha indolepyruvate ferredoxin oxidoreductase, alpha subunit. Indolepyruvate ferredoxin oxidoreductase (IOR) is an alpha 2/beta 2 tetramer related to ketoacid oxidoreductases for pyruvate (1.2.7.1, POR), 2-ketoglutarate (1.2.7.3, KOR), and 2-oxoisovalerate (1.2.7.7, VOR). These multi-subunit enzymes typically are found in anaerobes and are inactiviated by oxygen. IOR in Pyrococcus acts in fermentation of all three aromatic amino acids, following removal of the amino group by transamination. In Methanococcus maripaludis, by contrast, IOR acts in the opposite direction, in pathways of amino acid biosynthesis from phenylacetate, indoleacetate, and p-hydroxyphenylacetate. In M. maripaludis and many other species, iorA and iorB are found next to an apparent phenylacetate-CoA ligase. Probab=96.21 E-value=0.09 Score=33.11 Aligned_cols=158 Identities=17% Similarity=0.248 Sum_probs=88.7 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEEC-CCEE Q ss_conf 999999999998737910799806765526899988537989999985588743223455510015026889835-9618 Q gi|254781051|r 246 IIPAIEEIIRQGVQQGVDEMILGMAHRGRLNVLSQIMNKLPRSIFYEFKGKGPSEKEYSGDVKYHLGLCCNRQIC-GKDV 324 (957) Q Consensus 246 lIP~L~~ii~~a~~~Gv~eiviGMaHRGRLNVL~Nvl~Kp~~~iF~EF~g~~~~~~~~sGDVKYHLG~s~~~~~~-g~~v 324 (957) |+-+-++|.+.+-+.|| .++=|-+ |-|...|..++.-...+ .|.- .+.. +.++ T Consensus 4 ll~GneAiarga~eaGv-~~vsgYp------------GsP~~ei~~~l~~~~~~-----------~~i~--~e~~~NEkv 57 (595) T TIGR03336 4 LLLGNEAIARGALEAGV-GVAAAYP------------GTPSSEITDTLAKVAKR-----------AGVY--FEWSVNEKV 57 (595) T ss_pred HHHHHHHHHHHHHHCCC-EEEEECC------------CCCHHHHHHHHHHHHHH-----------CCEE--EEECCCHHH T ss_conf 32258999999998198-4898659------------89878999999962042-----------6669--985578899 Q ss_pred EEEECCCCCCCHHCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCE Q ss_conf 99971787731000445532367777641666544555654454047888625753334672588764510677443657 Q gi|254781051|r 325 KLLLQSNPSHLEFVDPVVIGSVRARQDLKAGIVGEKSISLVERSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGN 404 (957) Q Consensus 325 ~lsL~~NPSHLEaVnPVv~G~~RAkQd~~~d~~~~~~~~~~~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGT 404 (957) - +|..---..+=+|+-= .-|-..+-+-|||=+ ++|++.+ .||- T Consensus 58 A---------~e~a~gA~~~Gvr~l~----------------~~K~vGlnva~D~l~--------~~~~~G~----~GG~ 100 (595) T TIGR03336 58 A---------VEVAAGAAWSGLRAFC----------------TMKHVGLNVAADPLM--------TLAYTGV----KGGL 100 (595) T ss_pred H---------HHHHHHHHHCCCCEEE----------------ECCCCCCCHHHHHHH--------HHHHHCC----CCEE T ss_conf 9---------9998776442863899----------------856688441277888--------7652077----7559 Q ss_pred EEEEECCCEECC-CCCHHCCCCCCCCHHHHHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEE Q ss_conf 999841602012-4701024677840011220340587426788999999999999999738973998642 Q gi|254781051|r 405 IHLIINNQIGFT-TNPSSARSCTYASDISKSIGIPIFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVC 474 (957) Q Consensus 405 iHiVvNNQIGFT-T~p~d~RSs~Y~TDiAK~i~aPI~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~ 474 (957) |=|+-.---... |++.|+| . .++.-..|||.=- ||..+....+.|++.=.+|+.-|.+-++. T Consensus 101 v~v~gDDpg~~SSq~eqdsr--~----~~~~a~iPvl~Ps--~~qE~~d~~~~af~LSr~~g~pV~lr~tt 163 (595) T TIGR03336 101 VVVVADDPSMHSSQNEQDTR--H----YAKFAKIPCLEPS--TPQEAKDMVKYAFELSEKFGLPVILRPTT 163 (595) T ss_pred EEEEECCCCCCCCCCHHHHH--H----HHHHCCCEEECCC--CHHHHHHHHHHHHHHHHHHCCCEEEEEEE T ss_conf 99994699985351677789--9----9998498267789--99999999999999999979987999887 No 39 >cd02004 TPP_BZL_OCoD_HPCL Thiamine pyrophosphate (TPP) family, BZL_OCoD_HPCL subfamily, TPP-binding module; composed of proteins similar to benzaldehyde lyase (BZL), oxalyl-CoA decarboxylase (OCoD) and 2-hydroxyphytanoyl-CoA lyase (2-HPCL). Pseudomonas fluorescens biovar I BZL cleaves the acyloin linkage of benzoin producing 2 molecules of benzaldehyde and enabling the Pseudomonas to grow on benzoin as the sole carbon and energy source. OCoD has a role in the detoxification of oxalate, catalyzing the decarboxylation of oxalyl-CoA to formate. 2-HPCL is a peroxisomal enzyme which plays a role in the alpha-oxidation of 3-methyl-branched fatty acids, catalyzing the cleavage of 2-hydroxy-3-methylacyl-CoA into formyl-CoA and a 2-methyl-branched fatty aldehyde. All these enzymes depend on Mg2+ and TPP for activity. Probab=95.98 E-value=0.056 Score=34.64 Aligned_cols=156 Identities=19% Similarity=0.138 Sum_probs=96.0 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEE-EEC-CCEEEEEECCCCCCCHHCCHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 89999985588743223455510015026889-835-9618999717877310004455323677776416665445556 Q gi|254781051|r 286 PRSIFYEFKGKGPSEKEYSGDVKYHLGLCCNR-QIC-GKDVKLLLQSNPSHLEFVDPVVIGSVRARQDLKAGIVGEKSIS 363 (957) Q Consensus 286 ~~~iF~EF~g~~~~~~~~sGDVKYHLG~s~~~-~~~-g~~v~lsL~~NPSHLEaVnPVv~G~~RAkQd~~~d~~~~~~~~ 363 (957) |..++.+.....|++.....|+=.|..+.... .+. .+... ....=+.+=.=-|-++|..-|+.+ T Consensus 1 P~~v~~~l~~~lp~d~iv~~D~G~~~~~~~~~~~~~~p~~~~--~~~~~g~mG~~lpaaiGa~lA~p~------------ 66 (172) T cd02004 1 PYRVLHELQEALPDDAIIVSDGGNTMDWARYILRPRKPRHRL--DAGTFGTLGVGLGYAIAAALARPD------------ 66 (172) T ss_pred CHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCCCEEE--CCCCCCHHHHHHHHHHHHHHHCCC------------ T ss_conf 979999999648988099987737799999974737999498--079876477799999999997899------------ Q ss_pred CCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCCE-ECCCCC----H--------HCCCCCCCCH Q ss_conf 54454047888625753334672588764510677443657999841602-012470----1--------0246778400 Q gi|254781051|r 364 LVERSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQI-GFTTNP----S--------SARSCTYASD 430 (957) Q Consensus 364 ~~~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQI-GFTT~p----~--------d~RSs~Y~TD 430 (957) +.|++| -||.+|.=..---+|.---+|| .+.||+||+- |-...- . +.....| .. T Consensus 67 ----~~Vv~i--~GDG~f~m~~~EL~Ta~~~~lp------i~~iV~nN~~~~~i~~~q~~~~~~~~~~~~~~~~~df-~~ 133 (172) T cd02004 67 ----KRVVLV--EGDGAFGFSGMELETAVRYNLP------IVVVVGNNGGWYQGLDGQQLSYGLGLPVTTLLPDTRY-DL 133 (172) T ss_pred ----CEEEEE--ECCCCHHCCCHHHHHHHHHCCC------EEEEEEECCCHHHHHHHHHHHHCCCCCCEECCCCCCH-HH T ss_conf ----849999--7797340240788899984988------5999982883289999989872689874321899877-89 Q ss_pred HHHHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEE Q ss_conf 11220340587426788999999999999999738973998642 Q gi|254781051|r 431 ISKSIGIPIFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVC 474 (957) Q Consensus 431 iAK~i~aPI~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~ 474 (957) +|+.++++-+.| ++|+..-.+.+-|++ -++-++||++. T Consensus 134 iA~a~G~~~~~v--~~~~el~~al~~a~~----~~~p~liev~i 171 (172) T cd02004 134 VAEAFGGKGELV--TTPEELKPALKRALA----SGKPALINVII 171 (172) T ss_pred HHHHCCCEEEEE--CCHHHHHHHHHHHHH----CCCCEEEEEEC T ss_conf 999759828997--899999999999995----89969999981 No 40 >pfam02775 TPP_enzyme_C Thiamine pyrophosphate enzyme, C-terminal TPP binding domain. Probab=95.89 E-value=0.06 Score=34.46 Aligned_cols=110 Identities=24% Similarity=0.357 Sum_probs=68.8 Q ss_pred CCCHHCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCC Q ss_conf 73100044553236777764166654455565445404788862575333467258876451067744365799984160 Q gi|254781051|r 333 SHLEFVDPVVIGSVRARQDLKAGIVGEKSISLVERSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQ 412 (957) Q Consensus 333 SHLEaVnPVv~G~~RAkQd~~~d~~~~~~~~~~~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQ 412 (957) +.+-.--|.++|...|+.+ ++|++|. ||.+|.--+ .|.....+ |+. ..+.||.||+ T Consensus 28 g~mG~~~p~AiGa~~a~p~----------------~~vi~i~--GDG~f~~~~--~el~Ta~~---~~~-~i~~iv~nN~ 83 (150) T pfam02775 28 GTMGYGLPAAIGAKLARPD----------------RPVVAIA--GDGGFQMNG--QELATAVR---YNL-PITVVVLNNG 83 (150) T ss_pred CHHHHHHHHHHHHHHHCCC----------------CEEEEEE--CCCCHHCCC--CHHHHHHH---HCC-CCCEEEEECC T ss_conf 3232289999999997899----------------8499998--897211134--37999998---488-8568999764 Q ss_pred E-ECCC---CCHH--CCCCCC-------CCHHHHHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEE Q ss_conf 2-0124---7010--246778-------400112203405874267889999999999999997389739986 Q gi|254781051|r 413 I-GFTT---NPSS--ARSCTY-------ASDISKSIGIPIFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDI 472 (957) Q Consensus 413 I-GFTT---~p~d--~RSs~Y-------~TDiAK~i~aPI~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDl 472 (957) - |-+. .+.. .+++.+ -..+||.++++-+.| ++++....+.+=|+ +.++-++||+ T Consensus 84 ~yg~~~~~q~~~~~~~~~~~~~~~~~~d~~~lA~a~G~~~~~v--~~~~el~~al~~a~----~~~gP~~ieV 150 (150) T pfam02775 84 GYGMTRGQQTPFGGKRYSGPDGDLPPVDFAKLAEAYGAKGARV--ESPEELEEALKEAL----AHDGPALIDV 150 (150) T ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHCCCEEEEE--CCHHHHHHHHHHHH----HCCCCEEEEC T ss_conf 2588899988742997456567888878999999859869997--89999999999998----2899989959 No 41 >PRK07119 2-ketoisovalerate ferredoxin reductase; Validated Probab=95.55 E-value=0.1 Score=32.68 Aligned_cols=61 Identities=25% Similarity=0.266 Sum_probs=46.3 Q ss_pred CCEEEEECCHHHHHHHHHHHHCC--CCCEEEEEEECCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCC Q ss_conf 76899868857899999999769--9868999840148682899999985379983998067853178 Q gi|254781051|r 837 IRRVILCTGKVYYDLLDNRDMRN--IADIYLLRIEQLYPFPEDYLIKVLSRFVQAEIVWCQEEPQNMG 902 (957) Q Consensus 837 v~rvi~csGKvyydL~~~r~~~~--~~~vaivRiEqLyPfP~~~l~~~l~~y~~~e~vW~QEEp~N~G 902 (957) .+.+|+|.|-.|-...++.+... -.+|.++|+-.+||||.++|.+.++++.. |-+=|+ |.| T Consensus 246 a~~~ii~~Gs~~~~~~eav~~lr~~G~kvg~l~~~~~~PfP~~~i~~~l~~~~~---viVvE~--n~G 308 (350) T PRK07119 246 AELVLVAYGTSSRIAKSAVDMAREEGIKVGLFRPITLWPFPEKALEKLADKVKG---FLSVEM--SNG 308 (350) T ss_pred CCEEEEEECCCHHHHHHHHHHHHHCCCEECEEEECEECCCCHHHHHHHHHCCCE---EEEECC--CCC T ss_conf 999999956346899999999997598126687651068999999999946998---999858--886 No 42 >cd02015 TPP_AHAS Thiamine pyrophosphate (TPP) family, Acetohydroxyacid synthase (AHAS) subfamily, TPP-binding module; composed of proteins similar to the large catalytic subunit of AHAS. AHAS catalyzes the condensation of two molecules of pyruvate to give the acetohydroxyacid, 2-acetolactate. 2-Acetolactate is the precursor of the branched chain amino acids, valine and leucine. AHAS also catalyzes the condensation of pyruvate and 2-ketobutyrate to form 2-aceto-2-hydroxybutyrate in isoleucine biosynthesis. In addition to requiring TPP and a divalent metal ion as cofactors, AHAS requires FAD. Probab=95.52 E-value=0.18 Score=30.82 Aligned_cols=156 Identities=21% Similarity=0.286 Sum_probs=97.3 Q ss_pred CHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEE-EEC-CCEEEEEECCCCCCC---HHCCHHHHHHHHHHHHHHCCCCCC Q ss_conf 989999985588743223455510015026889-835-961899971787731---000445532367777641666544 Q gi|254781051|r 285 LPRSIFYEFKGKGPSEKEYSGDVKYHLGLCCNR-QIC-GKDVKLLLQSNPSHL---EFVDPVVIGSVRARQDLKAGIVGE 359 (957) Q Consensus 285 p~~~iF~EF~g~~~~~~~~sGDVKYHLG~s~~~-~~~-g~~v~lsL~~NPSHL---EaVnPVv~G~~RAkQd~~~d~~~~ 359 (957) .|..+|.++....|++.....|.=.|.-+.... ++. .+.... +..+ =.==|-++|-.-|.-+ T Consensus 2 ~P~~~~~~L~~~lp~d~iiv~D~G~~~~~~~~~~~~~~p~~~~~-----s~~~g~mG~~lpaAIGa~la~p~-------- 68 (186) T cd02015 2 KPQEVIKELSELTPGDAIVTTDVGQHQMWAAQYYRFKKPRSWLT-----SGGLGTMGFGLPAAIGAKVARPD-------- 68 (186) T ss_pred CHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCCCEEEC-----CCCCCHHHHHHHHHHHHHHHCCC-------- T ss_conf 87999999996589983999778688999999736189993984-----79873566599999999986899-------- Q ss_pred CCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCCE-E---------CCCCC--HHC-CCCC Q ss_conf 555654454047888625753334672588764510677443657999841602-0---------12470--102-4677 Q gi|254781051|r 360 KSISLVERSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQI-G---------FTTNP--SSA-RSCT 426 (957) Q Consensus 360 ~~~~~~~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQI-G---------FTT~p--~d~-RSs~ 426 (957) ++|+ .|.||.+|.=..--.+|..-.+|| .+.||+||+- | |-... .+. ...- T Consensus 69 --------r~vv--~i~GDG~f~m~~~EL~Ta~~~~lp------i~~iV~NN~~~g~i~~~q~~~~~~~~~~~~~~~~pd 132 (186) T cd02015 69 --------KTVI--CIDGDGSFQMNIQELATAAQYNLP------VKIVILNNGSLGMVRQWQELFYEGRYSHTTLDSNPD 132 (186) T ss_pred --------CEEE--EEECCCHHHCCHHHHHHHHHHCCC------CEEEEEECCCHHHHHHHHHHHCCCCCCCCCCCCCCC T ss_conf --------8189--997670775035999999986699------479999897068899999997289766656788877 Q ss_pred CCCHHHHHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEECC Q ss_conf 84001122034058742678899999999999999973897399864211 Q gi|254781051|r 427 YASDISKSIGIPIFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVCYR 476 (957) Q Consensus 427 Y~TDiAK~i~aPI~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~YR 476 (957) | ..+|+.++++-+.| ++++....+.+-|++ .++-++||+..=+ T Consensus 133 ~-~~iA~a~G~~~~~v--~~~~el~~al~~a~~----~~gP~lieV~idp 175 (186) T cd02015 133 F-VKLAEAYGIKGLRV--EKPEELEAALKEALA----SDGPVLLDVLVDP 175 (186) T ss_pred H-HHHHHHCCCEEEEE--CCHHHHHHHHHHHHH----CCCCEEEEEEECC T ss_conf 6-89999759769997--899999999999984----8997999999889 No 43 >PRK06112 acetolactate synthase catalytic subunit; Validated Probab=95.40 E-value=0.055 Score=34.71 Aligned_cols=38 Identities=18% Similarity=0.367 Sum_probs=15.7 Q ss_pred HHHHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEE Q ss_conf 11220340587426788999999999999999738973998642 Q gi|254781051|r 431 ISKSIGIPIFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVC 474 (957) Q Consensus 431 iAK~i~aPI~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~ 474 (957) +|+.++++=++| ++||++-.|.+=|++ ..+-++||+++ T Consensus 525 lA~a~G~~g~rV--~~~~eL~~Al~~A~~----~~gP~lIeV~v 562 (581) T PRK06112 525 IARACGCDGVRV--EDPASLAQALAAAEA----APGPFLIEVMT 562 (581) T ss_pred HHHHCCCEEEEE--CCHHHHHHHHHHHHH----CCCCEEEEEEE T ss_conf 998789869997--999999999999984----89939999986 No 44 >PRK08273 pyruvate decarboxylase; Provisional Probab=95.37 E-value=0.25 Score=29.74 Aligned_cols=204 Identities=18% Similarity=0.284 Sum_probs=107.4 Q ss_pred CHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEE-EEECCCEEEEEECCCCCCCHHCCHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 98999998558874322345551001502688-98359618999717877310004455323677776416665445556 Q gi|254781051|r 285 LPRSIFYEFKGKGPSEKEYSGDVKYHLGLCCN-RQICGKDVKLLLQSNPSHLEFVDPVVIGSVRARQDLKAGIVGEKSIS 363 (957) Q Consensus 285 p~~~iF~EF~g~~~~~~~~sGDVKYHLG~s~~-~~~~g~~v~lsL~~NPSHLEaVnPVv~G~~RAkQd~~~d~~~~~~~~ 363 (957) .+..++.+.....|++.....|+=.|..+... .... +....++...=..+=.=-|-++|-.-|.-+ T Consensus 367 ~p~~v~~~l~~~lp~daIv~~D~G~~~~w~~r~l~~~-~~~~~~~sg~~gsMG~glPaAIGA~lA~P~------------ 433 (597) T PRK08273 367 NPQRVFWELSPRLPDNAILTADSGSCANWYARDLRMR-RGMMASLSGGLATMGPAVPYAIAAKFAHPD------------ 433 (597) T ss_pred CHHHHHHHHHHHCCCCCEEEECCCCHHHHHHHHHCCC-CCCEEEECCCCCCCCCHHHHHHHHHHHCCC------------ T ss_conf 9999999999858987389956863089999882478-997377468766645078999999985899------------ Q ss_pred CCCCCEEEEEEEECCCCCCCCCHHHHHHHH-------HCCCCCCCCCEEEEEECCC-EECC--------CCCHHCCCCCC Q ss_conf 544540478886257533346725887645-------1067744365799984160-2012--------47010246778 Q gi|254781051|r 364 LVERSRVLPLIIHGDAAFAGQGIVSETFGL-------SGLSGYTVAGNIHLIINNQ-IGFT--------TNPSSARSCTY 427 (957) Q Consensus 364 ~~~~~~vlpiliHGDAAfaGQGVV~Etl~l-------s~l~gY~tGGTiHiVvNNQ-IGFT--------T~p~d~RSs~Y 427 (957) +.|++ |-||.+|.=-|+ .|-... .++| .|-||.||+ .|.- -+|...-+... T Consensus 434 ----r~Vv~--i~GDG~f~M~~~-~EL~Ta~ry~~~~~~lp------vvvvV~NN~~~g~i~~~q~~~~g~~~~~~~~~l 500 (597) T PRK08273 434 ----RPVIA--LVGDGAMQMNGM-AELITVAKYWRQWADPR------LIVLVLNNRDLNQVTWEQRVMEGDPKFEASQQL 500 (597) T ss_pred ----CCEEE--EECCCHHHHCHH-HHHHHHHHHHHHHCCCC------EEEEEEECCHHHHHHHHHHHHCCCCCCCCCCCC T ss_conf ----95899--981615763508-89999999764014899------799998185157999999984588776764568 Q ss_pred -C---CHHHHHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHCCCH Q ss_conf -4---001122034058742678899999999999999973897399864211256564354223557888886718998 Q gi|254781051|r 428 -A---SDISKSIGIPIFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVCYRRFGHNEGDEPSFTQPMMYKTIRSHKSV 503 (957) Q Consensus 428 -~---TDiAK~i~aPI~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~YRr~GHNE~DeP~fTQP~mY~~I~~hp~v 503 (957) - ..+|+.++++=+.| ++||.+-.|.+-|++ -.+-++||+++ .-|+|.+---+.++..+ T Consensus 501 p~~d~a~~A~a~G~~g~rV--~~~~el~~Al~~Al~----~~~P~lIeV~~-------Dp~~~p~pp~~~~~~~~----- 562 (597) T PRK08273 501 PDVPYARFAELLGLKGIRV--DDPEQLGAAWDEALA----ADRPVVLEVKT-------DPNVPPLPPHITLEQAK----- 562 (597) T ss_pred CCCCHHHHHHHCCCEEEEE--CCHHHHHHHHHHHHH----CCCCEEEEEEC-------CCCCCCCCCCCCHHHHH----- T ss_conf 9999999999789979998--999999999999984----89978999971-------89999999968799999----- Q ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999864489898999999999999977566 Q gi|254781051|r 504 LQLYADSLMKNQVISKQELQSLANNWHKYLEAEY 537 (957) Q Consensus 504 ~~~Y~~~Li~~g~it~~e~~~~~~~~~~~l~~~~ 537 (957) -|+..+++ |. .++..-+++..++.|+.-+ T Consensus 563 --~~~~~~~~-~~--~~~~~~~~~~~~~~~~~~~ 591 (597) T PRK08273 563 --HFASALLK-GD--PDEGGVIVETAKQVLAEVL 591 (597) T ss_pred --HHHHHHHC-CC--CCHHHHHHHHHHHHHHHHC T ss_conf --99999966-89--4212689999999999865 No 45 >cd02003 TPP_IolD Thiamine pyrophosphate (TPP) family, IolD subfamily, TPP-binding module; composed of proteins similar to Rhizobium leguminosarum bv. viciae IolD. IolD plays an important role in myo-inositol catabolism. Probab=95.29 E-value=0.29 Score=29.29 Aligned_cols=157 Identities=17% Similarity=0.128 Sum_probs=99.5 Q ss_pred HHHHHHHCCCCCCCCCCCCCCCCCCCEEEE-EEEC-CCEEEEEECCCCCCCHHCCHHHHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 999998558874322345551001502688-9835-96189997178773100044553236777764166654455565 Q gi|254781051|r 287 RSIFYEFKGKGPSEKEYSGDVKYHLGLCCN-RQIC-GKDVKLLLQSNPSHLEFVDPVVIGSVRARQDLKAGIVGEKSISL 364 (957) Q Consensus 287 ~~iF~EF~g~~~~~~~~sGDVKYHLG~s~~-~~~~-g~~v~lsL~~NPSHLEaVnPVv~G~~RAkQd~~~d~~~~~~~~~ 364 (957) .+++++.....|++.....|+=.|..+... ..+. .+..+++.. =+-+=.=-|-++|..-|.-| T Consensus 2 ~~Vv~~L~~~lp~daIv~~D~G~~~~~~~~~~~~~~p~~~~~~~g--~g~MG~~lPaAiGA~lA~Pd------------- 66 (205) T cd02003 2 TEVLGALNEAIGDDDVVINAAGSLPGDLHKLWRARTPGGYHLEYG--YSCMGYEIAAGLGAKLAKPD------------- 66 (205) T ss_pred HHHHHHHHHHCCCCEEEEECCCCCHHHHHHHEECCCCCEEECCCC--CHHHHHHHHHHHHHHHHCCC------------- T ss_conf 589999996689981999889850789996530589990875887--40688999999999985899------------- Q ss_pred CCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCCE-E-------------CCCCCHH-------CC Q ss_conf 4454047888625753334672588764510677443657999841602-0-------------1247010-------24 Q gi|254781051|r 365 VERSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQI-G-------------FTTNPSS-------AR 423 (957) Q Consensus 365 ~~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQI-G-------------FTT~p~d-------~R 423 (957) ++|+ .|-||.+|-=-+---+|.---+|| .+.||+||+. | |-|.-++ .- T Consensus 67 ---r~Vv--~i~GDG~f~m~~~EL~Tavr~~lp------v~~vV~NN~~yg~i~~~Q~~~~~~~~~~~~~~~~~~~~~~~ 135 (205) T cd02003 67 ---REVY--VLVGDGSYLMLHSEIVTAVQEGLK------IIIVLFDNHGFGCINNLQESTGSGSFGTEFRDRDQESGQLD 135 (205) T ss_pred ---CCEE--EEECCCCCCCCHHHHHHHHHHCCC------CEEEEEECCCCHHHHHHHHHHCCCCCCCEEECCCCCCCCCC T ss_conf ---8289--997674111520489999982995------46999989942899999997446763311101456555567 Q ss_pred CCCCCCHH---HHHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEEC Q ss_conf 67784001---12203405874267889999999999999997389739986421 Q gi|254781051|r 424 SCTYASDI---SKSIGIPIFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVCY 475 (957) Q Consensus 424 Ss~Y~TDi---AK~i~aPI~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~Y 475 (957) ....+.|. |+.++++-+.| ++|+..-.+.+=|++ .++-++||+.+= T Consensus 136 ~~~~~~Df~~~A~a~G~~g~~V--~~~~eL~~Al~~Al~----~~gP~vIev~vd 184 (205) T cd02003 136 GALLPVDFAANARSLGARVEKV--KTIEELKAALAKAKA----SDRTTVIVIKTD 184 (205) T ss_pred CCCCCCCHHHHHHHCCCEEEEE--CCHHHHHHHHHHHHH----CCCCEEEEEEEC T ss_conf 7778988999998749708993--888999999999996----899699999937 No 46 >cd02014 TPP_POX Thiamine pyrophosphate (TPP) family, Pyruvate oxidase (POX) subfamily, TPP-binding module; composed of proteins similar to Lactobacillus plantarum POX, which plays a key role in controlling acetate production under aerobic conditions. POX decarboxylates pyruvate, producing hydrogen peroxide and the energy-storage metabolite acetylphosphate. It requires FAD in addition to TPP and a divalent cation as cofactors. Probab=95.05 E-value=0.24 Score=29.83 Aligned_cols=158 Identities=22% Similarity=0.204 Sum_probs=95.9 Q ss_pred CHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEE-EEEC-CCEEEEEECCCCCCCHHCCHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 98999998558874322345551001502688-9835-961899971787731000445532367777641666544555 Q gi|254781051|r 285 LPRSIFYEFKGKGPSEKEYSGDVKYHLGLCCN-RQIC-GKDVKLLLQSNPSHLEFVDPVVIGSVRARQDLKAGIVGEKSI 362 (957) Q Consensus 285 p~~~iF~EF~g~~~~~~~~sGDVKYHLG~s~~-~~~~-g~~v~lsL~~NPSHLEaVnPVv~G~~RAkQd~~~d~~~~~~~ 362 (957) .|..+|.+.....|++.....|+=.|..+... .++. ++....+. .=+-+=.==|-++|..-|+-+ T Consensus 3 ~P~~~~~~l~~~lp~d~iiv~D~G~~~~~~~~~~~~~~p~~~~~~~--~~g~mG~glpaAiGa~la~p~----------- 69 (178) T cd02014 3 HPERVAAELNKRAPDDAIFTIDVGNVTVWAARHLRMNGKQRFILSG--LLATMGNGLPGAIAAKLAYPD----------- 69 (178) T ss_pred CHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCCCCCEEECCC--CCCHHHHHHHHHHHHHHHCCC----------- T ss_conf 9899999999448998099985838799999977858998487378--865377899999999984899----------- Q ss_pred CCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCCE-ECCC---CCHHCC---CCCCC---CHHH Q ss_conf 654454047888625753334672588764510677443657999841602-0124---701024---67784---0011 Q gi|254781051|r 363 SLVERSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQI-GFTT---NPSSAR---SCTYA---SDIS 432 (957) Q Consensus 363 ~~~~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQI-GFTT---~p~d~R---Ss~Y~---TDiA 432 (957) ++|++|. ||.+|-=-+--.+|..--+|| .+.||.||.- |... .....+ ....+ ..+| T Consensus 70 -----~~Vv~i~--GDG~f~m~~~El~Ta~~~~lp------i~~vV~nN~~~g~i~~~q~~~~~~~~~~~~~~~d~~~lA 136 (178) T cd02014 70 -----RQVIALS--GDGGFAMLMGDLITAVKYNLP------VIVVVFNNSDLGFIKWEQEVMGQPEFGVDLPNPDFAKIA 136 (178) T ss_pred -----CEEEEEE--CCCHHHCCHHHHHHHHHHCCC------CEEEEEECCHHHHHHHHHHHHCCCCCCCCCCCCCHHHHH T ss_conf -----8089997--770674061899989985898------139999896158999999985799752537999889999 Q ss_pred HHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEE Q ss_conf 220340587426788999999999999999738973998642 Q gi|254781051|r 433 KSIGIPIFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVC 474 (957) Q Consensus 433 K~i~aPI~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~ 474 (957) |.++++-+.|. +++....+.+-|++ .++-++||+.+ T Consensus 137 ~a~G~~~~~v~--~~~el~~al~~a~~----~~gP~lieV~v 172 (178) T cd02014 137 EAMGIKGIRVE--DPDELEAALDEALA----ADGPVVIDVVT 172 (178) T ss_pred HHCCCEEEEEC--CHHHHHHHHHHHHH----CCCCEEEEEEC T ss_conf 97697079978--99999999999996----89979999980 No 47 >cd02010 TPP_ALS Thiamine pyrophosphate (TPP) family, Acetolactate synthase (ALS) subfamily, TPP-binding module; composed of proteins similar to Klebsiella pneumoniae ALS, a catabolic enzyme required for butanediol fermentation. ALS catalyzes the conversion of 2 molecules of pyruvate to acetolactate and carbon dioxide. ALS does not contain FAD, and requires TPP and a divalent metal cation for activity. Probab=95.05 E-value=0.26 Score=29.63 Aligned_cols=156 Identities=17% Similarity=0.174 Sum_probs=97.7 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCCCCCEEEE-EEE-CCCEEEEEECCCCCCCHHCCHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 8999998558874322345551001502688-983-59618999717877310004455323677776416665445556 Q gi|254781051|r 286 PRSIFYEFKGKGPSEKEYSGDVKYHLGLCCN-RQI-CGKDVKLLLQSNPSHLEFVDPVVIGSVRARQDLKAGIVGEKSIS 363 (957) Q Consensus 286 ~~~iF~EF~g~~~~~~~~sGDVKYHLG~s~~-~~~-~g~~v~lsL~~NPSHLEaVnPVv~G~~RAkQd~~~d~~~~~~~~ 363 (957) |..++.+.....|++.....|+=.|..+... ..+ ..+....+ ..=+.+=.==|.++|-.-|.-+ T Consensus 1 P~~v~~~l~~~lp~d~iiv~D~G~~~~~~~~~~~~~~p~~~~~s--~~~g~mG~glpaAiGa~lA~p~------------ 66 (177) T cd02010 1 PQRIVHDLRAVMGDDDIVLLDVGAHKIWMARYYRTYAPNTCLIS--NGLATMGVALPGAIGAKLVYPD------------ 66 (177) T ss_pred CHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHCCCCCCCEEECC--CCCCHHHHHHHHHHHHHHHCCC------------ T ss_conf 97999999976899959998585879999996775899908748--9863688899999999987899------------ Q ss_pred CCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCCE-ECCC---CCHHCCCCCCCC--------HH Q ss_conf 54454047888625753334672588764510677443657999841602-0124---701024677840--------01 Q gi|254781051|r 364 LVERSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQI-GFTT---NPSSARSCTYAS--------DI 431 (957) Q Consensus 364 ~~~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQI-GFTT---~p~d~RSs~Y~T--------Di 431 (957) +.|++| -||.+|.=.+---+|..--+|| .+.||+||+- |... ..... ..+.+ .+ T Consensus 67 ----~~Vv~i--~GDG~f~m~~~EL~Ta~~~~lp------i~iiV~NN~~~g~i~~~q~~~~~--~~~~~~~~~~d~~~l 132 (177) T cd02010 67 ----RKVVAV--SGDGGFMMNSQELETAVRLKIP------LVVLIWNDNGYGLIKWKQEKEYG--RDSGVDFGNPDFVKY 132 (177) T ss_pred ----CEEEEE--ECCCCCEECCHHHHHHHHHCCC------EEEEEEECCHHHHHHHHHHHHCC--CCCCCCCCCCCHHHH T ss_conf ----859999--2898611066688899972884------03999989804899999998639--753461899998999 Q ss_pred HHHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEEC Q ss_conf 12203405874267889999999999999997389739986421 Q gi|254781051|r 432 SKSIGIPIFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVCY 475 (957) Q Consensus 432 AK~i~aPI~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~Y 475 (957) |+.++++-+.| ++|+..-.+.+-|++ -++-++||++.= T Consensus 133 A~a~G~~~~~V--~~~~el~~al~~a~~----~~gp~liev~vD 170 (177) T cd02010 133 AESFGAKGYRI--ESADDLLPVLERALA----ADGVHVIDCPVD 170 (177) T ss_pred HHHCCCEEEEE--CCHHHHHHHHHHHHH----CCCCEEEEEEEC T ss_conf 99879879998--999999999999984----899599999986 No 48 >KOG0523 consensus Probab=94.76 E-value=0.38 Score=28.35 Aligned_cols=412 Identities=17% Similarity=0.210 Sum_probs=209.7 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCCEECCCCC Q ss_conf 45532367777641666544555654454047888625753334672588764510677443657999841602012470 Q gi|254781051|r 340 PVVIGSVRARQDLKAGIVGEKSISLVERSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQIGFTTNP 419 (957) Q Consensus 340 PVv~G~~RAkQd~~~d~~~~~~~~~~~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQIGFTT~p 419 (957) .++.|+|-++-. .+. -.++|.+|+ ||.... .|+++|+++++. .|..+--|-|.=||||+--+.- T Consensus 125 s~a~GmA~~~k~-~~k----------~~~rv~~vl--GDG~~~-eG~~~Ea~s~Ag--~l~ldnLVai~D~n~is~~g~t 188 (632) T KOG0523 125 SNAVGMAYAGKH-LGK----------ASNRVYCVL--GDGCLT-EGSVWEAMSLAG--HLKLDNLVAIYDNNKISIDGAT 188 (632) T ss_pred HHHHHHHHHHHH-HCC----------CCCEEEEEE--CCCCCC-CCHHHHHHHHHH--HCCCCCEEEEECCCCCCCCCCC T ss_conf 777789988876-412----------446089997--685001-441888876640--2254878999726644578986 Q ss_pred HHCCCCCCCCHHHH----HHCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCCCHHHHH Q ss_conf 10246778400112----20340587426788999999999999999738973998642112565643542235578888 Q gi|254781051|r 420 SSARSCTYASDISK----SIGIPIFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVCYRRFGHNEGDEPSFTQPMMYK 495 (957) Q Consensus 420 ~d~RSs~Y~TDiAK----~i~aPI~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~YRr~GHNE~DeP~fTQP~mY~ 495 (957) .. -.+-||.| +++--+.-|+++|.|+..++..-|. +.=+|--.|-.-.+ T Consensus 189 ~~----~~~~dV~~~r~ea~g~~~~~V~~~d~d~i~ka~~~a~---~~k~kpt~i~~~t~-------------------- 241 (632) T KOG0523 189 SL----GFDEDVYQLRFEAFGWNVIIVDGGDVDEIRKAIGKAK---SVKGKPTAIKATTF-------------------- 241 (632) T ss_pred CC----CCCCCHHHHHHHHHCCEEEEECCCCHHHHHHHHHHHH---HCCCCCEEEEEEEE-------------------- T ss_conf 55----7674278888887485579874858899999986554---22688503554555-------------------- Q ss_pred HHHHCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCC--CCCCCCCCCCCCCCCCCCCHHHHHHHHHH Q ss_conf 8671899899999998644898989999999999999775666541258--76566665686266677899999999887 Q gi|254781051|r 496 TIRSHKSVLQLYADSLMKNQVISKQELQSLANNWHKYLEAEYKESESYC--PEKLGLLHNGENERKNSVSKEILKKIGSS 573 (957) Q Consensus 496 ~I~~hp~v~~~Y~~~Li~~g~it~~e~~~~~~~~~~~l~~~~~~~k~~~--~~~~~~~~~~~~~~~T~v~~~~L~~i~~~ 573 (957) +..|....+ ...+... ..++...+..|+.. |...- .-. +.+. .+|....+..-.. T Consensus 242 ----------------~g~G~~~ig-~~~~Hg~--pl~~~~~~~~k~~~~~P~~~~--~v~-~~~~-~~p~~~~~~~~~~ 298 (632) T KOG0523 242 ----------------IGRGSPYIG-SESVHGA--PLGEDDVERVKSIKGLPVLIF--VVP-EKVK-LYPEKPVEDARAI 298 (632) T ss_pred ----------------EECCCCCCC-CCCCCCC--CCHHHHHHHHHHHCCCCCEEE--EEC-CCCC-CCCCCCCCCCCCC T ss_conf ----------------404755435-5322588--550668998886418762368--716-1125-4777540124465 Q ss_pred HCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHH-CCCEE-EEEC--CCCCCCCCCCCEEEEEECCCC Q ss_conf 4368998620567788899899873343564211478988898863-58315-7850--100223332302799977888 Q gi|254781051|r 574 IIRLPKSFNTHKIVERLMANRQKMIETGKGIDWSMAESLAFGSLCY-EGYKV-RLSG--QDCERGTFSHRHAILCDQETG 649 (957) Q Consensus 574 l~~~Pe~f~~h~~v~k~~~~R~~~~~~~~~iDW~~AE~LA~~SLl~-eG~~V-RlsG--QD~~RGTFshRHavl~dq~t~ 649 (957) ...+|+.-.-++. .=..-|=+-|--.+|+.-|. ++.-. |+-| -|-+-+||+ +.. T Consensus 299 ~~~i~~~~~~~~p-------------~yk~~Dk~~Atrk~~~~aL~~l~~~~~~vI~~~ad~~~st~t---------d~~ 356 (632) T KOG0523 299 SVRIPKIWEKSLP-------------TYKVGDKAVATRKAFGEALAALAEADPRVIGGSADLKNSTLT---------DFF 356 (632) T ss_pred CCCCCCCCCCCCC-------------CCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCHH---------HHC T ss_conf 5676655323577-------------445586005479999999999764285648875466787166---------543 Q ss_pred CEEEEHHHCCCCCCEEEEEECCCCCCCHHHHHHCCCCCCCCCCEEECCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCEE Q ss_conf 44556344278984599982431111013432022113865311100002442454102256554202576233368135 Q gi|254781051|r 650 KRYFPLGNISKDQGHCEVVNSFLSEQAVLGFEYGYSLNNLNALTIWEAQFGDFANGAQVILDQFITAGEQKWLCTSNLVC 729 (957) Q Consensus 650 ~~y~pL~~l~~~q~~f~v~nS~LSE~avlgFEyGys~~~p~~L~iWEAQFGDF~NgAQviiDqfiss~e~KW~~~sglvl 729 (957) .+-. +.+| .|--.-|....||--|-+...+ . . +=-|-|+|=-|-.+||---++- .++.++. T Consensus 357 ~~~~--------p~R~--i~~giaEq~mv~ia~G~a~~g~-~-~---Pf~~tf~~F~trA~dqvr~~a~----s~~~v~~ 417 (632) T KOG0523 357 PKRF--------PERF--IECGIAEQNMVGIANGIACRGR-T-I---PFCGTFAAFFTRAFDQVRMGAL----SQANVIY 417 (632) T ss_pred CCCC--------CCCE--EEEEEEHHHHHHHHHCHHCCCC-C-C---CHHHHHHHHHHHHHHHEEEHHH----CCCCCEE T ss_conf 1227--------6642--7873202116775402311788-6-6---5138799999875302443223----1688579 Q ss_pred ECCCCCCCC---CCCCCCCCHHHHHHHCCCCCEEEEECCCHHHHHHHHHHHHCCCCCCCEEEECCC-HHHCCCCCCCCHH Q ss_conf 436888888---876564405788752143782697248899999999997435348868985660-2210801268878 Q gi|254781051|r 730 LLPHGYEGQ---GPEHSSARLERFLQMCAENNMYVANCTSPANYFHILRRQIYDRSSRPLIMMAPK-SLLRHKRVVSSLS 805 (957) Q Consensus 730 lLPHGyeGq---GPEHSSaRlERfLQl~a~~N~~V~~~ttpAq~FH~LRRQ~~r~~rkPLiv~tPK-sLLR~~~a~S~l~ 805 (957) ..-|-==|- ||-| +-+|-.--+.+-.||.|-.|+--.-.+++.---+...- .|-|.-++. ++--.+ -++.+ T Consensus 418 v~th~~i~~GeDGPth--~~iedla~frsiPn~~v~~PaD~~et~~av~~Aa~~~~-~p~i~~~~r~~~~~~~--~~~~~ 492 (632) T KOG0523 418 VATHDSIGLGEDGPTH--QPIEDLAMFRSIPNMIVFRPADGNETENAVATAANTKG-TPSIRTLSRQNLPIYN--NTEIE 492 (632) T ss_pred EEEECCCCCCCCCCCC--CCHHHHHHHHHCCCCEEEECCCHHHHHHHHHHHHHCCC-CEEEEEECCCCCCCCC--CCCHH T ss_conf 9970451136888655--63889999870788547710763789999999870489-6069984777656568--96554 Q ss_pred HHCCCCCEEECCCCCCCCCCCCCCCCCCHHHCCEEEEECCHHHHHHHHHHHHCCC--CCEEEEEEECCCCCHHHHHHH Q ss_conf 9379986035157764334445444579545768998688578999999997699--868999840148682899999 Q gi|254781051|r 806 DMTCGSVFQAVLSDDAEYHGKTSVKLKEDSHIRRVILCTGKVYYDLLDNRDMRNI--ADIYLLRIEQLYPFPEDYLIK 881 (957) Q Consensus 806 d~~~g~~F~~vi~d~~~~~~~~~~~~~~~~~v~rvi~csGKvyydL~~~r~~~~~--~~vaivRiEqLyPfP~~~l~~ 881 (957) +...|. -|+-|. + -+-+++-+|---|.-+++-+.... -++-+|-.=-..||-...|.+ T Consensus 493 ~igkg~---~vl~~~--------------~-~dV~LiG~Gs~v~~cl~AA~~L~e~gi~vrVvd~~~~kplD~~li~~ 552 (632) T KOG0523 493 EIGKGK---YVLQEV--------------E-PDVILIGTGSEVQECLEAAELLSEDGIKVRVVDPFTWKPLDVALIRS 552 (632) T ss_pred HHCCCC---EEEECC--------------C-CCEEEEECCHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCHHHHHH T ss_conf 532650---798638--------------8-87799926589999999999998649638995453200023587765 No 49 >PRK09627 oorA 2-oxoglutarate-acceptor oxidoreductase subunit OorA; Reviewed Probab=94.50 E-value=0.094 Score=32.95 Aligned_cols=89 Identities=21% Similarity=0.220 Sum_probs=61.5 Q ss_pred CCEEEEECCHHHHHHHHHHHHCCC--CCEEEEEEECCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHH Q ss_conf 768998688578999999997699--868999840148682899999985379983998067853178457899769999 Q gi|254781051|r 837 IRRVILCTGKVYYDLLDNRDMRNI--ADIYLLRIEQLYPFPEDYLIKVLSRFVQAEIVWCQEEPQNMGAWTFIEPYLEKV 914 (957) Q Consensus 837 v~rvi~csGKvyydL~~~r~~~~~--~~vaivRiEqLyPfP~~~l~~~l~~y~~~e~vW~QEEp~N~GaW~~v~~rl~~~ 914 (957) .+-+|++.|=.|-...++.+.... ..|.++|+-.|+|||.++|++.++++.++ -+=| .|+|.+.-. ++.. T Consensus 275 Ae~~iV~~GSt~~~~~eAv~~Lr~~G~kvg~l~~~~l~PfP~~~i~e~l~~~k~v---~VvE--~N~Gq~~~~---i~~~ 346 (375) T PRK09627 275 AEILIIAYGSVSLSAKEAINRLREEGIKVGLFRPITLWPSPAKRLKEIGDKFEKI---LVIE--LNMGQYLEE---IERV 346 (375) T ss_pred CCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEEEECCCCHHHHHHHHHCCCEE---EEEE--CCCHHHHHH---HHHH T ss_conf 8789999564378999999999974990537975245799989999999579989---9980--781599999---9997 Q ss_pred HHHCCCCCCEEEEEECCCCC Q ss_conf 87505677601897536646 Q gi|254781051|r 915 LHSIGADYSRVRYVGRLPSA 934 (957) Q Consensus 915 l~~~~~~~~~~~y~gR~~~a 934 (957) ... .....-++|-|||-.+ T Consensus 347 ~~~-~~~~~i~k~~G~pf~~ 365 (375) T PRK09627 347 MQR-DDFHFLGKANGRPISP 365 (375) T ss_pred HCC-CCCCEEEEECCCCCCH T ss_conf 389-8751570069985799 No 50 >COG0674 PorA Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, alpha subunit [Energy production and conversion] Probab=94.23 E-value=0.13 Score=31.95 Aligned_cols=93 Identities=22% Similarity=0.154 Sum_probs=62.3 Q ss_pred HHCCEEEEECCHHHHHHHHHHHHCC---CCCEEEEEEECCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCC-HHHHHHH Q ss_conf 4576899868857899999999769---98689998401486828999999853799839980678531784-5789976 Q gi|254781051|r 835 SHIRRVILCTGKVYYDLLDNRDMRN---IADIYLLRIEQLYPFPEDYLIKVLSRFVQAEIVWCQEEPQNMGA-WTFIEPY 910 (957) Q Consensus 835 ~~v~rvi~csGKvyydL~~~r~~~~---~~~vaivRiEqLyPfP~~~l~~~l~~y~~~e~vW~QEEp~N~Ga-W~~v~~r 910 (957) +..++||+|.|...-...++-.... -.+|.++|+-.++|||.+++.+.+.+.. +..|-.+| .|.|. |.-++-. T Consensus 255 ~DAe~viV~~Gss~~~~~~a~~~~~~~~g~kvg~l~vr~~rPFp~~~i~~~l~~~~-~~~Vl~~e--~~~g~~~~~l~~~ 331 (365) T COG0674 255 EDAEIVIVAMGSSKGSTAEAVVDLLRDKGEKVGLLKVRTLRPFPAEEIREVLPKTN-AVVVLDVE--ISLGGLAEPLYLE 331 (365) T ss_pred CCCCEEEEEECCCHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCHHHHHHHHCCCC-EEEEEEEE--CCCCCCHHHHHHH T ss_conf 97278999967524868999999887418546899997756899899997613462-26999982--3675210028886 Q ss_pred HHHHHHHCCCCCCEEEEEECCCCC Q ss_conf 999987505677601897536646 Q gi|254781051|r 911 LEKVLHSIGADYSRVRYVGRLPSA 934 (957) Q Consensus 911 l~~~l~~~~~~~~~~~y~gR~~~a 934 (957) ....+ ......++-||+-.. T Consensus 332 ~~~~~----~~~~~~~~~g~~~~~ 351 (365) T COG0674 332 VLAAL----YVESRYKLGGRDFTP 351 (365) T ss_pred HHHHE----EECCCCCCCCEECCH T ss_conf 43120----102202218863699 No 51 >PRK05899 transketolase; Reviewed Probab=94.03 E-value=0.58 Score=26.99 Aligned_cols=457 Identities=16% Similarity=0.161 Sum_probs=210.5 Q ss_pred CHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEE-CCCEECCC Q ss_conf 44553236777764166654455565445404788862575333467258876451067744365799984-16020124 Q gi|254781051|r 339 DPVVIGSVRARQDLKAGIVGEKSISLVERSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLII-NNQIGFTT 417 (957) Q Consensus 339 nPVv~G~~RAkQd~~~d~~~~~~~~~~~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVv-NNQIGFTT 417 (957) =+++.|.+.|........ +. ........+|.+|+ ||++.. .|+.+|.||++.-.+-+ -+=+|+ +|.+.--. T Consensus 124 is~a~G~A~a~~~~~~~~-~~-~~~~~~~~~v~avi--GDG~l~-eG~a~EAln~a~~~~l~---nLIvI~ndN~~sIdg 195 (661) T PRK05899 124 LANAVGMALAEKLLAARF-NR-PGHDIVDHYTYVLL--GDGCLM-EGISHEACSLAGHLKLG---NLIVIYDDNRISIDG 195 (661) T ss_pred HHHHHHHHHHHHHHHHHH-CC-CCCCCCCCEEEEEE--EECCCC-CHHHHHHHHHHCCCCCC---CEEEEEECCCEEECC T ss_conf 999999999999986652-87-75566577499998--206435-67899999864504689---889999479700066 Q ss_pred CCHHCCCCCCCCHHHHHHCC---CEEEE-CCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCCCHHH Q ss_conf 70102467784001122034---05874-267889999999999999997389739986421125656435422355788 Q gi|254781051|r 418 NPSSARSCTYASDISKSIGI---PIFHV-NGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVCYRRFGHNEGDEPSFTQPMM 493 (957) Q Consensus 418 ~p~d~RSs~Y~TDiAK~i~a---PI~HV-N~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~YRr~GHNE~DeP~fTQP~m 493 (957) +-. .....|+.+.+.+ =++.| +|.|++++..|.+-|.+ .+ .|-++|-. .-.-|+.- |.+-.+.. T Consensus 196 ~v~----~~~~~~l~~rf~a~Gw~vi~v~DGhD~~~I~~Ai~~Ak~--~~-~kPtvI~v--~TiKG~G~---p~~e~~~~ 263 (661) T PRK05899 196 DTS----GWFSEDTKKRFEAYGWHVIEVVDGHDVEAIDAAIEEAKA--ET-DKPTLIIA--KTIIGKGS---PNKEGTHK 263 (661) T ss_pred CCC----CCCHHHHHHHHHHCCCCEEECCCCCCHHHHHHHHHHHHH--CC-CCCEEEEE--EECCCCCC---CCCCCCHH T ss_conf 404----210598999999769867843799999999999999986--46-99868999--82366787---32346513 Q ss_pred HHHHHHCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCC------------CCCCCCCC-- Q ss_conf 8886718998999999986448989899999999999997756665412587656666------------56862666-- Q gi|254781051|r 494 YKTIRSHKSVLQLYADSLMKNQVISKQELQSLANNWHKYLEAEYKESESYCPEKLGLL------------HNGENERK-- 559 (957) Q Consensus 494 Y~~I~~hp~v~~~Y~~~Li~~g~it~~e~~~~~~~~~~~l~~~~~~~k~~~~~~~~~~------------~~~~~~~~-- 559 (957) .| .+.++.+++...++.+. - +...-+.|...... ..|..... T Consensus 264 -----~H-------------g~~L~~ee~~~~~~~lg----~--~~~~f~~~~~v~~~~~~~~~~g~~~~~~~~~~~~~~ 319 (661) T PRK05899 264 -----VH-------------GAPLGAEEIAAARKALG----W--DYEPFEVPEEVYAHWRKAKERGAKAEAEWNEKFAAY 319 (661) T ss_pred -----HH-------------CCCCCHHHHHHHHHHCC----C--CCCCCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHH T ss_conf -----32-------------58755168999999629----9--986666886888988752002337899999999985 Q ss_pred CCCCHHHHHHHHHHHC-CCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHC---CCE--EEEECCCCCC Q ss_conf 7789999999988743-689986205677888998998733435642114789888988635---831--5785010022 Q gi|254781051|r 560 NSVSKEILKKIGSSII-RLPKSFNTHKIVERLMANRQKMIETGKGIDWSMAESLAFGSLCYE---GYK--VRLSGQDCER 633 (957) Q Consensus 560 T~v~~~~L~~i~~~l~-~~Pe~f~~h~~v~k~~~~R~~~~~~~~~iDW~~AE~LA~~SLl~e---G~~--VRlsGQD~~R 633 (957) .....+...+..+.+. ..|.....- ...+.. .|.--|||..|.+ -.| |=+|. |..- T Consensus 320 ~~~~p~~~~~~~~~~~~~~~~~~~~~-------------~~~~~~----~Atr~afg~~L~~l~~~~p~ivg~sA-Dl~~ 381 (661) T PRK05899 320 AAAYPELAAEFERRAGELPAGWDAAL-------------PEKGKK----VATRKASGKALNALAAALPELVGGSA-DLAG 381 (661) T ss_pred HHHCHHHHHHHHHHHCCCCCCHHHHH-------------CCCCCC----HHHHHHHHHHHHHHHHHCCCEEEEEC-CCCC T ss_conf 34376789999875246674023331-------------124653----04999999999999854876178704-5478 Q ss_pred CCCCCCEEEEEECCCCCEEEEHHHCCCCCCEEEEEECCCCCCCHHHHHHCCCCCCCCCCEEECCCCCCCCCCCHHHHHHH Q ss_conf 33323027999778884455634427898459998243111101343202211386531110000244245410225655 Q gi|254781051|r 634 GTFSHRHAILCDQETGKRYFPLGNISKDQGHCEVVNSFLSEQAVLGFEYGYSLNNLNALTIWEAQFGDFANGAQVILDQF 713 (957) Q Consensus 634 GTFshRHavl~dq~t~~~y~pL~~l~~~q~~f~v~nS~LSE~avlgFEyGys~~~p~~L~iWEAQFGDF~NgAQviiDqf 713 (957) .|... +. +.. .|... .-+++| +|-=..|.+..++--|.++.. .+.-+-+-|.=|. |-..||- T Consensus 382 St~~~----~~----~~~-~~~~e--~~~~r~--~~~GIaE~~mv~~aaGla~~G--~~~P~~~tf~~F~---~ra~dqi 443 (661) T PRK05899 382 SNNTE----IK----GSK-DFTPD--DYSGRY--IHYGVREFAMAAIMNGLALHG--GFIPFGGTFLVFS---DYARNAI 443 (661) T ss_pred CCCCC----CC----CCC-CCCCC--CCCCCC--CCCCCCHHHHHHHHHHHHHCC--CCCCEEEEHHHHH---HHHHHHH T ss_conf 86644----25----577-76765--789873--244620899999999998549--9884012079899---9989999 Q ss_pred HHHHHHHHCCCCCCEEECCC-CCC--CCCCCCCCCCHHHHHHHCCCCCEEEEECCCHHHHHHHHHHHHCCCCCCCEEEEC Q ss_conf 42025762333681354368-888--888765644057887521437826972488999999999974353488689856 Q gi|254781051|r 714 ITAGEQKWLCTSNLVCLLPH-GYE--GQGPEHSSARLERFLQMCAENNMYVANCTSPANYFHILRRQIYDRSSRPLIMMA 790 (957) Q Consensus 714 iss~e~KW~~~sglvllLPH-Gye--GqGPEHSSaRlERfLQl~a~~N~~V~~~ttpAq~FH~LRRQ~~r~~rkPLiv~t 790 (957) --++- ++ ...++.+-| |.. .-||-|. =+|-.-.|..=.||.|..|+.+.+.-++++.-+.+. ..|-.+-. T Consensus 444 r~~al---~~-~~v~~v~t~~Gi~vG~DG~THq--~i~dla~lr~iPn~~v~~P~d~~E~~~~~~~a~~~~-~gp~~ir~ 516 (661) T PRK05899 444 RLAAL---MK-LPVIYVFTHDSIGVGEDGPTHQ--PVEQLASLRAIPNLTVIRPADANETAAAWKLALERK-DGPSALVL 516 (661) T ss_pred HHHHH---HC-CCEEEEEECCCCCCCCCCCCHH--HHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHHHCC-CCCEEEEE T ss_conf 98876---04-8749999787523489986612--467899985589967981599999999999999669-99789996 Q ss_pred CCHHHCC-CCCCCCHHHHCCCCCEEECCCCCCCCCCCCCCCCCCHHHCCEEEEECCHHHHHHHHHHHHCCCC--CEEEEE Q ss_conf 6022108-0126887893799860351577643344454445795457689986885789999999976998--689998 Q gi|254781051|r 791 PKSLLRH-KRVVSSLSDMTCGSVFQAVLSDDAEYHGKTSVKLKEDSHIRRVILCTGKVYYDLLDNRDMRNIA--DIYLLR 867 (957) Q Consensus 791 PKsLLR~-~~a~S~l~d~~~g~~F~~vi~d~~~~~~~~~~~~~~~~~v~rvi~csGKvyydL~~~r~~~~~~--~vaivR 867 (957) |+.-+-. .....+.+.+..|. .. ++.+. .-+=.|++||-....-+++.+....+ ++.+|- T Consensus 517 ~r~~~p~~~~~~~~~~~~~~G~-~~-~~~~~---------------~~dv~iia~G~~v~~al~Aa~~L~~~gi~~~Vv~ 579 (661) T PRK05899 517 SRQNLPVLETTEEQDEGVAKGG-YV-LKDSG---------------GPDVILIATGSEVHLALEAAKELEAEGIKVRVVS 579 (661) T ss_pred CCCCCCCCCCCCCCHHHHCCCC-EE-EEECC---------------CCCEEEEEECHHHHHHHHHHHHHHHCCCCEEEEE T ss_conf 6887787787765144421661-89-99879---------------9988999832999999999999986499869997 Q ss_pred EECCCCCHHH---HHHHHHHHCCCC-EEEEECCCCCCCC Q ss_conf 4014868289---999998537998-3998067853178 Q gi|254781051|r 868 IEQLYPFPED---YLIKVLSRFVQA-EIVWCQEEPQNMG 902 (957) Q Consensus 868 iEqLyPfP~~---~l~~~l~~y~~~-e~vW~QEEp~N~G 902 (957) +--+=||..+ .+..+|+. +. .+|=+ |+..-+| T Consensus 580 ~~s~~pld~~~~~~~~~vl~~--~~~~ivtv-E~~~~~g 615 (661) T PRK05899 580 MPSTELFDEQDAAYKESVLPA--AVTKRVAV-EAGVADG 615 (661) T ss_pred CCCCCHHHCCCHHHHHHHCCC--CCCCEEEE-CCCHHHH T ss_conf 688724546749899864276--78838998-4884765 No 52 >PRK07525 sulfoacetaldehyde acetyltransferase; Validated Probab=93.90 E-value=0.38 Score=28.33 Aligned_cols=39 Identities=8% Similarity=0.087 Sum_probs=19.2 Q ss_pred HHHHCCCCCEEEEECCCHHHHHHHHHHHHCC--CCCCCEEE Q ss_conf 8752143782697248899999999997435--34886898 Q gi|254781051|r 750 FLQMCAENNMYVANCTSPANYFHILRRQIYD--RSSRPLIM 788 (957) Q Consensus 750 fLQl~a~~N~~V~~~ttpAq~FH~LRRQ~~r--~~rkPLiv 788 (957) |-+++.--...-.-.+||+.+=-+|+|=+-. .--+|-+| T Consensus 518 f~~~A~a~G~~g~rV~~~~el~~al~~Al~~~~~~g~P~li 558 (589) T PRK07525 518 YAGIARAMGAEGVVVDTQEELGPALKRAIDAQMNEGKTTVI 558 (589) T ss_pred HHHHHHHCCCEEEEECCHHHHHHHHHHHHHHCCCCCCCEEE T ss_conf 89999977987999899999999999999733358994899 No 53 >PRK06154 hypothetical protein; Provisional Probab=93.90 E-value=0.3 Score=29.16 Aligned_cols=24 Identities=21% Similarity=0.498 Sum_probs=14.1 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEECC Q ss_conf 9999999999987379107998067 Q gi|254781051|r 246 IIPAIEEIIRQGVQQGVDEMILGMA 270 (957) Q Consensus 246 lIP~L~~ii~~a~~~Gv~eiviGMa 270 (957) ||-.-|.|++.-.+.||+.|. |+| T Consensus 11 ~m~~ad~i~~~L~~~GV~~iF-G~P 34 (556) T PRK06154 11 LMKAVDEVARILKAEGTELLF-GFP 34 (556) T ss_pred CCCHHHHHHHHHHHCCCCEEE-ECC T ss_conf 755999999999987998999-554 No 54 >COG0028 IlvB Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] [Amino acid transport and metabolism / Coenzyme metabolism] Probab=93.70 E-value=0.12 Score=32.08 Aligned_cols=162 Identities=20% Similarity=0.243 Sum_probs=97.0 Q ss_pred CHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEE-EECCCEEEEEECCCCCCCHHCCHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 989999985588743223455510015026889-8359618999717877310004455323677776416665445556 Q gi|254781051|r 285 LPRSIFYEFKGKGPSEKEYSGDVKYHLGLCCNR-QICGKDVKLLLQSNPSHLEFVDPVVIGSVRARQDLKAGIVGEKSIS 363 (957) Q Consensus 285 p~~~iF~EF~g~~~~~~~~sGDVKYHLG~s~~~-~~~g~~v~lsL~~NPSHLEaVnPVv~G~~RAkQd~~~d~~~~~~~~ 363 (957) .|..++.+.....+.+...+.||=.|.-+.... .+. ++=+.-....-..+=.--|-++|..-|.-+ T Consensus 360 ~p~~v~~~l~~~~~~daiv~~d~G~~~~w~~~~~~~~-~p~~~~~s~~~GtMG~glPaAIGAkla~P~------------ 426 (550) T COG0028 360 KPQYVIKVLRELLPDDAIVVTDVGQHQMWAARYFDFY-RPRRFLTSGGLGTMGFGLPAAIGAKLAAPD------------ 426 (550) T ss_pred CHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHCCCC-CCCCEEECCCCCCCCCHHHHHHHHHHHCCC------------ T ss_conf 8899999999866998799978728689999845557-887156579986111248999998863899------------ Q ss_pred CCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCC-EECCCCCHHC----C-CCCC-----CCHHH Q ss_conf 5445404788862575333467258876451067744365799984160-2012470102----4-6778-----40011 Q gi|254781051|r 364 LVERSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQ-IGFTTNPSSA----R-SCTY-----ASDIS 432 (957) Q Consensus 364 ~~~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQ-IGFTT~p~d~----R-Ss~Y-----~TDiA 432 (957) ++|++|. ||.+|.=. ++|-.... -|+. -.+.||.||+ .|+...-.+. | |..+ -.-+| T Consensus 427 ----r~Vv~i~--GDG~F~m~--~qEL~Ta~---r~~l-pv~ivv~nN~~~g~v~~~q~~~~~~~~~~~~~~~~~f~klA 494 (550) T COG0028 427 ----RKVVAIA--GDGGFMMN--GQELETAV---RYGL-PVKIVVLNNGGYGMVRQWQELFYGGRYSGTDLGNPDFVKLA 494 (550) T ss_pred ----CCEEEEE--CCCHHHCC--HHHHHHHH---HHCC-CEEEEEEECCCHHHHHHHHHHHCCCCCCCCCCCCCCHHHHH T ss_conf ----8289997--02368507--98999999---8689-97999997844345089988734778551115887689999 Q ss_pred HHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEECCC Q ss_conf 220340587426788999999999999999738973998642112 Q gi|254781051|r 433 KSIGIPIFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVCYRR 477 (957) Q Consensus 433 K~i~aPI~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~YRr 477 (957) +.+++.-+.|. +||.+-.+.+=|++-.-. ++||.+-=+. T Consensus 495 ea~G~~g~rv~--~~~el~~al~~al~~~~p----~lidv~id~~ 533 (550) T COG0028 495 EAYGAKGIRVE--TPEELEEALEEALASDGP----VLIDVVVDPE 533 (550) T ss_pred HHCCCEEEEEC--CHHHHHHHHHHHHHCCCC----EEEEEEECCC T ss_conf 97798259968--889999999999707998----8999983776 No 55 >TIGR03457 sulphoacet_xsc sulfoacetaldehyde acetyltransferase. Members of this protein family are sulfoacetaldehyde acetyltransferase, an enzyme of taurine utilization. Taurine, or 2-aminoethanesulfonate, can be used by bacteria as a source of carbon, nitrogen, and sulfur. Probab=93.40 E-value=0.4 Score=28.21 Aligned_cols=15 Identities=13% Similarity=0.195 Sum_probs=5.7 Q ss_pred ECCCHHHHHHHHHHH Q ss_conf 248899999999997 Q gi|254781051|r 763 NCTSPANYFHILRRQ 777 (957) Q Consensus 763 ~~ttpAq~FH~LRRQ 777 (957) -.++|.++=-+|+|= T Consensus 526 ~V~~~~el~~Al~~A 540 (579) T TIGR03457 526 VVDKPEDVGPALKKA 540 (579) T ss_pred EECCHHHHHHHHHHH T ss_conf 979999999999999 No 56 >COG4231 Indolepyruvate ferredoxin oxidoreductase, alpha and beta subunits [Energy production and conversion] Probab=93.30 E-value=0.75 Score=26.12 Aligned_cols=211 Identities=17% Similarity=0.146 Sum_probs=117.2 Q ss_pred CCCEEEEEECCCEECCC--CCHHCCCCCCCCHHHHHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEECCCC Q ss_conf 36579998416020124--7010246778400112203405874267889999999999999997389739986421125 Q gi|254781051|r 401 VAGNIHLIINNQIGFTT--NPSSARSCTYASDISKSIGIPIFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVCYRRF 478 (957) Q Consensus 401 tGGTiHiVvNNQIGFTT--~p~d~RSs~Y~TDiAK~i~aPI~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~YRr~ 478 (957) +.|-+=||+---=|--. +..|+| . .+|.-..|||-=- ||-++....+.|+|.-.+|+.-|++ -|.-|- T Consensus 108 v~GGlviv~aDDpg~~SSqneqdsr--~----y~~~a~iPvLeP~--d~Qea~d~~~~afelSe~~~~pVil--r~ttr~ 177 (640) T COG4231 108 VTGGLVIVVADDPGMHSSQNEQDSR--A----YGKFALIPVLEPS--DPQEAYDYVKYAFELSEKSGLPVIL--RTTTRV 177 (640) T ss_pred CCCCEEEEECCCCCCCCCCCHHHHH--H----HHHHCCCEEECCC--CHHHHHHHHHHHHHHHHHHCCCEEE--EEEEEE T ss_conf 6660899981698864453566768--9----8885076120689--8699999999998777873897899--998643 Q ss_pred CCCCCCCCCCCCHHHHHHHHHCCCHHHH-HHHHHHHCCCCCHHHH-HHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC-CC Q ss_conf 6564354223557888886718998999-9999864489898999-9999999999775666541258765666656-86 Q gi|254781051|r 479 GHNEGDEPSFTQPMMYKTIRSHKSVLQL-YADSLMKNQVISKQEL-QSLANNWHKYLEAEYKESESYCPEKLGLLHN-GE 555 (957) Q Consensus 479 GHNE~DeP~fTQP~mY~~I~~hp~v~~~-Y~~~Li~~g~it~~e~-~~~~~~~~~~l~~~~~~~k~~~~~~~~~~~~-~~ 555 (957) +|-.+|= -...++..+..+.. ...+-....+.-++.. ...+.....+++++.+......-++..+..+ .. T Consensus 178 ~h~~~~V-------~~~~~~~~~~~~~~~~~~k~~~~~V~~p~~~~~~~~~~l~~k~~a~~~~~~~~~~n~v~~~~~~~l 250 (640) T COG4231 178 SHSRGDV-------EVGLNRRPIVEPEDEFFIKDPGRYVRVPANALRHRHRKLLEKWEAAEEFINANPLNRVEGSDDAKL 250 (640) T ss_pred ECCCEEE-------EECCCCCCCCCCCCCCCCCCCCCEECCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCE T ss_conf 0221028-------824666787765212434786632306742112567889999999999986486563136887856 Q ss_pred CCCCCCCCHHHHHHHHHHHCCCCCCCC---CCHHHHHHHHHHHHH------HCCCCCCCCHHHHHHHHHHHHHCCCEEEE Q ss_conf 266677899999999887436899862---056778889989987------33435642114789888988635831578 Q gi|254781051|r 556 NERKNSVSKEILKKIGSSIIRLPKSFN---THKIVERLMANRQKM------IETGKGIDWSMAESLAFGSLCYEGYKVRL 626 (957) Q Consensus 556 ~~~~T~v~~~~L~~i~~~l~~~Pe~f~---~h~~v~k~~~~R~~~------~~~~~~iDW~~AE~LA~~SLl~eG~~VRl 626 (957) --+.-|++...+++..+.+---.+=|+ +||--.+.+.+=.+- +|.+++ .-|.=.-..|..+|.+|.+ T Consensus 251 GII~~G~ay~yVkeAl~~lgl~~~~lklg~~~Plp~~~i~~F~~g~~~vlVVEE~~P----~iE~qv~~~l~~~~~~~~v 326 (640) T COG4231 251 GIIASGIAYNYVKEALEDLGLDDELLKLGTPYPLPEQLIENFLKGLERVLVVEEGEP----FIEEQVKALLYDAGLPVEV 326 (640) T ss_pred EEEECCCCHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHCCCEEEEEECCCH----HHHHHHHHHHHHCCCCEEE T ss_conf 899357217999999997588741789458757888999999826847999955870----4899999998754885474 Q ss_pred ECCCCC Q ss_conf 501002 Q gi|254781051|r 627 SGQDCE 632 (957) Q Consensus 627 sGQD~~ 632 (957) -|-|-+ T Consensus 327 ~GKd~g 332 (640) T COG4231 327 HGKDEG 332 (640) T ss_pred ECCCCC T ss_conf 034456 No 57 >cd02008 TPP_IOR_alpha Thiamine pyrophosphate (TPP) family, IOR-alpha subfamily, TPP-binding module; composed of proteins similar to indolepyruvate ferredoxin oxidoreductase (IOR) alpha subunit. IOR catalyzes the oxidative decarboxylation of arylpyruvates, such as indolepyruvate or phenylpyruvate, which are generated by the transamination of aromatic amino acids, to the corresponding aryl acetyl-CoA. Probab=93.07 E-value=0.49 Score=27.56 Aligned_cols=157 Identities=25% Similarity=0.286 Sum_probs=86.5 Q ss_pred EEEECCCCCHHHHHHHHHCCCHHHHHHHHCCCCCCCCCCCCCCCC-CCCEEEEEEECCCEEEEEECCCCCCCHHCCHHHH Q ss_conf 998067655268999885379899999855887432234555100-1502688983596189997178773100044553 Q gi|254781051|r 265 MILGMAHRGRLNVLSQIMNKLPRSIFYEFKGKGPSEKEYSGDVKY-HLGLCCNRQICGKDVKLLLQSNPSHLEFVDPVVI 343 (957) Q Consensus 265 iviGMaHRGRLNVL~Nvl~Kp~~~iF~EF~g~~~~~~~~sGDVKY-HLG~s~~~~~~g~~v~lsL~~NPSHLEaVnPVv~ 343 (957) .+=|-+||+=+..|.+++.+ +....+|+=- -+++.... ..-..+. +.-.--|++. T Consensus 6 ~CpGCgh~~~~~al~~~~~~---------------~~v~~~~IGC~~~~~~~~~--~~~~~~~-------~mG~~~~~A~ 61 (178) T cd02008 6 LCPGCPHRPSFYALRKAFKK---------------DSIVSGDIGCYTLGALPPL--NAIDTCT-------CMGASIGVAI 61 (178) T ss_pred CCCCCCCCHHHHHHHHHHCC---------------CCEEEECCCHHHHHHCCCC--CEEEEEE-------CCCCCHHHHH T ss_conf 68899985999999974015---------------8679816371014126883--0000111-------3577378997 Q ss_pred HHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCCEECCC----CC Q ss_conf 23677776416665445556544540478886257533346725887645106774436579998416020124----70 Q gi|254781051|r 344 GSVRARQDLKAGIVGEKSISLVERSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQIGFTT----NP 419 (957) Q Consensus 344 G~~RAkQd~~~d~~~~~~~~~~~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQIGFTT----~p 419 (957) |..+|..| .+|+.+. ||..|..-|+ .|..+..+-. -..+.||.||++-=-| +| T Consensus 62 G~k~a~p~----------------~~Vv~~~--GDG~~~~~G~-~~l~~av~~~----~~it~iv~nN~~ygmTggQ~sp 118 (178) T cd02008 62 GMAKASED----------------KKVVAVI--GDSTFFHSGI-LGLINAVYNK----ANITVVILDNRTTAMTGGQPHP 118 (178) T ss_pred HHHHHCCC----------------CCEEEEE--CCHHHHHHHH-HHHHHHHHHC----CCEEEEEEECCCCCCCCCCCCC T ss_conf 78987899----------------8379995--7627766319-9999999829----7959999948530013675589 Q ss_pred ---HHCCCCCCCCHHH---HHHCCCEEEE-CCCCHHHHHHHHHHHHHHHHHHCCCEEEEE Q ss_conf ---1024677840011---2203405874-267889999999999999997389739986 Q gi|254781051|r 420 ---SSARSCTYASDIS---KSIGIPIFHV-NGDDPEAVIRVVRMAVSFRMKFHKSVVIDI 472 (957) Q Consensus 420 ---~d~RSs~Y~TDiA---K~i~aPI~HV-N~DdpEav~~~~~lA~~yR~~F~kDVvIDl 472 (957) ....+....-|++ +..++.-+.| +..|++....+.+-|+++ +-=.||+. T Consensus 119 ~~~~~~~~~~~~~d~~~~a~a~G~~~v~~~~~~d~~~~~~~l~~A~~~----~G~s~I~v 174 (178) T cd02008 119 GTGKTLTEPTTVIDIEALVRAIGVKRVVVVDPYDLKAIREELKEALAV----PGVSVIIA 174 (178) T ss_pred CCCCCCCCCCCCCCHHHHHHHCCCCEEEEECHHHHHHHHHHHHHHHCC----CCCEEEEE T ss_conf 988877787898798999997799999997844199999999999738----99789999 No 58 >PRK08199 acetolactate synthase 2 catalytic subunit; Validated Probab=92.82 E-value=0.83 Score=25.80 Aligned_cols=19 Identities=11% Similarity=0.267 Sum_probs=12.4 Q ss_pred HHHHHHHHHCCCCEEEEECC Q ss_conf 99999987379107998067 Q gi|254781051|r 251 EEIIRQGVQQGVDEMILGMA 270 (957) Q Consensus 251 ~~ii~~a~~~Gv~eiviGMa 270 (957) +.|++.-.+.||+.| +|.| T Consensus 6 ~~l~~~L~~~Gv~~v-Fg~p 24 (553) T PRK08199 6 QILVDALRANGVERV-FCVP 24 (553) T ss_pred HHHHHHHHHCCCCEE-EEEC T ss_conf 999999998799989-9968 No 59 >TIGR03393 indolpyr_decarb indolepyruvate decarboxylase, Erwinia family. A family of closely related, thiamine pyrophosphate-dependent enzymes includes indolepyruvate decarboxylase (EC 4.1.1.74), phenylpyruvate decarboxylase (EC 4.1.1.43), pyruvate decarboxylase (EC 4.1.1.1), branched-chain alpha-ketoacid decarboxylase, etc.. Members of this group of homologs may overlap in specificity. Within the larger family, this model represents a clade of bacterial indolepyruvate decarboxylases, part of a pathway for biosynthesis of the plant hormone indole-3-acetic acid. Typically, these species interact with plants, as pathogens or as beneficial, root-associated bacteria. Probab=92.57 E-value=0.71 Score=26.34 Aligned_cols=23 Identities=17% Similarity=0.228 Sum_probs=10.6 Q ss_pred CHHHHHHHHHHHHHHHHHHCCCEEEEE Q ss_conf 889999999999999997389739986 Q gi|254781051|r 446 DPEAVIRVVRMAVSFRMKFHKSVVIDI 472 (957) Q Consensus 446 dpEav~~~~~lA~~yR~~F~kDVvIDl 472 (957) .++.+.++...|+.-| +-|+|+| T Consensus 141 ~~~~i~~a~~~a~~~~----~Pv~l~i 163 (539) T TIGR03393 141 ATAEIDRVITTALRER----RPGYLML 163 (539) T ss_pred CHHHHHHHHHHHHHCC----CCEEEEE T ss_conf 1889999999998608----9758980 No 60 >PRK05858 hypothetical protein; Provisional Probab=92.35 E-value=0.94 Score=25.38 Aligned_cols=41 Identities=27% Similarity=0.442 Sum_probs=23.2 Q ss_pred HCCCCCCCCHHHH--HHHHHHHHHCCCEEEEECCCCCCCCCCCCEEE Q ss_conf 3343564211478--98889886358315785010022333230279 Q gi|254781051|r 598 IETGKGIDWSMAE--SLAFGSLCYEGYKVRLSGQDCERGTFSHRHAI 642 (957) Q Consensus 598 ~~~~~~iDW~~AE--~LA~~SLl~eG~~VRlsGQD~~RGTFshRHav 642 (957) +--|.++.|+-|+ +.+++..+ |.||--|-+ .||.|...|-. T Consensus 210 ii~G~gv~~~~a~~~l~~lae~l--g~Pv~tt~~--gkg~~p~~hpl 252 (543) T PRK05858 210 IMAGTNVWWGHAEAALLRLAEEL--HIPVLMNGM--ARGVVPADHPL 252 (543) T ss_pred EEECCCCCCCCHHHHHHHHHHHH--CCCEEECCC--CCCCCCCCCCH T ss_conf 99686422333399999999986--997787555--57768887704 No 61 >PRK06882 acetolactate synthase 3 catalytic subunit; Validated Probab=92.05 E-value=0.55 Score=27.18 Aligned_cols=45 Identities=11% Similarity=0.125 Sum_probs=18.3 Q ss_pred HHHHHHCCCCCCCCH--HHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCEE Q ss_conf 899873343564211--4789888988635831578501002233323027 Q gi|254781051|r 593 NRQKMIETGKGIDWS--MAESLAFGSLCYEGYKVRLSGQDCERGTFSHRHA 641 (957) Q Consensus 593 ~R~~~~~~~~~iDW~--~AE~LA~~SLl~eG~~VRlsGQD~~RGTFshRHa 641 (957) +++-.+--|.++.++ ..|..+|+-.+. .||=- .-.++|.|...|- T Consensus 206 A~rPvii~G~g~~~~~a~~~l~~lae~lg--~PV~t--t~~gkg~~p~~hp 252 (574) T PRK06882 206 AKKPVLFIGGGVITAECSEQLTQFAQRLN--LPVTS--SLMGLGAYPSTDK 252 (574) T ss_pred CCCCEEEECCCCCCHHHHHHHHHHHHHCC--CCEEE--HHCCCCCCCCCCC T ss_conf 79988997874344248999999998609--87131--2035777777776 No 62 >cd03371 TPP_PpyrDC Thiamine pyrophosphate (TPP) family, PpyrDC subfamily, TPP-binding module; composed of proteins similar to phosphonopyruvate decarboxylase (PpyrDC) proteins. PpyrDC is a homotrimeric enzyme which functions in the biosynthesis of C-P compounds such as bialaphos tripeptide in Streptomyces hygroscopicus. These proteins require TPP and divalent metal cation cofactors. Probab=91.80 E-value=1.1 Score=24.75 Aligned_cols=109 Identities=22% Similarity=0.287 Sum_probs=65.3 Q ss_pred HCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCCEECC Q ss_conf 00445532367777641666544555654454047888625753334672588764510677443657999841602012 Q gi|254781051|r 337 FVDPVVIGSVRARQDLKAGIVGEKSISLVERSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQIGFT 416 (957) Q Consensus 337 aVnPVv~G~~RAkQd~~~d~~~~~~~~~~~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQIGFT 416 (957) ..-|.++|.+-|+-+ ++|++ |-||.+|.=..=--+|+ --|..-=.+|||+||+---. T Consensus 52 ~a~p~AiG~ala~p~----------------r~Vv~--l~GDGs~lM~~geL~T~-----~~~~~~n~~~vv~nN~~~~s 108 (188) T cd03371 52 HASQIALGIALARPD----------------RKVVC--IDGDGAALMHMGGLATI-----GGLAPANLIHIVLNNGAHDS 108 (188) T ss_pred HHHHHHHHHHHHCCC----------------CEEEE--EECCHHHHHHHHHHHHH-----HHHCCCCEEEEEECCCCCCC T ss_conf 178999999983999----------------81899--95733888885799999-----97278877999987965223 Q ss_pred CC--CHHCCCCCCCCHHHHHHCC-CEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEEC Q ss_conf 47--0102467784001122034-05874267889999999999999997389739986421 Q gi|254781051|r 417 TN--PSSARSCTYASDISKSIGI-PIFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVCY 475 (957) Q Consensus 417 T~--p~d~RSs~Y~TDiAK~i~a-PI~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~Y 475 (957) |- |..+- ..==..+|+..+. ..+.| +++|..-.+.+-|+ +-..-+|||+.|= T Consensus 109 ~g~q~t~~~-~~df~~iA~a~G~~~~~~v--~~~~el~~al~~al----~~~gP~~ieV~id 163 (188) T cd03371 109 VGGQPTVSF-DVSLPAIAKACGYRAVYEV--PSLEELVAALAKAL----AADGPAFIEVKVR 163 (188) T ss_pred CCCCCCCCC-CCCHHHHHHHCCCCCEEEE--CCHHHHHHHHHHHH----HCCCCEEEEEEEC T ss_conf 468766578-9899999998699827997--89999999999998----4899999999978 No 63 >PRK09259 putative oxalyl-CoA decarboxylase; Validated Probab=91.44 E-value=0.57 Score=27.02 Aligned_cols=37 Identities=24% Similarity=0.307 Sum_probs=22.1 Q ss_pred HHHHCCCCCEEEEECCCHHHHHHHHHHHHCCCCCCCEEE Q ss_conf 875214378269724889999999999743534886898 Q gi|254781051|r 750 FLQMCAENNMYVANCTSPANYFHILRRQIYDRSSRPLIM 788 (957) Q Consensus 750 fLQl~a~~N~~V~~~ttpAq~FH~LRRQ~~r~~rkPLiv 788 (957) |=+++.--...=..+++|+.+=.+|++-+. .++|-+| T Consensus 508 f~~lA~a~G~~g~~V~~~~eL~~Al~~A~~--~~~P~lI 544 (572) T PRK09259 508 YDKMMEAFGGVGYNVTTPDELRHALTEAIA--SGKPTLI 544 (572) T ss_pred HHHHHHHCCCEEEEECCHHHHHHHHHHHHH--CCCCEEE T ss_conf 999999689979997999999999999983--8994899 No 64 >PRK07064 hypothetical protein; Provisional Probab=91.42 E-value=0.88 Score=25.62 Aligned_cols=20 Identities=20% Similarity=0.237 Sum_probs=7.9 Q ss_pred CEEEEECCCHHHHHHHHHHH Q ss_conf 82697248899999999997 Q gi|254781051|r 758 NMYVANCTSPANYFHILRRQ 777 (957) Q Consensus 758 N~~V~~~ttpAq~FH~LRRQ 777 (957) ...-.-++||..+=-+|++= T Consensus 495 G~~g~rV~~~~eL~~al~~A 514 (544) T PRK07064 495 GLPHWRVTSADDFEAVLREA 514 (544) T ss_pred CCEEEEECCHHHHHHHHHHH T ss_conf 99799979999999999999 No 65 >PRK06276 acetolactate synthase catalytic subunit; Reviewed Probab=91.40 E-value=0.58 Score=27.01 Aligned_cols=113 Identities=20% Similarity=0.336 Sum_probs=51.5 Q ss_pred CCCHHCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCC Q ss_conf 73100044553236777764166654455565445404788862575333467258876451067744365799984160 Q gi|254781051|r 333 SHLEFVDPVVIGSVRARQDLKAGIVGEKSISLVERSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQ 412 (957) Q Consensus 333 SHLEaVnPVv~G~~RAkQd~~~d~~~~~~~~~~~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQ 412 (957) +.+=.--|..+|-.-|..| ++|+ .|-||.+|-=-+---+|+--.+|| .+-||+||+ T Consensus 419 GsmG~glpaAiGA~lA~pd----------------r~Vv--~i~GDGsf~m~~qEL~Tavr~~lp------v~ivV~NN~ 474 (586) T PRK06276 419 GTMGFGFPAAIGAKVARPD----------------ANVI--AVTGDGGFLMNSQELATIAEYDIP------VIVVIFDNR 474 (586) T ss_pred CCCCCCHHHHHHHHHHCCC----------------CCEE--EEECCHHHHHHHHHHHHHHHHCCC------CEEEEEECC T ss_conf 7754547999999985899----------------9689--998856875509999999997889------489999787 Q ss_pred E-ECCCC----CHHCC-------CCCCCCHHHHHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEEC Q ss_conf 2-01247----01024-------6778400112203405874267889999999999999997389739986421 Q gi|254781051|r 413 I-GFTTN----PSSAR-------SCTYASDISKSIGIPIFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVCY 475 (957) Q Consensus 413 I-GFTT~----p~d~R-------Ss~Y~TDiAK~i~aPI~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~Y 475 (957) - |..-. ..+.| .++==..+|+.++++=++|+ +++..-.|.+=|+ ....-++||++.= T Consensus 475 ~yg~i~~~q~~~~~~~~~~~~~~~~pDf~~lA~a~G~~g~rV~--~~~el~~Al~~A~----~~~gP~lIeV~vD 543 (586) T PRK06276 475 TLGMVYQWQNLYYGKRQSAVHLGETPDFVKLAESYGVDALRIE--KPGEINEALKTAI----NSGEPYLLDIIID 543 (586) T ss_pred CCHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHCCCEEEEEC--CHHHHHHHHHHHH----HCCCCEEEEEEEC T ss_conf 2059999999862886665667999999999997799799979--9999999999998----0899789999956 No 66 >PRK08366 vorA 2-ketoisovalerate ferredoxin oxidoreductase subunit alpha; Reviewed Probab=91.25 E-value=1.3 Score=24.35 Aligned_cols=79 Identities=13% Similarity=0.124 Sum_probs=54.3 Q ss_pred CCEEEEECCHHHHHHHHHHHHCC--CCCEEEEEEECCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHH Q ss_conf 76899868857899999999769--9868999840148682899999985379983998067853178457899769999 Q gi|254781051|r 837 IRRVILCTGKVYYDLLDNRDMRN--IADIYLLRIEQLYPFPEDYLIKVLSRFVQAEIVWCQEEPQNMGAWTFIEPYLEKV 914 (957) Q Consensus 837 v~rvi~csGKvyydL~~~r~~~~--~~~vaivRiEqLyPfP~~~l~~~l~~y~~~e~vW~QEEp~N~GaW~~v~~rl~~~ 914 (957) .+.||++-|-.+-...++-+... -..|.++|+-.|.|||.++|.+.++++..+=++=---++-..| +.|. -+... T Consensus 264 Ad~viV~~GS~~~~~~eaVd~lr~~G~kVG~l~vr~~rPFP~~~l~~~l~~~k~V~Vlern~~~g~~g-~l~~--~v~~a 340 (394) T PRK08366 264 ADFVFMGMGSLMGTVKEAVDLLRKEGYKVGYAKVRWFRPFPKEELYEIAESVKGIAVLDRNFSFGQEG-ILFT--EAKGA 340 (394) T ss_pred CCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEEEECCCCHHHHHHHHCCCCEEEEEECCCCCCCCC-CHHH--HHHHH T ss_conf 77999993751899999999999759866589998848999999999875799899965788988788-1599--99999 Q ss_pred HHHC Q ss_conf 8750 Q gi|254781051|r 915 LHSI 918 (957) Q Consensus 915 l~~~ 918 (957) |... T Consensus 341 l~~~ 344 (394) T PRK08366 341 LYNT 344 (394) T ss_pred HHCC T ss_conf 8568 No 67 >PRK07449 2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylic acid synthase/2-oxoglutarate decarboxylase; Validated Probab=91.25 E-value=1.3 Score=24.34 Aligned_cols=20 Identities=15% Similarity=0.185 Sum_probs=14.9 Q ss_pred HHHHHHHHHCCCCEEEEECCC Q ss_conf 999999873791079980676 Q gi|254781051|r 251 EEIIRQGVQQGVDEMILGMAH 271 (957) Q Consensus 251 ~~ii~~a~~~Gv~eiviGMaH 271 (957) +.|++.-..+||+.|+ |+|= T Consensus 13 ~~lv~~L~~~GV~~vf-g~PG 32 (548) T PRK07449 13 AVILEELTRLGVRHVV-IAPG 32 (548) T ss_pred HHHHHHHHHCCCCEEE-ECCC T ss_conf 9999999987998999-8688 No 68 >cd00956 Transaldolase_FSA Transaldolase-like fructose-6-phosphate aldolases (FSA) found in bacteria and archaea, which are member of the MipB/TalC subfamily of class I aldolases. FSA catalyze an aldol cleavage of fructose 6-phosphate and do not utilize fructose, fructose 1-phosphate, fructose 1,6-phosphate, or dihydroxyacetone phosphate. The enzymes belong to the transaldolase family that serves in transfer reactions in the pentose phosphate cycle, and are more distantly related to fructose 1,6-bisphosphate aldolase. Probab=91.08 E-value=1 Score=25.08 Aligned_cols=100 Identities=20% Similarity=0.225 Sum_probs=58.1 Q ss_pred ECCCCCH-HCCCCC-----CCCHHHHHHCCCE-EEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEECC---------- Q ss_conf 0124701-024677-----8400112203405-8742678899999999999999973897399864211---------- Q gi|254781051|r 414 GFTTNPS-SARSCT-----YASDISKSIGIPI-FHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVCYR---------- 476 (957) Q Consensus 414 GFTT~p~-d~RSs~-----Y~TDiAK~i~aPI-~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~YR---------- 476 (957) |+||||. -.|+.. +..++++.+.-|| +.|.|||+|..+.-++.-. ++..+|+|-+=|-. T Consensus 23 GvTTNPsll~k~g~~~~~~~~~~i~~~~~~~ls~qv~~~~~~~m~~~a~~l~----~~~~ni~vKIP~t~~gl~ai~~L~ 98 (211) T cd00956 23 GVTTNPSLIAKSGRIDFEAVLKEICEIIDGPVSAQVVSTDAEGMVAEARKLA----SLGGNVVVKIPVTEDGLKAIKKLS 98 (211) T ss_pred EECCCHHHHHHCCCCCHHHHHHHHHHHCCCCEEEEEEECCHHHHHHHHHHHH----HHCCCEEEEECCCHHHHHHHHHHH T ss_conf 5826889998749999999999999854998899997387999999999999----738957999268565999999999 Q ss_pred CCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHCCCC Q ss_conf 25656435422355788888671899899999998644898 Q gi|254781051|r 477 RFGHNEGDEPSFTQPMMYKTIRSHKSVLQLYADSLMKNQVI 517 (957) Q Consensus 477 r~GHNE~DeP~fTQP~mY~~I~~hp~v~~~Y~~~Li~~g~i 517 (957) +.|..=.=---||-...+...+.-.+....|..++-+.|.- T Consensus 99 ~~gi~~n~Tav~s~~Qa~~Aa~aga~yvspf~GRi~d~G~d 139 (211) T cd00956 99 EEGIKTNVTAIFSAAQALLAAKAGATYVSPFVGRIDDLGGD 139 (211) T ss_pred HCCCCEEEEEECCHHHHHHHHHCCCCEEEEECCCHHHCCCC T ss_conf 85997677750689999999987997886303407545898 No 69 >TIGR03394 indol_phenyl_DC indolepyruvate/phenylpyruvate decarboxylase, Azospirillum family. A family of closely related, thiamine pyrophosphate-dependent enzymes includes indolepyruvate decarboxylase (EC 4.1.1.74), phenylpyruvate decarboxylase (EC 4.1.1.43), pyruvate decarboxylase (EC 4.1.1.1), branched-chain alpha-ketoacid decarboxylase, etc.. Members of this group of homologs may overlap in specificity. This model represents a clade that includes a Azospirillum brasilense member active as both phenylpyruvate decarboxylase and indolepyruvate decarboxylase. Probab=90.65 E-value=0.93 Score=25.45 Aligned_cols=25 Identities=24% Similarity=0.380 Sum_probs=14.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 78899999999999999973897399864 Q gi|254781051|r 445 DDPEAVIRVVRMAVSFRMKFHKSVVIDIV 473 (957) Q Consensus 445 DdpEav~~~~~lA~~yR~~F~kDVvIDlv 473 (957) +-|+.+.++.+.|++ + .+-|+|++= T Consensus 139 ~~~~~~~~a~~~A~~-~---~~Pv~l~iP 163 (535) T TIGR03394 139 TAPAEIARVLGSARE-L---SRPVYLEIP 163 (535) T ss_pred HHHHHHHHHHHHHHH-C---CCCEEEEEC T ss_conf 789999999999985-4---798599936 No 70 >PRK09124 pyruvate dehydrogenase; Provisional Probab=90.59 E-value=0.38 Score=28.36 Aligned_cols=37 Identities=11% Similarity=0.168 Sum_probs=19.6 Q ss_pred HHHHHCCCCCEEEEECCCHHHHHHHHHHHHCCCCCCCEE Q ss_conf 887521437826972488999999999974353488689 Q gi|254781051|r 749 RFLQMCAENNMYVANCTSPANYFHILRRQIYDRSSRPLI 787 (957) Q Consensus 749 RfLQl~a~~N~~V~~~ttpAq~FH~LRRQ~~r~~rkPLi 787 (957) .|-.++.--...-.-.++|+++=.+|+|=+.. .+|-+ T Consensus 488 Dfa~lA~a~G~~g~rV~~~~el~~al~~Al~~--~gP~l 524 (574) T PRK09124 488 DFAAIANAAGITGIRVEKASELDGALQRAFAH--DGPAL 524 (574) T ss_pred CHHHHHHHCCCEEEEECCHHHHHHHHHHHHHC--CCCEE T ss_conf 99999997799799979999999999999818--99799 No 71 >PRK08979 acetolactate synthase 3 catalytic subunit; Validated Probab=90.56 E-value=0.56 Score=27.11 Aligned_cols=15 Identities=20% Similarity=0.204 Sum_probs=5.8 Q ss_pred EECCCHHHHHHHHHH Q ss_conf 724889999999999 Q gi|254781051|r 762 ANCTSPANYFHILRR 776 (957) Q Consensus 762 ~~~ttpAq~FH~LRR 776 (957) +-.+||+++--+|+| T Consensus 516 ~rV~~~~el~~Al~~ 530 (572) T PRK08979 516 MTISDPAELESGLAK 530 (572) T ss_pred EEECCHHHHHHHHHH T ss_conf 997999999999999 No 72 >PRK08327 acetolactate synthase catalytic subunit; Validated Probab=90.44 E-value=1.5 Score=23.82 Aligned_cols=14 Identities=21% Similarity=0.183 Sum_probs=5.6 Q ss_pred HHHHHHHHHCCCCE Q ss_conf 99999987379107 Q gi|254781051|r 251 EEIIRQGVQQGVDE 264 (957) Q Consensus 251 ~~ii~~a~~~Gv~e 264 (957) +.|++.-.+.||+. T Consensus 11 ~~l~~~L~~~GV~~ 24 (568) T PRK08327 11 ALLLELLKELGVDY 24 (568) T ss_pred HHHHHHHHHCCCCE T ss_conf 99999999879999 No 73 >PRK08617 acetolactate synthase; Reviewed Probab=90.38 E-value=0.57 Score=27.05 Aligned_cols=15 Identities=13% Similarity=0.228 Sum_probs=6.7 Q ss_pred CCCHHHHHHHHHHHH Q ss_conf 488999999999974 Q gi|254781051|r 764 CTSPANYFHILRRQI 778 (957) Q Consensus 764 ~ttpAq~FH~LRRQ~ 778 (957) +..|...++.|++.+ T Consensus 364 ~~~p~~~~~~l~~~l 378 (552) T PRK08617 364 LLHPLRIIRALQDIV 378 (552) T ss_pred CCCHHHHHHHHHHHC T ss_conf 729999999999856 No 74 >TIGR00553 pabB para-aminobenzoate synthase, component I; InterPro: IPR005802 Para-aminobenzoate synthase component I (pabB) is closely related to anthranilate synthase component I (trpE). The apparent orthologs of pabB in Aquifex aeolicus and Helicobacter pylori score well below most TrpE proteins because of a different architecture, in which the less strongly conserved N-terminal domain is absent. The noise cut off is set to exclude most examples of TrpE.; GO: 0009396 folic acid and derivative biosynthetic process. Probab=90.37 E-value=0.081 Score=33.43 Aligned_cols=35 Identities=23% Similarity=0.409 Sum_probs=30.3 Q ss_pred CCCEEECCCCCCCCCCCHHHHHHHHHH----HHCCCEEE Q ss_conf 171572566578777789999999998----40643027 Q gi|254781051|r 145 YDRKICMKGVLGLESATIPEIVDVLSH----LYCSNIGV 179 (957) Q Consensus 145 Ld~~f~~~~~~g~~~~tL~eIi~~L~~----tYCgsIGv 179 (957) |.--|+.|.+.|.++.--.|||+.||+ +|||+||+ T Consensus 268 ~~AlFP~GSiTGAPK~RAMeII~elE~~~RG~YcGaiG~ 306 (368) T TIGR00553 268 FRALFPGGSITGAPKLRAMEIIKELEKQPRGVYCGAIGY 306 (368) T ss_pred HHHHCCCCCCCCCHHHHHHHHHHHHCCCCCCCCHHHHHH T ss_conf 985178887358146899999998636888951001330 No 75 >PRK09107 acetolactate synthase 3 catalytic subunit; Validated Probab=90.34 E-value=0.67 Score=26.49 Aligned_cols=38 Identities=16% Similarity=0.424 Sum_probs=24.0 Q ss_pred HHHHHCCCCCEEEEECCCHHHHHHHHHHHHCCCCCCCEEE Q ss_conf 8875214378269724889999999999743534886898 Q gi|254781051|r 749 RFLQMCAENNMYVANCTSPANYFHILRRQIYDRSSRPLIM 788 (957) Q Consensus 749 RfLQl~a~~N~~V~~~ttpAq~FH~LRRQ~~r~~rkPLiv 788 (957) .|-.|+.--...=.-.++|+++--+|+|=+ .-.+|-+| T Consensus 511 Df~~lA~a~G~~g~rV~~~~el~~Al~~Al--~~~gP~lI 548 (594) T PRK09107 511 DFVKLAEAYGAVGIRCEKPGDLDDAIQEMI--DVDKPVIF 548 (594) T ss_pred CHHHHHHHCCCEEEEECCHHHHHHHHHHHH--HCCCCEEE T ss_conf 999999977997999799999999999998--28998899 No 76 >PRK06456 acetolactate synthase catalytic subunit; Reviewed Probab=90.25 E-value=1 Score=25.13 Aligned_cols=45 Identities=22% Similarity=0.385 Sum_probs=17.9 Q ss_pred HHHHHHCCCCCCCCH--HHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCEE Q ss_conf 899873343564211--4789888988635831578501002233323027 Q gi|254781051|r 593 NRQKMIETGKGIDWS--MAESLAFGSLCYEGYKVRLSGQDCERGTFSHRHA 641 (957) Q Consensus 593 ~R~~~~~~~~~iDW~--~AE~LA~~SLl~eG~~VRlsGQD~~RGTFshRHa 641 (957) +++-.+--|.++.|. ..|..+++-.+. .||=-| =..||+|...|- T Consensus 207 AkrPvIl~G~G~~~~~a~~~l~~lae~l~--~Pv~tT--~~gkg~~p~~hp 253 (572) T PRK06456 207 AERPIILVGTGVVWSNATPEVLELAELLH--IPIVST--FPGKSAIPHDHP 253 (572) T ss_pred CCCCEEEEECCCCCCCCHHHHHHHHHHHC--CCCEEC--CCCCCCCCCCCC T ss_conf 89988999558241436499999999839--981220--145555345786 No 77 >cd02001 TPP_ComE_PpyrDC Thiamine pyrophosphate (TPP) family, ComE and PpyrDC subfamily, TPP-binding module; composed of proteins similar to sulfopyruvate decarboxylase beta subunit (ComE) and phosphonopyruvate decarboxylase (Ppyr decarboxylase). Methanococcus jannaschii sulfopyruvate decarboxylase (ComDE) is a dodecamer of six alpha (D) subunits and six (E) beta subunits which, catalyzes the decarboxylation of sulfopyruvic acid to sulfoacetaldehyde in the coenzyme M pathway. Ppyr decarboxylase is a homotrimeric enzyme which functions in the biosynthesis of C-P compounds such as bialaphos tripeptide in Streptomyces hygroscopicus. Ppyr decarboxylase and ComDE require TPP and divalent metal cation cofactors. Probab=90.23 E-value=1.6 Score=23.70 Aligned_cols=104 Identities=11% Similarity=0.120 Sum_probs=66.9 Q ss_pred HCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHH---HCCCCCCCCCEEEEEECCCE Q ss_conf 004455323677776416665445556544540478886257533346725887645---10677443657999841602 Q gi|254781051|r 337 FVDPVVIGSVRARQDLKAGIVGEKSISLVERSRVLPLIIHGDAAFAGQGIVSETFGL---SGLSGYTVAGNIHLIINNQI 413 (957) Q Consensus 337 aVnPVv~G~~RAkQd~~~d~~~~~~~~~~~~~~vlpiliHGDAAfaGQGVV~Etl~l---s~l~gY~tGGTiHiVvNNQI 413 (957) ...|+++|.+-|+. ++|++ |-||++|- |+| +-+..|.-=--+|||+||+. T Consensus 46 ~~~~~alG~al~~p-----------------~kVi~--idGDGs~l--------M~lg~LaTi~~~~~~nl~~ivlnN~~ 98 (157) T cd02001 46 LAGSIGLGLALGLS-----------------RKVIV--VDGDGSLL--------MNPGVLLTAGEFTPLNLILVVLDNRA 98 (157) T ss_pred HHHHHHHHHHHCCC-----------------CCEEE--EECCHHHH--------HCCCHHHHHHHCCCCCEEEEEEECCC T ss_conf 28899999985389-----------------96899--95755887--------52461999986389887999997975 Q ss_pred ECCC-C-CHHCCCCCCCCHHHHHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEE Q ss_conf 0124-7-01024677840011220340587426788999999999999999738973998642 Q gi|254781051|r 414 GFTT-N-PSSARSCTYASDISKSIGIPIFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVC 474 (957) Q Consensus 414 GFTT-~-p~d~RSs~Y~TDiAK~i~aPI~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~ 474 (957) ==.| . |..+ +..==..|||..+...++| +++|....+.+-++ .-..-++|++.+ T Consensus 99 ~~s~GgQ~t~~-~~~df~~iA~a~G~~~~~v--~~~~el~~al~~~l----~~~GP~fIeV~I 154 (157) T cd02001 99 YGSTGGQPTPS-SNVNLEAWAAACGYLVLSA--PLLGGLGSEFAGLL----ATTGPTLLHAPI 154 (157) T ss_pred CCCCCCCCCCC-CCCCHHHHHHHCCCCEEEE--CCHHHHHHHHHHHH----CCCCCEEEEEEE T ss_conf 32467856754-4669999999869857995--89999999999998----499998999995 No 78 >PRK06546 pyruvate dehydrogenase; Provisional Probab=89.69 E-value=0.99 Score=25.24 Aligned_cols=71 Identities=20% Similarity=0.262 Sum_probs=33.9 Q ss_pred HHHHHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHH Q ss_conf 01122034058742678899999999999999973897399864211256564354223557888886718998999999 Q gi|254781051|r 430 DISKSIGIPIFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVCYRRFGHNEGDEPSFTQPMMYKTIRSHKSVLQLYAD 509 (957) Q Consensus 430 DiAK~i~aPI~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~YRr~GHNE~DeP~fTQP~mY~~I~~hp~v~~~Y~~ 509 (957) .+|+.++++=+.| ++|+....|.+=|++- .+-++||++.= -||...|..+.-..-+ =..+|.- T Consensus 491 ~lA~a~G~~g~rV--~~~~el~~al~~Al~~----~gP~lIeV~vD----p~~~~~p~~~~~~~~~-------~~~~~~~ 553 (578) T PRK06546 491 AIAAALGIHAVRV--EDPADIRGALKEAFAH----DGPALVDVVTD----PNALSIPPKITGEQVM-------GFALAAS 553 (578) T ss_pred HHHHHCCCEEEEE--CCHHHHHHHHHHHHHC----CCCEEEEEEEC----CCCCCCCCCCCHHHHH-------HHHHHHH T ss_conf 9999779989997--9999999999999818----99689999838----9987889988889999-------9999999 Q ss_pred HHHHCCCC Q ss_conf 98644898 Q gi|254781051|r 510 SLMKNQVI 517 (957) Q Consensus 510 ~Li~~g~i 517 (957) |++-.|-. T Consensus 554 ~~~~~~~~ 561 (578) T PRK06546 554 KIVLNGGV 561 (578) T ss_pred HHHHCCCC T ss_conf 99975872 No 79 >PRK01362 putative translaldolase; Provisional Probab=89.56 E-value=1.8 Score=23.34 Aligned_cols=58 Identities=31% Similarity=0.332 Sum_probs=39.8 Q ss_pred ECCCCCHH-CCCCCCC----CHHHHHHCCCE-EEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEEC Q ss_conf 01247010-2467784----00112203405-874267889999999999999997389739986421 Q gi|254781051|r 414 GFTTNPSS-ARSCTYA----SDISKSIGIPI-FHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVCY 475 (957) Q Consensus 414 GFTT~p~d-~RSs~Y~----TDiAK~i~aPI-~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~Y 475 (957) |+||||.- .|+..=+ .+|.+.++-|| +.|.|||++..+.-++-- + +++++|+|-+=|- T Consensus 24 GvTTNPsll~k~g~~~~~~~~~i~~~~~~~is~ev~~~~~~~m~~qA~~l---~-~~~~nv~VKIP~t 87 (214) T PRK01362 24 GVTTNPSLIAKEGRDFKEVIKEICSIVDGPVSAEVIALDAEGMIKEGREL---A-KIAPNVVVKIPMT 87 (214) T ss_pred EEECCHHHHHHCCCCHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHH---H-HHCCCEEEECCCC T ss_conf 69589899987599989999999873179757996364199999999999---9-8487769981896 No 80 >PRK06725 acetolactate synthase 3 catalytic subunit; Validated Probab=89.27 E-value=1.2 Score=24.53 Aligned_cols=46 Identities=15% Similarity=0.205 Sum_probs=19.8 Q ss_pred HHHHHHCCCCCCCCH--HHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCEEE Q ss_conf 899873343564211--47898889886358315785010022333230279 Q gi|254781051|r 593 NRQKMIETGKGIDWS--MAESLAFGSLCYEGYKVRLSGQDCERGTFSHRHAI 642 (957) Q Consensus 593 ~R~~~~~~~~~iDW~--~AE~LA~~SLl~eG~~VRlsGQD~~RGTFshRHav 642 (957) +++-.+--|.++.++ ..|+++|+-.+ |.||=-| =.+||.|...|.- T Consensus 214 AkrPvil~G~Gv~~~~a~~~l~~lae~~--~~PV~tt--~~gkg~~~~~hpl 261 (570) T PRK06725 214 AKRPLLYIGGGVIHSGGSDELIKFAREN--RIPVVST--LMGLGAYPPGDPL 261 (570) T ss_pred CCCCEEECCCCCCCCCHHHHHHHHHHHH--CCCEEEC--CCCCCCCCCCCCC T ss_conf 6898798367600464899999999986--9989942--4568889999855 No 81 >PRK09622 porA pyruvate flavodoxin oxidoreductase subunit alpha; Reviewed Probab=88.72 E-value=2 Score=22.91 Aligned_cols=81 Identities=15% Similarity=0.144 Sum_probs=55.7 Q ss_pred HHCCEEEEECCHHHHHHHHHHHH---CCCCCEEEEEEECCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHH Q ss_conf 45768998688578999999997---699868999840148682899999985379983998067853178457899769 Q gi|254781051|r 835 SHIRRVILCTGKVYYDLLDNRDM---RNIADIYLLRIEQLYPFPEDYLIKVLSRFVQAEIVWCQEEPQNMGAWTFIEPYL 911 (957) Q Consensus 835 ~~v~rvi~csGKvyydL~~~r~~---~~~~~vaivRiEqLyPfP~~~l~~~l~~y~~~e~vW~QEEp~N~GaW~~v~~rl 911 (957) +..+.||+|-|-.+=...++-+. +|. .|.++|+--+-|||.++|.+.|+ +++-|=|=|--+.-|+..-+..-+ T Consensus 266 eDAe~viV~~GS~~~~~~~AVd~lR~~G~-KVGlir~r~~rPFP~~~l~~al~---~~K~V~Vldr~~~~g~~g~l~~~v 341 (407) T PRK09622 266 EDAEVAIFALGTTVESAIVAAKEMRKKGI-KAGVATIRVLRPFPYERLGQALK---NLKALAILDRSSPAGAMGALFNEV 341 (407) T ss_pred CCCCEEEEEECCCHHHHHHHHHHHHHCCC-EEEEEEEEEECCCCHHHHHHHHC---CCCEEEEEECCCCCCCCCHHHHHH T ss_conf 98779999807868999999999997498-67589999858999999999875---799899997889988877379999 Q ss_pred HHHHHHCC Q ss_conf 99987505 Q gi|254781051|r 912 EKVLHSIG 919 (957) Q Consensus 912 ~~~l~~~~ 919 (957) ...|.... T Consensus 342 ~~al~~~~ 349 (407) T PRK09622 342 TSAVYQTQ 349 (407) T ss_pred HHHHHHCC T ss_conf 99998527 No 82 >TIGR03254 oxalate_oxc oxalyl-CoA decarboxylase. In a number of bacteria, including Oxalobacter formigenes from the human gut, a two-gene operon of oxc (oxalyl-CoA decarboxylase) and frc (formyl-CoA transferase) encodes a system for degrading and therefore detoxifying oxalate. Members of this family are the thiamine pyrophosphate (TPP)-containing enzyme oxalyl-CoA decarboxylase. Probab=87.72 E-value=1.8 Score=23.32 Aligned_cols=25 Identities=20% Similarity=0.102 Sum_probs=10.0 Q ss_pred HHCCCCCEEEEECCCHHHHHHHHHH Q ss_conf 5214378269724889999999999 Q gi|254781051|r 752 QMCAENNMYVANCTSPANYFHILRR 776 (957) Q Consensus 752 Ql~a~~N~~V~~~ttpAq~FH~LRR 776 (957) .++.--...=..++||+.+=.+|.+ T Consensus 499 ~lA~a~G~~g~~V~~~~eL~~al~~ 523 (554) T TIGR03254 499 KMMKAFGGVGYNVTTPDELKAALNE 523 (554) T ss_pred HHHHHCCCEEEEECCHHHHHHHHHH T ss_conf 9998789989997999999999999 No 83 >PRK08367 porA pyruvate ferredoxin oxidoreductase subunit alpha; Reviewed Probab=87.58 E-value=2.3 Score=22.41 Aligned_cols=80 Identities=14% Similarity=0.163 Sum_probs=55.3 Q ss_pred HCCEEEEECCHHHHHHHHHHHHCCC--CCEEEEEEECCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHH Q ss_conf 5768998688578999999997699--86899984014868289999998537998399806785317845789976999 Q gi|254781051|r 836 HIRRVILCTGKVYYDLLDNRDMRNI--ADIYLLRIEQLYPFPEDYLIKVLSRFVQAEIVWCQEEPQNMGAWTFIEPYLEK 913 (957) Q Consensus 836 ~v~rvi~csGKvyydL~~~r~~~~~--~~vaivRiEqLyPfP~~~l~~~l~~y~~~e~vW~QEEp~N~GaW~~v~~rl~~ 913 (957) ..+.||+|-|-.+=...++.+.... ..|-++|+--+.|||.++|++.+++ ++-|=|=|--...|++-=+..-+.. T Consensus 261 DAe~viV~~GS~~~~~k~AVd~lR~~G~KVGllr~r~~rPFP~e~l~~~~k~---~k~V~Vldr~~s~g~~g~l~~ev~~ 337 (395) T PRK08367 261 DAEIIFVTMGSLAGTLKEWVDKKREEGYKVGAAKITVYRPFPVEEIRELAKK---AKVLAFLEKNITFGLYGAVFQDASA 337 (395) T ss_pred CCCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEEEECCCCHHHHHHHHCC---CCEEEEEECCCCCCCCCCHHHHHHH T ss_conf 9889999817428999999999997498576899888589999999998657---9889998688998889815999999 Q ss_pred HHHHC Q ss_conf 98750 Q gi|254781051|r 914 VLHSI 918 (957) Q Consensus 914 ~l~~~ 918 (957) .|... T Consensus 338 ~l~~~ 342 (395) T PRK08367 338 ALVNE 342 (395) T ss_pred HHHCC T ss_conf 98478 No 84 >cd03375 TPP_OGFOR Thiamine pyrophosphate (TPP family), 2-oxoglutarate ferredoxin oxidoreductase (OGFOR) subfamily, TPP-binding module; OGFOR catalyzes the oxidative decarboxylation of 2-oxo-acids, with ferredoxin acting as an electron acceptor. In the TCA cycle, OGFOR catalyzes the oxidative decarboxylation of 2-oxoglutarate to succinyl-CoA. In the reductive tricarboxylic acid cycle found in the anaerobic autotroph Hydrogenobacter thermophilus, OGFOR catalyzes the reductive carboxylation of succinyl-CoA to produce 2-oxoglutarate. Thauera aromatica OGFOR has been shown to provide reduced ferredoxin to benzoyl-CoA reductase, a key enzyme in the anaerobic metabolism of aromatic compounds. OGFOR is dependent on TPP and a divalent metal cation for activity. Probab=87.44 E-value=2.4 Score=22.35 Aligned_cols=160 Identities=17% Similarity=0.237 Sum_probs=91.4 Q ss_pred EECCCCCHHHHHHHHHCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEE--ECCCEEEEEECCCCCCCHHCCHHHHH Q ss_conf 8067655268999885379899999855887432234555100150268898--35961899971787731000445532 Q gi|254781051|r 267 LGMAHRGRLNVLSQIMNKLPRSIFYEFKGKGPSEKEYSGDVKYHLGLCCNRQ--ICGKDVKLLLQSNPSHLEFVDPVVIG 344 (957) Q Consensus 267 iGMaHRGRLNVL~Nvl~Kp~~~iF~EF~g~~~~~~~~sGDVKYHLG~s~~~~--~~g~~v~lsL~~NPSHLEaVnPVv~G 344 (957) -|-+||+=++.|...+.+ .|..+++....+| .|.++... .+-..+|.. ....-|++.| T Consensus 3 pGCgh~~~~~~~~~a~~~---------l~i~~~~~v~vs~----iGC~~~~~~~~~~~~~~~~-------hG~a~~~A~G 62 (193) T cd03375 3 PGCGDGSILKALAKALAE---------LGIDPEKVVVVSG----IGCSSRLPYYFNTYGFHTL-------HGRALAVATG 62 (193) T ss_pred CCCCCHHHHHHHHHHHHH---------HCCCCCCEEEEEC----CCCCCCCCCHHHCCCCCHH-------HCCCHHHHHH T ss_conf 988878999999999998---------1899667899958----7746766461115865202-------1160888867 Q ss_pred HHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCCE-ECCC------ Q ss_conf 367777641666544555654454047888625753334672588764510677443657999841602-0124------ Q gi|254781051|r 345 SVRARQDLKAGIVGEKSISLVERSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQI-GFTT------ 417 (957) Q Consensus 345 ~~RAkQd~~~d~~~~~~~~~~~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQI-GFTT------ 417 (957) ..+|+.| .+| |.+-||..|..=|+- +.+++. ++--.-+-||+||++ |-|- T Consensus 63 ~k~a~p~----------------~~V--va~~GDG~~~~iG~~-~l~~A~----~rn~~i~~iv~DN~~ygmTGgQ~spt 119 (193) T cd03375 63 VKLANPD----------------LTV--IVVSGDGDLAAIGGN-HFIHAA----RRNIDITVIVHNNQIYGLTKGQASPT 119 (193) T ss_pred HHHHCCC----------------CEE--EEEEECCHHHHCCHH-HHHHHH----HHCCCEEEEEECCCCEECCCCCCCCC T ss_conf 9875999----------------739--999706557645618-899999----80997499998584044236834889 Q ss_pred CCHHC--CCCCC--------CCHHHHHHCCC-EEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 70102--46778--------40011220340-58742678899999999999999973897399864 Q gi|254781051|r 418 NPSSA--RSCTY--------ASDISKSIGIP-IFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIV 473 (957) Q Consensus 418 ~p~d~--RSs~Y--------~TDiAK~i~aP-I~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv 473 (957) .|... .+|+| .-.|++..++| |--++..||.....+.+-|+++ +-=-+|+.. T Consensus 120 T~~g~~t~ttp~G~~~~~~d~~~ia~a~G~~yVa~~~~~~~~~l~~~ik~Al~~----~G~s~I~v~ 182 (193) T cd03375 120 TPEGFKTKTTPYGNIEEPFNPLALALAAGATFVARGFSGDIKQLKEIIKKAIQH----KGFSFVEVL 182 (193) T ss_pred CCCCCEECCCCCCCCCCCCCHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHC----CCCEEEEEE T ss_conf 889855413677876578899999987599879996579999999999999808----998899998 No 85 >PRK08978 acetolactate synthase 2 catalytic subunit; Reviewed Probab=87.18 E-value=2 Score=22.89 Aligned_cols=28 Identities=21% Similarity=0.266 Sum_probs=11.9 Q ss_pred CCHHHHHHHHHHHHHHHHHHC-CCEEEEE Q ss_conf 788999999999999999738-9739986 Q gi|254781051|r 445 DDPEAVIRVVRMAVSFRMKFH-KSVVIDI 472 (957) Q Consensus 445 DdpEav~~~~~lA~~yR~~F~-kDVvIDl 472 (957) ++|+.+..+.+-|+..-..-+ .-|+|+| T Consensus 129 ~~~~~i~~~l~~A~~~A~~g~~GPV~l~i 157 (548) T PRK08978 129 QSLEELPRIMAEAFEIASSGRPGPVLVDI 157 (548) T ss_pred CCHHHHHHHHHHHHHHHHCCCCCCEEEEC T ss_conf 89999899999999998538997569865 No 86 >PRK07710 acetolactate synthase catalytic subunit; Reviewed Probab=86.98 E-value=2.5 Score=22.17 Aligned_cols=24 Identities=17% Similarity=0.375 Sum_probs=10.7 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEECC Q ss_conf 9999999999987379107998067 Q gi|254781051|r 246 IIPAIEEIIRQGVQQGVDEMILGMA 270 (957) Q Consensus 246 lIP~L~~ii~~a~~~Gv~eiviGMa 270 (957) |+-+=|.|++.-.+.||+. |+|++ T Consensus 15 ~~tGa~~l~~~L~~~GV~~-vFGvp 38 (571) T PRK07710 15 LATGAQLLLEALEKEGVEV-IFGYP 38 (571) T ss_pred CCCHHHHHHHHHHHCCCCE-EEECC T ss_conf 4749999999999879999-99847 No 87 >PRK12656 fructose-6-phosphate aldolase; Reviewed Probab=85.53 E-value=2.2 Score=22.57 Aligned_cols=58 Identities=24% Similarity=0.372 Sum_probs=40.0 Q ss_pred ECCCCCHH-CCCCC-----CCCHHHHHHCC--CE-EEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEE Q ss_conf 01247010-24677-----84001122034--05-87426788999999999999999738973998642 Q gi|254781051|r 414 GFTTNPSS-ARSCT-----YASDISKSIGI--PI-FHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVC 474 (957) Q Consensus 414 GFTT~p~d-~RSs~-----Y~TDiAK~i~a--PI-~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~ 474 (957) |+||||.- .|+.. +..+|++.+.- || +-|-|||+|..+.-+ .+++..++++|+|-+=+ T Consensus 24 GvTTNPsll~k~g~~d~~~~l~~i~~~i~~~~~ls~qv~~~d~~~mi~~a---~~i~~~~~~nv~VKIP~ 90 (222) T PRK12656 24 GVTSNPSIAKKEGDIDFFERIREVREIIGDEASIHVQVVAQDYEGILKDA---AEIRRQCGDDVYIKVPV 90 (222) T ss_pred EECCCHHHHHHCCCCCHHHHHHHHHHHHCCCCCEEEEEEECCHHHHHHHH---HHHHHHCCCCEEEECCC T ss_conf 57589999987699888999999999829998889997639899999999---99998557678996688 No 88 >PRK06965 acetolactate synthase 3 catalytic subunit; Validated Probab=85.41 E-value=3 Score=21.62 Aligned_cols=62 Identities=11% Similarity=0.189 Sum_probs=35.9 Q ss_pred HHHHHCCCCCEEEEECCCHHHHHHHHHHHHCCCCCCCE---EEECCCH-HHCCCCCCCCHHHHCCCC Q ss_conf 88752143782697248899999999997435348868---9856602-210801268878937998 Q gi|254781051|r 749 RFLQMCAENNMYVANCTSPANYFHILRRQIYDRSSRPL---IMMAPKS-LLRHKRVVSSLSDMTCGS 811 (957) Q Consensus 749 RfLQl~a~~N~~V~~~ttpAq~FH~LRRQ~~r~~rkPL---iv~tPKs-LLR~~~a~S~l~d~~~g~ 811 (957) .|-.++.--...-.-.+||+++--+|++=+. .--+|. |+.-|+. ++=.=.+-++++||.-|+ T Consensus 519 df~~lA~a~G~~g~~V~~~~eL~~al~~Al~-~~~~p~lidV~iDp~~~~~P~~~~g~~~~~~~~~~ 584 (587) T PRK06965 519 DFVKLAEAYGHVGMRIEKTSDVEPALKEALR-LKDRTVFLDFQTDPTENVWPMVQAGKGITEMLLGS 584 (587) T ss_pred CHHHHHHHCCCEEEEECCHHHHHHHHHHHHH-CCCCCEEEEEEECCCCCCCCCCCCCCCHHHHCCCC T ss_conf 9999999779989997999999999999986-48980999999789888789799999875623687 No 89 >PRK12474 hypothetical protein; Provisional Probab=84.75 E-value=3.2 Score=21.40 Aligned_cols=19 Identities=11% Similarity=0.261 Sum_probs=11.9 Q ss_pred HHHHHHHHHCCCCEEEEECC Q ss_conf 99999987379107998067 Q gi|254781051|r 251 EEIIRQGVQQGVDEMILGMA 270 (957) Q Consensus 251 ~~ii~~a~~~Gv~eiviGMa 270 (957) |.+++.-.+.||+. |+|++ T Consensus 9 ~~lv~~L~~~GV~~-vFGip 27 (518) T PRK12474 9 DSVVDTLLNCGVEV-CFANP 27 (518) T ss_pred HHHHHHHHHCCCCE-EEECC T ss_conf 99999999879999-99898 No 90 >CHL00099 ilvB acetohydroxyacid synthase large subunit Probab=84.15 E-value=2.2 Score=22.62 Aligned_cols=38 Identities=13% Similarity=0.224 Sum_probs=24.6 Q ss_pred HHHHHCCCCCEEEEECCCHHHHHHHHHHHHCCCCCCCEEE Q ss_conf 8875214378269724889999999999743534886898 Q gi|254781051|r 749 RFLQMCAENNMYVANCTSPANYFHILRRQIYDRSSRPLIM 788 (957) Q Consensus 749 RfLQl~a~~N~~V~~~ttpAq~FH~LRRQ~~r~~rkPLiv 788 (957) .|-+++.--...=.-.+||+++--+|++=+. -.+|-+| T Consensus 509 Df~~lA~a~G~~g~rV~~~~el~~Al~~Al~--~~gP~lI 546 (588) T CHL00099 509 DFVKLAEAFGIKGITIKSRKDLKSKLQRALD--YDGPVLV 546 (588) T ss_pred CHHHHHHHCCCEEEEECCHHHHHHHHHHHHC--CCCCEEE T ss_conf 9999999779989997999999999999982--9997899 No 91 >KOG1185 consensus Probab=84.02 E-value=3.4 Score=21.18 Aligned_cols=108 Identities=15% Similarity=0.182 Sum_probs=47.4 Q ss_pred HHHHHCCCCCCCCCCEEECCCCCCCCCCCHHH-HHHHHHHHHHH---HCCCCCCEEECCCCCCCC---CCCCCCCC---- Q ss_conf 34320221138653111000024424541022-56554202576---233368135436888888---87656440---- Q gi|254781051|r 678 LGFEYGYSLNNLNALTIWEAQFGDFANGAQVI-LDQFITAGEQK---WLCTSNLVCLLPHGYEGQ---GPEHSSAR---- 746 (957) Q Consensus 678 lgFEyGys~~~p~~L~iWEAQFGDF~NgAQvi-iDqfiss~e~K---W~~~sglvllLPHGyeGq---GPEHSSaR---- 746 (957) +||-.+-.+..|+..|+ .=+||=|-|=--| +|+|+-.--.- =.--||+-=|.+||++-- -|++-+.- T Consensus 436 ~Gfalaaa~~~P~~~V~--~veGDsaFGfSaME~ET~vR~~Lpvv~vV~NN~Giyg~d~~~~~~I~e~~~~~~~p~~~l~ 513 (571) T KOG1185 436 LGFALAAALAAPDRKVV--CVEGDSAFGFSAMELETFVRYKLPVVIVVGNNNGIYGLDDDGWKQISEQDPTLDLPPTALL 513 (571) T ss_pred HHHHHHHHHHCCCCEEE--EEECCCCCCCCHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCHHHHHHCCCCCCCCCCCCC T ss_conf 15777877627997389--9964765571056588888716876999962886346676448888516855689840124 Q ss_pred -HHHHHHHCCCCCEEEEECCCHHHHHHHHHHHHCCCCCCCEEE Q ss_conf -578875214378269724889999999999743534886898 Q gi|254781051|r 747 -LERFLQMCAENNMYVANCTSPANYFHILRRQIYDRSSRPLIM 788 (957) Q Consensus 747 -lERfLQl~a~~N~~V~~~ttpAq~FH~LRRQ~~r~~rkPLiv 788 (957) =-||=|+|----+.=-.++|+-.+-|.|+.+ .+.-++|-|| T Consensus 514 ~~~rY~~v~ka~G~kG~~v~t~~el~~~l~~a-~q~~~~psvI 555 (571) T KOG1185 514 ANTRYDKVAKAFGGKGYFVSTVEELLAALQQA-CQDTDKPSVI 555 (571) T ss_pred CCCCHHHHHHHCCCCCEEECCHHHHHHHHHHH-HHCCCCCEEE T ss_conf 43569999987188863647899999999999-8547997689 No 92 >PRK07789 acetolactate synthase 1 catalytic subunit; Validated Probab=83.66 E-value=3.5 Score=21.08 Aligned_cols=72 Identities=8% Similarity=0.234 Sum_probs=48.2 Q ss_pred HHHHHCCCCCEEEEECCCHHHHHHHHHHHHCCCCCCCEEE---ECCCHHH-CCCCCCCCHHHHCCCCCEEECCCCCC Q ss_conf 8875214378269724889999999999743534886898---5660221-08012688789379986035157764 Q gi|254781051|r 749 RFLQMCAENNMYVANCTSPANYFHILRRQIYDRSSRPLIM---MAPKSLL-RHKRVVSSLSDMTCGSVFQAVLSDDA 821 (957) Q Consensus 749 RfLQl~a~~N~~V~~~ttpAq~FH~LRRQ~~r~~rkPLiv---~tPKsLL-R~~~a~S~l~d~~~g~~F~~vi~d~~ 821 (957) -|-.++.--...-.-+++|.++=.+|++=+..+ -+|-+| ..|.-+. =.-..=.+++||--....+|..+|.. T Consensus 533 Df~~lA~a~G~~g~rv~~~~el~~al~~Al~~~-d~P~lIdv~vdp~~~~~P~~~~g~~~~~~~~~~~~~~~~~~~~ 608 (612) T PRK07789 533 DFVKLAEALGCVGLRCEREEDVDAVIEKARAIN-DRPVVIDFVVGADAMVWPMVAAGTSNDEIQYARGIRPLFDDEE 608 (612) T ss_pred CHHHHHHHCCCEEEEECCHHHHHHHHHHHHHCC-CCCEEEEEEECCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCHH T ss_conf 999999977987999799999999999998559-9958999998988770698899989899704768899878422 No 93 >PRK12376 putative translaldolase; Provisional Probab=82.54 E-value=2.5 Score=22.21 Aligned_cols=59 Identities=31% Similarity=0.471 Sum_probs=41.2 Q ss_pred EECCCCCHH-CCCCC-----CCCHHHHHH-CCCE-EEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEEC Q ss_conf 201247010-24677-----840011220-3405-874267889999999999999997389739986421 Q gi|254781051|r 413 IGFTTNPSS-ARSCT-----YASDISKSI-GIPI-FHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVCY 475 (957) Q Consensus 413 IGFTT~p~d-~RSs~-----Y~TDiAK~i-~aPI-~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~Y 475 (957) =|+||||.- .|+.. ++.++.+.+ +.|| +-|-|||+|..+.-++.- + +++++|+|-+=|= T Consensus 30 ~GVTTNPsLl~k~G~~d~~~~~~~i~~~i~~~~is~EV~~~~~~~mi~qA~~l---~-~~~~nv~VKIP~t 96 (238) T PRK12376 30 KGFTTNPSLMKKAGITDYKAFAKEVLAEIPDYPISFEVFADDLETMEKEAEIL---A-SLGENVYVKIPIT 96 (238) T ss_pred CEEECCHHHHHHCCCCCHHHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHH---H-HHCCCEEEEECCC T ss_conf 57907889998669997899999999638998779999568778899999999---9-7589779997785 No 94 >PRK06466 acetolactate synthase 3 catalytic subunit; Validated Probab=82.51 E-value=3.8 Score=20.78 Aligned_cols=19 Identities=32% Similarity=0.508 Sum_probs=7.3 Q ss_pred HHHHHHHHHCCCCEEEEECC Q ss_conf 99999987379107998067 Q gi|254781051|r 251 EEIIRQGVQQGVDEMILGMA 270 (957) Q Consensus 251 ~~ii~~a~~~Gv~eiviGMa 270 (957) |.|++.-.+.||+. |+|.+ T Consensus 8 ~~l~~~L~~~GV~~-vFgip 26 (574) T PRK06466 8 EMLVRALRDEGVEY-IYGYP 26 (574) T ss_pred HHHHHHHHHCCCCE-EEECC T ss_conf 99999999879999-99778 No 95 >pfam10237 N6-adenineMlase Probable N6-adenine methyltransferase. This is a protein of approximately 200 residues which is conserved from plants to humans. It contains a highly conserved QFW motif close to the N-terminus and a DPPF motif in the centre. The DPPF motif is characteristic of N-6 adenine-specific DNA methylases, and this family is found in eukaryotes. Probab=82.45 E-value=2.8 Score=21.81 Aligned_cols=104 Identities=18% Similarity=0.256 Sum_probs=55.0 Q ss_pred CHHHHHHHCCCCCEEEEECCCHHHHHHHHHHHHCC------CCCCCEEEECCC---HHHCCCCCCCCHHHHCCCCCEEEC Q ss_conf 05788752143782697248899999999997435------348868985660---221080126887893799860351 Q gi|254781051|r 746 RLERFLQMCAENNMYVANCTSPANYFHILRRQIYD------RSSRPLIMMAPK---SLLRHKRVVSSLSDMTCGSVFQAV 816 (957) Q Consensus 746 RlERfLQl~a~~N~~V~~~ttpAq~FH~LRRQ~~r------~~rkPLiv~tPK---sLLR~~~a~S~l~d~~~g~~F~~v 816 (957) .|-+=+.-.+.++..|+-.|||+=|+++.-+...+ .|-|-.-++..+ -=...|.-. .+++ .| .|..| T Consensus 14 ~La~~~~~~~~~~~~IA~lS~PSly~~l~~~~~~~~~v~LfEyD~RF~~~~~~FvfYDyn~P~~l--p~~l-~~-~fD~v 89 (161) T pfam10237 14 KLAKALLEGADEGTVIACVSAPSVYAALKERPIPTKNVYLLEYDKRFAVYGGEFVFYDYNKPLEL--PEQL-KG-KFDRV 89 (161) T ss_pred HHHHHHHHHCCCCCEEEEECCCHHHHHHHHCCCCCCCEEEEEECHHHHHCCCCEEECCCCCCCCC--HHHH-HC-CCCEE T ss_conf 99999997368998799985868999998434887737996543058767994167778980108--7997-08-67889 Q ss_pred CCCCCCCCCCCC----C---CCCCHHHCCEEEEECCHHHHHHHHHH Q ss_conf 577643344454----4---45795457689986885789999999 Q gi|254781051|r 817 LSDDAEYHGKTS----V---KLKEDSHIRRVILCTGKVYYDLLDNR 855 (957) Q Consensus 817 i~d~~~~~~~~~----~---~~~~~~~v~rvi~csGKvyydL~~~r 855 (957) |-||....-.-- . .+..+ --|+|+|||.+--+++.+. T Consensus 90 i~DPPFLseec~~K~a~t~~~L~k~--~~kli~~Tg~~~~~~~~kl 133 (161) T pfam10237 90 IADPPFLSEECLTKTAITIKLLLKP--DSKLLLCTGERMEDLAAKL 133 (161) T ss_pred EECCCCCCHHHHHHHHHHHHHHHCC--CCEEEEECHHHHHHHHHHH T ss_conf 9899998999999999999998089--9879993489999999997 No 96 >PRK08611 pyruvate oxidase; Provisional Probab=81.83 E-value=4.1 Score=20.58 Aligned_cols=38 Identities=8% Similarity=0.135 Sum_probs=22.9 Q ss_pred HHHHHCCCCCEEEEECCCHHHHHHHHHHHHCCCCCCCEEE Q ss_conf 8875214378269724889999999999743534886898 Q gi|254781051|r 749 RFLQMCAENNMYVANCTSPANYFHILRRQIYDRSSRPLIM 788 (957) Q Consensus 749 RfLQl~a~~N~~V~~~ttpAq~FH~LRRQ~~r~~rkPLiv 788 (957) +|-+++.--...=...+||..+-.+|++=+.+ .+|-+| T Consensus 488 Df~~~A~a~G~~g~rV~~~~el~~Al~~Al~~--~~P~lI 525 (576) T PRK08611 488 DYAKFAEACGGKGYTVKSADELDPAFEEALAQ--DKPVII 525 (576) T ss_pred CHHHHHHHCCCEEEEECCHHHHHHHHHHHHHC--CCCEEE T ss_conf 99999997799799979999999999999828--997999 No 97 >PRK12655 fructose-6-phosphate aldolase; Reviewed Probab=80.95 E-value=3.5 Score=21.12 Aligned_cols=57 Identities=26% Similarity=0.300 Sum_probs=36.7 Q ss_pred ECCCCCH-HCCCCCCCC----HHHHHH--CCCEE-EECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEE Q ss_conf 0124701-024677840----011220--34058-7426788999999999999999738973998642 Q gi|254781051|r 414 GFTTNPS-SARSCTYAS----DISKSI--GIPIF-HVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVC 474 (957) Q Consensus 414 GFTT~p~-d~RSs~Y~T----DiAK~i--~aPI~-HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~ 474 (957) |+||||. -.|+..-+. ++++.+ +.|++ -|-|+|+|..+.-++.- +. ..++|||-+=+ T Consensus 24 GvTTNPsll~k~g~~~~~~~~~i~~~~~~~~~l~~ev~~~d~~~m~~~a~~l---~~-~~~~ivVKIP~ 88 (220) T PRK12655 24 GVTTNPSIIAASKESIWEVLPRLQKAIGGEGILFAQTMSRDAQGMVEEAKRL---RN-AIPGIVVKIPV 88 (220) T ss_pred EECCCHHHHHHCCCCHHHHHHHHHHHHCCCCCEEEEEECCCHHHHHHHHHHH---HH-HCCCEEEECCC T ss_conf 5828999998659987999999999828999789999559999999999999---97-47873998688 No 98 >cd03372 TPP_ComE Thiamine pyrophosphate (TPP) family, ComE subfamily, TPP-binding module; composed of proteins similar to Methanococcus jannaschii sulfopyruvate decarboxylase beta subunit (ComE). M. jannaschii sulfopyruvate decarboxylase (ComDE) is a dodecamer of six alpha (D) subunits and six (E) beta subunits, which catalyzes the decarboxylation of sulfopyruvic acid to sulfoacetaldehyde in the coenzyme M pathway. ComDE requires TPP and divalent metal cation cofactors. Probab=80.47 E-value=4.5 Score=20.26 Aligned_cols=113 Identities=19% Similarity=0.284 Sum_probs=67.7 Q ss_pred HCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHC---CCCCCCCCEEEEEECCCE Q ss_conf 00445532367777641666544555654454047888625753334672588764510---677443657999841602 Q gi|254781051|r 337 FVDPVVIGSVRARQDLKAGIVGEKSISLVERSRVLPLIIHGDAAFAGQGIVSETFGLSG---LSGYTVAGNIHLIINNQI 413 (957) Q Consensus 337 aVnPVv~G~~RAkQd~~~d~~~~~~~~~~~~~~vlpiliHGDAAfaGQGVV~Etl~ls~---l~gY~tGGTiHiVvNNQI 413 (957) ...|+++|.+-|+ + ++|++ |-||++|- |+|.. +-.|..---+|||.||+. T Consensus 46 ~~~p~aiGaa~a~----------------p-~kVi~--i~GDGs~l--------M~~geL~Ti~~~~~~n~~~ivlnN~~ 98 (179) T cd03372 46 LASSIGLGLALAQ----------------P-RKVIV--IDGDGSLL--------MNLGALATIAAEKPKNLIIVVLDNGA 98 (179) T ss_pred HHHHHHHHHHHCC----------------C-CCEEE--EECCHHHH--------HHHHHHHHHHHHCCCCEEEEEEECCC T ss_conf 0889999998657----------------9-96899--96780465--------43669999997268978999983887 Q ss_pred ECCCCCHHCCCCCCCCH---HHHHHCCCEE-EECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCC Q ss_conf 01247010246778400---1122034058-7426788999999999999999738973998642112565643542235 Q gi|254781051|r 414 GFTTNPSSARSCTYASD---ISKSIGIPIF-HVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVCYRRFGHNEGDEPSFT 489 (957) Q Consensus 414 GFTT~p~d~RSs~Y~TD---iAK~i~aPI~-HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~YRr~GHNE~DeP~fT 489 (957) ==.|--..+- +..++| ||+..+.+-+ -| +++|..-.+.+.| -..-++|++.|=+ .|.+-|+.+ T Consensus 99 ~g~~g~q~t~-~~~~~Df~~iA~a~G~~~~~~v--~~~e~l~~~l~~~------~~gP~~iev~i~~----~~~~~p~~p 165 (179) T cd03372 99 YGSTGNQPTH-AGKKTDLEAVAKACGLDNVATV--ASEEAFEKAVEQA------LDGPSFIHVKIKP----GNTDVPNIP 165 (179) T ss_pred CCCCCCCCCC-CCCCCCHHHHHHHCCCCEEEEE--CCHHHHHHHHHHH------CCCCEEEEEEECC----CCCCCCCCC T ss_conf 5541350366-6789899999998799739998--9999999999984------7999999999879----998889999 No 99 >PRK13029 2-oxoacid ferredoxin oxidoreductase; Provisional Probab=78.52 E-value=5.1 Score=19.83 Aligned_cols=62 Identities=16% Similarity=0.224 Sum_probs=30.3 Q ss_pred HHHHHHHHHHHHCCCCCEEEEEEECCCCCH--HHHHHHHHHHCCCCEEEEE---------CCCC--CCCCCHHHHHHHHH Q ss_conf 578999999997699868999840148682--8999999853799839980---------6785--31784578997699 Q gi|254781051|r 846 KVYYDLLDNRDMRNIADIYLLRIEQLYPFP--EDYLIKVLSRFVQAEIVWC---------QEEP--QNMGAWTFIEPYLE 912 (957) Q Consensus 846 KvyydL~~~r~~~~~~~vaivRiEqLyPfP--~~~l~~~l~~y~~~e~vW~---------QEEp--~N~GaW~~v~~rl~ 912 (957) +-||-|.++.++-. || .||-=| .++|.+......+..|-+. .-.| .-.|+|.+.-.++- T Consensus 1021 r~lfKLMAYKDEYE---VA-----RL~td~~F~~~l~~qFeg~~kl~fhLAPPlL~r~d~~GrprK~~fG~Wm~~~frlL 1092 (1186) T PRK13029 1021 RNLFKLMAYKDEYE---VA-----RLYTDPAFLDQLAAEFEGEPRLSFHLAPPLFARRDERGHLVKVRYGPWMLPAFRVL 1092 (1186) T ss_pred HHHHHHHHHHHHHH---HH-----HHCCCHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCCCCHHHCCCCHHHHHHHHH T ss_conf 99986841633989---98-----84188899999998649995179985897557889999880300770699999999 Q ss_pred HHH Q ss_conf 998 Q gi|254781051|r 913 KVL 915 (957) Q Consensus 913 ~~l 915 (957) .-+ T Consensus 1093 A~l 1095 (1186) T PRK13029 1093 ARL 1095 (1186) T ss_pred HHC T ss_conf 747 No 100 >cd06250 M14_PaAOTO_like An uncharacterized subgroup of the Succinylglutamate desuccinylase (ASTE)/aspartoacylase (ASPA) subfamily which is part of the the M14 family of metallocarboxypeptidases. This subgroup includes Pseudomonas aeruginosa AotO and related proteins. ASTE catalyzes the fifth and last step in arginine catabolism by the arginine succinyltransferase pathway, and aspartoacylase (ASPA, also known as aminoacylase 2, and ACY-2; EC:3.5.1.15) cleaves N-acetyl L-aspartic acid (NAA) into aspartate and acetate. NAA is abundant in the brain, and hydrolysis of NAA by ASPA may help maintain white matter. ASPA is an NAA scavenger in other tissues. Mutations in the gene encoding ASPA cause Canavan disease (CD), a fatal progressive neurodegenerative disorder involving dysmyelination and spongiform degeneration of white matter in children. This enzyme binds zinc which is necessary for activity. Measurement of elevated NAA levels in urine is used in the diagnosis of CD. The gene encoding Probab=78.37 E-value=5.2 Score=19.80 Aligned_cols=106 Identities=20% Similarity=0.345 Sum_probs=61.2 Q ss_pred CEEEEEEEECCCCCCCCCHHHHHHH-HHCCCC-CCCCCEEEEE-ECCCEECCCCCHHCCCCCCCC----HHHHHHC---- Q ss_conf 4047888625753334672588764-510677-4436579998-416020124701024677840----0112203---- Q gi|254781051|r 368 SRVLPLIIHGDAAFAGQGIVSETFG-LSGLSG-YTVAGNIHLI-INNQIGFTTNPSSARSCTYAS----DISKSIG---- 436 (957) Q Consensus 368 ~~vlpiliHGDAAfaGQGVV~Etl~-ls~l~g-Y~tGGTiHiV-vNNQIGFTT~p~d~RSs~Y~T----DiAK~i~---- 436 (957) +-.+-=-||||= ..|+.|+.+.++ |.+++. =..-|||=+| +=|-+||...-....++||.. |+-+++- T Consensus 29 ~vyiqAalHGDE-l~G~~v~~~L~~~L~~~e~~~~L~GtviiVPv~Np~G~~q~~~~~~~GRf~~~d~~dlNR~FP~~~~ 107 (359) T cd06250 29 KVYIQASLHADE-LPGMLVLHHLIELLKKLEAEGRILGEITLVPVANPIGLNQRLGGFHLGRFDLASGTNFNRDFPDLAD 107 (359) T ss_pred EEEEECCCCHHC-CCHHHHHHHHHHHHHCCCCCCCCCEEEEEEECCCHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 799980445100-2169999999999761322366723799993648788765423566666678998834657887532 Q ss_pred ---CCEEEECCCCHHHHHHHHHHHHH--------------------HHHHHCCCEEEEEEE Q ss_conf ---40587426788999999999999--------------------999738973998642 Q gi|254781051|r 437 ---IPIFHVNGDDPEAVIRVVRMAVS--------------------FRMKFHKSVVIDIVC 474 (957) Q Consensus 437 ---aPI~HVN~DdpEav~~~~~lA~~--------------------yR~~F~kDVvIDlv~ 474 (957) .=+=..=++|++.-+...+-++. .+.-...|++|||=| T Consensus 108 ~~~~~~~~~~~~~~~~~~~~~r~~~~~~l~~~~~~~~~~~~l~~~L~~la~~aD~vIDLHt 168 (359) T cd06250 108 AVADFVEGHLTSDPAANVALVRQALREALAALLPPTELQSTLRLTLQRLALDADIVLDLHC 168 (359) T ss_pred CHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCEEEECCC T ss_conf 0233331135775788999999999998740367624778999999875406788998467 No 101 >cd01987 USP_OKCHK USP domain is located between the N-terminal sensor domain and C-terminal catalytic domain of this Osmosensitive K+ channel histidine kinase family. The family of KdpD sensor kinase proteins regulates the kdpFABC operon responsible for potassium transport. The USP domain is homologous to the universal stress protein Usp Usp is a small cytoplasmic bacterial protein whose expression is enhanced when the cell is exposed to stress agents. Usp enhances the rate of cell survival during prolonged exposure to such conditions, and may provide a general "stress endurance" activity. Probab=78.09 E-value=5.3 Score=19.74 Aligned_cols=59 Identities=27% Similarity=0.446 Sum_probs=41.0 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEECCCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHH Q ss_conf 879889999999999999899999864033622225646589999999999987379107998067655268 Q gi|254781051|r 205 NFSKEEQRDILDKLVRAEGFEKFIDIKYKGAKRFGADGSEVIIPAIEEIIRQGVQQGVDEMILGMAHRGRLN 276 (957) Q Consensus 205 ~~s~eeK~~iL~~L~~Ae~FE~FL~~Kf~g~KRFsLEG~EslIP~L~~ii~~a~~~Gv~eiviGMaHRGRLN 276 (957) .++.++++.+-+.+--|+. .|.+---++|.+.- ++|++.|.+.|++.||||=++|+++- T Consensus 42 ~~~~~~~~~l~~~~~la~~---------lga~~~~~~~~d~~----~~I~~~A~~~~~t~IVlG~~~~~~~~ 100 (124) T cd01987 42 RLSEAERRRLAEALRLAEE---------LGAEVVTLPGDDVA----EAIVEFAREHNVTQIVVGKSRRSRWR 100 (124) T ss_pred CCCHHHHHHHHHHHHHHHH---------CCCEEEEEECCCHH----HHHHHHHHHCCCCEEEECCCCCCHHH T ss_conf 5897999999999999998---------59989999479989----99999999849989997689885468 No 102 >PRK06457 pyruvate dehydrogenase; Provisional Probab=77.94 E-value=5.3 Score=19.71 Aligned_cols=38 Identities=8% Similarity=0.187 Sum_probs=25.3 Q ss_pred HHHHHCCCCCEEEEECCCHHHHHHHHHHHHCCCCCCCEEE Q ss_conf 8875214378269724889999999999743534886898 Q gi|254781051|r 749 RFLQMCAENNMYVANCTSPANYFHILRRQIYDRSSRPLIM 788 (957) Q Consensus 749 RfLQl~a~~N~~V~~~ttpAq~FH~LRRQ~~r~~rkPLiv 788 (957) .|-+++.--...-...+||.++=-+|++=+.. .+|-+| T Consensus 477 Df~~~A~a~G~~g~~V~~~~el~~al~~A~~~--~gP~lI 514 (549) T PRK06457 477 DFSKLAESVGIKGFRVEDPNELEEAIEEFLNT--EGPAVL 514 (549) T ss_pred CHHHHHHHCCCEEEEECCHHHHHHHHHHHHCC--CCCEEE T ss_conf 99999997799799979999999999999818--995999 No 103 >PRK07524 hypothetical protein; Provisional Probab=77.31 E-value=5.5 Score=19.58 Aligned_cols=19 Identities=26% Similarity=0.527 Sum_probs=11.5 Q ss_pred HHHHHHHHHCCCCEEEEECC Q ss_conf 99999987379107998067 Q gi|254781051|r 251 EEIIRQGVQQGVDEMILGMA 270 (957) Q Consensus 251 ~~ii~~a~~~Gv~eiviGMa 270 (957) |.|++.-.+.||+.| +|.+ T Consensus 6 ~~l~~~L~~~Gv~~i-Fgvp 24 (534) T PRK07524 6 EALVRLLEAYGVETV-FGIP 24 (534) T ss_pred HHHHHHHHHCCCCEE-EECC T ss_conf 999999998799999-9807 No 104 >PRK13030 2-oxoacid ferredoxin oxidoreductase; Provisional Probab=76.20 E-value=5.9 Score=19.37 Aligned_cols=45 Identities=18% Similarity=0.046 Sum_probs=24.0 Q ss_pred HHHCCCCCCEEEEEECCCCCCCHHHHHHCCCCCCCCCCEEECCCCC Q ss_conf 3442789845999824311110134320221138653111000024 Q gi|254781051|r 655 LGNISKDQGHCEVVNSFLSEQAVLGFEYGYSLNNLNALTIWEAQFG 700 (957) Q Consensus 655 L~~l~~~q~~f~v~nS~LSE~avlgFEyGys~~~p~~L~iWEAQFG 700 (957) |..+.++ -..-|.|+...-.|.+-..=..++..+.....-++.-| T Consensus 814 l~~~~~~-rT~aVvNt~~~PTg~Fv~~~d~~~p~~~~~~~l~~~~g 858 (1168) T PRK13030 814 LQTVRHG-RTRAVVNTHEIPTADFVRNPDANLHADALLAKIRHAAG 858 (1168) T ss_pred HHHHCCC-CEEEEEECCCCCHHHHHHCCCCCCCHHHHHHHHHHHHC T ss_conf 9750778-73899507867638774088888996999999998737 No 105 >PRK09193 indolepyruvate ferredoxin oxidoreductase; Validated Probab=75.26 E-value=6.2 Score=19.20 Aligned_cols=35 Identities=14% Similarity=0.109 Sum_probs=19.5 Q ss_pred HHHCCCCCCEEEEEECCCCCCCHHHHHHCCCCCCCC Q ss_conf 344278984599982431111013432022113865 Q gi|254781051|r 655 LGNISKDQGHCEVVNSFLSEQAVLGFEYGYSLNNLN 690 (957) Q Consensus 655 L~~l~~~q~~f~v~nS~LSE~avlgFEyGys~~~p~ 690 (957) |..++++ -..-|.||...-.|.+-..=..+...+. T Consensus 809 L~~~~~~-rT~aVvNt~~~PTg~Fv~d~d~~~p~~~ 843 (1155) T PRK09193 809 LSRMRPG-RTRAVVNTHETPTADFVRNPDWRFPGAS 843 (1155) T ss_pred HHHHCCC-CEEEEEECCCCCHHHHHCCCCCCCCHHH T ss_conf 9751767-7389950786765876218888998489 No 106 >cd02009 TPP_SHCHC_synthase Thiamine pyrophosphate (TPP) family, SHCHC synthase subfamily, TPP-binding module; composed of proteins similar to Escherichia coli 2-succinyl-6-hydroxyl-2,4-cyclohexadiene-1-carboxylic acid (SHCHC) synthase (also called MenD). SHCHC synthase plays a key role in the menaquinone biosynthetic pathway, converting isochorismate and 2-oxoglutarate to SHCHC, pyruvate and carbon dioxide. The enzyme requires TPP and a divalent metal cation for activity. Probab=74.93 E-value=6.3 Score=19.14 Aligned_cols=119 Identities=16% Similarity=0.227 Sum_probs=64.4 Q ss_pred EEEEECCCCCCCHHCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCC Q ss_conf 89997178773100044553236777764166654455565445404788862575333467258876451067744365 Q gi|254781051|r 324 VKLLLQSNPSHLEFVDPVVIGSVRARQDLKAGIVGEKSISLVERSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAG 403 (957) Q Consensus 324 v~lsL~~NPSHLEaVnPVv~G~~RAkQd~~~d~~~~~~~~~~~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGG 403 (957) +.+------|=.+-.=|-++|.+.|.. +.|++ |-||.+|- ==+.|-..+.+ |.. - T Consensus 42 ~~~~~NRG~sGIdg~lstAiGaa~a~~-----------------~pvv~--i~GDgsf~--~d~~eL~t~~~---~~~-p 96 (175) T cd02009 42 VRVFANRGASGIDGTLSTALGIALATD-----------------KPTVL--LTGDLSFL--HDLNGLLLGKQ---EPL-N 96 (175) T ss_pred CEEEECCCCCCCCCHHHHHHHHHHCCC-----------------CCEEE--EEECHHHH--CCHHHHHHHHH---CCC-C T ss_conf 279946885352208999999996589-----------------98899--97026866--16789999986---699-8 Q ss_pred EEEEEECCCEE--CCCCC----HHCCCCCCCC-------HHHHHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEE Q ss_conf 79998416020--12470----1024677840-------01122034058742678899999999999999973897399 Q gi|254781051|r 404 NIHLIINNQIG--FTTNP----SSARSCTYAS-------DISKSIGIPIFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVI 470 (957) Q Consensus 404 TiHiVvNNQIG--FTT~p----~d~RSs~Y~T-------DiAK~i~aPI~HVN~DdpEav~~~~~lA~~yR~~F~kDVvI 470 (957) .+-||+||+=| |-.-| .+.....|.| .+|++++.|-..|+ ++|..-.+.+-|++ -..-++| T Consensus 97 v~ivV~NN~GggIf~~l~~~~~~~~~~~~~~tp~~~df~~lA~a~G~~~~~v~--~~~el~~al~~~~~----~~gp~lI 170 (175) T cd02009 97 LTIVVINNNGGGIFSLLPQASFEDEFERLFGTPQGLDFEHLAKAYGLEYRRVS--SLDELEQALESALA----QDGPHVI 170 (175) T ss_pred EEEEEEECCCCHHHHHHHHHHCCCCCCEEECCCCCCCHHHHHHHCCCCEEEEC--CHHHHHHHHHHHHH----CCCCEEE T ss_conf 29999948983399998775036864312157776689999998699779849--99999999999983----8998999 Q ss_pred EEE Q ss_conf 864 Q gi|254781051|r 471 DIV 473 (957) Q Consensus 471 Dlv 473 (957) ++. T Consensus 171 EV~ 173 (175) T cd02009 171 EVK 173 (175) T ss_pred EEE T ss_conf 996 No 107 >PRK07586 hypothetical protein; Validated Probab=73.48 E-value=6.8 Score=18.89 Aligned_cols=20 Identities=15% Similarity=0.169 Sum_probs=10.8 Q ss_pred CCCHHHHHHHHHHHHHHHHC Q ss_conf 38678999999999998719 Q gi|254781051|r 938 VGHMSRHLEQLSSFIEDALK 957 (957) Q Consensus 938 ~G~~~~H~~e~~~li~~a~~ 957 (957) .|....-..|.+..|++||+ T Consensus 484 ~g~~V~~~~el~~Al~~a~~ 503 (514) T PRK07586 484 PARRVTTAEEFADALARALA 503 (514) T ss_pred EEEEECCHHHHHHHHHHHHC T ss_conf 79997999999999999972 No 108 >TIGR01824 PabB-clade2 para-aminobenzoate synthase component I; InterPro: IPR010118 This clade of sequences is more closely related to TrpE (anthranilate synthase, IPR005256 from INTERPRO, IPR010116 from INTERPRO, IPR005257 from INTERPRO) than to the better characterised group of PabB enzymes (IPR005802 from INTERPRO, IPR010117 from INTERPRO). This clade includes one characterised enzyme from Lactococcus and the conserved function across the clade is supported by these pieces of evidence: 1) all genomes with a member in this clade also have a separate TrpE gene, 2) none of these genomes contain an apparent PabB from any of the other PabB clades, 3) none of these sequences are found in a region of the genome in association with other Trp biosynthesis genes, 4) all of these genomes apparently contain most if not all of the steps of the folate biosynthetic pathway (for which PABA is a precursor). Many of the sequences are annotated as TrpE enzymes, however, we believe that all members of this clade are, in fact, PabB. The sequences from Bacillus halodurans and B.subtilus, are also likely to be PabB enzymes, but are too closely related to TrpE to be separated at this time.. Probab=73.00 E-value=2.2 Score=22.66 Aligned_cols=41 Identities=20% Similarity=0.223 Sum_probs=33.4 Q ss_pred CCHHHC-CCEEECCCCCCCCCCCHHHHHHHHHHH----HCCCEEEE Q ss_conf 997881-715725665787777899999999984----06430277 Q gi|254781051|r 140 FVKADY-DRKICMKGVLGLESATIPEIVDVLSHL----YCSNIGVE 180 (957) Q Consensus 140 Lse~DL-d~~f~~~~~~g~~~~tL~eIi~~L~~t----YCgsIGvE 180 (957) ++-.|| +-.|+-|++-|-++..--+||+.||.+ |||++|+= T Consensus 272 ~~~~dl~~A~FPgGsITGAPK~r~m~iI~eLEp~~Rg~Y~Gs~G~~ 317 (373) T TIGR01824 272 LTLADLIAALFPGGSITGAPKVRAMEIIEELEPQRRGPYTGSVGWI 317 (373) T ss_pred CCHHHHHHHCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCEEEEEEE T ss_conf 7888999642878744489727899999986378987541238888 No 109 >COG0147 TrpE Anthranilate/para-aminobenzoate synthases component I [Amino acid transport and metabolism / Coenzyme metabolism] Probab=72.87 E-value=1.1 Score=24.81 Aligned_cols=36 Identities=14% Similarity=0.207 Sum_probs=26.3 Q ss_pred CCHHHHHHHHHHHHCCCC-------CCCEEEECCCHHHCCCCC Q ss_conf 889999999999743534-------886898566022108012 Q gi|254781051|r 765 TSPANYFHILRRQIYDRS-------SRPLIMMAPKSLLRHKRV 800 (957) Q Consensus 765 ttpAq~FH~LRRQ~~r~~-------rkPLiv~tPKsLLR~~~a 800 (957) ..|-++|..||.+--.+| -.-+|-.+|.++++.... T Consensus 227 ~~p~~~y~~Lr~~NPspY~~~l~~~~~~ivg~SPE~~v~v~~~ 269 (462) T COG0147 227 GDPLALYRRLRQRNPSPYMFFLRLGDFTLVGASPELFVKVDGN 269 (462) T ss_pred CCHHHHHHHHHHCCCCCHHHEEECCCCEEEEECHHHHEEECCC T ss_conf 9989999999734996112103169818999760563788088 No 110 >PRK06048 acetolactate synthase 3 catalytic subunit; Reviewed Probab=71.95 E-value=7.3 Score=18.64 Aligned_cols=46 Identities=17% Similarity=0.199 Sum_probs=18.1 Q ss_pred HHHHHCCCCCCCC--HHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCEEEE Q ss_conf 9987334356421--1478988898863583157850100223332302799 Q gi|254781051|r 594 RQKMIETGKGIDW--SMAESLAFGSLCYEGYKVRLSGQDCERGTFSHRHAIL 643 (957) Q Consensus 594 R~~~~~~~~~iDW--~~AE~LA~~SLl~eG~~VRlsGQD~~RGTFshRHavl 643 (957) ++-.+--|.++.+ +.+|..+++-.+ |.||=-| -.+||+|...|--. T Consensus 208 ~rPvIl~G~g~~~~~a~~~l~~lae~l--~~Pv~tt--~~gkg~~p~~hp~~ 255 (562) T PRK06048 208 CRPVIYAGGGVISSNASAELVELAETL--NAPVTTT--LMGIGSIPTEHPLY 255 (562) T ss_pred CCCEEEECCCCCCCHHHHHHHHHHHHH--CCCEEEC--CCCCCCCCCCCCCC T ss_conf 997899577645122899999999985--9887863--34546677667422 No 111 >PTZ00089 transketolase; Provisional Probab=71.85 E-value=7.4 Score=18.62 Aligned_cols=118 Identities=19% Similarity=0.335 Sum_probs=73.8 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCCEECCCCC Q ss_conf 45532367777641666544555654454047888625753334672588764510677443657999841602012470 Q gi|254781051|r 340 PVVIGSVRARQDLKAGIVGEKSISLVERSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQIGFTTNP 419 (957) Q Consensus 340 PVv~G~~RAkQd~~~d~~~~~~~~~~~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQIGFTT~p 419 (957) +...|++=|........ +.+... -=-++|.+|+ ||.... .||++|.++++. .+..+--|=|+=+|+|--..+- T Consensus 123 ~~avGmAla~~~l~~~~-~~~~~~-i~d~~vy~l~--GDG~l~-EG~~~EA~~~Ag--~~~L~nLi~i~D~N~i~idg~~ 195 (670) T PTZ00089 123 SNAVGMAIAAHNLADKY-NTEEFK-IFDNYVYAIC--GDGCMQ-EGVFCEAASLAG--HLGLGRLILLYDDNKITIDGNT 195 (670) T ss_pred HHHHHHHHHHHHHHHHH-CCCCCC-CCCCEEEEEE--CCCCCC-CHHHHHHHHHHH--HCCCCCEEEEEECCCCCCCCCC T ss_conf 99999999999987561-877666-5463399997--775434-589999999864--6289988999917721146885 Q ss_pred HHCCCCCCCCHHHHHHCC---CEEEE-CCC-CHHHHHHHHHHHHHHHHHHCCCEEEE Q ss_conf 102467784001122034---05874-267-88999999999999999738973998 Q gi|254781051|r 420 SSARSCTYASDISKSIGI---PIFHV-NGD-DPEAVIRVVRMAVSFRMKFHKSVVID 471 (957) Q Consensus 420 ~d~RSs~Y~TDiAK~i~a---PI~HV-N~D-dpEav~~~~~lA~~yR~~F~kDVvID 471 (957) . -.+..|+++-+.+ -|++| +|. |.+++..+.+-| +..=.|-.+|- T Consensus 196 ~----~~~~~d~~~r~~a~gW~v~~v~~g~~d~~~i~~ai~~A---k~~~~kPtlI~ 245 (670) T PTZ00089 196 D----LSFTEDIEKKFEALKWEVRKVEDGNKDFAGILHEIEEA---KKNLKQPSLII 245 (670) T ss_pred C----CCCCHHHHHHHHHCCCCEEECCCCCCCHHHHHHHHHHH---HHCCCCCEEEE T ss_conf 4----36566599999980771343366777999999999999---85589984899 No 112 >PRK05940 anthranilate synthase component I-like protein; Validated Probab=71.10 E-value=1.3 Score=24.31 Aligned_cols=76 Identities=11% Similarity=0.038 Sum_probs=41.9 Q ss_pred HHHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCHHHHHHHHCCCCCCCCCC---CCCCCCCCCEEEEE-- Q ss_conf 6589999999999987379107998067655268999885379899999855887432234---55510015026889-- Q gi|254781051|r 243 SEVIIPAIEEIIRQGVQQGVDEMILGMAHRGRLNVLSQIMNKLPRSIFYEFKGKGPSEKEY---SGDVKYHLGLCCNR-- 317 (957) Q Consensus 243 ~EslIP~L~~ii~~a~~~Gv~eiviGMaHRGRLNVL~Nvl~Kp~~~iF~EF~g~~~~~~~~---sGDVKYHLG~s~~~-- 317 (957) -|..+-+++.+.+.-..-.+=+|++..--+... ..++-++|....-.+|..+.. .+|. .-+|.|-++ T Consensus 189 ~~~y~~~v~~~k~~I~~Gd~yQv~ls~r~~~~~-------~~~p~~~y~~Lr~~nPsPy~~~~~~~~~-~iissSPE~~~ 260 (459) T PRK05940 189 QQEYEAAVRQAKKYIQAGDIFQANLSLRFETTT-------KADSWQIYQRLQQINPSPFASYWRTPWG-EMISCSPERLV 260 (459) T ss_pred HHHHHHHHHHHHHHHHCCCEEEEECCEEEEECC-------CCCHHHHHHHHHHHCCCCCEEEEECCCC-EEEECCHHHHH T ss_conf 999999999999998649889997776998268-------9999999999984389984478966995-09836999998 Q ss_pred EECCCEEEE Q ss_conf 835961899 Q gi|254781051|r 318 QICGKDVKL 326 (957) Q Consensus 318 ~~~g~~v~l 326 (957) +..++.|.. T Consensus 261 ~~~~~~v~t 269 (459) T PRK05940 261 QLQGNQAQT 269 (459) T ss_pred HHCCCEEEE T ss_conf 533997999 No 113 >PRK07508 para-aminobenzoate synthase component I; Provisional Probab=71.01 E-value=1.2 Score=24.51 Aligned_cols=20 Identities=25% Similarity=0.486 Sum_probs=12.3 Q ss_pred HHHHHHHHHHHHHHCCCHHH Q ss_conf 99999999999988170454 Q gi|254781051|r 97 DFFQVMKMIDAYRSYGHFKA 116 (957) Q Consensus 97 kq~rV~~LI~AYR~~GHl~A 116 (957) +-..+..-|.+.+..||..| T Consensus 30 eV~~~l~~ie~a~~~G~~~a 49 (377) T PRK07508 30 EFAPALAAMEAARAAGKWLA 49 (377) T ss_pred HHHHHHHHHHHHHHCCCEEE T ss_conf 99999999999987799899 No 114 >PRK07418 acetolactate synthase 3 catalytic subunit; Reviewed Probab=70.58 E-value=7.8 Score=18.43 Aligned_cols=38 Identities=5% Similarity=0.152 Sum_probs=26.0 Q ss_pred HHHHHCCCCCEEEEECCCHHHHHHHHHHHHCCCCCCCEEE Q ss_conf 8875214378269724889999999999743534886898 Q gi|254781051|r 749 RFLQMCAENNMYVANCTSPANYFHILRRQIYDRSSRPLIM 788 (957) Q Consensus 749 RfLQl~a~~N~~V~~~ttpAq~FH~LRRQ~~r~~rkPLiv 788 (957) .|-+|+.--...=.-.+||.++=.+|++=+.. .+|-+| T Consensus 517 Df~~lA~a~G~~g~rV~~~~eL~~Al~~Al~~--~gP~li 554 (615) T PRK07418 517 DFVKLAEAFGVKGMLISERDQLKDAIAEALAH--DGPVLI 554 (615) T ss_pred CHHHHHHHCCCEEEEECCHHHHHHHHHHHHHC--CCCEEE T ss_conf 99999997799799979999999999999808--997899 No 115 >PRK13572 anthranilate synthase component I; Provisional Probab=70.50 E-value=1.7 Score=23.53 Aligned_cols=10 Identities=30% Similarity=0.600 Sum_probs=3.3 Q ss_pred HHHHHHHHHH Q ss_conf 9999999999 Q gi|254781051|r 767 PANYFHILRR 776 (957) Q Consensus 767 pAq~FH~LRR 776 (957) |-++|-.||+ T Consensus 203 p~~~Yr~LR~ 212 (430) T PRK13572 203 PFQLYRNLRE 212 (430) T ss_pred HHHHHHHHHH T ss_conf 9999999986 No 116 >TIGR02351 thiH thiazole biosynthesis protein ThiH; InterPro: IPR012726 Members of this protein family are the ThiH proteins involved in thiamine biosynthesis. They are homologues of the BioB protein of biotin biosynthesis. Genes encoding these proteins are generally found in operons with other thiamin biosynthesis genes. ; GO: 0005506 iron ion binding, 0051539 4 iron 4 sulfur cluster binding, 0009228 thiamin biosynthetic process. Probab=70.12 E-value=2.6 Score=22.06 Aligned_cols=118 Identities=18% Similarity=0.305 Sum_probs=72.9 Q ss_pred HHHCCCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 98347636789689999999987380507989999987367643333334344432222222211123334455566655 Q gi|254781051|r 9 FSLSSFLDGTNCCYIEDLYKSYQEDPSSVCKDWYPLFSFLDENSEEYDNLEDGIASFLKEESVASAVFSQKKSAVESSCS 88 (957) Q Consensus 9 f~~sSFL~g~Na~YIEeLYekYL~DP~SVd~sWr~fF~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 88 (957) |-.|+|+.+.++.|++.-| |+.=|+ ....+.++...+++++ T Consensus 233 wR~Da~~~a~H~~YL~~~Y-------------w~~~~S--------------------------iS~PRLRP~~n~aPas 273 (378) T TIGR02351 233 WRTDAFFTAYHLRYLQKKY-------------WKTKYS--------------------------ISVPRLRPCTNDAPAS 273 (378) T ss_pred HHHHHHHHHHHHHHHHHHC-------------CCCCEE--------------------------ECCCCCCCCCCCCCCC T ss_conf 2289999999999998636-------------833030--------------------------3254336766897646 Q ss_pred CCCHHH-HHHHHHHHHHHHHHHHCCCHH-------------HCCCCCCCCCCCCCCCCCCHHHCC--CCHHHCCCEEECC Q ss_conf 563367-899999999999998817045-------------401677777766787767876669--9978817157256 Q gi|254781051|r 89 LGETQS-VKDFFQVMKMIDAYRSYGHFK-------------ANIDPLGYNSHQKDLSELSPAHYG--FVKADYDRKICMK 152 (957) Q Consensus 89 ~~~~~~-~~kq~rV~~LI~AYR~~GHl~-------------A~lDPLgl~~~~~~~peL~~~~yG--Lse~DLd~~f~~~ 152 (957) ...... .-+.....|||-|||-.=|.. =++=|||...-... ..-.|.-|- ..+.+|++ |.+ T Consensus 274 G~~p~~~~~~d~~LvQ~~cAyRLF~p~~~IslSTRE~~~FRDn~~pLGiT~~SAG-ssT~pGGy~eC~~~~~leQ-FeI- 350 (378) T TIGR02351 274 GLKPKVKIVTDRELVQLICAYRLFDPDVEISLSTRESKKFRDNVIPLGITKMSAG-SSTEPGGYSECSEKKGLEQ-FEI- 350 (378) T ss_pred CCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCC-CCCCCCCCCCCCCCCCCCC-CCC- T ss_conf 7856665257678899999998734104744020256545664336632201122-0158866203788888751-012- Q ss_pred CCCCCCCCCHHHHHHHHHHH Q ss_conf 65787777899999999984 Q gi|254781051|r 153 GVLGLESATIPEIVDVLSHL 172 (957) Q Consensus 153 ~~~g~~~~tL~eIi~~L~~t 172 (957) .+..|+.||.+.||+. T Consensus 351 ----~D~Rsv~Ev~~~lr~~ 366 (378) T TIGR02351 351 ----SDERSVAEVEEDLRKK 366 (378) T ss_pred ----CCCCCHHHHHHHHHHC T ss_conf ----4678888999999862 No 117 >PRK13574 anthranilate synthase component I; Provisional Probab=69.95 E-value=1.6 Score=23.65 Aligned_cols=11 Identities=27% Similarity=0.664 Sum_probs=3.9 Q ss_pred HHHHHHHHHHH Q ss_conf 99999999997 Q gi|254781051|r 767 PANYFHILRRQ 777 (957) Q Consensus 767 pAq~FH~LRRQ 777 (957) |-+.|-.||+. T Consensus 195 p~~lY~~LR~~ 205 (422) T PRK13574 195 PMRFYYNLRKI 205 (422) T ss_pred HHHHHHHHHHH T ss_conf 99999999965 No 118 >KOG1184 consensus Probab=69.84 E-value=4.9 Score=19.97 Aligned_cols=80 Identities=20% Similarity=0.192 Sum_probs=48.0 Q ss_pred CCCHHHHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCC--------CCCCCCCCHHHHHHH-HHCCCHHHHHHHHHHHC Q ss_conf 678899999999999999973897399864211256564--------354223557888886-71899899999998644 Q gi|254781051|r 444 GDDPEAVIRVVRMAVSFRMKFHKSVVIDIVCYRRFGHNE--------GDEPSFTQPMMYKTI-RSHKSVLQLYADSLMKN 514 (957) Q Consensus 444 ~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~YRr~GHNE--------~DeP~fTQP~mY~~I-~~hp~v~~~Y~~~Li~~ 514 (957) -.|+|.--.|...++++-++=+++|++-=.|-||++|-. .--|.|--|.==-.| ..||..--+| - T Consensus 192 ~s~~e~~~~~v~~i~e~i~~~~~Pvil~~~~~~~~~~~~~~~~l~~~~~~p~~vtp~~Kg~i~E~hp~y~Gvy------~ 265 (561) T KOG1184 192 PSNKEGLEEAVDAILELINKAKKPVILGDPKLRRAKAESAFVELADATGFPVFVTPMGKGFIPESHPHYGGVY------W 265 (561) T ss_pred CCCHHHHHHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCEEEEE------E T ss_conf 7717889999999999864136873640432257678999999877537870572156555467688551367------0 Q ss_pred CCCCHHHHHHHHHHH Q ss_conf 898989999999999 Q gi|254781051|r 515 QVISKQELQSLANNW 529 (957) Q Consensus 515 g~it~~e~~~~~~~~ 529 (957) |.++..++.++.+.. T Consensus 266 G~vs~~~~~e~vesa 280 (561) T KOG1184 266 GAVSTPFVKEIVESA 280 (561) T ss_pred CCCCCHHHHHHHHHC T ss_conf 435638799998625 No 119 >COG3961 Pyruvate decarboxylase and related thiamine pyrophosphate-requiring enzymes [Carbohydrate transport and metabolism / Coenzyme metabolism / General function prediction only] Probab=69.66 E-value=5.1 Score=19.81 Aligned_cols=85 Identities=28% Similarity=0.358 Sum_probs=66.5 Q ss_pred EEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEECCCCCC--------CCCCCCCCCCHHHHHHHH-HCCCHHHHHHH Q ss_conf 587426788999999999999999738973998642112565--------643542235578888867-18998999999 Q gi|254781051|r 439 IFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVCYRRFGH--------NEGDEPSFTQPMMYKTIR-SHKSVLQLYAD 509 (957) Q Consensus 439 I~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~YRr~GH--------NE~DeP~fTQP~mY~~I~-~hp~v~~~Y~~ 509 (957) .+++-.-|||+--.+...+++.=++=+|-++|-=.|-+|+|- |..--|.||-|+==-.|. .||+-.-+| T Consensus 184 ~~~~~~sd~e~~~e~i~~i~~lI~~ak~p~ILad~~~~r~~~~~~~~~l~~~t~~p~~~~pmGKg~idEs~P~y~GvY-- 261 (557) T COG3961 184 DLQLKTSDPEALSEVIDTIAELINKAKKPVILADALVSRFGLEKELKKLINATGFPVATLPMGKGVIDESHPNYLGVY-- 261 (557) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEECCHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCEEEEE-- T ss_conf 444689998899999999999984568867851423312226999999998408976994246545566688700377-- Q ss_pred HHHHCCCCCHHHHHHHHHHH Q ss_conf 98644898989999999999 Q gi|254781051|r 510 SLMKNQVISKQELQSLANNW 529 (957) Q Consensus 510 ~Li~~g~it~~e~~~~~~~~ 529 (957) .|.++..++.+..+.- T Consensus 262 ----~G~~s~~~vre~vE~a 277 (557) T COG3961 262 ----NGKLSEPEVREAVESA 277 (557) T ss_pred ----ECCCCCHHHHHHHHCC T ss_conf ----5434878799986237 No 120 >PRK07282 acetolactate synthase catalytic subunit; Reviewed Probab=68.35 E-value=8.7 Score=18.10 Aligned_cols=25 Identities=16% Similarity=0.158 Sum_probs=10.5 Q ss_pred HHHHCCCCCEEEEECCCHHHHHHHH Q ss_conf 8752143782697248899999999 Q gi|254781051|r 750 FLQMCAENNMYVANCTSPANYFHIL 774 (957) Q Consensus 750 fLQl~a~~N~~V~~~ttpAq~FH~L 774 (957) |-+++.--...-.-++||..+=-+| T Consensus 501 f~~iA~a~G~~~~rv~~~~eL~~aL 525 (566) T PRK07282 501 FQLMAQAYGIKHYKFDNPETLAQDL 525 (566) T ss_pred HHHHHHHCCCEEEEECCHHHHHHHH T ss_conf 9999997798799989999999999 No 121 >PRK13570 anthranilate synthase component I; Provisional Probab=67.24 E-value=1.9 Score=23.06 Aligned_cols=17 Identities=24% Similarity=0.534 Sum_probs=7.7 Q ss_pred CCCEEEECCCCHHHHHH Q ss_conf 34058742678899999 Q gi|254781051|r 436 GIPIFHVNGDDPEAVIR 452 (957) Q Consensus 436 ~aPI~HVN~DdpEav~~ 452 (957) +.+=+++=|-.||.-++ T Consensus 249 ~~~~~~IigsSPE~~vr 265 (454) T PRK13570 249 DFGDYQVIGSSPESLVS 265 (454) T ss_pred ECCCCEEECCCHHHHCE T ss_conf 32561798479789503 No 122 >PRK13565 anthranilate synthase component I; Provisional Probab=67.10 E-value=2.4 Score=22.37 Aligned_cols=21 Identities=5% Similarity=0.028 Sum_probs=6.9 Q ss_pred HHHHHHHHHHHHHCCCCEEEE Q ss_conf 999999999987379107998 Q gi|254781051|r 247 IPAIEEIIRQGVQQGVDEMIL 267 (957) Q Consensus 247 IP~L~~ii~~a~~~Gv~eivi 267 (957) +-+++.+.+.-..-.+=+||+ T Consensus 222 ~~~V~~~k~~I~~GDifQv~l 242 (490) T PRK13565 222 LAAVRKAKEYIAAGDCMQVVP 242 (490) T ss_pred HHHHHHHHHHHHCCCEEEEEC T ss_conf 999999999987497799962 No 123 >COG1590 Uncharacterized conserved protein [Function unknown] Probab=66.11 E-value=6.2 Score=19.21 Aligned_cols=70 Identities=29% Similarity=0.299 Sum_probs=45.0 Q ss_pred HHHHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHC---------CCEEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHCC Q ss_conf 11220340587426788999999999999999738---------973998642112565643542235578888867189 Q gi|254781051|r 431 ISKSIGIPIFHVNGDDPEAVIRVVRMAVSFRMKFH---------KSVVIDIVCYRRFGHNEGDEPSFTQPMMYKTIRSHK 501 (957) Q Consensus 431 iAK~i~aPI~HVN~DdpEav~~~~~lA~~yR~~F~---------kDVvIDlv~YRr~GHNE~DeP~fTQP~mY~~I~~hp 501 (957) +=-+++-|||||-+.|-|+.....++|.+- -|+ +-++|.|.+=-| +|.|-- -+..- T Consensus 92 iwl~~~ppIlHV~~r~~e~A~~L~~~a~~~--GFK~SgI~sis~~r~iVEI~~~er-----mdvpl~--------e~g~~ 156 (208) T COG1590 92 IWLKVQPPILHVAARDLEAAKKLLNIARSC--GFKHSGIKSISQKRVIVEIRGTER-----MDVPLG--------EDGKI 156 (208) T ss_pred EEEEEECCEEEEEECCHHHHHHHHHHHHHC--CCCCCCEEEECCCCEEEEECCCCE-----EEEEEC--------CCCCE T ss_conf 999832878998847678899999999862--975345267437728999746522-----564405--------78729 Q ss_pred CHHHHHHHHHHHCC Q ss_conf 98999999986448 Q gi|254781051|r 502 SVLQLYADSLMKNQ 515 (957) Q Consensus 502 ~v~~~Y~~~Li~~g 515 (957) -|.+-|.+.|++-. T Consensus 157 ~v~~eyl~~lv~~~ 170 (208) T COG1590 157 LVDDEYLKFLVEIA 170 (208) T ss_pred EECHHHHHHHHHHH T ss_conf 97689999999999 No 124 >PRK09070 hypothetical protein; Validated Probab=65.39 E-value=2 Score=22.86 Aligned_cols=41 Identities=17% Similarity=0.025 Sum_probs=23.5 Q ss_pred HHCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEE-EEEECCCCCCCCCC Q ss_conf 2034058742678899999999999999973897399-86421125656435 Q gi|254781051|r 434 SIGIPIFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVI-DIVCYRRFGHNEGD 484 (957) Q Consensus 434 ~i~aPI~HVN~DdpEav~~~~~lA~~yR~~F~kDVvI-Dlv~YRr~GHNE~D 484 (957) .++.+=++|=|-.||.-+++- +..|.+ =|=|=|+.|.|+.| T Consensus 242 yl~~~d~~IvgsSPE~lvrv~----------~~~V~trPIaGTr~RG~~~~~ 283 (451) T PRK09070 242 LFVAGGRAIVSSSPERLVSVQ----------GGVVQTRPIAGTRPRFVGDDD 283 (451) T ss_pred EEECCCCEEEECCHHHHCCCC----------CCEEEEECCCCCCCCCCCHHH T ss_conf 771588389916989970745----------897999657656789988789 No 125 >PRK07092 benzoylformate decarboxylase; Reviewed Probab=64.06 E-value=10 Score=17.51 Aligned_cols=32 Identities=16% Similarity=0.217 Sum_probs=19.3 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHH Q ss_conf 999999999998737910799806765526899 Q gi|254781051|r 246 IIPAIEEIIRQGVQQGVDEMILGMAHRGRLNVL 278 (957) Q Consensus 246 lIP~L~~ii~~a~~~Gv~eiviGMaHRGRLNVL 278 (957) |...-|.|++.-.+.||+. ++|.|=-.=|.++ T Consensus 2 m~Tv~d~i~~~L~~~Gv~~-iFgvpG~~~~~~~ 33 (521) T PRK07092 2 MTTVRDATIDLLRRFGITT-VFGNPGSTELPFL 33 (521) T ss_pred CCCHHHHHHHHHHHCCCCE-EEECCCCCHHHHH T ss_conf 8729999999999879999-9988874479999 No 126 >PRK13571 anthranilate synthase component I; Provisional Probab=63.99 E-value=3 Score=21.60 Aligned_cols=23 Identities=9% Similarity=0.030 Sum_probs=9.8 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 89999999999987379107998 Q gi|254781051|r 245 VIIPAIEEIIRQGVQQGVDEMIL 267 (957) Q Consensus 245 slIP~L~~ii~~a~~~Gv~eivi 267 (957) ...-+++.+.+.-.+-.|=+||+ T Consensus 231 ey~~~V~kaKeyI~aGDIFQVVL 253 (506) T PRK13571 231 EYGAAVERLVEEIRAGEAFQVVP 253 (506) T ss_pred HHHHHHHHHHHHHHCCCEEEEEC T ss_conf 99999999999997498899987 No 127 >PRK12653 fructose-6-phosphate aldolase; Reviewed Probab=63.22 E-value=11 Score=17.41 Aligned_cols=57 Identities=25% Similarity=0.379 Sum_probs=36.9 Q ss_pred ECCCCCHH-CCCCCCCCHH----HHHHCC--CEE-EECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEE Q ss_conf 01247010-2467784001----122034--058-7426788999999999999999738973998642 Q gi|254781051|r 414 GFTTNPSS-ARSCTYASDI----SKSIGI--PIF-HVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVC 474 (957) Q Consensus 414 GFTT~p~d-~RSs~Y~TDi----AK~i~a--PI~-HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~ 474 (957) |+||||.- .|+..=+.|+ .+.+.- |+. -|-|+|.|.-+.-++- ++ +..++|+|-+=| T Consensus 24 GvTTNPsll~k~g~~~~~~~~~i~~~~~~~~~l~~qv~~~~~e~M~~~a~~---l~-~~~~nvvVKIP~ 88 (220) T PRK12653 24 GVTTNPSIIAAGKKPLEVVLPQLHEAMGGQGRLFAQVMATTAEGMVNDARK---LR-SIIADIVVKVPV 88 (220) T ss_pred EECCCHHHHHHCCCCHHHHHHHHHHHHCCCCCEEEEEEECCHHHHHHHHHH---HH-HHCCCEEEEECC T ss_conf 585889999855989899999999980899858999875889999999999---98-735780899488 No 128 >PRK08155 acetolactate synthase catalytic subunit; Validated Probab=63.17 E-value=11 Score=17.40 Aligned_cols=26 Identities=27% Similarity=0.390 Sum_probs=11.1 Q ss_pred EECCCCHHHHHHHHHHHHHHHHHCCCCEEEEECC Q ss_conf 2225646589999999999987379107998067 Q gi|254781051|r 237 RFGADGSEVIIPAIEEIIRQGVQQGVDEMILGMA 270 (957) Q Consensus 237 RFsLEG~EslIP~L~~ii~~a~~~Gv~eiviGMa 270 (957) |.-+-|+|.++=. -.+.||+.| +|.+ T Consensus 10 ~~k~tga~~lv~~-------L~~~Gv~~v-Fg~p 35 (564) T PRK08155 10 RKRFTGAELIVHL-------LERQGIRIV-TGIP 35 (564) T ss_pred CCEEEHHHHHHHH-------HHHCCCCEE-EEEC T ss_conf 7433799999999-------998799889-9968 No 129 >PRK13569 anthranilate synthase component I; Provisional Probab=62.42 E-value=2.9 Score=21.69 Aligned_cols=24 Identities=13% Similarity=0.131 Sum_probs=9.8 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEEE Q ss_conf 899999999999873791079980 Q gi|254781051|r 245 VIIPAIEEIIRQGVQQGVDEMILG 268 (957) Q Consensus 245 slIP~L~~ii~~a~~~Gv~eiviG 268 (957) ..+-+++.+.+.-..-.|=+||+. T Consensus 232 ey~~~V~kaKeyI~aGDIfQVVLS 255 (506) T PRK13569 232 RFLADVERIKEYIKAGDIFQAVLS 255 (506) T ss_pred HHHHHHHHHHHHHHCCCEEEEECC T ss_conf 999999999999865988999864 No 130 >PRK06404 anthranilate synthase component I; Reviewed Probab=60.67 E-value=3.4 Score=21.16 Aligned_cols=87 Identities=16% Similarity=0.198 Sum_probs=42.3 Q ss_pred HHHCCCCEECCCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCHHHHHHHHCCCCCCCCCC---CCC Q ss_conf 64033622225646589999999999987379107998067655268999885379899999855887432234---555 Q gi|254781051|r 230 IKYKGAKRFGADGSEVIIPAIEEIIRQGVQQGVDEMILGMAHRGRLNVLSQIMNKLPRSIFYEFKGKGPSEKEY---SGD 306 (957) Q Consensus 230 ~Kf~g~KRFsLEG~EslIP~L~~ii~~a~~~Gv~eiviGMaHRGRLNVL~Nvl~Kp~~~iF~EF~g~~~~~~~~---sGD 306 (957) ++|.+.|.=|----|.+.-.++.+.+.-..-.+=+||+. + |+- +++ .+-.++..+.-.+|..... -|| T Consensus 91 ~~~~~~~~~sn~~~~ey~~~v~kike~I~~GDi~Qvvls--q--rf~--~~~---dp~~~y~~lr~~NPsPY~~y~~~~~ 161 (351) T PRK06404 91 KSFKTVKLKGNYNDISLSLKIKELIELIRAGEVLQVVIS--R--EFE--ANI---DFKEKLSEFINNDRSRYVFYYRFGK 161 (351) T ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEEE--E--EEC--CCC---CHHHHHHHHHHCCCCCEEEEEECCC T ss_conf 333431010458999999999999999866875999605--8--850--679---9899999997449997489999899 Q ss_pred CCCCCCEEEEE--EECCCEEEE Q ss_conf 10015026889--835961899 Q gi|254781051|r 307 VKYHLGLCCNR--QICGKDVKL 326 (957) Q Consensus 307 VKYHLG~s~~~--~~~g~~v~l 326 (957) .. =+|.|-++ +..|+.|.. T Consensus 162 ~~-iigaSPE~~l~~~~~~i~t 182 (351) T PRK06404 162 YR-VVGSSPENVFTVNGNIINV 182 (351) T ss_pred CE-EEEECHHHHEEECCCEEEE T ss_conf 48-9981867861863998999 No 131 >TIGR00959 ffh signal recognition particle protein; InterPro: IPR004780 The signal recognition particle (SRP) is a multimeric protein, which along with its conjugate receptor (SR), is involved in targeting secretory proteins to the rough endoplasmic reticulum (RER) membrane in eukaryotes, or to the plasma membrane in prokaryotes , . SRP recognises the signal sequence of the nascent polypeptide on the ribosome, retards its elongation, and docks the SRP-ribosome-polypeptide complex to the RER membrane via the SR receptor. SRP consists of six polypeptides (SRP9, SRP14, SRP19, SRP54, SRP68 and SRP72) and a single 300 nucleotide 7S RNA molecule. The RNA component catalyses the interaction of SRP with its SR receptor . In higher eukaryotes, the SRP complex consists of the Alu domain and the S domain linked by the SRP RNA. The Alu domain consists of a heterodimer of SRP9 and SRP14 bound to the 5' and 3' terminal sequences of SRP RNA. This domain is necessary for retarding the elongation of the nascent polypeptide chain, which gives SRP time to dock the ribosome-polypeptide complex to the RER membrane. This entry represents various SRP subunits.; GO: 0008312 7S RNA binding, 0006614 SRP-dependent cotranslational protein targeting to membrane, 0048500 signal recognition particle. Probab=60.30 E-value=9.6 Score=17.76 Aligned_cols=144 Identities=20% Similarity=0.301 Sum_probs=81.7 Q ss_pred HHHHHCCCEEE-ECC-CCHHHHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCCCHHHHH--HHHHCCCHHHH Q ss_conf 11220340587-426-788999999999999999738973998642112565643542235578888--86718998999 Q gi|254781051|r 431 ISKSIGIPIFH-VNG-DDPEAVIRVVRMAVSFRMKFHKSVVIDIVCYRRFGHNEGDEPSFTQPMMYK--TIRSHKSVLQL 506 (957) Q Consensus 431 iAK~i~aPI~H-VN~-DdpEav~~~~~lA~~yR~~F~kDVvIDlv~YRr~GHNE~DeP~fTQP~mY~--~I~~hp~v~~~ 506 (957) +++=+++|||+ ... -+|...+..++=|++|=++=+.|||| +===|==..|| .+|-+ .|++--.+-++ T Consensus 154 Lg~Q~gVpvf~h~~~~~~p~~Pv~ia~~Al~~Ak~~~~D~vI----~DTAGRL~ID~-----~LM~EL~~iK~~~nP~Ei 224 (439) T TIGR00959 154 LGEQVGVPVFAHLGKGQSPDDPVEIARQALEEAKENGFDVVI----VDTAGRLQIDE-----ELMEELAEIKEILNPDEI 224 (439) T ss_pred HHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEE----EECCCCHHHHH-----HHHHHHHHHHHHHCCCEE T ss_conf 767528871100478889887789999999999974897899----72675125559-----999999999888688705 Q ss_pred HHHHHHHCCCCCHHHHHHHHHHHHHHHHH-HHHHH--------------HC--CCC-------CCCCCCCCCC------- Q ss_conf 99998644898989999999999999775-66654--------------12--587-------6566665686------- Q gi|254781051|r 507 YADSLMKNQVISKQELQSLANNWHKYLEA-EYKES--------------ES--YCP-------EKLGLLHNGE------- 555 (957) Q Consensus 507 Y~~~Li~~g~it~~e~~~~~~~~~~~l~~-~~~~~--------------k~--~~~-------~~~~~~~~~~------- 555 (957) -||-.+ +|=+|+=...+.|.+.|.= .+=-+ |. -.| ++...+.... T Consensus 225 ---LlVvDa-M~GQdAvn~A~~F~e~lgltG~vltK~DGDaRGGAALS~~~~tg~PIKFiG~GEK~~dLe~FhP~R~A~R 300 (439) T TIGR00959 225 ---LLVVDA-MTGQDAVNTAKTFNERLGLTGVVLTKLDGDARGGAALSVRSVTGKPIKFIGVGEKIEDLEPFHPDRMASR 300 (439) T ss_pred ---EECCHH-CCHHHHHHHHHHHCCCCCCCEEEEECCCCCCHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCCHHHHHHH T ss_conf ---412201-0216999999863660013547885475660578999999996896188841777233124674789863 Q ss_pred -----------CCCCCCCCHHHHHHHHHHHCCCCCCCCCCHHHHH Q ss_conf -----------2666778999999998874368998620567788 Q gi|254781051|r 556 -----------NERKNSVSKEILKKIGSSIIRLPKSFNTHKIVER 589 (957) Q Consensus 556 -----------~~~~T~v~~~~L~~i~~~l~~~Pe~f~~h~~v~k 589 (957) +...--|+.+..+++.+|+. - +.|++.--+.- T Consensus 301 ILGMGDv~SLvEKA~~~~D~~~a~~~~~K~~-~-~~FdleDfl~Q 343 (439) T TIGR00959 301 ILGMGDVLSLVEKAQEVVDEEEAKKLAEKML-K-GQFDLEDFLEQ 343 (439) T ss_pred HCCCCCHHHHHHHHHHHCCHHHHHHHHHHHH-C-CCCCHHHHHHH T ss_conf 2365522369999998518899999999863-0-58898999999 No 132 >PRK13564 anthranilate synthase component I; Provisional Probab=58.54 E-value=3.1 Score=21.53 Aligned_cols=25 Identities=4% Similarity=0.056 Sum_probs=10.8 Q ss_pred HHHHHHHHHHHHHHHHCCCCEEEEE Q ss_conf 5899999999999873791079980 Q gi|254781051|r 244 EVIIPAIEEIIRQGVQQGVDEMILG 268 (957) Q Consensus 244 EslIP~L~~ii~~a~~~Gv~eiviG 268 (957) |..+-+.+.+-+.-.+-.+=+||+. T Consensus 244 eeY~~~V~kaKeyI~~GDIFQVVLS 268 (521) T PRK13564 244 EEFCQVVRKLKEHIRAGDIFQVVPS 268 (521) T ss_pred HHHHHHHHHHHHHHHCCCEEEEEEE T ss_conf 9999999999999763987999534 No 133 >COG2166 sufE Cysteine desulfurase SufE subunit [Posttranslational modification, protein turnover, chaperones] Probab=58.45 E-value=4 Score=20.67 Aligned_cols=36 Identities=28% Similarity=0.474 Sum_probs=16.2 Q ss_pred CCCCCHHHHHHHHHHHCCCCE-EEEECCCCCCCCCHHH Q ss_conf 148682899999985379983-9980678531784578 Q gi|254781051|r 870 QLYPFPEDYLIKVLSRFVQAE-IVWCQEEPQNMGAWTF 906 (957) Q Consensus 870 qLyPfP~~~l~~~l~~y~~~e-~vW~QEEp~N~GaW~~ 906 (957) +|-|||.+.... ..+-+.++ -||...|++|-|.|.| T Consensus 38 ~Lp~lpe~~~~~-en~V~GC~S~vwL~~~~~~~~~~~F 74 (144) T COG2166 38 QLPPLPEELRAE-ENPVPGCQSQVWLVTEQNDDGTLHF 74 (144) T ss_pred HCCCCCHHHHHH-CCCCCCCCCCEEEEEEECCCCEEEE T ss_conf 089999788724-2678765423368886168863788 No 134 >PRK13573 anthranilate synthase component I; Provisional Probab=57.09 E-value=3.3 Score=21.26 Aligned_cols=51 Identities=12% Similarity=0.098 Sum_probs=21.1 Q ss_pred HHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCHHHHHHHHCCCCCCCC Q ss_conf 5899999999999873791079980676552689998853798999998558874322 Q gi|254781051|r 244 EVIIPAIEEIIRQGVQQGVDEMILGMAHRGRLNVLSQIMNKLPRSIFYEFKGKGPSEK 301 (957) Q Consensus 244 EslIP~L~~ii~~a~~~Gv~eiviGMaHRGRLNVL~Nvl~Kp~~~iF~EF~g~~~~~~ 301 (957) |...-+++.+.+.-..-.+=+||+.=--...+ ..++-+++......+|..+ T Consensus 231 eeY~~~V~~~k~~I~~Gdi~Qvvlsqrf~~~~-------~~~p~~lYr~LR~~NPsPy 281 (503) T PRK13573 231 EGYKAAVEKAKDYIRAGDIFQVVPSQRWTQDF-------RLPPFALYRSLRRTNPSPF 281 (503) T ss_pred HHHHHHHHHHHHHHHCCCEEEEEEEEEEEEEC-------CCCHHHHHHHHHHHCCCCC T ss_conf 99999999999999749979998235999833-------8998999999863689831 No 135 >COG1091 RfbD dTDP-4-dehydrorhamnose reductase [Cell envelope biogenesis, outer membrane] Probab=56.76 E-value=14 Score=16.62 Aligned_cols=112 Identities=18% Similarity=0.138 Sum_probs=60.1 Q ss_pred HHHHHHHHHCCCCCCCCCEEEEEECCCEECCCCCHHCCCCCCCCHHHHHHCC--------CEEEECCCCHHHHHHHHHHH Q ss_conf 2588764510677443657999841602012470102467784001122034--------05874267889999999999 Q gi|254781051|r 386 IVSETFGLSGLSGYTVAGNIHLIINNQIGFTTNPSSARSCTYASDISKSIGI--------PIFHVNGDDPEAVIRVVRMA 457 (957) Q Consensus 386 VV~Etl~ls~l~gY~tGGTiHiVvNNQIGFTT~p~d~RSs~Y~TDiAK~i~a--------PI~HVN~DdpEav~~~~~lA 457 (957) -|.-.+.+++ .|.. =-|||+|+|= -.|..|+|+.|-. +|+|+.++-. +--..+| T Consensus 158 Fv~tml~la~-----~~~~-l~vv~Dq~gs---------Pt~~~dlA~~i~~ll~~~~~~~~yH~~~~g~---~Swydfa 219 (281) T COG1091 158 FVKTMLRLAK-----EGKE-LKVVDDQYGS---------PTYTEDLADAILELLEKEKEGGVYHLVNSGE---CSWYEFA 219 (281) T ss_pred HHHHHHHHHH-----CCCC-EEEECCEEEC---------CCCHHHHHHHHHHHHHCCCCCCEEEEECCCC---CCHHHHH T ss_conf 8999999850-----5992-6997984538---------7469999999999983455586799807974---1199999 Q ss_pred HHHHHHHCCCEEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHH Q ss_conf 9999973897399864211256564354223557888886718998999999986448989899999999999 Q gi|254781051|r 458 VSFRMKFHKSVVIDIVCYRRFGHNEGDEPSFTQPMMYKTIRSHKSVLQLYADSLMKNQVISKQELQSLANNWH 530 (957) Q Consensus 458 ~~yR~~F~kDVvIDlv~YRr~GHNE~DeP~fTQP~mY~~I~~hp~v~~~Y~~~Li~~g~it~~e~~~~~~~~~ 530 (957) -.-=..++.|. +.-+|.-+. -|....++|.-..+=.+|+.+.+.++.-+-+...+.+. T Consensus 220 ~~I~~~~~~~~-------------~v~~~~~~~--~~~~~a~RP~~S~L~~~k~~~~~~~~~~~w~~~l~~~~ 277 (281) T COG1091 220 KAIFEEAGVDG-------------EVIEPIASA--EYPTPAKRPANSSLDTKKLEKAFGLSLPEWREALKALL 277 (281) T ss_pred HHHHHHHCCCC-------------CCCCCCCCC--CCCCCCCCCCCCCCCHHHHHHHHCCCCCCHHHHHHHHH T ss_conf 99999838886-------------414555622--36766789755542528899974889825999999997 No 136 >COG0499 SAM1 S-adenosylhomocysteine hydrolase [Coenzyme metabolism] Probab=56.24 E-value=11 Score=17.35 Aligned_cols=74 Identities=22% Similarity=0.434 Sum_probs=48.1 Q ss_pred CCCCCCCCCEEEEEECCCEECCC------CCHHCCCCCCCC--HHHHHH----CCCEEEECCCCHHHHHHHHHHHHHHHH Q ss_conf 06774436579998416020124------701024677840--011220----340587426788999999999999999 Q gi|254781051|r 395 GLSGYTVAGNIHLIINNQIGFTT------NPSSARSCTYAS--DISKSI----GIPIFHVNGDDPEAVIRVVRMAVSFRM 462 (957) Q Consensus 395 ~l~gY~tGGTiHiVvNNQIGFTT------~p~d~RSs~Y~T--DiAK~i----~aPI~HVN~DdpEav~~~~~lA~~yR~ 462 (957) -|.|-+++|++|+-+---+=--| +-.-+-+-+++| |+|-.+ +.|||-.-|.+.|.-..++.-.++++ T Consensus 42 PlkG~~i~~~lH~t~kTAvLietL~a~GAeV~~a~cNplSTqD~vaaAl~~~~GipVfA~kGe~~eeY~~~~~~vl~~~- 120 (420) T COG0499 42 PLKGARIAGCLHMTAKTAVLIETLKAGGAEVRWASCNPLSTQDDVAAALAAKEGIPVFAWKGETLEEYYEAIDQVLDWE- 120 (420) T ss_pred CCCCCEEEEEEEEHHHHHHHHHHHHHCCCEEEEECCCCCCCCHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHHCCC- T ss_conf 9776578998731587999999998558569985278876667899998750596179970787899999999995659- Q ss_pred HHCCCEEEEE Q ss_conf 7389739986 Q gi|254781051|r 463 KFHKSVVIDI 472 (957) Q Consensus 463 ~F~kDVvIDl 472 (957) -|++||= T Consensus 121 ---p~iiiDD 127 (420) T COG0499 121 ---PNIIIDD 127 (420) T ss_pred ---CCEEEEC T ss_conf ---9879824 No 137 >smart00604 MD MD domain. Probab=55.34 E-value=6.8 Score=18.91 Aligned_cols=13 Identities=23% Similarity=0.381 Sum_probs=5.5 Q ss_pred EEEEECCCCCCCHH Q ss_conf 89997178773100 Q gi|254781051|r 324 VKLLLQSNPSHLEF 337 (957) Q Consensus 324 v~lsL~~NPSHLEa 337 (957) ..|++-.+ |-|+. T Consensus 21 Cti~vR~q-S~lqV 33 (145) T smart00604 21 CTISVRSQ-SSLQV 33 (145) T ss_pred EEEEEEEC-CCEEE T ss_conf 89999835-64698 No 138 >PRK06163 hypothetical protein; Provisional Probab=54.04 E-value=15 Score=16.31 Aligned_cols=106 Identities=17% Similarity=0.250 Sum_probs=66.5 Q ss_pred HCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHH---CCCCCCCCCEEEEEECCCE Q ss_conf 0044553236777764166654455565445404788862575333467258876451---0677443657999841602 Q gi|254781051|r 337 FVDPVVIGSVRARQDLKAGIVGEKSISLVERSRVLPLIIHGDAAFAGQGIVSETFGLS---GLSGYTVAGNIHLIINNQI 413 (957) Q Consensus 337 aVnPVv~G~~RAkQd~~~d~~~~~~~~~~~~~~vlpiliHGDAAfaGQGVV~Etl~ls---~l~gY~tGGTiHiVvNNQI 413 (957) ...|+.+|.+-|+-+ ++|++|- ||.+|- |+|- -+-.|.-.--+|||+||.. T Consensus 61 ~as~iaLG~Ala~p~----------------k~Vi~iD--GDGs~L--------M~mGsLaTi~~~~p~Nl~hIvlnN~~ 114 (202) T PRK06163 61 LAFPIALGVALAQPK----------------RRVIALE--GDGSLL--------MQLGALGTIAALAPKNLTIIVMDNGV 114 (202) T ss_pred CHHHHHHHHHHHCCC----------------CEEEEEE--CCHHHH--------HCHHHHHHHHHHCCCCEEEEEECCCC T ss_conf 188999999987899----------------8289994--766887--------35688999997178856999986964 Q ss_pred ECCCC--CHHCCCCCCCCHHHHHHCCC-EEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEE Q ss_conf 01247--01024677840011220340-587426788999999999999999738973998642 Q gi|254781051|r 414 GFTTN--PSSARSCTYASDISKSIGIP-IFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVC 474 (957) Q Consensus 414 GFTT~--p~d~RSs~Y~TDiAK~i~aP-I~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~ 474 (957) -=.|- |..+-...==..|||..+.+ .+.| ++++....+.+-++ +...-+||++-+ T Consensus 115 ~~stGgQ~T~~~~~vDf~~iA~a~Gy~~~~~v--~~~~el~~~l~~~l----~~~GP~fi~Vki 172 (202) T PRK06163 115 YQITGGQPTLTSQTVDVVAIARGAGLENSHWA--ADEAHFEALVDQAL----SGPGPSFIAVRI 172 (202) T ss_pred CCCCCCCCCCCCCCCCHHHHHHHCCCCCEEEE--CCHHHHHHHHHHHH----HCCCCEEEEEEE T ss_conf 00168745776788799999997799857997--79999999999998----389989999993 No 139 >pfam11300 DUF3102 Protein of unknown function (DUF3102). This family of proteins has no known function. Probab=53.52 E-value=15 Score=16.26 Aligned_cols=68 Identities=18% Similarity=0.257 Sum_probs=43.2 Q ss_pred HHHHHHHHHHHCCCHHHCCCCCCCCCCCCCCCCCCHH----HCCCCHHHCCCEEECCCCCCCCCCCHHHHHHHHHH Q ss_conf 9999999998817045401677777766787767876----66999788171572566578777789999999998 Q gi|254781051|r 100 QVMKMIDAYRSYGHFKANIDPLGYNSHQKDLSELSPA----HYGFVKADYDRKICMKGVLGLESATIPEIVDVLSH 171 (957) Q Consensus 100 rV~~LI~AYR~~GHl~A~lDPLgl~~~~~~~peL~~~----~yGLse~DLd~~f~~~~~~g~~~~tL~eIi~~L~~ 171 (957) -+.+|++.|+.||+..+..++=. .......+.|.+. -.|+.++|.++-+.- ...+.||.||+-+.+++ T Consensus 58 tA~~~M~v~~eyG~~~~~~~~~~-~~n~~~l~~L~ytqal~Ll~~PeeeR~efi~e---~dve~Ms~RELqqavkE 129 (130) T pfam11300 58 TAANLMRLYEEYGSKLASPSDDD-SSNSQPLGNLTYTQALILLGVPEEEREQFIAE---NDVESMSTRELQQAVKE 129 (130) T ss_pred HHHHHHHHHHHHCCCCCCCCCCC-CCCCCCCCCCCHHHHHHHHCCCHHHHHHHHHH---HCCCCCCHHHHHHHHHC T ss_conf 99999999999750125898654-32453243121999999927987899999997---24101139999999864 No 140 >pfam02256 Fe_hyd_SSU Iron hydrogenase small subunit. This family represents the small subunit of the Fe-only hydrogenases EC:1.18.99.1. The subunit is comprised of alternating random coil and alpha helical structures that encompasses the large subunit in a novel protein fold. Probab=53.05 E-value=12 Score=16.99 Aligned_cols=29 Identities=14% Similarity=0.423 Sum_probs=19.9 Q ss_pred HHHHCC---CCCCCCHHHHHHHHHHHHHCCCC Q ss_conf 998347---63678968999999998738050 Q gi|254781051|r 8 MFSLSS---FLDGTNCCYIEDLYKSYQEDPSS 36 (957) Q Consensus 8 ~f~~sS---FL~g~Na~YIEeLYekYL~DP~S 36 (957) .|..|+ .-....-..|.+||+.|+..|.| T Consensus 14 lY~~D~~~~~r~s~eNp~i~~lY~~~l~~p~s 45 (60) T pfam02256 14 LYKIDKSKPLRKSHENPAVKKLYEEFLGEPLS 45 (60) T ss_pred HHHHHCCCCCCCCCCCHHHHHHHHHHHCCCCH T ss_conf 99975167644676798999999999858573 No 141 >cd06230 M14_ASTE_ASPA_like The Peptidase M14 Succinylglutamate desuccinylase (ASTE)/aspartoacylase (ASPA) subfamily belongs to the M14 family of metallocarboxypeptidases (MCPs), and includes ASTE, which catalyzes the fifth and last step in arginine catabolism by the arginine succinyltransferase pathway, and aspartoacylase (ASPA, also known as aminoacylase 2, and ACY-2; EC:3.5.1.15) which cleaves N-acetyl L-aspartic acid (NAA) into aspartate and acetate. NAA is abundant in the brain, and hydrolysis of NAA by ASPA may help maintain white matter. ASPA is an NAA scavenger in other tissues. Mutations in the gene encoding ASPA cause Canavan disease (CD), a fatal progressive neurodegenerative disorder involving dysmyelination and spongiform degeneration of white matter in children. This enzyme binds zinc which is necessary for activity. Measurement of elevated NAA levels in urine is used in the diagnosis of CD. Probab=52.14 E-value=14 Score=16.57 Aligned_cols=87 Identities=14% Similarity=0.151 Sum_probs=51.3 Q ss_pred EEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCCEECCCCCHHCCCCCCC-CHHHHHHCCCEEEECCCCHHHHHH Q ss_conf 8625753334672588764510677443657999841602012470102467784-001122034058742678899999 Q gi|254781051|r 374 IIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQIGFTTNPSSARSCTYA-SDISKSIGIPIFHVNGDDPEAVIR 452 (957) Q Consensus 374 liHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQIGFTT~p~d~RSs~Y~-TDiAK~i~aPI~HVN~DdpEav~~ 452 (957) .+|||= ++|.=++..-+.--....-..|..+-+++=|--||-. .+||+ .|+-+.+.- ||+.-.+ T Consensus 6 gvHGnE-~~G~~~~~~l~~~~~~~~~~~g~~~~~~~~N~~a~~~------~~R~~d~DLNR~f~g--------~~~~~~~ 70 (252) T cd06230 6 GVHGNE-PCGVQAIRRLLAELDEGQALRGPVKLVPAANPLALEA------GQRYLDRDLNRIFPG--------DPDSGTY 70 (252) T ss_pred CCCCCC-HHHHHHHHHHHHCCCCCCCCCCCEEEEEECCHHHHHH------CCCCCCCCHHHCCCC--------CCCCCCH T ss_conf 626681-8999999999964833112588389999539999983------857899566650899--------9899998 Q ss_pred HHHHHHHHHHHHCC--CEEEEEEEC Q ss_conf 99999999997389--739986421 Q gi|254781051|r 453 VVRMAVSFRMKFHK--SVVIDIVCY 475 (957) Q Consensus 453 ~~~lA~~yR~~F~k--DVvIDlv~Y 475 (957) -.++|.+..+.+.. |++|||=+- T Consensus 71 e~r~A~~l~~~l~~~~d~~iDLHs~ 95 (252) T cd06230 71 EDRLAAELCPELEGLADAVLDLHST 95 (252) T ss_pred HHHHHHHHHHHHHHCCCEEEECCCC T ss_conf 9999999999875037899987889 No 142 >pfam00456 Transketolase_N Transketolase, thiamine diphosphate binding domain. This family includes transketolase enzymes EC:2.2.1.1. and also partially matches to 2-oxoisovalerate dehydrogenase beta subunit EC:1.2.4.4. Both these enzymes utilize thiamine pyrophosphate as a cofactor, suggesting there may be common aspects in their mechanism of catalysis. Probab=51.42 E-value=16 Score=16.02 Aligned_cols=127 Identities=20% Similarity=0.299 Sum_probs=81.6 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCCEECCCCC Q ss_conf 45532367777641666544555654454047888625753334672588764510677443657999841602012470 Q gi|254781051|r 340 PVVIGSVRARQDLKAGIVGEKSISLVERSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQIGFTTNP 419 (957) Q Consensus 340 PVv~G~~RAkQd~~~d~~~~~~~~~~~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQIGFTT~p 419 (957) +++.|+|.|......- .+.++ ...--++|.+|+ ||.-.. .|+++|.+.++. .+..+--|=|+=+|.|--..+- T Consensus 119 ~~avG~Ala~k~l~~~-~n~~~-~~~~d~~vy~l~--GDGel~-EG~~~EA~~~Ag--~~~L~nLi~i~D~N~iqidG~t 191 (333) T pfam00456 119 ANAVGMAIAEANLAAT-YNRPG-FDIVDHYTYVFL--GDGCLM-EGVSSEASSLAG--HLKLGNLIAFYDDNRISIDGET 191 (333) T ss_pred HHHHHHHHHHHHHHHH-HCCCC-CCCCCCEEEEEE--CCCCCC-CHHHHHHHHHHH--HCCCCCEEEEECCCCEECCCCC T ss_conf 6788799999998766-36676-665574499984--643125-398999999998--7178978999745754217984 Q ss_pred HHCCCCCCCCHHHHHHCC---CEEEE-CCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEECCCCCC Q ss_conf 102467784001122034---05874-26788999999999999999738973998642112565 Q gi|254781051|r 420 SSARSCTYASDISKSIGI---PIFHV-NGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVCYRRFGH 480 (957) Q Consensus 420 ~d~RSs~Y~TDiAK~i~a---PI~HV-N~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~YRr~GH 480 (957) .. ....|+++-+.+ -|+.| +|.|++++..|.+-| +..=.|-++|-.-.--=+|- T Consensus 192 ~~----~~~ed~~~rf~a~GW~v~~v~DGhD~~~I~~Ai~~A---k~~~~kPt~Ii~~TiiGkG~ 249 (333) T pfam00456 192 EI----SFTEDTAKRFEAYGWHVIEVEDGHDVEAIAAAIEEA---KAEKDKPTLIICRTVIGYGS 249 (333) T ss_pred CC----CCCCHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHH---HHCCCCCEEEEEEEEEECCC T ss_conf 53----555109999876490799946999999999999999---86589981698642151588 No 143 >pfam08423 Rad51 Rad51. Rad51 is a DNA repair and recombination protein and is a homologue of the bacterial ATPase RecA protein. Probab=51.12 E-value=16 Score=15.99 Aligned_cols=60 Identities=22% Similarity=0.327 Sum_probs=33.6 Q ss_pred HHHHHHCCCCCCEEECCCCCCCCCC-------CCCCCCHHHHHHHCC---------CCCEEEEECCCHHHHHHHHHH Q ss_conf 0257623336813543688888887-------656440578875214---------378269724889999999999 Q gi|254781051|r 716 AGEQKWLCTSNLVCLLPHGYEGQGP-------EHSSARLERFLQMCA---------ENNMYVANCTSPANYFHILRR 776 (957) Q Consensus 716 s~e~KW~~~sglvllLPHGyeGqGP-------EHSSaRlERfLQl~a---------~~N~~V~~~ttpAq~FH~LRR 776 (957) ||-+-+..+--+...||.-+.|.|- |+ +=+.||+.|++. -+|+.|..|.+..+.+++|.+ T Consensus 54 sGKTQlc~qlav~~qlp~~~gg~~g~vvyIDTEg-~f~~eRl~qia~~~~~~~~~~L~~I~v~r~~~~~~~~~~l~~ 129 (261) T pfam08423 54 TGKTQLCHTLCVTCQLPLEMGGGEGKALYIDTEG-TFRPERIVAIAERFGLDPEEVLDNIAYARAYNTEHQMQLLLQ 129 (261) T ss_pred CCHHHHHHHHHHHHHCHHHHCCCCCEEEEEECCC-CCCHHHHHHHHHHHCCCHHHHHHHCEECCCCCHHHHHHHHHH T ss_conf 8789999999999407096569997289993688-869899999999829997898753314168998999999999 No 144 >PRK07912 salicylate synthase MbtI; Reviewed Probab=50.98 E-value=7.8 Score=18.44 Aligned_cols=33 Identities=9% Similarity=0.068 Sum_probs=18.8 Q ss_pred CHHHHHHHHHHHHCCCC-------CCCEEEECCCHHHCCC Q ss_conf 89999999999743534-------8868985660221080 Q gi|254781051|r 766 SPANYFHILRRQIYDRS-------SRPLIMMAPKSLLRHK 798 (957) Q Consensus 766 tpAq~FH~LRRQ~~r~~-------rkPLiv~tPKsLLR~~ 798 (957) -|-+.|..||+.--.+| -.-+|=+||..|++.. T Consensus 218 d~~~~Y~~lR~~Npspy~f~l~~~~~~~iGaSPE~lv~v~ 257 (449) T PRK07912 218 DFPSTYRLGRRHNTPVRSFLLRLGGIRALGYSPELVTAVR 257 (449) T ss_pred CHHHHHHHHHCCCCCCCEEEEECCCEEEEEECHHHHEEEC T ss_conf 9899999973569995089997798389984708957980 No 145 >TIGR02730 carot_isom carotene isomerase; InterPro: IPR014101 Members of this family, including sll0033 (crtH) of Synechocystis sp. (strain PCC 6803), catalyse a cis-trans isomerization of carotenes to the all-trans lycopene, a reaction that can also occur non-enzymatically in light through photoisomerization.. Probab=50.83 E-value=7.4 Score=18.63 Aligned_cols=21 Identities=33% Similarity=0.570 Sum_probs=11.0 Q ss_pred EEEEEEECCCCCCCCCHHHHH Q ss_conf 478886257533346725887 Q gi|254781051|r 370 VLPLIIHGDAAFAGQGIVSET 390 (957) Q Consensus 370 vlpiliHGDAAfaGQGVV~Et 390 (957) |-.+-.=||..|.|||+++=. T Consensus 471 i~gLYCVGDS~FPGQGl~AVA 491 (506) T TIGR02730 471 IPGLYCVGDSCFPGQGLNAVA 491 (506) T ss_pred CCCCEEECCCCCCCCCHHHHH T ss_conf 576155046578875336766 No 146 >TIGR01820 TrpE-arch anthranilate synthase component I; InterPro: IPR010116 This model represents an archaeal clade of anthranilate synthase component I enzymes. This enzyme (4.1.3.27 from EC) is responsible for the first step of tryptophan biosynthesis from chorismate. The Sulfolobus enzyme has been reported to be part of a gene cluster for Trp biosynthesis . ; GO: 0004049 anthranilate synthase activity, 0000162 tryptophan biosynthetic process. Probab=50.63 E-value=9.7 Score=17.73 Aligned_cols=34 Identities=21% Similarity=0.259 Sum_probs=24.6 Q ss_pred CCEEECCCCCCCCCCCHHHHHHHHHHH----HCCCEEE Q ss_conf 715725665787777899999999984----0643027 Q gi|254781051|r 146 DRKICMKGVLGLESATIPEIVDVLSHL----YCSNIGV 179 (957) Q Consensus 146 d~~f~~~~~~g~~~~tL~eIi~~L~~t----YCgsIGv 179 (957) .-.|+.|++-|.++.---|||+.|+++ |-|.+|| T Consensus 411 ~A~FPAGTlSGAPK~RAMEII~~LE~~~RG~YgGgVGY 448 (508) T TIGR01820 411 KATFPAGTLSGAPKIRAMEIIDELEKEPRGVYGGGVGY 448 (508) T ss_pred EEECCCCCCCCCCHHHHHHHHHHHHCCCCCCCCCEEEC T ss_conf 21347765479853678888888607888732231321 No 147 >TIGR01989 COQ6 Ubiquinone biosynthesis mono0xygenase COQ6; InterPro: IPR000689 Ubiquinone (Q) functions as an electron carrier in the respiratory chain in mitochondria. Q biosynthesis involves a series of enzymatic steps, which are catalysed by the enzymes COQ1-COQ8 in Saccharomyces cerevisiae. COQ6, or ubiquinone biosynthesis monooxygenase, is a flavin-dependent enzyme localised to the matrix side of the inner mitochondrial membrane that is required for one or more steps in Q biosynthesis , .; GO: 0004497 monooxygenase activity. Probab=50.54 E-value=4.9 Score=19.98 Aligned_cols=50 Identities=14% Similarity=0.276 Sum_probs=27.1 Q ss_pred HHCCCCEECCCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCHHHHHHHHC Q ss_conf 4033622225646589999999999987379107998067655268999885379899999855 Q gi|254781051|r 231 KYKGAKRFGADGSEVIIPAIEEIIRQGVQQGVDEMILGMAHRGRLNVLSQIMNKLPRSIFYEFK 294 (957) Q Consensus 231 Kf~g~KRFsLEG~EslIP~L~~ii~~a~~~Gv~eiviGMaHRGRLNVL~Nvl~Kp~~~iF~EF~ 294 (957) +=++--||==-|==++.|+=| .+..+|=-|.|+ .+.-|..=+++-|-++- T Consensus 247 n~~AWQRFLP~GPiAlLPl~D---------~~s~lVWSts~e-----~a~~L~~Lp~e~FV~~l 296 (481) T TIGR01989 247 NEVAWQRFLPTGPIALLPLSD---------NLSTLVWSTSPE-----EAKRLKSLPPEEFVDAL 296 (481) T ss_pred CCEEEECCCCCCCEEECCCCC---------CCCCCEEECCHH-----HHHHHCCCCCHHHHHHH T ss_conf 873541337876624457665---------336541216779-----99984258835789999 No 148 >KOG4748 consensus Probab=50.25 E-value=12 Score=17.11 Aligned_cols=156 Identities=15% Similarity=0.130 Sum_probs=77.3 Q ss_pred HHHHHHHHHHHHCCCEEE-EEEECCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHH---HHHHHHCCCCCHHHHHHHHHH Q ss_conf 999999999973897399-8642112565643542235578888867189989999---999864489898999999999 Q gi|254781051|r 453 VVRMAVSFRMKFHKSVVI-DIVCYRRFGHNEGDEPSFTQPMMYKTIRSHKSVLQLY---ADSLMKNQVISKQELQSLANN 528 (957) Q Consensus 453 ~~~lA~~yR~~F~kDVvI-Dlv~YRr~GHNE~DeP~fTQP~mY~~I~~hp~v~~~Y---~~~Li~~g~it~~e~~~~~~~ 528 (957) +.+=-.+|-.+.+=.+++ ++-.++|+ -|.-++=---|..-+..++||..-=|+ .+.||-+-.++- .+. T Consensus 127 ~ikNridYA~rHgy~~~~~~~~~~~~~--~e~~~~W~KiP~Ir~tM~kyP~AeWIWWlD~DAlimn~~lsL------~~~ 198 (364) T KOG4748 127 SIKNRIDYARRHGYEFEYKNATLDKRY--HELPGVWAKLPAIRQTMLKYPDAEWIWWLDQDALIMNPDLSL------QDH 198 (364) T ss_pred HHHHHHHHHHHHCCEEEEEECCCCCCC--CCCCCHHHHHHHHHHHHHHCCCCCEEEEECCCCHHHCCCCCH------HHH T ss_conf 887089999984975999744123222--456644577499999998789971788851641320733022------677 Q ss_pred HHH--HHHHHHHH--HHCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCHHHH------H-HHHHHHHH Q ss_conf 999--97756665--4125876566665686266677899999999887436899862056778------8-89989987 Q gi|254781051|r 529 WHK--YLEAEYKE--SESYCPEKLGLLHNGENERKNSVSKEILKKIGSSIIRLPKSFNTHKIVE------R-LMANRQKM 597 (957) Q Consensus 529 ~~~--~l~~~~~~--~k~~~~~~~~~~~~~~~~~~T~v~~~~L~~i~~~l~~~Pe~f~~h~~v~------k-~~~~R~~~ 597 (957) ..+ .|+...-. .+.+.|-. ...... -....+.+..+.--+..=+.|++.-.-+- | +++.--+. T Consensus 199 ilk~~~L~~~l~~nd~~~~~~~n-----~~~~~~-~~~~~d~~~~~~~ii~qD~nG~naGSfLirns~~~~~llD~w~dp 272 (364) T KOG4748 199 ILKPENLVTHLLRNDQKSINPLN-----IFRLRP-RTPSLDDLEDIAFIIPQDCNGINAGSFLIRNSEWGRLLLDAWNDP 272 (364) T ss_pred HCCHHHHHHHHCCCCCCCCCCCC-----CCCCCC-CCCCCCCHHHHCEECCCCCCCCCCCCEEEECCCCCHHHHHHCCCH T ss_conf 53878788763346666326678-----622266-664312666624643557787343434885373201677742577 Q ss_pred HCCCCCCCCHHHHHHHHHHHHHCCCEE Q ss_conf 334356421147898889886358315 Q gi|254781051|r 598 IETGKGIDWSMAESLAFGSLCYEGYKV 624 (957) Q Consensus 598 ~~~~~~iDW~~AE~LA~~SLl~eG~~V 624 (957) +--+ ..|+.+|.=|.+-++.-..-| T Consensus 273 ~l~~--~~~~~~Eq~al~~~~e~h~~l 297 (364) T KOG4748 273 LLYE--LLWGQKEQDALGHFLENHPQL 297 (364) T ss_pred HHHH--HCCCHHHHHHHHHHHHHCHHH T ss_conf 8876--130267899998988606425 No 149 >TIGR02461 osmo_MPG_phos mannosyl-3-phosphoglycerate phosphatase; InterPro: IPR012813 Members of this family are mannosyl-3-phosphoglycerate phosphatase (3.1.3.70 from EC). It acts sequentially after mannosyl-3-phosphoglycerate synthase (2.4.1.217 from EC) in a two-step pathway of biosynthesis of the compatible solute mannosylglycerate, a typical osmolyte of thermophiles.; GO: 0050531 mannosyl-3-phosphoglycerate phosphatase activity, 0051479 mannosylglycerate biosynthetic process, 0005737 cytoplasm. Probab=48.76 E-value=18 Score=15.73 Aligned_cols=87 Identities=21% Similarity=0.156 Sum_probs=34.8 Q ss_pred CCCCCCCHHHHHH--------HHHH-HHCCCEEEEEECCCCHHHHHHH-------------HHHHCCCCCCCC-CCHHHH Q ss_conf 7877778999999--------9998-4064302777248998999999-------------998414577888-798899 Q gi|254781051|r 155 LGLESATIPEIVD--------VLSH-LYCSNIGVEFMHIVDSIERDWV-------------RNTIEDYDFSSN-FSKEEQ 211 (957) Q Consensus 155 ~g~~~~tL~eIi~--------~L~~-tYCgsIGvEymHI~d~eer~Wl-------------q~riE~~~~~~~-~s~eeK 211 (957) +|.....+|.+++ .+++ --||++-+-+-|=.+.|+-.=| |+.-|..-.... -+-+.- T Consensus 95 LG~~~~~IR~~L~~alaeacte~E~se~~Gn~~~~~yg~~t~Eev~~lTgLp~e~a~lA~~REYsETi~~~~~Padw~~~ 174 (248) T TIGR02461 95 LGKPVAEIRAALKSALAEACTEIEASEEAGNLSIKGYGDMTAEEVARLTGLPREAARLAKRREYSETIFLESRPADWERL 174 (248) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHCCCCCHHHHHHHCCCCHHHHHHHHHCCCCEEEEECCCCCCHHHH T ss_conf 36767899999899999999998744431450111227888889998628998899998402752489873888656678 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCEECCC Q ss_conf 999999999998999998640336222256 Q gi|254781051|r 212 RDILDKLVRAEGFEKFIDIKYKGAKRFGAD 241 (957) Q Consensus 212 ~~iL~~L~~Ae~FE~FL~~Kf~g~KRFsLE 241 (957) -.||..-+++.+|+-+...+|-.+---+-+ T Consensus 175 ~~~L~~~l~~~GL~~~~GgRFy~v~g~~~D 204 (248) T TIGR02461 175 EAILVTALRARGLEYTHGGRFYTVIGAGSD 204 (248) T ss_pred HHHHHHHHHHCCCEEEECCCEEEEEECCCC T ss_conf 899999997479369755800045523788 No 150 >TIGR02876 spore_yqfD sporulation protein YqfD; InterPro: IPR010690 This family consists of several putative bacterial stage IV sporulation (SpoIV) proteins. YqfD of Bacillus subtilis (P54469 from SWISSPROT) is known to be essential for efficient sporulation although its exact function is unknown .. Probab=47.46 E-value=9.9 Score=17.65 Aligned_cols=22 Identities=32% Similarity=0.650 Sum_probs=17.6 Q ss_pred CCCCHHHHHHHCCCCCEEEEEC Q ss_conf 6440578875214378269724 Q gi|254781051|r 743 SSARLERFLQMCAENNMYVANC 764 (957) Q Consensus 743 SSaRlERfLQl~a~~N~~V~~~ 764 (957) ++.-+||||++|..+-+.|-+. T Consensus 14 ~G~~iErFlN~~~~~gi~vwdv 35 (406) T TIGR02876 14 EGEDIERFLNLCWRNGILVWDV 35 (406) T ss_pred EECCCHHHHHHHHHCCEEEEEE T ss_conf 8167027888988787579987 No 151 >KOG0182 consensus Probab=46.80 E-value=14 Score=16.60 Aligned_cols=89 Identities=24% Similarity=0.257 Sum_probs=63.5 Q ss_pred HHHHHHCCCCCCCCCEEEEEECCCEECCCCCHHCCCCCCCCHHHHHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCE Q ss_conf 87645106774436579998416020124701024677840011220340587426788999999999999999738973 Q gi|254781051|r 389 ETFGLSGLSGYTVAGNIHLIINNQIGFTTNPSSARSCTYASDISKSIGIPIFHVNGDDPEAVIRVVRMAVSFRMKFHKSV 468 (957) Q Consensus 389 Etl~ls~l~gY~tGGTiHiVvNNQIGFTT~p~d~RSs~Y~TDiAK~i~aPI~HVN~DdpEav~~~~~lA~~yR~~F~kDV 468 (957) +..|-.+|+.-.+-|-=-+||--|=--+-.-.|+++-.+---|.|-+++-|--.-||.--.|-+|---|.+||.+|+-|+ T Consensus 30 kAin~~gltsVavrgkDcavvvsqKkvpDKLld~~tvt~~f~itk~ig~v~tG~~aDar~~v~rar~eAa~~~yk~Gyem 109 (246) T KOG0182 30 KAINQAGLTSVAVRGKDCAVVVTQKKVPDKLLDSSTVTHLFRITKKIGCVITGMIADARSQVQRARYEAAEFRYKYGYEM 109 (246) T ss_pred HHHHCCCCCEEEECCCCEEEEEECCCCCCCCCCCCCCEEEEEEECCCEEEEECCCCCHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 77522797158873786699997354871003644222789840541179952775578999999989875256428878 Q ss_pred EEEEEECCC Q ss_conf 998642112 Q gi|254781051|r 469 VIDIVCYRR 477 (957) Q Consensus 469 vIDlv~YRr 477 (957) -.|++|-|- T Consensus 110 p~DiL~k~~ 118 (246) T KOG0182 110 PCDILAKRM 118 (246) T ss_pred CHHHHHHHH T ss_conf 788999988 No 152 >COG3737 Uncharacterized conserved protein [Function unknown] Probab=46.56 E-value=6.9 Score=18.86 Aligned_cols=49 Identities=22% Similarity=0.339 Sum_probs=30.6 Q ss_pred CCCCCCEEEECCCHHHCCCCCCCCHHHHCCCCCEEECCCCCCCCCCCCCCCCCCHHHCCEEEEECCH Q ss_conf 5348868985660221080126887893799860351577643344454445795457689986885 Q gi|254781051|r 780 DRSSRPLIMMAPKSLLRHKRVVSSLSDMTCGSVFQAVLSDDAEYHGKTSVKLKEDSHIRRVILCTGK 846 (957) Q Consensus 780 r~~rkPLiv~tPKsLLR~~~a~S~l~d~~~g~~F~~vi~d~~~~~~~~~~~~~~~~~v~rvi~csGK 846 (957) +.+|++|++|-.- =+.-.+++++||++- .|.+|+.. ...|+-||+-||. T Consensus 31 ~sh~~SlL~lpdg---v~~W~v~~~~~Lt~e-~f~~vl~~--------------a~~~EilliGTG~ 79 (127) T COG3737 31 MSHRGSLLVLPDG---VCDWEVATLSDLTPE-DFERVLAE--------------APDVEILLIGTGA 79 (127) T ss_pred CCCCCCEEEECCC---CCCCCCCCHHHCCHH-HHHHHHHC--------------CCCCEEEEEECCC T ss_conf 0515657881696---544665675667999-99998731--------------7885699982585 No 153 >COG3914 Spy Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones] Probab=46.18 E-value=12 Score=17.15 Aligned_cols=47 Identities=15% Similarity=0.133 Sum_probs=26.0 Q ss_pred HHCCCCCCCCHHHHHHCCCEEEECC-CCHHHHHHHHHHHHHHHHHHCCCEEEEEEECCCCC Q ss_conf 1024677840011220340587426-78899999999999999973897399864211256 Q gi|254781051|r 420 SSARSCTYASDISKSIGIPIFHVNG-DDPEAVIRVVRMAVSFRMKFHKSVVIDIVCYRRFG 479 (957) Q Consensus 420 ~d~RSs~Y~TDiAK~i~aPI~HVN~-DdpEav~~~~~lA~~yR~~F~kDVvIDlv~YRr~G 479 (957) .|+--+++.--+.|.+. .++ ||-|++ .+-|+. +-||.||+-+|-+-+ T Consensus 302 ~dal~~rI~a~~~~~~~-----~~~~dd~e~a-------~~I~~d-~IdILvDl~g~T~d~ 349 (620) T COG3914 302 TDALQERISAAVEKWYP-----IGRMDDAEIA-------NAIRTD-GIDILVDLDGHTVDT 349 (620) T ss_pred CHHHHHHHHHHHHHEEC-----CCCCCHHHHH-------HHHHHC-CCEEEEECCCCEECC T ss_conf 31678888876531410-----5886889999-------998725-870999656721034 No 154 >KOG4450 consensus Probab=46.04 E-value=5.7 Score=19.50 Aligned_cols=21 Identities=38% Similarity=0.640 Sum_probs=16.0 Q ss_pred CCEEEEEEEECCCCCCCCCHH Q ss_conf 540478886257533346725 Q gi|254781051|r 367 RSRVLPLIIHGDAAFAGQGIV 387 (957) Q Consensus 367 ~~~vlpiliHGDAAfaGQGVV 387 (957) -+.+|-=+|||||+|.|+|-- T Consensus 21 Nh~~L~d~~~g~A~F~gr~~T 41 (168) T KOG4450 21 NHFLLEDLINGDAVFIGRGTT 41 (168) T ss_pred CCHHHHHCCCCCEEEEEECEE T ss_conf 743665236885589971101 No 155 >pfam01074 Glyco_hydro_38 Glycosyl hydrolases family 38 N-terminal domain. Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Probab=45.91 E-value=20 Score=15.42 Aligned_cols=69 Identities=20% Similarity=0.289 Sum_probs=43.2 Q ss_pred HHHHHHHHHHHHHCCCCCCEEECCCCCC--CCCCCCCCCCHHHHHHHCCCCCEEEEECCCHHHHHHHHHHHHC Q ss_conf 2565542025762333681354368888--8887656440578875214378269724889999999999743 Q gi|254781051|r 709 ILDQFITAGEQKWLCTSNLVCLLPHGYE--GQGPEHSSARLERFLQMCAENNMYVANCTSPANYFHILRRQIY 779 (957) Q Consensus 709 iiDqfiss~e~KW~~~sglvllLPHGye--GqGPEHSSaRlERfLQl~a~~N~~V~~~ttpAq~FH~LRRQ~~ 779 (957) -++.++..-...+.....-.+|+|-|++ |.|| +-.=+|+.-.+-.....--+..|||+.||..|+.+.. T Consensus 188 ~~~~l~~~~~~~~~~~~~~~~l~~~G~gd~~~~~--~~~lie~i~~~~~~~~~~~i~~st~~~yf~~l~~~~~ 258 (269) T pfam01074 188 RADDLLDQAKKYADKTRTNHVLWPFGDGDGGGGP--TDELLEYINRWNELPGLPKVQFGTPSDYFDALEEATW 258 (269) T ss_pred CHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCC--HHHHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHHCC T ss_conf 7999999999999727799589961049999881--7999999997422689847997999999999986359 No 156 >KOG0630 consensus Probab=45.14 E-value=11 Score=17.35 Aligned_cols=22 Identities=14% Similarity=-0.007 Sum_probs=11.2 Q ss_pred CCCHHHHHHHHHCCCHHHHHHH Q ss_conf 6552689998853798999998 Q gi|254781051|r 271 HRGRLNVLSQIMNKLPRSIFYE 292 (957) Q Consensus 271 HRGRLNVL~Nvl~Kp~~~iF~E 292 (957) |--||.-++.-+|-+.+.|.-. T Consensus 241 h~~l~hYaarefGIa~~aI~~~ 262 (838) T KOG0630 241 HMDLLHYAAREFGIADEAIKPI 262 (838) T ss_pred CHHHHHHHHHHHCCCHHHEECC T ss_conf 4788999998729841211202 No 157 >COG3394 Uncharacterized protein conserved in bacteria [Function unknown] Probab=44.34 E-value=6.2 Score=19.21 Aligned_cols=13 Identities=15% Similarity=0.003 Sum_probs=5.8 Q ss_pred HHHHHHHHHHHCC Q ss_conf 8999999998414 Q gi|254781051|r 187 SIERDWVRNTIED 199 (957) Q Consensus 187 ~eer~Wlq~riE~ 199 (957) |...-|+..-.+. T Consensus 134 P~I~p~v~rlA~e 146 (257) T COG3394 134 PQIAPIVARLAEE 146 (257) T ss_pred CHHHHHHHHHHHC T ss_conf 1478999999751 No 158 >TIGR02134 transald_staph transaldolase; InterPro: IPR011861 This small family of proteins belong to the transaldolases. Coxiella and Staphylococcus lack members of the known transaldolase families and appear to require a transaldolase activity for completion of the pentose phosphate pathway.. Probab=43.42 E-value=7.6 Score=18.52 Aligned_cols=115 Identities=22% Similarity=0.277 Sum_probs=71.7 Q ss_pred CHHHHHHHHHCCCCCCCCCEEEEEECCCEECCCCCHHCC---CCCCCC----HHHHHHCCCE-EEECCCCHHHHHHHHHH Q ss_conf 725887645106774436579998416020124701024---677840----0112203405-87426788999999999 Q gi|254781051|r 385 GIVSETFGLSGLSGYTVAGNIHLIINNQIGFTTNPSSAR---SCTYAS----DISKSIGIPI-FHVNGDDPEAVIRVVRM 456 (957) Q Consensus 385 GVV~Etl~ls~l~gY~tGGTiHiVvNNQIGFTT~p~d~R---Ss~Y~T----DiAK~i~aPI-~HVN~DdpEav~~~~~l 456 (957) |-+-|+|--.--.+|= =||||||.--| =+-|-. -+|..-++|| |-|=|||-|.-=. T Consensus 12 GA~lEeM~k~~~~~~v------------~GFTTNPsLM~kAGi~dY~aFA~ealaqItd~piSFEVFADdL~~MEk---- 75 (237) T TIGR02134 12 GADLEEMVKALKNKLV------------KGFTTNPSLMKKAGISDYKAFAKEALAQITDLPISFEVFADDLEEMEK---- 75 (237) T ss_pred CCCHHHHHHHHCCCCE------------EEEECCHHHHHHCCCCCHHHHHHHHHHHCCCCCEEEEEECCCHHHHHH---- T ss_conf 4118999998628930------------000178666744486645889999997347996245554031656899---- Q ss_pred HHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHH Q ss_conf 99999973897399864211256564354223557888886718998999999986448989899999999999 Q gi|254781051|r 457 AVSFRMKFHKSVVIDIVCYRRFGHNEGDEPSFTQPMMYKTIRSHKSVLQLYADSLMKNQVISKQELQSLANNWH 530 (957) Q Consensus 457 A~~yR~~F~kDVvIDlv~YRr~GHNE~DeP~fTQP~mY~~I~~hp~v~~~Y~~~Li~~g~it~~e~~~~~~~~~ 530 (957) -++|=..|+.-|+|-+=-----| .||-|+.-+.=++-= +|-=--+.|-++++++.+.+. T Consensus 76 eA~~ias~GnNV~vKIPvtntkG-------esT~PlIqkLSadgi--------~LNvTA~~TieQv~~v~~~~t 134 (237) T TIGR02134 76 EAEKIASYGNNVYVKIPVTNTKG-------ESTIPLIQKLSADGI--------KLNVTAVYTIEQVKKVVEALT 134 (237) T ss_pred HHHHHHHHCCCEEEEEEEECCCC-------CCCCHHHHHHHHCCC--------EEEEEEECCHHHHHHHHHHHH T ss_conf 99999870793278841241889-------515307866402486--------675435402588999999874 No 159 >KOG2243 consensus Probab=43.22 E-value=12 Score=17.03 Aligned_cols=18 Identities=28% Similarity=0.412 Sum_probs=11.4 Q ss_pred HHHHHHHCCCEEEEECCC Q ss_conf 889886358315785010 Q gi|254781051|r 613 AFGSLCYEGYKVRLSGQD 630 (957) Q Consensus 613 A~~SLl~eG~~VRlsGQD 630 (957) |.-|++--||++--+.|| T Consensus 1059 avrt~~gygynie~~dqd 1076 (5019) T KOG2243 1059 AVRTFLGYGYNIEAPDQD 1076 (5019) T ss_pred HHHHHHCCCCCCCCCCHH T ss_conf 999984466667898626 No 160 >pfam03492 Methyltransf_7 SAM dependent carboxyl methyltransferase. This family of plant methyltransferases contains enzymes that act on a variety of substrates including salicylic acid, jasmonic acid and 7-Methylxanthine. Caffeine is synthesized through sequential three-step methylation of xanthine derivatives at positions 7-N, 3-N, and 1-N. The protein 7-methylxanthine methyltransferase (designated as CaMXMT) catalyses the second step to produce theobromine. Probab=42.91 E-value=22 Score=15.10 Aligned_cols=67 Identities=25% Similarity=0.443 Sum_probs=39.1 Q ss_pred EEECCCCHHHHHHHHHHHHHHHHHHCCC-----------------EEEEEEECCCCCCCCCCCCCCC-CHHHHHHHHHCC Q ss_conf 8742678899999999999999973897-----------------3998642112565643542235-578888867189 Q gi|254781051|r 440 FHVNGDDPEAVIRVVRMAVSFRMKFHKS-----------------VVIDIVCYRRFGHNEGDEPSFT-QPMMYKTIRSHK 501 (957) Q Consensus 440 ~HVN~DdpEav~~~~~lA~~yR~~F~kD-----------------VvIDlv~YRr~GHNE~DeP~fT-QP~mY~~I~~hp 501 (957) +|+.+-.|++|+. .|+.+|++| .|+-+.| | ++.| |.-+ .-.||+. T Consensus 137 I~i~~~s~~~V~~------AY~~Qf~~D~~~FL~~Ra~Elv~GG~mvl~~~G-r----~~~d-~~~~~~~~~~~l----- 199 (331) T pfam03492 137 IYISGASPEEVYK------AYLDQFKKDFSLFLRARAEELVSGGLMVLTFLG-R----PSVD-PSETECGIFWDL----- 199 (331) T ss_pred EEECCCCCHHHHH------HHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECC-C----CCCC-CCCCCCCHHHHH----- T ss_conf 6971699889999------999999999999999889985468659999605-6----8998-665664439999----- Q ss_pred CHHHHHHHHHHHCCCCCHHHHHHH Q ss_conf 989999999864489898999999 Q gi|254781051|r 502 SVLQLYADSLMKNQVISKQELQSL 525 (957) Q Consensus 502 ~v~~~Y~~~Li~~g~it~~e~~~~ 525 (957) ...=-..|+++|+|++++++.+ T Consensus 200 --l~~aL~dlv~eGlI~eekldsF 221 (331) T pfam03492 200 --LGDALNDLVSEGLIEEEKLDSF 221 (331) T ss_pred --HHHHHHHHHHCCCCCHHHHHHC T ss_conf --9999999998599679998324 No 161 >TIGR00838 argH argininosuccinate lyase; InterPro: IPR009049 This entry represents argininosuccinate lyase, but may include examples of avian delta crystallins, in which argininosuccinate lyase activity may or may not be present and the biological role is to provide the optically clear cellular protein of the eye lens. ; GO: 0004056 argininosuccinate lyase activity, 0042450 arginine biosynthetic process via ornithine. Probab=42.76 E-value=22 Score=15.08 Aligned_cols=38 Identities=13% Similarity=0.096 Sum_probs=31.9 Q ss_pred CHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 98999999986448989899999999999997756665 Q gi|254781051|r 502 SVLQLYADSLMKNQVISKQELQSLANNWHKYLEAEYKE 539 (957) Q Consensus 502 ~v~~~Y~~~Li~~g~it~~e~~~~~~~~~~~l~~~~~~ 539 (957) ..-.-|+.-|.+.|+||+||..++.+.-.+..++..+. T Consensus 33 ~~~~AH~~~L~k~GiLt~Ee~~~l~~gL~~l~~e~~~~ 70 (469) T TIGR00838 33 EGSIAHTKMLKKAGILTEEEAAKLIEGLNELKEEVREG 70 (469) T ss_pred CCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHCC T ss_conf 33599999987527988889999999999999988608 No 162 >TIGR02743 TraW type-F conjugative transfer system protein TraW; InterPro: IPR014114 This entry represents TraW, an essential component of the F-type conjugative transfer system for plasmid DNA transfer that has been shown to be localized to the periplasm , .. Probab=42.04 E-value=22 Score=15.01 Aligned_cols=33 Identities=6% Similarity=0.209 Sum_probs=18.9 Q ss_pred CCHHHHHHHHHHHCCCC--CCCCCCHHHHHHHHHH Q ss_conf 99899999999841457--7888798899999999 Q gi|254781051|r 185 VDSIERDWVRNTIEDYD--FSSNFSKEEQRDILDK 217 (957) Q Consensus 185 ~d~eer~Wlq~riE~~~--~~~~~s~eeK~~iL~~ 217 (957) .|+++..|-+...+... .+.-++.-.=+.+-++ T Consensus 143 dD~~Q~~wA~~~~~~~~G~~K~ILt~Gs~~el~~~ 177 (217) T TIGR02743 143 DDPKQLAWAQQLLAQLPGEVKWILTGGSVKELEKR 177 (217) T ss_pred CCHHHHHHHHHHHHCCCCCCEEEECCCCHHHHHHH T ss_conf 88899999998732038983088717897999985 No 163 >TIGR02237 recomb_radB DNA repair and recombination protein RadB; InterPro: IPR011939 This family consists exclusively of archaeal RadB protein, a homolog of bacterial RecA, eukaryotic RAD51 (IPR011941 from INTERPRO) and DMC1 (IPR011940 from INTERPRO), and archaeal RadA (IPR011938 from INTERPRO) ,, .; GO: 0003684 damaged DNA binding, 0005524 ATP binding, 0008094 DNA-dependent ATPase activity, 0006281 DNA repair, 0006310 DNA recombination. Probab=41.55 E-value=16 Score=16.01 Aligned_cols=20 Identities=30% Similarity=0.411 Sum_probs=15.4 Q ss_pred HHHHHHHCCCC---------CEEEEECCC Q ss_conf 57887521437---------826972488 Q gi|254781051|r 747 LERFLQMCAEN---------NMYVANCTS 766 (957) Q Consensus 747 lERfLQl~a~~---------N~~V~~~tt 766 (957) +|||+|+|.++ ||.|-+|.. T Consensus 54 ~ER~~q~~~~~~~D~e~~~~~~iv~~~~~ 82 (223) T TIGR02237 54 PERFKQIAEDRALDPERVLSNVIVFEVFD 82 (223) T ss_pred HHHHHHHHHCCCCCHHHHHCCEEEECCCC T ss_conf 89999986305889888841535523535 No 164 >PRK10696 C32 tRNA thiolase; Provisional Probab=41.46 E-value=11 Score=17.35 Aligned_cols=121 Identities=16% Similarity=0.300 Sum_probs=59.8 Q ss_pred HHCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHH-------------HHHHHHHHHCC---- Q ss_conf 4064302777248998999999998414577888798899999999999998-------------99999864033---- Q gi|254781051|r 172 LYCSNIGVEFMHIVDSIERDWVRNTIEDYDFSSNFSKEEQRDILDKLVRAEG-------------FEKFIDIKYKG---- 234 (957) Q Consensus 172 tYCgsIGvEymHI~d~eer~Wlq~riE~~~~~~~~s~eeK~~iL~~L~~Ae~-------------FE~FL~~Kf~g---- 234 (957) -||.++|++|-.+ ..+.-.+..+.+....+.-.+...-|+.+|.+..+..| .|.||=.=|-| T Consensus 91 ~yl~~lGvp~~i~-~~d~~~iv~~~~~egks~CslCsRlRRg~Ly~~A~e~G~nKIALGHH~DDi~ETfLMNlf~gG~Lk 169 (311) T PRK10696 91 EYLEKLGVEYKIV-EENTYGIVKEKIPEGKTTCSLCSRLRRGILYRTATELGATKIALGHHRDDILQTLFLNMFYGGKMK 169 (311) T ss_pred HHHHHCCCCCEEE-EEEHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEECCCHHHHHHHHHHHHHHCCCCC T ss_conf 9999749970799-840767998754368863789999999999999998699879860650558999999999668763 Q ss_pred -C--CEECCCCHHHHHHHH-----HHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCHHHHHHHHCCCCC Q ss_conf -6--222256465899999-----999999873791079980676552689998853798999998558874 Q gi|254781051|r 235 -A--KRFGADGSEVIIPAI-----EEIIRQGVQQGVDEMILGMAHRGRLNVLSQIMNKLPRSIFYEFKGKGP 298 (957) Q Consensus 235 -~--KRFsLEG~EslIP~L-----~~ii~~a~~~Gv~eiviGMaHRGRLNVL~Nvl~Kp~~~iF~EF~g~~~ 298 (957) . |-.+=+|.-.+|==| ..|++.|...+.-=|---. =| --.|+-++..+.|..+.+-..| T Consensus 170 tM~Pkl~~d~g~~~VIRPL~y~~E~di~~~a~~~~fPiIpcnl--cg---sq~~lqR~~ik~ml~~~~~~~p 236 (311) T PRK10696 170 GMPPKLMSDDGKHIVIRPLAYCREKDIERFADAKAFPIIPCNL--CG---SQPNLQRQVIADMLRDWDKRYP 236 (311) T ss_pred CCCCEEECCCCCEEEEEECCCCCHHHHHHHHHHCCCCEECCCC--CC---CCHHHHHHHHHHHHHHHHHHCC T ss_conf 6797667489975998304201599999999975998741667--78---8367899999999999999888 No 165 >PTZ00124 adenosine deaminase; Provisional Probab=41.06 E-value=23 Score=14.90 Aligned_cols=90 Identities=14% Similarity=0.173 Sum_probs=50.9 Q ss_pred EEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEEC---------CCEECCCCCHHCCCCCCCCHH-----HHHH Q ss_conf 47888625753334672588764510677443657999841---------602012470102467784001-----1220 Q gi|254781051|r 370 VLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIIN---------NQIGFTTNPSSARSCTYASDI-----SKSI 435 (957) Q Consensus 370 vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvN---------NQIGFTT~p~d~RSs~Y~TDi-----AK~i 435 (957) -+++.+|-.=+-.-++|....-.+-.|..=++|=-++++=+ ++|++|.-|...--+..+.++ .+++ T Consensus 219 Gl~~T~HAGE~~~p~~i~~v~~ai~~l~a~RIGHGv~~~~Dp~L~~~l~e~~I~LEvCPtSNv~t~~v~~~~~HPi~~l~ 298 (362) T PTZ00124 219 GVNLSVHAGEDVTPPNLNELYAAILDLGVKRIGHGIRVAESQELIDRVKEKDILLEVCPISNVLLNNSKSMDTHPIRKLY 298 (362) T ss_pred CCCEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCEECCCCHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCCHHHHHH T ss_conf 99543556755785459999999998097043575340779899999986595489867655334566983217199999 Q ss_pred CCCE-EEECCCCHHHHHHHHHHHHHHH Q ss_conf 3405-8742678899999999999999 Q gi|254781051|r 436 GIPI-FHVNGDDPEAVIRVVRMAVSFR 461 (957) Q Consensus 436 ~aPI-~HVN~DdpEav~~~~~lA~~yR 461 (957) ++=| +.+|-|||- ++-+.|.-||. T Consensus 299 ~~G~~vtiNTDDp~--vf~t~L~~Ey~ 323 (362) T PTZ00124 299 DAGVKVSVNSDDPG--MFLTNIHDEYE 323 (362) T ss_pred HCCCEEEEECCCCC--CCCCCHHHHHH T ss_conf 78996999489930--20898899999 No 166 >TIGR02470 sucr_synth sucrose synthase; InterPro: IPR012820 This entry represents sucrose synthase an enzyme that despite its name, generally uses rather produces sucrose. Sucrose plus UDP (or ADP) becomes D-fructose plus UDP-glucose (or ADP-glucose), which is then available for cell wall (or starch) biosynthesis. The enzyme is homologous to sucrose phosphate synthase, which catalyses the penultimate step in sucrose synthesis. Sucrose synthase is found, so far, exclusively in plants and cyanobacteria.; GO: 0016157 sucrose synthase activity. Probab=40.24 E-value=18 Score=15.70 Aligned_cols=140 Identities=20% Similarity=0.320 Sum_probs=78.2 Q ss_pred CCCCCCCEEEEEEECCCEEEEEECC----------------CCCCCHHCCHHH-HHHHHHHHHHHCCCCCCC-------- Q ss_conf 5100150268898359618999717----------------877310004455-323677776416665445-------- Q gi|254781051|r 306 DVKYHLGLCCNRQICGKDVKLLLQS----------------NPSHLEFVDPVV-IGSVRARQDLKAGIVGEK-------- 360 (957) Q Consensus 306 DVKYHLG~s~~~~~~g~~v~lsL~~----------------NPSHLEaVnPVv-~G~~RAkQd~~~d~~~~~-------- 360 (957) |=.=|.|+=.| .+|++=-|||. ||.==|-+|=|| .|+.++++-. |..=.. T Consensus 543 ~~~~~~G~L~d---~~KPiiFsMARLDrvKNlTGLvE~yg~s~~Lr~l~NLv~VaG~~~~~~S~--DrEE~~Ei~kmh~l 617 (790) T TIGR02470 543 DNDEHIGYLKD---PNKPIIFSMARLDRVKNLTGLVELYGRSKKLRELANLVVVAGKLDAKESK--DREEQAEIEKMHNL 617 (790) T ss_pred CCCCEEEEECC---CCCCCEEEECCCCCCCCCHHHHHHHCCCHHHHHHCCEEEEECCCCCCCCC--CHHHHHHHHHHHHH T ss_conf 87300787207---68782663063001013477884850484478166888880656864467--87899999999998 Q ss_pred --CCCCCCCCEEEE-------------EEEECCCCCCCCCHHHHHHHH-------HCCCCCCC--CCEEEEEECCCEECC Q ss_conf --556544540478-------------886257533346725887645-------10677443--657999841602012 Q gi|254781051|r 361 --SISLVERSRVLP-------------LIIHGDAAFAGQGIVSETFGL-------SGLSGYTV--AGNIHLIINNQIGFT 416 (957) Q Consensus 361 --~~~~~~~~~vlp-------------iliHGDAAfaGQGVV~Etl~l-------s~l~gY~t--GGTiHiVvNNQIGFT 416 (957) .-..+..=+=+. +.-=-=.+|. |==-||.+.| |.||-|.| ||..=||+|+.=||- T Consensus 618 id~Y~L~G~iRWi~~q~~~~~~GElYR~IAD~~GiFV-QPA~yEAFGLTv~EAM~cGLPTFAT~~GGP~EII~dgvsGFh 696 (790) T TIGR02470 618 IDQYQLHGQIRWIGAQLNRVRNGELYRYIADTKGIFV-QPALYEAFGLTVVEAMTCGLPTFATRFGGPLEIIQDGVSGFH 696 (790) T ss_pred HHHHCCCCCEEEEEECCCCCCCCEEEEEEECCCCCEE-CCCHHHHHHHHHHHHHHCCCHHHHHHCCCCEEEEEECCCCCE T ss_conf 8864669716770000685314646764425775033-330444532579998630112467626896368772883542 Q ss_pred CCCHHCCCCCCCCHHHHHHCCCEEEECCCCHHHHHHHHHHHH Q ss_conf 470102467784001122034058742678899999999999 Q gi|254781051|r 417 TNPSSARSCTYASDISKSIGIPIFHVNGDDPEAVIRVVRMAV 458 (957) Q Consensus 417 T~p~d~RSs~Y~TDiAK~i~aPI~HVN~DdpEav~~~~~lA~ 458 (957) -||.+++.+ +.-|+..++. +-+||..=.++.+-|+ T Consensus 697 IdP~~~~~~--A~~l~~Ff~k-----C~~dP~~W~~iS~~gl 731 (790) T TIGR02470 697 IDPYHGEEA--AEKLVDFFEK-----CDEDPSYWQKISQGGL 731 (790) T ss_pred ECCCCHHHH--HHHHHHHHHH-----HCCCHHHHHHHHHHHH T ss_conf 267558999--9999999986-----2038368999999886 No 167 >pfam05489 Phage_tail_X Phage Tail Protein X. This domain is found in a family of phage tail proteins. Visual analysis suggests that it is related to pfam01476 (personal obs: C Yeats). The functional annotation of family members further confirms this hypothesis. Probab=40.12 E-value=2.9 Score=21.66 Aligned_cols=14 Identities=29% Similarity=0.880 Sum_probs=8.3 Q ss_pred EEEEEECCCCCCCC Q ss_conf 99864211256564 Q gi|254781051|r 469 VIDIVCYRRFGHNE 482 (957) Q Consensus 469 vIDlv~YRr~GHNE 482 (957) .+|.+|||.||+.. T Consensus 10 tvD~ic~r~yGrt~ 23 (60) T pfam05489 10 TLDALCWRYYGRTL 23 (60) T ss_pred HHHHHHHHHHCCCC T ss_conf 59999999977832 No 168 >COG0299 PurN Folate-dependent phosphoribosylglycinamide formyltransferase PurN [Nucleotide transport and metabolism] Probab=39.30 E-value=24 Score=14.71 Aligned_cols=109 Identities=17% Similarity=0.225 Sum_probs=79.7 Q ss_pred CCCCCHHHHHHHHHCCCCCCCCCEEEEEECCCEECCCCCHHCCCCCCCCHHHHHHCCCEEEECCCCHH-HHHHHHHHHHH Q ss_conf 33467258876451067744365799984160201247010246778400112203405874267889-99999999999 Q gi|254781051|r 381 FAGQGIVSETFGLSGLSGYTVAGNIHLIINNQIGFTTNPSSARSCTYASDISKSIGIPIFHVNGDDPE-AVIRVVRMAVS 459 (957) Q Consensus 381 faGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQIGFTT~p~d~RSs~Y~TDiAK~i~aPI~HVN~DdpE-av~~~~~lA~~ 459 (957) .||.|-.++.+-=+.-.++-.+-...+|.| |...|+-+.|+-.+.|.++++-.+-+ -..+-..|+.. T Consensus 7 aSG~GSNlqaiida~~~~~~~a~i~~Visd------------~~~A~~lerA~~~gIpt~~~~~k~~~~r~~~d~~l~~~ 74 (200) T COG0299 7 ASGNGSNLQAIIDAIKGGKLDAEIVAVISD------------KADAYALERAAKAGIPTVVLDRKEFPSREAFDRALVEA 74 (200) T ss_pred EECCCCCHHHHHHHHHCCCCCCEEEEEEEC------------CCCCHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHH T ss_conf 947965599999987648988179999957------------97778999999849988974445678778999999999 Q ss_pred HHHHHCCCEEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHH Q ss_conf 9997389739986421125656435422355788888671899899999 Q gi|254781051|r 460 FRMKFHKSVVIDIVCYRRFGHNEGDEPSFTQPMMYKTIRSHKSVLQLYA 508 (957) Q Consensus 460 yR~~F~kDVvIDlv~YRr~GHNE~DeP~fTQP~mY~~I~~hp~v~~~Y~ 508 (957) .+ +++-|+|+ |-||=|- =.|.|++..-.+.|.-||+.+-.|. T Consensus 75 l~-~~~~dlvv-LAGyMrI-----L~~~fl~~~~grIlNIHPSLLP~f~ 116 (200) T COG0299 75 LD-EYGPDLVV-LAGYMRI-----LGPEFLSRFEGRILNIHPSLLPAFP 116 (200) T ss_pred HH-HCCCCEEE-ECCHHHH-----CCHHHHHHHHCCEEECCCCCCCCCC T ss_conf 98-52999999-8664777-----5999999732651741753135899 No 169 >pfam02780 Transketolase_C Transketolase, C-terminal domain. The C-terminal domain of transketolase has been proposed as a regulatory molecule binding site. Probab=39.09 E-value=24 Score=14.69 Aligned_cols=96 Identities=14% Similarity=0.119 Sum_probs=68.3 Q ss_pred CEEEEECCHHHHHHHHHHHHCCC--CCEEEEEEECCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHH Q ss_conf 68998688578999999997699--8689998401486828999999853799839980678531784578997699998 Q gi|254781051|r 838 RRVILCTGKVYYDLLDNRDMRNI--ADIYLLRIEQLYPFPEDYLIKVLSRFVQAEIVWCQEEPQNMGAWTFIEPYLEKVL 915 (957) Q Consensus 838 ~rvi~csGKvyydL~~~r~~~~~--~~vaivRiEqLyPfP~~~l~~~l~~y~~~e~vW~QEEp~N~GaW~~v~~rl~~~l 915 (957) .=.|+|+|-..+.-+++.+.... -++.+|-+-.+.||+.+.|.+.++++.+ ++=+.|-..+.|-=+.+...+.+-. T Consensus 11 di~ii~~G~~~~~al~aa~~L~~~gi~~~vid~~~ikPlD~~~i~~~~~~~~~--vvtvEe~~~~gG~Gs~i~~~l~e~~ 88 (124) T pfam02780 11 DVTIVAYGSMVHEALEAAEELAAEGISAEVIDLRTVKPLDEDTILESVKKTGR--LVVVEEAVPRGGFGAEVAAALAEEG 88 (124) T ss_pred CEEEEECCHHHHHHHHHHHHHHHCCCCEEEEEEEEECCCCHHHHHHHHHCCCC--EEEEECCCCCCCHHHHHHHHHHHHC T ss_conf 79999847899999999999877598569974236436678999998733486--8998468867868999999999856 Q ss_pred HHCCCCCCEEEEEECCCCCCCC Q ss_conf 7505677601897536646723 Q gi|254781051|r 916 HSIGADYSRVRYVGRLPSASTA 937 (957) Q Consensus 916 ~~~~~~~~~~~y~gR~~~aspA 937 (957) . ..-..++.-+|-|-...|. T Consensus 89 ~--~~l~~~v~~ig~pd~~ip~ 108 (124) T pfam02780 89 F--DYLDAPVLRVGGPDTPIPH 108 (124) T ss_pred H--HHCCCCEEEECCCCCCCCC T ss_conf 6--5449996997479898999 No 170 >PRK10150 beta-D-glucuronidase; Provisional Probab=39.05 E-value=25 Score=14.68 Aligned_cols=43 Identities=16% Similarity=0.308 Sum_probs=27.8 Q ss_pred HHHHHHCCCCCEEEEEC-------------------------------CCHHHHHHHHHHHHCCCCCCCEEEEC Q ss_conf 78875214378269724-------------------------------88999999999974353488689856 Q gi|254781051|r 748 ERFLQMCAENNMYVANC-------------------------------TSPANYFHILRRQIYDRSSRPLIMMA 790 (957) Q Consensus 748 ERfLQl~a~~N~~V~~~-------------------------------ttpAq~FH~LRRQ~~r~~rkPLiv~t 790 (957) +|||+||++--|-|..- .+-..+-..||+-+.|++--|=|||= T Consensus 342 ~~~~~lcD~lGllV~~E~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~emi~rd~NHPSIi~W 415 (605) T PRK10150 342 EEMLDLADRHGIVVIDETPAVGLNLSFGIGFEAGNKPKETWSEEAVNGETQQAHLQAIRELIARDKNHPSVVMW 415 (605) T ss_pred HHHHHHHHHCCCEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEE T ss_conf 99999998669899992344354433355533466557543413379889999999999999854699749998 No 171 >KOG1000 consensus Probab=38.95 E-value=18 Score=15.72 Aligned_cols=22 Identities=18% Similarity=0.105 Sum_probs=7.6 Q ss_pred CHHHHHHHCCCCCEEEEECCCH Q ss_conf 0578875214378269724889 Q gi|254781051|r 746 RLERFLQMCAENNMYVANCTSP 767 (957) Q Consensus 746 RlERfLQl~a~~N~~V~~~ttp 767 (957) |.-|-=|-++-+=...|---|. T Consensus 584 RaHRiGQkssV~v~ylvAKgT~ 605 (689) T KOG1000 584 RAHRIGQKSSVFVQYLVAKGTA 605 (689) T ss_pred HHHHCCCCCEEEEEEEEECCCH T ss_conf 5664056651138999954845 No 172 >pfam01855 POR_N domain. This family includes the N terminal structural domain of the pyruvate ferredoxin oxidoreductase. This domain binds thiamine diphosphate, and along with domains II and IV, is involved in inter subunit contacts. The family also includes pyruvate flavodoxin oxidoreductase as encoded by the nifJ gene in cyanobacterium which is required for growth on molecular nitrogen when iron is limited. Probab=38.65 E-value=25 Score=14.64 Aligned_cols=100 Identities=15% Similarity=0.204 Sum_probs=68.2 Q ss_pred CCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCCEECCCCCHHCCCCCCC--CHHHHHHCCCEEEECCCCHHHHHHHHHHH Q ss_conf 3334672588764510677443657999841602012470102467784--00112203405874267889999999999 Q gi|254781051|r 380 AFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQIGFTTNPSSARSCTYA--SDISKSIGIPIFHVNGDDPEAVIRVVRMA 457 (957) Q Consensus 380 AfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQIGFTT~p~d~RSs~Y~--TDiAK~i~aPI~HVN~DdpEav~~~~~lA 457 (957) |-+|||+..=.=+|+...|-.++..+|++.- -|=.| += +.++ +|+--+.+++++-.-..++..|..-+..| T Consensus 65 aTs~~Gl~lm~e~l~~aag~~~P~V~~v~~R--~~~~~----gl-~~~~eq~D~~~~rd~g~~vl~~~~~QEa~d~~~~A 137 (230) T pfam01855 65 ATSGQGLLLMIENLGKAAGEELPIVIHVVAR--AGPSP----GL-SIFGDQGDLMAARDTGWIVLAPENVQEAFDLALVA 137 (230) T ss_pred ECCCCCHHHHHHHHHHHHHCCCCEEEEEEEC--CCCCC----CC-CCCCCHHHHHHHHHCCCEEEECCCHHHHHHHHHHH T ss_conf 0267527675678889987368859999966--78998----77-66651378999985690899469999999999999 Q ss_pred HHHHHHHCCCEEEEEEECCC-CCCCCCCCC Q ss_conf 99999738973998642112-565643542 Q gi|254781051|r 458 VSFRMKFHKSVVIDIVCYRR-FGHNEGDEP 486 (957) Q Consensus 458 ~~yR~~F~kDVvIDlv~YRr-~GHNE~DeP 486 (957) ++.-.+|+--|++-.=+||- |.-.-.+-| T Consensus 138 ~~lae~~~~Pv~v~~Dgf~~SH~~~~v~~~ 167 (230) T pfam01855 138 FNLAEKVRTPVIHLFDGFRTSHEYEKVELP 167 (230) T ss_pred HHHHHHHCCCEEEEECCCEEECCCCEEECC T ss_conf 999999889989982566134426405469 No 173 >pfam05637 Glyco_transf_34 galactosyl transferase GMA12/MNN10 family. This family contains a number of glycosyltransferase enzymes that contain a DXD motif. This family includes a number of C. elegans homologues where the DXD is replaced by DXH. Some members of this family are included in glycosyltransferase family 34. Probab=37.58 E-value=25 Score=14.55 Aligned_cols=58 Identities=10% Similarity=0.077 Sum_probs=39.9 Q ss_pred CHHHHHHHHHHHHHHHHHHCCCEEE-EEEECCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHH Q ss_conf 8899999999999999973897399-864211256564354223557888886718998999 Q gi|254781051|r 446 DPEAVIRVVRMAVSFRMKFHKSVVI-DIVCYRRFGHNEGDEPSFTQPMMYKTIRSHKSVLQL 506 (957) Q Consensus 446 dpEav~~~~~lA~~yR~~F~kDVvI-Dlv~YRr~GHNE~DeP~fTQP~mY~~I~~hp~v~~~ 506 (957) |.+-..++.+=-.+|-++.+=++++ +.--|.+++.-+. =.--|.|=+...+||...=+ T Consensus 22 ~~~~l~k~i~NR~dYA~~HGY~~~~~~~~~~~~~~~~~~---W~Kv~~iR~aM~~~P~aew~ 80 (242) T pfam05637 22 DHEYLLKSIKNRVDYAERHGYGFEYVDNSALLPHEMPGY---WAKLPAIRQTMKKYPDAEWI 80 (242) T ss_pred CHHHHHHHHHHHHHHHHHHCCEEEEEEHHHCCCCCCCCC---EEHHHHHHHHHHHCCCCEEE T ss_conf 689999999869999998598599960132468557772---42689999999988993499 No 174 >cd03415 CbiX_CbiC Archaeal sirohydrochlorin cobalt chelatase (CbiX) single domain. Proteins in this subgroup contain a single CbiX domain N-terminal to a precorrin-8X methylmutase (CbiC) domain. CbiX is a cobaltochelatase, responsible for the chelation of Co2+ into sirohydrochlorin, while CbiC catalyzes the conversion of cobalt-precorrin 8 to cobyrinic acid by methyl rearrangement. Both CbiX and CbiC are involved in vitamin B12 biosynthesis. Probab=37.16 E-value=26 Score=14.47 Aligned_cols=35 Identities=20% Similarity=0.097 Sum_probs=15.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCC Q ss_conf 7889999999999999997389739986421125656435422 Q gi|254781051|r 445 DDPEAVIRVVRMAVSFRMKFHKSVVIDIVCYRRFGHNEGDEPS 487 (957) Q Consensus 445 DdpEav~~~~~lA~~yR~~F~kDVvIDlv~YRr~GHNE~DeP~ 487 (957) .+|++.-...++|-.+|+++...|+ .||+|..||+ T Consensus 11 R~~~~~e~~~~la~~v~e~~~~~v~--------~~fmElaePs 45 (125) T cd03415 11 RRNTFNEDMEEWAAYLERKLGVPVY--------LTYNEYAEPN 45 (125) T ss_pred CCHHHHHHHHHHHHHHHHHCCCCEE--------EEEECCCCCC T ss_conf 9577899999999999986199558--------7551106998 No 175 >COG5618 Predicted periplasmic lipoprotein [General function prediction only] Probab=36.77 E-value=26 Score=14.43 Aligned_cols=109 Identities=24% Similarity=0.293 Sum_probs=64.2 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCC-----CEEEEECCCCCCCCCCCCEEEEEECCCCCEEE Q ss_conf 9862056778889989987334356421147898889886358-----31578501002233323027999778884455 Q gi|254781051|r 579 KSFNTHKIVERLMANRQKMIETGKGIDWSMAESLAFGSLCYEG-----YKVRLSGQDCERGTFSHRHAILCDQETGKRYF 653 (957) Q Consensus 579 e~f~~h~~v~k~~~~R~~~~~~~~~iDW~~AE~LA~~SLl~eG-----~~VRlsGQD~~RGTFshRHavl~dq~t~~~y~ 653 (957) .+|++.+-+.-|....---+...+.+-|+---+||-+ -..| +.||++|-=+.--|=|.---|=.|-..+.. T Consensus 43 sgfnPdkmva~IWa~kV~Pyl~~~AgpfaEV~ala~a--pk~gs~n~t~~vrftGkVvaaetksr~g~v~vd~dgdga-- 118 (206) T COG5618 43 SGFNPDKMVATIWAKKVMPYLNSNAGPFAEVKALASA--PKQGSNNWTFAVRFTGKVVAAETKSREGLVRVDIDGDGA-- 118 (206) T ss_pred CCCCCHHHHHHHHHHHHCHHHHCCCCCHHHHHHHHHC--CCCCCCCCEEEEEEEEEEEEEEECCCCCEEEEECCCCCC-- T ss_conf 5799314799998864023333158878999998738--122678974899986689975402555469984178752-- Q ss_pred EHHHCCCCCCEEEEEECCCCCCCHHHHHHCCCCCCCCCCEEECCCCCCCCCCCHHHHHHHHH Q ss_conf 63442789845999824311110134320221138653111000024424541022565542 Q gi|254781051|r 654 PLGNISKDQGHCEVVNSFLSEQAVLGFEYGYSLNNLNALTIWEAQFGDFANGAQVILDQFIT 715 (957) Q Consensus 654 pL~~l~~~q~~f~v~nS~LSE~avlgFEyGys~~~p~~L~iWEAQFGDF~NgAQviiDqfis 715 (957) +++..+- -||-|-+.--|+. +-||+||-| ||-.-||=. T Consensus 119 --------~~RvQiG------PavrGtaiRDa~~--------fiqFndFkN--QIeyaq~gk 156 (206) T COG5618 119 --------DARVQIG------PAVRGTAIRDALD--------FIQFNDFKN--QIEYAQFGK 156 (206) T ss_pred --------EEEEEEC------CCCCCCEECCCCC--------EEEHHHHHH--HHHHHHHHH T ss_conf --------5798755------6305632101125--------053444555--889999999 No 176 >cd06254 M14_ASTE_ASPA_like_4 A functionally uncharacterized subgroup of the Succinylglutamate desuccinylase (ASTE)/aspartoacylase (ASPA) subfamily which is part of the M14 family of metallocarboxypeptidases. ASTE catalyzes the fifth and last step in arginine catabolism by the arginine succinyltransferase pathway, and aspartoacylase (ASPA, also known as aminoacylase 2, and ACY-2; EC:3.5.1.15) cleaves N-acetyl L-aspartic acid (NAA) into aspartate and acetate. NAA is abundant in the brain, and hydrolysis of NAA by ASPA may help maintain white matter. ASPA is an NAA scavenger in other tissues. Mutations in the gene encoding ASPA cause Canavan disease (CD), a fatal progressive neurodegenerative disorder involving dysmyelination and spongiform degeneration of white matter in children. This enzyme binds zinc which is necessary for activity. Measurement of elevated NAA levels in urine is used in the diagnosis of CD. Probab=36.57 E-value=27 Score=14.41 Aligned_cols=86 Identities=16% Similarity=0.337 Sum_probs=52.3 Q ss_pred EEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEE-EECCCEECCC-----CCHHCCCCCCCCHHHHHHCCCEEEECC Q ss_conf 7888625753334672588764510677443657999-8416020124-----701024677840011220340587426 Q gi|254781051|r 371 LPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHL-IINNQIGFTT-----NPSSARSCTYASDISKSIGIPIFHVNG 444 (957) Q Consensus 371 lpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHi-VvNNQIGFTT-----~p~d~RSs~Y~TDiAK~i~aPI~HVN~ 444 (957) +-=-+|||- +.|+-++.+- +..|..-..-|||=+ =+=|-.||-. +|.| -+|+-+++- | T Consensus 22 i~agvHGdE-~~G~~~~~~L--~~~l~~~~l~G~viivP~~N~~g~~~~~r~~~p~D------~~nlNR~FP-------G 85 (288) T cd06254 22 ITAGVHGGE-YPGIQALQKL--AREIDPAKLSGTLIIVHVLNLSGFYARTPYIVPED------GKNLNRVFP-------G 85 (288) T ss_pred EECCCCCCH-HHHHHHHHHH--HHHCCHHHCCEEEEEEECCCHHHHHHCCCCCCCCC------CCCHHHCCC-------C T ss_conf 976766554-8899999999--87488757817799993589999983486778888------775644389-------9 Q ss_pred CCHHHHHHHHHHHHHHHHHH--CCCEEEEEEE Q ss_conf 78899999999999999973--8973998642 Q gi|254781051|r 445 DDPEAVIRVVRMAVSFRMKF--HKSVVIDIVC 474 (957) Q Consensus 445 DdpEav~~~~~lA~~yR~~F--~kDVvIDlv~ 474 (957) | ++. ...-++|...-.+. +.|++|||=| T Consensus 86 ~-~~G-s~teriA~~l~~~l~~~aD~~IDLHt 115 (288) T cd06254 86 D-KDG-TLTERIAYFLTEEVIDKADFLIDLHS 115 (288) T ss_pred C-CCC-CHHHHHHHHHHHHHHHHCCEEEEECC T ss_conf 9-999-98999999999988865989998236 No 177 >TIGR00936 ahcY adenosylhomocysteinase; InterPro: IPR000043 S-adenosyl-L-homocysteine hydrolase (3.3.1.1 from EC) (AdoHcyase) is an enzyme of the activated methyl cycle, responsible for the reversible hydration of S-adenosyl-L-homocysteine into adenosine and homocysteine. AdoHcyase is an ubiquitous enzyme which binds and requires NAD^+ as a cofactor. AdoHcyase is a highly conserved protein of about 430 to 470 amino acids. The family contains a glycine-rich region in the central part of AdoHcyase; a region thought to be involved in NAD-binding.; GO: 0004013 adenosylhomocysteinase activity, 0006730 one-carbon compound metabolic process. Probab=36.42 E-value=27 Score=14.39 Aligned_cols=79 Identities=19% Similarity=0.385 Sum_probs=61.6 Q ss_pred HHCCCCCCCCCEEEEEECCCEECCCCCHH------CCCCCCCC------HHHHHHCCCEEEECCCCHHHHHHHHHHHHH- Q ss_conf 51067744365799984160201247010------24677840------011220340587426788999999999999- Q gi|254781051|r 393 LSGLSGYTVAGNIHLIINNQIGFTTNPSS------ARSCTYAS------DISKSIGIPIFHVNGDDPEAVIRVVRMAVS- 459 (957) Q Consensus 393 ls~l~gY~tGGTiHiVvNNQIGFTT~p~d------~RSs~Y~T------DiAK~i~aPI~HVN~DdpEav~~~~~lA~~- 459 (957) ---|.|.++++++|+-+.--+=..|-..- +-|-+++| -++|..+.|||-.-|++.|.=.++.+-+++ T Consensus 36 ~kPl~G~~i~~cLH~~~~Ta~L~~TL~~~GAEv~~~~cNplSTQDdvaAA~~~~~G~~vyA~rGe~~E~Y~~~~~~~l~~ 115 (422) T TIGR00936 36 EKPLKGVRIAACLHLTVKTAVLIETLKAGGAEVALTACNPLSTQDDVAAALAKAAGIPVYAWRGETNEEYYEALEQVLDA 115 (422) T ss_pred CCCCCCEEEEEEECCHHHHHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHHHHH T ss_conf 58977527642432148899999999851974887447887745689999986269528885179848999999999852 Q ss_pred HHHHHCCCEEEE Q ss_conf 999738973998 Q gi|254781051|r 460 FRMKFHKSVVID 471 (957) Q Consensus 460 yR~~F~kDVvID 471 (957) +...-.-||||| T Consensus 116 ~~~~~~p~i~iD 127 (422) T TIGR00936 116 FKDDKEPNIIID 127 (422) T ss_pred HCCCCCCCEEEE T ss_conf 046899757882 No 178 >PRK04235 hypothetical protein; Provisional Probab=36.36 E-value=27 Score=14.39 Aligned_cols=72 Identities=22% Similarity=0.255 Sum_probs=43.2 Q ss_pred HHHHHCCCEEEECCCCHHHHHHHHHHHHH--HHHHH-----CCCEEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHCCCH Q ss_conf 11220340587426788999999999999--99973-----897399864211256564354223557888886718998 Q gi|254781051|r 431 ISKSIGIPIFHVNGDDPEAVIRVVRMAVS--FRMKF-----HKSVVIDIVCYRRFGHNEGDEPSFTQPMMYKTIRSHKSV 503 (957) Q Consensus 431 iAK~i~aPI~HVN~DdpEav~~~~~lA~~--yR~~F-----~kDVvIDlv~YRr~GHNE~DeP~fTQP~mY~~I~~hp~v 503 (957) |--.++.||+||-+-|.|+.-...++|.+ ||.-- .+-+++-+-| ++. +|-|-- | ..+--| T Consensus 92 l~~k~ep~ILHV~cr~le~A~~Ll~~A~~~GFr~SGI~s~~~~r~iVaIrs----~~~-levPl~-----~---~g~~lV 158 (200) T PRK04235 92 LWLKVQGPILHVAARDLETAKKLLELAREAGFKHSGIKSISDKRVIVEIRT----TER-MDVPLK-----E---DGKLLV 158 (200) T ss_pred EEEEECCCEEEEEECCHHHHHHHHHHHHHCCCCHHCCEECCCCCEEEEEEC----CCC-EEEEEC-----C---CCCCCC T ss_conf 899931978776438889999999999973852006364279828999941----571-776423-----4---981675 Q ss_pred HHHHHHHHHHCC Q ss_conf 999999986448 Q gi|254781051|r 504 LQLYADSLMKNQ 515 (957) Q Consensus 504 ~~~Y~~~Li~~g 515 (957) .+-|.+.|++.. T Consensus 159 ~~eyl~~Lv~~a 170 (200) T PRK04235 159 DDEYLKILVEIA 170 (200) T ss_pred CHHHHHHHHHHH T ss_conf 999999999999 No 179 >PRK13559 hypothetical protein; Provisional Probab=36.21 E-value=27 Score=14.37 Aligned_cols=58 Identities=26% Similarity=0.423 Sum_probs=43.6 Q ss_pred CCHHCCCCCCCCHHHHHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEECCCCCC Q ss_conf 701024677840011220340587426788999999999999999738973998642112565 Q gi|254781051|r 418 NPSSARSCTYASDISKSIGIPIFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVCYRRFGH 480 (957) Q Consensus 418 ~p~d~RSs~Y~TDiAK~i~aPI~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~YRr~GH 480 (957) |+...+-|-|..|=.-.-+|..++-...||++|-+. +-|+ +=++++-++++.|||-|. T Consensus 74 N~aF~~lTGYs~~EviGrncrfLqGp~t~~~~v~~i-r~ai----~~~~~~~~el~nyrKDGs 131 (363) T PRK13559 74 NQAFLDLTGYAAEEVVGRNCRFLQGAGTDPAAVAKI-RAAI----AAERVVVVELLNYRKDGS 131 (363) T ss_pred CHHHHHHHCCCHHHHCCCCCHHCCCCCCCHHHHHHH-HHHH----HCCCCEEEEEEEECCCCC T ss_conf 879997639897998499924108999998999999-9999----839946999999826998 No 180 >pfam10109 FluMu_gp41 Mu-like prophage FluMu protein gp41. Members of this family of prokaryotic proteins include various gp41 proteins and related sequences. Probab=35.97 E-value=27 Score=14.34 Aligned_cols=45 Identities=36% Similarity=0.450 Sum_probs=26.0 Q ss_pred HHCCC-----CCEEEEECCCHHHH-HHHHHHHHCC--CCCCCEEEECCCHHHCCCC Q ss_conf 52143-----78269724889999-9999997435--3488689856602210801 Q gi|254781051|r 752 QMCAE-----NNMYVANCTSPANY-FHILRRQIYD--RSSRPLIMMAPKSLLRHKR 799 (957) Q Consensus 752 Ql~a~-----~N~~V~~~ttpAq~-FH~LRRQ~~r--~~rkPLiv~tPKsLLR~~~ 799 (957) ||+|| +|=..+--++|+.+ .++||||+.. +..-| ++||-|.+... T Consensus 36 q~AAErv~~~~nG~pv~~~S~~l~G~elLrRQI~~IG~i~GP---~s~k~l~~l~~ 88 (105) T pfam10109 36 QLAAERVVMSDNGKPVAYTSPVLMGLELLRRQIEFIGNINGP---LSPKQLRKLST 88 (105) T ss_pred HHHHHHEEECCCCCEEEEECHHHHHHHHHHHHHHHHCCCCCC---CCHHHHHCCCH T ss_conf 765542476799957999683887799999999986287898---69999962499 No 181 >cd01854 YjeQ_engC YjeQ/EngC. YjeQ (YloQ in Bacillus subtilis) represents a protein family whose members are broadly conserved in bacteria and have been shown to be essential to the growth of E. coli and B. subtilis. Proteins of the YjeQ family contain all sequence motifs typical of the vast class of P-loop-containing GTPases, but show a circular permutation, with a G4-G1-G3 pattern of motifs as opposed to the regular G1-G3-G4 pattern seen in most GTPases. All YjeQ family proteins display a unique domain architecture, which includes an N-terminal OB-fold RNA-binding domain, the central permuted GTPase domain, and a zinc knuckle-like C-terminal cysteine domain. This domain architecture suggests a role for YjeQ as a regulator of translation. Probab=35.97 E-value=17 Score=15.86 Aligned_cols=70 Identities=14% Similarity=0.138 Sum_probs=41.1 Q ss_pred CCCCCCCCCHHHHHHHCCCCCEEEEECCCHHHH----HHHHHHHHCCCCCCCEEEECCCHHHCCCCCCCCHHHHCCCC Q ss_conf 887656440578875214378269724889999----99999974353488689856602210801268878937998 Q gi|254781051|r 738 QGPEHSSARLERFLQMCAENNMYVANCTSPANY----FHILRRQIYDRSSRPLIMMAPKSLLRHKRVVSSLSDMTCGS 811 (957) Q Consensus 738 qGPEHSSaRlERfLQl~a~~N~~V~~~ttpAq~----FH~LRRQ~~r~~rkPLiv~tPKsLLR~~~a~S~l~d~~~g~ 811 (957) ..|..+..||||||-+|...|+.++.+=|-+-. --.++.......--|.+..+.+. +.....|.++..|. T Consensus 89 ~~p~~~~~~idR~Lv~a~~~~i~pvivlnK~DL~~~~~~~~~~~~~~~~gy~v~~~S~~~----~~g~~~L~~~l~~k 162 (287) T cd01854 89 NEPFFNPRLLDRYLVAAEAAGIEPVIVLTKADLLDDEEEELELVEALALGYPVLAVSAKT----GEGLDELREYLKGK 162 (287) T ss_pred CCCCCCHHHHHHHHHHHHHCCCCEEEEEECHHCCCCHHHHHHHHHHHCCCCEEEEEECCC----CCCHHHHHHHHCCC T ss_conf 899899899999999999779968999986221994899999999872998499996689----85889999874798 No 182 >PRK04081 hypothetical protein; Provisional Probab=35.69 E-value=5 Score=19.92 Aligned_cols=61 Identities=18% Similarity=0.159 Sum_probs=43.5 Q ss_pred HHHHCCCEEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHCCCHHH--HHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 997389739986421125656435422355788888671899899--99999864489898999999999999977566 Q gi|254781051|r 461 RMKFHKSVVIDIVCYRRFGHNEGDEPSFTQPMMYKTIRSHKSVLQ--LYADSLMKNQVISKQELQSLANNWHKYLEAEY 537 (957) Q Consensus 461 R~~F~kDVvIDlv~YRr~GHNE~DeP~fTQP~mY~~I~~hp~v~~--~Y~~~Li~~g~it~~e~~~~~~~~~~~l~~~~ 537 (957) -|+=+.=|||+= |.|--|++++++|+... +-.+.=-.|-+++++|++++.++...+.|+.- T Consensus 44 ~q~qgafviIEe----------------~~~g~YKi~eE~PS~~TRvilrd~dGtER~LS~eE~d~Li~eE~aKId~GT 106 (212) T PRK04081 44 SQQQGAFVIIEE----------------QADGSYKILEEYPSSETRIVLRDLNGTERVLSKEEMDKLIKEEEAKIDNGT 106 (212) T ss_pred HHCCCCEEEEEE----------------CCCCCEEEEECCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCCC T ss_conf 750786899986----------------389944751006898624888648975001779999999999886640687 No 183 >PRK05439 pantothenate kinase; Provisional Probab=35.53 E-value=14 Score=16.56 Aligned_cols=64 Identities=19% Similarity=0.189 Sum_probs=41.5 Q ss_pred CCCCHHHHHHHHHH-HCCCCCCCCCC-----HHH--HHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCE Q ss_conf 77899999999887-43689986205-----677--888998998733435642114789888988635831 Q gi|254781051|r 560 NSVSKEILKKIGSS-IIRLPKSFNTH-----KIV--ERLMANRQKMIETGKGIDWSMAESLAFGSLCYEGYK 623 (957) Q Consensus 560 T~v~~~~L~~i~~~-l~~~Pe~f~~h-----~~v--~k~~~~R~~~~~~~~~iDW~~AE~LA~~SLl~eG~~ 623 (957) -+|.+.+..++.+. +..+|++.++. -.+ .+.|++|-=|-++|-+=-+-+.-.+.|-+=+..|.+ T Consensus 95 VAVGKSTtARlLq~LL~r~~~~~~V~LvTTDGFLypNa~L~~rglm~RKGFPESYD~~~Ll~Fl~~vKsG~~ 166 (312) T PRK05439 95 VAVGKSTTARLLQALLSRWPEHPKVDLVTTDGFLYPNAELKERGLMKRKGFPESYDMRALLRFLSDVKSGKP 166 (312) T ss_pred EECCHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHCCCC T ss_conf 102628899999999950789994589934665578689987677124789854479999999999976998 No 184 >TIGR01706 NAPA periplasmic nitrate reductase, large subunit; InterPro: IPR010051 This entry represents the large subunit of a family of nitrate reductases found in proteobacteria which are localised to the periplasm. This subunit binds molybdopterin and contains a twin-arginine motif at the N terminus. The protein associates with NapB, a soluble haem-containing protein and NapC, a membrane-bound cytochrome c. The periplasmic nitrate reductases are not involved in the assimilation of nitrogen, and are not directly involved in the formation of electrochemical gradients (i.e. respiration) either. Rather, the purpose of this enzyme is either dissimilatory (i.e. to dispose of excess reductive equivalents) or indirectly respiratory by virtue of the consumption of electrons derived from NADH via the proton translocating NADH dehydrogenase. The enzymes from Alicagenes eutrophus and Paracoccus pantotrophus have been characterised. In Escherichia coli (as well as other organisms) this gene is part of a large nitrate reduction operon (napFDAGHBC).; GO: 0008940 nitrate reductase activity, 0030151 molybdenum ion binding, 0051539 4 iron 4 sulfur cluster binding, 0042128 nitrate assimilation. Probab=35.50 E-value=12 Score=17.03 Aligned_cols=32 Identities=41% Similarity=0.575 Sum_probs=22.5 Q ss_pred HHHHHHHHHHHCC-------CEEEEECCCC------CCCCCCCCE Q ss_conf 7898889886358-------3157850100------223332302 Q gi|254781051|r 609 AESLAFGSLCYEG-------YKVRLSGQDC------ERGTFSHRH 640 (957) Q Consensus 609 AE~LA~~SLl~eG-------~~VRlsGQD~------~RGTFshRH 640 (957) +..|+|--=|.-| -+--|+||-+ +=|||+||= T Consensus 433 ~n~~vY~lHlL~GK~a~PG~~pFSLTGQpSACGTAReVGTF~HRL 477 (930) T TIGR01706 433 ANNLVYNLHLLTGKIAKPGNSPFSLTGQPSACGTAREVGTFSHRL 477 (930) T ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 888899988751654688887534577766666523466323348 No 185 >cd04821 PA_M28_1_2 PA_M28_1_2: Protease-associated (PA) domain, peptidase family M28, subfamily-1, subgroup 2. A subgroup of PA-domain containing proteins belonging to the peptidase family M28. Family M28 contains aminopeptidases and carboxypeptidases, and has co-catalytic zinc ions. The PA domain is an insert domain in a diverse fraction of proteases. The significance of the PA domain to many of the proteins in which it is inserted is undetermined. It may be a protein-protein interaction domain. At peptidase active sites, the PA domain may participate in substrate binding and/or promoting conformational changes, which influence the stability and accessibility of the site to substrate. Proteins into which the PA domain is inserted include the following members of the peptidase family M28: i) prostate-specific membrane antigen (PSMA), ii) yeast aminopeptidase Y, and ii) human TfR (transferrin receptor)1 and human TfR2. The proteins listed above belong to other subgroups; relatively litt Probab=35.40 E-value=22 Score=15.03 Aligned_cols=51 Identities=31% Similarity=0.595 Sum_probs=41.0 Q ss_pred CCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCCEECCCC-CH--HCCCCCC Q ss_conf 575333467258876451067744365799984160201247-01--0246778 Q gi|254781051|r 377 GDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQIGFTTN-PS--SARSCTY 427 (957) Q Consensus 377 GDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQIGFTT~-p~--d~RSs~Y 427 (957) -+.-|+|-||++.-.+-..-.|-.+-|-|-||..|-=||.+. |. .++.-+| T Consensus 24 a~~VfvGyGi~ape~~~dDy~g~Dv~gK~vv~l~~~P~f~~~~~~~f~g~~mTy 77 (157) T cd04821 24 SPLVFVGYGIVAPEYGWDDYKGLDVKGKTVVILVNDPGFATPDSGLFNGKAMTY 77 (157) T ss_pred CCEEEEECCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCCCCHHHCCCCCCC T ss_conf 998997357058866864235877679889998379986777702206754420 No 186 >TIGR03494 salicyl_syn salicylate synthase. Members of this protein family are salicylate synthases, bifunctional enzymes that make salicylate, in two steps, from chorismate. Members are homologous to anthranilate synthase component I from Trp biosynthesis. Members typically are found in gene regions associated with siderophore or other secondary metabolite biosynthesis. Probab=35.31 E-value=13 Score=16.75 Aligned_cols=51 Identities=12% Similarity=0.111 Sum_probs=26.1 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEEEECCCEEEEEECCCCCCCH Q ss_conf 899999855887432234555100150268898359618999717877310 Q gi|254781051|r 286 PRSIFYEFKGKGPSEKEYSGDVKYHLGLCCNRQICGKDVKLLLQSNPSHLE 336 (957) Q Consensus 286 ~~~iF~EF~g~~~~~~~~sGDVKYHLG~s~~~~~~g~~v~lsL~~NPSHLE 336 (957) ...+.++|++.++--++-.|+.-.-+|.-..+..+...++++..-...+-+ T Consensus 6 ~a~ll~~~~~~~y~~YEr~~ew~~g~G~~a~l~~d~~~~~~~~~g~~~~~~ 56 (425) T TIGR03494 6 LAAVLSEFAEEDYYVYERDGEWYLGIGVQAMLTIDSDGRRVTFTGVTDRQR 56 (425) T ss_pred HHHHHHHCCCCCEEEEEECCEEEEEECCEEEEEECCCCEEEEECCCCCEEC T ss_conf 999986378887499997997999837649999868855998637500002 No 187 >pfam02030 Lipoprotein_8 Hypothetical lipoprotein (MG045 family). This family includes hypothetical lipoproteins, the amino terminal part of this protein is related to pfam01547, a family of solute binding proteins. This suggests this family also has a solute binding function. Probab=35.20 E-value=21 Score=15.26 Aligned_cols=35 Identities=20% Similarity=0.268 Sum_probs=24.6 Q ss_pred EEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEE Q ss_conf 8886257533346725887645106774436579998 Q gi|254781051|r 372 PLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLI 408 (957) Q Consensus 372 piliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiV 408 (957) .|+--|||+||.||=-+-+--=+ +-.-+|-+.||| T Consensus 267 ~I~YNGD~~fAA~GGd~~~e~~~--n~~p~~~nfHIV 301 (493) T pfam02030 267 GIVYNGDAVYAALGGDLRDEADE--NMKPTGDNFHIV 301 (493) T ss_pred CEEECCCEEEHHCCCCCCCCCCC--CCCCCCCCEEEE T ss_conf 26864410010148755455565--678777741374 No 188 >TIGR02812 fadR_gamma fatty acid metabolism transcriptional regulator FadR; InterPro: IPR014178 Members of this entry are transcriptional regulators of fatty acid metabolism (FadR), including both biosynthesis and beta-oxidation reactions, which are found exclusively in a subset of Gammaproteobacteria with strictly one copy per genome. Proteins have an N-terminal DNA-binding domain and a less well conserved C-terminal long chain acyl-CoA-binding domain. FadR when expressed in Escherichia coli shows differences in regulatory response and fatty acid binding profiles. All members of this entry are expected to have the same function.; GO: 0000062 acyl-CoA binding, 0003677 DNA binding, 0003700 transcription factor activity, 0019217 regulation of fatty acid metabolic process. Probab=35.12 E-value=22 Score=15.03 Aligned_cols=24 Identities=29% Similarity=0.571 Sum_probs=10.3 Q ss_pred HHHHHHHHHCCCEEEEEECCCCHHHHH Q ss_conf 999999840643027772489989999 Q gi|254781051|r 165 IVDVLSHLYCSNIGVEFMHIVDSIERD 191 (957) Q Consensus 165 Ii~~L~~tYCgsIGvEymHI~d~eer~ 191 (957) |+.-|+..|. .||-=|= .+|+-|. T Consensus 203 ILNglKglY~-RiG~yYF--~~peaRe 226 (275) T TIGR02812 203 ILNGLKGLYT-RIGLYYF--ASPEARE 226 (275) T ss_pred HCCHHHHHHH-HHHHHCC--CCHHHHH T ss_conf 2001014555-5511015--8878889 No 189 >COG1162 Predicted GTPases [General function prediction only] Probab=34.91 E-value=19 Score=15.46 Aligned_cols=70 Identities=11% Similarity=0.283 Sum_probs=41.6 Q ss_pred CCCCCCCCCHHHHHHHCCCCCEEEEECCCHHHHHHHHH---HHHCCCCC---CCEEEECCCHHHCCCCCCCCHHHHCCCC Q ss_conf 88765644057887521437826972488999999999---97435348---8689856602210801268878937998 Q gi|254781051|r 738 QGPEHSSARLERFLQMCAENNMYVANCTSPANYFHILR---RQIYDRSS---RPLIMMAPKSLLRHKRVVSSLSDMTCGS 811 (957) Q Consensus 738 qGPEHSSaRlERfLQl~a~~N~~V~~~ttpAq~FH~LR---RQ~~r~~r---kPLiv~tPKsLLR~~~a~S~l~d~~~g~ 811 (957) .-|+-|+.+|+|||=+|-..|++.+.|=|-+.+---.- +++.+.++ =|++..+-|+= .....|.+...|. T Consensus 90 ~~P~~~~~~ldR~Lv~ae~~gi~pvIvlnK~DL~~~~~~~~~~~~~~y~~~gy~v~~~s~~~~----~~~~~l~~~l~~~ 165 (301) T COG1162 90 VDPDFNTNLLDRYLVLAEAGGIEPVIVLNKIDLLDDEEAAVKELLREYEDIGYPVLFVSAKNG----DGLEELAELLAGK 165 (301) T ss_pred CCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHCCEEEEEECCCCC----CCHHHHHHHHCCC T ss_conf 689989889999999997769957999975336761677899999999867935999537676----5589999975588 No 190 >PRK05301 pyrroloquinoline quinone biosynthesis protein PqqE; Provisional Probab=34.51 E-value=29 Score=14.18 Aligned_cols=51 Identities=18% Similarity=0.287 Sum_probs=40.0 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEECCCCHHHHH-HHHHHHHHHHHHCCCCEE Q ss_conf 888798899999999999998999998640336222256465899-999999999873791079 Q gi|254781051|r 203 SSNFSKEEQRDILDKLVRAEGFEKFIDIKYKGAKRFGADGSEVII-PAIEEIIRQGVQQGVDEM 265 (957) Q Consensus 203 ~~~~s~eeK~~iL~~L~~Ae~FE~FL~~Kf~g~KRFsLEG~EslI-P~L~~ii~~a~~~Gv~ei 265 (957) ..++|.+|-+++++.|.+ .|.+++.+-|||-|+ |-+.+|++.+.+.|+.-. T Consensus 44 ~~ELs~~e~~~~id~l~~------------~Gv~~v~~tGGEPllr~D~~ei~~~a~~~G~~~~ 95 (375) T PRK05301 44 GAELSTAEWIRVLREARA------------LGVLQLHFSGGEPLLRKDLEELVAHARRLGLYTN 95 (375) T ss_pred CCCCCHHHHHHHHHHHHH------------CCCCEEEECCCCCCCCCCHHHHHHHHHHCCCEEE T ss_conf 578999999999999998------------6998899618652456689999999997697589 No 191 >TIGR02114 coaB_strep phosphopantothenate--cysteine ligase; InterPro: IPR011848 In most bacteria, a single bifunctional protein catalyses phosphopantothenoylcysteine decarboxylase and phosphopantothenate--cysteine ligase activities, sequential steps in coenzyme A biosynthesis. These activities reside in separate proteins encoded by tandem genes in some bacterial lineages. This entry describes proteins from the genera Streptococcus and Enterococcus homologous to the C-terminal region of IPR005252 from INTERPRO, corresponding to phosphopantothenate--cysteine ligase activity .. Probab=34.50 E-value=25 Score=14.61 Aligned_cols=13 Identities=31% Similarity=0.468 Sum_probs=7.5 Q ss_pred CCCHHHHHHHHHH Q ss_conf 7899999999887 Q gi|254781051|r 561 SVSKEILKKIGSS 573 (957) Q Consensus 561 ~v~~~~L~~i~~~ 573 (957) -|+.|.|-+++++ T Consensus 186 nV~~e~Lv~vAr~ 198 (253) T TIGR02114 186 NVTQEELVKVARA 198 (253) T ss_pred CCCHHHHHHHHHH T ss_conf 3786568999998 No 192 >cd04814 PA_M28_1 PA_M28_1: Protease-associated (PA) domain, peptidase family M28, subfamily-1. A subfamily of PA-domain containing proteins belonging to the peptidase family M28. Family M28 contains aminopeptidases and carboxypeptidases, and has co-catalytic zinc ions. The PA domain is an insert domain in a diverse fraction of proteases. The significance of the PA domain to many of the proteins in which it is inserted is undetermined. It may be a protein-protein interaction domain. At peptidase active sites, the PA domain may participate in substrate binding and/or promoting conformational changes, which influence the stability and accessibility of the site to substrate. Proteins into which the PA domain is inserted include the following members of the peptidase family M28: i) prostate-specific membrane antigen (PSMA), ii) yeast aminopeptidase Y, and ii) human TfR (transferrin receptor)1 and human TfR2. The proteins listed above belong to other subfamilies, relatively little is known a Probab=34.47 E-value=20 Score=15.35 Aligned_cols=41 Identities=20% Similarity=0.370 Sum_probs=35.5 Q ss_pred CCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCCEECCCC Q ss_conf 75333467258876451067744365799984160201247 Q gi|254781051|r 378 DAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQIGFTTN 418 (957) Q Consensus 378 DAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQIGFTT~ 418 (957) +--|+|-||++.-.+-.+-.|-.+-|-|-||..|-=+|.+. T Consensus 23 ~lVFvGYGi~ape~~~dDy~g~Dv~GKivvvl~ndP~~~~~ 63 (142) T cd04814 23 PLVFVGYGIKAPELSWDDYAGLDVKGKVVVVLRNDPQGEPG 63 (142) T ss_pred CEEEEECCCCCCCCCCCCCCCCCCCCCEEEEEECCCCCCCC T ss_conf 87997146157766855345877679789998079999887 No 193 >cd02524 G1P_cytidylyltransferase G1P_cytidylyltransferase catalyzes the production of CDP-D-Glucose. Alpha-D-Glucose-1-phosphate Cytidylyltransferase catalyzes the production of CDP-D-Glucose from alpha-D-Glucose-1-phosphate and MgCTP as substrate. CDP-D-Glucose is the precursor for synthesizing four of the five naturally occurring 3,6-dideoxy sugars-abequose (3,6-dideoxy-D-Xylo-hexose), ascarylose (3,6-dideoxy-L-arabino-hexose), paratose (3,6-dideoxy-D-ribohexose), and tyvelose (3,6-dideoxy-D-arabino-hexose. Deoxysugars are ubiquitous in nature where they function in a variety of biological processes, including cell adhesion, immune response, determination of ABO blood groups, fertilization, antibiotic function, and microbial pathogenicity. Probab=34.38 E-value=29 Score=14.16 Aligned_cols=28 Identities=21% Similarity=0.446 Sum_probs=24.2 Q ss_pred HHHHHHHHHHHHCCCCEEEEECCCCCHH Q ss_conf 9999999998737910799806765526 Q gi|254781051|r 248 PAIEEIIRQGVQQGVDEMILGMAHRGRL 275 (957) Q Consensus 248 P~L~~ii~~a~~~Gv~eiviGMaHRGRL 275 (957) |+|+.+++..+..|+++|+|...|.|.. T Consensus 30 Pii~~ii~~l~~~gi~~iil~~gy~~~~ 57 (253) T cd02524 30 PILWHIMKIYSHYGHNDFILCLGYKGHV 57 (253) T ss_pred EHHHHHHHHHHHCCCCEEEEECCCCCHH T ss_conf 8999999999986998799953225113 No 194 >TIGR00564 trpE_most anthranilate synthase component I; InterPro: IPR005256 This enzyme resembles some other chorismate-binding enzymes, including para-aminobenzoate synthase (pabB) and isochorismate synthase. There is a fairly deep split between two sets, seen in the pattern of gaps as well as in amino acid sequence differences. This group includes eukaryotes, archaea, and many bacterial lineages; sequences from the second group may resemble pabB more closely than other trpE from the other group. The other group includes Gram-negative proteobacteria such as Escherichia coli and Helicobacter pylori but also the Gram-positive organism Corynebacterium glutamicum, and is described by IPR005257 from INTERPRO. A sequence from Bacillus subtilis that scores above the trusted cut-off is annotated as PabB rather than TrpE. However, it is part of an operon that is required for Trp as well as folate biosynthesis, is Trp-repressible, and contains TrpG. It is likely that this sequence annotated as PabB functions both as PabB and as TrpE. ; GO: 0004049 anthranilate synthase activity, 0000162 tryptophan biosynthetic process. Probab=34.10 E-value=22 Score=14.99 Aligned_cols=40 Identities=18% Similarity=0.252 Sum_probs=32.7 Q ss_pred CCHHH-CCCEEECCCCCCCCCCCHHHHHHHHHHH----HCCCEEE Q ss_conf 99788-1715725665787777899999999984----0643027 Q gi|254781051|r 140 FVKAD-YDRKICMKGVLGLESATIPEIVDVLSHL----YCSNIGV 179 (957) Q Consensus 140 Lse~D-Ld~~f~~~~~~g~~~~tL~eIi~~L~~t----YCgsIGv 179 (957) ++-.| |-..|+.|++-|-++.--=|||+.||.+ |-|-||+ T Consensus 426 ~t~~DALra~lPAGTlSGAPK~RAme~I~ELE~~~RG~YgGavGy 470 (543) T TIGR00564 426 LTAIDALRATLPAGTLSGAPKVRAMELIDELEPEKRGIYGGAVGY 470 (543) T ss_pred CCHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCEEEEC T ss_conf 773026643367764578844899999998717884235306871 No 195 >TIGR01418 PEP_synth phosphoenolpyruvate synthase; InterPro: IPR006319 This family represents phosphoenolpyruvate synthase; also called pyruvate,water dikinase and PEP synthase. The member from Methanococcus jannaschii contains a large intein. This enzyme generates phosphoenolpyruvate (PEP) from pyruvate, hydrolyzing ATP to AMP and releasing inorganic phosphate in the process. The enzyme shows extensive homology to other enzymes that use PEP as substrate or product. This enzyme may provide PEP for gluconeogenesis, for PTS-type carbohydrate transport systems, or for other processes. ; GO: 0008986 pyruvate water dikinase activity, 0006094 gluconeogenesis. Probab=33.95 E-value=13 Score=16.79 Aligned_cols=25 Identities=24% Similarity=0.450 Sum_probs=14.9 Q ss_pred EEECCCCHHHHHHHH-HHHHH----HHHHH Q ss_conf 874267889999999-99999----99973 Q gi|254781051|r 440 FHVNGDDPEAVIRVV-RMAVS----FRMKF 464 (957) Q Consensus 440 ~HVN~DdpEav~~~~-~lA~~----yR~~F 464 (957) +-+|==.||..-+.+ .|-.| .|.+| T Consensus 543 i~mNvg~Pe~Af~~ar~lP~dGVGLaR~Ef 572 (877) T TIGR01418 543 IYMNVGNPERAFRFARALPNDGVGLARIEF 572 (877) T ss_pred EEEECCCCCHHHHHHHHCCCCCCHHHHHHH T ss_conf 898538854578997525788622788999 No 196 >pfam05221 AdoHcyase S-adenosyl-L-homocysteine hydrolase. Probab=33.91 E-value=29 Score=14.11 Aligned_cols=36 Identities=22% Similarity=0.487 Sum_probs=14.3 Q ss_pred CCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEE Q ss_conf 340587426788999999999999999738973998 Q gi|254781051|r 436 GIPIFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVID 471 (957) Q Consensus 436 ~aPI~HVN~DdpEav~~~~~lA~~yR~~F~kDVvID 471 (957) ++|||-..|+++|.--....-+++|...-+-|++|| T Consensus 92 Gi~VfA~~g~t~eey~~~~~~~L~~~~~~~P~iiiD 127 (430) T pfam05221 92 GVPVFAWKGETLEEYWWCIERALTFKDGQGPNMILD 127 (430) T ss_pred CCEEEEECCCCHHHHHHHHHHHHCCCCCCCCCEEEC T ss_conf 976999379998999999999846588989656542 No 197 >PRK06772 salicylate synthase Irp9; Reviewed Probab=33.76 E-value=19 Score=15.55 Aligned_cols=35 Identities=11% Similarity=-0.008 Sum_probs=15.3 Q ss_pred CCCHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHH Q ss_conf 78999999998874368998620567788899899 Q gi|254781051|r 561 SVSKEILKKIGSSIIRLPKSFNTHKIVERLMANRQ 595 (957) Q Consensus 561 ~v~~~~L~~i~~~l~~~Pe~f~~h~~v~k~~~~R~ 595 (957) |-+.+.-+.+...|..-|++..=|--+.+...+.+ T Consensus 262 g~d~~eD~~l~~eLl~d~KE~aEH~mvVdl~rndL 296 (434) T PRK06772 262 MGNPEHNKAKEAELLHDSKEVLEHILSVKEAIAEL 296 (434) T ss_pred CCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHH T ss_conf 96989999999988549688999999999999887 No 198 >KOG2414 consensus Probab=33.37 E-value=26 Score=14.53 Aligned_cols=59 Identities=27% Similarity=0.292 Sum_probs=34.8 Q ss_pred CCEEEEEECCCEECCCCC--HH--CCCCCCCCHHHHHHCCCEE--EECCCCHHHHHHHHHHHHHH Q ss_conf 657999841602012470--10--2467784001122034058--74267889999999999999 Q gi|254781051|r 402 AGNIHLIINNQIGFTTNP--SS--ARSCTYASDISKSIGIPIF--HVNGDDPEAVIRVVRMAVSF 460 (957) Q Consensus 402 GGTiHiVvNNQIGFTT~p--~d--~RSs~Y~TDiAK~i~aPI~--HVN~DdpEav~~~~~lA~~y 460 (957) +-|||-+.|||+=+--+- -| .-.+-|||||.+..-+-=+ -+..|==|||..+-+=-+.| T Consensus 292 a~tIHY~~Nnq~l~d~emVLvDaGcelgGYvSDITRTWP~sGkFs~~Qr~LYeavL~vq~ecik~ 356 (488) T KOG2414 292 ANTIHYVRNNQLLKDDEMVLVDAGCELGGYVSDITRTWPISGKFSDAQRDLYEAVLQVQEECIKY 356 (488) T ss_pred CCEEEEEECCCCCCCCCEEEEECCCCCCCEECCCEECCCCCCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 52688862455467884799945744586671440016787832707899999999999999986 No 199 >cd01316 CAD_DHOase The eukaryotic CAD protein is a trifunctional enzyme of carbamoylphosphate synthetase-aspartate transcarbamoylase-dihydroorotase, which catalyzes the first three steps of de novo pyrimidine nucleotide biosynthesis. Dihydroorotase (DHOase) catalyzes the third step, the reversible interconversion of carbamoyl aspartate to dihydroorotate. Probab=33.29 E-value=30 Score=14.04 Aligned_cols=13 Identities=31% Similarity=0.493 Sum_probs=4.9 Q ss_pred ECCCCHHHHHHHH Q ss_conf 2256465899999 Q gi|254781051|r 238 FGADGSEVIIPAI 250 (957) Q Consensus 238 FsLEG~EslIP~L 250 (957) +|+-|.|+++|+| T Consensus 240 ~G~~GlEt~lp~~ 252 (344) T cd01316 240 PGFPGVETSLPLL 252 (344) T ss_pred CCCCHHHHHHHHH T ss_conf 9856277689999 No 200 >PTZ00186 heat shock 70 kDa precursor protein; Provisional Probab=33.28 E-value=28 Score=14.23 Aligned_cols=36 Identities=14% Similarity=0.314 Sum_probs=15.9 Q ss_pred HHHHHHHH--HCCCCEEEE-ECCCCCHH--HHHHHHHCCCH Q ss_conf 99999987--379107998-06765526--89998853798 Q gi|254781051|r 251 EEIIRQGV--QQGVDEMIL-GMAHRGRL--NVLSQIMNKLP 286 (957) Q Consensus 251 ~~ii~~a~--~~Gv~eivi-GMaHRGRL--NVL~Nvl~Kp~ 286 (957) +..++.|. ...+.+|++ |=+-|=-+ ..|.+++||++ T Consensus 341 ~~aL~dAgl~~~dId~VILVGGSTRIP~Vq~~l~~~Fgkep 381 (657) T PTZ00186 341 KQCMKDAGVELKEINDVVLVGGMTRMPKVVEEVKKFFQKDP 381 (657) T ss_pred HHHHHHHCCCHHHEEEEEEECCCCCCHHHHHHHHHHHCCCC T ss_conf 99999818998780089995784154799999999858997 No 201 >COG3250 LacZ Beta-galactosidase/beta-glucuronidase [Carbohydrate transport and metabolism] Probab=33.13 E-value=30 Score=14.02 Aligned_cols=16 Identities=19% Similarity=0.183 Sum_probs=7.3 Q ss_pred EEECCCCCCCCCHHHH Q ss_conf 8625753334672588 Q gi|254781051|r 374 IIHGDAAFAGQGIVSE 389 (957) Q Consensus 374 liHGDAAfaGQGVV~E 389 (957) --|-|...-|+++-.+ T Consensus 306 nrHe~~~~~G~~~~~~ 321 (808) T COG3250 306 NRHEDDPILGRVTDED 321 (808) T ss_pred ECCCCCCCCCCCCCHH T ss_conf 0466877655454777 No 202 >PRK13410 molecular chaperone DnaK; Provisional Probab=32.61 E-value=29 Score=14.13 Aligned_cols=36 Identities=17% Similarity=0.263 Sum_probs=17.8 Q ss_pred CCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCC Q ss_conf 540478886257533346725887645106774436 Q gi|254781051|r 367 RSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVA 402 (957) Q Consensus 367 ~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tG 402 (957) ...|.--..-|.-.++..-.---.|.|+++|.=--| T Consensus 428 Qt~v~I~V~qGEr~~~~dn~~Lg~F~L~gippap~G 463 (719) T PRK13410 428 QTLVEIHVLQGEREMASDNKSLGRFKLTGIPPAPRG 463 (719) T ss_pred CCEEEEEEEECCCCCCCCCCEEEEEEECCCCCCCCC T ss_conf 715899998337853102846758886187989999 No 203 >pfam04952 AstE_AspA Succinylglutamate desuccinylase / Aspartoacylase family. This family includes Succinylglutamate desuccinylase EC:3.1.-.- that catalyses the fifth and last step in arginine catabolism by the arginine succinyltransferase pathway. The family also include aspartoacylase EC:3.5.1.15 which cleaves acylaspartate into a fatty acid and aspartate. Mutations in human aspartoacylase (ACY-2) lead to Canavan disease. This family is probably structurally related to pfam00246 (Bateman A pers. obs.). Probab=32.24 E-value=31 Score=13.92 Aligned_cols=95 Identities=23% Similarity=0.350 Sum_probs=56.6 Q ss_pred EEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCE-EEEEECCCEECCCCCHHCCCCC-CCCHHHHHHCCCEEEECC--CC Q ss_conf 7888625753334672588764510677443657-9998416020124701024677-840011220340587426--78 Q gi|254781051|r 371 LPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGN-IHLIINNQIGFTTNPSSARSCT-YASDISKSIGIPIFHVNG--DD 446 (957) Q Consensus 371 lpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGT-iHiVvNNQIGFTT~p~d~RSs~-Y~TDiAK~i~aPI~HVN~--Dd 446 (957) +-=-+|||- +.|+-++.+-+.. |..-..-|+ |.+.+=|-+||...- |-+. --+|+-+++-- --+| .. T Consensus 7 i~aG~HGdE-~~G~~~~~~L~~~--l~~~~l~G~ii~vP~~Np~a~~~~~---R~~p~d~~nlNR~FPG---~~~g~~~~ 77 (289) T pfam04952 7 LSAGIHGNE-LAGVELLRRLLRQ--LDPAQRAGTRILVPLANPPAFRAGS---RYIPIDRRDLNRAFPG---RADGSSGS 77 (289) T ss_pred EECCCCCCH-HHHHHHHHHHHHC--CCHHHCCCEEEEEECCCHHHHHHCE---ECCCCCCCCHHCCCCC---CCCCCCCC T ss_conf 986877652-8799999999865--7354459319999634999997277---4578898830114898---63578530 Q ss_pred HHHHHHHHHHHHHHHHHH--CCCEEEEEEE Q ss_conf 899999999999999973--8973998642 Q gi|254781051|r 447 PEAVIRVVRMAVSFRMKF--HKSVVIDIVC 474 (957) Q Consensus 447 pEav~~~~~lA~~yR~~F--~kDVvIDlv~ 474 (957) +......-++|-.+..+. ..|++|||=| T Consensus 78 ~~~~t~teria~~~~~~~~~~aD~~iDLHt 107 (289) T pfam04952 78 PYRATRAERIADAFFRALLPRADIVLDLHT 107 (289) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCEEEEECC T ss_conf 135379999999999754405778999046 No 204 >pfam06898 YqfD Putative stage IV sporulation protein YqfD. This family consists of several putative bacterial stage IV sporulation (SpoIV) proteins. YqfD of Bacillus subtilis is known to be essential for efficient sporulation although its exact function is unknown. Probab=31.97 E-value=26 Score=14.50 Aligned_cols=30 Identities=20% Similarity=0.065 Sum_probs=13.0 Q ss_pred EEECCCCHHHHHHHHHHHHHH-HHHHCCCEE Q ss_conf 874267889999999999999-997389739 Q gi|254781051|r 440 FHVNGDDPEAVIRVVRMAVSF-RMKFHKSVV 469 (957) Q Consensus 440 ~HVN~DdpEav~~~~~lA~~y-R~~F~kDVv 469 (957) ..+.-+.-||+-.|.+.|.+. ..++.+|+- T Consensus 324 ~~~~~t~eeA~~~a~e~a~~~l~~~l~~~a~ 354 (383) T pfam06898 324 KVVKLTKEEAVEKGKKLAKKKLEKEIDKDAK 354 (383) T ss_pred EEEEECHHHHHHHHHHHHHHHHHHHCCCCCE T ss_conf 9999299999999999999999963799999 No 205 >cd07034 TPP_PYR_PFOR_IOR-alpha_like Pyrimidine (PYR) binding domain of pyruvate ferredoxin oxidoreductase (PFOR), indolepyruvate ferredoxin oxidoreductase alpha subunit (IOR-alpha), and related proteins. Thiamine pyrophosphate (TPP family), pyrimidine (PYR) binding domain, of pyruvate ferredoxin oxidoreductase (PFOR), indolepyruvate ferredoxin oxidoreductase (IOR) alpha subunit (IOR-alpha), and related proteins, subfamily. The PYR domain is found in many key metabolic enzymes which use TPP (also known as thiamine diphosphate) as a cofactor. TPP binds in the cleft formed by a PYR domain and a PP domain. The PYR domain, binds the aminopyrimidine ring of TPP, the PP domain binds the diphosphate residue. A polar interaction between the conserved glutamate of the PYR domain and the N1' of the TPP aminopyrimidine ring is shared by most TPP-dependent enzymes, and participates in the activation of TPP. The PYR and PP domains have a common fold, but do not share strong sequence conservation. Th Probab=31.97 E-value=31 Score=13.89 Aligned_cols=84 Identities=14% Similarity=0.086 Sum_probs=47.7 Q ss_pred CCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCCEECCCC----CHHCCCCCCCCHHHHHHCCCEEEECCCCHHHHHHHHH Q ss_conf 333467258876451067744365799984160201247----0102467784001122034058742678899999999 Q gi|254781051|r 380 AFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQIGFTTN----PSSARSCTYASDISKSIGIPIFHVNGDDPEAVIRVVR 455 (957) Q Consensus 380 AfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQIGFTT~----p~d~RSs~Y~TDiAK~i~aPI~HVN~DdpEav~~~~~ 455 (957) +-+|||+..=.=.|+...+-. .+.|-++++--..-|.. ..|..+.+|+. -|++.....|++.|...+. T Consensus 70 ~ts~~Gl~l~~e~l~~a~~~~-~p~Vi~~~~r~g~~~~~~~~~q~D~~~~~~~g-------~~~~vl~p~~~qE~~d~~~ 141 (160) T cd07034 70 ATSGPGLNLMAEALYLAAGAE-LPLVIVVAQRPGPSTGLPKPDQSDLMAARYGG-------HPWPVLAPSSVQEAFDLAL 141 (160) T ss_pred EECCCCHHHHHHHHHHHHHCC-CCEEEEEEECCCCCCCCCCCCCHHHHHHHHCC-------CCEEEEECCCHHHHHHHHH T ss_conf 306533988887999998478-88899997458886545554417899999679-------9879993799999999999 Q ss_pred HHHHHHHHHCCCEEEE Q ss_conf 9999999738973998 Q gi|254781051|r 456 MAVSFRMKFHKSVVID 471 (957) Q Consensus 456 lA~~yR~~F~kDVvID 471 (957) .|++.-.+++.-|++= T Consensus 142 ~A~~lae~~~~Pvi~~ 157 (160) T cd07034 142 EAFELAEKYRLPVIVL 157 (160) T ss_pred HHHHHHHHHCCCEEEE T ss_conf 9999999868998995 No 206 >COG0652 PpiB Peptidyl-prolyl cis-trans isomerase (rotamase) - cyclophilin family [Posttranslational modification, protein turnover, chaperones] Probab=31.80 E-value=26 Score=14.51 Aligned_cols=15 Identities=20% Similarity=0.496 Sum_probs=8.0 Q ss_pred HHHCCCCCEEECCCC Q ss_conf 893799860351577 Q gi|254781051|r 805 SDMTCGSVFQAVLSD 819 (957) Q Consensus 805 ~d~~~g~~F~~vi~d 819 (957) +-+..|+.|++||++ T Consensus 33 ~g~Ydg~~FHRVi~~ 47 (158) T COG0652 33 EGFYDGTIFHRVIPG 47 (158) T ss_pred CCCCCCCEEEEEECC T ss_conf 277799567885158 No 207 >TIGR01420 pilT_fam twitching motility protein; InterPro: IPR006321 These represent the PilT subfamily of proteins related to GspE, a protein involved in type II secretion (also called the General Secretion Pathway). PilT is an apparent cytosolic ATPase associated with type IV pilus systems. It is not required for pilin biogenesis, but is required for twitching motility and social gliding behaviors, shown in some species, powered by pilus retraction . Members of this family may be found in some species that do not have type IV pili but have related structures for DNA uptake and natural transformation. ; GO: 0005524 ATP binding, 0006810 transport. Probab=31.02 E-value=18 Score=15.70 Aligned_cols=72 Identities=19% Similarity=0.339 Sum_probs=46.0 Q ss_pred HHHHHHCCCHHH----------CCCCCCCCCCCCCCCCCCHHHCCCCHHHCCCEEECCCCCCCCCCCHHHHHHHHHHHHC Q ss_conf 999988170454----------0167777776678776787666999788171572566578777789999999998406 Q gi|254781051|r 105 IDAYRSYGHFKA----------NIDPLGYNSHQKDLSELSPAHYGFVKADYDRKICMKGVLGLESATIPEIVDVLSHLYC 174 (957) Q Consensus 105 I~AYR~~GHl~A----------~lDPLgl~~~~~~~peL~~~~yGLse~DLd~~f~~~~~~g~~~~tL~eIi~~L~~tYC 174 (957) ||+|.+||=..+ .+.=|||+ ++-..++--...|| ..-+|.-..+++.||--+||++++.-- T Consensus 86 vN~f~QRg~~a~vlR~ip~~Ip~fe~LGLP--~~v~~~~a~~~~GL-------iLVTGPTGSGKSTTlAsmIDyIN~~~~ 156 (350) T TIGR01420 86 VNAFKQRGGVALVLRLIPSKIPTFEELGLP--RPVLRELAERPRGL-------ILVTGPTGSGKSTTLASMIDYINKNKA 156 (350) T ss_pred EHHHHHCCHHHHHHHHCCCCCCCHHHCCCC--HHHHHHHHHCCCCC-------EEEECCCCCCHHHHHHHHHHHHHCCCC T ss_conf 203235000642323115346216663798--78999998366993-------898768898678999999978740388 Q ss_pred CCE-----EEEEECCC Q ss_conf 430-----27772489 Q gi|254781051|r 175 SNI-----GVEFMHIV 185 (957) Q Consensus 175 gsI-----GvEymHI~ 185 (957) ++| =+||+|=+ T Consensus 157 ~HIiTIEDPIEyvh~~ 172 (350) T TIGR01420 157 GHIITIEDPIEYVHKN 172 (350) T ss_pred CCCEEEECCEEEEECC T ss_conf 8825631773141047 No 208 >KOG0628 consensus Probab=30.93 E-value=32 Score=13.77 Aligned_cols=116 Identities=24% Similarity=0.356 Sum_probs=62.6 Q ss_pred EEEEECCCCCCCCC-HHHHHHHHHC----CCCCCCCCEEEEEECCCEECCCCCHHCCCCCCCCHHH-------------- Q ss_conf 88862575333467-2588764510----6774436579998416020124701024677840011-------------- Q gi|254781051|r 372 PLIIHGDAAFAGQG-IVSETFGLSG----LSGYTVAGNIHLIINNQIGFTTNPSSARSCTYASDIS-------------- 432 (957) Q Consensus 372 piliHGDAAfaGQG-VV~Etl~ls~----l~gY~tGGTiHiVvNNQIGFTT~p~d~RSs~Y~TDiA-------------- 432 (957) .|-+|=|||+||-- ++.|-=.+.+ ...|.+.=.=-..|| |+-+|.--|+..+-|+.- T Consensus 265 glWLHVDAAYAGsa~iCpE~r~l~rGie~aDSfn~n~hK~~~vn----fDCs~lWvkd~~~~~~~~~vdp~yL~h~~~~~ 340 (511) T KOG0628 265 GLWLHVDAAYAGSAFICPEFRYLMRGIEYADSFNFNPHKWLLVN----FDCSPLWVKDGTKLSRAFNVDPLYLKHAYQGS 340 (511) T ss_pred CEEEEEEHHHCCCCCCCHHHHHHHHCCHHHCCCCCCHHHEEEEE----EEEECCEEECCCEEEEEEECCHHHHCCHHHCC T ss_conf 87899634531543158878877502233303357852603776----53000345047604666503827644433236 Q ss_pred ----HHHCCCEEEECC-------------CC-HHHHHHHHHHHHHHHHHHCCC--------EEEEEEECCCCCCCCCCCC Q ss_conf ----220340587426-------------78-899999999999999973897--------3998642112565643542 Q gi|254781051|r 433 ----KSIGIPIFHVNG-------------DD-PEAVIRVVRMAVSFRMKFHKS--------VVIDIVCYRRFGHNEGDEP 486 (957) Q Consensus 433 ----K~i~aPI~HVN~-------------Dd-pEav~~~~~lA~~yR~~F~kD--------VvIDlv~YRr~GHNE~DeP 486 (957) +--+.|.-|==. +. -+-+-+-.+||-+|-+--.+| ++.+|||||.-|+||.- T Consensus 341 ~~Dyrhwqipl~rRfRSLKlWfv~R~~Gve~lq~~iR~h~~La~~fe~lv~~d~~FE~~~~~~lgLvcFRlk~~N~~n-- 418 (511) T KOG0628 341 APDYRHWQIPLGRRFRSLKLWFVLRSYGVENLQNYIREHVRLAKEFETLVRADPRFEIVNKRILGLVCFRLKGDNEIN-- 418 (511) T ss_pred CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEECCCCCCEEEEEECCCCHHH-- T ss_conf 888210115200113267899999988688999999999999999999860487415502210021577525874789-- Q ss_pred CCCCHHHHHH Q ss_conf 2355788888 Q gi|254781051|r 487 SFTQPMMYKT 496 (957) Q Consensus 487 ~fTQP~mY~~ 496 (957) .-+++.. T Consensus 419 ---e~Ll~~i 425 (511) T KOG0628 419 ---EALLNRL 425 (511) T ss_pred ---HHHHHHH T ss_conf ---9999999 No 209 >PRK01433 hscA chaperone protein HscA; Provisional Probab=30.77 E-value=26 Score=14.44 Aligned_cols=22 Identities=27% Similarity=0.391 Sum_probs=10.4 Q ss_pred HHHHHHHHHHHHHHCCCCEEEE Q ss_conf 9999999999987379107998 Q gi|254781051|r 246 IIPAIEEIIRQGVQQGVDEMIL 267 (957) Q Consensus 246 lIP~L~~ii~~a~~~Gv~eivi 267 (957) .+..++..++.|....+++|++ T Consensus 293 ~~~~v~~~L~~a~~~dId~ViL 314 (595) T PRK01433 293 TINIAQECLEQAGNPNIDGVIL 314 (595) T ss_pred HHHHHHHHHHHCCCCCCCEEEE T ss_conf 9999999999769353554899 No 210 >pfam04194 PDCD2_C Programmed cell death protein 2, C-terminal putative domain. Probab=30.68 E-value=24 Score=14.76 Aligned_cols=24 Identities=21% Similarity=0.501 Sum_probs=11.5 Q ss_pred CCCCHHHHHHHHHHHHHHCCCCCCCCHH Q ss_conf 6205677888998998733435642114 Q gi|254781051|r 581 FNTHKIVERLMANRQKMIETGKGIDWSM 608 (957) Q Consensus 581 f~~h~~v~k~~~~R~~~~~~~~~iDW~~ 608 (957) |++-|++...|+.. ..+.++|||+ T Consensus 117 fQlmP~li~~L~~~----~~~~~~dwGt 140 (169) T pfam04194 117 FQIMPQLISYLEVE----SEGVGMDWGT 140 (169) T ss_pred EEECHHHHHHHHCC----CCCCCCCEEE T ss_conf 77536788774035----6667842058 No 211 >COG2374 Predicted extracellular nuclease [General function prediction only] Probab=30.65 E-value=27 Score=14.40 Aligned_cols=15 Identities=47% Similarity=0.694 Sum_probs=6.9 Q ss_pred CCHHHHHHHHHCCCC Q ss_conf 672588764510677 Q gi|254781051|r 384 QGIVSETFGLSGLSG 398 (957) Q Consensus 384 QGVV~Etl~ls~l~g 398 (957) -|.|.|-+.|+++.. T Consensus 275 tG~V~Ey~~~tq~~~ 289 (798) T COG2374 275 TGTVSEYYGLTQLFA 289 (798) T ss_pred EEEEEEECCCCCCCC T ss_conf 888876316654112 No 212 >smart00087 PTH Parathyroid hormone. Probab=30.52 E-value=17 Score=15.85 Aligned_cols=19 Identities=26% Similarity=0.718 Sum_probs=16.0 Q ss_pred CCCCHHHHHHHHHHHCCCC Q ss_conf 4899899999999841457 Q gi|254781051|r 183 HIVDSIERDWVRNTIEDYD 201 (957) Q Consensus 183 HI~d~eer~Wlq~riE~~~ 201 (957) ||++.+.+.||++.||..- T Consensus 16 ~lq~~~R~~wL~~~le~vh 34 (36) T smart00087 16 HLQSLERREWLRKKLQDVH 34 (36) T ss_pred HHHHHHHHHHHHHHHHHHH T ss_conf 7999999999999998723 No 213 >TIGR01283 nifE nitrogenase MoFe cofactor biosynthesis protein NifE; InterPro: IPR005973 The enzyme responsible for nitrogen fixation, the nitrogenase, shows a high degree of conservation of structure, function, and amino acid sequence across wide phylogenetic ranges. All known Mo-nitrogenases consist of two components, component I (also called dinitrogenase, or Fe-Mo protein), an alpha2beta2 tetramer encoded by the nifD and nifK genes, and component II (dinitrogenase reductase, or Fe protein) a homodimer encoded by the nifH gene. Two operons, nifDK and nifEN, encode a tetrameric (alpha2beta2 and N2E2) enzymatic complex. Nitrogenase contains two unusual rare metal clusters; one of them is the iron molybdenum cofactor (FeMo-co), which is considered to be the site of dinitrogen reduction and whose biosynthesis requires the products of nifNE and of some other nif genes. It has been proposed that NifNE might serve as a scaffold upon which FeMo-co is built and then inserted into component I. ; GO: 0005515 protein binding, 0006461 protein complex assembly, 0009399 nitrogen fixation. Probab=30.49 E-value=33 Score=13.71 Aligned_cols=53 Identities=25% Similarity=0.350 Sum_probs=32.1 Q ss_pred CCEECCCCCHH-----CC------------CCCCCCHHHHHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCEE Q ss_conf 60201247010-----24------------6778400112203405874267889999999999999997389739 Q gi|254781051|r 411 NQIGFTTNPSS-----AR------------SCTYASDISKSIGIPIFHVNGDDPEAVIRVVRMAVSFRMKFHKSVV 469 (957) Q Consensus 411 NQIGFTT~p~d-----~R------------Ss~Y~TDiAK~i~aPI~HVN~DdpEav~~~~~lA~~yR~~F~kDVv 469 (957) |..||||+=.+ +| -.+|-=-..=++..=|.=+-|||.|||++++ .+||+.-|| T Consensus 92 yR~gFsTDL~E~DVifGrGEKKL~~aI~e~v~~Y~P~AvFVYsTCv~alIGDD~~AVCk~A------~~k~G~~Vi 161 (470) T TIGR01283 92 YRLGFSTDLTEKDVIFGRGEKKLFHAIREIVERYHPAAVFVYSTCVVALIGDDLEAVCKAA------AEKYGIPVI 161 (470) T ss_pred CCCCCCCCCCCCCEEECCHHHHHHHHHHHHHHHCCCCEEEEECCEECCEECCCHHHHHHHH------HHCCCCCEE T ss_conf 1366632660246673314478999999998632997489945713102448889999999------723586067 No 214 >pfam12076 Wax2_C WAX2 C-terminal domain. This presumed domain is functionally uncharacterized. This domain is found in eukaryotes. This domain is about 170 amino acids in length. This domain is found associated with pfam04116. This domain has a conserved LEGW sequence motif. This region has similarity to short chain dehydrogenases. Probab=30.46 E-value=33 Score=13.71 Aligned_cols=37 Identities=16% Similarity=0.364 Sum_probs=29.3 Q ss_pred HHHCCCCCEEEEECCCHHHHHHHHHHHHCCCCCCCEEEEC Q ss_conf 7521437826972488999999999974353488689856 Q gi|254781051|r 751 LQMCAENNMYVANCTSPANYFHILRRQIYDRSSRPLIMMA 790 (957) Q Consensus 751 LQl~a~~N~~V~~~ttpAq~FH~LRRQ~~r~~rkPLiv~t 790 (957) |-|| +.|+||+.. .-..|+.|++++-...+.-|+.-| T Consensus 16 ~aLC-~rgi~V~m~--~ke~~~~lk~e~~~~~~~~Lv~s~ 52 (164) T pfam12076 16 LALC-KKGIKVVML--SKEEFEKLKPEAPEKVQDNLVLST 52 (164) T ss_pred HHHH-HCCCEEEEE--CHHHHHHHHHHCCHHHHHHEEEEC T ss_conf 9998-669989995--689999999873887663378725 No 215 >pfam05320 Pox_RNA_Pol_19 Poxvirus DNA-directed RNA polymerase 19 kDa subunit. This family contains several DNA-directed RNA polymerase 19 kDa polypeptides. The Poxvirus DNA-directed RNA polymerase (EC: 2.7.7.6) catalyses DNA-template-directed extension of the 3'-end of an RNA strand by one nucleotide at a time. Probab=30.39 E-value=12 Score=16.95 Aligned_cols=17 Identities=18% Similarity=0.280 Sum_probs=8.8 Q ss_pred HHHHHHCCCHHHCCCCC Q ss_conf 99998817045401677 Q gi|254781051|r 105 IDAYRSYGHFKANIDPL 121 (957) Q Consensus 105 I~AYR~~GHl~A~lDPL 121 (957) |.-|..-|-++-..|-| T Consensus 80 iSLfEiTGIiaESyNLL 96 (167) T pfam05320 80 ISLFEITGIIAESYNLL 96 (167) T ss_pred HHHHHHHHHHHHHHHHH T ss_conf 33788778999888898 No 216 >COG5499 Predicted transcription regulator containing HTH domain [Transcription] Probab=30.27 E-value=28 Score=14.30 Aligned_cols=70 Identities=24% Similarity=0.290 Sum_probs=45.4 Q ss_pred HHHHHHH-HHHHHHHHCCCHHHCCCCCCCCCCCCCCCCCCHHHCCCCHHHCCCEEEC----CCCCCCCCC-CHHHHHHHH Q ss_conf 9999999-9999998817045401677777766787767876669997881715725----665787777-899999999 Q gi|254781051|r 96 KDFFQVM-KMIDAYRSYGHFKANIDPLGYNSHQKDLSELSPAHYGFVKADYDRKICM----KGVLGLESA-TIPEIVDVL 169 (957) Q Consensus 96 ~kq~rV~-~LI~AYR~~GHl~A~lDPLgl~~~~~~~peL~~~~yGLse~DLd~~f~~----~~~~g~~~~-tL~eIi~~L 169 (957) .+.++|. .||.||...-|..|-=||... - .--++-|||+.+||..+|-+ ++++.+..+ || |+|.+| T Consensus 36 ~dr~~Ilatl~eAyE~kh~~i~aP~pve~--I-----~t~Md~~glt~~dLa~~iGSks~vS~iL~~rraLTl-e~ikkL 107 (120) T COG5499 36 LDRADILATLIEAYEFKHYPIAAPDPVEV--I-----RTLMDQYGLTLADLANEIGSKSRVSNILSGRRALTL-EHIKKL 107 (120) T ss_pred HHHHHHHHHHHHHHHHHHCHHHCCCHHHH--H-----HHHHHHHCCCHHHHHHHHCCHHHHHHHHHHHHHHHH-HHHHHH T ss_conf 89999999998606655140316888899--9-----999999387188889986761889999732067669-999999 Q ss_pred HHHH Q ss_conf 9840 Q gi|254781051|r 170 SHLY 173 (957) Q Consensus 170 ~~tY 173 (957) .+-+ T Consensus 108 ~q~~ 111 (120) T COG5499 108 HQRF 111 (120) T ss_pred HHHH T ss_conf 9986 No 217 >pfam03652 UPF0081 Uncharacterized protein family (UPF0081). Probab=30.17 E-value=33 Score=13.68 Aligned_cols=62 Identities=11% Similarity=0.365 Sum_probs=39.7 Q ss_pred CCCCHHHHHHHHHHHHCCCEEEEEE--CCCCHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 7778999999999840643027772--48998999999998414577888798899999999999998999998 Q gi|254781051|r 158 ESATIPEIVDVLSHLYCSNIGVEFM--HIVDSIERDWVRNTIEDYDFSSNFSKEEQRDILDKLVRAEGFEKFID 229 (957) Q Consensus 158 ~~~tL~eIi~~L~~tYCgsIGvEym--HI~d~eer~Wlq~riE~~~~~~~~s~eeK~~iL~~L~~Ae~FE~FL~ 229 (957) .....++..+.|++.| .|-++|. +.++.+-+.-+++ . ..++.+++...+.+.++--+|.||. T Consensus 70 ~~~~v~~f~~~L~~~~--~i~v~~~DEr~TS~~A~~~l~~---~-----~~~~~k~k~~iD~~AA~iILq~~L~ 133 (134) T pfam03652 70 QTKRVRKFARRLKKRF--GLPVELVDERLTTVEAERILRE---A-----GLSRKKRKEVVDSIAAVIILQSYLD 133 (134) T ss_pred HHHHHHHHHHHHHHHH--CCCEEEEECCCCHHHHHHHHHH---C-----CCCCCCCCCCHHHHHHHHHHHHHHC T ss_conf 9999999999999861--9986886343279999999997---3-----7442102521679999999999866 No 218 >COG3608 Predicted deacylase [General function prediction only] Probab=30.12 E-value=33 Score=13.67 Aligned_cols=94 Identities=17% Similarity=0.299 Sum_probs=48.2 Q ss_pred EEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEE-CCCEECCCCCHHCCCCCCCC-HHHHHHC-CCEEEECCC Q ss_conf 04788862575333467258876451067744365799984-16020124701024677840-0112203-405874267 Q gi|254781051|r 369 RVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLII-NNQIGFTTNPSSARSCTYAS-DISKSIG-IPIFHVNGD 445 (957) Q Consensus 369 ~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVv-NNQIGFTT~p~d~RSs~Y~T-DiAK~i~-aPI~HVN~D 445 (957) -.|-=-+|||- ..|+....+- |.+|+.=.+-|||+||= =| .|.-..-++|+. |--.+.. .| =--+|. T Consensus 51 ~~l~ag~HGdE-l~G~~al~~L--i~~L~~a~i~GtV~iVP~aN------~~a~~~~~R~~p~d~~N~NR~fP-g~~dgs 120 (331) T COG3608 51 VLLQAGVHGDE-LPGVIALRRL--IPALDPADISGTVIIVPIAN------PPAFEAQGRFSPGDDTNLNRAFP-GRPDGS 120 (331) T ss_pred EEEEECCCCCC-CCHHHHHHHH--HHHCCHHHCCCEEEEEECCC------HHHHHHHCCCCCCCCCCCCCCCC-CCCCCC T ss_conf 89971245554-6438999999--97567754474399972258------88898614437787876334588-998988 Q ss_pred CHHHHHHHH-HHHHHHHHHHCCCEEEEEEECCC Q ss_conf 889999999-99999999738973998642112 Q gi|254781051|r 446 DPEAVIRVV-RMAVSFRMKFHKSVVIDIVCYRR 477 (957) Q Consensus 446 dpEav~~~~-~lA~~yR~~F~kDVvIDlv~YRr 477 (957) ..+...... ++.++ ..|+|+||=|-.. T Consensus 121 ~t~ria~~l~r~L~~-----~aD~VlDlHsg~~ 148 (331) T COG3608 121 ATERIADRLKRLLLP-----LADIVLDLHSGGE 148 (331) T ss_pred HHHHHHHHHHHHHHC-----CCCEEEECCCCCC T ss_conf 799999999986420-----3678998157997 No 219 >PRK13246 dihydrobiliverdin:ferredoxin oxidoreductase; Provisional Probab=30.04 E-value=33 Score=13.66 Aligned_cols=21 Identities=14% Similarity=0.263 Sum_probs=11.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999989999986 Q gi|254781051|r 210 EQRDILDKLVRAEGFEKFIDI 230 (957) Q Consensus 210 eK~~iL~~L~~Ae~FE~FL~~ 230 (957) --+.+|+++.-.|-=|+|++. T Consensus 205 Par~ll~~~FG~eWae~yi~~ 225 (236) T PRK13246 205 PADKLFKSFFGEKWTKKFINK 225 (236) T ss_pred CHHHHHHHHHCHHHHHHHHHH T ss_conf 788899997094999999986 No 220 >cd01815 BMSC_UbP_N BMSC_UbP (bone marrow stromal cell-derived ubiquitin-like protein) has an N-terminal ubiquitin-like (UBQ) domain and a C-terminal ubiquitin-associated (UBA) domain, a domain architecture similar to those of the UBIN, Chap1, and ubiquilin proteins. This CD represents the N-terminal ubiquitin-like domain. Probab=29.89 E-value=14 Score=16.54 Aligned_cols=46 Identities=35% Similarity=0.481 Sum_probs=28.6 Q ss_pred CCCHHHCCCCC-----------CCCHHHHCCCCCEEECCCCCCCCCCCCCCCCCCHHHCCEEEEECCHHHHHH Q ss_conf 66022108012-----------688789379986035157764334445444579545768998688578999 Q gi|254781051|r 790 APKSLLRHKRV-----------VSSLSDMTCGSVFQAVLSDDAEYHGKTSVKLKEDSHIRRVILCTGKVYYDL 851 (957) Q Consensus 790 tPKsLLR~~~a-----------~S~l~d~~~g~~F~~vi~d~~~~~~~~~~~~~~~~~v~rvi~csGKvyydL 851 (957) .|||++|.|.+ +|+++.+..+. .+.-+|||. -+ -+|.|--|.--|+ T Consensus 1 ~pks~~~~pe~e~gd~~~ggy~vs~lKQLia~~-L~dS~PDPe--------------LI-~li~CG~~LkDd~ 57 (75) T cd01815 1 APKSLFPFPEVELGDVSPGGYQVSTLKQLIAAQ-LPDSLPDPE--------------LI-DLIHCGRKLKDDQ 57 (75) T ss_pred CCCCCCCCCCCCCCCCCCCCEEHHHHHHHHHHH-CCCCCCCHH--------------HH-HHHCCCCEECCCC T ss_conf 973323388887776599834378999999755-654699845--------------72-2211141314654 No 221 >KOG1520 consensus Probab=29.88 E-value=10 Score=17.57 Aligned_cols=175 Identities=17% Similarity=0.168 Sum_probs=85.0 Q ss_pred CCCCHHHHHHHHHHHH---HCCCEEEEECC-CCCCCCCCCCEEEEEECCCCCEEEEHH----HCCC-CCCEEEEEECCCC Q ss_conf 6421147898889886---35831578501-002233323027999778884455634----4278-9845999824311 Q gi|254781051|r 603 GIDWSMAESLAFGSLC---YEGYKVRLSGQ-DCERGTFSHRHAILCDQETGKRYFPLG----NISK-DQGHCEVVNSFLS 673 (957) Q Consensus 603 ~iDW~~AE~LA~~SLl---~eG~~VRlsGQ-D~~RGTFshRHavl~dq~t~~~y~pL~----~l~~-~q~~f~v~nS~LS 673 (957) .|+|.... |.++ .||-++|+..+ |+-+ --+++=..+-.+|.+=+ -+.. .-|++--||-.=- T Consensus 140 ~V~p~g~~----a~~l~~~~~G~~~kf~N~ldI~~------~g~vyFTDSSsk~~~rd~~~a~l~g~~~GRl~~YD~~tK 209 (376) T KOG1520 140 KVGPEGGL----AELLADEAEGKPFKFLNDLDIDP------EGVVYFTDSSSKYDRRDFVFAALEGDPTGRLFRYDPSTK 209 (376) T ss_pred EECCCCCC----CEECCCCCCCEEEEECCCEEECC------CCEEEEECCCCCCCHHHEEEEEECCCCCCCEEEECCCCC T ss_conf 97788870----03246612590566327536767------871999326245562265776622797664687468664 Q ss_pred CCCHHHHHHCCCCCCCCCCEEECCCCCCCCCCCHHHHHH-HHHHHHHHHCCCCCCEEECCCCCCCCCCCCCCCCHHHHHH Q ss_conf 110134320221138653111000024424541022565-5420257623336813543688888887656440578875 Q gi|254781051|r 674 EQAVLGFEYGYSLNNLNALTIWEAQFGDFANGAQVILDQ-FITAGEQKWLCTSNLVCLLPHGYEGQGPEHSSARLERFLQ 752 (957) Q Consensus 674 E~avlgFEyGys~~~p~~L~iWEAQFGDF~NgAQviiDq-fiss~e~KW~~~sglvllLPHGyeGqGPEHSSaRlERfLQ 752 (957) +.-||-=+. -|+||-|.-.|| |+--+|+-=.|..-.-+ .||--.++ |=|-| T Consensus 210 ~~~VLld~L------------------~F~NGlaLS~d~sfvl~~Et~~~ri~rywi--------~g~k~gt~--EvFa~ 261 (376) T KOG1520 210 VTKVLLDGL------------------YFPNGLALSPDGSFVLVAETTTARIKRYWI--------KGPKAGTS--EVFAE 261 (376) T ss_pred CHHHHHHCC------------------CCCCCCCCCCCCCEEEEEEECCCEEEEEEE--------CCCCCCCH--HHHHH T ss_conf 044145216------------------455661088899889999502202466684--------47866726--56751 Q ss_pred -HC---------CCCCEEEEECCCHHHHHHHHHHHHCCCCCCCEEEECCCHH------HCC--CCCCCCHHHH-CCCCCE Q ss_conf -21---------4378269724889999999999743534886898566022------108--0126887893-799860 Q gi|254781051|r 753 -MC---------AENNMYVANCTSPANYFHILRRQIYDRSSRPLIMMAPKSL------LRH--KRVVSSLSDM-TCGSVF 813 (957) Q Consensus 753 -l~---------a~~N~~V~~~ttpAq~FH~LRRQ~~r~~rkPLiv~tPKsL------LR~--~~a~S~l~d~-~~g~~F 813 (957) |- .+.++.|...|.-... .|.-+..++-+-+++++|+.. ... |. |.+... .+| .. T Consensus 262 ~LPG~PDNIR~~~~G~fWVal~~~~~~~---~~~~~~~p~vr~~~~~~~~~~~~~~~~~~~~~p~--~~V~~~d~~G-~i 335 (376) T KOG1520 262 GLPGYPDNIRRDSTGHFWVALHSKRSTL---WRLLMKYPWVRKFIAKLPKYMELLYFLNNGGKPH--SAVKLSDETG-KI 335 (376) T ss_pred CCCCCCCCEEECCCCCEEEEEECCCCHH---HHHHHCCHHHHHHHHHHCCCHHHHHHHHCCCCCC--EEEEEECCCC-CE T ss_conf 5999986531989998899874266567---8765118279888876131033333343257873--5899955898-48 Q ss_pred EECCCCCC Q ss_conf 35157764 Q gi|254781051|r 814 QAVLSDDA 821 (957) Q Consensus 814 ~~vi~d~~ 821 (957) -+++.|.. T Consensus 336 l~~lhD~~ 343 (376) T KOG1520 336 LESLHDKE 343 (376) T ss_pred EEEEECCC T ss_conf 89983588 No 222 >cd00671 ArgRS_core This is the catalytic core domain of Arginyl tRNA synthetase (ArgRS). This class I enzyme is a monomer, which aminoacylates the 2'-OH of the nucleotide at the 3' of the appropriate tRNA. The core domain is based on the Rossman fold and is responsible for the ATP-dependent formation of the enzyme bound aminoacyl-adenylate. There are at least three subgroups of ArgRS. One type contains both characteristic class I HIGH and KMSKS motifs, which are involved in ATP binding. The other subtype lacks the KMSKS motif; however, it has a lysine N-terminal to the HIGH motif, which serves as the functional counterpart to the second lysine of the KMSKS motif. A third group, which is found primarily in archaea and a few bacteria, lacks both the KMSKS motif and the HIGH loop lysine. Probab=29.57 E-value=34 Score=13.60 Aligned_cols=20 Identities=5% Similarity=-0.162 Sum_probs=11.7 Q ss_pred HHHHHHHHCCCHHHCCCCCC Q ss_conf 99999988170454016777 Q gi|254781051|r 103 KMIDAYRSYGHFKANIDPLG 122 (957) Q Consensus 103 ~LI~AYR~~GHl~A~lDPLg 122 (957) .|.|-++..||..-..+=+| T Consensus 28 slaril~~~G~~V~r~~yvn 47 (267) T cd00671 28 SLARILEFAGYDVTREYYIN 47 (267) T ss_pred HHHHHHHHCCCEEEEEEECC T ss_conf 99999998799588988507 No 223 >pfam00425 Chorismate_bind chorismate binding enzyme. This family includes the catalytic regions of the chorismate binding enzymes anthranilate synthase, isochorismate synthase, aminodeoxychorismate synthase and para-aminobenzoate synthase. Probab=29.46 E-value=17 Score=15.87 Aligned_cols=40 Identities=23% Similarity=0.229 Sum_probs=29.7 Q ss_pred CCHHHC-CCEEECCCCCCCCCCCHHHHHHHHHH----HHCCCEEE Q ss_conf 997881-71572566578777789999999998----40643027 Q gi|254781051|r 140 FVKADY-DRKICMKGVLGLESATIPEIVDVLSH----LYCSNIGV 179 (957) Q Consensus 140 Lse~DL-d~~f~~~~~~g~~~~tL~eIi~~L~~----tYCgsIGv 179 (957) .+..|+ ..-++++.+.|.+...-.++|..++. .|||.+|+ T Consensus 163 ~~~~~ll~~lhPt~av~G~Pk~~A~~~I~~~E~~~Rg~Y~G~~G~ 207 (239) T pfam00425 163 LSLLDLLAALHPTGAVTGAPKKRAMEIIAELEPFDRGLYAGAVGW 207 (239) T ss_pred CCHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCCCCCCEEEEEEE T ss_conf 898999997299866788976999999998559988775789999 No 224 >PRK06923 isochorismate synthase DhbC; Validated Probab=29.42 E-value=21 Score=15.21 Aligned_cols=17 Identities=24% Similarity=0.544 Sum_probs=8.5 Q ss_pred EEEECCCC--CCCCCHHHH Q ss_conf 88625753--334672588 Q gi|254781051|r 373 LIIHGDAA--FAGQGIVSE 389 (957) Q Consensus 373 iliHGDAA--faGQGVV~E 389 (957) .++.|+.+ |+|=|||++ T Consensus 350 a~v~~~~~~l~AGaGIV~d 368 (399) T PRK06923 350 AEVEENTLRLYAGAGVVAG 368 (399) T ss_pred EEEECCEEEEEEEEEEECC T ss_conf 9998999999976689678 No 225 >cd02754 MopB_Nitrate-R-NapA-like Nitrate reductases, NapA (Nitrate-R-NapA), NasA, and NarB catalyze the reduction of nitrate to nitrite. Monomeric Nas is located in the cytoplasm and participates in nitrogen assimilation. Dimeric Nap is located in the periplasm and is coupled to quinol oxidation via a membrane-anchored tetraheme cytochrome. Members of the MopB_Nitrate-R-NapA CD belong to the molybdopterin_binding (MopB) superfamily of proteins. Probab=29.10 E-value=27 Score=14.31 Aligned_cols=61 Identities=11% Similarity=0.187 Sum_probs=34.1 Q ss_pred HHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCC Q ss_conf 9986448989899999999999997756665412587656666568626667789999999988743689986 Q gi|254781051|r 509 DSLMKNQVISKQELQSLANNWHKYLEAEYKESESYCPEKLGLLHNGENERKNSVSKEILKKIGSSIIRLPKSF 581 (957) Q Consensus 509 ~~Li~~g~it~~e~~~~~~~~~~~l~~~~~~~k~~~~~~~~~~~~~~~~~~T~v~~~~L~~i~~~l~~~Pe~f 581 (957) +.|+++|.++++.+++....+.. -.+..+.|.|+ +...-|||+.+.+++++..+.+-+... T Consensus 225 ~~ii~~~~~D~~fv~~~t~gfe~----~~~~v~~~tpe--------~aa~itGv~~~~I~~lA~~~a~~~~~~ 285 (565) T cd02754 225 HVLIEEGLIDRDFIDAHTEGFEE----LKAFVADYTPE--------KVAEITGVPEADIREAARLFGEARKVM 285 (565) T ss_pred EEEEECCCCCHHHHHHCCCCHHH----HHHHHHCCCHH--------HHHHHCCCCHHHHHHHHHHHHHCCCCE T ss_conf 58420565142346641442677----77776328887--------766752988899999999987177718 No 226 >KOG2560 consensus Probab=29.05 E-value=25 Score=14.64 Aligned_cols=87 Identities=11% Similarity=0.113 Sum_probs=62.9 Q ss_pred CCCCCHHHHHHHHHCCCCCCCCCEEEEEECCCEECCCCCHHCCCCCCCCHHHHHHCCCEEEECCCCHHHHHHHHHHHHHH Q ss_conf 33467258876451067744365799984160201247010246778400112203405874267889999999999999 Q gi|254781051|r 381 FAGQGIVSETFGLSGLSGYTVAGNIHLIINNQIGFTTNPSSARSCTYASDISKSIGIPIFHVNGDDPEAVIRVVRMAVSF 460 (957) Q Consensus 381 faGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQIGFTT~p~d~RSs~Y~TDiAK~i~aPI~HVN~DdpEav~~~~~lA~~y 460 (957) ++||++-++.+||---.+|.-|+-+|+|-+ | .-.|.+....+.+-++ T Consensus 291 ~SGe~~~~~qlq~FAweA~~kG~~~H~~A~--------P-------------------------tk~Ell~k~~k~kke~ 337 (529) T KOG2560 291 NSGEALEFNQLQMFAWEAFDKGVDVHMQAD--------P-------------------------TKLELLYKENKVKKET 337 (529) T ss_pred CCCHHHHHHHHHHHHHHHHCCCCEEEEECC--------C-------------------------HHHHHHHHHHHHHHHH T ss_conf 053145579988999998605640232138--------0-------------------------7888999873888999 Q ss_pred HHHHCCCEEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHC Q ss_conf 9973897399864211256564354223557888886718 Q gi|254781051|r 461 RMKFHKSVVIDIVCYRRFGHNEGDEPSFTQPMMYKTIRSH 500 (957) Q Consensus 461 R~~F~kDVvIDlv~YRr~GHNE~DeP~fTQP~mY~~I~~h 500 (957) =++-.++-++|-.|=--++--+.++--.+|-.-|-.+..+ T Consensus 338 lK~q~kq~il~kYG~~~~~d~~p~elll~qte~~iey~rk 377 (529) T KOG2560 338 LKKQTKQEILDKYGGGEHKDLPPKELLLAQTEEYIEYSRK 377 (529) T ss_pred HHHHHHHHHHHHCCCCCCCCCCHHHHHCCCCHHHHHHHHC T ss_conf 9887789998612875224457676650132788887641 No 227 >cd05816 CBM20_DPE2_repeat2 Disproportionating enzyme 2 (DPE2), N-terminal CBM20 (carbohydrate-binding module, family 20) domain, repeat 2. DPE2 is a transglucosidase that is essential for the cytosolic metabolism of maltose in plant leaves at night. Maltose is an intermediate on the pathway from starch to sucrose and DPE2 is thought to metabolize the maltose that is exported from the chloroplast. DPE2 has two N-terminal CBM20 domains as well as a C-terminal amylomaltase (4-alpha-glucanotransferase) catalytic domain. DPE1, the plastid version of this enzyme, has a transglucosidase domain that is similar to that of DPE2 but lacks the N-terminal CBM20 domains. Included in this group are PDE2-like proteins from Dictyostelium, Entamoeba, and Bacteroides. The CBM20 domain is found in a large number of starch degrading enzymes including alpha-amylase, beta-amylase, glucoamylase, and CGTase (cyclodextrin glucanotransferase). CBM20 is also present in proteins that have a regulatory role in star Probab=28.88 E-value=35 Score=13.52 Aligned_cols=67 Identities=19% Similarity=0.374 Sum_probs=43.4 Q ss_pred HHHHCCCEEEEECCCCCCCCCCCCEEEEEECCCCCEEEEHHHCCCCCCEEEEEECCCCCCCHHHHHHCCCCCCCC-CCEE Q ss_conf 886358315785010022333230279997788844556344278984599982431111013432022113865-3111 Q gi|254781051|r 616 SLCYEGYKVRLSGQDCERGTFSHRHAILCDQETGKRYFPLGNISKDQGHCEVVNSFLSEQAVLGFEYGYSLNNLN-ALTI 694 (957) Q Consensus 616 SLl~eG~~VRlsGQD~~RGTFshRHavl~dq~t~~~y~pL~~l~~~q~~f~v~nS~LSE~avlgFEyGys~~~p~-~L~i 694 (957) ..+..|-.|.++|-..+=|--.--+|+.-....+..| .+.+.+-++. .-|||=|-+.+.+ ..++ T Consensus 9 p~~~~Gq~v~vvG~~~~LG~W~~~~a~~L~~~~~~~W---------~~~v~lp~~~------~~~EYKyvi~~~~~~~~~ 73 (99) T cd05816 9 PYVPKGQSVYVTGSSPELGNWDPQKALKLSDVGFPIW---------EADIDISKDS------FPFEYKYIIANKDSGVVS 73 (99) T ss_pred CCCCCCCEEEEECCHHHHCCCCHHHCEECCCCCCCCE---------EEEEEECCCC------CCEEEEEEEEECCCCEEE T ss_conf 8789999999987958987997555662578889989---------9999978999------658999999969999189 Q ss_pred ECC Q ss_conf 000 Q gi|254781051|r 695 WEA 697 (957) Q Consensus 695 WEA 697 (957) ||. T Consensus 74 WE~ 76 (99) T cd05816 74 WEN 76 (99) T ss_pred EEC T ss_conf 702 No 228 >cd02128 PA_TfR PA_TfR: Protease-associated domain containing proteins like transferrin receptor (TfR). This group contains various PA domain-containing proteins similar to human TfR1 and TfR2. TfR1 and TfR2 are type II membrane proteins, belonging to the peptidase M28 family. TfR1 is homodimeric, widely expressed, and a key player in the uptake of iron-loaded transferrin (Tf) into cells. The TfR1 homodimer binds two molecules of Tf and this complex is internalized. In addition to its role in iron uptake, TfR1 may participate in cell growth and proliferation. TfR2 also binds Tf but with a significantly lower affinity than does TfR1. TfR2 is expressed chiefly in hepatocytes, hematopoietic cells, and duodenal crypt cells; its expression overlaps with that of hereditary hemochromatosis protein (HFE). TfR2 is involved in iron homeostasis. HFE and TfR2 interact in cells. By one model for serum iron sensing, at low or basal iron concentrations, HFE and TFR1 form a complex at the plasma membra Probab=28.80 E-value=35 Score=13.51 Aligned_cols=36 Identities=31% Similarity=0.417 Sum_probs=23.8 Q ss_pred HHHHHHHHCCCC-CCCCCCCCCCCCCCCEEEEEEECCCEEEEE Q ss_conf 899999855887-432234555100150268898359618999 Q gi|254781051|r 286 PRSIFYEFKGKG-PSEKEYSGDVKYHLGLCCNRQICGKDVKLL 327 (957) Q Consensus 286 ~~~iF~EF~g~~-~~~~~~sGDVKYHLG~s~~~~~~g~~v~ls 327 (957) .+.|++...|.. |.++.+ ||+.||+|-.. +++|+|+ T Consensus 146 A~~lL~~L~G~~~P~~WqG-g~~tY~~Gpg~-----~~~v~l~ 182 (183) T cd02128 146 AAKLLSKMGGPVCPSGWKG-GDSTCRLGTSS-----SKNVKLT 182 (183) T ss_pred HHHHHHHCCCCCCCCCCCC-CCCCEEECCCC-----CCEEEEE T ss_conf 9999997599989813417-98886747998-----8648875 No 229 >PRK05183 hscA chaperone protein HscA; Provisional Probab=28.68 E-value=35 Score=13.50 Aligned_cols=25 Identities=16% Similarity=0.401 Sum_probs=10.7 Q ss_pred EEECCCCCCCCCHHHHHHHHHCCCC Q ss_conf 8625753334672588764510677 Q gi|254781051|r 374 IIHGDAAFAGQGIVSETFGLSGLSG 398 (957) Q Consensus 374 liHGDAAfaGQGVV~Etl~ls~l~g 398 (957) ..-|...++-.-.---.|.|+++|- T Consensus 442 V~qGEr~~~~dN~~Lg~f~L~gIpp 466 (621) T PRK05183 442 VVQGERELVSDCRSLARFELRGIPP 466 (621) T ss_pred EEEECCHHHHCCCEEEEEEECCCCC T ss_conf 9973554452472771799779899 No 230 >KOG1119 consensus Probab=28.49 E-value=15 Score=16.24 Aligned_cols=32 Identities=34% Similarity=0.661 Sum_probs=23.1 Q ss_pred CCHHHHHHCCCCCC--CCCCEEECCCCCCCCCCCHHHHHH Q ss_conf 10134320221138--653111000024424541022565 Q gi|254781051|r 675 QAVLGFEYGYSLNN--LNALTIWEAQFGDFANGAQVILDQ 712 (957) Q Consensus 675 ~avlgFEyGys~~~--p~~L~iWEAQFGDF~NgAQviiDq 712 (957) -|+-||+|-+.+++ .+.=.+|+ -|||-||||. T Consensus 122 GGCsGFQYkf~LD~~in~dD~vf~------e~~arVVvD~ 155 (199) T KOG1119 122 GGCSGFQYKFRLDNKINNDDRVFV------ENGARVVVDN 155 (199) T ss_pred CCCCCEEEEEEECCCCCCCCEEEE------ECCCEEEEEC T ss_conf 870335888884477787665786------1880899853 No 231 >TIGR03142 cytochro_ccmI cytochrome c-type biogenesis protein CcmI. This TPR repeat-containing protein is the CcmI protein (also called CycH) of c-type cytochrome biogenesis. CcmI is thought to act as an apo-cytochrome c chaperone. This model describes the N-terminal region of the protein, Members of this protein family Probab=28.39 E-value=36 Score=13.46 Aligned_cols=38 Identities=21% Similarity=0.364 Sum_probs=29.8 Q ss_pred HHHHHHHHH------HCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999986------44898989999999999999775666541 Q gi|254781051|r 504 LQLYADSLM------KNQVISKQELQSLANNWHKYLEAEYKESE 541 (957) Q Consensus 504 ~~~Y~~~Li------~~g~it~~e~~~~~~~~~~~l~~~~~~~k 541 (957) ..+|.++|. ++|+|+++|.+..+.+....|=++.+..+ T Consensus 38 ~~iyrdqL~ELe~d~~~G~l~~~e~~~~r~El~rrLL~d~~~~~ 81 (117) T TIGR03142 38 LAVYRDRLAELERDLAEGLLDEAEAEAARAELQRRLLADIPAAE 81 (117) T ss_pred HHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 99999999999999986998999999999999999986774233 No 232 >PRK13566 anthranilate synthase; Provisional Probab=28.31 E-value=18 Score=15.65 Aligned_cols=72 Identities=19% Similarity=0.306 Sum_probs=45.0 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCHHHHHHHHCCCCCCCCCC---CCCCCCCCCEEEEE--EECCCE Q ss_conf 9999999987379107998067655268999885379899999855887432234---55510015026889--835961 Q gi|254781051|r 249 AIEEIIRQGVQQGVDEMILGMAHRGRLNVLSQIMNKLPRSIFYEFKGKGPSEKEY---SGDVKYHLGLCCNR--QICGKD 323 (957) Q Consensus 249 ~L~~ii~~a~~~Gv~eiviGMaHRGRLNVL~Nvl~Kp~~~iF~EF~g~~~~~~~~---sGDVKYHLG~s~~~--~~~g~~ 323 (957) +.+.+-+....-.+=+||++-- +.--+..+|-++|....-.+|..+.+ -||-.|=.|.|-+. ...|+. T Consensus 246 ~V~kake~i~~GDiFQvVlSqr-------f~~~~~~~p~~~Yr~LR~~NPSPYm~~~~~gd~~~lvGsSPE~lvrv~~~~ 318 (724) T PRK13566 246 LVEKAKESFRRGDLFEVVPGQT-------FYEPCERSPSELFRRLKRINPSPYGFLINLGDGEYLVGASPEMFVRVDGRR 318 (724) T ss_pred HHHHHHHHHHCCCEEEECCCEE-------ECCCCCCCHHHHHHHHHHCCCCCEEEEEECCCCCEEEECCCHHEEEEECCE T ss_conf 9999999985598699847157-------734589999999999970699973899987999889955842508986998 Q ss_pred EEEE Q ss_conf 8999 Q gi|254781051|r 324 VKLL 327 (957) Q Consensus 324 v~ls 327 (957) |... T Consensus 319 v~t~ 322 (724) T PRK13566 319 VETC 322 (724) T ss_pred EEEE T ss_conf 9995 No 233 >PRK01889 ribosome-associated GTPase; Reviewed Probab=28.29 E-value=24 Score=14.72 Aligned_cols=26 Identities=19% Similarity=0.285 Sum_probs=19.9 Q ss_pred CCCCCCCCHHHHHHHCCCCCEEEEEC Q ss_conf 87656440578875214378269724 Q gi|254781051|r 739 GPEHSSARLERFLQMCAENNMYVANC 764 (957) Q Consensus 739 GPEHSSaRlERfLQl~a~~N~~V~~~ 764 (957) .|+-+-.||||||-++.+.++..+.+ T Consensus 122 ~~~fn~~rieRyLv~a~~~g~~pviv 147 (353) T PRK01889 122 NHDFNLRRIERYLALAWESGAEPVVV 147 (353) T ss_pred CCCCCHHHHHHHHHHHHHCCCCEEEE T ss_conf 99979789999999999869967999 No 234 >TIGR00612 ispG_gcpE 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase; InterPro: IPR004588 This protein previously of unknown biochemical function is essential in Escherichia coli. It has now been characterised as 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate synthase, which converts 2C-methyl-D-erythritol 2,4-cyclodiphosphate (ME-2,4CPP) into 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate in the sixth step of nonmevalonate terpenoid biosynthesis. The family is restricted to bacteria, where it is widely but not universally distributed. No homology can be detected between this family and other proteins.; GO: 0016114 terpenoid biosynthetic process. Probab=28.26 E-value=36 Score=13.45 Aligned_cols=157 Identities=21% Similarity=0.267 Sum_probs=87.5 Q ss_pred CCCHHHCCCCHHHCCCEEECCCC----CCCCCCCHHHHHHHHHHHHCC-CEEEEEECCCCHHHHHHHHHHHCCCCCCCCC Q ss_conf 67876669997881715725665----787777899999999984064-3027772489989999999984145778887 Q gi|254781051|r 132 ELSPAHYGFVKADYDRKICMKGV----LGLESATIPEIVDVLSHLYCS-NIGVEFMHIVDSIERDWVRNTIEDYDFSSNF 206 (957) Q Consensus 132 eL~~~~yGLse~DLd~~f~~~~~----~g~~~~tL~eIi~~L~~tYCg-sIGvEymHI~d~eer~Wlq~riE~~~~~~~~ 206 (957) ..+|.++|+.+.|..++++++.- +-+-.-++++|++.+|+.=|. .||| |.=++ T Consensus 114 RINPGNig~k~~~~~~~~Y~~~~yA~~~~~~~e~v~~~Ve~ar~~G~a~RIGV----------------------N~GSL 171 (633) T TIGR00612 114 RINPGNIGFKKKDAKRKIYSDAKYAEILLKVEERVRDVVEKARRHGKAMRIGV----------------------NHGSL 171 (633) T ss_pred EECCCCCCCCCCHHHCCCCCHHHHHHHHCCHHHHHHHHHHHHHHCCCCEEEEC----------------------CCCCC T ss_conf 43787643467012004564257888630046788999999985789687503----------------------56766 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEECCCC----HHHHHHHHHHHHHHHHHCCCC--EEEEECCCC-CHHHHHH Q ss_conf 988999999999999989999986403362222564----658999999999998737910--799806765-5268999 Q gi|254781051|r 207 SKEEQRDILDKLVRAEGFEKFIDIKYKGAKRFGADG----SEVIIPAIEEIIRQGVQQGVD--EMILGMAHR-GRLNVLS 279 (957) Q Consensus 207 s~eeK~~iL~~L~~Ae~FE~FL~~Kf~g~KRFsLEG----~EslIP~L~~ii~~a~~~Gv~--eiviGMaHR-GRLNVL~ 279 (957) +.+-|+++|++= |.|-|++.- .|+|+--==.-++.+-+.|.. ++|+-|=== =--+|=| T Consensus 172 ~~~~~~~ll~~Y---------------g~~~~~~~~qev~~EA~V~SAl~~a~~~e~~~f~rd~~v~S~KaS~v~~~vaA 236 (633) T TIGR00612 172 EQVLKRRLLEKY---------------GDKGFSLTAQEVTAEAMVQSALESAEILEKLDFRRDNVVLSMKASDVAETVAA 236 (633) T ss_pred HHHHHHHHHHHH---------------CCCCCCCCHHHCCHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEECCHHHHHHH T ss_conf 089999999971---------------54683217431587899999999999998508944607999841876789999 Q ss_pred HHHCCCHHHHHHHHCC---C-CCCCC-------CCCCCCCCCCCEEEEEEEC-CCEEEEEECCCC Q ss_conf 8853798999998558---8-74322-------3455510015026889835-961899971787 Q gi|254781051|r 280 QIMNKLPRSIFYEFKG---K-GPSEK-------EYSGDVKYHLGLCCNRQIC-GKDVKLLLQSNP 332 (957) Q Consensus 280 Nvl~Kp~~~iF~EF~g---~-~~~~~-------~~sGDVKYHLG~s~~~~~~-g~~v~lsL~~NP 332 (957) | .+|++=.+ - .|..- +-+|=||.=+|.+.=..-+ |-+|.+||+..| T Consensus 237 y-------~~lA~~~D~~G~~~PLHLGvTEAG~g~~GivKS~~gig~LL~~GIGDTiRvSLt~~P 294 (633) T TIGR00612 237 Y-------RLLAERSDELGMLYPLHLGVTEAGMGVKGIVKSSVGIGILLAEGIGDTIRVSLTDDP 294 (633) T ss_pred H-------HHHHHHHHHCCCCCCCCCCCEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCC T ss_conf 9-------999999865035252425412028888734566789999998458957998537888 No 235 >pfam01786 AOX Alternative oxidase. The alternative oxidase is used as a second terminal oxidase in the mitochondria, electrons are transfered directly from reduced ubiquinol to oxygen forming water. This is not coupled to ATP synthesis and is not inhibited by cyanide, this pathway is a single step process. In rice the transcript levels of the alternative oxidase are increased by low temperature. Probab=28.13 E-value=36 Score=13.43 Aligned_cols=34 Identities=18% Similarity=0.249 Sum_probs=14.3 Q ss_pred HHHHCCCHHHCCCCCCCCCCCCCCCCCCHHHCCCCH Q ss_conf 998817045401677777766787767876669997 Q gi|254781051|r 107 AYRSYGHFKANIDPLGYNSHQKDLSELSPAHYGFVK 142 (957) Q Consensus 107 AYR~~GHl~A~lDPLgl~~~~~~~peL~~~~yGLse 142 (957) |...|-|...++|--.+. ....|++...+|+|.+ T Consensus 153 Av~tYT~~l~~id~Gki~--n~~aP~iAi~Yw~L~~ 186 (214) T pfam01786 153 AVVTYTHFLKDIEAGKLP--NVPAPEIAIDYWRLPE 186 (214) T ss_pred HHHHHHHHHHHHHCCCCC--CCCCCHHHHHHCCCCC T ss_conf 898999999998579989--9999778998749997 No 236 >KOG4156 consensus Probab=28.06 E-value=32 Score=13.79 Aligned_cols=58 Identities=19% Similarity=0.308 Sum_probs=27.5 Q ss_pred CCCCEEEECCCHHHCCCCCCCCHHHHCCCCCEEECCCCCCCCCCCCCCCCCCHHHCCEEEE--ECCHHHHH Q ss_conf 4886898566022108012688789379986035157764334445444579545768998--68857899 Q gi|254781051|r 782 SSRPLIMMAPKSLLRHKRVVSSLSDMTCGSVFQAVLSDDAEYHGKTSVKLKEDSHIRRVIL--CTGKVYYD 850 (957) Q Consensus 782 ~rkPLiv~tPKsLLR~~~a~S~l~d~~~g~~F~~vi~d~~~~~~~~~~~~~~~~~v~rvi~--csGKvyyd 850 (957) -|-||.-|--..|=- ..--+.|+.+|+ |-.--.- ......+.+.++-..+ ||||+--. T Consensus 892 prlplasmdenamDa---nmDE~~Dl~t~k-~~SQ~~~-------~~~~~sDkke~~E~~~nl~SGkFTsq 951 (1329) T KOG4156 892 PRLPLASMDENAMDA---NMDELLDLCTGK-FTSQAEK-------HLPRKSDKKENMEELLNLCSGKFTSQ 951 (1329) T ss_pred CCCCCCCCCHHHHCC---CCHHHHHHHHCC-CCHHHHC-------CCCCCCCHHHHHHHHHHHCCCCCCHH T ss_conf 566533110443314---502777777455-7605540-------69863116789999976326754010 No 237 >COG5097 MED6 RNA polymerase II transcriptional regulation mediator [Transcription] Probab=28.03 E-value=14 Score=16.51 Aligned_cols=24 Identities=25% Similarity=0.514 Sum_probs=17.4 Q ss_pred HHHHHHHHHC-----CCHHHHHHHHHHHC Q ss_conf 7888886718-----99899999998644 Q gi|254781051|r 491 PMMYKTIRSH-----KSVLQLYADSLMKN 514 (957) Q Consensus 491 P~mY~~I~~h-----p~v~~~Y~~~Li~~ 514 (957) |+||-.|..| |+...||.-++... T Consensus 98 pl~~Y~i~g~NIY~aPt~y~i~~~Rm~~s 126 (210) T COG5097 98 PLMYYIIHGHNIYAAPTNYSIYRCRMGDS 126 (210) T ss_pred CCEEEEEECCCEEECCCHHHHHHHHHHHH T ss_conf 30489996672030762243565553014 No 238 >TIGR00576 dut dUTP diphosphatase; InterPro: IPR008181 Synonym(s): dUTP diphosphatase, Deoxyuridine-triphosphatase. The essential enzyme dUTP pyrophosphatase (3.6.1.23 from EC) is specific for dUTP and is critical for the fidelity of DNA replication and repair. dUTPase hydrolyzes dUTP to dUMP and pyrophosphate, simultaneously reducing dUTP levels and providing the dUMP for dTTP biosynthesis. dUTPase decreases the intracellular concentration of dUPT so that uracil cannot be incorporated into DNA . The crystal structure of human dUTPase reveals that each subunit of the dUTPase trimer folds into an eight-stranded jelly-roll beta barrel, with the C-terminal beta strands interchanged among the subunits. The structure is similar to that of the Escherichia coli enzyme, despite low sequence homology between the two enzymes . Other enzymes like deoxycytidine triphosphate deaminase (dCTP) (3.5.4.13 from EC) that specifically bind uridine also belong to this group suggesting that the signature may recognise a putative uridine-binding motif. Some retroviruses encode dUTPases. Retroviral dUTPase is synthesised as part of POL polyprotein that contains; an aspartyl protease, a reverse transcriptase, dUTPase and RNase H. ; GO: 0016787 hydrolase activity, 0046080 dUTP metabolic process. Probab=27.96 E-value=22 Score=15.09 Aligned_cols=17 Identities=35% Similarity=0.722 Sum_probs=13.2 Q ss_pred CCCCCCEEECCCCCCCC Q ss_conf 33368135436888888 Q gi|254781051|r 722 LCTSNLVCLLPHGYEGQ 738 (957) Q Consensus 722 ~~~sglvllLPHGyeGq 738 (957) +-..||.+-||+||+|| T Consensus 45 Lv~TG~~~~lP~g~~g~ 61 (151) T TIGR00576 45 LVPTGIAIELPEGYYGR 61 (151) T ss_pred EECCCEEEECCCCCEEE T ss_conf 75244299848994899 No 239 >TIGR01161 purK phosphoribosylaminoimidazole carboxylase, ATPase subunit; InterPro: IPR005875 Phosphoribosylaminoimidazole carboxylase is a fusion protein in plants and fungi, but consists of two non-interacting proteins in bacteria, PurK and PurE. This family represents PurK, N5-carboxyaminoimidazole ribonucleotide (N5_CAIR) synthetase, which catalyzes the conversion of 5-aminoimidazole ribonucleotide (AIR), ATP, and bicarbonate to N5-CAIR, ADP, and Pi. PurE converts N5-CAIR to CAIR. In the presence of high concentrations of bicarbonate, PurE is reported able to convert AIR to CAIR directly and without ATP. PurK belongs to the ATP grasp superfamily of C-N ligase enzymes. Each subunit of PurK is composed of three domains (A, B, and C). The B domain contains a flexible, glycine-rich loop (B loop, T123-G130) that is disordered in the sulphate-PurK structure and becomes ordered in the MgADP-PurK structure. MgADP is wedged between the B and C domains, as with all members of the ATP grasp superfamily. Other enzymes in this superfamily contain a conserved Omega loop proposed to interact with the B loop, define the specificity of their nonnucleotide substrate, and protect the acyl phosphate intermediate formed from this substrate. PurK contains a minimal Omega loop without conserved residues. In the reaction catalyzed by PurK, carboxyphosphate is the putative acyl phosphate intermediate. The sulphate of the sulphate ion-liganded PurK interacts electrostatically with Arg 242 and the backbone amide group of Asn 245, components of the J loop of the C domain. This sulphate may reveal the location of the carboxyphosphate binding site. Conserved residues within the C-terminus of the C domain define a pocket that is proposed to bind AIR in collaboration with an N-terminal strand loop helix motif in the A domain (P loop, G8-L1). The P loop is proposed to bind the phosphate of AIR on the basis of similar binding sites observed in PurN and PurE and proposed in PurD and PurT, four other enzymes in the purine pathway .; GO: 0004638 phosphoribosylaminoimidazole carboxylase activity, 0006189 'de novo' IMP biosynthetic process, 0009320 phosphoribosylaminoimidazole carboxylase complex. Probab=27.79 E-value=21 Score=15.11 Aligned_cols=121 Identities=19% Similarity=0.208 Sum_probs=57.0 Q ss_pred HHHHHHHHCCCHHHCCCCCCCCCCCCCCCCCCHHHCCCCHHHCCCEEECCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEE Q ss_conf 99999988170454016777777667877678766699978817157256657877778999999999840643027772 Q gi|254781051|r 103 KMIDAYRSYGHFKANIDPLGYNSHQKDLSELSPAHYGFVKADYDRKICMKGVLGLESATIPEIVDVLSHLYCSNIGVEFM 182 (957) Q Consensus 103 ~LI~AYR~~GHl~A~lDPLgl~~~~~~~peL~~~~yGLse~DLd~~f~~~~~~g~~~~tL~eIi~~L~~tYCgsIGvEym 182 (957) +|+.|=+..|-..+=|||=. ..|. ..+. |=...+..... +..-++++.+ =|.=|.+||- T Consensus 13 M~~~aA~~LG~~~~VL~~~~---~~PA-~q~A---------d~~~~v~a~~~---d~~~i~~La~-----~cDviT~E~E 71 (386) T TIGR01161 13 MLALAAKKLGIKVAVLDPDA---NSPA-KQVA---------DSREHVLASFT---DPEAIRELAE-----ACDVITFEFE 71 (386) T ss_pred HHHHHHHHCCCEEEEECCCC---CCCC-CCCC---------CCCCEEEEEEC---CHHHHHHHHH-----HCCEEEEEEC T ss_conf 99998712795899854898---9731-1128---------98557998514---8789999997-----6565543101 Q ss_pred CCCCHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEECC-C--CHHHHHHHHH Q ss_conf 4899899999999841457788879889999999999999899999864033622225-6--4658999999 Q gi|254781051|r 183 HIVDSIERDWVRNTIEDYDFSSNFSKEEQRDILDKLVRAEGFEKFIDIKYKGAKRFGA-D--GSEVIIPAIE 251 (957) Q Consensus 183 HI~d~eer~Wlq~riE~~~~~~~~s~eeK~~iL~~L~~Ae~FE~FL~~Kf~g~KRFsL-E--G~EslIP~L~ 251 (957) ||..+.-.+=..+. .+ .+...+.+-=..+.+|+.+=+ ||.+-=.-+=||.+ + .-+++-.+++ T Consensus 72 hv~~~~L~~L~~~g-~~--~~~~P~~~~l~~~~dk~~qK~----~L~~~g~~vp~f~~~~~~~~~~~~~~~~ 136 (386) T TIGR01161 72 HVDVEALEKLEARG-DK--VKVRPSPEALAIIQDKLTQKQ----FLQKLGLPVPPFVVIEIKDEEELDSALQ 136 (386) T ss_pred CCCCHHHHHHHHCC-CE--EEEECCHHHHHHHHCHHHHHH----HHHHCCCCCCCCCEEECCCHHHHHHHHH T ss_conf 37856789998359-84--776038899998836589999----9996689988412022368425789998 No 240 >COG1359 Uncharacterized conserved protein [Function unknown] Probab=27.68 E-value=19 Score=15.54 Aligned_cols=30 Identities=20% Similarity=0.286 Sum_probs=20.9 Q ss_pred EECCCCCCCHHHHHHCCCCCCCCCCEEECC Q ss_conf 824311110134320221138653111000 Q gi|254781051|r 668 VNSFLSEQAVLGFEYGYSLNNLNALTIWEA 697 (957) Q Consensus 668 ~nS~LSE~avlgFEyGys~~~p~~L~iWEA 697 (957) ......|.|+|.|++-=+..+|+.++++|. T Consensus 28 v~~sr~EeGcl~Y~l~~d~~~~~~~~~~E~ 57 (100) T COG1359 28 VEASRAEEGCLSYELHRDPDNPGRFVLIEV 57 (100) T ss_pred HHHHCCCCCCEEEEEEECCCCCCEEEEEEE T ss_conf 998706888679999965899978999999 No 241 >KOG3171 consensus Probab=27.53 E-value=10 Score=17.58 Aligned_cols=91 Identities=18% Similarity=0.231 Sum_probs=56.4 Q ss_pred CCCCHHHHHHHHHHHHHHHHHC---CCCEEEEECCCCCHHHHHHHHHCCCHHHHHHHHCCCCC--CCCCCCCCCCCCCCE Q ss_conf 2564658999999999998737---91079980676552689998853798999998558874--322345551001502 Q gi|254781051|r 239 GADGSEVIIPAIEEIIRQGVQQ---GVDEMILGMAHRGRLNVLSQIMNKLPRSIFYEFKGKGP--SEKEYSGDVKYHLGL 313 (957) Q Consensus 239 sLEG~EslIP~L~~ii~~a~~~---Gv~eiviGMaHRGRLNVL~Nvl~Kp~~~iF~EF~g~~~--~~~~~sGDVKYHLG~ 313 (957) |+.|||+|--.|.-+-.....- .++.-++||.-|=|||||-.++=..-..+...|--... .+.-..|||..-|-- T Consensus 170 gi~gcealn~~~~cLAAeyP~vKFckikss~~gas~~F~~n~lP~LliYkgGeLIgNFv~va~qlgedffa~dle~FL~e 249 (273) T KOG3171 170 GIKGCEALNSSLTCLAAEYPIVKFCKIKSSNTGASDRFSLNVLPTLLIYKGGELIGNFVSVAEQLGEDFFAGDLESFLNE 249 (273) T ss_pred CCCHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCHHHHCCCCCCEEEEEECCCHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 98427888656878650587236787520243414443016774589850872467899999997555526669999977 Q ss_pred EEEEEECCCEEEEEECCC Q ss_conf 688983596189997178 Q gi|254781051|r 314 CCNRQICGKDVKLLLQSN 331 (957) Q Consensus 314 s~~~~~~g~~v~lsL~~N 331 (957) -. -.+-+.||+.-.-| T Consensus 250 ~g--llpe~ev~vl~s~~ 265 (273) T KOG3171 250 YG--LLPEREVHVLESTK 265 (273) T ss_pred CC--CCCCCCEEEECCCC T ss_conf 48--87102413431566 No 242 >cd04820 PA_M28_1_1 PA_M28_1_1: Protease-associated (PA) domain, peptidase family M28, subfamily-1, subgroup 1. A subgroup of PA-domain containing proteins belonging to the peptidase family M28. Family M28 contains aminopeptidases and carboxypeptidases, and has co-catalytic zinc ions. The PA domain is an insert domain in a diverse fraction of proteases. The significance of the PA domain to many of the proteins in which it is inserted is undetermined. It may be a protein-protein interaction domain. At peptidase active sites, the PA domain may participate in substrate binding and/or promoting conformational changes, which influence the stability and accessibility of the site to substrate. Proteins into which the PA domain is inserted include the following members of the peptidase family M28: i) prostate-specific membrane antigen (PSMA), ii) yeast aminopeptidase Y, and ii) human TfR (transferrin receptor)1 and human TfR2. The proteins listed above belong to other subgroups; relatively litt Probab=27.20 E-value=29 Score=14.09 Aligned_cols=44 Identities=20% Similarity=0.395 Sum_probs=35.7 Q ss_pred EEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCCEECC Q ss_conf 88625753334672588764510677443657999841602012 Q gi|254781051|r 373 LIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQIGFT 416 (957) Q Consensus 373 iliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQIGFT 416 (957) +-|-+..-|+|-||+.+-++..+..|-.+-|-|-||..|.-++. T Consensus 20 ~~v~a~~VfvGyGi~a~e~~~dDy~g~Dv~GKivv~l~g~P~~~ 63 (137) T cd04820 20 ASVEAPLVFVGYGLVAPELGHDDYAGLDVKGKIVVVLSGGPAGI 63 (137) T ss_pred CCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCEEEEECCCCCCC T ss_conf 55403379872582676458432258765787899982799877 No 243 >COG4098 comFA Superfamily II DNA/RNA helicase required for DNA uptake (late competence protein) [DNA replication, recombination, and repair] Probab=27.13 E-value=24 Score=14.74 Aligned_cols=15 Identities=20% Similarity=0.282 Sum_probs=8.4 Q ss_pred HHCCCCCCCCCCEEE Q ss_conf 202211386531110 Q gi|254781051|r 681 EYGYSLNNLNALTIW 695 (957) Q Consensus 681 EyGys~~~p~~L~iW 695 (957) --|-|++.|.+.|+. T Consensus 397 RvGRs~~~PtGdv~F 411 (441) T COG4098 397 RVGRSLERPTGDVLF 411 (441) T ss_pred HCCCCCCCCCCCEEE T ss_conf 316787689875899 No 244 >PRK05647 purN phosphoribosylglycinamide formyltransferase; Reviewed Probab=27.12 E-value=37 Score=13.30 Aligned_cols=108 Identities=18% Similarity=0.248 Sum_probs=63.7 Q ss_pred CCCCCHHHHHHHHHCCCCCCCCCEEE-EEECCCEECCCCCHHCCCCCCCCHHHHHHCCCEEEECCCCHHH-HHHHHHHHH Q ss_conf 33467258876451067744365799-9841602012470102467784001122034058742678899-999999999 Q gi|254781051|r 381 FAGQGIVSETFGLSGLSGYTVAGNIH-LIINNQIGFTTNPSSARSCTYASDISKSIGIPIFHVNGDDPEA-VIRVVRMAV 458 (957) Q Consensus 381 faGQGVV~Etl~ls~l~gY~tGGTiH-iVvNNQIGFTT~p~d~RSs~Y~TDiAK~i~aPI~HVN~DdpEa-v~~~~~lA~ 458 (957) +||.|-.++.+--+.-.|-- ...|- ||.|| +..++-..|+-.+.|.+++|-.+-+. ..+-.++ + T Consensus 8 ~SG~GSNl~aii~a~~~~~l-~~~i~~Visn~------------~~a~~l~~A~~~~Ip~~~i~~~~~~~r~~~d~~i-~ 73 (200) T PRK05647 8 ASGNGSNLQAIIDACAAGQL-PAEVVAVISDR------------PDAYGLERAEQAGIPTFVLDHKDFPSREAFDAAL-V 73 (200) T ss_pred ECCCCHHHHHHHHHHHCCCC-CCEEEEEEECC------------CCCHHHHHHHHCCCCEEEECCCCCCCHHHHHHHH-H T ss_conf 91580449999999875999-94899999789------------7536659999759976996776679989999999-9 Q ss_pred HHHHHHCCCEEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHH Q ss_conf 99997389739986421125656435422355788888671899899999 Q gi|254781051|r 459 SFRMKFHKSVVIDIVCYRRFGHNEGDEPSFTQPMMYKTIRSHKSVLQLYA 508 (957) Q Consensus 459 ~yR~~F~kDVvIDlv~YRr~GHNE~DeP~fTQP~mY~~I~~hp~v~~~Y~ 508 (957) +.-.++.-|.+ =|.||-|-= -|.|.+-.--+.|.-||+.+-.|. T Consensus 74 ~~L~~~~vDlI-vLAGyMril-----~~~~l~~~~~kIiNiHPsLLP~f~ 117 (200) T PRK05647 74 EALDAYEPDLV-VLAGFMRIL-----GPTFVSAYEGRIINIHPSLLPSFP 117 (200) T ss_pred HHHHHCCCCEE-EECCHHHHC-----CHHHHHHCCCCCEEECCCCCCCCC T ss_conf 99986499999-998813338-----989995377995677542035888 No 245 >COG4223 Uncharacterized protein conserved in bacteria [Function unknown] Probab=27.10 E-value=25 Score=14.65 Aligned_cols=53 Identities=34% Similarity=0.509 Sum_probs=40.7 Q ss_pred CCCCCCCCCCC----------HHHHHHHHHHHHHHHCCCCCCEEECCCC-CCCCCCCCCCCCHHHHHHHCC Q ss_conf 00024424541----------0225655420257623336813543688-888887656440578875214 Q gi|254781051|r 696 EAQFGDFANGA----------QVILDQFITAGEQKWLCTSNLVCLLPHG-YEGQGPEHSSARLERFLQMCA 755 (957) Q Consensus 696 EAQFGDF~NgA----------QviiDqfiss~e~KW~~~sglvllLPHG-yeGqGPEHSSaRlERfLQl~a 755 (957) -.|||+-+|-. |-++|+..+|+ ++||-.-|-| -||-||+---||||--|+-.+ T Consensus 313 ~~qF~~~AnamvsA~~~pd~nagl~~rL~~Sa-------~slVsVRpVGsveG~t~~a~iARmEa~L~~GD 376 (422) T COG4223 313 ATQFGAVANAMVSASNNPDPNAGLFDRLRSSA-------SSLVSVRPVGSVEGSTPDAMIARMEAALDNGD 376 (422) T ss_pred HHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHH-------HCCEEEEECCCCCCCCCCHHHHHHHHHHHCCC T ss_conf 99878899899973348998702999999987-------43512432255578984068999999875445 No 246 >pfam12550 GCR1_C Transcriptional activator of glycolytic enzymes. This domain family is found in eukaryotes, and is approximately 80 amino acids in length. This family is activates the transcription of glycolytic enzymes. Probab=26.83 E-value=24 Score=14.72 Aligned_cols=24 Identities=17% Similarity=0.328 Sum_probs=19.6 Q ss_pred HHHCCCCCCHHHHHHHHHCCCCCC Q ss_conf 873805079899999873676433 Q gi|254781051|r 30 YQEDPSSVCKDWYPLFSFLDENSE 53 (957) Q Consensus 30 YL~DP~SVd~sWr~fF~~l~~~~~ 53 (957) ...+|.||.+-|++|+.+++..++ T Consensus 4 m~~~~~tV~eiw~Ew~~g~~g~ps 27 (81) T pfam12550 4 MSRDIKTVYELWREWYTGLNGQPS 27 (81) T ss_pred CCCCCCCHHHHHHHHHHCCCCCCC T ss_conf 678858699999999865589986 No 247 >smart00833 CobW_C Cobalamin synthesis protein cobW C-terminal domain. CobW proteins are generally found proximal to the trimeric cobaltochelatase subunit CobN, which is essential for vitamin B12 (cobalamin) biosynthesis PUBMED:12869542. They contain a P-loop nucleotide-binding loop in the N-terminal domain and a histidine-rich region in the C-terminal portion suggesting a role in metal binding, possibly as an intermediary between the cobalt transport and chelation systems. CobW might be involved in cobalt reduction leading to cobalt(I) corrinoids. This entry represents the C-terminal domain found in CobW, as well as in P47K, a Pseudomonas chlororaphis protein needed for nitrile hydratase expression PUBMED:7765511. Probab=26.77 E-value=38 Score=13.26 Aligned_cols=66 Identities=20% Similarity=0.312 Sum_probs=45.5 Q ss_pred EEEECCCCCHHHHHHHHHHHCC----CCE-EEEECCCCCCCCCHHHHHHHHHHHHHHC----CCCCCEEEEEECC Q ss_conf 9840148682899999985379----983-9980678531784578997699998750----5677601897536 Q gi|254781051|r 866 LRIEQLYPFPEDYLIKVLSRFV----QAE-IVWCQEEPQNMGAWTFIEPYLEKVLHSI----GADYSRVRYVGRL 931 (957) Q Consensus 866 vRiEqLyPfP~~~l~~~l~~y~----~~e-~vW~QEEp~N~GaW~~v~~rl~~~l~~~----~~~~~~~~y~gR~ 931 (957) +.++.=.||..+.|...|...+ .++ ++|+.+.|.--..|.+|..++.---... +....+|-.||+. T Consensus 4 ~~~~~~~p~d~~~l~~~l~~l~~~ilR~KG~~~~~~~~~~~~~~q~v~~~~~~~~~~~w~~~~~~~s~LVfIG~~ 78 (92) T smart00833 4 FVYRARRPFHPQRLLAALDELPEGVLRAKGFFWLASRPDLPGVLSGAGGRLRIEPAGAWPAAGDRRTRLVFIGRD 78 (92) T ss_pred EEEECCCCCCHHHHHHHHHHHHCCCEEEEEEEEECCCCCEEEEEEECCEEEECCCCCCCCCCCCCCEEEEEEECC T ss_conf 999789999989999999864006289768999379986899997327364246667544689853489999469 No 248 >pfam10925 DUF2680 Protein of unknown function (DUF2680). Members in this family of proteins are annotated as yckD however currently no function is known. Probab=26.75 E-value=38 Score=13.26 Aligned_cols=43 Identities=23% Similarity=0.342 Sum_probs=30.7 Q ss_pred HHHHHHHCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 8888671899899999998644898989999999999999775666 Q gi|254781051|r 493 MYKTIRSHKSVLQLYADSLMKNQVISKQELQSLANNWHKYLEAEYK 538 (957) Q Consensus 493 mY~~I~~hp~v~~~Y~~~Li~~g~it~~e~~~~~~~~~~~l~~~~~ 538 (957) +|+.|.. .+.--.+++++.|+||++..+.|.+....+++..-+ T Consensus 12 l~kqi~e---l~Kqii~KyVe~G~iT~eQa~~i~~~id~r~~~~~q 54 (59) T pfam10925 12 LYKQIAE---LRKQVIDKYVEAGVITKEQADHIKKNIDKRVEYMKQ 54 (59) T ss_pred HHHHHHH---HHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999---999999999982756499999999999999999998 No 249 >COG0769 MurE UDP-N-acetylmuramyl tripeptide synthase [Cell envelope biogenesis, outer membrane] Probab=26.68 E-value=38 Score=13.25 Aligned_cols=40 Identities=28% Similarity=0.436 Sum_probs=24.2 Q ss_pred CCHHCCCCCCCCHHHHHHCCCE-EEECCCCHHHHHHHHHHH Q ss_conf 7010246778400112203405-874267889999999999 Q gi|254781051|r 418 NPSSARSCTYASDISKSIGIPI-FHVNGDDPEAVIRVVRMA 457 (957) Q Consensus 418 ~p~d~RSs~Y~TDiAK~i~aPI-~HVN~DdpEav~~~~~lA 457 (957) +|+....-..-.||.+.+..|- ..++.|--+|.-++..+| T Consensus 396 npR~edp~~i~~~i~~g~~~~~~~~~~~dr~~AI~~ai~~a 436 (475) T COG0769 396 NPRSEDPAVILADILAGIEAPEKYEIIEDREEAIRKALDLA 436 (475) T ss_pred CCCCCCHHHHHHHHHHCCCCCCCEEECHHHHHHHHHHHHHH T ss_conf 98876889999999836476641430110999999999863 No 250 >pfam06785 UPF0242 Uncharacterized protein family (UPF0242). Probab=26.64 E-value=18 Score=15.69 Aligned_cols=75 Identities=19% Similarity=0.297 Sum_probs=55.8 Q ss_pred CCCCCCHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCC Q ss_conf 66778999999998874368998620567788899899873343564211478988898863583157850100223332 Q gi|254781051|r 558 RKNSVSKEILKKIGSSIIRLPKSFNTHKIVERLMANRQKMIETGKGIDWSMAESLAFGSLCYEGYKVRLSGQDCERGTFS 637 (957) Q Consensus 558 ~~T~v~~~~L~~i~~~l~~~Pe~f~~h~~v~k~~~~R~~~~~~~~~iDW~~AE~LA~~SLl~eG~~VRlsGQD~~RGTFs 637 (957) .+|.-+.+.|..+...+.+ |+-.+.-++..+...+.+..+....++--.||+|||---|.|-| ..||+ T Consensus 123 ~ktK~~~q~lE~l~~hl~E--ENqcLq~Qldal~qEc~EK~ee~q~LnrELae~LAYQQ~LndEY----------QATF~ 190 (401) T pfam06785 123 IKTKGRYQKLEELSRHLKE--ENQCLQIQLDALVQECNEKIEENQELNRELAETLAYQQELNDEY----------QATFV 190 (401) T ss_pred HHCCCCHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----------HHHHH T ss_conf 7404518679999998887--64789999999999976668889999999999999999986899----------87899 Q ss_pred CCEEEEE Q ss_conf 3027999 Q gi|254781051|r 638 HRHAILC 644 (957) Q Consensus 638 hRHavl~ 644 (957) -.|.+|- T Consensus 191 EQhnMLD 197 (401) T pfam06785 191 EQHNMLD 197 (401) T ss_pred HHHHHHH T ss_conf 9874268 No 251 >COG2210 Peroxiredoxin family protein [General function prediction only] Probab=26.53 E-value=38 Score=13.23 Aligned_cols=41 Identities=15% Similarity=0.034 Sum_probs=19.6 Q ss_pred HHHHCCCHHHCCCCCCCCCC----CCCCCCCCHHHCCCCH-HHCCC Q ss_conf 99881704540167777776----6787767876669997-88171 Q gi|254781051|r 107 AYRSYGHFKANIDPLGYNSH----QKDLSELSPAHYGFVK-ADYDR 147 (957) Q Consensus 107 AYR~~GHl~A~lDPLgl~~~----~~~~peL~~~~yGLse-~DLd~ 147 (957) ..+...+..+.++=|--..+ +-...++..+-||+++ +|++. T Consensus 68 ~~~~k~~~~~~le~l~~ak~~~~vkv~AC~mtmdl~Gi~k~d~~~~ 113 (137) T COG2210 68 KKMMKKKNVAWLELLREAKKDGGVKVYACSMTMDLMGISKEDDLDD 113 (137) T ss_pred HHHHHHCCCCHHHHHHHHHHHCCCEEEEEHHHHHHHCCCHHHHCCC T ss_conf 9999852520799999875616928997053798858872654245 No 252 >PRK01286 deoxyguanosinetriphosphate triphosphohydrolase-like protein; Provisional Probab=26.42 E-value=38 Score=13.22 Aligned_cols=16 Identities=19% Similarity=0.148 Sum_probs=7.6 Q ss_pred HHHHHCCCHHHCCCCCC Q ss_conf 99988170454016777 Q gi|254781051|r 106 DAYRSYGHFKANIDPLG 122 (957) Q Consensus 106 ~AYR~~GHl~A~lDPLg 122 (957) .|||+..+. .++=||+ T Consensus 40 ~afRRL~~K-TQVf~~~ 55 (338) T PRK01286 40 KAFRRLKHK-TQVFINH 55 (338) T ss_pred HHHHCCCCC-CEECCCC T ss_conf 877614567-7364079 No 253 >PRK11869 2-oxoacid ferredoxin oxidoreductase subunit beta; Provisional Probab=26.23 E-value=39 Score=13.19 Aligned_cols=109 Identities=16% Similarity=0.216 Sum_probs=62.4 Q ss_pred CCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCCE-ECC Q ss_conf 0445532367777641666544555654454047888625753334672588764510677443657999841602-012 Q gi|254781051|r 338 VDPVVIGSVRARQDLKAGIVGEKSISLVERSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQI-GFT 416 (957) Q Consensus 338 VnPVv~G~~RAkQd~~~d~~~~~~~~~~~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQI-GFT 416 (957) .-||+.|.-+|.-| ..-|..-||.-..|-|...=.-.+-+ -=.-..||+|||| |-| T Consensus 69 alp~AtGiK~anPd------------------L~Viv~~GDGD~~~IGgnH~iHA~RR-----N~dit~Iv~nN~iYGlT 125 (284) T PRK11869 69 AIPAATAVKATNPN------------------LTVIAEGGDGDMYAEGGNHLIHAIRR-----NPDITVLIHNNQIYGLT 125 (284) T ss_pred HHHHHHHHHHHCCC------------------CEEEEEECCCHHHHHHHHHHHHHHHC-----CCCCEEEEECCEECCCC T ss_conf 07889999876899------------------76999812661543028889999974-----89808999888100115 Q ss_pred C------CCHH--CCCCCCCC--------HHHHHHCCCEE-EECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEE Q ss_conf 4------7010--24677840--------01122034058-742678899999999999999973897399864 Q gi|254781051|r 417 T------NPSS--ARSCTYAS--------DISKSIGIPIF-HVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIV 473 (957) Q Consensus 417 T------~p~d--~RSs~Y~T--------DiAK~i~aPI~-HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv 473 (957) . +|.. ..|++|-. .+|...+|+-. ..-.-||+......+-|++ ++-=-+||++ T Consensus 126 kGQ~SPTt~~G~~T~t~P~G~~e~p~np~~la~a~GatfVAR~~~~~~~~l~~~ik~Ai~----hkGfs~VeVl 195 (284) T PRK11869 126 KGQASPTTLKGTKTPTQPWGVFEEPFNPIALAIALDASFVARTFSGDIEETKEIIKEAIK----HKGLAIVDIF 195 (284) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHCCCCEEEEEECCCHHHHHHHHHHHHC----CCCCEEEEEE T ss_conf 798898999988055788998889979999999769978998712899999999999967----9981799996 No 254 >PTZ00075 S-adenosyl-L-homocysteine hydrolase; Provisional Probab=26.19 E-value=39 Score=13.19 Aligned_cols=209 Identities=15% Similarity=0.200 Sum_probs=114.0 Q ss_pred HHCCCCCCCCCEEEEEECC------------CEECCCCCHHCCCCCCCC--HHHHH----HCCCEEEECCCCHHHHHHHH Q ss_conf 5106774436579998416------------020124701024677840--01122----03405874267889999999 Q gi|254781051|r 393 LSGLSGYTVAGNIHLIINN------------QIGFTTNPSSARSCTYAS--DISKS----IGIPIFHVNGDDPEAVIRVV 454 (957) Q Consensus 393 ls~l~gY~tGGTiHiVvNN------------QIGFTT~p~d~RSs~Y~T--DiAK~----i~aPI~HVN~DdpEav~~~~ 454 (957) ---|.|-+++|++|+-+-- +|-.| -|-+++| |+|-. -++|||-..|+++|.--.+. T Consensus 40 ~kPlkG~rI~~~LHlt~kTAvL~~tL~~~GA~V~~~------~~Np~STQDdvaAAL~~~~gi~VfA~kget~eey~~~~ 113 (476) T PTZ00075 40 SKPLKGAKIAGCLHMTVETAVLIETLVALGAEVRWC------SCNIFSTQDNAAAAIAKKGGVTVFAWKGETIEEYWWCI 113 (476) T ss_pred CCCCCCCEEEEEEECHHHHHHHHHHHHHCCCEEEEE------CCCCCCCCHHHHHHHHHHCCCEEEEECCCCHHHHHHHH T ss_conf 699899989999822899999999999859869996------47975436699999886079269995799989999999 Q ss_pred HHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHCCCHHHHHHH--HHHHCCCCCHHHHHHHHHHHHHH Q ss_conf 9999999973897399864211256564354223557888886718998999999--98644898989999999999999 Q gi|254781051|r 455 RMAVSFRMKFHKSVVIDIVCYRRFGHNEGDEPSFTQPMMYKTIRSHKSVLQLYAD--SLMKNQVISKQELQSLANNWHKY 532 (957) Q Consensus 455 ~lA~~yR~~F~kDVvIDlv~YRr~GHNE~DeP~fTQP~mY~~I~~hp~v~~~Y~~--~Li~~g~it~~e~~~~~~~~~~~ 532 (957) .-|++|-..-+-|++||==| | ++..|.+-.-....|.+ +|-.-.....++. +..... T Consensus 114 ~~~L~~~~~~~P~iiiDDGg---------D--------l~~llH~~~~~e~~~~~~~~~~~~~~~~~~~~----~~~~~~ 172 (476) T PTZ00075 114 EQTLKWSDDGGPNLIVDDGG---------D--------LTNLVHDGVKLEKMYEKKGKLPDPSVAEPEDN----RCLLTF 172 (476) T ss_pred HHHHHCCCCCCCCEEEECCC---------H--------HHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHH----HHHHHH T ss_conf 99973466889877981561---------2--------77887521121012001356778766751788----999999 Q ss_pred HHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCHHHHH-HHHHHHHHHCCCCCCCCHHHHH Q ss_conf 775666541258765666656862666778999999998874368998620567788-8998998733435642114789 Q gi|254781051|r 533 LEAEYKESESYCPEKLGLLHNGENERKNSVSKEILKKIGSSIIRLPKSFNTHKIVER-LMANRQKMIETGKGIDWSMAES 611 (957) Q Consensus 533 l~~~~~~~k~~~~~~~~~~~~~~~~~~T~v~~~~L~~i~~~l~~~Pe~f~~h~~v~k-~~~~R~~~~~~~~~iDW~~AE~ 611 (957) |........+..|+-........+...||| .+|+.+.+.-.-.=+-++++.-+.| .+++|-..-++ -+|.=+ T Consensus 173 l~~~~~~~~~~~~~l~~~i~G~sEETTTGV--~RL~am~~~g~L~fP~i~VNDs~tK~~FDNrYGtgqS--~~dgi~--- 245 (476) T PTZ00075 173 LNHLVTKNPNKWPNLVKKIVGVSEETTTGV--KNLYKMLKKGKLLFPAINVNDSVTKSKFDNLYGCRES--LVDGIK--- 245 (476) T ss_pred HHHHHHHCCCCCHHHHHHEECCCCCHHHHH--HHHHHHHHCCCCCCEEEEECCHHHHHHHCCCCCHHHH--HHHHHH--- T ss_conf 998874155201467851004000544759--9999998779602118998765552340222211365--999999--- Q ss_pred HHHHHHHHCCCEEEEEC-CCCCCCCC Q ss_conf 88898863583157850-10022333 Q gi|254781051|r 612 LAFGSLCYEGYKVRLSG-QDCERGTF 636 (957) Q Consensus 612 LA~~SLl~eG~~VRlsG-QD~~RGTF 636 (957) -.--+|..|-.|=..| -||+||.= T Consensus 246 -raTn~liaGK~vVV~GYG~~GkG~A 270 (476) T PTZ00075 246 -RATDVMIAGKTVVVCGYGDVGKGCA 270 (476) T ss_pred -HHHCCEECCCEEEEEECCCCCHHHH T ss_conf -8628345475899950453012147 No 255 >PRK03906 mannonate dehydratase; Provisional Probab=26.19 E-value=27 Score=14.32 Aligned_cols=29 Identities=21% Similarity=0.432 Sum_probs=13.7 Q ss_pred HHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHH Q ss_conf 9999999840643027772489989999999984 Q gi|254781051|r 164 EIVDVLSHLYCSNIGVEFMHIVDSIERDWVRNTI 197 (957) Q Consensus 164 eIi~~L~~tYCgsIGvEymHI~d~eer~Wlq~ri 197 (957) +.|..|-+.=-.+|.|-||=+-| |.|..+ T Consensus 87 ~sirNLa~~GI~viCYNFMPv~D-----WtRTdL 115 (389) T PRK03906 87 QTLRNLAAAGIKVVCYNFMPVFD-----WTRTDL 115 (389) T ss_pred HHHHHHHHCCCCEEEEECCCCCC-----CCCCCC T ss_conf 99999997699789650366465-----003656 No 256 >PRK01184 hypothetical protein; Provisional Probab=26.16 E-value=13 Score=16.68 Aligned_cols=54 Identities=24% Similarity=0.300 Sum_probs=35.3 Q ss_pred EECCCCCHHCCCCCCCCHHHHHHCCCEEEECCCCHH-HH----------HHHHHHHHHHHHHHCCCEEE Q ss_conf 201247010246778400112203405874267889-99----------99999999999973897399 Q gi|254781051|r 413 IGFTTNPSSARSCTYASDISKSIGIPIFHVNGDDPE-AV----------IRVVRMAVSFRMKFHKSVVI 470 (957) Q Consensus 413 IGFTT~p~d~RSs~Y~TDiAK~i~aPI~HVN~DdpE-av----------~~~~~lA~~yR~~F~kDVvI 470 (957) ||+|-.|--+.|+ .+++++-.++||+ ..+|.- .. -....++.+.|++|+.|++. T Consensus 4 IGlTG~iGSGKst--va~i~~e~G~~vi--~~~Divr~~v~~~g~~~~~~~~~~~~~~lR~~~G~~~~a 68 (183) T PRK01184 4 IIVTGMPGSGKGE--FSKIARELGIPVV--VMGDVIREEVKKRGLPPTDENIGKVATDLRKELGMDAVA 68 (183) T ss_pred EEEECCCCCCHHH--HHHHHHHCCCEEE--ECCHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCHHHH T ss_conf 9996899887899--9999997799399--860778999998389997789999999999871955899 No 257 >pfam11339 DUF3141 Protein of unknown function (DUF3141). This family of proteins with unknown function appears to be restricted to Proteobacteria. Probab=26.11 E-value=11 Score=17.43 Aligned_cols=42 Identities=19% Similarity=0.179 Sum_probs=26.1 Q ss_pred HHHHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCCCCHHHHHHHHHCCCHH Q ss_conf 8999999999999999738973998642112565643542235578888867189989 Q gi|254781051|r 447 PEAVIRVVRMAVSFRMKFHKSVVIDIVCYRRFGHNEGDEPSFTQPMMYKTIRSHKSVL 504 (957) Q Consensus 447 pEav~~~~~lA~~yR~~F~kDVvIDlv~YRr~GHNE~DeP~fTQP~mY~~I~~hp~v~ 504 (957) =++|.++.++....=..|-.-.|= .++.|.+-+.+++.-+.+ T Consensus 426 F~~VarvSe~n~~~Y~~fv~P~vr----------------a~~tp~~A~~lr~~HPlR 467 (581) T pfam11339 426 FAAVARVSEINLALYDLFVQPWVR----------------AAVTPQSAELLRELHPLR 467 (581) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH----------------HHCCHHHHHHHHHHCCHH T ss_conf 799999999999999998759999----------------736999999999859147 No 258 >KOG4269 consensus Probab=25.69 E-value=25 Score=14.56 Aligned_cols=24 Identities=21% Similarity=0.357 Sum_probs=14.7 Q ss_pred HHHHHHHHHH-HCCCCCEEEEEEEC Q ss_conf 7899999999-76998689998401 Q gi|254781051|r 847 VYYDLLDNRD-MRNIADIYLLRIEQ 870 (957) Q Consensus 847 vyydL~~~r~-~~~~~~vaivRiEq 870 (957) |-|--+++.+ .+|++.+.|-||-- T Consensus 922 VVyrCvEyle~~RgieEeGIyRlSG 946 (1112) T KOG4269 922 VVYRCVEYLESCRGIEEEGIYRLSG 946 (1112) T ss_pred HHHHHHHHHHHCCCCCHHCEEEECC T ss_conf 8999999998614620204477325 No 259 >KOG3350 consensus Probab=25.64 E-value=39 Score=13.12 Aligned_cols=74 Identities=18% Similarity=0.264 Sum_probs=40.1 Q ss_pred HHHCCCCCEEECCCCCCCCCCC----CCCCC--CCHHHCCEEEEECCHHHHHHHHHHHHCCCCCEEEEEEECCCCCHHHH Q ss_conf 8937998603515776433444----54445--79545768998688578999999997699868999840148682899 Q gi|254781051|r 805 SDMTCGSVFQAVLSDDAEYHGK----TSVKL--KEDSHIRRVILCTGKVYYDLLDNRDMRNIADIYLLRIEQLYPFPEDY 878 (957) Q Consensus 805 ~d~~~g~~F~~vi~d~~~~~~~----~~~~~--~~~~~v~rvi~csGKvyydL~~~r~~~~~~~vaivRiEqLyPfP~~~ 878 (957) +++. + .|..||.||....-. +..++ -... -+|||+|||-+--+.+.++-. |+.--.-|=.... T Consensus 130 ~~lk-~-~fdiivaDPPfL~~eCl~Kts~tik~L~r~-~~kvilCtGeimee~~s~~l~--------~~~~sF~PeH~~n 198 (217) T KOG3350 130 DELK-A-HFDIIVADPPFLSEECLAKTSETIKRLQRN-QKKVILCTGEIMEEWASALLP--------VLKCSFRPEHERN 198 (217) T ss_pred HHHH-H-CCCEEEECCCCCCCHHHHHHHHHHHHHHCC-CCEEEEECHHHHHHHHHHHHH--------HHHCCCCCHHHCC T ss_conf 9997-4-016897178654616666668999997068-853998340676899998753--------3210013544305 Q ss_pred HHHHHHHCCCC Q ss_conf 99998537998 Q gi|254781051|r 879 LIKVLSRFVQA 889 (957) Q Consensus 879 l~~~l~~y~~~ 889 (957) |.....-|.|- T Consensus 199 LaNeF~cyaNf 209 (217) T KOG3350 199 LANEFRCYANF 209 (217) T ss_pred CCCCEEEEEEC T ss_conf 35001577742 No 260 >PRK06389 argininosuccinate lyase; Provisional Probab=25.55 E-value=40 Score=13.10 Aligned_cols=31 Identities=23% Similarity=0.190 Sum_probs=17.4 Q ss_pred CCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCC Q ss_conf 5404788862575333467258876451067744 Q gi|254781051|r 367 RSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYT 400 (957) Q Consensus 367 ~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~ 400 (957) +-.+.|+ |-||.+|-..-----.++++=||. T Consensus 183 r~n~~PL---GsgA~aGt~~piDR~~~a~~LGF~ 213 (434) T PRK06389 183 DLREMPY---GYGSGYGSPSSVKFNQMSELLGME 213 (434) T ss_pred HHCCCCC---CCCCCCCCCCCCCHHHHHHHCCCC T ss_conf 6144875---300025888876889999982999 No 261 >COG1166 SpeA Arginine decarboxylase (spermidine biosynthesis) [Amino acid transport and metabolism] Probab=25.51 E-value=40 Score=13.10 Aligned_cols=300 Identities=17% Similarity=0.218 Sum_probs=148.4 Q ss_pred CCCCHHHHHHHHHHH-HCCCEEEEEECCCCHHHHH---HHHHHHCCCCCC------CCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 777899999999984-0643027772489989999---999984145778------887988999999999999989999 Q gi|254781051|r 158 ESATIPEIVDVLSHL-YCSNIGVEFMHIVDSIERD---WVRNTIEDYDFS------SNFSKEEQRDILDKLVRAEGFEKF 227 (957) Q Consensus 158 ~~~tL~eIi~~L~~t-YCgsIGvEymHI~d~eer~---Wlq~riE~~~~~------~~~s~eeK~~iL~~L~~Ae~FE~F 227 (957) .+..|.+|++.|+.. |-...=+-|-||---..|+ =+...+|..... .+.-..+.+++-+.|+++. T Consensus 62 ~~~dL~elV~~l~~~g~~LPlL~rFp~IL~~Rl~~ln~aF~~Ai~ey~Y~g~Y~~VyPIKvNQ~r~vVe~Lv~~g----- 136 (652) T COG1166 62 ARVDLAELVKALRDRGLRLPLLLRFPQILQHRLRSLNAAFARAIEEYGYPGGYFAVYPIKVNQHRRVVESLVASG----- 136 (652) T ss_pred CCCCHHHHHHHHHHCCCCCCEEEECHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEEEEECCHHHHHHHHHHCC----- T ss_conf 522099999999965789856985269999999999999999999808898605898766554188999998636----- Q ss_pred HHHHHCCCCEECCCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCC--HHHHHHHHHCCCHH----------HHHHHHC- Q ss_conf 9864033622225646589999999999987379107998067655--26899988537989----------9999855- Q gi|254781051|r 228 IDIKYKGAKRFGADGSEVIIPAIEEIIRQGVQQGVDEMILGMAHRG--RLNVLSQIMNKLPR----------SIFYEFK- 294 (957) Q Consensus 228 L~~Kf~g~KRFsLEG~EslIP~L~~ii~~a~~~Gv~eiviGMaHRG--RLNVL~Nvl~Kp~~----------~iF~EF~- 294 (957) --.-||||-|-- |-|=..+..+..-|.-=||=|---|. ||-.+.--||..+- .+..|=+ T Consensus 137 ------~~~~~GLEAGSK--~ELm~vLA~~~~~~~~IvCNGyKDrEyI~lAlig~kLGh~v~ivIEklsEl~~Vl~eA~~ 208 (652) T COG1166 137 ------KGYPLGLEAGSK--AELMAVLAHAGNPGSLIVCNGYKDREYIRLALIGEKLGHKVYIVIEKLSELDLVLEEAKQ 208 (652) T ss_pred ------CCCCCCCCCCCH--HHHHHHHHHCCCCCCEEEECCCCCHHHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHH T ss_conf ------898776667887--999999983389997178457460999999999998279549999226799999999997 Q ss_pred -CCCCC-------------CCCCCCCCCCCCCEEEEE--E----E--CC-----CEEEEEECCCCCCCHHCCHHHHHHHH Q ss_conf -88743-------------223455510015026889--8----3--59-----61899971787731000445532367 Q gi|254781051|r 295 -GKGPS-------------EKEYSGDVKYHLGLCCNR--Q----I--CG-----KDVKLLLQSNPSHLEFVDPVVIGSVR 347 (957) Q Consensus 295 -g~~~~-------------~~~~sGDVKYHLG~s~~~--~----~--~g-----~~v~lsL~~NPSHLEaVnPVv~G~~R 347 (957) |..|. .+..||--|.-.|.|+.- + . .+ .-+|..| -|-+-.+.+|..|.-- T Consensus 209 lgvkP~lGvR~RL~sqGsGkW~~SgG~ksKFGLsa~qvL~~v~~Lre~~~Ld~l~llHFHl---GSQisnI~~ik~~~rE 285 (652) T COG1166 209 LGVKPRLGVRARLASQGSGKWQSSGGEKSKFGLSATQVLQVVERLREANLLDSLQLLHFHL---GSQISNIRDIKTGVRE 285 (652) T ss_pred CCCCCCCEEEEEEECCCCCCCCCCCCCHHCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHH---CCHHHHHHHHHHHHHH T ss_conf 3999763368886235567500046831014888899999999987626577667876410---0013456999999999 Q ss_pred HHHHHHCCCCCCCCCCCCCCCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCCEECCCCCHHCCCCCC Q ss_conf 77764166654455565445404788862575333467258876451067744365799984160201247010246778 Q gi|254781051|r 348 ARQDLKAGIVGEKSISLVERSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQIGFTTNPSSARSCTY 427 (957) Q Consensus 348 AkQd~~~d~~~~~~~~~~~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQIGFTT~p~d~RSs~Y 427 (957) |-|.+..=. - -| .++..+.+||-+ -.+..-+||..| T Consensus 286 A~r~YvEL~-------------k-------------lG--------a~i~~~dVGGGL----------gVDYdGt~t~~~ 321 (652) T COG1166 286 AARFYVELR-------------K-------------LG--------ANIKYFDVGGGL----------GVDYDGTRTQSD 321 (652) T ss_pred HHHHHHHHH-------------H-------------CC--------CCCEEEECCCCE----------EECCCCCCCCCC T ss_conf 999999999-------------8-------------27--------886078516844----------332167644663 Q ss_pred CCHHHHHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEE---------------EEEECCCCCCCCCCCCCCCCHH Q ss_conf 4001122034058742678899999999999999973897399---------------8642112565643542235578 Q gi|254781051|r 428 ASDISKSIGIPIFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVI---------------DIVCYRRFGHNEGDEPSFTQPM 492 (957) Q Consensus 428 ~TDiAK~i~aPI~HVN~DdpEav~~~~~lA~~yR~~F~kDVvI---------------Dlv~YRr~GHNE~DeP~fTQP~ 492 (957) ||-==-+ -+-.-.||+..+=|++=++-=+-+||+ +.++--|+.-| |.|..--|. T Consensus 322 ~S~NY~l---------~eYA~dVV~~l~d~C~~~~~p~P~IisESGRaitAHhaVLI~~Vi~v~~~~~~--~~p~~~~~~ 390 (652) T COG1166 322 CSKNYGL---------NEYANDVVWALKDACEEKGLPHPTIISESGRAITAHHAVLIANVIGVERHEYN--DAPLPDAPR 390 (652) T ss_pred CCCCCCH---------HHHHHHHHHHHHHHHHHCCCCCCEEEEECCHHHHHCCEEEEEEECCCCCCCCC--CCCCCCCCC T ss_conf 2336788---------99998999999999984599998078104423220422898630002457877--788888843 Q ss_pred HHHHHHHCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 888867189989999999864489898999999999999977566 Q gi|254781051|r 493 MYKTIRSHKSVLQLYADSLMKNQVISKQELQSLANNWHKYLEAEY 537 (957) Q Consensus 493 mY~~I~~hp~v~~~Y~~~Li~~g~it~~e~~~~~~~~~~~l~~~~ 537 (957) ..||-++.+| .--+-+++..+.++..+-...++.+. T Consensus 391 -----~~~~l~~~~~----e~~~~i~~r~~~E~~hds~~~~~~~~ 426 (652) T COG1166 391 -----NLPPLWRTLQ----ELYESITARNLREWYHDSQDDLEDAH 426 (652) T ss_pred -----CCHHHHHHHH----HHHCCCCHHHHHHHHHHHHHHHHHHH T ss_conf -----2458999999----98634577889999987676799999 No 262 >TIGR00747 fabH 3-oxoacyl-(acyl-carrier-protein) synthase III; InterPro: IPR004655 Beta-ketoacyl-acyl carrier protein synthase III (FabH), in general, initiates elongation in type II fatty acid synthase systems found in bacteria and plants. It is responsible for producing the multitude of fatty acid structures found in bacterial membranes . The two members of this subfamily from Bacillus subtilis differ from each other, and from FabH from Escherichia coli, in acyl group specificity. Active site residues include Cys112, His244 and Asn274 of Escherichia coli FabH. Cys-112 is the site of acyl group attachment. ; GO: 0004315 3-oxoacyl-[acyl-carrier-protein] synthase activity, 0006633 fatty acid biosynthetic process. Probab=25.48 E-value=19 Score=15.60 Aligned_cols=25 Identities=40% Similarity=0.658 Sum_probs=17.0 Q ss_pred CCCHHHHHHCCCCCCCCCCEEECCCCCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 110134320221138653111000024424541022565542025762 Q gi|254781051|r 674 EQAVLGFEYGYSLNNLNALTIWEAQFGDFANGAQVILDQFITAGEQKW 721 (957) Q Consensus 674 E~avlgFEyGys~~~p~~L~iWEAQFGDF~NgAQviiDqfiss~e~KW 721 (957) -+||=||=||-|+. +|||-||..|- T Consensus 110 ~AACaGF~Y~Ls~A-----------------------~~~i~SG~y~~ 134 (329) T TIGR00747 110 SAACAGFIYGLSVA-----------------------KQYIESGKYKT 134 (329) T ss_pred HHHHHHHHHHHHHH-----------------------HHHHHCCCCCE T ss_conf 65676689999999-----------------------99861577026 No 263 >PRK00098 ribosome-associated GTPase; Reviewed Probab=25.37 E-value=33 Score=13.70 Aligned_cols=153 Identities=12% Similarity=0.153 Sum_probs=73.1 Q ss_pred CCCCCCCCHHHHHHHCCCCCEEEEECCCHHHHH---HHHH--HHHCCCCCCCEEEECCCHHHCCCCCCCCHHHHCCCCCE Q ss_conf 876564405788752143782697248899999---9999--97435348868985660221080126887893799860 Q gi|254781051|r 739 GPEHSSARLERFLQMCAENNMYVANCTSPANYF---HILR--RQIYDRSSRPLIMMAPKSLLRHKRVVSSLSDMTCGSVF 813 (957) Q Consensus 739 GPEHSSaRlERfLQl~a~~N~~V~~~ttpAq~F---H~LR--RQ~~r~~rkPLiv~tPKsLLR~~~a~S~l~d~~~g~~F 813 (957) .|+.+..+|+|||-+|...|+.++.+=|-+-.. -.++ -+..+..--|.+..+.+. +.....|.++..|. - T Consensus 92 ~P~~~~~~idR~Lv~a~~~~i~pvivlnK~DL~~~~~~~~~~~~~y~~~gy~v~~~S~~~----~~g~~~L~~~l~~k-~ 166 (298) T PRK00098 92 EPDFSTDLLDRFLVLAEANDIKPIIVLNKIDLVDDLEEARERLALYRAIGYDVLELSAKE----GPGLDELKPLLAGK-V 166 (298) T ss_pred CCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHHHCCCCEEEEECCC----CCCHHHHHHHHCCC-E T ss_conf 898998999999999998699589996424564779999999999987899789996899----98999999985798-6 Q ss_pred EECCCCCCCCCCCCCCCCCCHHHCCEEEEECCHHHHHHH--HHHHHCCCCCEEEEEEECCCCCHHHHHHHHHHHCCCC-E Q ss_conf 351577643344454445795457689986885789999--9999769986899984014868289999998537998-3 Q gi|254781051|r 814 QAVLSDDAEYHGKTSVKLKEDSHIRRVILCTGKVYYDLL--DNRDMRNIADIYLLRIEQLYPFPEDYLIKVLSRFVQA-E 890 (957) Q Consensus 814 ~~vi~d~~~~~~~~~~~~~~~~~v~rvi~csGKvyydL~--~~r~~~~~~~vaivRiEqLyPfP~~~l~~~l~~y~~~-e 890 (957) --++. ++.. -+.+-+..++ |..--.-- .+...+|.. ..|--+|||+|- ..|.-=|.. + T Consensus 167 sv~~G-~SGV--------GKSSLiN~L~---~~~~~~t~~is~~~~rGrH---TTt~~~l~~l~~----G~iiDTPG~r~ 227 (298) T PRK00098 167 TVLAG-QSGV--------GKSTLLNALV---PELELKTGEISEALGRGKH---TTTHVELYDLPG----GLLIDTPGFSS 227 (298) T ss_pred EEEEC-CCCC--------CHHHHHHHHC---CHHHHHHHHHHHHCCCCCE---EEEEEEEEECCC----CEEEECCCCCC T ss_conf 99987-8988--------7888887607---1444566755665189721---330468998699----75981898775 Q ss_pred E-EEECCCCCCCCCHHHHHHHHHHHHHHCC Q ss_conf 9-9806785317845789976999987505 Q gi|254781051|r 891 I-VWCQEEPQNMGAWTFIEPYLEKVLHSIG 919 (957) Q Consensus 891 ~-vW~QEEp~N~GaW~~v~~rl~~~l~~~~ 919 (957) | +|-- ++.. -.+..|-|.......+ T Consensus 228 ~~l~~~-~~~~---l~~~F~e~~~~~~~Ck 253 (298) T PRK00098 228 FGLEHL-EAED---LEHAFPEFRPLSGECK 253 (298) T ss_pred CCCCCC-CHHH---HHHHHHHHHHHHCCCC T ss_conf 566779-9799---9886788999868898 No 264 >cd02753 MopB_Formate-Dh-H Formate dehydrogenase H (Formate-Dh-H) catalyzes the reversible oxidation of formate to CO2 with the release of a proton and two electrons. It is a component of the anaerobic formate hydrogen lyase complex. The E. coli formate dehydrogenase H (Fdh-H) is a monomer composed of a single polypeptide chain with a Mo active site region and a [4Fe-4S] center. Members of the MopB_Formate-Dh-H CD belong to the molybdopterin_binding (MopB) superfamily of proteins. Probab=25.35 E-value=40 Score=13.08 Aligned_cols=58 Identities=14% Similarity=0.241 Sum_probs=32.1 Q ss_pred HHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHCCCCC Q ss_conf 9864489898999999999999977566654125876566665686266677899999999887436899 Q gi|254781051|r 510 SLMKNQVISKQELQSLANNWHKYLEAEYKESESYCPEKLGLLHNGENERKNSVSKEILKKIGSSIIRLPK 579 (957) Q Consensus 510 ~Li~~g~it~~e~~~~~~~~~~~l~~~~~~~k~~~~~~~~~~~~~~~~~~T~v~~~~L~~i~~~l~~~Pe 579 (957) -|+++|.++++-+++....+.+. .+..+.|.|++. ..-|||+.+.+++++..+.+-.. T Consensus 223 ~ii~~~~~d~~fi~~~t~gf~~~----~~~~~~~tpe~~--------a~itgv~~~~I~~~A~~~a~~~~ 280 (512) T cd02753 223 VIIEEGLYDEEFIEERTEGFEEL----KEIVEKYTPEYA--------ERITGVPAEDIREAARMYATAKS 280 (512) T ss_pred EEECCCCCCHHHHHHHCCCHHHH----HHHHHHCCHHHH--------HHHHCCCHHHHHHHHHHHHHCCC T ss_conf 85006664514577605779999----998650899999--------98859889999999999975698 No 265 >KOG1845 consensus Probab=25.29 E-value=40 Score=13.09 Aligned_cols=39 Identities=15% Similarity=-0.004 Sum_probs=16.3 Q ss_pred CEEEEECCHHHHHHHHHHHHCCCC---CEEEEEEECCCCCHH Q ss_conf 689986885789999999976998---689998401486828 Q gi|254781051|r 838 RRVILCTGKVYYDLLDNRDMRNIA---DIYLLRIEQLYPFPE 876 (957) Q Consensus 838 ~rvi~csGKvyydL~~~r~~~~~~---~vaivRiEqLyPfP~ 876 (957) +....-+|+-+-+.+++..++.+. +=-.+++||+-|.-. T Consensus 661 ~~~~~~~~~~~~~~l~~~~~~~l~~~~~~~~t~~~q~~~~n~ 702 (775) T KOG1845 661 KLKSEQKQKKTLVELEETRKKWLRSMLNQSLTAGEQLKSLNQ 702 (775) T ss_pred CCCHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 555121151799999999999999863333444531010143 No 266 >TIGR02403 trehalose_treC alpha,alpha-phosphotrehalase; InterPro: IPR012769 Trehalose is a glucose disaccharide that serves in many biological systems as a compatible solute for protection against hyperosmotic and thermal stress. This family describes trehalose-6-phosphate hydrolase, product of the treC (or treA) gene, which is often found together with a trehalose uptake transporter and a trehalose operon repressor.; GO: 0008788 alphaalpha-phosphotrehalase activity, 0005993 trehalose catabolic process, 0005737 cytoplasm. Probab=25.26 E-value=40 Score=13.07 Aligned_cols=35 Identities=11% Similarity=0.086 Sum_probs=23.8 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCHHHCCCCCCCCCC Q ss_conf 36789999999999999881704540167777776 Q gi|254781051|r 92 TQSVKDFFQVMKMIDAYRSYGHFKANIDPLGYNSH 126 (957) Q Consensus 92 ~~~~~kq~rV~~LI~AYR~~GHl~A~lDPLgl~~~ 126 (957) .+.-.=.-.|..+|+-+|..|=.==+||=++|-.. T Consensus 169 W~Np~VR~~~~~v~~FW~dKGV~GFRlDVINLIsK 203 (555) T TIGR02403 169 WENPEVREELKDVVNFWRDKGVDGFRLDVINLISK 203 (555) T ss_pred CCCHHHHHHHHHHHHHHHHCCCCCEEEEEEEECCC T ss_conf 77888999999999999866987135427740347 No 267 >pfam02633 Creatininase Creatinine amidohydrolase. Creatinine amidohydrolase (EC:3.5.2.10), or creatininase, catalyses the hydrolysis of creatinine to creatine. Probab=25.24 E-value=40 Score=13.06 Aligned_cols=52 Identities=17% Similarity=0.397 Sum_probs=42.1 Q ss_pred HHHCCCCEECCCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCC Q ss_conf 6403362222564658999999999998737910799806765526899988537 Q gi|254781051|r 230 IKYKGAKRFGADGSEVIIPAIEEIIRQGVQQGVDEMILGMAHRGRLNVLSQIMNK 284 (957) Q Consensus 230 ~Kf~g~KRFsLEG~EslIP~L~~ii~~a~~~Gv~eiviGMaHRGRLNVL~Nvl~K 284 (957) ..|||+= || ..|+++..|..|++....+|++.|+|==.|-|-.++|..++.+ T Consensus 71 ~~fpGTi--sl-~~~t~~~~l~di~~sl~~~Gf~~ivilNgHgGN~~~l~~~~~~ 122 (235) T pfam02633 71 MGFPGTI--SL-SPETLIALLRDIGRSLARHGFRRLVIVNGHGGNIALLQEAARE 122 (235) T ss_pred CCCCCEE--EE-CHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHHHH T ss_conf 2889736--50-9999999999999999977975899991687748999999999 No 268 >pfam02836 Glyco_hydro_2_C Glycosyl hydrolases family 2, TIM barrel domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities. Probab=25.22 E-value=40 Score=13.06 Aligned_cols=23 Identities=13% Similarity=0.117 Sum_probs=13.3 Q ss_pred EEEEEEEECCCCCCCCCHHHHHH Q ss_conf 04788862575333467258876 Q gi|254781051|r 369 RVLPLIIHGDAAFAGQGIVSETF 391 (957) Q Consensus 369 ~vlpiliHGDAAfaGQGVV~Etl 391 (957) +.-.+.-|+|-+..|-.+-.|.+ T Consensus 16 ~l~Gvnrh~~~~~~G~a~~~~~~ 38 (297) T pfam02836 16 YFRGVNRHEDHPRRGRAFDEDDM 38 (297) T ss_pred EEEEEECCCCCCCCCCCCCHHHH T ss_conf 87514757578656631899999 No 269 >PRK08849 2-octaprenyl-3-methyl-6-methoxy-1,4-benzoquinol hydroxylase; Provisional Probab=25.18 E-value=35 Score=13.47 Aligned_cols=15 Identities=13% Similarity=0.138 Sum_probs=7.5 Q ss_pred CCCCHHHHHHHHHHH Q ss_conf 267889999999999 Q gi|254781051|r 443 NGDDPEAVIRVVRMA 457 (957) Q Consensus 443 N~DdpEav~~~~~lA 457 (957) -|--|-..+-|+.|| T Consensus 9 vGaG~vGl~lAl~La 23 (384) T PRK08849 9 VGGGMVGAATAIGFA 23 (384) T ss_pred ECCCHHHHHHHHHHH T ss_conf 992499999999999 No 270 >COG0128 AroA 5-enolpyruvylshikimate-3-phosphate synthase [Amino acid transport and metabolism] Probab=24.97 E-value=37 Score=13.35 Aligned_cols=68 Identities=21% Similarity=0.302 Sum_probs=47.2 Q ss_pred HHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCC Q ss_conf 99999998874368998620567788899899873343564211478988898863583157850100223332 Q gi|254781051|r 564 KEILKKIGSSIIRLPKSFNTHKIVERLMANRQKMIETGKGIDWSMAESLAFGSLCYEGYKVRLSGQDCERGTFS 637 (957) Q Consensus 564 ~~~L~~i~~~l~~~Pe~f~~h~~v~k~~~~R~~~~~~~~~iDW~~AE~LA~~SLl~eG~~VRlsGQD~~RGTFs 637 (957) .+.|+++|.++.+.++++.+++. .++-... .+.+ -=|-=+|=++|.+.|+.++- |++-+-+|-+=||- T Consensus 348 a~eL~klG~~v~e~~Dgl~I~g~-~~l~g~~--~v~s--~~DHRiaMa~aia~l~~~~~-v~I~~~~~v~kSfP 415 (428) T COG0128 348 ATELRKLGVEVEETEDGLIITGG-TKLKGAG--TVDS--YGDHRIAMAFAVAGLLSEGG-VRIDDAECVAKSFP 415 (428) T ss_pred HHHHHHCCCEEEECCCEEEEECC-CCCCCCC--CCCC--CCCHHHHHHHHHHHHHCCCC-EEECCCCCEECCCC T ss_conf 99999658677864883899789-8789985--1047--68689999999999605897-89727461034584 No 271 >pfam12565 DUF3747 Protein of unknown function (DUF3747). This family of proteins is found in bacteria. Proteins in this family are typically between 215 and 413 amino acids in length. There is a conserved DSNGYS sequence motif. Probab=24.84 E-value=32 Score=13.80 Aligned_cols=14 Identities=36% Similarity=0.807 Sum_probs=7.0 Q ss_pred HCCCEEEEECCCCC Q ss_conf 35831578501002 Q gi|254781051|r 619 YEGYKVRLSGQDCE 632 (957) Q Consensus 619 ~eG~~VRlsGQD~~ 632 (957) +.||++|+.|||-+ T Consensus 96 SNGYSlR~~G~Dlg 109 (183) T pfam12565 96 SNGYSLRIGGEDLG 109 (183) T ss_pred CCCCEEEECCEECC T ss_conf 88747877880257 No 272 >TIGR02010 IscR iron-sulfur cluster assembly transcription factor IscR; InterPro: IPR010242 Iron-sulphur (FeS) clusters are important cofactors for numerous proteins involved in electron transfer, in redox and non-redox catalysis, in gene regulation, and as sensors of oxygen and iron. These functions depend on the various FeS cluster prosthetic groups, the most common being [2Fe-2S] and [4Fe-4S] . FeS cluster assembly is a complex process involving the mobilisation of Fe and S atoms from storage sources, their assembly into [Fe-S] form, their transport to specific cellular locations, and their transfer to recipient apoproteins. So far, three FeS assembly machineries have been identified, which are capable of synthesising all types of [Fe-S] clusters: ISC (iron-sulphur cluster), SUF (sulphur assimilation), and NIF (nitrogen fixation) systems. The ISC system is conserved in eubacteria and eukaryotes (mitochondria), and has broad specificity, targeting general FeS proteins , . It is encoded by the isc operon (iscRSUA-hscBA-fdx-iscX). IscS is a cysteine desulphurase, which obtains S from cysteine (converting it to alanine) and serves as a S donor for FeS cluster assembly. IscU and IscA act as scaffolds to accept S and Fe atoms, assembling clusters and transfering them to recipient apoproteins. HscA is a molecular chaperone and HscB is a co-chaperone. Fdx is a [2Fe-2S]-type ferredoxin. IscR is a transcription factor that regulates expression of the isc operon. IscX (also known as YfhJ) appears to interact with IscS and may function as an Fe donor during cluster assembly . The SUF system is an alternative pathway to the ISC system that operates under iron starvation and oxidative stress. It is found in eubacteria, archaea and eukaryotes (plastids). The SUF system is encoded by the suf operon (sufABCDSE), and the six encoded proteins are arranged into two complexes (SufSE and SufBCD) and one protein (SufA). SufS is a pyridoxal-phosphate (PLP) protein displaying cysteine desulphurase activity. SufE acts as a scaffold protein that accepts S from SufS and donates it to SufA . SufC is an ATPase with an unorthodox ATP-binding cassette (ABC)-like component. No specific functions have been assigned to SufB and SufD. SufA is homologous to IscA , acting as a scaffold protein in which Fe and S atoms are assembled into [FeS] cluster forms, which can then easily be transferred to apoproteins targets. In the NIF system, NifS and NifU are required for the formation of metalloclusters of nitrogenase in Azotobacter vinelandii, and other organisms, as well as in the maturation of other FeS proteins. Nitrogenase catalyses the fixation of nitrogen. It contains a complex cluster, the FeMo cofactor, which contains molybdenum, Fe and S. NifS is a cysteine desulphurase. NifU binds one Fe atom at its N-terminal, assembling an FeS cluster that is transferred to nitrogenase apoproteins . Nif proteins involved in the formation of FeS clusters can also be found in organisms that do not fix nitrogen . This entry describes IscR, an iron-sulphur binding transcription factor of the ISC iron-sulphur cluster assembly system.; GO: 0003690 double-stranded DNA binding, 0003700 transcription factor activity, 0006355 regulation of transcription DNA-dependent. Probab=24.84 E-value=41 Score=13.02 Aligned_cols=17 Identities=18% Similarity=0.503 Sum_probs=11.1 Q ss_pred CCCHHHHHHHHHHHHHC Q ss_conf 78968999999998738 Q gi|254781051|r 17 GTNCCYIEDLYKSYQED 33 (957) Q Consensus 17 g~Na~YIEeLYekYL~D 33 (957) +-+++|+|||+.|=+++ T Consensus 36 ~ISLSYLEQLFaKLRk~ 52 (140) T TIGR02010 36 NISLSYLEQLFAKLRKA 52 (140) T ss_pred CCCHHHHHHHHHHHCHH T ss_conf 61356798998863210 No 273 >TIGR03159 cas_Csc1 CRISPR-associated protein Csc1. CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) is a widespread family of prokaryotic direct repeats with spacers of unique sequence between consecutive repeats. This protein family is a CRISPR-associated (Cas) family strictly associated with the Cyano subtype of CRISPR/Cas locus, found in several species of Cyanobacteria and several archaeal species. This family is designated Csc1 for CRISPR/Cas Subtype Cyano protein 1, as it is often the first gene upstream of the core cas genes, cas3-cas4-cas1-cas2. Probab=24.75 E-value=41 Score=13.00 Aligned_cols=29 Identities=24% Similarity=0.260 Sum_probs=22.7 Q ss_pred CCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCC Q ss_conf 421147898889886358315785010022333 Q gi|254781051|r 604 IDWSMAESLAFGSLCYEGYKVRLSGQDCERGTF 636 (957) Q Consensus 604 iDW~~AE~LA~~SLl~eG~~VRlsGQD~~RGTF 636 (957) .+||.. ||.=+..|..||+.|++.+|=|. T Consensus 25 ~N~AL~----YAlg~~~~~y~~~~~~~~q~P~Y 53 (223) T TIGR03159 25 GNTALD----YALGYILGEYVDLHGQATQRPTY 53 (223) T ss_pred HHHHHH----HHHHCCCCCEEECCCCCCCCCCH T ss_conf 677999----98714577502012432579757 No 274 >PHA00098 hypothetical protein Probab=24.69 E-value=32 Score=13.80 Aligned_cols=32 Identities=25% Similarity=0.452 Sum_probs=20.7 Q ss_pred CCCCCC-CCCCCCCCHHHHHHHHHCCCHHHHHHHHHHHCCCC Q ss_conf 256564-35422355788888671899899999998644898 Q gi|254781051|r 477 RFGHNE-GDEPSFTQPMMYKTIRSHKSVLQLYADSLMKNQVI 517 (957) Q Consensus 477 r~GHNE-~DeP~fTQP~mY~~I~~hp~v~~~Y~~~Li~~g~i 517 (957) |+|--| +|-|..-||+|-+ --|+++|++.|.+ T Consensus 38 ~~~lG~~~dvp~~~qpLrId---------~~YA~rL~~t~Af 70 (112) T PHA00098 38 QYGLGLNTDIPFNKQPLRIE---------PTYAKRLIETRAF 70 (112) T ss_pred HHCCCCCCCCCCCCCCEEEC---------HHHHHHHHHHCCC T ss_conf 50445678767677744756---------8899998871276 No 275 >PRK11544 hycI hydrogenase 3 maturation protease; Provisional Probab=24.68 E-value=41 Score=12.99 Aligned_cols=40 Identities=23% Similarity=0.342 Sum_probs=28.0 Q ss_pred CCCCCCCCCCCCCCHHHHHHHCCCCCEEEEEC-CCHHHHHHHHHHH Q ss_conf 88888887656440578875214378269724-8899999999997 Q gi|254781051|r 733 HGYEGQGPEHSSARLERFLQMCAENNMYVANC-TSPANYFHILRRQ 777 (957) Q Consensus 733 HGyeGqGPEHSSaRlERfLQl~a~~N~~V~~~-ttpAq~FH~LRRQ 777 (957) .|=||.||. +=+-|+----.||.|+++ |+|-||.+.+|+- T Consensus 13 ~gDDG~G~~-----v~e~L~~~~~~~~~vidgGt~pe~~~~~i~~~ 53 (156) T PRK11544 13 MGDDGAGPL-----LAEKCAAAPKGGWVVIDGGSAPENDIVAIREL 53 (156) T ss_pred CCCCCHHHH-----HHHHHHHCCCCCEEEEECCCCHHHHHHHHHCC T ss_conf 055747599-----99999737899849997988869999998701 No 276 >pfam04007 DUF354 Protein of unknown function (DUF354). Members of this family are around 350 amino acids in length. They are found in archaebacteria and have no known function. Probab=24.65 E-value=6.8 Score=18.89 Aligned_cols=37 Identities=14% Similarity=0.250 Sum_probs=20.0 Q ss_pred CCCCCCCCHHHHHHCCCEEEECCCCHHHHHHHHHHHHHH Q ss_conf 246778400112203405874267889999999999999 Q gi|254781051|r 422 ARSCTYASDISKSIGIPIFHVNGDDPEAVIRVVRMAVSF 460 (957) Q Consensus 422 ~RSs~Y~TDiAK~i~aPI~HVN~DdpEav~~~~~lA~~y 460 (957) |..|.+++-+|.+++.|.+..+ |++.|. .+.+|.+-| T Consensus 89 s~~S~~a~~va~~LgipsI~f~-Dteha~-~~~~Lt~Pf 125 (335) T pfam04007 89 MKNSMELPRVAFGLRIPSIIVL-DNEHAL-AANKLTFPL 125 (335) T ss_pred ECCCHHHHHHHHHCCCCEEEEE-CCHHHC-CCCEEEEEC T ss_conf 4488019999988299879994-775541-233023123 No 277 >TIGR02169 SMC_prok_A chromosome segregation protein SMC; InterPro: IPR011891 The SMC (structural maintenance of chromosomes) family of proteins, exist in virtually all organisms including both bacteria and archaea. The SMC proteins are essential for successful chromosome transmission during replication and segregation of the genome in all organisms and form three types of heterodimer (SMC1SMC3, SMC2SMC4, SMC5SMC6), which are core components of large multiprotein complexes. The best known complexes are cohesin, which is responsible for sister-chromatid cohesion, and condensin, which is required for full chromosome condensation in mitosis. SMCs are generally present as single proteins in bacteria, and as at least six distinct proteins in eukaryotes. The proteins range in size from approximately 110 to 170 kDa, and share a five-domain structure, with globular N- and C-terminal (IPR003395 from INTERPRO) domains separated by a long (circa 100 nm or 900 residues) coiled coil segment in the centre of which is a globular ''hinge'' domain, characterised by a set of four highly conserved glycine residues that are typical of flexible regions in a protein. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif (XXXXD, where X is any hydrophobic residue), and a LSGG motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases . All SMC proteins appear to form dimers, either forming homodimers with themselves, as in the case of prokaryotic SMC proteins, or heterodimers between different but related SMC proteins. The dimers are arranged in an antiparallel alignment. This orientation brings the N- and C-terminal globular domains (from either different or identical protamers) together, which unites an ATP binding site (Walker A motif) within the N-terminal domain with a Walker B motif (DA box) within the C-terminal domain, to form a potentially functional ATPase. Protein interaction and microscopy data suggest that SMC dimers form a ring-like structure which might embrace DNA molecules. Non-SMC subunits associate with the SMC amino- and carboxy-terminal domains. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. SMCs share not only sequence similarity but also structural similarity with ABC proteins. SMC proteins function together with other proteins in a range of chromosomal transactions, including chromosome condensation, sister-chromatid cohesion, recombination, DNA repair and epigenetic silencing of gene expression . This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by IPR011890 from INTERPRO. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent .. Probab=24.59 E-value=20 Score=15.34 Aligned_cols=44 Identities=23% Similarity=0.280 Sum_probs=20.7 Q ss_pred CCCCCCCHHHHHHHHHHHHCCCEEEEE---------ECCCCHHHHHHHHHHHCC Q ss_conf 787777899999999984064302777---------248998999999998414 Q gi|254781051|r 155 LGLESATIPEIVDVLSHLYCSNIGVEF---------MHIVDSIERDWVRNTIED 199 (957) Q Consensus 155 ~g~~~~tL~eIi~~L~~tYCgsIGvEy---------mHI~d~eer~Wlq~riE~ 199 (957) +++..++|.||-+.|-+.=--.=||=+ .-+ ++-||+=|.++|=+ T Consensus 135 lNG~~~~l~ei~d~L~~~gI~p~gYNvVlQGDvt~~I~M-s~~ERRkiIdeiAG 187 (1202) T TIGR02169 135 LNGKSVRLSEIHDFLAAAGIYPEGYNVVLQGDVTKIISM-SAVERRKIIDEIAG 187 (1202) T ss_pred ECCCCCCHHHHHHHHHHCCCCCCCCEEEEECCHHHHCCC-CHHHHHHHHHHHCC T ss_conf 708203576689999861768898706743541231038-84356788888548 No 278 >TIGR00655 PurU formyltetrahydrofolate deformylase; InterPro: IPR004810 An Escherichia coli gene designated purU has been identified and characterised. The gene codes for a 280-amino-acid protein, PurU (P37051 from SWISSPROT, 3.5.1.10 from EC). PurU is an enzyme that catalyses the hydrolysis of 10-formyltetrahydrofolate (formyl-FH4) to FH4 and formate ,.10-formyltetrahydrofolate + H(2)O = formate +tetrahydrofolate Formyl-FH4 hydrolase generates the formate that is used by purT-encoded 5'-phosphoribosylglycinamide transformylase for step three of de novo purine nucleotide synthesis. Formyl-FH4 hydrolase, a hexamer of 32kDa subunits, is activated by methionine and inhibited by glycine. Heterotropic cooperativity is observed for activation by methionine in the presence of glycine and for inhibition by glycine in the presence of methionine. These results suggest that formyl-FH4 hydrolase is a regulatory enzyme whose main function is to balance the pools of FH4 and C1-FH4 in response to changing growth conditions. The enzyme uses methionine and glycine to sense the pools of C1-FH4 and FH4, respectively. ; GO: 0008864 formyltetrahydrofolate deformylase activity, 0006189 'de novo' IMP biosynthetic process. Probab=24.46 E-value=41 Score=12.96 Aligned_cols=70 Identities=11% Similarity=0.059 Sum_probs=42.8 Q ss_pred CCHHHHHHHHHHHHCCCEEEEEE-CCCCHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHH-HHHH Q ss_conf 78999999999840643027772-489989999999984145778887988999999999999989999-9864 Q gi|254781051|r 160 ATIPEIVDVLSHLYCSNIGVEFM-HIVDSIERDWVRNTIEDYDFSSNFSKEEQRDILDKLVRAEGFEKF-IDIK 231 (957) Q Consensus 160 ~tL~eIi~~L~~tYCgsIGvEym-HI~d~eer~Wlq~riE~~~~~~~~s~eeK~~iL~~L~~Ae~FE~F-L~~K 231 (957) ...-=+.|.|=+.|.|..-+|-. =|+|-+.=+=+-++|.-+-...+-+++.| ....=-.+|.+++| |..+ T Consensus 101 Ke~HCL~DLL~r~~~GeL~~ei~~VIsNH~~lr~lve~F~iPF~~~p~~~~~r--~e~Ek~~Lell~~yelkh~ 172 (294) T TIGR00655 101 KEDHCLGDLLWRWYSGELDAEIALVISNHEDLRELVERFGIPFHYIPATKDTR--LEHEKKELELLKQYELKHK 172 (294) T ss_pred CCHHHHHHHHHHHCCCCCEEEEEEEECCCHHHHHHHHHCCCCEEEECCCCCHH--HHHHHHHHHHHHCCCCCEE T ss_conf 41203589999873898237999986498789988665389889863788505--6899999998740332110 No 279 >COG1444 Predicted P-loop ATPase fused to an acetyltransferase [General function prediction only] Probab=24.44 E-value=28 Score=14.28 Aligned_cols=46 Identities=30% Similarity=0.584 Sum_probs=22.5 Q ss_pred CCCCHHHHHH-HHHHH-HHHCCC--CCCCC------HHHHHHHHHHHHHCCC-EEEEEC Q ss_conf 6205677888-99899-873343--56421------1478988898863583-157850 Q gi|254781051|r 581 FNTHKIVERL-MANRQ-KMIETG--KGIDW------SMAESLAFGSLCYEGY-KVRLSG 628 (957) Q Consensus 581 f~~h~~v~k~-~~~R~-~~~~~~--~~iDW------~~AE~LA~~SLl~eG~-~VRlsG 628 (957) ..+||-+.+. +..|. +++... .++|| .+.|.+.| =+..|| +|+||- T Consensus 537 IAvhPe~q~~GiGsrlL~~l~~~a~~~~DwlgvsFG~t~~L~rF--W~rnGF~pVhls~ 593 (758) T COG1444 537 IAVHPELQRMGIGSRLLALLIEEARKGLDWLGVSFGYTEELLRF--WLRNGFVPVHLSP 593 (758) T ss_pred EEECHHHHHCCHHHHHHHHHHHHHHCCCCEEEECCCCCHHHHHH--HHHCCEEEEEECC T ss_conf 87088786358889999999999845898897456788899999--9975907987368 No 280 >KOG2582 consensus Probab=24.35 E-value=42 Score=12.95 Aligned_cols=78 Identities=21% Similarity=0.273 Sum_probs=46.9 Q ss_pred CCCHHHHHHHHHCCCCCCCCCEEEEEECCCEECCCCCHHCCCCCCCCHHHHHHCCCEEEECCCCHHHHHHHHHHHHHHHH Q ss_conf 46725887645106774436579998416020124701024677840011220340587426788999999999999999 Q gi|254781051|r 383 GQGIVSETFGLSGLSGYTVAGNIHLIINNQIGFTTNPSSARSCTYASDISKSIGIPIFHVNGDDPEAVIRVVRMAVSFRM 462 (957) Q Consensus 383 GQGVV~Etl~ls~l~gY~tGGTiHiVvNNQIGFTT~p~d~RSs~Y~TDiAK~i~aPI~HVN~DdpEav~~~~~lA~~yR~ 462 (957) .|-|-.+.+||=+ -|+||.++|--|=||++|.+--|.. .|-| ...-|..-.+. -. T Consensus 332 ~qevek~Ilqmie------~~~i~a~iNG~v~f~~n~e~~~Spe-------------M~~n--k~~~~~~L~e~----l~ 386 (422) T KOG2582 332 AQEVEKYILQMIE------DGEIFASINGMVFFTDNPEKYNSPE-------------MHEN--KIDLCIQLIEA----LK 386 (422) T ss_pred HHHHHHHHHHHHC------CCCEEEEECCEEEEECCCCCCCCHH-------------HHHH--HHHHHHHHHHH----HH T ss_conf 6899999999851------6734898624489823835479988-------------8764--77899999999----98 Q ss_pred HHCCCEEEEEEECCCCCCCCCCC Q ss_conf 73897399864211256564354 Q gi|254781051|r 463 KFHKSVVIDIVCYRRFGHNEGDE 485 (957) Q Consensus 463 ~F~kDVvIDlv~YRr~GHNE~De 485 (957) .|...+-.+=+-||+-+---.|+ T Consensus 387 ~~e~si~l~s~~~~K~m~~~~d~ 409 (422) T KOG2582 387 AMEESIRLNSVYVSKSMGSQEDD 409 (422) T ss_pred HCCHHEEECCEEEEHHHHHHHCC T ss_conf 40011550756662655520011 No 281 >PRK12713 flgJ peptidoglycan hydrolase; Provisional Probab=24.27 E-value=31 Score=13.91 Aligned_cols=30 Identities=7% Similarity=0.160 Sum_probs=11.1 Q ss_pred CCHHHHHHHHCCCCCCCCHHHHHHHHHHHH Q ss_conf 623889998347636789689999999987 Q gi|254781051|r 2 LQDLNKMFSLSSFLDGTNCCYIEDLYKSYQ 31 (957) Q Consensus 2 ~~~~~~~f~~sSFL~g~Na~YIEeLYekYL 31 (957) +|+|-+....++.|++....+.++||++=+ T Consensus 55 lKsMR~a~~~~~l~~s~~~~~y~~m~DqQl 84 (339) T PRK12713 55 LKRMREATPKEGLFDSQQTEMLQGMADEQL 84 (339) T ss_pred HHHHHHCCCCCCCCCCHHHHHHHHHHHHHH T ss_conf 999984677788888717789999999999 No 282 >COG3769 Predicted hydrolase (HAD superfamily) [General function prediction only] Probab=24.24 E-value=42 Score=12.93 Aligned_cols=36 Identities=25% Similarity=0.329 Sum_probs=22.0 Q ss_pred CCCCCCCCHHHHHHHHHHHHCCCEEEEEECCCCHHHHHH Q ss_conf 578777789999999998406430277724899899999 Q gi|254781051|r 154 VLGLESATIPEIVDVLSHLYCSNIGVEFMHIVDSIERDW 192 (957) Q Consensus 154 ~~g~~~~tL~eIi~~L~~tYCgsIGvEymHI~d~eer~W 192 (957) -+|..-.++|+.++.|+++|-= .-|.-..+.+.-.| T Consensus 98 elg~~l~~ire~l~kLee~~g~---~~~~~~d~~ei~e~ 133 (274) T COG3769 98 ELGKVLEKIREKLDKLEEHFGF---TTFDDVDDEEIAEW 133 (274) T ss_pred EHHHHHHHHHHHHHHHHHHHCE---EEECCCCHHHHHHH T ss_conf 3011199999999988887295---67346778999987 No 283 >pfam02676 TYW3 Methyltransferase TYW3. The methyltransferase TYW3 (tRNA-yW- synthesising protein 3) has been identified in yeast to be involved in wybutosine (yW) biosynthesis. yW is a complexly modified guanosine residue that contains a tricyclic base and is found at the 3' position adjacent the anticodon of phenylalanine tRNA. TYW3 is an N-4 methylase that methylates yW-86 to yield yW-72 in an Ado-Met-dependent manner. Probab=24.15 E-value=42 Score=12.92 Aligned_cols=72 Identities=26% Similarity=0.212 Sum_probs=41.4 Q ss_pred HHHHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEE------EEECCCCCCCCCCCCCCCCHHHHHHHHHCCCHH Q ss_conf 11220340587426788999999999999999738973998------642112565643542235578888867189989 Q gi|254781051|r 431 ISKSIGIPIFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVID------IVCYRRFGHNEGDEPSFTQPMMYKTIRSHKSVL 504 (957) Q Consensus 431 iAK~i~aPI~HVN~DdpEav~~~~~lA~~yR~~F~kDVvID------lv~YRr~GHNE~DeP~fTQP~mY~~I~~hp~v~ 504 (957) |--.++.||+||-+-|.|+.....++|.+ .-|+.-=++- +|+-| +..-+|-|-..+- +.-|. T Consensus 90 i~~k~ep~IlHV~cr~le~A~~Ll~~A~~--~GFk~SGi~s~~~~~~iV~Ir--s~~~le~Pl~~~g--------~~lV~ 157 (191) T pfam02676 90 IWLKFEPPILHVQCRDLEDAQKLLSAARS--SGFRESGIKSLSGKRVIVEIR--STERLDVPIGYDG--------KLLVD 157 (191) T ss_pred EEEEECCCEEEEEECCHHHHHHHHHHHHH--CCCCHHCCEECCCCCEEEEEE--CCCCEEEEEECCC--------CCCCC T ss_conf 89992098989995889999999999997--383121737516895699993--4674666760399--------47469 Q ss_pred HHHHHHHHHC Q ss_conf 9999998644 Q gi|254781051|r 505 QLYADSLMKN 514 (957) Q Consensus 505 ~~Y~~~Li~~ 514 (957) +-|.+.|++. T Consensus 158 ~eyl~~lv~~ 167 (191) T pfam02676 158 EEYLKFLVEI 167 (191) T ss_pred HHHHHHHHHH T ss_conf 9999999999 No 284 >cd01020 TroA_b Metal binding protein TroA_b. These proteins are predicted to function as initial receptors in ABC transport of metal ions. They belong to the TroA superfamily of helical backbone metal receptor proteins that share a distinct fold and ligand binding mechanism. A typical TroA protein is comprised of two globular subdomains connected by a single helix and can bind the metal ion in the cleft between these domains. In addition, these proteins sometimes have a low complexity region containing a metal-binding histidine-rich motif (repetitive HDH sequence). Probab=23.74 E-value=43 Score=12.87 Aligned_cols=105 Identities=20% Similarity=0.258 Sum_probs=66.6 Q ss_pred CCCHHCCCCCCCCHHHHHHCCCEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCCCCC-CCHHHHH Q ss_conf 470102467784001122034058742678899999999999999973897399864211256564354223-5578888 Q gi|254781051|r 417 TNPSSARSCTYASDISKSIGIPIFHVNGDDPEAVIRVVRMAVSFRMKFHKSVVIDIVCYRRFGHNEGDEPSF-TQPMMYK 495 (957) Q Consensus 417 T~p~d~RSs~Y~TDiAK~i~aPI~HVN~DdpEav~~~~~lA~~yR~~F~kDVvIDlv~YRr~GHNE~DeP~f-TQP~mY~ 495 (957) ++|++-. +=++|++|+-+|=+|=.||-+-|. +..+++- -.+..++..++.--+++.++|+|.+ +-|.... T Consensus 35 ~dPH~ye--p~p~d~~~l~~Adlvv~nG~~lE~--wl~k~~~-----~~~~~~~~~~~~~~~~~~~~~dPHiWldp~~~~ 105 (264) T cd01020 35 VDPHDFE--PTPTDAAKVSTADIVVYNGGGYDP--WMTKLLA-----DTKDVIVIAADLDGHDDKEGDNPHLWYDPETMS 105 (264) T ss_pred CCCCCCC--CCHHHHHHHHCCCEEEECCCCCHH--HHHHHHH-----CCCCCEEEECCCCCCCCCCCCCCCCCCCHHHHH T ss_conf 8965687--999999999629899995888168--9999985-----289966883366666677999989377999999 Q ss_pred HHHHCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 86718998999999986448989899999999999997756 Q gi|254781051|r 496 TIRSHKSVLQLYADSLMKNQVISKQELQSLANNWHKYLEAE 536 (957) Q Consensus 496 ~I~~hp~v~~~Y~~~Li~~g~it~~e~~~~~~~~~~~l~~~ 536 (957) .+.+ .-++.|.+.---.++.+++-.+.|.+.|++- T Consensus 106 ~~a~------~I~~~L~~~dP~~~~~y~~N~~~~~~~L~~L 140 (264) T cd01020 106 KVAN------ALADALVKADPDNKKYYQANAKKFVASLKPL 140 (264) T ss_pred HHHH------HHHHHHHHHCCHHHHHHHHHHHHHHHHHHHH T ss_conf 9999------9999999869021799999999999999999 No 285 >KOG0674 consensus Probab=23.66 E-value=20 Score=15.43 Aligned_cols=40 Identities=30% Similarity=0.487 Sum_probs=30.6 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCEEECCCCC Q ss_conf 0002442454102256554202576233368135436888 Q gi|254781051|r 696 EAQFGDFANGAQVILDQFITAGEQKWLCTSNLVCLLPHGY 735 (957) Q Consensus 696 EAQFGDF~NgAQviiDqfiss~e~KW~~~sglvllLPHGy 735 (957) -|-|-+|+|-++..+=||-.+-|+||..--|-|-|+|--- T Consensus 79 sa~F~~FsnK~kTLv~q~tVkheQ~~dcgggyiKl~~~d~ 118 (406) T KOG0674 79 SAKFKPFSNKGKTLVIQFTVKHEQKIDCGGGYIKLFPADL 118 (406) T ss_pred ECCCCCCCCCCCEEEEEEEECCCCCCCCCCEEEEEEECCC T ss_conf 2016654456855899998335401467750588631545 No 286 >cd06422 NTP_transferase_like_1 NTP_transferase_like_1 is a member of the nucleotidyl transferase family. This is a subfamily of nucleotidyl transferases. Nucleotidyl transferases transfer nucleotides onto phosphosugars. The activated sugars are precursors for synthesis of lipopolysaccharide, glycolipids and polysaccharides. Other subfamilies of nucleotidyl transferases include Alpha-D-Glucose-1-Phosphate Cytidylyltransferase, Mannose-1-phosphate guanyltransferase, and Glucose-1-phosphate thymidylyltransferase. Probab=23.64 E-value=43 Score=12.85 Aligned_cols=26 Identities=15% Similarity=0.418 Sum_probs=22.0 Q ss_pred HHHHHHHHHHHHCCCCEEEEECCCCC Q ss_conf 99999999987379107998067655 Q gi|254781051|r 248 PAIEEIIRQGVQQGVDEMILGMAHRG 273 (957) Q Consensus 248 P~L~~ii~~a~~~Gv~eiviGMaHRG 273 (957) |+++.+|+...+.|+++|+|---|++ T Consensus 31 pii~~~i~~l~~~gi~~iii~~~y~~ 56 (221) T cd06422 31 PLIDHALDRLAAAGIRRIVVNTHHLA 56 (221) T ss_pred EHHHHHHHHHHHCCCCEEEEECCCCH T ss_conf 99999999999829967998325327 No 287 >PRK12712 flgJ peptidoglycan hydrolase; Provisional Probab=23.64 E-value=33 Score=13.71 Aligned_cols=32 Identities=6% Similarity=0.135 Sum_probs=20.9 Q ss_pred CCCHHHHHHHHCCCCCCCCHHHHHHHHHHHHH Q ss_conf 96238899983476367896899999999873 Q gi|254781051|r 1 MLQDLNKMFSLSSFLDGTNCCYIEDLYKSYQE 32 (957) Q Consensus 1 M~~~~~~~f~~sSFL~g~Na~YIEeLYekYL~ 32 (957) |+|+|-+...+++.|++....+..+||++=+. T Consensus 54 mlKsMR~a~~~~~~~~s~~~~~y~~m~DqQla 85 (344) T PRK12712 54 VLKSMRDATPQDGLFDNEQSKLYMSMMDQQLA 85 (344) T ss_pred HHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHH T ss_conf 99999846777888787167899999999999 No 288 >cd01989 STK_N The N-terminal domain of Eukaryotic Serine Threonine kinases. The Serine Threonine kinases are enzymes that belong to a very extensive family of proteins which share a conserved catalytic core common with both serine/threonine and tyrosine protein kinases. The N-terminal domain is homologous to the USP family which has a ATP binding fold. The N-terminal domain is predicted to be involved in ATP binding. Probab=23.52 E-value=43 Score=12.83 Aligned_cols=50 Identities=10% Similarity=0.242 Sum_probs=39.5 Q ss_pred HHHHHHHHHHHCCCCEECCCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCHH Q ss_conf 989999986403362222564658999999999998737910799806765526 Q gi|254781051|r 222 EGFEKFIDIKYKGAKRFGADGSEVIIPAIEEIIRQGVQQGVDEMILGMAHRGRL 275 (957) Q Consensus 222 e~FE~FL~~Kf~g~KRFsLEG~EslIP~L~~ii~~a~~~Gv~eiviGMaHRGRL 275 (957) +-|.+++..|=+..+-.-+||.+ .=+.|++...+++|+.+|||.+=||.+ T Consensus 68 ~~~~~~C~~~~V~~e~~v~e~~d----~~~~I~e~v~~~~i~~LVmGs~~~~~~ 117 (146) T cd01989 68 LPYRCFCSRKGVQCEDVVLEDDD----VAKAIVEYVADHGITKLVMGASSDNHF 117 (146) T ss_pred HHHHHHHHHCCCEEEEEEEECCC----HHHHHHHHHHHCCCCEEEECCCCCCCE T ss_conf 99999998759728999983488----889999999975998999862799851 No 289 >TIGR00908 2A0305 ethanolamine permease; InterPro: IPR004757 Amino acid permeases are integral membrane proteins involved in the transport of amino acids into the cell. A number of such proteins have been found to be evolutionary related , , . These proteins seem to contain up to 12 transmembrane segments. The best conserved region in this family is located in the second transmembrane segment.. Probab=23.49 E-value=23 Score=14.87 Aligned_cols=26 Identities=19% Similarity=0.497 Sum_probs=16.4 Q ss_pred HHHHHHHH--HCCCCCCCEEEECCCHHH Q ss_conf 99999997--435348868985660221 Q gi|254781051|r 770 YFHILRRQ--IYDRSSRPLIMMAPKSLL 795 (957) Q Consensus 770 ~FH~LRRQ--~~r~~rkPLiv~tPKsLL 795 (957) +|.|=||| |.||||-|+-+++|--=| T Consensus 381 ~f~LR~~~P~meRPyr~p~~i~~p~~AL 408 (456) T TIGR00908 381 FFKLRKKRPKMERPYRVPLYILLPAVAL 408 (456) T ss_pred HHHHHCCCCCCCCCCCCCCHHHHHHHHH T ss_conf 9997124888778842763356799999 No 290 >cd06251 M14_ASTE_ASPA_like_1 A functionally uncharacterized subgroup of the Succinylglutamate desuccinylase (ASTE)/aspartoacylase (ASPA) subfamily which is part of the M14 family of metallocarboxypeptidases. ASTE catalyzes the fifth and last step in arginine catabolism by the arginine succinyltransferase pathway, and aspartoacylase (ASPA, also known as aminoacylase 2, and ACY-2; EC:3.5.1.15) cleaves N-acetyl L-aspartic acid (NAA) into aspartate and acetate. NAA is abundant in the brain, and hydrolysis of NAA by ASPA may help maintain white matter. ASPA is an NAA scavenger in other tissues. Mutations in the gene encoding ASPA cause Canavan disease (CD), a fatal progressive neurodegenerative disorder involving dysmyelination and spongiform degeneration of white matter in children. This enzyme binds zinc which is necessary for activity. Measurement of elevated NAA levels in urine is used in the diagnosis of CD. Probab=23.38 E-value=43 Score=12.82 Aligned_cols=86 Identities=26% Similarity=0.337 Sum_probs=53.0 Q ss_pred EEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEE-CCCEECCCCCHHCCCCCCCCHHHHHHCCCEEEECCCCHHHHHH Q ss_conf 862575333467258876451067744365799984-1602012470102467784001122034058742678899999 Q gi|254781051|r 374 IIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLII-NNQIGFTTNPSSARSCTYASDISKSIGIPIFHVNGDDPEAVIR 452 (957) Q Consensus 374 liHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVv-NNQIGFTT~p~d~RSs~Y~TDiAK~i~aPI~HVN~DdpEav~~ 452 (957) -+|||- +.|+-++.+.+. .|..-..-|||=+|= =|.+||... +|.+.=-+|+-+++-- --+|.. T Consensus 28 gvHGdE-~~G~~~~~~L~~--~l~~~~l~G~iiivP~~N~~a~~~~---~r~~~dg~nlNR~FPG---~~~Gs~------ 92 (287) T cd06251 28 AIHGDE-LNGVEIIRRLLR--QLDPKTLRGTVIAVPVVNVFGFLNQ---SRYLPDRRDLNRSFPG---SKNGSL------ 92 (287) T ss_pred CCCCCH-HHHHHHHHHHHH--HCCCCCCCEEEEEEECCCHHHHHHC---CCCCCCCCCHHHCCCC---CCCCCH------ T ss_conf 767541-679999999997--3880017547999924899999726---5458998755200799---999999------ Q ss_pred HHHHHHHHHHH-HC-CCEEEEEEE Q ss_conf 99999999997-38-973998642 Q gi|254781051|r 453 VVRMAVSFRMK-FH-KSVVIDIVC 474 (957) Q Consensus 453 ~~~lA~~yR~~-F~-kDVvIDlv~ 474 (957) .-++|...-++ +. .|++|||=| T Consensus 93 teriA~~l~~~l~~~aD~~iDLHs 116 (287) T cd06251 93 ASRIAHLFFTEILSHADYGIDLHT 116 (287) T ss_pred HHHHHHHHHHHHHHHCCEEEEECC T ss_conf 999999999998854779999426 No 291 >TIGR02382 wecD_rffC TDP-D-fucosamine acetyltransferase; InterPro: IPR012752 This entry represents the WecD protein (formerly RffC) for the biosynthesis of enterobacterial common antigen (ECA), an outer leaflet, outer membrane glycolipid with a trisaccharide repeat unit. WecD is a member of the GNAT family of acetyltransferases.. Probab=23.24 E-value=35 Score=13.47 Aligned_cols=37 Identities=22% Similarity=0.441 Sum_probs=29.4 Q ss_pred CCCCCCCCCCCEEEEEECCCC-CEEEEHHHCCCCCCEE Q ss_conf 100223332302799977888-4455634427898459 Q gi|254781051|r 629 QDCERGTFSHRHAILCDQETG-KRYFPLGNISKDQGHC 665 (957) Q Consensus 629 QD~~RGTFshRHavl~dq~t~-~~y~pL~~l~~~q~~f 665 (957) |..-+|||-|-=.++.|-... .-|+-|..|.+.-++. T Consensus 99 EnAV~GTFDHqCL~~~da~g~p~G~VtLR~L~d~dARI 136 (201) T TIGR02382 99 ENAVLGTFDHQCLLLRDAAGDPRGYVTLRELDDSDARI 136 (201) T ss_pred HHHCCCCCCCHHHHHHCCCCCCEEEEEEEECCCCCCCE T ss_conf 86505676401352224477940468740047876533 No 292 >pfam08754 consensus Probab=23.17 E-value=44 Score=12.79 Aligned_cols=37 Identities=24% Similarity=0.478 Sum_probs=24.5 Q ss_pred EEEECCCCHHHHHHHHHHHHHHHHHHCC-CEEEEEEEC Q ss_conf 5874267889999999999999997389-739986421 Q gi|254781051|r 439 IFHVNGDDPEAVIRVVRMAVSFRMKFHK-SVVIDIVCY 475 (957) Q Consensus 439 I~HVN~DdpEav~~~~~lA~~yR~~F~k-DVvIDlv~Y 475 (957) ||||+.|||+-.-.+..-+--+-+-+.. ++=|-+|++ T Consensus 3 v~hvd~~d~~~~~~~L~n~~N~~~~~~~~~~~I~vV~~ 40 (113) T pfam08754 3 VFHIDSNDPEKVNLVLSNINNLLNAYGEENLEIEVVAH 40 (113) T ss_pred EEEECCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEEE T ss_conf 99936888899999999999999746887548999980 No 293 >PRK10446 ribosomal protein S6 modification protein; Provisional Probab=23.13 E-value=25 Score=14.56 Aligned_cols=49 Identities=20% Similarity=0.227 Sum_probs=30.8 Q ss_pred HHHHHHHHHHHCCCHHHCCCCCCCCCCC-CCCCCCCHHHCCCCHHHCCCEEE Q ss_conf 9999999998817045401677777766-78776787666999788171572 Q gi|254781051|r 100 QVMKMIDAYRSYGHFKANIDPLGYNSHQ-KDLSELSPAHYGFVKADYDRKIC 150 (957) Q Consensus 100 rV~~LI~AYR~~GHl~A~lDPLgl~~~~-~~~peL~~~~yGLse~DLd~~f~ 150 (957) ...+|+.|.+.|||...-+||+...-.. ...+.+ .+.|..-.|+|..++ T Consensus 14 st~RL~eaa~~rGh~v~vidp~~~~~~i~~~~~~v--~~~g~~L~~~Davip 63 (300) T PRK10446 14 SCKRLREAAIQRGHLVEILDPLSCYMNINPAASSI--HYKGRKLPHFDAVIP 63 (300) T ss_pred HHHHHHHHHHHCCCEEEEEEHHHCEEEECCCCCEE--EECCEECCCCCEEEE T ss_conf 79999999998799599961489189824898649--989808687888998 No 294 >PRK13463 phosphatase PhoE; Provisional Probab=23.13 E-value=44 Score=12.78 Aligned_cols=97 Identities=12% Similarity=0.109 Sum_probs=57.2 Q ss_pred HHHHHHHHHHHCCCCEECCCCHHHHHHHHHHHHHH---HHHCCCCEEEEECCCCCHHHHHH-HHHCCCHHHHHHHHCCCC Q ss_conf 98999998640336222256465899999999999---87379107998067655268999-885379899999855887 Q gi|254781051|r 222 EGFEKFIDIKYKGAKRFGADGSEVIIPAIEEIIRQ---GVQQGVDEMILGMAHRGRLNVLS-QIMNKLPRSIFYEFKGKG 297 (957) Q Consensus 222 e~FE~FL~~Kf~g~KRFsLEG~EslIP~L~~ii~~---a~~~Gv~eiviGMaHRGRLNVL~-Nvl~Kp~~~iF~EF~g~~ 297 (957) +.+..|... ...|...||||+.-+.+.+.+. ..+..-.+-|+=.+|-|=+.+|. .++|.|++.++.... T Consensus 103 ~~~~~~~~~----~~~~~~~~GEs~~~~~~Rv~~~l~~i~~~~~~~~vlvVsHg~~ir~ll~~~lg~~~~~~~~~~~--- 175 (203) T PRK13463 103 DDIQLFWNE----PHLFQSTSGENFEAVHKRVIEGMQLLLEKHKGESILIVSHAAAAKLLVGHFAGIEIENVWDDPF--- 175 (203) T ss_pred HHHHHHHHC----CCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCEEEEECHHHHHHHHHHHHCCCHHHHHCCCC--- T ss_conf 999998629----1226899998999999999999999998889891999939799999999981989789725688--- Q ss_pred CCCCCCCCCCCCCCCEEEEEEECCCEEEEEECCCCCCCH Q ss_conf 432234555100150268898359618999717877310 Q gi|254781051|r 298 PSEKEYSGDVKYHLGLCCNRQICGKDVKLLLQSNPSHLE 336 (957) Q Consensus 298 ~~~~~~sGDVKYHLG~s~~~~~~g~~v~lsL~~NPSHLE 336 (957) . +-+.-+.+.++++.-.|...++-|||| T Consensus 176 -~----------~n~sisvi~~~~g~~~l~~~nd~sHL~ 203 (203) T PRK13463 176 -M----------HSASLSIIEFEDGKGEVKQFADISHFQ 203 (203) T ss_pred -C----------CCCEEEEEEEECCEEEEEEECCCCCCC T ss_conf -7----------676799999989969999981313259 No 295 >TIGR02744 TrbI_Ftype type-F conjugative transfer system protein TrbI; InterPro: IPR014115 This entry represents TrbI, an essential component of the F-type conjugative transfer system for plasmid DNA transfer that has been shown to be localized to the periplasm , .. Probab=23.08 E-value=44 Score=12.77 Aligned_cols=41 Identities=29% Similarity=0.425 Sum_probs=35.5 Q ss_pred HCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 189989999999864489898999999999999977566654 Q gi|254781051|r 499 SHKSVLQLYADSLMKNQVISKQELQSLANNWHKYLEAEYKES 540 (957) Q Consensus 499 ~hp~v~~~Y~~~Li~~g~it~~e~~~~~~~~~~~l~~~~~~~ 540 (957) ..+.+.+-|.+.|.+.. +|+++.++..+.|...|++.++.- T Consensus 38 dmK~tldaf~~~l~~~~-l~E~~~k~~~~rF~~~L~~~L~~~ 78 (112) T TIGR02744 38 DMKSTLDAFFDSLSQKK-LSEAQQKKLLKRFNALLEAELQAW 78 (112) T ss_pred CCHHHHHHHHHHHHHHH-CCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 10357408999998720-378999888999988779999999 No 296 >TIGR03234 OH-pyruv-isom hydroxypyruvate isomerase. This enzyme interconverts tartronate semi-aldehyde (TSA, aka 2-hydroxy 3-oxopropionate) and hydroxypyruvate. The E. coli enzyme has been characterized and found to be specific for TSA, contain no cofactors, and have a rather high Km for hydroxypyruvate of 12.5 mM. The gene is ofter found in association with glyoxalate carboligase (which produces TSA), but has been shown to have no effect on growth on glyoxalate when knocked out. This is consistent with the fact that the gene for tartronate semialdehyde reductase (glxR) is also associated and may have primary responsibility for the catabolism of TSA. Probab=23.04 E-value=44 Score=12.77 Aligned_cols=54 Identities=9% Similarity=0.126 Sum_probs=26.9 Q ss_pred HHHHHHHHHHHHHHHHHHHC----CCCEECCCC---HHHHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 99999999989999986403----362222564---6589999999999987379107998 Q gi|254781051|r 214 ILDKLVRAEGFEKFIDIKYK----GAKRFGADG---SEVIIPAIEEIIRQGVQQGVDEMIL 267 (957) Q Consensus 214 iL~~L~~Ae~FE~FL~~Kf~----g~KRFsLEG---~EslIP~L~~ii~~a~~~Gv~eivi 267 (957) ..+.+.++-.+-+-|..+.+ |.-..+... -+.++-.|..+.+.+.+.||+=.+- T Consensus 82 ~~~~~~~ai~~a~~lg~~~i~~~~G~~~~~~~~e~~~~~~~e~l~~~a~~a~~~Gv~l~iE 142 (254) T TIGR03234 82 FREGVALAIAYARALGCPQVNCLAGKRPAGVSPEEARATLVENLRYAADALDRIGLTLLIE 142 (254) T ss_pred HHHHHHHHHHHHHHHCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEEE T ss_conf 8873899999999968986886756888899989999999999999999999659889886 No 297 >pfam10544 T5orf172 T5orf172 domain. This domain was identified by Iyer and colleagues. Probab=23.01 E-value=27 Score=14.31 Aligned_cols=40 Identities=20% Similarity=0.258 Sum_probs=17.1 Q ss_pred CEECCCCCHHCCCCCCCCHHHHHHC-CCEEE--ECCCCHHHHHH Q ss_conf 0201247010246778400112203-40587--42678899999 Q gi|254781051|r 412 QIGFTTNPSSARSCTYASDISKSIG-IPIFH--VNGDDPEAVIR 452 (957) Q Consensus 412 QIGFTT~p~d~RSs~Y~TDiAK~i~-aPI~H--VN~DdpEav~~ 452 (957) -||+|++|. .|-+.+.+....... .-++. .|+...|..++ T Consensus 8 KIG~T~n~~-~R~~~l~~~~~~~~~~~~~~~~~~~~~~~E~~lh 50 (79) T pfam10544 8 KIGYTTNLE-RRLKELNTGSPFPDELVAVIELVDNARKLERLLH 50 (79) T ss_pred EEECCCCHH-HHHHHHCCCCCCCCEEEEEEEECCCHHHHHHHHH T ss_conf 897729999-9999863579999758999985299999999999 No 298 >cd06425 M1P_guanylylT_B_like_N N-terminal domain of the M1P-guanylyltransferase B-isoform like proteins. GDP-mannose pyrophosphorylase (GTP: alpha-d-mannose-1-phosphate guanyltransferase) catalyzes the formation of GDP-d-mannose from GTP and alpha-d-mannose-1-Phosphate. It contains an N-terminal catalytic domain and a C-terminal Lefthanded-beta-Helix fold domain. GDP-d-mannose is the activated form of mannose for formation of cell wall lipoarabinomannan and various mannose-containing glycolipids and polysaccharides. The function of GDP-mannose pyrophosphorylase is essential for cell wall integrity, morphogenesis and viability. Repression of GDP-mannose pyrophosphorylase in yeast leads to phenotypes, such as cell lysis, defective cell wall, and failure of polarized growth and cell separation. Probab=22.86 E-value=43 Score=12.80 Aligned_cols=26 Identities=38% Similarity=0.533 Sum_probs=21.4 Q ss_pred HHHHHHHHHHHHCCCCEEEEECCCCC Q ss_conf 99999999987379107998067655 Q gi|254781051|r 248 PAIEEIIRQGVQQGVDEMILGMAHRG 273 (957) Q Consensus 248 P~L~~ii~~a~~~Gv~eiviGMaHRG 273 (957) |+++.+|+...+.|+++|+|-..|++ T Consensus 32 pli~~~i~~l~~~gi~~iii~~~y~~ 57 (233) T cd06425 32 PMIEHQIEALAKAGVKEIILAVNYRP 57 (233) T ss_pred EHHHHHHHHHHHCCCCCCCCHHHHHH T ss_conf 69999999999859973210112357 No 299 >PRK09060 dihydroorotase; Validated Probab=22.61 E-value=27 Score=14.31 Aligned_cols=39 Identities=26% Similarity=0.407 Sum_probs=26.0 Q ss_pred CHHHHHHHCCCCCEEEEECCCHHHHHHHHHH-HHCCCCCCCEEEECCCH Q ss_conf 0578875214378269724889999999999-74353488689856602 Q gi|254781051|r 746 RLERFLQMCAENNMYVANCTSPANYFHILRR-QIYDRSSRPLIMMAPKS 793 (957) Q Consensus 746 RlERfLQl~a~~N~~V~~~ttpAq~FH~LRR-Q~~r~~rkPLiv~tPKs 793 (957) -+||+..++ ++.||+.|-+-++ .+.-.+.==|||+-|+. T Consensus 345 sl~~~v~~~---------s~nPAki~GL~~kG~i~~G~dADlvi~Dp~~ 384 (444) T PRK09060 345 SLERFVDLT---------SAGPARIFGIAGKGRIAVGYDADFTIVDLKR 384 (444) T ss_pred CHHHHHHHH---------HHHHHHHHCCCCCCCCCCCCCCCEEEECCCC T ss_conf 299999997---------1749998498988735799966589995999 No 300 >PRK08948 consensus Probab=22.48 E-value=38 Score=13.27 Aligned_cols=11 Identities=9% Similarity=0.139 Sum_probs=4.2 Q ss_pred CHHHHHHHHHH Q ss_conf 99999999887 Q gi|254781051|r 563 SKEILKKIGSS 573 (957) Q Consensus 563 ~~~~L~~i~~~ 573 (957) +...++.+|-. T Consensus 360 ~~~~~R~~gl~ 370 (392) T PRK08948 360 PLVVGRNLGLM 370 (392) T ss_pred HHHHHHHHHHH T ss_conf 89999999999 No 301 >PRK12753 transketolase; Reviewed Probab=22.39 E-value=45 Score=12.68 Aligned_cols=184 Identities=14% Similarity=0.190 Sum_probs=108.5 Q ss_pred CEEEEEECCCCCCCHHHHHHCCCCCCCCCCEEECCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCEEECCCCCCCC---C Q ss_conf 4599982431111013432022113865311100002442454102256554202576233368135436888888---8 Q gi|254781051|r 663 GHCEVVNSFLSEQAVLGFEYGYSLNNLNALTIWEAQFGDFANGAQVILDQFITAGEQKWLCTSNLVCLLPHGYEGQ---G 739 (957) Q Consensus 663 ~~f~v~nS~LSE~avlgFEyGys~~~p~~L~iWEAQFGDF~NgAQviiDqfiss~e~KW~~~sglvllLPHGyeGq---G 739 (957) ++| ++-=-.|.+..++--|.++.. .|.-+=+-|.=| .|-+.||-.-++-. ....+..+-|.==|- | T Consensus 401 gr~--~~~GI~E~~m~~~a~Gla~~g--gl~P~~~tf~~F---~~~~~~~ir~~a~~----~~~~i~v~Thd~i~vGeDG 469 (662) T PRK12753 401 GNY--IHYGVREFGMTAIANGIAHHG--GFVPYTATFLMF---VEYARNAARMAALM----KARQIMVYTHDSIGLGEDG 469 (662) T ss_pred CCE--EEECHHHHHHHHHHHHHHHCC--CCEEEEEEHHHH---HHHHHHHHHHHHHH----CCCCEEEEECCCCCCCCCC T ss_conf 762--661257887999998998728--974887659999---99868999999872----5882799953554046887 Q ss_pred CCCCCCCHHHHHHHCCCCCEEEEECCCHHHHHHHHHHHHCCCCCCCEEEECCCH-HHCCCCCCCCHHHHCCCCCEEECCC Q ss_conf 765644057887521437826972488999999999974353488689856602-2108012688789379986035157 Q gi|254781051|r 740 PEHSSARLERFLQMCAENNMYVANCTSPANYFHILRRQIYDRSSRPLIMMAPKS-LLRHKRVVSSLSDMTCGSVFQAVLS 818 (957) Q Consensus 740 PEHSSaRlERfLQl~a~~N~~V~~~ttpAq~FH~LRRQ~~r~~rkPLiv~tPKs-LLR~~~a~S~l~d~~~g~~F~~vi~ 818 (957) |-|-. +|-.--|.+=.||.|..|.-.-.--++++.-+.+. ..|-.+..+.. +--.+......+++..|. -|+- T Consensus 470 pTHQ~--ie~la~lR~iPn~~v~~PaD~~E~~~a~~~a~~~~-~gP~~l~l~R~~~~~~~~~~~~~~~~~~G~---yil~ 543 (662) T PRK12753 470 PTHQP--VEQLASLRLTPNFSTWRPCDQVEAAVAWKLAIERH-NGPTALILSRQNLAQVERTPEQVKNIARGG---YILK 543 (662) T ss_pred CCCCH--HHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHCC-CCCEEEEEECCCCCCCCCCCHHHHCCCCEE---EEEE T ss_conf 65330--78999998555877993188899999999999625-897499985787787688711132156525---9995 Q ss_pred CCCCCCCCCCCCCCCHHHCCEEEEECCHHHHHHHHHHHHCCCC--CEEEEEEECCCCCHH Q ss_conf 7643344454445795457689986885789999999976998--689998401486828 Q gi|254781051|r 819 DDAEYHGKTSVKLKEDSHIRRVILCTGKVYYDLLDNRDMRNIA--DIYLLRIEQLYPFPE 876 (957) Q Consensus 819 d~~~~~~~~~~~~~~~~~v~rvi~csGKvyydL~~~r~~~~~~--~vaivRiEqLyPfP~ 876 (957) |.. .+.+=.|+.||-.-..-+++.++...+ ++.+|-+--+-||-. T Consensus 544 ~~~-------------~~pdv~iiatGs~v~~Al~Aa~~L~~~gi~~~Vvs~~s~~~~d~ 590 (662) T PRK12753 544 DSG-------------GKPDLILIATGSEVEITLQAAEKLTGEGRNVRVVSMPSTDIFDA 590 (662) T ss_pred ECC-------------CCCCEEEEECCHHHHHHHHHHHHHHHCCCCEEEEECCCCHHHHH T ss_conf 059-------------99989999630999999999999996799789997898548773 No 302 >PRK11069 recC exonuclease V subunit gamma; Provisional Probab=22.39 E-value=41 Score=12.98 Aligned_cols=12 Identities=17% Similarity=0.407 Sum_probs=6.5 Q ss_pred HHHHHHHHHHHH Q ss_conf 689999999987 Q gi|254781051|r 20 CCYIEDLYKSYQ 31 (957) Q Consensus 20 a~YIEeLYekYL 31 (957) +.||-+||.+-+ T Consensus 66 ~~fiw~l~~~vl 77 (1122) T PRK11069 66 ATFIWDMFTRVL 77 (1122) T ss_pred HHHHHHHHHHHC T ss_conf 999999999864 No 303 >cd05817 CBM20_DSP Dual-specificity phosphatase (DSP), N-terminal CBM20 (carbohydrate-binding module, family 20) domain. This CBM20 domain is located at the N-terminus of a protein tyrosine phosphatase of unknown function found in slime molds and ciliated protozoans. The CBM20 domain is found in a large number of starch degrading enzymes including alpha-amylase, beta-amylase, glucoamylase, and CGTase (cyclodextrin glucanotransferase). CBM20 is also present in proteins that have a regulatory role in starch metabolism in plants (e.g. alpha-amylase) or glycogen metabolism in mammals (e.g. laforin). CBM20 folds as an antiparallel beta-barrel structure with two starch binding sites. These two sites are thought to differ functionally with site 1 acting as the initial starch recognition site and site 2 involved in the specific recognition of appropriate regions of starch. Probab=22.33 E-value=45 Score=12.67 Aligned_cols=62 Identities=21% Similarity=0.332 Sum_probs=40.8 Q ss_pred CCCEEEEECCCCCCCCCCCCEEEEEECCCCCEEEEHHHCCCCCCEEEEEECCCCCCCHHHHHHCCCCCCCC--CCEEECC Q ss_conf 58315785010022333230279997788844556344278984599982431111013432022113865--3111000 Q gi|254781051|r 620 EGYKVRLSGQDCERGTFSHRHAILCDQETGKRYFPLGNISKDQGHCEVVNSFLSEQAVLGFEYGYSLNNLN--ALTIWEA 697 (957) Q Consensus 620 eG~~VRlsGQD~~RGTFshRHavl~dq~t~~~y~pL~~l~~~q~~f~v~nS~LSE~avlgFEyGys~~~p~--~L~iWEA 697 (957) -|-.|.++|-..+=|--+-..|+--+...+..|. +.+++- +-.-|||=|-+.+++ .-++||. T Consensus 12 ~Gq~v~vvG~~~~LG~W~~~~ai~L~~~~~~~W~---------~~v~lp-------~~~~~EYKyvi~~~d~~~~v~WE~ 75 (100) T cd05817 12 FGEAVYISGNCNQLGNWNPSKAKRMQWNEGDLWT---------VDVGIP-------ESVYIEYKYFVSNYDDPNTVLWES 75 (100) T ss_pred CCCEEEEEECCHHHCCCCHHHCCCCCCCCCCCEE---------EEEEEC-------CCCCEEEEEEEEECCCCCEEEECC T ss_conf 9988999979388689888887216566899699---------999967-------999699999997348897689676 No 304 >pfam09011 DUF1898 Domain of unknown function (DUF1898). This domain is predominantly found in Maelstrom homolog proteins. It has no known function. Probab=22.31 E-value=32 Score=13.79 Aligned_cols=22 Identities=27% Similarity=0.433 Sum_probs=9.7 Q ss_pred CCCCCCCCCCHHHHHHHCCCCC Q ss_conf 8887656440578875214378 Q gi|254781051|r 737 GQGPEHSSARLERFLQMCAENN 758 (957) Q Consensus 737 GqGPEHSSaRlERfLQl~a~~N 758 (957) +++|--|--|.-||+|||+.-| T Consensus 81 ~~a~v~~~~~~~~~~ql~~~~~ 102 (164) T pfam09011 81 KNAPKFSDFRKPGFFQLNDQRN 102 (164) T ss_pred CCCCCCCCCCCCCCEEECCCCC T ss_conf 6776433322211100136677 No 305 >TIGR01950 SoxR redox-sensitive transcriptional activator SoxR; InterPro: IPR010211 SoxR is a MerR-family homodimeric transcription factor with a 2Fe-2S cluster in each monomer. The motif CIGCGCxxxxxC is conserved. Oxidation of the iron-sulphur cluster activates SoxR. The physiological role in Escherichia coli is response to oxidative stress. It is activated by superoxide, singlet oxygen, nitric oxide (NO), and hydrogen peroxide. In Escherichia coli, SoxR increases expression of transcription factor SoxS; different downstream targets may exist in other species.; GO: 0003677 DNA binding, 0003700 transcription factor activity, 0005506 iron ion binding, 0006118 electron transport, 0006355 regulation of transcription DNA-dependent, 0006979 response to oxidative stress. Probab=22.30 E-value=35 Score=13.53 Aligned_cols=28 Identities=11% Similarity=0.260 Sum_probs=23.7 Q ss_pred CCEEEEEEECCCCCHHHHHHHHHHHCCC Q ss_conf 8689998401486828999999853799 Q gi|254781051|r 861 ADIYLLRIEQLYPFPEDYLIKVLSRFVQ 888 (957) Q Consensus 861 ~~vaivRiEqLyPfP~~~l~~~l~~y~~ 888 (957) .-||+||+=|--=.|-++|.+.|+..|. T Consensus 45 RRVa~Ik~AQ~vGipL~~I~~ALa~LP~ 72 (142) T TIGR01950 45 RRVAVIKVAQRVGIPLAEIAEALAELPE 72 (142) T ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHCCC T ss_conf 1567433233258988899999973468 No 306 >KOG4166 consensus Probab=22.30 E-value=45 Score=12.67 Aligned_cols=12 Identities=17% Similarity=0.673 Sum_probs=5.6 Q ss_pred CCCCHHHHHHHH Q ss_conf 256465899999 Q gi|254781051|r 239 GADGSEVIIPAI 250 (957) Q Consensus 239 sLEG~EslIP~L 250 (957) |.-|||-+.-|| T Consensus 90 g~tGg~If~emm 101 (675) T KOG4166 90 GRTGGDIFVEMM 101 (675) T ss_pred CCCHHHHHHHHH T ss_conf 774268999999 No 307 >cd02742 GH20_hexosaminidase Beta-N-acetylhexosaminidases of glycosyl hydrolase family 20 (GH20) catalyze the removal of beta-1,4-linked N-acetyl-D-hexosamine residues from the non-reducing ends of N-acetyl-beta-D-hexosaminides including N-acetylglucosides and N-acetylgalactosides. These enzymes are broadly distributed in microorganisms, plants and animals, and play roles in various key physiological and pathological processes. These processes include cell structural integrity, energy storage, cellular signaling, fertilization, pathogen defense, viral penetration, the development of carcinomas, inflammatory events and lysosomal storage disorders. The GH20 enzymes include the eukaryotic beta-N-acetylhexosaminidases A and B, the bacterial chitobiases, dispersin B, and lacto-N-biosidase. The GH20 hexosaminidases are thought to act via a catalytic mechanism in which the catalytic nucleophile is not provided by the solvent or the enzyme, but by the substrate itself. Probab=22.17 E-value=45 Score=12.65 Aligned_cols=18 Identities=22% Similarity=0.495 Sum_probs=12.6 Q ss_pred HHHHHHCCCEEEECCCCH Q ss_conf 011220340587426788 Q gi|254781051|r 430 DISKSIGIPIFHVNGDDP 447 (957) Q Consensus 430 DiAK~i~aPI~HVN~Ddp 447 (957) +++.++..|.||+-||.. T Consensus 142 e~~~lf~~~~~HiGgDE~ 159 (303) T cd02742 142 EIAELFPDRYLHIGGDEA 159 (303) T ss_pred HHHHHCCCCEEEECCCCC T ss_conf 999867998599667578 No 308 >pfam09820 AAA-ATPase_like Predicted AAA-ATPase. This family contains many hypothetical bacterial proteins. This family was previously the N-terminal part of the Pfam DUF1703 (pfam08011) family before it was split into two. This region is predicted to be an AAA-ATPase domain. Probab=22.05 E-value=35 Score=13.47 Aligned_cols=11 Identities=18% Similarity=0.374 Sum_probs=4.1 Q ss_pred CCCCEECCCCH Q ss_conf 33622225646 Q gi|254781051|r 233 KGAKRFGADGS 243 (957) Q Consensus 233 ~g~KRFsLEG~ 243 (957) .|.=||+.+|. T Consensus 188 TGI~~i~k~si 198 (279) T pfam09820 188 TGVSKFSKVSI 198 (279) T ss_pred EEEEECCCCCC T ss_conf 41100211344 No 309 >PRK12700 flgH flagellar basal body L-ring protein; Reviewed Probab=22.00 E-value=41 Score=13.01 Aligned_cols=19 Identities=37% Similarity=0.676 Sum_probs=7.8 Q ss_pred HHHHHHHCCCCCEEEEEEE Q ss_conf 9999997699868999840 Q gi|254781051|r 851 LLDNRDMRNIADIYLLRIE 869 (957) Q Consensus 851 L~~~r~~~~~~~vaivRiE 869 (957) |.+-|+++.+.||..|.|+ T Consensus 64 Lf~D~RA~~VGDilTV~i~ 82 (230) T PRK12700 64 LFEDRRPRNVGDIVTIVLE 82 (230) T ss_pred HHCCCCCCCCCCEEEEEEE T ss_conf 0066644789986999998 No 310 >PRK13738 conjugal transfer pilus assembly protein TraW; Provisional Probab=21.99 E-value=46 Score=12.62 Aligned_cols=21 Identities=14% Similarity=0.368 Sum_probs=14.2 Q ss_pred EEEECCCCHHHHHHHHHHHCC Q ss_conf 777248998999999998414 Q gi|254781051|r 179 VEFMHIVDSIERDWVRNTIED 199 (957) Q Consensus 179 vEymHI~d~eer~Wlq~riE~ 199 (957) .=|-.=.|+++..|...+... T Consensus 124 LvF~DgdD~~Qv~Wa~~~~~~ 144 (210) T PRK13738 124 LYFINGDDPAQVAWMKRQTPP 144 (210) T ss_pred EEEEECCCHHHHHHHHHHCCC T ss_conf 899959899999999974005 No 311 >TIGR00821 EII-GUT PTS system, glucitol/sorbitol-specific, IIC component; InterPro: IPR004699 Bacterial PTS transporters transport and concomitantly phosphorylate their sugar substrates, and typically consist of multiple subunits or protein domains.The Man family is unique in several respects among PTS permease families. It is the only PTS family in which members possess a IID protein. It is the only PTS family in which the IIB constituent is phosphorylated on a histidyl rather than a cysteyl residue. Its permease members exhibit broad specificity for a range of sugars, rather than being specific for just one or a few sugars. The Gut family consists only of glucitol-specific transporters, but these occur both in Gram-negative and Gram-positive bacteria. Escherichia coli consists of IIA protein, a IIC protein and a IIBC protein. This family is specific for the IIC component.; GO: 0009401 phosphoenolpyruvate-dependent sugar phosphotransferase system, 0016021 integral to membrane. Probab=21.97 E-value=27 Score=14.41 Aligned_cols=12 Identities=58% Similarity=1.071 Sum_probs=7.6 Q ss_pred CHHHHHHHCCCC Q ss_conf 057887521437 Q gi|254781051|r 746 RLERFLQMCAEN 757 (957) Q Consensus 746 RlERfLQl~a~~ 757 (957) |+|||=|-||.| T Consensus 52 ~~~rfA~~~~~N 63 (181) T TIGR00821 52 RIERFAKKCAGN 63 (181) T ss_pred HHHHHHHHHCCC T ss_conf 799998864034 No 312 >pfam11432 DUF3197 Protein of unknown function (DUF3197). This bacterial family of proteins has no known function. Probab=21.83 E-value=46 Score=12.60 Aligned_cols=51 Identities=14% Similarity=0.292 Sum_probs=36.3 Q ss_pred CEECCCCHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCHHHHHHHHC Q ss_conf 22225646589999999999987379107998067655268999885379899999855 Q gi|254781051|r 236 KRFGADGSEVIIPAIEEIIRQGVQQGVDEMILGMAHRGRLNVLSQIMNKLPRSIFYEFK 294 (957) Q Consensus 236 KRFsLEG~EslIP~L~~ii~~a~~~Gv~eiviGMaHRGRLNVL~Nvl~Kp~~~iF~EF~ 294 (957) -|||.||.++|.-+..-+++++.. ..|.|++-.- .+-+|.-|+++...-.. T Consensus 52 PafGp~G~~ALaELv~w~~~~G~r--~~E~V~~p~e------f~~ll~~p~~e~~~~l~ 102 (113) T pfam11432 52 PAYGPGGAEALAELVRWALAQGAR--FYETVLSPGD------FTRVLAEPDAEEVKRLI 102 (113) T ss_pred CCCCCCHHHHHHHHHHHHHHCCCC--CEEEECCHHH------HHHHHCCCCHHHHHHHH T ss_conf 675852789999999999975997--0122138688------89998189589999998 No 313 >TIGR01063 gyrA DNA gyrase, A subunit; InterPro: IPR005743 DNA topoisomerases regulate the number of topological links between two DNA strands (i.e. change the number of superhelical turns) by catalysing transient single- or double-strand breaks, crossing the strands through one another, then resealing the breaks. These enzymes have several functions: to remove DNA supercoils during transcription and DNA replication; for strand breakage during recombination; for chromosome condensation; and to disentangle intertwined DNA during mitosis , . DNA topoisomerases are divided into two classes: type I enzymes (5.99.1.2 from EC; topoisomerases I, III and V) break single-strand DNA, and type II enzymes (5.99.1.3 from EC; topoisomerases II, IV and VI) break double-strand DNA . Type II topoisomerases are ATP-dependent enzymes, and can be subdivided according to their structure and reaction mechanisms: type IIA (topoisomerase II or gyrase, and topoisomerase IV) and type IIB (topoisomerase VI). These enzymes are responsible for relaxing supercoiled DNA as well as for introducing both negative and positive supercoils . Topoisomerase II (called gyrase in bacteria) primarily introduces negative supercoils into DNA. In bacteria, topoisomerase II consists of two polypeptide subunits, gyrA and gyrB, which form a heterotetramer: (BA)2. In most eukaryotes, topoisomerase II consists of a single polypeptide, where the N- and C-terminal regions correspond to gyrB and gyrA, respectively. This entry represents the A subunit (gyrA) as found predominantly in bacteria, but does not include the topoisomerase II enzymes composed of a single polypeptide, as are found in most eukaryotes. GyrA has two functional domains: an N-terminal that forms the covalent DNA-protein bridge that is responsible for the breaking- and rejoining function, and a C-terminal that can bind DNA non-specifically . More information about this protein can be found at Protein of the Month: DNA Topoisomerase .; GO: 0003677 DNA binding, 0003918 DNA topoisomerase (ATP-hydrolyzing) activity, 0005524 ATP binding, 0006265 DNA topological change, 0006268 DNA unwinding during replication, 0005694 chromosome. Probab=21.70 E-value=46 Score=12.58 Aligned_cols=106 Identities=14% Similarity=0.298 Sum_probs=62.7 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHCCCHHHCCCCC-----CCCC----CCCCCCCCCHHHCCCCHHHCCCEEECCCCC-- Q ss_conf 55563367899999999999998817045401677-----7777----667877678766699978817157256657-- Q gi|254781051|r 87 CSLGETQSVKDFFQVMKMIDAYRSYGHFKANIDPL-----GYNS----HQKDLSELSPAHYGFVKADYDRKICMKGVL-- 155 (957) Q Consensus 87 ~~~~~~~~~~kq~rV~~LI~AYR~~GHl~A~lDPL-----gl~~----~~~~~peL~~~~yGLse~DLd~~f~~~~~~-- 155 (957) .....+..+. ++..||++=+.-| .|++.=- |+|- .+-..|+.=+-.. ++..+|+..|.+..+. T Consensus 260 YqVNKa~Lie---kIAeLv~ek~ieG--IsdiRDESSDr~GiRiVIE~KRD~~a~VvLN~L-Yk~T~lQ~~Fg~NmLALv 333 (864) T TIGR01063 260 YQVNKARLIE---KIAELVREKKIEG--ISDIRDESSDREGIRIVIELKRDAVAEVVLNNL-YKQTQLQVSFGINMLALV 333 (864) T ss_pred CCCHHHHHHH---HHHHHHHCCCCCC--CCCCCCCCCCCCCCEEEEEEECCCCCCHHHHHH-HCCCCCCEECCEEEEEEE T ss_conf 8520268889---9998740000003--212442566877437999974289830154212-112645001481232110 Q ss_pred -CC-CCCCHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCCCC---HHHHHHHHHHHHHH Q ss_conf -87-77789999999998406430277724899899999999841457788879---88999999999999 Q gi|254781051|r 156 -GL-ESATIPEIVDVLSHLYCSNIGVEFMHIVDSIERDWVRNTIEDYDFSSNFS---KEEQRDILDKLVRA 221 (957) Q Consensus 156 -g~-~~~tL~eIi~~L~~tYCgsIGvEymHI~d~eer~Wlq~riE~~~~~~~~s---~eeK~~iL~~L~~A 221 (957) |. ..++|+++|.. |+..|.|-...+..|. .++|-|||+-|+.| T Consensus 334 ~G~Pk~lnLk~~l~~-----------------------f~~HR~~Vi~RRT~feLrKA~eRaHiL~GL~~A 381 (864) T TIGR01063 334 DGLPKVLNLKELLEA-----------------------FVEHRKDVITRRTIFELRKAEERAHILEGLLIA 381 (864) T ss_pred CCCCCEECHHHHHHH-----------------------HHHCCEEEEEEEHHCCHHHHHHHHHHHHHHHHH T ss_conf 789750178899999-----------------------984231335521000102265578999999999 No 314 >PRK13389 UTP--glucose-1-phosphate uridylyltransferase subunit GalU; Provisional Probab=21.68 E-value=46 Score=12.58 Aligned_cols=25 Identities=20% Similarity=0.482 Sum_probs=19.6 Q ss_pred HHHHHHHHHHHHCCCCEEEEECCCC Q ss_conf 9999999998737910799806765 Q gi|254781051|r 248 PAIEEIIRQGVQQGVDEMILGMAHR 272 (957) Q Consensus 248 P~L~~ii~~a~~~Gv~eiviGMaHR 272 (957) |+++.+++.+.+.||++|+|=-.++ T Consensus 40 Pii~~~vee~~~aGI~ei~iV~~~~ 64 (302) T PRK13389 40 PLIQYVVNECIAAGITEIVLVTHSS 64 (302) T ss_pred EHHHHHHHHHHHCCCCEEEEEECCC T ss_conf 7599999999986997899992798 No 315 >pfam11810 DUF3332 Domain of unknown function (DUF3332). This family of proteins are functionally uncharacterized. This family is only found in bacteria. Probab=21.64 E-value=41 Score=13.03 Aligned_cols=31 Identities=26% Similarity=0.414 Sum_probs=13.2 Q ss_pred CCCCCCCEEEEEECCCEECCC--CCHHCCCCCC Q ss_conf 774436579998416020124--7010246778 Q gi|254781051|r 397 SGYTVAGNIHLIINNQIGFTT--NPSSARSCTY 427 (957) Q Consensus 397 ~gY~tGGTiHiVvNNQIGFTT--~p~d~RSs~Y 427 (957) |-|.+.+..-++|=|-|-|=| ||--...+.+ T Consensus 53 pvYgi~~~~D~~v~NsiEFWtG~Npi~~~pav~ 85 (175) T pfam11810 53 PVYGIAALADLLVFNSIEFWSGSNPITGKPAVA 85 (175) T ss_pred HHHHHHHHHHEEEEEEEEEECCCCCCCCCCCCC T ss_conf 387786551236770068654888778997534 No 316 >cd06252 M14_ASTE_ASPA_like_2 A functionally uncharacterized subgroup of the Succinylglutamate desuccinylase (ASTE)/aspartoacylase (ASPA) subfamily which is part of the M14 family of metallocarboxypeptidases. ASTE catalyzes the fifth and last step in arginine catabolism by the arginine succinyltransferase pathway, and aspartoacylase (ASPA, also known as aminoacylase 2, and ACY-2; EC:3.5.1.15) cleaves N-acetyl L-aspartic acid (NAA) into aspartate and acetate. NAA is abundant in the brain, and hydrolysis of NAA by ASPA may help maintain white matter. ASPA is an NAA scavenger in other tissues. Mutations in the gene encoding ASPA cause Canavan disease (CD), a fatal progressive neurodegenerative disorder involving dysmyelination and spongiform degeneration of white matter in children. This enzyme binds zinc which is necessary for activity. Measurement of elevated NAA levels in urine is used in the diagnosis of CD. Probab=21.59 E-value=47 Score=12.57 Aligned_cols=90 Identities=20% Similarity=0.252 Sum_probs=54.9 Q ss_pred EEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEE-ECCCEECCCCCHHCCCCC-CCCHHHHHHCCCEEEECCCCHH Q ss_conf 78886257533346725887645106774436579998-416020124701024677-8400112203405874267889 Q gi|254781051|r 371 LPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLI-INNQIGFTTNPSSARSCT-YASDISKSIGIPIFHVNGDDPE 448 (957) Q Consensus 371 lpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiV-vNNQIGFTT~p~d~RSs~-Y~TDiAK~i~aPI~HVN~DdpE 448 (957) +-=-||||- +.|+-|+++.+. .|..-..-|||=|| +=|-.||-... |.++ --+|+-+++- | |++ T Consensus 39 i~agvHGdE-~~G~~v~~~L~~--~l~~~~l~G~viivP~~N~~g~~~~~---R~~P~dg~nlNR~FP-------G-~~~ 104 (316) T cd06252 39 LTGGNHGDE-YEGQIALLRLAR--RLDPEEVRGRVIILPALNFPAVQAGT---RTSPIDGGNLNRVFP-------G-DPD 104 (316) T ss_pred EECCCCCCH-HHHHHHHHHHHH--HCCHHHCEEEEEEEECCCHHHHHHCC---CCCCCCCCCCCCCCC-------C-CCC T ss_conf 971655532-779999999987--48854682889999148989994122---247889986122489-------9-989 Q ss_pred HHHHHHHHHHHHHHHH--CCCEEEEEEEC Q ss_conf 9999999999999973--89739986421 Q gi|254781051|r 449 AVIRVVRMAVSFRMKF--HKSVVIDIVCY 475 (957) Q Consensus 449 av~~~~~lA~~yR~~F--~kDVvIDlv~Y 475 (957) . ...-++|..+.+++ +.|++|||=|- T Consensus 105 G-s~teriA~~l~~~l~~~aD~~iDLHsg 132 (316) T cd06252 105 G-TVTEMIAHYLTTELLPRADYVIDLHSG 132 (316) T ss_pred C-CHHHHHHHHHHHHHHHHCCEEEEECCC T ss_conf 9-989999999998557529899990568 No 317 >pfam03881 Fructosamin_kin Fructosamine kinase. This family includes eukaryotic fructosamine-3-kinase enzymes. The family also includes bacterial members that have not been characterized but probably have a similar or identical function. Probab=21.58 E-value=33 Score=13.69 Aligned_cols=15 Identities=47% Similarity=1.059 Sum_probs=7.1 Q ss_pred EEEEEEECCCCCCCCCC Q ss_conf 39986421125656435 Q gi|254781051|r 468 VVIDIVCYRRFGHNEGD 484 (957) Q Consensus 468 VvIDlv~YRr~GHNE~D 484 (957) ++||=-|| |||+|.| T Consensus 208 ~liDPA~y--yG~rE~D 222 (287) T pfam03881 208 VIFDPACY--YGDRECD 222 (287) T ss_pred EEECCHHC--CCCCHHH T ss_conf 89754311--5770776 No 318 >PRK12451 arginyl-tRNA synthetase; Reviewed Probab=21.44 E-value=47 Score=12.54 Aligned_cols=38 Identities=18% Similarity=0.169 Sum_probs=24.7 Q ss_pred CHHCCCCCCCCHHHHHHCCCEEEECCCCHHHHHHHHHHHHHHH-HHHCCCEEEEEEECCCCCC Q ss_conf 0102467784001122034058742678899999999999999-9738973998642112565 Q gi|254781051|r 419 PSSARSCTYASDISKSIGIPIFHVNGDDPEAVIRVVRMAVSFR-MKFHKSVVIDIVCYRRFGH 480 (957) Q Consensus 419 p~d~RSs~Y~TDiAK~i~aPI~HVN~DdpEav~~~~~lA~~yR-~~F~kDVvIDlv~YRr~GH 480 (957) ..|+.++-.++||| +..|| ++|+-|-+|+++|=+-+|| T Consensus 300 ksDGt~tY~t~DiA------------------------~~~~k~~~~~~d~~I~V~gadh~~~ 338 (562) T PRK12451 300 KSDGATIYATRDLT------------------------AALYRQNTFGFDKALYVVGPEQSLH 338 (562) T ss_pred CCCCCCEEECCHHH------------------------HHHHHHHHCCCCEEEEEECCCHHHH T ss_conf 18997213166478------------------------9999987309975899956747679 No 319 >PRK12709 flgJ peptidoglycan hydrolase; Provisional Probab=21.38 E-value=40 Score=13.10 Aligned_cols=31 Identities=10% Similarity=0.179 Sum_probs=14.6 Q ss_pred CCCHHHHHHHHCCCCCCCCHHHHHHHHHHHH Q ss_conf 9623889998347636789689999999987 Q gi|254781051|r 1 MLQDLNKMFSLSSFLDGTNCCYIEDLYKSYQ 31 (957) Q Consensus 1 M~~~~~~~f~~sSFL~g~Na~YIEeLYekYL 31 (957) |+|+|-+...+++.|++....+.++||++=+ T Consensus 51 mlKsMR~a~~~~~l~~s~~~~~y~~M~DqQl 81 (318) T PRK12709 51 MLKSMRDATPSDGLFDSHTSKMYTSMLDQQL 81 (318) T ss_pred HHHHHHHCCCCCCCCCCHHHHHHHHHHHHHH T ss_conf 9999984676788878706789999999999 No 320 >PRK12288 ribosome-associated GTPase; Reviewed Probab=21.37 E-value=45 Score=12.69 Aligned_cols=75 Identities=13% Similarity=0.230 Sum_probs=45.8 Q ss_pred CCCCCCCHHHHHHHCCCCCEEEEECCCHHHHH----HHHHHHHCCCCCC---CEEEECCCHHHCCCCCCCCHHHHCCCCC Q ss_conf 76564405788752143782697248899999----9999974353488---6898566022108012688789379986 Q gi|254781051|r 740 PEHSSARLERFLQMCAENNMYVANCTSPANYF----HILRRQIYDRSSR---PLIMMAPKSLLRHKRVVSSLSDMTCGSV 812 (957) Q Consensus 740 PEHSSaRlERfLQl~a~~N~~V~~~ttpAq~F----H~LRRQ~~r~~rk---PLiv~tPKsLLR~~~a~S~l~d~~~g~~ 812 (957) |+-|+.+|+|||-+|...++..+.+=|-+-.. -.--++.+..|++ |.+..+-|+ ......|.++..|. T Consensus 134 P~~~~~liDRyLv~ae~~~I~pvIvlNK~DL~~~~~~~~~~~~l~~Y~~lGY~v~~~Sa~~----~~gl~~L~~~L~~k- 208 (344) T PRK12288 134 PELSLNIIDRYLVACETLGIEPLIVLNKIDLLDDEERPFVNEQLDIYRNIGYRVLMVSSHT----GEGLEPLEAALTGR- 208 (344) T ss_pred CCCCHHHHHHHHHHHHHCCCCEEEEEECHHCCCHHHHHHHHHHHHHHHHCCCEEEEEECCC----CCCHHHHHHHHCCC- T ss_conf 9989789999999999869977999731440897789999999999986797399973688----62899999987678- Q ss_pred EEECCCCC Q ss_conf 03515776 Q gi|254781051|r 813 FQAVLSDD 820 (957) Q Consensus 813 F~~vi~d~ 820 (957) ..|+--+ T Consensus 209 -tsvf~Gq 215 (344) T PRK12288 209 -ISIFVGQ 215 (344) T ss_pred -EEEEEEC T ss_conf -5999806 No 321 >pfam04908 SH3BGR SH3-binding, glutamic acid-rich protein. Probab=21.35 E-value=47 Score=12.53 Aligned_cols=63 Identities=21% Similarity=0.373 Sum_probs=40.2 Q ss_pred HHHHHHHHHHCCCEEEEEECCC-CHHHHHHHHHHHCCC-CCCCCCCHHHH----HHHHHHHHHHHHHHHHHH Q ss_conf 9999999840643027772489-989999999984145-77888798899----999999999998999998 Q gi|254781051|r 164 EIVDVLSHLYCSNIGVEFMHIV-DSIERDWVRNTIEDY-DFSSNFSKEEQ----RDILDKLVRAEGFEKFID 229 (957) Q Consensus 164 eIi~~L~~tYCgsIGvEymHI~-d~eer~Wlq~riE~~-~~~~~~s~eeK----~~iL~~L~~Ae~FE~FL~ 229 (957) +|+..|+. -.|-+|.--|. +.+.|.|.|++.... ...|++=.+++ -.-+...++.+.++.||. T Consensus 22 ~v~~iL~s---~kI~ye~vDIt~~ee~r~~Mr~~~~~~~~lpPQIFn~d~YcGdye~F~~A~E~~~l~~FLk 90 (92) T pfam04908 22 RVLMILDA---NKIPFDEVDITKDEEQRRWMRENPPNGAPLPPQIFNEDQYCGDYDAFFEAVEANTLYEFLG 90 (92) T ss_pred HHHHHHHH---CCCCCEEEECCCCHHHHHHHHHHCCCCCCCCCEEECCCEECCCHHHHHHHHHHHHHHHHHC T ss_conf 99999998---5997389855589899999998469999889907219863157999998851447999858 No 322 >TIGR01719 euk_UDPppase uridine phosphorylase; InterPro: IPR010059 This entry represents a clade of mainly eukaryotic uridine phosphorylases. Genes from human and mouse have been characterised. This enzyme is a member of the PHP/UDP subfamily and is closely related to the bacterial uridine (IPR010058 from INTERPRO) and inosine (IPR004402 from INTERPRO) phosphorylase equivalogs. In addition to the eukaryotes, a gene from Mycobacterium leprae is included in this equivalog and may have resulted from lateral gene transfer.; GO: 0004850 uridine phosphorylase activity, 0009166 nucleotide catabolic process, 0005737 cytoplasm. Probab=21.32 E-value=27 Score=14.41 Aligned_cols=22 Identities=27% Similarity=0.449 Sum_probs=15.1 Q ss_pred HHHHHH--HHHHHCCCHHHCCCCC Q ss_conf 999999--9998817045401677 Q gi|254781051|r 100 QVMKMI--DAYRSYGHFKANIDPL 121 (957) Q Consensus 100 rV~~LI--~AYR~~GHl~A~lDPL 121 (957) -...|| --|=.||-|.|-=||. T Consensus 102 ~L~ELiKL~~~A~rGDLea~~~p~ 125 (301) T TIGR01719 102 MLHELIKLLYYAKRGDLEACKDPV 125 (301) T ss_pred HHHHHHHHHHHHCCCCHHHHCCCE T ss_conf 999999999983777501206856 No 323 >TIGR02735 purC_vibrio phosphoribosylaminoimidazole-succinocarboxamide synthase; InterPro: IPR014106 Members of this protein family appear to represent a novel form of phosphoribosylaminoimidazole-succinocarboxamide synthase (SAICAR synthetase), significantly different in sequence from (IPR001636 from INTERPRO), which is found in a broad range of bacteria and eukaryotes. Members of this family are found within the gammaproteobacteria in the genera Vibrio, Shewanella, and Colwellia, and also (reported as a fragment) in the primitive eukaryote Guillardia theta (Cryptomonas phi).. Probab=21.05 E-value=29 Score=14.13 Aligned_cols=23 Identities=13% Similarity=0.384 Sum_probs=12.8 Q ss_pred CCCCHHHHHHHHHCCCCCCCCCC Q ss_conf 50798999998736764333333 Q gi|254781051|r 35 SSVCKDWYPLFSFLDENSEEYDN 57 (957) Q Consensus 35 ~SVd~sWr~fF~~l~~~~~~~~~ 57 (957) |.|...|=..|..-+...+-+-. T Consensus 87 NAiS~~WF~~F~~~gLA~SHIL~ 109 (365) T TIGR02735 87 NAISNHWFKLFKEAGLAESHILD 109 (365) T ss_pred HHHHHHHHHHHHHCCCCHHHHCC T ss_conf 88889999998745740010015 No 324 >KOG1278 consensus Probab=20.93 E-value=32 Score=13.77 Aligned_cols=10 Identities=40% Similarity=0.976 Sum_probs=7.9 Q ss_pred HHHHHHCCCC Q ss_conf 7887521437 Q gi|254781051|r 748 ERFLQMCAEN 757 (957) Q Consensus 748 ERfLQl~a~~ 757 (957) --|.|||+|| T Consensus 541 l~Yf~LC~Ed 550 (628) T KOG1278 541 LTYFQLCAED 550 (628) T ss_pred HHHHHHHHCC T ss_conf 9999986322 No 325 >PRK12308 bifunctional argininosuccinate lyase/N-acetylglutamate synthase; Provisional Probab=20.83 E-value=48 Score=12.46 Aligned_cols=45 Identities=27% Similarity=0.285 Sum_probs=25.8 Q ss_pred CCCEEEEEEEECCCCCCCCCHHHHHHHHHCCCCCCCCCEEEEEECCCEECCCCCHHCCCCC Q ss_conf 4540478886257533346725887645106774436579998416020124701024677 Q gi|254781051|r 366 ERSRVLPLIIHGDAAFAGQGIVSETFGLSGLSGYTVAGNIHLIINNQIGFTTNPSSARSCT 426 (957) Q Consensus 366 ~~~~vlpiliHGDAAfaGQGVV~Etl~ls~l~gY~tGGTiHiVvNNQIGFTT~p~d~RSs~ 426 (957) .+-.+.|+ |-||++|-..--.--.++++=||. +-|.|+.|+-|++ T Consensus 188 ~r~n~~PL---GsgAlAGT~~pIDR~~tA~~LGF~-------------~~~~NSlDAVSdR 232 (614) T PRK12308 188 TRLDTCPL---GSGALAGTAYPIDREALAHNLGFR-------------RATRNSLDSVSDR 232 (614) T ss_pred HHHCCCCC---CCCCCCCCCCCCCHHHHHHHCCCC-------------CCCCCHHHHHHHH T ss_conf 98575975---431346898987999999975999-------------8667888898725 No 326 >PRK00192 mannosyl-3-phosphoglycerate phosphatase; Reviewed Probab=20.78 E-value=48 Score=12.45 Aligned_cols=11 Identities=18% Similarity=0.199 Sum_probs=5.9 Q ss_pred HHHHHHHCCCH Q ss_conf 99999881704 Q gi|254781051|r 104 MIDAYRSYGHF 114 (957) Q Consensus 104 LI~AYR~~GHl 114 (957) -|+.-|..||. T Consensus 30 ai~~l~~~Gi~ 40 (275) T PRK00192 30 TLKALKEKGIP 40 (275) T ss_pred HHHHHHHCCCE T ss_conf 99999978999 No 327 >pfam03992 ABM Antibiotic biosynthesis monooxygenase. This domain is found in monooxygenases involved in the biosynthesis of several antibiotics by Streptomyces species. It's occurrence as a repeat in Streptomyces coelicolor SCO1909 is suggestive that the other proteins function as multimers. There is also a conserved histidine which is likely to be an active site residue. Probab=20.31 E-value=34 Score=13.60 Aligned_cols=28 Identities=29% Similarity=0.373 Sum_probs=19.6 Q ss_pred ECCCCCCCHHHHHHCCCCCCCCCCEEEC Q ss_conf 2431111013432022113865311100 Q gi|254781051|r 669 NSFLSEQAVLGFEYGYSLNNLNALTIWE 696 (957) Q Consensus 669 nS~LSE~avlgFEyGys~~~p~~L~iWE 696 (957) ....+|-|+++|++--+..+|+..+++| T Consensus 27 ~~~~~epG~l~~~l~~~~~~~~~~~~~e 54 (77) T pfam03992 27 EASRAEPGCLGYELLRSLEDPDEYVILE 54 (77) T ss_pred HHHHCCCCCEEEEEECCCCCCCEEEEEE T ss_conf 9995799945898872799997799999 No 328 >TIGR00520 asnASE_II L-asparaginases, type II; InterPro: IPR004550 L-asparaginase catalyses the conversion of L-asparagine to L-aspartate. Two related families of asparaginase (L-asparagine amidohydrolase, 3.5.1.1 from EC) are designated type I and type II according to the terminology in Escherichia coli, which has both: L-asparaginase I is a low-affinity enzyme found in the cytoplasm, while L-asparaginase II is a high-affinity periplasmic enzyme synthesized with a cleavable signal sequence. This family describes L-asparaginases related to type II of E. coli. Both the cytoplasmic and the cell wall asparaginases of Saccharomyces cerevisiae belong to this set. Members of this set from Acinetobacter glutaminasificans and Pseudomonas fluorescens are described as having both glutaminase and asparaginase activities , . All members are homotetrameric. ; GO: 0004067 asparaginase activity, 0006528 asparagine metabolic process. Probab=20.28 E-value=31 Score=13.92 Aligned_cols=15 Identities=20% Similarity=0.104 Sum_probs=6.3 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 789999999999999 Q gi|254781051|r 94 SVKDFFQVMKMIDAY 108 (957) Q Consensus 94 ~~~kq~rV~~LI~AY 108 (957) +...++-|..||+|- T Consensus 49 Y~vG~lgv~~Li~Av 63 (360) T TIGR00520 49 YKVGELGVEDLIEAV 63 (360) T ss_pred CEEEEEEHHHHHHHH T ss_conf 132013076787740 No 329 >pfam01521 Fe-S_biosyn Iron-sulphur cluster biosynthesis. This family is involved in iron-sulphur cluster biosynthesis. Its members include proteins that are involved in nitrogen fixation such as the HesB and HesB-like proteins. Probab=20.27 E-value=24 Score=14.81 Aligned_cols=33 Identities=33% Similarity=0.619 Sum_probs=19.2 Q ss_pred CCCCHHHHHHCCCCCCCCCCEEECCCCCCC---CCCCHHHHHH Q ss_conf 111013432022113865311100002442---4541022565 Q gi|254781051|r 673 SEQAVLGFEYGYSLNNLNALTIWEAQFGDF---ANGAQVILDQ 712 (957) Q Consensus 673 SE~avlgFEyGys~~~p~~L~iWEAQFGDF---~NgAQviiDq 712 (957) ..-|+-||+|+.++++ |++-+|. .||..|+||. T Consensus 29 ~~gGC~G~~Y~l~~~~-------~~~~~D~~~~~~g~~v~Id~ 64 (91) T pfam01521 29 RYGGCSGFSYGLTFED-------EAGEGDEVFEQDGVTVVVDE 64 (91) T ss_pred ECCCCCCCEEEEEECC-------CCCCCCEEEEECCCEEEEEH T ss_conf 6699798188789853-------57778599998992999916 No 330 >PRK12702 mannosyl-3-phosphoglycerate phosphatase; Reviewed Probab=20.25 E-value=50 Score=12.37 Aligned_cols=23 Identities=17% Similarity=0.224 Sum_probs=16.9 Q ss_pred CCCCCCCCHHHHHHHHHHHHCCC Q ss_conf 57877778999999999840643 Q gi|254781051|r 154 VLGLESATIPEIVDVLSHLYCSN 176 (957) Q Consensus 154 ~~g~~~~tL~eIi~~L~~tYCgs 176 (957) .+|.....++.+++.|++-|-.. T Consensus 95 ~lg~~~~~i~~~L~~l~~~~~~~ 117 (302) T PRK12702 95 ALGLPYPCLRHILQQVRQDSHLD 117 (302) T ss_pred ECCCCHHHHHHHHHHHHHHHCCC T ss_conf 25887999999999998975767 No 331 >PRK07333 2-octaprenyl-6-methoxyphenyl hydroxylase; Provisional Probab=20.20 E-value=45 Score=12.67 Aligned_cols=43 Identities=12% Similarity=0.133 Sum_probs=17.4 Q ss_pred CCHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHH Q ss_conf 89999999988743689986205677888998998733435642114789 Q gi|254781051|r 562 VSKEILKKIGSSIIRLPKSFNTHKIVERLMANRQKMIETGKGIDWSMAES 611 (957) Q Consensus 562 v~~~~L~~i~~~l~~~Pe~f~~h~~v~k~~~~R~~~~~~~~~iDW~~AE~ 611 (957) .....++.++-.+. +-.|-++|.+-++. |.-.++-=-|..+|+ T Consensus 360 ~~~~~~R~~gl~~~------~~~~~lK~~~~~~a-~Gl~~~~p~l~~~~~ 402 (403) T PRK07333 360 TLLRSVRDIGLGLV------DRLPPLKSFFIRQA-AGLTGDTPRLLKGEA 402 (403) T ss_pred HHHHHHHHHHHHHH------HHCHHHHHHHHHHH-CCCCCCCCHHHCCCC T ss_conf 79999999999998------52999999999996-588999864645898 No 332 >pfam06950 DUF1293 Protein of unknown function (DUF1293). This family consists of several bacterial and phage proteins of around 115 residues in length. The function of this family is unknown. Probab=20.14 E-value=45 Score=12.66 Aligned_cols=23 Identities=35% Similarity=0.589 Sum_probs=13.4 Q ss_pred CCCCCHHHHHHHHHCCCHHHHHHHHHHHCCCC Q ss_conf 22355788888671899899999998644898 Q gi|254781051|r 486 PSFTQPMMYKTIRSHKSVLQLYADSLMKNQVI 517 (957) Q Consensus 486 P~fTQP~mY~~I~~hp~v~~~Y~~~Li~~g~i 517 (957) |..-||+|-. ..|+.+|++.|.+ T Consensus 51 pq~dqpL~Id---------~~YA~kLe~tgA~ 73 (115) T pfam06950 51 PQWDQPLMID---------HTYALKLEKTGAF 73 (115) T ss_pred CCCCCCEEEC---------HHHHHHHHHHCCC T ss_conf 5568855856---------8899888860375 No 333 >PRK11181 23S rRNA (guanosine-2'-O-)-methyltransferase; Provisional Probab=20.13 E-value=50 Score=12.35 Aligned_cols=21 Identities=19% Similarity=0.324 Sum_probs=12.3 Q ss_pred HHHHHHHHHHCCCCEEEEECC Q ss_conf 999999987379107998067 Q gi|254781051|r 250 IEEIIRQGVQQGVDEMILGMA 270 (957) Q Consensus 250 L~~ii~~a~~~Gv~eiviGMa 270 (957) +-+||+.|...|++-|++.-- T Consensus 109 lGaIiRsA~afGv~~vil~~~ 129 (244) T PRK11181 109 LGACLRSADAAGVHAVIVPKD 129 (244) T ss_pred HHHHHHHHHHCCCCEEEECCC T ss_conf 999999999819989997899 No 334 >COG1208 GCD1 Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) [Cell envelope biogenesis, outer membrane / Translation, ribosomal structure and biogenesis] Probab=20.12 E-value=50 Score=12.35 Aligned_cols=36 Identities=28% Similarity=0.351 Sum_probs=28.1 Q ss_pred HHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCHHHHHHHHCCC Q ss_conf 9999999998737910799806765526899988537989999985588 Q gi|254781051|r 248 PAIEEIIRQGVQQGVDEMILGMAHRGRLNVLSQIMNKLPRSIFYEFKGK 296 (957) Q Consensus 248 P~L~~ii~~a~~~Gv~eiviGMaHRGRLNVL~Nvl~Kp~~~iF~EF~g~ 296 (957) |+|+-+++..+..||++|++.+ +...+.|-..|... T Consensus 33 Pii~~~l~~L~~~Gv~eivi~~-------------~y~~~~i~~~~~d~ 68 (358) T COG1208 33 PLIEYVLEALAAAGVEEIVLVV-------------GYLGEQIEEYFGDG 68 (358) T ss_pred EHHHHHHHHHHHCCCCEEEEEC-------------CCCHHHHHHHHHHC T ss_conf 4599999999987997899987-------------74689999998626 No 335 >pfam04329 consensus Probab=20.09 E-value=39 Score=13.12 Aligned_cols=33 Identities=12% Similarity=0.184 Sum_probs=14.5 Q ss_pred CCHHHHHHHCCCCCEEEEECCCHHHHHHHHHHH Q ss_conf 405788752143782697248899999999997 Q gi|254781051|r 745 ARLERFLQMCAENNMYVANCTSPANYFHILRRQ 777 (957) Q Consensus 745 aRlERfLQl~a~~N~~V~~~ttpAq~FH~LRRQ 777 (957) .=|++|+.+|...++.+|.-.....+.-++... T Consensus 57 ~li~~F~~~a~~~g~~~~fy~v~~~~~~~~~~~ 89 (135) T pfam04329 57 ELIERFLELADAHGWRPVFYQVSEEDLPLYHDL 89 (135) T ss_pred HHHHHHHHHHHHCCCEEEEEEECHHHHHHHHHC T ss_conf 999999999998799889999888999999876 Done!