Query gi|254780664|ref|YP_003065077.1| bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 536 No_of_seqs 170 out of 2250 Neff 4.9 Searched_HMMs 39220 Date Sun May 29 20:53:02 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780664.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK00881 purH bifunctional pho 100.0 0 0 1663.0 48.4 511 13-536 1-514 (514) 2 COG0138 PurH AICAR transformyl 100.0 0 0 1602.0 43.0 513 15-536 1-515 (515) 3 KOG2555 consensus 100.0 0 0 1170.7 30.8 503 17-536 5-588 (588) 4 PRK07106 5-aminoimidazole-4-ca 100.0 0 0 924.1 25.7 311 222-536 3-391 (391) 5 pfam01808 AICARFT_IMPCHas AICA 100.0 0 0 906.1 26.4 312 147-462 1-315 (315) 6 TIGR00355 purH bifunctional pu 100.0 0 0 896.0 25.1 520 17-536 1-569 (569) 7 smart00798 AICARFT_IMPCHas AIC 100.0 0 0 894.7 25.0 310 147-462 1-311 (311) 8 cd01421 IMPCH Inosine monophos 100.0 0 0 606.6 20.2 186 17-204 1-187 (187) 9 pfam02142 MGS MGS-like domain. 99.5 2.5E-14 6.4E-19 124.7 6.7 92 28-142 1-92 (92) 10 smart00851 MGS MGS-like domain 99.4 2.7E-13 6.8E-18 117.2 7.1 89 28-142 1-90 (90) 11 cd00532 MGS-like MGS-like doma 99.0 1.5E-09 3.9E-14 89.9 9.2 106 19-163 2-109 (112) 12 cd01424 MGS_CPS_II Methylglyox 98.6 4.2E-08 1.1E-12 79.4 4.9 106 18-167 2-109 (110) 13 cd01423 MGS_CPS_I_III Methylgl 98.6 1.1E-07 2.9E-12 76.3 7.1 112 17-166 1-114 (116) 14 PRK05294 carB carbamoyl phosph 98.4 5.9E-06 1.5E-10 63.9 11.7 66 226-300 526-604 (1063) 15 PRK12815 carB carbamoyl phosph 98.4 7.2E-06 1.8E-10 63.2 11.5 13 287-299 592-604 (1068) 16 TIGR01369 CPSaseII_lrg carbamo 97.6 9E-05 2.3E-09 55.2 4.2 130 12-170 957-1089(1089) 17 pfam06230 DUF1009 Protein of u 97.3 0.0071 1.8E-07 41.5 10.7 113 403-527 83-211 (212) 18 pfam06506 PrpR_N Propionate ca 92.8 0.66 1.7E-05 27.1 7.1 126 19-151 9-141 (169) 19 TIGR00315 cdhB CO dehydrogenas 90.0 0.27 6.8E-06 30.0 2.7 44 26-69 41-90 (170) 20 KOG0370 consensus 88.5 0.51 1.3E-05 27.9 3.3 222 107-339 716-1016(1435) 21 pfam07085 DRTGG DRTGG domain. 85.6 1.6 4.1E-05 24.3 4.5 46 489-534 51-96 (105) 22 cd01422 MGS Methylglyoxal synt 84.8 2.4 6.1E-05 23.1 5.1 106 18-163 2-111 (115) 23 pfam01081 Aldolase KDPG and KH 84.0 1.7 4.4E-05 24.2 4.1 88 261-360 13-100 (196) 24 cd00452 KDPG_aldolase KDPG and 83.8 1.6 4E-05 24.4 3.8 84 265-360 13-96 (190) 25 PRK05718 keto-hydroxyglutarate 83.8 1.7 4.3E-05 24.2 3.9 89 259-359 18-106 (212) 26 PRK08104 consensus 83.1 1.8 4.5E-05 24.1 3.8 93 255-359 14-106 (212) 27 PRK07455 keto-hydroxyglutarate 82.5 1.9 4.9E-05 23.8 3.8 40 321-360 66-105 (210) 28 PRK06015 keto-hydroxyglutarate 82.1 2 5.1E-05 23.7 3.8 89 260-360 19-107 (212) 29 PRK09140 2-dehydro-3-deoxy-6-p 81.0 1.9 4.9E-05 23.8 3.4 82 266-360 20-103 (206) 30 PRK06857 consensus 80.8 2.4 6.2E-05 23.1 3.9 88 260-359 16-103 (209) 31 PRK07114 keto-hydroxyglutarate 80.8 1.9 4.8E-05 23.8 3.3 93 259-360 19-112 (223) 32 PRK08904 consensus 80.6 2.5 6.4E-05 23.0 3.8 93 256-360 10-102 (207) 33 PRK05234 mgsA methylglyoxal sy 80.4 1.6 4.1E-05 24.3 2.8 113 17-169 7-123 (146) 34 COG0117 RibD Pyrimidine deamin 79.8 5.3 0.00013 20.6 8.0 81 416-505 12-101 (146) 35 COG3494 Uncharacterized protei 79.8 5.3 0.00013 20.6 11.8 118 403-527 141-274 (279) 36 PRK08782 consensus 78.2 3.2 8.2E-05 22.1 3.8 92 257-360 18-109 (219) 37 PRK13210 putative L-xylulose 5 77.8 6 0.00015 20.2 7.8 20 98-117 100-119 (284) 38 PRK06552 keto-hydroxyglutarate 76.1 4 0.0001 21.5 3.8 93 259-360 16-108 (209) 39 PRK00489 hisG ATP phosphoribos 74.6 6.2 0.00016 20.1 4.5 83 47-150 17-102 (287) 40 pfam04273 DUF442 Putative phos 73.9 7.5 0.00019 19.5 5.3 17 499-515 57-75 (110) 41 TIGR00107 deoD purine nucleosi 72.3 5.4 0.00014 20.5 3.7 80 414-503 96-177 (234) 42 pfam00591 Glycos_transf_3 Glyc 71.5 5.4 0.00014 20.5 3.5 19 343-361 78-96 (254) 43 PRK00945 acetyl-CoA decarbonyl 71.0 4.2 0.00011 21.3 2.9 35 26-60 48-83 (171) 44 COG2240 PdxK Pyridoxal/pyridox 70.0 6.4 0.00016 20.0 3.6 79 250-339 146-227 (281) 45 COG0800 Eda 2-keto-3-deoxy-6-p 69.7 5.5 0.00014 20.5 3.3 86 263-360 20-105 (211) 46 PRK05693 short chain dehydroge 69.1 9.5 0.00024 18.7 6.4 82 16-122 1-91 (274) 47 cd01286 deoxycytidylate_deamin 68.0 9.9 0.00025 18.6 7.5 74 432-512 23-122 (131) 48 PRK06153 hypothetical protein; 66.3 2.9 7.5E-05 22.4 1.3 19 320-338 248-266 (393) 49 COG0813 DeoD Purine-nucleoside 66.0 11 0.00028 18.3 4.3 154 331-504 24-180 (236) 50 COG0276 HemH Protoheme ferro-l 65.5 11 0.00028 18.3 4.2 19 351-369 105-123 (320) 51 PRK00194 hypothetical protein; 65.2 10 0.00026 18.5 3.9 32 16-47 1-35 (90) 52 PRK06444 prephenate dehydrogen 64.0 9.7 0.00025 18.7 3.6 54 31-84 15-79 (197) 53 cd01987 USP_OKCHK USP domain i 63.8 12 0.0003 18.1 4.0 42 491-533 54-95 (124) 54 pfam01976 DUF116 Protein of un 63.7 9.4 0.00024 18.8 3.5 40 78-119 103-142 (158) 55 PRK08228 L(+)-tartrate dehydra 63.3 12 0.00031 18.0 4.2 121 221-370 12-145 (201) 56 COG1936 Predicted nucleotide k 62.6 9.6 0.00025 18.7 3.4 106 20-144 4-124 (180) 57 PRK05773 3,4-dihydroxy-2-butan 62.3 13 0.00032 17.9 4.6 38 491-528 165-208 (218) 58 PRK08577 hypothetical protein; 61.7 7 0.00018 19.7 2.5 64 18-83 58-126 (135) 59 PRK07102 short chain dehydroge 61.6 13 0.00033 17.8 4.2 40 16-56 1-41 (243) 60 PRK06842 fumarate hydratase; P 59.9 14 0.00035 17.6 4.4 41 326-369 103-144 (185) 61 PRK02261 methylaspartate mutas 59.7 14 0.00035 17.5 4.5 79 27-107 18-114 (137) 62 TIGR00511 aIF-2BII_fam transla 57.5 7.7 0.0002 19.4 2.1 32 131-171 9-41 (303) 63 TIGR02296 HpaC 4-hydroxyphenyl 57.4 3.3 8.5E-05 22.0 0.3 31 37-70 8-38 (157) 64 cd03822 GT1_ecORF704_like This 56.8 8.4 0.00021 19.1 2.2 30 18-47 2-39 (366) 65 TIGR00216 ispH_lytB 4-hydroxy- 56.3 7.9 0.0002 19.3 2.0 35 72-116 29-63 (354) 66 pfam11009 DUF2847 Protein of u 56.3 16 0.0004 17.1 5.4 52 283-336 19-72 (104) 67 TIGR02906 spore_CotS spore coa 56.2 5.2 0.00013 20.6 1.1 107 234-351 59-183 (321) 68 KOG3010 consensus 55.7 8.3 0.00021 19.2 2.1 71 298-369 4-83 (261) 69 cd06330 PBP1_Arsenic_SBP_like 55.6 16 0.00041 17.1 4.1 20 340-359 195-214 (346) 70 CHL00202 argB acetylglutamate 55.1 15 0.00039 17.2 3.4 37 75-114 21-61 (284) 71 pfam02679 ComA (2R)-phospho-3- 55.1 16 0.00042 17.0 3.7 51 15-66 43-107 (245) 72 PRK01388 arginine deiminase; P 54.9 13 0.00033 17.8 2.9 31 475-508 359-389 (410) 73 PRK07985 oxidoreductase; Provi 52.5 18 0.00046 16.7 8.0 45 2-46 35-79 (294) 74 TIGR02717 AcCoA-syn-alpha acet 52.3 14 0.00036 17.5 2.8 173 336-520 209-400 (457) 75 COG2355 Zn-dependent dipeptida 51.9 18 0.00047 16.7 8.1 40 284-328 184-224 (313) 76 cd01915 CODH Carbon monoxide d 51.6 15 0.00039 17.2 2.9 44 312-366 345-388 (613) 77 TIGR01090 apt adenine phosphor 51.1 7.6 0.00019 19.4 1.3 17 155-171 23-39 (175) 78 COG4109 Predicted transcriptio 51.1 19 0.00048 16.6 4.0 76 261-346 261-336 (432) 79 COG0547 TrpD Anthranilate phos 50.8 18 0.00047 16.7 3.2 66 285-364 106-172 (338) 80 pfam09989 DUF2229 CoA enzyme a 50.5 19 0.00049 16.5 7.1 95 29-151 14-109 (218) 81 KOG0832 consensus 50.5 19 0.00049 16.5 3.9 16 132-149 44-59 (251) 82 PRK10786 ribD bifunctional dia 49.7 20 0.0005 16.4 6.4 25 79-108 71-95 (367) 83 pfam02603 Hpr_kinase_N HPr Ser 49.5 20 0.00051 16.4 4.4 40 254-294 54-93 (127) 84 pfam02593 DUF166 Uncharacteriz 48.8 20 0.00052 16.3 4.6 10 286-295 144-153 (215) 85 pfam05683 Fumerase_C Fumarase 48.7 20 0.00052 16.3 4.5 29 494-524 116-144 (205) 86 pfam02401 LYTB LytB protein. T 48.5 12 0.0003 18.1 1.9 31 72-110 26-56 (280) 87 TIGR02153 gatD_arch glutamyl-t 48.0 6.1 0.00016 20.1 0.4 49 485-533 292-347 (413) 88 PRK13805 bifunctional acetalde 47.9 21 0.00053 16.2 4.6 304 98-458 162-558 (862) 89 cd01968 Nitrogenase_NifE_I Nit 47.9 14 0.00037 17.4 2.3 32 491-528 348-379 (410) 90 pfam07799 DUF1643 Protein of u 47.7 11 0.00028 18.3 1.7 49 99-151 39-87 (135) 91 TIGR02988 YaaA_near_RecF S4 do 47.5 6.4 0.00016 20.0 0.4 35 26-62 5-40 (60) 92 pfam00762 Ferrochelatase Ferro 46.1 22 0.00057 16.1 4.1 15 343-357 255-269 (312) 93 PRK06180 short chain dehydroge 45.9 22 0.00057 16.0 4.2 150 14-198 2-168 (277) 94 pfam07905 PucR Purine cataboli 44.6 23 0.0006 15.9 4.7 43 489-531 61-106 (122) 95 pfam00383 dCMP_cyt_deam_1 Cyti 44.4 24 0.0006 15.9 8.4 83 418-505 8-102 (104) 96 LOAD_Ccd2 consensus 43.4 24 0.00062 15.8 3.4 54 94-155 59-113 (114) 97 pfam11152 DUF2930 Protein of u 43.4 24 0.00062 15.8 7.2 92 408-509 71-166 (196) 98 TIGR00160 MGSA methylglyoxal s 43.2 18 0.00045 16.8 2.1 48 17-64 4-55 (143) 99 pfam00625 Guanylate_kin Guanyl 43.2 8.4 0.00022 19.1 0.5 69 132-202 95-165 (182) 100 PRK08251 short chain dehydroge 43.2 25 0.00063 15.7 5.1 39 17-56 3-42 (248) 101 TIGR01137 cysta_beta cystathio 43.0 14 0.00035 17.6 1.5 15 145-159 83-97 (527) 102 pfam02602 HEM4 Uroporphyrinoge 42.6 25 0.00064 15.7 5.5 41 328-368 76-120 (229) 103 PRK06128 oxidoreductase; Provi 41.6 26 0.00066 15.6 7.6 38 8-45 47-84 (300) 104 TIGR02518 EutH_ACDH acetaldehy 41.4 8.6 0.00022 19.0 0.3 38 136-173 192-270 (528) 105 PRK13111 trpA tryptophan synth 41.4 26 0.00067 15.6 4.8 36 333-368 106-144 (256) 106 TIGR01158 SUI1_rel translation 41.3 7.1 0.00018 19.7 -0.1 29 19-51 51-81 (111) 107 TIGR02873 spore_ylxY probable 41.2 26 0.00067 15.5 3.5 36 492-528 170-205 (269) 108 PRK06953 short chain dehydroge 40.7 27 0.00068 15.5 5.0 40 16-55 1-40 (222) 109 COG1880 CdhB CO dehydrogenase/ 40.6 26 0.00067 15.5 2.7 46 15-60 34-83 (170) 110 TIGR00506 ribB 3,4-dihydroxy-2 40.0 27 0.00068 15.5 2.7 91 429-528 100-222 (230) 111 PRK12360 4-hydroxy-3-methylbut 39.9 21 0.00054 16.2 2.1 30 74-111 31-60 (281) 112 TIGR00337 PyrG CTP synthase; I 39.6 25 0.00064 15.7 2.5 35 79-114 184-219 (571) 113 pfam02091 tRNA-synt_2e Glycyl- 39.4 10 0.00027 18.4 0.5 20 221-240 55-74 (284) 114 TIGR01830 3oxo_ACP_reduc 3-oxo 39.4 28 0.00071 15.3 6.0 44 418-468 107-158 (238) 115 TIGR00070 hisG ATP phosphoribo 39.0 19 0.00047 16.6 1.7 15 343-357 131-145 (317) 116 PRK02491 putative deoxyribonuc 38.6 22 0.00055 16.1 2.0 42 416-459 176-217 (328) 117 COG2428 Uncharacterized conser 38.4 25 0.00065 15.6 2.3 52 29-82 17-71 (196) 118 PRK02304 adenine phosphoribosy 38.1 11 0.00028 18.3 0.4 23 150-172 21-44 (174) 119 COG0108 RibB 3,4-dihydroxy-2-b 38.0 29 0.00075 15.2 4.1 38 491-528 146-190 (203) 120 PRK13113 consensus 37.3 30 0.00076 15.1 4.5 34 335-368 117-153 (263) 121 COG3185 4-hydroxyphenylpyruvat 37.2 30 0.00077 15.1 4.0 66 290-359 213-297 (363) 122 TIGR02364 dha_pts dihydroxyace 36.9 23 0.00059 15.9 1.9 31 19-49 4-37 (128) 123 PRK00164 moaA molybdenum cofac 36.8 30 0.00078 15.1 7.2 75 27-113 82-167 (334) 124 KOG2811 consensus 36.5 22 0.00055 16.1 1.7 49 271-321 248-309 (420) 125 pfam08149 BING4CT BING4CT (NUC 36.2 8.9 0.00023 18.9 -0.3 34 475-508 7-40 (80) 126 PRK02492 deoxyhypusine synthas 36.2 31 0.00079 15.0 3.2 22 27-48 72-95 (344) 127 cd04724 Tryptophan_synthase_al 36.0 31 0.0008 15.0 4.8 32 337-368 101-135 (242) 128 PRK13127 consensus 35.7 32 0.00081 14.9 4.7 32 337-368 112-146 (262) 129 PRK01185 ppnK inorganic polyph 35.5 32 0.00081 14.9 3.7 44 16-59 1-47 (272) 130 pfam04210 MtrG Tetrahydrometha 35.2 30 0.00078 15.1 2.3 20 154-173 5-24 (70) 131 smart00854 PGA_cap Bacterial c 35.1 32 0.00082 14.9 4.7 11 106-116 30-40 (239) 132 PRK13137 consensus 34.7 33 0.00083 14.8 4.5 34 335-368 123-159 (266) 133 PRK13131 consensus 34.7 33 0.00083 14.8 4.3 10 301-310 57-66 (257) 134 cd01391 Periplasmic_Binding_Pr 34.6 33 0.00084 14.8 5.6 87 269-360 111-201 (269) 135 cd01981 Pchlide_reductase_B Pc 34.5 33 0.00084 14.8 3.2 70 253-340 167-236 (430) 136 cd06319 PBP1_ABC_sugar_binding 34.5 33 0.00084 14.8 4.7 54 305-360 175-231 (277) 137 KOG0774 consensus 34.5 33 0.00084 14.8 3.2 67 326-396 193-259 (334) 138 TIGR03627 arch_S9P archaeal ri 34.2 28 0.00071 15.3 2.0 22 444-465 64-85 (130) 139 PTZ00094 serine hydroxymethylt 34.1 27 0.00069 15.4 1.9 72 318-391 248-348 (450) 140 TIGR00338 serB phosphoserine p 34.0 34 0.00086 14.8 3.6 84 26-114 88-178 (223) 141 TIGR02346 chap_CCT_theta T-com 33.9 34 0.00086 14.7 3.8 43 407-457 380-428 (554) 142 PRK13978 ribose-5-phosphate is 33.7 34 0.00086 14.7 3.8 34 132-165 93-128 (228) 143 KOG4127 consensus 33.6 34 0.00087 14.7 5.6 52 268-328 255-307 (419) 144 PRK04046 translation initiatio 33.6 34 0.00087 14.7 2.6 64 290-367 62-135 (221) 145 PRK13122 consensus 33.6 34 0.00087 14.7 4.4 12 341-352 104-115 (242) 146 PRK09522 bifunctional anthrani 33.5 29 0.00074 15.2 2.0 68 284-365 301-369 (531) 147 PRK01792 ribB 3,4-dihydroxy-2- 33.4 34 0.00087 14.7 4.4 38 491-528 157-201 (214) 148 cd06578 HemD Uroporphyrinogen- 33.1 35 0.00088 14.7 5.8 43 328-370 84-130 (239) 149 TIGR01283 nifE nitrogenase MoF 32.7 35 0.0009 14.6 3.4 34 499-532 337-370 (470) 150 smart00654 eIF6 translation in 32.7 35 0.0009 14.6 2.6 62 291-366 61-134 (200) 151 PRK01026 tetrahydromethanopter 32.6 34 0.00086 14.8 2.2 20 154-173 6-25 (75) 152 PRK04200 cofactor-independent 32.5 35 0.0009 14.6 4.7 29 227-256 204-232 (395) 153 TIGR01702 CO_DH_cata carbon-mo 32.3 36 0.00091 14.6 2.4 58 327-393 373-436 (647) 154 cd03376 TPP_PFOR_porB_like Thi 32.2 26 0.00065 15.6 1.5 29 136-164 93-121 (235) 155 COG0752 GlyQ Glycyl-tRNA synth 32.1 24 0.00062 15.8 1.4 22 219-240 59-80 (298) 156 PRK06721 threonine synthase; R 31.8 36 0.00092 14.5 3.3 15 145-159 93-107 (352) 157 pfam01634 HisG ATP phosphoribo 31.8 26 0.00066 15.6 1.5 27 343-369 66-93 (161) 158 cd04630 CBS_pair_17 The CBS do 31.8 36 0.00093 14.5 4.6 92 352-444 14-105 (114) 159 cd00733 GlyRS_alpha_core Class 31.6 23 0.00058 16.0 1.2 21 220-240 55-75 (279) 160 PRK07576 short chain dehydroge 31.5 37 0.00094 14.5 6.9 40 15-54 7-46 (260) 161 CHL00079 rps9 ribosomal protei 31.4 25 0.00064 15.7 1.4 23 443-465 71-93 (134) 162 TIGR01326 OAH_OAS_sulfhy O-ace 31.4 37 0.00094 14.5 2.6 83 246-337 285-368 (434) 163 PRK12825 fabG 3-ketoacyl-(acyl 31.1 37 0.00095 14.4 5.7 90 12-119 3-96 (250) 164 PRK06701 short chain dehydroge 31.0 37 0.00095 14.4 7.3 42 7-48 36-77 (289) 165 PRK09348 glyQ glycyl-tRNA synt 31.0 20 0.00051 16.4 0.8 20 221-240 58-77 (291) 166 pfam05505 Ebola_NP Ebola nucle 30.9 28 0.00071 15.4 1.5 58 16-73 18-103 (717) 167 PRK13584 hisG ATP phosphoribos 30.7 28 0.00071 15.3 1.5 13 95-107 52-64 (204) 168 TIGR02124 hypE hydrogenase exp 30.6 24 0.00062 15.8 1.2 123 253-388 188-330 (345) 169 COG0462 PrsA Phosphoribosylpyr 30.2 25 0.00063 15.7 1.2 18 343-360 248-265 (314) 170 PRK07454 short chain dehydroge 30.1 39 0.00099 14.3 6.5 85 13-116 3-91 (241) 171 PRK13135 consensus 30.0 39 0.00099 14.3 4.4 34 335-368 116-152 (267) 172 PRK01686 hisG ATP phosphoribos 29.9 29 0.00074 15.2 1.5 57 48-116 16-73 (212) 173 CHL00200 trpA tryptophan synth 29.8 39 0.00099 14.3 4.4 44 266-312 27-72 (263) 174 PRK00011 glyA serine hydroxyme 29.8 35 0.0009 14.6 1.9 16 319-338 235-250 (415) 175 PRK13112 consensus 29.6 39 0.001 14.3 4.6 37 332-368 115-154 (279) 176 cd01310 TatD_DNAse TatD like p 29.6 39 0.001 14.3 4.3 29 144-172 22-51 (251) 177 PRK09291 short chain dehydroge 29.4 40 0.001 14.2 3.3 120 17-163 3-136 (257) 178 PRK03244 argD acetylornithine 29.4 40 0.001 14.2 4.0 17 504-520 370-386 (398) 179 COG4079 Uncharacterized protei 29.3 40 0.001 14.2 3.4 40 30-69 47-87 (293) 180 pfam01884 PcrB PcrB family. Th 29.1 40 0.001 14.2 4.1 50 482-531 163-212 (231) 181 PTZ00086 40S ribosomal protein 29.0 28 0.00071 15.3 1.3 22 444-465 74-95 (147) 182 KOG4226 consensus 29.0 15 0.00038 17.3 -0.1 44 342-386 192-236 (379) 183 PRK13222 phosphoglycolate phos 29.0 40 0.001 14.2 5.2 18 491-508 178-195 (228) 184 PRK09072 short chain dehydroge 28.9 40 0.001 14.2 4.5 123 14-160 3-139 (262) 185 TIGR00732 dprA DNA protecting 28.8 40 0.001 14.2 2.6 34 400-436 60-93 (238) 186 TIGR01584 citF citrate lyase, 28.8 22 0.00055 16.2 0.7 25 31-55 113-137 (496) 187 KOG4627 consensus 28.7 41 0.001 14.2 3.8 51 220-276 42-92 (270) 188 KOG0673 consensus 28.7 35 0.0009 14.6 1.7 32 34-65 79-110 (293) 189 PRK00087 4-hydroxy-3-methylbut 28.6 37 0.00094 14.5 1.8 47 288-335 184-230 (670) 190 TIGR00568 alkb alkylated DNA r 28.6 21 0.00055 16.2 0.6 90 411-536 101-197 (199) 191 TIGR02326 transamin_PhnW 2-ami 28.5 17 0.00044 16.9 0.1 127 177-323 37-178 (366) 192 COG1976 TIF6 Translation initi 28.4 41 0.001 14.1 2.4 34 313-360 96-129 (222) 193 KOG0923 consensus 28.3 41 0.0011 14.1 3.1 36 133-170 194-234 (902) 194 COG0112 GlyA Glycine/serine hy 28.3 35 0.00089 14.6 1.7 20 317-339 234-253 (413) 195 PRK08267 short chain dehydroge 28.3 41 0.0011 14.1 6.7 80 16-116 1-85 (258) 196 PRK05819 deoD purine nucleosid 28.1 42 0.0011 14.1 4.0 29 416-444 182-210 (235) 197 PRK00035 hemH ferrochelatase; 27.9 42 0.0011 14.1 4.0 15 343-357 262-276 (325) 198 PRK06209 glutamate-1-semialdeh 27.4 43 0.0011 14.0 4.8 28 257-289 167-194 (431) 199 pfam08727 P3A Poliovirus 3A pr 27.2 35 0.00089 14.6 1.5 15 509-523 26-40 (57) 200 COG0159 TrpA Tryptophan syntha 27.2 43 0.0011 14.0 4.5 12 267-278 193-204 (265) 201 TIGR00977 LeuA_rel 2-isopropyl 27.2 43 0.0011 14.0 2.2 84 92-196 46-134 (543) 202 PRK13324 pantothenate kinase; 27.1 43 0.0011 14.0 2.2 14 339-352 145-158 (258) 203 TIGR00109 hemH ferrochelatase; 27.1 43 0.0011 14.0 4.3 17 343-359 281-297 (339) 204 TIGR00519 asnASE_I L-asparagin 27.0 22 0.00057 16.0 0.5 25 343-367 225-250 (347) 205 pfam00380 Ribosomal_S9 Ribosom 27.0 27 0.00069 15.5 0.9 49 411-465 32-80 (121) 206 PRK13580 serine hydroxymethylt 27.0 40 0.001 14.2 1.8 26 404-429 423-449 (493) 207 PRK10425 DNase TatD; Provision 26.8 44 0.0011 13.9 4.4 32 144-175 22-54 (258) 208 PRK11864 2-ketoisovalerate fer 26.8 29 0.00075 15.2 1.1 29 136-164 105-133 (300) 209 KOG0525 consensus 26.7 44 0.0011 13.9 2.3 86 320-412 98-190 (362) 210 cd07571 ALP_N-acyl_transferase 26.7 33 0.00085 14.8 1.3 70 417-512 162-233 (270) 211 cd04872 ACT_1ZPV ACT domain pr 26.7 44 0.0011 13.9 3.5 29 18-46 1-32 (88) 212 PRK01045 ispH 4-hydroxy-3-meth 26.5 44 0.0011 13.9 1.9 29 74-110 30-58 (304) 213 PRK13371 4-hydroxy-3-methylbut 26.4 40 0.001 14.2 1.7 70 31-111 28-104 (392) 214 COG1013 PorB Pyruvate:ferredox 26.3 31 0.0008 15.0 1.1 29 136-164 101-129 (294) 215 PRK11867 2-oxoglutarate ferred 26.3 45 0.0011 13.9 4.1 29 136-164 99-127 (280) 216 TIGR00520 asnASE_II L-asparagi 26.3 25 0.00063 15.7 0.6 15 380-394 197-211 (360) 217 COG1587 HemD Uroporphyrinogen- 26.2 45 0.0011 13.8 5.9 145 328-510 85-233 (248) 218 TIGR01918 various_sel_PB selen 26.1 44 0.0011 13.9 1.9 38 426-463 153-190 (433) 219 TIGR03568 NeuC_NnaA UDP-N-acet 26.1 45 0.0011 13.8 4.4 32 29-60 16-58 (365) 220 PTZ00298 mevalonate kinase; Pr 26.1 25 0.00063 15.7 0.6 27 287-328 145-171 (328) 221 PRK12823 benD 1,6-dihydroxycyc 26.0 45 0.0012 13.8 8.3 38 12-49 4-41 (260) 222 COG1483 Predicted ATPase (AAA+ 26.0 45 0.0012 13.8 2.1 114 24-160 98-230 (774) 223 pfam11564 BpuJI_N Restriction 25.9 42 0.0011 14.0 1.8 19 406-424 266-284 (294) 224 cd00610 OAT_like Acetyl ornith 25.9 45 0.0012 13.8 3.1 18 504-521 391-408 (413) 225 pfam05327 RRN3 RNA polymerase 25.9 25 0.00064 15.7 0.6 72 129-205 150-223 (550) 226 COG1433 Uncharacterized conser 25.8 45 0.0012 13.8 2.3 58 330-392 55-112 (121) 227 KOG1203 consensus 25.8 19 0.00049 16.5 -0.0 16 143-158 188-203 (411) 228 PRK00474 rps9p 30S ribosomal p 25.8 43 0.0011 14.0 1.8 21 444-464 66-86 (132) 229 pfam01912 eIF-6 eIF-6 family. 25.6 46 0.0012 13.8 3.5 63 291-367 61-133 (198) 230 pfam00290 Trp_syntA Tryptophan 25.6 46 0.0012 13.8 4.4 33 336-368 110-145 (258) 231 PRK07814 short chain dehydroge 25.5 46 0.0012 13.8 6.4 87 13-117 7-96 (263) 232 KOG1753 consensus 25.5 39 0.00098 14.3 1.5 23 442-464 70-92 (145) 233 pfam04963 Sigma54_CBD Sigma-54 25.3 46 0.0012 13.7 4.5 101 265-395 67-183 (195) 234 PRK13123 consensus 25.2 47 0.0012 13.7 4.6 17 341-357 118-134 (256) 235 PRK13140 consensus 25.1 47 0.0012 13.7 4.7 101 266-369 26-151 (257) 236 PRK00188 trpD anthranilate pho 25.0 47 0.0012 13.7 2.9 40 324-363 130-170 (339) 237 PRK13034 serine hydroxymethylt 24.9 36 0.00091 14.6 1.2 18 318-338 240-257 (422) 238 PRK00230 orotidine 5'-phosphat 24.9 45 0.0011 13.8 1.7 58 138-201 89-155 (231) 239 PRK12418 cysteinyl-tRNA synthe 24.7 42 0.0011 14.1 1.5 115 13-154 9-123 (384) 240 pfam01045 consensus 24.7 25 0.00064 15.7 0.4 15 157-171 112-126 (411) 241 COG1803 MgsA Methylglyoxal syn 24.5 48 0.0012 13.6 3.9 116 15-170 4-123 (142) 242 PRK13120 consensus 24.5 48 0.0012 13.6 4.3 142 333-520 119-265 (285) 243 PRK13374 purine nucleoside pho 24.4 48 0.0012 13.6 3.7 26 417-442 184-209 (233) 244 pfam01244 Peptidase_M19 Membra 24.3 48 0.0012 13.6 7.2 187 96-327 35-232 (316) 245 PRK03675 consensus 23.9 40 0.001 14.2 1.4 24 479-504 234-257 (279) 246 PRK06500 short chain dehydroge 23.7 50 0.0013 13.5 4.8 44 13-56 3-46 (249) 247 PRK13973 thymidylate kinase; P 23.6 50 0.0013 13.5 3.2 100 52-161 26-138 (216) 248 cd03375 TPP_OGFOR Thiamine pyr 23.5 40 0.001 14.2 1.3 30 136-165 82-111 (193) 249 cd00587 HCP_like The HCP famil 23.5 50 0.0013 13.5 3.7 39 319-359 159-206 (258) 250 KOG2474 consensus 23.5 50 0.0013 13.5 1.9 55 259-319 230-293 (406) 251 PRK02842 light-independent pro 23.5 50 0.0013 13.5 3.5 30 27-56 26-55 (425) 252 PRK09607 rps11p 30S ribosomal 23.4 50 0.0013 13.5 4.0 21 50-70 94-114 (129) 253 PRK13583 hisG ATP phosphoribos 23.4 22 0.00057 16.1 -0.1 128 47-207 15-149 (226) 254 pfam01026 TatD_DNase TatD rela 23.4 50 0.0013 13.5 3.4 15 353-367 188-202 (255) 255 pfam09894 DUF2121 Uncharacteri 23.4 41 0.0011 14.1 1.3 40 30-69 47-86 (194) 256 cd02018 TPP_PFOR Thiamine pyro 23.3 41 0.001 14.2 1.3 29 136-164 100-128 (237) 257 KOG1712 consensus 23.3 28 0.00071 15.3 0.4 16 155-170 33-48 (183) 258 PRK09628 oorB 2-oxoglutarate-a 23.3 50 0.0013 13.5 3.8 30 135-164 98-127 (281) 259 TIGR02329 propionate_PrpR prop 23.1 51 0.0013 13.5 3.4 60 251-316 115-189 (658) 260 PRK10424 ilvG operon leader pe 23.1 35 0.00089 14.6 0.9 14 285-300 16-29 (32) 261 PRK07394 hypothetical protein; 23.1 51 0.0013 13.4 3.0 64 284-364 112-182 (342) 262 COG2046 MET3 ATP sulfurylase ( 22.9 51 0.0013 13.4 2.9 70 83-176 5-74 (397) 263 PRK00912 ribonuclease P protei 22.8 51 0.0013 13.4 3.9 19 28-46 20-38 (237) 264 TIGR01220 Pmev_kin_Gr_pos phos 22.8 31 0.0008 15.0 0.6 78 315-395 175-264 (367) 265 TIGR03447 mycothiol_MshC cyste 22.8 52 0.0013 13.4 1.8 108 21-155 44-151 (411) 266 TIGR01182 eda 2-dehydro-3-deox 22.7 52 0.0013 13.4 3.9 33 491-526 73-105 (205) 267 cd01398 RPI_A RPI_A: Ribose 5- 22.6 52 0.0013 13.4 4.1 30 135-164 90-121 (213) 268 TIGR02253 CTE7 HAD superfamily 22.6 52 0.0013 13.4 7.1 16 27-42 116-131 (244) 269 PRK13124 consensus 22.5 52 0.0013 13.4 4.2 12 341-352 114-125 (257) 270 TIGR02460 osmo_MPGsynth mannos 22.5 47 0.0012 13.7 1.5 23 253-277 162-189 (394) 271 KOG3185 consensus 22.5 51 0.0013 13.4 1.6 34 312-359 98-131 (245) 272 PRK11865 pyruvate ferredoxin o 22.3 33 0.00084 14.8 0.6 29 136-164 102-130 (298) 273 pfam00464 SHMT Serine hydroxym 22.3 42 0.0011 14.0 1.2 14 264-277 146-159 (380) 274 COG0449 GlmS Glucosamine 6-pho 22.2 53 0.0013 13.3 2.6 20 30-49 106-125 (597) 275 COG0120 RpiA Ribose 5-phosphat 22.2 53 0.0013 13.3 2.6 70 95-167 56-128 (227) 276 cd01977 Nitrogenase_VFe_alpha 22.2 47 0.0012 13.7 1.4 16 73-88 18-33 (415) 277 cd01301 rDP_like renal dipepti 22.2 53 0.0013 13.3 6.0 181 97-327 30-228 (309) 278 PRK11449 putative deoxyribonuc 22.2 53 0.0013 13.3 4.2 31 143-173 25-56 (258) 279 pfam01939 DUF91 Protein of unk 22.1 53 0.0014 13.3 2.2 61 284-370 150-212 (228) 280 PRK06182 short chain dehydroge 22.1 53 0.0014 13.3 6.1 40 15-55 2-42 (273) 281 TIGR01826 CofD_related conserv 22.1 53 0.0014 13.3 2.5 18 141-158 11-31 (331) 282 TIGR00601 rad23 UV excision re 22.1 42 0.0011 14.0 1.1 353 34-498 7-444 (453) 283 pfam08887 GAD-like GAD-like do 22.0 15 0.00039 17.3 -1.2 35 130-171 34-69 (109) 284 PRK13129 consensus 21.9 54 0.0014 13.3 4.3 32 337-368 120-154 (267) 285 TIGR02036 dsdC D-serine deamin 21.7 49 0.0013 13.5 1.4 15 512-526 233-247 (302) 286 cd06345 PBP1_ABC_ligand_bindin 21.7 54 0.0014 13.3 4.2 35 29-63 55-97 (344) 287 PRK10355 xylF D-xylose transpo 21.7 54 0.0014 13.3 4.9 43 317-359 211-256 (330) 288 PRK07168 bifunctional uroporph 21.5 54 0.0014 13.2 1.7 183 317-525 249-456 (474) 289 pfam02449 Glyco_hydro_42 Beta- 21.5 49 0.0012 13.6 1.4 29 300-328 166-194 (376) 290 cd01539 PBP1_GGBP Periplasmic 21.4 55 0.0014 13.2 5.3 17 320-336 204-220 (303) 291 cd05313 NAD_bind_2_Glu_DH NAD( 21.4 55 0.0014 13.2 1.6 38 323-361 129-170 (254) 292 PRK00168 coaD phosphopantethei 21.4 55 0.0014 13.2 2.4 48 136-192 13-60 (159) 293 cd06380 PBP1_iGluR_AMPA N-term 21.4 55 0.0014 13.2 3.3 26 480-505 316-347 (382) 294 TIGR02434 CobF precorrin-6A sy 21.4 46 0.0012 13.8 1.2 11 195-205 80-90 (259) 295 COG2820 Udp Uridine phosphoryl 21.3 46 0.0012 13.8 1.2 36 412-447 189-226 (248) 296 PRK13361 molybdenum cofactor b 21.1 55 0.0014 13.2 8.5 76 27-115 76-163 (329) 297 PRK13118 consensus 21.1 55 0.0014 13.2 3.4 13 340-352 122-134 (269) 298 PTZ00136 eukaryotic translatio 21.0 56 0.0014 13.2 3.7 40 313-366 99-138 (247) 299 PRK06846 deaminase; Validated 20.8 56 0.0014 13.1 2.8 15 70-84 34-48 (410) 300 PRK00132 rpsI 30S ribosomal pr 20.8 50 0.0013 13.5 1.3 22 443-464 67-88 (130) 301 PRK12828 short chain dehydroge 20.8 56 0.0014 13.1 6.2 43 13-55 4-46 (239) 302 PRK12826 3-ketoacyl-(acyl-carr 20.5 57 0.0015 13.1 2.9 43 13-55 3-45 (253) 303 COG1838 FumA Tartrate dehydrat 20.4 57 0.0015 13.1 3.8 55 313-370 88-145 (184) 304 COG4750 LicC CTP:phosphocholin 20.3 24 0.00062 15.8 -0.4 42 73-114 10-54 (231) 305 PRK00120 putative deoxyribonuc 20.2 33 0.00084 14.8 0.3 30 16-45 2-31 (197) 306 PRK02478 Maf-like protein; Rev 20.2 47 0.0012 13.7 1.0 46 227-277 147-194 (199) 307 PRK11866 2-oxoacid ferredoxin 20.0 58 0.0015 13.0 5.3 30 136-165 98-127 (286) No 1 >PRK00881 purH bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase; Provisional Probab=100.00 E-value=0 Score=1662.98 Aligned_cols=511 Identities=59% Similarity=0.950 Sum_probs=493.3 Q ss_pred CCCEEEEEEEEECCCCHHHHHHHHHHCCCEEEEEHHHHHHHHHCCCCEEEHHHHCCCCHHCCCCCCCCCCHHHHHHHHCC Q ss_conf 76136989987545686999999998799999942768999988995179465348816519855323715553553308 Q gi|254780664|r 13 EIAVKTALISVHNKTGVVEFASRLLSRGIKIISTGGTCQLLEEEGIPVTSVFDITKFPEIMGGRVKTLHPKIYGGILSIR 92 (536) Q Consensus 13 ~~~ikrALiSV~dKtgl~~la~~L~~~g~~iisTgGTa~~l~~~gi~v~~Vs~~TgfpEil~GRVKTLHP~I~ggIL~~r 92 (536) |.+||||||||||||||++||+.|+++||||+|||||+++|+++||||++|||+||||||||||||||||+||||||+|| T Consensus 1 r~~ikrALISV~DKtgl~~la~~L~~~g~eiiSTGGTak~l~~~gi~v~~Vs~~T~fpEil~GRVKTLHP~IhgGIL~~r 80 (514) T PRK00881 1 RRMIKRALISVSDKTGIVEFAKALVALGVEILSTGGTAKLLAEAGIPVTEVSDVTGFPEILDGRVKTLHPKIHGGILARR 80 (514) T ss_pred CCCCCEEEEEEECCCCHHHHHHHHHHCCCEEEECCHHHHHHHHCCCCEEEHHHCCCCHHHHCCCCCCCCCHHHHHHHCCC T ss_conf 99764799998866498999999998898999872789999987996588886049806529815646830226661378 Q ss_pred CCHHHHHHHHHCCCCCEEEEEEECCCHHHHHCCCC-CHHHHHHCCCCCCHHHHHHHHCCCCCCEEEECHHHHHHHHHHHH Q ss_conf 89899999997799841499994888899731765-31123200233434699987401563135308999898898753 Q gi|254780664|r 93 DNPAHMKFMQDHELESIDLVVVNLYPFEESFCRED-DYYTMVENIDIGGPSMIRAAAKNHDYVTILTNPQDYPLFLAEMD 171 (536) Q Consensus 93 ~~~~~~~~l~~~~i~~IDlVvvNLYPF~~~v~~~~-~~~~~IEnIDIGGpsmiRAAAKN~~~V~Vi~dp~dY~~~~~el~ 171 (536) ++++|+++|+++||++|||||||||||++++++++ +++++|||||||||||||||||||++|+|||||+||+.++++|+ T Consensus 81 ~~~~h~~~l~~~~i~~IDlVvvNLYPFe~~v~~~~~~~~~~IEnIDIGGpsmiRAAAKN~~~V~Vi~dp~dY~~~~~~l~ 160 (514) T PRK00881 81 DNPEHVAALEEHGIEPIDLVVVNLYPFEETVAKPGVTLEEAIENIDIGGPTMVRAAAKNHKDVTVVVDPADYDAVLEELK 160 (514) T ss_pred CCHHHHHHHHHCCCCCEEEEEEECCCHHHHHCCCCCCHHHHHHHCCCCCHHHHHHHHHCCCCEEEECCHHHHHHHHHHHH T ss_conf 98889999997699963389981454698750489998999884477749999998744787599647899999999998 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCCCCCCCCCCCCCEEEECCC-CCCCCCC Q ss_conf 023456667889999999998742289999875331012113443210123333436788100650576047-7668210 Q gi|254780664|r 172 VNNGKIPYNFRKKMARQAFSRTASYDTAICRWLANAESENFPDYLNITAVKKQEMRYGENPHQKAALYSTPE-KKSGIAH 250 (536) Q Consensus 172 ~~~g~~~~~~R~~lA~kAF~~ta~YD~~Ia~~l~~~~~~~~p~~~~~~~~~~~~LRYGENPHQ~Aa~Y~~~~-~~~~~~~ 250 (536) ++ |++++++|++||.|||++||+||++|++||+++.++.+|+.+.+++++.++|||||||||+|++|.... ...++.+ T Consensus 161 ~~-g~~s~~~R~~lA~kAF~~Ta~YD~~Is~~l~~~~~~~~p~~~~~~~~~~~~LRYGENPHQ~aa~Y~~~~~~~~~~~~ 239 (514) T PRK00881 161 AN-GSTSLETRFRLAAKAFAHTAAYDAAIANYFASQAGEEFPETLNLSFEKKQDLRYGENPHQKAAFYRDKDGPAGGVAT 239 (514) T ss_pred HC-CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEECCCCCCCCCCCCCCCCHHHHCCCCCCCCCCCCC T ss_conf 67-99799999999999999987608999999865301458621787142034542688814131430577767766421 Q ss_pred HHHCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCH Q ss_conf 12105898883122212001167764245567279996156865654520168999999853321320045554275421 Q gi|254780664|r 251 AVLVQGKPLSYNNINDLDAAFELVSEFRSQDCAACVIVKHMNPCGVATADTLVEAYRRALSCDPISAFGGIIAFNREVDQ 330 (536) Q Consensus 251 ~~ql~GKeLSYNNllD~daA~~lv~ef~~~~~Pa~vIvKH~NPCGvA~~~~~~~Ay~~A~~~DP~SAFGGIIa~N~~vd~ 330 (536) ++||||||||||||+|+|+||+||+||.+ |+||||||+||||||+++++.+||++|++|||+||||||||||++||. T Consensus 240 ~~ql~GKeLSYNN~lD~daA~~lv~ef~~---pa~vIiKH~nPCGvA~~~~~~~Ay~~A~~~Dp~SAFGGIva~N~~vd~ 316 (514) T PRK00881 240 AKQLQGKELSYNNIADADAALECVKEFDE---PACVIVKHANPCGVAVGDTILEAYDKAYACDPVSAFGGIIAFNREVDA 316 (514) T ss_pred HHHCCCCCCCCCCHHHHHHHHHHHHHCCC---CEEEEEECCCCCCHHCCCCHHHHHHHHHHCCCCCCCCCEEEECCCCCH T ss_conf 11005876674447668999999983667---689999516874020465799999998704974545747996262689 Q ss_pred HHHHHHHHHHHHHHEECCCCHHHHHHHHCCCCCEEEECCCCCCCCCCCCEECCCCHHHHHCCCCCC-CCCCCCCCCCCCC Q ss_conf 457878865431111268798999885124553156316766777666402032102231012334-5310001246777 Q gi|254780664|r 331 EVAKEVIKVFTEAIIAPTLSEEAADVLAKKPSMRFLKTSSLLDFHGEEIVLKTVSGGILVQTRDNV-VDNKELTVVTKRS 409 (536) Q Consensus 331 ~~A~~I~~~F~EvIiAP~f~~eAleiL~~KKnlRil~~~~~~~~~~~~~~~rsi~GG~LvQ~~D~~-~~~~~~~vVT~~~ 409 (536) ++|++|.+.|+||||||+|++|||++|++|||+|+|++... ....+++|++.||+|+|++|.. ++..+|++||+++ T Consensus 317 ~~A~~i~~~F~EvIiAP~f~~eAL~iL~~KKNlRvl~~~~~---~~~~~~~rsi~GG~LvQ~~D~~~~~~~~~k~VT~~~ 393 (514) T PRK00881 317 ETAEAIVKIFLEVIIAPSFDEEALEILAKKKNLRLLECGEL---PDPGLDFKSVSGGLLVQDRDNGMVDPADLKVVTKRQ 393 (514) T ss_pred HHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCEEEEECCCC---CCCCCEEEEEECEEEEEECCCCCCCHHHCEECCCCC T ss_conf 99999999888741178899899999863985699815887---888742899954299984577778988843113899 Q ss_pred CCHHHHHHHHHHHHHHHCCCCCEEEEEECCEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCCC Q ss_conf 86889888999999860056677999968938997296888899999999998764200122223487099973668685 Q gi|254780664|r 410 PTDQELRDMKFAFKVVKHVKSNAVVYAKDGRTVGIGSGQTSRVDSTRFAAIKAHNISAQADVKSMTNGSVIASEAFYPFP 489 (536) Q Consensus 410 pt~~e~~dL~FA~kv~K~vkSNAIv~ak~~~tiGiGaGQ~sRVda~~iA~~KA~~~~~~~~~~~~~~g~vlaSDAFFPF~ 489 (536) |+++|++||+|||+|||||||||||++||++|||||+||||||||++||++||++++. .++|+||||||||||| T Consensus 394 p~~~e~~DL~FAwkv~k~vKSNAIv~ak~~~tvGiGaGQ~sRVda~~iA~~ka~~~~~------~~~g~vlASDAFFPF~ 467 (514) T PRK00881 394 PTEQELKDLLFAWKVVKHVKSNAIVYAKDGQTVGIGAGQMSRVDSAKIALEKAGDAGL------DLKGSVLASDAFFPFR 467 (514) T ss_pred CCHHHHHHHHHHHHHHHHCCCCEEEEEECCCEEEECCCCCCHHHHHHHHHHHHHHHCC------CCCCCEEEECCCCCCC T ss_conf 8989986569999998636655699994893898678975689999999999886345------6678489956778874 Q ss_pred CHHHHHHHCCCCEEEECCCCCCHHHHHHHHHHCCCEEEECCCCCCCC Q ss_conf 55899997199599938987798899999987597799747854479 Q gi|254780664|r 490 DGIVEAIKAGVTAVIQPGGSVRDSEAITVADQHGIAMVFTGIRHFRH 536 (536) Q Consensus 490 D~ie~aa~~Gi~aIiqPGGSirD~evI~aan~~gi~m~fTg~RhF~H 536 (536) ||||.|+++||++||||||||||+|||++|||+||+|+|||.||||| T Consensus 468 D~ie~aa~~Gv~aIiQPGGSirD~evI~aan~~gi~M~fTg~RhF~H 514 (514) T PRK00881 468 DGVEAAAKAGITAIIQPGGSIRDQEVIAAADEHGIAMVFTGVRHFRH 514 (514) T ss_pred HHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHCCCEEEECCCCCCCC T ss_conf 47999998599099899975363999999997098799858766379 No 2 >COG0138 PurH AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) [Nucleotide transport and metabolism] Probab=100.00 E-value=0 Score=1602.05 Aligned_cols=513 Identities=58% Similarity=0.933 Sum_probs=496.7 Q ss_pred CEEEEEEEEECCCCHHHHHHHHHHCCCEEEEEHHHHHHHHHCCCCEEEHHHHCCCCHHCCCCCCCCCCHHHHHHHHCCCC Q ss_conf 13698998754568699999999879999994276899998899517946534881651985532371555355330889 Q gi|254780664|r 15 AVKTALISVHNKTGVVEFASRLLSRGIKIISTGGTCQLLEEEGIPVTSVFDITKFPEIMGGRVKTLHPKIYGGILSIRDN 94 (536) Q Consensus 15 ~ikrALiSV~dKtgl~~la~~L~~~g~~iisTgGTa~~l~~~gi~v~~Vs~~TgfpEil~GRVKTLHP~I~ggIL~~r~~ 94 (536) +||||||||||||||++||+.|.++||||+|||||++.|+++||||++|||+||||||||||||||||+||||||+||+. T Consensus 1 ~ikrALiSVsdKtGive~ak~L~~~gvei~STGGTak~l~eaGi~V~~Vs~~TgfpE~ldGRVKTLHP~ihgGiL~~r~~ 80 (515) T COG0138 1 MIKRALLSVSDKTGIVEFAKALVELGVEILSTGGTAKLLAEAGIPVTEVSDITGFPEMLDGRVKTLHPKIHGGILARRDK 80 (515) T ss_pred CCCHHHEEECCCCCHHHHHHHHHHCCEEEEECCCHHHHHHHCCCCCCCHHHCCCCHHHHCCCCEEECCCCCCCEEECCCC T ss_conf 91222322003527099999998677799955977999996799777154224875673784134134501323320365 Q ss_pred HHHHHHHHHCCCCCEEEEEEECCCHHHHHCCCC-CHHHHHHCCCCCCHHHHHHHHCCCCCCEEEECHHHHHHHHHHHHCC Q ss_conf 899999997799841499994888899731765-3112320023343469998740156313530899989889875302 Q gi|254780664|r 95 PAHMKFMQDHELESIDLVVVNLYPFEESFCRED-DYYTMVENIDIGGPSMIRAAAKNHDYVTILTNPQDYPLFLAEMDVN 173 (536) Q Consensus 95 ~~~~~~l~~~~i~~IDlVvvNLYPF~~~v~~~~-~~~~~IEnIDIGGpsmiRAAAKN~~~V~Vi~dp~dY~~~~~el~~~ 173 (536) ++|+++|++++|.+|||||||||||++|+.+++ +++|+|||||||||||||||||||++|+|+|||.||+.++++|+.+ T Consensus 81 ~~h~~~l~e~~I~~iDlVvvNLYPF~~tv~~~~~~~~e~vEnIDIGGptmlRaAAKN~~~V~V~~dp~Dy~~vl~~l~~~ 160 (515) T COG0138 81 DEHMAALEEHGIEPIDLVVVNLYPFEETVAKPGVTLEEAVENIDIGGPTMLRAAAKNHKDVTVVVDPADYAAVLEELKAN 160 (515) T ss_pred HHHHHHHHHCCCCCCCEEEECCCCHHHHCCCCCCCHHHHHHCCCCCCHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHCC T ss_conf 77899999759877547998479825421679998898875655676177687663258705986601468999998547 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCHHH Q ss_conf 34566678899999999987422899998753310121134432101233334367881006505760477668210121 Q gi|254780664|r 174 NGKIPYNFRKKMARQAFSRTASYDTAICRWLANAESENFPDYLNITAVKKQEMRYGENPHQKAALYSTPEKKSGIAHAVL 253 (536) Q Consensus 174 ~g~~~~~~R~~lA~kAF~~ta~YD~~Ia~~l~~~~~~~~p~~~~~~~~~~~~LRYGENPHQ~Aa~Y~~~~~~~~~~~~~q 253 (536) |++++++|++||.|||+|||+||++|++||++.....||+.+.+.+.+.++|||||||||+|+||.+.....+++.++| T Consensus 161 -~~~~~~~R~~lA~kaF~hTA~YDaaI~~~~~~~~~~~fp~~~~~~~~~~~~lRYGENPHQ~aa~Y~~~~~~~~va~a~q 239 (515) T COG0138 161 -GELSLATRKRLAAKAFEHTAAYDAAIANYFSELLGGEFPEQLNLSFTKKQDLRYGENPHQSAAFYRDGNAKGGVATAKQ 239 (515) T ss_pred -CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHEECCCCCCEEEECCCCCCCCCEEEECCCCCCCHHHHHH T ss_conf -8877899999999999987666999999997531133641110243200010057897778747841787876034787 Q ss_pred CCCCCCCCCCCCCCCHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHH Q ss_conf 05898883122212001167764245567279996156865654520168999999853321320045554275421457 Q gi|254780664|r 254 VQGKPLSYNNINDLDAAFELVSEFRSQDCAACVIVKHMNPCGVATADTLVEAYRRALSCDPISAFGGIIAFNREVDQEVA 333 (536) Q Consensus 254 l~GKeLSYNNllD~daA~~lv~ef~~~~~Pa~vIvKH~NPCGvA~~~~~~~Ay~~A~~~DP~SAFGGIIa~N~~vd~~~A 333 (536) ||||+||||||+|+|+||+||+||.+. .|+||||||+||||||+++++.+||.+||+|||+||||||||+|++||.+|| T Consensus 240 L~GK~lSYNNi~DaDaA~~~v~ef~~~-~pa~~ivKH~NPcGvA~~~~i~~Ay~~A~~~D~~SaFGGIIA~Nr~vD~etA 318 (515) T COG0138 240 LQGKELSYNNIADADAAWELVKEFDEP-GPACAIVKHANPCGVAVGDTIAEAYKRAYEADPTSAFGGIIALNREVDVETA 318 (515) T ss_pred HCCCCCCCCCHHHHHHHHHHHHHCCCC-CCEEEEEECCCCCHHCCCHHHHHHHHHHHCCCCCCCCCCEEEECCCCCHHHH T ss_conf 559725425255379999999855776-7618998437874010280599999999707972234788997474599999 Q ss_pred HHHHHHHHHHHEECCCCHHHHHHHHCCCCCEEEECCCCCCCCCCCCEECCCCHHHHHCCCCCC-CCCCCCCCCCCCCCCH Q ss_conf 878865431111268798999885124553156316766777666402032102231012334-5310001246777868 Q gi|254780664|r 334 KEVIKVFTEAIIAPTLSEEAADVLAKKPSMRFLKTSSLLDFHGEEIVLKTVSGGILVQTRDNV-VDNKELTVVTKRSPTD 412 (536) Q Consensus 334 ~~I~~~F~EvIiAP~f~~eAleiL~~KKnlRil~~~~~~~~~~~~~~~rsi~GG~LvQ~~D~~-~~~~~~~vVT~~~pt~ 412 (536) +.|+++|+||||||+|++|||+||++|||+|+|++..+ .+....+++|+|.||+|+|++|.. ++..+|++||+++||+ T Consensus 319 ~~i~~~F~EvIIAP~~~~~Al~il~kK~NlRvL~~~~~-~~~~~~~~~k~v~GGlLvQ~~d~~~~~~~~~~vVTkr~pt~ 397 (515) T COG0138 319 EAISKIFLEVIIAPSYTEEALEILAKKKNLRVLTIGPL-GPRAAGLEFKRVSGGLLVQERDDGMIDEAELKVVTKRQPTE 397 (515) T ss_pred HHHHHHHEEEEECCCCCHHHHHHHHHCCCEEEEECCCC-CCCCCCEEEEEEEEEEEEECCCCCCCCCCCEEEECCCCCCH T ss_conf 99986607999868999899999962576589964887-77776426899820589974644566730106742789998 Q ss_pred HHHHHHHHHHHHHHCCCCCEEEEEECCEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCCCCHH Q ss_conf 89888999999860056677999968938997296888899999999998764200122223487099973668685558 Q gi|254780664|r 413 QELRDMKFAFKVVKHVKSNAVVYAKDGRTVGIGSGQTSRVDSTRFAAIKAHNISAQADVKSMTNGSVIASEAFYPFPDGI 492 (536) Q Consensus 413 ~e~~dL~FA~kv~K~vkSNAIv~ak~~~tiGiGaGQ~sRVda~~iA~~KA~~~~~~~~~~~~~~g~vlaSDAFFPF~D~i 492 (536) +||+||+|||+|||||||||||++||++|||||+||||||||+|||+.||++++.. ++|||||||||||||||| T Consensus 398 ~e~~dl~Fawkv~k~vKSNaIv~akd~~TvGiGaGQ~sRV~sariA~~kA~~~~~~------~~Gav~ASDAFFPF~D~i 471 (515) T COG0138 398 QELEDLLFAWKVVKHVKSNAIVYAKDGQTVGIGAGQMSRVDSARIAAEKAGDAGLR------AHGAVLASDAFFPFPDGI 471 (515) T ss_pred HHHHHHHHHHHHHHHHCCCEEEEECCCEEEEECCCCCCHHHHHHHHHHHHHHHHHH------CCCCEEEECCCCCCCCHH T ss_conf 99999999999986703453999709838875688511588999999987776531------268177523557886139 Q ss_pred HHHHHCCCCEEEECCCCCCHHHHHHHHHHCCCEEEECCCCCCCC Q ss_conf 99997199599938987798899999987597799747854479 Q gi|254780664|r 493 VEAIKAGVTAVIQPGGSVRDSEAITVADQHGIAMVFTGIRHFRH 536 (536) Q Consensus 493 e~aa~~Gi~aIiqPGGSirD~evI~aan~~gi~m~fTg~RhF~H 536 (536) +.|+++||+|||||||||||+|||++||||||+|||||.||||| T Consensus 472 ~~aA~aGi~aIIqPGGSirD~eVI~aAde~giaMvfTg~RhF~H 515 (515) T COG0138 472 DAAAKAGIKAIIQPGGSIRDQEVIAAADEHGIAMVFTGVRHFRH 515 (515) T ss_pred HHHHHCCCEEEECCCCCCCCHHHHHHHHHCCCEEEECCCCCCCC T ss_conf 99997498699779986664999998886196799745433379 No 3 >KOG2555 consensus Probab=100.00 E-value=0 Score=1170.73 Aligned_cols=503 Identities=37% Similarity=0.613 Sum_probs=462.2 Q ss_pred EEEEEEEECCCCHHHHHHHHHHCCCEEEEEHHHHHHHHHCCCCEEEHHHHCCCCHHCCCCCCCCCCHHHHHHHHCCCCHH Q ss_conf 69899875456869999999987999999427689999889951794653488165198553237155535533088989 Q gi|254780664|r 17 KTALISVHNKTGVVEFASRLLSRGIKIISTGGTCQLLEEEGIPVTSVFDITKFPEIMGGRVKTLHPKIYGGILSIRDNPA 96 (536) Q Consensus 17 krALiSV~dKtgl~~la~~L~~~g~~iisTgGTa~~l~~~gi~v~~Vs~~TgfpEil~GRVKTLHP~I~ggIL~~r~~~~ 96 (536) |.|||||||||||.+|+++|.++|+.|++|||||+.|+++|++|++||++|+||||||||||||||.||||||+| |.++ T Consensus 5 k~AllSVsDKtgLl~La~gL~~~g~~lvaSGGTAk~lrdaG~~V~dVs~iT~fPEMLgGRVKTLHPaVHgGILAR-dies 83 (588) T KOG2555 5 KLALLSVSDKTGLLDLAKGLVELGVTLVASGGTAKMLRDAGLPVRDVSEITHFPEMLGGRVKTLHPAVHGGILAR-DIES 83 (588) T ss_pred EEEEEEEECCCCHHHHHHHHHHCCCEEEECCCHHHHHHHCCCCCCHHHHHCCCHHHHCCCCCCCCCCCCCCEEEC-CCCH T ss_conf 379999615657588875788648289942861777986798500087642885874871100264100253314-5700 Q ss_pred HHHHHHHCCCCCEEEEEEECCCHHHHHCCCC-CHHHHHHCCCCCCHHHHHHHHCCCCCCEEEECHHHHHHHHHHHHCCCC Q ss_conf 9999997799841499994888899731765-311232002334346999874015631353089998988987530234 Q gi|254780664|r 97 HMKFMQDHELESIDLVVVNLYPFEESFCRED-DYYTMVENIDIGGPSMIRAAAKNHDYVTILTNPQDYPLFLAEMDVNNG 175 (536) Q Consensus 97 ~~~~l~~~~i~~IDlVvvNLYPF~~~v~~~~-~~~~~IEnIDIGGpsmiRAAAKN~~~V~Vi~dp~dY~~~~~el~~~~g 175 (536) |+++|++++|..||+||||||||.+||++++ +.+|++||||||||||||||||||++|+|+|||.||+.|+++|... T Consensus 84 d~kdL~e~~i~~vdvVVcNLYPF~etVa~pgvtveeaVEnIDIGGvTLLRAAAKNH~rVtiv~dp~DY~~vlsel~~~-- 161 (588) T KOG2555 84 DEKDLKEQGIDKVDVVVCNLYPFKETVAKPGVTVEEAVENIDIGGVTLLRAAAKNHARVTIVCDPADYDAVLSELKSS-- 161 (588) T ss_pred HHHHHHHCCCCEEEEEEEECCCHHHHHCCCCCCHHHHHHHCCCCCHHHHHHHHHCCCCEEEEECHHHHHHHHHHHHCC-- T ss_conf 288998748874779998240137651689976787766426552999999873577179995636789999997426-- Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCCCCCCCCCCCCC-EEEECCCCCCCCCCHHHC Q ss_conf 5666788999999999874228999987533101211344321012333343678810065-057604776682101210 Q gi|254780664|r 176 KIPYNFRKKMARQAFSRTASYDTAICRWLANAESENFPDYLNITAVKKQEMRYGENPHQKA-ALYSTPEKKSGIAHAVLV 254 (536) Q Consensus 176 ~~~~~~R~~lA~kAF~~ta~YD~~Ia~~l~~~~~~~~p~~~~~~~~~~~~LRYGENPHQ~A-a~Y~~~~~~~~~~~~~ql 254 (536) .++...|.+.|.|||+|||.||++|+.||.++..+. ....+||||+||||++ .+|... .++ .+..| T Consensus 162 ~~~~d~~~r~aLKaf~Hta~yD~Aisd~frKQy~~g---------vsq~slRYg~npHQ~paql~~~q---~~l-p~~vl 228 (588) T KOG2555 162 EISQDLRNRRALKAFEHTASYDAAISDYFRKQYSEG---------VSQLSLRYGMNPHQKPAQLYVVQ---GKL-PFKVL 228 (588) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------HHHCCHHCCCCCCCCHHHHHHHC---CCC-CEEEE T ss_conf 522237889999888877657789999999865314---------43233310789666745576645---787-44786 Q ss_pred CCCCCCCCCCCCCCHHHHHHHHCCC-CCCCEEEEECCCCCCCCCCCC-------------HHHH------HHHHHHHHCC Q ss_conf 5898883122212001167764245-567279996156865654520-------------1689------9999985332 Q gi|254780664|r 255 QGKPLSYNNINDLDAAFELVSEFRS-QDCAACVIVKHMNPCGVATAD-------------TLVE------AYRRALSCDP 314 (536) Q Consensus 255 ~GKeLSYNNllD~daA~~lv~ef~~-~~~Pa~vIvKH~NPCGvA~~~-------------~~~~------Ay~~A~~~DP 314 (536) +| +++|+||+|++++|.||+||++ ...|+|+++||++|||+|+|- ++.+ ||.+|+.+|| T Consensus 229 ~g-spgyiNl~DAln~w~lvkel~~~l~~paaasfKHvsPaGaAvg~pls~~e~kv~~v~Di~eltpla~AYarArgADr 307 (588) T KOG2555 229 CG-SPGYINLLDALNSWPLVKELSENLGLPAAASFKHVSPAGAAVGLPLSDVEAKVYFVDDIYELTPLACAYARARGADR 307 (588) T ss_pred CC-CCCHHHHHHHHCCHHHHHHHHHHCCCCCCCEECCCCCCCCCCCCCCCHHHHHEEEHHHHHHCCHHHHHHHHHHCCCC T ss_conf 18-98534577664240454677761388600001023764110267665455320013435534668889998746774 Q ss_pred CCCCCEEEEECCCCCHHHHHHHHHHHHHHHEECCCCHHHHHHHHCCCCCEEEECCCCCCCCCCCCEECCCCHHHHHCCCC Q ss_conf 13200455542754214578788654311112687989998851245531563167667776664020321022310123 Q gi|254780664|r 315 ISAFGGIIAFNREVDQEVAKEVIKVFTEAIIAPTLSEEAADVLAKKPSMRFLKTSSLLDFHGEEIVLKTVSGGILVQTRD 394 (536) Q Consensus 315 ~SAFGGIIa~N~~vd~~~A~~I~~~F~EvIiAP~f~~eAleiL~~KKnlRil~~~~~~~~~~~~~~~rsi~GG~LvQ~~D 394 (536) +|+||++||+|..||..||+.|++...+.||||+|++|||+||++|||..+.++.+.++|.+...|.|+++|++|.|.+| T Consensus 308 mSsFGdfvAls~~vDv~tAriIsrevsDGviApgYepeaLeiL~KkK~g~yciLq~dpny~p~~~e~rqv~G~~L~Qkrn 387 (588) T KOG2555 308 MSSFGDFVALSDVVDVVTARIISREVSDGVIAPGYEPEALEILSKKKNGKYCILQMDPNYVPGEVERRQVFGLTLQQKRN 387 (588) T ss_pred CCCCCCEEEEEEEHHHHHHHHHHCCCCCCEECCCCCHHHHHHHHCCCCCCEEEEEECCCCCCCCCEEEEEEEEEEEEECC T ss_conf 12446747752001145676750156676435887878889974256885699970777674500133450277343147 Q ss_pred CC-CCCCCC-CCCCCCCC-CHHHHHHHHHHHHHHHCCCCCEEEEEECCEEEEECCCCCCHHHHHHHHHHHHHHHHHCCC- Q ss_conf 34-531000-12467778-688988899999986005667799996893899729688889999999999876420012- Q gi|254780664|r 395 NV-VDNKEL-TVVTKRSP-TDQELRDMKFAFKVVKHVKSNAVVYAKDGRTVGIGSGQTSRVDSTRFAAIKAHNISAQAD- 470 (536) Q Consensus 395 ~~-~~~~~~-~vVT~~~p-t~~e~~dL~FA~kv~K~vkSNAIv~ak~~~tiGiGaGQ~sRVda~~iA~~KA~~~~~~~~- 470 (536) +. ++..-| ++|+++++ ++..+.||.|||.++||+|||+|+++|||++||+|+||||||.|+|+|..||.+||+..+ T Consensus 388 da~i~~s~F~~vvsknk~l~e~a~~Dl~vAt~a~KytqSNsvcyAkng~vvGlGaGQqsRihctRlAgdkadnwwlr~hp 467 (588) T KOG2555 388 DAVIDQSLFSNVVSKNKDLPESALIDLIVATIALKYTQSNSVCYAKNGMVVGLGAGQQSRIHCTRLAGDKADNWWLRQHP 467 (588) T ss_pred CCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEECCEEEEECCCCCCCEEEEECCCCCCCCHHHHCCC T ss_conf 67510677877762389961677777789877665503461799708879961588755022210125543226665283 Q ss_pred -------------------------------------------------------CCCCCCCEEEEECCCCCCCCHHHHH Q ss_conf -------------------------------------------------------2223487099973668685558999 Q gi|254780664|r 471 -------------------------------------------------------VKSMTNGSVIASEAFYPFPDGIVEA 495 (536) Q Consensus 471 -------------------------------------------------------~~~~~~g~vlaSDAFFPF~D~ie~a 495 (536) |-++++|.+++|||||||||+|+.| T Consensus 468 ~vl~mkfkkgvkr~e~snaid~~v~g~~~e~~~~~~~~~~fe~~p~~~t~~e~~ewl~~l~~v~l~SDAFFPF~Dnv~ra 547 (588) T KOG2555 468 KVLGMKFKKGVKRAEKSNAIDLYVTGTVGEDPELSQWESKFEEVPEPLTKEERKEWLEKLKGVSLSSDAFFPFPDNVYRA 547 (588) T ss_pred HHHHHHHHHCCCCHHHCCCHHHHCCCCCCCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHCCCEECCCCCCCCCHHHHHH T ss_conf 45413565314614324633431035324684056666655416653476888999997118442245436671579999 Q ss_pred HHCCCCEEEECCCCCCHHHHHHHHHHCCCEEEECCCCCCCC Q ss_conf 97199599938987798899999987597799747854479 Q gi|254780664|r 496 IKAGVTAVIQPGGSVRDSEAITVADQHGIAMVFTGIRHFRH 536 (536) Q Consensus 496 a~~Gi~aIiqPGGSirD~evI~aan~~gi~m~fTg~RhF~H 536 (536) .+.||++|..||||++|+.||++|+++||.|.+|..|||+| T Consensus 548 ~qsGv~yiaaP~GSv~D~~v~~a~d~~~iv~~~t~lRlFhH 588 (588) T KOG2555 548 VQSGVKYIAAPSGSVMDKVVIDACDEFGIVLAETNLRLFHH 588 (588) T ss_pred HHCCCEEEECCCCCCHHHHHHHHHHHHCEEEEECCHHHHCC T ss_conf 86387288568875015799988876296888342442059 No 4 >PRK07106 5-aminoimidazole-4-carboxamide ribonucleotide transformylase; Provisional Probab=100.00 E-value=0 Score=924.08 Aligned_cols=311 Identities=35% Similarity=0.532 Sum_probs=276.5 Q ss_pred CCCCCCCCCCCCCCCEEEECCCCCCCCCCHHHCCCCCCCCCCCCCCCHHHHHHHHCCC-CCCCEEEEECCCCCCCCCCCC Q ss_conf 3333436788100650576047766821012105898883122212001167764245-567279996156865654520 Q gi|254780664|r 222 KKQEMRYGENPHQKAALYSTPEKKSGIAHAVLVQGKPLSYNNINDLDAAFELVSEFRS-QDCAACVIVKHMNPCGVATAD 300 (536) Q Consensus 222 ~~~~LRYGENPHQ~Aa~Y~~~~~~~~~~~~~ql~GKeLSYNNllD~daA~~lv~ef~~-~~~Pa~vIvKH~NPCGvA~~~ 300 (536) ...+|||||||||+|++|.... +...++|||||+ |||||+|+|+||+||+||.+ .+.||||||||+||||||++. T Consensus 3 ~~~~LRYGENPHQ~aA~~~~~~---~~~~~~~l~Gk~-SYNN~lD~daA~~lv~ef~~~~~~Pa~viiKH~NPCGvA~~~ 78 (391) T PRK07106 3 NELELKYGCNPNQKPARIFIKE---GELPIEVLNGRP-GYINLLDALNSWQLVKELKEATGLPAAASFKHVSPAGAAVAV 78 (391) T ss_pred CCCCCCCCCCCCCCCCCEECCC---CCCCCEEECCCC-CHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHCCC T ss_conf 7787558888475320003468---866856666820-077899999999999975651599749999378767021066 Q ss_pred H------------------HHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHEECCCCHHHHHHHHCCCC Q ss_conf 1------------------6899999985332132004555427542145787886543111126879899988512455 Q gi|254780664|r 301 T------------------LVEAYRRALSCDPISAFGGIIAFNREVDQEVAKEVIKVFTEAIIAPTLSEEAADVLAKKPS 362 (536) Q Consensus 301 ~------------------~~~Ay~~A~~~DP~SAFGGIIa~N~~vd~~~A~~I~~~F~EvIiAP~f~~eAleiL~~KKn 362 (536) + +.+||.+|++|||+||||||||||++||.++|++|.+.|+||||||+|++||||||++||| T Consensus 79 ~l~d~~~~~~~~~~~~~~~~~~Ay~~A~~~Dp~SAFGGIVa~N~~vd~~~A~~I~~~F~EvIIAP~f~~eAleiL~~KKn 158 (391) T PRK07106 79 PLSDTLKKIYFVDDGELSPLATAYARARGADRMSSYGDFIALSDVCDEATARILNREVSDGIIAPGYTPEALEILKNKRK 158 (391) T ss_pred CCHHHHHHHHCCCCCCCCHHHHHHHHHHCCCCCCCCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCC T ss_conf 41134543201244345539999999970798544687899847338999999998657641057889899999975112 Q ss_pred CEEEECCCCCCCCCCCCEECCCCHHHHHCCCCCC-CCCCCCC-CC-CCCCCCHHHHHHHHHHHHHHHCCCCCEEEEEECC Q ss_conf 3156316766777666402032102231012334-5310001-24-6777868898889999998600566779999689 Q gi|254780664|r 363 MRFLKTSSLLDFHGEEIVLKTVSGGILVQTRDNV-VDNKELT-VV-TKRSPTDQELRDMKFAFKVVKHVKSNAVVYAKDG 439 (536) Q Consensus 363 lRil~~~~~~~~~~~~~~~rsi~GG~LvQ~~D~~-~~~~~~~-vV-T~~~pt~~e~~dL~FA~kv~K~vkSNAIv~ak~~ 439 (536) +|++.+.....+.+..+++|++.||+|+|++|.. ++...|+ +| ++++|++++++||+|||+|||||||||||++||+ T Consensus 159 ~~l~il~~~~~~~~~~~~~r~i~Gg~L~Q~~d~~~~~~~~~~~~v~~~k~~~~e~~~dL~FAwkv~K~vKSNAIVlakn~ 238 (391) T PRK07106 159 GTYNIIKIDPAYRPAPIEHKEVFGITFEQGRNETKIDESLLKNIPTKNKILPDEAKRDLIIALITLKYTQSNSVCYAKDG 238 (391) T ss_pred CCEEEEECCCCCCCCCCEEEEEECEEEEECCCCCCCCHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEECC T ss_conf 56688754765677872599981607997578543686660360014899998999999999999976456639999689 Q ss_pred EEEEECCCCCCHHHHHHHHHHHHHHHHHCCC------------------------------------------------- Q ss_conf 3899729688889999999999876420012------------------------------------------------- Q gi|254780664|r 440 RTVGIGSGQTSRVDSTRFAAIKAHNISAQAD------------------------------------------------- 470 (536) Q Consensus 440 ~tiGiGaGQ~sRVda~~iA~~KA~~~~~~~~------------------------------------------------- 470 (536) +|||||+||||||||++||++||+.|+++.+ T Consensus 239 ~tiGIGaGQ~SRVDsv~iA~~KA~~~~~r~~p~~~~l~~~~~~~~~e~~n~~~~~~~~d~~~~~~~~~~~~~f~~~~~~l 318 (391) T PRK07106 239 QAIGIGAGQQSRIHCTRLAGNKADIWYLRQHPKVLNLPFKEGIRRPDRDNTIDVYISDDYMDVLADGVWEQFFTRKPEPL 318 (391) T ss_pred CEEEECCCCCCHHHHHHHHHHHHHHHHHHHCHHHHCCHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCC T ss_conf 28986788646899999999999998764265652430332000133334788874011001244556665420365300 Q ss_pred -------CCCCCCCEEEEECCCCCCCCHHHHHHHCCCCEEEECCCCCCHHHHHHHHHHCCCEEEECCCCCCCC Q ss_conf -------222348709997366868555899997199599938987798899999987597799747854479 Q gi|254780664|r 471 -------VKSMTNGSVIASEAFYPFPDGIVEAIKAGVTAVIQPGGSVRDSEAITVADQHGIAMVFTGIRHFRH 536 (536) Q Consensus 471 -------~~~~~~g~vlaSDAFFPF~D~ie~aa~~Gi~aIiqPGGSirD~evI~aan~~gi~m~fTg~RhF~H 536 (536) +...++|+||||||||||+||||.|+++||++||||||||||+|||++|||+||+|+|||.||||| T Consensus 319 ~~~e~~~~~~~~~g~vlASDAFFPF~D~Ie~aa~~GV~aIiQPGGSirD~eVI~aane~gi~MvfTG~RhF~H 391 (391) T PRK07106 319 TREEKRAWLDTLKGVALGSDAFFPFGDNIERAHKSGVSYIAQPGGSIRDDHVIETCDKYNITMAFTGIRLFHH 391 (391) T ss_pred HHHHHHHHHCCCCCCEEEECCCCCCCHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHCCEEEECCCCCCCC T ss_conf 0777776650258808991157887568999998599499879974150999999998698899868766188 No 5 >pfam01808 AICARFT_IMPCHas AICARFT/IMPCHase bienzyme. This is a family of bifunctional enzymes catalysing the last two steps in de novo purine biosynthesis. The bifunctional enzyme is found in both prokaryotes and eukaryotes. The second last step is catalysed by 5-aminoimidazole-4-carboxamide ribonucleotide formyltransferase EC:2.1.2.3 (AICARFT), this enzyme catalyses the formylation of AICAR with 10-formyl-tetrahydrofolate to yield FAICAR and tetrahydrofolate. The last step is catalysed by IMP (Inosine monophosphate) cyclohydrolase EC:3.5.4.10 (IMPCHase), cyclizing FAICAR (5-formylaminoimidazole-4-carboxamide ribonucleotide) to IMP. Probab=100.00 E-value=0 Score=906.13 Aligned_cols=312 Identities=52% Similarity=0.794 Sum_probs=294.2 Q ss_pred HHCCCCCCEEEECHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC-CCHHHHHHHCCCCCCC Q ss_conf 740156313530899989889875302345666788999999999874228999987533101-2113443210123333 Q gi|254780664|r 147 AAKNHDYVTILTNPQDYPLFLAEMDVNNGKIPYNFRKKMARQAFSRTASYDTAICRWLANAES-ENFPDYLNITAVKKQE 225 (536) Q Consensus 147 AAKN~~~V~Vi~dp~dY~~~~~el~~~~g~~~~~~R~~lA~kAF~~ta~YD~~Ia~~l~~~~~-~~~p~~~~~~~~~~~~ 225 (536) |||||++|+|||||+||+.++++|+. +|+++++||++||.|||+|||+||++|++||+++.+ +.+|+.+.+++.+.++ T Consensus 1 AAKN~~~V~Vv~dp~dY~~vi~el~~-~g~~s~~~R~~lA~kAF~~Ta~YD~~Is~yl~~~~~~~~~~~~~~~~~~k~~~ 79 (315) T pfam01808 1 AAKNHKRVTIVVDPADYSAVLAELKK-EGSLTLQTRRRLALKAFEHTAAYDAAIADYFRKQYSPKHPERTFTLPGVRKQS 79 (315) T ss_pred CCCCCCCEEEECCHHHHHHHHHHHHH-CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHEEECCCCCCC T ss_conf 97789986999998999999999986-69999999999999999999999999999986442665761331005522357 Q ss_pred CCCCCCCCCCCEEEECCC-CCCCCCCHHHCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCHHHH Q ss_conf 436788100650576047-7668210121058988831222120011677642455672799961568656545201689 Q gi|254780664|r 226 MRYGENPHQKAALYSTPE-KKSGIAHAVLVQGKPLSYNNINDLDAAFELVSEFRSQDCAACVIVKHMNPCGVATADTLVE 304 (536) Q Consensus 226 LRYGENPHQ~Aa~Y~~~~-~~~~~~~~~ql~GKeLSYNNllD~daA~~lv~ef~~~~~Pa~vIvKH~NPCGvA~~~~~~~ 304 (536) |||||||||+|++|.+.. ...++.+++||||||||||||+|+|+||+||+||.+ .|+||||||+||||||+++++.+ T Consensus 80 LRYGENPHQ~Aa~Y~~~~~~~~~~~~~~qL~GKeLSYNNilD~daA~~lv~ef~~--~Pa~vIiKH~NPCGvA~~~~~~~ 157 (315) T pfam01808 80 LRYGENPHQQAAFYVDPTQKEGSIAPAEQLHGKEMSYNNLLDADAAWRLVKEFEN--QPAAAIVKHANPCGVAVGPSIAE 157 (315) T ss_pred CCCCCCCCCCCCEECCCCCCCCCCCCEEECCCCCCCCCCHHHHHHHHHHHHHCCC--CCEEEEEECCCCCEECCCCCHHH T ss_conf 7478880451258456687766641236426887884537657899724753589--97599994388862127989999 Q ss_pred HHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHEECCCCHHHHHHHHCCCCCEEEECCCCCCCCCCCCEECCC Q ss_conf 99999853321320045554275421457878865431111268798999885124553156316766777666402032 Q gi|254780664|r 305 AYRRALSCDPISAFGGIIAFNREVDQEVAKEVIKVFTEAIIAPTLSEEAADVLAKKPSMRFLKTSSLLDFHGEEIVLKTV 384 (536) Q Consensus 305 Ay~~A~~~DP~SAFGGIIa~N~~vd~~~A~~I~~~F~EvIiAP~f~~eAleiL~~KKnlRil~~~~~~~~~~~~~~~rsi 384 (536) ||++||+|||+||||||||||++||.++|++|.+.|+||||||+|++|||+||++|||+|+|++.....+ .+.+++|+| T Consensus 158 Ay~~A~~~Dp~SAFGGIva~N~~vd~~~A~~i~~~F~EvIiAP~f~~eAl~iL~~KknlRvl~~~~~~~~-~~~~~~rsv 236 (315) T pfam01808 158 AYARAREADPMSAFGGIIALNDEVDVETAKEISEEFSEVIIAPGYEPEALEVLSRKKNLRVLVLQEDPAY-VPELELRQV 236 (315) T ss_pred HHHHHHHCCCCCCCCCEEEECCHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCEEEEECCCCCCC-CCCEEEEEE T ss_conf 9999986087344586898657000889999998667642067899899999974887179963677666-786169996 Q ss_pred CHHHHHCCCCCC-CCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEEECCEEEEECCCCCCHHHHHHHHHHHH Q ss_conf 102231012334-531000124677786889888999999860056677999968938997296888899999999998 Q gi|254780664|r 385 SGGILVQTRDNV-VDNKELTVVTKRSPTDQELRDMKFAFKVVKHVKSNAVVYAKDGRTVGIGSGQTSRVDSTRFAAIKA 462 (536) Q Consensus 385 ~GG~LvQ~~D~~-~~~~~~~vVT~~~pt~~e~~dL~FA~kv~K~vkSNAIv~ak~~~tiGiGaGQ~sRVda~~iA~~KA 462 (536) .||+|+|++|.. ++..+|++||+++|+++||+||+|||+|||||||||||++||++|||||+||||||||++||++|| T Consensus 237 ~GG~LvQ~~d~~~~~~~~~kvVT~~~pt~~e~~dL~FAwkv~k~vKSNAIv~ak~~~tvGiGaGQ~sRVds~~iA~~KA 315 (315) T pfam01808 237 SGGLLIQDRDDGGVDDSEFSVVTKRKPTEQELEDLLFAWKVVKHVKSNAIVYAKNGMTVGIGAGQQSRVHSARLAGDKA 315 (315) T ss_pred ECCEEEECCCCCCCCHHHCEECCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEECCEEEEECCCCCCHHHHHHHHHHHC T ss_conf 0667887676676888885351688999999999999999997778567999939938986899763999999999629 No 6 >TIGR00355 purH bifunctional purine biosynthesis protein PurH; InterPro: IPR002695 This is a family of bifunctional enzymes catalysing the last two steps in de novo purine biosynthesis. The bifunctional enzyme is found in both prokaryotes and eukaryotes. The second last step is catalysed by 5-aminoimidazole-4-carboxamide ribonucleotide formyltransferase 2.1.2.3 from EC (AICARFT), this enzyme catalyses the formylation of AICAR with 10-formyl-tetrahydrofolate to yield FAICAR and tetrahydrofolate . The last step is catalysed by IMP (Inosine monophosphate) cyclohydrolase 3.5.4.10 from EC (IMPCHase), cyclizing FAICAR (5-formylaminoimidazole-4-carboxamide ribonucleotide) to IMP .; GO: 0003937 IMP cyclohydrolase activity, 0004643 phosphoribosylaminoimidazolecarboxamide formyltransferase activity, 0006164 purine nucleotide biosynthetic process. Probab=100.00 E-value=0 Score=896.04 Aligned_cols=520 Identities=53% Similarity=0.835 Sum_probs=495.8 Q ss_pred EEEEEEEECCCCHHHHHHHHH-HCCCEEEEEHHHHHHHHHCCCCEEEHHHHCCCCHHCCCCCCCCCCHHHHHHHHCCCCH Q ss_conf 698998754568699999999-8799999942768999988995179465348816519855323715553553308898 Q gi|254780664|r 17 KTALISVHNKTGVVEFASRLL-SRGIKIISTGGTCQLLEEEGIPVTSVFDITKFPEIMGGRVKTLHPKIYGGILSIRDNP 95 (536) Q Consensus 17 krALiSV~dKtgl~~la~~L~-~~g~~iisTgGTa~~l~~~gi~v~~Vs~~TgfpEil~GRVKTLHP~I~ggIL~~r~~~ 95 (536) ++||+||+||+|+++|++.|. +.|+++++||||++.|+++|+|+++|+++|||||+++||||||||++|||||++|+.+ T Consensus 1 ~~~~~~~~d~~g~~~~~~~~~~~~~~~~~~~gg~~~~~~~~g~p~~~~~~~~g~p~~~~g~~~~~~p~~~gg~~~~~~~~ 80 (569) T TIGR00355 1 KRALLSVSDKTGLVELAQGLVNEKGVELLSTGGTAKLLAEAGLPVTEVSDYTGFPELLGGRVKTLHPKVHGGILARRGDD 80 (569) T ss_pred CCCEEECCCCCCHHHHHHHHHHHHCCEEEECCCHHHHHHHCCCCCCHHHHHCCCCHHHCCCHHCCCCCCCCCEEECCCCC T ss_conf 94123112211137888876543131333156424667641786000011116502315620000410104311015762 Q ss_pred HHHHHHHHCCCCCEEEEEEECCCHHHHHCCCC-CHHHHHHCCCCCCHHHHHHHHCCCCCCEEEECHHHHHHHHHHHHCCC Q ss_conf 99999997799841499994888899731765-31123200233434699987401563135308999898898753023 Q gi|254780664|r 96 AHMKFMQDHELESIDLVVVNLYPFEESFCRED-DYYTMVENIDIGGPSMIRAAAKNHDYVTILTNPQDYPLFLAEMDVNN 174 (536) Q Consensus 96 ~~~~~l~~~~i~~IDlVvvNLYPF~~~v~~~~-~~~~~IEnIDIGGpsmiRAAAKN~~~V~Vi~dp~dY~~~~~el~~~~ 174 (536) +|..+++++++.++|+|+||||||++++.++. +..+.+|+||||||+|+|+++|||.+++++++|.||..++.++..++ T Consensus 81 ~~~~~~~~~~~~~~d~~~~~~~p~~~~~~~~~~~~~~~~~~~d~ggp~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 160 (569) T TIGR00355 81 EDDADLEEHGIEPLDLVVVNLYPFKETVAKPGVTLPEAVENIDIGGPTLLRAAAKNHADVTVLVDPKDYSAFLSELDENE 160 (569) T ss_pred HHHHHHHHHCCCCCCEEEEECCCHHHHHHCCCCCHHHHHHHCCCCCHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHCC T ss_conf 34567765057632167762120034430156623454542036741344433203241368745402578998764213 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC----------CCHHHHHHHCCCCCCCCCCCCCCCCC-CEEEECCC Q ss_conf 45666788999999999874228999987533101----------21134432101233334367881006-50576047 Q gi|254780664|r 175 GKIPYNFRKKMARQAFSRTASYDTAICRWLANAES----------ENFPDYLNITAVKKQEMRYGENPHQK-AALYSTPE 243 (536) Q Consensus 175 g~~~~~~R~~lA~kAF~~ta~YD~~Ia~~l~~~~~----------~~~p~~~~~~~~~~~~LRYGENPHQ~-Aa~Y~~~~ 243 (536) |.+++..|+.++.++|.||+.||.+|++||.+... ..||..+.+.+.+.+.|||||||||+ +.+|.... T Consensus 161 g~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~g~~~~~~p~~~~~~~~ 240 (569) T TIGR00355 161 GELSLALRFDLALKAFEHTAAYDAAIADYFGGLVGPYHGEEKEKCGRFPRTFGLGFTKKQPLRYGENPHQKPAGFYVTGN 240 (569) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHCCCHHHHCCCHHHHHHHHCCCCCCCCHHHHHHHHH T ss_conf 53025677777777776544677889987642033201200244201123412220112122116775433034443210 Q ss_pred CCCC-CCCHHHCCCCCCCCCCCCCCCHHHHHHHHCC-CCCCCEEEEECCCCCCCCCCCCHHHH----------------- Q ss_conf 7668-2101210589888312221200116776424-55672799961568656545201689----------------- Q gi|254780664|r 244 KKSG-IAHAVLVQGKPLSYNNINDLDAAFELVSEFR-SQDCAACVIVKHMNPCGVATADTLVE----------------- 304 (536) Q Consensus 244 ~~~~-~~~~~ql~GKeLSYNNllD~daA~~lv~ef~-~~~~Pa~vIvKH~NPCGvA~~~~~~~----------------- 304 (536) ...+ ++.+++++||+++|||+.|++++|.++.+|. ....|+|+++||.||||++++.++.+ T Consensus 241 ~~~~~~~~~~~~~g~~~~~~~~~d~~~~~~~~~~~~~~~~~p~~~~~~~~~p~g~~~g~~~~~~~~~~~~~~~~~~~~p~ 320 (569) T TIGR00355 241 DKEGSVATAEQLQGKELSYNNLLDADAALPLVKEFDASLNEPAAVIVKHANPCGVALGKTLLDVEKQVYFVADIENLSPL 320 (569) T ss_pred HCCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHCCCCCCCCCCCHHHHHHHHEEEEECCCCCCHH T ss_conf 00001344444304443201234446678888765431034202110014521100042134432220011000011035 Q ss_pred --HHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHH-HHHHHHEECCCCHHHHHHHHCCCCCEEEECCCCCCCCCCCCEE Q ss_conf --9999985332132004555427542145787886-5431111268798999885124553156316766777666402 Q gi|254780664|r 305 --AYRRALSCDPISAFGGIIAFNREVDQEVAKEVIK-VFTEAIIAPTLSEEAADVLAKKPSMRFLKTSSLLDFHGEEIVL 381 (536) Q Consensus 305 --Ay~~A~~~DP~SAFGGIIa~N~~vd~~~A~~I~~-~F~EvIiAP~f~~eAleiL~~KKnlRil~~~~~~~~~~~~~~~ 381 (536) +|.+|+.+||.|+|||++++|+++|..+|+.+.+ .|.|+++||++++++++++++|+|+|++.+..+..+.+...++ T Consensus 321 ~c~~~~~~~~d~~~~~gg~~~~~~~~d~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 400 (569) T TIGR00355 321 ACAYDRAFGADPTSAFGGIIALNRELDGPTAKALVKREFLEGIIAPGYSPEALEILAKKKNLRLLTLGDWANGVPELDEL 400 (569) T ss_pred HHHHHHHHCCCCCHHHCCHHHHCCHHHHHHHHHHHHHHHHHHEECCCCCHHHHHHHHHHCCCEEEEECCCCCCCCHHHHH T ss_conf 67666540344201101111101110035788877544432101267656789988641374178833532223014556 Q ss_pred CCCCHHHHHCCCCC-CCCCCCCC-CCCC-CCCCHHHHHHHHHHHHHHHCCCCCEEEEEECCEEEEECCCCCCHHHHHHHH Q ss_conf 03210223101233-45310001-2467-778688988899999986005667799996893899729688889999999 Q gi|254780664|r 382 KTVSGGILVQTRDN-VVDNKELT-VVTK-RSPTDQELRDMKFAFKVVKHVKSNAVVYAKDGRTVGIGSGQTSRVDSTRFA 458 (536) Q Consensus 382 rsi~GG~LvQ~~D~-~~~~~~~~-vVT~-~~pt~~e~~dL~FA~kv~K~vkSNAIv~ak~~~tiGiGaGQ~sRVda~~iA 458 (536) +.+.||+|+|+.|. ..+..+|+ ++++ ++|+++++.|+.|+|+++||++||+|++++++.++|+|+||++|+.++++| T Consensus 401 ~~~~gg~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~w~~~~~~~~~~~~~~~~~~~~g~g~g~~~~~~~~~~~ 480 (569) T TIGR00355 401 KRVNGGLLVQDRDDGGVDPSTLKEVVTKNKQPTEQELADLLFAWKVAKHVKSNAIVYAKNNLTVGLGAGQLSRVGSAKLA 480 (569) T ss_pred HHHHCCCEEECCCCCCCCHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEECCCEEEEECCCCCCHHHHHHHH T ss_conf 55422110111210114645677765225673367888888888887542024047640660553045410001122332 Q ss_pred HHHHHHHHHCCCCC---------CCCCCEEEEECCCCCCCCHHHHHHHCCCCEEEE--CCCCCCHHHHHHHHHHCCCEEE Q ss_conf 99987642001222---------234870999736686855589999719959993--8987798899999987597799 Q gi|254780664|r 459 AIKAHNISAQADVK---------SMTNGSVIASEAFYPFPDGIVEAIKAGVTAVIQ--PGGSVRDSEAITVADQHGIAMV 527 (536) Q Consensus 459 ~~KA~~~~~~~~~~---------~~~~g~vlaSDAFFPF~D~ie~aa~~Gi~aIiq--PGGSirD~evI~aan~~gi~m~ 527 (536) +.++.+||+..++. ....|+.+++|+||||+|+++.++..|+++++| |||++||++.+.+++++++.|+ T Consensus 481 ~~~~~~ww~~~~p~~~~~~w~~g~~~~g~~~~~d~~~p~~~~~~~~~~~g~~~~~~~~pgg~~~~~~~~~~~~~~~~~~~ 560 (569) T TIGR00355 481 GDKADNWWFRQHPRVLEIKWDEGLEAPGSSLASDAFFPFPDGVEEAAAAGVTCVIQEIPGGSLRDEDSIWAADEHGLALV 560 (569) T ss_pred HHHHHCCHHHCCCCEEEEECCCCCCCCCHHHHCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHCEEEE T ss_conf 11110002112772256421224321100110002003554400244431232222237887640356655544040001 Q ss_pred ECCCCCCCC Q ss_conf 747854479 Q gi|254780664|r 528 FTGIRHFRH 536 (536) Q Consensus 528 fTg~RhF~H 536 (536) |||.|||+| T Consensus 561 ~~~~~~~~~ 569 (569) T TIGR00355 561 FTGLRHFPH 569 (569) T ss_pred ECCHHCCCC T ss_conf 001010379 No 7 >smart00798 AICARFT_IMPCHas AICARFT/IMPCHase bienzyme. This is a family of bifunctional enzymes catalysing the last two steps in de novo purine biosynthesis. The bifunctional enzyme is found in both prokaryotes and eukaryotes. The second last step is catalysed by 5-aminoimidazole-4-carboxamide ribonucleotide formyltransferase (AICARFT), this enzyme catalyses the formylation of AICAR with 10-formyl-tetrahydrofolate to yield FAICAR and tetrahydrofolate. The last step is catalysed by IMP (Inosine monophosphate) cyclohydrolase (IMPCHase), cyclizing FAICAR (5-formylaminoimidazole-4-carboxamide ribonucleotide) to IMP. Probab=100.00 E-value=0 Score=894.65 Aligned_cols=310 Identities=58% Similarity=0.905 Sum_probs=298.4 Q ss_pred HHCCCCCCEEEECHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCCC Q ss_conf 74015631353089998988987530234566678899999999987422899998753310121134432101233334 Q gi|254780664|r 147 AAKNHDYVTILTNPQDYPLFLAEMDVNNGKIPYNFRKKMARQAFSRTASYDTAICRWLANAESENFPDYLNITAVKKQEM 226 (536) Q Consensus 147 AAKN~~~V~Vi~dp~dY~~~~~el~~~~g~~~~~~R~~lA~kAF~~ta~YD~~Ia~~l~~~~~~~~p~~~~~~~~~~~~L 226 (536) |||||++|+|||||+||+.++++|++ +|++|++||++||.|||+|||.||++|++||++...+.+|+.+.+++++.++| T Consensus 1 AAKN~~~V~Vi~dp~dY~~~~~el~~-~g~~s~~~R~~lA~kAF~~Ta~YD~~I~~yl~~~~~~~~p~~~~~~~~~~~~L 79 (311) T smart00798 1 AAKNHKDVTVVVDPADYAEVLEELKA-GGGLSLETRKRLAAKAFAHTAAYDAAISNYLAKQLASEFPETLTLSFEKKQDL 79 (311) T ss_pred CCCCCCCEEEECCHHHHHHHHHHHHH-CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHEEECCCCCCCC T ss_conf 97889987999998999999999996-69989999999999999999999999999983205555870067734303477 Q ss_pred CCCCCCCCCCEEEECCCCCCCCCCHHHCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCHHHHHH Q ss_conf 36788100650576047766821012105898883122212001167764245567279996156865654520168999 Q gi|254780664|r 227 RYGENPHQKAALYSTPEKKSGIAHAVLVQGKPLSYNNINDLDAAFELVSEFRSQDCAACVIVKHMNPCGVATADTLVEAY 306 (536) Q Consensus 227 RYGENPHQ~Aa~Y~~~~~~~~~~~~~ql~GKeLSYNNllD~daA~~lv~ef~~~~~Pa~vIvKH~NPCGvA~~~~~~~Ay 306 (536) ||||||||+|++|.+.....++..++||||||||||||||+|+||+||.||.+ |+||||||+||||||+++++.+|| T Consensus 80 RYGENPHQ~Aa~Y~~~~~~~~~~~~~ql~GKeLSYNN~lD~daA~~lv~ef~~---pa~~IiKH~nPCGvA~~~~~~~A~ 156 (311) T smart00798 80 RYGENPHQKAAFYTDPDALGGIATAKQLQGKELSYNNILDADAALELVKEFDE---PACVIVKHANPCGVAVGDTLAEAY 156 (311) T ss_pred CCCCCCCHHHHHHCCCCCCCCCCCCEEECCCCCCCCCHHHHHHHHHHHHCCCC---CEEEEEECCCCCCCCCCCHHHHHH T ss_conf 57888221314305688666654435605887785567768999999862688---748998248876412450699999 Q ss_pred HHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHEECCCCHHHHHHHHCCCCCEEEECCCCCCCCCCCCEECCCCH Q ss_conf 99985332132004555427542145787886543111126879899988512455315631676677766640203210 Q gi|254780664|r 307 RRALSCDPISAFGGIIAFNREVDQEVAKEVIKVFTEAIIAPTLSEEAADVLAKKPSMRFLKTSSLLDFHGEEIVLKTVSG 386 (536) Q Consensus 307 ~~A~~~DP~SAFGGIIa~N~~vd~~~A~~I~~~F~EvIiAP~f~~eAleiL~~KKnlRil~~~~~~~~~~~~~~~rsi~G 386 (536) ++|++|||+||||||||+|++||.++|++|.+.|+||||||+|++|||++|++|||+|||++.... ..+.+++|++.| T Consensus 157 ~~A~~~Dp~SAFGGIva~N~~vd~~~A~~i~~~F~EvIiAP~f~~eAl~iL~~KknlRil~~~~~~--~~~~~~~rsi~g 234 (311) T smart00798 157 RKAYAADPVSAFGGIIAFNRPVDEETAEAINKIFLEVIIAPDFDEEALEILSKKKNLRLLELGPLP--DPDGLEFKSVSG 234 (311) T ss_pred HHHHHCCCCCCCCCEEEECCEECHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCEEEEECCCCC--CCCCEEEEEEEC T ss_conf 998702985445877886462179999999999987711778998999999748865999668877--788714899957 Q ss_pred HHHHCCCCCC-CCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEEECCEEEEECCCCCCHHHHHHHHHHHH Q ss_conf 2231012334-531000124677786889888999999860056677999968938997296888899999999998 Q gi|254780664|r 387 GILVQTRDNV-VDNKELTVVTKRSPTDQELRDMKFAFKVVKHVKSNAVVYAKDGRTVGIGSGQTSRVDSTRFAAIKA 462 (536) Q Consensus 387 G~LvQ~~D~~-~~~~~~~vVT~~~pt~~e~~dL~FA~kv~K~vkSNAIv~ak~~~tiGiGaGQ~sRVda~~iA~~KA 462 (536) |+|+|++|.. ++..+|++||+++|++++|+||+|||+|||||||||||++||++|||||+||||||||++||++|| T Consensus 235 G~L~Q~~d~~~~~~~~~~~Vt~~~p~~~~~~dl~Fa~~v~k~vkSNaIv~ak~~~tvGiGaGQ~sRvda~~~A~~kA 311 (311) T smart00798 235 GLLVQDRDNGGIDPEDLKVVTKRQPTEEELKDLLFAWKVVKHVKSNAIVYAKDGQTVGIGAGQMSRVDSARIAAEKA 311 (311) T ss_pred EEEEECCCCCCCCHHHCEEECCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEECCEEEEECCCCCCHHHHHHHHHHHC T ss_conf 19997267788998997880799989999999999999997677857999958907986898555999999999609 No 8 >cd01421 IMPCH Inosine monophosphate cyclohydrolase domain. This is the N-terminal domain in the purine biosynthesis pathway protein ATIC (purH). The bifunctional ATIC protein contains a C-terminal ATIC formylase domain that formylates 5-aminoimidazole-4-carboxamide-ribonucleotide. The IMPCH domain then converts the formyl-5-aminoimidazole-4-carboxamide-ribonucleotide to inosine monophosphate. This is the final step in de novo purine production. Probab=100.00 E-value=0 Score=606.62 Aligned_cols=186 Identities=58% Similarity=0.914 Sum_probs=181.9 Q ss_pred EEEEEEEECCCCHHHHHHHHHHCCCEEEEEHHHHHHHHHCCCCEEEHHHHCCCCHHCCCCCCCCCCHHHHHHHHCCCCHH Q ss_conf 69899875456869999999987999999427689999889951794653488165198553237155535533088989 Q gi|254780664|r 17 KTALISVHNKTGVVEFASRLLSRGIKIISTGGTCQLLEEEGIPVTSVFDITKFPEIMGGRVKTLHPKIYGGILSIRDNPA 96 (536) Q Consensus 17 krALiSV~dKtgl~~la~~L~~~g~~iisTgGTa~~l~~~gi~v~~Vs~~TgfpEil~GRVKTLHP~I~ggIL~~r~~~~ 96 (536) |||||||||||||++||+.|.++||+|+|||||+++|+++|++|++||++||||||||||||||||+||||||+||++++ T Consensus 1 krALISVsdKt~l~~la~~L~~~g~~IisTgGT~~~L~~~gi~v~~Vs~~TgfpEil~GRVKTLHPkI~gGIL~~r~~~~ 80 (187) T cd01421 1 KRALISVSDKTGLVEFAKELVELGVEILSTGGTAKFLKEAGIPVTDVSDITGFPEILGGRVKTLHPKIHGGILARRDNEE 80 (187) T ss_pred CEEEEEEECCCCHHHHHHHHHHCCCEEEECHHHHHHHHHCCCCEEEHHHHCCCHHHHCCCCCCCCCHHHHHHCCCCCCHH T ss_conf 90799864532779999999988988997808999998668860669873197377589765357055440215666635 Q ss_pred HHHHHHHCCCCCEEEEEEECCCHHHHHCCCC-CHHHHHHCCCCCCHHHHHHHHCCCCCCEEEECHHHHHHHHHHHHCCCC Q ss_conf 9999997799841499994888899731765-311232002334346999874015631353089998988987530234 Q gi|254780664|r 97 HMKFMQDHELESIDLVVVNLYPFEESFCRED-DYYTMVENIDIGGPSMIRAAAKNHDYVTILTNPQDYPLFLAEMDVNNG 175 (536) Q Consensus 97 ~~~~l~~~~i~~IDlVvvNLYPF~~~v~~~~-~~~~~IEnIDIGGpsmiRAAAKN~~~V~Vi~dp~dY~~~~~el~~~~g 175 (536) |+ ++++++|++|||||||||||++++++++ +++++|||||||||||||||||||++|+|+|||+||+.++++|+.+ | T Consensus 81 h~-~~~~~~i~~IDlVvvNLYPF~~~v~~~~~~~~~~IEnIDIGGpsliRAAAKN~~~V~v~~dp~dY~~~i~~l~~~-g 158 (187) T cd01421 81 HK-DLEEHGIEPIDLVVVNLYPFEETVAKGNVTLEEAIENIDIGGPSLLRAAAKNYKDVTVLVDPADYQKVLEELKSN-G 158 (187) T ss_pred HH-HHHHCCCCCCCEEEEECCCHHHHHCCCCCCHHHHHHHCCCCCHHHHHHHHHCCCCEEEECCHHHHHHHHHHHHHC-C T ss_conf 89-999869997027998465777775558999899998446774899999982688128767999999999999976-9 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 56667889999999998742289999875 Q gi|254780664|r 176 KIPYNFRKKMARQAFSRTASYDTAICRWL 204 (536) Q Consensus 176 ~~~~~~R~~lA~kAF~~ta~YD~~Ia~~l 204 (536) ++++++|++||.|||.+||+||++|++|| T Consensus 159 ~~~~~~R~~lA~kAF~~ta~YD~~I~~~L 187 (187) T cd01421 159 SISEETRRRLALKAFAHTAEYDAAISNYL 187 (187) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 99999999999999999999999999639 No 9 >pfam02142 MGS MGS-like domain. This domain composes the whole protein of methylglyoxal synthetase and the domain is also found in Carbamoyl phosphate synthetase (CPS) where it forms a regulatory domain that binds to the allosteric effector ornithine. This family also includes inosicase. The known structures in this family show a common phosphate binding site. Probab=99.52 E-value=2.5e-14 Score=124.70 Aligned_cols=92 Identities=39% Similarity=0.599 Sum_probs=78.9 Q ss_pred CHHHHHHHHHHCCCEEEEEHHHHHHHHHCCCCEEEHHHHCCCCHHCCCCCCCCCCHHHHHHHHCCCCHHHHHHHHHCCCC Q ss_conf 86999999998799999942768999988995179465348816519855323715553553308898999999977998 Q gi|254780664|r 28 GVVEFASRLLSRGIKIISTGGTCQLLEEEGIPVTSVFDITKFPEIMGGRVKTLHPKIYGGILSIRDNPAHMKFMQDHELE 107 (536) Q Consensus 28 gl~~la~~L~~~g~~iisTgGTa~~l~~~gi~v~~Vs~~TgfpEil~GRVKTLHP~I~ggIL~~r~~~~~~~~l~~~~i~ 107 (536) ||+++|+.|.++||+|+||+||+++|+++||++ ++.+.+++| +++||+ +....+++. T Consensus 1 ~iv~~a~~L~~lGf~i~AT~GTa~~L~~~Gi~~-~~v~k~~~~-~~~~~~------------------~i~d~i~~~--- 57 (92) T pfam02142 1 GLVELAKALVELGFKLYATGGTAKFLKEAGIPT-EVVNKTGEG-RPGGRV------------------QIGDLIKNG--- 57 (92) T ss_pred CHHHHHHHHHHCCCEEEECHHHHHHHHHCCCCC-EEEEECCCC-CCCCCH------------------HHHHHHHCC--- T ss_conf 968999999988999998838999999559981-686340689-999847------------------699999829--- Q ss_pred CEEEEEEECCCHHHHHCCCCCHHHHHHCCCCCCHH Q ss_conf 41499994888899731765311232002334346 Q gi|254780664|r 108 SIDLVVVNLYPFEESFCREDDYYTMVENIDIGGPS 142 (536) Q Consensus 108 ~IDlVvvNLYPF~~~v~~~~~~~~~IEnIDIGGps 142 (536) .|||||+++|||++++.++.....+.+++||+++| T Consensus 58 ~IdlVInt~~~~~~~~~dg~~iRr~a~~~~Ip~~T 92 (92) T pfam02142 58 EIDLVINTLYPFKATVHDGYAIRRAAENIDIPLPT 92 (92) T ss_pred CCCEEEECCCCCCEECCCCHHHHHHHHHCCCCCCC T ss_conf 95199983898851125887899999926998549 No 10 >smart00851 MGS MGS-like domain. This domain composes the whole protein of methylglyoxal synthetase and the domain is also found in Carbamoyl phosphate synthetase (CPS) where it forms a regulatory domain that binds to the allosteric effector ornithine. This family also includes inosicase. The known structures in this family show a common phosphate binding site PUBMED:10526357. Probab=99.44 E-value=2.7e-13 Score=117.24 Aligned_cols=89 Identities=44% Similarity=0.723 Sum_probs=75.6 Q ss_pred CHHHHHHHHHHCCCEEEEEHHHHHHHHHCCCCEEEHHHHCCCCHHCCCCCCCCCCHHHHHHHHCCCCHHHHHHHHHCCCC Q ss_conf 86999999998799999942768999988995179465348816519855323715553553308898999999977998 Q gi|254780664|r 28 GVVEFASRLLSRGIKIISTGGTCQLLEEEGIPVTSVFDITKFPEIMGGRVKTLHPKIYGGILSIRDNPAHMKFMQDHELE 107 (536) Q Consensus 28 gl~~la~~L~~~g~~iisTgGTa~~l~~~gi~v~~Vs~~TgfpEil~GRVKTLHP~I~ggIL~~r~~~~~~~~l~~~~i~ 107 (536) +++++|+.|.++||+|+||+||+++|+++||+| |||||++++|++ + .++..... T Consensus 1 ~~~~~ak~l~~lGf~l~AT~GTa~~L~~~Gi~v-----------------~~l~~ki~~g~~-------~--i~~~i~~~ 54 (90) T smart00851 1 GLVELAKRLAELGFELVATGGTAKFLREAGLPV-----------------KTLHPKVHGGIL-------A--ILDLIKNG 54 (90) T ss_pred CHHHHHHHHHHCCCEEEECCHHHHHHHHCCCCC-----------------EEEECCCCCCCC-------C--HHHHHCCC T ss_conf 968999999988998998125899999759971-----------------043024435610-------4--89981978 Q ss_pred CEEEEEEECCCHHHHHCC-CCCHHHHHHCCCCCCHH Q ss_conf 414999948888997317-65311232002334346 Q gi|254780664|r 108 SIDLVVVNLYPFEESFCR-EDDYYTMVENIDIGGPS 142 (536) Q Consensus 108 ~IDlVvvNLYPF~~~v~~-~~~~~~~IEnIDIGGps 142 (536) .|||||+++|||.+...+ +.....+.+++||+++| T Consensus 55 ~IdlVInt~~~~~~~~~~d~~~iRr~a~~~~Ip~~T 90 (90) T smart00851 55 EIDLVINTLYPLGAQPHEDGKALRRAAENIDIPGAT 90 (90) T ss_pred CEEEEEEECCCCCCHHHHCCCCHHHHHHCCCCCCCC T ss_conf 879999846876323653588899998808988709 No 11 >cd00532 MGS-like MGS-like domain. This domain composes the whole protein of methylglyoxal synthetase, which catalyzes the enolization of dihydroxyacetone phosphate (DHAP) to produce methylglyoxal. The family also includes the C-terminal domain in carbamoyl phosphate synthetase (CPS) where it catalyzes the last phosphorylation of a coaboxyphosphate intermediate to form the product carbamoyl phosphate and may also play a regulatory role. This family also includes inosine monophosphate cyclohydrolase. The known structures in this family show a common phosphate binding site. Probab=99.05 E-value=1.5e-09 Score=89.90 Aligned_cols=106 Identities=26% Similarity=0.372 Sum_probs=79.2 Q ss_pred EEEEEE--CCCCHHHHHHHHHHCCCEEEEEHHHHHHHHHCCCCEEEHHHHCCCCHHCCCCCCCCCCHHHHHHHHCCCCHH Q ss_conf 899875--456869999999987999999427689999889951794653488165198553237155535533088989 Q gi|254780664|r 19 ALISVH--NKTGVVEFASRLLSRGIKIISTGGTCQLLEEEGIPVTSVFDITKFPEIMGGRVKTLHPKIYGGILSIRDNPA 96 (536) Q Consensus 19 ALiSV~--dKtgl~~la~~L~~~g~~iisTgGTa~~l~~~gi~v~~Vs~~TgfpEil~GRVKTLHP~I~ggIL~~r~~~~ 96 (536) .||||. ||..++++|+.|.++||+|++|+||+++|+++||+|+.|..+... -+|.|..-|+.+ T Consensus 2 VfiSv~D~DK~~~i~~ak~l~~lGf~i~AT~GTa~~L~~~Gi~~~~V~Ki~e~----------g~p~v~d~i~~~----- 66 (112) T cd00532 2 VFLSVSDHVKAMLVDLAPKLSSDGFPLFATGGTSRVLADAGIPVRAVSKRHED----------GEPTVDAAIAEK----- 66 (112) T ss_pred EEEEEECCCHHHHHHHHHHHHHCCCEEEECHHHHHHHHHCCCEEEEEEECCCC----------CCCCHHHHHHCC----- T ss_conf 89998657566899999999988999998868999999879834898504878----------998899998668----- Q ss_pred HHHHHHHCCCCCEEEEEEECCCHHHHHCCCCCHHHHHHCCCCCCHHHHHHHHCCCCCCEEEECHHHH Q ss_conf 9999997799841499994888899731765311232002334346999874015631353089998 Q gi|254780664|r 97 HMKFMQDHELESIDLVVVNLYPFEESFCREDDYYTMVENIDIGGPSMIRAAAKNHDYVTILTNPQDY 163 (536) Q Consensus 97 ~~~~l~~~~i~~IDlVvvNLYPF~~~v~~~~~~~~~IEnIDIGGpsmiRAAAKN~~~V~Vi~dp~dY 163 (536) ..|||||-...|.... ..+..|-.+-|+|-.+ +|.++|..+-. T Consensus 67 ----------g~I~lVINt~~~~~~~------------~~~~Dg~~IRr~Av~~--~IP~~Ttl~~A 109 (112) T cd00532 67 ----------GKFDVVINLRDPRRDR------------CTDEDGTALLRLARLY--KIPVTTPNATA 109 (112) T ss_pred ----------CCEEEEEECCCCCCCC------------CCCCCHHHHHHHHHHC--CCCEECCHHHH T ss_conf ----------9679999888888773------------3245489999999976--99988448986 No 12 >cd01424 MGS_CPS_II Methylglyoxal synthase-like domain from type II glutamine-dependent carbamoyl phosphate synthetase (CSP). CSP, a CarA and CarB heterodimer, catalyzes the production of carbamoyl phosphate which is subsequently employed in the metabolic pathways responsible for the synthesis of pyrimidine nucleotides or arginine. The MGS-like domain is the C-terminal domain of CarB and appears to play a regulatory role in CPS function by binding allosteric effector molecules, including UMP and ornithine. Probab=98.63 E-value=4.2e-08 Score=79.44 Aligned_cols=106 Identities=36% Similarity=0.500 Sum_probs=71.8 Q ss_pred EEEEEEE--CCCCHHHHHHHHHHCCCEEEEEHHHHHHHHHCCCCEEEHHHHCCCCHHCCCCCCCCCCHHHHHHHHCCCCH Q ss_conf 9899875--45686999999998799999942768999988995179465348816519855323715553553308898 Q gi|254780664|r 18 TALISVH--NKTGVVEFASRLLSRGIKIISTGGTCQLLEEEGIPVTSVFDITKFPEIMGGRVKTLHPKIYGGILSIRDNP 95 (536) Q Consensus 18 rALiSV~--dKtgl~~la~~L~~~g~~iisTgGTa~~l~~~gi~v~~Vs~~TgfpEil~GRVKTLHP~I~ggIL~~r~~~ 95 (536) +.|+||. ||..++++|+.|.++||+|++|.||+++|+++||+++.|..+.. | .|.| T Consensus 2 ~Vl~Sv~d~dK~e~l~~ak~l~~lGf~i~AT~GTa~~L~~~Gi~~~~v~k~~e------~-----~p~i----------- 59 (110) T cd01424 2 TVFISVADRDKPEAVEIAKRLAELGFKLVATEGTAKYLQEAGIPVEVVNKVSE------G-----RPNI----------- 59 (110) T ss_pred EEEEEEEHHHHHHHHHHHHHHHHCCCEEEECCHHHHHHHHCCCCCEEEECCCC------C-----CCCH----------- T ss_conf 69999866547689999999998899999871699999864986403300567------9-----9768----------- Q ss_pred HHHHHHHHCCCCCEEEEEEECCCHHHHHCCCCCHHHHHHCCCCCCHHHHHHHHCCCCCCEEEECHHHHHHHH Q ss_conf 999999977998414999948888997317653112320023343469998740156313530899989889 Q gi|254780664|r 96 AHMKFMQDHELESIDLVVVNLYPFEESFCREDDYYTMVENIDIGGPSMIRAAAKNHDYVTILTNPQDYPLFL 167 (536) Q Consensus 96 ~~~~~l~~~~i~~IDlVvvNLYPF~~~v~~~~~~~~~IEnIDIGGpsmiRAAAKN~~~V~Vi~dp~dY~~~~ 167 (536) .++=+. ..||||| |. |-. ... ++ | |-.|=|+|.. ..|.++|+.+--..++ T Consensus 60 ---~d~i~~--~~i~lVI-Nt-~~~-----~~~----~~--d--g~~IRr~Av~--~~Ip~~Tsi~~A~a~v 109 (110) T cd01424 60 ---VDLIKN--GEIQLVI-NT-PSG-----KRA----IR--D--GFSIRRAALE--YKVPYFTTLDTARAAV 109 (110) T ss_pred ---HHHHHC--CCEEEEE-EC-CCC-----CCC----CC--C--HHHHHHHHHH--CCCCEEECHHHHHHHH T ss_conf ---999977--9358999-77-889-----865----74--2--9999999997--6999996699999873 No 13 >cd01423 MGS_CPS_I_III Methylglyoxal synthase-like domain found in pyr1 and URA1-like carbamoyl phosphate synthetases (CPS), including ammonia-dependent CPS Type I, and glutamine-dependent CPS Type III. These are multidomain proteins, in which MGS is the C-terminal domain. Probab=98.63 E-value=1.1e-07 Score=76.29 Aligned_cols=112 Identities=35% Similarity=0.497 Sum_probs=73.5 Q ss_pred EEEEEEEEC--CCCHHHHHHHHHHCCCEEEEEHHHHHHHHHCCCCEEEHHHHCCCCHHCCCCCCCCCCHHHHHHHHCCCC Q ss_conf 698998754--568699999999879999994276899998899517946534881651985532371555355330889 Q gi|254780664|r 17 KTALISVHN--KTGVVEFASRLLSRGIKIISTGGTCQLLEEEGIPVTSVFDITKFPEIMGGRVKTLHPKIYGGILSIRDN 94 (536) Q Consensus 17 krALiSV~d--Ktgl~~la~~L~~~g~~iisTgGTa~~l~~~gi~v~~Vs~~TgfpEil~GRVKTLHP~I~ggIL~~r~~ 94 (536) |..||||.| |.-+.++|+.|.++||+|++|.||+++|+++||+|+.|..+...+ -+|+ |.+ T Consensus 1 KnVliSv~d~dK~~~l~~a~~l~~lGf~l~AT~GTa~~l~~~Gi~~~~v~k~~e~~--~~~~-----p~i---------- 63 (116) T cd01423 1 KGILISIGSYSKPELLPTAQKLSKLGYKLYATEGTADFLLENGIPVTPVAWPSEEP--QNDK-----PSL---------- 63 (116) T ss_pred CEEEEEEEHHHHHHHHHHHHHHHHCCCEEEEEHHHHHHHHHCCCCCEEEEEECCCC--CCCC-----CCH---------- T ss_conf 92999940203899999999999879999984189999998699657987862467--8999-----669---------- Q ss_pred HHHHHHHHHCCCCCEEEEEEECCCHHHHHCCCCCHHHHHHCCCCCCHHHHHHHHCCCCCCEEEECHHHHHHH Q ss_conf 899999997799841499994888899731765311232002334346999874015631353089998988 Q gi|254780664|r 95 PAHMKFMQDHELESIDLVVVNLYPFEESFCREDDYYTMVENIDIGGPSMIRAAAKNHDYVTILTNPQDYPLF 166 (536) Q Consensus 95 ~~~~~~l~~~~i~~IDlVvvNLYPF~~~v~~~~~~~~~IEnIDIGGpsmiRAAAKN~~~V~Vi~dp~dY~~~ 166 (536) ..-+++ ..||||| |+-- ..... ..| -| -.||.+|=. ..|..+|+++-=..+ T Consensus 64 ---~d~i~~---g~i~lVI-Ntp~-----~~~~~------~~~-~g-~~iRr~Av~-~~Ip~~T~~~~A~~~ 114 (116) T cd01423 64 ---RELLAE---GKIDLVI-NLPS-----NRGKR------VLD-ND-YVMRRAADD-FAVPLITNPKCAKLF 114 (116) T ss_pred ---HHHHHC---CCEEEEE-ECCC-----CCCCC------CCC-CC-HHHHHHHHH-CCCCEEECHHHHHHH T ss_conf ---999976---9868999-8889-----99864------335-45-899999997-599989779999987 No 14 >PRK05294 carB carbamoyl phosphate synthase large subunit; Reviewed Probab=98.44 E-value=5.9e-06 Score=63.88 Aligned_cols=66 Identities=23% Similarity=0.228 Sum_probs=31.0 Q ss_pred CCCCCCCCCCCEEEECCCCCC-------------CCCCHHHCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCEEEEECCCC Q ss_conf 436788100650576047766-------------8210121058988831222120011677642455672799961568 Q gi|254780664|r 226 MRYGENPHQKAALYSTPEKKS-------------GIAHAVLVQGKPLSYNNINDLDAAFELVSEFRSQDCAACVIVKHMN 292 (536) Q Consensus 226 LRYGENPHQ~Aa~Y~~~~~~~-------------~~~~~~ql~GKeLSYNNllD~daA~~lv~ef~~~~~Pa~vIvKH~N 292 (536) --=||=|-+...+|+.-+... |-...++-||-|--| .+..++..+.+.+. -+|+=-.| T Consensus 526 TcAgEF~a~T~Y~YSTY~~e~e~~~~~~~kvlvlGsGp~rIGqgiEFDY-------~~v~a~~alk~~G~--~~imIN~N 596 (1063) T PRK05294 526 TCAAEFEADTPYMYSTYEEECESNPSDRKKVMILGGGPNRIGQGIEFDY-------CCVHAALALREAGY--ETIMVNCN 596 (1063) T ss_pred CCCCCCCCCCCEEEEECCCCCCCCCCCCCCEEEECCCCCCCCCCCCCHH-------HHHHHHHHHHHCCC--EEEEECCC T ss_conf 6323456678726764577777677788637996577732044423006-------79999999996598--15895378 Q ss_pred CCCCCCCC Q ss_conf 65654520 Q gi|254780664|r 293 PCGVATAD 300 (536) Q Consensus 293 PCGvA~~~ 300 (536) |--|++.- T Consensus 597 PETVSTD~ 604 (1063) T PRK05294 597 PETVSTDY 604 (1063) T ss_pred CCCCCCCC T ss_conf 11111565 No 15 >PRK12815 carB carbamoyl phosphate synthase large subunit; Reviewed Probab=98.41 E-value=7.2e-06 Score=63.23 Aligned_cols=13 Identities=38% Similarity=0.424 Sum_probs=6.0 Q ss_pred EECCCCCCCCCCC Q ss_conf 9615686565452 Q gi|254780664|r 287 IVKHMNPCGVATA 299 (536) Q Consensus 287 IvKH~NPCGvA~~ 299 (536) |+=-+||--|++. T Consensus 592 ImiN~NPETVSTD 604 (1068) T PRK12815 592 IMINNNPETVSTD 604 (1068) T ss_pred EEECCCCCCCCCC T ss_conf 9962784312367 No 16 >TIGR01369 CPSaseII_lrg carbamoyl-phosphate synthase, large subunit; InterPro: IPR006275 Carbamoyl phosphate synthase (CPSase) is a heterodimeric enzyme composed of a small and a large subunit (with the exception of CPSase III, see below). CPSase catalyses the synthesis of carbamoyl phosphate from biocarbonate, ATP and glutamine (6.3.5.5 from EC) or ammonia (6.3.4.16 from EC), and represents the first committed step in pyrimidine and arginine biosynthesis in prokaryotes and eukaryotes, and in the urea cycle in most terrestrial vertebrates , . CPSase has three active sites, one in the small subunit and two in the large subunit. The small subunit contains the glutamine binding site and catalyses the hydrolysis of glutamine to glutamate and ammonia. The large subunit has two homologous carboxy phosphate domains, both of which have ATP-binding sites; however, the N-terminal carboxy phosphate domain catalyses the phosphorylation of biocarbonate, while the C-terminal domain catalyses the phosphorylation of the carbamate intermediate . The carboxy phosphate domain found duplicated in the large subunit of CPSase is also present as a single copy in the biotin-dependent enzymes acetyl-CoA carboxylase (6.4.1.2 from EC) (ACC), propionyl-CoA carboxylase (6.4.1.3 from EC) (PCCase), pyruvate carboxylase (6.4.1.1 from EC) (PC) and urea carboxylase (6.3.4.6 from EC). Most prokaryotes carry one form of CPSase that participates in both arginine and pyrimidine biosynthesis, however certain bacteria can have separate forms. The large subunit in bacterial CPSase has four structural domains: the carboxy phosphate domain 1, the oligomerisation domain, the carbamoyl phosphate domain 2 and the allosteric domain . CPSase heterodimers from Escherichia coli contain two molecular tunnels: an ammonia tunnel and a carbamate tunnel. These inter-domain tunnels connect the three distinct active sites, and function as conduits for the transport of unstable reaction intermediates (ammonia and carbamate) between successive active sites . The catalytic mechanism of CPSase involves the diffusion of carbamate through the interior of the enzyme from the site of synthesis within the N-terminal domain of the large subunit to the site of phosphorylation within the C-terminal domain. Eukaryotes have two distinct forms of CPSase: a mitochondrial enzyme (CPSase I) that participates in both arginine biosynthesis and the urea cycle; and a cytosolic enzyme (CPSase II) involved in pyrimidine biosynthesis. CPSase II occurs as part of a multi-enzyme complex along with aspartate transcarbamoylase and dihydroorotase; this complex is referred to as the CAD protein . The hepatic expression of CPSase is transcriptionally regulated by glucocorticoids and/or cAMP . There is a third form of the enzyme, CPSase III, found in fish, which uses glutamine as a nitrogen source instead of ammonia . CPSase III is closely related to CPSase I, and is composed of a single polypeptide that may have arisen from gene fusion of the glutaminase and synthetase domains . This entry represents glutamine-dependent CPSase (6.3.5.5 from EC) from prokaryotes and eukaryotes (CPSase II). ; GO: 0004086 carbamoyl-phosphate synthase activity, 0006807 nitrogen compound metabolic process. Probab=97.57 E-value=9e-05 Score=55.25 Aligned_cols=130 Identities=25% Similarity=0.371 Sum_probs=85.9 Q ss_pred CCCCEE-EEEEEE--ECCCCHHHHHHHHHHCCCEEEEEHHHHHHHHHCCCCEEEHHHHCCCCHHCCCCCCCCCCHHHHHH Q ss_conf 776136-989987--54568699999999879999994276899998899517946534881651985532371555355 Q gi|254780664|r 12 GEIAVK-TALISV--HNKTGVVEFASRLLSRGIKIISTGGTCQLLEEEGIPVTSVFDITKFPEIMGGRVKTLHPKIYGGI 88 (536) Q Consensus 12 ~~~~ik-rALiSV--~dKtgl~~la~~L~~~g~~iisTgGTa~~l~~~gi~v~~Vs~~TgfpEil~GRVKTLHP~I~ggI 88 (536) ..+|-| ..|++| .||+.+.++++.|.+.||+|++|.||+++|.++||+++-|-.++.-+ ++-=- T Consensus 957 ~~~p~kg~~~~~~~d~dk~~~~~~a~~L~~~G~~~yat~gT~~~L~e~gi~~~~v~~~~e~~------~~~~C------- 1023 (1089) T TIGR01369 957 NRIPKKGSVLLSVRDKDKEELLDLAKKLAEKGYKIYATEGTAKFLEEAGIKAEVVLKVSEGR------EDDDC------- 1023 (1089) T ss_pred CCCCCCCEEEEEECCCCHHHHHHHHHHHHHCCCEEEEECCHHHHHHHCCCCHHHEEEECCCC------CCCCC------- T ss_conf 70578877999803443457999999998637378850202676765187633200003788------78888------- Q ss_pred HHCCCCHHHHHHHHHCCCCCEEEEEEECCCHHHHHCCCCCHHHHHHCCCCCCHHHHHHHHCCCCCCEEEECHHHHHHHHH Q ss_conf 33088989999999779984149999488889973176531123200233434699987401563135308999898898 Q gi|254780664|r 89 LSIRDNPAHMKFMQDHELESIDLVVVNLYPFEESFCREDDYYTMVENIDIGGPSMIRAAAKNHDYVTILTNPQDYPLFLA 168 (536) Q Consensus 89 L~~r~~~~~~~~l~~~~i~~IDlVvvNLYPF~~~v~~~~~~~~~IEnIDIGGpsmiRAAAKN~~~V~Vi~dp~dY~~~~~ 168 (536) ..+|+=++.+++ ..|+|||--.++=.+-+. ...-+=-|=.+-|.|.+ .+|.++++++--..+++ T Consensus 1024 ---~~~p~~~~~~~~---~~~~~vIN~~~~~~~~~~--------~~~~~~~gy~iRR~A~~--~~vp~~t~~~~~~~~~~ 1087 (1089) T TIGR01369 1024 ---EAQPNILELIKN---GEIELVINLTSKGKEGVL--------LRETATDGYKIRREAVD--YGVPLITTLETAEAFLE 1087 (1089) T ss_pred ---CCCHHHHHHHHH---CCCEEEEECCCCCCCCEE--------ECCCCCHHHHHHHHHHH--CCCCEEECHHHHHHHHH T ss_conf ---887158999970---880699965788866312--------20245445899999996--49977865689999998 Q ss_pred HH Q ss_conf 75 Q gi|254780664|r 169 EM 170 (536) Q Consensus 169 el 170 (536) .| T Consensus 1088 al 1089 (1089) T TIGR01369 1088 AL 1089 (1089) T ss_pred HC T ss_conf 09 No 17 >pfam06230 DUF1009 Protein of unknown function (DUF1009). Family of uncharacterized bacterial proteins. Probab=97.25 E-value=0.0071 Score=41.46 Aligned_cols=113 Identities=25% Similarity=0.433 Sum_probs=84.2 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHCCCCCEE---EEEECCEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEE Q ss_conf 1246777868898889999998600566779---9996893899729688889999999999876420012222348709 Q gi|254780664|r 403 TVVTKRSPTDQELRDMKFAFKVVKHVKSNAV---VYAKDGRTVGIGSGQTSRVDSTRFAAIKAHNISAQADVKSMTNGSV 479 (536) Q Consensus 403 ~vVT~~~pt~~e~~dL~FA~kv~K~vkSNAI---v~ak~~~tiGiGaGQ~sRVda~~iA~~KA~~~~~~~~~~~~~~g~v 479 (536) .+.|.++|+++++.|..|+|++++..-+=-| |+++++..+++=+= .-+.-.+.|+++.... +|+| T Consensus 83 G~l~~~~p~~~~~~dI~~g~~i~~~l~~~DiGQsvVV~~g~VlavEa~-----EGTD~mI~R~~~l~~~-------~~~v 150 (212) T pfam06230 83 GVLTRRAPDKSELADIARGAEVAKALGALDIGQAVVVKDGAVLAVEAI-----EGTDAMLRRVGELRRG-------RGGV 150 (212) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCEEEEEECC-----CCHHHHHHHHHHHCCC-------CCEE T ss_conf 752578989899999999999999973468741699989989999553-----0689999999974568-------8808 Q ss_pred EE------ECCCCCCC----CHHHHHHHCCCCEE-EECCCCC--CHHHHHHHHHHCCCEEE Q ss_conf 99------73668685----55899997199599-9389877--98899999987597799 Q gi|254780664|r 480 IA------SEAFYPFP----DGIVEAIKAGVTAV-IQPGGSV--RDSEAITVADQHGIAMV 527 (536) Q Consensus 480 la------SDAFFPF~----D~ie~aa~~Gi~aI-iqPGGSi--rD~evI~aan~~gi~m~ 527 (536) +. -|=-|-+| +.|+.++++|.+.| ++.|.++ .-+++|+.||++||..+ T Consensus 151 lvK~~K~~QD~R~DlPtIG~~Ti~~~~~ag~~giavea~~~li~d~~~~i~~A~~~~i~i~ 211 (212) T pfam06230 151 LVKAPKPGQDLRFDLPTIGPDTIETAAEAGLAGIAVEAGKTLVLDREAVIAAADAAGLFVV 211 (212) T ss_pred EEECCCCCCCCCCCCCCCCHHHHHHHHHCCCEEEEEECCCEEEECHHHHHHHHHHCCCEEE T ss_conf 9857787863124777748999999998399599996698799869999999998398995 No 18 >pfam06506 PrpR_N Propionate catabolism activator. This domain is found at the N terminus of several sigma54- dependent transcriptional activators including PrpR, which activates catabolism of propionate. Probab=92.78 E-value=0.66 Score=27.15 Aligned_cols=126 Identities=21% Similarity=0.349 Sum_probs=64.4 Q ss_pred EEEEEECCC--CHHHHHHHHHHCCCE-EEEEHHHHHHHHHC-CCCEEEHHHHCCCCHHCC--CCCCCCCCHHHHHHHHCC Q ss_conf 899875456--869999999987999-99942768999988-995179465348816519--855323715553553308 Q gi|254780664|r 19 ALISVHNKT--GVVEFASRLLSRGIK-IISTGGTCQLLEEE-GIPVTSVFDITKFPEIMG--GRVKTLHPKIYGGILSIR 92 (536) Q Consensus 19 ALiSV~dKt--gl~~la~~L~~~g~~-iisTgGTa~~l~~~-gi~v~~Vs~~TgfpEil~--GRVKTLHP~I~ggIL~~r 92 (536) +=+.|++.+ .-++.|+.|.+.|++ |||.|||+.+|+++ .+||.++. +|+|. ||. =+.|..+++| |+..-+ T Consensus 9 ~~i~v~~~~l~~av~~a~~~~~~g~dvIIsRGgta~~ir~~~~iPVv~I~-~s~~D-il~al~~a~~~~~ki--avvg~~ 84 (169) T pfam06506 9 ADVEVVDGGLEDAVEVARALVAEGVDVIISRGGTAAYLRDRLSVPVVEIK-VSGFD-LLRALARARRYGGRI--GLVGYE 84 (169) T ss_pred CEEEEEECCHHHHHHHHHHHHHCCCCEEEECCHHHHHHHHHCCCCEEEEE-CCHHH-HHHHHHHHHHHCCCE--EEEECC T ss_conf 72799977789999999999977995999896589999985899889982-78869-999999999758979--999276 Q ss_pred CCHHHHHHHHHCCCCCEEEEEEECCCHHHHHCCCCCH-HHHHHCCCCCCHHHHHHHHCCC Q ss_conf 8989999999779984149999488889973176531-1232002334346999874015 Q gi|254780664|r 93 DNPAHMKFMQDHELESIDLVVVNLYPFEESFCREDDY-YTMVENIDIGGPSMIRAAAKNH 151 (536) Q Consensus 93 ~~~~~~~~l~~~~i~~IDlVvvNLYPF~~~v~~~~~~-~~~IEnIDIGGpsmiRAAAKN~ 151 (536) +.-...+.+.++ -.+|+......--++.-..=... ++-+ .+=|||..-.+.|.|.- T Consensus 85 ~~~~~~~~~~~i--l~~~i~~~~~~~~~e~~~~i~~l~~~G~-~vvVG~~~~~~~A~~~G 141 (169) T pfam06506 85 NIIPGLKALSEL--LGLDIVQRAYQSEEEARAAVKELKAQGI-KVIVGDGLVCDLAEQAG 141 (169) T ss_pred CCCHHHHHHHHH--HCCCEEEEEECCHHHHHHHHHHHHHCCC-EEEECCCHHHHHHHHCC T ss_conf 303689999999--6993599996688999999999998699-59985828999999839 No 19 >TIGR00315 cdhB CO dehydrogenase/acetyl-CoA synthase complex, epsilon subunit; InterPro: IPR003704 Carbon monoxide dehydrogenase (Cdh) from Methanosarcina frisia Go1 is a Ni2+-, Fe2+-, and S2-containing alpha2beta2 heterotetramer . The CO dehydrogenase enzyme complex from Methanosarcina thermophila contains a corrinoid/iron-sulphur enzyme composed of two subunits (delta and gamma) . This family consists of carbon monoxide dehydrogenase I/II beta subunit 1.2.99.2 from EC and CO dehydrogenase (acetyl-CoA synthase epsilon subunit).; GO: 0003824 catalytic activity, 0008152 metabolic process. Probab=89.98 E-value=0.27 Score=30.01 Aligned_cols=44 Identities=23% Similarity=0.520 Sum_probs=35.9 Q ss_pred CCC-HHHHHHHHHHC-CCEEEEEHHHHHHHHHCCCC--E--EEHHHHCCC Q ss_conf 568-69999999987-99999942768999988995--1--794653488 Q gi|254780664|r 26 KTG-VVEFASRLLSR-GIKIISTGGTCQLLEEEGIP--V--TSVFDITKF 69 (536) Q Consensus 26 Ktg-l~~la~~L~~~-g~~iisTgGTa~~l~~~gi~--v--~~Vs~~Tgf 69 (536) +.- +.|+++.|.++ ++.+++|++|.+.|.|+||. + ..+-++|.| T Consensus 41 ~~Ek~~E~~~K~iekF~l~vvaTa~~~~~lieaGi~~~s~~~~L~~it~f 90 (170) T TIGR00315 41 KDEKLLELVVKFIEKFDLPVVATADAIKALIEAGIEEKSEEANLHEITQF 90 (170) T ss_pred HCCHHHHHHHHHHHHCCCCEEECCHHHHHHHHCCCCCCCCHHHHHHHHHH T ss_conf 21213455555564328864747311388885466211100379999975 No 20 >KOG0370 consensus Probab=88.50 E-value=0.51 Score=27.95 Aligned_cols=222 Identities=20% Similarity=0.190 Sum_probs=106.9 Q ss_pred CCEEEEEE-----ECCCHHHHHCCC-CCHHHHHHCCCCCCHHHHHHHHCCCCCCEEEECHHHHHH----HHHHHHCCCCC Q ss_conf 84149999-----488889973176-531123200233434699987401563135308999898----89875302345 Q gi|254780664|r 107 ESIDLVVV-----NLYPFEESFCRE-DDYYTMVENIDIGGPSMIRAAAKNHDYVTILTNPQDYPL----FLAEMDVNNGK 176 (536) Q Consensus 107 ~~IDlVvv-----NLYPF~~~v~~~-~~~~~~IEnIDIGGpsmiRAAAKN~~~V~Vi~dp~dY~~----~~~el~~~~g~ 176 (536) +..|.+|| .|--|..+..+- .+.....|-.-|| .+.--|..|--.-|-.-..-=++.. +.+||. T Consensus 716 pslDY~v~KiprWDl~kf~~vs~~igssmKSvgEvm~iG-R~feea~QKalr~vd~~~~Gf~~~~~~~~~~~eL~----- 789 (1435) T KOG0370 716 PSLDYCVVKIPRWDLSKFQRVSTEIGSSMKSVGEVMAIG-RTFEEAFQKALRMVDPSLLGFMSTPFLDDLDEELS----- 789 (1435) T ss_pred CCHHHEEEECCCCCHHHHHHHHHHHCHHHHHHHHHHHHH-HHHHHHHHHHHHHCCHHHCCCCCCCCHHHHHHHHC----- T ss_conf 532013440565337878788876252565246666221-30799998877514923237301021445777735----- Q ss_pred CCHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHC--------------CCCHHHHHHH----------------- Q ss_conf 666788999999999874228-------99998753310--------------1211344321----------------- Q gi|254780664|r 177 IPYNFRKKMARQAFSRTASYD-------TAICRWLANAE--------------SENFPDYLNI----------------- 218 (536) Q Consensus 177 ~~~~~R~~lA~kAF~~ta~YD-------~~Ia~~l~~~~--------------~~~~p~~~~~----------------- 218 (536) ..-..|-.-...|+.+- |. ..|-.||.... .+.+|..+.+ T Consensus 790 ~ptd~r~faia~a~~~~--~svd~~~elt~IdkWFL~k~~~i~~~~~~l~~~~~~~l~~~ll~~AK~~GFsD~QIa~~i~ 867 (1435) T KOG0370 790 TPTDRRVFAIAAALAKG--YSVDRIHELTRIDKWFLYKLMNIVNIYKLLESHSLSSLPKELLLRAKKLGFSDKQIAKFIG 867 (1435) T ss_pred CCCCCCHHHHHHHHHHC--CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHCCCCHHHHHHHHC T ss_conf 87541489999998725--7799999999899999999999999999987436566999999999872986899998857 Q ss_pred ----------------CCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCHHHCC----CCCCCCCC--CCCC-CHHHHHHH Q ss_conf ----------------0123333436788100650576047766821012105----89888312--2212-00116776 Q gi|254780664|r 219 ----------------TAVKKQEMRYGENPHQKAALYSTPEKKSGIAHAVLVQ----GKPLSYNN--INDL-DAAFELVS 275 (536) Q Consensus 219 ----------------~~~~~~~LRYGENPHQ~Aa~Y~~~~~~~~~~~~~ql~----GKeLSYNN--llD~-daA~~lv~ 275 (536) ++.|.-+---+|=|-|.-.+|..-+....-..|.-++ |.- -|-= -+.. -.|..|++ T Consensus 868 s~El~vR~~R~~~gi~P~VKqIDTvAaE~Pa~TNYLY~tyn~~~hDv~F~~~g~mVlGsG-vYrIGSSVEFDwcaV~~~r 946 (1435) T KOG0370 868 STELAVRRLRKELGIHPFVKQIDTVAAEFPAQTNYLYTTYNATEHDVDFNEHGVMVLGSG-VYRIGSSVEFDWCAVGCAR 946 (1435) T ss_pred CCHHHHHHHHHHCCCCCHHHHHHHHHHCCCCCCCEEEEEECCCCCCCCCCCCCEEEECCC-CEECCCCEEECHHHHHHHH T ss_conf 218999999986298704542212422376633369997057424653388865997243-2563562055133554799 Q ss_pred HCCCCCCCEEEEECCCCCCCCCCCCHHHH-------HHHHHHHHC-CCCCCCEEEEECCCCCHHHHHHHHHH Q ss_conf 42455672799961568656545201689-------999998533-21320045554275421457878865 Q gi|254780664|r 276 EFRSQDCAACVIVKHMNPCGVATADTLVE-------AYRRALSCD-PISAFGGIIAFNREVDQEVAKEVIKV 339 (536) Q Consensus 276 ef~~~~~Pa~vIvKH~NPCGvA~~~~~~~-------Ay~~A~~~D-P~SAFGGIIa~N~~vd~~~A~~I~~~ 339 (536) ++.+.+.++ |+-.-||--|++.=+.-+ .|++-+.-= -..+=|-||++-..+--..|-.+.+. T Consensus 947 tLr~~g~kT--imvNyNPETVSTDyDecdrLYFeeis~E~vmDiYe~E~~~G~iis~GGQ~pnNiA~~L~r~ 1016 (1435) T KOG0370 947 TLRKLGKKT--IMVNYNPETVSTDYDECDRLYFEEISYERVMDIYELENSEGIIISVGGQLPNNIALKLHRN 1016 (1435) T ss_pred HHHHCCCCE--EEEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCHHHHHHHHC T ss_conf 999709846--9995686543575677766767542366534355430677269983684862021375862 No 21 >pfam07085 DRTGG DRTGG domain. This presumed domain is about 120 amino acids in length. It is found associated with CBS domains pfam00571, as well as the CbiA domain pfam01656. The function of this domain is unknown. It is named the DRTGG domain after some of the most conserved residues. This domain may be very distantly related to a pair of CBS domains. There are no significant sequence similarities, but its length and association with CBS domains supports this idea (Bateman A, pers. obs.). Probab=85.56 E-value=1.6 Score=24.32 Aligned_cols=46 Identities=26% Similarity=0.434 Sum_probs=41.3 Q ss_pred CCHHHHHHHCCCCEEEECCCCCCHHHHHHHHHHCCCEEEECCCCCC Q ss_conf 5558999971995999389877988999999875977997478544 Q gi|254780664|r 489 PDGIVEAIKAGVTAVIQPGGSVRDSEAITVADQHGIAMVFTGIRHF 534 (536) Q Consensus 489 ~D~ie~aa~~Gi~aIiqPGGSirD~evI~aan~~gi~m~fTg~RhF 534 (536) +|.+..|..+++.+||=.||--=+++|++.|.+.+++++-|..-.| T Consensus 51 ~di~~~a~~~~~~~iIlTgg~~p~~~v~~la~~~~ipii~t~~dT~ 96 (105) T pfam07085 51 EDIQLAALLAGIAGLILTGGFEPSEEVLKLAEEAGLPVLSTPYDTF 96 (105) T ss_pred HHHHHHHHHHCCCEEEEECCCCCCHHHHHHHHHCCCEEEEECCCHH T ss_conf 8999999982487899948989899999999877983999668899 No 22 >cd01422 MGS Methylglyoxal synthase catalyzes the enolization of dihydroxyacetone phosphate (DHAP) to produce methylglyoxal. The first part of the catalytic mechanism is believed to be similar to TIM (triosephosphate isomerase) in that both enzymes utilize DHAP to form an ene-diolate phosphate intermediate. In MGS, the second catalytic step is characterized by the elimination of phosphate and collapse of the enediolate to form methylglyoxal instead of reprotonation to form the isomer glyceraldehyde 3-phosphate, as in TIM. This is the first reaction in the methylglyoxal bypass of the Embden-Myerhoff glycolytic pathway and is believed to provide physiological benefits under non-ideal growth conditions in bacteria. Probab=84.75 E-value=2.4 Score=23.10 Aligned_cols=106 Identities=23% Similarity=0.299 Sum_probs=64.2 Q ss_pred EEEEEEE-CCCCHHHHHHHHHHC--CCEEEEEHHHHHHHHHC-CCCEEEHHHHCCCCHHCCCCCCCCCCHHHHHHHHCCC Q ss_conf 9899875-456869999999987--99999942768999988-9951794653488165198553237155535533088 Q gi|254780664|r 18 TALISVH-NKTGVVEFASRLLSR--GIKIISTGGTCQLLEEE-GIPVTSVFDITKFPEIMGGRVKTLHPKIYGGILSIRD 93 (536) Q Consensus 18 rALiSV~-dKtgl~~la~~L~~~--g~~iisTgGTa~~l~~~-gi~v~~Vs~~TgfpEil~GRVKTLHP~I~ggIL~~r~ 93 (536) -|||.=- -|..+++|++.-.+. +.+|++||.|.+.|.++ |++|.. +.+--.|| T Consensus 2 IALIAHD~kK~~lv~f~~~~~~~L~~~~L~ATgtTG~~i~e~~~L~V~~-----------------~~sGplGG------ 58 (115) T cd01422 2 IALIAHDNKKEDLVEFVKQHQELLSRHRLVATGTTGLLIQEATGLTVNR-----------------MKSGPLGG------ 58 (115) T ss_pred EEEEECCCCHHHHHHHHHHHHHHHHCCCEEECCCHHHHHHHHHCCCEEE-----------------ECCCCCCH------ T ss_conf 8998345665999999999999970796686400999999864387688-----------------55799668------ Q ss_pred CHHHHHHHHHCCCCCEEEEEEECCCHHHHHCCCCCHHHHHHCCCCCCHHHHHHHHCCCCCCEEEECHHHH Q ss_conf 9899999997799841499994888899731765311232002334346999874015631353089998 Q gi|254780664|r 94 NPAHMKFMQDHELESIDLVVVNLYPFEESFCREDDYYTMVENIDIGGPSMIRAAAKNHDYVTILTNPQDY 163 (536) Q Consensus 94 ~~~~~~~l~~~~i~~IDlVvvNLYPF~~~v~~~~~~~~~IEnIDIGGpsmiRAAAKN~~~V~Vi~dp~dY 163 (536) |++-...---..||+|+.=--|... . +.+ -|| -+|+|.+.- .+|.+-|++.-= T Consensus 59 ---DqQIga~I~~g~id~viFf~DPl~~---~--ph~-----~Di--~aLlRlc~v--~niP~AtN~aTA 111 (115) T cd01422 59 ---DQQIGALIAEGEIDAVIFFRDPLTA---Q--PHE-----PDV--KALLRLCDV--YNIPLATNRSTA 111 (115) T ss_pred ---HHHHHHHHHHCCCCEEEEECCCCCC---C--CCC-----CCH--HHHHHHHHH--CCCCCCCCHHHH T ss_conf ---9999999981998889984289877---9--997-----029--999999986--196121799889 No 23 >pfam01081 Aldolase KDPG and KHG aldolase. This family includes the following members: 4-hydroxy-2-oxoglutarate aldolase (KHG-aldolase) Phospho-2-dehydro-3-deoxygluconate aldolase (KDPG-aldolase) Probab=83.99 E-value=1.7 Score=24.16 Aligned_cols=88 Identities=15% Similarity=0.230 Sum_probs=48.6 Q ss_pred CCCCCCCCHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHH Q ss_conf 31222120011677642455672799961568656545201689999998533213200455542754214578788654 Q gi|254780664|r 261 YNNINDLDAAFELVSEFRSQDCAACVIVKHMNPCGVATADTLVEAYRRALSCDPISAFGGIIAFNREVDQEVAKEVIKVF 340 (536) Q Consensus 261 YNNllD~daA~~lv~ef~~~~~Pa~vIvKH~NPCGvA~~~~~~~Ay~~A~~~DP~SAFGGIIa~N~~vd~~~A~~I~~~F 340 (536) --...|.+.|..++.-+-+.+.+ +.=|--++|. -.++.++..+--| +=+|..-.-+|.+.++...+.= T Consensus 13 V~r~~~~~~a~~~~~al~~~Gi~-~iEiTl~t~~-------a~~~I~~l~~~~p----~~~iGaGTV~~~e~~~~a~~aG 80 (196) T pfam01081 13 VIVIKDKEDALPLAEALAAGGIR-VLEVTLRTPC-------ALDAIRLLRKNRP----DALVGAGTVLNAQQLAEAAEAG 80 (196) T ss_pred EEECCCHHHHHHHHHHHHHCCCC-EEEEECCCHH-------HHHHHHHHHHHCC----CCEEEEEECCCHHHHHHHHHCC T ss_conf 99779999999999999987998-8999479827-------9999999996499----9679998376899999999749 Q ss_pred HHHHEECCCCHHHHHHHHCC Q ss_conf 31111268798999885124 Q gi|254780664|r 341 TEAIIAPTLSEEAADVLAKK 360 (536) Q Consensus 341 ~EvIiAP~f~~eAleiL~~K 360 (536) .+-++.|+++++-++...+. T Consensus 81 A~FivSP~~~~~v~~~a~~~ 100 (196) T pfam01081 81 AQFVVSPGLTADLLKHAVDV 100 (196) T ss_pred CCEEECCCCHHHHHHHHHHC T ss_conf 99999787639999999973 No 24 >cd00452 KDPG_aldolase KDPG and KHG aldolase. This family belongs to the class I adolases whose reaction mechanism involves Schiff base formation between a substrate carbonyl and lysine residue in the active site. 2-keto-3-deoxy-6-phosphogluconate (KDPG) aldolase, is best known for its role in the Entner-Doudoroff pathway of bacteria, where it catalyzes the reversible cleavage of KDPG to pyruvate and glyceraldehyde-3-phosphate. 2-keto-4-hydroxyglutarate (KHG) aldolase, which has enzymatic specificity toward glyoxylate, forming KHG in the presence of pyruvate, and is capable of regulating glyoxylate levels in the glyoxylate bypass, an alternate pathway when bacteria are grown on acetate carbon sources. Probab=83.78 E-value=1.6 Score=24.40 Aligned_cols=84 Identities=18% Similarity=0.283 Sum_probs=46.6 Q ss_pred CCCCHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHH Q ss_conf 21200116776424556727999615686565452016899999985332132004555427542145787886543111 Q gi|254780664|r 265 NDLDAAFELVSEFRSQDCAACVIVKHMNPCGVATADTLVEAYRRALSCDPISAFGGIIAFNREVDQEVAKEVIKVFTEAI 344 (536) Q Consensus 265 lD~daA~~lv~ef~~~~~Pa~vIvKH~NPCGvA~~~~~~~Ay~~A~~~DP~SAFGGIIa~N~~vd~~~A~~I~~~F~EvI 344 (536) .|.+.|..++.-+-+.+.+ ++=|--++|.. -...+.+.+.+ | .=+|..-.-+|.+.++...+-=.+-+ T Consensus 13 ~~~~~a~~~~~al~~~Gi~-~iEitl~t~~a----~~~i~~l~~~~---~----~~~iGaGTV~~~~~~~~a~~aGa~Fi 80 (190) T cd00452 13 DDAEDALALAEALIEGGIR-AIEITLRTPGA----LEAIRALRKEF---P----EALIGAGTVLTPEQADAAIAAGAQFI 80 (190) T ss_pred CCHHHHHHHHHHHHHCCCC-EEEEECCCCHH----HHHHHHHHHHC---C----CCEEEECCCCCHHHHHHHHHCCCCEE T ss_conf 9999999999999986998-89996788029----99999999868---9----80896523477999999998599899 Q ss_pred EECCCCHHHHHHHHCC Q ss_conf 1268798999885124 Q gi|254780664|r 345 IAPTLSEEAADVLAKK 360 (536) Q Consensus 345 iAP~f~~eAleiL~~K 360 (536) +.|+++++-++..++. T Consensus 81 vsP~~~~~v~~~a~~~ 96 (190) T cd00452 81 VSPGLDPEVVKAANRA 96 (190) T ss_pred ECCCCCHHHHHHHHHC T ss_conf 7377999999999982 No 25 >PRK05718 keto-hydroxyglutarate-aldolase/keto-deoxy-phosphogluconate aldolase; Provisional Probab=83.77 E-value=1.7 Score=24.21 Aligned_cols=89 Identities=19% Similarity=0.225 Sum_probs=42.7 Q ss_pred CCCCCCCCCCHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHH Q ss_conf 88312221200116776424556727999615686565452016899999985332132004555427542145787886 Q gi|254780664|r 259 LSYNNINDLDAAFELVSEFRSQDCAACVIVKHMNPCGVATADTLVEAYRRALSCDPISAFGGIIAFNREVDQEVAKEVIK 338 (536) Q Consensus 259 LSYNNllD~daA~~lv~ef~~~~~Pa~vIvKH~NPCGvA~~~~~~~Ay~~A~~~DP~SAFGGIIa~N~~vd~~~A~~I~~ 338 (536) +.--...|.+.|..++.-+-+.+.+ ++=|--++|.. .++.+...+.=| .=+|..-.-+|.+.++...+ T Consensus 18 i~Vlr~~~~~~a~~i~~al~~gGi~-~iEiTl~tp~a-------~~~I~~l~~~~p----~~~vGaGTV~~~e~~~~a~~ 85 (212) T PRK05718 18 VPVIVINKLEDAVPLAKALVAGGLP-VLEVTLRTPAA-------LEAIRAIRKEVP----EALIGAGTVLNPEQLAQAIE 85 (212) T ss_pred EEEEECCCHHHHHHHHHHHHHCCCC-EEEEECCCCHH-------HHHHHHHHHHCC----CCEEEEEEECCHHHHHHHHH T ss_conf 9999748999999999999987997-89995789619-------999999997589----81796533134889999998 Q ss_pred HHHHHHEECCCCHHHHHHHHC Q ss_conf 543111126879899988512 Q gi|254780664|r 339 VFTEAIIAPTLSEEAADVLAK 359 (536) Q Consensus 339 ~F~EvIiAP~f~~eAleiL~~ 359 (536) -=.+-++.|+++++-++.-.+ T Consensus 86 aGA~FiVSP~~~~~v~~~a~~ 106 (212) T PRK05718 86 AGAQFIVSPGLTPPLLKACQD 106 (212) T ss_pred CCCCEEECCCCCHHHHHHHHH T ss_conf 499899848998999999998 No 26 >PRK08104 consensus Probab=83.06 E-value=1.8 Score=24.06 Aligned_cols=93 Identities=23% Similarity=0.297 Sum_probs=54.8 Q ss_pred CCCCCCCCCCCCCCHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHH Q ss_conf 58988831222120011677642455672799961568656545201689999998533213200455542754214578 Q gi|254780664|r 255 QGKPLSYNNINDLDAAFELVSEFRSQDCAACVIVKHMNPCGVATADTLVEAYRRALSCDPISAFGGIIAFNREVDQEVAK 334 (536) Q Consensus 255 ~GKeLSYNNllD~daA~~lv~ef~~~~~Pa~vIvKH~NPCGvA~~~~~~~Ay~~A~~~DP~SAFGGIIa~N~~vd~~~A~ 334 (536) +.+-+---..-|.+.|..++.-+-+.+.+ ++=|--++|+. .++.++....-| .-+|..-.-+|.+.++ T Consensus 14 ~~~iipVir~~~~~~a~~la~al~~gGi~-~iEiTlrt~~a-------~~~I~~l~~~~p----~~~vGaGTV~~~e~~~ 81 (212) T PRK08104 14 RGPVVPVIVINKLEHAVPLAKALVAGGVR-VLEVTLRTPCA-------LEAIRAIAKEVP----EAIVGAGTVLNPQQLA 81 (212) T ss_pred HCCEEEEEECCCHHHHHHHHHHHHHCCCC-EEEEECCCCHH-------HHHHHHHHHHCC----CCEEEEEECCCHHHHH T ss_conf 49868999779999999999999987998-89996888149-------999999998689----8568542026799999 Q ss_pred HHHHHHHHHHEECCCCHHHHHHHHC Q ss_conf 7886543111126879899988512 Q gi|254780664|r 335 EVIKVFTEAIIAPTLSEEAADVLAK 359 (536) Q Consensus 335 ~I~~~F~EvIiAP~f~~eAleiL~~ 359 (536) ...+-=.+-++.|+++++-++.-.+ T Consensus 82 ~ai~aGA~FiVSP~~~~~v~~~a~~ 106 (212) T PRK08104 82 EVTEAGAQFAISPGLTEELLKAATE 106 (212) T ss_pred HHHHCCCCEEECCCCCHHHHHHHHH T ss_conf 9998599999848999999999998 No 27 >PRK07455 keto-hydroxyglutarate-aldolase/keto-deoxy-phosphogluconate aldolase; Provisional Probab=82.49 E-value=1.9 Score=23.81 Aligned_cols=40 Identities=18% Similarity=0.125 Sum_probs=23.2 Q ss_pred EEEECCCCCHHHHHHHHHHHHHHHEECCCCHHHHHHHHCC Q ss_conf 5554275421457878865431111268798999885124 Q gi|254780664|r 321 IIAFNREVDQEVAKEVIKVFTEAIIAPTLSEEAADVLAKK 360 (536) Q Consensus 321 IIa~N~~vd~~~A~~I~~~F~EvIiAP~f~~eAleiL~~K 360 (536) +|..-.-+|.+.++...+-=.+-++.|.++++-++...+. T Consensus 66 ~iGaGTV~~~e~~~~a~~aGA~FiVSP~~~~~vi~~a~~~ 105 (210) T PRK07455 66 IIGTGTLLTLEDLEEAIAAGAQFCFTPHVDLELIQAAVAA 105 (210) T ss_pred EEEEEECCCHHHHHHHHHCCCCEEECCCCCHHHHHHHHHC T ss_conf 8988818789999999986999998688889999999982 No 28 >PRK06015 keto-hydroxyglutarate-aldolase/keto-deoxy-phosphogluconate aldolase; Provisional Probab=82.10 E-value=2 Score=23.66 Aligned_cols=89 Identities=13% Similarity=0.181 Sum_probs=50.1 Q ss_pred CCCCCCCCCHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHH Q ss_conf 83122212001167764245567279996156865654520168999999853321320045554275421457878865 Q gi|254780664|r 260 SYNNINDLDAAFELVSEFRSQDCAACVIVKHMNPCGVATADTLVEAYRRALSCDPISAFGGIIAFNREVDQEVAKEVIKV 339 (536) Q Consensus 260 SYNNllD~daA~~lv~ef~~~~~Pa~vIvKH~NPCGvA~~~~~~~Ay~~A~~~DP~SAFGGIIa~N~~vd~~~A~~I~~~ 339 (536) ---..-|.+.+..++.-+-+.+.+ ++=|--++|.. .++.+...+.=| +-+|..-.-+|.+.++...+- T Consensus 19 pVlr~~~~~~a~~~~~al~~gGi~-~iEITlrt~~a-------~~~I~~l~~~~p----~~~vGaGTVl~~e~~~~a~~a 86 (212) T PRK06015 19 PVLLIDDVEHAVPLARALARGGLP-AIEITLRTPAA-------LDAIRAVAAEVE----EAIVGAGTILNAKQFEDAAKA 86 (212) T ss_pred EEEECCCHHHHHHHHHHHHHCCCC-EEEEECCCCCH-------HHHHHHHHHHCC----CCEEEEEECCCHHHHHHHHHC T ss_conf 999779999999999999987998-89996899519-------999999998699----967954211569999999984 Q ss_pred HHHHHEECCCCHHHHHHHHCC Q ss_conf 431111268798999885124 Q gi|254780664|r 340 FTEAIIAPTLSEEAADVLAKK 360 (536) Q Consensus 340 F~EvIiAP~f~~eAleiL~~K 360 (536) =.+-++.|+++++-++.-++. T Consensus 87 GA~FiVSP~~~~~v~~~a~~~ 107 (212) T PRK06015 87 GSRFIVSPGTTQELLAAANDS 107 (212) T ss_pred CCCEEECCCCCHHHHHHHHHC T ss_conf 998998589999999999983 No 29 >PRK09140 2-dehydro-3-deoxy-6-phosphogalactonate aldolase; Reviewed Probab=81.00 E-value=1.9 Score=23.80 Aligned_cols=82 Identities=15% Similarity=0.180 Sum_probs=52.7 Q ss_pred CCCHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCE--EEEECCCCCHHHHHHHHHHHHHH Q ss_conf 1200116776424556727999615686565452016899999985332132004--55542754214578788654311 Q gi|254780664|r 266 DLDAAFELVSEFRSQDCAACVIVKHMNPCGVATADTLVEAYRRALSCDPISAFGG--IIAFNREVDQEVAKEVIKVFTEA 343 (536) Q Consensus 266 D~daA~~lv~ef~~~~~Pa~vIvKH~NPCGvA~~~~~~~Ay~~A~~~DP~SAFGG--IIa~N~~vd~~~A~~I~~~F~Ev 343 (536) |.+.|..++.-+-+.+.+ ++=|--++|+.. ...+...+. ||. .|..-.-+|.+.++...+-=.+- T Consensus 20 ~~~~a~~~~~al~~~Gi~-~iEVTl~tp~a~----~~I~~l~~~--------~~~~~~iGAGTVlt~e~~~~ai~aGA~F 86 (206) T PRK09140 20 TPDEALAHVGALIEAGFR-AIEIPLNSPDPF----DSIAALVKA--------LGDDALIGAGTVLSPEQVDRLADAGGRL 86 (206) T ss_pred CHHHHHHHHHHHHHCCCC-EEEEECCCCCHH----HHHHHHHHH--------CCCCEEEEEEECCCHHHHHHHHHCCCCE T ss_conf 999999999999986998-899917997699----999999996--------7986599862046799999999859999 Q ss_pred HEECCCCHHHHHHHHCC Q ss_conf 11268798999885124 Q gi|254780664|r 344 IIAPTLSEEAADVLAKK 360 (536) Q Consensus 344 IiAP~f~~eAleiL~~K 360 (536) |+.|.++++-++..+++ T Consensus 87 iVSP~~~~~vi~~a~~~ 103 (206) T PRK09140 87 IVTPNIDPEVIRRAVAY 103 (206) T ss_pred EECCCCCHHHHHHHHHC T ss_conf 99999989999999982 No 30 >PRK06857 consensus Probab=80.83 E-value=2.4 Score=23.06 Aligned_cols=88 Identities=13% Similarity=0.156 Sum_probs=46.0 Q ss_pred CCCCCCCCCHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHH Q ss_conf 83122212001167764245567279996156865654520168999999853321320045554275421457878865 Q gi|254780664|r 260 SYNNINDLDAAFELVSEFRSQDCAACVIVKHMNPCGVATADTLVEAYRRALSCDPISAFGGIIAFNREVDQEVAKEVIKV 339 (536) Q Consensus 260 SYNNllD~daA~~lv~ef~~~~~Pa~vIvKH~NPCGvA~~~~~~~Ay~~A~~~DP~SAFGGIIa~N~~vd~~~A~~I~~~ 339 (536) .-....|.+.+..++.-+-+.+.+ +.=|--++|+. .++.+...+.-| .=+|..-.-+|.+.++.-.+- T Consensus 16 pVir~~~~~~a~~~~~al~~gGi~-~iEiTlrt~~a-------~~~I~~l~~~~p----~~~vGaGTV~~~e~~~~a~~a 83 (209) T PRK06857 16 PVIAIDDAEDILPLAKVLAENGLP-VAEITFRSAAA-------AEAIRLLREAYP----DMLIGAGTVLTPEQVDAAKEA 83 (209) T ss_pred EEEECCCHHHHHHHHHHHHHCCCC-EEEEECCCCCH-------HHHHHHHHHHCC----CCEEEEEECCCHHHHHHHHHC T ss_conf 999759999999999999987998-89995899329-------999999997589----948999937679999999983 Q ss_pred HHHHHEECCCCHHHHHHHHC Q ss_conf 43111126879899988512 Q gi|254780664|r 340 FTEAIIAPTLSEEAADVLAK 359 (536) Q Consensus 340 F~EvIiAP~f~~eAleiL~~ 359 (536) =.+-++.|+++++-++...+ T Consensus 84 GA~FiVSP~~~~~v~~~a~~ 103 (209) T PRK06857 84 GADFIVSPGFNPNTVKYCQQ 103 (209) T ss_pred CCCEEECCCCCHHHHHHHHH T ss_conf 99999908999999999997 No 31 >PRK07114 keto-hydroxyglutarate-aldolase/keto-deoxy-phosphogluconate aldolase; Provisional Probab=80.79 E-value=1.9 Score=23.82 Aligned_cols=93 Identities=15% Similarity=0.173 Sum_probs=51.0 Q ss_pred CCCCCCCCCCHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHH-HCCCCCCCEEEEECCCCCHHHHHHHH Q ss_conf 88312221200116776424556727999615686565452016899999985-33213200455542754214578788 Q gi|254780664|r 259 LSYNNINDLDAAFELVSEFRSQDCAACVIVKHMNPCGVATADTLVEAYRRALS-CDPISAFGGIIAFNREVDQEVAKEVI 337 (536) Q Consensus 259 LSYNNllD~daA~~lv~ef~~~~~Pa~vIvKH~NPCGvA~~~~~~~Ay~~A~~-~DP~SAFGGIIa~N~~vd~~~A~~I~ 337 (536) +.---.-|.+.|..++.-+-+.+.++.=|- -.+|+. ....++..+.+. -.| +=+|..-.-+|.+.++... T Consensus 19 ipVvr~~~~e~a~~~a~aL~~gGi~~iEiT-lrt~~a----~~~i~~l~~~~~~~~p----~~~iGaGTVl~~~~~~~a~ 89 (223) T PRK07114 19 VPVFYHSDIEVAKKVVKACYDGGVRAFEFT-NRGDFA----HEVFGELVKYAAKECP----EMILGVGSIVDAATAALYI 89 (223) T ss_pred EEEEEECCHHHHHHHHHHHHHCCCCEEEEE-CCCCCH----HHHHHHHHHHHHHHCC----CCEEEEECCCCHHHHHHHH T ss_conf 999982899999999999998899889995-889658----9999999999986689----8089655188999999999 Q ss_pred HHHHHHHEECCCCHHHHHHHHCC Q ss_conf 65431111268798999885124 Q gi|254780664|r 338 KVFTEAIIAPTLSEEAADVLAKK 360 (536) Q Consensus 338 ~~F~EvIiAP~f~~eAleiL~~K 360 (536) +-=.+-++.|+++++-++...+. T Consensus 90 ~aGA~FiVSP~~~~~v~~~~~~~ 112 (223) T PRK07114 90 QLGANFVVGPLFNEDIAKVCNRR 112 (223) T ss_pred HCCCCEEECCCCCHHHHHHHHHC T ss_conf 85998999999999999999983 No 32 >PRK08904 consensus Probab=80.56 E-value=2.5 Score=22.96 Aligned_cols=93 Identities=22% Similarity=0.238 Sum_probs=57.9 Q ss_pred CCCCCCCCCCCCCHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHH Q ss_conf 89888312221200116776424556727999615686565452016899999985332132004555427542145787 Q gi|254780664|r 256 GKPLSYNNINDLDAAFELVSEFRSQDCAACVIVKHMNPCGVATADTLVEAYRRALSCDPISAFGGIIAFNREVDQEVAKE 335 (536) Q Consensus 256 GKeLSYNNllD~daA~~lv~ef~~~~~Pa~vIvKH~NPCGvA~~~~~~~Ay~~A~~~DP~SAFGGIIa~N~~vd~~~A~~ 335 (536) .+-+.-...-|.+.|..++.-+-+.+.+ ++=|--++|.. .++.+...+--| +=+|..-.-+|.+.++. T Consensus 10 ~~iipVir~~~~~~a~~~a~al~~~Gi~-~iEiTlrtp~a-------~~~i~~l~~~~p----~~~vGaGTVl~~e~~~~ 77 (207) T PRK08904 10 GAVVPVMAIDDLSTAVDLSRALVEGGIP-TLEITLRTPVG-------LDAIRLIAKEVP----NAIVGAGTVTNPEQLKA 77 (207) T ss_pred CCEEEEEECCCHHHHHHHHHHHHHCCCC-EEEEECCCCHH-------HHHHHHHHHHCC----CCEEEEEECCCHHHHHH T ss_conf 9979999769999999999999987998-89995799139-------999999998689----87685531368999999 Q ss_pred HHHHHHHHHEECCCCHHHHHHHHCC Q ss_conf 8865431111268798999885124 Q gi|254780664|r 336 VIKVFTEAIIAPTLSEEAADVLAKK 360 (536) Q Consensus 336 I~~~F~EvIiAP~f~~eAleiL~~K 360 (536) ..+-=.+-|+.|+++++-++..++. T Consensus 78 a~~aGA~FiVSP~~~~~v~~~a~~~ 102 (207) T PRK08904 78 VEDAGAVFAISPGLHESLAKAGHNS 102 (207) T ss_pred HHHCCCCEEECCCCCHHHHHHHHHC T ss_conf 9984999998489989999999983 No 33 >PRK05234 mgsA methylglyoxal synthase; Validated Probab=80.36 E-value=1.6 Score=24.32 Aligned_cols=113 Identities=21% Similarity=0.279 Sum_probs=75.6 Q ss_pred EEEEEEEEC-CCCHHHHHHHHHHC--CCEEEEEHHHHHHHHHC-CCCEEEHHHHCCCCHHCCCCCCCCCCHHHHHHHHCC Q ss_conf 698998754-56869999999987--99999942768999988-995179465348816519855323715553553308 Q gi|254780664|r 17 KTALISVHN-KTGVVEFASRLLSR--GIKIISTGGTCQLLEEE-GIPVTSVFDITKFPEIMGGRVKTLHPKIYGGILSIR 92 (536) Q Consensus 17 krALiSV~d-Ktgl~~la~~L~~~--g~~iisTgGTa~~l~~~-gi~v~~Vs~~TgfpEil~GRVKTLHP~I~ggIL~~r 92 (536) +-|||.=-. |..+++|++.-.+. +.+|++||.|.+.|.++ |++|. .+.+--.|| T Consensus 7 ~IALIAHD~kK~~lv~f~~~~~~~L~~~~L~ATgtTG~~i~~~~gL~v~-----------------~~~SGplGG----- 64 (146) T PRK05234 7 RIALIAHDHKKDDLVAWVKAHKELLEQHELYATGTTGGLIQEATGLDVT-----------------RLLSGPLGG----- 64 (146) T ss_pred EEEEEECCCCHHHHHHHHHHHHHHHHCCCEEECCCHHHHHHHHHCCCEE-----------------EECCCCCCH----- T ss_conf 5899955576499999999999998279779667599999998798369-----------------973688418----- Q ss_pred CCHHHHHHHHHCCCCCEEEEEEECCCHHHHHCCCCCHHHHHHCCCCCCHHHHHHHHCCCCCCEEEECHHHHHHHHHH Q ss_conf 89899999997799841499994888899731765311232002334346999874015631353089998988987 Q gi|254780664|r 93 DNPAHMKFMQDHELESIDLVVVNLYPFEESFCREDDYYTMVENIDIGGPSMIRAAAKNHDYVTILTNPQDYPLFLAE 169 (536) Q Consensus 93 ~~~~~~~~l~~~~i~~IDlVvvNLYPF~~~v~~~~~~~~~IEnIDIGGpsmiRAAAKN~~~V~Vi~dp~dY~~~~~e 169 (536) |.+-...---..||+|+.=--|... . +++ -|| -+|+|.+.- .+|.+.|+++-=+-++.. T Consensus 65 ----DqQIga~Ia~g~id~viFF~DPl~~---~--phd-----~Dv--~aLlRlc~v--~nIP~AtN~aTAe~li~~ 123 (146) T PRK05234 65 ----DQQIGALIAEGKIDMLIFFWDPLTA---Q--PHD-----PDV--KALLRLADV--WNIPVATNRATADFLISS 123 (146) T ss_pred ----HHHHHHHHHHCCCCEEEEEECCCCC---C--CCC-----CHH--HHHHHHHHH--HCCCCCCCHHHHHHHHCC T ss_conf ----9999999983998789994088755---7--775-----209--999999987--287642689789999708 No 34 >COG0117 RibD Pyrimidine deaminase [Coenzyme metabolism] Probab=79.80 E-value=5.3 Score=20.61 Aligned_cols=81 Identities=26% Similarity=0.257 Sum_probs=54.3 Q ss_pred HHHHHHHHHHHCCCCC---EEEEEECCEEEEECCCCCC-HHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEC-----CCC Q ss_conf 8899999986005667---7999968938997296888-899999999998764200122223487099973-----668 Q gi|254780664|r 416 RDMKFAFKVVKHVKSN---AVVYAKDGRTVGIGSGQTS-RVDSTRFAAIKAHNISAQADVKSMTNGSVIASE-----AFY 486 (536) Q Consensus 416 ~dL~FA~kv~K~vkSN---AIv~ak~~~tiGiGaGQ~s-RVda~~iA~~KA~~~~~~~~~~~~~~g~vlaSD-----AFF 486 (536) ..|..||+..-.+..| .-|++|++++||.|.-+.- +=.|=..|+.+|++ .++|+.+-.- -|= T Consensus 12 ~Al~lA~k~~g~T~pNP~VG~VIV~~~~Ivg~G~h~~aG~pHAEv~Al~~ag~---------~a~Gat~yVTLEPCsH~G 82 (146) T COG0117 12 RALELAEKGQGTTSPNPSVGCVIVKDGEIVGEGYHEKAGGPHAEVCALRMAGE---------AARGATAYVTLEPCSHYG 82 (146) T ss_pred HHHHHHHHCCCCCCCCCCEEEEEEECCEEEEEEECCCCCCCCHHHHHHHHCCC---------CCCCCEEEEEECCCCCCC T ss_conf 99999985086678999666999889999866603778997289999997585---------557878999724766689 Q ss_pred CCCCHHHHHHHCCCCEEEE Q ss_conf 6855589999719959993 Q gi|254780664|r 487 PFPDGIVEAIKAGVTAVIQ 505 (536) Q Consensus 487 PF~D~ie~aa~~Gi~aIiq 505 (536) -=|-+.+.+.++||+-|+- T Consensus 83 rTPPC~~ali~agi~rVvv 101 (146) T COG0117 83 RTPPCADALIKAGVARVVV 101 (146) T ss_pred CCCCHHHHHHHCCCCEEEE T ss_conf 9964579999709888999 No 35 >COG3494 Uncharacterized protein conserved in bacteria [Function unknown] Probab=79.77 E-value=5.3 Score=20.60 Aligned_cols=118 Identities=22% Similarity=0.397 Sum_probs=78.6 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHCCCCCEE---EEEECCEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEE Q ss_conf 1246777868898889999998600566779---9996893899729688889999999999876420012222348709 Q gi|254780664|r 403 TVVTKRSPTDQELRDMKFAFKVVKHVKSNAV---VYAKDGRTVGIGSGQTSRVDSTRFAAIKAHNISAQADVKSMTNGSV 479 (536) Q Consensus 403 ~vVT~~~pt~~e~~dL~FA~kv~K~vkSNAI---v~ak~~~tiGiGaGQ~sRVda~~iA~~KA~~~~~~~~~~~~~~g~v 479 (536) .+.|+..|+.++++|...|..++++.-+=-| ++++++..|++-+- .-+..-+.++.+....- . ..-+|.| T Consensus 141 g~lt~~~P~~~d~~dI~~g~~aA~~lg~lDVGQ~aV~~~g~vvAvEg~-----EGTd~ml~R~a~lr~~g-r-~~rr~gv 213 (279) T COG3494 141 GPLTKKEPDNEDLRDIELGIEAANALGALDVGQGAVVVGGRVVAVEGA-----EGTDAMLRRVADLRKAG-R-AERRGGV 213 (279) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCEEEEEEEC-----CCHHHHHHHHHHHHHCC-C-CCCCCCE T ss_conf 765678887224889999999998742034565469967769999603-----26499999999876435-5-5656878 Q ss_pred EEECCC--------CC--CCCHHHHHHHCCCCEEE-ECCCCC--CHHHHHHHHHHCCCEEE Q ss_conf 997366--------86--85558999971995999-389877--98899999987597799 Q gi|254780664|r 480 IASEAF--------YP--FPDGIVEAIKAGVTAVI-QPGGSV--RDSEAITVADQHGIAMV 527 (536) Q Consensus 480 laSDAF--------FP--F~D~ie~aa~~Gi~aIi-qPGGSi--rD~evI~aan~~gi~m~ 527 (536) +.-=+= .| =++.|+.|+++|.+.|. |-|-++ .-++.|++||++||-.+ T Consensus 214 LvK~aKp~QD~R~DlPtIG~~Ti~~a~kaGlaGIaieagr~lile~e~~l~~Ank~giFI~ 274 (279) T COG3494 214 LVKMAKPQQDERADLPTIGPNTIENAAKAGLAGIAIEAGRVLILEREATLKLANKLGIFIV 274 (279) T ss_pred EEECCCCCCCCCCCCCCCCHHHHHHHHHCCCCCEEEECCCEEEECCHHHHHHHHHCCEEEE T ss_conf 9982688854313687558899999997582113500393799653888986765783899 No 36 >PRK08782 consensus Probab=78.20 E-value=3.2 Score=22.15 Aligned_cols=92 Identities=14% Similarity=0.195 Sum_probs=55.6 Q ss_pred CCCCCCCCCCCCHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHH Q ss_conf 98883122212001167764245567279996156865654520168999999853321320045554275421457878 Q gi|254780664|r 257 KPLSYNNINDLDAAFELVSEFRSQDCAACVIVKHMNPCGVATADTLVEAYRRALSCDPISAFGGIIAFNREVDQEVAKEV 336 (536) Q Consensus 257 KeLSYNNllD~daA~~lv~ef~~~~~Pa~vIvKH~NPCGvA~~~~~~~Ay~~A~~~DP~SAFGGIIa~N~~vd~~~A~~I 336 (536) +-+.-...-|.+.|..++.-+-+.+.++ .=|--++|.. -...+.+.+. -| .=+|..-.-+|.+.++.. T Consensus 18 ~iipVir~~~~~~a~~~~eal~~gGi~~-iEiTlrt~~a----~~~i~~l~~~---~p----~~~vGaGTV~~~e~~~~a 85 (219) T PRK08782 18 GILPVVTVDTLDQARRVADALLEGGLPA-IELTLRTPVA----IEALAMLKRE---LP----NIVIGAGTVLSERQLRQS 85 (219) T ss_pred CEEEEEECCCHHHHHHHHHHHHHCCCCE-EEEECCCCHH----HHHHHHHHHH---CC----CCEEEEEEECCHHHHHHH T ss_conf 9789997599999999999999879987-9996799339----9999999986---89----947999970589999999 Q ss_pred HHHHHHHHEECCCCHHHHHHHHCC Q ss_conf 865431111268798999885124 Q gi|254780664|r 337 IKVFTEAIIAPTLSEEAADVLAKK 360 (536) Q Consensus 337 ~~~F~EvIiAP~f~~eAleiL~~K 360 (536) .+.=.+-++.|+++++-++...+. T Consensus 86 ~~aGA~FiVSP~~~~~v~~~a~~~ 109 (219) T PRK08782 86 VDAGADFLVTPGTPAPLARLLADA 109 (219) T ss_pred HHCCCCEEECCCCCHHHHHHHHHC T ss_conf 984998998789979999999981 No 37 >PRK13210 putative L-xylulose 5-phosphate 3-epimerase; Reviewed Probab=77.79 E-value=6 Score=20.19 Aligned_cols=20 Identities=25% Similarity=0.539 Sum_probs=9.7 Q ss_pred HHHHHHCCCCCEEEEEEECC Q ss_conf 99999779984149999488 Q gi|254780664|r 98 MKFMQDHELESIDLVVVNLY 117 (536) Q Consensus 98 ~~~l~~~~i~~IDlVvvNLY 117 (536) +....++||..|.|+....| T Consensus 100 I~lA~~LGi~~I~l~g~dv~ 119 (284) T PRK13210 100 IRLAQDLGIRTIQLAGYDVY 119 (284) T ss_pred HHHHHHCCCCEEEECCCCCC T ss_conf 99999809978996887666 No 38 >PRK06552 keto-hydroxyglutarate-aldolase/keto-deoxy-phosphogluconate aldolase; Provisional Probab=76.06 E-value=4 Score=21.50 Aligned_cols=93 Identities=15% Similarity=0.221 Sum_probs=57.3 Q ss_pred CCCCCCCCCCHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHH Q ss_conf 88312221200116776424556727999615686565452016899999985332132004555427542145787886 Q gi|254780664|r 259 LSYNNINDLDAAFELVSEFRSQDCAACVIVKHMNPCGVATADTLVEAYRRALSCDPISAFGGIIAFNREVDQEVAKEVIK 338 (536) Q Consensus 259 LSYNNllD~daA~~lv~ef~~~~~Pa~vIvKH~NPCGvA~~~~~~~Ay~~A~~~DP~SAFGGIIa~N~~vd~~~A~~I~~ 338 (536) +.--...|.+.|..++.-+-+.+.+ +.=|--++|.. ....+...+.+...| +=.|..-.-+|.+.++...+ T Consensus 16 iaVlr~~~~~~a~~~~~al~~gGi~-~iEITl~tp~a----~~~i~~l~~~~~~~p----~~~iGaGTV~~~e~~~~a~~ 86 (209) T PRK06552 16 VAVVRGESKEEALKISLAVIKGGIK-AIEVTYTNPFA----SEVIKELVERYKDDP----EVLIGAGTVLDAVTARQAIL 86 (209) T ss_pred EEEEECCCHHHHHHHHHHHHHCCCC-EEEEECCCCCH----HHHHHHHHHHHCCCC----CEEEEEECCCCHHHHHHHHH T ss_conf 9999728999999999999987998-89996789759----999999999817799----81898872748999999998 Q ss_pred HHHHHHEECCCCHHHHHHHHCC Q ss_conf 5431111268798999885124 Q gi|254780664|r 339 VFTEAIIAPTLSEEAADVLAKK 360 (536) Q Consensus 339 ~F~EvIiAP~f~~eAleiL~~K 360 (536) -=.+-|+.|+++++-++..++. T Consensus 87 aGA~FiVSP~~~~~v~~~a~~~ 108 (209) T PRK06552 87 AGAQFIVSPSFNRETAKICNRY 108 (209) T ss_pred CCCCEEECCCCCHHHHHHHHHC T ss_conf 5998897699989999999985 No 39 >PRK00489 hisG ATP phosphoribosyltransferase; Reviewed Probab=74.60 E-value=6.2 Score=20.09 Aligned_cols=83 Identities=17% Similarity=0.279 Sum_probs=42.0 Q ss_pred HHHHHHHHHCCCCEEEHHHHCCCCHHCCCCCCCCCCHHHH---HHHHCCCCHHHHHHHHHCCCCCEEEEEEECCCHHHHH Q ss_conf 2768999988995179465348816519855323715553---5533088989999999779984149999488889973 Q gi|254780664|r 47 GGTCQLLEEEGIPVTSVFDITKFPEIMGGRVKTLHPKIYG---GILSIRDNPAHMKFMQDHELESIDLVVVNLYPFEESF 123 (536) Q Consensus 47 gGTa~~l~~~gi~v~~Vs~~TgfpEil~GRVKTLHP~I~g---gIL~~r~~~~~~~~l~~~~i~~IDlVvvNLYPF~~~v 123 (536) ..|.++|+++|+++.. ++| .|.+.+-+ -++.- +++|+...-+.|.-.+-++=-|+. .| T Consensus 17 e~~~~ll~~aGl~~~~-----------~~R--~l~~~~~~~~i~~~~~--R~~DIp~yV~~G~~DlGI~G~D~l--~E-- 77 (287) T PRK00489 17 EPALELLARAGLKIRL-----------DSR--DLIATIEDNPIEVLFL--RPDDIPGYVADGVVDLGITGEDLL--EE-- 77 (287) T ss_pred HHHHHHHHHCCCCCCC-----------CCC--EEEEECCCCCEEEEEE--CHHHHHHHHHCCCCCEEEEEEEEH--HH-- T ss_conf 8999999987996578-----------985--2687728998799998--879899999769977755133121--20-- Q ss_pred CCCCCHHHHHHCCCCCCHHHHHHHHCC Q ss_conf 176531123200233434699987401 Q gi|254780664|r 124 CREDDYYTMVENIDIGGPSMIRAAAKN 150 (536) Q Consensus 124 ~~~~~~~~~IEnIDIGGpsmiRAAAKN 150 (536) ++.. -..+...+.|...|.=|+-++ T Consensus 78 -~~~~-v~~~~~LgfG~crlvvAvP~~ 102 (287) T PRK00489 78 -SDAD-VEELLDLGFGKCRLVLAVPED 102 (287) T ss_pred -CCCC-CEEEECCCCCCEEEEEEECCC T ss_conf -6898-169970688748999995177 No 40 >pfam04273 DUF442 Putative phosphatase (DUF442). Although this domain is uncharacterized it seems likely that it performs a phosphatase function. Probab=73.86 E-value=7.5 Score=19.48 Aligned_cols=17 Identities=18% Similarity=0.554 Sum_probs=6.2 Q ss_pred CCCEEEEC--CCCCCHHHH Q ss_conf 99599938--987798899 Q gi|254780664|r 499 GVTAVIQP--GGSVRDSEA 515 (536) Q Consensus 499 Gi~aIiqP--GGSirD~ev 515 (536) |++++-.| +|.+.++.| T Consensus 57 Gl~y~~iPv~~~~~t~~~v 75 (110) T pfam04273 57 GLAYRFIPVISGQITEADV 75 (110) T ss_pred CCEEEEEECCCCCCCHHHH T ss_conf 9979996447789899999 No 41 >TIGR00107 deoD purine nucleoside phosphorylase; InterPro: IPR004402 Purine nucleoside phosphorylase 2.4.2.1 from EC also called inosine phosphorylase (pnp) cleave guanosine or inosine to respective bases and sugar-1-phosphate molecules. These enzymes are involved in the salvage of nucleotides and nucleosides.; GO: 0004731 purine-nucleoside phosphorylase activity, 0006139 nucleobase nucleoside nucleotide and nucleic acid metabolic process. Probab=72.35 E-value=5.4 Score=20.50 Aligned_cols=80 Identities=25% Similarity=0.308 Sum_probs=60.9 Q ss_pred HHHHHHHHHHHHHCCCCCEEEEEECC-EEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCCC-CH Q ss_conf 98889999998600566779999689-38997296888899999999998764200122223487099973668685-55 Q gi|254780664|r 414 ELRDMKFAFKVVKHVKSNAVVYAKDG-RTVGIGSGQTSRVDSTRFAAIKAHNISAQADVKSMTNGSVIASEAFYPFP-DG 491 (536) Q Consensus 414 e~~dL~FA~kv~K~vkSNAIv~ak~~-~tiGiGaGQ~sRVda~~iA~~KA~~~~~~~~~~~~~~g~vlaSDAFFPF~-D~ 491 (536) .++|++.|.+.+---|-|-|-+...- =...+-. -+=++.|.+-|+..+.+ .-.|.|..+|+|+-++ |- T Consensus 96 kl~D~iiA~~A~tdS~~nr~~f~~~P~D~~~~Ad-----f~L~~~A~~tAk~~g~~-----~~vG~vF~~D~FY~~~~~~ 165 (234) T TIGR00107 96 KLRDVIIAMGASTDSKVNRVRFVEVPEDFAAIAD-----FELVELAYDTAKALGID-----VRVGNVFSADAFYSPDKDV 165 (234) T ss_pred CEEEEEEECCCCCHHHHHHHHCCCCCCCCCCCCC-----HHHHHHHHHHHHHCCCC-----EEECCEECCCCCCCCCHHH T ss_conf 6068897114232233555505687644334789-----89999999999867972-----7765233111130744889 Q ss_pred HHHHHHCCCCEE Q ss_conf 899997199599 Q gi|254780664|r 492 IVEAIKAGVTAV 503 (536) Q Consensus 492 ie~aa~~Gi~aI 503 (536) ++.+.++|+=+| T Consensus 166 ~~~~~ky~~LaV 177 (234) T TIGR00107 166 FDLMAKYGILAV 177 (234) T ss_pred HHHHHHCCCCEE T ss_conf 999987598722 No 42 >pfam00591 Glycos_transf_3 Glycosyl transferase family, a/b domain. This family includes anthranilate phosphoribosyltransferase (TrpD), thymidine phosphorylase. All these proteins can transfer a phosphorylated ribose substrate. Probab=71.54 E-value=5.4 Score=20.55 Aligned_cols=19 Identities=16% Similarity=0.238 Sum_probs=8.7 Q ss_pred HHEECCCCHHHHHHHHCCC Q ss_conf 1112687989998851245 Q gi|254780664|r 343 AIIAPTLSEEAADVLAKKP 361 (536) Q Consensus 343 vIiAP~f~~eAleiL~~KK 361 (536) .+-||.|-|....+..-+| T Consensus 78 FlfAp~~hPamk~v~~~R~ 96 (254) T pfam00591 78 FLFAPNYHPAMKHVAPVRK 96 (254) T ss_pred EECCHHHCHHHHHHHHHHH T ss_conf 8641120989999999998 No 43 >PRK00945 acetyl-CoA decarbonylase/synthase complex subunit epsilon; Provisional Probab=70.96 E-value=4.2 Score=21.30 Aligned_cols=35 Identities=20% Similarity=0.377 Sum_probs=30.1 Q ss_pred CCCHHHHHHHHHH-CCCEEEEEHHHHHHHHHCCCCE Q ss_conf 5686999999998-7999999427689999889951 Q gi|254780664|r 26 KTGVVEFASRLLS-RGIKIISTGGTCQLLEEEGIPV 60 (536) Q Consensus 26 Ktgl~~la~~L~~-~g~~iisTgGTa~~l~~~gi~v 60 (536) ..-|.+++..+.+ .++.|++||++.+-+.+.|+.+ T Consensus 48 d~elle~~iki~k~~~ipi~aTg~s~~~l~e~g~~~ 83 (171) T PRK00945 48 DEELLDRAIKIAKAKNIPVAATGSSYKGLIEKGVKS 83 (171) T ss_pred CHHHHHHHHHHHHHCCCCEEEECCCCHHHHHCCCCH T ss_conf 889999999999874985687445205456458871 No 44 >COG2240 PdxK Pyridoxal/pyridoxine/pyridoxamine kinase [Coenzyme metabolism] Probab=70.02 E-value=6.4 Score=20.01 Aligned_cols=79 Identities=22% Similarity=0.212 Sum_probs=57.8 Q ss_pred CHHHCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCC---HHHHHHHHHHHHCCCCCCCEEEEECC Q ss_conf 012105898883122212001167764245567279996156865654520---16899999985332132004555427 Q gi|254780664|r 250 HAVLVQGKPLSYNNINDLDAAFELVSEFRSQDCAACVIVKHMNPCGVATAD---TLVEAYRRALSCDPISAFGGIIAFNR 326 (536) Q Consensus 250 ~~~ql~GKeLSYNNllD~daA~~lv~ef~~~~~Pa~vIvKH~NPCGvA~~~---~~~~Ay~~A~~~DP~SAFGGIIa~N~ 326 (536) ..++|.|++ +||..|+-.|.+...+ ..|..++|+|..+-|...+. ....+....+.+ |...|. +|. T Consensus 146 ELe~Ltg~~--~~~~~da~~aa~~L~~----~gp~~vlVTS~~~~~~~~~~~~~~~~~~~~~~h~~-~~v~~~----~~G 214 (281) T COG2240 146 ELEILTGKP--LNTLDDAVKAARKLGA----DGPKIVLVTSLSRAGMSTGNFEMLGKSAELAWHIS-PLVPFI----PNG 214 (281) T ss_pred HHHHHHCCC--CCCHHHHHHHHHHHHH----CCCCEEEEECCCCCCCCCCEEEEECCCHHHHHHHH-HCCCCC----CCC T ss_conf 999984988--7888999999999751----48988999534536787734999613122332343-227889----998 Q ss_pred CCCHHHHHHHHHH Q ss_conf 5421457878865 Q gi|254780664|r 327 EVDQEVAKEVIKV 339 (536) Q Consensus 327 ~vd~~~A~~I~~~ 339 (536) +=|.-+|..+.+. T Consensus 215 tGDL~sallla~l 227 (281) T COG2240 215 TGDLFSALLLARL 227 (281) T ss_pred CHHHHHHHHHHHH T ss_conf 2499999999999 No 45 >COG0800 Eda 2-keto-3-deoxy-6-phosphogluconate aldolase [Carbohydrate transport and metabolism] Probab=69.74 E-value=5.5 Score=20.50 Aligned_cols=86 Identities=17% Similarity=0.290 Sum_probs=42.7 Q ss_pred CCCCCCHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHHHH Q ss_conf 22212001167764245567279996156865654520168999999853321320045554275421457878865431 Q gi|254780664|r 263 NINDLDAAFELVSEFRSQDCAACVIVKHMNPCGVATADTLVEAYRRALSCDPISAFGGIIAFNREVDQEVAKEVIKVFTE 342 (536) Q Consensus 263 NllD~daA~~lv~ef~~~~~Pa~vIvKH~NPCGvA~~~~~~~Ay~~A~~~DP~SAFGGIIa~N~~vd~~~A~~I~~~F~E 342 (536) -..|.+-|..++.-+-+.+.+ |+=|--++||. .....+..+.+ | .-+|.--.-+|.+.++....-=.+ T Consensus 20 r~~~~e~a~~~a~Ali~gGi~-~IEITl~sp~a----~e~I~~l~~~~---p----~~lIGAGTVL~~~q~~~a~~aGa~ 87 (211) T COG0800 20 RGDDVEEALPLAKALIEGGIP-AIEITLRTPAA----LEAIRALAKEF---P----EALIGAGTVLNPEQARQAIAAGAQ 87 (211) T ss_pred EECCHHHHHHHHHHHHHCCCC-EEEEECCCCCH----HHHHHHHHHHC---C----CCEECCCCCCCHHHHHHHHHCCCC T ss_conf 708999999999999976987-69996479878----99999999867---4----658824556699999999985997 Q ss_pred HHEECCCCHHHHHHHHCC Q ss_conf 111268798999885124 Q gi|254780664|r 343 AIIAPTLSEEAADVLAKK 360 (536) Q Consensus 343 vIiAP~f~~eAleiL~~K 360 (536) -|+.|++++|-++.-..+ T Consensus 88 fiVsP~~~~ev~~~a~~~ 105 (211) T COG0800 88 FIVSPGLNPEVAKAANRY 105 (211) T ss_pred EEECCCCCHHHHHHHHHC T ss_conf 898999999999999867 No 46 >PRK05693 short chain dehydrogenase; Provisional Probab=69.07 E-value=9.5 Score=18.75 Aligned_cols=82 Identities=16% Similarity=0.302 Sum_probs=47.7 Q ss_pred EEEEEEEEECCCCH-HHHHHHHHHCCCEEEEEHHHHHHHHH---CCCCEEEHHHHCCCCHHCCCCCCCCCCHHHHHHHHC Q ss_conf 36989987545686-99999999879999994276899998---899517946534881651985532371555355330 Q gi|254780664|r 16 VKTALISVHNKTGV-VEFASRLLSRGIKIISTGGTCQLLEE---EGIPVTSVFDITKFPEIMGGRVKTLHPKIYGGILSI 91 (536) Q Consensus 16 ikrALiSV~dKtgl-~~la~~L~~~g~~iisTgGTa~~l~~---~gi~v~~Vs~~TgfpEil~GRVKTLHP~I~ggIL~~ 91 (536) ||.+||.=... || ..+|+.|.+.|+.++.++-....|.+ .|+....+ |+|...++ T Consensus 1 MKvvlITGass-GIG~alA~~la~~G~~V~~~~R~~~~l~~l~~~~~~~~~~-Dvtd~~~i------------------- 59 (274) T PRK05693 1 MPVVLITGCSS-GIGRALADAFKAAGYEVWATARKAEDVEALAAAGFTAVQL-DVNDGAAL------------------- 59 (274) T ss_pred CCEEEECCCCC-HHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHCCCCEEEE-ECCCHHHH------------------- T ss_conf 99899948885-8999999999987999999979999999998489918998-46998999------------------- Q ss_pred CCCHHHHHHHHHCCCCCEEEEEEEC-----CCHHHH Q ss_conf 8898999999977998414999948-----888997 Q gi|254780664|r 92 RDNPAHMKFMQDHELESIDLVVVNL-----YPFEES 122 (536) Q Consensus 92 r~~~~~~~~l~~~~i~~IDlVvvNL-----YPF~~~ 122 (536) ..-.+++.+ ....||++|.|= -||+++ T Consensus 60 ---~~~~~~~~~-~~g~iDiLVNNAG~~~~g~~~~~ 91 (274) T PRK05693 60 ---ARLAEELEA-EHQGLDVLINNAGYGAMGPLLDG 91 (274) T ss_pred ---HHHHHHHHH-HCCCCCEEEECCCCCCCCCHHHH T ss_conf ---999999999-72997689988867788758987 No 47 >cd01286 deoxycytidylate_deaminase Deoxycytidylate deaminase domain. Deoxycytidylate deaminase catalyzes the deamination of dCMP to dUMP, providing the nucleotide substrate for thymidylate synthase. The enzyme binds Zn++, which is required for catalytic activity. The activity of the enzyme is allosterically regulated by the ratio of dCTP to dTTP not only in eukaryotic cells but also in T-even phage-infected Escherichia coli, with dCTP acting as an activator and dTTP as an inhibitor. Probab=68.01 E-value=9.9 Score=18.60 Aligned_cols=74 Identities=20% Similarity=0.270 Sum_probs=49.3 Q ss_pred EEEEEECCEEEEECCC-----CCC---------------------HHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCC Q ss_conf 7999968938997296-----888---------------------89999999999876420012222348709997366 Q gi|254780664|r 432 AVVYAKDGRTVGIGSG-----QTS---------------------RVDSTRFAAIKAHNISAQADVKSMTNGSVIASEAF 485 (536) Q Consensus 432 AIv~ak~~~tiGiGaG-----Q~s---------------------RVda~~iA~~KA~~~~~~~~~~~~~~g~vlaSDAF 485 (536) .=||++++..||.|-. ..+ =+.|-..|+.+|...+ ..++|+.|-.- - T Consensus 23 GaVIV~~~~iIs~GyNG~p~g~~~~~~~~~~~~~~~~~~~~~~~~~iHAE~nAi~~a~~~~------~~~~g~tlYvT-~ 95 (131) T cd01286 23 GAVIVKDKRIISTGYNGSPSGLPHCAEVGCERDDLPSGEDQKCCRTVHAEQNAILQAARHG------VSLEGATLYVT-L 95 (131) T ss_pred EEEEEECCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCC------CCCCCCEEECC-C T ss_conf 8999979938966448998888874000331124443323222221789999999999829------98888666607-8 Q ss_pred CCCCCHHHHHHHCCCCEEEECCCCCCH Q ss_conf 868555899997199599938987798 Q gi|254780664|r 486 YPFPDGIVEAIKAGVTAVIQPGGSVRD 512 (536) Q Consensus 486 FPF~D~ie~aa~~Gi~aIiqPGGSirD 512 (536) ||=+.+...+..+||+-||-+..=-.| T Consensus 96 ~PC~~Ca~~Ii~aGI~rVvy~~~y~~~ 122 (131) T cd01286 96 FPCIECAKLIIQAGIKKVVYAEPYDDD 122 (131) T ss_pred CCHHHHHHHHHHCCCCEEEECCCCCCC T ss_conf 977999999998299999989976998 No 48 >PRK06153 hypothetical protein; Provisional Probab=66.31 E-value=2.9 Score=22.44 Aligned_cols=19 Identities=21% Similarity=0.359 Sum_probs=9.7 Q ss_pred EEEEECCCCCHHHHHHHHH Q ss_conf 4555427542145787886 Q gi|254780664|r 320 GIIAFNREVDQEVAKEVIK 338 (536) Q Consensus 320 GIIa~N~~vd~~~A~~I~~ 338 (536) ||++--.-+|.+....+.. T Consensus 248 gIv~h~~~i~~~nv~~L~g 266 (393) T PRK06153 248 GIVSHPYYIDEDNVDELDG 266 (393) T ss_pred CCCCCCEEECCCHHHHHCC T ss_conf 8424623763202556439 No 49 >COG0813 DeoD Purine-nucleoside phosphorylase [Nucleotide transport and metabolism] Probab=65.97 E-value=11 Score=18.32 Aligned_cols=154 Identities=18% Similarity=0.130 Sum_probs=72.8 Q ss_pred HHHHHHHHHHHHHHEECCCCHHHHHHHHCCCCCEEEECCCCCCCCCCCCEECCCCHHHHHCCCCC-CCCCCCCCCCCCCC Q ss_conf 45787886543111126879899988512455315631676677766640203210223101233-45310001246777 Q gi|254780664|r 331 EVAKEVIKVFTEAIIAPTLSEEAADVLAKKPSMRFLKTSSLLDFHGEEIVLKTVSGGILVQTRDN-VVDNKELTVVTKRS 409 (536) Q Consensus 331 ~~A~~I~~~F~EvIiAP~f~~eAleiL~~KKnlRil~~~~~~~~~~~~~~~rsi~GG~LvQ~~D~-~~~~~~~~vVT~~~ 409 (536) --|+.|.+.|+|-.+--.=....+-.--..|.-|+-...-. .-.+++ +|+---|+|+..- .+ ..+-|--. T Consensus 24 lRAK~iAetfLe~~~~vnevR~mlgfTGtYKGk~iSimg~G--mGipS~---sIY~~ELi~~y~Vk~i----IRvGt~Ga 94 (236) T COG0813 24 LRAKYIAETFLENAVCVNEVRGMLGFTGTYKGKKISVMGHG--MGIPSI---SIYSRELITDYGVKKI----IRVGTCGA 94 (236) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHCCCCEECCCEEEEEEEC--CCCCCH---HHHHHHHHHHHCCCEE----EEEECCCC T ss_conf 33789999987423022321000103413438478998734--887508---9999999987385348----99873434 Q ss_pred C-CHHHHHHHHHHHHHHHCCCCCEEEEEECCEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCC-C Q ss_conf 8-6889888999999860056677999968938997296888899999999998764200122223487099973668-6 Q gi|254780664|r 410 P-TDQELRDMKFAFKVVKHVKSNAVVYAKDGRTVGIGSGQTSRVDSTRFAAIKAHNISAQADVKSMTNGSVIASEAFY-P 487 (536) Q Consensus 410 p-t~~e~~dL~FA~kv~K~vkSNAIv~ak~~~tiGiGaGQ~sRVda~~iA~~KA~~~~~~~~~~~~~~g~vlaSDAFF-P 487 (536) . .+-.+.|+..|...+-.-+=|-+.+. +--.-++---+-++.|.+.|++.+.+. --|-|++||.|. | T Consensus 95 l~~~v~l~DvVia~~A~tds~~~~~~f~------~~df~~~ad~~Ll~~a~~~A~e~gi~~-----hvgnv~ssD~FY~~ 163 (236) T COG0813 95 LSEDVKLRDVVIAQGASTDSNVNRIRFK------PHDFAPIADFELLEKAYETAKELGIDT-----HVGNVFSSDLFYNP 163 (236) T ss_pred CCCCCCCCEEEEECCCCCCCHHHHCCCC------CCCCCCCCCHHHHHHHHHHHHHHCCCE-----EEEEEEEEECCCCC T ss_conf 5677740047886144674124330157------655445599999999999999839871-----55216643013578 Q ss_pred CCCHHHHHHHCCCCEEE Q ss_conf 85558999971995999 Q gi|254780664|r 488 FPDGIVEAIKAGVTAVI 504 (536) Q Consensus 488 F~D~ie~aa~~Gi~aIi 504 (536) =++-.+.+++.||.+|= T Consensus 164 ~~~~~~~~~~~gvlaVe 180 (236) T COG0813 164 DTEMFDLMAKYGVLAVE 180 (236) T ss_pred CHHHHHHHHHHCCCEEE T ss_conf 77799999983973787 No 50 >COG0276 HemH Protoheme ferro-lyase (ferrochelatase) [Coenzyme metabolism] Probab=65.47 E-value=11 Score=18.26 Aligned_cols=19 Identities=32% Similarity=0.319 Sum_probs=7.0 Q ss_pred HHHHHHHHCCCCCEEEECC Q ss_conf 8999885124553156316 Q gi|254780664|r 351 EEAADVLAKKPSMRFLKTS 369 (536) Q Consensus 351 ~eAleiL~~KKnlRil~~~ 369 (536) ++|++-|++.-.-|++.++ T Consensus 105 ~~~v~~l~~~gv~~iv~~p 123 (320) T COG0276 105 EEAVEELKKDGVERIVVLP 123 (320) T ss_pred HHHHHHHHHCCCCEEEEEE T ss_conf 9999999971987599997 No 51 >PRK00194 hypothetical protein; Validated Probab=65.22 E-value=10 Score=18.51 Aligned_cols=32 Identities=22% Similarity=0.442 Sum_probs=21.2 Q ss_pred EEEEEEEEE--CCCCHH-HHHHHHHHCCCEEEEEH Q ss_conf 369899875--456869-99999998799999942 Q gi|254780664|r 16 VKTALISVH--NKTGVV-EFASRLLSRGIKIISTG 47 (536) Q Consensus 16 ikrALiSV~--dKtgl~-~la~~L~~~g~~iisTg 47 (536) |.||+|+|+ ||.||+ .+++.|.++|+.|+-.. T Consensus 1 m~~avITV~G~DrpGIVa~Vt~~La~~~~NI~Dis 35 (90) T PRK00194 1 MMKAIITVIGKDRVGIVAGVSTVLAELNVNILDIS 35 (90) T ss_pred CCEEEEEEECCCCCCHHHHHHHHHHHCCCCEEECC T ss_conf 95699999879988789999999998699989551 No 52 >PRK06444 prephenate dehydrogenase; Provisional Probab=63.98 E-value=9.7 Score=18.67 Aligned_cols=54 Identities=15% Similarity=0.234 Sum_probs=45.1 Q ss_pred HHHHHHHHCCCEE---------EE--EHHHHHHHHHCCCCEEEHHHHCCCCHHCCCCCCCCCCHH Q ss_conf 9999999879999---------99--427689999889951794653488165198553237155 Q gi|254780664|r 31 EFASRLLSRGIKI---------IS--TGGTCQLLEEEGIPVTSVFDITKFPEIMGGRVKTLHPKI 84 (536) Q Consensus 31 ~la~~L~~~g~~i---------is--TgGTa~~l~~~gi~v~~Vs~~TgfpEil~GRVKTLHP~I 84 (536) -|+++|...|+++ +| .+-|...|+..|-...+|+.+--+++.+.|+|=-+||-- T Consensus 15 ~~~~~~~~~g~~v~i~k~D~ifiaVPI~~~~~iI~~~~~tiiDv~SVK~p~k~~~~~IIs~HPMF 79 (197) T PRK06444 15 VLCSILDDNGLGVYIKKADHAFLSVPIDAALNYIESYDNNFVEISSVKWPFKKYSGKIVSIHPLF 79 (197) T ss_pred HHHHHHCCCCCEEEECCCCEEEEEEEHHHHHHHHHHCCCEEEEEEECCCHHHHHCCCEEEECCCC T ss_conf 99999703784789727888999824588899998479738998863311777079879879987 No 53 >cd01987 USP_OKCHK USP domain is located between the N-terminal sensor domain and C-terminal catalytic domain of this Osmosensitive K+ channel histidine kinase family. The family of KdpD sensor kinase proteins regulates the kdpFABC operon responsible for potassium transport. The USP domain is homologous to the universal stress protein Usp Usp is a small cytoplasmic bacterial protein whose expression is enhanced when the cell is exposed to stress agents. Usp enhances the rate of cell survival during prolonged exposure to such conditions, and may provide a general "stress endurance" activity. Probab=63.78 E-value=12 Score=18.05 Aligned_cols=42 Identities=21% Similarity=0.376 Sum_probs=34.2 Q ss_pred HHHHHHHCCCCEEEECCCCCCHHHHHHHHHHCCCEEEECCCCC Q ss_conf 5899997199599938987798899999987597799747854 Q gi|254780664|r 491 GIVEAIKAGVTAVIQPGGSVRDSEAITVADQHGIAMVFTGIRH 533 (536) Q Consensus 491 ~ie~aa~~Gi~aIiqPGGSirD~evI~aan~~gi~m~fTg~Rh 533 (536) ..++|.+.|...++-+|+.+- ++++++|.++++.++.-|.++ T Consensus 54 ~~~la~~lga~~~~~~~~d~~-~~I~~~A~~~~~t~IVlG~~~ 95 (124) T cd01987 54 ALRLAEELGAEVVTLPGDDVA-EAIVEFAREHNVTQIVVGKSR 95 (124) T ss_pred HHHHHHHCCCEEEEEECCCHH-HHHHHHHHHCCCCEEEECCCC T ss_conf 999999859989999479989-999999998499899976898 No 54 >pfam01976 DUF116 Protein of unknown function DUF116. This archaebacterial protein has no known function. The protein contains seven conserved cysteines and may also be an integral membrane protein. Probab=63.70 E-value=9.4 Score=18.78 Aligned_cols=40 Identities=23% Similarity=0.217 Sum_probs=14.4 Q ss_pred CCCCCHHHHHHHHCCCCHHHHHHHHHCCCCCEEEEEEECCCH Q ss_conf 323715553553308898999999977998414999948888 Q gi|254780664|r 78 KTLHPKIYGGILSIRDNPAHMKFMQDHELESIDLVVVNLYPF 119 (536) Q Consensus 78 KTLHP~I~ggIL~~r~~~~~~~~l~~~~i~~IDlVvvNLYPF 119 (536) |..+|+.-=|+---||..+.+.++..++|+. +-|.|++|. T Consensus 103 ~~~~p~~IigVAC~~dL~~gi~~~~~~~ip~--~gV~l~~~g 142 (158) T pfam01976 103 KEYRPKAIIGVACERDLNEGIQDLKKKGIPV--QGVLLLRPG 142 (158) T ss_pred HHCCCCEEEEEECHHHHHHHHHHHHCCCCCE--EEEECCCCC T ss_conf 8729998999920489999999865089986--999759999 No 55 >PRK08228 L(+)-tartrate dehydratase subunit beta; Validated Probab=63.29 E-value=12 Score=17.98 Aligned_cols=121 Identities=15% Similarity=0.183 Sum_probs=57.2 Q ss_pred CCCCCCCCCCCCCCCCEEEECCCCCCCCCCHHHC-CCCCCCCCCCCCCCHHHHHHHHCCCCCCCEEEEECCCCCCCC--- Q ss_conf 2333343678810065057604776682101210-589888312221200116776424556727999615686565--- Q gi|254780664|r 221 VKKQEMRYGENPHQKAALYSTPEKKSGIAHAVLV-QGKPLSYNNINDLDAAFELVSEFRSQDCAACVIVKHMNPCGV--- 296 (536) Q Consensus 221 ~~~~~LRYGENPHQ~Aa~Y~~~~~~~~~~~~~ql-~GKeLSYNNllD~daA~~lv~ef~~~~~Pa~vIvKH~NPCGv--- 296 (536) +...+||-|+-=-=.+.+|.--+ .++.+.+ +|++|-+ +| -.. +|=|.-||-. T Consensus 12 edi~~L~vGD~V~LsG~iyTaRD----~AHkrl~e~g~~lP~--------------dl-----~g~-~iyh~GP~~~k~~ 67 (201) T PRK08228 12 EDLADIRVGDIIYLTGTLVTCRD----VAHRRLIELGRPLPV--------------DL-----NGG-AIFHAGPIVRKNG 67 (201) T ss_pred HHHHHCCCCCEEEEEEEEEEEHH----HHHHHHHHCCCCCCC--------------CC-----CCC-EEEEECCCCCCCC T ss_conf 99961889999999569998748----999999974999995--------------71-----337-6897468766799 Q ss_pred ------CCCCHHHHHHHHHHHHCC-CCCCCEEEEE-CCCCCHHHHHHHHHHHHHHHEECCCCHHHHHHHHCC-CCCEEEE Q ss_conf ------452016899999985332-1320045554-275421457878865431111268798999885124-5531563 Q gi|254780664|r 297 ------ATADTLVEAYRRALSCDP-ISAFGGIIAF-NREVDQEVAKEVIKVFTEAIIAPTLSEEAADVLAKK-PSMRFLK 367 (536) Q Consensus 297 ------A~~~~~~~Ay~~A~~~DP-~SAFGGIIa~-N~~vd~~~A~~I~~~F~EvIiAP~f~~eAleiL~~K-KnlRil~ 367 (536) |.|.| .+++..-..++ +..+|--+.+ -...+.++.+.+.+.=.=-..|++ .|-.++++. |..+.+- T Consensus 68 ~~~~i~s~GPT--TS~RMd~y~~~~l~~~G~~~~IGKGgrs~~~~~a~~~~gaVYl~a~G---GaaaL~~~~Ik~v~~V~ 142 (201) T PRK08228 68 DKWEMVSVGPT--TSMRMEKFEKEFIEQTGVKLIVGKGGMGPGTEEGCQEFKALHVVFPA---GCAVLAATQVEEIEDAQ 142 (201) T ss_pred CCCEEEEECCC--HHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHCCEEEEECCC---HHHHHHHHHCCEEEEEE T ss_conf 97234662686--05444677999998519269998587999999999976989997465---89999982042887870 Q ss_pred CCC Q ss_conf 167 Q gi|254780664|r 368 TSS 370 (536) Q Consensus 368 ~~~ 370 (536) +.. T Consensus 143 ~~d 145 (201) T PRK08228 143 WTD 145 (201) T ss_pred ECC T ss_conf 134 No 56 >COG1936 Predicted nucleotide kinase (related to CMP and AMP kinases) [Nucleotide transport and metabolism] Probab=62.60 E-value=9.6 Score=18.69 Aligned_cols=106 Identities=26% Similarity=0.343 Sum_probs=61.9 Q ss_pred EEEEECCCCHHHHHHHHHHCCCEEEEEHHHHHHHHHCCCCEEEHHHHCCCCHH-------CCCCCCCCCCHHH--HHHHH Q ss_conf 99875456869999999987999999427689999889951794653488165-------1985532371555--35533 Q gi|254780664|r 20 LISVHNKTGVVEFASRLLSRGIKIISTGGTCQLLEEEGIPVTSVFDITKFPEI-------MGGRVKTLHPKIY--GGILS 90 (536) Q Consensus 20 LiSV~dKtgl~~la~~L~~~g~~iisTgGTa~~l~~~gi~v~~Vs~~TgfpEi-------l~GRVKTLHP~I~--ggIL~ 90 (536) +|.=.--+|=-.+|+.|.++|++.++-+ .+.+++|+ +|+++|. ++.-.+-|+|.+. .+|+- T Consensus 4 ~ITGTPGvGKTT~~~~L~~lg~~~i~l~---el~~e~~~-------~~~~de~r~s~~vD~d~~~~~le~~~~~~~~Ivd 73 (180) T COG1936 4 AITGTPGVGKTTVCKLLRELGYKVIELN---ELAKENGL-------YTEYDELRKSVIVDVDKLRKRLEELLREGSGIVD 73 (180) T ss_pred EEECCCCCCHHHHHHHHHHHCCCEEEHH---HHHHHCCC-------EECCCCCCCEEEEEHHHHHHHHHHHHCCCCEEEE T ss_conf 9937999866879999998298466199---99986697-------1143776615886679999888887505884761 Q ss_pred CCCCHHHHHHHHHCCCCCEEEEEE-ECCCHH---HHHCCCCCHHHHHHCCC--CCCHHHH Q ss_conf 088989999999779984149999-488889---97317653112320023--3434699 Q gi|254780664|r 91 IRDNPAHMKFMQDHELESIDLVVV-NLYPFE---ESFCREDDYYTMVENID--IGGPSMI 144 (536) Q Consensus 91 ~r~~~~~~~~l~~~~i~~IDlVvv-NLYPF~---~~v~~~~~~~~~IEnID--IGGpsmi 144 (536) .|...| .+.+||||| -..|.. +--.++-+.+.+-||++ |=|.-+. T Consensus 74 -----~H~~hl----~~~~dlVvVLR~~p~~L~~RLk~RGy~~eKI~ENveAEi~~vi~~ 124 (180) T COG1936 74 -----SHLSHL----LPDCDLVVVLRADPEVLYERLKGRGYSEEKILENVEAEILDVILI 124 (180) T ss_pred -----CHHHHC----CCCCCEEEEECCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHH T ss_conf -----266621----787888999718989999999876998789887799999999999 No 57 >PRK05773 3,4-dihydroxy-2-butanone 4-phosphate synthase; Validated Probab=62.34 E-value=13 Score=17.86 Aligned_cols=38 Identities=29% Similarity=0.392 Sum_probs=31.3 Q ss_pred HHHHHHHCCCC------EEEECCCCCCHHHHHHHHHHCCCEEEE Q ss_conf 58999971995------999389877988999999875977997 Q gi|254780664|r 491 GIVEAIKAGVT------AVIQPGGSVRDSEAITVADQHGIAMVF 528 (536) Q Consensus 491 ~ie~aa~~Gi~------aIiqPGGSirD~evI~aan~~gi~m~f 528 (536) +|+++.-+|.+ -|+..|||++-++..+.|++|++.++- T Consensus 165 sVdL~~lAGl~P~avicEilddg~~m~~~~~~~fA~~~~l~~is 208 (218) T PRK05773 165 SLILAEAAGLPPSAVIAEMLGEGESLSKEEAKRYARELGIPLVE 208 (218) T ss_pred HHHHHHHCCCCCEEEEEEEECCCCCCCHHHHHHHHHHCCCCEEE T ss_conf 99999982999618999996699865689999999983997998 No 58 >PRK08577 hypothetical protein; Provisional Probab=61.67 E-value=7 Score=19.70 Aligned_cols=64 Identities=16% Similarity=0.225 Sum_probs=48.6 Q ss_pred EEEEEEECCCC-HHHHHHHHHHCCCEEEEEHHHHHHHHHCCC----CEEEHHHHCCCCHHCCCCCCCCCCH Q ss_conf 98998754568-699999999879999994276899998899----5179465348816519855323715 Q gi|254780664|r 18 TALISVHNKTG-VVEFASRLLSRGIKIISTGGTCQLLEEEGI----PVTSVFDITKFPEIMGGRVKTLHPK 83 (536) Q Consensus 18 rALiSV~dKtg-l~~la~~L~~~g~~iisTgGTa~~l~~~gi----~v~~Vs~~TgfpEil~GRVKTLHP~ 83 (536) .--+-|.|+.| |-+++..|.++|+.|++|.- ..|+.-++ -+.+||.-+--|+-|.++.|-|-|- T Consensus 58 ei~l~v~DrpGvLA~is~~La~h~vdil~te~--~~l~r~e~ae~v~IvDvS~~~~~~~el~~~Lk~le~V 126 (135) T PRK08577 58 ELELVVEDRPGVLAKISGLLAEHGVDILATEC--EELKRGELAECVIIVDVSKSDIDLKELEEELKALEEV 126 (135) T ss_pred EEEEEEECCCCHHHHHHHHHHHCCCCEEEHHH--HEEECCCCEEEEEEEECCCCCCCHHHHHHHHHHCCCE T ss_conf 99999945975699999999874853111021--1000376100899998545779999999998724531 No 59 >PRK07102 short chain dehydrogenase; Provisional Probab=61.64 E-value=13 Score=17.77 Aligned_cols=40 Identities=20% Similarity=0.233 Sum_probs=29.6 Q ss_pred EEEEEEEEECCCCH-HHHHHHHHHCCCEEEEEHHHHHHHHHC Q ss_conf 36989987545686-999999998799999942768999988 Q gi|254780664|r 16 VKTALISVHNKTGV-VEFASRLLSRGIKIISTGGTCQLLEEE 56 (536) Q Consensus 16 ikrALiSV~dKtgl-~~la~~L~~~g~~iisTgGTa~~l~~~ 56 (536) ||++||.=.- +|| ..+|+.|.+.|++++-++-....|++. T Consensus 1 MK~vlITGas-sGIG~a~A~~la~~G~~v~l~~R~~~~l~~~ 41 (243) T PRK07102 1 MKKILIIGAT-SDIARACARRYAAAGARLYLAARDTERLERI 41 (243) T ss_pred CCEEEEECCC-HHHHHHHHHHHHHCCCEEEEEECCHHHHHHH T ss_conf 9979991574-5999999999998799899998988999999 No 60 >PRK06842 fumarate hydratase; Provisional Probab=59.93 E-value=14 Score=17.57 Aligned_cols=41 Identities=20% Similarity=0.102 Sum_probs=18.6 Q ss_pred CCCCHHHHHHHHHHHHHHHEECCCCHHHHHHHHCC-CCCEEEECC Q ss_conf 75421457878865431111268798999885124-553156316 Q gi|254780664|r 326 REVDQEVAKEVIKVFTEAIIAPTLSEEAADVLAKK-PSMRFLKTS 369 (536) Q Consensus 326 ~~vd~~~A~~I~~~F~EvIiAP~f~~eAleiL~~K-KnlRil~~~ 369 (536) ...+.++.+.+.+.=.=-..|++ .|-.+|++. |..+++-.. T Consensus 103 G~~s~~~~~a~~~~gaVYl~~~G---G~aALla~~Ik~~~~v~~~ 144 (185) T PRK06842 103 GARNDEVIESIKKNKAVYFGAIG---GAAALIAKSIKKSEIIAYE 144 (185) T ss_pred CCCCHHHHHHHHCCCEEEEECCC---CHHHHHHHHEEEEEEEECC T ss_conf 87998999998629989998068---0999997435478789757 No 61 >PRK02261 methylaspartate mutase subunit S; Provisional Probab=59.74 E-value=14 Score=17.55 Aligned_cols=79 Identities=19% Similarity=0.118 Sum_probs=43.1 Q ss_pred CCHHHHHHHHHHCCCEEEEEHH------HHHHHHHCCCCEEEHHHHCCCCHHC-----C-----CCCCCCCCHHHHHHHH Q ss_conf 6869999999987999999427------6899998899517946534881651-----9-----8553237155535533 Q gi|254780664|r 27 TGVVEFASRLLSRGIKIISTGG------TCQLLEEEGIPVTSVFDITKFPEIM-----G-----GRVKTLHPKIYGGILS 90 (536) Q Consensus 27 tgl~~la~~L~~~g~~iisTgG------Ta~~l~~~gi~v~~Vs~~TgfpEil-----~-----GRVKTLHP~I~ggIL~ 90 (536) -|..-++..|...|||.+-+|- -...-.+.+..+.-+|.+.|.-+.+ + |+-. -|-+-||++. T Consensus 18 ~G~~iva~~l~~~GfeVi~lG~~~~pe~~v~~A~~~~ad~igiSsl~G~~~~~~~~l~~~l~~~g~~d--i~vvvGG~i~ 95 (137) T PRK02261 18 VGNKILDRALTEAGFEVINLGVMTSQEEFIDAAIETDADAILVSSLYGHGEIDCRGLREKCIEAGLGD--ILLYVGGNLV 95 (137) T ss_pred HHHHHHHHHHHHCCCEEEECCCCCCHHHHHHHHHHCCCCEEEECCCCCCCCHHHHHHHHHHHHCCCCC--CEEEECCCCC T ss_conf 88999999999789749846887999999999987399999971111266127999999999679999--9699836216 Q ss_pred --CCCCHHHHHHHHHCCCC Q ss_conf --08898999999977998 Q gi|254780664|r 91 --IRDNPAHMKFMQDHELE 107 (536) Q Consensus 91 --~r~~~~~~~~l~~~~i~ 107 (536) .-+-+.+.+.|+++|+. T Consensus 96 i~~~dp~~~~~~L~~~Gv~ 114 (137) T PRK02261 96 VGKHDFEEVEKKFKEMGFD 114 (137) T ss_pred CCCCCCHHHHHHHHHCCCC T ss_conf 7887839999999977979 No 62 >TIGR00511 aIF-2BII_fam translation initiation factor, putative, aIF-2BII family; InterPro: IPR005250 The delineation of this family is based in part on a discussion and neighbour-joining phylogenetic study , , of archaeal and other proteins homologous to the alpha, beta, and delta subunits of eukaryotic initiation factor 2B (eIF-2B), a five-subunit molecule that catalyzes GTP recycling for eIF-2. The archaeal proteins are related to the common ancestor of eIF-2B alpha, beta, and delta rather then specifically to any one of them, and that designation of particular archaeal members as corresponding to eIF-2B alpha or eIF-2B delta is imprecise. It has been suggested that designating the archaeal proteins as translation initiation factors remains unproven. So far, members of this family are found only in the euryarchaeota.. Probab=57.49 E-value=7.7 Score=19.43 Aligned_cols=32 Identities=25% Similarity=0.516 Sum_probs=19.3 Q ss_pred HHHHCCCCCCHHH-HHHHHCCCCCCEEEECHHHHHHHHHHHH Q ss_conf 2320023343469-9987401563135308999898898753 Q gi|254780664|r 131 TMVENIDIGGPSM-IRAAAKNHDYVTILTNPQDYPLFLAEMD 171 (536) Q Consensus 131 ~~IEnIDIGGpsm-iRAAAKN~~~V~Vi~dp~dY~~~~~el~ 171 (536) |-|-+..|=|-+= -|+|||-- ..|..=++.|+ T Consensus 9 EKI~~ME~RGA~RI~R~AA~~L---------~~yA~K~Shln 41 (303) T TIGR00511 9 EKIKSMEVRGAARIARAAAKAL---------KKYALKISHLN 41 (303) T ss_pred HHHHCCCCCCHHHHHHHHHHHH---------HHHHHHHCCCC T ss_conf 5553121232468999999999---------99987501012 No 63 >TIGR02296 HpaC 4-hydroxyphenylacetate 3-monooxygenase, reductase component; InterPro: IPR011982 This entry identifies the reductase component (HpaC) of 4-hydroxyphenylacetate 3-monooxygenase . This enzyme catalyses the first step (hydroxylation at the 3-position) in the degradation of 4-hydroxyphenylacetate to succinate and pyruvate. 4-hydroxyphenylacetate arises from the degradation of tyrosine. These reductases catalyse the reduction of free flavins by NADPH. The flavin is then utilised by the large subunit of the monooxygenase.; GO: 0016651 oxidoreductase activity acting on NADH or NADPH, 0051287 NAD binding, 0042537 benzene and derivative metabolic process. Probab=57.41 E-value=3.3 Score=22.05 Aligned_cols=31 Identities=35% Similarity=0.562 Sum_probs=28.1 Q ss_pred HHCCCEEEEEHHHHHHHHHCCCCEEEHHHHCCCC Q ss_conf 9879999994276899998899517946534881 Q gi|254780664|r 37 LSRGIKIISTGGTCQLLEEEGIPVTSVFDITKFP 70 (536) Q Consensus 37 ~~~g~~iisTgGTa~~l~~~gi~v~~Vs~~Tgfp 70 (536) ...+|.||-|+|-|-.. ||.+|.|-.+|.-| T Consensus 8 LaAaVnIiTT~G~AG~~---GIT~TAVCSVTDtP 38 (157) T TIGR02296 8 LAAAVNIITTNGIAGKC---GITATAVCSVTDTP 38 (157) T ss_pred HHCCCEEEECCCCCCCE---EEEEEEEECCCCCC T ss_conf 33464148739987723---22277751665887 No 64 >cd03822 GT1_ecORF704_like This family is most closely related to the GT1 family of glycosyltransferases. ORF704 in E. coli has been shown to be involved in the biosynthesis of O-specific mannose homopolysaccharides. Probab=56.75 E-value=8.4 Score=19.14 Aligned_cols=30 Identities=30% Similarity=0.500 Sum_probs=16.1 Q ss_pred EEEEEEEC--CCCHH----HHHHHHHHCCCE--EEEEH Q ss_conf 98998754--56869----999999987999--99942 Q gi|254780664|r 18 TALISVHN--KTGVV----EFASRLLSRGIK--IISTG 47 (536) Q Consensus 18 rALiSV~d--Ktgl~----~la~~L~~~g~~--iisTg 47 (536) =|+||-|- +.|++ +|++.|.++|.+ +++.+ T Consensus 2 I~~i~~ypP~~GGi~~~~~~La~~L~~~Gh~V~v~~~~ 39 (366) T cd03822 2 IALVSPYPPRKCGIATFTTDLVNALSARGPDVLVVSVA 39 (366) T ss_pred EEEEECCCCCCCCHHHHHHHHHHHHHHCCCEEEEEECC T ss_conf 89991899999838999999999998679989999588 No 65 >TIGR00216 ispH_lytB 4-hydroxy-3-methylbut-2-enyl diphosphate reductase; InterPro: IPR003451 Terpenes are among the largest groups of natural products and include compounds such as vitamins, cholesterol and carotenoids. The biosynthesis of all terpenoids begins with one or both of the two C5 precursors of the pathway: isopentenyl diphosphate (IPP) and dimethylallyl diphosphate (DMAPP). In animals, fungi, and certain bacteria, the synthesis of IPP and DMAPP occurs via the well-known mevalonate pathway, however, a second, nonmevalonate terpenoid pathway has been identified in many eubacteria, algae and the chloroplasts of higher plants . LytB(IspH) catalyses the conversion of 1-hydroy-2-methyl-2-(E)-butenyl 4-diphosphate into IPP and DMAPP in this second pathway The enzyme appears to be responsible for a branch-step in the nonmevalonate pathway, in that IPP and DMAPP are produced in parallel from a single precursor although the exact mechanism of this is not currently fully understood . Escherichia coli LytB protein had been found to regulate the activity of RelA (guanosine 3',5'-bispyrophosphate synthetase I), which in turn controls the level of a regulatory metabolite. It is involved in penicillin tolerance and the stringent response .; GO: 0019288 isopentenyl diphosphate biosynthetic process mevalonate-independent pathway. Probab=56.26 E-value=7.9 Score=19.33 Aligned_cols=35 Identities=29% Similarity=0.399 Sum_probs=30.5 Q ss_pred HCCCCCCCCCCHHHHHHHHCCCCHHHHHHHHHCCCCCEEEEEEEC Q ss_conf 519855323715553553308898999999977998414999948 Q gi|254780664|r 72 IMGGRVKTLHPKIYGGILSIRDNPAHMKFMQDHELESIDLVVVNL 116 (536) Q Consensus 72 il~GRVKTLHP~I~ggIL~~r~~~~~~~~l~~~~i~~IDlVvvNL 116 (536) -.+++|=|+||-|| |+.=++.|++.|+..+|.| |+ T Consensus 29 ~~g~~vyt~g~IvH--------N~~v~~~L~~~g~~~~D~~--~~ 63 (354) T TIGR00216 29 ELGKPVYTLGEIVH--------NPQVVERLREKGVVIFDVV--NL 63 (354) T ss_pred CCCCCEEEECCCCC--------CHHHHHHHHHCCCEEECCC--HH T ss_conf 27897788576033--------7589999984797775220--00 No 66 >pfam11009 DUF2847 Protein of unknown function (DUF2847). Some members in this bacterial family of proteins with unknown function are annotated as YtxJ, a putative general stress protein. This cannot be confirmed. Probab=56.25 E-value=16 Score=17.15 Aligned_cols=52 Identities=17% Similarity=0.317 Sum_probs=32.9 Q ss_pred CEEEEECCCCCCCCCCCCHHHHHHHHHHHHCC--CCCCCEEEEECCCCCHHHHHHH Q ss_conf 27999615686565452016899999985332--1320045554275421457878 Q gi|254780664|r 283 AACVIVKHMNPCGVATADTLVEAYRRALSCDP--ISAFGGIIAFNREVDQEVAKEV 336 (536) Q Consensus 283 Pa~vIvKH~NPCGvA~~~~~~~Ay~~A~~~DP--~SAFGGIIa~N~~vd~~~A~~I 336 (536) ..++|+||.+-||++. ....-|++-+..++ +..|==.|.-+|++--..|+.. T Consensus 19 ~pv~ifKHSt~C~IS~--~a~~~~e~~~~~~~~~~~~y~ldv~~~R~vSn~IAe~~ 72 (104) T pfam11009 19 KPVLIFKHSTTCPISR--MALKQFEAFAEEDEEDLPVYYLDVQEYRDVSNEIAEKF 72 (104) T ss_pred CCEEEEECCCCCCCHH--HHHHHHHHHHHCCCCCCCEEEEEEEECCHHHHHHHHHH T ss_conf 8289994399872109--99999998862552348879999641855789999995 No 67 >TIGR02906 spore_CotS spore coat protein, CotS family; InterPro: IPR014255 Members of this entry include the spore coat proteins CotS and YtaA from Bacillus subtilis and, from other endospore-forming bacteria, homologues that are more closely related to these two than to the spore coat proteins YutH (IPR014254 from INTERPRO) and YsxE (IPR014253 from INTERPRO). The CotS family is more broadly distributed than YutH or YsxE, but still is not universal among spore-formers.. Probab=56.19 E-value=5.2 Score=20.63 Aligned_cols=107 Identities=17% Similarity=0.158 Sum_probs=57.8 Q ss_pred CCCEEEECCCCCCCCCCHHHCCCCCCC-CCCCCCCCHHHHHHHHCCCCCCCEEEEECC-CCCCCCCCCC----------- Q ss_conf 065057604776682101210589888-312221200116776424556727999615-6865654520----------- Q gi|254780664|r 234 QKAALYSTPEKKSGIAHAVLVQGKPLS-YNNINDLDAAFELVSEFRSQDCAACVIVKH-MNPCGVATAD----------- 300 (536) Q Consensus 234 Q~Aa~Y~~~~~~~~~~~~~ql~GKeLS-YNNllD~daA~~lv~ef~~~~~Pa~vIvKH-~NPCGvA~~~----------- 300 (536) +.+..|...+...-+ -.+.+.|+|++ |+|.-|+..|...+.+|... ..+= .-|=|.-... T Consensus 59 k~g~~y~~~n~~ly~-l~ew~eGrec~P~~n~~dl~~~~~~La~~H~a------S~Gy~~~p~~~~~~~~~~Lgkw~~~f 131 (321) T TIGR02906 59 KDGELYVKLNGDLYV-LTEWIEGRECDPFNNPIDLKKAAKALALLHEA------SKGYRVPPDGSKIRSKNELGKWPKQF 131 (321) T ss_pred CCCCCEEEECCEEEE-EECCCCCCCCCCCCCHHHHHHHHHHHHHHHHH------CCCCCCCCCCCCCCHHHHHCHHHHHH T ss_conf 888744654781899-84330264698888989999999999855650------66751077766410344302016799 Q ss_pred -----HHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHEECCCCH Q ss_conf -----168999999853321320045554275421457878865431111268798 Q gi|254780664|r 301 -----TLVEAYRRALSCDPISAFGGIIAFNREVDQEVAKEVIKVFTEAIIAPTLSE 351 (536) Q Consensus 301 -----~~~~Ay~~A~~~DP~SAFGGIIa~N~~vd~~~A~~I~~~F~EvIiAP~f~~ 351 (536) .+..-...|...-|.|.|.-+. ...+|.-... .+.-.|-+-.-+|.+ T Consensus 132 ~~~~~~l~~~k~~~~~~~~~~~Fd~~y--l~~~D~~l~~--~~~~~~~l~~S~Y~~ 183 (321) T TIGR02906 132 EKRLEELEKFKKIAKEKKYKDEFDKLY--LKEVDYFLER--AKKALELLNKSKYYD 183 (321) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCHHHH--HHHHHHHHHH--HHHHHHHHHCCCCHH T ss_conf 999999999999986212257003689--9887899999--999999863277226 No 68 >KOG3010 consensus Probab=55.74 E-value=8.3 Score=19.18 Aligned_cols=71 Identities=21% Similarity=0.270 Sum_probs=38.9 Q ss_pred CCCHHHHHHHHHHHHCCCCCCCEEEEECCCC---------CHHHHHHHHHHHHHHHEECCCCHHHHHHHHCCCCCEEEEC Q ss_conf 5201689999998533213200455542754---------2145787886543111126879899988512455315631 Q gi|254780664|r 298 TADTLVEAYRRALSCDPISAFGGIIAFNREV---------DQEVAKEVIKVFTEAIIAPTLSEEAADVLAKKPSMRFLKT 368 (536) Q Consensus 298 ~~~~~~~Ay~~A~~~DP~SAFGGIIa~N~~v---------d~~~A~~I~~~F~EvIiAP~f~~eAleiL~~KKnlRil~~ 368 (536) ..+...+.|..|+-.=|.+=|--|-+++..- +-.+|..|.++ +|=|||-|.+++-|++++++++.|.... T Consensus 4 ~~~~~a~~Y~~ARP~YPtdw~~~ia~~~~~h~~a~DvG~G~Gqa~~~iae~-~k~VIatD~s~~mL~~a~k~~~~~y~~t 82 (261) T KOG3010 4 LFDKQAADYLNARPSYPTDWFKKIASRTEGHRLAWDVGTGNGQAARGIAEH-YKEVIATDVSEAMLKVAKKHPPVTYCHT 82 (261) T ss_pred CCCCCHHHHHHCCCCCCHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHH-HHHHEEECCCHHHHHHHHCCCCCCCCCC T ss_conf 002018888625889868999999842888645888456887114788875-4343130687999998611898620457 Q ss_pred C Q ss_conf 6 Q gi|254780664|r 369 S 369 (536) Q Consensus 369 ~ 369 (536) + T Consensus 83 ~ 83 (261) T KOG3010 83 P 83 (261) T ss_pred C T ss_conf 8 No 69 >cd06330 PBP1_Arsenic_SBP_like Periplasmic solute-binding domain of active transport proteins. Periplasmic solute-binding domain of active transport proteins found in bacteria and Archaea that is predicted to be involved in the efflux of toxic compounds. Members of this subgroup include proteins from Herminiimonas arsenicoxydans, which is resistant to arsenic and various heavy metals such as cadmium and zinc. Moreover, they show significant sequence similarity to the cluster of AmiC and active transport systems for short-chain amides and urea (FmdDEF), and thus are likely to exhibit a ligand-binding mode similar to that of the amide sensor protein AmiC from Pseudomonas aeruginosa. Probab=55.61 E-value=16 Score=17.08 Aligned_cols=20 Identities=10% Similarity=0.175 Sum_probs=9.1 Q ss_pred HHHHHEECCCCHHHHHHHHC Q ss_conf 43111126879899988512 Q gi|254780664|r 340 FTEAIIAPTLSEEAADVLAK 359 (536) Q Consensus 340 F~EvIiAP~f~~eAleiL~~ 359 (536) =.|+|+...+.+++..++++ T Consensus 195 ~pd~v~~~~~~~~~~~~~~q 214 (346) T cd06330 195 KPDAIFSSLWGGDLVTFVRQ 214 (346) T ss_pred CCCEEEEECCCHHHHHHHHH T ss_conf 99999990664579999999 No 70 >CHL00202 argB acetylglutamate kinase; Provisional Probab=55.13 E-value=15 Score=17.22 Aligned_cols=37 Identities=22% Similarity=0.089 Sum_probs=19.8 Q ss_pred CCCCCCCCHHHHHHHHCCCCH----HHHHHHHHCCCCCEEEEEE Q ss_conf 855323715553553308898----9999999779984149999 Q gi|254780664|r 75 GRVKTLHPKIYGGILSIRDNP----AHMKFMQDHELESIDLVVV 114 (536) Q Consensus 75 GRVKTLHP~I~ggIL~~r~~~----~~~~~l~~~~i~~IDlVvv 114 (536) =|=||.-=++-|.++.+.... +|+.-|+.+|+.+ |+| T Consensus 21 ~rgktfVIk~gG~~~~d~~l~~~~~~DialL~~lGik~---VlV 61 (284) T CHL00202 21 FRGRIMVIKYGGAAMKNLILKADIIKDILFLSCIGLKI---VVV 61 (284) T ss_pred HCCCEEEEEECCEEEECHHHHHHHHHHHHHHHHCCCEE---EEE T ss_conf 68998999989666637548999999999999889979---998 No 71 >pfam02679 ComA (2R)-phospho-3-sulfolactate synthase (ComA). In methanobacteria (2R)-phospho-3-sulfolactate synthase (ComA) catalyses the first step of the biosynthesis of coenzyme M from phosphoenolpyruvate (P-enolpyruvate). This novel enzyme catalyses the stereospecific Michael addition of sulfite to P-enolpyruvate, forming L-2-phospho-3-sulfolactate (PSL). It is suggested that the ComA-catalysed reaction is analogous to those reactions catalysed by beta-elimination enzymes that proceed through an enolate intermediate. Probab=55.08 E-value=16 Score=17.02 Aligned_cols=51 Identities=18% Similarity=0.115 Sum_probs=30.7 Q ss_pred CEEEEEEEEECCCCHHHHHHHHHHCCCEEEEEHHHH--------------HHHHHCCCCEEEHHHH Q ss_conf 136989987545686999999998799999942768--------------9999889951794653 Q gi|254780664|r 15 AVKTALISVHNKTGVVEFASRLLSRGIKIISTGGTC--------------QLLEEEGIPVTSVFDI 66 (536) Q Consensus 15 ~ikrALiSV~dKtgl~~la~~L~~~g~~iisTgGTa--------------~~l~~~gi~v~~Vs~~ 66 (536) |+-..--+++++.-|.+..+-.+++|+.. ++|||. +..++.|+.+.+||+- T Consensus 43 K~g~gt~~l~p~~~l~eKI~l~~~~~V~v-~~GGtlfE~a~~~~~~d~y~~~~~~lGf~~iEiSdg 107 (245) T pfam02679 43 KFGWGTSALMPEDILKEKIDLAHEHGVYV-YTGGTLFEIAILQGKFDEYLRECKELGFDAIEISDG 107 (245) T ss_pred EECCCEEEECCHHHHHHHHHHHHHCCCEE-ECCCHHHHHHHHCCCHHHHHHHHHHCCCCEEEECCC T ss_conf 97687650178899999999999859948-479699999997383999999999869988995688 No 72 >PRK01388 arginine deiminase; Provisional Probab=54.94 E-value=13 Score=17.80 Aligned_cols=31 Identities=23% Similarity=0.290 Sum_probs=17.6 Q ss_pred CCCEEEEECCCCCCCCHHHHHHHCCCCEEEECCC Q ss_conf 4870999736686855589999719959993898 Q gi|254780664|r 475 TNGSVIASEAFYPFPDGIVEAIKAGVTAVIQPGG 508 (536) Q Consensus 475 ~~g~vlaSDAFFPF~D~ie~aa~~Gi~aIiqPGG 508 (536) .-|-|++.|- -.-.-+.+.++||+.|--||. T Consensus 359 ~Pg~Vv~Y~r---N~~Tn~~L~~~Gi~Vi~i~~s 389 (410) T PRK01388 359 APGVVVGYDR---NTVTNTLLRKAGIEVITIPGS 389 (410) T ss_pred CCCEEEEECC---CHHHHHHHHHCCCEEEEECCH T ss_conf 6988998568---789999999779989997024 No 73 >PRK07985 oxidoreductase; Provisional Probab=52.47 E-value=18 Score=16.73 Aligned_cols=45 Identities=16% Similarity=0.150 Sum_probs=24.2 Q ss_pred CCCCCCCCCCCCCCEEEEEEEEECCCCHHHHHHHHHHCCCEEEEE Q ss_conf 720157788677613698998754568699999999879999994 Q gi|254780664|r 2 DCFHRKDGDHGEIAVKTALISVHNKTGVVEFASRLLSRGIKIIST 46 (536) Q Consensus 2 ~~~~~~~~~~~~~~ikrALiSV~dKtgl~~la~~L~~~g~~iisT 46 (536) ||=...--.+|+++=|+|||.=.-+-==...|+.|.+.|.+++-+ T Consensus 35 ~~~~~~~~~~GrL~gKvalVTGas~GIG~aiA~~lA~~GA~Vvi~ 79 (294) T PRK07985 35 DCGEKTYVGSGRLKDRKALVTGGDSGIGRAAAIAYAREGADVAIS 79 (294) T ss_pred CCCCCCCCCCCCCCCCEEEEECCCCHHHHHHHHHHHHCCCEEEEE T ss_conf 889878887777899979991726699999999999879999994 No 74 >TIGR02717 AcCoA-syn-alpha acetyl coenzyme A synthetase (ADP forming), alpha domain; InterPro: IPR014089 This group of ADP-dependent acetyl-CoA synthetases (ACS) act in the direction of acetate and ATP production in the organisms in which they have been characterised , , . In most species this protein is bifunctional, existing as fused alpha-beta domains. In Pyrococcus and related species, however, the domains exist as separate polypeptides. This entry represents the alpha (N-terminal) domain. In Pyrococcus and related species there appears to have been the development of a paralogous family such that four other proteins are close relatives. One of these (along with its beta-domain partner) was characterised as ACS-II showing specificity for phenylacetyl-CoA . This entry excludes non-ACS-I paralogs. . Probab=52.32 E-value=14 Score=17.51 Aligned_cols=173 Identities=18% Similarity=0.187 Sum_probs=71.6 Q ss_pred HHHHHHHHHEE-CCCCHHHHHHHHCCCCCEEEECCCCCCCCCCCCEECCCCHHHHHCCCCCCCCCCCC-CCCCCCCCCHH Q ss_conf 88654311112-68798999885124553156316766777666402032102231012334531000-12467778688 Q gi|254780664|r 336 VIKVFTEAIIA-PTLSEEAADVLAKKPSMRFLKTSSLLDFHGEEIVLKTVSGGILVQTRDNVVDNKEL-TVVTKRSPTDQ 413 (536) Q Consensus 336 I~~~F~EvIiA-P~f~~eAleiL~~KKnlRil~~~~~~~~~~~~~~~rsi~GG~LvQ~~D~~~~~~~~-~vVT~~~pt~~ 413 (536) +.-.|.|.|-. +.|=+-|-++=++|| +-+|+... ... ..---.| .-|-|.=+.- ..+ .=| ++-.=|--|-+ T Consensus 209 ~I~~Y~Eg~~DG~~Fl~~A~~~s~~KP-iv~lKsG~--s~~-GakAA~S-HTGaLAGs~~-~y~-aaf~q~G~iRa~~~~ 281 (457) T TIGR02717 209 VILLYLEGIKDGRKFLKTAKEISKKKP-IVVLKSGT--SEA-GAKAASS-HTGALAGSDE-AYD-AAFKQAGVIRADSIE 281 (457) T ss_pred EEEEECCCCCCHHHHHHHHHHHHHCCC-EEEEECCC--CHH-HHHHHHH-CCCHHHHHHH-HHH-HHHCCCCEEEEECHH T ss_conf 899971787041689999888630598-89993688--834-5676521-0231336689-999-874301438870177 Q ss_pred HHHHHHHHHHHHHCC-CCCEEEEEECCEEEEECCCCCCHHHHHHHHHHHHHHH-----------HHCCCCCCCCCCEEEE Q ss_conf 988899999986005-6677999968938997296888899999999998764-----------2001222234870999 Q gi|254780664|r 414 ELRDMKFAFKVVKHV-KSNAVVYAKDGRTVGIGSGQTSRVDSTRFAAIKAHNI-----------SAQADVKSMTNGSVIA 481 (536) Q Consensus 414 e~~dL~FA~kv~K~v-kSNAIv~ak~~~tiGiGaGQ~sRVda~~iA~~KA~~~-----------~~~~~~~~~~~g~vla 481 (536) ||=|+.-..---.+. |.|-|+++-|+- |.|.==+.|++..-+-+..-.++ ...........|+ +| T Consensus 282 ELfd~A~~L~~~~~~~~g~~~~IiTN~G--G~Gvia~D~~~~~Gl~L~~~~~~t~~~L~~~LP~~as~~NPVD~~Gs-DA 358 (457) T TIGR02717 282 ELFDLARLLSNQPLPPKGNRVAIITNAG--GPGVIATDACEEVGLELAELSEKTKEKLRNILPPEASIKNPVDVLGS-DA 358 (457) T ss_pred HHHHHHHHHHCCCCCCCCCEEEEEECCC--CHHHHHHHHHHHCCCEEECCCHHHHHHHHHHCCCCCCCCCCCEEEEC-CC T ss_conf 8899999983589899887699997896--16778765677749745558589999999747611477875125522-78 Q ss_pred ECCCCCCCCHHHHHHHC----CCCEEEECCCCCC-HHHHHHHHH Q ss_conf 73668685558999971----9959993898779-889999998 Q gi|254780664|r 482 SEAFYPFPDGIVEAIKA----GVTAVIQPGGSVR-DSEAITVAD 520 (536) Q Consensus 482 SDAFFPF~D~ie~aa~~----Gi~aIiqPGGSir-D~evI~aan 520 (536) .-.- |.+.++++++- ||=.|.+|.=.++ =+||-+.+= T Consensus 359 ~~~~--Y~~~l~~v~eD~nVd~~~vv~~~~a~~~~~~~va~~~~ 400 (457) T TIGR02717 359 TAER--YAKALKIVAEDENVDGVVVVLTPTAMTDKPEEVAKGII 400 (457) T ss_pred CHHH--HHHHHHHHHCCCCCCEEEEEEECCCHHHHHHHHHHHHH T ss_conf 9899--99999998348888889996425302467999999887 No 75 >COG2355 Zn-dependent dipeptidase, microsomal dipeptidase homolog [Amino acid transport and metabolism] Probab=51.93 E-value=18 Score=16.67 Aligned_cols=40 Identities=25% Similarity=0.336 Sum_probs=26.5 Q ss_pred EEEEECCCCCCCCCC-CCHHHHHHHHHHHHCCCCCCCEEEEECCCC Q ss_conf 799961568656545-201689999998533213200455542754 Q gi|254780664|r 284 ACVIVKHMNPCGVAT-ADTLVEAYRRALSCDPISAFGGIIAFNREV 328 (536) Q Consensus 284 a~vIvKH~NPCGvA~-~~~~~~Ay~~A~~~DP~SAFGGIIa~N~~v 328 (536) .-+|.-|.||-.+-- ..|+.+.-.+|...- ||+|.+|.-. T Consensus 184 ~PviaSHSN~~al~~h~RNl~D~qlkaI~~~-----gGvIgv~~~~ 224 (313) T COG2355 184 APVVASHSNARALVDHPRNLSDEQLKAIAET-----GGVIGVNFIP 224 (313) T ss_pred CCEEEECCCCHHCCCCCCCCCHHHHHHHHHC-----CCEEEEEEEH T ss_conf 8668743772410588889889999999964-----9979998645 No 76 >cd01915 CODH Carbon monoxide dehydrogenase (CODH) is found in acetogenic and methanogenic organisms and is responsible for the synthesis and breakdown of acetyl-CoA, respectively. CODH has two types of metal clusters, a cubane [Fe4-S4] center (B-cluster) similar to that of hybrid cluster protein (HCP) and a Ni-Fe-S center (C-cluster) where carbon monoxide oxidation occurs. Bifunctional CODH forms a heterotetramer with acetyl-CoA synthase (ACS) consisting of two CODH and two ACS subunits while monofunctional CODH forms a homodimer. Bifunctional CODH reduces carbon dioxide to carbon monoxide and ACS then synthesizes acetyl-CoA from carbon monoxide, CoA, and a methyl group donated by another protein (CoFeSP), while monofunctional CODH oxidizes carbon monoxide to carbon dioxide. CODH and ACS each have a metal cluster referred to as the C- and A-clusters, respectively. Probab=51.64 E-value=15 Score=17.23 Aligned_cols=44 Identities=16% Similarity=0.233 Sum_probs=27.7 Q ss_pred HCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHEECCCCHHHHHHHHCCCCCEEE Q ss_conf 3321320045554275421457878865431111268798999885124553156 Q gi|254780664|r 312 CDPISAFGGIIAFNREVDQEVAKEVIKVFTEAIIAPTLSEEAADVLAKKPSMRFL 366 (536) Q Consensus 312 ~DP~SAFGGIIa~N~~vd~~~A~~I~~~F~EvIiAP~f~~eAleiL~~KKnlRil 366 (536) ++|..-|-|.. --+.|.+.|.++.+.-+ ..|+|-+++++..++- T Consensus 345 ts~~ak~~ga~--hi~f~~~~a~e~A~~Iv---------~~AIe~fk~R~~~~v~ 388 (613) T cd01915 345 TSDVAKIPGAE--HIDFDPEEADESAKEII---------RMAIEAFKRRKKSKVY 388 (613) T ss_pred CCHHCCCCCCE--ECCCCCCCHHHHHHHHH---------HHHHHHHHHCCCCCCC T ss_conf 55110078981--04778345899999999---------9999998716998577 No 77 >TIGR01090 apt adenine phosphoribosyltransferase; InterPro: IPR005764 Adenine phosphoribosyltransferase (APRTase, 2.4.2.7 from EC) is a widely distributed enzyme, and its deficiency in humans causes the accumulation of 2,8-dihydroxyadenine. It is the sole catalyst for adenine recycling in most eukaryotes.AMP + diphosphate = adenine + 5-phospho-alpha-D-ribose 1-diphosphate ; GO: 0003999 adenine phosphoribosyltransferase activity, 0006168 adenine salvage. Probab=51.11 E-value=7.6 Score=19.43 Aligned_cols=17 Identities=18% Similarity=0.477 Sum_probs=8.9 Q ss_pred EEEECHHHHHHHHHHHH Q ss_conf 35308999898898753 Q gi|254780664|r 155 TILTNPQDYPLFLAEMD 171 (536) Q Consensus 155 ~Vi~dp~dY~~~~~el~ 171 (536) +++-||+-|..+++.|. T Consensus 23 plL~~~~~f~~~id~l~ 39 (175) T TIGR01090 23 PLLNNPELFRFLIDLLV 39 (175) T ss_pred CCCCCHHHHHHHHHHHH T ss_conf 01068778999999999 No 78 >COG4109 Predicted transcriptional regulator containing CBS domains [Transcription] Probab=51.09 E-value=19 Score=16.58 Aligned_cols=76 Identities=8% Similarity=0.050 Sum_probs=38.7 Q ss_pred CCCCCCCCHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHH Q ss_conf 31222120011677642455672799961568656545201689999998533213200455542754214578788654 Q gi|254780664|r 261 YNNINDLDAAFELVSEFRSQDCAACVIVKHMNPCGVATADTLVEAYRRALSCDPISAFGGIIAFNREVDQEVAKEVIKVF 340 (536) Q Consensus 261 YNNllD~daA~~lv~ef~~~~~Pa~vIvKH~NPCGvA~~~~~~~Ay~~A~~~DP~SAFGGIIa~N~~vd~~~A~~I~~~F 340 (536) +-|..-.-|...-++-++..+ -.-|+=-|.+-.|+-+..+++++.++.-.--- +-.+++...|+.+.+.. T Consensus 261 tv~~~tsVAsvaq~MiwE~ie-m~PVv~~n~~llGiitR~dvlk~lq~~q~qpq---------vget~~d~I~~~l~e~~ 330 (432) T COG4109 261 TVRAKTSVASVAQMMIWEGIE-MLPVVDSNNTLLGIITRQDVLKSLQMIQRQPQ---------VGETISDQIANNLSEKG 330 (432) T ss_pred EECCCCHHHHHHHHHHHCCCE-EEEEECCCCEEEEEEEHHHHHHHHHHHCCCCC---------CCCCHHHHHHHHHHHHC T ss_conf 542522688998888764552-65688378618989889999999987512886---------55447799985331313 Q ss_pred HHHHEE Q ss_conf 311112 Q gi|254780664|r 341 TEAIIA 346 (536) Q Consensus 341 ~EvIiA 346 (536) .|.-+. T Consensus 331 ~~~~~t 336 (432) T COG4109 331 DEYGVT 336 (432) T ss_pred CCCCEE T ss_conf 566248 No 79 >COG0547 TrpD Anthranilate phosphoribosyltransferase [Amino acid transport and metabolism] Probab=50.77 E-value=18 Score=16.66 Aligned_cols=66 Identities=20% Similarity=0.242 Sum_probs=36.1 Q ss_pred EEEECCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHH-HHHHHHHHHHHEECCCCHHHHHHHHCCCCC Q ss_conf 9996156865654520168999999853321320045554275421457-878865431111268798999885124553 Q gi|254780664|r 285 CVIVKHMNPCGVATADTLVEAYRRALSCDPISAFGGIIAFNREVDQEVA-KEVIKVFTEAIIAPTLSEEAADVLAKKPSM 363 (536) Q Consensus 285 ~vIvKH~NPCGvA~~~~~~~Ay~~A~~~DP~SAFGGIIa~N~~vd~~~A-~~I~~~F~EvIiAP~f~~eAleiL~~KKnl 363 (536) +=|.||.|-.- ++..-. +|=+.|+| +|-..+.+.+ +.|.+.=+=-+-||.|.|.-..+.--++.| T Consensus 106 v~VaKHGnrs~-sSksGs---------aDvleaLG----v~l~~~~e~~~~~l~~~g~~FlfAp~~hp~~k~v~~vR~~L 171 (338) T COG0547 106 VPVAKHGNRSV-SSKSGS---------ADVLEALG----VNLELSPEQAARALEETGIGFLFAPAYHPAMKHVAPVRKEL 171 (338) T ss_pred CCEEEECCCCC-CCCCCH---------HHHHHHCC----CCCCCCHHHHHHHHHHCCEEEEECCCCCHHHHHHHHHHHHC T ss_conf 91776789877-887768---------99999759----98789999999999855948987611388999889999970 Q ss_pred E Q ss_conf 1 Q gi|254780664|r 364 R 364 (536) Q Consensus 364 R 364 (536) . T Consensus 172 G 172 (338) T COG0547 172 G 172 (338) T ss_pred C T ss_conf 8 No 80 >pfam09989 DUF2229 CoA enzyme activase uncharacterized domain (DUF2229). Members of this family include various bacterial hypothetical proteins, as well as CoA enzyme activases. The exact function of this domain has not, as yet, been defined. Probab=50.54 E-value=19 Score=16.52 Aligned_cols=95 Identities=20% Similarity=0.207 Sum_probs=61.8 Q ss_pred HHHHHHHHHHCCCEEEEEHHHHHHHHHCCCCEEEHHHHCCCCHHCCCCCCCCCCHHHHHHHHCCCCHHHHHHHHHCCCCC Q ss_conf 69999999987999999427689999889951794653488165198553237155535533088989999999779984 Q gi|254780664|r 29 VVEFASRLLSRGIKIISTGGTCQLLEEEGIPVTSVFDITKFPEIMGGRVKTLHPKIYGGILSIRDNPAHMKFMQDHELES 108 (536) Q Consensus 29 l~~la~~L~~~g~~iisTgGTa~~l~~~gi~v~~Vs~~TgfpEil~GRVKTLHP~I~ggIL~~r~~~~~~~~l~~~~i~~ 108 (536) ..-+..++.++|++++-|.-|-+.+-+.|+....-+ |=|| ||-.| .|..+|.+.+. T Consensus 14 ~p~w~~fF~~LG~~VV~S~~t~k~~l~~G~~~~~~e--~C~P------~Kl~~--------------Ghv~~L~~kg~-- 69 (218) T pfam09989 14 YPFWFTFFTELGFEVVLSPPTNKEIYELGVETIPSE--TCYP------AKLFH--------------GHVADLLDKGV-- 69 (218) T ss_pred HHHHHHHHHHCCCEEEECCCCCHHHHHHHHHHCCCC--CCHH------HHHHH--------------HHHHHHHHCCC-- T ss_conf 599999999859679989988399999886607887--4432------69988--------------99999986699-- Q ss_pred EEEEEEECCCHHHHHCCCCC-HHHHHHCCCCCCHHHHHHHHCCC Q ss_conf 14999948888997317653-11232002334346999874015 Q gi|254780664|r 109 IDLVVVNLYPFEESFCREDD-YYTMVENIDIGGPSMIRAAAKNH 151 (536) Q Consensus 109 IDlVvvNLYPF~~~v~~~~~-~~~~IEnIDIGGpsmiRAAAKN~ 151 (536) |-+ +||---.+.+... ...-.--.-.|-|-|+|++-.+- T Consensus 70 -D~I---F~P~~~~~~~~~~~~~~~~CP~~~g~Pd~i~~~~~~~ 109 (218) T pfam09989 70 -DYI---FYPRIVSERKEKSEDNHYNCPKVQGLPEMIKNNIDLL 109 (218) T ss_pred -CEE---EECCCCCCCCCCCCCCCCCCCEECCHHHHHHHHCCCC T ss_conf -999---9798845026656777432876566999999858632 No 81 >KOG0832 consensus Probab=50.45 E-value=19 Score=16.51 Aligned_cols=16 Identities=19% Similarity=0.322 Sum_probs=7.7 Q ss_pred HHHCCCCCCHHHHHHHHC Q ss_conf 320023343469998740 Q gi|254780664|r 132 MVENIDIGGPSMIRAAAK 149 (536) Q Consensus 132 ~IEnIDIGGpsmiRAAAK 149 (536) .+|+|+| +.|++|.+- T Consensus 44 ~~~~~~v--~~L~~agvH 59 (251) T KOG0832 44 VPELISV--EELFNAGVH 59 (251) T ss_pred CHHHCCH--HHHHHCCCC T ss_conf 5655029--999741241 No 82 >PRK10786 ribD bifunctional diaminohydroxyphosphoribosylaminopyrimidine deaminase/5-amino-6-(5-phosphoribosylamino)uracil reductase; Provisional Probab=49.66 E-value=20 Score=16.43 Aligned_cols=25 Identities=24% Similarity=0.303 Sum_probs=10.5 Q ss_pred CCCCHHHHHHHHCCCCHHHHHHHHHCCCCC Q ss_conf 237155535533088989999999779984 Q gi|254780664|r 79 TLHPKIYGGILSIRDNPAHMKFMQDHELES 108 (536) Q Consensus 79 TLHP~I~ggIL~~r~~~~~~~~l~~~~i~~ 108 (536) ||-|--|-| | .|-..+.+-+.||+. T Consensus 71 TLEPC~H~G----k-TpPC~~aIi~agI~r 95 (367) T PRK10786 71 TLEPCSHHG----R-TPPCCDALIAAGVAR 95 (367) T ss_pred EECCCCCCC----C-CCHHHHHHHHHCCCE T ss_conf 835734679----8-945999999828988 No 83 >pfam02603 Hpr_kinase_N HPr Serine kinase N terminus. This family represents the N-terminal region of Hpr Serine/threonine kinase PtsK. This kinase is the sensor in a multicomponent phospho-relay system in control of carbon catabolic repression in bacteria. This kinase in unusual in that it recognizes the tertiary structure of its target and is a member of a novel family unrelated to any previously described protein phosphorylating enzymes. X-ray analysis of the full-length crystalline enzyme from Staphylococcus xylosus at a resolution of 1.95 A shows the enzyme to consist of two clearly separated domains that are assembled in a hexameric structure resembling a three-bladed propeller. The blades are formed by two N-terminal domains each, and the compact central hub assembles the C-terminal kinase domains. Probab=49.52 E-value=20 Score=16.41 Aligned_cols=40 Identities=23% Similarity=0.205 Sum_probs=23.3 Q ss_pred CCCCCCCCCCCCCCCHHHHHHHHCCCCCCCEEEEECCCCCC Q ss_conf 05898883122212001167764245567279996156865 Q gi|254780664|r 254 VQGKPLSYNNINDLDAAFELVSEFRSQDCAACVIVKHMNPC 294 (536) Q Consensus 254 l~GKeLSYNNllD~daA~~lv~ef~~~~~Pa~vIvKH~NPC 294 (536) ++-+|++|-+-++-+-....+..+-+.+ |.|+||=++.++ T Consensus 54 ~G~~E~~yl~~l~~e~r~~~l~~l~~~~-~P~iIvt~~~~~ 93 (127) T pfam02603 54 LGKTELSYLDQLTEEERKERLEKLFSYD-TPCLIVTRGLEP 93 (127) T ss_pred ECHHHHHHHHHCCHHHHHHHHHHHHCCC-CCEEEEECCCCC T ss_conf 8579999999699999999999985759-988999799999 No 84 >pfam02593 DUF166 Uncharacterized ArCR, COG1810. Probab=48.79 E-value=20 Score=16.34 Aligned_cols=10 Identities=50% Similarity=0.826 Sum_probs=4.1 Q ss_pred EEECCCCCCC Q ss_conf 9961568656 Q gi|254780664|r 286 VIVKHMNPCG 295 (536) Q Consensus 286 vIvKH~NPCG 295 (536) |-|.-..||| T Consensus 144 V~VlR~aPCG 153 (215) T pfam02593 144 VRVLRGAPCG 153 (215) T ss_pred EEEEECCCCC T ss_conf 9898478986 No 85 >pfam05683 Fumerase_C Fumarase C-terminus. This family consists of the C terminal region of several bacterial fumarate hydratase proteins (FumA and FumB). Fumarase, or fumarate hydratase (EC 4.2.1.2), is a component of the citric acid cycle. In facultative anaerobes such as Escherichia coli, fumarase also engages in the reductive pathway from oxaloacetate to succinate during anaerobic growth. Probab=48.73 E-value=20 Score=16.33 Aligned_cols=29 Identities=28% Similarity=0.433 Sum_probs=19.2 Q ss_pred HHHHCCCCEEEECCCCCCHHHHHHHHHHCCC Q ss_conf 9997199599938987798899999987597 Q gi|254780664|r 494 EAIKAGVTAVIQPGGSVRDSEAITVADQHGI 524 (536) Q Consensus 494 ~aa~~Gi~aIiqPGGSirD~evI~aan~~gi 524 (536) +..+.|++++|--|| |.++++++|.+||- T Consensus 116 ~l~~~G~~~~IGKG~--rs~~~~~a~~~~~a 144 (205) T pfam05683 116 MLEKTGLLGMIGKGE--RGPAVIEAIKKHGA 144 (205) T ss_pred HHHHCCCEEEEECCC--CCHHHHHHHHHCCE T ss_conf 998439379997477--99999999997490 No 86 >pfam02401 LYTB LytB protein. The mevalonate-independent 2-C-methyl-D-erythritol 4-phosphate (MEP) pathway for isoprenoid biosynthesis is essential in many eubacteria, plants, and the malaria parasite. The LytB gene is involved in the trunk line of the MEP pathway. Probab=48.54 E-value=12 Score=18.10 Aligned_cols=31 Identities=13% Similarity=0.110 Sum_probs=26.4 Q ss_pred HCCCCCCCCCCHHHHHHHHCCCCHHHHHHHHHCCCCCEE Q ss_conf 519855323715553553308898999999977998414 Q gi|254780664|r 72 IMGGRVKTLHPKIYGGILSIRDNPAHMKFMQDHELESID 110 (536) Q Consensus 72 il~GRVKTLHP~I~ggIL~~r~~~~~~~~l~~~~i~~ID 110 (536) -.+++|-||||-|| |+.-++.|++.|+..+| T Consensus 26 ~~~~~vy~lG~iVH--------N~~vv~~L~~~Gv~~v~ 56 (280) T pfam02401 26 KSGGPVYVRGEIVH--------NRHVVNRLRERGAIFVE 56 (280) T ss_pred HCCCCEEEECCCCC--------CHHHHHHHHHCCCEEEC T ss_conf 76997798377764--------98999999988399813 No 87 >TIGR02153 gatD_arch glutamyl-tRNA(Gln) amidotransferase, subunit D; InterPro: IPR011878 This peptide is found only in the Archaea. It is part of a heterodimer, with GatD (IPR004414 from INTERPRO), that acts as an amidotransferase on misacylated Glu-tRNA(Gln) to produce Gln-tRNA(Gln). The analogous amidotransferase found in bacteria is the GatABC system, although GatABC homologs in the Archaea appear to act instead on Asp-tRNA(Asn) .; GO: 0006450 regulation of translational fidelity. Probab=48.01 E-value=6.1 Score=20.14 Aligned_cols=49 Identities=27% Similarity=0.485 Sum_probs=24.8 Q ss_pred CCCC--CCHHHHHHHCCCCEEEECC---CCCCHH--HHHHHHHHCCCEEEECCCCC Q ss_conf 6868--5558999971995999389---877988--99999987597799747854 Q gi|254780664|r 485 FYPF--PDGIVEAIKAGVTAVIQPG---GSVRDS--EAITVADQHGIAMVFTGIRH 533 (536) Q Consensus 485 FFPF--~D~ie~aa~~Gi~aIiqPG---GSirD~--evI~aan~~gi~m~fTg~Rh 533 (536) |||= |+=||.....|.+.||==| |-+-++ ++|+.|-+.|+.-++|.|=. T Consensus 292 fyPG~~p~il~~~~d~GykGiViEGTGLGHvs~~~ip~i~ra~d~Gv~V~MTSQCl 347 (413) T TIGR02153 292 FYPGLDPEILEFLVDKGYKGIVIEGTGLGHVSEDWIPSIKRATDDGVPVVMTSQCL 347 (413) T ss_pred ECCCCCHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEEEEE T ss_conf 43898888999985187159998337875552358999999875896899961143 No 88 >PRK13805 bifunctional acetaldehyde-CoA/alcohol dehydrogenase; Provisional Probab=47.93 E-value=21 Score=16.24 Aligned_cols=304 Identities=17% Similarity=0.216 Sum_probs=141.9 Q ss_pred HHHHHHCCCCCEEEEEEECCCHHHHHCCCCCHHHHHHC------CCCCCHHHHHHHH----------------------- Q ss_conf 99999779984149999488889973176531123200------2334346999874----------------------- Q gi|254780664|r 98 MKFMQDHELESIDLVVVNLYPFEESFCREDDYYTMVEN------IDIGGPSMIRAAA----------------------- 148 (536) Q Consensus 98 ~~~l~~~~i~~IDlVvvNLYPF~~~v~~~~~~~~~IEn------IDIGGpsmiRAAA----------------------- 148 (536) .+.+.+.| .|-|+|.|==-|=-+. ..+++.+ +--|||.|+|||- T Consensus 162 ~~a~~~aG-aP~~li~~i~~ps~e~------t~~LM~h~~v~lilaTGg~~mVkaAysSGkPaigvG~GN~p~~Id~tAd 234 (862) T PRK13805 162 LDAAVAAG-APKDIIQWIEEPSVEA------TNALMNHPGIALILATGGPGMVKAAYSSGKPALGVGAGNTPVYIDKTAD 234 (862) T ss_pred HHHHHHCC-CCCCCEEECCCCCHHH------HHHHHCCCCCCEEEECCCHHHHHHHHCCCCCCEEECCCCCCEEEECCCC T ss_conf 99999859-8914365538899899------9998569994479955977999999707998165578987769727889 Q ss_pred -----------CCCCCC-------EEEECHHHHHHHHHHHHCCCCC-CCHHHHHHHHHHHHHH-----HHHHHHHHHHHH Q ss_conf -----------015631-------3530899989889875302345-6667889999999998-----742289999875 Q gi|254780664|r 149 -----------KNHDYV-------TILTNPQDYPLFLAEMDVNNGK-IPYNFRKKMARQAFSR-----TASYDTAICRWL 204 (536) Q Consensus 149 -----------KN~~~V-------~Vi~dp~dY~~~~~el~~~~g~-~~~~~R~~lA~kAF~~-----ta~YD~~Ia~~l 204 (536) |.|.+= .||++.+-|++++.+|+.+.+- .+.+..++|..-.|.. ...+=..=+.|+ T Consensus 235 i~~Aa~~Ii~sktFDng~iCasEq~viv~~~iyd~~~~~l~~~Gay~l~~eE~~kl~~~~~~~~~g~ln~~iVGk~a~~I 314 (862) T PRK13805 235 IKRAVNDILLSKTFDNGMICASEQSVIVDDEIYDEVKEEFKSHGAYFLNKKELKKLEKVIFNKANGALNADIVGQSAYKI 314 (862) T ss_pred HHHHHHHHHHHCCCCCCCCCCCCCEEEEEHHHHHHHHHHHHHCCCEEECHHHHHHHHHHHCCCCCCCCCHHHHCCCHHHH T ss_conf 99999999862126688635677628862888999999999889798089999999987424889847825418789999 Q ss_pred HHHCCCCHHHHHHHCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCHHHCCCCCCCCCCCCCCCHHHHHHHHCCCC--CC Q ss_conf 3310121134432101233334367881006505760477668210121058988831222120011677642455--67 Q gi|254780664|r 205 ANAESENFPDYLNITAVKKQEMRYGENPHQKAALYSTPEKKSGIAHAVLVQGKPLSYNNINDLDAAFELVSEFRSQ--DC 282 (536) Q Consensus 205 ~~~~~~~~p~~~~~~~~~~~~LRYGENPHQ~Aa~Y~~~~~~~~~~~~~ql~GKeLSYNNllD~daA~~lv~ef~~~--~~ 282 (536) .+..+-+.|+.-.+-. +|-. . .....++.. +.|. --|++--.-|.+.|.+.+.++-+. -. T Consensus 315 A~~AGi~vP~~tkvLi--------~e~~---~-----vg~~~P~s~-EKLs-PVLa~y~~~d~~~A~~~~~~il~~~G~G 376 (862) T PRK13805 315 AEMAGFKVPEDTKILI--------AEVK---G-----VGESEPLSH-EKLS-PVLAMYKAKSFEDAVEKAEKLVEFGGLG 376 (862) T ss_pred HHHCCCCCCCCCEEEE--------EECC---C-----CCCCCCCCC-CCCC-CEEEEEEECCHHHHHHHHHHHHHCCCCC T ss_conf 9982998898975999--------9677---7-----788887631-4014-0059989589999999999999717998 Q ss_pred CEEEEECCCCCCCCCCCCHHHHHHHHHHHHC-----CCCCCCEEEEECC-----------------CCCHHHHHHHHH-- Q ss_conf 2799961568656545201689999998533-----2132004555427-----------------542145787886-- Q gi|254780664|r 283 AACVIVKHMNPCGVATADTLVEAYRRALSCD-----PISAFGGIIAFNR-----------------EVDQEVAKEVIK-- 338 (536) Q Consensus 283 Pa~vIvKH~NPCGvA~~~~~~~Ay~~A~~~D-----P~SAFGGIIa~N~-----------------~vd~~~A~~I~~-- 338 (536) .+|+| |++ .+.....|......- .-|+|||+=.++. .-|--+.+.+.+ T Consensus 377 HT~~I--Hs~------d~~~i~~fa~~~p~~RilVN~p~s~g~~G~~~n~l~pslTLGCGs~Ggns~s~Nv~~~hllNik 448 (862) T PRK13805 377 HTAVI--YTN------DDELVKEFGLRMKAARILVNTPSSQGGIGDLYNKLAPSLTLGCGSWGGNSISENVGPKHLLNIK 448 (862) T ss_pred CEEEE--ECC------CHHHHHHHHHHCCEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHH T ss_conf 62677--659------9999999985397538997278667654523358885612315688898676788788876455 Q ss_pred ---------HHHHHHEECCCCHHHHHHHHCCCC-CEEEECCCCCCCCCCCCEECCCCHHHHHCCCCCCCCC--CCCCCCC Q ss_conf ---------543111126879899988512455-3156316766777666402032102231012334531--0001246 Q gi|254780664|r 339 ---------VFTEAIIAPTLSEEAADVLAKKPS-MRFLKTSSLLDFHGEEIVLKTVSGGILVQTRDNVVDN--KELTVVT 406 (536) Q Consensus 339 ---------~F~EvIiAP~f~~eAleiL~~KKn-lRil~~~~~~~~~~~~~~~rsi~GG~LvQ~~D~~~~~--~~~~vVT 406 (536) +.+-|=----|++.++..|++-++ -|.+.+... . -+.-|++-+-.+. ++. -++.+-. T Consensus 449 rva~rr~n~~wfkvP~~iyf~~~~~~~l~~~~~~kralIVTD~-~---------~~~~G~vd~v~~~-L~~~gi~~~vFd 517 (862) T PRK13805 449 TVAKRRENMQWFKLPKKIYFERGSLPYLLDEPGKKRAFIVTDR-G---------MVELGYVDKVTDV-LRANGVETEVFS 517 (862) T ss_pred HHHHHCCCCEEEECCCEEEECCCHHHHHHHCCCCCEEEEECCH-H---------HHHCCCHHHHHHH-HHHCCCEEEEEC T ss_conf 7987634780553697467567829999866899779998785-1---------8767889999999-998698399966 Q ss_pred C--CCCCHHHHHHHHHHHHHHHCCCCCEEEEEECCEEEEECCCCCCHHHHHHHH Q ss_conf 7--778688988899999986005667799996893899729688889999999 Q gi|254780664|r 407 K--RSPTDQELRDMKFAFKVVKHVKSNAVVYAKDGRTVGIGSGQTSRVDSTRFA 458 (536) Q Consensus 407 ~--~~pt~~e~~dL~FA~kv~K~vkSNAIv~ak~~~tiGiGaGQ~sRVda~~iA 458 (536) + -.|+.+....- ....+--+ -...|++|.| |-+|+.|.. T Consensus 518 ~V~pdPt~~~V~~G---~~~~~~~~--------~D~IIalGGG--S~iDaAKai 558 (862) T PRK13805 518 EVEPDPTLSTVRKG---AELMRSFK--------PDTIIALGGG--SPMDAAKIM 558 (862) T ss_pred CCCCCCCHHHHHHH---HHHHHHCC--------CCEEEEECCC--CHHHHHHHH T ss_conf 98999397999999---99998649--------9999994783--488999999 No 89 >cd01968 Nitrogenase_NifE_I Nitrogenase_NifE_I: a subgroup of the NifE subunit of the NifEN complex: NifE forms an alpha2beta2 tetramer with NifN. NifE and NifN are structurally homologous to nitrogenase MoFe protein alpha and beta subunits respectively. NifEN participates in the synthesis of the iron-molybdenum cofactor (FeMoco) of the MoFe protein. NifB-co (an iron and sulfur containing precursor of the FeMoco) from NifB is transferred to the NifEN complex where it is further processed to FeMoco. The NifEN bound precursor of FeMoco has been identified as a molybdenum-free, iron- and sulfur- containing analog of FeMoco. It has been suggested that this NifEN bound precursor also acts as a cofactor precursor in nitrogenase systems which require a cofactor other than FeMoco: i.e. iron-vanadium cofactor (FeVco) or iron only cofactor (FeFeco). Probab=47.92 E-value=14 Score=17.44 Aligned_cols=32 Identities=13% Similarity=0.084 Sum_probs=14.0 Q ss_pred HHHHHHHCCCCEEEECCCCCCHHHHHHHHHHCCCEEEE Q ss_conf 58999971995999389877988999999875977997 Q gi|254780664|r 491 GIVEAIKAGVTAVIQPGGSVRDSEAITVADQHGIAMVF 528 (536) Q Consensus 491 ~ie~aa~~Gi~aIiqPGGSirD~evI~aan~~gi~m~f 528 (536) --+.+.+.++--+| |+| +.. ..|.+.|++++- T Consensus 348 l~~~i~~~~~Dlii--g~s-~~~---~~a~rlGiP~~~ 379 (410) T cd01968 348 LKKLLKEKKADLLV--AGG-KER---YLALKLGIPFCD 379 (410) T ss_pred HHHHHHHCCCCEEE--ECC-CHH---HHHHHHCCCEEE T ss_conf 99998656999999--577-324---889980898898 No 90 >pfam07799 DUF1643 Protein of unknown function (DUF1643). The members of this family are all sequences found within hypothetical proteins expressed by various bacterial species. The region concerned is approximately 150 residues long. Probab=47.71 E-value=11 Score=18.28 Aligned_cols=49 Identities=22% Similarity=0.295 Sum_probs=22.2 Q ss_pred HHHHHCCCCCEEEEEEECCCHHHHHCCCCCHHHHHHCCCCCCHHHHHHHHCCC Q ss_conf 99997799841499994888899731765311232002334346999874015 Q gi|254780664|r 99 KFMQDHELESIDLVVVNLYPFEESFCREDDYYTMVENIDIGGPSMIRAAAKNH 151 (536) Q Consensus 99 ~~l~~~~i~~IDlVvvNLYPF~~~v~~~~~~~~~IEnIDIGGpsmiRAAAKN~ 151 (536) .-.+..|+. .++++|||||-.|- +..+...-.-|..---.-|+.++|.. T Consensus 39 ~fa~~~Gyg--g~~~~NLfa~rst~--P~~L~~~~dpig~~Nd~~i~~~~~~a 87 (135) T pfam07799 39 RFARAWGYG--GLVVVNLFAWRATD--PKDLRRAADPVGPENDAHLLRAAKWA 87 (135) T ss_pred HHHHHCCCC--EEEEEECCCCCCCC--HHHHHHCCCCCCHHHHHHHHHHHHHC T ss_conf 999877998--49998513121699--89997564887876999999998713 No 91 >TIGR02988 YaaA_near_RecF S4 domain protein YaaA; InterPro: IPR014330 This small protein has a single S4 domain (IPR002942 from INTERPRO); it is also found in bacterial S4 ribosomal proteins, some pseudouridine synthases and tyrosyl-tRNA synthetases. The S4 domain may bind RNA. Members of this protein are found almost exclusively in the Firmicutes, and almost invariably just a few nucleotides upstream of the gene for the DNA replication and repair protein RecF. The few members that are not near recF are found instead near dnaA and/or dnaN, the usual neighbours of recF, near the origin of replication. The conserved location suggests a possible role in replication in the Firmicutes lineage.. Probab=47.54 E-value=6.4 Score=19.98 Aligned_cols=35 Identities=29% Similarity=0.456 Sum_probs=24.6 Q ss_pred CCCHHHHHHHHHHCCCEEEEEHHHHH-HHHHCCCCEEE Q ss_conf 56869999999987999999427689-99988995179 Q gi|254780664|r 26 KTGVVEFASRLLSRGIKIISTGGTCQ-LLEEEGIPVTS 62 (536) Q Consensus 26 Ktgl~~la~~L~~~g~~iisTgGTa~-~l~~~gi~v~~ 62 (536) +|-.+.|-++|... .+|+|||.|| ||.++.+-|.. T Consensus 5 ~tEyITLgQlLK~~--~~i~sGG~AK~fL~e~~V~vNG 40 (60) T TIGR02988 5 KTEYITLGQLLKEL--GIIDSGGQAKWFLQENEVLVNG 40 (60) T ss_pred CCCEECHHHHHHHH--HHHCCHHHHHHHHHCCCEEECC T ss_conf 43416276786688--5622825788987069555478 No 92 >pfam00762 Ferrochelatase Ferrochelatase. Probab=46.09 E-value=22 Score=16.05 Aligned_cols=15 Identities=7% Similarity=0.257 Sum_probs=7.3 Q ss_pred HHEECCCCHHHHHHH Q ss_conf 111268798999885 Q gi|254780664|r 343 AIIAPTLSEEAADVL 357 (536) Q Consensus 343 vIiAP~f~~eAleiL 357 (536) +|+.|+|..|-+|.| T Consensus 255 ~v~p~gFv~D~lETl 269 (312) T pfam00762 255 VVVPIGFVSDHLETL 269 (312) T ss_pred EEECCCCCCCCHHHH T ss_conf 998884544246769 No 93 >PRK06180 short chain dehydrogenase; Provisional Probab=45.94 E-value=22 Score=16.03 Aligned_cols=150 Identities=20% Similarity=0.214 Sum_probs=77.6 Q ss_pred CCEEEEEEEEECCCCH-HHHHHHHHHCCCEEEEEHHHHHHHHHCCCCEEEHHHHCCCCHHCCCCCCCCCCHHHHHHHHCC Q ss_conf 6136989987545686-999999998799999942768999988995179465348816519855323715553553308 Q gi|254780664|r 14 IAVKTALISVHNKTGV-VEFASRLLSRGIKIISTGGTCQLLEEEGIPVTSVFDITKFPEIMGGRVKTLHPKIYGGILSIR 92 (536) Q Consensus 14 ~~ikrALiSV~dKtgl-~~la~~L~~~g~~iisTgGTa~~l~~~gi~v~~Vs~~TgfpEil~GRVKTLHP~I~ggIL~~r 92 (536) -.+|.+||.=.- +|| ..+|+.|.+.|+.++.++-....|++. .+..++++..+. .+- T Consensus 2 ~~~KvvlITGas-sGIG~aiA~~l~~~G~~Vi~~~R~~~~l~~l-------------~~~~~~~~~~~~--------~Dv 59 (277) T PRK06180 2 ASMKTWLITGVS-SGFGRALAQAALAAGHRVVGTVRSAAARRDF-------------EALHPGRALARV--------LDV 59 (277) T ss_pred CCCCEEEECCCC-CHHHHHHHHHHHHCCCEEEEEECCHHHHHHH-------------HHHCCCCEEEEE--------EEC T ss_conf 999889991787-3999999999998799999998999999999-------------986799579999--------837 Q ss_pred CCHHHHHHHHH---CCCCCEEEEEEEC-----CCHHHHHCCCCCHHHHHHCCC--C-CCHHHHHHHHC-----CCCCCEE Q ss_conf 89899999997---7998414999948-----888997317653112320023--3-43469998740-----1563135 Q gi|254780664|r 93 DNPAHMKFMQD---HELESIDLVVVNL-----YPFEESFCREDDYYTMVENID--I-GGPSMIRAAAK-----NHDYVTI 156 (536) Q Consensus 93 ~~~~~~~~l~~---~~i~~IDlVvvNL-----YPF~~~v~~~~~~~~~IEnID--I-GGpsmiRAAAK-----N~~~V~V 156 (536) .+++..+.+-+ ....+||++|.|= =||+++ +.++.-+.+| . |---|.|++.. ..-++.. T Consensus 60 td~~~v~~~v~~~~~~~G~iDvLVNNAG~~~~~~~e~~-----~~~~~~~~~~vN~~g~~~~~~a~lp~m~~~~~G~Ivn 134 (277) T PRK06180 60 TDFDAIDGVVADAEATVGPIDVLVNNAGYGHEGAIEES-----PLAEMRRQFEVNVFGAVAMIKAVLPGMRARRRGHIVN 134 (277) T ss_pred CCHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHHHC-----CHHHHHHHHHHHHHHHHHHHHCCCHHHHHCCCCEEEE T ss_conf 99999999999999981998699989977888863339-----9999999998853776544200488889658965775 Q ss_pred EECHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 308999898898753023456667889999999998742289 Q gi|254780664|r 157 LTNPQDYPLFLAEMDVNNGKIPYNFRKKMARQAFSRTASYDT 198 (536) Q Consensus 157 i~dp~dY~~~~~el~~~~g~~~~~~R~~lA~kAF~~ta~YD~ 198 (536) +++..-+.. .-| .+.=.--+.|..+|..+-+.+. T Consensus 135 isS~ag~~~-------~p~-~~~Y~aSK~Al~~lt~sLa~El 168 (277) T PRK06180 135 ITSMGGLIT-------MPG-IAYYCGSKFALEGISEALAKEV 168 (277) T ss_pred ECCHHHCCC-------CCC-CHHHHHHHHHHHHHHHHHHHHH T ss_conf 354665257-------999-8279999999999999999984 No 94 >pfam07905 PucR Purine catabolism regulatory protein-like family. The bacterial proteins found in this family are similar to the purine catabolism regulatory protein expressed by Bacillus subtilis (PucR). PucR is thought to be a transcriptional activator involved in the induction of the purine degradation pathway, and may contain a LysR-like DNA-binding domain. It is similar to LysR-type regulators in that it represses its own expression. The other members of this family are also annotated as being putative regulatory proteins. Probab=44.56 E-value=23 Score=15.89 Aligned_cols=43 Identities=21% Similarity=0.231 Sum_probs=30.3 Q ss_pred CCHHHHHHHCCCCE-EEECCCC--CCHHHHHHHHHHCCCEEEECCC Q ss_conf 55589999719959-9938987--7988999999875977997478 Q gi|254780664|r 489 PDGIVEAIKAGVTA-VIQPGGS--VRDSEAITVADQHGIAMVFTGI 531 (536) Q Consensus 489 ~D~ie~aa~~Gi~a-IiqPGGS--irD~evI~aan~~gi~m~fTg~ 531 (536) .+=|+.+++.|+.+ +|..|-- -=.+++|++||+++++++.... T Consensus 61 ~~~i~~L~~~g~agL~i~~g~~~~~iP~~~i~~a~~~~~Pli~iP~ 106 (122) T pfam07905 61 REFVRELAEAGAAGLGIKTGRYIPEIPEELIAAANRLGLPLIELPR 106 (122) T ss_pred HHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHCCCCEEEECC T ss_conf 9999999978964999942555466999999999974997799459 No 95 >pfam00383 dCMP_cyt_deam_1 Cytidine and deoxycytidylate deaminase zinc-binding region. Probab=44.42 E-value=24 Score=15.87 Aligned_cols=83 Identities=22% Similarity=0.301 Sum_probs=56.1 Q ss_pred HHHHHHHHHCC---CCC----EEEEEECCEEEEECCC-----CCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCC Q ss_conf 99999986005---667----7999968938997296-----88889999999999876420012222348709997366 Q gi|254780664|r 418 MKFAFKVVKHV---KSN----AVVYAKDGRTVGIGSG-----QTSRVDSTRFAAIKAHNISAQADVKSMTNGSVIASEAF 485 (536) Q Consensus 418 L~FA~kv~K~v---kSN----AIv~ak~~~tiGiGaG-----Q~sRVda~~iA~~KA~~~~~~~~~~~~~~g~vlaSDAF 485 (536) |..|....+.. .+| |+++..|+.+|+.|.- +..-+.|-..|+.+|..... ...++|+.+.+- - T Consensus 8 m~~A~~~A~~a~~~~~~~~VGaviv~~~g~ii~~g~N~~~~~~~~~~HAE~~Ai~~a~~~~~----~~~~~~~~lyvT-~ 82 (104) T pfam00383 8 MRLALEAAKRAYTPYSNFPVGAVIVKNDGGIIATGYNGENAGYDPTIHAERNAIRNAGRLGE----GIKLEGATLYVT-L 82 (104) T ss_pred HHHHHHHHHHCCCCCCCCCEEEEEECCCCCEEEEEECEEECCCCCCHHHHHHHHHHHHHHCC----CCCCCCCEEECC-C T ss_conf 99999999866798899978999992799799988570756999630189999999999669----971126323269-8 Q ss_pred CCCCCHHHHHHHCCCCEEEE Q ss_conf 86855589999719959993 Q gi|254780664|r 486 YPFPDGIVEAIKAGVTAVIQ 505 (536) Q Consensus 486 FPF~D~ie~aa~~Gi~aIiq 505 (536) .|=+-+-..+..+||+-|+- T Consensus 83 ePC~mC~~ai~~~gi~~Vvy 102 (104) T pfam00383 83 EPCGMCRQAIIESGIKKVVF 102 (104) T ss_pred CCHHHHHHHHHHHCCCEEEE T ss_conf 97699999999969199998 No 96 >LOAD_Ccd2 consensus Probab=43.38 E-value=24 Score=15.76 Aligned_cols=54 Identities=17% Similarity=0.185 Sum_probs=39.0 Q ss_pred CHHHHHHHHHCCCCCEEEEEEECCCHHHHHCC-CCCHHHHHHCCCCCCHHHHHHHHCCCCCCE Q ss_conf 98999999977998414999948888997317-653112320023343469998740156313 Q gi|254780664|r 94 NPAHMKFMQDHELESIDLVVVNLYPFEESFCR-EDDYYTMVENIDIGGPSMIRAAAKNHDYVT 155 (536) Q Consensus 94 ~~~~~~~l~~~~i~~IDlVvvNLYPF~~~v~~-~~~~~~~IEnIDIGGpsmiRAAAKN~~~V~ 155 (536) ..+.++.+.++|++ +.+|. ..... +-+.++++|.++|+|+.-+...++....|. T Consensus 59 ~~~~~~~~~~~Gv~---i~vC~-----~~~~~rgi~~~dl~~gv~i~g~~~l~~~~~~~d~vi 113 (114) T LOAD_Ccd2 59 LADLLKLAKEYGVK---VYVCG-----MSLDARGLKEDDLIEGVDIAGVAALAELLLEGDAVI 113 (114) T ss_pred HHHHHHHHHHCCCE---EEEEH-----HHHHHCCCCHHHCCCCCEEECHHHHHHHHHHCCEEE T ss_conf 99999999976987---99879-----999884999788647778959999999999779212 No 97 >pfam11152 DUF2930 Protein of unknown function (DUF2930). This family of proteins has no known function. Probab=43.35 E-value=24 Score=15.76 Aligned_cols=92 Identities=12% Similarity=0.135 Sum_probs=54.0 Q ss_pred CCCCHHHHHHHHHHHHHH-HCCCCCEEEEEECCEEEEE-CCCCCC--HHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEC Q ss_conf 778688988899999986-0056677999968938997-296888--899999999998764200122223487099973 Q gi|254780664|r 408 RSPTDQELRDMKFAFKVV-KHVKSNAVVYAKDGRTVGI-GSGQTS--RVDSTRFAAIKAHNISAQADVKSMTNGSVIASE 483 (536) Q Consensus 408 ~~pt~~e~~dL~FA~kv~-K~vkSNAIv~ak~~~tiGi-GaGQ~s--RVda~~iA~~KA~~~~~~~~~~~~~~g~vlaSD 483 (536) ...++...++|..||... ..+-.-++++..+++++=- |.-... -+..-.+.- ++.+ .-+-..|+-- T Consensus 71 ~~l~~~~k~ELAWas~~LLt~T~a~svlV~~~g~~llrRG~~~~~~~~~~pG~i~~-r~~~---------~~~~~~Lvn~ 140 (196) T pfam11152 71 PNLSEALKEELAWASHLLLTATPAASVLVYWDGQVLLRRGILGPPVSKFEPGEICK-RAME---------TQQLVYLVNL 140 (196) T ss_pred CCCCHHHHHHHHHHHHHHHHHCCCCEEEEEECCEEEEEECCCCCCCCCCCHHHHHH-HHHH---------HCCCCCHHHC T ss_conf 87789999999999999986377608999989999996203579877678679999-9986---------0564554443 Q ss_pred CCCCCCCHHHHHHHCCCCEEEECCCC Q ss_conf 66868555899997199599938987 Q gi|254780664|r 484 AFYPFPDGIVEAIKAGVTAVIQPGGS 509 (536) Q Consensus 484 AFFPF~D~ie~aa~~Gi~aIiqPGGS 509 (536) .+||-++..+..-..==.-++||.|. T Consensus 141 ~lyPGr~Ef~~lP~ntqsV~vqPlg~ 166 (196) T pfam11152 141 KLYPGRIEFDYLPPNTQAVIVQPLGN 166 (196) T ss_pred CCCCCHHHHHHCCCCCCEEEEEECCC T ss_conf 63798577762699998699998689 No 98 >TIGR00160 MGSA methylglyoxal synthase; InterPro: IPR004363 Methylglyoxal synthase (MGS) catalyzes the conversion of dihydroxyacetone phosphate to methylglyoxal and phosphate:Glycerone phosphate = methylglyoxal + phosphate It provides bacteria with an alternative to triosephosphate isomerase for metabolizing dihydroxyacetone phosphate. MGS is a small protein of about 13 to 17 kDa. An aspartate residue is involved in the catalytic mechanism. Methylglyoxal synthase contains a domain shared by other enzymes. Other proteins containing this domain include purine biosynthesis protein PurH and carbamoyl phosphate synthetase.; GO: 0008929 methylglyoxal synthase activity, 0019242 methylglyoxal biosynthetic process, 0005737 cytoplasm. Probab=43.20 E-value=18 Score=16.80 Aligned_cols=48 Identities=27% Similarity=0.442 Sum_probs=34.1 Q ss_pred EEEEEEEE-CCCCHHHHHHHHHHC--CCEEEEEHHHHHHHHHC-CCCEEEHH Q ss_conf 69899875-456869999999987--99999942768999988-99517946 Q gi|254780664|r 17 KTALISVH-NKTGVVEFASRLLSR--GIKIISTGGTCQLLEEE-GIPVTSVF 64 (536) Q Consensus 17 krALiSV~-dKtgl~~la~~L~~~--g~~iisTgGTa~~l~~~-gi~v~~Vs 64 (536) |=|||-== -|..|+.|.+.-.+. .+.+|+||-|-..++++ |+.|..+- T Consensus 4 ~IALiAHD~kK~~Lvnfv~~h~~~L~~h~LyATGTTG~~i~~~TGL~i~~~~ 55 (143) T TIGR00160 4 KIALIAHDKKKQDLVNFVQQHKELLKKHDLYATGTTGELIERATGLAIEKLL 55 (143) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEECCCHHHHHHHCCCCEECCC T ss_conf 7422235134688999999989875416455316662776652077011005 No 99 >pfam00625 Guanylate_kin Guanylate kinase. Probab=43.18 E-value=8.4 Score=19.12 Aligned_cols=69 Identities=20% Similarity=0.334 Sum_probs=42.1 Q ss_pred HHHCCCCCCHHHHHHHHCCCCCCEEEECHHHHHHHHHHHHCCCCCCC--HHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 32002334346999874015631353089998988987530234566--678899999999987422899998 Q gi|254780664|r 132 MVENIDIGGPSMIRAAAKNHDYVTILTNPQDYPLFLAEMDVNNGKIP--YNFRKKMARQAFSRTASYDTAICR 202 (536) Q Consensus 132 ~IEnIDIGGpsmiRAAAKN~~~V~Vi~dp~dY~~~~~el~~~~g~~~--~~~R~~lA~kAF~~ta~YD~~Ia~ 202 (536) +|=+||+-|..-+|.. .+.-++|...|.+.+.+...|..-..... .+.|..-|.+-+++...||..|-| T Consensus 95 vvl~id~~g~~~lk~~--~~~~~~IfI~pps~~~L~~RL~~Rg~~~~~~i~~Rl~~a~~e~~~~~~fD~vIvN 165 (182) T pfam00625 95 CILDVDIQGVKQLRKA--ELSPISVFIKPPSLKVLQRRLKGRGTEQEEKINKRMEAAEQEFQHYAEFDYIIVN 165 (182) T ss_pred EEEEECHHHHHHHHHH--CCCCEEEEEECCCHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCEEEEC T ss_conf 9999728999999874--9574899993879999999998148888999999999999997348619999989 No 100 >PRK08251 short chain dehydrogenase; Provisional Probab=43.17 E-value=25 Score=15.74 Aligned_cols=39 Identities=21% Similarity=0.290 Sum_probs=27.3 Q ss_pred EEEEEEEECCCCH-HHHHHHHHHCCCEEEEEHHHHHHHHHC Q ss_conf 6989987545686-999999998799999942768999988 Q gi|254780664|r 17 KTALISVHNKTGV-VEFASRLLSRGIKIISTGGTCQLLEEE 56 (536) Q Consensus 17 krALiSV~dKtgl-~~la~~L~~~g~~iisTgGTa~~l~~~ 56 (536) |++||.=... || .++|+.|.+.|+.++-++-....|++. T Consensus 3 K~vlITGAss-GIG~alA~~la~~G~~v~l~~r~~~~l~~~ 42 (248) T PRK08251 3 QKILITGASS-GLGAGMAREFAAKGRDLALCARRTDRLEEL 42 (248) T ss_pred CEEEEECCCC-HHHHHHHHHHHHCCCEEEEEECCHHHHHHH T ss_conf 9899947863-999999999998799899998988899999 No 101 >TIGR01137 cysta_beta cystathionine beta-synthase; InterPro: IPR005857 This model discriminates cystathionine beta-synthase, a protein found primarily in eukaryotes and carrying a C-terminal CBS domain from cysteine synthases. Cysteine synthase (O-acetylserine (thiol)-lyase) is the enzyme responsible for the formation of cysteine from O-acetyl-serine and hydrogen sulphide with the concomitant release of acetic acid - the function of many of these enzymes is unproven.; GO: 0004122 cystathionine beta-synthase activity, 0019343 cysteine biosynthetic process via cystathionine, 0005737 cytoplasm. Probab=43.04 E-value=14 Score=17.60 Aligned_cols=15 Identities=27% Similarity=0.313 Sum_probs=7.6 Q ss_pred HHHHCCCCCCEEEEC Q ss_conf 987401563135308 Q gi|254780664|r 145 RAAAKNHDYVTILTN 159 (536) Q Consensus 145 RAAAKN~~~V~Vi~d 159 (536) -||-|-|+-+.|+.| T Consensus 83 ~Aa~kGYk~Iiv~Pe 97 (527) T TIGR01137 83 VAAIKGYKCIIVLPE 97 (527) T ss_pred HHHHCCCEEEEECCC T ss_conf 995268639998687 No 102 >pfam02602 HEM4 Uroporphyrinogen-III synthase HemD. This family consists of uroporphyrinogen-III synthase HemD EC:4.2.1.75 also known as Hydroxymethylbilane hydrolyase (cyclizing) from eukaryotes, bacteria and archaea. This enzyme catalyses the reaction: Hydroxymethylbilane <= uroporphyrinogen-III + H(2)O. Some members of this family are multi-functional proteins possessing other enzyme activities related to porphyrin biosynthesis, with pfam00590, however the aligned region corresponds with the uroporphyrinogen-III synthase EC:4.2.1.75 activity only. Uroporphyrinogen-III synthase is the fourth enzyme in the heme pathway. Mutant forms of the Uroporphyrinogen-III synthase gene cause congenital erythropoietic porphyria in humans a recessive inborn error of metabolism also known as Gunther disease. Probab=42.65 E-value=25 Score=15.69 Aligned_cols=41 Identities=17% Similarity=0.104 Sum_probs=17.5 Q ss_pred CCHHHHHHHHHHHHHHHEE--CCCCHHHH-HHHHC-CCCCEEEEC Q ss_conf 4214578788654311112--68798999-88512-455315631 Q gi|254780664|r 328 VDQEVAKEVIKVFTEAIIA--PTLSEEAA-DVLAK-KPSMRFLKT 368 (536) Q Consensus 328 vd~~~A~~I~~~F~EvIiA--P~f~~eAl-eiL~~-KKnlRil~~ 368 (536) |-..||+.+.+..+.+..- .+|+.+.| +.+.+ -++.|++.. T Consensus 76 vG~~Ta~~l~~~G~~~~~~~~~~~~~~~L~~~i~~~~~~~~il~~ 120 (229) T pfam02602 76 VGPKTARALREAGLTPDFVPSAEGTAEGLAEELAELLAGKRVLLL 120 (229) T ss_pred ECHHHHHHHHHCCCCCEEECCCCCCHHHHHHHHHHCCCCCEEEEE T ss_conf 378999999984999608798888999999977642689869997 No 103 >PRK06128 oxidoreductase; Provisional Probab=41.55 E-value=26 Score=15.57 Aligned_cols=38 Identities=18% Similarity=0.129 Sum_probs=16.9 Q ss_pred CCCCCCCCEEEEEEEEECCCCHHHHHHHHHHCCCEEEE Q ss_conf 78867761369899875456869999999987999999 Q gi|254780664|r 8 DGDHGEIAVKTALISVHNKTGVVEFASRLLSRGIKIIS 45 (536) Q Consensus 8 ~~~~~~~~ikrALiSV~dKtgl~~la~~L~~~g~~iis 45 (536) -..+||++=|+|||.=...-==...|+.|.+.|..++- T Consensus 47 y~g~grL~GKvAlVTGgssGIG~AiA~~lA~eGA~Vvi 84 (300) T PRK06128 47 YKGFGRLQGRKALITGADSGIGRATAIAFAREGADIVL 84 (300) T ss_pred CCCCCCCCCCEEEEECCCCHHHHHHHHHHHHCCCEEEE T ss_conf 98888789995899173669999999999986999999 No 104 >TIGR02518 EutH_ACDH acetaldehyde dehydrogenase (acetylating); InterPro: IPR013357 Aldehyde dehydrogenases (1.2.1.3 from EC and 1.2.1.5 from EC) are enzymes which oxidize a wide variety of aliphatic and aromatic aldehydes using NAD(P) as a cofactor. In eukaryotes, at least twenty distinct aldehyde dehydrogenase families have been classified . Many aldehyde dehydrogenases have also been found in prokaryotic species. A glutamic acid and a cysteine residue have been implicated in the catalytic activity of mammalian aldehyde dehydrogenase, though these residues are not necessarily conserved in all the enzymes of this family. The acetaldehyde dehydrogenase family (1.2.1.10 from EC) of bacterial enzymes catalyse the formation of acetyl-CoA from acetaldehyde in the degradation of phenols, cresols and catechol . Proteins in this entry occur in bacterial species and are annotated as probable acetaldehyde dehydrogenases, though this function has not been experimentally verified.. Probab=41.38 E-value=8.6 Score=19.04 Aligned_cols=38 Identities=21% Similarity=0.381 Sum_probs=25.1 Q ss_pred CCCCCHHHHHHHH----------------------------------CCCCCC-------EEEECHHHHHHHHHHHHCC Q ss_conf 2334346999874----------------------------------015631-------3530899989889875302 Q gi|254780664|r 136 IDIGGPSMIRAAA----------------------------------KNHDYV-------TILTNPQDYPLFLAEMDVN 173 (536) Q Consensus 136 IDIGGpsmiRAAA----------------------------------KN~~~V-------~Vi~dp~dY~~~~~el~~~ 173 (536) |=-||++|+|||= |-|.+= .|||.+..-+.|++||+.. T Consensus 192 lATGG~aMVkAAYSSG~PAIGVGpGNgPAyIe~~Anv~~Av~~Il~SKtFDnGtICASEQSvivE~~n~~aVv~Elk~Q 270 (528) T TIGR02518 192 LATGGEAMVKAAYSSGTPAIGVGPGNGPAYIERTANVKKAVRKILDSKTFDNGTICASEQSVIVEKCNKDAVVEELKKQ 270 (528) T ss_pred EECCCHHHHHHECCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHCCCCCCCCEEEEECCCEEEECCCHHHHHHHHHHC T ss_conf 9607702533000377862121777860101156787899988740256677616531350686157726899999852 No 105 >PRK13111 trpA tryptophan synthase subunit alpha; Provisional Probab=41.35 E-value=26 Score=15.55 Aligned_cols=36 Identities=25% Similarity=0.373 Sum_probs=17.7 Q ss_pred HHHHHHHHHHHHEECCCCHHHHHHH-H--CCCCCEEEEC Q ss_conf 7878865431111268798999885-1--2455315631 Q gi|254780664|r 333 AKEVIKVFTEAIIAPTLSEEAADVL-A--KKPSMRFLKT 368 (536) Q Consensus 333 A~~I~~~F~EvIiAP~f~~eAleiL-~--~KKnlRil~~ 368 (536) .+...+.=++.+|-||.-.|--+-+ . ++.++.++.+ T Consensus 106 ~~~~~~~GvdGvIipDLP~eE~~~~~~~~~~~gi~~I~l 144 (256) T PRK13111 106 AADAAEAGVDGLIIPDLPPEEAEEFRAAAKKHGIDLIFL 144 (256) T ss_pred HHHHHHCCCCEEEECCCCHHHHHHHHHHHHHCCCEEEEE T ss_conf 999997599779816999788899999999759808999 No 106 >TIGR01158 SUI1_rel translation initation factor SUI1, putative; InterPro: IPR005872 This family of archaeal and bacterial proteins is homologous to the eukaryotic translation intiation factor SUI1 involved in directing the ribosome to the proper start site of translation by functioning in concert with eIF-2 and the initiator tRNA-Met.; GO: 0003743 translation initiation factor activity, 0006412 translation. Probab=41.25 E-value=7.1 Score=19.68 Aligned_cols=29 Identities=28% Similarity=0.362 Sum_probs=22.7 Q ss_pred EEEEEECCC--CHHHHHHHHHHCCCEEEEEHHHHH Q ss_conf 899875456--869999999987999999427689 Q gi|254780664|r 19 ALISVHNKT--GVVEFASRLLSRGIKIISTGGTCQ 51 (536) Q Consensus 19 ALiSV~dKt--gl~~la~~L~~~g~~iisTgGTa~ 51 (536) -+|+=-|.+ +|.+||+.|.+ ..+||||+| T Consensus 51 TiI~Gld~~~~~L~~LAk~LK~----~cacGGTvK 81 (111) T TIGR01158 51 TIIEGLDLSDIDLKELAKELKS----KCACGGTVK 81 (111) T ss_pred EEEECCCCCHHHHHHHHHHHHH----HHCCCCCEE T ss_conf 8861578336789999998778----846897403 No 107 >TIGR02873 spore_ylxY probable sporulation protein, polysaccharide deacetylase family; InterPro: IPR014228 Members of this entry include YlxY, and are related to IPR014132 from INTERPRO, which represents a subset of the polysaccharide deacetylase family found in a species if, and only if, the species forms endospores e.g. Bacillus subtilis or Clostridium tetani. Proteins in this entry are likewise restricted to spore-formers, but are not universally found among endospore-forming species.. Probab=41.24 E-value=26 Score=15.54 Aligned_cols=36 Identities=25% Similarity=0.388 Sum_probs=27.1 Q ss_pred HHHHHHCCCCEEEECCCCCCHHHHHHHHHHCCCEEEE Q ss_conf 8999971995999389877988999999875977997 Q gi|254780664|r 492 IVEAIKAGVTAVIQPGGSVRDSEAITVADQHGIAMVF 528 (536) Q Consensus 492 ie~aa~~Gi~aIiqPGGSirD~evI~aan~~gi~m~f 528 (536) |+.+....++=..=|.||=|| +||+.|.+++|-=|+ T Consensus 170 i~a~i~~~pkWFAPPSGSF~d-~Vv~~Aad~~M~Tim 205 (269) T TIGR02873 170 IEATIGVKPKWFAPPSGSFRD-EVVQIAADLQMGTIM 205 (269) T ss_pred HHHHHCCCCCEECCCCCCHHH-HHHHHHHHCCCCEEE T ss_conf 999857987040288876415-778778746896388 No 108 >PRK06953 short chain dehydrogenase; Provisional Probab=40.75 E-value=27 Score=15.49 Aligned_cols=40 Identities=25% Similarity=0.293 Sum_probs=28.7 Q ss_pred EEEEEEEEECCCCHHHHHHHHHHCCCEEEEEHHHHHHHHH Q ss_conf 3698998754568699999999879999994276899998 Q gi|254780664|r 16 VKTALISVHNKTGVVEFASRLLSRGIKIISTGGTCQLLEE 55 (536) Q Consensus 16 ikrALiSV~dKtgl~~la~~L~~~g~~iisTgGTa~~l~~ 55 (536) ||++||.=..+-==..+|+.|.+.|+.++.++-....+.+ T Consensus 1 MK~~LVTGas~GIG~a~a~~la~~G~~V~~~~R~~~~l~~ 40 (222) T PRK06953 1 MKTVLIVGASRGIGLEFVRQYRADGWRVIATARDAAGLAA 40 (222) T ss_pred CCEEEECCCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHH T ss_conf 9999994757299999999999888999999688888999 No 109 >COG1880 CdhB CO dehydrogenase/acetyl-CoA synthase epsilon subunit [Energy production and conversion] Probab=40.62 E-value=26 Score=15.53 Aligned_cols=46 Identities=24% Similarity=0.314 Sum_probs=34.2 Q ss_pred CEEEEEEEEECC---CCHHHHHHHHHHCC-CEEEEEHHHHHHHHHCCCCE Q ss_conf 136989987545---68699999999879-99999427689999889951 Q gi|254780664|r 15 AVKTALISVHNK---TGVVEFASRLLSRG-IKIISTGGTCQLLEEEGIPV 60 (536) Q Consensus 15 ~ikrALiSV~dK---tgl~~la~~L~~~g-~~iisTgGTa~~l~~~gi~v 60 (536) +-||-|+=|--. ..+.|++..+.+.. +.+++|+++..-+.+.|+.+ T Consensus 34 kAkrPLlivGp~~~dee~~E~~vKi~ekfnipivaTa~~~~~~~~~~i~~ 83 (170) T COG1880 34 KAKRPLLIVGPLALDEELLELAVKIIEKFNIPIVATASSMGNLIGRGIGS 83 (170) T ss_pred HCCCCEEEECCCCCCHHHHHHHHHHHHHCCCCEEECCHHHCCHHHCCCCC T ss_conf 62795498655325889999999999864874475321102023246563 No 110 >TIGR00506 ribB 3,4-dihydroxy-2-butanone 4-phosphate synthase; InterPro: IPR000422 3,4-Dihydroxy-2-butanone 4-phosphate is biosynthesized from ribulose 5-phosphate and serves as the biosynthetic precursor for the xylene ring of riboflavin . It is sometimes found as a bifunctional enzyme with GTP cyclohydrolase II that catalyses the first committed step in the biosynthesis of riboflavin (IPR000926 from INTERPRO). No sequences with significant homology to DHBP synthase are found in the metazoa.; GO: 0008686 34-dihydroxy-2-butanone-4-phosphate synthase activity, 0009231 riboflavin biosynthetic process. Probab=39.96 E-value=27 Score=15.46 Aligned_cols=91 Identities=22% Similarity=0.286 Sum_probs=59.3 Q ss_pred CCCEEEEEEC-CEEEEECCCCCCHHHHHHHHHHH-HHH--HHHCCCCCCC------------CCCEEEEECCCCCCCC-- Q ss_conf 6677999968-93899729688889999999999-876--4200122223------------4870999736686855-- Q gi|254780664|r 429 KSNAVVYAKD-GRTVGIGSGQTSRVDSTRFAAIK-AHN--ISAQADVKSM------------TNGSVIASEAFYPFPD-- 490 (536) Q Consensus 429 kSNAIv~ak~-~~tiGiGaGQ~sRVda~~iA~~K-A~~--~~~~~~~~~~------------~~g~vlaSDAFFPF~D-- 490 (536) -|=+|-+++. +.+.||.+ ..|.--+++++.- +.. .+...++.+. ..|-|+ =|- T Consensus 100 ~~fti~~~~r~et~TGIsa--~DR~~Ti~~~l~~~~~~vmmgr~~dFG~df~rPGHvfpLRAa~GgVL-------~R~GH 170 (230) T TIGR00506 100 STFTIDVAHRKETTTGISA--NDRALTIRAALADVVKPVMMGRKEDFGSDFRRPGHVFPLRAADGGVL-------KRAGH 170 (230) T ss_pred EEEEEEEECCCCCEECCCH--HHHHHHHHHHHCCCCCEEECCCCCCCHHHCCCCCCCCHHHHCCCCEE-------CCCCC T ss_conf 3789997477885005546--47889998874676642222762455114368886541543278751-------25773 Q ss_pred ---HHHHHHHCCCCEEE-------E--CCCCCCHHH--HHHHHHHCCCEEEE Q ss_conf ---58999971995999-------3--898779889--99999875977997 Q gi|254780664|r 491 ---GIVEAIKAGVTAVI-------Q--PGGSVRDSE--AITVADQHGIAMVF 528 (536) Q Consensus 491 ---~ie~aa~~Gi~aIi-------q--PGGSirD~e--vI~aan~~gi~m~f 528 (536) ||+++.=+|.+-+. . -|-+.|-+| .+++|+||||.|+- T Consensus 171 TEasVdL~~lAGl~pa~VICE~~nd~edG~mar~~ekf~~~yA~Kh~l~~i~ 222 (230) T TIGR00506 171 TEASVDLAELAGLKPAGVICEIMNDEEDGTMARKPEKFLVEYAKKHGLKLIS 222 (230) T ss_pred CHHHHHHHHHCCCCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHCCCCEE T ss_conf 3488899997089972899862377203310366648899999881985011 No 111 >PRK12360 4-hydroxy-3-methylbut-2-enyl diphosphate reductase; Provisional Probab=39.88 E-value=21 Score=16.20 Aligned_cols=30 Identities=20% Similarity=0.276 Sum_probs=25.4 Q ss_pred CCCCCCCCCHHHHHHHHCCCCHHHHHHHHHCCCCCEEE Q ss_conf 98553237155535533088989999999779984149 Q gi|254780664|r 74 GGRVKTLHPKIYGGILSIRDNPAHMKFMQDHELESIDL 111 (536) Q Consensus 74 ~GRVKTLHP~I~ggIL~~r~~~~~~~~l~~~~i~~IDl 111 (536) +++|-||||-|| |+.-.+.|++.|+..|+. T Consensus 31 ~~~vy~lG~iVH--------N~~vv~~L~~~Gv~~v~~ 60 (281) T PRK12360 31 KKKIYTLGPLIH--------NNQVVSDLEEKGVKVIEE 60 (281) T ss_pred CCCEEEECCCCC--------CHHHHHHHHHCCCEEEEC T ss_conf 996898377767--------989999999779979813 No 112 >TIGR00337 PyrG CTP synthase; InterPro: IPR004468 CTP synthase is involved in pyrimidine ribonucleotide/ribonucleoside metabolism. The enzyme catalyzes the reaction L-glutamine + H2O + UTP + ATP = CTP + phosphate + ADP + L-glutamate. The enzyme exists as a dimer of identical chains that aggregates as a tetramer. This gene has been found circa 500 bp 5 upstream of enolase in both beta (Nitrosomonas europaea) and gamma (Escherichia coli) subdivisions of Proteobacterium .; GO: 0003883 CTP synthase activity, 0006221 pyrimidine nucleotide biosynthetic process. Probab=39.60 E-value=25 Score=15.69 Aligned_cols=35 Identities=20% Similarity=0.349 Sum_probs=17.0 Q ss_pred CCCCHHHHH-HHHCCCCHHHHHHHHHCCCCCEEEEEE Q ss_conf 237155535-533088989999999779984149999 Q gi|254780664|r 79 TLHPKIYGG-ILSIRDNPAHMKFMQDHELESIDLVVV 114 (536) Q Consensus 79 TLHP~I~gg-IL~~r~~~~~~~~l~~~~i~~IDlVvv 114 (536) ||=|.|+|+ =|=-.-.+.=.++|.++||.| ||.|| T Consensus 184 TLVP~i~aagE~KTKPTQhSVKeLRs~Gi~P-D~i~c 219 (571) T TIGR00337 184 TLVPYIAAAGELKTKPTQHSVKELRSLGIQP-DIIIC 219 (571) T ss_pred EECCCCCCCCCCCCCCCHHHHHHHHHCCCCC-CEEEE T ss_conf 0026314487478775127899998609888-68998 No 113 >pfam02091 tRNA-synt_2e Glycyl-tRNA synthetase alpha subunit. Probab=39.44 E-value=10 Score=18.45 Aligned_cols=20 Identities=35% Similarity=0.683 Sum_probs=13.6 Q ss_pred CCCCCCCCCCCCCCCCEEEE Q ss_conf 23333436788100650576 Q gi|254780664|r 221 VKKQEMRYGENPHQKAALYS 240 (536) Q Consensus 221 ~~~~~LRYGENPHQ~Aa~Y~ 240 (536) .+-.+=||||||.----+|+ T Consensus 55 rRP~DgRYGenPnRlq~y~Q 74 (284) T pfam02091 55 RRPTDGRYGENPNRLQHYYQ 74 (284) T ss_pred CCCCCCCCCCCCCHHHHHEE T ss_conf 79999877789204200223 No 114 >TIGR01830 3oxo_ACP_reduc 3-oxoacyl-(acyl-carrier-protein) reductase; InterPro: IPR011284 This entry represents 3-oxoacyl-[ACP] reductase, also called 3-ketoacyl-acyl carrier protein reductase, an enzyme of fatty acid biosynthesis found in many plant and bacterial species. This enzyme is involved in type II fatty acid biosynthesis, where the individual metabolic transformations are carried out by different enzymes rather than by a single enzyme as occurs in type I fatty acid biosynthesis . Structural studies show that the enzyme is a tetramer which forms a typical Rossman fold , . Unlike other members of the short-chain dehydrogenase/reductase superfamily, the enzyme undergoes a marked conformational change upon binding of the NADP(H)cofactor. This conformational change aligns the side chains of the catalytic triad at the active site in an active conformation and increases the affinity of the enzyme for its substrate.; GO: 0004316 3-oxoacyl-[acyl-carrier-protein] reductase activity, 0051287 NAD binding, 0006633 fatty acid biosynthetic process. Probab=39.38 E-value=28 Score=15.34 Aligned_cols=44 Identities=30% Similarity=0.466 Sum_probs=18.9 Q ss_pred HHHHHHHHHCCCCCEEEEEECCEEEEEC--------CCCCCHHHHHHHHHHHHHHHHHC Q ss_conf 9999998600566779999689389972--------96888899999999998764200 Q gi|254780664|r 418 MKFAFKVVKHVKSNAVVYAKDGRTVGIG--------SGQTSRVDSTRFAAIKAHNISAQ 468 (536) Q Consensus 418 L~FA~kv~K~vkSNAIv~ak~~~tiGiG--------aGQ~sRVda~~iA~~KA~~~~~~ 468 (536) |.=.|+++|.+ +-...=.|.|..|-|+ +||+| =|..||+--++. T Consensus 107 L~g~F~~t~~v-~~~M~K~R~GrIINisSVVG~~GN~GQaN------YaASKAG~IGft 158 (238) T TIGR01830 107 LKGVFNLTQAV-LRPMIKQRSGRIINISSVVGLMGNAGQAN------YAASKAGVIGFT 158 (238) T ss_pred HHHHHHHHHHH-HHHHHHHCCCCEEEEEEHHHHHCCCCHHH------HHHHHHHHHHHH T ss_conf 26687888998-89887506743486100200006874267------888875589999 No 115 >TIGR00070 hisG ATP phosphoribosyltransferase; InterPro: IPR001348 ATP phosphoribosyltransferase (2.4.2.17 from EC) is the enzyme that catalyzes the first step in the biosynthesis of histidine in bacteria, fungi and plants as shown below. It is a member of the larger phosphoribosyltransferase superfamily of enzymes which catalyse the condensation of 5-phospho-alpha-D-ribose 1-diphosphate with nitrogenous bases in the presence of divalent metal ions .ATP + 5-phospho-alpha-D-ribose 1-diphosphate = 1-(5-phospho-D-ribosyl)-ATP + diphosphate Histidine biosynthesis is an energetically expensive process and ATP phosphoribosyltransferase activity is subject to control at several levels. Transcriptional regulation is based primarily on nutrient conditions and determines the amount of enzyme present in the cell, while feedback inihibition rapidly modulates activity in response to cellular conditions. The enzyme has been shown to be inhibited by 1-(5-phospho-D-ribosyl)-ATP, histidine, ppGpp (a signal associated with adverse environmental conditions) and ADP and AMP (which reflect the overall energy status of the cell). As this pathway of histidine biosynthesis is present only in prokayrotes, plants and fungi, this enzyme is a promising target for the development of novel antimicrobial compounds and herbicides. This entry represents the two forms of ATP phosphoribosyltransferase: a long form containing two catalytic domains and a C-terminal regulatory domain, and a short form in which the regulatory domain is missing. The long form is catalytically competent, but in organisms with the short form, a histidyl-tRNA synthetase paralogue, HisZ, is required for enzyme activity . The structures of the long form enzymes from Escherichia coli (P60757 from SWISSPROT) and Mycobacterium tuberculosis (P60759 from SWISSPROT) have been determined , . The enzyme itself exists in equilibrium between an active dimeric form, an inactive hexameric form and higher aggregates. Interconversion between the various forms is largely reversible and is influenced by the binding of the natural substrates and inhibitors of the enzyme. The two catalytic domains are linked by a two-stranded beta-sheet and togther form a "periplamsic binding protein fold". A crevice between these domains contains the active site. The C-terminal domain is not directly involved in catalysis but appears to be involved the formation of hexamers, induced by the binding of inhibitors such as histidine to the enzyme, thus regulating activity.; GO: 0003879 ATP phosphoribosyltransferase activity, 0000105 histidine biosynthetic process. Probab=39.03 E-value=19 Score=16.63 Aligned_cols=15 Identities=20% Similarity=0.169 Sum_probs=5.8 Q ss_pred HHEECCCCHHHHHHH Q ss_conf 111268798999885 Q gi|254780664|r 343 AIIAPTLSEEAADVL 357 (536) Q Consensus 343 vIiAP~f~~eAleiL 357 (536) |.||-+|=.=|.+-| T Consensus 131 v~IAT~FP~~a~~Yf 145 (317) T TIGR00070 131 VRIATEFPHLAKKYF 145 (317) T ss_pred EEEEECCHHHHHHHH T ss_conf 899826757999999 No 116 >PRK02491 putative deoxyribonucleotide triphosphate pyrophosphatase/unknown domain fusion protein; Reviewed Probab=38.61 E-value=22 Score=16.12 Aligned_cols=42 Identities=21% Similarity=0.115 Sum_probs=18.4 Q ss_pred HHHHHHHHHHHCCCCCEEEEEECCEEEEECCCCCCHHHHHHHHH Q ss_conf 88999999860056677999968938997296888899999999 Q gi|254780664|r 416 RDMKFAFKVVKHVKSNAVVYAKDGRTVGIGSGQTSRVDSTRFAA 459 (536) Q Consensus 416 ~dL~FA~kv~K~vkSNAIv~ak~~~tiGiGaGQ~sRVda~~iA~ 459 (536) ..++=|..+.+. ++-.|||.|--+.=-.-|..--|.|.|-|. T Consensus 176 NA~lKA~~~ak~--tG~pvLADDSGL~VDAL~G~PGIySARyAG 217 (328) T PRK02491 176 NARLKAETISRL--TGKMVLADDSGLKVDALGGLPGVWSARFSG 217 (328) T ss_pred HHHHHHHHHHHH--HCCCEEECCCCEEEEECCCCCCCCEEEECC T ss_conf 999999999998--799689627761881048998722334158 No 117 >COG2428 Uncharacterized conserved protein [Function unknown] Probab=38.44 E-value=25 Score=15.65 Aligned_cols=52 Identities=23% Similarity=0.283 Sum_probs=32.2 Q ss_pred HHHHHHHHHHCCCEEEEEHHHH---HHHHHCCCCEEEHHHHCCCCHHCCCCCCCCCC Q ss_conf 6999999998799999942768---99998899517946534881651985532371 Q gi|254780664|r 29 VVEFASRLLSRGIKIISTGGTC---QLLEEEGIPVTSVFDITKFPEIMGGRVKTLHP 82 (536) Q Consensus 29 l~~la~~L~~~g~~iisTgGTa---~~l~~~gi~v~~Vs~~TgfpEil~GRVKTLHP 82 (536) |.++...+.+.|++.+-|+-+- ..|-+.|.++..+| +|..| .=++||=-|.| T Consensus 17 llEy~e~a~~~G~~~~vtna~pe~p~vlak~g~~~i~e~-~~~~~-l~r~rvivLDl 71 (196) T COG2428 17 LLEYKEVARWWGDEFIVTNAKPEEPEVLAKIGLSGIPES-ITRLP-LDRSRVIVLDL 71 (196) T ss_pred HHHHHHHHHHHCHHEEEECCCCCHHHHHHHHCCCCCCHH-HHHCC-CCCCCEEEECC T ss_conf 999999999834430660578621579998265668636-76263-67897799668 No 118 >PRK02304 adenine phosphoribosyltransferase; Provisional Probab=38.12 E-value=11 Score=18.33 Aligned_cols=23 Identities=13% Similarity=0.112 Sum_probs=13.6 Q ss_pred CCCCCEEEE-CHHHHHHHHHHHHC Q ss_conf 156313530-89998988987530 Q gi|254780664|r 150 NHDYVTILT-NPQDYPLFLAEMDV 172 (536) Q Consensus 150 N~~~V~Vi~-dp~dY~~~~~el~~ 172 (536) +|++++.+. ||+-+..++++|.. T Consensus 21 ~F~Ditpll~dp~~~~~~~~~l~~ 44 (174) T PRK02304 21 LFRDITPLLADPEALREVIDALVE 44 (174) T ss_pred EEEECHHHHCCHHHHHHHHHHHHH T ss_conf 688582476599999999999999 No 119 >COG0108 RibB 3,4-dihydroxy-2-butanone 4-phosphate synthase [Coenzyme metabolism] Probab=38.00 E-value=29 Score=15.19 Aligned_cols=38 Identities=24% Similarity=0.358 Sum_probs=28.0 Q ss_pred HHHHHHHCCCC-----E-EEE-CCCCCCHHHHHHHHHHCCCEEEE Q ss_conf 58999971995-----9-993-89877988999999875977997 Q gi|254780664|r 491 GIVEAIKAGVT-----A-VIQ-PGGSVRDSEAITVADQHGIAMVF 528 (536) Q Consensus 491 ~ie~aa~~Gi~-----a-Iiq-PGGSirD~evI~aan~~gi~m~f 528 (536) +|+++.-+|.+ | |+- .|++.|-+++++.|.+|||.|+- T Consensus 146 sVdLarlAGl~Pa~VicEi~~~dG~mar~~~~~~fa~~h~l~~it 190 (203) T COG0108 146 AVDLARLAGLKPAGVICEIMNDDGTMARLPELEEFAKEHGLPVIT 190 (203) T ss_pred HHHHHHHCCCCCCEEEEEEECCCCCCCCHHHHHHHHHHCCCCEEE T ss_conf 999999839997379999867986615738999999983996888 No 120 >PRK13113 consensus Probab=37.26 E-value=30 Score=15.11 Aligned_cols=34 Identities=15% Similarity=0.279 Sum_probs=17.5 Q ss_pred HHHHHHHHHHEECCCCHHHHH-HHH--CCCCCEEEEC Q ss_conf 788654311112687989998-851--2455315631 Q gi|254780664|r 335 EVIKVFTEAIIAPTLSEEAAD-VLA--KKPSMRFLKT 368 (536) Q Consensus 335 ~I~~~F~EvIiAP~f~~eAle-iL~--~KKnlRil~~ 368 (536) ...+.=++.+|-||.-.|.-+ +.+ ++-++.++.+ T Consensus 117 ~~~~~GvdGvIipDLP~eE~~~~~~~~~~~~l~~I~l 153 (263) T PRK13113 117 EAKEAGIDGLIVVDLPPEEDSELCLPAQAAGLNFIRL 153 (263) T ss_pred HHHHCCCCEEEECCCCHHHHHHHHHHHHHCCCCEEEE T ss_conf 9877794369717999788899999999779867999 No 121 >COG3185 4-hydroxyphenylpyruvate dioxygenase and related hemolysins [Amino acid transport and metabolism / General function prediction only] Probab=37.18 E-value=30 Score=15.11 Aligned_cols=66 Identities=21% Similarity=0.326 Sum_probs=28.5 Q ss_pred CCCCCCC--------CCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHH----------HHHHHHHEEC-CCC Q ss_conf 5686565--------45201689999998533213200455542754214578788----------6543111126-879 Q gi|254780664|r 290 HMNPCGV--------ATADTLVEAYRRALSCDPISAFGGIIAFNREVDQEVAKEVI----------KVFTEAIIAP-TLS 350 (536) Q Consensus 290 H~NPCGv--------A~~~~~~~Ay~~A~~~DP~SAFGGIIa~N~~vd~~~A~~I~----------~~F~EvIiAP-~f~ 350 (536) -++|||. +...++.+-|.+.+.+--+ -=|||...==.+|+..+- +.++|-+=+- .++ T Consensus 213 m~Sp~G~vrlplN~s~~~~sqi~efl~~y~G~GI----QHIA~~T~dI~~tv~~lr~rG~~fl~ip~tYYd~~~~R~~~~ 288 (363) T COG3185 213 MVSPCGKVRLPLNESADDKSQIGEFLREYRGEGI----QHIAFGTDDIYATVAALRERGVKFLPIPETYYDDLDARFPLH 288 (363) T ss_pred EECCCCCEEEECCCCCCCHHHHHHHHHHHCCCCC----EEEEECCCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHCCCCC T ss_conf 7568885786245687762589999997378760----378843627999999999759866778642788876217897 Q ss_pred HHHHHHHHC Q ss_conf 899988512 Q gi|254780664|r 351 EEAADVLAK 359 (536) Q Consensus 351 ~eAleiL~~ 359 (536) +|-++-|++ T Consensus 289 ~e~ld~Lr~ 297 (363) T COG3185 289 GEFLDALRE 297 (363) T ss_pred HHHHHHHHH T ss_conf 677999885 No 122 >TIGR02364 dha_pts dihydroxyacetone kinase, phosphotransfer subunit; InterPro: IPR012844 In Escherichia coli and many other bacteria, unlike the yeasts and a few bacteria such as Citrobacter freundii, the dihydroxyacetone kinase (also called glycerone kinase) transfers a phosphate from a phosphoprotein rather than from ATP and contains multiple subunits.. Probab=36.86 E-value=23 Score=15.93 Aligned_cols=31 Identities=42% Similarity=0.659 Sum_probs=24.7 Q ss_pred EEEEEECC--CCHHHHHHHHHH-CCCEEEEEHHH Q ss_conf 89987545--686999999998-79999994276 Q gi|254780664|r 19 ALISVHNK--TGVVEFASRLLS-RGIKIISTGGT 49 (536) Q Consensus 19 ALiSV~dK--tgl~~la~~L~~-~g~~iisTgGT 49 (536) -|+|=|-| .|+.+|++.... ..|+|++.||| T Consensus 4 VlVSHS~~iaeG~~~L~~qM~~S~dV~I~~aGGt 37 (128) T TIGR02364 4 VLVSHSKKIAEGVKELVKQMAGSDDVKIISAGGT 37 (128) T ss_pred EEEECCHHHHHHHHHHHHHHHCCCCEEEEEECCC T ss_conf 8987646688899999998536795468871064 No 123 >PRK00164 moaA molybdenum cofactor biosynthesis protein A; Reviewed Probab=36.81 E-value=30 Score=15.07 Aligned_cols=75 Identities=20% Similarity=0.310 Sum_probs=34.5 Q ss_pred CCHHHHHHHHHHC-CCE--EEEEHHH-----HHHHHHCCCCEEEHHHHCCCCHHCCCCCCCCCCHHHHHHHHCCCCH--- Q ss_conf 6869999999987-999--9994276-----8999988995179465348816519855323715553553308898--- Q gi|254780664|r 27 TGVVEFASRLLSR-GIK--IISTGGT-----CQLLEEEGIPVTSVFDITKFPEIMGGRVKTLHPKIYGGILSIRDNP--- 95 (536) Q Consensus 27 tgl~~la~~L~~~-g~~--iisTgGT-----a~~l~~~gi~v~~Vs~~TgfpEil~GRVKTLHP~I~ggIL~~r~~~--- 95 (536) .+|++|.+.|.+. |++ -+.|-|+ ++.|+++|+.-..||= -||.|..|.-|-.+.+-+ T Consensus 82 ~di~~li~~l~~~~gi~~v~lTTNG~lL~~~a~~Lk~aGL~riNISL------------DsLd~~~f~~IT~~~~l~~Vl 149 (334) T PRK00164 82 KDLEDIIARLAALPGIRDLALTTNGYLLARRAAALKDAGLTRVNVSL------------DSLDPERFKAITGRDRLDQVL 149 (334) T ss_pred CCHHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHHHHCCCCEEEEEE------------EECCHHHHHHHHCCCCHHHHH T ss_conf 57899999986327975178844488999999999985998699711------------318999999984899759999 Q ss_pred HHHHHHHHCCCCCEEEEE Q ss_conf 999999977998414999 Q gi|254780664|r 96 AHMKFMQDHELESIDLVV 113 (536) Q Consensus 96 ~~~~~l~~~~i~~IDlVv 113 (536) .-+..+.+.|+.++-+-+ T Consensus 150 ~gI~~A~~~G~~~vKiN~ 167 (334) T PRK00164 150 AGIDAALAAGLEPVKVNA 167 (334) T ss_pred HHHHHHHHCCCCCEEEEE T ss_conf 999999958987616899 No 124 >KOG2811 consensus Probab=36.51 E-value=22 Score=16.14 Aligned_cols=49 Identities=27% Similarity=0.377 Sum_probs=30.8 Q ss_pred HHHHHHCCCCCCCEEEEECCCCCCCCCCCCHHH-------------HHHHHHHHHCCCCCCCEE Q ss_conf 167764245567279996156865654520168-------------999999853321320045 Q gi|254780664|r 271 FELVSEFRSQDCAACVIVKHMNPCGVATADTLV-------------EAYRRALSCDPISAFGGI 321 (536) Q Consensus 271 ~~lv~ef~~~~~Pa~vIvKH~NPCGvA~~~~~~-------------~Ay~~A~~~DP~SAFGGI 321 (536) ++-+-||...+.|.++|=|| -||+|+.-++. .+..-|+-|--.|-|+.- T Consensus 248 L~~i~el~~~~~p~vaisKH--LCG~ATDLtLRCl~~s~~~~~p~l~~i~IAlCCHH~c~wr~Y 309 (420) T KOG2811 248 LNAIPELKNSGKPYVAISKH--LCGAATDLTLRCLLSSGDASSPVLAGILIALCCHHRCRWRSY 309 (420) T ss_pred CCCCCCCCCCCCCEEEEECC--CCCCHHHHHHHHHCCCCCCCCCCHHHHHHHHHHCCCCCHHHH T ss_conf 55320004569837998422--346212367888606854335303457887743010334443 No 125 >pfam08149 BING4CT BING4CT (NUC141) domain. This C terminal domain is found in the BING4 family of nucleolar WD40 repeat proteins. Probab=36.25 E-value=8.9 Score=18.94 Aligned_cols=34 Identities=35% Similarity=0.572 Sum_probs=29.8 Q ss_pred CCCEEEEECCCCCCCCHHHHHHHCCCCEEEECCC Q ss_conf 4870999736686855589999719959993898 Q gi|254780664|r 475 TNGSVIASEAFYPFPDGIVEAIKAGVTAVIQPGG 508 (536) Q Consensus 475 ~~g~vlaSDAFFPF~D~ie~aa~~Gi~aIiqPGG 508 (536) ..|....|=.|-||-|-+-.....|++.||-||- T Consensus 7 ~~~~~i~~~~F~PfEDvLgvGh~~G~ssiiVPGs 40 (80) T pfam08149 7 KPGSKISSVRFCPFEDVLGIGHSGGFSSIIVPGS 40 (80) T ss_pred CCCCCEEEEEEECHHHEEEECCCCCEEEEEECCC T ss_conf 7998141457744578357303576256771699 No 126 >PRK02492 deoxyhypusine synthase-like protein; Provisional Probab=36.23 E-value=31 Score=15.00 Aligned_cols=22 Identities=23% Similarity=0.480 Sum_probs=11.0 Q ss_pred CCHHHHHHHHHHCCCE--EEEEHH Q ss_conf 6869999999987999--999427 Q gi|254780664|r 27 TGVVEFASRLLSRGIK--IISTGG 48 (536) Q Consensus 27 tgl~~la~~L~~~g~~--iisTgG 48 (536) +|+-+..+.|.++|+= |++||+ T Consensus 72 aGlr~~i~~lIr~~~Vd~IvtTgA 95 (344) T PRK02492 72 AGCMQVYIDLVENNMVDAIVATGA 95 (344) T ss_pred HHHHHHHHHHHHCCCEEEEECCCC T ss_conf 668999999997698569972788 No 127 >cd04724 Tryptophan_synthase_alpha Ttryptophan synthase (TRPS) alpha subunit (TSA). TPRS is a bifunctional tetrameric enzyme (2 alpha and 2 beta subunits) that catalyzes the last two steps of L-tryptophan biosynthesis. Alpha and beta subunit catalyze two distinct reactions which are both strongly stimulated by the formation of the complex. The alpha subunit catalyzes the cleavage of indole 3-glycerol phosphate (IGP) to indole and d-glyceraldehyde 3-phosphate (G3P). Indole is then channeled to the active site of the beta subunit, a PLP-dependent enzyme that catalyzes a replacement reaction to convert L-serine into L-tryptophan. Probab=35.98 E-value=31 Score=14.98 Aligned_cols=32 Identities=19% Similarity=0.286 Sum_probs=13.2 Q ss_pred HHHHHHHHEECCCCHHHHHHHH---CCCCCEEEEC Q ss_conf 8654311112687989998851---2455315631 Q gi|254780664|r 337 IKVFTEAIIAPTLSEEAADVLA---KKPSMRFLKT 368 (536) Q Consensus 337 ~~~F~EvIiAP~f~~eAleiL~---~KKnlRil~~ 368 (536) .+.=+..+|.||.-.|..+.+. ++-++..+.+ T Consensus 101 ~~~Gv~GviipDLP~ee~~~~~~~~~~~~i~~I~l 135 (242) T cd04724 101 KEAGVDGLIIPDLPPEEAEEFREAAKEYGLDLIFL 135 (242) T ss_pred HHCCCCEEEECCCCHHHHHHHHHHHHHCCCEEEEE T ss_conf 97599758706999578468999998659838899 No 128 >PRK13127 consensus Probab=35.70 E-value=32 Score=14.95 Aligned_cols=32 Identities=22% Similarity=0.252 Sum_probs=14.2 Q ss_pred HHHHHHHHEECCCCHHHHHHHHC---CCCCEEEEC Q ss_conf 86543111126879899988512---455315631 Q gi|254780664|r 337 IKVFTEAIIAPTLSEEAADVLAK---KPSMRFLKT 368 (536) Q Consensus 337 ~~~F~EvIiAP~f~~eAleiL~~---KKnlRil~~ 368 (536) .+.=++.+|-||.-.|.-+.+.+ +.++.++.+ T Consensus 112 ~~~GvdGlIipDLP~eE~~~~~~~~~~~gi~~I~l 146 (262) T PRK13127 112 AEAGVSGLIIPDLPVEEATDLREACKKHGLDLVFL 146 (262) T ss_pred HHCCCCEEEECCCCHHHHHHHHHHHHHCCCEEEEE T ss_conf 87599769966999789999999998558327998 No 129 >PRK01185 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=35.48 E-value=32 Score=14.92 Aligned_cols=44 Identities=18% Similarity=0.320 Sum_probs=33.6 Q ss_pred EEEEEEEEECCCCHHHHHHHHHH---CCCEEEEEHHHHHHHHHCCCC Q ss_conf 36989987545686999999998---799999942768999988995 Q gi|254780664|r 16 VKTALISVHNKTGVVEFASRLLS---RGIKIISTGGTCQLLEEEGIP 59 (536) Q Consensus 16 ikrALiSV~dKtgl~~la~~L~~---~g~~iisTgGTa~~l~~~gi~ 59 (536) ||-++++-.|+....++|+.|.+ .+++++-..-||+.|...|++ T Consensus 1 MKi~iv~~~~~~~a~~~a~~l~~~L~~~~~v~~d~~~a~~l~~~~~~ 47 (272) T PRK01185 1 MKVAFVIRKDCKRCANIARSIISLLPSDWEIIYDKEAAKFLKKPGLD 47 (272) T ss_pred CEEEEEEECCCHHHHHHHHHHHHHHHCCCEEEEEHHHHHHCCCCCCC T ss_conf 98999997899899999999999996699199856888870999998 No 130 >pfam04210 MtrG Tetrahydromethanopterin S-methyltransferase, subunit G. The N5-methyltetrahydromethanopterin: coenzyme M (EC:2.1.1.86) of Methanosarcina mazei Go1 is a membrane-associated, corrinoid-containing protein that uses a transmethylation reaction to drive an energy-conserving sodium ion pump. Probab=35.20 E-value=30 Score=15.07 Aligned_cols=20 Identities=15% Similarity=0.315 Sum_probs=16.6 Q ss_pred CEEEECHHHHHHHHHHHHCC Q ss_conf 13530899989889875302 Q gi|254780664|r 154 VTILTNPQDYPLFLAEMDVN 173 (536) Q Consensus 154 V~Vi~dp~dY~~~~~el~~~ 173 (536) -+++++|+||.++.+.|+.- T Consensus 5 P~~iv~p~df~ei~~rLd~i 24 (70) T pfam04210 5 PKVVVPPDDFNEIHKRLDEI 24 (70) T ss_pred CEEECCHHHHHHHHHHHHHH T ss_conf 86651889999999999999 No 131 >smart00854 PGA_cap Bacterial capsule synthesis protein PGA_cap. This protein is a putative poly-gamma-glutamate capsule biosynthesis protein found in bacteria. Poly-gamma-glutamate is a natural polymer that may be involved in virulence and may help bacteria survive in high salt concentrations. It is a surface-associated protein. Probab=35.08 E-value=32 Score=14.88 Aligned_cols=11 Identities=45% Similarity=0.640 Sum_probs=8.7 Q ss_pred CCCEEEEEEEC Q ss_conf 98414999948 Q gi|254780664|r 106 LESIDLVVVNL 116 (536) Q Consensus 106 i~~IDlVvvNL 116 (536) +..-|+++||| T Consensus 30 ~~~aD~~~~Nl 40 (239) T smart00854 30 LRAADLAIGNL 40 (239) T ss_pred HHHCCEEEEEC T ss_conf 97499999964 No 132 >PRK13137 consensus Probab=34.74 E-value=33 Score=14.84 Aligned_cols=34 Identities=12% Similarity=0.116 Sum_probs=16.5 Q ss_pred HHHHHHHHHHEECCCCHHHHHHHH---CCCCCEEEEC Q ss_conf 788654311112687989998851---2455315631 Q gi|254780664|r 335 EVIKVFTEAIIAPTLSEEAADVLA---KKPSMRFLKT 368 (536) Q Consensus 335 ~I~~~F~EvIiAP~f~~eAleiL~---~KKnlRil~~ 368 (536) ...+.=+..+|-||.-.|--+-|. ++.++-++.+ T Consensus 123 ~a~~aGvdGlIipDLP~eE~~~~~~~~~~~gi~~I~l 159 (266) T PRK13137 123 LFQEAGVDGLILPDLPPDQDPEIADLAAEIGLAVTFL 159 (266) T ss_pred HHHHCCCCEEEECCCCHHHHHHHHHHHHHCCCCEEEE T ss_conf 9997696099947999788899999998759978999 No 133 >PRK13131 consensus Probab=34.72 E-value=33 Score=14.84 Aligned_cols=10 Identities=40% Similarity=0.388 Sum_probs=3.4 Q ss_pred HHHHHHHHHH Q ss_conf 1689999998 Q gi|254780664|r 301 TLVEAYRRAL 310 (536) Q Consensus 301 ~~~~Ay~~A~ 310 (536) .+-+|..+|+ T Consensus 57 vIQ~a~~rAL 66 (257) T PRK13131 57 TIQASHLRAL 66 (257) T ss_pred HHHHHHHHHH T ss_conf 9999999999 No 134 >cd01391 Periplasmic_Binding_Protein_Type_1 Type 1 periplasmic binding fold superfamily. Type 1 periplasmic binding fold superfamily. This model and hierarchy represent the ligand binding domains of the LacI family of transcriptional regulators, periplasmic binding proteins of the ABC-type transport systems, the family C G-protein couples receptors (GPCRs), membrane bound guanylyl cyclases including the family of natriuretic peptide receptors (NPRs), and the N-terminal leucine/isoleucine/valine- binding protein (LIVBP)-like domains of the ionotropic glutamate receptors (iGluRs). In LacI-like transcriptional regulator and the bacterial periplasmic binding proteins the ligands are monosaccharides including lactose, ribose, fructose, xylose, arabinose, galactose/glucose, and other sugars, with a few exceptions. Periplasmic sugar binding proteins are one of the components of ABC transporters and are involved in the active transport of water-soluble ligands. The LacI family of proteins con Probab=34.62 E-value=33 Score=14.83 Aligned_cols=87 Identities=17% Similarity=0.114 Sum_probs=37.3 Q ss_pred HHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCC--CHHHHHHHHHH--HHHHH Q ss_conf 011677642455672799961568656545201689999998533213200455542754--21457878865--43111 Q gi|254780664|r 269 AAFELVSEFRSQDCAACVIVKHMNPCGVATADTLVEAYRRALSCDPISAFGGIIAFNREV--DQEVAKEVIKV--FTEAI 344 (536) Q Consensus 269 aA~~lv~ef~~~~~Pa~vIvKH~NPCGvA~~~~~~~Ay~~A~~~DP~SAFGGIIa~N~~v--d~~~A~~I~~~--F~EvI 344 (536) .+..++..+.+...-..+++-..+. ..+....+.|+.++..-+...- .+.-.+... +...+..+.+. =.++| T Consensus 111 ~~~~~~~~l~~~g~~~i~~i~~~~~---~~~~~r~~g~~~~~~~~g~~~~-~~~~~~~~~~~~~~~~~~~l~~~~~~~ai 186 (269) T cd01391 111 AGEAAAEYLAEKGWKRVALIYGDDG---AYGRERLEGFKAALKKAGIEVV-AIEYGDLDTEKGFQALLQLLKAAPKPDAI 186 (269) T ss_pred HHHHHHHHHHHHCCCEEEEECCCCC---HHHHHHHHHHHHHHHHCCCCEE-EEEEECCCCCHHHHHHHHHHHHCCCCEEE T ss_conf 9999986898718961699737985---7799999999999998699867-99971343214899999998618997399 Q ss_pred EECCCCHHHHHHHHCC Q ss_conf 1268798999885124 Q gi|254780664|r 345 IAPTLSEEAADVLAKK 360 (536) Q Consensus 345 iAP~f~~eAleiL~~K 360 (536) ++-+. ..|+.+++.= T Consensus 187 ~~~~d-~~a~g~~~a~ 201 (269) T cd01391 187 FACND-EMAAGALKAA 201 (269) T ss_pred EECCH-HHHHHHHHHH T ss_conf 98888-9999999999 No 135 >cd01981 Pchlide_reductase_B Pchlide_reductase_B: B protein of the NB protein complex of Protochlorophyllide (Pchlide)_reductase. Pchlide reductase catalyzes the reductive formation of chlorophyllide (chlide) from protochlorophyllide (pchlide) during biosynthesis of chlorophylls and bacteriochlorophylls. This group contains both the light-independent Pchlide reductase (DPOR) and light-dependent Pchlide reductase (LPOR). Angiosperms contain only LPOR, cyanobacteria, algae and gymnosperms contain both DPOR and LPOR, primitive anoxygenic photosynthetic bacteria contain only DPOR. NB is structurally similar to the FeMo protein of nitrogenase, forming an N2B2 heterotetramer. N and B are homologous to the FeMo alpha and beta subunits respectively. Also in common with nitrogenase in vitro DPOR activity requires ATP hydrolysis and dithoionite or ferredoxin as electron donor. The NB protein complex may serve as a catalytic site for Pchlide reduction similar to MoFe for nitrogen reduction. Probab=34.54 E-value=33 Score=14.82 Aligned_cols=70 Identities=20% Similarity=0.201 Sum_probs=35.0 Q ss_pred HCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCHHH Q ss_conf 10589888312221200116776424556727999615686565452016899999985332132004555427542145 Q gi|254780664|r 253 LVQGKPLSYNNINDLDAAFELVSEFRSQDCAACVIVKHMNPCGVATADTLVEAYRRALSCDPISAFGGIIAFNREVDQEV 332 (536) Q Consensus 253 ql~GKeLSYNNllD~daA~~lv~ef~~~~~Pa~vIvKH~NPCGvA~~~~~~~Ay~~A~~~DP~SAFGGIIa~N~~vd~~~ 332 (536) .+.-.-+.|+|.-|+..--+++.++.- ..-+++ |+|.. .+.++++-.++=. |++.+.....+ T Consensus 167 iig~~~~g~~~~gD~~eikrll~~~Gi---~v~~~~----p~~~t-----~~ei~~~~~A~ln------vv~~~~~~~~~ 228 (430) T cd01981 167 LIGPSSLGFHNRHDCRELKRLLHTLGI---EVNVVI----PEGAS-----VDDLNELPKAWFN------IVPYREYGLSA 228 (430) T ss_pred EEECCCCCCCCHHHHHHHHHHHHHCCC---EEEEEE----CCCCC-----HHHHHHHHHCCEE------EEEHHHHHHHH T ss_conf 982476777775689999999998399---189981----89999-----9999864126677------86049899999 Q ss_pred HHHHHHHH Q ss_conf 78788654 Q gi|254780664|r 333 AKEVIKVF 340 (536) Q Consensus 333 A~~I~~~F 340 (536) |+.+.+.| T Consensus 229 a~~Le~~~ 236 (430) T cd01981 229 ALYLEEEF 236 (430) T ss_pred HHHHHHHH T ss_conf 99999985 No 136 >cd06319 PBP1_ABC_sugar_binding_like_10 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=34.51 E-value=33 Score=14.82 Aligned_cols=54 Identities=13% Similarity=0.076 Sum_probs=32.1 Q ss_pred HHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHHH--H-HHEECCCCHHHHHHHHCC Q ss_conf 9999985332132004555427542145787886543--1-111268798999885124 Q gi|254780664|r 305 AYRRALSCDPISAFGGIIAFNREVDQEVAKEVIKVFT--E-AIIAPTLSEEAADVLAKK 360 (536) Q Consensus 305 Ay~~A~~~DP~SAFGGIIa~N~~vd~~~A~~I~~~F~--E-vIiAP~f~~eAleiL~~K 360 (536) +.+..+...|- +-+|++.|...-....+.+.+.=. + .|+.=|.++++++.++.- T Consensus 175 ~~~~~L~~~p~--i~ai~~~~d~~a~Ga~~a~~~~G~~~~i~vvg~D~~~~~~~~i~~G 231 (277) T cd06319 175 YTNDLLTANPD--IRAIWLQGSDRYQGALDAIATAGKTGKVLLICFDAEPEFIELLKSG 231 (277) T ss_pred HHHHHHHHCCC--CCEEEECCCHHHHHHHHHHHHCCCCCCEEEEECCCCHHHHHHHHCC T ss_conf 99999973899--7689966988999999999975999985899637979999998759 No 137 >KOG0774 consensus Probab=34.47 E-value=33 Score=14.81 Aligned_cols=67 Identities=19% Similarity=0.193 Sum_probs=53.6 Q ss_pred CCCCHHHHHHHHHHHHHHHEECCCCHHHHHHHHCCCCCEEEECCCCCCCCCCCCEECCCCHHHHHCCCCCC Q ss_conf 75421457878865431111268798999885124553156316766777666402032102231012334 Q gi|254780664|r 326 REVDQEVAKEVIKVFTEAIIAPTLSEEAADVLAKKPSMRFLKTSSLLDFHGEEIVLKTVSGGILVQTRDNV 396 (536) Q Consensus 326 ~~vd~~~A~~I~~~F~EvIiAP~f~~eAleiL~~KKnlRil~~~~~~~~~~~~~~~rsi~GG~LvQ~~D~~ 396 (536) |..++.+-+.+.+.|+.-+--|-=++||.+-|.++=|+.|-+...+.. . ..+|..-.+...|+.+++ T Consensus 193 RNFsK~aTeiLneyF~~h~~nPYPSee~K~eLAkqCnItvsQvsnwfg--n--krIrykK~~~k~~ee~~l 259 (334) T KOG0774 193 RNFSKQATEILNEYFYSHLSNPYPSEEAKEELAKQCNITVSQVSNWFG--N--KRIRYKKNMGKNQEEANL 259 (334) T ss_pred CCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHCCCEEHHHCCCCC--C--CEEEHHHHHHHHHHHHHH T ss_conf 164421799999999973379998378999999872934056323224--5--403215310023655546 No 138 >TIGR03627 arch_S9P archaeal ribosomal protein S9P. This model describes exclusively the archaeal ribosomal protein S9P. Homologous eukaryotic and bacterial ribosomal proteins are excluded from this model. Probab=34.17 E-value=28 Score=15.35 Aligned_cols=22 Identities=23% Similarity=0.265 Sum_probs=19.4 Q ss_pred ECCCCCCHHHHHHHHHHHHHHH Q ss_conf 7296888899999999998764 Q gi|254780664|r 444 IGSGQTSRVDSTRFAAIKAHNI 465 (536) Q Consensus 444 iGaGQ~sRVda~~iA~~KA~~~ 465 (536) -|.|.++..+|+++|+.||-.. T Consensus 64 ~GGG~sgQa~AirlaIaraL~~ 85 (130) T TIGR03627 64 SGGGIMGQADAARTAIARGLVE 85 (130) T ss_pred EECCEEHHHHHHHHHHHHHHHH T ss_conf 8288306887999999999998 No 139 >PTZ00094 serine hydroxymethyltransferase; Provisional Probab=34.06 E-value=27 Score=15.42 Aligned_cols=72 Identities=18% Similarity=0.271 Sum_probs=40.1 Q ss_pred CCEEEEECCCCCHHHHHHHHHHHHHHH-------------------EECCCCHHHHHHHHC---------CCCCEEEECC Q ss_conf 004555427542145787886543111-------------------126879899988512---------4553156316 Q gi|254780664|r 318 FGGIIAFNREVDQEVAKEVIKVFTEAI-------------------IAPTLSEEAADVLAK---------KPSMRFLKTS 369 (536) Q Consensus 318 FGGIIa~N~~vd~~~A~~I~~~F~EvI-------------------iAP~f~~eAleiL~~---------KKnlRil~~~ 369 (536) =||+|-+|+..+.+.++.|.+-.+-.+ .-|+|.+=|..+++. +...+++. T Consensus 248 rGGiIl~~k~~~~e~~~kId~aVFPg~qggph~h~IAa~Ava~~Ea~~~efk~Ya~qVv~NAkaLa~~L~~~G~~vvs-- 325 (450) T PTZ00094 248 RSGMIFFRKKIRPDIESKINQAVFPGLQGGPHFHQIAAIAVQLKEVCSPEWKEYAKQVVDNAKALAAELEKRGFDLVT-- 325 (450) T ss_pred CCEEEEECCCCHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHCCCEEEE-- T ss_conf 842898446640899998545128854467778799999999999757679999999999999999999978996950-- Q ss_pred CCCCCCCCCCEECCCC-HHHHHC Q ss_conf 7667776664020321-022310 Q gi|254780664|r 370 SLLDFHGEEIVLKTVS-GGILVQ 391 (536) Q Consensus 370 ~~~~~~~~~~~~rsi~-GG~LvQ 391 (536) ...+...--+++|+.. +|.-++ T Consensus 326 ggTdnHlvlvDl~~~g~~G~~ae 348 (450) T PTZ00094 326 GGTDNHLILWNLRPFGITGSKVE 348 (450) T ss_pred CCCCCEEEEEECCCCCCCHHHHH T ss_conf 89976279997777899999999 No 140 >TIGR00338 serB phosphoserine phosphatase SerB; InterPro: IPR004469 Phosphoserine phosphatase (SerB), (3.1.3.3 from EC), also known as O-phosphoserine phosphohydrolase, is involved in both serine and glycine biosynthesis. It catalyzes the reaction 3-phospho-serine + H2O = L-serine + phosphate, which is the last step in the biosynthesis of serine from carbohydrates. The reaction proceeds via the formation of a phosphoryl-enzyme intermediate. It acts as a homodimer, and requires magnesium as a cofactor.; GO: 0004647 phosphoserine phosphatase activity, 0006564 L-serine biosynthetic process. Probab=33.97 E-value=34 Score=14.75 Aligned_cols=84 Identities=21% Similarity=0.380 Sum_probs=47.8 Q ss_pred CCCHHHHHHHHHHCCCEEEEEHHHHHHH-----HHCCCCEEEHHHHCCCCHHC-CCCCCCCCCHHHHHHHHCC-CCHHHH Q ss_conf 5686999999998799999942768999-----98899517946534881651-9855323715553553308-898999 Q gi|254780664|r 26 KTGVVEFASRLLSRGIKIISTGGTCQLL-----EEEGIPVTSVFDITKFPEIM-GGRVKTLHPKIYGGILSIR-DNPAHM 98 (536) Q Consensus 26 Ktgl~~la~~L~~~g~~iisTgGTa~~l-----~~~gi~v~~Vs~~TgfpEil-~GRVKTLHP~I~ggIL~~r-~~~~~~ 98 (536) -.|+.|+++.|++.||.+.=-+|-..+. .+.||+.|.+-- +-=++- ||+ |-=.+-|-|+... .-+-=. T Consensus 88 ~~G~~Elv~~L~~~gykVav~SGGF~~~a~~~k~~L~LD~tn~fa--N~L~~~~dG~---LTG~v~G~~v~~~~K~~~l~ 162 (223) T TIGR00338 88 TEGAEELVKTLKEKGYKVAVISGGFDLFAEHLKDKLGLDYTNAFA--NRLEVEFDGK---LTGLVEGDIVDASYKGETLL 162 (223) T ss_pred CCCHHHHHHHHHHCCEEEEEEECCHHHHHHHHHHHHCCCHHHHHH--CEEEEEECCC---CCCCCCCCCCCCCCCHHHHH T ss_conf 978789999998869079998577688898889985788647210--2004622773---14842365000003508999 Q ss_pred HHHHHCCCCCEEEEEE Q ss_conf 9999779984149999 Q gi|254780664|r 99 KFMQDHELESIDLVVV 114 (536) Q Consensus 99 ~~l~~~~i~~IDlVvv 114 (536) .-++++||++=+.|.| T Consensus 163 ~l~~~e~i~~~~~vav 178 (223) T TIGR00338 163 KLLEKEGISLENTVAV 178 (223) T ss_pred HHHHHHCCCHHHEEEE T ss_conf 9999738882218776 No 141 >TIGR02346 chap_CCT_theta T-complex protein 1, theta subunit; InterPro: IPR012721 Members of this eukaryotic family are part of the group II chaperonin complex called CCT (chaperonin containing TCP-1) or TRiC. The archaeal equivalent group II chaperonin is often called the thermosome. Both are somewhat related to the group I chaperonin of bacterial, GroEL/GroES. This family consists exclusively of the CCT theta chain (part of a paralogous family) from animals, plants, fungi, and other eukaryotes.; GO: 0005524 ATP binding, 0051082 unfolded protein binding, 0006457 protein folding. Probab=33.92 E-value=34 Score=14.75 Aligned_cols=43 Identities=33% Similarity=0.424 Sum_probs=21.2 Q ss_pred CCCCCHHHHHHHHHHH----HHHHCCCCCEEEEEECC--EEEEECCCCCCHHHHHHH Q ss_conf 7778688988899999----98600566779999689--389972968888999999 Q gi|254780664|r 407 KRSPTDQELRDMKFAF----KVVKHVKSNAVVYAKDG--RTVGIGSGQTSRVDSTRF 457 (536) Q Consensus 407 ~~~pt~~e~~dL~FA~----kv~K~vkSNAIv~ak~~--~tiGiGaGQ~sRVda~~i 457 (536) -|-.|..-++|++=|. +++| +++||+ ..+ -|+|=|-=.=|-|| T Consensus 380 LRGsT~N~LDDiERAiDDGVN~~K-------~L~KD~GGrl~-pGaGAtEieL~~~L 428 (554) T TIGR02346 380 LRGSTKNLLDDIERAIDDGVNVIK-------ALVKDNGGRLL-PGAGATEIELAKRL 428 (554) T ss_pred ECCCCCCHHHHHHHHHHCCHHHHH-------EEECCCCCCEE-CCCCCHHHHHHHHH T ss_conf 523000013444444304312101-------00058986150-57772489999999 No 142 >PRK13978 ribose-5-phosphate isomerase A; Provisional Probab=33.73 E-value=34 Score=14.73 Aligned_cols=34 Identities=12% Similarity=0.184 Sum_probs=22.7 Q ss_pred HHHCCCCCCHHHHHH--HHCCCCCCEEEECHHHHHH Q ss_conf 320023343469998--7401563135308999898 Q gi|254780664|r 132 MVENIDIGGPSMIRA--AAKNHDYVTILTNPQDYPL 165 (536) Q Consensus 132 ~IEnIDIGGpsmiRA--AAKN~~~V~Vi~dp~dY~~ 165 (536) -.-.|-=||.+|+|- -|.+-+...+|+|..-+-. T Consensus 93 ~lnlIKGgGgal~rEKivA~~ak~~IiI~D~sK~v~ 128 (228) T PRK13978 93 SLNIIKGGGGALFREKVIDEMASRFVVVVDETKIVQ 128 (228) T ss_pred CCCEEEECCHHHHHHHHHHHCCCCEEEEEECCHHHH T ss_conf 878898784999999999970384899983654345 No 143 >KOG4127 consensus Probab=33.61 E-value=34 Score=14.72 Aligned_cols=52 Identities=23% Similarity=0.277 Sum_probs=28.9 Q ss_pred CHHHHHHHHCCCCCCCEEEEECCCCCCCCCC-CCHHHHHHHHHHHHCCCCCCCEEEEECCCC Q ss_conf 0011677642455672799961568656545-201689999998533213200455542754 Q gi|254780664|r 268 DAAFELVSEFRSQDCAACVIVKHMNPCGVAT-ADTLVEAYRRALSCDPISAFGGIIAFNREV 328 (536) Q Consensus 268 daA~~lv~ef~~~~~Pa~vIvKH~NPCGvA~-~~~~~~Ay~~A~~~DP~SAFGGIIa~N~~v 328 (536) ++.++-+-+..+ |-||+-|.++-++-- ..|+-+-..+.++-- ||+|-+|.-- T Consensus 255 ~atm~~aL~vS~----APVIFSHSsA~~vcns~rNVPDdVL~llk~N-----gGvVMVnfy~ 307 (419) T KOG4127 255 DATMRDALEVSR----APVIFSHSSAYSVCNSSRNVPDDVLQLLKEN-----GGVVMVNFYP 307 (419) T ss_pred HHHHHHHHHHHC----CCEEEECCCHHHHHCCCCCCCHHHHHHHHHC-----CCEEEEEEEC T ss_conf 899999997523----8657633557877537567958999987534-----9789998420 No 144 >PRK04046 translation initiation factor IF-6; Provisional Probab=33.58 E-value=34 Score=14.71 Aligned_cols=64 Identities=23% Similarity=0.415 Sum_probs=41.5 Q ss_pred CCCCCCCCCCCHHHH----HHHHHHH------HCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHEECCCCHHHHHHHHC Q ss_conf 568656545201689----9999985------332132004555427542145787886543111126879899988512 Q gi|254780664|r 290 HMNPCGVATADTLVE----AYRRALS------CDPISAFGGIIAFNREVDQEVAKEVIKVFTEAIIAPTLSEEAADVLAK 359 (536) Q Consensus 290 H~NPCGvA~~~~~~~----Ay~~A~~------~DP~SAFGGIIa~N~~vd~~~A~~I~~~F~EvIiAP~f~~eAleiL~~ 359 (536) -.|-=|..+.....+ ..++... -+..+|.|-+|++|... .++-|++++++.++++. T Consensus 62 ~gNsnGllvp~~~~d~El~~l~~~~~v~V~~l~~~~nAlGN~Il~ND~~--------------Alv~p~l~~e~~e~I~d 127 (221) T PRK04046 62 VGNSNGILVPRIVLDEELELLKNALDVNVEVLPSKLTALGNLILANDKG--------------ALVSPELSDEARKVIED 127 (221) T ss_pred EECCCEEEECCCCCHHHHHHHHHCCCCEEEEECCCCCCCCCEEEECCCC--------------EEECCCCCHHHHHHHHH T ss_conf 7058759937847899999887547917999788666754589975864--------------69768799899999988 Q ss_pred CCCCEEEE Q ss_conf 45531563 Q gi|254780664|r 360 KPSMRFLK 367 (536) Q Consensus 360 KKnlRil~ 367 (536) -=+..+.+ T Consensus 128 ~L~Vev~~ 135 (221) T PRK04046 128 VLGVEVIR 135 (221) T ss_pred HHCCEEEE T ss_conf 63956999 No 145 >PRK13122 consensus Probab=33.56 E-value=34 Score=14.71 Aligned_cols=12 Identities=33% Similarity=0.465 Sum_probs=4.6 Q ss_pred HHHHEECCCCHH Q ss_conf 311112687989 Q gi|254780664|r 341 TEAIIAPTLSEE 352 (536) Q Consensus 341 ~EvIiAP~f~~e 352 (536) ++.+|-||.-.| T Consensus 104 vdGvIipDLP~e 115 (242) T PRK13122 104 VYGLIIPDLPYE 115 (242) T ss_pred CCEEECCCCCHH T ss_conf 986777899878 No 146 >PRK09522 bifunctional anthranilate synthase/anthranilate phosphoribosyltransferase; Provisional Probab=33.51 E-value=29 Score=15.23 Aligned_cols=68 Identities=18% Similarity=0.300 Sum_probs=37.6 Q ss_pred EEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHH-HHHHHHHHHHEECCCCHHHHHHHHCCCC Q ss_conf 799961568656545201689999998533213200455542754214578-7886543111126879899988512455 Q gi|254780664|r 284 ACVIVKHMNPCGVATADTLVEAYRRALSCDPISAFGGIIAFNREVDQEVAK-EVIKVFTEAIIAPTLSEEAADVLAKKPS 362 (536) Q Consensus 284 a~vIvKH~NPCGvA~~~~~~~Ay~~A~~~DP~SAFGGIIa~N~~vd~~~A~-~I~~~F~EvIiAP~f~~eAleiL~~KKn 362 (536) .+-|.||.|.. +++..- .+|=+.++| +|-..+.+.++ .|.+.=+=-+.||.|.+.-..+-.-+|. T Consensus 301 Gv~VaKHGNrs-~sSksG---------SADvLealG----v~i~~~~~~~~~~l~~~gi~FlfAp~~Hpamk~v~pvRk~ 366 (531) T PRK09522 301 GLKVAKHGNRS-VSSKSG---------SSDLLAAFG----INLDMNADKSRQALDELGVCFLFAPKYHTGFRHAMPVRQQ 366 (531) T ss_pred CCEEEEECCCC-CCCCCC---------HHHHHHHCC----CCCCCCHHHHHHHHHHCCEEEEECHHHCHHHHHHHHHHHH T ss_conf 98676058987-798862---------999999709----9988999999999997491586234336669999999997 Q ss_pred CEE Q ss_conf 315 Q gi|254780664|r 363 MRF 365 (536) Q Consensus 363 lRi 365 (536) |.+ T Consensus 367 Lg~ 369 (531) T PRK09522 367 LKT 369 (531) T ss_pred CCC T ss_conf 198 No 147 >PRK01792 ribB 3,4-dihydroxy-2-butanone 4-phosphate synthase; Provisional Probab=33.43 E-value=34 Score=14.70 Aligned_cols=38 Identities=24% Similarity=0.316 Sum_probs=27.7 Q ss_pred HHHHHHHCCCC------EEE-ECCCCCCHHHHHHHHHHCCCEEEE Q ss_conf 58999971995------999-389877988999999875977997 Q gi|254780664|r 491 GIVEAIKAGVT------AVI-QPGGSVRDSEAITVADQHGIAMVF 528 (536) Q Consensus 491 ~ie~aa~~Gi~------aIi-qPGGSirD~evI~aan~~gi~m~f 528 (536) +|+++.-+|.+ -|+ .+|.+.|-+|..+.|.+|+++|+- T Consensus 157 avdL~~lAGl~P~avicEil~ddG~ma~~~el~~fA~~~~lp~is 201 (214) T PRK01792 157 AVDLARLAGYKEAGVICEITNDDGTMARTPEIVEFAKKFGYSVVT 201 (214) T ss_pred HHHHHHHCCCCCEEEEEEEECCCCCCCCHHHHHHHHHHCCCCEEE T ss_conf 999999829997199999845998736779999999983997998 No 148 >cd06578 HemD Uroporphyrinogen-III synthase (HemD) catalyzes the asymmetrical cyclization of tetrapyrrole (linear) to uroporphyrinogen-III, the fourth step in the biosynthesis of heme. This ubiquitous enzyme is present in eukaryotes, bacteria and archaea. Mutations in the human uroporphyrinogen-III synthase gene cause congenital erythropoietic porphyria, a recessive inborn error of metabolism also known as Gunther disease. Probab=33.09 E-value=35 Score=14.66 Aligned_cols=43 Identities=16% Similarity=0.151 Sum_probs=25.8 Q ss_pred CCHHHHHHHHHHHHHHH-EECCCCHHHH-HHHHC--CCCCEEEECCC Q ss_conf 42145787886543111-1268798999-88512--45531563167 Q gi|254780664|r 328 VDQEVAKEVIKVFTEAI-IAPTLSEEAA-DVLAK--KPSMRFLKTSS 370 (536) Q Consensus 328 vd~~~A~~I~~~F~EvI-iAP~f~~eAl-eiL~~--KKnlRil~~~~ 370 (536) |-..||+.+.+..+.+. +.+.++.|.| +.+.. .++.+++.... T Consensus 84 VG~~Ta~~l~~~g~~~~~~~~~~~se~L~~~~~~~~~~~~~il~~~g 130 (239) T cd06578 84 VGPKTAEALREAGLTADFVPEEGDSEGLLELLELQDGKGKRILRPRG 130 (239) T ss_pred CCHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHHHCCCCCEEEEECC T ss_conf 48899999997299855778889999999999873468986999847 No 149 >TIGR01283 nifE nitrogenase MoFe cofactor biosynthesis protein NifE; InterPro: IPR005973 The enzyme responsible for nitrogen fixation, the nitrogenase, shows a high degree of conservation of structure, function, and amino acid sequence across wide phylogenetic ranges. All known Mo-nitrogenases consist of two components, component I (also called dinitrogenase, or Fe-Mo protein), an alpha2beta2 tetramer encoded by the nifD and nifK genes, and component II (dinitrogenase reductase, or Fe protein) a homodimer encoded by the nifH gene. Two operons, nifDK and nifEN, encode a tetrameric (alpha2beta2 and N2E2) enzymatic complex. Nitrogenase contains two unusual rare metal clusters; one of them is the iron molybdenum cofactor (FeMo-co), which is considered to be the site of dinitrogen reduction and whose biosynthesis requires the products of nifNE and of some other nif genes. It has been proposed that NifNE might serve as a scaffold upon which FeMo-co is built and then inserted into component I. ; GO: 0005515 protein binding, 0006461 protein complex assembly, 0009399 nitrogen fixation. Probab=32.70 E-value=35 Score=14.61 Aligned_cols=34 Identities=29% Similarity=0.508 Sum_probs=27.1 Q ss_pred CCCEEEECCCCCCHHHHHHHHHHCCCEEEECCCC Q ss_conf 9959993898779889999998759779974785 Q gi|254780664|r 499 GVTAVIQPGGSVRDSEAITVADQHGIAMVFTGIR 532 (536) Q Consensus 499 Gi~aIiqPGGSirD~evI~aan~~gi~m~fTg~R 532 (536) |=++.|--||..+=.-||.|..++||..|.||++ T Consensus 337 GKkaaiY~GGa~KswSlv~Al~dLGMeVV~~GTq 370 (470) T TIGR01283 337 GKKAAIYTGGAVKSWSLVSALQDLGMEVVATGTQ 370 (470) T ss_pred CCEEEEECCCHHHHHHHHHHHHHCCCEEEEEEEC T ss_conf 9889986586478999998884579179998300 No 150 >smart00654 eIF6 translation initiation factor 6. Probab=32.69 E-value=35 Score=14.61 Aligned_cols=62 Identities=23% Similarity=0.418 Sum_probs=40.8 Q ss_pred CCCCCCCCCCHHHH----HHHHHHH--------HCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHEECCCCHHHHHHHH Q ss_conf 68656545201689----9999985--------33213200455542754214578788654311112687989998851 Q gi|254780664|r 291 MNPCGVATADTLVE----AYRRALS--------CDPISAFGGIIAFNREVDQEVAKEVIKVFTEAIIAPTLSEEAADVLA 358 (536) Q Consensus 291 ~NPCGvA~~~~~~~----Ay~~A~~--------~DP~SAFGGIIa~N~~vd~~~A~~I~~~F~EvIiAP~f~~eAleiL~ 358 (536) .|-=|..+.....+ -.++.+. -++++|.|-+|++|... .++-|+++++..++++ T Consensus 61 GNsnGllvp~~~~d~El~~l~~~l~~~v~V~~l~~~~~AlGN~I~~ND~~--------------Alv~p~l~~e~~e~I~ 126 (200) T smart00654 61 GNSNGLLVPNTTTDQELQHLRNSLPDSVEVQRVEERLTALGNLILCNDHG--------------ALASPDLSKETEEIIS 126 (200) T ss_pred ECCCEEEECCCCCHHHHHHHHHHCCCCCEEEEECCCCCCCCCEEEECCCC--------------EEECCCCCHHHHHHHH T ss_conf 62885892797999999998863745741999688655656789976865--------------7986879989999998 Q ss_pred CCCCCEEE Q ss_conf 24553156 Q gi|254780664|r 359 KKPSMRFL 366 (536) Q Consensus 359 ~KKnlRil 366 (536) .-=+..+. T Consensus 127 dvL~Vev~ 134 (200) T smart00654 127 DVLGVEVF 134 (200) T ss_pred HHHCCEEE T ss_conf 74093599 No 151 >PRK01026 tetrahydromethanopterin S-methyltransferase subunit G; Provisional Probab=32.56 E-value=34 Score=14.76 Aligned_cols=20 Identities=25% Similarity=0.544 Sum_probs=16.5 Q ss_pred CEEEECHHHHHHHHHHHHCC Q ss_conf 13530899989889875302 Q gi|254780664|r 154 VTILTNPQDYPLFLAEMDVN 173 (536) Q Consensus 154 V~Vi~dp~dY~~~~~el~~~ 173 (536) -++++||+||.++.+.|+.- T Consensus 6 P~~~v~~~df~ei~~rLd~i 25 (75) T PRK01026 6 PTVVVDPKDYKEIMKRLDEI 25 (75) T ss_pred CEEECCHHHHHHHHHHHHHH T ss_conf 81035889999999999999 No 152 >PRK04200 cofactor-independent phosphoglycerate mutase; Provisional Probab=32.51 E-value=35 Score=14.59 Aligned_cols=29 Identities=14% Similarity=0.052 Sum_probs=12.9 Q ss_pred CCCCCCCCCCEEEECCCCCCCCCCHHHCCC Q ss_conf 367881006505760477668210121058 Q gi|254780664|r 227 RYGENPHQKAALYSTPEKKSGIAHAVLVQG 256 (536) Q Consensus 227 RYGENPHQ~Aa~Y~~~~~~~~~~~~~ql~G 256 (536) +-|++|--.-++ ........+..|....| T Consensus 204 ~~g~~paN~ill-rg~G~~p~lp~F~e~~g 232 (395) T PRK04200 204 SEGKDPANSIWL-WGQGYAPKMPTFKERYG 232 (395) T ss_pred HCCCCCCCEEEE-CCCCCCCCCCCHHHHHC T ss_conf 769986657876-42798988999768738 No 153 >TIGR01702 CO_DH_cata carbon-monoxide dehydrogenase, catalytic subunit; InterPro: IPR010047 This entry describes the Ni-containing carbon monoxide dehydrogenase (CODH) family (1.2.99.2 from EC). These enzymes reversibly oxidise CO to CO(2), and play key roles in the energy-yielding pathways of various autotrophic anaerobes, allowing these organisms to grow with CO as the sole carbon source . The active sites of these enzymes contain Ni-iron-sulphur (NiFeS) clusters. This family also includes the CODH subunit of bifunctional CODH/acetyl-CoA synthetase (CODH/ACS), which contains two Ni-containing active sites to catalyse the formation of acetyl-CoA from CO(2)-derived CO , . CODH is related to hybrid cluster proteins (HCP, or prismane) (IPR010048 from INTERPRO), a complex whose activity is not yet fully described; the hybrid Fe-S cluster in HCPs is similar to the NiFeS cluster in CODH .; GO: 0016151 nickel ion binding, 0018492 carbon-monoxide dehydrogenase (acceptor) activity, 0051539 4 iron 4 sulfur cluster binding, 0006091 generation of precursor metabolites and energy, 0005737 cytoplasm. Probab=32.28 E-value=36 Score=14.57 Aligned_cols=58 Identities=19% Similarity=0.142 Sum_probs=27.5 Q ss_pred CCCHHHHHHHHHHHHHHHEECCCCHHHHHHHHCCCC-CEEEECCCCC--CCCCCCCE---ECCCCHHHHHCCC Q ss_conf 542145787886543111126879899988512455-3156316766--77766640---2032102231012 Q gi|254780664|r 327 EVDQEVAKEVIKVFTEAIIAPTLSEEAADVLAKKPS-MRFLKTSSLL--DFHGEEIV---LKTVSGGILVQTR 393 (536) Q Consensus 327 ~vd~~~A~~I~~~F~EvIiAP~f~~eAleiL~~KKn-lRil~~~~~~--~~~~~~~~---~rsi~GG~LvQ~~ 393 (536) +.|.+.|+++.+.-. +.|+|=|++||. -+...++..+ +..---+. +....|+.+-|++ T Consensus 373 ~y~~~~A~E~A~~iI---------~~AIeaFk~R~~s~~~~~~~~~kG~~k~vvGfS~Eal~~~Lg~~~~~~P 436 (647) T TIGR01702 373 PYDPEKAEETAKTII---------EMAIEAFKERKESKQKVAIPKEKGKQKVVVGFSEEALVKALGGKLADDP 436 (647) T ss_pred CCCHHHHHHHHHHHH---------HHHHHHHHCCCCCCCCCCCCCCCCCCEEEEEECHHHHHHHHCCCCCCCC T ss_conf 988767889999999---------9999976303467886565400486217883077999997246423475 No 154 >cd03376 TPP_PFOR_porB_like Thiamine pyrophosphate (TPP family), PFOR porB-like subfamily, TPP-binding module; composed of proteins similar to the beta subunit (porB) of the Helicobacter pylori four-subunit pyruvate ferredoxin oxidoreductase (PFOR), which are also found in archaea and some hyperthermophilic bacteria. PFOR catalyzes the oxidative decarboxylation of pyruvate to form acetyl-CoA, a crucial step in many metabolic pathways. Archaea, anaerobic bacteria and eukaryotes that lack mitochondria (and therefore pyruvate dehydrogenase) use PFOR to oxidatively decarboxylate pyruvate, with ferredoxin or flavodoxin as the electron acceptor. The 36-kDa porB subunit contains the binding sites for the cofactors, TPP and a divalent metal cation, which are required for activity. Probab=32.24 E-value=26 Score=15.61 Aligned_cols=29 Identities=31% Similarity=0.423 Sum_probs=24.9 Q ss_pred CCCCCHHHHHHHHCCCCCCEEEECHHHHH Q ss_conf 23343469998740156313530899989 Q gi|254780664|r 136 IDIGGPSMIRAAAKNHDYVTILTNPQDYP 164 (536) Q Consensus 136 IDIGGpsmiRAAAKN~~~V~Vi~dp~dY~ 164 (536) -|||+-.++.|+..|..-+.++.|-.-|. T Consensus 93 ~dIG~~~l~haa~Rn~~i~~iv~dN~~Yg 121 (235) T cd03376 93 ADIGFQALSGAAERGHDILYICYDNEAYM 121 (235) T ss_pred HHHHHHHHHHHHHCCCCEEEEEECCHHHC T ss_conf 54207889999982998399998882100 No 155 >COG0752 GlyQ Glycyl-tRNA synthetase, alpha subunit [Translation, ribosomal structure and biogenesis] Probab=32.14 E-value=24 Score=15.80 Aligned_cols=22 Identities=32% Similarity=0.640 Sum_probs=15.6 Q ss_pred CCCCCCCCCCCCCCCCCCEEEE Q ss_conf 0123333436788100650576 Q gi|254780664|r 219 TAVKKQEMRYGENPHQKAALYS 240 (536) Q Consensus 219 ~~~~~~~LRYGENPHQ~Aa~Y~ 240 (536) +..+-.+=||||||.---.+|. T Consensus 59 PSRRP~DGRYGenPNRlq~yyQ 80 (298) T COG0752 59 PSRRPTDGRYGENPNRLQHYYQ 80 (298) T ss_pred CCCCCCCCCCCCCCHHHHHHEE T ss_conf 6779988877889266523315 No 156 >PRK06721 threonine synthase; Reviewed Probab=31.82 E-value=36 Score=14.51 Aligned_cols=15 Identities=13% Similarity=0.197 Sum_probs=9.4 Q ss_pred HHHHCCCCCCEEEEC Q ss_conf 987401563135308 Q gi|254780664|r 145 RAAAKNHDYVTILTN 159 (536) Q Consensus 145 RAAAKN~~~V~Vi~d 159 (536) -||+++++-+.++.+ T Consensus 93 ~aa~~G~~~~IvmP~ 107 (352) T PRK06721 93 YAARLGMKCIIVIPE 107 (352) T ss_pred HHHHHCCCEEEEECC T ss_conf 999848966999247 No 157 >pfam01634 HisG ATP phosphoribosyltransferase. Probab=31.80 E-value=26 Score=15.57 Aligned_cols=27 Identities=19% Similarity=0.159 Sum_probs=14.1 Q ss_pred HHEECCCCHHHHHHHHCC-CCCEEEECC Q ss_conf 111268798999885124-553156316 Q gi|254780664|r 343 AIIAPTLSEEAADVLAKK-PSMRFLKTS 369 (536) Q Consensus 343 vIiAP~f~~eAleiL~~K-KnlRil~~~ 369 (536) .-||-.|-.=+-+.|.++ -+.+++.+. T Consensus 66 ~rIATkYp~it~~~~~~~gi~~~ii~l~ 93 (161) T pfam01634 66 KRIATKYPNLARKYFRKNGIDAEIIKLD 93 (161) T ss_pred CEEEEECHHHHHHHHHHCCCEEEEEECC T ss_conf 7999965399999999819826899755 No 158 >cd04630 CBS_pair_17 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic gener Probab=31.76 E-value=36 Score=14.51 Aligned_cols=92 Identities=11% Similarity=0.142 Sum_probs=35.0 Q ss_pred HHHHHHHCCCCCEEEECCCCCCCCCCCCEECCCCHHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCC Q ss_conf 99988512455315631676677766640203210223101233453100012467778688988899999986005667 Q gi|254780664|r 352 EAADVLAKKPSMRFLKTSSLLDFHGEEIVLKTVSGGILVQTRDNVVDNKELTVVTKRSPTDQELRDMKFAFKVVKHVKSN 431 (536) Q Consensus 352 eAleiL~~KKnlRil~~~~~~~~~~~~~~~rsi~GG~LvQ~~D~~~~~~~~~vVT~~~pt~~e~~dL~FA~kv~K~vkSN 431 (536) ||++++.++.-.+++...........-+..|.+...++.+..+.. ...=..+.|+.-.+-..-.++..|.+....-+=. T Consensus 14 eA~~~M~~~~i~~v~V~~~~~~~~vGiiT~~Di~~~~~~~~~~~~-~~~V~~vMt~~v~tv~~d~~~~~a~~~m~~~~i~ 92 (114) T cd04630 14 EALQLMKEHGVSSLVVEKRRESDAYGIVTMRDILKKVVAEGRDPD-RVNVYEIMTKPLISVSPDMDIKYCARLMERTNIR 92 (114) T ss_pred HHHHHHHHHCCCEEEEEECCCCCEEEEEEEHHHHHHHHHCCCCCC-CCCHHHHCCCCEEEECCCCCHHHHHHHHHHCCCC T ss_conf 999999982998899997899608999981898999984678944-4498682703549999949499999999977972 Q ss_pred EEEEEECCEEEEE Q ss_conf 7999968938997 Q gi|254780664|r 432 AVVYAKDGRTVGI 444 (536) Q Consensus 432 AIv~ak~~~tiGi 444 (536) .+.+.+++..+|| T Consensus 93 ~lpVvd~~~lvGi 105 (114) T cd04630 93 RAPVVENNELIGI 105 (114) T ss_pred EEEEEECCEEEEE T ss_conf 9999999999999 No 159 >cd00733 GlyRS_alpha_core Class II Glycyl-tRNA synthetase (GlyRS) alpha subunit core catalytic domain. GlyRS functions as a homodimer in eukaryotes, archaea and some bacteria and as a heterotetramer in the remainder of prokaryotes and in arabidopsis. It is responsible for the attachment of glycine to the 3' OH group of ribose of the appropriate tRNA. This domain is primarily responsible for the ATP-dependent formation of the enzyme bound aminoacyl-adenylate. This alignment contains only sequences from the GlyRS form which heterotetramerizes. The homodimer form of GlyRS is in a different family of class II aaRS. Class II assignment is based upon structure and the presence of three characteristic sequence motifs. Probab=31.61 E-value=23 Score=16.01 Aligned_cols=21 Identities=33% Similarity=0.668 Sum_probs=14.4 Q ss_pred CCCCCCCCCCCCCCCCCEEEE Q ss_conf 123333436788100650576 Q gi|254780664|r 220 AVKKQEMRYGENPHQKAALYS 240 (536) Q Consensus 220 ~~~~~~LRYGENPHQ~Aa~Y~ 240 (536) -.+-.+=||||||.---.+|. T Consensus 55 srRP~DgRYG~NPNRlq~y~Q 75 (279) T cd00733 55 SRRPTDGRYGENPNRLQHYYQ 75 (279) T ss_pred CCCCCCCCCCCCCCHHCEEEE T ss_conf 779999877789323100034 No 160 >PRK07576 short chain dehydrogenase; Provisional Probab=31.46 E-value=37 Score=14.47 Aligned_cols=40 Identities=10% Similarity=0.087 Sum_probs=21.5 Q ss_pred CEEEEEEEEECCCCHHHHHHHHHHCCCEEEEEHHHHHHHH Q ss_conf 1369899875456869999999987999999427689999 Q gi|254780664|r 15 AVKTALISVHNKTGVVEFASRLLSRGIKIISTGGTCQLLE 54 (536) Q Consensus 15 ~ikrALiSV~dKtgl~~la~~L~~~g~~iisTgGTa~~l~ 54 (536) +=|.|||.=.-+-==...|+.|.+.|..++-++-+...+. T Consensus 7 ~gK~alVTGgs~GIG~aia~~la~~Ga~V~i~~r~~~~~~ 46 (260) T PRK07576 7 AGKNVFVVGGTSGINLGIAQAFARAGANVAVASRSQEKVD 46 (260) T ss_pred CCCEEEEECCCCHHHHHHHHHHHHCCCEEEEEECCHHHHH T ss_conf 8998999589619999999999987999999979889999 No 161 >CHL00079 rps9 ribosomal protein S9 Probab=31.42 E-value=25 Score=15.65 Aligned_cols=23 Identities=17% Similarity=0.255 Sum_probs=19.0 Q ss_pred EECCCCCCHHHHHHHHHHHHHHH Q ss_conf 97296888899999999998764 Q gi|254780664|r 443 GIGSGQTSRVDSTRFAAIKAHNI 465 (536) Q Consensus 443 GiGaGQ~sRVda~~iA~~KA~~~ 465 (536) =-|.|.++..+|+++|+.||-.. T Consensus 71 V~GGG~sgQa~Air~aIaRaL~~ 93 (134) T CHL00079 71 VKGGGLTGQAEAIKLGIARALCS 93 (134) T ss_pred EECCCHHHHHHHHHHHHHHHHHH T ss_conf 97887543999999999999999 No 162 >TIGR01326 OAH_OAS_sulfhy O-acetylhomoserine aminocarboxypropyltransferase/cysteine synthase; InterPro: IPR006235 This family of sequences is a distinct clade of the Cys/Met metabolism pyridoxal phosphate-dependent enzyme superfamily. Members include examples of OAH/OAS sulfhydrylase, an enzyme with activity both as O-acetylhomoserine sulfhydrylase (OAH, 2.5.1.49 from EC) and O-acetylserine sulfhydrylase (OAS, 2.5.1.47 from EC). An alternate name for OAH sulfhydrylase is homocysteine synthase.; GO: 0016765 transferase activity transferring alkyl or aryl (other than methyl) groups, 0006520 amino acid metabolic process. Probab=31.37 E-value=37 Score=14.46 Aligned_cols=83 Identities=23% Similarity=0.297 Sum_probs=54.5 Q ss_pred CCCCCHHHCCCCC-CCCCCCCCCCHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEE Q ss_conf 6821012105898-883122212001167764245567279996156865654520168999999853321320045554 Q gi|254780664|r 246 SGIAHAVLVQGKP-LSYNNINDLDAAFELVSEFRSQDCAACVIVKHMNPCGVATADTLVEAYRRALSCDPISAFGGIIAF 324 (536) Q Consensus 246 ~~~~~~~ql~GKe-LSYNNllD~daA~~lv~ef~~~~~Pa~vIvKH~NPCGvA~~~~~~~Ay~~A~~~DP~SAFGGIIa~ 324 (536) +|...|-.|||=| ||-===-=.++|++++.=|.+. |. |+-.|=-|.++... |++|-.==|---||+|+.| T Consensus 285 sPFnAFLllQGLETLsLR~ERH~~NA~kVA~fL~~H--p~---V~wV~YPGL~s~~~----h~lAkkYl~~G~~G~vLsF 355 (434) T TIGR01326 285 SPFNAFLLLQGLETLSLRMERHVENALKVAEFLEAH--PK---VAWVNYPGLASHPH----HALAKKYLPKGLFGAVLSF 355 (434) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC--CC---CCCCCCCCCCCCHH----HHHHHHHHCCCCEEEEEEC T ss_conf 178999999866678767888889999999997278--84---45134888788753----7889987417942678741 Q ss_pred CCCCCHHHHHHHH Q ss_conf 2754214578788 Q gi|254780664|r 325 NREVDQEVAKEVI 337 (536) Q Consensus 325 N~~vd~~~A~~I~ 337 (536) --+==.+.++++. T Consensus 356 e~kGG~ea~~kfi 368 (434) T TIGR01326 356 EIKGGREAGKKFI 368 (434) T ss_pred CCCCCHHHHHHHH T ss_conf 3218288999998 No 163 >PRK12825 fabG 3-ketoacyl-(acyl-carrier-protein) reductase; Provisional Probab=31.08 E-value=37 Score=14.43 Aligned_cols=90 Identities=21% Similarity=0.252 Sum_probs=47.0 Q ss_pred CCCCEEEEEEEEECCCCHHHHHHHHHHCCCEEEEEHHHH-HHHHHCCCCEEEHHHHCCCCHHCCCCCCCCCCHHHHHHHH Q ss_conf 776136989987545686999999998799999942768-9999889951794653488165198553237155535533 Q gi|254780664|r 12 GEIAVKTALISVHNKTGVVEFASRLLSRGIKIISTGGTC-QLLEEEGIPVTSVFDITKFPEIMGGRVKTLHPKIYGGILS 90 (536) Q Consensus 12 ~~~~ikrALiSV~dKtgl~~la~~L~~~g~~iisTgGTa-~~l~~~gi~v~~Vs~~TgfpEil~GRVKTLHP~I~ggIL~ 90 (536) ++++=|.|||.=.-+-==..+|+.|.+.|+.++-++..- +.+.+. .+++ +-.++++..+. . T Consensus 3 ~~LkgK~~lITGas~GIG~aia~~la~~G~~V~~~~~~~~~~~~~~------~~~~----~~~~~~~~~~~--------~ 64 (250) T PRK12825 3 GSLSGRVALVTGAARGIGRAIALRLAAAGADVIVHPPSDEAAAEET------VAAV----EALGRRAQAVQ--------A 64 (250) T ss_pred CCCCCCEEEEECCCCHHHHHHHHHHHHCCCEEEEEECCCHHHHHHH------HHHH----HHCCCCEEEEE--------E T ss_conf 6609788999389558999999999987998999979887899999------9999----85399489999--------4 Q ss_pred CCCCHHHHHHHHH---CCCCCEEEEEEECCCH Q ss_conf 0889899999997---7998414999948888 Q gi|254780664|r 91 IRDNPAHMKFMQD---HELESIDLVVVNLYPF 119 (536) Q Consensus 91 ~r~~~~~~~~l~~---~~i~~IDlVvvNLYPF 119 (536) +-.++++.+++-+ ....+||++|.|--++ T Consensus 65 Dl~~~~~~~~~~~~~~~~~g~iDilInnAg~~ 96 (250) T PRK12825 65 DVTDAAALEAAVEELVERFGAIDILVNNAGIT 96 (250) T ss_pred ECCCHHHHHHHHHHHHHHCCCCCEEEECCCCC T ss_conf 18999999999999999769998999899889 No 164 >PRK06701 short chain dehydrogenase; Provisional Probab=31.00 E-value=37 Score=14.42 Aligned_cols=42 Identities=19% Similarity=0.058 Sum_probs=34.1 Q ss_pred CCCCCCCCCEEEEEEEEECCCCHHHHHHHHHHCCCEEEEEHH Q ss_conf 778867761369899875456869999999987999999427 Q gi|254780664|r 7 KDGDHGEIAVKTALISVHNKTGVVEFASRLLSRGIKIISTGG 48 (536) Q Consensus 7 ~~~~~~~~~ikrALiSV~dKtgl~~la~~L~~~g~~iisTgG 48 (536) .-..+|+++=|.|||+=..+-==..+|+.|.+.|..+.-++. T Consensus 36 ~~~~~grL~GKvalVTGgs~GIG~aiA~~la~~GA~V~i~~~ 77 (289) T PRK06701 36 NYKGSGKLKGKVALITGGDSGIGRAVAVAFAKEGADIAIVYL 77 (289) T ss_pred CCCCCCCCCCCEEEEECCCCHHHHHHHHHHHHCCCEEEEEEC T ss_conf 886688779998999682579999999999987998999828 No 165 >PRK09348 glyQ glycyl-tRNA synthetase subunit alpha; Validated Probab=30.98 E-value=20 Score=16.37 Aligned_cols=20 Identities=35% Similarity=0.683 Sum_probs=14.1 Q ss_pred CCCCCCCCCCCCCCCCEEEE Q ss_conf 23333436788100650576 Q gi|254780664|r 221 VKKQEMRYGENPHQKAALYS 240 (536) Q Consensus 221 ~~~~~LRYGENPHQ~Aa~Y~ 240 (536) .+-.+=||||||.---.+|. T Consensus 58 rRP~DgRYGeNPNRlq~y~Q 77 (291) T PRK09348 58 RRPTDGRYGENPNRLQHYYQ 77 (291) T ss_pred CCCCCCCCCCCCHHHHHEEE T ss_conf 78887754689124201055 No 166 >pfam05505 Ebola_NP Ebola nucleoprotein. This family consists of Ebola and Marburg virus nucleoproteins. These proteins are responsible for encapsidation of genomic RNA. It has been found that nucleoprotein DNA vaccines can offer protection from the virus. Probab=30.90 E-value=28 Score=15.37 Aligned_cols=58 Identities=17% Similarity=0.315 Sum_probs=45.0 Q ss_pred EEEEEEEEECCCCHHHHHHHHHH---C-----------------------CCEEEEEHHHHHHHHHCCCC--EEEHHHHC Q ss_conf 36989987545686999999998---7-----------------------99999942768999988995--17946534 Q gi|254780664|r 16 VKTALISVHNKTGVVEFASRLLS---R-----------------------GIKIISTGGTCQLLEEEGIP--VTSVFDIT 67 (536) Q Consensus 16 ikrALiSV~dKtgl~~la~~L~~---~-----------------------g~~iisTgGTa~~l~~~gi~--v~~Vs~~T 67 (536) +|.-.|+||+-++++++|+-.++ . .++.+-+.+..|||+.+|.. |++-..+| T Consensus 18 VRqrvi~vy~vnnle~iCqlIIqAfeaGvDfqd~adsfLLmLClhHaYqgDyk~FleS~avkYLeghGfrFev~kk~gv~ 97 (717) T pfam05505 18 VRQKVIPVYDVNNLESICQLIIQAFEAGIDFQDNADSFLLMLCLHHAYQGDYKLFLESPAVKYLEGHGFRFEVRKKDGVT 97 (717) T ss_pred HHCCEEEEEECCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCHHHHHCCCHHHHHHCCCCEEEEEECCCHH T ss_conf 13104557862659999999999997267644312447999999888612188773130778875277368885145646 Q ss_pred CCCHHC Q ss_conf 881651 Q gi|254780664|r 68 KFPEIM 73 (536) Q Consensus 68 gfpEil 73 (536) .+.|+| T Consensus 98 Rleell 103 (717) T pfam05505 98 RLEELL 103 (717) T ss_pred HHHHHC T ss_conf 899856 No 167 >PRK13584 hisG ATP phosphoribosyltransferase catalytic subunit; Provisional Probab=30.68 E-value=28 Score=15.35 Aligned_cols=13 Identities=0% Similarity=0.158 Sum_probs=6.8 Q ss_pred HHHHHHHHHCCCC Q ss_conf 8999999977998 Q gi|254780664|r 95 PAHMKFMQDHELE 107 (536) Q Consensus 95 ~~~~~~l~~~~i~ 107 (536) ++|+.+.-++|.- T Consensus 52 ~~DIp~yV~~G~~ 64 (204) T PRK13584 52 GSDVPIYVEQGMA 64 (204) T ss_pred HHHHHHHHHCCCC T ss_conf 6888999974986 No 168 >TIGR02124 hypE hydrogenase expression/formation protein HypE; InterPro: IPR011854 This family contains HypE (or HupE), a protein required for expression of catalytically active hydrogenase in many systems. It appears to be an accessory protein involved in maturation rather than a regulatory protein involved in expression. HypE shows considerable homology to the thiamine-monophosphate kinase ThiL (IPR006283 from INTERPRO) and other enzymes.. Probab=30.59 E-value=24 Score=15.77 Aligned_cols=123 Identities=20% Similarity=0.240 Sum_probs=77.9 Q ss_pred HCCCCCCCCCCCCCCCHHHHHHHHCCCCC--CCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCC-- Q ss_conf 10589888312221200116776424556--72799961568656545201689999998533213200455542754-- Q gi|254780664|r 253 LVQGKPLSYNNINDLDAAFELVSEFRSQD--CAACVIVKHMNPCGVATADTLVEAYRRALSCDPISAFGGIIAFNREV-- 328 (536) Q Consensus 253 ql~GKeLSYNNllD~daA~~lv~ef~~~~--~Pa~vIvKH~NPCGvA~~~~~~~Ay~~A~~~DP~SAFGGIIa~N~~v-- 328 (536) +=+|=++.-|=--|-.+=|.+|..+-+.. .|.+-.++=.+-=|+|.-=| +-|-++. =||..-=..+ T Consensus 188 ~Regl~f~~~i~SDCapL~~lv~~ll~~~g~~~~v~~~RD~TRGGLA~vLN-----E~A~~sg-----~~i~l~E~~iPV 257 (345) T TIGR02124 188 VREGLGFETNIESDCAPLNGLVEALLAVGGYEPAVHAMRDATRGGLAAVLN-----EIAESSG-----VGIELEEEKIPV 257 (345) T ss_pred HHHCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCEEEECCCCCCHHHHHH-----HHHHHHC-----CCEEEEECCCCC T ss_conf 650255256733327765899999984379871423761678550478999-----9999619-----927998524788 Q ss_pred CHHHH--HHH---------HH-HHHHHHEECCCCHHHHHHHHCCC---CCE-EEECCCCCCCCCCCCEECCCCHHH Q ss_conf 21457--878---------86-54311112687989998851245---531-563167667776664020321022 Q gi|254780664|r 329 DQEVA--KEV---------IK-VFTEAIIAPTLSEEAADVLAKKP---SMR-FLKTSSLLDFHGEEIVLKTVSGGI 388 (536) Q Consensus 329 d~~~A--~~I---------~~-~F~EvIiAP~f~~eAleiL~~KK---nlR-il~~~~~~~~~~~~~~~rsi~GG~ 388 (536) -+++. -+| ++ .|+ +++.|.+.+++|++||+.+ +-+ |+- .-.+.......+++.+|+- T Consensus 258 ~eeV~gaCE~LGldPl~lANEG~~v-~~V~~E~A~~vLe~lk~hp~G~~A~YiIG--~V~e~~~~~V~l~t~~G~~ 330 (345) T TIGR02124 258 KEEVKGACELLGLDPLYLANEGKLV-LAVPPEAAEKVLEILKSHPLGKDAAYIIG--EVVEKKEGLVVLKTAYGGK 330 (345) T ss_pred CHHHHHHHHHHCCCHHHHHCCCEEE-EEECHHHHHHHHHHHHHCCCCCCCCEEEE--EEEECCCCEEEEEECCCCE T ss_conf 3789999986170325420476289-98283779999999860776433215630--1473798779997068840 No 169 >COG0462 PrsA Phosphoribosylpyrophosphate synthetase [Nucleotide transport and metabolism / Amino acid transport and metabolism] Probab=30.16 E-value=25 Score=15.75 Aligned_cols=18 Identities=22% Similarity=0.143 Sum_probs=8.5 Q ss_pred HHEECCCCHHHHHHHHCC Q ss_conf 111268798999885124 Q gi|254780664|r 343 AIIAPTLSEEAADVLAKK 360 (536) Q Consensus 343 vIiAP~f~~eAleiL~~K 360 (536) +..-|=|+..|.+.|++- T Consensus 248 ~~tH~vfs~~a~~~l~~~ 265 (314) T COG0462 248 AATHGVFSGAALERLEAS 265 (314) T ss_pred EEECHHHCHHHHHHHHCC T ss_conf 998366070899998648 No 170 >PRK07454 short chain dehydrogenase; Provisional Probab=30.07 E-value=39 Score=14.31 Aligned_cols=85 Identities=15% Similarity=0.108 Sum_probs=47.7 Q ss_pred CCCEEEEEEEEECCCCHHHHHHHHHHCCCEEEEEHHHHHHHHHCCCCEEEHHHHCCCCHHCCCCCCCCCCHHHHHHHHCC Q ss_conf 76136989987545686999999998799999942768999988995179465348816519855323715553553308 Q gi|254780664|r 13 EIAVKTALISVHNKTGVVEFASRLLSRGIKIISTGGTCQLLEEEGIPVTSVFDITKFPEIMGGRVKTLHPKIYGGILSIR 92 (536) Q Consensus 13 ~~~ikrALiSV~dKtgl~~la~~L~~~g~~iisTgGTa~~l~~~gi~v~~Vs~~TgfpEil~GRVKTLHP~I~ggIL~~r 92 (536) +-++|.|||.=.-.-==...|+.|.+.|+.++-++-+...|++. .+++. -.++++..+.- +- T Consensus 3 ~~~mKvalITGas~GIG~a~A~~la~~G~~V~l~~R~~~~l~~~------~~e~~----~~g~~~~~~~~--------Dv 64 (241) T PRK07454 3 LNSMPTALITGASRGIGKATALAFAKAGWDLALVARSQDALEAL------AEELR----STGVKVAAYSI--------DL 64 (241) T ss_pred CCCCCEEEECCCCCHHHHHHHHHHHHCCCEEEEEECCHHHHHHH------HHHHH----HCCCCEEEEEE--------EC T ss_conf 89998899917587899999999998799899998999999999------99999----65992899995--------18 Q ss_pred CCHHHH----HHHHHCCCCCEEEEEEEC Q ss_conf 898999----999977998414999948 Q gi|254780664|r 93 DNPAHM----KFMQDHELESIDLVVVNL 116 (536) Q Consensus 93 ~~~~~~----~~l~~~~i~~IDlVvvNL 116 (536) .++++. +++.+ .+..||++|.|= T Consensus 65 t~~~~v~~~~~~~~~-~~G~iDiLVnNA 91 (241) T PRK07454 65 SNPEAIAPGIAELLE-QFGCPSVLINNA 91 (241) T ss_pred CCHHHHHHHHHHHHH-HCCCCCEEEECC T ss_conf 999999999999999-759988999889 No 171 >PRK13135 consensus Probab=29.97 E-value=39 Score=14.30 Aligned_cols=34 Identities=9% Similarity=0.180 Sum_probs=17.4 Q ss_pred HHHHHHHHHHEECCCCHHHHHHHH---CCCCCEEEEC Q ss_conf 788654311112687989998851---2455315631 Q gi|254780664|r 335 EVIKVFTEAIIAPTLSEEAADVLA---KKPSMRFLKT 368 (536) Q Consensus 335 ~I~~~F~EvIiAP~f~~eAleiL~---~KKnlRil~~ 368 (536) ...+.=++.+|.||.-.|-.+-+. ++.++-++.+ T Consensus 116 ~~~~~GvdGlIipDLP~ee~~~~~~~~~~~~l~~I~l 152 (267) T PRK13135 116 DAAAAGVDGVLLVDLPPEEAEEFKACADRHGLDVIFL 152 (267) T ss_pred HHHHCCCCEEECCCCCHHHHHHHHHHHHHCCCCEEEE T ss_conf 9997499747637899788899999998729618998 No 172 >PRK01686 hisG ATP phosphoribosyltransferase catalytic subunit; Reviewed Probab=29.85 E-value=29 Score=15.23 Aligned_cols=57 Identities=21% Similarity=0.131 Sum_probs=26.6 Q ss_pred HHHHHHHHCCCCEEEHHHHCCCCHHCCCCCCCCCCHH-HHHHHHCCCCHHHHHHHHHCCCCCEEEEEEEC Q ss_conf 7689999889951794653488165198553237155-53553308898999999977998414999948 Q gi|254780664|r 48 GTCQLLEEEGIPVTSVFDITKFPEIMGGRVKTLHPKI-YGGILSIRDNPAHMKFMQDHELESIDLVVVNL 116 (536) Q Consensus 48 GTa~~l~~~gi~v~~Vs~~TgfpEil~GRVKTLHP~I-~ggIL~~r~~~~~~~~l~~~~i~~IDlVvvNL 116 (536) .|.++|+++|+.+....+ ++|- |-+.. ..+|-..--+++|+...-++|.- |+=||-. T Consensus 16 ~~~~ll~~~G~~~~~~~~--------~~R~--l~~~~~~~~i~~~~~r~~DIp~~V~~G~~--DlGI~G~ 73 (212) T PRK01686 16 ETLPLLAAAGIDPSEDPD--------KSRK--LIFPTPDPDVRFLLVRATDVPTYVEHGAA--DLGIVGK 73 (212) T ss_pred HHHHHHHHCCCCCCCCCC--------CCCE--EEEECCCCCEEEEEECHHHHHHHHHCCCC--CEEEEEH T ss_conf 999999987997445788--------8741--78543899979999876888999977997--6886158 No 173 >CHL00200 trpA tryptophan synthase alpha subunit; Provisional Probab=29.85 E-value=39 Score=14.29 Aligned_cols=44 Identities=20% Similarity=0.193 Sum_probs=17.7 Q ss_pred CCCHHHHHHHHCCCCCCCEEEEECCCCCC--CCCCCCHHHHHHHHHHHH Q ss_conf 12001167764245567279996156865--654520168999999853 Q gi|254780664|r 266 DLDAAFELVSEFRSQDCAACVIVKHMNPC--GVATADTLVEAYRRALSC 312 (536) Q Consensus 266 D~daA~~lv~ef~~~~~Pa~vIvKH~NPC--GvA~~~~~~~Ay~~A~~~ 312 (536) |.+..++++..+.+. .|=+|-=.-|. -+|-|..+-+|..+|++. T Consensus 27 ~~e~s~~~~~~l~~~---GaDiiElGiPfSDP~ADGpvIq~A~~~AL~~ 72 (263) T CHL00200 27 DIVITKKALKILDKK---GADIIELGIPYSDPLADGPIIQEASNRALKQ 72 (263) T ss_pred CHHHHHHHHHHHHHC---CCCEEEECCCCCCCCCCCHHHHHHHHHHHHC T ss_conf 878999999999976---9999997898888666589999999999977 No 174 >PRK00011 glyA serine hydroxymethyltransferase; Reviewed Probab=29.81 E-value=35 Score=14.59 Aligned_cols=16 Identities=44% Similarity=0.696 Sum_probs=9.5 Q ss_pred CEEEEECCCCCHHHHHHHHH Q ss_conf 04555427542145787886 Q gi|254780664|r 319 GGIIAFNREVDQEVAKEVIK 338 (536) Q Consensus 319 GGIIa~N~~vd~~~A~~I~~ 338 (536) ||+|.+|. +.++.|.+ T Consensus 235 GGiIl~~~----e~~~kId~ 250 (415) T PRK00011 235 GGLILTNE----ELAKKINS 250 (415) T ss_pred CEEEEECC----HHHHHHHC T ss_conf 60797340----66666541 No 175 >PRK13112 consensus Probab=29.59 E-value=39 Score=14.26 Aligned_cols=37 Identities=16% Similarity=0.252 Sum_probs=19.6 Q ss_pred HHHHHHHHHHHHHEECCCCHHHHHHHH---CCCCCEEEEC Q ss_conf 578788654311112687989998851---2455315631 Q gi|254780664|r 332 VAKEVIKVFTEAIIAPTLSEEAADVLA---KKPSMRFLKT 368 (536) Q Consensus 332 ~A~~I~~~F~EvIiAP~f~~eAleiL~---~KKnlRil~~ 368 (536) .++...+.=++.+|-||.-.|.-+-+. ++.++.++.+ T Consensus 115 F~~~~~~aGvdGvIipDLP~eE~~~~~~~~~~~~i~~I~l 154 (279) T PRK13112 115 FLTDAKAAGVDGLIVVDLPPEMDAELCIPAMKAGINFIRL 154 (279) T ss_pred HHHHHHHCCCCEEEECCCCHHHHHHHHHHHHHCCCCEEEE T ss_conf 9999997399879846999788899999998578346998 No 176 >cd01310 TatD_DNAse TatD like proteins; E.coli TatD is a cytoplasmic protein, shown to have magnesium dependent DNase activity. Probab=29.58 E-value=39 Score=14.26 Aligned_cols=29 Identities=14% Similarity=-0.040 Sum_probs=15.9 Q ss_pred HHHHHCCCCCC-EEEECHHHHHHHHHHHHC Q ss_conf 99874015631-353089998988987530 Q gi|254780664|r 144 IRAAAKNHDYV-TILTNPQDYPLFLAEMDV 172 (536) Q Consensus 144 iRAAAKN~~~V-~Vi~dp~dY~~~~~el~~ 172 (536) -||.+.+-..+ ++=++|.++..+++-.+. T Consensus 22 ~~a~~~gv~~ii~~~~~~~~~~~~~~la~~ 51 (251) T cd01310 22 ARAREAGVIKIIVVGTDLKSSKRALELAKK 51 (251) T ss_pred HHHHHCCCCEEEEECCCHHHHHHHHHHHHH T ss_conf 999986998899968999999999999975 No 177 >PRK09291 short chain dehydrogenase; Provisional Probab=29.45 E-value=40 Score=14.24 Aligned_cols=120 Identities=14% Similarity=0.136 Sum_probs=58.1 Q ss_pred EEEEEEEECCCCH-HHHHHHHHHCCCEEEEEHHHHHHHHHCCCCEEEHHHHCCCCHHCCCCCCCCCCHHHHHHHHCCCCH Q ss_conf 6989987545686-999999998799999942768999988995179465348816519855323715553553308898 Q gi|254780664|r 17 KTALISVHNKTGV-VEFASRLLSRGIKIISTGGTCQLLEEEGIPVTSVFDITKFPEIMGGRVKTLHPKIYGGILSIRDNP 95 (536) Q Consensus 17 krALiSV~dKtgl-~~la~~L~~~g~~iisTgGTa~~l~~~gi~v~~Vs~~TgfpEil~GRVKTLHP~I~ggIL~~r~~~ 95 (536) |++||.=.- +|| ..+|+.|.+.|+.++.++-+...+.+.- ++ .+-.++++....- +-.++ T Consensus 3 K~vLITGAs-sGIGraiA~~la~~G~~Vi~~~r~~~~l~~l~------~~----~~~~g~~~~~~~l--------Dv~~~ 63 (257) T PRK09291 3 KTILITGAG-SGFGREVALRLARKGHRVIAGVQIAPQVTELR------AE----AARRGLALRVEKL--------DLTDA 63 (257) T ss_pred CEEEEECCC-CHHHHHHHHHHHHCCCEEEEEECCHHHHHHHH------HH----HHHCCCCEEEEEC--------CCCCH T ss_conf 989996898-58999999999987998999968789999999------99----9852995599989--------88999 Q ss_pred HHHHHHHHCCCCCEEEEEEE-----CCCHHHHHCCCCCHHHHHHCC--CC-CCHHHHHHHHC-----CCCCCEEEECHHH Q ss_conf 99999997799841499994-----888899731765311232002--33-43469998740-----1563135308999 Q gi|254780664|r 96 AHMKFMQDHELESIDLVVVN-----LYPFEESFCREDDYYTMVENI--DI-GGPSMIRAAAK-----NHDYVTILTNPQD 162 (536) Q Consensus 96 ~~~~~l~~~~i~~IDlVvvN-----LYPF~~~v~~~~~~~~~IEnI--DI-GGpsmiRAAAK-----N~~~V~Vi~dp~d 162 (536) .+..... ...||++|.| ..||.++ +.++.-+.+ .+ |-.-|.|+..+ +.-++..+++-.- T Consensus 64 ~~~~~~~---~~~iDvLVNNAGi~~~g~i~e~-----~~~~~~~~~~vNv~g~~~ltq~~lp~M~~~~~G~IV~isS~ag 135 (257) T PRK09291 64 IDRARAA---EWDVDVLLNNAGIGEAGALVDI-----PVELVRELFETNVFGPLELTQGVVRKMVARGKGKIVFVSSIAG 135 (257) T ss_pred HHHHHHC---CCCCCEEEECCCCCCCCCHHHC-----CHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCHHH T ss_conf 9999980---8999999989856899773449-----9999999999997999999999789998769968999878776 Q ss_pred H Q ss_conf 8 Q gi|254780664|r 163 Y 163 (536) Q Consensus 163 Y 163 (536) + T Consensus 136 ~ 136 (257) T PRK09291 136 L 136 (257) T ss_pred C T ss_conf 6 No 178 >PRK03244 argD acetylornithine aminotransferase; Provisional Probab=29.36 E-value=40 Score=14.23 Aligned_cols=17 Identities=12% Similarity=0.190 Sum_probs=7.8 Q ss_pred EECCCCCCHHHHHHHHH Q ss_conf 93898779889999998 Q gi|254780664|r 504 IQPGGSVRDSEAITVAD 520 (536) Q Consensus 504 iqPGGSirD~evI~aan 520 (536) +.|-=-|-++|+=+.++ T Consensus 370 ~~PPl~it~~eid~~~~ 386 (398) T PRK03244 370 LAPPLIITDAQVDAFVA 386 (398) T ss_pred EECCCCCCHHHHHHHHH T ss_conf 94995789999999999 No 179 >COG4079 Uncharacterized protein conserved in archaea [Function unknown] Probab=29.29 E-value=40 Score=14.22 Aligned_cols=40 Identities=23% Similarity=0.302 Sum_probs=34.8 Q ss_pred HHHHHHHHHCCCEEEEEHHHHHHHHH-CCCCEEEHHHHCCC Q ss_conf 99999999879999994276899998-89951794653488 Q gi|254780664|r 30 VEFASRLLSRGIKIISTGGTCQLLEE-EGIPVTSVFDITKF 69 (536) Q Consensus 30 ~~la~~L~~~g~~iisTgGTa~~l~~-~gi~v~~Vs~~Tgf 69 (536) +||++.+.++|++|.-|.-+.|.-+- .++-|-+||..+-+ T Consensus 47 EEL~r~aeel~Vki~vtDdr~KVrk~~d~VvvGEV~s~~~~ 87 (293) T COG4079 47 EELARKAEELGVKITVTDDRNKVRKRNDGVVVGEVSSVERG 87 (293) T ss_pred HHHHHHHHHCCCEEEEECCHHHHHCCCCCEEEEEEECCCCC T ss_conf 99999999769779997164765223686799875203565 No 180 >pfam01884 PcrB PcrB family. This family contains proteins that are related to PcrB. The function of these proteins is unknown. Probab=29.10 E-value=40 Score=14.20 Aligned_cols=50 Identities=24% Similarity=0.335 Sum_probs=44.1 Q ss_pred ECCCCCCCCHHHHHHHCCCCEEEECCCCCCHHHHHHHHHHCCCEEEECCC Q ss_conf 73668685558999971995999389877988999999875977997478 Q gi|254780664|r 482 SEAFYPFPDGIVEAIKAGVTAVIQPGGSVRDSEAITVADQHGIAMVFTGI 531 (536) Q Consensus 482 SDAFFPF~D~ie~aa~~Gi~aIiqPGGSirD~evI~aan~~gi~m~fTg~ 531 (536) |-|+.|-|.-+..+.+.-....+-=||-||+.|....+=++|--++-||. T Consensus 163 SGa~~~vp~~vi~~~~~~~~~~LivGGGIrs~e~A~~~~~aGAD~IVvGn 212 (231) T pfam01884 163 SGAPGPVPEEVIAVKKVLDDARLIVGGGIKSGEKAKEMARAGADVIVTGN 212 (231) T ss_pred CCCCCCCCHHHHHHHHCCCCCCEEEECCCCCHHHHHHHHHCCCCEEEECC T ss_conf 99999989999999964689768996997999999999977999999797 No 181 >PTZ00086 40S ribosomal protein S16; Provisional Probab=29.01 E-value=28 Score=15.34 Aligned_cols=22 Identities=27% Similarity=0.285 Sum_probs=19.6 Q ss_pred ECCCCCCHHHHHHHHHHHHHHH Q ss_conf 7296888899999999998764 Q gi|254780664|r 444 IGSGQTSRVDSTRFAAIKAHNI 465 (536) Q Consensus 444 iGaGQ~sRVda~~iA~~KA~~~ 465 (536) -|.|.++..+|+++|+.||-.. T Consensus 74 ~GGG~sgQa~Air~aIaRALv~ 95 (147) T PTZ00086 74 RGGGQVAQIYAIRQAIAKGIVA 95 (147) T ss_pred ECCCHHHHHHHHHHHHHHHHHH T ss_conf 6787989999999999999998 No 182 >KOG4226 consensus Probab=28.99 E-value=15 Score=17.30 Aligned_cols=44 Identities=11% Similarity=0.116 Sum_probs=19.2 Q ss_pred HHHEE-CCCCHHHHHHHHCCCCCEEEECCCCCCCCCCCCEECCCCH Q ss_conf 11112-6879899988512455315631676677766640203210 Q gi|254780664|r 342 EAIIA-PTLSEEAADVLAKKPSMRFLKTSSLLDFHGEEIVLKTVSG 386 (536) Q Consensus 342 EvIiA-P~f~~eAleiL~~KKnlRil~~~~~~~~~~~~~~~rsi~G 386 (536) .||+| -.|+..--|-|+=+|.-|+-++.. ++..++.|.-|...| T Consensus 192 ~vVvaLYsFsssndeELsFeKGerleivd~-Pe~DPdWwkarn~~G 236 (379) T KOG4226 192 HVVVALYSFSSSNDEELSFEKGERLEIVDK-PENDPDWWKARNARG 236 (379) T ss_pred EEEEEEECCCCCCHHHCCCCCCCEEEECCC-CCCCCHHHHHCCCCC T ss_conf 999998314679722215445855574248-878806776404577 No 183 >PRK13222 phosphoglycolate phosphatase; Provisional Probab=28.96 E-value=40 Score=14.18 Aligned_cols=18 Identities=33% Similarity=0.239 Sum_probs=8.5 Q ss_pred HHHHHHHCCCCEEEECCC Q ss_conf 589999719959993898 Q gi|254780664|r 491 GIVEAIKAGVTAVIQPGG 508 (536) Q Consensus 491 ~ie~aa~~Gi~aIiqPGG 508 (536) .|+.|.++|+.+|.-.-| T Consensus 178 Di~aA~~aG~~~i~v~~G 195 (228) T PRK13222 178 DIQAAKAAGCPSVGVTYG 195 (228) T ss_pred HHHHHHHCCCEEEEECCC T ss_conf 999999969949998789 No 184 >PRK09072 short chain dehydrogenase; Provisional Probab=28.91 E-value=40 Score=14.18 Aligned_cols=123 Identities=15% Similarity=0.213 Sum_probs=62.9 Q ss_pred CCEEEEEEEEECCCCH-HHHHHHHHHCCCEEEEEHHHHHHHHHCCCCEEEHHHHCCCCHHCCCCCCCCCCHHHHHHHHCC Q ss_conf 6136989987545686-999999998799999942768999988995179465348816519855323715553553308 Q gi|254780664|r 14 IAVKTALISVHNKTGV-VEFASRLLSRGIKIISTGGTCQLLEEEGIPVTSVFDITKFPEIMGGRVKTLHPKIYGGILSIR 92 (536) Q Consensus 14 ~~ikrALiSV~dKtgl-~~la~~L~~~g~~iisTgGTa~~l~~~gi~v~~Vs~~TgfpEil~GRVKTLHP~I~ggIL~~r 92 (536) ++=|++||.=.-. || ..+|+.|.+.|+.++.++-+...|.+.- .|+.++++..+. ++- T Consensus 3 l~~K~vlITGass-GIG~a~A~~la~~G~~vil~~R~~~~L~~~~------------~~l~~~~~~~~~--------~Dl 61 (262) T PRK09072 3 LKDKRVLLTGASG-GIGEALAEALCAAGARLLLVGRNAEKLEALA------------ARPYPGRVRWVV--------ADL 61 (262) T ss_pred CCCCEEEEECCCC-HHHHHHHHHHHHCCCEEEEEECCHHHHHHHH------------HHHCCCCEEEEE--------EEC T ss_conf 8998899948623-9999999999987998999989899999999------------984589769999--------717 Q ss_pred CCHHHHHHHHH--CCCCCEEEEEEEC-----CCHHHHHCCCCCHHHHHHCCCC-CCHHHHHHHHC-----CCCCCEEEEC Q ss_conf 89899999997--7998414999948-----8889973176531123200233-43469998740-----1563135308 Q gi|254780664|r 93 DNPAHMKFMQD--HELESIDLVVVNL-----YPFEESFCREDDYYTMVENIDI-GGPSMIRAAAK-----NHDYVTILTN 159 (536) Q Consensus 93 ~~~~~~~~l~~--~~i~~IDlVvvNL-----YPF~~~v~~~~~~~~~IEnIDI-GGpsmiRAAAK-----N~~~V~Vi~d 159 (536) .++++.+.+.+ .....||++|-|= -||+++- ...++++++ +.. |--.+.|++.. +..++..+++ T Consensus 62 s~~~~~~~~~~~~~~~g~iDiLInNAG~~~~~~~~~~~--~~~~~~~~~-vN~~g~~~lt~~~lp~m~~~~~G~IvnisS 138 (262) T PRK09072 62 TSEAGREAVLARAREMGGINVLINNAGVNHFALLDQQD--PEAIDRLLA-LNLTAPMQLTRALLPLLRAQPSAAVVNVGS 138 (262) T ss_pred CCHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHHHCC--HHHHHHHHH-HHHHHHHHHHHHHHHHHHHCCCCEEEEECC T ss_conf 99999999999999849998999899778898635499--999999999-956899999999999998769948999668 Q ss_pred H Q ss_conf 9 Q gi|254780664|r 160 P 160 (536) Q Consensus 160 p 160 (536) . T Consensus 139 ~ 139 (262) T PRK09072 139 T 139 (262) T ss_pred H T ss_conf 6 No 185 >TIGR00732 dprA DNA protecting protein DprA; InterPro: IPR003488 The SMF family (DNA processing chain A, dprA) are a group of bacterial proteins. In Helicobacter pylori, dprA is required for natural chromosomal and plasmid transformation .; GO: 0009294 DNA mediated transformation. Probab=28.84 E-value=40 Score=14.17 Aligned_cols=34 Identities=24% Similarity=0.416 Sum_probs=17.8 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEE Q ss_conf 0001246777868898889999998600566779999 Q gi|254780664|r 400 KELTVVTKRSPTDQELRDMKFAFKVVKHVKSNAVVYA 436 (536) Q Consensus 400 ~~~~vVT~~~pt~~e~~dL~FA~kv~K~vkSNAIv~a 436 (536) ..|=+|-.|.||..= ..-+-+.++..--|.++|+ T Consensus 60 ~~~aiVGTR~pT~~g---~~~~~~l~~~la~~G~~Iv 93 (238) T TIGR00732 60 RKVAIVGTRRPTKYG---ERWTRKLSKELAKNGVTIV 93 (238) T ss_pred EEEEEEECCCCCHHH---HHHHHHHHHHHHHCCCEEE T ss_conf 057677437655447---9999999999865794898 No 186 >TIGR01584 citF citrate lyase, alpha subunit; InterPro: IPR006472 These sequences, from both Gram-positive and Gram-negative bacteria, represent the alpha subunit of the holoenzyme citrate lyase composed of alpha (2.8.3.10 from EC), beta, and acyl carrier protein subunits in a stoichiometric relationship of 6:6:6. Citrate lyase is an enzyme which converts citrate to oxaloacetate. In bacteria, this reaction is involved in citrate fermentation. The alpha subunit catalyzes the reaction Acetyl-CoA + citrate = acetate + (3S)-citryl-CoA. The protein from Lactococcus lactis subsp. lactis has been experimentally characterised . ; GO: 0008814 citrate CoA-transferase activity, 0006084 acetyl-CoA metabolic process, 0005737 cytoplasm, 0009346 citrate lyase complex. Probab=28.83 E-value=22 Score=16.16 Aligned_cols=25 Identities=28% Similarity=0.490 Sum_probs=12.2 Q ss_pred HHHHHHHHCCCEEEEEHHHHHHHHH Q ss_conf 9999999879999994276899998 Q gi|254780664|r 31 EFASRLLSRGIKIISTGGTCQLLEE 55 (536) Q Consensus 31 ~la~~L~~~g~~iisTgGTa~~l~~ 55 (536) +..|++-+.=|-|=|-||=|+.++. T Consensus 113 ~iS~G~Lk~PViirSHGGRarave~ 137 (496) T TIGR01584 113 EISKGILKKPVIIRSHGGRARAVES 137 (496) T ss_pred HHHHHHHCCCEEEEECCCEEEEEEC T ss_conf 8736440388478607872778824 No 187 >KOG4627 consensus Probab=28.72 E-value=41 Score=14.15 Aligned_cols=51 Identities=18% Similarity=0.466 Sum_probs=29.9 Q ss_pred CCCCCCCCCCCCCCCCCEEEECCCCCCCCCCHHHCCCCCCCCCCCCCCCHHHHHHHH Q ss_conf 123333436788100650576047766821012105898883122212001167764 Q gi|254780664|r 220 AVKKQEMRYGENPHQKAALYSTPEKKSGIAHAVLVQGKPLSYNNINDLDAAFELVSE 276 (536) Q Consensus 220 ~~~~~~LRYGENPHQ~Aa~Y~~~~~~~~~~~~~ql~GKeLSYNNllD~daA~~lv~e 276 (536) ..+...||||||--|+--.+++.... .+ |.-+|| +|=-.-|.--.+..|.- T Consensus 42 i~r~e~l~Yg~~g~q~VDIwg~~~~~-kl--fIfIHG---GYW~~g~rk~clsiv~~ 92 (270) T KOG4627 42 IIRVEHLRYGEGGRQLVDIWGSTNQA-KL--FIFIHG---GYWQEGDRKMCLSIVGP 92 (270) T ss_pred CCCHHCCCCCCCCCEEEEEECCCCCC-CE--EEEEEC---CHHHCCCHHCCCCHHHH T ss_conf 66211025578876589876577776-08--999936---61330752103020115 No 188 >KOG0673 consensus Probab=28.67 E-value=35 Score=14.61 Aligned_cols=32 Identities=22% Similarity=0.361 Sum_probs=16.9 Q ss_pred HHHHHCCCEEEEEHHHHHHHHHCCCCEEEHHH Q ss_conf 99998799999942768999988995179465 Q gi|254780664|r 34 SRLLSRGIKIISTGGTCQLLEEEGIPVTSVFD 65 (536) Q Consensus 34 ~~L~~~g~~iisTgGTa~~l~~~gi~v~~Vs~ 65 (536) +.|.+.|+.|+-+-|...||...|+..+++-| T Consensus 79 k~l~~kgv~IWd~ngsrefLds~G~~~re~GD 110 (293) T KOG0673 79 KVLEEKGVHIWDGNGSREFLDSVGFTAREEGD 110 (293) T ss_pred CHHHHCCCEEECCCCHHHHHHHCCCCCCCCCC T ss_conf 13654584651487618889853987000377 No 189 >PRK00087 4-hydroxy-3-methylbut-2-enyl diphosphate reductase/S1 RNA-binding domain protein; Reviewed Probab=28.58 E-value=37 Score=14.47 Aligned_cols=47 Identities=26% Similarity=0.162 Sum_probs=32.3 Q ss_pred ECCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHH Q ss_conf 615686565452016899999985332132004555427542145787 Q gi|254780664|r 288 VKHMNPCGVATADTLVEAYRRALSCDPISAFGGIIAFNREVDQEVAKE 335 (536) Q Consensus 288 vKH~NPCGvA~~~~~~~Ay~~A~~~DP~SAFGGIIa~N~~vd~~~A~~ 335 (536) .-|++=|- |+.+-+.+|.+.|-.+|-+=--||-=.-|..-=.++|+. T Consensus 184 ~v~nTIC~-AT~~RQ~a~~~LA~~vD~miVVGG~nSSNT~rL~eia~~ 230 (670) T PRK00087 184 KVFNTICN-ATAVRQEAAEKLAKKVDVMIVVGGKNSSNTTKLYEICKS 230 (670) T ss_pred CCCCCCCH-HHHHHHHHHHHHHHHCCEEEEECCCCCHHHHHHHHHHHH T ss_conf 67898544-268899999999865999999899897259999999997 No 190 >TIGR00568 alkb alkylated DNA repair protein AlkB; InterPro: IPR004574 Proteins repair alkylation damage to DNA. The Escherichia coli alkB gene product protects against cell killing by S(N)2-alkylating agents through DNA repair by a novel direct reversal DNA repair mechanism: the oxidative demethylation of N1-methyladenine or N3-methylcytosine DNA lesions. This reaction occurs on both single- and double-stranded DNA, and requires AlkB-bound non-heme Fe(2+), O(2) and alpha-ketogluterate to oxidize the offending methyl group. This is followed by the release of succinate, CO(2) and formaldehyde, and the restoration of undamaged A or C in DNA .; GO: 0006281 DNA repair. Probab=28.57 E-value=21 Score=16.18 Aligned_cols=90 Identities=20% Similarity=0.307 Sum_probs=58.2 Q ss_pred CHHHHHHH--HHHHHHHH--CCCCCEEEEE--ECCEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECC Q ss_conf 68898889--99999860--0566779999--689389972968888999999999987642001222234870999736 Q gi|254780664|r 411 TDQELRDM--KFAFKVVK--HVKSNAVVYA--KDGRTVGIGSGQTSRVDSTRFAAIKAHNISAQADVKSMTNGSVIASEA 484 (536) Q Consensus 411 t~~e~~dL--~FA~kv~K--~vkSNAIv~a--k~~~tiGiGaGQ~sRVda~~iA~~KA~~~~~~~~~~~~~~g~vlaSDA 484 (536) =++.+-|| .-+|.++- +.|-.|=++= .-|.++|+=. .|- + .|- T Consensus 101 ~P~~L~~L~~~~v~~~~gf~~~~aeA~lvN~Y~~g~~Ls~H~-------------D~~----E--------------~D~ 149 (199) T TIGR00568 101 FPKDLGDLVEKRVATALGFPDFKAEACLVNFYAPGATLSLHQ-------------DRD----E--------------EDL 149 (199) T ss_pred CCHHHHHHHHHHHHHHCCCCCCCCCEEEECCCCCCCCCCCCC-------------CCC----C--------------CCC T ss_conf 556788899999998728866782304540347887002310-------------376----5--------------566 Q ss_pred CCCCCCHHHHHHHCCCCEEEECCCCCCHHHHHHHHHHCCCEEEECCC-CCCCC Q ss_conf 68685558999971995999389877988999999875977997478-54479 Q gi|254780664|r 485 FYPFPDGIVEAIKAGVTAVIQPGGSVRDSEAITVADQHGIAMVFTGI-RHFRH 536 (536) Q Consensus 485 FFPF~D~ie~aa~~Gi~aIiqPGGSirD~evI~aan~~gi~m~fTg~-RhF~H 536 (536) =+|. +-...|-+||-|=||--|++.--.--=+.|=.|+++|. |.|-| T Consensus 150 ~~PL-----~S~SlG~~AiF~~Gg~~R~e~~~~L~L~sGDv~~~~G~SRl~fH 197 (199) T TIGR00568 150 RLPL-----LSLSLGLSAIFLIGGLKRDEPPKALRLHSGDVVIMSGESRLAFH 197 (199) T ss_pred CCCE-----EEEECCCHHHHHHCCCCCCCCCCHHHHHCCCEEEECCCHHHHHC T ss_conf 8875-----66653611454407745565561225424987885551224213 No 191 >TIGR02326 transamin_PhnW 2-aminoethylphosphonate--pyruvate transaminase; InterPro: IPR012703 Phosphonates are a class of organophosphorus compounds, characterised by a stable C-P bond, which are found in a variety of biologically produced molecules including antiobiotics, lipids, proteins and polysaccharides . The functions of these molecules include phosphorus storage, cell communication, host recognition and chemical warfare. 2-Aminoethylphosphonate (AEP), the most common naturally occurring phosphonate, is an important precursor used in the biosynthesis of phosphonolipids, phosphonoproteins, and phosphonoglycans. This entry represents 2-aminoethylphosphonate-pyruvate transaminase (AEPT)(2.6.1.37 from EC) which catalyses the interconversion of AEP and phosphonoacetaldehyde (P-Ald), coupled with the interconversion of pryuvate and L-alanine. In some bacterial species this is the first step in an AEP degradation pathway which allows them to utilise this compound as a source of carbon, nitrogen and phosphorus. Phosphonoacetaldeyhde hydrolase, often encoded by an adjacent gene, then converts P-Ald to acetaldehyde and phosphate.Species with this pathway generally have an identified phosphonate ABC transporter but do not also have the multisubunit C-P lyase complex as found in Escherichia coli. The crystal structure of AEPT from Salmonella typhimurium has been studied to 2.2 A resolution . The protein is a homodimer where each subunit is composed of two domains, large and small. The large domain forms an alpha-beta complex characteristic of the aspartate transaminase family, while the small domain is comosed of highly twisted beta strands. The active site contains a pyridoxal 5'-phosphate cofactor and is found at the domain interface, being composed of residues from both subunits.. Probab=28.50 E-value=17 Score=16.85 Aligned_cols=127 Identities=21% Similarity=0.290 Sum_probs=67.8 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHH---HHHHH--CCCCHH-H-HHHHCCCCCCCCCCCCCCCCCCEEEECCCCCCCCC Q ss_conf 66678899999999987422899998---75331--012113-4-43210123333436788100650576047766821 Q gi|254780664|r 177 IPYNFRKKMARQAFSRTASYDTAICR---WLANA--ESENFP-D-YLNITAVKKQEMRYGENPHQKAALYSTPEKKSGIA 249 (536) Q Consensus 177 ~~~~~R~~lA~kAF~~ta~YD~~Ia~---~l~~~--~~~~~p-~-~~~~~~~~~~~LRYGENPHQ~Aa~Y~~~~~~~~~~ 249 (536) +...-|++|-.-|=.+-..|-+..-+ -|+-. .+.-.| . -..+ -..+==||.-=-|-|.+|..+. T Consensus 37 ~~~~ir~~L~~la~~~~~~YT~VllQGSGtf~VEA~iGs~~pr~~~klL---v~~NGAYG~R~~~ia~~l~i~~------ 107 (366) T TIGR02326 37 VVESIRQKLLKLATAEEEKYTTVLLQGSGTFAVEAVIGSAVPRKDGKLL---VVINGAYGKRIAQIAEYLQIPH------ 107 (366) T ss_pred HHHHHHHHHHHHHHCCCCCEEEEEECCCCHHHHHHHHHCCCCCCCCEEE---EEECCCHHHHHHHHHHHHCCCE------ T ss_conf 7999999999986068985467861275125556654112478897389---9717853478999999847861------ Q ss_pred CHHHCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCEEEEECCCCCCCCCCC-CHHHHH-------HHHHHHHCCCCCCCEE Q ss_conf 01210589888312221200116776424556727999615686565452-016899-------9999853321320045 Q gi|254780664|r 250 HAVLVQGKPLSYNNINDLDAAFELVSEFRSQDCAACVIVKHMNPCGVATA-DTLVEA-------YRRALSCDPISAFGGI 321 (536) Q Consensus 250 ~~~ql~GKeLSYNNllD~daA~~lv~ef~~~~~Pa~vIvKH~NPCGvA~~-~~~~~A-------y~~A~~~DP~SAFGGI 321 (536) ..++-.|-+--|+.++++.++-=.+. .=+|+| | |=-.+| =|+.+| |=+..=-|-+|+|||| T Consensus 108 --~~~~~~E~~~~~~~~ve~~L~~dp~I-----tH~A~v-H---cETtTGiLNPi~~~~~~~~~hGk~~ivDAMSSfGgi 176 (366) T TIGR02326 108 --HVVETEENEPTNVVEVEAILAADPAI-----THVALV-H---CETTTGILNPIEAVAKVAKRHGKVTIVDAMSSFGGI 176 (366) T ss_pred --EEEECCCCCCCCHHHHHHHHHCCCCC-----EEEEEE-E---CCCCCCCCCHHHHHHHHHHHCCCEEEEECCCCCCCC T ss_conf --67515877777889999998518971-----278898-3---675266211289999999855977998534113760 Q ss_pred EE Q ss_conf 55 Q gi|254780664|r 322 IA 323 (536) Q Consensus 322 Ia 323 (536) =. T Consensus 177 ~i 178 (366) T TIGR02326 177 PI 178 (366) T ss_pred HH T ss_conf 01 No 192 >COG1976 TIF6 Translation initiation factor 6 (eIF-6) [Translation, ribosomal structure and biogenesis] Probab=28.43 E-value=41 Score=14.12 Aligned_cols=34 Identities=29% Similarity=0.485 Sum_probs=28.8 Q ss_pred CCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHEECCCCHHHHHHHHCC Q ss_conf 321320045554275421457878865431111268798999885124 Q gi|254780664|r 313 DPISAFGGIIAFNREVDQEVAKEVIKVFTEAIIAPTLSEEAADVLAKK 360 (536) Q Consensus 313 DP~SAFGGIIa~N~~vd~~~A~~I~~~F~EvIiAP~f~~eAleiL~~K 360 (536) ++++|+|-+|.+|... .++-|+|+++|++....- T Consensus 96 ~k~nAlGN~Il~ND~~--------------Alvhp~l~~~a~k~I~d~ 129 (222) T COG1976 96 TKLNALGNLILANDKG--------------ALVHPDLSDEAEKEIEDV 129 (222) T ss_pred CCCCCCCCEEEECCCC--------------EEECCCCCHHHHHHHHHH T ss_conf 5010115589965830--------------386676487799998764 No 193 >KOG0923 consensus Probab=28.34 E-value=41 Score=14.11 Aligned_cols=36 Identities=22% Similarity=0.372 Sum_probs=18.3 Q ss_pred HHCCCCCCHHHHHHHHCCCC-----CCEEEECHHHHHHHHHHH Q ss_conf 20023343469998740156-----313530899989889875 Q gi|254780664|r 133 VENIDIGGPSMIRAAAKNHD-----YVTILTNPQDYPLFLAEM 170 (536) Q Consensus 133 IEnIDIGGpsmiRAAAKN~~-----~V~Vi~dp~dY~~~~~el 170 (536) -|.--+ +-+|++..||+-+ +-.|.+|-. --.|+.+. T Consensus 194 WEd~ql-~~a~~~~ga~~~~~~~~~ye~vfdd~~-~idF~~e~ 234 (902) T KOG0923 194 WEDHQL-GNAMLKFGAKDAKEGSDGYEFVFDDQI-VIDFIQES 234 (902) T ss_pred HHHHHH-HHHHHHCCCCCCCCCCCCCEEECCCHH-HHHHHHHH T ss_conf 899986-655764278757555667501325315-56377764 No 194 >COG0112 GlyA Glycine/serine hydroxymethyltransferase [Amino acid transport and metabolism] Probab=28.32 E-value=35 Score=14.64 Aligned_cols=20 Identities=40% Similarity=0.547 Sum_probs=15.3 Q ss_pred CCCEEEEECCCCCHHHHHHHHHH Q ss_conf 20045554275421457878865 Q gi|254780664|r 317 AFGGIIAFNREVDQEVAKEVIKV 339 (536) Q Consensus 317 AFGGIIa~N~~vd~~~A~~I~~~ 339 (536) .=||+|-+|. +++++.|.+- T Consensus 234 PrGG~Il~~~---eel~kkin~a 253 (413) T COG0112 234 PRGGIILTND---EELAKKINSA 253 (413) T ss_pred CCCEEEEECC---HHHHHHHHHH T ss_conf 9841798362---9999876400 No 195 >PRK08267 short chain dehydrogenase; Provisional Probab=28.30 E-value=41 Score=14.10 Aligned_cols=80 Identities=15% Similarity=0.048 Sum_probs=45.4 Q ss_pred EEEEEEEEECCCCH-HHHHHHHHHCCCEEEEEHHHHHHHHHCCCCEEEHHHHCCCCHHCCCCCCCCCCHHHHHHHHCCCC Q ss_conf 36989987545686-99999999879999994276899998899517946534881651985532371555355330889 Q gi|254780664|r 16 VKTALISVHNKTGV-VEFASRLLSRGIKIISTGGTCQLLEEEGIPVTSVFDITKFPEIMGGRVKTLHPKIYGGILSIRDN 94 (536) Q Consensus 16 ikrALiSV~dKtgl-~~la~~L~~~g~~iisTgGTa~~l~~~gi~v~~Vs~~TgfpEil~GRVKTLHP~I~ggIL~~r~~ 94 (536) ||++||.=.-. || ..+|+.|.+.|+.++-++-....|++.- +++. ++++.+.+- +-.+ T Consensus 1 MK~vlITGass-GIG~a~A~~~a~~G~~V~~~~r~~~~l~~~~------~~l~------~~~~~~~~~--------Dvtd 59 (258) T PRK08267 1 MKSIFITGAAS-GIGRATARLFAARGWRVGAYDINEDGLAALA------AELG------AERAWTGAL--------DVTD 59 (258) T ss_pred CCEEEEECCCC-HHHHHHHHHHHHCCCEEEEEECCHHHHHHHH------HHHC------CCCEEEEEE--------CCCC T ss_conf 99899907226-8999999999987999999988899999999------9836------996799991--------1799 Q ss_pred HHHH----HHHHHCCCCCEEEEEEEC Q ss_conf 8999----999977998414999948 Q gi|254780664|r 95 PAHM----KFMQDHELESIDLVVVNL 116 (536) Q Consensus 95 ~~~~----~~l~~~~i~~IDlVvvNL 116 (536) +++. ++..+.....||++|.|= T Consensus 60 ~~~v~~~~~~~~~~~~G~iDiLVNNA 85 (258) T PRK08267 60 RAAWDAALADFCAATGGRLDVLFNNA 85 (258) T ss_pred HHHHHHHHHHHHHHHCCCCCEEEECC T ss_conf 99999999999999589986899888 No 196 >PRK05819 deoD purine nucleoside phosphorylase; Reviewed Probab=28.14 E-value=42 Score=14.08 Aligned_cols=29 Identities=10% Similarity=0.094 Sum_probs=19.5 Q ss_pred HHHHHHHHHHHCCCCCEEEEEECCEEEEE Q ss_conf 88999999860056677999968938997 Q gi|254780664|r 416 RDMKFAFKVVKHVKSNAVVYAKDGRTVGI 444 (536) Q Consensus 416 ~dL~FA~kv~K~vkSNAIv~ak~~~tiGi 444 (536) ...+|+.--.+..|.-+|..+-+...-|= T Consensus 182 taaLf~la~l~G~ka~sv~~V~dnl~t~e 210 (235) T PRK05819 182 AAALYGLAAEYGAKALTILTVSDHIRTGE 210 (235) T ss_pred HHHHHHHHHHCCCCEEEEEEEEEEECCCC T ss_conf 89999999981986799999963305688 No 197 >PRK00035 hemH ferrochelatase; Reviewed Probab=27.87 E-value=42 Score=14.05 Aligned_cols=15 Identities=13% Similarity=0.483 Sum_probs=8.7 Q ss_pred HHEECCCCHHHHHHH Q ss_conf 111268798999885 Q gi|254780664|r 343 AIIAPTLSEEAADVL 357 (536) Q Consensus 343 vIiAP~f~~eAleiL 357 (536) +|+.|+|..|-+|.| T Consensus 262 ~v~p~gFvsD~lETl 276 (325) T PRK00035 262 VVVPPGFVSDHLETL 276 (325) T ss_pred EEECCCCCCCCHHHH T ss_conf 998982045344659 No 198 >PRK06209 glutamate-1-semialdehyde 2,1-aminomutase; Provisional Probab=27.36 E-value=43 Score=13.99 Aligned_cols=28 Identities=36% Similarity=0.743 Sum_probs=12.7 Q ss_pred CCCCCCCCCCCCHHHHHHHHCCCCCCCEEEEEC Q ss_conf 988831222120011677642455672799961 Q gi|254780664|r 257 KPLSYNNINDLDAAFELVSEFRSQDCAACVIVK 289 (536) Q Consensus 257 KeLSYNNllD~daA~~lv~ef~~~~~Pa~vIvK 289 (536) ....||++-++. +++.+.. +..||+|+- T Consensus 167 ~~~~y~d~~~le---~~~~~~~--~~iAAvIvE 194 (431) T PRK06209 167 KTFRYNDIASLE---ALFEDYP--GRIACVILE 194 (431) T ss_pred CCCCCCCHHHHH---HHHHHCC--CCEEEEEEE T ss_conf 564765599999---9876455--666788850 No 199 >pfam08727 P3A Poliovirus 3A protein like. This domain is found in positive-strand RNA viruses. The 3A protein is a critical component of the poliovirus replication complex, and is also an inhibitor of host cell ER to Golgi transport. Probab=27.22 E-value=35 Score=14.64 Aligned_cols=15 Identities=27% Similarity=0.406 Sum_probs=10.0 Q ss_pred CCCHHHHHHHHHHCC Q ss_conf 779889999998759 Q gi|254780664|r 509 SVRDSEAITVADQHG 523 (536) Q Consensus 509 SirD~evI~aan~~g 523 (536) |++-+||+++|.+.| T Consensus 26 SVds~EVr~YC~~kg 40 (57) T pfam08727 26 SVDSPEVREYCEKKG 40 (57) T ss_pred HCCCHHHHHHHHHCC T ss_conf 658799999999779 No 200 >COG0159 TrpA Tryptophan synthase alpha chain [Amino acid transport and metabolism] Probab=27.20 E-value=43 Score=13.97 Aligned_cols=12 Identities=8% Similarity=0.266 Sum_probs=5.1 Q ss_pred CCHHHHHHHHCC Q ss_conf 200116776424 Q gi|254780664|r 267 LDAAFELVSEFR 278 (536) Q Consensus 267 ~daA~~lv~ef~ 278 (536) +...++-++++. T Consensus 193 ~~~~v~~vr~~~ 204 (265) T COG0159 193 VKELVKRVRKYT 204 (265) T ss_pred HHHHHHHHHHHC T ss_conf 999999999744 No 201 >TIGR00977 LeuA_rel 2-isopropylmalate synthase/homocitrate synthase family protein; InterPro: IPR005675 This model represents largely uncharacterised proteins related to 2-isopropylmalate synthases and homocitrate synthases but phylogenetically distinct. ; GO: 0046912 transferase activity transferring acyl groups acyl groups converted into alkyl on transfer, 0019752 carboxylic acid metabolic process. Probab=27.20 E-value=43 Score=13.97 Aligned_cols=84 Identities=18% Similarity=0.266 Sum_probs=41.9 Q ss_pred CCCHHHHH---HHHHCCCCCEEEEEEECCCHHHHHCCCC--CHHHHHHCCCCCCHHHHHHHHCCCCCCEEEECHHHHHHH Q ss_conf 88989999---9997799841499994888899731765--311232002334346999874015631353089998988 Q gi|254780664|r 92 RDNPAHMK---FMQDHELESIDLVVVNLYPFEESFCRED--DYYTMVENIDIGGPSMIRAAAKNHDYVTILTNPQDYPLF 166 (536) Q Consensus 92 r~~~~~~~---~l~~~~i~~IDlVvvNLYPF~~~v~~~~--~~~~~IEnIDIGGpsmiRAAAKN~~~V~Vi~dp~dY~~~ 166 (536) --||+|.. .+++.+++...++ -|..|-.-.. ..+..| -|||.|.. +.||+---.=|- -| T Consensus 46 gaNPKD~~FF~~~~~~~~~~~~~~-----AF~STRRP~~~v~~D~~~-------~~l~~A~T---~~~TIFGKSWDL-HV 109 (543) T TIGR00977 46 GANPKDVAFFEKVKEMNFKNAKIV-----AFSSTRRPDKKVEEDKQV-------QALIKAET---PVVTIFGKSWDL-HV 109 (543) T ss_pred CCCHHHHHHHHHHHHCCCCCEEEE-----EEECCCCCCCCCCCCHHH-------HHHHHCCC---EEEEEECCCHHH-HH T ss_conf 987256899999852771100466-----752367886332334567-------89872698---169985473578-88 Q ss_pred HHHHHCCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 987530234566678899999999987422 Q gi|254780664|r 167 LAEMDVNNGKIPYNFRKKMARQAFSRTASY 196 (536) Q Consensus 167 ~~el~~~~g~~~~~~R~~lA~kAF~~ta~Y 196 (536) .+.|+ ++|+...+|-...-++.-.| T Consensus 110 ~~aL~-----T~L~ENL~MI~dTv~YLk~~ 134 (543) T TIGR00977 110 LEALK-----TTLEENLKMIYDTVAYLKRY 134 (543) T ss_pred HHHHH-----HHHHHHHHHHHHHHHHHHHH T ss_conf 88776-----42999999999999999872 No 202 >PRK13324 pantothenate kinase; Reviewed Probab=27.13 E-value=43 Score=13.96 Aligned_cols=14 Identities=14% Similarity=0.242 Sum_probs=6.0 Q ss_pred HHHHHHEECCCCHH Q ss_conf 54311112687989 Q gi|254780664|r 339 VFTEAIIAPTLSEE 352 (536) Q Consensus 339 ~F~EvIiAP~f~~e 352 (536) .|.=++|+|+..-- T Consensus 145 ~~lGG~I~PGl~l~ 158 (258) T PRK13324 145 KYLSGSIMPGVKLS 158 (258) T ss_pred CEEEEEECCCHHHH T ss_conf 68899988459999 No 203 >TIGR00109 hemH ferrochelatase; InterPro: IPR001015 Synonym(s): Protoheme ferro-lyase, Iron chelatase, etc. Ferrochelatase catalyzes the last step in heme biosynthesis: the chelation of a ferrous ion to proto-porphyrin IX, to form protoheme , . In eukaryotic cells, it binds to the mitochondrial inner membrane with its active site on the matrix side of the membrane. The X-ray structure of Bacillus subtilis and human ferrochelatase have been solved , . The human enzyme exists as a homodimer. Each subunit contains one [Fe2S2] cluster. The monomer is folded into two similar domains, each with a four-stranded parallel beta-sheet flanked by an alpha-helix in a beta-alpha-beta motif that is reminiscent of the fold found in the periplasmic binding proteins. The topological similarity between the domains suggests that they have arisen from a gene duplication event. However, significant differences exist between the two domains, including an N-terminal section (residues 80-130) that forms part of the active site pocket, and a C-terminal extension (residues 390-423) that is involved in coordination of the [Fe2S2]cluster and in stabilisation of the homodimer. The [Fe2S2] cluster ligands are Cys196, Cys403, Cys406 and Cys411. The experiments with Co(II) binding show that His230 and Asp383 are part of the enzyme active site . Ferrochelatase seems to have a structurally conserved core region that is common to the enzyme from bacteria, plants and mammals. Porphyrin binds in the identified cleft; this cleft also includes the metal-binding site of the enzyme. It is likely that the structure of the cleft region will have different conformations upon substrate binding and release .; GO: 0004325 ferrochelatase activity, 0006783 heme biosynthetic process. Probab=27.12 E-value=43 Score=13.96 Aligned_cols=17 Identities=6% Similarity=0.253 Sum_probs=13.6 Q ss_pred HHEECCCCHHHHHHHHC Q ss_conf 11126879899988512 Q gi|254780664|r 343 AIIAPTLSEEAADVLAK 359 (536) Q Consensus 343 vIiAP~f~~eAleiL~~ 359 (536) +||.|+|+-|.+|.|.. T Consensus 281 ~~~P~gF~~Dh~ETL~E 297 (339) T TIGR00109 281 VVVPIGFTADHLETLYE 297 (339) T ss_pred EEECCCCHHHHHHHHHH T ss_conf 99886301245788887 No 204 >TIGR00519 asnASE_I L-asparaginases, type I; InterPro: IPR006033 Two related families of asparaginase are designated type I and type II according to the terminology in Escherichia coli, which has both: L-asparaginase I is a low-affinity enzyme found in the cytoplasm, while L-asparaginase II is a high-affinity secreted enzyme synthesized with a cleavable signal sequence. This family includes L-asparaginases related to type I of E. coli. Archaeal putative asparaginases are of this type but contain an extra ~80 residues in a conserved N-terminal region. These archaeal homologs are included in this family. ; GO: 0004067 asparaginase activity, 0006520 amino acid metabolic process. Probab=27.01 E-value=22 Score=16.03 Aligned_cols=25 Identities=16% Similarity=0.130 Sum_probs=15.0 Q ss_pred HHEECCCCHHHHHHHHCCC-CCEEEE Q ss_conf 1112687989998851245-531563 Q gi|254780664|r 343 AIIAPTLSEEAADVLAKKP-SMRFLK 367 (536) Q Consensus 343 vIiAP~f~~eAleiL~~KK-nlRil~ 367 (536) +=+=|+.++|-++.+..|. ..-+|+ T Consensus 225 ik~yPGl~~di~~~~~~~~YkGiVie 250 (347) T TIGR00519 225 IKLYPGLSPDIIRAYLSKGYKGIVIE 250 (347) T ss_pred EEECCCCCHHHHHHHCCCCCEEEEEE T ss_conf 98668988899998626993489982 No 205 >pfam00380 Ribosomal_S9 Ribosomal protein S9/S16. This family includes small ribosomal subunit S9 from prokaryotes and S16 from eukaryotes. Probab=26.96 E-value=27 Score=15.45 Aligned_cols=49 Identities=12% Similarity=0.079 Sum_probs=29.1 Q ss_pred CHHHHHHHHHHHHHHHCCCCCEEEEEECCEEEEECCCCCCHHHHHHHHHHHHHHH Q ss_conf 6889888999999860056677999968938997296888899999999998764 Q gi|254780664|r 411 TDQELRDMKFAFKVVKHVKSNAVVYAKDGRTVGIGSGQTSRVDSTRFAAIKAHNI 465 (536) Q Consensus 411 t~~e~~dL~FA~kv~K~vkSNAIv~ak~~~tiGiGaGQ~sRVda~~iA~~KA~~~ 465 (536) ++.....+..-..++..... .|-...=-|.|.++..+|+++|+.||-.. T Consensus 32 ~~~~r~~v~~Pl~~~~~~~~------~Di~i~V~GGG~sgQa~AirlaIaRaLv~ 80 (121) T pfam00380 32 NETLRMKILEPLELTGTLGK------FDIVVTVKGGGISGQAGAIRLAIARALVA 80 (121) T ss_pred CHHHHHHHHHHHHHHHHCCC------CCEEEEEECCCHHHHHHHHHHHHHHHHHH T ss_conf 89999999979998600466------42699996888858999999999999999 No 206 >PRK13580 serine hydroxymethyltransferase; Provisional Probab=26.96 E-value=40 Score=14.17 Aligned_cols=26 Identities=15% Similarity=0.333 Sum_probs=12.3 Q ss_pred CCCCCCCCHHHHHHH-HHHHHHHHCCC Q ss_conf 246777868898889-99999860056 Q gi|254780664|r 404 VVTKRSPTDQELRDM-KFAFKVVKHVK 429 (536) Q Consensus 404 vVT~~~pt~~e~~dL-~FA~kv~K~vk 429 (536) -+|.|--.++|+... .|--++++..+ T Consensus 423 a~TtRGm~e~em~~ia~~I~~~l~~~~ 449 (493) T PRK13580 423 ALTTLGMGSDEMDEVAELIVKVLRNTT 449 (493) T ss_pred HHHHCCCCHHHHHHHHHHHHHHHHCCC T ss_conf 676189998999999999999995524 No 207 >PRK10425 DNase TatD; Provisional Probab=26.78 E-value=44 Score=13.92 Aligned_cols=32 Identities=19% Similarity=-0.010 Sum_probs=19.5 Q ss_pred HHHHHCCCCCCEEE-ECHHHHHHHHHHHHCCCC Q ss_conf 99874015631353-089998988987530234 Q gi|254780664|r 144 IRAAAKNHDYVTIL-TNPQDYPLFLAEMDVNNG 175 (536) Q Consensus 144 iRAAAKN~~~V~Vi-~dp~dY~~~~~el~~~~g 175 (536) =||.+.+-..+.++ +||++....++--+...+ T Consensus 22 ~rA~~~gV~~ii~~g~~~~~~~~~~~la~~~~~ 54 (258) T PRK10425 22 ARAFAAGVNGLLITGTNLRESQQAQKLARQYPS 54 (258) T ss_pred HHHHHCCCCEEEEECCCHHHHHHHHHHHHHCCC T ss_conf 999987999899946999999999999986887 No 208 >PRK11864 2-ketoisovalerate ferredoxin oxidoreductase subunit beta; Provisional Probab=26.77 E-value=29 Score=15.17 Aligned_cols=29 Identities=31% Similarity=0.472 Sum_probs=25.7 Q ss_pred CCCCCHHHHHHHHCCCCCCEEEECHHHHH Q ss_conf 23343469998740156313530899989 Q gi|254780664|r 136 IDIGGPSMIRAAAKNHDYVTILTNPQDYP 164 (536) Q Consensus 136 IDIGGpsmiRAAAKN~~~V~Vi~dp~dY~ 164 (536) -|||+-.++.|+..|+.-+.++.|-.-|. T Consensus 105 ~dIG~~~l~ha~~Rn~~i~~iv~DNe~Y~ 133 (300) T PRK11864 105 ADIGFQALSGAAERGHDILYIMYDNEAYM 133 (300) T ss_pred HHHHHHHHHHHHHCCCCEEEEEECCCEEE T ss_conf 54006878999982898599998571310 No 209 >KOG0525 consensus Probab=26.70 E-value=44 Score=13.91 Aligned_cols=86 Identities=20% Similarity=0.309 Sum_probs=40.2 Q ss_pred EEEEECCCCCHHHHHHHHHHHHHHHEECCCCHHHHHHHHCCCCCEEEECCCCCCCCCCCCEECCCCH----HHHH--CCC Q ss_conf 4555427542145787886543111126879899988512455315631676677766640203210----2231--012 Q gi|254780664|r 320 GIIAFNREVDQEVAKEVIKVFTEAIIAPTLSEEAADVLAKKPSMRFLKTSSLLDFHGEEIVLKTVSG----GILV--QTR 393 (536) Q Consensus 320 GIIa~N~~vd~~~A~~I~~~F~EvIiAP~f~~eAleiL~~KKnlRil~~~~~~~~~~~~~~~rsi~G----G~Lv--Q~~ 393 (536) ||+.|---+-..-|..|.++-+---|-|.|++=.-|--| -|+-. ...++...+.+|+-+| |-|. |++ T Consensus 98 givgfgig~aa~g~~aiaeiqfadyifpafdqivneaak----fryrs---gnqfncg~ltir~p~gavghg~~yhsqsp 170 (362) T KOG0525 98 GIVGFGIGLAAMGATAIAEIQFADYIFPAFDQIVNEAAK----FRYRS---GNQFNCGGLTIRAPWGAVGHGALYHSQSP 170 (362) T ss_pred CCCEECHHHHHCCCCEEEEEEECCCCCHHHHHHHHHHHH----HEECC---CCCCCCCCEEEECCCCCCCCCCCCCCCCC T ss_conf 542001312540661588885022122558999878776----25235---78445676577455442455530156882 Q ss_pred CCCCC-CCCCCCCCCCCCCH Q ss_conf 33453-10001246777868 Q gi|254780664|r 394 DNVVD-NKELTVVTKRSPTD 412 (536) Q Consensus 394 D~~~~-~~~~~vVT~~~pt~ 412 (536) ....- -.-+|+|--|.|.+ T Consensus 171 eaff~h~pgikvviprsp~q 190 (362) T KOG0525 171 EAFFCHVPGIKVVIPRSPRQ 190 (362) T ss_pred HHHEECCCCCEEEECCCCCH T ss_conf 35300489836982489504 No 210 >cd07571 ALP_N-acyl_transferase Apolipoprotein N-acyl transferase (class 9 nitrilases). ALP N-acyl transferase (Lnt), is an essential membrane-bound enzyme in gram-negative bacteria, which catalyzes the N-acylation of apolipoproteins, the final step in lipoprotein maturation. This is a reverse amidase (i.e. condensation) reaction. This subgroup belongs to a larger nitrilase superfamily comprised of nitrile- or amide-hydrolyzing enzymes and amide-condensing enzymes, which depend on a Glu-Lys-Cys catalytic triad. This superfamily has been classified in the literature based on global and structure based sequence analysis into thirteen different enzyme classes (referred to as 1-13), this subgroup corresponds to class 9. Probab=26.69 E-value=33 Score=14.78 Aligned_cols=70 Identities=21% Similarity=0.341 Sum_probs=41.8 Q ss_pred HHHHHHHHHHCCCC--CEEEEEECCEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCCCCHHHH Q ss_conf 89999998600566--7799996893899729688889999999999876420012222348709997366868555899 Q gi|254780664|r 417 DMKFAFKVVKHVKS--NAVVYAKDGRTVGIGSGQTSRVDSTRFAAIKAHNISAQADVKSMTNGSVIASEAFYPFPDGIVE 494 (536) Q Consensus 417 dL~FA~kv~K~vkS--NAIv~ak~~~tiGiGaGQ~sRVda~~iA~~KA~~~~~~~~~~~~~~g~vlaSDAFFPF~D~ie~ 494 (536) |..|...+-+.++. +-|+..-|..--|-..|+.- -..+|..||-|.+.. ++ . T Consensus 162 E~~f~~~~r~~~~~ga~~lv~~tNdaWfg~s~~~~Q---hl~~ar~RAiE~~r~----------~v-------------r 215 (270) T cd07571 162 ESIFPELVRDAVRQGADLLVNITNDAWFGDSAGPYQ---HLAMARLRAIETGRP----------LV-------------R 215 (270) T ss_pred HHHCHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHH---HHHHHHHHHHHHCCC----------EE-------------E T ss_conf 343649999986448709999736645678841799---999989889982996----------39-------------8 Q ss_pred HHHCCCCEEEECCCCCCH Q ss_conf 997199599938987798 Q gi|254780664|r 495 AIKAGVTAVIQPGGSVRD 512 (536) Q Consensus 495 aa~~Gi~aIiqPGGSirD 512 (536) +++.||+++|-|-|.|.. T Consensus 216 ~~ntGiSa~Id~~G~i~~ 233 (270) T cd07571 216 AANTGISAVIDPDGRIVA 233 (270) T ss_pred ECCCEEEEEECCCCCEEE T ss_conf 669626899999987888 No 211 >cd04872 ACT_1ZPV ACT domain proteins similar to the yet uncharacterized Streptococcus pneumoniae ACT domain protein. This CD, ACT_1ZPV, includes those single ACT domain proteins similar to the yet uncharacterized Streptococcus pneumoniae ACT domain protein (pdb structure 1ZPV). Members of this CD belong to the superfamily of ACT regulatory domains. Probab=26.68 E-value=44 Score=13.91 Aligned_cols=29 Identities=24% Similarity=0.525 Sum_probs=17.6 Q ss_pred EEEEEEE--CCCCHH-HHHHHHHHCCCEEEEE Q ss_conf 9899875--456869-9999999879999994 Q gi|254780664|r 18 TALISVH--NKTGVV-EFASRLLSRGIKIIST 46 (536) Q Consensus 18 rALiSV~--dKtgl~-~la~~L~~~g~~iisT 46 (536) ||.|+|+ ||.||+ .+++.|.++|+.|.-. T Consensus 1 kaVITviG~Dr~GIVa~vt~~Lae~~iNI~Di 32 (88) T cd04872 1 KAVITVVGKDRVGIVAGVSTKLAELNVNILDI 32 (88) T ss_pred CEEEEEECCCCCCHHHHHHHHHHHCCCCEEEE T ss_conf 96999977998878999999999869988953 No 212 >PRK01045 ispH 4-hydroxy-3-methylbut-2-enyl diphosphate reductase; Reviewed Probab=26.55 E-value=44 Score=13.89 Aligned_cols=29 Identities=10% Similarity=0.135 Sum_probs=24.7 Q ss_pred CCCCCCCCCHHHHHHHHCCCCHHHHHHHHHCCCCCEE Q ss_conf 9855323715553553308898999999977998414 Q gi|254780664|r 74 GGRVKTLHPKIYGGILSIRDNPAHMKFMQDHELESID 110 (536) Q Consensus 74 ~GRVKTLHP~I~ggIL~~r~~~~~~~~l~~~~i~~ID 110 (536) +..|-||+|-|| |+.-++.|++.|+..+| T Consensus 30 ~~~iytlG~iIH--------N~~vv~~L~~~Gv~~v~ 58 (304) T PRK01045 30 GAPIYVRHEIVH--------NKYVVERLKKKGAIFVE 58 (304) T ss_pred CCCEEEECCCCC--------CHHHHHHHHHCCCEEEC T ss_conf 996898478864--------98999999988699961 No 213 >PRK13371 4-hydroxy-3-methylbut-2-enyl diphosphate reductase; Provisional Probab=26.41 E-value=40 Score=14.21 Aligned_cols=70 Identities=17% Similarity=0.232 Sum_probs=47.3 Q ss_pred HHHHHHHHCCCEEEEEHHHHHHHHHCCCC-----EEEHH--HHCCCCHHCCCCCCCCCCHHHHHHHHCCCCHHHHHHHHH Q ss_conf 99999998799999942768999988995-----17946--534881651985532371555355330889899999997 Q gi|254780664|r 31 EFASRLLSRGIKIISTGGTCQLLEEEGIP-----VTSVF--DITKFPEIMGGRVKTLHPKIYGGILSIRDNPAHMKFMQD 103 (536) Q Consensus 31 ~la~~L~~~g~~iisTgGTa~~l~~~gi~-----v~~Vs--~~TgfpEil~GRVKTLHP~I~ggIL~~r~~~~~~~~l~~ 103 (536) +|.+.|.++|+++---+=|.+.-+..|.= ..++. ....|| |.||=||||-|| ||.=.+.|++ T Consensus 28 ~~i~~~~~~~~~~~~g~~~i~LA~~~GFC~GVeRAI~ia~ea~~~~p---~~~Iy~~geIIH--------Np~Vv~~L~~ 96 (392) T PRK13371 28 SLIQSIRDNGYVLQRGDVTIKLAEAFGFCWGVERAVAMAYETRRHFP---DERIWITNEIIH--------NPSVNQHLRE 96 (392) T ss_pred HHHHHHHHCCCEEEECCEEEEEHHHCCCCCCHHHHHHHHHHHHHHCC---CCCEEEECCCCC--------CHHHHHHHHH T ss_conf 99999997497675577799966546875358999999999998689---997897178746--------9799999997 Q ss_pred CCCCCEEE Q ss_conf 79984149 Q gi|254780664|r 104 HELESIDL 111 (536) Q Consensus 104 ~~i~~IDl 111 (536) .|+..|+- T Consensus 97 ~Gv~~i~~ 104 (392) T PRK13371 97 MGVKFIPV 104 (392) T ss_pred CCCEEEEC T ss_conf 79999624 No 214 >COG1013 PorB Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, beta subunit [Energy production and conversion] Probab=26.32 E-value=31 Score=14.99 Aligned_cols=29 Identities=38% Similarity=0.630 Sum_probs=24.4 Q ss_pred CCCCCHHHHHHHHCCCCCCEEEECHHHHH Q ss_conf 23343469998740156313530899989 Q gi|254780664|r 136 IDIGGPSMIRAAAKNHDYVTILTNPQDYP 164 (536) Q Consensus 136 IDIGGpsmiRAAAKN~~~V~Vi~dp~dY~ 164 (536) .|||+..++-++..|+.=..++.|-..|. T Consensus 101 ~dIG~~~l~ha~~Rn~dit~iv~DNevYg 129 (294) T COG1013 101 YDIGGNHLIHALRRNHDITYIVVDNEVYG 129 (294) T ss_pred HHCCHHHHHHHHHCCCCEEEEEECCEECC T ss_conf 54101677899970798599997770045 No 215 >PRK11867 2-oxoglutarate ferredoxin oxidoreductase subunit beta; Reviewed Probab=26.31 E-value=45 Score=13.86 Aligned_cols=29 Identities=41% Similarity=0.540 Sum_probs=22.9 Q ss_pred CCCCCHHHHHHHHCCCCCCEEEECHHHHH Q ss_conf 23343469998740156313530899989 Q gi|254780664|r 136 IDIGGPSMIRAAAKNHDYVTILTNPQDYP 164 (536) Q Consensus 136 IDIGGpsmiRAAAKN~~~V~Vi~dp~dY~ 164 (536) -+|||--+|-|+-.|..-+.++.|-.-|. T Consensus 99 ~~IGgnH~iHA~rRN~dit~iv~nN~iYG 127 (280) T PRK11867 99 LAIGGNHFIHALRRNIDITIILFNNQIYG 127 (280) T ss_pred HHCHHHHHHHHHHHCCCCEEEEECCEEEC T ss_conf 20238889999980898189998882010 No 216 >TIGR00520 asnASE_II L-asparaginases, type II; InterPro: IPR004550 L-asparaginase catalyses the conversion of L-asparagine to L-aspartate. Two related families of asparaginase (L-asparagine amidohydrolase, 3.5.1.1 from EC) are designated type I and type II according to the terminology in Escherichia coli, which has both: L-asparaginase I is a low-affinity enzyme found in the cytoplasm, while L-asparaginase II is a high-affinity periplasmic enzyme synthesized with a cleavable signal sequence. This family describes L-asparaginases related to type II of E. coli. Both the cytoplasmic and the cell wall asparaginases of Saccharomyces cerevisiae belong to this set. Members of this set from Acinetobacter glutaminasificans and Pseudomonas fluorescens are described as having both glutaminase and asparaginase activities , . All members are homotetrameric. ; GO: 0004067 asparaginase activity, 0006528 asparagine metabolic process. Probab=26.30 E-value=25 Score=15.70 Aligned_cols=15 Identities=20% Similarity=0.264 Sum_probs=8.7 Q ss_pred EECCCCHHHHHCCCC Q ss_conf 020321022310123 Q gi|254780664|r 380 VLKTVSGGILVQTRD 394 (536) Q Consensus 380 ~~rsi~GG~LvQ~~D 394 (536) .||+..-|+|=.=.| T Consensus 197 TF~~~n~G~lG~~~n 211 (360) T TIGR00520 197 TFKSENQGYLGYIHN 211 (360) T ss_pred CCCCCCCCCEEEEEC T ss_conf 121025541046762 No 217 >COG1587 HemD Uroporphyrinogen-III synthase [Coenzyme metabolism] Probab=26.21 E-value=45 Score=13.85 Aligned_cols=145 Identities=18% Similarity=0.146 Sum_probs=66.6 Q ss_pred CCHHHHHHHHHHHHHHHEECC-CCHH-HHHHHHCCCC--CEEEECCCCCCCCCCCCEECCCCHHHHHCCCCCCCCCCCCC Q ss_conf 421457878865431111268-7989-9988512455--31563167667776664020321022310123345310001 Q gi|254780664|r 328 VDQEVAKEVIKVFTEAIIAPT-LSEE-AADVLAKKPS--MRFLKTSSLLDFHGEEIVLKTVSGGILVQTRDNVVDNKELT 403 (536) Q Consensus 328 vd~~~A~~I~~~F~EvIiAP~-f~~e-AleiL~~KKn--lRil~~~~~~~~~~~~~~~rsi~GG~LvQ~~D~~~~~~~~~ 403 (536) |-..||+.+.+..+.+.+-|. ++.+ -++.|..-.+ -|++...... -|.+....|.+.--.. .++. T Consensus 85 VG~~Ta~~l~~~G~~~~~~p~~~~~~~l~~~l~~~~~~~~~vl~~~~~~--------~r~~l~~~L~~~G~~v---~~~~ 153 (248) T COG1587 85 VGEKTAEALRKLGIKVDFIPEDGDSEGLLEELPELLKGGKRVLILRGNG--------GREVLEEKLEERGAEV---REVE 153 (248) T ss_pred ECHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHCCCCCEEEEECCCC--------CCHHHHHHHHHCCCEE---EEEE T ss_conf 8899999999829987758988777999999874256787599973777--------7389999999769779---9997 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEEECCEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEC Q ss_conf 24677786889888999999860056677999968938997296888899999999998764200122223487099973 Q gi|254780664|r 404 VVTKRSPTDQELRDMKFAFKVVKHVKSNAVVYAKDGRTVGIGSGQTSRVDSTRFAAIKAHNISAQADVKSMTNGSVIASE 483 (536) Q Consensus 404 vVT~~~pt~~e~~dL~FA~kv~K~vkSNAIv~ak~~~tiGiGaGQ~sRVda~~iA~~KA~~~~~~~~~~~~~~g~vlaSD 483 (536) +=+ ..|.......+ +...+.-+-.+|++.-- -+++.-..-+...... .+..+..+| T Consensus 154 ~Y~-~~~~~~~~~~~---~~~~~~~~~d~v~ftS~--------------~~v~~~~~~~~~~~~~-----~~~~~~v~~- 209 (248) T COG1587 154 VYR-TEPPPLDEATL---IELLKLGEVDAVVFTSS--------------SAVRALLALAPESGIE-----FLERKRVAS- 209 (248) T ss_pred EEE-EECCCCCHHHH---HHHHHCCCCCEEEECCH--------------HHHHHHHHHCCCCCCH-----HHCCCEEEE- T ss_conf 775-30687557789---88851178559998799--------------9999999863522201-----212857999- Q ss_pred CCCCCCCHHHHHHHCCCCEEEECCCCC Q ss_conf 668685558999971995999389877 Q gi|254780664|r 484 AFYPFPDGIVEAIKAGVTAVIQPGGSV 510 (536) Q Consensus 484 AFFPF~D~ie~aa~~Gi~aIiqPGGSi 510 (536) .-|..-+.+.+.|++.++||--+. T Consensus 210 ---IG~~Ta~~l~~~G~~~~~~~~~~~ 233 (248) T COG1587 210 ---IGPRTAETLKELGITVDIAAEKPT 233 (248) T ss_pred ---ECHHHHHHHHHHCCCCEECCCCCC T ss_conf ---888999999982996233256677 No 218 >TIGR01918 various_sel_PB selenoprotein B, glycine/betaine/sarcosine/D-proline reductase family; InterPro: IPR010187 This entry represents selenoprotein B of glycine reductase, sarcosine reductase, betaine reductase, D-proline reductase, and perhaps others. All members are expected to contain an internal UGA codon, encoding selenocysteine, which may be misinterpreted as a stop codon.; GO: 0030699 glycine reductase activity, 0050485 oxidoreductase activity acting on X-H and Y-H to form an X-Y bond with a disulfide as acceptor, 0030700 glycine reductase complex. Probab=26.13 E-value=44 Score=13.87 Aligned_cols=38 Identities=11% Similarity=0.037 Sum_probs=12.9 Q ss_pred HCCCCCEEEEEECCEEEEECCCCCCHHHHHHHHHHHHH Q ss_conf 00566779999689389972968888999999999987 Q gi|254780664|r 426 KHVKSNAVVYAKDGRTVGIGSGQTSRVDSTRFAAIKAH 463 (536) Q Consensus 426 K~vkSNAIv~ak~~~tiGiGaGQ~sRVda~~iA~~KA~ 463 (536) |+.|---|.=+...--+-=|.-.+.=.+--+-+.+|+- T Consensus 153 K~~~GE~Ig~~~EeGy~~RG~R~~~f~ed~kp~s~R~V 190 (433) T TIGR01918 153 KLAKGEEIGSPDEEGYFERGVRVNAFREDRKPGSERAV 190 (433) T ss_pred HHHCCCCCCCCCCCCCCCCCCEECCCCCCCCHHHHHHH T ss_conf 86435655886546722268210120013762578999 No 219 >TIGR03568 NeuC_NnaA UDP-N-acetyl-D-glucosamine 2-epimerase, UDP-hydrolysing. This family of enzymes catalyzes the combined epimerization and UDP-hydrolysis of UDP-N-acetylglucosamine to N-acetylmannosamine. This is in contrast to the related enzyme WecB (TIGR00236) which retains the UDP moiety. NeuC acts in concert with NeuA and NeuB to synthesize CMP-N5-acetyl-neuraminate. Probab=26.12 E-value=45 Score=13.84 Aligned_cols=32 Identities=19% Similarity=0.366 Sum_probs=17.4 Q ss_pred HHHHHHHHHHC---CCEEEEEHHH--------HHHHHHCCCCE Q ss_conf 69999999987---9999994276--------89999889951 Q gi|254780664|r 29 VVEFASRLLSR---GIKIISTGGT--------CQLLEEEGIPV 60 (536) Q Consensus 29 l~~la~~L~~~---g~~iisTgGT--------a~~l~~~gi~v 60 (536) +-++.+.|.+. .+.++.||-- .+.+...|+.. T Consensus 16 lapli~~l~~~~~~~~~li~TGqH~~~~~g~t~~~i~~d~~~~ 58 (365) T TIGR03568 16 LRPLLKALQDDPDLELQLIVTGMHLSPEYGNTVNEIEKDGFDI 58 (365) T ss_pred HHHHHHHHHHCCCCCEEEEEECCCCCHHHCCHHHHHHHCCCCC T ss_conf 9999999972899888999907778411070899999757987 No 220 >PTZ00298 mevalonate kinase; Provisional Probab=26.08 E-value=25 Score=15.71 Aligned_cols=27 Identities=30% Similarity=0.567 Sum_probs=19.4 Q ss_pred EECCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCC Q ss_conf 961568656545201689999998533213200455542754 Q gi|254780664|r 287 IVKHMNPCGVATADTLVEAYRRALSCDPISAFGGIIAFNREV 328 (536) Q Consensus 287 IvKH~NPCGvA~~~~~~~Ay~~A~~~DP~SAFGGIIa~N~~v 328 (536) -+-|.+|||+ ++ -+|+|||.|.|.+.- T Consensus 145 ~~~hG~pSGi---D~------------~~s~~Gg~i~f~~~~ 171 (328) T PTZ00298 145 GGYHGTPSGA---DN------------TAATYGGLISYRRVN 171 (328) T ss_pred HHHCCCCCHH---HH------------HHHHCCCEEEEEECC T ss_conf 9855899727---69------------999459879998548 No 221 >PRK12823 benD 1,6-dihydroxycyclohexa-2,4-diene-1-carboxylate dehydrogenase; Provisional Probab=26.00 E-value=45 Score=13.82 Aligned_cols=38 Identities=16% Similarity=0.128 Sum_probs=25.9 Q ss_pred CCCCEEEEEEEEECCCCHHHHHHHHHHCCCEEEEEHHH Q ss_conf 77613698998754568699999999879999994276 Q gi|254780664|r 12 GEIAVKTALISVHNKTGVVEFASRLLSRGIKIISTGGT 49 (536) Q Consensus 12 ~~~~ikrALiSV~dKtgl~~la~~L~~~g~~iisTgGT 49 (536) .+++=|+|||.=.-+-==..+|+.|.+.|+.++-++-+ T Consensus 4 ~~l~gKvalITGas~GIG~aiA~~la~~Ga~V~~~~r~ 41 (260) T PRK12823 4 QRFAGKVAVVTGAAQGIGRGVALRAAAEGARVVLVDRS 41 (260) T ss_pred CCCCCCEEEEECCCCHHHHHHHHHHHHCCCEEEEEECC T ss_conf 88799989994886789999999999879999999694 No 222 >COG1483 Predicted ATPase (AAA+ superfamily) [General function prediction only] Probab=25.99 E-value=45 Score=13.82 Aligned_cols=114 Identities=17% Similarity=0.103 Sum_probs=73.1 Q ss_pred ECCCCHHHHHHHHHHCCCEEEE--EHHHHHHHHHCCCCEEEHHHHCCCCHHCCCCCCCCCCHHHHHHHHCCCCHHHHHHH Q ss_conf 5456869999999987999999--42768999988995179465348816519855323715553553308898999999 Q gi|254780664|r 24 HNKTGVVEFASRLLSRGIKIIS--TGGTCQLLEEEGIPVTSVFDITKFPEIMGGRVKTLHPKIYGGILSIRDNPAHMKFM 101 (536) Q Consensus 24 ~dKtgl~~la~~L~~~g~~iis--TgGTa~~l~~~gi~v~~Vs~~TgfpEil~GRVKTLHP~I~ggIL~~r~~~~~~~~l 101 (536) .|+.+|.++.-.+..++..++- -+|| ++-|.+++. +| |||+-+.|-=-+. .-....+- T Consensus 98 ~d~e~i~el~e~~~~l~~~~v~v~~~~~------~~~P~~p~~---------~~-Vkt~wg~ia~~lg----~y~~v~~~ 157 (774) T COG1483 98 EDLEPIEELVERIKNLSVPIVVVYGKGT------APQPRQPLE---------GG-VKTEWGYIAYSLG----EYEIVEDV 157 (774) T ss_pred HCHHHHHHHHHHHHHCCCCEEEEECCCC------CCCCCCCCC---------CC-CCCCHHHHHHHCC----CCCHHHHH T ss_conf 0302158899999707886699977887------899987601---------66-5420899999724----77401103 Q ss_pred HHCCCCCEEEEEEECCCHHHHHCCCCCHHHHHHCCCCC-----------------CHHHHHHHHCCCCCCEEEECH Q ss_conf 97799841499994888899731765311232002334-----------------346999874015631353089 Q gi|254780664|r 102 QDHELESIDLVVVNLYPFEESFCREDDYYTMVENIDIG-----------------GPSMIRAAAKNHDYVTILTNP 160 (536) Q Consensus 102 ~~~~i~~IDlVvvNLYPF~~~v~~~~~~~~~IEnIDIG-----------------GpsmiRAAAKN~~~V~Vi~dp 160 (536) .+.-..+.+=+.++|--+++.. ..++++.+.||=+ =.+|+++|.|+-+.+.|.+-| T Consensus 158 de~~t~~~~~~~~~ll~~~~v~---~lidei~deid~~~~~~~e~~r~y~~~~dn~La~v~sa~~~s~~~~v~~il 230 (774) T COG1483 158 DENLTAPDPELLDELLDLEEVL---ALIDEIGDEIDTLIRSAFEEDRRYAGNADNPLARVLSALKGSKDEMVAIIL 230 (774) T ss_pred HHHHCCCCCCHHHHHCCHHHHH---HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEEE T ss_conf 4653378842001430337778---889888888877513000001123321341899999874258630467775 No 223 >pfam11564 BpuJI_N Restriction endonuclease BpuJI - N terminal. BpuJI is a restriction endonuclease which recognizes the asymmetric sequence 5'-CCCGT and cuts at multiple sites in the surrounding area of the target sequence. This family of proteins is the N terminal domain of BpuJI which has DNA recognition functions. The recognition domain has two subdomains D1 and D2. The recognition of the target sequence occurs through major groove contacts of amino acids on the helix-turn-helix region and the N-terminal arm. Probab=25.94 E-value=42 Score=14.03 Aligned_cols=19 Identities=11% Similarity=0.139 Sum_probs=12.3 Q ss_pred CCCCCCHHHHHHHHHHHHH Q ss_conf 6777868898889999998 Q gi|254780664|r 406 TKRSPTDQELRDMKFAFKV 424 (536) Q Consensus 406 T~~~pt~~e~~dL~FA~kv 424 (536) .+.+.|++..+...-+|-- T Consensus 266 ~~~~tt~ed~k~f~~~Wf~ 284 (294) T pfam11564 266 GNASTSEDDKKNFFKAWFE 284 (294) T ss_pred HCCCCCHHHHHHHHHHHHH T ss_conf 2254758889999999999 No 224 >cd00610 OAT_like Acetyl ornithine aminotransferase family. This family belongs to pyridoxal phosphate (PLP)-dependent aspartate aminotransferase superfamily (fold I). The major groups in this CD correspond to ornithine aminotransferase, acetylornithine aminotransferase, alanine-glyoxylate aminotransferase, dialkylglycine decarboxylase, 4-aminobutyrate aminotransferase, beta-alanine-pyruvate aminotransferase, adenosylmethionine-8-amino-7-oxononanoate aminotransferase, and glutamate-1-semialdehyde 2,1-aminomutase. All the enzymes belonging to this family act on basic amino acids and their derivatives are involved in transamination or decarboxylation. Probab=25.93 E-value=45 Score=13.81 Aligned_cols=18 Identities=17% Similarity=0.108 Sum_probs=10.0 Q ss_pred EECCCCCCHHHHHHHHHH Q ss_conf 938987798899999987 Q gi|254780664|r 504 IQPGGSVRDSEAITVADQ 521 (536) Q Consensus 504 iqPGGSirD~evI~aan~ 521 (536) +.|-=.|.|+|+-+..+. T Consensus 391 ~~Ppl~it~~eid~~~~~ 408 (413) T cd00610 391 LLPPLIITEEEIDEGLDA 408 (413) T ss_pred EECCCCCCHHHHHHHHHH T ss_conf 978836899999999999 No 225 >pfam05327 RRN3 RNA polymerase I specific transcription initiation factor RRN3. This family consists of several eukaryotic proteins which are homologous to the yeast RRN3 protein. RRN3 is one of the RRN genes specifically required for the transcription of rDNA by RNA polymerase I (Pol I) in Saccharomyces cerevisiae. Probab=25.88 E-value=25 Score=15.69 Aligned_cols=72 Identities=11% Similarity=0.040 Sum_probs=43.8 Q ss_pred HHHHHHCCCCCCHHHHHHHHCCCCCCEEEECHHHHHH--HHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 1123200233434699987401563135308999898--8987530234566678899999999987422899998753 Q gi|254780664|r 129 YYTMVENIDIGGPSMIRAAAKNHDYVTILTNPQDYPL--FLAEMDVNNGKIPYNFRKKMARQAFSRTASYDTAICRWLA 205 (536) Q Consensus 129 ~~~~IEnIDIGGpsmiRAAAKN~~~V~Vi~dp~dY~~--~~~el~~~~g~~~~~~R~~lA~kAF~~ta~YD~~Ia~~l~ 205 (536) +..++..|=-+-..|.-.=+++|.+. .+|..... +-+-|.-.. -..+.|...=...+.+.-.-|..|..-+. T Consensus 150 L~~il~lVP~s~~~L~~~L~~~FP~~---~~~~~~~~~Yv~NlL~l~~--Y~p~Lr~~IL~lIieklikiDVeiq~~~d 223 (550) T pfam05327 150 LRYILRLVPSAPSFLLPILAKNFPHK---TKSKKVLVGYVRNLLRLTE--YCPELGFDILELIIERLLKLDVEIQNELD 223 (550) T ss_pred HHHHHHHCCCCHHHHHHHHHHHCCCC---CCCHHHHHHHHHHHHHHHH--CCHHHHHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 99999985898578999999859999---9956889999999999984--54878999999999999994347678500 No 226 >COG1433 Uncharacterized conserved protein [Function unknown] Probab=25.83 E-value=45 Score=13.80 Aligned_cols=58 Identities=16% Similarity=0.197 Sum_probs=38.0 Q ss_pred HHHHHHHHHHHHHHHEECCCCHHHHHHHHCCCCCEEEECCCCCCCCCCCCEECCCCHHHHHCC Q ss_conf 145787886543111126879899988512455315631676677766640203210223101 Q gi|254780664|r 330 QEVAKEVIKVFTEAIIAPTLSEEAADVLAKKPSMRFLKTSSLLDFHGEEIVLKTVSGGILVQT 392 (536) Q Consensus 330 ~~~A~~I~~~F~EvIiAP~f~~eAleiL~~KKnlRil~~~~~~~~~~~~~~~rsi~GG~LvQ~ 392 (536) ..+|+.+.+.=+|+||++.+-+.|++.|+.+ .+++...... .-.--++.+..|-|.+. T Consensus 55 ~~~a~~l~~~gvdvvi~~~iG~~a~~~l~~~-GIkv~~~~~~----~V~e~i~~~~~g~l~~~ 112 (121) T COG1433 55 IRIAELLVDEGVDVVIASNIGPNAYNALKAA-GIKVYVAPGG----TVEEAIKAFLEGELEEA 112 (121) T ss_pred HHHHHHHHHCCCCEEEECCCCHHHHHHHHHC-CCEEEECCCC----CHHHHHHHHHCCCCCCC T ss_conf 7799999976998999776697799999974-9579965998----99999999861876556 No 227 >KOG1203 consensus Probab=25.82 E-value=19 Score=16.53 Aligned_cols=16 Identities=19% Similarity=0.075 Sum_probs=8.3 Q ss_pred HHHHHHCCCCCCEEEE Q ss_conf 9998740156313530 Q gi|254780664|r 143 MIRAAAKNHDYVTILT 158 (536) Q Consensus 143 miRAAAKN~~~V~Vi~ 158 (536) +.-|..||+..|..+. T Consensus 188 ~~~aGvk~~vlv~si~ 203 (411) T KOG1203 188 CKKAGVKRVVLVGSIG 203 (411) T ss_pred HHHHCCCEEEEEEEEC T ss_conf 9983874599997634 No 228 >PRK00474 rps9p 30S ribosomal protein S9P; Reviewed Probab=25.78 E-value=43 Score=13.98 Aligned_cols=21 Identities=24% Similarity=0.311 Sum_probs=18.8 Q ss_pred ECCCCCCHHHHHHHHHHHHHH Q ss_conf 729688889999999999876 Q gi|254780664|r 444 IGSGQTSRVDSTRFAAIKAHN 464 (536) Q Consensus 444 iGaGQ~sRVda~~iA~~KA~~ 464 (536) -|.|.+...+|+++|+.||-. T Consensus 66 ~GGG~sgQA~AirlaIaRAL~ 86 (132) T PRK00474 66 SGGGIMGQADAVRTAIARGLV 86 (132) T ss_pred ECCCHHHHHHHHHHHHHHHHH T ss_conf 778725999999999999999 No 229 >pfam01912 eIF-6 eIF-6 family. This family includes eukaryotic translation initiation factor 6 as well as presumed archaebacterial homologues. Probab=25.59 E-value=46 Score=13.77 Aligned_cols=63 Identities=22% Similarity=0.406 Sum_probs=41.6 Q ss_pred CCCCCCCCCCHHHH----HHHHHH------HHCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHEECCCCHHHHHHHHCC Q ss_conf 68656545201689----999998------53321320045554275421457878865431111268798999885124 Q gi|254780664|r 291 MNPCGVATADTLVE----AYRRAL------SCDPISAFGGIIAFNREVDQEVAKEVIKVFTEAIIAPTLSEEAADVLAKK 360 (536) Q Consensus 291 ~NPCGvA~~~~~~~----Ay~~A~------~~DP~SAFGGIIa~N~~vd~~~A~~I~~~F~EvIiAP~f~~eAleiL~~K 360 (536) .|-=|..+.....+ ..++.+ --+.++|.|-+|++|... .++-|++++|+.++++.- T Consensus 61 gN~nGllvp~~~~d~El~~L~~~~~v~V~~l~~~~~AlGN~Il~ND~~--------------Alv~p~l~~e~~e~I~dv 126 (198) T pfam01912 61 GNSNGLLVPSTATDEELDHLKESLDVNVERLEEKLTALGNVILANDKG--------------ALVHPDLSRETEEIIADV 126 (198) T ss_pred ECCCEEECCCCCCHHHHHHHHHCCCEEEEEECCCCCCCCCEEEECCCC--------------EEECCCCCHHHHHHHHHH T ss_conf 258827818978999999877427827999688556510489974754--------------596577998999999886 Q ss_pred CCCEEEE Q ss_conf 5531563 Q gi|254780664|r 361 PSMRFLK 367 (536) Q Consensus 361 KnlRil~ 367 (536) =+..+.+ T Consensus 127 LgVev~~ 133 (198) T pfam01912 127 LGVEVFR 133 (198) T ss_pred HCCEEEE T ss_conf 3963999 No 230 >pfam00290 Trp_syntA Tryptophan synthase alpha chain. Probab=25.57 E-value=46 Score=13.77 Aligned_cols=33 Identities=24% Similarity=0.350 Sum_probs=15.7 Q ss_pred HHHHHHHHHEECCCCHHHHHHHHC---CCCCEEEEC Q ss_conf 886543111126879899988512---455315631 Q gi|254780664|r 336 VIKVFTEAIIAPTLSEEAADVLAK---KPSMRFLKT 368 (536) Q Consensus 336 I~~~F~EvIiAP~f~~eAleiL~~---KKnlRil~~ 368 (536) ..+.=++.+|-||.-.|-.+-+.+ +.++.++.+ T Consensus 110 ~~~~GvdGvIipDLP~eE~~~~~~~~~~~~l~~I~l 145 (258) T pfam00290 110 AAEAGVDGLIIPDLPPEEADPLREAAEKHGIDLIFL 145 (258) T ss_pred HHHCCCCEEECCCCCHHHHHHHHHHHHHCCCCEEEE T ss_conf 997599778707999889999999998458435888 No 231 >PRK07814 short chain dehydrogenase; Provisional Probab=25.52 E-value=46 Score=13.76 Aligned_cols=87 Identities=16% Similarity=0.182 Sum_probs=46.3 Q ss_pred CCCEEEEEEEEECCCCHHHHHHHHHHCCCEEEEEHHHHHHHHHCCCCEEEHHHHCCCCHHCCCCCCCCCCHHHHHHHHCC Q ss_conf 76136989987545686999999998799999942768999988995179465348816519855323715553553308 Q gi|254780664|r 13 EIAVKTALISVHNKTGVVEFASRLLSRGIKIISTGGTCQLLEEEGIPVTSVFDITKFPEIMGGRVKTLHPKIYGGILSIR 92 (536) Q Consensus 13 ~~~ikrALiSV~dKtgl~~la~~L~~~g~~iisTgGTa~~l~~~gi~v~~Vs~~TgfpEil~GRVKTLHP~I~ggIL~~r 92 (536) +++=|.|||.=.-+-==...|+.|.+.|.+++-++-..+.|.+. .+++.. .++++..+ .++- T Consensus 7 ~L~gKvalITGgs~GIG~aia~~la~~Ga~V~i~~~~~~~l~~~------~~~i~~----~g~~~~~~--------~~Dv 68 (263) T PRK07814 7 RLDGQVAVVTGAGRGLGAAIALAFAEAGADVLIAARTESQLDEV------AEQIRA----AGRRAHVV--------AADL 68 (263) T ss_pred CCCCCEEEEECCCCHHHHHHHHHHHHCCCEEEEEECCHHHHHHH------HHHHHH----CCCEEEEE--------ECCC T ss_conf 89999899958966899999999998799899996989999999------999985----29928999--------8158 Q ss_pred CCHHHHHHHHH---CCCCCEEEEEEECC Q ss_conf 89899999997---79984149999488 Q gi|254780664|r 93 DNPAHMKFMQD---HELESIDLVVVNLY 117 (536) Q Consensus 93 ~~~~~~~~l~~---~~i~~IDlVvvNLY 117 (536) .++++.+.+-+ ..+..||++|.|-- T Consensus 69 ~~~~~v~~~v~~~~~~~G~iDiLVnNAg 96 (263) T PRK07814 69 AHPEATAGLAGQAVEAFGRLDIVVNNVG 96 (263) T ss_pred CCHHHHHHHHHHHHHHHCCCCEEEECCC T ss_conf 9999999999999998299889998986 No 232 >KOG1753 consensus Probab=25.52 E-value=39 Score=14.32 Aligned_cols=23 Identities=30% Similarity=0.238 Sum_probs=16.2 Q ss_pred EEECCCCCCHHHHHHHHHHHHHH Q ss_conf 99729688889999999999876 Q gi|254780664|r 442 VGIGSGQTSRVDSTRFAAIKAHN 464 (536) Q Consensus 442 iGiGaGQ~sRVda~~iA~~KA~~ 464 (536) ---|.|.+|||-++++|+.||-- T Consensus 70 ~~~ggghvsqiyairqa~~kalv 92 (145) T KOG1753 70 RVKGGGHVSQIYAIRQAIAKALV 92 (145) T ss_pred EEECCCHHHHHHHHHHHHHHHHH T ss_conf 98358508889999987367799 No 233 >pfam04963 Sigma54_CBD Sigma-54 factor, core binding domain. This domain makes a direct interaction with the core RNA polymerase, to form an enhancer dependent holoenzyme. The centre of this domain contains a very weak similarity to a helix-turn-helix motif which may represent the other DNA binding domain. Probab=25.28 E-value=46 Score=13.73 Aligned_cols=101 Identities=19% Similarity=0.316 Sum_probs=59.2 Q ss_pred CCCCHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHH Q ss_conf 21200116776424556727999615686565452016899999985332132004555427542145787886543111 Q gi|254780664|r 265 NDLDAAFELVSEFRSQDCAACVIVKHMNPCGVATADTLVEAYRRALSCDPISAFGGIIAFNREVDQEVAKEVIKVFTEAI 344 (536) Q Consensus 265 lD~daA~~lv~ef~~~~~Pa~vIvKH~NPCGvA~~~~~~~Ay~~A~~~DP~SAFGGIIa~N~~vd~~~A~~I~~~F~EvI 344 (536) .++..+++.++.| -|+||+. .|+.|.+.--+.-.+ .....|..|...++|.+ T Consensus 67 ~~ie~~l~~iQ~l--------------eP~GIgA-rnL~EcL~lQL~~~~-------------~~~~~a~~ii~~~le~l 118 (195) T pfam04963 67 DEVEAVLAVLQQF--------------DPAGVGA-RDLSECLLLQLRQLD-------------RWLDPAMEALVDHLELL 118 (195) T ss_pred HHHHHHHHHHHCC--------------CCCCCCC-CCHHHHHHHHHHHCC-------------CCHHHHHHHHHHHHHHH T ss_conf 9999999999748--------------9982350-889999999986528-------------65689999999889998 Q ss_pred EECCCCH-------------HHHHHHHC---CCCCEEEECCCCCCCCCCCCEECCCCHHHHHCCCCC Q ss_conf 1268798-------------99988512---455315631676677766640203210223101233 Q gi|254780664|r 345 IAPTLSE-------------EAADVLAK---KPSMRFLKTSSLLDFHGEEIVLKTVSGGILVQTRDN 395 (536) Q Consensus 345 iAP~f~~-------------eAleiL~~---KKnlRil~~~~~~~~~~~~~~~rsi~GG~LvQ~~D~ 395 (536) ..-.|.. +|++++++ ||...+-. ....+-.+.+-++...|+|.++-.+. T Consensus 119 ~~~~~~~i~k~l~~s~~~i~~a~~~I~~LnPkPg~~f~~--~~~~yI~PDi~V~~~~~~~~v~lN~~ 183 (195) T pfam04963 119 ARRDFATLKRICGVDEEDLREMIEEIRSLNPKPGSAFES--GEAEYVVPDVIVRKGNGGWLVELNPD 183 (195) T ss_pred HCCCHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCCCC--CCCCCCCCCEEEEEECCEEEEEECCC T ss_conf 526899999996919999999999999679996111378--98887589679998599899998778 No 234 >PRK13123 consensus Probab=25.24 E-value=47 Score=13.73 Aligned_cols=17 Identities=29% Similarity=0.534 Sum_probs=7.8 Q ss_pred HHHHEECCCCHHHHHHH Q ss_conf 31111268798999885 Q gi|254780664|r 341 TEAIIAPTLSEEAADVL 357 (536) Q Consensus 341 ~EvIiAP~f~~eAleiL 357 (536) ++.+|-||.-.|.-+-+ T Consensus 118 vdGvIipDLP~eE~~~~ 134 (256) T PRK13123 118 VKGLIIPDLPYEHQDFI 134 (256) T ss_pred CCEEECCCCCHHHHHHH T ss_conf 97897379996789999 No 235 >PRK13140 consensus Probab=25.08 E-value=47 Score=13.71 Aligned_cols=101 Identities=13% Similarity=0.207 Sum_probs=46.0 Q ss_pred CCCHHHHHHHHCCCCCCCEEEEECCCCCC--CCCCCCHHHHHHHHHHHHCC--CCCCC-----------EEEEE---CCC Q ss_conf 12001167764245567279996156865--65452016899999985332--13200-----------45554---275 Q gi|254780664|r 266 DLDAAFELVSEFRSQDCAACVIVKHMNPC--GVATADTLVEAYRRALSCDP--ISAFG-----------GIIAF---NRE 327 (536) Q Consensus 266 D~daA~~lv~ef~~~~~Pa~vIvKH~NPC--GvA~~~~~~~Ay~~A~~~DP--~SAFG-----------GIIa~---N~~ 327 (536) |.+..++++..+.+. .|=++-=.-|- -+|-|..+-.|..+|++..- ...|. -+|.+ |-- T Consensus 26 ~~~~s~~~~~~l~~~---GaDiiElGiPfSDP~ADGpvIq~A~~rAL~~G~~~~~~~~~~~~~r~~~~~pivlM~Y~N~i 102 (257) T PRK13140 26 TLEDTVSIIKDLEKH---GVDMIEIGIPFSDPLADGPVIQKSATQALKNGMTLKLLFEQLKDIRKEVQIPLILMGYLNPI 102 (257) T ss_pred CHHHHHHHHHHHHHC---CCCEEEECCCCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCEEEEECHHHH T ss_conf 879999999999975---99999978988987765899999999999869989999999999743689888999055999 Q ss_pred CC---HHHHHHHHHHHHHHHEECCCCHHH-HHHH---HCCCCCEEEECC Q ss_conf 42---145787886543111126879899-9885---124553156316 Q gi|254780664|r 328 VD---QEVAKEVIKVFTEAIIAPTLSEEA-ADVL---AKKPSMRFLKTS 369 (536) Q Consensus 328 vd---~~~A~~I~~~F~EvIiAP~f~~eA-leiL---~~KKnlRil~~~ 369 (536) .. ++..+...+.=++.+|-||.-.|- .+.+ .++.++.++.+- T Consensus 103 ~~~G~e~F~~~~~~~GvdGlIipDLP~ee~~~~~~~~~~~~~i~~I~lv 151 (257) T PRK13140 103 MQYGFEKFCKKCAETGIDGVIIPDLPFDDYQEHYKEIFEKYGLKNIMLI 151 (257) T ss_pred HHHCHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHHHHHHCCCCEEEEE T ss_conf 8517999999999849986983599856758999999998699779986 No 236 >PRK00188 trpD anthranilate phosphoribosyltransferase; Provisional Probab=25.00 E-value=47 Score=13.70 Aligned_cols=40 Identities=15% Similarity=0.207 Sum_probs=20.1 Q ss_pred ECCCCCHHHHHHHH-HHHHHHHEECCCCHHHHHHHHCCCCC Q ss_conf 42754214578788-65431111268798999885124553 Q gi|254780664|r 324 FNREVDQEVAKEVI-KVFTEAIIAPTLSEEAADVLAKKPSM 363 (536) Q Consensus 324 ~N~~vd~~~A~~I~-~~F~EvIiAP~f~~eAleiL~~KKnl 363 (536) ++...+.+.++... +.=+=-+-||.|.|....+..-+|.+ T Consensus 130 ~~~~~s~~~~~~~~~~~g~~fl~Ap~~~pa~~~~~~~R~~l 170 (339) T PRK00188 130 INLDLTPEQVARCLDEVGIGFLFAPLYHPAMKHVAPVRKEL 170 (339) T ss_pred CCCCCCHHHHHHHHHHCCCEEEECHHHCHHHHHHHHHHHCC T ss_conf 76689999999999980956962143498889888775404 No 237 >PRK13034 serine hydroxymethyltransferase; Reviewed Probab=24.93 E-value=36 Score=14.57 Aligned_cols=18 Identities=44% Similarity=0.689 Sum_probs=10.9 Q ss_pred CCEEEEECCCCCHHHHHHHHH Q ss_conf 004555427542145787886 Q gi|254780664|r 318 FGGIIAFNREVDQEVAKEVIK 338 (536) Q Consensus 318 FGGIIa~N~~vd~~~A~~I~~ 338 (536) =||+|.+| |++.++.|.+ T Consensus 240 rGGiIl~~---~~el~~kId~ 257 (422) T PRK13034 240 RGGMILTN---DEEIAKKINS 257 (422) T ss_pred CCCEEECC---HHHHHHHHHC T ss_conf 84357526---6999998643 No 238 >PRK00230 orotidine 5'-phosphate decarboxylase; Reviewed Probab=24.89 E-value=45 Score=13.83 Aligned_cols=58 Identities=31% Similarity=0.364 Sum_probs=31.1 Q ss_pred CCCHHHHHHHHCC---CC------CCEEEECHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 3434699987401---56------3135308999898898753023456667889999999998742289999 Q gi|254780664|r 138 IGGPSMIRAAAKN---HD------YVTILTNPQDYPLFLAEMDVNNGKIPYNFRKKMARQAFSRTASYDTAIC 201 (536) Q Consensus 138 IGGpsmiRAAAKN---~~------~V~Vi~dp~dY~~~~~el~~~~g~~~~~~R~~lA~kAF~~ta~YD~~Ia 201 (536) .||+.|++|+.+. +. -|+++|+.++=+ +.++. .++. -.+.=.++|..|+. +.-|..|+ T Consensus 89 ~~G~~~l~a~~~~~~~~~~~~kll~Vt~LTS~~~~~--l~~~~-~~~~-~~~~v~~~a~~a~~--~g~dGiVc 155 (231) T PRK00230 89 SGGARMMAAAREGLEPGGSRPLLIAVTVLTSMDEED--LAELG-INLS-LEEQVLRLAKLAQE--AGLDGVVC 155 (231) T ss_pred CCCHHHHHHHHHHHHHCCCCCEEEEEEEECCCCHHH--HHHCC-CCCC-HHHHHHHHHHHHHH--HCCCEEEC T ss_conf 785999999999887148987599999623689889--98667-5789-99999999999999--69984863 No 239 >PRK12418 cysteinyl-tRNA synthetase; Provisional Probab=24.73 E-value=42 Score=14.06 Aligned_cols=115 Identities=16% Similarity=0.058 Sum_probs=67.9 Q ss_pred CCCEEEEEEEEECCCCHHHHHHHHHHCCCEEEEEHHHHHHHHHCCCCEEEHHHHCCCCHHCCCCCCCCCCHHHHHHHHCC Q ss_conf 76136989987545686999999998799999942768999988995179465348816519855323715553553308 Q gi|254780664|r 13 EIAVKTALISVHNKTGVVEFASRLLSRGIKIISTGGTCQLLEEEGIPVTSVFDITKFPEIMGGRVKTLHPKIYGGILSIR 92 (536) Q Consensus 13 ~~~ikrALiSV~dKtgl~~la~~L~~~g~~iisTgGTa~~l~~~gi~v~~Vs~~TgfpEil~GRVKTLHP~I~ggIL~~r 92 (536) ++.|=--=..|||.+.|-..-..+.- .=-.++|+..|.+|+-|..+|.-..=+--|-+.. .+.-.-|++| T Consensus 9 ~V~mY~CGPTVYd~~HiGhaRt~v~f--------D~l~R~L~~~gy~V~~v~NiTDVDDKii~~A~~~--g~~~~ela~~ 78 (384) T PRK12418 9 TATMYVCGITPYDATHLGHAATYLAF--------DLVQRVWLDAGHDVHYVQNVTDVDDPLLERAARD--GVDWRDLAAR 78 (384) T ss_pred EEEEEECCCCCCCCCCCCCCHHHHHH--------HHHHHHHHHHCCCEEEEEECCCCCHHHHHHHHHH--CCCHHHHHHH T ss_conf 68999668868888542335168999--------9999999980996499981050638999999985--9399999999 Q ss_pred CCHHHHHHHHHCCCCCEEEEEEECCCHHHHHCCCCCHHHHHHCCCCCCHHHHHHHHCCCCCC Q ss_conf 89899999997799841499994888899731765311232002334346999874015631 Q gi|254780664|r 93 DNPAHMKFMQDHELESIDLVVVNLYPFEESFCREDDYYTMVENIDIGGPSMIRAAAKNHDYV 154 (536) Q Consensus 93 ~~~~~~~~l~~~~i~~IDlVvvNLYPF~~~v~~~~~~~~~IEnIDIGGpsmiRAAAKN~~~V 154 (536) -...-.++|+.+||.+-|. || .-+...+++|+.|. +--.|.|.++ T Consensus 79 ~~~~f~~d~~~Lnv~~p~~-----~p-----raTe~I~~ii~~I~-------~Li~~G~AY~ 123 (384) T PRK12418 79 EIALFREDMAALRVLPPRD-----YV-----GAVESIPEVVELVE-------KLLASGAAYV 123 (384) T ss_pred HHHHHHHHHHHCCCCCCCE-----EC-----CCCCCHHHHHHHHH-------HHHHCCCEEE T ss_conf 9999999999829999950-----03-----21137999999999-------9996795577 No 240 >pfam01045 consensus Probab=24.70 E-value=25 Score=15.66 Aligned_cols=15 Identities=13% Similarity=0.310 Sum_probs=5.9 Q ss_pred EECHHHHHHHHHHHH Q ss_conf 308999898898753 Q gi|254780664|r 157 LTNPQDYPLFLAEMD 171 (536) Q Consensus 157 i~dp~dY~~~~~el~ 171 (536) +-|..|+++-++.|+ T Consensus 112 lndstqwddwv~kme 126 (411) T pfam01045 112 LNDSTQWDDRVDKME 126 (411) T ss_pred CCCCCHHHHHHHHHH T ss_conf 465330888999998 No 241 >COG1803 MgsA Methylglyoxal synthase [Carbohydrate transport and metabolism] Probab=24.54 E-value=48 Score=13.64 Aligned_cols=116 Identities=22% Similarity=0.314 Sum_probs=63.3 Q ss_pred CEEEEEEEEECC-CCHHHHHHHHHHC--CCEEEEEHHHHHHHHHC-CCCEEEHHHHCCCCHHCCCCCCCCCCHHHHHHHH Q ss_conf 136989987545-6869999999987--99999942768999988-9951794653488165198553237155535533 Q gi|254780664|r 15 AVKTALISVHNK-TGVVEFASRLLSR--GIKIISTGGTCQLLEEE-GIPVTSVFDITKFPEIMGGRVKTLHPKIYGGILS 90 (536) Q Consensus 15 ~ikrALiSV~dK-tgl~~la~~L~~~--g~~iisTgGTa~~l~~~-gi~v~~Vs~~TgfpEil~GRVKTLHP~I~ggIL~ 90 (536) ..+-|||.--.| ..++.|++.-.+. .++|++||-|...++++ |++|..+-+ =| ||| T Consensus 4 ~k~IALIAHD~kK~~lv~f~~~~~~~L~~h~L~ATGTTG~~i~~atgL~v~~~~S---GP--mGG--------------- 63 (142) T COG1803 4 RKHIALIAHDHKKDDLVNFVQAHKELLSRHDLYATGTTGGLIQEATGLNVHRLKS---GP--MGG--------------- 63 (142) T ss_pred CCEEEEEECCHHHHHHHHHHHHHHHHHHHCEEEEECCCHHHHHHHHCCCEEEEEC---CC--CCC--------------- T ss_conf 5158888431108899999999899753156898447127889875871577404---78--884--------------- Q ss_pred CCCCHHHHHHHHHCCCCCEEEEEEECCCHHHHHCCCCCHHHHHHCCCCCCHHHHHHHHCCCCCCEEEECHHHHHHHHHHH Q ss_conf 08898999999977998414999948888997317653112320023343469998740156313530899989889875 Q gi|254780664|r 91 IRDNPAHMKFMQDHELESIDLVVVNLYPFEESFCREDDYYTMVENIDIGGPSMIRAAAKNHDYVTILTNPQDYPLFLAEM 170 (536) Q Consensus 91 ~r~~~~~~~~l~~~~i~~IDlVvvNLYPF~~~v~~~~~~~~~IEnIDIGGpsmiRAAAKN~~~V~Vi~dp~dY~~~~~el 170 (536) ++.+-.+ .-=..||+++-=--|... . +++ -|+ -+|+|.|-- .++.+-++-+--+-++.-+ T Consensus 64 ----DQQiGa~--Iaeg~id~lIFf~DPLta---q--PHd-----pDV--kAL~Rl~~V--~nIP~A~N~aTAe~li~~~ 123 (142) T COG1803 64 ----DQQIGAL--IAEGKIDVLIFFWDPLTA---Q--PHD-----PDV--KALLRLATV--YNIPVATNRATAEFLIKSL 123 (142) T ss_pred ----CHHHHHH--HHCCCCEEEEEEECCCCC---C--CCC-----CCH--HHHHHHHHH--HCCCCHHHHHHHHHHHHCC T ss_conf ----0788788--762862299997067878---9--998-----579--999999886--1561131175799998352 No 242 >PRK13120 consensus Probab=24.50 E-value=48 Score=13.63 Aligned_cols=142 Identities=20% Similarity=0.227 Sum_probs=60.4 Q ss_pred HHHHHHHHHHHHEECCCCHHH-HHHHH--CCCCCEEEECCCCCCCCCCC-CEECCCCHHHHHCCCCCCCCCCCCCCCCCC Q ss_conf 787886543111126879899-98851--24553156316766777666-402032102231012334531000124677 Q gi|254780664|r 333 AKEVIKVFTEAIIAPTLSEEA-ADVLA--KKPSMRFLKTSSLLDFHGEE-IVLKTVSGGILVQTRDNVVDNKELTVVTKR 408 (536) Q Consensus 333 A~~I~~~F~EvIiAP~f~~eA-leiL~--~KKnlRil~~~~~~~~~~~~-~~~rsi~GG~LvQ~~D~~~~~~~~~vVT~~ 408 (536) ++...+.=++.+|-||--.|- -++.. +|.++-++.+-..... ... ..+-....||+..-. ..=||.. T Consensus 119 ~~~~~~aGvdGlIIpDLP~EE~~~~~~~~~~~gi~~I~LiaPtT~-~eRi~~I~~~s~GFvY~VS--------~~GVTG~ 189 (285) T PRK13120 119 AQAAQAAGVDGVLVVDYPPEEVDEFAAMLAEAGVAPIFLLAPTST-EARIEAIGRVARGYVYYVS--------LKGVTGA 189 (285) T ss_pred HHHHHHCCCCEEECCCCCHHHHHHHHHHHHHCCCCEEEEECCCCC-HHHHHHHHHHCCCCEEEEE--------CCCCCCC T ss_conf 999998398779647999799999999999669965899579998-9999999950898189986--------5654688 Q ss_pred -CCCHHHHHHHHHHHHHHHCCCCCEEEEEECCEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCCCC Q ss_conf -7868898889999998600566779999689389972968888999999999987642001222234870999736686 Q gi|254780664|r 409 -SPTDQELRDMKFAFKVVKHVKSNAVVYAKDGRTVGIGSGQTSRVDSTRFAAIKAHNISAQADVKSMTNGSVIASEAFYP 487 (536) Q Consensus 409 -~pt~~e~~dL~FA~kv~K~vkSNAIv~ak~~~tiGiGaGQ~sRVda~~iA~~KA~~~~~~~~~~~~~~g~vlaSDAFFP 487 (536) ....+++.+++ -++=+++ + .-|.+|=|=-++-++-+++- .+.|+|..|- T Consensus 190 ~~~~~~~l~~~i--~~ik~~t-~---------~Pv~vGFGIs~~e~v~~~~~--------------~ADGvIVGSA---- 239 (285) T PRK13120 190 GSLDTDDVARKL--ALIRRHV-H---------IPVGVGFGIRDAASAQRIAA--------------HADAVVIGSK---- 239 (285) T ss_pred CCCCHHHHHHHH--HHHHHCC-C---------CCEEEEECCCCHHHHHHHHC--------------CCCEEEECHH---- T ss_conf 755668899999--9999726-9---------97599962598999999970--------------2999998789---- Q ss_pred CCCHHHHHHHCCCCEEEECCCCCCHHHHHHHHH Q ss_conf 855589999719959993898779889999998 Q gi|254780664|r 488 FPDGIVEAIKAGVTAVIQPGGSVRDSEAITVAD 520 (536) Q Consensus 488 F~D~ie~aa~~Gi~aIiqPGGSirD~evI~aan 520 (536) =|+...++|= |.+.+.+.+.-..++- T Consensus 240 ---iVk~Iee~~~----~~~~~~~~~~~~~~~~ 265 (285) T PRK13120 240 ---LIETMEQAGA----QAGADQKNEAAIAAAQ 265 (285) T ss_pred ---HHHHHHHCCC----CCCCCCCHHHHHHHHH T ss_conf ---9999997075----4565556889999999 No 243 >PRK13374 purine nucleoside phosphorylase; Provisional Probab=24.43 E-value=48 Score=13.62 Aligned_cols=26 Identities=8% Similarity=0.029 Sum_probs=15.3 Q ss_pred HHHHHHHHHHCCCCCEEEEEECCEEE Q ss_conf 89999998600566779999689389 Q gi|254780664|r 417 DMKFAFKVVKHVKSNAVVYAKDGRTV 442 (536) Q Consensus 417 dL~FA~kv~K~vkSNAIv~ak~~~ti 442 (536) ..+|+.--.+..|--+|..+-+.-+- T Consensus 184 aaL~~la~~~g~kA~svl~V~dn~~~ 209 (233) T PRK13374 184 AGLYGLAAYLGAEALAILTVSDHIIT 209 (233) T ss_pred HHHHHHHHHCCCCEEEEEEEEEECCC T ss_conf 99999999849857899999600476 No 244 >pfam01244 Peptidase_M19 Membrane dipeptidase (Peptidase family M19). Probab=24.27 E-value=48 Score=13.60 Aligned_cols=187 Identities=17% Similarity=0.215 Sum_probs=89.3 Q ss_pred HHHHHHHHCCCCCEEEEEEECCCHHHHHCCCCCHHHHHHCCCCCCHHHHHHHHCCCCCCEEEECHHHHHHHHHHHHCCCC Q ss_conf 99999997799841499994888899731765311232002334346999874015631353089998988987530234 Q gi|254780664|r 96 AHMKFMQDHELESIDLVVVNLYPFEESFCREDDYYTMVENIDIGGPSMIRAAAKNHDYVTILTNPQDYPLFLAEMDVNNG 175 (536) Q Consensus 96 ~~~~~l~~~~i~~IDlVvvNLYPF~~~v~~~~~~~~~IEnIDIGGpsmiRAAAKN~~~V~Vi~dp~dY~~~~~el~~~~g 175 (536) .|...|++ ..+++++..+|-=+ ........+.+.|+||. +-|-.++|-..+..+.+.+|.....++=+.. - T Consensus 35 vdlprlr~---Ggv~~q~fai~v~~-~~~~~~~~~~al~~id~----~~~~~~~~~~~~~~~~s~~di~~a~~~gk~a-~ 105 (316) T pfam01244 35 TDLERLRA---GGVGAQFFAIFVPC-DAQYEDAVQATLEQIDI----FRRLVRKNPEQLRLVRTADDIERAKKEGKIA-I 105 (316) T ss_pred CCHHHHHH---CCCCEEEEEEEECC-CCCCHHHHHHHHHHHHH----HHHHHHHCCCCEEEECCHHHHHHHHHCCCEE-E T ss_conf 17999986---79987999999578-77750899999999999----9999996989689969999999999779989-9 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCCCCCCCCCCCCCEEEECCCCCCCCCCHHHCC Q ss_conf 56667889999999998742289999875331012113443210123333436788100650576047766821012105 Q gi|254780664|r 176 KIPYNFRKKMARQAFSRTASYDTAICRWLANAESENFPDYLNITAVKKQEMRYGENPHQKAALYSTPEKKSGIAHAVLVQ 255 (536) Q Consensus 176 ~~~~~~R~~lA~kAF~~ta~YD~~Ia~~l~~~~~~~~p~~~~~~~~~~~~LRYGENPHQ~Aa~Y~~~~~~~~~~~~~ql~ 255 (536) -++.|--.-+...--..-..|+. + -..+.+++.....+=.|..+ .....|+..+ T Consensus 106 ~l~~Eg~~~l~~dl~~l~~~y~~----------G---vR~~~Lt~n~~N~~a~g~~~---------~~~~~GLt~~---- 159 (316) T pfam01244 106 LLGIEGAHALGDDLALLRTFYAL----------G---VRYLGLTWNCNNLFADGCLE---------RKRDGGLTRF---- 159 (316) T ss_pred EEEEECHHHHHCCHHHHHHHHHC----------C---CEEEEEECCCCCCCCCCCCC---------CCCCCCCCHH---- T ss_conf 99520767764509999999961----------9---37998523788854036787---------9889998877---- Q ss_pred CCC----C-CCCCCCCCC-----HHHHHHHHCCCCCCCEEEEECCCCCCCCCC-CCHHHHHHHHHHHHCCCCCCCEEEEE Q ss_conf 898----8-831222120-----011677642455672799961568656545-20168999999853321320045554 Q gi|254780664|r 256 GKP----L-SYNNINDLD-----AAFELVSEFRSQDCAACVIVKHMNPCGVAT-ADTLVEAYRRALSCDPISAFGGIIAF 324 (536) Q Consensus 256 GKe----L-SYNNllD~d-----aA~~lv~ef~~~~~Pa~vIvKH~NPCGvA~-~~~~~~Ay~~A~~~DP~SAFGGIIa~ 324 (536) ||+ | ...=++|+. +.|..+ ++.+ .| +|.-|+|+-.+.- ..|+.+.-.+|++. -||+|.+ T Consensus 160 G~~~v~~mn~lGmivDlSH~s~~~~~d~~-~~s~--~P--viaSHSn~ral~~h~RNl~De~i~aia~-----~gGviGi 229 (316) T pfam01244 160 GKEVVREMNRLGILIDLSHLSERTFWDVL-ELSK--AP--VVASHSNARALCDHPRNLTDEQLKAIAE-----SGGVIGV 229 (316) T ss_pred HHHHHHHHHHCCCEEECCCCCHHHHHHHH-HHCC--CC--EEEECCCHHHCCCCCCCCCHHHHHHHHH-----CCCEEEE T ss_conf 99999999976967764879889999999-9709--99--8997878554058888888999999986-----4987999 Q ss_pred CCC Q ss_conf 275 Q gi|254780664|r 325 NRE 327 (536) Q Consensus 325 N~~ 327 (536) |.- T Consensus 230 ~~~ 232 (316) T pfam01244 230 NFY 232 (316) T ss_pred EEC T ss_conf 600 No 245 >PRK03675 consensus Probab=23.94 E-value=40 Score=14.16 Aligned_cols=24 Identities=25% Similarity=0.386 Sum_probs=11.7 Q ss_pred EEEECCCCCCCCHHHHHHHCCCCEEE Q ss_conf 99973668685558999971995999 Q gi|254780664|r 479 VIASEAFYPFPDGIVEAIKAGVTAVI 504 (536) Q Consensus 479 vlaSDAFFPF~D~ie~aa~~Gi~aIi 504 (536) ++++-+-|. .+.++.+.++ |.-|+ T Consensus 234 ~~aTHglfs-~~A~~~l~~~-i~~iv 257 (279) T PRK03675 234 VVATHGVFA-EGAIERVSKA-VDELA 257 (279) T ss_pred EEEECCCCC-CHHHHHHHHH-CCEEE T ss_conf 999781068-0699999974-88999 No 246 >PRK06500 short chain dehydrogenase; Provisional Probab=23.72 E-value=50 Score=13.53 Aligned_cols=44 Identities=27% Similarity=0.283 Sum_probs=29.8 Q ss_pred CCCEEEEEEEEECCCCHHHHHHHHHHCCCEEEEEHHHHHHHHHC Q ss_conf 76136989987545686999999998799999942768999988 Q gi|254780664|r 13 EIAVKTALISVHNKTGVVEFASRLLSRGIKIISTGGTCQLLEEE 56 (536) Q Consensus 13 ~~~ikrALiSV~dKtgl~~la~~L~~~g~~iisTgGTa~~l~~~ 56 (536) +++=|+|||.=.-+-==...|+.|.+.|.+++-++-..+.|.+. T Consensus 3 rl~gK~~lITGas~GIG~aiA~~la~~Ga~V~i~~r~~~~l~~~ 46 (249) T PRK06500 3 RLQGKTALITGGTSGIGLETARQFAAEGARVAITGRDAATLEAA 46 (249) T ss_pred CCCCCEEEEECCCCHHHHHHHHHHHHCCCEEEEEECCHHHHHHH T ss_conf 98998899937687899999999998799999996998999999 No 247 >PRK13973 thymidylate kinase; Provisional Probab=23.59 E-value=50 Score=13.51 Aligned_cols=100 Identities=15% Similarity=0.159 Sum_probs=52.7 Q ss_pred HHHHCCCCEEEHHHHCCCC------H-HCCCCCCCCCCHHHHHHHHCCCCHHHHHHHHHCCCCCEEEEEEECCCHHHHHC Q ss_conf 9998899517946534881------6-51985532371555355330889899999997799841499994888899731 Q gi|254780664|r 52 LLEEEGIPVTSVFDITKFP------E-IMGGRVKTLHPKIYGGILSIRDNPAHMKFMQDHELESIDLVVVNLYPFEESFC 124 (536) Q Consensus 52 ~l~~~gi~v~~Vs~~Tgfp------E-il~GRVKTLHP~I~ggIL~~r~~~~~~~~l~~~~i~~IDlVvvNLYPF~~~v~ 124 (536) .|++.|++|.-..+-+|.| + +|+|...-++|....= |+--++-+|..++-+-....=..|||+=|=+...+- T Consensus 26 ~L~~~g~~v~~trePg~t~~~e~ir~~ll~~~~~~~~~~~e~l-LfaA~R~eh~~~~I~paL~~g~~VI~DRy~~Ss~AY 104 (216) T PRK13973 26 RLRAAGYDVLVTREPGGSPGAEAIRHVLLSGAAELYGPAMEAL-LFAAARDDHVEEVIRPALARGKIVLCDRFIDSTRAY 104 (216) T ss_pred HHHHCCCCEEEEECCCCCCCHHHHHHHHHCCCCCCCCHHHHHH-HHHHHHHHHHHHHHHHHHHCCCEEEECCCCCHHHHH T ss_conf 9997799579940989984089999998466657899899999-999999999999999999779989867862579999 Q ss_pred CCC----CH--HHHHHCCCCCCHHHHHHHHCCCCCCEEEECHH Q ss_conf 765----31--12320023343469998740156313530899 Q gi|254780664|r 125 RED----DY--YTMVENIDIGGPSMIRAAAKNHDYVTILTNPQ 161 (536) Q Consensus 125 ~~~----~~--~~~IEnIDIGGpsmiRAAAKN~~~V~Vi~dp~ 161 (536) ++. +. -+.++.+-+|+| .++++++-|-. T Consensus 105 Qg~~~~~~~~~l~~l~~~~~~~~---------~PDlti~LDv~ 138 (216) T PRK13973 105 QGVTGNVDPALLAALERVAINGV---------MPDLTLILDIP 138 (216) T ss_pred HHHCCCCCHHHHHHHHHHHHCCC---------CCCEEEEECCC T ss_conf 77427889899999999996799---------99989998188 No 248 >cd03375 TPP_OGFOR Thiamine pyrophosphate (TPP family), 2-oxoglutarate ferredoxin oxidoreductase (OGFOR) subfamily, TPP-binding module; OGFOR catalyzes the oxidative decarboxylation of 2-oxo-acids, with ferredoxin acting as an electron acceptor. In the TCA cycle, OGFOR catalyzes the oxidative decarboxylation of 2-oxoglutarate to succinyl-CoA. In the reductive tricarboxylic acid cycle found in the anaerobic autotroph Hydrogenobacter thermophilus, OGFOR catalyzes the reductive carboxylation of succinyl-CoA to produce 2-oxoglutarate. Thauera aromatica OGFOR has been shown to provide reduced ferredoxin to benzoyl-CoA reductase, a key enzyme in the anaerobic metabolism of aromatic compounds. OGFOR is dependent on TPP and a divalent metal cation for activity. Probab=23.52 E-value=40 Score=14.21 Aligned_cols=30 Identities=43% Similarity=0.570 Sum_probs=23.8 Q ss_pred CCCCCHHHHHHHHCCCCCCEEEECHHHHHH Q ss_conf 233434699987401563135308999898 Q gi|254780664|r 136 IDIGGPSMIRAAAKNHDYVTILTNPQDYPL 165 (536) Q Consensus 136 IDIGGpsmiRAAAKN~~~V~Vi~dp~dY~~ 165 (536) -|||++.|+.|+-.|..-+.++.|-.-|.. T Consensus 82 ~~iG~~~l~~A~~rn~~i~~iv~DN~~ygm 111 (193) T cd03375 82 AAIGGNHFIHAARRNIDITVIVHNNQIYGL 111 (193) T ss_pred HHCCHHHHHHHHHHCCCEEEEEECCCCEEC T ss_conf 645618899999809974999985840442 No 249 >cd00587 HCP_like The HCP family of iron-sulfur proteins includes hybrid cluster protein (HCP), acetyl-CoA synthase (ACS), and carbon monoxide dehydrogenase (CODH), all of which contain [Fe4-S4] metal clusters at their active sites. These proteins have a conserved alpha-beta rossman fold domain. HCP, formerly known as prismane, is thought to play a role in nitrogen metabolism but its specific function is unknown. Acetyl-CoA synthase (ACS), is found in acetogenic and methanogenic organisms and is responsible for the synthesis and breakdown of acetyl-CoA. ACS forms a heterotetramer with carbon monoxide dehydrogenase (CODH) consisting of two ACS and two CODH subunits. CODH reduces carbon dioxide to carbon monoxide and ACS then synthesizes acetyl-CoA from carbon monoxide and CoA. Probab=23.50 E-value=50 Score=13.50 Aligned_cols=39 Identities=18% Similarity=0.322 Sum_probs=20.3 Q ss_pred CEEEEECCCCCHHHHHHHHHHHH--------HHHEECC-CCHHHHHHHHC Q ss_conf 04555427542145787886543--------1111268-79899988512 Q gi|254780664|r 319 GGIIAFNREVDQEVAKEVIKVFT--------EAIIAPT-LSEEAADVLAK 359 (536) Q Consensus 319 GGIIa~N~~vd~~~A~~I~~~F~--------EvIiAP~-f~~eAleiL~~ 359 (536) |..+-..|-++..+ .+.+.|= =+-.||. +++.|..|..- T Consensus 159 GSCVDnsri~~la~--~lA~~lG~vdi~dLP~~gsApe~~seKAvAIgt~ 206 (258) T cd00587 159 GNCVDNSHAANLAL--KLANMFGGYDRSDLPAVASAPGAYSQKAAAIATG 206 (258) T ss_pred CCCHHHHHHHHHHH--HHHHHHCCCCHHCCCCEEECHHHHHHHHHHHHHH T ss_conf 53142789999999--9999846676200751031715565899999999 No 250 >KOG2474 consensus Probab=23.47 E-value=50 Score=13.50 Aligned_cols=55 Identities=18% Similarity=0.265 Sum_probs=31.7 Q ss_pred CCCCC-CCCCCHHHHHHHHCCCCCCCEEEEECCCCCCCC-CCC-------CHHHHHHHHHHHHCCCCCCC Q ss_conf 88312-221200116776424556727999615686565-452-------01689999998533213200 Q gi|254780664|r 259 LSYNN-INDLDAAFELVSEFRSQDCAACVIVKHMNPCGV-ATA-------DTLVEAYRRALSCDPISAFG 319 (536) Q Consensus 259 LSYNN-llD~daA~~lv~ef~~~~~Pa~vIvKH~NPCGv-A~~-------~~~~~Ay~~A~~~DP~SAFG 319 (536) +++|+ -.++...|+-..+. .-.+.+|+.|-|+ +++ .+.-|.|+-+|+.---+-|| T Consensus 230 P~~~~~~~~a~~sw~~i~l~------~~~~~~h~~P~g~~~I~va~~~alst~pE~Fe~~~ns~~a~Gf~ 293 (406) T KOG2474 230 PSGNDVEEAAASSWRRIHLL------TLSITVHNHPEGLIGIQVALGAALSTFPEGFESFYNSSLAIGFG 293 (406) T ss_pred CCCCCCCCCHHHHHHHHHHH------CCHHHCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHCCCCCCCHH T ss_conf 87776224313335777641------50220025854035778887766502718889885230000124 No 251 >PRK02842 light-independent protochlorophyllide reductase subunit N; Provisional Probab=23.46 E-value=50 Score=13.49 Aligned_cols=30 Identities=20% Similarity=0.245 Sum_probs=15.7 Q ss_pred CCHHHHHHHHHHCCCEEEEEHHHHHHHHHC Q ss_conf 686999999998799999942768999988 Q gi|254780664|r 27 TGVVEFASRLLSRGIKIISTGGTCQLLEEE 56 (536) Q Consensus 27 tgl~~la~~L~~~g~~iisTgGTa~~l~~~ 56 (536) |++.=|-+++...=+=++.|---+.+|+.+ T Consensus 26 t~i~WLhrki~DaFFLVvGsrTCahllQsA 55 (425) T PRK02842 26 TSVVWLHRKIKDSFFLVVGTRTCAHLLQSA 55 (425) T ss_pred HHHHHHHHHCCCCEEEEEECCHHHHHHHHC T ss_conf 347989986476549998450278787406 No 252 >PRK09607 rps11p 30S ribosomal protein S11P; Reviewed Probab=23.44 E-value=50 Score=13.49 Aligned_cols=21 Identities=24% Similarity=0.499 Sum_probs=9.9 Q ss_pred HHHHHHCCCCEEEHHHHCCCC Q ss_conf 899998899517946534881 Q gi|254780664|r 50 CQLLEEEGIPVTSVFDITKFP 70 (536) Q Consensus 50 a~~l~~~gi~v~~Vs~~Tgfp 70 (536) -+.|...||.++.|.|+|-.| T Consensus 94 ir~l~~~Gl~I~~I~DvTpiP 114 (129) T PRK09607 94 IRALARAGLRIGRIEDVTPIP 114 (129) T ss_pred HHHHHHCCCEEEEEEEECCCC T ss_conf 999986897898988709899 No 253 >PRK13583 hisG ATP phosphoribosyltransferase catalytic subunit; Provisional Probab=23.44 E-value=22 Score=16.06 Aligned_cols=128 Identities=13% Similarity=0.182 Sum_probs=67.3 Q ss_pred HHHHHHHHHCCCCEEEHHH---H----CCCCHHCCCCCCCCCCHHHHHHHHCCCCHHHHHHHHHCCCCCEEEEEEECCCH Q ss_conf 2768999988995179465---3----48816519855323715553553308898999999977998414999948888 Q gi|254780664|r 47 GGTCQLLEEEGIPVTSVFD---I----TKFPEIMGGRVKTLHPKIYGGILSIRDNPAHMKFMQDHELESIDLVVVNLYPF 119 (536) Q Consensus 47 gGTa~~l~~~gi~v~~Vs~---~----TgfpEil~GRVKTLHP~I~ggIL~~r~~~~~~~~l~~~~i~~IDlVvvNLYPF 119 (536) ..|.+.|+++|+++....+ + .++|++ ++.-+ ++.|+.+.-++|.-.+-+|=-++ . T Consensus 15 ~~~~~ll~~~Gl~~~~~~~~R~~~~~~~~~~~i---~i~~~-------------r~~DIp~yV~~G~~DlGItG~D~--l 76 (226) T PRK13583 15 EKTFAWFEKAGYTLVRPGSDREYRGRVEGLDDV---ELLFL-------------SASEIPRELGAGRVDLGVTGEDL--V 76 (226) T ss_pred HHHHHHHHHCCCCCCCCCCCHHEEEECCCCCCE---EEEEE-------------CHHHHHHHHHCCCCCEEEEEHHH--H T ss_conf 999999998799853488862205540599985---89997-------------55568999976986566502255--5 Q ss_pred HHHHCCCCCHHHHHHCCCCCCHHHHHHHHCCCCCCEEEECHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99731765311232002334346999874015631353089998988987530234566678899999999987422899 Q gi|254780664|r 120 EESFCREDDYYTMVENIDIGGPSMIRAAAKNHDYVTILTNPQDYPLFLAEMDVNNGKIPYNFRKKMARQAFSRTASYDTA 199 (536) Q Consensus 120 ~~~v~~~~~~~~~IEnIDIGGpsmiRAAAKN~~~V~Vi~dp~dY~~~~~el~~~~g~~~~~~R~~lA~kAF~~ta~YD~~ 199 (536) .|.-..-...-+.+....+|...|+-|+-++|.+|.-+.|.. ++..+++..+|. |.+.| +.|-.. T Consensus 77 ~E~~~~~~~~v~~~~~LgfG~c~LvvAvP~~w~dv~s~~dl~---~~~~~~~~~~~~-----~lrIA-------Tkyp~l 141 (226) T PRK13583 77 REKLADWDKRVEIVARLGFGHADLVVAVPEIWIDVDTMADLD---DVAADFRARHGR-----RLRIA-------TKYWRL 141 (226) T ss_pred HHCCCCCCCCEEEEECCCCCCEEEEEEECCCCCCCCCHHHHH---HHHHHHHHCCCC-----CEEEE-------ECCCHH T ss_conf 422446145277632178561669999444013554400055---664554421454-----14764-------047326 Q ss_pred HHHHHHHH Q ss_conf 99875331 Q gi|254780664|r 200 ICRWLANA 207 (536) Q Consensus 200 Ia~~l~~~ 207 (536) -.+||.+. T Consensus 142 t~~~f~~~ 149 (226) T PRK13583 142 TQQFLSQK 149 (226) T ss_pred HHHHHHHC T ss_conf 57889866 No 254 >pfam01026 TatD_DNase TatD related DNase. This family of proteins are related to a large superfamily of metalloenzymes. TatD, a member of this family has been shown experimentally to be a DNase enzyme. Probab=23.40 E-value=50 Score=13.49 Aligned_cols=15 Identities=33% Similarity=0.425 Sum_probs=7.6 Q ss_pred HHHHHHCCCCCEEEE Q ss_conf 998851245531563 Q gi|254780664|r 353 AADVLAKKPSMRFLK 367 (536) Q Consensus 353 AleiL~~KKnlRil~ 367 (536) ..++++.-+-=|+|. T Consensus 188 ~~~~~~~iPl~rlll 202 (255) T pfam01026 188 LREVVAEIPLDRLLV 202 (255) T ss_pred HHHHHHHCCCCEEEE T ss_conf 999997499625998 No 255 >pfam09894 DUF2121 Uncharacterized protein conserved in archaea (DUF2121). This domain, found in various hypothetical archaeal proteins, has no known function. Probab=23.37 E-value=41 Score=14.11 Aligned_cols=40 Identities=20% Similarity=0.151 Sum_probs=32.6 Q ss_pred HHHHHHHHHCCCEEEEEHHHHHHHHHCCCCEEEHHHHCCC Q ss_conf 9999999987999999427689999889951794653488 Q gi|254780664|r 30 VEFASRLLSRGIKIISTGGTCQLLEEEGIPVTSVFDITKF 69 (536) Q Consensus 30 ~~la~~L~~~g~~iisTgGTa~~l~~~gi~v~~Vs~~Tgf 69 (536) +||.+...++|++|.=|...-|.-+-.|+-|-+|+.++|+ T Consensus 47 eEL~kkA~E~~V~i~i~D~~~KV~~~~~vlvGEV~s~~g~ 86 (194) T pfam09894 47 EELKKKASEFGVKIYITDDREKVRKVGDVLVGEVTSIEGK 86 (194) T ss_pred HHHHHHHHHCCCEEEEECCHHHEEEECCEEEEEEEEECCC T ss_conf 9999889875947999627245378589999988885066 No 256 >cd02018 TPP_PFOR Thiamine pyrophosphate (TPP family), Pyruvate ferredoxin/flavodoxin oxidoreductase (PFOR) subfamily, TPP-binding module; PFOR catalyzes the oxidative decarboxylation of pyruvate to form acetyl-CoA, a crucial step in many metabolic pathways. Archaea, anaerobic bacteria and eukaryotes that lack mitochondria (and therefore pyruvate dehydrogenase) use PFOR to oxidatively decarboxylate pyruvate, with ferredoxin or flavodoxin as the electron acceptor. PFORs can be homodimeric, heterodimeric, or heterotetrameric, depending on the organism. These enzymes are dependent on TPP and a divalent metal cation as cofactors. Probab=23.31 E-value=41 Score=14.16 Aligned_cols=29 Identities=21% Similarity=0.328 Sum_probs=23.5 Q ss_pred CCCCCHHHHHHHHCCCCCCEEEECHHHHH Q ss_conf 23343469998740156313530899989 Q gi|254780664|r 136 IDIGGPSMIRAAAKNHDYVTILTNPQDYP 164 (536) Q Consensus 136 IDIGGpsmiRAAAKN~~~V~Vi~dp~dY~ 164 (536) -|||+-.++.|+..|..-..++.|-.-|. T Consensus 100 ~~IG~~~l~ha~~Rn~~i~~I~~dN~~Yg 128 (237) T cd02018 100 YDIGFGALSHSLFRGEDITVIVLDNEVYS 128 (237) T ss_pred HHCCHHHHHHHHHCCCCEEEEEECCCCCC T ss_conf 65007889999982898399998881344 No 257 >KOG1712 consensus Probab=23.27 E-value=28 Score=15.34 Aligned_cols=16 Identities=13% Similarity=0.465 Sum_probs=7.7 Q ss_pred EEEECHHHHHHHHHHH Q ss_conf 3530899989889875 Q gi|254780664|r 155 TILTNPQDYPLFLAEM 170 (536) Q Consensus 155 ~Vi~dp~dY~~~~~el 170 (536) +++.||.-+..+++-+ T Consensus 33 pll~dP~af~~lidlf 48 (183) T KOG1712 33 PLLLDPKAFKKLIDLF 48 (183) T ss_pred HHHCCHHHHHHHHHHH T ss_conf 6643979999999999 No 258 >PRK09628 oorB 2-oxoglutarate-acceptor oxidoreductase subunit OorB; Reviewed Probab=23.27 E-value=50 Score=13.47 Aligned_cols=30 Identities=33% Similarity=0.445 Sum_probs=23.2 Q ss_pred CCCCCCHHHHHHHHCCCCCCEEEECHHHHH Q ss_conf 023343469998740156313530899989 Q gi|254780664|r 135 NIDIGGPSMIRAAAKNHDYVTILTNPQDYP 164 (536) Q Consensus 135 nIDIGGpsmiRAAAKN~~~V~Vi~dp~dY~ 164 (536) ..+|||--+|-||-+|..-+.++.|-.-|. T Consensus 98 ~~aIGgnH~iHAaRRN~dit~Iv~NN~iYG 127 (281) T PRK09628 98 GLAIGGNHTIHGCRRNIDLNHILINNFIYG 127 (281) T ss_pred HHHCHHHHHHHHHHHCCCCEEEEECCEEEE T ss_conf 420308899999980899289998777230 No 259 >TIGR02329 propionate_PrpR propionate catabolism operon regulatory protein PrpR; InterPro: IPR012704 At least five distinct pathways exist for the catabolism of propionate by way of propionyl-CoA. Members of this family represent the transcriptional regulatory protein PrpR which is, in most cases, divergently transcribed from the operon that encodes the genes involved in the methylcitric acid cycle of propionate catabolism. This protein is required for the expression of the proteins involved in this pathway . 2-methylcitric acid, an intermediate in this pathway, has been proposed to be a co-activator of PrpR .; GO: 0000156 two-component response regulator activity, 0003677 DNA binding, 0005524 ATP binding, 0000160 two-component signal transduction system (phosphorelay), 0019629 propionate catabolic process 2-methylcitrate cycle, 0005737 cytoplasm. Probab=23.14 E-value=51 Score=13.45 Aligned_cols=60 Identities=20% Similarity=0.213 Sum_probs=31.0 Q ss_pred HHHCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCEEEEECCCCCCCCCC---------------CCHHHHHHHHHHHHCCC Q ss_conf 121058988831222120011677642455672799961568656545---------------20168999999853321 Q gi|254780664|r 251 AVLVQGKPLSYNNINDLDAAFELVSEFRSQDCAACVIVKHMNPCGVAT---------------ADTLVEAYRRALSCDPI 315 (536) Q Consensus 251 ~~ql~GKeLSYNNllD~daA~~lv~ef~~~~~Pa~vIvKH~NPCGvA~---------------~~~~~~Ay~~A~~~DP~ 315 (536) |.+--+=.+.+-.|.-.+=|-.||.+++..+. -||| =.|..+ ++++..||..|++-=.+ T Consensus 115 Fq~~f~l~~~~r~y~t~eDAr~~v~~Lra~G~--~~vV----G~GL~tDlA~~AgL~gvflYSadsvR~Af~~Al~~~~~ 188 (658) T TIGR02329 115 FQKAFNLDIEQRSYVTEEDARSCVNDLRASGI--EVVV----GAGLITDLAEQAGLVGVFLYSADSVRQAFDDALDLARA 188 (658) T ss_pred HHHHHCCCHHHHHHCCHHHHHHHHHHHHHCCC--EEEE----CCCHHHHHHHHCCCCEEEEECHHHHHHHHHHHHHHHHC T ss_conf 99986341688750467789999999976897--5988----76568888876796478763378999999999998632 Q ss_pred C Q ss_conf 3 Q gi|254780664|r 316 S 316 (536) Q Consensus 316 S 316 (536) + T Consensus 189 ~ 189 (658) T TIGR02329 189 T 189 (658) T ss_pred C T ss_conf 0 No 260 >PRK10424 ilvG operon leader peptide; Provisional Probab=23.12 E-value=35 Score=14.64 Aligned_cols=14 Identities=43% Similarity=0.933 Sum_probs=10.2 Q ss_pred EEEECCCCCCCCCCCC Q ss_conf 9996156865654520 Q gi|254780664|r 285 CVIVKHMNPCGVATAD 300 (536) Q Consensus 285 ~vIvKH~NPCGvA~~~ 300 (536) ++|| .-|||+|.|. T Consensus 16 vvii--ippcgaalgr 29 (32) T PRK10424 16 VVII--IPPCGAALGR 29 (32) T ss_pred EEEE--ECCCHHHHCC T ss_conf 9999--6884576347 No 261 >PRK07394 hypothetical protein; Provisional Probab=23.07 E-value=51 Score=13.44 Aligned_cols=64 Identities=16% Similarity=0.170 Sum_probs=36.2 Q ss_pred EEEEECCCCC-----CCCCCCCHHHHHHHHHHHHCCCCCCCEEEEECCC-CC-HHHHHHHHHHHHHHHEECCCCHHHHHH Q ss_conf 7999615686-----5654520168999999853321320045554275-42-145787886543111126879899988 Q gi|254780664|r 284 ACVIVKHMNP-----CGVATADTLVEAYRRALSCDPISAFGGIIAFNRE-VD-QEVAKEVIKVFTEAIIAPTLSEEAADV 356 (536) Q Consensus 284 a~vIvKH~NP-----CGvA~~~~~~~Ay~~A~~~DP~SAFGGIIa~N~~-vd-~~~A~~I~~~F~EvIiAP~f~~eAlei 356 (536) .+-|+||.|. ||+.++ +++++. | +|-. ++ +++++.+.+.=+=-+-||.|.|.-..+ T Consensus 112 G~~VaKHGnrsvssk~G~~sa-DvLe~l------------G----v~i~~l~~e~~~~~l~~~g~~FlfAp~~HPamk~v 174 (342) T PRK07394 112 GQPVVLHGGDRMPTKYGVPLI-ELWQGL------------G----VDLTGLSLEQVQEGFEQTGLAFIYQPDHFPLAESL 174 (342) T ss_pred CCEEEEECCCCCCCCCCCCHH-HHHHHC------------C----CCCCCCCHHHHHHHHHHCCEEEEECHHHCHHHHHH T ss_conf 995998889988765674279-999975------------9----98677999999999997094798760118899998 Q ss_pred HHCCCCCE Q ss_conf 51245531 Q gi|254780664|r 357 LAKKPSMR 364 (536) Q Consensus 357 L~~KKnlR 364 (536) ..-+|.|. T Consensus 175 ~~vRk~Lg 182 (342) T PRK07394 175 IPYRDEIG 182 (342) T ss_pred HHHHHHCC T ss_conf 99999819 No 262 >COG2046 MET3 ATP sulfurylase (sulfate adenylyltransferase) [Inorganic ion transport and metabolism] Probab=22.91 E-value=51 Score=13.42 Aligned_cols=70 Identities=27% Similarity=0.269 Sum_probs=44.2 Q ss_pred HHHHHHHHCCCCHHHHHHHHHCCCCCEEEEEEECCCHHHHHCCCCCHHHHHHCCCCCCHHHHHHHHCCCCCCEEEECHHH Q ss_conf 55535533088989999999779984149999488889973176531123200233434699987401563135308999 Q gi|254780664|r 83 KIYGGILSIRDNPAHMKFMQDHELESIDLVVVNLYPFEESFCREDDYYTMVENIDIGGPSMIRAAAKNHDYVTILTNPQD 162 (536) Q Consensus 83 ~I~ggIL~~r~~~~~~~~l~~~~i~~IDlVvvNLYPF~~~v~~~~~~~~~IEnIDIGGpsmiRAAAKN~~~V~Vi~dp~d 162 (536) .=|||=|-+|..++-.......+.+.|.|= .-.| --||.|-+|+=| -++=.-.-+| T Consensus 5 ~phgg~Lv~r~~~~~~~~~~~~~~~~ield---~~~~-----------~dl~lIa~G~fS----------Pl~GFMne~d 60 (397) T COG2046 5 PPHGGKLVRRVAEERDAMKSIRKLPRIELD---QNSF-----------GDLELIAYGAFS----------PLTGFMNEKD 60 (397) T ss_pred CCCCCHHHHHHCCCCCHHHHHCCCCEEEEC---HHHH-----------HHHHHHHCCCCC----------CCCCCCCHHH T ss_conf 887521655530233128775138618986---3367-----------789998703777----------4402254888 Q ss_pred HHHHHHHHHCCCCC Q ss_conf 89889875302345 Q gi|254780664|r 163 YPLFLAEMDVNNGK 176 (536) Q Consensus 163 Y~~~~~el~~~~g~ 176 (536) |.+|+++++-.+|. T Consensus 61 y~sVv~~mRL~~G~ 74 (397) T COG2046 61 YESVVESMRLANGT 74 (397) T ss_pred HHHHHHHCCCCCCC T ss_conf 99998735156897 No 263 >PRK00912 ribonuclease P protein component 3; Provisional Probab=22.85 E-value=51 Score=13.41 Aligned_cols=19 Identities=26% Similarity=0.230 Sum_probs=10.4 Q ss_pred CHHHHHHHHHHCCCEEEEE Q ss_conf 8699999999879999994 Q gi|254780664|r 28 GVVEFASRLLSRGIKIIST 46 (536) Q Consensus 28 gl~~la~~L~~~g~~iisT 46 (536) -+.+++.....+||+-|.- T Consensus 20 t~~ela~~A~~lGy~~iai 38 (237) T PRK00912 20 TVLELAEEASHLGFSGICL 38 (237) T ss_pred HHHHHHHHHHHCCCCEEEE T ss_conf 7999999999869868998 No 264 >TIGR01220 Pmev_kin_Gr_pos phosphomevalonate kinase; InterPro: IPR005917 Protein kinases are a group of enzymes that possess a catalytic subunit which transfers the gamma phosphate from nucleotide triphosphates (often ATP) to one or more amino acid residues in a protein substrate side chain, resulting in a conformational change affecting protein function. The enzymes fall into two broad classes, characterised with respect to substrate specificity: serine/threonine specific and tyrosine specific . Protein kinase function has been evolutionarily conserved from Escherichia coli to Homo sapiens. Protein kinases play a role in a mulititude of cellular processes, including division, proliferation, apoptosis, and differentiation . Phosphorylation usually results in a functional change of the target protein by changing enzyme activity, cellular location, or association with other proteins. The catalytic subunits of protein kinases are highly conserved, and several structures have been solved , leading to large screens to develop kinase-specific inhibitors for the treatments of a number of diseases . Phosphomevalonate kinase (2.7.4.2 from EC)catalyzes the phosphorylation of 5-phosphomevalonate into 5-diphosphomevalonate, an essential step in isoprenoid biosynthesis via the mevalonate pathway. This pathway starts with three molecules of acetyl-CoA, which, in a series of six different enzyme reactions, are converted to isopentanyl pyrophosphate (IPP), the basic C5 isoprene unit and a common intermediate for a number of pathways including isoprenoid and cholesterol biosynthesis. There are two unrelated types of PMK. The first type belongs to the GHMP kinase and includes a eukaryotic group which is typified by Saccharomyces cerevisiae ERG8 and a bacterial group typified by by Streptococcus pneumoniae PMK (MvaK2) . The second type includes animal PMKs, typified by human PMK . PMKs of the first type are present in eubacteria, fungi, and plants, while the second type is found only in animals, indicative of a nonorthologous gene displacement early in animal evolution . PMKs of the GHMP kinase type are closely related to mevalonate kinases. The two types of PMK have different consensus ATP-binding motifs: a protein kinase motif in the ERG8 orthologs versus a P-loop or Walker A motif in the animal orthologs. The fact that ERG8 orthologs are found in pathogenic eubacteria and fungi but not in humans makes them attractive targets for the development of antibacterial and/or antifungal drugs . This group of enzymes belongs to the GHMP kinase domain superfamily. GHMP kinases are a unique class of ATP-dependent enzymes (the abbreviation of which refers to the original members: galactokinase, homoserine kinase, mevalonate kinase, and phosphomevalonate kinase) . Enzymes belonging to this superfamily contain three well-conserved motifs, the second of which has the typical sequence Pro-X-X-X-Gly-Leu-X-Ser-Ser-Ala and is involved in ATP binding . The phosphate binding loop in GHMP kinases is distinct from the classical P-loops found in many ATP/GTP binding proteins. The bound ADP molecule adopts a rare syn conformation and is in the opposite orientation from those bound to the P-loop-containing proteins . GHMP kinases display a distinctly bilobal appearance with their N-terminal subdomains dominated by a mixed beta-sheet flanked on one side by alpha-helices and their C-terminal subdomains containing a four stranded anti-parallel beta-sheet , , , .. Probab=22.79 E-value=31 Score=14.98 Aligned_cols=78 Identities=19% Similarity=0.278 Sum_probs=49.4 Q ss_pred CCCCCEEEEECCCCCHHHHH-HHHHHHHHHHEECCCCHHHHHHHHCCCCCEEEECCCCCCC-----------CCCCCEEC Q ss_conf 13200455542754214578-7886543111126879899988512455315631676677-----------76664020 Q gi|254780664|r 315 ISAFGGIIAFNREVDQEVAK-EVIKVFTEAIIAPTLSEEAADVLAKKPSMRFLKTSSLLDF-----------HGEEIVLK 382 (536) Q Consensus 315 ~SAFGGIIa~N~~vd~~~A~-~I~~~F~EvIiAP~f~~eAleiL~~KKnlRil~~~~~~~~-----------~~~~~~~r 382 (536) .|.|||+||+.. .|.+-.+ .+.+.=++=+++-+.=.-.++-|+.=||||+++ .+... +..+.+-. T Consensus 175 aS~ygGWiAyst-fDhdwV~q~~~~~~v~~~L~~~WPgl~i~pL~aPk~l~lli--GWTGsPASt~~LVs~vhr~k~~~~ 251 (367) T TIGR01220 175 ASVYGGWIAYST-FDHDWVKQKVEEVSVDEVLKKDWPGLSIEPLKAPKNLELLI--GWTGSPASTADLVSEVHRRKEEKS 251 (367) T ss_pred HHHHHHHHHHCC-CCHHHHHHHHHHHCHHHHHHCCCCCCEEECCCCCCCCEEEE--EECCCCCCCHHHHHHHHHCCCCCC T ss_conf 101110333057-66688875541303588861477885022067998677988--631788777678888874046762 Q ss_pred CCCHHHHHCCCCC Q ss_conf 3210223101233 Q gi|254780664|r 383 TVSGGILVQTRDN 395 (536) Q Consensus 383 si~GG~LvQ~~D~ 395 (536) +.++-||-|.++. T Consensus 252 a~y~rFl~~s~~C 264 (367) T TIGR01220 252 AAYERFLEKSRDC 264 (367) T ss_pred HHHHHHHHHHHHH T ss_conf 3688888767899 No 265 >TIGR03447 mycothiol_MshC cysteine--1-D-myo-inosityl 2-amino-2-deoxy-alpha-D-glucopyranoside ligase. Members of this protein family are MshC, l-cysteine:1-D-myo-inosityl 2-amino-2-deoxy-alpha-D-glucopyranoside ligase, an enzyme that uses ATP to ligate a Cys residue to a mycothiol precursor molecule, in the second to last step in mycothiol biosynthesis. This enzyme shows considerable homology to Cys--tRNA ligases, and many instances are misannotated as such. Mycothiol is found in Mycobacterium tuberculosis, Corynebacterium glutamicum, Streptomyces coelicolor, and various other members of the Actinobacteria. Mycothiol is an analog to glutathione. Probab=22.75 E-value=52 Score=13.40 Aligned_cols=108 Identities=14% Similarity=0.055 Sum_probs=63.4 Q ss_pred EEEECCCCHHHHHHHHHHCCCEEEEEHHHHHHHHHCCCCEEEHHHHCCCCHHCCCCCCCCCCHHHHHHHHCCCCHHHHHH Q ss_conf 98754568699999999879999994276899998899517946534881651985532371555355330889899999 Q gi|254780664|r 21 ISVHNKTGVVEFASRLLSRGIKIISTGGTCQLLEEEGIPVTSVFDITKFPEIMGGRVKTLHPKIYGGILSIRDNPAHMKF 100 (536) Q Consensus 21 iSV~dKtgl~~la~~L~~~g~~iisTgGTa~~l~~~gi~v~~Vs~~TgfpEil~GRVKTLHP~I~ggIL~~r~~~~~~~~ 100 (536) ..|||-..|-..-..+.- .=-.++|+..|.+|+-|..+|.-..=+=-|-+.. .+.-.-|+++-...-.++ T Consensus 44 pTVYd~~HiGhaR~~v~f--------D~l~R~L~~~G~~V~~v~NiTDiDDKii~~A~~~--g~~~~ela~~y~~~f~~d 113 (411) T TIGR03447 44 ITPYDATHLGHAATYLTF--------DLVNRVWRDAGHRVHYVQNVTDVDDPLFERAERD--GVDWRELGTSQIDLFRED 113 (411) T ss_pred CCCCCCCCCCCCHHHHHH--------HHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHH--CCCHHHHHHHHHHHHHHH T ss_conf 847888862234168999--------9999999983985589961465425999999882--989999999999999999 Q ss_pred HHHCCCCCEEEEEEECCCHHHHHCCCCCHHHHHHCCCCCCHHHHHHHHCCCCCCE Q ss_conf 9977998414999948888997317653112320023343469998740156313 Q gi|254780664|r 101 MQDHELESIDLVVVNLYPFEESFCREDDYYTMVENIDIGGPSMIRAAAKNHDYVT 155 (536) Q Consensus 101 l~~~~i~~IDlVvvNLYPF~~~v~~~~~~~~~IEnIDIGGpsmiRAAAKN~~~V~ 155 (536) |+.+||.+-| .|| .-+...+++||.|. +--.|+|.+++ T Consensus 114 ~~~Lni~~p~-----~~p-----raTe~I~~ii~~I~-------~Li~kG~AY~~ 151 (411) T TIGR03447 114 MEALRVLPPR-----DYI-----GAVESIDEVIEMVE-------KLLAAGAAYEV 151 (411) T ss_pred HHHCCCCCCC-----CCC-----CCCCCHHHHHHHHH-------HHHHCCCEEEE T ss_conf 9974999998-----776-----74347999999999-------99868967982 No 266 >TIGR01182 eda 2-dehydro-3-deoxyphosphogluconate aldolase/4-hydroxy-2-oxoglutarate aldolase; InterPro: IPR000887 4-Hydroxy-2-oxoglutarate aldolase (4.1.3.16 from EC) (KHG-aldolase) catalyzes the interconversion of 4-hydroxy-2-oxoglutarate into pyruvate and glyoxylate. Phospho-2-dehydro-3-deoxygluconate aldolase (4.1.2.14 from EC) (KDPG-aldolase) catalyzes the interconversion of 6-phospho-2-dehydro-3-deoxy-D-gluconate into pyruvate and glyceraldehyde 3-phosphate. These two enzymes are structurally and functionally related . They are both homotrimeric proteins of approximately 220 amino-acid residues. They are class I aldolases whose catalytic mechanism involves the formation of a Schiff-base intermediate between the substrate and the epsilon-amino group of a lysine residue. In both enzymes, an arginine is required for catalytic activity.; GO: 0003824 catalytic activity, 0008152 metabolic process. Probab=22.66 E-value=52 Score=13.39 Aligned_cols=33 Identities=27% Similarity=0.481 Sum_probs=15.2 Q ss_pred HHHHHHHCCCCEEEECCCCCCHHHHHHHHHHCCCEE Q ss_conf 589999719959993898779889999998759779 Q gi|254780664|r 491 GIVEAIKAGVTAVIQPGGSVRDSEAITVADQHGIAM 526 (536) Q Consensus 491 ~ie~aa~~Gi~aIiqPGGSirD~evI~aan~~gi~m 526 (536) -++.|.++|-..||-||= +++.+++|.+++|+. T Consensus 73 Q~~~A~~AGA~F~vSPG~---~p~l~~~~~~~~~P~ 105 (205) T TIGR01182 73 QLRQAVAAGAQFIVSPGL---TPELAKHAKDKGIPI 105 (205) T ss_pred HHHHHHHCCCCEEECCCC---CHHHHHHHHHCCCCE T ss_conf 999999708957876978---889999985088812 No 267 >cd01398 RPI_A RPI_A: Ribose 5-phosphate isomerase type A (RPI_A) subfamily; RPI catalyzes the reversible conversion of ribose-5-phosphate to ribulose 5-phosphate, the first step of the non-oxidative branch of the pentose phosphate pathway. This reaction leads to the conversion of phosphosugars into glycolysis intermediates, which are precursors for the synthesis of amino acids, vitamins, nucleotides, and cell wall components. In plants, RPI is part of the Calvin cycle as ribulose 5-phosphate is the carbon dioxide receptor in the first dark reaction of photosynthesis. There are two unrelated types of RPIs (A and B), which catalyze the same reaction, at least one type of RPI is present in an organism. RPI_A is more widely distributed than RPI_B in bacteria, eukaryotes, and archaea. Probab=22.62 E-value=52 Score=13.38 Aligned_cols=30 Identities=17% Similarity=0.287 Sum_probs=18.1 Q ss_pred CCCCCCHHHHH--HHHCCCCCCEEEECHHHHH Q ss_conf 02334346999--8740156313530899989 Q gi|254780664|r 135 NIDIGGPSMIR--AAAKNHDYVTILTNPQDYP 164 (536) Q Consensus 135 nIDIGGpsmiR--AAAKN~~~V~Vi~dp~dY~ 164 (536) .|-=||-+|+| -.|.+-+.+.+|+|..-+- T Consensus 90 lIKGgGgal~rEKiva~~a~~~I~i~D~sK~v 121 (213) T cd01398 90 LIKGGGGALLREKIVASAAKKFIVIADESKLV 121 (213) T ss_pred EEEECCHHHHHHHHHHHHHHEEEEEECCCCEE T ss_conf 89708289898999997422099996601160 No 268 >TIGR02253 CTE7 HAD superfamily (subfamily IA) hydrolase, TIGR02253; InterPro: IPR011950 This family of sequences from archaea and metazoans includes the human uncharacterised protein CTE7. Pyrococcus species appear to have three different forms of this enzyme, so it is unclear whether all members of this family have the same function. This family is a member of the haloacid dehalogenase (HAD) superfamily of hydrolases which are characterised by three conserved sequence motifs. By virtue of an alpha helical domain in-between the first and second conserved motif, this family is a member of subfamily IA .. Probab=22.61 E-value=52 Score=13.38 Aligned_cols=16 Identities=25% Similarity=0.329 Sum_probs=7.4 Q ss_pred CCHHHHHHHHHHCCCE Q ss_conf 6869999999987999 Q gi|254780664|r 27 TGVVEFASRLLSRGIK 42 (536) Q Consensus 27 tgl~~la~~L~~~g~~ 42 (536) .+.++....|-+.||. T Consensus 116 p~~~~~L~~LR~~Gy~ 131 (244) T TIGR02253 116 PDVVDTLMELRESGYR 131 (244) T ss_pred CCHHHHHHHHHHCCCE T ss_conf 7688999999864778 No 269 >PRK13124 consensus Probab=22.55 E-value=52 Score=13.37 Aligned_cols=12 Identities=25% Similarity=0.562 Sum_probs=4.8 Q ss_pred HHHHEECCCCHH Q ss_conf 311112687989 Q gi|254780664|r 341 TEAIIAPTLSEE 352 (536) Q Consensus 341 ~EvIiAP~f~~e 352 (536) +..+|-||.-.| T Consensus 114 v~GvIipDLP~e 125 (257) T PRK13124 114 IDGLLIPDLPLE 125 (257) T ss_pred CCEEECCCCCHH T ss_conf 984777899979 No 270 >TIGR02460 osmo_MPGsynth mannosyl-3-phosphoglycerate synthase; InterPro: IPR012812 This family consists of examples of mannosyl-3-phosphoglycerate synthase (MPGS), which together with mannosyl-3-phosphoglycerate phosphatase (MPGP), comprises a two-step pathway for mannosylglycerate biosynthesis. Mannosylglycerate is a compatible solute that tends to be restricted to extreme thermophiles of archaea and bacteria. Note that in Rhodothermus marinus, this pathway is one of two; the other is condensation of GDP-mannose with D-glycerate by mannosylglycerate synthase.; GO: 0050504 mannosyl-3-phosphoglycerate synthase activity, 0051479 mannosylglycerate biosynthetic process, 0005737 cytoplasm. Probab=22.49 E-value=47 Score=13.67 Aligned_cols=23 Identities=26% Similarity=0.320 Sum_probs=15.4 Q ss_pred HCCCCCCCCCCCCCCCH-----HHHHHHHC Q ss_conf 10589888312221200-----11677642 Q gi|254780664|r 253 LVQGKPLSYNNINDLDA-----AFELVSEF 277 (536) Q Consensus 253 ql~GKeLSYNNllD~da-----A~~lv~ef 277 (536) .++|+ -|.-+.|+|+ +|.-|+.| T Consensus 162 ~~~G~--~YvGFvDsDNY~PGAv~EYv~~Y 189 (394) T TIGR02460 162 AALGA--EYVGFVDSDNYFPGAVNEYVKIY 189 (394) T ss_pred HHHCC--CEEEEEECCCCCCCCHHHHHHHH T ss_conf 98638--70036740678774256889999 No 271 >KOG3185 consensus Probab=22.46 E-value=51 Score=13.43 Aligned_cols=34 Identities=35% Similarity=0.634 Sum_probs=27.2 Q ss_pred HCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHEECCCCHHHHHHHHC Q ss_conf 332132004555427542145787886543111126879899988512 Q gi|254780664|r 312 CDPISAFGGIIAFNREVDQEVAKEVIKVFTEAIIAPTLSEEAADVLAK 359 (536) Q Consensus 312 ~DP~SAFGGIIa~N~~vd~~~A~~I~~~F~EvIiAP~f~~eAleiL~~ 359 (536) -.++||.|-+|+||.-| .++-|+.+.|.-||++. T Consensus 98 eErlsALGNviaCNDyv--------------AlvH~dldketEeii~d 131 (245) T KOG3185 98 EERLSALGNVIACNDYV--------------ALVHPDLDKETEEIIAD 131 (245) T ss_pred HHHHHHHCCEEEECCEE--------------EEECCCCCHHHHHHHHH T ss_conf 66776415787754555--------------77337633338999998 No 272 >PRK11865 pyruvate ferredoxin oxidoreductase subunit beta; Provisional Probab=22.35 E-value=33 Score=14.82 Aligned_cols=29 Identities=28% Similarity=0.426 Sum_probs=24.6 Q ss_pred CCCCCHHHHHHHHCCCCCCEEEECHHHHH Q ss_conf 23343469998740156313530899989 Q gi|254780664|r 136 IDIGGPSMIRAAAKNHDYVTILTNPQDYP 164 (536) Q Consensus 136 IDIGGpsmiRAAAKN~~~V~Vi~dp~dY~ 164 (536) -|||+-.|+.|++.|+.-..++.|-.-|. T Consensus 102 ~dIG~~~l~ha~~Rn~~i~~iv~DNe~Y~ 130 (298) T PRK11865 102 ADIGFQALSGAMERGHNIVYIMYDNEAYM 130 (298) T ss_pred HHHHHHHHHHHHHCCCCEEEEEECCEEEE T ss_conf 54217889999982998599998471210 No 273 >pfam00464 SHMT Serine hydroxymethyltransferase. Probab=22.30 E-value=42 Score=14.02 Aligned_cols=14 Identities=21% Similarity=0.161 Sum_probs=7.2 Q ss_pred CCCCCHHHHHHHHC Q ss_conf 22120011677642 Q gi|254780664|r 264 INDLDAAFELVSEF 277 (536) Q Consensus 264 llD~daA~~lv~ef 277 (536) .+|.|.--+++.++ T Consensus 146 ~ID~d~l~~~a~~~ 159 (380) T pfam00464 146 LIDYDQLEKNAKLF 159 (380) T ss_pred CCCHHHHHHHHHHC T ss_conf 66899999999970 No 274 >COG0449 GlmS Glucosamine 6-phosphate synthetase, contains amidotransferase and phosphosugar isomerase domains [Cell envelope biogenesis, outer membrane] Probab=22.24 E-value=53 Score=13.33 Aligned_cols=20 Identities=25% Similarity=0.297 Sum_probs=8.2 Q ss_pred HHHHHHHHHCCCEEEEEHHH Q ss_conf 99999999879999994276 Q gi|254780664|r 30 VEFASRLLSRGIKIISTGGT 49 (536) Q Consensus 30 ~~la~~L~~~g~~iisTgGT 49 (536) .+|-..|.+.|+...|.--| T Consensus 106 ~eLr~eL~~~G~~F~S~TDT 125 (597) T COG0449 106 AELKEELEAKGYVFKSDTDT 125 (597) T ss_pred HHHHHHHHHCCCEEECCCCH T ss_conf 99999999669888567614 No 275 >COG0120 RpiA Ribose 5-phosphate isomerase [Carbohydrate transport and metabolism] Probab=22.24 E-value=53 Score=13.33 Aligned_cols=70 Identities=11% Similarity=0.099 Sum_probs=38.6 Q ss_pred HHHHHHHHHCCCCCEEEEEEECCCHHHHHCCCC-CHHHHHHCCCCCCHHHHHH--HHCCCCCCEEEECHHHHHHHH Q ss_conf 899999997799841499994888899731765-3112320023343469998--740156313530899989889 Q gi|254780664|r 95 PAHMKFMQDHELESIDLVVVNLYPFEESFCRED-DYYTMVENIDIGGPSMIRA--AAKNHDYVTILTNPQDYPLFL 167 (536) Q Consensus 95 ~~~~~~l~~~~i~~IDlVvvNLYPF~~~v~~~~-~~~~~IEnIDIGGpsmiRA--AAKN~~~V~Vi~dp~dY~~~~ 167 (536) ..-...+.++||+-.++--++ --...-.++ ..+.-.-.|-=||-+|+|- -|-+-+...||+|..-+-..+ T Consensus 56 ~~t~~l~~~~GI~v~~l~~~~---~lDl~iDGADEvd~~~~lIKGGGgAl~rEKIva~~ak~~IvIvDesKlV~~L 128 (227) T COG0120 56 FQTEELARELGIPVSSLNEVD---SLDLAIDGADEVDPNLNLIKGGGGALLREKIVASAAKRFIVIVDESKLVEVL 128 (227) T ss_pred HHHHHHHHHCCCEECCCCCCC---CCCEEEECCCCCCCCCCEECCCHHHHHHHHHHHHHCCEEEEEEECCCCHHHC T ss_conf 999999998297643733367---6665764641118888767037589889899997447189999676015331 No 276 >cd01977 Nitrogenase_VFe_alpha Nitrogenase_VFe_alpha -like: Nitrogenase VFe protein, alpha subunit like. This group contains proteins similar to the alpha subunits of, the VFe protein of the vanadium-dependent (V-) nitrogenase and the FeFe protein of the iron only (Fe-) nitrogenase Nitrogenase catalyzes the ATP-dependent reduction of dinitrogen (N2) to ammonia. In addition to V- and Fe- nitrogenases there is a molybdenum (Mo)-dependent nitrogenase which is the most widespread and best characterized of these systems. These systems consist of component 1 (VFe protein, FeFe protein or, MoFe protein respectively) and, component 2 (Fe protein). MoFe is an alpha2beta2 tetramer, V-and Fe- nitrogenases are alpha2beta2delta2 hexamers. The alpha and beta subunits of VFe and FeFe are similar to the alpha and beta subunits of MoFe. For MoFe each alphabeta pair contains one P-cluster (at the alphabeta interface) and, one molecule of iron molybdenum cofactor (FeMoco) contained within the alpha sub Probab=22.24 E-value=47 Score=13.67 Aligned_cols=16 Identities=31% Similarity=0.557 Sum_probs=7.9 Q ss_pred CCCCCCCCCCHHHHHH Q ss_conf 1985532371555355 Q gi|254780664|r 73 MGGRVKTLHPKIYGGI 88 (536) Q Consensus 73 l~GRVKTLHP~I~ggI 88 (536) +.|-+|--||-+||.+ T Consensus 18 ~~g~i~d~~~ivHgp~ 33 (415) T cd01977 18 IGGVIKDVIHVIHGPV 33 (415) T ss_pred HHCCCCCEEEEEECCC T ss_conf 5045476599852772 No 277 >cd01301 rDP_like renal dipeptidase (rDP), best studied in mammals and also called membrane or microsomal dipeptidase, is a membrane-bound glycoprotein hydrolyzing dipeptides and is involved in hydrolytic metabolism of penem and carbapenem beta-lactam antibiotics. Although the biological function of the enzyme is still unknown, it has been suggested to play a role in the renal glutathione metabolism. Probab=22.22 E-value=53 Score=13.33 Aligned_cols=181 Identities=20% Similarity=0.243 Sum_probs=84.1 Q ss_pred HHHHHHHCCCCCEEEEEEECCCHHHHHCCC--CCHHHHHHCCCCCCHHHHHHHHCCCCCCEEEECHHHHHHHHHHHHC-- Q ss_conf 999999779984149999488889973176--5311232002334346999874015631353089998988987530-- Q gi|254780664|r 97 HMKFMQDHELESIDLVVVNLYPFEESFCRE--DDYYTMVENIDIGGPSMIRAAAKNHDYVTILTNPQDYPLFLAEMDV-- 172 (536) Q Consensus 97 ~~~~l~~~~i~~IDlVvvNLYPF~~~v~~~--~~~~~~IEnIDIGGpsmiRAAAKN~~~V~Vi~dp~dY~~~~~el~~-- 172 (536) |...|++-++. +.+--+|--...-..+ ..++.+.|+|| .+-|-.+++-..+..+.+++|......+=+. T Consensus 30 Dlprlr~Ggv~---~q~fai~v~~~~~~~~~~~~~~~al~qi~----~~~~~~~~~~~~~~l~~~~~di~~~~~~gki~~ 102 (309) T cd01301 30 DLPRLREGGVG---GQVFAIFVPPGELQPTWLDALERALEQID----RVRRLIAAYPRIFVLATSSADIRRALKEGKLAA 102 (309) T ss_pred CHHHHHHCCCC---EEEEEEEECCCCCCCCHHHHHHHHHHHHH----HHHHHHHHCCCCEEEECCHHHHHHHHHCCCEEE T ss_conf 79999866998---79999995685789506799999999999----999999869897898789999999997799799 Q ss_pred ---CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCCCCCCCCCCCCCCCCCEEEECCCCCCCCC Q ss_conf ---23456667889999999998742289999875331012113443210123333436788100650576047766821 Q gi|254780664|r 173 ---NNGKIPYNFRKKMARQAFSRTASYDTAICRWLANAESENFPDYLNITAVKKQEMRYGENPHQKAALYSTPEKKSGIA 249 (536) Q Consensus 173 ---~~g~~~~~~R~~lA~kAF~~ta~YD~~Ia~~l~~~~~~~~p~~~~~~~~~~~~LRYGENPHQ~Aa~Y~~~~~~~~~~ 249 (536) -+|.-.++.. |.. .-..|+. + -..+.+++.....+=-|..+ ....|+. T Consensus 103 ~l~iEg~~~l~~~--l~~----l~~~y~~----------G---vR~~~lt~n~~n~~~~~~~~----------~~~~GLt 153 (309) T cd01301 103 IISIEGAHALGGD--LAL----LRLLYRL----------G---VRYLGLTWNGDNKFADGCGE----------KRGGGLT 153 (309) T ss_pred EEEEECHHHHHCC--HHH----HHHHHHC----------C---CEEEEEEECCCCCCCCCCCC----------CCCCCCC T ss_conf 9855074455267--999----9999972----------9---76887542489875257899----------8899989 Q ss_pred CHHHCCCCC----C-CCCCCCCCC-----HHHHHHHHCCCCCCCEEEEECCCCCCCCCC-CCHHHHHHHHHHHHCCCCCC Q ss_conf 012105898----8-831222120-----011677642455672799961568656545-20168999999853321320 Q gi|254780664|r 250 HAVLVQGKP----L-SYNNINDLD-----AAFELVSEFRSQDCAACVIVKHMNPCGVAT-ADTLVEAYRRALSCDPISAF 318 (536) Q Consensus 250 ~~~ql~GKe----L-SYNNllD~d-----aA~~lv~ef~~~~~Pa~vIvKH~NPCGvA~-~~~~~~Ay~~A~~~DP~SAF 318 (536) .+ ||+ | .-+=++|+. +.|.++ ++.+ +-+|.-|+|+-.+.- -.|+.+.-.+|.+-- T Consensus 154 ~~----G~~~v~~mn~lgm~iDvSH~s~~~~~d~~-~~s~----~PviaSHSn~ral~~h~RNl~D~~i~aia~~----- 219 (309) T cd01301 154 PF----GKELVREMNRLGIIIDLSHLSERTFWDVL-DISN----APVIASHSNARALCDHPRNLTDAQLKAIAET----- 219 (309) T ss_pred HH----HHHHHHHHHHHCCEEECCCCCHHHHHHHH-HHCC----CCEEEECCCCCCCCCCCCCCCHHHHHHHHHC----- T ss_conf 89----99999999860967754879889999999-8458----9989857883321488788999999999984----- Q ss_pred CEEEEECCC Q ss_conf 045554275 Q gi|254780664|r 319 GGIIAFNRE 327 (536) Q Consensus 319 GGIIa~N~~ 327 (536) ||+|.+|.- T Consensus 220 gGviGi~~~ 228 (309) T cd01301 220 GGVIGVNFY 228 (309) T ss_pred CCEEEEECC T ss_conf 988999514 No 278 >PRK11449 putative deoxyribonuclease YjjV; Provisional Probab=22.17 E-value=53 Score=13.32 Aligned_cols=31 Identities=19% Similarity=0.175 Sum_probs=18.0 Q ss_pred HHHHHHCCCCCCE-EEECHHHHHHHHHHHHCC Q ss_conf 9998740156313-530899989889875302 Q gi|254780664|r 143 MIRAAAKNHDYVT-ILTNPQDYPLFLAEMDVN 173 (536) Q Consensus 143 miRAAAKN~~~V~-Vi~dp~dY~~~~~el~~~ 173 (536) +-||...+-..+. +-+||+++...++--++. T Consensus 25 i~~a~~~gv~~i~~~g~~~~~~~~~~~l~~~~ 56 (258) T PRK11449 25 LQRAAQAGVGKIIVPATEAANFARVLALAERY 56 (258) T ss_pred HHHHHHCCCCEEEEECCCHHHHHHHHHHHHHC T ss_conf 99999829999999349999999999998759 No 279 >pfam01939 DUF91 Protein of unknown function DUF91. The function of this prokaryotic protein is unknown. Probab=22.14 E-value=53 Score=13.32 Aligned_cols=61 Identities=20% Similarity=0.330 Sum_probs=40.4 Q ss_pred EEEEEC--CCCCCCCCCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHEECCCCHHHHHHHHCCC Q ss_conf 799961--568656545201689999998533213200455542754214578788654311112687989998851245 Q gi|254780664|r 284 ACVIVK--HMNPCGVATADTLVEAYRRALSCDPISAFGGIIAFNREVDQEVAKEVIKVFTEAIIAPTLSEEAADVLAKKP 361 (536) Q Consensus 284 a~vIvK--H~NPCGvA~~~~~~~Ay~~A~~~DP~SAFGGIIa~N~~vd~~~A~~I~~~F~EvIiAP~f~~eAleiL~~KK 361 (536) -++=+| ++.+-| -.++.-|-.++.-||.-+ ..--+++||+.++.|.++|+.+ T Consensus 150 VvVEvKRr~a~ida----V~QL~RYve~l~rd~~~~---------------------~VRGIlvAp~It~~Ar~Ll~d~- 203 (228) T pfam01939 150 VILELKRRKADLHA----VSQLKRYVDSLKEDYVEA---------------------KVRGVLVAPSITEGAKKLLEEE- 203 (228) T ss_pred EEEEEEEECCCHHH----HHHHHHHHHHHHHCCCCC---------------------CCEEEEECCCCCHHHHHHHHHC- T ss_conf 99999740378207----999999999986554888---------------------6039999985898899999975- Q ss_pred CCEEEECCC Q ss_conf 531563167 Q gi|254780664|r 362 SMRFLKTSS 370 (536) Q Consensus 362 nlRil~~~~ 370 (536) +++.+.+.. T Consensus 204 Gle~v~ldp 212 (228) T pfam01939 204 GLEFRKLEP 212 (228) T ss_pred CCEEEEECC T ss_conf 972899368 No 280 >PRK06182 short chain dehydrogenase; Validated Probab=22.13 E-value=53 Score=13.31 Aligned_cols=40 Identities=20% Similarity=0.387 Sum_probs=28.5 Q ss_pred CEEEEEEEEECCCCH-HHHHHHHHHCCCEEEEEHHHHHHHHH Q ss_conf 136989987545686-99999999879999994276899998 Q gi|254780664|r 15 AVKTALISVHNKTGV-VEFASRLLSRGIKIISTGGTCQLLEE 55 (536) Q Consensus 15 ~ikrALiSV~dKtgl-~~la~~L~~~g~~iisTgGTa~~l~~ 55 (536) +=|.|||.=... || ..+|+.|.+.|+.++.++-+...|++ T Consensus 2 k~Kv~lITGass-GIG~a~a~~la~~G~~V~~~~r~~~~l~~ 42 (273) T PRK06182 2 KKKVALVTGASS-GIGKATARKLIAEGFTVYGAARRVDKMED 42 (273) T ss_pred CCCEEEEECCCH-HHHHHHHHHHHHCCCEEEEEECCHHHHHH T ss_conf 469899906320-99999999999879989999798999999 No 281 >TIGR01826 CofD_related conserved hypothetical protein; InterPro: IPR010119 This entry represents a subfamily of conserved hypothetical proteins that forms a sister group to the family of CofD, (IPR010115 from INTERPRO), LPPG:Fo 2-phospho-L-lactate transferase, an enzyme of cytochrome F420 biosynthesis.. Probab=22.11 E-value=53 Score=13.31 Aligned_cols=18 Identities=17% Similarity=0.296 Sum_probs=8.8 Q ss_pred HHHHHHHHCCC---CCCEEEE Q ss_conf 46999874015---6313530 Q gi|254780664|r 141 PSMIRAAAKNH---DYVTILT 158 (536) Q Consensus 141 psmiRAAAKN~---~~V~Vi~ 158 (536) ++|||+.-+-. .++|-|+ T Consensus 11 s~~L~gLk~~~~ri~~iTAIV 31 (331) T TIGR01826 11 SVLLRGLKKLDSRISNITAIV 31 (331) T ss_pred HHHHHHHHHCCCCEEEEEEEE T ss_conf 789988753289414776899 No 282 >TIGR00601 rad23 UV excision repair protein Rad23; InterPro: IPR004806 All proteins in this family for which functions are known are components of a multiprotein complex used for targeting nucleotide excision repair to specific parts of the genome. Rad23 contains a ubiquitin-like domain that interacts with catalytically active proteasomes and two ubiquitin (Ub)-associated (UBA) sequences that bind Ub. Rad23 interacts with ubiquitinated cellular proteins through the synergistic action of its UBA domains. In humans, Rad23 complexes with the XPC protein.; GO: 0006289 nucleotide-excision repair, 0005634 nucleus. Probab=22.05 E-value=42 Score=14.05 Aligned_cols=353 Identities=18% Similarity=0.254 Sum_probs=157.4 Q ss_pred HHHHHCCCEE--EEEHHHHHHHHHCCCCEEEHHHHCC---CCHHCCCCCCCCCCHHHHH-HHHCCCCHHHHHHHHHCCCC Q ss_conf 9999879999--9942768999988995179465348---8165198553237155535-53308898999999977998 Q gi|254780664|r 34 SRLLSRGIKI--ISTGGTCQLLEEEGIPVTSVFDITK---FPEIMGGRVKTLHPKIYGG-ILSIRDNPAHMKFMQDHELE 107 (536) Q Consensus 34 ~~L~~~g~~i--isTgGTa~~l~~~gi~v~~Vs~~Tg---fpEil~GRVKTLHP~I~gg-IL~~r~~~~~~~~l~~~~i~ 107 (536) |-|.+..|+| ++-.-|-+.||++ |++--| ||+ +|-|- ||.| ||. |.+.+.+++|. T Consensus 7 KtLq~~~f~Isd~~P~~tv~~lK~K------I~~~~gkDa~pv---~~qKL----IY~GKiL~------Dd~t~~Ey~I~ 67 (453) T TIGR00601 7 KTLQQQKFKISDVEPDETVKELKEK------IEQEQGKDAYPV---AQQKL----IYSGKILS------DDKTVSEYKIK 67 (453) T ss_pred ECCCCCEEEEECCCCCCHHHHHHHH------HHHHCCCCCCCH---HCCEE----EECCCCCC------CCCCHHHCCCC T ss_conf 5145775366307713048999999------997608735620---02021----00453316------75415442665 Q ss_pred CEEEEEEECCCHHHHHCCCCCHHHHHHCCCCCCHHHHHH-------------------HHC-CCCCCE------------ Q ss_conf 414999948888997317653112320023343469998-------------------740-156313------------ Q gi|254780664|r 108 SIDLVVVNLYPFEESFCREDDYYTMVENIDIGGPSMIRA-------------------AAK-NHDYVT------------ 155 (536) Q Consensus 108 ~IDlVvvNLYPF~~~v~~~~~~~~~IEnIDIGGpsmiRA-------------------AAK-N~~~V~------------ 155 (536) -=|-|||=+== .|+... . .--...---.--|+-+-+ -+. |-.-+. T Consensus 68 E~~FvVvMV~k-~K~~~t-a-vPattqPsstPsPttvsssPav~~~~~P~~~P~sAp~P~p~~~~~as~~~~~~~APa~~ 144 (453) T TIGR00601 68 EKDFVVVMVSK-PKTTTT-A-VPATTQPSSTPSPTTVSSSPAVTKTTEPAATPESAPTPTPENAPPASPAEKKSAAPAST 144 (453) T ss_pred CCCEEEEEECC-CCCCCC-C-CCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 48758998515-765331-2-34455777777875111553435567887774325788788888888886566758656 Q ss_pred --------------EEECHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---CC-CCHHHHHH Q ss_conf --------------5308999898898753023456667889999999998742289999875331---01-21134432 Q gi|254780664|r 156 --------------ILTNPQDYPLFLAEMDVNNGKIPYNFRKKMARQAFSRTASYDTAICRWLANA---ES-ENFPDYLN 217 (536) Q Consensus 156 --------------Vi~dp~dY~~~~~el~~~~g~~~~~~R~~lA~kAF~~ta~YD~~Ia~~l~~~---~~-~~~p~~~~ 217 (536) +...++-|.. ..|..+.+--..=|.-+|..=+.|..+|.+-|.=. +. +..- T Consensus 145 ~~~~~~~~e~~~~~~~~~~~~~~~-------s~~~~~r~nlfEDA~S~lV~G~~~e~~~~~Im~MGGGmYeRe~V~---- 213 (453) T TIGR00601 145 VAESTPTEEEVTATAPASPDTSAP-------SEGDSDRNNLFEDAASTLVVGSEREETIEEIMEMGGGMYEREEVE---- 213 (453) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCC-------CCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCCCHHHHHH---- T ss_conf 776788875443566688776666-------776432345444433111010007889999986278835088999---- Q ss_pred HCCCCCCCCCCC-CCCCCCCEEEECCCCCCCCCCHHHCCCCCCCCCCCCCCCHHHHHHHHCCCCCCCEEEEECCCCCCCC Q ss_conf 101233334367-8810065057604776682101210589888312221200116776424556727999615686565 Q gi|254780664|r 218 ITAVKKQEMRYG-ENPHQKAALYSTPEKKSGIAHAVLVQGKPLSYNNINDLDAAFELVSEFRSQDCAACVIVKHMNPCGV 296 (536) Q Consensus 218 ~~~~~~~~LRYG-ENPHQ~Aa~Y~~~~~~~~~~~~~ql~GKeLSYNNllD~daA~~lv~ef~~~~~Pa~vIvKH~NPCGv 296 (536) .-||=- =||| +|.=| .|-|=+=+.-+-.--.+|-.++..+... ++ ..|.+. T Consensus 214 ------rALRAafNNPd-RAVEY-------------LL~GIPe~~~~~~p~~sa~~~aA~~q~~--~~------~~~~~~ 265 (453) T TIGR00601 214 ------RALRAAFNNPD-RAVEY-------------LLTGIPEDLEAREPEESAAALAAAEQPS--EA------AATTET 265 (453) T ss_pred ------HHHHHHCCCHH-HHHHH-------------HCCCCCCCCCCCCHHHHHHHHHHHHCCC--CC------CCCCCC T ss_conf ------99974027814-78985-------------3068886500146356899998760773--00------356778 Q ss_pred CCCCHHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHEECCCCHHHHHHHHCCC---CCEEEECCCCCC Q ss_conf 45201689999998533213200455542754214578788654311112687989998851245---531563167667 Q gi|254780664|r 297 ATADTLVEAYRRALSCDPISAFGGIIAFNREVDQEVAKEVIKVFTEAIIAPTLSEEAADVLAKKP---SMRFLKTSSLLD 373 (536) Q Consensus 297 A~~~~~~~Ay~~A~~~DP~SAFGGIIa~N~~vd~~~A~~I~~~F~EvIiAP~f~~eAleiL~~KK---nlRil~~~~~~~ 373 (536) -...++.++=..+=++-|.. . ++..-...++ +.=..-|++|+.-+ +||=+.-. T Consensus 266 p~~~~v~~~Aa~~~N~~~~~----l--~~~~~~t~~~--------------~~G~~PLefLR~~PqFq~lRq~vq~---- 321 (453) T TIGR00601 266 PQEDDVFEQAALAENALSLA----L--FTEETATDAA--------------GVGGDPLEFLRNQPQFQQLRQVVQQ---- 321 (453) T ss_pred CCCHHHHHHHCCCCCCCCCC----C--CCCCCCCCCC--------------CCCCCCHHHCCCCHHHHHHHHHHHC---- T ss_conf 88636998710467765112----3--5666665434--------------3677732211673656767277612---- Q ss_pred CCCCCCEECCCCHHHHHCC--CC-CC---CC--CCCC-CCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEEE------- Q ss_conf 7766640203210223101--23-34---53--1000-12467778688988899999986005667799996------- Q gi|254780664|r 374 FHGEEIVLKTVSGGILVQT--RD-NV---VD--NKEL-TVVTKRSPTDQELRDMKFAFKVVKHVKSNAVVYAK------- 437 (536) Q Consensus 374 ~~~~~~~~rsi~GG~LvQ~--~D-~~---~~--~~~~-~vVT~~~pt~~e~~dL~FA~kv~K~vkSNAIv~ak------- 437 (536) ++ ++.--+|.|= .| .+ |. .+.| .. -..|-++.+.|.+= +..=+.-. T Consensus 322 -NP------~~L~~lL~~iG~~nPQL~~~I~~h~E~F~ql--L~Epv~~nmqDv~~---------~~~d~~~gGGG~~~~ 383 (453) T TIGR00601 322 -NP------QLLPPLLQQIGQENPQLLQQISQHPEEFLQL--LNEPVEENMQDVLE---------GESDMVGGGGGEDIE 383 (453) T ss_pred -CH------HHHHHHHHHHCCCCHHHHHHHHHCCHHHHHH--HCCCCCHHHHHHHC---------CCCCCCCCCCCCCCC T ss_conf -66------7899999973453737999965260678887--50473135898732---------467776787556776 Q ss_pred -CCEEEEECCCCCC-----HHHHH---HHHHHHHHHHHHCCCCCCCCCCEEEEECCCCCCCCHHHHHHHC Q ss_conf -8938997296888-----89999---9999998764200122223487099973668685558999971 Q gi|254780664|r 438 -DGRTVGIGSGQTS-----RVDST---RFAAIKAHNISAQADVKSMTNGSVIASEAFYPFPDGIVEAIKA 498 (536) Q Consensus 438 -~~~tiGiGaGQ~s-----RVda~---~iA~~KA~~~~~~~~~~~~~~g~vlaSDAFFPF~D~ie~aa~~ 498 (536) .|..=--|.||++ .|+-+ +-||+|-+.+|++ +|-| =.|||==.=+=|+|+++ T Consensus 384 ~~G~iae~g~gq~~GeGsf~I~vTp~e~EAIeRL~~LGF~-------r~LV--IqAYfACdKNEeLAAN~ 444 (453) T TIGR00601 384 VTGEIAEAGVGQMNGEGSFQIQVTPEEKEAIERLCALGFD-------RGLV--IQAYFACDKNEELAANY 444 (453) T ss_pred CCCCCCCCCCCCCCCCCCEEEEECCCHHHHHHHHHHCCCC-------CCCE--EEEEECCCCCHHHHHHH T ss_conf 4355524467865665211676341257789999971798-------6751--43221011017999989 No 283 >pfam08887 GAD-like GAD-like domain. This domain is functionally uncharacterized, but it appears to be distantly related to the GAD domain pfam02938. Probab=21.97 E-value=15 Score=17.27 Aligned_cols=35 Identities=20% Similarity=0.186 Sum_probs=20.2 Q ss_pred HHHHHCC-CCCCHHHHHHHHCCCCCCEEEECHHHHHHHHHHHH Q ss_conf 1232002-33434699987401563135308999898898753 Q gi|254780664|r 130 YTMVENI-DIGGPSMIRAAAKNHDYVTILTNPQDYPLFLAEMD 171 (536) Q Consensus 130 ~~~IEnI-DIGGpsmiRAAAKN~~~V~Vi~dp~dY~~~~~el~ 171 (536) ++++|.- ..|..++. .--.-+|+|.||..+++++- T Consensus 34 ~~Ll~~W~~~G~g~~~-------dG~f~~vnP~ey~~vl~~w~ 69 (109) T pfam08887 34 EQLLEYWIEHGWGGYG-------DGLFWIVNPQEYKPVVDSWL 69 (109) T ss_pred HHHHHHHHHCCCCHHC-------CCEEEEECHHHHHHHHHHHH T ss_conf 9999999980960220-------85599758799899999997 No 284 >PRK13129 consensus Probab=21.87 E-value=54 Score=13.28 Aligned_cols=32 Identities=16% Similarity=0.266 Sum_probs=16.2 Q ss_pred HHHHHHHHEECCCCHHHHHHHH---CCCCCEEEEC Q ss_conf 8654311112687989998851---2455315631 Q gi|254780664|r 337 IKVFTEAIIAPTLSEEAADVLA---KKPSMRFLKT 368 (536) Q Consensus 337 ~~~F~EvIiAP~f~~eAleiL~---~KKnlRil~~ 368 (536) .+.=++.+|.||.-.|-.+-+. ++-++.++.+ T Consensus 120 ~~~GvdGvIipDLP~eE~~~~~~~~~~~gl~~I~l 154 (267) T PRK13129 120 AAAGVAGLVVPDLPLEEAEKLSPIAAERGIDLILL 154 (267) T ss_pred HHCCCCEEECCCCCHHHHHHHHHHHHHCCCEEEEE T ss_conf 86698757678999899999999998539816899 No 285 >TIGR02036 dsdC D-serine deaminase transcriptional activator; InterPro: IPR011781 This family, part of the LysR family of transcriptional regulators, activates transcription of the gene for D-serine deaminase, dsdA. Trusted members of this family so far are found adjacent to dsdA and only in Gammaproteobacteria, including Escherichia coli, Vibrio cholerae, and Colwellia psychrerythraea.. Probab=21.69 E-value=49 Score=13.54 Aligned_cols=15 Identities=33% Similarity=0.561 Sum_probs=8.1 Q ss_pred HHHHHHHHHHCCCEE Q ss_conf 889999998759779 Q gi|254780664|r 512 DSEAITVADQHGIAM 526 (536) Q Consensus 512 D~evI~aan~~gi~m 526 (536) |=.||+|-|..|||| T Consensus 233 DLAVIAA~NnaGvAM 247 (302) T TIGR02036 233 DLAVIAAMNNAGVAM 247 (302) T ss_pred HHHHHHHHHHCCHHC T ss_conf 799999985044110 No 286 >cd06345 PBP1_ABC_ligand_binding_like_10 Type I periplasmic ligand-binding domain of uncharacterized ABC (Atpase Binding Cassette)-type active transport systems that are predicted to be involved in uptake of amino acids, peptides, or inorganic ions. This subgroup includes the type I periplasmic ligand-binding domain of uncharacterized ABC (Atpase Binding Cassette)-type active transport systems that are predicted to be involved in uptake of amino acids, peptides, or inorganic ions. This subgroup has high sequence similarity to members of the family of hydrophobic amino acid transporters (HAAT), such as leucine/isoleucine/valine binding protein (LIVBP); however its ligand specificity has not been determined experimentally. Probab=21.68 E-value=54 Score=13.25 Aligned_cols=35 Identities=17% Similarity=0.238 Sum_probs=15.5 Q ss_pred HHHHHHHHHH-CCCEEEE----EHHH---HHHHHHCCCCEEEH Q ss_conf 6999999998-7999999----4276---89999889951794 Q gi|254780664|r 29 VVEFASRLLS-RGIKIIS----TGGT---CQLLEEEGIPVTSV 63 (536) Q Consensus 29 l~~la~~L~~-~g~~iis----TgGT---a~~l~~~gi~v~~V 63 (536) -...++.|.+ .++.++- |+-| +..+.++++|+... T Consensus 55 a~~~a~~Li~~~~V~~iiG~~~S~~~~a~~~~~~~~~vp~i~~ 97 (344) T cd06345 55 AVRAFERLVSQDKVDAVVGGYSSEVVLALQDVAAENKVPFIVT 97 (344) T ss_pred HHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHHHCCCEEEEE T ss_conf 9999999996599708746768578899999999869549980 No 287 >PRK10355 xylF D-xylose transporter subunit XylF; Provisional Probab=21.67 E-value=54 Score=13.25 Aligned_cols=43 Identities=12% Similarity=0.041 Sum_probs=24.7 Q ss_pred CCCEEEEECCCCCHHHHHHHHHHHHH---HHEECCCCHHHHHHHHC Q ss_conf 20045554275421457878865431---11126879899988512 Q gi|254780664|r 317 AFGGIIAFNREVDQEVAKEVIKVFTE---AIIAPTLSEEAADVLAK 359 (536) Q Consensus 317 AFGGIIa~N~~vd~~~A~~I~~~F~E---vIiAP~f~~eAleiL~~ 359 (536) .+-+|++.|...-....+.+.+.=.. .|..-|-+++|++-.+. T Consensus 211 ~id~V~a~ND~mAlGA~~Al~~aG~~~~v~V~G~Dg~~~al~~I~~ 256 (330) T PRK10355 211 KIDAVVASNDATAGGAIQALSAQGLSGKVAISGQDADLAAIKRIVA 256 (330) T ss_pred CEEEEEECCCCHHHHHHHHHHHCCCCCCCEEECCCCCHHHHHHHHC T ss_conf 5169998997148999999996699999769902799899999974 No 288 >PRK07168 bifunctional uroporphyrinogen-III methyltransferase/uroporphyrinogen-III synthase; Reviewed Probab=21.53 E-value=54 Score=13.23 Aligned_cols=183 Identities=12% Similarity=0.150 Sum_probs=78.8 Q ss_pred CCCEEEEECCCCCH--HHHHHHHHHHHHHHEECCCCHHHHHHHHCCCCCEEEECCCCCCCCCCCCEECCCCHHHH-HCCC Q ss_conf 20045554275421--45787886543111126879899988512455315631676677766640203210223-1012 Q gi|254780664|r 317 AFGGIIAFNREVDQ--EVAKEVIKVFTEAIIAPTLSEEAADVLAKKPSMRFLKTSSLLDFHGEEIVLKTVSGGIL-VQTR 393 (536) Q Consensus 317 AFGGIIa~N~~vd~--~~A~~I~~~F~EvIiAP~f~~eAleiL~~KKnlRil~~~~~~~~~~~~~~~rsi~GG~L-vQ~~ 393 (536) =||==|.+.|+=+. +++..+.+.=-|++--|.++.+-..-+. .. +....+| +.+-+.|..|--. .+.- T Consensus 249 LfG~rVLVtRtreQA~~ls~~L~~~GA~v~E~PtI~~~~~~~~~-----~~--i~~l~~y--dwlvFTS~NgV~~Ff~~L 319 (474) T PRK07168 249 LHGKKVLFTSATNKTSVMKQKLQEAGAEIYQIPTFKKEEYTLTL-----EQ--INEIFNV--DRLVFCSAESVEILMQSC 319 (474) T ss_pred CCCCEEEECCCHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHHH-----HH--HHHHCCC--CEEEEECHHHHHHHHHHH T ss_conf 78867997786677699999999769938994453168856379-----99--8753338--989993878999999999 Q ss_pred C-CCCCCCCC--CCCCCCCCCHHHHHH-HHHHHHHHHCCCCCEEEEEECCEEEEECCCCCCHHHHHHHHHH--------- Q ss_conf 3-34531000--124677786889888-9999998600566779999689389972968888999999999--------- Q gi|254780664|r 394 D-NVVDNKEL--TVVTKRSPTDQELRD-MKFAFKVVKHVKSNAVVYAKDGRTVGIGSGQTSRVDSTRFAAI--------- 460 (536) Q Consensus 394 D-~~~~~~~~--~vVT~~~pt~~e~~d-L~FA~kv~K~vkSNAIv~ak~~~tiGiGaGQ~sRVda~~iA~~--------- 460 (536) . ...|-..+ ++..=-..|.++++. =+|+..+ +...+--+++. -..+|+..++--+- T Consensus 320 ~~~g~D~R~L~~kiaaiG~aT~~~L~~~Gll~~~~-~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~t~~ 388 (474) T PRK07168 320 SKYKKDIRSLQAELQHMNVATQEKLMQYGLLSKEA-KFSSDTTVYLG----------RNINRIAFIQEKIGAGSYMMTHK 388 (474) T ss_pred HHCCCCHHHCCCEEEEECHHHHHHHHHCCCCCCCC-CCCCCEEEECC----------CCHHHHHHHHHCCCCCCEEEECC T ss_conf 97398457607617887589999999769734422-03686589804----------55116788870578753032100 Q ss_pred -----HHHHHHHCCCCCCCCCCEEEEECCCCCCCCHHHH----HHHCCCCEEEECCCCCCHHHHHHHHHHCCCE Q ss_conf -----9876420012222348709997366868555899----9971995999389877988999999875977 Q gi|254780664|r 461 -----KAHNISAQADVKSMTNGSVIASEAFYPFPDGIVE----AIKAGVTAVIQPGGSVRDSEAITVADQHGIA 525 (536) Q Consensus 461 -----KA~~~~~~~~~~~~~~g~vlaSDAFFPF~D~ie~----aa~~Gi~aIiqPGGSirD~evI~aan~~gi~ 525 (536) |-.+.... ...+..-.+| +||=+-+|+. ..+.|..-|..--=|--|+-+.++||+.|.. T Consensus 389 ~~~~~~~~~~~~r-~~~~~~~~~v-----~f~~~~~~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 456 (474) T PRK07168 389 YTIDHRFDEVHSR-MLSEFLWDSI-----VFEGRASIDTFLAEVKRLGFIDIVTLPFSYTDVPTLHYANKVGFH 456 (474) T ss_pred EEEHHHHHHHHHH-HCCCCCEEEE-----EECCHHHHHHHHHHHHHCCCHHHHCCCCHHCCCHHHHHHHHCCCC T ss_conf 0001236777763-2343540389-----965800699999998751512122277300372888888760754 No 289 >pfam02449 Glyco_hydro_42 Beta-galactosidase. This group of beta-galactosidase enzymes belong to the glycosyl hydrolase 42 family. The enzyme catalyses the hydrolysis of terminal, non-reducing terminal beta-D-galactosidase residues. Probab=21.46 E-value=49 Score=13.57 Aligned_cols=29 Identities=28% Similarity=0.244 Sum_probs=18.3 Q ss_pred CHHHHHHHHHHHHCCCCCCCEEEEECCCC Q ss_conf 01689999998533213200455542754 Q gi|254780664|r 300 DTLVEAYRRALSCDPISAFGGIIAFNREV 328 (536) Q Consensus 300 ~~~~~Ay~~A~~~DP~SAFGGIIa~N~~v 328 (536) +.+.+||-.+|=+-..|+|.-|..=+... T Consensus 166 d~LN~aWgt~fWs~~y~~w~~i~~P~~~~ 194 (376) T pfam02449 166 DALNEAWGTAFWSQTYSDFDEIEPPRPAP 194 (376) T ss_pred HHHHHHHHHHHHCCCCCCHHHCCCCCCCC T ss_conf 99999964554236069888768876788 No 290 >cd01539 PBP1_GGBP Periplasmic glucose/galactose-binding protein (GGBP) involved in chemotaxis towards, and active transport of, glucose and galactose in various bacterial species. Periplasmic glucose/galactose-binding protein (GGBP) involved in chemotaxis towards, and active transport of, glucose and galactose in various bacterial species. GGBP is a member of the pentose/hexose sugar-binding protein family of the type I periplasmic binding protein superfamily which consists of two alpha/beta globular domains connected by a three-stranded hinge. This Venus flytrap-like domain undergoes transition from an open to a closed conformational state upon ligand binding. Moreover, the periplasmic GGBP is homologous to the ligand-binding domain of eukaryotic receptors such as glutamate receptor (GluR) and DNA-binding transcriptional repressors such as LacI and GalR. Probab=21.41 E-value=55 Score=13.21 Aligned_cols=17 Identities=18% Similarity=0.229 Sum_probs=6.2 Q ss_pred EEEEECCCCCHHHHHHH Q ss_conf 45554275421457878 Q gi|254780664|r 320 GIIAFNREVDQEVAKEV 336 (536) Q Consensus 320 GIIa~N~~vd~~~A~~I 336 (536) +|++.|...-...++.+ T Consensus 204 ~I~a~ND~mAlGA~~Al 220 (303) T cd01539 204 AVIANNDAMALGAIEAL 220 (303) T ss_pred EEEECCCHHHHHHHHHH T ss_conf 99978947799999999 No 291 >cd05313 NAD_bind_2_Glu_DH NAD(P) binding domain of glutamate dehydrogenase, subgroup 2. Amino acid dehydrogenase (DH) is a widely distributed family of enzymes that catalyzes the oxidative deamination of an amino acid to its keto acid and ammonia with concomitant reduction of NADP+. Glutamate DH is a multidomain enzyme that catalyzes the reaction from glutamate to 2-oxyoglutarate and ammonia in the presence of NAD or NADP. It is present in all organisms. Enzymes involved in ammonia asimilation are typically NADP+-dependent, while those involved in glutamate catabolism are generally NAD+-dependent. Amino acid DH-like NAD(P)-binding domains are members of the Rossmann fold superfamily and include glutamate, leucine, and phenylalanine DHs, methylene tetrahydrofolate DH, methylene-tetrahydromethanopterin DH, methylene-tetrahydropholate DH/cyclohydrolase, Shikimate DH-like proteins, malate oxidoreductases, and glutamyl tRNA reductase. Amino acid DHs catalyze the deamination of amino acids t Probab=21.40 E-value=55 Score=13.22 Aligned_cols=38 Identities=34% Similarity=0.383 Sum_probs=25.4 Q ss_pred EECCCCCHHHHHHHH----HHHHHHHEECCCCHHHHHHHHCCC Q ss_conf 542754214578788----654311112687989998851245 Q gi|254780664|r 323 AFNREVDQEVAKEVI----KVFTEAIIAPTLSEEAADVLAKKP 361 (536) Q Consensus 323 a~N~~vd~~~A~~I~----~~F~EvIiAP~f~~eAleiL~~KK 361 (536) |+-..|+.+.|..|. +..+|.-=.| .++||-+||.+|. T Consensus 129 A~en~I~~~nA~~i~a~~ck~IvEgANgP-~T~eAd~iL~~kG 170 (254) T cd05313 129 ATQNEVDAEDAKLLVKNGCKYVAEGANMP-CTAEAIEVFRQAG 170 (254) T ss_pred CCCCCCCHHHHHHHHHCCCEEEEECCCCC-CCHHHHHHHHHCC T ss_conf 54667899999999967982998658899-9878999999889 No 292 >PRK00168 coaD phosphopantetheine adenylyltransferase; Provisional Probab=21.38 E-value=55 Score=13.21 Aligned_cols=48 Identities=19% Similarity=0.180 Sum_probs=34.9 Q ss_pred CCCCCHHHHHHHHCCCCCCEEEECHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHH Q ss_conf 233434699987401563135308999898898753023456667889999999998 Q gi|254780664|r 136 IDIGGPSMIRAAAKNHDYVTILTNPQDYPLFLAEMDVNNGKIPYNFRKKMARQAFSR 192 (536) Q Consensus 136 IDIGGpsmiRAAAKN~~~V~Vi~dp~dY~~~~~el~~~~g~~~~~~R~~lA~kAF~~ 192 (536) |-.|--.+++-|++-|..|.|++-.+ -. .++-++.+.|..|..+++.+ T Consensus 13 it~GHldii~ra~~lfD~viv~v~~N--------p~-K~~~f~~eeR~~mi~~~~~~ 60 (159) T PRK00168 13 ITNGHLDIIERASKLFDEVIVAVAIN--------PS-KKPLFSLEERVELIREVTAH 60 (159) T ss_pred CCHHHHHHHHHHHHHCCEEEEECCCC--------CC-CCCCCCHHHHHHHHHHHHCC T ss_conf 84479999999998679999940569--------88-77778999999999998515 No 293 >cd06380 PBP1_iGluR_AMPA N-terminal leucine/isoleucine/valine-binding protein (LIVBP)-like domain of the AMPA receptor. N-terminal leucine/isoleucine/valine-binding protein (LIVBP)-like domain of the AMPA (alpha-amino-3-hydroxy-5-methyl-4-isoxazolepropionic acid) receptor, a member of the glutamate-receptor ion channels (iGluRs). AMPA receptors are the major mediators of excitatory synaptic transmission in the central nervous system. While this N-terminal domain belongs to the periplasmic-binding fold type I superfamily, the glutamate-binding domain of the iGluR is structurally homologous to the periplasmic-binding fold type II. The LIVBP-like domain of iGluRs is thought to play a role in the initial assembly of iGluR subunits, but it is not well understood how this domain is arranged and functions in intact iGluR. AMPA receptors consist of four types of subunits (GluR1, GluR2, GluR3, and GluR4) which combine to form a tetramer and play an important roles in mediating the rapid excita Probab=21.36 E-value=55 Score=13.21 Aligned_cols=26 Identities=23% Similarity=0.305 Sum_probs=13.3 Q ss_pred EEECCCCCCCCHHHHHHHC------CCCEEEE Q ss_conf 9973668685558999971------9959993 Q gi|254780664|r 480 IASEAFYPFPDGIVEAIKA------GVTAVIQ 505 (536) Q Consensus 480 laSDAFFPF~D~ie~aa~~------Gi~aIiq 505 (536) +.++..=||..|-.+.... |++.-|| T Consensus 316 C~~~~~~~w~~G~~l~~~ik~v~~~GlTG~I~ 347 (382) T cd06380 316 CLANPAVPWEHGIDIERALKKVQFEGLTGNVQ 347 (382) T ss_pred CCCCCCCCCCCHHHHHHHHHHCCCCCCEEEEE T ss_conf 57777876676789999997188866422249 No 294 >TIGR02434 CobF precorrin-6A synthase (deacetylating); InterPro: IPR012797 Cobalamin (vitamin B12) is a structurally complex cofactor, consisting of a modified tetrapyrrole with a centrally chelated cobalt. Cobalamin is usually found in one of two biologically active forms: methylcobalamin and adocobalamin. Most prokaryotes, as well as animals, have cobalamin-dependent enzymes, whereas plants and fungi do not appear to use it. In bacteria and archaea, these include methionine synthase, ribonucleotide reductase, glutamate and methylmalonyl-CoA mutases, ethanolamine ammonia lyase, and diol dehydratase . In mammals, cobalamin is obtained through the diet, and is required for methionine synthase and methylmalonyl-CoA mutase . There are at least two distinct cobalamin biosynthetic pathways in bacteria : Aerobic pathway that requires oxygen and in which cobalt is inserted late in the pathway ; found in Pseudomonas denitrificans and Rhodobacter capsulatus. Anaerobic pathway in which cobalt insertion is the first committed step towards cobalamin synthesis ; found in Salmonella typhimurium, Bacillus megaterium, and Propionibacterium freudenreichii shermanii. Either pathway can be divided into two parts: (1) corrin ring synthesis (differs in aerobic and anaerobic pathways) and (2) adenosylation of corrin ring, attachment of aminopropanol arm, and assembly of the nucleotide loop (common to both pathways) . There are about 30 enzymes involved in either pathway, where those involved in the aerobic pathway are prefixed Cob and those of the anaerobic pathway Cbi. Several of these enzymes are pathway-specific: CbiD, CbiG, and CbiK are specific to the anaerobic route of S. typhimurium, whereas CobE, CobF, CobG, CobN, CobS, CobT, and CobW are unique to the aerobic pathway of P. denitrificans. This entry represents CobF precorrin-6A synthase (2.1.1.152 from EC), a pathway-specific enzyme in the aerobic pathway. After precorrin-4 is methylated at C-11 by CobM to produce precorrin-5, the extruded acyl group is then removed in the subsequent step, which also sees a methyl group added at C-1 in a reaction catalysed by CobF. The product of this reaction is precorrin-6A, which is subsequently reduced by an NADH-dependent reductase to precorrin-6B . This entry identifies CobF in high GC Gram-positive, alphaproteobacteria and pseudomonas-related species.. Probab=21.36 E-value=46 Score=13.76 Aligned_cols=11 Identities=36% Similarity=0.767 Sum_probs=5.1 Q ss_pred HHHHHHHHHHH Q ss_conf 22899998753 Q gi|254780664|r 195 SYDTAICRWLA 205 (536) Q Consensus 195 ~YD~~Ia~~l~ 205 (536) .|..++.+|=. T Consensus 80 ~Y~~~V~~WH~ 90 (259) T TIGR02434 80 DYRAAVDDWHA 90 (259) T ss_pred CHHHHHHHHHH T ss_conf 42778999999 No 295 >COG2820 Udp Uridine phosphorylase [Nucleotide transport and metabolism] Probab=21.29 E-value=46 Score=13.76 Aligned_cols=36 Identities=14% Similarity=0.234 Sum_probs=20.4 Q ss_pred HHHHH-HHHHHHHHHHCCCCCEEEEE-ECCEEEEECCC Q ss_conf 88988-89999998600566779999-68938997296 Q gi|254780664|r 412 DQELR-DMKFAFKVVKHVKSNAVVYA-KDGRTVGIGSG 447 (536) Q Consensus 412 ~~e~~-dL~FA~kv~K~vkSNAIv~a-k~~~tiGiGaG 447 (536) .-||+ ..+|..--++..++-+|..+ -|...-.++.- T Consensus 189 ~~EMEsAtlftl~~~~G~rag~V~~vi~n~~~~e~~~~ 226 (248) T COG2820 189 NIEMESATLFTLGSLRGLRAGAVLGVIANRTQGEQGVA 226 (248) T ss_pred HHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCC T ss_conf 66778999999998718342418999824224221433 No 296 >PRK13361 molybdenum cofactor biosynthesis protein A; Provisional Probab=21.14 E-value=55 Score=13.18 Aligned_cols=76 Identities=21% Similarity=0.369 Sum_probs=40.4 Q ss_pred CCHHHHHHHHHHC-CCE-E-EEEHHH-----HHHHHHCCCCEEEHHHHCCCCHHCCCCCCCCCCHHHHHHHHCCCCHHH- Q ss_conf 6869999999987-999-9-994276-----899998899517946534881651985532371555355330889899- Q gi|254780664|r 27 TGVVEFASRLLSR-GIK-I-ISTGGT-----CQLLEEEGIPVTSVFDITKFPEIMGGRVKTLHPKIYGGILSIRDNPAH- 97 (536) Q Consensus 27 tgl~~la~~L~~~-g~~-i-isTgGT-----a~~l~~~gi~v~~Vs~~TgfpEil~GRVKTLHP~I~ggIL~~r~~~~~- 97 (536) .+|.+|.+.|.+. |++ | +.|-|+ ++.|+++|+.-..||= -||.|..|.-|- +++.-.. T Consensus 76 ~dl~~li~~l~~~~gi~~islTTNG~lL~~~a~~Lk~aGL~rvNISL------------DsLd~~~f~~IT-r~~~l~~V 142 (329) T PRK13361 76 TGCDQLVARLGKLPGLEELSMTTNGSRLARFAAELADAGLKRLNISL------------DTLNPDLFAALT-RNGRLERV 142 (329) T ss_pred CCHHHHHHHHHHCCCCCEEEEECCHHHHHHHHHHHHHCCCCEEEEEC------------CCCCHHHHHHHH-CCCCHHHH T ss_conf 68899999986179977189966477689999999977998699735------------779999999772-89976999 Q ss_pred ---HHHHHHCCCCCEEEEEEE Q ss_conf ---999997799841499994 Q gi|254780664|r 98 ---MKFMQDHELESIDLVVVN 115 (536) Q Consensus 98 ---~~~l~~~~i~~IDlVvvN 115 (536) +..+.+.|+.++-+=+|= T Consensus 143 l~gI~aA~~~G~~~VKiN~V~ 163 (329) T PRK13361 143 IAGIDAAKAAGFERIKLNAVI 163 (329) T ss_pred HHHHHHHHHCCCCEEEEEEEE T ss_conf 999999997799738899998 No 297 >PRK13118 consensus Probab=21.10 E-value=55 Score=13.17 Aligned_cols=13 Identities=23% Similarity=0.240 Sum_probs=6.2 Q ss_pred HHHHHEECCCCHH Q ss_conf 4311112687989 Q gi|254780664|r 340 FTEAIIAPTLSEE 352 (536) Q Consensus 340 F~EvIiAP~f~~e 352 (536) =+..+|-||.-.| T Consensus 122 GvdGvIipDLP~e 134 (269) T PRK13118 122 GVDGLILVDLPPE 134 (269) T ss_pred CCCEEECCCCCHH T ss_conf 9974645899978 No 298 >PTZ00136 eukaryotic translation initiation factor 6; Provisional Probab=20.99 E-value=56 Score=13.16 Aligned_cols=40 Identities=30% Similarity=0.472 Sum_probs=31.8 Q ss_pred CCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHEECCCCHHHHHHHHCCCCCEEE Q ss_conf 321320045554275421457878865431111268798999885124553156 Q gi|254780664|r 313 DPISAFGGIIAFNREVDQEVAKEVIKVFTEAIIAPTLSEEAADVLAKKPSMRFL 366 (536) Q Consensus 313 DP~SAFGGIIa~N~~vd~~~A~~I~~~F~EvIiAP~f~~eAleiL~~KKnlRil 366 (536) +..+|.|-+|++|... .++-|++++|+.++++.-=+.-+. T Consensus 99 ~k~tAlGN~Il~ND~~--------------AlvhP~l~~e~~e~I~dvLgVev~ 138 (247) T PTZ00136 99 ERLSALGNCIACNDYV--------------ALIHTDLDRETEEIIADVLGVEVF 138 (247) T ss_pred CCCCCCCCEEEECCCE--------------EEECCCCCHHHHHHHHHHCCCCEE T ss_conf 5434501499975731--------------685488998999999762486289 No 299 >PRK06846 deaminase; Validated Probab=20.83 E-value=56 Score=13.13 Aligned_cols=15 Identities=20% Similarity=0.386 Sum_probs=10.4 Q ss_pred CHHCCCCCCCCCCHH Q ss_conf 165198553237155 Q gi|254780664|r 70 PEIMGGRVKTLHPKI 84 (536) Q Consensus 70 pEil~GRVKTLHP~I 84 (536) =+|=|||++.+.|.. T Consensus 34 i~I~dGkI~~I~~~~ 48 (410) T PRK06846 34 LEIQDGKIVAIRENK 48 (410) T ss_pred EEEECCEEEEEECCC T ss_conf 998589898870588 No 300 >PRK00132 rpsI 30S ribosomal protein S9; Reviewed Probab=20.79 E-value=50 Score=13.51 Aligned_cols=22 Identities=18% Similarity=0.220 Sum_probs=18.7 Q ss_pred EECCCCCCHHHHHHHHHHHHHH Q ss_conf 9729688889999999999876 Q gi|254780664|r 443 GIGSGQTSRVDSTRFAAIKAHN 464 (536) Q Consensus 443 GiGaGQ~sRVda~~iA~~KA~~ 464 (536) =-|.|.+...+|+++|+.||-. T Consensus 67 V~GGG~sGQa~AirlaIaRaL~ 88 (130) T PRK00132 67 VKGGGLSGQAGAIRHGIARALL 88 (130) T ss_pred EECCCHHHHHHHHHHHHHHHHH T ss_conf 9798898999999999999999 No 301 >PRK12828 short chain dehydrogenase; Provisional Probab=20.78 E-value=56 Score=13.13 Aligned_cols=43 Identities=19% Similarity=0.097 Sum_probs=31.7 Q ss_pred CCCEEEEEEEEECCCCHHHHHHHHHHCCCEEEEEHHHHHHHHH Q ss_conf 7613698998754568699999999879999994276899998 Q gi|254780664|r 13 EIAVKTALISVHNKTGVVEFASRLLSRGIKIISTGGTCQLLEE 55 (536) Q Consensus 13 ~~~ikrALiSV~dKtgl~~la~~L~~~g~~iisTgGTa~~l~~ 55 (536) +++=|.|||+=.-+-==...|+.|.+.|..++-++-..+.+.+ T Consensus 4 ~L~gKvalITGas~GIG~aia~~la~~Ga~V~i~~~~~~~~~~ 46 (239) T PRK12828 4 SLQGKVVAITGGFGGLGRATAAWLAARGARVALIGRGAAPLSQ 46 (239) T ss_pred CCCCCEEEEECCCCHHHHHHHHHHHHCCCEEEEEECCHHHHHH T ss_conf 9698989994725489999999999879989999798778999 No 302 >PRK12826 3-ketoacyl-(acyl-carrier-protein) reductase; Reviewed Probab=20.55 E-value=57 Score=13.09 Aligned_cols=43 Identities=16% Similarity=0.135 Sum_probs=28.1 Q ss_pred CCCEEEEEEEEECCCCHHHHHHHHHHCCCEEEEEHHHHHHHHH Q ss_conf 7613698998754568699999999879999994276899998 Q gi|254780664|r 13 EIAVKTALISVHNKTGVVEFASRLLSRGIKIISTGGTCQLLEE 55 (536) Q Consensus 13 ~~~ikrALiSV~dKtgl~~la~~L~~~g~~iisTgGTa~~l~~ 55 (536) +++=|.|||.=.-+-==..+|+.|.+.|+.++-++-..+.+++ T Consensus 3 ~L~gK~alITGgs~GIG~aia~~la~~G~~V~~~~r~~~~~~~ 45 (253) T PRK12826 3 DLMGRVALVTGAARGIGRAIAVRFAADGADVIVVDICGQAAAA 45 (253) T ss_pred CCCCCEEEEECCCCHHHHHHHHHHHHCCCEEEEEECCHHHHHH T ss_conf 8899889994897789999999999879989999898899999 No 303 >COG1838 FumA Tartrate dehydratase beta subunit/Fumarate hydratase class I, C-terminal domain [Energy production and conversion] Probab=20.37 E-value=57 Score=13.07 Aligned_cols=55 Identities=22% Similarity=0.192 Sum_probs=24.7 Q ss_pred CCCCCCCEEEEECC--CCCHHHHHHHHHHHHHHHEECCCCHHHHHHHH-CCCCCEEEECCC Q ss_conf 32132004555427--54214578788654311112687989998851-245531563167 Q gi|254780664|r 313 DPISAFGGIIAFNR--EVDQEVAKEVIKVFTEAIIAPTLSEEAADVLA-KKPSMRFLKTSS 370 (536) Q Consensus 313 DP~SAFGGIIa~N~--~vd~~~A~~I~~~F~EvIiAP~f~~eAleiL~-~KKnlRil~~~~ 370 (536) |.+=+.+|+.++=. ....++.+.+.++=.=-++||+= |-.+++ .=|..|.+.+.. T Consensus 88 ~~~l~~~G~~~~iGKG~~~~~~~ea~~~~kavyl~~~gG---aA~L~a~~IK~~~~v~~~d 145 (184) T COG1838 88 DELLEQTGVLAMIGKGGRGPETVEACKKHKAVYLVAPGG---AAALAAKSIKSVRCVAYED 145 (184) T ss_pred HHHHHHCCEEEEEECCCCCHHHHHHHHHCCEEEEECCCH---HHHHHHHHHHHEEEEEECC T ss_conf 999974182999964876778899998759289972626---9999998640323786234 No 304 >COG4750 LicC CTP:phosphocholine cytidylyltransferase involved in choline phosphorylation for cell surface LPS epitopes [Cell envelope biogenesis, outer membrane] Probab=20.29 E-value=24 Score=15.77 Aligned_cols=42 Identities=17% Similarity=0.340 Sum_probs=19.2 Q ss_pred CCCCCCCCCCHHHHHHHHCCC---CHHHHHHHHHCCCCCEEEEEE Q ss_conf 198553237155535533088---989999999779984149999 Q gi|254780664|r 73 MGGRVKTLHPKIYGGILSIRD---NPAHMKFMQDHELESIDLVVV 114 (536) Q Consensus 73 l~GRVKTLHP~I~ggIL~~r~---~~~~~~~l~~~~i~~IDlVvv 114 (536) ||-|.+-|.-..+-++|---+ .+..+++|++-||..|-+||- T Consensus 10 ~gsR~~plT~~tpK~LlkV~g~plIErqI~~L~e~gI~dI~IVvG 54 (231) T COG4750 10 LGSRFVPLTQSTPKSLLKVNGEPLIERQIEQLREAGIDDITIVVG 54 (231) T ss_pred CCCCCCCCCCCCCHHHHHHCCCCCHHHHHHHHHHCCCCEEEEEEE T ss_conf 566533131028767887358550999999999779861899963 No 305 >PRK00120 putative deoxyribonucleotide triphosphate pyrophosphatase; Reviewed Probab=20.19 E-value=33 Score=14.83 Aligned_cols=30 Identities=27% Similarity=0.460 Sum_probs=17.0 Q ss_pred EEEEEEEEECCCCHHHHHHHHHHCCCEEEE Q ss_conf 369899875456869999999987999999 Q gi|254780664|r 16 VKTALISVHNKTGVVEFASRLLSRGIKIIS 45 (536) Q Consensus 16 ikrALiSV~dKtgl~~la~~L~~~g~~iis 45 (536) |++-++.-.++..+.|+.+.|...|+++++ T Consensus 2 m~ki~~aT~N~~K~~E~~~il~~~~~~i~~ 31 (197) T PRK00120 2 MKKLVLASHNAGKLRELAALLAPFGLEVVS 31 (197) T ss_pred CCCEEEECCCHHHHHHHHHHHHHCCCEEEE T ss_conf 863999929867999999988765988975 No 306 >PRK02478 Maf-like protein; Reviewed Probab=20.17 E-value=47 Score=13.71 Aligned_cols=46 Identities=22% Similarity=0.350 Sum_probs=29.8 Q ss_pred CCCCCCCCCCEEEECCCCCCCCCCHHHCCCCCCCCCCCCCCC--HHHHHHHHC Q ss_conf 367881006505760477668210121058988831222120--011677642 Q gi|254780664|r 227 RYGENPHQKAALYSTPEKKSGIAHAVLVQGKPLSYNNINDLD--AAFELVSEF 277 (536) Q Consensus 227 RYGENPHQ~Aa~Y~~~~~~~~~~~~~ql~GKeLSYNNllD~d--aA~~lv~ef 277 (536) ++||.|-.+|.-|.... .|...++.+.| ||+|++-+- .-++++++| T Consensus 147 ~~~~~~~~kAG~Y~Iqg--~g~~~i~~I~G---d~~nVvGLPl~~l~~~L~e~ 194 (199) T PRK02478 147 RVGDKALSSVGAYQLEG--EGIQLFEKIEG---DYFTIVGLPLLPLLAKLREL 194 (199) T ss_pred HHCCCCCCEEEEEEECC--CHHHCEEEEEC---CCCCCCCCCHHHHHHHHHHC T ss_conf 74997465415997716--56764750389---86214565699999999974 No 307 >PRK11866 2-oxoacid ferredoxin oxidoreductase subunit beta; Provisional Probab=20.01 E-value=58 Score=13.01 Aligned_cols=30 Identities=40% Similarity=0.485 Sum_probs=26.0 Q ss_pred CCCCCHHHHHHHHCCCCCCEEEECHHHHHH Q ss_conf 233434699987401563135308999898 Q gi|254780664|r 136 IDIGGPSMIRAAAKNHDYVTILTNPQDYPL 165 (536) Q Consensus 136 IDIGGpsmiRAAAKN~~~V~Vi~dp~dY~~ 165 (536) -+|||-.++-|+-.|..-+.++.|-.-|.. T Consensus 98 ~~IG~nh~ihA~rRN~dit~iv~nN~iYGl 127 (286) T PRK11866 98 YGIGLQHLIHAARRNVDITYIVSNNQVYGL 127 (286) T ss_pred HCCHHHHHHHHHHHCCCCEEEEECCHHHCC T ss_conf 042388999999848993799988743214 Done!