Query gi|254780768|ref|YP_003065181.1| hypothetical protein CLIBASIA_03295 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 281 No_of_seqs 128 out of 720 Neff 6.7 Searched_HMMs 39220 Date Sun May 29 20:00:03 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780768.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 COG3494 Uncharacterized protei 100.0 0 0 552.8 29.1 275 2-278 5-279 (279) 2 pfam06230 DUF1009 Protein of u 100.0 0 0 490.7 23.3 212 54-274 1-212 (212) 3 PRK00881 purH bifunctional pho 97.4 0.0061 1.6E-07 40.2 12.0 115 139-274 387-506 (514) 4 COG0138 PurH AICAR transformyl 97.0 0.0049 1.3E-07 40.9 7.8 117 136-273 385-506 (515) 5 PRK07106 5-aminoimidazole-4-ca 96.7 0.014 3.5E-07 37.9 8.2 121 140-274 203-383 (391) 6 COG2910 Putative NADH-flavin r 94.8 0.12 3E-06 31.6 6.4 152 3-165 2-198 (211) 7 PRK05647 purN phosphoribosylgl 94.6 0.15 3.8E-06 30.9 6.5 76 1-76 1-91 (200) 8 TIGR00639 PurN phosphoribosylg 94.0 0.21 5.3E-06 29.9 6.2 72 3-77 2-105 (215) 9 PRK09288 purT phosphoribosylgl 92.0 0.74 1.9E-05 26.2 6.6 70 1-71 12-81 (395) 10 PRK06019 phosphoribosylaminoim 90.4 0.9 2.3E-05 25.6 5.8 62 1-63 7-68 (377) 11 pfam07905 PucR Purine cataboli 89.3 0.82 2.1E-05 25.9 4.8 48 232-279 61-108 (122) 12 pfam01808 AICARFT_IMPCHas AICA 87.5 1.2 3.2E-05 24.7 4.8 41 140-183 257-297 (315) 13 pfam00289 CPSase_L_chain Carba 85.9 1.5 3.8E-05 24.2 4.4 69 5-74 4-81 (109) 14 smart00798 AICARFT_IMPCHas AIC 85.5 1.8 4.6E-05 23.6 4.7 41 140-183 253-293 (311) 15 COG1432 Uncharacterized conser 83.9 2.3 5.7E-05 22.9 4.6 85 105-199 55-156 (181) 16 PRK03202 6-phosphofructokinase 83.0 3.7 9.4E-05 21.5 5.4 31 1-31 2-38 (323) 17 PRK13175 consensus 81.6 3.2 8.1E-05 22.0 4.7 66 1-70 1-67 (206) 18 PRK11199 tyrA bifunctional cho 79.5 5.3 0.00013 20.5 6.8 60 2-61 99-162 (374) 19 PRK08655 prephenate dehydrogen 79.1 4 0.0001 21.3 4.5 60 3-62 2-79 (441) 20 COG0026 PurK Phosphoribosylami 77.9 5.9 0.00015 20.1 5.8 36 1-37 1-36 (375) 21 CHL00188 hisH imidazole glycer 77.0 6.2 0.00016 20.0 5.1 65 1-70 1-67 (210) 22 PRK10537 voltage-gated potassi 76.0 6.1 0.00015 20.1 4.7 137 3-157 205-348 (356) 23 PRK08462 biotin carboxylase; V 76.0 6.6 0.00017 19.8 5.5 63 5-67 7-78 (446) 24 PRK05568 flavodoxin; Provision 75.9 6.2 0.00016 20.0 4.7 125 1-162 1-141 (142) 25 PRK13174 consensus 75.6 6.8 0.00017 19.7 5.0 66 1-70 1-68 (212) 26 pfam07035 Mic1 Colon cancer-as 75.0 3.9 0.0001 21.3 3.5 83 50-136 76-165 (167) 27 PRK13180 consensus 74.8 6.8 0.00017 19.7 4.7 64 1-69 1-66 (209) 28 pfam00551 Formyl_trans_N Formy 74.7 7.1 0.00018 19.6 7.3 122 3-128 2-140 (181) 29 PRK13173 consensus 74.6 7.2 0.00018 19.6 4.9 75 1-81 1-77 (211) 30 pfam02844 GARS_N Phosphoribosy 74.3 7.3 0.00019 19.5 8.5 93 6-132 3-96 (99) 31 PRK13171 consensus 73.4 7.6 0.00019 19.4 5.9 65 1-70 1-67 (200) 32 PRK12743 acetoin dehydrogenase 71.2 7.8 0.0002 19.3 4.3 163 1-173 1-180 (253) 33 TIGR02339 thermosome_arch ther 70.2 8 0.0002 19.2 4.2 76 105-180 266-350 (522) 34 CHL00194 ycf39 Ycf39; Provisio 69.7 9.2 0.00023 18.8 7.3 34 3-36 2-35 (319) 35 PRK12384 sorbitol-6-phosphate 69.4 8.3 0.00021 19.1 4.1 166 1-176 1-184 (259) 36 PRK13172 consensus 69.1 9.5 0.00024 18.7 5.4 65 1-70 1-67 (213) 37 pfam03807 F420_oxidored NADP o 68.3 8.7 0.00022 19.0 4.0 32 3-35 1-32 (93) 38 pfam03446 NAD_binding_2 NAD bi 68.2 5.5 0.00014 20.3 3.0 152 1-166 1-156 (163) 39 PRK05642 DNA replication initi 68.0 9.1 0.00023 18.9 4.1 111 10-124 58-186 (234) 40 KOG3111 consensus 65.9 8.1 0.00021 19.2 3.5 76 176-267 136-215 (224) 41 PRK08903 hypothetical protein; 65.4 11 0.00028 18.3 5.0 112 9-124 54-177 (227) 42 TIGR01324 cysta_beta_ly_B cyst 65.4 11 0.00028 18.3 4.7 56 218-273 117-177 (389) 43 PRK06202 hypothetical protein; 65.2 8.4 0.00021 19.1 3.5 30 5-34 67-96 (233) 44 TIGR01285 nifN nitrogenase mol 63.2 12 0.00031 18.0 5.0 68 2-70 329-396 (451) 45 pfam02254 TrkA_N TrkA-N domain 62.1 10 0.00026 18.5 3.5 28 6-34 2-29 (115) 46 PRK05784 phosphoribosylamine-- 61.4 13 0.00034 17.8 9.8 126 6-165 4-145 (485) 47 PRK06444 prephenate dehydrogen 59.9 14 0.00036 17.6 6.7 60 3-75 2-61 (197) 48 TIGR03420 DnaA_homol_Hda DnaA 59.8 14 0.00036 17.6 5.8 113 9-124 50-179 (226) 49 COG3250 LacZ Beta-galactosidas 58.4 15 0.00038 17.5 5.9 62 209-274 297-362 (808) 50 pfam06189 5-nucleotidase 5'-nu 57.9 4.6 0.00012 20.9 1.0 22 56-77 51-74 (263) 51 TIGR00423 TIGR00423 conserved 57.8 14 0.00036 17.6 3.5 38 43-82 30-67 (331) 52 TIGR03023 WcaJ_sugtrans Undeca 57.8 15 0.00038 17.4 6.2 72 3-75 129-201 (451) 53 PRK06928 pyrroline-5-carboxyla 56.9 15 0.00039 17.3 3.6 25 1-26 1-25 (275) 54 PRK10124 putative UDP-glucose 56.4 16 0.00041 17.2 7.2 70 3-76 145-215 (464) 55 PTZ00331 alpha/beta hydrolase; 56.2 5.9 0.00015 20.1 1.4 42 233-274 168-211 (222) 56 PRK12446 N-acetylglucosaminyl 56.1 16 0.00041 17.2 7.4 75 1-75 1-101 (352) 57 PRK13148 consensus 55.7 16 0.00042 17.2 4.4 67 1-70 1-69 (225) 58 PRK12367 short chain dehydroge 54.0 17 0.00044 17.0 6.5 199 2-253 18-226 (250) 59 TIGR03025 EPS_sugtrans exopoly 54.0 17 0.00044 17.0 6.6 71 4-75 127-198 (445) 60 pfam09035 Tn916-Xis Excisionas 53.3 18 0.00045 16.9 5.1 40 229-268 24-63 (66) 61 TIGR03234 OH-pyruv-isom hydrox 52.8 18 0.00046 16.9 4.9 149 53-247 85-236 (254) 62 COG0299 PurN Folate-dependent 52.7 18 0.00046 16.8 6.8 73 3-75 2-89 (200) 63 PRK10200 putative racemase; Pr 52.6 18 0.00047 16.8 4.7 76 1-76 1-86 (230) 64 TIGR02379 ECA_wecE TDP-4-keto- 51.9 19 0.00048 16.8 5.3 97 154-274 55-151 (376) 65 TIGR01780 SSADH succinic semia 51.3 13 0.00034 17.7 2.6 30 106-136 242-271 (454) 66 PRK08643 acetoin reductase; Va 51.0 19 0.00049 16.7 4.2 75 1-75 1-90 (256) 67 PRK08727 hypothetical protein; 50.8 19 0.00049 16.6 6.9 110 11-124 55-182 (233) 68 pfam01370 Epimerase NAD depend 50.0 20 0.00051 16.6 8.3 72 5-76 2-76 (235) 69 cd01012 YcaC_related YcaC rela 49.9 20 0.00051 16.5 3.8 114 16-135 27-156 (157) 70 TIGR00506 ribB 3,4-dihydroxy-2 49.5 18 0.00047 16.8 3.0 197 15-276 5-224 (230) 71 PRK09496 trkA potassium transp 49.0 21 0.00053 16.5 4.0 70 5-76 3-73 (455) 72 pfam01936 DUF88 Protein of unk 48.8 21 0.00053 16.4 4.4 14 151-164 72-85 (140) 73 pfam00365 PFK Phosphofructokin 48.5 21 0.00054 16.4 5.5 212 2-254 1-225 (279) 74 COG1395 Predicted transcriptio 48.1 21 0.00054 16.4 4.6 21 116-136 212-232 (313) 75 cd02071 MM_CoA_mut_B12_BD meth 47.9 22 0.00055 16.3 5.6 63 14-76 16-90 (122) 76 PRK08084 DNA replication initi 47.8 22 0.00055 16.3 6.3 112 10-124 58-187 (235) 77 PRK03659 glutathione-regulated 47.0 22 0.00057 16.3 3.3 120 5-144 403-534 (602) 78 PRK06893 DNA replication initi 46.7 22 0.00057 16.2 3.7 113 9-124 51-181 (229) 79 PRK08666 5'-methylthioadenosin 46.6 11 0.00027 18.4 1.5 204 1-268 1-229 (261) 80 pfam00308 Bac_DnaA Bacterial d 45.2 24 0.0006 16.1 4.0 101 53-167 118-218 (219) 81 PTZ00093 nucleoside diphosphat 44.9 24 0.00061 16.0 3.5 74 109-202 18-91 (149) 82 PRK06242 flavodoxin; Provision 44.6 24 0.00061 16.0 3.6 51 111-165 92-147 (150) 83 PRK07707 consensus 44.6 24 0.00061 16.0 8.9 164 1-175 1-176 (239) 84 PRK12833 acetyl-CoA carboxylas 44.3 24 0.00062 16.0 5.6 63 6-68 9-80 (458) 85 TIGR02620 cas_VVA1548 putative 44.2 19 0.00049 16.7 2.4 28 10-37 39-66 (93) 86 pfam07085 DRTGG DRTGG domain. 44.0 25 0.00063 16.0 5.1 45 232-279 51-95 (105) 87 PRK11706 TDP-4-oxo-6-deoxy-D-g 43.7 25 0.00063 15.9 4.8 14 257-270 301-314 (375) 88 TIGR01977 am_tr_V_EF2568 cyste 43.4 25 0.00064 15.9 5.3 135 102-273 32-177 (384) 89 PRK03562 glutathione-regulated 42.9 25 0.00064 15.9 2.9 146 5-170 402-564 (615) 90 COG1492 CobQ Cobyric acid synt 42.3 3.3 8.5E-05 21.8 -1.7 100 67-166 158-264 (486) 91 PRK04301 radA DNA repair and r 42.1 14 0.00036 17.6 1.5 25 154-180 229-254 (318) 92 PRK05586 biotin carboxylase; V 41.7 27 0.00068 15.7 6.7 69 1-69 1-78 (447) 93 PTZ00325 malate dehydrogenase; 41.6 24 0.00061 16.0 2.6 104 1-123 1-110 (313) 94 PRK11064 wecC UDP-N-acetyl-D-m 40.8 27 0.0007 15.6 7.1 100 1-106 3-113 (415) 95 pfam06057 VirJ Bacterial virul 40.8 27 0.0007 15.6 3.6 70 1-74 1-75 (192) 96 COG0151 PurD Phosphoribosylami 40.2 28 0.00071 15.6 9.3 151 6-195 4-173 (428) 97 PRK00349 uvrA excinuclease ABC 39.8 28 0.00072 15.5 3.0 29 224-252 514-551 (944) 98 PRK08654 pyruvate carboxylase 39.0 29 0.00074 15.5 5.4 67 1-69 1-78 (497) 99 PRK00726 murG N-acetylglucosam 39.0 29 0.00074 15.4 8.6 76 1-76 1-103 (359) 100 PRK06123 short chain dehydroge 39.0 29 0.00075 15.4 4.6 168 1-177 2-189 (249) 101 COG0105 Ndk Nucleoside diphosp 38.7 21 0.00054 16.4 2.0 76 109-204 18-93 (135) 102 KOG3985 consensus 38.6 8.5 0.00022 19.1 -0.1 19 1-19 9-27 (283) 103 PRK10669 putative cation:proto 38.4 30 0.00076 15.4 3.5 28 6-33 421-448 (558) 104 cd00363 PFK Phosphofructokinas 38.3 30 0.00076 15.4 5.2 30 2-31 1-36 (338) 105 COG0399 WecE Predicted pyridox 38.1 30 0.00077 15.4 4.7 141 17-167 87-268 (374) 106 PRK00081 coaE dephospho-CoA ki 38.1 23 0.00058 16.2 2.1 29 1-29 2-30 (199) 107 cd02022 DPCK Dephospho-coenzym 38.0 24 0.00061 16.0 2.1 26 4-29 2-27 (179) 108 pfam01121 CoaE Dephospho-CoA k 37.2 25 0.00063 15.9 2.1 26 4-29 3-28 (179) 109 TIGR00315 cdhB CO dehydrogenas 37.2 26 0.00066 15.8 2.2 18 106-123 106-123 (170) 110 TIGR00630 uvra excinuclease AB 36.3 32 0.00082 15.2 3.5 43 208-266 536-587 (956) 111 PRK08057 cobalt-precorrin-6x r 35.9 33 0.00083 15.1 6.6 69 1-73 1-70 (241) 112 PRK09234 fbiC FO synthase; Rev 35.2 33 0.00085 15.1 4.3 226 23-270 521-824 (846) 113 COG0351 ThiD Hydroxymethylpyri 35.1 33 0.00085 15.0 4.1 54 124-182 129-182 (263) 114 pfam03721 UDPG_MGDP_dh_N UDP-g 34.7 34 0.00087 15.0 7.0 97 3-106 2-113 (185) 115 pfam11731 Cdd1 Pathogenicity l 34.6 19 0.00049 16.7 1.2 23 225-247 15-37 (92) 116 PRK12845 3-ketosteroid-delta-1 34.4 34 0.00088 15.0 3.2 26 6-32 20-45 (566) 117 PRK01184 hypothetical protein; 34.2 29 0.00075 15.4 2.1 30 1-30 1-30 (183) 118 pfam06948 consensus 33.9 35 0.00089 14.9 6.4 61 1-68 1-69 (112) 119 PRK01130 N-acetylmannosamine-6 33.9 27 0.00069 15.7 1.9 137 26-199 55-195 (222) 120 PRK05678 succinyl-CoA syntheta 33.7 35 0.0009 14.9 4.0 139 4-178 11-179 (289) 121 pfam01784 NIF3 NIF3 (NGG1p int 33.6 35 0.0009 14.9 3.6 84 179-275 134-218 (238) 122 cd03110 Fer4_NifH_child This p 33.3 36 0.00091 14.9 2.4 29 138-166 116-144 (179) 123 PRK08356 hypothetical protein; 32.9 36 0.00093 14.8 6.5 25 112-136 20-44 (195) 124 pfam04131 NanE Putative N-acet 32.9 28 0.00072 15.5 1.9 138 23-199 28-165 (192) 125 PRK05301 pyrroloquinoline quin 32.7 37 0.00093 14.8 5.3 79 48-133 46-129 (375) 126 COG0569 TrkA K+ transport syst 32.7 37 0.00093 14.8 3.9 69 5-75 3-73 (225) 127 PRK09856 fructoselysine 3-epim 32.1 37 0.00095 14.7 4.6 169 50-263 88-261 (276) 128 COG1060 ThiH Thiamine biosynth 31.8 38 0.00096 14.7 3.6 180 45-242 86-283 (370) 129 COG0237 CoaE Dephospho-CoA kin 31.2 38 0.00096 14.7 2.3 31 1-31 2-32 (201) 130 TIGR03035 trp_arylform arylfor 30.5 40 0.001 14.6 4.6 63 210-272 102-168 (206) 131 PRK08591 acetyl-CoA carboxylas 30.5 40 0.001 14.5 5.8 69 1-69 1-78 (449) 132 TIGR01026 fliI_yscN ATPase Fli 30.2 40 0.001 14.5 2.7 89 2-125 174-267 (455) 133 cd04955 GT1_like_6 This family 30.1 40 0.001 14.5 4.4 34 3-36 1-43 (363) 134 cd04729 NanE N-acetylmannosami 30.0 40 0.001 14.5 2.8 138 24-199 57-198 (219) 135 PRK10217 dTDP-glucose 4,6-dehy 30.0 40 0.001 14.5 7.7 75 1-76 1-85 (355) 136 PRK12824 acetoacetyl-CoA reduc 29.9 41 0.001 14.5 13.6 167 1-177 1-183 (245) 137 PRK13143 hisH imidazole glycer 29.8 41 0.001 14.5 5.8 58 1-62 1-59 (201) 138 PRK10799 putative hydrolase-ox 29.8 41 0.001 14.5 4.1 16 16-31 47-62 (247) 139 cd00431 cysteine_hydrolases Cy 29.7 41 0.001 14.5 4.6 19 57-75 101-119 (161) 140 TIGR03471 HpnJ hopanoid biosyn 29.6 41 0.001 14.4 3.4 53 223-278 281-350 (472) 141 PRK13525 glutamine amidotransf 29.3 42 0.0011 14.4 5.0 61 1-66 1-61 (191) 142 PRK05472 redox-sensing transcr 29.3 42 0.0011 14.4 4.8 108 105-236 91-203 (211) 143 pfam02887 PK_C Pyruvate kinase 29.0 42 0.0011 14.4 3.0 39 150-190 73-111 (117) 144 COG2210 Peroxiredoxin family p 28.7 43 0.0011 14.3 4.6 31 2-32 3-38 (137) 145 PRK00635 excinuclease ABC subu 28.6 43 0.0011 14.3 3.4 36 224-266 1412-1456(1809) 146 cd02037 MRP-like MRP (Multiple 28.5 43 0.0011 14.3 2.2 30 4-33 2-36 (169) 147 TIGR02053 MerA mercuric reduct 28.2 43 0.0011 14.3 3.0 37 101-137 216-254 (494) 148 TIGR01284 alt_nitrog_alph nitr 28.2 43 0.0011 14.3 2.2 77 23-127 165-242 (468) 149 pfam01041 DegT_DnrJ_EryC1 DegT 28.1 44 0.0011 14.3 5.0 15 257-271 291-305 (363) 150 COG3473 Maleate cis-trans isom 28.0 44 0.0011 14.3 2.2 24 107-130 129-152 (238) 151 TIGR02363 dhaK1 dihydroxyaceto 27.9 31 0.0008 15.2 1.4 52 56-107 117-171 (354) 152 KOG4471 consensus 27.8 44 0.0011 14.3 5.6 97 157-269 185-297 (717) 153 TIGR02545 ATP_syn_fliI flagell 27.8 44 0.0011 14.2 3.1 73 2-77 159-259 (439) 154 pfam02670 DXP_reductoisom 1-de 27.7 44 0.0011 14.2 4.1 55 5-74 2-56 (129) 155 TIGR02079 THD1 threonine dehyd 27.7 25 0.00064 15.9 0.9 176 4-217 149-339 (415) 156 PRK13010 purU formyltetrahydro 27.1 45 0.0012 14.2 3.4 77 53-139 157-236 (289) 157 COG1082 IolE Sugar phosphate i 26.9 46 0.0012 14.1 5.4 20 233-252 233-253 (274) 158 PRK10787 DNA-binding ATP-depen 26.9 46 0.0012 14.1 4.4 59 9-72 361-421 (784) 159 PRK13146 hisH imidazole glycer 26.8 46 0.0012 14.1 5.8 63 1-66 1-65 (208) 160 COG4597 BatB ABC-type amino ac 26.5 21 0.00054 16.4 0.3 44 211-254 308-357 (397) 161 PRK06027 purU formyltetrahydro 26.4 47 0.0012 14.1 4.0 33 43-75 136-174 (285) 162 TIGR02312 HpaH 2-oxo-hepta-3-e 26.3 28 0.00072 15.5 0.9 16 65-81 234-249 (271) 163 cd06167 LabA_like LabA_like pr 26.3 47 0.0012 14.1 3.8 55 111-178 55-109 (149) 164 COG0473 LeuB Isocitrate/isopro 26.2 47 0.0012 14.1 5.3 164 1-182 3-187 (348) 165 PRK04296 thymidine kinase; Pro 26.1 47 0.0012 14.0 5.3 32 1-32 1-37 (197) 166 PTZ00117 malate dehydrogenase; 26.1 47 0.0012 14.0 2.7 114 1-137 1-130 (313) 167 COG5014 Predicted Fe-S oxidore 25.9 47 0.0012 14.0 5.2 24 51-74 77-100 (228) 168 pfam04227 Indigoidine_A Indigo 25.7 48 0.0012 14.0 2.9 157 15-178 35-222 (293) 169 CHL00175 minD septum-site dete 25.6 48 0.0012 14.0 2.1 152 1-170 12-180 (279) 170 PRK11640 putative transcriptio 25.4 46 0.0012 14.1 1.9 46 107-152 4-49 (191) 171 cd06353 PBP1_BmpA_Med_like Per 25.2 49 0.0012 13.9 4.5 58 212-280 154-211 (258) 172 pfam02571 CbiJ Precorrin-6x re 25.2 49 0.0012 13.9 5.9 70 3-73 2-72 (246) 173 LOAD_Ccd2 consensus 25.1 49 0.0013 13.9 5.6 63 12-75 17-79 (114) 174 TIGR00763 lon ATP-dependent pr 24.8 50 0.0013 13.9 4.7 64 9-77 462-528 (941) 175 pfam10055 DUF2292 Uncharacteri 24.8 50 0.0013 13.9 3.7 30 158-187 4-34 (38) 176 TIGR03100 hydr1_PEP hydrolase, 24.7 50 0.0013 13.9 6.1 66 8-74 40-107 (274) 177 TIGR03466 HpnA hopanoid-associ 24.6 50 0.0013 13.9 6.7 70 3-74 2-73 (328) 178 COG1526 FdhD Uncharacterized p 24.5 50 0.0013 13.8 11.8 108 154-279 146-253 (266) 179 PRK06703 flavodoxin; Provision 24.4 51 0.0013 13.8 5.3 44 1-44 1-49 (151) 180 pfam00070 Pyr_redox Pyridine n 24.4 51 0.0013 13.8 5.6 54 6-67 3-56 (82) 181 COG0178 UvrA Excinuclease ATPa 23.7 52 0.0013 13.8 3.8 36 224-266 506-550 (935) 182 TIGR00486 TIGR00486 conserved 23.7 52 0.0013 13.8 4.1 202 14-249 57-286 (325) 183 PRK13361 molybdenum cofactor b 23.5 53 0.0013 13.7 7.6 117 47-179 43-167 (329) 184 PRK12999 pyruvate carboxylase; 23.5 53 0.0013 13.7 5.5 65 5-69 8-82 (1147) 185 TIGR01133 murG undecaprenyldip 23.0 54 0.0014 13.7 4.6 50 1-50 5-63 (368) 186 PRK00164 moaA molybdenum cofac 22.8 54 0.0014 13.6 7.8 171 47-246 49-228 (334) 187 COG0027 PurT Formate-dependent 22.4 55 0.0014 13.6 5.5 65 6-70 16-80 (394) 188 pfam09374 PG_binding_3 Predict 22.1 56 0.0014 13.6 1.9 13 229-241 20-32 (67) 189 cd03870 M14_CPA Peptidase M14 21.9 56 0.0014 13.5 4.9 62 110-173 170-238 (301) 190 PRK09087 hypothetical protein; 21.9 56 0.0014 13.5 3.2 67 52-124 104-173 (226) 191 PRK07178 acetyl-CoA carboxylas 21.7 57 0.0015 13.5 6.1 68 1-68 1-76 (471) 192 TIGR02361 dak_ATP dihydroxyace 21.7 52 0.0013 13.8 1.5 31 56-88 304-337 (627) 193 pfam00814 Peptidase_M22 Glycop 21.6 57 0.0015 13.5 2.1 16 234-249 173-188 (225) 194 PRK06714 S-adenosylhomocystein 21.5 58 0.0015 13.5 6.0 70 1-74 1-77 (236) 195 COG1903 CbiD Cobalamin biosynt 21.4 58 0.0015 13.5 3.2 64 18-82 195-264 (367) 196 COG5583 Uncharacterized small 21.3 58 0.0015 13.4 2.5 33 156-188 10-43 (54) 197 pfam09652 Cas_VVA1548 Putative 21.3 58 0.0015 13.4 2.4 24 11-34 40-63 (93) 198 PRK01216 DNA polymerase IV; Va 21.3 52 0.0013 13.7 1.5 25 105-129 145-170 (351) 199 pfam04994 TfoX_C TfoX C-termin 21.2 58 0.0015 13.4 2.1 16 227-242 4-19 (77) 200 TIGR00097 HMP-P_kinase phospho 21.0 59 0.0015 13.4 2.9 79 105-193 109-190 (264) 201 pfam04127 DFP DNA / pantothena 21.0 59 0.0015 13.4 6.5 66 10-76 27-95 (197) 202 PRK01372 ddl D-alanine--D-alan 20.8 59 0.0015 13.4 3.8 81 1-87 4-98 (304) 203 COG2344 AT-rich DNA-binding pr 20.8 59 0.0015 13.4 4.1 112 105-237 91-204 (211) 204 PRK13304 L-aspartate dehydroge 20.5 60 0.0015 13.3 4.7 34 1-35 1-34 (265) 205 PRK04140 hypothetical protein; 20.4 61 0.0015 13.3 4.9 48 189-239 254-302 (319) 206 PRK08887 nicotinic acid mononu 20.4 58 0.0015 13.5 1.5 31 1-32 1-35 (173) 207 TIGR00624 tag DNA-3-methyladen 20.3 61 0.0015 13.3 1.8 19 111-129 146-164 (185) 208 COG1647 Esterase/lipase [Gener 20.2 61 0.0016 13.3 3.5 122 8-133 25-151 (243) 209 pfam00743 FMO-like Flavin-bind 20.2 61 0.0016 13.3 4.4 31 2-33 2-32 (532) 210 TIGR02275 DHB_AMP_lig 2,3-dihy 20.0 62 0.0016 13.3 2.1 29 247-275 251-283 (534) No 1 >COG3494 Uncharacterized protein conserved in bacteria [Function unknown] Probab=100.00 E-value=0 Score=552.81 Aligned_cols=275 Identities=47% Similarity=0.738 Sum_probs=266.0 Q ss_pred CEEEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEEECCCCCCCCH Q ss_conf 73999977980389999999978997899995486673103885678738999999999986495406770242321000 Q gi|254780768|r 2 KRLLIIAGSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQDFECRELPLGDFCVLRSILHQYNIGRIVVAGAIDRRPNV 81 (281) Q Consensus 2 ~kigIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~~~~~~~ig~ig~li~~Lk~~~i~~ivmaG~V~krP~~ 81 (281) .|||||||+|.||.++|++++.+|+.||++.+++++|+++++|++..+++|.+|++++.|+.++|+++||+|+|++||.| T Consensus 5 ~klglIaG~G~LP~~va~~a~~~G~~~~ii~l~~eaD~~~~~~e~~~~~iG~vg~lik~l~~~~v~~vVl~G~v~~Rp~~ 84 (279) T COG3494 5 GKLGLIAGNGSLPLEVAENARNQGYAPFIIGLRGEADPELKEFEYKEVSIGEVGKLIKLLKTEGVDRVVLAGGVERRPNF 84 (279) T ss_pred CEEEEEECCCCCCHHHHHHHHHCCCCCEEEEECCCCCHHHHCCCCEEEEHHHHHHHHHHHHHCCCCEEEEECCCCCCCCH T ss_conf 60799955986758999999867998279983575665552588707757879999999998488689996452568534 Q ss_pred HHHCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCEEECHHHCCHHHCCCCCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 11102603578999999875104803589999999997697482112225242035433455465323478899899999 Q gi|254780768|r 82 QDLCFSIKDSLRISKMIWQLVSGGNAAILKASIDLLESYGVSVVGAHEIVPELLVQVGSLGTCVPNRDVKRDILAAMKSA 161 (281) Q Consensus 82 ~~l~~D~~~~~~l~k~l~~~~~~gDd~iL~~i~~~fe~~G~~vi~~~~~l~~ll~~~G~l~~~~p~~~~~~dI~~g~~i~ 161 (281) .++++|+.+...+++++..+ .+|||++|+.++++||+.||+|+++|+++|++++++|.+|+..|+.++++||+.|++++ T Consensus 85 ~~L~~d~~~l~~lp~Iv~~~-~~gDDaLLk~vi~~~E~~GfKvigahei~~~ll~~~g~lt~~~P~~~d~~dI~~g~~aA 163 (279) T COG3494 85 RDLRPDKIGLAVLPKIVEAL-IRGDDALLKAVIDFIESRGFKVIGAHEIVPGLLAETGPLTKKEPDNEDLRDIELGIEAA 163 (279) T ss_pred HHCCCCCCHHHHHHHHHHHH-CCCCHHHHHHHHHHHHHCCCEEECHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHH T ss_conf 54064300146779999986-14967899999999986586885576611110257876567888722488999999999 Q ss_pred HHHCCCCCEEEEEEECCEEEEEECCCCHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCEEEECCCCHHHHHHHHHC Q ss_conf 97332672259998197488962534217999999975431234566771899934888753121066279999999984 Q gi|254780768|r 162 EALSELDVGQSAVSIGGRVVALEGIEGTDSMLQRIVDCRNNGRILAGKSGVLVKMCKSQQDMRADLPSIGAKTVQNVIKA 241 (281) Q Consensus 162 ~~l~~~DiGQsvVv~~g~ViaiEa~eGTD~mi~R~~~~~~~~~~~~~~~~ilvK~~K~~QD~r~DlP~IG~~Ti~~~~~a 241 (281) +.+|++||||++||.+|+|+|+|++||||+||+||+++++.++ ...++|||||+|||+||+|+|||||||+|++++++| T Consensus 164 ~~lg~lDVGQ~aV~~~g~vvAvEg~EGTd~ml~R~a~lr~~gr-~~rr~gvLvK~aKp~QD~R~DlPtIG~~Ti~~a~ka 242 (279) T COG3494 164 NALGALDVGQGAVVVGGRVVAVEGAEGTDAMLRRVADLRKAGR-AERRGGVLVKMAKPQQDERADLPTIGPNTIENAAKA 242 (279) T ss_pred HHHCCCCCCCEEEEECCEEEEEEECCCHHHHHHHHHHHHHCCC-CCCCCCEEEECCCCCCCCCCCCCCCCHHHHHHHHHC T ss_conf 8742034565469967769999603264999999998764355-565687899826888543136875588999999975 Q ss_pred CCEEEEEECCCEEEECHHHHHHHHHHCCCEEEEECHH Q ss_conf 9909999739779985899999999879689995775 Q gi|254780768|r 242 GLAGIALEAGKSLVLEKELVKKHADEAGIFVCGIDRE 278 (281) Q Consensus 242 g~~giaiea~~~lild~~~~i~~a~~~~i~i~g~~~~ 278 (281) |++|||+|||+++++|+++++++||++||||+|++.+ T Consensus 243 GlaGIaieagr~lile~e~~l~~Ank~giFI~gi~~~ 279 (279) T COG3494 243 GLAGIAIEAGRVLILEREATLKLANKLGIFIVGIDRD 279 (279) T ss_pred CCCCEEEECCCEEEECCHHHHHHHHHCCEEEEEECCC T ss_conf 8211350039379965388898676578389961479 No 2 >pfam06230 DUF1009 Protein of unknown function (DUF1009). Family of uncharacterized bacterial proteins. Probab=100.00 E-value=0 Score=490.66 Aligned_cols=212 Identities=42% Similarity=0.661 Sum_probs=202.4 Q ss_pred HHHHHHHHHHCCCCEEEEECCCCCCCCHHHHCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCEEECHHHCCHH Q ss_conf 99999999864954067702423210001110260357899999987510480358999999999769748211222524 Q gi|254780768|r 54 FCVLRSILHQYNIGRIVVAGAIDRRPNVQDLCFSIKDSLRISKMIWQLVSGGNAAILKASIDLLESYGVSVVGAHEIVPE 133 (281) Q Consensus 54 ig~li~~Lk~~~i~~ivmaG~V~krP~~~~l~~D~~~~~~l~k~l~~~~~~gDd~iL~~i~~~fe~~G~~vi~~~~~l~~ 133 (281) +|+++++||+++|+++||+|+| +||+|++++||+++...++++ ....+|||++|++|+++||++||+++++++|+|+ T Consensus 1 igkii~~Lk~~~i~~ivm~G~v-~rp~~~~~~~D~~~~~ll~~~--~~~~~gDd~lL~~i~~~fe~~G~~vi~~~~~l~~ 77 (212) T pfam06230 1 LGKLIKLLKAEGVTEVVMAGAV-KRPAFSPLRPDLRTLRLLPRL--AALRRGDDALLRAVIREFEEEGFKVVGAHEVAPD 77 (212) T ss_pred CHHHHHHHHHCCCCEEEEECCE-ECCCCCCCCCCHHHHHHHHHH--HHHHCCCHHHHHHHHHHHHHCCCEEECHHHHHHH T ss_conf 9789999998799989996253-578863438699999988762--6753597889999999999879899757896298 Q ss_pred HCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEECCEEEEEECCCCHHHHHHHHHHHHHHCCCCCCCCEEE Q ss_conf 20354334554653234788998999999733267225999819748896253421799999997543123456677189 Q gi|254780768|r 134 LLVQVGSLGTCVPNRDVKRDILAAMKSAEALSELDVGQSAVSIGGRVVALEGIEGTDSMLQRIVDCRNNGRILAGKSGVL 213 (281) Q Consensus 134 ll~~~G~l~~~~p~~~~~~dI~~g~~i~~~l~~~DiGQsvVv~~g~ViaiEa~eGTD~mi~R~~~~~~~~~~~~~~~~il 213 (281) +++++|++++++|++++++||++|+++++++|++||||||||++|+|+|+||+||||+||+|+++++++ +++|| T Consensus 78 ll~~~G~l~~~~p~~~~~~dI~~g~~i~~~l~~~DiGQsvVV~~g~VlavEa~EGTD~mI~R~~~l~~~------~~~vl 151 (212) T pfam06230 78 LLAPEGVLTRRAPDKSELADIARGAEVAKALGALDIGQAVVVKDGAVLAVEAIEGTDAMLRRVGELRRG------RGGVL 151 (212) T ss_pred HCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCEEEEEECCCCHHHHHHHHHHHCCC------CCEEE T ss_conf 644667525789898999999999999999734687416999899899995530689999999974568------88089 Q ss_pred EEECCCCCCCEEEECCCCHHHHHHHHHCCCEEEEEECCCEEEECHHHHHHHHHHCCCEEEE Q ss_conf 9934888753121066279999999984990999973977998589999999987968999 Q gi|254780768|r 214 VKMCKSQQDMRADLPSIGAKTVQNVIKAGLAGIALEAGKSLVLEKELVKKHADEAGIFVCG 274 (281) Q Consensus 214 vK~~K~~QD~r~DlP~IG~~Ti~~~~~ag~~giaiea~~~lild~~~~i~~a~~~~i~i~g 274 (281) ||+|||+||+|+|+|||||+|+++|++||++|||+||++|+++|+++++++||+||||||| T Consensus 152 vK~~K~~QD~R~DlPtIG~~Ti~~~~~ag~~giavea~~~li~d~~~~i~~A~~~~i~i~G 212 (212) T pfam06230 152 VKAPKPGQDLRFDLPTIGPDTIETAAEAGLAGIAVEAGKTLVLDREAVIAAADAAGLFVVG 212 (212) T ss_pred EECCCCCCCCCCCCCCCCHHHHHHHHHCCCEEEEEECCCEEEECHHHHHHHHHHCCCEEEC T ss_conf 8577878631247777489999999983995999966987998699999999983989959 No 3 >PRK00881 purH bifunctional phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase; Provisional Probab=97.42 E-value=0.0061 Score=40.23 Aligned_cols=115 Identities=23% Similarity=0.259 Sum_probs=81.1 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEECCEEEEEECCC-----CHHHHHHHHHHHHHHCCCCCCCCEEE Q ss_conf 3345546532347889989999997332672259998197488962534-----21799999997543123456677189 Q gi|254780768|r 139 GSLGTCVPNRDVKRDILAAMKSAEALSELDVGQSAVSIGGRVVALEGIE-----GTDSMLQRIVDCRNNGRILAGKSGVL 213 (281) Q Consensus 139 G~l~~~~p~~~~~~dI~~g~~i~~~l~~~DiGQsvVv~~g~ViaiEa~e-----GTD~mi~R~~~~~~~~~~~~~~~~il 213 (281) .+.|+++|++++++|..++|++++..-+=-| |+++|+..+++=+=+ -+.-.++++++... .-+|+|+ T Consensus 387 k~VT~~~p~~~e~~DL~FAwkv~k~vKSNAI---v~ak~~~tvGiGaGQ~sRVda~~iA~~ka~~~~~-----~~~g~vl 458 (514) T PRK00881 387 KVVTKRQPTEQELKDLLFAWKVVKHVKSNAI---VYAKDGQTVGIGAGQMSRVDSAKIALEKAGDAGL-----DLKGSVL 458 (514) T ss_pred EECCCCCCCHHHHHHHHHHHHHHHHCCCCEE---EEEECCCEEEECCCCCCHHHHHHHHHHHHHHHCC-----CCCCCEE T ss_conf 3113899898998656999999863665569---9994893898678975689999999999886345-----6678489 Q ss_pred EEECCCCCCCEEEECCCCHHHHHHHHHCCCEEEEEECCCEEEECHHHHHHHHHHCCCEEEE Q ss_conf 9934888753121066279999999984990999973977998589999999987968999 Q gi|254780768|r 214 VKMCKSQQDMRADLPSIGAKTVQNVIKAGLAGIALEAGKSLVLEKELVKKHADEAGIFVCG 274 (281) Q Consensus 214 vK~~K~~QD~r~DlP~IG~~Ti~~~~~ag~~giaiea~~~lild~~~~i~~a~~~~i~i~g 274 (281) . -|==|-+ ++.|+.++++|++.|+ +.|+++ .-+++|+.||++||..+= T Consensus 459 A------SDAFFPF----~D~ie~aa~~Gv~aIi-QPGGSi--rD~evI~aan~~gi~M~f 506 (514) T PRK00881 459 A------SDAFFPF----RDGVEAAAKAGITAII-QPGGSI--RDQEVIAAADEHGIAMVF 506 (514) T ss_pred E------ECCCCCC----CHHHHHHHHCCCEEEE-CCCCCC--CHHHHHHHHHHCCCEEEE T ss_conf 9------5677887----4479999985990998-999753--639999999970987998 No 4 >COG0138 PurH AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) [Nucleotide transport and metabolism] Probab=96.98 E-value=0.0049 Score=40.89 Aligned_cols=117 Identities=23% Similarity=0.330 Sum_probs=78.3 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEECCEEEEEECCCCHHHHHHHHHHHHHHCC---C--CCCCC Q ss_conf 354334554653234788998999999733267225999819748896253421799999997543123---4--56677 Q gi|254780768|r 136 VQVGSLGTCVPNRDVKRDILAAMKSAEALSELDVGQSAVSIGGRVVALEGIEGTDSMLQRIVDCRNNGR---I--LAGKS 210 (281) Q Consensus 136 ~~~G~l~~~~p~~~~~~dI~~g~~i~~~l~~~DiGQsvVv~~g~ViaiEa~eGTD~mi~R~~~~~~~~~---~--~~~~~ 210 (281) ....+.|++.||+++++|..++|++++..-+--| |+.+|+..+++ |--.| .|+..-+-... . ..-.| T Consensus 385 ~~~~vVTkr~pt~~e~~dl~Fawkv~k~vKSNaI---v~akd~~TvGi----GaGQ~-sRV~sariA~~kA~~~~~~~~G 456 (515) T COG0138 385 AELKVVTKRQPTEQELEDLLFAWKVVKHVKSNAI---VYAKDGQTVGI----GAGQM-SRVDSARIAAEKAGDAGLRAHG 456 (515) T ss_pred CCEEEECCCCCCHHHHHHHHHHHHHHHHHCCCEE---EEECCCEEEEE----CCCCC-CHHHHHHHHHHHHHHHHHHCCC T ss_conf 0106742789998999999999999867034539---99709838875----68851-1588999999987776531268 Q ss_pred EEEEEECCCCCCCEEEECCCCHHHHHHHHHCCCEEEEEECCCEEEECHHHHHHHHHHCCCEEE Q ss_conf 189993488875312106627999999998499099997397799858999999998796899 Q gi|254780768|r 211 GVLVKMCKSQQDMRADLPSIGAKTVQNVIKAGLAGIALEAGKSLVLEKELVKKHADEAGIFVC 273 (281) Q Consensus 211 ~ilvK~~K~~QD~r~DlP~IG~~Ti~~~~~ag~~giaiea~~~lild~~~~i~~a~~~~i~i~ 273 (281) +|+- -|-=|-+ +++|..++++|++.|. +.|+++ .-+++|+.||+|||..+ T Consensus 457 av~A------SDAFFPF----~D~i~~aA~aGi~aII-qPGGSi--rD~eVI~aAde~giaMv 506 (515) T COG0138 457 AVLA------SDAFFPF----PDGIDAAAKAGIKAII-QPGGSI--RDQEVIAAADEHGIAMV 506 (515) T ss_pred CEEE------ECCCCCC----CCHHHHHHHCCCEEEE-CCCCCC--CCHHHHHHHHHCCCEEE T ss_conf 1775------2355788----6139999974986997-799866--64999998886196799 No 5 >PRK07106 5-aminoimidazole-4-carboxamide ribonucleotide transformylase; Provisional Probab=96.71 E-value=0.014 Score=37.92 Aligned_cols=121 Identities=17% Similarity=0.218 Sum_probs=74.7 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEECCEEEEEECCCCHHHH---------HHHHHHHHH--HC--CCC Q ss_conf 34554653234788998999999733267225999819748896253421799---------999997543--12--345 Q gi|254780768|r 140 SLGTCVPNRDVKRDILAAMKSAEALSELDVGQSAVSIGGRVVALEGIEGTDSM---------LQRIVDCRN--NG--RIL 206 (281) Q Consensus 140 ~l~~~~p~~~~~~dI~~g~~i~~~l~~~DiGQsvVv~~g~ViaiEa~eGTD~m---------i~R~~~~~~--~~--~~~ 206 (281) +-+.+.|++++++|+.++|++++..-+=-| |+++|+..+++ |+-.| +++++...- +. ..+ T Consensus 203 v~~~k~~~~e~~~dL~FAwkv~K~vKSNAI---Vlakn~~tiGI----GaGQ~SRVDsv~iA~~KA~~~~~r~~p~~~~l 275 (391) T PRK07106 203 PTKNKILPDEAKRDLIIALITLKYTQSNSV---CYAKDGQAIGI----GAGQQSRIHCTRLAGNKADIWYLRQHPKVLNL 275 (391) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHCCCCEE---EEEECCCEEEE----CCCCCCHHHHHHHHHHHHHHHHHHHCHHHHCC T ss_conf 014899998999999999999976456639---99968928986----78864689999999999999876426565243 Q ss_pred CCCCEE-----------------------------EEEECCC------------C------CCCEEEECCCCHHHHHHHH Q ss_conf 667718-----------------------------9993488------------8------7531210662799999999 Q gi|254780768|r 207 AGKSGV-----------------------------LVKMCKS------------Q------QDMRADLPSIGAKTVQNVI 239 (281) Q Consensus 207 ~~~~~i-----------------------------lvK~~K~------------~------QD~r~DlP~IG~~Ti~~~~ 239 (281) +.+.++ |.+.|-| - -|== +|- ++.|+.++ T Consensus 276 ~~~~~~~~~e~~n~~~~~~~~d~~~~~~~~~~~~~f~~~~~~l~~~e~~~~~~~~~g~vlASDAF--FPF--~D~Ie~aa 351 (391) T PRK07106 276 PFKEGIRRPDRDNTIDVYISDDYMDVLADGVWEQFFTRKPEPLTREEKRAWLDTLKGVALGSDAF--FPF--GDNIERAH 351 (391) T ss_pred HHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCCCCEEEECCC--CCC--CHHHHHHH T ss_conf 03320001333347888740110012445566654203653000777776650258808991157--887--56899999 Q ss_pred HCCCEEEEEECCCEEEECHHHHHHHHHHCCCEEEE Q ss_conf 84990999973977998589999999987968999 Q gi|254780768|r 240 KAGLAGIALEAGKSLVLEKELVKKHADEAGIFVCG 274 (281) Q Consensus 240 ~ag~~giaiea~~~lild~~~~i~~a~~~~i~i~g 274 (281) ++|.+.|+ +.|+++ --+++|+.||++||..+= T Consensus 352 ~~GV~aIi-QPGGSi--rD~eVI~aane~gi~Mvf 383 (391) T PRK07106 352 KSGVSYIA-QPGGSI--RDDHVIETCDKYNITMAF 383 (391) T ss_pred HCCCEEEE-CCCCCC--CHHHHHHHHHHHCCEEEE T ss_conf 85994998-799741--509999999986988998 No 6 >COG2910 Putative NADH-flavin reductase [General function prediction only] Probab=94.80 E-value=0.12 Score=31.59 Aligned_cols=152 Identities=13% Similarity=0.046 Sum_probs=86.6 Q ss_pred EEEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCC--------------------CCCCCEEE--EE---------- Q ss_conf 39999779803899999999789978999954866731--------------------03885678--73---------- Q gi|254780768|r 3 RLLIIAGSGMLPYYVAKAARLKNDEPVIASVLNECSFD--------------------WQDFECRE--LP---------- 50 (281) Q Consensus 3 kigIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~--------------------~~~~~~~~--~~---------- 50 (281) |||||+-+|.--..+++++..+|+++..+.-.....+. +..++... +. T Consensus 2 KIaiIgAsG~~Gs~i~~EA~~RGHeVTAivRn~~K~~~~~~~~i~q~Difd~~~~a~~l~g~DaVIsA~~~~~~~~~~~~ 81 (211) T COG2910 2 KIAIIGASGKAGSRILKEALKRGHEVTAIVRNASKLAARQGVTILQKDIFDLTSLASDLAGHDAVISAFGAGASDNDELH 81 (211) T ss_pred EEEEEECCCHHHHHHHHHHHHCCCEEEEEEECHHHCCCCCCCEEECCCCCCHHHHHHHHCCCCEEEEECCCCCCCHHHHH T ss_conf 07899537456799999998679804899807676652235302000222745667663587669972157888715778 Q ss_pred HHHHHHHHHHHHHCCCCEEEEECCCCCCCCHH--HHCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCEEECHH Q ss_conf 89999999999864954067702423210001--1102603578999999875104803589999999997697482112 Q gi|254780768|r 51 LGDFCVLRSILHQYNIGRIVVAGAIDRRPNVQ--DLCFSIKDSLRISKMIWQLVSGGNAAILKASIDLLESYGVSVVGAH 128 (281) Q Consensus 51 ig~ig~li~~Lk~~~i~~ivmaG~V~krP~~~--~l~~D~~~~~~l~k~l~~~~~~gDd~iL~~i~~~fe~~G~~vi~~~ 128 (281) ......++..|+..++.+++.+|+.. ---++ ....| .|++.. -...... .....|. .|..+.= ++++ T Consensus 82 ~k~~~~li~~l~~agv~RllVVGGAG-SL~id~g~rLvD-~p~fP~--ey~~~A~-~~ae~L~----~Lr~~~~--l~WT 150 (211) T COG2910 82 SKSIEALIEALKGAGVPRLLVVGGAG-SLEIDEGTRLVD-TPDFPA--EYKPEAL-AQAEFLD----SLRAEKS--LDWT 150 (211) T ss_pred HHHHHHHHHHHHHCCCEEEEEECCCC-CEEECCCCEEEC-CCCCCH--HHHHHHH-HHHHHHH----HHHHCCC--CCEE T ss_conf 88999999998615970599984742-058768845505-899856--6779998-7789999----9863567--6459 Q ss_pred HCCHHHCCCCCCCCCCCC-----------CHHHHHHHHHHHHHHHHHC Q ss_conf 225242035433455465-----------3234788998999999733 Q gi|254780768|r 129 EIVPELLVQVGSLGTCVP-----------NRDVKRDILAAMKSAEALS 165 (281) Q Consensus 129 ~~l~~ll~~~G~l~~~~p-----------~~~~~~dI~~g~~i~~~l~ 165 (281) .+.|..+..+|.-|+..- .++....-+|++.++.++- T Consensus 151 fvSPaa~f~PGerTg~yrlggD~ll~n~~G~SrIS~aDYAiA~lDe~E 198 (211) T COG2910 151 FVSPAAFFEPGERTGNYRLGGDQLLVNAKGESRISYADYAIAVLDELE 198 (211) T ss_pred EECCHHHCCCCCCCCCEEECCCEEEECCCCCEEEEHHHHHHHHHHHHH T ss_conf 967178457765568567636357774888503448999999998774 No 7 >PRK05647 purN phosphoribosylglycinamide formyltransferase; Reviewed Probab=94.61 E-value=0.15 Score=30.89 Aligned_cols=76 Identities=18% Similarity=0.211 Sum_probs=50.6 Q ss_pred CCEEEE-EECCCCHHHHHHHHHHHCCCCEEEE-EECCCCCCC-C-----CCCCEEEEEHH------HH-HHHHHHHHHCC Q ss_conf 973999-9779803899999999789978999-954866731-0-----38856787389------99-99999998649 Q gi|254780768|r 1 MKRLLI-IAGSGMLPYYVAKAARLKNDEPVIA-SVLNECSFD-W-----QDFECRELPLG------DF-CVLRSILHQYN 65 (281) Q Consensus 1 M~kigI-IAG~G~LP~~ia~~~~~~g~~~~ii-~l~~~~~~~-~-----~~~~~~~~~ig------~i-g~li~~Lk~~~ 65 (281) |+|+|+ ++|+|.==..+.++......+.-+. .+.+..+.. + .+.++..++-. ++ ..+++.|++++ T Consensus 1 Mkkiavl~SG~GSNl~aii~a~~~~~l~~~i~~Visn~~~a~~l~~A~~~~Ip~~~i~~~~~~~r~~~d~~i~~~L~~~~ 80 (200) T PRK05647 1 MKRIVVLASGNGSNLQAIIDACAAGQLPAEVVAVISDRPDAYGLERAEQAGIPTFVLDHKDFPSREAFDAALVEALDAYE 80 (200) T ss_pred CCEEEEEECCCCHHHHHHHHHHHCCCCCCEEEEEEECCCCCHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHHCC T ss_conf 97899999158044999999987599994899999789753665999975997699677667998999999999998649 Q ss_pred CCEEEEECCCC Q ss_conf 54067702423 Q gi|254780768|r 66 IGRIVVAGAID 76 (281) Q Consensus 66 i~~ivmaG~V~ 76 (281) ++-+|+||++. T Consensus 81 vDlIvLAGyMr 91 (200) T PRK05647 81 PDLVVLAGFMR 91 (200) T ss_pred CCEEEECCHHH T ss_conf 99999988133 No 8 >TIGR00639 PurN phosphoribosylglycinamide formyltransferase; InterPro: IPR004607 This family describes phosphoribosylglycinamide formyltransferase (GAR transformylase), one of several proteins found in formyltransferase. This enzyme uses formyl tetrahydrofolate as a formyl group donor to produce 5'-phosphoribosyl-N-formylglycinamide. PurT, a different GAR transformylase, uses ATP and formate rather than formyl tetrahydrofolate. Experimental proof includes complementation of Escherichia coli purN mutants by orthologs from vertebrates (where it is a domain of a multifunctional protein), Bacillus subtilis, and Arabidopsis thaliana. In phylogenetic analyses, the member from Saccharomyces cerevisiae shows a long branch length but membership in the family, while the formyltetrahydrofolate deformylases form a closely related outgroup. ; GO: 0004644 phosphoribosylglycinamide formyltransferase activity, 0006189 'de novo' IMP biosynthetic process. Probab=94.03 E-value=0.21 Score=29.91 Aligned_cols=72 Identities=19% Similarity=0.313 Sum_probs=49.7 Q ss_pred EEE-EEECCCCHHHHHHHHHHH--CCCCEEEEEECCCC-CCC----------------CCCCCE------------EEEE Q ss_conf 399-997798038999999997--89978999954866-731----------------038856------------7873 Q gi|254780768|r 3 RLL-IIAGSGMLPYYVAKAARL--KNDEPVIASVLNEC-SFD----------------WQDFEC------------RELP 50 (281) Q Consensus 3 kig-IIAG~G~LP~~ia~~~~~--~g~~~~ii~l~~~~-~~~----------------~~~~~~------------~~~~ 50 (281) ||+ ||+|+|+.=..++++++. .+++-.|..+.... +.. .++|+. ..-. T Consensus 2 ~i~VLiSG~GSNLqAiiDA~~~~~~~~~A~v~lVisn~p~A~g~~RA~~~~iP~~v~~~~~f~sGiakeDkaarakar~~ 81 (215) T TIGR00639 2 RIVVLISGSGSNLQAIIDAIKEGQGKIDASVVLVISNKPKAYGLERAAQAGIPTKVLSLKDFDSGIAKEDKAARAKAREA 81 (215) T ss_pred EEEEEEECCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCHHHHHHHHCCCCEEEEECCCCCCCCCCCHHHHHHHHHHH T ss_conf 36899842871799999998521278673689998658662667778875894677421678877531016788887899 Q ss_pred HHHHHHHHHHHHHCCCCEEEEECCCCC Q ss_conf 899999999998649540677024232 Q gi|254780768|r 51 LGDFCVLRSILHQYNIGRIVVAGAIDR 77 (281) Q Consensus 51 ig~ig~li~~Lk~~~i~~ivmaG~V~k 77 (281) +.+ .|.+.|+.++++=|||||.| | T Consensus 82 ~d~--~i~~~l~~~~vdlvvlAGfM-R 105 (215) T TIGR00639 82 FDQ--AIAEELKAAEVDLVVLAGFM-R 105 (215) T ss_pred HHH--HHHHHHHHHCCCEEEECCHH-H T ss_conf 999--99999996099699984614-4 No 9 >PRK09288 purT phosphoribosylglycinamide formyltransferase 2; Validated Probab=92.01 E-value=0.74 Score=26.19 Aligned_cols=70 Identities=20% Similarity=0.141 Sum_probs=43.7 Q ss_pred CCEEEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEE Q ss_conf 97399997798038999999997899789999548667310388567873899999999998649540677 Q gi|254780768|r 1 MKRLLIIAGSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQDFECRELPLGDFCVLRSILHQYNIGRIVV 71 (281) Q Consensus 1 M~kigIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~~~~~~~ig~ig~li~~Lk~~~i~~ivm 71 (281) ++|||| .|+|.|-+.++.++++-|++++++.-...+....-...+...++-+...+.++.++.+++-|.+ T Consensus 12 ~kkIgI-lGgGQLg~Mla~aA~~LG~~vivld~~~d~PA~~vAd~~~~~~~~D~~al~~~a~~~~~DvvT~ 81 (395) T PRK09288 12 ATRVML-LGSGELGKEVAIEAQRLGVEVIAVDRYANAPAMQVAHRSHVIDMLDGDALRAVIEREKPDLIVP 81 (395) T ss_pred CCEEEE-ECCCHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHCCEEEECCCCCHHHHHHHHHHHCCCEEEE T ss_conf 888999-8988999999999998799899984899594467286579778789999999999838998997 No 10 >PRK06019 phosphoribosylaminoimidazole carboxylase ATPase subunit; Reviewed Probab=90.45 E-value=0.9 Score=25.63 Aligned_cols=62 Identities=23% Similarity=0.182 Sum_probs=38.8 Q ss_pred CCEEEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCEEEEEHHHHHHHHHHHHH Q ss_conf 973999977980389999999978997899995486673103885678738999999999986 Q gi|254780768|r 1 MKRLLIIAGSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQDFECRELPLGDFCVLRSILHQ 63 (281) Q Consensus 1 M~kigIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~~~~~~~ig~ig~li~~Lk~ 63 (281) +.+|||| |+|.|-+.++.++++-|+++.++.-...+....-...+..-++-+...+.++.++ T Consensus 7 ~~tIGIl-GgGQLgrMla~aA~~lG~~v~vldp~~~~PA~~va~~~~~~~~~D~~al~~fa~~ 68 (377) T PRK06019 7 GKTIGII-GGGQLGRMLALAAAPLGYKVIVLDPDADCPAAQVADRVIVADYDDVAALRELAEQ 68 (377) T ss_pred CCEEEEE-CCCHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHCCEEEECCCCCHHHHHHHHHC T ss_conf 9999998-7868999999999978998999848984984783784898688989999999845 No 11 >pfam07905 PucR Purine catabolism regulatory protein-like family. The bacterial proteins found in this family are similar to the purine catabolism regulatory protein expressed by Bacillus subtilis (PucR). PucR is thought to be a transcriptional activator involved in the induction of the purine degradation pathway, and may contain a LysR-like DNA-binding domain. It is similar to LysR-type regulators in that it represses its own expression. The other members of this family are also annotated as being putative regulatory proteins. Probab=89.34 E-value=0.82 Score=25.91 Aligned_cols=48 Identities=23% Similarity=0.366 Sum_probs=40.6 Q ss_pred HHHHHHHHHCCCEEEEEECCCEEEECHHHHHHHHHHCCCEEEEECHHH Q ss_conf 999999998499099997397799858999999998796899957754 Q gi|254780768|r 232 AKTVQNVIKAGLAGIALEAGKSLVLEKELVKKHADEAGIFVCGIDREF 279 (281) Q Consensus 232 ~~Ti~~~~~ag~~giaiea~~~lild~~~~i~~a~~~~i~i~g~~~~~ 279 (281) .+.++.++++|++|+++.-|..+--=.++++++||++++=|+-+..+. T Consensus 61 ~~~i~~L~~~g~agL~i~~g~~~~~iP~~~i~~a~~~~~Pli~iP~~~ 108 (122) T pfam07905 61 REFVRELAEAGAAGLGIKTGRYIPEIPEELIAAANRLGLPLIELPREV 108 (122) T ss_pred HHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHCCCCEEEECCCC T ss_conf 999999997896499994255546699999999997499779945978 No 12 >pfam01808 AICARFT_IMPCHas AICARFT/IMPCHase bienzyme. This is a family of bifunctional enzymes catalysing the last two steps in de novo purine biosynthesis. The bifunctional enzyme is found in both prokaryotes and eukaryotes. The second last step is catalysed by 5-aminoimidazole-4-carboxamide ribonucleotide formyltransferase EC:2.1.2.3 (AICARFT), this enzyme catalyses the formylation of AICAR with 10-formyl-tetrahydrofolate to yield FAICAR and tetrahydrofolate. The last step is catalysed by IMP (Inosine monophosphate) cyclohydrolase EC:3.5.4.10 (IMPCHase), cyclizing FAICAR (5-formylaminoimidazole-4-carboxamide ribonucleotide) to IMP. Probab=87.52 E-value=1.2 Score=24.69 Aligned_cols=41 Identities=17% Similarity=0.221 Sum_probs=22.0 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEECCEEEEE Q ss_conf 34554653234788998999999733267225999819748896 Q gi|254780768|r 140 SLGTCVPNRDVKRDILAAMKSAEALSELDVGQSAVSIGGRVVAL 183 (281) Q Consensus 140 ~l~~~~p~~~~~~dI~~g~~i~~~l~~~DiGQsvVv~~g~Viai 183 (281) +.|+++|++++++|..++|++++..-+=-| |+++|+..++| T Consensus 257 vVT~~~pt~~e~~dL~FAwkv~k~vKSNAI---v~ak~~~tvGi 297 (315) T pfam01808 257 VVTKRKPTEQELEDLLFAWKVVKHVKSNAI---VYAKNGMTVGI 297 (315) T ss_pred ECCCCCCCHHHHHHHHHHHHHHHHCCCCEE---EEEECCEEEEE T ss_conf 516889999999999999999977785679---99939938986 No 13 >pfam00289 CPSase_L_chain Carbamoyl-phosphate synthase L chain, N-terminal domain. Carbamoyl-phosphate synthase catalyses the ATP-dependent synthesis of carbamyl-phosphate from glutamine or ammonia and bicarbonate. This important enzyme initiates both the urea cycle and the biosynthesis of arginine and/or pyrimidines. The carbamoyl-phosphate synthase (CPS) enzyme in prokaryotes is a heterodimer of a small and large chain. The small chain promotes the hydrolysis of glutamine to ammonia, which is used by the large chain to synthesize carbamoyl phosphate. See pfam00988. The small chain has a GATase domain in the carboxyl terminus. See pfam00117. Probab=85.86 E-value=1.5 Score=24.18 Aligned_cols=69 Identities=13% Similarity=0.093 Sum_probs=44.4 Q ss_pred EEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCC--EEEE-------EHHHHHHHHHHHHHCCCCEEEEECC Q ss_conf 99977980389999999978997899995486673103885--6787-------3899999999998649540677024 Q gi|254780768|r 5 LIIAGSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQDFE--CREL-------PLGDFCVLRSILHQYNIGRIVVAGA 74 (281) Q Consensus 5 gIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~~--~~~~-------~ig~ig~li~~Lk~~~i~~ivmaG~ 74 (281) -|||+.|.....+++.+++.|.+++++.-....+....... .+.. +.-....+++..++++|. .++-|+ T Consensus 4 vLIanrGeiA~ri~rt~re~gi~~v~i~s~~d~~s~~~~~ad~~~~~~~~~~~~~Yl~~~~Ii~ia~~~~~~-aihpGy 81 (109) T pfam00289 4 VLVANRGEIAVRIIRALRELGIETVAVNSNPDTVSTHVRLADEAYFLGPGPASESYLNIERILDIAEKEGAD-AIHPGY 81 (109) T ss_pred EEEECCCHHHHHHHHHHHHCCCCEEEEECHHHCCCHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCC-EEECCC T ss_conf 999888799999999999869979999633441522566505765347998321113799999999981889-687797 No 14 >smart00798 AICARFT_IMPCHas AICARFT/IMPCHase bienzyme. This is a family of bifunctional enzymes catalysing the last two steps in de novo purine biosynthesis. The bifunctional enzyme is found in both prokaryotes and eukaryotes. The second last step is catalysed by 5-aminoimidazole-4-carboxamide ribonucleotide formyltransferase (AICARFT), this enzyme catalyses the formylation of AICAR with 10-formyl-tetrahydrofolate to yield FAICAR and tetrahydrofolate. The last step is catalysed by IMP (Inosine monophosphate) cyclohydrolase (IMPCHase), cyclizing FAICAR (5-formylaminoimidazole-4-carboxamide ribonucleotide) to IMP. Probab=85.45 E-value=1.8 Score=23.61 Aligned_cols=41 Identities=17% Similarity=0.279 Sum_probs=20.2 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEECCEEEEE Q ss_conf 34554653234788998999999733267225999819748896 Q gi|254780768|r 140 SLGTCVPNRDVKRDILAAMKSAEALSELDVGQSAVSIGGRVVAL 183 (281) Q Consensus 140 ~l~~~~p~~~~~~dI~~g~~i~~~l~~~DiGQsvVv~~g~Viai 183 (281) +.|.++|++++++|+.++|++++..-+--| |+++|+..+++ T Consensus 253 ~Vt~~~p~~~~~~dl~Fa~~v~k~vkSNaI---v~ak~~~tvGi 293 (311) T smart00798 253 VVTKRQPTEEELKDLLFAWKVVKHVKSNAI---VYAKDGQTVGI 293 (311) T ss_pred EECCCCCCHHHHHHHHHHHHHHHHCCCCEE---EEEECCEEEEE T ss_conf 807999899999999999999976778579---99958907986 No 15 >COG1432 Uncharacterized conserved protein [Function unknown] Probab=83.89 E-value=2.3 Score=22.94 Aligned_cols=85 Identities=21% Similarity=0.142 Sum_probs=41.3 Q ss_pred CCHHHHHHHHH--HHHHCCCEEECHHHCCHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEE------- Q ss_conf 80358999999--999769748211222524203543345546532347889989999997332672259998------- Q gi|254780768|r 105 GNAAILKASID--LLESYGVSVVGAHEIVPELLVQVGSLGTCVPNRDVKRDILAAMKSAEALSELDVGQSAVS------- 175 (281) Q Consensus 105 gDd~iL~~i~~--~fe~~G~~vi~~~~~l~~ll~~~G~l~~~~p~~~~~~dI~~g~~i~~~l~~~DiGQsvVv------- 175 (281) ++..+=+...+ ....+||.+...... .+.. ....+-.-|+..+..+.....+-.+---|.+ T Consensus 55 ~~~~~~~~~~~~~~~~~~g~~v~~~~~~--~~~~--------~~~~k~~vDv~la~D~~~l~~~~~~D~ivl~SgD~DF~ 124 (181) T COG1432 55 GDIEIQEVFLNEYELRSDGFTVVTSNLD--GFTD--------LRITKGDVDVELAVDAMELADKKNVDTIVLFSGDGDFI 124 (181) T ss_pred CHHHHHHHHHHHHHHHCCCEEEEEECCC--CCCC--------CCCCCCCCCHHHHHHHHHHHCCCCCCEEEEECCCCCHH T ss_conf 6155413441288873377189830433--3342--------11010354599999999974114678699974776528 Q ss_pred --------ECCEEEEEECCCCHHHHHHHHHHH Q ss_conf --------197488962534217999999975 Q gi|254780768|r 176 --------IGGRVVALEGIEGTDSMLQRIVDC 199 (281) Q Consensus 176 --------~~g~ViaiEa~eGTD~mi~R~~~~ 199 (281) .+++|.-+...++|-.-+.++++. T Consensus 125 p~v~~~~~~G~rv~v~~~~~~~s~~L~~~aD~ 156 (181) T COG1432 125 PLVEAARDKGKRVEVAGIEPMTSSDLRNAADY 156 (181) T ss_pred HHHHHHHHCCCEEEEEECCCCCHHHHHHHCCC T ss_conf 89999996698799994577687999863261 No 16 >PRK03202 6-phosphofructokinase; Provisional Probab=83.03 E-value=3.7 Score=21.51 Aligned_cols=31 Identities=23% Similarity=0.170 Sum_probs=25.3 Q ss_pred CCEEEEEECCCCHHH------HHHHHHHHCCCCEEEE Q ss_conf 973999977980389------9999999789978999 Q gi|254780768|r 1 MKRLLIIAGSGMLPY------YVAKAARLKNDEPVIA 31 (281) Q Consensus 1 M~kigIIAG~G~LP~------~ia~~~~~~g~~~~ii 31 (281) |+|+||+-+.|.=|- .+.+.+...|++++-+ T Consensus 2 ~kriaIltsGG~~pGmNa~ir~vv~~a~~~g~~v~G~ 38 (323) T PRK03202 2 MKKIGVLTSGGDAPGMNAAIRGVVRTAISEGLEVYGI 38 (323) T ss_pred CCEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEE T ss_conf 8669999368886778999999999999789999999 No 17 >PRK13175 consensus Probab=81.55 E-value=3.2 Score=21.96 Aligned_cols=66 Identities=21% Similarity=0.153 Sum_probs=47.0 Q ss_pred CCEEEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCEEEEE-HHHHHHHHHHHHHCCCCEEE Q ss_conf 97399997798038999999997899789999548667310388567873-89999999999864954067 Q gi|254780768|r 1 MKRLLIIAGSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQDFECRELP-LGDFCVLRSILHQYNIGRIV 70 (281) Q Consensus 1 M~kigIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~~~~~~~-ig~ig~li~~Lk~~~i~~iv 70 (281) |+||+||==...--..++++++..|.++.++.- ..++.+++...++ .|.++..++.|++.+..+.+ T Consensus 1 M~kI~IiD~g~gn~~si~~al~~lg~~~~i~~~----~~~i~~~d~lILPGvGsf~~~~~~l~~~~l~~~i 67 (206) T PRK13175 1 MPKIAVIDYGMGNLHSVCKALERLGAEPILTSD----PADLLAADALILPGVGAFDPAMQNLRSRGLIPPI 67 (206) T ss_pred CCEEEEEECCCCHHHHHHHHHHHCCCCEEEECC----HHHHHCCCEEEECCCCCCHHHHHHHHHCCCHHHH T ss_conf 987999979976899999999987998999799----8998328928968998526788755531871689 No 18 >PRK11199 tyrA bifunctional chorismate mutase/prephenate dehydrogenase; Provisional Probab=79.54 E-value=5.3 Score=20.47 Aligned_cols=60 Identities=18% Similarity=0.079 Sum_probs=41.7 Q ss_pred CEEEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCC--CCCCCC--EEEEEHHHHHHHHHHH Q ss_conf 73999977980389999999978997899995486673--103885--6787389999999999 Q gi|254780768|r 2 KRLLIIAGSGMLPYYVAKAARLKNDEPVIASVLNECSF--DWQDFE--CRELPLGDFCVLRSIL 61 (281) Q Consensus 2 ~kigIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~--~~~~~~--~~~~~ig~ig~li~~L 61 (281) +++.||.|.|.+=..+++.+...||++.++.-.+..+. .+.+.. ...+|+..-...++.+ T Consensus 99 ~~i~IIGG~G~mG~~F~~~f~~sGy~V~ild~~dw~~~~~~~~~advViVsVPI~~T~~VI~~l 162 (374) T PRK11199 99 RPVVIVGGKGQLGRLFAKMLTLSGYQVRILEKDDWDRADDILADAGMVIVSVPIHLTEEVIEKL 162 (374) T ss_pred CEEEEEECCCCHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHCCCEEEEEECHHHHHHHHHHC T ss_conf 7179980798277999999996798799616444534898871799899981458899999857 No 19 >PRK08655 prephenate dehydrogenase; Provisional Probab=79.08 E-value=4 Score=21.28 Aligned_cols=60 Identities=17% Similarity=0.112 Sum_probs=40.3 Q ss_pred EEEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCC----------------CCCC--CCEEEEEHHHHHHHHHHHH Q ss_conf 3999977980389999999978997899995486673----------------1038--8567873899999999998 Q gi|254780768|r 3 RLLIIAGSGMLPYYVAKAARLKNDEPVIASVLNECSF----------------DWQD--FECRELPLGDFCVLRSILH 62 (281) Q Consensus 3 kigIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~----------------~~~~--~~~~~~~ig~ig~li~~Lk 62 (281) |++||.|.|.+-..+++.++..|+++.+..-...... ...+ .-...+|+......++.+. T Consensus 2 kI~IIGG~G~MG~~Fa~~f~~sGyeV~I~gRd~~k~~~va~~LGv~~~~~~~e~~~~advVIvsVPI~~T~~VI~~la 79 (441) T PRK08655 2 KISIIGGTGGLGKWFARFLKDKGYEVIVWGRDPKKGKEVAKELGVEYASDNIDAAKDGDIVIVSVPINVTEDVIREVA 79 (441) T ss_pred EEEEEECCCHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHCCCCCCCHHHHHHCCCEEEEECCHHHHHHHHHHHH T ss_conf 799994798177999999986798899981573135678887386222447889724999999848899999999974 No 20 >COG0026 PurK Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) [Nucleotide transport and metabolism] Probab=77.90 E-value=5.9 Score=20.14 Aligned_cols=36 Identities=25% Similarity=0.232 Sum_probs=30.7 Q ss_pred CCEEEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCC Q ss_conf 9739999779803899999999789978999954866 Q gi|254780768|r 1 MKRLLIIAGSGMLPYYVAKAARLKNDEPVIASVLNEC 37 (281) Q Consensus 1 M~kigIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~ 37 (281) |.+|||| |+|.|-+..+.++..-|++++++.....+ T Consensus 1 ~~tvgIl-GGGQLgrMm~~aa~~lG~~v~vLdp~~~~ 36 (375) T COG0026 1 MKTVGIL-GGGQLGRMMALAAARLGIKVIVLDPDADA 36 (375) T ss_pred CCEEEEE-CCCHHHHHHHHHHHHCCCEEEEECCCCCC T ss_conf 9769997-68699999999998649779995699999 No 21 >CHL00188 hisH imidazole glycerol phosphate synthase subunit hisH; Provisional Probab=76.99 E-value=6.2 Score=19.97 Aligned_cols=65 Identities=22% Similarity=0.289 Sum_probs=49.0 Q ss_pred CCEEEEEE-CCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCEEEEE-HHHHHHHHHHHHHCCCCEEE Q ss_conf 97399997-798038999999997899789999548667310388567873-89999999999864954067 Q gi|254780768|r 1 MKRLLIIA-GSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQDFECRELP-LGDFCVLRSILHQYNIGRIV 70 (281) Q Consensus 1 M~kigIIA-G~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~~~~~~~-ig~ig~li~~Lk~~~i~~iv 70 (281) |.+||||- |.|++-- +.++++..|.++.++.-. .++.+.+...+| +|.++..++.|++.+..+.+ T Consensus 1 M~~I~Iid~g~GNi~S-v~~al~~~g~~~~ii~~~----~~i~~~d~lILPGVGsf~~~m~~L~~~~l~~~i 67 (210) T CHL00188 1 MMKIGIIDYSMGNLHS-VSRAIQQAGQQPCIINSE----SELAQVHALVLPGVGSFDLAMKKLEKKGLITPI 67 (210) T ss_pred CCEEEEEECCCHHHHH-HHHHHHHCCCCEEEECCH----HHHHHCCEEEECCCCCHHHHHHHHHHCCCHHHH T ss_conf 9779999588478999-999999869987997999----999648968988867389999988630516999 No 22 >PRK10537 voltage-gated potassium channel; Provisional Probab=76.03 E-value=6.1 Score=20.05 Aligned_cols=137 Identities=17% Similarity=0.137 Sum_probs=68.3 Q ss_pred EEEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHH Q ss_conf 39999779803899999999789978999954866731038856787389999999999864954067702423210001 Q gi|254780768|r 3 RLLIIAGSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQDFECRELPLGDFCVLRSILHQYNIGRIVVAGAIDRRPNVQ 82 (281) Q Consensus 3 kigIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~~~~~~~ig~ig~li~~Lk~~~i~~ivmaG~V~krP~~~ 82 (281) +==||+|.|..-..+++++++.|.+++++.-..+.....++++.. .|+... -+.|++.++++.--+=- -.++ T Consensus 205 dH~IICGyG~~g~~v~~eL~~~g~~vVVI~~~~~~~~~~~~~~~I---~GDaTd-d~~L~kAgI~~Ak~VI~--~~~~-- 276 (356) T PRK10537 205 DHFIICGHSPLAINTYLGLRQRGQAVTVIVPLGEEHRLPDDADLI---PGDSSD-SAVLKKAGAARARAILA--LRDN-- 276 (356) T ss_pred CCEEEECCCHHHHHHHHHHHHCCCCEEEEECCCHHHCCCCCCCEE---ECCCCC-HHHHHHCCCHHHCEEEE--CCCC-- T ss_conf 968998988379999999997299979994580443020568879---508997-89998759345079888--0787-- Q ss_pred HHCCCHHHHHH--HHHHHHH----HHCCCCHHHHHHHHHHHHHCC-CEEECHHHCCHHHCCCCCCCCCCCCCHHHHHHHH Q ss_conf 11026035789--9999987----510480358999999999769-7482112225242035433455465323478899 Q gi|254780768|r 83 DLCFSIKDSLR--ISKMIWQ----LVSGGNAAILKASIDLLESYG-VSVVGAHEIVPELLVQVGSLGTCVPNRDVKRDIL 155 (281) Q Consensus 83 ~l~~D~~~~~~--l~k~l~~----~~~~gDd~iL~~i~~~fe~~G-~~vi~~~~~l~~ll~~~G~l~~~~p~~~~~~dI~ 155 (281) |-...+. ..|-+.. ...-.|..-. +-+..-| =+|+++..+-.++++ +.+++...+.+...+.- T Consensus 277 ----Da~NvfivLTAR~LnP~v~IVA~a~~~en~----~KlrrAGAD~VISP~~lGg~LLA--~~l~ge~i~~~~~~~~~ 346 (356) T PRK10537 277 ----DADNAFVVLAAKEMSSDVKTVAAVNDSKNL----EKIKRVHPDMIFSPQLLGSELLA--RTLNGETIDNDMITSML 346 (356) T ss_pred ----CHHHHHHHHHHHHHCCCCEEEEEECCHHHH----HHHHHHCCCEEECHHHHHHHHHH--HHHCCCCCCHHHHHHHH T ss_conf ----778999998499868997699996767579----99986289999868887899999--98458886789999998 Q ss_pred HH Q ss_conf 89 Q gi|254780768|r 156 AA 157 (281) Q Consensus 156 ~g 157 (281) ++ T Consensus 347 ~~ 348 (356) T PRK10537 347 FG 348 (356) T ss_pred HH T ss_conf 61 No 23 >PRK08462 biotin carboxylase; Validated Probab=76.00 E-value=6.6 Score=19.80 Aligned_cols=63 Identities=8% Similarity=0.100 Sum_probs=42.1 Q ss_pred EEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCEEEEE---------HHHHHHHHHHHHHCCCC Q ss_conf 9997798038999999997899789999548667310388567873---------89999999999864954 Q gi|254780768|r 5 LIIAGSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQDFECRELP---------LGDFCVLRSILHQYNIG 67 (281) Q Consensus 5 gIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~~~~~~~---------ig~ig~li~~Lk~~~i~ 67 (281) =|||+.|.....++++|+..|.+.+.+.=....+...-......+. .-.+..+++..+..+|+ T Consensus 7 vLIANRGEIA~Ri~ra~~elgi~tVavys~~D~~a~~v~~ad~~~~ig~~~~~~sYln~~~Ii~~a~~~~~d 78 (446) T PRK08462 7 ILIANRGEIALRAIRTIQEMGKEAIAIYSTADKDALYLKYADAKICIGGAKSSESYLNIPAIISAAEIFEAD 78 (446) T ss_pred EEEECCHHHHHHHHHHHHHCCCCEEEEECHHHHCCCHHHHCCEEEECCCCCCCCCCCCHHHHHHHHHHHCCC T ss_conf 999896099999999999829969999376771676287598989828998433414899999999995919 No 24 >PRK05568 flavodoxin; Provisional Probab=75.95 E-value=6.2 Score=20.01 Aligned_cols=125 Identities=20% Similarity=0.253 Sum_probs=67.8 Q ss_pred CCEEEEEECC--C---CHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEEECCC Q ss_conf 9739999779--8---0389999999978997899995486673103885678738999999999986495406770242 Q gi|254780768|r 1 MKRLLIIAGS--G---MLPYYVAKAARLKNDEPVIASVLNECSFDWQDFECRELPLGDFCVLRSILHQYNIGRIVVAGAI 75 (281) Q Consensus 1 M~kigIIAG~--G---~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~~~~~~~ig~ig~li~~Lk~~~i~~ivmaG~V 75 (281) |+|+.|+=.+ | .....|++.++..|.++.+..+......++.+ .+.+..|.= T Consensus 1 M~kv~IvY~S~~GnTe~mA~~Iaegl~~~Gv~V~~~~~~~~~~~di~~-----------------------~d~i~~Gsp 57 (142) T PRK05568 1 MKKLNIIYWSGTGNTEAMANLIAEGAKENGLEVKLLNVSEASVDDVEN-----------------------ADVVALGSP 57 (142) T ss_pred CCEEEEEEECCCCHHHHHHHHHHHHHHHCCCEEEEEEECCCCHHHHHH-----------------------CCEEEEECC T ss_conf 972999998998549999999999998669839999800499889973-----------------------996899477 Q ss_pred CC----------CCCHHHHCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCEEECHHHCCHHHCCCCCCCCCCC Q ss_conf 32----------10001110260357899999987510480358999999999769748211222524203543345546 Q gi|254780768|r 76 DR----------RPNVQDLCFSIKDSLRISKMIWQLVSGGNAAILKASIDLLESYGVSVVGAHEIVPELLVQVGSLGTCV 145 (281) Q Consensus 76 ~k----------rP~~~~l~~D~~~~~~l~k~l~~~~~~gDd~iL~~i~~~fe~~G~~vi~~~~~l~~ll~~~G~l~~~~ 145 (281) +- .|.++.+...+++.. . -+|..+.-|+..-.+.+.+.+++.|++++.. ++...-. T Consensus 58 T~~~~~~~~~~~~~~~~~~~~~~~gK~-~--a~FGSyGWg~Ge~v~~~~e~l~~~g~~~v~~-----------~l~v~~~ 123 (142) T PRK05568 58 AMGCEVLEEGEMEPFVEEISGLVKGKK-T--LLFGSYGWGTGEWMEDWKERMEGYGANLVND-----------GLIVNNT 123 (142) T ss_pred CCCCCCCCHHHHHHHHHHHHHHCCCCE-E--EEEEECCCCCCHHHHHHHHHHHHCCCEEECC-----------CEEEEEC T ss_conf 648766773667999999886659998-9--9999434898739999999999769978178-----------6799607 Q ss_pred CCHHHHHHH-HHHHHHHH Q ss_conf 532347889-98999999 Q gi|254780768|r 146 PNRDVKRDI-LAAMKSAE 162 (281) Q Consensus 146 p~~~~~~dI-~~g~~i~~ 162 (281) |++++.+.. ++|.++++ T Consensus 124 P~~e~l~~c~e~G~~~A~ 141 (142) T PRK05568 124 PEGEGIEKCKALGKALAK 141 (142) T ss_pred CCHHHHHHHHHHHHHHHC T ss_conf 998999999999999953 No 25 >PRK13174 consensus Probab=75.56 E-value=6.8 Score=19.72 Aligned_cols=66 Identities=20% Similarity=0.263 Sum_probs=48.0 Q ss_pred CCEEEEE-ECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCEEEEE-HHHHHHHHHHHHHCCCCEEE Q ss_conf 9739999-7798038999999997899789999548667310388567873-89999999999864954067 Q gi|254780768|r 1 MKRLLII-AGSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQDFECRELP-LGDFCVLRSILHQYNIGRIV 70 (281) Q Consensus 1 M~kigII-AG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~~~~~~~-ig~ig~li~~Lk~~~i~~iv 70 (281) |++|||| -|.|++-- +.++++..|.++.++. ... .++.+.+...+| +|.++..++.|++.+..+++ T Consensus 1 Mk~I~IiDyG~gNi~S-v~~al~~~g~~~~~i~--~~~-~~i~~~d~lIlPGVGsf~~~m~~L~~~~~~~~i 68 (212) T PRK13174 1 MQTVAVIDYGMGNLHS-VAKALEHVGAGRVLVT--SDA-AVIREADRVVFPGVGAIRDCMAEIRRLGFDSLV 68 (212) T ss_pred CCEEEEEECCCHHHHH-HHHHHHHCCCCEEEEC--CCH-HHHHHCCEEEECCCCCHHHHHHHHHHCCHHHHH T ss_conf 9889999588107999-9999997499879983--898-999638979965888689999998876969999 No 26 >pfam07035 Mic1 Colon cancer-associated protein Mic1-like. This family represents the C-terminus (approximately 160 residues) of a number of proteins that resemble colon cancer-associated protein Mic1. Probab=75.02 E-value=3.9 Score=21.32 Aligned_cols=83 Identities=12% Similarity=0.113 Sum_probs=52.4 Q ss_pred EHHHHHHHHHHHHHCC-CC-EEE--EE--CCCCCCCCHHHHCC-CHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCC Q ss_conf 3899999999998649-54-067--70--24232100011102-603578999999875104803589999999997697 Q gi|254780768|r 50 PLGDFCVLRSILHQYN-IG-RIV--VA--GAIDRRPNVQDLCF-SIKDSLRISKMIWQLVSGGNAAILKASIDLLESYGV 122 (281) Q Consensus 50 ~ig~ig~li~~Lk~~~-i~-~iv--ma--G~V~krP~~~~l~~-D~~~~~~l~k~l~~~~~~gDd~iL~~i~~~fe~~G~ 122 (281) +..|+ .++-||+-+ .. +|+ |. |+| -|.++-.+- .........+.|-.+...+||.++-+|-+||++.+. T Consensus 76 p~~QL--alDMLkRL~~a~d~IievLL~kg~v--l~ALR~~r~~~~~d~v~~rkFLEAA~~t~D~~~f~sV~~FF~~rn~ 151 (167) T pfam07035 76 PLRQL--GVDMLRRLSLAHDEIVEVLLQKGQV--LDALRFARKVRGLNDVLARKFLEAANQTNDRQTFYAILRFFSERNA 151 (167) T ss_pred HHHHH--HHHHHHHHCCCHHHHHHHHHHCCCC--HHHHHHHHHCCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHHHHHCC T ss_conf 89999--9999988340629999999956861--8999999981887766889999999874867899999999998340 Q ss_pred EEECHHHCCHHHCC Q ss_conf 48211222524203 Q gi|254780768|r 123 SVVGAHEIVPELLV 136 (281) Q Consensus 123 ~vi~~~~~l~~ll~ 136 (281) ++-+...|.|..-. T Consensus 152 ~l~~~~~F~~~e~c 165 (167) T pfam07035 152 KLSLYPSFTPGLYT 165 (167) T ss_pred HHCCCCCCCCHHHC T ss_conf 43289999933002 No 27 >PRK13180 consensus Probab=74.81 E-value=6.8 Score=19.70 Aligned_cols=64 Identities=23% Similarity=0.269 Sum_probs=46.2 Q ss_pred CCEEEEEE-CCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCEEEEE-HHHHHHHHHHHHHCCCCEE Q ss_conf 97399997-798038999999997899789999548667310388567873-8999999999986495406 Q gi|254780768|r 1 MKRLLIIA-GSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQDFECRELP-LGDFCVLRSILHQYNIGRI 69 (281) Q Consensus 1 M~kigIIA-G~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~~~~~~~-ig~ig~li~~Lk~~~i~~i 69 (281) |+||+||= |.|++- -+.++++..|.++.++.- ..++.+.+...+| +|.++...+.|++.+-.+. T Consensus 1 MkkI~IiDyg~gNi~-Sv~~al~~~g~~~~i~~~----~~~~~~~d~lIlPGVGsf~~~m~~L~~~~~~~~ 66 (209) T PRK13180 1 MKRVVVLDYGSGNLR-SAQRALERVGAEVEVTAD----PDAALNADGLVVPGVGAFAACMAGLRAVGGDRI 66 (209) T ss_pred CCEEEEEECCCCHHH-HHHHHHHHCCCCEEEECC----HHHHHHCCEEEECCCCCHHHHHHHHHHCCCHHH T ss_conf 977999978943899-999999986998999799----999953896899998855777776553691999 No 28 >pfam00551 Formyl_trans_N Formyl transferase. This family includes the following members. Glycinamide ribonucleotide transformylase catalyses the third step in de novo purine biosynthesis, the transfer of a formyl group to 5'-phosphoribosylglycinamide. Formyltetrahydrofolate deformylase produces formate from formyl- tetrahydrofolate. Methionyl-tRNA formyltransferase transfers a formyl group onto the amino terminus of the acyl moiety of the methionyl aminoacyl-tRNA. Inclusion of the following members is supported by PSI-blast. HOXX_BRAJA (P31907) contains a related domain of unknown function. PRTH_PORGI (P46071) contains a related domain of unknown function. Y09P_MYCTU (Q50721) contains a related domain of unknown function. Probab=74.66 E-value=7.1 Score=19.57 Aligned_cols=122 Identities=19% Similarity=0.058 Sum_probs=65.5 Q ss_pred EEEE-EECCCCHHHHHHHHHHHCCCCEEEEEECCC-CCCC------CCCCCEEEEEH-------HHHHHHHHHHHHCCCC Q ss_conf 3999-977980389999999978997899995486-6731------03885678738-------9999999999864954 Q gi|254780768|r 3 RLLI-IAGSGMLPYYVAKAARLKNDEPVIASVLNE-CSFD------WQDFECRELPL-------GDFCVLRSILHQYNIG 67 (281) Q Consensus 3 kigI-IAG~G~LP~~ia~~~~~~g~~~~ii~l~~~-~~~~------~~~~~~~~~~i-------g~ig~li~~Lk~~~i~ 67 (281) |||+ ++|+|..-..+.++++..+.+.-++.+..+ .+.. ..+.++..++- ..-..+++.|+..+++ T Consensus 2 kiavl~SG~Gsnl~~Il~a~~~~~l~~~I~~Visn~~~~~~~~~a~~~~ip~~~~~~~~~~~r~~~~~~~~~~l~~~~~D 81 (181) T pfam00551 2 KIAVLISGTGSNLQALLDALRKGGHEVEIVAVVTNKDKAAGLERAEQAGIPVEVFEHKNFTPRSQFDSELADSLAALAPD 81 (181) T ss_pred EEEEEECCCCHHHHHHHHHHHHCCCCCEEEEEEECCCCHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHHHCCC T ss_conf 89999907966599999999819999889999958957288889998599989806778998346189999999974999 Q ss_pred EEEEECCCCC--CCCHHHHCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCEEECHH Q ss_conf 0677024232--100011102603578999999875104803589999999997697482112 Q gi|254780768|r 68 RIVVAGAIDR--RPNVQDLCFSIKDSLRISKMIWQLVSGGNAAILKASIDLLESYGVSVVGAH 128 (281) Q Consensus 68 ~ivmaG~V~k--rP~~~~l~~D~~~~~~l~k~l~~~~~~gDd~iL~~i~~~fe~~G~~vi~~~ 128 (281) -+|++|.... ++.++..+. +..-.=+.+|- ..+|-+.+-.++.+--..-|.++.-.+ T Consensus 82 liv~~g~~~il~~~~~~~~~~--~~iN~HpslLP--~~~G~~~~~~ai~~g~~~~G~Tih~v~ 140 (181) T pfam00551 82 LIVLAGYMRILPPEFLQAFPG--KILNIHPSLLP--RFRGAAPIQRALEAGDKETGVTVHQVD 140 (181) T ss_pred EEEEECHHHHCCHHHHHHCCC--CEEEECCCCCC--CCCCCCHHHHHHHCCCCCCCEEEEEEE T ss_conf 999801633569789700668--87995854052--888905999999819986724899973 No 29 >PRK13173 consensus Probab=74.55 E-value=7.2 Score=19.56 Aligned_cols=75 Identities=21% Similarity=0.226 Sum_probs=53.6 Q ss_pred CCEEEEE-ECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCEEEEE-HHHHHHHHHHHHHCCCCEEEEECCCCCC Q ss_conf 9739999-7798038999999997899789999548667310388567873-8999999999986495406770242321 Q gi|254780768|r 1 MKRLLII-AGSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQDFECRELP-LGDFCVLRSILHQYNIGRIVVAGAIDRR 78 (281) Q Consensus 1 M~kigII-AG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~~~~~~~-ig~ig~li~~Lk~~~i~~ivmaG~V~kr 78 (281) |+||||| -|.|++-- +.++++..|.++.++.= .. ++.+.+...+| +|.++..++.|++.+..+.+-- .+..+ T Consensus 1 M~kI~IiDyg~gNi~S-v~~al~~lg~~~~i~~~--~~--~i~~~d~lILPGVGsF~~~m~~L~~~~l~~~i~~-~~~~k 74 (211) T PRK13173 1 MTKIALLDYGMGNLHS-ASKALSAVGAEVSITND--PK--VIAAADKIVFPGVGAMRDCMAGMHEAGIDEVVRQ-AIFNK 74 (211) T ss_pred CCEEEEEECCCHHHHH-HHHHHHHCCCCEEEECC--HH--HHHHCCEEEECCCCCHHHHHHHHHHCCHHHHHHH-HHCCC T ss_conf 9879999689449999-99999986998899679--99--9843896897577868999999886297999998-86299 Q ss_pred CCH Q ss_conf 000 Q gi|254780768|r 79 PNV 81 (281) Q Consensus 79 P~~ 81 (281) |.+ T Consensus 75 piL 77 (211) T PRK13173 75 PVM 77 (211) T ss_pred CEE T ss_conf 989 No 30 >pfam02844 GARS_N Phosphoribosylglycinamide synthetase, N domain. Phosphoribosylglycinamide synthetase catalyses the second step in the de novo biosynthesis of purine. The reaction catalysed by Phosphoribosylglycinamide synthetase is the ATP- dependent addition of 5-phosphoribosylamine to glycine to form 5'phosphoribosylglycinamide. This domain is related to the N-terminal domain of biotin carboxylase/carbamoyl phosphate synthetase (see pfam00289). Probab=74.35 E-value=7.3 Score=19.52 Aligned_cols=93 Identities=20% Similarity=0.254 Sum_probs=62.1 Q ss_pred EEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCC-CEEEEEHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHH Q ss_conf 997798038999999997899789999548667310388-5678738999999999986495406770242321000111 Q gi|254780768|r 6 IIAGSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQDF-ECRELPLGDFCVLRSILHQYNIGRIVVAGAIDRRPNVQDL 84 (281) Q Consensus 6 IIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~-~~~~~~ig~ig~li~~Lk~~~i~~ivmaG~V~krP~~~~l 84 (281) |+-|+|-=-..++.++.+.....-+....++. -...+ .+..++..++..+.++.++++++ +|++|.= -|.. T Consensus 3 LvIGsGgREHAi~~~l~~s~~~~~l~~~pgN~--gi~~~~~~~~i~~~d~~~i~~~~~~~~id-lviiGPE--~pL~--- 74 (99) T pfam02844 3 LVVGSGGREHALAWKLAQSPRVEKVYVAPGNP--GTAQLAKNVNIDITDFEALADFAKEENID-LVVVGPE--APLV--- 74 (99) T ss_pred EEECCCHHHHHHHHHHHCCCCCCEEEEECCCC--HHHHHCEEECCCCCCHHHHHHHHHHCCCC-EEEECCC--HHHH--- T ss_conf 99878879999999996499977699958981--57774750145844799999999981974-9998960--6778--- Q ss_pred CCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCEEECHHHCCH Q ss_conf 026035789999998751048035899999999976974821122252 Q gi|254780768|r 85 CFSIKDSLRISKMIWQLVSGGNAAILKASIDLLESYGVSVVGAHEIVP 132 (281) Q Consensus 85 ~~D~~~~~~l~k~l~~~~~~gDd~iL~~i~~~fe~~G~~vi~~~~~l~ 132 (281) ..++++|+++|+.+++++.-.. T Consensus 75 --------------------------~Gl~D~l~~~gi~vfGP~k~aA 96 (99) T pfam02844 75 --------------------------AGIVDALRAAGIPVFGPSKAAA 96 (99) T ss_pred --------------------------HHHHHHHHHCCCEEECCCHHHH T ss_conf --------------------------7889999868991799286887 No 31 >PRK13171 consensus Probab=73.45 E-value=7.6 Score=19.38 Aligned_cols=65 Identities=18% Similarity=0.111 Sum_probs=48.6 Q ss_pred CCEEEEEE-CCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCEEEEE-HHHHHHHHHHHHHCCCCEEE Q ss_conf 97399997-798038999999997899789999548667310388567873-89999999999864954067 Q gi|254780768|r 1 MKRLLIIA-GSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQDFECRELP-LGDFCVLRSILHQYNIGRIV 70 (281) Q Consensus 1 M~kigIIA-G~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~~~~~~~-ig~ig~li~~Lk~~~i~~iv 70 (281) |++|+||= |.|++- -+.++++..|.++.++. +. .++.+.+...+| +|.++..++.|++.+..+-+ T Consensus 1 M~~I~IiDyg~gNi~-Sv~~al~~lg~~~~ii~--~~--~~i~~~~~lIlPGVGsf~~~m~~L~~~~~~~~i 67 (200) T PRK13171 1 MTDVALIDAGGANLG-SVRYALERLGVEARVVR--DA--AGLQGAQRVILPGVGAAPEAMARLRAQGLIEPL 67 (200) T ss_pred CCEEEEEECCCCHHH-HHHHHHHHCCCCEEEEC--CH--HHHHCCCEEEECCCCCHHHHHHHHHHCCCHHHH T ss_conf 987999968951799-99999998499879978--99--999529989989988789999999872126789 No 32 >PRK12743 acetoin dehydrogenase; Provisional Probab=71.24 E-value=7.8 Score=19.30 Aligned_cols=163 Identities=12% Similarity=0.025 Sum_probs=79.3 Q ss_pred CCEEEEEECCCC-HHHHHHHHHHHCCCCEEEEEECCCCCC-----CCCCC--CEE--EEE---HHHHHHHHHHHHHC--C Q ss_conf 973999977980-389999999978997899995486673-----10388--567--873---89999999999864--9 Q gi|254780768|r 1 MKRLLIIAGSGM-LPYYVAKAARLKNDEPVIASVLNECSF-----DWQDF--ECR--ELP---LGDFCVLRSILHQY--N 65 (281) Q Consensus 1 M~kigIIAG~G~-LP~~ia~~~~~~g~~~~ii~l~~~~~~-----~~~~~--~~~--~~~---ig~ig~li~~Lk~~--~ 65 (281) |+|++||-|.+. +=+.+++.+-+.|.++++....++... ..+.. .+. ..+ ..++..+++...++ + T Consensus 1 M~KValITGgs~GIG~a~a~~la~~Ga~V~i~~~~~~~~~~~~~~~~~~~g~~~~~~~~Dv~~~~~~~~~~~~~~~~~G~ 80 (253) T PRK12743 1 MAQVAIVTASDSGIGKACALLLAQQGFDIGITWHSDEEGAKETAEEVVSHGVRAEIVHLDLSNLPEGAQAIEKLIQRLGR 80 (253) T ss_pred CCCEEEEECCCCHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHHHHCCC T ss_conf 99989990758899999999999879989997489979999999999945991899990489999999999999998199 Q ss_pred CCEEEEECCCCCCCCHHHHCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCC--EEECHHHCCHHHCCCCCCCCC Q ss_conf 540677024232100011102603578999999875104803589999999997697--482112225242035433455 Q gi|254780768|r 66 IGRIVVAGAIDRRPNVQDLCFSIKDSLRISKMIWQLVSGGNAAILKASIDLLESYGV--SVVGAHEIVPELLVQVGSLGT 143 (281) Q Consensus 66 i~~ivmaG~V~krP~~~~l~~D~~~~~~l~k~l~~~~~~gDd~iL~~i~~~fe~~G~--~vi~~~~~l~~ll~~~G~l~~ 143 (281) ++-+|=.-.+..+..+.+...+. ..+.+ ...-++---+.+.++..+.++|. +++........ ...++.. . T Consensus 81 iDilVNnAG~~~~~~~~~~~~~~-----w~~~~-~vNl~~~f~~~~~~~~~m~k~~~~G~IVnisS~~~~-~~~~~~~-~ 152 (253) T PRK12743 81 LDVLVNNAGAMTKAPFLDMAFDE-----WRKIF-TVDVDGAFLCSQIAARHMVKQGQGGRIINITSVHEH-TPLPDAS-A 152 (253) T ss_pred CCEEEECCCCCCCCCCHHCCHHH-----HHHHH-HHHHHHHHHHHHHHHHHHHHCCCCCEEEEEEEHHHC-CCCCCCH-H T ss_conf 98999899899999800299999-----99999-998599999999999999975899638999636655-7889858-9 Q ss_pred CCCCHHHHHHHHHHHHHHHHHCCCCCEEEE Q ss_conf 465323478899899999973326722599 Q gi|254780768|r 144 CVPNRDVKRDILAAMKSAEALSELDVGQSA 173 (281) Q Consensus 144 ~~p~~~~~~dI~~g~~i~~~l~~~DiGQsv 173 (281) ...+|. .=+.+...++.++++..|===+ T Consensus 153 Y~asKa--al~~ltk~lA~ela~~gIrVN~ 180 (253) T PRK12743 153 YTAAKH--ALGGLTKAMALELVEHKILVNA 180 (253) T ss_pred HHHHHH--HHHHHHHHHHHHHHHHCEEEEE T ss_conf 999999--9999999999997021929999 No 33 >TIGR02339 thermosome_arch thermosome, various subunits, archaeal; InterPro: IPR012714 Thermosome is the name given to the archaeal rather than eukaryotic form of the group II chaperonin (counterpart to the group I chaperonin, GroEL/GroES, in bacteria), a toroidal, ATP-dependent molecular chaperone that assists in the folding or refolding of nascent or denatured proteins. Various homologous subunits, one to five per archaeal genome, may be designated alpha, beta, etc., but phylogenetic analysis does not show distinct alpha subunit and beta subunit lineages traceable to ancient paralogs.; GO: 0005524 ATP binding, 0051082 unfolded protein binding, 0006457 protein folding. Probab=70.21 E-value=8 Score=19.25 Aligned_cols=76 Identities=22% Similarity=0.357 Sum_probs=66.8 Q ss_pred CCHHHHHHHHHHHHHCCCEEE----CHHHCCHHHCCCCCCCCCCCCCHHHHHHHHHH-----HHHHHHHCCCCCEEEEEE Q ss_conf 803589999999997697482----11222524203543345546532347889989-----999997332672259998 Q gi|254780768|r 105 GNAAILKASIDLLESYGVSVV----GAHEIVPELLVQVGSLGTCVPNRDVKRDILAA-----MKSAEALSELDVGQSAVS 175 (281) Q Consensus 105 gDd~iL~~i~~~fe~~G~~vi----~~~~~l~~ll~~~G~l~~~~p~~~~~~dI~~g-----~~i~~~l~~~DiGQsvVv 175 (281) --++.||..++.+..-|-+|+ |..++...+|++.|+|.-++..++|.+-+.++ .+-++.+..-|.|.+=-| T Consensus 266 qEe~~Lke~VD~i~~~GAnVVfCQKGIDD~Aq~yLak~GIlAvRRVKkSDi~~LarATGA~i~s~i~e~~~~DLG~A~lV 345 (522) T TIGR02339 266 QEEAMLKEMVDKIADAGANVVFCQKGIDDLAQHYLAKAGILAVRRVKKSDIEKLARATGAKIVSSIKEITESDLGYAELV 345 (522) T ss_pred HHHHHHHHHHHHHHHCCCCEEEECCCHHHHHHHHHHHCCEEEECCCCHHHHHHHHHHCCCCEECCHHHCCHHHCCCEEEE T ss_conf 99999999988999828987982288689999987105602204335337999987318841257311696367614226 Q ss_pred ECCEE Q ss_conf 19748 Q gi|254780768|r 176 IGGRV 180 (281) Q Consensus 176 ~~g~V 180 (281) ....| T Consensus 346 E~r~v 350 (522) T TIGR02339 346 EERKV 350 (522) T ss_pred EEEEE T ss_conf 86542 No 34 >CHL00194 ycf39 Ycf39; Provisional Probab=69.75 E-value=9.2 Score=18.84 Aligned_cols=34 Identities=18% Similarity=0.099 Sum_probs=29.2 Q ss_pred EEEEEECCCCHHHHHHHHHHHCCCCEEEEEECCC Q ss_conf 3999977980389999999978997899995486 Q gi|254780768|r 3 RLLIIAGSGMLPYYVAKAARLKNDEPVIASVLNE 36 (281) Q Consensus 3 kigIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~ 36 (281) +|-++.|.|.+=..+++.+..+|+++.++.-... T Consensus 2 ~ILV~GATG~lGr~vVr~Ll~~G~~Vr~lvRnp~ 35 (319) T CHL00194 2 SLLVIGATGTLGRQIVRRALDEGYQVKCLVRNLR 35 (319) T ss_pred EEEEECCCCHHHHHHHHHHHHCCCCEEEEECCHH T ss_conf 7999899858999999999968890899957867 No 35 >PRK12384 sorbitol-6-phosphate dehydrogenase; Provisional Probab=69.38 E-value=8.3 Score=19.15 Aligned_cols=166 Identities=10% Similarity=0.098 Sum_probs=86.4 Q ss_pred CCEEEEEECCC-CHHHHHHHHHHHCCCCEEEEEECCCCCCC----CC----CCCEE--EE---EHHHHHHHHHHHHHC-- Q ss_conf 97399997798-03899999999789978999954866731----03----88567--87---389999999999864-- Q gi|254780768|r 1 MKRLLIIAGSG-MLPYYVAKAARLKNDEPVIASVLNECSFD----WQ----DFECR--EL---PLGDFCVLRSILHQY-- 64 (281) Q Consensus 1 M~kigIIAG~G-~LP~~ia~~~~~~g~~~~ii~l~~~~~~~----~~----~~~~~--~~---~ig~ig~li~~Lk~~-- 64 (281) |+|++||-|.+ -+=..+++.+-+.|.+++++....+.... +. ..... .. +-.++.++++...++ T Consensus 1 mnKvalITG~s~GIG~aia~~la~~Ga~V~i~~~~~~~~~~~~~~l~~~~~~~~~~~~~~Dv~~~~~v~~~~~~~~~~~G 80 (259) T PRK12384 1 MNKVAVVIGGGQTLGAFLCHGLAEEGYRVAVADINSEKAANVAQEINAEYGEGMAYGFGADATSEQSVLALSRGVDEIFG 80 (259) T ss_pred CCCEEEEECCCCHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHHHHHHHHHC T ss_conf 99789994688689999999999879999999798899999999998624886089998327999999999999999829 Q ss_pred CCCEEEEECCCCCCCCHHHHCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCC--CEEECHHHCCHHHCCCCCCCC Q ss_conf 954067702423210001110260357899999987510480358999999999769--748211222524203543345 Q gi|254780768|r 65 NIGRIVVAGAIDRRPNVQDLCFSIKDSLRISKMIWQLVSGGNAAILKASIDLLESYG--VSVVGAHEIVPELLVQVGSLG 142 (281) Q Consensus 65 ~i~~ivmaG~V~krP~~~~l~~D~~~~~~l~k~l~~~~~~gDd~iL~~i~~~fe~~G--~~vi~~~~~l~~ll~~~G~l~ 142 (281) .++-+|-.-.+.....+.++-.+. ..+.+ ...-.|--.+.+.+...+.+++ =+++.........-. ++. + T Consensus 81 ~iDilVnnAG~~~~~~~~~~~~~~-----~~~~~-~vNl~~~~~~~~~~~~~m~~~~~~G~Iv~isS~~~~~~~-~~~-~ 152 (259) T PRK12384 81 RVDLLVYSAGIAKAAKITDFQLGD-----FDRSL-QVNLVGYFLCAREFSRLMIRDGIQGRIIQINSKSGKVGS-KHN-S 152 (259) T ss_pred CCCEEEECCCCCCCCCHHHCCHHH-----HHHHH-HHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCHHHCCCC-CCC-C T ss_conf 971999899777889914599999-----99999-988644223467763689973898459998352545588-543-0 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEE Q ss_conf 5465323478899899999973326722599981 Q gi|254780768|r 143 TCVPNRDVKRDILAAMKSAEALSELDVGQSAVSI 176 (281) Q Consensus 143 ~~~p~~~~~~dI~~g~~i~~~l~~~DiGQsvVv~ 176 (281) ....+|... ..+-..++.++++..|==-+|.- T Consensus 153 ~Y~asK~al--~~ltk~lA~e~a~~gIrVN~I~P 184 (259) T PRK12384 153 GYSAAKFGG--VGLTQSLALDLAEYGITVHSLML 184 (259) T ss_pred HHHHHHHHH--HHHHHHHHHHHHHHCEEEEEEEC T ss_conf 679999999--99999999996231979999838 No 36 >PRK13172 consensus Probab=69.12 E-value=9.5 Score=18.75 Aligned_cols=65 Identities=23% Similarity=0.191 Sum_probs=49.8 Q ss_pred CCEEEEE-ECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCEEEEE-HHHHHHHHHHHHHCCCCEEE Q ss_conf 9739999-7798038999999997899789999548667310388567873-89999999999864954067 Q gi|254780768|r 1 MKRLLII-AGSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQDFECRELP-LGDFCVLRSILHQYNIGRIV 70 (281) Q Consensus 1 M~kigII-AG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~~~~~~~-ig~ig~li~~Lk~~~i~~iv 70 (281) |++|||| -|.|++-- +.++++..|.++.++.-.. ++.+.+...+| +|.++..++.|++.+..+.+ T Consensus 1 M~~I~IIDyG~GNi~S-v~~al~~lg~~~~i~~~~~----~i~~~d~lIlPGVGsF~~~m~~L~~~~l~~~i 67 (213) T PRK13172 1 MSSVSIVDYGVGNLLS-VARAFQYFDASVNLVSTPE----EIMSADRLVLPGVGAFEDGMKGLTTLNFIEPI 67 (213) T ss_pred CCEEEEEECCCHHHHH-HHHHHHHCCCCEEEECCHH----HHHHCCEEEECCCCCHHHHHHHHHHCCCHHHH T ss_conf 9879999789429999-9999998699889977999----98338978976878469999988723863789 No 37 >pfam03807 F420_oxidored NADP oxidoreductase coenzyme F420-dependent. Probab=68.31 E-value=8.7 Score=19.00 Aligned_cols=32 Identities=28% Similarity=0.367 Sum_probs=27.2 Q ss_pred EEEEEECCCCHHHHHHHHHHHCCCCEEEEEECC Q ss_conf 399997798038999999997899789999548 Q gi|254780768|r 3 RLLIIAGSGMLPYYVAKAARLKNDEPVIASVLN 35 (281) Q Consensus 3 kigIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~ 35 (281) |||+| |.|+.-..+++.+.++|++.+++.-++ T Consensus 1 KIg~I-G~G~mg~ai~~~l~~~g~~~~~~~~r~ 32 (93) T pfam03807 1 KIGII-GAGNMGEALARGLAAAGHEVIIANSRN 32 (93) T ss_pred CEEEE-CCHHHHHHHHHHHHHCCCCCEEEECCC T ss_conf 98999-700999999999997799612786487 No 38 >pfam03446 NAD_binding_2 NAD binding domain of 6-phosphogluconate dehydrogenase. The NAD binding domain of 6-phosphogluconate dehydrogenase adopts a Rossmann fold. Probab=68.18 E-value=5.5 Score=20.32 Aligned_cols=152 Identities=14% Similarity=0.043 Sum_probs=84.1 Q ss_pred CCEEEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCEEEE-EHHHHHHHHHHHHHCCCCEEEEECCCCCCC Q ss_conf 9739999779803899999999789978999954866731038856787-389999999999864954067702423210 Q gi|254780768|r 1 MKRLLIIAGSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQDFECREL-PLGDFCVLRSILHQYNIGRIVVAGAIDRRP 79 (281) Q Consensus 1 M~kigIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~~~~~~-~ig~ig~li~~Lk~~~i~~ivmaG~V~krP 79 (281) |.|||+| |-|..=.-+|+.+.++|+++++..........+.+....+. ++.+ ..+ +++ +|+.==-+ .+ T Consensus 1 M~~Ig~I-GlG~MG~~ma~~L~~~g~~v~v~d~~~~~~~~~~~~g~~~~~s~~e------~~~--~~d-vIi~~l~~-~~ 69 (163) T pfam03446 1 MAKIGFI-GLGVMGSPMALNLLKAGYTVTVYNRTPEKVEELVAEGAVAAASPAE------AAA--SAD-VVITMVPA-GA 69 (163) T ss_pred CCEEEEE-EEHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHCCCEECCCHHH------HHH--CCC-EEEEECCC-HH T ss_conf 9889998-3679899999999977996999979788779999839955399999------986--199-99992587-14 Q ss_pred CHHHHCCCHHHHHH-H--HHHHHHHHCCCCHHHHHHHHHHHHHCCCEEECHHHCCHHHCCCCCCCCCCCCCHHHHHHHHH Q ss_conf 00111026035789-9--99998751048035899999999976974821122252420354334554653234788998 Q gi|254780768|r 80 NVQDLCFSIKDSLR-I--SKMIWQLVSGGNAAILKASIDLLESYGVSVVGAHEIVPELLVQVGSLGTCVPNRDVKRDILA 156 (281) Q Consensus 80 ~~~~l~~D~~~~~~-l--~k~l~~~~~~gDd~iL~~i~~~fe~~G~~vi~~~~~l~~ll~~~G~l~~~~p~~~~~~dI~~ 156 (281) ...+.-+...+... + -+++ -.....+-...+.+.+.+++.|++.+++--.=...-+..|.++---- -+..+++. T Consensus 70 ~v~~V~~~~~gl~~~~~~g~ii-id~sT~~p~~~~~~~~~~~~~g~~~lDaPVsGg~~~A~~G~L~~m~g--G~~~~~~~ 146 (163) T pfam03446 70 AVDAVIFGEDGLLPGLKPGDII-IDGSTISPDDTRRLAKELKEKGIRFLDAPVSGGEEGAEAGTLSIMVG--GDEEAFER 146 (163) T ss_pred HHHEEECCCCCHHHHCCCCCEE-EECCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHCCCEEEEEC--CCHHHHHH T ss_conf 5402202631332313589889-86798999999999998753034434787768989994488799975--89999999 Q ss_pred HHHHHHHHCC Q ss_conf 9999997332 Q gi|254780768|r 157 AMKSAEALSE 166 (281) Q Consensus 157 g~~i~~~l~~ 166 (281) ...+++.+++ T Consensus 147 ~~p~l~~~~~ 156 (163) T pfam03446 147 VKPILEAMGA 156 (163) T ss_pred HHHHHHHHCC T ss_conf 9999999739 No 39 >PRK05642 DNA replication initiation factor; Validated Probab=68.00 E-value=9.1 Score=18.86 Aligned_cols=111 Identities=14% Similarity=0.137 Sum_probs=65.0 Q ss_pred CCCHHHHHHHHHHHCCCCEEEEEECCCCCC--C----CCCCCEEEEE-H----------HHHHHHHHHHHHCCCCEEEEE Q ss_conf 980389999999978997899995486673--1----0388567873-8----------999999999986495406770 Q gi|254780768|r 10 SGMLPYYVAKAARLKNDEPVIASVLNECSF--D----WQDFECRELP-L----------GDFCVLRSILHQYNIGRIVVA 72 (281) Q Consensus 10 ~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~--~----~~~~~~~~~~-i----------g~ig~li~~Lk~~~i~~ivma 72 (281) .-+|=..++..+...+...+.+.+....+. + ++.++...+. + ..+-.+++.++..+. +++|+ T Consensus 58 KTHLL~A~~~~~~~~~~~~~yl~~~~~~~~~~~~~~~l~~~d~l~IDDi~~i~g~~~~e~~lF~l~N~~~~~~~-~llit 136 (234) T PRK05642 58 RSHLLQAACLRFEQRGEPAVYLPLAELLDRGPELLDNLEQYELVCIDDLDVIAGKADWEEALFHLFNRLRDSGR-RLLLA 136 (234) T ss_pred HHHHHHHHHHHHHHCCCCEEEEEHHHHHHHHHHHHHHHHHCCEEEEECHHHHCCCHHHHHHHHHHHHHHHHCCC-EEEEE T ss_conf 89999999999980799679978999875449998624227989893645546885999999999999998399-59995 Q ss_pred CCCCCCCC-HHHHCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCEE Q ss_conf 24232100-01110260357899999987510480358999999999769748 Q gi|254780768|r 73 GAIDRRPN-VQDLCFSIKDSLRISKMIWQLVSGGNAAILKASIDLLESYGVSV 124 (281) Q Consensus 73 G~V~krP~-~~~l~~D~~~~~~l~k~l~~~~~~gDd~iL~~i~~~fe~~G~~v 124 (281) +.. .|. +....+|+++=+.. -..+.+..-.|+....++....+..|+.+ T Consensus 137 s~~--~P~~l~~~l~DL~SRl~~-~~~~~i~~l~d~~~~~iL~~~a~~rgi~l 186 (234) T PRK05642 137 ASK--SPRELPVKLPDLKSRLTL-ALVFQMRGLSDEDKLRALQLRASRRGLHL 186 (234) T ss_pred CCC--CHHHHCCCHHHHHHHHHC-CCEEEECCCCHHHHHHHHHHHHHHCCCCC T ss_conf 787--955523001679999957-81275148998999999999977546899 No 40 >KOG3111 consensus Probab=65.86 E-value=8.1 Score=19.19 Aligned_cols=76 Identities=17% Similarity=0.120 Sum_probs=51.6 Q ss_pred ECCEEEEEECCCCH----HHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCEEEECCCCHHHHHHHHHCCCEEEEEECC Q ss_conf 19748896253421----79999999754312345667718999348887531210662799999999849909999739 Q gi|254780768|r 176 IGGRVVALEGIEGT----DSMLQRIVDCRNNGRILAGKSGVLVKMCKSQQDMRADLPSIGAKTVQNVIKAGLAGIALEAG 251 (281) Q Consensus 176 ~~g~ViaiEa~eGT----D~mi~R~~~~~~~~~~~~~~~~ilvK~~K~~QD~r~DlP~IG~~Ti~~~~~ag~~giaiea~ 251 (281) ..-+|..||-=.|- ..|+..+..++.+. |+-|..+| --+||+|+..+++||.+.|+..++ T Consensus 136 D~vLvMtVePGFGGQkFme~mm~KV~~lR~ky---------------p~l~ieVD-GGv~p~ti~~~a~AGAN~iVaGsa 199 (224) T KOG3111 136 DMVLVMTVEPGFGGQKFMEDMMPKVEWLREKY---------------PNLDIEVD-GGVGPSTIDKAAEAGANMIVAGSA 199 (224) T ss_pred CEEEEEEECCCCCHHHHHHHHHHHHHHHHHHC---------------CCCEEEEC-CCCCCCHHHHHHHCCCCEEEECCE T ss_conf 57999985489750457899989999999868---------------98438854-886821377998758887986333 Q ss_pred CEEEECHHHHHHHHHH Q ss_conf 7799858999999998 Q gi|254780768|r 252 KSLVLEKELVKKHADE 267 (281) Q Consensus 252 ~~lild~~~~i~~a~~ 267 (281) -.=--|+.++|+.... T Consensus 200 vf~a~d~~~vi~~lr~ 215 (224) T KOG3111 200 VFGAADPSDVISLLRN 215 (224) T ss_pred EECCCCHHHHHHHHHH T ss_conf 4527998999999999 No 41 >PRK08903 hypothetical protein; Validated Probab=65.43 E-value=11 Score=18.27 Aligned_cols=112 Identities=13% Similarity=0.062 Sum_probs=61.3 Q ss_pred CCCCHHHHHHHHHHHCCCCEEEEEECCCCCCC--CCCCCEE------EEE---HHHHHHHHHHHHHCCCCEEEEECCCCC Q ss_conf 79803899999999789978999954866731--0388567------873---899999999998649540677024232 Q gi|254780768|r 9 GSGMLPYYVAKAARLKNDEPVIASVLNECSFD--WQDFECR------ELP---LGDFCVLRSILHQYNIGRIVVAGAIDR 77 (281) Q Consensus 9 G~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~--~~~~~~~------~~~---ig~ig~li~~Lk~~~i~~ivmaG~V~k 77 (281) |.-+|=..++..+...+..+..+......... ....... .+. =..+-.+++.++.++-..+++++.- T Consensus 54 GKTHLl~a~~~~~~~~~~~~~yl~~~~~~~~~~~~~~~d~l~iDDi~~i~~~~q~~lF~l~N~~~~~~~~~ll~s~~~-- 131 (227) T PRK08903 54 GRSHLLQALVAAASEGGKNARYLDPASPLLAFDFDPRAELYAVDDVERLDDAQQIALFNLFNRVRAHGKTALLVAGPA-- 131 (227) T ss_pred CHHHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCEEEECCCC-- T ss_conf 888999999999980699749965110457774200189899964114895699999999999997299489971899-- Q ss_pred CCCHHHHCCCHHHHHHHHH-HHHHHHCCCCHHHHHHHHHHHHHCCCEE Q ss_conf 1000111026035789999-9987510480358999999999769748 Q gi|254780768|r 78 RPNVQDLCFSIKDSLRISK-MIWQLVSGGNAAILKASIDLLESYGVSV 124 (281) Q Consensus 78 rP~~~~l~~D~~~~~~l~k-~l~~~~~~gDd~iL~~i~~~fe~~G~~v 124 (281) .|....+.+|+++ |+.. +...+..-.|+.....+.+.++..|+.+ T Consensus 132 ~p~~l~~~~DL~S--Rl~~gl~~~i~~pdde~~~~iL~~~a~~rgl~l 177 (227) T PRK08903 132 APLALDVREDLRT--RLGWGLVYEVKPLSDEDKIAALKAAAAERGLQL 177 (227) T ss_pred CHHHCCCCHHHHH--HHHCCCEEEEECCCHHHHHHHHHHHHHHCCCCC T ss_conf 7120120089999--993897389979799999999999999629999 No 42 >TIGR01324 cysta_beta_ly_B cystathionine beta-lyase; InterPro: IPR006233 Cystathionine beta-lyase (alternate name: beta-cystathionase) is one of several pyridoxal-dependent enzymes of cysteine, methionine, and homocysteine metabolism. This enzyme is involved in the biosynthesis of Met from Cys .; GO: 0004121 cystathionine beta-lyase activity, 0006520 amino acid metabolic process, 0005737 cytoplasm. Probab=65.41 E-value=11 Score=18.27 Aligned_cols=56 Identities=29% Similarity=0.356 Sum_probs=45.7 Q ss_pred CCCCCCEEEECCCCHHHHHHHHHCCCEEEEEECCCEEEEC---HHHHHHHHH--HCCCEEE Q ss_conf 8887531210662799999999849909999739779985---899999999--8796899 Q gi|254780768|r 218 KSQQDMRADLPSIGAKTVQNVIKAGLAGIALEAGKSLVLE---KELVKKHAD--EAGIFVC 273 (281) Q Consensus 218 K~~QD~r~DlP~IG~~Ti~~~~~ag~~giaiea~~~lild---~~~~i~~a~--~~~i~i~ 273 (281) |-+=|.++==|-|||+-|..+++.+=+.|.+||=.++-.| =+...+.|+ ..|+-++ T Consensus 117 r~gv~~~YYDP~igPe~ia~Li~pnTkv~flEaP~S~TfE~qDIPaIakaA~kv~~g~~~m 177 (389) T TIGR01324 117 RLGVEVEYYDPKIGPEDIAALIKPNTKVVFLEAPASLTFEVQDIPAIAKAARKVAAGVVVM 177 (389) T ss_pred HHCEEEEEECCCCCHHHHHHHCCCCCEEEEEECCCCCEECCCCHHHHHHHHHHCCCCEEEE T ss_conf 6344588727886888898740888448998378745103588468999986418982999 No 43 >PRK06202 hypothetical protein; Provisional Probab=65.24 E-value=8.4 Score=19.10 Aligned_cols=30 Identities=20% Similarity=0.129 Sum_probs=26.6 Q ss_pred EEEECCCCHHHHHHHHHHHCCCCEEEEEEC Q ss_conf 999779803899999999789978999954 Q gi|254780768|r 5 LIIAGSGMLPYYVAKAARLKNDEPVIASVL 34 (281) Q Consensus 5 gIIAG~G~LP~~ia~~~~~~g~~~~ii~l~ 34 (281) =|=+|+|++|+.+++.+++.|++.-+.++. T Consensus 67 DlGcG~Gdl~~~lar~a~~~g~~l~v~GiD 96 (233) T PRK06202 67 DLGCGGGDLAIDLARWARRDGLRLEVTAID 96 (233) T ss_pred EECCCCCHHHHHHHHHHHHCCCCEEEEEEC T ss_conf 834787579999999997559963899977 No 44 >TIGR01285 nifN nitrogenase molybdenum-iron cofactor biosynthesis protein NifN; InterPro: IPR005975 The enzyme responsible for nitrogen fixation, the nitrogenase, shows a high degree of conservation of structure, function, and amino acid sequence across wide phylogenetic ranges. All known Mo-nitrogenases consist of two components, component I (also called dinitrogenase, or Fe-Mo protein), an alpha2beta2 tetramer encoded by the nifD and nifK genes, and component II (dinitrogenase reductase, or Fe protein) a homodimer encoded by the nifH gene. Two operons, nifDK and nifEN, encode a tetrameric (alpha2beta2 and N2E2) enzymatic complex. Nitrogenase contains two unusual rare metal clusters; one of them is the iron molybdenum cofactor (FeMo-co), which is considered to be the site of dinitrogen reduction and whose biosynthesis requires the products of nifNE and of some other nif genes. It has been proposed that NifNE might serve as a scaffold upon which FeMo-co is built and then inserted into component I.; GO: 0005515 protein binding, 0006461 protein complex assembly, 0009399 nitrogen fixation. Probab=63.21 E-value=12 Score=18.00 Aligned_cols=68 Identities=16% Similarity=0.113 Sum_probs=56.0 Q ss_pred CEEEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCEEEEEHHHHHHHHHHHHHCCCCEEE Q ss_conf 739999779803899999999789978999954866731038856787389999999999864954067 Q gi|254780768|r 2 KRLLIIAGSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQDFECRELPLGDFCVLRSILHQYNIGRIV 70 (281) Q Consensus 2 ~kigIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~~~~~~~ig~ig~li~~Lk~~~i~~iv 70 (281) +|+| ||+.++|=..++..+...|.++.-+.=.-..++.+++..+.|+.+|++|-+-....+++.+=+| T Consensus 329 ~k~A-iAaePDlL~~~~~~~~~~G~~i~a~VttT~~s~~L~k~P~e~V~iGDLeDLe~La~~A~AdLli 396 (451) T TIGR01285 329 KKVA-IAAEPDLLAALATLLTEMGAQIVAAVTTTGNSPLLEKLPVETVKIGDLEDLEDLAREAGADLLI 396 (451) T ss_pred EEEE-EECCHHHHHHHHHHHHHCCCCCEEEECCCCCCHHHHHCCCCEEEECCHHHHHHHHHHCCCCEEE T ss_conf 0232-0168478999999998579853021227767378762873457316868999886317950773 No 45 >pfam02254 TrkA_N TrkA-N domain. This domain is found in a wide variety of proteins. These protein include potassium channels, phosphoesterases, and various other transporters. This domain binds to NAD. Probab=62.15 E-value=10 Score=18.49 Aligned_cols=28 Identities=25% Similarity=0.203 Sum_probs=24.2 Q ss_pred EEECCCCHHHHHHHHHHHCCCCEEEEEEC Q ss_conf 99779803899999999789978999954 Q gi|254780768|r 6 IIAGSGMLPYYVAKAARLKNDEPVIASVL 34 (281) Q Consensus 6 IIAG~G~LP~~ia~~~~~~g~~~~ii~l~ 34 (281) +|+|.|.+-..+++.+.+.| +++++... T Consensus 2 iI~G~g~~G~~la~~L~~~~-~v~vId~d 29 (115) T pfam02254 2 IIIGYGRVGRSLAEELREGG-PVVVIDKD 29 (115) T ss_pred EEECCCHHHHHHHHHHHHCC-CEEEEECC T ss_conf 99878889999999998089-99999998 No 46 >PRK05784 phosphoribosylamine--glycine ligase; Provisional Probab=61.36 E-value=13 Score=17.78 Aligned_cols=126 Identities=19% Similarity=0.104 Sum_probs=75.9 Q ss_pred EEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCC------CCCEEEEEHHHHHHHHHHHHHCCCCEEEEECCCCCCC Q ss_conf 9977980389999999978997899995486673103------8856787389999999999864954067702423210 Q gi|254780768|r 6 IIAGSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQ------DFECRELPLGDFCVLRSILHQYNIGRIVVAGAIDRRP 79 (281) Q Consensus 6 IIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~------~~~~~~~~ig~ig~li~~Lk~~~i~~ivmaG~V~krP 79 (281) ||-|+|-=-..++..+.+...-.-+....++.++-.. +..+..++..++..+.++.++++| +++++|.= -| T Consensus 4 LVIGsGGREHAla~kl~~Sp~v~~l~~aPGn~NpG~~~~a~~~~~~~~~~~~~d~~~i~~fa~~~~I-DLvvVGPE--~P 80 (485) T PRK05784 4 LLVGDGAREHAIAEALAKSPKGYRIYALSSHLNPGIEEIVKKTGGEYFIGNPTSPGEVVKAAEEVSP-DLVVIGPE--EP 80 (485) T ss_pred EEECCCHHHHHHHHHHHCCCCCCEEEEEECCCCCCHHHHHHHCCCCEECCCCCCHHHHHHHHHHCCC-CEEEECCH--HH T ss_conf 9989888999999999609898989997389875256776425872120487799999999998199-99998973--88 Q ss_pred CHHHHCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCEEECHH----------HCCHHHCCCCCCCCCCCCCHH Q ss_conf 0011102603578999999875104803589999999997697482112----------225242035433455465323 Q gi|254780768|r 80 NVQDLCFSIKDSLRISKMIWQLVSGGNAAILKASIDLLESYGVSVVGAH----------EIVPELLVQVGSLGTCVPNRD 149 (281) Q Consensus 80 ~~~~l~~D~~~~~~l~k~l~~~~~~gDd~iL~~i~~~fe~~G~~vi~~~----------~~l~~ll~~~G~l~~~~p~~~ 149 (281) .. ..+++.|+++|+.+.+++ .|.+++|...|+-+... -. T Consensus 81 L~-----------------------------~Gi~D~l~~~gi~vFGP~k~aA~LE~SK~FaK~fm~~~~IPtta~--~~ 129 (485) T PRK05784 81 LF-----------------------------AGVADALREEGFPVFGASSKCAEIEKSKVFARSLMWKYGIPGRLR--YG 129 (485) T ss_pred HH-----------------------------HHHHHHHHHCCCCEECCCHHHHHHHCCHHHHHHHHHHCCCCCCCC--CC T ss_conf 76-----------------------------125899973799078968899414127889999999728996640--12 Q ss_pred HHHHHHHHHHHHHHHC Q ss_conf 4788998999999733 Q gi|254780768|r 150 VKRDILAAMKSAEALS 165 (281) Q Consensus 150 ~~~dI~~g~~i~~~l~ 165 (281) ...|.+-+.+.++..+ T Consensus 130 ~f~~~~~A~~~l~~~~ 145 (485) T PRK05784 130 VFKDVEEAYSFAEYGG 145 (485) T ss_pred CCCCHHHHHHHHHHCC T ss_conf 1389999999986489 No 47 >PRK06444 prephenate dehydrogenase; Provisional Probab=59.92 E-value=14 Score=17.62 Aligned_cols=60 Identities=15% Similarity=0.132 Sum_probs=44.0 Q ss_pred EEEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEEECCC Q ss_conf 3999977980389999999978997899995486673103885678738999999999986495406770242 Q gi|254780768|r 3 RLLIIAGSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQDFECRELPLGDFCVLRSILHQYNIGRIVVAGAI 75 (281) Q Consensus 3 kigIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~~~~~~~ig~ig~li~~Lk~~~i~~ivmaG~V 75 (281) +.+||.|.|.|-+.+++-+..+|+++++- ..++-+..+|+.+.-.+++ +.+ +.++=+..| T Consensus 2 ~~~iig~~gr~g~~~~~~~~~~g~~v~i~---------k~D~ifiaVPI~~~~~iI~---~~~-~tiiDv~SV 61 (197) T PRK06444 2 MEIIIGKNGRLGRVLCSILDDNGLGVYIK---------KADHAFLSVPIDAALNYIE---SYD-NNFVEISSV 61 (197) T ss_pred CEEEECCCCCHHHHHHHHHCCCCCEEEEC---------CCCEEEEEEEHHHHHHHHH---HCC-CEEEEEEEC T ss_conf 16996378716689999970378478972---------7888999824588899998---479-738998863 No 48 >TIGR03420 DnaA_homol_Hda DnaA regulatory inactivator Hda. Members of this protein family are Hda (Homologous to DnaA). These proteins are about half the length of DnaA and homologous over length of Hda. In the model species Escherichia coli, the initiation of DNA replication requires DnaA bound to ATP rather than ADP; Hda helps facilitate the conversion of DnaA-ATP to DnaA-ADP. Probab=59.82 E-value=14 Score=17.61 Aligned_cols=113 Identities=12% Similarity=0.120 Sum_probs=62.3 Q ss_pred CCCCHHHHHHHHHHHCCCCEEEEEECCCCCC------CCCCCCEEEEE-H----------HHHHHHHHHHHHCCCCEEEE Q ss_conf 7980389999999978997899995486673------10388567873-8----------99999999998649540677 Q gi|254780768|r 9 GSGMLPYYVAKAARLKNDEPVIASVLNECSF------DWQDFECRELP-L----------GDFCVLRSILHQYNIGRIVV 71 (281) Q Consensus 9 G~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~------~~~~~~~~~~~-i----------g~ig~li~~Lk~~~i~~ivm 71 (281) |.-+|-..++.++..++..++.+......+. .+++.+...+. + ..+-.+++.++.++. ++++ T Consensus 50 GKTHLl~a~~~~~~~~~~~~~yl~~~~~~~~~~~~l~~l~~~d~l~iDDi~~i~~~~~~e~~lF~l~N~~~~~~~-~ili 128 (226) T TIGR03420 50 GKSHLLQAACAAAEERGKSAIYLPLAELAQADPEVLEGLEQADLVCLDDVEAIAGQPEWQEALFHLYNRVREAGG-RLLI 128 (226) T ss_pred CHHHHHHHHHHHHHCCCCCEEEECHHHHHHHHHHHHHHCCCCCEEEEECHHHHCCCHHHHHHHHHHHHHHHHHCC-EEEE T ss_conf 889999999999862699579952999877539999727448999996633343783789999999999986528-2898 Q ss_pred ECCCCCCCCHHHHCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCEE Q ss_conf 02423210001110260357899999987510480358999999999769748 Q gi|254780768|r 72 AGAIDRRPNVQDLCFSIKDSLRISKMIWQLVSGGNAAILKASIDLLESYGVSV 124 (281) Q Consensus 72 aG~V~krP~~~~l~~D~~~~~~l~k~l~~~~~~gDd~iL~~i~~~fe~~G~~v 124 (281) ++... +..+.-..+|+++=+.. -+......-.|+.....+.+.++..|+++ T Consensus 129 ts~~~-p~~l~~~l~dL~SRl~~-~~~~~I~~pdd~~~~~iL~k~~~~r~i~i 179 (226) T TIGR03420 129 AGRAA-PAQLPLRLPDLRTRLAW-GLVFQLPPLSDEEKIAALQSRAARRGLQL 179 (226) T ss_pred ECCCC-HHHCCCCHHHHHHHHHC-CCEEECCCCCHHHHHHHHHHHHHHCCCCC T ss_conf 67888-23203201779999968-85685279999999999999999859988 No 49 >COG3250 LacZ Beta-galactosidase/beta-glucuronidase [Carbohydrate transport and metabolism] Probab=58.43 E-value=15 Score=17.45 Aligned_cols=62 Identities=13% Similarity=0.115 Sum_probs=52.1 Q ss_pred CCEEEEEECCCCCCCEEEECCCCHH----HHHHHHHCCCEEEEEECCCEEEECHHHHHHHHHHCCCEEEE Q ss_conf 7718999348887531210662799----99999984990999973977998589999999987968999 Q gi|254780768|r 209 KSGVLVKMCKSQQDMRADLPSIGAK----TVQNVIKAGLAGIALEAGKSLVLEKELVKKHADEAGIFVCG 274 (281) Q Consensus 209 ~~~ilvK~~K~~QD~r~DlP~IG~~----Ti~~~~~ag~~giaiea~~~lild~~~~i~~a~~~~i~i~g 274 (281) ...|++|..---+|.-.=-++.+.+ .+++|+++|++.|=+. -.=+.+++.++||++||++|- T Consensus 297 Gkpvf~kGvnrHe~~~~~G~~~~~~~~~~dl~lmk~~n~N~vRts----HyP~~~~~ydLcDelGllV~~ 362 (808) T COG3250 297 GKPVFIRGVNRHEDDPILGRVTDEDAMERDLKLMKEANMNSVRTS----HYPNSEEFYDLCDELGLLVID 362 (808) T ss_pred CEEEEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEEC----CCCCCHHHHHHHHHHCCEEEE T ss_conf 868998610046687765545477789999999997389878725----888988999998874958997 No 50 >pfam06189 5-nucleotidase 5'-nucleotidase. This family consists of both eukaryotic and prokaryotic 5'-nucleotidase sequences (EC:3.1.3.5). Probab=57.86 E-value=4.6 Score=20.88 Aligned_cols=22 Identities=18% Similarity=0.342 Sum_probs=12.7 Q ss_pred HHHHHHHHC--CCCEEEEECCCCC Q ss_conf 999999864--9540677024232 Q gi|254780768|r 56 VLRSILHQY--NIGRIVVAGAIDR 77 (281) Q Consensus 56 ~li~~Lk~~--~i~~ivmaG~V~k 77 (281) ++++..+.+ .|++.+|.|+=+. T Consensus 51 RvfnSI~hygL~I~Ra~Ft~G~sp 74 (263) T pfam06189 51 RVFNSINHYGLDITRAAFTGGESP 74 (263) T ss_pred HHHHHHHHCCCCCCEEEECCCCCH T ss_conf 999989972987313554189882 No 51 >TIGR00423 TIGR00423 conserved hypothetical protein TIGR00423; InterPro: IPR005244 This entry describes a family of conserved hypothetical proteins with no known function. . Probab=57.80 E-value=14 Score=17.59 Aligned_cols=38 Identities=16% Similarity=0.225 Sum_probs=32.9 Q ss_pred CCCEEEEEHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHH Q ss_conf 8856787389999999999864954067702423210001 Q gi|254780768|r 43 DFECRELPLGDFCVLRSILHQYNIGRIVVAGAIDRRPNVQ 82 (281) Q Consensus 43 ~~~~~~~~ig~ig~li~~Lk~~~i~~ivmaG~V~krP~~~ 82 (281) +-+-++++..+|-+-.+...+.||++|++.|++. |.++ T Consensus 30 ~~~~Y~Ls~eEI~~Kv~ea~~~G~tE~~iQGGln--P~~~ 67 (331) T TIGR00423 30 DKDAYVLSLEEILRKVKEAVAKGATEICIQGGLN--PQLD 67 (331) T ss_pred CCCCCCCCHHHHHHHHHHHHHCCCEEEEEECCCC--CCCC T ss_conf 9888140779999999999971982788523427--8876 No 52 >TIGR03023 WcaJ_sugtrans Undecaprenyl-phosphate glucose phosphotransferase. Colanic acid biosynthesis utilizes a glucose-undecaprenyl carrier, knockout of EpsB abolishes incorporation of UDP-glucose into the lipid phase and the C-terminal portion of GumD has been shown to be responsible for the glucosyl-1-transferase activity. Probab=57.79 E-value=15 Score=17.38 Aligned_cols=72 Identities=14% Similarity=0.151 Sum_probs=48.1 Q ss_pred EEEEEECCCCHHHHHHHHHHHCCC-CEEEEEECCCCCCCCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEEECCC Q ss_conf 399997798038999999997899-7899995486673103885678738999999999986495406770242 Q gi|254780768|r 3 RLLIIAGSGMLPYYVAKAARLKND-EPVIASVLNECSFDWQDFECRELPLGDFCVLRSILHQYNIGRIVVAGAI 75 (281) Q Consensus 3 kigIIAG~G~LP~~ia~~~~~~g~-~~~ii~l~~~~~~~~~~~~~~~~~ig~ig~li~~Lk~~~i~~ivmaG~V 75 (281) +=-+|-|+|+--..+++.+.+... ...++++-+....... .....-.+|.+..+.+.++++++++|+.+-.- T Consensus 129 rrvLIIG~g~~~~~l~~~l~~~~~~G~~vvG~vdd~~~~~~-~~~~~pvlg~~~~l~~~i~~~~ideViia~~~ 201 (451) T TIGR03023 129 RRVLIVGAGELGRRLAERLARNPELGYRVVGFFDDRPDART-GVRGVPVLGKLDDLEELIREGEVDEVYIALPL 201 (451) T ss_pred CEEEEEECCHHHHHHHHHHHHCCCCCEEEEEEECCCCCCCC-CCCCCCCCCCHHHHHHHHHHCCCCEEEEECCC T ss_conf 54999968689999999997194368489999838854444-45799736989999999996799889995483 No 53 >PRK06928 pyrroline-5-carboxylate reductase; Reviewed Probab=56.92 E-value=15 Score=17.34 Aligned_cols=25 Identities=20% Similarity=0.251 Sum_probs=23.2 Q ss_pred CCEEEEEECCCCHHHHHHHHHHHCCC Q ss_conf 97399997798038999999997899 Q gi|254780768|r 1 MKRLLIIAGSGMLPYYVAKAARLKND 26 (281) Q Consensus 1 M~kigIIAG~G~LP~~ia~~~~~~g~ 26 (281) |+|||+| |.|+.-..+++.+...|. T Consensus 1 M~kI~fI-G~GnMg~Aii~Gll~~~~ 25 (275) T PRK06928 1 MEKIGFI-GYGSMADMIATKLLETEV 25 (275) T ss_pred CCEEEEE-CCCHHHHHHHHHHHHCCC T ss_conf 9889998-678999999999997899 No 54 >PRK10124 putative UDP-glucose lipid carrier transferase; Provisional Probab=56.38 E-value=16 Score=17.23 Aligned_cols=70 Identities=13% Similarity=0.012 Sum_probs=49.4 Q ss_pred EEEEEECCCCHHHHHHHHHHHCCC-CEEEEEECCCCCCCCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEEECCCC Q ss_conf 399997798038999999997899-78999954866731038856787389999999999864954067702423 Q gi|254780768|r 3 RLLIIAGSGMLPYYVAKAARLKND-EPVIASVLNECSFDWQDFECRELPLGDFCVLRSILHQYNIGRIVVAGAID 76 (281) Q Consensus 3 kigIIAG~G~LP~~ia~~~~~~g~-~~~ii~l~~~~~~~~~~~~~~~~~ig~ig~li~~Lk~~~i~~ivmaG~V~ 76 (281) |=.+|.|.|..-..+++..+.... -.-++++.+...+.-...+ -+|++..+.+..++++|++|..+-... T Consensus 145 r~v~IvG~~~~~~~l~~~i~~~p~~G~~vvG~~dd~~~~~~~~~----~lG~~~~l~~~~~~~~id~V~ialp~~ 215 (464) T PRK10124 145 RMVAVAGDLPAGQMLLESFRNQPWLGFEVVGVYHDPKPGGVSND----WAGNLQQLVEDAKAGKIHNVYIAMSMC 215 (464) T ss_pred EEEEEECCCHHHHHHHHHHHHCCCCCCEEEEEECCCCCCCCCCC----CCCCHHHHHHHHHHCCCCEEEEECCCC T ss_conf 38999838889999999997296679669999668876667899----878999999999868999899966823 No 55 >PTZ00331 alpha/beta hydrolase; Provisional Probab=56.20 E-value=5.9 Score=20.11 Aligned_cols=42 Identities=19% Similarity=0.242 Sum_probs=23.5 Q ss_pred HHHHHHHHCCCEEEEEECCCEEEEC--HHHHHHHHHHCCCEEEE Q ss_conf 9999999849909999739779985--89999999987968999 Q gi|254780768|r 233 KTVQNVIKAGLAGIALEAGKSLVLE--KELVKKHADEAGIFVCG 274 (281) Q Consensus 233 ~Ti~~~~~ag~~giaiea~~~lild--~~~~i~~a~~~~i~i~g 274 (281) .|...+.+.|+++.+++..-.=+-. -++.++.-.++|+-++. T Consensus 168 ~Ta~DA~~~Gf~v~v~~D~~~~v~p~~~~~al~em~~~Gv~ii~ 211 (222) T PTZ00331 168 FTALDAVKEGFDVVVLEDATRAVDPAAWSKRRAELKKAGVVLAD 211 (222) T ss_pred HHHHHHHHCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCEEEE T ss_conf 99999998799799962352035867899999999976999977 No 56 >PRK12446 N-acetylglucosaminyl transferase; Reviewed Probab=56.10 E-value=16 Score=17.20 Aligned_cols=75 Identities=9% Similarity=0.059 Sum_probs=47.6 Q ss_pred CCEEEEEECCCC---HHH-HHHHHHHHCCCCEEEEEECCCCCCC---CCCCCEEEEEH---------HH----------H Q ss_conf 973999977980---389-9999999789978999954866731---03885678738---------99----------9 Q gi|254780768|r 1 MKRLLIIAGSGM---LPY-YVAKAARLKNDEPVIASVLNECSFD---WQDFECRELPL---------GD----------F 54 (281) Q Consensus 1 M~kigIIAG~G~---LP~-~ia~~~~~~g~~~~ii~l~~~~~~~---~~~~~~~~~~i---------g~----------i 54 (281) |+|+-|.||..- +|- .+|+.++++|+++..+.-....+.. ..+++...++. .. + T Consensus 1 MkkIii~~GGTGGHi~Palala~~L~~~~~~v~~ig~~~g~e~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~l~~~~ 80 (352) T PRK12446 1 MKKIVFTGGGSAGHVTPNLAIIPKLIEDNWDISYIGSHQGIEKTIIEKENIPYYSISSGKLRRYFDLKNIKDPFLVMKGV 80 (352) T ss_pred CCEEEEEECCCHHHHHHHHHHHHHHHHCCCEEEEEECCCHHHHCCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHH T ss_conf 98799995875888999999999998489959999889605430445049968995447727855299999999999999 Q ss_pred HHHHHHHHHCCCCEEEEECCC Q ss_conf 999999986495406770242 Q gi|254780768|r 55 CVLRSILHQYNIGRIVVAGAI 75 (281) Q Consensus 55 g~li~~Lk~~~i~~ivmaG~V 75 (281) -+.+.+|++.+.+-|+--|+. T Consensus 81 ~~s~~il~~~kPd~Vig~GGY 101 (352) T PRK12446 81 MDAYVRIRKLKPDVIFSKGGF 101 (352) T ss_pred HHHHHHHHHCCCCEEEECCCH T ss_conf 999999996399999974987 No 57 >PRK13148 consensus Probab=55.72 E-value=16 Score=17.16 Aligned_cols=67 Identities=18% Similarity=0.156 Sum_probs=48.2 Q ss_pred CCEEEEE-ECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCEEEEE-HHHHHHHHHHHHHCCCCEEE Q ss_conf 9739999-7798038999999997899789999548667310388567873-89999999999864954067 Q gi|254780768|r 1 MKRLLII-AGSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQDFECRELP-LGDFCVLRSILHQYNIGRIV 70 (281) Q Consensus 1 M~kigII-AG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~~~~~~~-ig~ig~li~~Lk~~~i~~iv 70 (281) |++|||| -|.|++- -+.++++..|++.-+....+. .++.+.+...+| +|.++..++.|++.+..+.+ T Consensus 1 M~~I~IIDyg~GNl~-Sv~~al~~~g~~~~~~i~~~~--~~i~~~d~lILPGVGsF~~am~~L~~~~l~~~i 69 (225) T PRK13148 1 MSTIAIVDYGMGNFH-SVARALQHAAPDADIRICNRP--EQIDAADRVVFPGQGAMPDCMRTLNESGLRAAV 69 (225) T ss_pred CCEEEEEECCCHHHH-HHHHHHHHHCCCCCEEEECCH--HHHHHCCEEEECCCCCHHHHHHHHHHCCHHHHH T ss_conf 987999979944999-999999985789739998999--999549979989978879999998864879999 No 58 >PRK12367 short chain dehydrogenase; Provisional Probab=54.00 E-value=17 Score=16.97 Aligned_cols=199 Identities=17% Similarity=0.162 Sum_probs=109.2 Q ss_pred CEEEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCC-CCCEEEE--EHHHHHHHHHHHHHCCCCEEEEECCCCCC Q ss_conf 73999977980389999999978997899995486673103-8856787--38999999999986495406770242321 Q gi|254780768|r 2 KRLLIIAGSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQ-DFECREL--PLGDFCVLRSILHQYNIGRIVVAGAIDRR 78 (281) Q Consensus 2 ~kigIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~-~~~~~~~--~ig~ig~li~~Lk~~~i~~ivmaG~V~kr 78 (281) +++||=.++|.|=..+.+.++.+|.+++-+.-......+.. +-+..|+ ..|+-..+-..|++ ++=+|+--+|. T Consensus 18 KtIgITGAsGaLG~AL~k~f~~~GakVIalTh~~~~~~~~~~~~p~~wi~W~cG~E~~L~~~Lkk--iDILILNHGIn-- 93 (250) T PRK12367 18 KRIGITGASGALGKALTKLFRAKGAKVIGLTHSKINNSEISDESPNEWVSWECGKESSLDKTLAK--IDVLILNHGIN-- 93 (250) T ss_pred CEEEEECCCCHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCCEEEEEECCCHHHHHHHHHH--CCEEEECCCCC-- T ss_conf 87999678738999999999988998999836888875455678952898434998999999875--88999838777-- Q ss_pred CCHHHHCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCEEECHHHCCHHHCCCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 00011102603578999999875104803589999999997697482112225242035433455465323478899899 Q gi|254780768|r 79 PNVQDLCFSIKDSLRISKMIWQLVSGGNAAILKASIDLLESYGVSVVGAHEIVPELLVQVGSLGTCVPNRDVKRDILAAM 158 (281) Q Consensus 79 P~~~~l~~D~~~~~~l~k~l~~~~~~gDd~iL~~i~~~fe~~G~~vi~~~~~l~~ll~~~G~l~~~~p~~~~~~dI~~g~ 158 (281) |. .+ .. +..+.+...-+.-+.+|. .+.||+--. ..+ + ..+..+.-.. | ...|.-|. T Consensus 94 ~~-~~-----~~----~~~i~~s~EINalS~~Rl-lelF~~~~~---~~~----~-~~~kEiWvNT--S---EAEi~PA~ 149 (250) T PRK12367 94 PG-GR-----QN----SNDINKALEINALSSWRL-LQLFEDIAL---NNN----S-QIPKEIWVNT--S---EAEIQPAL 149 (250) T ss_pred CC-CC-----CC----HHHHHHHHHHHHHHHHHH-HHHHHHHHH---CCC----C-CCCCEEEEEC--C---HHHCCCCC T ss_conf 45-56-----59----789999998777879999-999999973---665----5-5783588615--1---66415433 Q ss_pred HHHHHHCCCCCEEEEEEECCEEEEEECCCCHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCEEEECCCCH---H-- Q ss_conf 99997332672259998197488962534217999999975431234566771899934888753121066279---9-- Q gi|254780768|r 159 KSAEALSELDVGQSAVSIGGRVVALEGIEGTDSMLQRIVDCRNNGRILAGKSGVLVKMCKSQQDMRADLPSIGA---K-- 233 (281) Q Consensus 159 ~i~~~l~~~DiGQsvVv~~g~ViaiEa~eGTD~mi~R~~~~~~~~~~~~~~~~ilvK~~K~~QD~r~DlP~IG~---~-- 233 (281) +-+.++|+-=+||-|-.+....- .+. .+.+++-|..= --.|=+|--||. + T Consensus 150 sP~YEiSKrliGqLVslr~n~ld----~~~-------------------~~~~~irKliL--GPFkS~LNPiGiMs~~~V 204 (250) T PRK12367 150 SPVYEISKRLIGQLVSLKKNLLD----KNE-------------------RKKLIIRKLIL--GPFRSELNPIGIMSADFV 204 (250) T ss_pred CCHHHHHHHHHCCEEEEECCCCC----CCC-------------------CCCCEEEEEEC--CCCCCCCCCCCCCCHHHH T ss_conf 80378789874031155401145----222-------------------56845899842--665567887522288999 Q ss_pred --HHHHHHHCCCEEEEEECCCE Q ss_conf --99999984990999973977 Q gi|254780768|r 234 --TVQNVIKAGLAGIALEAGKS 253 (281) Q Consensus 234 --Ti~~~~~ag~~giaiea~~~ 253 (281) -|-..++.+++-|.|..+-. T Consensus 205 A~~I~~~a~~~~~lIIVtpNPl 226 (250) T PRK12367 205 AKQIVDQANLGLYLIIVTPNPL 226 (250) T ss_pred HHHHHHHHHCCCCEEEECCCCC T ss_conf 9999999844895699837872 No 59 >TIGR03025 EPS_sugtrans exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase. Certain closely related transferase enzymes such as Sinorhizobium ExoY and Lactococcus EpsD lack the N-terminal domain and are not found by this model. Probab=53.97 E-value=17 Score=16.97 Aligned_cols=71 Identities=17% Similarity=0.192 Sum_probs=47.2 Q ss_pred EEEEECCCCHHHHHHHHHHHCCC-CEEEEEECCCCCCCCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEEECCC Q ss_conf 99997798038999999997899-7899995486673103885678738999999999986495406770242 Q gi|254780768|r 4 LLIIAGSGMLPYYVAKAARLKND-EPVIASVLNECSFDWQDFECRELPLGDFCVLRSILHQYNIGRIVVAGAI 75 (281) Q Consensus 4 igIIAG~G~LP~~ia~~~~~~g~-~~~ii~l~~~~~~~~~~~~~~~~~ig~ig~li~~Lk~~~i~~ivmaG~V 75 (281) =.+|-|+|.--..+++.++.+.. ...++++-+..... .......--+|.+..+.+.++++++++|+.+... T Consensus 127 rvlIIG~g~~~~~l~~~l~~~~~~g~~vvG~~dd~~~~-~~~~~~~p~lg~~~~l~~~i~~~~ideViIa~p~ 198 (445) T TIGR03025 127 RVLIVGTGELAEELAAALSRNPDLGYRVVGFVDDRPED-RVEVAGLPVLGKLDDLVELVRAHRVDEVIIALPL 198 (445) T ss_pred EEEEEECCHHHHHHHHHHHHCCCCCEEEEEEECCCCCC-CCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCC T ss_conf 39999084899999999982846884899997787544-4556788601889999999997699889996586 No 60 >pfam09035 Tn916-Xis Excisionase from transposon Tn916. The phage-encoded excisionase protein Tn916-Xis adopts a winged-helix structure that consists of a three-stranded anti-parallel beta-sheet that packs against a helix-turn-helix (HTH) motif and a third C-terminal alpha-helix. It is encoded for by Tn916, which also codes for the integrase Tn916-Int. The protein interacts with DNA by the insertion of helix alpha-2 into the major groove and the contact of the hairpin that connects strands beta-2 and beta-3 with the adjacent phosphodiester backbone and/or minor groove. Tn916-Xis stimulates phage excision and inhibits viral integration by stabilising distorted DNA structures. Probab=53.25 E-value=18 Score=16.90 Aligned_cols=40 Identities=15% Similarity=0.297 Sum_probs=25.5 Q ss_pred CCCHHHHHHHHHCCCEEEEEECCCEEEECHHHHHHHHHHC Q ss_conf 6279999999984990999973977998589999999987 Q gi|254780768|r 229 SIGAKTVQNVIKAGLAGIALEAGKSLVLEKELVKKHADEA 268 (281) Q Consensus 229 ~IG~~Ti~~~~~ag~~giaiea~~~lild~~~~i~~a~~~ 268 (281) -||.++++.+++..-.+-++.-|.-.++-++++-++.|+. T Consensus 24 ~IG~~klR~lae~~~~~fvl~~Gnr~lIkR~~FEk~ld~~ 63 (66) T pfam09035 24 GIGENKLRSLAENPNCEFVLWVGTKILIKRKEFEKYIDTL 63 (66) T ss_pred CCCHHHHHHHHHCCCCCEEEEECCEEEEEHHHHHHHHHHH T ss_conf 4039999999858899879996888888799999999863 No 61 >TIGR03234 OH-pyruv-isom hydroxypyruvate isomerase. This enzyme interconverts tartronate semi-aldehyde (TSA, aka 2-hydroxy 3-oxopropionate) and hydroxypyruvate. The E. coli enzyme has been characterized and found to be specific for TSA, contain no cofactors, and have a rather high Km for hydroxypyruvate of 12.5 mM. The gene is ofter found in association with glyoxalate carboligase (which produces TSA), but has been shown to have no effect on growth on glyoxalate when knocked out. This is consistent with the fact that the gene for tartronate semialdehyde reductase (glxR) is also associated and may have primary responsibility for the catabolism of TSA. Probab=52.84 E-value=18 Score=16.85 Aligned_cols=149 Identities=9% Similarity=0.042 Sum_probs=73.9 Q ss_pred HHHHHHHHHHHCCCCEE-EEECCCCCCCCHHHHCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCEEECHHHCC Q ss_conf 99999999986495406-77024232100011102603578999999875104803589999999997697482112225 Q gi|254780768|r 53 DFCVLRSILHQYNIGRI-VVAGAIDRRPNVQDLCFSIKDSLRISKMIWQLVSGGNAAILKASIDLLESYGVSVVGAHEIV 131 (281) Q Consensus 53 ~ig~li~~Lk~~~i~~i-vmaG~V~krP~~~~l~~D~~~~~~l~k~l~~~~~~gDd~iL~~i~~~fe~~G~~vi~~~~~l 131 (281) .+.+.+++.+..+|..+ +++|... +..+. ....+.+. + -|+.++.+.++.|+++.= .. T Consensus 85 ~~~~ai~~a~~lg~~~i~~~~G~~~--~~~~~--------e~~~~~~~------e--~l~~~a~~a~~~Gv~l~i-Ep-- 143 (254) T TIGR03234 85 GVALAIAYARALGCPQVNCLAGKRP--AGVSP--------EEARATLV------E--NLRYAADALDRIGLTLLI-EP-- 143 (254) T ss_pred HHHHHHHHHHHHCCCEEEECCCCCC--CCCCH--------HHHHHHHH------H--HHHHHHHHHHHCCCEEEE-EE-- T ss_conf 3899999999968986886756888--89998--------99999999------9--999999999965988988-63-- Q ss_pred HHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEECCEEEEEECCCCHHHHHHHHHHHHHHCCCCC--CC Q ss_conf 2420354334554653234788998999999733267225999819748896253421799999997543123456--67 Q gi|254780768|r 132 PELLVQVGSLGTCVPNRDVKRDILAAMKSAEALSELDVGQSAVSIGGRVVALEGIEGTDSMLQRIVDCRNNGRILA--GK 209 (281) Q Consensus 132 ~~ll~~~G~l~~~~p~~~~~~dI~~g~~i~~~l~~~DiGQsvVv~~g~ViaiEa~eGTD~mi~R~~~~~~~~~~~~--~~ 209 (281) ++...-........+-+..+++.++.-.+| +...=++ +.++.-. --+.|++.++...+-.... ++ T Consensus 144 ---------ln~~e~~g~~i~t~~~a~~lv~~v~~p~v~--l~~D~~H-~~~~~~d-~~~~i~~~~~~I~hvHiaD~~~R 210 (254) T TIGR03234 144 ---------INSFDMPGFFLTTTEQALAVIDDVGRENLK--LQYDLYH-MQRMGGD-LARTLAAYAPHIGHVQIADNPGR 210 (254) T ss_pred ---------CCHHCCCCCHHCCHHHHHHHHHHHCCCCEE--EEEECHH-HHHCCCC-HHHHHHHCCCCCCEEECCCCCCC T ss_conf ---------562116986107999999999980877656--8887056-5744889-99999970371568981799999 Q ss_pred CEEEEEECCCCCCCEEEECCCCHHHHHHHHHCCCEEEE Q ss_conf 71899934888753121066279999999984990999 Q gi|254780768|r 210 SGVLVKMCKSQQDMRADLPSIGAKTVQNVIKAGLAGIA 247 (281) Q Consensus 210 ~~ilvK~~K~~QD~r~DlP~IG~~Ti~~~~~ag~~gia 247 (281) + -|+. =-+|+| ..++.+.+.|+.|-+ T Consensus 211 ~-------~PG~-G~id~~----~i~~aL~~~gY~G~i 236 (254) T TIGR03234 211 H-------EPGT-GEINYR----FLFAVLDRLGYDGWV 236 (254) T ss_pred C-------CCCC-CCCCHH----HHHHHHHHHCCCCEE T ss_conf 8-------9999-973889----999999983999559 No 62 >COG0299 PurN Folate-dependent phosphoribosylglycinamide formyltransferase PurN [Nucleotide transport and metabolism] Probab=52.71 E-value=18 Score=16.84 Aligned_cols=73 Identities=16% Similarity=0.218 Sum_probs=46.7 Q ss_pred EEE-EEECCCCHHHHHHHHHHHCCCCEEEEE-ECCCCCCC-C-----CCCCEEEEEHHHH-------HHHHHHHHHCCCC Q ss_conf 399-997798038999999997899789999-54866731-0-----3885678738999-------9999999864954 Q gi|254780768|r 3 RLL-IIAGSGMLPYYVAKAARLKNDEPVIAS-VLNECSFD-W-----QDFECRELPLGDF-------CVLRSILHQYNIG 67 (281) Q Consensus 3 kig-IIAG~G~LP~~ia~~~~~~g~~~~ii~-l~~~~~~~-~-----~~~~~~~~~ig~i-------g~li~~Lk~~~i~ 67 (281) ||+ +++|+|+-=..++++++....+.-+.. +.+..++. + ...+...+.-..+ ..+.+.|+..+++ T Consensus 2 ki~VlaSG~GSNlqaiida~~~~~~~a~i~~Visd~~~A~~lerA~~~gIpt~~~~~k~~~~r~~~d~~l~~~l~~~~~d 81 (200) T COG0299 2 KIAVLASGNGSNLQAIIDAIKGGKLDAEIVAVISDKADAYALERAAKAGIPTVVLDRKEFPSREAFDRALVEALDEYGPD 81 (200) T ss_pred EEEEEEECCCCCHHHHHHHHHCCCCCCEEEEEEECCCCCHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCC T ss_conf 49999947965599999987648988179999957977789999998499889744456787789999999999852999 Q ss_pred EEEEECCC Q ss_conf 06770242 Q gi|254780768|r 68 RIVVAGAI 75 (281) Q Consensus 68 ~ivmaG~V 75 (281) =+|+||++ T Consensus 82 lvvLAGyM 89 (200) T COG0299 82 LVVLAGYM 89 (200) T ss_pred EEEECCHH T ss_conf 99986647 No 63 >PRK10200 putative racemase; Provisional Probab=52.58 E-value=18 Score=16.83 Aligned_cols=76 Identities=9% Similarity=-0.017 Sum_probs=41.3 Q ss_pred CCEEEEEECCCCHHH-----HHHHHHHHC-C-CCEEEEEECCCCCCCCC--CCCEEEEEHH-HHHHHHHHHHHCCCCEEE Q ss_conf 973999977980389-----999999978-9-97899995486673103--8856787389-999999999864954067 Q gi|254780768|r 1 MKRLLIIAGSGMLPY-----YVAKAARLK-N-DEPVIASVLNECSFDWQ--DFECRELPLG-DFCVLRSILHQYNIGRIV 70 (281) Q Consensus 1 M~kigIIAG~G~LP~-----~ia~~~~~~-g-~~~~ii~l~~~~~~~~~--~~~~~~~~ig-~ig~li~~Lk~~~i~~iv 70 (281) |++||||.|=|-..- .+.+....+ + ...-.+.+..-..++.. .....|-+.+ .+....+.|++.|++-++ T Consensus 1 Mk~IGIlGGmGp~AT~~yy~~i~~~t~a~~~~~h~~~l~~~s~~~~~~~~~~~~~~~d~~~~~L~~~a~~Le~aGAd~i~ 80 (230) T PRK10200 1 MKTIGLLGGMSWESTIPYYRLINEGIKQRLGGLHSAQLLLHSVDFHEIEECQRRGEWDKTGDILAEAALGLQRAGAEGIV 80 (230) T ss_pred CCEEEECCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCHHCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCEEE T ss_conf 96789826679899999999999999887189788721122788577999886799653999999999999984999999 Q ss_pred EECCCC Q ss_conf 702423 Q gi|254780768|r 71 VAGAID 76 (281) Q Consensus 71 maG~V~ 76 (281) |+-.=. T Consensus 81 i~cNTa 86 (230) T PRK10200 81 LCTNTM 86 (230) T ss_pred ECCCHH T ss_conf 647478 No 64 >TIGR02379 ECA_wecE TDP-4-keto-6-deoxy-D-glucose transaminase; InterPro: IPR012749 This family consists of TDP-4-keto-6-deoxy-D-glucose transaminases, the WecE (formerly RffA) protein of enterobacterial common antigen (ECA) biosynthesis, from enterobacteria. It also includes closely matching sequence from species not expected to make ECA, but which contain other genes for the biosynthesis of TDP-4-keto-6-deoxy-D-Glc, an intermediate in the biosynthesis of other compounds as well and the substrate of WecA. This family belongs to the DegT/DnrJ/EryC1/StrS aminotransferase family.. Probab=51.85 E-value=19 Score=16.75 Aligned_cols=97 Identities=20% Similarity=0.270 Sum_probs=70.1 Q ss_pred HHHHHHHHHHHCCCCCEEEEEEECCEEEEEECCCCHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCEEEECCCCHH Q ss_conf 99899999973326722599981974889625342179999999754312345667718999348887531210662799 Q gi|254780768|r 154 ILAAMKSAEALSELDVGQSAVSIGGRVVALEGIEGTDSMLQRIVDCRNNGRILAGKSGVLVKMCKSQQDMRADLPSIGAK 233 (281) Q Consensus 154 I~~g~~i~~~l~~~DiGQsvVv~~g~ViaiEa~eGTD~mi~R~~~~~~~~~~~~~~~~ilvK~~K~~QD~r~DlP~IG~~ 233 (281) ...+.+++.-|-...-|--||.=.=.-+.- -|+-.-| |...+|| |.|-|-=-|-+. T Consensus 55 CT~aLEmaAlL~Di~~GDEVIMPSyTFVST-----ANAFvLR------------Ga~~vFV-------Dir~dTmNiDe~ 110 (376) T TIGR02379 55 CTAALEMAALLLDIQPGDEVIMPSYTFVST-----ANAFVLR------------GAKIVFV-------DIRPDTMNIDET 110 (376) T ss_pred HHHHHHHHHHHHCCCCCCEEECCCCHHHHH-----HHHHHHC------------CCEEEEE-------EECCCCCCCCHH T ss_conf 567999999983046888367575145776-----6778672------------8827999-------604200046767 Q ss_pred HHHHHHHCCCEEEEEECCCEEEECHHHHHHHHHHCCCEEEE Q ss_conf 99999984990999973977998589999999987968999 Q gi|254780768|r 234 TVQNVIKAGLAGIALEAGKSLVLEKELVKKHADEAGIFVCG 274 (281) Q Consensus 234 Ti~~~~~ag~~giaiea~~~lild~~~~i~~a~~~~i~i~g 274 (281) .|+.++...-+.|+.-.-.=.-.|.+.+.++|++|+||++- T Consensus 111 ~IE~AIt~rT~aIvpVHYaGVaC~M~tIM~la~k~~L~ViE 151 (376) T TIGR02379 111 KIEAAITERTKAIVPVHYAGVACDMDTIMALANKHELFVIE 151 (376) T ss_pred HHHHHHCCCCEEEEEEECCCEECCHHHHHHHHHHCCEEEEE T ss_conf 89886325871899873364210547898776434616996 No 65 >TIGR01780 SSADH succinic semialdehyde dehydrogenase; InterPro: IPR010102 Succinic semialdehyde dehydrogenase is one of three enzymes constituting 4-aminobutyrate (GABA) degradation in both prokaryotes and eukaryotes, catalysing the (NAD(P)+)-dependent catabolism reaction of succinic semialdehyde to succinate for metabolism by the citric acid cycle. In Escherichia coli, succinic semialdehyde dehydrogenase is located in a unidirectionally transcribed gene cluster encoding enzymes for GABA degradation and is suggested to be co-transcribed with succinic semialdehyde transaminase from a common promoter upstream of SSADH . Similar gene arrangements can be found in characterised Ralstonia eutropha and the genome analysis of Bacillus subtilis. Prokaryotic succinic semialdehyde dehydrogenases (1.2.1.16 from EC) share high sequence homology to characterised succinic semialdehyde dehydrogenases from rat and human (1.2.1.24 from EC), exhibiting conservation of proposed cofactor binding residues, and putative active sites (G-237 and G-242, C-293 and G-259 respectively of rat SSADH) . Eukaryotic SSADH enzymes exclusively utilise NAD+ as a cofactor, exhibiting little to no NADP+ activity . While a NADP+ preference has been detected in prokaryotes in addition to both NADP+- and NAD+-dependencies as in Escherichia coli, Pseudomonas, and Klebsiella pneumoniae . The function of this alternative SSADH currently is unknown, but has been suggested to play a possible role in 4-hydroxyphenylacetic degradation . SSADH enzyme belongs to the aldehyde dehydrogenase family, sharing a common evolutionary origin and enzymatic mechanism with lactaldehyde dehydrogenase . Like in lactaldehyde dehydrogenase and succinate semialdehyde dehydrogenase, the mammalian catalytic glutamic acid and cysteine residues are conserved in all the enzymes of this family. ; GO: 0016620 oxidoreductase activity acting on the aldehyde or oxo group of donors NAD or NADP as acceptor. Probab=51.33 E-value=13 Score=17.74 Aligned_cols=30 Identities=20% Similarity=0.149 Sum_probs=13.9 Q ss_pred CHHHHHHHHHHHHHCCCEEECHHHCCHHHCC Q ss_conf 0358999999999769748211222524203 Q gi|254780768|r 106 NAAILKASIDLLESYGVSVVGAHEIVPELLV 136 (281) Q Consensus 106 Dd~iL~~i~~~fe~~G~~vi~~~~~l~~ll~ 136 (281) |..|=.+|-..+-. .|+=.+|+=+|++.+. T Consensus 242 DAd~D~AVE~a~As-KFRn~GQTCVCaNRly 271 (454) T TIGR01780 242 DADIDQAVEGAMAS-KFRNAGQTCVCANRLY 271 (454) T ss_pred CCCHHHHHHHHHHH-HHCCCCCEEECCCEEE T ss_conf 98876899999987-4527996377266346 No 66 >PRK08643 acetoin reductase; Validated Probab=50.99 E-value=19 Score=16.66 Aligned_cols=75 Identities=12% Similarity=0.094 Sum_probs=44.7 Q ss_pred CCEEEEEECC-CCHHHHHHHHHHHCCCCEEEEEECCCCCCC----CCC--CCEE--EEEH---HHHHHHHHHHHH--CCC Q ss_conf 9739999779-803899999999789978999954866731----038--8567--8738---999999999986--495 Q gi|254780768|r 1 MKRLLIIAGS-GMLPYYVAKAARLKNDEPVIASVLNECSFD----WQD--FECR--ELPL---GDFCVLRSILHQ--YNI 66 (281) Q Consensus 1 M~kigIIAG~-G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~----~~~--~~~~--~~~i---g~ig~li~~Lk~--~~i 66 (281) |+|+++|-|. +-+=..+++.+-+.|.+++++.+..+.-.. ++. .++. ..++ .++..+++...+ ..+ T Consensus 1 mnKvalVTGg~~GIG~aia~~la~~Ga~V~i~d~~~~~~~~~~~~~~~~~~~~~~~~~Dvt~~~~v~~~~~~~~~~~G~i 80 (256) T PRK08643 1 MSKVALVTGAGQGIGFAIAKRLVEDGFKVAIVDYNEETAKAAADKLSSDGGKAIAVKADVSNRDQVFDAVQQVVDTFGDL 80 (256) T ss_pred CCCEEEEECCCCHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHHHHHHHHHCCC T ss_conf 98499995757889999999999879999999698899999999998539909999805899999999999999982998 Q ss_pred CEEE-EECCC Q ss_conf 4067-70242 Q gi|254780768|r 67 GRIV-VAGAI 75 (281) Q Consensus 67 ~~iv-maG~V 75 (281) +-+| -||-. T Consensus 81 DiLVNnAG~~ 90 (256) T PRK08643 81 NVVVNNAGLA 90 (256) T ss_pred CEEEECCCCC T ss_conf 7999899889 No 67 >PRK08727 hypothetical protein; Validated Probab=50.84 E-value=19 Score=16.65 Aligned_cols=110 Identities=10% Similarity=0.004 Sum_probs=64.2 Q ss_pred CCHHHHHHHHHHHCCCCEEEEEECCCCCCC------CCCCCEEEEE----------H-HHHHHHHHHHHHCCCCEEEEEC Q ss_conf 803899999999789978999954866731------0388567873----------8-9999999999864954067702 Q gi|254780768|r 11 GMLPYYVAKAARLKNDEPVIASVLNECSFD------WQDFECRELP----------L-GDFCVLRSILHQYNIGRIVVAG 73 (281) Q Consensus 11 G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~------~~~~~~~~~~----------i-g~ig~li~~Lk~~~i~~ivmaG 73 (281) -+|-..+...+...|..+..+.+....... ++......+. . ..+-.+++.++.+++. ++|++ T Consensus 55 THLl~a~~~~~~~~~~~~~yl~l~~~~~~~~~~l~~le~~~ll~iDDid~i~g~~~~e~aLFhL~N~~~~~~~~-ll~ts 133 (233) T PRK08727 55 THLALSLCAAAEQAGRSSAYLPLQAAAGRLRDALEALEGRSLVALDGVDSIAGQREDEVALFDFHNRARAAGIT-LLYTA 133 (233) T ss_pred HHHHHHHHHHHHHCCCCEEEEEHHHHHHHHHHHHHHHCCCCEEEEECCHHCCCCHHHHHHHHHHHHHHHHCCCE-EEEEC T ss_conf 99999999999827997288447885320256775310389789855011269827999999999999861983-89977 Q ss_pred CCCCCCCHHH-HCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCEE Q ss_conf 4232100011-10260357899999987510480358999999999769748 Q gi|254780768|r 74 AIDRRPNVQD-LCFSIKDSLRISKMIWQLVSGGNAAILKASIDLLESYGVSV 124 (281) Q Consensus 74 ~V~krP~~~~-l~~D~~~~~~l~k~l~~~~~~gDd~iL~~i~~~fe~~G~~v 124 (281) .. .|.-.+ ..+|+++=+... ..+.+..-.|+.....+...++..||++ T Consensus 134 ~~--~P~~l~~~l~DL~SRL~~~-~~~~l~~~dD~~~~~iL~~~a~~rgl~l 182 (233) T PRK08727 134 RQ--MPDGLALVLPDLRSRLSQC-IRIGLPVLDDVARAAVLRDRAQRRGLAL 182 (233) T ss_pred CC--CHHHHCCCHHHHHHHHHCC-CEEEECCCCHHHHHHHHHHHHHHCCCCC T ss_conf 98--9566231002199999669-2288578897999999999999869999 No 68 >pfam01370 Epimerase NAD dependent epimerase/dehydratase family. This family of proteins utilize NAD as a cofactor. The proteins in this family use nucleotide-sugar substrates for a variety of chemical reactions. Probab=49.96 E-value=20 Score=16.56 Aligned_cols=72 Identities=15% Similarity=0.136 Sum_probs=52.2 Q ss_pred EEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCC---CCCCEEEEEHHHHHHHHHHHHHCCCCEEEEECCCC Q ss_conf 9997798038999999997899789999548667310---38856787389999999999864954067702423 Q gi|254780768|r 5 LIIAGSGMLPYYVAKAARLKNDEPVIASVLNECSFDW---QDFECRELPLGDFCVLRSILHQYNIGRIVVAGAID 76 (281) Q Consensus 5 gIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~---~~~~~~~~~ig~ig~li~~Lk~~~i~~ivmaG~V~ 76 (281) -|..|+|-+=..+++.+.++|++++.+.......... ........++.+...+.+.++..+++.|+-+.... T Consensus 2 LItGasGfiG~~l~~~L~~~g~~v~~~~r~~~~~~~~~~~~~~~~~~~dl~~~~~~~~~~~~~~~D~VihlAa~~ 76 (235) T pfam01370 2 LVTGGTGFIGSALVRRLLQEGYEVIVLGRRRRSESLNTGRIRFRFHEGDLTDPDALERLLAEVQPDAVIHLAAQS 76 (235) T ss_pred EEECCCCHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHCCCCCEEEEEECCCHHHHHHHHHHCCCCEEEECCCCC T ss_conf 997289799999999999787989999899730122211467659996588999999998538998999897747 No 69 >cd01012 YcaC_related YcaC related amidohydrolases; E.coli YcaC is an homooctameric hydrolase with unknown specificity. Despite its weak sequence similarity, it is structurally related to other amidohydrolases and shares conserved active site residues with them. Multimerisation interface seems not to be conserved in all members. Probab=49.86 E-value=20 Score=16.55 Aligned_cols=114 Identities=14% Similarity=0.094 Sum_probs=57.4 Q ss_pred HHHHHHHHCCCCEEEEEECCCC----CCCC----CCCCE---EEEEHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHH- Q ss_conf 9999999789978999954866----7310----38856---7873899999999998649540677024232100011- Q gi|254780768|r 16 YVAKAARLKNDEPVIASVLNEC----SFDW----QDFEC---RELPLGDFCVLRSILHQYNIGRIVVAGAIDRRPNVQD- 83 (281) Q Consensus 16 ~ia~~~~~~g~~~~ii~l~~~~----~~~~----~~~~~---~~~~ig~ig~li~~Lk~~~i~~ivmaG~V~krP~~~~- 83 (281) .+++.++..|.+++...-.... .+++ ++... ..++-..-..+-..|+++++++++++|-.+. -=... T Consensus 27 ~Li~~ar~~~~pVi~t~~~~~~~g~~~~el~~~~~~~~vi~K~~~saf~~t~L~~~L~~~gi~~lvi~G~~T~-~CV~~T 105 (157) T cd01012 27 KLAKAAKLLDVPVILTEQYPKGLGPTVPELREVFPDAPVIEKTSFSCWEDEAFRKALKATGRKQVVLAGLETH-VCVLQT 105 (157) T ss_pred HHHHHHHHCCCCEEEEEECCCCCCCCCHHHHHCCCCCEEEECCCCCCCCCCHHHHHHHHCCCCEEEEEEEECC-HHHHHH T ss_conf 9999999849998999878888888576567249998457778147837813999998629998999998542-579997 Q ss_pred ----HCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCEEECHHHCCHHHC Q ss_conf ----1026035789999998751048035899999999976974821122252420 Q gi|254780768|r 84 ----LCFSIKDSLRISKMIWQLVSGGNAAILKASIDLLESYGVSVVGAHEIVPELL 135 (281) Q Consensus 84 ----l~~D~~~~~~l~k~l~~~~~~gDd~iL~~i~~~fe~~G~~vi~~~~~l~~ll 135 (281) ...+.+. .++.-.....+...=.+-.+.+...|..+.+...++-+|+ T Consensus 106 a~da~~~Gy~v-----~vv~Da~~s~~~~~h~~AL~~m~~~g~~v~t~e~i~~el~ 156 (157) T cd01012 106 ALDLLEEGYEV-----FVVADACGSRSKEDHELALARMRQAGAVLTTSESVLFELQ 156 (157) T ss_pred HHHHHHCCCEE-----EEECCCCCCCCHHHHHHHHHHHHHCCCEEECHHHHHHHHC T ss_conf 99999869969-----9946378899999999999999987999967999999973 No 70 >TIGR00506 ribB 3,4-dihydroxy-2-butanone 4-phosphate synthase; InterPro: IPR000422 3,4-Dihydroxy-2-butanone 4-phosphate is biosynthesized from ribulose 5-phosphate and serves as the biosynthetic precursor for the xylene ring of riboflavin . It is sometimes found as a bifunctional enzyme with GTP cyclohydrolase II that catalyses the first committed step in the biosynthesis of riboflavin (IPR000926 from INTERPRO). No sequences with significant homology to DHBP synthase are found in the metazoa.; GO: 0008686 34-dihydroxy-2-butanone-4-phosphate synthase activity, 0009231 riboflavin biosynthetic process. Probab=49.55 E-value=18 Score=16.83 Aligned_cols=197 Identities=15% Similarity=0.131 Sum_probs=94.1 Q ss_pred HHHHHHHHHCCCCEEEEEECC-CCCCCCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHCCCHHHHHH Q ss_conf 999999997899789999548-6673103885678738999999999986495406770242321000111026035789 Q gi|254780768|r 15 YYVAKAARLKNDEPVIASVLN-ECSFDWQDFECRELPLGDFCVLRSILHQYNIGRIVVAGAIDRRPNVQDLCFSIKDSLR 93 (281) Q Consensus 15 ~~ia~~~~~~g~~~~ii~l~~-~~~~~~~~~~~~~~~ig~ig~li~~Lk~~~i~~ivmaG~V~krP~~~~l~~D~~~~~~ 93 (281) +.=|=++.++|.-+.|+--.+ |++-++= ++-+.+.- .-++++..++=.-||.+= + |...+ ++|+ T Consensus 5 i~~AL~Alk~G~~vlV~DdedREnEgDli-~aAe~~tp----E~i~~m~~~a~GLiC~~~--t--~~~~~-kLdL----- 69 (230) T TIGR00506 5 IEEALEALKKGEIVLVLDDEDRENEGDLI-VAAEFITP----EQIAFMRRHAGGLICVAL--T--EDIAD-KLDL----- 69 (230) T ss_pred HHHHHHHHHCCCEEEEEECCCCCCCHHHH-HCCCCCCH----HHHHHHHHHCCCEEEECC--C--HHHHH-HCCC----- T ss_conf 89999998569559997368998755541-01221588----897876664586573337--7--78897-4689----- Q ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHCCCEEECHHH---------CCHHHCCCCC--CCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 999998751048035899999999976974821122---------2524203543--34554653234788998999999 Q gi|254780768|r 94 ISKMIWQLVSGGNAAILKASIDLLESYGVSVVGAHE---------IVPELLVQVG--SLGTCVPNRDVKRDILAAMKSAE 162 (281) Q Consensus 94 l~k~l~~~~~~gDd~iL~~i~~~fe~~G~~vi~~~~---------~l~~ll~~~G--~l~~~~p~~~~~~dI~~g~~i~~ 162 (281) |.+.-. +.++.+.|.++.--. +.=..-+.++ .-|+.. ++++. ......++. T Consensus 70 -P~Mv~~--------------~~~~~~k~pvlrkl~p~Di~Yd~~~~fti~~~~r~et~TGIs--a~DR~-~Ti~~~l~~ 131 (230) T TIGR00506 70 -PLMVDV--------------NTSASEKLPVLRKLTPDDIPYDEKSTFTIDVAHRKETTTGIS--ANDRA-LTIRAALAD 131 (230) T ss_pred -CCCCHH--------------HHHHHCCCCCCCCCCCCCCCCCCEEEEEEEEECCCCCEECCC--HHHHH-HHHHHHHCC T ss_conf -735001--------------002211355010006786787742378999747788500554--64788-999887467 Q ss_pred HHCCCCCEEEEEEECCEEEEEECCCCHHHHHHHHHHHHHHCCCC---CCCCEEEEEECCCCCCCEEEECCCCHHHHHHHH Q ss_conf 73326722599981974889625342179999999754312345---667718999348887531210662799999999 Q gi|254780768|r 163 ALSELDVGQSAVSIGGRVVALEGIEGTDSMLQRIVDCRNNGRIL---AGKSGVLVKMCKSQQDMRADLPSIGAKTVQNVI 239 (281) Q Consensus 163 ~l~~~DiGQsvVv~~g~ViaiEa~eGTD~mi~R~~~~~~~~~~~---~~~~~ilvK~~K~~QD~r~DlP~IG~~Ti~~~~ 239 (281) .-.. | .++.-.-.|+ +|.++|... +..||||- +.+|-- =||-+|. T Consensus 132 ~~~~------v------mmgr~~dFG~--------df~rPGHvfpLRAa~GgVL~---R~GHTE---------asVdL~~ 179 (230) T TIGR00506 132 VVKP------V------MMGRKEDFGS--------DFRRPGHVFPLRAADGGVLK---RAGHTE---------ASVDLAE 179 (230) T ss_pred CCCE------E------ECCCCCCCHH--------HCCCCCCCCHHHHCCCCEEC---CCCCCH---------HHHHHHH T ss_conf 6642------2------2276245511--------43688865415432787512---577334---------8889999 Q ss_pred HCCCEEEEE-----EC--CCEEEEC-HHHHHHHHHHCCCEEEEEC Q ss_conf 849909999-----73--9779985-8999999998796899957 Q gi|254780768|r 240 KAGLAGIAL-----EA--GKSLVLE-KELVKKHADEAGIFVCGID 276 (281) Q Consensus 240 ~ag~~giai-----ea--~~~lild-~~~~i~~a~~~~i~i~g~~ 276 (281) -||++.++| .+ .+++.-- ..-+.+||+||||-+++++ T Consensus 180 lAGl~pa~VICE~~nd~edG~mar~~ekf~~~yA~Kh~l~~i~i~ 224 (230) T TIGR00506 180 LAGLKPAGVICEIMNDEEDGTMARKPEKFLVEYAKKHGLKLISIE 224 (230) T ss_pred HCCCCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHCCCCEEHH T ss_conf 708997289986237720331036664889999988198501178 No 71 >PRK09496 trkA potassium transporter peripheral membrane component; Reviewed Probab=49.05 E-value=21 Score=16.46 Aligned_cols=70 Identities=17% Similarity=0.173 Sum_probs=44.9 Q ss_pred EEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCC-CCCEEEEEHHHHHHHHHHHHHCCCCEEEEECCCC Q ss_conf 99977980389999999978997899995486673103-8856787389999999999864954067702423 Q gi|254780768|r 5 LIIAGSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQ-DFECRELPLGDFCVLRSILHQYNIGRIVVAGAID 76 (281) Q Consensus 5 gIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~-~~~~~~~~ig~ig~li~~Lk~~~i~~ivmaG~V~ 76 (281) -||+|.|..=..+|+.+...|+++++|.-..+.-..+. .+.+..+. |+ +.-.+.|++.++++.=+.=.++ T Consensus 3 IiI~GaG~vG~~La~~Ls~e~~dV~vID~d~~~~~~~~~~lDv~~i~-Gd-~~~~~~L~~Agi~~ad~~IAvT 73 (455) T PRK09496 3 IIILGAGQVGGTLAERLVGENNDVTVIDTDEERLRRLQDRLDVRTVV-GN-GSHPDVLREAGAEDADMLIAVT 73 (455) T ss_pred EEEECCCHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHCCEEEEE-EC-CCCHHHHHHCCCCCCCEEEEEC T ss_conf 99999888999999999868997999989999999988625868999-66-8999999965998699999957 No 72 >pfam01936 DUF88 Protein of unknown function DUF88. This highly conserved bacterial protein has no known function. The alignment contains many conserved aspartates, suggesting an enzymatic function such as an endonuclease or glycosyl hydrolase (Bateman A pers. obs). Probab=48.85 E-value=21 Score=16.44 Aligned_cols=14 Identities=21% Similarity=0.131 Sum_probs=6.6 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 78899899999973 Q gi|254780768|r 151 KRDILAAMKSAEAL 164 (281) Q Consensus 151 ~~dI~~g~~i~~~l 164 (281) -.|+.....+..-. T Consensus 72 ~~D~~l~vD~~~~~ 85 (140) T pfam01936 72 AVDVGLAVDALELA 85 (140) T ss_pred CHHHHHHHHHHHHH T ss_conf 03499999999997 No 73 >pfam00365 PFK Phosphofructokinase. Probab=48.52 E-value=21 Score=16.41 Aligned_cols=212 Identities=13% Similarity=0.135 Sum_probs=105.5 Q ss_pred CEEEEEECCCCHHH------HHHHHHHHCCCCEEEEEECCCCCCCCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEEECCC Q ss_conf 73999977980389------999999978997899995486673103885678738999999999986495406770242 Q gi|254780768|r 2 KRLLIIAGSGMLPY------YVAKAARLKNDEPVIASVLNECSFDWQDFECRELPLGDFCVLRSILHQYNIGRIVVAGAI 75 (281) Q Consensus 2 ~kigIIAG~G~LP~------~ia~~~~~~g~~~~ii~l~~~~~~~~~~~~~~~~~ig~ig~li~~Lk~~~i~~ivmaG~V 75 (281) +||||+-+.|.=|- .+.+.+...|++++- +.+.-.--+. -+...++...+.. +++.-+. +.|.- T Consensus 1 KrI~IltsGG~~pG~Na~i~~vv~~a~~~g~~v~G--~~~G~~GL~~-~~~~~l~~~~v~~---~~~~gGt----~Lgts 70 (279) T pfam00365 1 KKIGVLTSGGDAPGMNAAIRAVVRSAIAEGLEVYG--IYEGYAGLVA-GNIKQLDWESVSD---IIQRGGT----FIGSA 70 (279) T ss_pred CEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEE--EECCHHHHHC-CCEEECCHHHHHH---HHHCCCC----CCCCC T ss_conf 95999907888667899999999999877999999--9056778826-9824098888988---9965984----12466 Q ss_pred CCCCCHHHHCCCHHHHHHHHHHHHH------HHCCCCHHHHHHHHHHHHHCCCEEECHHHCCHHHCCCCCCCCCCCCCHH Q ss_conf 3210001110260357899999987------5104803589999999997697482112225242035433455465323 Q gi|254780768|r 76 DRRPNVQDLCFSIKDSLRISKMIWQ------LVSGGNAAILKASIDLLESYGVSVVGAHEIVPELLVQVGSLGTCVPNRD 149 (281) Q Consensus 76 ~krP~~~~l~~D~~~~~~l~k~l~~------~~~~gDd~iL~~i~~~fe~~G~~vi~~~~~l~~ll~~~G~l~~~~p~~~ 149 (281) |.|.+.. .....++...|.+ ....||++. ......-++.|+.+++.-.=..+-++.. .++ |. T Consensus 71 -R~~~~~~----~~~~~~~~~~l~~~~Id~li~IGGd~S~-~~a~~L~~~~~i~vigIPkTIDNDl~~t-d~s---~G-- 138 (279) T pfam00365 71 -RCPEFRE----REGRLKAAENLKKHGIDALVVIGGDGSY-TGADLLTSEHGFNCVGLPGTIDNDICGT-DYT---IG-- 138 (279) T ss_pred -CCCCCCC----HHHHHHHHHHHHHHCCCEEEEECCCHHH-HHHHHHHHHCCCCEEEEECCCCCCCCCC-CCC---CC-- T ss_conf -7775545----6679999999998489979996695689-9999999972997898203456898777-678---78-- Q ss_pred HHHHHHHHHHHHHHHCCCCCEEEEEEECCEEEEEECCCCHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCEEEECC Q ss_conf 47889989999997332672259998197488962534217999999975431234566771899934888753121066 Q gi|254780768|r 150 VKRDILAAMKSAEALSELDVGQSAVSIGGRVVALEGIEGTDSMLQRIVDCRNNGRILAGKSGVLVKMCKSQQDMRADLPS 229 (281) Q Consensus 150 ~~~dI~~g~~i~~~l~~~DiGQsvVv~~g~ViaiEa~eGTD~mi~R~~~~~~~~~~~~~~~~ilvK~~K~~QD~r~DlP~ 229 (281) ...-++.+.+....+- |-. -...+|.=||.+-..---|.-.+.+.. ++-++=.|-. .+|+-. T Consensus 139 f~TA~~~~~~~i~~l~--~ta----~s~~rv~ivEvMGR~~G~LAl~~~la~--------gad~iliPE~----~~~~~~ 200 (279) T pfam00365 139 FDTALNTIVEAIDRIR--DTA----SSHQRTFVVEVMGRHCGDLALVAGIAG--------GADYIFIPEA----PFWEEE 200 (279) T ss_pred HHHHHHHHHHHHHHHH--HHH----HCCCCEEEEEECCCCHHHHHHHHHHHC--------CCCEEEECCC----CCCHHH T ss_conf 8999999999999999--986----426864999979978689999998852--------8999996798----875899 Q ss_pred CCHHHHHHHHHCC-CEEEEEECCCEE Q ss_conf 2799999999849-909999739779 Q gi|254780768|r 230 IGAKTVQNVIKAG-LAGIALEAGKSL 254 (281) Q Consensus 230 IG~~Ti~~~~~ag-~~giaiea~~~l 254 (281) + .+.++...+.+ -.++++-+.++. T Consensus 201 ~-~~~i~~~~~~~k~~~ivvvsEG~~ 225 (279) T pfam00365 201 L-CNRLKRGRERGKRHNIVIVAEGAI 225 (279) T ss_pred H-HHHHHHHHHCCCCEEEEEEECCCC T ss_conf 9-999999998089805999978987 No 74 >COG1395 Predicted transcriptional regulator [Transcription] Probab=48.12 E-value=21 Score=16.37 Aligned_cols=21 Identities=14% Similarity=0.181 Sum_probs=7.3 Q ss_pred HHHHCCCEEECHHHCCHHHCC Q ss_conf 999769748211222524203 Q gi|254780768|r 116 LLESYGVSVVGAHEIVPELLV 136 (281) Q Consensus 116 ~fe~~G~~vi~~~~~l~~ll~ 136 (281) .+.+-||.+.+...-..+..+ T Consensus 212 ~L~~iG~~~~~~~~apfDava 232 (313) T COG1395 212 KLARIGFKVLPTKQAPFDAVA 232 (313) T ss_pred HHHHHCCCCCCHHHCCHHHHH T ss_conf 999846575013307377764 No 75 >cd02071 MM_CoA_mut_B12_BD methylmalonyl CoA mutase B12 binding domain. This domain binds to B12 (adenosylcobamide), which initiates the conversion of succinyl CoA and methylmalonyl CoA by forming an adenosyl radical, which then undergoes a rearrangement exchanging a hydrogen atom with a group attached to a neighboring carbon atom. This family is present in both mammals and bacteria. Bacterial members are heterodimers and involved in the fermentation of pyruvate to propionate. Mammalian members are homodimers and responsible for the conversion of odd-chain fatty acids and branched-chain amino acids via propionyl CoA to succinyl CoA for further degradation. Probab=47.88 E-value=22 Score=16.34 Aligned_cols=63 Identities=21% Similarity=0.187 Sum_probs=42.0 Q ss_pred HHHHHHHHHHCCCCEEEEEECCCCCCCC-----CCCCEEEEE------HHHHHHHHHHHHHCCCC-EEEEECCCC Q ss_conf 8999999997899789999548667310-----388567873------89999999999864954-067702423 Q gi|254780768|r 14 PYYVAKAARLKNDEPVIASVLNECSFDW-----QDFECRELP------LGDFCVLRSILHQYNIG-RIVVAGAID 76 (281) Q Consensus 14 P~~ia~~~~~~g~~~~ii~l~~~~~~~~-----~~~~~~~~~------ig~ig~li~~Lk~~~i~-~ivmaG~V~ 76 (281) ...++..++.+|++++-.++....+... .+....-++ +..+..+++.|++++.+ -.+++|.+- T Consensus 16 ~~iva~~l~d~GfeVi~lG~~~s~eeiv~~A~~e~ad~IglSsL~g~h~~~~~~l~~~L~e~G~~di~v~vGG~I 90 (122) T cd02071 16 AKVIARALRDAGFEVIYTGLRQTPEEIVEAAIQEDVDVIGLSSLSGGHMTLFPEVIELLRELGAGDILVVGGGII 90 (122) T ss_pred HHHHHHHHHHCCCEEEECCCCCCHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHHCCCCCCEEEEECCC T ss_conf 999999999789769967998899999999997399899996465544789999999999769998469994564 No 76 >PRK08084 DNA replication initiation factor; Provisional Probab=47.85 E-value=22 Score=16.34 Aligned_cols=112 Identities=12% Similarity=0.042 Sum_probs=65.4 Q ss_pred CCCHHHHHHHHHHHCCCCEEEEEECCCCC--CC----CCCCCEEEEE-----------HHHHHHHHHHHHHCCCCEEEEE Q ss_conf 98038999999997899789999548667--31----0388567873-----------8999999999986495406770 Q gi|254780768|r 10 SGMLPYYVAKAARLKNDEPVIASVLNECS--FD----WQDFECRELP-----------LGDFCVLRSILHQYNIGRIVVA 72 (281) Q Consensus 10 ~G~LP~~ia~~~~~~g~~~~ii~l~~~~~--~~----~~~~~~~~~~-----------ig~ig~li~~Lk~~~i~~ivma 72 (281) .-+|=..+..++..++..++.+.+..... ++ ++.++...+. =..+-.+++.+...+-++++|+ T Consensus 58 KTHLLqA~~~~~~~~~~~~~yl~~~~~~~~~~~~l~~l~~~dll~iDDi~~i~g~~~~ee~lF~l~N~~~~~g~~~ll~t 137 (235) T PRK08084 58 RSHLLHAACAELSQRGDAVGYVPLDKRAWFVPEVLEGMEQLSLVCIDNIECIAGDELWEMAIFDLYNRILESGKTRLLIT 137 (235) T ss_pred HHHHHHHHHHHHHCCCCCEEEEEHHHHHHHHHHHHHHHHHCCEEEEECHHHHCCCHHHHHHHHHHHHHHHHHCCCEEEEE T ss_conf 89999999999970798579987798665179999876418989982745546997899999999999998489669996 Q ss_pred CCCCCCCCHHHH-CCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCEE Q ss_conf 242321000111-0260357899999987510480358999999999769748 Q gi|254780768|r 73 GAIDRRPNVQDL-CFSIKDSLRISKMIWQLVSGGNAAILKASIDLLESYGVSV 124 (281) Q Consensus 73 G~V~krP~~~~l-~~D~~~~~~l~k~l~~~~~~gDd~iL~~i~~~fe~~G~~v 124 (281) +.. .|.--++ .+|+++=+. .-..+.+..-.|+....++....+..|+.+ T Consensus 138 s~~--~P~~l~~~l~DL~SRl~-~g~~~~i~~~dde~~~~iL~~~a~~rgl~l 187 (235) T PRK08084 138 GDR--PPRQLNLGLPDLASRLD-WGQIYKLQPLSDEEKLQALQLRARLRGFEL 187 (235) T ss_pred CCC--CHHHCCCCCHHHHHHHH-CCCEEEECCCCHHHHHHHHHHHHHHCCCCC T ss_conf 798--82430231288999995-697278559998999999999999739999 No 77 >PRK03659 glutathione-regulated potassium-efflux system protein KefB; Provisional Probab=46.99 E-value=22 Score=16.25 Aligned_cols=120 Identities=13% Similarity=0.143 Sum_probs=67.5 Q ss_pred EEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCC--CCEEEEEHHHHHHHHHHHHHCCCCE---EEEECCCCCCC Q ss_conf 999779803899999999789978999954866731038--8567873899999999998649540---67702423210 Q gi|254780768|r 5 LIIAGSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQD--FECRELPLGDFCVLRSILHQYNIGR---IVVAGAIDRRP 79 (281) Q Consensus 5 gIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~--~~~~~~~ig~ig~li~~Lk~~~i~~---ivmaG~V~krP 79 (281) -||+|.|.+=..+++.++.+|.+++++....+......+ +++++=+-.+. +.|+..|+.+ +|.+ ++ -| T Consensus 403 VII~G~GR~Gq~var~L~~~gi~~vviD~d~~~V~~~r~~G~~v~yGDat~~----~vL~~AGi~~A~~vVia--i~-d~ 475 (602) T PRK03659 403 VIIVGFGRFGQVIGRLLMANKMRITVLERDISAVNLMRKYGYKVYYGDATQL----ELLRAAGAEKAEAIVIT--CN-EP 475 (602) T ss_pred EEEECCCHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHCCCEEEEECCCCH----HHHHHCCCCCCCEEEEE--EC-CH T ss_conf 8997887568999999997899989997867999999978990897589999----99986790405889998--29-89 Q ss_pred CHHHHCCCHHHHHHHHHHHHHH-------HCCCCHHHHHHHHHHHHHCCCEEECHHHCCHHHCCCCCCCCCC Q ss_conf 0011102603578999999875-------1048035899999999976974821122252420354334554 Q gi|254780768|r 80 NVQDLCFSIKDSLRISKMIWQL-------VSGGNAAILKASIDLLESYGVSVVGAHEIVPELLVQVGSLGTC 144 (281) Q Consensus 80 ~~~~l~~D~~~~~~l~k~l~~~-------~~~gDd~iL~~i~~~fe~~G~~vi~~~~~l~~ll~~~G~l~~~ 144 (281) ....++.+...+. .+-.| +.=...+.+.|.+-+-..+|-..+-+....+..- T Consensus 476 ---------~~~~~iv~~~r~~~P~l~I~aRar~----~~~~~~L~~~Ga~~vv~Et~essL~l~~~~L~~l 534 (602) T PRK03659 476 ---------EDTMKLVELCQQHFPHLHILARARG----RVEAHELLQAGVTQFSRETFSSALELGRKTLVSL 534 (602) T ss_pred ---------HHHHHHHHHHHHHCCCCEEEEEECC----HHHHHHHHHCCCCEEECCHHHHHHHHHHHHHHHC T ss_conf ---------9999999999987869969998697----8999999978999786627899999999999980 No 78 >PRK06893 DNA replication initiation factor; Validated Probab=46.69 E-value=22 Score=16.22 Aligned_cols=113 Identities=12% Similarity=0.052 Sum_probs=64.9 Q ss_pred CCCCHHHHHHHHHHHCCCCEEEEEECCCC--CCC----CCCCCEEEEE----------H-HHHHHHHHHHHHCCCCEEEE Q ss_conf 79803899999999789978999954866--731----0388567873----------8-99999999998649540677 Q gi|254780768|r 9 GSGMLPYYVAKAARLKNDEPVIASVLNEC--SFD----WQDFECRELP----------L-GDFCVLRSILHQYNIGRIVV 71 (281) Q Consensus 9 G~G~LP~~ia~~~~~~g~~~~ii~l~~~~--~~~----~~~~~~~~~~----------i-g~ig~li~~Lk~~~i~~ivm 71 (281) |.-+|=..+...+..++.+.+.+.+.... ++. +++.+...+. + ..+-.+++.+++.+.+-+++ T Consensus 51 GKTHLLqa~~~~~~~~~~~~~yi~~~~~~~~~~~~l~~l~~~d~l~iDDi~~i~g~~~~e~~lF~l~N~l~~~~~~~ll~ 130 (229) T PRK06893 51 GKSHLLKAVSNHYLLNQRTAIYIPLSKSQYFSPAVLENLEQQDLVCLDDLQAVIGNEEWELAIFDLFNRIKESGKTLLLI 130 (229) T ss_pred CHHHHHHHHHHHHHHHCCCEEEEEHHHHHHCCHHHHHHHHHCCEEEEECHHHHCCCHHHHHHHHHHHHHHHHCCCCEEEE T ss_conf 88999999999999718985999737756406999987654797999672342488389999999999999759917998 Q ss_pred ECCCCCCCC-HHHHCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCEE Q ss_conf 024232100-01110260357899999987510480358999999999769748 Q gi|254780768|r 72 AGAIDRRPN-VQDLCFSIKDSLRISKMIWQLVSGGNAAILKASIDLLESYGVSV 124 (281) Q Consensus 72 aG~V~krP~-~~~l~~D~~~~~~l~k~l~~~~~~gDd~iL~~i~~~fe~~G~~v 124 (281) .+.. .|. +.-..+|+++=+.. -.......-.|+....++.+.+++.|+++ T Consensus 131 ss~~--~p~~l~~~l~DL~SRl~~-~~~~~i~~~dd~~~~~iL~~~a~~rgl~l 181 (229) T PRK06893 131 SANQ--SPHALQIKLPDLASRLTW-GEIYQLNDLTDEQKIEVLQRNAYQRGIEL 181 (229) T ss_pred ECCC--CHHHHCCHHHHHHHHHHC-CEEEEECCCCHHHHHHHHHHHHHHCCCCC T ss_conf 5798--833221002679999968-83699667775799999999999649999 No 79 >PRK08666 5'-methylthioadenosine phosphorylase; Validated Probab=46.59 E-value=11 Score=18.39 Aligned_cols=204 Identities=17% Similarity=0.121 Sum_probs=86.5 Q ss_pred CCEEEEEECCCCHHHHHHHHHHH------CCCCEEEEE-ECCCCCCCCC------CCCEEEEEHHHHHHHHHHHHHCCCC Q ss_conf 97399997798038999999997------899789999-5486673103------8856787389999999999864954 Q gi|254780768|r 1 MKRLLIIAGSGMLPYYVAKAARL------KNDEPVIAS-VLNECSFDWQ------DFECRELPLGDFCVLRSILHQYNIG 67 (281) Q Consensus 1 M~kigIIAG~G~LP~~ia~~~~~------~g~~~~ii~-l~~~~~~~~~------~~~~~~~~ig~ig~li~~Lk~~~i~ 67 (281) |.|||||.|+|-+-....+..+. -|...+++. +.+....-+. .++.+.++. -.-+..||.-||+ T Consensus 1 m~kigIIgGSGl~~~~~l~~~~~~~v~TpyG~~~~~~g~~~g~~v~fl~RHG~~h~~~p~~iny---rAni~alk~LGv~ 77 (261) T PRK08666 1 MVRIAIIGGTGVYDPPILENIREETVETPYGEVKVKIGEYAGEEVAFLARHGKGHSVPPHLINY---RANIWALKELGVE 77 (261) T ss_pred CCEEEEECCCCCCCCHHCCCCEEEEEECCCCCCCEEEEEECCEEEEEECCCCCCCCCCCCCCCC---HHHHHHHHHCCCC T ss_conf 9749998887667600147660788589988865599998998999978889998878533760---3659999975997 Q ss_pred EEEEECCCCC-CCCHHHHCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCEEECHHHCCHHHCCCCCCCCCCCC Q ss_conf 0677024232-100011102603578999999875104803589999999997697482112225242035433455465 Q gi|254780768|r 68 RIVVAGAIDR-RPNVQDLCFSIKDSLRISKMIWQLVSGGNAAILKASIDLLESYGVSVVGAHEIVPELLVQVGSLGTCVP 146 (281) Q Consensus 68 ~ivmaG~V~k-rP~~~~l~~D~~~~~~l~k~l~~~~~~gDd~iL~~i~~~fe~~G~~vi~~~~~l~~ll~~~G~l~~~~p 146 (281) .++-...+-- +|.+..- ++-++............ .+++ +-..+.-+..-..| T Consensus 78 ~ii~tnA~Gsl~~~~~pG------dlvi~~d~Id~t~~r~~-------tf~~--------------~~~~~~~h~~~~~p 130 (261) T PRK08666 78 RILATSAVGSLNPNMKPG------DFVILDQFLDFTKNRHY-------TFYD--------------GGESGVVHTDFTEP 130 (261) T ss_pred EEEEECCCCCCCCCCCCC------CEEEEHHHHCCCCCCCC-------CCCC--------------CCCCCCCCCCCCCC T ss_conf 899840200155568998------68964033113688998-------6887--------------87678244787755 Q ss_pred CHHHHHHHHHHHHHHHHHCCCCCEEEEEEECCEEEEEECC-CCHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCEE Q ss_conf 3234788998999999733267225999819748896253-421799999997543123456677189993488875312 Q gi|254780768|r 147 NRDVKRDILAAMKSAEALSELDVGQSAVSIGGRVVALEGI-EGTDSMLQRIVDCRNNGRILAGKSGVLVKMCKSQQDMRA 225 (281) Q Consensus 147 ~~~~~~dI~~g~~i~~~l~~~DiGQsvVv~~g~ViaiEa~-eGTD~mi~R~~~~~~~~~~~~~~~~ilvK~~K~~QD~r~ 225 (281) ...+.+. ...+.+++++- . +...|..+.+|+. .-|-+=++....+ |+=+ T Consensus 131 y~~~Lr~--~l~~~a~~~~~-~-----~~~~GvY~~~~GPrfET~AEi~~~r~~----------GaDv------------ 180 (261) T PRK08666 131 YCPELRK--ALITAARELGL-T-----YHPGGTYVCTEGPRFETAAEIRMYRIL----------GGDL------------ 180 (261) T ss_pred CCHHHHH--HHHHHHHHCCC-C-----CCCCEEEEEEECCCCCCHHHHHHHHHH----------CCCE------------ T ss_conf 2999999--99999998599-7-----248959999889988979999999982----------8994------------ Q ss_pred EECCCCHHHH---HHHHHCCCEEEEEEC--CCEE-----EECHHHHHHHHHHC Q ss_conf 1066279999---999984990999973--9779-----98589999999987 Q gi|254780768|r 226 DLPSIGAKTV---QNVIKAGLAGIALEA--GKSL-----VLEKELVKKHADEA 268 (281) Q Consensus 226 DlP~IG~~Ti---~~~~~ag~~giaiea--~~~l-----ild~~~~i~~a~~~ 268 (281) ||+-|+ ..+.+.|+...++.- +..- -++-+++++.++++ T Consensus 181 ----VGMStvPEv~lAre~gi~ya~is~vTN~a~g~~~~~~s~eeV~~~~~~~ 229 (261) T PRK08666 181 ----VGMTQVPEAVLARELEMCYATVAIVTNYAAGISPTKLTHEEVVELMAQN 229 (261) T ss_pred ----ECCCCCHHHHHHHHCCCCEEEEEEEEECCCCCCCCCCCHHHHHHHHHHH T ss_conf ----0477357999998769967999987200227889988999999999998 No 80 >pfam00308 Bac_DnaA Bacterial dnaA protein. Probab=45.22 E-value=24 Score=16.08 Aligned_cols=101 Identities=14% Similarity=0.202 Sum_probs=55.8 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCEEECHHHCCH Q ss_conf 99999999986495406770242321000111026035789999998751048035899999999976974821122252 Q gi|254780768|r 53 DFCVLRSILHQYNIGRIVVAGAIDRRPNVQDLCFSIKDSLRISKMIWQLVSGGNAAILKASIDLLESYGVSVVGAHEIVP 132 (281) Q Consensus 53 ~ig~li~~Lk~~~i~~ivmaG~V~krP~~~~l~~D~~~~~~l~k~l~~~~~~gDd~iL~~i~~~fe~~G~~vi~~~~~l~ 132 (281) .+-.+++.++.++. ++++++... +..+..+.+|+++=+. .-+...+..-+|+.....+...+++.|+.+ .++.+. T Consensus 118 ~lf~l~N~~~~~~~-~lllts~~~-p~~l~~~~~dL~SRL~-~g~~~~i~~pdd~~~~~iL~~~a~~r~l~l--~~~v~~ 192 (219) T pfam00308 118 EFFHTFNALHENNK-QIVLTSDRP-PKELEGFEDRLRSRFE-WGLIIAIEPPDLETRLAILRKKAEEENINI--PNEVLN 192 (219) T ss_pred HHHHHHHHHHHCCC-EEEEECCCC-CCCCCCCCHHHHHHHH-CCEEEECCCCCHHHHHHHHHHHHHHCCCCC--CHHHHH T ss_conf 99999999997298-699977998-1002453277999986-875661169999999999999999849999--999999 Q ss_pred HHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCC Q ss_conf 42035433455465323478899899999973326 Q gi|254780768|r 133 ELLVQVGSLGTCVPNRDVKRDILAAMKSAEALSEL 167 (281) Q Consensus 133 ~ll~~~G~l~~~~p~~~~~~dI~~g~~i~~~l~~~ 167 (281) +++ .+- ..+.+++.-...-+...+++ T Consensus 193 -yl~------~r~--~R~~r~L~~~L~~L~~~~~~ 218 (219) T pfam00308 193 -FIA------QRI--TDNVRELEGALNRLLAFASF 218 (219) T ss_pred -HHH------HHC--CCCHHHHHHHHHHHHHHHCC T ss_conf -999------842--79899999999999985507 No 81 >PTZ00093 nucleoside diphosphate kinase; Provisional Probab=44.86 E-value=24 Score=16.04 Aligned_cols=74 Identities=15% Similarity=0.116 Sum_probs=41.8 Q ss_pred HHHHHHHHHHHCCCEEECHHHCCHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEECCEEEEEECCCC Q ss_conf 89999999997697482112225242035433455465323478899899999973326722599981974889625342 Q gi|254780768|r 109 ILKASIDLLESYGVSVVGAHEIVPELLVQVGSLGTCVPNRDVKRDILAAMKSAEALSELDVGQSAVSIGGRVVALEGIEG 188 (281) Q Consensus 109 iL~~i~~~fe~~G~~vi~~~~~l~~ll~~~G~l~~~~p~~~~~~dI~~g~~i~~~l~~~DiGQsvVv~~g~ViaiEa~eG 188 (281) +...|++.|++.||+|+....+..+--.-+.+|...+- +.... .+..-|+ .|-|+|++ .|| T Consensus 18 l~g~Ii~~i~~~Gf~I~~~k~~~lt~e~a~~fY~~h~g-kpff~------~Lv~~mt-----------SGPvvalv-leg 78 (149) T PTZ00093 18 LVGEIIKRFEKKGYKLVALKFLQPTTEQAEEHYKEHKS-KPFFP------GLVSYIS-----------SGPVVCMV-WEG 78 (149) T ss_pred CHHHHHHHHHHCCCEEEEEEEECCCHHHHHHHHHHHCC-CCCHH------HHHHHHC-----------CCCEEEEE-EEC T ss_conf 87999999998799899987425899999999999718-83288------8999867-----------89879999-627 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 17999999975431 Q gi|254780768|r 189 TDSMLQRIVDCRNN 202 (281) Q Consensus 189 TD~mi~R~~~~~~~ 202 (281) -| .|....++..+ T Consensus 79 ~n-aI~~~R~l~Gp 91 (149) T PTZ00093 79 KN-VVKSGRVLLGA 91 (149) T ss_pred CC-HHHHHHHHHCC T ss_conf 88-89999998589 No 82 >PRK06242 flavodoxin; Provisional Probab=44.64 E-value=24 Score=16.02 Aligned_cols=51 Identities=25% Similarity=0.323 Sum_probs=30.0 Q ss_pred HHHHHHHHHCCCEEECHHHCCHHH-----CCCCCCCCCCCCCHHHHHHHHHHHHHHHHHC Q ss_conf 999999997697482112225242-----0354334554653234788998999999733 Q gi|254780768|r 111 KASIDLLESYGVSVVGAHEIVPEL-----LVQVGSLGTCVPNRDVKRDILAAMKSAEALS 165 (281) Q Consensus 111 ~~i~~~fe~~G~~vi~~~~~l~~l-----l~~~G~l~~~~p~~~~~~dI~~g~~i~~~l~ 165 (281) +.+.+.+++.|+++++ +-.++-+ +-.-|-++...|++ +|++.+.+.++.|- T Consensus 92 ~~l~~~L~~kg~~v~G-~F~CkG~dt~gp~k~~ggi~kghPn~---~Dl~~A~~Fa~~i~ 147 (150) T PRK06242 92 KALRKKLEEKGFIIIG-EFTCKGFDTFGPFKLIGGINKGHPNE---KDLENAKEFALNLK 147 (150) T ss_pred HHHHHHHHHCCCEEEE-EEEECCCCCCCCHHHCCCCCCCCCCH---HHHHHHHHHHHHHH T ss_conf 9999999968997978-89841502557443216654899498---99999999999987 No 83 >PRK07707 consensus Probab=44.56 E-value=24 Score=16.01 Aligned_cols=164 Identities=14% Similarity=0.060 Sum_probs=83.3 Q ss_pred CCEEEEEECC-CCHHHHHHHHHHHCCCCEEEEEECCCCCCC-----CCCCCEEE--E---EHHHHHHHHHHHHHCCCCEE Q ss_conf 9739999779-803899999999789978999954866731-----03885678--7---38999999999986495406 Q gi|254780768|r 1 MKRLLIIAGS-GMLPYYVAKAARLKNDEPVIASVLNECSFD-----WQDFECRE--L---PLGDFCVLRSILHQYNIGRI 69 (281) Q Consensus 1 M~kigIIAG~-G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~-----~~~~~~~~--~---~ig~ig~li~~Lk~~~i~~i 69 (281) |+|++||.|+ +-+=..+++.+-+.|.++++..-..+.... +.+..... . +..++.++++.+.+ .++-+ T Consensus 1 M~KvalVTG~s~GIG~aia~~la~~Ga~V~i~~~~~~~~~~~~~~~~~~~~~~~v~~Dl~~~~~~~~~~~~~~~-~iD~l 79 (239) T PRK07707 1 MKKYALVTGASGGIGQAISKQLAQDGYTVYLHYNNNEEKVNELQEELGEVIPVPVQADLASPDGAEKLWSQIEH-PLDAI 79 (239) T ss_pred CCCEEEEECCCCHHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHHCCCCEEEEEEECCCHHHHHHHHHHHCC-CCCEE T ss_conf 99899996688789999999999879989998399989999999984436606999868999999999998578-89999 Q ss_pred EEECCCCCCCCHHHHCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH-CCCEEECHHHCCHHHCCCCCCCCCCCCCH Q ss_conf 77024232100011102603578999999875104803589999999997-69748211222524203543345546532 Q gi|254780768|r 70 VVAGAIDRRPNVQDLCFSIKDSLRISKMIWQLVSGGNAAILKASIDLLES-YGVSVVGAHEIVPELLVQVGSLGTCVPNR 148 (281) Q Consensus 70 vmaG~V~krP~~~~l~~D~~~~~~l~k~l~~~~~~gDd~iL~~i~~~fe~-~G~~vi~~~~~l~~ll~~~G~l~~~~p~~ 148 (281) |-.-.+..+..+.++..+. ..+.+ ...-++--.+.+.+...+.+ .+=.++....+.... ..++. .-...+| T Consensus 80 VnnAG~~~~~~~~~~~~e~-----~~~~~-~~nl~~~~~~~~~~~p~m~~~~~G~II~isS~~~~~-g~~~~-~~Y~asK 151 (239) T PRK07707 80 VYNSGKSIFGLVTDVTNDE-----LNDMV-QLHVKSPYKLLSMLLPSMIQRKSGNIVAVSSIWGQI-GASCE-VLYSMVK 151 (239) T ss_pred EECCCCCCCCCCCCCCHHH-----HHHHH-HHHHHHHHHHHHHHHHHHHHCCCCEEEEECCHHHCC-CCCCC-HHHHHHH T ss_conf 9899999998701099999-----99999-999899999999998999876996799973788747-68751-6889999 Q ss_pred HHHHHHHHHHHHHHHHCCCCCEEEEEE Q ss_conf 347889989999997332672259998 Q gi|254780768|r 149 DVKRDILAAMKSAEALSELDVGQSAVS 175 (281) Q Consensus 149 ~~~~dI~~g~~i~~~l~~~DiGQsvVv 175 (281) ... ..+-..+++++++..|==-+|. T Consensus 152 aav--~~ltr~lA~ela~~gIrVN~I~ 176 (239) T PRK07707 152 GAQ--NSFVKALAKELAPSGIRVNAVA 176 (239) T ss_pred HHH--HHHHHHHHHHHHHHCEEEEEEE T ss_conf 999--9999999999766396999997 No 84 >PRK12833 acetyl-CoA carboxylase biotin carboxylase subunit; Provisional Probab=44.31 E-value=24 Score=15.98 Aligned_cols=63 Identities=11% Similarity=0.014 Sum_probs=42.6 Q ss_pred EEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCEEEEEH---------HHHHHHHHHHHHCCCCE Q ss_conf 9977980389999999978997899995486673103885678738---------99999999998649540 Q gi|254780768|r 6 IIAGSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQDFECRELPL---------GDFCVLRSILHQYNIGR 68 (281) Q Consensus 6 IIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~~~~~~~i---------g~ig~li~~Lk~~~i~~ 68 (281) |||-.|....-+++.|+..|++.+.+.-....+...-......+.+ -.+..++..-++.+|+- T Consensus 9 LIANRGEIA~Riirt~relgi~tVavys~~D~~~~hv~~ADeav~ig~~~~~~sYln~~~Ii~~A~~~g~dA 80 (458) T PRK12833 9 LVANRGEIAVRVIRAAHELGMRAVAVVSDADRDSLAARMADEAVHIGPSHAAKSYLNPAAILAAARQCGADA 80 (458) T ss_pred EEECCCHHHHHHHHHHHHCCCCEEEEECHHHCCCHHHHHCCEEEEECCCCCCCCCCCHHHHHHHHHHHCCCE T ss_conf 997786999999999998399899991857655835874999998289870013269999999999829999 No 85 >TIGR02620 cas_VVA1548 putative CRISPR-associated protein, VVA1548 family; InterPro: IPR013443 This entry represents a conserved region of about 95 amino acids found exclusively in species with CRISPRs (Clustered Regularly Interspaced Short Palindromic Repeats). In all bacterial species that contain this entry, the genes encoding the proteins are in the midst of a cluster of cas (CRISPR-associated) genes.. Probab=44.18 E-value=19 Score=16.69 Aligned_cols=28 Identities=21% Similarity=0.083 Sum_probs=23.6 Q ss_pred CCCHHHHHHHHHHHCCCCEEEEEECCCC Q ss_conf 9803899999999789978999954866 Q gi|254780768|r 10 SGMLPYYVAKAARLKNDEPVIASVLNEC 37 (281) Q Consensus 10 ~G~LP~~ia~~~~~~g~~~~ii~l~~~~ 37 (281) -|.||+.+|...-++|-++|-+.+.=.. T Consensus 39 iG~lPv~Laad~c~kga~Yf~ls~~v~~ 66 (93) T TIGR02620 39 IGTLPVSLAADICKKGARYFELSLNVPA 66 (93) T ss_pred EECCCHHHHHHHHHCCCEEEEEEEECCC T ss_conf 8412757887786469458998740223 No 86 >pfam07085 DRTGG DRTGG domain. This presumed domain is about 120 amino acids in length. It is found associated with CBS domains pfam00571, as well as the CbiA domain pfam01656. The function of this domain is unknown. It is named the DRTGG domain after some of the most conserved residues. This domain may be very distantly related to a pair of CBS domains. There are no significant sequence similarities, but its length and association with CBS domains supports this idea (Bateman A, pers. obs.). Probab=44.01 E-value=25 Score=15.95 Aligned_cols=45 Identities=31% Similarity=0.339 Sum_probs=37.3 Q ss_pred HHHHHHHHHCCCEEEEEECCCEEEECHHHHHHHHHHCCCEEEEECHHH Q ss_conf 999999998499099997397799858999999998796899957754 Q gi|254780768|r 232 AKTVQNVIKAGLAGIALEAGKSLVLEKELVKKHADEAGIFVCGIDREF 279 (281) Q Consensus 232 ~~Ti~~~~~ag~~giaiea~~~lild~~~~i~~a~~~~i~i~g~~~~~ 279 (281) .+.+..+.+++++|+.+..|... -++++++|++.++.++..++|. T Consensus 51 ~di~~~a~~~~~~~iIlTgg~~p---~~~v~~la~~~~ipii~t~~dT 95 (105) T pfam07085 51 EDIQLAALLAGIAGLILTGGFEP---SEEVLKLAEEAGLPVLSTPYDT 95 (105) T ss_pred HHHHHHHHHHCCCEEEEECCCCC---CHHHHHHHHHCCCEEEEECCCH T ss_conf 89999999824878999489898---9999999987798399966889 No 87 >PRK11706 TDP-4-oxo-6-deoxy-D-glucose transaminase; Provisional Probab=43.65 E-value=25 Score=15.92 Aligned_cols=14 Identities=29% Similarity=0.494 Sum_probs=6.8 Q ss_pred CHHHHHHHHHHCCC Q ss_conf 58999999998796 Q gi|254780768|r 257 EKELVKKHADEAGI 270 (281) Q Consensus 257 d~~~~i~~a~~~~i 270 (281) +|++++++.+++|| T Consensus 301 ~Rd~l~~~L~~~gI 314 (375) T PRK11706 301 DRSALINFLKEAGI 314 (375) T ss_pred HHHHHHHHHHHCCC T ss_conf 89999999998799 No 88 >TIGR01977 am_tr_V_EF2568 cysteine desulfurase family protein; InterPro: IPR010969 This entry describes a subfamily of probable pyridoxal phosphate-dependent enzymes in the aminotransferase class V family. Related families contain members active as cysteine desulphurases, selenocysteine lyases, or both. The members of this family form a distinct clade and all are shorter at the N terminus. The function of members of this family are unknown.. Probab=43.40 E-value=25 Score=15.89 Aligned_cols=135 Identities=20% Similarity=0.208 Sum_probs=86.0 Q ss_pred HCCCCHHHHHHHHHHHHHCCCEEECHHHCCHHHCC-CCC--CCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEE-- Q ss_conf 10480358999999999769748211222524203-543--3455465323478899899999973326722599981-- Q gi|254780768|r 102 VSGGNAAILKASIDLLESYGVSVVGAHEIVPELLV-QVG--SLGTCVPNRDVKRDILAAMKSAEALSELDVGQSAVSI-- 176 (281) Q Consensus 102 ~~~gDd~iL~~i~~~fe~~G~~vi~~~~~l~~ll~-~~G--~l~~~~p~~~~~~dI~~g~~i~~~l~~~DiGQsvVv~-- 176 (281) .+.+++..|+ ..-.|...++.+.+||= +.- +....+ . ..+.+++= |+.+.+ T Consensus 32 gRG~~~~~l~--------~~~~v~EtR~~lA~Lf~~~~~~~v~Ft~n----A----T~aLN~~L--------~Gl~lknr 87 (384) T TIGR01977 32 GRGRYRLALR--------ASREVEETRELLAKLFNAPSKAHVVFTNN----A----TTALNIAL--------KGLILKNR 87 (384) T ss_pred CCCCCHHHHH--------CCCHHHHHHHHHHHHHCCCCCCCEEEECC----H----HHHHHHHH--------HCCCCCCC T ss_conf 8887503454--------14228899999999848897462784060----7----79999998--------43300055 Q ss_pred CCEEEEEECCCCHHHHHHHHHHHHHHCCCCCCCC-EEEEEECCCCCCCEEEECCCCHHHHHHHHHCC-CEEEEEECCCEE Q ss_conf 9748896253421799999997543123456677-18999348887531210662799999999849-909999739779 Q gi|254780768|r 177 GGRVVALEGIEGTDSMLQRIVDCRNNGRILAGKS-GVLVKMCKSQQDMRADLPSIGAKTVQNVIKAG-LAGIALEAGKSL 254 (281) Q Consensus 177 ~g~ViaiEa~eGTD~mi~R~~~~~~~~~~~~~~~-~ilvK~~K~~QD~r~DlP~IG~~Ti~~~~~ag-~~giaiea~~~l 254 (281) .|.=+=+=.+|= ++ -.|- ...++..+ +|-|-..|++||-++| ||-|+.+.+.+ =+-|+|.....+ T Consensus 88 ~g~hVi~t~mEH-Ns-V~RP------L~~L~~~~~g~~~~iv~~~~~G~~~-----~e~i~~~~~~~kTk~iv~~HaSNV 154 (384) T TIGR01977 88 EGDHVITTPMEH-NS-VLRP------LEKLKEERIGVEITIVKCDNEGLIS-----PERIKRAIKKNKTKLIVVSHASNV 154 (384) T ss_pred CCCCEEECCCCH-HH-HHHH------HHHHHHCCCCEEEEEECCCCCCCCC-----HHHHHHHCCCCCCCEEEEECHHHH T ss_conf 999678647602-36-6438------9999735997589986278885216-----799998533788300432200301 Q ss_pred ---EECHHHHH-HHHHHCCCEEE Q ss_conf ---98589999-99998796899 Q gi|254780768|r 255 ---VLEKELVK-KHADEAGIFVC 273 (281) Q Consensus 255 ---ild~~~~i-~~a~~~~i~i~ 273 (281) ++.-||+- ++|.++|++++ T Consensus 155 ~G~ilpiEEIgR~lA~e~g~~~~ 177 (384) T TIGR01977 155 TGTILPIEEIGRELAKENGIFFI 177 (384) T ss_pred CCCCCCHHHHHHHHHHHCCCEEE T ss_conf 03112578999999986797399 No 89 >PRK03562 glutathione-regulated potassium-efflux system protein KefC; Provisional Probab=42.94 E-value=25 Score=15.89 Aligned_cols=146 Identities=13% Similarity=0.080 Sum_probs=77.3 Q ss_pred EEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCC--CCEEEEEHHHHHHHHHHHHHCCCCE---EEEECCCCCCC Q ss_conf 999779803899999999789978999954866731038--8567873899999999998649540---67702423210 Q gi|254780768|r 5 LIIAGSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQD--FECRELPLGDFCVLRSILHQYNIGR---IVVAGAIDRRP 79 (281) Q Consensus 5 gIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~--~~~~~~~ig~ig~li~~Lk~~~i~~---ivmaG~V~krP 79 (281) -||+|.|..=..+++.++.+|.+++++....+......+ +++++=+-.+ .+.|+..|+.+ +|.+ ++ -| T Consensus 402 VII~G~GRvGq~var~L~~~gi~~vviD~d~~~V~~~r~~G~~v~yGDat~----~~vL~~AGi~~Ar~vVia--id-d~ 474 (615) T PRK03562 402 VIIAGFGRFGQIVGRLLLSSGVKMVVLDHDPDHIETLRKFGMKVFYGDATR----MDLLESAGAAKAEVLINA--ID-DP 474 (615) T ss_pred EEEEECCCCHHHHHHHHHHCCCCEEEEECCHHHHHHHHHCCCEEEEECCCC----HHHHHHCCCCCCCEEEEE--EC-CH T ss_conf 899902804699999999789987999799999999996799089768999----999986791406889999--49-89 Q ss_pred CHHHHCCCHHHHHHHHHHHHHH-------HCCCCHHHHHHHHHHHHHCCCEEECHHHCCHHHCCCCCCCCCCCCCHHH-H Q ss_conf 0011102603578999999875-------1048035899999999976974821122252420354334554653234-7 Q gi|254780768|r 80 NVQDLCFSIKDSLRISKMIWQL-------VSGGNAAILKASIDLLESYGVSVVGAHEIVPELLVQVGSLGTCVPNRDV-K 151 (281) Q Consensus 80 ~~~~l~~D~~~~~~l~k~l~~~-------~~~gDd~iL~~i~~~fe~~G~~vi~~~~~l~~ll~~~G~l~~~~p~~~~-~ 151 (281) ....++.+...+. .+-.| +.=...+.+.|.+-+-..+|-..+.+..-.+..--.+..+ + T Consensus 475 ---------~~~~~iv~~~r~~~P~l~IiaRard----~~~~~~L~~~Ga~~vv~Et~essL~l~~~~L~~lG~~~~~a~ 541 (615) T PRK03562 475 ---------QTNLQLTELVKEHFPHLQIIARARD----VDHYIRLRQAGVEKPERETFEGALKSGRLALESLGLGPYEAR 541 (615) T ss_pred ---------HHHHHHHHHHHHHCCCCEEEEEECC----HHHHHHHHHCCCCEEECCHHHHHHHHHHHHHHHCCCCHHHHH T ss_conf ---------9999999999975899869998397----788999997899989666589999999999998099999999 Q ss_pred HH----HHHHHHHHHHHCCCCCE Q ss_conf 88----99899999973326722 Q gi|254780768|r 152 RD----ILAAMKSAEALSELDVG 170 (281) Q Consensus 152 ~d----I~~g~~i~~~l~~~DiG 170 (281) +. -++-.+.++++.++.-- T Consensus 542 ~~~~~fr~~d~~~l~~~~~~~~d 564 (615) T PRK03562 542 ERADRFRRFNIQMVEEMAPHEND 564 (615) T ss_pred HHHHHHHHHHHHHHHHHHHHCCC T ss_conf 99999999999999998643578 No 90 >COG1492 CobQ Cobyric acid synthase [Coenzyme metabolism] Probab=42.29 E-value=3.3 Score=21.79 Aligned_cols=100 Identities=23% Similarity=0.284 Sum_probs=55.9 Q ss_pred CEEEEECCCCCCCCHHHHCCC---H-HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHC-CCEEECHHHCCHHHCC-CCCC Q ss_conf 406770242321000111026---0-35789999998751048035899999999976-9748211222524203-5433 Q gi|254780768|r 67 GRIVVAGAIDRRPNVQDLCFS---I-KDSLRISKMIWQLVSGGNAAILKASIDLLESY-GVSVVGAHEIVPELLV-QVGS 140 (281) Q Consensus 67 ~~ivmaG~V~krP~~~~l~~D---~-~~~~~l~k~l~~~~~~gDd~iL~~i~~~fe~~-G~~vi~~~~~l~~ll~-~~G~ 140 (281) ..++++|.|+|-=.|..+-=. + ..+....+=..-...+||-++|.--++.+|+. |..+++...|+++.+. .+-- T Consensus 158 apvILV~DIdRGGvfAsl~GT~~lL~~~~r~~VkG~iINkFRGd~~ll~~gik~Le~~tg~~vlGv~P~~~~~~~p~EDS 237 (486) T COG1492 158 APVILVGDIDRGGVFASLYGTLALLPESDRALVKGFIINKFRGDESLLDPGLKWLEELTGVPVLGVLPYLKDALRPAEDS 237 (486) T ss_pred CCEEEEEEECCCCEEEEEEEHHEECCHHHHCCEEEEEEEEECCCHHHHHHHHHHHHHHHCCEEEEECCCCCCCCCCCCCC T ss_conf 98899995115645417302122368767600147999631798788745999999861974675652555455764202 Q ss_pred CCCCCCCHHH-HHHHHHHHHHHHHHCC Q ss_conf 4554653234-7889989999997332 Q gi|254780768|r 141 LGTCVPNRDV-KRDILAAMKSAEALSE 166 (281) Q Consensus 141 l~~~~p~~~~-~~dI~~g~~i~~~l~~ 166 (281) +.-..++... ..+|..+.--+..++. T Consensus 238 ~~~~~~~~~~~~~~i~Iav~~lp~isN 264 (486) T COG1492 238 LSLEQPKAGGNKRAIRIAVIRLPRISN 264 (486) T ss_pred CCCHHHCCCCCCCCEEEEEECCCCCCC T ss_conf 270001025897754999965787556 No 91 >PRK04301 radA DNA repair and recombination protein RadA; Validated Probab=42.14 E-value=14 Score=17.60 Aligned_cols=25 Identities=8% Similarity=0.058 Sum_probs=10.5 Q ss_pred HHHHHHHHHHHC-CCCCEEEEEEECCEE Q ss_conf 998999999733-267225999819748 Q gi|254780768|r 154 ILAAMKSAEALS-ELDVGQSAVSIGGRV 180 (281) Q Consensus 154 I~~g~~i~~~l~-~~DiGQsvVv~~g~V 180 (281) +..-...+..++ .++ =+||+-|+-. T Consensus 229 L~~~l~~L~~lA~~~n--iaVvvTNQV~ 254 (318) T PRK04301 229 LNKHLHDLLRLADLYN--AAVVVTNQVM 254 (318) T ss_pred HHHHHHHHHHHHHHCC--CEEEEEEEEE T ss_conf 9999999999999859--5799961367 No 92 >PRK05586 biotin carboxylase; Validated Probab=41.69 E-value=27 Score=15.72 Aligned_cols=69 Identities=14% Similarity=0.028 Sum_probs=47.7 Q ss_pred CCEEEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCEEEEE---------HHHHHHHHHHHHHCCCCEE Q ss_conf 97399997798038999999997899789999548667310388567873---------8999999999986495406 Q gi|254780768|r 1 MKRLLIIAGSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQDFECRELP---------LGDFCVLRSILHQYNIGRI 69 (281) Q Consensus 1 M~kigIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~~~~~~~---------ig~ig~li~~Lk~~~i~~i 69 (281) |=|==|||-.|....-+++.|+..|.+.+.+.=........-......+. .-.+..+++..++.+|+-| T Consensus 1 M~~kvLIANRGEIA~Ri~rt~~~lgi~tVavys~~D~~a~hv~~Ade~~~lg~~~~~~sYln~~~ii~~A~~~g~dAi 78 (447) T PRK05586 1 MFKKILIANRGEIAVRIIRACREMGIETVAVYSEIDKDALHVQLADEAVCIGPASSKDSYLNIYNILSATVLTGAQAI 78 (447) T ss_pred CCCEEEEECCCHHHHHHHHHHHHCCCCEEEECCCCCCCCHHHHHCCEEEECCCCCHHHHHCCHHHHHHHHHHHCCCEE T ss_conf 975499989879999999999983995999917033678547649899873999956511489999999998499899 No 93 >PTZ00325 malate dehydrogenase; Provisional Probab=41.62 E-value=24 Score=16.04 Aligned_cols=104 Identities=10% Similarity=0.087 Sum_probs=58.5 Q ss_pred CCEEEEEECCCCHHHHHHHHHHHCCC--CEEEEEECCCCCCCCCCCCEEEE--EHH--HHHHHHHHHHHCCCCEEEEECC Q ss_conf 97399997798038999999997899--78999954866731038856787--389--9999999998649540677024 Q gi|254780768|r 1 MKRLLIIAGSGMLPYYVAKAARLKND--EPVIASVLNECSFDWQDFECREL--PLG--DFCVLRSILHQYNIGRIVVAGA 74 (281) Q Consensus 1 M~kigIIAG~G~LP~~ia~~~~~~g~--~~~ii~l~~~~~~~~~~~~~~~~--~ig--~ig~li~~Lk~~~i~~ivmaG~ 74 (281) |.|++||..+|..=-.+|-.+..+|. +.+++.+.. +...-.+..|... ... .-+..-+.++ +++=+|+++. T Consensus 1 m~KV~IIGA~G~VG~s~A~~l~~~~~~~elvL~Di~~-a~g~a~Dl~~~~~~~~~~~~~~~~~~e~~~--~aDIVVitaG 77 (313) T PTZ00325 1 MFKVAVLGAAGGIGQPLSLLLKRNPYVSTLSLYDIVG-APGVAADLSHIPSPAKVTGYAKGELHKAVD--GADVVLIVAG 77 (313) T ss_pred CCEEEEECCCCHHHHHHHHHHHHCCCCCEEEEEECCC-CHHHHHHHHCCCCCCCCCEECCCCHHHHHC--CCCEEEECCC T ss_conf 9389998999869999999998389977799980897-266898886755535665452798889848--9989998889 Q ss_pred CCCCCCHHHHCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCE Q ss_conf 2321000111026035789999998751048035899999999976974 Q gi|254780768|r 75 IDRRPNVQDLCFSIKDSLRISKMIWQLVSGGNAAILKASIDLLESYGVS 123 (281) Q Consensus 75 V~krP~~~~l~~D~~~~~~l~k~l~~~~~~gDd~iL~~i~~~fe~~G~~ 123 (281) +.++|.-+ +.| ++. ....+++.++.-+.+.+++ T Consensus 78 ~~rkpg~t--R~d---------Ll~-----~N~~I~~~i~~~i~~~~~~ 110 (313) T PTZ00325 78 VPRKPGMT--RDD---------LFN-----TNAGIVRDLVLACASSAPK 110 (313) T ss_pred CCCCCCCC--HHH---------HHH-----HHHHHHHHHHHHHHHHCCC T ss_conf 88997896--899---------999-----7069999999999976998 No 94 >PRK11064 wecC UDP-N-acetyl-D-mannosamine dehydrogenase; Provisional Probab=40.83 E-value=27 Score=15.63 Aligned_cols=100 Identities=11% Similarity=0.060 Sum_probs=60.6 Q ss_pred CCEEEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCC--CEEEEEHHHHHHHHHHHHHCC-C--------CEE Q ss_conf 97399997798038999999997899789999548667310388--567873899999999998649-5--------406 Q gi|254780768|r 1 MKRLLIIAGSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQDF--ECRELPLGDFCVLRSILHQYN-I--------GRI 69 (281) Q Consensus 1 M~kigIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~--~~~~~~ig~ig~li~~Lk~~~-i--------~~i 69 (281) |+||++| |.|-.=.-.|-.+-.+|++++.+....+....+++- ++..-. +..+++.-...+ . .++ T Consensus 3 ~kkI~Vi-GlGYVGL~~a~~lA~~G~~Vig~D~d~~kv~~l~~g~~p~~Ep~---l~~ll~~~~~~~~l~~t~~~~~ad~ 78 (415) T PRK11064 3 FETISVI-GLGYIGLPTAAAFASRQKQVIGVDINQHAVDTINRGEIHIVEPD---LDMVVKAAVEGGYLRATTTPVPADA 78 (415) T ss_pred CCEEEEE-CCCHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHCCCCCCCCCC---HHHHHHHHHHCCCCEEECCHHHCCE T ss_conf 8879998-86877899999999688948999899999999978689988989---8999999986598365257467899 Q ss_pred EEECCCCCCCCHHHHCCCHHHHHHHHHHHHHHHCCCC Q ss_conf 7702423210001110260357899999987510480 Q gi|254780768|r 70 VVAGAIDRRPNVQDLCFSIKDSLRISKMIWQLVSGGN 106 (281) Q Consensus 70 vmaG~V~krP~~~~l~~D~~~~~~l~k~l~~~~~~gD 106 (281) +++= |. .|.-.+-.+|+.....+.+.+....++|+ T Consensus 79 iiI~-V~-TP~~~~~~~Dls~i~~a~~~I~~~l~~~~ 113 (415) T PRK11064 79 FLIA-VP-TPFKGDHEPDLSYVEAAAKSIAPVLKKGD 113 (415) T ss_pred EEEE-CC-CCCCCCCCCCCCHHHHHHHHHHHHCCCCC T ss_conf 9998-89-99789997202018889999997526886 No 95 >pfam06057 VirJ Bacterial virulence protein (VirJ). This family consists of several bacterial VirJ virulence proteins. VirJ is thought to be involved in the type IV secretion system. It is thought that the substrate proteins localized to the periplasm may associate with the pilus in a manner that is mediated by VirJ, and suggest a two-step process for type IV secretion in Agrobacterium. Probab=40.80 E-value=27 Score=15.63 Aligned_cols=70 Identities=16% Similarity=0.150 Sum_probs=42.0 Q ss_pred CCEEEE-EEC---CCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCEEEEEHHHHHHHHHH-HHHCCCCEEEEECC Q ss_conf 973999-977---980389999999978997899995486673103885678738999999999-98649540677024 Q gi|254780768|r 1 MKRLLI-IAG---SGMLPYYVAKAARLKNDEPVIASVLNECSFDWQDFECRELPLGDFCVLRSI-LHQYNIGRIVVAGA 74 (281) Q Consensus 1 M~kigI-IAG---~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~~~~~~~ig~ig~li~~-Lk~~~i~~ivmaG~ 74 (281) |+++++ ++| +..|-..+++++.++|++++ ++....- -++...-... =..+.+++.. .++.+.++++++|+ T Consensus 1 ~dtlav~~SGDGGW~~lD~~va~~L~~~GvpVv--GvdSLrY-FW~~ktP~q~-a~Dl~~ii~~Y~~~w~~~~v~LiGY 75 (192) T pfam06057 1 SDTVAVFYSGDGGWRDLDKEVGSALQKQGVPVV--GVDSLRY-FWSERTPEEV-ADDLDRIIDTYRKRWKVKNVVLIGY 75 (192) T ss_pred CCEEEEEEECCCCHHHHHHHHHHHHHHCCCCEE--EECHHHH-HHCCCCHHHH-HHHHHHHHHHHHHHHCCCEEEEEEE T ss_conf 977999994688735551999999997798365--5326888-7565998999-9999999999999858965999961 No 96 >COG0151 PurD Phosphoribosylamine-glycine ligase [Nucleotide transport and metabolism] Probab=40.21 E-value=28 Score=15.57 Aligned_cols=151 Identities=17% Similarity=0.162 Sum_probs=96.6 Q ss_pred EEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCEEEEEH-HHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHH Q ss_conf 9977980389999999978997899995486673103885678738-999999999986495406770242321000111 Q gi|254780768|r 6 IIAGSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQDFECRELPL-GDFCVLRSILHQYNIGRIVVAGAIDRRPNVQDL 84 (281) Q Consensus 6 IIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~~~~~~~i-g~ig~li~~Lk~~~i~~ivmaG~V~krP~~~~l 84 (281) |+-|+|-=-..+|.++...+..-.+..-.++.-... ...+...++ .+...+.++-++++++ ++++|.= -|. T Consensus 4 LviGsGGREHAiA~kla~s~~v~~~~~apgN~G~a~-~~~~~~~~~~~~~~~lv~fA~~~~id-l~vVGPE--~pL---- 75 (428) T COG0151 4 LVIGSGGREHALAWKLAQSPLVLYVYVAPGNPGTAL-EAYLVNIEIDTDHEALVAFAKEKNVD-LVVVGPE--APL---- 75 (428) T ss_pred EEECCCCHHHHHHHHHHCCCCEEEEEEECCCCCCCH-HHHHCCCCCCCCHHHHHHHHHHCCCC-EEEECCC--HHH---- T ss_conf 998688169999999840876249999379974211-34433676555889999999973999-8998971--787---- Q ss_pred CCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCEEECHH----------HCCHHHCCCCCCCCCCCCCHHHHHHH Q ss_conf 02603578999999875104803589999999997697482112----------22524203543345546532347889 Q gi|254780768|r 85 CFSIKDSLRISKMIWQLVSGGNAAILKASIDLLESYGVSVVGAH----------EIVPELLVQVGSLGTCVPNRDVKRDI 154 (281) Q Consensus 85 ~~D~~~~~~l~k~l~~~~~~gDd~iL~~i~~~fe~~G~~vi~~~----------~~l~~ll~~~G~l~~~~p~~~~~~dI 154 (281) .-.|.+.|++.|+.+.+++ .|..++|...|+-|...- .-.|. T Consensus 76 -------------------------~~GvvD~l~~~Gi~vFGPsk~AA~lE~SK~faK~fm~k~~IPta~y~---~f~~~ 127 (428) T COG0151 76 -------------------------VAGVVDALRAAGIPVFGPTKAAAQLEGSKAFAKDFMKKYGIPTAEYE---VFTDP 127 (428) T ss_pred -------------------------HHHHHHHHHHCCCCEECCCHHHHHHHHHHHHHHHHHHHCCCCCCCCE---ECCCH T ss_conf -------------------------64468999877996238678898877539999999998199964111---00898 Q ss_pred HHHHHHHHHHCC--------CCCEEEEEEECCEEEEEECCCCHHHHHHH Q ss_conf 989999997332--------67225999819748896253421799999 Q gi|254780768|r 155 LAAMKSAEALSE--------LDVGQSAVSIGGRVVALEGIEGTDSMLQR 195 (281) Q Consensus 155 ~~g~~i~~~l~~--------~DiGQsvVv~~g~ViaiEa~eGTD~mi~R 195 (281) +-+...+++.+. |--|-.|+|... -=||.+-.++|+.. T Consensus 128 e~a~ayi~~~g~piVVKadGLaaGKGV~V~~~---~eeA~~a~~~~l~~ 173 (428) T COG0151 128 EEAKAYIDEKGAPIVVKADGLAAGKGVIVAMT---LEEAEAAVDEMLEG 173 (428) T ss_pred HHHHHHHHHCCCCEEEECCCCCCCCCEEECCC---HHHHHHHHHHHHHH T ss_conf 99999999729988993454467898598277---99999999999740 No 97 >PRK00349 uvrA excinuclease ABC subunit A; Reviewed Probab=39.82 E-value=28 Score=15.53 Aligned_cols=29 Identities=31% Similarity=0.389 Sum_probs=19.9 Q ss_pred EEEECCCCH---------HHHHHHHHCCCEEEEEECCC Q ss_conf 121066279---------99999998499099997397 Q gi|254780768|r 224 RADLPSIGA---------KTVQNVIKAGLAGIALEAGK 252 (281) Q Consensus 224 r~DlP~IG~---------~Ti~~~~~ag~~giaiea~~ 252 (281) =+|=|+||+ ++++.+.+.|=+.|+||.+. T Consensus 514 VLDEPSiGLHprD~~rLi~~L~~Lrd~GNTVlVVEHD~ 551 (944) T PRK00349 514 VLDEPSIGLHQRDNDRLIETLKRLRDLGNTLIVVEHDE 551 (944) T ss_pred EECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCH T ss_conf 93687554798899999999999985798599983568 No 98 >PRK08654 pyruvate carboxylase subunit A; Validated Probab=39.03 E-value=29 Score=15.45 Aligned_cols=67 Identities=16% Similarity=0.215 Sum_probs=45.5 Q ss_pred CCEEEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCC--CCCCCEEEEEH---------HHHHHHHHHHHHCCCCEE Q ss_conf 9739999779803899999999789978999954866731--03885678738---------999999999986495406 Q gi|254780768|r 1 MKRLLIIAGSGMLPYYVAKAARLKNDEPVIASVLNECSFD--WQDFECRELPL---------GDFCVLRSILHQYNIGRI 69 (281) Q Consensus 1 M~kigIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~--~~~~~~~~~~i---------g~ig~li~~Lk~~~i~~i 69 (281) |=|==|||-.|....-|+++|+..|++.+.+.= ..|.. ........+.+ -.+.+++...++.+|+-| T Consensus 1 Mf~kvLIANRGEIA~RiiRa~relGi~tVaVyS--~~D~~s~hv~~ADe~~~ig~~~~~~sYLni~~Ii~~A~~~g~dAI 78 (497) T PRK08654 1 MFKKILIANRGEIAIRVMRACRELGIKTVAVYS--EADKNALFVKYADEAYPIGPAPPSKSYLNMERILEVAKKAGAEAI 78 (497) T ss_pred CCCEEEEECCCHHHHHHHHHHHHCCCEEEEECC--HHHHCCCCHHHCCEEEECCCCCCCCCCCCHHHHHHHHHHHCCCEE T ss_conf 976089987879999999999985993999888--378548997888989984898832121689999999998099999 No 99 >PRK00726 murG N-acetylglucosaminyl transferase; Provisional Probab=38.99 E-value=29 Score=15.45 Aligned_cols=76 Identities=18% Similarity=0.279 Sum_probs=45.6 Q ss_pred CCEEEEEECCC---CHH-HHHHHHHHHCCCCEEEEEECCCCCCC-C--CCCCEEEEEH-------------------HHH Q ss_conf 97399997798---038-99999999789978999954866731-0--3885678738-------------------999 Q gi|254780768|r 1 MKRLLIIAGSG---MLP-YYVAKAARLKNDEPVIASVLNECSFD-W--QDFECRELPL-------------------GDF 54 (281) Q Consensus 1 M~kigIIAG~G---~LP-~~ia~~~~~~g~~~~ii~l~~~~~~~-~--~~~~~~~~~i-------------------g~i 54 (281) |+||-|.+|+. .+| ..+|++++++|++++++.-....+.. . ..+++..++. ..+ T Consensus 1 mkkI~i~~GGTGGHi~Palala~~L~~~g~ev~~ig~~~g~E~~~v~~~~~~~~~i~~~~~~~~~~~~~~~~~~~l~~~~ 80 (359) T PRK00726 1 MKKILLAGGGTGGHVFPALALAEELKKRGWEVLWLGTKRGMEARLVPKAGIEFHFIPIGGLRRKGSLANLKAPFKLLKGV 80 (359) T ss_pred CCEEEEEECCCHHHHHHHHHHHHHHHHCCCEEEEEECCCHHHHHHCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHH T ss_conf 98899995886899999999999998387989999788268654044149838997778889878799999999999999 Q ss_pred HHHHHHHHHCCCCEEEEE-CCCC Q ss_conf 999999986495406770-2423 Q gi|254780768|r 55 CVLRSILHQYNIGRIVVA-GAID 76 (281) Q Consensus 55 g~li~~Lk~~~i~~ivma-G~V~ 76 (281) -+...++++.+.+-|+-- |+++ T Consensus 81 ~~~~~il~~~kPd~Vig~GGY~s 103 (359) T PRK00726 81 LQARKILKRFKPDVVVGFGGYVS 103 (359) T ss_pred HHHHHHHHHCCCCEEEECCCHHH T ss_conf 99999999749999997897412 No 100 >PRK06123 short chain dehydrogenase; Provisional Probab=38.97 E-value=29 Score=15.44 Aligned_cols=168 Identities=13% Similarity=0.150 Sum_probs=83.1 Q ss_pred CCEEEEEECCCC-HHHHHHHHHHHCCCCEEEEEECCCCCCC-----CCC--CCEEEE--E---HHHHHHHHHHHHHC--C Q ss_conf 973999977980-3899999999789978999954866731-----038--856787--3---89999999999864--9 Q gi|254780768|r 1 MKRLLIIAGSGM-LPYYVAKAARLKNDEPVIASVLNECSFD-----WQD--FECREL--P---LGDFCVLRSILHQY--N 65 (281) Q Consensus 1 M~kigIIAG~G~-LP~~ia~~~~~~g~~~~ii~l~~~~~~~-----~~~--~~~~~~--~---ig~ig~li~~Lk~~--~ 65 (281) |+|++||-|... +=..+++.+-+.|.++++....++...+ .++ ..+..+ + -.++.++++...++ . T Consensus 2 ~nKvalITGas~GIG~aia~~la~~Ga~V~i~~~~~~~~~~~~~~~~~~~g~~~~~~~~Dv~~~~~v~~~~~~~~~~~G~ 81 (249) T PRK06123 2 MRKVMIITGASRGIGAATALLAAERGYAVCLNYLRNRDAAEAVVQAIRRQGGEALAVAADVADEADVLRLFEAVDRELGR 81 (249) T ss_pred CCCEEEEECCCCHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHHHHHHHHHCC T ss_conf 99889996868799999999999879989998089878999999999964990999984799999999999999998299 Q ss_pred CCEEEEECCC-CCCCCHHHHCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHC----CCEEECHHHCCHHHCCCCCC Q ss_conf 5406770242-321000111026035789999998751048035899999999976----97482112225242035433 Q gi|254780768|r 66 IGRIVVAGAI-DRRPNVQDLCFSIKDSLRISKMIWQLVSGGNAAILKASIDLLESY----GVSVVGAHEIVPELLVQVGS 140 (281) Q Consensus 66 i~~ivmaG~V-~krP~~~~l~~D~~~~~~l~k~l~~~~~~gDd~iL~~i~~~fe~~----G~~vi~~~~~l~~ll~~~G~ 140 (281) ++-+|---.+ .++..+.++-.+ . ..+.+ ...-++--.+.+.++..+.++ +=.|+....+....-.+ +. T Consensus 82 iDiLVnNAG~~~~~~~~~~~~~~--~---w~~~~-~vNl~~~~~~~~~~~~~m~~~~~g~~g~IInisS~~~~~~~~-~~ 154 (249) T PRK06123 82 LDALVNNAGILEAQMRLEQMDAA--R---LTRIF-ATNVVGSFLCAREAVKRMSTRHGGRGGAIVNVSSMAARLGSP-GE 154 (249) T ss_pred CCEEEECCCCCCCCCCHHHCCHH--H---HHHHH-HHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCHHHCCCCC-CC T ss_conf 87899888557899972129999--9---99998-540699999999999999997089983799974476565898-30 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEEC Q ss_conf 4554653234788998999999733267225999819 Q gi|254780768|r 141 LGTCVPNRDVKRDILAAMKSAEALSELDVGQSAVSIG 177 (281) Q Consensus 141 l~~~~p~~~~~~dI~~g~~i~~~l~~~DiGQsvVv~~ 177 (281) +.....+|.. =...-..++.++++..|-=-+|.-+ T Consensus 155 ~~~Y~asKaa--v~~ltr~lA~ela~~gIrvN~IaPG 189 (249) T PRK06123 155 YIDYAASKGA--IDTMTIGLAKEVAAEGIRVNAVRPG 189 (249) T ss_pred HHHHHHHHHH--HHHHHHHHHHHHHHCCEEEEEEEEC T ss_conf 6878999999--9999999999986559699999867 No 101 >COG0105 Ndk Nucleoside diphosphate kinase [Nucleotide transport and metabolism] Probab=38.70 E-value=21 Score=16.41 Aligned_cols=76 Identities=13% Similarity=0.124 Sum_probs=45.1 Q ss_pred HHHHHHHHHHHCCCEEECHHHCCHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEECCEEEEEECCCC Q ss_conf 89999999997697482112225242035433455465323478899899999973326722599981974889625342 Q gi|254780768|r 109 ILKASIDLLESYGVSVVGAHEIVPELLVQVGSLGTCVPNRDVKRDILAAMKSAEALSELDVGQSAVSIGGRVVALEGIEG 188 (281) Q Consensus 109 iL~~i~~~fe~~G~~vi~~~~~l~~ll~~~G~l~~~~p~~~~~~dI~~g~~i~~~l~~~DiGQsvVv~~g~ViaiEa~eG 188 (281) +...|+..||+.||+++..+-..-+.-..+++|.-++-......-+++ +..|-|++. -.|| T Consensus 18 LIG~IisrfE~~Glkiva~K~~~~~~e~Ae~~Y~~h~~kpFf~~Lv~f------------------itSgPvv~~-VleG 78 (135) T COG0105 18 LIGEIISRFEKKGLKIVALKMVQLSRELAENHYAEHKGKPFFGELVEF------------------ITSGPVVAM-VLEG 78 (135) T ss_pred HHHHHHHHHHHCCCEEEEEEEECCCHHHHHHHHHHHCCCCCCHHHHHH------------------EECCCEEEE-EEEC T ss_conf 189999999977988876440204799997778987678752877862------------------312658999-9852 Q ss_pred HHHHHHHHHHHHHHCC Q ss_conf 1799999997543123 Q gi|254780768|r 189 TDSMLQRIVDCRNNGR 204 (281) Q Consensus 189 TD~mi~R~~~~~~~~~ 204 (281) .|+ +.+...+....+ T Consensus 79 e~a-i~~~R~l~GaTn 93 (135) T COG0105 79 ENA-ISVVRKLMGATN 93 (135) T ss_pred HHH-HHHHHHHHCCCC T ss_conf 769-999999977998 No 102 >KOG3985 consensus Probab=38.56 E-value=8.5 Score=19.07 Aligned_cols=19 Identities=32% Similarity=0.408 Sum_probs=14.9 Q ss_pred CCEEEEEECCCCHHHHHHH Q ss_conf 9739999779803899999 Q gi|254780768|r 1 MKRLLIIAGSGMLPYYVAK 19 (281) Q Consensus 1 M~kigIIAG~G~LP~~ia~ 19 (281) |=+||||.|+|-.-..+.+ T Consensus 9 ~VklGIIGGsGl~dp~ile 27 (283) T KOG3985 9 TVKLGIIGGSGLYDPDILE 27 (283) T ss_pred EEEEEEECCCCCCCCHHHH T ss_conf 0799875367777831320 No 103 >PRK10669 putative cation:proton antiport protein; Provisional Probab=38.43 E-value=30 Score=15.39 Aligned_cols=28 Identities=11% Similarity=0.144 Sum_probs=13.8 Q ss_pred EEECCCCHHHHHHHHHHHCCCCEEEEEE Q ss_conf 9977980389999999978997899995 Q gi|254780768|r 6 IIAGSGMLPYYVAKAARLKNDEPVIASV 33 (281) Q Consensus 6 IIAG~G~LP~~ia~~~~~~g~~~~ii~l 33 (281) ||+|.|..=..+++.++.+|.+++++.. T Consensus 421 ii~G~Gr~G~~va~~L~~~~~~~vvid~ 448 (558) T PRK10669 421 LLVGYGRVGSLLGEKLLASGIPLVVIET 448 (558) T ss_pred EEECCCHHHHHHHHHHHHCCCCEEEEEC T ss_conf 9989886699999999987998899989 No 104 >cd00363 PFK Phosphofructokinase, a key regulatory enzyme in glycolysis, catalyzes the phosphorylation of fructose-6-phosphate to fructose-1,6-biphosphate. The members belong to PFK family that includes ATP- and pyrophosphate (PPi)- dependent phosphofructokinases. Some members evolved by gene duplication and thus have a large C-terminal/N-terminal extension comprising a second PFK domain. Generally, ATP-PFKs are allosteric homotetramers, and PPi-PFKs are dimeric and nonallosteric except for plant PPi-PFKs which are allosteric heterotetramers. Probab=38.33 E-value=30 Score=15.38 Aligned_cols=30 Identities=20% Similarity=0.143 Sum_probs=24.1 Q ss_pred CEEEEEECCCCHHH------HHHHHHHHCCCCEEEE Q ss_conf 73999977980389------9999999789978999 Q gi|254780768|r 2 KRLLIIAGSGMLPY------YVAKAARLKNDEPVIA 31 (281) Q Consensus 2 ~kigIIAG~G~LP~------~ia~~~~~~g~~~~ii 31 (281) +||||+-+.|.=|- -+.+.+...|.+++-+ T Consensus 1 krIaIltsGG~~pGmNa~ir~vv~~a~~~g~~v~Gi 36 (338) T cd00363 1 KKIGVLTSGGDAPGMNAAIRGVVRSAIAEGLEVYGI 36 (338) T ss_pred CEEEEECCCCCCHHHHHHHHHHHHHHHHCCCEEEEE T ss_conf 959998688886688899999999999779999998 No 105 >COG0399 WecE Predicted pyridoxal phosphate-dependent enzyme apparently involved in regulation of cell wall biogenesis [Cell envelope biogenesis, outer membrane] Probab=38.14 E-value=30 Score=15.36 Aligned_cols=141 Identities=21% Similarity=0.184 Sum_probs=68.8 Q ss_pred HHHHHHHCCCCEEEEEECCCC---CCCC-----CC--CCEEEEEH----HHHHHHHHHHHHCCCCEEE------------ Q ss_conf 999999789978999954866---7310-----38--85678738----9999999999864954067------------ Q gi|254780768|r 17 VAKAARLKNDEPVIASVLNEC---SFDW-----QD--FECRELPL----GDFCVLRSILHQYNIGRIV------------ 70 (281) Q Consensus 17 ia~~~~~~g~~~~ii~l~~~~---~~~~-----~~--~~~~~~~i----g~ig~li~~Lk~~~i~~iv------------ 70 (281) -++.....|.+|+++.+...+ |++. .. ..+..+.+ -++..+.++-+++++-=|- T Consensus 87 Tan~i~~~Ga~PVFvDid~~T~nid~~~ie~aIt~~tKAIipVhl~G~~~dm~~i~~la~~~~l~vIEDaAqa~Ga~y~g 166 (374) T COG0399 87 TANAVLLVGAKPVFVDIDPDTLNIDPDLIEAAITPRTKAIIPVHLAGQPCDMDAIMALAKRHGLPVIEDAAQAHGATYKG 166 (374) T ss_pred HHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHCCCCCEEEEEEHHCCCCCCHHHHHHHHHHCCCEEEEECCHHCCCEECC T ss_conf 89999976996799964764147799999987155774999851216877989999999873984998760320672358 Q ss_pred -EECCCCCCCCHHHHCCCHHHHHHHHHHHHHHHCCC----CHHHHHHHHHHHHHCCCEE----------ECHHHCCHHHC Q ss_conf -70242321000111026035789999998751048----0358999999999769748----------21122252420 Q gi|254780768|r 71 -VAGAIDRRPNVQDLCFSIKDSLRISKMIWQLVSGG----NAAILKASIDLLESYGVSV----------VGAHEIVPELL 135 (281) Q Consensus 71 -maG~V~krP~~~~l~~D~~~~~~l~k~l~~~~~~g----Dd~iL~~i~~~fe~~G~~v----------i~~~~~l~~ll 135 (281) ++|.+..--. |++.+ .|.+-. ..+| ||.-|-..+..+..+|... ++.+.=+.++. T Consensus 167 k~vGt~Gd~~~-----fSF~~----~K~itt-gEGGav~tnd~ela~k~~~lr~hG~~~~~~~~y~~~~~G~N~rm~~iq 236 (374) T COG0399 167 KKVGSFGDIGA-----FSFHA----TKNLTT-GEGGAVVTNDEELAEKARSLRNHGLSRDAVFKYLHEELGYNYRLTEIQ 236 (374) T ss_pred CCCCCCCCEEE-----EEECC----CCCCCC-CCCEEEEECCHHHHHHHHHHHHHCCCCCCCCCCEEEECCCCCCHHHHH T ss_conf 42556664689-----98658----776556-675079838899999999999837677866564352133003877999 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCC Q ss_conf 35433455465323478899899999973326 Q gi|254780768|r 136 VQVGSLGTCVPNRDVKRDILAAMKSAEALSEL 167 (281) Q Consensus 136 ~~~G~l~~~~p~~~~~~dI~~g~~i~~~l~~~ 167 (281) +.-|...-.+.++...+--+.+....+.+..+ T Consensus 237 AAigl~QL~~l~~~~~~R~~~a~~Y~~~l~~~ 268 (374) T COG0399 237 AAIGLAQLERLDEINERRREIAQIYAEALKGL 268 (374) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 99999999888999999999999999986107 No 106 >PRK00081 coaE dephospho-CoA kinase; Reviewed Probab=38.12 E-value=23 Score=16.17 Aligned_cols=29 Identities=17% Similarity=-0.016 Sum_probs=24.0 Q ss_pred CCEEEEEECCCCHHHHHHHHHHHCCCCEE Q ss_conf 97399997798038999999997899789 Q gi|254780768|r 1 MKRLLIIAGSGMLPYYVAKAARLKNDEPV 29 (281) Q Consensus 1 M~kigIIAG~G~LP~~ia~~~~~~g~~~~ 29 (281) |..+||..|-|.==-.+++.+...|++++ T Consensus 2 m~~IgiTG~igsGKStv~~~l~~~G~~vi 30 (199) T PRK00081 2 MLIIGLTGGIGSGKSTVANIFAELGVPVI 30 (199) T ss_pred CEEEEEECCCCCCHHHHHHHHHHCCCCEE T ss_conf 78999578887779999999998899399 No 107 >cd02022 DPCK Dephospho-coenzyme A kinase (DPCK, EC 2.7.1.24) catalyzes the phosphorylation of dephosphocoenzyme A (dCoA) to yield CoA, which is the final step in CoA biosynthesis. Probab=37.97 E-value=24 Score=16.04 Aligned_cols=26 Identities=19% Similarity=0.055 Sum_probs=13.9 Q ss_pred EEEEECCCCHHHHHHHHHHHCCCCEE Q ss_conf 99997798038999999997899789 Q gi|254780768|r 4 LLIIAGSGMLPYYVAKAARLKNDEPV 29 (281) Q Consensus 4 igIIAG~G~LP~~ia~~~~~~g~~~~ 29 (281) |||-.|-|+==-.+++.+++.|++++ T Consensus 2 igiTG~igSGKStv~~~l~~~g~~v~ 27 (179) T cd02022 2 IGLTGGIGSGKSTVAKLLKELGIPVI 27 (179) T ss_pred EEEECCCCCCHHHHHHHHHHCCCEEE T ss_conf 88637875789999999998799099 No 108 >pfam01121 CoaE Dephospho-CoA kinase. This family catalyses the phosphorylation of the 3'-hydroxyl group of dephosphocoenzyme A to form Coenzyme A EC:2.7.1.24. This enzyme uses ATP in its reaction. Probab=37.21 E-value=25 Score=15.95 Aligned_cols=26 Identities=19% Similarity=-0.055 Sum_probs=17.2 Q ss_pred EEEEECCCCHHHHHHHHHHHCCCCEE Q ss_conf 99997798038999999997899789 Q gi|254780768|r 4 LLIIAGSGMLPYYVAKAARLKNDEPV 29 (281) Q Consensus 4 igIIAG~G~LP~~ia~~~~~~g~~~~ 29 (281) |||-.|-|.==-.+++.++..|++++ T Consensus 3 IgiTG~IgsGKStv~~~l~~~G~~vi 28 (179) T pfam01121 3 VGLTGGIGSGKSTVANLFADLGVPIV 28 (179) T ss_pred EEEECCCCCCHHHHHHHHHHCCCCEE T ss_conf 99857864789999999998799199 No 109 >TIGR00315 cdhB CO dehydrogenase/acetyl-CoA synthase complex, epsilon subunit; InterPro: IPR003704 Carbon monoxide dehydrogenase (Cdh) from Methanosarcina frisia Go1 is a Ni2+-, Fe2+-, and S2-containing alpha2beta2 heterotetramer . The CO dehydrogenase enzyme complex from Methanosarcina thermophila contains a corrinoid/iron-sulphur enzyme composed of two subunits (delta and gamma) . This family consists of carbon monoxide dehydrogenase I/II beta subunit 1.2.99.2 from EC and CO dehydrogenase (acetyl-CoA synthase epsilon subunit).; GO: 0003824 catalytic activity, 0008152 metabolic process. Probab=37.18 E-value=26 Score=15.80 Aligned_cols=18 Identities=17% Similarity=0.037 Sum_probs=10.4 Q ss_pred CHHHHHHHHHHHHHCCCE Q ss_conf 035899999999976974 Q gi|254780768|r 106 NAAILKASIDLLESYGVS 123 (281) Q Consensus 106 Dd~iL~~i~~~fe~~G~~ 123 (281) |=.|+-.++.|.-.++.. T Consensus 106 Dlvlf~G~~YYy~sq~l~ 123 (170) T TIGR00315 106 DLVLFLGIIYYYLSQMLS 123 (170) T ss_pred HHHHHHHHHHHHHHHHHH T ss_conf 588888779899999998 No 110 >TIGR00630 uvra excinuclease ABC, A subunit; InterPro: IPR004602 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). During the process of Escherichia coli nucleotide excision repair, DNA damage recognition and processing are achieved by the action of the uvrA, uvrB, and uvrC gene products . The UvrC protein contain 4 conserved regions: a central region which interact with UvrB (Uvr domain), a Helix hairpin Helix (HhH) domain important for 5 prime incision of damage DNA and the homology regions 1 and 2 of unknown function. UvrC homology region 2 is specific for UvrC proteins, whereas UvrC homology region 1 is also shared by few other nucleases.; GO: 0009381 excinuclease ABC activity, 0006289 nucleotide-excision repair, 0009380 excinuclease repair complex. Probab=36.26 E-value=32 Score=15.17 Aligned_cols=43 Identities=37% Similarity=0.458 Sum_probs=34.3 Q ss_pred CCCEEEEEECCCCCCCEEEECCCCH---------HHHHHHHHCCCEEEEEECCCEEEECHHHHHHHHH Q ss_conf 6771899934888753121066279---------9999999849909999739779985899999999 Q gi|254780768|r 208 GKSGVLVKMCKSQQDMRADLPSIGA---------KTVQNVIKAGLAGIALEAGKSLVLEKELVKKHAD 266 (281) Q Consensus 208 ~~~~ilvK~~K~~QD~r~DlP~IG~---------~Ti~~~~~ag~~giaiea~~~lild~~~~i~~a~ 266 (281) +--|||+= +|=|.||+ +|++.+-.-|=+.|+||.+ +++|..|| T Consensus 536 ~L~GVLYV---------LDEPSIGLHqrDN~rLI~TL~~LRDLGNTliVVEHD-------e~Ti~~AD 587 (956) T TIGR00630 536 GLTGVLYV---------LDEPSIGLHQRDNERLINTLKRLRDLGNTLIVVEHD-------EETIRAAD 587 (956) T ss_pred CEEEEEEE---------ECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECC-------HHHHHHCC T ss_conf 12567888---------417888887345689999999985039879999174-------79985579 No 111 >PRK08057 cobalt-precorrin-6x reductase; Reviewed Probab=35.86 E-value=33 Score=15.12 Aligned_cols=69 Identities=22% Similarity=0.177 Sum_probs=45.4 Q ss_pred CCEEEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCC-CCCCEEEEEHHHHHHHHHHHHHCCCCEEEEEC Q ss_conf 97399997798038999999997899789999548667310-38856787389999999999864954067702 Q gi|254780768|r 1 MKRLLIIAGSGMLPYYVAKAARLKNDEPVIASVLNECSFDW-QDFECRELPLGDFCVLRSILHQYNIGRIVVAG 73 (281) Q Consensus 1 M~kigIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~-~~~~~~~~~ig~ig~li~~Lk~~~i~~ivmaG 73 (281) |.||-|++|.++=..+ ++.+. +++ +++.+.+...... ...+...-+++....+.+++++++|+-||=|- T Consensus 1 m~~IlilgGT~Egr~l-a~~L~--~~~-~~~s~ag~~~~~~~~~~~i~~G~~~~~~~l~~~l~~~~i~~VIDAT 70 (241) T PRK08057 1 MPRILLLGGTSEARAL-ARALA--PDD-TVTSLAGRTLKPADLPGPVRVGGFGGAEGLAAYLREEGIDLVVDAT 70 (241) T ss_pred CCEEEEEECHHHHHHH-HHHHH--CCC-EEEEEEEECCCCCCCCCCEEECCCCCHHHHHHHHHHCCCCEEEECC T ss_conf 9659999670899999-99974--899-8999853025565688767988889999999999967998999899 No 112 >PRK09234 fbiC FO synthase; Reviewed Probab=35.24 E-value=33 Score=15.06 Aligned_cols=226 Identities=13% Similarity=0.117 Sum_probs=118.2 Q ss_pred HCCCCEEEEEECCCCCCCCC--------------CCCEEEEEHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHCCCH Q ss_conf 78997899995486673103--------------8856787389999999999864954067702423210001110260 Q gi|254780768|r 23 LKNDEPVIASVLNECSFDWQ--------------DFECRELPLGDFCVLRSILHQYNIGRIVVAGAIDRRPNVQDLCFSI 88 (281) Q Consensus 23 ~~g~~~~ii~l~~~~~~~~~--------------~~~~~~~~ig~ig~li~~Lk~~~i~~ivmaG~V~krP~~~~l~~D~ 88 (281) ..|..+..+.=.+.+-.+.- +-+-+.++..++.+-...-...|+++|+|.|+++ |.+. +++ T Consensus 521 ~~GD~VTyVvNRNINyTNVC~~~C~FCAF~r~~~~~~aY~ls~eeI~~r~~EA~~~GaTEV~iqGGih--P~l~---~~~ 595 (846) T PRK09234 521 VVGDDVTYVVNRNINFTNICYTGCRFCAFAQRKGDADAYSLSLDEVADRAWEAWVAGATEVCMQGGID--PELP---GTG 595 (846) T ss_pred HCCCEEEEEEECCCCHHHHHHCCCCCCCCCCCCCCCCCEECCHHHHHHHHHHHHHCCCEEEEECCCCC--CCCC---HHH T ss_conf 71884799840676388775517973514478899876118999999999999976987998347879--8998---789 Q ss_pred HHH-HHH---------------HHHHHHHHCCCCHHHHHHHHHHHHHCCCEEEC--HHHCCHHHCCCCCCCCCCCCCHHH Q ss_conf 357-899---------------99998751048035899999999976974821--122252420354334554653234 Q gi|254780768|r 89 KDS-LRI---------------SKMIWQLVSGGNAAILKASIDLLESYGVSVVG--AHEIVPELLVQVGSLGTCVPNRDV 150 (281) Q Consensus 89 ~~~-~~l---------------~k~l~~~~~~gDd~iL~~i~~~fe~~G~~vi~--~~~~l~~ll~~~G~l~~~~p~~~~ 150 (281) ..+ .+. ..+...... -.--++.+...+.+.|..-++ .-+++.+-.= -.++..|.+.++ T Consensus 596 Y~di~r~iK~~~P~ihihAFSp~EI~~~A~~--~g~s~~E~L~~LkeAGL~SlPGggAEIL~d~VR--~~Icp~K~~~~~ 671 (846) T PRK09234 596 YADLVRAVKARVPSMHVHAFSPMEIANGATR--SGLSIREWLTALREAGLDTIPGTAAEILDDEVR--WVLTKGKLPTAE 671 (846) T ss_pred HHHHHHHHHHHCCCCEEECCCHHHHHHHHHH--CCCCHHHHHHHHHHCCCCCCCCCCCHHCCHHHH--HHHCCCCCCHHH T ss_conf 9999999998689870450899999999998--299999999999980977799974132587999--976888888999 Q ss_pred HHHHHHHHHHHHHHCCCCCEEEEEEECCEEEEEECCCCHHHHHHHHHHHHHHCCCCCCCCEEEEEEC------------- Q ss_conf 7889989999997332672259998197488962534217999999975431234566771899934------------- Q gi|254780768|r 151 KRDILAAMKSAEALSELDVGQSAVSIGGRVVALEGIEGTDSMLQRIVDCRNNGRILAGKSGVLVKMC------------- 217 (281) Q Consensus 151 ~~dI~~g~~i~~~l~~~DiGQsvVv~~g~ViaiEa~eGTD~mi~R~~~~~~~~~~~~~~~~ilvK~~------------- 217 (281) |.+|-.. +.++| +--+.-+-=|+| |..|--=.-+.|..++.+..+ .---||=.| T Consensus 672 Wlev~~~---AH~lG---l~TtATMmyGHv---Et~e~rv~HL~~lR~lQdeTG----GFteFIPL~F~~~~tpl~~~g~ 738 (846) T PRK09234 672 WIEVVTT---AHEVG---LRSSSTMMYGHV---DTPRHWVAHLRVLRDIQDRTG----GFTEFVPLPFVHQSAPLYLAGA 738 (846) T ss_pred HHHHHHH---HHHCC---CCCCCEEECCCC---CCHHHHHHHHHHHHHHHHHHC----CEEEEEECCCCCCCCCHHHCCC T ss_conf 9999999---99859---975212435677---999999999999999998759----9559974675678880332268 Q ss_pred -CCCCC----C-----------------EEEECCCCHHHHHHHHHCCCEEE---------EEECCCEE--EECHHHHHHH Q ss_conf -88875----3-----------------12106627999999998499099---------99739779--9858999999 Q gi|254780768|r 218 -KSQQD----M-----------------RADLPSIGAKTVQNVIKAGLAGI---------ALEAGKSL--VLEKELVKKH 264 (281) Q Consensus 218 -K~~QD----~-----------------r~DlP~IG~~Ti~~~~~ag~~gi---------aiea~~~l--ild~~~~i~~ 264 (281) +++-. + ..--.++|+++-+.+-.+|.+=+ .=.||..- -+..+++..+ T Consensus 739 ~r~gpT~~e~l~~~AvsRL~L~dnI~nIQasWVklG~~~aq~aL~~GaNDlGGTlmeE~I~~aAGa~~g~~~t~~el~~l 818 (846) T PRK09234 739 ARPGPTHRDNRAVHALARILLHGRIDNIQTSWVKLGVEGTRVMLRGGANDLGGTLMEETISRMAGSEHGSAKTVAELEAI 818 (846) T ss_pred CCCCCCHHHHHHHHHHHHHHHHCCCCCEECCCHHCCHHHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHHH T ss_conf 87899889999999999997226886726150016799999999659977755611113311126898877999999999 Q ss_pred HHHCCC Q ss_conf 998796 Q gi|254780768|r 265 ADEAGI 270 (281) Q Consensus 265 a~~~~i 270 (281) +...|- T Consensus 819 I~~aGr 824 (846) T PRK09234 819 AEGAGR 824 (846) T ss_pred HHHCCC T ss_conf 998599 No 113 >COG0351 ThiD Hydroxymethylpyrimidine/phosphomethylpyrimidine kinase [Coenzyme metabolism] Probab=35.14 E-value=33 Score=15.05 Aligned_cols=54 Identities=19% Similarity=0.270 Sum_probs=41.0 Q ss_pred EECHHHCCHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEECCEEEE Q ss_conf 82112225242035433455465323478899899999973326722599981974889 Q gi|254780768|r 124 VVGAHEIVPELLVQVGSLGTCVPNRDVKRDILAAMKSAEALSELDVGQSAVSIGGRVVA 182 (281) Q Consensus 124 vi~~~~~l~~ll~~~G~l~~~~p~~~~~~dI~~g~~i~~~l~~~DiGQsvVv~~g~Via 182 (281) +++.-+++-.-+++...+++. +.-....|+..+.+++.++|. |.|+|++|+.-. T Consensus 129 LlP~a~vvTPNl~EA~~L~g~-~~i~~~~d~~~a~~~i~~~g~----~~VliKGGH~~~ 182 (263) T COG0351 129 LLPLATVVTPNLPEAEALSGL-PKIKTEEDMKEAAKLLHELGA----KAVLIKGGHLEG 182 (263) T ss_pred HHCCCEEECCCHHHHHHHCCC-CCCCCHHHHHHHHHHHHHHCC----CEEEECCCCCCC T ss_conf 515464956998999997199-766999999999999998389----989994898888 No 114 >pfam03721 UDPG_MGDP_dh_N UDP-glucose/GDP-mannose dehydrogenase family, NAD binding domain. The UDP-glucose/GDP-mannose dehydrogenaseses are a small group of enzymes which possesses the ability to catalyse the NAD-dependent 2-fold oxidation of an alcohol to an acid without the release of an aldehyde intermediate. Probab=34.67 E-value=34 Score=15.00 Aligned_cols=97 Identities=12% Similarity=0.057 Sum_probs=53.1 Q ss_pred EEEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCC--CEEEEEHHHHHHHHHHHHH-------------CCCC Q ss_conf 399997798038999999997899789999548667310388--5678738999999999986-------------4954 Q gi|254780768|r 3 RLLIIAGSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQDF--ECRELPLGDFCVLRSILHQ-------------YNIG 67 (281) Q Consensus 3 kigIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~--~~~~~~ig~ig~li~~Lk~-------------~~i~ 67 (281) ||++| |-|-+=.-.|-.+..+|++++.+.........+.+- +...-.+. ++++.... .++ T Consensus 2 kI~Vi-GlGyVGl~~a~~la~~G~~V~g~D~d~~~v~~ln~g~~p~~E~~l~---~~l~~~~~~~~~~~~~~~~~i~~~- 76 (185) T pfam03721 2 RIAVI-GLGYVGLPTAVCLAEIGHDVVGVDINQSKIDKLNNGKIPIYEPGLE---ELLKANVSGRLRFTTDVAEAIKEA- 76 (185) T ss_pred EEEEE-CCCHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHCCCCCCCCCCHH---HHHHHHHCCCEEEECCHHHHHHHC- T ss_conf 79998-9787489999999948993999979989999986268974675889---999873408969987879988449- Q ss_pred EEEEECCCCCCCCHHHHCCCHHHHHHHHHHHHHHHCCCC Q ss_conf 067702423210001110260357899999987510480 Q gi|254780768|r 68 RIVVAGAIDRRPNVQDLCFSIKDSLRISKMIWQLVSGGN 106 (281) Q Consensus 68 ~ivmaG~V~krP~~~~l~~D~~~~~~l~k~l~~~~~~gD 106 (281) +++++- |. .|.-.+-.+|+.....+.+.+.+..++|+ T Consensus 77 d~i~I~-Vp-TP~~~~~~~d~s~l~~~~~~i~~~l~~~~ 113 (185) T pfam03721 77 DVIFIA-VP-TPSKKGGAPDLTYVESAARTIGPVLKKGK 113 (185) T ss_pred CEEEEE-CC-CCCCCCCCCCCHHHHHHHHHHHHHCCCCC T ss_conf 899997-36-87655767663599999999974467998 No 115 >pfam11731 Cdd1 Pathogenicity locus. Cdd1 is expressed as part of the pathogenicity locus operon in several different orders of bacteria. Many members of the family are annotated as being putative mitomycin resistance proteins but this could not be confirmed. Probab=34.65 E-value=19 Score=16.68 Aligned_cols=23 Identities=35% Similarity=0.570 Sum_probs=16.3 Q ss_pred EEECCCCHHHHHHHHHCCCEEEE Q ss_conf 21066279999999984990999 Q gi|254780768|r 225 ADLPSIGAKTVQNVIKAGLAGIA 247 (281) Q Consensus 225 ~DlP~IG~~Ti~~~~~ag~~gia 247 (281) -|+|-||+.|.+-+..-|+..++ T Consensus 15 ~~lPnIG~a~a~DL~~LGi~~~~ 37 (92) T pfam11731 15 TDLPNIGKATAKDLRLLGINSPA 37 (92) T ss_pred HCCCCCCHHHHHHHHHHCCCCHH T ss_conf 41897469999999991899899 No 116 >PRK12845 3-ketosteroid-delta-1-dehydrogenase; Reviewed Probab=34.38 E-value=34 Score=14.97 Aligned_cols=26 Identities=27% Similarity=0.240 Sum_probs=18.0 Q ss_pred EEECCCCHHHHHHHHHHHCCCCEEEEE Q ss_conf 997798038999999997899789999 Q gi|254780768|r 6 IIAGSGMLPYYVAKAARLKNDEPVIAS 32 (281) Q Consensus 6 IIAG~G~LP~~ia~~~~~~g~~~~ii~ 32 (281) ||.|+|. =..-|-.+..+|.+|+++- T Consensus 20 vVVGsGA-Gl~AAi~Aae~G~~VivlE 45 (566) T PRK12845 20 LVVGSGT-GMAAALAAHELGLSVLIVE 45 (566) T ss_pred EEECHHH-HHHHHHHHHHCCCCEEEEE T ss_conf 9988469-9999999998899389996 No 117 >PRK01184 hypothetical protein; Provisional Probab=34.24 E-value=29 Score=15.42 Aligned_cols=30 Identities=20% Similarity=0.070 Sum_probs=19.9 Q ss_pred CCEEEEEECCCCHHHHHHHHHHHCCCCEEE Q ss_conf 973999977980389999999978997899 Q gi|254780768|r 1 MKRLLIIAGSGMLPYYVAKAARLKNDEPVI 30 (281) Q Consensus 1 M~kigIIAG~G~LP~~ia~~~~~~g~~~~i 30 (281) |.-|||..|-|.==-.+++-++..|++++- T Consensus 1 M~iIGlTG~iGSGKstva~i~~e~G~~vi~ 30 (183) T PRK01184 1 MMIIIVTGMPGSGKGEFSKIARELGIPVVV 30 (183) T ss_pred CEEEEEECCCCCCHHHHHHHHHHCCCEEEE T ss_conf 939999689988789999999977993998 No 118 >pfam06948 consensus Probab=33.94 E-value=35 Score=14.92 Aligned_cols=61 Identities=15% Similarity=0.117 Sum_probs=44.0 Q ss_pred CCEEEEEECCCC------HHHHHHHHHHHCC--CCEEEEEECCCCCCCCCCCCEEEEEHHHHHHHHHHHHHCCCCE Q ss_conf 973999977980------3899999999789--9789999548667310388567873899999999998649540 Q gi|254780768|r 1 MKRLLIIAGSGM------LPYYVAKAARLKN--DEPVIASVLNECSFDWQDFECRELPLGDFCVLRSILHQYNIGR 68 (281) Q Consensus 1 M~kigIIAG~G~------LP~~ia~~~~~~g--~~~~ii~l~~~~~~~~~~~~~~~~~ig~ig~li~~Lk~~~i~~ 68 (281) |.|+.+|-=+|+ ++...|..++++| .++-++.+ |-...-..+.+ +.+.+++..|++.|+.- T Consensus 1 M~Kl~iiisS~d~~~k~~~~l~~A~ns~k~~w~edVkvi~f-GPSekllae~d------~~~~~~l~el~~~gv~~ 69 (112) T pfam06948 1 MAKLLVLISSGDNEEKFLTALMFAYNSKKNKWFEDVKVIFF-GPSEKLLAEGD------QEVLKILSELLQLGETP 69 (112) T ss_pred CCEEEEEEECCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEE-CCCHHHHHCCC------HHHHHHHHHHHHCCCEE T ss_conf 95189999759888999999999999877166133799998-85689884486------99999999999869816 No 119 >PRK01130 N-acetylmannosamine-6-phosphate 2-epimerase; Provisional Probab=33.87 E-value=27 Score=15.66 Aligned_cols=137 Identities=15% Similarity=0.149 Sum_probs=63.8 Q ss_pred CCEEEEEECCCCCCCCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHCCCHHHHHHHHHHHH---HHH Q ss_conf 97899995486673103885678738999999999986495406770242321000111026035789999998---751 Q gi|254780768|r 26 DEPVIASVLNECSFDWQDFECRELPLGDFCVLRSILHQYNIGRIVVAGAIDRRPNVQDLCFSIKDSLRISKMIW---QLV 102 (281) Q Consensus 26 ~~~~ii~l~~~~~~~~~~~~~~~~~ig~ig~li~~Lk~~~i~~ivmaG~V~krP~~~~l~~D~~~~~~l~k~l~---~~~ 102 (281) .++.||++... ++++.+++.-+ --+-...|-..|++=|.+=.--.+||.-.. ...+.+-+- ... T Consensus 55 v~lPIIGi~K~---~~~~s~VyITP---t~~ev~~l~~aGadiIA~DaT~R~RP~g~~-------~~~~i~~i~~~~~~l 121 (222) T PRK01130 55 VDVPIIGIIKR---DYPDSEVYITP---TLKEVDALAAAGADIIALDATLRPRPDGET-------LAELVKRIKEKPGQL 121 (222) T ss_pred CCCCEEEEEEC---CCCCCCEEECC---CHHHHHHHHHCCCCEEEEECCCCCCCCCCC-------HHHHHHHHHHHHCCE T ss_conf 79987999954---68999737517---699999999869999998467898989968-------999999999982987 Q ss_pred CCCCHHHHHHHHHHHHHCCCEEECHHHCCHHHCCCCCCCCCC-CCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEECCEEE Q ss_conf 048035899999999976974821122252420354334554-6532347889989999997332672259998197488 Q gi|254780768|r 103 SGGNAAILKASIDLLESYGVSVVGAHEIVPELLVQVGSLGTC-VPNRDVKRDILAAMKSAEALSELDVGQSAVSIGGRVV 181 (281) Q Consensus 103 ~~gDd~iL~~i~~~fe~~G~~vi~~~~~l~~ll~~~G~l~~~-~p~~~~~~dI~~g~~i~~~l~~~DiGQsvVv~~g~Vi 181 (281) -.-|=+-+..- ..=++.|+.+++.+-. ..-..|.. .++. -|.+ +++.+...++ -|+ T Consensus 122 ~MAD~st~eea-~~A~~~G~D~V~TTLs------GYT~~t~~~~~~~---pD~~----lv~~l~~~~~---pvI------ 178 (222) T PRK01130 122 LMADCSTLEEG-LAAAKLGFDFIGTTLS------GYTEYTEGETPEE---PDFA----LLKELLKAGC---PVI------ 178 (222) T ss_pred EEEECCCHHHH-HHHHHCCCCEEECCCC------CCCCCCCCCCCCC---CCHH----HHHHHHHCCC---CEE------ T ss_conf 89854889999-9999849999972334------5676767787899---8699----9999995899---899------ Q ss_pred EEECCCCHHHHHHHHHHH Q ss_conf 962534217999999975 Q gi|254780768|r 182 ALEGIEGTDSMLQRIVDC 199 (281) Q Consensus 182 aiEa~eGTD~mi~R~~~~ 199 (281) .|+---|-++.+++.++ T Consensus 179 -aEGri~tPe~a~~al~~ 195 (222) T PRK01130 179 -AEGRINTPEQAKKALEL 195 (222) T ss_pred -EECCCCCHHHHHHHHHC T ss_conf -74798999999999984 No 120 >PRK05678 succinyl-CoA synthetase subunit alpha; Validated Probab=33.66 E-value=35 Score=14.89 Aligned_cols=139 Identities=14% Similarity=0.088 Sum_probs=66.5 Q ss_pred EEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCC-CCCCC-----------------CCCEEEEEHHHHHHHHHHHHHCC Q ss_conf 9999779803899999999789978999954866-73103-----------------88567873899999999998649 Q gi|254780768|r 4 LLIIAGSGMLPYYVAKAARLKNDEPVIASVLNEC-SFDWQ-----------------DFECRELPLGDFCVLRSILHQYN 65 (281) Q Consensus 4 igIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~-~~~~~-----------------~~~~~~~~ig~ig~li~~Lk~~~ 65 (281) +-+..=.|..-..-++.+..-|-+++. +..... -.... |....+++-...-..+...-..+ T Consensus 11 v~vqGiTG~~g~~h~~~m~~yGT~iVa-GVtPgkgG~~~~gvPVf~tV~eAv~~~~~d~svIfVPp~~a~dAi~EAi~ag 89 (289) T PRK05678 11 VIVQGITGKQGTFHTEQMLAYGTKIVG-GVTPGKGGTTHLGLPVFNTVKEAVEATGATASVIYVPPPFAADAILEAIDAG 89 (289) T ss_pred EEEECCCCCHHHHHHHHHHHHCCCEEE-EECCCCCCEEEECCCCCCCHHHHHHHCCCCEEEEEECHHHHHHHHHHHHHCC T ss_conf 999878883777999999986895699-9779978517658333065999986128886999707899999999998679 Q ss_pred CCEEEEECCCCCCCCHHHHCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCEEECHHH------------CCHH Q ss_conf 5406770242321000111026035789999998751048035899999999976974821122------------2524 Q gi|254780768|r 66 IGRIVVAGAIDRRPNVQDLCFSIKDSLRISKMIWQLVSGGNAAILKASIDLLESYGVSVVGAHE------------IVPE 133 (281) Q Consensus 66 i~~ivmaG~V~krP~~~~l~~D~~~~~~l~k~l~~~~~~gDd~iL~~i~~~fe~~G~~vi~~~~------------~l~~ 133 (281) |+.+|.+=- .-|-. | ...+.++.++.|.++++++. ++|. T Consensus 90 I~~iV~ITE--giP~~------------------------D---~~~i~~~a~~~g~riIGPNc~GiI~Pg~~kiGi~p~ 140 (289) T PRK05678 90 IDLIVCITE--GIPVL------------------------D---MLEVKRYLKGKKTRLIGPNCPGIITPGECKIGIMPG 140 (289) T ss_pred CCEEEEECC--CCCHH------------------------H---HHHHHHHHHHCCCEEECCCCCEEECCCCCCCCCCCC T ss_conf 988999438--99788------------------------8---999999873079889889997016677240123562 Q ss_pred HCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEECC Q ss_conf 203543345546532347889989999997332672259998197 Q gi|254780768|r 134 LLVQVGSLGTCVPNRDVKRDILAAMKSAEALSELDVGQSAVSIGG 178 (281) Q Consensus 134 ll~~~G~l~~~~p~~~~~~dI~~g~~i~~~l~~~DiGQsvVv~~g 178 (281) .+..+|.++-..-| =....+++..+....+|||.+|.=| T Consensus 141 ~~~~pG~VgiiSrS------GtLtye~~~~l~~~g~G~S~~VgiG 179 (289) T PRK05678 141 HIHKKGRVGIVSRS------GTLTYEAVAQLTDLGLGQSTCVGIG 179 (289) T ss_pred CCCCCCCEEEEECC------CCHHHHHHHHHHHCCCCEEEEEECC T ss_conf 24779868999777------6579999999997698669898527 No 121 >pfam01784 NIF3 NIF3 (NGG1p interacting factor 3). This family contains several NIF3 (NGG1p interacting factor 3) protein homologues. NIF3 interacts with the yeast transcriptional coactivator NGG1p which is part of the ADA complex, the exact function of this interaction is unknown. Probab=33.59 E-value=35 Score=14.89 Aligned_cols=84 Identities=15% Similarity=0.137 Sum_probs=36.0 Q ss_pred EEEEEECCCCHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCEEE-ECCCCHHHHHHHHHCCCEEEEEECCCEEEEC Q ss_conf 488962534217999999975431234566771899934888753121-0662799999999849909999739779985 Q gi|254780768|r 179 RVVALEGIEGTDSMLQRIVDCRNNGRILAGKSGVLVKMCKSQQDMRAD-LPSIGAKTVQNVIKAGLAGIALEAGKSLVLE 257 (281) Q Consensus 179 ~ViaiEa~eGTD~mi~R~~~~~~~~~~~~~~~~ilvK~~K~~QD~r~D-lP~IG~~Ti~~~~~ag~~giaiea~~~lild 257 (281) ++--++...-.+++++++.+.... ..+-+-...+.+-.|+- .|-=|-.-++.+.++|+.... .|.. T Consensus 134 ~ig~l~~~~~~~~~~~~vk~~l~~-------~~vr~~g~~~~~i~~Vai~~GsG~~~i~~a~~~gaD~~I--TGd~---- 200 (238) T pfam01784 134 RIGELPEPISLEDLAKRVKERLGL-------EGIRVVGDLDKPVKKVAICGGSGSSLIEEAKAKGADLLI--TGEL---- 200 (238) T ss_pred EEEECCCCCCHHHHHHHHHHHCCC-------CCEEEECCCCCCEEEEEEECCCCHHHHHHHHHCCCCEEE--ECCC---- T ss_conf 999839996899999999987299-------978996688873768999746667789998757996999--7688---- Q ss_pred HHHHHHHHHHCCCEEEEE Q ss_conf 899999999879689995 Q gi|254780768|r 258 KELVKKHADEAGIFVCGI 275 (281) Q Consensus 258 ~~~~i~~a~~~~i~i~g~ 275 (281) +......|...|++++-+ T Consensus 201 k~h~~~~a~~~gi~li~~ 218 (238) T pfam01784 201 KHHDALDAKEKGINLIDA 218 (238) T ss_pred CHHHHHHHHHCCCEEEEC T ss_conf 779999998779989976 No 122 >cd03110 Fer4_NifH_child This protein family's function is unkown. It contains nucleotide binding site. It uses NTP as energy source to transfer electron or ion. Probab=33.27 E-value=36 Score=14.86 Aligned_cols=29 Identities=14% Similarity=0.064 Sum_probs=17.2 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHHCC Q ss_conf 43345546532347889989999997332 Q gi|254780768|r 138 VGSLGTCVPNRDVKRDILAAMKSAEALSE 166 (281) Q Consensus 138 ~G~l~~~~p~~~~~~dI~~g~~i~~~l~~ 166 (281) ...+--..|+.....|..+..+++++++. T Consensus 116 d~~iiVttP~~~si~d~~r~i~l~~~~~~ 144 (179) T cd03110 116 DAALLVTEPTPSGLHDLERAVELVRHFGI 144 (179) T ss_pred CEEEEECCCCHHHHHHHHHHHHHHHHCCC T ss_conf 94999819947899999999999998299 No 123 >PRK08356 hypothetical protein; Provisional Probab=32.87 E-value=36 Score=14.81 Aligned_cols=25 Identities=20% Similarity=0.367 Sum_probs=13.1 Q ss_pred HHHHHHHHCCCEEECHHHCCHHHCC Q ss_conf 9999999769748211222524203 Q gi|254780768|r 112 ASIDLLESYGVSVVGAHEIVPELLV 136 (281) Q Consensus 112 ~i~~~fe~~G~~vi~~~~~l~~ll~ 136 (281) .++++|++.||.+++..+.+-+++. T Consensus 20 tva~~l~~~G~~~~s~sd~lrd~~~ 44 (195) T PRK08356 20 TVAKFLEELGFCRISCSEPLIDILT 44 (195) T ss_pred HHHHHHHHCCCEEEECCHHHHHHHH T ss_conf 9999999869928842278999984 No 124 >pfam04131 NanE Putative N-acetylmannosamine-6-phosphate epimerase. This family represents a putative ManNAc-6-P-to-GlcNAc-6P epimerase in the N-acetylmannosamine (ManNAc) utilisation pathway found mainly in pathogenic bacteria. Probab=32.86 E-value=28 Score=15.53 Aligned_cols=138 Identities=22% Similarity=0.240 Sum_probs=65.5 Q ss_pred HCCCCEEEEEECCCCCCCCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHCCCHHHHHHHHHHHHHHH Q ss_conf 78997899995486673103885678738999999999986495406770242321000111026035789999998751 Q gi|254780768|r 23 LKNDEPVIASVLNECSFDWQDFECRELPLGDFCVLRSILHQYNIGRIVVAGAIDRRPNVQDLCFSIKDSLRISKMIWQLV 102 (281) Q Consensus 23 ~~g~~~~ii~l~~~~~~~~~~~~~~~~~ig~ig~li~~Lk~~~i~~ivmaG~V~krP~~~~l~~D~~~~~~l~k~l~~~~ 102 (281) ++.-++.||++... ++++.+.+.-+- -+-...|-..|++=|.+=+---+|| .. +.......+-...+ T Consensus 28 k~~v~vPIIGi~K~---~~~~~~VyITPt---~~ev~~l~~aGadiIA~DaT~R~RP--~~----~~~lv~~i~~~~~l- 94 (192) T pfam04131 28 RAIVDLPIIGIVKR---DLPDSPVRITPT---MKDIDELANAGADIIALDGTDRPRP--VD----IESFIKRIKEKGQL- 94 (192) T ss_pred HHHCCCCEEEEEEC---CCCCCCCEECCC---HHHHHHHHHCCCCEEEEECCCCCCC--CC----HHHHHHHHHHHCCE- T ss_conf 98589987999856---789998165588---9999999985999999846789897--58----99999999981998- Q ss_pred CCCCHHHHHHHHHHHHHCCCEEECHHHCCHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEECCEEEE Q ss_conf 04803589999999997697482112225242035433455465323478899899999973326722599981974889 Q gi|254780768|r 103 SGGNAAILKASIDLLESYGVSVVGAHEIVPELLVQVGSLGTCVPNRDVKRDILAAMKSAEALSELDVGQSAVSIGGRVVA 182 (281) Q Consensus 103 ~~gDd~iL~~i~~~fe~~G~~vi~~~~~l~~ll~~~G~l~~~~p~~~~~~dI~~g~~i~~~l~~~DiGQsvVv~~g~Via 182 (281) -.-|=+-+..- ..=++.|+.+++.+-. |.-...+++. -| +++++.+... +.-+= T Consensus 95 ~MAD~st~eea-~~A~~~G~D~I~TTL~--------GYT~~t~~~~---pD----~~ll~~l~~~----------~~pvI 148 (192) T pfam04131 95 AMADCSTFEEG-LNAHKLGVDIVGTTLS--------GYTGGSNPAE---PD----FQLVKTLSEA----------GCFVI 148 (192) T ss_pred EEEECCCHHHH-HHHHHCCCCEEECCCC--------CCCCCCCCCC---CC----HHHHHHHHHC----------CCCEE T ss_conf 89974999999-9999859999982325--------5789999999---97----8999999868----------99399 Q ss_pred EECCCCHHHHHHHHHHH Q ss_conf 62534217999999975 Q gi|254780768|r 183 LEGIEGTDSMLQRIVDC 199 (281) Q Consensus 183 iEa~eGTD~mi~R~~~~ 199 (281) .|+---|-++.+++.++ T Consensus 149 aEGri~tPe~a~~a~~~ 165 (192) T pfam04131 149 AEGRYNTPELAKKAIEI 165 (192) T ss_pred EECCCCCHHHHHHHHHC T ss_conf 85798999999999983 No 125 >PRK05301 pyrroloquinoline quinone biosynthesis protein PqqE; Provisional Probab=32.70 E-value=37 Score=14.79 Aligned_cols=79 Identities=14% Similarity=0.306 Sum_probs=47.1 Q ss_pred EEEHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHCCCHHHHHHHHHHH---HHHHCCCCHHHHHHHHHHHHHCCCEE Q ss_conf 873899999999998649540677024232100011102603578999999---87510480358999999999769748 Q gi|254780768|r 48 ELPLGDFCVLRSILHQYNIGRIVVAGAIDRRPNVQDLCFSIKDSLRISKMI---WQLVSGGNAAILKASIDLLESYGVSV 124 (281) Q Consensus 48 ~~~ig~ig~li~~Lk~~~i~~ivmaG~V~krP~~~~l~~D~~~~~~l~k~l---~~~~~~gDd~iL~~i~~~fe~~G~~v 124 (281) .++..+.-++++.++..++..|+|.|+ + |.+. +|+.-.....+-. ..+..+|- -|-...++.|.+.|+.. T Consensus 46 ELs~~e~~~~id~l~~~Gv~~v~~tGG-E--Pllr---~D~~ei~~~a~~~G~~~~l~TNG~-lit~~~a~~L~~~gl~~ 118 (375) T PRK05301 46 ELSTAEWIRVLREARALGVLQLHFSGG-E--PLLR---KDLEELVAHARRLGLYTNLITSGV-GLTEARLAALKAAGLDH 118 (375) T ss_pred CCCHHHHHHHHHHHHHCCCCEEEECCC-C--CCCC---CCHHHHHHHHHHCCCEEEEEECCC-CCCHHHHHHHHHHCCCE T ss_conf 899999999999999869988996186-5--2456---689999999997697589960674-55799999998509988 Q ss_pred E--CHHHCCHH Q ss_conf 2--11222524 Q gi|254780768|r 125 V--GAHEIVPE 133 (281) Q Consensus 125 i--~~~~~l~~ 133 (281) + |.+..-|+ T Consensus 119 v~vSlDg~~~e 129 (375) T PRK05301 119 IQLSFQDSDPE 129 (375) T ss_pred EEEECCCCCHH T ss_conf 99956779877 No 126 >COG0569 TrkA K+ transport systems, NAD-binding component [Inorganic ion transport and metabolism] Probab=32.67 E-value=37 Score=14.79 Aligned_cols=69 Identities=16% Similarity=0.156 Sum_probs=44.4 Q ss_pred EEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCC-C-CCCEEEEEHHHHHHHHHHHHHCCCCEEEEECCC Q ss_conf 9997798038999999997899789999548667310-3-885678738999999999986495406770242 Q gi|254780768|r 5 LIIAGSGMLPYYVAKAARLKNDEPVIASVLNECSFDW-Q-DFECRELPLGDFCVLRSILHQYNIGRIVVAGAI 75 (281) Q Consensus 5 gIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~-~-~~~~~~~~ig~ig~li~~Lk~~~i~~ivmaG~V 75 (281) -+|.|.|.+=..+|+.+...|++++++--..+....+ . .+.+..+--- +.-.+.|++.++.+.=++=.. T Consensus 3 iiIiG~G~vG~~va~~L~~~g~~Vv~Id~d~~~~~~~~~~~~~~~~v~gd--~t~~~~L~~agi~~aD~vva~ 73 (225) T COG0569 3 IIIIGAGRVGRSVARELSEEGHNVVLIDRDEERVEEFLADELDTHVVIGD--ATDEDVLEEAGIDDADAVVAA 73 (225) T ss_pred EEEECCCHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHCCCEEEEEEC--CCCHHHHHHCCCCCCCEEEEE T ss_conf 99989857889999999878990899976889999863200044999926--889899986798638999998 No 127 >PRK09856 fructoselysine 3-epimerase; Provisional Probab=32.08 E-value=37 Score=14.72 Aligned_cols=169 Identities=13% Similarity=0.137 Sum_probs=77.5 Q ss_pred EHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCEEECHHH Q ss_conf 38999999999986495406770242321000111026035789999998751048035899999999976974821122 Q gi|254780768|r 50 PLGDFCVLRSILHQYNIGRIVVAGAIDRRPNVQDLCFSIKDSLRISKMIWQLVSGGNAAILKASIDLLESYGVSVVGAHE 129 (281) Q Consensus 50 ~ig~ig~li~~Lk~~~i~~ivmaG~V~krP~~~~l~~D~~~~~~l~k~l~~~~~~gDd~iL~~i~~~fe~~G~~vi~~~~ 129 (281) .+..+.+.++.-..-++..++....-. .+ ..+ + ...-+. + - .-|+.+.++=++.|+++. T Consensus 88 ~i~~~k~~id~A~~Lga~~v~v~~~~~-~~-~~~------~-~~~w~~---~----~-e~l~~l~~~A~~~Gv~l~---- 146 (276) T PRK09856 88 SLDMIKLAMDMAKEMNAGYTLISAAHA-GY-LTP------P-NVIWGR---L----A-ENLSELCEYAENIGMDLI---- 146 (276) T ss_pred HHHHHHHHHHHHHHHCCCEEEEECCCC-CC-CCC------H-HHHHHH---H----H-HHHHHHHHHHHHCCCEEE---- T ss_conf 999999999999984998499936877-78-889------7-999999---9----9-999999999997399899---- Q ss_pred CCHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEECCEEEEEECCCCHHHHHHHHHHHHHHCCCCCCC Q ss_conf 25242035433455465323478899899999973326722599981974889625342179999999754312345667 Q gi|254780768|r 130 IVPELLVQVGSLGTCVPNRDVKRDILAAMKSAEALSELDVGQSAVSIGGRVVALEGIEGTDSMLQRIVDCRNNGRILAGK 209 (281) Q Consensus 130 ~l~~ll~~~G~l~~~~p~~~~~~dI~~g~~i~~~l~~~DiGQsvVv~~g~ViaiEa~eGTD~mi~R~~~~~~~~~~~~~~ 209 (281) +..+..-+..+.. ..+-+..++..++.-.++=.+=+. -..+|+ |--.+++++.++...+ T Consensus 147 -lE~l~~~e~~li~---------~~~~~~~~~~~v~~~~~~~~lD~~---h~~~~~-~~~~~~~~~~g~~l~H------- 205 (276) T PRK09856 147 -LEPLTPYESNVVC---------NANDVLHALALVPSPRLFSMVDIC---APYVQA-EPVMSYFDKLGDKLRH------- 205 (276) T ss_pred -EEECCCCCCCHHC---------CHHHHHHHHHHCCCCCEEEEEECH---HHHHCC-CCHHHHHHHHCCCEEE------- T ss_conf -9517611142006---------899999999857998648998541---375538-9999999985887469------- Q ss_pred CEEEEEECCCCCCCEEEECCCC----HHHHHHHHHCCCEEE-EEECCCEEEECHHHHHH Q ss_conf 7189993488875312106627----999999998499099-99739779985899999 Q gi|254780768|r 210 SGVLVKMCKSQQDMRADLPSIG----AKTVQNVIKAGLAGI-ALEAGKSLVLEKELVKK 263 (281) Q Consensus 210 ~~ilvK~~K~~QD~r~DlP~IG----~~Ti~~~~~ag~~gi-aiea~~~lild~~~~i~ 263 (281) |=+.=..-..|.+. +|--| .+.++.+.+.|++|- .+|-..-..-|+++..+ T Consensus 206 --vHi~D~~g~~d~hl-~pG~G~~d~~~il~~L~~~gY~G~vsvEl~~~y~~~P~~~a~ 261 (276) T PRK09856 206 --LHIVDSDGASDTHY-IPGEGKMPLRELMRDIIDRGYEGYCTVELVTMYMNEPRLYAR 261 (276) T ss_pred --EEEECCCCCCCCCC-CCCCCCCCHHHHHHHHHHCCCCCEEEEEECCCCCCCHHHHHH T ss_conf --98767999866773-799987788999999998599815999972576579999999 No 128 >COG1060 ThiH Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes [Coenzyme metabolism / General function prediction only] Probab=31.83 E-value=38 Score=14.70 Aligned_cols=180 Identities=11% Similarity=0.041 Sum_probs=88.3 Q ss_pred CEEEEEHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHH-HHCCCHHHHHHHHH-------------HHHHHHCCCCHHHH Q ss_conf 56787389999999999864954067702423210001-11026035789999-------------99875104803589 Q gi|254780768|r 45 ECRELPLGDFCVLRSILHQYNIGRIVVAGAIDRRPNVQ-DLCFSIKDSLRISK-------------MIWQLVSGGNAAIL 110 (281) Q Consensus 45 ~~~~~~ig~ig~li~~Lk~~~i~~ivmaG~V~krP~~~-~l~~D~~~~~~l~k-------------~l~~~~~~gDd~iL 110 (281) +-++++..++.+.+..+.+.||++++|.|+.+ |.+. ..-.+. .+..+ .=+.-..+...-.. T Consensus 86 ~~y~Ls~eeI~~~~~~~~~~G~~Evli~gG~~--p~~~~~y~~~~---~~~ik~~~p~~~i~a~s~~ei~~~~~~~~~s~ 160 (370) T COG1060 86 KAYTLSPEEILEEVREAVKRGITEVLIVGGEH--PELSLEYYEEL---FRTIKEEFPDLHIHALSAGEILFLAREGGLSY 160 (370) T ss_pred CCCCCCHHHHHHHHHHHHHCCCEEEEEECCCC--CCCCHHHHHHH---HHHHHHHCCCHHHCCCCHHHHHHHHHCCCCCH T ss_conf 55316999999999999875986999805768--77436799999---99998857303430167888679874368889 Q ss_pred HHHHHHHHHCCCEEEC--HHHCCHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEECCEEEEEECCCC Q ss_conf 9999999976974821--12225242035433455465323478899899999973326722599981974889625342 Q gi|254780768|r 111 KASIDLLESYGVSVVG--AHEIVPELLVQVGSLGTCVPNRDVKRDILAAMKSAEALSELDVGQSAVSIGGRVVALEGIEG 188 (281) Q Consensus 111 ~~i~~~fe~~G~~vi~--~~~~l~~ll~~~G~l~~~~p~~~~~~dI~~g~~i~~~l~~~DiGQsvVv~~g~ViaiEa~eG 188 (281) +-+...|.+.|.-.++ +.+++.+-.- ..+..++-+...+. ...+.|-++|--.++- +.+..+|+.|= T Consensus 161 ~E~l~~Lk~aGldsmpg~~aeil~e~vr--~~~~p~K~~~~~wl---e~~~~Ah~lGI~~tat------ml~Gh~E~~ed 229 (370) T COG1060 161 EEVLKRLKEAGLDSMPGGGAEILSEEVR--KIHCPPKKSPEEWL---EIHERAHRLGIPTTAT------MLLGHVETRED 229 (370) T ss_pred HHHHHHHHHCCCCCCCCCCEEECHHHHH--HHHCCCCCCHHHHH---HHHHHHHHCCCCCCCE------EEEEECCCHHH T ss_conf 9999999976987674754114167799--86379889999999---9999999769984203------47873288899 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCEEEEE-ECCCCC-CCEEEECCCCHHHHHHHHHCC Q ss_conf 179999999754312345667718999-348887-531210662799999999849 Q gi|254780768|r 189 TDSMLQRIVDCRNNGRILAGKSGVLVK-MCKSQQ-DMRADLPSIGAKTVQNVIKAG 242 (281) Q Consensus 189 TD~mi~R~~~~~~~~~~~~~~~~ilvK-~~K~~Q-D~r~DlP~IG~~Ti~~~~~ag 242 (281) -=.-+.+...+.+.-+.. -.-|+-. .|+++. -+...-|.-+.+.++..+-|. T Consensus 230 ~~~hl~~ir~lQ~~~gg~--~~fI~~~f~p~~~~~~~~~~~~~~~~~~l~~iAiaR 283 (370) T COG1060 230 RIDHLEHIRDLQDETGGF--QEFIPLRFRPENGPLPAEVVPEASLEQDLKAIALAR 283 (370) T ss_pred HHHHHHHHHHHHHHHCCC--EEEECCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHH T ss_conf 999999999999985895--799805545788876666789899899999999999 No 129 >COG0237 CoaE Dephospho-CoA kinase [Coenzyme metabolism] Probab=31.24 E-value=38 Score=14.69 Aligned_cols=31 Identities=23% Similarity=-0.009 Sum_probs=22.9 Q ss_pred CCEEEEEECCCCHHHHHHHHHHHCCCCEEEE Q ss_conf 9739999779803899999999789978999 Q gi|254780768|r 1 MKRLLIIAGSGMLPYYVAKAARLKNDEPVIA 31 (281) Q Consensus 1 M~kigIIAG~G~LP~~ia~~~~~~g~~~~ii 31 (281) |..|||-.|-|.==-.+++.+...|++++.. T Consensus 2 ~~iIglTG~igsGKStva~~~~~~G~~vida 32 (201) T COG0237 2 MLIIGLTGGIGSGKSTVAKILAELGFPVIDA 32 (201) T ss_pred CEEEEEECCCCCCHHHHHHHHHHCCCEEEEC T ss_conf 6499995788778899999999779939988 No 130 >TIGR03035 trp_arylform arylformamidase. One of several pathways of tryptophan degradation is as follows: tryptophan 2,3-dioxygenase (1.13.11.11) uses 02 to convert Trp to L-formylkynurenine. Arylformamidase (3.5.1.9) hydrolyzes the product to L-kynurenine and formate. Kynureninase (3.7.1.3) hydrolyzes L-kynurenine to anthranilate plus alanine. Members of the seed alignment for this model are bacterial predicted metal-dependent hydrolases. All are supported as arylformamidase (3.5.1.9) by an operon structure in which kynureninase and/or tryptophan 2,3-dioxygenase genes are adjacent. The members from Bacillus cereus, Pseudomonas aeruginosa and Ralstonia metallidurans were characterized. An example from Pseudomonas fluorescens is given the gene symbol qbsH instead of kynB because of its role in quinolobactin biosynthesis, which begins with tryptophan. All members of this family should be arylformamidase (3.5.1.9). Probab=30.51 E-value=40 Score=14.55 Aligned_cols=63 Identities=16% Similarity=0.246 Sum_probs=38.1 Q ss_pred CEEEEEECCCCCCC--EEEECCCCHHHHHHHHHCCCEEEEEECCCEEEEC-HH-HHHHHHHHCCCEE Q ss_conf 71899934888753--1210662799999999849909999739779985-89-9999999879689 Q gi|254780768|r 210 SGVLVKMCKSQQDM--RADLPSIGAKTVQNVIKAGLAGIALEAGKSLVLE-KE-LVKKHADEAGIFV 272 (281) Q Consensus 210 ~~ilvK~~K~~QD~--r~DlP~IG~~Ti~~~~~ag~~giaiea~~~lild-~~-~~i~~a~~~~i~i 272 (281) ..+++++...++.. .-|.|.+.++..+.+++.|+++|.+.+=.+=-.+ ++ .+=+..-++++.+ T Consensus 102 ~~vli~T~~~~~~~~~~~~~P~l~~eaa~~L~e~~i~~vGiD~~S~D~~~~~~~~~H~~ll~~gi~i 168 (206) T TIGR03035 102 PRVLLRTYLPAPANAWPDDFPAVAPDTIELLAEKGVRLIGIDTPSVDPLDSKTLDAHHALFRHGMAI 168 (206) T ss_pred CEEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHCCCEEEEECCCCCCCCCCCCHHHHHHHHHCCEEE T ss_conf 7899976886871107767861599999999978985999747867876677629999998799689 No 131 >PRK08591 acetyl-CoA carboxylase biotin carboxylase subunit; Validated Probab=30.48 E-value=40 Score=14.55 Aligned_cols=69 Identities=12% Similarity=0.054 Sum_probs=46.6 Q ss_pred CCEEEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCEEEE---------EHHHHHHHHHHHHHCCCCEE Q ss_conf 9739999779803899999999789978999954866731038856787---------38999999999986495406 Q gi|254780768|r 1 MKRLLIIAGSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQDFECREL---------PLGDFCVLRSILHQYNIGRI 69 (281) Q Consensus 1 M~kigIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~~~~~~---------~ig~ig~li~~Lk~~~i~~i 69 (281) |=|==|||-.|....-|++.|+..|++.+.+.=....+...-......+ +.-++..++...++.+|+-| T Consensus 1 M~~kvLIANRGEIA~RiiRt~~elgi~tVavys~~D~~a~hv~~ADeav~ig~~~~~~sYln~~~Ii~~A~~~g~dAi 78 (449) T PRK08591 1 MFDKILIANRGEIALRILRACKELGIKTVAVHSTADRDALHVQLADEAVCIGPAPSKKSYLNIPAIISAAEITGADAI 78 (449) T ss_pred CCCEEEEECCCHHHHHHHHHHHHCCCCEEEECCHHHCCCCHHHHCCEEEEECCCCCCCCCCCHHHHHHHHHHHCCCEE T ss_conf 964488967849999999999984994999868575278528859888995898843330489999999998299989 No 132 >TIGR01026 fliI_yscN ATPase FliI/YscN family; InterPro: IPR005714 Proteins in this entry show extensive homology to the ATP synthase F1 beta subunit, and are involved in type III protein secretion. They fall into the two separate functional groups outlined below. The first group, exemplified by the Salmonella typhimurium FliI protein (P26465 from SWISSPROT), is needed for flagellar assembly. Most structural components of the bacterial flagellum are translocated through the central channel of the growing flagellar structure by the type III flagellar protein-export apparatus in an ATPase-driven manner, to be assembled at the growing end. FliI is the ATPase that couples ATP hydrolysis to the translocation reaction , . The second group couples ATP hydrolysis to protein translocation in non-flagellar type III secretion systems. Often these systems are involved in virulence and pathogenicity. YscN (P40290 from SWISSPROT) from pathogenic Yersinia species, for example, energises the injection of antihost factors directly into eukaryotic cells, thus overcoming host defences .; GO: 0016887 ATPase activity, 0009058 biosynthetic process, 0015031 protein transport, 0005737 cytoplasm. Probab=30.16 E-value=40 Score=14.51 Aligned_cols=89 Identities=21% Similarity=0.320 Sum_probs=53.7 Q ss_pred CEEEEEECCCC----HHHHHHH-HHHHCCCCEEEEEECCCCCCCCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEEECCCC Q ss_conf 73999977980----3899999-999789978999954866731038856787389999999999864954067702423 Q gi|254780768|r 2 KRLLIIAGSGM----LPYYVAK-AARLKNDEPVIASVLNECSFDWQDFECRELPLGDFCVLRSILHQYNIGRIVVAGAID 76 (281) Q Consensus 2 ~kigIIAG~G~----LP~~ia~-~~~~~g~~~~ii~l~~~~~~~~~~~~~~~~~ig~ig~li~~Lk~~~i~~ivmaG~V~ 76 (281) -||||.||+|. |=-.||+ +. .-++.||+|.||==.+.++| +-+.|..+|-++-|.+=.=+ T Consensus 174 QR~GIFAGSGVGKStLlGMIARn~~---~ADv~ViALIGERGREV~EF------------IE~~LG~EGLkrSV~VVaTS 238 (455) T TIGR01026 174 QRIGIFAGSGVGKSTLLGMIARNNT---EADVNVIALIGERGREVKEF------------IEKDLGEEGLKRSVVVVATS 238 (455) T ss_pred CCEEEEECCCCCHHHHHHHHHHCCC---CCCEEEEEEECCCCCCHHHH------------HHHHCCCCCCCEEEEEEECC T ss_conf 5013550376003445667630067---89827998643777741578------------86313656660117998368 Q ss_pred CCCCHHHHCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCEEE Q ss_conf 2100011102603578999999875104803589999999997697482 Q gi|254780768|r 77 RRPNVQDLCFSIKDSLRISKMIWQLVSGGNAAILKASIDLLESYGVSVV 125 (281) Q Consensus 77 krP~~~~l~~D~~~~~~l~k~l~~~~~~gDd~iL~~i~~~fe~~G~~vi 125 (281) |..| +.|+ +|.-. =-+|++||-++|-+|+ T Consensus 239 ----------D~SP---l~R~------~GAy~-At~iAEYFrdqGk~Vl 267 (455) T TIGR01026 239 ----------DQSP---LLRL------KGAYV-ATAIAEYFRDQGKDVL 267 (455) T ss_pred ----------CCCH---HHHH------HHHHE-EHHHHHHHHHCCCEEE T ss_conf ----------8638---8887------32640-0254354652187056 No 133 >cd04955 GT1_like_6 This family is most closely related to the GT1 family of glycosyltransferases. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homolog Probab=30.08 E-value=40 Score=14.50 Aligned_cols=34 Identities=15% Similarity=0.073 Sum_probs=27.7 Q ss_pred EEEEE---------ECCCCHHHHHHHHHHHCCCCEEEEEECCC Q ss_conf 39999---------77980389999999978997899995486 Q gi|254780768|r 3 RLLII---------AGSGMLPYYVAKAARLKNDEPVIASVLNE 36 (281) Q Consensus 3 kigII---------AG~G~LP~~ia~~~~~~g~~~~ii~l~~~ 36 (281) ||||| .|.+..-..+++.+.++|+++.++..... T Consensus 1 ~ia~i~~~~~Pp~~GG~e~~v~~La~~L~~~Gh~V~v~t~~~~ 43 (363) T cd04955 1 KIAIIGTRGIPAKYGGFETFVEELAPRLVARGHEVTVYCRSPY 43 (363) T ss_pred CEEEECCCCCCCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCC T ss_conf 9899916848999987899999999999977997999987898 No 134 >cd04729 NanE N-acetylmannosamine-6-phosphate epimerase (NanE) converts N-acetylmannosamine-6-phosphate to N-acetylglucosamine-6-phosphate. This reaction is part of the pathway that allows the usage of sialic acid as a carbohydrate source. Sialic acids are a family of related sugars that are found as a component of glycoproteins, gangliosides, and other sialoglycoconjugates. Probab=30.00 E-value=40 Score=14.50 Aligned_cols=138 Identities=14% Similarity=0.121 Sum_probs=64.3 Q ss_pred CCCCEEEEEECCCCCCCCCCCCEEEEE-HHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHCCCHHHHHHHHHHHHH-- Q ss_conf 899789999548667310388567873-89999999999864954067702423210001110260357899999987-- Q gi|254780768|r 24 KNDEPVIASVLNECSFDWQDFECRELP-LGDFCVLRSILHQYNIGRIVVAGAIDRRPNVQDLCFSIKDSLRISKMIWQ-- 100 (281) Q Consensus 24 ~g~~~~ii~l~~~~~~~~~~~~~~~~~-ig~ig~li~~Lk~~~i~~ivmaG~V~krP~~~~l~~D~~~~~~l~k~l~~-- 100 (281) +-.++.||++... ++++.+++.-+ + +-...|-..|++=|.+=+--.+||.-. +...+.+-+-. T Consensus 57 ~~v~lPIIGi~K~---~~~~s~VyITPt~----~ev~~l~~aGadiIA~DaT~R~RP~g~-------~l~~~i~~i~~~~ 122 (219) T cd04729 57 ARVDLPIIGLIKR---DYPDSEVYITPTI----EEVDALAAAGADIIALDATDRPRPDGE-------TLAELIKRIHEEY 122 (219) T ss_pred HCCCCCEEEEEEC---CCCCCCEEECCCH----HHHHHHHHCCCCEEEEECCCCCCCCCC-------CHHHHHHHHHHHH T ss_conf 3289988999956---8899984566889----999999985999999946788798997-------8999999999986 Q ss_pred -HHCCCCHHHHHHHHHHHHHCCCEEECHHHCCHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEECCE Q ss_conf -5104803589999999997697482112225242035433455465323478899899999973326722599981974 Q gi|254780768|r 101 -LVSGGNAAILKASIDLLESYGVSVVGAHEIVPELLVQVGSLGTCVPNRDVKRDILAAMKSAEALSELDVGQSAVSIGGR 179 (281) Q Consensus 101 -~~~~gDd~iL~~i~~~fe~~G~~vi~~~~~l~~ll~~~G~l~~~~p~~~~~~dI~~g~~i~~~l~~~DiGQsvVv~~g~ 179 (281) ..-.-|=+-+..- ..=.+-|+.+++.+-. . ||...+. .+.-|.++=.++.+.++ -. T Consensus 123 ~~l~MAD~st~ee~-~~A~~~G~D~vgTTL~--G-------YT~~t~~-~~~PD~~lv~~l~~~~~------------~p 179 (219) T cd04729 123 NCLLMADISTLEEA-LNAAKLGFDIIGTTLS--G-------YTEETAK-TEDPDFELLKELRKALG------------IP 179 (219) T ss_pred CCEEEEECCCHHHH-HHHHHCCCCEEECCCC--C-------CCCCCCC-CCCCCHHHHHHHHHHCC------------CC T ss_conf 97788754889999-9999849989970214--5-------6778788-99987899999999759------------93 Q ss_pred EEEEECCCCHHHHHHHHHHH Q ss_conf 88962534217999999975 Q gi|254780768|r 180 VVALEGIEGTDSMLQRIVDC 199 (281) Q Consensus 180 ViaiEa~eGTD~mi~R~~~~ 199 (281) +=.|+---|-++.+++.++ T Consensus 180 -vIaEGri~tPe~a~~a~~~ 198 (219) T cd04729 180 -VIAEGRINSPEQAAKALEL 198 (219) T ss_pred -EEEECCCCCHHHHHHHHHC T ss_conf -9970698999999999983 No 135 >PRK10217 dTDP-glucose 4,6-dehydratase; Provisional Probab=29.99 E-value=40 Score=14.49 Aligned_cols=75 Identities=15% Similarity=0.249 Sum_probs=50.6 Q ss_pred CCEEEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCC-CCC---------CCCCCEEEEEHHHHHHHHHHHHHCCCCEEE Q ss_conf 9739999779803899999999789978999954866-731---------038856787389999999999864954067 Q gi|254780768|r 1 MKRLLIIAGSGMLPYYVAKAARLKNDEPVIASVLNEC-SFD---------WQDFECRELPLGDFCVLRSILHQYNIGRIV 70 (281) Q Consensus 1 M~kigIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~-~~~---------~~~~~~~~~~ig~ig~li~~Lk~~~i~~iv 70 (281) |+|+-|..|.|-+=..+++.+..++.+.+++. .... ... .+++.....++.+...+-+.++..+.+.|+ T Consensus 1 MKkILVTGg~GFIGs~Lv~~Ll~~~~~~v~~~-d~~~~~~~~~~~~~~~~~~~~~~~~~Di~d~~~l~~~~~~~~pD~Vi 79 (355) T PRK10217 1 MRKILITGGAGFIGSALVRYIINETSDAVVVV-DKLTYAGNLMSLAPVAQSERFAFEKVDICDRAELARVFTEYQPDCVM 79 (355) T ss_pred CCEEEECCCCCHHHHHHHHHHHHCCCCEEEEE-ECCCCCCCHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHHCCCCEEE T ss_conf 99699937875799999999997699889998-28987652544445412787169980058899999999861998899 Q ss_pred EECCCC Q ss_conf 702423 Q gi|254780768|r 71 VAGAID 76 (281) Q Consensus 71 maG~V~ 76 (281) -+.... T Consensus 80 HlAa~~ 85 (355) T PRK10217 80 HLAAES 85 (355) T ss_pred EECCCC T ss_conf 942422 No 136 >PRK12824 acetoacetyl-CoA reductase; Provisional Probab=29.90 E-value=41 Score=14.48 Aligned_cols=167 Identities=9% Similarity=0.058 Sum_probs=90.6 Q ss_pred CCEEEEEECCC-CHHHHHHHHHHHCCCCEEEEEECCCCCC-----CCC--CCCEEEE-----EHHHHHHHHHHHHHC--C Q ss_conf 97399997798-0389999999978997899995486673-----103--8856787-----389999999999864--9 Q gi|254780768|r 1 MKRLLIIAGSG-MLPYYVAKAARLKNDEPVIASVLNECSF-----DWQ--DFECREL-----PLGDFCVLRSILHQY--N 65 (281) Q Consensus 1 M~kigIIAG~G-~LP~~ia~~~~~~g~~~~ii~l~~~~~~-----~~~--~~~~~~~-----~ig~ig~li~~Lk~~--~ 65 (281) |+|++||.|++ -+=..+++.+.++|+++++.....+... ... ...+..+ +-.++.++++...++ . T Consensus 1 M~KvalITGas~GIG~a~a~~la~~G~~Vv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~D~~d~~~~~~~v~~~~~~~g~ 80 (245) T PRK12824 1 MKKIALVTGAKRGIGSAIARELLADGYRVIATYFGNYDAAKDWFEEYGFTEDQVRLKSLDVTDTEECQEALARIEEEEGP 80 (245) T ss_pred CCCEEEEECCCCHHHHHHHHHHHHCCCEEEEEECCCHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHHCCC T ss_conf 98599994788889999999999879989999588077899999987404993899991389999999999999997499 Q ss_pred CCEEEEECCCCCCCCHHHHCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCC-CEEECHHHCCHHHCCCCCCCCCC Q ss_conf 54067702423210001110260357899999987510480358999999999769-74821122252420354334554 Q gi|254780768|r 66 IGRIVVAGAIDRRPNVQDLCFSIKDSLRISKMIWQLVSGGNAAILKASIDLLESYG-VSVVGAHEIVPELLVQVGSLGTC 144 (281) Q Consensus 66 i~~ivmaG~V~krP~~~~l~~D~~~~~~l~k~l~~~~~~gDd~iL~~i~~~fe~~G-~~vi~~~~~l~~ll~~~G~l~~~ 144 (281) ++-+|-.-.+..+..+.++-.+ ...+.+ ...-++--.+.+.++..+.+++ =+++.........-. ++... . T Consensus 81 iDiLVnnAG~~~~~~~~~~~~e-----~w~~~~-~vNl~~~f~~~~~~~~~m~~~~~G~IVnisS~~~~~~~-~~~~~-Y 152 (245) T PRK12824 81 VDILVNNAGITRDSGFKRMSHQ-----EWNDVI-NTNLNSVFNVTQPLFPAMCEQGYGRIINISSVNGLKGQ-FGQTN-Y 152 (245) T ss_pred CCEEEECCCCCCCCCCHHCCHH-----HHHHHH-HHHHHHHHHHHHHHHHHHHHCCCCEEEEECCHHHCCCC-CCCHH-H T ss_conf 9899989888999990239999-----999999-99734159999999999998399559997467757788-99689-9 Q ss_pred CCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEEC Q ss_conf 653234788998999999733267225999819 Q gi|254780768|r 145 VPNRDVKRDILAAMKSAEALSELDVGQSAVSIG 177 (281) Q Consensus 145 ~p~~~~~~dI~~g~~i~~~l~~~DiGQsvVv~~ 177 (281) ..+|. .=+.+-..+++++++..|-=-+|.-+ T Consensus 153 ~asKa--al~~ltk~lA~E~a~~gIrvN~I~PG 183 (245) T PRK12824 153 SAAKA--GMIGFTKALASEGARYGITVNCIAPG 183 (245) T ss_pred HHHHH--HHHHHHHHHHHHHHHHCEEEEEEEEC T ss_conf 99999--99999999999972549199999744 No 137 >PRK13143 hisH imidazole glycerol phosphate synthase subunit HisH; Provisional Probab=29.85 E-value=41 Score=14.48 Aligned_cols=58 Identities=22% Similarity=0.104 Sum_probs=37.6 Q ss_pred CCEEEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCEEEEE-HHHHHHHHHHHH Q ss_conf 97399997798038999999997899789999548667310388567873-899999999998 Q gi|254780768|r 1 MKRLLIIAGSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQDFECRELP-LGDFCVLRSILH 62 (281) Q Consensus 1 M~kigIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~~~~~~~-ig~ig~li~~Lk 62 (281) |++|+||==...--..++++++..|.++.++.- ..++.+++...++ .|.++...+.|+ T Consensus 1 Mk~I~Iid~G~~n~~si~~~l~~lg~~~~i~~~----~~~l~~~d~iILPGvG~~~~~~~~l~ 59 (201) T PRK13143 1 MKMIVIIDYGLGNLRSVSKGLERAGADVEITSD----PEEILSADGIVLPGVGAFGDVMENLG 59 (201) T ss_pred CCEEEEEECCCCHHHHHHHHHHHCCCEEEECCC----HHHHHHCCEEEECCCCCHHHHHHHHH T ss_conf 988999988971999999999986985999289----99984379679738881889998765 No 138 >PRK10799 putative hydrolase-oxidase; Provisional Probab=29.77 E-value=41 Score=14.47 Aligned_cols=16 Identities=19% Similarity=0.125 Sum_probs=7.0 Q ss_pred HHHHHHHHCCCCEEEE Q ss_conf 9999999789978999 Q gi|254780768|r 16 YVAKAARLKNDEPVIA 31 (281) Q Consensus 16 ~ia~~~~~~g~~~~ii 31 (281) .+.+++...|.+.++. T Consensus 47 ~vi~eAi~~~aDliit 62 (247) T PRK10799 47 ALLDEAVRLQADAVIV 62 (247) T ss_pred HHHHHHHHCCCCEEEE T ss_conf 9999999859999998 No 139 >cd00431 cysteine_hydrolases Cysteine hydrolases; This family contains amidohydrolases, like CSHase (N-carbamoylsarcosine amidohydrolase), involved in creatine metabolism and nicotinamidase, converting nicotinamide to nicotinic acid and ammonia in the pyridine nucleotide cycle. It also contains isochorismatase, an enzyme that catalyzes the conversion of isochorismate to 2,3-dihydroxybenzoate and pyruvate, via the hydrolysis of the vinyl ether bond, and other related enzymes with unknown function. Probab=29.73 E-value=41 Score=14.47 Aligned_cols=19 Identities=32% Similarity=0.538 Sum_probs=7.8 Q ss_pred HHHHHHHCCCCEEEEECCC Q ss_conf 9999986495406770242 Q gi|254780768|r 57 LRSILHQYNIGRIVVAGAI 75 (281) Q Consensus 57 li~~Lk~~~i~~ivmaG~V 75 (281) +-..|++++|++++++|-. T Consensus 101 L~~~L~~~~i~~lii~G~~ 119 (161) T cd00431 101 LDELLRERGIDTLVVCGIA 119 (161) T ss_pred HHHHHHHCCCCCEEEEEHH T ss_conf 8999997399867996444 No 140 >TIGR03471 HpnJ hopanoid biosynthesis associated radical SAM protein HpnJ. One of the well-described hopanoid intermediates is bacteriohopanetetrol. In the conversion from hopene several reactions must occur in the side chain for which a radical mechanism might be reasonable. These include the four (presumably anaerobic) hydroxylations and a methyl shift. Probab=29.56 E-value=41 Score=14.45 Aligned_cols=53 Identities=25% Similarity=0.391 Sum_probs=36.3 Q ss_pred CEEEECCCCHHHHHHHHHCCCEEEE--EECCCEEEEC----------HHHHHHHHHHCCCEE-----EEECHH Q ss_conf 3121066279999999984990999--9739779985----------899999999879689-----995775 Q gi|254780768|r 223 MRADLPSIGAKTVQNVIKAGLAGIA--LEAGKSLVLE----------KELVKKHADEAGIFV-----CGIDRE 278 (281) Q Consensus 223 ~r~DlP~IG~~Ti~~~~~ag~~gia--iea~~~lild----------~~~~i~~a~~~~i~i-----~g~~~~ 278 (281) .|+|+ -.++++.|++|||..|. +|+|.--+|+ -.++++.|.++||.+ +|+..| T Consensus 281 ~Rv~~---d~E~l~~mk~AGc~~v~~GiESgsq~iL~~i~K~~t~e~~~~av~~~k~~GI~v~~~FIiG~PgE 350 (472) T TIGR03471 281 ARANV---DYETLKVMKENGLRLLLVGYESGDQQILKNIKKGLTVEIARRFTRDCHKLGIKVHGTFILGLPGE 350 (472) T ss_pred EECCC---CHHHHHHHHHHCCEEEEEECCCCCHHHHHHHCCCCCHHHHHHHHHHHHHCCCEEEEEEEECCCCC T ss_conf 30348---99999999983984899803758999999853899899999999988757987999998779999 No 141 >PRK13525 glutamine amidotransferase subunit PdxT; Provisional Probab=29.28 E-value=42 Score=14.42 Aligned_cols=61 Identities=18% Similarity=0.219 Sum_probs=38.6 Q ss_pred CCEEEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCEEEEEHHHHHHHHHHHHHCCC Q ss_conf 973999977980389999999978997899995486673103885678738999999999986495 Q gi|254780768|r 1 MKRLLIIAGSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQDFECRELPLGDFCVLRSILHQYNI 66 (281) Q Consensus 1 M~kigIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~~~~~~~ig~ig~li~~Lk~~~i 66 (281) |-|||+++=.|++- .+.+++++.|.++.++.=. .++.+.+...+|=|..+.+.+.|++.+. T Consensus 1 m~~IGvl~~qGn~r-s~~~aL~~lG~~~~~v~~~----~di~~ad~lILPGG~s~am~~ll~~~gl 61 (191) T PRK13525 1 MMKIGVLALQGAVR-EHIAALEALGAEAVEVRRP----EDLDEIDGLILPGGESTTMGKLLRDFGL 61 (191) T ss_pred CCEEEEEECCCCHH-HHHHHHHHCCCCEEEECCH----HHHHHCCEEEECCCCHHHHHHHHHHCCC T ss_conf 93799996478899-9999999879978998999----9995399899789765899999998686 No 142 >PRK05472 redox-sensing transcriptional repressor Rex; Provisional Probab=29.26 E-value=42 Score=14.41 Aligned_cols=108 Identities=13% Similarity=0.182 Sum_probs=49.5 Q ss_pred CCHHHHHHHH--HHHHHCCCEEECHHHCCHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEECCEEEE Q ss_conf 8035899999--99997697482112225242035433455465323478899899999973326722599981974889 Q gi|254780768|r 105 GNAAILKASI--DLLESYGVSVVGAHEIVPELLVQVGSLGTCVPNRDVKRDILAAMKSAEALSELDVGQSAVSIGGRVVA 182 (281) Q Consensus 105 gDd~iL~~i~--~~fe~~G~~vi~~~~~l~~ll~~~G~l~~~~p~~~~~~dI~~g~~i~~~l~~~DiGQsvVv~~g~Via 182 (281) |-.++=++++ ..|+++||+++.+=+.-|+..=.. + .+... ....+++ +..++. ..++| ||+ T Consensus 91 GaGnLG~AL~~y~gf~~~gf~Iva~FD~dp~kiG~~-i-~gi~V--~~i~~L~---~~i~~~-~i~ia---------Iia 153 (211) T PRK05472 91 GAGNLGRALLNYNGFKKRGFKIVAAFDVDPEKVGTK-I-GGIPV--YHIDELE---EVIKEN-DIEIA---------ILT 153 (211) T ss_pred CCCHHHHHHHHCCCHHHCCCEEEEEECCCHHHCCCE-E-CCEEE--ECHHHHH---HHHHHH-CCCEE---------EEE T ss_conf 887799999848762318978999974897885988-3-88387--3499999---999981-99389---------995 Q ss_pred E---ECCCCHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCEEEECCCCHHHHH Q ss_conf 6---25342179999999754312345667718999348887531210662799999 Q gi|254780768|r 183 L---EGIEGTDSMLQRIVDCRNNGRILAGKSGVLVKMCKSQQDMRADLPSIGAKTVQ 236 (281) Q Consensus 183 i---Ea~eGTD~mi~R~~~~~~~~~~~~~~~~ilvK~~K~~QD~r~DlP~IG~~Ti~ 236 (281) + .|-|=+|.|++- | .+.-. .-.++-.+.|+--+=..+|+ +.+++|+. T Consensus 154 VP~~~AQ~vad~Lv~a-G-Ik~Il----NFap~~L~vP~~V~V~nvDl-~~~L~~L~ 203 (211) T PRK05472 154 VPAEAAQEVADRLVEA-G-IKGIL----NFAPVRLNVPEDVIVQNVDL-TVELQTLS 203 (211) T ss_pred CCHHHHHHHHHHHHHH-C-CEEEE----ECCCCCCCCCCCCEEEECCH-HHHHHHHH T ss_conf 5768899999999981-9-83999----76872447999978997370-98999998 No 143 >pfam02887 PK_C Pyruvate kinase, alpha/beta domain. As well as being found in pyruvate kinase this family is found as an isolated domain in some bacterial proteins. Probab=29.00 E-value=42 Score=14.38 Aligned_cols=39 Identities=26% Similarity=0.338 Sum_probs=31.3 Q ss_pred HHHHHHHHHHHHHHHCCCCCEEEEEEECCEEEEEECCCCHH Q ss_conf 47889989999997332672259998197488962534217 Q gi|254780768|r 150 VKRDILAAMKSAEALSELDVGQSAVSIGGRVVALEGIEGTD 190 (281) Q Consensus 150 ~~~dI~~g~~i~~~l~~~DiGQsvVv~~g~ViaiEa~eGTD 190 (281) ....+..+.+.+++-+-+.-|+.+||-.|.-.+.++ +|| T Consensus 73 ~~~~i~~a~~~l~~~g~~~~GD~vVvv~G~p~~~~g--~TN 111 (117) T pfam02887 73 TDEIIAEALRVAKDAGLIKKGDLVVVTAGVPVGTSG--GTN 111 (117) T ss_pred HHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCC--CCE T ss_conf 999999999999986999999989998463599887--645 No 144 >COG2210 Peroxiredoxin family protein [General function prediction only] Probab=28.68 E-value=43 Score=14.35 Aligned_cols=31 Identities=32% Similarity=0.262 Sum_probs=25.5 Q ss_pred CEEEEEECCCCHHH-----HHHHHHHHCCCCEEEEE Q ss_conf 73999977980389-----99999997899789999 Q gi|254780768|r 2 KRLLIIAGSGMLPY-----YVAKAARLKNDEPVIAS 32 (281) Q Consensus 2 ~kigIIAG~G~LP~-----~ia~~~~~~g~~~~ii~ 32 (281) +|++||..+|.+-+ .++..+.+.|+++.+.. T Consensus 3 ~k~~IIl~SG~~dk~~~a~iias~A~A~G~EV~VF~ 38 (137) T COG2210 3 KKLGIILASGTLDKAYAALIIASGAAAMGYEVTVFF 38 (137) T ss_pred CEEEEEEECCCHHHHHHHHHHHHHHHHCCCEEEEEE T ss_conf 457999957978899999999999997697589998 No 145 >PRK00635 excinuclease ABC subunit A; Provisional Probab=28.56 E-value=43 Score=14.33 Aligned_cols=36 Identities=19% Similarity=0.230 Sum_probs=28.1 Q ss_pred EEEECCCCH---------HHHHHHHHCCCEEEEEECCCEEEECHHHHHHHHH Q ss_conf 121066279---------9999999849909999739779985899999999 Q gi|254780768|r 224 RADLPSIGA---------KTVQNVIKAGLAGIALEAGKSLVLEKELVKKHAD 266 (281) Q Consensus 224 r~DlP~IG~---------~Ti~~~~~ag~~giaiea~~~lild~~~~i~~a~ 266 (281) =+|=|+||+ +|++.+.+.|=+.|+||.+ +++|+.|| T Consensus 1412 VLDEPSIGLHprD~~rLi~~Lk~Lrd~GNTVIVVEHD-------~~~I~~AD 1456 (1809) T PRK00635 1412 LLEDPLSGLHPQDAPTLVQLLKELVANNNTVIATDRS-------GSLIPHAD 1456 (1809) T ss_pred EECCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECC-------HHHHHHCC T ss_conf 9478866778545999999999998669918999568-------89997488 No 146 >cd02037 MRP-like MRP (Multiple Resistance and pH adaptation) is a homologue of the Fer4_NifH superfamily. Like the other members of the superfamily, MRP contains a ATP-binding domain at the N-termini. It is found in bacteria as a membrane-spanning protein and functions as a Na+/H+ antiporter. Probab=28.52 E-value=43 Score=14.33 Aligned_cols=30 Identities=13% Similarity=0.169 Sum_probs=20.5 Q ss_pred EEEEECCC-----CHHHHHHHHHHHCCCCEEEEEE Q ss_conf 99997798-----0389999999978997899995 Q gi|254780768|r 4 LLIIAGSG-----MLPYYVAKAARLKNDEPVIASV 33 (281) Q Consensus 4 igIIAG~G-----~LP~~ia~~~~~~g~~~~ii~l 33 (281) +++.+|.| .+...+|.++.++|+++.++.. T Consensus 2 i~v~s~kggvgkst~~~~la~~l~~~g~~v~~~d~ 36 (169) T cd02037 2 IAVMSGKGGVGKSTVAVNLALALAKLGYKVGLLDA 36 (169) T ss_pred EEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEE T ss_conf 89974999881999999999999987997899971 No 147 >TIGR02053 MerA mercuric reductase; InterPro: IPR011796 This entry represents the mercuric reductase found in the mer operon for the detoxification of mercury compounds. MerA is a FAD-containing flavoprotein which reduces Hg(II) to Hg(0) utilizing NADPH .; GO: 0016152 mercury (II) reductase activity, 0045340 mercury ion binding, 0050660 FAD binding, 0050661 NADP binding, 0006118 electron transport, 0050787 detoxification of mercury ion. Probab=28.24 E-value=43 Score=14.30 Aligned_cols=37 Identities=19% Similarity=0.223 Sum_probs=22.7 Q ss_pred HHCCCCHHHHHHHHHHHHH-CCCEEECHH-HCCHHHCCC Q ss_conf 5104803589999999997-697482112-225242035 Q gi|254780768|r 101 LVSGGNAAILKASIDLLES-YGVSVVGAH-EIVPELLVQ 137 (281) Q Consensus 101 ~~~~gDd~iL~~i~~~fe~-~G~~vi~~~-~~l~~ll~~ 137 (281) +..+-|-.+=..|-+.|++ +|++++... .+-.----+ T Consensus 216 ll~~~epeis~~V~~~l~~eeGi~~~~~~r~~~~v~~rn 254 (494) T TIGR02053 216 LLPREEPEISAAVEEALAEEEGIEVVTSARQVKAVSVRN 254 (494) T ss_pred HCCCCCHHHHHHHHHHHCCCCCEEEEECCEEEEEEEECC T ss_conf 446468889999999841478779980440355444527 No 148 >TIGR01284 alt_nitrog_alph nitrogenase alpha chain; InterPro: IPR005974 The enzyme responsible for nitrogen fixation, the nitrogenase, shows a high degree of conservation of structure, function, and amino acid sequence across wide phylogenetic ranges. All known Mo-nitrogenases consist of two components, component I (also called dinitrogenase, or Fe-Mo protein), an alpha2beta2 tetramer encoded by the nifD and nifK genes, and component II (dinitrogenase reductase, or Fe protein) a homodimer encoded by the nifH gene. Two operons, nifDK and nifEN, encode a tetrameric (alpha2beta2 and N2E2) enzymatic complex. Nitrogenase contains two unusual rare metal clusters; one of them is the iron molybdenum cofactor (FeMo-co), which is considered to be the site of dinitrogen reduction and whose biosynthesis requires the products of nifNE and of some other nif genes. It has been proposed that NifNE might serve as a scaffold upon which FeMo-co is built and then inserted into component I. This model represents the alpha chains of various forms of the nitrogen-fixing enzyme nitrogenase: vanadium-iron, iron-iron, and molybdenum-iron. Most examples of NifD, the molybdenum-iron type nitrogenase alpha chain, are excluded from this model and described instead by equivalog model IPR005972 from INTERPRO.; GO: 0016163 nitrogenase activity, 0051536 iron-sulfur cluster binding, 0009399 nitrogen fixation. Probab=28.23 E-value=43 Score=14.30 Aligned_cols=77 Identities=12% Similarity=0.079 Sum_probs=50.0 Q ss_pred HCC-CCEEEEEECCCCCCCCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHCCCHHHHHHHHHHHHHH Q ss_conf 789-9789999548667310388567873899999999998649540677024232100011102603578999999875 Q gi|254780768|r 23 LKN-DEPVIASVLNECSFDWQDFECRELPLGDFCVLRSILHQYNIGRIVVAGAIDRRPNVQDLCFSIKDSLRISKMIWQL 101 (281) Q Consensus 23 ~~g-~~~~ii~l~~~~~~~~~~~~~~~~~ig~ig~li~~Lk~~~i~~ivmaG~V~krP~~~~l~~D~~~~~~l~k~l~~~ 101 (281) ..+ .+++++.-.|.+-+.-+ --|+.++++=+.. .+|.++ +|.-++- ..-++..- T Consensus 165 e~p~vDvf~~n~PGFaGpsQS-kGH~v~Ni~win~--------------kVgT~E-peiT~ey---------~~N~iGey 219 (468) T TIGR01284 165 EIPDVDVFAVNAPGFAGPSQS-KGHHVLNIAWIND--------------KVGTVE-PEITTEY---------DVNLIGEY 219 (468) T ss_pred HCCCCCEEEEECCCCCCCCCC-CCHHHHHHHHHHH--------------HCCCCC-CCCCCCC---------CEEEEECC T ss_conf 279942899817789898757-7601223453442--------------112348-8768750---------06555333 Q ss_pred HCCCCHHHHHHHHHHHHHCCCEEECH Q ss_conf 10480358999999999769748211 Q gi|254780768|r 102 VSGGNAAILKASIDLLESYGVSVVGA 127 (281) Q Consensus 102 ~~~gDd~iL~~i~~~fe~~G~~vi~~ 127 (281) .-.||-.+|. +||++-||+|++. T Consensus 220 NIqGD~~vl~---~Yf~rmGI~Vlst 242 (468) T TIGR01284 220 NIQGDLEVLK---KYFERMGIQVLST 242 (468) T ss_pred CCHHHHHHHH---HHHHHCCCEEEEE T ss_conf 3400089999---9998639458876 No 149 >pfam01041 DegT_DnrJ_EryC1 DegT/DnrJ/EryC1/StrS aminotransferase family. The members of this family are probably all pyridoxal-phosphate-dependent aminotransferase enzymes with a variety of molecular functions. The family includes StsA, StsC, and StsS. The aminotransferase activity was demonstrated for purified StsC protein as the L-glutamine:scyllo-inosose aminotransferase EC:2.6.1.50, which catalyses the first amino transfer in the biosynthesis of the streptidine subunit of streptomycin. Probab=28.07 E-value=44 Score=14.28 Aligned_cols=15 Identities=27% Similarity=0.313 Sum_probs=8.5 Q ss_pred CHHHHHHHHHHCCCE Q ss_conf 589999999987968 Q gi|254780768|r 257 EKELVKKHADEAGIF 271 (281) Q Consensus 257 d~~~~i~~a~~~~i~ 271 (281) +|++++++..++||. T Consensus 291 ~R~~l~~~L~~~gI~ 305 (363) T pfam01041 291 SRDELVEALKEAGIG 305 (363) T ss_pred HHHHHHHHHHHCCCC T ss_conf 399999999987995 No 150 >COG3473 Maleate cis-trans isomerase [Secondary metabolites biosynthesis, transport, and catabolism] Probab=27.95 E-value=44 Score=14.26 Aligned_cols=24 Identities=21% Similarity=0.427 Sum_probs=17.9 Q ss_pred HHHHHHHHHHHHHCCCEEECHHHC Q ss_conf 358999999999769748211222 Q gi|254780768|r 107 AAILKASIDLLESYGVSVVGAHEI 130 (281) Q Consensus 107 d~iL~~i~~~fe~~G~~vi~~~~~ 130 (281) +.+=+-.++|||++||+|++..-+ T Consensus 129 ~evn~~e~ef~~~~Gfeiv~~~~L 152 (238) T COG3473 129 DEVNQREIEFLEANGFEIVDFKGL 152 (238) T ss_pred HHHHHHHHHHHHHCCEEEEEEECC T ss_conf 544158999998489279975415 No 151 >TIGR02363 dhaK1 dihydroxyacetone kinase, DhaK subunit; InterPro: IPR012736 In bacteria, dihydroxyacetone is formed by the oxidation of glycerol or the aldol cleavage of fructose-6-phosphate. Dihydroxyacetone kinase converts this compound to the glycolytic intermediate dihydroxyacetone phosphate. Two forms of this enzyme have been shown to exist, using either ATP or a phosphoprotein of the phosphoenolpyruvate: sugar phosphotransferase system (PTS) as phosphate donor. The ATP-dependent kinases (2.7.1.29 from EC) are single-subunit enzymes that are composed of two domains . The phosphoprotein-dependent kinases are composed of three subunits: DhaK and DhaL, which are homologous to the domains from the ATP-dependent enzyme; and DhaM which is a component of the PTS system . This entry represents the DhaK subunit of phosphoprotein-dependent dihydroxyacetone kinase. The model excludes the DhaK paralog DhaK2 (IPR012735 from INTERPRO) encoded in the same operon as DhaK in the firmicutes, with which it sometimes exists as a DhaK/DhaK2 fusion. This subunit is a homodimer which contains the dihydroxyacetone-binding site . The overall fold consists of two six-stranded mixed beta-sheets surrounded by nine alpha-helices and a beta-ribbon covering the exposed edge strand of one sheet. . Probab=27.94 E-value=31 Score=15.24 Aligned_cols=52 Identities=12% Similarity=0.087 Sum_probs=25.4 Q ss_pred HHHHHHHHCCCC--EEEEECCCCC-CCCHHHHCCCHHHHHHHHHHHHHHHCCCCH Q ss_conf 999999864954--0677024232-100011102603578999999875104803 Q gi|254780768|r 56 VLRSILHQYNIG--RIVVAGAIDR-RPNVQDLCFSIKDSLRISKMIWQLVSGGNA 107 (281) Q Consensus 56 ~li~~Lk~~~i~--~ivmaG~V~k-rP~~~~l~~D~~~~~~l~k~l~~~~~~gDd 107 (281) -.-+.....+|+ +||-===|.- --.|.-=|.=.-+....-|++..+.++|++ T Consensus 117 MA~ElAE~eGI~V~~VvVdDDiAVeDS~YT~GRRGVAGTvfVhKI~GAaAE~G~~ 171 (354) T TIGR02363 117 MAAELAEAEGIKVEQVVVDDDIAVEDSLYTAGRRGVAGTVFVHKIAGAAAEKGAS 171 (354) T ss_pred HHHHHHHCCCCEEEEEEEECCEEEECCCCCCCCCCCCHHHHHHHHHHHHHHCCCC T ss_conf 8999862279766579980643322485044550420578889999997406888 No 152 >KOG4471 consensus Probab=27.84 E-value=44 Score=14.25 Aligned_cols=97 Identities=18% Similarity=0.227 Sum_probs=59.6 Q ss_pred HHHHHHHHCCCCCEEEEEEECCEEEEEECCCCHHHHHHHHHHHHHHCCCCC------CCCEEEEEECCCCCCCEEEECCC Q ss_conf 999999733267225999819748896253421799999997543123456------67718999348887531210662 Q gi|254780768|r 157 AMKSAEALSELDVGQSAVSIGGRVVALEGIEGTDSMLQRIVDCRNNGRILA------GKSGVLVKMCKSQQDMRADLPSI 230 (281) Q Consensus 157 g~~i~~~l~~~DiGQsvVv~~g~ViaiEa~eGTD~mi~R~~~~~~~~~~~~------~~~~ilvK~~K~~QD~r~DlP~I 230 (281) .|.|-+--+.++.=||-= ..+.|- .-=||+++.|++.++...+... +.|+|+.. ---|-+ T Consensus 185 ~WRIs~iNs~Y~LC~SYP----~~l~VP-~~isD~eL~~VasFRsr~RlPvlsw~Hp~sgAvIaR---------cSQPlV 250 (717) T KOG4471 185 SWRISKINSNYKLCDSYP----AKLVVP-KSISDEELLRVASFRSRCRLPVLSWRHPESGAVIAR---------CSQPLV 250 (717) T ss_pred HEEEECCCCCCCCCCCCC----CCEEEC-CCCCHHHHHHHHHHHHCCCCCEEEEECCCCCCEEEE---------CCCCCC T ss_conf 245301455631215676----105740-424889999775044137661588875788865875---------288643 Q ss_pred CHH---------HHHHHHHCCCEEEEEECCCEEEEC-HHHHHHHHHHCC Q ss_conf 799---------999999849909999739779985-899999999879 Q gi|254780768|r 231 GAK---------TVQNVIKAGLAGIALEAGKSLVLE-KELVKKHADEAG 269 (281) Q Consensus 231 G~~---------Ti~~~~~ag~~giaiea~~~lild-~~~~i~~a~~~~ 269 (281) |.- -+...++|...- .|..+-+|+| ++-+-+.||+++ T Consensus 251 G~~g~Rn~~DEkll~~i~~a~A~~--~e~~KL~I~DARp~~nAvANkAk 297 (717) T KOG4471 251 GWSGKRNKDDEKLLQAIADANAQD--GERRKLLIVDARPYTNAVANKAK 297 (717) T ss_pred CHHCCCCCCHHHHHHHHHHHCCCC--CCCCEEEEEECCCCHHHHHCCCC T ss_conf 111145655699999999865445--66532899856620466640566 No 153 >TIGR02545 ATP_syn_fliI flagellar protein export ATPase FliI; InterPro: IPR013379 FliI proteins are involved in bacterial flagellum systems, acting to drive protein export for flagellar biosynthesis. The most closely related proteins are the YscN proteins of bacterial type III secretion systems.; GO: 0016887 ATPase activity, 0001539 ciliary or flagellar motility, 0009296 flagellum biogenesis, 0009288 flagellin-based flagellum. Probab=27.76 E-value=44 Score=14.24 Aligned_cols=73 Identities=19% Similarity=0.333 Sum_probs=46.6 Q ss_pred CEEEEEECCCC----HHHHHHHHHHHCCCCEEEEEECCCCCCCCCCC------------CEEEE------EHHHH----- Q ss_conf 73999977980----38999999997899789999548667310388------------56787------38999----- Q gi|254780768|r 2 KRLLIIAGSGM----LPYYVAKAARLKNDEPVIASVLNECSFDWQDF------------ECREL------PLGDF----- 54 (281) Q Consensus 2 ~kigIIAG~G~----LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~------------~~~~~------~ig~i----- 54 (281) .||||.||+|. |=-.+|++- .-+++||+|.||=-.+.++| .+.-. ++-+. T Consensus 159 QR~GIFAGSGVGKSTLLGMiAr~t---~ADV~VIALIGERGREV~EFiE~~LG~eGl~kSVVVVATSD~spl~R~~aA~~ 235 (439) T TIGR02545 159 QRLGIFAGSGVGKSTLLGMIARYT---EADVNVIALIGERGREVKEFIEDDLGEEGLKKSVVVVATSDESPLMRIRAAYA 235 (439) T ss_pred CEEEEECCCCHHHHHHHHHHHCCC---CCCEEEEEEECCCCCCCHHHHHHHCCCCCCCCEEEEEECCCCCHHHHHHHHHH T ss_conf 102663377444788988875066---58878998444656643135543035110254079982799868999888899 Q ss_pred -HHHHHHHHHCCCCEEEEECCCCC Q ss_conf -99999998649540677024232 Q gi|254780768|r 55 -CVLRSILHQYNIGRIVVAGAIDR 77 (281) Q Consensus 55 -g~li~~Lk~~~i~~ivmaG~V~k 77 (281) =.+=+.|+.+|.+=+.|.=.|+| T Consensus 236 A~~iAEYFRDqGk~VLL~~DSlTR 259 (439) T TIGR02545 236 ATAIAEYFRDQGKDVLLLMDSLTR 259 (439) T ss_pred HHHHHHHHHHCCCCEEEEECHHHH T ss_conf 999999998649834776211788 No 154 >pfam02670 DXP_reductoisom 1-deoxy-D-xylulose 5-phosphate reductoisomerase. This is a family of 1-deoxy-D-xylulose 5-phosphate reductoisomerases. This enzyme catalyses the formation of 2-C-methyl-D-erythritol 4-phosphate from 1-deoxy-D-xylulose-5-phosphate in the presence of NADPH. This reaction is part of the terpenoid biosynthesis pathway. Probab=27.69 E-value=44 Score=14.23 Aligned_cols=55 Identities=13% Similarity=0.171 Sum_probs=24.5 Q ss_pred EEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEEECC Q ss_conf 9997798038999999997899789999548667310388567873899999999998649540677024 Q gi|254780768|r 5 LIIAGSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQDFECRELPLGDFCVLRSILHQYNIGRIVVAGA 74 (281) Q Consensus 5 gIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~~~~~~~ig~ig~li~~Lk~~~i~~ivmaG~ 74 (281) +|+.-.|+.=..-.+-+++...++-+.++....+ ..++.+..++.+.+.+++.-. T Consensus 2 ~IlGsTGSIG~~tL~Vi~~~~~~f~v~~Lsa~~N---------------~~~L~~q~~~f~p~~v~i~~~ 56 (129) T pfam02670 2 TILGSTGSIGTQTLDVIRRNPDRFEVVALSAGRN---------------VELLAEQIKEFKPKYVAVADE 56 (129) T ss_pred EEECCCCHHHHHHHHHHHHCCCCEEEEEEEECCC---------------HHHHHHHHHHCCCCEEEECCH T ss_conf 8976786889999999995956718999983478---------------999999999739979999589 No 155 >TIGR02079 THD1 threonine dehydratase; InterPro: IPR011820 This entry represents threonine dehydratase, the first step in the pathway converting threonine into isoleucine. At least two other clades of biosynthetic threonine dehydratases are described by IPR005787 from INTERPRO and IPR005789 from INTERPRO. The sequences described by this entry are exclusively found in species containing the rest of the isoleucine pathway and which are generally lacking in members of the other two clades of threonine dehydratases. The sequences associated with this entry are often cluster with other elements of the isoleucine pathway.; GO: 0004794 L-threonine ammonia-lyase activity, 0030170 pyridoxal phosphate binding, 0009097 isoleucine biosynthetic process. Probab=27.69 E-value=25 Score=15.88 Aligned_cols=176 Identities=19% Similarity=0.253 Sum_probs=97.8 Q ss_pred EEEEECCCCHHHHHHHHHHHC-CCCEEEEEECCC-CCCC----CC----CCCEEE-EEHHHHHHHHHHHHHCCCCEEEEE Q ss_conf 999977980389999999978-997899995486-6731----03----885678-738999999999986495406770 Q gi|254780768|r 4 LLIIAGSGMLPYYVAKAARLK-NDEPVIASVLNE-CSFD----WQ----DFECRE-LPLGDFCVLRSILHQYNIGRIVVA 72 (281) Q Consensus 4 igIIAG~G~LP~~ia~~~~~~-g~~~~ii~l~~~-~~~~----~~----~~~~~~-~~ig~ig~li~~Lk~~~i~~ivma 72 (281) ..||+|+|.+-.+|.+.+... ..+.+++++-|. ...- ++ .-++.- -|=|.. .+-..|++ .++|=. T Consensus 149 ~~IIeGQGTva~Eil~ql~~~~k~D~V~~pVGGGGLisG~~~yl~~~s~~TkiiGvEP~GAP-sm~~sL~~---gevvTL 224 (415) T TIGR02079 149 PDIIEGQGTVAVEILEQLEEETKIDYVVVPVGGGGLISGVTSYLKEKSPKTKIIGVEPEGAP-SMKASLEA---GEVVTL 224 (415) T ss_pred HHHHHCHHHHHHHHHHHCCCCCCCCEEEECCCCHHHHHHHHHHHHHCCCCCEEEEECCCCHH-HHHHHHHC---CCCEEE T ss_conf 56651102765667750586767768998477405788899987311786417874688648-89999964---896741 Q ss_pred CCCCCCCCHHHHCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCEEECHHHCCHHHC---CCCCCCCCCCCCHH Q ss_conf 242321000111026035789999998751048035899999999976974821122252420---35433455465323 Q gi|254780768|r 73 GAIDRRPNVQDLCFSIKDSLRISKMIWQLVSGGNAAILKASIDLLESYGVSVVGAHEIVPELL---VQVGSLGTCVPNRD 149 (281) Q Consensus 73 G~V~krP~~~~l~~D~~~~~~l~k~l~~~~~~gDd~iL~~i~~~fe~~G~~vi~~~~~l~~ll---~~~G~l~~~~p~~~ 149 (281) -+|++ | .|= .+.+.+..| .-++++ ....+--+..++-..+|.-+| -.+|++..+- . T Consensus 225 ~kiD~---F----VDG----AAV~rvG~L----nF~~~k---~~~d~v~~~~v~EGaVC~tiLdlYn~EgiVaEPA-G-- 283 (415) T TIGR02079 225 DKIDK---F----VDG----AAVKRVGDL----NFKALK---KVVDEVEVTLVPEGAVCSTILDLYNLEGIVAEPA-G-- 283 (415) T ss_pred CCCCC---C----CCH----HHHHHHHHH----HHHHHH---HHHCCEEEEECCCCCHHHHHHHHHHCCCEEEECC-H-- T ss_conf 36785---3----041----789878668----899998---7650147586379852466788541682698151-3-- Q ss_pred HHHHHHHHHHHHHHHCCCCCEEEEEEECCEEEEEECCCCHHHHHHHHHHHHHHCCCCCC-CCEEEEEEC Q ss_conf 47889989999997332672259998197488962534217999999975431234566-771899934 Q gi|254780768|r 150 VKRDILAAMKSAEALSELDVGQSAVSIGGRVVALEGIEGTDSMLQRIVDCRNNGRILAG-KSGVLVKMC 217 (281) Q Consensus 150 ~~~dI~~g~~i~~~l~~~DiGQsvVv~~g~ViaiEa~eGTD~mi~R~~~~~~~~~~~~~-~~~ilvK~~ 217 (281) +..+..|..+++--+ .+..|++| +-|-|+=|.|..+.+++.-...| +.=-+|+.| T Consensus 284 -----ALSiAAL~~~~r~ei------~gKTVVC~--vSGGNNDi~R~~Ei~eRsl~y~GLKhYFiv~FP 339 (415) T TIGR02079 284 -----ALSIAALEELSREEI------KGKTVVCV--VSGGNNDIERTEEIRERSLLYEGLKHYFIVRFP 339 (415) T ss_pred -----HHHHHHHHHHCHHCC------CCCEEEEE--EECCCCCCCCCHHHHHHHHHHCCCCEEEEECCC T ss_conf -----667999875122004------69727999--527757533136789888873177036875278 No 156 >PRK13010 purU formyltetrahydrofolate deformylase; Reviewed Probab=27.14 E-value=45 Score=14.17 Aligned_cols=77 Identities=12% Similarity=0.054 Sum_probs=39.8 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHCC-CHHH--HHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCEEECHHH Q ss_conf 9999999998649540677024232100011102-6035--789999998751048035899999999976974821122 Q gi|254780768|r 53 DFCVLRSILHQYNIGRIVVAGAIDRRPNVQDLCF-SIKD--SLRISKMIWQLVSGGNAAILKASIDLLESYGVSVVGAHE 129 (281) Q Consensus 53 ~ig~li~~Lk~~~i~~ivmaG~V~krP~~~~l~~-D~~~--~~~l~k~l~~~~~~gDd~iL~~i~~~fe~~G~~vi~~~~ 129 (281) +-.++++.+++++++-+|+||++ + - +++-.. .+.+ .-.=+.+|- ..+|-...=. -+ +.|.+++|++- T Consensus 157 ~e~~~~~~~~~~~~d~vvla~ym-~-i-l~~~~~~~~~~~iinih~s~lP--~f~G~~~~~~----a~-~~gvk~~G~T~ 226 (289) T PRK13010 157 QEAQILDLIETSGAELVVLARYM-Q-V-LSDDLSRKLSGRAINIHHSFLP--GFKGARPYHQ----AH-ARGVKLIGATA 226 (289) T ss_pred HHHHHHHHHHHCCCCEEEEECHH-H-H-CCHHHHHHCCCCEEEECHHHCC--CCCCCCHHHH----HH-HCCCCEEECCE T ss_conf 89999999871398899870546-6-7-6989997578512663521225--7899877999----99-76997882763 Q ss_pred CCHHHCCCCC Q ss_conf 2524203543 Q gi|254780768|r 130 IVPELLVQVG 139 (281) Q Consensus 130 ~l~~ll~~~G 139 (281) ..=.--+.+| T Consensus 227 H~v~~~lD~G 236 (289) T PRK13010 227 HFVTDDLDEG 236 (289) T ss_pred EEECCCCCCC T ss_conf 7827888888 No 157 >COG1082 IolE Sugar phosphate isomerases/epimerases [Carbohydrate transport and metabolism] Probab=26.91 E-value=46 Score=14.14 Aligned_cols=20 Identities=20% Similarity=0.297 Sum_probs=15.1 Q ss_pred HHHHHHHHCCCEE-EEEECCC Q ss_conf 9999999849909-9997397 Q gi|254780768|r 233 KTVQNVIKAGLAG-IALEAGK 252 (281) Q Consensus 233 ~Ti~~~~~ag~~g-iaiea~~ 252 (281) .-++.+.++|++| +.+|.-. T Consensus 233 ~i~~~L~~~gy~g~~~iE~~~ 253 (274) T COG1082 233 AIFSALREAGYDGWLVVEVFA 253 (274) T ss_pred HHHHHHHHCCCCEEEEEEECC T ss_conf 999999967996269998537 No 158 >PRK10787 DNA-binding ATP-dependent protease La; Provisional Probab=26.86 E-value=46 Score=14.14 Aligned_cols=59 Identities=14% Similarity=0.023 Sum_probs=42.2 Q ss_pred CCCCHHHHHHHHHHHCCCCEEEEEECCCCCC-CCCCCCEEEEEHH-HHHHHHHHHHHCCCCEEEEE Q ss_conf 7980389999999978997899995486673-1038856787389-99999999986495406770 Q gi|254780768|r 9 GSGMLPYYVAKAARLKNDEPVIASVLNECSF-DWQDFECRELPLG-DFCVLRSILHQYNIGRIVVA 72 (281) Q Consensus 9 G~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~-~~~~~~~~~~~ig-~ig~li~~Lk~~~i~~ivma 72 (281) |..+|-.-||+++ |.+++-+++.|-.|. +...+... -+| ..|+++..|++.++.+=||. T Consensus 361 GKTSl~~sIA~al---~r~f~rislGGv~DeaeirGHrrT--YvgampGrii~~l~~a~~~nPv~l 421 (784) T PRK10787 361 GKTSLGQSIAKAT---GRKYVRMALGGVRDEAEIRGHRRT--YIGSMPGKLIQKMAKVGVKNPLFL 421 (784) T ss_pred CHHHHHHHHHHHH---CCCEEEEECCCCCCHHHHCCCCCC--CCCCCCHHHHHHHHHHCCCCCEEE T ss_conf 7246999999985---898699806887888882564334--344368389999997489885665 No 159 >PRK13146 hisH imidazole glycerol phosphate synthase subunit HisH; Provisional Probab=26.77 E-value=46 Score=14.13 Aligned_cols=63 Identities=22% Similarity=0.165 Sum_probs=41.9 Q ss_pred CCEEEEE-ECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCEEEEE-HHHHHHHHHHHHHCCC Q ss_conf 9739999-7798038999999997899789999548667310388567873-8999999999986495 Q gi|254780768|r 1 MKRLLII-AGSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQDFECRELP-LGDFCVLRSILHQYNI 66 (281) Q Consensus 1 M~kigII-AG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~~~~~~~-ig~ig~li~~Lk~~~i 66 (281) |.||+|| .|.|++- -+.++++.-+...-++...+ ..++++.+...+| +|.++..++.|++.+. T Consensus 1 M~kI~IiDyg~gNi~-Sv~~al~~~~~~~~i~i~~~--~~~i~~~d~lIlPGVGsf~~~m~~L~~~~~ 65 (208) T PRK13146 1 MMSVAIIDYGSGNLR-SAARALERAAPGADVTVTAD--PDAVRAADRLVLPGVGAFADCMRGLRAVGG 65 (208) T ss_pred CCEEEEEECCCCHHH-HHHHHHHHHCCCCEEEEECC--HHHHHHCCEEEECCCCCHHHHHHHHHHCCC T ss_conf 988999937955899-99999998579973999699--999842887898388977999998764076 No 160 >COG4597 BatB ABC-type amino acid transport system, permease component [Amino acid transport and metabolism] Probab=26.51 E-value=21 Score=16.39 Aligned_cols=44 Identities=18% Similarity=0.340 Sum_probs=26.8 Q ss_pred EEEEEECCCCCCCEEEECCCCHHHHHH------HHHCCCEEEEEECCCEE Q ss_conf 189993488875312106627999999------99849909999739779 Q gi|254780768|r 211 GVLVKMCKSQQDMRADLPSIGAKTVQN------VIKAGLAGIALEAGKSL 254 (281) Q Consensus 211 ~ilvK~~K~~QD~r~DlP~IG~~Ti~~------~~~ag~~giaiea~~~l 254 (281) +.-.+..---|-+|+=.|-.--+-+.. +++.|+--++.-+|-++ T Consensus 308 ~~t~RlVivPQAlRiIIPPLTSQYLNLtKNSSLAiAIGYpDlv~vagT~l 357 (397) T COG4597 308 SLTLRLVIVPQALRIIIPPLTSQYLNLTKNSSLAIAIGYPDLVAVAGTVL 357 (397) T ss_pred CCEEEEEEECCCCEEECCCCCHHHHHHCCCCHHHHCCCCCCEEEEECEEE T ss_conf 54479998411015641762188873323542320027776688733022 No 161 >PRK06027 purU formyltetrahydrofolate deformylase; Reviewed Probab=26.35 E-value=47 Score=14.08 Aligned_cols=33 Identities=15% Similarity=0.299 Sum_probs=25.2 Q ss_pred CCCEEEEEH------HHHHHHHHHHHHCCCCEEEEECCC Q ss_conf 885678738------999999999986495406770242 Q gi|254780768|r 43 DFECRELPL------GDFCVLRSILHQYNIGRIVVAGAI 75 (281) Q Consensus 43 ~~~~~~~~i------g~ig~li~~Lk~~~i~~ivmaG~V 75 (281) +.+++++++ .+-.++++.+++++++-+|+||++ T Consensus 136 ~IPF~~iPv~~~~K~e~E~~i~~~~~~~~~d~ivla~ym 174 (285) T PRK06027 136 GIPFHHVPVTKETKAEAEAQLLELIDEYQPDLVVLARYM 174 (285) T ss_pred CCCEEEECCCCCCHHHHHHHHHHHHHHCCCCEEEEHHHH T ss_conf 998288268766537799999999873497199763368 No 162 >TIGR02312 HpaH 2-oxo-hepta-3-ene-1,7-dioic acid hydratase; InterPro: IPR012690 This entry represents the enzyme which hydrates the double bond of 2-oxo-hepta-3-ene-1,7-dioic acid to form 4-hydroxy-2-oxo-heptane-1,7-dioic acid in the catabolism of 4-hydroxyphenylacetic acid. The gene for this enzyme is generally found adjacent to other genes of this pathway.. Probab=26.34 E-value=28 Score=15.55 Aligned_cols=16 Identities=38% Similarity=0.746 Sum_probs=12.6 Q ss_pred CCCEEEEECCCCCCCCH Q ss_conf 95406770242321000 Q gi|254780768|r 65 NIGRIVVAGAIDRRPNV 81 (281) Q Consensus 65 ~i~~ivmaG~V~krP~~ 81 (281) .-.+|||+|.++ ||-- T Consensus 234 eaGqvvLAGSFT-RPV~ 249 (271) T TIGR02312 234 EAGQVVLAGSFT-RPVE 249 (271) T ss_pred CCCCEEECCCCC-CCCC T ss_conf 788578656654-6421 No 163 >cd06167 LabA_like LabA_like proteins. A well conserved group of bacterial proteins with no defined function. LabA, a member from Synechococcus elongatus PCC 7942, has been shown to play a role in cyanobacterial circadian timing. It is required for negative feedback regulation of the autokinase/autophosphatase KaiC, a central component of the circadian clock system. In particular, LabA seems necessary for KaiC-dependent repression of gene expression. Probab=26.26 E-value=47 Score=14.07 Aligned_cols=55 Identities=13% Similarity=0.126 Sum_probs=32.7 Q ss_pred HHHHHHHHHCCCEEECHHHCCHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEECC Q ss_conf 99999999769748211222524203543345546532347889989999997332672259998197 Q gi|254780768|r 111 KASIDLLESYGVSVVGAHEIVPELLVQVGSLGTCVPNRDVKRDILAAMKSAEALSELDVGQSAVSIGG 178 (281) Q Consensus 111 ~~i~~~fe~~G~~vi~~~~~l~~ll~~~G~l~~~~p~~~~~~dI~~g~~i~~~l~~~DiGQsvVv~~g 178 (281) +...+.+.+.||+++....... ...+.-.|+..+..++.....-.+...|+|.+- T Consensus 55 ~~~~~~L~~~g~~~~~~~~~~~-------------~~~kk~~Dv~la~D~~~~a~~~~~D~~vLvSgD 109 (149) T cd06167 55 RGFLDALRRLGFEPIQKPLRTR-------------GSGKKGVDVALAIDALELAYKRRIDTIVLVSGD 109 (149) T ss_pred HHHHHHHHHCCCEEEEEEEEEC-------------CCCCCCCHHHHHHHHHHHHHCCCCCEEEEEECC T ss_conf 8999999975984799611304-------------776664209999999999613899989999577 No 164 >COG0473 LeuB Isocitrate/isopropylmalate dehydrogenase [Amino acid transport and metabolism] Probab=26.15 E-value=47 Score=14.05 Aligned_cols=164 Identities=20% Similarity=0.148 Sum_probs=79.0 Q ss_pred CCEEEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCEEEEEHH-----HH-----HHHHHHHHHCCCCEEE Q ss_conf 9739999779803899999999789978999954866731038856787389-----99-----9999999864954067 Q gi|254780768|r 1 MKRLLIIAGSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQDFECRELPLG-----DF-----CVLRSILHQYNIGRIV 70 (281) Q Consensus 1 M~kigIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~~~~~~~ig-----~i-----g~li~~Lk~~~i~~iv 70 (281) |.+|++|.|.|-=|..++++.+--. .-.+ ..-+++...+.+| +- ...++.+++ .+.+ T Consensus 3 ~~~IavipGDGIGpEv~~~a~kVl~---------a~~~-~~~~~e~~~~~~G~~~~~~~G~~lpeetl~~~~~---~Dai 69 (348) T COG0473 3 TYTIAVIPGDGIGPEVMAAALKVLE---------AAAE-FGLDFEFEEAEVGGEAYDKHGEPLPEETLESLKK---ADAI 69 (348) T ss_pred CEEEEEECCCCCCHHHHHHHHHHHH---------HHHH-CCCCEEEEEEHHHHHHHHHCCCCCCHHHHHHHHH---CCEE T ss_conf 5589995789877889999999999---------8663-3774389995005789997099889999999975---8979 Q ss_pred EECCCCCCCCHHHHCCCHH-HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCEEECHHHCCHHHCCCCCC--CCCC--- Q ss_conf 7024232100011102603-578999999875104803589999999997697482112225242035433--4554--- Q gi|254780768|r 71 VAGAIDRRPNVQDLCFSIK-DSLRISKMIWQLVSGGNAAILKASIDLLESYGVSVVGAHEIVPELLVQVGS--LGTC--- 144 (281) Q Consensus 71 maG~V~krP~~~~l~~D~~-~~~~l~k~l~~~~~~gDd~iL~~i~~~fe~~G~~vi~~~~~l~~ll~~~G~--l~~~--- 144 (281) +.|.|. .|.+.......+ ..+.+.|-+-...+-.-...+..+...+. +|+.++=.++....+....+. .... T Consensus 70 L~Gavg-~P~~~~~~~~~~~~ll~lRk~l~lyANlRP~k~~~~~k~~~~-~~~D~viVREnTeG~Y~G~~~~~~~~~eva 147 (348) T COG0473 70 LFGAVG-GPKWDPLPRPERGLLLALRKELDLYANLRPAKSLPGLKSPLV-KGVDIVIVRENTEGLYFGEEGRILGGGEVA 147 (348) T ss_pred EECCCC-CCCCCCCCCCCCCHHHHHHHHCCCEEEEEECCCCCCCCCCCC-CCCCEEEEEECCCCCCCCCCCCCCCCCEEE T ss_conf 983667-887788897630157899986382154003245799987666-895089996388760148876545897279 Q ss_pred ----CCC-HHHHHHHHHHHHHHHHHCCCCCEEEEEEECCEEEE Q ss_conf ----653-23478899899999973326722599981974889 Q gi|254780768|r 145 ----VPN-RDVKRDILAAMKSAEALSELDVGQSAVSIGGRVVA 182 (281) Q Consensus 145 ----~p~-~~~~~dI~~g~~i~~~l~~~DiGQsvVv~~g~Via 182 (281) .-| ....+=+.++++.|++-+ -..=..|....|+. T Consensus 148 ~~~~~~Tr~~~eRI~r~AFe~A~~R~---~kkvTsv~KaNVl~ 187 (348) T COG0473 148 IDTKVITRKGSERIARFAFELARKRG---RKKVTSVHKANVLK 187 (348) T ss_pred EEEEEECHHHHHHHHHHHHHHHHHHC---CCCEEEEEHHHHHH T ss_conf 99875008889999999999999607---99468986001056 No 165 >PRK04296 thymidine kinase; Provisional Probab=26.10 E-value=47 Score=14.05 Aligned_cols=32 Identities=22% Similarity=0.246 Sum_probs=20.3 Q ss_pred CCEEEEEEC---CCCHHHHH--HHHHHHCCCCEEEEE Q ss_conf 973999977---98038999--999997899789999 Q gi|254780768|r 1 MKRLLIIAG---SGMLPYYV--AKAARLKNDEPVIAS 32 (281) Q Consensus 1 M~kigIIAG---~G~LP~~i--a~~~~~~g~~~~ii~ 32 (281) |.+|-+|.| +|.=-.++ ++.+..+|.++.++- T Consensus 1 mg~L~~i~GpMfSGKTteLi~~~~~~~~~gkkvl~~k 37 (197) T PRK04296 1 MAKLYFIYGAMNSGKSTELLQRAYNYEERGMKVLVFT 37 (197) T ss_pred CCEEEEEECCCCCHHHHHHHHHHHHHHHCCCEEEEEE T ss_conf 9559999934278889999999999998799599998 No 166 >PTZ00117 malate dehydrogenase; Provisional Probab=26.07 E-value=47 Score=14.04 Aligned_cols=114 Identities=18% Similarity=0.237 Sum_probs=62.8 Q ss_pred CCEEEEEECCCCHHHHHHHHHHHCCCCE-EEEEECCCC-C---CCCCC------CCEEEEEHHHHHHHHHHHHHCCCCEE Q ss_conf 9739999779803899999999789978-999954866-7---31038------85678738999999999986495406 Q gi|254780768|r 1 MKRLLIIAGSGMLPYYVAKAARLKNDEP-VIASVLNEC-S---FDWQD------FECRELPLGDFCVLRSILHQYNIGRI 69 (281) Q Consensus 1 M~kigIIAG~G~LP~~ia~~~~~~g~~~-~ii~l~~~~-~---~~~~~------~~~~~~~ig~ig~li~~Lk~~~i~~i 69 (281) |+|++|| |.|..=-.+|..+..++..- +++.+.... . .++.+ .+...+.-+. .+.+ .+++=| T Consensus 1 M~KV~II-GaG~VG~~~A~~l~~~~~~eivLiDi~~~~a~g~a~Dl~h~~~~~~~~~~i~~~~d----y~~~--~daDiV 73 (313) T PTZ00117 1 MKKISII-GSGQIGAIVGLLLLQENLGDVYLYDVIEGVPQGKALDLKHFSTIIGVNANILGTNN----YEDI--KDSDVI 73 (313) T ss_pred CCEEEEE-CCCHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHHHHCCCCCCEEEECCC----HHHH--CCCCEE T ss_conf 9789998-97989999999997089987999958898308899887724203689857983799----9996--899999 Q ss_pred EEECCCCCCCCHHHHCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCEE-----ECHHHCCHHHCCC Q ss_conf 7702423210001110260357899999987510480358999999999769748-----2112225242035 Q gi|254780768|r 70 VVAGAIDRRPNVQDLCFSIKDSLRISKMIWQLVSGGNAAILKASIDLLESYGVSV-----VGAHEIVPELLVQ 137 (281) Q Consensus 70 vmaG~V~krP~~~~l~~D~~~~~~l~k~l~~~~~~gDd~iL~~i~~~fe~~G~~v-----i~~~~~l~~ll~~ 137 (281) |++.++.++|.-+ +.| ++ . ....|++.++..+.+.+++- -.+-+++-..+.. T Consensus 74 VitAG~~rk~g~t--R~d---------Ll-~----~N~~I~~~i~~~i~~~~p~aiiivvtNPvDimt~v~~k 130 (313) T PTZ00117 74 VITAGVQRKEGMT--RED---------LI-G----VNGKIMKSVAESVKKHCPNAFVICVSNPLDIMVNVFKK 130 (313) T ss_pred EECCCCCCCCCCC--HHH---------HH-H----HHHHHHHHHHHHHHCCCCCEEEEECCCCHHHHHHHHHH T ss_conf 9898998997998--899---------99-8----76777888887762358980899789948999999998 No 167 >COG5014 Predicted Fe-S oxidoreductase [General function prediction only] Probab=25.95 E-value=47 Score=14.03 Aligned_cols=24 Identities=17% Similarity=0.029 Sum_probs=14.8 Q ss_pred HHHHHHHHHHHHHCCCCEEEEECC Q ss_conf 899999999998649540677024 Q gi|254780768|r 51 LGDFCVLRSILHQYNIGRIVVAGA 74 (281) Q Consensus 51 ig~ig~li~~Lk~~~i~~ivmaG~ 74 (281) ..-..++.++-|+.+|+.+-..|. T Consensus 77 ~eVaeRL~ei~K~~g~d~vRiSG~ 100 (228) T COG5014 77 EEVAERLLEISKKRGCDLVRISGA 100 (228) T ss_pred HHHHHHHHHHHHHCCCCEEEEECC T ss_conf 999999999988558868996289 No 168 >pfam04227 Indigoidine_A Indigoidine synthase A like protein. Indigoidine is a blue pigment synthesized by Erwinia chrysanthemi implicated in pathogenicity and protection from oxidative stress. IdgA is involved in indigoidine biosynthesis, but its specific function is unknown. Probab=25.70 E-value=48 Score=14.00 Aligned_cols=157 Identities=15% Similarity=0.161 Sum_probs=73.7 Q ss_pred HHHHHHHHHCCCCEEEEEECCCCC------CCCCC-----CCEEEEEHHHH--------------HHHHHHHHHCCCCEE Q ss_conf 999999997899789999548667------31038-----85678738999--------------999999986495406 Q gi|254780768|r 15 YYVAKAARLKNDEPVIASVLNECS------FDWQD-----FECRELPLGDF--------------CVLRSILHQYNIGRI 69 (281) Q Consensus 15 ~~ia~~~~~~g~~~~ii~l~~~~~------~~~~~-----~~~~~~~ig~i--------------g~li~~Lk~~~i~~i 69 (281) ..+-+..+.+|--|-.+++.+... .+++. ..+.-++-.++ ...+..-+..||+=. T Consensus 35 ~~~e~~vr~~GavPAtiai~~G~~~vGl~~~el~~la~~~~~~~K~S~RDl~~~~a~~~~GaTTVsaTm~iA~~aGI~VF 114 (293) T pfam04227 35 REVEQIVRENGAVPATIAIIDGRIKVGLSDEELERLAQAGKDVAKVSRRDLPYVVATGKTGATTVAATMILAHLAGIKVF 114 (293) T ss_pred HHHHHHHHHCCCCCEEEEEECCEEEECCCHHHHHHHHHCCCCCCCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCEEE T ss_conf 99999999779962168999898774699999999972476654355302799981587652109999999998698089 Q ss_pred EE--ECCCCCCCCHHHHCCCHHHHHHHHHHH-HHHHCCCCHHHH--HHHHHHHHHCCCEEECHHH-CCHHHCCCCCCCCC Q ss_conf 77--024232100011102603578999999-875104803589--9999999976974821122-25242035433455 Q gi|254780768|r 70 VV--AGAIDRRPNVQDLCFSIKDSLRISKMI-WQLVSGGNAAIL--KASIDLLESYGVSVVGAHE-IVPELLVQVGSLGT 143 (281) Q Consensus 70 vm--aG~V~krP~~~~l~~D~~~~~~l~k~l-~~~~~~gDd~iL--~~i~~~fe~~G~~vi~~~~-~l~~ll~~~G~l~~ 143 (281) += +|+|+ |-.- --+|...|+.....- ....--|--+|| ..-.+++|..|+.|++.+. .+|-|+....-+. T Consensus 115 aTGGIGGVH-rg~~--~t~DiSaDL~eL~~tpv~VVcaG~KsILDi~~TlE~LET~GV~V~gy~td~fPaFy~~~Sg~~- 190 (293) T pfam04227 115 ATGGIGGVH-RGAE--ESFDISADLTELARTPVAVVCAGAKSILDIPKTLEYLETQGVPVIGYGTDEFPAFYSRDSGFK- 190 (293) T ss_pred EECCCCCCC-CCCC--CCCCCCCCHHHHHCCCEEEEECCCHHHHCCHHHHHHHHHCCCEEEEECCCCCCCEEECCCCCC- T ss_conf 746745125-7864--563214347888159759994260765054468999997594389745876564230588998- Q ss_pred CCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEECC Q ss_conf 46532347889989999997332672259998197 Q gi|254780768|r 144 CVPNRDVKRDILAAMKSAEALSELDVGQSAVSIGG 178 (281) Q Consensus 144 ~~p~~~~~~dI~~g~~i~~~l~~~DiGQsvVv~~g 178 (281) ......+.+-..++++.-.++....+++|.|- T Consensus 191 ---~~~~~~~~~eiA~i~~~~~~lgl~~g~LianP 222 (293) T pfam04227 191 ---VPYRLDSAEEIAAIIRARWALGLQGGVLVANP 222 (293) T ss_pred ---CCCCCCCHHHHHHHHHHHHHCCCCCCEEEECC T ss_conf ---86613899999999999997189983599668 No 169 >CHL00175 minD septum-site determining protein; Validated Probab=25.57 E-value=48 Score=13.98 Aligned_cols=152 Identities=14% Similarity=0.142 Sum_probs=69.0 Q ss_pred CCE-EEEEECCCC-----HHHHHHHHHHHCCCCEEEEEEC-CCCCCCC-CCCCE-EEEEHHHHHHHHHHHHH-CCCCEEE Q ss_conf 973-999977980-----3899999999789978999954-8667310-38856-78738999999999986-4954067 Q gi|254780768|r 1 MKR-LLIIAGSGM-----LPYYVAKAARLKNDEPVIASVL-NECSFDW-QDFEC-RELPLGDFCVLRSILHQ-YNIGRIV 70 (281) Q Consensus 1 M~k-igIIAG~G~-----LP~~ia~~~~~~g~~~~ii~l~-~~~~~~~-~~~~~-~~~~ig~ig~li~~Lk~-~~i~~iv 70 (281) |.| |++..|.|- +-.-++-++...|+++.++.+. +..+... -..+. .... +...+.. ..+.+.+ T Consensus 12 m~kiIaV~s~KGGVGKTT~a~NLa~aLA~~G~kVlliD~D~~~~n~~~~lg~~~~~~~~------~~~vl~g~~~l~~~~ 85 (279) T CHL00175 12 MTRIIVITSGKGGVGKTTTTANLGMSIARLGYRVALIDADIGLRNLDLLLGLENRVVYT------AMEVLEGECRLDQAL 85 (279) T ss_pred CCEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCCCCCCCCC------HHHHCCCCCCCCCCE T ss_conf 88699997489984489999999999997899889995789999875326866666674------766407876643013 Q ss_pred EECCCCCCCCHHHHCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHC--CCEEECHHHCCHH-----HCCCCCCCCC Q ss_conf 70242321000111026035789999998751048035899999999976--9748211222524-----2035433455 Q gi|254780768|r 71 VAGAIDRRPNVQDLCFSIKDSLRISKMIWQLVSGGNAAILKASIDLLESY--GVSVVGAHEIVPE-----LLVQVGSLGT 143 (281) Q Consensus 71 maG~V~krP~~~~l~~D~~~~~~l~k~l~~~~~~gDd~iL~~i~~~fe~~--G~~vi~~~~~l~~-----ll~~~G~l~~ 143 (281) +-.. ..+.+.-+-.. .+. .. .. .....+ ..+++.+.+. -|-+++.-.=+.. +.+....+-- T Consensus 86 i~~~--~~~~l~ll~~~-~~~---~~--~~---~~~~~~-~~ll~~l~~~~yDyiiID~ppgl~~~~~~al~aad~viIv 153 (279) T CHL00175 86 IRDK--RWPNLSLLPIS-KNR---QR--YN---VTRKNM-NMLVDSLKERNYDYILIDCPAGIDVGFINAIAPAKEAIVV 153 (279) T ss_pred EECC--CCCCEEEEECC-CHH---HH--HH---HHHHHH-HHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHCCEEEEE T ss_conf 4257--77877999789-705---44--57---419999-9999999727999999818998889999999978906997 Q ss_pred CCCCHHHHHHHHHHHHHHHHHCCCCCE Q ss_conf 465323478899899999973326722 Q gi|254780768|r 144 CVPNRDVKRDILAAMKSAEALSELDVG 170 (281) Q Consensus 144 ~~p~~~~~~dI~~g~~i~~~l~~~DiG 170 (281) ..|.....+|.....++++..+..+++ T Consensus 154 ttpe~~al~da~~~i~~~~~~~~~~~~ 180 (279) T CHL00175 154 TTPEITAIRDADRVAGLLEANGIYNIK 180 (279) T ss_pred CCCCHHHHHHHHHHHHHHHHCCCCCCE T ss_conf 899789999999999999975998621 No 170 >PRK11640 putative transcriptional regulator; Provisional Probab=25.43 E-value=46 Score=14.11 Aligned_cols=46 Identities=15% Similarity=0.270 Sum_probs=33.3 Q ss_pred HHHHHHHHHHHHHCCCEEECHHHCCHHHCCCCCCCCCCCCCHHHHH Q ss_conf 3589999999997697482112225242035433455465323478 Q gi|254780768|r 107 AAILKASIDLLESYGVSVVGAHEIVPELLVQVGSLGTCVPNRDVKR 152 (281) Q Consensus 107 d~iL~~i~~~fe~~G~~vi~~~~~l~~ll~~~G~l~~~~p~~~~~~ 152 (281) +.||.+-.+.|..+||+-+|.+.++.+==+.+--+...=|||++.- T Consensus 4 eriL~~A~~LFy~~Gi~atGid~I~~eAgVaK~TLY~hF~SKd~LI 49 (191) T PRK11640 4 EDVLGEALKLLEQQGIANTTLEMVAERVDYPLDELQRFWPDKEALL 49 (191) T ss_pred HHHHHHHHHHHHHCCCCCCCHHHHHHHHCCCHHHHHHHCCCHHHHH T ss_conf 9999999999997494204799999995998999998779889999 No 171 >cd06353 PBP1_BmpA_Med_like Periplasmic binding domain of the basic membrane lipoprotein Med in Bacillus and its close homologs from other bacteria and Archaea. Periplasmic binding domain of the basic membrane lipoprotein Med in Bacillus and its close homologs from other bacteria and Archaea. Med, a cell-surface localized protein, which regulates the competence transcription factor gene comK in Bacillus subtilis, lacks the DNA binding domain when compared with structures of transcription regulators from the LacI family. Nevertheless, Med has significant overall sequence homology to various periplasmic substrate-binding proteins. Moreover, the structure of Med shows a striking similarity to PnrA, a periplasmic nucleoside binding protein of an ATP-binding cassette transport system. Members of this group contain the type I periplasmic sugar-binding protein-like fold. Probab=25.20 E-value=49 Score=13.94 Aligned_cols=58 Identities=19% Similarity=0.173 Sum_probs=44.9 Q ss_pred EEEEECCCCCCCEEEECCCCHHHHHHHHHCCCEEEEEECCCEEEECHHHHHHHHHHCCCEEEEECHHHC Q ss_conf 899934888753121066279999999984990999973977998589999999987968999577545 Q gi|254780768|r 212 VLVKMCKSQQDMRADLPSIGAKTVQNVIKAGLAGIALEAGKSLVLEKELVKKHADEAGIFVCGIDREFA 280 (281) Q Consensus 212 ilvK~~K~~QD~r~DlP~IG~~Ti~~~~~ag~~giaiea~~~lild~~~~i~~a~~~~i~i~g~~~~~~ 280 (281) +.++..- -++=|.-|-+.-+.+...|+..|.-.++. .-+++.|.+.|.+.+|++.||. T Consensus 154 v~~~y~g-----sf~D~~~~~~~a~~l~~~GaDVI~~~ag~------~gv~~aa~e~g~~~IG~d~dq~ 211 (258) T cd06353 154 VKVIWTG-----SWFDPAKEKEAALALIDQGADVIYQHTDS------PGVIQAAEEKGVYAIGYVSDMS 211 (258) T ss_pred EEEEEEC-----CCCCCHHHHHHHHHHHHCCCCEEEECCCC------HHHHHHHHHCCCEEEECCCCHH T ss_conf 4899958-----86782899999999997499889744897------5899999972987995467666 No 172 >pfam02571 CbiJ Precorrin-6x reductase CbiJ/CobK. This family consists of Precorrin-6x reductase EC:1.3.1.54. This enzyme catalyses the reaction: precorrin-6Y + NADP(+) <= precorrin-6X + NADPH. CbiJ and CobK both catalyse the reduction of macocycle in the colbalmin biosynthesis pathway. Probab=25.15 E-value=49 Score=13.93 Aligned_cols=70 Identities=20% Similarity=0.183 Sum_probs=43.8 Q ss_pred EEEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCC-CCCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEEEC Q ss_conf 399997798038999999997899789999548667-31038856787389999999999864954067702 Q gi|254780768|r 3 RLLIIAGSGMLPYYVAKAARLKNDEPVIASVLNECS-FDWQDFECRELPLGDFCVLRSILHQYNIGRIVVAG 73 (281) Q Consensus 3 kigIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~-~~~~~~~~~~~~ig~ig~li~~Lk~~~i~~ivmaG 73 (281) ||=|++|.++=. .+++.+...|+-.++....+... +.........=+++....+.+++++++|+-||=|- T Consensus 2 ~IlilgGT~e~r-~la~~L~~~g~~~v~t~~~~~~~~~~~~~~~~~~G~l~~~~~m~~~i~~~~i~~vIDAT 72 (246) T pfam02571 2 RILILGGTTEAR-ALAAALAAAGVVSVVTSLAGRTAAPRLPPLPVRVGGFGGADGLAAYLREEGIDAVIDAT 72 (246) T ss_pred EEEEEEECHHHH-HHHHHHHHCCCEEEEECCCHHHCCCCCCCCCEEECCCCCHHHHHHHHHHCCCCEEEECC T ss_conf 699997368999-99999985698799984755443766788508979989999999999977997999899 No 173 >LOAD_Ccd2 consensus Probab=25.07 E-value=49 Score=13.92 Aligned_cols=63 Identities=14% Similarity=0.050 Sum_probs=41.0 Q ss_pred CHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEEECCC Q ss_conf 0389999999978997899995486673103885678738999999999986495406770242 Q gi|254780768|r 12 MLPYYVAKAARLKNDEPVIASVLNECSFDWQDFECRELPLGDFCVLRSILHQYNIGRIVVAGAI 75 (281) Q Consensus 12 ~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~~~~~~~ig~ig~li~~Lk~~~i~~ivmaG~V 75 (281) ..|..+|..+...|+++.++...+....-.++..........+.++++.+...|++ +.-++.- T Consensus 17 ~~a~~~A~~a~~~g~ev~vf~~~dgv~~~~~~~~~~~~~~~~~~~~~~~~~~~Gv~-i~vC~~~ 79 (114) T LOAD_Ccd2 17 YEAFILALAAAALGYEVEVFFTEDGVYALLKDQKPEVKDVPLLADLLKLAKEYGVK-VYVCGMS 79 (114) T ss_pred HHHHHHHHHHHHCCCCEEEEEEHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHCCCE-EEEEHHH T ss_conf 99999999999689978999863699986166885445665799999999976987-9987999 No 174 >TIGR00763 lon ATP-dependent protease La; InterPro: IPR004815 Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes . They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence . Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases . Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base . The geometric orientations of the catalytic residues are similar between families, despite different protein folds . The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) , . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This signature defines the bacterial and eukaryotic lon proteases, which are ATP-dependent serine peptidases belonging to the MEROPS peptidase family S16 (lon protease family, clan SF). This family of sequences does not include the archaeal lon homologs, IPR004663 from INTERPRO. In the eukaryotes the majority of the proteins are located in the mitochondrial matrix , . In yeast, Pim1, is located in the mitochondrial matrix, is required for mitochondrial function, is constitutively expressed but is increased after thermal stress, suggesting that Pim1 may play a role in the heat shock response .; GO: 0004176 ATP-dependent peptidase activity, 0005524 ATP binding, 0006510 ATP-dependent proteolysis. Probab=24.85 E-value=50 Score=13.89 Aligned_cols=64 Identities=19% Similarity=0.099 Sum_probs=46.7 Q ss_pred CCCCHHHHHHHHHHHCCCCEEEEEECCCCC-CCCCCCCEEEEEHH-HHHHHHHHHHHCCCCE-EEEECCCCC Q ss_conf 798038999999997899789999548667-31038856787389-9999999998649540-677024232 Q gi|254780768|r 9 GSGMLPYYVAKAARLKNDEPVIASVLNECS-FDWQDFECRELPLG-DFCVLRSILHQYNIGR-IVVAGAIDR 77 (281) Q Consensus 9 G~G~LP~~ia~~~~~~g~~~~ii~l~~~~~-~~~~~~~~~~~~ig-~ig~li~~Lk~~~i~~-ivmaG~V~k 77 (281) |..+|-.-||+++ |.++|=+++-|-.| ++.+.|-.. -+| -.|++|..||+.+..+ |+|+==|+| T Consensus 462 GKTSlg~SIA~AL---nRkFvR~SlGG~~DeAEIrGHRRT--YvGAMPGriiQ~lk~~~t~NPl~LlDEIDK 528 (941) T TIGR00763 462 GKTSLGKSIAKAL---NRKFVRFSLGGVRDEAEIRGHRRT--YVGAMPGRIIQGLKKAKTKNPLILLDEIDK 528 (941) T ss_pred CHHHHHHHHHHHH---CCEEEEEEECCCEEHHHCCCCCCC--CCCCCHHHHHHHHHHCCCCCCEEEEEEEEE T ss_conf 4222789999996---880499952672203112786432--034672578999876041588068620220 No 175 >pfam10055 DUF2292 Uncharacterized small protein (DUF2292). Members of this family of hypothetical bacterial proteins have no known function. Probab=24.77 E-value=50 Score=13.88 Aligned_cols=30 Identities=30% Similarity=0.425 Sum_probs=23.2 Q ss_pred HHHHHHHCCCCCEE-EEEEECCEEEEEECCC Q ss_conf 99999733267225-9998197488962534 Q gi|254780768|r 158 MKSAEALSELDVGQ-SAVSIGGRVVALEGIE 187 (281) Q Consensus 158 ~~i~~~l~~~DiGQ-svVv~~g~ViaiEa~e 187 (281) .++.+.+..+.-|- +++|++|+|+-+|.-| T Consensus 4 ~~I~~~l~~l~yGsvti~Vhdg~VvQIe~~E 34 (38) T pfam10055 4 EKIRKALEKIKYGSVTITIQDGKVVQIEKTE 34 (38) T ss_pred HHHHHHHHHCCCCEEEEEEECCEEEEEEEHH T ss_conf 9999998417503499999889899998123 No 176 >TIGR03100 hydr1_PEP hydrolase, ortholog 1, exosortase system type 1 associated. This group of proteins are members of the alpha/beta hydrolase superfamily. These proteins are generally found in genomes containing the exosortase/PEP-CTERM protein expoert system, specifically the type 1 variant of this system described by the Genome Property GenProp0652. When found in this context they are invariably present in the vicinity of a second, relatively unrelated enzyme (ortholog 2, TIGR03101) of the same superfamily. Probab=24.67 E-value=50 Score=13.87 Aligned_cols=66 Identities=14% Similarity=-0.060 Sum_probs=47.8 Q ss_pred ECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCEEEEEHHHHHHHHHHHHHC--CCCEEEEECC Q ss_conf 779803899999999789978999954866731038856787389999999999864--9540677024 Q gi|254780768|r 8 AGSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQDFECRELPLGDFCVLRSILHQY--NIGRIVVAGA 74 (281) Q Consensus 8 AG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~~~~~~~ig~ig~li~~Lk~~--~i~~ivmaG~ 74 (281) .|.+.+=..+|+.+-+.|+.++-+.+.|.-+.+- ++....-.+.++...++.+++. +..++++.|- T Consensus 40 ~g~~r~~v~la~~la~~g~~~~rfd~~g~g~s~g-~~~~~~~~~~D~~aA~d~~~~~~~~~~~ivl~G~ 107 (274) T TIGR03100 40 VGSHRQFVLLARRLAEAGFPVLRFDYRGMGDSEG-ENLGFEGIDADIAAAIDAFREAAPHLRRIVAWGL 107 (274) T ss_pred CCCCHHHHHHHHHHHHCCCEEEEECCCCCCCCCC-CCCCHHHHHHHHHHHHHHHHHHCCCCCCEEEEEE T ss_conf 6764279999999997898799975888878898-8787255489999999999863778673799997 No 177 >TIGR03466 HpnA hopanoid-associated sugar epimerase. The sequences in this family are members of the pfam01370 superfamily of NAD-dependent epimerases and dehydratases typically acting on nucleotide-sugar substrates. The genes of the family modeled here are generally in the same locus with genes involved in the biosynthesis and elaboration of hopene, the cyclization product of the polyisoprenoid squalene. Probab=24.62 E-value=50 Score=13.87 Aligned_cols=70 Identities=16% Similarity=0.157 Sum_probs=45.3 Q ss_pred EEEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCC--EEEEEHHHHHHHHHHHHHCCCCEEEEECC Q ss_conf 3999977980389999999978997899995486673103885--67873899999999998649540677024 Q gi|254780768|r 3 RLLIIAGSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQDFE--CRELPLGDFCVLRSILHQYNIGRIVVAGA 74 (281) Q Consensus 3 kigIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~~--~~~~~ig~ig~li~~Lk~~~i~~ivmaG~ 74 (281) |+-+..|+|-+=..+++.+.++|+++.++.........+.+.. ...-++.+...+-+.++ +|+.|+-+.. T Consensus 2 riLVTGgtGfiG~~l~~~L~~~G~~V~~l~r~~~~~~~~~~~~~~~~~gDl~d~~~~~~~~~--~~d~ViH~Aa 73 (328) T TIGR03466 2 KVLVTGATGFVGSAVVRLLLERGEEVRVLVRPTSDRRNLEGLDVEIVEGDLRDPASLRKALA--GCRALFHVAA 73 (328) T ss_pred EEEEECCCCHHHHHHHHHHHHCCCEEEEEECCCCCHHHHCCCCCEEEEEECCCHHHHHHHHH--CCCEEEEECC T ss_conf 49998677799999999999784989999899986556521797799820799999999971--7858976134 No 178 >COG1526 FdhD Uncharacterized protein required for formate dehydrogenase activity [Energy production and conversion] Probab=24.47 E-value=50 Score=13.85 Aligned_cols=108 Identities=17% Similarity=0.145 Sum_probs=76.7 Q ss_pred HHHHHHHHHHHCCCCCEEEEEEECCEEEEEECCCCHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCEEEECCCCHH Q ss_conf 99899999973326722599981974889625342179999999754312345667718999348887531210662799 Q gi|254780768|r 154 ILAAMKSAEALSELDVGQSAVSIGGRVVALEGIEGTDSMLQRIVDCRNNGRILAGKSGVLVKMCKSQQDMRADLPSIGAK 233 (281) Q Consensus 154 I~~g~~i~~~l~~~DiGQsvVv~~g~ViaiEa~eGTD~mi~R~~~~~~~~~~~~~~~~ilvK~~K~~QD~r~DlP~IG~~ 233 (281) +.....+.++-|...-.==+...++.++-+|++-=-++.-+=+|.....+ ...++.|++=+-+. .-+ T Consensus 146 l~~~q~~~~~TG~~H~Aal~~~~g~l~~~~EDVGRHNAvDKliG~~~~~G--~~~~~~vl~tSGR~-----------S~E 212 (266) T COG1526 146 LEKAQPLYRKTGGVHAAALFDPDGELLLVREDVGRHNAVDKLIGRALLEG--IPLSGKVLVTSGRA-----------SSE 212 (266) T ss_pred HHHHHHHHHHHCCEEEEEEECCCCCEEEEEEECCCHHHHHHHHHHHHHCC--CCCCCEEEEECCCC-----------CHH T ss_conf 99854877752758668898489978999974340268999999998779--97478699991776-----------289 Q ss_pred HHHHHHHCCCEEEEEECCCEEEECHHHHHHHHHHCCCEEEEECHHH Q ss_conf 9999998499099997397799858999999998796899957754 Q gi|254780768|r 234 TVQNVIKAGLAGIALEAGKSLVLEKELVKKHADEAGIFVCGIDREF 279 (281) Q Consensus 234 Ti~~~~~ag~~giaiea~~~lild~~~~i~~a~~~~i~i~g~~~~~ 279 (281) -+..++.+|+..|+=-+.-| +.-++.|+++||+++|+-... T Consensus 213 mV~Kaa~~Gipil~S~SAPT-----~Lai~~Ae~~~iTLvgf~R~~ 253 (266) T COG1526 213 MVQKAAMAGIPILASVSAPT-----SLAIEAAERLGLTLVGFVRGG 253 (266) T ss_pred HHHHHHHHCCCEEEECCCCH-----HHHHHHHHHCCCEEEEEECCC T ss_conf 99999981995899844306-----999999998098899984288 No 179 >PRK06703 flavodoxin; Provisional Probab=24.43 E-value=51 Score=13.84 Aligned_cols=44 Identities=14% Similarity=0.099 Sum_probs=34.4 Q ss_pred CCEEEEEECCC-----CHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCC Q ss_conf 97399997798-----038999999997899789999548667310388 Q gi|254780768|r 1 MKRLLIIAGSG-----MLPYYVAKAARLKNDEPVIASVLNECSFDWQDF 44 (281) Q Consensus 1 M~kigIIAG~G-----~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~ 44 (281) |.||.|+=|+. .+...+++.++..|.++.++.+.......+.++ T Consensus 1 M~Kv~I~YgS~tGnte~~A~~i~~~l~~~g~~v~~~~~~~~~~~~l~~~ 49 (151) T PRK06703 1 MAKILIAYASMSGNTEDIADLIKVSLDAFDHEVVLQEMDGMDAEELLAY 49 (151) T ss_pred CCEEEEEEECCCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHCC T ss_conf 9818999989861789999999999985799638976031998898428 No 180 >pfam00070 Pyr_redox Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain. Probab=24.43 E-value=51 Score=13.84 Aligned_cols=54 Identities=15% Similarity=0.233 Sum_probs=38.0 Q ss_pred EEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCEEEEEHHHHHHHHHHHHHCCCC Q ss_conf 99779803899999999789978999954866731038856787389999999999864954 Q gi|254780768|r 6 IIAGSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQDFECRELPLGDFCVLRSILHQYNIG 67 (281) Q Consensus 6 IIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~~~~~~~ig~ig~li~~Lk~~~i~ 67 (281) +|-|+|..-.++|..+...|.++.++...+.. ++.++. .+ -..+.+.|++++|+ T Consensus 3 ~iiGgG~ig~E~A~~l~~~G~~Vtiie~~~~~---l~~~d~---~~--~~~~~~~l~~~GV~ 56 (82) T pfam00070 3 VVVGGGYIGLEFASALAKLGSKVTVVERRDRL---LRGFDE---EI--AKILQEKLEKNGIE 56 (82) T ss_pred EEECCCHHHHHHHHHHHHCCCEEEEECCCCCC---CHHCCH---HH--HHHHHHHHHHCCCE T ss_conf 99998899999999998639278998125733---022798---89--99999999866999 No 181 >COG0178 UvrA Excinuclease ATPase subunit [DNA replication, recombination, and repair] Probab=23.74 E-value=52 Score=13.76 Aligned_cols=36 Identities=33% Similarity=0.373 Sum_probs=22.6 Q ss_pred EEEECCCCH---------HHHHHHHHCCCEEEEEECCCEEEECHHHHHHHHH Q ss_conf 121066279---------9999999849909999739779985899999999 Q gi|254780768|r 224 RADLPSIGA---------KTVQNVIKAGLAGIALEAGKSLVLEKELVKKHAD 266 (281) Q Consensus 224 r~DlP~IG~---------~Ti~~~~~ag~~giaiea~~~lild~~~~i~~a~ 266 (281) =+|=|+||+ +|++.+...|=+.|+||.. +.++..|| T Consensus 506 VLDEPSIGLHqrDn~rLi~tL~~LRDlGNTviVVEHD-------edti~~AD 550 (935) T COG0178 506 VLDEPSIGLHQRDNERLIETLKRLRDLGNTVIVVEHD-------EDTIRAAD 550 (935) T ss_pred EECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEEEECC-------HHHHHHCC T ss_conf 8058866777230899999999998659859999569-------89996468 No 182 >TIGR00486 TIGR00486 conserved hypothetical protein TIGR00486; InterPro: IPR002678 This family contains several NIF3 (NGG1p interacting factor 3) protein homologues. NIF3 interacts with the yeast transcriptional coactivator NGG1p which is part of the ADA complex, the exact function of this interaction is unknown .. Probab=23.69 E-value=52 Score=13.75 Aligned_cols=202 Identities=15% Similarity=0.075 Sum_probs=92.3 Q ss_pred HHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCEEEEEHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHCCCHHHHHH Q ss_conf 89999999978997899995486673103885678738999999999986495406770242321000111026035789 Q gi|254780768|r 14 PYYVAKAARLKNDEPVIASVLNECSFDWQDFECRELPLGDFCVLRSILHQYNIGRIVVAGAIDRRPNVQDLCFSIKDSLR 93 (281) Q Consensus 14 P~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~~~~~~~ig~ig~li~~Lk~~~i~~ivmaG~V~krP~~~~l~~D~~~~~~ 93 (281) -..++++|..+|.+.+|.--.-.-.+....+.. + .-++++++|-+++|.-..+==.++.-+.+ - T Consensus 57 s~~~~~~A~~~~~d~i~~HHp~~~~~~~~~~~~--~---~~~~~~k~Ll~~~i~lY~~HtnlD~~~~~-----------G 120 (325) T TIGR00486 57 SESVAKEAKEKNADLIITHHPLIFKPLKRLIRG--I---KPKRLLKILLENDISLYSAHTNLDAHRGL-----------G 120 (325) T ss_pred HHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCC--C---CCHHHHHHHHHCCCCEEECCCCCHHHCCC-----------C T ss_conf 999999998537979998078642872223666--4---10367889886796134104600233047-----------6 Q ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHC---C------------------CEEECHHHCCHHHCC----CCCCCCCCCCCH Q ss_conf 999998751048035899999999976---9------------------748211222524203----543345546532 Q gi|254780768|r 94 ISKMIWQLVSGGNAAILKASIDLLESY---G------------------VSVVGAHEIVPELLV----QVGSLGTCVPNR 148 (281) Q Consensus 94 l~k~l~~~~~~gDd~iL~~i~~~fe~~---G------------------~~vi~~~~~l~~ll~----~~G~l~~~~p~~ 148 (281) .-+.|..++...+ + |...|+++ | ++--|..++.-+..+ +.-...++... T Consensus 121 ~N~~La~~L~~~~-~----V~~~~~~~~~~~lGnaGaGhiGeyshcaf~ikshPyeevaydiyPve~T~~~~~~G~~~~- 194 (325) T TIGR00486 121 NNDVLAEALGLEN-N----VPKEFEEETVSALGNAGAGHIGEYSHCAFMIKSHPYEEVAYDIYPVEETDDLIGLGRMVG- 194 (325) T ss_pred HHHHHHHHHCCCC-C----HHHHHHHHHHHHCCCCCCCCCCCHHHHEEEEECCCCHHHHEEECCCCCCHHHHCCCCEEE- T ss_conf 0689999818742-0----125543001211036676543320110112103760121101024424712315772267- Q ss_pred HHHHHHHHHHHHHHHHCCCCCEEEEEEECCEEEEEECCCCHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCEEEEC Q ss_conf 34788998999999733267225999819748896253421799999997543123456677189993488875312106 Q gi|254780768|r 149 DVKRDILAAMKSAEALSELDVGQSAVSIGGRVVALEGIEGTDSMLQRIVDCRNNGRILAGKSGVLVKMCKSQQDMRADLP 228 (281) Q Consensus 149 ~~~~dI~~g~~i~~~l~~~DiGQsvVv~~g~ViaiEa~eGTD~mi~R~~~~~~~~~~~~~~~~ilvK~~K~~QD~r~DlP 228 (281) ....++..-..+.+.-.++++...+++.++.--|- ..++++|++-+...|.. ....+.+..+.++|.-+ T Consensus 195 ~~~~~~~~~~~~~~~~~~L~v~~~~~~~~~~PsaW-----nq~~v~kvA~vsG~G~s---~ks~~~~a~~~gvd~yi--- 263 (325) T TIGR00486 195 ELKASIESKEEVLEIKKKLNVKPLLVVKKGEPSAW-----NQEKVKKVAVVSGSGLS---DKSFIMKALREGVDLYI--- 263 (325) T ss_pred EECCCCCHHHHHHHHHHHCCCCCEEEEECCCCCCC-----CCCCCCEEEEEECCCCC---HHHHHHHHHHCCCCEEE--- T ss_conf 53455105788888887449971788506887523-----10216638999557875---27899999970896897--- Q ss_pred CCC---HHHHHHHHHCCCEEEEEE Q ss_conf 627---999999998499099997 Q gi|254780768|r 229 SIG---AKTVQNVIKAGLAGIALE 249 (281) Q Consensus 229 ~IG---~~Ti~~~~~ag~~giaie 249 (281) -| -++...+.+-|+..|.-. T Consensus 264 -TGd~~h~~~~~A~e~Gl~~i~ag 286 (325) T TIGR00486 264 -TGDLSHHTAHLARELGLNVIDAG 286 (325) T ss_pred -CCCCHHHHHHHHHHCCEEEEECC T ss_conf -16540667887643882798558 No 183 >PRK13361 molybdenum cofactor biosynthesis protein A; Provisional Probab=23.53 E-value=53 Score=13.73 Aligned_cols=117 Identities=18% Similarity=0.267 Sum_probs=81.7 Q ss_pred EEEEHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHCCCHHHHHHHHHHHHHH------HCCCCHHHHHHHHHHHHHC Q ss_conf 7873899999999998649540677024232100011102603578999999875------1048035899999999976 Q gi|254780768|r 47 RELPLGDFCVLRSILHQYNIGRIVVAGAIDRRPNVQDLCFSIKDSLRISKMIWQL------VSGGNAAILKASIDLLESY 120 (281) Q Consensus 47 ~~~~ig~ig~li~~Lk~~~i~~ivmaG~V~krP~~~~l~~D~~~~~~l~k~l~~~------~~~gDd~iL~~i~~~fe~~ 120 (281) ..+++.++..+.+.+.+.||++|=+.|+ .|.+.. |+-. +.+.+... .---...+|...+..+.+. T Consensus 43 ~~Ls~eEi~~l~~~~~~~Gi~kvRlTGG---EPLlR~---dl~~---li~~l~~~~gi~~islTTNG~lL~~~a~~Lk~a 113 (329) T PRK13361 43 QVLTLEELAWLAQAFTELGVRKIRLTGG---EPLVRT---GCDQ---LVARLGKLPGLEELSMTTNGSRLARFAAELADA 113 (329) T ss_pred CCCCHHHHHHHHHHHHHCCCEEEEEECC---CCCCCC---CHHH---HHHHHHHCCCCCEEEEECCHHHHHHHHHHHHHC T ss_conf 4689999999999999729528996278---822356---8899---999986179977189966477689999999977 Q ss_pred CCEE--ECHHHCCHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEECCE Q ss_conf 9748--2112225242035433455465323478899899999973326722599981974 Q gi|254780768|r 121 GVSV--VGAHEIVPELLVQVGSLGTCVPNRDVKRDILAAMKSAEALSELDVGQSAVSIGGR 179 (281) Q Consensus 121 G~~v--i~~~~~l~~ll~~~G~l~~~~p~~~~~~dI~~g~~i~~~l~~~DiGQsvVv~~g~ 179 (281) |+.= +|.+.+-|+.+.. +|+. .....+..|+..|.+.|--.|==-+|+..|. T Consensus 114 GL~rvNISLDsLd~~~f~~---ITr~----~~l~~Vl~gI~aA~~~G~~~VKiN~V~lrg~ 167 (329) T PRK13361 114 GLKRLNISLDTLNPDLFAA---LTRN----GRLERVIAGIDAAKAAGFERIKLNAVILRGQ 167 (329) T ss_pred CCCEEEEECCCCCHHHHHH---HHCC----CCHHHHHHHHHHHHHCCCCEEEEEEEEECCC T ss_conf 9986997357799999997---7289----9769999999999977997388999983687 No 184 >PRK12999 pyruvate carboxylase; Reviewed Probab=23.52 E-value=53 Score=13.73 Aligned_cols=65 Identities=14% Similarity=0.047 Sum_probs=41.2 Q ss_pred EEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCC--CCCEEEE--------EHHHHHHHHHHHHHCCCCEE Q ss_conf 99977980389999999978997899995486673103--8856787--------38999999999986495406 Q gi|254780768|r 5 LIIAGSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQ--DFECREL--------PLGDFCVLRSILHQYNIGRI 69 (281) Q Consensus 5 gIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~--~~~~~~~--------~ig~ig~li~~Lk~~~i~~i 69 (281) =|||-.|....-|+++|+..|.+.+.+.=........- ..+.+.+ +.-.+..+++.-++.+|+-| T Consensus 8 vLiANRGEIA~Ri~Ra~~elgi~tVavys~~D~~s~h~~~ADea~~ig~~~~p~~sYL~~~~ii~~A~~~~~dAi 82 (1147) T PRK12999 8 VLVANRGEIAIRIFRAATELGIRTVAIYSEEDKLSLHRFKADEAYLIGEGKGPIEAYLDIDEIIRVAKQAGVDAI 82 (1147) T ss_pred EEEECCCHHHHHHHHHHHHCCCCEEEEECHHHCCCHHHHHCCEEEEECCCCCHHHHHCCHHHHHHHHHHHCCCEE T ss_conf 999668199999999999839958999784645782588588567728999713111299999999999498999 No 185 >TIGR01133 murG undecaprenyldiphospho-muramoylpentapeptide beta-N-acetylglucosaminyltransferase; InterPro: IPR006009 The murG gene of Escherichia coli encodes the N-acetylglucosaminyltransferase, UDP-N-acetylglucosamine--N-acetylmuramyl-(pentapeptide) pyrophosphoryl-undecaprenol N-acetylglucosamine transferase, responsible for the final step in the formation of the lipid-linked disaccharide-pentapeptide subunit of peptidoglycan. The enzyme is peripherally associated with the inner face of the cytoplasmic membrane. Therefore, the peptidoglycan subunit is completely assembled before it traverses the cytoplasmic membrane .; GO: 0050511 undecaprenyldiphospho-muramoylpentapeptide beta-N-acetylglucosaminyltransferase activity, 0019277 UDP-N-acetylgalactosamine biosynthetic process, 0009276 1-2nm peptidoglycan-based cell wall. Probab=23.00 E-value=54 Score=13.66 Aligned_cols=50 Identities=22% Similarity=0.282 Sum_probs=32.8 Q ss_pred CCEEEEEECC--CCHHH-HHHHHHHHCC--CCEEEEEECCCCCCCC-C-C--CCEEEEE Q ss_conf 9739999779--80389-9999999789--9789999548667310-3-8--8567873 Q gi|254780768|r 1 MKRLLIIAGS--GMLPY-YVAKAARLKN--DEPVIASVLNECSFDW-Q-D--FECRELP 50 (281) Q Consensus 1 M~kigIIAG~--G~LP~-~ia~~~~~~g--~~~~ii~l~~~~~~~~-~-~--~~~~~~~ 50 (281) |.+|.|-+|+ |-+|- .+++++++++ ++++.++-.+..+..+ + + ++.+.++ T Consensus 5 ~~~~~~~gGGTGG~fPAlA~a~~l~~~~~~~~v~~lG~~~g~e~~lv~~~~~~~~~~i~ 63 (368) T TIGR01133 5 MKKVALAGGGTGGIFPALAVAEELIKRGPEVEVVWLGTKRGLEERLVPSKEGIEFLTIP 63 (368) T ss_pred CEEEEEECCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCCCCEEEEEE T ss_conf 22899972783026899999999997489369998506775000034321574177774 No 186 >PRK00164 moaA molybdenum cofactor biosynthesis protein A; Reviewed Probab=22.78 E-value=54 Score=13.63 Aligned_cols=171 Identities=16% Similarity=0.198 Sum_probs=98.0 Q ss_pred EEEEHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHCCCHHHHHHHHHHHHHH------HCCCCHHHHHHHHHHHHHC Q ss_conf 7873899999999998649540677024232100011102603578999999875------1048035899999999976 Q gi|254780768|r 47 RELPLGDFCVLRSILHQYNIGRIVVAGAIDRRPNVQDLCFSIKDSLRISKMIWQL------VSGGNAAILKASIDLLESY 120 (281) Q Consensus 47 ~~~~ig~ig~li~~Lk~~~i~~ivmaG~V~krP~~~~l~~D~~~~~~l~k~l~~~------~~~gDd~iL~~i~~~fe~~ 120 (281) ..++..++..+++.+.+.||++|=+.|+ .|.+.. |+-. +.+.+... .---...+|.-.+..+.+. T Consensus 49 ~~Ls~eEi~~i~~~~~~lGi~kiRlTGG---EPLlR~---di~~---li~~l~~~~gi~~v~lTTNG~lL~~~a~~Lk~a 119 (334) T PRK00164 49 ELLSLEEIERLVRAFAALGVRKIRLTGG---EPLLRK---DLED---IIARLAALPGIRDLALTTNGYLLARRAAALKDA 119 (334) T ss_pred HCCCHHHHHHHHHHHHHCCCCEEEECCC---CCCCCC---CHHH---HHHHHHHCCCCCCEEEECCHHHHHHHHHHHHHC T ss_conf 2299999999999999709627986078---843235---7899---999986327975178844488999999999985 Q ss_pred CCEEE--CHHHCCHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEECCEEEEEECCCCHHHHHHHHHH Q ss_conf 97482--1122252420354334554653234788998999999733267225999819748896253421799999997 Q gi|254780768|r 121 GVSVV--GAHEIVPELLVQVGSLGTCVPNRDVKRDILAAMKSAEALSELDVGQSAVSIGGRVVALEGIEGTDSMLQRIVD 198 (281) Q Consensus 121 G~~vi--~~~~~l~~ll~~~G~l~~~~p~~~~~~dI~~g~~i~~~l~~~DiGQsvVv~~g~ViaiEa~eGTD~mi~R~~~ 198 (281) |+.=+ |.+.+-|+.+. .+|+. .....+..|++.|.+.|---|==-+|+..|.- .|+.+. ..+ T Consensus 120 GL~riNISLDsLd~~~f~---~IT~~----~~l~~Vl~gI~~A~~~G~~~vKiN~V~~~g~N--------~dEi~~-li~ 183 (334) T PRK00164 120 GLTRVNVSLDSLDPERFK---AITGR----DRLDQVLAGIDAALAAGLEPVKVNAVLMKGVN--------DDEIPD-LLR 183 (334) T ss_pred CCCEEEEEEEECCHHHHH---HHHCC----CCHHHHHHHHHHHHHCCCCCEEEEEEEECCCC--------HHHHHH-HHH T ss_conf 998699711318999999---98489----97599999999999589876168999637989--------899999-999 Q ss_pred HHHHCCCCCCCCEEEEEECCCCCCCEEE-ECCCCHHHHHHHHHCCCEEE Q ss_conf 5431234566771899934888753121-06627999999998499099 Q gi|254780768|r 199 CRNNGRILAGKSGVLVKMCKSQQDMRAD-LPSIGAKTVQNVIKAGLAGI 246 (281) Q Consensus 199 ~~~~~~~~~~~~~ilvK~~K~~QD~r~D-lP~IG~~Ti~~~~~ag~~gi 246 (281) +....+. .-=|+-..--+...+.. -..+..+.+....+.++.-. T Consensus 184 ~~~~~~i----~vRFIE~Mp~g~~~~~~~~~~~~~~~i~~~l~~~~~l~ 228 (334) T PRK00164 184 WAKDRGI----QLRFIELMPTGEGNEWFRDHHLSGAEIRARLAEGFTLQ 228 (334) T ss_pred HHHHCCC----EEEEEEEECCCCCCCHHHCCCCCHHHHHHHHHHHCCCE T ss_conf 9964696----59999982167776435306548999999998528961 No 187 >COG0027 PurT Formate-dependent phosphoribosylglycinamide formyltransferase (GAR transformylase) [Nucleotide transport and metabolism] Probab=22.37 E-value=55 Score=13.58 Aligned_cols=65 Identities=20% Similarity=0.113 Sum_probs=40.4 Q ss_pred EEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCEEEEEHHHHHHHHHHHHHCCCCEEE Q ss_conf 99779803899999999789978999954866731038856787389999999999864954067 Q gi|254780768|r 6 IIAGSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQDFECRELPLGDFCVLRSILHQYNIGRIV 70 (281) Q Consensus 6 IIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~~~~~~~ig~ig~li~~Lk~~~i~~iv 70 (281) ++-|+|.|-.+++=+|..-|.+++.+.--..+.+.--.+.++.+++-+-..+-...++.+-+.+| T Consensus 16 mLLGSGELGKEvaIe~QRLG~eVIAVDrY~~APAmqVAhrs~Vi~MlD~~al~avv~re~Pd~IV 80 (394) T COG0027 16 MLLGSGELGKEVAIEAQRLGVEVIAVDRYANAPAMQVAHRSYVIDMLDGDALRAVVEREKPDYIV 80 (394) T ss_pred EEECCCCCCHHHHHHHHHCCCEEEEECCCCCCHHHHHHHHEEEEECCCHHHHHHHHHHHCCCEEE T ss_conf 99638864469999888638779996376898066421111345456999999999865898133 No 188 >pfam09374 PG_binding_3 Predicted Peptidoglycan domain. This family contains a potential peptidoglycan binding domain. Probab=22.15 E-value=56 Score=13.55 Aligned_cols=13 Identities=31% Similarity=0.424 Sum_probs=7.2 Q ss_pred CCCHHHHHHHHHC Q ss_conf 6279999999984 Q gi|254780768|r 229 SIGAKTVQNVIKA 241 (281) Q Consensus 229 ~IG~~Ti~~~~~a 241 (281) .|||+|+..+..+ T Consensus 20 iIGp~Tl~Av~~~ 32 (67) T pfam09374 20 IIGPKTLAALASM 32 (67) T ss_pred CCCHHHHHHHHHC T ss_conf 8289999999967 No 189 >cd03870 M14_CPA Peptidase M14 Carboxypeptidase (CP) A (CPA) belongs to the A/B subfamily of the M14 family of metallocarboxypeptidases (MCPs). The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPA enzymes generally favor hydrophobic residues. A/B subfamily enzymes are normally synthesized as inactive precursors containing preceding signal peptide, followed by a globular N-terminal pro-region linked to the enzyme; these proenzymes are called procarboxypeptidases. The procarboxypeptidase A (PCPA) is produced by the exocrine pancreas and stored as a stable zymogen in the pancreatic granules until secretion into the digestive tract occurs. This subfamily includes CPA1, CPA2 and CPA4 forms. Within these A forms, there are slightly different specificities, with CPA1 preferring aliphatic and small aromatic residues, and CPA2 p Probab=21.94 E-value=56 Score=13.52 Aligned_cols=62 Identities=19% Similarity=0.316 Sum_probs=44.3 Q ss_pred HHHHHHHHHHC-CCEE-ECHHHCCHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHC-----CCCCEEEE Q ss_conf 99999999976-9748-21122252420354334554653234788998999999733-----26722599 Q gi|254780768|r 110 LKASIDLLESY-GVSV-VGAHEIVPELLVQVGSLGTCVPNRDVKRDILAAMKSAEALS-----ELDVGQSA 173 (281) Q Consensus 110 L~~i~~~fe~~-G~~v-i~~~~~l~~ll~~~G~l~~~~p~~~~~~dI~~g~~i~~~l~-----~~DiGQsv 173 (281) -++|.++++++ .|.. ++.|.+-.-++.|-|.-+...|+..++.++ |...+++|. .+-+|+++ T Consensus 170 t~Av~~~~~~~~~i~~~i~~HSygq~il~P~gyt~~~~pd~~~l~~~--a~~~a~ai~~~~g~~Y~~G~~~ 238 (301) T cd03870 170 VKSIVDFVKSHGNFKAFISIHSYSQLLLYPYGYTTQSIPDKTELNQV--AKSAVAALKSLYGTSYKYGSII 238 (301) T ss_pred HHHHHHHHHHCCCCEEEEEECCCCCEEEECCCCCCCCCCCHHHHHHH--HHHHHHHHHHHHCCCCCCCCHH T ss_conf 99999999848782699997156655885676788799899999999--9999999999769986257555 No 190 >PRK09087 hypothetical protein; Validated Probab=21.91 E-value=56 Score=13.52 Aligned_cols=67 Identities=13% Similarity=0.233 Sum_probs=40.8 Q ss_pred HHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHH-CCCHHHHHHHHHHH-HHHHCCCCHHHHHHH-HHHHHHCCCEE Q ss_conf 999999999986495406770242321000111-02603578999999-875104803589999-99999769748 Q gi|254780768|r 52 GDFCVLRSILHQYNIGRIVVAGAIDRRPNVQDL-CFSIKDSLRISKMI-WQLVSGGNAAILKAS-IDLLESYGVSV 124 (281) Q Consensus 52 g~ig~li~~Lk~~~i~~ivmaG~V~krP~~~~l-~~D~~~~~~l~k~l-~~~~~~gDd~iL~~i-~~~fe~~G~~v 124 (281) ..+-.+++.++..++. ++|++.- .|.-.++ .+|+++ |+...+ +.. ..=||.++.++ .+.|+..|+.+ T Consensus 104 e~LFhl~N~~~~~~~~-LLlts~~--~p~~l~~~L~DL~S--RL~~~~~~~I-~~pdD~ll~~~L~k~~~~r~l~l 173 (226) T PRK09087 104 TGLFHLINSVRQAGTS-LLMTSRL--WPSAWNVKLPDLKS--RLKAATVVEI-GEPDDALLSGVIFKLFADRQLYV 173 (226) T ss_pred HHHHHHHHHHHHCCCE-EEEECCC--CHHHCCCCCHHHHH--HHHCCEEEEE-CCCCHHHHHHHHHHHHHHCCCCC T ss_conf 9999999999853987-9998898--95666762468999--9857857983-59998999999999987576578 No 191 >PRK07178 acetyl-CoA carboxylase; Validated Probab=21.67 E-value=57 Score=13.49 Aligned_cols=68 Identities=12% Similarity=0.035 Sum_probs=44.2 Q ss_pred CCEEEEEECCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCEEEEEHH--------HHHHHHHHHHHCCCCE Q ss_conf 9739999779803899999999789978999954866731038856787389--------9999999998649540 Q gi|254780768|r 1 MKRLLIIAGSGMLPYYVAKAARLKNDEPVIASVLNECSFDWQDFECRELPLG--------DFCVLRSILHQYNIGR 68 (281) Q Consensus 1 M~kigIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~~~~~~~ig--------~ig~li~~Lk~~~i~~ 68 (281) |=|==|||-.|....-+++.|+..|++.+.+.=....+...-......+.+| .+.++++.-+..+|+- T Consensus 1 M~~kvLIANRGEIA~RiiRt~~elgi~tVavys~~D~~a~hv~~ADea~~ig~~~~~sYln~~~Ii~~A~~~g~dA 76 (471) T PRK07178 1 MIKKILIANRGEIAVRIVRACAEMGIRSVAIYSEADRHALHVKRADEAHSIGADPLAGYLNPRKLVNLAVETGCDA 76 (471) T ss_pred CCCEEEEECCCHHHHHHHHHHHHCCCCEEEEECHHHCCCHHHHHCCEEEECCCCHHHHHCCHHHHHHHHHHHCCCE T ss_conf 9760899778699999999999839948999083756683688488888718872665449999999999969999 No 192 >TIGR02361 dak_ATP dihydroxyacetone kinase; InterPro: IPR012734 This family consists of examples of the single chain form of dihydroxyacetone kinase (also called glycerone kinase) that uses ATP (2.7.1.29 from EC) as the phosphate donor, rather than a phosphoprotein as in Escherichia coli. This form has separable domains homologous to the K and L subunits of the E. coli enzyme, and is found in yeasts and other eukaryotes and in some bacteria, including Citrobacter freundii. The member from tomato has been shown to phosphorylate dihydroxyacetone, 3,4-dihydroxy-2-butanone, and some other aldoses and ketoses . ; GO: 0004371 glycerone kinase activity, 0005524 ATP binding. Probab=21.66 E-value=52 Score=13.76 Aligned_cols=31 Identities=19% Similarity=0.238 Sum_probs=11.7 Q ss_pred HHHHHHHHCC--C-CEEEEECCCCCCCCHHHHCCCH Q ss_conf 9999998649--5-4067702423210001110260 Q gi|254780768|r 56 VLRSILHQYN--I-GRIVVAGAIDRRPNVQDLCFSI 88 (281) Q Consensus 56 ~li~~Lk~~~--i-~~ivmaG~V~krP~~~~l~~D~ 88 (281) +..+.|..+. + ...++.|.+. =+|+-.-|++ T Consensus 304 ~~~~~L~~~~Y~~~p~~~~~G~~~--TSLng~GFSi 337 (627) T TIGR02361 304 EVVEQLAKEKYKIIPVRIISGTFM--TSLNGPGFSI 337 (627) T ss_pred HHHHHHHHCCCCEEEEEEECCCCH--HHCCCCCCEE T ss_conf 999987517873024588610323--1101786146 No 193 >pfam00814 Peptidase_M22 Glycoprotease family. The Peptidase M22 proteins are part of the HSP70-actin superfamily. The region represented here is an insert into the fold and is not found in the rest of the family (beyond the Peptidase M22 family). Included in this family are the Rhizobial NodU proteins and the HypF regulator. This region also contains the histidine dyad believed to coordinate the metal ion and hence provide catalytic activity. Interestingly the histidines are not well conserved, and there is a lack of experimental evidence to support peptidase activity as a general property of this family. There also appear to be instances of this domain outside of the HSP70-actin superfamily. Probab=21.58 E-value=57 Score=13.48 Aligned_cols=16 Identities=13% Similarity=0.206 Sum_probs=7.4 Q ss_pred HHHHHHHCCCEEEEEE Q ss_conf 9999998499099997 Q gi|254780768|r 234 TVQNVIKAGLAGIALE 249 (281) Q Consensus 234 Ti~~~~~ag~~giaie 249 (281) |.+.+...+.+.+++. T Consensus 173 ~~~a~~~~~~k~lvv~ 188 (225) T pfam00814 173 TERALKHTGAKELVIL 188 (225) T ss_pred HHHHHHHHCCCEEEEE T ss_conf 9999998694989998 No 194 >PRK06714 S-adenosylhomocysteine nucleosidase; Validated Probab=21.49 E-value=58 Score=13.46 Aligned_cols=70 Identities=13% Similarity=0.153 Sum_probs=35.1 Q ss_pred CCEEEEEECCCCHHHHHHHHHHHCCCCEEEEEECCC--CCCCCCCCCEEE--EEHHHHHHHHH---HHHHCCCCEEEEEC Q ss_conf 973999977980389999999978997899995486--673103885678--73899999999---99864954067702 Q gi|254780768|r 1 MKRLLIIAGSGMLPYYVAKAARLKNDEPVIASVLNE--CSFDWQDFECRE--LPLGDFCVLRS---ILHQYNIGRIVVAG 73 (281) Q Consensus 1 M~kigIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~~--~~~~~~~~~~~~--~~ig~ig~li~---~Lk~~~i~~ivmaG 73 (281) |+|||||+ .+|.++..-...-. +.....+.+. ..-.+.+.++.. -.+|++..... .+.+.+++.++|.| T Consensus 1 M~kIgIIg---AM~~Ev~~l~~~~~-~~~~~~~~~~~fy~G~l~g~~VVl~~sGIGKVnAA~~t~~LI~~F~~d~IIntG 76 (236) T PRK06714 1 MKRIAIVA---AWEPELTYLHQSYP-SERVEKRAAWEFHFHTINDLEIISVITGVGKVSCASCVQLLISEFQPDELFMTG 76 (236) T ss_pred CCEEEEEE---CCHHHHHHHHHHHC-CCEEEEECCEEEEEEEECCEEEEEEECCCCHHHHHHHHHHHHHHCCCCEEEECC T ss_conf 96699995---28999999999730-071899988899999999999999967822999999999999844999999878 Q ss_pred C Q ss_conf 4 Q gi|254780768|r 74 A 74 (281) Q Consensus 74 ~ 74 (281) - T Consensus 77 v 77 (236) T PRK06714 77 I 77 (236) T ss_pred C T ss_conf 6 No 195 >COG1903 CbiD Cobalamin biosynthesis protein CbiD [Coenzyme metabolism] Probab=21.42 E-value=58 Score=13.46 Aligned_cols=64 Identities=13% Similarity=0.089 Sum_probs=41.3 Q ss_pred HHHHHHCCCCEEEEEECCCCCCCCC-----CCCEEEEEHH-HHHHHHHHHHHCCCCEEEEECCCCCCCCHH Q ss_conf 9999978997899995486673103-----8856787389-999999999864954067702423210001 Q gi|254780768|r 18 AKAARLKNDEPVIASVLNECSFDWQ-----DFECRELPLG-DFCVLRSILHQYNIGRIVVAGAIDRRPNVQ 82 (281) Q Consensus 18 a~~~~~~g~~~~ii~l~~~~~~~~~-----~~~~~~~~ig-~ig~li~~Lk~~~i~~ivmaG~V~krP~~~ 82 (281) .+.++..+++.+++.. |++..++. ..+...+.++ -+|.+++...++++++++|+|..-|--.+. T Consensus 195 l~~~r~~~~~~iv~~~-Gn~g~~~a~~~~~~~~~~~v~~~n~vG~~l~~a~~~~~~~i~i~G~pGKL~KlA 264 (367) T COG1903 195 LDVARAAGLDHVVFCP-GNTGEDYARKLFILPEQAIVKMGNFVGSMLKEARELGVKEILIFGHPGKLIKLA 264 (367) T ss_pred HHHHHHCCCCEEEECC-CHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHH T ss_conf 9999863775799856-826889999866870377766887777999999866888799976717899986 No 196 >COG5583 Uncharacterized small protein [Function unknown] Probab=21.32 E-value=58 Score=13.44 Aligned_cols=33 Identities=27% Similarity=0.356 Sum_probs=22.5 Q ss_pred HHHHHHHHHCCCCCEE-EEEEECCEEEEEECCCC Q ss_conf 8999999733267225-99981974889625342 Q gi|254780768|r 156 AAMKSAEALSELDVGQ-SAVSIGGRVVALEGIEG 188 (281) Q Consensus 156 ~g~~i~~~l~~~DiGQ-svVv~~g~ViaiEa~eG 188 (281) +--++-+.+-.++-|- .++|+||.|+-+|+.|- T Consensus 10 ~~ekI~~~Le~lkyGsV~ItVhdgqViQIE~~EK 43 (54) T COG5583 10 VIEKIKKALEGLKYGSVTITVHDGQVIQIEASEK 43 (54) T ss_pred HHHHHHHHHHHCCCCEEEEEEECCEEEEEEHHHH T ss_conf 9999999986366333999997898999960220 No 197 >pfam09652 Cas_VVA1548 Putative CRISPR-associated protein (Cas_VVA1548). This entry represents a conserved region of about 95 amino acids found exclusively in species with CRISPRs (Clustered Regularly Interspaced Short Palindromic Repeats). In all bacterial species that contain this entry, the genes encoding the proteins are in the midst of a cluster of cas (CRISPR-associated) genes. Probab=21.31 E-value=58 Score=13.44 Aligned_cols=24 Identities=25% Similarity=0.144 Sum_probs=20.4 Q ss_pred CCHHHHHHHHHHHCCCCEEEEEEC Q ss_conf 803899999999789978999954 Q gi|254780768|r 11 GMLPYYVAKAARLKNDEPVIASVL 34 (281) Q Consensus 11 G~LP~~ia~~~~~~g~~~~ii~l~ 34 (281) |.||..+|.+.-.+|-.++-+.+. T Consensus 40 GtLPvhLaa~vc~kga~y~~l~l~ 63 (93) T pfam09652 40 GTLPVHLAADVCEKGARYFHLSLD 63 (93) T ss_pred EECCHHHHHHHHHCCCEEEEEEEC T ss_conf 807789999997447179999815 No 198 >PRK01216 DNA polymerase IV; Validated Probab=21.31 E-value=52 Score=13.74 Aligned_cols=25 Identities=20% Similarity=0.384 Sum_probs=11.4 Q ss_pred CCHHHHHHHHHHHHH-CCCEEECHHH Q ss_conf 803589999999997-6974821122 Q gi|254780768|r 105 GNAAILKASIDLLES-YGVSVVGAHE 129 (281) Q Consensus 105 gDd~iL~~i~~~fe~-~G~~vi~~~~ 129 (281) +.+.+|..++..+++ +|+.++++.+ T Consensus 145 a~nk~lAKiAs~~~KP~G~~~v~~~~ 170 (351) T PRK01216 145 GPNKVFAKIIADMAKPNGLGVISPEE 170 (351) T ss_pred CCCHHHHHHHHHHCCCCCEEEECHHH T ss_conf 65499999988723999579988368 No 199 >pfam04994 TfoX_C TfoX C-terminal domain. TfoX may play a key role in the development of genetic competence by regulating the expression of late competence-specific genes. This family corresponds to the C-terminal presumed domain of TfoX. The domain is found associated with pfam00383 in a cytidine and deoxycytidylate deaminase family protein from Neisseria meningitidis serogroup B. It is also found as an isolated domain in some proteins suggesting this is an autonomous domain. Probab=21.20 E-value=58 Score=13.43 Aligned_cols=16 Identities=44% Similarity=0.665 Sum_probs=5.4 Q ss_pred ECCCCHHHHHHHHHCC Q ss_conf 0662799999999849 Q gi|254780768|r 227 LPSIGAKTVQNVIKAG 242 (281) Q Consensus 227 lP~IG~~Ti~~~~~ag 242 (281) +|-+|+..-+.+.++| T Consensus 4 LpNlg~~~er~L~~vG 19 (77) T pfam04994 4 LPNLGLKLERMLKKAG 19 (77) T ss_pred CCCCCHHHHHHHHHCC T ss_conf 8898999999999949 No 200 >TIGR00097 HMP-P_kinase phosphomethylpyrimidine kinase; InterPro: IPR004399 Phosphomethylpyrimidine kinase (2.7.4.7 from EC), also known as HMP-phosphate kinase, catalyses the phosphorylation of HMP-P to HMP-PP in the reaction: ATP + 4-amino-2-methyl-5-phosphomethylpyrimidine = ADP + 4-amino-2-methyl-5-diphosphomethylpyrimidine during the biosynthesis of thiamine.; GO: 0008972 phosphomethylpyrimidine kinase activity, 0009228 thiamin biosynthetic process. Probab=21.04 E-value=59 Score=13.40 Aligned_cols=79 Identities=19% Similarity=0.283 Sum_probs=51.9 Q ss_pred CCHHHHHHHHHHHHHCCCEEECHHHCCHHHCCCCCCCCC-CCCCHHHHHHHHHHHHHHH-HHCCCCCEEEEEEECCEEEE Q ss_conf 803589999999997697482112225242035433455-4653234788998999999-73326722599981974889 Q gi|254780768|r 105 GNAAILKASIDLLESYGVSVVGAHEIVPELLVQVGSLGT-CVPNRDVKRDILAAMKSAE-ALSELDVGQSAVSIGGRVVA 182 (281) Q Consensus 105 gDd~iL~~i~~~fe~~G~~vi~~~~~l~~ll~~~G~l~~-~~p~~~~~~dI~~g~~i~~-~l~~~DiGQsvVv~~g~Via 182 (281) |+.=+-...++.|.++ ++|.-+++-.-+++...|.+ .+ -++..|...+.+.+. ++|. +.|++++|+.=- T Consensus 109 G~~Ll~~~a~~~l~~~---LLP~A~~~TPN~pEAe~L~g~~~--I~~~~dm~~AAk~l~~~lG~----~~vl~KGGHl~h 179 (264) T TIGR00097 109 GAALLEEEAVEALKKR---LLPLATLVTPNLPEAEALLGATK--IRTEEDMIKAAKKLREELGP----KAVLIKGGHLEH 179 (264) T ss_pred CCCCCCHHHHHHHHHH---HCCCEEEECCCHHHHHHHCCCCC--CCCHHHHHHHHHHHHHHHCC----CCEEEECCCCCC T ss_conf 7546547899999997---38711430689799999808966--58989999999999885178----808982773444 Q ss_pred EECCC-CHHHHH Q ss_conf 62534-217999 Q gi|254780768|r 183 LEGIE-GTDSML 193 (281) Q Consensus 183 iEa~e-GTD~mi 193 (281) . .=| ++|-.. T Consensus 180 f-~~~~~~Dv~~ 190 (264) T TIGR00097 180 F-GDEDAVDVLY 190 (264) T ss_pred C-CCCCCCEEEE T ss_conf 6-7777631688 No 201 >pfam04127 DFP DNA / pantothenate metabolism flavoprotein. The DNA/pantothenate metabolism flavoprotein (EC:4.1.1.36) affects synthesis of DNA, and pantothenate metabolism. Probab=20.99 E-value=59 Score=13.40 Aligned_cols=66 Identities=11% Similarity=0.051 Sum_probs=42.0 Q ss_pred CCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCEEEEEHHHHHHHHHHHHH---CCCCEEEEECCCC Q ss_conf 980389999999978997899995486673103885678738999999999986---4954067702423 Q gi|254780768|r 10 SGMLPYYVAKAARLKNDEPVIASVLNECSFDWQDFECRELPLGDFCVLRSILHQ---YNIGRIVVAGAID 76 (281) Q Consensus 10 ~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~~~~~~~~~~~~ig~ig~li~~Lk~---~~i~~ivmaG~V~ 76 (281) +|..=..+|+++...|.++.++.=.....+..+. ....+++.....+...+.. ..++-++|+-.|. T Consensus 27 SGkmG~alA~~~~~~Ga~V~li~g~~~~~~~~~~-~~~~i~v~t~~em~~~~~~~~~~~~D~~I~aAAVs 95 (197) T pfam04127 27 SGKMGAAIAEEFLKRGADVTLIAGKGSLKPEPSG-NLLIIPVETAEEMLNALKEIAPDLHDVFILAAAVS 95 (197) T ss_pred CHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCC-CEEEEEECCHHHHHHHHHHHCCCCCCEEEEEEECC T ss_conf 5599999999999789938997235655668988-81899968999999999974756577999910004 No 202 >PRK01372 ddl D-alanine--D-alanine ligase; Reviewed Probab=20.82 E-value=59 Score=13.37 Aligned_cols=81 Identities=12% Similarity=0.084 Sum_probs=49.7 Q ss_pred CCEEEEEECCCCHHH--------HHHHHHHHCCCCEEEEEECCCCCCCCC--CCCEEE----EEHHHHHHHHHHHHHCCC Q ss_conf 973999977980389--------999999978997899995486673103--885678----738999999999986495 Q gi|254780768|r 1 MKRLLIIAGSGMLPY--------YVAKAARLKNDEPVIASVLNECSFDWQ--DFECRE----LPLGDFCVLRSILHQYNI 66 (281) Q Consensus 1 M~kigIIAG~G~LP~--------~ia~~~~~~g~~~~ii~l~~~~~~~~~--~~~~~~----~~ig~ig~li~~Lk~~~i 66 (281) ++|||++.|+-+-=+ .+.+++.+.||+++.+.........++ +++..+ =+.|+=|.+-.+|...+| T Consensus 4 ~~kI~vl~GG~S~E~eVSl~S~~~v~~aL~~~~y~v~~id~~~~~~~~l~~~~~D~vf~~lHG~~GEDG~iQglLe~~~I 83 (304) T PRK01372 4 FGKVAVLMGGTSAEREVSLNSGAAVLAALREAGYDAHPIDPGEDPAAELKELGFDRVFNALHGRGGEDGTIQGLLELLGI 83 (304) T ss_pred CCCEEEEECCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHCCCCEEEEECCCCCCCCHHHHHHHHHCCC T ss_conf 77189993678712899999999999988375997999828964677875339999999068999856499999998599 Q ss_pred CEEEEECCCCCCCCHHHHCCC Q ss_conf 406770242321000111026 Q gi|254780768|r 67 GRIVVAGAIDRRPNVQDLCFS 87 (281) Q Consensus 67 ~~ivmaG~V~krP~~~~l~~D 87 (281) -+ .|. .+.-+.+-+| T Consensus 84 PY---tGs---~~~asal~mD 98 (304) T PRK01372 84 PY---TGS---GVLASALAMD 98 (304) T ss_pred CC---CCC---CHHHHHHHHH T ss_conf 83---579---9899877653 No 203 >COG2344 AT-rich DNA-binding protein [General function prediction only] Probab=20.76 E-value=59 Score=13.36 Aligned_cols=112 Identities=21% Similarity=0.203 Sum_probs=54.3 Q ss_pred CCHHHHHHHHH--HHHHCCCEEECHHHCCHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEEECCEEEE Q ss_conf 80358999999--9997697482112225242035433455465323478899899999973326722599981974889 Q gi|254780768|r 105 GNAAILKASID--LLESYGVSVVGAHEIVPELLVQVGSLGTCVPNRDVKRDILAAMKSAEALSELDVGQSAVSIGGRVVA 182 (281) Q Consensus 105 gDd~iL~~i~~--~fe~~G~~vi~~~~~l~~ll~~~G~l~~~~p~~~~~~dI~~g~~i~~~l~~~DiGQsvVv~~g~Via 182 (281) |-.+|=+++.+ +.+++||+++.+=++-|+.. |-..+.-| -.+..+++.- +-+.|+--++. .|=+ T Consensus 91 G~GnlG~All~Y~f~~~~~~~iv~~FDv~~~~V---G~~~~~v~-V~~~d~le~~------v~~~dv~iaiL----tVPa 156 (211) T COG2344 91 GVGNLGRALLNYNFSKKNGMKIVAAFDVDPDKV---GTKIGDVP-VYDLDDLEKF------VKKNDVEIAIL----TVPA 156 (211) T ss_pred CCCHHHHHHHCCCCHHHCCCEEEEEECCCHHHH---CCCCCCEE-EECHHHHHHH------HHHCCCCEEEE----ECCH T ss_conf 057088898607423326935999961787883---75248814-5526888889------88617618999----7348 Q ss_pred EECCCCHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCCCCEEEECCCCHHHHHH Q ss_conf 6253421799999997543123456677189993488875312106627999999 Q gi|254780768|r 183 LEGIEGTDSMLQRIVDCRNNGRILAGKSGVLVKMCKSQQDMRADLPSIGAKTVQN 237 (281) Q Consensus 183 iEa~eGTD~mi~R~~~~~~~~~~~~~~~~ilvK~~K~~QD~r~DlP~IG~~Ti~~ 237 (281) ..|-|=+|.+++--- .+.+ .-.++-...|+--+=..+|+ +.+++|+.- T Consensus 157 ~~AQ~vad~Lv~aGV-----kGIl-NFtPv~l~~pe~V~V~~iDL-t~eLqtL~~ 204 (211) T COG2344 157 EHAQEVADRLVKAGV-----KGIL-NFTPVRLQVPEGVIVENIDL-TAELQTLLY 204 (211) T ss_pred HHHHHHHHHHHHCCC-----CEEE-ECCCEEECCCCCCEEEEEEH-HHHHHHHHH T ss_conf 989999999998387-----3588-42635751898737998305-788999999 No 204 >PRK13304 L-aspartate dehydrogenase; Reviewed Probab=20.48 E-value=60 Score=13.33 Aligned_cols=34 Identities=18% Similarity=0.301 Sum_probs=24.6 Q ss_pred CCEEEEEECCCCHHHHHHHHHHHCCCCEEEEEECC Q ss_conf 97399997798038999999997899789999548 Q gi|254780768|r 1 MKRLLIIAGSGMLPYYVAKAARLKNDEPVIASVLN 35 (281) Q Consensus 1 M~kigIIAG~G~LP~~ia~~~~~~g~~~~ii~l~~ 35 (281) |-|+||| |.|.+-..+++.+.....+.-+.++.+ T Consensus 1 M~rVgiI-G~GaIG~~Va~~l~~g~~~~~L~~V~~ 34 (265) T PRK13304 1 MLKIGIV-GCGAIANLITKAIDSGRINAELLAFYD 34 (265) T ss_pred CCEEEEE-CCCHHHHHHHHHHHCCCCCEEEEEEEC T ss_conf 9499998-657899999999867998649999978 No 205 >PRK04140 hypothetical protein; Provisional Probab=20.38 E-value=61 Score=13.31 Aligned_cols=48 Identities=10% Similarity=0.185 Sum_probs=27.7 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCEEE-EEECCCCCCCEEEECCCCHHHHHHHH Q ss_conf 1799999997543123456677189-99348887531210662799999999 Q gi|254780768|r 189 TDSMLQRIVDCRNNGRILAGKSGVL-VKMCKSQQDMRADLPSIGAKTVQNVI 239 (281) Q Consensus 189 TD~mi~R~~~~~~~~~~~~~~~~il-vK~~K~~QD~r~DlP~IG~~Ti~~~~ 239 (281) |.++.+|+.-..+-+... +..+++ ++-. .++.==..|+|+.+.++.+. T Consensus 254 ~~~~~rrA~~~~~is~v~-~s~sv~I~~~~--~~~~i~~~~~I~~~eL~~i~ 302 (319) T PRK04140 254 TEETKRRARIVSSISEVT-NSESVFILEKK--TKESVKGIPVISIEELEKID 302 (319) T ss_pred CHHHHHHHHHHHHHHHHH-CCCEEEEEECC--EEEEECCEEEEEHHHHHHCC T ss_conf 288999999999999986-68539998055--02345788787699976448 No 206 >PRK08887 nicotinic acid mononucleotide adenylyltransferase; Provisional Probab=20.35 E-value=58 Score=13.46 Aligned_cols=31 Identities=16% Similarity=0.169 Sum_probs=19.2 Q ss_pred CCEEEEEECCCCHH----HHHHHHHHHCCCCEEEEE Q ss_conf 97399997798038----999999997899789999 Q gi|254780768|r 1 MKRLLIIAGSGMLP----YYVAKAARLKNDEPVIAS 32 (281) Q Consensus 1 M~kigIIAG~G~LP----~~ia~~~~~~g~~~~ii~ 32 (281) |+||||..|+=+=| ..+++++.. =.++.++. T Consensus 1 MkkIgifGGSFDP~H~GHl~i~~~l~~-lD~v~~iP 35 (173) T PRK08887 1 MKKIAVFGSAFNPPSLGHKSVIESLSH-FDLVLLVP 35 (173) T ss_pred CCEEEEEEECCCCCCHHHHHHHHHHHC-CCEEEEEE T ss_conf 948999742889866989999999875-89899997 No 207 >TIGR00624 tag DNA-3-methyladenine glycosylase I; InterPro: IPR004597 One common form of damage to DNA occurs when alkyl groups become covalently linked to DNA, leading to the formation of mutagenic or toxic lesions. Alkylating agents are ubiquitous in the environment and can also exist endogenously; for example S-adenosylmethionine, a commonly used methyl donor in many cellualr pathways, can produce methylation damage. Given the prevalence of alkylating agents it is not surprising that all cellular organisms studied so far have developed repair systems to deal with DNA alkylation . DNA-3-methyladenine glycosylase I (also known as TagI) is part of the broader helix-hairpin-helix DNA repair glycosylase supefamily. It specifically catalyses the release of 3-methylated adenine and, to a lesser extent, guanosine bases from DNA by hydrolysis of the deoxyribose N-glycosidic bond , . A phylogenomic study suggests that TagI evolved in, and is specific to, the bacteria .; GO: 0008725 DNA-3-methyladenine glycosylase I activity, 0006284 base-excision repair. Probab=20.32 E-value=61 Score=13.30 Aligned_cols=19 Identities=32% Similarity=0.309 Sum_probs=14.9 Q ss_pred HHHHHHHHHCCCEEECHHH Q ss_conf 9999999976974821122 Q gi|254780768|r 111 KASIDLLESYGVSVVGAHE 129 (281) Q Consensus 111 ~~i~~~fe~~G~~vi~~~~ 129 (281) .|+.+.+++.||+.+|+|. T Consensus 146 ~Amsk~LKkrGF~FVGpTi 164 (185) T TIGR00624 146 KAMSKELKKRGFRFVGPTI 164 (185) T ss_pred HHHHHHHHHCCCCCCCHHH T ss_conf 9999998756883001689 No 208 >COG1647 Esterase/lipase [General function prediction only] Probab=20.24 E-value=61 Score=13.29 Aligned_cols=122 Identities=11% Similarity=-0.018 Sum_probs=70.1 Q ss_pred ECCCCHHHHHHHHHHHCCCCEEEEEECCCCCC--CCCCCC-EEEEEHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHH Q ss_conf 77980389999999978997899995486673--103885-678738999999999986495406770242321000111 Q gi|254780768|r 8 AGSGMLPYYVAKAARLKNDEPVIASVLNECSF--DWQDFE-CRELPLGDFCVLRSILHQYNIGRIVVAGAIDRRPNVQDL 84 (281) Q Consensus 8 AG~G~LP~~ia~~~~~~g~~~~ii~l~~~~~~--~~~~~~-~~~~~ig~ig~li~~Lk~~~i~~ivmaG~V~krP~~~~l 84 (281) .|+-.=-+..++.+..+|+.+++-.++|+-.. ++-... .-| +..+......|++.++.+|..+|-= --=.| .+ T Consensus 25 TGt~~Dvr~Lgr~L~e~GyTv~aP~ypGHG~~~e~fl~t~~~DW--~~~v~d~Y~~L~~~gy~eI~v~GlS-mGGv~-al 100 (243) T COG1647 25 TGTPRDVRMLGRYLNENGYTVYAPRYPGHGTLPEDFLKTTPRDW--WEDVEDGYRDLKEAGYDEIAVVGLS-MGGVF-AL 100 (243) T ss_pred CCCCHHHHHHHHHHHHCCCEEECCCCCCCCCCHHHHHCCCHHHH--HHHHHHHHHHHHHCCCCEEEEEEEC-CHHHH-HH T ss_conf 89808899999999878955746888999999899850799999--9999999999987579858998512-03699-99 Q ss_pred CCCH-HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCC-EEECHHHCCHH Q ss_conf 0260-3578999999875104803589999999997697-48211222524 Q gi|254780768|r 85 CFSI-KDSLRISKMIWQLVSGGNAAILKASIDLLESYGV-SVVGAHEIVPE 133 (281) Q Consensus 85 ~~D~-~~~~~l~k~l~~~~~~gDd~iL~~i~~~fe~~G~-~vi~~~~~l~~ 133 (281) ++.. .+...+..+-.....+.++.++..+.++|++... +-.++..+-.+ T Consensus 101 kla~~~p~K~iv~m~a~~~~k~~~~iie~~l~y~~~~kk~e~k~~e~~~~e 151 (243) T COG1647 101 KLAYHYPPKKIVPMCAPVNVKSWRIIIEGLLEYFRNAKKYEGKDQEQIDKE 151 (243) T ss_pred HHHHHCCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHH T ss_conf 998537955236404776655410345789999998565148898999999 No 209 >pfam00743 FMO-like Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase. Probab=20.19 E-value=61 Score=13.29 Aligned_cols=31 Identities=23% Similarity=0.269 Sum_probs=25.5 Q ss_pred CEEEEEECCCCHHHHHHHHHHHCCCCEEEEEE Q ss_conf 73999977980389999999978997899995 Q gi|254780768|r 2 KRLLIIAGSGMLPYYVAKAARLKNDEPVIASV 33 (281) Q Consensus 2 ~kigIIAG~G~LP~~ia~~~~~~g~~~~ii~l 33 (281) +|+||| |+|.-=+.-++.|...|.+++|+-- T Consensus 2 KrVAII-GAG~SGL~a~K~lle~G~~~~~FE~ 32 (532) T pfam00743 2 KKVAVI-GAGVSGLSSIKCCLEEGLEPTCFER 32 (532) T ss_pred CEEEEE-CCCHHHHHHHHHHHHCCCCEEEEEC T ss_conf 879998-9729999999999877998299977 No 210 >TIGR02275 DHB_AMP_lig 2,3-dihydroxybenzoate-AMP ligase; InterPro: IPR011963 Siderophores are low molecular weight iron-chelating compounds synthesised by many bacteria to aid in the aquisition of this vital trace element . Proteins in this entry are adenylation components of non-ribosomal peptide synthases (NRPSs) involved in the biosynthesis of siderophores. These proteins belong to the AMP-binding family and are mostly thought to activate 2,3-dihydroxybenzoate (DHB) by ligation of AMP from ATP with the release of pyrophosphate. One of these proteins, however, is believed to differ in function, activating salicylate rather than DHB. The crystal structure of 2,3-dihydroxybenzoate-AMP ligase (P40871 from SWISSPROT) from Bacillus subtilis has been examined . This protein is composed of a large N-terminal domain (~420 aa) and a more compact C-terminal domain (~110 aa), with an overall "hammer-and-anvil" fold similar to that of firefly luciferase . The active site is located in a deep compartment located at the interface of the domains, with a p-loop thought to be involved in catalysis located at the entrance to the cavity. Relatively little conformational change was observed during catalysis.; GO: 0008668 (23-dihydroxybenzoyl)adenylate synthase activity, 0019290 siderophore biosynthetic process. Probab=20.02 E-value=62 Score=13.26 Aligned_cols=29 Identities=17% Similarity=0.161 Sum_probs=21.3 Q ss_pred EEECCCEEEE----CHHHHHHHHHHCCCEEEEE Q ss_conf 9973977998----5899999999879689995 Q gi|254780768|r 247 ALEAGKSLVL----EKELVKKHADEAGIFVCGI 275 (281) Q Consensus 247 aiea~~~lil----d~~~~i~~a~~~~i~i~g~ 275 (281) |.-||++++| ....+..+.++|++.++++ T Consensus 251 VF~AGGcVVLa~dPsP~~cFpLIE~HkVt~TAL 283 (534) T TIGR02275 251 VFYAGGCVVLAADPSPTDCFPLIERHKVTVTAL 283 (534) T ss_pred HHHHCCEEEECCCCCHHHHCHHHHHCCCEEEEC T ss_conf 621178679768888212110445547307873 Done!