Query gi|254780579|ref|YP_003064992.1| hypothetical protein CLIBASIA_02330 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 207 No_of_seqs 147 out of 1321 Neff 7.2 Searched_HMMs 39220 Date Sun May 29 23:16:23 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780579.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK10494 hypothetical protein; 100.0 2.3E-35 5.8E-40 236.4 21.1 154 31-189 73-254 (259) 2 pfam02698 DUF218 DUF218 domain 100.0 2.1E-30 5.2E-35 206.0 14.3 139 36-179 1-148 (148) 3 cd06259 YdcF-like YdcF-like. Y 100.0 2.9E-30 7.4E-35 205.1 14.6 137 38-179 1-150 (150) 4 COG1434 Uncharacterized conser 99.9 1.2E-22 3.1E-27 158.5 17.9 142 33-182 61-208 (223) 5 PRK10834 hypothetical protein; 99.8 1.4E-18 3.5E-23 133.7 19.0 143 30-187 38-194 (239) 6 COG2949 SanA Uncharacterized m 99.6 1.2E-13 3E-18 103.4 15.8 144 30-186 50-207 (235) 7 KOG4533 consensus 94.8 0.022 5.7E-07 34.4 2.8 106 50-155 122-239 (317) 8 PRK00179 pgi glucose-6-phospha 94.2 0.45 1.2E-05 26.4 8.9 107 22-133 113-235 (525) 9 cd05015 SIS_PGI_1 Phosphogluco 92.9 0.74 1.9E-05 25.1 9.4 108 35-151 18-137 (158) 10 cd02509 GDP-M1P_Guanylyltransf 92.0 0.97 2.5E-05 24.3 12.4 77 50-131 36-114 (274) 11 cd05005 SIS_PHI Hexulose-6-pho 84.7 3.1 7.8E-05 21.3 7.9 42 91-134 76-117 (179) 12 PRK00973 glucose-6-phosphate i 84.1 3.3 8.4E-05 21.1 8.5 90 35-129 70-178 (454) 13 pfam00342 PGI Phosphoglucose i 83.5 3.5 8.9E-05 21.0 9.1 97 33-134 92-202 (483) 14 COG0836 {ManC} Mannose-1-phosp 81.9 4 0.0001 20.6 7.2 75 60-139 49-125 (333) 15 PRK01581 speE spermidine synth 80.3 4.6 0.00012 20.2 6.2 80 77-158 203-292 (363) 16 pfam10686 DUF2493 Protein of u 79.9 4.7 0.00012 20.1 5.2 51 40-91 5-60 (71) 17 TIGR00364 TIGR00364 exsB prote 77.6 3.5 8.8E-05 21.0 3.8 57 39-95 1-62 (227) 18 cd05013 SIS_RpiR RpiR-like pro 76.1 6.1 0.00016 19.5 7.0 57 91-156 61-117 (139) 19 COG0603 Predicted PP-loop supe 76.0 5.8 0.00015 19.6 4.6 60 37-96 3-63 (222) 20 TIGR02351 thiH thiazole biosyn 74.6 6.7 0.00017 19.2 4.9 57 102-158 105-169 (378) 21 PRK10618 phosphotransfer inter 72.4 7.3 0.00019 19.0 4.4 130 2-139 14-182 (881) 22 PRK00129 upp uracil phosphorib 70.6 8.2 0.00021 18.7 6.6 113 40-155 74-186 (208) 23 TIGR02477 PFKA_PPi diphosphate 70.3 8.3 0.00021 18.6 5.3 122 34-159 72-213 (566) 24 COG1105 FruK Fructose-1-phosph 70.2 8.4 0.00021 18.6 4.8 87 38-127 130-222 (310) 25 PRK09240 thiH thiamine biosynt 69.5 8.7 0.00022 18.5 4.9 53 101-153 103-162 (371) 26 TIGR03127 RuMP_HxlB 6-phospho 69.1 8.9 0.00023 18.5 7.7 43 91-135 73-115 (179) 27 PRK11557 putative DNA-binding 69.0 8.9 0.00023 18.4 7.5 99 48-157 120-237 (282) 28 TIGR01007 eps_fam capsular exo 68.8 9 0.00023 18.4 4.7 87 63-154 99-195 (207) 29 pfam03641 Lysine_decarbox Poss 68.3 9.2 0.00024 18.4 4.8 23 35-57 51-73 (130) 30 pfam02887 PK_C Pyruvate kinase 68.2 9.2 0.00024 18.4 9.3 86 36-131 16-104 (117) 31 cd06378 PBP1_iGluR_NMDA_NR2 N- 67.1 9.7 0.00025 18.2 4.7 42 99-140 115-156 (362) 32 PRK07573 sdhA succinate dehydr 67.0 9.8 0.00025 18.2 5.7 40 27-67 24-63 (638) 33 TIGR00725 TIGR00725 conserved 65.6 10 0.00027 18.0 4.0 36 35-71 97-132 (176) 34 TIGR00674 dapA dihydrodipicoli 64.6 11 0.00028 17.9 5.8 88 67-155 41-135 (288) 35 PRK03868 glucose-6-phosphate i 64.4 11 0.00028 17.9 8.0 107 36-151 57-173 (409) 36 PRK05954 precorrin-8X methylmu 63.3 12 0.00029 17.8 5.4 33 39-71 123-155 (203) 37 COG1611 Predicted Rossmann fol 62.3 12 0.00031 17.7 5.4 36 39-77 16-61 (205) 38 cd00610 OAT_like Acetyl ornith 60.8 13 0.00033 17.5 6.0 27 38-64 97-123 (413) 39 PRK08286 cbiC precorrin-8X met 60.2 13 0.00033 17.4 5.2 12 49-60 62-73 (213) 40 PRK06264 cbiC precorrin-8X met 59.6 13 0.00034 17.4 5.7 11 120-130 128-138 (210) 41 PRK05639 4-aminobutyrate amino 59.5 13 0.00034 17.4 6.5 153 43-206 67-233 (457) 42 TIGR00454 TIGR00454 conserved 59.5 13 0.00032 17.5 3.5 103 49-156 30-144 (204) 43 pfam01630 Glyco_hydro_56 Hyalu 59.3 14 0.00035 17.3 7.4 81 48-128 245-337 (339) 44 PRK12483 threonine dehydratase 57.9 14 0.00037 17.2 6.5 38 36-74 186-223 (521) 45 COG2082 CobH Precorrin isomera 57.8 14 0.00037 17.2 5.7 10 112-121 146-155 (210) 46 cd01545 PBP1_SalR Ligand-bindi 55.5 16 0.0004 16.9 11.0 140 50-193 106-254 (270) 47 TIGR01812 sdhA_frdA_Gneg succi 55.1 16 0.00041 16.9 4.0 63 38-103 1-69 (636) 48 pfam07071 DUF1341 Protein of u 54.6 16 0.00041 16.9 6.7 70 49-123 136-207 (218) 49 cd06273 PBP1_GntR_like_1 This 54.5 16 0.00042 16.8 10.3 95 56-150 111-213 (268) 50 TIGR01361 DAHP_synth_Bsub phos 53.2 17 0.00044 16.7 3.9 41 85-125 126-167 (262) 51 COG1509 KamA Lysine 2,3-aminom 53.1 17 0.00044 16.7 10.2 102 46-152 142-257 (369) 52 pfam04122 CW_binding_2 Putativ 52.6 17 0.00045 16.7 7.8 72 45-126 4-78 (90) 53 cd06298 PBP1_CcpA_like Ligand- 52.2 18 0.00045 16.6 12.7 96 53-150 108-212 (268) 54 PRK11302 DNA-binding transcrip 52.0 18 0.00046 16.6 7.5 81 49-134 117-216 (284) 55 TIGR03581 EF_0839 conserved hy 51.9 18 0.00046 16.6 6.6 70 49-123 136-207 (236) 56 PRK09224 threonine dehydratase 51.6 18 0.00046 16.6 7.7 38 36-74 169-206 (504) 57 cd06294 PBP1_ycjW_transcriptio 51.1 18 0.00047 16.5 10.5 95 56-150 117-218 (270) 58 COG2081 Predicted flavoprotein 51.0 19 0.00047 16.5 4.4 48 40-87 6-65 (408) 59 KOG3339 consensus 50.7 19 0.00048 16.5 7.2 78 1-81 1-85 (211) 60 cd06288 PBP1_sucrose_transcrip 50.1 19 0.00049 16.4 10.0 98 53-150 108-212 (269) 61 COG1737 RpiR Transcriptional r 49.4 20 0.0005 16.3 7.5 55 91-154 178-232 (281) 62 cd06274 PBP1_FruR Ligand bindi 47.1 21 0.00054 16.1 10.1 97 53-150 108-213 (264) 63 cd06271 PBP1_AglR_RafR_like Li 46.1 22 0.00057 16.0 10.8 96 54-150 113-216 (268) 64 PRK05953 precorrin-8X methylmu 45.1 23 0.00059 15.9 5.3 11 50-60 54-64 (208) 65 COG0166 Pgi Glucose-6-phosphat 44.9 23 0.00059 15.9 7.9 94 36-134 78-181 (446) 66 cd06283 PBP1_RegR_EndR_KdgR_li 44.8 23 0.00059 15.9 10.2 97 54-151 109-214 (267) 67 pfam08981 consensus 44.4 24 0.0006 15.9 8.3 28 101-128 8-35 (181) 68 PRK11337 DNA-binding transcrip 44.3 24 0.0006 15.9 7.7 60 92-160 190-249 (293) 69 KOG2094 consensus 44.3 15 0.00038 17.1 1.9 30 121-150 135-171 (490) 70 cd01746 GATase1_CTP_Synthase T 43.5 22 0.00056 16.0 2.7 37 34-70 53-93 (235) 71 PRK08639 threonine dehydratase 42.9 25 0.00063 15.7 7.8 89 36-130 177-268 (418) 72 cd01829 SGNH_hydrolase_peri2 S 42.8 25 0.00064 15.7 3.4 19 49-67 96-114 (200) 73 KOG1201 consensus 42.8 25 0.00064 15.7 11.4 72 37-117 37-111 (300) 74 TIGR02128 G6PI_arch bifunction 42.3 25 0.00065 15.7 6.2 99 35-153 25-128 (338) 75 pfam04820 Trp_halogenase Trypt 42.2 25 0.00065 15.7 5.2 52 40-93 2-62 (457) 76 PRK06382 threonine dehydratase 41.4 26 0.00067 15.6 7.7 101 34-140 165-268 (400) 77 PRK09221 beta alanine--pyruvat 41.2 26 0.00067 15.6 5.8 85 44-131 64-153 (445) 78 PRK06186 hypothetical protein; 40.5 27 0.00069 15.5 3.3 36 33-68 50-89 (229) 79 TIGR01373 soxB sarcosine oxida 40.1 27 0.0007 15.5 3.5 35 30-65 24-58 (407) 80 PRK05575 cbiC precorrin-8X met 40.0 28 0.0007 15.5 5.3 11 50-60 58-68 (204) 81 PRK03501 ppnK inorganic polyph 39.9 28 0.0007 15.4 4.4 28 30-58 33-60 (264) 82 cd06285 PBP1_LacI_like_7 Ligan 39.5 28 0.00072 15.4 9.9 98 53-150 106-210 (265) 83 cd04605 CBS_pair_MET2_assoc Th 38.6 29 0.00074 15.3 6.9 81 48-128 11-95 (110) 84 pfam06637 PV-1 PV-1 protein (P 38.1 29 0.00075 15.3 2.9 27 1-27 25-51 (442) 85 PRK06411 NADH dehydrogenase su 38.0 30 0.00076 15.3 3.2 40 30-69 74-114 (195) 86 COG2240 PdxK Pyridoxal/pyridox 37.9 30 0.00076 15.2 2.8 39 90-130 145-183 (281) 87 TIGR00337 PyrG CTP synthase; I 36.8 31 0.00079 15.1 3.3 136 48-196 126-285 (571) 88 COG2143 Thioredoxin-related pr 36.0 32 0.00081 15.1 5.3 86 1-86 2-114 (182) 89 COG3962 Acetolactate synthase 36.0 32 0.00081 15.1 5.0 44 49-92 218-261 (617) 90 PRK08198 threonine dehydratase 35.2 33 0.00084 15.0 9.0 101 35-141 171-274 (406) 91 KOG3153 consensus 35.1 33 0.00084 15.0 3.2 13 106-118 107-119 (250) 92 cd06284 PBP1_LacI_like_6 Ligan 34.4 34 0.00086 14.9 10.3 96 55-151 109-212 (267) 93 cd06290 PBP1_LacI_like_9 Ligan 34.0 34 0.00088 14.9 10.1 98 53-150 107-211 (265) 94 TIGR02529 EutJ ethanolamine ut 33.1 36 0.00091 14.8 3.8 69 74-149 155-225 (240) 95 COG1058 CinA Predicted nucleot 33.0 36 0.00091 14.8 4.9 31 39-70 39-69 (255) 96 cd06299 PBP1_LacI_like_13 Liga 33.0 36 0.00091 14.8 12.0 96 53-151 108-211 (265) 97 KOG1276 consensus 32.4 36 0.00093 14.7 5.0 88 37-132 11-102 (491) 98 TIGR02247 HAD-1A3-hyp Epoxide 32.3 33 0.00084 15.0 2.1 36 97-133 100-135 (228) 99 PRK07334 threonine dehydratase 32.2 37 0.00094 14.7 9.0 101 34-140 166-267 (399) 100 PRK04761 ppnK inorganic polyph 32.0 37 0.00095 14.7 4.4 34 30-64 19-52 (246) 101 cd06267 PBP1_LacI_sugar_bindin 31.5 38 0.00096 14.6 11.6 134 55-194 110-253 (264) 102 TIGR02152 D_ribokin_bact ribok 31.3 38 0.00097 14.6 8.4 29 97-126 195-223 (303) 103 PRK08526 threonine dehydratase 31.3 38 0.00097 14.6 7.4 95 35-136 167-264 (403) 104 COG0848 ExbD Biopolymer transp 31.1 38 0.00098 14.6 5.9 57 74-130 79-136 (137) 105 KOG3085 consensus 30.8 39 0.00099 14.5 5.1 38 99-139 110-147 (237) 106 cd01994 Alpha_ANH_like_IV This 30.7 39 0.00099 14.5 11.2 103 39-141 2-113 (194) 107 TIGR00957 MRP_assoc_pro multi 30.6 27 0.0007 15.5 1.5 15 33-47 827-841 (1542) 108 TIGR02685 pter_reduc_Leis pter 30.4 32 0.00081 15.1 1.8 50 42-96 5-57 (283) 109 pfam05044 Prox1 Homeobox prosp 29.9 31 0.0008 15.1 1.7 45 127-175 751-806 (905) 110 pfam00205 TPP_enzyme_M Thiamin 29.7 41 0.001 14.4 3.9 41 51-91 2-42 (138) 111 TIGR01825 gly_Cac_T_rel pyrido 29.6 41 0.001 14.4 4.8 129 38-178 239-384 (392) 112 TIGR01923 menE O-succinylbenzo 29.4 8 0.0002 18.7 -1.4 47 102-148 173-219 (490) 113 PRK10294 6-phosphofructokinase 29.3 41 0.0011 14.4 5.0 13 50-62 42-54 (309) 114 PRK09162 hypoxanthine-guanine 28.9 42 0.0011 14.3 4.7 88 39-129 44-135 (181) 115 COG1234 ElaC Metal-dependent h 28.7 42 0.0011 14.3 5.2 91 52-144 180-278 (292) 116 cd06296 PBP1_CatR_like Ligand- 28.5 43 0.0011 14.3 10.1 95 55-150 111-213 (270) 117 TIGR01816 sdhA_forward succina 28.4 43 0.0011 14.3 3.0 16 37-52 196-211 (615) 118 PRK11543 gutQ D-arabinose 5-ph 28.3 43 0.0011 14.3 7.5 41 92-134 91-131 (321) 119 PRK12595 bifunctional 3-deoxy- 28.2 42 0.0011 14.3 2.1 35 94-128 228-262 (360) 120 PRK10270 hypothetical protein; 28.1 43 0.0011 14.2 7.5 45 1-46 1-48 (340) 121 cd05006 SIS_GmhA Phosphoheptos 27.8 44 0.0011 14.2 4.5 60 71-132 82-141 (177) 122 cd04619 CBS_pair_6 The CBS dom 27.5 44 0.0011 14.2 7.2 80 49-128 11-99 (114) 123 cd01992 PP-ATPase N-terminal d 27.5 44 0.0011 14.2 5.1 86 39-124 2-102 (185) 124 PRK10703 DNA-binding transcrip 27.3 45 0.0011 14.2 10.6 95 55-150 172-274 (335) 125 PRK10892 D-arabinose 5-phospha 27.0 45 0.0012 14.1 7.4 83 49-134 35-136 (326) 126 COG2086 FixA Electron transfer 26.9 45 0.0012 14.1 6.3 69 49-120 42-113 (260) 127 pfam06508 ExsB ExsB. This fami 26.9 45 0.0012 14.1 5.2 53 39-91 2-55 (137) 128 PRK06931 diaminobutyrate--2-ox 26.7 46 0.0012 14.1 6.7 28 103-132 128-155 (454) 129 PRK11269 glyoxylate carboligas 26.7 46 0.0012 14.1 5.4 49 44-92 187-235 (591) 130 COG2102 Predicted ATPases of P 26.5 46 0.0012 14.1 11.2 106 38-148 2-116 (223) 131 COG2876 AroA 3-deoxy-D-arabino 26.3 47 0.0012 14.1 4.4 41 86-126 147-187 (286) 132 PRK13508 tagatose-6-phosphate 26.0 47 0.0012 14.0 4.4 23 103-126 199-221 (309) 133 PRK02231 ppnK inorganic polyph 25.6 48 0.0012 14.0 3.8 34 34-68 40-73 (272) 134 pfam07509 DUF1523 Protein of u 25.6 27 0.0007 15.5 0.7 43 1-53 1-43 (174) 135 cd01575 PBP1_GntR Ligand-bindi 25.5 48 0.0012 14.0 10.9 84 56-140 111-198 (268) 136 cd06295 PBP1_CelR Ligand bindi 25.4 48 0.0012 14.0 10.1 95 56-151 120-222 (275) 137 PRK10833 putative assembly pro 25.4 48 0.0012 13.9 2.7 23 1-23 1-23 (617) 138 cd06277 PBP1_LacI_like_1 Ligan 25.2 49 0.0012 13.9 12.0 92 56-150 113-212 (268) 139 cd01471 vWA_micronemal_protein 25.1 49 0.0012 13.9 7.4 54 99-154 83-147 (186) 140 PRK07233 hypothetical protein; 25.0 49 0.0013 13.9 3.4 10 43-52 192-201 (430) 141 pfam10742 DUF2555 Protein of u 24.8 46 0.0012 14.1 1.8 41 100-140 7-47 (57) 142 cd06282 PBP1_GntR_like_2 Ligan 24.5 50 0.0013 13.8 10.3 96 53-150 108-211 (266) 143 cd01574 PBP1_LacI Ligand-bindi 24.0 52 0.0013 13.8 9.4 95 53-150 108-209 (264) 144 cd06270 PBP1_GalS_like Ligand 23.3 53 0.0014 13.7 10.8 95 55-150 110-212 (268) 145 PRK04759 consensus 23.2 53 0.0014 13.7 3.6 34 34-68 62-95 (294) 146 KOG0404 consensus 23.2 53 0.0014 13.7 7.3 45 100-149 166-212 (322) 147 PRK00561 ppnK inorganic polyph 22.9 54 0.0014 13.7 5.2 34 33-67 30-63 (259) 148 TIGR02733 desat_CrtD C-3',4' d 22.5 55 0.0014 13.6 3.3 26 161-186 387-413 (499) 149 cd06275 PBP1_PurR Ligand-bindi 22.4 55 0.0014 13.6 12.8 95 55-150 111-213 (269) 150 COG1148 HdrA Heterodisulfide r 22.4 55 0.0014 13.6 4.6 32 36-67 123-154 (622) 151 COG0028 IlvB Thiamine pyrophos 22.3 56 0.0014 13.6 4.5 46 48-93 188-233 (550) 152 PTZ00300 pyruvate kinase; Prov 22.2 56 0.0014 13.6 8.0 28 112-139 342-369 (454) 153 pfam06057 VirJ Bacterial virul 22.0 56 0.0014 13.5 5.0 67 64-130 5-78 (192) 154 cd01995 ExsB ExsB is a transcr 21.7 57 0.0015 13.5 4.6 77 39-124 2-82 (169) 155 PRK07482 hypothetical protein; 21.7 57 0.0015 13.5 6.0 150 46-206 67-231 (461) 156 TIGR01318 gltD_gamma_fam gluta 21.7 44 0.0011 14.2 1.2 18 38-55 293-313 (480) 157 TIGR01466 cobJ_cbiH precorrin- 21.5 58 0.0015 13.5 4.7 113 40-154 47-166 (254) 158 TIGR01457 HAD-SF-IIA-hyp2 HAD- 21.2 59 0.0015 13.4 7.4 72 46-125 15-90 (251) 159 cd04615 CBS_pair_2 The CBS dom 20.9 59 0.0015 13.4 5.8 81 49-129 11-99 (113) 160 cd06332 PBP1_aromatic_compound 20.9 59 0.0015 13.4 8.8 45 100-144 115-160 (333) 161 PRK05192 tRNA uridine 5-carbox 20.9 60 0.0015 13.4 4.0 33 34-67 4-36 (621) 162 PRK13360 omega amino acid--pyr 20.8 60 0.0015 13.4 6.8 54 78-131 92-149 (441) 163 PRK05259 consensus 20.7 60 0.0015 13.4 6.1 98 40-141 161-260 (310) 164 PRK06427 phosphomethylpyrimidi 20.6 60 0.0015 13.4 7.5 91 36-127 73-178 (266) 165 cd05014 SIS_Kpsf KpsF-like pro 20.4 61 0.0016 13.3 5.4 44 90-135 47-90 (128) 166 TIGR03309 matur_yqeB selenium- 20.3 49 0.0013 13.9 1.2 72 40-116 1-72 (256) 167 PRK08674 bifunctional phosphog 20.3 61 0.0016 13.3 7.2 87 33-133 30-119 (328) 168 TIGR02867 spore_II_P stage II 20.2 61 0.0016 13.3 2.7 30 87-116 184-214 (221) 169 cd06291 PBP1_Qymf_like Ligand 20.2 61 0.0016 13.3 10.6 95 55-150 106-208 (265) 170 PRK11267 biopolymer transport 20.2 62 0.0016 13.3 6.0 57 72-128 79-136 (141) 171 TIGR02168 SMC_prok_B chromosom 20.1 62 0.0016 13.3 4.5 40 37-77 1096-1139(1191) No 1 >PRK10494 hypothetical protein; Provisional Probab=100.00 E-value=2.3e-35 Score=236.37 Aligned_cols=154 Identities=20% Similarity=0.129 Sum_probs=131.1 Q ss_pred CCCCCCCCEEEEECCCH-----------------HHHHHHHHHHHHCCCCCEECCCCC---CCCCHHHHHHHHHHHCCCC Q ss_conf 76545687899946985-----------------799999999996899824515876---5467899999876515886 Q gi|254780579|r 31 IPDHPSVSAIVVLTGEP-----------------IRIERAFELLENQIGEKIFISGVH---HSVSKDILLQKIPIRQDLA 90 (207) Q Consensus 31 ~~~~~~~DaIVVLgGg~-----------------~Ri~~a~~L~~~g~~~~ii~SG~~---~~~~~~~~~~~~~~~~~i~ 90 (207) .+..+++|+|||||||. +|+.+|++|||+++..+|++||+. +..+|++.++..+.+.|++ T Consensus 73 ~~~~~~~d~IVVLGGg~~~~~~~~~~~~~~~~a~~Rl~~g~~L~r~~~~~~li~SGG~~~~~~~sea~~~~~~~~~lGv~ 152 (259) T PRK10494 73 WNGSQKVDYIVVLGGGYTWNPQWAPSSNLINNSLPRLNEGIRLWRANPGAKLIFTGGVAKTNTVSTAEVGARVAQSLGVP 152 (259) T ss_pred CCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCHHHHHHHHHHHCCCC T ss_conf 77777778799926851158876764444464899999999999838998399968878889998999999999983999 Q ss_pred HHHEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHCCCCEEEEEECCCCC--CCCCCC---CCC Q ss_conf 133220466666025699999999864998189851144219999999986299879996416766--675421---169 Q gi|254780579|r 91 ECCIDIGYKALNTEGNAQEASAWAEKNNFHHVLIVTHDYHMPRTFLELQRINSTVQFIPYPIISHD--LEENSS---IFK 165 (207) Q Consensus 91 ~~~I~~e~~s~~T~ena~~~~~~l~~~~~~~iiLVTs~yHm~RA~~~f~~~~p~i~i~~~pv~~~~--~~~~~W---w~~ 165 (207) +++|++|++|+||+|||.++++++.+ ++++||||++||+||...|++. |++++|+|++... ...+.| +.+ T Consensus 153 ~~~I~~e~~srnT~EnA~~~~~ll~~---~~~lLVTSA~HMpRA~~~F~~~--G~~viP~P~d~~~~~~~~~~~~~~lP~ 227 (259) T PRK10494 153 REDIITLDLPKDTEEEAAAVKQAIGD---APFLLVTSASHLPRAMIFFQQE--GLNPLPAPANQLAIDSPLNPWERAIPS 227 (259) T ss_pred HHHEEECCCCCCHHHHHHHHHHHHCC---CCEEEECCHHHHHHHHHHHHHC--CCEEEEECCCCCCCCCCCCCHHHCCCC T ss_conf 89915256777889999999998488---9789983733538999999987--992243046552257888708762998 Q ss_pred HH---HHHHHHHHHHHHHHHHHHHHHC Q ss_conf 89---9999999999999999987605 Q gi|254780579|r 166 IK---ILRVLLIEYLKILLLSIQLSLS 189 (207) Q Consensus 166 ~~---~~~~~~~Ey~K~l~~~lr~~i~ 189 (207) .. ....+++||+++++|++|+.-+ T Consensus 228 ~~aL~~s~~aihE~lG~lwy~lrG~~~ 254 (259) T PRK10494 228 PVWLMHSDRAGYETLGRIWQWLKGSSG 254 (259) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 799999899999999999998727789 No 2 >pfam02698 DUF218 DUF218 domain. This large family of proteins contains several highly conserved charged amino acids, suggesting this may be an enzymatic domain (Bateman A pers. obs). The family includes SanA, which is involved in Vancomycin resistance. This protein may be involved in murein synthesis. Probab=99.97 E-value=2.1e-30 Score=206.02 Aligned_cols=139 Identities=26% Similarity=0.350 Sum_probs=116.4 Q ss_pred CCCEEEEECCC----HHHHHHHHHHHHHCCCCCEECCCCCC---CCCHHHHHHHHHHHCCCCHHHEECCCCCCCHHHHHH Q ss_conf 68789994698----57999999999968998245158765---467899999876515886133220466666025699 Q gi|254780579|r 36 SVSAIVVLTGE----PIRIERAFELLENQIGEKIFISGVHH---SVSKDILLQKIPIRQDLAECCIDIGYKALNTEGNAQ 108 (207) Q Consensus 36 ~~DaIVVLgGg----~~Ri~~a~~L~~~g~~~~ii~SG~~~---~~~~~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~ 108 (207) ++|+||||||| .+|+++|++||++|++++|++||+.. ..+|++.|++++.+.|+++++|++|++|+||+|||+ T Consensus 1 ~~d~IVVlG~~~~~~~~R~~~a~~l~~~g~~~~ii~sGg~~~~~~~~ea~~~~~~l~~~gvp~~~I~~e~~s~nT~ena~ 80 (148) T pfam02698 1 PADAIVVLGGGSPALAARLDAAAELYRAGPAPRIIVSGGAGGGEPVSEAEVMRRYLVELGVPAEAILLEPQSRNTYENAR 80 (148) T ss_pred CCCEEEECCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCHHHHHHHHHHHCCCCHHHEECCCCCCCHHHHHH T ss_conf 99889989999877999999999999809998899848988888878999999999986989999774556778355199 Q ss_pred HHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHCCCCEEEEEECCCCC--CCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 999999864998189851144219999999986299879996416766--67542116989999999999999 Q gi|254780579|r 109 EASAWAEKNNFHHVLIVTHDYHMPRTFLELQRINSTVQFIPYPIISHD--LEENSSIFKIKILRVLLIEYLKI 179 (207) Q Consensus 109 ~~~~~l~~~~~~~iiLVTs~yHm~RA~~~f~~~~p~i~i~~~pv~~~~--~~~~~Ww~~~~~~~~~~~Ey~K~ 179 (207) +++++++++++++++||||+|||+||.++|++.+++ ..++++.... .....||. ..+.+++||+|+ T Consensus 81 ~~~~~~~~~~~~~v~lVTs~~H~~Ra~~~f~~~~~~--~~~~~~~~~~~~~~~~~~~~---~~~~~~~E~~~~ 148 (148) T pfam02698 81 FSAELLRERGLRRVLLVTSAFHMRRALLLFRRAGPE--VVPVPADYPTPLPPRALWLR---LARAALREYLGL 148 (148) T ss_pred HHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHCCE--EEEEECCCCCCCCCCHHHHH---HHHHHHHHHHCC T ss_conf 999999976997599989888999999999980990--59986798888887256888---889999999578 No 3 >cd06259 YdcF-like YdcF-like. YdcF-like is a large family of mainly bacterial proteins, with a few members found in fungi, plants, and archaea. Escherichia coli YdcF has been shown to bind S-adenosyl-L-methionine (AdoMet), but a biochemical function has not been idenitified. The family also includes Escherichia coli sanA and Salmonella typhimurium sfiX, which are involved in vancomycin resistance; sfiX may also be involved in murein synthesis. Probab=99.97 E-value=2.9e-30 Score=205.10 Aligned_cols=137 Identities=29% Similarity=0.412 Sum_probs=124.1 Q ss_pred CEEEEECCCHH----------HHHHHHHHHHHCCCCCEECCCCCC---CCCHHHHHHHHHHHCCCCHHHEECCCCCCCHH Q ss_conf 78999469857----------999999999968998245158765---46789999987651588613322046666602 Q gi|254780579|r 38 SAIVVLTGEPI----------RIERAFELLENQIGEKIFISGVHH---SVSKDILLQKIPIRQDLAECCIDIGYKALNTE 104 (207) Q Consensus 38 DaIVVLgGg~~----------Ri~~a~~L~~~g~~~~ii~SG~~~---~~~~~~~~~~~~~~~~i~~~~I~~e~~s~~T~ 104 (207) |+|||||||.. |+++|++||++|++++|++||+.. ..+|++.|++++.+.|++.++|++|++|+||+ T Consensus 1 daIvVLG~~~~~~~~~~~~~~R~~~a~~L~~~~~~~~ii~sGg~~~~~~~~Ea~~~~~~l~~~gv~~~~i~~e~~s~~T~ 80 (150) T cd06259 1 DAIVVLGGGVNGDGPSPILAERLDAAAELYRAGPAPKLIVSGGQGPGEGYSEAEAMARYLIELGVPAEAILLEDRSTNTY 80 (150) T ss_pred CEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCHHHHHHHHHHHCCCCHHHEEECCCCCCCH T ss_conf 98998999879999897999999999999981999889995887999988999999999998599989968313477702 Q ss_pred HHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 569999999986499818985114421999999998629987999641676667542116989999999999999 Q gi|254780579|r 105 GNAQEASAWAEKNNFHHVLIVTHDYHMPRTFLELQRINSTVQFIPYPIISHDLEENSSIFKIKILRVLLIEYLKI 179 (207) Q Consensus 105 ena~~~~~~l~~~~~~~iiLVTs~yHm~RA~~~f~~~~p~i~i~~~pv~~~~~~~~~Ww~~~~~~~~~~~Ey~K~ 179 (207) |||.+++++++++++++++||||+|||+||.++|++.+++..++++|+....... +....+.+++||+|+ T Consensus 81 ena~~~~~~~~~~~~~~i~lVTs~~H~~Ra~~~~~~~~~~~~~~~~p~~~~~~~~-----~~~~~~~~~~E~~~~ 150 (150) T cd06259 81 ENARFSAELLRERGIRSVLLVTSAYHMPRALLIFRKAGLDVEVVPAPTDFYSLSS-----ALRLLRLALREYLGL 150 (150) T ss_pred HHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCC-----HHHHHHHHHHHHHCC T ss_conf 4599999999976998799999888999999999980998459868877613600-----466678999998569 No 4 >COG1434 Uncharacterized conserved protein [Function unknown] Probab=99.91 E-value=1.2e-22 Score=158.47 Aligned_cols=142 Identities=26% Similarity=0.218 Sum_probs=112.0 Q ss_pred CCCCCCEEEEECCCH------HHHHHHHHHHHHCCCCCEECCCCCCCCCHHHHHHHHHHHCCCCHHHEECCCCCCCHHHH Q ss_conf 545687899946985------79999999999689982451587654678999998765158861332204666660256 Q gi|254780579|r 33 DHPSVSAIVVLTGEP------IRIERAFELLENQIGEKIFISGVHHSVSKDILLQKIPIRQDLAECCIDIGYKALNTEGN 106 (207) Q Consensus 33 ~~~~~DaIVVLgGg~------~Ri~~a~~L~~~g~~~~ii~SG~~~~~~~~~~~~~~~~~~~i~~~~I~~e~~s~~T~en 106 (207) .+...|+||++||+. .|+.++..+++..+.+.+..||+... ....+.+....|++.++|+.|++|+||+|| T Consensus 61 ~~~~~~~iv~~Gg~~~~~~~~~rl~~~~~~~~~~~~~~v~~s~~~~~---~~~~~~~~~~~gv~~~~i~~e~~s~~T~eN 137 (223) T COG1434 61 LPGLADAIVVLGGGSRLTDHLIRLLEAARLAKILPISGVLESGGVIE---IQATRRYLENLGVPAERIILEDRSRNTVEN 137 (223) T ss_pred CCCCCCEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCH---HHHHHHHHHHCCCCHHHEEEECCCCCHHHH T ss_conf 54568889984887543589999999999999724367066765214---778889999859987782420777768999 Q ss_pred HHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 9999999986499818985114421999999998629987999641676667542116989999999999999999 Q gi|254780579|r 107 AQEASAWAEKNNFHHVLIVTHDYHMPRTFLELQRINSTVQFIPYPIISHDLEENSSIFKIKILRVLLIEYLKILLL 182 (207) Q Consensus 107 a~~~~~~l~~~~~~~iiLVTs~yHm~RA~~~f~~~~p~i~i~~~pv~~~~~~~~~Ww~~~~~~~~~~~Ey~K~l~~ 182 (207) |.++++++++++++++++|||+|||+||...|++.++++ +|+|+...... |............+|.+.... T Consensus 138 a~~s~~~l~~~~~~~~ilVTs~~Hm~Ra~~~~~~~g~~~--~p~p~~~~~~~---~~~~~~~~~~~~~~~~~~~~~ 208 (223) T COG1434 138 ARFSRRLLRTQGPESVILVTSPYHMPRALLLFRKLGISV--IPYPVGFLDRN---ILRLPLSLLALAEALLKSLRA 208 (223) T ss_pred HHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHCCCC--CCCCCCCCCCC---CHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999739837999889862899999999828988--54542345532---102055551567999999999 No 5 >PRK10834 hypothetical protein; Provisional Probab=99.83 E-value=1.4e-18 Score=133.66 Aligned_cols=143 Identities=19% Similarity=0.221 Sum_probs=119.4 Q ss_pred CCCCCCCCCEEEEECCC------------HHHHHHHHHHHHHCCCCCEECCCCC--CCCCHHHHHHHHHHHCCCCHHHEE Q ss_conf 67654568789994698------------5799999999996899824515876--546789999987651588613322 Q gi|254780579|r 30 HIPDHPSVSAIVVLTGE------------PIRIERAFELLENQIGEKIFISGVH--HSVSKDILLQKIPIRQDLAECCID 95 (207) Q Consensus 30 ~~~~~~~~DaIVVLgGg------------~~Ri~~a~~L~~~g~~~~ii~SG~~--~~~~~~~~~~~~~~~~~i~~~~I~ 95 (207) ..++.|..++.+|||-+ .+|++.|++||++|+.++|++||.+ ....|...|+.++.+.|+|++.|. T Consensus 38 ~i~~vP~~~valVLGtak~~~~G~pn~~~~~RldaA~~LY~~GKv~~iLvSGDn~~~~YnEp~~Mk~~Li~~GVP~e~I~ 117 (239) T PRK10834 38 ELQDLPYRQVGVVLGTAKYYRTGVINQYYRYRIQGAINAYNSGKVNYLLLSGDNALQSYNEPMTMRKDLIAAGVDPADIV 117 (239) T ss_pred CHHHCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCEEEECCCCCCCCCCCHHHHHHHHHHCCCCHHHEE T ss_conf 87477987569994576467899819899999999999998699748986689998889828999999998599899950 Q ss_pred CCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHH Q ss_conf 04666660256999999998649981898511442199999999862998799964167666754211698999999999 Q gi|254780579|r 96 IGYKALNTEGNAQEASAWAEKNNFHHVLIVTHDYHMPRTFLELQRINSTVQFIPYPIISHDLEENSSIFKIKILRVLLIE 175 (207) Q Consensus 96 ~e~~s~~T~ena~~~~~~l~~~~~~~iiLVTs~yHm~RA~~~f~~~~p~i~i~~~pv~~~~~~~~~Ww~~~~~~~~~~~E 175 (207) +|...-+|++....+++.. +.+++++||..||++||..+.++. |++.+.+.+++.. .. ++.-++| T Consensus 118 ~D~AGfrT~DS~vRAk~VF---~~~~~iIVTQ~FH~~RAlfiAr~~--GidA~g~~a~~~~----~~------~k~r~RE 182 (239) T PRK10834 118 LDYAGFRTLDSIVRTRKVF---DTNDFIIITQRFHCERALFIALHM--GIQAQCYAVPSPK----NM------LSVRVRE 182 (239) T ss_pred CCCCCCCHHHHHHHHHHHH---CCCCEEEEECCHHHHHHHHHHHHC--CCCEEEEECCCCC----CC------CHHHHHH T ss_conf 5667654899999999980---998289994601389999999974--9836997168865----44------1105999 Q ss_pred HHHHHHHHHHHH Q ss_conf 999999999876 Q gi|254780579|r 176 YLKILLLSIQLS 187 (207) Q Consensus 176 y~K~l~~~lr~~ 187 (207) ++.=+...+-.. T Consensus 183 ~~AR~ka~~Dl~ 194 (239) T PRK10834 183 FAARFGALADLY 194 (239) T ss_pred HHHHHHHHHHHH T ss_conf 999998885654 No 6 >COG2949 SanA Uncharacterized membrane protein [Function unknown] Probab=99.60 E-value=1.2e-13 Score=103.44 Aligned_cols=144 Identities=20% Similarity=0.229 Sum_probs=115.0 Q ss_pred CCCCCCCCCEEEEECCC------------HHHHHHHHHHHHHCCCCCEECCCCCCC--CCHHHHHHHHHHHCCCCHHHEE Q ss_conf 67654568789994698------------579999999999689982451587654--6789999987651588613322 Q gi|254780579|r 30 HIPDHPSVSAIVVLTGE------------PIRIERAFELLENQIGEKIFISGVHHS--VSKDILLQKIPIRQDLAECCID 95 (207) Q Consensus 30 ~~~~~~~~DaIVVLgGg------------~~Ri~~a~~L~~~g~~~~ii~SG~~~~--~~~~~~~~~~~~~~~i~~~~I~ 95 (207) ...+.|.-.+-||||-+ .+|++.|++||++|+..++++||++.. ..|..-|++.+...|+|...|. T Consensus 50 ~~~~lP~r~vgvVLGtsky~~~g~~N~yy~~Ri~aA~~ly~~gKV~~LLlSGDN~~~sYnEp~tM~kdL~~~GVp~~~i~ 129 (235) T COG2949 50 DIQDLPARQVGVVLGTSKYLAKGPPNRYYTYRIDAAIALYKAGKVNYLLLSGDNATVSYNEPRTMRKDLIAAGVPAKNIF 129 (235) T ss_pred CHHHCCCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCCHHHHHHHHHHCCCCHHHEE T ss_conf 85458862048997223002479955768999999999986477049998168775346545899999998399888903 Q ss_pred CCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHH Q ss_conf 04666660256999999998649981898511442199999999862998799964167666754211698999999999 Q gi|254780579|r 96 IGYKALNTEGNAQEASAWAEKNNFHHVLIVTHDYHMPRTFLELQRINSTVQFIPYPIISHDLEENSSIFKIKILRVLLIE 175 (207) Q Consensus 96 ~e~~s~~T~ena~~~~~~l~~~~~~~iiLVTs~yHm~RA~~~f~~~~p~i~i~~~pv~~~~~~~~~Ww~~~~~~~~~~~E 175 (207) +|...-+|+|....+++.. |-++++++|..||..||.++.+.. |++-+...+++... ..++..-++| T Consensus 130 lDyAGFrTLDSvvRA~kVF---~~~~ftIItQ~FHceRAlfiA~~~--gIdAic~~ap~p~~--------~~~~~vrlRE 196 (235) T COG2949 130 LDYAGFRTLDSVVRARKVF---GTNDFTIITQRFHCERALFIARQM--GIDAICFAAPDPEG--------RSGLSVRLRE 196 (235) T ss_pred ECCCCCCHHHHHHHHHHHC---CCCCEEEEECCCCCHHHHHHHHHH--CCCEEEECCCCCCC--------CCCCEEHHHH T ss_conf 2003741789899899870---767279982012348899999982--97547865899444--------6770207999 Q ss_pred HHHHHHHHHHH Q ss_conf 99999999987 Q gi|254780579|r 176 YLKILLLSIQL 186 (207) Q Consensus 176 y~K~l~~~lr~ 186 (207) .+.=+-..+-. T Consensus 197 ~~ARv~Av~D~ 207 (235) T COG2949 197 FLARVKAVLDL 207 (235) T ss_pred HHHHHHHHHHH T ss_conf 99888787410 No 7 >KOG4533 consensus Probab=94.84 E-value=0.022 Score=34.36 Aligned_cols=106 Identities=14% Similarity=0.029 Sum_probs=74.6 Q ss_pred HHHHHHHHHHCCCCCEECCCCCC-----CCCHHHHHHHHHHHCCCC--HHHEECCCCCCCHHHHHHHHHHHHHHC---CC Q ss_conf 99999999968998245158765-----467899999876515886--133220466666025699999999864---99 Q gi|254780579|r 50 IERAFELLENQIGEKIFISGVHH-----SVSKDILLQKIPIRQDLA--ECCIDIGYKALNTEGNAQEASAWAEKN---NF 119 (207) Q Consensus 50 i~~a~~L~~~g~~~~ii~SG~~~-----~~~~~~~~~~~~~~~~i~--~~~I~~e~~s~~T~ena~~~~~~l~~~---~~ 119 (207) +..|+++..+.....+++||+.. ..++++..=...+.-|-. ..+-..|.-|++.+||..++.--..+- -- T Consensus 122 ~~~gid~~~~DdeslLlfsgg~tr~~agp~seaqsyy~~s~~~~knev~sr~~TEEfarDSfeNllfSv~RF~Evt~~yP 201 (317) T KOG4533 122 SNDGIDVPNFDDESLLLFSGGKTRLSAGPVSEAQSYYGGSITFGKNEVRSRALTEEFARDSFENLLFSVYRFEEVTKKYP 201 (317) T ss_pred CCCCCCCCCCCCCEEEEEECCCCCCCCCCCCHHHHHEEEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 11466666778621355307631257788516443111245525215655441687867668766664341775751378 Q ss_pred CEEEEECCHHHHHHHHHHHHHHC--CCCEEEEEECCCC Q ss_conf 81898511442199999999862--9987999641676 Q gi|254780579|r 120 HHVLIVTHDYHMPRTFLELQRIN--STVQFIPYPIISH 155 (207) Q Consensus 120 ~~iiLVTs~yHm~RA~~~f~~~~--p~i~i~~~pv~~~ 155 (207) ++|.+|+-+|.|+|=.-.-++++ |+-+++...++++ T Consensus 202 QkITvvsfdFK~~RF~~lHrkAi~fPes~f~yiGidP~ 239 (317) T KOG4533 202 QKITVVSFDFKMPRFISLHRKAIDFPESNFTYIGIDPK 239 (317) T ss_pred CCEEEEEEECCCHHHHHHHHHHCCCCCCCEEEEEECCC T ss_conf 61389985116547777678654796101389830798 No 8 >PRK00179 pgi glucose-6-phosphate isomerase; Reviewed Probab=94.18 E-value=0.45 Score=26.38 Aligned_cols=107 Identities=16% Similarity=0.145 Sum_probs=66.9 Q ss_pred HHHHHCCCC--CCCCCCCCEEEEEC-CCH----HHHHHHHHHHHHCCCCCEECCCCCCCCCHHHHHHHHHHHCCCCHHHE Q ss_conf 999721346--76545687899946-985----79999999999689982451587654678999998765158861332 Q gi|254780579|r 22 FIRYVKQMH--IPDHPSVSAIVVLT-GEP----IRIERAFELLENQIGEKIFISGVHHSVSKDILLQKIPIRQDLAECCI 94 (207) Q Consensus 22 f~~~~~~~~--~~~~~~~DaIVVLg-Gg~----~Ri~~a~~L~~~g~~~~ii~SG~~~~~~~~~~~~~~~~~~~i~~~~I 94 (207) |...+.... ....++...||..| ||. .=+..|++=+..+.-...++|++++.. ....+......+..+ T Consensus 113 f~~~i~sG~~~g~tgk~i~~VV~IGIGGS~LGp~~~~~AL~~~~~~~~~i~FvsNvD~~~-----l~~~l~~ldpe~TLf 187 (525) T PRK00179 113 FAARIRSGAWKGYTGKDITDVVNIGIGGSDLGPRAVIEALRPYFDNLLRVHFVSNVDPTY-----IAEVLKKLDPKTTLF 187 (525) T ss_pred HHHHHHCCCEECCCCCCCCEEEEECCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCHHH-----HHHHHHHCCHHHEEE T ss_conf 999997598117889833438996656601769999999986466984179967888689-----999997289223389 Q ss_pred ECCCCCCCHHH---HHHHHHHHHHHCC-----C-CEEEEECCHHHHHH Q ss_conf 20466666025---6999999998649-----9-81898511442199 Q gi|254780579|r 95 DIGYKALNTEG---NAQEASAWAEKNN-----F-HHVLIVTHDYHMPR 133 (207) Q Consensus 95 ~~e~~s~~T~e---na~~~~~~l~~~~-----~-~~iiLVTs~yHm~R 133 (207) ++-.+|-+|.| |+..+++|+.+++ + ++++.||++---.. T Consensus 188 iv~SKSftT~ETl~N~~~a~~wl~~~~g~~~~~~~h~vavT~~~~~a~ 235 (525) T PRK00179 188 IVISKSGTTTETATNFRIARDWLLKKGGKELEAAKHFVAVTDNAKGAL 235 (525) T ss_pred EEECCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHEEEECCCHHHHH T ss_conf 996698782889999999999999837983436552556428857899 No 9 >cd05015 SIS_PGI_1 Phosphoglucose isomerase (PGI) contains two SIS (Sugar ISomerase) domains. This classification is based on the alignment of the first SIS domain. PGI is a multifunctional enzyme which as an intracellular dimer catalyzes the reversible isomerization of glucose 6-phosphate to fructose 6-phosphate. As an extracellular protein, PGI also has functions equivalent to neuroleukin (NLK), autocrine motility factor (AMF), and maturation factor (MF). Evidence suggests that PGI, NLK, AMF, and MF are closely related or identical. NLK is a neurotrophic growth factor that promotes regeneration and survival of neurons. The dimeric form of NLK has isomerase function, whereas its monomeric form carries out neurotrophic activity. AMF is a cytokine that stimulates cell migration and metastasis. MF mediates the differentiation of human myeloid leukemic HL-60 cells to terminal monocytic cells. Probab=92.90 E-value=0.74 Score=25.06 Aligned_cols=108 Identities=18% Similarity=0.112 Sum_probs=62.8 Q ss_pred CCCCEEEEECC-CHH---H-HHHHHHHHHHCCCCCEECCCCCCCCCHHHHHHHHHHHCCCCHHHEECCCCCCCHHH---H Q ss_conf 56878999469-857---9-99999999968998245158765467899999876515886133220466666025---6 Q gi|254780579|r 35 PSVSAIVVLTG-EPI---R-IERAFELLENQIGEKIFISGVHHSVSKDILLQKIPIRQDLAECCIDIGYKALNTEG---N 106 (207) Q Consensus 35 ~~~DaIVVLgG-g~~---R-i~~a~~L~~~g~~~~ii~SG~~~~~~~~~~~~~~~~~~~i~~~~I~~e~~s~~T~e---n 106 (207) ++.+.||++|- |.. | +..++.=+....-...++++.++. .....+.+....+..+++-.+|-+|.| | T Consensus 18 ~~~~~iv~iGiGGS~LG~k~l~~al~~~~~~~~~i~f~dn~Dp~-----~~~~~l~~l~~~~TlfiviSKSg~T~ETl~~ 92 (158) T cd05015 18 KKITDVVVIGIGGSDLGPRAVYEALKPYFKGGLRLHFVSNVDPD-----DLAELLKKLDPETTLFIVISKSGTTLETLAN 92 (158) T ss_pred CCCCEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEECCCCHH-----HHHHHHHHCCCHHEEEEEECCCCCCHHHHHH T ss_conf 98886999814333899999999976245489569995799989-----9999997279301499997599986899999 Q ss_pred HHHHHHHHHHCCC----CEEEEECCHHHHHHHHHHHHHHCCCCEEEEEE Q ss_conf 9999999986499----81898511442199999999862998799964 Q gi|254780579|r 107 AQEASAWAEKNNF----HHVLIVTHDYHMPRTFLELQRINSTVQFIPYP 151 (207) Q Consensus 107 a~~~~~~l~~~~~----~~iiLVTs~yHm~RA~~~f~~~~p~i~i~~~p 151 (207) +....+|+++++. ++++.||++--- +...++.. +.++...| T Consensus 93 ~~~~~~~l~~~~~~~~~~~~vaIT~~~s~--l~~~a~~~--~~~i~~~~ 137 (158) T cd05015 93 ARLAREWLEEAGGDDLAKHFVAITDNGSG--LLKKAGIE--GLNTFEIP 137 (158) T ss_pred HHHHHHHHHHHCCHHHHHHEEEEECCCHH--HHHHHCCC--CCCEEECC T ss_conf 99999999983203556607898059669--99985758--85888899 No 10 >cd02509 GDP-M1P_Guanylyltransferase GDP-M1P_Guanylyltransferase catalyzes the formation of GDP-Mannose. GDP-mannose-1-phosphate guanylyltransferase, also called GDP-mannose pyrophosphorylase (GDP-MP), catalyzes the formation of GDP-Mannose from mannose-1-phosphate and GTP. Mannose is a key monosaccharide for glycosylation of proteins and lipids. GDP-Mannose is the activated donor for mannosylation of various biomolecules. This enzyme is known to be bifunctional, as both mannose-6-phosphate isomerase and mannose-1-phosphate guanylyltransferase. This CD covers the N-terminal GDP-mannose-1-phosphate guanylyltransferase domain, whereas the isomerase function is located at the C-terminal half. GDP-MP is a member of the nucleotidyltransferase family of enzymes. Probab=91.98 E-value=0.97 Score=24.35 Aligned_cols=77 Identities=17% Similarity=0.193 Sum_probs=49.5 Q ss_pred HHHHHHHHHH--CCCCCEECCCCCCCCCHHHHHHHHHHHCCCCHHHEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEECC Q ss_conf 9999999996--89982451587654678999998765158861332204666660256999999998649981898511 Q gi|254780579|r 50 IERAFELLEN--QIGEKIFISGVHHSVSKDILLQKIPIRQDLAECCIDIGYKALNTEGNAQEASAWAEKNNFHHVLIVTH 127 (207) Q Consensus 50 i~~a~~L~~~--g~~~~ii~SG~~~~~~~~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~~~~~~l~~~~~~~iiLVTs 127 (207) ++..++=... +....+++++.. .....++.+.+ ..++.+|++|+.++||.--...+..++.+++-..+++|+. T Consensus 36 lq~T~~R~~~~~~~~~i~IvTn~~----~~~~v~~ql~~-~~~~~~ii~EP~~rnTApAI~laa~~~~~~~~d~~l~v~P 110 (274) T cd02509 36 LQQTLDRLKGLVPPDRILVVTNEE----YRFLVREQLPE-GLPEENIILEPEGRNTAPAIALAALYLAKRDPDAVLLVLP 110 (274) T ss_pred HHHHHHHHHCCCCCCCCEEEECCH----HHHHHHHHHHH-CCCCCCEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEE T ss_conf 999999874689867809995721----79999999884-3986648967688897999999999999759997799984 Q ss_pred HHHH Q ss_conf 4421 Q gi|254780579|r 128 DYHM 131 (207) Q Consensus 128 ~yHm 131 (207) +=|. T Consensus 111 sDH~ 114 (274) T cd02509 111 SDHL 114 (274) T ss_pred CCCC T ss_conf 7402 No 11 >cd05005 SIS_PHI Hexulose-6-phosphate isomerase (PHI). PHI is a member of the SIS (Sugar ISomerase domain) superfamily. In the ribulose monophosphate pathway of formaldehyde fixation, hexulose-6-phosphate synthase catalyzes the condensation of ribulose-5-phosphate with formadelhyde to become hexulose-6-phosphate, which is then isomerized to fructose-6-phosphate by PHI. Probab=84.75 E-value=3.1 Score=21.27 Aligned_cols=42 Identities=17% Similarity=0.048 Sum_probs=30.2 Q ss_pred HHHEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHH Q ss_conf 13322046666602569999999986499818985114421999 Q gi|254780579|r 91 ECCIDIGYKALNTEGNAQEASAWAEKNNFHHVLIVTHDYHMPRT 134 (207) Q Consensus 91 ~~~I~~e~~s~~T~ena~~~~~~l~~~~~~~iiLVTs~yHm~RA 134 (207) .+.++.-..|.+|.+- ....+.++++|. +++.+|+..+.+=| T Consensus 76 ~Dv~I~iS~SG~T~~~-~~~~~~aK~~ga-~iI~IT~~~~S~la 117 (179) T cd05005 76 GDLLIAISGSGETSSV-VNAAEKAKKAGA-KVVLITSNPDSPLA 117 (179) T ss_pred CCEEEEECCCCCCHHH-HHHHHHHHHCCC-EEEEEECCCCCHHH T ss_conf 9999998199995689-999999998799-19999798999789 No 12 >PRK00973 glucose-6-phosphate isomerase; Provisional Probab=84.10 E-value=3.3 Score=21.10 Aligned_cols=90 Identities=21% Similarity=0.200 Sum_probs=55.1 Q ss_pred CCCCEEEEEC-CCHH---H-HHHHHHHHH-------HCCCCCEE-CCCCCCCCCHHHHHHHHHHHCCCCHHHEECCCCCC Q ss_conf 5687899946-9857---9-999999999-------68998245-15876546789999987651588613322046666 Q gi|254780579|r 35 PSVSAIVVLT-GEPI---R-IERAFELLE-------NQIGEKIF-ISGVHHSVSKDILLQKIPIRQDLAECCIDIGYKAL 101 (207) Q Consensus 35 ~~~DaIVVLg-Gg~~---R-i~~a~~L~~-------~g~~~~ii-~SG~~~~~~~~~~~~~~~~~~~i~~~~I~~e~~s~ 101 (207) ++.|.+||+| ||.+ | +.+|+.=+. +...+++. ++.+++ ......+......+..+++-.+|- T Consensus 70 k~~~~vV~IGIGGS~LG~~a~~~aL~~~~~n~~~~~~~~~~~i~f~~NvDp-----~~l~~~l~~ld~~~TlfiViSKSg 144 (454) T PRK00973 70 KNFENVVVLGIGGSALGNLALHTALLPLNYNELSKEERNGYRIFVLDNVDP-----EKTASILDVIDPKKTLFNVISKSG 144 (454) T ss_pred CCCCEEEEECCCHHHHHHHHHHHHHHHHCCCCCCHHCCCCCEEEEEECCCH-----HHHHHHHHHCCCCCEEEEEEECCC T ss_conf 789879995435367899999999500112310011168975999808998-----999999974793307999984897 Q ss_pred CHHHHH---HHHHHHHHHCCC---CEEEEECCHH Q ss_conf 602569---999999986499---8189851144 Q gi|254780579|r 102 NTEGNA---QEASAWAEKNNF---HHVLIVTHDY 129 (207) Q Consensus 102 ~T~ena---~~~~~~l~~~~~---~~iiLVTs~y 129 (207) +|.|.+ ...++|+++.|. ++++.||++- T Consensus 145 tT~ETl~n~~~~r~~l~~~g~~~~~h~vaiT~~~ 178 (454) T PRK00973 145 NTAETLANYLIVRGLLEKYGLDPKKHLVFTTDPE 178 (454) T ss_pred CCHHHHHHHHHHHHHHHHCCCCHHHCEEEECCCC T ss_conf 9689999999999999964956765657752675 No 13 >pfam00342 PGI Phosphoglucose isomerase. Phosphoglucose isomerase catalyses the interconversion of glucose-6-phosphate and fructose-6-phosphate. Probab=83.53 E-value=3.5 Score=20.95 Aligned_cols=97 Identities=14% Similarity=0.217 Sum_probs=63.7 Q ss_pred CCCCCCEEEEEC-CCH----HHHHHHHHHHHHCCCCCEECCCCCCCCCHHHHHHHHHHHCCCCHHHEECCCCCCCHHHH- Q ss_conf 545687899946-985----79999999999689982451587654678999998765158861332204666660256- Q gi|254780579|r 33 DHPSVSAIVVLT-GEP----IRIERAFELLENQIGEKIFISGVHHSVSKDILLQKIPIRQDLAECCIDIGYKALNTEGN- 106 (207) Q Consensus 33 ~~~~~DaIVVLg-Gg~----~Ri~~a~~L~~~g~~~~ii~SG~~~~~~~~~~~~~~~~~~~i~~~~I~~e~~s~~T~en- 106 (207) ..++.+-||++| ||. .=+.+|++=+..+.-...++|+.++.. ....+......+.-+++-.+|-+|.|. T Consensus 92 ~g~~~~~vV~IGIGGS~LGp~~~~~AL~~~~~~~~~~~FvsN~Dp~~-----~~~~l~~ld~~~TlfiV~SKSf~T~ETl 166 (483) T pfam00342 92 TGKAITDVVNIGIGGSDLGPRMVIEALKHYSENALIVFFVSNVDGTH-----IAEVLKKLNPETTLINVASKTFTTAETM 166 (483) T ss_pred CCCCCCEEEEECCCCCCHHHHHHHHHHHHHCCCCCEEEEECCCCHHH-----HHHHHHCCCCCCEEEEEECCCCCCHHHH T ss_conf 89823438995656500769999999865336982289956898789-----9999960892025999977887847799 Q ss_pred --HHHHHHHHHHCCC------CEEEEECCHHHHHHH Q ss_conf --9999999986499------818985114421999 Q gi|254780579|r 107 --AQEASAWAEKNNF------HHVLIVTHDYHMPRT 134 (207) Q Consensus 107 --a~~~~~~l~~~~~------~~iiLVTs~yHm~RA 134 (207) +...++|+.+++. ++++.||++-.-.+. T Consensus 167 ~n~~~~~~wl~~~~~~~~~~~~h~vavT~~~~~a~~ 202 (483) T pfam00342 167 TNARSAREWLLKKLGEKSAVAKHFVALSTNAKKVEK 202 (483) T ss_pred HHHHHHHHHHHHHCCCHHHHHCEEEEECCCHHHHHH T ss_conf 999999999998349855341148996288899997 No 14 >COG0836 {ManC} Mannose-1-phosphate guanylyltransferase [Cell envelope biogenesis, outer membrane] Probab=81.91 E-value=4 Score=20.57 Aligned_cols=75 Identities=13% Similarity=0.048 Sum_probs=46.1 Q ss_pred CCCCCEECCCCCCCCCHHHHHHHHHHH--CCCCHHHEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHH Q ss_conf 899824515876546789999987651--588613322046666602569999999986499818985114421999999 Q gi|254780579|r 60 QIGEKIFISGVHHSVSKDILLQKIPIR--QDLAECCIDIGYKALNTEGNAQEASAWAEKNNFHHVLIVTHDYHMPRTFLE 137 (207) Q Consensus 60 g~~~~ii~SG~~~~~~~~~~~~~~~~~--~~i~~~~I~~e~~s~~T~ena~~~~~~l~~~~~~~iiLVTs~yHm~RA~~~ 137 (207) +....+++++... ....++.+.+ .+.+.. |++|+.++||.--.-.++..+.++.-..+++|..+=|.=+---. T Consensus 49 ~~~~~~vVtne~~----~f~v~eql~e~~~~~~~~-illEP~gRnTApAIA~aa~~~~~~~~d~~~lVlpsDH~I~d~~a 123 (333) T COG0836 49 DIEEPLVVTNEKY----RFIVKEQLPEIDIENAAG-IILEPEGRNTAPAIALAALSATAEGGDALVLVLPSDHVIADEEA 123 (333) T ss_pred CCCCEEEEECHHH----HHHHHHHHHHHHHCCCCC-EEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCEECCHHH T ss_conf 8566099957799----999999866410125660-68545787749999999999987589838999447556556899 Q ss_pred HH Q ss_conf 99 Q gi|254780579|r 138 LQ 139 (207) Q Consensus 138 f~ 139 (207) |. T Consensus 124 f~ 125 (333) T COG0836 124 FL 125 (333) T ss_pred HH T ss_conf 99 No 15 >PRK01581 speE spermidine synthase; Validated Probab=80.28 E-value=4.6 Score=20.23 Aligned_cols=80 Identities=10% Similarity=-0.034 Sum_probs=44.8 Q ss_pred HHHHHHHHHHCCCCHHHEEC-CCCCCCHHHHHHHHHHH-------HHHCCCCEEEEECCHHHHHHHHHHHHHHCC--CCE Q ss_conf 99999876515886133220-46666602569999999-------986499818985114421999999998629--987 Q gi|254780579|r 77 DILLQKIPIRQDLAECCIDI-GYKALNTEGNAQEASAW-------AEKNNFHHVLIVTHDYHMPRTFLELQRINS--TVQ 146 (207) Q Consensus 77 ~~~~~~~~~~~~i~~~~I~~-e~~s~~T~ena~~~~~~-------l~~~~~~~iiLVTs~yHm~RA~~~f~~~~p--~i~ 146 (207) ++++.......... +.|++ -+...|.--+-.++.+. |...| .-++=-||+||.|+|.|...+.+. |.. T Consensus 203 ~DAf~wL~~~~~~F-DvIIVDlPDP~n~~L~KLYS~eFY~Ll~~~La~dG-~~vVQSTSPyfApkaFWsI~~Tl~aAGl~ 280 (363) T PRK01581 203 CDAKEFLSSPSSLY-DVIIIDFPDPATELLSTLYTSELFARIATFLTEDG-AFVCQSNSPADAPLVYWSIGNTIEHAGLT 280 (363) T ss_pred CCHHHHHHCCCCCC-CEEEEECCCCCCHHHHHHHHHHHHHHHHHHCCCCC-EEEEEECCCCCCCCCEEEHHHHHHHCCCC T ss_conf 10899986167544-28999589998624666735999999998619885-39996079765762336777779873675 Q ss_pred EEEEECCCCCCC Q ss_conf 999641676667 Q gi|254780579|r 147 FIPYPIISHDLE 158 (207) Q Consensus 147 i~~~pv~~~~~~ 158 (207) +.||.+.-+.|. T Consensus 281 t~pYHv~VPSFG 292 (363) T PRK01581 281 VKSYHTIVPSFG 292 (363) T ss_pred CCCCEECCCCCC T ss_conf 413420688886 No 16 >pfam10686 DUF2493 Protein of unknown function (DUF2493). Members of this family are all Proteobacteria. The function is not known. Probab=79.86 E-value=4.7 Score=20.14 Aligned_cols=51 Identities=12% Similarity=0.174 Sum_probs=37.7 Q ss_pred EEEECCCH-----HHHHHHHHHHHHCCCCCEECCCCCCCCCHHHHHHHHHHHCCCCH Q ss_conf 99946985-----79999999999689982451587654678999998765158861 Q gi|254780579|r 40 IVVLTGEP-----IRIERAFELLENQIGEKIFISGVHHSVSKDILLQKIPIRQDLAE 91 (207) Q Consensus 40 IVVLgGg~-----~Ri~~a~~L~~~g~~~~ii~SG~~~~~~~~~~~~~~~~~~~i~~ 91 (207) .|+.+||+ +++..+++-++......++++|+.+.. ...++..++.+.+++. T Consensus 5 ~V~~~Ggrd~~D~~~i~~~Ld~~~~~~pd~vlihGG~~kG-ad~lA~~WA~~~gv~~ 60 (71) T pfam10686 5 RVAFSGGRDFNDHRLIWDALDKVHARHPDMVLLHGGAPKG-AERIAARWARRRGVPQ 60 (71) T ss_pred EEEEEECCCCCHHHHHHHHHHHHHHHCCCEEEEECCCCCC-HHHHHHHHHHHCCCCE T ss_conf 8999808986509999999999998689879997798633-7999999999869976 No 17 >TIGR00364 TIGR00364 exsB protein; InterPro: IPR004479 This protein family is represented by a single member in nearly every completed large (> 1000 genes) prokaryotic genome. In Sinorhizobium meliloti, a species in which the exo genes make succinoglycan, a symbiotically important exopolysaccharide, exsB is located nearby and affects succinoglycan levels, probably through polar effects on exsA expression or the same polycistronic mRNA , . In Arthrobacter viscosus, the homologous gene is designated alu1 and is associated with an aluminum tolerance phenotype. When expressed in Escherichia coli, it conferred aliminium tolerance . The entry also contains the gene queC, which is responsible for the conversion of GTP to 7-cyano-7-deazaguanine (preQ0). The biosynthesis of hypermodified tRNA nucleoside queuosine only occurs in eubacteria. It occupies the wobble position for all known tRNAs that are specific for Asp, Asn, His or Tyr .. Probab=77.63 E-value=3.5 Score=20.96 Aligned_cols=57 Identities=21% Similarity=0.117 Sum_probs=39.7 Q ss_pred EEEEECCCHHHHHHHHHHHHHC-CCCCEECC-C-CCC-CCCHHHHHHHHHHHCCCCHH-HEE Q ss_conf 8999469857999999999968-99824515-8-765-46789999987651588613-322 Q gi|254780579|r 39 AIVVLTGEPIRIERAFELLENQ-IGEKIFIS-G-VHH-SVSKDILLQKIPIRQDLAEC-CID 95 (207) Q Consensus 39 aIVVLgGg~~Ri~~a~~L~~~g-~~~~ii~S-G-~~~-~~~~~~~~~~~~~~~~i~~~-~I~ 95 (207) |||||+||.|=---++-+.++| +-....+| . +.. +..|-+.++.....+|++.. .|. T Consensus 1 Av~~lSGG~DStT~~~~a~~~~GkyeV~a~TF~YGQR~H~~Ele~A~~ia~~Lgi~~~~~~D 62 (227) T TIGR00364 1 AVVVLSGGQDSTTVLLIALDEGGKYEVHAITFDYGQRAHSRELESARKIAEALGIRHHFVID 62 (227) T ss_pred CEEEEECCHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEC T ss_conf 96874373468999999996179507998546501378999999999999980897078617 No 18 >cd05013 SIS_RpiR RpiR-like protein. RpiR contains a SIS (Sugar ISomerase) domain, which is found in many phosphosugar isomerases and phosphosugar binding proteins. In E. coli, rpiR negatively regulates the expression of rpiB gene. Both rpiB and rpiA are ribose phosphate isomerases that catalyze the reversible reactions of ribose 5-phosphate into ribulose 5-phosphate. Probab=76.07 E-value=6.1 Score=19.46 Aligned_cols=57 Identities=14% Similarity=0.054 Sum_probs=35.2 Q ss_pred HHHEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHCCCCEEEEEECCCCC Q ss_conf 133220466666025699999999864998189851144219999999986299879996416766 Q gi|254780579|r 91 ECCIDIGYKALNTEGNAQEASAWAEKNNFHHVLIVTHDYHMPRTFLELQRINSTVQFIPYPIISHD 156 (207) Q Consensus 91 ~~~I~~e~~s~~T~ena~~~~~~l~~~~~~~iiLVTs~yHm~RA~~~f~~~~p~i~i~~~pv~~~~ 156 (207) ++.++.-..+..| .+.....+.++++|. +++.+|+..+.+=+ +.+ |+ ...+|..... T Consensus 61 ~d~~i~iS~sg~~-~~~~~~~~~ak~~g~-~ii~IT~~~~s~l~----~~a--d~-~l~~~~~~~~ 117 (139) T cd05013 61 GDVVIAISFSGET-KETVEAAEIAKERGA-KVIAITDSANSPLA----KLA--DI-VLLVSSEEGD 117 (139) T ss_pred CCEEEEECCCCCC-HHHHHHHHHHHHCCC-EEEEEECCCCCHHH----HHC--CE-EEECCCCCCC T ss_conf 9999997686363-789999999998699-79999799999779----969--99-9982886555 No 19 >COG0603 Predicted PP-loop superfamily ATPase [General function prediction only] Probab=75.97 E-value=5.8 Score=19.58 Aligned_cols=60 Identities=10% Similarity=0.011 Sum_probs=40.3 Q ss_pred CCEEEEECCCHHHHHHHHHHHHHCCCCCEECCC-CCCCCCHHHHHHHHHHHCCCCHHHEEC Q ss_conf 878999469857999999999968998245158-765467899999876515886133220 Q gi|254780579|r 37 VSAIVVLTGEPIRIERAFELLENQIGEKIFISG-VHHSVSKDILLQKIPIRQDLAECCIDI 96 (207) Q Consensus 37 ~DaIVVLgGg~~Ri~~a~~L~~~g~~~~ii~SG-~~~~~~~~~~~~~~~~~~~i~~~~I~~ 96 (207) --|||+|+||.+=.-.+....++++.-..+.-. +..+..|-+.++......|++...|.+ T Consensus 3 ~kavvl~SGG~DStt~l~~a~~~~~ev~alsfdYGQrh~~Ele~A~~iak~lgv~~~iid~ 63 (222) T COG0603 3 KKAVVLLSGGLDSTTCLAWAKKEGYEVHALTFDYGQRHRKELEAAKELAKKLGVPHHIIDV 63 (222) T ss_pred CEEEEECCCCHHHHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHHHCCCCEEEECH T ss_conf 3499991688138999999996098899997507898689999999999981998289541 No 20 >TIGR02351 thiH thiazole biosynthesis protein ThiH; InterPro: IPR012726 Members of this protein family are the ThiH proteins involved in thiamine biosynthesis. They are homologues of the BioB protein of biotin biosynthesis. Genes encoding these proteins are generally found in operons with other thiamin biosynthesis genes. ; GO: 0005506 iron ion binding, 0051539 4 iron 4 sulfur cluster binding, 0009228 thiamin biosynthetic process. Probab=74.56 E-value=6.7 Score=19.22 Aligned_cols=57 Identities=14% Similarity=0.205 Sum_probs=42.0 Q ss_pred CHHHHHHHHHHHHHHCC-CCEEEEECCHH--H-----HHHHHHHHHHHCCCCEEEEEECCCCCCC Q ss_conf 60256999999998649-98189851144--2-----1999999998629987999641676667 Q gi|254780579|r 102 NTEGNAQEASAWAEKNN-FHHVLIVTHDY--H-----MPRTFLELQRINSTVQFIPYPIISHDLE 158 (207) Q Consensus 102 ~T~ena~~~~~~l~~~~-~~~iiLVTs~y--H-----m~RA~~~f~~~~p~i~i~~~pv~~~~~~ 158 (207) =|.|+...=.+.+.+.| +++|+|||=-. | ..|+.-+.|.-|+.+.+-.+|...++|+ T Consensus 105 Ln~~Ei~~E~~aI~k~gPf~~iLlvtGE~e~~~g~~Yi~~~~~l~k~~F~~l~iEv~Pl~~eeYk 169 (378) T TIGR02351 105 LNEEEIEREIEAIKKSGPFKEILLVTGESEKAAGVEYIEEAIKLAKEYFSSLAIEVQPLTEEEYK 169 (378) T ss_pred CCHHHHHHHHHHHHHCCCCHHEEEECCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEEECCHHHHH T ss_conf 88889999999986207701300111577755883789999999875278544887307704568 No 21 >PRK10618 phosphotransfer intermediate protein in two-component regulatory system with RcsBC; Provisional Probab=72.37 E-value=7.3 Score=18.98 Aligned_cols=130 Identities=16% Similarity=0.138 Sum_probs=67.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEC-CCHHHHHH----HHHHHHHCCCCCEECCCCCCCCCH Q ss_conf 1799999999999999999999972134676545687899946-98579999----999999689982451587654678 Q gi|254780579|r 2 RYFWYGLFVCLIFFIMGFISFIRYVKQMHIPDHPSVSAIVVLT-GEPIRIER----AFELLENQIGEKIFISGVHHSVSK 76 (207) Q Consensus 2 R~~~i~l~~~~~~~~~~~~~f~~~~~~~~~~~~~~~DaIVVLg-Gg~~Ri~~----a~~L~~~g~~~~ii~SG~~~~~~~ 76 (207) |+|+++++++++..+.....+...+..-. ..+++=.+ +-+.|++. +-++|+.-.++.---++ +...+ T Consensus 14 R~F~lfi~lL~~~l~l~~yny~nawl~~K------~~al~~ia~~lq~rId~YRy~t~QIY~~~~~~~~~~~~--~~~~e 85 (881) T PRK10618 14 RFFLLFIIVLLVTMGVMVYSAVNAWLKDK------SYQIVDITHAIQKRIDTYRYVTWQIYDNIAATTSPSSG--EGLQE 85 (881) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCC--CCCCC T ss_conf 99999999999999999999999999877------87899999999998877877899999873477789976--77663 Q ss_pred HHHHHH--HHHHCCCCHHHEECCCCCCCHHHHHHHHHHHHHH--------------CC-CCEEEEECCH----------- Q ss_conf 999998--7651588613322046666602569999999986--------------49-9818985114----------- Q gi|254780579|r 77 DILLQK--IPIRQDLAECCIDIGYKALNTEGNAQEASAWAEK--------------NN-FHHVLIVTHD----------- 128 (207) Q Consensus 77 ~~~~~~--~~~~~~i~~~~I~~e~~s~~T~ena~~~~~~l~~--------------~~-~~~iiLVTs~----------- 128 (207) ..+... ++++.+...++++++....+|.+-|.....+|+- +| -.+.+|||.- T Consensus 86 trLrpDVy~lek~~~ktDAlIFG~h~~atl~ma~rlS~YLdilWGA~n~~~SMYYLNGqDNsLiLItT~plk~~~~rfKE 165 (881) T PRK10618 86 TRLRPDVYYLEKPRRKTEALIFGSHDSATLEMTQRMSTYLDTLWGAENVPWSMYYLNGQDNSLILISTLPLKDLTSRFKE 165 (881) T ss_pred EEECCCEEEECCCCCEEEEEEECCCCHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEECCCCCEEEEECCCCCCCHHHHHH T ss_conf 12127647845788613379965887068999999998898773437775215786079885799974544561555445 Q ss_pred ------HHHHHHHHHHH Q ss_conf ------42199999999 Q gi|254780579|r 129 ------YHMPRTFLELQ 139 (207) Q Consensus 129 ------yHm~RA~~~f~ 139 (207) --.|||-|.-+ T Consensus 166 Sylt~~aEsrRaEMLqQ 182 (881) T PRK10618 166 SYITDIVESRRAEMLQQ 182 (881) T ss_pred HHHHHHHHHHHHHHHHH T ss_conf 57777688789999986 No 22 >PRK00129 upp uracil phosphoribosyltransferase; Reviewed Probab=70.63 E-value=8.2 Score=18.66 Aligned_cols=113 Identities=12% Similarity=0.103 Sum_probs=65.8 Q ss_pred EEEECCCHHHHHHHHHHHHHCCCCCEECCCCCCCCCHHHHHHHHHHHCCCCHHHEECCCCCCCHHHHHHHHHHHHHHCCC Q ss_conf 99946985799999999996899824515876546789999987651588613322046666602569999999986499 Q gi|254780579|r 40 IVVLTGEPIRIERAFELLENQIGEKIFISGVHHSVSKDILLQKIPIRQDLAECCIDIGYKALNTEGNAQEASAWAEKNNF 119 (207) Q Consensus 40 IVVLgGg~~Ri~~a~~L~~~g~~~~ii~SG~~~~~~~~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~~~~~~l~~~~~ 119 (207) +-||-.|-.=.+...+++-.-..-.|.+.- ++...+.+..-.. ....+.+..+++-.--..|-+.+..+-+.++++|. T Consensus 74 V~ILRAGl~m~~~~~~~~P~a~vg~Igi~R-de~T~~p~~yy~k-LP~~i~~~~VlllDPmlATG~s~~~ai~~L~~~G~ 151 (208) T PRK00129 74 VPILRAGLGMVDGVLKLIPSARVGHVGAYR-DEETLEPVEYLEK-LPSDIDERTVIVVDPMLATGGSMIAAIDLLKKRGA 151 (208) T ss_pred EEEECCHHHHHHHHHHHCCCCCCCEEEEEE-CCCCCCEEEEHHH-CCCCCCCCEEEEECCHHHCCHHHHHHHHHHHHCCC T ss_conf 986165165889999868875400588995-4777770675632-86444668799878556437669999999996699 Q ss_pred CEEEEECCHHHHHHHHHHHHHHCCCCEEEEEECCCC Q ss_conf 818985114421999999998629987999641676 Q gi|254780579|r 120 HHVLIVTHDYHMPRTFLELQRINSTVQFIPYPIISH 155 (207) Q Consensus 120 ~~iiLVTs~yHm~RA~~~f~~~~p~i~i~~~pv~~~ 155 (207) ++|++|+ --=.+-..-.+.+.+|+++++...+++. T Consensus 152 ~~I~~v~-~ias~~Gi~~v~~~~P~v~I~ta~iD~~ 186 (208) T PRK00129 152 KRIKVLC-LVAAPEGIKALEEAHPDVEIYTAAIDEK 186 (208) T ss_pred CEEEEEE-EEECHHHHHHHHHHCCCCEEEEEEECCC T ss_conf 7389999-9964899999998787978999983777 No 23 >TIGR02477 PFKA_PPi diphosphate--fructose-6-phosphate 1-phosphotransferase; InterPro: IPR011183 Diphosphate--fructose-6-phosphate 1-phosphotransferase catalyses the addition of phosphate from diphosphate (PPi) to fructose 6-phosphate to give fructose 1,6-bisphosphate (2.7.1.90 from EC). The enzyme is also known as pyrophosphate-dependent phosphofructokinase. The usage of PPi-dependent enzymes in glycolysis presumably frees up ATP for other processes . IPR012828 from INTERPRO represents the ATP-dependent 6-phosphofructokinase enzyme contained within Phosphofructokinase. This entry contains primarily bacterial, plant alpha, and plant beta sequences. These may be dimeric nonallosteric enzymes (bacteria) or allosteric heterotetramers (plants) . For additional information please see , , , .; GO: 0005524 ATP binding, 0047334 diphosphate-fructose-6-phosphate 1-phosphotransferase activity, 0006096 glycolysis, 0005945 6-phosphofructokinase complex. Probab=70.34 E-value=8.3 Score=18.62 Aligned_cols=122 Identities=21% Similarity=0.280 Sum_probs=51.3 Q ss_pred CCCCCEEEEECCCHH--------HHHHHHHHHHHCCCCCE--ECCCCC-----CCC-CHHHHHHHHHHHCCCCHHHEECC Q ss_conf 456878999469857--------99999999996899824--515876-----546-78999998765158861332204 Q gi|254780579|r 34 HPSVSAIVVLTGEPI--------RIERAFELLENQIGEKI--FISGVH-----HSV-SKDILLQKIPIRQDLAECCIDIG 97 (207) Q Consensus 34 ~~~~DaIVVLgGg~~--------Ri~~a~~L~~~g~~~~i--i~SG~~-----~~~-~~~~~~~~~~~~~~i~~~~I~~e 97 (207) .++-..=|||+||+- =+-.|++..+. ..++ |..|.. ... =.++++..|--.-|++ +|--. T Consensus 72 ~~~l~~GviLSGGqAPGGHNVi~GLFD~lk~~np--~SkLyGF~~Gp~GL~~~~~~elT~~~i~~YRN~GGFD--~iGSG 147 (566) T TIGR02477 72 HQPLKVGVILSGGQAPGGHNVISGLFDALKKLNP--ESKLYGFLGGPEGLLDNNYVELTKELIDEYRNTGGFD--IIGSG 147 (566) T ss_pred CCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHCC--CCCCCCCCCCCCCCCCCCEEEECHHHHHCCCCCCCEE--EECCC T ss_conf 6875688987579897711467778999997377--6720010207444315871576378872227887742--00168 Q ss_pred CCCCCHHHHHHHHHHHHHHCCCCEEEEE-CCHHHHHHHHH--HHHHHCCCCEEEEEECC-CCCCCC Q ss_conf 6666602569999999986499818985-11442199999--99986299879996416-766675 Q gi|254780579|r 98 YKALNTEGNAQEASAWAEKNNFHHVLIV-THDYHMPRTFL--ELQRINSTVQFIPYPII-SHDLEE 159 (207) Q Consensus 98 ~~s~~T~ena~~~~~~l~~~~~~~iiLV-Ts~yHm~RA~~--~f~~~~p~i~i~~~pv~-~~~~~~ 159 (207) ..=--|.|+...+.+-+++.+++=+++| =|+..+-=|++ .|.+...++.|+-+|.+ +.|++. T Consensus 148 RtKI~T~Eq~~~al~~~k~l~LdgLVIIGGDdSNTnAA~LAEyF~~~~~~t~viGVPKTIDGDLKn 213 (566) T TIGR02477 148 RTKIETEEQFAKALETAKKLKLDGLVIIGGDDSNTNAALLAEYFAKKGLKTQVIGVPKTIDGDLKN 213 (566) T ss_pred CCCCCCHHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHHHCCCCCEEEEEECCCCHHHCC T ss_conf 545568889999999987608964899747986799999999999738992278640254721102 No 24 >COG1105 FruK Fructose-1-phosphate kinase and related fructose-6-phosphate kinase (PfkB) [Carbohydrate transport and metabolism] Probab=70.16 E-value=8.4 Score=18.60 Aligned_cols=87 Identities=20% Similarity=0.211 Sum_probs=38.2 Q ss_pred CEEEEECCC------HHHHHHHHHHHHHCCCCCEECCCCCCCCCHHHHHHHHHHHCCCCHHHEECCCCCCCHHHHHHHHH Q ss_conf 789994698------57999999999968998245158765467899999876515886133220466666025699999 Q gi|254780579|r 38 SAIVVLTGE------PIRIERAFELLENQIGEKIFISGVHHSVSKDILLQKIPIRQDLAECCIDIGYKALNTEGNAQEAS 111 (207) Q Consensus 38 DaIVVLgGg------~~Ri~~a~~L~~~g~~~~ii~SG~~~~~~~~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~~~~ 111 (207) +.+||++|+ .+-..+=+++.++.-.+.++=+. ++...+.--+.-.+++.+..+-..++. ...++.++.+..+ T Consensus 130 ~d~VvlsGSlP~g~~~d~y~~li~~~~~~g~~vilD~S-g~~L~~~L~~~P~lIKPN~~EL~~~~g-~~~~~~~d~~~~a 207 (310) T COG1105 130 DDIVVLSGSLPPGVPPDAYAELIRILRQQGAKVILDTS-GEALLAALEAKPWLIKPNREELEALFG-RELTTLEDVIKAA 207 (310) T ss_pred CCEEEEECCCCCCCCHHHHHHHHHHHHHCCCEEEEECC-HHHHHHHHCCCCCEEECCHHHHHHHHC-CCCCCHHHHHHHH T ss_conf 88999908899999979999999999865983999798-599999870799389458899999968-9889748899999 Q ss_pred HHHHHCCCCEEEEECC Q ss_conf 9998649981898511 Q gi|254780579|r 112 AWAEKNNFHHVLIVTH 127 (207) Q Consensus 112 ~~l~~~~~~~iiLVTs 127 (207) +-+...|+++++ ||- T Consensus 208 ~~l~~~g~~~Vi-VSl 222 (310) T COG1105 208 RELLAEGIENVI-VSL 222 (310) T ss_pred HHHHHCCCCEEE-EEE T ss_conf 999877998899-975 No 25 >PRK09240 thiH thiamine biosynthesis protein ThiH; Reviewed Probab=69.52 E-value=8.7 Score=18.51 Aligned_cols=53 Identities=23% Similarity=0.226 Sum_probs=25.9 Q ss_pred CCHHHHHHHHHHHHHHCCCCEEEEECCH-------HHHHHHHHHHHHHCCCCEEEEEECC Q ss_conf 6602569999999986499818985114-------4219999999986299879996416 Q gi|254780579|r 101 LNTEGNAQEASAWAEKNNFHHVLIVTHD-------YHMPRTFLELQRINSTVQFIPYPII 153 (207) Q Consensus 101 ~~T~ena~~~~~~l~~~~~~~iiLVTs~-------yHm~RA~~~f~~~~p~i~i~~~pv~ 153 (207) +=|.|+...-.+.+.+.|+++|+|||-. --+.++.-..++.++.+.+-.+|.. T Consensus 103 ~Ls~eEI~~E~~ai~~~G~k~ILLvtGE~~~~~~~~Yi~~~v~~ik~~f~~v~iev~Pl~ 162 (371) T PRK09240 103 TLDEEEIEREMAAIKKLGFEHILLVTGEHEAKVGVDYIRRALPLAREYFSSVAIEVQPLS 162 (371) T ss_pred CCCHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHCCCEEEEECCCC T ss_conf 289999999999999769523885405787769889999999999975674079952599 No 26 >TIGR03127 RuMP_HxlB 6-phospho 3-hexuloisomerase. Members of this protein family are 6-phospho 3-hexuloisomerase (PHI), or the PHI domain of a fusion protein. This enzyme is part of the ribulose monophosphate (RuMP) pathway, which in one direction removes the toxic metabolite formaldehyde by assimilation into fructose-6-phosphate. In the other direction, in species lacking a complete pentose phosphate pathway, the RuMP pathway yields ribulose-5-phosphate, necessary for nucleotide biosynthesis, at the cost of also yielding formaldehyde. These latter species tend usually have a formaldehyde-activating enzyme to attach formaldehyde to the C1 carrier tetrahydromethanopterin. Probab=69.14 E-value=8.9 Score=18.46 Aligned_cols=43 Identities=12% Similarity=-0.137 Sum_probs=31.2 Q ss_pred HHHEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHH Q ss_conf 133220466666025699999999864998189851144219999 Q gi|254780579|r 91 ECCIDIGYKALNTEGNAQEASAWAEKNNFHHVLIVTHDYHMPRTF 135 (207) Q Consensus 91 ~~~I~~e~~s~~T~ena~~~~~~l~~~~~~~iiLVTs~yHm~RA~ 135 (207) .+.++.-..|.+|.| .....+.++++|. +++.+|+..+.+=|. T Consensus 73 ~Dv~I~iS~SGeT~e-~~~~~~~aK~~ga-~ii~IT~~~~S~Lak 115 (179) T TIGR03127 73 GDLLIAISGSGETES-LVTVAKKAKEIGA-TVAAITTNPESTLGK 115 (179) T ss_pred CCEEEEECCCCCCHH-HHHHHHHHHHCCC-EEEEEECCCCCHHHH T ss_conf 999999819999689-9999999998799-299997989897799 No 27 >PRK11557 putative DNA-binding transcriptional regulator; Provisional Probab=69.01 E-value=8.9 Score=18.45 Aligned_cols=99 Identities=11% Similarity=-0.031 Sum_probs=52.9 Q ss_pred HHHHHHHHHHHHCCCCCEECCCCCCCCCHHHHHHHHHHHCC-------------------CCHHHEECCCCCCCHHHHHH Q ss_conf 79999999999689982451587654678999998765158-------------------86133220466666025699 Q gi|254780579|r 48 IRIERAFELLENQIGEKIFISGVHHSVSKDILLQKIPIRQD-------------------LAECCIDIGYKALNTEGNAQ 108 (207) Q Consensus 48 ~Ri~~a~~L~~~g~~~~ii~SG~~~~~~~~~~~~~~~~~~~-------------------i~~~~I~~e~~s~~T~ena~ 108 (207) +.+.+++++..+ +.+|.+-|.+....-++.+...+...| -++++++.-..+..|.+ .. T Consensus 120 ~~l~~av~~i~~--A~~I~i~G~G~S~~vA~~~~~kl~rig~~~~~~~d~~~~~~~~~~l~~~Dv~I~iS~sG~t~~-~~ 196 (282) T PRK11557 120 EKLHECVTMLRS--ARRIILTGIGASGLVAQNFAWKLMKIGINAVAERDMHALLATVQALSPDDLLLAISYSGERRE-LN 196 (282) T ss_pred HHHHHHHHHHHC--CCCEEEEECCCHHHHHHHHHHHHHHHCCCEEECCCHHHHHHHHHHCCCCCEEEEEECCCCCHH-HH T ss_conf 999999999964--895899970436999999999999809963740886899999981899998999859999789-99 Q ss_pred HHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHCCCCEEEEEECCCCCC Q ss_conf 9999998649981898511442199999999862998799964167666 Q gi|254780579|r 109 EASAWAEKNNFHHVLIVTHDYHMPRTFLELQRINSTVQFIPYPIISHDL 157 (207) Q Consensus 109 ~~~~~l~~~~~~~iiLVTs~yHm~RA~~~f~~~~p~i~i~~~pv~~~~~ 157 (207) .+.+.++++|. +++.+|+..+.+=| +.+ |+.++ .+.....+ T Consensus 197 ~~~~~Ak~~Ga-~iIaIT~~~~spLa----~~a--D~~L~-~~~~e~~~ 237 (282) T PRK11557 197 LAADEALRVGG-KVLAITGFTPNALQ----QRA--SHCLY-TIAEEQAT 237 (282) T ss_pred HHHHHHHHCCC-EEEEECCCCCCCHH----HHC--CEEEE-ECCCCCCC T ss_conf 99999998799-39997298998158----869--99998-47853101 No 28 >TIGR01007 eps_fam capsular exopolysaccharide family; InterPro: IPR005702 This family describes the capsular exopolysaccharide proteins in bacteria. The exopolysaccharide gene cluster consists of several genes which encode a number of proteins which regulate the exoploysaccharide biosynthesis (EPS). At least 13 genes EpsA to EpsM in streptococcus species seem to direct the EPS proteins and share high homology. ; GO: 0030234 enzyme regulator activity, 0045227 capsule polysaccharide biosynthetic process. Probab=68.78 E-value=9 Score=18.42 Aligned_cols=87 Identities=13% Similarity=0.049 Sum_probs=55.6 Q ss_pred CCEECCC-CCCCCCH---HHHHHHHHHHCCCCHHHEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHH-- Q ss_conf 8245158-7654678---999998765158861332204666660256999999998649981898511442199999-- Q gi|254780579|r 63 EKIFISG-VHHSVSK---DILLQKIPIRQDLAECCIDIGYKALNTEGNAQEASAWAEKNNFHHVLIVTHDYHMPRTFL-- 136 (207) Q Consensus 63 ~~ii~SG-~~~~~~~---~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~~~~~~l~~~~~~~iiLVTs~yHm~RA~~-- 136 (207) -.++.|| +.|.+++ .+-++.......-.=+-|++|.-.-++.-.|+-.++.+ +..+|||.++...|... T Consensus 99 L~vi~sG~vPPNPt~LL~s~~F~~l~e~~~~~fD~iiiDTPPig~V~DAai~a~~~-----d~~~LV~~A~~~~k~~v~K 173 (207) T TIGR01007 99 LDVITSGPVPPNPTELLQSSNFKTLIETLRKYFDYIIIDTPPIGTVIDAAIIARAV-----DASILVTDAGKIKKREVKK 173 (207) T ss_pred CEEECCCCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHH-----CCEEEEEECCCCCHHHHHH T ss_conf 72751788787754788889999999998716888999518866678899999872-----9779887225326467899 Q ss_pred ---HHHHHCCC-CEEEEEECCC Q ss_conf ---99986299-8799964167 Q gi|254780579|r 137 ---ELQRINST-VQFIPYPIIS 154 (207) Q Consensus 137 ---~f~~~~p~-i~i~~~pv~~ 154 (207) .+++.+.. +.|+-..++. T Consensus 174 AK~~LEq~G~~~LGvvLNK~d~ 195 (207) T TIGR01007 174 AKEQLEQAGSKFLGVVLNKVDI 195 (207) T ss_pred HHHHHHHHCCCEEEEEECCCCC T ss_conf 9999986178411588888257 No 29 >pfam03641 Lysine_decarbox Possible lysine decarboxylase. The members of this family share a highly conserved motif PGGXGTXXE that is probably functionally important. This family includes proteins annotated as lysine decarboxylases, although the evidence for this is not clear. Probab=68.26 E-value=9.2 Score=18.35 Aligned_cols=23 Identities=35% Similarity=0.420 Sum_probs=16.3 Q ss_pred CCCCEEEEECCCHHHHHHHHHHH Q ss_conf 56878999469857999999999 Q gi|254780579|r 35 PSVSAIVVLTGEPIRIERAFELL 57 (207) Q Consensus 35 ~~~DaIVVLgGg~~Ri~~a~~L~ 57 (207) ..+||.|||-||.+-+++-.+.+ T Consensus 51 ~~sDafI~lPGG~GTLdElfevl 73 (130) T pfam03641 51 RLADAFVALPGGFGTLEELFEIL 73 (130) T ss_pred HHCCEEEEECCCCCHHHHHHHHH T ss_conf 86698999079753299999999 No 30 >pfam02887 PK_C Pyruvate kinase, alpha/beta domain. As well as being found in pyruvate kinase this family is found as an isolated domain in some bacterial proteins. Probab=68.25 E-value=9.2 Score=18.35 Aligned_cols=86 Identities=17% Similarity=0.152 Sum_probs=53.6 Q ss_pred CCCEEEEECCCHHHHHHHHHHHHHCCCCCEECCCCCCCCCHHHHHHHHHHHCCCCHHHEECCCCCCCHHHHHHHHHHHHH Q ss_conf 68789994698579999999999689982451587654678999998765158861332204666660256999999998 Q gi|254780579|r 36 SVSAIVVLTGEPIRIERAFELLENQIGEKIFISGVHHSVSKDILLQKIPIRQDLAECCIDIGYKALNTEGNAQEASAWAE 115 (207) Q Consensus 36 ~~DaIVVLgGg~~Ri~~a~~L~~~g~~~~ii~SG~~~~~~~~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~~~~~~l~ 115 (207) .+.+|||++-.-. .|..+.+.-+..+|+.-- ......+......|+.. +.+++...++.+....+.++++ T Consensus 16 ~akaIv~~T~sG~---tA~~iS~~RP~~pI~a~T-----~~~~~~r~l~l~~GV~p--~~~~~~~~~~~~~i~~a~~~l~ 85 (117) T pfam02887 16 GAKAIVVLTESGR---TARLVSKYRPGAPIIAVT-----PNEGTARRLALYWGVHP--VLGDERSISTDEIIAEALRVAK 85 (117) T ss_pred CCCEEEEECCCCH---HHHHHHHHCCCCCEEEEC-----CCHHHHHHHHHHCCCEE--EEECCCCCCHHHHHHHHHHHHH T ss_conf 9999999889877---999999549998689965-----99899998887659705--9825777999999999999999 Q ss_pred HCCC---CEEEEECCHHHH Q ss_conf 6499---818985114421 Q gi|254780579|r 116 KNNF---HHVLIVTHDYHM 131 (207) Q Consensus 116 ~~~~---~~iiLVTs~yHm 131 (207) ++|+ ...++|++..++ T Consensus 86 ~~g~~~~GD~vVvv~G~p~ 104 (117) T pfam02887 86 DAGLIKKGDLVVVTAGVPV 104 (117) T ss_pred HCCCCCCCCEEEEEECCCC T ss_conf 8699999998999846359 No 31 >cd06378 PBP1_iGluR_NMDA_NR2 N-terminal leucine/isoleucine/valine-binding protein (LIVBP)-like domain of the NR2 subunit of NMDA receptor family. N-terminal leucine/isoleucine/valine-binding protein (LIVBP)-like domain of the NR2 subunit of NMDA receptor family. The ionotropic N-methyl-d-asparate (NMDA) subtype of glutamate receptor serves critical functions in neuronal development, functioning, and degeneration in the mammalian central nervous system. The functional NMDA receptor is a heterotetramer composed of two NR1 and two NR2 (A, B, C, and D) or of NR3 (A and B) subunits. The receptor controls a cation channel that is highly permeable to monovalent ions and calcium and exhibits voltage-dependent inhibition by magnesium. Dual agonists, glutamate and glycine, are required for efficient activation of the NMDA receptor. Among NMDA receptor subtypes, the NR2B subunit containing receptors appear particularly important for pain perception; thus NR2B-selective antagonists may be useful in Probab=67.12 E-value=9.7 Score=18.21 Aligned_cols=42 Identities=19% Similarity=0.177 Sum_probs=34.1 Q ss_pred CCCCHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHH Q ss_conf 666602569999999986499818985114421999999998 Q gi|254780579|r 99 KALNTEGNAQEASAWAEKNNFHHVLIVTHDYHMPRTFLELQR 140 (207) Q Consensus 99 ~s~~T~ena~~~~~~l~~~~~~~iiLVTs~yHm~RA~~~f~~ 140 (207) -+-+..+.|.-..++|++++|..+.+|||.++-.|+....-+ T Consensus 115 ~~ps~~hQa~vm~~iLe~y~W~~FsVVtS~~~G~~~Fl~~v~ 156 (362) T cd06378 115 FGPSIEQQAAVMLKIMEEYDWHAFSVVTSRFPGYDDFVSAVR 156 (362) T ss_pred CCCCHHHHHHHHHHHHHHCCCEEEEEEEECCCCHHHHHHHHH T ss_conf 489989999999999997797289999827888799999999 No 32 >PRK07573 sdhA succinate dehydrogenase flavoprotein subunit; Reviewed Probab=67.02 E-value=9.8 Score=18.20 Aligned_cols=40 Identities=28% Similarity=0.172 Sum_probs=29.2 Q ss_pred CCCCCCCCCCCCEEEEECCCHHHHHHHHHHHHHCCCCCEEC Q ss_conf 13467654568789994698579999999999689982451 Q gi|254780579|r 27 KQMHIPDHPSVSAIVVLTGEPIRIERAFELLENQIGEKIFI 67 (207) Q Consensus 27 ~~~~~~~~~~~DaIVVLgGg~~Ri~~a~~L~~~g~~~~ii~ 67 (207) ....+++..+.|+|||=+|+.+ +.+|+++.+.|....++. T Consensus 24 ~~~~~~~~~~~DVlVIGsG~AG-L~AAi~aa~~G~~V~vl~ 63 (638) T PRK07573 24 KLVNPANKRKFDIIVVGTGLAG-ASAAATLGELGYNVKVFC 63 (638) T ss_pred CCCCCCCCCCCCEEEECCCHHH-HHHHHHHHHCCCCEEEEE T ss_conf 5568664664888999966999-999999997499569999 No 33 >TIGR00725 TIGR00725 conserved hypothetical protein TIGR00725; InterPro: IPR005268 This family of conserved hypothetical proteins has no known function. . Probab=65.63 E-value=10 Score=18.03 Aligned_cols=36 Identities=14% Similarity=0.061 Sum_probs=23.0 Q ss_pred CCCCEEEEECCCHHHHHHHHHHHHHCCCCCEECCCCC Q ss_conf 5687899946985799999999996899824515876 Q gi|254780579|r 35 PSVSAIVVLTGEPIRIERAFELLENQIGEKIFISGVH 71 (207) Q Consensus 35 ~~~DaIVVLgGg~~Ri~~a~~L~~~g~~~~ii~SG~~ 71 (207) .++|++|++||+.+-+.+++.-|+- ..|.|+..+-+ T Consensus 97 ~S~d~~~~vGG~~GT~~Ei~~A~~l-~~pvvv~~~tG 132 (176) T TIGR00725 97 ESADVVVSVGGGYGTLIEILIAYEL-GGPVVVLLKTG 132 (176) T ss_pred EECCEEEEEECCHHHHHHHHHHHHC-CCCEEEEECCC T ss_conf 5265899962882589999999715-89899980788 No 34 >TIGR00674 dapA dihydrodipicolinate synthase; InterPro: IPR005263 Dihydropicolinate synthase (DHDPS) is the key enzyme in lysine biosynthesis via the diaminopimelate pathway of prokaryotes, some phycomycetes and higher plants. The enzyme catalyses the condensation of L-aspartate-beta-semialdehyde and pyruvate to dihydropicolinic acid via a ping-pong mechanism in which pyruvate binds to the enzyme by forming a Schiff-base with a lysine residue . Three other proteins are structurally related to DHDPS and probably also act via a similar catalytic mechanism. These are Escherichia coli N-acetylneuraminate lyase (4.1.3.3 from EC, IPR005264 from INTERPRO) (gene nanA), which catalyzes the condensation of N-acetyl-D-mannosamine and pyruvate to form N-acetylneuraminate; Sinorhizobium meliloti protein mosA , which is involved in the biosynthesis of the rhizopine 3-O-methyl-scyllo-inosamine; and E. coli hypothetical protein yjhH. The sequences of DHDPS from different sources are well-conserved. The structure takes the form of a homotetramer, in which 2 monomers are related by an approximate 2-fold symmetry . Each monomer comprises 2 domains: an 8-fold alpha-/beta-barrel, and a C-terminal alpha-helical domain. The fold resembles that of N-acetylneuraminate lyase. The active site lysine is located in the barrel domain, and has access via 2 channels on the C-terminal side of the barrel. This family represents a subclass of dihydrodipicolinate synthase. ; GO: 0008840 dihydrodipicolinate synthase activity, 0019877 diaminopimelate biosynthetic process. Probab=64.60 E-value=11 Score=17.91 Aligned_cols=88 Identities=24% Similarity=0.232 Sum_probs=55.2 Q ss_pred CCCCCCCCCHHHHHHHHHHHCCCCHHH--EECCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHH--- Q ss_conf 158765467899999876515886133--220466666025699999999864998189851144219999999986--- Q gi|254780579|r 67 ISGVHHSVSKDILLQKIPIRQDLAECC--IDIGYKALNTEGNAQEASAWAEKNNFHHVLIVTHDYHMPRTFLELQRI--- 141 (207) Q Consensus 67 ~SG~~~~~~~~~~~~~~~~~~~i~~~~--I~~e~~s~~T~ena~~~~~~l~~~~~~~iiLVTs~yHm~RA~~~f~~~--- 141 (207) .||.++-.|..|-.+-...-......| |+-+.-|++| +||....+.+++-|+.=++.||--|-=|-..-.++|. T Consensus 41 TTGEs~TLs~EE~~~~i~~~~~~~~~R~pvIaG~GsN~T-~Eai~l~~~a~~~G~dg~L~vtPyYNKP~q~Gl~~HFkai 119 (288) T TIGR00674 41 TTGESATLSHEEHKKVIEFVVDLVKGRVPVIAGTGSNAT-EEAIELTKFAEKLGVDGFLVVTPYYNKPTQEGLYQHFKAI 119 (288) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHCCCEEEEECCCCCHH-HHHHHHHHHHHHCCCCEEECCCCCCCCCCCCHHHHHHHHH T ss_conf 558864468888899999998776287789853773258-9999999999868956884588755188821389999999 Q ss_pred CC--CCEEEEEECCCC Q ss_conf 29--987999641676 Q gi|254780579|r 142 NS--TVQFIPYPIISH 155 (207) Q Consensus 142 ~p--~i~i~~~pv~~~ 155 (207) .. ++=++-|-|++. T Consensus 120 a~~~~lPiiLYNvPsR 135 (288) T TIGR00674 120 AEEVDLPIILYNVPSR 135 (288) T ss_pred HHHCCCCEEEECCCCC T ss_conf 9871698898428764 No 35 >PRK03868 glucose-6-phosphate isomerase; Provisional Probab=64.42 E-value=11 Score=17.89 Aligned_cols=107 Identities=14% Similarity=0.126 Sum_probs=57.9 Q ss_pred CCCEEEEEC-CCHH---HH-HHHHHHHHHCCCCCEECCCCCCCCCHHHHHHHHHHHCCCCHHHEECCCCCCCHHHHHHHH Q ss_conf 687899946-9857---99-999999996899824515876546789999987651588613322046666602569999 Q gi|254780579|r 36 SVSAIVVLT-GEPI---RI-ERAFELLENQIGEKIFISGVHHSVSKDILLQKIPIRQDLAECCIDIGYKALNTEGNAQEA 110 (207) Q Consensus 36 ~~DaIVVLg-Gg~~---Ri-~~a~~L~~~g~~~~ii~SG~~~~~~~~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~~~ 110 (207) .-+-|||+| ||.. |. ..+++-++....+..+++..++. -....+......+..+++-.+|-+|.|..... T Consensus 57 ~~~~vV~IGIGGS~LG~~ai~~aL~~~~~~~~~i~F~~N~Dp~-----~~~~~l~~l~~~~TlfiViSKSgtT~ET~~~~ 131 (409) T PRK03868 57 AIKNIVVIGIGGSSLGVKAIYSMLKNKKQLKRELHFLDNTDPI-----SINKTLKKINLEETLFIVISKSGTTIETISIF 131 (409) T ss_pred CCCEEEEEEECCHHHHHHHHHHHHCCCCCCCCEEEEEECCCHH-----HHHHHHHHCCCCCEEEEEECCCCCCHHHHHHH T ss_conf 4987999825528899999999841204589749998488989-----99999960893517999985999838999999 Q ss_pred HHHHHHCC-----CCEEEEECCHHHHHHHHHHHHHHCCCCEEEEEE Q ss_conf 99998649-----981898511442199999999862998799964 Q gi|254780579|r 111 SAWAEKNN-----FHHVLIVTHDYHMPRTFLELQRINSTVQFIPYP 151 (207) Q Consensus 111 ~~~l~~~~-----~~~iiLVTs~yHm~RA~~~f~~~~p~i~i~~~p 151 (207) +.+++..+ -++++.||++-- ++..+.+. .++.+.+.| T Consensus 132 ~~~~~~~~~~~~~~~~~i~iTd~~s---~L~~~a~~-~~~~~f~ip 173 (409) T PRK03868 132 KYILEHFKLDKLELKNFLVITDPDS---PLEQFAKE-NGIKSFNIP 173 (409) T ss_pred HHHHHHHCCCCCCCCCEEEEECCCH---HHHHHHHH-CCCEEEECC T ss_conf 9999972455330025799936841---66654442-586087267 No 36 >PRK05954 precorrin-8X methylmutase; Provisional Probab=63.34 E-value=12 Score=17.77 Aligned_cols=33 Identities=21% Similarity=0.240 Sum_probs=12.4 Q ss_pred EEEEECCCHHHHHHHHHHHHHCCCCCEECCCCC Q ss_conf 899946985799999999996899824515876 Q gi|254780579|r 39 AIVVLTGEPIRIERAFELLENQIGEKIFISGVH 71 (207) Q Consensus 39 aIVVLgGg~~Ri~~a~~L~~~g~~~~ii~SG~~ 71 (207) +|+|.|-.+-=+.+-++|.++|..++-++-|.. T Consensus 123 aIvvIGNAPTAL~~Lleli~~g~~~PaLVIG~P 155 (203) T PRK05954 123 AIYVIGNAPTALLALCEEIRAGRVKPALVIGVP 155 (203) T ss_pred CEEEEECCHHHHHHHHHHHHCCCCCCCEEEECC T ss_conf 689993876999999999984899986799668 No 37 >COG1611 Predicted Rossmann fold nucleotide-binding protein [General function prediction only] Probab=62.29 E-value=12 Score=17.65 Aligned_cols=36 Identities=31% Similarity=0.304 Sum_probs=17.9 Q ss_pred EEEEECCCH---H---HHHHHHHH----HHHCCCCCEECCCCCCCCCHH Q ss_conf 899946985---7---99999999----996899824515876546789 Q gi|254780579|r 39 AIVVLTGEP---I---RIERAFEL----LENQIGEKIFISGVHHSVSKD 77 (207) Q Consensus 39 aIVVLgGg~---~---Ri~~a~~L----~~~g~~~~ii~SG~~~~~~~~ 77 (207) -|+|.+|+. . =.+.|.+| .++| .++++|++++.=++ T Consensus 16 ~i~V~~gs~~~~~~~~~~~~A~~lg~~la~~g---~~V~tGG~~GiMea 61 (205) T COG1611 16 QIVVICGSARGIEPEEYYELARELGRELAKRG---LLVITGGGPGVMEA 61 (205) T ss_pred EEEEEEECCCCCCCHHHHHHHHHHHHHHHHCC---CEEEECCCHHHHHH T ss_conf 48999838888898899999999999997679---88996986525057 No 38 >cd00610 OAT_like Acetyl ornithine aminotransferase family. This family belongs to pyridoxal phosphate (PLP)-dependent aspartate aminotransferase superfamily (fold I). The major groups in this CD correspond to ornithine aminotransferase, acetylornithine aminotransferase, alanine-glyoxylate aminotransferase, dialkylglycine decarboxylase, 4-aminobutyrate aminotransferase, beta-alanine-pyruvate aminotransferase, adenosylmethionine-8-amino-7-oxononanoate aminotransferase, and glutamate-1-semialdehyde 2,1-aminomutase. All the enzymes belonging to this family act on basic amino acids and their derivatives are involved in transamination or decarboxylation. Probab=60.83 E-value=13 Score=17.49 Aligned_cols=27 Identities=19% Similarity=0.305 Sum_probs=11.9 Q ss_pred CEEEEECCCHHHHHHHHHHHHHCCCCC Q ss_conf 789994698579999999999689982 Q gi|254780579|r 38 SAIVVLTGEPIRIERAFELLENQIGEK 64 (207) Q Consensus 38 DaIVVLgGg~~Ri~~a~~L~~~g~~~~ 64 (207) |-+.-..+|.+-.+.|+++.++...++ T Consensus 97 ~~v~f~~sGsEA~e~AiklAr~~t~r~ 123 (413) T cd00610 97 DKVFFVNSGTEAVEAALKLARAYTGRK 123 (413) T ss_pred CEEEEECCHHHHHHHHHHHHHHHCCCC T ss_conf 879981877799999999999973998 No 39 >PRK08286 cbiC precorrin-8X methylmutase; Validated Probab=60.20 E-value=13 Score=17.42 Aligned_cols=12 Identities=25% Similarity=0.457 Sum_probs=7.0 Q ss_pred HHHHHHHHHHHC Q ss_conf 999999999968 Q gi|254780579|r 49 RIERAFELLENQ 60 (207) Q Consensus 49 Ri~~a~~L~~~g 60 (207) =+..|++-++.| T Consensus 62 av~ag~~AL~~g 73 (213) T PRK08286 62 ALEKICDALRNP 73 (213) T ss_pred HHHHHHHHHHCC T ss_conf 999999999779 No 40 >PRK06264 cbiC precorrin-8X methylmutase; Validated Probab=59.64 E-value=13 Score=17.36 Aligned_cols=11 Identities=0% Similarity=0.144 Sum_probs=4.5 Q ss_pred CEEEEECCHHH Q ss_conf 81898511442 Q gi|254780579|r 120 HHVLIVTHDYH 130 (207) Q Consensus 120 ~~iiLVTs~yH 130 (207) ..|++|-++.- T Consensus 128 ~aIvvIGNAPT 138 (210) T PRK06264 128 GGIVVIGNAPT 138 (210) T ss_pred CCEEEECCCHH T ss_conf 97899678779 No 41 >PRK05639 4-aminobutyrate aminotransferase; Provisional Probab=59.54 E-value=13 Score=17.35 Aligned_cols=153 Identities=12% Similarity=0.102 Sum_probs=64.9 Q ss_pred ECCCHHHHHHHHHHHHHCCCCCEECCCCCCCCCHHHHHHHHHHHCCCCHHHEECCCCCCCHHHHHHHHHHHHHHCCCCEE Q ss_conf 46985799999999996899824515876546789999987651588613322046666602569999999986499818 Q gi|254780579|r 43 LTGEPIRIERAFELLENQIGEKIFISGVHHSVSKDILLQKIPIRQDLAECCIDIGYKALNTEGNAQEASAWAEKNNFHHV 122 (207) Q Consensus 43 LgGg~~Ri~~a~~L~~~g~~~~ii~SG~~~~~~~~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~~~~~~l~~~~~~~i 122 (207) ||-+..++..|+.=.-+. ... ...+........++++.+..........+.+-+....--|.|...++.... =+.| T Consensus 67 lGH~~p~i~~Ai~~Q~~~-l~~-~~~~~~~~~~~~~lae~L~~~~P~~~~~v~f~~sGSEAvE~AiKlAr~~t~--r~~I 142 (457) T PRK05639 67 TGYSHPKLVKAVKEQVEL-IQH-SMIGYTHSERAIRVAEKLVKISPIKNSKVIFGLSGSDAVDMAIKVSKFSTR--RPWI 142 (457) T ss_pred HCCCCHHHHHHHHHHHHH-CCC-CCCCCCCCHHHHHHHHHHHHHCCCCCCEEEEECCHHHHHHHHHHHHHHCCC--CCEE T ss_conf 617969999999999864-447-657774887999999999975898877599818658999999999864259--8869 Q ss_pred EEECCHHHHHHH--H--H--------HHHHHCCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 985114421999--9--9--------999862998799964167666754211698999999999999999999876055 Q gi|254780579|r 123 LIVTHDYHMPRT--F--L--------ELQRINSTVQFIPYPIISHDLEENSSIFKIKILRVLLIEYLKILLLSIQLSLST 190 (207) Q Consensus 123 iLVTs~yHm~RA--~--~--------~f~~~~p~i~i~~~pv~~~~~~~~~Ww~~~~~~~~~~~Ey~K~l~~~lr~~i~~ 190 (207) +-.+..||-+=. . . -|....|+....|+|-+.......+-+..++....-+.+| +...+.. T Consensus 143 is~~~~yHG~T~ga~s~~~~~~~~~~~~~~~~~~~~~~p~p~~~r~~~~~~~~~~~~~~~~~~~~~-------le~~i~~ 215 (457) T PRK05639 143 LAFIGAYHGQTLGATSVASFQVSQKRGYSPLMPNVFWLPYPNPYRNIWGINGYEEPNELINRVLDY-------LENYVFS 215 (457) T ss_pred EEEECCCCCCCCCHHCCCCCCCCCCCCCCCCCCCCEECCCCCCCCCCCCCCCCCCHHHHHHHHHHH-------HHHHHHH T ss_conf 999357798651400014687544568888899847817986334533567633789999999999-------9999986 Q ss_pred C--CCCCCCCCCCCCCCC Q ss_conf 6--766541101742025 Q gi|254780579|r 191 Q--TASQFFITLIEEITV 206 (207) Q Consensus 191 ~--~~~~~~~~~~~~~~~ 206 (207) + ..++-.+.++||||. T Consensus 216 ~~~~~~~iAavi~EPiqg 233 (457) T PRK05639 216 HVVPPDEVAALFAEPIQG 233 (457) T ss_pred HCCCCCCEEEEEEECCCC T ss_conf 146889789999827668 No 42 >TIGR00454 TIGR00454 conserved hypothetical protein TIGR00454; InterPro: IPR005245 This entry describes a family of conserved hypothetical proteins with no known function. . Probab=59.51 E-value=13 Score=17.54 Aligned_cols=103 Identities=13% Similarity=0.111 Sum_probs=59.5 Q ss_pred HHHHHHHHHHHCC--CCCEECCCCCCCCCHHHHHHHHHHHCCCCHHH-EECCCCCCCHHHHHHHHHHHHHHCCCCEEEEE Q ss_conf 9999999999689--98245158765467899999876515886133-22046666602569999999986499818985 Q gi|254780579|r 49 RIERAFELLENQI--GEKIFISGVHHSVSKDILLQKIPIRQDLAECC-IDIGYKALNTEGNAQEASAWAEKNNFHHVLIV 125 (207) Q Consensus 49 Ri~~a~~L~~~g~--~~~ii~SG~~~~~~~~~~~~~~~~~~~i~~~~-I~~e~~s~~T~ena~~~~~~l~~~~~~~iiLV 125 (207) =+++.+.-|++-. ...+++|.-....|+.-+. +.+..-.+ +.++...+-=.|...++-..++..=-.-+++| T Consensus 30 lIDhvl~~l~~~~GV~~ii~~TSPhTP~Te~y~~-----~~~~~~~~ivvidasGkGYiEDl~E~~~hlE~~~~EP~lV~ 104 (204) T TIGR00454 30 LIDHVLEALKKAKGVDNIIVVTSPHTPKTEEYVA-----EKYKEYKRIVVIDASGKGYIEDLREVLSHLELAFSEPLLVV 104 (204) T ss_pred HHHHHHHHHHHCCCCCEEEEEECCCCCCCHHHHH-----HCCCEEEEEEEEECCCCCCHHHHHHHHHHHHHHHCCCEEEE T ss_conf 4577778664047875379983777976378873-----05850457889976999602527899998776633985787 Q ss_pred CCHHHHHHHHHHH---H------HHCCCCEEEEEECCCCC Q ss_conf 1144219999999---9------86299879996416766 Q gi|254780579|r 126 THDYHMPRTFLEL---Q------RINSTVQFIPYPIISHD 156 (207) Q Consensus 126 Ts~yHm~RA~~~f---~------~~~p~i~i~~~pv~~~~ 156 (207) +|+-=.-|...+= . +-.||++-....++.+. T Consensus 105 ssDl~~~r~~~id~Ivd~y~~~~~ptPdveAl~V~~p~e~ 144 (204) T TIGR00454 105 SSDLVLVRDKIIDSIVDAYYKIKKPTPDVEALAVVVPDEK 144 (204) T ss_pred ECCHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEEECCCC T ss_conf 4360243367787776432104888887103677524225 No 43 >pfam01630 Glyco_hydro_56 Hyaluronidase. Probab=59.28 E-value=14 Score=17.33 Aligned_cols=81 Identities=16% Similarity=0.104 Sum_probs=35.7 Q ss_pred HHHHHHHHHHHHCCCC---CEEC-------CCCCCCCCHHHHHHHHHHH--CCCCHHHEECCCCCCCHHHHHHHHHHHHH Q ss_conf 7999999999968998---2451-------5876546789999987651--58861332204666660256999999998 Q gi|254780579|r 48 IRIERAFELLENQIGE---KIFI-------SGVHHSVSKDILLQKIPIR--QDLAECCIDIGYKALNTEGNAQEASAWAE 115 (207) Q Consensus 48 ~Ri~~a~~L~~~g~~~---~ii~-------SG~~~~~~~~~~~~~~~~~--~~i~~~~I~~e~~s~~T~ena~~~~~~l~ 115 (207) .|+.+|+++.+.+..+ +++. .......+++++.....+. +|...-.|+-+..-.+|.++....++++. T Consensus 245 ~Rv~EA~Rva~~~~~~~~~PVf~Y~r~~y~~~~~~fLs~~DL~~TigesaalGaaGvViWG~~~~~~s~~~C~~l~~Yl~ 324 (339) T pfam01630 245 HRVKEAIRVSKVGDASHPLPVFVYIRYVFYDRTDTFLSEDDLVSTIGESAALGAAGIVIWGSSSLTNSKESCQHLREYMD 324 (339) T ss_pred HHHHHHHHHHHHCCCCCCCCEEEEEEEEECCCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHHHHHHHHH T ss_conf 89999999986137888885587772221477556358899999999999849988999547300168899898999986 Q ss_pred HCCCCEEEEECCH Q ss_conf 6499818985114 Q gi|254780579|r 116 KNNFHHVLIVTHD 128 (207) Q Consensus 116 ~~~~~~iiLVTs~ 128 (207) ..=-.-++-||.+ T Consensus 325 ~~LGPyivNVt~a 337 (339) T pfam01630 325 TTLGPYIVNVTLA 337 (339) T ss_pred CCCCCCEEEEHHH T ss_conf 1766500221051 No 44 >PRK12483 threonine dehydratase; Reviewed Probab=57.93 E-value=14 Score=17.19 Aligned_cols=38 Identities=18% Similarity=0.163 Sum_probs=25.7 Q ss_pred CCCEEEEECCCHHHHHHHHHHHHHCCCCCEECCCCCCCC Q ss_conf 687899946985799999999996899824515876546 Q gi|254780579|r 36 SVSAIVVLTGEPIRIERAFELLENQIGEKIFISGVHHSV 74 (207) Q Consensus 36 ~~DaIVVLgGg~~Ri~~a~~L~~~g~~~~ii~SG~~~~~ 74 (207) +.|+|+|.-||-+ +..|+..+-+...|.+=+-|+.+.. T Consensus 186 ~lD~V~vpVGGGG-LiaGia~~~K~~~P~ikVIGVEpe~ 223 (521) T PRK12483 186 PLDAIFVPVGGGG-LIAGIAAYVKYLRPEIKVIGVEPDD 223 (521) T ss_pred CCCEEEECCCCCC-HHHHHHHHHHHHCCCCEEEEEEECC T ss_conf 9888997678741-7889999998619997089999678 No 45 >COG2082 CobH Precorrin isomerase [Coenzyme metabolism] Probab=57.81 E-value=14 Score=17.17 Aligned_cols=10 Identities=10% Similarity=0.156 Sum_probs=3.3 Q ss_pred HHHHHCCCCE Q ss_conf 9998649981 Q gi|254780579|r 112 AWAEKNNFHH 121 (207) Q Consensus 112 ~~l~~~~~~~ 121 (207) +++++++++- T Consensus 146 elie~~~~~p 155 (210) T COG2082 146 ELIEEGGIKP 155 (210) T ss_pred HHHHCCCCCC T ss_conf 9998069998 No 46 >cd01545 PBP1_SalR Ligand-binding domain of DNA transcription repressor SalR, a member of the LacI-GalR family of bacterial transcription regulators. Ligand-binding domain of DNA transcription repressor SalR, a member of the LacI-GalR family of bacterial transcription regulators. The SalR binds to glucose based compound Salicin which is chemically related to aspirin. The ligand-binding of SalR is structurally homologous to the periplasmic sugar-binding domain of ABC-transporters and both domains contain the type I periplasmic binding protein-like fold. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the type I periplasmic binding proteins. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand bind Probab=55.50 E-value=16 Score=16.94 Aligned_cols=140 Identities=10% Similarity=0.007 Sum_probs=70.7 Q ss_pred HHHHHH-HHHHCCCCCEECCCCCCCCC---HHHHHHHHHHHCCCCHHHEECCCCCCCHHHHHHHH-HHHHHHCCCCEEEE Q ss_conf 999999-99968998245158765467---89999987651588613322046666602569999-99998649981898 Q gi|254780579|r 50 IERAFE-LLENQIGEKIFISGVHHSVS---KDILLQKIPIRQDLAECCIDIGYKALNTEGNAQEA-SAWAEKNNFHHVLI 124 (207) Q Consensus 50 i~~a~~-L~~~g~~~~ii~SG~~~~~~---~~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~~~-~~~l~~~~~~~iiL 124 (207) ...+++ |.++|+....+++|...... ..+-.++.+.+.|++.....+.. ...|++.+... .++++.+.-.+.++ T Consensus 106 ~~~a~~~L~~~G~~~I~~i~g~~~~~~~~~R~~Gf~~a~~~~gi~~~~~~i~~-~~~s~~~~~~~~~~ll~~~~~~tai~ 184 (270) T cd01545 106 AREMTRHLIDLGHRRIAFIAGPPDHRASAERLEGYRDALAEAGLPLDPELVAQ-GDFTFESGLEAAEALLALPDRPTAIF 184 (270) T ss_pred HHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCEEEC-CCCCCHHHHHHHHHHHHCCCCCCEEE T ss_conf 99999999974996599936997773699999999999998399977212202-65310269999999985599985899 Q ss_pred ECCHHHHHHHHHHHHHHC---C-CCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 511442199999999862---9-98799964167666754211698999999999999999999876055676 Q gi|254780579|r 125 VTHDYHMPRTFLELQRIN---S-TVQFIPYPIISHDLEENSSIFKIKILRVLLIEYLKILLLSIQLSLSTQTA 193 (207) Q Consensus 125 VTs~yHm~RA~~~f~~~~---p-~i~i~~~pv~~~~~~~~~Ww~~~~~~~~~~~Ey~K~l~~~lr~~i~~~~~ 193 (207) ++++.....++..+++.+ | ++.++.+- +..+.. ..+....+++.=..|-.+..+-.|-..+.-+.. T Consensus 185 ~~nD~~A~g~~~~l~~~g~~iP~dvsiigfD--~~~~~~-~~~P~LTti~q~~~~~g~~A~~~Ll~~i~~~~~ 254 (270) T cd01545 185 ASNDDMAAGVLAVAHRRGLRVPDDLSVVGFD--DTPIAT-TIWPPLTTVRQPIAEMARAAAELLIARLRGAGP 254 (270) T ss_pred ECCHHHHHHHHHHHHHHCCCCCCCEEEEEEC--CHHHHH-HCCCCCEEEECCHHHHHHHHHHHHHHHHCCCCC T ss_conf 7888999999999998189999986999988--859998-358982499719999999999999999569899 No 47 >TIGR01812 sdhA_frdA_Gneg succinate dehydrogenase or fumarate reductase, flavoprotein subunit; InterPro: IPR014006 Succinate:quinone oxidoreductase (1.3.5.1 from EC) refers collectively to succinate:quinone reductase (SQR, or Complex II) and quinol:fumarate reductase (QFR) . SQR is found in aerobic organisms, and catalyses the oxidation of succinate to fumarate in the citric acid cycle and donates the electrons to quinone in the membrane. QFR can be found in anaerobic cells respiring with fumarate as terminal electron acceptor. SQR and QFR are very similar in composition and structure, despite catalysing opposite reactions in vivo. They are thought to have evolved from a common ancestor, and in Escherichia coli they are capable of functionally replacing each other . Succinate:quinone oxidoreductases consist of a peripheral domain, exposed to the cytoplasm in bacteria and to the matrix in mitochondria, and a membrane-integral anchor domain that spans the membrane (Fig. 1). The peripheral part, which contains the dicarboxylate binding site, is composed of a flavoprotein subunit, with one covalently bound FAD, and an iron-sulphur protein subunit containing three iron-sulphur clusters. The membrane-integral domain functions to anchor the peripheral domain to the membrane and is required for quinone reduction and oxidation. The anchor domain shows the largest variability in composition and primary sequence, being composed either of one large subunit, or two smaller subunits, which may, or may not, contain protoheme groups. This entry represents the flavoprotein subunit found in both the SQR and QFR enzymes. This subunit contains an N-terminal domain which binds the FAD cofactor, a central catalytic domain with an unsual fold, and a C-terminal domain whose role is unclear , , . The dicarboxylate binding site is located between the FAD and catalytic domains.. Probab=55.07 E-value=16 Score=16.90 Aligned_cols=63 Identities=16% Similarity=0.026 Sum_probs=38.1 Q ss_pred CEEEEECCCHHHHHHHHHHHHHCC--CCCEECCCCCCCCCH---HHHHHHHHHH-CCCCHHHEECCCCCCCH Q ss_conf 789994698579999999999689--982451587654678---9999987651-58861332204666660 Q gi|254780579|r 38 SAIVVLTGEPIRIERAFELLENQI--GEKIFISGVHHSVSK---DILLQKIPIR-QDLAECCIDIGYKALNT 103 (207) Q Consensus 38 DaIVVLgGg~~Ri~~a~~L~~~g~--~~~ii~SG~~~~~~~---~~~~~~~~~~-~~i~~~~I~~e~~s~~T 103 (207) |+|||=|||++ +.+|++..++|+ ....++|-+.|-.|. |+=-...... +.-+++. .|-.+-|| T Consensus 1 D~vIVGaGgAG-lRAA~e~a~~gPp~~~~AvisKvyP~RSHtvAAQGG~~Aal~~n~d~~D~--~e~H~yDT 69 (636) T TIGR01812 1 DVVIVGAGGAG-LRAAVEAAKAGPPNLNVAVISKVYPTRSHTVAAQGGIAAALANNVDPDDS--WEWHAYDT 69 (636) T ss_pred CEEEECCCHHH-HHHHHHHHHCCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHCCCCCCCCC--CCCCEEEE T ss_conf 97888186689-99999997357796107999556988520153567899995778643207--21230411 No 48 >pfam07071 DUF1341 Protein of unknown function (DUF1341). This family consists of several hypothetical bacterial proteins of around 220 residues in length. The function of this family is unknown. Probab=54.63 E-value=16 Score=16.85 Aligned_cols=70 Identities=16% Similarity=0.107 Sum_probs=43.1 Q ss_pred HHHHHHHHHHH--CCCCCEECCCCCCCCCHHHHHHHHHHHCCCCHHHEECCCCCCCHHHHHHHHHHHHHHCCCCEEE Q ss_conf 99999999996--8998245158765467899999876515886133220466666025699999999864998189 Q gi|254780579|r 49 RIERAFELLEN--QIGEKIFISGVHHSVSKDILLQKIPIRQDLAECCIDIGYKALNTEGNAQEASAWAEKNNFHHVL 123 (207) Q Consensus 49 Ri~~a~~L~~~--g~~~~ii~SG~~~~~~~~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~~~~~~l~~~~~~~ii 123 (207) +++.|++|.+. +.+-+.+--|+-....|-..+...+.+.|+ .+|+..-=..||..+..+.+.+.|+++++ T Consensus 136 ~vetAiaml~dmG~~SiKffPm~Gl~~leE~~avA~aca~~g~-----~lEPTGGIdl~N~~~I~~i~ldaGv~kvi 207 (218) T pfam07071 136 PVETAIAMLKDMGGSSVKFFPMGGLKHLDEYRAVAKACAEHGF-----YLEPTGGIDLDNFEEIVQIALDAGVEKVI 207 (218) T ss_pred CHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHHHHHCCC-----EECCCCCCCHHHHHHHHHHHHHCCCCEEC T ss_conf 0999999999729880566426761109999999999997693-----66667884677799999999984998543 No 49 >cd06273 PBP1_GntR_like_1 This group includes the ligand-binding domain of putative DNA transcription repressors which are highly similar to that of the repressor specific for gluconate (GntR), a member of the LacI-GalR family of bacterial transcription regulators. This group includes the ligand-binding domain of putative DNA transcription repressors which are highly similar to that of the repressor specific for gluconate (GntR), a member of the LacI-GalR family of bacterial transcription regulators. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational Probab=54.47 E-value=16 Score=16.84 Aligned_cols=95 Identities=9% Similarity=0.033 Sum_probs=44.3 Q ss_pred HHHHCCCCCEECCCCCCCCCH----HHHHHHHHHHCCCCHHHEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCHHHH Q ss_conf 999689982451587654678----9999987651588613322046666602569999999986499818985114421 Q gi|254780579|r 56 LLENQIGEKIFISGVHHSVSK----DILLQKIPIRQDLAECCIDIGYKALNTEGNAQEASAWAEKNNFHHVLIVTHDYHM 131 (207) Q Consensus 56 L~~~g~~~~ii~SG~~~~~~~----~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~~~~~~l~~~~~~~iiLVTs~yHm 131 (207) |.++|+....+++|.....+. ..-++..+.+.++............+..+-....++++....-...++..++.-. T Consensus 111 L~~~G~~~i~~i~~~~~~~~~~~~R~~gf~~al~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ai~~~nd~~A 190 (268) T cd06273 111 LIALGHRRIAMIFGPTQGNDRARARRAGVRAALAEAGLELPELWQVEAPYSIADGRAALRQLLEQPPRPTAVICGNDVLA 190 (268) T ss_pred HHHCCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCHHHEEECCCCHHHHHHHHHHHHHCCCCCCEEEECCHHHH T ss_conf 98729874799946877777899999999999998099987142031578989999999999847999848997787999 Q ss_pred HHHHHHHHHHC---C-CCEEEEE Q ss_conf 99999999862---9-9879996 Q gi|254780579|r 132 PRTFLELQRIN---S-TVQFIPY 150 (207) Q Consensus 132 ~RA~~~f~~~~---p-~i~i~~~ 150 (207) ..++..+++.+ | ++.++.+ T Consensus 191 ~g~i~~l~~~g~~iP~disvigf 213 (268) T cd06273 191 LGALYEARRLGLSVPEDLSIVGF 213 (268) T ss_pred HHHHHHHHHCCCCCCCCEEEEEE T ss_conf 99999999639999996799999 No 50 >TIGR01361 DAHP_synth_Bsub phospho-2-dehydro-3-deoxyheptonate aldolase; InterPro: IPR006268 These sequences are one of at least three types of phospho-2-dehydro-3-deoxyheptonate aldolase (DAHP synthase). This enzyme catalyzes the first of 7 steps in the biosynthesis of chorismate, that last common precursor of all three aromatic amino acids and of PABA, ubiquinone and menaquinone. Some members of this family, including an experimentally characterised member from Bacillus subtilis, are bifunctional, with a chorismate mutase domain N-terminal to this region. The member of this family from Synechocystis PCC 6803, CcmA, was shown to be essential for carboxysome formation. However, no other candidate for this enzyme is present in that species, chorismate biosynthesis does occur, other species having this protein lack carboxysomes but appear to make chorismate, and a requirement of CcmA for carboxysome formation does not prohibit a role in chorismate biosynthesis.; GO: 0016832 aldehyde-lyase activity, 0009073 aromatic amino acid family biosynthetic process. Probab=53.16 E-value=17 Score=16.71 Aligned_cols=41 Identities=10% Similarity=-0.080 Sum_probs=25.9 Q ss_pred HHCCCCHHHEECCCCCCCHHHHHHHHHHHHHHCC-CCEEEEE Q ss_conf 5158861332204666660256999999998649-9818985 Q gi|254780579|r 85 IRQDLAECCIDIGYKALNTEGNAQEASAWAEKNN-FHHVLIV 125 (207) Q Consensus 85 ~~~~i~~~~I~~e~~s~~T~ena~~~~~~l~~~~-~~~iiLV 125 (207) ..-|-...-|+|-.---||.+|-..+++|+-..| -.+|||+ T Consensus 126 ~~vG~~~KPVLLKRG~~aTi~EwL~AAEYIl~~GsN~~ViLC 167 (262) T TIGR01361 126 KEVGKQGKPVLLKRGMGATIEEWLEAAEYILSEGSNGNVILC 167 (262) T ss_pred HHHHHCCCCEEEECCCCHHHHHHHHHHHHHHHCCCCCCEEEE T ss_conf 997223797553077215899999999999846889954899 No 51 >COG1509 KamA Lysine 2,3-aminomutase [Amino acid transport and metabolism] Probab=53.12 E-value=17 Score=16.70 Aligned_cols=102 Identities=18% Similarity=0.145 Sum_probs=52.1 Q ss_pred CHHHHHHHHHHHHHCCCCC-EECCCCCCCC-CHHHHHHHHHHHCCCCHHHEE-CCCCC------CCHHHHHHHHHHHHHH Q ss_conf 8579999999999689982-4515876546-789999987651588613322-04666------6602569999999986 Q gi|254780579|r 46 EPIRIERAFELLENQIGEK-IFISGVHHSV-SKDILLQKIPIRQDLAECCID-IGYKA------LNTEGNAQEASAWAEK 116 (207) Q Consensus 46 g~~Ri~~a~~L~~~g~~~~-ii~SG~~~~~-~~~~~~~~~~~~~~i~~~~I~-~e~~s------~~T~ena~~~~~~l~~ 116 (207) +.++++.|++..++++.-. +++|||++-. +..++......-..++--.|+ +..+. +=| ....+++.+ T Consensus 142 ~~~~~~~al~YIa~hPeI~eVllSGGDPL~ls~~~L~~ll~~L~~IpHv~iiRi~TR~pvv~P~RIt----~~L~~~l~~ 217 (369) T COG1509 142 NKEEWDKALDYIAAHPEIREVLLSGGDPLSLSDKKLEWLLKRLRAIPHVKIIRIGTRLPVVLPQRIT----DELCEILGK 217 (369) T ss_pred CHHHHHHHHHHHHCCCHHHEEEECCCCCCCCCHHHHHHHHHHHHCCCCEEEEEEECCCCEECHHHHC----HHHHHHHHC T ss_conf 8899999999997395165177407875636889999999987548964699862467431544406----999998723 Q ss_pred CCCCEEEEECCHHHHHHHHHHHHHHC-----CCCEEEEEEC Q ss_conf 49981898511442199999999862-----9987999641 Q gi|254780579|r 117 NNFHHVLIVTHDYHMPRTFLELQRIN-----STVQFIPYPI 152 (207) Q Consensus 117 ~~~~~iiLVTs~yHm~RA~~~f~~~~-----p~i~i~~~pv 152 (207) - .+++.++|+--|-+----+++.++ -|+.+....| T Consensus 218 ~-~~~v~~~tH~NHp~Eit~e~~~A~~~L~~aGv~l~NQsV 257 (369) T COG1509 218 S-RKPVWLVTHFNHPNEITPEAREACAKLRDAGVPLLNQSV 257 (369) T ss_pred C-CCEEEEECCCCCHHHCCHHHHHHHHHHHHCCCEEECCHH T ss_conf 5-860799803588354689999999999975956532410 No 52 >pfam04122 CW_binding_2 Putative cell wall binding repeat 2. This repeat is found in multiple tandem copies in proteins including amidase enhancers and adhesins. Probab=52.63 E-value=17 Score=16.65 Aligned_cols=72 Identities=11% Similarity=0.155 Sum_probs=39.9 Q ss_pred CCHHHHHHHHHHHHHCC--CCC-EECCCCCCCCCHHHHHHHHHHHCCCCHHHEECCCCCCCHHHHHHHHHHHHHHCCCCE Q ss_conf 98579999999999689--982-451587654678999998765158861332204666660256999999998649981 Q gi|254780579|r 45 GEPIRIERAFELLENQI--GEK-IFISGVHHSVSKDILLQKIPIRQDLAECCIDIGYKALNTEGNAQEASAWAEKNNFHH 121 (207) Q Consensus 45 Gg~~Ri~~a~~L~~~g~--~~~-ii~SG~~~~~~~~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~~~~~~l~~~~~~~ 121 (207) +|.+|.+.++++.++.. ... ++.+|.+ ...+-....+.-..+.| |++-+. +.. ..+.+.++..+.++ T Consensus 4 ~G~dRyeTa~~va~~~~~~~~~v~la~G~~--~~DaLsa~~lAa~~~~P---Illt~~---~~~--~~~~~~l~~~~~~~ 73 (90) T pfam04122 4 GGKDRYETAAKIAKEFGGKADTVYVANGED--FADALSAAPLAAKNGAP---ILLTGD---SLP--SSTKDYLKSKKIKK 73 (90) T ss_pred CCCCHHHHHHHHHHHHCCCCCEEEEEECCC--CCCHHHHHHHHHHCCCE---EEEECC---CCC--HHHHHHHHHCCCCE T ss_conf 766899999999998577999899995886--43168789999856981---999899---999--99999999769987 Q ss_pred EEEEC Q ss_conf 89851 Q gi|254780579|r 122 VLIVT 126 (207) Q Consensus 122 iiLVT 126 (207) +.+|= T Consensus 74 v~iiG 78 (90) T pfam04122 74 VYIIG 78 (90) T ss_pred EEEEC T ss_conf 99989 No 53 >cd06298 PBP1_CcpA_like Ligand-binding domain of the catabolite control protein A (CcpA), which functions as the major transcriptional regulator of carbon catabolite repression/regulation. Ligand-binding domain of the catabolite control protein A (CcpA), which functions as the major transcriptional regulator of carbon catabolite repression/regulation (CCR), a process in which enzymes necessary for the metabolism of alternative sugars are inhibited in the presence of glucose. In gram-positive bacteria, CCR is controlled by HPr, a phosphoenolpyruvate:sugar phsophotrasnferase system (PTS) and a transcriptional regulator CcpA. Moreover, CcpA can regulate sporulation and antibiotic resistance as well as play a role in virulence development of certain pathogens such as the group A streptococcus. The ligand binding domain of CcpA is a member of the LacI-GalR family of bacterial transcription regulators. Probab=52.21 E-value=18 Score=16.61 Aligned_cols=96 Identities=16% Similarity=0.013 Sum_probs=48.9 Q ss_pred HHHHHHHCCCCCEECCCCCCCCCH----HHHHHHHHHHCCCCHHHEECCCCCCCHHHHHHHH-HHHHHHCCCCEEEEECC Q ss_conf 999999689982451587654678----9999987651588613322046666602569999-99998649981898511 Q gi|254780579|r 53 AFELLENQIGEKIFISGVHHSVSK----DILLQKIPIRQDLAECCIDIGYKALNTEGNAQEA-SAWAEKNNFHHVLIVTH 127 (207) Q Consensus 53 a~~L~~~g~~~~ii~SG~~~~~~~----~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~~~-~~~l~~~~~~~iiLVTs 127 (207) +-.|+++|+....+++|....... .+-.++.+.+.|++.....+. .+..+.+.+... ..++++.. -..+++++ T Consensus 108 ~~~L~~~G~~~I~~i~~~~~~~~~~~~R~~gf~~a~~~~g~~~~~~~i~-~~~~~~~~~~~~~~~l~~~~~-~~ai~~~n 185 (268) T cd06298 108 TELLIKNGHKKIAFISGPLEDSINGDERLAGYKEALSEANIEFDESLIF-EGDYTYESGYELAEELLEDGK-PTAAFVTD 185 (268) T ss_pred HHHHHHCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHEE-CCCHHHHHHHHHHHHHHHCCC-CCEEEECC T ss_conf 9999972996699996787776259999999999999849997601200-354037789999999983699-97899668 Q ss_pred HHHHHHHHHHHHHHC---C-CCEEEEE Q ss_conf 442199999999862---9-9879996 Q gi|254780579|r 128 DYHMPRTFLELQRIN---S-TVQFIPY 150 (207) Q Consensus 128 ~yHm~RA~~~f~~~~---p-~i~i~~~ 150 (207) +.-.-.++..+++.+ | ++.++.+ T Consensus 186 D~~A~g~l~~l~~~g~~vP~disvigf 212 (268) T cd06298 186 DELAIGILNAAQDAGLKVPEDFEIIGF 212 (268) T ss_pred HHHHHHHHHHHHHCCCCCCCCEEEEEE T ss_conf 799999999999749999987499997 No 54 >PRK11302 DNA-binding transcriptional regulator HexR; Provisional Probab=51.99 E-value=18 Score=16.59 Aligned_cols=81 Identities=21% Similarity=0.125 Sum_probs=43.2 Q ss_pred HHHHHHHHHHHCCCCCEECCCCCCCCCHHHHHHHHHH-------------------HCCCCHHHEECCCCCCCHHHHHHH Q ss_conf 9999999999689982451587654678999998765-------------------158861332204666660256999 Q gi|254780579|r 49 RIERAFELLENQIGEKIFISGVHHSVSKDILLQKIPI-------------------RQDLAECCIDIGYKALNTEGNAQE 109 (207) Q Consensus 49 Ri~~a~~L~~~g~~~~ii~SG~~~~~~~~~~~~~~~~-------------------~~~i~~~~I~~e~~s~~T~ena~~ 109 (207) -++.++++..+ +.+|++-|.+....-+..+...+. ..--+.++++.-..+..|.+- .. T Consensus 117 ~l~~av~li~~--A~~I~i~G~G~S~~vA~~~~~kl~r~g~~~~~~~d~~~~~~~a~~~~~~Dvvi~iS~sG~t~e~-i~ 193 (284) T PRK11302 117 AINRAVDLLTQ--AKKISFFGLGASAAVAHDAQNKFFRFNVPVVYFDDIVMQRMSCMNSSDGDVVVLISHTGRTKSL-VE 193 (284) T ss_pred HHHHHHHHHHC--CCEEEEEECCCCHHHHHHHHHHHHHCCCCEEECCCHHHHHHHHHHCCCCCEEEEECCCCCCHHH-HH T ss_conf 99999999971--8958999648509999999999984398146428789999999719998889997379998799-99 Q ss_pred HHHHHHHCCCCEEEEECCHHHHHHH Q ss_conf 9999986499818985114421999 Q gi|254780579|r 110 ASAWAEKNNFHHVLIVTHDYHMPRT 134 (207) Q Consensus 110 ~~~~l~~~~~~~iiLVTs~yHm~RA 134 (207) ..+.++++|. ++|.+||+ ..|=| T Consensus 194 ~~~~Ak~~ga-~vIaIT~~-~SpLa 216 (284) T PRK11302 194 LAQLARENGA-TVIGITAA-GSPLA 216 (284) T ss_pred HHHHHHHCCC-CEEEECCC-CCHHH T ss_conf 9999998799-58997799-98148 No 55 >TIGR03581 EF_0839 conserved hypothetical protein EF_0839/AHA_3917. Members of this family of relatively uncommon proteins are found in both Gram-positive (e.g. Enterococcus faecalis) and Gram-negative (e.g. Aeromonas hydrophila) bacteria, as part of a cluster of conserved proteins. The function is unknown. Probab=51.92 E-value=18 Score=16.59 Aligned_cols=70 Identities=17% Similarity=0.129 Sum_probs=45.5 Q ss_pred HHHHHHHHHHH--CCCCCEECCCCCCCCCHHHHHHHHHHHCCCCHHHEECCCCCCCHHHHHHHHHHHHHHCCCCEEE Q ss_conf 99999999996--8998245158765467899999876515886133220466666025699999999864998189 Q gi|254780579|r 49 RIERAFELLEN--QIGEKIFISGVHHSVSKDILLQKIPIRQDLAECCIDIGYKALNTEGNAQEASAWAEKNNFHHVL 123 (207) Q Consensus 49 Ri~~a~~L~~~--g~~~~ii~SG~~~~~~~~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~~~~~~l~~~~~~~ii 123 (207) +++.|++|.+. +.+-+.+--|+-....|-..+...+.+.|+ .+|+..-=..+|..+..+.+.+.|+++++ T Consensus 136 ~vetAiaml~dmG~~SiKffPm~Gl~~leE~~avA~aca~~g~-----~lEPTGGIdl~N~~~I~~i~ldaGv~kvi 207 (236) T TIGR03581 136 PIETAIAMLKDMGGSSVKFFPMGGLKHLEEYAAVAKACAKHGF-----YLEPTGGIDLDNFEEIVQIALDAGVEKVI 207 (236) T ss_pred EHHHHHHHHHHCCCCEEEEEECCCCCCHHHHHHHHHHHHHCCE-----EECCCCCCCHHHHHHHHHHHHHCCCCEEC T ss_conf 3999999999739882667506773419999999999997691-----56667985677799999999984998543 No 56 >PRK09224 threonine dehydratase; Reviewed Probab=51.59 E-value=18 Score=16.55 Aligned_cols=38 Identities=21% Similarity=0.184 Sum_probs=25.8 Q ss_pred CCCEEEEECCCHHHHHHHHHHHHHCCCCCEECCCCCCCC Q ss_conf 687899946985799999999996899824515876546 Q gi|254780579|r 36 SVSAIVVLTGEPIRIERAFELLENQIGEKIFISGVHHSV 74 (207) Q Consensus 36 ~~DaIVVLgGg~~Ri~~a~~L~~~g~~~~ii~SG~~~~~ 74 (207) +.|+|+|.-||-+ +..|+..|-+...|.+=+-|+.+.. T Consensus 169 ~~D~V~vpVGGGG-LiaGia~~~K~~~P~ikVIGVEpe~ 206 (504) T PRK09224 169 PLDAIFVPVGGGG-LIAGVAAYIKQLRPEIKVIGVEPED 206 (504) T ss_pred CCCEEEEECCCCH-HHHHHHHHHHHHCCCCEEEEEEECC T ss_conf 8888999558437-8899999998639987389999568 No 57 >cd06294 PBP1_ycjW_transcription_regulator_like Ligand-binding domain of uncharacterized transcription regulator ycjW which is a member of the LacI-GalR family repressors. This group includes the ligand-binding domain of uncharacterized transcription regulator ycjW which is a member of the LacI-GalR family repressors that are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=51.11 E-value=18 Score=16.51 Aligned_cols=95 Identities=12% Similarity=0.077 Sum_probs=47.2 Q ss_pred HHHHCCCCCEECCCCCCCCCH---HHHHHHHHHHCCCCHHHEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCHHHHH Q ss_conf 999689982451587654678---99999876515886133220466666025699999999864998189851144219 Q gi|254780579|r 56 LLENQIGEKIFISGVHHSVSK---DILLQKIPIRQDLAECCIDIGYKALNTEGNAQEASAWAEKNNFHHVLIVTHDYHMP 132 (207) Q Consensus 56 L~~~g~~~~ii~SG~~~~~~~---~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~~~~~~l~~~~~~~iiLVTs~yHm~ 132 (207) |+++|+....+++|....... .+-.+..+.+.|++.....+.....+..+-...+.++++...--..+++.++.-.- T Consensus 117 L~~~Ghr~I~~i~~~~~~~~~~~R~~Gf~~al~~~g~~~~~~~~~~~~~~~~~~~~~~~~ll~~~~~~tai~~~nD~~A~ 196 (270) T cd06294 117 LIKLGHKKIAFVGGDLDLEVTQDRLQGYKQALEDHGIPDRNEVIISLDFSEEGGYKALKKLLEQHPRPTAIVATDDLLAL 196 (270) T ss_pred HHHCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCCCEEECCCCCHHHHHHHHHHHHHCCCCCCEEEECCHHHHH T ss_conf 99769956999818998658999999999999986999773333137766788999999998469998489987749999 Q ss_pred HHHHHHHHHC---C-CCEEEEE Q ss_conf 9999999862---9-9879996 Q gi|254780579|r 133 RTFLELQRIN---S-TVQFIPY 150 (207) Q Consensus 133 RA~~~f~~~~---p-~i~i~~~ 150 (207) -++..+++.+ | |+.++.+ T Consensus 197 g~~~al~~~g~~iP~disvvgf 218 (270) T cd06294 197 GVLKVLNELGLKVPEDLSIIGF 218 (270) T ss_pred HHHHHHHHCCCCCCCCEEEEEE T ss_conf 9999999839999998499998 No 58 >COG2081 Predicted flavoproteins [General function prediction only] Probab=51.00 E-value=19 Score=16.50 Aligned_cols=48 Identities=19% Similarity=0.358 Sum_probs=29.9 Q ss_pred EEEECCCHHHHHHHHHHHHHC-----------CCCCEECCCCCCC-CCHHHHHHHHHHHC Q ss_conf 999469857999999999968-----------9982451587654-67899999876515 Q gi|254780579|r 40 IVVLTGEPIRIERAFELLENQ-----------IGEKIFISGVHHS-VSKDILLQKIPIRQ 87 (207) Q Consensus 40 IVVLgGg~~Ri~~a~~L~~~g-----------~~~~ii~SG~~~~-~~~~~~~~~~~~~~ 87 (207) ++|.|||+-=+..|+.+.+.| .+.+|++||++.. .+..+...+++.+. T Consensus 6 viIIGgGpAGlMaA~~aa~~G~~V~lid~~~k~GrKil~sGgGrCN~Tn~~~~~~~ls~~ 65 (408) T COG2081 6 VIIIGGGPAGLMAAISAAKAGRRVLLIDKGPKLGRKILMSGGGRCNFTNSEAPDEFLSRN 65 (408) T ss_pred EEEECCCHHHHHHHHHHHHCCCEEEEEECCCCCCCEEEECCCCCCCCCCCCCHHHHHHHC T ss_conf 899888788999999886469879998058642213685378874332650589999758 No 59 >KOG3339 consensus Probab=50.70 E-value=19 Score=16.47 Aligned_cols=78 Identities=17% Similarity=0.273 Sum_probs=39.2 Q ss_pred CHHHHHHHHHH-HHHHHHHHHHHHHHHCCCCCCC-CCCCCEEEEECCCHH-----HHHHHHHHHHHCCCCCEECCCCCCC Q ss_conf 91799999999-9999999999999721346765-456878999469857-----9999999999689982451587654 Q gi|254780579|r 1 MRYFWYGLFVC-LIFFIMGFISFIRYVKQMHIPD-HPSVSAIVVLTGEPI-----RIERAFELLENQIGEKIFISGVHHS 73 (207) Q Consensus 1 mR~~~i~l~~~-~~~~~~~~~~f~~~~~~~~~~~-~~~~DaIVVLgGg~~-----Ri~~a~~L~~~g~~~~ii~SG~~~~ 73 (207) |..|.+..+++ .+..+..+..|.........++ .++..-.||||.|.. |+..+. ++-+.++-.+-|..+. T Consensus 1 ~~~~~l~~lvig~~~~li~~~v~~~~~~~~k~~Pk~~s~~~lVvlGSGGHT~EMlrLl~~l---~~~y~~r~yI~a~tD~ 77 (211) T KOG3339 1 MKTFVLAALVIGYVIVLIALLVFRHTQRSTKKDPKDKSLSTLVVLGSGGHTGEMLRLLEAL---QDLYSPRSYIAADTDE 77 (211) T ss_pred CCEEEHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCEEEEEECCCCCHHHHHHHHHHH---HHHCCCEEEEEECCCH T ss_conf 9244099999999999999999999985157897678610899986897689999999988---7531853799965835 Q ss_pred CCHHHHHH Q ss_conf 67899999 Q gi|254780579|r 74 VSKDILLQ 81 (207) Q Consensus 74 ~~~~~~~~ 81 (207) .++..+.. T Consensus 78 mS~~k~~~ 85 (211) T KOG3339 78 MSEQKARS 85 (211) T ss_pred HHHHHHHH T ss_conf 42899873 No 60 >cd06288 PBP1_sucrose_transcription_regulator Ligand-binding domain of DNA-binding regulatory proteins specific to sucrose that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of DNA-binding regulatory proteins specific to sucrose that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=50.13 E-value=19 Score=16.41 Aligned_cols=98 Identities=10% Similarity=0.051 Sum_probs=53.5 Q ss_pred HHHHHHHCCCCCEECCCCCCCCCHH---HHHHHHHHHCCCCHHHEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCHH Q ss_conf 9999996899824515876546789---9999876515886133220466666025699999999864998189851144 Q gi|254780579|r 53 AFELLENQIGEKIFISGVHHSVSKD---ILLQKIPIRQDLAECCIDIGYKALNTEGNAQEASAWAEKNNFHHVLIVTHDY 129 (207) Q Consensus 53 a~~L~~~g~~~~ii~SG~~~~~~~~---~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~~~~~~l~~~~~~~iiLVTs~y 129 (207) +-.|.++|+....+++|........ +-....+.+.++............+..+-....+++++...--..++++++. T Consensus 108 ~~~L~~~G~~~i~~i~~~~~~~~~~~R~~gf~~a~~~~~i~~~~~~i~~~~~~~~~~~~~~~~ll~~~~~~dai~~~~d~ 187 (269) T cd06288 108 TRHLLAAGHRRIAFINGEPWMLAAKDRLKGYRQALAEAGIPFDPDLVVHGDWSADDGYEAAAALLDLDDRPTAIFCGNDR 187 (269) T ss_pred HHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHHHHCCCCCCCCEEECCCCCHHHHHHHHHHHHHCCCCCCEECCCCHH T ss_conf 99999848875578748988757999999999999976998664224126776677777779998558998610147789 Q ss_pred HHHHHHHHHHHHC---C-CCEEEEE Q ss_conf 2199999999862---9-9879996 Q gi|254780579|r 130 HMPRTFLELQRIN---S-TVQFIPY 150 (207) Q Consensus 130 Hm~RA~~~f~~~~---p-~i~i~~~ 150 (207) -...++..++..+ | |+.++.+ T Consensus 188 ~A~g~~~~l~~~gi~vP~di~iigf 212 (269) T cd06288 188 MAMGAYQALLERGLRIPQDVSVVGF 212 (269) T ss_pred HHHHHHHHHHHCCCCCCCCEEEEEE T ss_conf 9999999999859999998899998 No 61 >COG1737 RpiR Transcriptional regulators [Transcription] Probab=49.44 E-value=20 Score=16.35 Aligned_cols=55 Identities=15% Similarity=0.086 Sum_probs=33.7 Q ss_pred HHHEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHCCCCEEEEEECCC Q ss_conf 1332204666660256999999998649981898511442199999999862998799964167 Q gi|254780579|r 91 ECCIDIGYKALNTEGNAQEASAWAEKNNFHHVLIVTHDYHMPRTFLELQRINSTVQFIPYPIIS 154 (207) Q Consensus 91 ~~~I~~e~~s~~T~ena~~~~~~l~~~~~~~iiLVTs~yHm~RA~~~f~~~~p~i~i~~~pv~~ 154 (207) ++.++.-..+..|.| .....+.++++|. +++.+|+..+.|=|. .+ ++.+. +|+.. T Consensus 178 ~Dv~i~iS~sG~t~e-~i~~a~~ak~~ga-~vIaiT~~~~spla~----~A--d~~L~-~~~~~ 232 (281) T COG1737 178 GDVVIAISFSGYTRE-IVEAAELAKERGA-KVIAITDSADSPLAK----LA--DIVLL-VPVAE 232 (281) T ss_pred CCEEEEEECCCCCHH-HHHHHHHHHHCCC-EEEEEECCCCCCHHH----HC--CEEEE-ECCCC T ss_conf 988999848999689-9999999997799-299983999993666----54--77887-12676 No 62 >cd06274 PBP1_FruR Ligand binding domain of DNA transcription repressor specific for fructose (FruR) and its close homologs. Ligand binding domain of DNA transcription repressor specific for fructose (FruR) and its close homologs, all of which are a member of the LacI-GalR family of bacterial transcription regulators. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to members of the type I periplasmic binding protein superfamily. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor Probab=47.11 E-value=21 Score=16.12 Aligned_cols=97 Identities=12% Similarity=0.037 Sum_probs=47.4 Q ss_pred HHHHHHHCCCCCEECCCCCCCCCH---HHHHHHHHHHCCCCHHHEECCCCCCCHHHHH-HHHHHHHHHCCC-CEEEEECC Q ss_conf 999999689982451587654678---9999987651588613322046666602569-999999986499-81898511 Q gi|254780579|r 53 AFELLENQIGEKIFISGVHHSVSK---DILLQKIPIRQDLAECCIDIGYKALNTEGNA-QEASAWAEKNNF-HHVLIVTH 127 (207) Q Consensus 53 a~~L~~~g~~~~ii~SG~~~~~~~---~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena-~~~~~~l~~~~~-~~iiLVTs 127 (207) +-.|+++|+....+++|....... .+-++..+.+.|++.....+..... +.+.. ...++++.+++- -..+++++ T Consensus 108 ~~~L~~~Ghr~I~~i~~~~~~~~~~~R~~Gf~~al~~~g~~~~~~~i~~~~~-~~~~g~~~~~~~l~~~~~~p~ai~~~n 186 (264) T cd06274 108 TRELLAAPPEEVLFLGGLPELSPSRERLAGFRQALADAGLPVQPDWIYAEGY-SPESGYQLMAELLARLGRLPRALFTTS 186 (264) T ss_pred HHHHHHCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHCCCCCCCEEEEECCC-CHHHHHHHHHHHHHHCCCCCCCCCCCC T ss_conf 9999985996699976898773099999999999997799877407982688-878999999999986568874220176 Q ss_pred HHHHHHHHHHHHHHC---C-CCEEEEE Q ss_conf 442199999999862---9-9879996 Q gi|254780579|r 128 DYHMPRTFLELQRIN---S-TVQFIPY 150 (207) Q Consensus 128 ~yHm~RA~~~f~~~~---p-~i~i~~~ 150 (207) +...--++..++..+ | |+.|+.+ T Consensus 187 d~~a~g~l~~l~~~g~~vP~disvvgf 213 (264) T cd06274 187 YTLLEGVLRFLRERPGLAPSDLRIATF 213 (264) T ss_pred HHHHHHHHHHHHHCCCCCCCCEEEEEE T ss_conf 999999999999859999998799998 No 63 >cd06271 PBP1_AglR_RafR_like Ligand-binding domain of DNA transcription repressors specific for raffinose (RafR) and alpha-glucosides (AglR) which are members of the LacI-GalR family of bacterial transcription regulators. Ligand-binding domain of DNA transcription repressors specific for raffinose (RafR) and alpha-glucosides (AglR) which are members of the LacI-GalR family of bacterial transcription regulators. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the represso Probab=46.06 E-value=22 Score=16.02 Aligned_cols=96 Identities=8% Similarity=-0.027 Sum_probs=46.0 Q ss_pred HHHHHHCCCCCEECCCCCCCCCH---HHHHHHHHHHCCCCHHHEECCCCCCCHHHHHH-HHHHHHHHCCCCEEEEECCHH Q ss_conf 99999689982451587654678---99999876515886133220466666025699-999999864998189851144 Q gi|254780579|r 54 FELLENQIGEKIFISGVHHSVSK---DILLQKIPIRQDLAECCIDIGYKALNTEGNAQ-EASAWAEKNNFHHVLIVTHDY 129 (207) Q Consensus 54 ~~L~~~g~~~~ii~SG~~~~~~~---~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~-~~~~~l~~~~~~~iiLVTs~y 129 (207) -.|+++|+....+++|....... .+-++..+.+.|++.....+.. ...+.+++. ..+++++..+--+.++++++. T Consensus 113 ~~L~~~G~~~I~~i~~~~~~~~~~~R~~Gf~~al~~~gl~~~~~~~~~-~~~~~~~~~~~~~~ll~~~~~p~Ai~~~nD~ 191 (268) T cd06271 113 RRLIALGHRRIALLNPPEDLTFAQHRRAGYRRALAEAGLPLDPALIVS-GDMTEEGGYAAAAELLALPDRPTAIVCSSEL 191 (268) T ss_pred HHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEE-CCCCHHHHHHHHHHHHHCCCCCCEEEECCHH T ss_conf 999985987487548875460799999999999998399977113650-6875777999999999659998689977779 Q ss_pred HHHHHHHHHHHHC---C-CCEEEEE Q ss_conf 2199999999862---9-9879996 Q gi|254780579|r 130 HMPRTFLELQRIN---S-TVQFIPY 150 (207) Q Consensus 130 Hm~RA~~~f~~~~---p-~i~i~~~ 150 (207) -..-++..+++.+ | |+.++.+ T Consensus 192 ~A~g~l~~l~~~g~~vP~Disvigf 216 (268) T cd06271 192 MALGVLAALAEAGLRPGRDVSVVGF 216 (268) T ss_pred HHHHHHHHHHHCCCCCCCCEEEEEE T ss_conf 9999999999829998999799997 No 64 >PRK05953 precorrin-8X methylmutase; Validated Probab=45.09 E-value=23 Score=15.93 Aligned_cols=11 Identities=9% Similarity=0.187 Sum_probs=4.5 Q ss_pred HHHHHHHHHHC Q ss_conf 99999999968 Q gi|254780579|r 50 IERAFELLENQ 60 (207) Q Consensus 50 i~~a~~L~~~g 60 (207) ++.|++-++.| T Consensus 54 v~ag~~AL~~G 64 (208) T PRK05953 54 LQAGAAALAAR 64 (208) T ss_pred HHHHHHHHHCC T ss_conf 99999999769 No 65 >COG0166 Pgi Glucose-6-phosphate isomerase [Carbohydrate transport and metabolism] Probab=44.94 E-value=23 Score=15.92 Aligned_cols=94 Identities=19% Similarity=0.141 Sum_probs=48.3 Q ss_pred CCCEEEEEC-CCHHHHHHHHHHHHHCCC---CC-EECCCCCCCCCHHHHHHHHHHHCCCCHHHEECCCCCCCHHH---HH Q ss_conf 687899946-985799999999996899---82-45158765467899999876515886133220466666025---69 Q gi|254780579|r 36 SVSAIVVLT-GEPIRIERAFELLENQIG---EK-IFISGVHHSVSKDILLQKIPIRQDLAECCIDIGYKALNTEG---NA 107 (207) Q Consensus 36 ~~DaIVVLg-Gg~~Ri~~a~~L~~~g~~---~~-ii~SG~~~~~~~~~~~~~~~~~~~i~~~~I~~e~~s~~T~e---na 107 (207) +.|.||+.| ||.+=...++.-.-..+. +. .+++.+++. . +...+...+..+..+++-.+|-+|.| |+ T Consensus 78 ~~~~iV~IGIGGS~LG~~~~~~aL~~~~~~~~~~~Fv~nid~~-~----~~~~l~~i~~~~tl~iviSKSGtT~Et~~n~ 152 (446) T COG0166 78 KITDIVNIGIGGSDLGPRAVTEALRPYAPNGPRVHFVSNVDPT-Y----LAEVLKKLDPETTLFIVISKSGTTLETLTNF 152 (446) T ss_pred CCCEEEEECCCHHHHHHHHHHHHHHHHCCCCCCEEEECCCCCH-H----HHHHHHCCCCCCEEEEEEECCCCCHHHHHHH T ss_conf 7443999787555899999999864404688745982578822-3----4678852685627999995898958899889 Q ss_pred HHHHHHHHHC--CCCEEEEECCHHHHHHH Q ss_conf 9999999864--99818985114421999 Q gi|254780579|r 108 QEASAWAEKN--NFHHVLIVTHDYHMPRT 134 (207) Q Consensus 108 ~~~~~~l~~~--~~~~iiLVTs~yHm~RA 134 (207) ...++|+.++ ..++..++||.--...+ T Consensus 153 ~~~r~~~~~~~~~~~~~~v~~~~~~~~l~ 181 (446) T COG0166 153 RLARKWLEKKEEAAKKHFVATSTNGGALA 181 (446) T ss_pred HHHHHHHHHHHHHHHCEEEEECCCCHHHH T ss_conf 99999998646645147999828715788 No 66 >cd06283 PBP1_RegR_EndR_KdgR_like Ligand-binding domain of DNA transcription repressor RegR and other putative regulators such as KdgR and EndR. Ligand-binding domain of DNA transcription repressor RegR and other putative regulators such as KdgR and EndR, all of which are members of the LacI-GalR family of bacterial transcription regulators. RegR regulates bacterial competence and the expression of virulence factors, including hyaluronidase. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA b Probab=44.79 E-value=23 Score=15.90 Aligned_cols=97 Identities=9% Similarity=0.086 Sum_probs=51.0 Q ss_pred HHHHHHCCCCCEECCCCCCCCCH----HHHHHHHHHHCCCCHHHEECCCCCCCHHHHH-HHHHHHHHHCCCCEEEEECCH Q ss_conf 99999689982451587654678----9999987651588613322046666602569-999999986499818985114 Q gi|254780579|r 54 FELLENQIGEKIFISGVHHSVSK----DILLQKIPIRQDLAECCIDIGYKALNTEGNA-QEASAWAEKNNFHHVLIVTHD 128 (207) Q Consensus 54 ~~L~~~g~~~~ii~SG~~~~~~~----~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena-~~~~~~l~~~~~~~iiLVTs~ 128 (207) -.|+++|.....++++.....+. .+-+...+.+.+........+. ..++.+.+ ...+++++.+.--..++++++ T Consensus 109 ~~L~~~G~~~i~~i~~~~~~~~~~~~R~~g~~~a~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~l~~~~~~~aii~~~D 187 (267) T cd06283 109 DHLIEKGYERILFVTEPLDEISPRMERYEGFKEALAEHGIGVNEELIEI-DDEDADELDERLRQLLNKPKKKTAIFAANG 187 (267) T ss_pred HHHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEE-CCCCHHHHHHHHHHHHHCCCCCCEECCCCC T ss_conf 9999759972999956877883799999999999997599765148983-354267799999999847999864203770 Q ss_pred HHHHHHHHHHHHHC---C-CCEEEEEE Q ss_conf 42199999999862---9-98799964 Q gi|254780579|r 129 YHMPRTFLELQRIN---S-TVQFIPYP 151 (207) Q Consensus 129 yHm~RA~~~f~~~~---p-~i~i~~~p 151 (207) .-..-+...++..+ | ++.|+.+. T Consensus 188 ~~A~g~~~~l~~~g~~vP~disVvg~d 214 (267) T cd06283 188 LILLEVLKALKELGIRIPEDVGLIGFD 214 (267) T ss_pred HHHHHHHHHHHHCCCCCCCCCEEEEEC T ss_conf 999999999998299889876688538 No 67 >pfam08981 consensus Probab=44.40 E-value=24 Score=15.86 Aligned_cols=28 Identities=21% Similarity=0.323 Sum_probs=11.3 Q ss_pred CCHHHHHHHHHHHHHHCCCCEEEEECCH Q ss_conf 6602569999999986499818985114 Q gi|254780579|r 101 LNTEGNAQEASAWAEKNNFHHVLIVTHD 128 (207) Q Consensus 101 ~~T~ena~~~~~~l~~~~~~~iiLVTs~ 128 (207) .||.+-.....+.+++.|++++++-++. T Consensus 8 ~NT~~tl~~a~~ra~e~gI~~iVvASss 35 (181) T pfam08981 8 ENTEDTLELAAERAKELGIKHIVVASSS 35 (181) T ss_pred CCHHHHHHHHHHHHHHCCCCEEEEEECC T ss_conf 1389999999999997699769998078 No 68 >PRK11337 DNA-binding transcriptional repressor RpiR; Provisional Probab=44.35 E-value=24 Score=15.86 Aligned_cols=60 Identities=20% Similarity=0.212 Sum_probs=36.1 Q ss_pred HHEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCC Q ss_conf 332204666660256999999998649981898511442199999999862998799964167666754 Q gi|254780579|r 92 CCIDIGYKALNTEGNAQEASAWAEKNNFHHVLIVTHDYHMPRTFLELQRINSTVQFIPYPIISHDLEEN 160 (207) Q Consensus 92 ~~I~~e~~s~~T~ena~~~~~~l~~~~~~~iiLVTs~yHm~RA~~~f~~~~p~i~i~~~pv~~~~~~~~ 160 (207) ++++.-..+..|.+ ...+.+.++++|.+ +|.||+..+.|=| +.+ |+. ...|+....+... T Consensus 190 Dv~i~iS~sG~t~e-~i~~~~~Ak~~Ga~-vI~IT~~~~SpLa----~~a--D~v-L~~~~~e~~~~~~ 249 (293) T PRK11337 190 DVVLVVSHSGRTSD-VKAAVELAKQNGAK-IICITHSYHSPIA----KLA--DYI-ICSPAPETPLLGR 249 (293) T ss_pred CEEEEEECCCCCHH-HHHHHHHHHHCCCE-EEEECCCCCCHHH----HHC--CEE-EECCCCCCCCCCC T ss_conf 88999818999889-99999999987994-9997699998468----958--998-8648876443444 No 69 >KOG2094 consensus Probab=44.27 E-value=15 Score=17.05 Aligned_cols=30 Identities=10% Similarity=0.256 Sum_probs=21.5 Q ss_pred EEEEECCHHHHHHHH-------HHHHHHCCCCEEEEE Q ss_conf 189851144219999-------999986299879996 Q gi|254780579|r 121 HVLIVTHDYHMPRTF-------LELQRINSTVQFIPY 150 (207) Q Consensus 121 ~iiLVTs~yHm~RA~-------~~f~~~~p~i~i~~~ 150 (207) .-.|-|++||.||=- .+.++..|++.++|. T Consensus 135 ~smlsTaNY~ARkFGVRaaMPgFIarklCPdLiiVP~ 171 (490) T KOG2094 135 KSMLSTANYVARKFGVRAAMPGFIARKLCPDLIIVPL 171 (490) T ss_pred HHHHHHHHHHHHHHHHHHCCCHHHHHCCCCCEEEECC T ss_conf 4553220488887412423806777602886489579 No 70 >cd01746 GATase1_CTP_Synthase Type 1 glutamine amidotransferase (GATase1) domain found in Cytidine Triphosphate Synthetase. Type 1 glutamine amidotransferase (GATase1) domain found in Cytidine Triphosphate Synthetase (CTP). CTP is involved in pyrimidine ribonucleotide/ribonucleoside metabolism. CTPs produce CTP from UTP and glutamine and regulate intracellular CTP levels through interactions with four ribonucleotide triphosphates. The enzyme exists as a dimer of identical chains that aggregates as a tetramer. CTP is derived form UTP in three separate steps involving two active sites. In one active site, the UTP O4 oxygen is activated by Mg-ATP-dependent phosphorylation, followed by displacement of the resulting 4-phosphate moiety by ammonia. At a separate site, ammonia is generated via rate limiting glutamine hydrolysis (glutaminase) activity. A gated channel that spans between the glutamine hydrolysis and amidoligase active sites provides a path for ammonia diffusion. CTPs belong to th Probab=43.52 E-value=22 Score=16.04 Aligned_cols=37 Identities=19% Similarity=-0.034 Sum_probs=28.1 Q ss_pred CCCCCEEEEECCCHHH----HHHHHHHHHHCCCCCEECCCC Q ss_conf 4568789994698579----999999999689982451587 Q gi|254780579|r 34 HPSVSAIVVLTGEPIR----IERAFELLENQIGEKIFISGV 70 (207) Q Consensus 34 ~~~~DaIVVLgGg~~R----i~~a~~L~~~g~~~~ii~SG~ 70 (207) ....|+|+|.||...| ...|++..|+..-|.+=+-.| T Consensus 53 L~~~dgilvpgGFG~rG~eGki~Ai~yARen~iPfLGIClG 93 (235) T cd01746 53 LKGADGILVPGGFGIRGVEGKILAIKYARENNIPFLGICLG 93 (235) T ss_pred HHCCCCEEECCCCCCCCHHHHHHHHHHHHHCCCCEEEEECC T ss_conf 73069789578888776458899999999739972544103 No 71 >PRK08639 threonine dehydratase; Validated Probab=42.94 E-value=25 Score=15.73 Aligned_cols=89 Identities=11% Similarity=0.047 Sum_probs=43.5 Q ss_pred CCCEEEEECCCHHHHHHHHHHHHHCCCCCEECCCCCCCCCHHHHHHHHHHHCC---CCHHHEECCCCCCCHHHHHHHHHH Q ss_conf 68789994698579999999999689982451587654678999998765158---861332204666660256999999 Q gi|254780579|r 36 SVSAIVVLTGEPIRIERAFELLENQIGEKIFISGVHHSVSKDILLQKIPIRQD---LAECCIDIGYKALNTEGNAQEASA 112 (207) Q Consensus 36 ~~DaIVVLgGg~~Ri~~a~~L~~~g~~~~ii~SG~~~~~~~~~~~~~~~~~~~---i~~~~I~~e~~s~~T~ena~~~~~ 112 (207) ..|+|+|.-||-+ +..|+..|-+...|.+=+-|+.+....+ |...+.... .+....+.|.-+..+-+ ..+-+ T Consensus 177 ~~D~vvvpvGGGG-LiaGia~~lK~~~P~ikIiGVEp~~a~~--m~~Sl~~g~~v~~~~~~t~aDG~av~~~G--~~tf~ 251 (418) T PRK08639 177 EPDYVFVPVGGGG-LISGVTTYLKEVSPKTKVIGVEPAGAAS--MKAALEAGKPVTLEKIDKFVDGAAVARVG--DLTFE 251 (418) T ss_pred CCCEEEEEECCCH-HHHHHHHHHHHHCCCCEEEEEEECCCHH--HHHHHHCCCCCCCCCCCCCCCCCCCCCCC--HHHHH T ss_conf 7888999618736-8999999998509997189981589768--99999859960068767611334335455--67899 Q ss_pred HHHHCCCCEEEEECCHHH Q ss_conf 998649981898511442 Q gi|254780579|r 113 WAEKNNFHHVLIVTHDYH 130 (207) Q Consensus 113 ~l~~~~~~~iiLVTs~yH 130 (207) ++++. +..+++|+.+-= T Consensus 252 i~~~~-vd~iv~V~e~eI 268 (418) T PRK08639 252 ILKDV-VDDVILVPEGAV 268 (418) T ss_pred HHHHH-CCCCEEECHHHH T ss_conf 99862-897077289999 No 72 >cd01829 SGNH_hydrolase_peri2 SGNH_peri2; putative periplasmic member of the SGNH-family of hydrolases, a diverse family of lipases and esterases. The tertiary fold of the enzyme is substantially different from that of the alpha/beta hydrolase family and unique among all known hydrolases; its active site closely resembles the Ser-His-Asp(Glu) triad found in other serine hydrolases. Probab=42.81 E-value=25 Score=15.71 Aligned_cols=19 Identities=16% Similarity=0.289 Sum_probs=9.3 Q ss_pred HHHHHHHHHHHCCCCCEEC Q ss_conf 9999999999689982451 Q gi|254780579|r 49 RIERAFELLENQIGEKIFI 67 (207) Q Consensus 49 Ri~~a~~L~~~g~~~~ii~ 67 (207) |++.-++..++..++.+.+ T Consensus 96 rv~~~l~~~~~~g~~ViWv 114 (200) T cd01829 96 RIDELLNVARAKGVPVIWV 114 (200) T ss_pred HHHHHHHHHHHCCCEEEEE T ss_conf 9999999997459829998 No 73 >KOG1201 consensus Probab=42.77 E-value=25 Score=15.71 Aligned_cols=72 Identities=13% Similarity=0.128 Sum_probs=30.9 Q ss_pred CCEEEEECCCH---HHHHHHHHHHHHCCCCCEECCCCCCCCCHHHHHHHHHHHCCCCHHHEECCCCCCCHHHHHHHHHHH Q ss_conf 87899946985---799999999996899824515876546789999987651588613322046666602569999999 Q gi|254780579|r 37 VSAIVVLTGEP---IRIERAFELLENQIGEKIFISGVHHSVSKDILLQKIPIRQDLAECCIDIGYKALNTEGNAQEASAW 113 (207) Q Consensus 37 ~DaIVVLgGg~---~Ri~~a~~L~~~g~~~~ii~SG~~~~~~~~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~~~~~~ 113 (207) +..+|+-+||. +|. -|.++.+.|. ++++-..++...++..... .+.| .+..-..--+-.||...+++- T Consensus 37 ~g~~vLITGgg~GlGr~-ialefa~rg~--~~vl~Din~~~~~etv~~~--~~~g----~~~~y~cdis~~eei~~~a~~ 107 (300) T KOG1201 37 SGEIVLITGGGSGLGRL-IALEFAKRGA--KLVLWDINKQGNEETVKEI--RKIG----EAKAYTCDISDREEIYRLAKK 107 (300) T ss_pred CCCEEEEECCCCHHHHH-HHHHHHHHCC--EEEEEECCCCCHHHHHHHH--HHCC----CEEEEEECCCCHHHHHHHHHH T ss_conf 69889996898607899-9999997078--4899955651239999999--8448----525899558988999999999 Q ss_pred HHHC Q ss_conf 9864 Q gi|254780579|r 114 AEKN 117 (207) Q Consensus 114 l~~~ 117 (207) +++. T Consensus 108 Vk~e 111 (300) T KOG1201 108 VKKE 111 (300) T ss_pred HHHH T ss_conf 9986 No 74 >TIGR02128 G6PI_arch bifunctional phosphoglucose/phosphomannose isomerase; InterPro: IPR011857 Mannose-6-phosphate isomerase or phosphomannose isomerase (5.3.1.8 from EC) (PMI) is the enzyme that catalyzes the interconversion of mannose-6-phosphate and fructose-6-phosphate. In eukaryotes PMI is involved in the synthesis of GDP-mannose, a constituent of N- and O-linked glycans and GPI anchors and in prokaryotes it participates in a variety of pathways, including capsular polysaccharide biosynthesis and D-mannose metabolism. PMI's belong to the cupin superfamily whose functions range from isomerase and epimerase activities involved in the modification of cell wall carbohydrates in bacteria and plants, to non-enzymatic storage proteins in plant seeds, and transcription factors linked to congenital baldness in mammals . Three classes of PMI have been defined . These bifunctional isomerases form a distinct phylogenetic cluster within the larger phosphoglucose isomerase (PGI) superfamily . They show relatively low sequence identity to other PGIs, but contain similar structural elements and show almost complete conservation of the catalytic residues in the active site, indicating they use a similar catalytic mechanism . The family appears to have originated in the archaea, with the bacterial proteins being acquired through horizontal transfer.; GO: 0004347 glucose-6-phosphate isomerase activity, 0004476 mannose-6-phosphate isomerase activity. Probab=42.31 E-value=25 Score=15.67 Aligned_cols=99 Identities=14% Similarity=0.037 Sum_probs=59.6 Q ss_pred CCCCEEEEEC-CC---HHHHHHHHHHHHHCCCCCEECCCCCCCCCHHHHHHHHHHHCCCC-HHHEECCCCCCCHHHHHHH Q ss_conf 5687899946-98---57999999999968998245158765467899999876515886-1332204666660256999 Q gi|254780579|r 35 PSVSAIVVLT-GE---PIRIERAFELLENQIGEKIFISGVHHSVSKDILLQKIPIRQDLA-ECCIDIGYKALNTEGNAQE 109 (207) Q Consensus 35 ~~~DaIVVLg-Gg---~~Ri~~a~~L~~~g~~~~ii~SG~~~~~~~~~~~~~~~~~~~i~-~~~I~~e~~s~~T~ena~~ 109 (207) ...|-||..| || .+|+...+.+-++...|.+++-- |.....+. +..++.-.-|-||.|.... T Consensus 25 e~~~~iv~~GmGGSGi~G~~~~~~~~~~~~~~Pvfvv~D-------------Y~~p~~v~~~~~lIAvSYSGNT~ETl~~ 91 (338) T TIGR02128 25 EAYDEIVISGMGGSGIAGRILSELLLEKSFEVPVFVVKD-------------YRLPAFVDGKTLLIAVSYSGNTEETLSA 91 (338) T ss_pred CCCCCEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEECC-------------CCCCCCCCCCEEEEEEECCCCHHHHHHH T ss_conf 655622787256578999999998653247888888707-------------7896988985179998617866899999 Q ss_pred HHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHCCCCEEEEEECC Q ss_conf 99999864998189851144219999999986299879996416 Q gi|254780579|r 110 ASAWAEKNNFHHVLIVTHDYHMPRTFLELQRINSTVQFIPYPII 153 (207) Q Consensus 110 ~~~~l~~~~~~~iiLVTs~yHm~RA~~~f~~~~p~i~i~~~pv~ 153 (207) ...- .+++ .+++.|||.=-+ .-..++. ++.++-.|-. T Consensus 92 ~e~A-~~~~-~~~~aITSGG~L---~e~A~e~--g~~~i~iP~G 128 (338) T TIGR02128 92 VEEA-KKKG-AKVIAITSGGKL---EEMAKEE--GLDVIKIPKG 128 (338) T ss_pred HHHH-HHCC-CCEEEEECCCHH---HHHCCCC--CCCEEEECCC T ss_conf 9999-8657-955999448538---8630114--8756882886 No 75 >pfam04820 Trp_halogenase Tryptophan halogenase. Tryptophan halogenase catalyses the chlorination of tryptophan to form 7-chlorotryptophan. This is the first step in the biosynthesis of pyrrolnitrin, an antibiotic with broad-spectrum anti-fungal activity. Tryptophan halogenase is NADH-dependent. Probab=42.19 E-value=25 Score=15.66 Aligned_cols=52 Identities=17% Similarity=0.193 Sum_probs=28.3 Q ss_pred EEEECCCHHHHHHHHHHHHHCCCC-CE-EC-------CCCCCCCCHHHHHHHHHHHCCCCHHH Q ss_conf 999469857999999999968998-24-51-------58765467899999876515886133 Q gi|254780579|r 40 IVVLTGEPIRIERAFELLENQIGE-KI-FI-------SGVHHSVSKDILLQKIPIRQDLAECC 93 (207) Q Consensus 40 IVVLgGg~~Ri~~a~~L~~~g~~~-~i-i~-------SG~~~~~~~~~~~~~~~~~~~i~~~~ 93 (207) |||+|||..=-..|+-|.+..+.. .| ++ .|+++... ..++..+...|+++.. T Consensus 2 IvIvGGGTAGWmtA~~L~~~~~~~~~ItlIES~~i~~iGVGE~T~--p~~~~~l~~lGI~e~~ 62 (457) T pfam04820 2 IVIVGGGTAGWMAAAALARALKGGLDVTLVESEEIGTVGVGEATI--PSIRTFNRMLGIDEAE 62 (457) T ss_pred EEEECCCHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCCCH--HHHHHHHHHCCCCHHH T ss_conf 899898799999999999865999879999679999878457637--9999999983999899 No 76 >PRK06382 threonine dehydratase; Provisional Probab=41.40 E-value=26 Score=15.58 Aligned_cols=101 Identities=14% Similarity=0.158 Sum_probs=52.6 Q ss_pred CCCCCEEEEECCCHHHHHHHHHHHHHCCCCCEECCCCCCCCCHHHHHHHHHHHCC---CCHHHEECCCCCCCHHHHHHHH Q ss_conf 4568789994698579999999999689982451587654678999998765158---8613322046666602569999 Q gi|254780579|r 34 HPSVSAIVVLTGEPIRIERAFELLENQIGEKIFISGVHHSVSKDILLQKIPIRQD---LAECCIDIGYKALNTEGNAQEA 110 (207) Q Consensus 34 ~~~~DaIVVLgGg~~Ri~~a~~L~~~g~~~~ii~SG~~~~~~~~~~~~~~~~~~~---i~~~~I~~e~~s~~T~ena~~~ 110 (207) .+..|+|+|.-||-+ +..|+.++-+...|.+=+-|+.+....+ |...+.... .+....+.|.-+...-++ .+ T Consensus 165 ~pd~D~vvvpvGGGG-LisGia~a~K~~~P~ikViGVEpe~a~~--~~~sl~~g~~v~~~~~~tiaDGiav~~~G~--~t 239 (400) T PRK06382 165 LPDLDQIIVPVGGGG-LISGIALAAKHINPNVKIIGIESELSDS--MKASLREGKIVAHTSGVSICDGISVKYPGD--LT 239 (400) T ss_pred CCCCCEEEEECCCCH-HHHHHHHHHHHHCCCCEEEEECCCCCHH--HHHHHHCCCCCCCCCCCEEECCCCCCCCCC--CC T ss_conf 689998999368715-8899999999859997599979689878--999987799620588773642403565551--15 Q ss_pred HHHHHHCCCCEEEEECCHHHHHHHHHHHHH Q ss_conf 999986499818985114421999999998 Q gi|254780579|r 111 SAWAEKNNFHHVLIVTHDYHMPRTFLELQR 140 (207) Q Consensus 111 ~~~l~~~~~~~iiLVTs~yHm~RA~~~f~~ 140 (207) .+++++. +..+++||.+-=..=-+..+++ T Consensus 240 f~i~~~~-vD~vv~V~e~eI~~A~~~l~e~ 268 (400) T PRK06382 240 FDIAKNY-VDDIVTVTEESVSKAIYKLFER 268 (400) T ss_pred HHHHHHC-CCCEEEECHHHHHHHHHHHHHH T ss_conf 9999840-8977998999999999999996 No 77 >PRK09221 beta alanine--pyruvate transaminase; Provisional Probab=41.23 E-value=26 Score=15.56 Aligned_cols=85 Identities=7% Similarity=-0.130 Sum_probs=36.9 Q ss_pred CCCHHHHHHHHHHHHHCCCCCEECCC-CCCCCCHHHHHHHHHHHCCCCHHHEECCCCCCCHHHHHHHHHHHH-HHCC--- Q ss_conf 69857999999999968998245158-765467899999876515886133220466666025699999999-8649--- Q gi|254780579|r 44 TGEPIRIERAFELLENQIGEKIFISG-VHHSVSKDILLQKIPIRQDLAECCIDIGYKALNTEGNAQEASAWA-EKNN--- 118 (207) Q Consensus 44 gGg~~Ri~~a~~L~~~g~~~~ii~SG-~~~~~~~~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~~~~~~l-~~~~--- 118 (207) |-+..++.+|+.=. ...... .+. ........++++.+.....-.-+.+.+-+....-.|.|...+... +.+| T Consensus 64 Gh~hp~i~~Ai~~Q-~~~l~~--~~~~~~~~~~~~~lae~L~~~~p~~l~~v~f~~SGsEAvE~AiKlAr~y~~~~g~~~ 140 (445) T PRK09221 64 GHGRPEIVEAVARQ-AATLDY--APAFQMGHPLAFELAERLAELAPGGLDHVFFTNSGSESVDTALKIALAYHRARGQGT 140 (445) T ss_pred CCCCHHHHHHHHHH-HHHCCC--CCCCCCCCHHHHHHHHHHHHHCCCCCCEEEECCCHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 78979999999999-865687--245556888999999999984888758799608789999999999999999719997 Q ss_pred CCEEEEECCHHHH Q ss_conf 9818985114421 Q gi|254780579|r 119 FHHVLIVTHDYHM 131 (207) Q Consensus 119 ~~~iiLVTs~yHm 131 (207) -++++-....||- T Consensus 141 r~~ii~~~~syHG 153 (445) T PRK09221 141 RTRLIGRERGYHG 153 (445) T ss_pred CCEEEEECCCCCC T ss_conf 5269620377479 No 78 >PRK06186 hypothetical protein; Validated Probab=40.51 E-value=27 Score=15.49 Aligned_cols=36 Identities=17% Similarity=0.126 Sum_probs=25.9 Q ss_pred CCCCCCEEEEECCCHHH----HHHHHHHHHHCCCCCEECC Q ss_conf 54568789994698579----9999999996899824515 Q gi|254780579|r 33 DHPSVSAIVVLTGEPIR----IERAFELLENQIGEKIFIS 68 (207) Q Consensus 33 ~~~~~DaIVVLgGg~~R----i~~a~~L~~~g~~~~ii~S 68 (207) .....|+|+|.+|+..| ...|++..|+..-|.+=+- T Consensus 50 ~L~~~dgilv~pGfG~RG~eGki~Ai~yARen~iP~LGIC 89 (229) T PRK06186 50 DLAGFDGIWCVPGSPYRNEDGALTAIRFARENGIPFLGTC 89 (229) T ss_pred HHCCCCEEEECCCCCCCCHHHHHHHHHHHHHCCCCEEEEH T ss_conf 2225998998998777656389999999987699878642 No 79 >TIGR01373 soxB sarcosine oxidase, beta subunit family; InterPro: IPR006278 These sequences represent the beta subunit of a family of known and putative heterotetrameric sarcosine oxidases. Five operons of such oxidases are found in Mesorhizobium loti and three in Agrobacterium tumefaciens, a high enough copy number to suggest that not all members share the same function. The model is designated as subfamily rather than equivalog for this reason. Sarcosine oxidase catalyzes the oxidative demethylation of sarcosine to glycine. The reaction converts tetrahydrofolate to 5,10-methylene-tetrahydrofolate . Bacterial sarcosine oxidases have been isolated from over a dozen different organisms and fall into two major classes (1) monomeric form that contains only covalent flavin and (2) heterotetrameric (alpha, beta, gamma, delta) forms that contain a covalent and noncovalent flavin, this entry represents the heterotetrameric form.; GO: 0008115 sarcosine oxidase activity, 0046653 tetrahydrofolate metabolic process. Probab=40.13 E-value=27 Score=15.46 Aligned_cols=35 Identities=17% Similarity=0.168 Sum_probs=24.6 Q ss_pred CCCCCCCCCEEEEECCCHHHHHHHHHHHHHCCCCCE Q ss_conf 676545687899946985799999999996899824 Q gi|254780579|r 30 HIPDHPSVSAIVVLTGEPIRIERAFELLENQIGEKI 65 (207) Q Consensus 30 ~~~~~~~~DaIVVLgGg~~Ri~~a~~L~~~g~~~~i 65 (207) .+++++..|+|||=|||.+ +.+|.=|.+++=-..+ T Consensus 24 ~P~PK~~YDviIvGgGGHG-LATAYYLA~~hGItnV 58 (407) T TIGR01373 24 SPEPKPEYDVIIVGGGGHG-LATAYYLAKEHGITNV 58 (407) T ss_pred CCCCCCCCCEEEECCCCCH-HHHHHHHHHHCCCCEE T ss_conf 7336885467888688606-8999998864694068 No 80 >PRK05575 cbiC precorrin-8X methylmutase; Validated Probab=40.05 E-value=28 Score=15.45 Aligned_cols=11 Identities=45% Similarity=0.531 Sum_probs=4.3 Q ss_pred HHHHHHHHHHC Q ss_conf 99999999968 Q gi|254780579|r 50 IERAFELLENQ 60 (207) Q Consensus 50 i~~a~~L~~~g 60 (207) ++.|++-++.| T Consensus 58 v~ag~~AL~~G 68 (204) T PRK05575 58 IEAAKEILKKG 68 (204) T ss_pred HHHHHHHHHCC T ss_conf 99999999779 No 81 >PRK03501 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=39.93 E-value=28 Score=15.44 Aligned_cols=28 Identities=18% Similarity=0.217 Sum_probs=19.2 Q ss_pred CCCCCCCCCEEEEECCCHHHHHHHHHHHH Q ss_conf 67654568789994698579999999999 Q gi|254780579|r 30 HIPDHPSVSAIVVLTGEPIRIERAFELLE 58 (207) Q Consensus 30 ~~~~~~~~DaIVVLgGg~~Ri~~a~~L~~ 58 (207) ..++++.+|.|||+||+-- +..|++... T Consensus 33 ~vdd~~~aDliI~iGGDGT-~L~a~r~~~ 60 (264) T PRK03501 33 VVDDPKNANIIVSIGGDGT-FLQAVRKTG 60 (264) T ss_pred EECCCCCCCEEEEECCHHH-HHHHHHHHH T ss_conf 8489889899999997199-999999873 No 82 >cd06285 PBP1_LacI_like_7 Ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=39.46 E-value=28 Score=15.39 Aligned_cols=98 Identities=11% Similarity=0.107 Sum_probs=54.4 Q ss_pred HHHHHHHCCCCCEECCCCCCCCCH---HHHHHHHHHHCCCCHHHEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCHH Q ss_conf 999999689982451587654678---99999876515886133220466666025699999999864998189851144 Q gi|254780579|r 53 AFELLENQIGEKIFISGVHHSVSK---DILLQKIPIRQDLAECCIDIGYKALNTEGNAQEASAWAEKNNFHHVLIVTHDY 129 (207) Q Consensus 53 a~~L~~~g~~~~ii~SG~~~~~~~---~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~~~~~~l~~~~~~~iiLVTs~y 129 (207) +-.|+++|+.+..+++|....... .+-....+.+.+++.....+.....+..+.....+++++...-.+.++++++. T Consensus 106 ~~~Li~~G~~~i~~l~~~~~~~~~~~R~~Gf~~a~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~l~~~~~~~ai~~~nD~ 185 (265) T cd06285 106 TRHLLDLGHRRIAVLAGPDYASTARDRLAGFRAALAEAGIEVPPERIVYSGFDIEGGEAAAEKLLRSDSPPTAIFAVNDF 185 (265) T ss_pred HHHHHHHCCCCEEECCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHEECCCCCHHHHHHHHHHHHHCCCCCCCEECCCHH T ss_conf 99999838865875478877705999999999999986999875660136621556999999998569998810037889 Q ss_pred HHHHHHHHHHHHC---C-CCEEEEE Q ss_conf 2199999999862---9-9879996 Q gi|254780579|r 130 HMPRTFLELQRIN---S-TVQFIPY 150 (207) Q Consensus 130 Hm~RA~~~f~~~~---p-~i~i~~~ 150 (207) =...++..+++.+ | ++.++.+ T Consensus 186 ~A~g~~~~l~~~g~~iP~di~iigf 210 (265) T cd06285 186 AAIGVMGAARDRGLRVPDDVALVGY 210 (265) T ss_pred HHHHHHHHHHHCCCCCCCCEEEEEE T ss_conf 9999999999859999998799998 No 83 >cd04605 CBS_pair_MET2_assoc This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the MET2 domain. Met2 is a key enzyme in the biosynthesis of methionine. It encodes a homoserine transacetylase involved in converting homoserine to O-acetyl homoserine. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Probab=38.65 E-value=29 Score=15.32 Aligned_cols=81 Identities=11% Similarity=0.074 Sum_probs=45.2 Q ss_pred HHHHHHHHHHHHCCCCCEECCCCC-C---CCCHHHHHHHHHHHCCCCHHHEECCCCCCCHHHHHHHHHHHHHHCCCCEEE Q ss_conf 799999999996899824515876-5---467899999876515886133220466666025699999999864998189 Q gi|254780579|r 48 IRIERAFELLENQIGEKIFISGVH-H---SVSKDILLQKIPIRQDLAECCIDIGYKALNTEGNAQEASAWAEKNNFHHVL 123 (207) Q Consensus 48 ~Ri~~a~~L~~~g~~~~ii~SG~~-~---~~~~~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~~~~~~l~~~~~~~ii 123 (207) .-+..|+++.+++....+.+...+ . -.++.++.+.........++-..-.....+..+...++.+.|.+++++++- T Consensus 11 ~tv~~a~~~m~~~~~~~lpVvd~~~~~vGiit~~Di~~~~~~~~~~v~~im~~~~~~~~~~~~~~~a~~~m~~~~i~~lP 90 (110) T cd04605 11 ASIKEAAKLMIEENINHLPVVDEDGRLVGIVTSWDISKAVARDKKSVEDIMTRNVITATPDEPIDVAARKMERHNISALP 90 (110) T ss_pred CCHHHHHHHHHHCCCCEEEEECCCCCEEEEEECHHHHHHHHCCCCCHHHCEEECHHHHHHHHHHHHHHHHHHHCCCEEEE T ss_conf 99999999999729988999979991999996227888987079763360430012014655899999999983961999 Q ss_pred EECCH Q ss_conf 85114 Q gi|254780579|r 124 IVTHD 128 (207) Q Consensus 124 LVTs~ 128 (207) +|-++ T Consensus 91 VvD~~ 95 (110) T cd04605 91 VVDAE 95 (110) T ss_pred EECCC T ss_conf 99189 No 84 >pfam06637 PV-1 PV-1 protein (PLVAP). This family consists of several PV-1 (PLVAP) proteins which seem to be specific to mammals. PV-1 is a novel protein component of the endothelial fenestral and stomatal diaphragms. The function of this family is unknown. Probab=38.15 E-value=29 Score=15.27 Aligned_cols=27 Identities=26% Similarity=0.637 Sum_probs=19.2 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 917999999999999999999999721 Q gi|254780579|r 1 MRYFWYGLFVCLIFFIMGFISFIRYVK 27 (207) Q Consensus 1 mR~~~i~l~~~~~~~~~~~~~f~~~~~ 27 (207) ||+||+|.-++-.+.+.|++.|.-.-. T Consensus 25 LryfFlFvSLIQfLIIlgLVLFmVYGn 51 (442) T pfam06637 25 LRYFFLFVSLIQFLIILGLVLFMIYGN 51 (442) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 999999999999999999999964077 No 85 >PRK06411 NADH dehydrogenase subunit B; Validated Probab=37.99 E-value=30 Score=15.25 Aligned_cols=40 Identities=15% Similarity=0.100 Sum_probs=30.7 Q ss_pred CCCCCCCCCEEEEECCCHHHHHHHHH-HHHHCCCCCEECCC Q ss_conf 67654568789994698579999999-99968998245158 Q gi|254780579|r 30 HIPDHPSVSAIVVLTGEPIRIERAFE-LLENQIGEKIFISG 69 (207) Q Consensus 30 ~~~~~~~~DaIVVLgGg~~Ri~~a~~-L~~~g~~~~ii~SG 69 (207) ...+++++|.++|-|--......++. +|+|-+.|+.+++- T Consensus 74 ~~aSPRqADvliVtG~vT~km~~~l~~~YeqmPePK~VIA~ 114 (195) T PRK06411 74 FRASPRQADLMIVAGTLTNKMAPALRRLYDQMPEPKWVISM 114 (195) T ss_pred ECCCCCCCCEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEE T ss_conf 60587524478970523188899999999858997079984 No 86 >COG2240 PdxK Pyridoxal/pyridoxine/pyridoxamine kinase [Coenzyme metabolism] Probab=37.85 E-value=30 Score=15.24 Aligned_cols=39 Identities=28% Similarity=0.196 Sum_probs=28.3 Q ss_pred CHHHEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCHHH Q ss_conf 61332204666660256999999998649981898511442 Q gi|254780579|r 90 AECCIDIGYKALNTEGNAQEASAWAEKNNFHHVLIVTHDYH 130 (207) Q Consensus 90 ~~~~I~~e~~s~~T~ena~~~~~~l~~~~~~~iiLVTs~yH 130 (207) .+-++..+ .+-+|.+++..+++.+.+.|. +.+||||.-- T Consensus 145 fELe~Ltg-~~~~~~~da~~aa~~L~~~gp-~~vlVTS~~~ 183 (281) T COG2240 145 FELEILTG-KPLNTLDDAVKAARKLGADGP-KIVLVTSLSR 183 (281) T ss_pred HHHHHHHC-CCCCCHHHHHHHHHHHHHCCC-CEEEEECCCC T ss_conf 99999849-887888999999999751489-8899953453 No 87 >TIGR00337 PyrG CTP synthase; InterPro: IPR004468 CTP synthase is involved in pyrimidine ribonucleotide/ribonucleoside metabolism. The enzyme catalyzes the reaction L-glutamine + H2O + UTP + ATP = CTP + phosphate + ADP + L-glutamate. The enzyme exists as a dimer of identical chains that aggregates as a tetramer. This gene has been found circa 500 bp 5 upstream of enolase in both beta (Nitrosomonas europaea) and gamma (Escherichia coli) subdivisions of Proteobacterium .; GO: 0003883 CTP synthase activity, 0006221 pyrimidine nucleotide biosynthetic process. Probab=36.83 E-value=31 Score=15.14 Aligned_cols=136 Identities=13% Similarity=0.040 Sum_probs=75.3 Q ss_pred HHHHHHHHH--HHHCCCCCEECC--CC----CCCCCHHHHHHHHHHHCCC-----------CHHHEECCCCCCCHHHHHH Q ss_conf 799999999--996899824515--87----6546789999987651588-----------6133220466666025699 Q gi|254780579|r 48 IRIERAFEL--LENQIGEKIFIS--GV----HHSVSKDILLQKIPIRQDL-----------AECCIDIGYKALNTEGNAQ 108 (207) Q Consensus 48 ~Ri~~a~~L--~~~g~~~~ii~S--G~----~~~~~~~~~~~~~~~~~~i-----------~~~~I~~e~~s~~T~ena~ 108 (207) +|+..+++- =..|..+-+++. || -+....-|+.++...+-|- |-..-.-|-+.+=|....+ T Consensus 126 ~~I~~~A~~P~eDtG~~~Dv~IvEiGGTVGDIEs~PFLEAiRQ~~~e~G~Env~~iHvTLVP~i~aagE~KTKPTQhSVK 205 (571) T TIGR00337 126 DRIKRVAKKPVEDTGSGADVVIVEIGGTVGDIESLPFLEAIRQLKKEVGRENVLFIHVTLVPYIAAAGELKTKPTQHSVK 205 (571) T ss_pred HHHHHHHCCCCCCCCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEEECCCCCCCCCCCCCCCHHHHH T ss_conf 99999603776456799747999837700000362589999999987389867999840026314487478775127899 Q ss_pred HHHHHHHHCCCC-EEEEECCHHHHHHHH----HHHHHHCCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 999999864998-189851144219999----999986299879996416766675421169899999999999999999 Q gi|254780579|r 109 EASAWAEKNNFH-HVLIVTHDYHMPRTF----LELQRINSTVQFIPYPIISHDLEENSSIFKIKILRVLLIEYLKILLLS 183 (207) Q Consensus 109 ~~~~~l~~~~~~-~iiLVTs~yHm~RA~----~~f~~~~p~i~i~~~pv~~~~~~~~~Ww~~~~~~~~~~~Ey~K~l~~~ 183 (207) +++. -|+. .+|++=|+--+.... +.|+.+- .-.|+.+.-.+.-|.-.--..... -.+++..+ T Consensus 206 eLRs----~Gi~PD~i~cRs~~~l~~~~k~KiAlFCdV~-~~aVI~~~Da~sIY~vPl~L~~eg--------l~~~~~~r 272 (571) T TIGR00337 206 ELRS----LGIQPDIIICRSSEPLDKSLKKKIALFCDVE-EEAVINAEDASSIYEVPLLLKKEG--------LDDYLCRR 272 (571) T ss_pred HHHH----CCCCCCEEEECCCCCCCHHHHHHHHHCCCCC-CCCEEEEECCCCCCCCCCEECCCC--------HHHHHHHH T ss_conf 9986----0988868998188768977764521005889-732578655524011352100043--------35799998 Q ss_pred HHHHHCCCCCCCC Q ss_conf 9876055676654 Q gi|254780579|r 184 IQLSLSTQTASQF 196 (207) Q Consensus 184 lr~~i~~~~~~~~ 196 (207) |++.-.|.+.++. T Consensus 273 l~L~~~~~~ee~k 285 (571) T TIGR00337 273 LNLNEKPLIEEEK 285 (571) T ss_pred HCCCCCCCCHHHH T ss_conf 4378777420122 No 88 >COG2143 Thioredoxin-related protein [Posttranslational modification, protein turnover, chaperones] Probab=36.02 E-value=32 Score=15.06 Aligned_cols=86 Identities=15% Similarity=0.146 Sum_probs=43.3 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHH-----HHHCCCCCCCCCCCCEEEEECCC--------------HHHHHHHHH------ Q ss_conf 91799999999999999999999-----97213467654568789994698--------------579999999------ Q gi|254780579|r 1 MRYFWYGLFVCLIFFIMGFISFI-----RYVKQMHIPDHPSVSAIVVLTGE--------------PIRIERAFE------ 55 (207) Q Consensus 1 mR~~~i~l~~~~~~~~~~~~~f~-----~~~~~~~~~~~~~~DaIVVLgGg--------------~~Ri~~a~~------ 55 (207) ||.+.++++++++++.+--..+- ..........+.+--.+++.+.. ..|+.+.+. T Consensus 2 mRvl~i~Lliis~fl~a~~s~~e~~s~~~~~~d~ksi~~~~Kylllmfes~~C~yC~~fKKd~~~~krlrEylk~hf~~~ 81 (182) T COG2143 2 MRVLLIVLLIISLFLSACKSNNEKRSNIDVFDDNKSISPNDKYLLLMFESNGCSYCERFKKDLKNVKRLREYLKEHFSAY 81 (182) T ss_pred CCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCEEEEEECCCCCHHHHHHHHHHCCHHHHHHHHHHCEEEE T ss_conf 00199999999999999717804432255677787459567589999758997079999886246387999976275899 Q ss_pred -HHHHCCCCCEECCCCC-CCCCHHHHHHHHHHH Q ss_conf -9996899824515876-546789999987651 Q gi|254780579|r 56 -LLENQIGEKIFISGVH-HSVSKDILLQKIPIR 86 (207) Q Consensus 56 -L~~~g~~~~ii~SG~~-~~~~~~~~~~~~~~~ 86 (207) |+-.+..|..+.-|+. ...++.++++.+.+. T Consensus 82 ~l~i~~skpv~f~~g~kee~~s~~ELa~kf~vr 114 (182) T COG2143 82 YLNISYSKPVLFKVGDKEEKMSTEELAQKFAVR 114 (182) T ss_pred EEEECCCCCEEEECCCEEEEECHHHHHHHHCCC T ss_conf 998315762674127654554399999873555 No 89 >COG3962 Acetolactate synthase [Amino acid transport and metabolism] Probab=36.01 E-value=32 Score=15.06 Aligned_cols=44 Identities=11% Similarity=0.062 Sum_probs=31.6 Q ss_pred HHHHHHHHHHHCCCCCEECCCCCCCCCHHHHHHHHHHHCCCCHH Q ss_conf 99999999996899824515876546789999987651588613 Q gi|254780579|r 49 RIERAFELLENQIGEKIFISGVHHSVSKDILLQKIPIRQDLAEC 92 (207) Q Consensus 49 Ri~~a~~L~~~g~~~~ii~SG~~~~~~~~~~~~~~~~~~~i~~~ 92 (207) .+..|++|.|..+.|.|+..|+-......+..+.+.+..++|-. T Consensus 218 eL~~A~~lik~ak~PlIvaGGGv~YS~A~~~L~af~E~~~iPv~ 261 (617) T COG3962 218 ELADAAALIKSAKKPLIVAGGGVLYSGAREALRAFAETHGIPVV 261 (617) T ss_pred HHHHHHHHHHHCCCCEEEECCCEEECHHHHHHHHHHHHCCCCEE T ss_conf 99999999983579779966865443079999999986499657 No 90 >PRK08198 threonine dehydratase; Provisional Probab=35.21 E-value=33 Score=14.98 Aligned_cols=101 Identities=15% Similarity=0.108 Sum_probs=56.1 Q ss_pred CCCCEEEEECCCHHHHHHHHHHHHHCCCCCEECCCCCCCCCHHHHHHHHHHHCC---CCHHHEECCCCCCCHHHHHHHHH Q ss_conf 568789994698579999999999689982451587654678999998765158---86133220466666025699999 Q gi|254780579|r 35 PSVSAIVVLTGEPIRIERAFELLENQIGEKIFISGVHHSVSKDILLQKIPIRQD---LAECCIDIGYKALNTEGNAQEAS 111 (207) Q Consensus 35 ~~~DaIVVLgGg~~Ri~~a~~L~~~g~~~~ii~SG~~~~~~~~~~~~~~~~~~~---i~~~~I~~e~~s~~T~ena~~~~ 111 (207) +..|+|+|.-||-+ +..|+..+-+...|.+=+-|+.+....+ |...+.... ++....+.|.-+..+-+ ..+- T Consensus 171 p~~D~vvvpvGGGG-LiaGia~a~K~~~P~ikViGVEpe~a~~--~~~Sl~ag~~v~~~~~~tiADGiav~~~G--~~tf 245 (406) T PRK08198 171 PDVDTVVVPIGGGG-LISGVATAVKALRPNVRVIGVQAEGAPA--MPLSLAAGRPVELESVDTIADGIAVKRPG--DLTF 245 (406) T ss_pred CCCCEEEECCCCHH-HHHHHHHHHHHHCCCCCEEEEEECCCHH--HHHHHHCCCCEECCCCCEEECCCCCCCCC--HHHH T ss_conf 77787886378626-8899999999868997089996578768--99999849956559999797114468777--8899 Q ss_pred HHHHHCCCCEEEEECCHHHHHHHHHHHHHH Q ss_conf 999864998189851144219999999986 Q gi|254780579|r 112 AWAEKNNFHHVLIVTHDYHMPRTFLELQRI 141 (207) Q Consensus 112 ~~l~~~~~~~iiLVTs~yHm~RA~~~f~~~ 141 (207) +++++. +..+++||.+-=..=-+..|++. T Consensus 246 ~i~~~~-vD~vv~V~e~eI~~A~~~l~e~~ 274 (406) T PRK08198 246 PIIREL-VDDVVTVSDEEIARAILLLLERA 274 (406) T ss_pred HHHHHC-CCCEEEECHHHHHHHHHHHHHHC T ss_conf 999962-99789989999999999999974 No 91 >KOG3153 consensus Probab=35.13 E-value=33 Score=14.97 Aligned_cols=13 Identities=8% Similarity=0.475 Sum_probs=5.7 Q ss_pred HHHHHHHHHHHCC Q ss_conf 6999999998649 Q gi|254780579|r 106 NAQEASAWAEKNN 118 (207) Q Consensus 106 na~~~~~~l~~~~ 118 (207) ....+.+|++++. T Consensus 107 DftKcv~~i~~~~ 119 (250) T KOG3153 107 DFTKCVKWIQEHK 119 (250) T ss_pred HHHHHHHHHHHHC T ss_conf 3899999998862 No 92 >cd06284 PBP1_LacI_like_6 Ligand-binding domain of an uncharacterized transcription regulator from Actinobacillus succinogenes and its close homologs from other bacteria. This group includes the ligand-binding domain of an uncharacterized transcription regulator from Actinobacillus succinogenes and its close homologs from other bacteria. This group belongs to the the LacI-GalR family repressors and are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding. Probab=34.42 E-value=34 Score=14.90 Aligned_cols=96 Identities=8% Similarity=-0.017 Sum_probs=53.2 Q ss_pred HHHHHCCCCCEECCCCCCCCCHH---HHHHHHHHHCCCCHHHEECCCCCCCHHHHH-HHHHHHHHHCCCCEEEEECCHHH Q ss_conf 99996899824515876546789---999987651588613322046666602569-99999998649981898511442 Q gi|254780579|r 55 ELLENQIGEKIFISGVHHSVSKD---ILLQKIPIRQDLAECCIDIGYKALNTEGNA-QEASAWAEKNNFHHVLIVTHDYH 130 (207) Q Consensus 55 ~L~~~g~~~~ii~SG~~~~~~~~---~~~~~~~~~~~i~~~~I~~e~~s~~T~ena-~~~~~~l~~~~~~~iiLVTs~yH 130 (207) .|.++|+.+..+++|........ +-+...+.+.++......... ...|++++ ...++++..++--..++++++.- T Consensus 109 ~L~~~G~~~i~~i~~~~~~~~~~~R~~Gf~~a~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~l~~~~~~~ai~~~nD~~ 187 (267) T cd06284 109 HLISLGHRRIALITGPRDNPLARDRLEGYRQALAEAGLPADEELIQE-GDFSLESGYAAARRLLALPDRPTAIFCFSDEM 187 (267) T ss_pred HHHHHCCCEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCCEEEEC-CCCCHHHHHHHHHHHHHCCCCCCEEEECCCHH T ss_conf 99970997699962888872599999999999998399855127732-78767778899999996599998026578688 Q ss_pred HHHHHHHHHHHC---C-CCEEEEEE Q ss_conf 199999999862---9-98799964 Q gi|254780579|r 131 MPRTFLELQRIN---S-TVQFIPYP 151 (207) Q Consensus 131 m~RA~~~f~~~~---p-~i~i~~~p 151 (207) .-.++..++..+ | ++.++.+- T Consensus 188 A~g~~~~l~~~g~~vP~di~vigfd 212 (267) T cd06284 188 AIGAISALKELGLRVPEDISVVGFD 212 (267) T ss_pred HHHHHHHHHHCCCCCCCEEEEEEEC T ss_conf 9999999998399999627999877 No 93 >cd06290 PBP1_LacI_like_9 Ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=33.95 E-value=34 Score=14.86 Aligned_cols=98 Identities=13% Similarity=0.058 Sum_probs=51.7 Q ss_pred HHHHHHHCCCCCEECCCCCCCCCHHH---HHHHHHHHCCCCHHHEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCHH Q ss_conf 99999968998245158765467899---999876515886133220466666025699999999864998189851144 Q gi|254780579|r 53 AFELLENQIGEKIFISGVHHSVSKDI---LLQKIPIRQDLAECCIDIGYKALNTEGNAQEASAWAEKNNFHHVLIVTHDY 129 (207) Q Consensus 53 a~~L~~~g~~~~ii~SG~~~~~~~~~---~~~~~~~~~~i~~~~I~~e~~s~~T~ena~~~~~~l~~~~~~~iiLVTs~y 129 (207) +-.|+++|+....+++|........+ -.++.+.+.|++...-.+.....+..+....+.++++++.--..++++++. T Consensus 107 ~~~L~~~G~~~i~~i~~~~~~~~~~~R~~Gf~~a~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ai~~~nD~ 186 (265) T cd06290 107 TQHLIDLGHRRIAHITGPRGHIDARDRLAGYRKALEEAGLEVQPDLIVQGDFEEESGLEAVEELLQRGPDFTAIFAANDQ 186 (265) T ss_pred HHHHHHHCCCCEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHEECCCCCCHHHHHHHHHHHHHCCCCCCCEECCHH T ss_conf 99999829986556048878701999999999999986999988883416677178999999999719998842105789 Q ss_pred HHHHHHHHHHHHC---C-CCEEEEE Q ss_conf 2199999999862---9-9879996 Q gi|254780579|r 130 HMPRTFLELQRIN---S-TVQFIPY 150 (207) Q Consensus 130 Hm~RA~~~f~~~~---p-~i~i~~~ 150 (207) =.--++..++..+ | ++.++.+ T Consensus 187 ~A~g~~~~~~~~g~~vP~di~vigf 211 (265) T cd06290 187 TAYGARLALYRRGLRVPEDVSLIGF 211 (265) T ss_pred HHHHHHHHHHHHCCCCCCCEEEEEE T ss_conf 9999999999809999998799999 No 94 >TIGR02529 EutJ ethanolamine utilization protein EutJ family protein; InterPro: IPR013366 Salmonella typhimurium is capable of growth on ethanolamine as a sole source of carbon nitrogen and energy . During growth on this compound the cells form a multimolecular structure known as a metabolosome, which is similar to the carboxysome used by some photosynthetic bacteria to fix CO2, and is thought to contain the enzymes needed to metabolise this compound to acetyl-CoA. The metabolosome is not directly involved in the biochemistry of ethanolamine utilization - instead its role is thought to be to concentrate the enzymes involved in this process, while also protecting the cell from the build-up of toxic intermediates . The genes involved in growth on ethanolamine are encoded in a 17-gene operon known as the ethanolamine utilization (eut) operon. EutJ shows similarity to chaperonins and may play a role in assembly of the metabolosme , though it is not necessary for growth on this compound.. Probab=33.07 E-value=36 Score=14.77 Aligned_cols=69 Identities=6% Similarity=-0.032 Sum_probs=48.8 Q ss_pred CCHHHHHHHH-HHHCCCCHH-HEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 6789999987-651588613-322046666602569999999986499818985114421999999998629987999 Q gi|254780579|r 74 VSKDILLQKI-PIRQDLAEC-CIDIGYKALNTEGNAQEASAWAEKNNFHHVLIVTHDYHMPRTFLELQRINSTVQFIP 149 (207) Q Consensus 74 ~~~~~~~~~~-~~~~~i~~~-~I~~e~~s~~T~ena~~~~~~l~~~~~~~iiLVTs~yHm~RA~~~f~~~~p~i~i~~ 149 (207) ..|||-+|+. ..+..+... .-.+++ =|.-.+.|++.++++++.||==+--..-..-+|+|.+|+++++. T Consensus 155 ~EEAEe~K~~~k~e~E~F~~v~PV~qK-------mA~Iv~~hi~~~~v~~~yLVGGac~~~~f~~~F~k~~Pg~~v~~ 225 (240) T TIGR02529 155 FEEAEEIKRDKKKEEEVFSVVKPVVQK-------MASIVKKHIEGQKVKDLYLVGGACSFSGFEDVFEKELPGLNVIK 225 (240) T ss_pred HHHHHHHHHCCCCCEEEEEEECCHHHH-------HHHHHHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCCCCCCC T ss_conf 889887863058843788760413777-------89999998511883368982440052246788876427876216 No 95 >COG1058 CinA Predicted nucleotide-utilizing enzyme related to molybdopterin-biosynthesis enzyme MoeA [General function prediction only] Probab=33.01 E-value=36 Score=14.76 Aligned_cols=31 Identities=19% Similarity=0.317 Sum_probs=24.9 Q ss_pred EEEEECCCHHHHHHHHHHHHHCCCCCEECCCC Q ss_conf 89994698579999999999689982451587 Q gi|254780579|r 39 AIVVLTGEPIRIERAFELLENQIGEKIFISGV 70 (207) Q Consensus 39 aIVVLgGg~~Ri~~a~~L~~~g~~~~ii~SG~ 70 (207) -+.+.|-..+|+..+++...+. +..+++||| T Consensus 39 ~~~~VgD~~~~I~~~l~~a~~r-~D~vI~tGG 69 (255) T COG1058 39 RITTVGDNPDRIVEALREASER-ADVVITTGG 69 (255) T ss_pred EEEECCCCHHHHHHHHHHHHHC-CCEEEECCC T ss_conf 9996399999999999999718-998998798 No 96 >cd06299 PBP1_LacI_like_13 Ligand-binding domain of DNA-binding regulatory protein from Corynebacterium glutamicum which has a unique ability to produce significant amounts of L-glutamate directly from cheap sugar and ammonia. This group includes the ligand-binding domain of DNA-binding regulatory protein from Corynebacterium glutamicum which has a unique ability to produce significant amounts of L-glutamate directly from cheap sugar and ammonia. This regulatory protein is a member of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial trans Probab=33.01 E-value=36 Score=14.76 Aligned_cols=96 Identities=8% Similarity=0.009 Sum_probs=46.0 Q ss_pred HHHHHHHCCCCCEECCCCCCCCCH---HHHHHHHHHHCCCCHHHEECCCCCCCHHHHHHH-HHHHHHHCCCCEEEEECCH Q ss_conf 999999689982451587654678---999998765158861332204666660256999-9999986499818985114 Q gi|254780579|r 53 AFELLENQIGEKIFISGVHHSVSK---DILLQKIPIRQDLAECCIDIGYKALNTEGNAQE-ASAWAEKNNFHHVLIVTHD 128 (207) Q Consensus 53 a~~L~~~g~~~~ii~SG~~~~~~~---~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~~-~~~~l~~~~~~~iiLVTs~ 128 (207) +-.|.++|+....+++|.....+. .+-.+..+.+.+.+.....+... ..+.+.+.. ..++++ ++ ...+++.++ T Consensus 108 ~~~L~~~G~~~i~~i~~~~~~~~~~~R~~gf~~a~~~~~~~~~~~~i~~~-~~~~~~~~~~~~~~l~-~~-p~ai~~~nD 184 (265) T cd06299 108 VSLLVALGHKKIGYISGPQDTSTGRERLEAFRQACASLGLEVNEDLVVLG-GYSQESGYAGATKLLD-QG-ATAIIAGDS 184 (265) T ss_pred HHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCCCEEEEE-CCCHHHHHHHHHHHHH-CC-CCEEECCCC T ss_conf 99999749983886048988756999999999999976999993059984-0355789999999996-39-985450685 Q ss_pred HHHHHHHHHHHHHC---C-CCEEEEEE Q ss_conf 42199999999862---9-98799964 Q gi|254780579|r 129 YHMPRTFLELQRIN---S-TVQFIPYP 151 (207) Q Consensus 129 yHm~RA~~~f~~~~---p-~i~i~~~p 151 (207) ...--++..+++.+ | ++.++.+- T Consensus 185 ~~A~g~~~~l~~~g~~vP~dvsvvgfd 211 (265) T cd06299 185 MMTIGAIRAIHDAGLVIGEDISLIGFD 211 (265) T ss_pred HHHHHHHHHHHHHCCCCCCCEEEEEEC T ss_conf 889999999998389999934999999 No 97 >KOG1276 consensus Probab=32.43 E-value=36 Score=14.70 Aligned_cols=88 Identities=14% Similarity=-0.025 Sum_probs=53.2 Q ss_pred CCEEEEECCCHHHHHHHHHHHHHCCCCCEECCCCCCCCCHHHHHHHHHHHCCCCHHHEECCCCCCCHHHHHHH----HHH Q ss_conf 8789994698579999999999689982451587654678999998765158861332204666660256999----999 Q gi|254780579|r 37 VSAIVVLTGEPIRIERAFELLENQIGEKIFISGVHHSVSKDILLQKIPIRQDLAECCIDIGYKALNTEGNAQE----ASA 112 (207) Q Consensus 37 ~DaIVVLgGg~~Ri~~a~~L~~~g~~~~ii~SG~~~~~~~~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~~----~~~ 112 (207) .--++|+|||..=+-.|..|.+.+..+.|.+...++.....- + .....+-+++|.-+ +|...|-. +.. T Consensus 11 ~~~vaVvGGGiSGL~aay~L~r~~p~~~i~l~Ea~~RvGGwi--r-----S~r~~ng~ifE~GP-rtlrpag~~g~~~l~ 82 (491) T KOG1276 11 GMTVAVVGGGISGLCAAYYLARLGPDVTITLFEASPRVGGWI--R-----SDRMQNGFIFEEGP-RTLRPAGPGGAETLD 82 (491) T ss_pred CCEEEEECCCHHHHHHHHHHHHCCCCCEEEEEECCCCCCCEE--E-----ECCCCCCEEECCCC-CCCCCCCCCHHHHHH T ss_conf 666999888536889999998548995599984278666525--6-----32689854421377-756768830457899 Q ss_pred HHHHCCCCEEEEECCHHHHH Q ss_conf 99864998189851144219 Q gi|254780579|r 113 WAEKNNFHHVLIVTHDYHMP 132 (207) Q Consensus 113 ~l~~~~~~~iiLVTs~yHm~ 132 (207) ++.+.|++.-++--+.-|-. T Consensus 83 lv~dLGl~~e~~~i~~~~pa 102 (491) T KOG1276 83 LVSDLGLEDELQPIDISHPA 102 (491) T ss_pred HHHHCCCCCEEEECCCCCHH T ss_conf 99870863114335887822 No 98 >TIGR02247 HAD-1A3-hyp Epoxide hydrolase N-terminal domain-like phosphatase; InterPro: IPR011945 This entry represents a small clade of sequences from the metazoa and bacteria. In eukaryotes, this domain exists as an N-terminal fusion to the soluble epoxide hydrolase enzyme and has recently been shown to be an active phosphatase, although the nature of the biological substrate is unclear . These appear to be members of the haloacid dehalogenase (HAD) superfamily of aspartate-nucleophile hydrolases by general homology and the conservation of all of the recognized catalytic motifs (although the first motif is unusual in the replacement of the more common aspartate with glycine). The variable domain is found in between motifs 1 and 2, indicating membership in subfamily I and phylogeny and prediction of the alpha helical nature of the variable domain indicate membership in subfamily IA.. Probab=32.31 E-value=33 Score=14.96 Aligned_cols=36 Identities=8% Similarity=0.144 Sum_probs=28.3 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCCCEEEEECCHHHHHH Q ss_conf 4666660256999999998649981898511442199 Q gi|254780579|r 97 GYKALNTEGNAQEASAWAEKNNFHHVLIVTHDYHMPR 133 (207) Q Consensus 97 e~~s~~T~ena~~~~~~l~~~~~~~iiLVTs~yHm~R 133 (207) |.+.+--.-|...+.+.++.+|+| ..++|+++|+-| T Consensus 100 ~~q~tkl~p~m~~~i~~~~~~Gfk-t~~lTNN~~l~~ 135 (228) T TIGR02247 100 EIQVTKLLPEMVKAIKSLRAKGFK-TALLTNNFYLDK 135 (228) T ss_pred CEECCCCCHHHHHHHHHHHHCCCE-EEEEECCCCCCC T ss_conf 400025687899999999757775-888616730567 No 99 >PRK07334 threonine dehydratase; Provisional Probab=32.18 E-value=37 Score=14.68 Aligned_cols=101 Identities=16% Similarity=0.084 Sum_probs=54.0 Q ss_pred CCCCCEEEEECCCHHHHHHHHHHHHHCCCCCEECCCCCCCCCHHHHHHHHHHHCCCCH-HHEECCCCCCCHHHHHHHHHH Q ss_conf 4568789994698579999999999689982451587654678999998765158861-332204666660256999999 Q gi|254780579|r 34 HPSVSAIVVLTGEPIRIERAFELLENQIGEKIFISGVHHSVSKDILLQKIPIRQDLAE-CCIDIGYKALNTEGNAQEASA 112 (207) Q Consensus 34 ~~~~DaIVVLgGg~~Ri~~a~~L~~~g~~~~ii~SG~~~~~~~~~~~~~~~~~~~i~~-~~I~~e~~s~~T~ena~~~~~ 112 (207) .+..|+|||.-||-+ +..|+.++-+...|.+=+-|+.+....+ |........++. ..-+.|.-+...- ...+-+ T Consensus 166 ~p~~D~vvvpvGGGG-LisGia~a~K~~~P~ikViGVEpe~a~~--m~~sl~~~~~~~~~~TiADGlav~~~--G~~tf~ 240 (399) T PRK07334 166 APDLDTLVVPIGGGG-LISGIATAAKALKPDIEIIGVQTELYPS--MYAALKGVALPCGGSTIAEGIAVKSP--GQLTLA 240 (399) T ss_pred CCCCCEEEEECCCCH-HHHHHHHHHHCCCCCCEEEEECCCCCHH--HHHHHCCCCCCCCCCCCCCCCCCCCC--CHHHHH T ss_conf 789998999278604-7999999996049997899976587769--99984699988998732012246888--777899 Q ss_pred HHHHCCCCEEEEECCHHHHHHHHHHHHH Q ss_conf 9986499818985114421999999998 Q gi|254780579|r 113 WAEKNNFHHVLIVTHDYHMPRTFLELQR 140 (207) Q Consensus 113 ~l~~~~~~~iiLVTs~yHm~RA~~~f~~ 140 (207) ++++. +..+++||.+-=..=-+..+++ T Consensus 241 i~~~~-vD~vv~Vse~eI~~Am~~l~e~ 267 (399) T PRK07334 241 IVRRL-VDDILLVDEADIEQAIALLLEI 267 (399) T ss_pred HHHHH-CCCEEEECHHHHHHHHHHHHHH T ss_conf 99962-8977998999999999999996 No 100 >PRK04761 ppnK inorganic polyphosphate/ATP-NAD kinase; Reviewed Probab=31.97 E-value=37 Score=14.66 Aligned_cols=34 Identities=15% Similarity=0.059 Sum_probs=21.0 Q ss_pred CCCCCCCCCEEEEECCCHHHHHHHHHHHHHCCCCC Q ss_conf 67654568789994698579999999999689982 Q gi|254780579|r 30 HIPDHPSVSAIVVLTGEPIRIERAFELLENQIGEK 64 (207) Q Consensus 30 ~~~~~~~~DaIVVLgGg~~Ri~~a~~L~~~g~~~~ 64 (207) ...+.+.+|.|||+||+-- +..+++.+.....|. T Consensus 19 ~~~~~~eaDliIvlGGDGT-~L~a~r~~~~~~~Pi 52 (246) T PRK04761 19 GDVPIEEADVIVALGGDGF-MLQTLHRYMNSGKPV 52 (246) T ss_pred CCCCHHHCCEEEEECCCHH-HHHHHHHHHCCCCCE T ss_conf 8999657999999898799-999999874149937 No 101 >cd06267 PBP1_LacI_sugar_binding_like Ligand binding domain of the LacI tanscriptional regulator family belonging to the type I periplasmic-binding fold protein superfamily. Ligand binding domain of the LacI tanscriptional regulator family belonging to the type I periplasmic-binding fold protein superfamily. In most cases, ligands are monosaccharide including lactose, ribose, fructose, xylose, arabinose, galactose/glucose, and other sugars. The LacI family of proteins consists of transcriptional regulators related to the lac repressor. In this case, the domain sugar binding changes the DNA binding activity of the repressor domain. Probab=31.51 E-value=38 Score=14.61 Aligned_cols=134 Identities=11% Similarity=0.026 Sum_probs=64.6 Q ss_pred HHHHHCCCCCEECCCCCCCCCH---HHHHHHHHHHCCCCHHHEECCCCCCCHHHHH-HHHHHHHHHCCCCEEEEECCHHH Q ss_conf 9999689982451587654678---9999987651588613322046666602569-99999998649981898511442 Q gi|254780579|r 55 ELLENQIGEKIFISGVHHSVSK---DILLQKIPIRQDLAECCIDIGYKALNTEGNA-QEASAWAEKNNFHHVLIVTHDYH 130 (207) Q Consensus 55 ~L~~~g~~~~ii~SG~~~~~~~---~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena-~~~~~~l~~~~~~~iiLVTs~yH 130 (207) .|.++|+....+++|....... .+-.+..+.+.+.+........ ...+.+++ ....++++++.--..++++++.. T Consensus 110 ~L~~~G~~~i~~i~~~~~~~~~~~R~~gf~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~l~~~~~~~ai~~~~D~~ 188 (264) T cd06267 110 HLIELGHRRIAFIGGPPDLSTARERLEGYREALEEAGIPLDEELIVE-GDFSEESGYEAARELLASGERPTAIFAANDLM 188 (264) T ss_pred HHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCHHEEEC-CCCCHHHHHHHHHHHHHCCCCCCEEEECCHHH T ss_conf 99871888568963898770699999999999998699988530752-78977999999999996599998998788799 Q ss_pred HHHHHHHHHHHC---C-CCEEEEEECCCCCCCCCCCCCCHH--HHHHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 199999999862---9-987999641676667542116989--999999999999999998760556766 Q gi|254780579|r 131 MPRTFLELQRIN---S-TVQFIPYPIISHDLEENSSIFKIK--ILRVLLIEYLKILLLSIQLSLSTQTAS 194 (207) Q Consensus 131 m~RA~~~f~~~~---p-~i~i~~~pv~~~~~~~~~Ww~~~~--~~~~~~~Ey~K~l~~~lr~~i~~~~~~ 194 (207) ...+...+++.+ | ++.++.+- +.+.. .+..+. +...=..|..+..+-.|...+.-...+ T Consensus 189 A~g~~~~l~~~g~~iP~di~Iig~d--~~~~~---~~~~p~lttv~~~~~~ig~~a~~~l~~~i~~~~~~ 253 (264) T cd06267 189 AIGALRALRELGLRVPEDVSVVGFD--DIPLA---ELLTPPLTTVRQPIEEMGRAAAELLLERIEGGDAP 253 (264) T ss_pred HHHHHHHHHHCCCCCCCCEEEEEEC--CHHHH---HCCCCCCEEEEECHHHHHHHHHHHHHHHHCCCCCC T ss_conf 9999999998299999977999999--84898---53489947999099999999999999996597999 No 102 >TIGR02152 D_ribokin_bact ribokinase; InterPro: IPR011877 This entry describes ribokinase, an enzyme catalyzing the first step in ribose catabolism. The rbsK gene encoding ribokinase typically is found with ribose transport genes. Ribokinase belongs to the carbohydrate kinase pfkB family. ; GO: 0004747 ribokinase activity, 0006014 D-ribose metabolic process. Probab=31.35 E-value=38 Score=14.59 Aligned_cols=29 Identities=17% Similarity=0.160 Sum_probs=15.7 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCCCEEEEEC Q ss_conf 466666025699999999864998189851 Q gi|254780579|r 97 GYKALNTEGNAQEASAWAEKNNFHHVLIVT 126 (207) Q Consensus 97 e~~s~~T~ena~~~~~~l~~~~~~~iiLVT 126 (207) -+..-+|.|++...+++|.++|++ .++|| T Consensus 195 tg~~v~~~~~~~~AA~~L~~~G~~-~VIiT 223 (303) T TIGR02152 195 TGIEVTDEEDAEKAAEKLLEKGVK-NVIIT 223 (303) T ss_pred CCCCCCCHHHHHHHHHHHHHCCCE-EEEEE T ss_conf 399878989999999999867840-69886 No 103 >PRK08526 threonine dehydratase; Provisional Probab=31.28 E-value=38 Score=14.59 Aligned_cols=95 Identities=15% Similarity=0.084 Sum_probs=46.0 Q ss_pred CCCCEEEEECCCHHHHHHHHHHHHHCCCCCEECCCCCCCCCHHHHHHHHHHHCCC---CHHHEECCCCCCCHHHHHHHHH Q ss_conf 5687899946985799999999996899824515876546789999987651588---6133220466666025699999 Q gi|254780579|r 35 PSVSAIVVLTGEPIRIERAFELLENQIGEKIFISGVHHSVSKDILLQKIPIRQDL---AECCIDIGYKALNTEGNAQEAS 111 (207) Q Consensus 35 ~~~DaIVVLgGg~~Ri~~a~~L~~~g~~~~ii~SG~~~~~~~~~~~~~~~~~~~i---~~~~I~~e~~s~~T~ena~~~~ 111 (207) +..|+|+|.-||-+ +..|+..+-+...|.+=+-|+.+....+ |...+..... +....+.|.-+...-+ ..+- T Consensus 167 ~d~D~vvvpvGGGG-LisGia~a~K~~~P~ikViGVEpe~a~~--m~~Sl~~g~~v~~~~~~tiaDGlav~~~g--~~tf 241 (403) T PRK08526 167 SDLDMIVVPVGGGG-LISGIASAAKQINPNIKIIGVGAKGAPA--MKESFHAKKIKNSKSVRTIADGIAVRDAS--PITL 241 (403) T ss_pred CCCCEEEECCCCCC-HHHHHHHHHHHHCCCCCEEEECCCCCHH--HHHHHHCCCCEECCCCCCEECCCCCCCCC--HHHH T ss_conf 68998998588861-6899999998629988389966478768--99998769945348878240342357666--2569 Q ss_pred HHHHHCCCCEEEEECCHHHHHHHHH Q ss_conf 9998649981898511442199999 Q gi|254780579|r 112 AWAEKNNFHHVLIVTHDYHMPRTFL 136 (207) Q Consensus 112 ~~l~~~~~~~iiLVTs~yHm~RA~~ 136 (207) +++++. +..+++||.+- +.+|+. T Consensus 242 ~i~~~~-vD~iv~V~e~e-I~~A~~ 264 (403) T PRK08526 242 AIILEC-VDDFVQVDDEE-IANAIL 264 (403) T ss_pred HHHHHC-CCCEEEECHHH-HHHHHH T ss_conf 999970-89679979999-999999 No 104 >COG0848 ExbD Biopolymer transport protein [Intracellular trafficking and secretion] Probab=31.09 E-value=38 Score=14.57 Aligned_cols=57 Identities=16% Similarity=0.060 Sum_probs=40.2 Q ss_pred CCHHHHHHHHHHHCCC-CHHHEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCHHH Q ss_conf 6789999987651588-61332204666660256999999998649981898511442 Q gi|254780579|r 74 VSKDILLQKIPIRQDL-AECCIDIGYKALNTEGNAQEASAWAEKNNFHHVLIVTHDYH 130 (207) Q Consensus 74 ~~~~~~~~~~~~~~~i-~~~~I~~e~~s~~T~ena~~~~~~l~~~~~~~iiLVTs~yH 130 (207) .+..++........+. .+..|.+.-.....|+......+.+++.|++++-|+|.+-+ T Consensus 79 v~~~~l~~~l~~~~~~~~~~~v~i~aD~~v~y~~vv~vm~~l~~aG~~~v~L~t~~~~ 136 (137) T COG0848 79 VSLEELEAALAALAKGKKNPRVVIRADKNVKYGTVVKVMDLLKEAGFKKVGLVTEKKK 136 (137) T ss_pred CCHHHHHHHHHHHHCCCCCCEEEEEECCCCCHHHHHHHHHHHHHCCCCEEEEEECCCC T ss_conf 2589999999998547889619998378888889999999999869846999954778 No 105 >KOG3085 consensus Probab=30.78 E-value=39 Score=14.53 Aligned_cols=38 Identities=18% Similarity=-0.004 Sum_probs=23.0 Q ss_pred CCCCHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHH Q ss_conf 66660256999999998649981898511442199999999 Q gi|254780579|r 99 KALNTEGNAQEASAWAEKNNFHHVLIVTHDYHMPRTFLELQ 139 (207) Q Consensus 99 ~s~~T~ena~~~~~~l~~~~~~~iiLVTs~yHm~RA~~~f~ 139 (207) ....+.++++.+.+.+++.| .+|.+..++-+|...+.. T Consensus 110 ~~~~~~~~~~~~lq~lR~~g---~~l~iisN~d~r~~~~l~ 147 (237) T KOG3085 110 SAWKYLDGMQELLQKLRKKG---TILGIISNFDDRLRLLLL 147 (237) T ss_pred CCCEECCHHHHHHHHHHHCC---EEEEEECCCCHHHHHHHH T ss_conf 57211637999999998679---399995277277777764 No 106 >cd01994 Alpha_ANH_like_IV This is a subfamily of Adenine nucleotide alpha hydrolases superfamily.Adeninosine nucleotide alpha hydrolases superfamily includes N type ATP PPases and ATP sulphurylases. It forms a apha/beta/apha fold which binds to Adenosine group. This subfamily of proteins is predicted to bind ATP. This domainhas a strongly conserved motif SGGKD at the N terminus. Probab=30.69 E-value=39 Score=14.52 Aligned_cols=103 Identities=12% Similarity=0.111 Sum_probs=63.7 Q ss_pred EEEEECCCHHHHHHHHHHHHHCCCCCEECCCCC---C----CCCHHHHHHHHHHHCCCCHHHEECCCCCCCHHHHHHHHH Q ss_conf 899946985799999999996899824515876---5----467899999876515886133220466666025699999 Q gi|254780579|r 39 AIVVLTGEPIRIERAFELLENQIGEKIFISGVH---H----SVSKDILLQKIPIRQDLAECCIDIGYKALNTEGNAQEAS 111 (207) Q Consensus 39 aIVVLgGg~~Ri~~a~~L~~~g~~~~ii~SG~~---~----~~~~~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~~~~ 111 (207) +++-.+||.+=........++|.....+++=.. + +.-..++.+..+...|+|-..+.....-.+-.++..... T Consensus 2 v~~l~SGGKDS~lAl~~a~~~g~~v~~L~~~~~~~~~~~~~H~~~~~l~~~qAealgiPl~~~~~~~~~~~~~~~l~~~L 81 (194) T cd01994 2 VVALISGGKDSCYALYRALEEGHEVVALLNLTPEEGSSMMYHTVNHELLELQAEAMGIPLIRIEISGEEEDEVEDLKELL 81 (194) T ss_pred EEEEECCCHHHHHHHHHHHHCCCEEEEEEEEECCCCCEEECCCCCHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHH T ss_conf 89997785999999999998699259999996399880525055789999999985996699966898717999999999 Q ss_pred HHHHHCCCCEEEE--ECCHHHHHHHHHHHHHH Q ss_conf 9998649981898--51144219999999986 Q gi|254780579|r 112 AWAEKNNFHHVLI--VTHDYHMPRTFLELQRI 141 (207) Q Consensus 112 ~~l~~~~~~~iiL--VTs~yHm~RA~~~f~~~ 141 (207) +-++++|++.++- +-|.||..|-....++. T Consensus 82 ~~~k~~gi~~vv~GdI~s~~qr~~~e~~c~~l 113 (194) T cd01994 82 RKLKEEGVDAVVFGAILSEYQRTRVERVCERL 113 (194) T ss_pred HHHHHCCCCEEEECCCCCHHHHHHHHHHHHHC T ss_conf 99997599599999633288999999999973 No 107 >TIGR00957 MRP_assoc_pro multi drug resistance-associated protein (MRP); InterPro: IPR005292 ABC transporters belong to the ATP-Binding Cassette (ABC) superfamily, which uses the hydrolysis of ATP to energize diverse biological systems. ABC transporters are minimally constituted of two conserved regions: a highly conserved ATP binding cassette (ABC) and a less conserved transmembrane domain (TMD). These regions can be found on the same protein or on two different ones. Most ABC transporters function as a dimer and therefore are constituted of four domains, two ABC modules and two TMDs. ABC transporters are involved in the export or import of a wide variety of substrates ranging from small ions to macromolecules. The major function of ABC import systems is to provide essential nutrients to bacteria. They are found only in prokaryotes and their four constitutive domains are usually encoded by independent polypeptides (two ABC proteins and two TMD proteins). Prokaryotic importers require additional extracytoplasmic binding proteins (one or more per systems) for function. In contrast, export systems are involved in the extrusion of noxious substances, the export of extracellular toxins and the targeting of membrane components. They are found in all living organisms and in general the TMD is fused to the ABC module in a variety of combinations. Some eukaryotic exporters encode the four domains on the same polypeptide chain . The ABC module (approximately two hundred amino acid residues) is known to bind and hydrolyze ATP, thereby coupling transport to ATP hydrolysis in a large number of biological processes. The cassette is duplicated in several subfamilies. Its primary sequence is highly conserved, displaying a typical phosphate-binding loop: Walker A, and a magnesium binding site: Walker B. Besides these two regions, three other conserved motifs are present in the ABC cassette: the switch region which contains a histidine loop, postulated to polarize the attaching water molecule for hydrolysis, the signature conserved motif (LSGGQ) specific to the ABC transporter, and the Q-motif (between Walker A and the signature), which interacts with the gamma phosphate through a water bond. The Walker A, Walker B, Q-loop and switch region form the nucleotide binding site , , . The 3D structure of a monomeric ABC module adopts a stubby L-shape with two distinct arms. ArmI (mainly beta-strand) contains Walker A and Walker B. The important residues for ATP hydrolysis and/or binding are located in the P-loop. The ATP-binding pocket is located at the extremity of armI. The perpendicular armII contains mostly the alpha helical subdomain with the signature motif. It only seems to be required for structural integrity of the ABC module. ArmII is in direct contact with the TMD. The hinge between armI and armII contains both the histidine loop and the Q-loop, making contact with the gamma phosphate of the ATP molecule. ATP hydrolysis leads to a conformational change that could facilitate ADP release. In the dimer the two ABC cassettes contact each other through hydrophobic interactions at the antiparallel beta-sheet of armI by a two-fold axis , , , , , . Proteins known to belong to this family are classified in several functional subfamilies depending on the substrate used (for further information see http://www.tcdb.org/tcdb/index.php?tc=3.A.1). This family consists of multi drug resistance-associated protein (MRP) in eukaryotes. The multidrug resistance-associated protein is an integral membrane protein that causes multidrug resistance when overexpressed in mammalian cells. It belongs to the ABC transporter superfamily. The protein topology and function was experimentally demonstrated by epitope tagging and immunofluorescence. Insertion of tags in the critical regions associated with drug efflux reduced its function. The C-terminal domain seems to be highly conserved. ; GO: 0005215 transporter activity, 0006810 transport, 0016021 integral to membrane. Probab=30.59 E-value=27 Score=15.45 Aligned_cols=15 Identities=40% Similarity=0.629 Sum_probs=11.9 Q ss_pred CCCCCCEEEEECCCH Q ss_conf 545687899946985 Q gi|254780579|r 33 DHPSVSAIVVLTGEP 47 (207) Q Consensus 33 ~~~~~DaIVVLgGg~ 47 (207) =.|+.|.||||+||+ T Consensus 827 yLPQ~D~IIVl~~G~ 841 (1542) T TIGR00957 827 YLPQVDVIIVLSGGK 841 (1542) T ss_pred CCCCCCEEEEECCCE T ss_conf 156437899844874 No 108 >TIGR02685 pter_reduc_Leis pteridine reductase; InterPro: IPR014058 Pteridine reductase is an enzyme used by trypanosomatids (including Trypanosoma cruzi and Leishmania major) to obtain reduced pteridines by salvage rather than biosynthetic pathways. Enzymes in T. cruzi described as pteridine reductase 1 (PTR1) and pteridine reductase 2 (PTR2) have different activity profiles. PTR1 is more active with fully oxidized biopterin and folate than with reduced forms, while PTR2 reduces dihydrobiopterin and dihydrofolate but not oxidized pteridines. T. cruzi PTR1 and PTR2 are more similar to each other in sequence than either is to the pteridine reductase of Leishmania major, and all are included in this family.. Probab=30.41 E-value=32 Score=15.07 Aligned_cols=50 Identities=20% Similarity=0.182 Sum_probs=26.9 Q ss_pred EECCCHHHHHH--HHHHHHHCCCCCEECCCCCCCCCHHHHHHHHH-HHCCCCHHHEEC Q ss_conf 94698579999--99999968998245158765467899999876-515886133220 Q gi|254780579|r 42 VLTGEPIRIER--AFELLENQIGEKIFISGVHHSVSKDILLQKIP-IRQDLAECCIDI 96 (207) Q Consensus 42 VLgGg~~Ri~~--a~~L~~~g~~~~ii~SG~~~~~~~~~~~~~~~-~~~~i~~~~I~~ 96 (207) |.||+.+|+-+ |.+|+++||.-.|= .+.+++....... .+..-+...|.. T Consensus 5 ~vTGaAkRiG~sIAv~LH~~GyrVv~H-----Yh~Sa~aA~~LaaeLNa~R~nsAv~~ 57 (283) T TIGR02685 5 VVTGAAKRIGRSIAVKLHQEGYRVVVH-----YHRSAAAASTLAAELNAERANSAVVC 57 (283) T ss_pred EEECHHHHHHHHHHHHHHHCCCEEEEE-----ECCCHHHHHHHHHHHHHCCCCCEEEE T ss_conf 650025552189999985089889994-----02456789999999734078976999 No 109 >pfam05044 Prox1 Homeobox prospero-like protein (PROX1). The homeobox gene Prox1 is expressed in a subpopulation of endothelial cells that, after budding from veins, gives rise to the mammalian lymphatic system. Prox1 has been found to be an early specific marker for the developing liver and pancreas in the mammalian foregut endoderm. This family contains an atypical homeobox domain. Probab=29.90 E-value=31 Score=15.11 Aligned_cols=45 Identities=9% Similarity=0.046 Sum_probs=25.2 Q ss_pred CHHHHHHHHHHHH-----------HHCCCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHH Q ss_conf 1442199999999-----------862998799964167666754211698999999999 Q gi|254780579|r 127 HDYHMPRTFLELQ-----------RINSTVQFIPYPIISHDLEENSSIFKIKILRVLLIE 175 (207) Q Consensus 127 s~yHm~RA~~~f~-----------~~~p~i~i~~~pv~~~~~~~~~Ww~~~~~~~~~~~E 175 (207) ++.|+|.|++.|= -.||||+|--.-+.. --.|+.+++-|-++-.| T Consensus 751 TPmHLRKAKLMFFwvRYPSSavLKMYFpDIKFNKnnTAQ----LVKWFSNFREFYYIQME 806 (905) T pfam05044 751 TPMHLRKAKLMFFYVRYPSSAVLKMYFPDIKFNKNNTAQ----LVKWFSNFREFYYIQME 806 (905) T ss_pred CHHHHHHHHHEEEEEECCCHHHHHHHCCCCCCCCCCHHH----HHHHHHHHHHHHHHHHH T ss_conf 746665523214664155267787636634446531899----99998631689999899 No 110 >pfam00205 TPP_enzyme_M Thiamine pyrophosphate enzyme, central domain. The central domain of TPP enzymes contains a 2-fold Rossman fold. Probab=29.65 E-value=41 Score=14.42 Aligned_cols=41 Identities=20% Similarity=0.208 Sum_probs=22.0 Q ss_pred HHHHHHHHHCCCCCEECCCCCCCCCHHHHHHHHHHHCCCCH Q ss_conf 99999999689982451587654678999998765158861 Q gi|254780579|r 51 ERAFELLENQIGEKIFISGVHHSVSKDILLQKIPIRQDLAE 91 (207) Q Consensus 51 ~~a~~L~~~g~~~~ii~SG~~~~~~~~~~~~~~~~~~~i~~ 91 (207) ++++++.++...|.|++.++.......+..+++.+..++|- T Consensus 2 ~~~~~~l~~AkrPvii~G~g~~~~~a~~~l~~lae~~~~Pv 42 (138) T pfam00205 2 EKAAELLAAAKRPVILVGGGVRRSGASEELRALAEKLGIPV 42 (138) T ss_pred HHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHHHHHCCCE T ss_conf 89999999689919998978352218999999999849987 No 111 >TIGR01825 gly_Cac_T_rel pyridoxal phosphate-dependent acyltransferase, putative; InterPro: IPR010962 Pyridoxal phosphate is the active form of vitamin B6 (pyridoxine or pyridoxal). PLP is a versatile catalyst, acting as a coenzyme in a multitude of reactions, including decarboxylation, deamination and transamination , , . PLP-dependent enzymes are primarily involved in the biosynthesis of amino acids and amino acid-derived metabolites, but they are also found in the biosynthetic pathways of amino sugars and in the synthesis or catabolism of neurotransmitters; pyridoxal phosphate can also inhibit DNA polymerases and several steroid receptors . Inadequate levels of pyridoxal phosphate in the brain can cause neurological dysfunction, particularly epilepsy . PLP enzymes exist in their resting state as a Schiff base, the aldehyde group of PLP forming a linkage with the epsilon-amino group of an active site lysine residue on the enzyme. The alpha-amino group of the substrate displaces the lysine epsilon-amino group, in the process forming a new aldimine with the substrate. This aldimine is the common central intermediate for all PLP-catalyzed reactions, enzymatic and non-enzymatic . This entry contains a number of enzyme families: Serine palmitoyltransferase Glycine C-acetyltransferase (2-amino-3-ketobutyrate coenzyme A ligase) 5-aminolevulinic acid synthase 8-amino-7-oxononanoate synthase All transfer the R-group (acetyl, succinyl, or 6-carboxyhexanoyl) from coenzyme A to an amino acid (Gly, Gly, Ala, respectively), with release of CO2 for the latter two reactions.; GO: 0016740 transferase activity, 0030170 pyridoxal phosphate binding. Probab=29.64 E-value=41 Score=14.41 Aligned_cols=129 Identities=18% Similarity=0.105 Sum_probs=70.0 Q ss_pred CEEEEECCCHHHHHHHHHHHHHCCCCCEECCCCCCCCCHHHHHH--HHHHHCCCCHHHEECCCCCCCHHHHHHHHHHHHH Q ss_conf 78999469857999999999968998245158765467899999--8765158861332204666660256999999998 Q gi|254780579|r 38 SAIVVLTGEPIRIERAFELLENQIGEKIFISGVHHSVSKDILLQ--KIPIRQDLAECCIDIGYKALNTEGNAQEASAWAE 115 (207) Q Consensus 38 DaIVVLgGg~~Ri~~a~~L~~~g~~~~ii~SG~~~~~~~~~~~~--~~~~~~~i~~~~I~~e~~s~~T~ena~~~~~~l~ 115 (207) -||=|+||=.-=-...++++++.--|.+|.|.-.|....|-..+ ....+ +..-.-=|+|+.|.++.|. T Consensus 239 KA~GvvGGY~AG~~~lidyL~~~aRPFLFSta~~Pa~~~A~~~Av~~l~~~----------~~l~~kLW~Nt~ylk~~L~ 308 (392) T TIGR01825 239 KAIGVVGGYVAGSKELIDYLKARARPFLFSTALPPAVVAALIEAVDVLQES----------PELMEKLWDNTRYLKAGLS 308 (392) T ss_pred HHHCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHCCC----------CCHHHHHHHHHHHHHHHHH T ss_conf 322256642348889999998628986012578807899999999987079----------3147776788999998767 Q ss_pred HCCCC---------EEEEECCHHHHHHHHHHHHHHCCCC--EEEEEECCCCCCCCCCCC----CCHHHHHHHHHHHHH Q ss_conf 64998---------1898511442199999999862998--799964167666754211----698999999999999 Q gi|254780579|r 116 KNNFH---------HVLIVTHDYHMPRTFLELQRINSTV--QFIPYPIISHDLEENSSI----FKIKILRVLLIEYLK 178 (207) Q Consensus 116 ~~~~~---------~iiLVTs~yHm~RA~~~f~~~~p~i--~i~~~pv~~~~~~~~~Ww----~~~~~~~~~~~Ey~K 178 (207) +-|+. -|++=--.-=---++..|.+. |+ +-+.+|+.+.+..+=.-- ++.+.+..++.=|-| T Consensus 309 ~LGydl~~S~tPItPv~~Gde~~A~efS~rL~~~y--GvFa~sivFPTVp~g~ARiR~~~tA~HTkd~L~~al~aye~ 384 (392) T TIGR01825 309 KLGYDLGESETPITPVLIGDEKVAQEFSRRLLDEY--GVFAQSIVFPTVPRGTARIRNIPTAEHTKDDLDQALDAYEK 384 (392) T ss_pred HHCCCCCCCCCCCCCEEECCCHHHHHHHHHHHHHC--CEEEEEEECCCCCCCCCCCCCCCCHHCCHHHHHHHHHHHHH T ss_conf 61641147668612458765156899999877752--90899842165669884212465613026679999999998 No 112 >TIGR01923 menE O-succinylbenzoate-CoA ligase; InterPro: IPR010192 This entry represents the enzyme O-succinylbenzoate-CoA ligase (MenE), which is involved in the fourth step of the menaquinone (vitamin K2) biosynthesis pathway. In bacteria, menaquinone is used during fumarate reduction in anaerobic respiration. In green sulphur bacteria and heliobacteria, menaquinones are thought to be used as a loosely bound secondary electron acceptor in the photosynthetic reaction centre. In cyanobacteria, the product of the menaquinone pathway is phylloquinone (2-methyl-3-phytyl-1,4-naphthoquinone), a molecule used exclusively as an electron transfer cofactor in Photosystem (PS) I. .; GO: 0008756 o-succinylbenzoate-CoA ligase activity, 0009234 menaquinone biosynthetic process. Probab=29.43 E-value=8 Score=18.73 Aligned_cols=47 Identities=11% Similarity=0.022 Sum_probs=26.6 Q ss_pred CHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHCCCCEEE Q ss_conf 60256999999998649981898511442199999999862998799 Q gi|254780579|r 102 NTEGNAQEASAWAEKNNFHHVLIVTHDYHMPRTFLELQRINSTVQFI 148 (207) Q Consensus 102 ~T~ena~~~~~~l~~~~~~~iiLVTs~yHm~RA~~~f~~~~p~i~i~ 148 (207) |=|.+|.-+++-|.-..-.+++++=-=||.-=--++||....|..+. T Consensus 173 nHy~sA~g~~~~L~f~~~D~WL~~LPLyHvSG~sil~r~l~~G~tl~ 219 (490) T TIGR01923 173 NHYESAVGSAENLGFTESDNWLLSLPLYHVSGLSILFRWLLEGATLK 219 (490) T ss_pred HHHHHHHHHHHHCCCCCCCCEEEECCHHHHHHHHHHHHHHHCCEEEE T ss_conf 89999999998568988875245420456526999999875474788 No 113 >PRK10294 6-phosphofructokinase 2; Provisional Probab=29.27 E-value=41 Score=14.37 Aligned_cols=13 Identities=15% Similarity=-0.100 Sum_probs=5.8 Q ss_pred HHHHHHHHHHCCC Q ss_conf 9999999996899 Q gi|254780579|r 50 IERAFELLENQIG 62 (207) Q Consensus 50 i~~a~~L~~~g~~ 62 (207) ++.|..+-+.|-. T Consensus 42 ~NvA~al~~LG~~ 54 (309) T PRK10294 42 INVARAIAHLGGS 54 (309) T ss_pred HHHHHHHHHCCCC T ss_conf 9999999986998 No 114 >PRK09162 hypoxanthine-guanine phosphoribosyltransferase; Provisional Probab=28.92 E-value=42 Score=14.34 Aligned_cols=88 Identities=13% Similarity=0.056 Sum_probs=52.0 Q ss_pred EEEEECCCHHHHHHHHHHHHHCCCCC----EECCCCCCCCCHHHHHHHHHHHCCCCHHHEECCCCCCCHHHHHHHHHHHH Q ss_conf 89994698579999999999689982----45158765467899999876515886133220466666025699999999 Q gi|254780579|r 39 AIVVLTGEPIRIERAFELLENQIGEK----IFISGVHHSVSKDILLQKIPIRQDLAECCIDIGYKALNTEGNAQEASAWA 114 (207) Q Consensus 39 aIVVLgGg~~Ri~~a~~L~~~g~~~~----ii~SG~~~~~~~~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~~~~~~l 114 (207) .|.||.|+- ..+.+|.++=..+. +-+|......+..++.........+....|++-..--+|=....+..+++ T Consensus 44 iI~vL~Ga~---~F~adL~r~l~~~~~vdfi~~sSY~~~~ssg~v~~~~~~~~~i~gk~VLiVeDIvDTG~Tl~~l~~~l 120 (181) T PRK09162 44 VLCVMNGGL---VFTGQLLPRLDFPLEFDYLHATRYRNETTGGELVWKVKPRESLKGRTVLVVDDILDEGHTLAAIRDWC 120 (181) T ss_pred EEEEECCCH---HHHHHHHHHCCCCEEEEEEEEECCCCCCCCCCEEEEECCCCCCCCCEEEEEEEEECCHHHHHHHHHHH T ss_conf 999957849---99999998579983777778612489775784799735775668986999931876279999999999 Q ss_pred HHCCCCEEEEECCHH Q ss_conf 864998189851144 Q gi|254780579|r 115 EKNNFHHVLIVTHDY 129 (207) Q Consensus 115 ~~~~~~~iiLVTs~y 129 (207) +.+|-+|+-++|=-. T Consensus 121 ~~~gp~Svki~~Lld 135 (181) T PRK09162 121 LEMGAAEVYSAVLVD 135 (181) T ss_pred HHCCCCEEEEEEEEE T ss_conf 704998899999996 No 115 >COG1234 ElaC Metal-dependent hydrolases of the beta-lactamase superfamily III [General function prediction only] Probab=28.72 E-value=42 Score=14.32 Aligned_cols=91 Identities=15% Similarity=0.118 Sum_probs=45.6 Q ss_pred HHHHHHHHCCCCCEECCCCCCCCCHHHHHHHHHHHCCCCHHHEECCCC---CCCH-HHHHHHHHHHHHHCCCCEEEE--E Q ss_conf 999999968998245158765467899999876515886133220466---6660-256999999998649981898--5 Q gi|254780579|r 52 RAFELLENQIGEKIFISGVHHSVSKDILLQKIPIRQDLAECCIDIGYK---ALNT-EGNAQEASAWAEKNNFHHVLI--V 125 (207) Q Consensus 52 ~a~~L~~~g~~~~ii~SG~~~~~~~~~~~~~~~~~~~i~~~~I~~e~~---s~~T-~ena~~~~~~l~~~~~~~iiL--V 125 (207) .+..+-....+..|.+||+.... +..+......+.=+-++ ...+.. +..- .-++.+++++.++.|.++++| + T Consensus 180 ~~~~~~~~~~G~~v~ysGDT~p~-~~~~~~a~~aDlLiHEa-t~~~~~~~~a~~~~HsT~~eAa~iA~~A~vk~LiLtH~ 257 (292) T COG1234 180 PADRIGEPRKGKSVVYSGDTRPC-DELIDLAKGADLLIHEA-TFEDDLEDLANEGGHSTAEEAAEIAKEAGVKKLILTHF 257 (292) T ss_pred HHHEEECCCCCEEEEEECCCCCC-HHHHHHHCCCCEEEEEE-ECCCCHHHHHCCCCCCCHHHHHHHHHHCCCCEEEEEEE T ss_conf 88853058897099997999777-68999766898999961-24765134404479999999999999759886999822 Q ss_pred CCHH--HHHHHHHHHHHHCCC Q ss_conf 1144--219999999986299 Q gi|254780579|r 126 THDY--HMPRTFLELQRINST 144 (207) Q Consensus 126 Ts~y--Hm~RA~~~f~~~~p~ 144 (207) +..| +-..-+...++.+|+ T Consensus 258 s~ry~~~~~~~~~ea~~~f~~ 278 (292) T COG1234 258 SPRYPKDDEELLKEARAIFPG 278 (292) T ss_pred CCCCCCHHHHHHHHHHHHCCC T ss_conf 766644079999999985899 No 116 >cd06296 PBP1_CatR_like Ligand-binding domain of a LacI-like transcriptional regulator, CatR which is involved in catechol degradation. This group includes the ligand-binding domain of a LacI-like transcriptional regulator, CatR which is involved in catechol degradation. This group belongs to the the LacI-GalR family repressors that are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=28.49 E-value=43 Score=14.29 Aligned_cols=95 Identities=8% Similarity=-0.000 Sum_probs=44.3 Q ss_pred HHHHHCCCCCEECCCCCCCCCH---HHHHHHHHHHCCCCHHHEECCCCCCCHHHHH-HHHHHHHHHCCCCEEEEECCHHH Q ss_conf 9999689982451587654678---9999987651588613322046666602569-99999998649981898511442 Q gi|254780579|r 55 ELLENQIGEKIFISGVHHSVSK---DILLQKIPIRQDLAECCIDIGYKALNTEGNA-QEASAWAEKNNFHHVLIVTHDYH 130 (207) Q Consensus 55 ~L~~~g~~~~ii~SG~~~~~~~---~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena-~~~~~~l~~~~~~~iiLVTs~yH 130 (207) .|.++|+....+++|....... -+-.+..+.+.|++....... ....+.+.+ ..+.++++..+--..++++++.. T Consensus 111 ~L~~~G~~~I~~i~~~~~~~~~~~R~~Gf~~a~~~~~~~~~~~~i~-~~~~~~~~~~~~~~~ll~~~~~~~aii~~~D~~ 189 (270) T cd06296 111 HLLELGHRRIGFITGPPDLLCSRARLDGYRAALAEAGIPVDPALVR-EGDFSTESGFRAAAELLALPERPTAIFAGNDLM 189 (270) T ss_pred HHHHHCCCCEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHEE-ECCCHHHHHHHHHHHHHHCCCCCCCEEECCCHH T ss_conf 9998589728884599886559999999999999869998855533-065017899999999995499886203478689 Q ss_pred HHHHHHHHHHHC---C-CCEEEEE Q ss_conf 199999999862---9-9879996 Q gi|254780579|r 131 MPRTFLELQRIN---S-TVQFIPY 150 (207) Q Consensus 131 m~RA~~~f~~~~---p-~i~i~~~ 150 (207) ...+....++.+ | ++.++.+ T Consensus 190 A~g~~~~l~~~gi~vP~di~iigf 213 (270) T cd06296 190 ALGVYEAARERGLRIPEDLSVVGF 213 (270) T ss_pred HHHHHHHHHHCCCCCCCCCEEEEE T ss_conf 999999999869999998879997 No 117 >TIGR01816 sdhA_forward succinate dehydrogenase, flavoprotein subunit; InterPro: IPR011281 Succinate dehydrogenase and fumarate reductase are homologous enzymes reversible in principle but favoured under different circumstances. This entry represents a narrowly defined clade of the succinate dehydrogenase flavoprotein subunit as found in mitochondria, in Rickettsia, in Escherichia coli and other proteobacteria, and in a few other lineages. However, excluded are all known fumarate reductases. It also excludes putative succinate dehydrogenases that appear to diverged before the split between E. coli succinate dehydrogenase and fumarate reductase.; GO: 0016627 oxidoreductase activity acting on the CH-CH group of donors, 0050660 FAD binding, 0006099 tricarboxylic acid cycle. Probab=28.45 E-value=43 Score=14.29 Aligned_cols=16 Identities=38% Similarity=0.399 Sum_probs=8.2 Q ss_pred CCEEEEECCCHHHHHH Q ss_conf 8789994698579999 Q gi|254780579|r 37 VSAIVVLTGEPIRIER 52 (207) Q Consensus 37 ~DaIVVLgGg~~Ri~~ 52 (207) |-|+|+=+||.+|+-. T Consensus 196 AKaVvlATGG~GR~Y~ 211 (615) T TIGR01816 196 AKAVVLATGGYGRIYK 211 (615) T ss_pred EEEEEEEECCCCHHHH T ss_conf 4107897378640567 No 118 >PRK11543 gutQ D-arabinose 5-phosphate isomerase; Provisional Probab=28.33 E-value=43 Score=14.27 Aligned_cols=41 Identities=10% Similarity=-0.067 Sum_probs=20.7 Q ss_pred HHEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHH Q ss_conf 3322046666602569999999986499818985114421999 Q gi|254780579|r 92 CCIDIGYKALNTEGNAQEASAWAEKNNFHHVLIVTHDYHMPRT 134 (207) Q Consensus 92 ~~I~~e~~s~~T~ena~~~~~~l~~~~~~~iiLVTs~yHm~RA 134 (207) +.+++-.+|-+|.| .....+++++.+. .++-+|++-...=| T Consensus 91 D~~i~~S~SG~t~E-l~~~~~~~k~~~~-~ii~it~~~~S~La 131 (321) T PRK11543 91 DVMLFISYSGGAKE-LDLIIPRLEDKSI-ALLAMTGKPTSPLG 131 (321) T ss_pred CEEEEEECCCCCHH-HHHHHHHHHHCCC-CEEEEECCCCCHHH T ss_conf 98999958988177-8877278876698-68999789999768 No 119 >PRK12595 bifunctional 3-deoxy-7-phosphoheptulonate synthase/chorismate mutase; Reviewed Probab=28.16 E-value=42 Score=14.32 Aligned_cols=35 Identities=6% Similarity=0.013 Sum_probs=18.0 Q ss_pred EECCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCH Q ss_conf 22046666602569999999986499818985114 Q gi|254780579|r 94 IDIGYKALNTEGNAQEASAWAEKNNFHHVLIVTHD 128 (207) Q Consensus 94 I~~e~~s~~T~ena~~~~~~l~~~~~~~iiLVTs~ 128 (207) |.+-.--..|.++-..+++++...|..+++|+--. T Consensus 228 VLlKrg~~ati~ewl~AaEyi~~~Gn~~vilceRG 262 (360) T PRK12595 228 VLLKRGLSATIEEFIYAAEYIMSQGNDQIILCERG 262 (360) T ss_pred EEEECCCCCCHHHHHHHHHHHHHCCCCCEEEEECC T ss_conf 79607999999999999999986799878999177 No 120 >PRK10270 hypothetical protein; Provisional Probab=28.08 E-value=43 Score=14.25 Aligned_cols=45 Identities=13% Similarity=0.071 Sum_probs=18.6 Q ss_pred CHHHHHHHHHHHHHHH--HHHHHHHHHHCCCCCCCCCCCCEEEEE-CCC Q ss_conf 9179999999999999--999999997213467654568789994-698 Q gi|254780579|r 1 MRYFWYGLFVCLIFFI--MGFISFIRYVKQMHIPDHPSVSAIVVL-TGE 46 (207) Q Consensus 1 mR~~~i~l~~~~~~~~--~~~~~f~~~~~~~~~~~~~~~DaIVVL-gGg 46 (207) ||.+.+.++++++++. +|+..+... .....+.+.+.+.+|.. .|. T Consensus 1 Mkk~~~~~~~l~~~l~i~~g~~~~~~~-~~~~~~~~~~~~vvv~I~~G~ 48 (340) T PRK10270 1 MKKVLLIILLLLVVLGIAAGVGVWKVR-HLADSKLLIKEETIFTLKPGT 48 (340) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHH-CCCCCCCCCCCCEEEEECCCC T ss_conf 902899999999999999999998741-002589999988899989999 No 121 >cd05006 SIS_GmhA Phosphoheptose isomerase is a member of the SIS (Sugar ISomerase) superfamily. Phosphoheptose isomerase catalyzes the isomerization of sedoheptulose 7-phosphate into D-glycero-D-mannoheptose 7-phosphate. This is the first step of the biosynthesis of gram-negative bacteria inner core lipopolysaccharide precursor, L-glycero-D-mannoheptose (Gmh). Probab=27.77 E-value=44 Score=14.21 Aligned_cols=60 Identities=8% Similarity=-0.094 Sum_probs=40.4 Q ss_pred CCCCCHHHHHHHHHHHCCCCHHHEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCHHHHH Q ss_conf 65467899999876515886133220466666025699999999864998189851144219 Q gi|254780579|r 71 HHSVSKDILLQKIPIRQDLAECCIDIGYKALNTEGNAQEASAWAEKNNFHHVLIVTHDYHMP 132 (207) Q Consensus 71 ~~~~~~~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~~~~~~l~~~~~~~iiLVTs~yHm~ 132 (207) ..+.+..+++.+.....+-+.+.++.-..|.+ -.|...+.++.+++|.+ ++.+|..-.-+ T Consensus 82 ~ND~~~~~if~~ql~~~~~~gDili~iS~sG~-s~nii~a~~~ak~~g~~-~i~ltg~~gg~ 141 (177) T cd05006 82 ANDYGYEEVFSRQVEALGQPGDVLIGISTSGN-SPNVLKALEAAKERGMK-TIALTGRDGGK 141 (177) T ss_pred HCCCCHHHHHHHHHHHHCCCCCEEEEECCCCC-CHHHHHHHHHHHHCCCE-EEEEECCCCCH T ss_conf 56532999999999974468988999818999-98999999999987998-99998789861 No 122 >cd04619 CBS_pair_6 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=27.50 E-value=44 Score=14.18 Aligned_cols=80 Identities=6% Similarity=0.054 Sum_probs=50.6 Q ss_pred HHHHHHHHHHHCCCCCEECC-CCCC---CCCHHHHHHHHHHHCCCCH----HHE-ECCCCCCCHHHHHHHHHHHHHHCCC Q ss_conf 99999999996899824515-8765---4678999998765158861----332-2046666602569999999986499 Q gi|254780579|r 49 RIERAFELLENQIGEKIFIS-GVHH---SVSKDILLQKIPIRQDLAE----CCI-DIGYKALNTEGNAQEASAWAEKNNF 119 (207) Q Consensus 49 Ri~~a~~L~~~g~~~~ii~S-G~~~---~~~~~~~~~~~~~~~~i~~----~~I-~~e~~s~~T~ena~~~~~~l~~~~~ 119 (207) -+.+|+++..++.-.-+++. +.+. -.++.++.+......+.+. ..+ .-+.-+-+--+....+.+.|.++++ T Consensus 11 ti~eA~~~M~~~~ig~l~Vvd~~g~lvGIiTe~Di~r~~~~~~~~~~~~~V~~vMt~~vitv~~~~~i~~a~~~M~~~~i 90 (114) T cd04619 11 TLQRAAKILGEPGIDLVVVCDPHGKLAGVLTKTDVVRQMGRCGGPGCTAPVENVMTRAVVSCRPGDLLHDVWQVMKQRGL 90 (114) T ss_pred CHHHHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHHHHHHCCCCCCCCCHHHHCCCCCEEECCCCCHHHHHHHHHHCCC T ss_conf 39999999987499889999899959999994999999873079866789999615898899998909999998887698 Q ss_pred CEEEEECCH Q ss_conf 818985114 Q gi|254780579|r 120 HHVLIVTHD 128 (207) Q Consensus 120 ~~iiLVTs~ 128 (207) +++.+|-.+ T Consensus 91 ~~lPVVD~~ 99 (114) T cd04619 91 KNIPVVDEN 99 (114) T ss_pred CEEEEEECC T ss_conf 588999079 No 123 >cd01992 PP-ATPase N-terminal domain of predicted ATPase of the PP-loop faimly implicated in cell cycle control [Cell division and chromosome partitioning]. This is a subfamily of Adenine nucleotide alpha hydrolases superfamily.Adeninosine nucleotide alpha hydrolases superfamily includes N type ATP PPases and ATP sulphurylases. It forms a apha/beta/apha fold which binds to Adenosine group. This domain has a strongly conserved motif SGGXD at the N terminus. Probab=27.49 E-value=44 Score=14.18 Aligned_cols=86 Identities=16% Similarity=0.256 Sum_probs=41.5 Q ss_pred EEEEECCCHHHHH--HHHHHHHHCCCCCEECC----CCC-CCCCHHHHHHHHHHHCCCCHHHEE--CCCC-CCCHHHHHH Q ss_conf 8999469857999--99999996899824515----876-546789999987651588613322--0466-666025699 Q gi|254780579|r 39 AIVVLTGEPIRIE--RAFELLENQIGEKIFIS----GVH-HSVSKDILLQKIPIRQDLAECCID--IGYK-ALNTEGNAQ 108 (207) Q Consensus 39 aIVVLgGg~~Ri~--~a~~L~~~g~~~~ii~S----G~~-~~~~~~~~~~~~~~~~~i~~~~I~--~e~~-s~~T~ena~ 108 (207) .+|-++||.+=+. +.+.-++......+..- |-. +...+++..+.++...+++-.... .... ..+..+.|+ T Consensus 2 i~vavSGG~DS~~Ll~~l~~~~~~~~~~l~a~hvdh~lr~~s~~~~~~v~~~~~~~~i~~~i~~~~~~~~~~~~~e~~aR 81 (185) T cd01992 2 ILVAVSGGPDSMALLHLLSELKPRLGLRLVAVHVDHGLRPESDEEAAFVADLCAKLGIPLYILVVALAPKPGGNLEAAAR 81 (185) T ss_pred EEEEECCCHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCHHHHHHHHHHHHHCCCCEEEEEEEECCCCCCCHHHHHH T ss_conf 99996784999999999999999749948999981898888889999999999985998899997753678999999999 Q ss_pred H-----HHHHHHHCCCCEEEE Q ss_conf 9-----999998649981898 Q gi|254780579|r 109 E-----ASAWAEKNNFHHVLI 124 (207) Q Consensus 109 ~-----~~~~l~~~~~~~iiL 124 (207) . ..+++++.+...+++ T Consensus 82 ~~Ry~~l~~~~~~~~~~~i~l 102 (185) T cd01992 82 EARYDFFAEIAKEHGADVLLT 102 (185) T ss_pred HHHHHHHHHHHHHHCCCCEEE T ss_conf 999999999998735450420 No 124 >PRK10703 DNA-binding transcriptional repressor PurR; Provisional Probab=27.30 E-value=45 Score=14.16 Aligned_cols=95 Identities=9% Similarity=-0.027 Sum_probs=48.0 Q ss_pred HHHHHCCCCCEECCCCCCCCCH---HHHHHHHHHHCCCCHHHEECCCCCCCHHHHH-HHHHHHHHHCCCCEEEEECCHHH Q ss_conf 9999689982451587654678---9999987651588613322046666602569-99999998649981898511442 Q gi|254780579|r 55 ELLENQIGEKIFISGVHHSVSK---DILLQKIPIRQDLAECCIDIGYKALNTEGNA-QEASAWAEKNNFHHVLIVTHDYH 130 (207) Q Consensus 55 ~L~~~g~~~~ii~SG~~~~~~~---~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena-~~~~~~l~~~~~~~iiLVTs~yH 130 (207) .|++.|+....+++|.....+. .+-....+.+.|++...-.+-... .+.++. ....+++.....-+.+++.++.- T Consensus 172 ~L~~~Ghr~Ia~i~~~~~~~~~~~R~~gf~~al~~~gl~~~~~~i~~~~-~~~~~g~~~~~~ll~~~~~ptAi~~~nD~~ 250 (335) T PRK10703 172 YLIERGHRDIGVIPGPLERNTGAGRLAGFMKAMEEANIKVPEEWIVQGD-FEPESGYEAMQQILSQEHRPTAVFCGGDIM 250 (335) T ss_pred HHHHCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHCCCCCCHHEEEECC-CCHHHHHHHHHHHHHCCCCCCEEEECCHHH T ss_conf 9997599869999588765217899999999999859998813289657-885889999999995699986689687599 Q ss_pred HHHHHHHHHHHC---C-CCEEEEE Q ss_conf 199999999862---9-9879996 Q gi|254780579|r 131 MPRTFLELQRIN---S-TVQFIPY 150 (207) Q Consensus 131 m~RA~~~f~~~~---p-~i~i~~~ 150 (207) .--++..++..+ | |+.++.+ T Consensus 251 A~g~~~~l~~~g~~VP~Disvigf 274 (335) T PRK10703 251 AMGAICAADEMGLRVPQDISVIGY 274 (335) T ss_pred HHHHHHHHHHCCCCCCCCEEEEEE T ss_conf 999999999719999997499998 No 125 >PRK10892 D-arabinose 5-phosphate isomerase; Provisional Probab=27.01 E-value=45 Score=14.13 Aligned_cols=83 Identities=7% Similarity=-0.078 Sum_probs=42.6 Q ss_pred HHHHHHHHHHHCCCCCEECCCCCCCCCHHHHHHHHHHHCCC-------------------CHHHEECCCCCCCHHHHHHH Q ss_conf 99999999996899824515876546789999987651588-------------------61332204666660256999 Q gi|254780579|r 49 RIERAFELLENQIGEKIFISGVHHSVSKDILLQKIPIRQDL-------------------AECCIDIGYKALNTEGNAQE 109 (207) Q Consensus 49 Ri~~a~~L~~~g~~~~ii~SG~~~~~~~~~~~~~~~~~~~i-------------------~~~~I~~e~~s~~T~ena~~ 109 (207) ....|+++....+ -+++++|+++..--+.-...-+.+.|. +++.+++-.+|-+|.| ... T Consensus 35 ~f~~av~~i~~~k-Gkvi~~GvGKSg~ia~KiaaTl~StGtps~flhp~ea~HGDlG~i~~~D~~i~~S~SG~t~E-l~~ 112 (326) T PRK10892 35 DFTLACEKMFWCK-GKVVVMGMGKSGHIGRKMAATFASTGTPSFFVHPGEAAHGDLGMVTPQDVVIAISNSGESSE-ILA 112 (326) T ss_pred HHHHHHHHHHHCC-CCEEEECCCHHHHHHHHHHHHHHCCCCCCEECCHHHCCCCCCCCCCCCCEEEEEECCCCCHH-HHH T ss_conf 8999999998289-98899838657999999999986548751545743406465567778998999958987577-876 Q ss_pred HHHHHHHCCCCEEEEECCHHHHHHH Q ss_conf 9999986499818985114421999 Q gi|254780579|r 110 ASAWAEKNNFHHVLIVTHDYHMPRT 134 (207) Q Consensus 110 ~~~~l~~~~~~~iiLVTs~yHm~RA 134 (207) ..+++++.+. .++-+|++-+..=| T Consensus 113 ll~~~~~~~~-~iI~it~~~~S~l~ 136 (326) T PRK10892 113 LIPVLKRLHV-PLICITGRPESSMA 136 (326) T ss_pred HHHHHHHCCC-CEEEEECCCCCHHH T ss_conf 4178776698-58999569999668 No 126 >COG2086 FixA Electron transfer flavoprotein, beta subunit [Energy production and conversion] Probab=26.89 E-value=45 Score=14.12 Aligned_cols=69 Identities=16% Similarity=0.193 Sum_probs=40.9 Q ss_pred HHHHHHHHHHHCCCCCEECCCCCCCCCHHHHHHHHHHHCCCCHHHEECC---CCCCCHHHHHHHHHHHHHHCCCC Q ss_conf 9999999999689982451587654678999998765158861332204---66666025699999999864998 Q gi|254780579|r 49 RIERAFELLENQIGEKIFISGVHHSVSKDILLQKIPIRQDLAECCIDIG---YKALNTEGNAQEASAWAEKNNFH 120 (207) Q Consensus 49 Ri~~a~~L~~~g~~~~ii~SG~~~~~~~~~~~~~~~~~~~i~~~~I~~e---~~s~~T~ena~~~~~~l~~~~~~ 120 (207) =+++|++|-.+++...+..-..++...+..+ ..+..+|..+. |++. ...-+|+.-|.-.+..+++.+.. T Consensus 42 AvEeAlrLke~~~~~eV~vlt~Gp~~a~~~l--r~aLAmGaDra-ili~d~~~~~~d~~~ta~~Laa~~~~~~~~ 113 (260) T COG2086 42 AVEEALRLKEKGYGGEVTVLTMGPPQAEEAL--REALAMGADRA-ILITDRAFAGADPLATAKALAAAVKKIGPD 113 (260) T ss_pred HHHHHHHHHCCCCCCEEEEEEECCHHHHHHH--HHHHHCCCCEE-EEEECCCCCCCCHHHHHHHHHHHHHHCCCC T ss_conf 9999998614688866999994645339999--99985488759-997032236755899999999999874998 No 127 >pfam06508 ExsB ExsB. This family includes putative transcriptional regulators from Bacteria and Archaea. Probab=26.87 E-value=45 Score=14.11 Aligned_cols=53 Identities=11% Similarity=-0.051 Sum_probs=24.6 Q ss_pred EEEEECCCHHHHHHHHHHHHHCCCCCEECCC-CCCCCCHHHHHHHHHHHCCCCH Q ss_conf 8999469857999999999968998245158-7654678999998765158861 Q gi|254780579|r 39 AIVVLTGEPIRIERAFELLENQIGEKIFISG-VHHSVSKDILLQKIPIRQDLAE 91 (207) Q Consensus 39 aIVVLgGg~~Ri~~a~~L~~~g~~~~ii~SG-~~~~~~~~~~~~~~~~~~~i~~ 91 (207) ++|+|+||.+=.-.+....++|+...-+.-. +..+..|-+.++..+...+++- T Consensus 2 avvl~SGG~DSt~~l~~a~~~~~~v~ait~dYGQ~~~~Ei~~A~~ia~~l~i~h 55 (137) T pfam06508 2 AVVLLSGGLDSTTCLAWAKKEGYEVYALTFDYGQRHSKELECAKKIAKALGVEH 55 (137) T ss_pred EEEEECCCHHHHHHHHHHHHCCCEEEEEEECCCCCHHHHHHHHHHHHHHCCCCC T ss_conf 899917878999999999986996899981478873999999999999829975 No 128 >PRK06931 diaminobutyrate--2-oxoglutarate aminotransferase; Provisional Probab=26.71 E-value=46 Score=14.10 Aligned_cols=28 Identities=14% Similarity=0.055 Sum_probs=14.8 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEECCHHHHH Q ss_conf 025699999999864998189851144219 Q gi|254780579|r 103 TEGNAQEASAWAEKNNFHHVLIVTHDYHMP 132 (207) Q Consensus 103 T~ena~~~~~~l~~~~~~~iiLVTs~yHm~ 132 (207) -.|.|...++.- .+=++|+-....||-+ T Consensus 128 AvE~AlKlAr~~--tgR~~iis~~~sYHG~ 155 (454) T PRK06931 128 AVEAAIKLAKTY--TGRSNVISFSGGYHGM 155 (454) T ss_pred HHHHHHHHHHHH--CCCCEEEEEECCCCCC T ss_conf 999999998760--6998199980688997 No 129 >PRK11269 glyoxylate carboligase; Provisional Probab=26.70 E-value=46 Score=14.10 Aligned_cols=49 Identities=16% Similarity=0.140 Sum_probs=35.3 Q ss_pred CCCHHHHHHHHHHHHHCCCCCEECCCCCCCCCHHHHHHHHHHHCCCCHH Q ss_conf 6985799999999996899824515876546789999987651588613 Q gi|254780579|r 44 TGEPIRIERAFELLENQIGEKIFISGVHHSVSKDILLQKIPIRQDLAEC 92 (207) Q Consensus 44 gGg~~Ri~~a~~L~~~g~~~~ii~SG~~~~~~~~~~~~~~~~~~~i~~~ 92 (207) ......+++++++.++...|.|+..|+-......+....+.+..++|-. T Consensus 187 ~~~~~~i~~a~~lL~~AkrPvIi~G~G~~~~~a~~~l~~lae~l~iPV~ 235 (591) T PRK11269 187 AATRAQIEKALAMLNAAERPLIVAGGGVINADASDLLVEFAELTGVPVI 235 (591) T ss_pred CCCHHHHHHHHHHHHHCCCCEEEECCCCCCCCHHHHHHHHHHHCCCCEE T ss_conf 9999999999999972789879958880311159999999985399789 No 130 >COG2102 Predicted ATPases of PP-loop superfamily [General function prediction only] Probab=26.50 E-value=46 Score=14.07 Aligned_cols=106 Identities=8% Similarity=0.042 Sum_probs=70.8 Q ss_pred CEEEEECCCHHHHHHHHHHHHHCCCCCEECC----CC---CCCCCHHHHHHHHHHHCCCCHHHEECCCCCCCHHHHHHHH Q ss_conf 7899946985799999999996899824515----87---6546789999987651588613322046666602569999 Q gi|254780579|r 38 SAIVVLTGEPIRIERAFELLENQIGEKIFIS----GV---HHSVSKDILLQKIPIRQDLAECCIDIGYKALNTEGNAQEA 110 (207) Q Consensus 38 DaIVVLgGg~~Ri~~a~~L~~~g~~~~ii~S----G~---~~~~~~~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~~~ 110 (207) ++|+-.+||.+=.-+....+++|.--.-+++ .. ..+.-..++....+...|+| +.....+..-..+.... T Consensus 2 k~~aL~SGGKDS~~Al~~a~~~G~eV~~Ll~~~p~~~dS~m~H~~n~~~~~~~Ae~~gi~---l~~~~~~g~~e~eve~L 78 (223) T COG2102 2 KVIALYSGGKDSFYALYLALEEGHEVVYLLTVKPENGDSYMFHTPNLELAELQAEAMGIP---LVTFDTSGEEEREVEEL 78 (223) T ss_pred CEEEEEECCCHHHHHHHHHHHCCCEEEEEEEEECCCCCEEEEECCCHHHHHHHHHHCCCC---EEEEECCCCCHHHHHHH T ss_conf 689998167188999999997598368999981599870442222567999998744883---59983476305569999 Q ss_pred HHHHHHCCCCEEE--EECCHHHHHHHHHHHHHHCCCCEEE Q ss_conf 9999864998189--8511442199999999862998799 Q gi|254780579|r 111 SAWAEKNNFHHVL--IVTHDYHMPRTFLELQRINSTVQFI 148 (207) Q Consensus 111 ~~~l~~~~~~~ii--LVTs~yHm~RA~~~f~~~~p~i~i~ 148 (207) ++.++.-+..-|+ =|-|.||-.|...+.++. +++.+ T Consensus 79 ~~~l~~l~~d~iv~GaI~s~yqk~rve~lc~~l--Gl~~~ 116 (223) T COG2102 79 KEALRRLKVDGIVAGAIASEYQKERVERLCEEL--GLKVY 116 (223) T ss_pred HHHHHHCCCCEEEECHHHHHHHHHHHHHHHHHH--CCEEE T ss_conf 999973763379973433099999999999972--97885 No 131 >COG2876 AroA 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase [Amino acid transport and metabolism] Probab=26.31 E-value=47 Score=14.05 Aligned_cols=41 Identities=7% Similarity=-0.056 Sum_probs=25.8 Q ss_pred HCCCCHHHEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEEC Q ss_conf 15886133220466666025699999999864998189851 Q gi|254780579|r 86 RQDLAECCIDIGYKALNTEGNAQEASAWAEKNNFHHVLIVT 126 (207) Q Consensus 86 ~~~i~~~~I~~e~~s~~T~ena~~~~~~l~~~~~~~iiLVT 126 (207) ..|-...-|++-..-.+|.|+...+++|+...|-.++||+- T Consensus 147 e~G~~~kPvLLKRg~~aTieEwL~AAEYI~s~GN~~vILCE 187 (286) T COG2876 147 EVGRQNKPVLLKRGLSATIEEWLNAAEYILSHGNGNVILCE 187 (286) T ss_pred HHCCCCCCEEEECCCCCCHHHHHHHHHHHHHCCCCCEEEEE T ss_conf 82355997697247412499999999999967999579971 No 132 >PRK13508 tagatose-6-phosphate kinase; Provisional Probab=25.97 E-value=47 Score=14.01 Aligned_cols=23 Identities=13% Similarity=0.049 Sum_probs=11.1 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEEC Q ss_conf 025699999999864998189851 Q gi|254780579|r 103 TEGNAQEASAWAEKNNFHHVLIVT 126 (207) Q Consensus 103 T~ena~~~~~~l~~~~~~~iiLVT 126 (207) ..+.+....+.+..+|.+ .++|| T Consensus 199 ~~~~~~~a~~~l~~~Gv~-~VivS 221 (309) T PRK13508 199 DLDELKEVLQQPLFEGIE-WIIVS 221 (309) T ss_pred CHHHHHHHHHHHHHCCCC-EEEEE T ss_conf 699999999999864888-89990 No 133 >PRK02231 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=25.60 E-value=48 Score=13.97 Aligned_cols=34 Identities=15% Similarity=0.039 Sum_probs=22.5 Q ss_pred CCCCCEEEEECCCHHHHHHHHHHHHHCCCCCEECC Q ss_conf 45687899946985799999999996899824515 Q gi|254780579|r 34 HPSVSAIVVLTGEPIRIERAFELLENQIGEKIFIS 68 (207) Q Consensus 34 ~~~~DaIVVLgGg~~Ri~~a~~L~~~g~~~~ii~S 68 (207) .+++|.|||+||+ +-+..+++.+.....|.+=+- T Consensus 40 ~~~~Dlii~iGGD-GT~L~a~r~~~~~~iPilGiN 73 (272) T PRK02231 40 GQRAQLAIVIGGD-GNMLGRARVLAKYDIPLIGIN 73 (272) T ss_pred CCCCCEEEEECCC-HHHHHHHHHHHCCCCCEEEEC T ss_conf 0177899997875-899999998600599789653 No 134 >pfam07509 DUF1523 Protein of unknown function (DUF1523). Probab=25.59 E-value=27 Score=15.45 Aligned_cols=43 Identities=14% Similarity=0.359 Sum_probs=26.0 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCEEEEECCCHHHHHHH Q ss_conf 91799999999999999999999972134676545687899946985799999 Q gi|254780579|r 1 MRYFWYGLFVCLIFFIMGFISFIRYVKQMHIPDHPSVSAIVVLTGEPIRIERA 53 (207) Q Consensus 1 mR~~~i~l~~~~~~~~~~~~~f~~~~~~~~~~~~~~~DaIVVLgGg~~Ri~~a 53 (207) ||++.+.+.+++.+++++++.+ ..|+.|..-|.|...+|++.+ T Consensus 1 M~y~k~~~~~~~~~~~~~~lhY----------~lPq~dvvrItgtevkR~d~~ 43 (174) T pfam07509 1 MKYVKWIVLLLFLLVLALFLHY----------TLPQYDVVRITGVEVKRMDLG 43 (174) T ss_pred CCEEHHHHHHHHHHHHHHHEEE----------CCCCCCEEEEECCEEEEECCC T ss_conf 9375147999999999987452----------478753899947389986377 No 135 >cd01575 PBP1_GntR Ligand-binding domain of DNA transcription repressor GntR specific for gluconate, a member of the LacI-GalR family of bacterial transcription regulators. This group represents the ligand-binding domain of DNA transcription repressor GntR specific for gluconate, a member of the LacI-GalR family of bacterial transcription regulators. The ligand-binding domain of GntR is structurally homologous to the periplasmic sugar-binding domain of ABC-type transporters and both domains contain the type I periplasmic binding protein-like fold. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the type I periplasmic binding proteins. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding, Probab=25.49 E-value=48 Score=13.96 Aligned_cols=84 Identities=11% Similarity=-0.024 Sum_probs=28.5 Q ss_pred HHHHCCCCCEECCCCCCCCCH---HHHHHHHHHHCCCCHHHEECCCCCCCHHHHHHH-HHHHHHHCCCCEEEEECCHHHH Q ss_conf 999689982451587654678---999998765158861332204666660256999-9999986499818985114421 Q gi|254780579|r 56 LLENQIGEKIFISGVHHSVSK---DILLQKIPIRQDLAECCIDIGYKALNTEGNAQE-ASAWAEKNNFHHVLIVTHDYHM 131 (207) Q Consensus 56 L~~~g~~~~ii~SG~~~~~~~---~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~~-~~~~l~~~~~~~iiLVTs~yHm 131 (207) |.++|+....+++|....... .+-.+..+.+.|.+...+..... .++.+++.. ..++++++.--..++++++.=. T Consensus 111 L~~~G~~~i~~i~~~~~~~~~~~R~~G~~~al~~~g~~~~~~~~~~~-~~~~~~~~~~~~~~l~~~~~~dai~~~nD~~A 189 (268) T cd01575 111 LLARGYRRIGFLGARMDDTRAQQRLEGFRAALRAAGLDPPLVVTTPE-PSSFALGRELLAELLARWPDLDAVFCSNDDLA 189 (268) T ss_pred HHHCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEECC-CCCHHHHHHHHHHHHHCCCCCCEEEECCHHHH T ss_conf 99749976999848988736999999999999976989880899548-99889999999999956999837998788999 Q ss_pred HHHHHHHHH Q ss_conf 999999998 Q gi|254780579|r 132 PRTFLELQR 140 (207) Q Consensus 132 ~RA~~~f~~ 140 (207) .-++..+++ T Consensus 190 ~g~~~~l~~ 198 (268) T cd01575 190 LGALFECQR 198 (268) T ss_pred HHHHHHHHH T ss_conf 999999998 No 136 >cd06295 PBP1_CelR Ligand binding domain of a transcription regulator of cellulose genes, CelR, which is highly homologous to the LacI-GalR family of bacterial transcription regulators. This group includes the ligand binding domain of a transcription regulator of cellulose genes, CelR, which is highly homologous to the LacI-GalR family of bacterial transcription regulators. The binding of CelR to the celE promoter is inhibited specifically by cellobiose. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn chang Probab=25.41 E-value=48 Score=13.95 Aligned_cols=95 Identities=13% Similarity=0.046 Sum_probs=44.6 Q ss_pred HHHHCCCCCEECCCCCCCCCH---HHHHHHHHHHCCCCHHHEECCCCCCCHHHHHH-HHHHHHHHCCCCEEEEECCHHHH Q ss_conf 999689982451587654678---99999876515886133220466666025699-99999986499818985114421 Q gi|254780579|r 56 LLENQIGEKIFISGVHHSVSK---DILLQKIPIRQDLAECCIDIGYKALNTEGNAQ-EASAWAEKNNFHHVLIVTHDYHM 131 (207) Q Consensus 56 L~~~g~~~~ii~SG~~~~~~~---~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~-~~~~~l~~~~~~~iiLVTs~yHm 131 (207) |.++|+....+++|....... ..-.+..+.+.+++........ +..+.+.+. ..++++++.+--..++++++.-. T Consensus 120 L~~~G~~~I~~i~~~~~~~~~~~R~~Gf~~a~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~l~~~~~~~ai~~~nD~~A 198 (275) T cd06295 120 LLARGRRRIAFLGGPQDMPEGEERLEGYREALAEAGLPLDPRLVAP-GDFTEESGRAAMRALLERGPDFDAVFAASDLMA 198 (275) T ss_pred HHHHCCCEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCCEEEE-CCCCHHHHHHHHHHHHHCCCCCCEEEECCCHHH T ss_conf 9980998798705886672699999999999998699999417996-577668799998889854999870341475878 Q ss_pred HHHHHHHHHHC---C-CCEEEEEE Q ss_conf 99999999862---9-98799964 Q gi|254780579|r 132 PRTFLELQRIN---S-TVQFIPYP 151 (207) Q Consensus 132 ~RA~~~f~~~~---p-~i~i~~~p 151 (207) --++..+++.. | ++.++.+- T Consensus 199 ~g~~~~~~~~g~~iP~disIigfd 222 (275) T cd06295 199 LGALRALREAGRRVPEDVAVVGFD 222 (275) T ss_pred HHHHHHHHHCCCCCCCCEEEEEEC T ss_conf 999999997499989856999967 No 137 >PRK10833 putative assembly protein; Provisional Probab=25.35 E-value=48 Score=13.94 Aligned_cols=23 Identities=26% Similarity=0.613 Sum_probs=17.9 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 91799999999999999999999 Q gi|254780579|r 1 MRYFWYGLFVCLIFFIMGFISFI 23 (207) Q Consensus 1 mR~~~i~l~~~~~~~~~~~~~f~ 23 (207) ||+|+..+.++++++++++..++ T Consensus 1 MKrll~~l~illvvvv~gl~aLv 23 (617) T PRK10833 1 MRRFLTTLMILLVVLVAGLSALV 23 (617) T ss_pred CCHHHHHHHHHHHHHHHHHHHHH T ss_conf 94189999999999999999881 No 138 >cd06277 PBP1_LacI_like_1 Ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=25.22 E-value=49 Score=13.93 Aligned_cols=92 Identities=12% Similarity=-0.006 Sum_probs=39.8 Q ss_pred HHHHCCCCCEECCCCCCCCCH---HHHHHHHHHHCCCCHHHEECCCCCCCHHHHHHHHHHHHHHC-CCCEEEEECCHHHH Q ss_conf 999689982451587654678---99999876515886133220466666025699999999864-99818985114421 Q gi|254780579|r 56 LLENQIGEKIFISGVHHSVSK---DILLQKIPIRQDLAECCIDIGYKALNTEGNAQEASAWAEKN-NFHHVLIVTHDYHM 131 (207) Q Consensus 56 L~~~g~~~~ii~SG~~~~~~~---~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~~~~~~l~~~-~~~~iiLVTs~yHm 131 (207) |.++|+....+++|.....+. .+-.++.+.+.|++...-.......+..+.+. +++... .-.+.++++++.-. T Consensus 113 L~~~Ghr~I~~i~~~~~~~~~~~R~~Gf~~al~~~gl~~~~~~~~~~~~~~~~~~~---~~~~~~~~~ptAi~~~nD~~A 189 (268) T cd06277 113 LIEKGHRKIGFVGDPLYSPSFEERYEGYKKALLDHGIPFNEDYDITEKEEDEEDIG---KFIDELKPLPTAFFCSNDGVA 189 (268) T ss_pred HHHCCCCCEEEEECCCCCCHHHHHHHHHHHHHHHCCCCCCCCEEEECCCCHHHHHH---HHHHHCCCCCCEEEECCHHHH T ss_conf 99709983699727888812999999999999985999982246625877799999---999847889987996885999 Q ss_pred HHHHHHHHHHC---C-CCEEEEE Q ss_conf 99999999862---9-9879996 Q gi|254780579|r 132 PRTFLELQRIN---S-TVQFIPY 150 (207) Q Consensus 132 ~RA~~~f~~~~---p-~i~i~~~ 150 (207) --++..++..+ | ++.++.+ T Consensus 190 ~g~~~~l~~~g~~vP~Disivgf 212 (268) T cd06277 190 FLLIKVLKEMGIRVPEDVSVIGF 212 (268) T ss_pred HHHHHHHHHCCCCCCCCCEEEEE T ss_conf 99999999859999986289997 No 139 >cd01471 vWA_micronemal_protein Micronemal proteins: The Toxoplasma lytic cycle begins when the parasite actively invades a target cell. In association with invasion, T. gondii sequentially discharges three sets of secretory organelles beginning with the micronemes, which contain adhesive proteins involved in parasite attachment to a host cell. Deployed as protein complexes, several micronemal proteins possess vertebrate-derived adhesive sequences that function in binding receptors. The VWA domain likely mediates the protein-protein interactions of these with their interacting partners. Probab=25.15 E-value=49 Score=13.92 Aligned_cols=54 Identities=13% Similarity=0.005 Sum_probs=22.9 Q ss_pred CCCCHHHHHHHHHHHHH-HCC----CC-EEEEECCHH-----HHHHHHHHHHHHCCCCEEEEEECCC Q ss_conf 66660256999999998-649----98-189851144-----2199999999862998799964167 Q gi|254780579|r 99 KALNTEGNAQEASAWAE-KNN----FH-HVLIVTHDY-----HMPRTFLELQRINSTVQFIPYPIIS 154 (207) Q Consensus 99 ~s~~T~ena~~~~~~l~-~~~----~~-~iiLVTs~y-----Hm~RA~~~f~~~~p~i~i~~~pv~~ 154 (207) ..++|-.-..++.+.+- .+| .. =++|+|+.. +..++-...|.. |+++++..|.. T Consensus 83 g~T~Tg~AL~~a~~~~f~~~g~R~~vpkv~illTDG~s~d~~~~~~~a~~Lr~~--GV~ifavGVG~ 147 (186) T cd01471 83 GSTNTTSALLVVEKHLFDTRGNRENAPQLVIIMTDGIPDSKFRTLKEARKLRER--GVIIAVLGVGQ 147 (186) T ss_pred CCCHHHHHHHHHHHHHCCCCCCCCCCCEEEEEEECCCCCCCCHHHHHHHHHHHC--CCEEEEEECCC T ss_conf 967799999999997211468899998599999069877852589999999988--99999998343 No 140 >PRK07233 hypothetical protein; Provisional Probab=24.99 E-value=49 Score=13.90 Aligned_cols=10 Identities=20% Similarity=0.032 Sum_probs=3.7 Q ss_pred ECCCHHHHHH Q ss_conf 4698579999 Q gi|254780579|r 43 LTGEPIRIER 52 (207) Q Consensus 43 LgGg~~Ri~~ 52 (207) +-||.+++.. T Consensus 192 ~~GG~~~l~~ 201 (430) T PRK07233 192 LEGGFGTLLD 201 (430) T ss_pred CCCCHHHHHH T ss_conf 0698999999 No 141 >pfam10742 DUF2555 Protein of unknown function (DUF2555). This family is conserved in Cyanobacteria. The function is not known. Probab=24.81 E-value=46 Score=14.08 Aligned_cols=41 Identities=20% Similarity=0.246 Sum_probs=30.4 Q ss_pred CCCHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHH Q ss_conf 66602569999999986499818985114421999999998 Q gi|254780579|r 100 ALNTEGNAQEASAWAEKNNFHHVLIVTHDYHMPRTFLELQR 140 (207) Q Consensus 100 s~~T~ena~~~~~~l~~~~~~~iiLVTs~yHm~RA~~~f~~ 140 (207) +.-|.+.....++-+++.++.+--=.-.++|.-||..+=|- T Consensus 7 ~~f~ee~va~LA~RLE~DdY~~pFd~L~DWhlLRalA~~Rp 47 (57) T pfam10742 7 AAFTEEDVAELAKRLEEDDYPNPFDGLKDWHLLRALAIHRP 47 (57) T ss_pred HHCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCC T ss_conf 66699999999986323258871245667899999995180 No 142 >cd06282 PBP1_GntR_like_2 Ligand-binding domain of putative DNA transcription repressors highly similar to that of the repressor specific for gluconate (GntR) which is a member of the LacI-GalR family of bacterial transcription regulators. This group includes the ligand-binding domain of putative DNA transcription repressors highly similar to that of the repressor specific for gluconate (GntR) which is a member of the LacI-GalR family of bacterial transcription regulators. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding Probab=24.50 E-value=50 Score=13.84 Aligned_cols=96 Identities=11% Similarity=0.032 Sum_probs=52.0 Q ss_pred HHHHHHHCCCCCEECCCCCCCCC--H--HHHHHHHHHHCCCCHHHEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCH Q ss_conf 99999968998245158765467--8--9999987651588613322046666602569999999986499818985114 Q gi|254780579|r 53 AFELLENQIGEKIFISGVHHSVS--K--DILLQKIPIRQDLAECCIDIGYKALNTEGNAQEASAWAEKNNFHHVLIVTHD 128 (207) Q Consensus 53 a~~L~~~g~~~~ii~SG~~~~~~--~--~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~~~~~~l~~~~~~~iiLVTs~ 128 (207) +-.|+++|+....+++|...... . .+-.++.+.+.+++...+.. ...+..+......+++.++.--+.++++++ T Consensus 108 ~~~L~~~G~~~i~~i~~~~~~~~~~~~R~~Gf~~a~~~~~~~~~~~~~--~~~~~~~~~~~~~~~l~~~~~~~ai~~~nD 185 (266) T cd06282 108 AQALAALGHRRIAMLAGRLAASDRARQRYAGYRAAMRAAGLAPLPPVE--IPFNTAALPSALLALLTAHPAPTAIFCSND 185 (266) T ss_pred HHHHHHCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHCCCCCCCEEE--ECCCHHHHHHHHHHHHHCCCCCCEEEECCC T ss_conf 999997399569999777888868999999999999986999761799--348726899999999837999847985388 Q ss_pred HHHHHHHHHHHHHC---C-CCEEEEE Q ss_conf 42199999999862---9-9879996 Q gi|254780579|r 129 YHMPRTFLELQRIN---S-TVQFIPY 150 (207) Q Consensus 129 yHm~RA~~~f~~~~---p-~i~i~~~ 150 (207) ...-.++..+++.+ | ++.++.+ T Consensus 186 ~~A~g~l~al~~~g~~vP~disIigf 211 (266) T cd06282 186 LLALAVIRALRRLGLRVPDDLSVVGF 211 (266) T ss_pred HHHHHHHHHHHHCCCCCCCCEEEEEE T ss_conf 89999999999849999998499998 No 143 >cd01574 PBP1_LacI Ligand-binding domain of DNA transcription repressor LacI specific for lactose, a member of the LacI-GalR family of bacterial transcription regulators. Ligand-binding domain of DNA transcription repressor LacI specific for lactose, a member of the LacI-GalR family of bacterial transcription regulators. The ligand-binding domain of LacI is structurally homologous to the periplasmic sugar-binding domain of ABC-type transporters and both domains contain the type I periplasmic binding protein-like fold. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the type I periplasmic binding proteins. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA b Probab=23.96 E-value=52 Score=13.78 Aligned_cols=95 Identities=12% Similarity=0.008 Sum_probs=49.5 Q ss_pred HHHHHHHCCCCCEECCCCCCCCCH---HHHHHHHHHHCCCCHHHEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCHH Q ss_conf 999999689982451587654678---99999876515886133220466666025699999999864998189851144 Q gi|254780579|r 53 AFELLENQIGEKIFISGVHHSVSK---DILLQKIPIRQDLAECCIDIGYKALNTEGNAQEASAWAEKNNFHHVLIVTHDY 129 (207) Q Consensus 53 a~~L~~~g~~~~ii~SG~~~~~~~---~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~~~~~~l~~~~~~~iiLVTs~y 129 (207) +-.|.++|+....+++|....... .+-....+.+.+++...+... ..+.+.+....+-+.++.--..++++++. T Consensus 108 ~~~L~~~G~~~i~~i~~~~~~~~~~~R~~G~~~al~~~~i~~~~~~~~---~~~~~~~~~~~~~l~~~~~~~aii~~~D~ 184 (264) T cd01574 108 TEHLLELGHRTIAHVAGPEEWLSARARLAGWRAALEAAGIAPPPVLEG---DWSAESGYRAGRELLREGDPTAVFAANDQ 184 (264) T ss_pred HHHHHHCCCCCEEECCCCCCCHHHHHHHHHHHHHHHHCCCCCCEEEEC---CCCHHHHHHHHHHHHHCCCCCEEEECCHH T ss_conf 999998499827325899777579999999999999869897448866---89979999999999837999689636429 Q ss_pred HHHHHHHHHHHHC---C-CCEEEEE Q ss_conf 2199999999862---9-9879996 Q gi|254780579|r 130 HMPRTFLELQRIN---S-TVQFIPY 150 (207) Q Consensus 130 Hm~RA~~~f~~~~---p-~i~i~~~ 150 (207) -.-.+...+++.+ | ++.++.+ T Consensus 185 ~A~g~i~~l~~~g~~iP~di~iigf 209 (264) T cd01574 185 MALGVLRALHELGLRVPDDVSVVGF 209 (264) T ss_pred HHHHHHHHHHHHCCCCCCCCEEEEE T ss_conf 9999999999829988988479986 No 144 >cd06270 PBP1_GalS_like Ligand binding domain of DNA transcription iso-repressor GalS, which is one of two regulatory proteins involved in galactose transport and metabolism. Ligand binding domain of DNA transcription iso-repressor GalS, which is one of two regulatory proteins involved in galactose transport and metabolism. Transcription of the galactose regulon genes is regulated by Gal iso-repressor (GalS) and Gal repressor (GalR) in different ways, but both repressors recognize the same DNA binding site in the absence of D-galactose. GalS is a dimeric protein like GalR,and its major role is in regulating expression of the high-affinity galactose transporter encoded by the mgl operon, whereas GalR is the exclusive regulator of galactose permease, the low-affinity galactose transporter. GalS and GalR are members of the LacI-GalR family of transcription regulators and both contain the type I periplasmic binding protein-like fold. Hence, they are homologous to the periplasmic sugar bindi Probab=23.25 E-value=53 Score=13.70 Aligned_cols=95 Identities=14% Similarity=0.003 Sum_probs=50.5 Q ss_pred HHHHHCCCCCEECCCCCCCCCH---HHHHHHHHHHCCCCHHHEECCCCCCCHHHHHHHH-HHHHHHCCCCEEEEECCHHH Q ss_conf 9999689982451587654678---9999987651588613322046666602569999-99998649981898511442 Q gi|254780579|r 55 ELLENQIGEKIFISGVHHSVSK---DILLQKIPIRQDLAECCIDIGYKALNTEGNAQEA-SAWAEKNNFHHVLIVTHDYH 130 (207) Q Consensus 55 ~L~~~g~~~~ii~SG~~~~~~~---~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~~~-~~~l~~~~~~~iiLVTs~yH 130 (207) .|+++|+....+++|.....+. .+-.+..+.+.|++.....+.. +..+.+.+... .+++++..--+.++++++.- T Consensus 110 ~L~~~G~~~i~~i~~~~~~~~~~~R~~Gf~~al~~~gl~~~~~~i~~-~~~~~~~g~~~~~~ll~~~~~~~ai~~~nD~~ 188 (268) T cd06270 110 HLIELGHRKIACITGPLTKEDARLRLQGYRDALAEAGIALDESLIIE-GDFTEEGGYAAMQELLARGAPFTAVFCANDEM 188 (268) T ss_pred HHHHCCCCCEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCCCEEEEE-CCCCHHHHHHHHHHHHHCCCCCCEEEECCHHH T ss_conf 99983998389845988883199999999999998599978004761-67767889998899995699987455337799 Q ss_pred HHHHHHHHHHHC---C-CCEEEEE Q ss_conf 199999999862---9-9879996 Q gi|254780579|r 131 MPRTFLELQRIN---S-TVQFIPY 150 (207) Q Consensus 131 m~RA~~~f~~~~---p-~i~i~~~ 150 (207) .-.++..+++.+ | ++.++.+ T Consensus 189 A~g~~~~l~~~g~~vP~disvigf 212 (268) T cd06270 189 AAGAISALREHGISVPQDVSIIGF 212 (268) T ss_pred HHHHHHHHHHCCCCCCCCEEEEEE T ss_conf 999999999829999997699998 No 145 >PRK04759 consensus Probab=23.22 E-value=53 Score=13.69 Aligned_cols=34 Identities=15% Similarity=0.079 Sum_probs=22.6 Q ss_pred CCCCCEEEEECCCHHHHHHHHHHHHHCCCCCEECC Q ss_conf 45687899946985799999999996899824515 Q gi|254780579|r 34 HPSVSAIVVLTGEPIRIERAFELLENQIGEKIFIS 68 (207) Q Consensus 34 ~~~~DaIVVLgGg~~Ri~~a~~L~~~g~~~~ii~S 68 (207) ..++|.||||||+- -+..+++++.....|.+=+. T Consensus 62 ~~~~Dlvi~lGGDG-TlL~aar~~~~~~~PilgiN 95 (294) T PRK04759 62 GKKADLAIVVGGDG-NMLGAARVLSRFDISVIGVN 95 (294) T ss_pred CCCCCEEEEECCCH-HHHHHHHHHCCCCCCEEEEE T ss_conf 76656899984785-89999998601699689884 No 146 >KOG0404 consensus Probab=23.17 E-value=53 Score=13.69 Aligned_cols=45 Identities=16% Similarity=0.180 Sum_probs=27.7 Q ss_pred CCCHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHC--CCCEEEE Q ss_conf 6660256999999998649981898511442199999999862--9987999 Q gi|254780579|r 100 ALNTEGNAQEASAWAEKNNFHHVLIVTHDYHMPRTFLELQRIN--STVQFIP 149 (207) Q Consensus 100 s~~T~ena~~~~~~l~~~~~~~iiLVTs~yHm~RA~~~f~~~~--p~i~i~~ 149 (207) ..+-.|+|.+..++. +++.++--.-|.|-+...-+|+. |++.++. T Consensus 166 GDsA~EEA~fLtkya-----skVyii~Rrd~fRAs~~Mq~ra~~npnI~v~~ 212 (322) T KOG0404 166 GDSAMEEALFLTKYA-----SKVYIIHRRDHFRASKIMQQRAEKNPNIEVLY 212 (322) T ss_pred CHHHHHHHHHHHHHC-----CEEEEEEEHHHHHHHHHHHHHHHCCCCEEEEE T ss_conf 678878999887414-----37999997124467789999875399769993 No 147 >PRK00561 ppnK inorganic polyphosphate/ATP-NAD kinase; Provisional Probab=22.91 E-value=54 Score=13.66 Aligned_cols=34 Identities=12% Similarity=-0.052 Sum_probs=21.5 Q ss_pred CCCCCCEEEEECCCHHHHHHHHHHHHHCCCCCEEC Q ss_conf 54568789994698579999999999689982451 Q gi|254780579|r 33 DHPSVSAIVVLTGEPIRIERAFELLENQIGEKIFI 67 (207) Q Consensus 33 ~~~~~DaIVVLgGg~~Ri~~a~~L~~~g~~~~ii~ 67 (207) .+..+|.|||+||+ +-+..|++.+.....|.+=+ T Consensus 30 ~~d~~DlviviGGD-GT~L~a~~~~~~~~iPilGI 63 (259) T PRK00561 30 VEDGADYLFVLGGD-GFFVSTAANYNCAGCKVVGI 63 (259) T ss_pred CCCCCCEEEEECCH-HHHHHHHHHHCCCCCCEEEE T ss_conf 78899999998971-99999999855479968999 No 148 >TIGR02733 desat_CrtD C-3',4' desaturase CrtD; InterPro: IPR014104 Members of this family are encoded by slr1293, a carotenoid biosynthesis protein which was shown to be the C-3',4' desaturase (CrtD) of myxoxanthophyll biosynthesis in Synechocystis sp. (strain PCC 6803), and close homologues (presumed to be functionally equivalent) from other cyanobacteria, where myxoxanthophyll biosynthesis is either known or expected. This enzyme can act on neurosporene and so presumably catalyses the first step that is committed to myxoxanthophyll.. Probab=22.47 E-value=55 Score=13.60 Aligned_cols=26 Identities=8% Similarity=-0.087 Sum_probs=10.3 Q ss_pred CCCC-CHHHHHHHHHHHHHHHHHHHHH Q ss_conf 2116-9899999999999999999987 Q gi|254780579|r 161 SSIF-KIKILRVLLIEYLKILLLSIQL 186 (207) Q Consensus 161 ~Ww~-~~~~~~~~~~Ey~K~l~~~lr~ 186 (207) .|+. +.......=.+|.+-+...|-. T Consensus 387 ~W~~l~~~~Y~~~K~~y~~~~~~~L~~ 413 (499) T TIGR02733 387 DWSSLDEEDYTAKKKQYTETILERLGE 413 (499) T ss_pred HHCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 211772678999999999999998432 No 149 >cd06275 PBP1_PurR Ligand-binding domain of purine repressor, PurR, which functions as the master regulatory protein of de novo purine nucleotide biosynthesis in Escherichia coli. Ligand-binding domain of purine repressor, PurR, which functions as the master regulatory protein of de novo purine nucleotide biosynthesis in Escherichia coli. This dimeric PurR belongs to the LacI-GalR family of transcription regulators and is activated to bind to DNA operator sites by initially binding either of high affinity corepressors, hypoxanthine or guanine. PurR is composed of two functional domains: aan N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the purine transcription repressor undergoes a Probab=22.39 E-value=55 Score=13.59 Aligned_cols=95 Identities=11% Similarity=-0.018 Sum_probs=43.2 Q ss_pred HHHHHCCCCCEECCCCCCCCCHH---HHHHHHHHHCCCCHHHEECCCCCCCHHHHH-HHHHHHHHHCCCCEEEEECCHHH Q ss_conf 99996899824515876546789---999987651588613322046666602569-99999998649981898511442 Q gi|254780579|r 55 ELLENQIGEKIFISGVHHSVSKD---ILLQKIPIRQDLAECCIDIGYKALNTEGNA-QEASAWAEKNNFHHVLIVTHDYH 130 (207) Q Consensus 55 ~L~~~g~~~~ii~SG~~~~~~~~---~~~~~~~~~~~i~~~~I~~e~~s~~T~ena-~~~~~~l~~~~~~~iiLVTs~yH 130 (207) .|.++|+....+++|........ +-+++.+.+.+++-....+.. +..+.+.+ ....+++..+.--+.++++++.= T Consensus 111 ~Li~~G~~~i~~i~~~~~~~~~~~R~~Gf~~al~~~~i~~~~~~i~~-~~~~~~~~~~~~~~~l~~~~~~~aii~~~d~~ 189 (269) T cd06275 111 HLIELGHRRIGCITGPLEKAPAQQRLAGFRRAMAEAGLPVNPGWIVE-GDFECEGGYEAMQRLLAQPKRPTAVFCGNDLM 189 (269) T ss_pred HHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHCC-CCCCHHHHHHHHHHHHHHCCCCCEEECCCHHH T ss_conf 99982997177556997771699999999999998599976555036-77776889999999997089966123187499 Q ss_pred HHHHHHHHHHHC---C-CCEEEEE Q ss_conf 199999999862---9-9879996 Q gi|254780579|r 131 MPRTFLELQRIN---S-TVQFIPY 150 (207) Q Consensus 131 m~RA~~~f~~~~---p-~i~i~~~ 150 (207) ...++..++..+ | ++.++.+ T Consensus 190 A~g~~~~l~~~gi~vP~di~vvgf 213 (269) T cd06275 190 AMGALCAAQEAGLRVPQDLSIIGY 213 (269) T ss_pred HHHHHHHHHHCCCCCCCCCEEEEE T ss_conf 999999999808878998679987 No 150 >COG1148 HdrA Heterodisulfide reductase, subunit A and related polyferredoxins [Energy production and conversion] Probab=22.39 E-value=55 Score=13.59 Aligned_cols=32 Identities=19% Similarity=0.222 Sum_probs=26.1 Q ss_pred CCCEEEEECCCHHHHHHHHHHHHHCCCCCEEC Q ss_conf 68789994698579999999999689982451 Q gi|254780579|r 36 SVSAIVVLTGEPIRIERAFELLENQIGEKIFI 67 (207) Q Consensus 36 ~~DaIVVLgGg~~Ri~~a~~L~~~g~~~~ii~ 67 (207) -++-+.|.|||..-+.+|++|.+.|+.-.++= T Consensus 123 v~~svLVIGGGvAGitAAl~La~~G~~v~LVE 154 (622) T COG1148 123 VSKSVLVIGGGVAGITAALELADMGFKVYLVE 154 (622) T ss_pred HCCCEEEECCCHHHHHHHHHHHHCCCEEEEEE T ss_conf 30535998684898999999997598089994 No 151 >COG0028 IlvB Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] [Amino acid transport and metabolism / Coenzyme metabolism] Probab=22.31 E-value=56 Score=13.58 Aligned_cols=46 Identities=17% Similarity=0.135 Sum_probs=33.1 Q ss_pred HHHHHHHHHHHHCCCCCEECCCCCCCCCHHHHHHHHHHHCCCCHHH Q ss_conf 7999999999968998245158765467899999876515886133 Q gi|254780579|r 48 IRIERAFELLENQIGEKIFISGVHHSVSKDILLQKIPIRQDLAECC 93 (207) Q Consensus 48 ~Ri~~a~~L~~~g~~~~ii~SG~~~~~~~~~~~~~~~~~~~i~~~~ 93 (207) +-+..++++.++.+.|.|+..|+.......+..+++.+..++|-.+ T Consensus 188 ~~i~~aa~~L~~AkrPvIl~G~G~~~a~a~~~l~~lae~~~~Pv~~ 233 (550) T COG0028 188 EAIRKAAELLAEAKRPVILAGGGVRRAGASEELRELAEKLGAPVVT 233 (550) T ss_pred HHHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHHHHCCCEEE T ss_conf 9999999999738997899888656413499999999987989897 No 152 >PTZ00300 pyruvate kinase; Provisional Probab=22.20 E-value=56 Score=13.57 Aligned_cols=28 Identities=4% Similarity=0.068 Sum_probs=15.4 Q ss_pred HHHHHCCCCEEEEECCHHHHHHHHHHHH Q ss_conf 9998649981898511442199999999 Q gi|254780579|r 112 AWAEKNNFHHVLIVTHDYHMPRTFLELQ 139 (207) Q Consensus 112 ~~l~~~~~~~iiLVTs~yHm~RA~~~f~ 139 (207) ....+-+.+-|+..|..=++.|...-|| T Consensus 342 ~~a~~~~a~aIv~~T~sG~tA~~vs~~R 369 (454) T PTZ00300 342 NSVYETKAKALVVLSNTGRSARLVAKYR 369 (454) T ss_pred HHHHHCCCCEEEEECCCCHHHHHHHHHC T ss_conf 9998569989999828858999999659 No 153 >pfam06057 VirJ Bacterial virulence protein (VirJ). This family consists of several bacterial VirJ virulence proteins. VirJ is thought to be involved in the type IV secretion system. It is thought that the substrate proteins localized to the periplasm may associate with the pilus in a manner that is mediated by VirJ, and suggest a two-step process for type IV secretion in Agrobacterium. Probab=21.97 E-value=56 Score=13.54 Aligned_cols=67 Identities=13% Similarity=0.143 Sum_probs=37.1 Q ss_pred CEECCCCCCCCCHHHHHHHHHHHCCCCHHHEE---CCCCCCCHHHHHHHHHHHHH----HCCCCEEEEECCHHH Q ss_conf 24515876546789999987651588613322---04666660256999999998----649981898511442 Q gi|254780579|r 64 KIFISGVHHSVSKDILLQKIPIRQDLAECCID---IGYKALNTEGNAQEASAWAE----KNNFHHVLIVTHDYH 130 (207) Q Consensus 64 ~ii~SG~~~~~~~~~~~~~~~~~~~i~~~~I~---~e~~s~~T~ena~~~~~~l~----~~~~~~iiLVTs~yH 130 (207) -+++||.+--....+-....+.+.|+|---+. .-=+.++-.|-|.-+.++++ +.+-++++||=-.|= T Consensus 5 av~~SGDGGW~~lD~~va~~L~~~GvpVvGvdSLrYFW~~ktP~q~a~Dl~~ii~~Y~~~w~~~~v~LiGYSFG 78 (192) T pfam06057 5 AVFYSGDGGWRDLDKEVGSALQKQGVPVVGVDSLRYFWSERTPEEVADDLDRIIDTYRKRWKVKNVVLIGYSFG 78 (192) T ss_pred EEEEECCCCHHHHHHHHHHHHHHCCCCEEEECHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEEEECCC T ss_conf 99994688735551999999997798365532688875659989999999999999999858965999961787 No 154 >cd01995 ExsB ExsB is a transcription regulator related protein. It is a subfamily of a Adenosine nucleotide binding superfamily of proteins. This protein family is represented by a single member in nearly every completed large ( 1000 genes) prokaryotic genome. In Rhizobium meliloti, a species in which the exo genes make succinoglycan, a symbiotically important exopolysaccharide, exsB is located nearby and affects succinoglycan levels, probably through polar effects on exsA expression or the same polycistronic mRNA. In Arthrobacter viscosus, the homologous gene is designated ALU1 and is associated with an aluminum tolerance phenotype. The function is unknown Probab=21.72 E-value=57 Score=13.51 Aligned_cols=77 Identities=14% Similarity=0.027 Sum_probs=42.8 Q ss_pred EEEEECCCHHHHHHHHHHHHHCCCCCEECCC-CCCCCCHHHHHHHHHHHCCCCHHHEECCCCCCCHHHHHHH---HHHHH Q ss_conf 8999469857999999999968998245158-7654678999998765158861332204666660256999---99999 Q gi|254780579|r 39 AIVVLTGEPIRIERAFELLENQIGEKIFISG-VHHSVSKDILLQKIPIRQDLAECCIDIGYKALNTEGNAQE---ASAWA 114 (207) Q Consensus 39 aIVVLgGg~~Ri~~a~~L~~~g~~~~ii~SG-~~~~~~~~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~~---~~~~l 114 (207) ++|+|+||.|=.-.+..+.++|+...-+.-. +..+..|.+..+......+.. ..+.. + |+.+ +..+. T Consensus 2 avvllSGGlDSt~~l~~~~~~g~~v~~l~~dYGQr~~~E~~~a~~i~~~l~~~---~~VP~--R----N~ifls~A~~~A 72 (169) T cd01995 2 AVVLLSGGLDSTTCLAWAKKEGYEVHALSFDYGQRHAKEEEAAKLIAEKLGPS---TYVPA--R----NLIFLSIAAAYA 72 (169) T ss_pred EEEEECCCHHHHHHHHHHHHCCCEEEEEEEECCCCHHHHHHHHHHHHHHHCCC---CEECC--C----HHHHHHHHHHHH T ss_conf 89990787889999999998499699999756985499999999999996899---46528--3----899999999999 Q ss_pred HHCCCCEEEE Q ss_conf 8649981898 Q gi|254780579|r 115 EKNNFHHVLI 124 (207) Q Consensus 115 ~~~~~~~iiL 124 (207) ...|.+.|.+ T Consensus 73 ~~~g~~~v~~ 82 (169) T cd01995 73 EALGAEAIII 82 (169) T ss_pred HHCCCCEEEE T ss_conf 9829996899 No 155 >PRK07482 hypothetical protein; Provisional Probab=21.70 E-value=57 Score=13.51 Aligned_cols=150 Identities=7% Similarity=-0.074 Sum_probs=62.3 Q ss_pred CHHHHHHHHHHHHHCCCCCE-ECCCCCCCCCHHHHHHHHHHHCCCCHHHEECCCCCCCHHHHHHHHHHHHHH-CC---CC Q ss_conf 85799999999996899824-515876546789999987651588613322046666602569999999986-49---98 Q gi|254780579|r 46 EPIRIERAFELLENQIGEKI-FISGVHHSVSKDILLQKIPIRQDLAECCIDIGYKALNTEGNAQEASAWAEK-NN---FH 120 (207) Q Consensus 46 g~~Ri~~a~~L~~~g~~~~i-i~SG~~~~~~~~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~~~~~~l~~-~~---~~ 120 (207) +..++.+|+.=. ....... ...+ .......++++.+.....-.-.++.+-+....--|+|...+..-.. .| -+ T Consensus 67 ~~p~v~~Ai~~Q-~~~l~~~~~~~~-~~~~~~~~lAe~L~~~~p~~l~~v~f~~SGSEAvE~AiKlAr~y~~~~g~~~r~ 144 (461) T PRK07482 67 GRTEVAEAIAEQ-AKELAYYHTYVG-HGTEASIELSKRIIDRAPAGMSKVYYGLSGSDANETQIKLVWYYNNVLGRPEKK 144 (461) T ss_pred CCHHHHHHHHHH-HHHCCCCCCCCC-CCCHHHHHHHHHHHHHCCCCCCEEEECCCHHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 949999999999-965876114677-568899999999997578886879961866999999999999998861887654 Q ss_pred EEEEECCHHHHHHHHHHHH--------HHC--CCCEEEEEECCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 1898511442199999999--------862--998799964167666754211698999999999999999999876055 Q gi|254780579|r 121 HVLIVTHDYHMPRTFLELQ--------RIN--STVQFIPYPIISHDLEENSSIFKIKILRVLLIEYLKILLLSIQLSLST 190 (207) Q Consensus 121 ~iiLVTs~yHm~RA~~~f~--------~~~--p~i~i~~~pv~~~~~~~~~Ww~~~~~~~~~~~Ey~K~l~~~lr~~i~~ 190 (207) +|+-....||-+ +..... ..+ |-..+...+.+. .++..+... .-.++.......+..-+.- T Consensus 145 ~Iis~~~syHG~-T~~a~s~tg~~~~~~~~~~~~~~~~~~~~P~------~~~~~~~~~--~~~~~~~~~~~~le~~i~~ 215 (461) T PRK07482 145 KIISRWRGYHGS-GIVTGSLTGLSLFHQHFDLPIDRVLHTEAPH------YYRRADAGM--SEEQFSAYCADELEELILA 215 (461) T ss_pred EEEEECCCCCCC-CCCCCCCCCCCCCCCCCCCCCCCCEECCCCC------CCCCCCCCC--CHHHHHHHHHHHHHHHHHH T ss_conf 599961785567-5313002466333466679987832246885------456788663--3777789999999999985 Q ss_pred CCCCCCCCCCCCCCCC Q ss_conf 6766541101742025 Q gi|254780579|r 191 QTASQFFITLIEEITV 206 (207) Q Consensus 191 ~~~~~~~~~~~~~~~~ 206 (207) +..++-.+.++|||+. T Consensus 216 ~~~~~iAAvI~EPi~g 231 (461) T PRK07482 216 EGPDTIAAFIGEPVLG 231 (461) T ss_pred HCCCCEEEEEECCCCC T ss_conf 0988489998056357 No 156 >TIGR01318 gltD_gamma_fam glutamate synthase, small subunit; InterPro: IPR006006 One pathway for the assimilation of ammonia and glutamate biosynthesis involves glutamate synthase (1.4.1.13 from EC) which transfers the amide group of glutamine to 2-oxoglutarate to yield two molecules of glutamate.2 L-glutamate + NADP^+ = L-glutamine + 2-oxoglutarate + NADPH + H^+. This model describes both glutatate synthase small subunit and closely related paralogs of unknown function from a number of gamma and alpha subdivision Proteobacteria, including Escherichia coli. ; GO: 0016639 oxidoreductase activity acting on the CH-NH2 group of donors NAD or NADP as acceptor, 0006537 glutamate biosynthetic process. Probab=21.67 E-value=44 Score=14.22 Aligned_cols=18 Identities=28% Similarity=0.344 Sum_probs=10.8 Q ss_pred CEEEEECCCH---HHHHHHHH Q ss_conf 7899946985---79999999 Q gi|254780579|r 38 SAIVVLTGEP---IRIERAFE 55 (207) Q Consensus 38 DaIVVLgGg~---~Ri~~a~~ 55 (207) --+||||||- +=+..|++ T Consensus 293 k~VVVLGGGDTaMDCvRTaiR 313 (480) T TIGR01318 293 KRVVVLGGGDTAMDCVRTAIR 313 (480) T ss_pred CEEEEECCCCCHHHHHHHHHH T ss_conf 668985888752578899998 No 157 >TIGR01466 cobJ_cbiH precorrin-3B C17-methyltransferase; InterPro: IPR006363 Cobalamin (vitamin B12) is a structurally complex cofactor, consisting of a modified tetrapyrrole with a centrally chelated cobalt. Cobalamin is usually found in one of two biologically active forms: methylcobalamin and adocobalamin. Most prokaryotes, as well as animals, have cobalamin-dependent enzymes, whereas plants and fungi do not appear to use it. In bacteria and archaea, these include methionine synthase, ribonucleotide reductase, glutamate and methylmalonyl-CoA mutases, ethanolamine ammonia lyase, and diol dehydratase . In mammals, cobalamin is obtained through the diet, and is required for methionine synthase and methylmalonyl-CoA mutase . There are at least two distinct cobalamin biosynthetic pathways in bacteria : Aerobic pathway that requires oxygen and in which cobalt is inserted late in the pathway ; found in Pseudomonas denitrificans and Rhodobacter capsulatus. Anaerobic pathway in which cobalt insertion is the first committed step towards cobalamin synthesis ; found in Salmonella typhimurium, Bacillus megaterium, and Propionibacterium freudenreichii shermanii. Either pathway can be divided into two parts: (1) corrin ring synthesis (differs in aerobic and anaerobic pathways) and (2) adenosylation of corrin ring, attachment of aminopropanol arm, and assembly of the nucleotide loop (common to both pathways) . There are about 30 enzymes involved in either pathway, where those involved in the aerobic pathway are prefixed Cob and those of the anaerobic pathway Cbi. Several of these enzymes are pathway-specific: CbiD, CbiG, and CbiK are specific to the anaerobic route of S. typhimurium, whereas CobE, CobF, CobG, CobN, CobS, CobT, and CobW are unique to the aerobic pathway of P. denitrificans. This entry represents CobJ and CbiH precorrin-3B C(17)-methyltransferase (2.1.1.131 from EC), both as stand-alone enzymes and when CobJ forms part of a bifunctional enzyme. In the aerobic pathway, once CobG has generated precorrin-3b, CobJ catalyses the methylation of precorrin-3b at C-17 to form precorrin-4 (the extruded methylated C-20 fragment is left attached as an acyl group at C-1) . In the corresponding anaerobic pathway, CbiH carries out this ring contraction, using cobalt-precorrin-3b as a substrate to generate a tetramethylated delta-lactone. These proteins belong to the superfamily of tetrapyrrole (corrin/porphyrin) methylases (IPR000878 from INTERPRO), which includes methylases that use S-adenosylmethionine (S-AdoMet) in the methylation of diverse substrates. A number of other methylases in the cobalamin biosynthesis pathway also belong to this domain superfamily (precorrin-3 methylase, IPR012203 from INTERPRO, IPR012059 from INTERPRO, IPR012382 from INTERPRO, IPR012797 from INTERPRO and others), and a fusion of precorrin-3B C17-methyltransferases with precorrin isomerase is represented by IPR014422 from INTERPRO. Nomenclature note: precorrin-3B C17-methyltransferase is one of the two methyltransferases often referred to as precorrin-3 methylase (the other is precorrin-4 C11-methyltransferase, 2.1.1.133 from EC). ; GO: 0008168 methyltransferase activity, 0009236 cobalamin biosynthetic process. Probab=21.53 E-value=58 Score=13.48 Aligned_cols=113 Identities=20% Similarity=0.173 Sum_probs=57.9 Q ss_pred EEEECCCH---HHHHHHHHHHHHCCCCCEECCCCCCC-CCHHHHHHHHHHHCCCCHHHEECCCCCCCHHHHHHHHHHHHH Q ss_conf 99946985---79999999999689982451587654-678999998765158861332204666660256999999998 Q gi|254780579|r 40 IVVLTGEP---IRIERAFELLENQIGEKIFISGVHHS-VSKDILLQKIPIRQDLAECCIDIGYKALNTEGNAQEASAWAE 115 (207) Q Consensus 40 IVVLgGg~---~Ri~~a~~L~~~g~~~~ii~SG~~~~-~~~~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~~~~~~l~ 115 (207) -|+=+|.. +|.+.|+++.++|+.--++.|| +++ ..-|-+.=+.....+.++..+.+|=-.-=|--++--+ .+=. T Consensus 47 ~~~~~~M~~E~~Ra~~Ai~~A~~G~~VAlvSSG-DpGiYgMA~l~~E~~~~~~~~~~~~dieV~PGiTA~~aAAs-~LGa 124 (254) T TIGR01466 47 EVITSGMREEIERAELAIELAKEGKTVALVSSG-DPGIYGMAALVFEILEKKGLEDVAIDIEVIPGITAASAAAS-LLGA 124 (254) T ss_pred EEEECCCCHHHHHHHHHHHHHHCCCEEEEEECC-CHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCHHHHHHHHH-HHCC T ss_conf 487677962789999999998609948999368-75278889999999862389977234798588337899999-7336 Q ss_pred HCCCCEEEEECCHHHHHHHHHHHH--HHCC-CCEEEEEECCC Q ss_conf 649981898511442199999999--8629-98799964167 Q gi|254780579|r 116 KNNFHHVLIVTHDYHMPRTFLELQ--RINS-TVQFIPYPIIS 154 (207) Q Consensus 116 ~~~~~~iiLVTs~yHm~RA~~~f~--~~~p-~i~i~~~pv~~ 154 (207) .-+..-+.|==|+.=.|.+..+-| .+.. |.-+--|-..+ T Consensus 125 PL~HDFc~ISLSDlLtPw~~Ie~R~~aAA~aDFVia~YNP~S 166 (254) T TIGR01466 125 PLGHDFCVISLSDLLTPWPVIEKRLRAAAEADFVIAIYNPRS 166 (254) T ss_pred CCCCCEEEEECCCCCCCHHHHHHHHHHHHCCCEEEEEECCCC T ss_conf 311378888745556787899999999844992899974898 No 158 >TIGR01457 HAD-SF-IIA-hyp2 HAD-superfamily subfamily IIA hydrolase, TIGR01457; InterPro: IPR006354 These sequences are all members of the Class IIA subfamily of the haloacid dehalogenase superfamily of aspartate-nucleophile hydrolases. The sequences are all from the Gram-positive (low-GC) bacteria.. Probab=21.16 E-value=59 Score=13.44 Aligned_cols=72 Identities=10% Similarity=0.138 Sum_probs=35.2 Q ss_pred CHHHHHHHHHHH---HHCCCCCEECCCCCCCCCHHHHHHHHHHHCCCCHHHEECCCCCCCHHHHHHHHHHHHHHCCC-CE Q ss_conf 857999999999---96899824515876546789999987651588613322046666602569999999986499-81 Q gi|254780579|r 46 EPIRIERAFELL---ENQIGEKIFISGVHHSVSKDILLQKIPIRQDLAECCIDIGYKALNTEGNAQEASAWAEKNNF-HH 121 (207) Q Consensus 46 g~~Ri~~a~~L~---~~g~~~~ii~SG~~~~~~~~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~~~~~~l~~~~~-~~ 121 (207) |.+|+.+|.+.. ++...|.+|++.- ...+...+...+..+..+|... |. =|.....+.++|.+.+- .+ T Consensus 15 G~e~I~ea~~FV~~L~kr~~pYLFVTNN-strtPe~Va~~L~snfdipat~---e~----VfT~smAta~y~~~~k~~~~ 86 (251) T TIGR01457 15 GKEKIEEAVEFVKELKKRDVPYLFVTNN-STRTPESVAELLASNFDIPATK---EQ----VFTASMATADYLADLKKEKS 86 (251) T ss_pred CCCCHHHHHHHHHHHHHCCCCEEEEECC-CCCCHHHHHHHHHHHCCCCCCC---CH----HHHHHHHHHHHHHHCCCCEE T ss_conf 8530167899999886479863687278-8888179999987516899972---10----23447899999983288028 Q ss_pred EEEE Q ss_conf 8985 Q gi|254780579|r 122 VLIV 125 (207) Q Consensus 122 iiLV 125 (207) +.+| T Consensus 87 vyVi 90 (251) T TIGR01457 87 VYVI 90 (251) T ss_pred EEEE T ss_conf 9998 No 159 >cd04615 CBS_pair_2 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera Probab=20.92 E-value=59 Score=13.41 Aligned_cols=81 Identities=10% Similarity=0.072 Sum_probs=45.5 Q ss_pred HHHHHHHHHHHCCCCCEECCCC-CC---CCCHHHHHHHHHHHCCCCH---HHEE-CCCCCCCHHHHHHHHHHHHHHCCCC Q ss_conf 9999999999689982451587-65---4678999998765158861---3322-0466666025699999999864998 Q gi|254780579|r 49 RIERAFELLENQIGEKIFISGV-HH---SVSKDILLQKIPIRQDLAE---CCID-IGYKALNTEGNAQEASAWAEKNNFH 120 (207) Q Consensus 49 Ri~~a~~L~~~g~~~~ii~SG~-~~---~~~~~~~~~~~~~~~~i~~---~~I~-~e~~s~~T~ena~~~~~~l~~~~~~ 120 (207) -+..|++...+.....+.+... +. -.++.++.+........++ ..+. -+...-+.-+....+.+.|.+++++ T Consensus 11 ~l~~A~~~m~~~~~~~~pVvd~~~~lvGivT~~Di~~~~~~~~~~~~~~v~~iMt~~~~ti~~~~~l~~a~~~m~~~~i~ 90 (113) T cd04615 11 DIARAVAEMYTSGSRALPVVDDKKRLVGIITRYDVLSYALESEELKDAKVREVMNSPVITIDANDSIAKARWLMSNNNIS 90 (113) T ss_pred CHHHHHHHHHHCCCCEEEEECCCCEEEEEEEHHHHHHHHHHCCCCCCCCHHHHCCCCCEEEECCCCHHHHHHHHHHHCCE T ss_conf 99999999997399789999489979999995999999980798245899992756978990899399999999974994 Q ss_pred EEEEECCHH Q ss_conf 189851144 Q gi|254780579|r 121 HVLIVTHDY 129 (207) Q Consensus 121 ~iiLVTs~y 129 (207) ++.+|-++- T Consensus 91 ~lpVvd~~g 99 (113) T cd04615 91 RLPVLDDKG 99 (113) T ss_pred EEEEECCCC T ss_conf 899997999 No 160 >cd06332 PBP1_aromatic_compounds_like Type I periplasmic binding proteins of active transport systems that are predicted to be involved in transport of aromatic compounds such as 2-nitrobenzoic acid and alkylbenzenes. This group includes the type I periplasmic binding proteins of active transport systems that are predicted to be involved in transport of aromatic compounds such as 2-nitrobenzoic acid and alkylbenzenes; their substrate specificities are not well characterized, however. Members also exhibit close similarity to active transport systems for short chain amides and/or urea found in bacteria and archaea. Probab=20.88 E-value=59 Score=13.40 Aligned_cols=45 Identities=13% Similarity=0.312 Sum_probs=32.9 Q ss_pred CCCHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHH-HHHHHCCC Q ss_conf 6660256999999998649981898511442199999-99986299 Q gi|254780579|r 100 ALNTEGNAQEASAWAEKNNFHHVLIVTHDYHMPRTFL-ELQRINST 144 (207) Q Consensus 100 s~~T~ena~~~~~~l~~~~~~~iiLVTs~yHm~RA~~-~f~~~~p~ 144 (207) +.++.+.+....+++.+.+++++.++++++-.-|... .|++.+.+ T Consensus 115 ~~~~~~~~~~~~~~~~~~~~kkvail~~d~~~g~~~~~~~~~~~~~ 160 (333) T cd06332 115 SWQNDQVHEAMGKYAADKGYKKVVIIAPDYAAGKDAVAGFKRTFKG 160 (333) T ss_pred CCCHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHHHHHHCC T ss_conf 5986999999999999718977999957864769999999997270 No 161 >PRK05192 tRNA uridine 5-carboxymethylaminomethyl modification enzyme GidA; Validated Probab=20.85 E-value=60 Score=13.40 Aligned_cols=33 Identities=18% Similarity=0.122 Sum_probs=21.2 Q ss_pred CCCCCEEEEECCCHHHHHHHHHHHHHCCCCCEEC Q ss_conf 4568789994698579999999999689982451 Q gi|254780579|r 34 HPSVSAIVVLTGEPIRIERAFELLENQIGEKIFI 67 (207) Q Consensus 34 ~~~~DaIVVLgGg~~Ri~~a~~L~~~g~~~~ii~ 67 (207) ..+.|+||| |||.-=.++|..-.+.|..-.++. T Consensus 4 ~~~yDVIVi-G~GhAG~EAa~aaar~G~~t~lit 36 (621) T PRK05192 4 PEEYDVIVV-GGGHAGCEAALAAARMGAKTLLLT 36 (621) T ss_pred CCCCCEEEE-CCCHHHHHHHHHHHHCCCCEEEEE T ss_conf 887988998-975799999999996799679996 No 162 >PRK13360 omega amino acid--pyruvate transaminase; Provisional Probab=20.75 E-value=60 Score=13.39 Aligned_cols=54 Identities=9% Similarity=-0.058 Sum_probs=22.6 Q ss_pred HHHHHHHHHCCCCHHHEECCCCCCCHHHHHHHHHHHH-HHCC---CCEEEEECCHHHH Q ss_conf 9999876515886133220466666025699999999-8649---9818985114421 Q gi|254780579|r 78 ILLQKIPIRQDLAECCIDIGYKALNTEGNAQEASAWA-EKNN---FHHVLIVTHDYHM 131 (207) Q Consensus 78 ~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~~~~~~l-~~~~---~~~iiLVTs~yHm 131 (207) ++++.+..-..-.-+.+.+-+....-.|.|...+... +..| -++++-.+..||- T Consensus 92 ~lae~L~~~~p~~l~~v~f~~SGsEA~E~AiKlAr~y~~~~g~~~r~~ii~~~~syHG 149 (441) T PRK13360 92 ELATRIAEIAPGGLNHVFFTNSGSESVDTALKIALAYHRARGQGQRTRLIGRERGYHG 149 (441) T ss_pred HHHHHHHHHCCCCCCEEEEECCHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCC T ss_conf 9999999857888687998487899999999999999886099872289932786279 No 163 >PRK05259 consensus Probab=20.67 E-value=60 Score=13.37 Aligned_cols=98 Identities=15% Similarity=0.148 Sum_probs=51.6 Q ss_pred EEEECCCHHHHHHHHHHHHHCCCCCEECCCCCCCCCHHHHHHHHHHHCCCCH-HHEECCCCCCCHHHHHHHHHHHHHHCC Q ss_conf 9994698579999999999689982451587654678999998765158861-332204666660256999999998649 Q gi|254780579|r 40 IVVLTGEPIRIERAFELLENQIGEKIFISGVHHSVSKDILLQKIPIRQDLAE-CCIDIGYKALNTEGNAQEASAWAEKNN 118 (207) Q Consensus 40 IVVLgGg~~Ri~~a~~L~~~g~~~~ii~SG~~~~~~~~~~~~~~~~~~~i~~-~~I~~e~~s~~T~ena~~~~~~l~~~~ 118 (207) .||.+-+.+....+-.+.+.-..+.-++........+.+.+... ..+.. .||++ ..--+|-+....+.+.++++| T Consensus 161 ~vvVsPD~G~~~ra~~~a~~l~~~~~~~~K~R~~~~~~~~~~~~---gdV~Gk~~IIv-DDiIdTGgTl~~aa~~Lk~~G 236 (310) T PRK05259 161 VMVVSPDVGGVVRARALAKRLDAPLAIVDKRRERPGESEVMNVI---GDVSGRDCILI-DDIVDSGGTLCNAAEALLANG 236 (310) T ss_pred EEEECCCCHHHHHHHHHHHHCCCCEEEEEEEECCCCCCCCCCCC---CCCCCCEEEEE-CCHHHCHHHHHHHHHHHHHCC T ss_conf 39991492699999999998199678999860689954334545---66231248951-734416889999999998779 Q ss_pred CCEEE-EECCHHHHHHHHHHHHHH Q ss_conf 98189-851144219999999986 Q gi|254780579|r 119 FHHVL-IVTHDYHMPRTFLELQRI 141 (207) Q Consensus 119 ~~~ii-LVTs~yHm~RA~~~f~~~ 141 (207) .+++. .+|++-=...|.--+... T Consensus 237 A~~V~~~~THgvfs~~A~~ri~~s 260 (310) T PRK05259 237 ANSVTAYITHGVLSGGAVARIASS 260 (310) T ss_pred CCEEEEEEECCCCCCHHHHHHHCC T ss_conf 986999997853685699998628 No 164 >PRK06427 phosphomethylpyrimidine kinase; Reviewed Probab=20.58 E-value=60 Score=13.36 Aligned_cols=91 Identities=15% Similarity=0.112 Sum_probs=50.8 Q ss_pred CCCEEEEE-CCCHHHHHHHHHHHHHCCCCCEEC-----CCCCCCCCHH---HHHHHH------HHHCCCCHHHEECCCCC Q ss_conf 68789994-698579999999999689982451-----5876546789---999987------65158861332204666 Q gi|254780579|r 36 SVSAIVVL-TGEPIRIERAFELLENQIGEKIFI-----SGVHHSVSKD---ILLQKI------PIRQDLAECCIDIGYKA 100 (207) Q Consensus 36 ~~DaIVVL-gGg~~Ri~~a~~L~~~g~~~~ii~-----SG~~~~~~~~---~~~~~~------~~~~~i~~~~I~~e~~s 100 (207) +.|+|-+= =|..+-++.-.+..++.+..++++ |+.+....+. +.+++. .+-.+.++.+++.+... T Consensus 73 ~i~aiKiGmL~s~~~i~~v~~~l~~~~~~~vV~DPVl~s~sG~~l~~~~~~~~l~~~Llp~a~liTPN~~Ea~~L~g~~~ 152 (266) T PRK06427 73 GIDAVKTGMLASAEIIETVAEALKRYPPPPVVLDPVMIAKSGDPLLADDAVAALRERLLPLATLITPNLPEAEALTGLPI 152 (266) T ss_pred CCCEEEECCCCCHHHHHHHHHHHHHCCCCCEEECCCEECCCCCCCCCHHHHHHHHHHHCCCCCEECCCHHHHHHHHCCCC T ss_conf 98989999969899999999999967998899567251489985656789999998606634141588899999829988 Q ss_pred CCHHHHHHHHHHHHHHCCCCEEEEECC Q ss_conf 660256999999998649981898511 Q gi|254780579|r 101 LNTEGNAQEASAWAEKNNFHHVLIVTH 127 (207) Q Consensus 101 ~~T~ena~~~~~~l~~~~~~~iiLVTs 127 (207) .++.++.+..++.+.+.|.+. +|||. T Consensus 153 ~~~~d~~~~aa~~l~~~G~~~-VlikG 178 (266) T PRK06427 153 ADTEDEMKAAARALLALGCKA-VLIKG 178 (266) T ss_pred CCCHHHHHHHHHHHHHHCCCE-EEEEC T ss_conf 990999999999999608984-99947 No 165 >cd05014 SIS_Kpsf KpsF-like protein. KpsF is an arabinose-5-phosphate isomerase which contains SIS (Sugar ISomerase) domains. SIS domains are found in many phosphosugar isomerases and phosphosugar binding proteins. KpsF catalyzes the reversible reaction of ribulose 5-phosphate to arabinose 5-phosphate. This is the second step in the CMP-Kdo biosynthesis pathway. Probab=20.40 E-value=61 Score=13.34 Aligned_cols=44 Identities=5% Similarity=-0.092 Sum_probs=29.1 Q ss_pred CHHHEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHH Q ss_conf 6133220466666025699999999864998189851144219999 Q gi|254780579|r 90 AECCIDIGYKALNTEGNAQEASAWAEKNNFHHVLIVTHDYHMPRTF 135 (207) Q Consensus 90 ~~~~I~~e~~s~~T~ena~~~~~~l~~~~~~~iiLVTs~yHm~RA~ 135 (207) +.+.++.-..|-.|.|-. ...+.++++|.+ ++.+|+.-+.+=|. T Consensus 47 ~~Dv~i~iS~SG~T~e~~-~~~~~ak~~g~~-vI~iT~~~~S~La~ 90 (128) T cd05014 47 PGDVVIAISNSGETDELL-NLLPHLKRRGAP-IIAITGNPNSTLAK 90 (128) T ss_pred CCCEEEEEECCCCCHHHH-HHHHHHHHCCCC-EEEEECCCCCHHHH T ss_conf 999999997999986799-999999863785-89998799996799 No 166 >TIGR03309 matur_yqeB selenium-dependent molybdenum hydroxylase system protein, YqeB family. Members of this protein family are probable accessory proteins for the biosynthesis of enzymes with labile selenium-containing centers, different from selenocysteine-containing proteins. Probab=20.33 E-value=49 Score=13.89 Aligned_cols=72 Identities=8% Similarity=0.129 Sum_probs=33.2 Q ss_pred EEEECCCHHHHHHHHHHHHHCCCCCEECCCCCCCCCHHHHHHHHHHHCCCCHHHEECCCCCCCHHHHHHHHHHHHHH Q ss_conf 99946985799999999996899824515876546789999987651588613322046666602569999999986 Q gi|254780579|r 40 IVVLTGEPIRIERAFELLENQIGEKIFISGVHHSVSKDILLQKIPIRQDLAECCIDIGYKALNTEGNAQEASAWAEK 116 (207) Q Consensus 40 IVVLgGg~~Ri~~a~~L~~~g~~~~ii~SG~~~~~~~~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~~~~~~l~~ 116 (207) |||+|+|.==---|.+|++.|+. ++++...... .+.+...-...+.+....+|.-.---.++..+..+.+++ T Consensus 1 viVrG~gDiAsgva~~L~raGf~--Vv~~e~~~P~---~iRR~vaF~dAvy~g~~~VEgv~a~ra~~~~e~~~~~~~ 72 (256) T TIGR03309 1 VVVRGAGDLATGVAHRLHRSGFK--VLMTETEQPT---VIRRTVAFAEAIYTGEVTVEGVTARRISNLAEVLKLLQQ 72 (256) T ss_pred CEECCCCHHHHHHHHHHHHCCCE--EEEECCCCCC---EEEEEEEEEEEEECCCEEEEEEEEEECCCHHHHHHHHHC T ss_conf 98915766787775886638967--9992489997---265200711015889579831799986999999999854 No 167 >PRK08674 bifunctional phosphoglucose/phosphomannose isomerase; Validated Probab=20.30 E-value=61 Score=13.33 Aligned_cols=87 Identities=14% Similarity=0.088 Sum_probs=42.5 Q ss_pred CCCCCCEEEEEC-CCHHHHHHHHH-HHHHCCCCCEECCCCCCCCCHHHHHHHHHHHCCCCH-HHEECCCCCCCHHHHHHH Q ss_conf 545687899946-98579999999-999689982451587654678999998765158861-332204666660256999 Q gi|254780579|r 33 DHPSVSAIVVLT-GEPIRIERAFE-LLENQIGEKIFISGVHHSVSKDILLQKIPIRQDLAE-CCIDIGYKALNTEGNAQE 109 (207) Q Consensus 33 ~~~~~DaIVVLg-Gg~~Ri~~a~~-L~~~g~~~~ii~SG~~~~~~~~~~~~~~~~~~~i~~-~~I~~e~~s~~T~ena~~ 109 (207) +..+.|.|||.| ||..-.-.=++ |..+....++.+.-+ |....-+.+ .-++.-..|-||.|.... T Consensus 30 ~~~~~~~Ivi~GmGGS~i~Gdv~~~l~~~~~~iPv~v~~~------------y~lP~~v~~~tLVIavSySGnTeETL~a 97 (328) T PRK08674 30 EIEPYDNIVISGMGGSGIGGDLLRSLLLDEWKKPVFVVRD------------YFLPAFVDRKTLVIAVSYSGNTEETLSA 97 (328) T ss_pred CCCCCCEEEEEECCHHHHHHHHHHHHHHCCCCCCEEEECC------------CCCCCCCCCCCEEEEECCCCCCHHHHHH T ss_conf 1476572999957564899999999984479986798558------------8799865888579998289997799999 Q ss_pred HHHHHHHCCCCEEEEECCHHHHHH Q ss_conf 999998649981898511442199 Q gi|254780579|r 110 ASAWAEKNNFHHVLIVTHDYHMPR 133 (207) Q Consensus 110 ~~~~l~~~~~~~iiLVTs~yHm~R 133 (207) .++ ..++|. +++.|||.=-+.+ T Consensus 98 ~~~-A~~rga-~vi~ItsGG~L~~ 119 (328) T PRK08674 98 VEQ-AKKRGA-KIIAITSGGKLAE 119 (328) T ss_pred HHH-HHHCCC-CEEEEECCCCHHH T ss_conf 999-997599-5899948970678 No 168 >TIGR02867 spore_II_P stage II sporulation protein P; InterPro: IPR010897 This family contains the bacterial stage II sporulation protein P (SpoIIP) (approximately 350 residues long). It has been shown that a block in polar cytokinesis in Bacillus subtilis is mediated partly by transcription of spoIID, spoIIM and spoIIP. This inhibition of polar division is involved in the locking in of asymmetry after the formation of a polar septum during sporulation . SpoIIP is one of the three genes (spoIID, spoIIM and spoIIP, , , ), under the control of sigma E, that have been shown to be essential for the engulfment of the forespore by the mother cell. Their products are involved in degradation of the septal peptidoglycan and mutations in spoIID, spoIIM or spoIIP block sporulation at morphological stage II, prior to the stage of engulfment. These three genes are absolutely conserved (sometimes even duplicated) in all endospore formers .. Probab=20.21 E-value=61 Score=13.31 Aligned_cols=30 Identities=27% Similarity=0.211 Sum_probs=21.8 Q ss_pred CCCCHHHEECC-CCCCCHHHHHHHHHHHHHH Q ss_conf 58861332204-6666602569999999986 Q gi|254780579|r 87 QDLAECCIDIG-YKALNTEGNAQEASAWAEK 116 (207) Q Consensus 87 ~~i~~~~I~~e-~~s~~T~ena~~~~~~l~~ 116 (207) +.+.+.+|++| +...||.|||..++++|.+ T Consensus 184 QDls~nA~LiE~Gg~~Nt~~Ea~~sa~~LA~ 214 (221) T TIGR02867 184 QDLSPNAILIEVGGVDNTLEEAKRSADYLAE 214 (221) T ss_pred CCCCCCEEEEEECCCCCCHHHHHHHHHHHHH T ss_conf 4333684799975873858999999999999 No 169 >cd06291 PBP1_Qymf_like Ligand binding domain of the lacI-like transcription regulator from a novel metal-reducing bacterium Alkaliphilus Metalliredigens (strain Qymf) and its close homologs. This group includes the ligand binding domain of the lacI-like transcription regulator from a novel metal-reducing bacterium Alkaliphilus Metalliredigens (strain Qymf) and its close homologs. Qymf is a strict anaerobe that could be grown in the presence of borax and its cells are straight rods that produce endospores. This group is a member of the LacI-GalR family repressors that are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription Probab=20.20 E-value=61 Score=13.31 Aligned_cols=95 Identities=19% Similarity=0.172 Sum_probs=51.0 Q ss_pred HHHHHCCCCCEECCCCCCCCCH----HHHHHHHHHHCCCCHHHEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCHHH Q ss_conf 9999689982451587654678----999998765158861332204666660256999999998649981898511442 Q gi|254780579|r 55 ELLENQIGEKIFISGVHHSVSK----DILLQKIPIRQDLAECCIDIGYKALNTEGNAQEASAWAEKNNFHHVLIVTHDYH 130 (207) Q Consensus 55 ~L~~~g~~~~ii~SG~~~~~~~----~~~~~~~~~~~~i~~~~I~~e~~s~~T~ena~~~~~~l~~~~~~~iiLVTs~yH 130 (207) .|+++|.....+++|.....+. .+-++..+.+.|++...+..+. ..+..+-.....+++++..--+.++++++.- T Consensus 106 ~L~~~Ghr~I~~i~~~~~~~~~~~~R~~Gf~~al~~~g~~~~~~~~~~-~~~~~~~~~~~~~~l~~~~~~~Ai~~~nD~~ 184 (265) T cd06291 106 ELIERGCKHIAHIGGPNNTVSPTNLRYEGFLDVLKENGLEVRIIEIQE-NFDDAEKKEEIKELLEEYPDIDGIFASNDLT 184 (265) T ss_pred HHHHCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECC-CCCHHHHHHHHHHHHHCCCCCCEEECCCHHH T ss_conf 999739964999966888874899999999999997699960899768-8986999999999985599988432166899 Q ss_pred HHHHHHHHHHHC---C-CCEEEEE Q ss_conf 199999999862---9-9879996 Q gi|254780579|r 131 MPRTFLELQRIN---S-TVQFIPY 150 (207) Q Consensus 131 m~RA~~~f~~~~---p-~i~i~~~ 150 (207) .-.+...++..+ | ++.++.+ T Consensus 185 A~g~~~al~~~g~~vP~disvigf 208 (265) T cd06291 185 AILVLKEAQQRGIRVPEDLQIIGY 208 (265) T ss_pred HHHHHHHHHHCCCCCCCCEEEEEE T ss_conf 999999999819999998699998 No 170 >PRK11267 biopolymer transport protein ExbD; Provisional Probab=20.16 E-value=62 Score=13.31 Aligned_cols=57 Identities=5% Similarity=0.072 Sum_probs=39.9 Q ss_pred CCCCHHHHHHHHHH-HCCCCHHHEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCH Q ss_conf 54678999998765-1588613322046666602569999999986499818985114 Q gi|254780579|r 72 HSVSKDILLQKIPI-RQDLAECCIDIGYKALNTEGNAQEASAWAEKNNFHHVLIVTHD 128 (207) Q Consensus 72 ~~~~~~~~~~~~~~-~~~i~~~~I~~e~~s~~T~ena~~~~~~l~~~~~~~iiLVTs~ 128 (207) ...+..++...... ..+-++..|.+.......|+......+.+++.|++++-|||.. T Consensus 79 ~~v~~~~L~~~l~~~~~~~~~~~v~i~aD~~~~y~~vv~Vmd~~~~aG~~kv~lvte~ 136 (141) T PRK11267 79 DPVTDETMITALDALTEGKKDTTIFFRADKTVDYETLMKVMDTLHQAGYLKIGLVGEE 136 (141) T ss_pred CCCCHHHHHHHHHHHHHCCCCCEEEEECCCCCCHHHHHHHHHHHHHCCCCEEEEEECC T ss_conf 0357999999999998328996299975899868899999999998699869998468 No 171 >TIGR02168 SMC_prok_B chromosome segregation protein SMC; InterPro: IPR011890 The SMC (structural maintenance of chromosomes) family of proteins, exist in virtually all organisms including both bacteria and archaea. The SMC proteins are essential for successful chromosome transmission during replication and segregation of the genome in all organisms and form three types of heterodimer (SMC1SMC3, SMC2SMC4, SMC5SMC6), which are core components of large multiprotein complexes. The best known complexes are cohesin, which is responsible for sister-chromatid cohesion, and condensin, which is required for full chromosome condensation in mitosis. SMCs are generally present as single proteins in bacteria, and as at least six distinct proteins in eukaryotes. The proteins range in size from approximately 110 to 170 kDa, and share a five-domain structure, with globular N- and C-terminal (IPR003395 from INTERPRO) domains separated by a long (circa 100 nm or 900 residues) coiled coil segment in the centre of which is a globular ''hinge'' domain, characterised by a set of four highly conserved glycine residues that are typical of flexible regions in a protein. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif (XXXXD, where X is any hydrophobic residue), and a LSGG motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases . All SMC proteins appear to form dimers, either forming homodimers with themselves, as in the case of prokaryotic SMC proteins, or heterodimers between different but related SMC proteins. The dimers are arranged in an antiparallel alignment. This orientation brings the N- and C-terminal globular domains (from either different or identical protamers) together, which unites an ATP binding site (Walker A motif) within the N-terminal domain with a Walker B motif (DA box) within the C-terminal domain, to form a potentially functional ATPase. Protein interaction and microscopy data suggest that SMC dimers form a ring-like structure which might embrace DNA molecules. Non-SMC subunits associate with the SMC amino- and carboxy-terminal domains. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. SMCs share not only sequence similarity but also structural similarity with ABC proteins. SMC proteins function together with other proteins in a range of chromosomal transactions, including chromosome condensation, sister-chromatid cohesion, recombination, DNA repair and epigenetic silencing of gene expression . The smc gene is often associated with scpB (IPR005234 from INTERPRO) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle , , .; GO: 0005515 protein binding, 0005524 ATP binding, 0005694 chromosome. Probab=20.06 E-value=62 Score=13.29 Aligned_cols=40 Identities=13% Similarity=0.164 Sum_probs=17.1 Q ss_pred CCEEEEECCCHHHHHHHHHH----HHHCCCCCEECCCCCCCCCHH Q ss_conf 87899946985799999999----996899824515876546789 Q gi|254780579|r 37 VSAIVVLTGEPIRIERAFEL----LENQIGEKIFISGVHHSVSKD 77 (207) Q Consensus 37 ~DaIVVLgGg~~Ri~~a~~L----~~~g~~~~ii~SG~~~~~~~~ 77 (207) ...|=-|+||.. ...|+.| |+-.|+|.+|+--++-+..++ T Consensus 1096 ~~~l~LLSGGEK-ALTA~ALlFAIf~~~P~PFC~LDEVDAPLDda 1139 (1191) T TIGR02168 1096 NQNLSLLSGGEK-ALTALALLFAIFKVKPAPFCILDEVDAPLDDA 1139 (1191) T ss_pred CCCCCCCCCHHH-HHHHHHHHHHHHCCCCCCEEEECCCCCCCCHH T ss_conf 766430201579-99999999999600788726321233679826 Done!