Query gi|254780670|ref|YP_003065083.1| phosphopyruvate hydratase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 424 No_of_seqs 163 out of 2152 Neff 5.6 Searched_HMMs 39220 Date Sun May 29 20:40:55 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780670.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 TIGR01060 eno phosphopyruvate 100.0 0 0 1511.1 39.4 421 3-423 1-429 (430) 2 PRK00077 eno phosphopyruvate h 100.0 0 0 1282.6 42.4 423 1-423 2-425 (427) 3 cd03313 enolase Enolase: Enola 100.0 0 0 1237.0 40.2 404 5-408 1-408 (408) 4 PTZ00081 enolase (2-phospho-D- 100.0 0 0 1226.9 41.1 420 1-423 2-440 (442) 5 COG0148 Eno Enolase [Carbohydr 100.0 0 0 1226.7 39.7 421 2-424 3-423 (423) 6 KOG2670 consensus 100.0 0 0 1159.6 30.9 417 1-422 1-432 (433) 7 pfam00113 Enolase_C Enolase, C 100.0 0 0 841.6 27.2 286 137-424 1-296 (296) 8 PRK08350 hypothetical protein; 100.0 0 0 839.3 27.3 337 2-418 3-339 (341) 9 pfam03952 Enolase_N Enolase, N 100.0 0 0 367.6 14.0 131 2-132 1-132 (132) 10 cd03316 MR_like Mandelate race 99.8 5.5E-16 1.4E-20 138.7 24.8 305 2-383 1-318 (357) 11 cd03318 MLE Muconate Lactonizi 99.7 1.6E-14 4.1E-19 127.8 26.1 292 10-385 23-318 (365) 12 cd03325 D-galactonate_dehydrat 99.7 3.4E-13 8.7E-18 118.1 27.8 298 2-381 1-303 (352) 13 cd03322 rpsA The starvation se 99.7 4E-13 1E-17 117.5 24.6 288 2-380 1-291 (361) 14 cd03329 MR_like_4 Mandelate ra 99.6 6.1E-13 1.6E-17 116.2 23.7 280 18-382 33-313 (368) 15 cd03327 MR_like_2 Mandelate ra 99.6 3.6E-12 9.3E-17 110.5 25.0 288 2-381 1-293 (341) 16 cd03328 MR_like_3 Mandelate ra 99.6 2.3E-12 5.8E-17 112.0 23.6 277 15-380 26-305 (352) 17 cd03321 mandelate_racemase Man 99.6 1.5E-12 3.9E-17 113.2 22.6 283 1-367 1-298 (355) 18 cd03319 L-Ala-DL-Glu_epimerase 99.6 3.4E-12 8.7E-17 110.7 24.0 286 8-386 18-309 (316) 19 cd03317 NAAAR N-acylamino acid 99.6 7.7E-13 2E-17 115.5 19.7 287 8-384 17-306 (354) 20 COG4948 L-alanine-DL-glutamate 99.6 6.2E-12 1.6E-16 108.7 23.6 298 1-380 1-313 (372) 21 cd00308 enolase_like Enolase-s 99.6 3E-13 7.6E-18 118.5 14.8 200 15-381 22-222 (229) 22 cd03324 rTSbeta_L-fuconate_deh 99.5 6E-10 1.5E-14 94.1 25.2 317 1-379 1-367 (415) 23 cd03326 MR_like_1 Mandelate ra 99.4 1.1E-09 2.8E-14 92.2 21.5 287 15-382 24-335 (385) 24 cd03315 MLE_like Muconate lact 99.3 1.1E-09 2.8E-14 92.1 20.1 210 110-384 46-258 (265) 25 cd03323 D-glucarate_dehydratas 99.3 9.9E-09 2.5E-13 85.0 24.9 318 2-380 1-337 (395) 26 cd03314 MAL Methylaspartate am 99.2 1.3E-08 3.2E-13 84.3 19.7 336 18-406 13-366 (369) 27 TIGR01502 B_methylAsp_ase meth 98.9 1.7E-06 4.4E-11 68.5 19.0 313 19-387 51-387 (414) 28 cd03320 OSBS o-Succinylbenzoat 98.9 3.2E-06 8.1E-11 66.5 20.4 142 213-381 110-253 (263) 29 TIGR01928 menC_lowGC/arch o-su 98.8 1E-06 2.6E-11 70.1 17.2 257 18-363 22-283 (326) 30 PRK05105 O-succinylbenzoate sy 98.7 3E-05 7.5E-10 59.3 20.3 118 258-383 167-288 (326) 31 PRK02714 O-succinylbenzoate sy 98.6 9.5E-06 2.4E-10 63.0 16.3 144 212-383 145-293 (320) 32 TIGR02534 mucon_cyclo muconate 98.5 1.1E-05 2.9E-10 62.4 13.7 290 14-389 24-322 (369) 33 COG3799 Mal Methylaspartate am 98.2 0.00055 1.4E-08 50.0 17.0 308 19-387 52-382 (410) 34 pfam07476 MAAL_C Methylasparta 98.0 0.00013 3.4E-09 54.5 10.3 147 261-409 87-247 (249) 35 pfam02746 MR_MLE_N Mandelate r 97.8 0.00028 7.1E-09 52.1 9.1 92 17-133 26-117 (117) 36 TIGR03247 glucar-dehydr glucar 97.5 0.014 3.5E-07 39.6 20.4 280 56-379 51-353 (441) 37 TIGR01927 menC_gamma/gm+ o-suc 95.7 0.033 8.3E-07 36.8 5.9 123 214-362 162-292 (336) 38 pfam05034 MAAL_N Methylasparta 95.3 0.29 7.5E-06 29.8 9.4 104 19-134 51-154 (158) 39 PRK02901 O-succinylbenzoate sy 95.1 0.087 2.2E-06 33.7 6.4 123 245-379 129-253 (327) 40 pfam01188 MR_MLE Mandelate rac 95.0 0.13 3.4E-06 32.3 7.0 69 213-302 28-96 (98) 41 KOG2367 consensus 94.7 0.46 1.2E-05 28.3 10.3 40 304-343 338-377 (560) 42 PRK11382 frlB fructoselysine-6 89.0 1.3 3.3E-05 25.0 5.6 30 332-362 269-298 (347) 43 COG1105 FruK Fructose-1-phosph 88.3 2.3 5.9E-05 23.1 10.4 103 251-362 101-221 (310) 44 PRK06375 consensus 88.0 2.4 6.1E-05 23.0 13.4 32 257-289 172-203 (381) 45 PRK11557 putative DNA-binding 85.0 3.4 8.7E-05 21.9 7.6 61 330-390 180-242 (282) 46 cd01539 PBP1_GGBP Periplasmic 84.1 1.4 3.5E-05 24.8 3.6 64 260-323 181-252 (303) 47 TIGR03586 PseI pseudaminic aci 82.9 4.1 0.0001 21.3 11.0 115 263-386 75-220 (327) 48 TIGR01754 flav_RNR ribonucleot 81.1 4.7 0.00012 20.9 6.0 94 212-324 13-115 (145) 49 COG1415 Uncharacterized conser 79.6 1.3 3.3E-05 25.0 2.2 72 156-227 273-347 (373) 50 COG1441 MenC O-succinylbenzoat 79.0 3.4 8.8E-05 21.9 4.1 81 275-363 183-264 (321) 51 PRK06107 aspartate aminotransf 78.5 5.7 0.00014 20.3 13.9 31 257-288 179-210 (402) 52 PRK09271 flavodoxin; Provision 76.9 2.5 6.5E-05 22.8 3.0 112 206-340 7-127 (160) 53 cd05710 SIS_1 A subgroup of th 76.0 2.2 5.6E-05 23.3 2.4 55 329-383 47-101 (120) 54 PRK06575 consensus 75.4 6.8 0.00017 19.7 14.1 30 257-287 177-207 (399) 55 pfam04748 Polysacc_deac_2 Dive 75.3 6.9 0.00017 19.7 9.8 135 233-370 46-189 (213) 56 cd06319 PBP1_ABC_sugar_binding 74.1 7.3 0.00019 19.5 5.6 102 213-342 140-247 (277) 57 PRK02271 methylenetetrahydrome 74.1 7.3 0.00019 19.5 6.1 30 338-367 280-310 (324) 58 cd03242 ABC_RecF RecF is a rec 74.0 7.1 0.00018 19.5 4.6 55 266-323 200-254 (270) 59 pfam00289 CPSase_L_chain Carba 73.8 7.4 0.00019 19.4 5.1 71 345-422 13-85 (109) 60 cd05008 SIS_GlmS_GlmD_1 SIS (S 73.7 2.1 5.5E-05 23.4 1.9 65 329-393 46-110 (126) 61 smart00475 53EXOc 5'-3' exonuc 72.7 7.8 0.0002 19.2 5.2 94 213-311 33-132 (259) 62 cd06312 PBP1_ABC_sugar_binding 72.4 8 0.0002 19.2 6.2 76 262-343 166-246 (271) 63 PRK08069 consensus 71.0 8.5 0.00022 19.0 12.5 49 257-306 176-230 (390) 64 cd05017 SIS_PGI_PMI_1 The memb 70.8 5.7 0.00014 20.3 3.5 71 329-405 43-113 (119) 65 TIGR03600 phage_DnaB phage rep 70.6 8.7 0.00022 18.9 12.8 140 236-379 209-373 (421) 66 PRK07550 hypothetical protein; 70.0 8.9 0.00023 18.8 8.0 31 257-288 177-207 (387) 67 pfam04017 DUF366 Domain of unk 69.8 5.3 0.00014 20.5 3.2 93 241-335 21-131 (183) 68 cd06306 PBP1_TorT-like TorT-li 69.5 8.9 0.00023 18.8 4.3 76 261-342 166-245 (268) 69 PRK08960 hypothetical protein; 68.5 9.5 0.00024 18.6 12.4 30 257-287 178-207 (387) 70 TIGR03471 HpnJ hopanoid biosyn 68.0 9.8 0.00025 18.5 11.9 125 259-388 225-373 (472) 71 TIGR00593 pola DNA polymerase 68.0 9.8 0.00025 18.5 7.6 121 176-311 6-135 (1005) 72 PRK07366 succinyldiaminopimela 67.6 10 0.00025 18.5 13.3 29 258-287 179-207 (388) 73 PRK00064 recF recombination pr 67.1 10 0.00026 18.4 5.0 56 266-324 290-346 (355) 74 TIGR03555 F420_mer 5,10-methyl 66.3 11 0.00027 18.3 5.8 30 339-368 281-311 (325) 75 PRK06056 consensus 66.3 11 0.00027 18.3 11.8 180 71-287 5-210 (402) 76 TIGR01420 pilT_fam twitching m 66.2 2.9 7.3E-05 22.5 1.2 135 172-310 43-208 (350) 77 cd06309 PBP1_YtfQ_like Peripla 65.5 11 0.00028 18.2 5.4 59 261-322 166-232 (273) 78 KOG1220 consensus 62.5 12 0.00031 17.8 5.1 76 202-283 284-363 (607) 79 pfam02593 DUF166 Uncharacteriz 61.9 13 0.00032 17.7 4.8 39 265-303 87-129 (215) 80 TIGR03569 NeuB_NnaB N-acetylne 61.6 13 0.00032 17.7 12.5 148 257-414 69-263 (329) 81 PRK06108 aspartate aminotransf 61.5 13 0.00032 17.7 10.8 30 257-287 171-200 (382) 82 PTZ00340 O-sialoglycoprotein e 60.7 13 0.00033 17.6 4.5 114 63-191 54-175 (348) 83 cd06318 PBP1_ABC_sugar_binding 60.5 13 0.00034 17.6 5.1 75 260-341 171-251 (282) 84 PRK08835 consensus 60.5 13 0.00034 17.5 4.0 16 329-344 544-559 (931) 85 cd01540 PBP1_arabinose_binding 60.4 11 0.00029 18.0 3.4 147 169-342 104-259 (289) 86 cd06310 PBP1_ABC_sugar_binding 60.2 13 0.00034 17.5 5.2 76 261-342 165-245 (273) 87 PRK08928 consensus 59.8 14 0.00035 17.5 4.7 16 9-24 94-109 (861) 88 PRK07997 consensus 59.5 14 0.00035 17.4 4.7 18 329-346 542-559 (928) 89 PRK06256 biotin synthase; Vali 59.3 14 0.00035 17.4 8.3 55 334-390 245-301 (325) 90 PRK13234 nifH nitrogenase redu 58.6 14 0.00036 17.3 9.1 123 160-296 145-271 (293) 91 KOG2450 consensus 58.3 14 0.00037 17.3 4.6 125 261-413 313-451 (501) 92 PRK09513 fruK 1-phosphofructok 58.0 14 0.00037 17.3 8.8 12 57-68 38-49 (312) 93 PRK11337 DNA-binding transcrip 57.5 15 0.00038 17.2 8.9 54 330-383 189-242 (293) 94 TIGR01533 lipo_e_P4 5'-nucleot 57.2 15 0.00038 17.2 4.6 92 237-376 85-181 (295) 95 cd01173 pyridoxal_pyridoxamine 56.5 15 0.00039 17.1 7.1 17 367-383 215-231 (254) 96 COG3854 SpoIIIAA ncharacterize 56.2 15 0.00039 17.1 5.6 40 336-376 224-267 (308) 97 PRK00331 glucosamine--fructose 55.5 8.6 0.00022 18.9 2.1 33 329-361 335-367 (604) 98 PRK06245 cofG FO synthase subu 55.3 9 0.00023 18.8 2.2 15 217-231 286-300 (336) 99 PRK05755 DNA polymerase I; Pro 55.1 16 0.00041 16.9 5.1 13 181-193 470-482 (889) 100 PRK09265 aminotransferase AlaT 54.9 16 0.00041 16.9 11.1 31 257-288 181-211 (404) 101 PRK07310 consensus 54.9 16 0.00041 16.9 10.7 48 257-305 177-230 (395) 102 PRK05797 consensus 54.7 16 0.00042 16.9 5.1 18 329-346 484-501 (869) 103 PRK08443 consensus 52.1 18 0.00045 16.6 12.7 30 257-287 175-204 (388) 104 COG1012 PutA NAD-dependent ald 51.5 5.2 0.00013 20.6 0.5 64 139-211 187-251 (472) 105 TIGR03470 HpnH hopanoid biosyn 51.1 18 0.00047 16.5 10.0 118 260-385 58-195 (318) 106 cd06314 PBP1_tmGBP Periplasmic 50.2 19 0.00048 16.4 4.7 77 261-342 161-241 (271) 107 PRK06507 consensus 50.0 19 0.00049 16.4 12.0 31 257-288 177-208 (400) 108 pfam03102 NeuB NeuB family. Ne 50.0 19 0.00049 16.4 9.4 120 257-386 49-199 (240) 109 PRK05464 Na(+)-translocating N 49.5 16 0.0004 17.0 2.7 35 261-295 312-352 (408) 110 cd06334 PBP1_ABC_ligand_bindin 49.3 20 0.0005 16.3 7.5 97 214-337 154-251 (351) 111 pfam12103 Lipl32 Surface lipop 48.9 4.9 0.00012 20.8 -0.0 43 329-376 130-172 (183) 112 PRK08434 consensus 48.8 20 0.00051 16.2 4.9 12 269-280 577-588 (887) 113 PRK08654 pyruvate carboxylase 48.6 13 0.00033 17.6 2.1 24 217-240 256-279 (497) 114 PRK07543 consensus 48.5 20 0.00051 16.2 11.8 27 257-284 177-204 (400) 115 TIGR02525 plasmid_TraJ plasmid 48.5 18 0.00047 16.5 2.9 24 263-286 163-189 (374) 116 PRK05794 consensus 48.2 20 0.00052 16.2 12.3 30 257-287 176-205 (397) 117 pfam00682 HMGL-like HMGL-like. 48.0 20 0.00052 16.2 10.7 13 261-273 101-113 (237) 118 pfam01380 SIS SIS domain. SIS 46.8 16 0.00042 16.9 2.4 57 330-386 54-110 (131) 119 PRK13236 nitrogenase reductase 46.6 21 0.00055 16.0 8.9 135 147-295 133-271 (295) 120 PRK07337 aminotransferase; Val 46.6 21 0.00055 16.0 9.6 31 257-288 176-206 (388) 121 PRK08068 transaminase; Reviewe 46.4 22 0.00055 16.0 13.8 30 257-287 180-209 (389) 122 PRK07568 aspartate aminotransf 45.6 22 0.00056 15.9 10.9 32 257-289 175-206 (396) 123 PRK05929 consensus 45.5 22 0.00057 15.9 4.8 12 268-279 555-566 (870) 124 TIGR01487 SPP-like SPP-like hy 45.2 15 0.00039 17.1 2.1 14 217-230 124-137 (223) 125 COG2084 MmsB 3-hydroxyisobutyr 44.9 23 0.00058 15.8 5.5 93 249-357 87-197 (286) 126 PRK08786 consensus 44.8 23 0.00058 15.8 4.9 18 329-346 541-558 (927) 127 PRK05957 aspartate aminotransf 44.5 23 0.00058 15.8 13.8 31 257-288 173-203 (389) 128 COG2024 Phenylalanyl-tRNA synt 44.4 22 0.00056 15.9 2.8 57 271-327 198-258 (536) 129 PRK05586 biotin carboxylase; V 44.4 17 0.00044 16.7 2.2 57 217-279 256-321 (447) 130 PRK06749 replicative DNA helic 44.1 23 0.00059 15.7 6.0 137 237-377 202-367 (428) 131 PRK12581 oxaloacetate decarbox 44.1 23 0.00059 15.7 7.2 29 182-210 163-192 (468) 132 TIGR02090 LEU1_arch isopropylm 43.9 23 0.0006 15.7 9.4 104 257-363 15-134 (371) 133 pfam03641 Lysine_decarbox Poss 43.7 24 0.0006 15.7 3.7 23 329-351 53-75 (130) 134 cd06335 PBP1_ABC_ligand_bindin 43.7 24 0.0006 15.7 6.6 83 214-321 152-235 (347) 135 cd06300 PBP1_ABC_sugar_binding 43.3 24 0.00061 15.6 4.5 50 261-312 167-218 (272) 136 PRK07625 consensus 43.2 24 0.00061 15.6 5.1 15 330-344 537-551 (922) 137 cd06320 PBP1_allose_binding Pe 43.2 24 0.00061 15.6 5.5 75 261-342 164-244 (275) 138 PRK10355 xylF D-xylose transpo 42.8 24 0.00062 15.6 6.5 63 261-323 192-257 (330) 139 PRK09534 btuF corrinoid ABC tr 42.7 19 0.00047 16.5 2.2 70 271-343 243-314 (364) 140 PRK13508 tagatose-6-phosphate 42.6 24 0.00062 15.6 6.8 49 183-242 116-164 (309) 141 PRK06348 aspartate aminotransf 42.3 25 0.00063 15.5 10.5 31 257-288 175-205 (383) 142 pfam09806 CDK2AP Cyclin-depend 41.9 21 0.00055 16.0 2.4 17 395-411 100-116 (156) 143 TIGR01979 sufS cysteine desulf 41.3 25 0.00065 15.4 5.2 106 212-363 89-201 (409) 144 PRK05756 pyridoxamine kinase; 41.3 25 0.00065 15.4 7.2 79 260-357 83-176 (287) 145 TIGR02318 phosphono_phnM phosp 41.3 17 0.00044 16.7 1.9 96 248-364 181-283 (391) 146 cd06321 PBP1_ABC_sugar_binding 40.1 27 0.00068 15.3 4.4 50 261-313 163-216 (271) 147 PRK12457 2-dehydro-3-deoxyphos 39.0 28 0.0007 15.2 9.8 23 214-236 73-95 (281) 148 PRK08176 pdxK pyridoxal kinase 38.7 28 0.00071 15.2 7.5 18 340-357 172-189 (281) 149 PRK08076 consensus 38.4 28 0.00072 15.1 4.6 14 180-193 455-468 (877) 150 COG1879 RbsB ABC-type sugar tr 37.7 29 0.00073 15.0 4.7 75 259-340 199-279 (322) 151 pfam03444 DUF293 Domain of unk 37.6 19 0.0005 16.3 1.6 52 166-218 22-74 (79) 152 COG5170 CDC55 Serine/threonine 37.3 29 0.00074 15.0 2.7 19 161-179 205-223 (460) 153 PRK09389 (R)-citramalate synth 37.3 29 0.00074 15.0 8.2 48 171-225 106-153 (487) 154 cd06308 PBP1_sensor_kinase_lik 37.3 29 0.00074 15.0 5.7 72 261-337 164-241 (270) 155 PRK07367 consensus 37.2 29 0.00075 15.0 11.4 31 257-288 174-204 (385) 156 cd00984 DnaB_C DnaB helicase C 37.1 29 0.00075 15.0 8.8 13 351-363 158-171 (242) 157 PRK11241 gabD succinate-semial 36.8 11 0.00028 18.2 0.2 125 138-289 199-325 (482) 158 PRK11559 garR tartronate semia 36.7 30 0.00076 14.9 5.8 13 285-297 272-284 (295) 159 cd07084 ALDH_KGSADH-like ALDH 36.2 17 0.00042 16.8 1.1 59 140-210 156-215 (442) 160 pfam03659 Glyco_hydro_71 Glyco 36.2 30 0.00077 14.9 5.8 72 290-361 254-340 (386) 161 cd06313 PBP1_ABC_sugar_binding 36.0 30 0.00078 14.9 5.7 77 260-341 164-243 (272) 162 TIGR03540 DapC_direct LL-diami 35.2 31 0.0008 14.8 10.8 30 257-287 177-206 (383) 163 cd06248 M14_CPA_CPB_like Pepti 34.8 32 0.00081 14.7 3.6 29 61-91 70-98 (304) 164 TIGR00273 TIGR00273 iron-sulfu 34.6 22 0.00057 15.9 1.5 139 100-253 282-433 (450) 165 COG1806 Uncharacterized protei 34.5 32 0.00082 14.7 4.5 89 279-374 169-263 (273) 166 COG2805 PilT Tfp pilus assembl 34.3 21 0.00054 16.0 1.4 29 257-286 136-164 (353) 167 PRK00103 SPOUT methyltransfera 34.1 33 0.00083 14.6 4.2 36 254-289 73-112 (156) 168 cd07094 ALDH_F21_LactADH-like 34.1 15 0.00037 17.2 0.5 24 263-289 277-300 (453) 169 TIGR00653 GlnA glutamine synth 33.8 15 0.00037 17.2 0.5 228 35-299 169-465 (486) 170 PRK09276 aspartate aminotransf 33.8 33 0.00084 14.6 13.2 30 257-287 179-208 (385) 171 PRK13795 hypothetical protein; 33.7 33 0.00084 14.6 9.1 17 177-193 324-342 (630) 172 TIGR01804 BADH betaine aldehyd 33.6 33 0.00084 14.6 3.0 299 56-398 59-426 (471) 173 COG0276 HemH Protoheme ferro-l 33.6 33 0.00084 14.6 6.0 67 159-232 115-181 (320) 174 KOG1354 consensus 33.5 33 0.00085 14.6 3.1 19 161-179 197-215 (433) 175 PRK07898 consensus 33.4 33 0.00085 14.6 4.7 13 181-193 480-492 (902) 176 PRK11858 aksA trans-homoaconit 33.2 34 0.00086 14.6 11.8 17 171-187 109-125 (378) 177 PRK07778 consensus 33.1 34 0.00086 14.5 13.2 32 257-289 175-206 (386) 178 cd07669 BAR_SNX33 The Bin/Amph 33.1 9.1 0.00023 18.8 -0.7 43 204-246 44-86 (207) 179 COG2145 ThiM Hydroxyethylthiaz 33.0 34 0.00086 14.5 3.4 22 261-282 141-162 (265) 180 PRK00090 bioD dithiobiotin syn 32.6 34 0.00087 14.5 11.1 100 257-384 84-194 (223) 181 PRK00366 ispG 4-hydroxy-3-meth 32.6 34 0.00087 14.5 8.9 17 364-383 309-325 (367) 182 cd06301 PBP1_rhizopine_binding 32.6 34 0.00088 14.5 5.4 76 261-342 165-246 (272) 183 TIGR01735 FGAM_synt phosphorib 32.5 34 0.00086 14.6 2.2 23 29-53 313-336 (1401) 184 TIGR01761 thiaz-red thiazoliny 32.3 35 0.00088 14.4 2.6 97 109-223 43-150 (378) 185 PRK09701 D-allose transporter 32.3 35 0.00089 14.4 5.5 76 261-342 198-278 (311) 186 cd03870 M14_CPA Peptidase M14 32.2 35 0.00089 14.4 3.4 29 61-91 69-97 (301) 187 PRK07777 aminotransferase; Val 32.2 35 0.00089 14.4 13.2 30 257-287 172-201 (386) 188 TIGR00538 hemN oxygen-independ 31.9 35 0.0009 14.4 3.0 146 202-360 109-286 (462) 189 PRK13794 hypothetical protein; 31.8 35 0.0009 14.4 8.8 78 215-295 232-315 (473) 190 TIGR00470 sepS O-phosphoseryl- 31.7 35 0.0009 14.4 2.3 149 139-324 109-265 (558) 191 PRK01122 potassium-transportin 31.7 35 0.0009 14.4 3.6 91 279-382 444-539 (684) 192 PTZ00295 glucosamine-fructose- 31.7 35 0.0009 14.4 2.2 32 329-360 422-453 (691) 193 PRK05764 aspartate aminotransf 31.6 36 0.00091 14.4 11.5 30 257-287 177-206 (389) 194 TIGR02967 guan_deamin guanine 31.5 36 0.00091 14.4 2.8 116 171-298 89-233 (426) 195 TIGR02050 gshA_cyan_rel unchar 31.5 26 0.00067 15.3 1.5 38 159-196 48-87 (297) 196 pfam03828 PAP_assoc Poly(A) po 31.4 25 0.00065 15.4 1.4 10 278-287 47-56 (56) 197 cd05014 SIS_Kpsf KpsF-like pro 31.3 36 0.00092 14.3 3.0 55 329-383 47-101 (128) 198 COG4360 APA2 ATP adenylyltrans 31.2 27 0.00069 15.2 1.5 40 145-184 140-179 (298) 199 KOG1016 consensus 31.0 23 0.00058 15.8 1.1 82 153-238 339-440 (1387) 200 pfam00239 Resolvase Resolvase, 31.0 36 0.00093 14.3 9.3 85 262-369 13-100 (139) 201 COG4018 Uncharacterized protei 30.9 36 0.00093 14.3 6.0 23 109-131 91-115 (505) 202 COG1195 RecF Recombinational D 30.9 36 0.00093 14.3 4.5 60 266-328 292-351 (363) 203 pfam03808 Glyco_tran_WecB Glyc 30.9 36 0.00093 14.3 5.7 48 262-309 57-108 (172) 204 PRK09423 gldA glycerol dehydro 30.8 37 0.00093 14.3 6.2 27 104-130 83-113 (366) 205 PRK07556 consensus 30.7 37 0.00094 14.3 5.2 18 329-346 586-603 (977) 206 TIGR01941 nqrF NADH:ubiquinone 30.2 24 0.0006 15.7 1.1 37 261-297 322-364 (425) 207 COG1247 Sortase and related ac 30.2 37 0.00095 14.2 8.3 123 264-393 13-147 (169) 208 smart00148 PLCXc Phospholipase 30.0 37 0.00094 14.2 2.0 46 264-315 75-121 (135) 209 TIGR03599 YloV DAK2 domain fus 30.0 38 0.00096 14.2 6.3 183 186-400 241-441 (530) 210 TIGR01977 am_tr_V_EF2568 cyste 29.9 38 0.00096 14.2 3.9 68 295-362 107-179 (384) 211 cd06302 PBP1_LsrB_Quorum_Sensi 29.7 38 0.00097 14.1 5.6 76 261-343 167-248 (298) 212 cd02812 PcrB_like PcrB_like pr 29.7 31 0.0008 14.8 1.6 55 256-319 156-213 (219) 213 pfam06924 DUF1281 Protein of u 29.7 35 0.00089 14.4 1.9 75 112-186 8-100 (134) 214 pfam02739 5_3_exonuc_N 5'-3' e 29.6 38 0.00097 14.1 5.3 94 213-311 35-134 (169) 215 cd07626 BAR_SNX9_like The Bin/ 29.5 34 0.00086 14.5 1.8 12 264-275 179-190 (199) 216 cd07668 BAR_SNX9 The Bin/Amphi 28.6 39 0.001 14.0 2.0 11 264-274 187-197 (210) 217 COG0761 lytB 4-Hydroxy-3-methy 28.4 40 0.001 14.0 5.4 83 248-335 156-245 (294) 218 COG5310 Homospermidine synthas 28.3 32 0.00083 14.7 1.5 109 186-305 72-209 (481) 219 PRK11183 D-lactate dehydrogena 27.9 41 0.001 13.9 4.7 18 229-246 313-330 (565) 220 PRK08462 biotin carboxylase; V 27.8 41 0.001 13.9 4.0 58 217-279 258-324 (446) 221 cd06246 M14_CPB2 Peptidase M14 27.6 41 0.0011 13.9 3.5 35 61-97 69-109 (300) 222 cd00008 53EXOc 5'-3' exonuclea 27.4 41 0.0011 13.9 4.9 93 214-311 35-133 (240) 223 cd05025 S-100A1 S-100A1: S-100 27.3 42 0.0011 13.9 4.3 48 49-100 19-66 (92) 224 PRK11320 prpB 2-methylisocitra 27.0 42 0.0011 13.8 4.5 13 140-152 201-214 (292) 225 PRK07683 aminotransferase A; V 26.8 43 0.0011 13.8 12.1 31 257-288 174-204 (387) 226 cd07101 ALDH_SSADH2_GabD2 Myco 26.7 43 0.0011 13.8 3.1 125 137-288 170-294 (454) 227 TIGR03250 PhnAcAld_DH putative 26.6 43 0.0011 13.8 3.8 120 138-289 192-316 (472) 228 cd05026 S-100Z S-100Z: S-100Z 26.6 43 0.0011 13.8 4.3 47 49-99 20-66 (93) 229 pfam00465 Fe-ADH Iron-containi 26.5 43 0.0011 13.8 7.5 36 112-147 90-127 (312) 230 COG0439 AccC Biotin carboxylas 26.5 43 0.0011 13.8 6.1 29 104-132 103-134 (449) 231 COG1503 eRF1 Peptide chain rel 26.4 43 0.0011 13.7 4.5 25 163-187 249-273 (411) 232 cd06305 PBP1_methylthioribose_ 26.3 43 0.0011 13.7 6.0 77 262-342 164-247 (273) 233 COG0630 VirB11 Type IV secreto 26.3 43 0.0011 13.7 3.6 26 275-300 169-195 (312) 234 COG5054 ERV1 Mitochondrial sul 26.1 39 0.001 14.0 1.6 42 158-199 127-169 (181) 235 TIGR01310 L7 60S ribosomal pro 26.0 35 0.00089 14.4 1.4 39 247-288 147-189 (248) 236 PRK09147 aminotransferase; Pro 25.9 44 0.0011 13.7 9.3 30 257-287 179-208 (397) 237 PRK12795 fliM flagellar motor 25.9 33 0.00085 14.6 1.2 70 80-149 45-141 (378) 238 PRK07682 hypothetical protein; 25.7 44 0.0011 13.7 13.2 30 257-287 167-196 (378) 239 cd07149 ALDH_y4uC Uncharacteri 25.7 44 0.0011 13.7 2.9 15 139-153 177-191 (453) 240 cd07088 ALDH_LactADH-AldA Esch 25.6 45 0.0011 13.6 3.7 125 138-289 186-312 (468) 241 TIGR02881 spore_V_K stage V sp 25.6 45 0.0011 13.6 4.4 72 195-277 114-194 (261) 242 PRK05942 aspartate aminotransf 25.5 45 0.0011 13.6 15.2 32 257-289 183-214 (394) 243 pfam10231 DUF2315 Uncharacteri 25.2 45 0.0012 13.6 2.6 19 258-276 74-92 (127) 244 PRK07212 consensus 25.1 45 0.0012 13.6 12.0 30 257-287 175-204 (378) 245 KOG3248 consensus 25.0 46 0.0012 13.6 2.6 28 260-287 232-266 (421) 246 COG2089 SpsE Sialic acid synth 24.8 46 0.0012 13.5 12.0 13 261-273 251-263 (347) 247 cd06307 PBP1_uncharacterized_s 24.7 46 0.0012 13.5 5.2 73 262-341 169-246 (275) 248 TIGR01075 uvrD DNA helicase II 24.6 46 0.0012 13.5 1.9 158 161-355 108-299 (741) 249 PHA01633 putative glycosyl tra 24.5 40 0.001 14.0 1.4 16 389-404 307-322 (335) 250 PRK10060 RNase II stability mo 24.4 47 0.0012 13.5 4.0 123 263-391 506-642 (663) 251 TIGR03286 methan_mark_15 putat 24.3 47 0.0012 13.5 2.4 28 204-231 169-196 (404) 252 cd07146 ALDH_PhpJ Streptomyces 24.2 36 0.00092 14.3 1.1 123 138-289 173-297 (451) 253 COG3096 MukB Uncharacterized p 23.9 38 0.00096 14.2 1.2 21 62-82 86-106 (1480) 254 cd07082 ALDH_F11_NP-GAPDH NADP 23.6 36 0.00092 14.3 1.0 121 138-288 194-317 (473) 255 COG2204 AtoC Response regulato 23.5 48 0.0012 13.4 2.8 110 138-252 75-218 (464) 256 PRK07590 L,L-diaminopimelate a 23.5 48 0.0012 13.4 10.5 30 258-288 190-219 (409) 257 COG2871 NqrF Na+-transporting 23.5 49 0.0012 13.4 2.5 38 261-298 314-357 (410) 258 TIGR03217 4OH_2_O_val_ald 4-hy 23.4 49 0.0012 13.4 10.9 28 215-242 231-258 (333) 259 PRK07309 aromatic amino acid a 23.3 49 0.0012 13.3 12.6 31 257-288 179-209 (390) 260 TIGR00003 TIGR00003 copper ion 23.2 40 0.001 14.0 1.2 24 208-231 42-65 (66) 261 cd06346 PBP1_ABC_ligand_bindin 23.2 49 0.0013 13.3 7.3 46 264-311 179-225 (312) 262 cd07103 ALDH_F5_SSADH_GabD Mit 23.0 49 0.0013 13.3 2.6 125 139-290 171-297 (451) 263 KOG0053 consensus 23.0 50 0.0013 13.3 4.6 15 366-380 390-404 (409) 264 pfam00580 UvrD-helicase UvrD/R 23.0 50 0.0013 13.3 4.0 151 259-413 184-373 (494) 265 TIGR01386 cztS_silS_copS heavy 22.9 49 0.0012 13.4 1.6 64 263-366 341-407 (483) 266 PRK09140 2-dehydro-3-deoxy-6-p 22.9 50 0.0013 13.3 8.1 64 262-328 20-84 (206) 267 COG1019 Predicted nucleotidylt 22.8 50 0.0013 13.3 2.7 62 219-288 24-86 (158) 268 smart00631 Zn_pept Zn_pept. 22.7 50 0.0013 13.3 3.2 28 61-90 65-92 (277) 269 PRK08674 bifunctional phosphog 22.5 51 0.0013 13.2 5.1 28 324-351 270-297 (328) 270 pfam09161 consensus 22.4 35 0.00089 14.4 0.8 30 279-312 294-323 (391) 271 KOG2599 consensus 22.2 51 0.0013 13.2 8.1 16 259-274 126-141 (308) 272 cd03871 M14_CPB Peptidase M14 22.2 51 0.0013 13.2 3.4 19 79-97 85-109 (300) 273 PRK06253 O-phosphoseryl-tRNA s 22.0 52 0.0013 13.2 4.1 75 261-336 188-267 (527) 274 KOG2792 consensus 22.0 44 0.0011 13.7 1.2 75 207-287 151-232 (280) 275 pfam09226 Endonuc-HincII Restr 21.9 17 0.00044 16.7 -0.9 30 126-157 4-34 (257) 276 PRK12833 acetyl-CoA carboxylas 21.8 52 0.0013 13.1 4.2 31 217-247 258-288 (458) 277 COG5011 Uncharacterized protei 21.4 53 0.0014 13.1 3.6 69 213-282 18-95 (228) 278 cd03872 M14_CPA6 Carboxypeptid 21.4 53 0.0014 13.1 3.1 29 199-227 197-225 (300) 279 TIGR03419 NifU_clost FeS clust 21.4 53 0.0014 13.1 1.6 65 56-132 51-117 (121) 280 cd06333 PBP1_ABC-type_HAAT_lik 21.4 53 0.0014 13.1 9.1 55 265-322 176-231 (312) 281 PRK06427 phosphomethylpyrimidi 21.4 53 0.0014 13.1 2.2 18 367-384 212-229 (266) 282 pfam08963 DUF1878 Protein of u 21.3 28 0.00071 15.1 0.1 39 265-303 9-55 (113) 283 cd02517 CMP-KDO-Synthetase CMP 21.2 53 0.0014 13.1 3.5 38 341-378 195-235 (239) 284 PRK03359 putative electron tra 21.2 54 0.0014 13.0 7.6 28 6-38 5-32 (256) 285 KOG0481 consensus 21.1 54 0.0014 13.0 3.5 71 213-290 448-520 (729) 286 cd07140 ALDH_F1L_FTFDH 10-form 21.1 54 0.0014 13.0 2.9 24 263-289 304-327 (486) 287 COG4301 Uncharacterized conser 21.0 38 0.00096 14.2 0.7 76 212-302 168-248 (321) 288 cd07102 ALDH_EDX86601 Uncharac 20.8 54 0.0014 13.0 3.7 125 138-289 169-294 (452) 289 PRK10919 ATP-dependent DNA hel 20.8 54 0.0014 13.0 4.7 13 35-47 19-31 (672) 290 PRK00915 2-isopropylmalate syn 20.7 55 0.0014 13.0 11.4 80 171-278 113-192 (511) 291 cd01538 PBP1_ABC_xylose_bindin 20.7 55 0.0014 13.0 6.0 74 261-341 171-251 (288) 292 pfam05153 DUF706 Family of unk 20.6 55 0.0014 13.0 2.8 28 250-277 213-241 (248) 293 cd06247 M14_CPO Peptidase M14 20.2 56 0.0014 12.9 3.5 21 331-351 272-293 (298) 294 COG1139 Uncharacterized conser 20.2 56 0.0014 12.9 2.2 19 6-24 117-135 (459) 295 TIGR00109 hemH ferrochelatase; 20.1 56 0.0014 12.9 6.1 60 160-226 126-187 (339) 296 TIGR03374 ABALDH 1-pyrroline d 20.1 56 0.0014 12.9 1.7 124 139-289 190-315 (472) 297 cd06905 Peptidase_M14-like_8 A 20.1 56 0.0014 12.9 3.6 37 61-99 71-113 (360) 298 PRK10916 ADP-heptose:LPS hepto 20.1 56 0.0014 12.9 5.7 110 275-397 178-296 (348) No 1 >TIGR01060 eno phosphopyruvate hydratase; InterPro: IPR000941 Enolase (2-phospho-D-glycerate hydrolase) is an essential glycolytic enzyme that catalyses the interconversion of 2-phosphoglycerate and phosphoenolpyruvate , . In vertebrates, there are 3 different, tissue-specific isoenzymes, designated alpha, beta and gamma. Alpha is present in most tissues, beta is localised in muscle tissue, and gamma is found only in nervous tissue. The functional enzyme exists as a dimer of any 2 isoforms. In immature organs and in adult liver, it is usually an alpha homodimer, in adult skeletal muscle, a beta homodimer, and in adult neurons, a gamma homodimer. In developing muscle, it is usually an alpha/beta heterodimer, and in the developing nervous system, an alpha/gamma heterodimer . The tissue specific forms display minor kinetic differences. Tau-crystallin, one of the major lens proteins in some fish, reptiles and birds, has been shown to be evolutionary related to enolase. Neuron-specific enolase is released in a variety of neurological diseases, such as multiple sclerosis and after seizures or acute stroke. Several tumour cells have also been found positive for neuron-specific enolase. Beta-enolase deficiency is associated with glycogenosis type XIII defect.; GO: 0004634 phosphopyruvate hydratase activity, 0006096 glycolysis, 0000015 phosphopyruvate hydratase complex. Probab=100.00 E-value=0 Score=1511.09 Aligned_cols=421 Identities=67% Similarity=1.052 Sum_probs=415.4 Q ss_pred EEEEEEEEEEECCCCCEEEEEEEECCCCEEEEECCCCCCCCCCEEEEECCCC-CCCCCCHHHHHHHHHHHHHHHHHCCCC Q ss_conf 0589986886079784299999984897676634478664775467626898-651680489999999875678975988 Q gi|254780670|r 3 INDIIAREVIDSRGSPTIEVDVCLEDGSTGRAMVPSGASTGIHEAFELRDQE-KRYFGKGVLKAIAFVNDEIRTALLGCD 81 (424) Q Consensus 3 I~~i~~r~IlDSRG~PTvEvev~~~~g~~gra~~PSGaStG~~Ea~elrD~~-~~~~G~gV~~Av~~in~~i~~~L~g~~ 81 (424) |.+|+|||||||||||||||+|.|.+|.+|||+||||||||++||+||||+| +||.||||++||+|||++|+|+|+|+| T Consensus 1 I~~i~AReilDSRGNPTVev~V~l~~G~~G~A~VPSGASTG~~EAlELRD~Dk~RY~GKGVl~Av~nVn~~I~~~L~G~~ 80 (430) T TIGR01060 1 IKDIRAREILDSRGNPTVEVEVILEDGTFGRAAVPSGASTGEREALELRDGDKKRYLGKGVLKAVENVNEIIAPALIGLD 80 (430) T ss_pred CCEEEEEEEEECCCCCEEEEEEEECCCCEEEEECCCCCCCCCCEEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHCCCC T ss_conf 92589988872287872899999768976888536787642220455645578754884389999999998786605788 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCHHHHCCCCCCCCCCCCCEEECCCCCCCCCCCCCE Q ss_conf 55789888646530366432212322220244899875266349848882253245647543000003664556776610 Q gi|254780670|r 82 ARDQLLIDKIMIDLDGTPNKSRLGANAILGVSLAVSKAAAQTSNLPLYKYLGGCSAHILPVPLMNILNGGIHADNALDFQ 161 (424) Q Consensus 82 ~~dQ~~iD~~Li~lDgT~nks~lGaNailAvSlA~akA~A~~~~~pLy~yl~~~~~~~lP~P~~NiinGG~HA~~~l~iQ 161 (424) +.||..||+.|++||||||||+||||||||||||+|||||+..++||||||+|...++||+|||||||||+||+|.+||| T Consensus 81 A~~Q~~iD~~l~~lDGTpNKs~LGANAILgVSlA~AkAAA~~~~~PLY~YLGG~~~~~LPvPMmNiINGG~HAdN~~D~Q 160 (430) T TIGR01060 81 ATDQREIDQILIELDGTPNKSKLGANAILGVSLAVAKAAAKSLGLPLYRYLGGVNAYVLPVPMMNIINGGAHADNNLDFQ 160 (430) T ss_pred CCCHHHHHHHHHHCCCCCCHHHCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCE T ss_conf 01379999999964588885341521778999999999999717871446541046416876422123652668783521 Q ss_pred EEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCC--CCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHH Q ss_conf 2565148877489998999999999998752000--01367655764488684689999999999753853111023431 Q gi|254780670|r 162 EFMIMPVGAENIREAIRMGAEVFHTLKKELKSKG--YSTNVGDEGGFSPNLKTADSALDLIANSIEKAGYNAGKDLLIAL 239 (424) Q Consensus 162 EfmIiP~g~~sf~eal~~~~ev~~~lk~~L~~~g--~~t~vgDEGGfap~~~~~eeaL~ll~eAi~~aGy~~g~di~i~l 239 (424) ||||+|+||++|+|||||++||||+||++|+++| ++|+||||||||||+++|+|||++|.+||++|||+||+||.||| T Consensus 161 EFMI~P~GA~~~~EAlR~gAEVfH~LK~~L~~~g~~~~T~VGDEGGFAPnL~sn~eAL~~~~~AIekAGy~Pg~dv~lAL 240 (430) T TIGR01060 161 EFMIMPVGAKSFREALRMGAEVFHALKKLLKEKGPSLATGVGDEGGFAPNLASNEEALEIIVEAIEKAGYKPGEDVALAL 240 (430) T ss_pred EEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCHHHHHHHH T ss_conf 12204678441689998789999999999876188542432566740588677688999999999850798513235453 Q ss_pred CHHHHHHCCCC--CEEECCCCC---CCCHHHHHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHCCCCEEEECCCEEE Q ss_conf 10465320278--511114686---5687899999998765448699776763104237998864238836850430000 Q gi|254780670|r 240 DCAASAFFKKE--KYILKGENL---ELQPNEMASYLANLVDQYPIYSIEDGMSEDDWHGWKILTNKIGSNCQLVGDDLFV 314 (424) Q Consensus 240 D~AAsefy~~~--kY~~~~~~~---~~t~~elid~y~~l~~~yPI~sIEDP~~e~D~~gw~~l~~~~g~~~~IvGDDL~v 314 (424) ||||||||+++ +|.|+.+++ .+|++||++||++||++|||+||||||+|+|||||+.||+++|+||||||||||| T Consensus 241 D~AaSEFY~~~~~~Y~~~g~~~~~~~lt~~e~~~yy~~L~~~YPI~SIEDgl~E~DweG~~~LT~~lG~~vQiVGDDlFV 320 (430) T TIGR01060 241 DCAASEFYDEETGKYVLKGENKANKQLTSEEMIEYYEELVEKYPIVSIEDGLSEEDWEGWAELTKELGDKVQIVGDDLFV 320 (430) T ss_pred HHHHHHHEECCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHCCCEEEECCCCCCHHHHHHHHHHHHCCEEEEECCCCCC T ss_conf 02312104068615887403042322478999999999972389078527888462899999999738705886088223 Q ss_pred CCHHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHHCCCHHEECCCCHHH Q ss_conf 21488852100386522342545300389999999999987986999507888723589989998199801007561144 Q gi|254780670|r 315 TNPERLHKGINEEVANAILIKPNQIGSVSETLNTIEKAQMAGYPSIISHRSGETEDHTIADLAVATNCGQIKTGSLARSD 394 (424) Q Consensus 315 Tn~~rl~~gi~~~~~NailiK~NQiGTvset~ea~~~a~~~g~~~ivShRSGETeD~~iaDLAVg~~a~~iK~G~~~R~E 394 (424) ||++||++||+++.|||||||||||||||||++||++|+++||.+||||||||||||||||||||+++||||||||+||| T Consensus 321 TN~~~L~kGIe~g~ANsILIK~NQIGTlTET~~Av~lA~~~gY~~viSHRSGETEDttIADLAVA~~aGqIKTGs~sRse 400 (430) T TIGR01060 321 TNTEILKKGIEMGVANSILIKLNQIGTLTETLDAVELAKKAGYTAVISHRSGETEDTTIADLAVALNAGQIKTGSLSRSE 400 (430) T ss_pred CCHHHHHHHHHCCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHHHHHHCCCCCCCCCCCCHH T ss_conf 76899987654278624775027775488899999999966984899715789831269999998338874116785114 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCHHHHCC Q ss_conf 89999999999998311045576253047 Q gi|254780670|r 395 RIAKYNQLIRIEESLGKQAKFAGRSILIH 423 (424) Q Consensus 395 R~aKyNrLlrIee~Lg~~a~~~g~~~~~~ 423 (424) |+||||||||||||||..|+|+|++.|.+ T Consensus 401 RiAKYNqLlrIEeeLg~~A~y~G~~~F~~ 429 (430) T TIGR01060 401 RIAKYNQLLRIEEELGDSARYAGKNAFYR 429 (430) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCC T ss_conf 78999889999998556301045102578 No 2 >PRK00077 eno phosphopyruvate hydratase; Provisional Probab=100.00 E-value=0 Score=1282.56 Aligned_cols=423 Identities=65% Similarity=1.020 Sum_probs=418.2 Q ss_pred CEEEEEEEEEEEECCCCCEEEEEEEECCCCEEEEECCCCCCCCCCEEEEECCCC-CCCCCCHHHHHHHHHHHHHHHHHCC Q ss_conf 910589986886079784299999984897676634478664775467626898-6516804899999998756789759 Q gi|254780670|r 1 MFINDIIAREVIDSRGSPTIEVDVCLEDGSTGRAMVPSGASTGIHEAFELRDQE-KRYFGKGVLKAIAFVNDEIRTALLG 79 (424) Q Consensus 1 m~I~~i~~r~IlDSRG~PTvEvev~~~~g~~gra~~PSGaStG~~Ea~elrD~~-~~~~G~gV~~Av~~in~~i~~~L~g 79 (424) .+|++|++||||||||||||||||++++|.+||++||||||||+|||+||||++ ++|+||||.+||+|||+.|+|+|+| T Consensus 2 ~~I~~i~~r~IlDSRG~PTVEvev~~~~g~~gra~vPSGAStG~~EA~elrD~d~~ry~GkgV~~Av~~vn~~I~~~L~G 81 (427) T PRK00077 2 SKIEDVIAREILDSRGNPTVEAEVLLEDGTFGRAAVPSGASTGEREAVELRDGDKSRYLGKGVLKAVENVNEEIAPALIG 81 (427) T ss_pred CCCEEEEEEEEECCCCCEEEEEEEEECCCCEEEEECCCCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHCC T ss_conf 95207999999769997248999998899889994457767776651120579975257702999999999988999779 Q ss_pred CCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCHHHHCCCCCCCCCCCCCEEECCCCCCCCCCCC Q ss_conf 88557898886465303664322123222202448998752663498488822532456475430000036645567766 Q gi|254780670|r 80 CDARDQLLIDKIMIDLDGTPNKSRLGANAILGVSLAVSKAAAQTSNLPLYKYLGGCSAHILPVPLMNILNGGIHADNALD 159 (424) Q Consensus 80 ~~~~dQ~~iD~~Li~lDgT~nks~lGaNailAvSlA~akA~A~~~~~pLy~yl~~~~~~~lP~P~~NiinGG~HA~~~l~ 159 (424) ++++||.+||++|++||||+|||+||||+|+|||||+|||+|+.+++|||+||++...++||+|||||||||+||+|++| T Consensus 82 ~~~~dQ~~iD~~LielDgT~nKs~lGaNailavSlA~akAaA~~~~~pLy~yl~~~~~~~lP~Pm~NIinGG~HA~n~ld 161 (427) T PRK00077 82 LDAFDQAGIDKAMIELDGTENKSKLGANAILAVSLAVAKAAAASLGLPLYRYLGGPNAYSLPVPMMNIINGGAHADNNVD 161 (427) T ss_pred CCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCHHHHCCCCCCCCCCCCEEEEECCCCCCCCCCC T ss_conf 97462899999999732985412244267899999999998776189699953766665478422787426446899766 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHH Q ss_conf 10256514887748999899999999999875200001367655764488684689999999999753853111023431 Q gi|254780670|r 160 FQEFMIMPVGAENIREAIRMGAEVFHTLKKELKSKGYSTNVGDEGGFSPNLKTADSALDLIANSIEKAGYNAGKDLLIAL 239 (424) Q Consensus 160 iQEfmIiP~g~~sf~eal~~~~ev~~~lk~~L~~~g~~t~vgDEGGfap~~~~~eeaL~ll~eAi~~aGy~~g~di~i~l 239 (424) ||||||+|.++.||+|+|+|++||||+||++|++++++|+|||||||||++++++++|++|++||+++||+||+||+||| T Consensus 162 iQEfmIiP~ga~sf~eal~~~~ev~~~Lk~~L~~kg~~t~vGDEGGfaP~~~~~eeaL~ll~~AI~~aGy~~G~di~ial 241 (427) T PRK00077 162 IQEFMIMPVGAPSFKEALRMGAEVFHALKKVLKDKGLSTAVGDEGGFAPNLKSNEEALELILEAIEKAGYEPGEDIALAL 241 (427) T ss_pred CEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCEEEEEE T ss_conf 14788717998888999999999999999999865998765678774899988499999999999970888787089972 Q ss_pred CHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHCCCCEEEECCCEEECCHHH Q ss_conf 10465320278511114686568789999999876544869977676310423799886423883685043000021488 Q gi|254780670|r 240 DCAASAFFKKEKYILKGENLELQPNEMASYLANLVDQYPIYSIEDGMSEDDWHGWKILTNKIGSNCQLVGDDLFVTNPER 319 (424) Q Consensus 240 D~AAsefy~~~kY~~~~~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~D~~gw~~l~~~~g~~~~IvGDDL~vTn~~r 319 (424) ||||||||+++||.|..++..+|++||++||.+|+++|||+||||||+||||++|++||+++|++||||||||||||++| T Consensus 242 D~AASefy~~gkY~~~~~~~~~s~~eli~~y~~l~~~YPI~sIEDp~~EdD~~gw~~lt~~lg~k~~iVGDDL~vTn~~r 321 (427) T PRK00077 242 DCAASEFYEDGKYVLEGEGKVLTSEEQIDYLAELVDKYPIVSIEDGLDENDWDGWALLTERLGDKVQLVGDDLFVTNPKI 321 (427) T ss_pred CCCHHHHHHCCCEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCHHHHHHHHHHHHHCCCEEEECCCCCCCCHHH T ss_conf 42067675278246547776458899999999999879977874688822689999999970996799888413468899 Q ss_pred HHHHHHCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHHCCCHHEECCCCHHHHHHHH Q ss_conf 85210038652234254530038999999999998798699950788872358998999819980100756114489999 Q gi|254780670|r 320 LHKGINEEVANAILIKPNQIGSVSETLNTIEKAQMAGYPSIISHRSGETEDHTIADLAVATNCGQIKTGSLARSDRIAKY 399 (424) Q Consensus 320 l~~gi~~~~~NailiK~NQiGTvset~ea~~~a~~~g~~~ivShRSGETeD~~iaDLAVg~~a~~iK~G~~~R~ER~aKy 399 (424) |++||++++||++|||||||||||||++++++|+++||.+|||||||||||+||||||||+||+|||+|+|+||||+||| T Consensus 322 l~kgi~~~~~NaiLiK~NQiGTvset~eai~la~~~g~~~ivShRSGETeD~~IaDLAVg~ga~~iK~Ga~~R~ER~aKy 401 (427) T PRK00077 322 LKEGIEKGAANSILIKVNQIGTLTETLEAIELAKRAGYTAVISHRSGETEDTTIADLAVATNAGQIKTGSLSRSERVAKY 401 (427) T ss_pred HHHHHHCCCCCEEEEECCCCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCHHHHHHHHCCCCHHHCCCCCCHHHHHHH T ss_conf 97255407664799844346739999999999998698799978989866215999898608973205898407889999 Q ss_pred HHHHHHHHHHHHCCCCCCHHHHCC Q ss_conf 999999998311045576253047 Q gi|254780670|r 400 NQLIRIEESLGKQAKFAGRSILIH 423 (424) Q Consensus 400 NrLlrIee~Lg~~a~~~g~~~~~~ 423 (424) |||||||||||++++|+|+++|++ T Consensus 402 NrLlrIeeeLg~~a~~~g~~~~~~ 425 (427) T PRK00077 402 NQLLRIEEELGEKAVYAGRKAFYN 425 (427) T ss_pred HHHHHHHHHHHHCCEECCHHHHHC T ss_conf 999999998621467476666502 No 3 >cd03313 enolase Enolase: Enolases are homodimeric enzymes that catalyse the reversible dehydration of 2-phospho-D-glycerate to phosphoenolpyruvate as part of the glycolytic and gluconeogenesis pathways. The reaction is facilitated by the presence of metal ions. Probab=100.00 E-value=0 Score=1237.04 Aligned_cols=404 Identities=63% Similarity=0.999 Sum_probs=398.2 Q ss_pred EEEEEEEEECCCCCEEEEEEEECCCCEEEEECCCCCCCCCCEEEEECCCCC-CCCCCHHHHHHHHHHHHHHHHHCCCCHH Q ss_conf 899868860797842999999848976766344786647754676268986-5168048999999987567897598855 Q gi|254780670|r 5 DIIAREVIDSRGSPTIEVDVCLEDGSTGRAMVPSGASTGIHEAFELRDQEK-RYFGKGVLKAIAFVNDEIRTALLGCDAR 83 (424) Q Consensus 5 ~i~~r~IlDSRG~PTvEvev~~~~g~~gra~~PSGaStG~~Ea~elrD~~~-~~~G~gV~~Av~~in~~i~~~L~g~~~~ 83 (424) +|+|||||||||||||||||++++|.+|||+||||||||+|||+||||+++ +|+||||.+||+|||+.|+|+|+|++++ T Consensus 1 kv~areIlDSRG~PTVevev~~~~g~~gra~vPSGAStG~~EA~elrD~d~~ry~GkgV~~Av~~vn~~I~~~L~G~~~~ 80 (408) T cd03313 1 KIKAREILDSRGNPTVEVEVTTEDGGVGRAAVPSGASTGEHEAVELRDGDKSRYLGKGVLKAVKNVNEIIAPALIGMDVT 80 (408) T ss_pred CEEEEEEECCCCCEEEEEEEEECCCCEEEEECCCCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCHH T ss_conf 94589986489980399999989999899954577678873556714799531478159999999999889997699867 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCHHHHCCCCCCCCCCCCCEEECCCCCCCCCCCCCEEE Q ss_conf 78988864653036643221232222024489987526634984888225324564754300000366455677661025 Q gi|254780670|r 84 DQLLIDKIMIDLDGTPNKSRLGANAILGVSLAVSKAAAQTSNLPLYKYLGGCSAHILPVPLMNILNGGIHADNALDFQEF 163 (424) Q Consensus 84 dQ~~iD~~Li~lDgT~nks~lGaNailAvSlA~akA~A~~~~~pLy~yl~~~~~~~lP~P~~NiinGG~HA~~~l~iQEf 163 (424) ||.+||++|++||||+|||+||||+|||||||+|||+|+.+++|||+||++...++||+|||||||||+||+|++||||| T Consensus 81 dQ~~iD~~LielDgT~nKs~lGaNailavSlA~akAaA~~~~~pLy~yl~~~~~~~lP~P~~NIinGG~HA~n~ldiQEf 160 (408) T cd03313 81 DQRAIDKLLIELDGTPNKSKLGANAILGVSLAVAKAAAAALGLPLYRYLGGLAAYVLPVPMFNVINGGAHAGNKLDFQEF 160 (408) T ss_pred HHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCHHHHHCCCCCCCCCCCEEEEECCCCCCCCCCCCEEE T ss_conf 68999999997208866344651799999999999988764994999845555556774226874374568998763279 Q ss_pred EEECCCCCCHHHHHHHHHHHHHHHHHHHHHCC--CCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHCH Q ss_conf 65148877489998999999999998752000--0136765576448868468999999999975385311102343110 Q gi|254780670|r 164 MIMPVGAENIREAIRMGAEVFHTLKKELKSKG--YSTNVGDEGGFSPNLKTADSALDLIANSIEKAGYNAGKDLLIALDC 241 (424) Q Consensus 164 mIiP~g~~sf~eal~~~~ev~~~lk~~L~~~g--~~t~vgDEGGfap~~~~~eeaL~ll~eAi~~aGy~~g~di~i~lD~ 241 (424) ||+|.++.||+|+|||++||||+||++|++++ ++|+|||||||||++++++++|++|++||+++||+||+||+||||| T Consensus 161 mI~P~ga~sf~eal~~~~ev~~~lk~~L~~k~~~~~t~vgDEGGfaP~~~~~eeaL~li~~AI~~aGy~~G~di~ialD~ 240 (408) T cd03313 161 MIVPVGAPSFSEALRMGAEVYHTLKKVLKKKGGLLATNVGDEGGFAPNLSSNEEALDLLVEAIEKAGYEPGKKIAIALDV 240 (408) T ss_pred EEECCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCEEEEECC T ss_conf 98579988889999999999999999998737877775266767289999859999999999997088888727998422 Q ss_pred HHHHHCCCCCEEECC-CCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHCCCCEEEECCCEEECCHHHH Q ss_conf 465320278511114-6865687899999998765448699776763104237998864238836850430000214888 Q gi|254780670|r 242 AASAFFKKEKYILKG-ENLELQPNEMASYLANLVDQYPIYSIEDGMSEDDWHGWKILTNKIGSNCQLVGDDLFVTNPERL 320 (424) Q Consensus 242 AAsefy~~~kY~~~~-~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~D~~gw~~l~~~~g~~~~IvGDDL~vTn~~rl 320 (424) ||||||++++|.|.. ++..+|++||++||.+|+++|||+||||||+||||+||++||+++|++||||||||||||++|| T Consensus 241 AASEfy~~~~y~y~~~~~~~~s~~e~i~~y~~l~~kYPIisIEDp~~E~D~~gw~~lt~~lg~~~qiVGDDL~vTn~~rl 320 (408) T cd03313 241 AASEFYDEGKYVYDSDEGKKLTSEELIDYYKELVKKYPIVSIEDPFDEDDWEGWAKLTAKLGDKIQIVGDDLFVTNPERL 320 (408) T ss_pred CHHHHHHCCCEEEECCCCCEECHHHHHHHHHHHHHHCCEEEEECCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCHHHH T ss_conf 16766316761473588851089999999999997598778856887212789999998659807998874244788998 Q ss_pred HHHHHCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHHCCCHHEECCCCHHHHHHHHH Q ss_conf 52100386522342545300389999999999987986999507888723589989998199801007561144899999 Q gi|254780670|r 321 HKGINEEVANAILIKPNQIGSVSETLNTIEKAQMAGYPSIISHRSGETEDHTIADLAVATNCGQIKTGSLARSDRIAKYN 400 (424) Q Consensus 321 ~~gi~~~~~NailiK~NQiGTvset~ea~~~a~~~g~~~ivShRSGETeD~~iaDLAVg~~a~~iK~G~~~R~ER~aKyN 400 (424) ++||++++||++|||||||||||||+|++++|+++||.+|||||||||||+||||||||+||+|||+|+|+||||+|||| T Consensus 321 ~~gi~~~~~naiLiK~NQiGTvset~ea~~la~~~g~~~ivShRSGETeD~~IaDLAVg~ga~~iK~G~~~r~ER~aKyN 400 (408) T cd03313 321 KKGIEKKAANALLIKVNQIGTLTETIEAIKLAKKNGYGVVVSHRSGETEDTFIADLAVALGAGQIKTGAPCRSERTAKYN 400 (408) T ss_pred HHHHHCCCCCEEEEECCCCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCHHHHHHHHCCCCCEECCCCCCHHHHHHHH T ss_conf 60565176637998054577299999999999986987999789988650179998987089800258984168899999 Q ss_pred HHHHHHHH Q ss_conf 99999998 Q gi|254780670|r 401 QLIRIEES 408 (424) Q Consensus 401 rLlrIee~ 408 (424) |||||||| T Consensus 401 rLlrIeee 408 (408) T cd03313 401 QLLRIEEE 408 (408) T ss_pred HHHHHHCC T ss_conf 87886349 No 4 >PTZ00081 enolase (2-phospho-D-glycerate hydrolase); Provisional Probab=100.00 E-value=0 Score=1226.92 Aligned_cols=420 Identities=53% Similarity=0.838 Sum_probs=404.4 Q ss_pred CEEEEEEEEEEEECCCCCEEEEEEEECCCCEEEEECCCCCCCCCCEEEEECCCCC-CCCCCHHHHHHHHHHHHHHHHHCC Q ss_conf 9105899868860797842999999848976766344786647754676268986-516804899999998756789759 Q gi|254780670|r 1 MFINDIIAREVIDSRGSPTIEVDVCLEDGSTGRAMVPSGASTGIHEAFELRDQEK-RYFGKGVLKAIAFVNDEIRTALLG 79 (424) Q Consensus 1 m~I~~i~~r~IlDSRG~PTvEvev~~~~g~~gra~~PSGaStG~~Ea~elrD~~~-~~~G~gV~~Av~~in~~i~~~L~g 79 (424) |+|++|++||||||||||||||||++++| .+|++||||||||+|||+||||+++ +|+||||.+||+|||+.|+|+|+| T Consensus 2 ~~I~~v~areIlDSRG~PTVEveV~~~~g-~~ra~vPSGAStG~~EA~elrD~~~~ry~GkgV~~Av~~in~~I~~~L~G 80 (442) T PTZ00081 2 MVIQKVHAREILDSRGNPTVEVDLTTEAG-VFRAAVPSGASTGIYEALELRDGDKSRYLGKGVLKAVKNVNEVLAPALVG 80 (442) T ss_pred CCEEEEEEEEEECCCCCEEEEEEEEECCC-CEEEECCCCCCCCCCEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHCC T ss_conf 93158999999769997248999997889-77886557877776550021689964568865999999998887999869 Q ss_pred CCHHHHHHHHHHHHH-CCCCCC-----CCCCCCCCCCHHHHHHHHHHHHHCCCCHHHHCCC-----CCCCCCCCCCEEEC Q ss_conf 885578988864653-036643-----2212322220244899875266349848882253-----24564754300000 Q gi|254780670|r 80 CDARDQLLIDKIMID-LDGTPN-----KSRLGANAILGVSLAVSKAAAQTSNLPLYKYLGG-----CSAHILPVPLMNIL 148 (424) Q Consensus 80 ~~~~dQ~~iD~~Li~-lDgT~n-----ks~lGaNailAvSlA~akA~A~~~~~pLy~yl~~-----~~~~~lP~P~~Nii 148 (424) +++.||.+||++|++ ||||+| ||+||||+++|||||+|||+|+.+++|||+||++ ...++||+|||||| T Consensus 81 ~~~~dQ~~iD~~li~~lDgT~n~~~~~Ks~lGaNailAvSlA~akAaA~~~~~pLy~yl~~l~~~~~~~~~lPvP~~NIi 160 (442) T PTZ00081 81 KDELDQKELDKLMVEELDGTKNEWGYCKSKLGANAILGCSMAISRAAAAAKGVPLYRYLAQLAGKPTDQIVLPVPCLNVI 160 (442) T ss_pred CCHHHHHHHHHHHHHHHCCCCCCCCCHHHCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCCEECCCCEEEEE T ss_conf 98788887789999873599664310132068157889999999999987499399998876178777622164316763 Q ss_pred CCCCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHC--CCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 3664556776610256514887748999899999999999875200--00136765576448868468999999999975 Q gi|254780670|r 149 NGGIHADNALDFQEFMIMPVGAENIREAIRMGAEVFHTLKKELKSK--GYSTNVGDEGGFSPNLKTADSALDLIANSIEK 226 (424) Q Consensus 149 nGG~HA~~~l~iQEfmIiP~g~~sf~eal~~~~ev~~~lk~~L~~~--g~~t~vgDEGGfap~~~~~eeaL~ll~eAi~~ 226 (424) |||+||+|++|||||||+|.++.||+|+|+|++||||+||++|+++ +.+|+|||||||+|+++++|++|++|++||++ T Consensus 161 nGG~HA~n~ldiQEfMIiP~ga~sf~eal~~~~ev~~~lk~~L~~k~g~~~t~vgDEGGfaP~~~~~eeaL~li~eAi~~ 240 (442) T PTZ00081 161 NGGSHAGNSLPFQEFMIAPVGAPSFREALRMGAEVYHTLKSIIKKKYGQDATNVGDEGGFAPNIKSANEALDLLVEAIEK 240 (442) T ss_pred CCCCCCCCCCCCCEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHH T ss_conf 27456898888426998789998789999999999999999998760888676577767079999979999999999997 Q ss_pred CCCCCCCCHHHHHCHHHHHHCCCCC--EEECC---CCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHC Q ss_conf 3853111023431104653202785--11114---686568789999999876544869977676310423799886423 Q gi|254780670|r 227 AGYNAGKDLLIALDCAASAFFKKEK--YILKG---ENLELQPNEMASYLANLVDQYPIYSIEDGMSEDDWHGWKILTNKI 301 (424) Q Consensus 227 aGy~~g~di~i~lD~AAsefy~~~k--Y~~~~---~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~D~~gw~~l~~~~ 301 (424) +||+ .+|.|||||||||||++++ |.+.+ ++..+|++||++||.+|+++|||+||||||+||||+||++||+++ T Consensus 241 aG~~--~~i~ialD~AAsefy~~~~~~Y~l~~k~~~~~~~s~~eli~~y~~l~~~YPI~sIEDpl~E~D~~gw~~lt~~l 318 (442) T PTZ00081 241 AGHE--GKVAICMDVAASEFYDAEKKMYNLTFKCPEPTYKTGAELVETYEDWVAKYPIVSIEDPFDQDDWEEYAKITEAL 318 (442) T ss_pred CCCC--CCCCEECCHHHHHHHHCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCHHHHHHHHHHHHHC T ss_conf 0777--56542002356777636896377533057777259999999999998309878995898846799999999970 Q ss_pred CCCEEEECCCEEECCHHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHHC Q ss_conf 88368504300002148885210038652234254530038999999999998798699950788872358998999819 Q gi|254780670|r 302 GSNCQLVGDDLFVTNPERLHKGINEEVANAILIKPNQIGSVSETLNTIEKAQMAGYPSIISHRSGETEDHTIADLAVATN 381 (424) Q Consensus 302 g~~~~IvGDDL~vTn~~rl~~gi~~~~~NailiK~NQiGTvset~ea~~~a~~~g~~~ivShRSGETeD~~iaDLAVg~~ 381 (424) |++||||||||||||++||++||++++||++|||||||||||||+|++++|+++||.+|||||||||||+||||||||++ T Consensus 319 g~~~~iVGDDL~vTn~~rl~~gi~~~~~naiLIK~NQiGTlset~eai~~A~~~g~~~ivShRSGETeD~~iaDLAVg~g 398 (442) T PTZ00081 319 GGKVQIVGDDLLVTNPKRIKTALEKKACNALLLKVNQIGSITEAIEACKLCMKNGWGVMVSHRSGETEDTFIADLVVGLG 398 (442) T ss_pred CCCEEEECCCCCCCCHHHHHHHHHCCCCCEEEEECCCCCHHHHHHHHHHHHHHCCCEEEEECCCCCCCCCHHHHHHHHCC T ss_conf 99889989765576899999998548776389833232209999999999998798799957988766300888777448 Q ss_pred CCHHEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHCC Q ss_conf 980100756114489999999999998311045576253047 Q gi|254780670|r 382 CGQIKTGSLARSDRIAKYNQLIRIEESLGKQAKFAGRSILIH 423 (424) Q Consensus 382 a~~iK~G~~~R~ER~aKyNrLlrIee~Lg~~a~~~g~~~~~~ 423 (424) |+|||+|+|+||||+||||||||||||||++|+|||.+||.. T Consensus 399 a~~iK~G~~~r~ER~aKyNrLLrIeeeLg~~a~~~g~~~~~~ 440 (442) T PTZ00081 399 TGQIKTGAPCRSERNAKYNQLLRIEEELGSRATYAGPGFRTQ 440 (442) T ss_pred CCCEECCCCCCHHHHHHHHHHHHHHHHHCCCCEECCCCCCCC T ss_conf 982033898417889999999999998654537377565766 No 5 >COG0148 Eno Enolase [Carbohydrate transport and metabolism] Probab=100.00 E-value=0 Score=1226.66 Aligned_cols=421 Identities=65% Similarity=1.020 Sum_probs=415.9 Q ss_pred EEEEEEEEEEEECCCCCEEEEEEEECCCCEEEEECCCCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCC Q ss_conf 10589986886079784299999984897676634478664775467626898651680489999999875678975988 Q gi|254780670|r 2 FINDIIAREVIDSRGSPTIEVDVCLEDGSTGRAMVPSGASTGIHEAFELRDQEKRYFGKGVLKAIAFVNDEIRTALLGCD 81 (424) Q Consensus 2 ~I~~i~~r~IlDSRG~PTvEvev~~~~g~~gra~~PSGaStG~~Ea~elrD~~~~~~G~gV~~Av~~in~~i~~~L~g~~ 81 (424) .|++|++|+||||||||||||+|++++|..|||+||||||||.|||+||||+|+||+||||.+||++||++|+|+|+|+| T Consensus 3 ~I~~i~aReIlDSRGnpTVEveV~~~~g~~g~a~vPSGAStG~~EavElrdgd~ry~gkGV~~AV~nVn~~Iap~LiG~d 82 (423) T COG0148 3 AIEDVIAREILDSRGNPTVEVEVTLEDGFGGRAAVPSGASTGEHEAVELRDGDSRYLGKGVLKAVANVNEIIAPALIGLD 82 (423) T ss_pred CCCEEEEEEEECCCCCCEEEEEEEECCCCCCEEECCCCCCCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHCCCC T ss_conf 10005778887278982189999976887521326788888873567853776311452199999999877789874998 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCHHHHCCCCCCCCCCCCCEEECCCCCCCCCCCCCE Q ss_conf 55789888646530366432212322220244899875266349848882253245647543000003664556776610 Q gi|254780670|r 82 ARDQLLIDKIMIDLDGTPNKSRLGANAILGVSLAVSKAAAQTSNLPLYKYLGGCSAHILPVPLMNILNGGIHADNALDFQ 161 (424) Q Consensus 82 ~~dQ~~iD~~Li~lDgT~nks~lGaNailAvSlA~akA~A~~~~~pLy~yl~~~~~~~lP~P~~NiinGG~HA~~~l~iQ 161 (424) +.||..||+.|++||||+|||+||||+|||||||+|||+|+++++|||+||+|...+.||+|||||||||+||+|.+||| T Consensus 83 a~dQ~~ID~~lielDGT~Nks~lGaNailgVS~AvAkAAA~~l~~PLy~YlGG~~a~~lPvPm~NvinGG~HA~n~~d~Q 162 (423) T COG0148 83 ATDQALIDSLLIELDGTENKSKLGANAILGVSLAVAKAAAASLGIPLYRYLGGLNALVLPVPMMNVINGGAHADNNLDIQ 162 (423) T ss_pred CCCHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHCCCCCCCCCCCEEEEECCCCCCCCCCCCE T ss_conf 42478999999980298762446608999999999999997659948998557666505654056423512478875314 Q ss_pred EEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHCH Q ss_conf 25651488774899989999999999987520000136765576448868468999999999975385311102343110 Q gi|254780670|r 162 EFMIMPVGAENIREAIRMGAEVFHTLKKELKSKGYSTNVGDEGGFSPNLKTADSALDLIANSIEKAGYNAGKDLLIALDC 241 (424) Q Consensus 162 EfmIiP~g~~sf~eal~~~~ev~~~lk~~L~~~g~~t~vgDEGGfap~~~~~eeaL~ll~eAi~~aGy~~g~di~i~lD~ 241 (424) ||||+|+|++||.|++||++||||+||++|+++|++|+|||||||+|+++++++||++|.+||+++||+||+||.+|||| T Consensus 163 EFmI~p~ga~sf~ealr~~~ev~h~lk~~l~~~g~~t~vGDEGgfAP~l~~~eeald~i~~Aie~agy~~g~~i~~alD~ 242 (423) T COG0148 163 EFMIMPVGAESFKEALRAGAEVFHHLKKLLKEKGLSTGVGDEGGFAPNLKSNEEALDILVEAIEEAGYEPGEDIALALDV 242 (423) T ss_pred EEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCEEEEEHH T ss_conf 67884067677999999888999999999865376455567766379887648999999999998277877524654004 Q ss_pred HHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHCCCCEEEECCCEEECCHHHHH Q ss_conf 46532027851111468656878999999987654486997767631042379988642388368504300002148885 Q gi|254780670|r 242 AASAFFKKEKYILKGENLELQPNEMASYLANLVDQYPIYSIEDGMSEDDWHGWKILTNKIGSNCQLVGDDLFVTNPERLH 321 (424) Q Consensus 242 AAsefy~~~kY~~~~~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~D~~gw~~l~~~~g~~~~IvGDDL~vTn~~rl~ 321 (424) ||||||++++|+|+++ .+|++||++||.+|+++|||+||||||+|+||+||..||+++|+++|||||||||||++||+ T Consensus 243 Aasefy~~~~Y~~~~~--~~~~~e~i~~~~~Lv~~YpivsiEDpl~E~Dweg~~~lt~~~g~kvqivGDDLfvTN~~~l~ 320 (423) T COG0148 243 AASEFYKDGKYVLEGE--SLTSEELIEYYLELVKKYPIVSIEDPLSEDDWEGFAELTKRLGDKVQIVGDDLFVTNPKRLK 320 (423) T ss_pred HHHHHCCCCEEEECCC--CCCHHHHHHHHHHHHHHCCEEEEECCCCCHHHHHHHHHHHHHCCEEEEECCCCEECCHHHHH T ss_conf 4555302770463376--46879999999999976997897089884307899999986098389976754232889999 Q ss_pred HHHHCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHHCCCHHEECCCCHHHHHHHHHH Q ss_conf 21003865223425453003899999999999879869995078887235899899981998010075611448999999 Q gi|254780670|r 322 KGINEEVANAILIKPNQIGSVSETLNTIEKAQMAGYPSIISHRSGETEDHTIADLAVATNCGQIKTGSLARSDRIAKYNQ 401 (424) Q Consensus 322 ~gi~~~~~NailiK~NQiGTvset~ea~~~a~~~g~~~ivShRSGETeD~~iaDLAVg~~a~~iK~G~~~R~ER~aKyNr 401 (424) +||+++++|++|||||||||||||++++++|+++||.+|||||||||||+||||||||++|+|||||+|+||||+||||| T Consensus 321 ~gi~~~~aNaiLIK~NQIGTLTEt~~ai~~A~~~gy~~viSHRSGETeD~tIAdLAVa~~agqIKTGs~sRseRiaKyNq 400 (423) T COG0148 321 KGIEKGAANAILIKPNQIGTLTETLEAINLAKDAGYTAVISHRSGETEDTTIADLAVATNAGQIKTGSLSRSERVAKYNE 400 (423) T ss_pred HHHHHCCCCEEEEECHHCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCHHHHHHHHHCCCEEECCCCCCHHHHHHHHH T ss_conf 88874567538995200231899999999999879808875377886643388899981898011378764147999999 Q ss_pred HHHHHHHHHHCCCCCCHHHHCCC Q ss_conf 99999983110455762530479 Q gi|254780670|r 402 LIRIEESLGKQAKFAGRSILIHQ 424 (424) Q Consensus 402 LlrIee~Lg~~a~~~g~~~~~~~ 424 (424) ||||||+||+.++|+|+++|++. T Consensus 401 LlrIEeeLg~~a~y~g~~~f~~~ 423 (423) T COG0148 401 LLRIEEELGDKARYAGIKEFKGL 423 (423) T ss_pred HHHHHHHHHHCCCCCCHHHHCCC T ss_conf 99999986221224886753459 No 6 >KOG2670 consensus Probab=100.00 E-value=0 Score=1159.63 Aligned_cols=417 Identities=53% Similarity=0.835 Sum_probs=403.3 Q ss_pred CEEEEEEEEEEEECCCCCEEEEEEEECCCCEEEEECCCCCCCCCCEEEEECCCCC-CCCCCHHHHHHHHHHHHHHHHHCC Q ss_conf 9105899868860797842999999848976766344786647754676268986-516804899999998756789759 Q gi|254780670|r 1 MFINDIIAREVIDSRGSPTIEVDVCLEDGSTGRAMVPSGASTGIHEAFELRDQEK-RYFGKGVLKAIAFVNDEIRTALLG 79 (424) Q Consensus 1 m~I~~i~~r~IlDSRG~PTvEvev~~~~g~~gra~~PSGaStG~~Ea~elrD~~~-~~~G~gV~~Av~~in~~i~~~L~g 79 (424) |.|.+|++|||||||||||||||++|++| ++|++||||||||.|||+||||||+ +|.||||.+||.|||+.|+|+|++ T Consensus 1 m~~~kv~aR~I~dSRGnPTVEVdL~T~~G-~fRaavPSGAStGi~EAlELrDgdK~~y~GkgV~kaV~niN~~i~pali~ 79 (433) T KOG2670 1 MSIIKVKARQIYDSRGNPTVEVDLTTEKG-VFRAAVPSGASTGIYEALELRDGDKSKYMGKGVLKAVGNINNTIAPALIK 79 (433) T ss_pred CCCEEEEHHHHHCCCCCCEEEEEEEECCC-EEEEECCCCCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHC T ss_conf 97326434556315899527899871475-26531678865233666640048733015510999987677776699871 Q ss_pred C--CHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCHHHHCCCC---C-CCCCCCCCEEECCCCCC Q ss_conf 8--8557898886465303664322123222202448998752663498488822532---4-56475430000036645 Q gi|254780670|r 80 C--DARDQLLIDKIMIDLDGTPNKSRLGANAILGVSLAVSKAAAQTSNLPLYKYLGGC---S-AHILPVPLMNILNGGIH 153 (424) Q Consensus 80 ~--~~~dQ~~iD~~Li~lDgT~nks~lGaNailAvSlA~akA~A~~~~~pLy~yl~~~---~-~~~lP~P~~NiinGG~H 153 (424) + ++.+|..||++|++||||+|||+||||+||+||+|+|||+|+.+++|||+||++. . ++.||||.|||||||.| T Consensus 80 ~~~dv~~Q~~iD~~mi~LDGTeNKsklGaNaIlgvSlavckagAa~k~vplykhia~lag~~~~~vlPVPaFNVlNGGsH 159 (433) T KOG2670 80 KNLDVTDQKAIDNFMIELDGTENKSKLGANAILGVSLAVCKAGAAEKGVPLYKHIADLAGNKQPYVLPVPAFNVLNGGSH 159 (433) T ss_pred CCCCHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCCCCCEEECCCCEEEECCCCC T ss_conf 47880327889889885158764121463046688899876214214882999999862899752742552244037755 Q ss_pred CCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHC--CCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCC Q ss_conf 56776610256514887748999899999999999875200--0013676557644886846899999999997538531 Q gi|254780670|r 154 ADNALDFQEFMIMPVGAENIREAIRMGAEVFHTLKKELKSK--GYSTNVGDEGGFSPNLKTADSALDLIANSIEKAGYNA 231 (424) Q Consensus 154 A~~~l~iQEfmIiP~g~~sf~eal~~~~ev~~~lk~~L~~~--g~~t~vgDEGGfap~~~~~eeaL~ll~eAi~~aGy~~ 231 (424) |+|+|.+|||||.|+|+.||+|+|||++||||+||.++++| ...|+|||||||||++.+++|+|++|.+||.++||+. T Consensus 160 AGn~lAmQEfMIlP~ga~sf~eamr~GsevYh~LK~vik~kyG~~atnVGDEGGfAPnI~~~~E~L~Li~~Ai~kagyt~ 239 (433) T KOG2670 160 AGNKLAMQEFMILPVGADSFAEAMRMGSEVYHHLKSVIKEKYGADATNVGDEGGFAPNIQTNEEALDLIKEAINKAGYTG 239 (433) T ss_pred CCCHHHHHHHEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCC T ss_conf 55421033110124571569999986289999999999987075545656666769985444889999999999659888 Q ss_pred CCCHHHHHCHHHHHHCCCCCEEECCCC------CCCCHHHHHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHCCCCE Q ss_conf 110234311046532027851111468------65687899999998765448699776763104237998864238836 Q gi|254780670|r 232 GKDLLIALDCAASAFFKKEKYILKGEN------LELQPNEMASYLANLVDQYPIYSIEDGMSEDDWHGWKILTNKIGSNC 305 (424) Q Consensus 232 g~di~i~lD~AAsefy~~~kY~~~~~~------~~~t~~elid~y~~l~~~yPI~sIEDP~~e~D~~gw~~l~~~~g~~~ 305 (424) +|.||||+||||||+++||+++++. ..+|+++|.|+|.+++++|||+||||||++|||++|++|... ..| T Consensus 240 --kikIgmDvAaseF~~dgkYDLdfk~~~~d~s~~~s~~~L~dlY~~~~k~yPivSiEDPFdqdDw~~w~~~~~~--~~i 315 (433) T KOG2670 240 --KVKIGMDVAASEFYKDGKYDLDFKSPNSDPSRWLSGDQLADLYKSFIKDYPIVSIEDPFDQDDWEAWSKFFKE--VGI 315 (433) T ss_pred --CEEEEEEECHHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCEEEECCCCCHHHHHHHHHHHHC--CCE T ss_conf --5489875123554127754455768999712324889999999999860976641288650238899887635--644 Q ss_pred EEECCCEEECCHHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHHCCCHH Q ss_conf 85043000021488852100386522342545300389999999999987986999507888723589989998199801 Q gi|254780670|r 306 QLVGDDLFVTNPERLHKGINEEVANAILIKPNQIGSVSETLNTIEKAQMAGYPSIISHRSGETEDHTIADLAVATNCGQI 385 (424) Q Consensus 306 ~IvGDDL~vTn~~rl~~gi~~~~~NailiK~NQiGTvset~ea~~~a~~~g~~~ivShRSGETeD~~iaDLAVg~~a~~i 385 (424) |||||||+||||+|++++|++++||++|+|+||||||||+++++++|+++||.+|||||||||||+||+||.||++++|| T Consensus 316 qiVgDDLtvTnpkri~~Ai~~k~cN~LLlKvNQIGtvtEsieA~~~a~~~GwgvMvSHRSGETeDtFIaDL~VGl~tgqI 395 (433) T KOG2670 316 QIVGDDLTVTNPKRIATAIEEKACNALLLKVNQIGTVTESIEAAKLARSAGWGVMVSHRSGETEDTFIADLVVGLGTGQI 395 (433) T ss_pred EEECCCCCCCCHHHHHHHHHHHHCCCEEEECCCCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCHHHHHHHHHHCCCCC T ss_conf 99637532158899999998741141575301153049999999999875960787525788643026663321012401 Q ss_pred EECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHC Q ss_conf 0075611448999999999999831104557625304 Q gi|254780670|r 386 KTGSLARSDRIAKYNQLIRIEESLGKQAKFAGRSILI 422 (424) Q Consensus 386 K~G~~~R~ER~aKyNrLlrIee~Lg~~a~~~g~~~~~ 422 (424) |+|+||||||++|||||||||||||+.++|||++|+. T Consensus 396 KtGApcRsERlaKYNqLLRIeEelg~~a~~AG~~f~~ 432 (433) T KOG2670 396 KTGAPCRSERLAKYNQLLRIEEELGDDARYAGENFRN 432 (433) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHCCCCEECCCCCCC T ss_conf 2689750777888888999999856563662313468 No 7 >pfam00113 Enolase_C Enolase, C-terminal TIM barrel domain. Probab=100.00 E-value=0 Score=841.60 Aligned_cols=286 Identities=53% Similarity=0.869 Sum_probs=277.1 Q ss_pred CCCCCCCCEEECCCCCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCC--CCCCCCCCCCCCCCCCCHH Q ss_conf 56475430000036645567766102565148877489998999999999998752000--0136765576448868468 Q gi|254780670|r 137 AHILPVPLMNILNGGIHADNALDFQEFMIMPVGAENIREAIRMGAEVFHTLKKELKSKG--YSTNVGDEGGFSPNLKTAD 214 (424) Q Consensus 137 ~~~lP~P~~NiinGG~HA~~~l~iQEfmIiP~g~~sf~eal~~~~ev~~~lk~~L~~~g--~~t~vgDEGGfap~~~~~e 214 (424) +++||+|||||||||+||+|++|||||||+|.++.||+|+|+|++||||+||++|++++ .+++|||||||+|++++++ T Consensus 1 p~~lP~P~~NiinGG~HA~~~ldiQEfmIiP~ga~sf~eal~~~~evy~~lk~iL~~k~~~~~~~vgDEGGfaP~~~~~e 80 (296) T pfam00113 1 PYVLPVPMMNVINGGSHAGNKLAMQEFMILPTGASSFTEAMRMGSEVYHNLKSVIKAKYGQDATNVGDEGGFAPNIQSNK 80 (296) T ss_pred CCCCCCCEEEEECCCCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHH T ss_conf 99888340556437025899988858998569999999999999999999999999747877787577878799998819 Q ss_pred HHHHHHHHHHHHCCCCCCCCHHHHHCHHHHHHCC--CCCEEECCCC------CCCCHHHHHHHHHHHHHHCCEEEEECCC Q ss_conf 9999999999753853111023431104653202--7851111468------6568789999999876544869977676 Q gi|254780670|r 215 SALDLIANSIEKAGYNAGKDLLIALDCAASAFFK--KEKYILKGEN------LELQPNEMASYLANLVDQYPIYSIEDGM 286 (424) Q Consensus 215 eaL~ll~eAi~~aGy~~g~di~i~lD~AAsefy~--~~kY~~~~~~------~~~t~~elid~y~~l~~~yPI~sIEDP~ 286 (424) ++|++|++||+++||++ ||+|||||||||||+ +++|.+++++ ..+|++||+|||.+||++|||+|||||| T Consensus 81 ~aL~li~eAi~~ag~~~--di~ialD~AAsefy~~~~g~Y~l~~~~~~~~~~~~~t~~elid~y~~l~~~yPIisIEDpl 158 (296) T pfam00113 81 EALDLIVEAIEKAGYTG--KVKIAMDVASSEFYNKKDGKYDLDFKNPKSDPSKWLTSDQLADLYKELIKKYPIVSIEDPF 158 (296) T ss_pred HHHHHHHHHHHHCCCCC--CEEEEECCCHHHHHCCCCCEEEECCCCCCCCCCCEECHHHHHHHHHHHHHHCCEEEEECCC T ss_conf 99999999999748988--6489960256866216797167425656777562208999999999999869988996798 Q ss_pred CCCCHHHHHHHHHHCCCCEEEECCCEEECCHHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEECCCC Q ss_conf 31042379988642388368504300002148885210038652234254530038999999999998798699950788 Q gi|254780670|r 287 SEDDWHGWKILTNKIGSNCQLVGDDLFVTNPERLHKGINEEVANAILIKPNQIGSVSETLNTIEKAQMAGYPSIISHRSG 366 (424) Q Consensus 287 ~e~D~~gw~~l~~~~g~~~~IvGDDL~vTn~~rl~~gi~~~~~NailiK~NQiGTvset~ea~~~a~~~g~~~ivShRSG 366 (424) +|+||+||++||+++|++||||||||||||++||++||++++||++|||||||||||||++++++|+++||.+||||||| T Consensus 159 ~e~D~~gw~~lt~~lg~~~qiVGDDL~~Tn~~rl~~gi~~~~~NaiLiK~NQiGTlset~e~~~~ak~~g~~~ivShRSG 238 (296) T pfam00113 159 DEDDWEAWKKLTASLGDKIQIVGDDLTVTNPKRIAKAIEKKACNSLLLKVNQIGSVTESLAAVKMAKDAGWGVMVSHRSG 238 (296) T ss_pred CCHHHHHHHHHHHHHCCCEEEECCCCCCCCHHHHHHHHHCCCCCCEEEECHHHHHHHHHHHHHHHHHHCCCEEEEECCCC T ss_conf 81249999999997499579989875567999999998548777169712111019999999999998696499866988 Q ss_pred CCCHHHHHHHHHHHCCCHHEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCC Q ss_conf 8723589989998199801007561144899999999999983110455762530479 Q gi|254780670|r 367 ETEDHTIADLAVATNCGQIKTGSLARSDRIAKYNQLIRIEESLGKQAKFAGRSILIHQ 424 (424) Q Consensus 367 ETeD~~iaDLAVg~~a~~iK~G~~~R~ER~aKyNrLlrIee~Lg~~a~~~g~~~~~~~ 424 (424) ||||+||||||||++|+|||+|+|+||||++|||||||||||||++|+|||++||.-| T Consensus 239 ETeD~~iaDLAVg~~a~~iK~G~~~R~ER~aKyNrLlrIeeeLg~~a~~~g~~~~~~~ 296 (296) T pfam00113 239 ETEDTFIADLVVGLNAGQIKTGAPCRSERLAKYNQLLRIEEELGSKAKYAGRNFRKPQ 296 (296) T ss_pred CCCCCHHHHHHHHCCCCCEECCCCCCHHHHHHHHHHHHHHHHHCCCCEECCCCCCCCC T ss_conf 7765229998886088812447996313468899999999986545574886678999 No 8 >PRK08350 hypothetical protein; Provisional Probab=100.00 E-value=0 Score=839.28 Aligned_cols=337 Identities=27% Similarity=0.432 Sum_probs=302.4 Q ss_pred EEEEEEEEEEEECCCCCEEEEEEEECCCCEEEEECCCCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCC Q ss_conf 10589986886079784299999984897676634478664775467626898651680489999999875678975988 Q gi|254780670|r 2 FINDIIAREVIDSRGSPTIEVDVCLEDGSTGRAMVPSGASTGIHEAFELRDQEKRYFGKGVLKAIAFVNDEIRTALLGCD 81 (424) Q Consensus 2 ~I~~i~~r~IlDSRG~PTvEvev~~~~g~~gra~~PSGaStG~~Ea~elrD~~~~~~G~gV~~Av~~in~~i~~~L~g~~ 81 (424) +|++|+|||||||||||||||||++++| +|||+||||+|+|.||+. +.+||+|||++|+|+|+|+| T Consensus 3 ~I~~I~aReIlDSRGnPTVEvdv~~~~g-~gra~vPs~~~~~~~~~~-------------~~kAV~nVn~~Iap~LiG~d 68 (341) T PRK08350 3 VIQNIIGRVVVLRGGKYSVEVDVITDEG-FGRFAAPIDENPMLYIAE-------------ARRAVSEVDEIIGPELIGFD 68 (341) T ss_pred CCCEEEEEEEECCCCCCEEEEEEEECCC-EEEEECCCCCCCCCCCCC-------------CCHHHHHHHHHHHHHHCCCC T ss_conf 7747987899879998508999998895-389976359997501002-------------11378999988899987998 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCHHHHCCCCCCCCCCCCCEEECCCCCCCCCCCCCE Q ss_conf 55789888646530366432212322220244899875266349848882253245647543000003664556776610 Q gi|254780670|r 82 ARDQLLIDKIMIDLDGTPNKSRLGANAILGVSLAVSKAAAQTSNLPLYKYLGGCSAHILPVPLMNILNGGIHADNALDFQ 161 (424) Q Consensus 82 ~~dQ~~iD~~Li~lDgT~nks~lGaNailAvSlA~akA~A~~~~~pLy~yl~~~~~~~lP~P~~NiinGG~HA~~~l~iQ 161 (424) +.||..||++|++||||+|||+||||+|||||||+|||+|+.+++|||+||++...+.||+|||||+||+ .+ T Consensus 69 ~~dQ~~ID~~Li~LDGT~NKs~LGANAiLaVSlA~akAaA~~~~~PLy~ylgg~~~~~LPvPmmNIiNG~--------~~ 140 (341) T PRK08350 69 ASEQELIDSYLWEIDGTEDFSHIGANTALAVSVAVAKAAANSKNMPLYSYIGGTFTTELPVPILEFASDE--------NF 140 (341) T ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHCCCCCCCCCCCCHHCCCCH--------HH T ss_conf 6658999999998329987355574333998999999998505986899868677776890302103758--------79 Q ss_pred EEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHCH Q ss_conf 25651488774899989999999999987520000136765576448868468999999999975385311102343110 Q gi|254780670|r 162 EFMIMPVGAENIREAIRMGAEVFHTLKKELKSKGYSTNVGDEGGFSPNLKTADSALDLIANSIEKAGYNAGKDLLIALDC 241 (424) Q Consensus 162 EfmIiP~g~~sf~eal~~~~ev~~~lk~~L~~~g~~t~vgDEGGfap~~~~~eeaL~ll~eAi~~aGy~~g~di~i~lD~ 241 (424) |||++|.....+.+.+++..++++.+ .+.+|+.+.+||+++||++|.||.++++. T Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~~~~~-------------------------~e~ale~i~~Aie~aG~~~G~di~l~~~~ 195 (341) T PRK08350 141 DYYVMVRDLLEVTDIVDAVNKVLENL-------------------------KEATVEELSKASEKVGSELGLEVALGIAM 195 (341) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH-------------------------HHHHHHHHHHHHHHHCCCCCCCEEEECCC T ss_conf 98886999998999999999886343-------------------------69999999999998288777326860265 Q ss_pred HHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHCCCCEEEECCCEEECCHHHHH Q ss_conf 46532027851111468656878999999987654486997767631042379988642388368504300002148885 Q gi|254780670|r 242 AASAFFKKEKYILKGENLELQPNEMASYLANLVDQYPIYSIEDGMSEDDWHGWKILTNKIGSNCQLVGDDLFVTNPERLH 321 (424) Q Consensus 242 AAsefy~~~kY~~~~~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~D~~gw~~l~~~~g~~~~IvGDDL~vTn~~rl~ 321 (424) +....++++..|+++|||.||||++++++| .+++... +.+|++|||||+|| T Consensus 196 --------------------~~~~~~e~~~~l~~~y~i~~ie~~~~~~~~---~~~~~~~-~~v~i~GDdlf~~~----- 246 (341) T PRK08350 196 --------------------KKEMETEKVLNIVEDNNVAYIKPIGDPELF---LELIAGT-HGVFIDGEYLFRQK----- 246 (341) T ss_pred --------------------CCHHHHHHHHHHHHHCCEEEECCCCCHHHH---HHHHCCC-CCEEEECCCCEECC----- T ss_conf --------------------851049999999986999883499885789---9997088-82698778310224----- Q ss_pred HHHHCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHHCCCHHEECCCCHHHHHHHHHH Q ss_conf 21003865223425453003899999999999879869995078887235899899981998010075611448999999 Q gi|254780670|r 322 KGINEEVANAILIKPNQIGSVSETLNTIEKAQMAGYPSIISHRSGETEDHTIADLAVATNCGQIKTGSLARSDRIAKYNQ 401 (424) Q Consensus 322 ~gi~~~~~NailiK~NQiGTvset~ea~~~a~~~g~~~ivShRSGETeD~~iaDLAVg~~a~~iK~G~~~R~ER~aKyNr 401 (424) ++|++++|||+|||||||||||||++++++|+++||.+|||||||||||+||||||||++|+|||+ ||||+||||| T Consensus 247 ~~I~~~~aNaILIK~NQIGTlTETleai~lA~~~g~~~viSHRSGETEDt~IADLAVa~~~gqI~~----RseR~AKyNq 322 (341) T PRK08350 247 DILDRRYYNALSIKPINLGTLTDLYNFVNDVKSEKITPILAEAKYEPADETLPHLALGLRCPAMLI----RKDSIEKINE 322 (341) T ss_pred HHHHHCCCCEEEEECCCCEEHHHHHHHHHHHHHCCCEEEEECCCCCCCCCHHHHHHHHHCCCCEEE----CHHHHHHHHH T ss_conf 777740377079822222129999999999998799489985789864016999999718982222----2068999999 Q ss_pred HHHHHHHHHHCCCCCCH Q ss_conf 99999983110455762 Q gi|254780670|r 402 LIRIEESLGKQAKFAGR 418 (424) Q Consensus 402 LlrIee~Lg~~a~~~g~ 418 (424) |||||||||++|+|.|. T Consensus 323 LLRIEEeLG~~ar~~~~ 339 (341) T PRK08350 323 LNRIAEELGERGRIITF 339 (341) T ss_pred HHHHHHHHCCCCCCCCC T ss_conf 99999985302111414 No 9 >pfam03952 Enolase_N Enolase, N-terminal domain. Probab=100.00 E-value=0 Score=367.63 Aligned_cols=131 Identities=68% Similarity=0.973 Sum_probs=129.0 Q ss_pred EEEEEEEEEEEECCCCCEEEEEEEECCCCEEEEECCCCCCCCCCEEEEECCCC-CCCCCCHHHHHHHHHHHHHHHHHCCC Q ss_conf 10589986886079784299999984897676634478664775467626898-65168048999999987567897598 Q gi|254780670|r 2 FINDIIAREVIDSRGSPTIEVDVCLEDGSTGRAMVPSGASTGIHEAFELRDQE-KRYFGKGVLKAIAFVNDEIRTALLGC 80 (424) Q Consensus 2 ~I~~i~~r~IlDSRG~PTvEvev~~~~g~~gra~~PSGaStG~~Ea~elrD~~-~~~~G~gV~~Av~~in~~i~~~L~g~ 80 (424) +|++|++||||||||+|||||||++++|.+||++||||||||+|||+||||++ ++|+||||.+||++||+.|+|+|+|+ T Consensus 1 ~I~~i~~r~IlDSRG~PTvev~v~~~~g~~g~a~~PSGAStG~~EA~elrD~~~~~y~GkgV~~AV~~in~~i~~~Lig~ 80 (132) T pfam03952 1 TITKIKAREILDSRGNPTVEVEVTLEDGTFGRAAVPSGASTGSHEAVELRDGDKSRYGGKGVLKAVENVNEIIAPALIGM 80 (132) T ss_pred CCCEEEEEEEECCCCCEEEEEEEEECCCCEEEEECCCCCCCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHCCC T ss_conf 92189999998799984799999989998899967777788871446771699641477458999999999999998296 Q ss_pred CHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCHHHHC Q ss_conf 8557898886465303664322123222202448998752663498488822 Q gi|254780670|r 81 DARDQLLIDKIMIDLDGTPNKSRLGANAILGVSLAVSKAAAQTSNLPLYKYL 132 (424) Q Consensus 81 ~~~dQ~~iD~~Li~lDgT~nks~lGaNailAvSlA~akA~A~~~~~pLy~yl 132 (424) ++.||++||++|++||||+|||+||||+|||||||+|||+|+.+++|||+|| T Consensus 81 ~~~dQ~~iD~~LielDgT~nks~lGaNailAvSlA~akAaA~~~~~pLy~yl 132 (132) T pfam03952 81 DATDQRAIDKTLIELDGTPNKSKLGANAILAVSLAVAKAAAAALGLPLYRYL 132 (132) T ss_pred CHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCHHHHC T ss_conf 5314999999999815996401013699999999999999987399837629 No 10 >cd03316 MR_like Mandelate racemase (MR)-like subfamily of the enolase superfamily. Enzymes of this subgroup share three conserved carboxylate ligands for the essential divalent metal ion (usually Mg2+), two aspartates and a glutamate, and conserved catalytic residues, a Lys-X-Lys motif and a conserved histidine-aspartate dyad. Members of the MR subgroup are mandelate racemase, D-glucarate/L-idarate dehydratase (GlucD), D-altronate/D-mannonate dehydratase , D-galactonate dehydratase (GalD) , D-gluconate dehydratase (GlcD), and L-rhamnonate dehydratase (RhamD). Probab=99.79 E-value=5.5e-16 Score=138.69 Aligned_cols=305 Identities=17% Similarity=0.182 Sum_probs=206.8 Q ss_pred EEEEEEEEEEEE--------CCCCCEEEEEEEECCCCEEEEECCCCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 105899868860--------797842999999848976766344786647754676268986516804899999998756 Q gi|254780670|r 2 FINDIIAREVID--------SRGSPTIEVDVCLEDGSTGRAMVPSGASTGIHEAFELRDQEKRYFGKGVLKAIAFVNDEI 73 (424) Q Consensus 2 ~I~~i~~r~IlD--------SRG~PTvEvev~~~~g~~gra~~PSGaStG~~Ea~elrD~~~~~~G~gV~~Av~~in~~i 73 (424) +|++|....+-- ...+.+|=|+|++++|.+|.+-+.++.+ .......+++.+ T Consensus 1 KI~~i~~~~~~~Pl~~~~g~~~~~~~vlV~i~td~G~~G~GE~~~~~~--------------------~~~~~~~i~~~~ 60 (357) T cd03316 1 KITDVETFVLRVPLPEPGGAVTWRNLVLVRVTTDDGITGWGEAYPGGR--------------------PSAVAAAIEDLL 60 (357) T ss_pred CCCEEEEEEECCCCCCCCCCEEEEEEEEEEEEECCCCEEEEEECCCCC--------------------HHHHHHHHHHHH T ss_conf 948699999637866688425567899999998999999987089877--------------------699999999999 Q ss_pred HHHHCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCHHHHCCCCCCCCCCCCCEEECCCCCC Q ss_conf 78975988557898886465303664322123222202448998752663498488822532456475430000036645 Q gi|254780670|r 74 RTALLGCDARDQLLIDKIMIDLDGTPNKSRLGANAILGVSLAVSKAAAQTSNLPLYKYLGGCSAHILPVPLMNILNGGIH 153 (424) Q Consensus 74 ~~~L~g~~~~dQ~~iD~~Li~lDgT~nks~lGaNailAvSlA~akA~A~~~~~pLy~yl~~~~~~~lP~P~~NiinGG~H 153 (424) +|.|+|+++.+..++-+.|...-...........|+-|+.+|++-+.++..++|||+.|++.....+|+-. +.+.. T Consensus 61 ~~~liG~~~~~~~~i~~~l~~~~~~~~~~~~~~~A~said~AlwDl~gk~~~~Pl~~LLGg~~r~~i~~y~-s~~~~--- 136 (357) T cd03316 61 APLLIGRDPLDIERLWEKLYRRLFWRGRGGVAMAAISAVDIALWDIKGKAAGVPVYKLLGGKVRDRVRVYA-SGGGY--- 136 (357) T ss_pred HHHHCCCCCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCHHHHHCCCCCCCEEEEE-EECCC--- T ss_conf 99977998043999999999612103665078899999999999986546399899982886557623677-62478--- Q ss_pred CCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCCC---CCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCC Q ss_conf 56776610256514887748999899999999999875200001---367655764488684689999999999753853 Q gi|254780670|r 154 ADNALDFQEFMIMPVGAENIREAIRMGAEVFHTLKKELKSKGYS---TNVGDEGGFSPNLKTADSALDLIANSIEKAGYN 230 (424) Q Consensus 154 A~~~l~iQEfmIiP~g~~sf~eal~~~~ev~~~lk~~L~~~g~~---t~vgDEGGfap~~~~~eeaL~ll~eAi~~aGy~ 230 (424) ..+..+..+ ..++.+ ++|+. ..+|.... ...+.++-++.+...-+.. T Consensus 137 ----------------~~~~~~~~~-------~~~~~~-~~Gf~~~K~k~g~~~~---~~~~~~~di~~v~~ir~~~--- 186 (357) T cd03316 137 ----------------DDSPEELAE-------EAKRAV-AEGFTAVKLKVGGPDS---GGEDLREDLARVRAVREAV--- 186 (357) T ss_pred ----------------CCCHHHHHH-------HHHHHH-HCCCCEEEECCCCCCC---CCCCHHHHHHHHHHHHHHH--- T ss_conf ----------------999999999-------999999-7699789853688864---4126999999999999982--- Q ss_pred CCCCHHHHHCHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHCCCCEEEECC Q ss_conf 11102343110465320278511114686568789999999876544869977676310423799886423883685043 Q gi|254780670|r 231 AGKDLLIALDCAASAFFKKEKYILKGENLELQPNEMASYLANLVDQYPIYSIEDGMSEDDWHGWKILTNKIGSNCQLVGD 310 (424) Q Consensus 231 ~g~di~i~lD~AAsefy~~~kY~~~~~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~D~~gw~~l~~~~g~~~~IvGD 310 (424) |.++.|.+|+.. -.+.++-+.+... +++|.+.++|+|+..+|+++.++|.++.+ +-|.+| T Consensus 187 -g~~~~l~vDaN~----------------~~~~~~A~~~~~~-l~~~~l~~iEqP~~~~d~~~~~~l~~~~~--~pI~~~ 246 (357) T cd03316 187 -GPDVDLMVDANG----------------RWDLAEAIRLARA-LEEYDLFWFEEPVPPDDLEGLARLRQATS--VPIAAG 246 (357) T ss_pred -CCCCEEEECCCC----------------CCCHHHHHHHHHH-HHHHHHHHHCCCCCCCCHHHHHHHHHCCC--CCEEEC T ss_conf -999569865765----------------5579999999998-86544665058989557999999986289--968817 Q ss_pred CEEECCHHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHH--HHHHHHHHHCCC Q ss_conf 0000214888521003865223425453003899999999999879869995078887235--899899981998 Q gi|254780670|r 311 DLFVTNPERLHKGINEEVANAILIKPNQIGSVSETLNTIEKAQMAGYPSIISHRSGETEDH--TIADLAVATNCG 383 (424) Q Consensus 311 DL~vTn~~rl~~gi~~~~~NailiK~NQiGTvset~ea~~~a~~~g~~~ivShRSGETeD~--~iaDLAVg~~a~ 383 (424) .- +.++.-+.+.|+.++++.+-+|+..+|-+|+++..+.+|+.+|..++ .|.. ++.=. .-+|||.++... T Consensus 247 Es-~~~~~~~~~~i~~~a~di~~~d~~~~GGit~~~~i~~~A~~~gi~~~-~h~~-~~~i~~aa~~hlaaa~~n~ 318 (357) T cd03316 247 EN-LYTRWEFRDLLEAGAVDIIQPDVTKVGGITEAKKIAALAEAHGVRVA-PHGA-GGPIGLAASLHLAAALPNF 318 (357) T ss_pred CC-CCCHHHHHHHHHCCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCEEE-ECCC-CCHHHHHHHHHHHHHCCCC T ss_conf 88-78878888888707776376267446987999999999998699798-5589-9789999999999978998 No 11 >cd03318 MLE Muconate Lactonizing Enzyme (MLE), an homooctameric enzyme, catalyses the conversion of cis,cis-muconate (CCM) to muconolactone (ML) in the catechol branch of the beta-ketoadipate pathway. This pathway is used in soil microbes to breakdown lignin-derived aromatics, catechol and protocatechuate, to citric acid cycle intermediates. Some bacterial species are also capable of dehalogenating chloroaromatic compounds by the action of chloromuconate lactonizing enzymes (Cl-MLEs). MLEs are members of the enolase superfamily characterized by the presence of an enolate anion intermediate which is generated by abstraction of the alpha-proton of the carboxylate substrate by an active site residue and that is stabilized by coordination to the essential Mg2+ ion. Probab=99.75 E-value=1.6e-14 Score=127.84 Aligned_cols=292 Identities=16% Similarity=0.203 Sum_probs=201.7 Q ss_pred EEEECCCCCEEEEEEEECCCCEEEEECCC-CCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHH Q ss_conf 88607978429999998489767663447-86647754676268986516804899999998756789759885578988 Q gi|254780670|r 10 EVIDSRGSPTIEVDVCLEDGSTGRAMVPS-GASTGIHEAFELRDQEKRYFGKGVLKAIAFVNDEIRTALLGCDARDQLLI 88 (424) Q Consensus 10 ~IlDSRG~PTvEvev~~~~g~~gra~~PS-GaStG~~Ea~elrD~~~~~~G~gV~~Av~~in~~i~~~L~g~~~~dQ~~i 88 (424) ..+++| +.|=|+|++++|..|.+-+.+ +. ..|.+..+..+...+++.++|.|+|+++.+...+ T Consensus 23 ~~~~~~--~~v~V~i~td~G~~G~GE~~~~~~--------------~~~~~~t~~~~~~~l~~~l~p~lig~d~~~~~~~ 86 (365) T cd03318 23 TTMHTQ--SLVLVRLTTSDGVVGIGEATTPGG--------------PAWGGESPETIKAIIDRYLAPLLIGRDATNIGAA 86 (365) T ss_pred EEEEEE--EEEEEEEEECCCCEEEEECCCCCC--------------CCCCCCCHHHHHHHHHHHHHHHHCCCCCCCHHHH T ss_conf 899999--999999998999989985146788--------------7637679999999999988898769994799999 Q ss_pred HHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCHHHHCCCCCCCCCCCCCEEECCCCCCCCCCCCCEEEEEECC Q ss_conf 86465303664322123222202448998752663498488822532456475430000036645567766102565148 Q gi|254780670|r 89 DKIMIDLDGTPNKSRLGANAILGVSLAVSKAAAQTSNLPLYKYLGGCSAHILPVPLMNILNGGIHADNALDFQEFMIMPV 168 (424) Q Consensus 89 D~~Li~lDgT~nks~lGaNailAvSlA~akA~A~~~~~pLy~yl~~~~~~~lP~P~~NiinGG~HA~~~l~iQEfmIiP~ 168 (424) -+.|-.+-.. ..-|+-|+.+|++-+.++..++|||+.|++....++|+-. .+++| T Consensus 87 ~~~l~~~~~~------~~~A~said~AlwDl~gK~~g~Pl~~lLGg~~r~~i~~~~--~~~~~----------------- 141 (365) T cd03318 87 MALLDRAVAG------NLFAKAAIEMALLDAQGRRLGLPVSELLGGRVRDSLPVAW--TLASG----------------- 141 (365) T ss_pred HHHHHHHCCC------CHHHHHHHHHHHHHHHHHHHCCCHHHHHCCCCCCCCCEEE--EECCC----------------- T ss_conf 9999874046------6789999999999997768399399981887677774478--83589----------------- Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHCCCC---CCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHCHHHHH Q ss_conf 87748999899999999999875200001---367655764488684689999999999753853111023431104653 Q gi|254780670|r 169 GAENIREAIRMGAEVFHTLKKELKSKGYS---TNVGDEGGFSPNLKTADSALDLIANSIEKAGYNAGKDLLIALDCAASA 245 (424) Q Consensus 169 g~~sf~eal~~~~ev~~~lk~~L~~~g~~---t~vgDEGGfap~~~~~eeaL~ll~eAi~~aGy~~g~di~i~lD~AAse 245 (424) +..+.+.. ..+.+...++. ..+|. .+.++-++.+...-+.. |.++.|.+|+- T Consensus 142 ---~~~~~~~~-------~~~~~~~~g~~~~KiKig~--------~~~~~di~~v~~ir~~~----g~~~~l~vDan--- 196 (365) T cd03318 142 ---DTERDIAE-------AEEMLEAGRHRRFKLKMGA--------RPPADDLAHVEAIAKAL----GDRASVRVDVN--- 196 (365) T ss_pred ---CHHHHHHH-------HHHHHHHCCCCEEEEECCC--------CCHHHHHHHHHHHHHHC----CCCCEEEECCC--- T ss_conf ---81999999-------9999980899879996599--------99899999999999865----78968984677--- Q ss_pred HCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHCCCCEEEECCCEEECCHHHHHHHHH Q ss_conf 20278511114686568789999999876544869977676310423799886423883685043000021488852100 Q gi|254780670|r 246 FFKKEKYILKGENLELQPNEMASYLANLVDQYPIYSIEDGMSEDDWHGWKILTNKIGSNCQLVGDDLFVTNPERLHKGIN 325 (424) Q Consensus 246 fy~~~kY~~~~~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~D~~gw~~l~~~~g~~~~IvGDDL~vTn~~rl~~gi~ 325 (424) .-.+.++-+.+... +++|.+.++|+|+..+|+++.++|+++.+ +-|.+|. .+.+++-..+.|+ T Consensus 197 -------------~~~~~~~Ai~~~~~-l~~~~~~~~EeP~~~~d~~~~~~l~~~~~--ipIa~gE-~~~~~~~~~~~i~ 259 (365) T cd03318 197 -------------QAWDESTAIRALPR-LEAAGVELIEQPVPRENLDGLARLRSRNR--VPIMADE-SVSGPADAFELAR 259 (365) T ss_pred -------------CCCCHHHHHHHHHH-HHHHCCCEEECCCCCCCHHHHHHHHHCCC--CCEEECC-CCCCHHHHHHHHH T ss_conf -------------78899999999999-77512420337999536999998862589--8889378-8799999999987 Q ss_pred CCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHHCCCHH Q ss_conf 386522342545300389999999999987986999507888723589989998199801 Q gi|254780670|r 326 EEVANAILIKPNQIGSVSETLNTIEKAQMAGYPSIISHRSGETEDHTIADLAVATNCGQI 385 (424) Q Consensus 326 ~~~~NailiK~NQiGTvset~ea~~~a~~~g~~~ivShRSGETeD~~iaDLAVg~~a~~i 385 (424) .++++-+-+|+..+|-+|++++.+.+|+.+|+.+++.| .-|+.=.+.+.+.++...+.+ T Consensus 260 ~~a~div~~d~~~~GGit~~~kia~~A~~~gi~~~~~~-~~~~~i~~aa~~h~~aa~p~~ 318 (365) T cd03318 260 RGAADVFSLKIAKSGGLRRAQKVAAIAEAAGIALYGGT-MLESSIGTAASAHLFATLPSL 318 (365) T ss_pred HCCCCCEEECCCCCCCHHHHHHHHHHHHHCCCCEEECC-CCCCHHHHHHHHHHHHHCCCC T ss_conf 33444235323667978999999999998699364589-877689999999999968998 No 12 >cd03325 D-galactonate_dehydratase D-galactonate dehydratase catalyses the dehydration of galactonate to 2-keto-3-deoxygalactnate (KDGal), as part of the D-galactonate nonphosphorolytic catabolic Entner-Doudoroff pathway. D-galactonate dehydratase belongs to the enolase superfamily of enzymes, characterized by the presence of an enolate anion intermediate which is generated by abstraction of the alpha-proton of the carboxylate substrate by an active site residue and is stabilized by coordination to the essential Mg2+ ion. Probab=99.70 E-value=3.4e-13 Score=118.06 Aligned_cols=298 Identities=16% Similarity=0.179 Sum_probs=199.9 Q ss_pred EEEEEEEEEEEECCCCCEEEEEEEECCCCEEEEECCCCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCC Q ss_conf 10589986886079784299999984897676634478664775467626898651680489999999875678975988 Q gi|254780670|r 2 FINDIIAREVIDSRGSPTIEVDVCLEDGSTGRAMVPSGASTGIHEAFELRDQEKRYFGKGVLKAIAFVNDEIRTALLGCD 81 (424) Q Consensus 2 ~I~~i~~r~IlDSRG~PTvEvev~~~~g~~gra~~PSGaStG~~Ea~elrD~~~~~~G~gV~~Av~~in~~i~~~L~g~~ 81 (424) +|++|..--+ .. + =|=|+|.|++|.+|.+-+. ..|. +..+..++ + .+.|.|+|+| T Consensus 1 kit~~~~~~~-~~--~-~vlV~I~td~G~~G~GE~~---~~~~--------------~~~~~~~i---~-~l~~~liG~d 55 (352) T cd03325 1 KITKIETFVV-PP--R-WLFVKIETDEGVVGWGEPT---VEGK--------------ARTVEAAV---Q-ELEDYLIGKD 55 (352) T ss_pred CCEEEEEEEE-CC--C-EEEEEEEECCCCEEEEECC---CCCC--------------HHHHHHHH---H-HHHHHHCCCC T ss_conf 9128999997-89--8-8999999899998985467---7888--------------79999999---9-9999857999 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCHHHHCCCCCCCCCCCCCEEECCCCCCCCCCCCCE Q ss_conf 55789888646530366432212322220244899875266349848882253245647543000003664556776610 Q gi|254780670|r 82 ARDQLLIDKIMIDLDGTPNKSRLGANAILGVSLAVSKAAAQTSNLPLYKYLGGCSAHILPVPLMNILNGGIHADNALDFQ 161 (424) Q Consensus 82 ~~dQ~~iD~~Li~lDgT~nks~lGaNailAvSlA~akA~A~~~~~pLy~yl~~~~~~~lP~P~~NiinGG~HA~~~l~iQ 161 (424) |.+...|=+.|...-- .....+...++-|+-+|++-..++..++|||+.|+|....++|+ ...+++ T Consensus 56 p~~i~~i~~~~~~~~~-~~~~~~~~~A~saiD~ALwDl~gK~~g~Pv~~LLGG~~r~~i~~--y~~~~~----------- 121 (352) T cd03325 56 PMNIEHHWQVMYRGGF-YRGGPVLMSAISGIDQALWDIKGKVLGVPVHQLLGGQVRDRVRV--YSWIGG----------- 121 (352) T ss_pred HHHHHHHHHHHHHHCC-CCCCHHHHHHHHHHHHHHHHHHHHHCCCCHHHHCCCCCCCCEEE--EEECCC----------- T ss_conf 0479999999986321-25880688899999999999988784994999727766774268--775688----------- Q ss_pred EEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCCC---CCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHH Q ss_conf 256514887748999899999999999875200001---36765576448868468999999999975385311102343 Q gi|254780670|r 162 EFMIMPVGAENIREAIRMGAEVFHTLKKELKSKGYS---TNVGDEGGFSPNLKTADSALDLIANSIEKAGYNAGKDLLIA 238 (424) Q Consensus 162 EfmIiP~g~~sf~eal~~~~ev~~~lk~~L~~~g~~---t~vgDEGGfap~~~~~eeaL~ll~eAi~~aGy~~g~di~i~ 238 (424) .+..+..+. .++ +.++|+. ...+++.++....++.++.++.+...-+. .|.++.|. T Consensus 122 ---------~~~~~~~~~-------a~~-~~~~Gf~~~K~k~~~~~~~~~~~~~~~~di~~v~~vR~~----~G~~~~l~ 180 (352) T cd03325 122 ---------DRPSDVAEA-------ARA-RREAGFTAVKMNATEELQWIDTSKKVDAAVERVAALREA----VGPDIDIG 180 (352) T ss_pred ---------CCHHHHHHH-------HHH-HHHCCCCEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHH----HCCCCCEE T ss_conf ---------998999999-------999-996799879986777434466403199999999999998----68998199 Q ss_pred HCHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHCCCCEEEECCCEEECCHH Q ss_conf 11046532027851111468656878999999987654486997767631042379988642388368504300002148 Q gi|254780670|r 239 LDCAASAFFKKEKYILKGENLELQPNEMASYLANLVDQYPIYSIEDGMSEDDWHGWKILTNKIGSNCQLVGDDLFVTNPE 318 (424) Q Consensus 239 lD~AAsefy~~~kY~~~~~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~D~~gw~~l~~~~g~~~~IvGDDL~vTn~~ 318 (424) +|+- + ..+.++-+.+...| ++|.+.++|+|+..+|+++.++|.++.+ +-|.+|.-. ..+. T Consensus 181 vDan-------~---------~~~~~~A~~~~~~l-~~~~l~w~EeP~~~~d~~~~~~l~~~~~--~PIa~gE~~-~~~~ 240 (352) T cd03325 181 VDFH-------G---------RVSKPMAKDLAKEL-EPYRLLFIEEPVLPENVEALAEIAARTT--IPIATGERL-FSRW 240 (352) T ss_pred EECC-------C---------CCCHHHHHHHHHHH-HHCCCHHEECCCCCCCHHHHHHHHHHCC--CCEECCCCC-CCHH T ss_conf 9889-------9---------98999999999986-1046231127889889999999998769--999717837-8767 Q ss_pred HHHHHHHCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHH--HHHHHHC Q ss_conf 88521003865223425453003899999999999879869995078887235899--8999819 Q gi|254780670|r 319 RLHKGINEEVANAILIKPNQIGSVSETLNTIEKAQMAGYPSIISHRSGETEDHTIA--DLAVATN 381 (424) Q Consensus 319 rl~~gi~~~~~NailiK~NQiGTvset~ea~~~a~~~g~~~ivShRSGETeD~~ia--DLAVg~~ 381 (424) -.++.++.++++-+-+|+..+|-+|+++..+.+|+.+|..++ .|.. ++.=.+.| |++.++. T Consensus 241 ~~~~~i~~~a~di~~~d~~~~GGit~~~kia~~a~~~gi~~~-~h~~-~~~i~~~a~lh~~aa~p 303 (352) T cd03325 241 DFKELLEDGAVDIIQPDISHAGGITELKKIAAMAEAYDVALA-PHCP-LGPIALAASLHVDASTP 303 (352) T ss_pred HHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHCCCEEE-EECC-CCHHHHHHHHHHHHHCC T ss_conf 789999768776682177735989999999999998699098-6078-88799999999999787 No 13 >cd03322 rpsA The starvation sensing protein RpsA from E.coli and its homologs are lactonizing enzymes whose putative targets are homoserine lactone (HSL)-derivative. They are part of the mandelate racemase (MR)-like subfamily of the enolase superfamily. Enzymes of this subfamily share three conserved carboxylate ligands for the essential divalent metal ion (usually Mg2+), two aspartates and a glutamate, and catalytic residues, a partially conserved Lys-X-Lys motif and a conserved histidine-aspartate dyad. Probab=99.67 E-value=4e-13 Score=117.52 Aligned_cols=288 Identities=19% Similarity=0.292 Sum_probs=193.5 Q ss_pred EEEEEEEEEEEECCCCCEEEEEEEECCCCEEEEECCCCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCC Q ss_conf 10589986886079784299999984897676634478664775467626898651680489999999875678975988 Q gi|254780670|r 2 FINDIIAREVIDSRGSPTIEVDVCLEDGSTGRAMVPSGASTGIHEAFELRDQEKRYFGKGVLKAIAFVNDEIRTALLGCD 81 (424) Q Consensus 2 ~I~~i~~r~IlDSRG~PTvEvev~~~~g~~gra~~PSGaStG~~Ea~elrD~~~~~~G~gV~~Av~~in~~i~~~L~g~~ 81 (424) +|+++..- .-.=|+..|=|+|+|++|.+|-+-+..+ |. -......+++.++|.|+|+| T Consensus 1 ki~~~~~~--~~~p~r~~vlVkI~Td~Gi~G~GEa~~~---~~-----------------~~~~~~~i~~~l~p~liG~d 58 (361) T cd03322 1 KITAIEVI--VTCPGRNFVTLKITTDQGVTGLGDATLN---GR-----------------ELAVKAYLREHLKPLLIGRD 58 (361) T ss_pred CCEEEEEE--EECCCCCEEEEEEEECCCCEEEEECCCC---CC-----------------HHHHHHHHHHHHHHHHCCCC T ss_conf 94378999--9889974899999989999899726578---88-----------------69999999999999976998 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCHHHHCCCCCCCCCCCCCEEECCCCCCCCCCCCCE Q ss_conf 55789888646530366432212322220244899875266349848882253245647543000003664556776610 Q gi|254780670|r 82 ARDQLLIDKIMIDLDGTPNKSRLGANAILGVSLAVSKAAAQTSNLPLYKYLGGCSAHILPVPLMNILNGGIHADNALDFQ 161 (424) Q Consensus 82 ~~dQ~~iD~~Li~lDgT~nks~lGaNailAvSlA~akA~A~~~~~pLy~yl~~~~~~~lP~P~~NiinGG~HA~~~l~iQ 161 (424) |.+.+.|=+.|....-. ....+..+++-|+-+|++-+.++..++|||+.|+|.....+|+ ....+| T Consensus 59 p~~ie~i~~~l~~~~~~-~~g~~~~~A~saIDiALWDi~gK~~g~Pv~~LLGG~~r~~i~~--y~~~~~----------- 124 (361) T cd03322 59 ANRIEDIWQYLYRGAYW-RRGPVTMNAIAAVDMALWDIKGKAAGMPLYQLLGGKSRDGIMV--YSHASG----------- 124 (361) T ss_pred HHHHHHHHHHHHHHCCC-CCCCHHHHHHHHHHHHHHHHHHHHCCCCHHHHHCCCCCCEEEE--EEECCC----------- T ss_conf 65899999999874234-6881788899999999999999885995999818977874789--996388----------- Q ss_pred EEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHCH Q ss_conf 25651488774899989999999999987520000136765576448868468999999999975385311102343110 Q gi|254780670|r 162 EFMIMPVGAENIREAIRMGAEVFHTLKKELKSKGYSTNVGDEGGFSPNLKTADSALDLIANSIEKAGYNAGKDLLIALDC 241 (424) Q Consensus 162 EfmIiP~g~~sf~eal~~~~ev~~~lk~~L~~~g~~t~vgDEGGfap~~~~~eeaL~ll~eAi~~aGy~~g~di~i~lD~ 241 (424) .+..+.++. .++.+ ++|+.. ..++. ++. .+|+.++ .|.++.|-+|+ T Consensus 125 ---------~~~~e~~~~-------a~~~~-~~Gf~~---------~K~k~----~~~-i~avR~~---~G~~~~l~vDa 170 (361) T cd03322 125 ---------RDIPELLEA-------VERHL-AQGYRA---------IRVQL----PKL-FEAVREK---FGFEFHLLHDV 170 (361) T ss_pred ---------CCHHHHHHH-------HHHHH-HCCCCE---------EEECH----HHH-HHHHHHH---HCCCCEEECCC T ss_conf ---------996999999-------99888-659887---------87452----999-9999997---29997085125 Q ss_pred HHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHCCCCEEEECCCEEECCHHHHH Q ss_conf 46532027851111468656878999999987654486997767631042379988642388368504300002148885 Q gi|254780670|r 242 AASAFFKKEKYILKGENLELQPNEMASYLANLVDQYPIYSIEDGMSEDDWHGWKILTNKIGSNCQLVGDDLFVTNPERLH 321 (424) Q Consensus 242 AAsefy~~~kY~~~~~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~D~~gw~~l~~~~g~~~~IvGDDL~vTn~~rl~ 321 (424) +...+.++=+.+... +++|.+.++|+|+..+|+++++.+.++.+ +-|.+|.- .+...-.+ T Consensus 171 ----------------n~~~~~~~Ai~~~~~-l~~~~l~w~EeP~~~~d~~~~~~l~~~~~--iPIa~gE~-~~~~~~~~ 230 (361) T cd03322 171 ----------------HHRLTPNQAARFGKD-VEPYRLFWMEDPTPAENQEAFRLIRQHTA--TPLAVGEV-FNSIWDWQ 230 (361) T ss_pred ----------------CCCCCHHHHHHHHHH-HHHHHHHCCCCCCCCCCHHHHHHHHHHCC--CCEEECCC-CCCHHHHH T ss_conf ----------------568999999999999-87420210217988658899999997469--98882787-68999999 Q ss_pred HHHHCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCC-CCHHHHH--HHHHHH Q ss_conf 2100386522342545300389999999999987986999507888-7235899--899981 Q gi|254780670|r 322 KGINEEVANAILIKPNQIGSVSETLNTIEKAQMAGYPSIISHRSGE-TEDHTIA--DLAVAT 380 (424) Q Consensus 322 ~gi~~~~~NailiK~NQiGTvset~ea~~~a~~~g~~~ivShRSGE-TeD~~ia--DLAVg~ 380 (424) +.++.++++-+-+++...|-+|+++....+|..+|..+. .|-.++ +-=.+.| |++.++ T Consensus 231 ~~i~~~a~divq~d~~~~GGit~~~kia~lA~~~gi~~~-~H~~~~~~~i~~aa~~H~~aa~ 291 (361) T cd03322 231 NLIQERLIDYIRTTVSHAGGITPARKIADLASLYGVRTG-WHGPTDLSPVGMAAALHLDLWV 291 (361) T ss_pred HHHHHCCCCCCCCCCCEECCHHHHHHHHHHHHHCCCCEE-ECCCCCCCHHHHHHHHHHHHHC T ss_conf 999827656205566604879999999999998599374-0478874799999999999867 No 14 >cd03329 MR_like_4 Mandelate racemase (MR)-like subfamily of the enolase superfamily, subgroup 4. Enzymes of this subgroup share three conserved carboxylate ligands for the essential divalent metal ion (usually Mg2+), two aspartates and a glutamate, and conserved catalytic residues, a Lys-X-Lys motif and a conserved histidine-aspartate dyad. This subgroup's function is unknown. Probab=99.65 E-value=6.1e-13 Score=116.18 Aligned_cols=280 Identities=18% Similarity=0.207 Sum_probs=184.6 Q ss_pred CEEEEEEEECCCCEEEEECCCCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCC Q ss_conf 42999999848976766344786647754676268986516804899999998756789759885578988864653036 Q gi|254780670|r 18 PTIEVDVCLEDGSTGRAMVPSGASTGIHEAFELRDQEKRYFGKGVLKAIAFVNDEIRTALLGCDARDQLLIDKIMIDLDG 97 (424) Q Consensus 18 PTvEvev~~~~g~~gra~~PSGaStG~~Ea~elrD~~~~~~G~gV~~Av~~in~~i~~~L~g~~~~dQ~~iD~~Li~lDg 97 (424) .+|=|+|.|++|.+|.+..++ -......++..++|.|+|+||.+.+.+=+.|.... T Consensus 33 ~~~lV~i~TD~Gi~G~g~~~~-----------------------~~~~~~~i~~~lap~liG~dp~~~e~~~~~m~~~~- 88 (368) T cd03329 33 KLALLTIETDEGAKGHAFGGR-----------------------PVTDPALVDRFLKKVLIGQDPLDRERLWQDLWRLQ- 88 (368) T ss_pred EEEEEEEEECCCCEEEEECCC-----------------------HHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHH- T ss_conf 999999996899989996687-----------------------47789999999899868998314999999999873- Q ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHCCCCHHHHCCCCCCCCCCCCCEEECCCCCCCCCCCCCEEEEEECCCCCCHHHHH Q ss_conf 64322123222202448998752663498488822532456475430000036645567766102565148877489998 Q gi|254780670|r 98 TPNKSRLGANAILGVSLAVSKAAAQTSNLPLYKYLGGCSAHILPVPLMNILNGGIHADNALDFQEFMIMPVGAENIREAI 177 (424) Q Consensus 98 T~nks~lGaNailAvSlA~akA~A~~~~~pLy~yl~~~~~~~lP~P~~NiinGG~HA~~~l~iQEfmIiP~g~~sf~eal 177 (424) ....-.++-|+-+|++-..++..++|||+.|+| ...++|+-. -..++.+.. +..+-++.. T Consensus 89 ----~g~~~~A~saIDiALWDl~gK~~g~Pv~~LLGG-~r~~vp~Y~--s~~~~~~~~-------------~~~~~e~~~ 148 (368) T cd03329 89 ----RGLTDRGLGLVDIALWDLAGKYLGLPVHRLLGG-YREKIPAYA--STMVGDDLE-------------GLESPEAYA 148 (368) T ss_pred ----CCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHCC-CCCCCCEEE--EEECCCCCC-------------CCCCHHHHH T ss_conf ----574178999999999999998849979998389-877631268--871366534-------------458999999 Q ss_pred HHHHHHHHHHHHHHHHCCCCC-CCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHCHHHHHHCCCCCEEECC Q ss_conf 999999999998752000013-6765576448868468999999999975385311102343110465320278511114 Q gi|254780670|r 178 RMGAEVFHTLKKELKSKGYST-NVGDEGGFSPNLKTADSALDLIANSIEKAGYNAGKDLLIALDCAASAFFKKEKYILKG 256 (424) Q Consensus 178 ~~~~ev~~~lk~~L~~~g~~t-~vgDEGGfap~~~~~eeaL~ll~eAi~~aGy~~g~di~i~lD~AAsefy~~~kY~~~~ 256 (424) + ..+.+.++|+.. .+.--++ + +.++-++.+ .++.++ .|.|+.|.+|+- T Consensus 149 ~--------~a~~~~~~Gf~~~K~k~~~~--~---~~~~d~~~v-~avRea---vG~d~~l~vDan-------------- 197 (368) T cd03329 149 D--------FAEECKALGYRAIKLHPWGP--G---VVRRDLKAC-LAVREA---VGPDMRLMHDGA-------------- 197 (368) T ss_pred H--------HHHHHHHCCCCEEEECCCCC--C---CHHHHHHHH-HHHHHH---HCCCEEEEECCC-------------- T ss_conf 9--------99999976998998658989--7---689999999-999997---389708987685-------------- Q ss_pred CCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHCCCCEEEECCCEEECCHHHHHHHHHCCCCCEEEECC Q ss_conf 68656878999999987654486997767631042379988642388368504300002148885210038652234254 Q gi|254780670|r 257 ENLELQPNEMASYLANLVDQYPIYSIEDGMSEDDWHGWKILTNKIGSNCQLVGDDLFVTNPERLHKGINEEVANAILIKP 336 (424) Q Consensus 257 ~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~D~~gw~~l~~~~g~~~~IvGDDL~vTn~~rl~~gi~~~~~NailiK~ 336 (424) ...+.++-+.+.. .+++|.+..+|||+..+|++++.+|..++. +-|.++.-..+.....+.-++.+++.-+-+++ T Consensus 198 --~~~~~~~Ai~~~~-~le~~~~~w~EeP~~~~d~~~~~~l~~~~~--ipIa~gE~~~~~~~~~~~~i~~~avdivq~d~ 272 (368) T cd03329 198 --HWYSRADALRLGR-ALEELGFFWYEDPLREASISSYRWLAEKLD--IPILGTEHSRGALESRADWVLAGATDFLRADV 272 (368) T ss_pred --CCCCHHHHHHHHH-HHCCCCEEEEECCCCCCCHHHHHHHHHHCC--CCEECCCCCCCHHHHHHHHHHCCCCCEECCCC T ss_conf --4554788999876-433465157517998447999999996099--99996898776289999998759741672365 Q ss_pred CCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHHCC Q ss_conf 5300389999999999987986999507888723589989998199 Q gi|254780670|r 337 NQIGSVSETLNTIEKAQMAGYPSIISHRSGETEDHTIADLAVATNC 382 (424) Q Consensus 337 NQiGTvset~ea~~~a~~~g~~~ivShRSGETeD~~iaDLAVg~~a 382 (424) ..+|-+|+++....+|+.+|..+. .|-.+ ..-.||+.++.. T Consensus 273 ~~~GGit~~~kia~lA~~~gi~v~-~H~~~----~a~lhl~aa~~n 313 (368) T cd03329 273 NLVGGITGAMKTAHLAEAFGLDVE-LHGNG----AANLHVIAAIRN 313 (368) T ss_pred CCCCCHHHHHHHHHHHHHCCCEEE-ECCHH----HHHHHHHHHCCC T ss_conf 414739999999999998699070-44718----999999996646 No 15 >cd03327 MR_like_2 Mandelate racemase (MR)-like subfamily of the enolase superfamily, subgroup 2. Enzymes of this subgroup share three conserved carboxylate ligands for the essential divalent metal ion (usually Mg2+), two aspartates and a glutamate, and conserved catalytic residues, a Lys-X-Lys motif and a conserved histidine-aspartate dyad. This subgroup's function is unknown. Probab=99.62 E-value=3.6e-12 Score=110.46 Aligned_cols=288 Identities=20% Similarity=0.280 Sum_probs=189.6 Q ss_pred EEEEEEEEEEEECCCCCEEEEEEEECCCCEEEEECCCCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCC Q ss_conf 10589986886079784299999984897676634478664775467626898651680489999999875678975988 Q gi|254780670|r 2 FINDIIAREVIDSRGSPTIEVDVCLEDGSTGRAMVPSGASTGIHEAFELRDQEKRYFGKGVLKAIAFVNDEIRTALLGCD 81 (424) Q Consensus 2 ~I~~i~~r~IlDSRG~PTvEvev~~~~g~~gra~~PSGaStG~~Ea~elrD~~~~~~G~gV~~Av~~in~~i~~~L~g~~ 81 (424) +|+.++-|. ..|=|+|.|++|.+|.+-+..+ ..++..+++.++|.|+|+| T Consensus 1 ~~~~~~t~~-------~~vlVrI~Td~Gi~G~GE~~~~-----------------------~~~~~~i~~~l~p~liG~d 50 (341) T cd03327 1 KIKSVRTRV-------GWLFVEIETDDGTVGYANTTGG-----------------------PVACWIVDQHLARFLIGKD 50 (341) T ss_pred CCEEEEECC-------CEEEEEEEECCCCEEEEECCCC-----------------------HHHHHHHHHHHHHHHCCCC T ss_conf 914998538-------8899999989999899941694-----------------------8999999999999966998 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCHHHHCCCCCCCCCCCCCEEECCCCCCCCCCCCCE Q ss_conf 55789888646530366432212322220244899875266349848882253245647543000003664556776610 Q gi|254780670|r 82 ARDQLLIDKIMIDLDGTPNKSRLGANAILGVSLAVSKAAAQTSNLPLYKYLGGCSAHILPVPLMNILNGGIHADNALDFQ 161 (424) Q Consensus 82 ~~dQ~~iD~~Li~lDgT~nks~lGaNailAvSlA~akA~A~~~~~pLy~yl~~~~~~~lP~P~~NiinGG~HA~~~l~iQ 161 (424) |.+...+=+.|....-.-....++..++-|+-+|++-+.++..++|||+.|+|....++|+=. +++ T Consensus 51 p~~~e~l~~~~~~~~~~~g~~g~~~~A~saiDiAlwDl~gK~~g~Pv~~LLGG~~r~~i~~Ya----s~~---------- 116 (341) T cd03327 51 PSDIEKLWDQMYRATLAYGRKGIAMAAISAVDLALWDLLGKIRGEPVYKLLGGRTRDKIPAYA----SGL---------- 116 (341) T ss_pred HHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCHHHHHCCCCCCCEEEEE----ECC---------- T ss_conf 213999999999843330676278899999999999999988599799993798767602678----527---------- Q ss_pred EEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCC---CCCCCCCCCCC--CCCHHHHHHHHHHHHHHCCCCCCCCHH Q ss_conf 2565148877489998999999999998752000013---67655764488--684689999999999753853111023 Q gi|254780670|r 162 EFMIMPVGAENIREAIRMGAEVFHTLKKELKSKGYST---NVGDEGGFSPN--LKTADSALDLIANSIEKAGYNAGKDLL 236 (424) Q Consensus 162 EfmIiP~g~~sf~eal~~~~ev~~~lk~~L~~~g~~t---~vgDEGGfap~--~~~~eeaL~ll~eAi~~aGy~~g~di~ 236 (424) ...+..+..+. .++ +.++|+.. .+| +.|. -.+.++.++.+. ++.++ .|.++. T Consensus 117 -------~~~~~~~~~~~-------a~~-~~~~Gf~~~Kik~g----~~p~~~~~~~~~d~~~v~-avr~~---~G~~~~ 173 (341) T cd03327 117 -------YPTDLDELPDE-------AKE-YLKEGYRGMKMRFG----YGPSDGHAGLRKNVELVR-AIREA---VGYDVD 173 (341) T ss_pred -------CCCCHHHHHHH-------HHH-HHHCCCCEEEECCC----CCCCCCHHHHHHHHHHHH-HHHHH---HCCCCE T ss_conf -------89999999999-------999-99839987997157----788750445999999999-99998---399966 Q ss_pred HHHCHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHCCCCEEEECCCEEECC Q ss_conf 43110465320278511114686568789999999876544869977676310423799886423883685043000021 Q gi|254780670|r 237 IALDCAASAFFKKEKYILKGENLELQPNEMASYLANLVDQYPIYSIEDGMSEDDWHGWKILTNKIGSNCQLVGDDLFVTN 316 (424) Q Consensus 237 i~lD~AAsefy~~~kY~~~~~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~D~~gw~~l~~~~g~~~~IvGDDL~vTn 316 (424) |.+|+-. ..+.++-+.+... +++|.+..+|+|+..+|+++.+++.++.+ +-|.+|.- .++ T Consensus 174 l~vD~n~----------------~~~~~~A~~~~~~-l~~~~~~w~EeP~~~~d~~~~~~l~~~~~--ipIa~gE~-~~~ 233 (341) T cd03327 174 LMLDCYM----------------SWNLNYAIKMARA-LEKYELRWIEEPLIPDDIEGYAELKKATG--IPISTGEH-EYT 233 (341) T ss_pred EECCCCC----------------CCCHHHHHHHHHH-HHHCCCEEEECCCCCCCHHHHHHHHHCCC--CCEEECCC-CCC T ss_conf 9703555----------------8999999999776-41046013646899678999999873099--98980988-889 Q ss_pred HHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHHC Q ss_conf 48885210038652234254530038999999999998798699950788872358998999819 Q gi|254780670|r 317 PERLHKGINEEVANAILIKPNQIGSVSETLNTIEKAQMAGYPSIISHRSGETEDHTIADLAVATN 381 (424) Q Consensus 317 ~~rl~~gi~~~~~NailiK~NQiGTvset~ea~~~a~~~g~~~ivShRSGETeD~~iaDLAVg~~ 381 (424) ....++-++.++++-+-+++..+|=+|+++....+|..+|..++ .|-++ ..-.|++.+.. T Consensus 234 ~~~~~~~i~~~a~di~~~d~~~~GGit~~~~ia~~A~~~gi~v~-pH~~~----~~~~h~~aa~~ 293 (341) T cd03327 234 VYGFKRLLEGRAVDILQPDVNWVGGITELKKIAALAEAYGVPVV-PHASQ----IYNYHFIMSEP 293 (341) T ss_pred HHHHHHHHHHHCCCEEEECCCEECCHHHHHHHHHHHHHCCCCEE-ECCHH----HHHHHHHHHHH T ss_conf 99999999861889798243531869999999999998599185-11559----99999999714 No 16 >cd03328 MR_like_3 Mandelate racemase (MR)-like subfamily of the enolase superfamily, subgroup 3. Enzymes of this subgroup share three conserved carboxylate ligands for the essential divalent metal ion (usually Mg2+), two aspartates and a glutamate, and conserved catalytic residues, a Lys-X-Lys motif and a conserved histidine-aspartate dyad. This subgroup's function is unknown. Probab=99.61 E-value=2.3e-12 Score=111.95 Aligned_cols=277 Identities=17% Similarity=0.214 Sum_probs=187.3 Q ss_pred CCCCEEEEEEEECCCCEEEEECCCCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH Q ss_conf 97842999999848976766344786647754676268986516804899999998756789759885578988864653 Q gi|254780670|r 15 RGSPTIEVDVCLEDGSTGRAMVPSGASTGIHEAFELRDQEKRYFGKGVLKAIAFVNDEIRTALLGCDARDQLLIDKIMID 94 (424) Q Consensus 15 RG~PTvEvev~~~~g~~gra~~PSGaStG~~Ea~elrD~~~~~~G~gV~~Av~~in~~i~~~L~g~~~~dQ~~iD~~Li~ 94 (424) ...+.|-|+|.+ +|.+|.+-+..+ .++ ...|++.++|.|+|.|+.+...+=+.|.. T Consensus 26 ~~~~~v~V~v~t-dG~~G~G~~~~~--------------------~~~---~~~i~~~lap~liG~d~~~~e~l~~~m~~ 81 (352) T cd03328 26 DATTLVLVEVRA-GGRTGLGYTYAD--------------------AAA---AALVDGLLAPVVEGRDALDPPAAWEAMQR 81 (352) T ss_pred EEEEEEEEEEEE-CCCEEEEEECCH--------------------HHH---HHHHHHHHHHHHCCCCCCCHHHHHHHHHH T ss_conf 303899999994-897899997781--------------------679---99999988899679998899999999999 Q ss_pred CCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCHHHHCCCCCCCCCCCCCEEECCCCCCCCCCCCCEEEEEECCCCCCHH Q ss_conf 03664322123222202448998752663498488822532456475430000036645567766102565148877489 Q gi|254780670|r 95 LDGTPNKSRLGANAILGVSLAVSKAAAQTSNLPLYKYLGGCSAHILPVPLMNILNGGIHADNALDFQEFMIMPVGAENIR 174 (424) Q Consensus 95 lDgT~nks~lGaNailAvSlA~akA~A~~~~~pLy~yl~~~~~~~lP~P~~NiinGG~HA~~~l~iQEfmIiP~g~~sf~ 174 (424) ..-......+...|+-|+-||++-+.++..++|||+.|++. ...+| ... +||-. ..+.. T Consensus 82 ~~~~~g~~g~~~~A~saIDiALWDl~gK~~g~Pv~~LLG~~-rd~i~--~y~--s~~~~----------------~~~~~ 140 (352) T cd03328 82 AVRNAGRPGVAAMAISAVDIALWDLKARLLGLPLARLLGRA-HDSVP--VYG--SGGFT----------------SYDDD 140 (352) T ss_pred HHHHCCCCHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHCCC-CCCEE--EEE--ECCCC----------------CCCHH T ss_conf 87752885088999999999999999888498199984799-87447--875--26867----------------89989 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCC---CCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHCHHHHHHCCCCC Q ss_conf 998999999999998752000013---67655764488684689999999999753853111023431104653202785 Q gi|254780670|r 175 EAIRMGAEVFHTLKKELKSKGYST---NVGDEGGFSPNLKTADSALDLIANSIEKAGYNAGKDLLIALDCAASAFFKKEK 251 (424) Q Consensus 175 eal~~~~ev~~~lk~~L~~~g~~t---~vgDEGGfap~~~~~eeaL~ll~eAi~~aGy~~g~di~i~lD~AAsefy~~~k 251 (424) +..+ .+...+ ++|+.. .+|. +.++-++.+ +++.++ .|.++.|.+|+ T Consensus 141 ~~~~-------~~~~~~-~~G~~~~K~k~g~---------~~~~d~~~v-~avR~~---~G~~~~l~vDa---------- 189 (352) T cd03328 141 RLRE-------QLSGWV-AQGIPRVKMKIGR---------DPRRDPDRV-AAARRA---IGPDAELFVDA---------- 189 (352) T ss_pred HHHH-------HHHHHH-HCCCCEEECCCCC---------CHHHHHHHH-HHHHHH---HCCCCEEEECC---------- T ss_conf 9999-------999999-7599866034799---------978889999-999997---48895398678---------- Q ss_pred EEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHCCCCEEEECCCEEECCHHHHHHHHHCCCCCE Q ss_conf 11114686568789999999876544869977676310423799886423883685043000021488852100386522 Q gi|254780670|r 252 YILKGENLELQPNEMASYLANLVDQYPIYSIEDGMSEDDWHGWKILTNKIGSNCQLVGDDLFVTNPERLHKGINEEVANA 331 (424) Q Consensus 252 Y~~~~~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~D~~gw~~l~~~~g~~~~IvGDDL~vTn~~rl~~gi~~~~~Na 331 (424) +.-.+.++-+.+... +++|.+..+|||+..+|+++.+++.++.+.++.|.++.- ...+.-.++.|+.++++- T Consensus 190 ------n~~~~~~~Ai~~~~~-l~~~~l~w~EeP~~~~d~~~~~~lr~~~~~~~pia~gE~-~~~~~~~~~~i~~~a~Di 261 (352) T cd03328 190 ------NGAYSRKQALALARA-FADEGVTWFEEPVSSDDLAGLRLVRERGPAGMDIAAGEY-AYTLAYFRRLLEAHAVDV 261 (352) T ss_pred ------CCCCCHHHHHHHHHH-HHHHCCCEEECCCCCCCHHHHHHHHHHCCCCCCEECCCC-CCCHHHHHHHHHCCCCCE T ss_conf ------768899999999998-423063738789996689999999983799976414866-589899999998178785 Q ss_pred EEECCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHH Q ss_conf 3425453003899999999999879869995078887235899899981 Q gi|254780670|r 332 ILIKPNQIGSVSETLNTIEKAQMAGYPSIISHRSGETEDHTIADLAVAT 380 (424) Q Consensus 332 iliK~NQiGTvset~ea~~~a~~~g~~~ivShRSGETeD~~iaDLAVg~ 380 (424) +-+++..+|=+|++++.+.+|..+|..+. .|.+. ..-+||+.++ T Consensus 262 ~~~d~~~~GGit~~~kia~lA~a~gv~~~-~H~~~----~~~~hl~aa~ 305 (352) T cd03328 262 LQADVTRCGGVTGFLQAAALAAAHHVDLS-AHCAP----ALHAHVACAV 305 (352) T ss_pred EEECCCEECCHHHHHHHHHHHHHCCCEEC-CCCHH----HHHHHHHHHC T ss_conf 84041100869999999999998499045-35489----9999999868 No 17 >cd03321 mandelate_racemase Mandelate racemase (MR) catalyzes the Mg2+-dependent 1,1-proton transfer reaction that interconverts the enantiomers of mandelic acid. MR is the first enzyme in the bacterial pathway that converts mandelic acid to benzoic acid and allows this pathway to utilize either enantiomer of mandelate. MR belongs to the enolase superfamily of enzymes, characterized by the presence of an enolate anion intermediate which is generated by abstraction of the alpha-proton of the carboxylate substrate by an active site residue and is stabilized by coordination to the essential Mg2+ ion. Probab=99.61 E-value=1.5e-12 Score=113.21 Aligned_cols=283 Identities=24% Similarity=0.338 Sum_probs=186.9 Q ss_pred CEEEEEEEEEEE--------ECCC----CCEEEEEEEECCCCEEEEECCCCCCCCCCEEEEECCCCCCCCCCHHHHHHHH Q ss_conf 910589986886--------0797----8429999998489767663447866477546762689865168048999999 Q gi|254780670|r 1 MFINDIIAREVI--------DSRG----SPTIEVDVCLEDGSTGRAMVPSGASTGIHEAFELRDQEKRYFGKGVLKAIAF 68 (424) Q Consensus 1 m~I~~i~~r~Il--------DSRG----~PTvEvev~~~~g~~gra~~PSGaStG~~Ea~elrD~~~~~~G~gV~~Av~~ 68 (424) |+|++|..+.+- .|+| .++|=|+|+|++|..|.+-+.+ |.+. ...++.. T Consensus 1 MkI~~Ie~~~~~iPl~~P~~~s~g~~~~~~~viV~v~td~G~~G~ge~~~------------------~~~~-~~~~~~~ 61 (355) T cd03321 1 VLITGLRARAVNVPMQYPVHTSVGTVATAPLVLIDLATDEGVTGHSYLFT------------------YTPA-ALKSLKQ 61 (355) T ss_pred CEEEEEEEEEEECCCCCCEECCCEEEEEEEEEEEEEEECCCCEEEEEECC------------------CCCC-HHHHHHH T ss_conf 98969999999836888868978799999999999998999989994308------------------7732-4999999 Q ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCHHHHCCCCCCCCCCCCCEEEC Q ss_conf 98756789759885578988864653036643221232222024489987526634984888225324564754300000 Q gi|254780670|r 69 VNDEIRTALLGCDARDQLLIDKIMIDLDGTPNKSRLGANAILGVSLAVSKAAAQTSNLPLYKYLGGCSAHILPVPLMNIL 148 (424) Q Consensus 69 in~~i~~~L~g~~~~dQ~~iD~~Li~lDgT~nks~lGaNailAvSlA~akA~A~~~~~pLy~yl~~~~~~~lP~P~~Nii 148 (424) +-+.++|.|+|.+.... .+.+.|...-.......+...++-|+.+|++-+.++..++|||+.|+|... |+|....+ T Consensus 62 ~~~~l~~~lig~~~~~~-~~~~~~~~~~~~~~~~g~~~~A~aaid~AlwDl~gK~~~~Pv~~LLGg~~~---~v~~~~s~ 137 (355) T cd03321 62 LLDDMAALLVGEPLAPA-ELERALAKRFRLLGYTGLVRMAAAGIDMAAWDALAKVHGLPLAKLLGGNPR---PVQAYDSH 137 (355) T ss_pred HHHHHHHHHCCCCCCHH-HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCCCHHHHHCCCCC---CCEEEECC T ss_conf 99999999779989849-999999997665033327889999999983569998849939999089987---50256236 Q ss_pred CCCCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCCC---CCCCCCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 3664556776610256514887748999899999999999875200001---3676557644886846899999999997 Q gi|254780670|r 149 NGGIHADNALDFQEFMIMPVGAENIREAIRMGAEVFHTLKKELKSKGYS---TNVGDEGGFSPNLKTADSALDLIANSIE 225 (424) Q Consensus 149 nGG~HA~~~l~iQEfmIiP~g~~sf~eal~~~~ev~~~lk~~L~~~g~~---t~vgDEGGfap~~~~~eeaL~ll~eAi~ 225 (424) + ..+-.+.. .++ ++.+ ++|+. ..+| .|+ .++-++.+ +++. T Consensus 138 ~--------------------~~~~~~~~---~~a----~~~~-~~Gf~~~KiKvG-----~~~---~~~D~~~v-~~ir 180 (355) T cd03321 138 G--------------------LDGAKLAT---ERA----VTAA-EEGFHAVKTKIG-----YPT---ADEDLAVV-RSIR 180 (355) T ss_pred C--------------------CCCHHHHH---HHH----HHHH-HCCCCEEEEECC-----CCC---HHHHHHHH-HHHH T ss_conf 7--------------------89999999---999----9999-759998999799-----999---89999999-9999 Q ss_pred HCCCCCCCCHHHHHCHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHCCCCE Q ss_conf 53853111023431104653202785111146865687899999998765448699776763104237998864238836 Q gi|254780670|r 226 KAGYNAGKDLLIALDCAASAFFKKEKYILKGENLELQPNEMASYLANLVDQYPIYSIEDGMSEDDWHGWKILTNKIGSNC 305 (424) Q Consensus 226 ~aGy~~g~di~i~lD~AAsefy~~~kY~~~~~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~D~~gw~~l~~~~g~~~ 305 (424) ++ .|.++.|.+|+ +.-.|.++-+.+.. .+++|.+.++|+|+..+|+++...|.++.+ + T Consensus 181 ~~---~g~~~~l~vDa----------------N~~~~~~~A~~~~~-~l~~~~~~~~EqP~~~~d~~~~~~l~~~~~--~ 238 (355) T cd03321 181 QA---VGDGVGLMVDY----------------NQSLTVPEAIERGQ-ALDQEGLTWIEEPTLQHDYEGHARIASALR--T 238 (355) T ss_pred HH---HCCCCEEEECC----------------CCCCCHHHHHHHHH-HHHHCCCCEEECCCCCCCHHHHHHHHHCCC--C T ss_conf 97---19996597357----------------56768899999999-752305775537899889999999985489--9 Q ss_pred EEECCCEEECCHHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCC Q ss_conf 85043000021488852100386522342545300389999999999987986999507888 Q gi|254780670|r 306 QLVGDDLFVTNPERLHKGINEEVANAILIKPNQIGSVSETLNTIEKAQMAGYPSIISHRSGE 367 (424) Q Consensus 306 ~IvGDDL~vTn~~rl~~gi~~~~~NailiK~NQiGTvset~ea~~~a~~~g~~~ivShRSGE 367 (424) -|.+|.- +..+.-....++.++++.+-+|+...|=+|++++.+.+|+.+|..+. +|...+ T Consensus 239 pI~~~E~-~~~~~~~~~~i~~~a~d~~~~d~~~~GGit~~~kia~~A~~~gi~v~-~h~~~~ 298 (355) T cd03321 239 PVQMGEN-WLGPEEMFKALSAGACDLVMPDLMKIGGVTGWLRASALAEQAGIPMS-SHLFQE 298 (355) T ss_pred CEEECCC-CCCHHHHHHHHHCCCCCCCCCCCCEECCHHHHHHHHHHHHHCCCEEE-CCCHHH T ss_conf 9993788-89999999999849996103674520779999999999998599671-287899 No 18 >cd03319 L-Ala-DL-Glu_epimerase L-Ala-D/L-Glu epimerase catalyzes the epimerization of L-Ala-D/L-Glu and other dipeptides. The genomic context and the substrate specificity of characterized members of this family from E.coli and B.subtilis indicates a possible role in the metabolism of the murein peptide of peptidoglycan, of which L-Ala-D-Glu is a component. L-Ala-D/L-Glu epimerase is a member of the enolase-superfamily, which is characterized by the presence of an enolate anion intermediate which is generated by abstraction of the alpha-proton of the carboxylate substrate by an active site residue and is stabilized by coordination to the essential Mg2+ ion. Probab=99.61 E-value=3.4e-12 Score=110.65 Aligned_cols=286 Identities=23% Similarity=0.319 Sum_probs=188.5 Q ss_pred EEEEEECCCCCEEEEEEEECCCCEEEEEC-CCCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHH Q ss_conf 86886079784299999984897676634-47866477546762689865168048999999987567897598855789 Q gi|254780670|r 8 AREVIDSRGSPTIEVDVCLEDGSTGRAMV-PSGASTGIHEAFELRDQEKRYFGKGVLKAIAFVNDEIRTALLGCDARDQL 86 (424) Q Consensus 8 ~r~IlDSRG~PTvEvev~~~~g~~gra~~-PSGaStG~~Ea~elrD~~~~~~G~gV~~Av~~in~~i~~~L~g~~~~dQ~ 86 (424) ++..+++| .++-|+|.+ +|.+|.+-+ |.. .|.|.....+...+++ +.|.|+|.++.... T Consensus 18 s~g~~~~r--~~~lV~v~~-dG~~G~GE~~p~~----------------~~~get~~~~~~~l~~-~~~~l~g~~~~~~~ 77 (316) T cd03319 18 ARGSRTEA--ENVIVEIEL-DGITGYGEAAPTP----------------RVTGETVESVLAALKS-VRPALIGGDPRLEK 77 (316) T ss_pred CCEEEEEE--EEEEEEEEE-CCEEEEEECCCCC----------------CCCCCCHHHHHHHHHH-HHHHHCCCCCCHHH T ss_conf 69999997--899999999-9989999725788----------------8662589999999999-78986589977499 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCHHHHCCCCCCCCCCCCCEEECCCCCCCCCCCCCEEEEEE Q ss_conf 88864653036643221232222024489987526634984888225324564754300000366455677661025651 Q gi|254780670|r 87 LIDKIMIDLDGTPNKSRLGANAILGVSLAVSKAAAQTSNLPLYKYLGGCSAHILPVPLMNILNGGIHADNALDFQEFMIM 166 (424) Q Consensus 87 ~iD~~Li~lDgT~nks~lGaNailAvSlA~akA~A~~~~~pLy~yl~~~~~~~lP~P~~NiinGG~HA~~~l~iQEfmIi 166 (424) ........+.+ +.-+.-|+.||++-+.++..++|||+.+++.....+|. ...+ T Consensus 78 ~~~~~~~~~~~-------~~~a~said~AlwDl~gk~~~~pl~~llGg~~~~~i~~--~~~~------------------ 130 (316) T cd03319 78 LLEALQELLPG-------NGAARAAVDIALWDLEAKLLGLPLYQLWGGGAPRPLET--DYTI------------------ 130 (316) T ss_pred HHHHHHHHCCC-------CHHHHHHHHHHHHHHHHHHCCCCHHHHCCCCCCCCCEE--EEEC------------------ T ss_conf 99999863568-------77999999999999976064997789637777877404--6506------------------ Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHCCCC---CCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHCHHH Q ss_conf 4887748999899999999999875200001---3676557644886846899999999997538531110234311046 Q gi|254780670|r 167 PVGAENIREAIRMGAEVFHTLKKELKSKGYS---TNVGDEGGFSPNLKTADSALDLIANSIEKAGYNAGKDLLIALDCAA 243 (424) Q Consensus 167 P~g~~sf~eal~~~~ev~~~lk~~L~~~g~~---t~vgDEGGfap~~~~~eeaL~ll~eAi~~aGy~~g~di~i~lD~AA 243 (424) +..+-.+.. ..+.+.+ ++|+. ..+|. +.++-++.+ +++.++ + | |+.|.+|+ T Consensus 131 --~~~~~e~~~-------~~~~~~~-~~G~~~~KiKvg~---------~~~~d~~~v-~~vr~~-~--~-~~~l~vDa-- 184 (316) T cd03319 131 --SIDTPEAMA-------AAAKKAA-KRGFPLLKIKLGG---------DLEDDIERI-RAIREA-A--P-DARLRVDA-- 184 (316) T ss_pred --CCCCHHHHH-------HHHHHHH-HCCCCEEEEECCC---------CHHHHHHHH-HHHHHH-C--C-CCEEEEEC-- T ss_conf --899999999-------9999999-7598768632489---------979999999-999966-8--9-96299846-- Q ss_pred HHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHCCCCEEEECCCEEECCHHHHHHH Q ss_conf 53202785111146865687899999998765448699776763104237998864238836850430000214888521 Q gi|254780670|r 244 SAFFKKEKYILKGENLELQPNEMASYLANLVDQYPIYSIEDGMSEDDWHGWKILTNKIGSNCQLVGDDLFVTNPERLHKG 323 (424) Q Consensus 244 sefy~~~kY~~~~~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~D~~gw~~l~~~~g~~~~IvGDDL~vTn~~rl~~g 323 (424) + .-.+.++-+.+... +++|+|.+||+|+..+||+++.+|.++.+ +-|..|.-.. .+.-+++. T Consensus 185 -----N---------~~~~~~~A~~~~~~-l~~~~i~~iEqP~~~~d~~~~~~l~~~~~--~pIa~dEs~~-~~~d~~~~ 246 (316) T cd03319 185 -----N---------QGWTPEEAVELLRE-LAELGVELIEQPVPAGDDDGLAYLRDKSP--LPIMADESCF-SAADAARL 246 (316) T ss_pred -----C---------CCCCHHHHHHHHHH-HHHHHHHHHCCCCCCCCHHHHHHHHHHCC--CCEEECCCCC-CHHHHHHH T ss_conf -----8---------88899999999997-52443444308989999999999999689--9999358879-99999999 Q ss_pred HHCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHH--HHHHHHHHCCCHHE Q ss_conf 0038652234254530038999999999998798699950788872358--99899981998010 Q gi|254780670|r 324 INEEVANAILIKPNQIGSVSETLNTIEKAQMAGYPSIISHRSGETEDHT--IADLAVATNCGQIK 386 (424) Q Consensus 324 i~~~~~NailiK~NQiGTvset~ea~~~a~~~g~~~ivShRSGETeD~~--iaDLAVg~~a~~iK 386 (424) ++.+++..+-+|+...|-+|++.+.+.+|+.+|+.+++++-. ||.=.. .+|||-++ +.|.- T Consensus 247 ~~~~a~d~v~~k~~~~GGit~~~~ia~~A~~~gi~~~~g~~~-es~i~~aA~~hlaaa~-~~f~d 309 (316) T cd03319 247 AGGGAYDGINIKLMKTGGLTEALRIADLARAAGLKVMVGCMV-ESSLSIAAAAHLAAAK-ADFVD 309 (316) T ss_pred HHCCCCCEEEECCHHHCCHHHHHHHHHHHHHCCCEEEECCCC-CCHHHHHHHHHHHHCC-CCEEE T ss_conf 976998869845144089899999999999869949981885-4699999999998308-99896 No 19 >cd03317 NAAAR N-acylamino acid racemase (NAAAR), an octameric enzyme that catalyzes the racemization of N-acylamino acids. NAAARs act on a broad range of N-acylamino acids rather than amino acids. Enantiopure amino acids are of industrial interest as chiral building blocks for antibiotics, herbicides, and drugs. NAAAR is a member of the enolase superfamily, characterized by the presence of an enolate anion intermediate which is generated by abstraction of the alpha-proton of the carboxylate substrate by an active site residue and is stabilized by coordination to the essential Mg2+ ion. Probab=99.59 E-value=7.7e-13 Score=115.45 Aligned_cols=287 Identities=20% Similarity=0.293 Sum_probs=194.5 Q ss_pred EEEEEECCCCCEEEEEEEECCCCEEEEECCCCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHH Q ss_conf 86886079784299999984897676634478664775467626898651680489999999875678975988557898 Q gi|254780670|r 8 AREVIDSRGSPTIEVDVCLEDGSTGRAMVPSGASTGIHEAFELRDQEKRYFGKGVLKAIAFVNDEIRTALLGCDARDQLL 87 (424) Q Consensus 8 ~r~IlDSRG~PTvEvev~~~~g~~gra~~PSGaStG~~Ea~elrD~~~~~~G~gV~~Av~~in~~i~~~L~g~~~~dQ~~ 87 (424) +...+++| ++|=|+|++++|.+|.+-+++.. .+.|.+..+..++..+++.++|.|+|+++.+... T Consensus 17 s~~~~~~~--~~vlv~I~td~G~~G~GE~~~~~-------------~p~~~~e~~~~~~~~l~~~~~p~l~g~~~~~~~~ 81 (354) T cd03317 17 SFGTLNER--EFLIVELTDEEGITGYGEVVAFE-------------GPFYTEETNATAWHILKDYLLPLLLGREFSHPEE 81 (354) T ss_pred CCEEEEEE--EEEEEEEEECCCCEEEEECCCCC-------------CCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCHHH T ss_conf 79999999--99999999899999998325788-------------8875704499999999999899976999899999 Q ss_pred HHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCHHHHCCCCCCCCCCCCCEEECCCCCCCCCCCCCEEEEEEC Q ss_conf 88646530366432212322220244899875266349848882253245647543000003664556776610256514 Q gi|254780670|r 88 IDKIMIDLDGTPNKSRLGANAILGVSLAVSKAAAQTSNLPLYKYLGGCSAHILPVPLMNILNGGIHADNALDFQEFMIMP 167 (424) Q Consensus 88 iD~~Li~lDgT~nks~lGaNailAvSlA~akA~A~~~~~pLy~yl~~~~~~~lP~P~~NiinGG~HA~~~l~iQEfmIiP 167 (424) +-+.+..+-|.+ -++-|+-||++-..++..++|||+.|++.. ..+|+-.. .| T Consensus 82 ~~~~~~~~~g~~-------~A~said~ALwDl~gK~~~~Pv~~LLGg~~-~~i~~~~s----~~---------------- 133 (354) T cd03317 82 VSERLAPIKGNN-------MAKAGLEMAVWDLYAKAQGQSLAQYLGGTR-DSIPVGVS----IG---------------- 133 (354) T ss_pred HHHHHHHCCCCH-------HHHHHHHHHHHHHHHHHCCCCHHHHHCCCC-CEEEEEEE----CC---------------- T ss_conf 999987544857-------999999999999999883996799808987-60576687----16---------------- Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHCCCC---CCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHCHHHH Q ss_conf 887748999899999999999875200001---36765576448868468999999999975385311102343110465 Q gi|254780670|r 168 VGAENIREAIRMGAEVFHTLKKELKSKGYS---TNVGDEGGFSPNLKTADSALDLIANSIEKAGYNAGKDLLIALDCAAS 244 (424) Q Consensus 168 ~g~~sf~eal~~~~ev~~~lk~~L~~~g~~---t~vgDEGGfap~~~~~eeaL~ll~eAi~~aGy~~g~di~i~lD~AAs 244 (424) . ..+..+.++. +++. .++|+. ..++ |.. + .+.+ +++.++- | |+.|.+|+ T Consensus 134 ~-~~~~e~~~~~-------~~~~-~~~Gf~~~Kikv~------~~~-d-i~~v----~~ir~~~---~-d~~l~vDa--- 185 (354) T cd03317 134 I-QDDVEQLLKQ-------IERY-LEEGYKRIKLKIK------PGW-D-VEPL----KAVRERF---P-DIPLMADA--- 185 (354) T ss_pred C-CCCHHHHHHH-------HHHH-HHCCCCEEEEECC------CHH-H-HHHH----HHHHHHC---C-CCEEEEEC--- T ss_conf 8-9999999999-------9999-9779876764018------512-5-9999----9999865---7-97399626--- Q ss_pred HHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHCCCCEEEECCCEEECCHHHHHHHH Q ss_conf 32027851111468656878999999987654486997767631042379988642388368504300002148885210 Q gi|254780670|r 245 AFFKKEKYILKGENLELQPNEMASYLANLVDQYPIYSIEDGMSEDDWHGWKILTNKIGSNCQLVGDDLFVTNPERLHKGI 324 (424) Q Consensus 245 efy~~~kY~~~~~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~D~~gw~~l~~~~g~~~~IvGDDL~vTn~~rl~~gi 324 (424) + .-.+.++. ..+.. +++|.+.++|+|+..+|+++..++.++.+ +-|.+|- .+....-+++.+ T Consensus 186 ----N---------~~~~~~~a-~~~~~-l~~~~l~~~EeP~~~~d~~~~~~l~~~~~--~pIa~~E-~~~~~~~~~~~i 247 (354) T cd03317 186 ----N---------SAYTLADI-PLLKR-LDEYGLLMIEQPLAADDLIDHAELQKLLK--TPICLDE-SIQSAEDARKAI 247 (354) T ss_pred ----C---------CCCCHHHH-HHHHH-HCCCCEEEEECCCCCCCHHHHHHHHCCCC--CCEECCC-CCCCHHHHHHHH T ss_conf ----8---------88887899-99997-33067013444687227777655641589--9888188-868878899998 Q ss_pred HCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHHCCCH Q ss_conf 038652234254530038999999999998798699950788872358998999819980 Q gi|254780670|r 325 NEEVANAILIKPNQIGSVSETLNTIEKAQMAGYPSIISHRSGETEDHTIADLAVATNCGQ 384 (424) Q Consensus 325 ~~~~~NailiK~NQiGTvset~ea~~~a~~~g~~~ivShRSGETeD~~iaDLAVg~~a~~ 384 (424) +.+++..+-+|+..+|-+|++++.+.+|+.+|..+.+++-- ||.=.+.+.++++.-..+ T Consensus 248 ~~~a~div~~d~~~~GGit~~~~ia~~A~~~gi~~~~g~~~-~s~i~~aa~~~la~~~~~ 306 (354) T cd03317 248 ELGACKIINIKPGRVGGLTEALKIHDLCQEHGIPVWCGGML-ESGIGRAHNVALASLPNF 306 (354) T ss_pred HCCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCCEEECCCC-CCHHHHHHHHHHHHCCCC T ss_conf 76997646545653670999999999999839978876864-139999999999848786 No 20 >COG4948 L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily [Cell envelope biogenesis, outer membrane / General function prediction only] Probab=99.58 E-value=6.2e-12 Score=108.73 Aligned_cols=298 Identities=21% Similarity=0.250 Sum_probs=188.8 Q ss_pred CEEEEEEEEEEEECCCC------------CEEEEEEEECCCCEEEEECCCCCCCCCCEEEEECCCCCCCCCCHHHHHHHH Q ss_conf 91058998688607978------------429999998489767663447866477546762689865168048999999 Q gi|254780670|r 1 MFINDIIAREVIDSRGS------------PTIEVDVCLEDGSTGRAMVPSGASTGIHEAFELRDQEKRYFGKGVLKAIAF 68 (424) Q Consensus 1 m~I~~i~~r~IlDSRG~------------PTvEvev~~~~g~~gra~~PSGaStG~~Ea~elrD~~~~~~G~gV~~Av~~ 68 (424) |.|.+|+...+.--+-. ..|-|+|.+++|..|.+-++.+.....-+ ..+. T Consensus 1 ~~i~~i~~~~~~~~~~~~~~~~~~~~~~~~~v~v~i~~d~G~~G~GE~~~~~~~~~~~-----------------~~~~- 62 (372) T COG4948 1 MKIMKITVIPVAVPLSPPFVTSGGTVRFFTRVIVEITTDDGIVGWGEAVPGGRARYGE-----------------EAEA- 62 (372) T ss_pred CCCCEEEEEEEEECCCCCCCCCCCCCCCCEEEEEEEEECCCCCEEEECCCCCCCCCHH-----------------HHHH- T ss_conf 9752688998552156664557675543026999999889983574046765555125-----------------6667- Q ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCHHHHCCCCCCCCCCCCCEEEC Q ss_conf 98756789759885578988864653036643221232222024489987526634984888225324564754300000 Q gi|254780670|r 69 VNDEIRTALLGCDARDQLLIDKIMIDLDGTPNKSRLGANAILGVSLAVSKAAAQTSNLPLYKYLGGCSAHILPVPLMNIL 148 (424) Q Consensus 69 in~~i~~~L~g~~~~dQ~~iD~~Li~lDgT~nks~lGaNailAvSlA~akA~A~~~~~pLy~yl~~~~~~~lP~P~~Nii 148 (424) + .+++.++|.++.+...+.+.|....... ...+.-.++-||.+|++-..|+..++|||+.|++... -.++..+.. T Consensus 63 ~--~~~~~l~g~d~~~i~~~~~~~~~~~~~~-~~~~~~~a~sAvd~ALwDl~gK~~g~Pv~~LLGg~~r--~~v~~y~~~ 137 (372) T COG4948 63 V--LLAPLLIGRDPFDIERIWQKLYRAGFAR-RGGITMAAISAVDIALWDLAGKALGVPVYKLLGGKVR--DEVRAYASG 137 (372) T ss_pred H--HHHHHHCCCCCHHHHHHHHHHHHHCCCC-CCCHHHHHHHHHHHHHHHHHHCCCCCCHHHHHCCCCC--CEEEEEEEC T ss_conf 7--7778771348424689999998743356-6625689999999999998437346799998288756--368998851 Q ss_pred CCCCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCC Q ss_conf 36645567766102565148877489998999999999998752000013676557644886846899999999997538 Q gi|254780670|r 149 NGGIHADNALDFQEFMIMPVGAENIREAIRMGAEVFHTLKKELKSKGYSTNVGDEGGFSPNLKTADSALDLIANSIEKAG 228 (424) Q Consensus 149 nGG~HA~~~l~iQEfmIiP~g~~sf~eal~~~~ev~~~lk~~L~~~g~~t~vgDEGGfap~~~~~eeaL~ll~eAi~~aG 228 (424) .++ .+..+..... .+.+.+.|+..-.=+-| .++.+ +.++.+...=+. T Consensus 138 ~~~-------------------~~~~e~~~~~-------~~~~~~~G~~~~Klk~g--~~~~~---~d~~~v~avRea-- 184 (372) T COG4948 138 GGG-------------------EDPEEMAAEA-------ARALVELGFKALKLKVG--VGDGD---EDLERVRALREA-- 184 (372) T ss_pred CCC-------------------CCCHHHHHHH-------HHHHHHCCCCEEEECCC--CCCHH---HHHHHHHHHHHH-- T ss_conf 677-------------------8829999999-------99999779883876258--88668---899999999986-- Q ss_pred CCCCCCHHHHHCHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHCCCCEEEE Q ss_conf 53111023431104653202785111146865687899999998765448699776763104237998864238836850 Q gi|254780670|r 229 YNAGKDLLIALDCAASAFFKKEKYILKGENLELQPNEMASYLANLVDQYPIYSIEDGMSEDDWHGWKILTNKIGSNCQLV 308 (424) Q Consensus 229 y~~g~di~i~lD~AAsefy~~~kY~~~~~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~D~~gw~~l~~~~g~~~~Iv 308 (424) .|.++.|.+|+ ++ -.|.++=+.+...| ++|.|.+||||+..+|.++..++++... +-|. T Consensus 185 --~g~~~~l~iDa-------n~---------~~~~~~A~~~~~~l-~~~~l~~iEeP~~~~d~~~~~~l~~~~~--~PIa 243 (372) T COG4948 185 --VGDDVRLMVDA-------NG---------GWTLEEAIRLARAL-EEYGLEWIEEPLPPDDLEGLRELRAATS--TPIA 243 (372) T ss_pred --CCCCCEEEEEC-------CC---------CCCHHHHHHHHHHH-HHCCCCEEECCCCCCCHHHHHHHHCCCC--CCEE T ss_conf --59998899968-------98---------86899999999986-4349867988999888899999860689--9887 Q ss_pred CCCEEECCHHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCEEEE-ECCCCCCCHHHHH--HHHHHH Q ss_conf 43000021488852100386522342545300389999999999987986999-5078887235899--899981 Q gi|254780670|r 309 GDDLFVTNPERLHKGINEEVANAILIKPNQIGSVSETLNTIEKAQMAGYPSII-SHRSGETEDHTIA--DLAVAT 380 (424) Q Consensus 309 GDDL~vTn~~rl~~gi~~~~~NailiK~NQiGTvset~ea~~~a~~~g~~~iv-ShRSGETeD~~ia--DLAVg~ 380 (424) .|.- +.+..-.++-++.++++.+-+|+.-+|.+||++....+|+ ++.+.+ +|.. +.=...+ ||+... T Consensus 244 ~gEs-~~~~~~~~~l~~~~a~div~~d~~~~GGite~~kia~~A~--~~~~~v~~h~~--~~i~~aa~~hla~~~ 313 (372) T COG4948 244 AGES-VYTRWDFRRLLEAGAVDIVQPDLARVGGITEALKIAALAE--GFGVMVGPHVE--GPISLAAALHLAAAL 313 (372) T ss_pred ECCC-CCCHHHHHHHHHCCCCCEECCCCCCCCCHHHHHHHHHHHH--HCCCEEEECCC--CHHHHHHHHHHCCCC T ss_conf 2874-6799999999971898789037413378889999999998--67965995277--279999999853254 No 21 >cd00308 enolase_like Enolase-superfamily, characterized by the presence of an enolate anion intermediate which is generated by abstraction of the alpha-proton of the carboxylate substrate by an active site residue and is stabilized by coordination to the essential Mg2+ ion. Enolase superfamily contains different enzymes, like enolases, glutarate-, fucanate- and galactonate dehydratases, o-succinylbenzoate synthase, N-acylamino acid racemase, L-alanine-DL-glutamate epimerase, mandelate racemase, muconate lactonizing enzyme and 3-methylaspartase. Probab=99.56 E-value=3e-13 Score=118.49 Aligned_cols=200 Identities=22% Similarity=0.347 Sum_probs=156.2 Q ss_pred CCCCEEEEEEEECCCCEEEEECCCCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH Q ss_conf 97842999999848976766344786647754676268986516804899999998756789759885578988864653 Q gi|254780670|r 15 RGSPTIEVDVCLEDGSTGRAMVPSGASTGIHEAFELRDQEKRYFGKGVLKAIAFVNDEIRTALLGCDARDQLLIDKIMID 94 (424) Q Consensus 15 RG~PTvEvev~~~~g~~gra~~PSGaStG~~Ea~elrD~~~~~~G~gV~~Av~~in~~i~~~L~g~~~~dQ~~iD~~Li~ 94 (424) ..+|+|-|.|.|++|.+|.+- T Consensus 22 ~~~~~~~V~l~td~G~~G~Ge----------------------------------------------------------- 42 (229) T cd00308 22 DTNDTVLVKLTTDSGVVGWGE----------------------------------------------------------- 42 (229) T ss_pred EECEEEEEEEEECCCCEEEEH----------------------------------------------------------- T ss_conf 822179999998999967426----------------------------------------------------------- Q ss_pred CCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCHHHHCCCCCCCCCCCCCEEECCCCCCCCCCCCCEEEEEECCCCCCHH Q ss_conf 03664322123222202448998752663498488822532456475430000036645567766102565148877489 Q gi|254780670|r 95 LDGTPNKSRLGANAILGVSLAVSKAAAQTSNLPLYKYLGGCSAHILPVPLMNILNGGIHADNALDFQEFMIMPVGAENIR 174 (424) Q Consensus 95 lDgT~nks~lGaNailAvSlA~akA~A~~~~~pLy~yl~~~~~~~lP~P~~NiinGG~HA~~~l~iQEfmIiP~g~~sf~ 174 (424) ++-|+.+|++-+.++..++|||++|+|.....+|+= T Consensus 43 -------------a~said~AlwDl~gK~~~~Pv~~llgG~~~~~v~~y------------------------------- 78 (229) T cd00308 43 -------------VISGIDMALWDLAAKALGVPLAELLGGGSRDRVPAY------------------------------- 78 (229) T ss_pred -------------HHHHHHHHHHHHHHHHCCCCHHHHHCCCCCCCCCCC------------------------------- T ss_conf -------------999999999989798859969998088445777550------------------------------- Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHCHHHHHHCCCCCEEE Q ss_conf 99899999999999875200001367655764488684689999999999753853111023431104653202785111 Q gi|254780670|r 175 EAIRMGAEVFHTLKKELKSKGYSTNVGDEGGFSPNLKTADSALDLIANSIEKAGYNAGKDLLIALDCAASAFFKKEKYIL 254 (424) Q Consensus 175 eal~~~~ev~~~lk~~L~~~g~~t~vgDEGGfap~~~~~eeaL~ll~eAi~~aGy~~g~di~i~lD~AAsefy~~~kY~~ 254 (424) .. .+.++.+.++ .|.++.|.+|+- T Consensus 79 ----------~s---------------------------~~~v~~ir~~-------~G~~~~l~vDaN------------ 102 (229) T cd00308 79 ----------GS---------------------------IERVRAVREA-------FGPDARLAVDAN------------ 102 (229) T ss_pred ----------HH---------------------------HHHHHHHHHH-------CCCCCEEEEECC------------ T ss_conf ----------69---------------------------9999999996-------699986999899------------ Q ss_pred CCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHCCCCEEEECCCEEECCHHHHHHHHHCCCCCEEEE Q ss_conf 14686568789999999876544869977676310423799886423883685043000021488852100386522342 Q gi|254780670|r 255 KGENLELQPNEMASYLANLVDQYPIYSIEDGMSEDDWHGWKILTNKIGSNCQLVGDDLFVTNPERLHKGINEEVANAILI 334 (424) Q Consensus 255 ~~~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~D~~gw~~l~~~~g~~~~IvGDDL~vTn~~rl~~gi~~~~~Naili 334 (424) ..++.++-+.+...| ++|++.++|||+..+||+++++|.++.+ +.|.+|.- +.++...++.|+.++++.+-+ T Consensus 103 ----~~~~~~~A~~~~~~l-~~~~l~wiEeP~~~~d~~~~~~l~~~~~--ipIa~gE~-~~~~~~~~~~i~~~a~di~~~ 174 (229) T cd00308 103 ----GAWTPKEAIRLIRAL-EKYGLAWIEEPCAPDDLEGYAALRRRTG--IPIAADES-VTTVDDALEALELGAVDILQI 174 (229) T ss_pred ----CCCCHHHHHHHHHHH-HHCCCCEEECCCCCCCHHHHHHHHCCCC--CCEECCCC-CCCHHHHHHHHHHHHHHCCCC T ss_conf ----999999999999985-2139837646776627589999961369--98637876-789999999999742210445 Q ss_pred CCCCHHHHHHHHHHHHHHHHCCCEEEEECCCC-CCCHHHHHHHHHHHC Q ss_conf 54530038999999999998798699950788-872358998999819 Q gi|254780670|r 335 KPNQIGSVSETLNTIEKAQMAGYPSIISHRSG-ETEDHTIADLAVATN 381 (424) Q Consensus 335 K~NQiGTvset~ea~~~a~~~g~~~ivShRSG-ETeD~~iaDLAVg~~ 381 (424) |+..+|-+|++++.+.+|+.+|+.+++.+..| --.....+||+.++. T Consensus 175 d~~~~GGit~~~kia~~A~~~gi~~~~h~~~~s~i~~aa~~hl~aa~p 222 (229) T cd00308 175 KPTRVGGLTESRRAADLAEAFGIRVMVHGTLESSIGTAAALHLAAALP 222 (229) T ss_pred CCCCCCCHHHHHHHHHHHHHCCCEEEECCCCCCHHHHHHHHHHHHHCC T ss_conf 766316399999999999986997997899766999999999999689 No 22 >cd03324 rTSbeta_L-fuconate_dehydratase Human rTS beta is encoded by the rTS gene which, through alternative RNA splicing, also encodes rTS alpha whose mRNA is complementary to thymidylate synthase mRNA. rTS beta expression is associated with the production of small molecules that appear to mediate the down-regulation of thymidylate synthase protein by a novel intercellular signaling mechanism. A member of this family, from Xanthomonas, has been characterized to be a L-fuconate dehydratase. rTS beta belongs to the enolase superfamily of enzymes, characterized by the presence of an enolate anion intermediate which is generated by abstraction of the alpha-proton of the carboxylate substrate by an active site residue and is stabilized by coordination to the essential Mg2+ ion. Probab=99.46 E-value=6e-10 Score=94.07 Aligned_cols=317 Identities=17% Similarity=0.206 Sum_probs=184.3 Q ss_pred CEEEEEEEEEEEEC--------------CCCCEEEEEEEECC-CCEEEEECCCCCCCCCCEEEEECCCCCCCCCCHHHHH Q ss_conf 91058998688607--------------97842999999848-9767663447866477546762689865168048999 Q gi|254780670|r 1 MFINDIIAREVIDS--------------RGSPTIEVDVCLED-GSTGRAMVPSGASTGIHEAFELRDQEKRYFGKGVLKA 65 (424) Q Consensus 1 m~I~~i~~r~IlDS--------------RG~PTvEvev~~~~-g~~gra~~PSGaStG~~Ea~elrD~~~~~~G~gV~~A 65 (424) .+|++|..|.|-=- +....+=|.|.|++ |.+|-+-+-+. |+|. .+ T Consensus 1 ~kIt~v~~~~~r~P~~~~~~~s~~~h~~~~~~~~~V~i~TD~~Gi~G~G~~ft~-------------------~~g~-~~ 60 (415) T cd03324 1 IKITALEVRDVRFPTSLELDGSDAMNPDPDYSAAYVVLRTDAAGLKGHGLTFTI-------------------GRGN-EI 60 (415) T ss_pred CEEEEEEEEEEEECCCCCCCCCCCCCCCCCCCEEEEEEEECCCCCEEEEEEECC-------------------CCCC-HH T ss_conf 969899999998058788888787568999651799999769997489999678-------------------8875-89 Q ss_pred HHHHHHHHHHHHCCCCHHHHHHH-HHHHHHCCCCCCCCCCC------CCCCCHHHHHHHHHHHHHCCCCHHHHCCCCCCC Q ss_conf 99998756789759885578988-86465303664322123------222202448998752663498488822532456 Q gi|254780670|r 66 IAFVNDEIRTALLGCDARDQLLI-DKIMIDLDGTPNKSRLG------ANAILGVSLAVSKAAAQTSNLPLYKYLGGCSAH 138 (424) Q Consensus 66 v~~in~~i~~~L~g~~~~dQ~~i-D~~Li~lDgT~nks~lG------aNailAvSlA~akA~A~~~~~pLy~yl~~~~~~ 138 (424) +..+-+.++|.|+|+|+.+...+ .++-..|....+-.-+| -+|+-|+-+|++-..++..++|||+.|+|.... T Consensus 61 ~~~~i~~l~p~liG~d~~~i~~~~~~~w~~l~~~~~~~w~G~~~Gv~~~AiSaIDiALWDi~GK~~g~Pvy~LLGG~~r~ 140 (415) T cd03324 61 VCAAIEALAHLVVGRDLESIVADMGKFWRRLTSDSQLRWIGPEKGVIHLATAAVVNAVWDLWAKAEGKPLWKLLVDMTPE 140 (415) T ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCEEECCCCCCHHHHHHHHHHHHHHHHHHHHCCCCHHHHHCCCCHH T ss_conf 99999999998769988999998999999860376424306773187769999999998997746499399985377745 Q ss_pred CCCCCCEEECCCCCCCCCCCCCEEEEEECCCCCCHHHHHHHH---HHHHHHHHHHHHHCCCCCCCCCCCC---------- Q ss_conf 475430000036645567766102565148877489998999---9999999987520000136765576---------- Q gi|254780670|r 139 ILPVPLMNILNGGIHADNALDFQEFMIMPVGAENIREAIRMG---AEVFHTLKKELKSKGYSTNVGDEGG---------- 205 (424) Q Consensus 139 ~lP~P~~NiinGG~HA~~~l~iQEfmIiP~g~~sf~eal~~~---~ev~~~lk~~L~~~g~~t~vgDEGG---------- 205 (424) .+. .+ ++.. +.. ...+-.+++.+. ..-+..-...+...|+.. +....| T Consensus 141 ~i~-~~---~~~~-y~~-------------d~~~~~~~~~~~~~~~~~~~~r~~~~~~~g~~a-y~~~~gw~gy~~~~~~ 201 (415) T cd03324 141 ELV-SC---IDFR-YIT-------------DALTPEEALEILRRGQPGKAAREADLLAEGYPA-YTTSAGWLGYSDEKLR 201 (415) T ss_pred HHE-EE---ECCE-ECC-------------CCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCE-EECCCCCCCCCHHHHH T ss_conf 640-22---0442-015-------------788878999998631004778899887559876-8636765677678899 Q ss_pred ----------CCC-CC---CCHHHHHHHHHHHHHHCCCCCCCCHHHHHCHHHHHHCCCCCEEECCCCCCCCHHHHHHHHH Q ss_conf ----------448-86---8468999999999975385311102343110465320278511114686568789999999 Q gi|254780670|r 206 ----------FSP-NL---KTADSALDLIANSIEKAGYNAGKDLLIALDCAASAFFKKEKYILKGENLELQPNEMASYLA 271 (424) Q Consensus 206 ----------fap-~~---~~~eeaL~ll~eAi~~aGy~~g~di~i~lD~AAsefy~~~kY~~~~~~~~~t~~elid~y~ 271 (424) |.. .+ .+.++-++.+ +|+.++ .|.|+.|.+|+ +...+..+=+.+-. T Consensus 202 ~~~~~~~~~Gf~~~K~kvg~~~~~d~~rv-~avRea---vG~d~~lmvD~----------------n~~~~~~~Ai~~~~ 261 (415) T cd03324 202 RLCKEALAQGFTHFKLKVGADLEDDIRRC-RLAREV---IGPDNKLMIDA----------------NQRWDVPEAIEWVK 261 (415) T ss_pred HHHHHHHHCCCCCCCCCCCCCHHHHHHHH-HHHHHH---HCCCCEEEECC----------------CCCCCHHHHHHHHH T ss_conf 99998776234501230478888989999-999998---28887497456----------------77767999999998 Q ss_pred HHHHHCCEEEEECCCCCCCHHHHHHHHHHCC-CCEEEECCCEEECCHHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHHH Q ss_conf 8765448699776763104237998864238-836850430000214888521003865223425453003899999999 Q gi|254780670|r 272 NLVDQYPIYSIEDGMSEDDWHGWKILTNKIG-SNCQLVGDDLFVTNPERLHKGINEEVANAILIKPNQIGSVSETLNTIE 350 (424) Q Consensus 272 ~l~~~yPI~sIEDP~~e~D~~gw~~l~~~~g-~~~~IvGDDL~vTn~~rl~~gi~~~~~NailiK~NQiGTvset~ea~~ 350 (424) . +++|.+..+|||+..+|++++.++.+.+. ..+-|.+..-.. +..-.++-++.+++.-+-..+..+|-+||++..+. T Consensus 262 ~-le~~~~~w~EEP~~p~d~~~~~~l~~~~~~~~iPia~GE~~~-~~~~f~~li~~~a~di~qpD~~~~GGite~~kia~ 339 (415) T cd03324 262 Q-LAEFKPWWIEEPTSPDDILGHAAIRKALAPLPIGVATGEHCQ-NRVVFKQLLQAGAIDVVQIDSCRLGGVNENLAVLL 339 (415) T ss_pred H-HHHCCCCEEECCCCCCCHHHHHHHHHHHCCCCCCEEECCCCC-CHHHHHHHHHCCCCCEECCCCCCCCCHHHHHHHHH T ss_conf 7-522497789889995577899999987356998623166658-89999999981896824357200287879999999 Q ss_pred HHHHCCCEEEEECCCCCCCHHHHHHHHHH Q ss_conf 99987986999507888723589989998 Q gi|254780670|r 351 KAQMAGYPSIISHRSGETEDHTIADLAVA 379 (424) Q Consensus 351 ~a~~~g~~~ivShRSGETeD~~iaDLAVg 379 (424) +|..+|. .|..|-.|-.--..++||++. T Consensus 340 ~A~~~~v-~v~pH~g~~gl~~~~~hla~~ 367 (415) T cd03324 340 MAAKFGV-PVCPHAGGVGLCELVQHLSMI 367 (415) T ss_pred HHHHCCC-CEECCCCCHHHHHHHHHHHHH T ss_conf 9998299-162268833378999889877 No 23 >cd03326 MR_like_1 Mandelate racemase (MR)-like subfamily of the enolase superfamily, subgroup 1. Enzymes of this subgroup share three conserved carboxylate ligands for the essential divalent metal ion (usually Mg2+), two aspartates and a glutamate, and conserved catalytic residues, a Lys-X-Lys motif and a conserved histidine-aspartate dyad. This subgroup's function is unknown. Probab=99.37 E-value=1.1e-09 Score=92.16 Aligned_cols=287 Identities=18% Similarity=0.258 Sum_probs=169.6 Q ss_pred CCCCEEEEEEEECCCCEEEEECCCC-CCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHH-----H Q ss_conf 9784299999984897676634478-664775467626898651680489999999875678975988557898-----8 Q gi|254780670|r 15 RGSPTIEVDVCLEDGSTGRAMVPSG-ASTGIHEAFELRDQEKRYFGKGVLKAIAFVNDEIRTALLGCDARDQLL-----I 88 (424) Q Consensus 15 RG~PTvEvev~~~~g~~gra~~PSG-aStG~~Ea~elrD~~~~~~G~gV~~Av~~in~~i~~~L~g~~~~dQ~~-----i 88 (424) ...+++-|+|+|+.+..|+..+--| +|+|.+.. -..+.+.+.|.|+|.++..... + T Consensus 24 ~~~~~~lV~v~TD~~~~G~~~~G~g~~~~g~~~~------------------~~~i~~~~~p~lig~~p~~l~~~~~~~~ 85 (385) T cd03326 24 SGLTTSLVAVVTDVVRDGRPVVGYGFDSIGRYAQ------------------GGLLRERFIPRLLAAAPDSLLDDAGGNL 85 (385) T ss_pred EEEEEEEEEEEECCCCCCCEEEEEEECCCCCCCH------------------HHHHHHHHHHHHHCCCHHHHHHCCCCCC T ss_conf 4237899999974456886259972067864211------------------5788765576773379677763022225 Q ss_pred H-----HHHHHCCCCCCCCCCC--CCCCCHHHHHHHHHHHHHCCCCHHHHCCCCCC---CCCCCCCEEECCCCCCCCCCC Q ss_conf 8-----6465303664322123--22220244899875266349848882253245---647543000003664556776 Q gi|254780670|r 89 D-----KIMIDLDGTPNKSRLG--ANAILGVSLAVSKAAAQTSNLPLYKYLGGCSA---HILPVPLMNILNGGIHADNAL 158 (424) Q Consensus 89 D-----~~Li~lDgT~nks~lG--aNailAvSlA~akA~A~~~~~pLy~yl~~~~~---~~lP~P~~NiinGG~HA~~~l 158 (424) | +.|.. .....+-| +.|+-|+-+|++-+.++..++|||+.|++... ..-.+|.. -+||-+.. T Consensus 86 d~~~~~~~~~~---~~~~g~~G~~~~A~aaiDiALWDl~gK~~g~Pl~~LLGg~~~~g~~~~~v~vY--~sgg~~~~--- 157 (385) T cd03326 86 DPARAWAAMMR---NEKPGGHGERAVAVGALDMAVWDAVAKIAGLPLYRLLARRYGRGQADPRVPVY--AAGGYYYP--- 157 (385) T ss_pred CHHHHHHHHHH---HCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCHHHHHCCCCCCCCCCCEEEEE--EECCCCCC--- T ss_conf 89999999986---23578874689999999999998701533793899816434578777768899--87881788--- Q ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCC---CCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCH Q ss_conf 6102565148877489998999999999998752000013---6765576448868468999999999975385311102 Q gi|254780670|r 159 DFQEFMIMPVGAENIREAIRMGAEVFHTLKKELKSKGYST---NVGDEGGFSPNLKTADSALDLIANSIEKAGYNAGKDL 235 (424) Q Consensus 159 ~iQEfmIiP~g~~sf~eal~~~~ev~~~lk~~L~~~g~~t---~vgDEGGfap~~~~~eeaL~ll~eAi~~aGy~~g~di 235 (424) ..+..+. . ..+++.+ ++|+.. .+| + .+.++-++.+..+-+. .|.++ T Consensus 158 -----------~~~~~~l---~----~~~~~~~-~~G~~~~KikiG---~-----~~~~~D~~rv~avr~a----vG~~~ 206 (385) T cd03326 158 -----------GDDLGRL---R----DEMRRYL-DRGYTVVKIKIG---G-----APLDEDLRRIEAALDV----LGDGA 206 (385) T ss_pred -----------CCCHHHH---H----HHHHHHH-HCCCCEEEEECC---C-----CCHHHHHHHHHHHHHH----HCCCC T ss_conf -----------8899999---9----9999999-679998999489---9-----9889999999999997----38987 Q ss_pred HHHHCHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHCCCCEEEECCCEEEC Q ss_conf 34311046532027851111468656878999999987654486997767631042379988642388368504300002 Q gi|254780670|r 236 LIALDCAASAFFKKEKYILKGENLELQPNEMASYLANLVDQYPIYSIEDGMSEDDWHGWKILTNKIGSNCQLVGDDLFVT 315 (424) Q Consensus 236 ~i~lD~AAsefy~~~kY~~~~~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~D~~gw~~l~~~~g~~~~IvGDDL~vT 315 (424) .|.+|+- .-++..+-+.+-. .+++|.+..+|+|+..+|+++.+++.++..- -|.+|.-..| T Consensus 207 ~l~vDan----------------~~~~~~~Ai~~~~-~le~~~l~w~EeP~~~~d~~~~a~l~~~~~~--PIa~gE~~~s 267 (385) T cd03326 207 RLAVDAN----------------GRFDLETAIAYAK-ALAPYGLRWYEEPGDPLDYALQAELADHYDG--PIATGENLFS 267 (385) T ss_pred EEEEECC----------------CCCCHHHHHHHHH-HHHHCCCCEEECCCCCCCHHHHHHHHHHCCC--CEECCCCCCC T ss_conf 3886447----------------6789999999999-8553299189788995589999999963899--9981888659 Q ss_pred CHHHHHHHHHCCC----CCEEEECCCCHHHHHHHHHHHHHHHHCCCEE--EEECCCCCCCHHHHHHHHHHHCC Q ss_conf 1488852100386----5223425453003899999999999879869--99507888723589989998199 Q gi|254780670|r 316 NPERLHKGINEEV----ANAILIKPNQIGSVSETLNTIEKAQMAGYPS--IISHRSGETEDHTIADLAVATNC 382 (424) Q Consensus 316 n~~rl~~gi~~~~----~NailiK~NQiGTvset~ea~~~a~~~g~~~--ivShRSGETeD~~iaDLAVg~~a 382 (424) -. -.++-++.++ .+-+-+++...|-+|+++..+.+|..+|+.. +..|-. -..=.|++.+++. T Consensus 268 ~~-~~~~li~~ga~r~~~di~q~D~~~~GGit~~~ki~~~a~a~g~~~~~~~ph~g----~~~~~hlaaal~~ 335 (385) T cd03326 268 LQ-DARNLLRYGGMRPDRDVLQFDPGLSYGLPEYLRMLDVLEAHGWSRRRFFPHGG----HLMSLHIAAGLGL 335 (385) T ss_pred HH-HHHHHHHHCCCCCCCCEEEECCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCH----HHHHHHHHHHCCC T ss_conf 99-99999982775656887985753108898999999999983999652415737----8999999986687 No 24 >cd03315 MLE_like Muconate lactonizing enzyme (MLE) like subgroup of the enolase superfamily. Enzymes of this subgroup share three conserved carboxylate ligands for the essential divalent metal ion (usually Mg2+), two aspartates and a glutamate, and residues that can function as general acid/base catalysts, a Lys-X-Lys motif and another conserved lysine. Despite these conserved residues, the members of the MLE subgroup, like muconate lactonizing enzyme, o-succinylbenzoate synthase (OSBS) and N-acylamino acid racemase (NAAAR), catalyze different reactions. Probab=99.34 E-value=1.1e-09 Score=92.15 Aligned_cols=210 Identities=19% Similarity=0.314 Sum_probs=141.8 Q ss_pred CHHHHHHHHHHHHHCCCCHHHHCCCCCCCCCCCCCEEECCCCCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 02448998752663498488822532456475430000036645567766102565148877489998999999999998 Q gi|254780670|r 110 LGVSLAVSKAAAQTSNLPLYKYLGGCSAHILPVPLMNILNGGIHADNALDFQEFMIMPVGAENIREAIRMGAEVFHTLKK 189 (424) Q Consensus 110 lAvSlA~akA~A~~~~~pLy~yl~~~~~~~lP~P~~NiinGG~HA~~~l~iQEfmIiP~g~~sf~eal~~~~ev~~~lk~ 189 (424) -|+.||++-..++..++|||+.+++.. .++|+ ...++ ..+..+..+. .++ T Consensus 46 said~AlwDl~gK~~~~Pv~~LlGg~~-~~i~~--~~~~~--------------------~~~~~~~~~~-------~~~ 95 (265) T cd03315 46 AAVDMALWDLWGKRLGVPVYLLLGGYR-DRVRV--AHMLG--------------------LGEPAEVAEE-------ARR 95 (265) T ss_pred HHHHHHHHHHHHHHCCCCHHHHCCCCC-CCEEE--EEECC--------------------CCCHHHHHHH-------HHH T ss_conf 999999999999981988999729999-97699--99779--------------------9999999999-------999 Q ss_pred HHHHCCCC---CCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHCHHHHHHCCCCCEEECCCCCCCCHHHH Q ss_conf 75200001---367655764488684689999999999753853111023431104653202785111146865687899 Q gi|254780670|r 190 ELKSKGYS---TNVGDEGGFSPNLKTADSALDLIANSIEKAGYNAGKDLLIALDCAASAFFKKEKYILKGENLELQPNEM 266 (424) Q Consensus 190 ~L~~~g~~---t~vgDEGGfap~~~~~eeaL~ll~eAi~~aGy~~g~di~i~lD~AAsefy~~~kY~~~~~~~~~t~~el 266 (424) .+ ++|+. ..+|. +.++-++.+...-+. .|.++.|.+|+- .-.|.++- T Consensus 96 ~~-~~G~~~~Kikvg~---------~~~~d~~~v~~ir~~----~g~~~~l~vDan----------------~~~~~~~A 145 (265) T cd03315 96 AL-EAGFRTFKLKVGR---------DPARDVAVVAALREA----VGDDAELRVDAN----------------RGWTPKQA 145 (265) T ss_pred HH-HCCCCEEEEECCC---------CHHHHHHHHHHHHHH----CCCCCEEEECCC----------------CCCCHHHH T ss_conf 99-7599989997689---------859999999999987----299967985787----------------78899999 Q ss_pred HHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHCCCCEEEECCCEEECCHHHHHHHHHCCCCCEEEECCCCHHHHHHHH Q ss_conf 99999876544869977676310423799886423883685043000021488852100386522342545300389999 Q gi|254780670|r 267 ASYLANLVDQYPIYSIEDGMSEDDWHGWKILTNKIGSNCQLVGDDLFVTNPERLHKGINEEVANAILIKPNQIGSVSETL 346 (424) Q Consensus 267 id~y~~l~~~yPI~sIEDP~~e~D~~gw~~l~~~~g~~~~IvGDDL~vTn~~rl~~gi~~~~~NailiK~NQiGTvset~ 346 (424) +.+... +++|.+.++|||+..+|++++++|.++.+ +-|.+|.- ..+..-+++.++.++++.+-+|+.-+|-+|+++ T Consensus 146 ~~~~~~-l~~~~i~w~EeP~~~~d~~~~~~L~~~~~--ipIa~gEs-~~~~~~~~~~i~~~~~d~v~~d~~~~GGit~~~ 221 (265) T cd03315 146 IRALRA-LEDLGLDYVEQPLPADDLEGRAALARATD--TPIMADES-AFTPHDAFRELALGAADAVNIKTAKTGGLTKAQ 221 (265) T ss_pred HHHHHH-HCCCCCEEEECCCCCCCHHHHHHHHHCCC--CCEEECCC-CCCHHHHHHHHHHHCHHHEEECCCCCCHHHHHH T ss_conf 999986-40058548847899899999999985199--99990898-799999999998726134140877647499999 Q ss_pred HHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHHCCCH Q ss_conf 99999998798699950788872358998999819980 Q gi|254780670|r 347 NTIEKAQMAGYPSIISHRSGETEDHTIADLAVATNCGQ 384 (424) Q Consensus 347 ea~~~a~~~g~~~ivShRSGETeD~~iaDLAVg~~a~~ 384 (424) +.+.+|+.+|+.++++|-. |+.=.+.|-+-++...+. T Consensus 222 kia~~A~~~gi~~~~~~~~-~s~i~~aA~~hlaaa~pn 258 (265) T cd03315 222 RVLAVAEALGLPVMVGSMI-ESGLGTLANAHLAAALRA 258 (265) T ss_pred HHHHHHHHCCCCEEECCCC-CCHHHHHHHHHHHHCCCC T ss_conf 9999999869958978965-359999999999973999 No 25 >cd03323 D-glucarate_dehydratase D-Glucarate dehydratase (GlucD) catalyzes the dehydration of both D-glucarate and L-idarate to form 5-keto-4-deoxy-D-glucarate (5-KDG) , the initial reaction of the catabolic pathway for (D)-glucarate. GlucD belongs to the enolase superfamily of enzymes, characterized by the presence of an enolate anion intermediate which is generated by abstraction of the alpha-proton of the carboxylate substrate by an active site residue and that is stabilized by coordination to the essential Mg2+ ion. Probab=99.33 E-value=9.9e-09 Score=85.03 Aligned_cols=318 Identities=16% Similarity=0.177 Sum_probs=181.9 Q ss_pred EEEEEEEEEEE--ECC-----CCCE-----EEEEEEECCCCEEEEECCCCCCCCCCEEEEECCCCCCCCCCHHHHHHHHH Q ss_conf 10589986886--079-----7842-----99999984897676634478664775467626898651680489999999 Q gi|254780670|r 2 FINDIIAREVI--DSR-----GSPT-----IEVDVCLEDGSTGRAMVPSGASTGIHEAFELRDQEKRYFGKGVLKAIAFV 69 (424) Q Consensus 2 ~I~~i~~r~Il--DSR-----G~PT-----vEvev~~~~g~~gra~~PSGaStG~~Ea~elrD~~~~~~G~gV~~Av~~i 69 (424) +|++|....|- |++ |.++ .=|+|.|++|.+|-+-+|.+.. .+....+.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~tlV~I~TD~Gi~G~GE~~~~~~-------------------~~~~~~~~~ 61 (395) T cd03323 1 KITEMRVTPVAGHDSPLLNLSGAHEPFFTRNIVELTDDNGNTGVGESPGGAE-------------------ALEALLEAA 61 (395) T ss_pred CEEEEEEEEEECCCCCCCCCCCCCCCCEEEEEEEEEECCCCEEEEECCCCHH-------------------HHHHHHHHH T ss_conf 9007999987667876555667656751589999997999979886589768-------------------999999999 Q ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHCCCCCC--C---CCCCCCCCCHHHHHHHHHHHHHCCCCHHHHCCCCCCCCCCCCC Q ss_conf 8756789759885578988864653036643--2---2123222202448998752663498488822532456475430 Q gi|254780670|r 70 NDEIRTALLGCDARDQLLIDKIMIDLDGTPN--K---SRLGANAILGVSLAVSKAAAQTSNLPLYKYLGGCSAHILPVPL 144 (424) Q Consensus 70 n~~i~~~L~g~~~~dQ~~iD~~Li~lDgT~n--k---s~lGaNailAvSlA~akA~A~~~~~pLy~yl~~~~~~~lP~P~ 144 (424) ...+.+...+.++.....+-..+...+..-. . .+.-.+|+-|+-||++-..++..++|+|+.|+|....++|+-. T Consensus 62 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~AiSAIDiALWDi~GK~~g~Pv~~LLGG~~Rd~ip~~a 141 (395) T cd03323 62 RSLVGGDVFGAYLAVLESVRVAFADRDAGGRGLQTFDLRTTVHVVTAFEVALLDLLGQALGVPVADLLGGGQRDSVPFLA 141 (395) T ss_pred HHHHCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCHHHHCCCCCCCEEEEEE T ss_conf 99838776663578999999873146665566443222210568999999999984616389399984886356277767 Q ss_pred EEECCCCCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 00003664556776610256514887748999899999999999875200001367655764488684689999999999 Q gi|254780670|r 145 MNILNGGIHADNALDFQEFMIMPVGAENIREAIRMGAEVFHTLKKELKSKGYSTNVGDEGGFSPNLKTADSALDLIANSI 224 (424) Q Consensus 145 ~NiinGG~HA~~~l~iQEfmIiP~g~~sf~eal~~~~ev~~~lk~~L~~~g~~t~vgDEGGfap~~~~~eeaL~ll~eAi 224 (424) .-...++.+.. ..+.+-+..-...+.+-.+.++ ..++...+.|+..-.=.-|.+. .++-++.+ +|+ T Consensus 142 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~e~~~~-------~a~~~~~~~Gf~~~K~k~g~~~-----~~~di~~v-~av 207 (395) T cd03323 142 YLFYKGDRHKT-DLPYPWFRDRWGEALTPEGVVR-------LARAAIDRYGFKSFKLKGGVLP-----GEEEIEAV-KAL 207 (395) T ss_pred EEECCCCCCCC-CCCHHHHHHCCCCCCCHHHHHH-------HHHHHHHHCCCCEEEEECCCCC-----HHHHHHHH-HHH T ss_conf 74013688754-2001234320210079999999-------9999998559865987469899-----79999999-999 Q ss_pred HHCCCCCCCCHHHHHCHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHCCCC Q ss_conf 75385311102343110465320278511114686568789999999876544869977676310423799886423883 Q gi|254780670|r 225 EKAGYNAGKDLLIALDCAASAFFKKEKYILKGENLELQPNEMASYLANLVDQYPIYSIEDGMSEDDWHGWKILTNKIGSN 304 (424) Q Consensus 225 ~~aGy~~g~di~i~lD~AAsefy~~~kY~~~~~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~D~~gw~~l~~~~g~~ 304 (424) .++. + |+.|.+|+ +. ..+.++-+.+-..| +.+ +.++|+|+. +++++..+..+.+ T Consensus 208 rea~--~--d~~l~vDa-------n~---------~w~~~~Ai~~~r~l-e~~-l~w~EePv~--~~d~~a~lr~~~~-- 261 (395) T cd03323 208 AEAF--P--GARLRLDP-------NG---------AWSLETAIRLAKEL-EGV-LAYLEDPCG--GREGMAEFRRATG-- 261 (395) T ss_pred HHHC--C--CCEEEEEC-------CC---------CCCHHHHHHHHHHH-HCH-HHCCCCCCC--CHHHHHHHHHHCC-- T ss_conf 9867--9--98399878-------88---------84999999999997-222-211348886--6579999987339-- Q ss_pred EEEECCCEEECCHHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHH--HHHHHH Q ss_conf 6850430000214888521003865223425453003899999999999879869995078887235899--899981 Q gi|254780670|r 305 CQLVGDDLFVTNPERLHKGINEEVANAILIKPNQIGSVSETLNTIEKAQMAGYPSIISHRSGETEDHTIA--DLAVAT 380 (424) Q Consensus 305 ~~IvGDDL~vTn~~rl~~gi~~~~~NailiK~NQiGTvset~ea~~~a~~~g~~~ivShRSGETeD~~ia--DLAVg~ 380 (424) +-|.++. .+++...++..++.++++-+..++.-+|=+|++.....+|..+|..+. .|..+++.=++-| |++.+. T Consensus 262 iPIa~gE-~~~~~~~~~~~i~~~avDi~q~di~~~GGit~~~kIa~lAea~gi~v~-pH~~~~~~i~~aA~~Hl~aa~ 337 (395) T cd03323 262 LPLATNM-IVTDFRQLGHAIQLNAVDIPLADHHFWGGMRGSVRVAQVCETWGLGWG-MHSNNHLGISLAMMTHVAAAA 337 (395) T ss_pred CCCCCCC-EECCHHHHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHCCCEEE-ECCCCCCHHHHHHHHHHHHHC T ss_conf 9731464-104778889999748865464364535768899999999998199276-557876389999999999858 No 26 >cd03314 MAL Methylaspartate ammonia lyase (3-methylaspartase, MAL) is a homodimeric enzyme, catalyzing the magnesium-dependent reversible alpha,beta-elimination of ammonia from L-threo-(2S,3S)-3-methylaspartic acid to mesaconic acid. This reaction is part of the main catabolic pathway for glutamate. MAL belongs to the enolase superfamily of enzymes, characterized by the presence of an enolate anion intermediate which is generated by abstraction of the alpha-proton of the carboxylate substrate by an active site residue and is stabilized by coordination to the essential Mg2+ ion. Probab=99.21 E-value=1.3e-08 Score=84.29 Aligned_cols=336 Identities=20% Similarity=0.275 Sum_probs=216.9 Q ss_pred CEEEEEEEECCCCEEEEECCCCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCC Q ss_conf 42999999848976766344786647754676268986516804899999998756789759885578988864653036 Q gi|254780670|r 18 PTIEVDVCLEDGSTGRAMVPSGASTGIHEAFELRDQEKRYFGKGVLKAIAFVNDEIRTALLGCDARDQLLIDKIMIDLDG 97 (424) Q Consensus 18 PTvEvev~~~~g~~gra~~PSGaStG~~Ea~elrD~~~~~~G~gV~~Av~~in~~i~~~L~g~~~~dQ~~iD~~Li~lDg 97 (424) -+|-|.+.+++|.+.++-|.+..-.|.- ..|+.|. -..-+..+++.+.|.|+|.+++.-...-..+-.+ T Consensus 13 esisV~l~L~dG~va~GDCaaVqySGag------GRdPlF~---a~~~~~~~e~~i~p~L~g~d~~~Fr~~A~~~d~~-- 81 (369) T cd03314 13 EAISVMLVLEDGQVAVGDCAAVQYSGAG------GRDPLFL---AADFIPVIEKVIAPALVGRDVANFRPAAAVLDKM-- 81 (369) T ss_pred CEEEEEEEECCCCEEEECCHHEEECCCC------CCCCCCC---HHHHHHHHHHHCCHHHHCCCHHHHHHHHHHHHHC-- T ss_conf 5499999966997886010431304777------7775543---7677899874202876168588899999987424-- Q ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHCCCCHHHHCCCC---CCCCCCCCCEEECCCCCCCCC-CCCCEEEEEECCCCCCH Q ss_conf 64322123222202448998752663498488822532---456475430000036645567-76610256514887748 Q gi|254780670|r 98 TPNKSRLGANAILGVSLAVSKAAAQTSNLPLYKYLGGC---SAHILPVPLMNILNGGIHADN-ALDFQEFMIMPVGAENI 173 (424) Q Consensus 98 T~nks~lGaNailAvSlA~akA~A~~~~~pLy~yl~~~---~~~~lP~P~~NiinGG~HA~~-~l~iQEfmIiP~g~~sf 173 (424) +.+-.+|---.-++||-|...|+|+..+.-.-+-++.. ....-|+|.|---+...+.+- ++=++..=++|++.-+ T Consensus 82 ~~~g~rlHtAiRYGvsQALL~A~A~a~r~tmaeVi~~e~~~~~~~~pvPif~qsgd~ry~n~DKmIlK~v~vLPH~lin- 160 (369) T cd03314 82 RLDGNRLHTAIRYGVSQALLDAVALAQRRTMAEVLCDEYGLPLADEPVPIFAQSGDDRYINVDKMILKGADVLPHALIN- 160 (369) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCCCCCCEEECCCCHHHHHHHHHHHHCCCCCCHHHHC- T ss_conf 5688620388987188999999999727609999999868999888666330478525553789998278867667532- Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHCHHHHHHCCCCCEE Q ss_conf 99989999999999987520000136765576448868468999999999975385311102343110465320278511 Q gi|254780670|r 174 REAIRMGAEVFHTLKKELKSKGYSTNVGDEGGFSPNLKTADSALDLIANSIEKAGYNAGKDLLIALDCAASAFFKKEKYI 253 (424) Q Consensus 174 ~eal~~~~ev~~~lk~~L~~~g~~t~vgDEGGfap~~~~~eeaL~ll~eAi~~aGy~~g~di~i~lD~AAsefy~~~kY~ 253 (424) +|-+- .++.....+=++-|.+-|.+.| .++..=.+-+|+ |- T Consensus 161 -------------------------~v~~~---G~~Ge~L~eYv~Wl~~Ri~~lg-~~~Y~P~lHiDV----------YG 201 (369) T cd03314 161 -------------------------NVEEK---GPKGEKLLEYVKWLSDRIRKLG-RPGYHPILHIDV----------YG 201 (369) T ss_pred -------------------------CHHHH---CCCCHHHHHHHHHHHHHHHHHC-CCCCCCCEEEEC----------CH T ss_conf -------------------------79775---7562369999999999999846-898876268872----------10 Q ss_pred ECCCCCCCCHHHHHHHHHHHHHH-CC-EEEEECCCCCCC----HHHHHHHHHHC---CCCEEEECCCEEECCHHHHHHHH Q ss_conf 11468656878999999987654-48-699776763104----23799886423---88368504300002148885210 Q gi|254780670|r 254 LKGENLELQPNEMASYLANLVDQ-YP-IYSIEDGMSEDD----WHGWKILTNKI---GSNCQLVGDDLFVTNPERLHKGI 324 (424) Q Consensus 254 ~~~~~~~~t~~elid~y~~l~~~-yP-I~sIEDP~~e~D----~~gw~~l~~~~---g~~~~IvGDDL~vTn~~rl~~gi 324 (424) .-+.-...+.+.+.||+..|.+. .| =..||+|.+-.. .+...+|.+.+ |.+++||.|.- |-..+-++.-. T Consensus 202 tiG~~f~~d~~r~adyl~~le~aa~p~~L~IEgPmd~g~r~~QIe~l~~Lr~~L~~~G~~v~iVADEW-CNT~eDIr~Fa 280 (369) T cd03314 202 TIGQAFDPDPDRAADYLATLEEAAAPFPLRIEGPMDAGSREAQIERMAALRAELDRRGVGVRIVADEW-CNTLEDIRDFA 280 (369) T ss_pred HHHHHHCCCHHHHHHHHHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEHH-CCCHHHHHHHH T ss_conf 58888489889999999999974689724421775767769999999999999996489846985311-18889999998 Q ss_pred HCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHH--HHHHHHHHHCCCHHEECCCCHH--HH-HHHH Q ss_conf 03865223425453003899999999999879869995078887235--8998999819980100756114--48-9999 Q gi|254780670|r 325 NEEVANAILIKPNQIGSVSETLNTIEKAQMAGYPSIISHRSGETEDH--TIADLAVATNCGQIKTGSLARS--DR-IAKY 399 (424) Q Consensus 325 ~~~~~NailiK~NQiGTvset~ea~~~a~~~g~~~ivShRSGETeD~--~iaDLAVg~~a~~iK~G~~~R~--ER-~aKy 399 (424) +.+++.-+=||.--.|.+-.+++|+-.++++|...-+.-..-||.-+ .-+|.|+|+++.|+-.- |+=| |- +--+ T Consensus 281 da~A~dmVQIKtPDLGgi~nti~AvlyC~~~gvgaYlGGsCNETd~SAr~~~hvalAt~~~q~LaK-PGmG~DEG~mIv~ 359 (369) T cd03314 281 DAGAAHMVQIKTPDLGGIDNTIDAVLYCKEHGVGAYLGGSCNETDISARVTVHVALATRADQMLAK-PGMGVDEGLMIVT 359 (369) T ss_pred HCCCCCEEEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCCHHHHHHHHHHHHCCHHHHHCC-CCCCCCCHHHHHH T ss_conf 668866798648975553659999997502782375256555440246653355532288887169-9887653488888 Q ss_pred HHHHHHH Q ss_conf 9999999 Q gi|254780670|r 400 NQLIRIE 406 (424) Q Consensus 400 NrLlrIe 406 (424) |++.|.- T Consensus 360 NEM~R~l 366 (369) T cd03314 360 NEMNRTL 366 (369) T ss_pred HHHHHHH T ss_conf 8898898 No 27 >TIGR01502 B_methylAsp_ase methylaspartate ammonia-lyase; InterPro: IPR006395 These sequences describe methylaspartate ammonia-lyase, also called beta-methylaspartase. It follows methylaspartate mutase (composed of S and E subunits) in one of several possible pathways of glutamate fermentation. . Probab=98.86 E-value=1.7e-06 Score=68.46 Aligned_cols=313 Identities=20% Similarity=0.284 Sum_probs=222.4 Q ss_pred EEEEEEEECCCCEEEEECC----CCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH Q ss_conf 2999999848976766344----786647754676268986516804899999998756789759885578988864653 Q gi|254780670|r 19 TIEVDVCLEDGSTGRAMVP----SGASTGIHEAFELRDQEKRYFGKGVLKAIAFVNDEIRTALLGCDARDQLLIDKIMID 94 (424) Q Consensus 19 TvEvev~~~~g~~gra~~P----SGaStG~~Ea~elrD~~~~~~G~gV~~Av~~in~~i~~~L~g~~~~dQ~~iD~~Li~ 94 (424) .+-|++.+++|...++-+- |||= |-|+-|.-+.. ...|...|+|+|+|.|++.-..- -=.. T Consensus 51 slsv~lVL~Dg~vv~GDcaaVQYSGAG----------gRDpLF~a~d~---~~v~e~~vap~Ligrd~~~~~d~--CA~v 115 (414) T TIGR01502 51 SLSVELVLEDGRVVRGDCAAVQYSGAG----------GRDPLFRAKDF---SDVVEKAVAPKLIGRDVARYKDA--CAKV 115 (414) T ss_pred EEEEEEEEECCCEEECCEEEEEECCCC----------CCCHHHHHHHH---HHHHHHHHHHHHCCCHHHHCCCH--HHHH T ss_conf 589999963794786124567507888----------86744323212---56666666456404113321520--3788 Q ss_pred CCC-CCCCCCCCCCCCCHHHHHHHHHHHHHCCCCHHHHCCCC---CCCCCCCCCEEECCCCCCCCC---CCCCEEEEEEC Q ss_conf 036-64322123222202448998752663498488822532---456475430000036645567---76610256514 Q gi|254780670|r 95 LDG-TPNKSRLGANAILGVSLAVSKAAAQTSNLPLYKYLGGC---SAHILPVPLMNILNGGIHADN---ALDFQEFMIMP 167 (424) Q Consensus 95 lDg-T~nks~lGaNailAvSlA~akA~A~~~~~pLy~yl~~~---~~~~lP~P~~NiinGG~HA~~---~l~iQEfmIiP 167 (424) +|. |.+..+|---.-++||-|..+|+|+.++.-.-+-|... ...+-|||.| -+-|-.-.. ++=+++.=|+| T Consensus 116 fe~~~~~r~rLhtAirYGvsQAll~Aaa~~r~tT~aev~~de~~~~~~~~~VPvf--~QSGd~R~~nvdkMilK~vdVLP 193 (414) T TIGR01502 116 FEKLADERNRLHTAIRYGVSQALLAAAAKLRKTTMAEVVADEFDLELETEAVPVF--AQSGDDRERNVDKMILKEVDVLP 193 (414) T ss_pred HHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCEE--ECCHHHHHHCHHHHHHHHCCCCC T ss_conf 7554315787613444226799999999874334667756444789864442103--15847876133442230025576 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHCHHHHHHC Q ss_conf 88774899989999999999987520000136765576448868468999999999975385311102343110465320 Q gi|254780670|r 168 VGAENIREAIRMGAEVFHTLKKELKSKGYSTNVGDEGGFSPNLKTADSALDLIANSIEKAGYNAGKDLLIALDCAASAFF 247 (424) Q Consensus 168 ~g~~sf~eal~~~~ev~~~lk~~L~~~g~~t~vgDEGGfap~~~~~eeaL~ll~eAi~~aGy~~g~di~i~lD~AAsefy 247 (424) +|-.+=.| --|-|.-.+.-|=++-|.+=+.+.|..+|..=.+-+|+ T Consensus 194 HgLiNsve----------------------------~k~G~dG~~l~EyvkWl~~R~~~lgv~~gY~P~fH~DV------ 239 (414) T TIGR01502 194 HGLINSVE----------------------------KKLGLDGEKLLEYVKWLSERIAKLGVSEGYRPVFHVDV------ 239 (414) T ss_pred CCHHCCHH----------------------------HHCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEE------ T ss_conf 40120203----------------------------22288711557788889998875067889840688750------ Q ss_pred CCCCEEECCCCC--CCCHHHHHHHHHHHHHHCC--EEEEECCCCCCC----HHHHHHHHHHC---CCCEEEECCCEEECC Q ss_conf 278511114686--5687899999998765448--699776763104----23799886423---883685043000021 Q gi|254780670|r 248 KKEKYILKGENL--ELQPNEMASYLANLVDQYP--IYSIEDGMSEDD----WHGWKILTNKI---GSNCQLVGDDLFVTN 316 (424) Q Consensus 248 ~~~kY~~~~~~~--~~t~~elid~y~~l~~~yP--I~sIEDP~~e~D----~~gw~~l~~~~---g~~~~IvGDDL~vTn 316 (424) |=.-++-. -+++.-+.||...|.+-=| -.-||-|.|-+| .|.+.+|...| |....||.|.- |-. T Consensus 240 ----YG~iGe~fgPP~d~~a~adyi~~la~aa~pf~L~IEgPmD~gsrqaQieama~lr~~Ld~rGv~a~iVaDEw-Cnt 314 (414) T TIGR01502 240 ----YGTIGEEFGPPLDRKAVADYIRELAEAAKPFTLRIEGPMDVGSRQAQIEAMAELRSELDDRGVSAEIVADEW-CNT 314 (414) T ss_pred ----CCHHHHHCCCCCCHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEEECCC-CCC T ss_conf ----323343238884678888899999984689854436778845278999999999998707896389985067-887 Q ss_pred HHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHH--HHHHHHHHHCCCHHEE Q ss_conf 4888521003865223425453003899999999999879869995078887235--8998999819980100 Q gi|254780670|r 317 PERLHKGINEEVANAILIKPNQIGSVSETLNTIEKAQMAGYPSIISHRSGETEDH--TIADLAVATNCGQIKT 387 (424) Q Consensus 317 ~~rl~~gi~~~~~NailiK~NQiGTvset~ea~~~a~~~g~~~ivShRSGETeD~--~iaDLAVg~~a~~iK~ 387 (424) .+=++--.+.|++.=+=||-=-+|.|.|+-+|+.+++.+++.--+.--.-||.=+ .-+|.++|++|.|+-. T Consensus 315 ~eDv~~F~dakA~h~VQiKTPDvGg~~~~arA~~yCk~n~~gAYvGGtCnET~~SA~~~~hv~~At~A~~vLA 387 (414) T TIGR01502 315 VEDVKLFVDAKAAHLVQIKTPDVGGVADIARAVLYCKANDVGAYVGGTCNETDLSARASVHVALATDADQVLA 387 (414) T ss_pred HHHHHHHHHCCCCCEEEEECCCCCCHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHHHHHHHCHHHHHC T ss_conf 8899997307872357762487456556999998740388202256756632344666534554423466522 No 28 >cd03320 OSBS o-Succinylbenzoate synthase (OSBS) catalyzes the conversion of 2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylate (SHCHC) to 4-(2'-carboxyphenyl)-4-oxobutyrate (o-succinylbenzoate or OSB), a reaction in the menaquinone biosynthetic pathway. Menaquinone is an essential cofactor for anaerobic growth in eubacteria and some archaea. OSBS belongs to the enolase superfamily of enzymes, characterized by the presence of an enolate anion intermediate which is generated by abstraction of the alpha-proton of the carboxylate substrate by an active site residue and is stabilized by coordination to the essential Mg2+ ion. Probab=98.86 E-value=3.2e-06 Score=66.49 Aligned_cols=142 Identities=22% Similarity=0.260 Sum_probs=104.9 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCHHHHHCHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCCCHH Q ss_conf 68999999999975385311102343110465320278511114686568789999999876544869977676310423 Q gi|254780670|r 213 ADSALDLIANSIEKAGYNAGKDLLIALDCAASAFFKKEKYILKGENLELQPNEMASYLANLVDQYPIYSIEDGMSEDDWH 292 (424) Q Consensus 213 ~eeaL~ll~eAi~~aGy~~g~di~i~lD~AAsefy~~~kY~~~~~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~D~~ 292 (424) .++-+..+.. |.++ .|.++.|.+|+ |.-.|.++-+.+...| ++|.|.+||+|+..+||+ T Consensus 110 ~~~Di~~i~~-ir~~---~~~~~~l~vDa----------------N~~wt~~~A~~~~~~l-~~~~l~~iEqP~~~~d~~ 168 (263) T cd03320 110 FEEDLARLRA-LREA---LPADAKLRLDA----------------NGGWSLEEALAFLEAL-AAGRIEYIEQPLPPDDLA 168 (263) T ss_pred HHHHHHHHHH-HHHH---CCCCCEEEEEC----------------CCCCCHHHHHHHHHHH-CCCCEEEEECCCCCCCHH T ss_conf 5999999999-9996---69997799989----------------8774899999999871-516835886678999999 Q ss_pred HHHHHHHHCCCCEEEECCCEEECCHHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCH-- Q ss_conf 799886423883685043000021488852100386522342545300389999999999987986999507888723-- Q gi|254780670|r 293 GWKILTNKIGSNCQLVGDDLFVTNPERLHKGINEEVANAILIKPNQIGSVSETLNTIEKAQMAGYPSIISHRSGETED-- 370 (424) Q Consensus 293 gw~~l~~~~g~~~~IvGDDL~vTn~~rl~~gi~~~~~NailiK~NQiGTvset~ea~~~a~~~g~~~ivShRSGETeD-- 370 (424) +...+. ..+.|..|. .+.++.-+.+.++.+++..+.|||...|-++++++.+++|+.+|..+++++ +-||.= T Consensus 169 ~~~~~~----~~ipi~~DE-s~~~~~d~~~~~~~~a~d~i~iK~~~~GGi~~~~~i~~~a~~~gi~~~i~~-~~Es~Ig~ 242 (263) T cd03320 169 ELRRLA----AGVPIALDE-SLRRLDDPLALAAAGALGALVLKPALLGGPRALLELAEEARARGIPAVVSS-ALESSIGL 242 (263) T ss_pred HHHHHC----CCCCEECCC-CCCCHHHHHHHHHCCCCCEEEECCHHCCCHHHHHHHHHHHHHCCCCEEEEC-CCCHHHHH T ss_conf 998603----799854356-447878899999809998899672010899999999999998599699948-54159999 Q ss_pred HHHHHHHHHHC Q ss_conf 58998999819 Q gi|254780670|r 371 HTIADLAVATN 381 (424) Q Consensus 371 ~~iaDLAVg~~ 381 (424) ...+|||-++- T Consensus 243 ~a~~~laa~~p 253 (263) T cd03320 243 GALAHLAAALP 253 (263) T ss_pred HHHHHHHHHCC T ss_conf 99999998588 No 29 >TIGR01928 menC_lowGC/arch o-succinylbenzoic acid (OSB) synthetase; InterPro: IPR010197 This entry describes the enzyme o-succinylbenzoic acid synthetase (menC) that is involved in one of the steps of the menaquinone biosynthesis pathway. It takes SHCHC and makes it into 2-succinylbenzoate. Also included are enzymes with the common name of N-acylamino acid racemase (or the more general term, racemase / racemase family), which refers to the enzyme's industrial application as racemases, and not to its biological function as o-succinylbenzoic acid synthetase . Members are restricted to the are low GC Gram-positive bacteria and archaea. . Probab=98.84 E-value=1e-06 Score=70.10 Aligned_cols=257 Identities=20% Similarity=0.311 Sum_probs=182.5 Q ss_pred CEEEEEEEECCCCEEEEECCCCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCC Q ss_conf 42999999848976766344786647754676268986516804899999998756789759885578988864653036 Q gi|254780670|r 18 PTIEVDVCLEDGSTGRAMVPSGASTGIHEAFELRDQEKRYFGKGVLKAIAFVNDEIRTALLGCDARDQLLIDKIMIDLDG 97 (424) Q Consensus 18 PTvEvev~~~~g~~gra~~PSGaStG~~Ea~elrD~~~~~~G~gV~~Av~~in~~i~~~L~g~~~~dQ~~iD~~Li~lDg 97 (424) -++=+++.-++|..+-+-|-+=.+ +.|.---.-.+.+.+.+...|...++++.+..++-+.+..+.| T Consensus 22 ~~L~i~~~~~~G~~~fGE~~Af~~-------------p~Y~~Et~A~~~h~l~~~f~~~~~~k~f~~p~~~~~~~~~~~~ 88 (326) T TIGR01928 22 DALIIELIDDKGVAGFGEVVAFQT-------------PWYTEETIATVLHILEDFFLPNILQKEFEHPSEVLELVRSLKG 88 (326) T ss_pred CEEEEEEECCCCCCEEEEEECCCC-------------CCCCCCCHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHCCCC T ss_conf 078888873688613334640378-------------8898762578899998764433215665788899999725588 Q ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHCCCCHHH-HCCCCCCCCCCCCCEEECCCCCCCCCCCCCEEEEEECCCCCCHHHH Q ss_conf 643221232222024489987526634984888-2253245647543000003664556776610256514887748999 Q gi|254780670|r 98 TPNKSRLGANAILGVSLAVSKAAAQTSNLPLYK-YLGGCSAHILPVPLMNILNGGIHADNALDFQEFMIMPVGAENIREA 176 (424) Q Consensus 98 T~nks~lGaNailAvSlA~akA~A~~~~~pLy~-yl~~~~~~~lP~P~~NiinGG~HA~~~l~iQEfmIiP~g~~sf~ea 176 (424) ++ .|-=++-+|++.--++.++.||++ |+++...+ .|.=.. .|-.+..+- T Consensus 89 ~~-------mA~A~Le~al~d~y~k~~~~~l~~G~~gk~rdk-~~~G~v----------------------~Gl~~~e~~ 138 (326) T TIGR01928 89 TR-------MAKAGLEMALWDLYAKKKDKSLAEGFLGKLRDK-APAGVV----------------------VGLAKLEEM 138 (326) T ss_pred CH-------HHHHHHHHHHHHHHHHCCCCCHHHHHHCCCCCE-EECCEE----------------------EECCCHHHH T ss_conf 77-------799999999998876427973113211123571-320178----------------------714765789 Q ss_pred HHHHHHHHHHHHHHHHHC----CCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHCHHHHHHCCCCCE Q ss_conf 899999999999875200----0013676557644886846899999999997538531110234311046532027851 Q gi|254780670|r 177 IRMGAEVFHTLKKELKSK----GYSTNVGDEGGFSPNLKTADSALDLIANSIEKAGYNAGKDLLIALDCAASAFFKKEKY 252 (424) Q Consensus 177 l~~~~ev~~~lk~~L~~~----g~~t~vgDEGGfap~~~~~eeaL~ll~eAi~~aGy~~g~di~i~lD~AAsefy~~~kY 252 (424) |+-. -+..+.+ ++- ++|..+. +.. ..++.. |. +|.+.+|+ ++.| T Consensus 139 lk~i-------e~~~~~g~~RIKlK--------i~Pq~~~--~L~----~~~~~~-FP---~i~L~~DA-------N~sy 186 (326) T TIGR01928 139 LKEI-------ESLKKEGYKRIKLK--------ISPQIEV--DLV----KKVRKR-FP---DIPLVVDA-------NESY 186 (326) T ss_pred HHHH-------HHHHHCCCEEEEEE--------ECCCCCH--HHH----HHHHHH-CC---CCEEEEEC-------CCCC T ss_conf 9999-------99985698078875--------1688753--476----776763-79---72389733-------7788 Q ss_pred EECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHCCCCEEEECCCEEECCHHHHHHHHHCCCCCEE Q ss_conf 11146865687899999998765448699776763104237998864238836850430000214888521003865223 Q gi|254780670|r 253 ILKGENLELQPNEMASYLANLVDQYPIYSIEDGMSEDDWHGWKILTNKIGSNCQLVGDDLFVTNPERLHKGINEEVANAI 332 (424) Q Consensus 253 ~~~~~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~D~~gw~~l~~~~g~~~~IvGDDL~vTn~~rl~~gi~~~~~Nai 332 (424) .. .++-. .+..++|-+.+||.||.-+|.--.+.|++.+-. =|+=| =-+|...-++++|+-..|.-+ T Consensus 187 ~~---------~D~~~--Lk~Ld~~~l~~IE~Pf~~dDl~~l~~l~~~~~t--picLD-Es~~s~~~~~~l~eL~~~~v~ 252 (326) T TIGR01928 187 EL---------KDIPR--LKELDRYDLLYIEEPFKADDLSELKELKKELLT--PICLD-ESITSLDDARKLIELGAVKVI 252 (326) T ss_pred CC---------CCCHH--HHHHHHCCCEEEECCCCCCHHHHHHHHHHCCCC--CEECC-CCCCCHHHHHHHHCCCCCEEE T ss_conf 85---------65657--655444062033278984226668777521788--63112-340368899898515773089 Q ss_pred EECCCCHHHHHHHHHHHHHHHHCCCEEEEEC Q ss_conf 4254530038999999999998798699950 Q gi|254780670|r 333 LIKPNQIGSVSETLNTIEKAQMAGYPSIISH 363 (424) Q Consensus 333 liK~NQiGTvset~ea~~~a~~~g~~~ivSh 363 (424) .|||.-+|=+++++.++.++|+|+..+.|.. T Consensus 253 n~Kp~RlGGl~~~~k~~d~~k~h~~~vwiGg 283 (326) T TIGR01928 253 NIKPGRLGGLTEALKAIDLCKEHGIKVWIGG 283 (326) T ss_pred EECCCCCCCHHHHHHHHHHHHHCCCEEEEEE T ss_conf 9545431516689999999987797699932 No 30 >PRK05105 O-succinylbenzoate synthase; Provisional Probab=98.67 E-value=3e-05 Score=59.34 Aligned_cols=118 Identities=22% Similarity=0.229 Sum_probs=87.4 Q ss_pred CCCCCHHHHHHHHHHHH--HHCCEEEEECCCCCCCHHHHHHHHHHCCCCEEEECCCEEECCHHHHHHHHHCCCCCEEEEC Q ss_conf 86568789999999876--5448699776763104237998864238836850430000214888521003865223425 Q gi|254780670|r 258 NLELQPNEMASYLANLV--DQYPIYSIEDGMSEDDWHGWKILTNKIGSNCQLVGDDLFVTNPERLHKGINEEVANAILIK 335 (424) Q Consensus 258 ~~~~t~~elid~y~~l~--~~yPI~sIEDP~~e~D~~gw~~l~~~~g~~~~IvGDDL~vTn~~rl~~gi~~~~~NailiK 335 (424) |.-.|.+|-+++...|- .+++|-+||+|+...| ....|....+ +.|..|.-..+ +.. .-+......+++|| T Consensus 167 N~gws~~~A~~~~~~l~~~~~~~ie~iEqP~~~~~--d~~~~~~~~~--vpia~DEsl~~-~~~--~~~~~~~~~ai~iK 239 (326) T PRK05105 167 NRGWTLEKAQQFAKYVPPDYRSRIEFLEEPCKTPD--DSRAFARATG--IAIAWDESLRE-PDF--QFEAEPGVRAIVIK 239 (326) T ss_pred CCCCCHHHHHHHHHHHHHHHCCCCEEECCCCCCHH--HHHHHHHHCC--CCEEECCCCCC-CHH--HHHHHCCCCEEEEC T ss_conf 88979999999999744652688404258889867--8999998379--98898352468-205--56652568889985 Q ss_pred CCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCC--HHHHHHHHHHHCCC Q ss_conf 4530038999999999998798699950788872--35899899981998 Q gi|254780670|r 336 PNQIGSVSETLNTIEKAQMAGYPSIISHRSGETE--DHTIADLAVATNCG 383 (424) Q Consensus 336 ~NQiGTvset~ea~~~a~~~g~~~ivShRSGETe--D~~iaDLAVg~~a~ 383 (424) |.-.|.++++++.++.|+.+|..+|||+ +-|+. =++++|||-+++.. T Consensus 240 ~~~~GGl~~a~~l~~~A~~~gl~~~v~s-~~ES~igl~a~a~lAa~~~~~ 288 (326) T PRK05105 240 PTLLGSLEKCQELIEQAHALGLRAVISS-SIESSLGLTQLARLAAWLTPD 288 (326) T ss_pred CCCCCCHHHHHHHHHHHHHCCCCEEEEC-CCCHHHHHHHHHHHHHHHCCC T ss_conf 4154899999999999998799889983-510799999999999852889 No 31 >PRK02714 O-succinylbenzoate synthase; Provisional Probab=98.61 E-value=9.5e-06 Score=62.98 Aligned_cols=144 Identities=18% Similarity=0.188 Sum_probs=104.8 Q ss_pred CHHHHHHHHHHHHHHCCCCCCCCHHHHHCHHHHHHCCCCCEEECCCCCCCCHHHH---HHHHHHHHHHCCEEEEECCCCC Q ss_conf 4689999999999753853111023431104653202785111146865687899---9999987654486997767631 Q gi|254780670|r 212 TADSALDLIANSIEKAGYNAGKDLLIALDCAASAFFKKEKYILKGENLELQPNEM---ASYLANLVDQYPIYSIEDGMSE 288 (424) Q Consensus 212 ~~eeaL~ll~eAi~~aGy~~g~di~i~lD~AAsefy~~~kY~~~~~~~~~t~~el---id~y~~l~~~yPI~sIEDP~~e 288 (424) +.++-++.+.+..+.. |.++.|-+|+ ++. .|.++- +++... ...+.|-+||.|+.. T Consensus 145 ~~~~Dl~~l~air~~~----~~~~~LRvDA-------N~g---------ws~~~A~~~l~~l~~-l~~~~i~~iEQPl~~ 203 (320) T PRK02714 145 PLEQELKIFEQLLERL----PAGAKLRLDA-------NGG---------LSLEEAKRWLQLCDR-INPGKIEFLEQPLPP 203 (320) T ss_pred CCHHHHHHHHHHHHHC----CCCCEEEEEC-------CCC---------CCHHHHHHHHHHHHH-HCCCCHHHEECCCCC T ss_conf 9478999999999966----9886799957-------779---------899999999999987-134681330088995 Q ss_pred CCHHHHHHHHHHCCCCEEEECCCEEECCHHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCC Q ss_conf 04237998864238836850430000214888521003865223425453003899999999999879869995078887 Q gi|254780670|r 289 DDWHGWKILTNKIGSNCQLVGDDLFVTNPERLHKGINEEVANAILIKPNQIGSVSETLNTIEKAQMAGYPSIISHRSGET 368 (424) Q Consensus 289 ~D~~gw~~l~~~~g~~~~IvGDDL~vTn~~rl~~gi~~~~~NailiK~NQiGTvset~ea~~~a~~~g~~~ivShRSGET 368 (424) +||++...|+++.+ +-|.-|.= +.++..+.+.+..+++..+.|||...|.+.. ..++|+.+|..+|+|. +=|| T Consensus 204 ~d~~~l~~l~~~~~--~PIalDES-i~~~~d~~~~~~~~~~~~i~iKp~~~G~~~~---~~~~a~~~gi~~~igs-~lEs 276 (320) T PRK02714 204 DQFSEMLQLSQDYQ--TPIALDES-VANLAQLQACYQQGWRGIFVIKPAIAGSPSR---LRQFCQQHPLDAVFSS-VFET 276 (320) T ss_pred CCHHHHHHHHHHCC--CCEEECCC-CCCHHHHHHHHHCCCCCEEEECCCCCCCHHH---HHHHHHHCCCCEEEEE-CCHH T ss_conf 75999999998579--97997674-6882549999863899789987436778999---9999998699889990-4347 Q ss_pred C--HHHHHHHHHHHCCC Q ss_conf 2--35899899981998 Q gi|254780670|r 369 E--DHTIADLAVATNCG 383 (424) Q Consensus 369 e--D~~iaDLAVg~~a~ 383 (424) - =.+.+|||-.+... T Consensus 277 ~iG~~~~~~lAa~~~~~ 293 (320) T PRK02714 277 AIGRKAVLALAAELSRP 293 (320) T ss_pred HHHHHHHHHHHHHCCCC T ss_conf 99999999999848899 No 32 >TIGR02534 mucon_cyclo muconate and chloromuconate cycloisomerases; InterPro: IPR013370 This family consists of muconate cycloisomerase (5.5.1.1 from EC) and chloromuconate cycloisomerase (5.5.1.7 from EC), enzymes that often overlap in specificity. It does not include more distantly related proteins such as mandelate racemase (5.1.2.2 from EC).; GO: 0018849 muconate cycloisomerase activity, 0018850 chloromuconate cycloisomerase activity, 0030145 manganese ion binding. Probab=98.47 E-value=1.1e-05 Score=62.43 Aligned_cols=290 Identities=21% Similarity=0.241 Sum_probs=204.0 Q ss_pred CCCCCEEEEEEEECCCCEEE--EECCCCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 79784299999984897676--6344786647754676268986516804899999998756789759885578988864 Q gi|254780670|r 14 SRGSPTIEVDVCLEDGSTGR--AMVPSGASTGIHEAFELRDQEKRYFGKGVLKAIAFVNDEIRTALLGCDARDQLLIDKI 91 (424) Q Consensus 14 SRG~PTvEvev~~~~g~~gr--a~~PSGaStG~~Ea~elrD~~~~~~G~gV~~Av~~in~~i~~~L~g~~~~dQ~~iD~~ 91 (424) +.-+|=|=|.|.|+||..|- +-+| |+..|++-||..-=-+|.+-++|.|+|.++++ .|-.. T Consensus 24 M~~QtLvLV~irt~DGv~G~GEgtti---------------GGl~~G~ESpE~~K~~IDtYlAP~L~G~~a~~--n~~a~ 86 (369) T TIGR02534 24 MTEQTLVLVRIRTEDGVVGYGEGTTI---------------GGLWYGEESPETIKVNIDTYLAPVLVGRDATE--NIAAI 86 (369) T ss_pred CCCCEEEEEEEEECCCEEEEEECCCC---------------CCCCCCCCCCCEEEEEEEHHHHHHHCCCCCCC--CHHHH T ss_conf 05780536787532772898510006---------------86778884744046653001311020312442--47899 Q ss_pred HHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCHHHHCCCCCCCCCCCCCEEECCCCCCCCCCCCCEEEEEECCCCC Q ss_conf 65303664322123222202448998752663498488822532456475430000036645567766102565148877 Q gi|254780670|r 92 MIDLDGTPNKSRLGANAILGVSLAVSKAAAQTSNLPLYKYLGGCSAHILPVPLMNILNGGIHADNALDFQEFMIMPVGAE 171 (424) Q Consensus 92 Li~lDgT~nks~lGaNailAvSlA~akA~A~~~~~pLy~yl~~~~~~~lP~P~~NiinGG~HA~~~l~iQEfmIiP~g~~ 171 (424) |..|.+.-+-.|. |=.||-.|..-|-|+..|+|+.+-|+|.--.++||-.- | +.. T Consensus 87 m~~l~k~~~gnRF---AK~Av~tAL~DAqA~rLgVPV~eLLGG~~Rd~vdV~Wt--L--------------------asg 141 (369) T TIGR02534 87 MADLEKVVAGNRF---AKAAVDTALLDAQARRLGVPVSELLGGAVRDSVDVLWT--L--------------------ASG 141 (369) T ss_pred HHHHHHHHHCCHH---HHHHHHHHHHHHHHHHCCCCHHHHCCCCEECCCCHHHH--H--------------------CCC T ss_conf 9999998720613---68899999899877647872344278741144540242--1--------------------157 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHCCC---CCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHCHHHHHHCC Q ss_conf 4899989999999999987520000---1367655764488684689999999999753853111023431104653202 Q gi|254780670|r 172 NIREAIRMGAEVFHTLKKELKSKGY---STNVGDEGGFSPNLKTADSALDLIANSIEKAGYNAGKDLLIALDCAASAFFK 248 (424) Q Consensus 172 sf~eal~~~~ev~~~lk~~L~~~g~---~t~vgDEGGfap~~~~~eeaL~ll~eAi~~aGy~~g~di~i~lD~AAsefy~ 248 (424) ..... ..| .++.|.+|.+ .-.+| .-.|.- +...... |.++ +|..+.+=+|+ T Consensus 142 dtd~d---I~E----Ae~~~E~~rh~~FKLKiG---~~~~a~-DV~hV~~-----I~~A---Lg~r~~VRVDV------- 195 (369) T TIGR02534 142 DTDRD---IEE----AEELLEEKRHRSFKLKIG---ARDLAD-DVAHVVA-----IKKA---LGDRASVRVDV------- 195 (369) T ss_pred CHHHH---HHH----HHHHHHHHCCCCEEEECC---CCCHHH-HHHHHHH-----HHHH---HCCCEEEEEEC------- T ss_conf 53478---999----999999726882154437---888657-7999999-----9997---29952899720------- Q ss_pred CCCEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHCCCCEEEECCCEEECCHHHHHHHHHCCC Q ss_conf 78511114686568789999999876544869977676310423799886423883685043000021488852100386 Q gi|254780670|r 249 KEKYILKGENLELQPNEMASYLANLVDQYPIYSIEDGMSEDDWHGWKILTNKIGSNCQLVGDDLFVTNPERLHKGINEEV 328 (424) Q Consensus 249 ~~kY~~~~~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~D~~gw~~l~~~~g~~~~IvGDDL~vTn~~rl~~gi~~~~ 328 (424) |+--+..+=+-|...|-+ =.|=.||.|..-|+.++-++||.++ .+=|..|--..|=.+=++- .+..+ T Consensus 196 ---------N~aW~e~~A~~~~~~Lad-aGv~LIEQPtpae~~~~LarLt~r~--~VpIMADE~v~gP~dAl~~-a~~~a 262 (369) T TIGR02534 196 ---------NAAWDELEALKYLPQLAD-AGVELIEQPTPAEQREALARLTRRF--NVPIMADESVTGPADALEI-AKASA 262 (369) T ss_pred ---------CCCCCHHHHHHHHHHHHH-CCHHHHCCCCCHHHHHHHHHHHHHC--CCCEEECCCCCCHHHHHHH-HHHHC T ss_conf ---------366787899999888885-0700004889620489999998433--8854304456882789999-99825 Q ss_pred CCEEEECCCCHHHHHHHHHHHHHHHHCCCEE----EEECCCCCCCHHHHHHHHHHHCCCHHEECC Q ss_conf 5223425453003899999999999879869----995078887235899899981998010075 Q gi|254780670|r 329 ANAILIKPNQIGSVSETLNTIEKAQMAGYPS----IISHRSGETEDHTIADLAVATNCGQIKTGS 389 (424) Q Consensus 329 ~NailiK~NQiGTvset~ea~~~a~~~g~~~----ivShRSGETeD~~iaDLAVg~~a~~iK~G~ 389 (424) |.-+=+|..|-|=|++..+.+..|+.+|..+ |+=-.= =+||=+-....-++|+.|- T Consensus 263 ADVfAlK~~ksGGL~~~~~~AAIA~AAGi~lyGgTmLEGpi-----GtiASah~f~~~~~l~~GT 322 (369) T TIGR02534 263 ADVFALKIAKSGGLLESKKVAAIAKAAGIALYGGTMLEGPI-----GTIASAHFFAAFPELEFGT 322 (369) T ss_pred CCEEEEEHHCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCHH-----HHHHHHHHHHCCCCCCCCC T ss_conf 97787400014415789999999976483101650031026-----8999999984135666464 No 33 >COG3799 Mal Methylaspartate ammonia-lyase [Amino acid transport and metabolism] Probab=98.19 E-value=0.00055 Score=49.97 Aligned_cols=308 Identities=20% Similarity=0.283 Sum_probs=187.2 Q ss_pred EEEEEEEECCCCEEEEECCCCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHH---HHHHHHHHHHC Q ss_conf 299999984897676634478664775467626898651680489999999875678975988557---89888646530 Q gi|254780670|r 19 TIEVDVCLEDGSTGRAMVPSGASTGIHEAFELRDQEKRYFGKGVLKAIAFVNDEIRTALLGCDARD---QLLIDKIMIDL 95 (424) Q Consensus 19 TvEvev~~~~g~~gra~~PSGaStG~~Ea~elrD~~~~~~G~gV~~Av~~in~~i~~~L~g~~~~d---Q~~iD~~Li~l 95 (424) .+-|++.+.+|...++-+.+.--.|.- +.|+-|. -..-+..+|..+.|.|+|.|+.- -..+=+.|+ T Consensus 52 ~lsv~lvL~dg~vv~GDc~aVQYSGAG------gRDpLF~---a~~~~~~~~~~v~p~LvgrDv~~~ldnA~vfe~l~-- 120 (410) T COG3799 52 CLSVQLVLSDGAVVVGDCAAVQYSGAG------GRDPLFL---AEHFIPFLNDHVKPLLVGRDVDAFLDNARVFEKLI-- 120 (410) T ss_pred EEEEEEEEECCCEEEEEEEEEEECCCC------CCCCHHH---HHHHHHHHHHHHHHHHHCCCHHHHCCHHHHHHHHC-- T ss_conf 068999995695897202357732788------8771235---65547777633236661752876454367667643-- Q ss_pred CCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCHHHHCCCCC---CCCCCCCCEEECCCCCC---CCCCCCCEEEEEECCC Q ss_conf 36643221232222024489987526634984888225324---56475430000036645---5677661025651488 Q gi|254780670|r 96 DGTPNKSRLGANAILGVSLAVSKAAAQTSNLPLYKYLGGCS---AHILPVPLMNILNGGIH---ADNALDFQEFMIMPVG 169 (424) Q Consensus 96 DgT~nks~lGaNailAvSlA~akA~A~~~~~pLy~yl~~~~---~~~lP~P~~NiinGG~H---A~~~l~iQEfmIiP~g 169 (424) |+ ..|---.-++||.|...|+|...+.---+.++... ..+-|+|.+-- .|.. +-.++-.+-+=++|.| T Consensus 121 d~----~~LhtAvrYGvSQALl~Aaa~a~~tt~tevvcde~~lp~~te~vP~fgQ--SGd~R~~~vdkMiLK~vdVLPHg 194 (410) T COG3799 121 DG----NLLHTAVRYGVSQALLDAAALATGTTKTEVVCDEWQLPRVTESVPLFGQ--SGDDRYIAVDKMILKGVDVLPHG 194 (410) T ss_pred CC----CCCHHHHHHHHHHHHHHHHHHHHCCCHHEEEHHHHCCCCCCCCCCCCCC--CCCHHHHHHHHHHHHHCCCCCHH T ss_conf 67----7515888860779999999985245211232343289876666653466--84066652788888612746456 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHH---CCCCCCCCHHHHHCHHHHHH Q ss_conf 774899989999999999987520000136765576448868468999999999975---38531110234311046532 Q gi|254780670|r 170 AENIREAIRMGAEVFHTLKKELKSKGYSTNVGDEGGFSPNLKTADSALDLIANSIEK---AGYNAGKDLLIALDCAASAF 246 (424) Q Consensus 170 ~~sf~eal~~~~ev~~~lk~~L~~~g~~t~vgDEGGfap~~~~~eeaL~ll~eAi~~---aGy~~g~di~i~lD~AAsef 246 (424) +-.++- + |.|.-.+..|-++-+.+-+.. .||.|- +-+|+ T Consensus 195 --------------------------LiNsve-~--~G~dG~~l~Eyv~Wls~R~~~~g~~gYhP~----lH~DV----- 236 (410) T COG3799 195 --------------------------LINSVE-E--LGFDGEKLREYVRWLSDRILSKGTSGYHPT----LHIDV----- 236 (410) T ss_pred --------------------------HHHHHH-H--HCCCHHHHHHHHHHHHHHHHHCCCCCCCCC----EEEEE----- T ss_conf --------------------------532278-7--597538799999999999875079987860----79861----- Q ss_pred CCCCCEEECCCCCCCCHHHHHHHHHHHHHHCC--EEEEECCCC----CCCHHHHHHHHHH---CCCCEEEECCCEEECCH Q ss_conf 02785111146865687899999998765448--699776763----1042379988642---38836850430000214 Q gi|254780670|r 247 FKKEKYILKGENLELQPNEMASYLANLVDQYP--IYSIEDGMS----EDDWHGWKILTNK---IGSNCQLVGDDLFVTNP 317 (424) Q Consensus 247 y~~~kY~~~~~~~~~t~~elid~y~~l~~~yP--I~sIEDP~~----e~D~~gw~~l~~~---~g~~~~IvGDDL~vTn~ 317 (424) |-.-++-.-+++.-..+|..+|-++-| -.+||-|.+ +.-.+...++++. +|..+.||.|.- |-.- T Consensus 237 -----YG~iGe~fg~dp~r~a~yi~~l~~~a~~~pL~IEgP~DaGs~~aQI~~~a~i~~~L~~~Gs~v~IVaDEw-Cnt~ 310 (410) T COG3799 237 -----YGTIGEIFGMDPLRCAQYIASLEKEAQGLPLYIEGPVDAGSKPAQIRLLAAITKELTRLGSGVKIVADEW-CNTY 310 (410) T ss_pred -----EEHHHHHHCCCHHHHHHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEEHHH-CCCH T ss_conf -----0036778388889999999999853789825632665678888999999999999864587506865212-0538 Q ss_pred HHHHHHHHCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCH--HHHHHHHHHHCCCHHEE Q ss_conf 88852100386522342545300389999999999987986999507888723--58998999819980100 Q gi|254780670|r 318 ERLHKGINEEVANAILIKPNQIGSVSETLNTIEKAQMAGYPSIISHRSGETED--HTIADLAVATNCGQIKT 387 (424) Q Consensus 318 ~rl~~gi~~~~~NailiK~NQiGTvset~ea~~~a~~~g~~~ivShRSGETeD--~~iaDLAVg~~a~~iK~ 387 (424) .-+..-.+-++|.-+-||.--+|++-++..++.+++.+.+..-+....-||.= -.-+|.|+++.+.++-. T Consensus 311 ~Di~~F~dA~a~h~VQiKTPDvGsi~~~~rAvlyC~~~~~~AYvGGtCnETdvSAr~cvHValAt~a~~mLa 382 (410) T COG3799 311 QDIVDFTDAAACHMVQIKTPDVGSIHNIVRAVLYCNSHSMEAYVGGTCNETDVSARTCVHVALATRAMRMLA 382 (410) T ss_pred HHHHHHHHHCCCCEEEECCCCCCHHHHHHHHHHHHCCCCCCEEECCCCCCCCHHHHHEEHHHHHHCHHHHHC T ss_conf 899998764024379843898401788878886424775320116655655300333011221204787844 No 34 >pfam07476 MAAL_C Methylaspartate ammonia-lyase C-terminus. Methylaspartate ammonia-lyase EC:4.3.1.2 catalyses the second step of fermentation of glutamate. It is a homodimer. This family represents the C-terminal region of Methylaspartate ammonia-lyase and contains a TIM barrel fold similar to the pfam01188. This family represents the catalytic domain and contains a metal binding site. Probab=97.97 E-value=0.00013 Score=54.52 Aligned_cols=147 Identities=21% Similarity=0.301 Sum_probs=116.1 Q ss_pred CCHHHHHHHHHHHHHH-CC-EEEEECCCCCCC----HHHHHHHHHHC---CCCEEEECCCEEECCHHHHHHHHHCCCCCE Q ss_conf 6878999999987654-48-699776763104----23799886423---883685043000021488852100386522 Q gi|254780670|r 261 LQPNEMASYLANLVDQ-YP-IYSIEDGMSEDD----WHGWKILTNKI---GSNCQLVGDDLFVTNPERLHKGINEEVANA 331 (424) Q Consensus 261 ~t~~elid~y~~l~~~-yP-I~sIEDP~~e~D----~~gw~~l~~~~---g~~~~IvGDDL~vTn~~rl~~gi~~~~~Na 331 (424) .+.+.+.+|+..|.+. .| =..||+|.+-.+ .+....|++++ |.+++||.|.- |-..+-++.-.+.+++.- T Consensus 87 ~d~~~~adYl~~Le~aaap~~L~iEgPmd~g~r~~Qie~l~~Lr~~L~~~G~~v~iVADEW-CNT~eDI~~F~da~A~~m 165 (249) T pfam07476 87 NDLDRMADYLATLEEAAAPFPLRIEGPMDAGSKEAQIERLAALREKLDRRGIGVEIVADEW-CNTLEDIREFVDAGAADM 165 (249) T ss_pred CCHHHHHHHHHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEHH-CCCHHHHHHHHHCCCCCE T ss_conf 9789999999999985689724530775667759999999999999996489847984021-089899999996688667 Q ss_pred EEECCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHH--HHHHHHHHHCCCHHEECCCCHH--H-HHHHHHHHHHHH Q ss_conf 3425453003899999999999879869995078887235--8998999819980100756114--4-899999999999 Q gi|254780670|r 332 ILIKPNQIGSVSETLNTIEKAQMAGYPSIISHRSGETEDH--TIADLAVATNCGQIKTGSLARS--D-RIAKYNQLIRIE 406 (424) Q Consensus 332 iliK~NQiGTvset~ea~~~a~~~g~~~ivShRSGETeD~--~iaDLAVg~~a~~iK~G~~~R~--E-R~aKyNrLlrIe 406 (424) +-||.--.|.|-.++||+-.++++|...-+.-..-||.-+ .-+|.|+|+++.|+-.- |+=| | =+--+|++.|.- T Consensus 166 VQIKtPDLGgi~n~ieAvlyC~~~g~gaYlGGtCNETd~SAr~~~hvalAt~~~q~LaK-PGmG~DEG~mIv~NEM~R~l 244 (249) T pfam07476 166 VQIKTPDLGGINNTIEAVLYCKEKGVGAYLGGTCNETDRSARVSVHVALATRPDQMLAK-PGMGVDEGLMIVTNEMNRTL 244 (249) T ss_pred EEECCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHHHHHCCHHHHHCC-CCCCCCCHHHHHHHHHHHHH T ss_conf 88648975554669999998621694165056555430257776567752188888169-98876525899998999999 Q ss_pred HHH Q ss_conf 983 Q gi|254780670|r 407 ESL 409 (424) Q Consensus 407 e~L 409 (424) .-+ T Consensus 245 al~ 247 (249) T pfam07476 245 ALL 247 (249) T ss_pred HHH T ss_conf 984 No 35 >pfam02746 MR_MLE_N Mandelate racemase / muconate lactonizing enzyme, N-terminal domain. SCOP reports fold similarity with enolase N-terminal domain. Probab=97.77 E-value=0.00028 Score=52.14 Aligned_cols=92 Identities=23% Similarity=0.266 Sum_probs=68.0 Q ss_pred CCEEEEEEEECCCCEEEEECCCCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCC Q ss_conf 84299999984897676634478664775467626898651680489999999875678975988557898886465303 Q gi|254780670|r 17 SPTIEVDVCLEDGSTGRAMVPSGASTGIHEAFELRDQEKRYFGKGVLKAIAFVNDEIRTALLGCDARDQLLIDKIMIDLD 96 (424) Q Consensus 17 ~PTvEvev~~~~g~~gra~~PSGaStG~~Ea~elrD~~~~~~G~gV~~Av~~in~~i~~~L~g~~~~dQ~~iD~~Li~lD 96 (424) .+.|-|+|.+++|.+|.+-+.+ ..+ .-......+.+.++|.|+|+++.+.+.+-+.|...- T Consensus 26 ~~~vlV~l~t~dGi~G~Ge~~~--~~~-----------------~~~~~~~~i~~~l~p~lig~~~~~~~~~~~~~~~~~ 86 (117) T pfam02746 26 QSLVIVRIETSEGVVGIGEATS--VGG-----------------RAETIKAILDDHLAPLLIGRDAANISDLWQLMYRAA 86 (117) T ss_pred EEEEEEEEEECCCCEEEEEEEC--CCC-----------------CHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHC T ss_conf 2599999998999899988847--889-----------------779999999999999975899659999999976635 Q ss_pred CCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCHHHHCC Q ss_conf 6643221232222024489987526634984888225 Q gi|254780670|r 97 GTPNKSRLGANAILGVSLAVSKAAAQTSNLPLYKYLG 133 (424) Q Consensus 97 gT~nks~lGaNailAvSlA~akA~A~~~~~pLy~yl~ 133 (424) . . ...++-|+-+|++-+.++..++|||++|+ T Consensus 87 ~-~-----~~~A~aaiD~AlwDl~gK~~g~Pl~~lLG 117 (117) T pfam02746 87 L-G-----NMSAKAAIDMALWDLKAKVLNLPLADLLG 117 (117) T ss_pred C-C-----HHHHHHHHHHHHHHHHHHHCCCCHHHHCC T ss_conf 5-4-----18899999999999989885986899758 No 36 >TIGR03247 glucar-dehydr glucarate dehydratase. Glucarate dehydratase converts D-glucarate (and L-idarate, a stereoisomer) to 5-dehydro-4-deoxyglucarate which is subsequently acted on by GarL, tartronate semialdehyde reductase and glycerate kinase (, GenProp0716). The E. coli enzyme has been well-characterized. Probab=97.52 E-value=0.014 Score=39.64 Aligned_cols=280 Identities=15% Similarity=0.185 Sum_probs=160.9 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCCC---------CCCCCCCCHHHHHHHHHHHHHCCC Q ss_conf 51680489999999875678975988557898886465303664322---------123222202448998752663498 Q gi|254780670|r 56 RYFGKGVLKAIAFVNDEIRTALLGCDARDQLLIDKIMIDLDGTPNKS---------RLGANAILGVSLAVSKAAAQTSNL 126 (424) Q Consensus 56 ~~~G~gV~~Av~~in~~i~~~L~g~~~~dQ~~iD~~Li~lDgT~nks---------~lGaNailAvSlA~akA~A~~~~~ 126 (424) -|+|.++.++++ ...|.|+|.++.+.+.|=+.+...-+..+.+ +...+++.|+-+||.-.-.+..++ T Consensus 51 ~~Gg~~~~~~L~----~~~~~vvG~~v~~~~~i~~~v~~~~~~~~~~g~g~~t~dlr~~~~a~sAiE~A~lDl~GK~lg~ 126 (441) T TIGR03247 51 VPGGEKIRATLE----DARPLVVGKPLGEYQNVLNDVRATFADRDAGGRGLQTFDLRTTIHAVTAIESALLDLLGQHLGV 126 (441) T ss_pred CCCCHHHHHHHH----HHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCEEEEHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 889789999999----9788856897789999999999873577866777531000013465899999999862003288 Q ss_pred CHHHHCC-CCCCCCCCCCCEEECCCCCCCCCCCCCEEEEEECCCCCCH-----HHHHHHHHHHHHHHHHHHHHC-CCCCC Q ss_conf 4888225-3245647543000003664556776610256514887748-----999899999999999875200-00136 Q gi|254780670|r 127 PLYKYLG-GCSAHILPVPLMNILNGGIHADNALDFQEFMIMPVGAENI-----REAIRMGAEVFHTLKKELKSK-GYSTN 199 (424) Q Consensus 127 pLy~yl~-~~~~~~lP~P~~NiinGG~HA~~~l~iQEfmIiP~g~~sf-----~eal~~~~ev~~~lk~~L~~~-g~~t~ 199 (424) |++.-|+ |..--..|+-..-..-|- +....++.. --|...... .|++. -|-.-.+.+.+.++ |+.. T Consensus 127 PV~~LLG~G~~RD~V~~~aYLFy~gd-~~~t~~~y~---~~~~~~d~W~~~r~~ealt--pe~iv~~A~a~~~~yGF~~- 199 (441) T TIGR03247 127 PVAALLGEGQQRDEVEMLGYLFFIGD-RKRTSLPYR---SGPQDDDDWFRLRHEEALT--PEAVVRLAEAAYDRYGFRD- 199 (441) T ss_pred CHHHHHCCCCCCCCEEEEEEEEEECC-CCCCCCCCC---CCCCCCCHHHHHHHHCCCC--HHHHHHHHHHHHHHCCCCE- T ss_conf 69998478842251046773214047-656788867---7987631266652001289--9999999999887418600- Q ss_pred CCCCCC-CCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHCHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCC Q ss_conf 765576-4488684689999999999753853111023431104653202785111146865687899999998765448 Q gi|254780670|r 200 VGDEGG-FSPNLKTADSALDLIANSIEKAGYNAGKDLLIALDCAASAFFKKEKYILKGENLELQPNEMASYLANLVDQYP 278 (424) Q Consensus 200 vgDEGG-fap~~~~~eeaL~ll~eAi~~aGy~~g~di~i~lD~AAsefy~~~kY~~~~~~~~~t~~elid~y~~l~~~yP 278 (424) .-=.|| |.|.- . -+++.-|.+|+-. ..+-|| .+--.|.++=|.+-.+| +.+ T Consensus 200 ~KLKgGV~~~~e-E-i~~v~AL~eafP~--------~~lrlD----------------PNgaWs~~~Airl~r~l-~~~- 251 (441) T TIGR03247 200 FKLKGGVLRGEE-E-IEAVTALAKRFPQ--------ARITLD----------------PNGAWSLDEAIALCKDL-KGV- 251 (441) T ss_pred EEECCCCCCHHH-H-HHHHHHHHHHCCC--------CCEEEC----------------CCCCCCHHHHHHHHHHH-HCC- T ss_conf 441478888188-9-9999999986899--------866257----------------99886889999999972-340- Q ss_pred EEEEECCCCCCC-HHH---HHHHHHHCCCCEEEECCCEEECCHHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHHHHHHH Q ss_conf 699776763104-237---9988642388368504300002148885210038652234254530038999999999998 Q gi|254780670|r 279 IYSIEDGMSEDD-WHG---WKILTNKIGSNCQLVGDDLFVTNPERLHKGINEEVANAILIKPNQIGSVSETLNTIEKAQM 354 (424) Q Consensus 279 I~sIEDP~~e~D-~~g---w~~l~~~~g~~~~IvGDDL~vTn~~rl~~gi~~~~~NailiK~NQiGTvset~ea~~~a~~ 354 (424) +-|+|||...++ |.| ..++.++.+ + =+.-.-.+|+-+-+..++..++..-+|-.|+== ++-.+.....+++. T Consensus 252 l~Y~EDP~~~e~g~~gre~MAe~r~~t~--i-PlATNm~~t~~~~~~~ai~~~AVdIvLaDpHfW-g~~~s~~vA~~C~~ 327 (441) T TIGR03247 252 LAYAEDPCGAEQGYSGREVMAEFRRATG--L-PTATNMIATDWRQMGHALQLQAVDIPLADPHFW-TMQGSVRVAQMCHD 327 (441) T ss_pred HHHHCCCCCCCCCCCHHHHHHHHHHCCC--C-CCCCCEEEECHHHHHHHHHHCCCCEECCCCCHH-HHHHHHHHHHHHHH T ss_conf 5565188754467226899999985389--8-730231650589978999708873772687502-15778999999998 Q ss_pred CCCEEEEECCCCCCCHHH--HHHHHHH Q ss_conf 798699950788872358--9989998 Q gi|254780670|r 355 AGYPSIISHRSGETEDHT--IADLAVA 379 (424) Q Consensus 355 ~g~~~ivShRSGETeD~~--iaDLAVg 379 (424) .|. +.-+|...+..=+. ..|+|.+ T Consensus 328 ~gl-~~~~HSn~hlgISLAam~HvaAa 353 (441) T TIGR03247 328 WGL-TWGSHSNNHFDISLAMFTHVAAA 353 (441) T ss_pred HCC-CEECCCCCCCCHHHHHHHHHHHC T ss_conf 398-63523577423999999987422 No 37 >TIGR01927 menC_gamma/gm+ o-succinylbenzoic acid (OSB) synthetase; InterPro: IPR010196 This entry describes the enzyme o-succinylbenzoic acid synthetase (MenC) that is involved in one of the steps of the menaquinone biosynthesis pathway. The biosynthesis of menaquinone has been studied most in E. colii . 2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylate (SHCHC) is dehydrated by MenC to give the aromatic compound O-succinylbenzoate (OSB) . MenC, which is a member of the enolase superfamily and x-ray structure has been determined , . Many of the common names of the proteins in this family are incorrectly described as O-succinylbenzoyl-CoA synthase. Members are restricted to bacteria. ; GO: 0000287 magnesium ion binding, 0016836 hydro-lyase activity, 0009234 menaquinone biosynthetic process. Probab=95.72 E-value=0.033 Score=36.83 Aligned_cols=123 Identities=21% Similarity=0.358 Sum_probs=91.6 Q ss_pred HHHHHHHHHHHHHCCCCCCCCHHHHHCHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHH---HCC--EEEEECCCCC Q ss_conf 89999999999753853111023431104653202785111146865687899999998765---448--6997767631 Q gi|254780670|r 214 DSALDLIANSIEKAGYNAGKDLLIALDCAASAFFKKEKYILKGENLELQPNEMASYLANLVD---QYP--IYSIEDGMSE 288 (424) Q Consensus 214 eeaL~ll~eAi~~aGy~~g~di~i~lD~AAsefy~~~kY~~~~~~~~~t~~elid~y~~l~~---~yP--I~sIEDP~~e 288 (424) -...+.|.+++-. ...|=||. |.-++.++=..|..-+-- .|| |-|||.|+.. T Consensus 162 ~~~~~~L~~~lPd-------~~~LRLDA----------------N~~l~~~~A~~f~~~~~pqre~~~~~i~flEePl~~ 218 (336) T TIGR01927 162 GMLVNLLLEALPD-------KAELRLDA----------------NRGLSPDEAQQFLKALDPQRENLRGRIAFLEEPLPD 218 (336) T ss_pred HHHHHHHHHHCCC-------CCEEHHCC----------------CCCCCHHHHHHHHHHCCCHHCCCCCCEEEECCCCCC T ss_conf 8999999984644-------34132101----------------457787899999861482001576523221188785 Q ss_pred CCHHHHHHHHHHCCCCEEEECCCEEECC---HHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEE Q ss_conf 0423799886423883685043000021---4888521003865223425453003899999999999879869995 Q gi|254780670|r 289 DDWHGWKILTNKIGSNCQLVGDDLFVTN---PERLHKGINEEVANAILIKPNQIGSVSETLNTIEKAQMAGYPSIIS 362 (424) Q Consensus 289 ~D~~gw~~l~~~~g~~~~IvGDDL~vTn---~~rl~~gi~~~~~NailiK~NQiGTvset~ea~~~a~~~g~~~ivS 362 (424) ..++....|....|- =|-=|-= +.+ ..-+..-++.++-.+++||||=+|++....+.++.|+.-|..-|+| T Consensus 219 ~~~~~~~~f~~~tg~--aiAlDEs-l~~avr~~~lA~~~~~G~~g~~ViKP~~~Gs~~~~~~~~~~a~~lGl~AV~S 292 (336) T TIGR01927 219 EEIDEMLAFSEDTGI--AIALDES-LAEAVRLAQLAAEYEKGVRGALVIKPAIIGSPAKLRDLVQKAHRLGLQAVFS 292 (336) T ss_pred HHHHHHHHHHHHHCC--CHHHHHH-HHHHCCCHHHHHHHCCCCCCEEEECHHHCCCHHHHHHHHHHHHHCCCCHHHH T ss_conf 437899999887268--4178889-9986077679998628997247865012287899999999877568630168 No 38 >pfam05034 MAAL_N Methylaspartate ammonia-lyase N-terminus. Methylaspartate ammonia-lyase EC:4.3.1.2 catalyses the second step of fermentation of glutamate. It is a homodimer. This family represents the N-terminal region of Methylaspartate ammonia-lyase. This domain is structurally related to pfam03952. This domain is associated with the catalytic domain pfam07476. Probab=95.26 E-value=0.29 Score=29.78 Aligned_cols=104 Identities=20% Similarity=0.225 Sum_probs=68.5 Q ss_pred EEEEEEEECCCCEEEEECCCCCCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCC Q ss_conf 29999998489767663447866477546762689865168048999999987567897598855789888646530366 Q gi|254780670|r 19 TIEVDVCLEDGSTGRAMVPSGASTGIHEAFELRDQEKRYFGKGVLKAIAFVNDEIRTALLGCDARDQLLIDKIMIDLDGT 98 (424) Q Consensus 19 TvEvev~~~~g~~gra~~PSGaStG~~Ea~elrD~~~~~~G~gV~~Av~~in~~i~~~L~g~~~~dQ~~iD~~Li~lDgT 98 (424) +|-|.+.+++|.+.++-|.+..-.|.- ..|+.|. -..-+..++..|+|.|+|.|++.-...-..+-++ T Consensus 51 sisV~l~L~DG~va~GDCaaVqySGag------GRDPLF~---A~~~~~~~e~~v~p~L~g~d~~~Fr~~a~~~d~~--- 118 (158) T pfam05034 51 SLSVGLVLEDGTVVWGDCAAVQYSGAG------GRDPLFL---ASDFIPVTERVVAPRLIGRDVSRFLDNAAKVDKL--- 118 (158) T ss_pred EEEEEEEECCCCEEEEEEHHEEECCCC------CCCCCCC---HHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHC--- T ss_conf 799999957997887102431314777------7775443---7788899986411887088588999999998621--- Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHCCCCHHHHCCC Q ss_conf 432212322220244899875266349848882253 Q gi|254780670|r 99 PNKSRLGANAILGVSLAVSKAAAQTSNLPLYKYLGG 134 (424) Q Consensus 99 ~nks~lGaNailAvSlA~akA~A~~~~~pLy~yl~~ 134 (424) .+-.+|---.-++||-|...|+|+..+.-.-+-++. T Consensus 119 ~~g~rLhtAiRYGvSQALL~A~A~a~r~tmaeVi~~ 154 (158) T pfam05034 119 IDGNRLHTAIRYGVSQALLDAAAHARRTTMAEVVCD 154 (158) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHH T ss_conf 378730378887189999999999827609998887 No 39 >PRK02901 O-succinylbenzoate synthase; Provisional Probab=95.11 E-value=0.087 Score=33.68 Aligned_cols=123 Identities=20% Similarity=0.260 Sum_probs=86.5 Q ss_pred HHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHCCCCEEEECCCEEE--CCHHHHHH Q ss_conf 3202785111146865687899999998765448699776763104237998864238836850430000--21488852 Q gi|254780670|r 245 AFFKKEKYILKGENLELQPNEMASYLANLVDQYPIYSIEDGMSEDDWHGWKILTNKIGSNCQLVGDDLFV--TNPERLHK 322 (424) Q Consensus 245 efy~~~kY~~~~~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~D~~gw~~l~~~~g~~~~IvGDDL~v--Tn~~rl~~ 322 (424) +..-+.+-.++. |.-.|.+|=+.....|...++|-|+|.|... .+....+..+++ +-|..|.-.. .++.++. T Consensus 129 ~a~p~~~iRvDA-Ng~Wsv~eA~~~l~~L~a~g~lEYvEQPcat--ieeLaeLR~rv~--vpIAaDEsiR~~~dp~~v~- 202 (327) T PRK02901 129 EALPADRVRVDA-NGGWSVDEAVAAAAALTADGPLEYLEQPCAT--VEELAELRRRVD--VPIAADESIRKAEDPLRVV- 202 (327) T ss_pred HHCCCCEEEEEC-CCCCCHHHHHHHHHHHHHCCCCHHHCCCCCC--HHHHHHHHHCCC--CCEEEHHHHHCCCCHHHHH- T ss_conf 747865089846-7884699999999987754784202187778--899999996089--8532025562668768887- Q ss_pred HHHCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHH Q ss_conf 100386522342545300389999999999987986999507888723589989998 Q gi|254780670|r 323 GINEEVANAILIKPNQIGSVSETLNTIEKAQMAGYPSIISHRSGETEDHTIADLAVA 379 (424) Q Consensus 323 gi~~~~~NailiK~NQiGTvset~ea~~~a~~~g~~~ivShRSGETeD~~iaDLAVg 379 (424) ..++++-+++|+.-.|-+..+++ +|+..|..++||. .-||.=-.-+.|+.+ T Consensus 203 --~~~AADv~vlKv~PLGGi~ral~---iAe~~glpvVVSS-aLessVGiaagl~lA 253 (327) T PRK02901 203 --RAGAADVAVLKVAPLGGVRALLD---IAEQIGLPVVVSS-ALDTSVGIAAGLAAA 253 (327) T ss_pred --HHCCCCEEEEECCCCCHHHHHHH---HHHHHCCCEEEEC-CCCHHHHHHHHHHHH T ss_conf --60766067771456534999999---9998099859962-412388999999999 No 40 >pfam01188 MR_MLE Mandelate racemase / muconate lactonizing enzyme, C-terminal domain. C-terminal domain is TIM barrel fold, dehydratase-like domain. Manganese is associated with this domain. Probab=94.96 E-value=0.13 Score=32.31 Aligned_cols=69 Identities=17% Similarity=0.243 Sum_probs=48.7 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCHHHHHCHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCCCHH Q ss_conf 68999999999975385311102343110465320278511114686568789999999876544869977676310423 Q gi|254780670|r 213 ADSALDLIANSIEKAGYNAGKDLLIALDCAASAFFKKEKYILKGENLELQPNEMASYLANLVDQYPIYSIEDGMSEDDWH 292 (424) Q Consensus 213 ~eeaL~ll~eAi~~aGy~~g~di~i~lD~AAsefy~~~kY~~~~~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~D~~ 292 (424) .++-++.+...-+. .|.++.|.+|+- ...|.++-+.+... +++|.+.++|+|+..+|++ T Consensus 28 ~~~d~~~i~~ir~~----~g~~~~l~vD~n----------------~~~~~~~A~~~~~~-l~~~~~~~~EeP~~~~d~~ 86 (98) T pfam01188 28 LADDLARVAAVREA----VGDDVRLRVDAN----------------QAWDVATAIRLARA-LEELGLTWIEEPLPADDIE 86 (98) T ss_pred HHHHHHHHHHHHHH----CCCCCEEEECCC----------------CCCCHHHHHHHHHH-HCCCCCEEEECCCCCCCHH T ss_conf 99999999999997----599987987586----------------54379999999987-3257956997989999999 Q ss_pred HHHHHHHHCC Q ss_conf 7998864238 Q gi|254780670|r 293 GWKILTNKIG 302 (424) Q Consensus 293 gw~~l~~~~g 302 (424) +.++|.++++ T Consensus 87 ~~~~l~~~~~ 96 (98) T pfam01188 87 GLARLRRALD 96 (98) T ss_pred HHHHHHHHCC T ss_conf 9999998578 No 41 >KOG2367 consensus Probab=94.67 E-value=0.46 Score=28.30 Aligned_cols=40 Identities=18% Similarity=0.312 Sum_probs=30.3 Q ss_pred CEEEECCCEEECCHHHHHHHHHCCCCCEEEECCCCHHHHH Q ss_conf 3685043000021488852100386522342545300389 Q gi|254780670|r 304 NCQLVGDDLFVTNPERLHKGINEEVANAILIKPNQIGSVS 343 (424) Q Consensus 304 ~~~IvGDDL~vTn~~rl~~gi~~~~~NailiK~NQiGTvs 343 (424) +--|+|+-.|+.-...-|.||.++...=.+++|.-||..- T Consensus 338 ~~pi~G~~~Fth~SGiHqdaIlk~r~tYeiLdPeDiG~~~ 377 (560) T KOG2367 338 HKPIVGANAFTHESGIHQDAILKNRSTYEILDPEDIGMER 377 (560) T ss_pred CCCCCCCEEEEECCCCCHHHHHCCCCCCCCCCHHHCCCCC T ss_conf 7865451357741433377885489764147967838500 No 42 >PRK11382 frlB fructoselysine-6-P-deglycase; Provisional Probab=88.98 E-value=1.3 Score=25.02 Aligned_cols=30 Identities=17% Similarity=0.055 Sum_probs=15.0 Q ss_pred EEECCCCHHHHHHHHHHHHHHHHCCCEEEEE Q ss_conf 3425453003899999999999879869995 Q gi|254780670|r 332 ILIKPNQIGSVSETLNTIEKAQMAGYPSIIS 362 (424) Q Consensus 332 iliK~NQiGTvset~ea~~~a~~~g~~~ivS 362 (424) +++.|+ --+...+..+++..+..|-.+++- T Consensus 269 i~i~~~-d~~~~~~~~~i~evkarg~~vivi 298 (347) T PRK11382 269 LFLLGN-DESRHTTERAINFVKQRTDNVIVI 298 (347) T ss_pred EEEECC-CCHHHHHHHHHHHHHHCCCEEEEE T ss_conf 999548-505899999999999759908996 No 43 >COG1105 FruK Fructose-1-phosphate kinase and related fructose-6-phosphate kinase (PfkB) [Carbohydrate transport and metabolism] Probab=88.31 E-value=2.3 Score=23.13 Aligned_cols=103 Identities=18% Similarity=0.289 Sum_probs=74.4 Q ss_pred CEEECCCCCCCCHHH---HHHHHHHHHHHCCEEEEEC----CCCCCCHHHHHHHHHHCCCCEEEECCCEEECCHHHHHHH Q ss_conf 511114686568789---9999998765448699776----763104237998864238836850430000214888521 Q gi|254780670|r 251 KYILKGENLELQPNE---MASYLANLVDQYPIYSIED----GMSEDDWHGWKILTNKIGSNCQLVGDDLFVTNPERLHKG 323 (424) Q Consensus 251 kY~~~~~~~~~t~~e---lid~y~~l~~~yPI~sIED----P~~e~D~~gw~~l~~~~g~~~~IvGDDL~vTn~~rl~~g 323 (424) .+.+...+..++.+| +.+.+..++.+..++-+=- .+..+++..|...+++.|.++- +-|.-.-|..+ T Consensus 101 ~Tein~~Gp~is~~~~~~~l~~~~~~l~~~d~VvlsGSlP~g~~~d~y~~li~~~~~~g~~vi------lD~Sg~~L~~~ 174 (310) T COG1105 101 ETEINFPGPEISEAELEQFLEQLKALLESDDIVVLSGSLPPGVPPDAYAELIRILRQQGAKVI------LDTSGEALLAA 174 (310) T ss_pred EEEECCCCCCCCHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCEEE------EECCHHHHHHH T ss_conf 687048899889999999999999756648899990889999997999999999986598399------97985999998 Q ss_pred HHCCCCCEEEECCCC----------HHHHHHHHHHHHHHHHCCCE-EEEE Q ss_conf 003865223425453----------00389999999999987986-9995 Q gi|254780670|r 324 INEEVANAILIKPNQ----------IGSVSETLNTIEKAQMAGYP-SIIS 362 (424) Q Consensus 324 i~~~~~NailiK~NQ----------iGTvset~ea~~~a~~~g~~-~ivS 362 (424) ++.+ -.|||||- ..++.|.+++++.-...|.. +||| T Consensus 175 L~~~---P~lIKPN~~EL~~~~g~~~~~~~d~~~~a~~l~~~g~~~ViVS 221 (310) T COG1105 175 LEAK---PWLIKPNREELEALFGRELTTLEDVIKAARELLAEGIENVIVS 221 (310) T ss_pred HCCC---CCEEECCHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCCEEEEE T ss_conf 7079---9389458899999968988974889999999987799889997 No 44 >PRK06375 consensus Probab=88.05 E-value=2.4 Score=23.02 Aligned_cols=32 Identities=19% Similarity=0.326 Sum_probs=22.5 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCC Q ss_conf 686568789999999876544869977676310 Q gi|254780670|r 257 ENLELQPNEMASYLANLVDQYPIYSIEDGMSED 289 (424) Q Consensus 257 ~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~ 289 (424) .|..++.+++-++ .+||.+|.++-|+|=.+.+ T Consensus 172 TG~v~s~e~l~~i-~~la~~~~i~iIsDEiY~~ 203 (381) T PRK06375 172 TGKVLSEKEIREL-VDFALENNLYIVSDEIYED 203 (381) T ss_pred CCCCCCHHHHHHH-HHHHHHCEEEEEECCCHHH T ss_conf 8999999999999-9988756589981320575 No 45 >PRK11557 putative DNA-binding transcriptional regulator; Provisional Probab=85.03 E-value=3.4 Score=21.91 Aligned_cols=61 Identities=10% Similarity=0.038 Sum_probs=43.6 Q ss_pred CEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHH--HHHCCCHHEECCC Q ss_conf 223425453003899999999999879869995078887235899899--9819980100756 Q gi|254780670|r 330 NAILIKPNQIGSVSETLNTIEKAQMAGYPSIISHRSGETEDHTIADLA--VATNCGQIKTGSL 390 (424) Q Consensus 330 NailiK~NQiGTvset~ea~~~a~~~g~~~ivShRSGETeD~~iaDLA--Vg~~a~~iK~G~~ 390 (424) +-++|=...-|.-.|++++++.|++.|-.+|.=-++..+.=.-+||.. +...-..++.++. T Consensus 180 ~Dv~I~iS~sG~t~~~~~~~~~Ak~~Ga~iIaIT~~~~spLa~~aD~~L~~~~~e~~~~~~~~ 242 (282) T PRK11557 180 DDLLLAISYSGERRELNLAADEALRVGGKVLAITGFTPNALQQRASHCLYTIAEEQATRSAAI 242 (282) T ss_pred CCEEEEEECCCCCHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHCCEEEEECCCCCCCCCCCH T ss_conf 998999859999789999999999879939997298998158869999984785310134343 No 46 >cd01539 PBP1_GGBP Periplasmic glucose/galactose-binding protein (GGBP) involved in chemotaxis towards, and active transport of, glucose and galactose in various bacterial species. Periplasmic glucose/galactose-binding protein (GGBP) involved in chemotaxis towards, and active transport of, glucose and galactose in various bacterial species. GGBP is a member of the pentose/hexose sugar-binding protein family of the type I periplasmic binding protein superfamily which consists of two alpha/beta globular domains connected by a three-stranded hinge. This Venus flytrap-like domain undergoes transition from an open to a closed conformational state upon ligand binding. Moreover, the periplasmic GGBP is homologous to the ligand-binding domain of eukaryotic receptors such as glutamate receptor (GluR) and DNA-binding transcriptional repressors such as LacI and GalR. Probab=84.12 E-value=1.4 Score=24.84 Aligned_cols=64 Identities=19% Similarity=0.222 Sum_probs=35.5 Q ss_pred CCCHHHHHHHHHHHHHHCC--E---EEEECCCCCCCHHHHHHHHHHC---CCCEEEECCCEEECCHHHHHHH Q ss_conf 5687899999998765448--6---9977676310423799886423---8836850430000214888521 Q gi|254780670|r 260 ELQPNEMASYLANLVDQYP--I---YSIEDGMSEDDWHGWKILTNKI---GSNCQLVGDDLFVTNPERLHKG 323 (424) Q Consensus 260 ~~t~~elid~y~~l~~~yP--I---~sIEDP~~e~D~~gw~~l~~~~---g~~~~IvGDDL~vTn~~rl~~g 323 (424) ..++++=.+....|+.+|| | +..-|+..---...++..-.+. .+++.|+|=|-...-.+.|+.| T Consensus 181 ~w~~~~a~~~~e~~L~~~~d~id~I~a~ND~mAlGA~~Al~~aG~~~~~~~~~I~VvG~Dg~~~~l~aI~~G 252 (303) T cd01539 181 NWDRAQAKDKMDALLLKYGDKIEAVIANNDAMALGAIEALQKYGYNKGDKSKNIPVVGVDALPEALELIKKG 252 (303) T ss_pred CCCHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHHHHCCCCCCCCCCCCEEEEECCCHHHHHHHHCC T ss_conf 878899999999999858997359997894779999999997598766678886799867999999999839 No 47 >TIGR03586 PseI pseudaminic acid synthase. Probab=82.89 E-value=4.1 Score=21.29 Aligned_cols=115 Identities=16% Similarity=0.159 Sum_probs=59.2 Q ss_pred HHHHHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHCCCC-EEEECCCEEECCHHHHHHHHHCCCCCEEEECCCCHHH Q ss_conf 789999999876544869977676310423799886423883-6850430000214888521003865223425453003 Q gi|254780670|r 263 PNEMASYLANLVDQYPIYSIEDGMSEDDWHGWKILTNKIGSN-CQLVGDDLFVTNPERLHKGINEEVANAILIKPNQIGS 341 (424) Q Consensus 263 ~~elid~y~~l~~~yPI~sIEDP~~e~D~~gw~~l~~~~g~~-~~IvGDDL~vTn~~rl~~gi~~~~~NailiK~NQiGT 341 (424) +.|+..-+.+.|++..|..+-.||++++.+ +..++|-. .-|-..|+ ||...|+.-.+.+ .-++|... ..| T Consensus 75 ~~e~~~~L~~~~k~~gI~f~~t~fd~~s~~----~l~~l~~~~~KIaS~d~--~n~~Li~~iak~~--kpviiStG-~s~ 145 (327) T TIGR03586 75 PWEWHKELFERAKELGLTIFSSPFDETAVD----FLESLDVPAYKIASFEI--TDLPLIRYVAKTG--KPIIMSTG-IAT 145 (327) T ss_pred CHHHHHHHHHHHHHCCCEEEECCCCHHHHH----HHHHCCCCEEEECCHHH--CCHHHHHHHHCCC--CCEEEECC-CCC T ss_conf 899999999999985997997889979999----99870799599894153--6656999986648--86799688-875 Q ss_pred HHHHHHHHHHHHHCCCE-EEEEC-C----------------------------CCCCCHHHHHHHHHHHCCCHHE Q ss_conf 89999999999987986-99950-7----------------------------8887235899899981998010 Q gi|254780670|r 342 VSETLNTIEKAQMAGYP-SIISH-R----------------------------SGETEDHTIADLAVATNCGQIK 386 (424) Q Consensus 342 vset~ea~~~a~~~g~~-~ivSh-R----------------------------SGETeD~~iaDLAVg~~a~~iK 386 (424) +.|.-++++..++.+.. .++=| - |.-|.+...+=.||++||..|- T Consensus 146 ~~EI~~av~~~~~~~~~~i~llhC~s~YPt~~~~~nL~~I~~lk~~f~~~vG~SDHt~g~~~~~~Ava~GA~iIE 220 (327) T TIGR03586 146 LEEIEEAVEAAREAGCKDLVLLKCTSSYPAPLEDANLRTIPDLAERFNVPVGLSDHTLGILAPVAAVALGACVIE 220 (327) T ss_pred HHHHHHHHHHHHHHCCCCEEECCCCCCCCCCHHHCCHHHHHHHHHHCCCCEEECCCCCCCHHHHHHHHCCCEEEE T ss_conf 456788888776502557156525858998937743678999999749975838989982679999986994999 No 48 >TIGR01754 flav_RNR ribonucleotide reductase-associated flavodoxin, putative; InterPro: IPR010088 This entry represents a family of proteins found immediately downstream of ribonucleotide reductase genes in Xyella fastidiosa and some Gram-positive bacteria. It appears to be a highly divergent flavodoxin of the short chain type, more like the flavodoxins of the sulphate-reducing genus Desulfovibrio than like the NifF flavodoxins associated with nitrogen fixation.. Probab=81.14 E-value=4.7 Score=20.85 Aligned_cols=94 Identities=27% Similarity=0.323 Sum_probs=71.4 Q ss_pred CHHHHHHHHHHHHHHCCCCCCCC-HHHHHCHHHHHHCCCCCEEECCCC-----CCCCHHHHHHHHHHHHH---HCCEEEE Q ss_conf 46899999999997538531110-234311046532027851111468-----65687899999998765---4486997 Q gi|254780670|r 212 TADSALDLIANSIEKAGYNAGKD-LLIALDCAASAFFKKEKYILKGEN-----LELQPNEMASYLANLVD---QYPIYSI 282 (424) Q Consensus 212 ~~eeaL~ll~eAi~~aGy~~g~d-i~i~lD~AAsefy~~~kY~~~~~~-----~~~t~~elid~y~~l~~---~yPI~sI 282 (424) +.++.=.+|++-+++-|++. | |..-|..-|..-++.+.|.|-+=| .=.||+||-||..+|.. |=|=+.| T Consensus 13 NT~eVA~~I~~~l~~~G~eV--DW~~~r~~~La~~pldPe~ydL~~LGtwT~~~GrTP~e~KdFiaEl~~liGKP~nvai 90 (145) T TIGR01754 13 NTKEVADIIRDILEKDGHEV--DWVEFRISTLADAPLDPEDYDLYLLGTWTVERGRTPDEMKDFIAELAVLIGKPKNVAI 90 (145) T ss_pred CHHHHHHHHHHHHHHCCCEE--EEEEECCHHHHCCCCCCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCEEE T ss_conf 77899999999998479776--7664000464246789986315774312235789966689999999998369982488 Q ss_pred ECCCCCCCHHHHHHHHHHCCCCEEEECCCEEECCHHHHHHHH Q ss_conf 767631042379988642388368504300002148885210 Q gi|254780670|r 283 EDGMSEDDWHGWKILTNKIGSNCQLVGDDLFVTNPERLHKGI 324 (424) Q Consensus 283 EDP~~e~D~~gw~~l~~~~g~~~~IvGDDL~vTn~~rl~~gi 324 (424) = --| .+|--|+|+||--.+|+.+-. T Consensus 91 F----------------GTG-eTQwGG~d~yCgAVdr~a~ff 115 (145) T TIGR01754 91 F----------------GTG-ETQWGGDDYYCGAVDRLAKFF 115 (145) T ss_pred E----------------CCC-CCCCCCCCCCHHHHHHHHHHH T ss_conf 6----------------588-755288654024788878650 No 49 >COG1415 Uncharacterized conserved protein [Function unknown] Probab=79.64 E-value=1.3 Score=24.99 Aligned_cols=72 Identities=29% Similarity=0.403 Sum_probs=36.9 Q ss_pred CCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHC-CCCCCCC--CCCCCCCCCCCHHHHHHHHHHHHHHC Q ss_conf 776610256514887748999899999999999875200-0013676--55764488684689999999999753 Q gi|254780670|r 156 NALDFQEFMIMPVGAENIREAIRMGAEVFHTLKKELKSK-GYSTNVG--DEGGFSPNLKTADSALDLIANSIEKA 227 (424) Q Consensus 156 ~~l~iQEfmIiP~g~~sf~eal~~~~ev~~~lk~~L~~~-g~~t~vg--DEGGfap~~~~~eeaL~ll~eAi~~a 227 (424) +..|||||+++|=-.++.--||...+||.+..+.--++- .++..+| |-=-|..+...-+++++++.++++++ T Consensus 273 ~p~Df~elLl~~GiGpstvRALalVAEvIyg~~~s~rDP~~yafA~GGKDgvP~pV~~~~yde~I~~l~~~ve~a 347 (373) T COG1415 273 NPDDFEELLLVPGIGPSTVRALALVAEVIYGEPPSFRDPVKYAFAVGGKDGVPFPVNRKTYDELIEFLEELVEKA 347 (373) T ss_pred CCCCHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 832499987406878899999999999980899886770464333058789987755302999999999999864 No 50 >COG1441 MenC O-succinylbenzoate synthase [Coenzyme metabolism] Probab=78.95 E-value=3.4 Score=21.87 Aligned_cols=81 Identities=19% Similarity=0.275 Sum_probs=49.2 Q ss_pred HHCCEEEEECCCCCCC-HHHHHHHHHHCCCCEEEECCCEEECCHHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHHHHHH Q ss_conf 5448699776763104-237998864238836850430000214888521003865223425453003899999999999 Q gi|254780670|r 275 DQYPIYSIEDGMSEDD-WHGWKILTNKIGSNCQLVGDDLFVTNPERLHKGINEEVANAILIKPNQIGSVSETLNTIEKAQ 353 (424) Q Consensus 275 ~~yPI~sIEDP~~e~D-~~gw~~l~~~~g~~~~IvGDDL~vTn~~rl~~gi~~~~~NailiK~NQiGTvset~ea~~~a~ 353 (424) .+--|-++|.|..-.+ .- .|....| +-|.=|. .+..++.-..+ +.-..+++|||.-+|++-..-+-++.|+ T Consensus 183 ~R~RIaFLEEPCkt~aeSr---~Fa~eTg--IAIAWDE-s~readF~~e~--e~gv~avVIKPTL~GSl~r~~eli~qAh 254 (321) T COG1441 183 YRSRIAFLEEPCKTRAESR---AFARETG--IAIAWDE-SLREADFAFEA--EPGVRAVVIKPTLTGSLQRVRELVQQAH 254 (321) T ss_pred HHHHHHHHHCCCCCHHHHH---HHHHHCC--EEEEECC-HHCCCCCCCCC--CCCCEEEEECCCCHHHHHHHHHHHHHHH T ss_conf 8878998756447857888---8888628--0686000-10155620036--8872489963310125999999999998 Q ss_pred HCCCEEEEEC Q ss_conf 8798699950 Q gi|254780670|r 354 MAGYPSIISH 363 (424) Q Consensus 354 ~~g~~~ivSh 363 (424) ..|...+||. T Consensus 255 ~lGl~AVISS 264 (321) T COG1441 255 ALGLTAVISS 264 (321) T ss_pred HCCCEEEEEC T ss_conf 6285167560 No 51 >PRK06107 aspartate aminotransferase; Provisional Probab=78.46 E-value=5.7 Score=20.26 Aligned_cols=31 Identities=10% Similarity=0.247 Sum_probs=19.0 Q ss_pred CCCCCCHHHHHHHHHHHHHHC-CEEEEECCCCC Q ss_conf 686568789999999876544-86997767631 Q gi|254780670|r 257 ENLELQPNEMASYLANLVDQY-PIYSIEDGMSE 288 (424) Q Consensus 257 ~~~~~t~~elid~y~~l~~~y-PI~sIEDP~~e 288 (424) .|..++.+++..+ .+++.+| .+.-|+|-.+. T Consensus 179 TG~v~s~~~l~~i-~~i~~~~~~i~iisDEiY~ 210 (402) T PRK06107 179 TGAAYSRDELRAL-ADVLLRHPHVLVLTDDIYD 210 (402) T ss_pred CCCCCCHHHHHHH-HHHHHHCCCCEEEEEHHHH T ss_conf 9966534899999-8877524893286222025 No 52 >PRK09271 flavodoxin; Provisional Probab=76.85 E-value=2.5 Score=22.84 Aligned_cols=112 Identities=16% Similarity=0.187 Sum_probs=66.5 Q ss_pred CCCCCCCHHHHHHHHHHHHHHCCCCCC-CCHHHH-HCHHHHHH--CC---CCCEEECCCCCCCCHHHHHHHHHHHHHHCC Q ss_conf 448868468999999999975385311-102343-11046532--02---785111146865687899999998765448 Q gi|254780670|r 206 FSPNLKTADSALDLIANSIEKAGYNAG-KDLLIA-LDCAASAF--FK---KEKYILKGENLELQPNEMASYLANLVDQYP 278 (424) Q Consensus 206 fap~~~~~eeaL~ll~eAi~~aGy~~g-~di~i~-lD~AAsef--y~---~~kY~~~~~~~~~t~~elid~y~~l~~~yP 278 (424) |+..--++|+.=++|.++++..|++.- .++-+. +.-++..| |+ =+.|-+ ..| -.|+||.|||.+|-+ T Consensus 7 YaS~TGNTE~vA~~I~~~l~~~G~eV~~~e~d~~~~~~~~~d~~~yDl~llG~yTw-gdG--~lPdE~~DF~e~L~~--- 80 (160) T PRK09271 7 YASLSGNTREVARMIAARCEEAGHAVTWVETDLQTLAQAPLDPEEYDLFLLGTWTD-NAG--RTPPEMKRFIAELVE--- 80 (160) T ss_pred EECCCCCHHHHHHHHHHHHHHCCCEEEEEEECCHHHHHCCCCCCCCCEEEEECCCC-CCC--CCCHHHHHHHHHHHH--- T ss_conf 98488768999999999999769823798701023220335611188899965704-899--896678999999986--- Q ss_pred EEEEECCCCCCCHHHHHHHHHHC--CCCEEEECCCEEECCHHHHHHHHHCCCCCEEEECCCCHH Q ss_conf 69977676310423799886423--883685043000021488852100386522342545300 Q gi|254780670|r 279 IYSIEDGMSEDDWHGWKILTNKI--GSNCQLVGDDLFVTNPERLHKGINEEVANAILIKPNQIG 340 (424) Q Consensus 279 I~sIEDP~~e~D~~gw~~l~~~~--g~~~~IvGDDL~vTn~~rl~~gi~~~~~NailiK~NQiG 340 (424) .|+.| +...-+ |+..+ |+|.||-...++.+.+....+ .+|.+|.- T Consensus 81 ----------~dl~g--k~~avFGsGDt~y--~~e~FC~AVd~~~~~f~~~~~---~LkIE~~P 127 (160) T PRK09271 81 ----------TIGKP--PNVAVFGTGETQW--GEEYYCGAVHRMARFFGSSYP---RLKIEQMP 127 (160) T ss_pred ----------HCCCC--CEEEEECCCCCCC--CCHHHHHHHHHHHHHHCCCCC---CEEEECCC T ss_conf ----------04568--7699987777765--644788889999999758887---43453499 No 53 >cd05710 SIS_1 A subgroup of the SIS domain. SIS (Sugar ISomerase) domains are found in many phosphosugar isomerases and phosphosugar binding proteins. SIS domains are also found in proteins that regulate the expression of genes involved in synthesis of phosphosugars. Probab=75.98 E-value=2.2 Score=23.31 Aligned_cols=55 Identities=16% Similarity=0.201 Sum_probs=43.7 Q ss_pred CCEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHHCCC Q ss_conf 5223425453003899999999999879869995078887235899899981998 Q gi|254780670|r 329 ANAILIKPNQIGSVSETLNTIEKAQMAGYPSIISHRSGETEDHTIADLAVATNCG 383 (424) Q Consensus 329 ~NailiK~NQiGTvset~ea~~~a~~~g~~~ivShRSGETeD~~iaDLAVg~~a~ 383 (424) -|.++|=..|-|+-.|++++++.||+.|..+|-=-...++.=.-.+|..+-+|++ T Consensus 47 ~~~lvi~iS~SG~T~e~i~a~~~ak~~g~~tiaiT~~~~S~la~~aD~~i~~g~e 101 (120) T cd05710 47 EKSVVILASHSGNTKETVAAAKFAKEKGATVIGLTDDEDSPLAKLADYVIVYGFE 101 (120) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHCCEEEECCCC T ss_conf 9969999837989789999999999869959999898989879967988888996 No 54 >PRK06575 consensus Probab=75.38 E-value=6.8 Score=19.67 Aligned_cols=30 Identities=17% Similarity=0.400 Sum_probs=19.3 Q ss_pred CCCCCCHHHHHHHHHHHHHHCC-EEEEECCCC Q ss_conf 6865687899999998765448-699776763 Q gi|254780670|r 257 ENLELQPNEMASYLANLVDQYP-IYSIEDGMS 287 (424) Q Consensus 257 ~~~~~t~~elid~y~~l~~~yP-I~sIEDP~~ 287 (424) .|...+.+||-+. .++|.+|+ |.-|.|=.+ T Consensus 177 TG~v~~~eel~~i-~~~~~k~~~v~iisDEiY 207 (399) T PRK06575 177 TGASYNFEELENI-AKVLRKYPHVNVMSDDIY 207 (399) T ss_pred CCCCCCHHHHHHH-HHHHHHCCCEEEECHHHH T ss_conf 8887889999999-999874587378421356 No 55 >pfam04748 Polysacc_deac_2 Divergent polysaccharide deacetylase. This family is divergently related to pfam01522 (personal obs:Yeats C). Probab=75.27 E-value=6.9 Score=19.65 Aligned_cols=135 Identities=13% Similarity=0.180 Sum_probs=61.8 Q ss_pred CCHHHHHCHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCC-EEEEECCCC---CCCHHHHHHHHHHCCCCEEEE Q ss_conf 1023431104653202785111146865687899999998765448-699776763---104237998864238836850 Q gi|254780670|r 233 KDLLIALDCAASAFFKKEKYILKGENLELQPNEMASYLANLVDQYP-IYSIEDGMS---EDDWHGWKILTNKIGSNCQLV 308 (424) Q Consensus 233 ~di~i~lD~AAsefy~~~kY~~~~~~~~~t~~elid~y~~l~~~yP-I~sIEDP~~---e~D~~gw~~l~~~~g~~~~Iv 308 (424) .++-|-+-..+....+.|...+. ...+.+|....+....+++| .+.+-+..- -.|-+....+.+.+..+=... T Consensus 46 ~EvllhlPMe~~~~~~~gp~~L~---~~~~~~~i~~~l~~~l~~~p~avGvnNhmGS~~t~~~~~m~~vl~~l~~~gl~f 122 (213) T pfam04748 46 HEVLLHLPMEPLGYKDPGPGTLT---VGMSAEEIEKRLEAALSRVPYAVGVNNHMGSRFTADRAAMRWVMEELKKRGLFF 122 (213) T ss_pred CEEEEECCCCCCCCCCCCCCCCC---CCCCHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHCCCEE T ss_conf 94999745663468999977588---899999999999999987888489954667554169999999999998779889 Q ss_pred CCCEEECC--HHHHHHHHHCC-CCCEEEECC-CCHHHHHH-HHHHHHHHHHCCCEEEEECCCCCCCH Q ss_conf 43000021--48885210038-652234254-53003899-99999999987986999507888723 Q gi|254780670|r 309 GDDLFVTN--PERLHKGINEE-VANAILIKP-NQIGSVSE-TLNTIEKAQMAGYPSIISHRSGETED 370 (424) Q Consensus 309 GDDL~vTn--~~rl~~gi~~~-~~NailiK~-NQiGTvse-t~ea~~~a~~~g~~~ivShRSGETeD 370 (424) =|+.+..+ ..++.+...-. +.+-+.|.. .....+.+ ..++.+.|+++|+.++|.|-.-+|-+ T Consensus 123 vDS~Tt~~S~a~~~A~~~gvp~~~rdvfLD~~~~~~~I~~ql~~~~~~A~~~G~aI~Igh~~p~Tl~ 189 (213) T pfam04748 123 VDSRTSGRSVAPKLAKELGVPAAKRDVFLDNEQTEAAIRRQLDQAAALARKEGSAIAIGHPHPETIA 189 (213) T ss_pred EECCCCCCCHHHHHHHHCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCEEEEECCCHHHHH T ss_conf 9147776658999999839986761031479999999999999999999863957999779989999 No 56 >cd06319 PBP1_ABC_sugar_binding_like_10 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=74.13 E-value=7.3 Score=19.46 Aligned_cols=102 Identities=18% Similarity=0.154 Sum_probs=54.8 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCHHHHHCHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCC-EEEEECCCCCCCH Q ss_conf 689999999999753853111023431104653202785111146865687899999998765448-6997767631042 Q gi|254780670|r 213 ADSALDLIANSIEKAGYNAGKDLLIALDCAASAFFKKEKYILKGENLELQPNEMASYLANLVDQYP-IYSIEDGMSEDDW 291 (424) Q Consensus 213 ~eeaL~ll~eAi~~aGy~~g~di~i~lD~AAsefy~~~kY~~~~~~~~~t~~elid~y~~l~~~yP-I~sIEDP~~e~D~ 291 (424) ..+-++-..+++++.|+.. +.+ . .....+.++-.+...+|+..|| +..|- .-+|. T Consensus 140 ~~~R~~Gf~~~l~~~~~~~---~~~----------------~--~~~~~~~~~~~~~~~~~L~~~p~i~ai~---~~~d~ 195 (277) T cd06319 140 GQKRTKGFKEAMKEAGCDL---AGI----------------R--QQKDFSYQETFDYTNDLLTANPDIRAIW---LQGSD 195 (277) T ss_pred HHHHHHHHHHHHHHCCCCE---EEE----------------E--EECCCCHHHHHHHHHHHHHHCCCCCEEE---ECCCH T ss_conf 9999999999998759976---799----------------9--6089999999999999997389976899---66988 Q ss_pred H--HHHHHHHHCC--CCEEEECCCEEECCHHHHHHHHHCCCCCEE-EECCCCHHHH Q ss_conf 3--7998864238--836850430000214888521003865223-4254530038 Q gi|254780670|r 292 H--GWKILTNKIG--SNCQLVGDDLFVTNPERLHKGINEEVANAI-LIKPNQIGSV 342 (424) Q Consensus 292 ~--gw~~l~~~~g--~~~~IvGDDL~vTn~~rl~~gi~~~~~Nai-liK~NQiGTv 342 (424) - |-..-.+..| +++.|+|=|......+.++.| ...+. .-.|-.+|-. T Consensus 196 ~a~Ga~~a~~~~G~~~~i~vvg~D~~~~~~~~i~~G----~~~~tv~q~p~~~G~~ 247 (277) T cd06319 196 RYQGALDAIATAGKTGKVLLICFDAEPEFIELLKSG----ALVGAGMQQPFLMGER 247 (277) T ss_pred HHHHHHHHHHHCCCCCCEEEEECCCCHHHHHHHHCC----CCEEEEECCHHHHHHH T ss_conf 999999999975999985899637979999998759----9369996499999999 No 57 >PRK02271 methylenetetrahydromethanopterin reductase; Provisional Probab=74.10 E-value=7.3 Score=19.45 Aligned_cols=30 Identities=23% Similarity=0.357 Sum_probs=23.8 Q ss_pred CHHHHHHHHHHHHHHHHCCCEEEEECC-CCC Q ss_conf 300389999999999987986999507-888 Q gi|254780670|r 338 QIGSVSETLNTIEKAQMAGYPSIISHR-SGE 367 (424) Q Consensus 338 QiGTvset~ea~~~a~~~g~~~ivShR-SGE 367 (424) =+||.-+.++-++.-++.|...++.+- .|. T Consensus 280 l~Gtpe~v~~~l~~~~d~Gvd~i~~~~~~g~ 310 (324) T PRK02271 280 IAGTPDDVVEKIEDLLEAGVTQIVAGSPIGP 310 (324) T ss_pred EECCHHHHHHHHHHHHHCCCCEEEEECCCCC T ss_conf 8789999999999999669988998589997 No 58 >cd03242 ABC_RecF RecF is a recombinational DNA repair ATPase that maintains replication in the presence of DNA damage. When replication is prematurely disrupted by DNA damage, several recF pathway gene products play critical roles processing the arrested replication fork, allowing it to resume and complete its task. This CD represents the nucleotide binding domain of RecF. RecF belongs to a large superfamily of ABC transporters involved in the transport of a wide variety of different compounds including sugars, ions, peptides, and more complex organic molecules. The nucleotide binding domain shows the highest similarity between all members of the family. ABC transporters are a subset of nucleotide hydrolases with a signature motif, Q-loop, and H-loop/switch region, in addition to, the Walker A motif/P-loop and Walker B motif commonly found in a number of ATP- and GTP-binding and hydrolyzing proteins. Probab=74.02 E-value=7.1 Score=19.54 Aligned_cols=55 Identities=16% Similarity=0.226 Sum_probs=41.7 Q ss_pred HHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHCCCCEEEECCCEEECCHHHHHHH Q ss_conf 9999998765448699776763104237998864238836850430000214888521 Q gi|254780670|r 266 MASYLANLVDQYPIYSIEDGMSEDDWHGWKILTNKIGSNCQLVGDDLFVTNPERLHKG 323 (424) Q Consensus 266 lid~y~~l~~~yPI~sIEDP~~e~D~~gw~~l~~~~g~~~~IvGDDL~vTn~~rl~~g 323 (424) .++++.+-..++||+-+-|.++|=|-.-+..+...+.+..|+. +++|+...+... T Consensus 200 q~~~~~~~~~~~PilLlDDi~seLD~~r~~~l~~~l~~~~Qvf---iT~t~~~~~~~~ 254 (270) T cd03242 200 EIQLIKEVSGEYPVLLLDDVLAELDLGRQAALLDAIEGRVQTF---VTTTDLADFDAL 254 (270) T ss_pred HHHHHHHHHCCCCEEEECCHHHHHCHHHHHHHHHHHCCCCCEE---EECCCHHHHHHH T ss_conf 9999999869997899723155409999999999870189799---975785774543 No 59 >pfam00289 CPSase_L_chain Carbamoyl-phosphate synthase L chain, N-terminal domain. Carbamoyl-phosphate synthase catalyses the ATP-dependent synthesis of carbamyl-phosphate from glutamine or ammonia and bicarbonate. This important enzyme initiates both the urea cycle and the biosynthesis of arginine and/or pyrimidines. The carbamoyl-phosphate synthase (CPS) enzyme in prokaryotes is a heterodimer of a small and large chain. The small chain promotes the hydrolysis of glutamine to ammonia, which is used by the large chain to synthesize carbamoyl phosphate. See pfam00988. The small chain has a GATase domain in the carboxyl terminus. See pfam00117. Probab=73.77 E-value=7.4 Score=19.40 Aligned_cols=71 Identities=20% Similarity=0.239 Sum_probs=52.8 Q ss_pred HHHHHHHHHHCCCEEEEECCCCCC--CHHHHHHHHHHHCCCHHEECCCCHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHC Q ss_conf 999999999879869995078887--235899899981998010075611448999999999999831104557625304 Q gi|254780670|r 345 TLNTIEKAQMAGYPSIISHRSGET--EDHTIADLAVATNCGQIKTGSLARSDRIAKYNQLIRIEESLGKQAKFAGRSILI 422 (424) Q Consensus 345 t~ea~~~a~~~g~~~ivShRSGET--eD~~iaDLAVg~~a~~iK~G~~~R~ER~aKyNrLlrIee~Lg~~a~~~g~~~~~ 422 (424) ++..++.+++.|+.+++=+..-|+ .|.-.+|-++-++ |++. .|.-.-..++++|-++-|-.+.++|+-|+. T Consensus 13 A~ri~rt~re~gi~~v~i~s~~d~~s~~~~~ad~~~~~~------~~~~-~~~Yl~~~~Ii~ia~~~~~~aihpGyGfls 85 (109) T pfam00289 13 AVRIIRALRELGIETVAVNSNPDTVSTHVRLADEAYFLG------PGPA-SESYLNIERILDIAEKEGADAIHPGYGFLS 85 (109) T ss_pred HHHHHHHHHHCCCCEEEEECHHHCCCHHHHHHHHHHHCC------CCCC-CCCCCCHHHHHHHHHHHCCCEEECCCCCCC T ss_conf 999999999869979999633441522566505765347------9983-211137999999999818896877976233 No 60 >cd05008 SIS_GlmS_GlmD_1 SIS (Sugar ISomerase) domain repeat 1 found in Glucosamine 6-phosphate synthase (GlmS) and Glucosamine-6-phosphate deaminase (GlmD). The SIS domain is found in many phosphosugar isomerases and phosphosugar binding proteins. GlmS contains a N-terminal glutaminase domain and two C-terminal SIS domains and catalyzes the first step in hexosamine metabolism, converting fructose 6-phosphate into glucosamine 6-phosphate using glutamine as nitrogen source. The glutaminase domain hydrolyzes glutamine to glutamate and ammonia. Ammonia is transferred through a channel to the isomerase domain for glucosamine 6-phosphate synthesis. The end product of the pathway is N-acetylglucosamine, which plays multiple roles in eukaryotic cells including being a building block of bacterial and fungal cell walls. In the absence of glutamine, GlmS catalyzes the isomerization of fructose 6-phosphate into glucose 6- phosphate (PGI-like activity). Glucosamine-6-phosphate deaminase (GlmD) cont Probab=73.71 E-value=2.1 Score=23.39 Aligned_cols=65 Identities=15% Similarity=0.181 Sum_probs=54.5 Q ss_pred CCEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHHCCCHHEECCCCHH Q ss_conf 52234254530038999999999998798699950788872358998999819980100756114 Q gi|254780670|r 329 ANAILIKPNQIGSVSETLNTIEKAQMAGYPSIISHRSGETEDHTIADLAVATNCGQIKTGSLARS 393 (424) Q Consensus 329 ~NailiK~NQiGTvset~ea~~~a~~~g~~~ivShRSGETeD~~iaDLAVg~~a~~iK~G~~~R~ 393 (424) -|.++|=..|-|.-+|++++++.|++.|..+|.=....++.=.-.+|..+-++++-=+.+....+ T Consensus 46 ~~~lvI~iS~SG~t~e~i~a~~~a~~~g~~~i~iT~~~~S~la~~ad~~i~~~~g~E~~~~~Tks 110 (126) T cd05008 46 EDTLVIAISQSGETADTLAALRLAKEKGAKTVAITNVVGSTLAREADYVLYLRAGPEISVAATKA 110 (126) T ss_pred CCCEEEEECCCCCCCCHHHHHHHHHHCCCCEEEEECCCCCHHHHHCCEEEECCCCCEEEEECCHH T ss_conf 98599998689798007889999998299489870699997899689879869877678708699 No 61 >smart00475 53EXOc 5'-3' exonuclease. Probab=72.70 E-value=7.8 Score=19.22 Aligned_cols=94 Identities=21% Similarity=0.277 Sum_probs=58.7 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCHHHHHCHHHHHHCCCCCE-EECCCCCCCCHHHH---HHHHHHHHHHCCEEEEECCCCC Q ss_conf 6899999999997538531110234311046532027851-11146865687899---9999987654486997767631 Q gi|254780670|r 213 ADSALDLIANSIEKAGYNAGKDLLIALDCAASAFFKKEKY-ILKGENLELQPNEM---ASYLANLVDQYPIYSIEDGMSE 288 (424) Q Consensus 213 ~eeaL~ll~eAi~~aGy~~g~di~i~lD~AAsefy~~~kY-~~~~~~~~~t~~el---id~y~~l~~~yPI~sIEDP~~e 288 (424) ..--+..|...++. |.| ..+.++.|...+.| ..+-| .|+ .++...++++ +.+..++++.++|-.++-|-.| T Consensus 33 i~gf~~~l~~l~~~--~~p-~~i~v~fD~~~~~~-R~~i~p~YK-anR~~~p~~l~~q~~~i~~~l~~~gi~~~~~~g~E 107 (259) T smart00475 33 VYGFLRMLLKLIKE--EKP-TYVAVVFDAKGKTF-RHELYPEYK-ANRPKTPDELLEQIPLIKELLDALGIPVLEVEGYE 107 (259) T ss_pred HHHHHHHHHHHHHH--CCC-CEEEEEEECCCCCC-HHHHCHHHH-CCCCCCCHHHHHHHHHHHHHHHHCCCCEEECCCCC T ss_conf 99999999999987--699-77999981899865-144455553-28999978999999999999997699876238854 Q ss_pred -CCHHHH-HHHHHHCCCCEEEECCC Q ss_conf -042379-98864238836850430 Q gi|254780670|r 289 -DDWHGW-KILTNKIGSNCQLVGDD 311 (424) Q Consensus 289 -~D~~gw-~~l~~~~g~~~~IvGDD 311 (424) ||.-++ .+.....+..+.||.-| T Consensus 108 ADDvIatla~~~~~~~~~v~IvS~D 132 (259) T smart00475 108 ADDVIATLAKKAEAEGYEVRIVSGD 132 (259) T ss_pred HHHHHHHHHHHHHHCCCEEEEECCC T ss_conf 7999999999999869926995088 No 62 >cd06312 PBP1_ABC_sugar_binding_like_4 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=72.45 E-value=8 Score=19.18 Aligned_cols=76 Identities=21% Similarity=0.255 Sum_probs=45.5 Q ss_pred CHHHHHHHHHHHHHHCC-EEEEECCCCCCCHH--HHHHHHHHCC--CCEEEECCCEEECCHHHHHHHHHCCCCCEEEECC Q ss_conf 87899999998765448-69977676310423--7998864238--8368504300002148885210038652234254 Q gi|254780670|r 262 QPNEMASYLANLVDQYP-IYSIEDGMSEDDWH--GWKILTNKIG--SNCQLVGDDLFVTNPERLHKGINEEVANAILIKP 336 (424) Q Consensus 262 t~~elid~y~~l~~~yP-I~sIEDP~~e~D~~--gw~~l~~~~g--~~~~IvGDDL~vTn~~rl~~gi~~~~~NailiK~ 336 (424) +..+-.+...+++..|| |-.| |.-+|.- |-.+..+..| +++.|+|=|......+.|++|. ..-.+.=.| T Consensus 166 d~~~a~~~~~~~L~~~pdi~~I---~~~nd~~a~ga~~Al~~ag~~~~i~vvg~D~~~~~~~~i~~G~---i~~tv~Q~p 239 (271) T cd06312 166 DPTEVASRIAAYLRANPDVDAV---LTLGAPSAAPAAKALKQAGLKGKVKLGGFDLSPATLQAIKAGY---IQFAIDQQP 239 (271) T ss_pred CHHHHHHHHHHHHHHCCCCCEE---EECCCHHHHHHHHHHHHCCCCCCEEEEEECCCHHHHHHHHCCC---CEEEEECCH T ss_conf 9999999999999849898889---9889678999999999669999859999889999999998699---559995899 Q ss_pred CCHHHHH Q ss_conf 5300389 Q gi|254780670|r 337 NQIGSVS 343 (424) Q Consensus 337 NQiGTvs 343 (424) -+.|-++ T Consensus 240 ~~~Gy~a 246 (271) T cd06312 240 YLQGYLP 246 (271) T ss_pred HHHHHHH T ss_conf 9999999 No 63 >PRK08069 consensus Probab=71.04 E-value=8.5 Score=18.96 Aligned_cols=49 Identities=14% Similarity=0.198 Sum_probs=30.0 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCEEEEECCCCC------CCHHHHHHHHHHCCCCEE Q ss_conf 68656878999999987654486997767631------042379988642388368 Q gi|254780670|r 257 ENLELQPNEMASYLANLVDQYPIYSIEDGMSE------DDWHGWKILTNKIGSNCQ 306 (424) Q Consensus 257 ~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e------~D~~gw~~l~~~~g~~~~ 306 (424) .|..++.+++-++ .++|.+|.++-|+|=.+. ..+..+..+...+.+++. T Consensus 176 TG~v~s~e~l~~i-~~la~~~~~~vi~DEiY~~l~~~~~~~~s~~~~~~~~~~~~i 230 (390) T PRK08069 176 TGMIYTKEELAAL-GEVCLEHDILIVSDEIYEKLIYGGAEHVSIAQLSNELKEQTI 230 (390) T ss_pred CCCCCCCHHHHHH-HHHHHHCCEEEEEEHHHHHHCCCCCCCCCHHHHCCCCCCCCC T ss_conf 6816550689999-999875630598533236540699898898994954345641 No 64 >cd05017 SIS_PGI_PMI_1 The members of this protein family contain the SIS (Sugar ISomerase) domain and have both the phosphoglucose isomerase (PGI) and the phosphomannose isomerase (PMI) functions. These functions catalyze the reversible reactions of glucose 6-phosphate to fructose 6-phosphate, and mannose 6-phosphate to fructose 6-phosphate, respectively at an equal rate. This protein contains two SIS domains. This alignment is based on the first SIS domain. Probab=70.76 E-value=5.7 Score=20.26 Aligned_cols=71 Identities=17% Similarity=0.156 Sum_probs=52.8 Q ss_pred CCEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHHCCCHHEECCCCHHHHHHHHHHHHHH Q ss_conf 52234254530038999999999998798699950788872358998999819980100756114489999999999 Q gi|254780670|r 329 ANAILIKPNQIGSVSETLNTIEKAQMAGYPSIISHRSGETEDHTIADLAVATNCGQIKTGSLARSDRIAKYNQLIRI 405 (424) Q Consensus 329 ~NailiK~NQiGTvset~ea~~~a~~~g~~~ivShRSGETeD~~iaDLAVg~~a~~iK~G~~~R~ER~aKyNrLlrI 405 (424) .+.++|-...-|.-.||+.+++.|++.|-.+|.=.. +--+.++|---+..+++.-.-. .-|.+=..-+..+ T Consensus 43 ~~~lvi~~S~SGnTeEtl~~~~~a~~~ga~vi~its-----GG~L~~~a~~~~~p~v~iP~g~-~PR~a~g~~~~~l 113 (119) T cd05017 43 RKTLVIAVSYSGNTEETLSAVEQAKERGAKIVAITS-----GGKLLEMAREHGVPVIIIPKGL-QPRAAFPYLFTAL 113 (119) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHHHCCCEEEEECC-----CCHHHHHHHHCCCCEEECCCCC-CCHHHHHHHHHHH T ss_conf 787899992897968999999999985990999849-----9559999988799989879999-9579999999999 No 65 >TIGR03600 phage_DnaB phage replicative helicase, DnaB family, HK022 subfamily. Members of this family are phage (or prophage-region) homologs of the bacterial homohexameric replicative helicase DnaB. Some phage may rely on host DnaB, while others encode their own verions. This model describes the largest phage-specific clade among the close homologs of DnaB, but there are, or course, other DnaB homologs from phage that fall outside the scope of this model. Probab=70.58 E-value=8.7 Score=18.89 Aligned_cols=140 Identities=19% Similarity=0.270 Sum_probs=73.6 Q ss_pred HHHHCHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECC-CCCCCHHHHHHHHHHCCCCEEEECCC--- Q ss_conf 34311046532027851111468656878999999987654486997767-63104237998864238836850430--- Q gi|254780670|r 236 LIALDCAASAFFKKEKYILKGENLELQPNEMASYLANLVDQYPIYSIEDG-MSEDDWHGWKILTNKIGSNCQLVGDD--- 311 (424) Q Consensus 236 ~i~lD~AAsefy~~~kY~~~~~~~~~t~~elid~y~~l~~~yPI~sIEDP-~~e~D~~gw~~l~~~~g~~~~IvGDD--- 311 (424) .+++++|.+---+.++-++-+ ..+.+.+|+..-+..-...-|.-.|..| +.++||+-......++.+. .+.=|| T Consensus 209 afalnia~n~A~~~g~~Vl~f-SLEMs~eql~~R~la~~s~i~~~~i~~g~l~~~~~~~~~~a~~~l~~~-~l~i~d~~~ 286 (421) T TIGR03600 209 TLALNIAENVALREGKPVLFF-SLEMSAEQLGERLLASKSGINTGNIRTGRFNDSDFNRLLNAVDRLSEK-DLYIDDTGG 286 (421) T ss_pred HHHHHHHHHHHHHCCCEEEEE-ECCCCHHHHHHHHHHHHCCCCHHHHHCCCCCHHHHHHHHHHHHHHHCC-CEEEECCCC T ss_conf 999999999998669838999-257999999999999854897766652899989999999999986168-789966998 Q ss_pred EEECCHH-HHHHHH-HCCCCCEEEE------C-------CCCHHHHHHHHHHHHHHHHCCCEEEEEC---CCCCC-CH-- Q ss_conf 0002148-885210-0386522342------5-------4530038999999999998798699950---78887-23-- Q gi|254780670|r 312 LFVTNPE-RLHKGI-NEEVANAILI------K-------PNQIGSVSETLNTIEKAQMAGYPSIISH---RSGET-ED-- 370 (424) Q Consensus 312 L~vTn~~-rl~~gi-~~~~~Naili------K-------~NQiGTvset~ea~~~a~~~g~~~ivSh---RSGET-eD-- 370 (424) +.+.+.+ ++++-. +.+-...++| + ..+++.+|+.+. .+|++.+.-+|.-+ |+.|. .| T Consensus 287 ~ti~~ir~~~r~~~~~~~~l~livIDYLqLi~~~~~~~r~~ei~~Isr~LK--~lAkel~ipVi~lsQLnR~~e~R~dkr 364 (421) T TIGR03600 287 LTVAQIRSIARRIKRKKGGLDLIVVDYIQLMAPTRGRDRNEELGGISRGLK--ALAKELDVPVVLLAQLNRGSEKRTDKR 364 (421) T ss_pred CCHHHHHHHHHHHHHHCCCCCEEEECHHHHCCCCCCCCHHHHHHHHHHHHH--HHHHHHCCCEEEECCCCCCCCCCCCCC T ss_conf 876789999999998628986999757865378888888999999999999--999997997899705786523368888 Q ss_pred HHHHHHHHH Q ss_conf 589989998 Q gi|254780670|r 371 HTIADLAVA 379 (424) Q Consensus 371 ~~iaDLAVg 379 (424) ..++||.=. T Consensus 365 P~lsDLReS 373 (421) T TIGR03600 365 PIMSDLRDS 373 (421) T ss_pred CCHHHHHHC T ss_conf 654344433 No 66 >PRK07550 hypothetical protein; Provisional Probab=70.01 E-value=8.9 Score=18.80 Aligned_cols=31 Identities=13% Similarity=0.124 Sum_probs=22.4 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCEEEEECCCCC Q ss_conf 68656878999999987654486997767631 Q gi|254780670|r 257 ENLELQPNEMASYLANLVDQYPIYSIEDGMSE 288 (424) Q Consensus 257 ~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e 288 (424) .|..++.+++-++ .+|+.+|.++-|+|=.+. T Consensus 177 tG~v~s~e~l~~l-~~la~~~~~~iI~DEiY~ 207 (387) T PRK07550 177 TGVVYPPELLHEL-YDLARRRGLALILDETYR 207 (387) T ss_pred CCCCCCHHHHHHH-HHHCCCEEEEEEECCCHH T ss_conf 8851077985346-543033158999417578 No 67 >pfam04017 DUF366 Domain of unknown function (DUF366). Archaeal domain of unknown function. Probab=69.81 E-value=5.3 Score=20.46 Aligned_cols=93 Identities=18% Similarity=0.196 Sum_probs=53.1 Q ss_pred HHHHHHCCCCCEEECCCC-CCCCHHHHHHHHHHHHHHCCEEE------EECCCCCCCHHHHHHH-----------HHHCC Q ss_conf 046532027851111468-65687899999998765448699------7767631042379988-----------64238 Q gi|254780670|r 241 CAASAFFKKEKYILKGEN-LELQPNEMASYLANLVDQYPIYS------IEDGMSEDDWHGWKIL-----------TNKIG 302 (424) Q Consensus 241 ~AAsefy~~~kY~~~~~~-~~~t~~elid~y~~l~~~yPI~s------IEDP~~e~D~~gw~~l-----------~~~~g 302 (424) +|=..|-..+.-...+-+ ...+.+.|+|+ +++..+.||.| |-.-|+.-|...--.+ -...| T Consensus 21 wAy~~f~~~~dsIV~Frgp~~V~~e~miDl-eD~~~~~~I~S~dmi~FivE~fd~~dl~~a~~rQRLl~~i~~evl~~~g 99 (183) T pfam04017 21 WAYRNFGIQGDSIVVFRGPCDVKIEHMVDL-EDVREEKEIKSEDMLHFIVEHFDSTDLKLAYLRQRLLVAIAKEVLEDYG 99 (183) T ss_pred HHHHHCCCCCCCEEEEECCCCCCHHHCCCH-HHHHCCCCCCCHHHEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 788861867685899966615168866558-8850478655710104655427986688999999999999999999729 Q ss_pred CCEEEECCCEEECCHHHHHHHHHCCCCCEEEEC Q ss_conf 836850430000214888521003865223425 Q gi|254780670|r 303 SNCQLVGDDLFVTNPERLHKGINEEVANAILIK 335 (424) Q Consensus 303 ~~~~IvGDDL~vTn~~rl~~gi~~~~~NailiK 335 (424) .++.--|||||+-+. .+.-.|.....++.+|- T Consensus 100 ~~~~r~GDDlyv~~~-KlSVSIA~~s~vs~kiH 131 (183) T pfam04017 100 VKLTRDGDDLYVNGK-KLSVSIATASPVSMKIH 131 (183) T ss_pred CCCEECCCEEEECCC-EEEEEEEECCCCCEEEE T ss_conf 860655781687596-78999996488715899 No 68 >cd06306 PBP1_TorT-like TorT-like proteins, a periplasmic binding protein family that activates induction of the Tor respiratory system upon trimethylamine N-oxide (TMAO) electron-acceptor binding in bacteria. TorT-like proteins, a periplasmic binding protein family that activates induction of the Tor respiratory system upon trimethylamine N-oxide (TMAO) electron-acceptor binding in bacteria. The Tor respiratory system is consists of three proteins (TorC, TorA, and TorD) and is induced in the presence of TMAO. The TMAO control is tightly regulated by three proteins: TorS, TorT, and TorR. Thus, the disruption of any of these proteins can abolish the Tor respiratory induction. TorT shares homology with the sugar-binding domain of the type I periplasmic binding proteins. The members of TorT-like family bind TMAO or related compounds and are predicted to be involved in signal transduction and/or substrate transport. Probab=69.54 E-value=8.9 Score=18.82 Aligned_cols=76 Identities=13% Similarity=0.012 Sum_probs=41.1 Q ss_pred CCHHHHHHHHHHHHHHCC-EEEEECCCCCCCHHHHHHHHHH--CCCCEEEECCCEEECCHHHHHHHHHCCCCCEEE-ECC Q ss_conf 687899999998765448-6997767631042379988642--388368504300002148885210038652234-254 Q gi|254780670|r 261 LQPNEMASYLANLVDQYP-IYSIEDGMSEDDWHGWKILTNK--IGSNCQLVGDDLFVTNPERLHKGINEEVANAIL-IKP 336 (424) Q Consensus 261 ~t~~elid~y~~l~~~yP-I~sIEDP~~e~D~~gw~~l~~~--~g~~~~IvGDDL~vTn~~rl~~gi~~~~~Nail-iK~ 336 (424) .+.+.-.+.-.+|+..+| |-.|-. ....-.+=...... ..+++.|+|.|......+.+++| .-.+.+ -.| T Consensus 166 ~~~~~a~~~~~~~L~~~pdi~~i~~--~~~~a~~a~~a~~~~g~~~~v~vvg~d~~~~~~~~i~~G----~i~atv~Q~p 239 (268) T cd06306 166 TGKEVQRKLVEEALEAHPDIDYIVG--SAVAAEAAVGILRQRGLTDQIKIVSTYLSHAVYRGLKRG----KILAAPTDSM 239 (268) T ss_pred CCHHHHHHHHHHHHHHCCCCCEEEE--CCHHHHHHHHHHHHCCCCCCEEEEEECCCHHHHHHHHCC----CEEEEEECCH T ss_conf 7499999999999862999889995--746799999999860779975999936998999999859----8379995699 Q ss_pred CCHHHH Q ss_conf 530038 Q gi|254780670|r 337 NQIGSV 342 (424) Q Consensus 337 NQiGTv 342 (424) -++|-+ T Consensus 240 ~~~G~~ 245 (268) T cd06306 240 VLQGRL 245 (268) T ss_pred HHHHHH T ss_conf 999999 No 69 >PRK08960 hypothetical protein; Provisional Probab=68.55 E-value=9.5 Score=18.59 Aligned_cols=30 Identities=13% Similarity=0.089 Sum_probs=20.6 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCEEEEECCCC Q ss_conf 6865687899999998765448699776763 Q gi|254780670|r 257 ENLELQPNEMASYLANLVDQYPIYSIEDGMS 287 (424) Q Consensus 257 ~~~~~t~~elid~y~~l~~~yPI~sIEDP~~ 287 (424) .|..++.+++-++ .++|.+|.++-|+|=.+ T Consensus 178 TG~v~s~~~l~~l-~~~a~~~~~~vI~DEiY 207 (387) T PRK08960 178 TGTLLSRDELAAL-SQALKARGGHLVVDEIY 207 (387) T ss_pred CCCCCCHHHHHHH-HHHHHHCCCEEEEECCC T ss_conf 8631468999999-99876239779984461 No 70 >TIGR03471 HpnJ hopanoid biosynthesis associated radical SAM protein HpnJ. One of the well-described hopanoid intermediates is bacteriohopanetetrol. In the conversion from hopene several reactions must occur in the side chain for which a radical mechanism might be reasonable. These include the four (presumably anaerobic) hydroxylations and a methyl shift. Probab=68.00 E-value=9.8 Score=18.52 Aligned_cols=125 Identities=16% Similarity=0.178 Sum_probs=79.2 Q ss_pred CCCCHHHHHHHHHHHHHHCC-E--EEEECCCCCCCHHHHHHHHHHCCC-----CEEEECCCEEECCHHHHHHHHHCCCCC Q ss_conf 65687899999998765448-6--997767631042379988642388-----368504300002148885210038652 Q gi|254780670|r 259 LELQPNEMASYLANLVDQYP-I--YSIEDGMSEDDWHGWKILTNKIGS-----NCQLVGDDLFVTNPERLHKGINEEVAN 330 (424) Q Consensus 259 ~~~t~~elid~y~~l~~~yP-I--~sIEDP~~e~D~~gw~~l~~~~g~-----~~~IvGDDL~vTn~~rl~~gi~~~~~N 330 (424) +..|++..++-...|.++|| + +.+.|-.---|..-+..|+..+.. .|+.-.| .+.+.++.- ++.=|. T Consensus 225 R~RSpe~VvdEIe~l~~~y~gv~~~~f~DD~Ft~~~~r~~eic~~i~~l~i~W~~~~Rv~----~d~E~l~~m-k~AGc~ 299 (472) T TIGR03471 225 RTRSAESVIEEVKYALENFPEVREFFFDDDTFTDDKPRAEEIARKLGPLGVTWSCNARAN----VDYETLKVM-KENGLR 299 (472) T ss_pred CCCCHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEEECC----CCHHHHHHH-HHHCCE T ss_conf 215999999999999986689758999477667899999999999987698278763034----899999999-983984 Q ss_pred EE-----------EECCCCHHHHHHHHHHHHHHHHCCCEEEEE---CCCCCCCHHHHH--HHHHHHCCCHHEEC Q ss_conf 23-----------425453003899999999999879869995---078887235899--89998199801007 Q gi|254780670|r 331 AI-----------LIKPNQIGSVSETLNTIEKAQMAGYPSIIS---HRSGETEDHTIA--DLAVATNCGQIKTG 388 (424) Q Consensus 331 ai-----------liK~NQiGTvset~ea~~~a~~~g~~~ivS---hRSGETeD~~ia--DLAVg~~a~~iK~G 388 (424) .+ |=..|.--|+.+..++++.++++|+.+..+ -=-|||.+++-. ++|.-++..++... T Consensus 300 ~v~~GiESgsq~iL~~i~K~~t~e~~~~av~~~k~~GI~v~~~FIiG~PgET~Eti~~Ti~fa~~l~~d~~~~s 373 (472) T TIGR03471 300 LLLVGYESGDQQILKNIKKGLTVEIARRFTRDCHKLGIKVHGTFILGLPGETRETIRKTIDFAKELNPHTIQVS 373 (472) T ss_pred EEEEECCCCCHHHHHHHCCCCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCCHHHHHHHHHHHHHCCCCEEEEE T ss_conf 89980375899999985389989999999998875798799999877999988999999999997598908998 No 71 >TIGR00593 pola DNA polymerase I; InterPro: IPR002298 DNA carries the biological information that instructs cells how to exist in an ordered fashion. Accurate replication is thus one of the most important events in the cell life cycle. This function is mediated by DNA-directed DNA polymerases, which add nucleotide triphosphate (dNTP) residues to the 5'-end of the growing DNA chain, using a complementary DNA as template. Small RNA molecules are generally used as primers for chain elongation, although terminal proteins may also be used. DNA-dependent DNA polymerases have been grouped into families, denoted A, B and X, on the basis of sequence similarities , . Members of family A, which includes bacterial and bacteriophage polymerases, share significant similarity to Escherichia coli polymerase I; hence family A is also known as the pol I family. The bacterial polymerases also contain an exonuclease activity, which is coded for in the N-terminal portion. Three motifs, A, B and C , are seen to be conserved across all DNA polymerases, with motifs A and C also seen in RNA polymerases. They are centred on invariant residues, and their structural significance was implied from the Klenow (E. coli) structure. Motif A contains a strictly-conserved aspartate at the junction of a beta-strand and an alpha-helix; motif B contains an alpha-helix with positive charges; and motif C has a doublet of negative charges, located in a beta-turn-beta secondary structure .; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication. Probab=67.96 E-value=9.8 Score=18.51 Aligned_cols=121 Identities=18% Similarity=0.216 Sum_probs=79.3 Q ss_pred HHHHHHHHHHHHHH----HHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHCHHHHHHCCCCC Q ss_conf 98999999999998----75200001367655764488684689999999999753853111023431104653202785 Q gi|254780670|r 176 AIRMGAEVFHTLKK----ELKSKGYSTNVGDEGGFSPNLKTADSALDLIANSIEKAGYNAGKDLLIALDCAASAFFKKEK 251 (424) Q Consensus 176 al~~~~ev~~~lk~----~L~~~g~~t~vgDEGGfap~~~~~eeaL~ll~eAi~~aGy~~g~di~i~lD~AAsefy~~~k 251 (424) +--.++.-|+.++. +...+|..|+. ..=-..+|....+.- +|-.-|.++.|.-.-.|=.+.- T Consensus 6 G~~lafRayfAl~~~~~~L~~~~G~~t~A------------~yGF~~~L~~~l~~~--~Pm~~~~~aFD~~~~TFR~e~Y 71 (1005) T TIGR00593 6 GHSLAFRAYFALPNNKKPLTNSKGEPTNA------------VYGFTKMLLKLLKEE--KPMTYVAVAFDSGTKTFRHEAY 71 (1005) T ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCHHH------------HHHHHHHHHHHHHHH--CCCEEEEEEECCCCCCHHHHHC T ss_conf 34799999886221146666678871167------------788899999999741--8951699887688885166625 Q ss_pred EEECCCCCCCCHHH---HHHHHHHHHHHCCEEEEECCCCC-CC-HHHHHHHHHHCCCCEEEECCC Q ss_conf 11114686568789---99999987654486997767631-04-237998864238836850430 Q gi|254780670|r 252 YILKGENLELQPNE---MASYLANLVDQYPIYSIEDGMSE-DD-WHGWKILTNKIGSNCQLVGDD 311 (424) Q Consensus 252 Y~~~~~~~~~t~~e---lid~y~~l~~~yPI~sIEDP~~e-~D-~~gw~~l~~~~g~~~~IvGDD 311 (424) -.|+. |+..+|+| ++.+..+|+.++.|.-+|=+=.| || ...+++....-|-.+.||-.| T Consensus 72 ~~YKa-~R~~tP~el~~Qi~~i~e~l~algi~~~e~~GYEADD~IaTLA~~a~~~G~~V~I~sgD 135 (1005) T TIGR00593 72 EEYKA-NRAKTPEELIEQIPLIKELLEALGIPILEVEGYEADDVIATLAKQAEKEGYEVRIISGD 135 (1005) T ss_pred HHHCC-CCCCCCHHHHHHHHHHHHHHHHCCCEEEECCCCCHHHHHHHHHHHHHHCCCEEEEEECC T ss_conf 33157-88888874687689999999971981882288710128999888775468548998378 No 72 >PRK07366 succinyldiaminopimelate transaminase; Validated Probab=67.58 E-value=10 Score=18.46 Aligned_cols=29 Identities=7% Similarity=-0.111 Sum_probs=19.7 Q ss_pred CCCCCHHHHHHHHHHHHHHCCEEEEECCCC Q ss_conf 865687899999998765448699776763 Q gi|254780670|r 258 NLELQPNEMASYLANLVDQYPIYSIEDGMS 287 (424) Q Consensus 258 ~~~~t~~elid~y~~l~~~yPI~sIEDP~~ 287 (424) |..++.+++-++ .++|.+|.++-|.|=.+ T Consensus 179 G~v~s~~~l~~i-~~~a~~~~i~lisDEiY 207 (388) T PRK07366 179 TAIAPLSFFQEA-VAFCQQHDLVLVHDFPY 207 (388) T ss_pred CEEEEHHHHHHH-HHHHHHCCEEEECCCEE T ss_conf 668038888889-99874055078622021 No 73 >PRK00064 recF recombination protein F; Reviewed Probab=67.06 E-value=10 Score=18.38 Aligned_cols=56 Identities=21% Similarity=0.125 Sum_probs=42.3 Q ss_pred HHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHCC-CCEEEECCCEEECCHHHHHHHH Q ss_conf 9999998765448699776763104237998864238-8368504300002148885210 Q gi|254780670|r 266 MASYLANLVDQYPIYSIEDGMSEDDWHGWKILTNKIG-SNCQLVGDDLFVTNPERLHKGI 324 (424) Q Consensus 266 lid~y~~l~~~yPI~sIEDP~~e~D~~gw~~l~~~~g-~~~~IvGDDL~vTn~~rl~~gi 324 (424) ..+++.+-..++||+-|-|.++|=|-..-..|...+. ..+|+. +++|+++.+..-. T Consensus 290 q~~~~~~~~~~~PIlLlDDv~sELD~~r~~~ll~~l~~~~~Qvf---iT~td~~~~~~~~ 346 (355) T PRK00064 290 EAELLKEETGEAPILLLDDVASELDDGRRAALLETLIALGAQVF---ITATDLEGFDALW 346 (355) T ss_pred HHHHHHHHHCCCCEEEEECCHHCCCHHHHHHHHHHHHHCCCEEE---EECCCHHHHHHHH T ss_conf 99999985499978999483001699999999999874798699---9757878876554 No 74 >TIGR03555 F420_mer 5,10-methylenetetrahydromethanopterin reductase. Members of this protein family are 5,10-methylenetetrahydromethanopterin reductase, an F420-dependent enzyme of methanogenesis. It is restricted to the Archaea. Probab=66.27 E-value=11 Score=18.28 Aligned_cols=30 Identities=20% Similarity=0.393 Sum_probs=22.8 Q ss_pred HHHHHHHHHHHHHHHHCCCEEEEECC-CCCC Q ss_conf 00389999999999987986999507-8887 Q gi|254780670|r 339 IGSVSETLNTIEKAQMAGYPSIISHR-SGET 368 (424) Q Consensus 339 iGTvset~ea~~~a~~~g~~~ivShR-SGET 368 (424) +||--+.++-++.-.+.|..-|+.+- .|.. T Consensus 281 ~Gtpe~v~~~i~~~~~~Gvd~i~~~~~~g~~ 311 (325) T TIGR03555 281 AGTPDDVIEKIEELLKAGVTQVVAGSPIGPD 311 (325) T ss_pred ECCHHHHHHHHHHHHHCCCCEEEEECCCCCC T ss_conf 8999999999999997499879982888979 No 75 >PRK06056 consensus Probab=66.26 E-value=11 Score=18.28 Aligned_cols=180 Identities=18% Similarity=0.215 Sum_probs=80.9 Q ss_pred HHHHHHHCCCCHHHHHHHHHHHHHCCCC--CCCCCCCCCCCCHHHHHHHHHHHHHCCCCH-HHHCC--CC---------- Q ss_conf 7567897598855789888646530366--432212322220244899875266349848-88225--32---------- Q gi|254780670|r 71 DEIRTALLGCDARDQLLIDKIMIDLDGT--PNKSRLGANAILGVSLAVSKAAAQTSNLPL-YKYLG--GC---------- 135 (424) Q Consensus 71 ~~i~~~L~g~~~~dQ~~iD~~Li~lDgT--~nks~lGaNailAvSlA~akA~A~~~~~pL-y~yl~--~~---------- 135 (424) ..|+..+-+..++.-.+|+..-.++... +=-+-=.|+.-+.+.-.+..|.......|. ++|-. |. T Consensus 5 ~~~s~r~~~~~~s~~~~i~~~a~~l~~~G~~vi~l~~G~Pd~~~p~~~~~a~~~a~~~~~~~~Y~~~~G~~eLR~aia~~ 84 (402) T PRK06056 5 SRISARIGGIAESATLAVDAKAKALKAAGRPVIGFGAGEPDFPTPDHIVEAAVAACRDPKNHRYTPAAGLPELREAIAAK 84 (402) T ss_pred HHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCCEECCCCCCCCCCCHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHH T ss_conf 89999986499379999999999999779982788896999998899999999997599888579986819999999999 Q ss_pred --CCCCCCCC--CEEECCCCCCCCC-----CCCCEEEEEECC-CCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf --45647543--0000036645567-----766102565148-8774899989999999999987520000136765576 Q gi|254780670|r 136 --SAHILPVP--LMNILNGGIHADN-----ALDFQEFMIMPV-GAENIREAIRMGAEVFHTLKKELKSKGYSTNVGDEGG 205 (424) Q Consensus 136 --~~~~lP~P--~~NiinGG~HA~~-----~l~iQEfmIiP~-g~~sf~eal~~~~ev~~~lk~~L~~~g~~t~vgDEGG 205 (424) ..+.+++- -+-|-+|+++|-. -++--+-.++|. ...+|..++++.- .+........++| T Consensus 85 ~~~~~g~~~~~d~I~iT~G~~~al~~~~~~l~~pGD~Vlv~~P~y~~y~~~~~~~g-----------~~~v~v~~~~~~~ 153 (402) T PRK06056 85 TLRDSGYEVSPSQVLVTNGGKQAVYEAFATLLDPGDEVILPAPYWTTYPEAIRLAG-----------GVPVEVVAGADQG 153 (402) T ss_pred HHHHHCCCCCHHHEEEECCCHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHCC-----------CCEEEECCCCCCC T ss_conf 99983899880237980681499999999857999989981798756199999728-----------9707822373337 Q ss_pred CCCCCCCHHHHHHHHHHHHHHCCCCCCCC-HHHHHCHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEEC Q ss_conf 44886846899999999997538531110-23431104653202785111146865687899999998765448699776 Q gi|254780670|r 206 FSPNLKTADSALDLIANSIEKAGYNAGKD-LLIALDCAASAFFKKEKYILKGENLELQPNEMASYLANLVDQYPIYSIED 284 (424) Q Consensus 206 fap~~~~~eeaL~ll~eAi~~aGy~~g~d-i~i~lD~AAsefy~~~kY~~~~~~~~~t~~elid~y~~l~~~yPI~sIED 284 (424) |.|+++..+.+ + .+... +.+. +.. .-.|..++.+++-++ .+||.+|.+.-|+| T Consensus 154 ~~~d~~~le~~-------~-----~~~tk~iil~---------~P~----NPtG~v~s~e~l~~l-~~la~~~~i~ii~D 207 (402) T PRK06056 154 YLVTVEQLEAA-------R-----TERTKVLLFV---------SPS----NPTGAVYSPEQVRAI-GRWAAEHGIWVISD 207 (402) T ss_pred CCCCHHHHHHH-------C-----CCCCEEEEEC---------CCC----CCCCCCCCHHHHHHH-HHHHHHHCCCEECC T ss_conf 78799999963-------6-----6687399984---------897----988887427889999-99999714615510 Q ss_pred CCC Q ss_conf 763 Q gi|254780670|r 285 GMS 287 (424) Q Consensus 285 P~~ 287 (424) =.+ T Consensus 208 EiY 210 (402) T PRK06056 208 EIY 210 (402) T ss_pred HHH T ss_conf 443 No 76 >TIGR01420 pilT_fam twitching motility protein; InterPro: IPR006321 These represent the PilT subfamily of proteins related to GspE, a protein involved in type II secretion (also called the General Secretion Pathway). PilT is an apparent cytosolic ATPase associated with type IV pilus systems. It is not required for pilin biogenesis, but is required for twitching motility and social gliding behaviors, shown in some species, powered by pilus retraction . Members of this family may be found in some species that do not have type IV pili but have related structures for DNA uptake and natural transformation. ; GO: 0005524 ATP binding, 0006810 transport. Probab=66.22 E-value=2.9 Score=22.45 Aligned_cols=135 Identities=15% Similarity=0.209 Sum_probs=79.0 Q ss_pred CHHHHHHHHHHHHH----HHHHHHHHC---CCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHC-HHH Q ss_conf 48999899999999----999875200---0013676557644886846899999999997538531110234311-046 Q gi|254780670|r 172 NIREAIRMGAEVFH----TLKKELKSK---GYSTNVGDEGGFSPNLKTADSALDLIANSIEKAGYNAGKDLLIALD-CAA 243 (424) Q Consensus 172 sf~eal~~~~ev~~----~lk~~L~~~---g~~t~vgDEGGfap~~~~~eeaL~ll~eAi~~aGy~~g~di~i~lD-~AA 243 (424) |-++-.++..++.. .-.+.++++ -++.+|.+-+=|=.|.-...-+.-++..+|..-=-+. + .+||= --- T Consensus 43 t~~~~~~l~~~~l~~th~~~~~~f~~~~E~Dfs~~~~~~~RfRvN~f~QRg~~a~vlR~ip~~Ip~f-e--~LGLP~~v~ 119 (350) T TIGR01420 43 TPEDTQKLLREILSSTHEKQREEFEENGELDFSFSLPGLARFRVNAFKQRGGVALVLRLIPSKIPTF-E--ELGLPRPVL 119 (350) T ss_pred CHHHHHHHHHHHCCCCCHHHHHHHHHCCCCCEEEEECCCCEEEEHHHHHCCHHHHHHHHCCCCCCCH-H--HCCCCHHHH T ss_conf 9899999999863845657775056506644466306732212203235000642323115346216-6--637987899 Q ss_pred HHHCCCCCEEEC-----CCCCCCCHHHHHHHHHHHHHHCCEEEEECCC-------------CC--CCHHHH-HHHHHHCC Q ss_conf 532027851111-----4686568789999999876544869977676-------------31--042379-98864238 Q gi|254780670|r 244 SAFFKKEKYILK-----GENLELQPNEMASYLANLVDQYPIYSIEDGM-------------SE--DDWHGW-KILTNKIG 302 (424) Q Consensus 244 sefy~~~kY~~~-----~~~~~~t~~elid~y~~l~~~yPI~sIEDP~-------------~e--~D~~gw-~~l~~~~g 302 (424) .+|.+..+.-.- +.|+.-|=.-||||..+= ..|-|+.||||. +| .|-.+| .-|-++|- T Consensus 120 ~~~a~~~~GLiLVTGPTGSGKSTTlAsmIDyIN~~-~~~HIiTIEDPIEyvh~~~~sli~QREvG~DT~sF~~ALraALR 198 (350) T TIGR01420 120 RELAERPRGLILVTGPTGSGKSTTLASMIDYINKN-KAGHIITIEDPIEYVHKNKRSLINQREVGLDTLSFANALRAALR 198 (350) T ss_pred HHHHHCCCCCEEEECCCCCCHHHHHHHHHHHHHCC-CCCCCEEEECCEEEEECCCEEEEECCCCCCCHHHHHHHHHHHHC T ss_conf 99983669938987688986789999999787403-88882563177314104770245436246754579999768410 Q ss_pred --CCEEEECC Q ss_conf --83685043 Q gi|254780670|r 303 --SNCQLVGD 310 (424) Q Consensus 303 --~~~~IvGD 310 (424) ..+-+||. T Consensus 199 eDPDvILiGE 208 (350) T TIGR01420 199 EDPDVILIGE 208 (350) T ss_pred CCCCEEEEEC T ss_conf 2898899825 No 77 >cd06309 PBP1_YtfQ_like Periplasmic binding domain of ABC-type YtfQ-like transport systems. Periplasmic binding domain of ABC-type YtfQ-like transport systems. The YtfQ protein from Escherichia coli is up-regulated under glucose-limited conditions and shares homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily. Members of this group are predicted to be involved in the transport of sugar-containing molecules across cellular and organellar membranes; however their ligand specificity is not determined experimentally. Probab=65.55 E-value=11 Score=18.18 Aligned_cols=59 Identities=10% Similarity=0.068 Sum_probs=28.4 Q ss_pred CCHHHHHHHHHHHHHHCC--EEEEECCCCCCCHHHH--HHHHHHCC----CCEEEECCCEEECCHHHHHH Q ss_conf 687899999998765448--6997767631042379--98864238----83685043000021488852 Q gi|254780670|r 261 LQPNEMASYLANLVDQYP--IYSIEDGMSEDDWHGW--KILTNKIG----SNCQLVGDDLFVTNPERLHK 322 (424) Q Consensus 261 ~t~~elid~y~~l~~~yP--I~sIEDP~~e~D~~gw--~~l~~~~g----~~~~IvGDDL~vTn~~rl~~ 322 (424) .+.++-......|+..|| +..| |.-+|.-.. ..-.++.| +++.|+|=|-.....+.++. T Consensus 166 ~~~~~~~~~~~~~L~~~p~~i~ai---~~~nd~~a~Ga~~al~~~G~~~~~di~ivg~D~~~~~~~~i~~ 232 (273) T cd06309 166 FTRAKGKEVMEALLKAHGDDIDAV---YAHNDEMALGAIQAIKAAGKKPGKDIKIVSIDGTKDAFQAMAD 232 (273) T ss_pred CCHHHHHHHHHHHHHHCCCCCCEE---EECCCHHHHHHHHHHHHCCCCCCCCEEEEEECCCHHHHHHHHC T ss_conf 688999999999997278888889---8678579999999999839999998699998999999999876 No 78 >KOG1220 consensus Probab=62.50 E-value=12 Score=17.79 Aligned_cols=76 Identities=24% Similarity=0.267 Sum_probs=49.2 Q ss_pred CCCCCCC-CCCCHHH--HHHHHHHHHHHCCCCCCCCHHHHHCHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHH-HHHC Q ss_conf 5576448-8684689--9999999997538531110234311046532027851111468656878999999987-6544 Q gi|254780670|r 202 DEGGFSP-NLKTADS--ALDLIANSIEKAGYNAGKDLLIALDCAASAFFKKEKYILKGENLELQPNEMASYLANL-VDQY 277 (424) Q Consensus 202 DEGGfap-~~~~~ee--aL~ll~eAi~~aGy~~g~di~i~lD~AAsefy~~~kY~~~~~~~~~t~~elid~y~~l-~~~y 277 (424) ..+-|.- ++.++|+ ||++..++. ...+-|+.++-|.-|..|---+| ..++.+.++.+|++.++..| .++| T Consensus 284 Pdp~FPt~~~PNPEek~aL~ls~~~a----~~n~~dlvlanDpDaDR~avaek--~~G~wr~fnGNElgALl~~~~le~~ 357 (607) T KOG1220 284 PDPMFPTVPFPNPEEKGALDLSIKAA----LKNSADLVLANDPDADRFAVAEK--VSGEWRVFNGNELGALLSWWVLEEH 357 (607) T ss_pred CCCCCCCCCCCCCCHHHHHHHHHHHH----HCCCCCEEEECCCCCCHHHHEEC--CCCCCEECCCHHHHHHHHHHHHHHC T ss_conf 99999888998932177999999987----50488489950897423452013--5785035163289999999999866 Q ss_pred CEEEEE Q ss_conf 869977 Q gi|254780670|r 278 PIYSIE 283 (424) Q Consensus 278 PI~sIE 283 (424) |=.-+. T Consensus 358 k~~~~~ 363 (607) T KOG1220 358 KGSTPV 363 (607) T ss_pred CCCCCC T ss_conf 688853 No 79 >pfam02593 DUF166 Uncharacterized ArCR, COG1810. Probab=61.93 E-value=13 Score=17.72 Aligned_cols=39 Identities=5% Similarity=0.078 Sum_probs=14.0 Q ss_pred HHHHHHHHHHHHCC-EEEEECCCCC---CCHHHHHHHHHHCCC Q ss_conf 99999998765448-6997767631---042379988642388 Q gi|254780670|r 265 EMASYLANLVDQYP-IYSIEDGMSE---DDWHGWKILTNKIGS 303 (424) Q Consensus 265 elid~y~~l~~~yP-I~sIEDP~~e---~D~~gw~~l~~~~g~ 303 (424) .+..-+.+-+++|. -+..-.||.. +.......|.+.+|. T Consensus 87 g~~~~lk~~~e~~g~~~~~P~~~CsL~~~~~p~i~~F~~~FG~ 129 (215) T pfam02593 87 GLRKGLKEQLEEFGVEVEFPEPFCSLEPVGNPVIDEFAERFGR 129 (215) T ss_pred CHHHHHHHHHHHCCCEEECCCCCCCCCCCCCHHHHHHHHHHCC T ss_conf 1289999999975973877876557889888669999998199 No 80 >TIGR03569 NeuB_NnaB N-acetylneuraminate synthase. This family is a subset of the pfam03102 and is believed to include only authentic NeuB N-acetylneuraminate (sialic acid) synthase enzymes. The majority of the genes identified by this model are observed adjacent to both the NeuA and NeuC genes which together effect the biosynthesis of CMP-N-acetylneuraminate from UDP-N-acetylglucosamine. Probab=61.60 E-value=13 Score=17.68 Aligned_cols=148 Identities=18% Similarity=0.164 Sum_probs=84.7 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHCCCCEE-EECCCEEECCHHHHHHHHHCCCCCEEEEC Q ss_conf 68656878999999987654486997767631042379988642388368-50430000214888521003865223425 Q gi|254780670|r 257 ENLELQPNEMASYLANLVDQYPIYSIEDGMSEDDWHGWKILTNKIGSNCQ-LVGDDLFVTNPERLHKGINEEVANAILIK 335 (424) Q Consensus 257 ~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~D~~gw~~l~~~~g~~~~-IvGDDL~vTn~~rl~~gi~~~~~NailiK 335 (424) +..+++.+++.++ .++|++..|..+-.||++++.+- ..+++-... |-..| .||...|++..+.+ .-++|. T Consensus 69 kk~els~~~~~~L-~~~~~~~gi~~~~s~fd~~s~~~----l~~~~~~~~KIaS~d--~~n~~Li~~i~~~~--kpiiiS 139 (329) T TIGR03569 69 KKLELSEEDHREL-KEYCESKGIEFLSTPFDLESADF----LEDLGVPRFKIPSGE--ITNAPLLKKIARFG--KPVILS 139 (329) T ss_pred HHHCCCHHHHHHH-HHHHHHHCCEEEECCCCHHHHHH----HHHCCCCEEEECCCC--CCCHHHHHHHHHHC--CCEEEE T ss_conf 8714899999999-99999729929989698999999----986499979979231--05489999999748--978996 Q ss_pred CCCHHHHHHHHHHHHHHHHCCC---EEEEEC-C----------------------------CCCCCHHHHHHHHHHHCCC Q ss_conf 4530038999999999998798---699950-7----------------------------8887235899899981998 Q gi|254780670|r 336 PNQIGSVSETLNTIEKAQMAGY---PSIISH-R----------------------------SGETEDHTIADLAVATNCG 383 (424) Q Consensus 336 ~NQiGTvset~ea~~~a~~~g~---~~ivSh-R----------------------------SGETeD~~iaDLAVg~~a~ 383 (424) . -..|+.|..++++..+.++. ..++=| - |.-|.+...+=+||++||. T Consensus 140 t-G~s~~~EI~~av~~~~~~~~~~~~i~llhC~s~YP~~~~~~nL~~i~~lk~~f~~~iG~SDHt~g~~~~~~Ava~GA~ 218 (329) T TIGR03569 140 T-GMATLEEIEAAVGVLRDAGTPDSNITLLHCTTEYPAPFEDVNLNAMDTLKEAFDLPVGYSDHTLGIEAPIAAVALGAT 218 (329) T ss_pred C-CCCHHHHHHHHHHHHHHCCCCCCCEEEEEECCCCCCCHHHCCHHHHHHHHHHCCCCEEECCCCCCCHHHHHHHHCCCE T ss_conf 7-862399999999999974886557688643788999937647899999999859986847978983679999986996 Q ss_pred HHEE--------CCCC-----HHHHHHHHHHHHHHH-HHHHHCCC Q ss_conf 0100--------7561-----144899999999999-98311045 Q gi|254780670|r 384 QIKT--------GSLA-----RSDRIAKYNQLIRIE-ESLGKQAK 414 (424) Q Consensus 384 ~iK~--------G~~~-----R~ER~aKyNrLlrIe-e~Lg~~a~ 414 (424) .|-- -+|- -.+-..++-+-+|.- .-||+.-+ T Consensus 219 vIEKHfTldk~~~g~D~~~Sl~p~e~k~lv~~i~~~~~~lG~~~k 263 (329) T TIGR03569 219 VIEKHFTLDKNLPGPDHKASLEPDELKEMVQGIRNVEKALGDGVK 263 (329) T ss_pred EEEEEEECCCCCCCCCHHHCCCHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 999877768999999678709999999999999999998199988 No 81 >PRK06108 aspartate aminotransferase; Provisional Probab=61.48 E-value=13 Score=17.67 Aligned_cols=30 Identities=3% Similarity=0.086 Sum_probs=19.9 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCEEEEECCCC Q ss_conf 6865687899999998765448699776763 Q gi|254780670|r 257 ENLELQPNEMASYLANLVDQYPIYSIEDGMS 287 (424) Q Consensus 257 ~~~~~t~~elid~y~~l~~~yPI~sIEDP~~ 287 (424) .|..++.+++-+. .++|.+|.++-|+|=.+ T Consensus 171 tG~v~s~e~l~~l-~~la~~~~v~iisDEiY 200 (382) T PRK06108 171 TGWTASRDDQQAI-LAHCRRHGLWIVADEVY 200 (382) T ss_pred CCHHCCHHHHHHH-HHHHHCCCCEEECHHHH T ss_conf 6620447889999-99876269753533433 No 82 >PTZ00340 O-sialoglycoprotein endopeptidase; Provisional Probab=60.73 E-value=13 Score=17.57 Aligned_cols=114 Identities=18% Similarity=0.172 Sum_probs=59.1 Q ss_pred HHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCHHH--HCC-----CC Q ss_conf 99999998756789759885578988864653036643221232222024489987526634984888--225-----32 Q gi|254780670|r 63 LKAIAFVNDEIRTALLGCDARDQLLIDKIMIDLDGTPNKSRLGANAILGVSLAVSKAAAQTSNLPLYK--YLG-----GC 135 (424) Q Consensus 63 ~~Av~~in~~i~~~L~g~~~~dQ~~iD~~Li~lDgT~nks~lGaNailAvSlA~akA~A~~~~~pLy~--yl~-----~~ 135 (424) +..++++...+..+|..-+. + +.++|.-.--..-|=-..|-|-+.+||+-|...++||+- |+- .. T Consensus 54 r~H~~~i~~lv~~aL~~a~i------~--~~did~IAvT~gPGL~g~L~VG~~~AK~La~~~~~Pli~VnHlegHi~~~~ 125 (348) T PTZ00340 54 QHHRQHILSLVQEALEEAGI------T--LSDISLICYTKGPGMGAPLAVGATVAKTLSLLWGKPLVGVNHCVAHIEMGR 125 (348) T ss_pred HHHHHHHHHHHHHHHHHCCC------C--HHCCCEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCEEECCCHHHHHHHHH T ss_conf 99999999999999998599------8--411857997279985165899999999999980998352130243342121 Q ss_pred CCCCCCCCCEEECCCCCCCC-CCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 45647543000003664556-776610256514887748999899999999999875 Q gi|254780670|r 136 SAHILPVPLMNILNGGIHAD-NALDFQEFMIMPVGAENIREAIRMGAEVFHTLKKEL 191 (424) Q Consensus 136 ~~~~lP~P~~NiinGG~HA~-~~l~iQEfmIiP~g~~sf~eal~~~~ev~~~lk~~L 191 (424) -....+.|++=+++|| |.- =..+..+|-|+ | .|.-+| .-|.|-...+.| T Consensus 126 l~~~~~~P~~LlVSGG-HT~ll~~~~~~~~il--G-~T~DdA---~GEafDKvAr~L 175 (348) T PTZ00340 126 LVTGSENPVVLYVSGG-NTQVIAYSEKRYRVF--G-ETIDIA---VGNCLDRFARLL 175 (348) T ss_pred HCCCCCCCEEEEECCC-CEEEEEECCCEEEEE--E-EECCCC---CHHHHHHHHHHH T ss_conf 0379878569998689-769999868617897--6-652653---057999999980 No 83 >cd06318 PBP1_ABC_sugar_binding_like_9 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=60.54 E-value=13 Score=17.55 Aligned_cols=75 Identities=16% Similarity=0.247 Sum_probs=42.7 Q ss_pred CCCHHHHHHHHHHHHHHCC-EEEEECCCCCCCHHHH---HHHHHHCC--CCEEEECCCEEECCHHHHHHHHHCCCCCEEE Q ss_conf 5687899999998765448-6997767631042379---98864238--8368504300002148885210038652234 Q gi|254780670|r 260 ELQPNEMASYLANLVDQYP-IYSIEDGMSEDDWHGW---KILTNKIG--SNCQLVGDDLFVTNPERLHKGINEEVANAIL 333 (424) Q Consensus 260 ~~t~~elid~y~~l~~~yP-I~sIEDP~~e~D~~gw---~~l~~~~g--~~~~IvGDDL~vTn~~rl~~gi~~~~~Nail 333 (424) ..+.++-..-..+|+..|| |-.| |.-+|.-.. +-+. ..| +++.|+|-|-.....+.++.| ...-.+. T Consensus 171 ~~~~~~a~~~~~~~L~~~p~i~~i---~~~nD~~a~Ga~~Al~-~~G~~~~v~vvG~D~~~~~~~~i~~g---~~~~tv~ 243 (282) T cd06318 171 DWTREGGLKAMEDLLVAHPDINVV---YSENDDMALGAMRVLA-EAGKTDDVKVAAADGQKEALALIKGG---KYGATAL 243 (282) T ss_pred CCCHHHHHHHHHHHHHHCCCCEEE---EECCCHHHHHHHHHHH-HCCCCCCEEEEEECCCHHHHHHHHCC---CCEEEEE T ss_conf 977999999999999758997099---9889679999999999-66999982999989889999999839---9649993 Q ss_pred ECCCCHHH Q ss_conf 25453003 Q gi|254780670|r 334 IKPNQIGS 341 (424) Q Consensus 334 iK~NQiGT 341 (424) -.|..+|- T Consensus 244 q~p~~~G~ 251 (282) T cd06318 244 NDPDLVAR 251 (282) T ss_pred CCHHHHHH T ss_conf 49999999 No 84 >PRK08835 consensus Probab=60.48 E-value=13 Score=17.55 Aligned_cols=16 Identities=19% Similarity=0.297 Sum_probs=6.1 Q ss_pred CCEEEECCCCHHHHHH Q ss_conf 5223425453003899 Q gi|254780670|r 329 ANAILIKPNQIGSVSE 344 (424) Q Consensus 329 ~NailiK~NQiGTvse 344 (424) .|++.|.....-.+++ T Consensus 544 ~~GI~vD~~~L~~ls~ 559 (931) T PRK08835 544 RTGVLIDDMLLSAQSQ 559 (931) T ss_pred HHCEEECHHHHHHHHH T ss_conf 7284878999999999 No 85 >cd01540 PBP1_arabinose_binding Periplasmic L-arabinose-binding protein (ABP), a member of a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily. Periplasmic L-arabinose-binding protein (ABP), a member of a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily. ABP is only involved in transport contrary to other related sugar-binding proteins such as the glucose/galactose-binding protein (GGBP) and the ribose-binding protein (RBP), both of which are involved in chemotaxis as well as transport. The periplasmic ABP consists of two alpha/beta globular domains connected by a three-stranded hinge, a Venus flytrap-like domain, which undergoes a transition from an open to a closed conformational state upon ligand binding. Moreover, ABP is homologous to the ligand-binding domain of eukaryotic receptors such as metabotropic glutamate receptor (mGluR) and DNA-binding transcriptional repressors such a Probab=60.42 E-value=11 Score=18.05 Aligned_cols=147 Identities=17% Similarity=0.211 Sum_probs=72.6 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHCCCC-CCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHCHHHHHHC Q ss_conf 87748999899999999999875200001-36765576448868468999999999975385311102343110465320 Q gi|254780670|r 169 GAENIREAIRMGAEVFHTLKKELKSKGYS-TNVGDEGGFSPNLKTADSALDLIANSIEKAGYNAGKDLLIALDCAASAFF 247 (424) Q Consensus 169 g~~sf~eal~~~~ev~~~lk~~L~~~g~~-t~vgDEGGfap~~~~~eeaL~ll~eAi~~aGy~~g~di~i~lD~AAsefy 247 (424) +..++..+-+++ +.+.+.+.+++.. ..++=-.+..+...+..+=.+=..++++..||. ++.+ T Consensus 104 g~dn~~~G~~~g----~~la~~l~~kg~~~~~~~vi~~~~~~~~~~~~R~~Gf~~~l~~~g~p---~~~v---------- 166 (289) T cd01540 104 GMSATKIGEQVG----EAIADEMKKRGWDPKEVGALRITYDELDTAKPRTDGALEALKAPGFP---EANI---------- 166 (289) T ss_pred EECHHHHHHHHH----HHHHHHHHHHCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHCCCC---CCEE---------- T ss_conf 614689999999----99999998747886635999916888821889987699999635999---6079---------- Q ss_pred CCCCEEECCCCCCCCHHHHHHHHHHHHHHCC-EE--EEECCCCCCCHHH---HHHHHHH--CCCCEEEECCCEEECCHHH Q ss_conf 2785111146865687899999998765448-69--9776763104237---9988642--3883685043000021488 Q gi|254780670|r 248 KKEKYILKGENLELQPNEMASYLANLVDQYP-IY--SIEDGMSEDDWHG---WKILTNK--IGSNCQLVGDDLFVTNPER 319 (424) Q Consensus 248 ~~~kY~~~~~~~~~t~~elid~y~~l~~~yP-I~--sIEDP~~e~D~~g---w~~l~~~--~g~~~~IvGDDL~vTn~~r 319 (424) +.......++++-.+.-.+|+.+|| +. .| |..+|.-. .+.+.+. -++++.+||-|-+-..... T Consensus 167 ------v~~~~~~~~~~~a~~~~~~~L~a~pdi~~~~i---~a~ND~mAlGA~~Al~~aGl~~~~v~vvgidg~~~~~~~ 237 (289) T cd01540 167 ------FQAPQKTTDTEGAFDAAASTLTKNPNVKNWII---YGLNDETVLGAVRATEQSGIAAADVIGVGINGSDAADEE 237 (289) T ss_pred ------EEECCCCCCHHHHHHHHHHHHHHCCCCCCEEE---EECCCHHHHHHHHHHHHCCCCCCCEEEEEEECCCHHHHH T ss_conf ------98456886699999999999975899884399---977966889999999974998897699997288259999 Q ss_pred HHHHHHCCCCCEEEECCCCHHHH Q ss_conf 85210038652234254530038 Q gi|254780670|r 320 LHKGINEEVANAILIKPNQIGSV 342 (424) Q Consensus 320 l~~gi~~~~~NailiK~NQiGTv 342 (424) ++.. +....-++.-.|.++|.+ T Consensus 238 ~~~~-~~~~~atv~Q~p~~~G~~ 259 (289) T cd01540 238 SKKQ-PTGFYGTVLISPTNHGYI 259 (289) T ss_pred HHCC-CCCEEEEEEECHHHHHHH T ss_conf 7433-676489995088999999 No 86 >cd06310 PBP1_ABC_sugar_binding_like_2 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=60.17 E-value=13 Score=17.51 Aligned_cols=76 Identities=16% Similarity=0.276 Sum_probs=43.1 Q ss_pred CCHHHHHHHHHHHHHHCC-EEEEECCCCCCCHH--HHHHHHHHCC--CCEEEECCCEEECCHHHHHHHHHCCCCCEEEEC Q ss_conf 687899999998765448-69977676310423--7998864238--836850430000214888521003865223425 Q gi|254780670|r 261 LQPNEMASYLANLVDQYP-IYSIEDGMSEDDWH--GWKILTNKIG--SNCQLVGDDLFVTNPERLHKGINEEVANAILIK 335 (424) Q Consensus 261 ~t~~elid~y~~l~~~yP-I~sIEDP~~e~D~~--gw~~l~~~~g--~~~~IvGDDL~vTn~~rl~~gi~~~~~NailiK 335 (424) .+.++-.+...+|+.++| +..|- .-+|.. |-.+..+..| .++.|+|=|....+.+.|+.|. ..-.+.-. T Consensus 165 ~~~~~~~~~~~~~L~~~pdi~ai~---~~nd~~a~ga~~al~~~g~~~~i~vvg~D~~~~~~~~i~~G~---i~~tv~q~ 238 (273) T cd06310 165 SDYAKALDITEDLLTANPDLKGIF---GANEGSAVGAARAVRQAGKAGKVKVVGFDASPEQIDALREGV---IQALVVQN 238 (273) T ss_pred CHHHHHHHHHHHHHHHCCCCCEEE---ECCCHHHHHHHHHHHHCCCCCCEEEEEECCCHHHHHHHHCCC---CEEEECCC T ss_conf 248999999999997589875798---438489999999999779999939999898699999998599---85998189 Q ss_pred CCCHHHH Q ss_conf 4530038 Q gi|254780670|r 336 PNQIGSV 342 (424) Q Consensus 336 ~NQiGTv 342 (424) |-+.|-. T Consensus 239 p~~~G~~ 245 (273) T cd06310 239 PYKMGYL 245 (273) T ss_pred HHHHHHH T ss_conf 9999999 No 87 >PRK08928 consensus Probab=59.77 E-value=14 Score=17.46 Aligned_cols=16 Identities=25% Similarity=0.306 Sum_probs=7.8 Q ss_pred EEEEECCCCCEEEEEE Q ss_conf 6886079784299999 Q gi|254780670|r 9 REVIDSRGSPTIEVDV 24 (424) Q Consensus 9 r~IlDSRG~PTvEvev 24 (424) +++++.-|-|.++++= T Consensus 94 ~~~~~~~gi~~~~~~g 109 (861) T PRK08928 94 REAVSALNIPIEEKNG 109 (861) T ss_pred HHHHHHCCCCEEEECC T ss_conf 9999987999854489 No 88 >PRK07997 consensus Probab=59.53 E-value=14 Score=17.43 Aligned_cols=18 Identities=33% Similarity=0.468 Sum_probs=9.8 Q ss_pred CCEEEECCCCHHHHHHHH Q ss_conf 522342545300389999 Q gi|254780670|r 329 ANAILIKPNQIGSVSETL 346 (424) Q Consensus 329 ~NailiK~NQiGTvset~ 346 (424) .+++.|.......+++.+ T Consensus 542 ~~GI~vD~~~L~~ls~el 559 (928) T PRK07997 542 RNGVKIDPAVLHAHSEEL 559 (928) T ss_pred HCCEEECHHHHHHHHHHH T ss_conf 539599899999999999 No 89 >PRK06256 biotin synthase; Validated Probab=59.28 E-value=14 Score=17.40 Aligned_cols=55 Identities=16% Similarity=0.047 Sum_probs=31.1 Q ss_pred ECCCCHHHHHHHHHHHHHHHHC--CCEEEEECCCCCCCHHHHHHHHHHHCCCHHEECCC Q ss_conf 2545300389999999999987--98699950788872358998999819980100756 Q gi|254780670|r 334 IKPNQIGSVSETLNTIEKAQMA--GYPSIISHRSGETEDHTIADLAVATNCGQIKTGSL 390 (424) Q Consensus 334 iK~NQiGTvset~ea~~~a~~~--g~~~ivShRSGETeD~~iaDLAVg~~a~~iK~G~~ 390 (424) +...+..|.-|.+..+-++|-. .-...+|..-...-++.- -++.+ ||.-+-+|.. T Consensus 245 l~~~~~l~~~e~lr~iAi~Rl~~P~~~Ir~agGr~~~~~~~~-~~~~~-gan~~~~G~~ 301 (325) T PRK06256 245 LEDLPELTPLECLKTIAIFRLINPDKEIRIAGGREIALRSLQ-PLALK-GANSIFVGNY 301 (325) T ss_pred CCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHH-HHHHH-HCHHHEECCC T ss_conf 688999899999999999999789954897078552255679-99986-1735146665 No 90 >PRK13234 nifH nitrogenase reductase; Reviewed Probab=58.64 E-value=14 Score=17.33 Aligned_cols=123 Identities=14% Similarity=0.061 Sum_probs=81.7 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHH Q ss_conf 10256514887748999899999999999875200001367655764488684689999999999753853111023431 Q gi|254780670|r 160 FQEFMIMPVGAENIREAIRMGAEVFHTLKKELKSKGYSTNVGDEGGFSPNLKTADSALDLIANSIEKAGYNAGKDLLIAL 239 (424) Q Consensus 160 iQEfmIiP~g~~sf~eal~~~~ev~~~lk~~L~~~g~~t~vgDEGGfap~~~~~eeaL~ll~eAi~~aGy~~g~di~i~l 239 (424) -||+.|+-.+. ..+|..+-+|.+.+++.-... -..-||...|-.+.+.-.+++.+-.++.|-..=-.|--.. T Consensus 145 A~evyIVtSge---~mslyAANnI~~~i~~~~~~g-----~~rlgGlI~N~r~~~~e~~~v~~fa~~~gt~ii~~IPrs~ 216 (293) T PRK13234 145 AQEIYIVMSGE---MMALYAANNIAKGILKYANSG-----GVRLGGLICNERQTDRELELAEALAKRLGSKLIHFVPRDN 216 (293) T ss_pred CCEEEEEECCC---HHHHHHHHHHHHHHHHHHHCC-----CCEEEEEEEECCCCCCHHHHHHHHHHHHCCCEEEECCCCH T ss_conf 76899994671---879999999999999986326-----9624689971789853799999999984993799779968 Q ss_pred CHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHH----HHCCEEEEECCCCCCCHHHHHH Q ss_conf 10465320278511114686568789999999876----5448699776763104237998 Q gi|254780670|r 240 DCAASAFFKKEKYILKGENLELQPNEMASYLANLV----DQYPIYSIEDGMSEDDWHGWKI 296 (424) Q Consensus 240 D~AAsefy~~~kY~~~~~~~~~t~~elid~y~~l~----~~yPI~sIEDP~~e~D~~gw~~ 296 (424) .+.-+|++..---.+.-+ .++.+.|.+|. +..|-..|-.|+..++.+.|.. T Consensus 217 ~v~~aE~~~~TviE~~P~------s~~A~~Yr~LA~~I~~n~~~~~iP~Pl~~~eL~~l~~ 271 (293) T PRK13234 217 IVQHAELRRMTVIEYAPD------SKQAGEYRALAEKIHANSGKGTIPTPITMEELEDMLM 271 (293) T ss_pred HHHHHHHCCCEEEEECCC------CHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHH T ss_conf 899999739777896889------8799999999999984788700188899899999999 No 91 >KOG2450 consensus Probab=58.26 E-value=14 Score=17.28 Aligned_cols=125 Identities=18% Similarity=0.202 Sum_probs=74.4 Q ss_pred CCHHHHHHHHHHHHHH-CCEEEEECCCCCCC-------------HHHHHHHHHHCCCCEEEECCCEEECCHHHHHHHHHC Q ss_conf 6878999999987654-48699776763104-------------237998864238836850430000214888521003 Q gi|254780670|r 261 LQPNEMASYLANLVDQ-YPIYSIEDGMSEDD-------------WHGWKILTNKIGSNCQLVGDDLFVTNPERLHKGINE 326 (424) Q Consensus 261 ~t~~elid~y~~l~~~-yPI~sIEDP~~e~D-------------~~gw~~l~~~~g~~~~IvGDDL~vTn~~rl~~gi~~ 326 (424) .--+|+++-+..+.++ +++- |||+|+- ..+|-+.-++-|.+++.-|+.+ + T Consensus 313 ~iydefv~~~v~~a~~~~kvG---dP~~~~~~qG~~i~~~q~ekI~~yi~~~k~eGa~l~~gG~~~----------g--- 376 (501) T KOG2450 313 SIYDEFVEKFVAAAKKKLKVG---DPFDEGTEQGPQISKTQYEKILGYIESGKKEGATLLCGGVRL----------G--- 376 (501) T ss_pred HHHHHHHHHHHHHHHCCCCCC---CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEECCCCC----------C--- T ss_conf 579999999999984365328---987766514660589999999999999986588899657233----------8--- Q ss_pred CCCCEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHHCCCHHEECCCCHHHHHHHHHHHHHHH Q ss_conf 86522342545300389999999999987986999507888723589989998199801007561144899999999999 Q gi|254780670|r 327 EVANAILIKPNQIGSVSETLNTIEKAQMAGYPSIISHRSGETEDHTIADLAVATNCGQIKTGSLARSDRIAKYNQLIRIE 406 (424) Q Consensus 327 ~~~NailiK~NQiGTvset~ea~~~a~~~g~~~ivShRSGETeD~~iaDLAVg~~a~~iK~G~~~R~ER~aKyNrLlrIe 406 (424) -++-.|+|.-+.-|+ +.++++++.=|+.++.--.=.|+|.-|.. |-...| +++-+=-+--.|+.+|+. T Consensus 377 --~~Gyfi~Ptv~~~v~---~~m~i~~EEIFGPVv~v~~F~t~eE~i~~---AN~t~y----GLAa~V~t~dl~~a~~va 444 (501) T KOG2450 377 --DKGYFIKPTVFTNVT---DDMRIAKEEIFGPVVVVLKFKTEEEVIER---ANNTTY----GLAAGVFTKDLDKAHRVA 444 (501) T ss_pred --CCCEEECCEECCCCC---HHHHHHHHHCCCCEEEEEECCCHHHHHHH---HCCCCC----CCEEEEECCCHHHHHHHH T ss_conf --886387871415787---12454476535852799743719999987---417743----110057506737899999 Q ss_pred HHHHHCC Q ss_conf 9831104 Q gi|254780670|r 407 ESLGKQA 413 (424) Q Consensus 407 e~Lg~~a 413 (424) ..|.... T Consensus 445 ~~l~aGt 451 (501) T KOG2450 445 NALQAGT 451 (501) T ss_pred HHHCCCE T ss_conf 8742765 No 92 >PRK09513 fruK 1-phosphofructokinase; Provisional Probab=57.98 E-value=14 Score=17.25 Aligned_cols=12 Identities=33% Similarity=0.354 Sum_probs=6.8 Q ss_pred CCCCHHHHHHHH Q ss_conf 168048999999 Q gi|254780670|r 57 YFGKGVLKAIAF 68 (424) Q Consensus 57 ~~G~gV~~Av~~ 68 (424) -+|||+..|+.. T Consensus 38 ~GGKG~NvA~~l 49 (312) T PRK09513 38 AAGKGINVAKVL 49 (312) T ss_pred CCCHHHHHHHHH T ss_conf 587399999999 No 93 >PRK11337 DNA-binding transcriptional repressor RpiR; Provisional Probab=57.51 E-value=15 Score=17.20 Aligned_cols=54 Identities=17% Similarity=0.199 Sum_probs=43.1 Q ss_pred CEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHHCCC Q ss_conf 223425453003899999999999879869995078887235899899981998 Q gi|254780670|r 330 NAILIKPNQIGSVSETLNTIEKAQMAGYPSIISHRSGETEDHTIADLAVATNCG 383 (424) Q Consensus 330 NailiK~NQiGTvset~ea~~~a~~~g~~~ivShRSGETeD~~iaDLAVg~~a~ 383 (424) +-++|=...-|.-.|++++++.||++|..+|.=-+++.+.=.-+||...-+.+. T Consensus 189 ~Dv~i~iS~sG~t~e~i~~~~~Ak~~Ga~vI~IT~~~~SpLa~~aD~vL~~~~~ 242 (293) T PRK11337 189 GDVVLVVSHSGRTSDVKAAVELAKQNGAKIICITHSYHSPIAKLADYIICSPAP 242 (293) T ss_pred CCEEEEEECCCCCHHHHHHHHHHHHCCCEEEEECCCCCCHHHHHCCEEEECCCC T ss_conf 988999818999889999999999879949997699998468958998864887 No 94 >TIGR01533 lipo_e_P4 5'-nucleotidase, lipoprotein e(P4) family; InterPro: IPR006423 This group of sequences represents a set of bacterial lipoproteins belonging to a larger acid phosphatase family, which in turn belongs to the haloacid dehalogenase (HAD) superfamily of aspartate-dependent hydrolases. Members are found on the outer membrane of Gram-negative bacteria and the cytoplasmic membrane of Gram-positive bacteria. Most members have classic lipoprotein signal sequences. A critical role of this 5'-nucleotidase in Haemophilus influenzae is the degradation of external riboside in order to allow transport into the cell. An earlier suggested role in hemin transport is no longer current. This enzyme may also have other physiologically significant roles.. Probab=57.22 E-value=15 Score=17.16 Aligned_cols=92 Identities=23% Similarity=0.341 Sum_probs=54.5 Q ss_pred HHHCHHHHHHCC-CCCEE--ECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHCCCCEEEECCCEE Q ss_conf 431104653202-78511--114686568789999999876544869977676310423799886423883685043000 Q gi|254780670|r 237 IALDCAASAFFK-KEKYI--LKGENLELQPNEMASYLANLVDQYPIYSIEDGMSEDDWHGWKILTNKIGSNCQLVGDDLF 313 (424) Q Consensus 237 i~lD~AAsefy~-~~kY~--~~~~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~D~~gw~~l~~~~g~~~~IvGDDL~ 313 (424) ..||--...-=+ +.+|. ++.+-+.|+.....-|-..=+ =||++|||+-|......- T Consensus 85 ~~ld~~~~~~k~G~kk~AivlDlDETvLDNsPY~gy~v~~~---------k~f~~E~W~~W~~~~~a~------------ 143 (295) T TIGR01533 85 VRLDEELKKEKDGEKKYAIVLDLDETVLDNSPYQGYQVLNN---------KPFDPEDWDKWVQAAQAK------------ 143 (295) T ss_pred HHHHHHHHHCCCCCCCCEEEECCCCCCCCCCHHHHHHHHCC---------CCCCCCCHHHHHHHHCCC------------ T ss_conf 99997532124789874377115320103765579998807---------897987558998631177------------ Q ss_pred ECCHHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCEEE-EECCCCCCC-HHHHHHH Q ss_conf 02148885210038652234254530038999999999998798699-950788872-3589989 Q gi|254780670|r 314 VTNPERLHKGINEEVANAILIKPNQIGSVSETLNTIEKAQMAGYPSI-ISHRSGETE-DHTIADL 376 (424) Q Consensus 314 vTn~~rl~~gi~~~~~NailiK~NQiGTvset~ea~~~a~~~g~~~i-vShRSGETe-D~~iaDL 376 (424) +|--++++.+.|..+|..+- ||-|+-+++ +.|+-+| T Consensus 144 ---------------------------~vaGA~~Fl~ya~~kGv~iFYvsnR~~~~~~~aTl~nL 181 (295) T TIGR01533 144 ---------------------------PVAGALEFLNYANSKGVKIFYVSNRSDEKEKEATLENL 181 (295) T ss_pred ---------------------------CCCCHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHHHH T ss_conf ---------------------------75107889999987592589965687301244207999 No 95 >cd01173 pyridoxal_pyridoxamine_kinase Pyridoxal kinase plays a key role in the synthesis of the active coenzyme pyridoxal-5'-phosphate (PLP), by catalyzing the phosphorylation of the precursor vitamin B6 in the presence of Zn2+ and ATP. Mammals are unable to synthesize PLP de novo and require its precursors in the form of vitamin B6 (pyridoxal, pyridoxine, and pyridoxamine) from their diet. Pyridoxal kinase encoding genes are also found in many other species including yeast and bacteria. Probab=56.48 E-value=15 Score=17.08 Aligned_cols=17 Identities=29% Similarity=0.020 Sum_probs=9.7 Q ss_pred CCCHHHHHHHHHHHCCC Q ss_conf 87235899899981998 Q gi|254780670|r 367 ETEDHTIADLAVATNCG 383 (424) Q Consensus 367 ETeD~~iaDLAVg~~a~ 383 (424) -|-|.|-|=|+..+..+ T Consensus 215 GTGD~fsa~laa~l~~g 231 (254) T cd01173 215 GTGDLFAALLLARLLKG 231 (254) T ss_pred CHHHHHHHHHHHHHHCC T ss_conf 59999999999999879 No 96 >COG3854 SpoIIIAA ncharacterized protein conserved in bacteria [Function unknown] Probab=56.24 E-value=15 Score=17.05 Aligned_cols=40 Identities=33% Similarity=0.429 Sum_probs=18.6 Q ss_pred CCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCH----HHHHHH Q ss_conf 45300389999999999987986999507888723----589989 Q gi|254780670|r 336 PNQIGSVSETLNTIEKAQMAGYPSIISHRSGETED----HTIADL 376 (424) Q Consensus 336 ~NQiGTvset~ea~~~a~~~g~~~ivShRSGETeD----~~iaDL 376 (424) ...|||--|++ ++..|-..|-.+|.++---+-|| +++.|| T Consensus 224 vDEIGt~~d~~-A~~ta~~~GVkli~TaHG~~iedl~krp~lkdl 267 (308) T COG3854 224 VDEIGTEEDAL-AILTALHAGVKLITTAHGNGIEDLIKRPTLKDL 267 (308) T ss_pred EECCCCHHHHH-HHHHHHHCCCEEEEEECCCCHHHHHCCHHHHHH T ss_conf 83436477799-999998548589995044117776508168998 No 97 >PRK00331 glucosamine--fructose-6-phosphate aminotransferase; Reviewed Probab=55.49 E-value=8.6 Score=18.91 Aligned_cols=33 Identities=24% Similarity=0.391 Sum_probs=28.5 Q ss_pred CCEEEECCCCHHHHHHHHHHHHHHHHCCCEEEE Q ss_conf 522342545300389999999999987986999 Q gi|254780670|r 329 ANAILIKPNQIGSVSETLNTIEKAQMAGYPSII 361 (424) Q Consensus 329 ~NailiK~NQiGTvset~ea~~~a~~~g~~~iv 361 (424) -|.++|=..|-|.-.+|+++++.||+.|..++- T Consensus 335 ~~~l~i~iSQSGET~Dtl~al~~ak~~ga~tia 367 (604) T PRK00331 335 PNTLVIAISQSGETADTLAALRLAKELGAKTLA 367 (604) T ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHCCCCEEE T ss_conf 643899975788837899999999877994898 No 98 >PRK06245 cofG FO synthase subunit 1; Reviewed Probab=55.25 E-value=9 Score=18.76 Aligned_cols=15 Identities=27% Similarity=0.587 Sum_probs=5.8 Q ss_pred HHHHHHHHHHCCCCC Q ss_conf 999999997538531 Q gi|254780670|r 217 LDLIANSIEKAGYNA 231 (424) Q Consensus 217 L~ll~eAi~~aGy~~ 231 (424) ++-|.+.+..+|+++ T Consensus 286 ~~~l~~~~~~~g~~l 300 (336) T PRK06245 286 IDELKEILEEAGWQL 300 (336) T ss_pred HHHHHHHHHHCCCCC T ss_conf 999999999759965 No 99 >PRK05755 DNA polymerase I; Provisional Probab=55.06 E-value=16 Score=16.92 Aligned_cols=13 Identities=31% Similarity=0.455 Sum_probs=4.8 Q ss_pred HHHHHHHHHHHHH Q ss_conf 9999999987520 Q gi|254780670|r 181 AEVFHTLKKELKS 193 (424) Q Consensus 181 ~ev~~~lk~~L~~ 193 (424) ...|..|++.|++ T Consensus 470 ~~L~~~L~~~L~~ 482 (889) T PRK05755 470 LRLHQVLKPELEE 482 (889) T ss_pred HHHHHHHHHHHHH T ss_conf 9999999999986 No 100 >PRK09265 aminotransferase AlaT; Validated Probab=54.94 E-value=16 Score=16.90 Aligned_cols=31 Identities=6% Similarity=0.098 Sum_probs=21.3 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCEEEEECCCCC Q ss_conf 68656878999999987654486997767631 Q gi|254780670|r 257 ENLELQPNEMASYLANLVDQYPIYSIEDGMSE 288 (424) Q Consensus 257 ~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e 288 (424) .|..++.+++-++ .+|+.+|.|+-|.|=.++ T Consensus 181 TG~v~~~~~l~~l-~~~a~~~~i~visDEiY~ 211 (404) T PRK09265 181 TGAVYSKELLEEI-VEIARQHNLIIFADEIYD 211 (404) T ss_pred CCCCCCHHHHHHH-HHHHHHCEEEEEECCHHH T ss_conf 1200250468899-997631558998501145 No 101 >PRK07310 consensus Probab=54.89 E-value=16 Score=16.90 Aligned_cols=48 Identities=10% Similarity=0.199 Sum_probs=28.6 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCEEEEECCCCC------CCHHHHHHHHHHCCCCE Q ss_conf 68656878999999987654486997767631------04237998864238836 Q gi|254780670|r 257 ENLELQPNEMASYLANLVDQYPIYSIEDGMSE------DDWHGWKILTNKIGSNC 305 (424) Q Consensus 257 ~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e------~D~~gw~~l~~~~g~~~ 305 (424) .|..++.+++-. +.+||.+|.++-|+|=.+. +.+..+..+.....+++ T Consensus 177 TG~v~s~~~l~~-l~~~a~~~~i~visDEiY~~l~~~g~~~~p~~~~~~~~~~~~ 230 (395) T PRK07310 177 TGMIYSAEELRA-IGEWAVEHDILILADDIYGRLVYNGNQFTPISSLSEAIRKQT 230 (395) T ss_pred CCCCCCCCHHHH-HHHHHHHCCEEEEECCCCEEEECCCCCCCCHHHHHHHHCCCC T ss_conf 562663541467-765565268799943652100148988888888412432573 No 102 >PRK05797 consensus Probab=54.67 E-value=16 Score=16.87 Aligned_cols=18 Identities=6% Similarity=0.281 Sum_probs=7.8 Q ss_pred CCEEEECCCCHHHHHHHH Q ss_conf 522342545300389999 Q gi|254780670|r 329 ANAILIKPNQIGSVSETL 346 (424) Q Consensus 329 ~NailiK~NQiGTvset~ 346 (424) .+++.|.....-.+++.+ T Consensus 484 ~~GI~VD~~~L~~l~~~l 501 (869) T PRK05797 484 SEGFKVDKDILDELSKKF 501 (869) T ss_pred HCCEEECHHHHHHHHHHH T ss_conf 468099999999999999 No 103 >PRK08443 consensus Probab=52.13 E-value=18 Score=16.59 Aligned_cols=30 Identities=17% Similarity=0.265 Sum_probs=20.1 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCEEEEECCCC Q ss_conf 6865687899999998765448699776763 Q gi|254780670|r 257 ENLELQPNEMASYLANLVDQYPIYSIEDGMS 287 (424) Q Consensus 257 ~~~~~t~~elid~y~~l~~~yPI~sIEDP~~ 287 (424) .|..++.+++-++ .++|.+|.++-|+|=.+ T Consensus 175 TG~v~s~~~l~~l-~~~a~~~~~~ii~DE~Y 204 (388) T PRK08443 175 TGSVYSKEELEAI-AKVLKGTDIWVLSDEMY 204 (388) T ss_pred CCCCCCHHHHHHH-HHHHHCCCCEEECCHHH T ss_conf 8847644789999-99862357567422012 No 104 >COG1012 PutA NAD-dependent aldehyde dehydrogenases [Energy production and conversion] Probab=51.46 E-value=5.2 Score=20.55 Aligned_cols=64 Identities=23% Similarity=0.323 Sum_probs=32.7 Q ss_pred CCCCCCEEECCCCCCCCCCCCCEEEEEEC-CCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCC Q ss_conf 47543000003664556776610256514-88774899989999999999987520000136765576448868 Q gi|254780670|r 139 ILPVPLMNILNGGIHADNALDFQEFMIMP-VGAENIREAIRMGAEVFHTLKKELKSKGYSTNVGDEGGFSPNLK 211 (424) Q Consensus 139 ~lP~P~~NiinGG~HA~~~l~iQEfmIiP-~g~~sf~eal~~~~ev~~~lk~~L~~~g~~t~vgDEGGfap~~~ 211 (424) .+|-=.+|++.|.... --+.++--| +...+|.-+-+.+..|++...+.++. -...-||=.|.+= T Consensus 187 glP~Gv~nvv~g~g~~----~g~~l~~~p~v~~v~FTGSt~~G~~i~~~Aa~~~k~-----~~lELGGk~p~IV 251 (472) T COG1012 187 GLPAGVLNVVTGGGAE----VGDALVAHPDVDAISFTGSTAVGRAIAAAAAANLKP-----VTLELGGKSPAIV 251 (472) T ss_pred CCCCCCEEEEECCCCH----HHHHHHCCCCCCEEEEECCHHHHHHHHHHHHHCCCC-----EEEECCCCCCEEE T ss_conf 7998808999789846----889874299978799988779999999998605981-----8998589886698 No 105 >TIGR03470 HpnH hopanoid biosynthesis associated radical SAM protein HpnH. The sequences represented by this model are members of the radical SAM superfamily of enzymes (pfam04055). These enzymes utilize an iron-sulfur redox cluster and S-adenosylmethionine to carry out diverse radical mediated reactions. The members of this clade are frequently found in the same locus as squalene-hopene cyclase (SHC, TIGR01507) and other genes associated with the biosynthesis of hopanoid natural products. The linkage between SHC and this radical SAM enzyme is strong; one is nearly always observed in the same genome where the other is found. A hopanoid biosynthesis locus was described in Zymomonas mobilis consisting of the genes HpnA-E and SHC (HpnF). Continuing past SHC are found a phosphorylase enzyme (ZMO0873, i.e. HpnG, TIGR03468) and this radical SAM enzyme (ZMO0874) which we name here HpnH. Granted, in Z. mobilis, HpnH is in a convergent orientation with respect to HpnA-G, but one gene beyond HpnH Probab=51.14 E-value=18 Score=16.49 Aligned_cols=118 Identities=13% Similarity=0.102 Sum_probs=55.4 Q ss_pred CCCHHHHHHHHHHHHHHCCEEEEE--CCCCCCCHHHHHHHHHHCCCCEEEECCCEEECCHHHHHHHHHCCCCC---EEEE Q ss_conf 568789999999876544869977--67631042379988642388368504300002148885210038652---2342 Q gi|254780670|r 260 ELQPNEMASYLANLVDQYPIYSIE--DGMSEDDWHGWKILTNKIGSNCQLVGDDLFVTNPERLHKGINEEVAN---AILI 334 (424) Q Consensus 260 ~~t~~elid~y~~l~~~yPI~sIE--DP~~e~D~~gw~~l~~~~g~~~~IvGDDL~vTn~~rl~~gi~~~~~N---aili 334 (424) .+|.+|.++-..++- =|+++|. +||=..|+..-..-....|..+.+ +||--.+++-++.-.-. .+-| T Consensus 58 ~Ls~ee~~~~~~e~G--ap~V~itGGEPLLr~dl~eIv~~a~~~g~~v~l------~TNG~Ll~k~i~~~~~~~~~~~~V 129 (318) T TIGR03470 58 RLSVEECLRAVDECG--APVVSIPGGEPLLHPEIDEIVRGLVARKKFVYL------CTNALLLEKKLDKFEPSPYLTFSV 129 (318) T ss_pred CCCHHHHHHHHHHCC--CCEEEECCCCCCCCCCHHHHHHHHHHCCCEEEE------ECCHHHHHHHHHHHHHCCCCEEEE T ss_conf 389999999999849--978995188745564799999999975997999------775520099999985188836999 Q ss_pred -----------CCCCHHHHHHHHHHHHHHHHCCCEEEEECCC--CCCCHHH--HHHHHHHHCCCHH Q ss_conf -----------5453003899999999999879869995078--8872358--9989998199801 Q gi|254780670|r 335 -----------KPNQIGSVSETLNTIEKAQMAGYPSIISHRS--GETEDHT--IADLAVATNCGQI 385 (424) Q Consensus 335 -----------K~NQiGTvset~ea~~~a~~~g~~~ivShRS--GETeD~~--iaDLAVg~~a~~i 385 (424) -..+-|+--.++++++.|++.|+.+-+..-- ++..+.. +-|++-.+|..-| T Consensus 130 sLDG~~e~HD~~r~~~G~Fd~av~aIr~ak~~G~~V~iN~Tvf~~~n~~~i~~~~d~~~~lgVdgi 195 (318) T TIGR03470 130 HLDGLREHHDASVCREGVFDRAVEAIREAKARGFRVTTNTTLFNDTDPEEVAEFFDYLTDLGVDGM 195 (318) T ss_pred ECCCCHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCEEEEEEEECCCCHHHHHHHHHHHHHCCCCEE T ss_conf 801787886688717977999999999999869946799897068999999999999987699738 No 106 >cd06314 PBP1_tmGBP Periplasmic sugar-binding domain of Thermotoga maritima glucose-binding protein (tmGBP) and its close homologs. Periplasmic sugar-binding domain of Thermotoga maritima glucose-binding protein (tmGBP) and its close homologs from other bacteria. They are a member of the type I periplasmic binding protein superfamily which consists of two domains connected by a three-stranded hinge. TmGBP is specific for glucose and its binding pocket is buried at the interface of the two domains. TmGBP also exhibits high thermostability and the highest structural similarity to E. coli glucose binding protein (ecGBP). Probab=50.17 E-value=19 Score=16.38 Aligned_cols=77 Identities=13% Similarity=0.134 Sum_probs=44.8 Q ss_pred CCHHHHHHHHHHHHHHCC-EEEEECCCCCCCHHHHHHHHHHCC--CCEEEECCCEEECCHHHHHHHHHCCCCCEE-EECC Q ss_conf 687899999998765448-699776763104237998864238--836850430000214888521003865223-4254 Q gi|254780670|r 261 LQPNEMASYLANLVDQYP-IYSIEDGMSEDDWHGWKILTNKIG--SNCQLVGDDLFVTNPERLHKGINEEVANAI-LIKP 336 (424) Q Consensus 261 ~t~~elid~y~~l~~~yP-I~sIEDP~~e~D~~gw~~l~~~~g--~~~~IvGDDL~vTn~~rl~~gi~~~~~Nai-liK~ 336 (424) .+.++-.....+|+..|| |-.|--..+.... +-....+..| +++.|+|=|......+.+++|. ..+. .=.| T Consensus 161 ~~~~~a~~~~~~~L~~~pdi~~i~~~~~~~a~-ga~~Al~~~g~~~~i~vvg~D~~~~~~~~i~~G~----i~~tv~Q~p 235 (271) T cd06314 161 EDFAKAKSNAEDALNAHPDLKCMFGLYAYNGP-AIAEAVKAAGKLGKVKIVGFDEDPDTLQGVKEGT----IQGTVVQRP 235 (271) T ss_pred CCHHHHHHHHHHHHHHCCCCCEEEEECCCCHH-HHHHHHHHHCCCCCCEEEEECCCHHHHHHHHCCC----CEEEEECCH T ss_conf 63889999999999758998789980885169-9999999847589967998789899999998299----859983899 Q ss_pred CCHHHH Q ss_conf 530038 Q gi|254780670|r 337 NQIGSV 342 (424) Q Consensus 337 NQiGTv 342 (424) -.+|-. T Consensus 236 ~~~G~~ 241 (271) T cd06314 236 YQMGYL 241 (271) T ss_pred HHHHHH T ss_conf 999999 No 107 >PRK06507 consensus Probab=50.04 E-value=19 Score=16.37 Aligned_cols=31 Identities=23% Similarity=0.362 Sum_probs=20.4 Q ss_pred CCCCCCHHHHHHHHHHHHHHCC-EEEEECCCCC Q ss_conf 6865687899999998765448-6997767631 Q gi|254780670|r 257 ENLELQPNEMASYLANLVDQYP-IYSIEDGMSE 288 (424) Q Consensus 257 ~~~~~t~~elid~y~~l~~~yP-I~sIEDP~~e 288 (424) .|..++.+++-++ .+++.+|+ ++-|.|=.+. T Consensus 177 TG~v~s~~~l~~i-~~~a~~~~~v~visDEiY~ 208 (400) T PRK06507 177 TGAACTRAEMAAI-AEVMLRHPHVWIMTDDIYE 208 (400) T ss_pred HHHHHHHHHHHHH-HHHHHHCCCCEECCCCCHH T ss_conf 1288647799999-9866531682140566424 No 108 >pfam03102 NeuB NeuB family. NeuB is the prokaryotic N-acetylneuraminic acid (Neu5Ac) synthase. It catalyses the direct formation of Neu5Ac (the most common sialic acid) by condensation of phosphoenolpyruvate (PEP) and N-acetylmannosamine (ManNAc). This reaction has only been observed in prokaryotes; eukaryotes synthesize the 9-phosphate form, Neu5Ac-9-P, and utilize ManNAc-6-P instead of ManNAc. Such eukaryotic enzymes are not present in this family. This family also contains SpsE spore coat polysaccharide biosynthesis proteins. Probab=49.97 E-value=19 Score=16.36 Aligned_cols=120 Identities=16% Similarity=0.177 Sum_probs=69.7 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHCC-CCEEEECCCEEECCHHHHHHHHHCCCCCEEEEC Q ss_conf 6865687899999998765448699776763104237998864238-836850430000214888521003865223425 Q gi|254780670|r 257 ENLELQPNEMASYLANLVDQYPIYSIEDGMSEDDWHGWKILTNKIG-SNCQLVGDDLFVTNPERLHKGINEEVANAILIK 335 (424) Q Consensus 257 ~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~D~~gw~~l~~~~g-~~~~IvGDDL~vTn~~rl~~gi~~~~~NailiK 335 (424) +..+++.+++.++ .+.|++..|..+-.||++++.+ +..+++ +.+-|-..|+ +|...|++-.+.+ .-++|. T Consensus 49 kk~els~~~~~~l-~~~~kk~gi~f~~t~fd~~s~~----~l~~l~~~~~KIaS~d~--~n~~Li~~i~k~~--kpiiiS 119 (240) T pfam03102 49 KKLELPEEWHKEL-FEYCKEKGIEFFSTPFDLESVD----FLESLGVPAYKIASGEI--TNLPLLRYIAKTG--KPVILS 119 (240) T ss_pred HHHCCCHHHHHHH-HHHHHHCCCCEEECCCCHHHHH----HHHHCCCCEEEECCCCC--CCHHHHHHHHHCC--CCEEEE T ss_conf 9826899999999-9999982996898989899999----88755888699898864--6589999999739--968995 Q ss_pred CCCHHHHHHHHHHHHHHHHCCC-EEEEEC-C----------------------------CCCCCHHHHHHHHHHHCCCHH Q ss_conf 4530038999999999998798-699950-7----------------------------888723589989998199801 Q gi|254780670|r 336 PNQIGSVSETLNTIEKAQMAGY-PSIISH-R----------------------------SGETEDHTIADLAVATNCGQI 385 (424) Q Consensus 336 ~NQiGTvset~ea~~~a~~~g~-~~ivSh-R----------------------------SGETeD~~iaDLAVg~~a~~i 385 (424) . -..|+.|..++++..++.+. .+++=| - |.-|.+...+=.|++.||..| T Consensus 120 T-G~s~~~eI~~~i~~~~~~~~~~i~llhCvS~YPt~~~~~nL~~I~~lk~~f~~~iG~SdHs~g~~~~~~A~~~GA~ii 198 (240) T pfam03102 120 T-GMATLEEIEEAVETLREAGNEDITLLHCTSEYPAPFEDVNLRAIPTLKEAFGVPVGYSDHTLGIEAPIAAVALGASVI 198 (240) T ss_pred C-CCCCHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHCCCCEEECCCCCCCHHHHHHHHCCCCEE T ss_conf 7-988899999999999963456776866588899974660099988999867996884798897077799998299399 Q ss_pred E Q ss_conf 0 Q gi|254780670|r 386 K 386 (424) Q Consensus 386 K 386 (424) - T Consensus 199 E 199 (240) T pfam03102 199 E 199 (240) T ss_pred E T ss_conf 9 No 109 >PRK05464 Na(+)-translocating NADH-quinone reductase subunit F; Provisional Probab=49.52 E-value=16 Score=16.97 Aligned_cols=35 Identities=23% Similarity=0.506 Sum_probs=27.8 Q ss_pred CCHHH--HHHHHHHHHHHCC----EEEEECCCCCCCHHHHH Q ss_conf 68789--9999998765448----69977676310423799 Q gi|254780670|r 261 LQPNE--MASYLANLVDQYP----IYSIEDGMSEDDWHGWK 295 (424) Q Consensus 261 ~t~~e--lid~y~~l~~~yP----I~sIEDP~~e~D~~gw~ 295 (424) .+.+| +.|.+..|.++|| +..+-||..|++|.|++ T Consensus 312 Rs~~DL~y~de~~~La~e~pNF~y~~aLS~p~~ed~W~G~t 352 (408) T PRK05464 312 RSLREMFYQEDFDQLAAENPNFVWHVALSDPLPEDNWTGYT 352 (408) T ss_pred CCHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCC T ss_conf 76576233799999998789957999978889666888873 No 110 >cd06334 PBP1_ABC_ligand_binding_like_1 Type I periplasmic ligand-binding domain of uncharacterized ABC (ATPase Binding Cassette)-type active transport systems that are predicted to be involved in transport of amino acids, peptides, or inorganic ions. This subgroup includes the type I periplasmic ligand-binding domain of uncharacterized ABC (ATPase Binding Cassette)-type active transport systems that are predicted to be involved in transport of amino acids, peptides, or inorganic ions. Members of this group are sequence-similar to members of the family of ABC-type hydrophobic amino acid transporters, such as leucine-isoleucine-valine-binding protein (LIVBP); however their ligand specificity has not been determined experimentally. Probab=49.34 E-value=20 Score=16.29 Aligned_cols=97 Identities=10% Similarity=0.042 Sum_probs=53.5 Q ss_pred HHHHHHHHHHHHHCCCCCCCCHHHHHCHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCC-EEEEECCCCCCCHH Q ss_conf 89999999999753853111023431104653202785111146865687899999998765448-69977676310423 Q gi|254780670|r 214 DSALDLIANSIEKAGYNAGKDLLIALDCAASAFFKKEKYILKGENLELQPNEMASYLANLVDQYP-IYSIEDGMSEDDWH 292 (424) Q Consensus 214 eeaL~ll~eAi~~aGy~~g~di~i~lD~AAsefy~~~kY~~~~~~~~~t~~elid~y~~l~~~yP-I~sIEDP~~e~D~~ 292 (424) ...++.+.++.++.|++.-.+ +.| .....++...+.++-..-| .+.+--. -.|-. T Consensus 154 ~~~~~~~~~~~~~~G~~vv~~----------~~~------------~~~~~Dft~~l~~i~~a~pD~V~~~~~--~~~~~ 209 (351) T cd06334 154 KEPIEALKALAEKLGFEVVLE----------PVP------------PPGPNDQKAQWLQIRRSGPDYVILWGW--GVMNP 209 (351) T ss_pred HHHHHHHHHHHHHCCCEEEEE----------EEC------------CCCCCCHHHHHHHHHHCCCCEEEEECC--CHHHH T ss_conf 899999999999769979888----------806------------999835899999999769899999377--37899 Q ss_pred HHHHHHHHCCCCEEEECCCEEECCHHHHHHHHHCCCCCEEEECCC Q ss_conf 799886423883685043000021488852100386522342545 Q gi|254780670|r 293 GWKILTNKIGSNCQLVGDDLFVTNPERLHKGINEEVANAILIKPN 337 (424) Q Consensus 293 gw~~l~~~~g~~~~IvGDDL~vTn~~rl~~gi~~~~~NailiK~N 337 (424) .+.+=..++|-+.+++|.+- ..++..++.+- .++++++.-.+ T Consensus 210 ~~~kqa~~~G~~~~~ig~~~-~~~~~~~~~aG--~aa~G~~~~~~ 251 (351) T cd06334 210 VAIKEAKRVGLDDKFIGNWW-SGDEEDVKPAG--DAAKGYKGVTP 251 (351) T ss_pred HHHHHHHHCCCCCCEEEEEC-CCCHHHHHHHH--HHCCCEEEECC T ss_conf 99999997599985797516-78899999721--11177389525 No 111 >pfam12103 Lipl32 Surface lipoprotein of Spirochaetales order. Lipl32 is an outer membrane surface lipoprotein of Leptospira like bacteria. Probab=48.87 E-value=4.9 Score=20.76 Aligned_cols=43 Identities=19% Similarity=0.194 Sum_probs=16.4 Q ss_pred CCEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHH Q ss_conf 522342545300389999999999987986999507888723589989 Q gi|254780670|r 329 ANAILIKPNQIGSVSETLNTIEKAQMAGYPSIISHRSGETEDHTIADL 376 (424) Q Consensus 329 ~NailiK~NQiGTvset~ea~~~a~~~g~~~ivShRSGETeD~~iaDL 376 (424) .|+++--..||++-+.++ .+..=.....+-++||.+.+|.|.. T Consensus 130 YNSL~R~~Sei~sPtKaL-----vrGlYRIsFTtyktgevkGsF~Atv 172 (183) T pfam12103 130 YNSLTRIKSEISSPTKAL-----VRGLYRISFTTYKTGEVKGSFVATV 172 (183) T ss_pred HHHHHHHHHHCCCCHHHH-----HHHEEEEEEEEECCCEEEEEEEEEE T ss_conf 145666664248922877-----6314577778631441534799984 No 112 >PRK08434 consensus Probab=48.83 E-value=20 Score=16.24 Aligned_cols=12 Identities=33% Similarity=0.614 Sum_probs=5.9 Q ss_pred HHHHHHHHCCEE Q ss_conf 999876544869 Q gi|254780670|r 269 YLANLVDQYPIY 280 (424) Q Consensus 269 ~y~~l~~~yPI~ 280 (424) .+.+|..+|||+ T Consensus 577 vLe~L~~~hpi~ 588 (887) T PRK08434 577 VLNALLDKHPVI 588 (887) T ss_pred HHHHHHHCCCCH T ss_conf 999988738738 No 113 >PRK08654 pyruvate carboxylase subunit A; Validated Probab=48.56 E-value=13 Score=17.59 Aligned_cols=24 Identities=4% Similarity=0.072 Sum_probs=11.4 Q ss_pred HHHHHHHHHHCCCCCCCCHHHHHC Q ss_conf 999999997538531110234311 Q gi|254780670|r 217 LDLIANSIEKAGYNAGKDLLIALD 240 (424) Q Consensus 217 L~ll~eAi~~aGy~~g~di~i~lD 240 (424) .+.-..+.+.+||...-.|-+.+| T Consensus 256 ~~~Av~la~~vgY~gaGTVEFl~d 279 (497) T PRK08654 256 GEAAVKAAKAINYENAGTVEFLYS 279 (497) T ss_pred HHHHHHHHHHHCEECCCEEEEEEE T ss_conf 999999888734302332799874 No 114 >PRK07543 consensus Probab=48.52 E-value=20 Score=16.20 Aligned_cols=27 Identities=11% Similarity=0.359 Sum_probs=14.5 Q ss_pred CCCCCCHHHHHHHHHHHHHHCC-EEEEEC Q ss_conf 6865687899999998765448-699776 Q gi|254780670|r 257 ENLELQPNEMASYLANLVDQYP-IYSIED 284 (424) Q Consensus 257 ~~~~~t~~elid~y~~l~~~yP-I~sIED 284 (424) .|..++.+++-++ .+++.+|+ +.-|.| T Consensus 177 TG~v~s~~~l~~l-~~~~~~~~~v~iisD 204 (400) T PRK07543 177 TGAAYTRAELKAL-TDVLVKHPHVWVLTD 204 (400) T ss_pred CCCCCCHHHHHHH-HHHHHHHCCCEECCH T ss_conf 8837641678999-998887538603131 No 115 >TIGR02525 plasmid_TraJ plasmid transfer ATPase TraJ; InterPro: IPR013364 Proteins in this entry are predicted ATPases associated with plasmid transfer loci in bacteria. This family is most similar to the DotB ATPase (IPR013363 from INTERPRO) of a type-IV secretion-like system of the obligate intracellular pathogens Legionella pneumophila and Coxiella burnetii.. Probab=48.50 E-value=18 Score=16.47 Aligned_cols=24 Identities=25% Similarity=0.434 Sum_probs=14.7 Q ss_pred HHHHHHHHHHHHHHCC---EEEEECCC Q ss_conf 7899999998765448---69977676 Q gi|254780670|r 263 PNEMASYLANLVDQYP---IYSIEDGM 286 (424) Q Consensus 263 ~~elid~y~~l~~~yP---I~sIEDP~ 286 (424) |..|...|.-..+.|| |+-+|||. T Consensus 163 STl~AaiY~~~l~t~pdRKivT~EDPv 189 (374) T TIGR02525 163 STLAAAIYRHCLETYPDRKIVTYEDPV 189 (374) T ss_pred HHHHHHHHHHHCCCCCCCEEEEEECCE T ss_conf 999999999850748897079865772 No 116 >PRK05794 consensus Probab=48.21 E-value=20 Score=16.17 Aligned_cols=30 Identities=13% Similarity=0.256 Sum_probs=21.2 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCEEEEECCCC Q ss_conf 6865687899999998765448699776763 Q gi|254780670|r 257 ENLELQPNEMASYLANLVDQYPIYSIEDGMS 287 (424) Q Consensus 257 ~~~~~t~~elid~y~~l~~~yPI~sIEDP~~ 287 (424) .|..++.+++-++ .+||.+|.++-|+|=.+ T Consensus 176 tG~v~s~~~l~~l-~~la~~~~i~visDEiY 205 (397) T PRK05794 176 TGTVYSKEELEEI-AKFAKEHDLFIISDEIY 205 (397) T ss_pred CCCCCCCHHHHHH-HHHHHHCEEEEECHHHH T ss_conf 7824350679999-97654103688433345 No 117 >pfam00682 HMGL-like HMGL-like. This family contains a diverse set of enzymes. These include various aldolases and a region of pyruvate carboxylase. Probab=48.04 E-value=20 Score=16.15 Aligned_cols=13 Identities=15% Similarity=0.067 Sum_probs=5.1 Q ss_pred CCHHHHHHHHHHH Q ss_conf 6878999999987 Q gi|254780670|r 261 LQPNEMASYLANL 273 (424) Q Consensus 261 ~t~~elid~y~~l 273 (424) +|.++.++.+.+. T Consensus 101 ~s~~~~l~~~~~~ 113 (237) T pfam00682 101 KDREEVADRAVAA 113 (237) T ss_pred CCHHHHHHHHHHH T ss_conf 8999999999999 No 118 >pfam01380 SIS SIS domain. SIS (Sugar ISomerase) domains are found in many phosphosugar isomerases and phosphosugar binding proteins. SIS domains are also found in proteins that regulate the expression of genes involved in synthesis of phosphosugars. Presumably the SIS domains bind to the end-product of the pathway. Probab=46.81 E-value=16 Score=16.87 Aligned_cols=57 Identities=18% Similarity=0.130 Sum_probs=41.7 Q ss_pred CEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHHCCCHHE Q ss_conf 223425453003899999999999879869995078887235899899981998010 Q gi|254780670|r 330 NAILIKPNQIGSVSETLNTIEKAQMAGYPSIISHRSGETEDHTIADLAVATNCGQIK 386 (424) Q Consensus 330 NailiK~NQiGTvset~ea~~~a~~~g~~~ivShRSGETeD~~iaDLAVg~~a~~iK 386 (424) |.++|=..|-|.-.|++++++.|++.|..+|.=-...++.=.-.+|+.+-+.++--+ T Consensus 54 ~d~vi~iS~sG~t~~~~~~~~~a~~~g~~~i~iT~~~~S~la~~ad~~l~~~~~~e~ 110 (131) T pfam01380 54 DDLVIAISQSGETRDLLEAAKLLKARGAKIIAITDSKGSPLAREADHVLYIIAGPEA 110 (131) T ss_pred CCEEEEECCCCCCHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHCCEEEECCCCCEE T ss_conf 999999548986576898789999849969999899999789968998987998756 No 119 >PRK13236 nitrogenase reductase; Reviewed Probab=46.62 E-value=21 Score=16.00 Aligned_cols=135 Identities=12% Similarity=0.073 Sum_probs=83.2 Q ss_pred ECCCCCCCCCCCC-CEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 0036645567766-102565148877489998999999999998752000013676557644886846899999999997 Q gi|254780670|r 147 ILNGGIHADNALD-FQEFMIMPVGAENIREAIRMGAEVFHTLKKELKSKGYSTNVGDEGGFSPNLKTADSALDLIANSIE 225 (424) Q Consensus 147 iinGG~HA~~~l~-iQEfmIiP~g~~sf~eal~~~~ev~~~lk~~L~~~g~~t~vgDEGGfap~~~~~eeaL~ll~eAi~ 225 (424) |..||=-..-.-. -||+.|+-.|. ..+|..+-+|.+.+++.-... -.--||...|-.+.+.-.+++.+-.+ T Consensus 133 VVCGGFAmPir~g~A~evyiVtSge---~malyAANNI~~~i~~~a~~g-----~~rlgGiI~N~r~~~~e~~~v~~fa~ 204 (295) T PRK13236 133 VVCGGFAMPIREGKAQEIYIVTSGE---MMAMYAANNIARGILKYAHTG-----GVRLGGLICNSRNVDREIELIETLAK 204 (295) T ss_pred CCCCCCCCCCCCCCCCEEEEEECCC---HHHHHHHHHHHHHHHHHHHCC-----CCCEEEEEEECCCCCCHHHHHHHHHH T ss_conf 5326754656678764899995681---889999999999999997426-----97035899607888747999999999 Q ss_pred HCCCCCCCCHHHHHCHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHH---CCEEEEECCCCCCCHHHHH Q ss_conf 538531110234311046532027851111468656878999999987654---4869977676310423799 Q gi|254780670|r 226 KAGYNAGKDLLIALDCAASAFFKKEKYILKGENLELQPNEMASYLANLVDQ---YPIYSIEDGMSEDDWHGWK 295 (424) Q Consensus 226 ~aGy~~g~di~i~lD~AAsefy~~~kY~~~~~~~~~t~~elid~y~~l~~~---yPI~sIEDP~~e~D~~gw~ 295 (424) +.|-..=-.|--...+.-+|++..---.+.-+ .++.+.|.+|.++ -+-..+-.|+..|..+.|- T Consensus 205 ~~gt~ii~~iPr~~~V~~ae~~~~TviE~~P~------s~~A~~Yr~LA~~i~~n~~~~vP~Pl~~~eL~~l~ 271 (295) T PRK13236 205 RLNTQMIHFVPRDNIVQHAELRRMTVNEYAPD------SNQGNEYRILAKKIINNDKLTIPTPIEMEELEELL 271 (295) T ss_pred HHCCCEEEECCCCHHHHHHHHCCCEEEEECCC------CHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHH T ss_conf 81992699657838888999739876887889------77999999999999858998678989988999999 No 120 >PRK07337 aminotransferase; Validated Probab=46.60 E-value=21 Score=16.00 Aligned_cols=31 Identities=16% Similarity=0.304 Sum_probs=22.7 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCEEEEECCCCC Q ss_conf 68656878999999987654486997767631 Q gi|254780670|r 257 ENLELQPNEMASYLANLVDQYPIYSIEDGMSE 288 (424) Q Consensus 257 ~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e 288 (424) .|..++.+|+-+. .++|.+|.++-|+|=.+. T Consensus 176 tG~v~s~~el~~i-~~~a~~~~~~vIsDEiY~ 206 (388) T PRK07337 176 TGTSIEPDELRRI-VEAVRARGGFTIVDEIYQ 206 (388) T ss_pred CCEEEEHHHHHHH-HHHHHCCCEEEECCCCCC T ss_conf 8856267774335-544314580884576410 No 121 >PRK08068 transaminase; Reviewed Probab=46.38 E-value=22 Score=15.97 Aligned_cols=30 Identities=7% Similarity=-0.094 Sum_probs=20.7 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCEEEEECCCC Q ss_conf 6865687899999998765448699776763 Q gi|254780670|r 257 ENLELQPNEMASYLANLVDQYPIYSIEDGMS 287 (424) Q Consensus 257 ~~~~~t~~elid~y~~l~~~yPI~sIEDP~~ 287 (424) .|..++.+++-++ .+|+.+|.++-|+|=.+ T Consensus 180 TG~v~~~~~l~~l-~~la~~~~i~visDEiY 209 (389) T PRK08068 180 TGAVATKAFFEET-VAFAKKHNIGVVHDFAY 209 (389) T ss_pred CCHHCCHHHHHHH-HHHHHCCCEEEEECCCC T ss_conf 4612059999999-99871077799962652 No 122 >PRK07568 aspartate aminotransferase; Provisional Probab=45.59 E-value=22 Score=15.89 Aligned_cols=32 Identities=13% Similarity=0.314 Sum_probs=21.0 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCC Q ss_conf 686568789999999876544869977676310 Q gi|254780670|r 257 ENLELQPNEMASYLANLVDQYPIYSIEDGMSED 289 (424) Q Consensus 257 ~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~ 289 (424) .|..++.+++-.+ .+|+.+|.++-|+|=.+.+ T Consensus 175 TG~v~s~~~~~~l-~~la~~~~~~ii~De~Y~~ 206 (396) T PRK07568 175 TGVVYTKEELRQL-ADIAKEHDLFLISDEVYRE 206 (396) T ss_pred CCCCCCHHHHHHH-HHHHHHHCEEEECCCCHHH T ss_conf 8754015789999-9999871066742453555 No 123 >PRK05929 consensus Probab=45.50 E-value=22 Score=15.88 Aligned_cols=12 Identities=25% Similarity=0.224 Sum_probs=5.3 Q ss_pred HHHHHHHHHCCE Q ss_conf 999987654486 Q gi|254780670|r 268 SYLANLVDQYPI 279 (424) Q Consensus 268 d~y~~l~~~yPI 279 (424) +.+.+|..+||| T Consensus 555 evLe~L~~~hpi 566 (870) T PRK05929 555 EVLEALEGEHEI 566 (870) T ss_pred HHHHHHHCCCHH T ss_conf 999974060026 No 124 >TIGR01487 SPP-like SPP-like hydrolase, Archaeal; InterPro: IPR006382 This group of Archaeal sequences is most closely related to the sucrose-phosphate phosphatases (SPP) from plants and cyanobacteria. If it should be shown that the proteins in this family possess sucrose-phosphatase activity then SPP would represent a broad family of functionally related proteins.. Probab=45.16 E-value=15 Score=17.10 Aligned_cols=14 Identities=14% Similarity=0.328 Sum_probs=5.3 Q ss_pred HHHHHHHHHHCCCC Q ss_conf 99999999753853 Q gi|254780670|r 217 LDLIANSIEKAGYN 230 (424) Q Consensus 217 L~ll~eAi~~aGy~ 230 (424) ++.+.+-++..|.. T Consensus 124 ~d~~~~~~~~~gl~ 137 (223) T TIGR01487 124 VDEVRKIAKERGLE 137 (223) T ss_pred HHHHHHHHHHCCEE T ss_conf 88999998848848 No 125 >COG2084 MmsB 3-hydroxyisobutyrate dehydrogenase and related beta-hydroxyacid dehydrogenases [Lipid metabolism] Probab=44.93 E-value=23 Score=15.82 Aligned_cols=93 Identities=17% Similarity=0.250 Sum_probs=47.1 Q ss_pred CCCEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCC------------------CCCCHHHHHHHHHHCCCCEEEECC Q ss_conf 78511114686568789999999876544869977676------------------310423799886423883685043 Q gi|254780670|r 249 KEKYILKGENLELQPNEMASYLANLVDQYPIYSIEDGM------------------SEDDWHGWKILTNKIGSNCQLVGD 310 (424) Q Consensus 249 ~~kY~~~~~~~~~t~~elid~y~~l~~~yPI~sIEDP~------------------~e~D~~gw~~l~~~~g~~~~IvGD 310 (424) +++...+.. ..+++.-.+... .+++..+.+++=|. ++++++--.-+-+.+|.++.-+|+ T Consensus 87 ~G~i~IDmS--Tisp~~a~~~a~-~~~~~G~~~lDAPVsGg~~~A~~GtLtimvGG~~~~f~r~~pvl~~~g~~i~~~G~ 163 (286) T COG2084 87 PGAIVIDMS--TISPETARELAA-ALAAKGLEFLDAPVSGGVPGAAAGTLTIMVGGDAEAFERAKPVLEAMGKNIVHVGP 163 (286) T ss_pred CCCEEEECC--CCCHHHHHHHHH-HHHHCCCCEEECCCCCCCHHHHHCCEEEEECCCHHHHHHHHHHHHHHCCCEEEECC T ss_conf 997899878--999899999999-99966986894676588312320716999479999999889999986075698789 Q ss_pred CEEECCHHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCC Q ss_conf 00002148885210038652234254530038999999999998798 Q gi|254780670|r 311 DLFVTNPERLHKGINEEVANAILIKPNQIGSVSETLNTIEKAQMAGY 357 (424) Q Consensus 311 DL~vTn~~rl~~gi~~~~~NailiK~NQiGTvset~ea~~~a~~~g~ 357 (424) . -.|--.|.||.+++..+ +--..|++.++++.|. T Consensus 164 ~---------G~G~~~Kl~nn~l~~~~----~~a~aEAl~la~k~Gl 197 (286) T COG2084 164 V---------GAGQAAKLANNILLAGN----IAALAEALALAEKAGL 197 (286) T ss_pred C---------CHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHCCC T ss_conf 8---------70599999999999999----9999999999998097 No 126 >PRK08786 consensus Probab=44.76 E-value=23 Score=15.80 Aligned_cols=18 Identities=22% Similarity=0.370 Sum_probs=7.2 Q ss_pred CCEEEECCCCHHHHHHHH Q ss_conf 522342545300389999 Q gi|254780670|r 329 ANAILIKPNQIGSVSETL 346 (424) Q Consensus 329 ~NailiK~NQiGTvset~ 346 (424) .+++.|.....-.+++.+ T Consensus 541 ~~GI~vD~~~L~~l~~~l 558 (927) T PRK08786 541 ANGVCVDAAELRRQSADL 558 (927) T ss_pred HCCEEECHHHHHHHHHHH T ss_conf 568798899999999999 No 127 >PRK05957 aspartate aminotransferase; Provisional Probab=44.53 E-value=23 Score=15.78 Aligned_cols=31 Identities=16% Similarity=0.249 Sum_probs=20.6 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCEEEEECCCCC Q ss_conf 68656878999999987654486997767631 Q gi|254780670|r 257 ENLELQPNEMASYLANLVDQYPIYSIEDGMSE 288 (424) Q Consensus 257 ~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e 288 (424) .|..++.+++-+ +.+||.+|.++-|+|=.+. T Consensus 173 TG~v~s~~~l~~-l~~la~~~~~~ii~DE~Y~ 203 (389) T PRK05957 173 TGVVYPEALLRA-VNQICAEHGIYHISDEAYE 203 (389) T ss_pred CCCCCCHHHHHH-HHHHHCCCCCEEEECCCCC T ss_conf 898876778999-9987606781899898432 No 128 >COG2024 Phenylalanyl-tRNA synthetase alpha subunit (archaeal type) [Translation, ribosomal structure and biogenesis] Probab=44.39 E-value=22 Score=15.89 Aligned_cols=57 Identities=16% Similarity=0.388 Sum_probs=45.5 Q ss_pred HHHHHHC--C--EEEEECCCCCCCHHHHHHHHHHCCCCEEEECCCEEECCHHHHHHHHHCC Q ss_conf 9876544--8--6997767631042379988642388368504300002148885210038 Q gi|254780670|r 271 ANLVDQY--P--IYSIEDGMSEDDWHGWKILTNKIGSNCQLVGDDLFVTNPERLHKGINEE 327 (424) Q Consensus 271 ~~l~~~y--P--I~sIEDP~~e~D~~gw~~l~~~~g~~~~IvGDDL~vTn~~rl~~gi~~~ 327 (424) ..+.+++ | +.||.-.|+.+-.+.-+.|+.-.+..|.||++|+.|-.-+-+..|+-.+ T Consensus 198 s~i~~r~~~PlklFSIDRCFRREQ~ED~shLmtYhSASCVvvde~vtvD~GKaVAEglL~q 258 (536) T COG2024 198 SEILKREDPPLKLFSIDRCFRREQREDASHLMTYHSASCVVVDEDVTVDDGKAVAEGLLRQ 258 (536) T ss_pred HHHHHCCCCCCEEEEHHHHHHHHHHCCHHHHHHHCCCEEEEECCCCCCCCCHHHHHHHHHH T ss_conf 9887234898416425677665531015666432034079975864435438999999997 No 129 >PRK05586 biotin carboxylase; Validated Probab=44.37 E-value=17 Score=16.70 Aligned_cols=57 Identities=9% Similarity=0.183 Sum_probs=25.4 Q ss_pred HHHHHHHHHHCCCCCCCCHHHHHCHHHHHHCCCCCEEECC------CC---CCCCHHHHHHHHHHHHHHCCE Q ss_conf 9999999975385311102343110465320278511114------68---656878999999987654486 Q gi|254780670|r 217 LDLIANSIEKAGYNAGKDLLIALDCAASAFFKKEKYILKG------EN---LELQPNEMASYLANLVDQYPI 279 (424) Q Consensus 217 L~ll~eAi~~aGy~~g~di~i~lD~AAsefy~~~kY~~~~------~~---~~~t~~elid~y~~l~~~yPI 279 (424) .+.-....+.+||..--.|-+-+| ...+|| .++. +. -..|.-+|+..=..+..--|+ T Consensus 256 ~~~A~~la~~vgY~gaGTvEFl~d-~~~~fy-----FlEvNtRlQVEH~VTE~vtGvDLV~~Qi~iA~G~~L 321 (447) T PRK05586 256 GEIAVKAAKAVNYKNAGTIEFLLD-KDGNFY-----FMEMNTRIQVEHPITEMITGVDLVKEQIKIAYGEKL 321 (447) T ss_pred HHHHHHHHHHCCCCCCCEEEEEEC-CCCCEE-----EEEEECCCCCCCCCEEECCCCCHHHHHHHHHCCCCC T ss_conf 999998888606033324788875-899789-----993343555566411000377689999998679999 No 130 >PRK06749 replicative DNA helicase; Provisional Probab=44.14 E-value=23 Score=15.74 Aligned_cols=137 Identities=18% Similarity=0.198 Sum_probs=64.4 Q ss_pred HHHCHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECC---CCCCCHHHHHHHHHHCCCCEEEECCC-- Q ss_conf 4311046532027851111468656878999999987654486997767---63104237998864238836850430-- Q gi|254780670|r 237 IALDCAASAFFKKEKYILKGENLELQPNEMASYLANLVDQYPIYSIEDG---MSEDDWHGWKILTNKIGSNCQLVGDD-- 311 (424) Q Consensus 237 i~lD~AAsefy~~~kY~~~~~~~~~t~~elid~y~~l~~~yPI~sIEDP---~~e~D~~gw~~l~~~~g~~~~IvGDD-- 311 (424) +++++|-+-. +.++-.+-+ ..+.+.+|+.+-+..-...-|...|.+| +..+||+-..+...++.+.-..+=|. T Consensus 202 ~alnia~~~a-~~g~~v~~f-SlEMs~~~l~~R~ls~~s~v~~~~i~~~~~~~~~~~~~~~~~a~~~l~~~~l~i~d~~~ 279 (428) T PRK06749 202 FALNVGLHAA-KSGAAVGLF-SLEMSSKQLLKRMASCVGEVSGGRLKNPKHRFAMEDWEKVSKAFAEIGELPLEIYDNAG 279 (428) T ss_pred HHHHHHHHHH-HCCCCEEEE-ECCCCHHHHHHHHHHHHCCCCHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCEEEECCCC T ss_conf 9999999999-649927998-37899999999999975499888862776779999999999999998559659975899 Q ss_pred EEECCHHHHHHHHHCCCCC---EEEE------C---------CCCHHHHHHHHHHHHHHHHCCCEEEEEC---CCCCC-C Q ss_conf 0002148885210038652---2342------5---------4530038999999999998798699950---78887-2 Q gi|254780670|r 312 LFVTNPERLHKGINEEVAN---AILI------K---------PNQIGSVSETLNTIEKAQMAGYPSIISH---RSGET-E 369 (424) Q Consensus 312 L~vTn~~rl~~gi~~~~~N---aili------K---------~NQiGTvset~ea~~~a~~~g~~~ivSh---RSGET-e 369 (424) +.++..+---+-+..+..+ -++| + ..|++.+|..+. .+|++.+.-+|+-+ |.-|. . T Consensus 280 ~ti~~i~~~~r~~~~~~g~~~~livIDYlqLi~~~~~~~~~r~~ev~~isr~lK--~lAkel~vpvi~lsQLnR~~e~r~ 357 (428) T PRK06749 280 VTVQDIWMQTRKLKRKHGDKKVLIIVDYLQLITGDPKHKGNRFQEISEISRKLK--LLARELNVCVVALSQLSRSVESRQ 357 (428) T ss_pred CCHHHHHHHHHHHHHHCCCCCEEEEEEHHHHCCCCCCCCCCHHHHHHHHHHHHH--HHHHHHCCCEEEECCCCCCCCCCC T ss_conf 767999999999999749987699976776505787777789999999999999--999996998999713785523378 Q ss_pred H--HHHHHHH Q ss_conf 3--5899899 Q gi|254780670|r 370 D--HTIADLA 377 (424) Q Consensus 370 D--~~iaDLA 377 (424) | ..++||- T Consensus 358 dkrP~lsDLr 367 (428) T PRK06749 358 DKRPLLSDLR 367 (428) T ss_pred CCCCCCCHHC T ss_conf 9997601210 No 131 >PRK12581 oxaloacetate decarboxylase; Provisional Probab=44.13 E-value=23 Score=15.74 Aligned_cols=29 Identities=28% Similarity=0.302 Sum_probs=14.1 Q ss_pred HHHHHHHHHHHHCC-CCCCCCCCCCCCCCC Q ss_conf 99999998752000-013676557644886 Q gi|254780670|r 182 EVFHTLKKELKSKG-YSTNVGDEGGFSPNL 210 (424) Q Consensus 182 ev~~~lk~~L~~~g-~~t~vgDEGGfap~~ 210 (424) +-|-.+-+.|.+.| ....+.|-.|...|. T Consensus 163 ~yy~~~ak~l~~~Gad~I~iKDmaGlL~P~ 192 (468) T PRK12581 163 NYYLSLVKELVEMGADSICIKDMAGILTPK 192 (468) T ss_pred HHHHHHHHHHHHCCCCEEEEECCCCCCCHH T ss_conf 999999999997399989984787776889 No 132 >TIGR02090 LEU1_arch isopropylmalate/citramalate/homocitrate synthases; InterPro: IPR011830 Methanogenic archaea contain three closely related homologs of the 2-isopropylmalate synthases (LeuA) represented by IPR005671 from INTERPRO. Two of these in Methanococcus janaschii (MJ1392 - CimA ; MJ0503 - AksA ) have been characterised as catalyzing alternative reactions leaving the third (MJ1195) as the presumptive LeuA enzyme. CimA is citramalate (2-methylmalate) synthase, which condenses acetyl-CoA with pyruvate. This enzyme is believed to be involved in the biosynthesis of isoleucine in methanogens and possibly other species lacking threonine dehydratase. AksA is a homocitrate synthase which also produces (homo)2-citrate and (homo)3-citrate in the biosynthesis of Coenzyme B which is restricted solely to methanogenic archaea. Methanogens, then should and apparently do contain all three of these enzymes. Unfortunately, phylogenetic trees do not resolve into three unambiguous clades, making assignment of function to particular genes problematic. Other archaea, which lack a threonine dehydratase (mainly Euryarchaeota), should contain both CimA and LeuA genes. This is true for archaeoglobus fulgidis, but not for the Pyrococci which have none in this clade, but one in IPR005671 from INTERPRO and one in IPR005675 from INTERPRO which may fulfil these roles. Proteins from other species, which have only one hit to this entry and lack threonine dehydratase, are very likely to be LeuA enzymes.; GO: 0046912 transferase activity transferring acyl groups acyl groups converted into alkyl on transfer, 0019752 carboxylic acid metabolic process. Probab=43.94 E-value=23 Score=15.71 Aligned_cols=104 Identities=18% Similarity=0.259 Sum_probs=78.4 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCEEEEECCCC---CCCHHHHHHHHHHCCCCEEEECCCEEECCHHHHHHHHHCCCCCEEE Q ss_conf 6865687899999998765448699776763---1042379988642388368504300002148885210038652234 Q gi|254780670|r 257 ENLELQPNEMASYLANLVDQYPIYSIEDGMS---EDDWHGWKILTNKIGSNCQLVGDDLFVTNPERLHKGINEEVANAIL 333 (424) Q Consensus 257 ~~~~~t~~elid~y~~l~~~yPI~sIEDP~~---e~D~~gw~~l~~~~g~~~~IvGDDL~vTn~~rl~~gi~~~~~Nail 333 (424) .|..||++|=++.-.+| ++-.+=-||=+|. |-|+++-++++++.|.+..|+| |-.-+++=|.++|+-+.-.=.+ T Consensus 15 PGVsLT~EqK~~IA~KL-DeLGVDvIEAGfpi~S~GE~~aiK~I~~~vGLnAEI~~--l~RA~k~DID~AidcgvdsIh~ 91 (371) T TIGR02090 15 PGVSLTVEQKVEIARKL-DELGVDVIEAGFPIASEGEFEAIKKIAEEVGLNAEICS--LARALKKDIDKAIDCGVDSIHT 91 (371) T ss_pred CCCCCCHHHHHHHHHHH-HHCCCCEEECCCCCCCHHHHHHHHHHHHHCCCCEEEEE--EECCCCCCHHHHHHCCCCEEEE T ss_conf 69568865689999999-74698288547631451457899999986289635510--1026731001564369877899 Q ss_pred E----------CCCC---HHHHHHHHHHHHHHHHCCCEEEEEC Q ss_conf 2----------5453---0038999999999998798699950 Q gi|254780670|r 334 I----------KPNQ---IGSVSETLNTIEKAQMAGYPSIISH 363 (424) Q Consensus 334 i----------K~NQ---iGTvset~ea~~~a~~~g~~~ivSh 363 (424) + |.+. =-=+--.+||+++||++|..+=.|+ T Consensus 92 fiaTSpiH~KYKl~~K~~devle~~veAvEYAKEHGLiVEfSA 134 (371) T TIGR02090 92 FIATSPIHLKYKLKNKSRDEVLEKAVEAVEYAKEHGLIVEFSA 134 (371) T ss_pred EECCCHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCEEEECC T ss_conf 8048857872348887899999999998987752573553177 No 133 >pfam03641 Lysine_decarbox Possible lysine decarboxylase. The members of this family share a highly conserved motif PGGXGTXXE that is probably functionally important. This family includes proteins annotated as lysine decarboxylases, although the evidence for this is not clear. Probab=43.71 E-value=24 Score=15.69 Aligned_cols=23 Identities=22% Similarity=0.419 Sum_probs=9.2 Q ss_pred CCEEEECCCCHHHHHHHHHHHHH Q ss_conf 52234254530038999999999 Q gi|254780670|r 329 ANAILIKPNQIGSVSETLNTIEK 351 (424) Q Consensus 329 ~NailiK~NQiGTvset~ea~~~ 351 (424) ++|.++=|.=+||+-|..|+..+ T Consensus 53 sDafI~lPGG~GTLdElfevlt~ 75 (130) T pfam03641 53 ADAFVALPGGFGTLEELFEILTW 75 (130) T ss_pred CCEEEEECCCCCHHHHHHHHHHH T ss_conf 69899907975329999999999 No 134 >cd06335 PBP1_ABC_ligand_binding_like_2 Type I periplasmic ligand-binding domain of uncharacterized ABC (ATPase Binding Cassette)-type active transport systems that are predicted to be involved in transport of amino acids, peptides, or inorganic ions. This subgroup includes the type I periplasmic ligand-binding domain of uncharacterized ABC (ATPase Binding Cassette)-type active transport systems that are predicted to be involved in transport of amino acids, peptides, or inorganic ions. Members of this group are sequence-similar to members of the family of ABC-type hydrophobic amino acid transporters, such as leucine-isoleucine-valine-binding protein (LIVBP); however their ligand specificity has not been determined experimentally. Probab=43.69 E-value=24 Score=15.69 Aligned_cols=83 Identities=8% Similarity=0.009 Sum_probs=37.2 Q ss_pred HHHHHHHHHHHHHCCCCCCCCHHHHHCHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCC-EEEEECCCCCCCHH Q ss_conf 89999999999753853111023431104653202785111146865687899999998765448-69977676310423 Q gi|254780670|r 214 DSALDLIANSIEKAGYNAGKDLLIALDCAASAFFKKEKYILKGENLELQPNEMASYLANLVDQYP-IYSIEDGMSEDDWH 292 (424) Q Consensus 214 eeaL~ll~eAi~~aGy~~g~di~i~lD~AAsefy~~~kY~~~~~~~~~t~~elid~y~~l~~~yP-I~sIEDP~~e~D~~ 292 (424) ....+.+.+++++.|.+.-.+ +. |. ....++..++.++-..=| .+.+- ..-.|-. T Consensus 152 ~~~~~~~~~~~~~~G~~vv~~----------~~-----~~-------~~~~Df~~~l~~i~~~~pD~V~~~--~~~~~~~ 207 (347) T cd06335 152 RSNRKDLTAALAARGLKPVAV----------EW-----FN-------WGDKDMTAQLLRAKAAGADAIIIV--GNGPEGA 207 (347) T ss_pred HHHHHHHHHHHHHCCCEEEEE----------EE-----EC-------CCCCCHHHHHHHHHHCCCCEEEEE--CCCHHHH T ss_conf 899999999998659867578----------84-----07-------998348999999986699999994--7761799 Q ss_pred HHHHHHHHCCCCEEEECCCEEECCHHHHH Q ss_conf 79988642388368504300002148885 Q gi|254780670|r 293 GWKILTNKIGSNCQLVGDDLFVTNPERLH 321 (424) Q Consensus 293 gw~~l~~~~g~~~~IvGDDL~vTn~~rl~ 321 (424) .+.+-..++|-+..++|.+- .+.+..++ T Consensus 208 ~~~~q~~~~G~~~~~~g~~~-~~~~~~~~ 235 (347) T cd06335 208 QIANGMAKLGWKVPIISHWG-LSGGNFIE 235 (347) T ss_pred HHHHHHHHCCCCCCEEEECC-CCCHHHHH T ss_conf 99999997199976794147-77589998 No 135 >cd06300 PBP1_ABC_sugar_binding_like_1 Periplasmic sugar-binding component of uncharacterized ABC-type transport systems that are members of the pentose/hexose sugar-binding protein family of the type I periplasmic binding protein superfamily. Periplasmic sugar-binding component of uncharacterized ABC-type transport systems that are members of the pentose/hexose sugar-binding protein family of the type I periplasmic binding protein superfamily, which consists of two alpha/beta globular domains connected by a three-stranded hinge. This Venus flytrap-like domain undergoes transition from an open to a closed conformational state upon ligand binding. Members of this group are predicted to be involved in the transport of sugar-containing molecules across cellular and organellar membranes; however their substrate specificity is not known in detail. Probab=43.27 E-value=24 Score=15.64 Aligned_cols=50 Identities=16% Similarity=0.183 Sum_probs=24.7 Q ss_pred CCHHHHHHHHHHHHHHCC-EEEEECCCCCCCHHHHHHHHHHCCCCE-EEECCCE Q ss_conf 687899999998765448-699776763104237998864238836-8504300 Q gi|254780670|r 261 LQPNEMASYLANLVDQYP-IYSIEDGMSEDDWHGWKILTNKIGSNC-QLVGDDL 312 (424) Q Consensus 261 ~t~~elid~y~~l~~~yP-I~sIEDP~~e~D~~gw~~l~~~~g~~~-~IvGDDL 312 (424) .+++.=.+.-.+|+.+|| |-.|-- -+. +--|=..-.+..|.+. .++|+|- T Consensus 167 ~~~~~a~~~~~~~L~~~pdid~I~~-~d~-~a~Ga~~Al~~aG~~~~~~~g~~~ 218 (272) T cd06300 167 WDQAVAQKAVADFLASNPDVDGIWT-QGG-DAVGAVQAFEQAGRDIPPVTGEDE 218 (272) T ss_pred CCHHHHHHHHHHHHHHCCCCEEEEE-CCC-CHHHHHHHHHHCCCCCCEEEEECC T ss_conf 8889999999999986899609997-782-189999999966999886999759 No 136 >PRK07625 consensus Probab=43.21 E-value=24 Score=15.64 Aligned_cols=15 Identities=20% Similarity=0.434 Sum_probs=5.4 Q ss_pred CEEEECCCCHHHHHH Q ss_conf 223425453003899 Q gi|254780670|r 330 NAILIKPNQIGSVSE 344 (424) Q Consensus 330 NailiK~NQiGTvse 344 (424) +++.|.....-.+++ T Consensus 537 ~GI~vD~~~L~~l~~ 551 (922) T PRK07625 537 TGVLIDADRLSRQSG 551 (922) T ss_pred HCCCCCHHHHHHHHH T ss_conf 284026999999999 No 137 >cd06320 PBP1_allose_binding Periplasmic allose-binding domain of bacterial transport systems that function as a primary receptor of active transport and chemotaxis. Periplasmic allose-binding domain of bacterial transport systems that function as a primary receptor of active transport and chemotaxis. The members of this group are belonging to a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily. Like other periplasmic receptors of the ABC-type transport systems, the allose-binding protein consists of two alpha/beta domains connected by a three-stranded hinge. This Venus flytrap-like domain undergoes transition from an open to a closed conformational state upon ligand binding. Probab=43.16 E-value=24 Score=15.63 Aligned_cols=75 Identities=17% Similarity=0.245 Sum_probs=40.0 Q ss_pred CCHHHHHHHHHHHHHHCC-EEEEECCCCCCCH--HHHHHHHHHCC--CCEEEECCCEEECCHHHHHHHHHCCCCC-EEEE Q ss_conf 687899999998765448-6997767631042--37998864238--8368504300002148885210038652-2342 Q gi|254780670|r 261 LQPNEMASYLANLVDQYP-IYSIEDGMSEDDW--HGWKILTNKIG--SNCQLVGDDLFVTNPERLHKGINEEVAN-AILI 334 (424) Q Consensus 261 ~t~~elid~y~~l~~~yP-I~sIEDP~~e~D~--~gw~~l~~~~g--~~~~IvGDDL~vTn~~rl~~gi~~~~~N-aili 334 (424) .+.++-.+...+|+.++| +..| |.-+|. -|...-.++.| .++.|+|=|......+.++.| ... .+.- T Consensus 164 ~~~~~~~~~~~~~L~~~pd~~ai---~~~nD~~A~Ga~~al~~~g~~~di~vvG~D~~~~~~~~i~~g----~~~~tv~q 236 (275) T cd06320 164 WDREKAYDVATTILQRNPDLKAI---YCNNDTMALGVVEAVKNAGKQGKVLVVGTDGIPEAYKSIRAG----ELTATVDS 236 (275) T ss_pred CHHHHHHHHHHHHHHHCCCCCEE---EECCCHHHHHHHHHHHHCCCCCCEEEEEECCCHHHHHHHHCC----CCEEEECC T ss_conf 03899999999988628766779---966988999999999976999982999989769999998649----96699817 Q ss_pred CCCCHHHH Q ss_conf 54530038 Q gi|254780670|r 335 KPNQIGSV 342 (424) Q Consensus 335 K~NQiGTv 342 (424) .|-++|-. T Consensus 237 ~~~~~G~~ 244 (275) T cd06320 237 FPALIGEV 244 (275) T ss_pred CHHHHHHH T ss_conf 99999999 No 138 >PRK10355 xylF D-xylose transporter subunit XylF; Provisional Probab=42.79 E-value=24 Score=15.59 Aligned_cols=63 Identities=13% Similarity=0.195 Sum_probs=37.4 Q ss_pred CCHHHHHHHHHHHHHHCC--EEEEECCCCCCCHHHHHHHHHH-CCCCEEEECCCEEECCHHHHHHH Q ss_conf 687899999998765448--6997767631042379988642-38836850430000214888521 Q gi|254780670|r 261 LQPNEMASYLANLVDQYP--IYSIEDGMSEDDWHGWKILTNK-IGSNCQLVGDDLFVTNPERLHKG 323 (424) Q Consensus 261 ~t~~elid~y~~l~~~yP--I~sIEDP~~e~D~~gw~~l~~~-~g~~~~IvGDDL~vTn~~rl~~g 323 (424) .++++=...-.+|+..+| |-.|-=+.|+--.....-|.+. +..++.|+|-|-..--.++++.| T Consensus 192 w~~~~A~~~me~~L~a~~~~id~V~a~ND~mAlGA~~Al~~aG~~~~v~V~G~Dg~~~al~~I~~G 257 (330) T PRK10355 192 WLPENALKIMENALTANNNKIDAVVASNDATAGGAIQALSAQGLSGKVAISGQDADLAAIKRIVAG 257 (330) T ss_pred CCHHHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCEEECCCCCHHHHHHHHCC T ss_conf 228899999999996099851699989971489999999966999997699027998999999749 No 139 >PRK09534 btuF corrinoid ABC transporter substrate-binding protein; Reviewed Probab=42.66 E-value=19 Score=16.45 Aligned_cols=70 Identities=13% Similarity=0.111 Sum_probs=44.7 Q ss_pred HHHHHHCC-EEEEECCCCC-CCHHHHHHHHHHCCCCEEEECCCEEECCHHHHHHHHHCCCCCEEEECCCCHHHHH Q ss_conf 98765448-6997767631-0423799886423883685043000021488852100386522342545300389 Q gi|254780670|r 271 ANLVDQYP-IYSIEDGMSE-DDWHGWKILTNKIGSNCQLVGDDLFVTNPERLHKGINEEVANAILIKPNQIGSVS 343 (424) Q Consensus 271 ~~l~~~yP-I~sIEDP~~e-~D~~gw~~l~~~~g~~~~IvGDDL~vTn~~rl~~gi~~~~~NailiK~NQiGTvs 343 (424) +.+++.-| ++-+-+..++ .++++|+.+++--.++++.|-+|+.-.---|+-+|+++-+ -.|-|--.+|.+ T Consensus 243 E~iI~~nPDVII~~~~~~~i~~~~~~~~~~AVkn~rv~~vd~d~i~RpGPRiv~glE~lA---~~iHp~~~~~~~ 314 (364) T PRK09534 243 EVIVSQDPDVVVTGVSADRLDETASYRSTTAYETGNVVTVNTNHINQPAPRIVEPMARMA---NAFHNTTINTTL 314 (364) T ss_pred HHHHHHCCCEEEECCCHHHCCCCCCCCCCCCEECCCEEEECCCCCCCCCCHHHHHHHHHH---HHHCCCCCCCHH T ss_conf 999865998899678632203454645565263483899676445799871999999999---863823347421 No 140 >PRK13508 tagatose-6-phosphate kinase; Provisional Probab=42.62 E-value=24 Score=15.57 Aligned_cols=49 Identities=18% Similarity=0.331 Sum_probs=19.0 Q ss_pred HHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHCHH Q ss_conf 999999875200001367655764488684689999999999753853111023431104 Q gi|254780670|r 183 VFHTLKKELKSKGYSTNVGDEGGFSPNLKTADSALDLIANSIEKAGYNAGKDLLIALDCA 242 (424) Q Consensus 183 v~~~lk~~L~~~g~~t~vgDEGGfap~~~~~eeaL~ll~eAi~~aGy~~g~di~i~lD~A 242 (424) ....+.+++++...-. =-|.+.|.+. .+...-|.+.+.+.| +.+.+|.. T Consensus 116 ll~~~~~~~~~~d~vv---isGSlP~g~~--~~~y~~li~~~~~~g------~~vilD~s 164 (309) T PRK13508 116 FLHHFKQLLEKVEVVA---ISGSLPAGLP--QDYYAQLIELANNKG------KKVVLDCS 164 (309) T ss_pred HHHHHHHHHCCCCEEE---EECCCCCCCC--HHHHHHHHHHHHHCC------CEEEEECC T ss_conf 9999998705599999---9688899998--599999999998559------98999897 No 141 >PRK06348 aspartate aminotransferase; Provisional Probab=42.34 E-value=25 Score=15.55 Aligned_cols=31 Identities=10% Similarity=0.144 Sum_probs=22.8 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCEEEEECCCCC Q ss_conf 68656878999999987654486997767631 Q gi|254780670|r 257 ENLELQPNEMASYLANLVDQYPIYSIEDGMSE 288 (424) Q Consensus 257 ~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e 288 (424) .|..++.+++-+. .+||.+|.++-|+|=.+. T Consensus 175 TG~v~~~e~l~~l-~~la~~~~i~visDEiY~ 205 (383) T PRK06348 175 TGAVFSKETLEEI-AKVAIENDLVIISDEVYD 205 (383) T ss_pred CCCCCCHHHHHHH-HHHHHHCCEEEEECCCHH T ss_conf 8887777899999-976530686998514334 No 142 >pfam09806 CDK2AP Cyclin-dependent kinase 2-associated protein. Members of this family of proteins are cell-growth suppressors, associating with and influencing the biological activities of important cell cycle regulators in the S phase including monomeric non-phosphorylated cyclin-dependent kinase 2 (CDK2) and DNA polymerase alpha/primase. An association between mutations in the gene coding for this protein and oral cancer has been described. Probab=41.89 E-value=21 Score=16.00 Aligned_cols=17 Identities=29% Similarity=0.712 Sum_probs=7.7 Q ss_pred HHHHHHHHHHHHHHHHH Q ss_conf 89999999999998311 Q gi|254780670|r 395 RIAKYNQLIRIEESLGK 411 (424) Q Consensus 395 R~aKyNrLlrIee~Lg~ 411 (424) -+.||.+||.+-||+|. T Consensus 100 ivdkY~~LL~vi~em~~ 116 (156) T pfam09806 100 IVSKYADLLSVIEEMGR 116 (156) T ss_pred HHHHHHHHHHHHHHHHC T ss_conf 68999999999999714 No 143 >TIGR01979 sufS cysteine desulfurases, SufS subfamily; InterPro: IPR010970 Cysteine desulphurases are pyridoxal-phosphate enzymes which catalyse the removal of sulphur from L-cysteine to form L-alanine and elemental sulphur. These enzymes have been shown to play an important role in the biosynthesis of iron-sulphur clusters, thionucleosides in tRNA, thiamine, biotin, lipoate and molydopterin This entry represents a subfamily of NifS-related cysteine desulphurases that are often involved in iron-sulphur cluster formation, which is needed for nitrogen fixation and other vital functions . Many of these enzymes, in addition to cysteine desulphurase activity, also show selenocysteine lyase activity where L-selenocysteine is converted to L-alanine and selenide . Some of these enzymes are much more specific for selenocysteine than cysteine and are thus believed to function in selenium metabolism rather than sulphur metabolism. Structural studies of the E. coli and Synechocystis enzymes indicate that they are homodimers which show a similar fold to other members of the alpha family of pyridoxal-phosphate enzymes .; GO: 0030170 pyridoxal phosphate binding, 0031071 cysteine desulfurase activity, 0006534 cysteine metabolic process. Probab=41.35 E-value=25 Score=15.44 Aligned_cols=106 Identities=20% Similarity=0.283 Sum_probs=55.5 Q ss_pred CHHHHHHHHHHHHHHCCCCC----CCCHHHHHCHHHHHHCCCCCEEECCC-CCCCCHHHHHHHHHHHHHHCC--EEEEEC Q ss_conf 46899999999997538531----11023431104653202785111146-865687899999998765448--699776 Q gi|254780670|r 212 TADSALDLIANSIEKAGYNA----GKDLLIALDCAASAFFKKEKYILKGE-NLELQPNEMASYLANLVDQYP--IYSIED 284 (424) Q Consensus 212 ~~eeaL~ll~eAi~~aGy~~----g~di~i~lD~AAsefy~~~kY~~~~~-~~~~t~~elid~y~~l~~~yP--I~sIED 284 (424) +..++|.|+.-+--..-.+. |++|.|+. .| -..+=|=| .+|++-. |.+| T Consensus 89 gtTe~iNLvA~s~g~~~l~~av~pGDeIv~s~----------------mEHHaNiVPWQ------~l~~r~GA~lk~~-- 144 (409) T TIGR01979 89 GTTESINLVAYSWGRSRLKAAVDPGDEIVISE----------------MEHHANIVPWQ------LLAERTGATLKFI-- 144 (409) T ss_pred CCCHHHHHHHHHHCCCCCHHHCCCCCEEEECC----------------HHHHCCHHHHH------HHHHHHCCEEEEE-- T ss_conf 60278899999831101342538988798401----------------11001214799------9997609878997-- Q ss_pred CCCCCCHHHHHHHHHHCCCCEEEECCCEEECCHHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEEC Q ss_conf 7631042379988642388368504300002148885210038652234254530038999999999998798699950 Q gi|254780670|r 285 GMSEDDWHGWKILTNKIGSNCQLVGDDLFVTNPERLHKGINEEVANAILIKPNQIGSVSETLNTIEKAQMAGYPSIISH 363 (424) Q Consensus 285 P~~e~D~~gw~~l~~~~g~~~~IvGDDL~vTn~~rl~~gi~~~~~NailiK~NQiGTvset~ea~~~a~~~g~~~ivSh 363 (424) |++||-.=....|.+.+-+|++||. +.-==|=-||+.-.-|.+++|++.|-.++|=. T Consensus 145 ~l~~~G~ld~~~l~~~~~~~TKlVa----------------------~~HvSNVlGtvnP~~ei~~~AH~~GA~vlvDG 201 (409) T TIGR01979 145 PLDDDGTLDLDDLEELLTEKTKLVA----------------------ITHVSNVLGTVNPVEEIIKLAHQVGAKVLVDG 201 (409) T ss_pred ECCCCCEEEHHHHHHHHHCCCEEEE----------------------EEEEECCCCCCCCHHHHHHHHHHCCCEEEEEC T ss_conf 4488760518889988326982999----------------------97420220301487899999865797699862 No 144 >PRK05756 pyridoxamine kinase; Validated Probab=41.33 E-value=25 Score=15.44 Aligned_cols=79 Identities=16% Similarity=0.179 Sum_probs=38.4 Q ss_pred CCCHHHHHHHHHHHHHH----CC-EEEEECCCCCCCHHHHHHHHHHCCCCEEEECCCEEECCHHHHHHHHHCCCCCEEEE Q ss_conf 56878999999987654----48-69977676310423799886423883685043000021488852100386522342 Q gi|254780670|r 260 ELQPNEMASYLANLVDQ----YP-IYSIEDGMSEDDWHGWKILTNKIGSNCQLVGDDLFVTNPERLHKGINEEVANAILI 334 (424) Q Consensus 260 ~~t~~elid~y~~l~~~----yP-I~sIEDP~~e~D~~gw~~l~~~~g~~~~IvGDDL~vTn~~rl~~gi~~~~~Naili 334 (424) .+.+.++++...+|+.+ .| +.++-||.=-|+-.| +||. ++....-.+.-.--|-+| T Consensus 83 Ylgs~~~~~~i~~~i~~~k~~np~~~~v~DPVmGD~g~g------------------~yv~-~~~~~~~~~~llp~Adii 143 (287) T PRK05756 83 YLGSAEQGEAILGAVRRVKAANPQALYLCDPVMGHPEKG------------------CIVA-PGVAEFLRDRALPAADII 143 (287) T ss_pred CCCCHHHHHHHHHHHHHHHHHCCCCEEEECCEEECCCCC------------------EECC-HHHHHHHHHHCCCCCCEE T ss_conf 769999999999999999852998279966525048985------------------5328-769999998522535798 Q ss_pred CCCC----------HHHHHHHHHHHHHHHHCCC Q ss_conf 5453----------0038999999999998798 Q gi|254780670|r 335 KPNQ----------IGSVSETLNTIEKAQMAGY 357 (424) Q Consensus 335 K~NQ----------iGTvset~ea~~~a~~~g~ 357 (424) -||| |.|+.++++|++...+.|- T Consensus 144 TPN~fEle~LtG~~i~~~~~~~~A~~~l~~~g~ 176 (287) T PRK05756 144 TPNLFELEWLSGHPVETLEDAVAAARALIARGP 176 (287) T ss_pred ECCHHHHHHHCCCCCCCHHHHHHHHHHHHHHCC T ss_conf 388899999769965999999999999997199 No 145 >TIGR02318 phosphono_phnM phosphonate metabolism protein PhnM; InterPro: IPR012696 This family consists of proteins from in the PhnM family. PhnM is a protein associated with phosphonate utilization in a number of bacterial species. Pseudomonas stutzeri WM88, has a related protein that is part of a system for the oxidation of phosphites (another form of reduced phosphorous compound), which is not contained in this entry.. Probab=41.26 E-value=17 Score=16.66 Aligned_cols=96 Identities=21% Similarity=0.271 Sum_probs=54.8 Q ss_pred CCCCEE-ECCCCCC--CCHHHHHHHHHHHHHHCCEEE-EECCCCCCCHHHHHHHHHHCCCCEEEECCCEEECCHHHHHHH Q ss_conf 278511-1146865--687899999998765448699-776763104237998864238836850430000214888521 Q gi|254780670|r 248 KKEKYI-LKGENLE--LQPNEMASYLANLVDQYPIYS-IEDGMSEDDWHGWKILTNKIGSNCQLVGDDLFVTNPERLHKG 323 (424) Q Consensus 248 ~~~kY~-~~~~~~~--~t~~elid~y~~l~~~yPI~s-IEDP~~e~D~~gw~~l~~~~g~~~~IvGDDL~vTn~~rl~~g 323 (424) +.++|. |-.+..- +|.+|+-++..++...--.++ | +.-...-..++.+-+= ..--=||= +++.++.. T Consensus 181 ~~~~yr~~~~~k~grvlsd~~~~~~~~~r~~~~~~~~~~-----~~~r~~i~ala~~~~i-~lASHDD~---T~e~V~e~ 251 (391) T TIGR02318 181 DLEKYREYYQKKRGRVLSDDEFDEIVEERIARSAEYADV-----EANRSEIAALARARGI-PLASHDDD---TPEHVAEA 251 (391) T ss_pred HHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCH-----HHHHHHHHHHHHHCCC-CEECCCCC---CHHHHHHH T ss_conf 257899987245267458999999999999973586325-----8999999999985799-53246888---88899888 Q ss_pred HHCCCCCEEEECCCCHHHHHH---HHHHHHHHHHCCCEEEEECC Q ss_conf 003865223425453003899---99999999987986999507 Q gi|254780670|r 324 INEEVANAILIKPNQIGSVSE---TLNTIEKAQMAGYPSIISHR 364 (424) Q Consensus 324 i~~~~~NailiK~NQiGTvse---t~ea~~~a~~~g~~~ivShR 364 (424) .+.+. |+|| |+||++.|++.|+.+++.+. T Consensus 252 ~~~Gv------------~isEFPtT~EAA~~Ar~~G~~~lMGAP 283 (391) T TIGR02318 252 HDLGV------------TISEFPTTLEAAKEARSLGMQILMGAP 283 (391) T ss_pred HHCCC------------EEEECHHHHHHHHHHHHCCCEEEEECC T ss_conf 75782------------674042348999999876980896088 No 146 >cd06321 PBP1_ABC_sugar_binding_like_11 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. This group includes the periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consist of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=40.14 E-value=27 Score=15.31 Aligned_cols=50 Identities=10% Similarity=0.200 Sum_probs=26.4 Q ss_pred CCHHHHHHHHHHHHHHCC-EEEEECCCCCCCHHHH--HHHHHHCC-CCEEEECCCEE Q ss_conf 687899999998765448-6997767631042379--98864238-83685043000 Q gi|254780670|r 261 LQPNEMASYLANLVDQYP-IYSIEDGMSEDDWHGW--KILTNKIG-SNCQLVGDDLF 313 (424) Q Consensus 261 ~t~~elid~y~~l~~~yP-I~sIEDP~~e~D~~gw--~~l~~~~g-~~~~IvGDDL~ 313 (424) .+.+.-.+...+|+.++| +..| |.-+|.-.. -.-.++.| +.+.++|=|-+ T Consensus 163 ~~~~~~~~~~~~ll~~~p~~~ai---fa~nD~~A~Ga~~a~~~~G~~~i~ivg~dg~ 216 (271) T cd06321 163 GSRDGGLRVMQGLLTRFPKLDGV---FAINDPTAIGADLAAKQAGRNDIKITSVDGA 216 (271) T ss_pred CCHHHHHHHHHHHHHCCCCCCEE---EECCCHHHHHHHHHHHHCCCCCCEEEEEECC T ss_conf 78999999999998508867789---9778689999999999759799989999696 No 147 >PRK12457 2-dehydro-3-deoxyphosphooctonate aldolase; Provisional Probab=39.01 E-value=28 Score=15.19 Aligned_cols=23 Identities=17% Similarity=0.234 Sum_probs=12.6 Q ss_pred HHHHHHHHHHHHHCCCCCCCCHH Q ss_conf 89999999999753853111023 Q gi|254780670|r 214 DSALDLIANSIEKAGYNAGKDLL 236 (424) Q Consensus 214 eeaL~ll~eAi~~aGy~~g~di~ 236 (424) ++.|++|.++-++.|+..-.||+ T Consensus 73 eegL~~L~~ik~~~gl~viTeVh 95 (281) T PRK12457 73 DEGLRIFEEVKARFGVPVITDVH 95 (281) T ss_pred HHHHHHHHHHHHHHCCCEEEEEC T ss_conf 99999999999987996799858 No 148 >PRK08176 pdxK pyridoxal kinase; Reviewed Probab=38.74 E-value=28 Score=15.16 Aligned_cols=18 Identities=6% Similarity=0.095 Sum_probs=8.3 Q ss_pred HHHHHHHHHHHHHHHCCC Q ss_conf 038999999999998798 Q gi|254780670|r 340 GSVSETLNTIEKAQMAGY 357 (424) Q Consensus 340 GTvset~ea~~~a~~~g~ 357 (424) .|+.+++++++...+.|- T Consensus 172 ~~~~~~~~aa~~L~~~G~ 189 (281) T PRK08176 172 RTLDEAIAAAKSLLSDTL 189 (281) T ss_pred CCHHHHHHHHHHHHHCCC T ss_conf 999999999999997199 No 149 >PRK08076 consensus Probab=38.45 E-value=28 Score=15.13 Aligned_cols=14 Identities=21% Similarity=0.207 Sum_probs=5.8 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 99999999987520 Q gi|254780670|r 180 GAEVFHTLKKELKS 193 (424) Q Consensus 180 ~~ev~~~lk~~L~~ 193 (424) ....|+.|++.|++ T Consensus 455 t~~L~~~L~~~L~~ 468 (877) T PRK08076 455 IYDLKQTFVEELEK 468 (877) T ss_pred HHHHHHHHHHHHHH T ss_conf 99999999999865 No 150 >COG1879 RbsB ABC-type sugar transport system, periplasmic component [Carbohydrate transport and metabolism] Probab=37.69 E-value=29 Score=15.05 Aligned_cols=75 Identities=20% Similarity=0.295 Sum_probs=49.8 Q ss_pred CCCCHHHHHHHHHHHHHHCC-EEEEECCCCCCCHHH---HHHHHHHCCC--CEEEECCCEEECCHHHHHHHHHCCCCCEE Q ss_conf 65687899999998765448-699776763104237---9988642388--36850430000214888521003865223 Q gi|254780670|r 259 LELQPNEMASYLANLVDQYP-IYSIEDGMSEDDWHG---WKILTNKIGS--NCQLVGDDLFVTNPERLHKGINEEVANAI 332 (424) Q Consensus 259 ~~~t~~elid~y~~l~~~yP-I~sIEDP~~e~D~~g---w~~l~~~~g~--~~~IvGDDL~vTn~~rl~~gi~~~~~Nai 332 (424) ...++++-...-.+++.+|| |..| |..+|+.. ...+. ..|. ++.|+|-|+.....+.++.| ...-.+ T Consensus 199 ~~~~~~~a~~~~~~~L~~~pdi~~i---~~~~d~~a~ga~~a~~-~ag~~~~v~v~g~D~~~~~~~~i~~G---~~~~~v 271 (322) T COG1879 199 GDWDRDKALEVMEDLLAANPDIDGI---YAANDGMALGAIQALK-AAGRKGDVVVVGFDGTPDALKALKDG---KLDATV 271 (322) T ss_pred CCCCHHHHHHHHHHHHHHCCCCEEE---EECCCCHHHHHHHHHH-HCCCCCCEEEEEECCCHHHHHHHHCC---CEEEEE T ss_conf 8876889999999999758994499---9899706999999999-76998806999947979999999669---736996 Q ss_pred EECCCCHH Q ss_conf 42545300 Q gi|254780670|r 333 LIKPNQIG 340 (424) Q Consensus 333 liK~NQiG 340 (424) .-.|...| T Consensus 272 ~q~p~~~g 279 (322) T COG1879 272 LQDPAAQG 279 (322) T ss_pred CCCHHHHH T ss_conf 37988889 No 151 >pfam03444 DUF293 Domain of unknown function. This domain is always found with a pair of CBS domains pfam00571. this region may be distantly related to the HrcA proteins of prokaryotes (Bateman A pers. obs.). Probab=37.63 E-value=19 Score=16.31 Aligned_cols=52 Identities=27% Similarity=0.397 Sum_probs=37.9 Q ss_pred ECCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCC-CCCCCCCCCCCCCHHHHHH Q ss_conf 1488774899989999999999987520000136-7655764488684689999 Q gi|254780670|r 166 MPVGAENIREAIRMGAEVFHTLKKELKSKGYSTN-VGDEGGFSPNLKTADSALD 218 (424) Q Consensus 166 iP~g~~sf~eal~~~~ev~~~lk~~L~~~g~~t~-vgDEGGfap~~~~~eeaL~ 218 (424) -|++.+++.+.+.......++....|++-|+--+ -|.-|||.|..+- .++|. T Consensus 22 ~pVks~~IAe~l~~spgTIRN~M~~L~~LGLVe~~pgp~gGy~PT~~a-y~~l~ 74 (79) T pfam03444 22 RAVKGEEIADIIGRNPGTVRNQMQSLKALGLVEGVPGPKGGYKPTSKA-YEALG 74 (79) T ss_pred CCCCHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHH-HHHHC T ss_conf 986679999998789088999999999858865788998887658999-98865 No 152 >COG5170 CDC55 Serine/threonine protein phosphatase 2A, regulatory subunit [Signal transduction mechanisms] Probab=37.33 E-value=29 Score=15.01 Aligned_cols=19 Identities=21% Similarity=0.389 Sum_probs=10.8 Q ss_pred EEEEEECCCCCCHHHHHHH Q ss_conf 0256514887748999899 Q gi|254780670|r 161 QEFMIMPVGAENIREAIRM 179 (424) Q Consensus 161 QEfmIiP~g~~sf~eal~~ 179 (424) |-|-|+-+....++|-.+. T Consensus 205 ~sFnIVDiKP~nmeeLteV 223 (460) T COG5170 205 GSFNIVDIKPHNMEELTEV 223 (460) T ss_pred CCEEEEECCCCCHHHHHHH T ss_conf 7347896167668999988 No 153 >PRK09389 (R)-citramalate synthase; Provisional Probab=37.31 E-value=29 Score=15.00 Aligned_cols=48 Identities=21% Similarity=0.258 Sum_probs=22.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHH Q ss_conf 7489998999999999998752000013676557644886846899999999997 Q gi|254780670|r 171 ENIREAIRMGAEVFHTLKKELKSKGYSTNVGDEGGFSPNLKTADSALDLIANSIE 225 (424) Q Consensus 171 ~sf~eal~~~~ev~~~lk~~L~~~g~~t~vgDEGGfap~~~~~eeaL~ll~eAi~ 225 (424) .|-++.+++..+....++ +.+....++-|-++-- +.+-+++++..|++ T Consensus 106 ~s~ee~l~~~~~~v~~ak----~~g~~v~~~~ED~sr~---~~~fl~e~~~~a~~ 153 (487) T PRK09389 106 KTREEVLEMAVEAVEYAK----DHGLIVELSGEDASRA---DLDFLKELYRAGIE 153 (487) T ss_pred CCHHHHHHHHHHHHHHHH----HCCCEEEEEEECCCCC---CHHHHHHHHHHHHH T ss_conf 999999999999999999----7497799921066555---77999999999997 No 154 >cd06308 PBP1_sensor_kinase_like Periplasmic binding domain of two-component sensor kinase signaling systems. Periplasmic binding domain of two-component sensor kinase signaling systems, some of which are fused with a C-terminal histidine kinase A domain (HisK) and/or a signal receiver domain (REC). Members of this group share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily and are predicted to be involved in sensing of environmental stimuli; their substrate specificities, however, are not known in detail. Probab=37.30 E-value=29 Score=15.00 Aligned_cols=72 Identities=13% Similarity=0.213 Sum_probs=38.5 Q ss_pred CCHHHHHHHHHHHHHHCC-EEEEECCCCCCCHHHH---HHHHHHCC--CCEEEECCCEEECCHHHHHHHHHCCCCCEEEE Q ss_conf 687899999998765448-6997767631042379---98864238--83685043000021488852100386522342 Q gi|254780670|r 261 LQPNEMASYLANLVDQYP-IYSIEDGMSEDDWHGW---KILTNKIG--SNCQLVGDDLFVTNPERLHKGINEEVANAILI 334 (424) Q Consensus 261 ~t~~elid~y~~l~~~yP-I~sIEDP~~e~D~~gw---~~l~~~~g--~~~~IvGDDL~vTn~~rl~~gi~~~~~Naili 334 (424) .++++-.+--.+|+..|| |-.|- .-+|.-.. +.+ ++.| +++.|||=|-.......+ +.+..+...+... T Consensus 164 ~~~~~a~~~~~~~L~~~pdi~aI~---a~nD~~A~Ga~~Al-~~aG~~~~i~vvg~D~~~~~~~~l-~~i~~g~~~~t~~ 238 (270) T cd06308 164 WLKEKAEEKMEELLQANPDIDLVY---AHNDPMALGAYLAA-KRAGREKEIKFIGIDGLPGPGGGI-EAVRDGELDATFL 238 (270) T ss_pred CCHHHHHHHHHHHHHHCCCCCEEE---ECCCHHHHHHHHHH-HHCCCCCCCEEEEECCCCCHHHHH-HHHHCCCCEEEEE T ss_conf 679999999999985088888899---77865899999999-976999995899888993369999-9987386169998 Q ss_pred CCC Q ss_conf 545 Q gi|254780670|r 335 KPN 337 (424) Q Consensus 335 K~N 337 (424) -|- T Consensus 239 ~p~ 241 (270) T cd06308 239 YPT 241 (270) T ss_pred CHH T ss_conf 837 No 155 >PRK07367 consensus Probab=37.21 E-value=29 Score=14.99 Aligned_cols=31 Identities=23% Similarity=0.478 Sum_probs=22.1 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCEEEEECCCCC Q ss_conf 68656878999999987654486997767631 Q gi|254780670|r 257 ENLELQPNEMASYLANLVDQYPIYSIEDGMSE 288 (424) Q Consensus 257 ~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e 288 (424) .|..++.+++-++ .++|.+|.+.-|+|=.+. T Consensus 174 TG~v~s~~~l~~l-~~~a~~~~~~iIsDEiY~ 204 (385) T PRK07367 174 TGMVYTPEEIAAL-AEVIVEHDLYVVSDEIYE 204 (385) T ss_pred CCCCCCHHHHHHH-HHHHHHCCEEEEECCCCH T ss_conf 7856657888899-888875586997113211 No 156 >cd00984 DnaB_C DnaB helicase C terminal domain. The hexameric helicase DnaB unwinds the DNA duplex at the chromosome replication fork. Although the mechanism by which DnaB both couples ATP hydrolysis to translocation along DNA and denatures the duplex is unknown, a change in the quaternary structure of the protein involving dimerization of the N-terminal domain has been observed and may occur during the enzymatic cycle. This C-terminal domain contains an ATP-binding site and is therefore probably the site of ATP hydrolysis. Probab=37.06 E-value=29 Score=14.98 Aligned_cols=13 Identities=31% Similarity=0.378 Sum_probs=7.9 Q ss_pred HHHHCCCEEE-EEC Q ss_conf 9998798699-950 Q gi|254780670|r 351 KAQMAGYPSI-ISH 363 (424) Q Consensus 351 ~a~~~g~~~i-vSh 363 (424) +|++.+..+| +|| T Consensus 158 lA~e~~v~Vi~~sQ 171 (242) T cd00984 158 LAKELNVPVIALSQ 171 (242) T ss_pred HHHHHCCEEEEEEC T ss_conf 99997993999846 No 157 >PRK11241 gabD succinate-semialdehyde dehydrogenase I; Provisional Probab=36.85 E-value=11 Score=18.19 Aligned_cols=125 Identities=17% Similarity=0.265 Sum_probs=52.0 Q ss_pred CCCCCCCEEECCCCCC-CCCCCCCEEEEEEC-CCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHH Q ss_conf 6475430000036645-56776610256514-887748999899999999999875200001367655764488684689 Q gi|254780670|r 138 HILPVPLMNILNGGIH-ADNALDFQEFMIMP-VGAENIREAIRMGAEVFHTLKKELKSKGYSTNVGDEGGFSPNLKTADS 215 (424) Q Consensus 138 ~~lP~P~~NiinGG~H-A~~~l~iQEfmIiP-~g~~sf~eal~~~~ev~~~lk~~L~~~g~~t~vgDEGGfap~~~~~ee 215 (424) -.+|--.+|++.|+.. .+..+- --| +...+|.-+.+.+-.|.+...+.++.- +-.-||-.|-+=-..- T Consensus 199 AGlP~Gv~nvv~G~~~~~g~~l~-----~~~~v~~v~FTGS~~~G~~i~~~~a~~~~~~-----~lELGGk~p~IV~~dA 268 (482) T PRK11241 199 AGIPAGVFNVVTGSAGAVGGELT-----SNPLVRKLSFTGSTEIGRQLMEQCAKDIKKV-----SLELGGNAPFIVFDDA 268 (482) T ss_pred HCCCCCEEECCCCCHHHHHHHHH-----CCCCCCEEEEECCHHHHHHHHHHHCCCCCEE-----EECCCCCCEEEECCCC T ss_conf 48995767527897779999997-----1988376999898899999886312435301-----1215774338986877 Q ss_pred HHHHHHHHHHHCCCCCCCCHHHHHCHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCC Q ss_conf 99999999975385311102343110465320278511114686568789999999876544869977676310 Q gi|254780670|r 216 ALDLIANSIEKAGYNAGKDLLIALDCAASAFFKKEKYILKGENLELQPNEMASYLANLVDQYPIYSIEDGMSED 289 (424) Q Consensus 216 aL~ll~eAi~~aGy~~g~di~i~lD~AAsefy~~~kY~~~~~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~ 289 (424) -|+...+++-...+...-+.+.+. + ..|-- ..-.+++++.+...++++ .+-||++++ T Consensus 269 Dld~Av~~~~~~~f~n~GQ~C~a~----~-------rv~V~---~~i~d~f~~~l~~~~~~l---~vG~p~~~~ 325 (482) T PRK11241 269 DLDKAVEGALASKFRNAGQTCVCA----N-------RLYVQ---DGVYDRFAEKLQQAVSKL---HIGDGLEAG 325 (482) T ss_pred CHHHHHHHHHHHHHHCCCCCCEEC----C-------CEEEC---CCHHHHHHHHHHHHHHCC---CCCCCCCCC T ss_conf 899999998888852789976228----8-------42204---026999999999865315---578988766 No 158 >PRK11559 garR tartronate semialdehyde reductase; Provisional Probab=36.74 E-value=30 Score=14.94 Aligned_cols=13 Identities=15% Similarity=0.294 Sum_probs=4.5 Q ss_pred CCCCCCHHHHHHH Q ss_conf 7631042379988 Q gi|254780670|r 285 GMSEDDWHGWKIL 297 (424) Q Consensus 285 P~~e~D~~gw~~l 297 (424) ++.++||..-.++ T Consensus 272 G~g~~D~sal~~~ 284 (295) T PRK11559 272 GLGTADHSALACY 284 (295) T ss_pred CCCCCCHHHHHHH T ss_conf 9998789999999 No 159 >cd07084 ALDH_KGSADH-like ALDH subfamily: NAD(P)+-dependent alpha-ketoglutaric semialdehyde dehydrogenases and plant delta(1)-pyrroline-5-carboxylate dehydrogenase, ALDH family 12-like. ALDH subfamily which includes the NAD(P)+-dependent, alpha-ketoglutaric semialdehyde dehydrogenases (KGSADH, EC 1.2.1.26); plant delta(1)-pyrroline-5-carboxylate dehydrogenase (P5CDH, EC=1.5.1.12 ), ALDH family 12; the N-terminal domain of the MaoC (monoamine oxidase C) dehydratase regulatory protein; and orthologs of MaoC, PaaZ and PaaN, which are putative ring-opening enzymes of the aerobic phenylacetic acid catabolic pathway. Probab=36.20 E-value=17 Score=16.82 Aligned_cols=59 Identities=19% Similarity=0.178 Sum_probs=29.4 Q ss_pred CCCCCEEECCCCCCCCCCCCCEEEEEEC-CCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCC Q ss_conf 7543000003664556776610256514-8877489998999999999998752000013676557644886 Q gi|254780670|r 140 LPVPLMNILNGGIHADNALDFQEFMIMP-VGAENIREAIRMGAEVFHTLKKELKSKGYSTNVGDEGGFSPNL 210 (424) Q Consensus 140 lP~P~~NiinGG~HA~~~l~iQEfmIiP-~g~~sf~eal~~~~ev~~~lk~~L~~~g~~t~vgDEGGfap~~ 210 (424) ||-=.+|++.|+...+..|- --| +...+|.-+.+.+-.|..... ...-+.--||--|.+ T Consensus 156 lP~gv~nvv~g~~~~g~~L~-----~hp~v~~V~FTGS~~~G~~i~~~a~-------~~~~~lElGGknp~i 215 (442) T cd07084 156 LPPEDVTLINGDGKTMQALL-----LHPNPKMVLFTGSSRVAEKLALDAK-------QARIYLELAGFNWKV 215 (442) T ss_pred CCCCEEEECCCCCHHHHHHH-----CCCCCCEEEEECCHHHHHHHHHHCC-------CCCEEEECCCCCCEE T ss_conf 99747995489840358885-----3977357987476499999998464-------585688427888669 No 160 >pfam03659 Glyco_hydro_71 Glycosyl hydrolase family 71. Family of alpha-1,3-glucanases. Probab=36.18 E-value=30 Score=14.88 Aligned_cols=72 Identities=15% Similarity=0.239 Sum_probs=33.6 Q ss_pred CHHHHHHHHH------HCCC--CEEEECCCEEE---CCHHHHHHH----HHCCCCCEEEECCCCHHHHHHHHHHHHHHHH Q ss_conf 4237998864------2388--36850430000---214888521----0038652234254530038999999999998 Q gi|254780670|r 290 DWHGWKILTN------KIGS--NCQLVGDDLFV---TNPERLHKG----INEEVANAILIKPNQIGSVSETLNTIEKAQM 354 (424) Q Consensus 290 D~~gw~~l~~------~~g~--~~~IvGDDL~v---Tn~~rl~~g----i~~~~~NailiK~NQiGTvset~ea~~~a~~ 354 (424) +.+||..|.. |-|. .-.|.-|-|+. ++|.--... -+...-+-...+|+-.-.+.+.+.++-+.++ T Consensus 254 pH~gw~~l~~yyI~~yK~G~~~~p~I~~d~l~~WYR~~p~~a~~s~~Ttgn~~~~~q~~~~P~~~e~~~D~Vf~~alL~~ 333 (386) T pfam03659 254 PHDGWRELLPYYIDAYKAGKTSDATITEEKLVYWYRPNPGAACSSGDTTGNNASQGQFEGRPNGWEVLQDRVFVSALLKS 333 (386) T ss_pred CCHHHHHHHHHHHHHHHCCCCCCCCCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCEEEEEEEEECC T ss_conf 94789987699999986699888845112799998078512466899777875667445699876644306999998079 Q ss_pred CCCEEEE Q ss_conf 7986999 Q gi|254780670|r 355 AGYPSII 361 (424) Q Consensus 355 ~g~~~iv 361 (424) ..-.+|. T Consensus 334 patv~Vt 340 (386) T pfam03659 334 PATVTVT 340 (386) T ss_pred CCEEEEE T ss_conf 9489999 No 161 >cd06313 PBP1_ABC_sugar_binding_like_5 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=35.99 E-value=30 Score=14.86 Aligned_cols=77 Identities=13% Similarity=0.112 Sum_probs=40.2 Q ss_pred CCCHHHHHHHHHHHHHHCC-EEEEECCCCCCCHHHHHHHHHHCC-CCEEEECCCEEECCHHHHHHHHHCCCCCEEE-ECC Q ss_conf 5687899999998765448-699776763104237998864238-8368504300002148885210038652234-254 Q gi|254780670|r 260 ELQPNEMASYLANLVDQYP-IYSIEDGMSEDDWHGWKILTNKIG-SNCQLVGDDLFVTNPERLHKGINEEVANAIL-IKP 336 (424) Q Consensus 260 ~~t~~elid~y~~l~~~yP-I~sIEDP~~e~D~~gw~~l~~~~g-~~~~IvGDDL~vTn~~rl~~gi~~~~~Nail-iK~ 336 (424) ..++++-.+....|+..|| |-.|--.-+..-.-.-.-+. ..| .++.|+|=|-...-.+.++.| ...+.+ -.| T Consensus 164 ~~~~~~a~~~~~~~L~~~pdid~I~~~nD~~a~Ga~~Al~-~aG~~~v~v~g~Dg~~~~l~~I~~G----~~~atv~q~~ 238 (272) T cd06313 164 NWDVSKAARIWETWLTKYPQLDGAFCHNDSMALAAYQIMK-AAGRTKIVIGGVDGDPPAIQAVSDG----RMVATVRNPA 238 (272) T ss_pred CCCHHHHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHH-HCCCCCCEEEEECCCHHHHHHHHCC----CCEEEEECCH T ss_conf 8768999999999996499987899888477899999999-7699986899978999999999859----9189980699 Q ss_pred CCHHH Q ss_conf 53003 Q gi|254780670|r 337 NQIGS 341 (424) Q Consensus 337 NQiGT 341 (424) -.+|- T Consensus 239 ~~~G~ 243 (272) T cd06313 239 CRIHG 243 (272) T ss_pred HHHHH T ss_conf 99999 No 162 >TIGR03540 DapC_direct LL-diaminopimelate aminotransferase. This clade of the pfam00155 superfamily of aminotransferases includes several which are adjacent to elements of the lysine biosynthesis via diaminopimelate pathway (GenProp0125). Every member of this clade is from a genome which possesses most of the lysine biosynthesis pathway but lacks any of the known aminotransferases, succinylases, desuccinylases, acetylases or deacetylases typical of the acylated versions of this pathway nor do they have the direct, NADPH-dependent enzyme (ddh). Although there is no experimental characterization of any of the sequences in this clade, a direct pathway is known in plants and Chlamydia, so it seems quite reasonable that these enzymes catalyze the same transformation. Probab=35.19 E-value=31 Score=14.77 Aligned_cols=30 Identities=10% Similarity=0.092 Sum_probs=20.0 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCEEEEECCCC Q ss_conf 6865687899999998765448699776763 Q gi|254780670|r 257 ENLELQPNEMASYLANLVDQYPIYSIEDGMS 287 (424) Q Consensus 257 ~~~~~t~~elid~y~~l~~~yPI~sIEDP~~ 287 (424) .|..++.+++-++ .++|.+|.++-|+|=.+ T Consensus 177 TG~v~s~~~l~~i-~~la~~~~i~iIsDEiY 206 (383) T TIGR03540 177 TGAVAPLKFFKEL-VEFAKEYNIIVCHDNAY 206 (383) T ss_pred CCCCCCCHHHHCC-CCCCCCCCCEEECCHHH T ss_conf 0701210222201-00124554057611025 No 163 >cd06248 M14_CPA_CPB_like Peptidase M14 Carboxypeptidase A/B-like subfamily: This is one of two main M14 carboxypeptidase subfamilies, defined by sequence and structural homology, the other being N/E. Carboxypeptidases (CPs) hydrolyze single, C-terminal amino acids from polypeptide chains. They have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. Majority of the proteins in this subfamily have not been characterized as yet. The A/B enzymes are normally synthesized as inactive precursors containing preceding signal peptide, followed by a globular N-terminal pro-region linked to the enzyme; the proenzymes are called procarboxypeptidases. These enzymes exhibit distinct substrate specificity pattern; Carboxypeptidase A-like (CPA-like) enzymes favor hydrophobic residues while carboxypeptidase B-like (CPB-like) enzymes only cleave the basic residues lysine or arginine. There are nine members in the A/B family: CPA1, CPA2, CPA3, CPA4, CPA5, Probab=34.76 E-value=32 Score=14.72 Aligned_cols=29 Identities=21% Similarity=0.277 Sum_probs=12.5 Q ss_pred HHHHHHHHHHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 4899999998756789759885578988864 Q gi|254780670|r 61 GVLKAIAFVNDEIRTALLGCDARDQLLIDKI 91 (424) Q Consensus 61 gV~~Av~~in~~i~~~L~g~~~~dQ~~iD~~ 91 (424) +...++-.++..+.. -+.|+.-+.-+|+. T Consensus 70 s~~~~ly~i~~Ll~~--y~~d~~~~~lL~~~ 98 (304) T cd06248 70 STMTVEYLAYQLLTG--YGSDATVTALLDKF 98 (304) T ss_pred CHHHHHHHHHHHHHC--CCCCHHHHHHHHCC T ss_conf 399999999999864--57888899997477 No 164 >TIGR00273 TIGR00273 iron-sulfur cluster binding protein; InterPro: IPR004452 Members of this family have a perfect 4Fe-4S binding motif C-x(2)-C-x(2)-C-x(3)-CP followed by either a perfect or imperfect (the first Cys replaced by Ser) second copy. Members probably bind two 4fe-4S iron-sulphur clusters.. Probab=34.63 E-value=22 Score=15.88 Aligned_cols=139 Identities=19% Similarity=0.335 Sum_probs=70.7 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHCCCCHHHHCCCCC-CCCCCCCCEEECCCCCCCCCCCC-CEEEEEECCCCCCHHH-- Q ss_conf 3221232222024489987526634984888225324-56475430000036645567766-1025651488774899-- Q gi|254780670|r 100 NKSRLGANAILGVSLAVSKAAAQTSNLPLYKYLGGCS-AHILPVPLMNILNGGIHADNALD-FQEFMIMPVGAENIRE-- 175 (424) Q Consensus 100 nks~lGaNailAvSlA~akA~A~~~~~pLy~yl~~~~-~~~lP~P~~NiinGG~HA~~~l~-iQEfmIiP~g~~sf~e-- 175 (424) +.|++=+.--.---|+|-|+++-.+.=|.|+++||.. .-.-|=||.-|++- -|+ +.+|==+|.- .|.=- T Consensus 282 gRS~iL~~ekF~~~L~CiRCG~C~n~CPvY~~~Ggh~YGs~Y~GPiG~v~Sp------~l~g~~dYk~lPyl-ssLCg~C 354 (450) T TIGR00273 282 GRSNILATEKFREVLACIRCGACLNECPVYRHIGGHAYGSIYPGPIGAVVSP------LLGGYNDYKELPYL-SSLCGAC 354 (450) T ss_pred CCHHHHCCHHHHHHHHHHHHHHHHCCCCHHHCCCCCCCCCCCCCHHHEEHHH------HHHHHHHHHCCCCH-HCHHCCC T ss_conf 8211113656898741330123411486011146602453158712311113------54104433112331-0341067 Q ss_pred ------HHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHCHH--HHHHC Q ss_conf ------9899999999999875200001367655764488684689999999999753853111023431104--65320 Q gi|254780670|r 176 ------AIRMGAEVFHTLKKELKSKGYSTNVGDEGGFSPNLKTADSALDLIANSIEKAGYNAGKDLLIALDCA--ASAFF 247 (424) Q Consensus 176 ------al~~~~ev~~~lk~~L~~~g~~t~vgDEGGfap~~~~~eeaL~ll~eAi~~aGy~~g~di~i~lD~A--Asefy 247 (424) -|-+ .+....+++..-++|...-.|-...-+|... .=||++ -|--.+. ++ -..+++-.| |..|| T Consensus 355 ~evCPv~IpL-~~Lir~~R~~~ve~GrGvikGakstq~~~~E--klA~K~--fa~~aS~--~~-lwk~g~k~a~~A~~~F 426 (450) T TIGR00273 355 REVCPVKIPL-AELIREHRSDIVEKGRGVIKGAKSTQASKAE--KLAMKM--FAKAASD--GA-LWKVGLKMAAAAAQFF 426 (450) T ss_pred CCCCCCCCCH-HHHHHHHHHHHHHCCCCEEECCCCCCCHHHH--HHHHHH--HHHHCCC--HH-HHHHHHHHHHHHCCCC T ss_conf 8888511087-9999998788630685144156432451567--788877--7642474--02-5567765547652656 Q ss_pred C-CCCEE Q ss_conf 2-78511 Q gi|254780670|r 248 K-KEKYI 253 (424) Q Consensus 248 ~-~~kY~ 253 (424) . .+|-. T Consensus 427 ~~~gK~~ 433 (450) T TIGR00273 427 SELGKLL 433 (450) T ss_pred CCCCCCC T ss_conf 6877523 No 165 >COG1806 Uncharacterized protein conserved in bacteria [Function unknown] Probab=34.50 E-value=32 Score=14.70 Aligned_cols=89 Identities=20% Similarity=0.200 Sum_probs=51.4 Q ss_pred EEEEECCCCCCCHHHHHHHHHHCCCCEEEECCCEEECCHHHHHHHHHCCCCCEEEECCCCHHH-----HHHHHHHHHHHH Q ss_conf 699776763104237998864238836850430000214888521003865223425453003-----899999999999 Q gi|254780670|r 279 IYSIEDGMSEDDWHGWKILTNKIGSNCQLVGDDLFVTNPERLHKGINEEVANAILIKPNQIGS-----VSETLNTIEKAQ 353 (424) Q Consensus 279 I~sIEDP~~e~D~~gw~~l~~~~g~~~~IvGDDL~vTn~~rl~~gi~~~~~NailiK~NQiGT-----vset~ea~~~a~ 353 (424) |..--=||-.+|.+.-. |-+.+.. .|+| .+.+|+||.+==++. -+.+=+++|---+ -.|..++=++++ T Consensus 169 ikaAN~PlVpe~~~p~~-L~~~~~~--~i~G---Lti~peRL~~IR~eR-L~~~~~~~~s~Ya~~~~~~eEl~~ae~l~~ 241 (273) T COG1806 169 IKAANYPLVPEDPEPDE-LPAALKP--LLFG---LTISPERLSAIREER-LKSLGLRENSRYASLDQCREELAYAEALFR 241 (273) T ss_pred CHHCCCCCCCCCCCHHH-HHHCCCC--EEEE---EECCHHHHHHHHHHH-HHCCCCCCCCCCCCHHHHHHHHHHHHHHHH T ss_conf 03306785789988244-2103645--0788---863899999999987-421478875641249999999999999999 Q ss_pred HCCCEEE-EECCCCCCCHHHHH Q ss_conf 8798699-95078887235899 Q gi|254780670|r 354 MAGYPSI-ISHRSGETEDHTIA 374 (424) Q Consensus 354 ~~g~~~i-vShRSGETeD~~ia 374 (424) .+||-+| +|+||=|-.-+-|- T Consensus 242 r~~~pvidvt~~SIEEtAa~Il 263 (273) T COG1806 242 RNGIPVIDVTNKSIEETAAKIL 263 (273) T ss_pred HHCCCEEECCCCHHHHHHHHHH T ss_conf 8199778455115999999999 No 166 >COG2805 PilT Tfp pilus assembly protein, pilus retraction ATPase PilT [Cell motility and secretion / Intracellular trafficking and secretion] Probab=34.34 E-value=21 Score=16.02 Aligned_cols=29 Identities=21% Similarity=0.275 Sum_probs=18.2 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCEEEEECCC Q ss_conf 686568789999999876544869977676 Q gi|254780670|r 257 ENLELQPNEMASYLANLVDQYPIYSIEDGM 286 (424) Q Consensus 257 ~~~~~t~~elid~y~~l~~~yPI~sIEDP~ 286 (424) .|+.-|-..||||..+= ..+-|+.||||. T Consensus 136 SGKSTTlAamId~iN~~-~~~HIlTIEDPI 164 (353) T COG2805 136 SGKSTTLAAMIDYINKH-KAKHILTIEDPI 164 (353) T ss_pred CCHHHHHHHHHHHHHCC-CCCCEEEECCCH T ss_conf 96787999999998414-775168723746 No 167 >PRK00103 SPOUT methyltransferase superfamily protein; Provisional Probab=34.08 E-value=33 Score=14.65 Aligned_cols=36 Identities=14% Similarity=0.075 Sum_probs=28.5 Q ss_pred ECCCCCCCCHHHHHHHHHHHHHHCC-EEEEE---CCCCCC Q ss_conf 1146865687899999998765448-69977---676310 Q gi|254780670|r 254 LKGENLELQPNEMASYLANLVDQYP-IYSIE---DGMSED 289 (424) Q Consensus 254 ~~~~~~~~t~~elid~y~~l~~~yP-I~sIE---DP~~e~ 289 (424) ++-+|..+||.++.+++.+|.+... |++|. |+|+++ T Consensus 73 LDe~Gk~~sS~~fa~~l~~~~~~g~~i~FiIGGa~G~~~~ 112 (156) T PRK00103 73 LDIRGKPWTSEQFAKELERWRLDGRDVTFVIGGPEGLSPA 112 (156) T ss_pred EECCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCCHH T ss_conf 9179980576999999999986399669999788876989 No 168 >cd07094 ALDH_F21_LactADH-like ALDH subfamily: NAD+-dependent, lactaldehyde dehydrogenase, ALDH family 21 A1, and related proteins. ALDH subfamily which includes Tortula ruralis aldehyde dehydrogenase ALDH21A1 (RNP123), and NAD+-dependent, lactaldehyde dehydrogenase (EC=1.2.1.22) and like sequences. Probab=34.07 E-value=15 Score=17.22 Aligned_cols=24 Identities=21% Similarity=0.343 Sum_probs=13.8 Q ss_pred HHHHHHHHHHHHHHCCEEEEECCCCCC Q ss_conf 789999999876544869977676310 Q gi|254780670|r 263 PNEMASYLANLVDQYPIYSIEDGMSED 289 (424) Q Consensus 263 ~~elid~y~~l~~~yPI~sIEDP~~e~ 289 (424) -+++++-+.+-++++ -+=||++++ T Consensus 277 ~d~f~~~l~~~~~~l---~vG~p~~~~ 300 (453) T cd07094 277 YDEFIEAFVAAVKKL---KVGDPLDED 300 (453) T ss_pred HHHHHHHHHHHHHHC---CCCCHHHCC T ss_conf 799999999877625---467922136 No 169 >TIGR00653 GlnA glutamine synthetase, type I; InterPro: IPR004809 Glutamine synthetase type I (or glutamate-ammonia ligase) has a dodecameric form, which can be subdivided into 1-alpha and 1-beta forms. The phylogeny of the 1-alpha and 1-beta forms appears polyphyletic. Escherichia coli, Synechocystis sp. (strain PCC 6803) , Aquifex aeolicus, and the crenarcheon Sulfolobus acidocaldarius have form 1-beta, while Bacillus subtilis, Thermotoga maritima, and various euryarchaea has form 1-alpha. The 1-beta dodecamer from the crenarcheon S. acidocaldarius differs from that in E. coli in that it is not regulated by adenylation. ; GO: 0004356 glutamate-ammonia ligase activity, 0009399 nitrogen fixation, 0005737 cytoplasm. Probab=33.78 E-value=15 Score=17.23 Aligned_cols=228 Identities=19% Similarity=0.222 Sum_probs=122.9 Q ss_pred ECCCCCCCCCCEEEEECCCCCCCC-CCHHHHHHHHHHHHHHHHHCC--CCH--------HH-HHHHHHHHHHCCCCCCCC Q ss_conf 344786647754676268986516-804899999998756789759--885--------57-898886465303664322 Q gi|254780670|r 35 MVPSGASTGIHEAFELRDQEKRYF-GKGVLKAIAFVNDEIRTALLG--CDA--------RD-QLLIDKIMIDLDGTPNKS 102 (424) Q Consensus 35 ~~PSGaStG~~Ea~elrD~~~~~~-G~gV~~Av~~in~~i~~~L~g--~~~--------~d-Q~~iD~~Li~lDgT~nks 102 (424) ..|.|.+. ...++.+|..- |+ =-=+.++. .++..|...|.. .++ +- |.+||=.--+|= T Consensus 169 ~~e~g~~~--n~g~~~~~kgG-YF~v~P~D~~~-D~R~e~v~~L~~~G~~~E~~HHEVA~aGQ~Eid~kf~~l~------ 238 (486) T TIGR00653 169 EYEGGSEL--NSGYKPRDKGG-YFPVPPTDTAV-DIRREMVLTLEQLGLDVEVHHHEVATAGQHEIDFKFDTLL------ 238 (486) T ss_pred CCCCCCCC--CCCCCCCCCCC-CCCCCCCCCHH-HHHHHHHHHHHHCCCEEEEEECCCCCCCCEEEECCHHHHH------ T ss_conf 46888861--24603477777-75876898888-9999999999865982888601014787425304647898------ Q ss_pred CCCCCCCCHHHHHHHHHHHHHCCCCHHHHCC-------CCCCCCCCCCCEEECCCCCCCCCCCCC------EEEEEECCC Q ss_conf 1232222024489987526634984888225-------324564754300000366455677661------025651488 Q gi|254780670|r 103 RLGANAILGVSLAVSKAAAQTSNLPLYKYLG-------GCSAHILPVPLMNILNGGIHADNALDF------QEFMIMPVG 169 (424) Q Consensus 103 ~lGaNailAvSlA~akA~A~~~~~pLy~yl~-------~~~~~~lP~P~~NiinGG~HA~~~l~i------QEfmIiP~g 169 (424) -.+=++.+|+|+. |.....||.|+++.=+-|.|--=.|.- +-.+-=|.+ T Consensus 239 ------------------~~AD~~~~yKyvvK~vA~~~Gk~ATFMPKPlfGdNGSGMH~H~SLWK~ePf~GenlF~g~~~ 300 (486) T TIGR00653 239 ------------------KTADDIQTYKYVVKNVAKKHGKTATFMPKPLFGDNGSGMHCHQSLWKGEPFDGENLFAGEEG 300 (486) T ss_pred ------------------HHHCCEEEHHHHHHHHHHHCCCEEEECCCCCCCCCCCCCEEEEEEEECCCCCCCEECCCCCC T ss_conf ------------------74050451478998999983887974587887726685548355632778888400068588 Q ss_pred CCC-H-HHHHHHHHHHHHHHHHHHHH---------C---CC-------------CCCC----CCCCCC-------CC-CC Q ss_conf 774-8-99989999999999987520---------0---00-------------1367----655764-------48-86 Q gi|254780670|r 170 AEN-I-REAIRMGAEVFHTLKKELKS---------K---GY-------------STNV----GDEGGF-------SP-NL 210 (424) Q Consensus 170 ~~s-f-~eal~~~~ev~~~lk~~L~~---------~---g~-------------~t~v----gDEGGf-------ap-~~ 210 (424) .-. + +.|+--..=|-+|-|.+..= | |+ |..| .-.-+= +| +. T Consensus 301 ~y~GLSe~ALyyIGGiLkHA~AL~A~tNPTvNSYKRLVPGYEAPv~lays~~NRSA~iRIP~~~~n~kA~R~E~R~PDPs 380 (486) T TIGR00653 301 EYAGLSETALYYIGGILKHAKALAAFTNPTVNSYKRLVPGYEAPVYLAYSARNRSALIRIPASRGNPKATRIEVRFPDPS 380 (486) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHCCCCCCCCCEEEECCEECCCCEEEEEEECCCCC T ss_conf 76543588888752087777888897656605522368861210230216778752587401640883369987078876 Q ss_pred CCHHHHHHHHHHHHHHCCCCCCCCHHHHHCHHHHHHCCCCCEEEC-----CCCCCCCHHHHHHHHHHHHHHCCEEEEECC Q ss_conf 846899999999997538531110234311046532027851111-----468656878999999987654486997767 Q gi|254780670|r 211 KTADSALDLIANSIEKAGYNAGKDLLIALDCAASAFFKKEKYILK-----GENLELQPNEMASYLANLVDQYPIYSIEDG 285 (424) Q Consensus 211 ~~~eeaL~ll~eAi~~aGy~~g~di~i~lD~AAsefy~~~kY~~~-----~~~~~~t~~elid~y~~l~~~yPI~sIEDP 285 (424) .++.=|+ .|+=.||+-.=+. -...-|-++++-|.+. ..|..-.|..|-+=+..|-..|---..--- T Consensus 381 ~NPYLAF----AA~LmAGLDGIKn-----KI~PgeP~d~n~Y~l~peE~~e~GI~~lP~~L~eAL~~Le~D~Ef~~~~~v 451 (486) T TIGR00653 381 ANPYLAF----AALLMAGLDGIKN-----KIDPGEPVDKNLYELSPEELRELGIPQLPGSLEEALDELESDHEFLVLKGV 451 (486) T ss_pred CCHHHHH----HHHHHHHCCCCCC-----CCCCCCCCCCCCCCCCHHHHHHCCCCCCHHHHHHHHHHHHCCCCCHHHCCC T ss_conf 5678999----9999862012001-----458787310433358989997389011056799999997314620022032 Q ss_pred CCCCCHHHHHHHHH Q ss_conf 63104237998864 Q gi|254780670|r 286 MSEDDWHGWKILTN 299 (424) Q Consensus 286 ~~e~D~~gw~~l~~ 299 (424) |.++=++.|..+-. T Consensus 452 fg~~~~e~fie~Kr 465 (486) T TIGR00653 452 FGEEFIEAFIELKR 465 (486) T ss_pred CHHHHHHHHHHHHH T ss_conf 03789999987653 No 170 >PRK09276 aspartate aminotransferase; Provisional Probab=33.76 E-value=33 Score=14.61 Aligned_cols=30 Identities=10% Similarity=0.111 Sum_probs=22.0 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCEEEEECCCC Q ss_conf 6865687899999998765448699776763 Q gi|254780670|r 257 ENLELQPNEMASYLANLVDQYPIYSIEDGMS 287 (424) Q Consensus 257 ~~~~~t~~elid~y~~l~~~yPI~sIEDP~~ 287 (424) .|..++.+++-.+ .+||.+|.+.-|+|=.+ T Consensus 179 TG~v~s~~~l~~l-~~la~~~~i~ii~DEiY 208 (385) T PRK09276 179 TGAVADLEFFEKV-VDFAKKYDIIVCHDNAY 208 (385) T ss_pred CCCHHHHHHHHHH-HHHHCCCCCEEEEHHHH T ss_conf 5511118899999-87632557569754442 No 171 >PRK13795 hypothetical protein; Provisional Probab=33.70 E-value=33 Score=14.61 Aligned_cols=17 Identities=24% Similarity=0.389 Sum_probs=7.4 Q ss_pred HHHHHHHHH--HHHHHHHH Q ss_conf 899999999--99987520 Q gi|254780670|r 177 IRMGAEVFH--TLKKELKS 193 (424) Q Consensus 177 l~~~~ev~~--~lk~~L~~ 193 (424) .||++.|.+ .+.+.+++ T Consensus 324 ~RWCckv~Kl~pl~~~i~~ 342 (630) T PRK13795 324 YRWCCKVCKLGPITRAIKS 342 (630) T ss_pred CCCCCCCCCCHHHHHHHHH T ss_conf 6435543100579999997 No 172 >TIGR01804 BADH betaine aldehyde dehydrogenase; InterPro: IPR011264 Under osmotic stress, betaine aldehyde dehydrogenase oxidises glycine betaine aldehyde into the osmoprotectant glycine betaine, via the second of two oxidation steps from exogenously supplied choline or betaine aldehyde. This choline-glycine betaine synthesis pathway can be found in Gram-positive and Gram-negative bacteria. In Escherichia coli, betaine aldehyde dehydrogenase (betB) is osmotically co-induced with choline dehydrogenase (betA) in the presence of choline. These dehydrogenases are located in a betaine gene cluster with the upstream choline transporter (betT) and transcriptional regulator (betI) . Similar to E. coli, betaine synthesis in Staphylococcus xylosus is also influenced by osmotic stress and the presence of choline with genes localised in a functionally equivalent gene cluster . Organisation of the betaine gene cluster in Sinorhizobium meliloti and Bacillus subtilis differs from that of E. coli by the absence of upstream choline transporter and transcriptional regulator homologues , . Additionally, Bacillus subtilis co-expresses a type II alcohol dehydrogenase with betaine aldehyde dehydrogenase instead of choline dehydrogenase as in E. coli, Staphylococcus xylosus, and Sinorhizobium meliloti . Betaine aldehyde dehydrogenase is a member of the aldehyde dehydrogenase family. ; GO: 0008802 betaine-aldehyde dehydrogenase activity, 0006578 betaine biosynthetic process. Probab=33.64 E-value=33 Score=14.60 Aligned_cols=299 Identities=22% Similarity=0.282 Sum_probs=173.2 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHC-------CCCHHHHHHHHHH-------------------HHHCCCCCCCCC-----C Q ss_conf 51680489999999875678975-------9885578988864-------------------653036643221-----2 Q gi|254780670|r 56 RYFGKGVLKAIAFVNDEIRTALL-------GCDARDQLLIDKI-------------------MIDLDGTPNKSR-----L 104 (424) Q Consensus 56 ~~~G~gV~~Av~~in~~i~~~L~-------g~~~~dQ~~iD~~-------------------Li~lDgT~nks~-----l 104 (424) ..+||=|+++++.+.+... .|- |+...+-...|-. .|++-++...++ | T Consensus 59 ~~Rgrilr~~ael~r~~~~-eLa~LEtLDTGK~l~Et~~aD~~~~Adv~~ffaGl~~~~~G~~i~l~~~~~f~~~~repl 137 (471) T TIGR01804 59 LERGRILRRIAELLRERNE-ELAKLETLDTGKALEETLVADMDDIADVFEFFAGLADKLGGEIIPLPIPDSFSYTRREPL 137 (471) T ss_pred HHHHHHHHHHHHHHHHHHH-HHHHHCCCCCCCCCHHHHCCCHHHHHHHHHHHCCCCCCCCCCEECCCCCCCCCEEEEECC T ss_conf 4667999999999997556-765431466785202221015235654786631521036886404888875203786321 Q ss_pred CCCCC-----CHHHHHHHHHH-HHHCCC----------CHHH-HCCC-CCCCCCCCCCEEECCC-CCCCCCCCC-CEEEE Q ss_conf 32222-----02448998752-663498----------4888-2253-2456475430000036-645567766-10256 Q gi|254780670|r 105 GANAI-----LGVSLAVSKAA-AQTSNL----------PLYK-YLGG-CSAHILPVPLMNILNG-GIHADNALD-FQEFM 164 (424) Q Consensus 105 GaNai-----lAvSlA~akA~-A~~~~~----------pLy~-yl~~-~~~~~lP~P~~NiinG-G~HA~~~l~-iQEfm 164 (424) |.=+= +.+-+|++|.| |...+- ||=- -+.. ..+--||.=++||+.| |...+..|. -+++= T Consensus 138 Gv~~gIgaWNYP~~~A~WK~APALa~Gna~v~KPSE~TPLtal~~ae~~~EaGlP~Gv~Nvv~G~G~~vG~~L~~h~~va 217 (471) T TIGR01804 138 GVCVGIGAWNYPLQIASWKSAPALAAGNALVFKPSEITPLTALKVAELMEEAGLPDGVFNVVLGKGAEVGEELVNHKDVA 217 (471) T ss_pred EEEECCCCCHHHHHHHHHHCCHHHHHCCCEEECCCCCCHHHHHHHHHHHHHCCCCCCEEEEECCCCHHHHHHHHCCCCEE T ss_conf 47744664303899986401457770582553588877368999999998548986527855088733337864187700 Q ss_pred EECCCCCCHHHHHHHHHHHHH-HHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHCHHH Q ss_conf 514887748999899999999-9998752000013676557644886846899999999997538531110234311046 Q gi|254780670|r 165 IMPVGAENIREAIRMGAEVFH-TLKKELKSKGYSTNVGDEGGFSPNLKTADSALDLIANSIEKAGYNAGKDLLIALDCAA 243 (424) Q Consensus 165 IiP~g~~sf~eal~~~~ev~~-~lk~~L~~~g~~t~vgDEGGfap~~~~~eeaL~ll~eAi~~aGy~~g~di~i~lD~AA 243 (424) .+ ||.=+++.+-.|-. +-..-|++=.+ .-||=.|++= .++| |+-+|+|-|- T Consensus 218 ~v-----SfTGgv~tGkkim~~~Aa~~lk~vtl-----ELGGKsP~Iv-FdDA-----------------DLe~A~d~Al 269 (471) T TIGR01804 218 KV-----SFTGGVPTGKKIMANAAAEELKHVTL-----ELGGKSPLIV-FDDA-----------------DLELAVDQAL 269 (471) T ss_pred EE-----EECCCCHHHHHHHHHHHHHHHHHHHH-----HCCCCCCCCE-ECCC-----------------CHHHHHHHHH T ss_conf 58-----75277444789999877776533012-----0178987413-3156-----------------5788862666 Q ss_pred H-HHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCCC-------------HHHHHHHHHHCCCCEEEEC Q ss_conf 5-3202785111146865687899999998765448699776763104-------------2379988642388368504 Q gi|254780670|r 244 S-AFFKKEKYILKGENLELQPNEMASYLANLVDQYPIYSIEDGMSEDD-------------WHGWKILTNKIGSNCQLVG 309 (424) Q Consensus 244 s-efy~~~kY~~~~~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~D-------------~~gw~~l~~~~g~~~~IvG 309 (424) . .||..|.=.-.+.......+=--.|..+|+++--=+-|=||||++- ...+-+.-+.-|-++.+=| T Consensus 270 ~g~Ff~~GQVCs~GtRv~V~~~ik~~F~~~L~~R~~~I~~Gd~fd~~T~~GPliSa~~R~kv~~Yi~~gk~EgA~l~~GG 349 (471) T TIGR01804 270 LGNFFSAGQVCSNGTRVFVHKKIKEKFLARLVERVKRIKLGDGFDEATEMGPLISAEHRDKVLSYIEKGKEEGARLATGG 349 (471) T ss_pred HCCCCCCCCEECCCCEEECHHHHHHHHHHHHHHHHHHCCCCCCCCHHHCCCCCCCHHHHHHHHHHHHHHHHCCCEEEECC T ss_conf 22411467150477445324676599999999988621047886720247871017778899999987440498897778 Q ss_pred CCEEECCHHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHH---HHHHHHCCCHHE Q ss_conf 30000214888521003865223425453003899999999999879869995078887235899---899981998010 Q gi|254780670|r 310 DDLFVTNPERLHKGINEEVANAILIKPNQIGSVSETLNTIEKAQMAGYPSIISHRSGETEDHTIA---DLAVATNCGQIK 386 (424) Q Consensus 310 DDL~vTn~~rl~~gi~~~~~NailiK~NQiGTvset~ea~~~a~~~g~~~ivShRSGETeD~~ia---DLAVg~~a~~iK 386 (424) +-+ + ...--|+.-+.|.= .|+.-+.++.+|+.=|..|+|=-+-++||-.|. |=-.|+.++-. T Consensus 350 ~~p---~--------~~~l~~G~f~~PTv---f~d~~~~M~IvrEEiFGPV~~v~~F~~E~EvI~~ANdt~yGLAagv~- 414 (471) T TIGR01804 350 KVP---E--------DEGLDNGFFIEPTV---FTDCTDDMTIVREEIFGPVMSVLTFSSEDEVIARANDTEYGLAAGVF- 414 (471) T ss_pred CCC---C--------CCCCCCCCEECCEE---EECCCCCCEEEEEEECCCEEEEECCCCHHHHHHHCCCCCCCHHHHHH- T ss_conf 523---7--------77767872366736---75158997389875158676651358769999850775130367888- Q ss_pred ECCCCHHHHHHH Q ss_conf 075611448999 Q gi|254780670|r 387 TGSLARSDRIAK 398 (424) Q Consensus 387 ~G~~~R~ER~aK 398 (424) |-=+.|++|++. T Consensus 415 t~Dl~ra~rv~~ 426 (471) T TIGR01804 415 TADLERAHRVAA 426 (471) T ss_pred HHHHHHHHHHHH T ss_conf 876788899998 No 173 >COG0276 HemH Protoheme ferro-lyase (ferrochelatase) [Coenzyme metabolism] Probab=33.60 E-value=33 Score=14.59 Aligned_cols=67 Identities=12% Similarity=0.141 Sum_probs=35.0 Q ss_pred CCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCC Q ss_conf 61025651488774899989999999999987520000136765576448868468999999999975385311 Q gi|254780670|r 159 DFQEFMIMPVGAENIREAIRMGAEVFHTLKKELKSKGYSTNVGDEGGFSPNLKTADSALDLIANSIEKAGYNAG 232 (424) Q Consensus 159 ~iQEfmIiP~g~~sf~eal~~~~ev~~~lk~~L~~~g~~t~vgDEGGfap~~~~~eeaL~ll~eAi~~aGy~~g 232 (424) .+.+..++|.- +-|+.. ....+++.+.+.+++.... -+=-|.|++-+...-++++.+.|++.=-+.+ T Consensus 115 gv~~iv~~pLy-PqyS~s--Tt~s~~~~~~~al~~~~~~----~~i~~I~~~~~~p~yI~a~a~~I~~~~~~~~ 181 (320) T COG0276 115 GVERIVVLPLY-PQYSSS--TTGSYVDELARALKELRGQ----PKISTIPDYYDEPLYIEALADSIREKLAKHP 181 (320) T ss_pred CCCEEEEEECC-CCCCCC--CHHHHHHHHHHHHHHCCCC----CCEEEECCCCCCHHHHHHHHHHHHHHHHHCC T ss_conf 98759999777-630230--1899999999999853789----8668816755786899999999999997478 No 174 >KOG1354 consensus Probab=33.46 E-value=33 Score=14.58 Aligned_cols=19 Identities=26% Similarity=0.447 Sum_probs=10.3 Q ss_pred EEEEEECCCCCCHHHHHHH Q ss_conf 0256514887748999899 Q gi|254780670|r 161 QEFMIMPVGAENIREAIRM 179 (424) Q Consensus 161 QEfmIiP~g~~sf~eal~~ 179 (424) |-|-|+-+.-..++|-.+. T Consensus 197 ~sFnIVDIKP~nmEeLteV 215 (433) T KOG1354 197 QSFNIVDIKPANMEELTEV 215 (433) T ss_pred CCEEEEECCCCCHHHHHHH T ss_conf 7346897166678999988 No 175 >PRK07898 consensus Probab=33.39 E-value=33 Score=14.57 Aligned_cols=13 Identities=23% Similarity=0.337 Sum_probs=5.2 Q ss_pred HHHHHHHHHHHHH Q ss_conf 9999999987520 Q gi|254780670|r 181 AEVFHTLKKELKS 193 (424) Q Consensus 181 ~ev~~~lk~~L~~ 193 (424) ...|..|++.|++ T Consensus 480 ~~L~~~l~~~L~~ 492 (902) T PRK07898 480 LDLADALDAELAR 492 (902) T ss_pred HHHHHHHHHHHHH T ss_conf 9999999998616 No 176 >PRK11858 aksA trans-homoaconitate synthase; Reviewed Probab=33.24 E-value=34 Score=14.55 Aligned_cols=17 Identities=12% Similarity=0.036 Sum_probs=8.7 Q ss_pred CCHHHHHHHHHHHHHHH Q ss_conf 74899989999999999 Q gi|254780670|r 171 ENIREAIRMGAEVFHTL 187 (424) Q Consensus 171 ~sf~eal~~~~ev~~~l 187 (424) .|..+.+++..+..... T Consensus 109 ~t~~e~l~~~~~~v~~A 125 (378) T PRK11858 109 KTREEVLERLVEAVEYA 125 (378) T ss_pred CCHHHHHHHHHHHHHHH T ss_conf 99899999999999999 No 177 >PRK07778 consensus Probab=33.11 E-value=34 Score=14.54 Aligned_cols=32 Identities=19% Similarity=0.278 Sum_probs=23.5 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCC Q ss_conf 686568789999999876544869977676310 Q gi|254780670|r 257 ENLELQPNEMASYLANLVDQYPIYSIEDGMSED 289 (424) Q Consensus 257 ~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~ 289 (424) .|..++.+++-++ .++|.+|.|+-|+|-.+++ T Consensus 175 TG~v~s~~~l~~l-~~la~~~~i~iisDEiY~~ 206 (386) T PRK07778 175 TGTVIAPEELAAI-ASWCEASGVRLISDEVYHG 206 (386) T ss_pred CCCCCHHHHHHHH-HHHHHHCCEEEEECCCHHH T ss_conf 8778507999999-9998537879994343253 No 178 >cd07669 BAR_SNX33 The Bin/Amphiphysin/Rvs (BAR) domain of Sorting Nexin 33. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. Sorting nexins (SNXs) are Phox homology (PX) domain containing proteins that are involved in regulating membrane traffic and protein sorting in the endosomal system. SNXs differ from each other in their lipid-binding specificity, subcellular localization and specific function in the endocytic pathway. A subset of SNXs also contain BAR domains. The PX-BAR structural unit determines the specific membrane targeting of SNXs. SNX33 interacts with Wiskott-Aldrich syndrome protein (WASP) and plays a role in the maintenance of cell shape and cell cycle progression. It modulates the shedding and endocytosis of cellular prion protein (PrP(c)) and amyloid precursor protein (APP). BAR domains form dimers that bind to membranes, induce membrane bending and curvature, and may also be involved in Probab=33.10 E-value=9.1 Score=18.76 Aligned_cols=43 Identities=19% Similarity=0.328 Sum_probs=17.1 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHCHHHHHH Q ss_conf 7644886846899999999997538531110234311046532 Q gi|254780670|r 204 GGFSPNLKTADSALDLIANSIEKAGYNAGKDLLIALDCAASAF 246 (424) Q Consensus 204 GGfap~~~~~eeaL~ll~eAi~~aGy~~g~di~i~lD~AAsef 246 (424) |+|--.+...-+++..|..|.+.-++.+.....-+|...+..| T Consensus 44 ~~~KkEyqkiG~af~~Ls~afe~d~~~~~~~L~~Ai~~tG~~y 86 (207) T cd07669 44 GGFRKEFQKLGNAFQAISHSFQLDPPYSSEALNNAISHTGRTY 86 (207) T ss_pred CCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHH T ss_conf 3120999999999999999972698644247899999999999 No 179 >COG2145 ThiM Hydroxyethylthiazole kinase, sugar kinase family [Coenzyme metabolism] Probab=33.01 E-value=34 Score=14.53 Aligned_cols=22 Identities=5% Similarity=0.022 Sum_probs=17.2 Q ss_pred CCHHHHHHHHHHHHHHCCEEEE Q ss_conf 6878999999987654486997 Q gi|254780670|r 261 LQPNEMASYLANLVDQYPIYSI 282 (424) Q Consensus 261 ~t~~elid~y~~l~~~yPI~sI 282 (424) ...++.++....+..+|+.+-+ T Consensus 141 ~~~~~~~~~a~~~A~~~~~vvv 162 (265) T COG2145 141 DGAADAIEAAKKAAQKYGTVVV 162 (265) T ss_pred CCHHHHHHHHHHHHHHHCCEEE T ss_conf 4314699999999887591899 No 180 >PRK00090 bioD dithiobiotin synthetase; Reviewed Probab=32.61 E-value=34 Score=14.48 Aligned_cols=100 Identities=19% Similarity=0.222 Sum_probs=62.4 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCEEEEEC------CCCCCCHHHHHHHHHHCCCCEEEECCCEEECCHHHHHHHHHCCCCC Q ss_conf 6865687899999998765448699776------7631042379988642388368504300002148885210038652 Q gi|254780670|r 257 ENLELQPNEMASYLANLVDQYPIYSIED------GMSEDDWHGWKILTNKIGSNCQLVGDDLFVTNPERLHKGINEEVAN 330 (424) Q Consensus 257 ~~~~~t~~elid~y~~l~~~yPI~sIED------P~~e~D~~gw~~l~~~~g~~~~IvGDDL~vTn~~rl~~gi~~~~~N 330 (424) ++...+.+++.+.|.+|-++|.++-||= |+.++- .+..+.+.++-.+.+| T Consensus 84 ~g~~i~~~~i~~~~~~l~~~~d~vlvEGaGGl~~Pl~~~~--~~~Dla~~l~~pvILV---------------------- 139 (223) T PRK00090 84 EGVTIDLEKISAALRELAQQADLVLVEGAGGLLVPLTDDL--TLADLAAQLQLPVILV---------------------- 139 (223) T ss_pred HCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCCCCCCCCC--CHHHHHHHHCCCEEEE---------------------- T ss_conf 0984689999999999983189899946886556756787--8899999968898999---------------------- Q ss_pred EEEECCCCHHHHHHHHHHHHHHHHCCC---EEEEECCCCCC--CHHHHHHHHHHHCCCH Q ss_conf 234254530038999999999998798---69995078887--2358998999819980 Q gi|254780670|r 331 AILIKPNQIGSVSETLNTIEKAQMAGY---PSIISHRSGET--EDHTIADLAVATNCGQ 384 (424) Q Consensus 331 ailiK~NQiGTvset~ea~~~a~~~g~---~~ivShRSGET--eD~~iaDLAVg~~a~~ 384 (424) --...||+..|+-.+..++..|. .+|+.+-.++. .++.+..|.--++.+. T Consensus 140 ----~~~~lG~inhtllt~eal~~~gl~v~GvI~N~~~~~~~~~~~~~~~l~~~~gvPv 194 (223) T PRK00090 140 ----VGVKLGCINHTLLTLEAIRARGLPLAGWVANGVPPEGLRHAENLATLAELLPAPL 194 (223) T ss_pred ----ECCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCCCCHHHHHHHHHHHHHHCCCCE T ss_conf ----7698880999999899999689948999996858836677768999998549988 No 181 >PRK00366 ispG 4-hydroxy-3-methylbut-2-en-1-yl diphosphate synthase; Reviewed Probab=32.60 E-value=34 Score=14.48 Aligned_cols=17 Identities=35% Similarity=0.624 Sum_probs=9.3 Q ss_pred CCCCCCHHHHHHHHHHHCCC Q ss_conf 78887235899899981998 Q gi|254780670|r 364 RSGETEDHTIADLAVATNCG 383 (424) Q Consensus 364 RSGETeD~~iaDLAVg~~a~ 383 (424) --||.-| ||+.++-|.+ T Consensus 309 GPGEak~---ADiGiagg~~ 325 (367) T PRK00366 309 GPGEAKH---ADIGIAGGNG 325 (367) T ss_pred CCCHHHH---CCEEEECCCC T ss_conf 7650221---7772656988 No 182 >cd06301 PBP1_rhizopine_binding_like Periplasmic binding proteins specific to rhizopines. Periplasmic binding proteins specific to rhizopines, which are simple sugar-like compounds produced in the nodules induced by the symbiotic root nodule bacteria, such as Rhizobium and Sinorhizobium. Rhizopine-binding-like proteins from other bacteria are also included. Two inositol based rhizopine compounds are known to date: L-3-O-methly-scyllo-inosamine (3-O-MSI) and scyllo-inosamine. Bacterial strains that can metabolize rhizopine have a greater competitive advantage in nodulation and rhizopine synthesis is regulated by NifA/NtrA regulatory transcription activators which are maximally expressed at the onset of nitrogen fixation in bacteroids. The members of this group belong to the pentose/hexose sugar-binding protein family of the type I periplasmic binding protein superfamily. Probab=32.58 E-value=34 Score=14.48 Aligned_cols=76 Identities=13% Similarity=0.120 Sum_probs=39.7 Q ss_pred CCHHHHHHHHHHHHHHCC-EEEEECCCCCCCHHH---HHHHHHH--CCCCEEEECCCEEECCHHHHHHHHHCCCCCEEEE Q ss_conf 687899999998765448-699776763104237---9988642--3883685043000021488852100386522342 Q gi|254780670|r 261 LQPNEMASYLANLVDQYP-IYSIEDGMSEDDWHG---WKILTNK--IGSNCQLVGDDLFVTNPERLHKGINEEVANAILI 334 (424) Q Consensus 261 ~t~~elid~y~~l~~~yP-I~sIEDP~~e~D~~g---w~~l~~~--~g~~~~IvGDDL~vTn~~rl~~gi~~~~~Naili 334 (424) .++++-.+...+|+.++| +-.| |.-+|.-. ...+.+. .+.++.|+|=|......+.++.|. ..-.+.- T Consensus 165 ~~~~~a~~~~~~~L~~~~~~~aI---~~~nd~~a~Ga~~al~~~g~~~~~i~v~G~D~~~~~~~~i~~g~---~~~tv~q 238 (272) T cd06301 165 WSRAEAMDLMENWLSSGGKIDAV---VANNDEMALGAIMALKAAGKSDKDVPVAGIDGTPDALAAVKKGD---LDVTVFQ 238 (272) T ss_pred HHHHHHHHHHHHHHHHCCCCCEE---EECCCHHHHHHHHHHHHCCCCCCCEEEEEECCCHHHHHHHHCCC---CEEEEEC T ss_conf 23578999999999836777879---95582789999999998698999879999898599999987499---7699806 Q ss_pred CCCCHHHH Q ss_conf 54530038 Q gi|254780670|r 335 KPNQIGSV 342 (424) Q Consensus 335 K~NQiGTv 342 (424) .|-++|.. T Consensus 239 ~~~~~G~~ 246 (272) T cd06301 239 DAKGQGAG 246 (272) T ss_pred CHHHHHHH T ss_conf 99999999 No 183 >TIGR01735 FGAM_synt phosphoribosylformylglycinamidine synthase; InterPro: IPR010073 This entry represents a single-molecule form of phosphoribosylformylglycinamidine synthase, also called FGAM synthase, an enzyme of purine de novo biosynthesis. This form is found mostly in eukaryotes and proteobacteria. In Bacillus subtilis PurL (FGAM synthase II) and PurQ (FGAM synthase I) perform the equivalent function; the unrelated small protein PurS is also required and may be a third subunit .; GO: 0004642 phosphoribosylformylglycinamidine synthase activity, 0006189 'de novo' IMP biosynthetic process. Probab=32.49 E-value=34 Score=14.55 Aligned_cols=23 Identities=43% Similarity=0.592 Sum_probs=13.9 Q ss_pred CCEEEEECC-CCCCCCCCEEEEECCC Q ss_conf 976766344-7866477546762689 Q gi|254780670|r 29 GSTGRAMVP-SGASTGIHEAFELRDQ 53 (424) Q Consensus 29 g~~gra~~P-SGaStG~~Ea~elrD~ 53 (424) ..+-+|++| +||+||+- =|.||- T Consensus 313 HNhPTAiaPFpGA~TGaG--GeIRDe 336 (1401) T TIGR01735 313 HNHPTAIAPFPGASTGAG--GEIRDE 336 (1401) T ss_pred CCCCCCCCCCCCCCCCCC--CCEECC T ss_conf 788864677678887878--736436 No 184 >TIGR01761 thiaz-red thiazolinyl imide reductase; InterPro: IPR010091 This reductase is found associated with gene clusters for the biosynthesis of various non-ribosomal peptide derived natural products in which cysteine is cyclised to a thiazoline ring containing an imide double bond. Examples include yersiniabactin and pyochelin. Yersiniabactin is a virulence factor secreted by Yersinia pestis in iron-deficient microenvironments, in order to scavenge ferric ions . . Probab=32.28 E-value=35 Score=14.44 Aligned_cols=97 Identities=22% Similarity=0.350 Sum_probs=45.7 Q ss_pred CCHHHHHHHHHHHHHCCCCHHHHCCCCCCCCCCCCCE---EECCCCCC---C----CCCCC-CEEEEEECCCCCCHHHHH Q ss_conf 2024489987526634984888225324564754300---00036645---5----67766-102565148877489998 Q gi|254780670|r 109 ILGVSLAVSKAAAQTSNLPLYKYLGGCSAHILPVPLM---NILNGGIH---A----DNALD-FQEFMIMPVGAENIREAI 177 (424) Q Consensus 109 ilAvSlA~akA~A~~~~~pLy~yl~~~~~~~lP~P~~---NiinGG~H---A----~~~l~-iQEfmIiP~g~~sf~eal 177 (424) |||===+=-|+-|...|+|||-+.-..... +-+=|+ .-|-||+= | ..+++ +||==+.|.. +.+-. T Consensus 43 lLAqGS~RSR~LA~~lGvpLy~~ve~lp~~-~~~ACvvvRSa~~Gg~G~~LA~aLL~RGi~VLqEHPl~p~d---~~~L~ 118 (378) T TIGR01761 43 LLAQGSERSRALAHRLGVPLYSEVEELPDD-IDIACVVVRSAIVGGKGSELARALLKRGIHVLQEHPLHPED---IAELL 118 (378) T ss_pred EEHHCCHHHHHHHHHHCCCCCCCHHHCCCC-CCEEEEEEEEEEECCCCHHHHHHHHHCCCEEEECCCCCHHH---HHHHH T ss_conf 100103689999998098410671004688-75558885225317972589999985675156328988778---99999 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHH Q ss_conf 9999999999987520000136765576448868468999999999 Q gi|254780670|r 178 RMGAEVFHTLKKELKSKGYSTNVGDEGGFSPNLKTADSALDLIANS 223 (424) Q Consensus 178 ~~~~ev~~~lk~~L~~~g~~t~vgDEGGfap~~~~~eeaL~ll~eA 223 (424) +. -.+.|--..| ..|=|.+.-...=+++..++ T Consensus 119 ~l-----------A~~~Gr~Y~v---NTFYPh~PAv~~Fi~~~~q~ 150 (378) T TIGR01761 119 RL-----------AERQGRRYLV---NTFYPHLPAVRRFIEYARQL 150 (378) T ss_pred HH-----------HHHCCCEEEE---CCCCCCCHHHHHHHHHHHHH T ss_conf 99-----------9970986642---06778733789999999999 No 185 >PRK09701 D-allose transporter subunit; Provisional Probab=32.26 E-value=35 Score=14.44 Aligned_cols=76 Identities=18% Similarity=0.277 Sum_probs=44.3 Q ss_pred CCHHHHHHHHHHHHHHCC-EEEEECCCCCCCH--HHHHHHHHHCC--CCEEEECCCEEECCHHHHHHHHHCCCCCEEEEC Q ss_conf 687899999998765448-6997767631042--37998864238--836850430000214888521003865223425 Q gi|254780670|r 261 LQPNEMASYLANLVDQYP-IYSIEDGMSEDDW--HGWKILTNKIG--SNCQLVGDDLFVTNPERLHKGINEEVANAILIK 335 (424) Q Consensus 261 ~t~~elid~y~~l~~~yP-I~sIEDP~~e~D~--~gw~~l~~~~g--~~~~IvGDDL~vTn~~rl~~gi~~~~~NailiK 335 (424) .++++=.+.-.+|+..|| |..|- ..+|. -|..+--+..| .++.|+|=|....-.+.++.|- ..-.+.-. T Consensus 198 ~d~~~a~~~~~~~L~a~Pdi~~I~---~~nd~~a~Ga~~Al~~aG~~~~i~vvG~D~~~~~~~~I~~G~---i~atv~Q~ 271 (311) T PRK09701 198 WDRIKALDVATNVLQRNPNIKAIY---CANDTMAMGVAQAVANAGKTGKVLVVGTDGIPEARKMVEAGQ---MTATVAQN 271 (311) T ss_pred CCHHHHHHHHHHHHHCCCCCCEEE---ECCCHHHHHHHHHHHHCCCCCCEEEEEECCCHHHHHHHHCCC---CEEEEEEC T ss_conf 658779999999995499976899---779726899999999669999839999789999999998598---27999739 Q ss_pred CCCHHHH Q ss_conf 4530038 Q gi|254780670|r 336 PNQIGSV 342 (424) Q Consensus 336 ~NQiGTv 342 (424) |-++|-. T Consensus 272 P~~~G~~ 278 (311) T PRK09701 272 PADIGAT 278 (311) T ss_pred HHHHHHH T ss_conf 9999999 No 186 >cd03870 M14_CPA Peptidase M14 Carboxypeptidase (CP) A (CPA) belongs to the A/B subfamily of the M14 family of metallocarboxypeptidases (MCPs). The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPA enzymes generally favor hydrophobic residues. A/B subfamily enzymes are normally synthesized as inactive precursors containing preceding signal peptide, followed by a globular N-terminal pro-region linked to the enzyme; these proenzymes are called procarboxypeptidases. The procarboxypeptidase A (PCPA) is produced by the exocrine pancreas and stored as a stable zymogen in the pancreatic granules until secretion into the digestive tract occurs. This subfamily includes CPA1, CPA2 and CPA4 forms. Within these A forms, there are slightly different specificities, with CPA1 preferring aliphatic and small aromatic residues, and CPA2 p Probab=32.23 E-value=35 Score=14.44 Aligned_cols=29 Identities=14% Similarity=0.184 Sum_probs=14.4 Q ss_pred HHHHHHHHHHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 4899999998756789759885578988864 Q gi|254780670|r 61 GVLKAIAFVNDEIRTALLGCDARDQLLIDKI 91 (424) Q Consensus 61 gV~~Av~~in~~i~~~L~g~~~~dQ~~iD~~ 91 (424) +...++..++..+..- +.|+.-+.-+|+. T Consensus 69 s~~~~ly~i~~Ll~~y--~~d~~~~~ll~~~ 97 (301) T cd03870 69 TQATGVWFAKKITEDY--GQDPSFTAILDSM 97 (301) T ss_pred HHHHHHHHHHHHHHHH--CCCHHHHHHHHCC T ss_conf 8999999999999741--3588899998667 No 187 >PRK07777 aminotransferase; Validated Probab=32.20 E-value=35 Score=14.44 Aligned_cols=30 Identities=23% Similarity=0.258 Sum_probs=20.5 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCEEEEECCCC Q ss_conf 6865687899999998765448699776763 Q gi|254780670|r 257 ENLELQPNEMASYLANLVDQYPIYSIEDGMS 287 (424) Q Consensus 257 ~~~~~t~~elid~y~~l~~~yPI~sIEDP~~ 287 (424) .|..++.+++-++ .+|+.+|.++-|+|=.. T Consensus 172 TG~v~s~e~l~~l-~~la~~~~i~ii~DE~Y 201 (386) T PRK07777 172 TGTVLSAAELAAI-AELAVEHDLLVITDEVY 201 (386) T ss_pred CCCCCCHHHHHHH-HHHHHHCCCEEECCCCH T ss_conf 8848889999999-99986429788657537 No 188 >TIGR00538 hemN oxygen-independent coproporphyrinogen III oxidase; InterPro: IPR004558 This family represents HemN, the oxygen-independent coproporphyrinogen III oxidase that replaces HemF function under anaerobic conditions. HemN catalyses the anaerobic transformation of coproporhyrinogen-III into protoporphyrinogen-IX during porphyrin biosynthesis. Several species, including Escherichia coli, Helicobacter pylori, and Aquifex aeolicus, have both a member of this family and a member of another, closely related family for which there is no evidence of coproporphyrinogen III oxidase activity, IPR004559 from INTERPRO. Members of this family have a perfectly conserved motif PYRT[SC]YP in a region N-terminal to the region of homology with the related uncharacterised protein.; GO: 0004109 coproporphyrinogen oxidase activity, 0006779 porphyrin biosynthetic process, 0005737 cytoplasm. Probab=31.86 E-value=35 Score=14.40 Aligned_cols=146 Identities=22% Similarity=0.265 Sum_probs=91.7 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHCCCC-CCCCHHHHHCHHHHHHCCCCCEEECCCCC-------------------CC Q ss_conf 55764488684689999999999753853-11102343110465320278511114686-------------------56 Q gi|254780670|r 202 DEGGFSPNLKTADSALDLIANSIEKAGYN-AGKDLLIALDCAASAFFKKEKYILKGENL-------------------EL 261 (424) Q Consensus 202 DEGGfap~~~~~eeaL~ll~eAi~~aGy~-~g~di~i~lD~AAsefy~~~kY~~~~~~~-------------------~~ 261 (424) +=||=.|++-|.++ ++.|++.|.++ +. .++|+-|++.+-=-++=++---.++-.|. .. T Consensus 109 HwGGGTP~YL~~~Q-~~~l~~~i~~~-F~nf~~daEiSiEidPR~~~~e~~~~L~~~GFNRlS~GvQDfd~~VQ~avnR~ 186 (462) T TIGR00538 109 HWGGGTPTYLSPEQ-IEELMKEIREA-FPNFSEDAEISIEIDPRYLTKEVIKALRDEGFNRLSFGVQDFDKEVQQAVNRI 186 (462) T ss_pred ECCCCCCCCCCHHH-HHHHHHHHHHH-HHHHCCCCEEEEEECCCCCCHHHHHHHHHCCCCEEECCEECCCHHHHHHHCCC T ss_conf 62789833378899-99999999987-32011584477652374137889999997589664235210785554443134 Q ss_pred CHHHHHHHHHHHHHHCCEEE----EECCCCCCCHHHHHHHHHHC----CCCEEEECCCEEECCHHHHHHHHHCCCCCEEE Q ss_conf 87899999998765448699----77676310423799886423----88368504300002148885210038652234 Q gi|254780670|r 262 QPNEMASYLANLVDQYPIYS----IEDGMSEDDWHGWKILTNKI----GSNCQLVGDDLFVTNPERLHKGINEEVANAIL 333 (424) Q Consensus 262 t~~elid~y~~l~~~yPI~s----IEDP~~e~D~~gw~~l~~~~----g~~~~IvGDDL~vTn~~rl~~gi~~~~~Nail 333 (424) -+.|||.-..+|+.++.+-| +.=+|-..--++|.+--++. ++|+= |=.-.||...+-.++-|++.+- T Consensus 187 QP~e~i~~~~~~~R~~Gf~SiN~DLIYGLP~Qt~esF~~Tl~~v~~LnPDRlA-vFnyAyvP~vk~~q~k~~~~~L---- 261 (462) T TIGR00538 187 QPEEMIFELMEKAREAGFESINLDLIYGLPKQTKESFEKTLEKVAELNPDRLA-VFNYAYVPWVKPAQRKIPEEAL---- 261 (462) T ss_pred CCHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCHHHHHHHHHHHHHCCCCEEH-HHHHHHHCCHHHHHCCCCHHCC---- T ss_conf 86899999999998669827874201388887867899999998531877001-2102221015778502762005---- Q ss_pred ECCCCHHHHHHHH----HHHHHHHHCCCEEE Q ss_conf 2545300389999----99999998798699 Q gi|254780670|r 334 IKPNQIGSVSETL----NTIEKAQMAGYPSI 360 (424) Q Consensus 334 iK~NQiGTvset~----ea~~~a~~~g~~~i 360 (424) -+.-|=| .+|..-.++||.-| T Consensus 262 ------PS~~~KL~Il~~~I~~L~~~gY~fI 286 (462) T TIGR00538 262 ------PSAEEKLEILEETIAFLTEAGYVFI 286 (462) T ss_pred ------CCHHHHHHHHHHHHHHHHHCCCEEE T ss_conf ------8878999999999999975797585 No 189 >PRK13794 hypothetical protein; Provisional Probab=31.80 E-value=35 Score=14.39 Aligned_cols=78 Identities=22% Similarity=0.319 Sum_probs=48.8 Q ss_pred HHHHHHHHHHHH------CCCCCCCCHHHHHCHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCC Q ss_conf 999999999975------38531110234311046532027851111468656878999999987654486997767631 Q gi|254780670|r 215 SALDLIANSIEK------AGYNAGKDLLIALDCAASAFFKKEKYILKGENLELQPNEMASYLANLVDQYPIYSIEDGMSE 288 (424) Q Consensus 215 eaL~ll~eAi~~------aGy~~g~di~i~lD~AAsefy~~~kY~~~~~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e 288 (424) +|..+|...+++ +.|..|+|=...|+.|...|-++ ...-+-++.+.--|.++|..++.++|.+-.+...- . T Consensus 232 eAv~fir~~~~~~~~pv~VSfSGGKDS~v~L~La~ka~~~~--~~vvF~DTglEfPeT~e~ve~v~~~~gv~ii~~~~-~ 308 (473) T PRK13794 232 NSIGFMRNTIVKIGKPPSVAYSGGKDSLATLLLALKAFGNE--FPVLFADTGLEFPETLENVEDVEELYGLEIIRTSS-G 308 (473) T ss_pred HHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHCCCC--EEEEEEECCCCCHHHHHHHHHHHHHHCCCEEEECC-C T ss_conf 99999999998669983997068689999999999860897--16999608887676999999999985987899156-4 Q ss_pred CCHHHHH Q ss_conf 0423799 Q gi|254780670|r 289 DDWHGWK 295 (424) Q Consensus 289 ~D~~gw~ 295 (424) +-|+.+. T Consensus 309 ~fw~~~~ 315 (473) T PRK13794 309 EFWEKLE 315 (473) T ss_pred CHHHHHH T ss_conf 4778888 No 190 >TIGR00470 sepS O-phosphoseryl-tRNA(Cys) synthetase; InterPro: IPR005246 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology . The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric . Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices , and are mostly dimeric or multimeric, containing at least three conserved regions , , . However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases; these synthetases are further divided into three subclasses, a, b and c, according to sequence homology. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases . All known phenylalanyl-tRNA synthetases, excepting a monomeric mitochondrial form, have an alpha 2 beta 2 heterotetrameric structure. This family of archaeal proteins resembles known phenylalanyl-tRNA synthetase alpha chains but is longer and of unknown function; more generally, it resembles class 2 aminoacyl tRNA ligases. . Probab=31.75 E-value=35 Score=14.38 Aligned_cols=149 Identities=19% Similarity=0.289 Sum_probs=84.6 Q ss_pred CCCCCCEEECCCCCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCC----CCCCCCHH Q ss_conf 47543000003664556776610256514887748999899999999999875200001367655764----48868468 Q gi|254780670|r 139 ILPVPLMNILNGGIHADNALDFQEFMIMPVGAENIREAIRMGAEVFHTLKKELKSKGYSTNVGDEGGF----SPNLKTAD 214 (424) Q Consensus 139 ~lP~P~~NiinGG~HA~~~l~iQEfmIiP~g~~sf~eal~~~~ev~~~lk~~L~~~g~~t~vgDEGGf----ap~~~~~e 214 (424) .||.|=+=+=|+ ++.+=|= .|..--.+-.+---||+|.-||-=.+ ||+=-| |-++++ + T Consensus 109 gLPrPdvGlg~e------ki~~Ie~----~gid~~deK~E~L~Evfh~YKKG~id-------GDDLv~eia~aL~V~~-~ 170 (558) T TIGR00470 109 GLPRPDVGLGAE------KIRIIED----LGIDVSDEKKERLKEVFHRYKKGEID-------GDDLVYEIAEALDVED-E 170 (558) T ss_pred CCCCCCCCCCCH------HHHHHHH----CCCCCCCCCCHHHHHHHHHCCCCCCC-------CCHHHHHHHHHHCCCC-C T ss_conf 787886355602------7788887----07742101340357886314676667-------5418888876625675-1 Q ss_pred HHHHHHHHHHHHCCCCCCCCHHHHHCHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCC----EEEEECCCCCCC Q ss_conf 9999999999753853111023431104653202785111146865687899999998765448----699776763104 Q gi|254780670|r 215 SALDLIANSIEKAGYNAGKDLLIALDCAASAFFKKEKYILKGENLELQPNEMASYLANLVDQYP----IYSIEDGMSEDD 290 (424) Q Consensus 215 eaL~ll~eAi~~aGy~~g~di~i~lD~AAsefy~~~kY~~~~~~~~~t~~elid~y~~l~~~yP----I~sIEDP~~e~D 290 (424) .+|.+|.++.-+- +|. ++-.-.+..- ...|+.=+|.+ ..+.++|| +.||-=.|+.|- T Consensus 171 ~glkvLe~vFPEf-----kdL------------KP~S~tlTLR-SHMTsGWFItL-s~i~dk~~lPlKLFSIDRCFRREQ 231 (558) T TIGR00470 171 TGLKVLEEVFPEF-----KDL------------KPVSSTLTLR-SHMTSGWFITL-SSIADKLKLPLKLFSIDRCFRREQ 231 (558) T ss_pred CHHHHHHHHCCCH-----HCC------------CCCCCCCCCH-HHCCCHHHHHH-HHHHHHCCCCCEEECCCCCCCCCC T ss_conf 1021333308640-----016------------8865543200-10100278889-878751589711220032314466 Q ss_pred HHHHHHHHHHCCCCEEEECCCEEECCHHHHHHHH Q ss_conf 2379988642388368504300002148885210 Q gi|254780670|r 291 WHGWKILTNKIGSNCQLVGDDLFVTNPERLHKGI 324 (424) Q Consensus 291 ~~gw~~l~~~~g~~~~IvGDDL~vTn~~rl~~gi 324 (424) -|.-+.|..=++..|-||+.|+-+=.=|-+.+|. T Consensus 232 ~ED~shLmtYhSASCVvVdeevsvDdGKaVAEgL 265 (558) T TIGR00470 232 KEDASHLMTYHSASCVVVDEEVSVDDGKAVAEGL 265 (558) T ss_pred CCCCCCCCCEEEEEEEEECCCCCCCCCHHHHHHH T ss_conf 4353100113454667857712436426889999 No 191 >PRK01122 potassium-transporting ATPase subunit B; Provisional Probab=31.73 E-value=35 Score=14.38 Aligned_cols=91 Identities=22% Similarity=0.322 Sum_probs=44.3 Q ss_pred EEEEECCCCCCCHHHHHHHHHHCCCCE-EEECCCEEECCHHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHHHHHHHCCC Q ss_conf 699776763104237998864238836-8504300002148885210038652234254530038999999999998798 Q gi|254780670|r 279 IYSIEDGMSEDDWHGWKILTNKIGSNC-QLVGDDLFVTNPERLHKGINEEVANAILIKPNQIGSVSETLNTIEKAQMAGY 357 (424) Q Consensus 279 I~sIEDP~~e~D~~gw~~l~~~~g~~~-~IvGDDL~vTn~~rl~~gi~~~~~NailiK~NQiGTvset~ea~~~a~~~g~ 357 (424) ++.++||.+++=-+..+.|.+. |-++ .|-||.-.+...---+-||+.-.++. |=-+=++.++..|+.|. T Consensus 444 ~i~L~D~iRp~a~eaI~~Lr~~-GI~vvMITGDn~~TA~aIA~elGIDd~~A~~---------tPedKl~iVk~LQ~~G~ 513 (684) T PRK01122 444 VIYLKDIVKPGIKERFAQLRKM-GIKTVMITGDNPLTAAAIAAEAGVDDFLAEA---------TPEDKLALIRQEQAEGR 513 (684) T ss_pred EEEECCCCCHHHHHHHHHHHHC-CCEEEEECCCCHHHHHHHHHHHCCHHHHHCC---------CHHHHHHHHHHHHHCCC T ss_conf 9972688975499999999987-9929999898969999999984956556579---------99999999999984859 Q ss_pred EEEEECCCCC-CCH---HHHHHHHHHHCC Q ss_conf 6999507888-723---589989998199 Q gi|254780670|r 358 PSIISHRSGE-TED---HTIADLAVATNC 382 (424) Q Consensus 358 ~~ivShRSGE-TeD---~~iaDLAVg~~a 382 (424) .+.+- |+ +-| ---||..+++|+ T Consensus 514 vVAMt---GDGvNDAPALa~ADVGIAMgs 539 (684) T PRK01122 514 LVAMT---GDGTNDAPALAQADVGVAMNS 539 (684) T ss_pred EEEEE---CCCCCHHHHHHHCCEEEEECC T ss_conf 79996---288215888840997787377 No 192 >PTZ00295 glucosamine-fructose-6-phosphate aminotransferase; Provisional Probab=31.67 E-value=35 Score=14.41 Aligned_cols=32 Identities=22% Similarity=0.299 Sum_probs=27.5 Q ss_pred CCEEEECCCCHHHHHHHHHHHHHHHHCCCEEE Q ss_conf 52234254530038999999999998798699 Q gi|254780670|r 329 ANAILIKPNQIGSVSETLNTIEKAQMAGYPSI 360 (424) Q Consensus 329 ~NailiK~NQiGTvset~ea~~~a~~~g~~~i 360 (424) -+.++|=..|-|--.+|+.|++.|++.|..++ T Consensus 422 ~~~lvI~ISQSGETADTLaALr~AK~~G~~tl 453 (691) T PTZ00295 422 EHAGVIFISQSGETRDVIKACKLAEDLGIPKL 453 (691) T ss_pred CCCEEEEECCCCCCHHHHHHHHHHHHCCCCEE T ss_conf 77539997577885679999999997699389 No 193 >PRK05764 aspartate aminotransferase; Provisional Probab=31.57 E-value=36 Score=14.36 Aligned_cols=30 Identities=13% Similarity=0.241 Sum_probs=22.2 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCEEEEECCCC Q ss_conf 6865687899999998765448699776763 Q gi|254780670|r 257 ENLELQPNEMASYLANLVDQYPIYSIEDGMS 287 (424) Q Consensus 257 ~~~~~t~~elid~y~~l~~~yPI~sIEDP~~ 287 (424) .|..++.+++-++ .+||.+|.++-|+|=.+ T Consensus 177 tG~v~s~~~l~~l-~~~a~~~~i~ii~DEiY 206 (389) T PRK05764 177 TGAVYSKEELEAI-ADVAVEHDIWVLSDEIY 206 (389) T ss_pred CCCCCCHHHHHHH-HHHHHHCCCEEEEECCC T ss_conf 8867577999999-99988635327510134 No 194 >TIGR02967 guan_deamin guanine deaminase; InterPro: IPR014311 This entry describes guanine deaminase, which hydrolyzes guanine to xanthine and ammonia. Xanthine can then be converted to urate by xanthine dehydrogenase. In some bacteria, the guanine deaminase gene is found near the xdhABC genes for xanthine dehydrogenase. Non-homologous forms of guanine deaminase also exist, as well as more distantly related homologs which are not included in this entry. . Probab=31.49 E-value=36 Score=14.35 Aligned_cols=116 Identities=21% Similarity=0.287 Sum_probs=70.4 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCC-CHHHHHHHHHHHHHHCCCCC--CCCHHHHHC-HHHHHH Q ss_conf 74899989999999999987520000136765576448868-46899999999997538531--110234311-046532 Q gi|254780670|r 171 ENIREAIRMGAEVFHTLKKELKSKGYSTNVGDEGGFSPNLK-TADSALDLIANSIEKAGYNA--GKDLLIALD-CAASAF 246 (424) Q Consensus 171 ~sf~eal~~~~ev~~~lk~~L~~~g~~t~vgDEGGfap~~~-~~eeaL~ll~eAi~~aGy~~--g~di~i~lD-~AAsef 246 (424) ..|++ ..-+-+|+..+=+-|-..|-+|. --+- -..+.-+.+-++.++-+... || ..|| =-|=++ T Consensus 89 ~rF~D-~~~A~~vA~~Fl~eLLrnGTTTA--------~vf~TVH~~SvdaLF~~a~~~~~R~i~GK---V~MDlRnaP~~ 156 (426) T TIGR02967 89 ARFAD-PAHAEEVAEFFLDELLRNGTTTA--------LVFGTVHPESVDALFEAAEKRGMRMIAGK---VLMDLRNAPDY 156 (426) T ss_pred HHCCC-HHHHHHHHHHHHHHHHHCCCEEE--------EEEEEEHHHHHHHHHHHHHHCCCEEEECC---HHCCCCCCCHH T ss_conf 31268-78999999999999851696477--------33100028899999999987089078710---13153567642 Q ss_pred C-------------------C-CC-CEEECCC--CCCCCHHHHHHHHHHHHHHCCEEEEECCCCCCCHH-HH-HHHH Q ss_conf 0-------------------2-78-5111146--86568789999999876544869977676310423-79-9886 Q gi|254780670|r 247 F-------------------K-KE-KYILKGE--NLELQPNEMASYLANLVDQYPIYSIEDGMSEDDWH-GW-KILT 298 (424) Q Consensus 247 y-------------------~-~~-kY~~~~~--~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~D~~-gw-~~l~ 298 (424) | . +| +-.|... ..--++.|+.+...+|.++||=++|-==+.|++|| .| ++|- T Consensus 157 L~DT~e~gy~~s~~LI~~WhgLkGnR~~YavTPRFAPTSs~eQL~~aG~L~~e~Pd~~~QTHLSEN~~Ei~wV~eLf 233 (426) T TIGR02967 157 LRDTAESGYDESKALIERWHGLKGNRLLYAVTPRFAPTSSPEQLAAAGELLKEYPDVYVQTHLSENKDEIAWVKELF 233 (426) T ss_pred HCCCHHHHHHHHHHHHHHHHHHCCCCCCCEECCCCCCCCCHHHHHHHHHHHHHCCCCEEECCCCCCHHHHHHHHHHC T ss_conf 32888888899999999987420563673354667767868899999999975898179810004856899999848 No 195 >TIGR02050 gshA_cyan_rel uncharacterized enzyme; InterPro: IPR011793 This family represents a division of a larger family, the other branch of which is predicted to act as glutamate--cysteine ligase (the first of two enzymes in glutathione biosynthesis) in the cyanobacteria. Species containing this protein, however, are generally not believed to make glutathione, and the function is unknown.. Probab=31.49 E-value=26 Score=15.35 Aligned_cols=38 Identities=18% Similarity=0.123 Sum_probs=20.6 Q ss_pred CCEEEEE--ECCCCCCHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 6102565--1488774899989999999999987520000 Q gi|254780670|r 159 DFQEFMI--MPVGAENIREAIRMGAEVFHTLKKELKSKGY 196 (424) Q Consensus 159 ~iQEfmI--iP~g~~sf~eal~~~~ev~~~lk~~L~~~g~ 196 (424) .|+|.|| .-.-..+..||++...++-..|+....+-|+ T Consensus 48 E~~~~~vE~at~v~~~~~eA~~~~~~~r~~l~~~A~~~G~ 87 (297) T TIGR02050 48 ELFESQVELATPVCTTLAEALAELREVRARLVEAASDLGL 87 (297) T ss_pred HHHHCEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 2221444465684478789999999999999999866486 No 196 >pfam03828 PAP_assoc Poly(A) polymerase. This domain is found in poly(A) polymerases and has been shown to have polynucleotide adenylyltransferase activity. Proteins in this family have been located to both the nucleus and cytoplasm. Probab=31.43 E-value=25 Score=15.44 Aligned_cols=10 Identities=30% Similarity=0.358 Sum_probs=8.3 Q ss_pred CEEEEECCCC Q ss_conf 8699776763 Q gi|254780670|r 278 PIYSIEDGMS 287 (424) Q Consensus 278 PI~sIEDP~~ 287 (424) ..++|||||+ T Consensus 47 ~~l~IeDPf~ 56 (56) T pfam03828 47 SPLCIEDPFD 56 (56) T ss_pred CEEEEECCCC T ss_conf 4799969999 No 197 >cd05014 SIS_Kpsf KpsF-like protein. KpsF is an arabinose-5-phosphate isomerase which contains SIS (Sugar ISomerase) domains. SIS domains are found in many phosphosugar isomerases and phosphosugar binding proteins. KpsF catalyzes the reversible reaction of ribulose 5-phosphate to arabinose 5-phosphate. This is the second step in the CMP-Kdo biosynthesis pathway. Probab=31.32 E-value=36 Score=14.33 Aligned_cols=55 Identities=16% Similarity=0.240 Sum_probs=43.9 Q ss_pred CCEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHHCCC Q ss_conf 5223425453003899999999999879869995078887235899899981998 Q gi|254780670|r 329 ANAILIKPNQIGSVSETLNTIEKAQMAGYPSIISHRSGETEDHTIADLAVATNCG 383 (424) Q Consensus 329 ~NailiK~NQiGTvset~ea~~~a~~~g~~~ivShRSGETeD~~iaDLAVg~~a~ 383 (424) -|-++|=..|-|.-.|++++++.|++.|..+|.=.+..+..=.-++|+.+-+.+. T Consensus 47 ~~Dv~i~iS~SG~T~e~~~~~~~ak~~g~~vI~iT~~~~S~La~~ad~~l~~~~~ 101 (128) T cd05014 47 PGDVVIAISNSGETDELLNLLPHLKRRGAPIIAITGNPNSTLAKLSDVVLDLPVE 101 (128) T ss_pred CCCEEEEEECCCCCHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHCCEEEECCCC T ss_conf 9999999979999867999999998637858999879999679968989977998 No 198 >COG4360 APA2 ATP adenylyltransferase (5',5'''-P-1,P-4-tetraphosphate phosphorylase II) [Nucleotide transport and metabolism] Probab=31.21 E-value=27 Score=15.24 Aligned_cols=40 Identities=20% Similarity=0.369 Sum_probs=31.0 Q ss_pred EEECCCCCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHH Q ss_conf 0000366455677661025651488774899989999999 Q gi|254780670|r 145 MNILNGGIHADNALDFQEFMIMPVGAENIREAIRMGAEVF 184 (424) Q Consensus 145 ~NiinGG~HA~~~l~iQEfmIiP~g~~sf~eal~~~~ev~ 184 (424) +-.-|+|.||+...+-.-.-|+|..+..|.+.+-.+.+-| T Consensus 140 lvFYNsGp~aGaSq~HkHLQi~pmPfv~~~~~lpn~~dhF 179 (298) T COG4360 140 LVFYNSGPIAGASQDHKHLQIVPMPFVAFQDQLPNGKDHF 179 (298) T ss_pred EEEECCCCCCCCCCCCCCEEEEECCCCCCCCCCCCHHHHC T ss_conf 3786578776767776620676036644654357627640 No 199 >KOG1016 consensus Probab=31.03 E-value=23 Score=15.78 Aligned_cols=82 Identities=26% Similarity=0.387 Sum_probs=48.7 Q ss_pred CCCCC--CCCEEEEEECCCCCCHHHHHH-------------HHHHHHHHHH-HHHHHCCCC--CCCCCCCCCCCC--CCC Q ss_conf 55677--661025651488774899989-------------9999999999-875200001--367655764488--684 Q gi|254780670|r 153 HADNA--LDFQEFMIMPVGAENIREAIR-------------MGAEVFHTLK-KELKSKGYS--TNVGDEGGFSPN--LKT 212 (424) Q Consensus 153 HA~~~--l~iQEfmIiP~g~~sf~eal~-------------~~~ev~~~lk-~~L~~~g~~--t~vgDEGGfap~--~~~ 212 (424) .++++ ..+-|+.|+-.+-++|..-.. ++.|.|+-|- +.+-++|.. |-.+--+||.-+ .|. T Consensus 339 ~sD~~vrpR~F~vf~LnD~~KT~~~Rakvi~~Wv~~GGVlLvGYemfRLL~lk~~~~~grpkkt~kr~~~~~i~~d~eD~ 418 (1387) T KOG1016 339 FSDTGVRPRSFEVFLLNDGVKTFDQRAKVIEQWVQTGGVLLVGYEMFRLLILKTLPKKGRPKKTLKRISSGFIKDDSEDQ 418 (1387) T ss_pred CCCCCCCCCEEEEEEECCCHHHHHHHHHHHHHHHCCCCEEEEHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCHHHH T ss_conf 66678765304789824730458999999998760397898618999999873023358742000146774037740146 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCHHHH Q ss_conf 68999999999975385311102343 Q gi|254780670|r 213 ADSALDLIANSIEKAGYNAGKDLLIA 238 (424) Q Consensus 213 ~eeaL~ll~eAi~~aGy~~g~di~i~ 238 (424) ..|+..++.+|. .+||-|+.|+ T Consensus 419 ~qe~~~li~~AL----~~PGPDlVIC 440 (1387) T KOG1016 419 RQEAYSLIRSAL----LEPGPDLVIC 440 (1387) T ss_pred HHHHHHHHHHHH----CCCCCCEEEE T ss_conf 899999999984----3789976882 No 200 >pfam00239 Resolvase Resolvase, N terminal domain. The N-terminal domain of the resolvase family (this family) contains the active site and the dimer interface. The extended arm at the C-terminus of this domain connects to the C-terminal helix-turn-helix domain of resolvase - see pfam02796. Probab=31.00 E-value=36 Score=14.30 Aligned_cols=85 Identities=22% Similarity=0.230 Sum_probs=59.2 Q ss_pred CHHHHHHHHHHHHHHCCE-EEEECCCC--CCCHHHHHHHHHHCCCCEEEECCCEEECCHHHHHHHHHCCCCCEEEECCCC Q ss_conf 878999999987654486-99776763--104237998864238836850430000214888521003865223425453 Q gi|254780670|r 262 QPNEMASYLANLVDQYPI-YSIEDGMS--EDDWHGWKILTNKIGSNCQLVGDDLFVTNPERLHKGINEEVANAILIKPNQ 338 (424) Q Consensus 262 t~~elid~y~~l~~~yPI-~sIEDP~~--e~D~~gw~~l~~~~g~~~~IvGDDL~vTn~~rl~~gi~~~~~NailiK~NQ 338 (424) +.+.+.+....++....| ++.|.... ..+..+|.+|...+.. ||-+.|+...|+.+ T Consensus 13 ~~e~Q~~~l~~~~~~~~i~~~~d~~~Sg~~~~Rp~~~~ll~~~~~-----gd~ivv~~~dRl~R---------------- 71 (139) T pfam00239 13 DLERQLEALRAYGKCEEIVEFEDKGVSGKKLDRPGLQRLLADLRE-----GDVLVVTKLDRLGR---------------- 71 (139) T ss_pred CHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHHHHHHHHCCC-----CCEEEEEECCHHCC---------------- T ss_conf 999999999998877698699984336886788899999972769-----99999986633028---------------- Q ss_pred HHHHHHHHHHHHHHHHCCCEEEEECCCCCCC Q ss_conf 0038999999999998798699950788872 Q gi|254780670|r 339 IGSVSETLNTIEKAQMAGYPSIISHRSGETE 369 (424) Q Consensus 339 iGTvset~ea~~~a~~~g~~~ivShRSGETe 369 (424) .+.+.+..++..++.|...++..-..++. T Consensus 72 --~~~~~~~~~~~l~~~gv~l~~~~~~~d~~ 100 (139) T pfam00239 72 --SLRDLLELLEELREKGVEVVSLDEGIDTS 100 (139) T ss_pred --CHHHHHHHHHHHHHCCCEEEEEECCCCCC T ss_conf --99999999999997898899911797798 No 201 >COG4018 Uncharacterized protein conserved in archaea [Function unknown] Probab=30.94 E-value=36 Score=14.29 Aligned_cols=23 Identities=26% Similarity=0.310 Sum_probs=12.5 Q ss_pred CCHHHHHHHHHHHHHCCCC--HHHH Q ss_conf 2024489987526634984--8882 Q gi|254780670|r 109 ILGVSLAVSKAAAQTSNLP--LYKY 131 (424) Q Consensus 109 ilAvSlA~akA~A~~~~~p--Ly~y 131 (424) |.-+++|+|.-+-+.--.| |++- T Consensus 91 iHvCaYAvADiAE~yGmrPiEl~r~ 115 (505) T COG4018 91 IHVCAYAVADIAENYGMRPIELFRR 115 (505) T ss_pred EEEHHHHHHHHHHHCCCCHHHHHHH T ss_conf 2311678877898739987999999 No 202 >COG1195 RecF Recombinational DNA repair ATPase (RecF pathway) [DNA replication, recombination, and repair] Probab=30.91 E-value=36 Score=14.29 Aligned_cols=60 Identities=23% Similarity=0.282 Sum_probs=44.6 Q ss_pred HHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHCCCCEEEECCCEEECCHHHHHHHHHCCC Q ss_conf 999999876544869977676310423799886423883685043000021488852100386 Q gi|254780670|r 266 MASYLANLVDQYPIYSIEDGMSEDDWHGWKILTNKIGSNCQLVGDDLFVTNPERLHKGINEEV 328 (424) Q Consensus 266 lid~y~~l~~~yPI~sIEDP~~e~D~~gw~~l~~~~g~~~~IvGDDL~vTn~~rl~~gi~~~~ 328 (424) .++++.+...+|||.-+-|=+.|=|-.--..|......++|+. .++|....+.....+++ T Consensus 292 e~~l~~~~~g~~PILLLDDv~seLD~~Rr~~Ll~~~~~~~Q~f---vT~t~~~~~~~~~~~~~ 351 (363) T COG1195 292 EIELLREETGEYPILLLDDVASELDDGRRAALLDTIELGVQVF---VTTTDLEDIDDNLDENA 351 (363) T ss_pred HHHHHHHHCCCCCEEEECHHHHHHCHHHHHHHHHHCCCCCEEE---EECCCHHHHHHHHHCCC T ss_conf 9999987428897898616567559899999974344587499---97257777666521254 No 203 >pfam03808 Glyco_tran_WecB Glycosyl transferase WecB/TagA/CpsF family. Probab=30.90 E-value=36 Score=14.29 Aligned_cols=48 Identities=27% Similarity=0.401 Sum_probs=27.7 Q ss_pred CHHHHHHHHHHHHHHCC---EEEEECCC-CCCCHHHHHHHHHHCCCCEEEEC Q ss_conf 87899999998765448---69977676-31042379988642388368504 Q gi|254780670|r 262 QPNEMASYLANLVDQYP---IYSIEDGM-SEDDWHGWKILTNKIGSNCQLVG 309 (424) Q Consensus 262 t~~elid~y~~l~~~yP---I~sIEDP~-~e~D~~gw~~l~~~~g~~~~IvG 309 (424) +.+.+......|-.+|| |+...+|+ +++|.+...+.-.+.+.++.+|| T Consensus 57 ~~~~~~~~~~~l~~~yP~l~i~g~~~g~f~~~e~~~i~~~I~~~~~div~vg 108 (172) T pfam03808 57 KPGVLEKAAARLRARYPGLRIVGTHDGYFSPEEEEAIIEAINASGPDLLFVG 108 (172) T ss_pred CHHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHHHHHHHCCCCEEEEE T ss_conf 8899999999999887995599987999986899999999984599999995 No 204 >PRK09423 gldA glycerol dehydrogenase; Provisional Probab=30.83 E-value=37 Score=14.28 Aligned_cols=27 Identities=30% Similarity=0.468 Sum_probs=20.0 Q ss_pred CCCCCCCH----HHHHHHHHHHHHCCCCHHH Q ss_conf 23222202----4489987526634984888 Q gi|254780670|r 104 LGANAILG----VSLAVSKAAAQTSNLPLYK 130 (424) Q Consensus 104 lGaNailA----vSlA~akA~A~~~~~pLy~ 130 (424) -++..++| -+|=+|||.|...++|+.- T Consensus 83 ~~~D~IiavGGGS~iD~AKaia~~~~~P~i~ 113 (366) T PRK09423 83 NGCDVIIGIGGGKTLDTAKAVADYLGVPVVI 113 (366) T ss_pred CCCCEEEEECCHHHHHHHHHHHHHCCCCEEE T ss_conf 4999899937838877799999982899799 No 205 >PRK07556 consensus Probab=30.70 E-value=37 Score=14.26 Aligned_cols=18 Identities=11% Similarity=0.237 Sum_probs=7.7 Q ss_pred CCEEEECCCCHHHHHHHH Q ss_conf 522342545300389999 Q gi|254780670|r 329 ANAILIKPNQIGSVSETL 346 (424) Q Consensus 329 ~NailiK~NQiGTvset~ 346 (424) .|++.|.....-.+++.+ T Consensus 586 ~~GI~vD~~~L~~l~~~l 603 (977) T PRK07556 586 ERGISVDRQVLSRLSGEF 603 (977) T ss_pred HCCEEECHHHHHHHHHHH T ss_conf 569698699999999999 No 206 >TIGR01941 nqrF NADH:ubiquinone oxidoreductase, Na(+)-translocating, F subunit; InterPro: IPR010205 This entry represents the NqrF subunit of the six-protein, Na(+)-pumping NADH-quinone reductase of a number of marine and pathogenic Gram-negative bacteria.; GO: 0016655 oxidoreductase activity acting on NADH or NADPH quinone or similar compound as acceptor, 0051537 2 iron 2 sulfur cluster binding, 0006814 sodium ion transport, 0009276 1-2nm peptidoglycan-based cell wall, 0016021 integral to membrane. Probab=30.18 E-value=24 Score=15.69 Aligned_cols=37 Identities=24% Similarity=0.512 Sum_probs=26.6 Q ss_pred CCHHH--HHHHHHHHHHHCC-E---EEEECCCCCCCHHHHHHH Q ss_conf 68789--9999998765448-6---997767631042379988 Q gi|254780670|r 261 LQPNE--MASYLANLVDQYP-I---YSIEDGMSEDDWHGWKIL 297 (424) Q Consensus 261 ~t~~e--lid~y~~l~~~yP-I---~sIEDP~~e~D~~gw~~l 297 (424) +|..| .-+=|..|-.++| . +.+=||+-||-|.||.+= T Consensus 322 RS~~E~fY~edf~~L~~e~pNF~~H~aLSdP~PEDnw~Gwdkd 364 (425) T TIGR01941 322 RSKREIFYEEDFDQLEAENPNFKWHVALSDPQPEDNWTGWDKD 364 (425) T ss_pred CCCCCCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCC T ss_conf 2124432244478898648993799985486885100677767 No 207 >COG1247 Sortase and related acyltransferases [Cell envelope biogenesis, outer membrane] Probab=30.17 E-value=37 Score=14.20 Aligned_cols=123 Identities=17% Similarity=0.266 Sum_probs=64.6 Q ss_pred HHHHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHCC--------CCEEEECCCEEECC-HHHHHHHHHCCCCCEEEE Q ss_conf 899999998765448699776763104237998864238--------83685043000021-488852100386522342 Q gi|254780670|r 264 NEMASYLANLVDQYPIYSIEDGMSEDDWHGWKILTNKIG--------SNCQLVGDDLFVTN-PERLHKGINEEVANAILI 334 (424) Q Consensus 264 ~elid~y~~l~~~yPI~sIEDP~~e~D~~gw~~l~~~~g--------~~~~IvGDDL~vTn-~~rl~~gi~~~~~Naili 334 (424) ....++|...+..=-.++-+||.+-+.+..|-.-..+.| .+.+|+| |++- +=|-+.+...-.-+++-| T Consensus 13 ~~I~~IY~~~v~~~~a~~e~~~~~~~~~~~~~~~~~~~g~p~~V~~~~~g~v~G---~a~~~~fr~r~ay~~tve~SiYv 89 (169) T COG1247 13 EAILEIYNGAVENTAATFEEDPVSLEERAAWFSGRTRDGYPVVVAEEEDGKVLG---YASAGPFRERPAYRHTVELSIYL 89 (169) T ss_pred HHHHHHHHHHHHCCEEEEECCCCCHHHHHHHHHHCCCCCCEEEEEECCCCEEEE---EEEEEECCCCCCCCEEEEEEEEE T ss_conf 999999877653044798516999899999998336688549999747980999---99851004755655389999999 Q ss_pred CCCCHH--HHHHHH-HHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHHCCCHHEECCCCHH Q ss_conf 545300--389999-99999998798699950788872358998999819980100756114 Q gi|254780670|r 335 KPNQIG--SVSETL-NTIEKAQMAGYPSIISHRSGETEDHTIADLAVATNCGQIKTGSLARS 393 (424) Q Consensus 335 K~NQiG--Tvset~-ea~~~a~~~g~~~ivShRSGETeD~~iaDLAVg~~a~~iK~G~~~R~ 393 (424) -|.+-| .=+..+ ..+..++..|+.++++....+.+ +-++.--..+|.+.|-+-.. T Consensus 90 ~~~~~g~GiG~~Ll~~Li~~~~~~g~~~lva~I~~~n~----aSi~lh~~~GF~~~G~~~~v 147 (169) T COG1247 90 DPAARGKGLGKKLLQALITEARALGVRELVAGIESDNL----ASIALHEKLGFEEVGTFPEV 147 (169) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHCCEEEEEEEECCCCC----HHHHHHHHCCCEEECCCCCC T ss_conf 85102466789999999999885884899999737873----76999997797773634534 No 208 >smart00148 PLCXc Phospholipase C, catalytic domain (part); domain X. Phosphoinositide-specific phospholipases C. These enzymes contain 2 regions (X and Y) which together form a TIM barrel-like structure containing the active site residues. Phospholipase C enzymes (PI-PLC) act as signal transducers that generate two second messengers, inositol-1,4,5-trisphosphate and diacylglycerol. The bacterial enzyme [6] appears to be a homologue of the mammalian PLCs. Probab=29.98 E-value=37 Score=14.24 Aligned_cols=46 Identities=26% Similarity=0.382 Sum_probs=28.2 Q ss_pred HHHHHHHHHHHHHCC-EEEEECCCCCCCHHHHHHHHHHCCCCEEEECCCEEEC Q ss_conf 899999998765448-6997767631042379988642388368504300002 Q gi|254780670|r 264 NEMASYLANLVDQYP-IYSIEDGMSEDDWHGWKILTNKIGSNCQLVGDDLFVT 315 (424) Q Consensus 264 ~elid~y~~l~~~yP-I~sIEDP~~e~D~~gw~~l~~~~g~~~~IvGDDL~vT 315 (424) =+-|..|.-....|| |+|+|+-...+ +-.++.+-+- .+.||-||.. T Consensus 75 l~~Ik~~aF~~s~yPvIlslE~hcs~~---qQ~~ma~~l~---~~~gd~L~~~ 121 (135) T smart00148 75 LEAIKDFAFVTSPYPVILSLENHCSPD---QQAKMAQMFK---EIFGDMLYTP 121 (135) T ss_pred HHHHHHHHCCCCCCCEEEEEECCCCHH---HHHHHHHHHH---HHHHHHHCCC T ss_conf 999998714389998899972689999---9999999999---9987765578 No 209 >TIGR03599 YloV DAK2 domain fusion protein YloV. This model describes a protein family that contains an N-terminal DAK2 domain (pfam02734), so named because of similarity to the dihydroxyacetone kinase family family. The GTP-binding protein CgtA (a member of the obg family) is a bacterial GTPase associated with ribosome biogenesis, and it has a characteristic extension (TIGR03595) in certain lineages. This protein family described here was found, by the method of partial phylognetic profiling, to have a phylogenetic distribution strongly correlated to that of TIGR03595. This correlation implies some form of functional coupling. Probab=29.96 E-value=38 Score=14.18 Aligned_cols=183 Identities=18% Similarity=0.249 Sum_probs=103.9 Q ss_pred HHHHHHHHCCCCC-CCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHCHHH---HHHCCCCCEEECCCCCCC Q ss_conf 9998752000013-676557644886846899999999997538531110234311046---532027851111468656 Q gi|254780670|r 186 TLKKELKSKGYST-NVGDEGGFSPNLKTADSALDLIANSIEKAGYNAGKDLLIALDCAA---SAFFKKEKYILKGENLEL 261 (424) Q Consensus 186 ~lk~~L~~~g~~t-~vgDEGGfap~~~~~eeaL~ll~eAi~~aGy~~g~di~i~lD~AA---sefy~~~kY~~~~~~~~~ 261 (424) .|++.|...|.+. -+||++-|-..+.++ ++-..|..+++- |-- .+|.| +--. .+.....+-. . T Consensus 241 ~lr~~L~~~GdSlvvv~~~~~~kVHvHT~-~pG~vle~~~~~-G~l--~~ikI--enM~~Q~~~~~~~~~~~-------~ 307 (530) T TIGR03599 241 KFRKELEKLGDSLVVVGDDDLVKVHVHTN-DPGLVLEYGQKY-GEL--IKIKI--ENMREQHEEILEGTSNE-------E 307 (530) T ss_pred HHHHHHHHHCCEEEEECCCCEEEEEEECC-CHHHHHHHHHHH-CCE--EEEEE--ECCHHHHHHHHCCCCCC-------C T ss_conf 99999997398599975798499999419-979999999983-936--79998--15648898763135434-------4 Q ss_pred CHHHHHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHCCCCEEEECCCEEECCHHHHHHHHHCCCCCEEEECCCCHHH Q ss_conf 87899999998765448699776763104237998864238836850430000214888521003865223425453003 Q gi|254780670|r 262 QPNEMASYLANLVDQYPIYSIEDGMSEDDWHGWKILTNKIGSNCQLVGDDLFVTNPERLHKGINEEVANAILIKPNQIGS 341 (424) Q Consensus 262 t~~elid~y~~l~~~yPI~sIEDP~~e~D~~gw~~l~~~~g~~~~IvGDDL~vTn~~rl~~gi~~~~~NailiK~NQiGT 341 (424) -.....+|.|+++-.+ +|+.++.+.+|-...|.|..---...+-|-++|++--+..++|=||.-.. T Consensus 308 --------~~~~~~~~~vVAv~~G------~Gl~~if~~~Ga~~Vi~ggqt~NPSt~dil~Ai~~~~a~~ViiLPNn~Ni 373 (530) T TIGR03599 308 --------KKKEEKEYAIVAVAPG------EGIAELFKSLGADVVIEGGQTMNPSTEDILKAIEKVNAKNVFVLPNNKNI 373 (530) T ss_pred --------CCCCCCCEEEEEEECC------HHHHHHHHHCCCCEEECCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCHH T ss_conf --------5566677399999458------42999999779969965899899689999999985798739994799108 Q ss_pred HHHHHHHHHHHHHCCCEEEEECCC-------------C-CCCHHHHHHHHHHHCCCHHEECCCCHHHHHHHHH Q ss_conf 899999999999879869995078-------------8-8723589989998199801007561144899999 Q gi|254780670|r 342 VSETLNTIEKAQMAGYPSIISHRS-------------G-ETEDHTIADLAVATNCGQIKTGSLARSDRIAKYN 400 (424) Q Consensus 342 vset~ea~~~a~~~g~~~ivShRS-------------G-ETeD~~iaDLAVg~~a~~iK~G~~~R~ER~aKyN 400 (424) +.-+-++.++..+ . ..||..|| . ..+|+. ..+ .=.+..+++|...+..|-+++| T Consensus 374 ilaA~qaa~~~~k-~-V~VipT~si~qgiaAl~~fdp~~~~een~-~~M--~ea~~~v~tgeVT~Avrdt~~~ 441 (530) T TIGR03599 374 ILAAEQAAELADK-N-VVVIPTKTIVQGLAALLVFDPEASLEDNK-EAM--EEAIKAVRSGEVTYAVRDTKIN 441 (530) T ss_pred HHHHHHHHHHCCC-C-EEEEECCCHHHHHHHHHHHCCCCCHHHHH-HHH--HHHHHCCCEEEEEEEEEECEEC T ss_conf 8999999874289-7-89994798899999999718788878999-999--9999559789999998424567 No 210 >TIGR01977 am_tr_V_EF2568 cysteine desulfurase family protein; InterPro: IPR010969 This entry describes a subfamily of probable pyridoxal phosphate-dependent enzymes in the aminotransferase class V family. Related families contain members active as cysteine desulphurases, selenocysteine lyases, or both. The members of this family form a distinct clade and all are shorter at the N terminus. The function of members of this family are unknown.. Probab=29.93 E-value=38 Score=14.17 Aligned_cols=68 Identities=22% Similarity=0.304 Sum_probs=49.6 Q ss_pred HHHHHHC-CCCEEEE-CCCEEECCHHHHHHHHHCCCCCEEEECC--CCHHHHHHHHHHH-HHHHHCCCEEEEE Q ss_conf 9886423-8836850-4300002148885210038652234254--5300389999999-9999879869995 Q gi|254780670|r 295 KILTNKI-GSNCQLV-GDDLFVTNPERLHKGINEEVANAILIKP--NQIGSVSETLNTI-EKAQMAGYPSIIS 362 (424) Q Consensus 295 ~~l~~~~-g~~~~Iv-GDDL~vTn~~rl~~gi~~~~~NailiK~--NQiGTvset~ea~-~~a~~~g~~~ivS 362 (424) ..|.++. |=.|-|| ++.=..-++++|+++++++.+..++|-+ |=.||+-..=|.. ++|+++|.-+|+= T Consensus 107 ~~L~~~~~g~~~~iv~~~~~G~~~~e~i~~~~~~~kTk~iv~~HaSNV~G~ilpiEEIgR~lA~e~g~~~~~D 179 (384) T TIGR01977 107 EKLKEERIGVEITIVKCDNEGLISPERIKRAIKKNKTKLIVVSHASNVTGTILPIEEIGRELAKENGIFFILD 179 (384) T ss_pred HHHHHCCCCEEEEEECCCCCCCCCHHHHHHHCCCCCCCEEEEECHHHHCCCCCCHHHHHHHHHHHCCCEEEEE T ss_conf 9997359975899862788852167999985337883004322003010311257899999998679739983 No 211 >cd06302 PBP1_LsrB_Quorum_Sensing Periplasmic binding domain of autoinducer-2 (AI-2) receptor LsrB from Salmonella typhimurium and its close homologs. Periplasmic binding domain of autoinducer-2 (AI-2) receptor LsrB from Salmonella typhimurium and its close homologs from other bacteria. The members of this group are homologous to a family of periplasmic pentose/hexose sugar-binding proteins that function as the primary receptors for chemotaxis and transporters of many sugar based solutes in bacteria and archaea and that are a member of the type I periplasmic binding protein superfamily. LsrB binds a chemically distinct form of the AI-2 signal that lacks boron, in contrast to the Vibrio harveyi AI-2 signaling molecule that has an unusual furanosyl borate diester. Hence, many bacteria coordinate their gene expression according to the local density of their population by producing species specific AI-2. This process of quorum sensing allows LsrB to function as a periplasmic AI-2 binding p Probab=29.69 E-value=38 Score=14.14 Aligned_cols=76 Identities=14% Similarity=0.233 Sum_probs=44.4 Q ss_pred CCHHHHHHHHHHHHHHCC-EEEEECCCCCCCH--HHHHHHHHHCC--CCEEEECCCEEECCHHHHHHHHHCCCCCEEE-E Q ss_conf 687899999998765448-6997767631042--37998864238--8368504300002148885210038652234-2 Q gi|254780670|r 261 LQPNEMASYLANLVDQYP-IYSIEDGMSEDDW--HGWKILTNKIG--SNCQLVGDDLFVTNPERLHKGINEEVANAIL-I 334 (424) Q Consensus 261 ~t~~elid~y~~l~~~yP-I~sIEDP~~e~D~--~gw~~l~~~~g--~~~~IvGDDL~vTn~~rl~~gi~~~~~Nail-i 334 (424) .++++=.+.-..|+.+|| |-.| |.-+|. .|=.+-.+..| +++.|+|-|....-.+.|+.| ...+.+ = T Consensus 167 ~~~~~a~~~~~~~L~~~Pdi~~I---~~~nd~~a~Ga~~Al~~aG~~~~V~vvG~D~~~~~~~~I~~G----~i~atv~Q 239 (298) T cd06302 167 DDADKSYQTAQELLKAYPDLKGI---IGPTSVGIPGAARAVEEAGLKGKVAVTGLGLPNQMAPYVKSG----AVKEFALW 239 (298) T ss_pred CCHHHHHHHHHHHHHHCCCCCEE---EECCCHHHHHHHHHHHHCCCCCCEEEEEECCCHHHHHHHHCC----CCEEEEEC T ss_conf 74899999999999759991299---978981699999999967999985999968879999999859----82699984 Q ss_pred CCCCHHHHH Q ss_conf 545300389 Q gi|254780670|r 335 KPNQIGSVS 343 (424) Q Consensus 335 K~NQiGTvs 343 (424) .|=++|..+ T Consensus 240 ~P~~~G~~a 248 (298) T cd06302 240 NPADLGYAA 248 (298) T ss_pred CHHHHHHHH T ss_conf 999999999 No 212 >cd02812 PcrB_like PcrB_like proteins. One member of this family, a protein from Archaeoglobus fulgidus, has been characterized as a (S)-3-O-geranylgeranylglyceryl phosphate synthase (AfGGGPS). AfGGGPS catalyzes the formation of an ether linkage between sn-glycerol-1-phosphate (G1P) and geranylgeranyl diphosphate (GGPP), the committed step in archaeal lipid biosynthesis. Therefore, it has been proposed that PcrB-like proteins are either prenyltransferases or are involved in lipoteichoic acid biosynthesis although the exact function is still unknown. Probab=29.66 E-value=31 Score=14.77 Aligned_cols=55 Identities=18% Similarity=0.297 Sum_probs=32.8 Q ss_pred CCCCCCCHHHHHHHHHHHHHHCCEEE---EECCCCCCCHHHHHHHHHHCCCCEEEECCCEEECCHHH Q ss_conf 46865687899999998765448699---77676310423799886423883685043000021488 Q gi|254780670|r 256 GENLELQPNEMASYLANLVDQYPIYS---IEDGMSEDDWHGWKILTNKIGSNCQLVGDDLFVTNPER 319 (424) Q Consensus 256 ~~~~~~t~~elid~y~~l~~~yPI~s---IEDP~~e~D~~gw~~l~~~~g~~~~IvGDDL~vTn~~r 319 (424) +.|. .-+.||+...++.+.+-|+++ |.+| +.-.. ..+.|-.+-|||.-++- +++. T Consensus 156 gSGa-~v~~e~V~~vk~~l~~~~LivGGGIrs~------e~a~~-~~~AgAD~IVvGn~iee-~~~~ 213 (219) T cd02812 156 YSGA-YGPPEVVRAVKKVLGDTPLIVGGGIRSG------EQAKE-MAEAGADTIVVGNIVEE-DPNA 213 (219) T ss_pred CCCC-CCCHHHHHHHHHHCCCCEEEEECCCCCH------HHHHH-HHHCCCCEEEECCCEEE-CHHH T ss_conf 6899-7999999999984679709992897999------99999-99869999998872240-6899 No 213 >pfam06924 DUF1281 Protein of unknown function (DUF1281). This family consists of several hypothetical enterobacterial proteins of around 170 residues in length. Members of this family are found in Escherichia coli, Salmonella typhimurium and Shigella species. The function of this family is unknown. Probab=29.65 E-value=35 Score=14.42 Aligned_cols=75 Identities=19% Similarity=0.325 Sum_probs=45.5 Q ss_pred HHHHHHHHHHHHCCCCHHHH---------CCCC--------CCCCCCCCCEEECCCCCCCCCCCCCEEEEEE-CCCCCCH Q ss_conf 44899875266349848882---------2532--------4564754300000366455677661025651-4887748 Q gi|254780670|r 112 VSLAVSKAAAQTSNLPLYKY---------LGGC--------SAHILPVPLMNILNGGIHADNALDFQEFMIM-PVGAENI 173 (424) Q Consensus 112 vSlA~akA~A~~~~~pLy~y---------l~~~--------~~~~lP~P~~NiinGG~HA~~~l~iQEfmIi-P~g~~sf 173 (424) |-+.+.+....-...|.|+| |+|. .....|.|...--+-|.-+..++.|+.++-. -.+..-- T Consensus 8 v~id~l~~wi~G~~~P~Yr~Ai~Qsi~LFLAGCAGiLkPt~~~~y~pyP~L~a~GtG~~s~~N~AF~~WL~lL~~~v~Ld 87 (134) T pfam06924 8 VFVDEIQQWVTGEVIPLYRHAIQQSIKLFLAGCAGILKPTKNIEYPPYPRLVAHGTGSASPENLAFEHWLTLLKKNVVLD 87 (134) T ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCC T ss_conf 68999999980688806999999999999970111156677776788718874588889842689999999998688788 Q ss_pred HHHHHHHHHHHHH Q ss_conf 9998999999999 Q gi|254780670|r 174 REAIRMGAEVFHT 186 (424) Q Consensus 174 ~eal~~~~ev~~~ 186 (424) .+.+++.-++|++ T Consensus 88 ~~t~~~i~~l~lQ 100 (134) T pfam06924 88 EDTIRQIDRLYLQ 100 (134) T ss_pred HHHHHHHHHHHHH T ss_conf 7999999999997 No 214 >pfam02739 5_3_exonuc_N 5'-3' exonuclease, N-terminal resolvase-like domain. Probab=29.63 E-value=38 Score=14.14 Aligned_cols=94 Identities=22% Similarity=0.296 Sum_probs=58.7 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCHHHHHCHHHHHHCCCCCE-EECCCCCCCCHHH---HHHHHHHHHHHCCEEEEECCCCC Q ss_conf 6899999999997538531110234311046532027851-1114686568789---99999987654486997767631 Q gi|254780670|r 213 ADSALDLIANSIEKAGYNAGKDLLIALDCAASAFFKKEKY-ILKGENLELQPNE---MASYLANLVDQYPIYSIEDGMSE 288 (424) Q Consensus 213 ~eeaL~ll~eAi~~aGy~~g~di~i~lD~AAsefy~~~kY-~~~~~~~~~t~~e---lid~y~~l~~~yPI~sIEDP~~e 288 (424) ..--+..|...+++ ++| ..+.++.|.-.+.| ..+-| .|+. ++..++++ ++.+..++++..+|..++-|=.| T Consensus 35 i~gf~~~l~~li~~--~~p-~~i~v~fD~~~~t~-R~elyp~YKa-~R~~~p~~l~~Q~~~i~~~l~~~gi~~~~~~g~E 109 (169) T pfam02739 35 VYGFLRMLLKLLKE--EKP-TYVAVVFDAGAPTF-RHELYEEYKA-NRAKTPDELPPQIPLIKELLDALGIPVLEVPGYE 109 (169) T ss_pred HHHHHHHHHHHHHH--CCC-CEEEEEECCCCCCC-HHHHHHHHHC-CCCCCCHHHHHHHHHHHHHHHHCEEEEEEECCCC T ss_conf 99999999999997--499-76999965999972-7887998861-9999937899999999999875014478524853 Q ss_pred -CCH-HHHHHHHHHCCCCEEEECCC Q ss_conf -042-37998864238836850430 Q gi|254780670|r 289 -DDW-HGWKILTNKIGSNCQLVGDD 311 (424) Q Consensus 289 -~D~-~gw~~l~~~~g~~~~IvGDD 311 (424) ||. ..+.......+.++.|+..| T Consensus 110 ADDvIatla~~~~~~~~~v~I~S~D 134 (169) T pfam02739 110 ADDVIGTLAKKAEKEGYDVRIVSGD 134 (169) T ss_pred HHHHHHHHHHHHHHCCCCEEEECCC T ss_conf 1567999999998589919998899 No 215 >cd07626 BAR_SNX9_like The Bin/Amphiphysin/Rvs (BAR) domain of Sorting Nexin 9 and Similar Proteins. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. Sorting nexins (SNXs) are Phox homology (PX) domain containing proteins that are involved in regulating membrane traffic and protein sorting in the endosomal system. SNXs differ from each other in their lipid-binding specificity, subcellular localization and specific function in the endocytic pathway. A subset of SNXs also contain BAR domains. The PX-BAR structural unit determines the specific membrane targeting of SNXs. This subfamily consists of SNX9, SNX18, SNX33, and similar proteins. SNX9 is localized to plasma membrane endocytic sites and acts primarily in clathrin-mediated endocytosis, while SNX18 is localized to peripheral endosomal structures, and acts in a trafficking pathway that is clathrin-independent but relies on AP-1 and PACS1. BAR domains for Probab=29.49 E-value=34 Score=14.55 Aligned_cols=12 Identities=0% Similarity=0.254 Sum_probs=5.2 Q ss_pred HHHHHHHHHHHH Q ss_conf 899999998765 Q gi|254780670|r 264 NEMASYLANLVD 275 (424) Q Consensus 264 ~elid~y~~l~~ 275 (424) .++|+||...+. T Consensus 179 ~~QI~Fyq~I~~ 190 (199) T cd07626 179 QQQIEFYQKIAA 190 (199) T ss_pred HHHHHHHHHHHH T ss_conf 999999999999 No 216 >cd07668 BAR_SNX9 The Bin/Amphiphysin/Rvs (BAR) domain of Sorting Nexin 9. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. Sorting nexins (SNXs) are Phox homology (PX) domain containing proteins that are involved in regulating membrane traffic and protein sorting in the endosomal system. SNXs differ from each other in their lipid-binding specificity, subcellular localization and specific function in the endocytic pathway. A subset of SNXs also contain BAR domains. The PX-BAR structural unit determines the specific membrane targeting of SNXs. SNX9, also known as SH3PX1, is a cytosolic protein that interacts with proteins associated with clathrin-coated pits such as Cdc-42-associated tyrosine kinase 2 (ACK2). It binds class I polyproline sequences found in dynamin 1/2 and the WASP/N-WASP actin regulators. SNX9 is localized to plasma membrane endocytic sites and acts primarily in clathrin-mediated endocytosi Probab=28.60 E-value=39 Score=14.05 Aligned_cols=11 Identities=0% Similarity=0.317 Sum_probs=4.7 Q ss_pred HHHHHHHHHHH Q ss_conf 89999999876 Q gi|254780670|r 264 NEMASYLANLV 274 (424) Q Consensus 264 ~elid~y~~l~ 274 (424) .++|.||...+ T Consensus 187 ~~QI~Fyq~I~ 197 (210) T cd07668 187 EQQVQFYETIA 197 (210) T ss_pred HHHHHHHHHHH T ss_conf 99999999999 No 217 >COG0761 lytB 4-Hydroxy-3-methylbut-2-enyl diphosphate reductase IspH [Lipid metabolism] Probab=28.40 E-value=40 Score=13.99 Aligned_cols=83 Identities=23% Similarity=0.221 Sum_probs=54.9 Q ss_pred CCCCEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCCC-------HHHHHHHHHHCCCCEEEECCCEEECCHHHH Q ss_conf 2785111146865687899999998765448699776763104-------237998864238836850430000214888 Q gi|254780670|r 248 KKEKYILKGENLELQPNEMASYLANLVDQYPIYSIEDGMSEDD-------WHGWKILTNKIGSNCQLVGDDLFVTNPERL 320 (424) Q Consensus 248 ~~~kY~~~~~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~D-------~~gw~~l~~~~g~~~~IvGDDL~vTn~~rl 320 (424) .+++..|- ..+.++.++..+....|..+||- |+.|..++= =+.-+.|..+. +-+.|||+ ---+|..|| T Consensus 156 ~~~~l~~~-tQTTls~ddt~~Iv~~l~~r~p~--~~~~~~~~ICyAT~nRQ~Avk~la~~~-Dl~iVVG~-~nSSNs~rL 230 (294) T COG0761 156 LPDKLAFV-TQTTLSVDDTAEIVAALKERFPK--IEVPPFNDICYATQNRQDAVKELAPEV-DLVIVVGS-KNSSNSNRL 230 (294) T ss_pred CCCCEEEE-EEEECCHHHHHHHHHHHHHHCCC--CCCCCCCCCCHHHHHHHHHHHHHHHCC-CEEEEECC-CCCCCHHHH T ss_conf 76427999-64314778899999999986833--348864551030066899999876038-89999789-898667899 Q ss_pred HHHHHCCCCCEEEEC Q ss_conf 521003865223425 Q gi|254780670|r 321 HKGINEEVANAILIK 335 (424) Q Consensus 321 ~~gi~~~~~NailiK 335 (424) .+--++...++.||- T Consensus 231 ~eiA~~~g~~aylId 245 (294) T COG0761 231 AEIAKRHGKPAYLID 245 (294) T ss_pred HHHHHHHCCCEEEEC T ss_conf 999998599818968 No 218 >COG5310 Homospermidine synthase [Secondary metabolites biosynthesis, transport, and catabolism] Probab=28.32 E-value=32 Score=14.67 Aligned_cols=109 Identities=27% Similarity=0.379 Sum_probs=54.3 Q ss_pred HHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCC---CCHHHH--HCHHHHHHCCCCCEEEC----- Q ss_conf 99987520000136765576448868468999999999975385311---102343--11046532027851111----- Q gi|254780670|r 186 TLKKELKSKGYSTNVGDEGGFSPNLKTADSALDLIANSIEKAGYNAG---KDLLIA--LDCAASAFFKKEKYILK----- 255 (424) Q Consensus 186 ~lk~~L~~~g~~t~vgDEGGfap~~~~~eeaL~ll~eAi~~aGy~~g---~di~i~--lD~AAsefy~~~kY~~~----- 255 (424) ++|++|+- +-+++|- -||+.++.-+..++++|.- |.+.|--.- .+..+| .|+-|...-.+ .|.++ T Consensus 72 Nyk~vL~p--ll~~~~g-qgf~vnLSvd~~s~Dlmr~-crk~~vLYidTvVEpW~gfyfDa~adn~art-nyaLRet~lr 146 (481) T COG5310 72 NYKDVLKP--LLKGVGG-QGFCVNLSVDTSSLDLMRL-CRKHGVLYIDTVVEPWLGFYFDAQADNAART-NYALRETVLR 146 (481) T ss_pred HHHHHHHH--HHHCCCC-CEEEEEEEECCCHHHHHHH-HHHCCEEEEEEEECCCCCCCHHHHHHHHHHH-HHHHHHHHHH T ss_conf 69999999--8626788-5379996733661689999-9876858984200135563210222335544-3799999999 Q ss_pred -----CCCCC--------------CCHHHHHHHHHHHHHHCCEEEEECCCCCCCHHHHHHHHHHCCCCE Q ss_conf -----46865--------------687899999998765448699776763104237998864238836 Q gi|254780670|r 256 -----GENLE--------------LQPNEMASYLANLVDQYPIYSIEDGMSEDDWHGWKILTNKIGSNC 305 (424) Q Consensus 256 -----~~~~~--------------~t~~elid~y~~l~~~yPI~sIEDP~~e~D~~gw~~l~~~~g~~~ 305 (424) -.+.. +-...|+++-.+++-+| |.| .++|.+||++|.+++|-|. T Consensus 147 Ek~r~pgg~TaVs~cGANPGmvswFVKqaLvdlAad~~ld~-----~ep-~~ddr~gwAkLmkK~GVkg 209 (481) T COG5310 147 EKRRNPGGPTAVSTCGANPGMVSWFVKQALVDLAADLGLDF-----EEP-AQDDREGWAKLMKKAGVKG 209 (481) T ss_pred HHCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHHCCCC-----CCC-CCHHHHHHHHHHHHCCCCE T ss_conf 86049899825653589932889999999998898747676-----679-6011678999999719866 No 219 >PRK11183 D-lactate dehydrogenase; Provisional Probab=27.86 E-value=41 Score=13.92 Aligned_cols=18 Identities=22% Similarity=0.281 Sum_probs=9.9 Q ss_pred CCCCCCHHHHHCHHHHHH Q ss_conf 531110234311046532 Q gi|254780670|r 229 YNAGKDLLIALDCAASAF 246 (424) Q Consensus 229 y~~g~di~i~lD~AAsef 246 (424) -+-|+|.++.|+...+.+ T Consensus 313 ~kYGKDtfl~I~~lGT~~ 330 (565) T PRK11183 313 EKYGKDTFLMIDKLGTDK 330 (565) T ss_pred HHHCCHHHHHHHHHCCHH T ss_conf 885401757898758127 No 220 >PRK08462 biotin carboxylase; Validated Probab=27.84 E-value=41 Score=13.92 Aligned_cols=58 Identities=12% Similarity=0.291 Sum_probs=27.5 Q ss_pred HHHHHHHHHHCCCCCCCCHHHHHCHHHHHHCCCCCEEECCC---------CCCCCHHHHHHHHHHHHHHCCE Q ss_conf 99999999753853111023431104653202785111146---------8656878999999987654486 Q gi|254780670|r 217 LDLIANSIEKAGYNAGKDLLIALDCAASAFFKKEKYILKGE---------NLELQPNEMASYLANLVDQYPI 279 (424) Q Consensus 217 L~ll~eAi~~aGy~~g~di~i~lD~AAsefy~~~kY~~~~~---------~~~~t~~elid~y~~l~~~yPI 279 (424) .+.-....+.+||...-.|-+.+|.-..+| |.++.- .-..|.=+|+..-..+..-+|+ T Consensus 258 ~~~A~~la~~vgY~gaGTvEFl~d~~~~~f-----yFlE~NtRlQVEH~VTE~vtGiDLV~~Qi~iA~G~~l 324 (446) T PRK08462 258 LETAIKAAKAIGYVGAGTFEFLLDSNMKDF-----YFMEMNTRLQVEHTVSEMVSGLDLIEWMIKIAEGEEL 324 (446) T ss_pred HHHHHHHHHHCCCCCCCEEEEEECCCCCEE-----EEEEEECCCCEECCCCCCCCCCCHHHHHHHHHCCCCC T ss_conf 889999998648365433899980799718-----9997433421001111221487799999998679998 No 221 >cd06246 M14_CPB2 Peptidase M14 Carboxypeptidase (CP) B2 (CPB2, also known as plasma carboxypeptidase B, carboxypeptidase U, and CPU), belongs to the carboxpeptidase A/B subfamily of the M14 family of metallocarboxypeptidases (MCPs). The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPB2 enzyme displays B-like activity; it only cleaves the basic residues lysine or arginine. It is produced and secreted by the liver as the inactive precursor, procarboxypeptidase U or PCPB2, commonly referred to as thrombin-activatable fibrinolysis inhibitor (TAFI). It circulates in plasma as a zymogen bound to plasminogen, and the active enzyme, TAFIa, inhibits fibrinolysis. It is highly regulated, increased TAFI concentrations are thought to increase the risk of thrombosis and coronary artery disease by reducing fibrinolytic activity whil Probab=27.61 E-value=41 Score=13.89 Aligned_cols=35 Identities=20% Similarity=0.395 Sum_probs=17.7 Q ss_pred HHHHHHHHHHHHHHHHHCCCCHHHHHHHHHH------HHHCCC Q ss_conf 4899999998756789759885578988864------653036 Q gi|254780670|r 61 GVLKAIAFVNDEIRTALLGCDARDQLLIDKI------MIDLDG 97 (424) Q Consensus 61 gV~~Av~~in~~i~~~L~g~~~~dQ~~iD~~------Li~lDg 97 (424) +...++..++..+ ..-+.|+.-+.-+|+. ++.-|| T Consensus 69 s~~~~l~~i~~Ll--~~y~~d~~~~~ll~~~~~~ivP~~NPDG 109 (300) T cd06246 69 SPAFCLWFVGHAT--QFYGIDGQMTNLLRHMDFYIMPVMNVDG 109 (300) T ss_pred HHHHHHHHHHHHH--HHCCCCHHHHHHHHHCEEEEEEEECCCH T ss_conf 7999999999999--8416788999999708299995106964 No 222 >cd00008 53EXOc 5'-3' exonuclease; T5 type 5'-3' exonuclease domains may co-occur with DNA polymerase I (Pol I) domains, or be part of Pol I containing complexes. They digest dsDNA and ssDNA, releasing mono-,di- and tri-nucleotides, as well as oligonucleotides, and have also been reported to possess RNase H activity. Also called 5' nuclease family, involved in structure-specific cleavage of flaps formed by Pol I activity (similar to mammalian flap endonuclease I, FEN-1). A single nucleic acid strand may be threaded through the 5' nuclease enzyme before cleavage occurs. The domain binds two divalent metal ions which are necessary for activity. Probab=27.41 E-value=41 Score=13.87 Aligned_cols=93 Identities=23% Similarity=0.338 Sum_probs=57.6 Q ss_pred HHHHHHHHHHHHHCCCCCCCCHHHHHCHHHHHHCCCCCE-EECCCCCCCCHHHH---HHHHHHHHHHCCEEEEECCCCC- Q ss_conf 899999999997538531110234311046532027851-11146865687899---9999987654486997767631- Q gi|254780670|r 214 DSALDLIANSIEKAGYNAGKDLLIALDCAASAFFKKEKY-ILKGENLELQPNEM---ASYLANLVDQYPIYSIEDGMSE- 288 (424) Q Consensus 214 eeaL~ll~eAi~~aGy~~g~di~i~lD~AAsefy~~~kY-~~~~~~~~~t~~el---id~y~~l~~~yPI~sIEDP~~e- 288 (424) .-.+..|...++. +.| ..+.++.|.....| ..+-| .|+ .++...++++ ++...++++..+|..++-|-.| T Consensus 35 ~gf~~~l~~l~~~--~~p-~~i~v~fD~~~~t~-R~~l~p~YK-anR~~~p~~l~~q~~~~~~~l~~~gi~~~~~~~~EA 109 (240) T cd00008 35 YGFLNMLLKLIKE--YKP-TYVAVVFDAGGKTF-RHELYPEYK-ANRKKMPEELREQIPLIKELLEALGIPVLEIEGYEA 109 (240) T ss_pred HHHHHHHHHHHHH--CCC-CEEEEEECCCCCCC-HHHHHHHHH-HCCCCCCHHHHHHHHHHHHHHHHCCCCEEEECCCCH T ss_conf 9999999999986--599-88999972899862-387728877-378999589999999999999977997897468438 Q ss_pred CCHHH-HHHHHHHCCCCEEEECCC Q ss_conf 04237-998864238836850430 Q gi|254780670|r 289 DDWHG-WKILTNKIGSNCQLVGDD 311 (424) Q Consensus 289 ~D~~g-w~~l~~~~g~~~~IvGDD 311 (424) ||.-+ +.+.....+.++.||.-| T Consensus 110 DD~ia~la~~~~~~~~~v~IvS~D 133 (240) T cd00008 110 DDVIGTLAKKAEAEGYKVVIVSGD 133 (240) T ss_pred HHHHHHHHHHHHHCCCEEEEECCC T ss_conf 999999999998669706996288 No 223 >cd05025 S-100A1 S-100A1: S-100A1 domain found in proteins similar to S100A1. S100A1 is a calcium-binding protein belonging to a large S100 vertebrate-specific protein family within the EF-hand superfamily of calcium-binding proteins. Note that the S-100 hierarchy, to which this S-100A1 group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately. As is the case with many other members of S100 protein family, S100A1 is implicated in intracellular and extracellular regulatory activities, including interaction with myosin-associated twitchin kinase, actin-capping protein CapZ, sinapsin I, and tubulin. Structural data suggests that S100A1 proteins exist within cells as antiparallel homodimers, while heterodimers with S100A4 and S100B also has been reported. Upon binding calcium S100A1 changes conformation to expose a hydrophobic cleft which is the interaction site of S100A1 with its more that 20 known target proteins. Probab=27.33 E-value=42 Score=13.86 Aligned_cols=48 Identities=27% Similarity=0.358 Sum_probs=31.0 Q ss_pred EECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCC Q ss_conf 6268986516804899999998756789759885578988864653036643 Q gi|254780670|r 49 ELRDQEKRYFGKGVLKAIAFVNDEIRTALLGCDARDQLLIDKIMIDLDGTPN 100 (424) Q Consensus 49 elrD~~~~~~G~gV~~Av~~in~~i~~~L~g~~~~dQ~~iD~~Li~lDgT~n 100 (424) .-+||++....|.=.+ +.++..+..-| .++.|+..+|++|.+||.-.+ T Consensus 19 a~keGd~~~LsK~Elk--~Ll~~El~~~l--~~~kd~~~vd~~m~~LD~n~D 66 (92) T cd05025 19 SGKEGDKYKLSKKELK--DLLQTELSDFL--DAQKDADAVDKIMKELDENGD 66 (92) T ss_pred CCCCCCCCCCCHHHHH--HHHHHHHHHHH--CCCCCHHHHHHHHHHHCCCCC T ss_conf 4558985648799999--99999989988--377888999999998658999 No 224 >PRK11320 prpB 2-methylisocitrate lyase; Provisional Probab=26.99 E-value=42 Score=13.82 Aligned_cols=13 Identities=46% Similarity=0.603 Sum_probs=5.1 Q ss_pred CCCC-CEEECCCCC Q ss_conf 7543-000003664 Q gi|254780670|r 140 LPVP-LMNILNGGI 152 (424) Q Consensus 140 lP~P-~~NiinGG~ 152 (424) +++| +.|+.-||+ T Consensus 201 ~~~Pll~n~~e~g~ 214 (292) T PRK11320 201 VKVPILANITEFGA 214 (292) T ss_pred CCCCEEEEECCCCC T ss_conf 79998998489999 No 225 >PRK07683 aminotransferase A; Validated Probab=26.77 E-value=43 Score=13.79 Aligned_cols=31 Identities=19% Similarity=0.392 Sum_probs=22.1 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCEEEEECCCCC Q ss_conf 68656878999999987654486997767631 Q gi|254780670|r 257 ENLELQPNEMASYLANLVDQYPIYSIEDGMSE 288 (424) Q Consensus 257 ~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e 288 (424) .|..++.+++-+. .++|.+|.++-|+|=.+. T Consensus 174 tG~v~s~e~l~~i-~~la~~~~i~ii~DEiY~ 204 (387) T PRK07683 174 TGVTLSKEELKDI-ANVLKDKNIFVLSDEIYS 204 (387) T ss_pred CCCCCCHHHHHHH-HHHHHCCCCEEECCCCHH T ss_conf 9788789999999-997613694475465135 No 226 >cd07101 ALDH_SSADH2_GabD2 Mycobacterium tuberculosis succinate-semialdehyde dehydrogenase 2-like. Succinate-semialdehyde dehydrogenase 2 (SSADH2) and similar proteins are in this CD. SSADH1 (GabD1, EC=1.2.1.16) catalyzes the NADP(+)-dependent oxidation of succinate semialdehyde to succinate. SSADH activity in Mycobacterium tuberculosis is encoded by both gabD1 (Rv0234c) and gabD2 (Rv1731), however ,the Vmax of GabD1 was shown to be much higher than that of GabD2, and GabD2 (SSADH2) is likely to serve physiologically as a dehydrogenase for a different aldehyde(s). Probab=26.66 E-value=43 Score=13.78 Aligned_cols=125 Identities=15% Similarity=0.154 Sum_probs=58.1 Q ss_pred CCCCCCCCEEECCCCCCCCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHH Q ss_conf 56475430000036645567766102565148877489998999999999998752000013676557644886846899 Q gi|254780670|r 137 AHILPVPLMNILNGGIHADNALDFQEFMIMPVGAENIREAIRMGAEVFHTLKKELKSKGYSTNVGDEGGFSPNLKTADSA 216 (424) Q Consensus 137 ~~~lP~P~~NiinGG~HA~~~l~iQEfmIiP~g~~sf~eal~~~~ev~~~lk~~L~~~g~~t~vgDEGGfap~~~~~eea 216 (424) ...+|--.+|++.|... ++.+.|+-.+...+|.-+...+-.|.+...+.++.-. -.-||-.|.+=..+-- T Consensus 170 eaGlP~Gv~n~v~g~g~-----~~g~~l~~~vd~v~FTGS~~~G~~i~~~aa~~~~~~~-----lELGGk~p~IV~~dAD 239 (454) T cd07101 170 EAGLPRDLWQVVTGPGS-----EVGGAIVDNADYVMFTGSTATGRVVAERAGRRLIGCS-----LELGGKNPMIVLEDAD 239 (454) T ss_pred HHCCCCCEEEECCCCHH-----HHHHHHHHCCCEEEEECCCHHHHHHHHHHHHHHCCEE-----EECCCCCHHHCCCCCC T ss_conf 80999563665368878-----8889998388889991884798876544787616204-----5338878023266348 Q ss_pred HHHHHHHHHHCCCCCCCCHHHHHCHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCC Q ss_conf 999999997538531110234311046532027851111468656878999999987654486997767631 Q gi|254780670|r 217 LDLIANSIEKAGYNAGKDLLIALDCAASAFFKKEKYILKGENLELQPNEMASYLANLVDQYPIYSIEDGMSE 288 (424) Q Consensus 217 L~ll~eAi~~aGy~~g~di~i~lD~AAsefy~~~kY~~~~~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e 288 (424) |+...+.+-...|...-+.+++ . +..|--+. --+++++.+.+.+.++. +-||+++ T Consensus 240 l~~av~~~~~~~f~~~GQ~C~a----~-------~ri~V~~~---i~d~f~~~l~~~~~~l~---vG~p~d~ 294 (454) T cd07101 240 LDKAAAGAVRACFSNAGQLCVS----I-------ERIYVHES---VYDEFVRRFVARTRALR---LGAALDY 294 (454) T ss_pred HHHHHHHHHHHHHHHCCCCCCC----C-------CEEEEECC---HHHHHHHHHHHHHHHCC---CCCCCCC T ss_conf 9999986468888624886778----6-------28999612---28999999998775245---7897543 No 227 >TIGR03250 PhnAcAld_DH putative phosphonoacetaldehyde dehydrogenase. It seems reasonably certain then, that this enzyme catalyzes the NAD-dependent oxidation of phosphonoacetaldehyde to phosphonoacetate, bridging the metabolic gap between PhnW and PhnA. We propose the name phosphonoacetaldehyde dehydrogenase and the gene symbol PhnY for this enzyme. Probab=26.63 E-value=43 Score=13.77 Aligned_cols=120 Identities=18% Similarity=0.258 Sum_probs=52.1 Q ss_pred CCCCCCCEEECCCC-CCCCCCC----CCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCC Q ss_conf 64754300000366-4556776----610256514887748999899999999999875200001367655764488684 Q gi|254780670|r 138 HILPVPLMNILNGG-IHADNAL----DFQEFMIMPVGAENIREAIRMGAEVFHTLKKELKSKGYSTNVGDEGGFSPNLKT 212 (424) Q Consensus 138 ~~lP~P~~NiinGG-~HA~~~l----~iQEfmIiP~g~~sf~eal~~~~ev~~~lk~~L~~~g~~t~vgDEGGfap~~~~ 212 (424) ..+|--.+|++.|+ .-.+..| +++-+ +|.-+.+.+-.|-.... + +.. .---||-.|.+=. T Consensus 192 aGlP~gv~nvv~G~g~~~g~~Lv~~~~v~~v--------~FTGS~~~G~~i~~~a~--~--~~~---~LELGGknp~IV~ 256 (472) T TIGR03250 192 AGLPPQMLQVVTGDPREIADELITNPHVDLV--------TFTGGVAIGKYIAARAG--Y--RRQ---VLELGGNDPLIVM 256 (472) T ss_pred HCCCCCEEECCCCCHHHHHHHHHCCCCCCEE--------EEECCCHHHHHHHHHHH--C--CEE---EECCCCCCCCCCC T ss_conf 3939671531678878999888508774679--------98688188888887764--0--403---4226776765468 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCHHHHHCHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCC Q ss_conf 68999999999975385311102343110465320278511114686568789999999876544869977676310 Q gi|254780670|r 213 ADSALDLIANSIEKAGYNAGKDLLIALDCAASAFFKKEKYILKGENLELQPNEMASYLANLVDQYPIYSIEDGMSED 289 (424) Q Consensus 213 ~eeaL~ll~eAi~~aGy~~g~di~i~lD~AAsefy~~~kY~~~~~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~ 289 (424) ..--|+...+.+-...+...-+.+.+. ++.|--+ ---|++++-+...+.+++ +=||++++ T Consensus 257 ~dADld~a~~~~~~~~f~naGQ~C~a~-----------~rv~V~~---~i~d~f~~~l~~~~~~l~---~G~p~d~~ 316 (472) T TIGR03250 257 EDADLDRAADLAVKGSYKNSGQRCTAV-----------KRMLVQE---SVADRFTELLVEKTRAWR---YGDPMDPS 316 (472) T ss_pred CCCCHHHHHHHHHHHHHCCCCCCCCCC-----------CEEEEEH---HHHHHHHHHHHHHHHCCC---CCCCCCCC T ss_conf 866899999865566631689968588-----------1799837---589999999997553165---67977677 No 228 >cd05026 S-100Z S-100Z: S-100Z domain found in proteins similar to S100Z. S100Z is a member of the S100 domain family within the EF-hand Ca2+-binding proteins superfamily. Note that the S-100 hierarchy, to which this S-100Z group belongs, contains only S-100 EF-hand domains, other EF-hands have been modeled separately.S100 proteins exhibit unique patterns of tissue- and cell type-specific expression and have been implicated in the Ca2+-dependent regulation of diverse physiological processes, including cell cycle regulation, differentiation, growth, and metabolic control. S100Z is normally expressed in various tissues, with its highest level of expression being in spleen and leukocytes. The function of S100Z remains unclear. Preliminary structural data suggests that S100Z is homodimer, however a heterodimer with S100P has been reported. S100Z is capable of binding calcium ions. When calcium binds to S110Z, the protein experiences a conformational change, which exposes hydrophobic surfac Probab=26.56 E-value=43 Score=13.76 Aligned_cols=47 Identities=32% Similarity=0.539 Sum_probs=29.2 Q ss_pred EECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCC Q ss_conf 626898651680489999999875678975988557898886465303664 Q gi|254780670|r 49 ELRDQEKRYFGKGVLKAIAFVNDEIRTALLGCDARDQLLIDKIMIDLDGTP 99 (424) Q Consensus 49 elrD~~~~~~G~gV~~Av~~in~~i~~~L~g~~~~dQ~~iD~~Li~lDgT~ 99 (424) .-+||++...-|+=.+ +.++..+.+-| .++.|+..+|++|.+||.-. T Consensus 20 a~~~Gd~~tLsk~Elk--~Ll~~El~~fl--~~~~d~~~vd~~m~~LD~n~ 66 (93) T cd05026 20 SGKEGDRYKLSKGELK--ELLQRELTDFL--SSQKDPMLVDKIMNDLDSNK 66 (93) T ss_pred CCCCCCCCCCCHHHHH--HHHHHHHHHHH--CCCCCHHHHHHHHHHHCCCC T ss_conf 5768997748799999--99999988987--67998899999999864799 No 229 >pfam00465 Fe-ADH Iron-containing alcohol dehydrogenase. Probab=26.54 E-value=43 Score=13.76 Aligned_cols=36 Identities=28% Similarity=0.404 Sum_probs=21.1 Q ss_pred HHHHHHHHHHHH--CCCCHHHHCCCCCCCCCCCCCEEE Q ss_conf 448998752663--498488822532456475430000 Q gi|254780670|r 112 VSLAVSKAAAQT--SNLPLYKYLGGCSAHILPVPLMNI 147 (424) Q Consensus 112 vSlA~akA~A~~--~~~pLy~yl~~~~~~~lP~P~~Ni 147 (424) -+|-++|+.|.. ...+.|+|+.......-++|++.| T Consensus 90 S~iD~aK~ia~~~~~~~~~~~~~~~~~~~~~~~P~i~I 127 (312) T pfam00465 90 SVIDTAKAIALLLTNPGDVWDYLGGKKLTKPALPLIAI 127 (312) T ss_pred CCCHHHHHHHHHHCCCCCHHHHHCCCCCCCCCCCEEEE T ss_conf 61249999999971999889985576555678988997 No 230 >COG0439 AccC Biotin carboxylase [Lipid metabolism] Probab=26.49 E-value=43 Score=13.75 Aligned_cols=29 Identities=10% Similarity=0.123 Sum_probs=15.6 Q ss_pred CCCCCCCHHHH---HHHHHHHHHCCCCHHHHC Q ss_conf 23222202448---998752663498488822 Q gi|254780670|r 104 LGANAILGVSL---AVSKAAAQTSNLPLYKYL 132 (424) Q Consensus 104 lGaNailAvSl---A~akA~A~~~~~pLy~yl 132 (424) +|.|+-.--+| ..+|..++..++|+=.+- T Consensus 103 iGP~~~~i~~mgdK~~ar~~~~~aGVP~vpgs 134 (449) T COG0439 103 IGPSAEAIRRMGDKITARRLMAKAGVPVVPGS 134 (449) T ss_pred ECCCHHHHHHHHHHHHHHHHHHHCCCCCCCCC T ss_conf 08498999974458999999997499958997 No 231 >COG1503 eRF1 Peptide chain release factor 1 (eRF1) [Translation, ribosomal structure and biogenesis] Probab=26.40 E-value=43 Score=13.74 Aligned_cols=25 Identities=16% Similarity=0.315 Sum_probs=14.0 Q ss_pred EEEECCCCCCHHHHHHHHHHHHHHH Q ss_conf 5651488774899989999999999 Q gi|254780670|r 163 FMIMPVGAENIREAIRMGAEVFHTL 187 (424) Q Consensus 163 fmIiP~g~~sf~eal~~~~ev~~~l 187 (424) |-+.|.++.-+.|++..+.++-... T Consensus 249 vDv~y~~esg~~eli~~A~d~L~~~ 273 (411) T COG1503 249 VDVSYTGESGLRELIEKAEDALKDV 273 (411) T ss_pred CCCCCCCCCCHHHHHHHHHHHHHHH T ss_conf 2336333202799999749998766 No 232 >cd06305 PBP1_methylthioribose_binding_like Methylthioribose-binding protein-like of ABC-type transport systems that belong to a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein (PBP1) superfamily. Methylthioribose-binding protein-like of ABC-type transport systems that belong to a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein (PBP1) superfamily, which consists of two alpha/beta globular domains connected by a three-stranded hinge. This Venus flytrap-like domain undergoes transition from an open to a closed conformational state upon ligand binding. The sugar-binding domain of the periplasmic proteins in this group is also homologous to the ligand-binding domain of eukaryotic receptors such as metabotropic glutamate receptor (mGluR), DNA-binding transcriptional repressors such as LacI and GalR. Probab=26.33 E-value=43 Score=13.73 Aligned_cols=77 Identities=19% Similarity=0.300 Sum_probs=38.8 Q ss_pred CHHHHHHHHHHHHHHCC---EEEEECCCCCCCH--HHHHHHHHHCC--CCEEEECCCEEECCHHHHHHHHHCCCCCEEEE Q ss_conf 87899999998765448---6997767631042--37998864238--83685043000021488852100386522342 Q gi|254780670|r 262 QPNEMASYLANLVDQYP---IYSIEDGMSEDDW--HGWKILTNKIG--SNCQLVGDDLFVTNPERLHKGINEEVANAILI 334 (424) Q Consensus 262 t~~elid~y~~l~~~yP---I~sIEDP~~e~D~--~gw~~l~~~~g--~~~~IvGDDL~vTn~~rl~~gi~~~~~Naili 334 (424) +.++-......++.+|| +..| |.-+|. -|.-.-.+..| .++.|+|=|......+.++.+ +....-.+.- T Consensus 164 ~~~~~~~~~~~~L~~~pd~~~~ai---~~~nd~~a~Ga~~Al~~~G~~~~v~v~G~D~~~~~~~~i~~~-~g~~~atv~Q 239 (273) T cd06305 164 TAQDAAAQVEAVLKKYPKGGIDAI---WAAWDEFAKGAKQALDEAGRTDEIKIYGVDISPEDLQLMREP-DSPWVATAAQ 239 (273) T ss_pred HHHHHHHHHHHHHHHCCCCCEEEE---EECCCHHHHHHHHHHHHCCCCCCCEEEEECCCHHHHHHHHCC-CCCEEEEEEC T ss_conf 399999999999973888762799---935857899999999976999994899978998999999727-9955999947 Q ss_pred CCCCHHHH Q ss_conf 54530038 Q gi|254780670|r 335 KPNQIGSV 342 (424) Q Consensus 335 K~NQiGTv 342 (424) .|-.+|-+ T Consensus 240 ~p~~~G~~ 247 (273) T cd06305 240 DPALIGKV 247 (273) T ss_pred CHHHHHHH T ss_conf 99999999 No 233 >COG0630 VirB11 Type IV secretory pathway, VirB11 components, and related ATPases involved in archaeal flagella biosynthesis [Cell motility and secretion / Intracellular trafficking and secretion] Probab=26.30 E-value=43 Score=13.73 Aligned_cols=26 Identities=23% Similarity=0.288 Sum_probs=11.1 Q ss_pred HHCCEEEEECCCCCCC-HHHHHHHHHH Q ss_conf 5448699776763104-2379988642 Q gi|254780670|r 275 DQYPIYSIEDGMSEDD-WHGWKILTNK 300 (424) Q Consensus 275 ~~yPI~sIEDP~~e~D-~~gw~~l~~~ 300 (424) ..+.|++|||.-...- .+.|..+..+ T Consensus 169 ~~~rivtIEdt~E~~~~~~n~~~l~~r 195 (312) T COG0630 169 PEERIVTIEDTPELKLPHENWVQLVTR 195 (312) T ss_pred CCCCEEEECCCCEECCCCCCEEEEEEC T ss_conf 522189952555414788765899832 No 234 >COG5054 ERV1 Mitochondrial sulfhydryl oxidase involved in the biogenesis of cytosolic Fe/S proteins [Posttranslational modification, protein turnover, chaperones] Probab=26.12 E-value=39 Score=14.03 Aligned_cols=42 Identities=17% Similarity=0.353 Sum_probs=31.0 Q ss_pred CCCEEEE-EECCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 6610256-51488774899989999999999987520000136 Q gi|254780670|r 158 LDFQEFM-IMPVGAENIREAIRMGAEVFHTLKKELKSKGYSTN 199 (424) Q Consensus 158 l~iQEfm-IiP~g~~sf~eal~~~~ev~~~lk~~L~~~g~~t~ 199 (424) -+||..| .-|.-..|=+++-.|++|||+.+.+.|.+-.+... T Consensus 127 ~~f~K~l~~~ppqv~SRea~~~W~CevHN~VNekL~Kp~~dC~ 169 (181) T COG5054 127 KHFQKLLDVYPPQVSSREAATTWACEVHNKVNEKLGKPKFDCD 169 (181) T ss_pred HHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 9999998618997541889999999987899998389875531 No 235 >TIGR01310 L7 60S ribosomal protein L7; InterPro: IPR005998 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites , . About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many of ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome , . Eukaryotic ribosomal protein, L7, contains an N-terminal bZIP DNA binding domain and a second, DNA-binding domain has been mapped to the 50 C-terminal amino acids of the protein . In addition to its role in translation, L7 has also been shown to be involved in nuclear-receptor mediated transcriptional control. There is no bacterial homologue of this protein. ; GO: 0003735 structural constituent of ribosome, 0030528 transcription regulator activity, 0006412 translation, 0015934 large ribosomal subunit. Probab=26.04 E-value=35 Score=14.41 Aligned_cols=39 Identities=26% Similarity=0.416 Sum_probs=27.3 Q ss_pred CCCCCEEECCCC----CCCCHHHHHHHHHHHHHHCCEEEEECCCCC Q ss_conf 027851111468----656878999999987654486997767631 Q gi|254780670|r 247 FKKEKYILKGEN----LELQPNEMASYLANLVDQYPIYSIEDGMSE 288 (424) Q Consensus 247 y~~~kY~~~~~~----~~~t~~elid~y~~l~~~yPI~sIEDP~~e 288 (424) |+.|=-.+++.. .-+|.+.|||- =.-+|.|++|||=.+| T Consensus 147 yKrGf~ki~g~r~Y~~~pl~dN~~iE~---~LGk~giic~EDlihE 189 (248) T TIGR01310 147 YKRGFAKIDGQRRYAKVPLTDNTIIEQ---ALGKYGIICIEDLIHE 189 (248) T ss_pred CCCCEEEECCCCCCCCCCCCCCHHHHH---HCCCCCCEEHHHHHHH T ss_conf 013147778901202345587448998---6385772121315753 No 236 >PRK09147 aminotransferase; Provisional Probab=25.94 E-value=44 Score=13.69 Aligned_cols=30 Identities=17% Similarity=0.154 Sum_probs=22.3 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCEEEEECCCC Q ss_conf 6865687899999998765448699776763 Q gi|254780670|r 257 ENLELQPNEMASYLANLVDQYPIYSIEDGMS 287 (424) Q Consensus 257 ~~~~~t~~elid~y~~l~~~yPI~sIEDP~~ 287 (424) .|..++.+++-.+ .++|.+|.++-|+|=.+ T Consensus 179 TG~v~s~e~l~~l-~~la~~~~i~iisDEiY 208 (397) T PRK09147 179 TGAVLPLDDWKKL-FALSDRYGFVIASDECY 208 (397) T ss_pred CCCCCCHHHHHHH-HHHHCCCEEEEEECCCC T ss_conf 8998888999999-99632472899982686 No 237 >PRK12795 fliM flagellar motor switch protein FliM; Reviewed Probab=25.93 E-value=33 Score=14.57 Aligned_cols=70 Identities=20% Similarity=0.270 Sum_probs=35.5 Q ss_pred CCHHHHHHHHHHHH-HCCCCCCCCCCCCCCCCH---HHHH-------HHHHHHHHCCCCHHHHCCC-------------- Q ss_conf 88557898886465-303664322123222202---4489-------9875266349848882253-------------- Q gi|254780670|r 80 CDARDQLLIDKIMI-DLDGTPNKSRLGANAILG---VSLA-------VSKAAAQTSNLPLYKYLGG-------------- 134 (424) Q Consensus 80 ~~~~dQ~~iD~~Li-~lDgT~nks~lGaNailA---vSlA-------~akA~A~~~~~pLy~yl~~-------------- 134 (424) -.+.+|.+||.+|- +..+.....+-|..+++- ||.. +-.-.|......|+.++.. T Consensus 45 ~~~l~q~eid~~l~~~~~~~~~~~~~~~~~~~~~~~vs~eRmP~LeiI~dRfaR~~~~sL~n~~r~~veVsl~~V~~~kf 124 (378) T PRK12795 45 ERVLNQEEIDSLLGFSLGELSADDESGIRAIIDSAMVSYERLPMLEIVFDRLVRLLTTSLRNFTSDNVEVSLDRITSVRF 124 (378) T ss_pred CCCCCHHHHHHHHCCCCCCCCCCCCHHHHHHHHHCEECHHHCCHHHHHHHHHHHHHHHHHHHHHCCCEEEEECCEEEEEH T ss_conf 50058889777628777788865411499987520210864725889999999999999988625771898530568899 Q ss_pred -CCCCCCCCC-CEEECC Q ss_conf -245647543-000003 Q gi|254780670|r 135 -CSAHILPVP-LMNILN 149 (424) Q Consensus 135 -~~~~~lP~P-~~Niin 149 (424) ..-.+||+| ++||+. T Consensus 125 gefl~sLp~PT~lnvf~ 141 (378) T PRK12795 125 GDYLNSIPLPALLAVFK 141 (378) T ss_pred HHHHHCCCCCCEEEEEE T ss_conf 89875388874799987 No 238 >PRK07682 hypothetical protein; Validated Probab=25.68 E-value=44 Score=13.65 Aligned_cols=30 Identities=17% Similarity=0.270 Sum_probs=20.5 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCEEEEECCCC Q ss_conf 6865687899999998765448699776763 Q gi|254780670|r 257 ENLELQPNEMASYLANLVDQYPIYSIEDGMS 287 (424) Q Consensus 257 ~~~~~t~~elid~y~~l~~~yPI~sIEDP~~ 287 (424) .|..++.+|+-.+ .+||.+|.+.-|+|=.+ T Consensus 167 tG~v~s~~el~~l-~~la~~~~i~iisDEiY 196 (378) T PRK07682 167 TGAVLNKSELEEI-AVIVEKHNLIVLSDEIY 196 (378) T ss_pred CCCCCCHHHHHHH-HHHHHHCCEEEECCCHH T ss_conf 8888899999999-99764459679522014 No 239 >cd07149 ALDH_y4uC Uncharacterized ALDH (y4uC) with similarity to Tortula ruralis aldehyde dehydrogenase ALDH21A1. Uncharacterized aldehyde dehydrogenase (ORF name y4uC) with sequence similarity to the moss Tortula ruralis aldehyde dehydrogenase ALDH21A1 (RNP123) believed to play an important role in the detoxification of aldehydes generated in response to desiccation- and salinity-stress, and similar sequences are included in this CD. Probab=25.68 E-value=44 Score=13.65 Aligned_cols=15 Identities=27% Similarity=0.390 Sum_probs=10.3 Q ss_pred CCCCCCEEECCCCCC Q ss_conf 475430000036645 Q gi|254780670|r 139 ILPVPLMNILNGGIH 153 (424) Q Consensus 139 ~lP~P~~NiinGG~H 153 (424) .+|-=.+|++.|+.+ T Consensus 177 GlP~gv~n~v~g~~~ 191 (453) T cd07149 177 GLPKGALNVVTGSGE 191 (453) T ss_pred CCCCCCEEEECCCHH T ss_conf 978355256038678 No 240 >cd07088 ALDH_LactADH-AldA Escherichia coli lactaldehyde dehydrogenase AldA-like. Lactaldehyde dehydrogenase from Escherichia coli (AldA, LactADH, EC=1.2.1.22), an NAD(+)-dependent enzyme involved in the metabolism of L-fucose and L-rhamnose, and other similar sequences are present in this CD. Probab=25.57 E-value=45 Score=13.64 Aligned_cols=125 Identities=18% Similarity=0.257 Sum_probs=54.2 Q ss_pred CCCCCCCEEECCCC-CCCCCCCCCEEEEEEC-CCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHH Q ss_conf 64754300000366-4556776610256514-887748999899999999999875200001367655764488684689 Q gi|254780670|r 138 HILPVPLMNILNGG-IHADNALDFQEFMIMP-VGAENIREAIRMGAEVFHTLKKELKSKGYSTNVGDEGGFSPNLKTADS 215 (424) Q Consensus 138 ~~lP~P~~NiinGG-~HA~~~l~iQEfmIiP-~g~~sf~eal~~~~ev~~~lk~~L~~~g~~t~vgDEGGfap~~~~~ee 215 (424) ..+|--.+|++.|. ...+..|- --| +...+|.-+.+.+-.|.+...+.++.- +-.-||=.|.+=.... T Consensus 186 aGlP~Gv~n~v~g~g~~~g~~L~-----~~p~v~~v~FTGS~~~G~~i~~~aa~~~~~~-----~lELGGknp~iV~~dA 255 (468) T cd07088 186 AGLPAGVLNIVTGRGSVVGDALV-----AHPKVGMISLTGSTEAGQKIMEAAAENITKV-----SLELGGKAPAIVMKDA 255 (468) T ss_pred CCCCCCEEEECCCCCCCHHHHHH-----HCCCCCEEEEECCCHHHHHHHHHHHHHCCEE-----EECCCCCCCCCCCCCC T ss_conf 09992558752688741166897-----4989766996688068767988766535343-----3125677865467743 Q ss_pred HHHHHHHHHHHCCCCCCCCHHHHHCHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCC Q ss_conf 99999999975385311102343110465320278511114686568789999999876544869977676310 Q gi|254780670|r 216 ALDLIANSIEKAGYNAGKDLLIALDCAASAFFKKEKYILKGENLELQPNEMASYLANLVDQYPIYSIEDGMSED 289 (424) Q Consensus 216 aL~ll~eAi~~aGy~~g~di~i~lD~AAsefy~~~kY~~~~~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~ 289 (424) -|+...+++-...+...-+.+.+ .++.|--+ --.|++++.+..-++++. +=||++++ T Consensus 256 Dl~~A~~~~~~~~f~~aGQ~C~a-----------~~rv~V~~---~i~d~f~~~l~~~~~~l~---~G~p~~~~ 312 (468) T cd07088 256 DLDLAVKAIVDSRIINCGQVCTC-----------AERVYVHE---DIYDEFMEKLVEKMKAVK---VGDPFDAA 312 (468) T ss_pred CHHHHHHHHHHHHHHHCCCCCCC-----------CCEEEEEC---CHHHHHHHHHHHHHHHCC---CCCHHHCC T ss_conf 79999997799997523998838-----------84899971---537999999999986176---77957537 No 241 >TIGR02881 spore_V_K stage V sporulation protein K; InterPro: IPR014232 Proteins in this entry include the stage V sporulation protein K (SpoVK), a close homologue of the Rubisco expression protein CbbX (IPR000470 from INTERPRO), and are members of an ATPase family associated with various cellular activities. These proteins are strictly limited to bacterial endospore-forming species, but are not found universally among members of this group; they are missing from the Clostridium species.. Probab=25.55 E-value=45 Score=13.64 Aligned_cols=72 Identities=21% Similarity=0.414 Sum_probs=47.2 Q ss_pred CCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHH---------CCCCCCCCHHHHHCHHHHHHCCCCCEEECCCCCCCCHHH Q ss_conf 00136765576448868468999999999975---------385311102343110465320278511114686568789 Q gi|254780670|r 195 GYSTNVGDEGGFSPNLKTADSALDLIANSIEK---------AGYNAGKDLLIALDCAASAFFKKEKYILKGENLELQPNE 265 (424) Q Consensus 195 g~~t~vgDEGGfap~~~~~eeaL~ll~eAi~~---------aGy~~g~di~i~lD~AAsefy~~~kY~~~~~~~~~t~~e 265 (424) .+|-+.|=|-=| =.||+|.|+++.|. |||.---|.++.+|+-= +.+.-+..+.+.+|.+| T Consensus 114 AYSLaRGGEKDF------GKEAIDtLVK~mEd~~~~lvlILAGY~~EM~yFL~~NPGL-----~SRFPi~i~FPdY~~ee 182 (261) T TIGR02881 114 AYSLARGGEKDF------GKEAIDTLVKAMEDQRNELVLILAGYSDEMDYFLSLNPGL-----RSRFPISIDFPDYTVEE 182 (261) T ss_pred HHHHHCCCCCCC------CCHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHCCCCC-----CCCCCCEEECCCCCHHH T ss_conf 777614888876------6208889999876156986899708768999986207797-----77665054188998889 Q ss_pred HHHHHHHHHHHC Q ss_conf 999999876544 Q gi|254780670|r 266 MASYLANLVDQY 277 (424) Q Consensus 266 lid~y~~l~~~y 277 (424) |++....++++- T Consensus 183 L~~Ia~~m~~~R 194 (261) T TIGR02881 183 LMEIAERMVKER 194 (261) T ss_pred HHHHHHHHHHHH T ss_conf 999999998646 No 242 >PRK05942 aspartate aminotransferase; Provisional Probab=25.54 E-value=45 Score=13.64 Aligned_cols=32 Identities=9% Similarity=0.116 Sum_probs=21.4 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCC Q ss_conf 686568789999999876544869977676310 Q gi|254780670|r 257 ENLELQPNEMASYLANLVDQYPIYSIEDGMSED 289 (424) Q Consensus 257 ~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~ 289 (424) .|...+.+++-++ .++|.+|.++-|+|=.+.+ T Consensus 183 TG~v~~~e~l~~i-~~~a~~~~i~vIsDEiY~~ 214 (394) T PRK05942 183 TTATAPREFFEEI-VAFARKYEILLVHDLCYAE 214 (394) T ss_pred CCHHCCHHHHHHH-HHHHHCCCCEEEECCCCCC T ss_conf 0122019999999-9988517867970374100 No 243 >pfam10231 DUF2315 Uncharacterized conserved protein (DUF2315). This is a family of small conserved proteins found from worms to humans. The function is not known. Probab=25.25 E-value=45 Score=13.60 Aligned_cols=19 Identities=21% Similarity=0.515 Sum_probs=15.9 Q ss_pred CCCCCHHHHHHHHHHHHHH Q ss_conf 8656878999999987654 Q gi|254780670|r 258 NLELQPNEMASYLANLVDQ 276 (424) Q Consensus 258 ~~~~t~~elid~y~~l~~~ 276 (424) ...++.+||.+||.++.++ T Consensus 74 ~~~~~a~eMseFYK~FLd~ 92 (127) T pfam10231 74 TEEVSAEEMSEFYKEFLDK 92 (127) T ss_pred CCCCCHHHHHHHHHHHHHH T ss_conf 4566889999999999998 No 244 >PRK07212 consensus Probab=25.09 E-value=45 Score=13.58 Aligned_cols=30 Identities=13% Similarity=0.241 Sum_probs=18.9 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCEEEEECCCC Q ss_conf 6865687899999998765448699776763 Q gi|254780670|r 257 ENLELQPNEMASYLANLVDQYPIYSIEDGMS 287 (424) Q Consensus 257 ~~~~~t~~elid~y~~l~~~yPI~sIEDP~~ 287 (424) .|..++.+++-++ .++|.+|.++-|+|=.+ T Consensus 175 tG~v~s~~~l~~l-~~la~~~~~~ii~DEiY 204 (378) T PRK07212 175 TGAVFGKESLQGV-ADLAVDHDLYVLSDEIY 204 (378) T ss_pred CCCCCCCHHHHHH-HHHHHHHCCEECCCCCH T ss_conf 8988471768899-99777424501330011 No 245 >KOG3248 consensus Probab=24.96 E-value=46 Score=13.56 Aligned_cols=28 Identities=29% Similarity=0.428 Sum_probs=17.8 Q ss_pred CCCHHHHHHHHHHH-------HHHCCEEEEECCCC Q ss_conf 56878999999987-------65448699776763 Q gi|254780670|r 260 ELQPNEMASYLANL-------VDQYPIYSIEDGMS 287 (424) Q Consensus 260 ~~t~~elid~y~~l-------~~~yPI~sIEDP~~ 287 (424) .++.+|+..||+-- ...||=.|--|-+. T Consensus 232 ~LSrEEQAKYyElArKerqlH~qlYP~WSARdNYg 266 (421) T KOG3248 232 ALSREEQAKYYELARKERQLHMQLYPGWSARDNYG 266 (421) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHH T ss_conf 53588888999998888887777567852200010 No 246 >COG2089 SpsE Sialic acid synthase [Cell envelope biogenesis, outer membrane] Probab=24.76 E-value=46 Score=13.53 Aligned_cols=13 Identities=23% Similarity=0.603 Sum_probs=6.2 Q ss_pred CCHHHHHHHHHHH Q ss_conf 6878999999987 Q gi|254780670|r 261 LQPNEMASYLANL 273 (424) Q Consensus 261 ~t~~elid~y~~l 273 (424) ++++++-++...+ T Consensus 251 ldP~efk~mv~~i 263 (347) T COG2089 251 LDPDEFKEMVDAI 263 (347) T ss_pred CCHHHHHHHHHHH T ss_conf 3889999999999 No 247 >cd06307 PBP1_uncharacterized_sugar_binding Periplasmic sugar-binding domain of uncharacterized transport systems. Periplasmic sugar-binding domain of uncharacterized transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein (PBP1) superfamily. The members of this group are predicted to be involved in the transport of sugar-containing molecules across cellular and organellar membranes. Probab=24.68 E-value=46 Score=13.52 Aligned_cols=73 Identities=18% Similarity=0.273 Sum_probs=43.8 Q ss_pred CHHHHHHHHHHHHHHCC-EEEEECCCCCCC-HHHHHHHHHHCC--CCEEEECCCEEECCHHHHHHHHHCCCCCEE-EECC Q ss_conf 87899999998765448-699776763104-237998864238--836850430000214888521003865223-4254 Q gi|254780670|r 262 QPNEMASYLANLVDQYP-IYSIEDGMSEDD-WHGWKILTNKIG--SNCQLVGDDLFVTNPERLHKGINEEVANAI-LIKP 336 (424) Q Consensus 262 t~~elid~y~~l~~~yP-I~sIEDP~~e~D-~~gw~~l~~~~g--~~~~IvGDDL~vTn~~rl~~gi~~~~~Nai-liK~ 336 (424) +.++=.+...+|+..|| |..|= .-+| -.|...-.++.| .++.+||-|+...+.+.|+.|. ..++ --.| T Consensus 169 ~~~~a~~~~~~lL~~~Pdi~ai~---~~~~~~~ga~~Al~~~g~~~~v~vvg~D~~~~~~~~l~~G~----i~a~v~Q~P 241 (275) T cd06307 169 DPARAYEATRKLLARHPDLVGIY---NAGGGNRGVIRALREAGRAGKVVFVGHELTPETRAALRDGT----IDAVIDQDP 241 (275) T ss_pred CHHHHHHHHHHHHHHCCCCEEEE---ECCCCHHHHHHHHHHCCCCCCEEEEECCCCHHHHHHHHCCC----CEEEEECCH T ss_conf 65999999999997499962999---84871699999999739999869996279989999998198----379993599 Q ss_pred CCHHH Q ss_conf 53003 Q gi|254780670|r 337 NQIGS 341 (424) Q Consensus 337 NQiGT 341 (424) -+.|- T Consensus 242 ~~~G~ 246 (275) T cd06307 242 GHLAR 246 (275) T ss_pred HHHHH T ss_conf 99999 No 248 >TIGR01075 uvrD DNA helicase II; InterPro: IPR005753 The SOS response is required for efficient nucleotide excision repair (NER) of the major ultraviolet light (UV) induced DNA lesions. The expression of three NER genes, uvrA, uvrB, and uvrD, is upregulated as part of the SOS response. UvrD differs from the others in that it is not involved in lesion recognition but rather in promoting the post-incision steps of NER, including turnover of the UvrBC incision complex . The form of the UvrD helicase with optimal helicase activity is oligomeric with at least two sites for binding the DNA substrate, where one site contacts regions of the 3'-ssDNA tail that are distal from the single-stranded/double-stranded DNA junction.; GO: 0004003 ATP-dependent DNA helicase activity, 0006268 DNA unwinding during replication, 0005737 cytoplasm. Probab=24.56 E-value=46 Score=13.51 Aligned_cols=158 Identities=20% Similarity=0.267 Sum_probs=80.8 Q ss_pred EEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCC-------------CCCCCCCCCCCCCCCCHH---------HHHH Q ss_conf 025651488774899989999999999987520000-------------136765576448868468---------9999 Q gi|254780670|r 161 QEFMIMPVGAENIREAIRMGAEVFHTLKKELKSKGY-------------STNVGDEGGFSPNLKTAD---------SALD 218 (424) Q Consensus 161 QEfmIiP~g~~sf~eal~~~~ev~~~lk~~L~~~g~-------------~t~vgDEGGfap~~~~~e---------eaL~ 218 (424) |+|-|+ .| .|.++. +|++|+.-++ --+--|||==+..+.... ..-. T Consensus 108 ~~FQil----Ds-~DQlrL-------~KRlik~~n~~e~~~pPrq~~W~IN~qKdeGlRp~~i~~~~~d~~~~~~~~~Y~ 175 (741) T TIGR01075 108 EDFQIL----DS-DDQLRL-------IKRLIKSLNLDEKQFPPRQAMWYINNQKDEGLRPEHIEAFDNDPVERTWIKIYA 175 (741) T ss_pred CCCCEE----CC-HHHHHH-------HHHHHHHCCCCCCCCCCHHHHHCCCCCCCCCCCHHHCCCCCCCHHHHHHHHHHH T ss_conf 124236----40-657999-------999998638763336713442010366567888323577677768999999999 Q ss_pred HHHHHHHHCCCCCCCCHHHHHCHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCCCH--HHHHH Q ss_conf 9999997538531110234311046532027851111468656878999999987654486997767631042--37998 Q gi|254780670|r 219 LIANSIEKAGYNAGKDLLIALDCAASAFFKKEKYILKGENLELQPNEMASYLANLVDQYPIYSIEDGMSEDDW--HGWKI 296 (424) Q Consensus 219 ll~eAi~~aGy~~g~di~i~lD~AAsefy~~~kY~~~~~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~D~--~gw~~ 296 (424) .=+++|.++|+. |+.= |=.=|.|++.+... +..-|. ++|. --+.|=|.+.-- =.|-+ T Consensus 176 ~Yq~~C~RaGlV---DFAE-LLLRa~ELl~~~~~-------------~l~hY~---~RF~-~ILvDEFQDTN~iQYaw~r 234 (741) T TIGR01075 176 EYQEACDRAGLV---DFAE-LLLRAYELLRNKPH-------------ILQHYQ---ERFK-HILVDEFQDTNKIQYAWIR 234 (741) T ss_pred HHHHHHCCCCCH---HHHH-HHHHHHHHHCCCHH-------------HHHHHH---HHHC-CCCCCCCCCCHHHHHHHHH T ss_conf 987530637734---5899-99999997528958-------------999998---7610-0543232450069999999 Q ss_pred HHHHCCCCEEEECCC---------EEECCHHHHHHHHHCCCCCEEEECCCCHH-HHHHHHHHHHHHHHC Q ss_conf 864238836850430---------00021488852100386522342545300-389999999999987 Q gi|254780670|r 297 LTNKIGSNCQLVGDD---------LFVTNPERLHKGINEEVANAILIKPNQIG-SVSETLNTIEKAQMA 355 (424) Q Consensus 297 l~~~~g~~~~IvGDD---------L~vTn~~rl~~gi~~~~~NailiK~NQiG-Tvset~ea~~~a~~~ 355 (424) |.+--..++.||||| --|-|..++-+-+... --||+-|-- +.+..|+|++..=.+ T Consensus 235 llAg~~~~V~~VGDDDQsIYgwRGA~veni~~fl~dF~~~----~tirLEQNYRS~~nIL~AAN~lIan 299 (741) T TIGR01075 235 LLAGETGNVMIVGDDDQSIYGWRGAQVENIQKFLKDFPIA----ETIRLEQNYRSTANILAAANALIAN 299 (741) T ss_pred HHCCCCCCEEEEECCCCCCCCCCCCCHHHHHHHHHHCCCC----CCEEECCCCCHHHHHHHHHHHHHHC T ss_conf 9608996379885587420124576418889999746788----8534200322178899998899852 No 249 >PHA01633 putative glycosyl transferase group 1 Probab=24.45 E-value=40 Score=14.01 Aligned_cols=16 Identities=25% Similarity=0.364 Sum_probs=10.9 Q ss_pred CCCHHHHHHHHHHHHH Q ss_conf 5611448999999999 Q gi|254780670|r 389 SLARSDRIAKYNQLIR 404 (424) Q Consensus 389 ~~~R~ER~aKyNrLlr 404 (424) +-.|-||..|.-+|-. T Consensus 307 ~qdreers~~l~elak 322 (335) T PHA01633 307 LQDREERSMKLKELAK 322 (335) T ss_pred HCCHHHHHHHHHHHHH T ss_conf 0357888899999998 No 250 >PRK10060 RNase II stability modulator; Provisional Probab=24.39 E-value=47 Score=13.49 Aligned_cols=123 Identities=11% Similarity=0.075 Sum_probs=56.3 Q ss_pred HHHHHHHHHHHHHHCCEE----EEE---CCCCCCCHHHHH-HHHHHCCCCEEEECCCEEE--CCHHHHHHH-HHCCCCCE Q ss_conf 789999999876544869----977---676310423799-8864238836850430000--214888521-00386522 Q gi|254780670|r 263 PNEMASYLANLVDQYPIY----SIE---DGMSEDDWHGWK-ILTNKIGSNCQLVGDDLFV--TNPERLHKG-INEEVANA 331 (424) Q Consensus 263 ~~elid~y~~l~~~yPI~----sIE---DP~~e~D~~gw~-~l~~~~g~~~~IvGDDL~v--Tn~~rl~~g-i~~~~~Na 331 (424) ..++++.+.+++++|++- -+| --+- +|.+.-. .+..--..-+.|.=||+=+ ++..+|++- ++.-.... T Consensus 506 ~~~f~~~l~~~L~~~~l~p~~L~lEITEs~l~-~d~~~~~~~l~~Lr~lGv~iALDDFGTGySSLsyL~~lPvd~lKIDr 584 (663) T PRK10060 506 DQTIFTALKQVLQELNFEYCPIDVELTESCLI-ENEELALSVIQQFSQLGAQVHLDDFGTGYSSLSQLARFPIDAIKLDQ 584 (663) T ss_pred CCHHHHHHHHHHHHHCCCCCEEEEEEECCCHH-CCHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHCCCCEEEECH T ss_conf 83599999999998299824699998186122-39999999999999789989998999973369998428999899998 Q ss_pred EEEC-CCCHHHHHH--HHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHHCCCHHEECCCC Q ss_conf 3425-453003899--99999999987986999507888723589989998199801007561 Q gi|254780670|r 332 ILIK-PNQIGSVSE--TLNTIEKAQMAGYPSIISHRSGETEDHTIADLAVATNCGQIKTGSLA 391 (424) Q Consensus 332 iliK-~NQiGTvse--t~ea~~~a~~~g~~~ivShRSGETeD~~iaDLAVg~~a~~iK~G~~~ 391 (424) -+|+ ..+- .-+. .--.+.+|+.-|+.+|.- ..|+.--.++--.+||.++.-=-.+ T Consensus 585 sFV~~i~~d-~~~~aIV~aIi~LA~~Lgl~vVAE----GVEt~eQ~~~L~~lGcd~~QGy~fs 642 (663) T PRK10060 585 SFVRDIHKQ-PVSQSLVRAIVAVAQALNLQVIAE----GVETAKEDAFLTKNGVNERQGFLFA 642 (663) T ss_pred HHHHCCCCC-CCHHHHHHHHHHHHHHCCCEEEEE----CCCHHHHHHHHHHCCCCEEECCCCC T ss_conf 997160489-227999999999999869989991----3864999999997599998267536 No 251 >TIGR03286 methan_mark_15 putative methanogenesis marker protein 15. Members of this protein family, to date, are found in a completed prokaryotic genome if and only if the species is one of the archaeal methanogens. The exact function is unknown, but likely is linked to methanogenesis or a process closely connected to it. Related proteins include the BadF/BadG/BcrA/BcrD ATPase family (pfam01869), which includes an activator for (R)-2-hydroxyglutaryl-CoA dehydratase. Probab=24.27 E-value=47 Score=13.47 Aligned_cols=28 Identities=29% Similarity=0.513 Sum_probs=25.8 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHCCCCC Q ss_conf 7644886846899999999997538531 Q gi|254780670|r 204 GGFSPNLKTADSALDLIANSIEKAGYNA 231 (424) Q Consensus 204 GGfap~~~~~eeaL~ll~eAi~~aGy~~ 231 (424) -||.|.-+-.+.|.+.+.+|.+.+|++. T Consensus 169 ~gw~~t~~v~~sa~~~~~~al~~ag~~~ 196 (404) T TIGR03286 169 TGWVPTTKVIESAEEAVERALEEAGVSL 196 (404) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHCCCCH T ss_conf 5765554678889999999999739757 No 252 >cd07146 ALDH_PhpJ Streptomyces putative phosphonoformaldehyde dehydrogenase PhpJ-like. Putative phosphonoformaldehyde dehydrogenase (PhpJ), an aldehyde dehydrogenase homolog reportedly involved in the biosynthesis of phosphinothricin tripeptides in Streptomyces viridochromogenes DSM 40736, and similar sequences are included in this CD. Probab=24.22 E-value=36 Score=14.33 Aligned_cols=123 Identities=19% Similarity=0.215 Sum_probs=54.8 Q ss_pred CCCCCCCEEECCCC-CCCCCCCCC-EEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHH Q ss_conf 64754300000366-455677661-0256514887748999899999999999875200001367655764488684689 Q gi|254780670|r 138 HILPVPLMNILNGG-IHADNALDF-QEFMIMPVGAENIREAIRMGAEVFHTLKKELKSKGYSTNVGDEGGFSPNLKTADS 215 (424) Q Consensus 138 ~~lP~P~~NiinGG-~HA~~~l~i-QEfmIiP~g~~sf~eal~~~~ev~~~lk~~L~~~g~~t~vgDEGGfap~~~~~ee 215 (424) ..+|--.+|+|+|. ...+..|-- .++ ...+|.-+-+.+-.|..... ++ . -+-.-||-.|.+=...- T Consensus 173 aGlP~gv~~~v~g~~~~~g~~L~~~~~i-----~~i~FTGS~~~G~~i~~~a~--~k--~---~~lELGGknp~IV~~dA 240 (451) T cd07146 173 AGLPPDMLSVVTGEPGEIGDELITHPDV-----DLVTFTGGVAVGKAIAATAG--YK--R---QLLELGGNDPLIVMDDA 240 (451) T ss_pred HCCCCCEEEECCCCHHHHHHHHHCCCCC-----CEEEEECCCHHHHHHHHHHH--CC--E---EEEECCCCCCCCCCCCC T ss_conf 0959563775479768888898509885-----67999598278888887760--66--0---67533787775557766 Q ss_pred HHHHHHHHHHHCCCCCCCCHHHHHCHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCC Q ss_conf 99999999975385311102343110465320278511114686568789999999876544869977676310 Q gi|254780670|r 216 ALDLIANSIEKAGYNAGKDLLIALDCAASAFFKKEKYILKGENLELQPNEMASYLANLVDQYPIYSIEDGMSED 289 (424) Q Consensus 216 aL~ll~eAi~~aGy~~g~di~i~lD~AAsefy~~~kY~~~~~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~ 289 (424) -|+...+++-...|...-+.+.+ -++.|--+ .--+|+++-+...++++. +-||++++ T Consensus 241 dld~a~~~~~~~~f~~~GQ~C~a-----------~~ri~V~~---~i~d~f~~~l~~~~~~l~---~G~p~~~~ 297 (451) T cd07146 241 DLERAATLAVAGSYANSGQRCTA-----------VKRILVHE---SVADEFVDLLVEKSAALV---VGDPMDPA 297 (451) T ss_pred CHHHHHHHHHHHHHCCCCCCCCC-----------CCEEEECC---CHHHHHHHHHHHHHHHCC---CCCHHHCC T ss_conf 89999999889873105987424-----------85257700---069999999999886155---77813226 No 253 >COG3096 MukB Uncharacterized protein involved in chromosome partitioning [Cell division and chromosome partitioning] Probab=23.90 E-value=38 Score=14.19 Aligned_cols=21 Identities=10% Similarity=0.246 Sum_probs=12.2 Q ss_pred HHHHHHHHHHHHHHHHCCCCH Q ss_conf 899999998756789759885 Q gi|254780670|r 62 VLKAIAFVNDEIRTALLGCDA 82 (424) Q Consensus 62 V~~Av~~in~~i~~~L~g~~~ 82 (424) +..++..||+--...++|... T Consensus 86 CYa~LD~iNSR~QRv~vgvrL 106 (1480) T COG3096 86 CYSMLDTINSRHQRVVVGVRL 106 (1480) T ss_pred HHHHHHHHCCCCCEEEEEEHH T ss_conf 699888751554347752027 No 254 >cd07082 ALDH_F11_NP-GAPDH NADP+-dependent non-phosphorylating glyceraldehyde 3-phosphate dehydrogenase and ALDH family 11. NADP+-dependent non-phosphorylating glyceraldehyde 3-phosphate dehydrogenase (NP-GAPDH, EC=1.2.1.9) catalyzes the irreversible oxidation of glyceraldehyde 3-phosphate to 3-phosphoglycerate generating NADPH for biosynthetic reactions. This CD also includes the Arabidopsis thaliana osmotic-stress-inducible ALDH family 11, ALDH11A3 and similar sequences. In autotrophic eukaryotes, NP-GAPDH generates NADPH for biosynthetic processes from photosynthetic glyceraldehyde-3-phosphate exported from the chloroplast and catalyzes one of the classic glycolytic bypass reactions unique to plants. Probab=23.61 E-value=36 Score=14.33 Aligned_cols=121 Identities=18% Similarity=0.255 Sum_probs=48.7 Q ss_pred CCCCCCCEEECCC-CCCCCCCCCCEEEEEE-C-CCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHH Q ss_conf 6475430000036-6455677661025651-4-88774899989999999999987520000136765576448868468 Q gi|254780670|r 138 HILPVPLMNILNG-GIHADNALDFQEFMIM-P-VGAENIREAIRMGAEVFHTLKKELKSKGYSTNVGDEGGFSPNLKTAD 214 (424) Q Consensus 138 ~~lP~P~~NiinG-G~HA~~~l~iQEfmIi-P-~g~~sf~eal~~~~ev~~~lk~~L~~~g~~t~vgDEGGfap~~~~~e 214 (424) ..||--.+|++.| |.-.+..| +- | +...+|.-+.+.+-.|.+... + + .-+.--||-.|.+=..+ T Consensus 194 aGlP~Gv~nvv~g~g~~~g~~l------~~~p~v~~v~FTGS~~~G~~i~~~a~--~--k---~~~lELGGknp~IV~~D 260 (473) T cd07082 194 AGFPKGVVNVVTGRGREIGDPL------VTHGRIDVISFTGSTEVGNRLKKQHP--M--K---RLVLELGGKDPAIVLPD 260 (473) T ss_pred CCCCCCCEEEEECCHHHHHHHH------HHCCCCCEEEEECCHHHHHHHHHHHH--C--C---EEEECCCCCCCCCCCCC T ss_conf 3999676046116879999999------83999576997697499999999750--4--5---66752588762112676 Q ss_pred HHHHHHHHHHHHCCCCCCCCHHHHHCHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCC Q ss_conf 99999999997538531110234311046532027851111468656878999999987654486997767631 Q gi|254780670|r 215 SALDLIANSIEKAGYNAGKDLLIALDCAASAFFKKEKYILKGENLELQPNEMASYLANLVDQYPIYSIEDGMSE 288 (424) Q Consensus 215 eaL~ll~eAi~~aGy~~g~di~i~lD~AAsefy~~~kY~~~~~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e 288 (424) --|+...+++-...|...-+.+.+ .+.+|- -+ ---+++++.+.+-+++. .+=||+++ T Consensus 261 ADld~av~~~~~~af~~aGQ~C~a----~~Rv~V-------~~---~i~d~f~~~l~~~~~~l---~vGdp~~~ 317 (473) T cd07082 261 ADLELAAKEIVKGALSYSGQRCTA----IKRVLV-------HE---SVADELVELLKEEVAKL---KVGMPWDN 317 (473) T ss_pred CCHHHHHHHHHHHHHCCCCCCCCC----CCEEEE-------EH---HHHHHHHHHHHHHHHCC---CCCCHHHC T ss_conf 799999999999874568975677----862576-------12---03999999999998647---66796765 No 255 >COG2204 AtoC Response regulator containing CheY-like receiver, AAA-type ATPase, and DNA-binding domains [Signal transduction mechanisms] Probab=23.52 E-value=48 Score=13.37 Aligned_cols=110 Identities=25% Similarity=0.341 Sum_probs=48.1 Q ss_pred CCCCCCCEEECCCCCC--CCC--CCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCC---CCCC Q ss_conf 6475430000036645--567--76610256514887748999899999999999875200001367655764---4886 Q gi|254780670|r 138 HILPVPLMNILNGGIH--ADN--ALDFQEFMIMPVGAENIREAIRMGAEVFHTLKKELKSKGYSTNVGDEGGF---SPNL 210 (424) Q Consensus 138 ~~lP~P~~NiinGG~H--A~~--~l~iQEfmIiP~g~~sf~eal~~~~ev~~~lk~~L~~~g~~t~vgDEGGf---ap~~ 210 (424) ..+|+-+|. +-|.. |-. +.--.+|+.-|.........++-+.+......+.-. .......++..+ .|.+ T Consensus 75 ~~~pVI~~T--g~g~i~~AV~A~k~GA~Dfl~KP~~~~~L~~~v~ral~~~~~~~e~~~--~~~~~~~~~~~liG~S~am 150 (464) T COG2204 75 PDLPVIVMT--GHGDIDTAVEALRLGAFDFLEKPFDLDRLLAIVERALELRELQRENRR--SLKRAKSLGGELVGESPAM 150 (464) T ss_pred CCCCEEEEE--CCCCHHHHHHHHHCCCCEEEECCCCHHHHHHHHHHHHHHHHHHHHHHH--HHCCCCCCCCCCCCCCHHH T ss_conf 999889982--889999999998557033321899999999999999987653222100--0001345567752069999 Q ss_pred CCHHHH--------------------HHHHHHHHHHCCCCCCCCHHHHHCHHH-------HHHCCCCCE Q ss_conf 846899--------------------999999997538531110234311046-------532027851 Q gi|254780670|r 211 KTADSA--------------------LDLIANSIEKAGYNAGKDLLIALDCAA-------SAFFKKEKY 252 (424) Q Consensus 211 ~~~eea--------------------L~ll~eAi~~aGy~~g~di~i~lD~AA-------sefy~~~kY 252 (424) ....+. =+++..+|...+-.- +.=+++++||| ||+|-.+|. T Consensus 151 ~~l~~~i~kvA~s~a~VLI~GESGtGKElvAr~IH~~S~R~-~~PFVavNcaAip~~l~ESELFGhekG 218 (464) T COG2204 151 QQLRRLIAKVAPSDASVLITGESGTGKELVARAIHQASPRA-KGPFIAVNCAAIPENLLESELFGHEKG 218 (464) T ss_pred HHHHHHHHHHHCCCCCEEEECCCCCCHHHHHHHHHHHCCCC-CCCCEEEECCCCCHHHHHHHHHCCCCC T ss_conf 99999999984779978997789875899999998607445-899256334648988877776145656 No 256 >PRK07590 L,L-diaminopimelate aminotransferase; Validated Probab=23.51 E-value=48 Score=13.37 Aligned_cols=30 Identities=7% Similarity=0.078 Sum_probs=17.8 Q ss_pred CCCCCHHHHHHHHHHHHHHCCEEEEECCCCC Q ss_conf 8656878999999987654486997767631 Q gi|254780670|r 258 NLELQPNEMASYLANLVDQYPIYSIEDGMSE 288 (424) Q Consensus 258 ~~~~t~~elid~y~~l~~~yPI~sIEDP~~e 288 (424) |..+|.+|+-++ .+|+.+|.++-|.|=.++ T Consensus 190 G~V~s~eel~~i-~~~a~~~~i~iisDEiY~ 219 (409) T PRK07590 190 GTVATKEQLKAW-VDYAKANGSLILFDAAYE 219 (409) T ss_pred CCCCCHHHHHHH-HHHHHHCCCEEEEECCCC T ss_conf 655799999999-999987497899707620 No 257 >COG2871 NqrF Na+-transporting NADH:ubiquinone oxidoreductase, subunit NqrF [Energy production and conversion] Probab=23.49 E-value=49 Score=13.37 Aligned_cols=38 Identities=24% Similarity=0.489 Sum_probs=27.9 Q ss_pred CCHHHH--HHHHHHHHHHCC-EE---EEECCCCCCCHHHHHHHH Q ss_conf 687899--999998765448-69---977676310423799886 Q gi|254780670|r 261 LQPNEM--ASYLANLVDQYP-IY---SIEDGMSEDDWHGWKILT 298 (424) Q Consensus 261 ~t~~el--id~y~~l~~~yP-I~---sIEDP~~e~D~~gw~~l~ 298 (424) .+-.|| .+-|.+|-.++| .. .+-||+.||.|++.+.|- T Consensus 314 RS~rE~fY~Ed~d~L~ae~pNF~wH~aLSdplpEDnW~g~TgFi 357 (410) T COG2871 314 RSLREMFYQEDFDQLQAENPNFHWHLALSDPLPEDNWDGYTGFI 357 (410) T ss_pred CHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCHHHH T ss_conf 07777777877888875389817999736888766854421289 No 258 >TIGR03217 4OH_2_O_val_ald 4-hydroxy-2-oxovalerate aldolase. Members of this protein family are 4-hydroxy-2-oxovalerate aldolase, also called 4-hydroxy-2-ketovalerate aldolase and 2-oxo-4-hydroxypentanoate aldolase. This enzyme, part of the pathway for the meta-cleavage of catechol, produces pyruvate and acetaldehyde. Acetaldehyde is then converted by acetaldehyde dehydrogenase (acylating) (DmpF; EC 1.2.1.10) to acetyl-CoA. The two enzymes are tightly associated. Probab=23.45 E-value=49 Score=13.36 Aligned_cols=28 Identities=29% Similarity=0.636 Sum_probs=16.0 Q ss_pred HHHHHHHHHHHHCCCCCCCCHHHHHCHH Q ss_conf 9999999999753853111023431104 Q gi|254780670|r 215 SALDLIANSIEKAGYNAGKDLLIALDCA 242 (424) Q Consensus 215 eaL~ll~eAi~~aGy~~g~di~i~lD~A 242 (424) -+||.++.+.++.||+.|.|..--+|+| T Consensus 231 a~lE~lVa~l~~~g~~tgiDl~~l~d~a 258 (333) T TIGR03217 231 APLEVFVAVLDRLGWNTGCDLFKLMDAA 258 (333) T ss_pred CHHHHHHHHHHCCCCCCCCCHHHHHHHH T ss_conf 3499999999617986587989999999 No 259 >PRK07309 aromatic amino acid aminotransferase; Validated Probab=23.30 E-value=49 Score=13.34 Aligned_cols=31 Identities=16% Similarity=0.423 Sum_probs=22.9 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCEEEEECCCCC Q ss_conf 68656878999999987654486997767631 Q gi|254780670|r 257 ENLELQPNEMASYLANLVDQYPIYSIEDGMSE 288 (424) Q Consensus 257 ~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e 288 (424) .|..++.+++-.+ .+||.+|.++-|+|=.+. T Consensus 179 tG~v~s~e~l~~i-~~la~~~~v~iisDEiY~ 209 (390) T PRK07309 179 TGVTYSREQIKAL-AEVLKKYDIFVISDEVYS 209 (390) T ss_pred CCCCCCHHHHHHH-HHHHHHCCEEEEEECCCH T ss_conf 8887899999999-999976898999505511 No 260 >TIGR00003 TIGR00003 copper ion binding protein; InterPro: IPR006122 Proteins that transport heavy metals in micro-organisms and eukaryotes share similarities in their sequences and structures. These proteins provide an important focus for research, some being involved in bacterial resistance to toxic metals, such as lead and cadmium, while others are involved in inherited human syndromes, such as Wilson's and Menke's diseases . A conserved 30-residue domain has been found in a number of these heavy metal transport or detoxification proteins . The domain, which has been termed Heavy-Metal-Associated (HMA), contains two conserved cysteines that are probably involved in metal binding. This sub-domain is found in copper-binding proteins. ; GO: 0005507 copper ion binding, 0006825 copper ion transport. Probab=23.23 E-value=40 Score=14.03 Aligned_cols=24 Identities=21% Similarity=0.345 Sum_probs=18.1 Q ss_pred CCCCCHHHHHHHHHHHHHHCCCCC Q ss_conf 886846899999999997538531 Q gi|254780670|r 208 PNLKTADSALDLIANSIEKAGYNA 231 (424) Q Consensus 208 p~~~~~eeaL~ll~eAi~~aGy~~ 231 (424) ..||.+.-.-+.|.+||..+||+. T Consensus 42 V~Fd~~~v~~~~I~~Ai~d~GY~~ 65 (66) T TIGR00003 42 VEFDAPKVSAKEIKEAILDAGYEV 65 (66) T ss_pred EEECCCCCCHHHHHHHHHHCCCCC T ss_conf 875375344677888987366536 No 261 >cd06346 PBP1_ABC_ligand_binding_like_11 Type I periplasmic ligand-binding domain of uncharacterized ABC (Atpase Binding Cassette)-type active transport systems that are predicted to be involved in uptake of amino acids, peptides, or inorganic ions. This subgroup includes the type I periplasmic ligand-binding domain of uncharacterized ABC (Atpase Binding Cassette)-type active transport systems that are predicted to be involved in uptake of amino acids, peptides, or inorganic ions. This subgroup has high sequence similarity to members of the family of hydrophobic amino acid transporters (HAAT), such as leucine/isoleucine/valine binding protein (LIVBP); however its ligand specificity has not been determined experimentally. Probab=23.21 E-value=49 Score=13.33 Aligned_cols=46 Identities=13% Similarity=-0.010 Sum_probs=21.2 Q ss_pred HHHHHHHHHHHHHCC-EEEEECCCCCCCHHHHHHHHHHCCCCEEEECCC Q ss_conf 899999998765448-699776763104237998864238836850430 Q gi|254780670|r 264 NEMASYLANLVDQYP-IYSIEDGMSEDDWHGWKILTNKIGSNCQLVGDD 311 (424) Q Consensus 264 ~elid~y~~l~~~yP-I~sIEDP~~e~D~~gw~~l~~~~g~~~~IvGDD 311 (424) .++..++.++-..-| .+.+- ...+|-..+.+-..+.|-+..+++-+ T Consensus 179 ~Dfs~~l~ki~~~~pD~v~~~--~~~~~~~~~~kq~~~~G~~~~~~~~~ 225 (312) T cd06346 179 SSYSSEVAAAAAGGPDALVVI--GYPETGSGILRSAYEQGLFDKFLLTD 225 (312) T ss_pred CCHHHHHHHHHHCCCCEEEEE--CCCHHHHHHHHHHHHCCCCCCEEEEC T ss_conf 455999999986699999994--67347999999999759999779644 No 262 >cd07103 ALDH_F5_SSADH_GabD Mitochondrial succinate-semialdehyde dehydrogenase and ALDH family members 5A1 and 5F1-like. Succinate-semialdehyde dehydrogenase, mitochondrial (SSADH, GabD, EC=1.2.1.24) catalyzes the NAD+-dependent oxidation of succinate semialdehyde (SSA) to succinate. This group includes the human aldehyde dehydrogenase family 5 member A1 (ALDH5A1) which is a mitochondrial homotetramer that converts SSA to succinate in the last step of 4-aminobutyric acid (GABA) catabolism. This CD also includes the Arabidopsis SSADH gene product ALDH5F1. Mutations in this gene result in the accumulation of H2O2, suggesting a role in plant defense against the environmental stress of elevated reactive oxygen species. Probab=23.03 E-value=49 Score=13.30 Aligned_cols=125 Identities=17% Similarity=0.236 Sum_probs=53.7 Q ss_pred CCCCCCEEECCCC-CCCCCCCC-CEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHH Q ss_conf 4754300000366-45567766-102565148877489998999999999998752000013676557644886846899 Q gi|254780670|r 139 ILPVPLMNILNGG-IHADNALD-FQEFMIMPVGAENIREAIRMGAEVFHTLKKELKSKGYSTNVGDEGGFSPNLKTADSA 216 (424) Q Consensus 139 ~lP~P~~NiinGG-~HA~~~l~-iQEfmIiP~g~~sf~eal~~~~ev~~~lk~~L~~~g~~t~vgDEGGfap~~~~~eea 216 (424) .+|--.+|++.|. ...+..|- =..+ ...+|.-+.+.+-.|.+...+.++.-- -.-||=.|.+=..+-- T Consensus 171 GlP~gv~~vv~g~~~~~g~~l~~~~~v-----~~v~FTGS~~~G~~i~~~aa~~lk~v~-----lELGGk~p~IV~~dAd 240 (451) T cd07103 171 GLPAGVLNVVTGSPAEIGEALCASPRV-----RKISFTGSTAVGKLLMAQAADTVKRVS-----LELGGNAPFIVFDDAD 240 (451) T ss_pred CCCHHHEEECCCCHHHHHHHHHHCCCC-----CEEEEECCCHHHHHHHHHHHHHCCEEE-----EECCCCEEEEECCCCC T ss_conf 969423754457778899998609996-----769984883143148999886585778-----9718931699879999 Q ss_pred HHHHHHHHHHCCCCCCCCHHHHHCHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCCC Q ss_conf 99999999753853111023431104653202785111146865687899999998765448699776763104 Q gi|254780670|r 217 LDLIANSIEKAGYNAGKDLLIALDCAASAFFKKEKYILKGENLELQPNEMASYLANLVDQYPIYSIEDGMSEDD 290 (424) Q Consensus 217 L~ll~eAi~~aGy~~g~di~i~lD~AAsefy~~~kY~~~~~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~D 290 (424) |+...+++-...|...-+.+.+ .++.|--+ ---+++++.+...++++ .+-||++++. T Consensus 241 l~~aa~~~~~~~f~naGQ~C~a-----------~~rv~V~~---~i~d~f~~~l~~~~~~l---~~G~p~~~~~ 297 (451) T cd07103 241 LDKAVDGAIASKFRNAGQTCVC-----------ANRIYVHE---SIYDEFVEKLVERVKKL---KVGNGLDEGT 297 (451) T ss_pred HHHHHHHHHHHHHCCCCCCCCC-----------CCCCEECC---CHHHHHHHHHHHHHHCC---CCCCCCCCCC T ss_conf 9999998765030007997776-----------87540120---35999999998543017---7579777667 No 263 >KOG0053 consensus Probab=22.96 E-value=50 Score=13.29 Aligned_cols=15 Identities=27% Similarity=0.178 Sum_probs=8.1 Q ss_pred CCCCHHHHHHHHHHH Q ss_conf 887235899899981 Q gi|254780670|r 366 GETEDHTIADLAVAT 380 (424) Q Consensus 366 GETeD~~iaDLAVg~ 380 (424) =|..|+-|+||.-|+ T Consensus 390 iEd~~dL~~d~~~Al 404 (409) T KOG0053 390 IEDIEDLIKDFQQAL 404 (409) T ss_pred CCCHHHHHHHHHHHH T ss_conf 677999999999999 No 264 >pfam00580 UvrD-helicase UvrD/REP helicase. The Rep family helicases are composed of four structural domains. The Rep family function as dimers. REP helicases catalyse ATP dependent unwinding of double stranded DNA to single stranded DNA. Bacillus subtilis addA and Escherichia coli exodeoxyribonuclease V beta have large insertions near to the carboxy-terminus relative to other members of the family. Probab=22.95 E-value=50 Score=13.29 Aligned_cols=151 Identities=25% Similarity=0.271 Sum_probs=72.7 Q ss_pred CCCCHHHHHHHHHHHHHH-----------CCEEEEECCCCCCCHHHHHHHHHHCC--CCEEEECCC---EE---ECCHHH Q ss_conf 656878999999987654-----------48699776763104237998864238--836850430---00---021488 Q gi|254780670|r 259 LELQPNEMASYLANLVDQ-----------YPIYSIEDGMSEDDWHGWKILTNKIG--SNCQLVGDD---LF---VTNPER 319 (424) Q Consensus 259 ~~~t~~elid~y~~l~~~-----------yPI~sIEDP~~e~D~~gw~~l~~~~g--~~~~IvGDD---L~---vTn~~r 319 (424) ..++-++|+-+..++..+ |+ .-+.|=|.+-+.-+|.-+..-.+ .++.+|||+ .| -.+++. T Consensus 184 ~~~Df~Dll~~~~~lL~~~~~~~~~~~~r~~-~ilVDEfQDtn~~Q~~il~~L~~~~~~l~~VGD~~QSIY~FRGAd~~~ 262 (494) T pfam00580 184 NALDFDDLLLLTLELLRSDPELLEAYRERFK-YILVDEFQDTNPIQYELLKLLAGGHPNLFLVGDPDQSIYGFRGADIEN 262 (494) T ss_pred CCCCHHHHHHHHHHHHHCCHHHHHHHHHHCC-EEECHHHCCCHHHHHHHHHHHHCCCCEEEEEECCCEEEEECCCCCHHH T ss_conf 9989899999999997319999999986072-654034302649999999998658983999817971442017999699 Q ss_pred HHHHHHCCCCCEEEECCCCH-HHHHHHHHHHHHHHHC----------------CC-EEEEECCCCCCCHHHHHHHHHHHC Q ss_conf 85210038652234254530-0389999999999987----------------98-699950788872358998999819 Q gi|254780670|r 320 LHKGINEEVANAILIKPNQI-GSVSETLNTIEKAQMA----------------GY-PSIISHRSGETEDHTIADLAVATN 381 (424) Q Consensus 320 l~~gi~~~~~NailiK~NQi-GTvset~ea~~~a~~~----------------g~-~~ivShRSGETeD~~iaDLAVg~~ 381 (424) +..-.+ ..-++-.+.+++- =+-.+.+++++..-++ +- ..+....+-+.|-.+||+---.+- T Consensus 263 ~~~f~~-~~~~~~~i~L~~NyRS~~~Il~~aN~l~~~~~~~~~~~~l~~~~~~~~~v~~~~~~~~~~Ea~~ia~~I~~l~ 341 (494) T pfam00580 263 ILKFEK-DFPAAKVITLEQNYRSTPEILNAANKLISNNTARKEKRLLWSQKGSGEKVTVYEAETEQEEAEFIADEIKALH 341 (494) T ss_pred HHHHHH-HCCCCCEEECCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEEECCCHHHHHHHHHHHHHHHH T ss_conf 999998-6665667258899898878999999999726022445435455788996079837999999999999999998 Q ss_pred -CC-HHEECCCCHHHHHHHHHHHHHHHHHHHHCC Q ss_conf -98-010075611448999999999999831104 Q gi|254780670|r 382 -CG-QIKTGSLARSDRIAKYNQLIRIEESLGKQA 413 (424) Q Consensus 382 -a~-~iK~G~~~R~ER~aKyNrLlrIee~Lg~~a 413 (424) .+ .++.+-.+=-=|. .++.-.+|+.|.... T Consensus 342 ~~~~~~~~~DiAIL~R~--~~~~~~l~~~L~~~g 373 (494) T pfam00580 342 KSGAAIPFGDIAVLVRS--NSQSRAIEEALLKAG 373 (494) T ss_pred HCCCCCCCCCEEEEEEC--CHHHHHHHHHHHHCC T ss_conf 77999885656899825--231899999999779 No 265 >TIGR01386 cztS_silS_copS heavy metal sensor kinase; InterPro: IPR006290 Members of this family contain a sensor histidine kinase domain (IPR003661 from INTERPRO) and a domain found in bacterial signal proteins (IPR003660 from INTERPRO). This group is separated phylogenetically from related proteins with similar architecture and contains a number of proteins associated with heavy metal resistance efflux systems for copper, silver, cadmium, and/or zinc. ; GO: 0000155 two-component sensor activity, 0016775 phosphotransferase activity nitrogenous group as acceptor, 0000160 two-component signal transduction system (phosphorelay), 0007600 sensory perception. Probab=22.94 E-value=49 Score=13.36 Aligned_cols=64 Identities=14% Similarity=0.302 Sum_probs=43.0 Q ss_pred HH-HHHHHHHHHHHHCC-EEEEECCCCCCCHHHHHHHHHHCCCC-EEEECCCEEECCHHHHHHHHHCCCCCEEEECCCCH Q ss_conf 78-99999998765448-69977676310423799886423883-68504300002148885210038652234254530 Q gi|254780670|r 263 PN-EMASYLANLVDQYP-IYSIEDGMSEDDWHGWKILTNKIGSN-CQLVGDDLFVTNPERLHKGINEEVANAILIKPNQI 339 (424) Q Consensus 263 ~~-elid~y~~l~~~yP-I~sIEDP~~e~D~~gw~~l~~~~g~~-~~IvGDDL~vTn~~rl~~gi~~~~~NailiK~NQi 339 (424) -+ .+++||+-|.++-. |++|+ +..+ |++.|| +..++++|.+-..| T Consensus 341 ~~l~~~~~fE~lAee~~Piv~l~-----------------v~G~P~~~~gD------~~lfrRA~sNLLsN--------- 388 (483) T TIGR01386 341 LALKVAEYFEPLAEERGPIVSLR-----------------VEGEPAEVRGD------ALLFRRALSNLLSN--------- 388 (483) T ss_pred HHHHHHHHHHCCCCCCCCEEEEE-----------------EECCCCEEECC------HHHHHHHHHHHHHH--------- T ss_conf 87888877420276788668999-----------------83242056024------78889999999999--------- Q ss_pred HHHHHHHHHHHHHHHCCCEEEEECCCC Q ss_conf 038999999999998798699950788 Q gi|254780670|r 340 GSVSETLNTIEKAQMAGYPSIISHRSG 366 (424) Q Consensus 340 GTvset~ea~~~a~~~g~~~ivShRSG 366 (424) |++.+-..+...|--.|.. T Consensus 389 --------A~rhtp~g~~I~v~~~~~~ 407 (483) T TIGR01386 389 --------ALRHTPAGSTIDVRVERRA 407 (483) T ss_pred --------HHHHCCCCCEEEEEEEECC T ss_conf --------9860899885899996147 No 266 >PRK09140 2-dehydro-3-deoxy-6-phosphogalactonate aldolase; Reviewed Probab=22.91 E-value=50 Score=13.29 Aligned_cols=64 Identities=17% Similarity=0.213 Sum_probs=25.2 Q ss_pred CHHHHHHHHHHHHHHCCEEEEECCCCCCCH-HHHHHHHHHCCCCEEEECCCEEECCHHHHHHHHHCCC Q ss_conf 878999999987654486997767631042-3799886423883685043000021488852100386 Q gi|254780670|r 262 QPNEMASYLANLVDQYPIYSIEDGMSEDDW-HGWKILTNKIGSNCQLVGDDLFVTNPERLHKGINEEV 328 (424) Q Consensus 262 t~~elid~y~~l~~~yPI~sIEDP~~e~D~-~gw~~l~~~~g~~~~IvGDDL~vTn~~rl~~gi~~~~ 328 (424) +.++...+-.-| -+-.|..||=.+.-.++ +....+.+++++++.|-.. +|++++-++++++-++ T Consensus 20 ~~~~a~~~~~al-~~~Gi~~iEVTl~tp~a~~~I~~l~~~~~~~~~iGAG--TVlt~e~~~~ai~aGA 84 (206) T PRK09140 20 TPDEALAHVGAL-IEAGFRAIEIPLNSPDPFDSIAALVKALGDDALIGAG--TVLSPEQVDRLADAGG 84 (206) T ss_pred CHHHHHHHHHHH-HHCCCCEEEEECCCCCHHHHHHHHHHHCCCCEEEEEE--ECCCHHHHHHHHHCCC T ss_conf 999999999999-9869988999179976999999999967986599862--0467999999998599 No 267 >COG1019 Predicted nucleotidyltransferase [General function prediction only] Probab=22.80 E-value=50 Score=13.27 Aligned_cols=62 Identities=24% Similarity=0.370 Sum_probs=39.1 Q ss_pred HHHHHHHHCCCCCCCCHHHHHCHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCC-EEEEECCCCC Q ss_conf 999999753853111023431104653202785111146865687899999998765448-6997767631 Q gi|254780670|r 219 LIANSIEKAGYNAGKDLLIALDCAASAFFKKEKYILKGENLELQPNEMASYLANLVDQYP-IYSIEDGMSE 288 (424) Q Consensus 219 ll~eAi~~aGy~~g~di~i~lD~AAsefy~~~kY~~~~~~~~~t~~elid~y~~l~~~yP-I~sIEDP~~e 288 (424) ||..|++- |.++.||| -+.||-++. |.+.......--..+.+|+..+-.+|+ |+-|.||+-. T Consensus 24 LL~~A~~~-----G~~v~IGl--TsDe~~k~~-k~~~i~p~~~R~~~l~~fl~~~~~~~~~iv~i~Dp~G~ 86 (158) T COG1019 24 LLEVAFEI-----GDRVTIGL--TSDELAKKK-KKEKIEPYEVRLRNLRNFLESIKADYEEIVPIDDPYGP 86 (158) T ss_pred HHHHHHHH-----CCEEEEEE--CCHHHHHHH-CCCCCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCC T ss_conf 99999985-----79179997--648999872-22567748999999999999846886549982589999 No 268 >smart00631 Zn_pept Zn_pept. Probab=22.73 E-value=50 Score=13.26 Aligned_cols=28 Identities=32% Similarity=0.455 Sum_probs=12.0 Q ss_pred HHHHHHHHHHHHHHHHHCCCCHHHHHHHHH Q ss_conf 489999999875678975988557898886 Q gi|254780670|r 61 GVLKAIAFVNDEIRTALLGCDARDQLLIDK 90 (424) Q Consensus 61 gV~~Av~~in~~i~~~L~g~~~~dQ~~iD~ 90 (424) |...++..++..+..- +.++.-+.-+|+ T Consensus 65 ~~~~~l~~i~~l~~~~--~~~~~~~~ll~~ 92 (277) T smart00631 65 GPATALYLINQLLENY--GRDPRITKLLDK 92 (277) T ss_pred HHHHHHHHHHHHHHHC--CCCHHHHHHHHC T ss_conf 4899999999999860--476899998747 No 269 >PRK08674 bifunctional phosphoglucose/phosphomannose isomerase; Validated Probab=22.53 E-value=51 Score=13.24 Aligned_cols=28 Identities=18% Similarity=0.235 Sum_probs=12.2 Q ss_pred HHCCCCCEEEECCCCHHHHHHHHHHHHH Q ss_conf 0038652234254530038999999999 Q gi|254780670|r 324 INEEVANAILIKPNQIGSVSETLNTIEK 351 (424) Q Consensus 324 i~~~~~NailiK~NQiGTvset~ea~~~ 351 (424) ++........|.+.--..+...+..+-+ T Consensus 270 ~~~~~~~v~~i~~~g~s~L~rl~~li~l 297 (328) T PRK08674 270 LGASVIPVVEIDPRGNSPLARLLSLILL 297 (328) T ss_pred HHCCCCCEEEEECCCCCHHHHHHHHHHH T ss_conf 8436985699958998889999999999 No 270 >pfam09161 consensus Probab=22.41 E-value=35 Score=14.43 Aligned_cols=30 Identities=23% Similarity=0.368 Sum_probs=14.1 Q ss_pred EEEEECCCCCCCHHHHHHHHHHCCCCEEEECCCE Q ss_conf 6997767631042379988642388368504300 Q gi|254780670|r 279 IYSIEDGMSEDDWHGWKILTNKIGSNCQLVGDDL 312 (424) Q Consensus 279 I~sIEDP~~e~D~~gw~~l~~~~g~~~~IvGDDL 312 (424) ++.+|--+.+.+ ||..|-..- --||+.+|| T Consensus 294 lvAlegi~~~~~--G~eafl~~~--GWkvl~~Dl 323 (391) T pfam09161 294 LVALEGMVSERK--GFEAFLDFG--GWKVLYDDL 323 (391) T ss_pred HHHHHHHHHHHH--HHHHHHCCC--CCEEEHHHH T ss_conf 997435654127--899986078--714318689 No 271 >KOG2599 consensus Probab=22.20 E-value=51 Score=13.19 Aligned_cols=16 Identities=19% Similarity=0.326 Sum_probs=6.7 Q ss_pred CCCCHHHHHHHHHHHH Q ss_conf 6568789999999876 Q gi|254780670|r 259 LELQPNEMASYLANLV 274 (424) Q Consensus 259 ~~~t~~elid~y~~l~ 274 (424) +.+.++|++-.|.+++ T Consensus 126 ~lYV~eelipvYr~~i 141 (308) T KOG2599 126 RLYVPEELIPVYRDLI 141 (308) T ss_pred CEECCHHHHHHHHHHH T ss_conf 1744587779999863 No 272 >cd03871 M14_CPB Peptidase M14 Carboxypeptidase B (CPB) belongs to the carboxypeptidase A/B subfamily of the M14 family of metallocarboxypeptidases (MCPs). The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. Carboxypeptidase B (CPB) enzymes only cleave the basic residues lysine or arginine. A/B subfamily enzymes are normally synthesized as inactive precursors containing preceding signal peptide, followed by a globular N-terminal pro-region linked to the enzyme; these proenzymes are called procarboxypeptidases. The procarboxypeptidase B (PCPB) is produced by the exocrine pancreas and stored as stable zymogen in the pancreatic granules until secretion into the digestive tract occurs. PCPB has been reported to be a good serum marker for the diagnosis of acute pancreatitis and graft rejection in pancreas transplant recipients. Probab=22.17 E-value=51 Score=13.19 Aligned_cols=19 Identities=37% Similarity=0.693 Sum_probs=10.2 Q ss_pred CCCHHHHHHHHHH------HHHCCC Q ss_conf 9885578988864------653036 Q gi|254780670|r 79 GCDARDQLLIDKI------MIDLDG 97 (424) Q Consensus 79 g~~~~dQ~~iD~~------Li~lDg 97 (424) +.++.-+.-+|+. ++.-|| T Consensus 85 ~~d~~~~~ll~~~d~~ivP~~NPDG 109 (300) T cd03871 85 GREAIMTELLDKLDFYVLPVLNIDG 109 (300) T ss_pred CCCHHHHHHHHCCEEEEEEEECCCC T ss_conf 7788899998567399998764760 No 273 >PRK06253 O-phosphoseryl-tRNA synthetase; Reviewed Probab=22.03 E-value=52 Score=13.17 Aligned_cols=75 Identities=16% Similarity=0.328 Sum_probs=50.7 Q ss_pred CCHHHHHHHHHHHHHHCC----EEEEECCCCCCCHHHHHHHHHHCCCCEEEECCCEEECCHHHHHHHHHCCCCC-EEEEC Q ss_conf 687899999998765448----6997767631042379988642388368504300002148885210038652-23425 Q gi|254780670|r 261 LQPNEMASYLANLVDQYP----IYSIEDGMSEDDWHGWKILTNKIGSNCQLVGDDLFVTNPERLHKGINEEVAN-AILIK 335 (424) Q Consensus 261 ~t~~elid~y~~l~~~yP----I~sIEDP~~e~D~~gw~~l~~~~g~~~~IvGDDL~vTn~~rl~~gi~~~~~N-ailiK 335 (424) .|+. ..-.+..|.++.| +.||.-.|+.+-.+.++.|..-....|.|+|+|..+-+-+-+.+++-.+-.= -+-.+ T Consensus 188 mTsg-wfitL~~l~~k~~~P~klFSIDRcFRrEq~eD~~hL~~yhsascvv~~edVn~d~gkav~~~lL~~fGF~~frFr 266 (527) T PRK06253 188 MTSG-WFITLSALWERKPLPFKLFSIDRCFRREQKEDASRLMTYHSASCVVAGEDVTVDDGKAVAEGLLSQFGFTNFRFR 266 (527) T ss_pred CCCH-HHHHHHHHHHCCCCCEEEEEEEHEEECHHHCCHHHHHHHHEEEEEEECCCCCHHHHHHHHHHHHHHHCCCEEEEC T ss_conf 0302-788799997267998368741010220000245563454432269975986677779999999997097537843 Q ss_pred C Q ss_conf 4 Q gi|254780670|r 336 P 336 (424) Q Consensus 336 ~ 336 (424) | T Consensus 267 P 267 (527) T PRK06253 267 P 267 (527) T ss_pred C T ss_conf 4 No 274 >KOG2792 consensus Probab=22.03 E-value=44 Score=13.71 Aligned_cols=75 Identities=13% Similarity=0.276 Sum_probs=38.4 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCCCCCCC---HHHHHCHHHHHHCCCCCEEECCC----CCCCCHHHHHHHHHHHHHHCCE Q ss_conf 4886846899999999997538531110---23431104653202785111146----8656878999999987654486 Q gi|254780670|r 207 SPNLKTADSALDLIANSIEKAGYNAGKD---LLIALDCAASAFFKKEKYILKGE----NLELQPNEMASYLANLVDQYPI 279 (424) Q Consensus 207 ap~~~~~eeaL~ll~eAi~~aGy~~g~d---i~i~lD~AAsefy~~~kY~~~~~----~~~~t~~elid~y~~l~~~yPI 279 (424) +|++ .+ +-|+-|..+|+...-++|.. ++|.+|.+-..-.-=..|.-++. |..=|.+|+ .+.|++|-+ T Consensus 151 CPDI-CP-dELeKm~~~Vd~i~~~~~~~~~PlFIsvDPeRD~~~~~~eY~~eF~pkllGLTGT~eqv----k~vak~yRV 224 (280) T KOG2792 151 CPDI-CP-DELEKMSAVVDEIEAKPGLPPVPLFISVDPERDSVEVVAEYVSEFHPKLLGLTGTTEQV----KQVAKKYRV 224 (280) T ss_pred CCCC-CH-HHHHHHHHHHHHHHCCCCCCCCCEEEEECCCCCCHHHHHHHHHHCCHHHHCCCCCHHHH----HHHHHHHEE T ss_conf 9976-96-99999999999985258998622599847664879999999985086650566889999----999988078 Q ss_pred EEEECCCC Q ss_conf 99776763 Q gi|254780670|r 280 YSIEDGMS 287 (424) Q Consensus 280 ~sIEDP~~ 287 (424) ++---|=+ T Consensus 225 Yfs~gp~d 232 (280) T KOG2792 225 YFSTGPKD 232 (280) T ss_pred EECCCCCC T ss_conf 61468987 No 275 >pfam09226 Endonuc-HincII Restriction endonuclease HincII. Members of this family of prokaryotic restriction endonucleases recognize the double-stranded sequence 5'-GTYRAC-3' and cleave after Y-3. They catalyse the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. Probab=21.89 E-value=17 Score=16.68 Aligned_cols=30 Identities=23% Similarity=0.426 Sum_probs=18.1 Q ss_pred CCHHHHCCCC-CCCCCCCCCEEECCCCCCCCCC Q ss_conf 8488822532-4564754300000366455677 Q gi|254780670|r 126 LPLYKYLGGC-SAHILPVPLMNILNGGIHADNA 157 (424) Q Consensus 126 ~pLy~yl~~~-~~~~lP~P~~NiinGG~HA~~~ 157 (424) .|.|+-+... -....|+|.---++| ||.+. T Consensus 4 ~~~~~~~~~~~~G~~V~rP~sGtLSG--HAaGE 34 (257) T pfam09226 4 KPIYQDINSILIGQKVKRPKSGTLSG--HAAGE 34 (257) T ss_pred CHHHHHHHHHHHCCCCCCCCCCCCCC--CCCCC T ss_conf 12878767776045245788775366--54789 No 276 >PRK12833 acetyl-CoA carboxylase biotin carboxylase subunit; Provisional Probab=21.79 E-value=52 Score=13.13 Aligned_cols=31 Identities=19% Similarity=0.271 Sum_probs=14.1 Q ss_pred HHHHHHHHHHCCCCCCCCHHHHHCHHHHHHC Q ss_conf 9999999975385311102343110465320 Q gi|254780670|r 217 LDLIANSIEKAGYNAGKDLLIALDCAASAFF 247 (424) Q Consensus 217 L~ll~eAi~~aGy~~g~di~i~lD~AAsefy 247 (424) .+.-......+||..--.|-+-+|-...+|| T Consensus 258 ~~~a~~la~~v~y~gagTvEFl~d~~~~~fy 288 (458) T PRK12833 258 CASATRLARAVGYRGAGTLEYLFDDARGEFY 288 (458) T ss_pred HHHHHHHHHHCCCCEEEEEEEEEECCCCCEE T ss_conf 8899999885331000026667865889689 No 277 >COG5011 Uncharacterized protein conserved in bacteria [Function unknown] Probab=21.43 E-value=53 Score=13.08 Aligned_cols=69 Identities=17% Similarity=0.285 Sum_probs=26.3 Q ss_pred HHHHHHHHHHHHHHCCC--------CCCCCHHHHHCHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHH-HHHHCCEEEE Q ss_conf 68999999999975385--------3111023431104653202785111146865687899999998-7654486997 Q gi|254780670|r 213 ADSALDLIANSIEKAGY--------NAGKDLLIALDCAASAFFKKEKYILKGENLELQPNEMASYLAN-LVDQYPIYSI 282 (424) Q Consensus 213 ~eeaL~ll~eAi~~aGy--------~~g~di~i~lD~AAsefy~~~kY~~~~~~~~~t~~elid~y~~-l~~~yPI~sI 282 (424) .-+.+.++..+|.+||+ .|+-.+.|+ ++-+---|+.|.|.+-.-....++++.++-+.+ |-.++|+..+ T Consensus 18 hLdlmRlidR~iRRAglpiayT~GFhP~prmsia-~alpvG~~ssge~fd~eL~e~v~~dk~~etl~~~lp~Dl~~~~v 95 (228) T COG5011 18 HLDLMRLIDRTIRRAGLPIAYTGGFHPHPRMSIA-QALPVGIYSSGEIFDFELTEEVSEDKFKETLNKALPNDLPAYDV 95 (228) T ss_pred HHHHHHHHHHHHHHCCCCEEECCCCCCCCCEEEC-CCCCCCCCCCCEEEEEEEEEECCCHHHHHHHHHHCCCCCCHHHE T ss_conf 8899999999988649853543788999742210-24433201476189998643117188999999757887551310 No 278 >cd03872 M14_CPA6 Carboxypeptidase (CP) A6 (CPA6, also known as CPAH; EC 3.4.17.1), belongs to the carboxypeptidase A/B subfamily of the M14 family of metallocarboxypeptidases (MCPs). The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPA6 prefers large hydrophobic C-terminal amino acids as well as histidine, while peptides with a penultimate glycine or proline are very poorly cleaved. Several neuropeptides are processed by CPA6, including Met- and Leu-enkephalin, angiotensin I, and neurotensin. CPA6 converts enkephalin and neurotensin into forms known to be inactive toward their receptors, but converts inactive angiotensin I into the biologically active angiotensin II. Thus, CPA6 plays a possible role in the regulation of neuropeptides in the extracellular environment within the olfactory bulb where it is highly expresse Probab=21.40 E-value=53 Score=13.08 Aligned_cols=29 Identities=21% Similarity=0.148 Sum_probs=15.4 Q ss_pred CCCCCCCCCCCCCCHHHHHHHHHHHHHHC Q ss_conf 67655764488684689999999999753 Q gi|254780670|r 199 NVGDEGGFSPNLKTADSALDLIANSIEKA 227 (424) Q Consensus 199 ~vgDEGGfap~~~~~eeaL~ll~eAi~~a 227 (424) ..|.....+|+.+...++-.-..+||... T Consensus 197 P~gyt~~~~p~~~~l~~~a~~a~~ai~~~ 225 (300) T cd03872 197 PYSYKYATIPNFGCVESAAYNAVNALQSA 225 (300) T ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 35578889989899999999999999987 No 279 >TIGR03419 NifU_clost FeS cluster assembly scaffold protein NifU, Clostridium type. NifU and NifS form a pair of iron-sulfur (FeS) cluster biosynthesis proteins much simpler than the ISC and SUF systems. Members of this protein family are a distinct group of NifU-like proteins, found always to a NifS-like protein and restricted to species that lack a SUF system. Typically, NIF systems service a smaller number of FeS-containing proteins than do ISC or SUF. Members of this particular branch typically are found, almost half the time, near the mnmA gene, involved in the carboxymethylaminomethyl modification of U34 in some tRNAs (see GenProp0704). While other NifU proteins are associated with nitrogen fixation, this family is not. Probab=21.38 E-value=53 Score=13.07 Aligned_cols=65 Identities=20% Similarity=0.227 Sum_probs=34.6 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHH--HHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCHHHHC Q ss_conf 51680489999999875678975988557898886--465303664322123222202448998752663498488822 Q gi|254780670|r 56 RYFGKGVLKAIAFVNDEIRTALLGCDARDQLLIDK--IMIDLDGTPNKSRLGANAILGVSLAVSKAAAQTSNLPLYKYL 132 (424) Q Consensus 56 ~~~G~gV~~Av~~in~~i~~~L~g~~~~dQ~~iD~--~Li~lDgT~nks~lGaNailAvSlA~akA~A~~~~~pLy~yl 132 (424) +|.|.|..-.+.-- +.++..+.|+++.+-.+|+. ++-.|+|-| ...+-||..+....+.-|-.|. T Consensus 51 ~F~~~GCais~Asa-S~~~e~i~Gk~l~ea~~i~~~~i~~~l~~lp-----------p~r~~CA~La~~Al~~Al~~y~ 117 (121) T TIGR03419 51 KFKTFGCGAAIASS-SMATEMIKGKTLEEAWELTNKAVAEALDGLP-----------PVKMHCSVLAEEAIHKAINDYR 117 (121) T ss_pred EEEEECCHHHHHHH-HHHHHHHCCCCHHHHHHCCHHHHHHHHCCCC-----------CCCCHHHHHHHHHHHHHHHHHH T ss_conf 99965558899999-9999998499499998635489999975999-----------3116699999999999999998 No 280 >cd06333 PBP1_ABC-type_HAAT_like Type I periplasmic binding component of ABC (ATPase Binding Cassette)-type transport systems that are predicted to be involved in uptake of amino acids. This subgroup includes the type I periplasmic binding component of ABC (ATPase Binding Cassette)-type transport systems that are predicted to be involved in uptake of amino acids. Members of this subgroup are sequence-similar to members of the family of ABC-type hydrophobic amino acid transporters (HAAT), such as leucine-isoleucine-valine-binding protein (LIVBP); their ligand specificity has not been determined experimentally, however. Probab=21.37 E-value=53 Score=13.07 Aligned_cols=55 Identities=13% Similarity=-0.002 Sum_probs=22.6 Q ss_pred HHHHHHHHHHHHCC-EEEEECCCCCCCHHHHHHHHHHCCCCEEEECCCEEECCHHHHHH Q ss_conf 99999998765448-69977676310423799886423883685043000021488852 Q gi|254780670|r 265 EMASYLANLVDQYP-IYSIEDGMSEDDWHGWKILTNKIGSNCQLVGDDLFVTNPERLHK 322 (424) Q Consensus 265 elid~y~~l~~~yP-I~sIEDP~~e~D~~gw~~l~~~~g~~~~IvGDDL~vTn~~rl~~ 322 (424) ++...+.++...=| .+.+--. ..|---..+-...+|-+.+++|.|- ..++..++. T Consensus 176 Dfs~~l~ki~~~~pD~v~~~~~--~~~~~~~~~q~~~~G~~~~~~~~~~-~~~~~~~~~ 231 (312) T cd06333 176 SVTAQLLKIRAARPDAVLIWGS--GTPAALPAKNLRERGYKGPIYQTHG-VASPDFLRL 231 (312) T ss_pred CHHHHHHHHHHCCCCEEEEECC--CCHHHHHHHHHHHCCCCCCEEECCC-CCCHHHHHH T ss_conf 7799999998569899999457--5169999999997699870881276-678799998 No 281 >PRK06427 phosphomethylpyrimidine kinase; Reviewed Probab=21.35 E-value=53 Score=13.07 Aligned_cols=18 Identities=22% Similarity=0.071 Sum_probs=9.7 Q ss_pred CCCHHHHHHHHHHHCCCH Q ss_conf 872358998999819980 Q gi|254780670|r 367 ETEDHTIADLAVATNCGQ 384 (424) Q Consensus 367 ETeD~~iaDLAVg~~a~~ 384 (424) -|-++|=+-+|..+.-++ T Consensus 212 GTGCtlsSAiAa~LA~G~ 229 (266) T PRK06427 212 GTGCTLSAAIAAELAKGA 229 (266) T ss_pred CCCHHHHHHHHHHHHCCC T ss_conf 850899999999998699 No 282 >pfam08963 DUF1878 Protein of unknown function (DUF1878). This domain is found in a set of hypothetical bacterial proteins. Probab=21.35 E-value=28 Score=15.14 Aligned_cols=39 Identities=26% Similarity=0.573 Sum_probs=29.3 Q ss_pred HHHHHHHHHH------HHCCEEE--EECCCCCCCHHHHHHHHHHCCC Q ss_conf 9999999876------5448699--7767631042379988642388 Q gi|254780670|r 265 EMASYLANLV------DQYPIYS--IEDGMSEDDWHGWKILTNKIGS 303 (424) Q Consensus 265 elid~y~~l~------~~yPI~s--IEDP~~e~D~~gw~~l~~~~g~ 303 (424) |..+||.+|. ++||..+ |+-.+.++|.+.+-.||.++.+ T Consensus 9 e~LEyy~~LLl~mid~~k~PFy~liI~k~lskeeve~~~~lCeeLs~ 55 (113) T pfam08963 9 EQLEYYISLLLKMVDMEKYPFYSLIIRKGLTKEEAEEILRLCEELSE 55 (113) T ss_pred HHHHHHHHHHHHHCCCCCCCHHHHHHHCCCCHHHHHHHHHHHHHHHH T ss_conf 99999999999974842388899999812669889999999999999 No 283 >cd02517 CMP-KDO-Synthetase CMP-KDO synthetase catalyzes the activation of KDO which is an essential component of the lipopolysaccharide. CMP-KDO Synthetase: 3-Deoxy-D-manno-octulosonate cytidylyltransferase (CMP-KDO synthetase) catalyzes the conversion of CTP and 3-deoxy-D-manno-octulosonate into CMP-3-deoxy-D-manno-octulosonate (CMP-KDO) and pyrophosphate. KDO is an essential component of the lipopolysaccharide found in the outer surface of gram-negative eubacteria. It is also a constituent of the capsular polysaccharides of some gram-negative eubacteria. Its presence in the cell wall polysaccharides of green algae and plant were also discovered. However, they have not been found in yeast and animals. The absence of the enzyme in mammalian cells makes it an attractive target molecule for drug design. Probab=21.23 E-value=53 Score=13.05 Aligned_cols=38 Identities=24% Similarity=0.288 Sum_probs=14.8 Q ss_pred HHHHHHHHHHHHH--HCCCEEEEECCCCCC-CHHHHHHHHH Q ss_conf 3899999999999--879869995078887-2358998999 Q gi|254780670|r 341 SVSETLNTIEKAQ--MAGYPSIISHRSGET-EDHTIADLAV 378 (424) Q Consensus 341 Tvset~ea~~~a~--~~g~~~ivShRSGET-eD~~iaDLAV 378 (424) |--|.+|-++.-| ++|+.+-+-.-..++ .=++-.||.- T Consensus 195 s~lE~~E~lEqLR~leng~~I~~~~~~~~~~~VDt~eDl~~ 235 (239) T cd02517 195 SPLEQIESLEQLRALENGYKIKVVETDHESIGVDTPEDLER 235 (239) T ss_pred CHHHHHHHHHHHHHHHCCCEEEEEEECCCCCCCCCHHHHHH T ss_conf 82577760899999983980689996999989889999999 No 284 >PRK03359 putative electron transfer flavoprotein FixA; Reviewed Probab=21.20 E-value=54 Score=13.05 Aligned_cols=28 Identities=21% Similarity=0.197 Sum_probs=13.8 Q ss_pred EEEEEEEECCCCCEEEEEEEECCCCEEEEECCC Q ss_conf 998688607978429999998489767663447 Q gi|254780670|r 6 IIAREVIDSRGSPTIEVDVCLEDGSTGRAMVPS 38 (424) Q Consensus 6 i~~r~IlDSRG~PTvEvev~~~~g~~gra~~PS 38 (424) |..+||-|.- ++.+...+|.+-|..++. T Consensus 5 VcvKqVPD~~-----~~~i~~~~g~l~~~~~~~ 32 (256) T PRK03359 5 TCYKCVPDEQ-----DIAVNNADGSLDFSKADA 32 (256) T ss_pred EEEEECCCCC-----CEEEECCCCEEEECCCCC T ss_conf 9988787864-----148707999398469863 No 285 >KOG0481 consensus Probab=21.12 E-value=54 Score=13.04 Aligned_cols=71 Identities=20% Similarity=0.325 Sum_probs=48.8 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCHHHHHCHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCC-EEEEECCCCC-CC Q ss_conf 689999999999753853111023431104653202785111146865687899999998765448-6997767631-04 Q gi|254780670|r 213 ADSALDLIANSIEKAGYNAGKDLLIALDCAASAFFKKEKYILKGENLELQPNEMASYLANLVDQYP-IYSIEDGMSE-DD 290 (424) Q Consensus 213 ~eeaL~ll~eAi~~aGy~~g~di~i~lD~AAsefy~~~kY~~~~~~~~~t~~elid~y~~l~~~yP-I~sIEDP~~e-~D 290 (424) ..||++--.-+|.+||-+---+-.-++=+||+..| |+|+ -.+|++|-|||-.-+..+|. |+-+-|-.+| .| T Consensus 448 IHEAMEQQTISIAKAGITT~LNSRtSVLAAANpvf--GRyD-----d~Kt~~dNIDf~~TILSRFDmIFIVKD~h~~~~D 520 (729) T KOG0481 448 IHEAMEQQTISIAKAGITTTLNSRTSVLAAANPVF--GRYD-----DTKTGEDNIDFMPTILSRFDMIFIVKDEHDEERD 520 (729) T ss_pred HHHHHHHHHHHHHHHCCEEEECCHHHHHHHCCCCC--CCCC-----CCCCCCCCCCHHHHHHHHCCEEEEEECCCCCHHH T ss_conf 98998765577755052256424033445337866--5543-----4678555632234676451379998336760353 No 286 >cd07140 ALDH_F1L_FTFDH 10-formyltetrahydrofolate dehydrogenase, ALDH family 1L. 10-formyltetrahydrofolate dehydrogenase (FTHFDH, EC=1.5.1.6), also known as aldehyde dehydrogenase family 1 member L1 (ALDH1L1) in humans, is a multi-domain homotetramer with an N-terminal formyl transferase domain and a C-terminal ALDH domain. FTHFDH catalyzes an NADP+-dependent dehydrogenase reaction resulting in the conversion of 10-formyltetrahydrofolate to tetrahydrofolate and CO2. The ALDH domain is also capable of the oxidation of short chain aldehydes to their corresponding acids. Probab=21.05 E-value=54 Score=13.03 Aligned_cols=24 Identities=17% Similarity=0.282 Sum_probs=13.0 Q ss_pred HHHHHHHHHHHHHHCCEEEEECCCCCC Q ss_conf 789999999876544869977676310 Q gi|254780670|r 263 PNEMASYLANLVDQYPIYSIEDGMSED 289 (424) Q Consensus 263 ~~elid~y~~l~~~yPI~sIEDP~~e~ 289 (424) .+++++-+.+-+++ +-+-||++++ T Consensus 304 ~d~f~~~l~~~~~~---l~vG~p~d~~ 327 (486) T cd07140 304 HDEFVRRVVEEVKK---MKIGDPLDRS 327 (486) T ss_pred HHHHHHHHHHHHCC---CCCCCHHHCC T ss_conf 79999999987504---7646856536 No 287 >COG4301 Uncharacterized conserved protein [Function unknown] Probab=21.03 E-value=38 Score=14.19 Aligned_cols=76 Identities=16% Similarity=0.272 Sum_probs=40.5 Q ss_pred CHHHHHHHHHHHHHHCCCCCCCCHHHHHCHH--HHHH---CCCCCEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCC Q ss_conf 4689999999999753853111023431104--6532---0278511114686568789999999876544869977676 Q gi|254780670|r 212 TADSALDLIANSIEKAGYNAGKDLLIALDCA--ASAF---FKKEKYILKGENLELQPNEMASYLANLVDQYPIYSIEDGM 286 (424) Q Consensus 212 ~~eeaL~ll~eAi~~aGy~~g~di~i~lD~A--Asef---y~~~kY~~~~~~~~~t~~elid~y~~l~~~yPI~sIEDP~ 286 (424) ++++..-++. .+..+ ..||+-+-+|+|.- |.-+ |++-- -.|.+=-..++..+-..| =+| | T Consensus 168 tp~e~~~Fl~-~l~~a-~~pGd~~LlGvDl~k~Ae~Le~AYdDp~--------gVTa~FnlNvLa~lNr~f----~~n-F 232 (321) T COG4301 168 TPGECAVFLT-QLRGA-LRPGDYFLLGVDLRKPAERLEAAYDDPQ--------GVTAEFNLNVLAHLNRVF----GGN-F 232 (321) T ss_pred CHHHHHHHHH-HHHHC-CCCCCEEEEECCCCCHHHHHHHHHCCCC--------CHHHHHHHHHHHHHHHHH----CCC-C T ss_conf 8678999999-99842-8976669981212687888777415851--------338998788999989986----367-7 Q ss_pred CCCCHHHHHHHHHHCC Q ss_conf 3104237998864238 Q gi|254780670|r 287 SEDDWHGWKILTNKIG 302 (424) Q Consensus 287 ~e~D~~gw~~l~~~~g 302 (424) |++||+.|+.+.+..+ T Consensus 233 D~~dfeh~Avyne~~~ 248 (321) T COG4301 233 DVDDFEHVAVYNEDEG 248 (321) T ss_pred CCCHHHHHHHHHHHHH T ss_conf 7210255765510577 No 288 >cd07102 ALDH_EDX86601 Uncharacterized aldehyde dehydrogenase of Synechococcus sp. PCC 7335 (EDX86601). Uncharacterized aldehyde dehydrogenase of Synechococcus sp. PCC 7335 (locus EDX86601) and other similar sequences, are present in this CD. Probab=20.83 E-value=54 Score=13.00 Aligned_cols=125 Identities=17% Similarity=0.236 Sum_probs=53.2 Q ss_pred CCCCCCCEEECCCCCCCCCCCCCEEEEEEC-CCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHH Q ss_conf 647543000003664556776610256514-8877489998999999999998752000013676557644886846899 Q gi|254780670|r 138 HILPVPLMNILNGGIHADNALDFQEFMIMP-VGAENIREAIRMGAEVFHTLKKELKSKGYSTNVGDEGGFSPNLKTADSA 216 (424) Q Consensus 138 ~~lP~P~~NiinGG~HA~~~l~iQEfmIiP-~g~~sf~eal~~~~ev~~~lk~~L~~~g~~t~vgDEGGfap~~~~~eea 216 (424) ..+|--.+|++.|+.-.+..+ +--| +...+|.-+.+.+-.|.+...+.++.-.+ --||-.|.+=...-- T Consensus 169 aglP~gv~~vv~g~~~~~~~l-----~~~p~v~~I~FTGS~~~G~~i~~~aa~~~~~v~l-----ELGGk~p~IV~~dAD 238 (452) T cd07102 169 AGLPEGVFQVLHLSHETSAAL-----IADPRIDHVSFTGSVAGGRAIQRAAAGRFIKVGL-----ELGGKDPAYVRPDAD 238 (452) T ss_pred HCCCCCEEEECCCCHHHHHHH-----HCCCCCCEEEEECCCHHHHHHHHHHHHHHHHHHH-----HCCCCCCEEECCCCC T ss_conf 589956687468887789987-----4499868799978717665544356787645565-----227876336689889 Q ss_pred HHHHHHHHHHCCCCCCCCHHHHHCHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCC Q ss_conf 9999999975385311102343110465320278511114686568789999999876544869977676310 Q gi|254780670|r 217 LDLIANSIEKAGYNAGKDLLIALDCAASAFFKKEKYILKGENLELQPNEMASYLANLVDQYPIYSIEDGMSED 289 (424) Q Consensus 217 L~ll~eAi~~aGy~~g~di~i~lD~AAsefy~~~kY~~~~~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~ 289 (424) |+...+++-...+...=+.+.+ .++.|--+ .--|++++.+.+.++++ .+-||++++ T Consensus 239 ld~A~~~~~~~~f~~aGQ~C~a-----------~~rv~V~~---~i~d~f~~~l~~~~~~~---~vG~p~d~~ 294 (452) T cd07102 239 LDAAAESLVDGAFFNSGQSCCS-----------IERIYVHE---SIYDAFVEAFVAVVKGY---KLGDPLDPS 294 (452) T ss_pred HHHHHHHHHHHHCCCCCCCCCC-----------CCEEEECH---HHHHHHHHHHHHHHHHC---CCCCHHHCC T ss_conf 9999988752330468998724-----------83589713---47999999999877644---668915316 No 289 >PRK10919 ATP-dependent DNA helicase Rep; Provisional Probab=20.83 E-value=54 Score=13.00 Aligned_cols=13 Identities=31% Similarity=0.529 Sum_probs=5.8 Q ss_pred ECCCCCCCCCCEE Q ss_conf 3447866477546 Q gi|254780670|r 35 MVPSGASTGIHEA 47 (424) Q Consensus 35 ~~PSGaStG~~Ea 47 (424) .|-+||=+|+... T Consensus 19 lVlAGaGSGKT~t 31 (672) T PRK10919 19 LVLAGAGSGKTRV 31 (672) T ss_pred EEEECCCHHHHHH T ss_conf 9997385587999 No 290 >PRK00915 2-isopropylmalate synthase; Validated Probab=20.74 E-value=55 Score=12.98 Aligned_cols=80 Identities=16% Similarity=0.215 Sum_probs=35.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHCHHHHHHCCCC Q ss_conf 74899989999999999987520000136765576448868468999999999975385311102343110465320278 Q gi|254780670|r 171 ENIREAIRMGAEVFHTLKKELKSKGYSTNVGDEGGFSPNLKTADSALDLIANSIEKAGYNAGKDLLIALDCAASAFFKKE 250 (424) Q Consensus 171 ~sf~eal~~~~ev~~~lk~~L~~~g~~t~vgDEGGfap~~~~~eeaL~ll~eAi~~aGy~~g~di~i~lD~AAsefy~~~ 250 (424) .|.++.+++..+....+++ .+....++-|-++-.+ .+-+++++..|+ ++|-. -|.++ T Consensus 113 ~s~ee~l~~~~~~v~~ak~----~g~~V~f~~ED~srtd---~~~l~~~~~aa~-~aGa~---~i~l~------------ 169 (511) T PRK00915 113 MSREEVLEMAVEAVKYARS----YTDDVEFSAEDATRTD---LDFLCRVVEAAI-DAGAT---TINIP------------ 169 (511) T ss_pred CCHHHHHHHHHHHHHHHHH----CCCEEEEECCCCCCCC---HHHHHHHHHHHH-HCCCC---EEEEC------------ T ss_conf 9999999999999999998----0985999324057779---899999999987-64999---98645------------ Q ss_pred CEEECCCCCCCCHHHHHHHHHHHHHHCC Q ss_conf 5111146865687899999998765448 Q gi|254780670|r 251 KYILKGENLELQPNEMASYLANLVDQYP 278 (424) Q Consensus 251 kY~~~~~~~~~t~~elid~y~~l~~~yP 278 (424) -..-..+|.++.++...+++.+| T Consensus 170 -----DTvG~~~P~~~~~~i~~l~~~~p 192 (511) T PRK00915 170 -----DTVGYTTPEEFGELIKTLRERVP 192 (511) T ss_pred -----CCCCCCCHHHHHHHHHHHHHHCC T ss_conf -----66677693899999999998648 No 291 >cd01538 PBP1_ABC_xylose_binding Periplasmic xylose-binding component of the ABC-type transport systems that belong to a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein (PBP1) superfamily. Periplasmic xylose-binding component of the ABC-type transport systems that belong to a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein (PBP1) superfamily, which consists of two alpha/beta globular domains connected by a three-stranded hinge. This Venus flytrap-like domain undergoes a transition from an open to a closed conformational state upon ligand binding. Moreover, the periplasmic xylose-binding protein is homologous to the ligand-binding domain of eukaryotic receptors such as glutamate receptor (GluR) and DNA-binding transcriptional repressors such as LacI and GalR. Probab=20.72 E-value=55 Score=12.98 Aligned_cols=74 Identities=9% Similarity=0.168 Sum_probs=33.0 Q ss_pred CCHHHHHHHHHHHHHHCC--EEEEECCCCCCCH---HHHHHHHHH-CCCCEEEECCCEEECCHHHHHHHHHCCCCCEE-E Q ss_conf 687899999998765448--6997767631042---379988642-38836850430000214888521003865223-4 Q gi|254780670|r 261 LQPNEMASYLANLVDQYP--IYSIEDGMSEDDW---HGWKILTNK-IGSNCQLVGDDLFVTNPERLHKGINEEVANAI-L 333 (424) Q Consensus 261 ~t~~elid~y~~l~~~yP--I~sIEDP~~e~D~---~gw~~l~~~-~g~~~~IvGDDL~vTn~~rl~~gi~~~~~Nai-l 333 (424) .++++=.....+|+.++| +..| |.-+|. .....+.+. +..++.|+|=|-.....+.++. +...+. . T Consensus 171 ~~~~~a~~~~~~~L~~~~~~i~aI---~a~nD~~A~Ga~~Al~~aG~~~~v~vvG~Dg~~~~l~~I~~----G~~~~tv~ 243 (288) T cd01538 171 WDPETAQKRMENALTANYNKVDGV---LAANDGTAGGAIAALKAAGLAGKPPVTGQDAELAAVQRIVE----GTQTMTVY 243 (288) T ss_pred CCHHHHHHHHHHHHHHCCCCCCEE---EECCCHHHHHHHHHHHHCCCCCCCEEEECCCCHHHHHHHHC----CCCEEEEE T ss_conf 577889999999998465557589---85585677999999997499998739835998999999973----98259995 Q ss_pred ECCCCHHH Q ss_conf 25453003 Q gi|254780670|r 334 IKPNQIGS 341 (424) Q Consensus 334 iK~NQiGT 341 (424) -.|-++|- T Consensus 244 Q~p~~~G~ 251 (288) T cd01538 244 KDIRELAE 251 (288) T ss_pred CCHHHHHH T ss_conf 59999999 No 292 >pfam05153 DUF706 Family of unknown function (DUF706). Family of uncharacterized eukaryotic function. Some members have a described putative function, but a common theme is not evident. Probab=20.58 E-value=55 Score=12.96 Aligned_cols=28 Identities=21% Similarity=0.439 Sum_probs=18.5 Q ss_pred CCEEECCC-CCCCCHHHHHHHHHHHHHHC Q ss_conf 85111146-86568789999999876544 Q gi|254780670|r 250 EKYILKGE-NLELQPNEMASYLANLVDQY 277 (424) Q Consensus 250 ~kY~~~~~-~~~~t~~elid~y~~l~~~y 277 (424) ++|++=.+ ....+.+++-.||.+|++|| T Consensus 213 NkyDLYSK~d~~pd~~~lkpYY~~LidKY 241 (248) T pfam05153 213 NPYDLYSKSDEPPDVEALKPYYQELIDKY 241 (248) T ss_pred CCCCCCCCCCCCCCHHHHHHHHHHHHHHH T ss_conf 77543157999878899999999999997 No 293 >cd06247 M14_CPO Peptidase M14 carboxypeptidase (CP) O (CPO, also known as metallocarboxypeptidase C; EC 3.4.17.) belongs to the carboxypeptidase A/B subfamily of the M14 family of metallocarboxypeptidases (MCPs). The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPO has not been well characterized as yet, and little is known about it. Based on modeling studies, CPO has been suggested to have specificity for acidic residues rather than aliphatic/aromatic residues as in A-like enzymes or basic residues as in B-like enzymes. It remains to be demonstrated that CPO is functional as an MCP. Probab=20.21 E-value=56 Score=12.91 Aligned_cols=21 Identities=19% Similarity=0.411 Sum_probs=9.4 Q ss_pred EEEECCCCHH-HHHHHHHHHHH Q ss_conf 2342545300-38999999999 Q gi|254780670|r 331 AILIKPNQIG-SVSETLNTIEK 351 (424) Q Consensus 331 ailiK~NQiG-Tvset~ea~~~ 351 (424) +.++-++||- |..|+.+.++. T Consensus 272 GF~lP~~~I~p~~~E~~~gi~~ 293 (298) T cd06247 272 GFVLPEDQIQPTCEETMTAVMS 293 (298) T ss_pred CCCCCHHHHHHHHHHHHHHHHH T ss_conf 9648989946789999999999 No 294 >COG1139 Uncharacterized conserved protein containing a ferredoxin-like domain [Energy production and conversion] Probab=20.17 E-value=56 Score=12.90 Aligned_cols=19 Identities=21% Similarity=0.333 Sum_probs=8.1 Q ss_pred EEEEEEEECCCCCEEEEEE Q ss_conf 9986886079784299999 Q gi|254780670|r 6 IIAREVIDSRGSPTIEVDV 24 (424) Q Consensus 6 i~~r~IlDSRG~PTvEvev 24 (424) |.-++.|-++|+-.+|.|+ T Consensus 117 Igln~~Le~~G~ev~ETDL 135 (459) T COG1139 117 IGLNHYLEEKGIEVWETDL 135 (459) T ss_pred HHHHHHHHHCCCEEEECCH T ss_conf 5268999874984787258 No 295 >TIGR00109 hemH ferrochelatase; InterPro: IPR001015 Synonym(s): Protoheme ferro-lyase, Iron chelatase, etc. Ferrochelatase catalyzes the last step in heme biosynthesis: the chelation of a ferrous ion to proto-porphyrin IX, to form protoheme , . In eukaryotic cells, it binds to the mitochondrial inner membrane with its active site on the matrix side of the membrane. The X-ray structure of Bacillus subtilis and human ferrochelatase have been solved , . The human enzyme exists as a homodimer. Each subunit contains one [Fe2S2] cluster. The monomer is folded into two similar domains, each with a four-stranded parallel beta-sheet flanked by an alpha-helix in a beta-alpha-beta motif that is reminiscent of the fold found in the periplasmic binding proteins. The topological similarity between the domains suggests that they have arisen from a gene duplication event. However, significant differences exist between the two domains, including an N-terminal section (residues 80-130) that forms part of the active site pocket, and a C-terminal extension (residues 390-423) that is involved in coordination of the [Fe2S2]cluster and in stabilisation of the homodimer. The [Fe2S2] cluster ligands are Cys196, Cys403, Cys406 and Cys411. The experiments with Co(II) binding show that His230 and Asp383 are part of the enzyme active site . Ferrochelatase seems to have a structurally conserved core region that is common to the enzyme from bacteria, plants and mammals. Porphyrin binds in the identified cleft; this cleft also includes the metal-binding site of the enzyme. It is likely that the structure of the cleft region will have different conformations upon substrate binding and release .; GO: 0004325 ferrochelatase activity, 0006783 heme biosynthetic process. Probab=20.14 E-value=56 Score=12.90 Aligned_cols=60 Identities=15% Similarity=0.254 Sum_probs=26.2 Q ss_pred CEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHC-CC-CCCCCCCCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 10256514887748999899999999999875200-00-136765576448868468999999999975 Q gi|254780670|r 160 FQEFMIMPVGAENIREAIRMGAEVFHTLKKELKSK-GY-STNVGDEGGFSPNLKTADSALDLIANSIEK 226 (424) Q Consensus 160 iQEfmIiP~g~~sf~eal~~~~ev~~~lk~~L~~~-g~-~t~vgDEGGfap~~~~~eeaL~ll~eAi~~ 226 (424) +++..++|.- +-|+ ....-.+++.+.+.+++. ++ .. .=-|.+..-+++.-+..+.+.|.+ T Consensus 126 v~~~V~l~lY-PhfS--~sTtgS~f~~l~~~~~~~~slR~~----~~~~I~~w~~~~~y~~a~~d~i~~ 187 (339) T TIGR00109 126 VERLVVLPLY-PHFS--SSTTGSSFNELAEALKKLRSLRSL----KISVIESWYDNEKYIKALADSIKE 187 (339) T ss_pred CCEEEEEECC-CCCC--CHHHHHHHHHHHHHHHHCCCCCCC----CEEEEECCCCCHHHHHHHHHHHHH T ss_conf 6488885148-7300--012788999999999834896576----314751445688799999999999 No 296 >TIGR03374 ABALDH 1-pyrroline dehydrogenase. Members of this protein family are 1-pyrroline dehydrogenase (1.5.1.35), also called gamma-aminobutyraldehyde dehydrogenase. This enzyme can follow putrescine transaminase (EC 2.6.1.82) for a two-step conversion of putrescine to gamma-aminobutyric acid (GABA). The member from Escherichia coli is characterized as a homotetramer that binds one NADH per momomer. This enzyme belongs to the medium-chain aldehyde dehydrogenases, and is quite similar in sequence to the betaine aldehyde dehydrogenase (EC 1.2.1.8) family. Probab=20.10 E-value=56 Score=12.89 Aligned_cols=124 Identities=19% Similarity=0.255 Sum_probs=58.5 Q ss_pred CCCCCCEEECCC-CCCCCCCCC-CEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHH Q ss_conf 475430000036-645567766-102565148877489998999999999998752000013676557644886846899 Q gi|254780670|r 139 ILPVPLMNILNG-GIHADNALD-FQEFMIMPVGAENIREAIRMGAEVFHTLKKELKSKGYSTNVGDEGGFSPNLKTADSA 216 (424) Q Consensus 139 ~lP~P~~NiinG-G~HA~~~l~-iQEfmIiP~g~~sf~eal~~~~ev~~~lk~~L~~~g~~t~vgDEGGfap~~~~~eea 216 (424) .||--.+|++.| |.-.+..|- =++ +...+|.-+.+.+-.|.++..+.++.- +-.-||-.|-+=..+.- T Consensus 190 alP~Gv~nvv~G~g~~vg~~l~~~~~-----v~~v~FTGS~~~G~~i~~~aa~~~~~~-----~lELGGk~p~iV~~dAd 259 (472) T TIGR03374 190 IFPAGVVNILFGRGKTVGDPLTGHEK-----VRMVSLTGSIATGEHILSHTAPSIKRT-----HMELGGKAPVIVFDDAD 259 (472) T ss_pred HCCCCCEEECCCCHHHHHHHHHHCCC-----CCEEEEECCHHHHHHHHHHHHHCCCEE-----EEECCCCCCCCCCCCCC T ss_conf 08956256556882899999983989-----276998795299999999876047734-----77468987554588868 Q ss_pred HHHHHHHHHHCCCCCCCCHHHHHCHHHHHHCCCCCEEECCCCCCCCHHHHHHHHHHHHHHCCEEEEECCCCCC Q ss_conf 9999999975385311102343110465320278511114686568789999999876544869977676310 Q gi|254780670|r 217 LDLIANSIEKAGYNAGKDLLIALDCAASAFFKKEKYILKGENLELQPNEMASYLANLVDQYPIYSIEDGMSED 289 (424) Q Consensus 217 L~ll~eAi~~aGy~~g~di~i~lD~AAsefy~~~kY~~~~~~~~~t~~elid~y~~l~~~yPI~sIEDP~~e~ 289 (424) |+...+++-...+...-+.+.+. ++.|--+ .--|++++-+...++++ .+-||++++ T Consensus 260 l~~a~~~~~~~~f~naGQ~C~a~-----------~rv~V~~---~i~d~f~~~l~~~~~~l---~vG~p~~~~ 315 (472) T TIGR03374 260 IDAVVEGVRTFGFYNAGQDCTAA-----------CRIYAQR---GIYDTLVEKLGAAVATL---KSGAPDDES 315 (472) T ss_pred HHHHHHHHHHCCCCCCCCCCCCC-----------EEEEEEC---CHHHHHHHHHHHHHHHC---CCCCCCCCC T ss_conf 87874666631467889989888-----------6999964---07899999999766207---536854457 No 297 >cd06905 Peptidase_M14-like_8 A functionally uncharacterized subgroup of the M14 family of metallocarboxypeptidases (MCPs). The M14 family are zinc-binding carboxypeptidases (CPs) which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. Two major subfamilies of the M14 family, defined based on sequence and structural homology, are the A/B and N/E subfamilies. Enzymes belonging to the A/B subfamily are normally synthesized as inactive precursors containing preceding signal peptide, followed by an N-terminal pro-region linked to the enzyme; these proenzymes are called procarboxypeptidases. The A/B enzymes can be further divided based on their substrate specificity; Carboxypeptidase A-like (CPA-like) enzymes favor hydrophobic residues while carboxypeptidase B-like (CPB-like) enzymes only cleave the basic residues lysine or arginine. The A forms have slightly differe Probab=20.09 E-value=56 Score=12.89 Aligned_cols=37 Identities=22% Similarity=0.353 Sum_probs=17.9 Q ss_pred HHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHH------HHCCCCC Q ss_conf 48999999987567897598855789888646------5303664 Q gi|254780670|r 61 GVLKAIAFVNDEIRTALLGCDARDQLLIDKIM------IDLDGTP 99 (424) Q Consensus 61 gV~~Av~~in~~i~~~L~g~~~~dQ~~iD~~L------i~lDgT~ 99 (424) |...++-.++..+.. -|.++.-++-+|+.- ..-||-+ T Consensus 71 s~a~~ly~i~~Ll~~--Y~~d~~it~lLd~~d~yivP~~NPDGyE 113 (360) T cd06905 71 GSAVALYVIQTLLNG--YGSDPEVTRLLDGYTFYILPRLNPDGAE 113 (360) T ss_pred HHHHHHHHHHHHHHH--CCCHHHHHHHHHCCEEEEEEEECCCCHH T ss_conf 999999999999986--3872999999756749999402684065 No 298 >PRK10916 ADP-heptose:LPS heptosyltransferase II; Provisional Probab=20.08 E-value=56 Score=12.89 Aligned_cols=110 Identities=16% Similarity=0.190 Sum_probs=55.4 Q ss_pred HHCCEEEEECCCCC----CCH--HHHHHHHHHC---CCCEEEECCCEEECCHHHHHHHHHCCCCCEEEECCCCHHHHHHH Q ss_conf 54486997767631----042--3799886423---88368504300002148885210038652234254530038999 Q gi|254780670|r 275 DQYPIYSIEDGMSE----DDW--HGWKILTNKI---GSNCQLVGDDLFVTNPERLHKGINEEVANAILIKPNQIGSVSET 345 (424) Q Consensus 275 ~~yPI~sIEDP~~e----~D~--~gw~~l~~~~---g~~~~IvGDDL~vTn~~rl~~gi~~~~~NailiK~NQiGTvset 345 (424) .+.|++.| -|... -.| +.|.++...+ |.++.++|.--=.--.+.+..++..... .-++.+---.++.|+ T Consensus 178 ~~~~~i~i-~pGa~~~~~KrWp~e~fa~la~~L~~~g~~vvl~G~~~e~~~~~~i~~~l~~~~~-~~~~nl~GktsL~el 255 (348) T PRK10916 178 SERPIIGF-CPGAEFGPAKRWPHYHYAELAKQLIDEGYQVVLFGSAKDHEAGNEILAALNTEQQ-AWCRNLAGETQLEQA 255 (348) T ss_pred CCCCEEEE-ECCCCCCCCCCCCHHHHHHHHHHHHHCCCEEEEEECHHHHHHHHHHHHHHHHHCC-CCEECCCCCCCHHHH T ss_conf 79977998-1687666567798899999999999689979998172369999999985103315-651416788999999 Q ss_pred HHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHHHCCCHHEECCCCHHHHHH Q ss_conf 9999999987986999507888723589989998199801007561144899 Q gi|254780670|r 346 LNTIEKAQMAGYPSIISHRSGETEDHTIADLAVATNCGQIKTGSLARSDRIA 397 (424) Q Consensus 346 ~ea~~~a~~~g~~~ivShRSGETeD~~iaDLAVg~~a~~iK~G~~~R~ER~a 397 (424) ...++.|+ .+||. |+-..|||.++|.+.|=.=+|..-.++. T Consensus 256 ~ali~~a~-----l~I~n------DSGpmHlAaA~g~P~valFGpT~P~~~~ 296 (348) T PRK10916 256 VILIAACK-----AIVTN------DSGLMHVAAALNRPLVALYGPSSPDFTP 296 (348) T ss_pred HHHHHHCC-----EEEEC------CCHHHHHHHHCCCCEEEEECCCCCCCCC T ss_conf 99998559-----87844------8828999998099889998999933148 Done!