Query gi|254780558|ref|YP_003064971.1| hypothetical protein CLIBASIA_02225 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 396 No_of_seqs 176 out of 400 Neff 6.1 Searched_HMMs 39220 Date Sun May 29 22:38:17 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780558.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 COG2861 Uncharacterized protei 100.0 0 0 527.5 25.0 239 150-391 2-246 (250) 2 pfam04748 Polysacc_deac_2 Dive 100.0 0 0 509.4 26.6 213 175-390 1-213 (213) 3 TIGR02764 spore_ybaN_pdaB poly 96.1 0.099 2.5E-06 30.6 9.6 172 172-389 7-198 (198) 4 TIGR03212 uraD_N-term-dom puta 95.8 0.085 2.2E-06 31.0 8.1 186 187-390 82-289 (297) 5 PRK02134 hypothetical protein; 94.5 0.38 9.8E-06 26.9 12.5 204 172-387 4-252 (253) 6 TIGR03473 HpnK hopanoid biosyn 93.8 0.54 1.4E-05 26.0 17.5 209 172-389 3-282 (283) 7 pfam04794 YdjC YdjC-like prote 93.7 0.55 1.4E-05 25.9 15.6 181 172-361 2-225 (261) 8 COG5478 Predicted small integr 93.5 0.18 4.6E-06 29.0 5.1 61 326-386 63-127 (141) 9 PRK05406 LamB/YcsF family prot 89.4 1.7 4.2E-05 22.9 15.3 193 178-380 8-246 (246) 10 TIGR03471 HpnJ hopanoid biosyn 85.6 2.8 7.1E-05 21.5 12.6 128 246-378 225-368 (472) 11 TIGR02237 recomb_radB DNA repa 85.6 2.8 7.1E-05 21.5 6.8 113 212-358 32-155 (223) 12 COG0726 CDA1 Predicted xylanas 84.9 3 7.6E-05 21.3 13.5 178 171-391 65-259 (267) 13 TIGR00416 sms DNA repair prote 84.1 1.9 4.9E-05 22.5 4.4 81 268-365 166-253 (481) 14 PRK05434 phosphoglyceromutase; 83.4 3.5 8.8E-05 20.9 13.3 156 171-353 6-177 (511) 15 PRK06846 deaminase; Validated 78.2 5.2 0.00013 19.7 8.2 123 193-334 186-316 (410) 16 PRK07625 consensus 77.1 0.56 1.4E-05 25.8 -0.3 27 280-306 329-355 (922) 17 COG2390 DeoR Transcriptional r 75.6 6.1 0.00016 19.3 8.5 82 283-364 124-222 (321) 18 PRK13209 L-xylulose 5-phosphat 75.3 6.2 0.00016 19.2 14.7 157 203-384 17-186 (283) 19 pfam03740 PdxJ Pyridoxal phosp 75.2 6.3 0.00016 19.2 6.5 116 257-377 77-210 (239) 20 TIGR01418 PEP_synth phosphoeno 75.1 6.3 0.00016 19.2 4.8 123 251-383 701-858 (877) 21 pfam03796 DnaB_C DnaB-like hel 74.3 6.6 0.00017 19.1 9.0 119 216-360 44-179 (186) 22 pfam02092 tRNA_synt_2f Glycyl- 74.1 5.5 0.00014 19.6 4.1 176 164-356 40-236 (549) 23 cd06808 PLPDE_III Type III Pyr 73.1 7.1 0.00018 18.9 7.4 89 205-302 89-187 (211) 24 cd01125 repA Hexameric Replica 73.0 7.1 0.00018 18.9 10.7 124 213-362 21-160 (239) 25 PRK04004 translation initiatio 71.3 7.8 0.0002 18.6 5.2 33 344-376 419-451 (592) 26 PRK13886 conjugal transfer pro 70.5 8.1 0.00021 18.5 5.8 50 212-276 105-158 (241) 27 PRK08786 consensus 66.2 1.3 3.3E-05 23.6 -0.5 26 281-306 334-359 (927) 28 cd00984 DnaB_C DnaB helicase C 66.2 9.9 0.00025 18.0 11.0 121 215-360 37-172 (242) 29 cd02072 Glm_B12_BD B12 binding 65.8 10 0.00026 17.9 7.0 64 209-297 16-79 (128) 30 PRK07997 consensus 65.5 1.2 3.2E-05 23.6 -0.7 26 280-305 334-359 (928) 31 cd00003 PNPsynthase Pyridoxine 64.7 11 0.00027 17.8 6.0 98 275-377 102-207 (234) 32 COG1540 Uncharacterized protei 64.3 8.3 0.00021 18.5 3.3 189 188-386 22-252 (252) 33 TIGR01307 pgm_bpd_ind 2,3-bisp 63.9 11 0.00028 17.7 11.3 181 172-375 3-210 (529) 34 PRK07583 cytosine deaminase; V 63.0 11 0.00029 17.6 6.2 130 200-348 201-356 (437) 35 PRK07556 consensus 62.8 1.3 3.4E-05 23.5 -1.0 25 281-305 373-397 (977) 36 cd02071 MM_CoA_mut_B12_BD meth 62.3 12 0.0003 17.5 7.3 76 207-307 14-92 (122) 37 pfam03746 LamB_YcsF LamB/YcsF 61.6 12 0.00031 17.4 14.9 188 179-376 7-240 (242) 38 TIGR02873 spore_ylxY probable 59.3 12 0.00031 17.4 3.4 177 171-389 86-268 (269) 39 PRK11545 gntK gluconate kinase 55.1 15 0.00039 16.8 5.3 85 278-374 87-173 (177) 40 pfam06415 iPGM_N BPG-independe 54.8 16 0.0004 16.7 4.6 41 312-352 53-94 (223) 41 PRK09825 idnK D-gluconate kina 54.6 16 0.0004 16.7 5.5 84 282-377 86-171 (176) 42 TIGR01011 rpsB_bact ribosomal 54.3 11 0.00029 17.6 2.5 35 296-334 159-193 (227) 43 PRK07898 consensus 53.9 0.97 2.5E-05 24.4 -2.9 16 286-301 328-343 (902) 44 cd01122 GP4d_helicase GP4d_hel 53.4 16 0.00042 16.6 14.3 147 169-362 28-192 (271) 45 PRK05929 consensus 53.3 5.2 0.00013 19.7 0.7 19 279-297 295-313 (870) 46 PRK13479 2-aminoethylphosphona 52.7 17 0.00043 16.5 11.5 41 253-297 246-286 (368) 47 PRK10927 essential cell divisi 52.6 17 0.00042 16.6 3.2 21 29-49 37-57 (319) 48 TIGR03006 pepcterm_polyde poly 50.9 18 0.00046 16.3 3.8 102 207-327 60-163 (274) 49 TIGR02066 dsrB sulfite reducta 50.4 16 0.0004 16.7 2.8 32 278-310 74-105 (366) 50 pfam06506 PrpR_N Propionate ca 50.2 18 0.00047 16.3 8.3 118 195-351 51-168 (169) 51 pfam03979 Sigma70_r1_1 Sigma-7 50.2 17 0.00042 16.6 2.9 47 338-384 6-57 (79) 52 pfam01171 ATP_bind_3 PP-loop f 49.9 19 0.00048 16.2 10.2 90 281-370 11-115 (182) 53 PRK02261 methylaspartate mutas 49.6 19 0.00048 16.2 7.1 64 210-298 21-84 (137) 54 KOG0023 consensus 49.1 19 0.00049 16.2 7.5 151 170-388 182-334 (360) 55 PRK08928 consensus 48.9 5 0.00013 19.8 0.1 21 281-301 310-330 (861) 56 PRK09230 cytosine deaminase; P 48.8 19 0.00049 16.1 10.1 29 272-300 245-273 (426) 57 pfam04120 Iron_permease Low af 46.8 21 0.00053 15.9 5.1 48 331-378 65-113 (133) 58 KOG2703 consensus 46.4 16 0.00041 16.6 2.3 36 272-307 347-382 (460) 59 PRK04182 cytidylate kinase; Pr 46.3 16 0.00042 16.6 2.4 22 285-306 63-84 (178) 60 pfam01522 Polysacc_deac_1 Poly 45.6 22 0.00055 15.8 6.4 86 172-270 7-95 (123) 61 KOG4513 consensus 45.1 22 0.00055 15.8 2.8 81 174-263 23-104 (531) 62 PRK07572 cytosine deaminase; V 44.9 22 0.00057 15.7 10.1 78 208-301 191-269 (425) 63 pfam02353 CMAS Cyclopropane-fa 44.7 22 0.00057 15.7 5.5 170 174-378 65-236 (273) 64 COG4352 RPL13 Ribosomal protei 43.8 23 0.00059 15.6 3.4 37 345-381 73-111 (113) 65 PRK05692 hydroxymethylglutaryl 43.7 23 0.00059 15.6 12.5 75 209-295 122-196 (287) 66 TIGR02173 cyt_kin_arch cytidyl 43.7 19 0.00049 16.2 2.4 58 303-360 10-81 (173) 67 COG0542 clpA ATP-binding subun 43.0 24 0.00061 15.6 3.6 133 242-387 551-724 (786) 68 cd01992 PP-ATPase N-terminal d 42.9 24 0.00061 15.6 9.8 72 298-369 33-114 (185) 69 COG2243 CobF Precorrin-2 methy 42.6 24 0.00062 15.5 9.7 164 182-368 13-185 (234) 70 COG0052 RpsB Ribosomal protein 42.2 24 0.00062 15.5 4.6 148 171-334 34-192 (252) 71 PRK01233 glyS glycyl-tRNA synt 42.1 25 0.00063 15.5 5.0 175 165-356 44-239 (687) 72 COG1674 FtsK DNA segregation A 41.5 25 0.00064 15.4 4.9 88 272-363 582-685 (858) 73 pfam02645 DegV Uncharacterized 40.9 26 0.00066 15.4 4.2 123 244-389 64-195 (211) 74 cd02067 B12-binding B12 bindin 40.4 26 0.00067 15.3 10.0 90 210-325 17-109 (119) 75 pfam02633 Creatininase Creatin 40.4 26 0.00067 15.3 7.6 65 224-295 56-126 (235) 76 PRK08238 hypothetical protein; 40.4 26 0.00067 15.3 8.5 99 184-306 55-153 (481) 77 TIGR01457 HAD-SF-IIA-hyp2 HAD- 40.3 23 0.00058 15.7 2.4 53 285-337 22-78 (251) 78 TIGR03239 GarL 2-dehydro-3-deo 40.3 26 0.00067 15.3 4.6 85 279-387 145-231 (249) 79 PRK06886 hypothetical protein; 40.3 26 0.00067 15.3 7.1 58 325-384 179-241 (328) 80 COG1082 IolE Sugar phosphate i 40.1 26 0.00067 15.3 13.3 87 212-303 89-179 (274) 81 pfam11814 DUF3335 Domain of un 40.1 26 0.00067 15.3 5.4 56 326-387 100-158 (228) 82 PRK08434 consensus 40.0 3.4 8.6E-05 20.9 -2.0 28 279-306 302-329 (887) 83 TIGR00593 pola DNA polymerase 39.9 6.3 0.00016 19.2 -0.6 18 280-297 468-485 (1005) 84 cd06543 GH18_PF-ChiA-like PF-C 39.9 27 0.00068 15.3 4.6 98 194-301 140-258 (294) 85 TIGR03015 pepcterm_ATPase puta 39.7 27 0.00068 15.2 9.8 95 238-343 16-113 (269) 86 cd03768 SR_ResInv Serine Recom 39.6 27 0.00069 15.2 3.0 17 370-386 75-91 (126) 87 pfam04404 ERF ERF superfamily. 38.8 28 0.00071 15.2 3.8 48 255-302 3-52 (158) 88 COG1239 ChlI Mg-chelatase subu 37.5 29 0.00074 15.0 4.9 16 179-194 46-61 (423) 89 COG0391 Uncharacterized conser 37.2 29 0.00075 15.0 10.4 141 190-345 157-309 (323) 90 TIGR00624 tag DNA-3-methyladen 37.1 29 0.00074 15.0 2.5 32 283-319 143-174 (185) 91 PRK05755 DNA polymerase I; Pro 36.9 2 5E-05 22.4 -3.6 28 279-306 303-330 (889) 92 TIGR00055 uppS undecaprenyl di 36.9 23 0.00059 15.6 1.9 143 173-380 5-156 (237) 93 cd01120 RecA-like_NTPases RecA 36.3 30 0.00077 14.9 7.5 59 289-363 80-139 (165) 94 COG3265 GntK Gluconate kinase 36.2 30 0.00078 14.9 5.9 41 333-373 117-158 (161) 95 TIGR00707 argD acetylornithine 34.9 22 0.00055 15.8 1.5 146 173-387 220-374 (402) 96 PRK01045 ispH 4-hydroxy-3-meth 34.3 33 0.00083 14.7 3.9 184 175-377 69-284 (304) 97 PRK05797 consensus 33.9 6.1 0.00016 19.3 -1.4 95 277-385 302-401 (869) 98 KOG0087 consensus 33.6 25 0.00064 15.4 1.7 13 208-220 104-116 (222) 99 COG2183 Tex Transcriptional ac 33.6 33 0.00085 14.6 6.0 36 240-275 471-506 (780) 100 PRK13359 beta-ketoadipyl CoA t 33.4 34 0.00086 14.6 5.5 172 203-385 182-385 (400) 101 TIGR00086 smpB SsrA-binding pr 33.2 23 0.0006 15.6 1.5 24 367-390 84-107 (145) 102 pfam00455 DeoR Bacterial regul 33.1 30 0.00077 14.9 2.0 117 173-304 19-136 (157) 103 PRK08835 consensus 32.9 3.5 9E-05 20.8 -2.8 25 281-305 337-361 (931) 104 PRK06361 hypothetical protein; 32.7 35 0.00088 14.5 9.9 41 345-386 133-173 (216) 105 PRK05769 4-aminobutyrate amino 32.7 35 0.00088 14.5 5.5 12 209-220 245-256 (442) 106 PRK07300 consensus 32.2 5.7 0.00014 19.5 -1.8 13 341-353 754-766 (880) 107 cd00338 Ser_Recombinase Serine 31.4 36 0.00093 14.4 3.6 20 276-295 75-94 (137) 108 PRK11234 nfrB bacteriophage N4 31.3 32 0.00082 14.8 1.9 11 223-233 188-198 (646) 109 PRK10660 tilS tRNA(Ile)-lysidi 31.3 36 0.00093 14.4 11.5 66 243-309 111-185 (433) 110 COG1780 NrdI Protein involved 30.9 37 0.00094 14.4 3.2 84 267-351 43-136 (141) 111 pfam01936 DUF88 Protein of unk 30.8 37 0.00095 14.3 2.5 105 172-298 1-115 (140) 112 cd06305 PBP1_methylthioribose_ 30.6 37 0.00095 14.3 7.4 26 200-225 59-85 (273) 113 PRK13210 putative L-xylulose 5 30.3 38 0.00097 14.3 16.6 156 203-383 12-180 (284) 114 cd01306 PhnM PhnM is believed 30.2 38 0.00097 14.3 8.3 102 251-360 95-225 (325) 115 TIGR00542 hxl6Piso_put hexulos 29.7 39 0.00099 14.2 4.9 138 209-384 18-181 (290) 116 cd06307 PBP1_uncharacterized_s 29.5 39 0.001 14.2 8.3 158 199-370 61-230 (275) 117 TIGR00131 gal_kin galactokinas 29.2 34 0.00087 14.6 1.8 56 260-325 175-233 (500) 118 TIGR01428 HAD_type_II haloacid 29.0 34 0.00088 14.6 1.8 98 210-318 32-137 (207) 119 PRK07850 acetyl-CoA acetyltran 28.4 41 0.001 14.1 7.1 176 205-384 168-371 (387) 120 COG0749 PolA DNA polymerase I 28.3 19 0.00048 16.2 0.3 138 203-359 338-485 (593) 121 TIGR02884 spore_pdaA delta-lac 27.8 42 0.0011 14.0 3.8 170 179-389 44-225 (225) 122 PRK12569 hypothetical protein; 27.7 42 0.0011 14.0 14.4 185 178-376 9-243 (245) 123 PRK05093 argD bifunctional N-s 27.7 42 0.0011 14.0 3.9 13 275-287 280-292 (403) 124 pfam09989 DUF2229 CoA enzyme a 27.6 42 0.0011 14.0 3.2 53 335-387 162-217 (218) 125 cd03320 OSBS o-Succinylbenzoat 27.6 42 0.0011 14.0 13.3 110 249-365 109-240 (263) 126 pfam10686 DUF2493 Protein of u 27.5 42 0.0011 14.0 5.1 50 274-323 9-62 (71) 127 PRK08555 consensus 27.3 43 0.0011 14.0 2.2 15 207-221 243-257 (443) 128 pfam02401 LYTB LytB protein. T 26.9 43 0.0011 13.9 4.3 145 175-323 67-239 (280) 129 PRK13413 mpi multiple promoter 26.7 44 0.0011 13.9 3.6 124 257-391 52-184 (200) 130 PRK07323 consensus 26.6 36 0.00093 14.4 1.5 12 374-385 400-411 (443) 131 cd01121 Sms Sms (bacterial rad 26.3 44 0.0011 13.9 5.8 88 249-363 121-212 (372) 132 pfam11963 DUF3477 Protein of u 26.2 45 0.0011 13.8 2.2 19 194-214 168-186 (355) 133 TIGR00099 Cof-subfamily Cof-li 26.2 45 0.0011 13.8 4.0 71 207-296 20-104 (270) 134 TIGR00537 hemK_rel_arch methyl 26.2 45 0.0011 13.8 5.6 54 325-382 110-164 (183) 135 pfam01261 AP_endonuc_2 Xylose 26.1 45 0.0011 13.8 8.2 75 310-384 69-153 (201) 136 PRK00349 uvrA excinuclease ABC 25.9 45 0.0012 13.8 6.9 107 283-390 457-585 (944) 137 TIGR01090 apt adenine phosphor 25.8 44 0.0011 13.9 1.8 22 172-194 115-136 (175) 138 PRK09248 putative hydrolase; V 25.8 45 0.0012 13.8 10.3 113 248-374 109-234 (246) 139 COG1964 Predicted Fe-S oxidore 25.8 46 0.0012 13.8 3.0 48 210-272 189-237 (475) 140 PRK12616 pyridoxal kinase; Rev 25.2 47 0.0012 13.7 4.2 57 186-271 122-178 (270) 141 PRK13635 cbiO cobalt transport 25.2 47 0.0012 13.7 8.8 133 172-307 34-171 (279) 142 PRK06904 replicative DNA helic 25.1 47 0.0012 13.7 10.3 108 245-359 257-381 (472) 143 PRK10128 putative aldolase; Pr 24.9 47 0.0012 13.7 6.5 85 279-387 135-221 (250) 144 PRK12479 branched-chain amino 24.9 3.3 8.4E-05 21.0 -4.1 79 309-387 216-296 (299) 145 COG2230 Cfa Cyclopropane fatty 24.8 38 0.00098 14.3 1.4 167 173-378 74-242 (283) 146 pfam03971 IDH Monomeric isocit 24.8 48 0.0012 13.7 3.8 175 206-384 229-514 (735) 147 smart00633 Glyco_10 Glycosyl h 24.6 48 0.0012 13.7 11.8 44 338-381 135-184 (254) 148 TIGR00263 trpB tryptophan synt 24.2 49 0.0012 13.6 2.2 104 264-386 293-398 (412) 149 PRK06209 glutamate-1-semialdeh 24.1 49 0.0012 13.6 2.1 18 204-221 201-218 (431) 150 KOG0256 consensus 24.0 49 0.0013 13.6 7.0 112 211-341 215-329 (471) 151 PRK05595 replicative DNA helic 23.8 50 0.0013 13.6 8.1 119 215-358 225-357 (444) 152 cd06310 PBP1_ABC_sugar_binding 23.8 50 0.0013 13.6 6.1 26 200-225 61-87 (273) 153 TIGR00017 cmk cytidylate kinas 23.6 50 0.0013 13.5 3.1 17 320-336 141-157 (223) 154 TIGR00382 clpX ATP-dependent C 23.4 48 0.0012 13.7 1.6 98 249-346 91-210 (452) 155 PRK08760 replicative DNA helic 23.4 51 0.0013 13.5 8.3 107 245-358 265-385 (476) 156 PRK13640 cbiO cobalt transport 23.3 51 0.0013 13.5 8.9 125 172-306 35-174 (283) 157 PRK07369 dihydroorotase; Provi 23.1 51 0.0013 13.5 8.8 107 265-382 148-255 (419) 158 cd03315 MLE_like Muconate lact 23.0 51 0.0013 13.5 13.3 136 210-365 90-244 (265) 159 cd06316 PBP1_ABC_sugar_binding 22.9 52 0.0013 13.5 7.9 23 249-271 169-191 (294) 160 PRK08455 fliL flagellar basal 22.9 52 0.0013 13.4 4.7 12 36-47 32-43 (180) 161 COG0011 Uncharacterized conser 22.8 52 0.0013 13.4 6.9 79 250-347 19-98 (100) 162 TIGR01754 flav_RNR ribonucleot 22.8 52 0.0013 13.4 4.0 97 199-299 4-115 (145) 163 cd06451 AGAT_like Alanine-glyo 22.6 52 0.0013 13.4 9.2 106 252-376 240-352 (356) 164 COG0646 MetH Methionine syntha 22.5 52 0.0013 13.4 6.3 165 175-373 136-310 (311) 165 pfam03269 DUF268 Caenorhabditi 22.3 48 0.0012 13.7 1.4 73 243-315 43-124 (177) 166 COG0084 TatD Mg-dependent DNas 22.2 53 0.0014 13.4 10.8 71 203-294 13-85 (256) 167 smart00052 EAL Putative diguan 22.2 53 0.0014 13.4 10.2 144 213-374 75-226 (241) 168 COG0134 TrpC Indole-3-glycerol 22.1 54 0.0014 13.3 9.9 78 208-309 144-221 (254) 169 pfam07423 DUF1510 Protein of u 22.0 54 0.0014 13.3 2.4 22 28-49 18-39 (214) 170 pfam00682 HMGL-like HMGL-like. 21.8 54 0.0014 13.3 14.6 47 337-383 106-154 (237) 171 pfam07300 consensus 21.7 54 0.0014 13.3 3.8 50 326-375 57-110 (111) 172 COG1954 GlpP Glycerol-3-phosph 21.7 55 0.0014 13.3 8.7 93 188-331 16-108 (181) 173 PRK05265 pyridoxine 5'-phospha 21.7 55 0.0014 13.3 9.0 98 275-377 105-210 (240) 174 PRK10906 DNA-binding transcrip 21.5 52 0.0013 13.4 1.5 150 172-342 92-243 (252) 175 TIGR02009 PGMB-YQAB-SF beta-ph 21.5 55 0.0014 13.3 3.0 64 329-395 101-170 (211) 176 PRK08082 consensus 21.5 55 0.0014 13.3 7.0 122 216-359 228-361 (453) 177 PRK02551 hypothetical protein; 21.4 55 0.0014 13.3 3.8 49 298-346 98-151 (154) 178 cd07110 ALDH_F10_BADH Arabidop 21.3 56 0.0014 13.2 2.0 35 342-377 377-411 (456) 179 cd01394 radB RadB. The archaea 21.3 56 0.0014 13.2 8.0 53 294-363 103-160 (218) 180 PRK05636 replicative DNA helic 21.3 56 0.0014 13.2 8.0 105 246-359 304-424 (507) 181 TIGR03345 VI_ClpV1 type VI sec 21.2 56 0.0014 13.2 2.2 20 173-192 210-229 (852) 182 PRK10681 DNA-binding transcrip 21.0 56 0.0014 13.2 2.1 149 173-341 94-243 (252) 183 TIGR00089 TIGR00089 RNA modifi 20.8 57 0.0015 13.2 8.0 159 173-351 205-381 (455) 184 TIGR00830 PTBA PTS system, glu 20.7 42 0.0011 14.0 0.9 18 215-232 60-77 (129) 185 COG5016 Pyruvate/oxaloacetate 20.5 58 0.0015 13.1 6.8 106 241-352 300-426 (472) 186 COG1307 DegV Uncharacterized p 20.5 58 0.0015 13.1 10.2 112 244-381 135-253 (282) 187 KOG1389 consensus 20.4 58 0.0015 13.1 2.9 171 206-380 204-400 (435) 188 pfam01553 Acyltransferase Acyl 20.2 59 0.0015 13.1 6.1 81 266-357 15-96 (132) 189 PRK09061 D-glutamate deacylase 20.2 59 0.0015 13.1 11.7 36 262-297 226-261 (496) 190 PRK03600 nrdI ribonucleotide r 20.1 59 0.0015 13.1 3.3 36 310-345 94-129 (136) 191 PRK06918 4-aminobutyrate amino 20.1 59 0.0015 13.1 2.3 23 275-297 316-341 (454) 192 TIGR02902 spore_lonB ATP-depen 20.0 11 0.00028 17.6 -2.2 17 207-223 285-301 (532) No 1 >COG2861 Uncharacterized protein conserved in bacteria [Function unknown] Probab=100.00 E-value=0 Score=527.53 Aligned_cols=239 Identities=31% Similarity=0.541 Sum_probs=228.9 Q ss_pred CCCCCCCCCCCCC------CCCCCCCCCEEEEEECCCCCCCHHHHHHHHHCCCCCEEEEECCCCCHHHHHHHHHHCCCCE Q ss_conf 5646743455332------1123466865899973778771688999983793508987089876899999999859929 Q gi|254780558|r 150 VGREDTEVPAMDK------NFCSNASGARIAIVVSGLGISQTGTQRAINLLPANITLAFASNGNSLDRWMKEAKKKGQEA 223 (396) Q Consensus 150 ~g~~P~~~~~~~~------~~~~~a~~~riAIVIddlG~~~~~~~~ai~~LP~~vT~A~~P~~~~~~~~~~~Ar~~G~Ev 223 (396) .+.++++..++.+ .+|..++++|+||||||||+++.+++ +|.+||.+||+||+|+++++++|++.||++|||| T Consensus 2 ~~~l~~i~~~~~~~~~~~~~~~~~~~~~klaIVIddlG~~~~~~~-~i~~Lp~eVTlA~~P~~~~~~~~~~~A~~~G~ev 80 (250) T COG2861 2 QGFLKVIGADGRRPFLQYARPWSGARGPKLAIVIDDLGLSQTGTQ-AILALPPEVTLAFAPFAPHAREWAQKARNAGHEV 80 (250) T ss_pred CCCCCCHHHCCCCHHHHHCCCCCCCCCCEEEEEECCCCCCHHHHH-HHHHCCCCCEEEECCCCCHHHHHHHHHHHCCCEE T ss_conf 975221101110479883688776778638999777653577777-9982896634886678831689999998569879 Q ss_pred EEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHCCCCEEEECC Q ss_conf 99817665689877878554467899899999999999855893598001230000498999999999850692899749 Q gi|254780558|r 224 ILQIPMQAFDESYNEDDSYTLKVTQTVQQLLNRLRYSLRRGTGYFGVMNYRGAMLLSNKESAEVIFKEFAKRGLLFFDDG 303 (396) Q Consensus 224 llhlPMEp~~~~~~~pGp~~L~~~~~~~~~~~~l~~~l~~~p~~vGvnNhmGs~~t~~~~~m~~vl~~l~~rgL~flDs~ 303 (396) ||||||||++|+.+|+| +|..+|+.+||.++|+|+++++|+|+|+||||||+||+|+.+|+++|++|++||||||||+ T Consensus 81 lih~PmeP~~~~~~e~g--tL~~~~s~~e~~~rl~~a~~~v~~~~GlnNhmGs~~tsn~~aM~~~m~~Lk~r~l~flDs~ 158 (250) T COG2861 81 LIHMPMEPFSYPKIEPG--TLRPGMSAEEILRRLRKAMNKVPDAVGLNNHMGSRFTSNEDAMEKLMEALKERGLYFLDSG 158 (250) T ss_pred EEECCCCCCCCCCCCCC--CCCCCCCHHHHHHHHHHHHHHCCCCEEEHHHHHHHHCCCHHHHHHHHHHHHHCCEEEECCC T ss_conf 99656775568988988--7466888899999999998638651300123323301769999999999987796897266 Q ss_pred CCCCCHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHHHHHHHCCEEEEEEECCHHHHHHHHHHHHHHHHCCCEEE Q ss_conf 76322678989870994698458862899999999999999999987194899980887889999999756887691899 Q gi|254780558|r 304 SSPRNLTRVLAPKLNLPYMVADLYLDDQVDRDKIREKLKGLEEIARTTGQAIGVAVAFDESIEVISQWLQQEHVRDVSVV 383 (396) Q Consensus 304 Ts~~Sva~~~A~~~gvp~~~~dvfLD~~~~~~~I~~qL~~l~~~Ar~~G~AI~igh~~p~Ti~~L~~w~~~l~~~gi~lV 383 (396) |+.+|++.++|++.|||+..||||||+++++++|.+||++++++||++|++|+|||+|++|+.+|++|+.+|.++||+|| T Consensus 159 T~a~S~a~~iAk~~gVp~~~rdvfLD~e~~~~~V~kql~~~~~~Ark~G~ai~IGh~~~~Tv~vl~~~~~~l~~~gIelV 238 (250) T COG2861 159 TIANSLAGKIAKEIGVPVIKRDVFLDDEDTEAAVLKQLDAAEKLARKNGSAIGIGHPHKNTVAVLQQWLDELPARGIELV 238 (250) T ss_pred CCCCCHHHHHHHHCCCCEEEEEEEECCCCCHHHHHHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHHHHHHCCCCCEEEE T ss_conf 52020206667633875234013304867899999999999999986694378548866399999998875787874884 Q ss_pred EHHHHHHC Q ss_conf 67774306 Q gi|254780558|r 384 PLSCLAKL 391 (396) Q Consensus 384 pvS~l~~~ 391 (396) |+|+|+.. T Consensus 239 ~~s~L~~~ 246 (250) T COG2861 239 PVSALLGE 246 (250) T ss_pred CHHHHHCC T ss_conf 27876147 No 2 >pfam04748 Polysacc_deac_2 Divergent polysaccharide deacetylase. This family is divergently related to pfam01522 (personal obs:Yeats C). Probab=100.00 E-value=0 Score=509.40 Aligned_cols=213 Identities=29% Similarity=0.494 Sum_probs=208.2 Q ss_pred EEECCCCCCCHHHHHHHHHCCCCCEEEEECCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHH Q ss_conf 99737787716889999837935089870898768999999998599299981766568987787855446789989999 Q gi|254780558|r 175 IVVSGLGISQTGTQRAINLLPANITLAFASNGNSLDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQLL 254 (396) Q Consensus 175 IVIddlG~~~~~~~~ai~~LP~~vT~A~~P~~~~~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~~~ 254 (396) |||||+|++ ..+.+++.+||+||||||+||.+++++|++.||++|||||||+||||++| .|+||++|+++|++++|+ T Consensus 1 IVIDD~G~~-~~~~~~l~~Lp~piT~Ai~P~~~~~~~~a~~a~~~g~EvllhlPMe~~~~--~~~gp~~L~~~~~~~~i~ 77 (213) T pfam04748 1 IVIDDLGYN-LATDEALLALPAPVTLAILPYAPHAAELAEAARAAGHEVLLHLPMEPLGY--KDPGPGTLTVGMSAEEIE 77 (213) T ss_pred CEEECCCCC-HHHHHHHHCCCCCEEEEECCCCCCHHHHHHHHHHCCCEEEEECCCCCCCC--CCCCCCCCCCCCCHHHHH T ss_conf 979678999-78999998599875899879998669999999987994999745663468--999977588899999999 Q ss_pred HHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHCCCEEEEEEEECCCCCH Q ss_conf 99999998558935980012300004989999999998506928997497632267898987099469845886289999 Q gi|254780558|r 255 NRLRYSLRRGTGYFGVMNYRGAMLLSNKESAEVIFKEFAKRGLLFFDDGSSPRNLTRVLAPKLNLPYMVADLYLDDQVDR 334 (396) Q Consensus 255 ~~l~~~l~~~p~~vGvnNhmGs~~t~~~~~m~~vl~~l~~rgL~flDs~Ts~~Sva~~~A~~~gvp~~~~dvfLD~~~~~ 334 (396) ++|+|+|+++|+|+||||||||+||+|+..|++||++|++||||||||+|+++|+++++|++.|+|+++||+||||+++. T Consensus 78 ~~l~~~l~~~p~avGvnNhmGS~~t~~~~~m~~vl~~l~~~gl~fvDS~Tt~~S~a~~~A~~~gvp~~~rdvfLD~~~~~ 157 (213) T pfam04748 78 KRLEAALSRVPYAVGVNNHMGSRFTADRAAMRWVMEELKKRGLFFVDSRTSGRSVAPKLAKELGVPAAKRDVFLDNEQTE 157 (213) T ss_pred HHHHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHCCCCEEEEEEECCCCCCH T ss_conf 99999998788848995466755416999999999999877988991477766589999998399867610314799999 Q ss_pred HHHHHHHHHHHHHHHHCCEEEEEEECCHHHHHHHHHHHHHHHHCCCEEEEHHHHHH Q ss_conf 99999999999999871948999808878899999997568876918996777430 Q gi|254780558|r 335 DKIREKLKGLEEIARTTGQAIGVAVAFDESIEVISQWLQQEHVRDVSVVPLSCLAK 390 (396) Q Consensus 335 ~~I~~qL~~l~~~Ar~~G~AI~igh~~p~Ti~~L~~w~~~l~~~gi~lVpvS~l~~ 390 (396) ++|++||++++++|+++|+||+|||+||+|+++|++|+++|+++||+|||+|+|++ T Consensus 158 ~~I~~ql~~~~~~A~~~G~aI~Igh~~p~Tl~~L~~~~~~l~~~gi~lVpvS~l~~ 213 (213) T pfam04748 158 AAIRRQLDQAAALARKEGSAIAIGHPHPETIAALKEWLPELRARGIELVPVSALLR 213 (213) T ss_pred HHHHHHHHHHHHHHHHCCCEEEEECCCHHHHHHHHHHHHHHHHCCEEEEEHHHHHC T ss_conf 99999999999999863957999779989999999986577658879996658439 No 3 >TIGR02764 spore_ybaN_pdaB polysaccharide deacetylase family sporulation protein PdaB; InterPro: IPR014132 This entry describes the YbaN protein family, also called PdaB and SpoVIE, of Gram-positive bacteria. Although ybaN null mutants have only a mild sporulation defect, ybaN/ytrI double mutants show drastically reduced sporulation efficiencies. This synthetic defect suggests the role of this sigmaE-controlled gene in sporulation had been masked by functional redundancy. Members of this family are homologous to a characterised polysaccharide deacetylase; the exact function this protein family is unknown.. Probab=96.07 E-value=0.099 Score=30.60 Aligned_cols=172 Identities=13% Similarity=0.085 Sum_probs=113.7 Q ss_pred EEEEEECCCCCCCHHHHHH---HHHCCCC-CEEEEE-CCCCCHHHHHHH---HHHCCCCEEEECCCCCCCCCCCCCCCCC Q ss_conf 5899973778771688999---9837935-089870-898768999999---9985992999817665689877878554 Q gi|254780558|r 172 RIAIVVSGLGISQTGTQRA---INLLPAN-ITLAFA-SNGNSLDRWMKE---AKKKGQEAILQIPMQAFDESYNEDDSYT 243 (396) Q Consensus 172 riAIVIddlG~~~~~~~~a---i~~LP~~-vT~A~~-P~~~~~~~~~~~---Ar~~G~EvllhlPMEp~~~~~~~pGp~~ 243 (396) .|||=.|+ +-+...+..- +.+--.+ -||=++ +.+..-.+.+++ ..+.||||=.| +|...| T Consensus 7 ~iAlTFDi-sWG~~~~~~IL~~L~~~~vk~ATFFlsG~Wae~hPelvk~lveI~~~GHEIGsH------gY~h~~----- 74 (198) T TIGR02764 7 KIALTFDI-SWGNQYTEPILDTLKEEDVKNATFFLSGSWAERHPELVKELVEIVKDGHEIGSH------GYRHKN----- 74 (198) T ss_pred CEEEEECC-CCCCCCCCHHHHHHHHCCCCCCCCEEEHHHHHCCHHHHHHHHHHHHCCCCCCCC------CCCCCC----- T ss_conf 17884158-987443006766676558853330243157612778999999998448430335------421242----- Q ss_pred CCCCCCHHHHHHHHHHHHHHCCCCEEEEECC----CCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHCC Q ss_conf 4678998999999999998558935980012----300004989999999998506928997497632267898987099 Q gi|254780558|r 244 LKVTQTVQQLLNRLRYSLRRGTGYFGVMNYR----GAMLLSNKESAEVIFKEFAKRGLLFFDDGSSPRNLTRVLAPKLNL 319 (396) Q Consensus 244 L~~~~~~~~~~~~l~~~l~~~p~~vGvnNhm----Gs~~t~~~~~m~~vl~~l~~rgL~flDs~Ts~~Sva~~~A~~~gv 319 (396) .++++.|++.+-|..+-.-+-..+|.+--. .+.|.. .+++.+++.|+-.| T Consensus 75 -~~~l~~E~ikkd~~~a~~~i~~~~g~~p~LlRpP~G~fn~------~~~~~ae~~GY~~V------------------- 128 (198) T TIGR02764 75 -YTTLEDEKIKKDLLRAQEIIEKLTGKKPTLLRPPNGAFNK------AVLKLAESLGYTVV------------------- 128 (198) T ss_pred -CCCCCHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCH------HHHHHHHHCCCCEE------------------- T ss_conf -1258764899999998888865105550165688986046------89999997399189------------------- Q ss_pred CEEEEE--EEE-C-CCCCHHHHHHHHHHHHHHHHHCCEEEEEEE-C---CHHHHHHHHHHHHHHHHCCCEEEEHHHHH Q ss_conf 469845--886-2-899999999999999999987194899980-8---87889999999756887691899677743 Q gi|254780558|r 320 PYMVAD--LYL-D-DQVDRDKIREKLKGLEEIARTTGQAIGVAV-A---FDESIEVISQWLQQEHVRDVSVVPLSCLA 389 (396) Q Consensus 320 p~~~~d--vfL-D-~~~~~~~I~~qL~~l~~~Ar~~G~AI~igh-~---~p~Ti~~L~~w~~~l~~~gi~lVpvS~l~ 389 (396) .=+ |.= | -.+..+.|-++.-+ .-+.| .|..=| + .+.|..+|.+.+++|+++|+++|.||+|+ T Consensus 129 ---~WsWHvdS~DWkNPG~e~iv~~V~~----~~~~G-dIiL~HDASd~~KqT~~ALp~Ii~~LK~~GY~fv~iseLi 198 (198) T TIGR02764 129 ---HWSWHVDSLDWKNPGVESIVDRVVK----NTKPG-DIILLHDASDSAKQTVKALPEIIKKLKEKGYEFVTISELI 198 (198) T ss_pred ---EEECCCCCCCCCCCCEEHHHHHHHC----CCCCC-CEEEEEECCCCCCCHHHHHHHHHHHHHHCCCEEEEHHCCC T ss_conf ---8624558835457861000111331----27998-5698763487974417789999899875495343422019 No 4 >TIGR03212 uraD_N-term-dom putative urate catabolism protein. This model represents a protein that is predominantly found just upstream of the UraD protein (OHCU decarboxylase) and in a number of instances as a N-terminal fusion with it. UraD itself catalyzes the last step in the catabolism of urate to allantoate. The function of this protein is presently unknown. It shows homology with the pfam01522 polysaccharide deacetylase domain family. Probab=95.76 E-value=0.085 Score=31.02 Aligned_cols=186 Identities=14% Similarity=0.108 Sum_probs=104.0 Q ss_pred HHHHHHHCCCCCEEEEECCC-CCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCC Q ss_conf 89999837935089870898-76899999999859929998176656898778785544678998999999999998558 Q gi|254780558|r 187 TQRAINLLPANITLAFASNG-NSLDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQLLNRLRYSLRRGT 265 (396) Q Consensus 187 ~~~ai~~LP~~vT~A~~P~~-~~~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~~~~~l~~~l~~~p 265 (396) -.+.+.++-.+.||.+--.. ....+.++...++||||..|= |-..+ ..+|++++-++.+..+++.+. T Consensus 82 lL~ll~~~gi~aT~fv~g~~ae~~P~~~~~i~~~GhEi~~HG------y~~~~------~~~l~~~~E~~~i~~~~~~l~ 149 (297) T TIGR03212 82 LLRLFTERGIPVTVFGVAMALARNPEAVAAMKEAGWEIASHG------LRWID------YQDMDEAQEREHIAEAIRLHT 149 (297) T ss_pred HHHHHHHCCCCCEEEHHHHHHHHCHHHHHHHHHCCCCHHHHH------HHCCC------CCCCCHHHHHHHHHHHHHHHH T ss_conf 999999759983685229999889999999997597377543------30337------666998999999999999999 Q ss_pred CCEEEEE--CCCCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHH----HHCCCEEE--EEE-EECC--CCC- Q ss_conf 9359800--1230000498999999999850692899749763226789898----70994698--458-8628--999- Q gi|254780558|r 266 GYFGVMN--YRGAMLLSNKESAEVIFKEFAKRGLLFFDDGSSPRNLTRVLAP----KLNLPYMV--ADL-YLDD--QVD- 333 (396) Q Consensus 266 ~~vGvnN--hmGs~~t~~~~~m~~vl~~l~~rgL~flDs~Ts~~Sva~~~A~----~~gvp~~~--~dv-fLD~--~~~- 333 (396) .++|.-- +.-++++.+ +.+-+ +++.|+.|.-|... .-.-+-+.. ..-||+.- +|. |.-. -.+ T Consensus 150 ~~tG~rP~Gw~s~~~s~~--T~~Ll---~~~~G~~Y~sd~~~-DD~Py~~~~~~~~~l~iPy~~e~nD~~~~~~~~~~~~ 223 (297) T TIGR03212 150 EVTGERPLGWYTGRTSPN--TRRLV---AEEGGFLYDSDSYA-DDLPYWDEVAGRPQLIVPYTLDANDMRFATPQGFNTG 223 (297) T ss_pred HHCCCCCCCCCCCCCCCC--HHHHH---HHHCCCEEECCCCC-CCCCEEEECCCCCEEEEEEECCCCCCHHHCCCCCCCH T ss_conf 865999881178999963--36899---99589689898888-8787668638984665540031573001045798987 Q ss_pred ---HHHHHHHHHHHHHHHHHCCEEEEEE-EC----CHHHHHHHHHHHHHHHHCC-CEEEEHHHHHH Q ss_conf ---9999999999999998719489998-08----8788999999975688769-18996777430 Q gi|254780558|r 334 ---RDKIREKLKGLEEIARTTGQAIGVA-VA----FDESIEVISQWLQQEHVRD-VSVVPLSCLAK 390 (396) Q Consensus 334 ---~~~I~~qL~~l~~~Ar~~G~AI~ig-h~----~p~Ti~~L~~w~~~l~~~g-i~lVpvS~l~~ 390 (396) .+.++.+|+.+-+-+...+....|+ || +|.-+..|.+++..+.+++ |-+..-+++.+ T Consensus 224 ~~~~~~~~d~Fd~l~~Eg~~~~r~~~i~lHP~i~G~P~Ri~~Le~~l~~i~~~~~VW~at~~eIA~ 289 (297) T TIGR03212 224 EQFFTYLRDAFDVLYAEGEGAPKMMSIGLHCRLVGRPGRIAALQRFLDYVQSHDKVWVARRIDIAR 289 (297) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHCCCCEEEECHHHHHH T ss_conf 999999999999999960899637988428664688899999999999997189989857899999 No 5 >PRK02134 hypothetical protein; Provisional Probab=94.53 E-value=0.38 Score=26.89 Aligned_cols=204 Identities=10% Similarity=0.046 Sum_probs=118.3 Q ss_pred EEEEEECCCCCCCHHHHHHHHHC---CCCCEEEEECCCCCHHHHHHHHHHC-CCCEEEECCCCCCCCCCC------C-CC Q ss_conf 58999737787716889999837---9350898708987689999999985-992999817665689877------8-78 Q gi|254780558|r 172 RIAIVVSGLGISQTGTQRAINLL---PANITLAFASNGNSLDRWMKEAKKK-GQEAILQIPMQAFDESYN------E-DD 240 (396) Q Consensus 172 riAIVIddlG~~~~~~~~ai~~L---P~~vT~A~~P~~~~~~~~~~~Ar~~-G~EvllhlPMEp~~~~~~------~-pG 240 (396) +|-|--||+|++..... +|.++ ..-=+.+++..+|...+.++.+++. +..|=||+=.=. +.|-. | -| T Consensus 4 ~LIinADDfG~s~~vn~-gI~~a~~~GivtstslM~n~p~~~~A~~l~~~~p~l~vGlHl~Lt~-g~Pl~~~psLv~~~G 81 (253) T PRK02134 4 LLIVNADDFGLSKGQNY-GIIEAHRNGVVTSTTAMVNMPAIDHAVQLSREYPSLGVGMHFVLTA-GKPLTAMPSLTDRDG 81 (253) T ss_pred EEEEECCCCCCCHHHHH-HHHHHHHCCCEEEEEECCCCCHHHHHHHHHHHCCCCCEEEEEEECC-CCCCCCCCCCCCCCC T ss_conf 69997876789877889-9999998798378664258904999999997689997678776068-986655887678899 Q ss_pred C-------CCCCCCCCHHHHHHHHHHHHHHCCCC-----EEEEECCCCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCC Q ss_conf 5-------54467899899999999999855893-----59800123000049899999999985069289974976322 Q gi|254780558|r 241 S-------YTLKVTQTVQQLLNRLRYSLRRGTGY-----FGVMNYRGAMLLSNKESAEVIFKEFAKRGLLFFDDGSSPRN 308 (396) Q Consensus 241 p-------~~L~~~~~~~~~~~~l~~~l~~~p~~-----vGvnNhmGs~~t~~~~~m~~vl~~l~~rgL~flDs~Ts~~S 308 (396) - ....-..+.+|++.-++.-++++-.. .-+..|+..-.. +...+.+++..+++|+-+--++.... T Consensus 82 ~f~~~~~~~~~~~~~~~~ev~~El~AQi~~f~~~~G~~p~hiD~H~Hvh~~--P~v~~~~~~la~ey~lP~R~~~~~~~- 158 (253) T PRK02134 82 LLGKWIWQRAEEGTLPLEEIAQELAAQYERFLALFGRKPTHLDSHHHVHMF--PQIFPIVARFAAEYGVPLRIDRQEVS- 158 (253) T ss_pred CCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCCCCEECCCCEEECC--HHHHHHHHHHHHHHCCCEECCCCCCC- T ss_conf 774679999871699999999999999999999629997513110000048--87999999999982998412542100- Q ss_pred HHHHHHHHHCCCEE--EEEEEECCCCCHHHHHHHHHHHHHHHHHCCEEEEEEECCH--HHH------------------H Q ss_conf 67898987099469--8458862899999999999999999987194899980887--889------------------9 Q gi|254780558|r 309 LTRVLAPKLNLPYM--VADLYLDDQVDRDKIREKLKGLEEIARTTGQAIGVAVAFD--ESI------------------E 366 (396) Q Consensus 309 va~~~A~~~gvp~~--~~dvfLD~~~~~~~I~~qL~~l~~~Ar~~G~AI~igh~~p--~Ti------------------~ 366 (396) .......++++. ..+-|.+...+.+.+.+.|+++.. ...|..-.++||-- +++ . T Consensus 159 --~~~~~~~~~~~~~~f~~~~~~~~~~~~~~~~~l~~~~~--~~~~~~eim~HPg~~d~~l~~ss~~~~R~~El~~L~sp 234 (253) T PRK02134 159 --NAGDLPANLRSSQGFSSEFYGEEVSEDLFLQLLDRSKH--RGEQSLEVMCHPAFIDNTLRQSAYCYPRLTELEVLTSA 234 (253) T ss_pred --CCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHH--CCCCEEEEEEECCCCCHHHHHCCCCCCCHHHHHHHCCH T ss_conf --01111125873000240001344688999999999873--58880799961586786663078776639999997499 Q ss_pred HHHHHHHHHHHCCCEEEEHHH Q ss_conf 999997568876918996777 Q gi|254780558|r 367 VISQWLQQEHVRDVSVVPLSC 387 (396) Q Consensus 367 ~L~~w~~~l~~~gi~lVpvS~ 387 (396) .+++|+ +++||+|+--+. T Consensus 235 ~~k~~l---~~~~I~Li~y~d 252 (253) T PRK02134 235 SLKEAI---AERGYRLGSYLD 252 (253) T ss_pred HHHHHH---HHCCCEEECCCC T ss_conf 999999---987988953477 No 6 >TIGR03473 HpnK hopanoid biosynthesis associated protein HpnK. The sequences represented by this model are members of the pfam04794 "YdjC-like" family of uncharacterized proteins. The member of this clade from Acidithiobacillus ferrooxidans ATCC 23270 (AFE_0976) is found in the same locus as squalene-hopene cyclase (SHC, TIGR01507) and other genes associated with the biosynthesis of hopanoid natural products. Similarly, in Ralstonia eutropha JMP134 (Reut_B4902) this gene is adjacent to HpnAB, IspH and HpnH (TIGR03470), although SHC itself is elsewhere in the genome. Notably, this gene (here named HpnK) and three others form a conserved set (HpnIJKL) which occur in a subset of all genomes containing the SHC enzyme. This relationship was discerned using the method of partial phylogenetic profiling. This group includes Zymomonas mobilis, the organism where the initial hopanoid biosynthesis locus was described consisting of the genes HpnA-E and SHC (HpnF). Continuing past SHC are found a ph Probab=93.80 E-value=0.54 Score=25.97 Aligned_cols=209 Identities=11% Similarity=0.039 Sum_probs=123.0 Q ss_pred EEEEEECCCCCCCHHHHHHHHHC---CCCCEEEEECCCCCHHHHHHHHHHC-CCCEEEECCCCCCCCCCCCCC------- Q ss_conf 58999737787716889999837---9350898708987689999999985-992999817665689877878------- Q gi|254780558|r 172 RIAIVVSGLGISQTGTQRAINLL---PANITLAFASNGNSLDRWMKEAKKK-GQEAILQIPMQAFDESYNEDD------- 240 (396) Q Consensus 172 riAIVIddlG~~~~~~~~ai~~L---P~~vT~A~~P~~~~~~~~~~~Ar~~-G~EvllhlPMEp~~~~~~~pG------- 240 (396) +|-|--||+|++.... ++|.++ ..--+-+++-.+|...+.++.++.. +..|=||+=+=. +.|-..+. T Consensus 3 ~LIinADDfGl~~~vn-~gI~~~~~~GivtstS~Mvn~p~~~~A~~l~k~~p~l~vGlHltLt~-g~P~~~~~~ipsLvd 80 (283) T TIGR03473 3 RLIVTADDFGLSLEVN-EAVEKAHRDGILTAASLMVGAPAAEDAVERARRLPGLGVGLHLVLVD-GRPVLPPDQIPDLVD 80 (283) T ss_pred EEEEEECCCCCCHHHH-HHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHCCCCCEEEEEEECC-CCCCCCHHHCCCCCC T ss_conf 7999735678887789-99999998798467665368914999999997489998798787058-988999444876779 Q ss_pred -CCC------------CCCCCCHHHHHHHHHHHHHHCCC----CEEEEECCCCCCCCCHHHHHHHHHHHHCCCCEEEE-- Q ss_conf -554------------46789989999999999985589----35980012300004989999999998506928997-- Q gi|254780558|r 241 -SYT------------LKVTQTVQQLLNRLRYSLRRGTG----YFGVMNYRGAMLLSNKESAEVIFKEFAKRGLLFFD-- 301 (396) Q Consensus 241 -p~~------------L~~~~~~~~~~~~l~~~l~~~p~----~vGvnNhmGs~~t~~~~~m~~vl~~l~~rgL~flD-- 301 (396) .+. +......+|+++-++.-++++-. -.-+..||. ....+...+.+++..+++|+=.+= T Consensus 81 ~~G~f~~~~~~~~~~~~~~~~~~~ei~~E~~AQie~f~~~G~~ptHiD~H~h--vh~~P~v~~~~~~la~ey~ip~~r~p 158 (283) T TIGR03473 81 GQGRFGDDMVRDGVRYFFLPAVRAQLAREIRAQFEAFAATGLPLDHVNAHKH--FHLHPTVLSLILEIGREYGLRAVRLP 158 (283) T ss_pred CCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC--CCCCHHHHHHHHHHHHHHCCCEEECC T ss_conf 8997055289988887503235999999999999999980999753043320--12482699999999998299836435 Q ss_pred ----------CCCCCCCH-----------HHHHHHHHCCCEEEEEE--EECCCCCHHHHHHHHHHHHHHHHHCCEEEEEE Q ss_conf ----------49763226-----------78989870994698458--86289999999999999999998719489998 Q gi|254780558|r 302 ----------DGSSPRNL-----------TRVLAPKLNLPYMVADL--YLDDQVDRDKIREKLKGLEEIARTTGQAIGVA 358 (396) Q Consensus 302 ----------s~Ts~~Sv-----------a~~~A~~~gvp~~~~dv--fLD~~~~~~~I~~qL~~l~~~Ar~~G~AI~ig 358 (396) ......+. ....+.+.|+++.-... .-+...+.+.+.+.|+.+ ..|....++ T Consensus 159 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gi~~~d~~~g~~~~~~~~~~~l~~~l~~l-----~~g~~eim~ 233 (283) T TIGR03473 159 REPRALLTLEPVRRLGTLWDAFLFPWLGRMRARLDRAGIAHNDYVFGLADTGHMDEARLLAALERL-----PEGVSEIYF 233 (283) T ss_pred CCCCCHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCHHCCCCCCCCCCHHHHHHHHHHC-----CCCCEEEEE T ss_conf 665310013641123046666677778888999997499857101241343546999999999737-----998779997 Q ss_pred ECC-H-HHH--------HHHH--------HHHHHHHHCCCEEEEHHHHH Q ss_conf 088-7-889--------9999--------99756887691899677743 Q gi|254780558|r 359 VAF-D-ESI--------EVIS--------QWLQQEHVRDVSVVPLSCLA 389 (396) Q Consensus 359 h~~-p-~Ti--------~~L~--------~w~~~l~~~gi~lVpvS~l~ 389 (396) ||- . +.+ .+.. +...-++++||+|+--.+|+ T Consensus 234 HPg~~d~~l~~~~~~~~~R~~El~~l~s~~~~~~l~~~~I~Li~y~dL~ 282 (283) T TIGR03473 234 HPATAQDAEITPSMPGYRHADELAALLSPRVKAALKELGITLGGFADLV 282 (283) T ss_pred CCCCCCHHHHCCCCCCCCHHHHHHHHCCHHHHHHHHHCCCEEECHHHHC T ss_conf 8999971665169875255999999759999999998799995388851 No 7 >pfam04794 YdjC YdjC-like protein. Family of YdjC-like proteins. This region is possibly involved in the the cleavage of cellobiose-phosphate. Probab=93.72 E-value=0.55 Score=25.88 Aligned_cols=181 Identities=12% Similarity=-0.009 Sum_probs=107.9 Q ss_pred EEEEEECCCCCCCHHHHHHHHHC---CCCCEEEEECCCCCHHHHHHHHHHC-CCCEEEECCCCCCCCCCCCC-------- Q ss_conf 58999737787716889999837---9350898708987689999999985-99299981766568987787-------- Q gi|254780558|r 172 RIAIVVSGLGISQTGTQRAINLL---PANITLAFASNGNSLDRWMKEAKKK-GQEAILQIPMQAFDESYNED-------- 239 (396) Q Consensus 172 riAIVIddlG~~~~~~~~ai~~L---P~~vT~A~~P~~~~~~~~~~~Ar~~-G~EvllhlPMEp~~~~~~~p-------- 239 (396) +|-|-.||+|++.... ++|.++ ..-=+-+++..++...+.++.++.. +-.|=||+=+=.. .|-.++ T Consensus 2 ~Lii~ADD~G~s~~vn-~gI~~~~~~G~ltstS~M~n~p~~~~a~~~~k~~p~l~vGlHlnLt~g-~p~~~~~~~psLvd 79 (261) T pfam04794 2 KLIVNADDFGLSPGVN-RGIIEAHRAGIVTSTSLMVNMPAFEHAAALAKRNPDLGVGLHLTLTAG-KPLLPPTEVPSLVD 79 (261) T ss_pred EEEEECCCCCCCHHHH-HHHHHHHHCCCCEEEEEEECCHHHHHHHHHHHHCCCCCEEEEEEECCC-CCCCCHHHCCCCCC T ss_conf 6999677688987788-999999987981678871078159999999972899987888773689-88898454897618 Q ss_pred --C---------CCCCCCCCCHHHHHHHHHHHHHHCCC----CEEEEECCCCCCCCCHHHHHHHHHHHHCCCCEEEECCC Q ss_conf --8---------55446789989999999999985589----35980012300004989999999998506928997497 Q gi|254780558|r 240 --D---------SYTLKVTQTVQQLLNRLRYSLRRGTG----YFGVMNYRGAMLLSNKESAEVIFKEFAKRGLLFFDDGS 304 (396) Q Consensus 240 --G---------p~~L~~~~~~~~~~~~l~~~l~~~p~----~vGvnNhmGs~~t~~~~~m~~vl~~l~~rgL~flDs~T 304 (396) | -..+.-..+.++++.-++.-+.++-. -.-+-.||+.- ..+...+.+++..+++|+-++-... T Consensus 80 ~~G~F~~~~~~~~~~~~~~~~~~ev~~Ei~AQi~~f~~~G~~p~hiDgH~hvh--~~P~i~~~~~~la~~~~lp~iR~~~ 157 (261) T pfam04794 80 ENGYFLRLSVLLRRALLRRLDRDEVERELRAQIERFLAFGLPPDHIDSHHHVH--LLPQVRPAVLELAREYGLPLVRLPR 157 (261) T ss_pred CCCCCCCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCCCEECCCCCEE--CCHHHHHHHHHHHHHHCCCEEECCC T ss_conf 89961752999999871499999999999999999998399964661655200--2878999999999994998772466 Q ss_pred CCC--------------CHHHHHHHHHCCCEEEEEEEEC--CCCCHHHHHHHHHHHHHHHHHCCEEEEEEECC Q ss_conf 632--------------2678989870994698458862--89999999999999999998719489998088 Q gi|254780558|r 305 SPR--------------NLTRVLAPKLNLPYMVADLYLD--DQVDRDKIREKLKGLEEIARTTGQAIGVAVAF 361 (396) Q Consensus 305 s~~--------------Sva~~~A~~~gvp~~~~dvfLD--~~~~~~~I~~qL~~l~~~Ar~~G~AI~igh~~ 361 (396) ... -.......+.|++......-+. .....+.+.+.|.++ +.|....++||- T Consensus 158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~gi~~~~~f~g~~~~~~~~~~~~~~~l~~l-----~~g~~eim~HPg 225 (261) T pfam04794 158 EPPAGAPKLALLVRSLAAAFRARFYRAGISTNDGFAGLYSSGADFEALLLALLARL-----PEGTTEIMCHPG 225 (261) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHC-----CCCCEEEEECCC T ss_conf 53322221013567678999999998699868543554566300799999999848-----999689997899 No 8 >COG5478 Predicted small integral membrane protein [Function unknown] Probab=93.45 E-value=0.18 Score=28.96 Aligned_cols=61 Identities=20% Similarity=0.285 Sum_probs=51.4 Q ss_pred EEECC--CCCHHHHHHHHHHHHH-HHHHCCEEEEEEECCHHHHHHHHHHHHHHHHCCCE-EEEHH Q ss_conf 88628--9999999999999999-99871948999808878899999997568876918-99677 Q gi|254780558|r 326 LYLDD--QVDRDKIREKLKGLEE-IARTTGQAIGVAVAFDESIEVISQWLQQEHVRDVS-VVPLS 386 (396) Q Consensus 326 vfLD~--~~~~~~I~~qL~~l~~-~Ar~~G~AI~igh~~p~Ti~~L~~w~~~l~~~gi~-lVpvS 386 (396) +||-| ..|..+|+.+|+++.+ .-+.+--.|+|-|..+++++.+.+|+.+.-.+|.. .+++- T Consensus 63 ~liQNtq~rD~aai~aKLDElir~~~e~~n~~VgIEh~~~~~i~~~~~~~e~~a~~~~g~~~~~~ 127 (141) T COG5478 63 VLIQNTQNRDTAAIQAKLDELIRSLREARNDVVGIEHLKPEEIEEIRDRLEDEAGTGDGVPTSVA 127 (141) T ss_pred HHEECCCCCCHHHHHHHHHHHHHHHCCCCCCEEEECCCCHHHHHHHHHHHHHHHCCCCCCCCCHH T ss_conf 62225442428999999999999712366762101038888999999999998267988433099 No 9 >PRK05406 LamB/YcsF family protein; Provisional Probab=89.40 E-value=1.7 Score=22.87 Aligned_cols=193 Identities=16% Similarity=0.110 Sum_probs=119.4 Q ss_pred CCCCCCC----HHHHHHHHHC--CCCCEEEEECCCC-CHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCH Q ss_conf 3778771----6889999837--9350898708987-6899999999859929998176656898778785544678998 Q gi|254780558|r 178 SGLGISQ----TGTQRAINLL--PANITLAFASNGN-SLDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTV 250 (396) Q Consensus 178 ddlG~~~----~~~~~ai~~L--P~~vT~A~~P~~~-~~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~ 250 (396) -|||.+. .+.+++|+.+ -..|.+.|--..+ .-++.++.|.+.|-+|=-|- +||+.+ |=+=-...+++ T Consensus 8 ~DlGE~~g~~~~g~D~~lmp~IssaNIACG~HaGd~~~m~~tv~lA~~~~V~iGAHP-----sypD~~-gFGRr~~~~s~ 81 (246) T PRK05406 8 CDLGESFGAWKMGDDEALLPLVTSANIACGFHAGDPAVMRRTVRLAKENGVAIGAHP-----GYPDLE-GFGRRNMDLSP 81 (246) T ss_pred CCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCEEECCC-----CCCCCC-CCCCCCCCCCH T ss_conf 226888888776785999857525888606367899999999999998499884789-----998767-89988898999 Q ss_pred HHHHHHHHHHHHHCCC---CEEE-EEC------CCCCCCCCHHHHHHHHHHHHCC--CCEEEECCCCCCCHHHHHHHHHC Q ss_conf 9999999999985589---3598-001------2300004989999999998506--92899749763226789898709 Q gi|254780558|r 251 QQLLNRLRYSLRRGTG---YFGV-MNY------RGAMLLSNKESAEVIFKEFAKR--GLLFFDDGSSPRNLTRVLAPKLN 318 (396) Q Consensus 251 ~~~~~~l~~~l~~~p~---~vGv-nNh------mGs~~t~~~~~m~~vl~~l~~r--gL~flDs~Ts~~Sva~~~A~~~g 318 (396) +|+.+.+..-++.+-+ ..|+ .+| +.-....|+...+.+++.++.. +|.++ ..++|.-.++|++.| T Consensus 82 ~el~~~i~~Qi~~l~~~a~~~g~~l~hvKpHGALYn~~~~d~~~a~~i~~~i~~~~~~l~l~---~~~~s~~~~~A~~~G 158 (246) T PRK05406 82 EELYALVLYQIGALQAIARAAGGRVRHVKPHGALYNMAAKDPELADAIAEAVAAVDPSLILV---GLAGSELIEAAKAAG 158 (246) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCEECCCHHHHHHHHCCHHHHHHHHHHHHHHCCCCEEE---ECCCCHHHHHHHHCC T ss_conf 99999999999999999998499631324018999988459999999999999869995699---518819999999859 Q ss_pred CCEEEEEEEECCCCCH----------H----HHHHHHHHHHHHHHH------CCEEEE-----EE--ECCHHHHHHHHHH Q ss_conf 9469845886289999----------9----999999999999987------194899-----98--0887889999999 Q gi|254780558|r 319 LPYMVADLYLDDQVDR----------D----KIREKLKGLEEIART------TGQAIG-----VA--VAFDESIEVISQW 371 (396) Q Consensus 319 vp~~~~dvfLD~~~~~----------~----~I~~qL~~l~~~Ar~------~G~AI~-----ig--h~~p~Ti~~L~~w 371 (396) +++.. ..|.|-.-+. . ....-++++.++++. .|.-|- || .-.|..++.++.. T Consensus 159 l~~~~-E~FADR~Y~~dG~Lv~R~~~gAvi~d~e~~~~q~~~~~~~g~V~ti~G~~i~i~adTICvHgDtp~Av~iak~i 237 (246) T PRK05406 159 LRTAS-EVFADRAYTADGTLVPRSQPGAVIHDPEEAAAQVLQMVQEGRVTAIDGEWIPVEADTICVHGDGPHAVAFARRI 237 (246) T ss_pred CCEEE-EEEECCCCCCCCCEEECCCCCCCCCCHHHHHHHHHHHHHCCCEEECCCCEEECCCCEEEECCCCHHHHHHHHHH T ss_conf 81898-88402337899987005687761589999999999999779689238988760489899899997899999999 Q ss_pred HHHHHHCCC Q ss_conf 756887691 Q gi|254780558|r 372 LQQEHVRDV 380 (396) Q Consensus 372 ~~~l~~~gi 380 (396) -..|++.|| T Consensus 238 r~~L~~~gI 246 (246) T PRK05406 238 RAALEAAGI 246 (246) T ss_pred HHHHHHCCC T ss_conf 999997769 No 10 >TIGR03471 HpnJ hopanoid biosynthesis associated radical SAM protein HpnJ. One of the well-described hopanoid intermediates is bacteriohopanetetrol. In the conversion from hopene several reactions must occur in the side chain for which a radical mechanism might be reasonable. These include the four (presumably anaerobic) hydroxylations and a methyl shift. Probab=85.60 E-value=2.8 Score=21.46 Aligned_cols=128 Identities=12% Similarity=0.156 Sum_probs=85.3 Q ss_pred CCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHCCCCEEEE-CCCCCCCHHHHHHHHHCCCEEEE Q ss_conf 78998999999999998558935980012300004989999999998506928997-49763226789898709946984 Q gi|254780558|r 246 VTQTVQQLLNRLRYSLRRGTGYFGVMNYRGAMLLSNKESAEVIFKEFAKRGLLFFD-DGSSPRNLTRVLAPKLNLPYMVA 324 (396) Q Consensus 246 ~~~~~~~~~~~l~~~l~~~p~~vGvnNhmGs~~t~~~~~m~~vl~~l~~rgL~flD-s~Ts~~Sva~~~A~~~gvp~~~~ 324 (396) -.-+++.+.+-+++..++.++.--+. +.-.-||.+..-+..+++++++.|+-|-- .+..-+--.-++.++.|.-... T Consensus 225 R~RSpe~VvdEIe~l~~~y~gv~~~~-f~DD~Ft~~~~r~~eic~~i~~l~i~W~~~~Rv~~d~E~l~~mk~AGc~~v~- 302 (472) T TIGR03471 225 RTRSAESVIEEVKYALENFPEVREFF-FDDDTFTDDKPRAEEIARKLGPLGVTWSCNARANVDYETLKVMKENGLRLLL- 302 (472) T ss_pred CCCCHHHHHHHHHHHHHHCCCCCEEE-EECCCCCCCHHHHHHHHHHHHHCCCEEEEEEECCCCHHHHHHHHHHCCEEEE- T ss_conf 21599999999999998668975899-9477667899999999999987698278763034899999999983984899- Q ss_pred EEEECCCCCHHHHH---------HHHHHHHHHHHHCCEEE----EEEECC--HHHHHHHHHHHHHHHHC Q ss_conf 58862899999999---------99999999998719489----998088--78899999997568876 Q gi|254780558|r 325 DLYLDDQVDRDKIR---------EKLKGLEEIARTTGQAI----GVAVAF--DESIEVISQWLQQEHVR 378 (396) Q Consensus 325 dvfLD~~~~~~~I~---------~qL~~l~~~Ar~~G~AI----~igh~~--p~Ti~~L~~w~~~l~~~ 378 (396) +--+..-..|. .+..++.+.+++.|..+ +||.|. .+|++.--+++.++.-. T Consensus 303 ---~GiESgsq~iL~~i~K~~t~e~~~~av~~~k~~GI~v~~~FIiG~PgET~Eti~~Ti~fa~~l~~d 368 (472) T TIGR03471 303 ---VGYESGDQQILKNIKKGLTVEIARRFTRDCHKLGIKVHGTFILGLPGETRETIRKTIDFAKELNPH 368 (472) T ss_pred ---EECCCCCHHHHHHHCCCCCHHHHHHHHHHHHHCCCEEEEEEEECCCCCCHHHHHHHHHHHHHCCCC T ss_conf ---803758999999853899899999999988757987999998779999889999999999975989 No 11 >TIGR02237 recomb_radB DNA repair and recombination protein RadB; InterPro: IPR011939 This family consists exclusively of archaeal RadB protein, a homolog of bacterial RecA, eukaryotic RAD51 (IPR011941 from INTERPRO) and DMC1 (IPR011940 from INTERPRO), and archaeal RadA (IPR011938 from INTERPRO) ,, .; GO: 0003684 damaged DNA binding, 0005524 ATP binding, 0008094 DNA-dependent ATPase activity, 0006281 DNA repair, 0006310 DNA recombination. Probab=85.56 E-value=2.8 Score=21.45 Aligned_cols=113 Identities=14% Similarity=0.089 Sum_probs=86.0 Q ss_pred HHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHH---HHHCCCCEEEEECCCCCCCCCHHHHHHH Q ss_conf 9999998599299981766568987787855446789989999999999---9855893598001230000498999999 Q gi|254780558|r 212 WMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQLLNRLRYS---LRRGTGYFGVMNYRGAMLLSNKESAEVI 288 (396) Q Consensus 212 ~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~~~~~l~~~---l~~~p~~vGvnNhmGs~~t~~~~~m~~v 288 (396) .+-.|-++|..|+ |=|.+ | ++|.|.+++.+... .+++---+=|-| -..|..+..+.+.. T Consensus 32 ~a~~a~~~Gk~v~---------YiDTE-G------GLS~ER~~q~~~~~~~D~e~~~~~~iv~~--~~~f~eQ~~ai~~~ 93 (223) T TIGR02237 32 LAVNAARQGKKVV---------YIDTE-G------GLSPERFKQIAEDRALDPERVLSNVIVFE--VFDFDEQEVAIQKT 93 (223) T ss_pred HHHHHHHCCCCEE---------EEECC-C------CCHHHHHHHHHHCCCCCHHHHHCCEEEEC--CCCHHHHHHHHHHH T ss_conf 9999986189589---------99628-9------83289999986305889888841535523--53567899999999 Q ss_pred HHHHHCC----CCEEEECCCCCCCHHHHHHHHHCCCEEEEEEEECCC----CCHHHHHHHHHHHHHHHHHCCEEEEEE Q ss_conf 9998506----928997497632267898987099469845886289----999999999999999998719489998 Q gi|254780558|r 289 FKEFAKR----GLLFFDDGSSPRNLTRVLAPKLNLPYMVADLYLDDQ----VDRDKIREKLKGLEEIARTTGQAIGVA 358 (396) Q Consensus 289 l~~l~~r----gL~flDs~Ts~~Sva~~~A~~~gvp~~~~dvfLD~~----~~~~~I~~qL~~l~~~Ar~~G~AI~ig 358 (396) .+.++.. +|..|||-|+.. .+.+. |+. .-.....+||.-|..+|||+-.||.|. T Consensus 94 ~~~~~~~G~~~~LvVvDs~t~~Y----Rle~~------------~d~nk~~~~~~~l~~Ql~~Ll~lArk~~~AVviT 155 (223) T TIGR02237 94 SKLIDRDGDKADLVVVDSFTALY----RLERS------------DDRNKQISLNRELARQLTLLLSLARKKDLAVVIT 155 (223) T ss_pred HHHHHCCCCEEEEEEEECCHHHH----HHCCC------------CCCHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEE T ss_conf 99986068833148881533454----20257------------8602567999999999999999987649978997 No 12 >COG0726 CDA1 Predicted xylanase/chitin deacetylase [Carbohydrate transport and metabolism] Probab=84.89 E-value=3 Score=21.25 Aligned_cols=178 Identities=13% Similarity=0.006 Sum_probs=96.3 Q ss_pred CEEEEEECCCCCCCHHHHH---HHHHCCCCCEEEEECCCCCH-HHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCC Q ss_conf 6589997377877168899---99837935089870898768-9999999985992999817665689877878554467 Q gi|254780558|r 171 ARIAIVVSGLGISQTGTQR---AINLLPANITLAFASNGNSL-DRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKV 246 (396) Q Consensus 171 ~riAIVIddlG~~~~~~~~---ai~~LP~~vT~A~~P~~~~~-~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~ 246 (396) ..|++-.||- +....+.+ .+.+.-.+-||-+.-..-.. ...+++..+.|||+..|- |. +--+. T Consensus 65 k~v~lTFDDg-~~~~~~~~il~iL~k~~i~ATfFv~g~~~~~~p~~~~~~~~~Gheig~H~------~~------h~~~~ 131 (267) T COG0726 65 KAVALTFDDG-PLDGNTPRILPLLKKYGIKATFFVVGSWIERNPDLVKRIAEAGHEIGNHG------YD------HPDLQ 131 (267) T ss_pred CEEEEEEECC-CCCCCHHHHHHHHHHHCCCEEEEEECHHHHHCHHHHHHHHHCCCEEEECC------CC------CCCHH T ss_conf 7799997557-66436599999999819976999733765559799999985896376452------40------13521 Q ss_pred CCCHHHHHHHHHH---HHHHCCCCE--EEEECCCCCCCCCHHHHHHHHHHHHCCCC-EEEECCCCCCCHHHHHHHHHCCC Q ss_conf 8998999999999---998558935--98001230000498999999999850692-89974976322678989870994 Q gi|254780558|r 247 TQTVQQLLNRLRY---SLRRGTGYF--GVMNYRGAMLLSNKESAEVIFKEFAKRGL-LFFDDGSSPRNLTRVLAPKLNLP 320 (396) Q Consensus 247 ~~~~~~~~~~l~~---~l~~~p~~v--GvnNhmGs~~t~~~~~m~~vl~~l~~rgL-~flDs~Ts~~Sva~~~A~~~gvp 320 (396) .++.+++...+.+ .+.++.+.. ++.--.|+.- .......++.|+ .++. - T Consensus 132 ~~~~~~~~~~i~~~~~~l~~~~g~~~~~~r~p~g~~~-------~~~~~~~~~~g~~~~i~--~---------------- 186 (267) T COG0726 132 DLSLEELGAEIARAHDILKKITGGRPRGFRPPYGSYN-------ESSLALARRLGYRAEIP--W---------------- 186 (267) T ss_pred HCCHHHHHHHHHHHHHHHHHHHCCCCCEEECCCCCCC-------HHHHHHHHHCCCCEEEE--E---------------- T ss_conf 0698999999999899999851899842878866537-------55999998479750333--0---------------- Q ss_pred EEEEEEEECCCCCHHHHHHHHHHHHHHHHH--CCEEEEEEECC-----HHHHHHHHHHHHHHHHCCCEEEEHHHHHHC Q ss_conf 698458862899999999999999999987--19489998088-----788999999975688769189967774306 Q gi|254780558|r 321 YMVADLYLDDQVDRDKIREKLKGLEEIART--TGQAIGVAVAF-----DESIEVISQWLQQEHVRDVSVVPLSCLAKL 391 (396) Q Consensus 321 ~~~~dvfLD~~~~~~~I~~qL~~l~~~Ar~--~G~AI~igh~~-----p~Ti~~L~~w~~~l~~~gi~lVpvS~l~~~ 391 (396) ..-..|-..... .........+..+ .+.+|...|.. +.+..+|.+.+..++.+|..++.++.+... T Consensus 187 ---~~~~~d~~~~~~--~~~~~~~~~~~~~~~~~~~iil~h~~~~~~~~~~~~~l~~~i~~~~~~g~~~~~~~~~~~~ 259 (267) T COG0726 187 ---TVDTFDWPYGTA--LGYNKDAIDILYKEGYKGAIILLHDRASPDRPGTVEALPRLIPYLKAKGYVFVTLDELIEI 259 (267) T ss_pred ---CCCCCCCCCCCC--CCHHHHHHHHHHHCCCCCEEEEECCCCCCCCCCHHHHHHHHHHHHHHHCCEEEEECCCHHH T ss_conf ---256644445653--3114789999985278825998367544566521666999989999708727870310001 No 13 >TIGR00416 sms DNA repair protein RadA; InterPro: IPR004504 RadA/Sms is a highly conserved eubacterial protein that shares sequence similarity with both RecA strand transferase and lon protease. The RadA/Sms family are probable ATP-dependent proteases involved in both DNA repair and degradation of proteins, peptides, glycopeptides. They are classified in as non-peptidase homologues and unassigned peptidases in MEROPS peptidase family S16 (lon protease family, clan SJ). RadA/Sms is involved in recombination and recombinational repair, most likely involving the stabilisation or processing of branched DNA molecules or blocked replication forks because of its genetic redundancy with RecG and RuvABC .; GO: 0003684 damaged DNA binding, 0005524 ATP binding, 0006281 DNA repair. Probab=84.11 E-value=1.9 Score=22.47 Aligned_cols=81 Identities=23% Similarity=0.225 Sum_probs=57.6 Q ss_pred EEEEE--CCCCCCCCCHHHHHHHHHHHHCC--CCEEEECCCCCCCHHHHHHHHHCCCEEEEEEEECCCC-CHHHHHHHHH Q ss_conf 59800--12300004989999999998506--9289974976322678989870994698458862899-9999999999 Q gi|254780558|r 268 FGVMN--YRGAMLLSNKESAEVIFKEFAKR--GLLFFDDGSSPRNLTRVLAPKLNLPYMVADLYLDDQV-DRDKIREKLK 342 (396) Q Consensus 268 vGvnN--hmGs~~t~~~~~m~~vl~~l~~r--gL~flDs~Ts~~Sva~~~A~~~gvp~~~~dvfLD~~~-~~~~I~~qL~ 342 (396) -|+|| |-|..+.=++.-|+.+...+.+- -+..|||-.+-+.- .|-..+ +..-+|.-=. T Consensus 166 dGinnlahdG~L~~L~Et~~e~I~~~~e~~~P~~~ViDSIQ~ly~~-----------------di~SaPGSVsQVRE~t~ 228 (481) T TIGR00416 166 DGINNLAHDGNLYVLSETNLEQICAEIEELNPQVVVIDSIQTLYLP-----------------DISSAPGSVSQVRECTA 228 (481) T ss_pred HHHHHHHCCCCCCCCCCCCHHHHHHHHHHHCCCEEEEECCCCCCCH-----------------HHCCCCCCHHHHHHHHH T ss_conf 3245430267532157579899999998529948999142100000-----------------00258884238889999 Q ss_pred HHHHHHHHCCEEEE-EEECCHH-HH Q ss_conf 99999987194899-9808878-89 Q gi|254780558|r 343 GLEEIARTTGQAIG-VAVAFDE-SI 365 (396) Q Consensus 343 ~l~~~Ar~~G~AI~-igh~~p~-Ti 365 (396) .|-++||+++.||. |||...+ +| T Consensus 229 ~Lmr~AKt~~iaifiVGHVTKeGsi 253 (481) T TIGR00416 229 ELMRLAKTRGIAIFIVGHVTKEGSI 253 (481) T ss_pred HHHHHHHHCCCCEEEEEEECCCCCC T ss_conf 9987652168657997004356754 No 14 >PRK05434 phosphoglyceromutase; Provisional Probab=83.38 E-value=3.5 Score=20.85 Aligned_cols=156 Identities=20% Similarity=0.237 Sum_probs=64.5 Q ss_pred CEEEEEECCCCCCCHHHHHHHHHCCCCCEEEEECCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCC-----CCCCCCCC Q ss_conf 6589997377877168899998379350898708987689999999985992999817665689877-----87855446 Q gi|254780558|r 171 ARIAIVVSGLGISQTGTQRAINLLPANITLAFASNGNSLDRWMKEAKKKGQEAILQIPMQAFDESYN-----EDDSYTLK 245 (396) Q Consensus 171 ~riAIVIddlG~~~~~~~~ai~~LP~~vT~A~~P~~~~~~~~~~~Ar~~G~EvllhlPMEp~~~~~~-----~pGp~~L~ 245 (396) |-+=||+||.|++.....+||. .| ..|..+++.+. .-| .+|+--=+..+.|+. +=|...|- T Consensus 6 pvvLiIlDGwG~~~~~~~NAi~-------~A---~tP~~D~L~~~---yP~-t~L~AsG~~VGLP~GqMGNSEVGHlniG 71 (511) T PRK05434 6 PVVLVILDGWGYREETEGNAIA-------LA---KTPNLDRLWAN---YPH-TLLSASGLAVGLPDGQMGNSEVGHLNIG 71 (511) T ss_pred CEEEEEECCCCCCCCCCCCHHH-------HC---CCCHHHHHHHH---CCC-CEEEECCCCCCCCCCCCCCHHHHHHCCC T ss_conf 7899995787779999988888-------65---99709999986---898-4733067735998888654566441016 Q ss_pred CCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHCCC-----CEEEECCCCCCC-----HHHHHHH Q ss_conf 789989999999999985589359800123000049899999999985069-----289974976322-----6789898 Q gi|254780558|r 246 VTQTVQQLLNRLRYSLRRGTGYFGVMNYRGAMLLSNKESAEVIFKEFAKRG-----LLFFDDGSSPRN-----LTRVLAP 315 (396) Q Consensus 246 ~~~~~~~~~~~l~~~l~~~p~~vGvnNhmGs~~t~~~~~m~~vl~~l~~rg-----L~flDs~Ts~~S-----va~~~A~ 315 (396) .+---.|-..|+..++.. || |-.+ +++..+++.+++.| ++.|-++-.-.. -.-++|+ T Consensus 72 AGRiv~Qdl~rIn~aI~~-----------gs-f~~n-~~l~~~i~~~k~~~~~lHl~GLlSdGGVHSH~~Hl~al~~~a~ 138 (511) T PRK05434 72 AGRIVYQDLTRINKAIED-----------GS-FFEN-PALLDAIDKAKKNGGALHLMGLLSDGGVHSHIDHLFALLELAA 138 (511) T ss_pred CCEEECCCHHHHHHHHHC-----------CC-CCCC-HHHHHHHHHHHHCCCCEEEEEEECCCCCCCCHHHHHHHHHHHH T ss_conf 662361544999999864-----------98-3359-9999999999973981699985238843133789999999999 Q ss_pred HHCCCEEEEEEEECCC-CCHHHHHHHHHHHHHHHHHCCE Q ss_conf 7099469845886289-9999999999999999987194 Q gi|254780558|r 316 KLNLPYMVADLYLDDQ-VDRDKIREKLKGLEEIARTTGQ 353 (396) Q Consensus 316 ~~gvp~~~~dvfLD~~-~~~~~I~~qL~~l~~~Ar~~G~ 353 (396) +.|++-...+.|+|.. ..+..-...|++++..-++.|. T Consensus 139 ~~gv~~v~vH~f~DGRDt~P~Sa~~yl~~le~~l~~~~~ 177 (511) T PRK05434 139 KEGVKKVYLHAFLDGRDTPPKSALGYLEELEAKLAELGV 177 (511) T ss_pred HCCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHCC T ss_conf 708884899985168888842589999999998886188 No 15 >PRK06846 deaminase; Validated Probab=78.15 E-value=5.2 Score=19.73 Aligned_cols=123 Identities=14% Similarity=0.055 Sum_probs=60.7 Q ss_pred HCCCCCEEEEECCC------CCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHH-CC Q ss_conf 37935089870898------768999999998599299981766568987787855446789989999999999985-58 Q gi|254780558|r 193 LLPANITLAFASNG------NSLDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQLLNRLRYSLRR-GT 265 (396) Q Consensus 193 ~LP~~vT~A~~P~~------~~~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~~~~~l~~~l~~-~p 265 (396) ++...+-=.+.|+. .+++...+.|.+.|..|=+|+ + ...++| ...++...+..+.. +. T Consensus 186 ~~Gad~vGGvdP~~~d~d~~~~L~~~f~LA~~~~~~vD~Hl-----d-E~~d~~---------~~~l~~la~~t~~~g~~ 250 (410) T PRK06846 186 QMGAHLVGGVDPATVDGAIEKSLDTMFQIAVEFNKGVDIHL-----H-DTNPLG---------VATIKKLVETTEEAQWK 250 (410) T ss_pred HHCCCEEEEECCCCCCCCHHHHHHHHHHHHHHCCCCCEEEE-----C-CCCCHH---------HHHHHHHHHHHHHHCCC T ss_conf 85896798437766785799999999999998099923631-----5-889847---------78999999999982998 Q ss_pred CCEEEEECCCCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCHH-HHHHHHHCCCEEEEEEEECCCCCH Q ss_conf 935980012300004989999999998506928997497632267-898987099469845886289999 Q gi|254780558|r 266 GYFGVMNYRGAMLLSNKESAEVIFKEFAKRGLLFFDDGSSPRNLT-RVLAPKLNLPYMVADLYLDDQVDR 334 (396) Q Consensus 266 ~~vGvnNhmGs~~t~~~~~m~~vl~~l~~rgL~flDs~Ts~~Sva-~~~A~~~gvp~~~~dvfLD~~~~~ 334 (396) +- =.-.|--|.-..++.-.+.+++.|++-|.-++...-..+.+. -..-++.||+..... ||.+|. T Consensus 251 gr-Vt~sH~~sL~~~~~~~~~~~~~~la~agI~vvst~P~~r~v~pv~eL~~aGV~V~~g~---Dni~Dp 316 (410) T PRK06846 251 GR-VTISHAFALGDLNEEEVEELAERLAAQQISITSTVPIGRLHMPLPLLHEKGVKVSLGT---DSVTDH 316 (410) T ss_pred CC-EEEEHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCHHHHHHCCCEEEEEC---CCCCCC T ss_conf 98-8837344464399999999999999859939985899899787999997699199826---888689 No 16 >PRK07625 consensus Probab=77.10 E-value=0.56 Score=25.85 Aligned_cols=27 Identities=15% Similarity=0.218 Sum_probs=14.1 Q ss_pred CCHHHHHHHHHHHHCCCCEEEECCCCC Q ss_conf 498999999999850692899749763 Q gi|254780558|r 280 SNKESAEVIFKEFAKRGLLFFDDGSSP 306 (396) Q Consensus 280 ~~~~~m~~vl~~l~~rgL~flDs~Ts~ 306 (396) .+....+.+++.+...+++-+|.-|+. T Consensus 329 ~~~e~l~~~~~~l~~~~~~a~dtEtts 355 (922) T PRK07625 329 QTWEQFDAWLAKISAAELTAFDTETTS 355 (922) T ss_pred CCHHHHHHHHHHHHCCCEEEEEEECCC T ss_conf 789999999998640884899952277 No 17 >COG2390 DeoR Transcriptional regulator, contains sigma factor-related N-terminal domain [Transcription] Probab=75.55 E-value=6.1 Score=19.28 Aligned_cols=82 Identities=17% Similarity=0.265 Sum_probs=39.9 Q ss_pred HHHHHHHHHHH---CCCCEEE---------ECCCCCCCHHHHHHHHHCCC--EEEEEEEECCCCCHHHHHH--HHHHHHH Q ss_conf 99999999985---0692899---------74976322678989870994--6984588628999999999--9999999 Q gi|254780558|r 283 ESAEVIFKEFA---KRGLLFF---------DDGSSPRNLTRVLAPKLNLP--YMVADLYLDDQVDRDKIRE--KLKGLEE 346 (396) Q Consensus 283 ~~m~~vl~~l~---~rgL~fl---------Ds~Ts~~Sva~~~A~~~gvp--~~~~dvfLD~~~~~~~I~~--qL~~l~~ 346 (396) ++|..+.+.+. .+++.|| +..+.++.++..+|++++-+ +...=.|+|+....+.+.+ -+..... T Consensus 124 rTv~a~~~~l~~~~~~~~~vV~l~GG~~~~~~~~~~~~~~~~~A~k~~~~~~~l~aP~~~~s~e~r~~l~~e~~v~~vl~ 203 (321) T COG2390 124 RTLSAVVDNLPPAPLRDVKVVQLTGGVGHADGSYNANTIALRLAEKLGAESYLLPAPLVASSPELREALLQEPSVREVLD 203 (321) T ss_pred HHHHHHHHHCCCCCCCCEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHCCHHHHHHH T ss_conf 79999998559676688389988898887766667879999999984991786636700789999999984828999999 Q ss_pred HHHHCCEE-EEEEECCHHH Q ss_conf 99871948-9998088788 Q gi|254780558|r 347 IARTTGQA-IGVAVAFDES 364 (396) Q Consensus 347 ~Ar~~G~A-I~igh~~p~T 364 (396) .+++--.| ++||.+.+.. T Consensus 204 ~~~~ad~alvGIG~~~~~~ 222 (321) T COG2390 204 LARSADLALVGIGSLSANS 222 (321) T ss_pred HHHHCCEEEEECCCCCCCC T ss_conf 9975799999547776446 No 18 >PRK13209 L-xylulose 5-phosphate 3-epimerase; Reviewed Probab=75.34 E-value=6.2 Score=19.25 Aligned_cols=157 Identities=17% Similarity=0.174 Sum_probs=96.6 Q ss_pred ECCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCC---EEEEECCCCCC- Q ss_conf 08987689999999985992999817665689877878554467899899999999999855893---59800123000- Q gi|254780558|r 203 ASNGNSLDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQLLNRLRYSLRRGTGY---FGVMNYRGAML- 278 (396) Q Consensus 203 ~P~~~~~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~~~~~l~~~l~~~p~~---vGvnNhmGs~~- 278 (396) +|..-+..+..+.|++.|++.+= +- +|.+ ...--..+.+.+++.. |...+....-. +-+..||.--| T Consensus 17 ~~~~~sw~e~~~~ak~~Gfd~iE-ls---iDe~----d~~~~rL~w~~~~~~~-ir~~~~~~gi~i~s~cls~~~~~Pl~ 87 (283) T PRK13209 17 LPAGECWLEKLRIAKTAGFDFVE-MS---VDET----DERLARLDWSREQRLA-LVNALVETGFRVNSMCLSAHRRFPLG 87 (283) T ss_pred CCCCCCHHHHHHHHHHCCCCEEE-EE---CCCC----CCCCCCCCCCHHHHHH-HHHHHHHCCCCCEEEECCCCCCCCCC T ss_conf 58999999999999985998799-84---2685----3100358999999999-99999981998603305455579999 Q ss_pred CCCHHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHCCCEEEE---EEEECCCCCHH---HHHHHHHHHHHHHHHCC Q ss_conf 0498999999999850692899749763226789898709946984---58862899999---99999999999998719 Q gi|254780558|r 279 LSNKESAEVIFKEFAKRGLLFFDDGSSPRNLTRVLAPKLNLPYMVA---DLYLDDQVDRD---KIREKLKGLEEIARTTG 352 (396) Q Consensus 279 t~~~~~m~~vl~~l~~rgL~flDs~Ts~~Sva~~~A~~~gvp~~~~---dvfLD~~~~~~---~I~~qL~~l~~~Ar~~G 352 (396) ..|+...+.-++.+++ +-.+|..+|++...- |+.- ...+.+ .-...|.++...|.+.| T Consensus 88 S~D~~~R~~~~e~~~k---------------aI~lA~~LGi~~I~lag~dv~~-~~~~~e~~~~f~e~L~~~~~~A~~~g 151 (283) T PRK13209 88 SEDDAVRAQGLEIMRK---------------AIQLAQDLGIRVIQLAGYDVYY-EEANNETRRRFIDGLKESVEMASRYS 151 (283) T ss_pred CCCHHHHHHHHHHHHH---------------HHHHHHHCCCCEEEECCCCCCC-CCCCHHHHHHHHHHHHHHHHHHHHHC T ss_conf 9799999999999999---------------9999998099989968876678-87859999999999999999999859 Q ss_pred EEEEEEEC---CHHHHHHHHHHHHHHHHCCCEEEE Q ss_conf 48999808---878899999997568876918996 Q gi|254780558|r 353 QAIGVAVA---FDESIEVISQWLQQEHVRDVSVVP 384 (396) Q Consensus 353 ~AI~igh~---~p~Ti~~L~~w~~~l~~~gi~lVp 384 (396) .-|+|=.. +=+|++...++...+..-.+.+-+ T Consensus 152 V~L~iE~~~~~f~~t~~~~~~~i~~v~sP~l~v~~ 186 (283) T PRK13209 152 VTLAFEIMDTPFMGSISKALGYAHYLNSPWFQLYP 186 (283) T ss_pred CEEEEEECCHHHHCCHHHHHHHHHHCCCCCEEEEE T ss_conf 98999425534321599999999966997289994 No 19 >pfam03740 PdxJ Pyridoxal phosphate biosynthesis protein PdxJ. Members of this family belong to the PdxJ family that catalyses the condensation of 1-deoxy-d-xylulose-5-phosphate (DXP) and 1-amino-3-oxo-4-(phosphohydroxy)propan-2-one to form pyridoxine 5'-phosphate (PNP). This reaction is involved in de novo synthesis of pyridoxine (vitamin B6) and pyridoxal phosphate. Probab=75.22 E-value=6.3 Score=19.23 Aligned_cols=116 Identities=14% Similarity=0.197 Sum_probs=75.6 Q ss_pred HHHHHHHCCCCEEEE--------ECCCCCCCCCHHHHHHHHHHHHCCCC---EEEECCCCCCCHHHHHHHHHCCCEEEE- Q ss_conf 999998558935980--------01230000498999999999850692---899749763226789898709946984- Q gi|254780558|r 257 LRYSLRRGTGYFGVM--------NYRGAMLLSNKESAEVIFKEFAKRGL---LFFDDGSSPRNLTRVLAPKLNLPYMVA- 324 (396) Q Consensus 257 l~~~l~~~p~~vGvn--------NhmGs~~t~~~~~m~~vl~~l~~rgL---~flDs~Ts~~Sva~~~A~~~gvp~~~~- 324 (396) +.-++.--|..+-+. ..-|=-+..+...+..+++.|++.|. +|+|. .-..-..|++.|..+.+- T Consensus 77 i~ia~~~kP~qvtLVPE~r~elTTegGld~~~~~~~L~~~i~~lk~~girvSlFIDp----d~~~i~~a~~~Gad~VElh 152 (239) T pfam03740 77 LELALKTKPHQVTLVPEKREEITTEGGLDVVAQLEKLKPAIRRLKNAGIRVSLFIDP----DPEQIEAAKIVGADRIELH 152 (239) T ss_pred HHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCEEEEEECC----CHHHHHHHHHCCCCEEEEE T ss_conf 999998499858988899987356888063340689999999986078538997079----9899999998092999850 Q ss_pred -----EEEECCCCCHHHHHHHHHHHHHHHHHCCEEEEEEECCH-HHHHHHHHHHHHHHH Q ss_conf -----58862899999999999999999987194899980887-889999999756887 Q gi|254780558|r 325 -----DLYLDDQVDRDKIREKLKGLEEIARTTGQAIGVAVAFD-ESIEVISQWLQQEHV 377 (396) Q Consensus 325 -----dvfLD~~~~~~~I~~qL~~l~~~Ar~~G~AI~igh~~p-~Ti~~L~~w~~~l~~ 377 (396) +.|-+..........++.++.+.|.+.|.-|--||-.. +++..|.+ ++.+++ T Consensus 153 TG~YA~a~~~~~~~~~~~l~~i~~aa~~A~~lGL~VnAGHgLn~~Nl~~l~~-i~~i~E 210 (239) T pfam03740 153 TGPYADAHNDAEQAREELLERLAAGAALAFDLGLVVNAGHGLTYHNVKPVAK-IPPIAE 210 (239) T ss_pred CHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCEEECCCCCCHHHHHHHHC-CCCCEE T ss_conf 4778877513155579999999999999987498574678988766999852-899748 No 20 >TIGR01418 PEP_synth phosphoenolpyruvate synthase; InterPro: IPR006319 This family represents phosphoenolpyruvate synthase; also called pyruvate,water dikinase and PEP synthase. The member from Methanococcus jannaschii contains a large intein. This enzyme generates phosphoenolpyruvate (PEP) from pyruvate, hydrolyzing ATP to AMP and releasing inorganic phosphate in the process. The enzyme shows extensive homology to other enzymes that use PEP as substrate or product. This enzyme may provide PEP for gluconeogenesis, for PTS-type carbohydrate transport systems, or for other processes. ; GO: 0008986 pyruvate water dikinase activity, 0006094 gluconeogenesis. Probab=75.07 E-value=6.3 Score=19.21 Aligned_cols=123 Identities=16% Similarity=0.291 Sum_probs=62.7 Q ss_pred HHHHHHHHHHHHHCCCCEEEEECCCCC--CCCCHHHHHHHHHHHHCCCC--------EEEECCCCCCCHH-HHHHHH--H Q ss_conf 999999999998558935980012300--00498999999999850692--------8997497632267-898987--0 Q gi|254780558|r 251 QQLLNRLRYSLRRGTGYFGVMNYRGAM--LLSNKESAEVIFKEFAKRGL--------LFFDDGSSPRNLT-RVLAPK--L 317 (396) Q Consensus 251 ~~~~~~l~~~l~~~p~~vGvnNhmGs~--~t~~~~~m~~vl~~l~~rgL--------~flDs~Ts~~Sva-~~~A~~--~ 317 (396) ++--..=..|+.+|-+=-|+.| +.=- |..+..=.+.++++|.+.|| +|+--=+=++=+. .+.++. - T Consensus 701 ~eaF~lE~~A~k~VR~eMGLtN-V~vMiPFvRT~~E~~rv~~im~~~GL~~g~nGl~~~vM~E~PsnvlLAd~f~~~GfD 779 (877) T TIGR01418 701 EEAFRLECKAIKRVREEMGLTN-VEVMIPFVRTPEEGKRVLEIMAEEGLKRGKNGLEVYVMCEVPSNVLLADEFAKLGFD 779 (877) T ss_pred HHHHHHHHHHHHHHHHHHCCCC-CEECCCCCCCHHHHHHHHHHHHHCCCCCCCCCCEEEEEEEHHHHHHHHHHHHHHCCC T ss_conf 4799999999998876508513-311357548889999999999972888477855799988356899999989850877 Q ss_pred CCCEEEEE-----------------E-EECCCCCHHHHHHHHHHHHHHHHHCCEEEEEE----ECCHHHHHHHHHHHHHH Q ss_conf 99469845-----------------8-86289999999999999999998719489998----08878899999997568 Q gi|254780558|r 318 NLPYMVAD-----------------L-YLDDQVDRDKIREKLKGLEEIARTTGQAIGVA----VAFDESIEVISQWLQQE 375 (396) Q Consensus 318 gvp~~~~d-----------------v-fLD~~~~~~~I~~qL~~l~~~Ar~~G~AI~ig----h~~p~Ti~~L~~w~~~l 375 (396) |.-..+|| + +.| +++ .++++-+.++++.||++|..++|| +-||+-++ ||=+ T Consensus 780 GFSiGSNDLTQLTLgvDRDSelVA~lGYFD-Ern-~AVL~li~~~I~aAk~~G~~vgICGQAPSDyPe~ve----fLVe- 852 (877) T TIGR01418 780 GFSIGSNDLTQLTLGVDRDSELVAHLGYFD-ERN-PAVLRLIEMAIKAAKEHGKKVGICGQAPSDYPEVVE----FLVE- 852 (877) T ss_pred EEEECCCHHHHHHHHHHCCCHHHHHCCCCC-CCC-HHHHHHHHHHHHHHHHCCCEEEEECCCCCCHHHHHH----HHHH- T ss_conf 156575168776642010431433037775-768-799999999999998739969870688887389999----9985- Q ss_pred HHCCCEEE Q ss_conf 87691899 Q gi|254780558|r 376 HVRDVSVV 383 (396) Q Consensus 376 ~~~gi~lV 383 (396) +||.=+ T Consensus 853 --~GIDSi 858 (877) T TIGR01418 853 --EGIDSI 858 (877) T ss_pred --CCCCEE T ss_conf --498769 No 21 >pfam03796 DnaB_C DnaB-like helicase C terminal domain. The hexameric helicase DnaB unwinds the DNA duplex at the Escherichia coli chromosome replication fork. Although the mechanism by which DnaB both couples ATP hydrolysis to translocation along DNA and denatures the duplex is unknown, a change in the quaternary structure of the protein involving dimerization of the N-terminal domain has been observed and may occur during the enzymatic cycle. This C-terminal domain contains an ATP-binding site and is therefore probably the site of ATP hydrolysis. Probab=74.27 E-value=6.6 Score=19.08 Aligned_cols=119 Identities=19% Similarity=0.152 Sum_probs=66.1 Q ss_pred HHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCC-CHHHHHHHHHHHHC Q ss_conf 99859929998176656898778785544678998999999999998558935980012300004-98999999999850 Q gi|254780558|r 216 AKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQLLNRLRYSLRRGTGYFGVMNYRGAMLLS-NKESAEVIFKEFAK 294 (396) Q Consensus 216 Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~~~~~l~~~l~~~p~~vGvnNhmGs~~t~-~~~~m~~vl~~l~~ 294 (396) |.+.|+-|+. +...|+.+++..|+...++.++...=- ...++. +...+......+.+ T Consensus 44 a~~~g~~Vl~------------------~slEm~~~~~~~R~~a~~~~v~~~~i~----~~~~~~~~~~~~~~~~~~~~~ 101 (186) T pfam03796 44 ALKQDKPVLF------------------FSLEMSAEQLAERLLSSESRISSSKLR----SGQLSDEDWERLAEAAGELSE 101 (186) T ss_pred HHHCCCCEEE------------------ECCCCCHHHHHHHHHHHHHCCCHHHHH----CCCHHHHHHHHHHHHHHHHHC T ss_conf 9970996687------------------547552999999999986267655541----251216799999999999853 Q ss_pred CCCEEEECC-CCCCCH---HHHHHHHHCCCEEEEEEEECCC----------C-C-HHHHHHHHHHHHHHHHHCCEEEEEE Q ss_conf 692899749-763226---7898987099469845886289----------9-9-9999999999999998719489998 Q gi|254780558|r 295 RGLLFFDDG-SSPRNL---TRVLAPKLNLPYMVADLYLDDQ----------V-D-RDKIREKLKGLEEIARTTGQAIGVA 358 (396) Q Consensus 295 rgL~flDs~-Ts~~Sv---a~~~A~~~gvp~~~~dvfLD~~----------~-~-~~~I~~qL~~l~~~Ar~~G~AI~ig 358 (396) ..+++.|.. .+...+ ...+..+.++. -+|+|.- . + ..+|..--.+|-.+|++.+..|... T Consensus 102 ~~l~i~~~~~~t~~~i~~~i~~~~~~~~~~----~vvvDyl~l~~~~~~~~~~~r~~~v~~i~~~Lk~lA~e~~i~ii~~ 177 (186) T pfam03796 102 APLYIDDTPGLSLSELRAQARRLKREHGLG----LIVIDYLQLMQGSKASKSENRQQEISEISRSLKALAKELNIPVIAL 177 (186) T ss_pred CCEEEECCCCCCHHHHHHHHHHHHHHCCCC----EEEEEHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEE T ss_conf 986884799998999999999999855998----8997489863677888775599999999999999999979918997 Q ss_pred EC Q ss_conf 08 Q gi|254780558|r 359 VA 360 (396) Q Consensus 359 h~ 360 (396) +- T Consensus 178 sQ 179 (186) T pfam03796 178 SQ 179 (186) T ss_pred EC T ss_conf 22 No 22 >pfam02092 tRNA_synt_2f Glycyl-tRNA synthetase beta subunit. Probab=74.05 E-value=5.5 Score=19.59 Aligned_cols=176 Identities=16% Similarity=0.144 Sum_probs=94.9 Q ss_pred CCCCCCCCEEEEEECCCCCCCHHHHHHHHHCCCCCEEEEECCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCC- Q ss_conf 1234668658999737787716889999837935089870898768999999998599299981766568987787855- Q gi|254780558|r 164 FCSNASGARIAIVVSGLGISQTGTQRAINLLPANITLAFASNGNSLDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSY- 242 (396) Q Consensus 164 ~~~~a~~~riAIVIddlG~~~~~~~~ai~~LP~~vT~A~~P~~~~~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~- 242 (396) ...++..-|++++|.|+-..+.. .......+++..||...+.-.+...--+|..|-++ +-+.....+-|.. T Consensus 40 i~~f~TPRRLav~i~~l~~~q~d--~~~e~kGP~~~~AfD~~G~ptkA~~GFaks~gv~~------~~L~~~~t~kgey~ 111 (549) T pfam02092 40 VKVFATPRRLAVLVEGLAEKQPD--KEEEKKGPPVKIAFDADGNPTKAAEGFARSQGVSV------DDLEIRETGKGEYL 111 (549) T ss_pred CEEEECCCEEEEEECCCCCCCCC--HHHEEECCCHHHHCCCCCCCCHHHHHHHHHCCCCH------HHHEEEECCCEEEE T ss_conf 06987487888885376300866--24321387657654657991799999999869998------99254662886699 Q ss_pred ---CCCCCCC-HHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHCCC----------------CEEEEC Q ss_conf ---4467899-89999999999985589359800123000049899999999985069----------------289974 Q gi|254780558|r 243 ---TLKVTQT-VQQLLNRLRYSLRRGTGYFGVMNYRGAMLLSNKESAEVIFKEFAKRG----------------LLFFDD 302 (396) Q Consensus 243 ---~L~~~~~-~~~~~~~l~~~l~~~p~~vGvnNhmGs~~t~~~~~m~~vl~~l~~rg----------------L~flDs 302 (396) ....+.. .+-+...+..++..+|.-.-+-= |+.=..--+..+|++..+..+- +-|+.. T Consensus 112 ~~~~~~~G~~~~elL~~ii~~~i~~l~~pKsMRW--g~~~~~fvRPirwi~aL~~~~vi~~~~~gi~s~n~T~GHRf~~~ 189 (549) T pfam02092 112 FAKKEEKGRPTAELLPEILPEAIKSLPFPKSMRW--GDGDLRFVRPIRWLVALLGDEVVPFEILGITSGNTTRGHRFLGP 189 (549) T ss_pred EEEEEECCCCHHHHHHHHHHHHHHCCCCCCCCCC--CCCCCEEECHHHHHHHHCCCEEEEEEEEEEECCCEECCCCCCCC T ss_conf 9998208841999999999999967999863140--89997676618989987078066899940322543314312578 Q ss_pred CCCCCCHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHHHHHHHCCEEEE Q ss_conf 976322678989870994698458862899999999999999999987194899 Q gi|254780558|r 303 GSSPRNLTRVLAPKLNLPYMVADLYLDDQVDRDKIREKLKGLEEIARTTGQAIG 356 (396) Q Consensus 303 ~Ts~~Sva~~~A~~~gvp~~~~dvfLD~~~~~~~I~~qL~~l~~~Ar~~G~AI~ 356 (396) ..-.=+-+...-.. -..+.|.+|.+...+.|.+|++++ |+++|.-+- T Consensus 190 ~~i~I~~~~~Y~~~----L~~~~Vi~d~~eRk~~I~~~i~~~---a~~~~~~~~ 236 (549) T pfam02092 190 GPITIASADDYEET----LRKAGVIVDFEERKELIREQIEAL---AAELGGKVD 236 (549) T ss_pred CCEEECCHHHHHHH----HHHCCEEECHHHHHHHHHHHHHHH---HHHCCCEEC T ss_conf 98674899999999----852989926999999999999999---997695786 No 23 >cd06808 PLPDE_III Type III Pyridoxal 5-phosphate (PLP)-Dependent Enzymes. The fold type III PLP-dependent enzyme family is predominantly composed of two-domain proteins with similarity to bacterial alanine racemases (AR) including eukaryotic ornithine decarboxylases (ODC), prokaryotic diaminopimelate decarboxylases (DapDC), biosynthetic arginine decarboxylases (ADC), carboxynorspermidine decarboxylases (CANSDC), and similar proteins. AR-like proteins contain an N-terminal PLP-binding TIM-barrel domain and a C-terminal beta-sandwich domain. They exist as homodimers with active sites that lie at the interface between the TIM barrel domain of one subunit and the beta-sandwich domain of the other subunit. These proteins play important roles in the biosynthesis of amino acids and polyamine. The family also includes the single-domain YBL036c-like proteins, which contain a single PLP-binding TIM-barrel domain without any N- or C-terminal extensions. Due to the lack of a second domain, these p Probab=73.08 E-value=7.1 Score=18.90 Aligned_cols=89 Identities=12% Similarity=0.074 Sum_probs=56.9 Q ss_pred CCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCC--CCEEEEECCCCCCCCCH Q ss_conf 9876899999999859929998176656898778785544678998999999999998558--93598001230000498 Q gi|254780558|r 205 NGNSLDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQLLNRLRYSLRRGT--GYFGVMNYRGAMLLSNK 282 (396) Q Consensus 205 ~~~~~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~~~~~l~~~l~~~p--~~vGvnNhmGs~~t~~~ 282 (396) ....+.++.+.+++.+..+-+|+-+ |+|...-.-|++.++....+.+..+ .+ ...|+.-|+||...... T Consensus 89 s~~~l~~l~~~a~~~~~~~~v~lri--------ntg~~~~rfG~~~~e~~~~~~~~~~-~~~l~l~Gi~~H~gs~~~~~~ 159 (211) T cd06808 89 SLEELEKLEEAALKAGPPARVLLRI--------DTGDENGKFGVRPEELKALLERAKE-LPHLRLVGLHTHFGSADEDYS 159 (211) T ss_pred CHHHHHHHHHHHHHCCCCCCEEEEE--------ECCCCCCCCCCCHHHHHHHHHHHHH-CCCCCCEEEEEEEECCCCCHH T ss_conf 7788877899888538885147767--------2277767778899999999999996-898666114665434438989 Q ss_pred ------HHHHHHHHHHHCCCC--EEEEC Q ss_conf ------999999999850692--89974 Q gi|254780558|r 283 ------ESAEVIFKEFAKRGL--LFFDD 302 (396) Q Consensus 283 ------~~m~~vl~~l~~rgL--~flDs 302 (396) ..+..+++++++.|+ -|+|= T Consensus 160 ~~~~~~~~~~~~~~~l~~~g~~~~~i~i 187 (211) T cd06808 160 PFVEALSRFVAALDQLGELGIDLEQLSI 187 (211) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCEEEE T ss_conf 9999999999999999975969798996 No 24 >cd01125 repA Hexameric Replicative Helicase RepA. RepA is encoded by a plasmid, which is found in most Gram negative bacteria. RepA is a 5'-3' DNA helicase which can utilize ATP, GTP and CTP to a lesser extent. Probab=73.04 E-value=7.1 Score=18.89 Aligned_cols=124 Identities=19% Similarity=0.115 Sum_probs=70.8 Q ss_pred HHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCC-CCHHHHHHHHHHHHHHC-----CCCEEEEECC-----CCC--CC Q ss_conf 99999859929998176656898778785544678-99899999999999855-----8935980012-----300--00 Q gi|254780558|r 213 MKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVT-QTVQQLLNRLRYSLRRG-----TGYFGVMNYR-----GAM--LL 279 (396) Q Consensus 213 ~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~-~~~~~~~~~l~~~l~~~-----p~~vGvnNhm-----Gs~--~t 279 (396) +..+-+.|.+.+ ..++.+ ..+|.=.+... .+.+++.+||....... ++..-+++-. +.+ .. T Consensus 21 lAl~vA~G~~~~-g~~~~~-----~~~G~Vly~~~Ed~~~~l~rRl~a~~~~~~~~~~~~rl~~~~~~~~~l~~~~~~~~ 94 (239) T cd01125 21 LALAMALGKNLF-GGGLKV-----TEPGRVVYLSAEDPREEIHRRLEAILQHLEPDDAGDRLFIDSGRIQPISIAREGRI 94 (239) T ss_pred HHHHHHCCCCCC-CCCCCC-----CCCCEEEEEECCCCHHHHHHHHHHHHHHCCCCCCCCCEEECCCCCCCCCCCCCCCC T ss_conf 999997599656-898546-----87761999978899899999999999863866555416641556664413357721 Q ss_pred CCHHHHHHHHHHHHCC--CCEEEECCCCCCCHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHHHHHHHCCEEEEE Q ss_conf 4989999999998506--92899749763226789898709946984588628999999999999999999871948999 Q gi|254780558|r 280 SNKESAEVIFKEFAKR--GLLFFDDGSSPRNLTRVLAPKLNLPYMVADLYLDDQVDRDKIREKLKGLEEIARTTGQAIGV 357 (396) Q Consensus 280 ~~~~~m~~vl~~l~~r--gL~flDs~Ts~~Sva~~~A~~~gvp~~~~dvfLD~~~~~~~I~~qL~~l~~~Ar~~G~AI~i 357 (396) .....++.+.+.+... .|..||-... -.+ .++++..++...+..+++++.+.|.+|.+ T Consensus 95 ~~~~~~~~i~~~~~~~~~~LVViDpL~~----------~~~----------~dENd~~~m~~~i~~l~~ia~~tg~aVl~ 154 (239) T cd01125 95 IVVPEFERIIEQLLIRRIDLVVIDPLVS----------FHG----------VSENDNGAMDAVIKALRRIAAQTGAAILL 154 (239) T ss_pred CCHHHHHHHHHHHHCCCCCEEEECCCHH----------HCC----------CCCCCHHHHHHHHHHHHHHHHHCCCEEEE T ss_conf 0148999999997458998999838177----------489----------99788999999999999999971999999 Q ss_pred EE-CCH Q ss_conf 80-887 Q gi|254780558|r 358 AV-AFD 362 (396) Q Consensus 358 gh-~~p 362 (396) .| ... T Consensus 155 vHHt~K 160 (239) T cd01125 155 VHHVRK 160 (239) T ss_pred EECCCC T ss_conf 706887 No 25 >PRK04004 translation initiation factor IF-2; Validated Probab=71.28 E-value=7.8 Score=18.64 Aligned_cols=33 Identities=18% Similarity=0.264 Sum_probs=17.3 Q ss_pred HHHHHHHCCEEEEEEECCHHHHHHHHHHHHHHH Q ss_conf 999998719489998088788999999975688 Q gi|254780558|r 344 LEEIARTTGQAIGVAVAFDESIEVISQWLQQEH 376 (396) Q Consensus 344 l~~~Ar~~G~AI~igh~~p~Ti~~L~~w~~~l~ 376 (396) +...|.++|.-|-....--.-++-+++|..+++ T Consensus 419 a~~~Ae~~gV~I~~~~IIY~LiDd~~~~~~~~~ 451 (592) T PRK04004 419 AEEEAESSDVKIFEGDVIYQLVEDYEKWVKEQK 451 (592) T ss_pred HHHHHHHCCCEEEEECHHHHHHHHHHHHHHHHH T ss_conf 999999749769980748768999999999756 No 26 >PRK13886 conjugal transfer protein TraL; Provisional Probab=70.51 E-value=8.1 Score=18.53 Aligned_cols=50 Identities=10% Similarity=0.074 Sum_probs=37.5 Q ss_pred HHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCC----CEEEEECCCC Q ss_conf 9999998599299981766568987787855446789989999999999985589----3598001230 Q gi|254780558|r 212 WMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQLLNRLRYSLRRGTG----YFGVMNYRGA 276 (396) Q Consensus 212 ~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~~~~~l~~~l~~~p~----~vGvnNhmGs 276 (396) ..+.....||++.+|.+|.- |..-.+-..-|...++++|. .|.+|.|-|. T Consensus 105 v~~ll~e~G~~v~iH~vi~G---------------Gqa~~DT~~gl~~l~~~~~~~~~~VVWlN~~~G~ 158 (241) T PRK13886 105 VPALLQDMGHELVVHTVVTG---------------GQALLDTVSGFAQLASQFPAECLFVVWLNPYWGP 158 (241) T ss_pred HHHHHHHCCCEEEEEEEECC---------------CCCHHHHHHHHHHHHHHCCCCCCEEEEECCCCCC T ss_conf 69999977973999998459---------------8427789999999998658778789997488786 No 27 >PRK08786 consensus Probab=66.24 E-value=1.3 Score=23.56 Aligned_cols=26 Identities=15% Similarity=0.308 Sum_probs=13.5 Q ss_pred CHHHHHHHHHHHHCCCCEEEECCCCC Q ss_conf 98999999999850692899749763 Q gi|254780558|r 281 NKESAEVIFKEFAKRGLLFFDDGSSP 306 (396) Q Consensus 281 ~~~~m~~vl~~l~~rgL~flDs~Ts~ 306 (396) +......++..++..+.+-+|.-|+. T Consensus 334 ~~e~L~~~i~~l~~~~~~afdtET~s 359 (927) T PRK08786 334 TQEQLDSWIARLRAAGQFAFDTETDS 359 (927) T ss_pred CHHHHHHHHHHHHHCCEEEEEEECCC T ss_conf 79999999999873683899972388 No 28 >cd00984 DnaB_C DnaB helicase C terminal domain. The hexameric helicase DnaB unwinds the DNA duplex at the chromosome replication fork. Although the mechanism by which DnaB both couples ATP hydrolysis to translocation along DNA and denatures the duplex is unknown, a change in the quaternary structure of the protein involving dimerization of the N-terminal domain has been observed and may occur during the enzymatic cycle. This C-terminal domain contains an ATP-binding site and is therefore probably the site of ATP hydrolysis. Probab=66.20 E-value=9.9 Score=17.98 Aligned_cols=121 Identities=18% Similarity=0.177 Sum_probs=68.5 Q ss_pred HHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHC Q ss_conf 99985992999817665689877878554467899899999999999855893598001230000498999999999850 Q gi|254780558|r 215 EAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQLLNRLRYSLRRGTGYFGVMNYRGAMLLSNKESAEVIFKEFAK 294 (396) Q Consensus 215 ~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~~~~~l~~~l~~~p~~vGvnNhmGs~~t~~~~~m~~vl~~l~~ 294 (396) .|.++|+-|++ +...|+.+++..|+...++.++...-.+ |..-..+..........+.+ T Consensus 37 ~a~~~g~~V~~------------------~SlEm~~~~~~~R~~s~~~~i~~~~i~~---~~~~~~~~~~~~~~~~~~~~ 95 (242) T cd00984 37 IAKKQGKPVLF------------------FSLEMSKEQLLQRLLASESGISLSKLRT---GSLSDEDWERLAEAIGELKE 95 (242) T ss_pred HHHHCCCCEEE------------------EECCCCHHHHHHHHHHHHHCCCHHHHHC---CCCCHHHHHHHHHHHHHHCC T ss_conf 99977995999------------------9333538899999999982977455302---65227999999999998616 Q ss_pred CCCEEEECCC-CCCCH---HHHHHHHHCCCEEEEEEEECC------C----CCHHHHHHHHHHHHHHHHHCCEEEE-EEE Q ss_conf 6928997497-63226---789898709946984588628------9----9999999999999999987194899-980 Q gi|254780558|r 295 RGLLFFDDGS-SPRNL---TRVLAPKLNLPYMVADLYLDD------Q----VDRDKIREKLKGLEEIARTTGQAIG-VAV 359 (396) Q Consensus 295 rgL~flDs~T-s~~Sv---a~~~A~~~gvp~~~~dvfLD~------~----~~~~~I~~qL~~l~~~Ar~~G~AI~-igh 359 (396) ..|++.|... +...+ +..+..+.++.. ||+|. . ....+|..-..+|-.+|++.+..|. +.| T Consensus 96 ~~l~i~d~~~~t~~~i~~~ir~~~~~~~~~~----vvvDylql~~~~~~~~~~~~~i~~i~~~Lk~lA~e~~v~Vi~~sQ 171 (242) T cd00984 96 LPIYIDDSSSLTVSDIRSRARRLKKEHGLGL----IVIDYLQLMSGSKKKGNRQQEVAEISRSLKLLAKELNVPVIALSQ 171 (242) T ss_pred CCEEEECCCCCCHHHHHHHHHHHHHHCCCCE----EEEEHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCEEEEEEC T ss_conf 9889966999999999999999988369989----998269854677766579999999999999999997993999846 Q ss_pred C Q ss_conf 8 Q gi|254780558|r 360 A 360 (396) Q Consensus 360 ~ 360 (396) . T Consensus 172 l 172 (242) T cd00984 172 L 172 (242) T ss_pred C T ss_conf 7 No 29 >cd02072 Glm_B12_BD B12 binding domain of glutamate mutase (Glm). Glutamate mutase catalysis the conversion of (S)-glutamate with (2S,3S)-3-methylaspartate. The rearrangement reaction is initiated by the extraction of a hydrogen from the protein-bound substrate by a 5'-desoxyadenosyl radical, which is generated by the homolytic cleavage of the organometallic bond of the cofactor B12. Glm is a heterotetrameric molecule consisting of two alpha and two epsilon polypeptide chains. Probab=65.82 E-value=10 Score=17.93 Aligned_cols=64 Identities=11% Similarity=0.072 Sum_probs=48.1 Q ss_pred HHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHH Q ss_conf 89999999985992999817665689877878554467899899999999999855893598001230000498999999 Q gi|254780558|r 209 LDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQLLNRLRYSLRRGTGYFGVMNYRGAMLLSNKESAEVI 288 (396) Q Consensus 209 ~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~~~~~l~~~l~~~p~~vGvnNhmGs~~t~~~~~m~~v 288 (396) ..-.+..-+.+|+||+- |-...+++++ +..++..-..++|+...+|... ..|+.+ T Consensus 16 ~~iva~~l~~~GfeVi~------------------lG~~~~~e~~---~~~a~e~~ad~i~vSsl~g~~~----~~~~~l 70 (128) T cd02072 16 NKILDHAFTEAGFNVVN------------------LGVLSPQEEF---IDAAIETDADAILVSSLYGHGE----IDCKGL 70 (128) T ss_pred HHHHHHHHHHCCCEEEE------------------CCCCCCHHHH---HHHHHHCCCCEEEECCCCCCCC----HHHHHH T ss_conf 99999999978972984------------------7988999999---9999873999999823202562----489999 Q ss_pred HHHHHCCCC Q ss_conf 999850692 Q gi|254780558|r 289 FKEFAKRGL 297 (396) Q Consensus 289 l~~l~~rgL 297 (396) ++.|+++|+ T Consensus 71 ~~~L~e~G~ 79 (128) T cd02072 71 REKCDEAGL 79 (128) T ss_pred HHHHHHCCC T ss_conf 999996799 No 30 >PRK07997 consensus Probab=65.53 E-value=1.2 Score=23.65 Aligned_cols=26 Identities=15% Similarity=0.323 Sum_probs=15.0 Q ss_pred CCHHHHHHHHHHHHCCCCEEEECCCC Q ss_conf 49899999999985069289974976 Q gi|254780558|r 280 SNKESAEVIFKEFAKRGLLFFDDGSS 305 (396) Q Consensus 280 ~~~~~m~~vl~~l~~rgL~flDs~Ts 305 (396) .+...+..+++.+++.+.+-+|--|+ T Consensus 334 ~~~~~l~~~~~~l~~~~~~a~DtEt~ 359 (928) T PRK07997 334 LDEETLKAWIEKLKKAPVFAFDTETD 359 (928) T ss_pred CCHHHHHHHHHHHHCCCEEEEEEECC T ss_conf 78999999999864298699995127 No 31 >cd00003 PNPsynthase Pyridoxine 5'-phosphate (PNP) synthase domain; pyridoxal 5'-phosphate is the active form of vitamin B6 that acts as an essential, ubiquitous coenzyme in amino acid metabolism. In bacteria, formation of pyridoxine 5'-phosphate is a step in the biosynthesis of vitamin B6. PNP synthase, a homooctameric enzyme, catalyzes the final step in PNP biosynthesis, the condensation of 1-amino-acetone 3-phosphate and 1-deoxy-D-xylulose 5-phosphate. PNP synthase adopts a TIM barrel topology, intersubunit contacts are mediated by three ''extra'' helices, generating a tetramer of symmetric dimers with shared active sites; the open state has been proposed to accept substrates and to release products, while most of the catalytic events are likely to occur in the closed state; a hydrophilic channel running through the center of the barrel was identified as the essential structural feature that enables PNP synthase to release water molecules produced during the reaction from the closed, Probab=64.70 E-value=11 Score=17.80 Aligned_cols=98 Identities=17% Similarity=0.298 Sum_probs=68.6 Q ss_pred CCCCCCCHHHHHHHHHHHHCCCC---EEEECCCCCCCHHHHHHHHHCCCEEEEEE--EEC--CCCCHHHHHHHHHHHHHH Q ss_conf 30000498999999999850692---89974976322678989870994698458--862--899999999999999999 Q gi|254780558|r 275 GAMLLSNKESAEVIFKEFAKRGL---LFFDDGSSPRNLTRVLAPKLNLPYMVADL--YLD--DQVDRDKIREKLKGLEEI 347 (396) Q Consensus 275 Gs~~t~~~~~m~~vl~~l~~rgL---~flDs~Ts~~Sva~~~A~~~gvp~~~~dv--fLD--~~~~~~~I~~qL~~l~~~ 347 (396) |=-+..+...++.+.+.|++.|. +|+|.. -..-..|++.|....+-+- |-+ +..+......++.++... T Consensus 102 Gld~~~~~~~L~~~i~~lk~~~IrvSLFIDPd----~~qi~~a~~~Gad~VElhTG~Ya~a~~~~~~~~el~~i~~aa~~ 177 (234) T cd00003 102 GLDVAGQAEKLKPIIERLKDAGIRVSLFIDPD----PEQIEAAKEVGADRVELHTGPYANAYDKAEREAELERIAKAAKL 177 (234) T ss_pred CCCCCCHHHHHHHHHHHHHHCCCEEEEEECCC----HHHHHHHHHHCCCEEEEECHHHHHHCCCHHHHHHHHHHHHHHHH T ss_conf 92665478899999999986598279972798----78999999849399998247878634810399999999999999 Q ss_pred HHHCCEEEEEEECCH-HHHHHHHHHHHHHHH Q ss_conf 987194899980887-889999999756887 Q gi|254780558|r 348 ARTTGQAIGVAVAFD-ESIEVISQWLQQEHV 377 (396) Q Consensus 348 Ar~~G~AI~igh~~p-~Ti~~L~~w~~~l~~ 377 (396) |++.|--|--||-.. +++..+.+ ++.+++ T Consensus 178 A~~lGL~VnAGHgLn~~Nl~~i~~-ip~i~E 207 (234) T cd00003 178 ARELGLGVNAGHGLNYENVKPIAK-IPGIAE 207 (234) T ss_pred HHHCCCEEECCCCCCHHHHHHHHC-CCCCEE T ss_conf 998598785478988767999855-899728 No 32 >COG1540 Uncharacterized proteins, homologs of lactam utilization protein B [General function prediction only] Probab=64.30 E-value=8.3 Score=18.46 Aligned_cols=189 Identities=14% Similarity=0.137 Sum_probs=122.5 Q ss_pred HHHHHHC--CCCCEEEEECCCCCH-HHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHC Q ss_conf 9999837--935089870898768-9999999985992999817665689877878554467899899999999999855 Q gi|254780558|r 188 QRAINLL--PANITLAFASNGNSL-DRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQLLNRLRYSLRRG 264 (396) Q Consensus 188 ~~ai~~L--P~~vT~A~~P~~~~~-~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~~~~~l~~~l~~~ 264 (396) .++++.| -..|.+.|-...|.. .+..+.|.+.|--|=-|- +||+.. |=+-=....+++|+..-+..-++.+ T Consensus 22 De~~l~lvsSANIACGfHAGDp~~M~rtV~lA~e~gV~IGAHP-----gyPDl~-gFGRr~m~~~~~e~~a~~lYQiGAL 95 (252) T COG1540 22 DEALLPLVSSANIACGFHAGDPLTMRRTVRLAKENGVAIGAHP-----GYPDLV-GFGRREMALSPEELYAQVLYQIGAL 95 (252) T ss_pred CHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCEECCCC-----CCCCCC-CCCCCCCCCCHHHHHHHHHHHHHHH T ss_conf 0999988756557540467887999999999998497663589-----986323-5674346899899999999999999 Q ss_pred CCC---E-EEEECC------CCCCCCCHHHHHHHHHHHHCCC--CEEEECCCCCCCHHHHHHHHHCCCEEEEEEEECCC- Q ss_conf 893---5-980012------3000049899999999985069--28997497632267898987099469845886289- Q gi|254780558|r 265 TGY---F-GVMNYR------GAMLLSNKESAEVIFKEFAKRG--LLFFDDGSSPRNLTRVLAPKLNLPYMVADLYLDDQ- 331 (396) Q Consensus 265 p~~---v-GvnNhm------Gs~~t~~~~~m~~vl~~l~~rg--L~flDs~Ts~~Sva~~~A~~~gvp~~~~dvfLD~~- 331 (396) ... - +...|+ .-....|+..-+.|++.+..-. |.++- -+.|-....|++.|++... .+|-|.. T Consensus 96 ~a~~~a~G~~~~hVKpHGALYN~~a~D~~la~av~~av~~~dp~L~l~~---Lags~~~~~a~~~GL~~~~-EvFADR~Y 171 (252) T COG1540 96 QAFARAQGGVVQHVKPHGALYNQAAKDRALADAVAEAVAAFDPSLILMG---LAGSELLRAAKRAGLPVAE-EVFADRAY 171 (252) T ss_pred HHHHHHCCCEEEEECCCHHHHHHHHCCHHHHHHHHHHHHHHCCCCEEEE---CCCHHHHHHHHHCCCHHHH-HHHCCCCC T ss_conf 9999964975777553178898861189999999999997498956996---1857999999981954588-88413352 Q ss_pred ----------------CCHHHHHHHHHHHHHHH---HHCCEEEEE-----E-E-CCHHHHHHHHHHHHHHHHCCCEEEEH Q ss_conf ----------------99999999999999999---871948999-----8-0-88788999999975688769189967 Q gi|254780558|r 332 ----------------VDRDKIREKLKGLEEIA---RTTGQAIGV-----A-V-AFDESIEVISQWLQQEHVRDVSVVPL 385 (396) Q Consensus 332 ----------------~~~~~I~~qL~~l~~~A---r~~G~AI~i-----g-h-~~p~Ti~~L~~w~~~l~~~gi~lVpv 385 (396) .|++.+.+|-.+..+-. -..|..|.| | | --|..++...+.-..|++.||.+.++ T Consensus 172 ~~dG~Lv~R~~pgA~i~d~~~a~~qvl~m~~~g~v~a~dG~~v~v~adsiCvHGD~p~Al~~~~riR~~l~~~gi~i~~~ 251 (252) T COG1540 172 QPDGTLVPRSLPGAVIHDEEEALAQVLQMVREGKVTAIDGEWVAVEADSICVHGDNPHALAFARRIRAALEAEGIKVAAL 251 (252) T ss_pred CCCCCEECCCCCCCCCCCHHHHHHHHHHHHHCCCEEEECCCEEEEECCEEEECCCCHHHHHHHHHHHHHHHHCCCEEECC T ss_conf 88976711789985125999999999999864964832793886504468982897789999999999998769756237 Q ss_pred H Q ss_conf 7 Q gi|254780558|r 386 S 386 (396) Q Consensus 386 S 386 (396) + T Consensus 252 ~ 252 (252) T COG1540 252 A 252 (252) T ss_pred C T ss_conf 9 No 33 >TIGR01307 pgm_bpd_ind 2,3-bisphosphoglycerate-independent phosphoglycerate mutase; InterPro: IPR005995 This 2,3-bisphosphoglycerate-independent phosphoglycerate mutase (iPGAM) is a metalloenzyme found particularly in eubacteria and higher plants. It is distantly related to archaeal iPGAM (IPR004456 from INTERPRO) and distinct from the unrelated cofactor-dependent PGAM (PIRSF001490 from PIRSF). Activity has been demonstrated for proteins from a variety of organisms, including Pseudomonas syringae pv. tomato , Bacillus subtilis , Bacillus stearothermophilus , maize , castor bean , and Trypanosoma brucei . The structure of the B. stearothermophilus enzyme (PDB:1EJJ) has two domains . Residues 1-76 and 311-511 form the phosphatase domain, containing the active site residue and two metal-binding sites. This domain is similar to alkaline phosphatase (PDB:1ALK) and arylsulphatase (PDB:1AUK), which are members of the SCOP alkaline phosphatase-like superfamily, but there is meagre sequence similarity outside of the metal-binding segments. Residues 77-310 form the phosphotransferase domain, which is poorly conserved (or perhaps unrelated) in the archaeal enzymes.; GO: 0004619 phosphoglycerate mutase activity, 0006007 glucose catabolic process. Probab=63.95 E-value=11 Score=17.71 Aligned_cols=181 Identities=14% Similarity=0.152 Sum_probs=120.9 Q ss_pred EEEEEECCCCCCCHH---HHHHHHHCCCCCEEEEECCCCCHHHHHHHHH--HCCCCEEEECCCCCCCCCCC-----CCCC Q ss_conf 589997377877168---8999983793508987089876899999999--85992999817665689877-----8785 Q gi|254780558|r 172 RIAIVVSGLGISQTG---TQRAINLLPANITLAFASNGNSLDRWMKEAK--KKGQEAILQIPMQAFDESYN-----EDDS 241 (396) Q Consensus 172 riAIVIddlG~~~~~---~~~ai~~LP~~vT~A~~P~~~~~~~~~~~Ar--~~G~EvllhlPMEp~~~~~~-----~pGp 241 (396) -+=||+||.|++... ..+||.. ...|-.+++++..= ++|| +||.--=+..+.|+- +=|. T Consensus 3 ~vLvIlDG~G~~~~~srk~~nAi~~----------A~tP~~D~l~~~~pktaqG~-~ll~aSG~~VGLP~GQmGNSEVGH 71 (529) T TIGR01307 3 VVLVILDGWGYREDDSRKDGNAIFA----------AKTPVMDELIAAYPKTAQGY-SLLDASGLDVGLPDGQMGNSEVGH 71 (529) T ss_pred EEEEEECCCCCCCCCCCCHHHHHHH----------CCCCHHHHHHHCCCCCCCCC-EEEEECCCCCCCCCCCCCCCCHHH T ss_conf 6899976724788633101479871----------78775789986077510253-478610313578888636630521 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCH---------HHH Q ss_conf 54467899899999999999855893598001230000498999999999850692899749763226---------789 Q gi|254780558|r 242 YTLKVTQTVQQLLNRLRYSLRRGTGYFGVMNYRGAMLLSNKESAEVIFKEFAKRGLLFFDDGSSPRNL---------TRV 312 (396) Q Consensus 242 ~~L~~~~~~~~~~~~l~~~l~~~p~~vGvnNhmGs~~t~~~~~m~~vl~~l~~rgL~flDs~Ts~~Sv---------a~~ 312 (396) -.|-.|---.+-..|+...+. -..|..+......+-++-+.++-.=|==.-|..-| --+ T Consensus 72 lniGAGRVVyQdl~rI~~~i~------------~g~f~~N~a~~~a~~~a~~~~~~lHl~GL~S~GGVHSh~~Hl~AL~~ 139 (529) T TIGR01307 72 LNIGAGRVVYQDLVRISQAIK------------DGEFFANPALLGAIDRAKDANGKLHLMGLVSDGGVHSHIDHLIALIE 139 (529) T ss_pred EECCCCCEEECCCEEEHHHHH------------CCCCCCCHHHHHHHHHHHHCCCCEEEEEECCCCCCCCCHHHHHHHHH T ss_conf 002576356122032103542------------47321462799999999846891799973388950360889999999 Q ss_pred HHHHHCCCEEEEEEEECCC-CCHHHHHHHHHHHHHHHHHCCEEEE---EEECC----HHHHHHHHHHHHHH Q ss_conf 8987099469845886289-9999999999999999987194899---98088----78899999997568 Q gi|254780558|r 313 LAPKLNLPYMVADLYLDDQ-VDRDKIREKLKGLEEIARTTGQAIG---VAVAF----DESIEVISQWLQQE 375 (396) Q Consensus 313 ~A~~~gvp~~~~dvfLD~~-~~~~~I~~qL~~l~~~Ar~~G~AI~---igh~~----p~Ti~~L~~w~~~l 375 (396) +|++.|+.-..-|.|.|.. -.+.....-|+++....++.|.... +|+-| +.--++.+.-..-+ T Consensus 140 ~A~~~G~~k~~lH~f~DGRDt~P~s~~~~L~~l~~~~~~~g~g~Iasi~GRYYAMDRD~RWdRve~AY~~i 210 (529) T TIGR01307 140 LAAERGIEKVCLHAFTDGRDTAPKSAESYLEQLEAFLKEIGKGRIASISGRYYAMDRDQRWDRVEIAYKAI 210 (529) T ss_pred HHHHCCCEEEEEEEEECCCCCCCCCHHHHHHHHHHHHHHCCCEEEEEEEEEEEECCCCCCHHHHHHHHHHH T ss_conf 99668931799987607788888766789999999997249448999873445143467178999999986 No 34 >PRK07583 cytosine deaminase; Validated Probab=62.97 E-value=11 Score=17.60 Aligned_cols=130 Identities=18% Similarity=0.194 Sum_probs=65.2 Q ss_pred EEEECCCCC----HHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHH-CCCCEEEEECC Q ss_conf 987089876----8999999998599299981766568987787855446789989999999999985-58935980012 Q gi|254780558|r 200 LAFASNGNS----LDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQLLNRLRYSLRR-GTGYFGVMNYR 274 (396) Q Consensus 200 ~A~~P~~~~----~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~~~~~l~~~l~~-~p~~vGvnNhm 274 (396) =++.+..++ +....+.|.+.|..|=+|+ +. ..++| ...++...+..... +.+-| .-.|- T Consensus 201 GGv~~~~~d~~~~l~~vf~lA~~~g~~vDiHl-----de-~~d~~---------~~~l~~la~~t~~~g~~grV-t~sH~ 264 (437) T PRK07583 201 GGVTYMNPDLDAQLDRLFALAAERGLDLDLHV-----DE-TGDPA---------SRTLKAVAEAALRNGFEGKV-TCGHC 264 (437) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCC-----CC-CCCHH---------HHHHHHHHHHHHHCCCCCCE-EEEHH T ss_conf 38789885799999999999998099953235-----78-88717---------99999999999980999865-52023 Q ss_pred CCCCCCCHHHHHHHHHHHHCCCCEEEECC------------CCCC--CH-HHHHHHHHCCCEEEEEEEECCCCCH----- Q ss_conf 30000498999999999850692899749------------7632--26-7898987099469845886289999----- Q gi|254780558|r 275 GAMLLSNKESAEVIFKEFAKRGLLFFDDG------------SSPR--NL-TRVLAPKLNLPYMVADLYLDDQVDR----- 334 (396) Q Consensus 275 Gs~~t~~~~~m~~vl~~l~~rgL~flDs~------------Ts~~--Sv-a~~~A~~~gvp~~~~dvfLD~~~~~----- 334 (396) -|.-..++...+.+++.|++.|+-++-.- ++|+ .+ --..-.+.||+.+... ||.+|. T Consensus 265 ~sL~~~~~~~~~~~~~~lA~agi~vvslP~~nl~Lq~R~~~~~P~~Rgv~pv~eL~~aGV~Valg~---Dni~Dpw~P~G 341 (437) T PRK07583 265 CSLAVQPEEQAQETLALVAEAGIAIVSLPMCNLYLQDRQPGRTPRWRGVTLVHELKAAGIPVAFAS---DNCRDPFYAYG 341 (437) T ss_pred HHHHCCCHHHHHHHHHHHHHCCCCEEECCCCCEECCCCCCCCCCCCCCCCCHHHHHHCCCEEEEEC---CCCCCCCCCCC T ss_conf 455329999999999999983994897565220003667777887688778699997898189937---98878877688 Q ss_pred -HHHHHHHHHHHHHH Q ss_conf -99999999999999 Q gi|254780558|r 335 -DKIREKLKGLEEIA 348 (396) Q Consensus 335 -~~I~~qL~~l~~~A 348 (396) ..+...+..+..++ T Consensus 342 ~gDmLe~~~~a~~~~ 356 (437) T PRK07583 342 DHDMLEVFREAVRIL 356 (437) T ss_pred CCCHHHHHHHHHHHH T ss_conf 988999999999997 No 35 >PRK07556 consensus Probab=62.82 E-value=1.3 Score=23.49 Aligned_cols=25 Identities=12% Similarity=0.292 Sum_probs=10.4 Q ss_pred CHHHHHHHHHHHHCCCCEEEECCCC Q ss_conf 9899999999985069289974976 Q gi|254780558|r 281 NKESAEVIFKEFAKRGLLFFDDGSS 305 (396) Q Consensus 281 ~~~~m~~vl~~l~~rgL~flDs~Ts 305 (396) +......++..+++.+.+-+|.-|+ T Consensus 373 ~~e~L~~li~~l~~~~~~a~DtETt 397 (977) T PRK07556 373 DLATLKAWIARARETGIVAFDTETT 397 (977) T ss_pred CHHHHHHHHHHHHHCCEEEEEEECC T ss_conf 8999999999987679589997318 No 36 >cd02071 MM_CoA_mut_B12_BD methylmalonyl CoA mutase B12 binding domain. This domain binds to B12 (adenosylcobamide), which initiates the conversion of succinyl CoA and methylmalonyl CoA by forming an adenosyl radical, which then undergoes a rearrangement exchanging a hydrogen atom with a group attached to a neighboring carbon atom. This family is present in both mammals and bacteria. Bacterial members are heterodimers and involved in the fermentation of pyruvate to propionate. Mammalian members are homodimers and responsible for the conversion of odd-chain fatty acids and branched-chain amino acids via propionyl CoA to succinyl CoA for further degradation. Probab=62.35 E-value=12 Score=17.53 Aligned_cols=76 Identities=13% Similarity=0.212 Sum_probs=53.2 Q ss_pred CCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHH Q ss_conf 76899999999859929998176656898778785544678998999999999998558935980012300004989999 Q gi|254780558|r 207 NSLDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQLLNRLRYSLRRGTGYFGVMNYRGAMLLSNKESAE 286 (396) Q Consensus 207 ~~~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~~~~~l~~~l~~~p~~vGvnNhmGs~~t~~~~~m~ 286 (396) ....-.+...+.+|+||+. |....+++++ ++.+...=..++|++.++|+ +...|+ T Consensus 14 ~G~~iva~~l~d~GfeVi~------------------lG~~~s~eei---v~~A~~e~ad~IglSsL~g~----h~~~~~ 68 (122) T cd02071 14 RGAKVIARALRDAGFEVIY------------------TGLRQTPEEI---VEAAIQEDVDVIGLSSLSGG----HMTLFP 68 (122) T ss_pred HHHHHHHHHHHHCCCEEEE------------------CCCCCCHHHH---HHHHHHCCCCEEEEECCCCC----HHHHHH T ss_conf 7799999999978976996------------------7998899999---99999739989999646554----478999 Q ss_pred HHHHHHHCCCCE---EEECCCCCC Q ss_conf 999998506928---997497632 Q gi|254780558|r 287 VIFKEFAKRGLL---FFDDGSSPR 307 (396) Q Consensus 287 ~vl~~l~~rgL~---flDs~Ts~~ 307 (396) .+++.|+++|+- ++=-+.-+. T Consensus 69 ~l~~~L~e~G~~di~v~vGG~Ip~ 92 (122) T cd02071 69 EVIELLRELGAGDILVVGGGIIPP 92 (122) T ss_pred HHHHHHHHCCCCCCEEEEECCCCH T ss_conf 999999976999846999456498 No 37 >pfam03746 LamB_YcsF LamB/YcsF family. This family includes LamB. The lam locus of Aspergillus nidulans consists of two divergently transcribed genes, lamA and lamB, involved in the utilisation of lactams such as 2-pyrrolidinone. Both genes are under the control of the positive regulatory gene amdR and are subject to carbon and nitrogen metabolite repression. The exact molecular function of the proteins in this family is unknown. Probab=61.62 E-value=12 Score=17.45 Aligned_cols=188 Identities=15% Similarity=0.143 Sum_probs=111.2 Q ss_pred CCCCCC----HHHHHHHHHC--CCCCEEEEECCCC-CHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHH Q ss_conf 778771----6889999837--9350898708987-68999999998599299981766568987787855446789989 Q gi|254780558|r 179 GLGISQ----TGTQRAINLL--PANITLAFASNGN-SLDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQ 251 (396) Q Consensus 179 dlG~~~----~~~~~ai~~L--P~~vT~A~~P~~~-~~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~ 251 (396) |||.+. .+..++|+.+ -..|.+.+-...+ .-.+.++.|.+.|-.|=-|- +||+.+ |=+=-...++++ T Consensus 7 DlGE~~g~~~~g~D~~lm~~I~saNIACG~HaGD~~~m~~tv~lA~~~~V~IGAHP-----sypD~~-~FGRr~~~~s~~ 80 (242) T pfam03746 7 DLGESFGAWTMGDDEALLPLVTSANIACGFHAGDPSVMRRTVRLAAEHGVRIGAHP-----GYPDLV-GFGRRAMALSAE 80 (242) T ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCEEECCC-----CCCCCC-CCCCCCCCCCHH T ss_conf 07888788777881999867515888616357889999999999998499880889-----999767-899998889989 Q ss_pred HHHHHHHHHHHHCCCC---EEE-EECC---CC---CCCCCHHHHHHHHHHHHCC--CCEEEECCCCCCCHHHHHHHHHCC Q ss_conf 9999999999855893---598-0012---30---0004989999999998506--928997497632267898987099 Q gi|254780558|r 252 QLLNRLRYSLRRGTGY---FGV-MNYR---GA---MLLSNKESAEVIFKEFAKR--GLLFFDDGSSPRNLTRVLAPKLNL 319 (396) Q Consensus 252 ~~~~~l~~~l~~~p~~---vGv-nNhm---Gs---~~t~~~~~m~~vl~~l~~r--gL~flDs~Ts~~Sva~~~A~~~gv 319 (396) ++.+.+..-+..+-+. .|+ .+|. |. ....|+...+.+++.++.. +|.++- -+.|...++|++.|+ T Consensus 81 el~~~i~~Qi~al~~~a~~~g~~l~hVKPHGALYn~~~~d~~la~~i~~ai~~~~~~l~l~~---~~~s~~~~~A~~~Gl 157 (242) T pfam03746 81 ELYALVLYQIGALQAFARAQGLRLQHVKPHGALYNMAAKDRALARAVAEAVYDFDPALPLMG---LAGSALLDAAKEAGL 157 (242) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCEEEE---CCCHHHHHHHHHCCC T ss_conf 99999999999999999982996311353199999996599999999999998699852884---487599999987699 Q ss_pred CEEEEEEEECCCCCH----------H----HHHHHHHHHHHHHHH------CCEEEE-------EEECCHHHHHHHHHHH Q ss_conf 469845886289999----------9----999999999999987------194899-------9808878899999997 Q gi|254780558|r 320 PYMVADLYLDDQVDR----------D----KIREKLKGLEEIART------TGQAIG-------VAVAFDESIEVISQWL 372 (396) Q Consensus 320 p~~~~dvfLD~~~~~----------~----~I~~qL~~l~~~Ar~------~G~AI~-------igh~~p~Ti~~L~~w~ 372 (396) ++. ...|.|-.-+. . .....+++..++++. .|.-|- |-.-.|..++.++..- T Consensus 158 ~~~-~E~FADR~Y~~dG~Lv~R~~~gAvi~d~~~~~~qv~~~~~~g~V~t~~G~~i~i~adTiCvHgDtp~Ave~a~~ir 236 (242) T pfam03746 158 RLA-FEVFADRAYQPDGSLVPRSQPGAVIHDPEEAIAQVLQMVREGKVRAVDGEWVALEADTICVHGDNPHALAFARRIR 236 (242) T ss_pred CEE-EEEEECCCCCCCCCEEECCCCCCCCCCHHHHHHHHHHHHHCCCEEECCCCEEECCCCEEEECCCCHHHHHHHHHHH T ss_conf 715-8998515688899987413788766899999999999997798891589887526898998999978999999999 Q ss_pred HHHH Q ss_conf 5688 Q gi|254780558|r 373 QQEH 376 (396) Q Consensus 373 ~~l~ 376 (396) ..|+ T Consensus 237 ~~L~ 240 (242) T pfam03746 237 AALE 240 (242) T ss_pred HHHH T ss_conf 9997 No 38 >TIGR02873 spore_ylxY probable sporulation protein, polysaccharide deacetylase family; InterPro: IPR014228 Members of this entry include YlxY, and are related to IPR014132 from INTERPRO, which represents a subset of the polysaccharide deacetylase family found in a species if, and only if, the species forms endospores e.g. Bacillus subtilis or Clostridium tetani. Proteins in this entry are likewise restricted to spore-formers, but are not universally found among endospore-forming species.. Probab=59.26 E-value=12 Score=17.44 Aligned_cols=177 Identities=14% Similarity=0.149 Sum_probs=114.7 Q ss_pred CEEEEEECCCCCCCHHHHH---HHHHCCCCCEEEEEC-CCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCC Q ss_conf 6589997377877168899---998379350898708-987689999999985992999817665689877878554467 Q gi|254780558|r 171 ARIAIVVSGLGISQTGTQR---AINLLPANITLAFAS-NGNSLDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKV 246 (396) Q Consensus 171 ~riAIVIddlG~~~~~~~~---ai~~LP~~vT~A~~P-~~~~~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~ 246 (396) +-+|+.|- +--++.--.+ -+.+--..-||=+-= -...-.++|+..-.+||||==|- ++-|.=.+| T Consensus 86 ~MvalliN-VAWGnEYLp~~L~~L~Kh~VKATFFLeG~Wvk~n~~la~mI~~~GheiGNHa--------Y~HPdM~~L-- 154 (269) T TIGR02873 86 PMVALLIN-VAWGNEYLPKILKILKKHDVKATFFLEGKWVKENLQLAKMIVEEGHEIGNHA--------YNHPDMKTL-- 154 (269) T ss_pred CCEEEEEE-ECCCCCHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHHHHHCCCEECCCC--------CCCCCHHHC-- T ss_conf 51477875-1116101788987630004412575276255532798888876496463534--------588562230-- Q ss_pred CCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHCCCEEEEEE Q ss_conf 89989999999999985589359800123000049899999999985069289974976322678989870994698458 Q gi|254780558|r 247 TQTVQQLLNRLRYSLRRGTGYFGVMNYRGAMLLSNKESAEVIFKEFAKRGLLFFDDGSSPRNLTRVLAPKLNLPYMVADL 326 (396) Q Consensus 247 ~~~~~~~~~~l~~~l~~~p~~vGvnNhmGs~~t~~~~~m~~vl~~l~~rgL~flDs~Ts~~Sva~~~A~~~gvp~~~~dv 326 (396) +.+++.+.|.. +=+.+=..+..+=..|==--=|-|-..-++|.+.+|-+.==.| T Consensus 155 --s~~~~~dql~~------------------------TN~~i~a~i~~~pkWFAPPSGSF~d~Vv~~Aad~~M~TimWtV 208 (269) T TIGR02873 155 --SKERISDQLDK------------------------TNEVIEATIGVKPKWFAPPSGSFRDEVVQIAADLQMGTIMWTV 208 (269) T ss_pred --CHHHHHHHHHH------------------------HHHHHHHHHCCCCCEECCCCCCHHHHHHHHHHHCCCCEEEEEE T ss_conf --34778999988------------------------8999999857987040288876415778778746896388743 Q ss_pred E-EC-CCCCHHHHHHHHHHHHHHHHHCCEEEEEEECCHHHHHHHHHHHHHHHHCCCEEEEHHHHH Q ss_conf 8-62-899999999999999999987194899980887889999999756887691899677743 Q gi|254780558|r 327 Y-LD-DQVDRDKIREKLKGLEEIARTTGQAIGVAVAFDESIEVISQWLQQEHVRDVSVVPLSCLA 389 (396) Q Consensus 327 f-LD-~~~~~~~I~~qL~~l~~~Ar~~G~AI~igh~~p~Ti~~L~~w~~~l~~~gi~lVpvS~l~ 389 (396) . || -.++++.+..+..+ |-|--|+..=||..-|-+.|...+-.++++|+.|=.|++|+ T Consensus 209 DTIDWk~P~p~vl~~Rvl~-----KiHpGA~vLMHPT~Ssa~gLe~lIt~ikekGy~iGtv~~Ll 268 (269) T TIGR02873 209 DTIDWKNPEPSVLVDRVLS-----KIHPGAMVLMHPTESSAEGLEELITKIKEKGYKIGTVTELL 268 (269) T ss_pred CCCCCCCCCHHHHHHHHHH-----HCCCCCEEECCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHC T ss_conf 1124778874689888774-----06898657648997668999999899976286300155324 No 39 >PRK11545 gntK gluconate kinase 1; Provisional Probab=55.08 E-value=15 Score=16.75 Aligned_cols=85 Identities=11% Similarity=0.105 Sum_probs=50.6 Q ss_pred CCCCHHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHHHH-HHHCCE-EE Q ss_conf 0049899999999985069289974976322678989870994698458862899999999999999999-987194-89 Q gi|254780558|r 278 LLSNKESAEVIFKEFAKRGLLFFDDGSSPRNLTRVLAPKLNLPYMVADLYLDDQVDRDKIREKLKGLEEI-ARTTGQ-AI 355 (396) Q Consensus 278 ~t~~~~~m~~vl~~l~~rgL~flDs~Ts~~Sva~~~A~~~gvp~~~~dvfLD~~~~~~~I~~qL~~l~~~-Ar~~G~-AI 355 (396) ..+-.+.-+-++... ..++.||-=.-+..-+...+..+. ++--...-++.||+.||.- +.+.+. .| T Consensus 87 CSALKr~YRd~Lr~~-~~~~~fv~L~g~~~~i~~Rl~~R~-----------~HFmp~~LL~SQf~tLE~P~~~E~~~~~v 154 (177) T PRK11545 87 CSALKKHYRDLLREG-NPNLSFIYLKGDFDVIESRLKARK-----------GHFFKTQMLVTQFETLQEPGADETDVLVV 154 (177) T ss_pred ECCCCHHHHHHHHHC-CCCEEEEEECCCHHHHHHHHHHCC-----------CCCCCHHHHHHHHHHHCCCCCCCCCEEEE T ss_conf 701119999999806-997599997299999999997464-----------68998789998999817999888986999 Q ss_pred EEEECCHHHHHHHHHHHHH Q ss_conf 9980887889999999756 Q gi|254780558|r 356 GVAVAFDESIEVISQWLQQ 374 (396) Q Consensus 356 ~igh~~p~Ti~~L~~w~~~ 374 (396) =|..+..+-++.+.+|+++ T Consensus 155 di~~~~e~iv~~il~~l~~ 173 (177) T PRK11545 155 DIDQPLEGVVASTIEVIKK 173 (177) T ss_pred ECCCCHHHHHHHHHHHHHC T ss_conf 7899999999999999975 No 40 >pfam06415 iPGM_N BPG-independent PGAM N-terminus (iPGM_N). This family represents the N-terminal region of the 2,3-bisphosphoglycerate-independent phosphoglycerate mutase (or phosphoglyceromutase or BPG-independent PGAM) protein (EC:5.4.2.1). The family is found in conjunction with pfam01676 (located in the C-terminal region of the protein). Probab=54.79 E-value=16 Score=16.72 Aligned_cols=41 Identities=20% Similarity=0.261 Sum_probs=18.7 Q ss_pred HHHHHHCCCEEEEEEEECCC-CCHHHHHHHHHHHHHHHHHCC Q ss_conf 98987099469845886289-999999999999999998719 Q gi|254780558|r 312 VLAPKLNLPYMVADLYLDDQ-VDRDKIREKLKGLEEIARTTG 352 (396) Q Consensus 312 ~~A~~~gvp~~~~dvfLD~~-~~~~~I~~qL~~l~~~Ar~~G 352 (396) ++|.+.|++-..-+.|+|.. ..+..-...|++++..-.+.| T Consensus 53 ~~a~~~gv~~v~lH~f~DGRDt~P~Sa~~yl~~l~~~l~~~~ 94 (223) T pfam06415 53 KLAKERGVKKVYIHAFTDGRDVAPKSAKGFLEELEAFLKELG 94 (223) T ss_pred HHHHHHCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHHHCC T ss_conf 999971887089998606888883679999999999765405 No 41 >PRK09825 idnK D-gluconate kinase; Provisional Probab=54.55 E-value=16 Score=16.70 Aligned_cols=84 Identities=13% Similarity=0.109 Sum_probs=43.4 Q ss_pred HHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHHHHHH-HCCE-EEEEEE Q ss_conf 89999999998506928997497632267898987099469845886289999999999999999998-7194-899980 Q gi|254780558|r 282 KESAEVIFKEFAKRGLLFFDDGSSPRNLTRVLAPKLNLPYMVADLYLDDQVDRDKIREKLKGLEEIAR-TTGQ-AIGVAV 359 (396) Q Consensus 282 ~~~m~~vl~~l~~rgL~flDs~Ts~~Sva~~~A~~~gvp~~~~dvfLD~~~~~~~I~~qL~~l~~~Ar-~~G~-AI~igh 359 (396) .+.-+-++..- .-++.||-=..+...+...+..+.| |=....-++.||+.||.-+. +++. .|=|.+ T Consensus 86 K~~YRd~Lr~~-~~~v~fv~L~g~~~~i~~Rl~~R~g-----------HFMp~~LL~SQf~tLE~P~~dE~~v~~idi~~ 153 (176) T PRK09825 86 KKQYRDILRKS-SPNVHFLWLDGDYETILARMQRRAG-----------HFMPPDLLQSQFDALERPCADEHDIARIDVNH 153 (176) T ss_pred HHHHHHHHHCC-CCCEEEEEEECCHHHHHHHHHHCCC-----------CCCCHHHHHHHHHHHCCCCCCCCCEEEEECCC T ss_conf 79999999747-9987999971899999999974603-----------79997999989998179998889869997899 Q ss_pred CCHHHHHHHHHHHHHHHH Q ss_conf 887889999999756887 Q gi|254780558|r 360 AFDESIEVISQWLQQEHV 377 (396) Q Consensus 360 ~~p~Ti~~L~~w~~~l~~ 377 (396) +..+-++.+.+|++.+++ T Consensus 154 ~~e~iv~~~~~al~a~~~ 171 (176) T PRK09825 154 DIENVTEQCRQAVQAFRQ 171 (176) T ss_pred CHHHHHHHHHHHHHHHHH T ss_conf 999999999999999998 No 42 >TIGR01011 rpsB_bact ribosomal protein S2; InterPro: IPR005706 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites , . About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many of ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome , . This family describes the bacterial, mitochondrial and chloroplast forms of ribosomal protein S2.; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0015935 small ribosomal subunit. Probab=54.26 E-value=11 Score=17.64 Aligned_cols=35 Identities=29% Similarity=0.439 Sum_probs=19.8 Q ss_pred CCEEEECCCCCCCHHHHHHHHHCCCEEEEEEEECCCCCH Q ss_conf 928997497632267898987099469845886289999 Q gi|254780558|r 296 GLLFFDDGSSPRNLTRVLAPKLNLPYMVADLYLDDQVDR 334 (396) Q Consensus 296 gL~flDs~Ts~~Sva~~~A~~~gvp~~~~dvfLD~~~~~ 334 (396) |+.||+| |.-.-.|-..|+++|||... .+|..-|+ T Consensus 159 d~l~viD-~~~E~IAv~EA~kLgIPvvA---~vDTNCdP 193 (227) T TIGR01011 159 DLLFVID-PRKEKIAVAEARKLGIPVVA---IVDTNCDP 193 (227) T ss_pred CEEEEEC-CCCCHHHHHHHHHCCCCEEE---EECCCCCC T ss_conf 7089974-82106899997337997898---70478888 No 43 >PRK07898 consensus Probab=53.85 E-value=0.97 Score=24.35 Aligned_cols=16 Identities=6% Similarity=-0.076 Sum_probs=5.3 Q ss_pred HHHHHHHHCCCCEEEE Q ss_conf 9999998506928997 Q gi|254780558|r 286 EVIFKEFAKRGLLFFD 301 (396) Q Consensus 286 ~~vl~~l~~rgL~flD 301 (396) +++...+.....+.+| T Consensus 328 ~~l~~~~~~~~~~a~~ 343 (902) T PRK07898 328 QWLAEHAGDGRRLGLA 343 (902) T ss_pred HHHHHHHCCCCEEEEE T ss_conf 9999974359869999 No 44 >cd01122 GP4d_helicase GP4d_helicase is a homohexameric 5'-3' helicases. Helicases couple NTP hydrolysis to the unwinding of nucleic acid duplexes into their component strands. Probab=53.37 E-value=16 Score=16.58 Aligned_cols=147 Identities=19% Similarity=0.220 Sum_probs=82.8 Q ss_pred CCCEEEEEECCCCCCCHHHHHHHHHCCCCCEEEEECCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCC Q ss_conf 68658999737787716889999837935089870898768999999998599299981766568987787855446789 Q gi|254780558|r 169 SGARIAIVVSGLGISQTGTQRAINLLPANITLAFASNGNSLDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQ 248 (396) Q Consensus 169 ~~~riAIVIddlG~~~~~~~~ai~~LP~~vT~A~~P~~~~~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~ 248 (396) +...+-||-..=|.+.+.-.. . -....+.+.|+-|++ +...| T Consensus 28 ~~GeL~viaarpg~GKT~f~~---~-----------------~a~~~~~~~g~~vl~------------------~SlEm 69 (271) T cd01122 28 RKGELIILTAGTGVGKTTFLR---E-----------------YALDLITQHGVRVGT------------------ISLEE 69 (271) T ss_pred CCCCEEEEEECCCCCHHHHHH---H-----------------HHHHHHHHCCCEEEE------------------EECCC T ss_conf 998089999689986999999---9-----------------999999976990899------------------97049 Q ss_pred CHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHCCCCEEEECCCC---CCC---HHHHHHHHHCCCEE Q ss_conf 989999999999985589359800123000049899999999985069289974976---322---67898987099469 Q gi|254780558|r 249 TVQQLLNRLRYSLRRGTGYFGVMNYRGAMLLSNKESAEVIFKEFAKRGLLFFDDGSS---PRN---LTRVLAPKLNLPYM 322 (396) Q Consensus 249 ~~~~~~~~l~~~l~~~p~~vGvnNhmGs~~t~~~~~m~~vl~~l~~rgL~flDs~Ts---~~S---va~~~A~~~gvp~~ 322 (396) +.+++..|+...++.++.. ...........-+...++.+...+.+|+.+... ... .+..+.++.|+.. T Consensus 70 ~~~~~~~Rlls~~~g~~~~-----~~~~~~~~~~e~~~~~~~~~~~~~~l~i~d~~~~~~~~~i~~~ir~~~~~~~~~~- 143 (271) T cd01122 70 PVVRTARRLLGQYAGKRLH-----LPDTVFIYTLEEFDAAFDEFEGTGRLFMYDSFGEYSMDSVLEKVRYMAVSHGIQH- 143 (271) T ss_pred CHHHHHHHHHHHHHCCCHH-----HCCCCCCCHHHHHHHHHHHHCCCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCCE- T ss_conf 9999999999998299711-----0344677809999999999707998088789999889999999999998289988- Q ss_pred EEEEEECCCC----------C-HHHHHHHHHHHHHHHHHCCEEE-EEEECCH Q ss_conf 8458862899----------9-9999999999999998719489-9980887 Q gi|254780558|r 323 VADLYLDDQV----------D-RDKIREKLKGLEEIARTTGQAI-GVAVAFD 362 (396) Q Consensus 323 ~~dvfLD~~~----------~-~~~I~~qL~~l~~~Ar~~G~AI-~igh~~p 362 (396) +|||.-+ + ..+|..-..+|..+|++.+..| .++|+.. T Consensus 144 ---vvIDylqll~~~~~~~~d~~~~i~~i~~~Lk~lAke~~v~Vi~lsQlnR 192 (271) T cd01122 144 ---IIIDNLSIMVSDERASGDERKALDEIMTKLRGFATEHGIHITLVSHLRR 192 (271) T ss_pred ---EEEEHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEECCCC T ss_conf ---9983178503678677318999999999999999997997799952676 No 45 >PRK05929 consensus Probab=53.31 E-value=5.2 Score=19.74 Aligned_cols=19 Identities=11% Similarity=0.188 Sum_probs=7.7 Q ss_pred CCCHHHHHHHHHHHHCCCC Q ss_conf 0498999999999850692 Q gi|254780558|r 279 LSNKESAEVIFKEFAKRGL 297 (396) Q Consensus 279 t~~~~~m~~vl~~l~~rgL 297 (396) ..+......++..++...+ T Consensus 295 i~~~~~l~~~~~~l~~~~~ 313 (870) T PRK05929 295 IKDPESLTNVLNLLQGGEV 313 (870) T ss_pred ECCHHHHHHHHHHHCCCCE T ss_conf 0898999999998517974 No 46 >PRK13479 2-aminoethylphosphonate--pyruvate transaminase; Provisional Probab=52.66 E-value=17 Score=16.51 Aligned_cols=41 Identities=10% Similarity=0.013 Sum_probs=16.0 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHCCCC Q ss_conf 999999999855893598001230000498999999999850692 Q gi|254780558|r 253 LLNRLRYSLRRGTGYFGVMNYRGAMLLSNKESAEVIFKEFAKRGL 297 (396) Q Consensus 253 ~~~~l~~~l~~~p~~vGvnNhmGs~~t~~~~~m~~vl~~l~~rgL 297 (396) ....|+.+|..+-.--|+.| |+.......+.+-+.+++-|| T Consensus 246 li~aL~eaL~~i~~EGgl~~----r~~r~~~~~~~~~~~~~~lGl 286 (368) T PRK13479 246 VVAAFYQALLQLEEEGGVPA----RGARYARNCRTLVAGMRALGF 286 (368) T ss_pred HHHHHHHHHHHHHHHCCHHH----HHHHHHHHHHHHHHHHHHCCC T ss_conf 99999999999998318899----999999999999999997387 No 47 >PRK10927 essential cell division protein; Provisional Probab=52.64 E-value=17 Score=16.57 Aligned_cols=21 Identities=10% Similarity=0.159 Sum_probs=9.3 Q ss_pred HHHHHHHHHHHHHHHHHCCCC Q ss_conf 999999999998988750658 Q gi|254780558|r 29 LFLLFCTFIVGLSIYILISHA 49 (396) Q Consensus 29 ~~~l~~~~~~~~~~~~~~~~~ 49 (396) ++++++++++|++|+.+++|. T Consensus 37 ~ala~avlv~FiGGLyFith~ 57 (319) T PRK10927 37 VAIAAAVLVTFIGGLYFITHH 57 (319) T ss_pred HHHHHHHHHHHHCCEEEEECC T ss_conf 999999999975004788705 No 48 >TIGR03006 pepcterm_polyde polysaccharide deactylase family protein, PEP-CTERM locus subfamily; InterPro: IPR014344 Members of this entry belong to the family of polysaccharide deacetylases. All are found in species that encode the PEP-CTERM/exosortase system and are predicted to act in protein sorting in a number of Gram-negative bacteria, they are also found near the gene coding region for the EpsH homologue that is the putative exosortase gene.. Probab=50.92 E-value=18 Score=16.34 Aligned_cols=102 Identities=14% Similarity=0.136 Sum_probs=83.1 Q ss_pred CCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCE--EEEECCCCCCCCCHHH Q ss_conf 76899999999859929998176656898778785544678998999999999998558935--9800123000049899 Q gi|254780558|r 207 NSLDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQLLNRLRYSLRRGTGYF--GVMNYRGAMLLSNKES 284 (396) Q Consensus 207 ~~~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~~~~~l~~~l~~~p~~v--GvnNhmGs~~t~~~~~ 284 (396) .-..+++++..++||||-=| +| -+-+-+++++++.+.-+..+..-+-+.. -|+-|.---|+-+.. T Consensus 60 ERyP~LVr~Iv~~GHElASH------Gy------~H~Rv~~qtp~~Fr~Di~rsK~lLEDl~G~~V~GYRAPSFSI~~~- 126 (274) T TIGR03006 60 ERYPELVRRIVAAGHELASH------GY------GHERVTTQTPEEFRADIRRSKKLLEDLSGQAVRGYRAPSFSIGKS- 126 (274) T ss_pred HCCHHHHHHHHHCCCEEEEC------CC------HHHHHHHCCHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCEECCC- T ss_conf 40528899999719826532------41------035554089879999999999997257886222344662113788- Q ss_pred HHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHCCCEEEEEEE Q ss_conf 9999999850692899749763226789898709946984588 Q gi|254780558|r 285 AEVIFKEFAKRGLLFFDDGSSPRNLTRVLAPKLNLPYMVADLY 327 (396) Q Consensus 285 m~~vl~~l~~rgL~flDs~Ts~~Sva~~~A~~~gvp~~~~dvf 327 (396) --|.+++|.+.|.-| || ||.+--=.-+|||=+.|+-| T Consensus 127 n~WA~d~L~e~GY~Y-sS-----SiyPi~HD~YGmPdaPrfaf 163 (274) T TIGR03006 127 NLWALDVLAEAGYAY-SS-----SIYPIRHDHYGMPDAPRFAF 163 (274) T ss_pred CHHHHHHHHHCCCCC-CC-----CCCCCCCCCCCCCCCCCCCE T ss_conf 444899998637851-35-----51354266268877888874 No 49 >TIGR02066 dsrB sulfite reductase, dissimilatory-type beta subunit; InterPro: IPR011808 Dissimilatory sulphite reductase catalyzes the six-electron reduction of sulphite to sulphide, as the terminal reaction in dissimilatory sulphate reduction. It remains unclear however, whether trithionate and thiosulphate serve as intermediate compounds to sulphide, or as end products of sulphite reduction . Sulphite reductase is a multisubunit enzyme composed of dimers of either alpha/beta or alpha/beta/gamma subunits, each containing a siroheme and iron sulphur cluster prosthetic centre . Found in sulphate-reducing bacteria, these genes are commonly located in a unidirectional gene cluster . This entry describes the beta subunit of sulphite reductase.; GO: 0018551 hydrogensulfite reductase activity, 0051539 4 iron 4 sulfur cluster binding, 0006118 electron transport. Probab=50.39 E-value=16 Score=16.72 Aligned_cols=32 Identities=28% Similarity=0.471 Sum_probs=18.8 Q ss_pred CCCCHHHHHHHHHHHHCCCCEEEECCCCCCCHH Q ss_conf 004989999999998506928997497632267 Q gi|254780558|r 278 LLSNKESAEVIFKEFAKRGLLFFDDGSSPRNLT 310 (396) Q Consensus 278 ~t~~~~~m~~vl~~l~~rgL~flDs~Ts~~Sva 310 (396) |++|+.-.+.+.++|++||++|.=-||.. ||. T Consensus 74 lv~d~~K~~~li~~L~~RG~GFpvGGt~~-av~ 105 (366) T TIGR02066 74 LVSDESKVQPLIDELEKRGLGFPVGGTGG-AVA 105 (366) T ss_pred EECCHHHHHHHHHHHHHCCCEEECCCCHH-CCC T ss_conf 74587887799999971875000147100-010 No 50 >pfam06506 PrpR_N Propionate catabolism activator. This domain is found at the N terminus of several sigma54- dependent transcriptional activators including PrpR, which activates catabolism of propionate. Probab=50.19 E-value=18 Score=16.26 Aligned_cols=118 Identities=20% Similarity=0.215 Sum_probs=70.9 Q ss_pred CCCCEEEEECCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECC Q ss_conf 93508987089876899999999859929998176656898778785544678998999999999998558935980012 Q gi|254780558|r 195 PANITLAFASNGNSLDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQLLNRLRYSLRRGTGYFGVMNYR 274 (396) Q Consensus 195 P~~vT~A~~P~~~~~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~~~~~l~~~l~~~p~~vGvnNhm 274 (396) +.|| +.+-..+-+.-+....|++.+.-+-+ .+|++.-+| .+.+...| |+. . T Consensus 51 ~iPV-v~I~~s~~Dil~al~~a~~~~~kiav------vg~~~~~~~-------------~~~~~~il-------~~~--i 101 (169) T pfam06506 51 SVPV-VEIKVSGFDLLRALARARRYGGRIGL------VGYENIIPG-------------LKALSELL-------GLD--I 101 (169) T ss_pred CCCE-EEEECCHHHHHHHHHHHHHHCCCEEE------EECCCCCHH-------------HHHHHHHH-------CCC--E T ss_conf 9988-99827886999999999975897999------927630368-------------99999996-------993--5 Q ss_pred CCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHHHHHHHC Q ss_conf 30000498999999999850692899749763226789898709946984588628999999999999999999871 Q gi|254780558|r 275 GAMLLSNKESAEVIFKEFAKRGLLFFDDGSSPRNLTRVLAPKLNLPYMVADLYLDDQVDRDKIREKLKGLEEIARTT 351 (396) Q Consensus 275 Gs~~t~~~~~m~~vl~~l~~rgL~flDs~Ts~~Sva~~~A~~~gvp~~~~dvfLD~~~~~~~I~~qL~~l~~~Ar~~ 351 (396) =-..-.++.-.+..+..+++.|.-.+=. .+++..+|+++|+++. +| ..+.+.|++.|+++.++++.. T Consensus 102 ~~~~~~~~~e~~~~i~~l~~~G~~vvVG----~~~~~~~A~~~Gl~~v----li--~sg~eSI~~A~~eA~~l~~~~ 168 (169) T pfam06506 102 VQRAYQSEEEARAAVKELKAQGIKVIVG----DGLVCDLAEQAGLQGV----LI--YSGKESVREAFEEALELARAL 168 (169) T ss_pred EEEEECCHHHHHHHHHHHHHCCCEEEEC----CCHHHHHHHHCCCCEE----EE--ECCHHHHHHHHHHHHHHHHHH T ss_conf 9999668899999999999869959985----8289999998399579----99--667899999999999999742 No 51 >pfam03979 Sigma70_r1_1 Sigma-70 factor, region 1.1. Region 1.1 modulates DNA binding by region 2 and 4 when sigma is unbound by the core RNA polymerase. Region 1.1 is also involved in promoter binding. Probab=50.16 E-value=17 Score=16.57 Aligned_cols=47 Identities=19% Similarity=0.171 Sum_probs=25.8 Q ss_pred HHHHHHHHHHHHHCCEEEE--EEECCH---HHHHHHHHHHHHHHHCCCEEEE Q ss_conf 9999999999987194899--980887---8899999997568876918996 Q gi|254780558|r 338 REKLKGLEEIARTTGQAIG--VAVAFD---ESIEVISQWLQQEHVRDVSVVP 384 (396) Q Consensus 338 ~~qL~~l~~~Ar~~G~AI~--igh~~p---~Ti~~L~~w~~~l~~~gi~lVp 384 (396) ..++++|...+|++|+.-- |-...| .+-+.+...+..|...||.+|- T Consensus 6 ~~~ik~LI~~GK~~GylTy~eIn~~Lp~~~~~~d~ie~i~~~L~~~gI~Vve 57 (79) T pfam03979 6 QAQVKKLIEQGKKRGYLTYDEINEALPPDDVDSEQIDDIISMLEDMGIEVVE 57 (79) T ss_pred HHHHHHHHHHHHHCCEEEHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCEEEE T ss_conf 9999999998764583639999976887679999999999999988996852 No 52 >pfam01171 ATP_bind_3 PP-loop family. This family of proteins belongs to the PP-loop superfamily. Probab=49.90 E-value=19 Score=16.24 Aligned_cols=90 Identities=18% Similarity=0.257 Sum_probs=53.3 Q ss_pred CHHHHHHHHHHHHCC-----CCEEEECCCCCCCH-----HHHHHHHHCCCEEEEEEEECCC--CCHHHHHHH--HHHHHH Q ss_conf 989999999998506-----92899749763226-----7898987099469845886289--999999999--999999 Q gi|254780558|r 281 NKESAEVIFKEFAKR-----GLLFFDDGSSPRNL-----TRVLAPKLNLPYMVADLYLDDQ--VDRDKIREK--LKGLEE 346 (396) Q Consensus 281 ~~~~m~~vl~~l~~r-----gL~flDs~Ts~~Sv-----a~~~A~~~gvp~~~~dvfLD~~--~~~~~I~~q--L~~l~~ 346 (396) |.-+|-.++..+++. ..++||-+.-+.|- +.+.++++|+|+....+.+... .+.++..+. ...+.+ T Consensus 11 DS~~Ll~~l~~~~~~~~~~~~~~hvnh~lr~~s~~~~~~v~~~~~~~~i~~~i~~~~~~~~~~~~~e~~aR~~Ry~~l~~ 90 (182) T pfam01171 11 DSMALLYLLKKLKPKFGIDLTAAHVDHGLREESDREAQFVKELCRQLNIPLEVLRVDVAKKSGLNLEEAAREARYDFFEE 90 (182) T ss_pred HHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHHH T ss_conf 99999999999999759978999987998654058999999999985997599997347667877757899999999999 Q ss_pred HHHHCCE-EEEEEECCHHHHHHHHH Q ss_conf 9987194-89998088788999999 Q gi|254780558|r 347 IARTTGQ-AIGVAVAFDESIEVISQ 370 (396) Q Consensus 347 ~Ar~~G~-AI~igh~~p~Ti~~L~~ 370 (396) .|+++|. .|..||-.++-++.+.- T Consensus 91 ~a~~~~~~~i~lgHh~DD~~ET~lm 115 (182) T pfam01171 91 IAKKNGAEVLLTAHHADDQAETFLM 115 (182) T ss_pred HHHHHCCCEEEEECCCCHHHHHHHH T ss_conf 8986176648874342329999999 No 53 >PRK02261 methylaspartate mutase subunit S; Provisional Probab=49.60 E-value=19 Score=16.21 Aligned_cols=64 Identities=14% Similarity=0.103 Sum_probs=47.1 Q ss_pred HHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHH Q ss_conf 99999999859929998176656898778785544678998999999999998558935980012300004989999999 Q gi|254780558|r 210 DRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQLLNRLRYSLRRGTGYFGVMNYRGAMLLSNKESAEVIF 289 (396) Q Consensus 210 ~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~~~~~l~~~l~~~p~~vGvnNhmGs~~t~~~~~m~~vl 289 (396) .-.+...+.+|+||+- |-...+++++ +..++..-..++|+....|... ..++.++ T Consensus 21 ~iva~~l~~~GfeVi~------------------lG~~~~pe~~---v~~A~~~~ad~igiSsl~G~~~----~~~~~l~ 75 (137) T PRK02261 21 KILDRALTEAGFEVIN------------------LGVMTSQEEF---IDAAIETDADAILVSSLYGHGE----IDCRGLR 75 (137) T ss_pred HHHHHHHHHCCCEEEE------------------CCCCCCHHHH---HHHHHHCCCCEEEECCCCCCCC----HHHHHHH T ss_conf 9999999978974984------------------6887999999---9999873999999711112661----2799999 Q ss_pred HHHHCCCCE Q ss_conf 998506928 Q gi|254780558|r 290 KEFAKRGLL 298 (396) Q Consensus 290 ~~l~~rgL~ 298 (396) +.|+++|+. T Consensus 76 ~~l~~~g~~ 84 (137) T PRK02261 76 EKCIEAGLG 84 (137) T ss_pred HHHHHCCCC T ss_conf 999967999 No 54 >KOG0023 consensus Probab=49.07 E-value=19 Score=16.15 Aligned_cols=151 Identities=19% Similarity=0.277 Sum_probs=72.7 Q ss_pred CCEEEEE-ECCCCCCCHHHHHHHHHCCCCCEEEEECCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCC Q ss_conf 8658999-737787716889999837935089870898768999999998599299981766568987787855446789 Q gi|254780558|r 170 GARIAIV-VSGLGISQTGTQRAINLLPANITLAFASNGNSLDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQ 248 (396) Q Consensus 170 ~~riAIV-IddlG~~~~~~~~ai~~LP~~vT~A~~P~~~~~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~ 248 (396) +-++.|+ .+||| -.+.+=| .++...||. |+ ....+.-+.-+.-|-|..+ ... T Consensus 182 G~~vgI~GlGGLG--h~aVq~A-KAMG~rV~v-is---~~~~kkeea~~~LGAd~fv--------------------~~~ 234 (360) T KOG0023 182 GKWVGIVGLGGLG--HMAVQYA-KAMGMRVTV-IS---TSSKKKEEAIKSLGADVFV--------------------DST 234 (360) T ss_pred CCEEEEECCCCCC--HHHHHHH-HHHCCEEEE-EE---CCCHHHHHHHHHCCCCEEE--------------------EEC T ss_conf 8689885576401--6899999-870867999-92---7853489999854962158--------------------842 Q ss_pred CHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHC-CCEEEEEEE Q ss_conf 9899999999999855893598001230000498999999999850692899749763226789898709-946984588 Q gi|254780558|r 249 TVQQLLNRLRYSLRRGTGYFGVMNYRGAMLLSNKESAEVIFKEFAKRGLLFFDDGSSPRNLTRVLAPKLN-LPYMVADLY 327 (396) Q Consensus 249 ~~~~~~~~l~~~l~~~p~~vGvnNhmGs~~t~~~~~m~~vl~~l~~rgL~flDs~Ts~~Sva~~~A~~~g-vp~~~~dvf 327 (396) .+.++...+...++-+ ..|+.|. ...+|++.++.+|.+|-..+=....-. . ..- .|.....+- T Consensus 235 ~d~d~~~~~~~~~dg~--~~~v~~~-------a~~~~~~~~~~lk~~Gt~V~vg~p~~~-~------~~~~~~lil~~~~ 298 (360) T KOG0023 235 EDPDIMKAIMKTTDGG--IDTVSNL-------AEHALEPLLGLLKVNGTLVLVGLPEKP-L------KLDTFPLILGRKS 298 (360) T ss_pred CCHHHHHHHHHHHCCC--CEEEEEC-------CCCCHHHHHHHHCCCCEEEEEECCCCC-C------CCCCCCCEECCEE T ss_conf 7878999998752675--0232102-------324438888761039789998366776-5------6765000024379 Q ss_pred ECCCCCHHHHHHHHHHHHHHHHHCCEEEEEEECCHHHHHHHHHHHHHHHHCCCEEEEHHHH Q ss_conf 6289999999999999999998719489998088788999999975688769189967774 Q gi|254780558|r 328 LDDQVDRDKIREKLKGLEEIARTTGQAIGVAVAFDESIEVISQWLQQEHVRDVSVVPLSCL 388 (396) Q Consensus 328 LD~~~~~~~I~~qL~~l~~~Ar~~G~AI~igh~~p~Ti~~L~~w~~~l~~~gi~lVpvS~l 388 (396) | .|+.|| -+.+|-++|.-....+-.-.|++|++|.+ T Consensus 299 I----------------------~GS~vG---~~ket~E~Ldf~a~~~ik~~IE~v~~~~v 334 (360) T KOG0023 299 I----------------------KGSIVG---SRKETQEALDFVARGLIKSPIELVKLSEV 334 (360) T ss_pred E----------------------EECCCC---CHHHHHHHHHHHHCCCCCCCEEEEEHHHH T ss_conf 8----------------------740325---48879999999971788674699754277 No 55 >PRK08928 consensus Probab=48.87 E-value=5 Score=19.83 Aligned_cols=21 Identities=24% Similarity=0.286 Sum_probs=8.6 Q ss_pred CHHHHHHHHHHHHCCCCEEEE Q ss_conf 989999999998506928997 Q gi|254780558|r 281 NKESAEVIFKEFAKRGLLFFD 301 (396) Q Consensus 281 ~~~~m~~vl~~l~~rgL~flD 301 (396) +...+..++..+..-+++=+| T Consensus 310 ~~~~l~~~~~~~~~~~~~a~~ 330 (861) T PRK08928 310 NAEELEDFAEKCEKEGIFAIY 330 (861) T ss_pred CHHHHHHHHHHHHHCCEEEEE T ss_conf 989999999987657968999 No 56 >PRK09230 cytosine deaminase; Provisional Probab=48.82 E-value=19 Score=16.13 Aligned_cols=29 Identities=21% Similarity=0.243 Sum_probs=13.1 Q ss_pred ECCCCCCCCCHHHHHHHHHHHHCCCCEEE Q ss_conf 01230000498999999999850692899 Q gi|254780558|r 272 NYRGAMLLSNKESAEVIFKEFAKRGLLFF 300 (396) Q Consensus 272 NhmGs~~t~~~~~m~~vl~~l~~rgL~fl 300 (396) .|--|.-..++.....+++.+++.|+-++ T Consensus 245 sH~~sL~~~~~~~~~~~~~~laeagi~vv 273 (426) T PRK09230 245 SHTTAMHSYNGAYTSRLFRLLKMSGINFV 273 (426) T ss_pred ECCHHHHCCCHHHHHHHHHHHHHCCCCEE T ss_conf 00054533999999999999998298578 No 57 >pfam04120 Iron_permease Low affinity iron permease. Probab=46.79 E-value=21 Score=15.93 Aligned_cols=48 Identities=17% Similarity=0.173 Sum_probs=39.9 Q ss_pred CCCHHHHHHHHHHHHHHHH-HCCEEEEEEECCHHHHHHHHHHHHHHHHC Q ss_conf 9999999999999999998-71948999808878899999997568876 Q gi|254780558|r 331 QVDRDKIREKLKGLEEIAR-TTGQAIGVAVAFDESIEVISQWLQQEHVR 378 (396) Q Consensus 331 ~~~~~~I~~qL~~l~~~Ar-~~G~AI~igh~~p~Ti~~L~~w~~~l~~~ 378 (396) ..|-.++..+|++|.+.-+ .++..|+|-|.-.++++.|.+.+.++..+ T Consensus 65 nrD~~Al~~KLdELi~a~~~A~n~~i~iE~l~e~el~~l~~~y~~~a~~ 113 (133) T pfam04120 65 NRDDAALQLKLDELIRATNNAKNPLIGLEDLTEEELEELRAEYERLAEA 113 (133) T ss_pred CCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 5117889999999999877400654052459999999999999999999 No 58 >KOG2703 consensus Probab=46.36 E-value=16 Score=16.65 Aligned_cols=36 Identities=14% Similarity=0.102 Sum_probs=30.8 Q ss_pred ECCCCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCC Q ss_conf 012300004989999999998506928997497632 Q gi|254780558|r 272 NYRGAMLLSNKESAEVIFKEFAKRGLLFFDDGSSPR 307 (396) Q Consensus 272 NhmGs~~t~~~~~m~~vl~~l~~rgL~flDs~Ts~~ 307 (396) +..|+|||.-+..+.++-+.|..|-.+|-||..... T Consensus 347 ~~LgGrfTTiEGLl~~~~e~l~~~~f~~~DS~~~~~ 382 (460) T KOG2703 347 GTLGGRFTTIEGLLTQIREQLDGRSFTFGDSMDEGQ 382 (460) T ss_pred CCCCCEEEEHHHHHHHHHHHHHCCCEECCCCCCHHH T ss_conf 624532453799999999997257400245678889 No 59 >PRK04182 cytidylate kinase; Provisional Probab=46.30 E-value=16 Score=16.59 Aligned_cols=22 Identities=23% Similarity=0.235 Sum_probs=11.8 Q ss_pred HHHHHHHHHCCCCEEEECCCCC Q ss_conf 9999999850692899749763 Q gi|254780558|r 285 AEVIFKEFAKRGLLFFDDGSSP 306 (396) Q Consensus 285 m~~vl~~l~~rgL~flDs~Ts~ 306 (396) ...+..++.+.|=..++.+-.. T Consensus 63 ~~~~~~~~a~~~~~Vi~GR~~~ 84 (178) T PRK04182 63 IDRRQLELAKRGNVVLEGRLAG 84 (178) T ss_pred HHHHHHHHHHCCCEEEECCCCC T ss_conf 9999999985399899838887 No 60 >pfam01522 Polysacc_deac_1 Polysaccharide deacetylase. This domain is found in polysaccharide deacetylase. This family of polysaccharide deacetylases includes NodB (nodulation protein B from Rhizobium) which is a chitooligosaccharide deacetylase. It also includes chitin deacetylase from yeast, and endoxylanases which hydrolyses glucosidic bonds in xylan. Probab=45.64 E-value=22 Score=15.82 Aligned_cols=86 Identities=9% Similarity=0.027 Sum_probs=57.9 Q ss_pred EEEEEECCCCCCCH--HHHHHHHHCCCCCEEEEECCCCC-HHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCC Q ss_conf 58999737787716--88999983793508987089876-8999999998599299981766568987787855446789 Q gi|254780558|r 172 RIAIVVSGLGISQT--GTQRAINLLPANITLAFASNGNS-LDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQ 248 (396) Q Consensus 172 riAIVIddlG~~~~--~~~~ai~~LP~~vT~A~~P~~~~-~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~ 248 (396) .|+|-+|| |+... ...+.+.+.-.+.||-+....-. ..+.+++..+.||||-.|- |.. .-+..+ T Consensus 7 ~v~lTfDD-G~~~~~~~~l~~L~~~~i~aTfFv~~~~i~~~~~~~~~l~~~G~eig~Ht------~~H------~~~~~l 73 (123) T pfam01522 7 SVALTFDD-GPPDNTPRILDVLKKYGVPATFFVIGGNAETYPELVRRMVAEGHEIGNHT------YSH------PDLTAL 73 (123) T ss_pred EEEEEECC-CCHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHHHHHHCCCEEEECC------CCC------HHHHHH T ss_conf 89998747-97351999999999829986997247645368999999998799874210------456------415543 Q ss_pred CHHHHHHHHHHHHHHCCCCEEE Q ss_conf 9899999999999855893598 Q gi|254780558|r 249 TVQQLLNRLRYSLRRGTGYFGV 270 (396) Q Consensus 249 ~~~~~~~~l~~~l~~~p~~vGv 270 (396) +.++..+-|.++...+-...|. T Consensus 74 ~~~~~~~ei~~~~~~l~~~~g~ 95 (123) T pfam01522 74 SPEQIRAEILRAQEALKKATGK 95 (123) T ss_pred HHHHHHHHHHHHHHHHHHHHCC T ss_conf 1899999999999999998789 No 61 >KOG4513 consensus Probab=45.07 E-value=22 Score=15.82 Aligned_cols=81 Identities=17% Similarity=0.276 Sum_probs=30.5 Q ss_pred EEEECCCCCCCHHHH-HHHHHCCCCCEEEEECCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHH Q ss_conf 999737787716889-9998379350898708987689999999985992999817665689877878554467899899 Q gi|254780558|r 174 AIVVSGLGISQTGTQ-RAINLLPANITLAFASNGNSLDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQ 252 (396) Q Consensus 174 AIVIddlG~~~~~~~-~ai~~LP~~vT~A~~P~~~~~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~ 252 (396) -|||||-|.|...+. ++|..-|-|.-=.+ .+.+-|... .+ |-.-+-|| |-. --+++-|..+|-.+---+| T Consensus 23 liViDGWG~se~d~ygNai~~a~tp~Md~L----~~g~~w~li-~A--HG~hVGLP-egl-MGNSEVGHl~iGaGRii~Q 93 (531) T KOG4513 23 LIVIDGWGESEPDQYGNAIHNAPTPAMDSL----KRGRTWFLI-KA--HGTHVGLP-EGL-MGNSEVGHLAIGAGRIIAQ 93 (531) T ss_pred EEEECCCCCCCCCCCCCHHHCCCCHHHHHH----HCCCCHHHH-HH--CCCCCCCC-CCC-CCCCCCCCCCCCCCEEEEC T ss_conf 999766677887522302214897688876----257621113-33--07112687-222-4652002101354415621 Q ss_pred HHHHHHHHHHH Q ss_conf 99999999985 Q gi|254780558|r 253 LLNRLRYSLRR 263 (396) Q Consensus 253 ~~~~l~~~l~~ 263 (396) -..++..++.+ T Consensus 94 divrid~Av~~ 104 (531) T KOG4513 94 DIVRIDLAVAS 104 (531) T ss_pred CHHHHHHHHHC T ss_conf 04677778751 No 62 >PRK07572 cytosine deaminase; Validated Probab=44.87 E-value=22 Score=15.75 Aligned_cols=78 Identities=14% Similarity=0.053 Sum_probs=43.9 Q ss_pred CHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHH-CCCCEEEEECCCCCCCCCHHHHH Q ss_conf 68999999998599299981766568987787855446789989999999999985-58935980012300004989999 Q gi|254780558|r 208 SLDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQLLNRLRYSLRR-GTGYFGVMNYRGAMLLSNKESAE 286 (396) Q Consensus 208 ~~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~~~~~l~~~l~~-~p~~vGvnNhmGs~~t~~~~~m~ 286 (396) +.+...+.|++.|..|=+|+ +. ..+++- ..++......... +.+ -=.-.|.-+.-..++...+ T Consensus 191 ~l~~~f~lA~~~g~~iDiHl-----dE-~~d~~~---------~~l~~la~~t~~~G~~g-rVt~sH~~~L~~~~~~~~~ 254 (425) T PRK07572 191 SVRLLCEIAAERGLRVDMHC-----DE-SDDPLS---------RHIETLAAETQRLGLQG-RVAGSHLTSMHSMDNYYVS 254 (425) T ss_pred HHHHHHHHHHHCCCCEEEEE-----CC-CCCHHH---------HHHHHHHHHHHHHCCCC-CEEECCCCCCCCCCHHHHH T ss_conf 99999999997499758734-----68-777157---------89999999999828988-5561363111349999999 Q ss_pred HHHHHHHCCCCEEEE Q ss_conf 999998506928997 Q gi|254780558|r 287 VIFKEFAKRGLLFFD 301 (396) Q Consensus 287 ~vl~~l~~rgL~flD 301 (396) .++..+++.|..++- T Consensus 255 ~~i~~lAeagi~vv~ 269 (425) T PRK07572 255 KLIPLMAEAGVAAIA 269 (425) T ss_pred HHHHHHHHCCCCEEE T ss_conf 999999982995552 No 63 >pfam02353 CMAS Cyclopropane-fatty-acyl-phospholipid synthase. This family consist of Cyclopropane-fatty-acyl-phospholipid synthase or CFA synthase EC:2.1.1.79 this enzyme catalyse the reaction: S-adenosyl-L-methionine + phospholipid olefinic fatty acid <= S-adenosyl-L-homocysteine + phospholipid cyclopropane fatty acid. Probab=44.67 E-value=22 Score=15.73 Aligned_cols=170 Identities=12% Similarity=0.160 Sum_probs=101.9 Q ss_pred EEEECCCCCCCHHHHHHHHHCCCCCEEEEECCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHHH Q ss_conf 99973778771688999983793508987089876899999999859929998176656898778785544678998999 Q gi|254780558|r 174 AIVVSGLGISQTGTQRAINLLPANITLAFASNGNSLDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQL 253 (396) Q Consensus 174 AIVIddlG~~~~~~~~ai~~LP~~vT~A~~P~~~~~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~~ 253 (396) -|.=-|.|.+.. ...+..+....|| ++.-......-.-+++++.|.+-.+.+=+ .||-+. +| T Consensus 65 ~vLDiGCGWG~~-a~~~a~~~g~~v~-giTlS~~Q~~~~~~r~~~~gl~~~v~v~~--~Dyrd~-~~------------- 126 (273) T pfam02353 65 TLLDIGCGWGGL-MRRAAERYDVNVV-GLTLSKNQYKLARQRVAAEGLQRKVEVLL--QDYRDF-DE------------- 126 (273) T ss_pred EEEEECCCCHHH-HHHHHHHCCCEEE-EEECCHHHHHHHHHHHHHCCCCCCCCEEE--CCHHHC-CC------------- T ss_conf 899978880899-9999984795189-99797899999999998708743212000--626547-66------------- Q ss_pred HHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHCCCEEEEEEEECCC-C Q ss_conf 999999998558935980012300004989999999998506928997497632267898987099469845886289-9 Q gi|254780558|r 254 LNRLRYSLRRGTGYFGVMNYRGAMLLSNKESAEVIFKEFAKRGLLFFDDGSSPRNLTRVLAPKLNLPYMVADLYLDDQ-V 332 (396) Q Consensus 254 ~~~l~~~l~~~p~~vGvnNhmGs~~t~~~~~m~~vl~~l~~rgL~flDs~Ts~~Sva~~~A~~~gvp~~~~dvfLD~~-~ 332 (396) -+++|- .+|+..|||-+. -+.-++.+-+.|+.-|++||..-|..+.- ..-......++..+.+|=... + T Consensus 127 ------~fD~Iv-Siem~Ehvg~~~--~~~~f~~i~~~LkpgG~~~iq~i~~~~~~-~~~~~~~~~~fi~kyIFPGG~lP 196 (273) T pfam02353 127 ------PFDRIV-SVGMFEHVGHEN--YDTFFKKLYNLLPPGGLMLLHTITGLHPD-ETSERGLPLKFIDKYIFPGGELP 196 (273) T ss_pred ------CCCEEE-EEHHHHHCCHHH--HHHHHHHHHHHCCCCCEEEEEEEECCCCC-CHHHCCCCCCHHHHHCCCCCCCC T ss_conf ------666677-651687628777--99999999986588973999999425732-10102774200777607998789 Q ss_pred CHHHHHHHHHHHHHHHHHCCEEEE-EEECCHHHHHHHHHHHHHHHHC Q ss_conf 999999999999999987194899-9808878899999997568876 Q gi|254780558|r 333 DRDKIREKLKGLEEIARTTGQAIG-VAVAFDESIEVISQWLQQEHVR 378 (396) Q Consensus 333 ~~~~I~~qL~~l~~~Ar~~G~AI~-igh~~p~Ti~~L~~w~~~l~~~ 378 (396) +...|. ..+.+.|.-|- +-+-...=...|..|...++++ T Consensus 197 s~~~~~-------~~~~~~~l~v~~~~~~g~HYa~TL~~W~~nf~~n 236 (273) T pfam02353 197 SISMIV-------ESSSEAGFTVEDVESLRPHYAKTLDLWAENLQAN 236 (273) T ss_pred CHHHHH-------HHHHHCCCEEEEEEECCHHHHHHHHHHHHHHHHH T ss_conf 999999-------9998689488676767365999999999999999 No 64 >COG4352 RPL13 Ribosomal protein L13E [Translation, ribosomal structure and biogenesis] Probab=43.78 E-value=23 Score=15.64 Aligned_cols=37 Identities=19% Similarity=0.173 Sum_probs=25.7 Q ss_pred HHHHHHCCEEEEEE--ECCHHHHHHHHHHHHHHHHCCCE Q ss_conf 99998719489998--08878899999997568876918 Q gi|254780558|r 345 EEIARTTGQAIGVA--VAFDESIEVISQWLQQEHVRDVS 381 (396) Q Consensus 345 ~~~Ar~~G~AI~ig--h~~p~Ti~~L~~w~~~l~~~gi~ 381 (396) .+.||+=|.+|-+- ..+++.++++.+|+.++.+..++ T Consensus 73 ~~~AR~LGI~VD~RRr~~~~en~eal~k~ik~ll~~~~~ 111 (113) T COG4352 73 ARKARTLGIAVDHRRRNRNPENFEALVKRIKELLEKIIV 111 (113) T ss_pred HHHHHHHCCCEEHHHCCCCHHHHHHHHHHHHHHHHCCCC T ss_conf 888876086100010247888999999999999845866 No 65 >PRK05692 hydroxymethylglutaryl-CoA lyase; Provisional Probab=43.73 E-value=23 Score=15.64 Aligned_cols=75 Identities=8% Similarity=0.096 Sum_probs=32.9 Q ss_pred HHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHH Q ss_conf 89999999985992999817665689877878554467899899999999999855893598001230000498999999 Q gi|254780558|r 209 LDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQLLNRLRYSLRRGTGYFGVMNYRGAMLLSNKESAEVI 288 (396) Q Consensus 209 ~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~~~~~l~~~l~~~p~~vGvnNhmGs~~t~~~~~m~~v 288 (396) ..+.++.|++.|.++-..+-| .++-|.. | ..+.+.+.+.+..+.+.-...+.+-...|. .++..+..+ T Consensus 122 ~~~~i~~a~~~g~~v~~~i~~-afg~p~~--~------~~~~~~l~~~~~~~~~~Ga~~I~laDT~G~---a~P~~v~~~ 189 (287) T PRK05692 122 FEPVAEAAKQAGVRVRGYVSC-VLGCPYE--G------EVPPEAVADVAERLFALGCDEISLGDTIGV---GTPGQVRRV 189 (287) T ss_pred HHHHHHHHHHCCCEEEEEEEE-CCCCCCC--C------CCCHHHHHHHHHHHHHCCCCEEECCCCCCC---CCHHHHHHH T ss_conf 999999999769879998740-1367646--8------648999999999998579978544765566---699999999 Q ss_pred HHHHHCC Q ss_conf 9998506 Q gi|254780558|r 289 FKEFAKR 295 (396) Q Consensus 289 l~~l~~r 295 (396) ++.++++ T Consensus 190 i~~v~~~ 196 (287) T PRK05692 190 LEAVLAE 196 (287) T ss_pred HHHHHHH T ss_conf 9999986 No 66 >TIGR02173 cyt_kin_arch cytidylate kinase, putative; InterPro: IPR011892 Proteins in this family are believed to be cytidylate kinase. Members of this family are found in the archaea and in spirochaetes, and differ considerably from the common bacterial form of cytidylate kinase described by IPR003136 from INTERPRO.; GO: 0004127 cytidylate kinase activity, 0005524 ATP binding, 0006139 nucleobase nucleoside nucleotide and nucleic acid metabolic process. Probab=43.66 E-value=19 Score=16.15 Aligned_cols=58 Identities=24% Similarity=0.312 Sum_probs=41.4 Q ss_pred CCCCCCHHHHHHHHHCCCEEEE----------EEEEC-CC--CCHHHHHHHHHHHHH-HHHHCCEEEEEEEC Q ss_conf 9763226789898709946984----------58862-89--999999999999999-99871948999808 Q gi|254780558|r 303 GSSPRNLTRVLAPKLNLPYMVA----------DLYLD-DQ--VDRDKIREKLKGLEE-IARTTGQAIGVAVA 360 (396) Q Consensus 303 ~Ts~~Sva~~~A~~~gvp~~~~----------dvfLD-~~--~~~~~I~~qL~~l~~-~Ar~~G~AI~igh~ 360 (396) |++-.|||-.+|+++++++.+. ++.|- +. -....|++.+++... +|++++..|.=||. T Consensus 10 GSGktTvA~~lA~~Lsl~~iSaG~iRelA~~~Gldl~E~~~aee~~eIDk~iD~~~~E~A~~~~nvvlEsrl 81 (173) T TIGR02173 10 GSGKTTVAKILAEKLSLKLISAGDIRELAEKMGLDLAESKYAEENPEIDKKIDRRIREIAEKEKNVVLESRL 81 (173) T ss_pred CCCHHHHHHHHHHHCCCCEECCCHHHHHHHHCCCCHHHHHHHCCCCCCCHHHHHHHHHHHCCCCCEEEEEHH T ss_conf 686478999999863983120200788986429887773443058631167537885543048966885205 No 67 >COG0542 clpA ATP-binding subunits of Clp protease and DnaK/DnaJ chaperones [Posttranslational modification, protein turnover, chaperones] Probab=43.03 E-value=24 Score=15.57 Aligned_cols=133 Identities=11% Similarity=0.156 Sum_probs=72.3 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCC------------------CCHHHHHHHHHHHHCCCC------ Q ss_conf 54467899899999999999855893598001230000------------------498999999999850692------ Q gi|254780558|r 242 YTLKVTQTVQQLLNRLRYSLRRGTGYFGVMNYRGAMLL------------------SNKESAEVIFKEFAKRGL------ 297 (396) Q Consensus 242 ~~L~~~~~~~~~~~~l~~~l~~~p~~vGvnNhmGs~~t------------------~~~~~m~~vl~~l~~rgL------ 297 (396) .-+..+||+-.-...+...++.-|||||-.. |+.+| +++..+..++++|..--| T Consensus 551 aliR~DMSEy~EkHsVSrLIGaPPGYVGyee--GG~LTEaVRr~PySViLlDEIEKAHpdV~nilLQVlDdGrLTD~~Gr 628 (786) T COG0542 551 ALIRIDMSEYMEKHSVSRLIGAPPGYVGYEE--GGQLTEAVRRKPYSVILLDEIEKAHPDVFNLLLQVLDDGRLTDGQGR 628 (786) T ss_pred CCEEECHHHHHHHHHHHHHHCCCCCCCEECC--CCCHHHHHHCCCCEEEEECHHHHCCHHHHHHHHHHHCCCCEECCCCC T ss_conf 4455456877778779987279998720065--54003766069986888412644088999999998467805548998 Q ss_pred --------EEEECCCCCCCHHHHHHHHHCCCEEEEEEEECCCCCHHHH---HHHHHHHHHHHHHCCEEEEEEECC-HHHH Q ss_conf --------8997497632267898987099469845886289999999---999999999998719489998088-7889 Q gi|254780558|r 298 --------LFFDDGSSPRNLTRVLAPKLNLPYMVADLYLDDQVDRDKI---REKLKGLEEIARTTGQAIGVAVAF-DESI 365 (396) Q Consensus 298 --------~flDs~Ts~~Sva~~~A~~~gvp~~~~dvfLD~~~~~~~I---~~qL~~l~~~Ar~~G~AI~igh~~-p~Ti 365 (396) .++.|-... ..+....+ . +.|-+.+...+.+ .++.-+-|++.|-.. |.+=|+. .+++ T Consensus 629 ~VdFrNtiIImTSN~Gs----~~i~~~~~----~-~~~~~~~~~~~~v~~~l~~~F~PEFLNRid~--II~F~~L~~~~l 697 (786) T COG0542 629 TVDFRNTIIIMTSNAGS----EEILRDAD----G-DDFADKEALKEAVMEELKKHFRPEFLNRIDE--IIPFNPLSKEVL 697 (786) T ss_pred EEECCEEEEEEECCCCH----HHHHHHCC----C-CCCCHHHHHHHHHHHHHHHHCCHHHHHHCCC--EEECCCCCHHHH T ss_conf 88430028998450265----98975313----4-3210046788999999985389989851261--785067998999 Q ss_pred HH-----HHHHHHHHHHCCCEEEEHHH Q ss_conf 99-----99997568876918996777 Q gi|254780558|r 366 EV-----ISQWLQQEHVRDVSVVPLSC 387 (396) Q Consensus 366 ~~-----L~~w~~~l~~~gi~lVpvS~ 387 (396) .. |.+....|..+||.|.---+ T Consensus 698 ~~Iv~~~L~~l~~~L~~~~i~l~~s~~ 724 (786) T COG0542 698 ERIVDLQLNRLAKRLAERGITLELSDE 724 (786) T ss_pred HHHHHHHHHHHHHHHHHCCCEEEECHH T ss_conf 999999999999999868955998889 No 68 >cd01992 PP-ATPase N-terminal domain of predicted ATPase of the PP-loop faimly implicated in cell cycle control [Cell division and chromosome partitioning]. This is a subfamily of Adenine nucleotide alpha hydrolases superfamily.Adeninosine nucleotide alpha hydrolases superfamily includes N type ATP PPases and ATP sulphurylases. It forms a apha/beta/apha fold which binds to Adenosine group. This domain has a strongly conserved motif SGGXD at the N terminus. Probab=42.94 E-value=24 Score=15.56 Aligned_cols=72 Identities=21% Similarity=0.283 Sum_probs=39.0 Q ss_pred EEEECCCCCCCH-----HHHHHHHHCCCEEEEEEEECCC--CCHHHHHHH--HHHHHHHHHHCCE-EEEEEECCHHHHHH Q ss_conf 899749763226-----7898987099469845886289--999999999--9999999987194-89998088788999 Q gi|254780558|r 298 LFFDDGSSPRNL-----TRVLAPKLNLPYMVADLYLDDQ--VDRDKIREK--LKGLEEIARTTGQ-AIGVAVAFDESIEV 367 (396) Q Consensus 298 ~flDs~Ts~~Sv-----a~~~A~~~gvp~~~~dvfLD~~--~~~~~I~~q--L~~l~~~Ar~~G~-AI~igh~~p~Ti~~ 367 (396) ..+|-+..+.|. +.+.++.+|+|+.......+.. .+.+...+. ..-+.+.++++|. .|+.||-.++-+|. T Consensus 33 ~hvdh~lr~~s~~~~~~v~~~~~~~~i~~~i~~~~~~~~~~~~~e~~aR~~Ry~~l~~~~~~~~~~~i~lgHh~dD~~ET 112 (185) T cd01992 33 VHVDHGLRPESDEEAAFVADLCAKLGIPLYILVVALAPKPGGNLEAAAREARYDFFAEIAKEHGADVLLTAHHADDQAET 112 (185) T ss_pred EEEECCCCCCCHHHHHHHHHHHHHCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCEEECCCHHHHHHH T ss_conf 99818988888899999999999859988999977536789999999999999999999987354504203630368999 Q ss_pred HH Q ss_conf 99 Q gi|254780558|r 368 IS 369 (396) Q Consensus 368 L~ 369 (396) +. T Consensus 113 ~l 114 (185) T cd01992 113 VL 114 (185) T ss_pred HH T ss_conf 99 No 69 >COG2243 CobF Precorrin-2 methylase [Coenzyme metabolism] Probab=42.63 E-value=24 Score=15.53 Aligned_cols=164 Identities=15% Similarity=0.178 Sum_probs=77.9 Q ss_pred CCCHHHHHHHHHCCCCCEEEEECC----CCCHHHHHHHHHHCCC-CEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHH Q ss_conf 771688999983793508987089----8768999999998599-29998176656898778785544678998999999 Q gi|254780558|r 182 ISQTGTQRAINLLPANITLAFASN----GNSLDRWMKEAKKKGQ-EAILQIPMQAFDESYNEDDSYTLKVTQTVQQLLNR 256 (396) Q Consensus 182 ~~~~~~~~ai~~LP~~vT~A~~P~----~~~~~~~~~~Ar~~G~-EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~~~~~ 256 (396) -..-.|.+|+..|- .+-.-+.|. ...+.+.++.....+. ++-+|.||.... +..+ ..--+++... T Consensus 13 DPeLlTlkAi~~L~-~adVi~~P~~~g~~slAr~Iv~~y~~~~~~~~~l~fPm~~~~-------~e~~--~~~~~e~a~~ 82 (234) T COG2243 13 DPELLTLKAIRALK-KADVVYVPSKKGKGSLAREIVEDYLTPGSRIVELHFPMTTDM-------REEL--EDAWEEAAAE 82 (234) T ss_pred CHHHHHHHHHHHHH-HCCEEEEECCCCCCCHHHHHHHHHCCCCCEEEEEEECCCCCH-------HHHH--HHHHHHHHHH T ss_conf 83462399999885-499899836887643699999986698861567870567862-------8899--9999999999 Q ss_pred HHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHCCCEEEEEEEEC---CCCC Q ss_conf 9999985589359800123000049899999999985069289974976322678989870994698458862---8999 Q gi|254780558|r 257 LRYSLRRGTGYFGVMNYRGAMLLSNKESAEVIFKEFAKRGLLFFDDGSSPRNLTRVLAPKLNLPYMVADLYLD---DQVD 333 (396) Q Consensus 257 l~~~l~~~p~~vGvnNhmGs~~t~~~~~m~~vl~~l~~rgL~flDs~Ts~~Sva~~~A~~~gvp~~~~dvfLD---~~~~ 333 (396) +...|+.--+ |++.. .|.=+.= .+.-.+.+.|+.+| |-+.---.=.| ....|...|+|....|-.|= .... T Consensus 83 va~~l~~G~~-VAf~~-lGDP~~Y--sTf~~l~~~l~~~~-~e~e~VPGVsS-~~a~aa~~~~pL~~g~~~l~Vlp~t~~ 156 (234) T COG2243 83 VAAELEAGRD-VAFLT-LGDPTFY--STFMYLLERLRERG-FEVEVVPGVSS-FSACAARLGVPLVEGDDSLSVLPATRP 156 (234) T ss_pred HHHHHHCCCE-EEEEE-CCCCCHH--HHHHHHHHHHHCCC-CCEEEECCCCH-HHHHHHHHCCCEECCCCEEEEEECCCC T ss_conf 9999976992-89998-1485288--87999999853069-73599589436-999999719851516862688741574 Q ss_pred HHHHHHHHHHHHHHHHHCCEEEEEEECC-HHHHHHH Q ss_conf 9999999999999998719489998088-7889999 Q gi|254780558|r 334 RDKIREKLKGLEEIARTTGQAIGVAVAF-DESIEVI 368 (396) Q Consensus 334 ~~~I~~qL~~l~~~Ar~~G~AI~igh~~-p~Ti~~L 368 (396) ...+.+.| .....+|.+--.+ ...+..+ T Consensus 157 ~~~~~~~l-------~~~d~~VvMK~~~~~~~i~~~ 185 (234) T COG2243 157 DEELERAL-------ADFDTAVVMKVGRNFEKLRRL 185 (234) T ss_pred HHHHHHHH-------HHCCEEEEEECCCCHHHHHHH T ss_conf 06677677-------527818999668767899999 No 70 >COG0052 RpsB Ribosomal protein S2 [Translation, ribosomal structure and biogenesis] Probab=42.24 E-value=24 Score=15.49 Aligned_cols=148 Identities=18% Similarity=0.206 Sum_probs=74.0 Q ss_pred CEEEEEECCCCCCCHHHHHH---HHHCCC-CCEEEEECCCCCHHHHH-HHHHHCCCCEEEECCCCCCCCCCCCCCCCCCC Q ss_conf 65899973778771688999---983793-50898708987689999-99998599299981766568987787855446 Q gi|254780558|r 171 ARIAIVVSGLGISQTGTQRA---INLLPA-NITLAFASNGNSLDRWM-KEAKKKGQEAILQIPMQAFDESYNEDDSYTLK 245 (396) Q Consensus 171 ~riAIVIddlG~~~~~~~~a---i~~LP~-~vT~A~~P~~~~~~~~~-~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~ 245 (396) .|=-|-|-||.-....-.+| +.+.-. +=+.-|.---.+.++.+ +.|.+.|--++ ...|.| ++|+ T Consensus 34 ~RngihIIDL~kT~~~l~~A~~~v~~~~~~~g~ILfVgTK~~a~~~V~~~A~r~g~~yV----------~~RwLg-G~LT 102 (252) T COG0052 34 ERNGIHIIDLQKTLERLREAYKFLRRIAANGGKILFVGTKKQAQEPVKEFAERTGAYYV----------NGRWLG-GMLT 102 (252) T ss_pred ECCCCEEEEHHHHHHHHHHHHHHHHHHHCCCCEEEEEECHHHHHHHHHHHHHHHCCCEE----------CCCCCC-CCCC T ss_conf 02771798799989999999999999972899799995208878999999999599513----------276437-4244 Q ss_pred CCCCHHHHHHHHH--HHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHC----CCCEEEECCCCCCCHHHHHHHHHCC Q ss_conf 7899899999999--999855893598001230000498999999999850----6928997497632267898987099 Q gi|254780558|r 246 VTQTVQQLLNRLR--YSLRRGTGYFGVMNYRGAMLLSNKESAEVIFKEFAK----RGLLFFDDGSSPRNLTRVLAPKLNL 319 (396) Q Consensus 246 ~~~~~~~~~~~l~--~~l~~~p~~vGvnNhmGs~~t~~~~~m~~vl~~l~~----rgL~flDs~Ts~~Sva~~~A~~~gv 319 (396) .--+-..-.++|. ..+..-- .-+..-.-...+.....-++..+.-++. -++.||.+ +.....|-..|++.|| T Consensus 103 N~~ti~~si~rl~~lE~~~~~~-~~~~tKkE~l~l~re~~kL~k~lgGIk~m~~~Pd~l~ViD-p~~e~iAv~EA~klgI 180 (252) T COG0052 103 NFKTIRKSIKRLKELEKMEEDG-FDGLTKKEALMLTRELEKLEKSLGGIKDMKGLPDVLFVID-PRKEKIAVKEANKLGI 180 (252) T ss_pred CCHHHHHHHHHHHHHHHHHHCC-CCCCCHHHHHHHHHHHHHHHHHHCCHHHCCCCCCEEEEEC-CCHHHHHHHHHHHCCC T ss_conf 7405888999999999876414-3210078888777789999874014111267999899968-8176899999997599 Q ss_pred CEEEEEEEECCCCCH Q ss_conf 469845886289999 Q gi|254780558|r 320 PYMVADLYLDDQVDR 334 (396) Q Consensus 320 p~~~~dvfLD~~~~~ 334 (396) |... ++|...++ T Consensus 181 PVvA---lvDTn~dp 192 (252) T COG0052 181 PVVA---LVDTNCDP 192 (252) T ss_pred CEEE---EECCCCCC T ss_conf 9899---84189997 No 71 >PRK01233 glyS glycyl-tRNA synthetase subunit beta; Validated Probab=42.13 E-value=25 Score=15.48 Aligned_cols=175 Identities=16% Similarity=0.139 Sum_probs=90.4 Q ss_pred CCCCCCCEEEEEECCCCCCCHHHHHHHHHCCCCCEEEEECCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCC-- Q ss_conf 234668658999737787716889999837935089870898768999999998599299981766568987787855-- Q gi|254780558|r 165 CSNASGARIAIVVSGLGISQTGTQRAINLLPANITLAFASNGNSLDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSY-- 242 (396) Q Consensus 165 ~~~a~~~riAIVIddlG~~~~~~~~ai~~LP~~vT~A~~P~~~~~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~-- 242 (396) ..++..-|+++.|.|+-..+... ....-.+++..||...+.-.+....-+|..|-++ +-+.....+-|.. T Consensus 44 ~~~~TPRRLav~v~~l~~~q~d~--~~e~kGP~v~~afd~~G~ptkA~~GF~k~~gi~~------~dL~~~~~~kgey~~ 115 (687) T PRK01233 44 KVFATPRRLALLVEGLAEKQPDR--EEEKKGPAVKIAFDADGKPTKAAEGFARGQGITV------DDLERRETKKGEYLF 115 (687) T ss_pred EEEECCCEEEEEECCCCCCCCCH--HHEEECCCHHHHCCCCCCCCHHHHHHHHHCCCCH------HHCEEEECCCEEEEE T ss_conf 69985878788875863118661--3341376477754767895399999999879999------991778719972999 Q ss_pred --CCCCCCCHHH-HHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHCC----------------CCEEEECC Q ss_conf --4467899899-9999999998558935980012300004989999999998506----------------92899749 Q gi|254780558|r 243 --TLKVTQTVQQ-LLNRLRYSLRRGTGYFGVMNYRGAMLLSNKESAEVIFKEFAKR----------------GLLFFDDG 303 (396) Q Consensus 243 --~L~~~~~~~~-~~~~l~~~l~~~p~~vGvnNhmGs~~t~~~~~m~~vl~~l~~r----------------gL~flDs~ 303 (396) ....+...++ +...+...+..+|...-+-= |+.=..--+..+|++.-+..+ |+-|+.+. T Consensus 116 ~~~~~kg~~~~elL~~ii~~~i~~l~~pKsMRW--g~~~~~F~RPIrwIvaLld~~vi~~~~~gi~s~~~t~GHRfl~~~ 193 (687) T PRK01233 116 YRKEVPGRPTKELLPEIVPEALAKLPIPKSMRW--GNNDLRFVRPVHWLVALLGDEVIPFEILGIKSGRVTRGHRFLGPG 193 (687) T ss_pred EEEEECCCCHHHHHHHHHHHHHHCCCCCCCCCC--CCCCCEEECHHHHHHHHCCCEEEEEEEEEEECCCEECCCCCCCCC T ss_conf 998568842999999999999976999830027--999985545279898870881678999514525411255346888 Q ss_pred CCCCCHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHHHHHHHCCEEEE Q ss_conf 76322678989870994698458862899999999999999999987194899 Q gi|254780558|r 304 SSPRNLTRVLAPKLNLPYMVADLYLDDQVDRDKIREKLKGLEEIARTTGQAIG 356 (396) Q Consensus 304 Ts~~Sva~~~A~~~gvp~~~~dvfLD~~~~~~~I~~qL~~l~~~Ar~~G~AI~ 356 (396) .-.=+-+...-..+ ....|.+|.+...+.|..|+.++ |++.|.-+- T Consensus 194 ~i~I~~~~~Y~~~L----~~~~Vivd~~eRk~~I~~~i~~~---a~~~~~~~~ 239 (687) T PRK01233 194 EFTIDNADDYEEKL----REAFVIADPEERKAIILAQIEEL---AAANGGEAD 239 (687) T ss_pred CEEECCHHHHHHHH----HHCCEECCHHHHHHHHHHHHHHH---HHHCCCEEC T ss_conf 85518999999998----51989836999999999999999---998696786 No 72 >COG1674 FtsK DNA segregation ATPase FtsK/SpoIIIE and related proteins [Cell division and chromosome partitioning] Probab=41.51 E-value=25 Score=15.42 Aligned_cols=88 Identities=18% Similarity=0.297 Sum_probs=61.4 Q ss_pred ECCCCCCCCCHH-----HHHHHHHHHHCCCCEEEECCCCCCCHHHHHHH------HHCCCEEEEEEEECCCC-----CHH Q ss_conf 012300004989-----99999999850692899749763226789898------70994698458862899-----999 Q gi|254780558|r 272 NYRGAMLLSNKE-----SAEVIFKEFAKRGLLFFDDGSSPRNLTRVLAP------KLNLPYMVADLYLDDQV-----DRD 335 (396) Q Consensus 272 NhmGs~~t~~~~-----~m~~vl~~l~~rgL~flDs~Ts~~Sva~~~A~------~~gvp~~~~dvfLD~~~-----~~~ 335 (396) -|+++-+..|+. +++|+.+++.+|--.|... .-++...--.+ ...+|+. .+++|+-. ... T Consensus 582 PHl~~~v~td~~~k~~~al~~~~~eme~R~~l~~~~--~vr~i~~yn~k~~~~~~~~~lP~i--viiiDe~adlm~~~~k 657 (858) T COG1674 582 PHLGDPVVTDEKEKAEKALAELVAEMERRYKLFSEK--GVRNIEGYNEKIAGAIPDEELPYI--VIIIDEYADLMMVAGK 657 (858) T ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH--CCCCHHHHHHHHCCCCCCCCCCCE--EEEECCHHHHHCCCCH T ss_conf 855772324747789999999999999999988885--666277776542024555679808--9994447888612317 Q ss_pred HHHHHHHHHHHHHHHCCEEEEEEECCHH Q ss_conf 9999999999999871948999808878 Q gi|254780558|r 336 KIREKLKGLEEIARTTGQAIGVAVAFDE 363 (396) Q Consensus 336 ~I~~qL~~l~~~Ar~~G~AI~igh~~p~ 363 (396) .|+..+.++.+.||..|.-+.++--+|. T Consensus 658 ~ve~~i~rLa~~ara~GIHlilatqRps 685 (858) T COG1674 658 ELEELIARLAQKGRAAGIHLILATQRPS 685 (858) T ss_pred HHHHHHHHHHHHHHHCCCEEEEECCCCC T ss_conf 6999999999978765826999748999 No 73 >pfam02645 DegV Uncharacterized protein, DegV family COG1307. The structure of this protein revealed a bound fatty-acid molecule in a pocket between the two protein domains. The structure indicates that this family has the molecular function of fatty-acid binding and may play a role in the cellular functions of fatty acid transport or metabolism. Probab=40.86 E-value=26 Score=15.36 Aligned_cols=123 Identities=20% Similarity=0.281 Sum_probs=68.6 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCCCEEEEE----CCCCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHCC Q ss_conf 46789989999999999985589359800----12300004989999999998506928997497632267898987099 Q gi|254780558|r 244 LKVTQTVQQLLNRLRYSLRRGTGYFGVMN----YRGAMLLSNKESAEVIFKEFAKRGLLFFDDGSSPRNLTRVLAPKLNL 319 (396) Q Consensus 244 L~~~~~~~~~~~~l~~~l~~~p~~vGvnN----hmGs~~t~~~~~m~~vl~~l~~rgL~flDs~Ts~~Sva~~~A~~~gv 319 (396) +..+++.+|+.+.++....+.-.|+-+.| +.|+|...-.. .+-+.|+=+=++-+|.+..- T Consensus 64 ~~~G~s~~ei~~~l~~~~~~~~~~~~v~~L~~L~kgGRis~~~~---~ig~lL~ikPIi~~~~g~~~------------- 127 (211) T pfam02645 64 AKAGKSFEEILKKLEEIRERTKAYFTVDDLDYLVKGGRISKAAA---FLGSLLKIKPILSFEDGGLV------------- 127 (211) T ss_pred HHCCCCHHHHHHHHHHHHHHCEEEEEECCHHHHHHCCCCHHHHH---HHHHHHCCEEEEEEECCEEE------------- T ss_conf 98298999999999999850679999597799987891248999---99861086799999799589------------- Q ss_pred CEEEEEEEECCC-CCHHH-HHHHHHHHHHHHH-HCCEEEEEEECC-HHHHHHHHHHHHHHHH-CCCEEEEHHHHH Q ss_conf 469845886289-99999-9999999999998-719489998088-7889999999756887-691899677743 Q gi|254780558|r 320 PYMVADLYLDDQ-VDRDK-IREKLKGLEEIAR-TTGQAIGVAVAF-DESIEVISQWLQQEHV-RDVSVVPLSCLA 389 (396) Q Consensus 320 p~~~~dvfLD~~-~~~~~-I~~qL~~l~~~Ar-~~G~AI~igh~~-p~Ti~~L~~w~~~l~~-~gi~lVpvS~l~ 389 (396) ..... ....+ +.+.++...+... .....+.|.|.. ++..+.+.+++.+.-. ..+.+.+++.++ T Consensus 128 -------~~~k~r~~~k~~~~~~~~~~~~~~~~~~~~~i~i~~~~~~e~~~~~~~~l~~~~~~~~i~~~~~~~vi 195 (211) T pfam02645 128 -------PLAKVRGGKKAAIKRLLEILLKKAAEGKGYRVAIIHANEEEAAELLKELLKAKYPIVDISISEIGPVI 195 (211) T ss_pred -------EEEEEECHHHHHHHHHHHHHHHHHCCCCCEEEEEEECCCHHHHHHHHHHHHHHCCCCEEEEEEECCEE T ss_conf -------96456225689999999999986267986399999289999999999999856799739999837599 No 74 >cd02067 B12-binding B12 binding domain (B12-BD). This domain binds different cobalamid derivates, like B12 (adenosylcobamide) or methylcobalamin or methyl-Co(III) 5-hydroxybenzimidazolylcobamide, it is found in several enzymes, such as glutamate mutase, methionine synthase and methylmalonyl-CoA mutase. Cobalamin undergoes a conformational change on binding the protein; the dimethylbenzimidazole group, which is coordinated to the cobalt in the free cofactor, moves away from the corrin and is replaced by a histidine contributed by the protein. The sequence Asp-X-His-X-X-Gly, which contains this histidine ligand, is conserved in many cobalamin-binding proteins. Probab=40.45 E-value=26 Score=15.32 Aligned_cols=90 Identities=11% Similarity=0.141 Sum_probs=60.4 Q ss_pred HHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHH Q ss_conf 99999999859929998176656898778785544678998999999999998558935980012300004989999999 Q gi|254780558|r 210 DRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQLLNRLRYSLRRGTGYFGVMNYRGAMLLSNKESAEVIF 289 (396) Q Consensus 210 ~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~~~~~l~~~l~~~p~~vGvnNhmGs~~t~~~~~m~~vl 289 (396) .=.+..-+..|+||+- |-.+.+++++. +.+.+.=|..+|+.-.|++ +...|+.++ T Consensus 17 ~iv~~~l~~~G~~V~~------------------lG~~vp~e~~v---~~a~~~~~d~I~lS~~~~~----~~~~~~~~i 71 (119) T cd02067 17 NIVARALRDAGFEVID------------------LGVDVPPEEIV---EAAKEEDADAIGLSGLLTT----HMTLMKEVI 71 (119) T ss_pred HHHHHHHHHCCCEEEE------------------CCCCCCHHHHH---HHHHHCCCCEEEEECCCCC----CHHHHHHHH T ss_conf 9999999978998998------------------99999999999---9999709999999622024----268999999 Q ss_pred HHHHCCCC---EEEECCCCCCCHHHHHHHHHCCCEEEEE Q ss_conf 99850692---8997497632267898987099469845 Q gi|254780558|r 290 KEFAKRGL---LFFDDGSSPRNLTRVLAPKLNLPYMVAD 325 (396) Q Consensus 290 ~~l~~rgL---~flDs~Ts~~Sva~~~A~~~gvp~~~~d 325 (396) +.|+++|+ -.+=-+... +....++++.|.-....| T Consensus 72 ~~l~~~g~~~i~v~vGG~~~-~~~~~~~~~~Gad~~~~~ 109 (119) T cd02067 72 EELKEAGLDDIPVLVGGAIV-TRDFKFLKEIGVDAYFGP 109 (119) T ss_pred HHHHHCCCCCCEEEEECCCC-CHHHHHHHHCCCCEEECC T ss_conf 99997699998599989989-743999998699799778 No 75 >pfam02633 Creatininase Creatinine amidohydrolase. Creatinine amidohydrolase (EC:3.5.2.10), or creatininase, catalyses the hydrolysis of creatinine to creatine. Probab=40.44 E-value=26 Score=15.32 Aligned_cols=65 Identities=18% Similarity=0.116 Sum_probs=44.2 Q ss_pred EEECCCCCCCCCCCCCC-CCCCCCCCCHHHHHHHHHHHHHH-----CCCCEEEEECCCCCCCCCHHHHHHHHHHHHCC Q ss_conf 99817665689877878-55446789989999999999985-----58935980012300004989999999998506 Q gi|254780558|r 224 ILQIPMQAFDESYNEDD-SYTLKVTQTVQQLLNRLRYSLRR-----GTGYFGVMNYRGAMLLSNKESAEVIFKEFAKR 295 (396) Q Consensus 224 llhlPMEp~~~~~~~pG-p~~L~~~~~~~~~~~~l~~~l~~-----~p~~vGvnNhmGs~~t~~~~~m~~vl~~l~~r 295 (396) .+-+|--++++...--+ |+|+ +++.+.....|...+.+ |...+=||.|-| |...++.++++|+.+ T Consensus 56 ~lv~P~i~yG~s~~h~~fpGTi--sl~~~t~~~~l~di~~sl~~~Gf~~ivilNgHgG-----N~~~l~~~~~~l~~~ 126 (235) T pfam02633 56 VLVLPTIPYGYSPEHMGFPGTI--SLSPETLIALLRDIGRSLARHGFRRLVIVNGHGG-----NIALLQEAARELRAE 126 (235) T ss_pred EEEECCCCCCCCHHHCCCCCEE--EECHHHHHHHHHHHHHHHHHCCCCEEEEEECCCC-----CHHHHHHHHHHHHHH T ss_conf 6996887567786772889736--5099999999999999999779758999916877-----489999999999977 No 76 >PRK08238 hypothetical protein; Validated Probab=40.38 E-value=26 Score=15.31 Aligned_cols=99 Identities=12% Similarity=0.227 Sum_probs=72.0 Q ss_pred CHHHHHHHHHCCCCCEEEEECCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHH Q ss_conf 16889999837935089870898768999999998599299981766568987787855446789989999999999985 Q gi|254780558|r 184 QTGTQRAINLLPANITLAFASNGNSLDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQLLNRLRYSLRR 263 (396) Q Consensus 184 ~~~~~~ai~~LP~~vT~A~~P~~~~~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~~~~~l~~~l~~ 263 (396) ...-.+.+.+ -..+-.+.+||.+..-+|++..|++|+++.| .+.+.+.+.+.+..+|+- T Consensus 55 ka~lK~~La~-~~~~d~~~LPyn~~vl~~l~~~k~~GR~vvL--------------------~Tas~~~~a~~IA~hLgl 113 (481) T PRK08238 55 KAALKRRLAE-RVDLDVATLPYNEEVLDYLRAERAAGRPIVL--------------------ATASDERLAQAVAAHLGL 113 (481) T ss_pred HHHHHHHHHH-HCCCCHHHCCCCHHHHHHHHHHHHCCCEEEE--------------------EECCCHHHHHHHHHHCCC T ss_conf 4999999974-0788977479898999999999986997999--------------------958869999999985798 Q ss_pred CCCCEEEEECCCCCCCCCHHHHHHHHHHHHCCCCEEEECCCCC Q ss_conf 5893598001230000498999999999850692899749763 Q gi|254780558|r 264 GTGYFGVMNYRGAMLLSNKESAEVIFKEFAKRGLLFFDDGSSP 306 (396) Q Consensus 264 ~p~~vGvnNhmGs~~t~~~~~m~~vl~~l~~rgL~flDs~Ts~ 306 (396) +.++.|-+++. .+.+ ..-.+.+-+...++|.-|+-+...- T Consensus 114 Fd~v~aSd~~~--NL~g-~~Ka~~L~~~fG~~gFdY~Gns~~D 153 (481) T PRK08238 114 FDGVLASDGTT--NLKG-AAKAAALVEAFGEKGFDYAGNSRAD 153 (481) T ss_pred CCEEECCCCCC--CCCC-HHHHHHHHHHCCCCCCCCCCCCHHH T ss_conf 55355168853--4676-4899999987587675304787245 No 77 >TIGR01457 HAD-SF-IIA-hyp2 HAD-superfamily subfamily IIA hydrolase, TIGR01457; InterPro: IPR006354 These sequences are all members of the Class IIA subfamily of the haloacid dehalogenase superfamily of aspartate-nucleophile hydrolases. The sequences are all from the Gram-positive (low-GC) bacteria.. Probab=40.34 E-value=23 Score=15.67 Aligned_cols=53 Identities=23% Similarity=0.451 Sum_probs=38.5 Q ss_pred HHHHHHHHHCCCC--EEEE--CCCCCCCHHHHHHHHHCCCEEEEEEEECCCCCHHHH Q ss_conf 9999999850692--8997--497632267898987099469845886289999999 Q gi|254780558|r 285 AEVIFKEFAKRGL--LFFD--DGSSPRNLTRVLAPKLNLPYMVADLYLDDQVDRDKI 337 (396) Q Consensus 285 m~~vl~~l~~rgL--~flD--s~Ts~~Sva~~~A~~~gvp~~~~dvfLD~~~~~~~I 337 (396) -..+.++|++|+. +||. |-.||++|+..++..+.||.-.-+||==.--..++| T Consensus 22 a~~FV~~L~kr~~pYLFVTNNstrtPe~Va~~L~snfdipat~e~VfT~smAta~y~ 78 (251) T TIGR01457 22 AVEFVKELKKRDVPYLFVTNNSTRTPESVAELLASNFDIPATKEQVFTASMATADYL 78 (251) T ss_pred HHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHH T ss_conf 899999886479863687278888817999998751689997210234478999999 No 78 >TIGR03239 GarL 2-dehydro-3-deoxyglucarate aldolase. In E. coli this enzyme (GarL, ) 2-dehydro-3-deoxyglucarate aldolase acts in the catabolism of several sugars including D-galactarate, D-glucarate and L-idarate. In fact, 5-dehydro-4-deoxy-D-glucarate aldolase is a synonym for this enzyme as it is unclear in the literature whether the enzyme acts on only one of these or, as seems likely, has no preference. (Despite the apparent large difference in substrate stucture indicated by their names, 2-DH-3DO- and 5-DH-4DO-glucarate differ only by the chirality of most central hydroxyl-bearing carbon and is alternately named 2-DH-3DO-galactarate.) The reported product of D-galactarate dehydratase (4.2.1.42) is the 5DH-4DO-glucarate isomer and this enzyme is found proximal to the aldolase in many genomes (GenProp0714) where no epimerase is known. Similarly, the product of D-glucarate dehydratase (4.2.1.40) is again the 5-DH-4DO isomer, so the provenance of the 2-DH-3DO-glucarate isomer for which Probab=40.29 E-value=26 Score=15.30 Aligned_cols=85 Identities=13% Similarity=0.221 Sum_probs=54.8 Q ss_pred CCCHHHHHHHHHHHHCCC--CEEEECCCCCCCHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHHHHHHHCCEEEE Q ss_conf 049899999999985069--289974976322678989870994698458862899999999999999999987194899 Q gi|254780558|r 279 LSNKESAEVIFKEFAKRG--LLFFDDGSSPRNLTRVLAPKLNLPYMVADLYLDDQVDRDKIREKLKGLEEIARTTGQAIG 356 (396) Q Consensus 279 t~~~~~m~~vl~~l~~rg--L~flDs~Ts~~Sva~~~A~~~gvp~~~~dvfLD~~~~~~~I~~qL~~l~~~Ar~~G~AI~ 356 (396) -.+.++++.+=+.++-.| .+|+- ...++..+|+|- +.+...+.+.+++..+.++++|..+| T Consensus 145 IEt~~av~nldeI~av~GvD~ifiG--------P~DLs~slG~~g---------~~~~p~v~~ai~~v~~~~~~~gk~~G 207 (249) T TIGR03239 145 IESQKGVDNVDEIAAVDGVDGIFVG--------PSDLAAALGHLG---------NPNHPDVQKAIRHIFDRAAAHGKPCG 207 (249) T ss_pred ECCHHHHHHHHHHHCCCCCCEEEEC--------HHHHHHHCCCCC---------CCCCHHHHHHHHHHHHHHHHCCCCEE T ss_conf 6588999879999678998889988--------489998769999---------99997999999999999998599669 Q ss_pred EEECCHHHHHHHHHHHHHHHHCCCEEEEHHH Q ss_conf 9808878899999997568876918996777 Q gi|254780558|r 357 VAVAFDESIEVISQWLQQEHVRDVSVVPLSC 387 (396) Q Consensus 357 igh~~p~Ti~~L~~w~~~l~~~gi~lVpvS~ 387 (396) |-..-++ -.++|. ++|+.++.++. T Consensus 208 i~~~~~~---~~~~~~----~~G~~~i~~g~ 231 (249) T TIGR03239 208 ILAPVEA---DARRYL----EWGATFVAVGS 231 (249) T ss_pred EECCCHH---HHHHHH----HCCCCEEEEHH T ss_conf 8279999---999999----86998999668 No 79 >PRK06886 hypothetical protein; Validated Probab=40.27 E-value=26 Score=15.30 Aligned_cols=58 Identities=9% Similarity=0.067 Sum_probs=22.2 Q ss_pred EEEECCCCCHHHHHHHHHHHHHHHHHCCE--EEEEEECCH---HHHHHHHHHHHHHHHCCCEEEE Q ss_conf 58862899999999999999999987194--899980887---8899999997568876918996 Q gi|254780558|r 325 DLYLDDQVDRDKIREKLKGLEEIARTTGQ--AIGVAVAFD---ESIEVISQWLQQEHVRDVSVVP 384 (396) Q Consensus 325 dvfLD~~~~~~~I~~qL~~l~~~Ar~~G~--AI~igh~~p---~Ti~~L~~w~~~l~~~gi~lVp 384 (396) |+-+|...|+. .+.|+.+++.+.++|+ =|.++|+.. ..-+...+.+..+.+.||.+|. T Consensus 179 D~H~De~~dp~--~~~le~la~~t~~~G~~GRVt~sH~~sL~~~~~~~~~~l~~~la~agi~vis 241 (328) T PRK06886 179 HVHVDQFNTPK--EKETEQLCDKTIEHGMQGRVVAIHGISIGAHSKEYRYKLYEKMRDAQMMMIA 241 (328) T ss_pred EEECCCCCCCC--HHHHHHHHHHHHHHCCCCCEEEEEHHHHHCCCHHHHHHHHHHHHHCCCCEEE T ss_conf 76018999911--7689999999999577997775330146438999999999999984996998 No 80 >COG1082 IolE Sugar phosphate isomerases/epimerases [Carbohydrate transport and metabolism] Probab=40.14 E-value=26 Score=15.29 Aligned_cols=87 Identities=11% Similarity=0.105 Sum_probs=46.2 Q ss_pred HHHHHHHCCCC-EEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEE--CCCCCCCCCHHHHHHH Q ss_conf 99999985992-99981766568987787855446789989999999999985589359800--1230000498999999 Q gi|254780558|r 212 WMKEAKKKGQE-AILQIPMQAFDESYNEDDSYTLKVTQTVQQLLNRLRYSLRRGTGYFGVMN--YRGAMLLSNKESAEVI 288 (396) Q Consensus 212 ~~~~Ar~~G~E-vllhlPMEp~~~~~~~pGp~~L~~~~~~~~~~~~l~~~l~~~p~~vGvnN--hmGs~~t~~~~~m~~v 288 (396) .++.|.+-|-+ +.+|...-+....... .-.......+....|.+......--+++-| |+|.-.......++.+ T Consensus 89 ~i~~a~~lg~~~vv~~~g~~~~~~~~~~----~~~~~~~~~~~l~~l~~~a~~~~i~l~~e~~~~~~~~~~~~~~~~~~~ 164 (274) T COG1082 89 AIELAKELGAKVVVVHPGLGAGADDPDS----PEEARERWAEALEELAEIAEELGIGLALENHHHPGNVVETGADALDLL 164 (274) T ss_pred HHHHHHHCCCCEEEEECCCCCCCCCCCC----CHHHHHHHHHHHHHHHHHHHHHCCEEEEEECCCCCCEECCHHHHHHHH T ss_conf 9999998499889995477767666656----045689999999999999998098079726457776206858999999 Q ss_pred HHHHHCCCCE-EEECC Q ss_conf 9998506928-99749 Q gi|254780558|r 289 FKEFAKRGLL-FFDDG 303 (396) Q Consensus 289 l~~l~~rgL~-flDs~ 303 (396) ++.-.. +++ .+|.. T Consensus 165 ~~~~~~-~v~~~lD~~ 179 (274) T COG1082 165 REVDSP-NVGLLLDTG 179 (274) T ss_pred HHCCCC-CEEEEEEHH T ss_conf 855688-779998867 No 81 >pfam11814 DUF3335 Domain of unknown function (DUF3335). This family of proteins are functionally uncharacterized. This family is only found in bacteria. This presumed domain is typically between 226 to 230 amino acids in length. Probab=40.12 E-value=26 Score=15.29 Aligned_cols=56 Identities=18% Similarity=0.204 Sum_probs=21.3 Q ss_pred EEECCCCCHH--HHHHHHHH-HHHHHHHCCEEEEEEECCHHHHHHHHHHHHHHHHCCCEEEEHHH Q ss_conf 8862899999--99999999-99999871948999808878899999997568876918996777 Q gi|254780558|r 326 LYLDDQVDRD--KIREKLKG-LEEIARTTGQAIGVAVAFDESIEVISQWLQQEHVRDVSVVPLSC 387 (396) Q Consensus 326 vfLD~~~~~~--~I~~qL~~-l~~~Ar~~G~AI~igh~~p~Ti~~L~~w~~~l~~~gi~lVpvS~ 387 (396) +|||++.+++ .+.+.+.+ ....+.+.|..+ |..+.|++.|.+|+.+ .++.||=+|. T Consensus 100 lFld~VR~~~Kk~v~~~V~~~f~~~~~~~gV~~---~~~~~~~~~l~~~l~~---G~~vlvLIS~ 158 (228) T pfam11814 100 LFLDSVRSENKKEVMRLVHEDFRQELEQLGVPV---HYRPLSLEDLEAALAQ---GAAVLVLIST 158 (228) T ss_pred EEECCCCCHHHHHHHHHHHHHHHHHHHHCCCCE---ECCCCCHHHHHHHHHC---CCEEEEEEEC T ss_conf 552155774278999999999999999779952---3189999999999977---9868999865 No 82 >PRK08434 consensus Probab=39.98 E-value=3.4 Score=20.94 Aligned_cols=28 Identities=21% Similarity=0.453 Sum_probs=18.4 Q ss_pred CCCHHHHHHHHHHHHCCCCEEEECCCCC Q ss_conf 0498999999999850692899749763 Q gi|254780558|r 279 LSNKESAEVIFKEFAKRGLLFFDDGSSP 306 (396) Q Consensus 279 t~~~~~m~~vl~~l~~rgL~flDs~Ts~ 306 (396) ..+......+++.+.+.+.+-+|.-|+. T Consensus 302 v~~~~~l~~~l~~l~~~~~~a~d~Ett~ 329 (887) T PRK08434 302 LDDENKLEEILNTLPKESIIAFDTETTG 329 (887) T ss_pred ECCHHHHHHHHHHCCCCCEEEEEEECCC T ss_conf 3899999999985311676999951278 No 83 >TIGR00593 pola DNA polymerase I; InterPro: IPR002298 DNA carries the biological information that instructs cells how to exist in an ordered fashion. Accurate replication is thus one of the most important events in the cell life cycle. This function is mediated by DNA-directed DNA polymerases, which add nucleotide triphosphate (dNTP) residues to the 5'-end of the growing DNA chain, using a complementary DNA as template. Small RNA molecules are generally used as primers for chain elongation, although terminal proteins may also be used. DNA-dependent DNA polymerases have been grouped into families, denoted A, B and X, on the basis of sequence similarities , . Members of family A, which includes bacterial and bacteriophage polymerases, share significant similarity to Escherichia coli polymerase I; hence family A is also known as the pol I family. The bacterial polymerases also contain an exonuclease activity, which is coded for in the N-terminal portion. Three motifs, A, B and C , are seen to be conserved across all DNA polymerases, with motifs A and C also seen in RNA polymerases. They are centred on invariant residues, and their structural significance was implied from the Klenow (E. coli) structure. Motif A contains a strictly-conserved aspartate at the junction of a beta-strand and an alpha-helix; motif B contains an alpha-helix with positive charges; and motif C has a doublet of negative charges, located in a beta-turn-beta secondary structure .; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication. Probab=39.88 E-value=6.3 Score=19.20 Aligned_cols=18 Identities=11% Similarity=0.385 Sum_probs=8.6 Q ss_pred CCHHHHHHHHHHHHCCCC Q ss_conf 498999999999850692 Q gi|254780558|r 280 SNKESAEVIFKEFAKRGL 297 (396) Q Consensus 280 ~~~~~m~~vl~~l~~rgL 297 (396) ....-.+.+|..|+.+|. T Consensus 468 ~~~~~~K~~~~~L~~~g~ 485 (1005) T TIGR00593 468 KVAHDAKFLMHLLKRKGI 485 (1005) T ss_pred CCHHHHHHHHHHHHHCCC T ss_conf 228899999999974377 No 84 >cd06543 GH18_PF-ChiA-like PF-ChiA is an uncharacterized chitinase found in the hyperthermophilic archaeon Pyrococcus furiosus with a glycosyl hydrolase family 18 (GH18) catalytic domain as well as a cellulose-binding domain. Members of this domain family are found not only in archaea but also in eukaryotes and prokaryotes. PF-ChiA exhibits hydrolytic activity toward both colloidal and crystalline (beta/alpha) chitins at high temperature. Probab=39.86 E-value=27 Score=15.26 Aligned_cols=98 Identities=14% Similarity=0.168 Sum_probs=61.2 Q ss_pred CCCCCEEEEECCCCCH--HHHHHHHHHCCCCE--EEECCCCCCCCCCCCCCCCCCCCCCCHHHH--HHHHH--------- Q ss_conf 7935089870898768--99999999859929--998176656898778785544678998999--99999--------- Q gi|254780558|r 194 LPANITLAFASNGNSL--DRWMKEAKKKGQEA--ILQIPMQAFDESYNEDDSYTLKVTQTVQQL--LNRLR--------- 258 (396) Q Consensus 194 LP~~vT~A~~P~~~~~--~~~~~~Ar~~G~Ev--llhlPMEp~~~~~~~pGp~~L~~~~~~~~~--~~~l~--------- 258 (396) |-.-+||.++|.+-.. ...++.|++.|.++ +=-|+|. |-...+. .+|....+ .+.++ T Consensus 140 L~vs~TLpv~p~GLt~~G~~vl~~a~~~Gv~~~~VNiMtMD---yG~~~~~-----~~mg~~ai~Aa~~~~~qL~~~y~~ 211 (294) T cd06543 140 LKISFTLPVLPTGLTPDGLNVLEAAAANGVDLDTVNIMTMD---YGSSAGS-----QDMGAAAISAAESLHDQLKDLYPK 211 (294) T ss_pred EEEEEECCCCCCCCCHHHHHHHHHHHHCCCCCCEEEEEECC---CCCCCCC-----CCHHHHHHHHHHHHHHHHHHHCCC T ss_conf 18999727899998867899999999869985417777225---7788886-----547999999999999999986468 Q ss_pred ------HHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHCCCCEEEE Q ss_conf ------9998558935980012300004989999999998506928997 Q gi|254780558|r 259 ------YSLRRGTGYFGVMNYRGAMLLSNKESAEVIFKEFAKRGLLFFD 301 (396) Q Consensus 259 ------~~l~~~p~~vGvnNhmGs~~t~~~~~m~~vl~~l~~rgL~flD 301 (396) |.+--+.--+|+|+--+-.||.+. .+.|....+++||.+|- T Consensus 212 ~s~a~~w~~iGvTpMiG~nD~~~E~Ft~~d--a~~l~~fA~~~glg~ls 258 (294) T cd06543 212 LSDAELWAMIGVTPMIGVNDVGSEVFTLAD--AQTLVDFAKEKGLGRLS 258 (294) T ss_pred CCHHHHHHHCCCCCCCCCCCCCCCEEEHHH--HHHHHHHHHHCCCCEEE T ss_conf 788999976484554577787664652999--99999999867985585 No 85 >TIGR03015 pepcterm_ATPase putative secretion ATPase, PEP-CTERM locus subfamily. Members of this protein are marked as probable ATPases by the nucleotide binding P-loop motif GXXGXGKTT, a motif DEAQ similar to the DEAD/H box of helicases, and extensive homology to ATPases of MSHA-type pilus systems and to GspA proteins associated with type II protein secretion systems. Probab=39.70 E-value=27 Score=15.25 Aligned_cols=95 Identities=15% Similarity=0.220 Sum_probs=47.1 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHCCCC-E--EEECCCCCCCHHHHHH Q ss_conf 878554467899899999999999855893598001230000498999999999850692-8--9974976322678989 Q gi|254780558|r 238 EDDSYTLKVTQTVQQLLNRLRYSLRRGTGYFGVMNYRGAMLLSNKESAEVIFKEFAKRGL-L--FFDDGSSPRNLTRVLA 314 (396) Q Consensus 238 ~pGp~~L~~~~~~~~~~~~l~~~l~~~p~~vGvnNhmGs~~t~~~~~m~~vl~~l~~rgL-~--flDs~Ts~~Sva~~~A 314 (396) .|.|..+..+-.-++...+|.+.+.+-.+.+=|..--|+.=| ..++.+++.|..... . ..++.-+....-..++ T Consensus 16 ~pd~~~~y~s~~h~~al~~L~~~l~~~~g~~lltGe~GtGKT---tllr~l~~~l~~~~~~~~~i~~~~l~~~~ll~~i~ 92 (269) T TIGR03015 16 LPDPDFFYPSKGHKRAMAYLEYGLSQREGFILITGEVGAGKT---TLIRNLLKRLDQERVVAAKLVNTRVDAEDLLRMVA 92 (269) T ss_pred CCCHHHCCCCHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHH---HHHHHHHHHCCCCCEEEEEECCCCCCHHHHHHHHH T ss_conf 998454147866999999999999648965999729989889---99999998459345489997699999999999999 Q ss_pred HHHCCCEEEEEEEECCCCCHHHHHHHHHH Q ss_conf 87099469845886289999999999999 Q gi|254780558|r 315 PKLNLPYMVADLYLDDQVDRDKIREKLKG 343 (396) Q Consensus 315 ~~~gvp~~~~dvfLD~~~~~~~I~~qL~~ 343 (396) .++|++.. ..+...+.++|.+ T Consensus 93 ~~lg~~~~--------~~~~~~~~~~l~~ 113 (269) T TIGR03015 93 ADFGLETE--------GRDKAALLRELED 113 (269) T ss_pred HHCCCCCC--------CCCHHHHHHHHHH T ss_conf 98598988--------9899999999999 No 86 >cd03768 SR_ResInv Serine Recombinase (SR) family, Resolvase and Invertase subfamily, catalytic domain; members contain a C-terminal DNA binding domain. Serine recombinases catalyze site-specific recombination of DNA molecules by a concerted, four-strand cleavage and rejoining mechanism which involves a transient phosphoserine linkage between DNA and the enzyme. They are functionally versatile and include resolvases, invertases, integrases, and transposases. Resolvases and invertases affect resolution or inversion and comprise a major phylogenic group. Resolvases (e.g. Tn3, gamma-delta, and Tn5044) normally recombine two sites in direct repeat causing deletion of the DNA between the sites. Invertases (e.g. Gin and Hin) recombine sites in inverted repeat to invert the DNA between the sites. Cointegrate resolution with gamma-delta resolvase requires the formation of a synaptosome of three resolvase dimers bound to each of two res sites on the DNA. Also included in this subfamily are some Probab=39.62 E-value=27 Score=15.24 Aligned_cols=17 Identities=18% Similarity=0.298 Sum_probs=6.8 Q ss_pred HHHHHHHHCCCEEEEHH Q ss_conf 99756887691899677 Q gi|254780558|r 370 QWLQQEHVRDVSVVPLS 386 (396) Q Consensus 370 ~w~~~l~~~gi~lVpvS 386 (396) .++..+..+||.|+.+. T Consensus 75 ~~~~~l~~~gv~l~~~~ 91 (126) T cd03768 75 EIVEELREKGVSLRSLT 91 (126) T ss_pred HHHHHHHHCCCEEEECC T ss_conf 99999997798799755 No 87 >pfam04404 ERF ERF superfamily. The DNA single-strand annealing proteins (SSAPs), such as RecT, Red-beta, ERF and Rad52, function in RecA-dependent and RecA-independent DNA recombination pathways. This family includes proteins related to ERF. Probab=38.76 E-value=28 Score=15.15 Aligned_cols=48 Identities=15% Similarity=0.038 Sum_probs=38.5 Q ss_pred HHHHHHHHHCCCC--EEEEECCCCCCCCCHHHHHHHHHHHHCCCCEEEEC Q ss_conf 9999999855893--59800123000049899999999985069289974 Q gi|254780558|r 255 NRLRYSLRRGTGY--FGVMNYRGAMLLSNKESAEVIFKEFAKRGLLFFDD 302 (396) Q Consensus 255 ~~l~~~l~~~p~~--vGvnNhmGs~~t~~~~~m~~vl~~l~~rgL~flDs 302 (396) ..|..+...++.. -+-|.|-++++..-+..++.|-..|.+.||.|.-. T Consensus 3 ~al~~~q~el~~~~K~~~n~~~kykYa~ledIl~ai~p~l~~~Gl~~~~~ 52 (158) T pfam04404 3 QKLAELQKELKQPKKDANNPFYKYKYASLEDILEAIKPALSKHGLSLVQR 52 (158) T ss_pred HHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCEEEEE T ss_conf 99999998755535467488878554669999999999999859379985 No 88 >COG1239 ChlI Mg-chelatase subunit ChlI [Coenzyme metabolism] Probab=37.51 E-value=29 Score=15.03 Aligned_cols=16 Identities=25% Similarity=0.297 Sum_probs=5.0 Q ss_pred CCCCCCHHHHHHHHHC Q ss_conf 7787716889999837 Q gi|254780558|r 179 GLGISQTGTQRAINLL 194 (396) Q Consensus 179 dlG~~~~~~~~ai~~L 194 (396) +=|+..+...+||..| T Consensus 46 ~kGtaKSt~~Rala~L 61 (423) T COG1239 46 EKGTAKSTLARALADL 61 (423) T ss_pred CCCCCHHHHHHHHHHH T ss_conf 8875277999999986 No 89 >COG0391 Uncharacterized conserved protein [Function unknown] Probab=37.23 E-value=29 Score=15.00 Aligned_cols=141 Identities=14% Similarity=0.075 Sum_probs=95.3 Q ss_pred HHHHCCCC--CEEEEE--CCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHH-C Q ss_conf 99837935--089870--898768999999998599299981766568987787855446789989999999999985-5 Q gi|254780558|r 190 AINLLPAN--ITLAFA--SNGNSLDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQLLNRLRYSLRR-G 264 (396) Q Consensus 190 ai~~LP~~--vT~A~~--P~~~~~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~~~~~l~~~l~~-~ 264 (396) .|.+++.+ .-+.|. ++..-+.+..+..+++- -|++ ||+-+.|+.-+-=+..-+..+|.+ . T Consensus 157 ~i~~~~~~~v~~V~~~~~~~~~a~~eaveAI~~AD-~Ivi--------------GPgSl~TSIlP~Lllp~I~eaLr~~~ 221 (323) T COG0391 157 WIAELGGPPVHRVRLEGPEKPSAAPEAVEAIKEAD-LIVI--------------GPGSLFTSILPILLLPGIAEALRETV 221 (323) T ss_pred CHHHCCCCCCEEEEEECCCCCCCCHHHHHHHHHCC-EEEE--------------CCCCCHHHHCHHHCCHHHHHHHHHCC T ss_conf 06443899740799836888988899999998589-8998--------------68832766430301057999998678 Q ss_pred CCCEEEEECCCCC-----CCCCHHHHHHHHHHHHCC--CCEEEECCCCCCCHHHHHHHHHCCCEEEEEEEECCCCCHHHH Q ss_conf 8935980012300-----004989999999998506--928997497632267898987099469845886289999999 Q gi|254780558|r 265 TGYFGVMNYRGAM-----LLSNKESAEVIFKEFAKR--GLLFFDDGSSPRNLTRVLAPKLNLPYMVADLYLDDQVDRDKI 337 (396) Q Consensus 265 p~~vGvnNhmGs~-----~t~~~~~m~~vl~~l~~r--gL~flDs~Ts~~Sva~~~A~~~gvp~~~~dvfLD~~~~~~~I 337 (396) .-.++++|-||-. .-+......++++.+..+ ..+.+|..+.....-.+...+.+.+....+.-+|.......+ T Consensus 222 ap~i~v~n~~~~~g~~t~~~~~~d~i~~i~~~~g~~~iD~vivd~~~~~~~~~~~~~~~~~~~V~~~~~~~~~~~~~~~~ 301 (323) T COG0391 222 APIVYVCNLMTQAGKETDGLSVEDHIAALAQHYGAFVIDAVIVDNDDVEDEDLIRYVEEKGLEVEIDPTLLDREGLRRAL 301 (323) T ss_pred CCEEEECCCCCCCCCCCCCCCHHHHHHHHHHHHCCCCCCEEEECCCCCCHHHHHHHHHHCCCEEEECHHHHHCHHHHHHH T ss_conf 97798614778887422456289999999998576658679978888557888777663283378545654121237999 Q ss_pred HHHHHHHH Q ss_conf 99999999 Q gi|254780558|r 338 REKLKGLE 345 (396) Q Consensus 338 ~~qL~~l~ 345 (396) .+.+.+++ T Consensus 302 ~~~~~~~~ 309 (323) T COG0391 302 ARNLLKLE 309 (323) T ss_pred HHHHHHHH T ss_conf 98753232 No 90 >TIGR00624 tag DNA-3-methyladenine glycosylase I; InterPro: IPR004597 One common form of damage to DNA occurs when alkyl groups become covalently linked to DNA, leading to the formation of mutagenic or toxic lesions. Alkylating agents are ubiquitous in the environment and can also exist endogenously; for example S-adenosylmethionine, a commonly used methyl donor in many cellualr pathways, can produce methylation damage. Given the prevalence of alkylating agents it is not surprising that all cellular organisms studied so far have developed repair systems to deal with DNA alkylation . DNA-3-methyladenine glycosylase I (also known as TagI) is part of the broader helix-hairpin-helix DNA repair glycosylase supefamily. It specifically catalyses the release of 3-methylated adenine and, to a lesser extent, guanosine bases from DNA by hydrolysis of the deoxyribose N-glycosidic bond , . A phylogenomic study suggests that TagI evolved in, and is specific to, the bacteria .; GO: 0008725 DNA-3-methyladenine glycosylase I activity, 0006284 base-excision repair. Probab=37.13 E-value=29 Score=15.01 Aligned_cols=32 Identities=25% Similarity=0.402 Sum_probs=23.9 Q ss_pred HHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHCC Q ss_conf 9999999998506928997497632267898987099 Q gi|254780558|r 283 ESAEVIFKEFAKRGLLFFDDGSSPRNLTRVLAPKLNL 319 (396) Q Consensus 283 ~~m~~vl~~l~~rgL~flDs~Ts~~Sva~~~A~~~gv 319 (396) ..-..+-++||+||.-||- -|+++.+.+..|| T Consensus 143 ~ea~Amsk~LKkrGF~FVG-----pTicYalMQA~Gm 174 (185) T TIGR00624 143 SEAKAMSKELKKRGFRFVG-----PTICYALMQATGM 174 (185) T ss_pred HHHHHHHHHHHHCCCCCCC-----HHHHHHHHHHHCC T ss_conf 6789999998756883001-----6899998864157 No 91 >PRK05755 DNA polymerase I; Provisional Probab=36.90 E-value=2 Score=22.42 Aligned_cols=28 Identities=25% Similarity=0.477 Sum_probs=17.7 Q ss_pred CCCHHHHHHHHHHHHCCCCEEEECCCCC Q ss_conf 0498999999999850692899749763 Q gi|254780558|r 279 LSNKESAEVIFKEFAKRGLLFFDDGSSP 306 (396) Q Consensus 279 t~~~~~m~~vl~~l~~rgL~flDs~Ts~ 306 (396) ..+......+++.+...+.+-+|--|+. T Consensus 303 I~~~~el~~ll~~l~~~~~va~dtEtts 330 (889) T PRK05755 303 ILDEEELEAWLAKLKKAGLVAFDTETTS 330 (889) T ss_pred ECCHHHHHHHHHHHHCCCEEEEEEECCC T ss_conf 1899999999998652884999984178 No 92 >TIGR00055 uppS undecaprenyl diphosphate synthase; InterPro: IPR001441 Synonym(s): Di-trans-poly-cis-undecaprenyl-diphosphate synthase, Undecaprenyl pyrophosphate synthetase, Undecaprenyl pyrophosphate synthase, UPP synthetase Di-trans-poly-cis-decaprenylcistransferase (2.5.1.31 from EC) (UPP synthetase) generates undecaprenyl pyrophosphate (UPP) from isopentenyl pyrophosphate (IPP) . This bacterial enzyme is also found in archaebacteria and in a number of uncharacterised proteins including some from yeasts.; GO: 0016740 transferase activity, 0008152 metabolic process. Probab=36.86 E-value=23 Score=15.63 Aligned_cols=143 Identities=19% Similarity=0.230 Sum_probs=76.6 Q ss_pred EEEEECCCCCCCHHHHHHHHHCCCCCEEEEECCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHH Q ss_conf 89997377877168899998379350898708987689999999985992999817665689877878554467899899 Q gi|254780558|r 173 IAIVVSGLGISQTGTQRAINLLPANITLAFASNGNSLDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQ 252 (396) Q Consensus 173 iAIVIddlG~~~~~~~~ai~~LP~~vT~A~~P~~~~~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~ 252 (396) ||||+|| +... .+ ...++ +-+.=--......+....+++.|-|++= .=+|+- .|| .-+.+| T Consensus 5 Va~IMDG---NgRw-Ak-~~~k~--r~~GH~~G~~~l~~~l~~c~~~Gv~~lT---lYAFS~-ENw--------~RP~~E 65 (237) T TIGR00055 5 VAIIMDG---NGRW-AK-KKGKD--RKYGHEAGVKSLRRILRWCADLGVECLT---LYAFST-ENW--------KRPKEE 65 (237) T ss_pred EEEEECC---CHHH-HH-HCCCC--HHHHHHHHHHHHHHHHHHHHHCCCEEEE---EEEEEC-CCC--------CCCHHH T ss_conf 6787176---1699-98-53883--4455687789999999999866930575---332100-025--------797788 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHH---HCCCCEEEECCCCCCCHHHHHHHHHCCC-EEEEEEEE Q ss_conf 9999999998558935980012300004989999999998---5069289974976322678989870994-69845886 Q gi|254780558|r 253 LLNRLRYSLRRGTGYFGVMNYRGAMLLSNKESAEVIFKEF---AKRGLLFFDDGSSPRNLTRVLAPKLNLP-YMVADLYL 328 (396) Q Consensus 253 ~~~~l~~~l~~~p~~vGvnNhmGs~~t~~~~~m~~vl~~l---~~rgL~flDs~Ts~~Sva~~~A~~~gvp-~~~~dvfL 328 (396) +. ..|.-+...| .+|+.- .-.+ +..|-.+| T Consensus 66 V~----------------------------~LM~Lf~~~L~eia~re~~------------------~~~~l~nvR~Rii 99 (237) T TIGR00055 66 VD----------------------------ALMELFEKKLDEIAEREVK------------------ELHKLYNVRIRII 99 (237) T ss_pred HH----------------------------HHHHHHHHHHHHHHHHHHH------------------HHHHHCCEEEEEE T ss_conf 99----------------------------9999999999997478998------------------5340238058998 Q ss_pred CC-CCCHHHHHHHHHHHHHHHHH-CCEEEEEEECC---HHHHHHHHHHHHHHHHCCC Q ss_conf 28-99999999999999999987-19489998088---7889999999756887691 Q gi|254780558|r 329 DD-QVDRDKIREKLKGLEEIART-TGQAIGVAVAF---DESIEVISQWLQQEHVRDV 380 (396) Q Consensus 329 D~-~~~~~~I~~qL~~l~~~Ar~-~G~AI~igh~~---p~Ti~~L~~w~~~l~~~gi 380 (396) =+ ..-...++++..++++.-+. ++.-+-||-+| .+=+.+.++.+.+-++..+ T Consensus 100 Gd~s~l~~~Lq~~I~~~ee~T~~n~~~tl~iA~nYGGr~ei~~a~k~~~e~v~~g~~ 156 (237) T TIGR00055 100 GDLSLLSKELQEKIKKAEEDTKNNTDFTLNIAFNYGGRNEILHAVKQIAEKVKKGKL 156 (237) T ss_pred ECCCCCCHHHHHHHHHHHHHHCCCCEEEEEEEECCCCHHHHHHHHHHHHHHHHCCCC T ss_conf 556568988999999999741278847999850789868899999998763002688 No 93 >cd01120 RecA-like_NTPases RecA-like NTPases. This family includes the NTP binding domain of F1 and V1 H+ATPases, DnaB and related helicases as well as bacterial RecA and related eukaryotic and archaeal recombinases. This group also includes bacterial conjugation proteins and related DNA transfer proteins involved in type II and type IV secretion. Probab=36.28 E-value=30 Score=14.91 Aligned_cols=59 Identities=20% Similarity=0.187 Sum_probs=39.8 Q ss_pred HHHHHCCCCEEEECCCCCCCHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHHHHHHHCCEE-EEEEECCHH Q ss_conf 999850692899749763226789898709946984588628999999999999999999871948-999808878 Q gi|254780558|r 289 FKEFAKRGLLFFDDGSSPRNLTRVLAPKLNLPYMVADLYLDDQVDRDKIREKLKGLEEIARTTGQA-IGVAVAFDE 363 (396) Q Consensus 289 l~~l~~rgL~flDs~Ts~~Sva~~~A~~~gvp~~~~dvfLD~~~~~~~I~~qL~~l~~~Ar~~G~A-I~igh~~p~ 363 (396) +.+-.+..|.++||-|+-..-.... ..-....+...+.++.+.|++.+.. |.|+|.+.. T Consensus 80 ~~~~~~~vliiiDSit~~~~a~~e~----------------~~g~~~~v~~~~~~L~~~Ak~~~itvi~i~~v~~d 139 (165) T cd01120 80 LRERGGDDLIILDELTRLVRALREI----------------REGYPGELDEELRELLERARKGGVTVIFTLQVPSG 139 (165) T ss_pred HHHCCCCEEEEEECHHHHHHHHHHC----------------CCCCHHHHHHHHHHHHHHHHCCCCEEEEEEEEECC T ss_conf 9986997799992889988774001----------------58867899999999999997798289999984337 No 94 >COG3265 GntK Gluconate kinase [Carbohydrate transport and metabolism] Probab=36.20 E-value=30 Score=14.90 Aligned_cols=41 Identities=17% Similarity=0.256 Sum_probs=18.4 Q ss_pred CHHHHHHHHHHHHHHHHHC-CEEEEEEECCHHHHHHHHHHHH Q ss_conf 9999999999999999871-9489998088788999999975 Q gi|254780558|r 333 DRDKIREKLKGLEEIARTT-GQAIGVAVAFDESIEVISQWLQ 373 (396) Q Consensus 333 ~~~~I~~qL~~l~~~Ar~~-G~AI~igh~~p~Ti~~L~~w~~ 373 (396) ...-++.||+.||.-.-.. ...|-|.++..+-++....|+. T Consensus 117 ~~~ll~SQfa~LE~P~~de~vi~idi~~~~e~vv~~~~~~l~ 158 (161) T COG3265 117 PASLLDSQFATLEEPGADEDVLTIDIDQPPEEVVAQALAWLK 158 (161) T ss_pred CHHHHHHHHHHHCCCCCCCCEEEEECCCCHHHHHHHHHHHHH T ss_conf 889999899983589999887986489998999999999986 No 95 >TIGR00707 argD acetylornithine and succinylornithine aminotransferases; InterPro: IPR004636 This family of proteins, for which ornithine aminotransferases form an outgroup, consists mostly of proteins designated acetylornithine aminotransferase. However, the two very closely related members from Escherichia coli are assigned different enzymatic activities. One is acetylornithine aminotransferase (2.6.1.11 from EC), ArgD, an enzyme of arginine biosynthesis, while another is succinylornithine aminotransferase, an enzyme of the arginine succinyltransferase pathway, an ammonia-generating pathway of arginine catabolism . Members of this family may also act on ornithine, like ornithine aminotransferase (2.6.1.13 from EC) and on succinyldiaminopimelate, like N-succinyldiaminopmelate-aminotransferase (2.6.1.17 from EC, DapC, an enzyme of lysine biosynthesis) . ; GO: 0008483 transaminase activity, 0006525 arginine metabolic process. Probab=34.91 E-value=22 Score=15.84 Aligned_cols=146 Identities=10% Similarity=0.179 Sum_probs=88.7 Q ss_pred EEEEECCC--CCCCHHHHHHHHHCCCCCEEEEECCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCH Q ss_conf 89997377--8771688999983793508987089876899999999859929998176656898778785544678998 Q gi|254780558|r 173 IAIVVSGL--GISQTGTQRAINLLPANITLAFASNGNSLDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTV 250 (396) Q Consensus 173 iAIVIddl--G~~~~~~~~ai~~LP~~vT~A~~P~~~~~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~ 250 (396) +=+|+|=+ |+++++--=|..-++.+|+.=|+- +..+=-.|..| -.-+.. T Consensus 220 ~LLI~DEVQtG~GRTG~~fA~~Hygse~~PDi~T--------~AKgLGgG~Pi---------------------GA~~~~ 270 (402) T TIGR00707 220 ALLIFDEVQTGLGRTGKLFAYEHYGSEVEPDIIT--------LAKGLGGGVPI---------------------GAVLAK 270 (402) T ss_pred EEEEEEEEEECCCCCHHHHHHHCCCCCCCCHHHH--------HHHHHHCCCHH---------------------HHHEEH T ss_conf 1899944752877433223312068845523875--------40555278202---------------------232023 Q ss_pred HHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHCCCEEEEEEEECC Q ss_conf 99999999999855893598001230000498999999999850692899749763226789898709946984588628 Q gi|254780558|r 251 QQLLNRLRYSLRRGTGYFGVMNYRGAMLLSNKESAEVIFKEFAKRGLLFFDDGSSPRNLTRVLAPKLNLPYMVADLYLDD 330 (396) Q Consensus 251 ~~~~~~l~~~l~~~p~~vGvnNhmGs~~t~~~~~m~~vl~~l~~rgL~flDs~Ts~~Sva~~~A~~~gvp~~~~dvfLD~ 330 (396) +++.+.| =+=+| ||=|=.|+=+|...+.+|+- ...+.|||+ T Consensus 271 e~~a~~~-----------~~G~H-GsTFGGNPLAc~Aa~~vl~~---------------------------i~~~~~l~~ 311 (402) T TIGR00707 271 EEVAEAL-----------TPGDH-GSTFGGNPLACAAALAVLEV---------------------------IEKERLLEK 311 (402) T ss_pred HHHHHHC-----------CCCCC-CCCCCCCHHHHHHHHHHHHH---------------------------HHCCCHHHH T ss_conf 6787315-----------89898-66678738999998776665---------------------------505104789 Q ss_pred CCCHHH-HHHHHHHHHHHHH-----HCCEEEEEEECCHHH-HHHHHHHHHHHHHCCCEEEEHHH Q ss_conf 999999-9999999999998-----719489998088788-99999997568876918996777 Q gi|254780558|r 331 QVDRDK-IREKLKGLEEIAR-----TTGQAIGVAVAFDES-IEVISQWLQQEHVRDVSVVPLSC 387 (396) Q Consensus 331 ~~~~~~-I~~qL~~l~~~Ar-----~~G~AI~igh~~p~T-i~~L~~w~~~l~~~gi~lVpvS~ 387 (396) ...... ..++|+.+...-. =+|.-+|||--.... +.+ .+..++++.+|+-+.+.+. T Consensus 312 v~~~G~~f~~~L~~~~~~~~~l~k~VRG~GLmlG~e~~~~~~~a-~~~~~~a~~~GlLv~~Ag~ 374 (402) T TIGR00707 312 VKEKGDYFKERLEELIKKYIDLIKEVRGKGLMLGIELEAPSVAA-KDIVKEAREKGLLVNTAGP 374 (402) T ss_pred HHHHHHHHHHHHHHHCCCCCCCEEEEECCEEEEEEEECCCCCCH-HHHHHHHHHCCCEEECCCC T ss_conf 99988999999998617766740153231234655642786558-9999999848926881788 No 96 >PRK01045 ispH 4-hydroxy-3-methylbut-2-enyl diphosphate reductase; Reviewed Probab=34.28 E-value=33 Score=14.71 Aligned_cols=184 Identities=15% Similarity=0.146 Sum_probs=94.4 Q ss_pred EEECCCCCCCHHHHHHHHHCCCCCEEEEECCCCCHHHHHHHHHHCCCCEEEECC-----CCCC-CC-------------- Q ss_conf 997377877168899998379350898708987689999999985992999817-----6656-89-------------- Q gi|254780558|r 175 IVVSGLGISQTGTQRAINLLPANITLAFASNGNSLDRWMKEAKKKGQEAILQIP-----MQAF-DE-------------- 234 (396) Q Consensus 175 IVIddlG~~~~~~~~ai~~LP~~vT~A~~P~~~~~~~~~~~Ar~~G~EvllhlP-----MEp~-~~-------------- 234 (396) +||---|.+... .+.+.+.-..|.=|--|+-...++.+++..+.|..++++== ++.. +| T Consensus 69 ViirAHGi~~~~-~~~~~~~g~~viDaTCP~V~k~~~~v~~~~~~Gy~iviiG~~~HpEv~gi~g~~~~~~~vv~~~~d~ 147 (304) T PRK01045 69 VIFSAHGVSPAV-REEAKERGLTVIDATCPLVTKVHKEVARMSREGYEIILIGHKGHPEVEGTMGQAPDGVYLVESPEDV 147 (304) T ss_pred EEEECCCCCHHH-HHHHHHCCCCEEECCCCCHHHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHCCCCEEEECCHHHH T ss_conf 999778998999-9999965997770508305899999999985898899981789889996997579973998869999 Q ss_pred ---CCCCCCCC--CCCCCCCHHHHHHHHHHHHHHCCCCEEE-EECCCCCCCCCHHHHHHHHHHH-HCC-CCEEEECCCCC Q ss_conf ---87787855--4467899899999999999855893598-0012300004989999999998-506-92899749763 Q gi|254780558|r 235 ---SYNEDDSY--TLKVTQTVQQLLNRLRYSLRRGTGYFGV-MNYRGAMLLSNKESAEVIFKEF-AKR-GLLFFDDGSSP 306 (396) Q Consensus 235 ---~~~~pGp~--~L~~~~~~~~~~~~l~~~l~~~p~~vGv-nNhmGs~~t~~~~~m~~vl~~l-~~r-gL~flDs~Ts~ 306 (396) +..++..- .--++|+.++....+.....++|+.... .|-. -..|.++. +.+ .+| ++- -++++-+..|+ T Consensus 148 ~~l~~~~~~ki~vvsQTT~~~~~~~~i~~~l~~~~~~~~~~~~nTI-C~aT~~RQ--~a~-~~La~~vD~miVvGg~nSs 223 (304) T PRK01045 148 AKLDVKDPDKLALVTQTTLSVDDTAEIIAALKARFPEIRGPRKDDI-CYATQNRQ--EAV-KELAKQVDLVIVVGSKNSS 223 (304) T ss_pred HHCCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHCCCCCCCCCCCC-CHHHHHHH--HHH-HHHHHCCCEEEEECCCCCC T ss_conf 7577689971899972556599999999999976836525732352-07789999--999-9877418889997688996 Q ss_pred CCH-HHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHHHHHHHCCEEEEEEE--CCH-HHHHHHHHHHHHHHH Q ss_conf 226-78989870994698458862899999999999999999987194899980--887-889999999756887 Q gi|254780558|r 307 RNL-TRVLAPKLNLPYMVADLYLDDQVDRDKIREKLKGLEEIARTTGQAIGVAV--AFD-ESIEVISQWLQQEHV 377 (396) Q Consensus 307 ~Sv-a~~~A~~~gvp~~~~dvfLD~~~~~~~I~~qL~~l~~~Ar~~G~AI~igh--~~p-~Ti~~L~~w~~~l~~ 377 (396) ||. -.++|++.+.+... +++. +.|+..+- +.-.-|||.. -.| ..|+-+...+.++.. T Consensus 224 NT~~L~~i~~~~~~~t~~----Ie~~---~el~~~~~-------~~~~~VGITAGASTP~~lIeeVi~~l~~l~~ 284 (304) T PRK01045 224 NSNRLREVAERAGAPAYL----IDDA---SEIDPEWF-------KGVKTVGVTAGASAPEWLVQEVIARLKELGP 284 (304) T ss_pred CHHHHHHHHHHHCCCEEE----ECCH---HHCCHHHH-------CCCCEEEEECCCCCCHHHHHHHHHHHHHHCC T ss_conf 489999999987999599----6881---66999894-------7999899972337889999999999984189 No 97 >PRK05797 consensus Probab=33.88 E-value=6.1 Score=19.30 Aligned_cols=95 Identities=11% Similarity=0.200 Sum_probs=48.9 Q ss_pred CCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHCCCEEE---EEEEE--CCCCCHHHHHHHHHHHHHHHHHC Q ss_conf 00049899999999985069289974976322678989870994698---45886--28999999999999999999871 Q gi|254780558|r 277 MLLSNKESAEVIFKEFAKRGLLFFDDGSSPRNLTRVLAPKLNLPYMV---ADLYL--DDQVDRDKIREKLKGLEEIARTT 351 (396) Q Consensus 277 ~~t~~~~~m~~vl~~l~~rgL~flDs~Ts~~Sva~~~A~~~gvp~~~---~dvfL--D~~~~~~~I~~qL~~l~~~Ar~~ 351 (396) ....+...+..++..++....+ +|.-|+..... .+...|+-+.. .-.++ .+..+...+...|..+. .+ T Consensus 302 ~~i~~~~~l~~l~~~l~~~~~~-~~~ett~~~~~--~~~~vg~s~~~~~~~a~yip~~~~~~~~~~L~~Lk~lL----ed 374 (869) T PRK05797 302 KNIVDIEELKDLFNNIKNNQVY-INFTLEDENLY--SKIEIKKIFINFGEKTYYIDFKELIDEEEFIEDLKEIF----EN 374 (869) T ss_pred EEECCHHHHHHHHHHHHCCCEE-EEEECCCCCHH--HHHHEEEEEEECCCEEEEEECHHHCCHHHHHHHHHHHH----HC T ss_conf 5538999999999861349879-99973897612--03041899996199799983145167789999999998----48 Q ss_pred CEEEEEEECCHHHHHHHHHHHHHHHHCCCEEEEH Q ss_conf 9489998088788999999975688769189967 Q gi|254780558|r 352 GQAIGVAVAFDESIEVISQWLQQEHVRDVSVVPL 385 (396) Q Consensus 352 G~AI~igh~~p~Ti~~L~~w~~~l~~~gi~lVpv 385 (396) ...--|||-....+.+|.. .||.+-.+ T Consensus 375 ~~i~KIGhNlK~dl~vL~~-------~GI~l~gi 401 (869) T PRK05797 375 EEIKKIGHDIKNFLTILKK-------LGIEFKGL 401 (869) T ss_pred CCCCEEEECCHHHHHHHHH-------CCCCCCCH T ss_conf 9987898430699999997-------59855661 No 98 >KOG0087 consensus Probab=33.64 E-value=25 Score=15.45 Aligned_cols=13 Identities=31% Similarity=0.948 Sum_probs=6.0 Q ss_pred CHHHHHHHHHHCC Q ss_conf 6899999999859 Q gi|254780558|r 208 SLDRWMKEAKKKG 220 (396) Q Consensus 208 ~~~~~~~~Ar~~G 220 (396) +..+|..+.|..- T Consensus 104 nv~rWL~ELRdha 116 (222) T KOG0087 104 NVERWLKELRDHA 116 (222) T ss_pred HHHHHHHHHHHCC T ss_conf 8999999998558 No 99 >COG2183 Tex Transcriptional accessory protein [Transcription] Probab=33.56 E-value=33 Score=14.63 Aligned_cols=36 Identities=11% Similarity=0.021 Sum_probs=19.5 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCC Q ss_conf 855446789989999999999985589359800123 Q gi|254780558|r 240 DSYTLKVTQTVQQLLNRLRYSLRRGTGYFGVMNYRG 275 (396) Q Consensus 240 Gp~~L~~~~~~~~~~~~l~~~l~~~p~~vGvnNhmG 275 (396) |-+-.--+.+..++.+.|+.+...+-|.|||+--.- T Consensus 471 GVgqyQHdv~q~~L~~~Ld~vved~VN~VGVdvNtA 506 (780) T COG2183 471 GVGQYQHDVSQKKLAESLDAVVEDCVNAVGVDVNTA 506 (780) T ss_pred CCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCC T ss_conf 555321468878999999999998731026113427 No 100 >PRK13359 beta-ketoadipyl CoA thiolase; Provisional Probab=33.36 E-value=34 Score=14.61 Aligned_cols=172 Identities=17% Similarity=0.175 Sum_probs=93.7 Q ss_pred ECCCCCHHHHHHHHHHCCC--CEEEECCCCCCCCC------CCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECC Q ss_conf 0898768999999998599--29998176656898------778785544678998999999999998558935980012 Q gi|254780558|r 203 ASNGNSLDRWMKEAKKKGQ--EAILQIPMQAFDES------YNEDDSYTLKVTQTVQQLLNRLRYSLRRGTGYFGVMNYR 274 (396) Q Consensus 203 ~P~~~~~~~~~~~Ar~~G~--EvllhlPMEp~~~~------~~~pGp~~L~~~~~~~~~~~~l~~~l~~~p~~vGvnNhm 274 (396) .-|+-.+++.+..|++.|+ +-++ |++-.+.. +.|.++. +.+. |-..+|.-.+.. -|....- T Consensus 182 D~~A~~Sh~rA~~A~~~G~f~~EIv--Pv~~~~~~g~~~~~~~De~~R----~tt~-e~La~L~p~f~~----~GtvTAg 250 (400) T PRK13359 182 DAFALRSQQKAARAQADGTLAQEIV--PVTIPQKKGDPVVVSRDEHPR----ETTL-EALAKLKGVVRP----DGTVTAG 250 (400) T ss_pred HHHHHHHHHHHHHHHHCCCCHHEEE--EEEECCCCCCEEEEECCCCCC----CCCH-HHHHCCCCCCCC----CCCEECC T ss_conf 9999999999999876598402588--677526778637880687866----6776-666326875345----6856354 Q ss_pred CCCCCCCHHHHHHHHHHHH--CCCCE----EEECCC----------CCCCHHHHHHHHHCCCEEEEEEEECCCCCHHHHH Q ss_conf 3000049899999999985--06928----997497----------6322678989870994698458862899999999 Q gi|254780558|r 275 GAMLLSNKESAEVIFKEFA--KRGLL----FFDDGS----------SPRNLTRVLAPKLNLPYMVADLYLDDQVDRDKIR 338 (396) Q Consensus 275 Gs~~t~~~~~m~~vl~~l~--~rgL~----flDs~T----------s~~Sva~~~A~~~gvp~~~~dvfLD~~~~~~~I~ 338 (396) -|--.+|-.+.-.|+++-+ +.||- +++..+ .+-.-..++-++.|+...--|+|==|+-...... T Consensus 251 NsS~~~DGAAavll~s~~~a~~~gl~p~ari~~~a~~g~dP~~m~~gp~~A~~k~L~~agl~~~Did~~EinEAFA~q~l 330 (400) T PRK13359 251 NASGVNDGACALLLASEAAARRHGLKPRARVLGIATAGVEPRVMGIGPAPATQKLLARLGMTLDQFDVIELNEAFASQGL 330 (400) T ss_pred CCCCCCCHHHHHHHHHHHHHHHCCCCEEEEEEEEEEEECCHHHHHHCHHHHHHHHHHHCCCCCCCCCEEEEEHHHHHHHH T ss_conf 37854326788998659999987998059999889998287885336999999999976998456877786046688899 Q ss_pred HHHHHHHH-----HHHHCCEEEEEEECCHHHHHHH-HHHHHHHHHCC--CEEEEH Q ss_conf 99999999-----9987194899980887889999-99975688769--189967 Q gi|254780558|r 339 EKLKGLEE-----IARTTGQAIGVAVAFDESIEVI-SQWLQQEHVRD--VSVVPL 385 (396) Q Consensus 339 ~qL~~l~~-----~Ar~~G~AI~igh~~p~Ti~~L-~~w~~~l~~~g--i~lVpv 385 (396) ..++++.. +---+|-+|++|||+--|=.+| ...+.+|+.+| +-|+.+ T Consensus 331 ~~~~~lg~~~d~~kvN~~GGaiAlGHP~GAsGarl~~~l~~~L~~~~~~~Gl~t~ 385 (400) T PRK13359 331 AVLRMLGVADDDARVNPNGGAIALGHPLGASGARLVTTALYQLHRTGGRFALCTM 385 (400) T ss_pred HHHHHCCCCCCCCCCCCCCCHHHHCCCCCCCHHHHHHHHHHHHHHHCCCEEEEEE T ss_conf 9999819998777489886688748880224799999999999980999899996 No 101 >TIGR00086 smpB SsrA-binding protein; InterPro: IPR000037 In bacteria, SsrA RNA recognises ribosomes stalled on defective messages and acts as a tRNA and mRNA to mediate the addition of a short peptide tag to the C-terminus of the partially synthesized nascent polypeptide chain. The SsrA-tagged protein is then degraded by C-terminal-specific proteases. SmpB, a unique RNA-binding protein that is conserved throughout the bacterial kingdom is an essential component of the SsrA quality-control system. Deletion of the smpB gene in Escherichia coli results in the same phenotypes observed in ssrA-defective cells, including a variety of phage development defects and the failure to tag proteins translated from defective mRNAs. Purified SmpB binds specifically and with high affinity to SsrA RNA and is required for stable association of SsrA with ribosomes in vivo. Formation of an SmpB-SsrA complex appears to be critical in mediating SsrA activity after aminoacylation with alanine but prior to the transpeptidation reaction that couples this alanine to the nascent chain. SsrA RNA is present at wild-type levels in the smpB mutant arguing against a model of SsrA action that involves direct competition for transcription factors .; GO: 0003723 RNA binding, 0006412 translation. Probab=33.20 E-value=23 Score=15.62 Aligned_cols=24 Identities=21% Similarity=0.297 Sum_probs=16.7 Q ss_pred HHHHHHHHHHHCCCEEEEHHHHHH Q ss_conf 999997568876918996777430 Q gi|254780558|r 367 VISQWLQQEHVRDVSVVPLSCLAK 390 (396) Q Consensus 367 ~L~~w~~~l~~~gi~lVpvS~l~~ 390 (396) -|.+|-......|++|||+|--++ T Consensus 84 EI~kL~~~~~~~GlTlVPl~~Yf~ 107 (145) T TIGR00086 84 EIAKLQGKIKEKGLTLVPLKLYFK 107 (145) T ss_pred HHHHHHHHHCCCCCEEEEEEEEEE T ss_conf 899999875126844644445560 No 102 >pfam00455 DeoR Bacterial regulatory proteins, deoR family. Probab=33.14 E-value=30 Score=14.93 Aligned_cols=117 Identities=18% Similarity=0.115 Sum_probs=61.2 Q ss_pred EEEEECCCCCCCHHHHHHHHHCCCCCEEEEECCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHH Q ss_conf 89997377877168899998379350898708987689999999985992999817665689877878554467899899 Q gi|254780558|r 173 IAIVVSGLGISQTGTQRAINLLPANITLAFASNGNSLDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQ 252 (396) Q Consensus 173 iAIVIddlG~~~~~~~~ai~~LP~~vT~A~~P~~~~~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~ 252 (396) -.|.||. + +.+..-...||..--+-+. .++..-....++..+.+|++ .+. ..++.-+. .++....+ T Consensus 19 ~~Ifld~---G-TT~~~la~~L~~~~~ltVv--Tnsl~ia~~L~~~~~~~v~l------lGG-~~~~~~~~-~~G~~a~~ 84 (157) T pfam00455 19 DTIFLDA---G-TTVEELARALLGHINLTVI--TNSLPIANALSEKEHFRLIL------IGG-EYRRKTGS-FVGPLAES 84 (157) T ss_pred CEEEECC---C-HHHHHHHHHHHCCCCEEEE--ECCHHHHHHHHHCCCCEEEE------ECC-CCCCCCCE-EECHHHHH T ss_conf 9999909---6-8999999998538987999--89899999998589967998------689-22478873-89999999 Q ss_pred HHHHHHHHHHHCCCCEEEEECCC-CCCCCCHHHHHHHHHHHHCCCCEEEECCC Q ss_conf 99999999985589359800123-00004989999999998506928997497 Q gi|254780558|r 253 LLNRLRYSLRRGTGYFGVMNYRG-AMLLSNKESAEVIFKEFAKRGLLFFDDGS 304 (396) Q Consensus 253 ~~~~l~~~l~~~p~~vGvnNhmG-s~~t~~~~~m~~vl~~l~~rgL~flDs~T 304 (396) ..+.++..+ .|=+..|++..-| +-+..++..++..|-...++-....|+.. T Consensus 85 ~l~~~~~D~-afig~~gi~~~~G~~~~~~~~a~~k~~~~~~s~~~ill~D~sK 136 (157) T pfam00455 85 FLSKFRFDK-AFFSADGIDLDGGLTTSNLLEASVKRAMLEVAREVILLADHSK 136 (157) T ss_pred HHHHCCCCE-EEEECCEECCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECHHH T ss_conf 998127999-9980762668997554899999999999971898999980436 No 103 >PRK08835 consensus Probab=32.90 E-value=3.5 Score=20.79 Aligned_cols=25 Identities=16% Similarity=0.336 Sum_probs=11.7 Q ss_pred CHHHHHHHHHHHHCCCCEEEECCCC Q ss_conf 9899999999985069289974976 Q gi|254780558|r 281 NKESAEVIFKEFAKRGLLFFDDGSS 305 (396) Q Consensus 281 ~~~~m~~vl~~l~~rgL~flDs~Ts 305 (396) +....+.++..+...+.+-+|.-|+ T Consensus 337 ~~~~l~~~l~~l~~~~~~a~dtEt~ 361 (931) T PRK08835 337 DEASFQLWLDKLKAAELFAFDTETD 361 (931) T ss_pred CHHHHHHHHHHHHCCCEEEEEEECC T ss_conf 8899999999853388599995048 No 104 >PRK06361 hypothetical protein; Provisional Probab=32.74 E-value=35 Score=14.55 Aligned_cols=41 Identities=10% Similarity=0.013 Sum_probs=21.7 Q ss_pred HHHHHHCCEEEEEEECCHHHHHHHHHHHHHHHHCCCEEEEHH Q ss_conf 999987194899980887889999999756887691899677 Q gi|254780558|r 345 EEIARTTGQAIGVAVAFDESIEVISQWLQQEHVRDVSVVPLS 386 (396) Q Consensus 345 ~~~Ar~~G~AI~igh~~p~Ti~~L~~w~~~l~~~gi~lVpvS 386 (396) .+.|+++|.++-|- .+|.-.+--..|+...++.|+.|+=-| T Consensus 133 a~~Aae~gtaLEIN-a~p~rld~~~~~~r~A~~~Gv~i~I~s 173 (216) T PRK06361 133 AALAAENGVFLEIT-ARKGHNLTNGHVARLAREAGAPLVINT 173 (216) T ss_pred HHHHHHHCCEEEEE-CCCCCCCCCHHHHHHHHHCCCEEEEEC T ss_conf 99999819589996-897764777999999998699699978 No 105 >PRK05769 4-aminobutyrate aminotransferase; Provisional Probab=32.66 E-value=35 Score=14.54 Aligned_cols=12 Identities=33% Similarity=0.321 Sum_probs=4.3 Q ss_pred HHHHHHHHHHCC Q ss_conf 899999999859 Q gi|254780558|r 209 LDRWMKEAKKKG 220 (396) Q Consensus 209 ~~~~~~~Ar~~G 220 (396) .++..+..++.| T Consensus 245 l~~l~~lc~~~g 256 (442) T PRK05769 245 FKELRKLADKYG 256 (442) T ss_pred HHHHHHHHHHCC T ss_conf 989999998749 No 106 >PRK07300 consensus Probab=32.17 E-value=5.7 Score=19.50 Aligned_cols=13 Identities=15% Similarity=0.312 Sum_probs=4.0 Q ss_pred HHHHHHHHHHCCE Q ss_conf 9999999987194 Q gi|254780558|r 341 LKGLEEIARTTGQ 353 (396) Q Consensus 341 L~~l~~~Ar~~G~ 353 (396) +++....|+++|+ T Consensus 754 ~~~~~~~a~~~Gy 766 (880) T PRK07300 754 MENVVREAKDKGY 766 (880) T ss_pred HHHHHHHHHHCCC T ss_conf 9999999997688 No 107 >cd00338 Ser_Recombinase Serine Recombinase family, catalytic domain; a DNA binding domain may be present either N- or C-terminal to the catalytic domain. These enzymes perform site-specific recombination of DNA molecules by a concerted, four-strand cleavage and rejoining mechanism which involves a transient phosphoserine linkage between DNA and serine recombinase. Serine recombinases demonstrate functional versatility and include resolvases, invertases, integrases, and transposases. Resolvases and invertases (i.e. Tn3, gamma-delta, Tn5044 resolvases, Gin and Hin invertases) in this family contain a C-terminal DNA binding domain and comprise a major phylogenic group. Also included are phage- and bacterial-encoded recombinases such as phiC31 integrase, SpoIVCA excisionase, and Tn4451 TnpX transposase. These integrases and transposases have larger C-terminal domains compared to resolvases/invertases and are referred to as large serine recombinases. Also belonging to this family are protei Probab=31.36 E-value=36 Score=14.41 Aligned_cols=20 Identities=10% Similarity=-0.006 Sum_probs=6.4 Q ss_pred CCCCCCHHHHHHHHHHHHCC Q ss_conf 00004989999999998506 Q gi|254780558|r 276 AMLLSNKESAEVIFKEFAKR 295 (396) Q Consensus 276 s~~t~~~~~m~~vl~~l~~r 295 (396) |||.-+......++..|+++ T Consensus 75 dRl~R~~~~~~~~~~~l~~~ 94 (137) T cd00338 75 DRLSRNLVDLLELLELLEAH 94 (137) T ss_pred CCCCCCHHHHHHHHHHHHHC T ss_conf 80788899999999999987 No 108 >PRK11234 nfrB bacteriophage N4 adsorption protein B; Provisional Probab=31.33 E-value=32 Score=14.76 Aligned_cols=11 Identities=18% Similarity=0.706 Sum_probs=5.7 Q ss_pred EEEECCCCCCC Q ss_conf 99981766568 Q gi|254780558|r 223 AILQIPMQAFD 233 (396) Q Consensus 223 vllhlPMEp~~ 233 (396) =|+|+|--|+. T Consensus 188 d~iQlPV~~l~ 198 (646) T PRK11234 188 DLIQLPVYPLE 198 (646) T ss_pred CEEECCCCCCC T ss_conf 56535611265 No 109 >PRK10660 tilS tRNA(Ile)-lysidine synthetase; Provisional Probab=31.30 E-value=36 Score=14.40 Aligned_cols=66 Identities=17% Similarity=0.266 Sum_probs=34.2 Q ss_pred CCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCC-CCCCHH--------HHHHHHHHHHCCCCEEEECCCCCCCH Q ss_conf 44678998999999999998558935980012300-004989--------99999999850692899749763226 Q gi|254780558|r 243 TLKVTQTVQQLLNRLRYSLRRGTGYFGVMNYRGAM-LLSNKE--------SAEVIFKEFAKRGLLFFDDGSSPRNL 309 (396) Q Consensus 243 ~L~~~~~~~~~~~~l~~~l~~~p~~vGvnNhmGs~-~t~~~~--------~m~~vl~~l~~rgL~flDs~Ts~~Sv 309 (396) .|.+.-..++...++--.|-|-.|..|+.. |..+ -..+-. .=+-+...++++|+-|++|-|...+. T Consensus 111 ~l~tAHh~dDQaETvLlrL~RGsG~~GL~g-m~~~r~~~~~~liRPLL~~~r~eI~~Y~~~~~l~~~eD~SN~d~~ 185 (433) T PRK10660 111 VLVTAQHLDDQCETFLLALKRGSGPAGLSA-MAEVSPFAGTQLLRPLLARTRGELEQWAQAHGLRWIEDESNQDDR 185 (433) T ss_pred EEEECCCCCHHHHHHHHHHHCCCCCCCCCC-CCCCCCCCCCEEECCCHHHHHHHHHHHHHHCCCCCEECCCCCCCC T ss_conf 799624565199999999865899644567-752244789707737446659999999998499803899999761 No 110 >COG1780 NrdI Protein involved in ribonucleotide reduction [Nucleotide transport and metabolism] Probab=30.95 E-value=37 Score=14.36 Aligned_cols=84 Identities=12% Similarity=0.060 Sum_probs=52.8 Q ss_pred CEEEEECCCCCCCCCHHHHHHHHHHHH-----CCCCEEEECCCCCC-----CHHHHHHHHHCCCEEEEEEEECCCCCHHH Q ss_conf 359800123000049899999999985-----06928997497632-----26789898709946984588628999999 Q gi|254780558|r 267 YFGVMNYRGAMLLSNKESAEVIFKEFA-----KRGLLFFDDGSSPR-----NLTRVLAPKLNLPYMVADLYLDDQVDRDK 336 (396) Q Consensus 267 ~vGvnNhmGs~~t~~~~~m~~vl~~l~-----~rgL~flDs~Ts~~-----Sva~~~A~~~gvp~~~~dvfLD~~~~~~~ 336 (396) |+=|.-..|+. -....++.+|.+-|. ++=++.+-|+.+-- --+..++++.++|++.+.=.+-...|... T Consensus 43 yvlitpTyg~G-~~~~~Vp~~vi~FLn~~~Nr~~~rGViaSGN~NfG~~f~~Ag~~iS~k~~vPlLy~FEL~GT~~Dv~~ 121 (141) T COG1780 43 YVLITPTYGGG-GTVGAVPKQVIRFLNNEHNRALCRGVIASGNRNFGDNFALAGDVISAKCGVPLLYRFELLGTAEDVAA 121 (141) T ss_pred EEEEECCCCCC-CCCCCCCHHHHHHHCCCCCHHHEEEEEECCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHH T ss_conf 69995332588-75672678899986364255446778852776277888887899998749977998752699899999 Q ss_pred HHHHHHHHHHHHHHC Q ss_conf 999999999999871 Q gi|254780558|r 337 IREKLKGLEEIARTT 351 (396) Q Consensus 337 I~~qL~~l~~~Ar~~ 351 (396) +++-+.++-+.+.++ T Consensus 122 v~~~v~~~~~~~~~~ 136 (141) T COG1780 122 VRKGVTEFWKRAPQN 136 (141) T ss_pred HHHHHHHHHHHCCCC T ss_conf 999999998748765 No 111 >pfam01936 DUF88 Protein of unknown function DUF88. This highly conserved bacterial protein has no known function. The alignment contains many conserved aspartates, suggesting an enzymatic function such as an endonuclease or glycosyl hydrolase (Bateman A pers. obs). Probab=30.82 E-value=37 Score=14.35 Aligned_cols=105 Identities=12% Similarity=0.227 Sum_probs=49.1 Q ss_pred EEEEEECCCCC------CCHHHHHHHHHCCCCCEEEEECCCC----CHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCC Q ss_conf 58999737787------7168899998379350898708987----6899999999859929998176656898778785 Q gi|254780558|r 172 RIAIVVSGLGI------SQTGTQRAINLLPANITLAFASNGN----SLDRWMKEAKKKGQEAILQIPMQAFDESYNEDDS 241 (396) Q Consensus 172 riAIVIddlG~------~~~~~~~ai~~LP~~vT~A~~P~~~----~~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp 241 (396) |+||-||+--+ +...-.+.+.+.. +|.. +.-|+. ..+.|.+..+..|.+ +.|.|.-. .+. T Consensus 1 rvaVfiD~~N~~~~~~~d~~~l~~~i~~~g-~v~~-~~aY~~~~~~~~~~~~~~L~~~Gi~-v~~~~~~~-------~~K 70 (140) T pfam01936 1 RVAVFIDGENCPVPAGIDYRKVLEEIKSGG-EVVR-ARAYGDWSDPKLRKFPDALSSTGIP-VQHKPLTK-------SGK 70 (140) T ss_pred CEEEEEECCCCCCCCCCCHHHHHHHHHCCC-CEEE-EEEEECCCCCCHHHHHHHHHHCCCE-EEEECCCC-------CCC T ss_conf 979999626587532479999999998279-7899-9999478861047799999986971-89945515-------876 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHCCCCE Q ss_conf 544678998999999999998558935980012300004989999999998506928 Q gi|254780558|r 242 YTLKVTQTVQQLLNRLRYSLRRGTGYFGVMNYRGAMLLSNKESAEVIFKEFAKRGLL 298 (396) Q Consensus 242 ~~L~~~~~~~~~~~~l~~~l~~~p~~vGvnNhmGs~~t~~~~~m~~vl~~l~~rgL~ 298 (396) .+.+-.=..+.++|+...-+..+ --+|+|. -+.++++.|+++|.- T Consensus 71 ----n~~D~~l~vD~~~~~~~~~~d~~-------ilvsgD~-Df~~~~~~lr~~g~~ 115 (140) T pfam01936 71 ----NAVDVGLAVDALELAYDNNPDTF-------VLVSGDG-DFAPLLERLRERGKR 115 (140) T ss_pred ----CCHHHHHHHHHHHHHHCCCCCEE-------EEEECCH-HHHHHHHHHHHCCCE T ss_conf ----50349999999999733899989-------9992674-079999999988999 No 112 >cd06305 PBP1_methylthioribose_binding_like Methylthioribose-binding protein-like of ABC-type transport systems that belong to a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein (PBP1) superfamily. Methylthioribose-binding protein-like of ABC-type transport systems that belong to a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein (PBP1) superfamily, which consists of two alpha/beta globular domains connected by a three-stranded hinge. This Venus flytrap-like domain undergoes transition from an open to a closed conformational state upon ligand binding. The sugar-binding domain of the periplasmic proteins in this group is also homologous to the ligand-binding domain of eukaryotic receptors such as metabotropic glutamate receptor (mGluR), DNA-binding transcriptional repressors such as LacI and GalR. Probab=30.64 E-value=37 Score=14.33 Aligned_cols=26 Identities=15% Similarity=0.377 Sum_probs=12.4 Q ss_pred EEEECCCCCH-HHHHHHHHHCCCCEEE Q ss_conf 9870898768-9999999985992999 Q gi|254780558|r 200 LAFASNGNSL-DRWMKEAKKKGQEAIL 225 (396) Q Consensus 200 ~A~~P~~~~~-~~~~~~Ar~~G~Evll 225 (396) +.+.|..+.. ....++|+++|-.|+. T Consensus 59 Iii~p~d~~~~~~~l~~a~~aGIPVV~ 85 (273) T cd06305 59 IIIQHGRAEVLKPWVKRALDAGIPVVA 85 (273) T ss_pred EEEECCCHHHHHHHHHHHHHCCCCEEE T ss_conf 999468714448999999985997899 No 113 >PRK13210 putative L-xylulose 5-phosphate 3-epimerase; Reviewed Probab=30.27 E-value=38 Score=14.29 Aligned_cols=156 Identities=16% Similarity=0.156 Sum_probs=72.9 Q ss_pred ECCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCE-E--EEECCCCCC- Q ss_conf 089876899999999859929998176656898778785544678998999999999998558935-9--800123000- Q gi|254780558|r 203 ASNGNSLDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQLLNRLRYSLRRGTGYF-G--VMNYRGAML- 278 (396) Q Consensus 203 ~P~~~~~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~~~~~l~~~l~~~p~~v-G--vnNhmGs~~- 278 (396) +|..-+..+-...|++.|++.+ .+ .+|+.+. .--..+++.++..+.... +....=.+ + ..-||.--| T Consensus 12 ~p~~~sw~e~f~~Ak~~Gfd~I-E~---siDe~d~----~~~~l~~~~~~~~~i~~~-~~~~gl~I~s~~~s~~~~~pl~ 82 (284) T PRK13210 12 LPKDLSWPERLVLAKECGFDFV-EM---SVDETDE----RLARLDWSKEERLELVKA-IYETGVRIPSMCLSAHRRFPFG 82 (284) T ss_pred CCCCCCHHHHHHHHHHCCCCEE-EE---ECCCCCC----CCCCCCCCHHHHHHHHHH-HHHCCCEEEEEECCCCCCCCCC T ss_conf 6999999999999998699889-99---6067542----225789998999999999-9982983566415566689999 Q ss_pred CCCHHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHCCCEEEEEEEEC---CCCCHH---HHHHHHHHHHHHHHHCC Q ss_conf 049899999999985069289974976322678989870994698458862---899999---99999999999998719 Q gi|254780558|r 279 LSNKESAEVIFKEFAKRGLLFFDDGSSPRNLTRVLAPKLNLPYMVADLYLD---DQVDRD---KIREKLKGLEEIARTTG 352 (396) Q Consensus 279 t~~~~~m~~vl~~l~~rgL~flDs~Ts~~Sva~~~A~~~gvp~~~~dvfLD---~~~~~~---~I~~qL~~l~~~Ar~~G 352 (396) ..|+...+.-++.+++ +-.+|..+|+++..- ...| ...+.+ .....|.++..+|.++| T Consensus 83 s~d~~~r~~~le~l~k---------------aI~lA~~LGi~~I~l-~g~dv~~~~~~~~~~~rf~e~l~~~~~~Ae~~g 146 (284) T PRK13210 83 SRDEATRERALEIMKK---------------AIRLAQDLGIRTIQL-AGYDVYYEEKDEETRQRFIEGLAWAVEQAAAAQ 146 (284) T ss_pred CCCHHHHHHHHHHHHH---------------HHHHHHHCCCCEEEE-CCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHC T ss_conf 9898999999999999---------------999999809978996-887666686988999999999999999999839 Q ss_pred EEEEEEE---CCHHHHHHHHHHHHHHHHCCCEEE Q ss_conf 4899980---887889999999756887691899 Q gi|254780558|r 353 QAIGVAV---AFDESIEVISQWLQQEHVRDVSVV 383 (396) Q Consensus 353 ~AI~igh---~~p~Ti~~L~~w~~~l~~~gi~lV 383 (396) .-|+|=- ++-.|++.+.+++.++...-+.+. T Consensus 147 V~L~iE~~~~~f~~t~~~~~~~i~~v~sp~l~v~ 180 (284) T PRK13210 147 VMLAVEIMDTPFMNSISKWKKWDEEIDSPWFTVY 180 (284) T ss_pred CEEEEEECCCCCCCCHHHHHHHHHHCCCCCEEEE T ss_conf 9899995676554779999999996499836999 No 114 >cd01306 PhnM PhnM is believed to be a subunit of the membrane associated C-P lyase complex. C-P lyase is thought to catalyze the direct cleavage of inactivated C-P bonds to yield inorganic phosphate and the corresponding hydrocarbons. It is responsible for cleavage of alkylphosphonates, which are utilized as sole phosphorus sources by many bacteria. Probab=30.25 E-value=38 Score=14.29 Aligned_cols=102 Identities=17% Similarity=0.152 Sum_probs=54.7 Q ss_pred HHHHHHHHHHHHH-CCCCEEEEECC-CCC---------------CCCCHHHHHHHHHHHHCCCCEEEECCCCCCCHHHHH Q ss_conf 9999999999985-58935980012-300---------------004989999999998506928997497632267898 Q gi|254780558|r 251 QQLLNRLRYSLRR-GTGYFGVMNYR-GAM---------------LLSNKESAEVIFKEFAKRGLLFFDDGSSPRNLTRVL 313 (396) Q Consensus 251 ~~~~~~l~~~l~~-~p~~vGvnNhm-Gs~---------------~t~~~~~m~~vl~~l~~rgL~flDs~Ts~~Sva~~~ 313 (396) .+..+.|..++.. --.++-+|+|. |-+ .--+..-++..+..+..+..-+.. ..+...... T Consensus 95 ~~~~~~l~~~~~~~~v~lvS~mDHtPGqrQ~~~~~~~~~~~~~~~~~s~~e~~~~~~~~~~~~~~~~~---~~~~~l~~~ 171 (325) T cd01306 95 PAVLPELESLMADPRVHLVSLMDHTPGQRQFRDLEKYREYYAKKYGLSDEEVEEAILERKARAAAYAP---ANRSELAAL 171 (325) T ss_pred CHHHHHHHHHHCCCCCCEEEEECCCCHHHHHCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHH---HHHHHHHHH T ss_conf 04899999986088867698735883164434518999999875399999999999999998763239---999999999 Q ss_pred HHHHCCCEEEEEEEECCCCCHHHHHHHH------------HHHHHHHHHCCEEEEEEEC Q ss_conf 9870994698458862899999999999------------9999999871948999808 Q gi|254780558|r 314 APKLNLPYMVADLYLDDQVDRDKIREKL------------KGLEEIARTTGQAIGVAVA 360 (396) Q Consensus 314 A~~~gvp~~~~dvfLD~~~~~~~I~~qL------------~~l~~~Ar~~G~AI~igh~ 360 (396) |+..|++.++ .++.+.+.+.... .++++.||+.|-.|.+|-| T Consensus 172 a~~~gi~laS-----HDD~t~e~v~~~~~~G~~IaEFPtT~eAA~~A~~~G~~vvmGAP 225 (325) T cd01306 172 ARARGIPLAS-----HDDDTPEHVAEAHELGVVISEFPTTLEAAKAARELGLQTLMGAP 225 (325) T ss_pred HHHCCCCEEE-----CCCCCHHHHHHHHHCCCEEEECCCHHHHHHHHHHCCCCEEECCC T ss_conf 9977993730-----78999999999997798685352159999999975997782288 No 115 >TIGR00542 hxl6Piso_put hexulose-6-phosphate isomerase, putative; InterPro: IPR004560 This family shows similarity to other isomerases. Putative identification as hexulose-6-phosphate isomerase has been reported. This family is conserved at better than 40 0dentity among the four known examples from three species: Escherichia coli (SgbU and SgaU), Haemophilus influenzae, and Mycoplasma pneumoniae. The rarity of the family, high level of conservation, and proposed catabolic role suggests lateral transfer may be a part of the evolutionary history of this protein.; GO: 0016861 intramolecular oxidoreductase activity interconverting aldoses and ketoses, 0005975 carbohydrate metabolic process. Probab=29.68 E-value=39 Score=14.23 Aligned_cols=138 Identities=19% Similarity=0.229 Sum_probs=94.5 Q ss_pred HHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHC--------CCCEEEEECC------ Q ss_conf 89999999985992999817665689877878554467899899999999999855--------8935980012------ Q gi|254780558|r 209 LDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQLLNRLRYSLRRG--------TGYFGVMNYR------ 274 (396) Q Consensus 209 ~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~~~~~l~~~l~~~--------p~~vGvnNhm------ 274 (396) =.+....|++.|||.+ .| -++-++ +=..||+|.-.+. ..-++||. | T Consensus 18 W~erl~lAK~~gFdFi-Em-----------------SiDEsd-~RLsRL~Ws~~~r~~L~~~~~~~g~~~pS-MCLSAHR 77 (290) T TIGR00542 18 WKERLALAKQCGFDFI-EM-----------------SIDESD-DRLSRLDWSKTERLALVNAIIETGVCIPS-MCLSAHR 77 (290) T ss_pred CHHHHHHHHHCCCCEE-EE-----------------ECCCCH-HHHHHHCCCHHHHHHHHHHHHHCCCCCCC-CCCHHHC T ss_conf 4789999986697758-85-----------------113643-33113112689999999999867871353-0011113 Q ss_pred ----CCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHCCCE---EEEEEEECCC--CCHHHHHHHHHHHH Q ss_conf ----30000498999999999850692899749763226789898709946---9845886289--99999999999999 Q gi|254780558|r 275 ----GAMLLSNKESAEVIFKEFAKRGLLFFDDGSSPRNLTRVLAPKLNLPY---MVADLYLDDQ--VDRDKIREKLKGLE 345 (396) Q Consensus 275 ----Gs~~t~~~~~m~~vl~~l~~rgL~flDs~Ts~~Sva~~~A~~~gvp~---~~~dvfLD~~--~~~~~I~~qL~~l~ 345 (396) ||+ |..+.+.-++.+. +|-.+|+..||.. +.=||+-... .+...-..-|...+ T Consensus 78 RFPlGS~---d~~~r~~~~eIm~---------------KAi~La~~lGIR~IQLAgYDVYYe~~d~eT~~~F~~gl~~~v 139 (290) T TIGR00542 78 RFPLGSK---DKAVRQQGLEIME---------------KAIQLARDLGIRIIQLAGYDVYYEEHDEETRRRFIEGLKKAV 139 (290) T ss_pred CCCCCCC---CHHHHHHHHHHHH---------------HHHHHHHHCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHHH T ss_conf 4989885---1789999999999---------------999999757960662233300335576045899998899999 Q ss_pred HHHHHCCEEEE---EEECCHHHHHHHHHHHHHHHHCCCEEEE Q ss_conf 99987194899---9808878899999997568876918996 Q gi|254780558|r 346 EIARTTGQAIG---VAVAFDESIEVISQWLQQEHVRDVSVVP 384 (396) Q Consensus 346 ~~Ar~~G~AI~---igh~~p~Ti~~L~~w~~~l~~~gi~lVp 384 (396) ++|.+.-.-|+ +=.|+-.||...++|...+..-.|.|-| T Consensus 140 elA~~~qVtlA~EiMDtpfm~sIs~~~~~~~~l~~pwf~lYP 181 (290) T TIGR00542 140 ELAAKAQVTLAVEIMDTPFMSSISRWLKWDKILNSPWFTLYP 181 (290) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEECC T ss_conf 999875465346664133478999998899986088806537 No 116 >cd06307 PBP1_uncharacterized_sugar_binding Periplasmic sugar-binding domain of uncharacterized transport systems. Periplasmic sugar-binding domain of uncharacterized transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein (PBP1) superfamily. The members of this group are predicted to be involved in the transport of sugar-containing molecules across cellular and organellar membranes. Probab=29.55 E-value=39 Score=14.21 Aligned_cols=158 Identities=12% Similarity=0.054 Sum_probs=74.1 Q ss_pred EEEEECC-CCCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHH----CCCCEEEEEC Q ss_conf 8987089-8768999999998599299981766568987787855446789989999999999985----5893598001 Q gi|254780558|r 199 TLAFASN-GNSLDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQLLNRLRYSLRR----GTGYFGVMNY 273 (396) Q Consensus 199 T~A~~P~-~~~~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~~~~~l~~~l~~----~p~~vGvnNh 273 (396) -+.+.|. .+......+.++++|..|+.-.-.- +. +...-.++.+..+.-+...+.|.. -.+-+++..+ T Consensus 61 giii~~~d~~~~~~~i~~a~~~GIPVV~~~~~~----~~---~~~~~~Vg~Dn~~aG~~aa~~l~~~l~~~~g~i~ii~g 133 (275) T cd06307 61 GVALVAPDHPQVRAAVARLAAAGVPVVTLVSDL----PG---SPRAGYVGIDNRAAGRTAAWLIGRFLGRRPGKVAVLAG 133 (275) T ss_pred EEEEECCCCHHHHHHHHHHHHCCCEEEEECCCC----CC---CCCCEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEC T ss_conf 899989981778999999998798499972888----88---77433991487999999999999960768977999907 Q ss_pred CCCCCCCCHHHHHHHHHHHHCC--CCEEEECCCCC--CCHHHHHHHHHC--CCEEEEEEEECCCCCHHHHHHHHHHHHHH Q ss_conf 2300004989999999998506--92899749763--226789898709--94698458862899999999999999999 Q gi|254780558|r 274 RGAMLLSNKESAEVIFKEFAKR--GLLFFDDGSSP--RNLTRVLAPKLN--LPYMVADLYLDDQVDRDKIREKLKGLEEI 347 (396) Q Consensus 274 mGs~~t~~~~~m~~vl~~l~~r--gL~flDs~Ts~--~Sva~~~A~~~g--vp~~~~dvfLD~~~~~~~I~~qL~~l~~~ 347 (396) .-+ ......-++-+.+.|+++ ++=.++..... ...+.+.+..+= -|-.. -+|--+.... -+. ++.+. T Consensus 134 ~~~-~~~~~~R~~Gf~~~l~~~~p~~~iv~~~~~~~~~~~a~~~~~~lL~~~Pdi~-ai~~~~~~~~-ga~----~Al~~ 206 (275) T cd06307 134 SHR-FRGHEEREMGFRSVLREEFPGLRVLETLEGLDDPARAYEATRKLLARHPDLV-GIYNAGGGNR-GVI----RALRE 206 (275) T ss_pred CCC-CHHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCHHHHHHHHHHHHHHCCCCE-EEEECCCCHH-HHH----HHHHH T ss_conf 987-5329999999999998658887402466178865999999999997499962-9998487169-999----99997 Q ss_pred HHHCCEEEEEEE-CCHHHHHHHHH Q ss_conf 987194899980-88788999999 Q gi|254780558|r 348 ARTTGQAIGVAV-AFDESIEVISQ 370 (396) Q Consensus 348 Ar~~G~AI~igh-~~p~Ti~~L~~ 370 (396) +.+.|....||+ ..++++++|++ T Consensus 207 ~g~~~~v~vvg~D~~~~~~~~l~~ 230 (275) T cd06307 207 AGRAGKVVFVGHELTPETRAALRD 230 (275) T ss_pred CCCCCCEEEEECCCCHHHHHHHHC T ss_conf 399998699962799899999981 No 117 >TIGR00131 gal_kin galactokinase; InterPro: IPR000705 Galactokinase catalyses the first reaction in the galactose metabolism pathway, the ATP-dependent phosphorylation of galactose, yielding galactose-1-phosphate , . Deficiency in this enzyme results in the disease galactosemia, which is responsible for the formation of cataracts in newborn babies, and is possibly responsible for presenile cataracts in adults . In yeast, the GAL3 gene product is required for the GAL4-mediated induction of other enzymes involved in galactose metabolism. The induction of GAL1 production then reinforces this process, increasing the expression of other galactose-inducible genes. GAL3 has been shown to be similar to the GAL1 protein .; GO: 0004335 galactokinase activity, 0005524 ATP binding, 0006012 galactose metabolic process, 0046835 carbohydrate phosphorylation. Probab=29.16 E-value=34 Score=14.58 Aligned_cols=56 Identities=14% Similarity=0.206 Sum_probs=36.4 Q ss_pred HHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHCC-CCEEEECCCCCCCHHHHHHH--HHCCCEEEEE Q ss_conf 998558935980012300004989999999998506-92899749763226789898--7099469845 Q gi|254780558|r 260 SLRRGTGYFGVMNYRGAMLLSNKESAEVIFKEFAKR-GLLFFDDGSSPRNLTRVLAP--KLNLPYMVAD 325 (396) Q Consensus 260 ~l~~~p~~vGvnNhmGs~~t~~~~~m~~vl~~l~~r-gL~flDs~Ts~~Sva~~~A~--~~gvp~~~~d 325 (396) .-.---+|||||| ..|+++...+++. ..++||-+-+..+...++-+ ..+|-+...+ T Consensus 175 ~q~aenhfvGvNc----------G~MDQ~~s~~G~~dHaL~~~c~P~L~~~~~~~P~~KNH~~a~VI~N 233 (500) T TIGR00131 175 GQKAENHFVGVNC----------GIMDQLISALGKEDHALLIDCRPSLKTTPFKLPQLKNHGVAVVIAN 233 (500) T ss_pred HHHHHHCEEECCC----------CHHHHHHHHCCCCCCEEEEECCCCCCCEEEECCCCCCCCEEEEEEE T ss_conf 5556445251047----------7378899652752533555237777844100786225645899961 No 118 >TIGR01428 HAD_type_II haloacid dehalogenase, type II; InterPro: IPR006328 These proteins catalyze the hydrolytic dehalogenation of small L-2-haloalkanoic acids to yield the corresponding D-2-hydroxyalkanoic acids . They belong to the Haloacid Dehalogenase (HAD) superfamily of aspartate-nucleophile hydrolases, class (subfamily) I. Note that the Type I HAD enzymes have not yet been fully characterised, but clearly utilise a substantially different catalytic mechanism and are thus unlikely to be related.; GO: 0019120 hydrolase activity acting on acid halide bonds in C-halide compounds, 0008152 metabolic process. Probab=29.04 E-value=34 Score=14.56 Aligned_cols=98 Identities=15% Similarity=0.071 Sum_probs=39.7 Q ss_pred HHHHHHHHHCCCC-EEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECC-------CCCCCCC Q ss_conf 9999999985992-9998176656898778785544678998999999999998558935980012-------3000049 Q gi|254780558|r 210 DRWMKEAKKKGQE-AILQIPMQAFDESYNEDDSYTLKVTQTVQQLLNRLRYSLRRGTGYFGVMNYR-------GAMLLSN 281 (396) Q Consensus 210 ~~~~~~Ar~~G~E-vllhlPMEp~~~~~~~pGp~~L~~~~~~~~~~~~l~~~l~~~p~~vGvnNhm-------Gs~~t~~ 281 (396) ++..++-|++-.| .-++-.|.+...+..+ -.+--++.|+..+++.+=.++..++. ..++... T Consensus 32 ~~~~~lWR~~~l~y~~~~~~mg~~w~~y~d----------f~~~~~~al~~~~~~~~~~~~~~~~~~~~L~~~~~~L~p~ 101 (207) T TIGR01428 32 EALSQLWRQKQLEYSWLRTLMGQNWTPYKD----------FWDLTAEALRYLLGRLGLASTLDEAAADRLAEAYLRLPPH 101 (207) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCCCC----------HHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCCC T ss_conf 999999999888898887625888888888----------8999999999999755877678988899998776358998 Q ss_pred HHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHC Q ss_conf 8999999999850692899749763226789898709 Q gi|254780558|r 282 KESAEVIFKEFAKRGLLFFDDGSSPRNLTRVLAPKLN 318 (396) Q Consensus 282 ~~~m~~vl~~l~~rgL~flDs~Ts~~Sva~~~A~~~g 318 (396) +++.. .|+.|+++|+-.+-=--+....-..+.+..| T Consensus 102 pD~~~-gL~~L~~~G~~l~iLSNg~~~~~~~~~~~aG 137 (207) T TIGR01428 102 PDVPA-GLRALKERGLRLAILSNGSPAMLKSLVKHAG 137 (207) T ss_pred CCCHH-HHHHHHHCCCEEEEECCCCHHHHHHHHHHCC T ss_conf 66188-9999976432365214898678999999628 No 119 >PRK07850 acetyl-CoA acetyltransferase; Provisional Probab=28.44 E-value=41 Score=14.09 Aligned_cols=176 Identities=18% Similarity=0.129 Sum_probs=87.4 Q ss_pred CCCCHHHHHHHHHHCCC--CEEEECCCC--CCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCE--EEEECCCCCC Q ss_conf 98768999999998599--299981766--56898778785544678998999999999998558935--9800123000 Q gi|254780558|r 205 NGNSLDRWMKEAKKKGQ--EAILQIPMQ--AFDESYNEDDSYTLKVTQTVQQLLNRLRYSLRRGTGYF--GVMNYRGAML 278 (396) Q Consensus 205 ~~~~~~~~~~~Ar~~G~--EvllhlPME--p~~~~~~~pGp~~L~~~~~~~~~~~~l~~~l~~~p~~v--GvnNhmGs~~ 278 (396) |+-.+++.+..|++.|+ +-++ |.+ ..+......|...+ .+ .++.+++.-...|++++-+. |+...=-|-- T Consensus 168 ~A~~Sh~rA~~A~~~G~f~~EIv--Pv~~~~~d~~~~~~~~~~~-~~-~De~~r~tt~e~La~LkPvf~~GtvTAgNss~ 243 (387) T PRK07850 168 FGLASQRRAAQAWAEGRFDREIS--PVQAPVLDEQGQPTGERRL-VT-RDQGLRDTTMEGLAGLKPVLEGGIHTAGTSSQ 243 (387) T ss_pred HHHHHHHHHHHHHHCCCCCCEEE--EEEEECCCCCCCCCCCEEE-EC-CCCCCCCCCHHHHHHCCCCCCCCEEECCCCCC T ss_conf 99999999999987498310476--6650012345775587168-63-78886667665675228989997665676675 Q ss_pred CCCHHHHHHHHHH--HHCCCC----EEEEC----------CCCCCCHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHH Q ss_conf 0498999999999--850692----89974----------9763226789898709946984588628999999999999 Q gi|254780558|r 279 LSNKESAEVIFKE--FAKRGL----LFFDD----------GSSPRNLTRVLAPKLNLPYMVADLYLDDQVDRDKIREKLK 342 (396) Q Consensus 279 t~~~~~m~~vl~~--l~~rgL----~flDs----------~Ts~~Sva~~~A~~~gvp~~~~dvfLD~~~~~~~I~~qL~ 342 (396) .+|-.+.-.|+++ .++.|| -++++ +..|---..++-++.|+..--.|+|==|+.........++ T Consensus 244 ~~DGAAavll~s~~~a~~~gl~p~a~i~~~a~~g~dP~~m~~gPv~A~~k~L~~agl~~~Did~~EinEAFAaq~l~~~~ 323 (387) T PRK07850 244 ISDGAAAVLWMDEDRARALGLRPRARIVAQALVGAEPYYHLDGPVQSTAKVLEKAGMKIGDIDLVEINEAFASVVLSWAQ 323 (387) T ss_pred CCCHHHHHHHHHHHHHHHCCCCEEEEEEEEEEEECCHHHCCCCHHHHHHHHHHHCCCCCCCCCEEEECHHHHHHHHHHHH T ss_conf 55406898883699999879982599996678874857707068999999999859983468767860577889999998 Q ss_pred HH---HHHHHHCCEEEEEEECCHHHHHHH-HHHHHHHHHCC--CEEEE Q ss_conf 99---999987194899980887889999-99975688769--18996 Q gi|254780558|r 343 GL---EEIARTTGQAIGVAVAFDESIEVI-SQWLQQEHVRD--VSVVP 384 (396) Q Consensus 343 ~l---~~~Ar~~G~AI~igh~~p~Ti~~L-~~w~~~l~~~g--i~lVp 384 (396) .+ ..+---+|-+|++|||+--|=.+| ...+.+|+.+| +-|+. T Consensus 324 ~l~~d~~kvN~~GGaiAlGHP~GasGaRi~~~l~~~L~~~~~~~Gl~t 371 (387) T PRK07850 324 VHEPDMDRVNVNGGAIALGHPVGSTGARLITTALHELERTDKSTALIT 371 (387) T ss_pred HHCCCHHHCCCCCCHHHHCCCCCCCHHHHHHHHHHHHHHCCCCEEEEE T ss_conf 739979897998648875888121379999999999987299999999 No 120 >COG0749 PolA DNA polymerase I - 3'-5' exonuclease and polymerase domains [DNA replication, recombination, and repair] Probab=28.30 E-value=19 Score=16.23 Aligned_cols=138 Identities=17% Similarity=0.118 Sum_probs=85.3 Q ss_pred ECCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEE-EEE----CCC-- Q ss_conf 0898768999999998599299981766568987787855446789989999999999985589359-800----123-- Q gi|254780558|r 203 ASNGNSLDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQLLNRLRYSLRRGTGYFG-VMN----YRG-- 275 (396) Q Consensus 203 ~P~~~~~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~~~~~l~~~l~~~p~~vG-vnN----hmG-- 275 (396) .|+..+..-.-+..|+-..-+.. . . .+++.. .+-.+|+-|+..++++-++..- ..| |.. T Consensus 338 dPNLQNIPiRse~Gr~IR~aFva----------~--~-g~~~i~-aDYSQIELRilAHls~D~~Ll~AF~~g~DiH~~TA 403 (593) T COG0749 338 DPNLQNIPIRSEEGRKIRKAFVA----------E--K-GYTLIS-ADYSQIELRILAHLSQDEGLLRAFTEGEDIHTATA 403 (593) T ss_pred CCCCCCCCCCCHHHHHHHHCEEC----------C--C-CCEEEE-ECHHHHHHHHHHHHCCCHHHHHHHHCCCCHHHHHH T ss_conf 99755677677767866640557----------9--9-986887-01579999999986188999998865863778999 Q ss_pred CC-CCCCHHHHHH-HHHHHHCCCCEEEECCCCCCCHHHHHHHHHCCCEEEEEEEECCC-CCHHHHHHHHHHHHHHHHHCC Q ss_conf 00-0049899999-99998506928997497632267898987099469845886289-999999999999999998719 Q gi|254780558|r 276 AM-LLSNKESAEV-IFKEFAKRGLLFFDDGSSPRNLTRVLAPKLNLPYMVADLYLDDQ-VDRDKIREKLKGLEEIARTTG 352 (396) Q Consensus 276 s~-~t~~~~~m~~-vl~~l~~rgL~flDs~Ts~~Sva~~~A~~~gvp~~~~dvfLD~~-~~~~~I~~qL~~l~~~Ar~~G 352 (396) ++ |--+..-+.. .=+..|.-|.++|.- --+..+|+.+|||...+.-|||.- .+...|..-+++-.+.||++| T Consensus 404 ~~vFgv~~~~Vt~e~Rr~AKaINFGiiYG-----~safgLa~~L~I~~~eA~~~I~~YF~rypgv~~ym~~~~~~ar~~G 478 (593) T COG0749 404 AEVFGVPIEEVTSEQRRKAKAINFGLIYG-----MSAFGLAQQLGIPRKEAKEYIDRYFERYPGVKEYMERTKEEAREDG 478 (593) T ss_pred HHHHCCCHHHCCHHHHHHHHHHCCCEEEC-----CCHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCC T ss_conf 99949982338999875655303541434-----5133479874988688899999999868689999999999999749 Q ss_pred EEEEEEE Q ss_conf 4899980 Q gi|254780558|r 353 QAIGVAV 359 (396) Q Consensus 353 ~AI~igh 359 (396) ++--+-+ T Consensus 479 yV~Tl~g 485 (593) T COG0749 479 YVETLFG 485 (593) T ss_pred CEEECCC T ss_conf 3541026 No 121 >TIGR02884 spore_pdaA delta-lactam-biosynthetic de-N-acetylase; InterPro: IPR014235 Muramic delta-lactam is an unusual constituent of peptidoglycan, found only in bacterial spores, in the peptidoglycan wall or spore cortex. This entry represents a subset of the larger polysaccharide deacetylase family that is specifically involved in delta-lactam biosynthesis. PdaA from Bacillus subtilis acts immediately after CwlD, an N-acetylmuramoyl-L-alanine amidase and performs a de-N-acetylation. PdaA may also perform the subsequent transpeptidation that leads to lactam ring formation, as heterologous expression in Escherichia coli of CwlD and PdaA together is sufficient for delta-lactam production.. Probab=27.82 E-value=42 Score=14.02 Aligned_cols=170 Identities=14% Similarity=0.101 Sum_probs=107.1 Q ss_pred CCCCCCHHHHHHHHHC--CCCCEEEEE---CCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCC-CHHH Q ss_conf 7787716889999837--935089870---898768999999998599299981766568987787855446789-9899 Q gi|254780558|r 179 GLGISQTGTQRAINLL--PANITLAFA---SNGNSLDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQ-TVQQ 252 (396) Q Consensus 179 dlG~~~~~~~~ai~~L--P~~vT~A~~---P~~~~~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~-~~~~ 252 (396) |=||=+-=|.+ |++. --.|+=||. ||-....+++.|=.+.|| ++= |... .+.=++.+ +.+. T Consensus 44 DnGYEnGYT~k-ILDVLKe~~V~AaFFVTghY~K~~pdLvKRM~~EGH-ivG-------NHS~----hHPsl~~~~~~ek 110 (225) T TIGR02884 44 DNGYENGYTPK-ILDVLKENKVPAAFFVTGHYIKTQPDLVKRMVDEGH-IVG-------NHSV----HHPSLTTVLDDEK 110 (225) T ss_pred CCCCCCCCCHH-HHHHHHCCCCCEEEEEECCCCCCCHHHHHHHHHCCC-EEC-------CCCC----CCCCHHHHCCHHH T ss_conf 35533577033-423320378831146407731476676665554583-432-------5655----7755555134889 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHC-----CCEEEEEEE Q ss_conf 999999999855893598001230000498999999999850692899749763226789898709-----946984588 Q gi|254780558|r 253 LLNRLRYSLRRGTGYFGVMNYRGAMLLSNKESAEVIFKEFAKRGLLFFDDGSSPRNLTRVLAPKLN-----LPYMVADLY 327 (396) Q Consensus 253 ~~~~l~~~l~~~p~~vGvnNhmGs~~t~~~~~m~~vl~~l~~rgL~flDs~Ts~~Sva~~~A~~~g-----vp~~~~dvf 327 (396) ++.-|...-..+-..+|=..- +| +||=|.+-.+ +|+ ...+++| .-+|..|=- T Consensus 111 ~~~El~~V~e~~~~vTG~~~~---~Y------lRPPrG~fSE------------~tL--~~t~~LGY~~vFWSlAy~DW~ 167 (225) T TIGR02884 111 FKEELDGVEEEFKEVTGKKEM---KY------LRPPRGKFSE------------RTL--AYTKELGYKTVFWSLAYKDWE 167 (225) T ss_pred HHHHHHHHHHHHHHCCCCCCC---CE------ECCCCCCCHH------------HHH--HHHHHCCCCEEEEEEEECCCC T ss_conf 999988899887640576277---00------0787875405------------789--999974882110012202677 Q ss_pred ECCCCCHHHHHHHHHHHHHHHHHCCEEEEEEECC-HHHHHHHHHHHHHHHHCCCEEEEHHHHH Q ss_conf 6289999999999999999998719489998088-7889999999756887691899677743 Q gi|254780558|r 328 LDDQVDRDKIREKLKGLEEIARTTGQAIGVAVAF-DESIEVISQWLQQEHVRDVSVVPLSCLA 389 (396) Q Consensus 328 LD~~~~~~~I~~qL~~l~~~Ar~~G~AI~igh~~-p~Ti~~L~~w~~~l~~~gi~lVpvS~l~ 389 (396) .|.+....+=..|.-+ |-+-=||..-|+- ...-++|.+.+.+|+.+|+++-.+-.|. T Consensus 168 ~D~Q~G~~~A~~~im~-----~~HpGai~LLHAVS~dNA~aLd~~i~~lk~~GY~FkSLDdL~ 225 (225) T TIGR02884 168 VDKQKGKEYAYKQIMK-----KIHPGAILLLHAVSKDNAEALDEIIKDLKEQGYTFKSLDDLM 225 (225) T ss_pred CCCCCCHHHHHHHHHH-----HCCCHHHHHHHCCCHHHHHHHHHHHHHHHHCCCEECCHHCCC T ss_conf 6987787778877754-----157024133231474589999999999986597503331159 No 122 >PRK12569 hypothetical protein; Provisional Probab=27.72 E-value=42 Score=14.01 Aligned_cols=185 Identities=18% Similarity=0.205 Sum_probs=110.9 Q ss_pred CCCCCCCH----H--HHHHHHHC--CCCCEEEEECCCCC-HHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCC Q ss_conf 37787716----8--89999837--93508987089876-8999999998599299981766568987787855446789 Q gi|254780558|r 178 SGLGISQT----G--TQRAINLL--PANITLAFASNGNS-LDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQ 248 (396) Q Consensus 178 ddlG~~~~----~--~~~ai~~L--P~~vT~A~~P~~~~-~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~ 248 (396) -|||.+.. + ..++|+.+ -..|.+.|--..+. -.+.++.|.+.|-.|=-|- +||+.+ |=+=-...+ T Consensus 9 ~DlGE~~g~~~~g~~~D~~lmp~I~saNIACG~HaGD~~~m~~tv~lA~~~~V~IGAHP-----sypD~~-gFGRr~~~~ 82 (245) T PRK12569 9 SDMGEGFGPWTIGDGVDEELMPLISSANIATGFHAGDPNIMRRTVELAKAHGVGIGAHP-----GFRDLV-GFGRRHIAA 82 (245) T ss_pred CCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCEEECCC-----CCCCCC-CCCCCCCCC T ss_conf 30688878867788566999867525888616357789999999999998599881889-----999767-899898889 Q ss_pred CHHHHHHHHHHHHHHCCC---CEEEE-ECC---CC---CCCCCHHHHHHHHHHHHCCC--CEEEECCCCCCCHHHHHHHH Q ss_conf 989999999999985589---35980-012---30---00049899999999985069--28997497632267898987 Q gi|254780558|r 249 TVQQLLNRLRYSLRRGTG---YFGVM-NYR---GA---MLLSNKESAEVIFKEFAKRG--LLFFDDGSSPRNLTRVLAPK 316 (396) Q Consensus 249 ~~~~~~~~l~~~l~~~p~---~vGvn-Nhm---Gs---~~t~~~~~m~~vl~~l~~rg--L~flDs~Ts~~Sva~~~A~~ 316 (396) +++++.+.+.+-++.+-. ..|+. +|. |. ....|+...+.+++.++... |.++ ..+.|...++|++ T Consensus 83 s~~el~~~v~~Qi~al~~~a~~~g~~l~hVKPHGALYn~~~~d~~la~~i~~ai~~~~~~l~l~---~~~~s~~~~~A~~ 159 (245) T PRK12569 83 SAQELVNDILYQLGALREFARAQGVRLQHVKPHGALYMHAARDEALARLLVEALQRLDPGLPLY---CMEASVTYRIARE 159 (245) T ss_pred CHHHHHHHHHHHHHHHHHHHHHCCCCEEECCHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCEEE---ECCCHHHHHHHHH T ss_conf 9899999999999999999998399301114219999998569999999999999849884388---5286399999998 Q ss_pred HCCCEEEEEEEECCCC---------------CHHHHHHHHHHHHHHHHHCCEE--------------EEEEECCHHHHHH Q ss_conf 0994698458862899---------------9999999999999999871948--------------9998088788999 Q gi|254780558|r 317 LNLPYMVADLYLDDQV---------------DRDKIREKLKGLEEIARTTGQA--------------IGVAVAFDESIEV 367 (396) Q Consensus 317 ~gvp~~~~dvfLD~~~---------------~~~~I~~qL~~l~~~Ar~~G~A--------------I~igh~~p~Ti~~ 367 (396) .|+++.. ..|.|-.- |++.+. +++.++++. |.. |-+-.-.|..++. T Consensus 160 ~Gl~~~~-E~FADR~Y~~dG~Lv~R~~~gAvd~~~~~---~q~~~~~~~-g~V~ti~G~~i~i~adTiCvHGDtp~Av~~ 234 (245) T PRK12569 160 LGQPVVR-EFYADRDYDRSGSIVFTRRVGALDPQQVA---AKVLRACRE-GKVRTVDGEDLDIDFDSICLHSDTPGALAL 234 (245) T ss_pred CCCCEEE-EEEECCCCCCCCCEECCCCCCCCCHHHHH---HHHHHHHHC-CCEEECCCCEEECCCCEEEECCCCHHHHHH T ss_conf 6997468-99765654899988418999999999999---999999977-987905898874378889989999789999 Q ss_pred HHHHHHHHH Q ss_conf 999975688 Q gi|254780558|r 368 ISQWLQQEH 376 (396) Q Consensus 368 L~~w~~~l~ 376 (396) ++..-..|+ T Consensus 235 a~~iR~~L~ 243 (245) T PRK12569 235 ARATRAALE 243 (245) T ss_pred HHHHHHHHH T ss_conf 999999986 No 123 >PRK05093 argD bifunctional N-succinyldiaminopimelate-aminotransferase/acetylornithine transaminase protein; Reviewed Probab=27.69 E-value=42 Score=14.01 Aligned_cols=13 Identities=23% Similarity=0.220 Sum_probs=5.6 Q ss_pred CCCCCCCHHHHHH Q ss_conf 3000049899999 Q gi|254780558|r 275 GAMLLSNKESAEV 287 (396) Q Consensus 275 Gs~~t~~~~~m~~ 287 (396) |+-|..++-+|.. T Consensus 280 g~T~~g~pla~aa 292 (403) T PRK05093 280 GSTYGGNPLACAV 292 (403) T ss_pred CCCCCCCHHHHHH T ss_conf 6589858999999 No 124 >pfam09989 DUF2229 CoA enzyme activase uncharacterized domain (DUF2229). Members of this family include various bacterial hypothetical proteins, as well as CoA enzyme activases. The exact function of this domain has not, as yet, been defined. Probab=27.57 E-value=42 Score=14.00 Aligned_cols=53 Identities=8% Similarity=0.038 Sum_probs=31.5 Q ss_pred HHHHHHHHHHHHHHHHCC-E-EEEEEECCHHHHH-HHHHHHHHHHHCCCEEEEHHH Q ss_conf 999999999999998719-4-8999808878899-999997568876918996777 Q gi|254780558|r 335 DKIREKLKGLEEIARTTG-Q-AIGVAVAFDESIE-VISQWLQQEHVRDVSVVPLSC 387 (396) Q Consensus 335 ~~I~~qL~~l~~~Ar~~G-~-AI~igh~~p~Ti~-~L~~w~~~l~~~gi~lVpvS~ 387 (396) ..++++..+....+++++ . .+.+||+|-.-=. .=......|.+.|+.+++... T Consensus 162 ~~l~~~~~~~l~~~~~~~~~~I~liGrpY~i~Dp~iN~~I~~~L~~~Gv~Vit~d~ 217 (218) T pfam09989 162 KDLRKKGEEALAYLEEEGKKGIVLLGRPYHIYDPEINHGIPEKLTSLGIAVLTEDS 217 (218) T ss_pred HHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCCCHHHHHHCCCEECCCCC T ss_conf 99999889999988865994699971886778842257629999878595617434 No 125 >cd03320 OSBS o-Succinylbenzoate synthase (OSBS) catalyzes the conversion of 2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylate (SHCHC) to 4-(2'-carboxyphenyl)-4-oxobutyrate (o-succinylbenzoate or OSB), a reaction in the menaquinone biosynthetic pathway. Menaquinone is an essential cofactor for anaerobic growth in eubacteria and some archaea. OSBS belongs to the enolase superfamily of enzymes, characterized by the presence of an enolate anion intermediate which is generated by abstraction of the alpha-proton of the carboxylate substrate by an active site residue and is stabilized by coordination to the essential Mg2+ ion. Probab=27.57 E-value=42 Score=14.00 Aligned_cols=110 Identities=18% Similarity=0.170 Sum_probs=63.2 Q ss_pred CHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHCCCEEEEEEEE Q ss_conf 98999999999998558935980012300004989999999998506928997497632267898987099469845886 Q gi|254780558|r 249 TVQQLLNRLRYSLRRGTGYFGVMNYRGAMLLSNKESAEVIFKEFAKRGLLFFDDGSSPRNLTRVLAPKLNLPYMVADLYL 328 (396) Q Consensus 249 ~~~~~~~~l~~~l~~~p~~vGvnNhmGs~~t~~~~~m~~vl~~l~~rgL~flDs~Ts~~Sva~~~A~~~gvp~~~~dvfL 328 (396) +.++-..+++.....++.-+-+.=---..+|.+.. ..+++.|...|+.|+-.-..+...........++|.+- T Consensus 109 ~~~~Di~~i~~ir~~~~~~~~l~vDaN~~wt~~~A--~~~~~~l~~~~l~~iEqP~~~~d~~~~~~~~~~ipi~~----- 181 (263) T cd03320 109 SFEEDLARLRALREALPADAKLRLDANGGWSLEEA--LAFLEALAAGRIEYIEQPLPPDDLAELRRLAAGVPIAL----- 181 (263) T ss_pred CHHHHHHHHHHHHHHCCCCCEEEEECCCCCCHHHH--HHHHHHHCCCCEEEEECCCCCCCHHHHHHHCCCCCEEC----- T ss_conf 95999999999999669997799989877489999--99998715168358866789999999986037998543----- Q ss_pred CCC-CCHHHH---------------------HHHHHHHHHHHHHCCEEEEEEECCHHHH Q ss_conf 289-999999---------------------9999999999987194899980887889 Q gi|254780558|r 329 DDQ-VDRDKI---------------------REKLKGLEEIARTTGQAIGVAVAFDESI 365 (396) Q Consensus 329 D~~-~~~~~I---------------------~~qL~~l~~~Ar~~G~AI~igh~~p~Ti 365 (396) |.. .+.... ..+..+..++|+.+|.-++|++.+.-.| T Consensus 182 DEs~~~~~d~~~~~~~~a~d~i~iK~~~~GGi~~~~~i~~~a~~~gi~~~i~~~~Es~I 240 (263) T cd03320 182 DESLRRLDDPLALAAAGALGALVLKPALLGGPRALLELAEEARARGIPAVVSSALESSI 240 (263) T ss_pred CCCCCCHHHHHHHHHCCCCCEEEECCHHCCCHHHHHHHHHHHHHCCCCEEEECCCCHHH T ss_conf 56447878899999809998899672010899999999999998599699948541599 No 126 >pfam10686 DUF2493 Protein of unknown function (DUF2493). Members of this family are all Proteobacteria. The function is not known. Probab=27.52 E-value=42 Score=13.99 Aligned_cols=50 Identities=10% Similarity=0.228 Sum_probs=28.4 Q ss_pred CCCCCCCCHHHHHHHHHHHHCC--CCEEEECCC--CCCCHHHHHHHHHCCCEEE Q ss_conf 2300004989999999998506--928997497--6322678989870994698 Q gi|254780558|r 274 RGAMLLSNKESAEVIFKEFAKR--GLLFFDDGS--SPRNLTRVLAPKLNLPYMV 323 (396) Q Consensus 274 mGs~~t~~~~~m~~vl~~l~~r--gL~flDs~T--s~~Sva~~~A~~~gvp~~~ 323 (396) .|+|-..|...+...|..+..+ .+.++..+. .....|..-|++.||+... T Consensus 9 ~Ggrd~~D~~~i~~~Ld~~~~~~pd~vlihGG~~kGad~lA~~WA~~~gv~~i~ 62 (71) T pfam10686 9 SGGRDFNDHRLIWDALDKVHARHPDMVLLHGGAPKGAERIAARWARRRGVPQVA 62 (71) T ss_pred EECCCCCHHHHHHHHHHHHHHHCCCEEEEECCCCCCHHHHHHHHHHHCCCCEEE T ss_conf 808986509999999999998689879997798633799999999986997697 No 127 >PRK08555 consensus Probab=27.27 E-value=43 Score=13.96 Aligned_cols=15 Identities=33% Similarity=0.244 Sum_probs=7.7 Q ss_pred CCHHHHHHHHHHCCC Q ss_conf 768999999998599 Q gi|254780558|r 207 NSLDRWMKEAKKKGQ 221 (396) Q Consensus 207 ~~~~~~~~~Ar~~G~ 221 (396) .+.++..+..|+.|- T Consensus 243 ~yl~~l~~lc~~~gi 257 (443) T PRK08555 243 NFFKELKKLADKYGI 257 (443) T ss_pred HHHHHHHHHHHHCCC T ss_conf 887899999997494 No 128 >pfam02401 LYTB LytB protein. The mevalonate-independent 2-C-methyl-D-erythritol 4-phosphate (MEP) pathway for isoprenoid biosynthesis is essential in many eubacteria, plants, and the malaria parasite. The LytB gene is involved in the trunk line of the MEP pathway. Probab=26.88 E-value=43 Score=13.92 Aligned_cols=145 Identities=15% Similarity=0.135 Sum_probs=78.4 Q ss_pred EEECCCCCCCHHHHHHHHHCCCCCEEEEECCCCCHHHHHHHHHHCCCCEEEE--------------CCCCCC-------- Q ss_conf 9973778771688999983793508987089876899999999859929998--------------176656-------- Q gi|254780558|r 175 IVVSGLGISQTGTQRAINLLPANITLAFASNGNSLDRWMKEAKKKGQEAILQ--------------IPMQAF-------- 232 (396) Q Consensus 175 IVIddlG~~~~~~~~ai~~LP~~vT~A~~P~~~~~~~~~~~Ar~~G~Evllh--------------lPMEp~-------- 232 (396) +||---|.+...- +.+.+....|-=|--|+-...+..+++..+.|..+++. .|-... T Consensus 67 ViirAHGv~~~~~-~~a~~~gl~viDaTCP~V~kvh~~v~~~~~~Gy~iiiiG~~~HpEv~gi~g~~~~~~~vv~~~~d~ 145 (280) T pfam02401 67 VIFSAHGVSPEVR-EEAKERGLQVIDATCPLVTKVHNAVERMAKKGYHVILIGHKNHPEVEGTVGYAPEQVFVVENAEDV 145 (280) T ss_pred EEEECCCCCHHHH-HHHHHCCCEEEECCCCCHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHCCCCCEECCCHHHH T ss_conf 9993999998999-999974992993708015799999999986898799983798788886998757872431788788 Q ss_pred -CCCCCCCCCCC--CCCCCCHHHHHHHHHHHHHHCCCCEE-EEECCCCCCCCCHHHHHHHHHHHHCCC-CEEEECCCCCC Q ss_conf -89877878554--46789989999999999985589359-800123000049899999999985069-28997497632 Q gi|254780558|r 233 -DESYNEDDSYT--LKVTQTVQQLLNRLRYSLRRGTGYFG-VMNYRGAMLLSNKESAEVIFKEFAKRG-LLFFDDGSSPR 307 (396) Q Consensus 233 -~~~~~~pGp~~--L~~~~~~~~~~~~l~~~l~~~p~~vG-vnNhmGs~~t~~~~~m~~vl~~l~~rg-L~flDs~Ts~~ 307 (396) ..|..+..+-. =-|+|+.++....+.....++|+..+ +.|-. -..|+++. +.+.+-.++-. ++++-+..|+| T Consensus 146 ~~l~~~~~~k~~vvsQTT~~~~~~~~i~~~l~~~~~~~~~~v~ntI-C~aT~~RQ--~a~~~lA~~vD~miVVGg~nSsN 222 (280) T pfam02401 146 AALPLQAETKLAVVTQTTLSLDDTAEIVAALRERYPEVRGPPFNTI-CYATQNRQ--EAVRELAPEVDVVLVVGGKNSSN 222 (280) T ss_pred HHCCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHCHHCCCCCCCCC-CHHHHHHH--HHHHHHHHHCCEEEEECCCCCCC T ss_conf 4278888662699955655399999999999986621138885772-27588899--99997775498999957999954 Q ss_pred CH-HHHHHHHHCCCEEE Q ss_conf 26-78989870994698 Q gi|254780558|r 308 NL-TRVLAPKLNLPYMV 323 (396) Q Consensus 308 Sv-a~~~A~~~gvp~~~ 323 (396) |. -.++|++.|.++.. T Consensus 223 T~~L~eia~~~~~~t~~ 239 (280) T pfam02401 223 SNRLAEIAKEHGVPAYL 239 (280) T ss_pred HHHHHHHHHHHCCCEEE T ss_conf 89999999987999799 No 129 >PRK13413 mpi multiple promoter invertase; Provisional Probab=26.69 E-value=44 Score=13.90 Aligned_cols=124 Identities=11% Similarity=0.063 Sum_probs=60.5 Q ss_pred HHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHCCCCEEEEC--CCCC-CCHHHHHH-HHHCCCEEEEEEEECCCC Q ss_conf 9999985589359800123000049899999999985069289974--9763-22678989-870994698458862899 Q gi|254780558|r 257 LRYSLRRGTGYFGVMNYRGAMLLSNKESAEVIFKEFAKRGLLFFDD--GSSP-RNLTRVLA-PKLNLPYMVADLYLDDQV 332 (396) Q Consensus 257 l~~~l~~~p~~vGvnNhmGs~~t~~~~~m~~vl~~l~~rgL~flDs--~Ts~-~Sva~~~A-~~~gvp~~~~dvfLD~~~ 332 (396) |...|+.+-.---|.=+-=+|+.-+..-+-.++..|.++|..++.- .... ......+- .-+++ =.+. T Consensus 52 l~~ll~~~~~gD~lvV~~LDRLgRs~~d~l~~l~~l~~~gv~l~~~~~~~~~~~~~~~~~~~~~l~~---------~Ae~ 122 (200) T PRK13413 52 LGKLLKRVRKGDILIVSELSRLGRNLFEIMEILNICMTKEVIVWTIKEGYELGDDIQSKVLAFAFGL---------SAEI 122 (200) T ss_pred HHHHHHHCCCCCEEEEEECCHHHCCHHHHHHHHHHHHHCCCEEEECCCCCCCCCCHHHHHHHHHHHH---------HHHH T ss_conf 9999986659998999767374288999999999999689789986678655884789999999999---------9999 Q ss_pred CHHHHHHHHHHHHHHHHHCCEEEEEEECC---H--HHHHHHHHHHHHHHHCCCEEEEHHHHHHC Q ss_conf 99999999999999998719489998088---7--88999999975688769189967774306 Q gi|254780558|r 333 DRDKIREKLKGLEEIARTTGQAIGVAVAF---D--ESIEVISQWLQQEHVRDVSVVPLSCLAKL 391 (396) Q Consensus 333 ~~~~I~~qL~~l~~~Ar~~G~AI~igh~~---p--~Ti~~L~~w~~~l~~~gi~lVpvS~l~~~ 391 (396) ..+.|..+-.+....|+++|..+ |.|. | ..+..-..++..+..+|....-+...+.+ T Consensus 123 Er~~i~eRt~~g~~~ak~~G~~~--GRpk~~~~~~~~~~~~~~~i~~~~~~g~s~~~iak~~gv 184 (200) T PRK13413 123 ERNLISQRTKEALARKKAEGAKL--GRPKGDRPEKYKLHGKEEVITKELKKGTTKSEIAKKLKV 184 (200) T ss_pred HHHHHHHHHHHHHHHHHHCCCCC--CCCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHCC T ss_conf 99999999999999999748656--888889864112479999999999888999999999892 No 130 >PRK07323 consensus Probab=26.64 E-value=36 Score=14.40 Aligned_cols=12 Identities=0% Similarity=-0.014 Sum_probs=4.6 Q ss_pred HHHHCCCEEEEH Q ss_conf 688769189967 Q gi|254780558|r 374 QEHVRDVSVVPL 385 (396) Q Consensus 374 ~l~~~gi~lVpv 385 (396) .+..+|+-+.+. T Consensus 400 ~~~~~Gll~~~~ 411 (443) T PRK07323 400 LAFEKGLLLLGC 411 (443) T ss_pred HHHHCCEEEEEC T ss_conf 999799799836 No 131 >cd01121 Sms Sms (bacterial radA) DNA repair protein. This protein is not related to archael radA any more than is to other RecA-like NTPases. Sms has a role in recombination and recombinational repair and is responsible for the stabilization or processing of branched DNA molecules. Probab=26.32 E-value=44 Score=13.86 Aligned_cols=88 Identities=19% Similarity=0.179 Sum_probs=57.5 Q ss_pred CHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHCC--CCEEEECCCCCCCHHHHHHHHHCCCEEEEEE Q ss_conf 98999999999998558935980012300004989999999998506--9289974976322678989870994698458 Q gi|254780558|r 249 TVQQLLNRLRYSLRRGTGYFGVMNYRGAMLLSNKESAEVIFKEFAKR--GLLFFDDGSSPRNLTRVLAPKLNLPYMVADL 326 (396) Q Consensus 249 ~~~~~~~~l~~~l~~~p~~vGvnNhmGs~~t~~~~~m~~vl~~l~~r--gL~flDs~Ts~~Sva~~~A~~~gvp~~~~dv 326 (396) +.+|+.-|-+. | |+++ ..-+...+.-++.+++.+.+. .+..+||-.|-.|- T Consensus 121 S~~QIk~RA~R-L-------g~~~--~~l~l~set~le~Il~~i~~~kP~~lIIDSIQT~~~~----------------- 173 (372) T cd01121 121 SPEQIKLRADR-L-------GIST--ENLYLLAETNLEDILASIEELKPDLVIIDSIQTVYSS----------------- 173 (372) T ss_pred HHHHHHHHHHH-H-------CCCC--CCCEEEECCCHHHHHHHHHHHCCCEEEEECHHHCCCC----------------- T ss_conf 78999989998-5-------8788--7727884356999999999719988999562202037----------------- Q ss_pred EECCC-CCHHHHHHHHHHHHHHHHHCCEEEE-EEECCHH Q ss_conf 86289-9999999999999999987194899-9808878 Q gi|254780558|r 327 YLDDQ-VDRDKIREKLKGLEEIARTTGQAIG-VAVAFDE 363 (396) Q Consensus 327 fLD~~-~~~~~I~~qL~~l~~~Ar~~G~AI~-igh~~p~ 363 (396) -+|.. =+..-++.-=.+|.++||+++.++. |||...+ T Consensus 174 ~~~s~pGsvsQVReca~~L~~~AK~~~i~~~lVGHVTK~ 212 (372) T cd01121 174 ELTSAPGSVSQVRECTAELMRFAKERNIPIFIVGHVTKE 212 (372) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHCCCEEEEEEEECCC T ss_conf 767799878999999999999998619739999876268 No 132 >pfam11963 DUF3477 Protein of unknown function (DUF3477). This family of proteins is functionally uncharacterized. This protein is found in viruses. Proteins in this family are typically between 246 to 7162 amino acids in length. This protein is found associated with pfam08716, pfam01661, pfam05409, pfam08717, pfam01831, pfam08715, pfam08710. Probab=26.23 E-value=45 Score=13.84 Aligned_cols=19 Identities=11% Similarity=0.462 Sum_probs=10.2 Q ss_pred CCCCCEEEEECCCCCHHHHHH Q ss_conf 793508987089876899999 Q gi|254780558|r 194 LPANITLAFASNGNSLDRWMK 214 (396) Q Consensus 194 LP~~vT~A~~P~~~~~~~~~~ 214 (396) +..-|-++|+| .++++|.+ T Consensus 168 iGW~VP~~~~p--~ya~~w~q 186 (355) T pfam11963 168 IGWFVPVTFMP--EYAKKWLQ 186 (355) T ss_pred EEEEEEECCCH--HHHHHHHC T ss_conf 78986411263--88875420 No 133 >TIGR00099 Cof-subfamily Cof-like hydrolase; InterPro: IPR000150 The Haloacid Dehydrogenase (HAD) superfamily includes phosphatases, phosphonatases, P-type ATPases, beta-phosphoglucomutases, phosphomannomutases, and dehalogenases, which are involved in a variety of cellular processes ranging from amino acid biosynthesis to detoxification . Proteins in this entry are mostly uncharacterised, though they form a distinct subgroup within the HAD superfamily. Members are found almost exclusively in bacteria and many species contain several paralogs, for example Escherichia coli contains a total of six proteins from this entry. Sequence similarities suggest that these enzymes are phosphatases which work on phosphorylated sugars. The YbiV (P75792 from SWISSPROT) protein from E. coli has been experimentally characterised . This enzyme catalyzes the hydrolysis of sugar phosphate to sugar and inorganic phosphate. It has a wide substrate specificity, catalyzing the hydrolysis of ribose-5-phosphate and glucose-6-phosphate most efficiently, but it is not known if these are the real substrates in vivo. The protein appears to be a monomer that contains two domains, an alpha-beta hydrolase domain that forms a Rossman fold, and an alpha-beta domain. The active site is found in a negatively charged cavity found at the interface between the two domains. ; GO: 0016787 hydrolase activity, 0008152 metabolic process. Probab=26.20 E-value=45 Score=13.84 Aligned_cols=71 Identities=18% Similarity=0.198 Sum_probs=47.3 Q ss_pred CCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCC---HHHHHHHHHHHHHHCC-CCEEEEECC----C--- Q ss_conf 7689999999985992999817665689877878554467899---8999999999998558-935980012----3--- Q gi|254780558|r 207 NSLDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQT---VQQLLNRLRYSLRRGT-GYFGVMNYR----G--- 275 (396) Q Consensus 207 ~~~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~---~~~~~~~l~~~l~~~p-~~vGvnNhm----G--- 275 (396) +...++.+++|++||+|+| .||-+ -......+.+.+.--. .+++.||-+ + T Consensus 20 ~~~~~al~~l~~~G~~~~l-------------------aTGR~~~g~~~~~~~~~~~~~~~~~~~i~~nGg~~~~~~~~~ 80 (270) T TIGR00099 20 PSTKEALAKLREKGIKVVL-------------------ATGRPLAGVYKELKEILKELGLDTDPYISANGGAVIDEGGGN 80 (270) T ss_pred HHHHHHHHHHHHCCCEEEE-------------------EECCCCCCHHHHHHHHHHHHCCCCCEEEEECCCEEEEECCCC T ss_conf 8899999999966988999-------------------818864347899999888506785568761784899846870 Q ss_pred ---CCCCCCHHHHHHHHHHHHCCC Q ss_conf ---000049899999999985069 Q gi|254780558|r 276 ---AMLLSNKESAEVIFKEFAKRG 296 (396) Q Consensus 276 ---s~~t~~~~~m~~vl~~l~~rg 296 (396) ...+=+...++.+++.+++.+ T Consensus 81 ~~~~~~~l~~~~~~~~~~~~~~~~ 104 (270) T TIGR00099 81 EILYKKPLDLDLVEELLNFLKKHG 104 (270) T ss_pred EEEEECCCCHHHHHHHHHHHHHCC T ss_conf 056302478889999999997348 No 134 >TIGR00537 hemK_rel_arch methylase, putative; InterPro: IPR004557 These proteins exhibit homology to the Saccharomyces cerevisiae (yeast) ORF YDR140w (FYV9). No known function has been described for this yeast ORF though the null mutant is viable and exhibits K1 killer toxin hypersensitivity. Members of this entry are weakly related to the yeast ORF YNL063w which is a putative S-adenosyl-methionine-dependent methyltransferase . Members of this entry are found in archaeal, bacterial and eukaryotic lineages.; GO: 0008168 methyltransferase activity. Probab=26.17 E-value=45 Score=13.84 Aligned_cols=54 Identities=19% Similarity=0.251 Sum_probs=39.3 Q ss_pred EEEECC-CCCHHHHHHHHHHHHHHHHHCCEEEEEEECCHHHHHHHHHHHHHHHHCCCEE Q ss_conf 588628-9999999999999999998719489998088788999999975688769189 Q gi|254780558|r 325 DLYLDD-QVDRDKIREKLKGLEEIARTTGQAIGVAVAFDESIEVISQWLQQEHVRDVSV 382 (396) Q Consensus 325 dvfLD~-~~~~~~I~~qL~~l~~~Ar~~G~AI~igh~~p~Ti~~L~~w~~~l~~~gi~l 382 (396) |+-+|. ...+.-|+|.|+++-.+=++.|.++.|.+-...+=++|. .|+.+|+.. T Consensus 110 d~A~dGGkdGr~vidrFldelp~~lk~gGrv~l~~SSl~~e~~~~~----kl~~~GF~~ 164 (183) T TIGR00537 110 DLAIDGGKDGRKVIDRFLDELPEYLKEGGRVQLIQSSLSDEKDTLD----KLDELGFKV 164 (183) T ss_pred EEEECCCCCCHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCHHHHH----HHHHCCCCE T ss_conf 4333178730578888876568887059989999606688688998----876158847 No 135 >pfam01261 AP_endonuc_2 Xylose isomerase-like TIM barrel. This TIM alpha/beta barrel structure is found in xylose isomerase and in endonuclease IV (EC:3.1.21.2). This domain is also found in the N termini of bacterial myo-inositol catabolism proteins. These are involved in the myo-inositol catabolism pathway, and is required for growth on myo-inositol in Rhizobium leguminosarum bv. viciae. Probab=26.09 E-value=45 Score=13.83 Aligned_cols=75 Identities=17% Similarity=0.151 Sum_probs=45.6 Q ss_pred HHHHHHHHCCCEEEEEEEEC--C---CCCHHHHHHHHHHHHHHHHHCCEEEEEE-ECCH----HHHHHHHHHHHHHHHCC Q ss_conf 78989870994698458862--8---9999999999999999998719489998-0887----88999999975688769 Q gi|254780558|r 310 TRVLAPKLNLPYMVADLYLD--D---QVDRDKIREKLKGLEEIARTTGQAIGVA-VAFD----ESIEVISQWLQQEHVRD 379 (396) Q Consensus 310 a~~~A~~~gvp~~~~dvfLD--~---~~~~~~I~~qL~~l~~~Ar~~G~AI~ig-h~~p----~Ti~~L~~w~~~l~~~g 379 (396) .-.+|+.+|.++......-. + +...+.+.+.|.++...|+++|.-|++= |++. .|++-+...+..+...+ T Consensus 69 ~l~~a~~lG~~~i~~~~g~~~~~~~~~~~~~~~~~~l~~~~~~a~~~gi~i~iE~~~~~~~~~~~~~~~~~l~~~v~~~~ 148 (201) T pfam01261 69 AIELAAALGAKVVVVHPGGALPGEDREEALDRLAESLNELAELAEEYGVKLALENHPGTGVELGYFEEALRLIEEVDSPN 148 (201) T ss_pred HHHHHHHCCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCEEEEEEECCCCCCCCCCHHHHHHHHHHCCCCC T ss_conf 99999973995899826887889999999999999999999988755738999987998867899999999998649986 Q ss_pred CEEEE Q ss_conf 18996 Q gi|254780558|r 380 VSVVP 384 (396) Q Consensus 380 i~lVp 384 (396) +.++. T Consensus 149 ~~~~~ 153 (201) T pfam01261 149 VGLCL 153 (201) T ss_pred CCCCC T ss_conf 55110 No 136 >PRK00349 uvrA excinuclease ABC subunit A; Reviewed Probab=25.92 E-value=45 Score=13.81 Aligned_cols=107 Identities=21% Similarity=0.206 Sum_probs=53.3 Q ss_pred HHHHHHHHHHHCC-------CCEEEECCCCCCCHHH------HHHHHHCCCEEEEEEEECCC---CCHHHHHHHHHHHHH Q ss_conf 9999999998506-------9289974976322678------98987099469845886289---999999999999999 Q gi|254780558|r 283 ESAEVIFKEFAKR-------GLLFFDDGSSPRNLTR------VLAPKLNLPYMVADLYLDDQ---VDRDKIREKLKGLEE 346 (396) Q Consensus 283 ~~m~~vl~~l~~r-------gL~flDs~Ts~~Sva~------~~A~~~gvp~~~~dvfLD~~---~~~~~I~~qL~~l~~ 346 (396) ..-+.+++++.+| ||.||.=.-..+|.-+ .+|...|-.-.-.--+||.- ..+..+.+=+.-+.+ T Consensus 457 ~ia~~il~eI~~RL~fL~~VGLgYLtLdR~~~TLSGGE~QRirLa~qLgS~L~gvlYVLDEPSiGLHprD~~rLi~~L~~ 536 (944) T PRK00349 457 KIAELILKEIRERLKFLVDVGLDYLTLSRSAGTLSGGEAQRIRLATQIGSGLTGVLYVLDEPSIGLHQRDNDRLIETLKR 536 (944) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHHHHHHHHHCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHH T ss_conf 99999999999999999974962477304567767148899999998656664649993687554798899999999999 Q ss_pred HHHHCCEEEEEEECCHHHHH---HHHHHHHHHHHCCCEEE---EHHHHHH Q ss_conf 99871948999808878899---99999756887691899---6777430 Q gi|254780558|r 347 IARTTGQAIGVAVAFDESIE---VISQWLQQEHVRDVSVV---PLSCLAK 390 (396) Q Consensus 347 ~Ar~~G~AI~igh~~p~Ti~---~L~~w~~~l~~~gi~lV---pvS~l~~ 390 (396) +-.+.-++|.+=|-. ++|. .|-...|..-..|=++| +..++++ T Consensus 537 Lrd~GNTVlVVEHD~-~~i~~AD~iIDlGPgAG~~GGeiv~~Gt~~~l~~ 585 (944) T PRK00349 537 LRDLGNTLIVVEHDE-DTIRAADYIVDIGPGAGVHGGEVVASGTPEEIMK 585 (944) T ss_pred HHHCCCEEEEEECCH-HHHHHCCEEEEECCCCCCCCCEEEEEECHHHHHC T ss_conf 985798599983568-7775287789717877777887999748999950 No 137 >TIGR01090 apt adenine phosphoribosyltransferase; InterPro: IPR005764 Adenine phosphoribosyltransferase (APRTase, 2.4.2.7 from EC) is a widely distributed enzyme, and its deficiency in humans causes the accumulation of 2,8-dihydroxyadenine. It is the sole catalyst for adenine recycling in most eukaryotes.AMP + diphosphate = adenine + 5-phospho-alpha-D-ribose 1-diphosphate ; GO: 0003999 adenine phosphoribosyltransferase activity, 0006168 adenine salvage. Probab=25.80 E-value=44 Score=13.90 Aligned_cols=22 Identities=27% Similarity=0.383 Sum_probs=13.7 Q ss_pred EEEEEECCCCCCCHHHHHHHHHC Q ss_conf 58999737787716889999837 Q gi|254780558|r 172 RIAIVVSGLGISQTGTQRAINLL 194 (396) Q Consensus 172 riAIVIddlG~~~~~~~~ai~~L 194 (396) .=.+||||| +--.+|.+|..+| T Consensus 115 ~RVliVDDL-lATGGT~~A~~eL 136 (175) T TIGR01090 115 QRVLIVDDL-LATGGTAEATLEL 136 (175) T ss_pred CEEEEEECC-CCCHHHHHHHHHH T ss_conf 908998322-0126789999999 No 138 >PRK09248 putative hydrolase; Validated Probab=25.79 E-value=45 Score=13.79 Aligned_cols=113 Identities=12% Similarity=0.104 Sum_probs=75.4 Q ss_pred CCHHHHHHHHHHHHHHCCCCEEEEECCC-CCCCCCHHHHHHHHHHHHCCCCEEEECCCCCC----------CHHHHHHHH Q ss_conf 9989999999999985589359800123-00004989999999998506928997497632----------267898987 Q gi|254780558|r 248 QTVQQLLNRLRYSLRRGTGYFGVMNYRG-AMLLSNKESAEVIFKEFAKRGLLFFDDGSSPR----------NLTRVLAPK 316 (396) Q Consensus 248 ~~~~~~~~~l~~~l~~~p~~vGvnNhmG-s~~t~~~~~m~~vl~~l~~rgL~flDs~Ts~~----------Sva~~~A~~ 316 (396) .+.+++.+++..++.+ +++=+.-|.| .+|--| ++.+++.++++|...==.-++.+ ----++|++ T Consensus 109 ~~~~~~t~~~i~ai~n--~~vdiigHp~~~~~~~D---~e~v~~aa~e~g~~lEIN~s~~r~~r~~~~~~~~~il~~a~~ 183 (246) T PRK09248 109 GDKETNTQAMINAIKS--GRVDIIGHPGNPKYPID---IEAVVKAAAEHNVALEINNSSFKHSRKGSEDNCRAIAALCKK 183 (246) T ss_pred CCHHHHHHHHHHHHHC--CCCCEEECCCCCCCCCC---HHHHHHHHHHHCCEEEEECCCCCCCCCCCCCCHHHHHHHHHH T ss_conf 3799999999999857--99768989999988858---999999999849699995886666778764468999999998 Q ss_pred HCCCEEEEEEEECCC-CCHHHHHHHHHHHHHHHHHCCEEE-EEEECCHHHHHHHHHHHHH Q ss_conf 099469845886289-999999999999999998719489-9980887889999999756 Q gi|254780558|r 317 LNLPYMVADLYLDDQ-VDRDKIREKLKGLEEIARTTGQAI-GVAVAFDESIEVISQWLQQ 374 (396) Q Consensus 317 ~gvp~~~~dvfLD~~-~~~~~I~~qL~~l~~~Ar~~G~AI-~igh~~p~Ti~~L~~w~~~ 374 (396) +|+++. |+.+ ...+.|- .++.++.+||+-|+-- -|-.. +.+.|.+|+.. T Consensus 184 ~Gv~i~-----i~SDAH~~~~vg-~~~~~~~~ar~~g~~~e~V~N~---~~e~~~~~l~~ 234 (246) T PRK09248 184 AGVWVA-----LGSDAHIAFDIG-RFEEALKLLDEVGFPEERILNV---SPRRLLDFLES 234 (246) T ss_pred CCCEEE-----EECCCCCHHHHC-CHHHHHHHHHHCCCCHHHEEEC---CHHHHHHHHHH T ss_conf 199399-----858988877754-5999999999859999996268---99999999998 No 139 >COG1964 Predicted Fe-S oxidoreductases [General function prediction only] Probab=25.77 E-value=46 Score=13.79 Aligned_cols=48 Identities=21% Similarity=0.204 Sum_probs=34.0 Q ss_pred HHHHHHHHHCCCC-EEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEE Q ss_conf 9999999985992-99981766568987787855446789989999999999985589359800 Q gi|254780558|r 210 DRWMKEAKKKGQE-AILQIPMQAFDESYNEDDSYTLKVTQTVQQLLNRLRYSLRRGTGYFGVMN 272 (396) Q Consensus 210 ~~~~~~Ar~~G~E-vllhlPMEp~~~~~~~pGp~~L~~~~~~~~~~~~l~~~l~~~p~~vGvnN 272 (396) ....+-.|++|+. |.| + -||--+..+.+..+.++.++..+-..-|||= T Consensus 189 k~alen~r~~g~~svVL-V--------------ptl~rgvNd~~lG~iirfa~~n~dvVrgVnf 237 (475) T COG1964 189 KQALENCRKAGLPSVVL-V--------------PTLIRGVNDHELGAIIRFALNNIDVVRGVNF 237 (475) T ss_pred HHHHHHHHHCCCCCEEE-E--------------EHHHCCCCHHHHHHHHHHHHHCCCCCCCCCE T ss_conf 89999877548872899-7--------------1052156747777899998751351224524 No 140 >PRK12616 pyridoxal kinase; Reviewed Probab=25.19 E-value=47 Score=13.72 Aligned_cols=57 Identities=16% Similarity=0.126 Sum_probs=28.6 Q ss_pred HHHHHHHHCCCCCEEEEECCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCC Q ss_conf 88999983793508987089876899999999859929998176656898778785544678998999999999998558 Q gi|254780558|r 186 GTQRAINLLPANITLAFASNGNSLDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQLLNRLRYSLRRGT 265 (396) Q Consensus 186 ~~~~ai~~LP~~vT~A~~P~~~~~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~~~~~l~~~l~~~p 265 (396) ...+++.+.=.|..--+.|+-+.+..+ .|.+ .-.+.++++..-+..++.-. T Consensus 122 ~~~~~l~~~L~P~a~liTPN~~Ea~~L------~g~~-----------------------~i~~~~~~~~~a~~l~~~G~ 172 (270) T PRK12616 122 EHAEALREQLAPLATVITPNLFEAGQL------SGMG-----------------------EIKTVEQMKEAAKKIHELGA 172 (270) T ss_pred HHHHHHHHHHHHHCCEECCCHHHHHHH------CCCC-----------------------CCCCHHHHHHHHHHHHHHCC T ss_conf 999999999872382847987999987------2998-----------------------67999999999999997079 Q ss_pred CCEEEE Q ss_conf 935980 Q gi|254780558|r 266 GYFGVM 271 (396) Q Consensus 266 ~~vGvn 271 (396) .+|=+. T Consensus 173 ~~Vlik 178 (270) T PRK12616 173 QYVLIT 178 (270) T ss_pred CEEEEC T ss_conf 879990 No 141 >PRK13635 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=25.18 E-value=47 Score=13.72 Aligned_cols=133 Identities=14% Similarity=0.100 Sum_probs=56.0 Q ss_pred EEEEEECCCCCCCHHHHHHHHHCCCCCEEEEECCCCCHHHH--HHHHHHCCCCEEEECCCCCCCCCCCC--CCCCCCCCC Q ss_conf 58999737787716889999837935089870898768999--99999859929998176656898778--785544678 Q gi|254780558|r 172 RIAIVVSGLGISQTGTQRAINLLPANITLAFASNGNSLDRW--MKEAKKKGQEAILQIPMQAFDESYNE--DDSYTLKVT 247 (396) Q Consensus 172 riAIVIddlG~~~~~~~~ai~~LP~~vT~A~~P~~~~~~~~--~~~Ar~~G~EvllhlPMEp~~~~~~~--pGp~~L~~~ 247 (396) .+--||+-=|-+.+...+.|..|=.|-.=.+.=.+.+..+. .+..++ .-++.|-|..-+-..... -.-+.-..+ T Consensus 34 E~vaivG~nGsGKSTL~k~l~Gl~~p~~G~I~i~G~~i~~~~~~~lr~~--ig~VfQ~P~~~l~~~tV~e~iafgl~~~g 111 (279) T PRK13635 34 EWVAIVGHNGSGKSTLAKLLNGLLLPEAGTITVGGMVLSEETVWDVRKQ--IGMVFQNPDNQFVGTTVQDDVAFGLENIG 111 (279) T ss_pred CEEEEECCCCCHHHHHHHHHHCCCCCCCCEEEECCEECCCCCHHHHHHH--EEEEECCHHHHCCCCHHHHHHHHHHHHCC T ss_conf 8999999999659999999972888889649999999985787999743--66882185652576268999988998779 Q ss_pred CCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHH-HCCCCEEEECCCCCC Q ss_conf 998999999999998558935980012300004989999999998-506928997497632 Q gi|254780558|r 248 QTVQQLLNRLRYSLRRGTGYFGVMNYRGAMLLSNKESAEVIFKEF-AKRGLLFFDDGSSPR 307 (396) Q Consensus 248 ~~~~~~~~~l~~~l~~~p~~vGvnNhmGs~~t~~~~~m~~vl~~l-~~rgL~flDs~Ts~~ 307 (396) ++.+++.+++.+++..+.- ..+.+.....+..-.+-.=.+...| .+-.++++|--|+.- T Consensus 112 ~~~~e~~~rv~~~l~~~gl-~~~~~~~p~~LSGGQrQRvaIAraL~~~P~iLilDEPTs~L 171 (279) T PRK13635 112 VPREEMVERVDQALRQVGM-EDFLNREPHRLSGGQKQRVAIAGVLALQPDILILDEATSML 171 (279) T ss_pred CCHHHHHHHHHHHHHHCCC-HHHHHCCHHHCCHHHHHHHHHHHHHHCCCCEEEECCCCCCC T ss_conf 9999999999999987799-78861793439999999999999997099989973874548 No 142 >PRK06904 replicative DNA helicase; Validated Probab=25.09 E-value=47 Score=13.71 Aligned_cols=108 Identities=19% Similarity=0.189 Sum_probs=55.8 Q ss_pred CCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCC-CHHHHHHHHHHHHCCCCEEEECCCCC-----CCHHHHHHHHH- Q ss_conf 678998999999999998558935980012300004-98999999999850692899749763-----22678989870- Q gi|254780558|r 245 KVTQTVQQLLNRLRYSLRRGTGYFGVMNYRGAMLLS-NKESAEVIFKEFAKRGLLFFDDGSSP-----RNLTRVLAPKL- 317 (396) Q Consensus 245 ~~~~~~~~~~~~l~~~l~~~p~~vGvnNhmGs~~t~-~~~~m~~vl~~l~~rgL~flDs~Ts~-----~Sva~~~A~~~- 317 (396) ...|+.+++..|+....++++... +.+ |..++. +-..+...+..+++.-.+|+|+..+. ++.+.++.++. T Consensus 257 SLEM~~~~l~~R~ls~~s~v~~~~-i~~--g~~l~~~e~~~~~~~~~~l~~~~~l~idd~~~~t~~~i~~~~r~~~~~~~ 333 (472) T PRK06904 257 SLEMPAEQIMMRMLASLSRVDQTK-IRT--GQNLDQQDWAKISSTVGMFKQKPNLYIDDSSGLTPTELRSRARRVYRENG 333 (472) T ss_pred CCCCCHHHHHHHHHHHHCCCCHHH-HHC--CCCCHHHHHHHHHHHHHHHHCCCCEEEECCCCCCHHHHHHHHHHHHHHCC T ss_conf 787999999999999864999888-646--88560999999999999984689816846999999999999999998738 Q ss_pred CCCEEEEEEEECCCC----------CHHHHHHHHHHHHHHHHHCCEEEEEEE Q ss_conf 994698458862899----------999999999999999987194899980 Q gi|254780558|r 318 NLPYMVADLYLDDQV----------DRDKIREKLKGLEEIARTTGQAIGVAV 359 (396) Q Consensus 318 gvp~~~~dvfLD~~~----------~~~~I~~qL~~l~~~Ar~~G~AI~igh 359 (396) |+-. ||+|.-+ ....|-.-=.+|-.+||+-+..|...+ T Consensus 334 ~l~~----vvIDYLqL~~~~~~~~~r~~ei~~isr~LK~lAkel~ipvi~Ls 381 (472) T PRK06904 334 GLSL----IMVDYLQLMRAPGFEDNRTLEIAEISRSLKALAKELKVPVVALS 381 (472) T ss_pred CCCE----EEEEHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEC T ss_conf 9978----99638866048887777889999999999999999799889973 No 143 >PRK10128 putative aldolase; Provisional Probab=24.91 E-value=47 Score=13.69 Aligned_cols=85 Identities=15% Similarity=0.255 Sum_probs=54.8 Q ss_pred CCCHHHHHHHHHHHHCCCC--EEEECCCCCCCHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHHHHHHHCCEEEE Q ss_conf 0498999999999850692--89974976322678989870994698458862899999999999999999987194899 Q gi|254780558|r 279 LSNKESAEVIFKEFAKRGL--LFFDDGSSPRNLTRVLAPKLNLPYMVADLYLDDQVDRDKIREKLKGLEEIARTTGQAIG 356 (396) Q Consensus 279 t~~~~~m~~vl~~l~~rgL--~flDs~Ts~~Sva~~~A~~~gvp~~~~dvfLD~~~~~~~I~~qL~~l~~~Ar~~G~AI~ 356 (396) -.+..+++.+=+.++--|+ +|+- ...++..+|+|- +.+...+.+.+++..+.++++|...| T Consensus 135 IEt~~av~nldeI~av~GvD~~fiG--------p~DLs~slG~pg---------~~~~p~v~~ai~~v~~~~~~~gk~~G 197 (250) T PRK10128 135 VESKTALDNLDEILDVEGIDGVFIG--------PADLSASLGYPD---------NAGHPEVQRIIETSIRRIRAAGKAAG 197 (250) T ss_pred ECCHHHHHHHHHHHCCCCCCEEEEC--------HHHHHHHCCCCC---------CCCCHHHHHHHHHHHHHHHHCCCCEE T ss_conf 1558999879998588998889988--------488998659999---------99986999999999999998699768 Q ss_pred EEECCHHHHHHHHHHHHHHHHCCCEEEEHHH Q ss_conf 9808878899999997568876918996777 Q gi|254780558|r 357 VAVAFDESIEVISQWLQQEHVRDVSVVPLSC 387 (396) Q Consensus 357 igh~~p~Ti~~L~~w~~~l~~~gi~lVpvS~ 387 (396) |--.-++.. ++|+ ++|+.+|-++. T Consensus 198 ~~~~~~~~a---~~~~----~~G~~~v~~g~ 221 (250) T PRK10128 198 FLAVAPDMA---QQCL----AWGANFVAVGV 221 (250) T ss_pred EECCCHHHH---HHHH----HCCCCEEEEHH T ss_conf 837999999---9999----76999998657 No 144 >PRK12479 branched-chain amino acid aminotransferase; Provisional Probab=24.90 E-value=3.3 Score=20.99 Aligned_cols=79 Identities=15% Similarity=0.141 Sum_probs=58.5 Q ss_pred HHHHHHHHHCCCEEEEEEEECCC--CCHHHHHHHHHHHHHHHHHCCEEEEEEECCHHHHHHHHHHHHHHHHCCCEEEEHH Q ss_conf 67898987099469845886289--9999999999999999987194899980887889999999756887691899677 Q gi|254780558|r 309 LTRVLAPKLNLPYMVADLYLDDQ--VDRDKIREKLKGLEEIARTTGQAIGVAVAFDESIEVISQWLQQEHVRDVSVVPLS 386 (396) Q Consensus 309 va~~~A~~~gvp~~~~dvfLD~~--~~~~~I~~qL~~l~~~Ar~~G~AI~igh~~p~Ti~~L~~w~~~l~~~gi~lVpvS 386 (396) ..-++|++.|+++.++++-++.- -|+..|..-.....-+.+-+|..|+-|.+-|.|-.-.+.+.+-.+++|+.+-.+. T Consensus 216 ~viela~~~gi~v~E~~i~~~el~~ADEvFlt~s~~~i~PV~~id~~~ig~g~~Gpvt~~l~~~y~~~~~~~~~~~~~~~ 295 (299) T PRK12479 216 SVIELCERLSIPCEERPFTRHDVYVADEVFLTGTAAELIPVVKVDSREIGDGKPGSVTKQLTEEFKKLTRERGVRVPGLA 295 (299) T ss_pred HHHHHHHHCCCEEEEEECCHHHHHHCCEEEECCCCCCEEEEEEECCEECCCCCCCHHHHHHHHHHHHHHHHHCCCCCCHH T ss_conf 99999998699699995789999719999983797738999999998956988698999999999999974198786345 Q ss_pred H Q ss_conf 7 Q gi|254780558|r 387 C 387 (396) Q Consensus 387 ~ 387 (396) + T Consensus 296 ~ 296 (299) T PRK12479 296 E 296 (299) T ss_pred H T ss_conf 4 No 145 >COG2230 Cfa Cyclopropane fatty acid synthase and related methyltransferases [Cell envelope biogenesis, outer membrane] Probab=24.82 E-value=38 Score=14.25 Aligned_cols=167 Identities=12% Similarity=0.137 Sum_probs=98.5 Q ss_pred EEEEECCCCCCCHHHHHHHHHCCCCCEEEEECCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHH Q ss_conf 89997377877168899998379350898708987689999999985992999817665689877878554467899899 Q gi|254780558|r 173 IAIVVSGLGISQTGTQRAINLLPANITLAFASNGNSLDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQ 252 (396) Q Consensus 173 iAIVIddlG~~~~~~~~ai~~LP~~vT~A~~P~~~~~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~ 252 (396) .-+.=-|+|.+... ..+..+... =..++.=..+.....-+++++.|.|.-+++ +..||.+.+. + T Consensus 74 ~~lLDiGCGWG~l~-~~aA~~y~v-~V~GvTlS~~Q~~~~~~r~~~~gl~~~v~v--~l~d~rd~~e-~----------- 137 (283) T COG2230 74 MTLLDIGCGWGGLA-IYAAEEYGV-TVVGVTLSEEQLAYAEKRIAARGLEDNVEV--RLQDYRDFEE-P----------- 137 (283) T ss_pred CEEEEECCCHHHHH-HHHHHHCCC-EEEEEECCHHHHHHHHHHHHHCCCCCCCEE--EECCCCCCCC-C----------- T ss_conf 98987478844999-999998499-799966899999999999997599766079--9656210324-3----------- Q ss_pred HHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHCCCEEEEEEEECCC- Q ss_conf 9999999998558935980012300004989999999998506928997497632267898987099469845886289- Q gi|254780558|r 253 LLNRLRYSLRRGTGYFGVMNYRGAMLLSNKESAEVIFKEFAKRGLLFFDDGSSPRNLTRVLAPKLNLPYMVADLYLDDQ- 331 (396) Q Consensus 253 ~~~~l~~~l~~~p~~vGvnNhmGs~~t~~~~~m~~vl~~l~~rgL~flDs~Ts~~Sva~~~A~~~gvp~~~~dvfLD~~- 331 (396) ++++- .||+-.|+|-. .-..=++.+-+.|+..|++++.+-+.++.--. .-.++..+.+|=+.. T Consensus 138 --------fDrIv-SvgmfEhvg~~--~~~~ff~~~~~~L~~~G~~llh~I~~~~~~~~-----~~~~~i~~yiFPgG~l 201 (283) T COG2230 138 --------FDRIV-SVGMFEHVGKE--NYDDFFKKVYALLKPGGRMLLHSITGPDQEFR-----RFPDFIDKYIFPGGEL 201 (283) T ss_pred --------CCEEE-EHHHHHHHCCC--CHHHHHHHHHHHCCCCCEEEEEEECCCCCCCC-----CCHHHHHHHCCCCCCC T ss_conf --------42055-60068873710--28999999996459996599999667885544-----3327899857899848 Q ss_pred CCHHHHHHHHHHHHHHHHHCCEEEE-EEECCHHHHHHHHHHHHHHHHC Q ss_conf 9999999999999999987194899-9808878899999997568876 Q gi|254780558|r 332 VDRDKIREKLKGLEEIARTTGQAIG-VAVAFDESIEVISQWLQQEHVR 378 (396) Q Consensus 332 ~~~~~I~~qL~~l~~~Ar~~G~AI~-igh~~p~Ti~~L~~w~~~l~~~ 378 (396) ++. ......+.+.|..|- +-+-++.=...|..|...++++ T Consensus 202 Ps~-------~~i~~~~~~~~~~v~~~~~~~~hYa~Tl~~W~~~f~~~ 242 (283) T COG2230 202 PSI-------SEILELASEAGFVVLDVESLRPHYARTLRLWRERFEAN 242 (283) T ss_pred CCH-------HHHHHHHHHCCCEEEHHHHHCHHHHHHHHHHHHHHHHH T ss_conf 977-------99999887568688427640677899999999999998 No 146 >pfam03971 IDH Monomeric isocitrate dehydrogenase. NADP(+)-dependent isocitrate dehydrogenase (ICD) is an important enzyme of the intermediary metabolism, as it controls the carbon flux within the citric acid cycle and supplies the cell with 2-oxoglutarate EC:1.1.1.42 and NADPH for biosynthetic purposes. Probab=24.77 E-value=48 Score=13.67 Aligned_cols=175 Identities=13% Similarity=0.092 Sum_probs=99.1 Q ss_pred CCCHHHHHHHHHHCCCCEEEECC--C----------------------------CCCCCCCCCCCCCCCCCCC------C Q ss_conf 87689999999985992999817--6----------------------------6568987787855446789------9 Q gi|254780558|r 206 GNSLDRWMKEAKKKGQEAILQIP--M----------------------------QAFDESYNEDDSYTLKVTQ------T 249 (396) Q Consensus 206 ~~~~~~~~~~Ar~~G~EvllhlP--M----------------------------Ep~~~~~~~pGp~~L~~~~------~ 249 (396) .....+.++.|++.|-=.=|||= | +..+. +-+-|-+.|+... . T Consensus 229 ~~F~~~qi~dAK~~gvLfSLHLKATMMKVSDPIIFGHaV~vffkdvFeKh~~~f~elGv-npNNGlgdl~~Ki~~Lp~~~ 307 (735) T pfam03971 229 ADFFEEQMQDAKETGVLFSLHVKATMMKVSDPIIFGHAVSVFYKDVFDKHGDLFDQLGV-NVNNGIGDLYAKIVSLPASQ 307 (735) T ss_pred HHHHHHHHHHHHHCCCEEEEHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCC-CCCCCHHHHHHHHHHCCHHH T ss_conf 99999999999864937873022344333686006999999999999999999998499-85767899999886098768 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEECC-----------------------CCCC----------------CCCHHHHHHHHH Q ss_conf 8999999999998558935980012-----------------------3000----------------049899999999 Q gi|254780558|r 250 VQQLLNRLRYSLRRGTGYFGVMNYR-----------------------GAML----------------LSNKESAEVIFK 290 (396) Q Consensus 250 ~~~~~~~l~~~l~~~p~~vGvnNhm-----------------------Gs~~----------------t~~~~~m~~vl~ 290 (396) .++|+..++..+..-|...-||..- |++. .+-...-+.+++ T Consensus 308 ~~eI~adi~~~~~~~P~lAMVnSDkGITNLHVPSDVIIDASMPAmIR~sGkmW~~dG~~~DtkavIPD~sYA~vYq~~I~ 387 (735) T pfam03971 308 RAEIEADIHAVYAHRPELAMVDSDKGITNLHVPSDVIVDASMPAMIRNSGQMWGKDGKLKDTKAVIPDSTYAGVYQEVID 387 (735) T ss_pred HHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCCCEEEECCCHHHHHHCCCEECCCCCCCCCEEECCCCHHHHHHHHHHH T ss_conf 99999999999850997376506788435668865567267489987146317999881543375466313789999999 Q ss_pred HHHCCCCEEEECCCCCCCHH-----HHHHHHHC-------CCEEEEEEEEC--------CCCC-----------HHHHHH Q ss_conf 98506928997497632267-----89898709-------94698458862--------8999-----------999999 Q gi|254780558|r 291 EFAKRGLLFFDDGSSPRNLT-----RVLAPKLN-------LPYMVADLYLD--------DQVD-----------RDKIRE 339 (396) Q Consensus 291 ~l~~rgL~flDs~Ts~~Sva-----~~~A~~~g-------vp~~~~dvfLD--------~~~~-----------~~~I~~ 339 (396) .|+++|-|=. |+-.||. .+-|.++| +|....-.++| ++-. ...|+. T Consensus 388 ~ck~nGafDP---ttmGsV~NVGLMAqKAEEYGSHdkTF~~~~~G~v~vvd~~g~vl~~h~Ve~GDIwR~cq~kd~pI~d 464 (735) T pfam03971 388 FCKTHGAFDP---TTMGSVPNVGLMAQKAEEYGSHDKTFEIQADGVVRVVDAAGEVLLEHSVEAGDIWRMCQTKDAPIRD 464 (735) T ss_pred HHHHCCCCCC---HHCCCCCCHHHHHHHHHHHCCCCCCEECCCCCEEEEEECCCCEEEECCCCCCCEEHHCCCCCCHHHH T ss_conf 9997499893---2216777427888878984789851555899379999799989997110588851121146717999 Q ss_pred HHHHHHHHHHHCCE-EEEEE---ECCH-HHHHHHHHHHHHHHHCCCEEEE Q ss_conf 99999999987194-89998---0887-8899999997568876918996 Q gi|254780558|r 340 KLKGLEEIARTTGQ-AIGVA---VAFD-ESIEVISQWLQQEHVRDVSVVP 384 (396) Q Consensus 340 qL~~l~~~Ar~~G~-AI~ig---h~~p-~Ti~~L~~w~~~l~~~gi~lVp 384 (396) -.+-+..+||..|. ||--- ++|+ +-|...+.++++-.-.|+.+-= T Consensus 465 WvkLav~Rar~t~~pavFWLD~~RaHD~~li~kV~~yL~~hdt~gldi~I 514 (735) T pfam03971 465 WVKLAVTRARLSNTPAVFWLDPARAHDVELIKKVKRYLKDHDTNGLDIRI 514 (735) T ss_pred HHHHHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCCEEE T ss_conf 99999999897499768952888862699999999998754898885174 No 147 >smart00633 Glyco_10 Glycosyl hydrolase family 10. Probab=24.61 E-value=48 Score=13.66 Aligned_cols=44 Identities=18% Similarity=0.203 Sum_probs=19.9 Q ss_pred HHHHHHHHHHHHHCCEEE-EEE---ECC--HHHHHHHHHHHHHHHHCCCE Q ss_conf 999999999998719489-998---088--78899999997568876918 Q gi|254780558|r 338 REKLKGLEEIARTTGQAI-GVA---VAF--DESIEVISQWLQQEHVRDVS 381 (396) Q Consensus 338 ~~qL~~l~~~Ar~~G~AI-~ig---h~~--p~Ti~~L~~w~~~l~~~gi~ 381 (396) +..+.++++.-+++|.-| ||| |-. ..+.+.+..-+..+..-|+. T Consensus 135 ~~~~~~lv~~l~~~g~pIdgIG~Q~H~~~~~~~~~~~~~~l~~~~~~gl~ 184 (254) T smart00633 135 RQAIYELVKKLKAKGVPIDGIGLQSHLSLGSPNIAEIRAALDRFASLGLE 184 (254) T ss_pred HHHHHHHHHHHHHCCCCCCEEEEEEEECCCCCCHHHHHHHHHHHHHCCCC T ss_conf 99999999999977998315876243047999999999999999965994 No 148 >TIGR00263 trpB tryptophan synthase, beta subunit; InterPro: IPR006654 Tryptophan synthase catalyzes the last step in the biosynthesis of tryptophan , :L-serine + 1-(indol-3-yl)glycerol 3-phosphate = L-tryptophan + glyceraldehyde 3-phosphate + H_2O It has two functional domains, each found in bacteria and plants on a separate subunit: alpha chain (IPR002028 from INTERPRO) is for the aldol cleavage of indoleglycerol phosphate to indole and glyceraldehyde 3-phosphate and beta chain is for the synthesis of tryptophan from indole and serine. In fungi the two domains are fused together on a single multifunctional protein . The beta chain of the enzyme, represented here, requires pyridoxal-phosphate as a cofactor. The pyridoxal-phosphate group is attached to a lysine residue. The region around this lysine residue also contains two histidine residues which are part of the pyridoxal-phosphate binding site.; GO: 0004834 tryptophan synthase activity, 0006568 tryptophan metabolic process. Probab=24.23 E-value=49 Score=13.61 Aligned_cols=104 Identities=16% Similarity=0.209 Sum_probs=67.4 Q ss_pred CCCC-EEEEECCCCCCCCCHHHHHHHHHHHH-CCCCEEEECCCCCCCHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHH Q ss_conf 5893-59800123000049899999999985-069289974976322678989870994698458862899999999999 Q gi|254780558|r 264 GTGY-FGVMNYRGAMLLSNKESAEVIFKEFA-KRGLLFFDDGSSPRNLTRVLAPKLNLPYMVADLYLDDQVDRDKIREKL 341 (396) Q Consensus 264 ~p~~-vGvnNhmGs~~t~~~~~m~~vl~~l~-~rgL~flDs~Ts~~Sva~~~A~~~gvp~~~~dvfLD~~~~~~~I~~qL 341 (396) -.|- +||.--|=|.|.+|+.. +|...-. .=||=|- +|.++.|-=...-. ++-.+..|.++ | T Consensus 293 ~~G~t~Gv~HG~kt~lLQDedG--Qi~~~hSvSAGLDYP-------GVGP~HA~L~~~GR----a~Y~~iTD~EA----l 355 (412) T TIGR00263 293 SKGSTVGVLHGMKTYLLQDEDG--QILEAHSVSAGLDYP-------GVGPEHAYLHETGR----AEYEAITDDEA----L 355 (412) T ss_pred CCCCCCEEECCCCCCCCCCCCC--CCCCCEEEEEECCCC-------CCCHHHHHHHCCCC----EEEEECCHHHH----H T ss_conf 3686323641332010206567--603211278515788-------86867788750387----56630688999----9 Q ss_pred HHHHHHHHHCCEEEEEEECCHHHHHHHHHHHHHHHHCCCEEEEHH Q ss_conf 999999987194899980887889999999756887691899677 Q gi|254780558|r 342 KGLEEIARTTGQAIGVAVAFDESIEVISQWLQQEHVRDVSVVPLS 386 (396) Q Consensus 342 ~~l~~~Ar~~G~AI~igh~~p~Ti~~L~~w~~~l~~~gi~lVpvS 386 (396) +-+..++|++|..=+|=+.|. |+.+.+.+++|.+..|.+|-+| T Consensus 356 ~AF~~l~~~EGIIPALESSHA--lA~~~k~aP~l~kd~ivvvNLS 398 (412) T TIGR00263 356 EAFKLLSRNEGIIPALESSHA--LAYLEKLAPTLPKDQIVVVNLS 398 (412) T ss_pred HHHHHHHHHCCCCCCHHHHHH--HHHHHHHCCCCCCCCEEEEEEC T ss_conf 999998771687342356689--9999974788888877899842 No 149 >PRK06209 glutamate-1-semialdehyde 2,1-aminomutase; Provisional Probab=24.15 E-value=49 Score=13.60 Aligned_cols=18 Identities=17% Similarity=0.239 Sum_probs=12.5 Q ss_pred CCCCCHHHHHHHHHHCCC Q ss_conf 898768999999998599 Q gi|254780558|r 204 SNGNSLDRWMKEAKKKGQ 221 (396) Q Consensus 204 P~~~~~~~~~~~Ar~~G~ 221 (396) |...++++.-+..++.|- T Consensus 201 p~~~fl~~lr~lc~~~gi 218 (431) T PRK06209 201 PQDNFLHEVQRLCHENGA 218 (431) T ss_pred CCHHHHHHHHHHHHHCCC T ss_conf 978999999999987290 No 150 >KOG0256 consensus Probab=23.97 E-value=49 Score=13.58 Aligned_cols=112 Identities=16% Similarity=0.229 Sum_probs=75.5 Q ss_pred HHHHHHHHCCCC---EEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHH Q ss_conf 999999985992---99981766568987787855446789989999999999985589359800123000049899999 Q gi|254780558|r 211 RWMKEAKKKGQE---AILQIPMQAFDESYNEDDSYTLKVTQTVQQLLNRLRYSLRRGTGYFGVMNYRGAMLLSNKESAEV 287 (396) Q Consensus 211 ~~~~~Ar~~G~E---vllhlPMEp~~~~~~~pGp~~L~~~~~~~~~~~~l~~~l~~~p~~vGvnNhmGs~~t~~~~~m~~ 287 (396) ..-++||+.|.- ||+-= | .+.|.+..++++....|.|+-..-=+.|-=-=||||.|... .... T Consensus 215 ~A~~~A~~~~~kVkGvlitN-----------P-sNPLG~~~~~e~L~~ll~Fa~~kniHvI~DEIya~sVF~~~--~F~S 280 (471) T KOG0256 215 AALNQARKLGLKVKGVLITN-----------P-SNPLGTTLSPEELISLLNFASRKNIHVISDEIYAGSVFDKS--EFRS 280 (471) T ss_pred HHHHHHHHHCCCEEEEEEEC-----------C-CCCCCCCCCHHHHHHHHHHHHHCCEEEEEEHHHHHCCCCCC--CCEE T ss_conf 99999998189524899827-----------9-99777746999999999988645627986032100203666--7137 Q ss_pred HHHHHHCCCCEEEECCCCCCCHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHH Q ss_conf 999985069289974976322678989870994698458862899999999999 Q gi|254780558|r 288 IFKEFAKRGLLFFDDGSSPRNLTRVLAPKLNLPYMVADLYLDDQVDRDKIREKL 341 (396) Q Consensus 288 vl~~l~~rgL~flDs~Ts~~Sva~~~A~~~gvp~~~~dvfLD~~~~~~~I~~qL 341 (396) |++..+...+ |... --+.+.+.+.+|+|-.+.-+.--+..+....-+++ T Consensus 281 v~ev~~~~~~---~~~r--vHivyslSKD~GlpGfRvGviYs~ne~VvsaA~km 329 (471) T KOG0256 281 VLEVRKDPHL---DPDR--VHIVYSLSKDFGLPGFRVGVIYSNNEDVVSAATKM 329 (471) T ss_pred HHHHHHCCCC---CCCC--EEEEEEECCCCCCCCEEEEEEEECCHHHHHHHHHH T ss_conf 8987526456---8772--79999733556988637899986673899999987 No 151 >PRK05595 replicative DNA helicase; Provisional Probab=23.82 E-value=50 Score=13.56 Aligned_cols=119 Identities=14% Similarity=0.128 Sum_probs=64.4 Q ss_pred HHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHC Q ss_conf 99985992999817665689877878554467899899999999999855893598001230000498999999999850 Q gi|254780558|r 215 EAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQLLNRLRYSLRRGTGYFGVMNYRGAMLLSNKESAEVIFKEFAK 294 (396) Q Consensus 215 ~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~~~~~l~~~l~~~p~~vGvnNhmGs~~t~~~~~m~~vl~~l~~ 294 (396) .|-+.|+-|++ ....|+.+++..|+-...+.++...--+ |..--.+-..+...+..|.+ T Consensus 225 ~a~~~g~~V~~------------------fSlEMs~~ql~~R~ls~~s~i~~~~i~~---g~l~~~~~~~~~~a~~~l~~ 283 (444) T PRK05595 225 AALREGKSVVI------------------FSLEMSKEQLAYKLLCSEANVDMLRLRT---GNLEDKDWENIARASGPLAA 283 (444) T ss_pred HHHHCCCCEEE------------------EECCCCHHHHHHHHHHHHCCCCCCCCCC---CCCCHHHHHHHHHHHHHHHC T ss_conf 99866993799------------------9588999999999999646988442326---89799999999999999854 Q ss_pred CCCEEEECCC-C---CCCHHHHHHHHHCCCEEEEEEEECCCC----------CHHHHHHHHHHHHHHHHHCCEEEEEE Q ss_conf 6928997497-6---322678989870994698458862899----------99999999999999998719489998 Q gi|254780558|r 295 RGLLFFDDGS-S---PRNLTRVLAPKLNLPYMVADLYLDDQV----------DRDKIREKLKGLEEIARTTGQAIGVA 358 (396) Q Consensus 295 rgL~flDs~T-s---~~Sva~~~A~~~gvp~~~~dvfLD~~~----------~~~~I~~qL~~l~~~Ar~~G~AI~ig 358 (396) ..+++-|+.. + -++.+..+.++.|+.. +|+|.-+ ...+|..--.+|-.+||+-+..|... T Consensus 284 ~~l~i~d~~~~ti~~i~~~~r~~~~~~~~~l----iiiDYlqLi~~~~~~~~r~~ev~~isr~LK~lAkel~ipvi~l 357 (444) T PRK05595 284 AKIYIDDTAGVSVMEMRSKCRRLKIEHGIDL----ILIDYLQLMSGGKSSESRQQEVSEISRSIKALAKEMECPVIAL 357 (444) T ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHCCCCE----EEEEHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEE T ss_conf 8970548999648999999999998739998----9982376357898888899999999999999999969979997 No 152 >cd06310 PBP1_ABC_sugar_binding_like_2 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=23.82 E-value=50 Score=13.56 Aligned_cols=26 Identities=31% Similarity=0.382 Sum_probs=12.4 Q ss_pred EEEECCCC-CHHHHHHHHHHCCCCEEE Q ss_conf 98708987-689999999985992999 Q gi|254780558|r 200 LAFASNGN-SLDRWMKEAKKKGQEAIL 225 (396) Q Consensus 200 ~A~~P~~~-~~~~~~~~Ar~~G~Evll 225 (396) +.+.|..+ ......+.+.++|..|++ T Consensus 61 Iii~p~~~~~~~~~l~~a~~~gIPvV~ 87 (273) T cd06310 61 ILLAPTDAKALVPPLKEAKDAGIPVVL 87 (273) T ss_pred EEEECCCCHHHHHHHHHHHHCCCCEEE T ss_conf 999168714479999999984998589 No 153 >TIGR00017 cmk cytidylate kinase; InterPro: IPR003136 This family consists of cytidylate kinase (2.7.4.14 from EC), which catalyzes the phosphorylation of cytidine 5-monophosphate (dCMP) to cytidine 5 -diphosphate (dCDP) in the presence of ATP or GTP. UMP and dCMP can also act as acceptors .; GO: 0004127 cytidylate kinase activity, 0005524 ATP binding, 0006139 nucleobase nucleoside nucleotide and nucleic acid metabolic process. Probab=23.56 E-value=50 Score=13.53 Aligned_cols=17 Identities=24% Similarity=0.495 Sum_probs=6.8 Q ss_pred CEEEEEEEECCCCCHHH Q ss_conf 46984588628999999 Q gi|254780558|r 320 PYMVADLYLDDQVDRDK 336 (396) Q Consensus 320 p~~~~dvfLD~~~~~~~ 336 (396) |-+.--+|||....+.+ T Consensus 141 PnA~~KiFLdAsve~RA 157 (223) T TIGR00017 141 PNAELKIFLDASVEERA 157 (223) T ss_pred CCCCCCEEECCCHHHHH T ss_conf 88751324317878998 No 154 >TIGR00382 clpX ATP-dependent Clp protease, ATP-binding subunit ClpX; InterPro: IPR004487 ClpX is a member of the HSP (heat-shock protein) 100 family. Gel filtration and electron microscopy showed that ClpX subunits associate to form a six-membered ring that is stabilized by binding of ATP or nonhydrolyzable analogs of ATP . It functions as an ATP-depedent molecular chaperone and is the regulatory subunit of the ClpXP protease . ClpXP is involved in DNA damage repair, stationary-phase gene expression, and ssrA-mediated protein quality control. To date more than 50 proteins include transcription factors, metabolic enzymes, and proteins involved in the starvation and oxidative stress responses have been identified as substrates . The N-terminal domain of ClpX is a C4-type zinc binding domain (ZBD) involved in substrate recognition. ZBD forms a very stable dimer that is essential for promoting the degradation of some typical ClpXP substrates such as lO and MuA . ; GO: 0005515 protein binding, 0005524 ATP binding, 0016887 ATPase activity, 0015031 protein transport. Probab=23.38 E-value=48 Score=13.67 Aligned_cols=98 Identities=22% Similarity=0.224 Sum_probs=56.2 Q ss_pred CHHHHHHHHHH-HHHHCC----CCEEEEECCC-----C-CCCCCHH----HHHHHH-HHHHCCCCEEEECCCCCCC-HHH Q ss_conf 98999999999-998558----9359800123-----0-0004989----999999-9985069289974976322-678 Q gi|254780558|r 249 TVQQLLNRLRY-SLRRGT----GYFGVMNYRG-----A-MLLSNKE----SAEVIF-KEFAKRGLLFFDDGSSPRN-LTR 311 (396) Q Consensus 249 ~~~~~~~~l~~-~l~~~p----~~vGvnNhmG-----s-~~t~~~~----~m~~vl-~~l~~rgL~flDs~Ts~~S-va~ 311 (396) .|.+++..|.. ..++=. -.|.|=||.= + .-..|.. -++.+- =||.+-|...+---=|.+| .|. T Consensus 91 ~P~eik~~LD~YVIGQe~AKKVLsVAVYNHYKRl~~~~~n~~~d~~D~nvelehleeVEL~KSNILLiGPTGSGKTLLAq 170 (452) T TIGR00382 91 TPKEIKAHLDEYVIGQEQAKKVLSVAVYNHYKRLNLKEKNKKSDNGDSNVELEHLEEVELSKSNILLIGPTGSGKTLLAQ 170 (452) T ss_pred CCHHHHHHHCCCCCCCCCCCEEEEEEEHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCEEEECCCCCCHHHHHH T ss_conf 82799997213612310105254324112466653243045558840002354444433300662454688852689999 Q ss_pred HHHHHHCCCEEEEEEEECCC-----CCHHHHHHHHHHHHH Q ss_conf 98987099469845886289-----999999999999999 Q gi|254780558|r 312 VLAPKLNLPYMVADLYLDDQ-----VDRDKIREKLKGLEE 346 (396) Q Consensus 312 ~~A~~~gvp~~~~dvfLD~~-----~~~~~I~~qL~~l~~ 346 (396) ++|+.+.|||+-+|--==.+ -|.+.|..+|-|... T Consensus 171 TLA~~L~VPfAiADATtLTEAGYVGEDVENIL~~Llq~ad 210 (452) T TIGR00382 171 TLARILNVPFAIADATTLTEAGYVGEDVENILLKLLQAAD 210 (452) T ss_pred HHHHHCCCCEEECCHHHHHCCCCCCCCHHHHHHHHHHHCC T ss_conf 9998738874211111020066424228899999987414 No 155 >PRK08760 replicative DNA helicase; Provisional Probab=23.36 E-value=51 Score=13.50 Aligned_cols=107 Identities=20% Similarity=0.144 Sum_probs=57.1 Q ss_pred CCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHCCCCEEEECCC-CC---CCHHHHHHHHHCCC Q ss_conf 678998999999999998558935980012300004989999999998506928997497-63---22678989870994 Q gi|254780558|r 245 KVTQTVQQLLNRLRYSLRRGTGYFGVMNYRGAMLLSNKESAEVIFKEFAKRGLLFFDDGS-SP---RNLTRVLAPKLNLP 320 (396) Q Consensus 245 ~~~~~~~~~~~~l~~~l~~~p~~vGvnNhmGs~~t~~~~~m~~vl~~l~~rgL~flDs~T-s~---~Sva~~~A~~~gvp 320 (396) ...|+.+++..|+....++++...=- .|..-..+-..+...+..+++..+|+-|+.. +. ++.+..+.++.++. T Consensus 265 SLEMs~~ql~~Rlls~~s~v~~~~i~---~g~l~~~e~~~~~~a~~~l~~~~l~idD~~~~t~~~ir~~~R~~k~~~~l~ 341 (476) T PRK08760 265 SMEMSASQLAMRLISSNGRINAQRLR---TGALEDEDWARVTGAIKMLKETKIFIDDTPGVSPEVLRSKCRRLKREHDLG 341 (476) T ss_pred ECCCCHHHHHHHHHHHHCCCCHHHHH---CCCCCHHHHHHHHHHHHHHHHCCCEEECCCCCCHHHHHHHHHHHHHHCCCC T ss_conf 03699999999999983389767776---489999999999999999860881685799999999999999998727998 Q ss_pred EEEEEEEECCCC----------CHHHHHHHHHHHHHHHHHCCEEEEEE Q ss_conf 698458862899----------99999999999999998719489998 Q gi|254780558|r 321 YMVADLYLDDQV----------DRDKIREKLKGLEEIARTTGQAIGVA 358 (396) Q Consensus 321 ~~~~dvfLD~~~----------~~~~I~~qL~~l~~~Ar~~G~AI~ig 358 (396) . ||+|.-+ ....|-.--.+|-.+||+-+..|... T Consensus 342 l----vvIDYLqL~~~~~~~~~r~~~v~~isr~lK~lAkel~vpVi~L 385 (476) T PRK08760 342 L----IVIDYLQLMSVPGNSENRATEISEISRSLKGLAKELNVPVIAL 385 (476) T ss_pred E----EEEEHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEE T ss_conf 7----9997076415888874488999999999999999979978996 No 156 >PRK13640 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=23.27 E-value=51 Score=13.49 Aligned_cols=125 Identities=15% Similarity=0.135 Sum_probs=57.6 Q ss_pred EEEEEECCCCCCCHHHHHHHHHCCCC-------CEEEEECCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCC----CCC Q ss_conf 58999737787716889999837935-------0898708987689999999985992999817665689877----878 Q gi|254780558|r 172 RIAIVVSGLGISQTGTQRAINLLPAN-------ITLAFASNGNSLDRWMKEAKKKGQEAILQIPMQAFDESYN----EDD 240 (396) Q Consensus 172 riAIVIddlG~~~~~~~~ai~~LP~~-------vT~A~~P~~~~~~~~~~~Ar~~G~EvllhlPMEp~~~~~~----~pG 240 (396) .+--||+.=|-+.+...+.+..|=.| |.+--...... ...+..++-| ++.|-|..-+-.... .-| T Consensus 35 e~vaiiG~nGsGKSTL~~~l~Gll~P~~~~~G~i~~~g~~i~~~--~~~~lr~~vg--~VfQ~P~~q~~~~tV~e~iafg 110 (283) T PRK13640 35 SWTALIGHNGSGKSTISKLINGLLLPDDNPKSKITVDGITLTAK--TVWDIREKVG--IVFQNPDNQFVGATVGDDVAFG 110 (283) T ss_pred CEEEEECCCCCHHHHHHHHHHCCCCCCCCCEEEEEECCEECCCC--CHHHHHHCEE--EEEECCCCCCCCCCHHHHHHHH T ss_conf 99999999998799999999640378888617999999999967--9889962618--9986887618878299999845 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCC---CCCCHHHHHHHHHHHH-CCCCEEEECCCCC Q ss_conf 5544678998999999999998558935980012300---0049899999999985-0692899749763 Q gi|254780558|r 241 SYTLKVTQTVQQLLNRLRYSLRRGTGYFGVMNYRGAM---LLSNKESAEVIFKEFA-KRGLLFFDDGSSP 306 (396) Q Consensus 241 p~~L~~~~~~~~~~~~l~~~l~~~p~~vGvnNhmGs~---~t~~~~~m~~vl~~l~-~rgL~flDs~Ts~ 306 (396) + -..+++.+++.+++.++|..+ |+.++.... +..-..-.=.+...|. +-.++++|--|+. T Consensus 111 l--~n~~~~~~e~~~~v~~~l~~v----gl~~~~~~~p~~LSGGqkQRvaiA~aLa~~P~iLllDEPTs~ 174 (283) T PRK13640 111 L--ENRGVPRPEMIKIVADVLSDV----GMLDYIDSEPANLSGGQKQRVAIAGILAVEPQIIILDESTSM 174 (283) T ss_pred H--HHCCCCHHHHHHHHHHHHHHC----CCHHHHHCCHHHCCHHHHHHHHHHHHHHCCCCEEEECCCCCC T ss_conf 7--537999999999999999877----997776479222999999999999999719999997687454 No 157 >PRK07369 dihydroorotase; Provisional Probab=23.09 E-value=51 Score=13.47 Aligned_cols=107 Identities=17% Similarity=0.121 Sum_probs=55.3 Q ss_pred CCCEEEEECCCCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHH Q ss_conf 89359800123000049899999999985069289974976322678989870994698458862899999999999999 Q gi|254780558|r 265 TGYFGVMNYRGAMLLSNKESAEVIFKEFAKRGLLFFDDGSSPRNLTRVLAPKLNLPYMVADLYLDDQVDRDKIREKLKGL 344 (396) Q Consensus 265 p~~vGvnNhmGs~~t~~~~~m~~vl~~l~~rgL~flDs~Ts~~Sva~~~A~~~gvp~~~~dvfLD~~~~~~~I~~qL~~l 344 (396) .|++|.. .+....|...|..+|+.++..|..++-.--.. .+...-....|.-. -...+...+...+. .++.++ T Consensus 148 ~Gv~~f~---~~~~~~~~~~l~~~l~~~~~~~~~v~~h~ed~-~l~~~~~~~~g~~~--~~~~~~~rP~~~E~-~av~r~ 220 (419) T PRK07369 148 AGVVGFT---DGQPLENLALLRRLLEYLAPLGKPVALWPCDR-QLAGNGVMREGPDA--LRLGLPGIPASAET-AALAAL 220 (419) T ss_pred CCCEEEC---CCCCCCCHHHHHHHHHHHHHCCCEEEEECCCH-HHHHCHHHHCCCCC--HHCCCCCCCHHHHH-HHHHHH T ss_conf 7968904---89654899999999999986399189950689-88727265448640--20289988445546-999999 Q ss_pred HHHHHHCCEEEEEEECC-HHHHHHHHHHHHHHHHCCCEE Q ss_conf 99998719489998088-788999999975688769189 Q gi|254780558|r 345 EEIARTTGQAIGVAVAF-DESIEVISQWLQQEHVRDVSV 382 (396) Q Consensus 345 ~~~Ar~~G~AI~igh~~-p~Ti~~L~~w~~~l~~~gi~l 382 (396) ..+|+..|.-+-|.|.- .++++.|. +.+.+|+.+ T Consensus 221 i~lA~~~g~~lhi~HvSs~~~~e~i~----~ak~~G~~v 255 (419) T PRK07369 221 LELVAAIGTPVHLMRVSTARSVELIA----QAKARGLPI 255 (419) T ss_pred HHHHHHHCCCEEEECCCCHHHHHHHH----HHHHCCCCE T ss_conf 99988753848996168589999999----888639808 No 158 >cd03315 MLE_like Muconate lactonizing enzyme (MLE) like subgroup of the enolase superfamily. Enzymes of this subgroup share three conserved carboxylate ligands for the essential divalent metal ion (usually Mg2+), two aspartates and a glutamate, and residues that can function as general acid/base catalysts, a Lys-X-Lys motif and another conserved lysine. Despite these conserved residues, the members of the MLE subgroup, like muconate lactonizing enzyme, o-succinylbenzoate synthase (OSBS) and N-acylamino acid racemase (NAAAR), catalyze different reactions. Probab=23.02 E-value=51 Score=13.46 Aligned_cols=136 Identities=13% Similarity=0.094 Sum_probs=81.2 Q ss_pred HHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHH Q ss_conf 99999999859929998176656898778785544678998999999999998558935980012300004989999999 Q gi|254780558|r 210 DRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQLLNRLRYSLRRGTGYFGVMNYRGAMLLSNKESAEVIF 289 (396) Q Consensus 210 ~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~~~~~l~~~l~~~p~~vGvnNhmGs~~t~~~~~m~~vl 289 (396) .+.++.+.+.|+..+= +..+.+.++...+++...+.++.-+-+.=-.-..++.+ ....++ T Consensus 90 ~~~~~~~~~~G~~~~K------------------ikvg~~~~~d~~~v~~ir~~~g~~~~l~vDan~~~~~~--~A~~~~ 149 (265) T cd03315 90 AEEARRALEAGFRTFK------------------LKVGRDPARDVAVVAALREAVGDDAELRVDANRGWTPK--QAIRAL 149 (265) T ss_pred HHHHHHHHHCCCCEEE------------------EECCCCHHHHHHHHHHHHHHCCCCCEEEECCCCCCCHH--HHHHHH T ss_conf 9999999975999899------------------97689859999999999987299967985787788999--999999 Q ss_pred HHHHCCCCEEEECCCCCCCH--HHHHHHHHCCCEEEEEEEECCCCCHHHHH-----------------HHHHHHHHHHHH Q ss_conf 99850692899749763226--78989870994698458862899999999-----------------999999999987 Q gi|254780558|r 290 KEFAKRGLLFFDDGSSPRNL--TRVLAPKLNLPYMVADLYLDDQVDRDKIR-----------------EKLKGLEEIART 350 (396) Q Consensus 290 ~~l~~rgL~flDs~Ts~~Sv--a~~~A~~~gvp~~~~dvfLD~~~~~~~I~-----------------~qL~~l~~~Ar~ 350 (396) +.|.+.|++|+-.-..+... -.++.+...+|.+.-.-+.+...-...|. ....+...+|.. T Consensus 150 ~~l~~~~i~w~EeP~~~~d~~~~~~L~~~~~ipIa~gEs~~~~~~~~~~i~~~~~d~v~~d~~~~GGit~~~kia~~A~~ 229 (265) T cd03315 150 RALEDLGLDYVEQPLPADDLEGRAALARATDTPIMADESAFTPHDAFRELALGAADAVNIKTAKTGGLTKAQRVLAVAEA 229 (265) T ss_pred HHHCCCCCEEEECCCCCCCHHHHHHHHHCCCCCEEECCCCCCHHHHHHHHHHHCHHHEEECCCCCCHHHHHHHHHHHHHH T ss_conf 86400585488478998999999999851999999089879999999999872613414087764749999999999998 Q ss_pred CCEEEEEEECCHHHH Q ss_conf 194899980887889 Q gi|254780558|r 351 TGQAIGVAVAFDESI 365 (396) Q Consensus 351 ~G~AI~igh~~p~Ti 365 (396) +|.-+.++|...-.| T Consensus 230 ~gi~~~~~~~~~s~i 244 (265) T cd03315 230 LGLPVMVGSMIESGL 244 (265) T ss_pred CCCCEEECCCCCCHH T ss_conf 699589789653599 No 159 >cd06316 PBP1_ABC_sugar_binding_like_7 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=22.94 E-value=52 Score=13.45 Aligned_cols=23 Identities=9% Similarity=0.034 Sum_probs=12.6 Q ss_pred CHHHHHHHHHHHHHHCCCCEEEE Q ss_conf 98999999999998558935980 Q gi|254780558|r 249 TVQQLLNRLRYSLRRGTGYFGVM 271 (396) Q Consensus 249 ~~~~~~~~l~~~l~~~p~~vGvn 271 (396) ..++-....+..|.+-|...||- T Consensus 169 ~~~~a~~~~~~~L~~~Pdi~~I~ 191 (294) T cd06316 169 GPSKAEDIANAMLTQNPDLKGIY 191 (294) T ss_pred CHHHHHHHHHHHHHHCCCCCEEE T ss_conf 88999999999997399988899 No 160 >PRK08455 fliL flagellar basal body-associated protein FliL; Reviewed Probab=22.91 E-value=52 Score=13.45 Aligned_cols=12 Identities=25% Similarity=0.647 Sum_probs=5.0 Q ss_pred HHHHHHHHHHCC Q ss_conf 999989887506 Q gi|254780558|r 36 FIVGLSIYILIS 47 (396) Q Consensus 36 ~~~~~~~~~~~~ 47 (396) +++|+.+|.++. T Consensus 32 ~~gG~~~~~l~~ 43 (180) T PRK08455 32 LIGGVIAFLLMG 43 (180) T ss_pred HHHHHHHHHHHC T ss_conf 999999987617 No 161 >COG0011 Uncharacterized conserved protein [Function unknown] Probab=22.80 E-value=52 Score=13.44 Aligned_cols=79 Identities=16% Similarity=0.209 Sum_probs=58.6 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHCCCEEEEEEEEC Q ss_conf 89999999999985589359800123000049899999999985069289974976322678989870994698458862 Q gi|254780558|r 250 VQQLLNRLRYSLRRGTGYFGVMNYRGAMLLSNKESAEVIFKEFAKRGLLFFDDGSSPRNLTRVLAPKLNLPYMVADLYLD 329 (396) Q Consensus 250 ~~~~~~~l~~~l~~~p~~vGvnNhmGs~~t~~~~~m~~vl~~l~~rgL~flDs~Ts~~Sva~~~A~~~gvp~~~~dvfLD 329 (396) -.+...+....|... |+.+.+|.-+|.=+.-|.-+|+.+++ +.+..-+.|.+...-.+-+| T Consensus 19 vs~yVa~~i~~lk~~----glky~~~pm~T~iEg~~del~~~ik~---------------~~Ea~~~~g~~Rv~t~ikId 79 (100) T COG0011 19 VSKYVAEAIEILKES----GLKYQLGPMGTVIEGELDELMEAVKE---------------AHEAVFEKGAPRVSTVIKID 79 (100) T ss_pred HHHHHHHHHHHHHHC----CCCEEECCCCEEEEECHHHHHHHHHH---------------HHHHHHHCCCCEEEEEEEEE T ss_conf 899999999999974----99526067621898629999999999---------------99999862996489999866 Q ss_pred CCCCH-HHHHHHHHHHHHH Q ss_conf 89999-9999999999999 Q gi|254780558|r 330 DQVDR-DKIREKLKGLEEI 347 (396) Q Consensus 330 ~~~~~-~~I~~qL~~l~~~ 347 (396) +..|. ..|..++...+.. T Consensus 80 ~R~d~~~t~e~Kv~~v~ek 98 (100) T COG0011 80 ERRDKELTMEEKVKSVEEK 98 (100) T ss_pred EECCCCCCHHHHHHHHHHH T ss_conf 6058531299999999975 No 162 >TIGR01754 flav_RNR ribonucleotide reductase-associated flavodoxin, putative; InterPro: IPR010088 This entry represents a family of proteins found immediately downstream of ribonucleotide reductase genes in Xyella fastidiosa and some Gram-positive bacteria. It appears to be a highly divergent flavodoxin of the short chain type, more like the flavodoxins of the sulphate-reducing genus Desulfovibrio than like the NifF flavodoxins associated with nitrogen fixation.. Probab=22.78 E-value=52 Score=13.43 Aligned_cols=97 Identities=12% Similarity=0.118 Sum_probs=61.1 Q ss_pred EEEEECCCCCHHHHHHHHH----HCCCCE--E-------EECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHH-- Q ss_conf 8987089876899999999----859929--9-------981766568987787855446789989999999999985-- Q gi|254780558|r 199 TLAFASNGNSLDRWMKEAK----KKGQEA--I-------LQIPMQAFDESYNEDDSYTLKVTQTVQQLLNRLRYSLRR-- 263 (396) Q Consensus 199 T~A~~P~~~~~~~~~~~Ar----~~G~Ev--l-------lhlPMEp~~~~~~~pGp~~L~~~~~~~~~~~~l~~~l~~-- 263 (396) =+|+.-|.-+.++.|...+ +.|||| + +..|+.|.+|-=--.|--|.-.|.+|.|+++-+.....- T Consensus 4 llAy~slSGNT~eVA~~I~~~l~~~G~eVDW~~~r~~~La~~pldPe~ydL~~LGtwT~~~GrTP~e~KdFiaEl~~liG 83 (145) T TIGR01754 4 LLAYLSLSGNTKEVADIIRDILEKDGHEVDWVEFRISTLADAPLDPEDYDLYLLGTWTVERGRTPDEMKDFIAELAVLIG 83 (145) T ss_pred HHHHHHCCCCHHHHHHHHHHHHHHCCCEEEEEEECCHHHHCCCCCCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHC T ss_conf 65354148777899999999998479776766400046424678998631577431223578996668999999999836 Q ss_pred CCCCEEEEECCCCCCCCCHHHHHHHHHHHHCCCCEE Q ss_conf 589359800123000049899999999985069289 Q gi|254780558|r 264 GTGYFGVMNYRGAMLLSNKESAEVIFKEFAKRGLLF 299 (396) Q Consensus 264 ~p~~vGvnNhmGs~~t~~~~~m~~vl~~l~~rgL~f 299 (396) =|.-+-|- |..=|+- ..|+-.|+...+--.|| T Consensus 84 KP~nvaiF---GTGeTQw-GG~d~yCgAVdr~a~ff 115 (145) T TIGR01754 84 KPKNVAIF---GTGETQW-GGDDYYCGAVDRLAKFF 115 (145) T ss_pred CCCCEEEE---CCCCCCC-CCCCCCHHHHHHHHHHH T ss_conf 99824886---5887552-88654024788878650 No 163 >cd06451 AGAT_like Alanine-glyoxylate aminotransferase (AGAT) family. This family belongs to pyridoxal phosphate (PLP)-dependent aspartate aminotransferase superfamily (fold I). The major groups in this CD correspond to alanine-glyoxylate aminotransferase (AGAT), serine-glyoxylate aminotransferase (SGAT), and 3-hydroxykynurenine transaminase (HKT). AGAT is a homodimeric protein, which catalyses the transamination of glyoxylate to glycine, and SGAT converts serine and glyoxylate to hydroxypyruvate and glycine. HKT catalyzes the PLP-dependent transamination of 3-hydroxykynurenine, a potentially toxic metabolite of the kynurenine pathway. Probab=22.60 E-value=52 Score=13.41 Aligned_cols=106 Identities=20% Similarity=0.200 Sum_probs=54.0 Q ss_pred HHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHCCCCEEE-ECCCCCCCHHHHHHHHHCCCEEEEEEEECC Q ss_conf 9999999999855893598001230000498999999999850692899-749763226789898709946984588628 Q gi|254780558|r 252 QLLNRLRYSLRRGTGYFGVMNYRGAMLLSNKESAEVIFKEFAKRGLLFF-DDGSSPRNLTRVLAPKLNLPYMVADLYLDD 330 (396) Q Consensus 252 ~~~~~l~~~l~~~p~~vGvnNhmGs~~t~~~~~m~~vl~~l~~rgL~fl-Ds~Ts~~Sva~~~A~~~gvp~~~~dvfLD~ 330 (396) ...-.|+.+|+.+-. .|+- .++.......+.+.+.|++.|+=.+ +..-. -|.. --+.++. T Consensus 240 ~~i~al~~AL~~i~~-~G~~----~~~~r~~~l~~~l~~~l~~lGl~~~~~~~~~-------------S~~v-t~~~~p~ 300 (356) T cd06451 240 NLLYALREALDLILE-EGLE----NRWARHRRLAKALREGLEALGLKLLAKPELR-------------SPTV-TAVLVPE 300 (356) T ss_pred HHHHHHHHHHHHHHH-CCHH----HHHHHHHHHHHHHHHHHHHCCCCCCCCHHHC-------------CCEE-EEEECCC T ss_conf 688999999999986-0658----9999999999999999997599656893654-------------9827-9997999 Q ss_pred CCCHHHHHHHHHHH--HHHH----HHCCEEEEEEECCHHHHHHHHHHHHHHH Q ss_conf 99999999999999--9999----8719489998088788999999975688 Q gi|254780558|r 331 QVDRDKIREKLKGL--EEIA----RTTGQAIGVAVAFDESIEVISQWLQQEH 376 (396) Q Consensus 331 ~~~~~~I~~qL~~l--~~~A----r~~G~AI~igh~~p~Ti~~L~~w~~~l~ 376 (396) ..+...+.+.|.+- ..++ +-.|..+=|||....|.+-+...+.-++ T Consensus 301 g~~~~~~~~~l~~~~~i~i~~g~g~~~~~~fRIg~mG~~~~~di~~~l~al~ 352 (356) T cd06451 301 GVDGDEVVRRLMKRYNIEIAGGLGPTAGKVFRIGHMGEATREDVLGVLSALE 352 (356) T ss_pred CCCHHHHHHHHHHHCCEEEECCCCCCCCCEEEECCCCCCCHHHHHHHHHHHH T ss_conf 9999999999999789999788665689589861898999999999999999 No 164 >COG0646 MetH Methionine synthase I (cobalamin-dependent), methyltransferase domain [Amino acid transport and metabolism] Probab=22.55 E-value=52 Score=13.40 Aligned_cols=165 Identities=15% Similarity=0.094 Sum_probs=98.9 Q ss_pred EEECCCCCCCHHHHHHHHHCCCCCEEEEECCC-CCHHHHHHHHHHCC------CCEEEECCCCCCCCCCCCCCCCCCCCC Q ss_conf 99737787716889999837935089870898-76899999999859------929998176656898778785544678 Q gi|254780558|r 175 IVVSGLGISQTGTQRAINLLPANITLAFASNG-NSLDRWMKEAKKKG------QEAILQIPMQAFDESYNEDDSYTLKVT 247 (396) Q Consensus 175 IVIddlG~~~~~~~~ai~~LP~~vT~A~~P~~-~~~~~~~~~Ar~~G------~EvllhlPMEp~~~~~~~pGp~~L~~~ 247 (396) +-.|++=.....+.+++.+-...+-|-=.=|. -.++.....+++.. ..||+|.-|+. .+.++.+ T Consensus 136 v~fd~l~~ay~eq~~~Li~gG~D~iLiET~~D~l~~KaA~~a~~~~~~~~~~~LPv~~s~Ti~~---------sG~tl~G 206 (311) T COG0646 136 VTFDELVEAYREQVEGLIDGGADLILIETIFDTLNAKAAVFAAREVFEELGVRLPVMISGTITD---------SGRTLSG 206 (311) T ss_pred CCHHHHHHHHHHHHHHHHHCCCCEEEEEHHCCHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEEC---------CCEECCC T ss_conf 3599999999999999983787589975221689899999999999873277654799999803---------7612379 Q ss_pred CCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHCCCEEEEEEE Q ss_conf 99899999999999855893598001230000498999999999850692899749763226789898709946984588 Q gi|254780558|r 248 QTVQQLLNRLRYSLRRGTGYFGVMNYRGAMLLSNKESAEVIFKEFAKRGLLFFDDGSSPRNLTRVLAPKLNLPYMVADLY 327 (396) Q Consensus 248 ~~~~~~~~~l~~~l~~~p~~vGvnNhmGs~~t~~~~~m~~vl~~l~~rgL~flDs~Ts~~Sva~~~A~~~gvp~~~~dvf 327 (396) .+.++....|+..- +..+|+|-++|. ..|++.++++...--.||- .- -..|+|-.....+ T Consensus 207 q~~~a~~~~l~~~~---~~~vGlNCa~Gp------~~m~~~l~~ls~~~~~~vs--~~---------PNAGLP~~~g~~~ 266 (311) T COG0646 207 QTIEAFLNSLEHLG---PDAVGLNCALGP------DEMRPHLRELSRIADAFVS--VY---------PNAGLPNAFGERA 266 (311) T ss_pred CCHHHHHHHHHCCC---CCEEEECCCCCH------HHHHHHHHHHHHCCCCEEE--EE---------CCCCCCCCCCCCC T ss_conf 86899999866359---747853456688------9999999998741692499--80---------7999975468845 Q ss_pred ECCCCCHHHHHHHHHHHHHHHHHCCEEEEEEE---CCHHHHHHHHHHHH Q ss_conf 62899999999999999999987194899980---88788999999975 Q gi|254780558|r 328 LDDQVDRDKIREKLKGLEEIARTTGQAIGVAV---AFDESIEVISQWLQ 373 (396) Q Consensus 328 LD~~~~~~~I~~qL~~l~~~Ar~~G~AI~igh---~~p~Ti~~L~~w~~ 373 (396) .= +.++++...++.. .++ .|.+=.||- -.|+-|.++.+-++ T Consensus 267 ~Y-~~~p~~~a~~~~~---f~~-~g~vnIvGGCCGTTPeHIraia~~v~ 310 (311) T COG0646 267 VY-DLTPEYMAEALAE---FAE-EGGVNIVGGCCGTTPEHIRAIAEAVK 310 (311) T ss_pred CC-CCCHHHHHHHHHH---HHH-HCCCEEECCCCCCCHHHHHHHHHHHC T ss_conf 26-8998999999999---998-37841604436999899999999842 No 165 >pfam03269 DUF268 Caenorhabditis protein of unknown function, DUF268. Probab=22.31 E-value=48 Score=13.68 Aligned_cols=73 Identities=10% Similarity=-0.056 Sum_probs=51.9 Q ss_pred CCCCCCCHHHHHHHHHHHHHHCCCCE--EEEECCCCC-------CCCCHHHHHHHHHHHHCCCCEEEECCCCCCCHHHHH Q ss_conf 44678998999999999998558935--980012300-------004989999999998506928997497632267898 Q gi|254780558|r 243 TLKVTQTVQQLLNRLRYSLRRGTGYF--GVMNYRGAM-------LLSNKESAEVIFKEFAKRGLLFFDDGSSPRNLTRVL 313 (396) Q Consensus 243 ~L~~~~~~~~~~~~l~~~l~~~p~~v--GvnNhmGs~-------~t~~~~~m~~vl~~l~~rgL~flDs~Ts~~Sva~~~ 313 (396) .-++++.+.+..+.....-..+-.++ ....|-|=. --.|.++|+.+-..|++.|++|+---...+++.... T Consensus 43 ~rlssi~p~ef~~~~~~~~~~FD~a~SfSSiEH~GLGRYGDpidp~Gdlk~m~~i~~~Lk~~G~l~l~vPiG~d~v~fN~ 122 (177) T pfam03269 43 DRLSSILPTDFAKNFKKYAESFDFAASFSSIEHSGLGRYGDPIDPIGDLREMLKIKCVLKKGGLLFLGLPLGTDAIIFNA 122 (177) T ss_pred HCCCCCCHHHHHHHHHHHCCCEEEEEEEEEEEECCCCCCCCCCCCCCCHHHHHHHHHHHCCCCEEEEEECCCCCCEEEEC T ss_conf 10011568899999875235323899961244405655689779851099999999861789679999537776168703 Q ss_pred HH Q ss_conf 98 Q gi|254780558|r 314 AP 315 (396) Q Consensus 314 A~ 315 (396) -+ T Consensus 123 HR 124 (177) T pfam03269 123 HR 124 (177) T ss_pred CE T ss_conf 40 No 166 >COG0084 TatD Mg-dependent DNase [DNA replication, recombination, and repair] Probab=22.22 E-value=53 Score=13.36 Aligned_cols=71 Identities=11% Similarity=0.075 Sum_probs=44.5 Q ss_pred ECCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHH--HHHHHHHHHCCCCEEEEECCCCCCCC Q ss_conf 0898768999999998599299981766568987787855446789989999--99999998558935980012300004 Q gi|254780558|r 203 ASNGNSLDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQLL--NRLRYSLRRGTGYFGVMNYRGAMLLS 280 (396) Q Consensus 203 ~P~~~~~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~~~--~~l~~~l~~~p~~vGvnNhmGs~~t~ 280 (396) -.+..+..+.+++|++.|-..++.. +.+.+++. ..|......+-..+|+--+. .+.. T Consensus 13 ~~~~~d~~~vi~~a~~~gv~~~~~~-------------------g~~~~~~~~~~~la~~y~~v~~~~G~HP~~--~~~~ 71 (256) T COG0084 13 EEFDEDRDEVIARAREAGVKKMVVV-------------------GTDLEDFKRALELAEKYPNVYAAVGVHPLD--ADEH 71 (256) T ss_pred HHHCCCHHHHHHHHHHCCCCEEEEE-------------------ECCHHHHHHHHHHHHHCCCEEEEEEECCCC--CCCC T ss_conf 3326899999999998599379996-------------------158888899999998689868998668873--3336 Q ss_pred CHHHHHHHHHHHHC Q ss_conf 98999999999850 Q gi|254780558|r 281 NKESAEVIFKEFAK 294 (396) Q Consensus 281 ~~~~m~~vl~~l~~ 294 (396) .+....++.+.+.. T Consensus 72 ~~~~~~~l~~~~~~ 85 (256) T COG0084 72 SEEDLEELEQLAEH 85 (256) T ss_pred CHHHHHHHHHHHHC T ss_conf 78899999998721 No 167 >smart00052 EAL Putative diguanylate phosphodiesterase. Putative diguanylate phosphodiesterase, present in a variety of bacteria. Probab=22.20 E-value=53 Score=13.36 Aligned_cols=144 Identities=17% Similarity=0.209 Sum_probs=82.2 Q ss_pred HHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCC-----CEEEEECCCCCCCCCHHHHHH Q ss_conf 999998599299981766568987787855446789989999999999985589-----359800123000049899999 Q gi|254780558|r 213 MKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQLLNRLRYSLRRGTG-----YFGVMNYRGAMLLSNKESAEV 287 (396) Q Consensus 213 ~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~~~~~l~~~l~~~p~-----~vGvnNhmGs~~t~~~~~m~~ 287 (396) ....+..+.++.+.+.+-+. .|. ..+....|...+.+.+- .+-++. ..+..+...+.. T Consensus 75 ~~~~~~~~~~~~l~iNls~~----------~l~----~~~f~~~l~~~l~~~~~~~~~lv~Ei~E---~~~~~~~~~~~~ 137 (241) T smart00052 75 LAEWQAQGPPLRISINLSAR----------QLI----SPDLVPRVLELLEETGLPPQRLELEITE---SVLLDDDESAVA 137 (241) T ss_pred HHHHHHHCCCEEEEEEECHH----------HHC----CCHHHHHHHHHHHHHCCCHHHEEEEECH---HHHHCCHHHHHH T ss_conf 99989738982699993689----------945----9359999999999809698892667235---776408999999 Q ss_pred HHHHHHCCC-CEEEECCCCCCCHHHHHHHHHCCCEEEEEE-EECCCCCHHHHHHHHHHHHHHHHHCCEEEE-EEECCHHH Q ss_conf 999985069-289974976322678989870994698458-862899999999999999999987194899-98088788 Q gi|254780558|r 288 IFKEFAKRG-LLFFDDGSSPRNLTRVLAPKLNLPYMVADL-YLDDQVDRDKIREKLKGLEEIARTTGQAIG-VAVAFDES 364 (396) Q Consensus 288 vl~~l~~rg-L~flDs~Ts~~Sva~~~A~~~gvp~~~~dv-fLD~~~~~~~I~~qL~~l~~~Ar~~G~AI~-igh~~p~T 364 (396) .++.|+++| .+-+|+--+..+ ....-....+-+.+-|. |+.+......-...+..+...|++.|..+. -|--..+. T Consensus 138 ~i~~l~~~G~~iaiDdfG~~~~-~~~~l~~l~~d~iKld~~li~~~~~~~~~~~~~~~l~~~a~~~g~~viaegVE~~~~ 216 (241) T smart00052 138 TLQRLRELGVRIALDDFGTGYS-SLSYLKRLPVDLLKIDKSFVRDLQTDPEDEAIVQSIIELAQKLGLQVVAEGVETPEQ 216 (241) T ss_pred HHHHHHHCCCCEEECCCCCCCC-HHHHHHHCCCHHHCCCHHHHHHCCCCHHHHHHHHHHHHHHHHCCCEEEEEECCCHHH T ss_conf 9999997099787537899810-189998657204224799996132684558999999999998599899971881999 Q ss_pred HHHHHHHHHH Q ss_conf 9999999756 Q gi|254780558|r 365 IEVISQWLQQ 374 (396) Q Consensus 365 i~~L~~w~~~ 374 (396) ++.+.++.-+ T Consensus 217 ~~~l~~~Gi~ 226 (241) T smart00052 217 LDLLRSLGCD 226 (241) T ss_pred HHHHHHCCCC T ss_conf 9999974999 No 168 >COG0134 TrpC Indole-3-glycerol phosphate synthase [Amino acid transport and metabolism] Probab=22.10 E-value=54 Score=13.35 Aligned_cols=78 Identities=21% Similarity=0.272 Sum_probs=60.2 Q ss_pred CHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHH Q ss_conf 68999999998599299981766568987787855446789989999999999985589359800123000049899999 Q gi|254780558|r 208 SLDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQLLNRLRYSLRRGTGYFGVMNYRGAMLLSNKESAEV 287 (396) Q Consensus 208 ~~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~~~~~l~~~l~~~p~~vGvnNhmGs~~t~~~~~m~~ 287 (396) .+.++.+.|+..|.|+|+.+ . + ..-|..++..-+-.+||||-==-.|..|....+. T Consensus 144 ~l~el~~~A~~LGm~~LVEV--------h------------~----~eEl~rAl~~ga~iIGINnRdL~tf~vdl~~t~~ 199 (254) T COG0134 144 QLEELVDRAHELGMEVLVEV--------H------------N----EEELERALKLGAKIIGINNRDLTTLEVDLETTEK 199 (254) T ss_pred HHHHHHHHHHHCCCCEEEEE--------C------------C----HHHHHHHHHCCCCEEEEECCCCCHHEECHHHHHH T ss_conf 99999999997699238997--------8------------9----9999999967998899837884021006889999 Q ss_pred HHHHHHCCCCEEEECCCCCCCH Q ss_conf 9999850692899749763226 Q gi|254780558|r 288 IFKEFAKRGLLFFDDGSSPRNL 309 (396) Q Consensus 288 vl~~l~~rgL~flDs~Ts~~Sv 309 (396) +...+.+.-.+..-||-+...- T Consensus 200 la~~~p~~~~~IsESGI~~~~d 221 (254) T COG0134 200 LAPLIPKDVILISESGISTPED 221 (254) T ss_pred HHHHCCCCCEEEECCCCCCHHH T ss_conf 8844877758996179899999 No 169 >pfam07423 DUF1510 Protein of unknown function (DUF1510). This family consists of several hypothetical bacterial proteins of around 200 residues in length. The function of this family is unknown. Probab=22.02 E-value=54 Score=13.34 Aligned_cols=22 Identities=5% Similarity=0.093 Sum_probs=9.9 Q ss_pred HHHHHHHHHHHHHHHHHHCCCC Q ss_conf 9999999999998988750658 Q gi|254780558|r 28 GLFLLFCTFIVGLSIYILISHA 49 (396) Q Consensus 28 ~~~~l~~~~~~~~~~~~~~~~~ 49 (396) +++++++++++.+++.++|+.+ T Consensus 18 iaI~iV~llIiiva~~lf~~~~ 39 (214) T pfam07423 18 IAIGIVVVLIIIVAYQLFFPSS 39 (214) T ss_pred HHHHHHHHHHHHHHHHEEECCC T ss_conf 9999999999997664023588 No 170 >pfam00682 HMGL-like HMGL-like. This family contains a diverse set of enzymes. These include various aldolases and a region of pyruvate carboxylase. Probab=21.81 E-value=54 Score=13.31 Aligned_cols=47 Identities=9% Similarity=0.183 Sum_probs=15.3 Q ss_pred HHHHHHHHHHHHHHCCEEEEEEECC--HHHHHHHHHHHHHHHHCCCEEE Q ss_conf 9999999999998719489998088--7889999999756887691899 Q gi|254780558|r 337 IREKLKGLEEIARTTGQAIGVAVAF--DESIEVISQWLQQEHVRDVSVV 383 (396) Q Consensus 337 I~~qL~~l~~~Ar~~G~AI~igh~~--p~Ti~~L~~w~~~l~~~gi~lV 383 (396) +...+.+..+.|++.|.-+.++-++ ..+.+.+.++...+.+.|+..+ T Consensus 106 ~l~~~~~~i~~a~~~g~~v~f~~~~~~~~~~~~~~~~~~~~~~~G~~~i 154 (237) T pfam00682 106 VADRAVAAVEAARSAGIDVELGCEDAGRTDLAFLIEVVEVAQEAGATRI 154 (237) T ss_pred HHHHHHHHHHHHHHCCCEEEEEECCCCCCCHHHHHHHHHHHHHCCCCEE T ss_conf 9999999999999869905884051232478899999999986198579 No 171 >pfam07300 consensus Probab=21.73 E-value=54 Score=13.30 Aligned_cols=50 Identities=20% Similarity=0.314 Sum_probs=38.9 Q ss_pred EEE-CCC--CCHHHHHHHHHHHHH-HHHHCCEEEEEEECCHHHHHHHHHHHHHH Q ss_conf 886-289--999999999999999-99871948999808878899999997568 Q gi|254780558|r 326 LYL-DDQ--VDRDKIREKLKGLEE-IARTTGQAIGVAVAFDESIEVISQWLQQE 375 (396) Q Consensus 326 vfL-D~~--~~~~~I~~qL~~l~~-~Ar~~G~AI~igh~~p~Ti~~L~~w~~~l 375 (396) ||| -|. +|..+|..+|+++.+ ....+-.-|+|-|.-++-++.+.+-..++ T Consensus 57 VfLIQntqnRD~~Aiq~KLDELi~a~~~A~n~~i~iE~l~~~el~~~~~~~~~~ 110 (111) T pfam07300 57 VFLIQNTQNRDTAALQAKLDELIRATAEARNALIGIEHLDEEELEELRAKYEEL 110 (111) T ss_pred HHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHH T ss_conf 999971011207899988999999999999986272139999999999999973 No 172 >COG1954 GlpP Glycerol-3-phosphate responsive antiterminator (mRNA-binding) [Transcription] Probab=21.69 E-value=55 Score=13.30 Aligned_cols=93 Identities=14% Similarity=0.151 Sum_probs=69.5 Q ss_pred HHHHHHCCCCCEEEEECCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCC Q ss_conf 99998379350898708987689999999985992999817665689877878554467899899999999999855893 Q gi|254780558|r 188 QRAINLLPANITLAFASNGNSLDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQLLNRLRYSLRRGTGY 267 (396) Q Consensus 188 ~~ai~~LP~~vT~A~~P~~~~~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~~~~~l~~~l~~~p~~ 267 (396) .+-+.+..+.+-|-+.-+--+..+.+....+.|.++++|+ .|..|+ T Consensus 16 le~~les~~~~vflL~~~i~~ik~ivk~lK~~gK~vfiHv---------------DLv~Gl------------------- 61 (181) T COG1954 16 LEKALESESQYVFLLTGHILNIKEIVKKLKNRGKTVFIHV---------------DLVEGL------------------- 61 (181) T ss_pred HHHHHCCCCEEEEEEECHHHHHHHHHHHHHHCCCEEEEEE---------------HHHCCC------------------- T ss_conf 9998657871999982406309999999994796799985---------------784466------------------- Q ss_pred EEEEECCCCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHCCCEEEEEEEECCC Q ss_conf 5980012300004989999999998506928997497632267898987099469845886289 Q gi|254780558|r 268 FGVMNYRGAMLLSNKESAEVIFKEFAKRGLLFFDDGSSPRNLTRVLAPKLNLPYMVADLYLDDQ 331 (396) Q Consensus 268 vGvnNhmGs~~t~~~~~m~~vl~~l~~rgL~flDs~Ts~~Sva~~~A~~~gvp~~~~dvfLD~~ 331 (396) ..|+-+++-+.++.+..|.. |.+|.+...|++.|+.+..|--.||.. T Consensus 62 -----------~~~e~~i~fi~~~~~pdGII------STk~~~i~~Akk~~~~aIqR~FilDS~ 108 (181) T COG1954 62 -----------SNDEVAIEFIKEVIKPDGII------STKSNVIKKAKKLGILAIQRLFILDSI 108 (181) T ss_pred -----------CCCHHHHHHHHHHCCCCEEE------ECCHHHHHHHHHCCCCEEEEEEEECHH T ss_conf -----------78538999999850677168------734889999997497326422001078 No 173 >PRK05265 pyridoxine 5'-phosphate synthase; Provisional Probab=21.65 E-value=55 Score=13.29 Aligned_cols=98 Identities=11% Similarity=0.198 Sum_probs=64.7 Q ss_pred CCCCCCCHHHHHHHHHHHHCCCC---EEEECCCCCCCHHHHHHHHHCCCEEEEEE--EE--CCCCCHHHHHHHHHHHHHH Q ss_conf 30000498999999999850692---89974976322678989870994698458--86--2899999999999999999 Q gi|254780558|r 275 GAMLLSNKESAEVIFKEFAKRGL---LFFDDGSSPRNLTRVLAPKLNLPYMVADL--YL--DDQVDRDKIREKLKGLEEI 347 (396) Q Consensus 275 Gs~~t~~~~~m~~vl~~l~~rgL---~flDs~Ts~~Sva~~~A~~~gvp~~~~dv--fL--D~~~~~~~I~~qL~~l~~~ 347 (396) |=-+..+...++.+.+.|++.|. +|+|. .-..-..|++.|....+-+- |- -+..+......++.++.+. T Consensus 105 Gld~~~~~~~L~~~i~~lk~~gIrvSLFiDP----d~~~i~~a~~~Gad~VElhTG~Ya~a~~~~~~~~el~~i~~aa~~ 180 (240) T PRK05265 105 GLDVAGQFDKLKPAIARLKDAGIRVSLFIDP----DPEQIEAAAEVGADRIELHTGPYADAKTEAEQAAELERIAEAAAL 180 (240) T ss_pred CCCCCCHHHHHHHHHHHHHHCCCEEEEEECC----CHHHHHHHHHHCCCEEEEECHHHHHHCCCCHHHHHHHHHHHHHHH T ss_conf 9377657899999999998659817997279----878999999849399998347878635752199999999999999 Q ss_pred HHHCCEEEEEEECCH-HHHHHHHHHHHHHHH Q ss_conf 987194899980887-889999999756887 Q gi|254780558|r 348 ARTTGQAIGVAVAFD-ESIEVISQWLQQEHV 377 (396) Q Consensus 348 Ar~~G~AI~igh~~p-~Ti~~L~~w~~~l~~ 377 (396) |.+.|--|--||-.. +++..|.+ ++.+++ T Consensus 181 A~~lGL~VnAGHgLn~~Nl~~i~~-ip~i~E 210 (240) T PRK05265 181 AASLGLGVNAGHGLNYHNVKPIAA-IPGIEE 210 (240) T ss_pred HHHCCCEEECCCCCCHHHHHHHHC-CCCCEE T ss_conf 998698785378988777899844-899748 No 174 >PRK10906 DNA-binding transcriptional repressor GlpR; Provisional Probab=21.53 E-value=52 Score=13.42 Aligned_cols=150 Identities=13% Similarity=0.131 Sum_probs=71.3 Q ss_pred EEEEEECCCCCCCHHHHHHHHHCCCCCEEEEECCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHH Q ss_conf 58999737787716889999837935089870898768999999998599299981766568987787855446789989 Q gi|254780558|r 172 RIAIVVSGLGISQTGTQRAINLLPANITLAFASNGNSLDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQ 251 (396) Q Consensus 172 riAIVIddlG~~~~~~~~ai~~LP~~vT~A~~P~~~~~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~ 251 (396) .-.|.||+ +- .+..-...|+..--+-|.- ++..-..+.....+.+|++ + +. ...+.-+ -.+|.... T Consensus 92 g~~I~lD~---GT-T~~~la~~L~~~~~ltVvT--nsl~ia~~L~~~~~~~vil-~-----GG-~~~~~~~-~~~G~~a~ 157 (252) T PRK10906 92 GSTLFIDI---GT-TPEAVAHALLNHSNLRIVT--NNLNVANTLMAKEDFRIIL-A-----GG-ELRSRDG-GIIGEATL 157 (252) T ss_pred CCEEEECC---CH-HHHHHHHHCCCCCCEEEEE--CCHHHHHHHHHCCCCEEEE-E-----CC-EEECCCC-CEECHHHH T ss_conf 99999878---63-7999985333679869998--9899999998489964998-5-----88-8857887-43068899 Q ss_pred HHHHHHHHHHHHCCCCEEEEECCCCCCCCC--HHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHCCCEEEEEEEEC Q ss_conf 999999999985589359800123000049--899999999985069289974976322678989870994698458862 Q gi|254780558|r 252 QLLNRLRYSLRRGTGYFGVMNYRGAMLLSN--KESAEVIFKEFAKRGLLFFDDGSSPRNLTRVLAPKLNLPYMVADLYLD 329 (396) Q Consensus 252 ~~~~~l~~~l~~~p~~vGvnNhmGs~~t~~--~~~m~~vl~~l~~rgL~flDs~Ts~~Sva~~~A~~~gvp~~~~dvfLD 329 (396) +..+.++..+ .|=++-|++-. |+-.+.+ +..++..|-.-.++-.+..||..-.+.-...++. .-.-|.+|= T Consensus 158 ~~l~~~~~d~-afig~~gi~~~-~g~t~~~~~ea~~kr~mi~~a~~~illaDssKfg~~~~~~~~~-----l~~id~lIT 230 (252) T PRK10906 158 DFISQFRLDF-GILGISGIDSD-GSLLEFDYHEVRTKRAIIENSRHVMLVVDHSKFGRNAMVNMGS-----ISMVDAVYT 230 (252) T ss_pred HHHHHCCEEE-EEEECCEECCC-CCCCCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCEEEEC-----HHHCCEEEE T ss_conf 9997066029-99616512689-8733599999999999999619689998733038814169847-----798999998 Q ss_pred CCCCHHHHHHHHH Q ss_conf 8999999999999 Q gi|254780558|r 330 DQVDRDKIREKLK 342 (396) Q Consensus 330 ~~~~~~~I~~qL~ 342 (396) +..-...+.++|. T Consensus 231 D~~~~~~~~~~l~ 243 (252) T PRK10906 231 DQLPPASVMQVIT 243 (252) T ss_pred CCCCCHHHHHHHH T ss_conf 9499999999999 No 175 >TIGR02009 PGMB-YQAB-SF beta-phosphoglucomutase family hydrolase; InterPro: IPR010976 Phosphoglucomutases interconvert D-glucose 1-phosphate and D-glucose 6-phosphate, a reaction which is important for energy metabolism in many organisms and for cell wall biosynthesis in bacteria , . Beta-phosphoglucomutases are monomeric enzymes which interconvert the beta anomers of these compounds using Mg2+ as a cofactor. This entry groups together three clades: the characterised beta-phosphoglucomutases (bPGMs) (including those from Escherichia coli, Bacillus subtilus and Lactococcus lactis, a clade of putative bPGMs from mycobacteria and a clade including the uncharacterised Escherichia coli and Haemophilus influenzae yqaB genes which may prove to be beta-mutases of a related 1-phosphosugar. All of these are members of the larger Haloacid dehalogenase (HAD) subfamily IA and include the 'variant 3' Glu-Asp version of the third conserved HAD domain. The Lactococcus enzyme has been crystallised and studied in detail . It is composed of two distinct regions, an alpha/beta core domain similar to that found in other HAD family members, and a helical cap domain. The core domain contains a central six-stranded parallel beta sheet surrounded by six alpha helices, while the cap domain consists of an antiparallel four alpha-helix bundle. Overall, the monomer forms a "kidney-bean" shape similar to that observed in other HAD family members such as phosphoserine phosphatase. The active site of the enzyme is located at the interface of the two domains.. Probab=21.47 E-value=55 Score=13.27 Aligned_cols=64 Identities=13% Similarity=0.126 Sum_probs=45.1 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHCCEEEEEEECC--HHHHHHHHHHHHHHHHCCCEEEEHHHHHHC----CCCC Q ss_conf 289999999999999999998719489998088--788999999975688769189967774306----4788 Q gi|254780558|r 329 DDQVDRDKIREKLKGLEEIARTTGQAIGVAVAF--DESIEVISQWLQQEHVRDVSVVPLSCLAKL----SSPS 395 (396) Q Consensus 329 D~~~~~~~I~~qL~~l~~~Ar~~G~AI~igh~~--p~Ti~~L~~w~~~l~~~gi~lVpvS~l~~~----~~~~ 395 (396) +.+.+...|.--+..+.+..++++..||+|+-. .++-..|.+. +|...==.+|-.+. +|. ++|+ T Consensus 101 ~qE~~~~~~lp~~~~~l~~l~~~~i~~a~gS~SGP~Na~~~L~~~--~L~~~F~~~Vda~~-vk~~~~AgKP~ 170 (211) T TIGR02009 101 RQELTGAEVLPGIENLLKELKKKGIAVALGSSSGPKNADRILAKL--GLTDYFDAVVDASE-VKEEGLAGKPH 170 (211) T ss_pred HHCCCCCHHCCCHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHH--HHHHHHCEEECHHH-HHHHCCCCCCC T ss_conf 622672100365589999998669857884054502479999986--12658014327689-97614799887 No 176 >PRK08082 consensus Probab=21.46 E-value=55 Score=13.27 Aligned_cols=122 Identities=15% Similarity=0.198 Sum_probs=63.4 Q ss_pred HHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHCC Q ss_conf 99859929998176656898778785544678998999999999998558935980012300004989999999998506 Q gi|254780558|r 216 AKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQLLNRLRYSLRRGTGYFGVMNYRGAMLLSNKESAEVIFKEFAKR 295 (396) Q Consensus 216 Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~~~~~l~~~l~~~p~~vGvnNhmGs~~t~~~~~m~~vl~~l~~r 295 (396) |-+.|+-|++ ....|+.+++..|+-...++++...- .| |..-..+-..+...+..+.+. T Consensus 228 a~~~~~~V~~------------------fSlEM~~~~l~~R~la~~s~i~~~~i-~~--g~l~~~e~~~i~~a~~~l~~~ 286 (453) T PRK08082 228 ATKTDENVAI------------------FSLEMGADQLVMRMLCAEGNIDAQRL-RT--GSLTSDDWGKLTMAMGSLSNA 286 (453) T ss_pred HHHCCCCEEE------------------EECCCCHHHHHHHHHHCCCCCCHHHH-HC--CCCCHHHHHHHHHHHHHHCCC T ss_conf 9855994899------------------73138989999999971558886677-51--899999999999999985069 Q ss_pred CCEEEECCCC-----CCCHHHHHHHHHCCCEEEEEEE--ECCC----CC-HHHHHHHHHHHHHHHHHCCEEEEEEE Q ss_conf 9289974976-----3226789898709946984588--6289----99-99999999999999987194899980 Q gi|254780558|r 296 GLLFFDDGSS-----PRNLTRVLAPKLNLPYMVADLY--LDDQ----VD-RDKIREKLKGLEEIARTTGQAIGVAV 359 (396) Q Consensus 296 gL~flDs~Ts-----~~Sva~~~A~~~gvp~~~~dvf--LD~~----~~-~~~I~~qL~~l~~~Ar~~G~AI~igh 359 (396) .+++ |+..+ -++.+..+.++.|+.....|-. +... .+ ..+|..--..|-.+||+-+..|...+ T Consensus 287 ~l~i-dd~~~~~i~~i~~~~r~~~~~~~~~livIDYlqLi~~~~~~~~~r~~ev~~isr~LK~lAkel~ipvi~ls 361 (453) T PRK08082 287 GIYI-DDTPGIRVNEIRAKCRRLKQEQGLGMILIDYLQLIQGSGRSGENRQQEVSEISRTLKALARELEVPVIALS 361 (453) T ss_pred CEEE-ECCCCCCHHHHHHHHHHHHHHCCCCEEEEECHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEC T ss_conf 7389-78999989999999999998669988999507733778988878999999999999999999699799964 No 177 >PRK02551 hypothetical protein; Provisional Probab=21.36 E-value=55 Score=13.25 Aligned_cols=49 Identities=12% Similarity=0.122 Sum_probs=36.8 Q ss_pred EEEECCCCCC-----CHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHHH Q ss_conf 8997497632-----267898987099469845886289999999999999999 Q gi|254780558|r 298 LFFDDGSSPR-----NLTRVLAPKLNLPYMVADLYLDDQVDRDKIREKLKGLEE 346 (396) Q Consensus 298 ~flDs~Ts~~-----Sva~~~A~~~gvp~~~~dvfLD~~~~~~~I~~qL~~l~~ 346 (396) +.+-|+-+-- --+..+|++.++|++.+.=+.-...|.+.|.+.|.+..+ T Consensus 98 GVigsGN~NFg~~f~~ag~~ia~k~~vP~Ly~FEL~GT~~DV~~v~~~l~e~~~ 151 (154) T PRK02551 98 GIIGSGNRNFNNQYCLTAKQYAKRFGFPMLADFELRGTPSDIERIAAIIKEALS 151 (154) T ss_pred EEEEECCCCHHHHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHHHHHH T ss_conf 798406783789999999999997299769986405998999999999999997 No 178 >cd07110 ALDH_F10_BADH Arabidopsis betaine aldehyde dehydrogenase 1 and 2, ALDH family 10A8 and 10A9-like. Present in this CD are the Arabidopsis betaine aldehyde dehydrogenase (BADH) 1 (chloroplast) and 2 (mitochondria), also known as, aldehyde dehydrogenase family 10 member A8 and aldehyde dehydrogenase family 10 member A9, respectively, and are putative dehydration- and salt-inducible BADHs (EC 1.2.1.8) that catalyze the oxidation of betaine aldehyde to the compatible solute glycine betaine. Probab=21.31 E-value=56 Score=13.25 Aligned_cols=35 Identities=6% Similarity=0.040 Sum_probs=15.1 Q ss_pred HHHHHHHHHCCEEEEEEECCHHHHHHHHHHHHHHHH Q ss_conf 999999987194899980887889999999756887 Q gi|254780558|r 342 KGLEEIARTTGQAIGVAVAFDESIEVISQWLQQEHV 377 (396) Q Consensus 342 ~~l~~~Ar~~G~AI~igh~~p~Ti~~L~~w~~~l~~ 377 (396) +++.++|....+.+.- ..+-...+...++..+++. T Consensus 377 deai~~aN~~~yGL~a-svft~d~~~~~~~~~~l~~ 411 (456) T cd07110 377 DEAIALANDSEYGLAA-AVISRDAERCDRVAEALEA 411 (456) T ss_pred HHHHHHHHCCCCCCEE-EEECCCHHHHHHHHHHCCE T ss_conf 9999998489998869-9983899999999987877 No 179 >cd01394 radB RadB. The archaeal protein radB shares similarity radA, the archaeal functional homologue to the bacterial RecA. The precise function of radB is unclear. Probab=21.28 E-value=56 Score=13.24 Aligned_cols=53 Identities=21% Similarity=0.242 Sum_probs=33.6 Q ss_pred CCCCEEEECCCCCCCHHHHHHHHHCCCEEEEEEEECCCCC----HHHHHHHHHHHHHHHHHCCEEE-EEEECCHH Q ss_conf 0692899749763226789898709946984588628999----9999999999999998719489-99808878 Q gi|254780558|r 294 KRGLLFFDDGSSPRNLTRVLAPKLNLPYMVADLYLDDQVD----RDKIREKLKGLEEIARTTGQAI-GVAVAFDE 363 (396) Q Consensus 294 ~rgL~flDs~Ts~~Sva~~~A~~~gvp~~~~dvfLD~~~~----~~~I~~qL~~l~~~Ar~~G~AI-~igh~~p~ 363 (396) +-.|..+||-++..+.- +++... ...+.+++..|.++|++++.+| .++|.+.+ T Consensus 103 ~~~lvViDSi~tl~~~e-----------------~~~~~~~~~~~r~l~~~~~~L~~~Ak~~~~~vil~nqVt~~ 160 (218) T cd01394 103 KVDLVVVDSATALYRLE-----------------LGDDDTTIKNYRELAKQLTFLLWLARKHDVAVVITNQVYSD 160 (218) T ss_pred CCCEEEEECCHHHHHHH-----------------CCCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEEEEEEEEE T ss_conf 77299991404554554-----------------06896479999999999999999987669889999215884 No 180 >PRK05636 replicative DNA helicase; Provisional Probab=21.28 E-value=56 Score=13.24 Aligned_cols=105 Identities=16% Similarity=0.137 Sum_probs=55.2 Q ss_pred CCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCC-HHHHHHHHHHHHCCCCEEEECCCC-----CCCHHHHHHHHHCC Q ss_conf 789989999999999985589359800123000049-899999999985069289974976-----32267898987099 Q gi|254780558|r 246 VTQTVQQLLNRLRYSLRRGTGYFGVMNYRGAMLLSN-KESAEVIFKEFAKRGLLFFDDGSS-----PRNLTRVLAPKLNL 319 (396) Q Consensus 246 ~~~~~~~~~~~l~~~l~~~p~~vGvnNhmGs~~t~~-~~~m~~vl~~l~~rgL~flDs~Ts-----~~Sva~~~A~~~gv 319 (396) ..|+.+++..|+....++++...-- .++++.+ -..+...++.+.+..| |+|+..+ -++.+.++.++.++ T Consensus 304 LEMs~~ql~~Rlla~~s~V~~~~ir----~g~l~~~~~~~l~~a~~~l~~~pl-~IdD~~~lti~~Ira~aRrlk~~~~l 378 (507) T PRK05636 304 LEMSKSEIVMRLLSAEAEVRLADMR----GGKMDEDAWEKLVQRLGKIAQAPI-FIDDSANLTMMEIRSKARRLKQKHDL 378 (507) T ss_pred CCCCHHHHHHHHHHHHCCCCHHHHH----CCCCCHHHHHHHHHHHHHHHHCCE-EEECCCCCCHHHHHHHHHHHHHHCCC T ss_conf 5699899999999984798878885----588788999999999999861988-99849997699999999999861799 Q ss_pred CEEEEEEEECCC----------CCHHHHHHHHHHHHHHHHHCCEEEEEEE Q ss_conf 469845886289----------9999999999999999987194899980 Q gi|254780558|r 320 PYMVADLYLDDQ----------VDRDKIREKLKGLEEIARTTGQAIGVAV 359 (396) Q Consensus 320 p~~~~dvfLD~~----------~~~~~I~~qL~~l~~~Ar~~G~AI~igh 359 (396) -. +|+|.- ....+|-.--..|-.+||+-+..|...+ T Consensus 379 ~l----ivVDYLQLm~~~~~~~~R~~ev~~ISr~LK~lAkel~vpVi~Ls 424 (507) T PRK05636 379 KM----IVVDYLQLMSSGKRVESRQQEVSEFSRQLKLLAKELDVPLIAIS 424 (507) T ss_pred CE----EEEEHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCEEEEC T ss_conf 98----99845884568888766899999999999999999799889971 No 181 >TIGR03345 VI_ClpV1 type VI secretion ATPase, ClpV1 family. Members of this protein family are homologs of ClpB, an ATPase associated with chaperone-related functions. These ClpB homologs, designated ClpV1, are a key component of the bacterial pathogenicity-associated type VI secretion system. Probab=21.24 E-value=56 Score=13.24 Aligned_cols=20 Identities=15% Similarity=0.305 Sum_probs=12.1 Q ss_pred EEEEECCCCCCCHHHHHHHH Q ss_conf 89997377877168899998 Q gi|254780558|r 173 IAIVVSGLGISQTGTQRAIN 192 (396) Q Consensus 173 iAIVIddlG~~~~~~~~ai~ 192 (396) =.|+|++=|++.+.-.+.+. T Consensus 210 NPiLVGepGVGKTAIvEGLA 229 (852) T TIGR03345 210 NPILTGEAGVGKTAVVEGLA 229 (852) T ss_pred CCEEECCCCCCHHHHHHHHH T ss_conf 97465799987999999999 No 182 >PRK10681 DNA-binding transcriptional repressor DeoR; Provisional Probab=21.05 E-value=56 Score=13.21 Aligned_cols=149 Identities=15% Similarity=0.098 Sum_probs=74.1 Q ss_pred EEEEECCCCCCCHHHHHHHHHCCCCCEEEEECCCCCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCCCCCCHHH Q ss_conf 89997377877168899998379350898708987689999999985992999817665689877878554467899899 Q gi|254780558|r 173 IAIVVSGLGISQTGTQRAINLLPANITLAFASNGNSLDRWMKEAKKKGQEAILQIPMQAFDESYNEDDSYTLKVTQTVQQ 252 (396) Q Consensus 173 iAIVIddlG~~~~~~~~ai~~LP~~vT~A~~P~~~~~~~~~~~Ar~~G~EvllhlPMEp~~~~~~~pGp~~L~~~~~~~~ 252 (396) -.|.||+ + +.+...+..|+....+-+.- ++..-........+.+|++ + +. ...+. .....+...++ T Consensus 94 ~~IflD~---G-TT~~~la~~L~~~~~ltviT--nsl~~~~~l~~~~~~~vil-~-----GG-~~~~~-~~~~~g~~~~~ 159 (252) T PRK10681 94 QTVFFDC---G-TTTPWIIEAIDNELPFTAVC--YSLNTFLALQEKPHCRAIL-C-----GG-EFHAS-NAIFKPLDFQS 159 (252) T ss_pred CEEEECC---C-HHHHHHHHHHHCCCCEEEEE--CCHHHHHHHHHCCCCEEEE-E-----CC-EEECC-CCEEECHHHHH T ss_conf 9999908---6-89999999831138859996--4399999996389987998-0-----87-78528-84686689999 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCC-CCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHCCCEEEEEEEECCC Q ss_conf 999999999855893598001230-0004989999999998506928997497632267898987099469845886289 Q gi|254780558|r 253 LLNRLRYSLRRGTGYFGVMNYRGA-MLLSNKESAEVIFKEFAKRGLLFFDDGSSPRNLTRVLAPKLNLPYMVADLYLDDQ 331 (396) Q Consensus 253 ~~~~l~~~l~~~p~~vGvnNhmGs-~~t~~~~~m~~vl~~l~~rgL~flDs~Ts~~Sva~~~A~~~gvp~~~~dvfLD~~ 331 (396) ..+.++..+ .|=++-|+...-|- -+..++..+...|-...++-....|+..-.+. +.-.-.+.-.-|.+|=+. T Consensus 160 ~l~~~~~d~-aFig~~gi~~~~G~t~~~~~ea~~k~~~i~~a~~~illaD~sKf~~~-----~~~~~~~l~~id~iITD~ 233 (252) T PRK10681 160 TLNNICPDI-AFYSAAGVHVSKGATCFNLEELPVKHWAMAMAQKHVLVVDHSKFGKV-----RPARMGDLKRFDTVVSDR 233 (252) T ss_pred HHHHHCCCE-EEEECCEECCCCCCCCCCHHHHHHHHHHHHHCCCEEEEECHHHCCCC-----CEEEEECHHHCCEEEECC T ss_conf 998514888-99807548378996559999999999999957969999873242881-----306985779899999895 Q ss_pred CCHHHHHHHH Q ss_conf 9999999999 Q gi|254780558|r 332 VDRDKIREKL 341 (396) Q Consensus 332 ~~~~~I~~qL 341 (396) .-.+.+.+.| T Consensus 234 ~~~~~~~~~l 243 (252) T PRK10681 234 CPDDEFVAYA 243 (252) T ss_pred CCCHHHHHHH T ss_conf 9999999999 No 183 >TIGR00089 TIGR00089 RNA modification enzyme, MiaB family; InterPro: IPR005839 This entry represents a family defined on the basis of sequence similarity. Most of these proteins are not yet characterised, but those that are include CDK5 regulatory subunit-associated protein 1, which specifically inhibits CDK5 activation by CDK5R1 . MiaB, a tRNA modification enzyme . The size of proteins in this entry ranges from 47 to 61 kDa and they contain six conserved cysteines, three of which are clustered. . Probab=20.77 E-value=57 Score=13.18 Aligned_cols=159 Identities=13% Similarity=0.152 Sum_probs=97.8 Q ss_pred EEEEECCC-CCCCH-------HHHHHHHH-CCCC--CEEEEECCCCCHHHHHHHHHHC--C-CCEEEECCCCCCCCCCCC Q ss_conf 89997377-87716-------88999983-7935--0898708987689999999985--9-929998176656898778 Q gi|254780558|r 173 IAIVVSGL-GISQT-------GTQRAINL-LPAN--ITLAFASNGNSLDRWMKEAKKK--G-QEAILQIPMQAFDESYNE 238 (396) Q Consensus 173 iAIVIddl-G~~~~-------~~~~ai~~-LP~~--vT~A~~P~~~~~~~~~~~Ar~~--G-~EvllhlPMEp~~~~~~~ 238 (396) ++.-++.- |.+.. .-.+.|.+ +++- |=|.-+=-..-..+..+.-.+. . --=.||||.|.-+ | T Consensus 205 ~Gqnv~~YgG~D~~~~~~~La~LL~~l~~ki~G~~RIR~~~~hP~~~~d~li~~~~~~~~~~v~~~lHlPvQSGS----d 280 (455) T TIGR00089 205 LGQNVNAYGGKDLEGKTNSLADLLRELSKKIDGIERIRFTSSHPDDVTDDLIELIAENKDPKVCKHLHLPVQSGS----D 280 (455) T ss_pred EEEEECCCCCCCCCCCCCCHHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHHCCCCCCCCEECCCEECCC----H T ss_conf 988525624778888976479999998400597026886046703268789999985078853520221266188----6 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHCCCCE----EEEECCCCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHH Q ss_conf 785544678998999999999998558935----9800123000049899999999985069289974976322678989 Q gi|254780558|r 239 DDSYTLKVTQTVQQLLNRLRYSLRRGTGYF----GVMNYRGAMLLSNKESAEVIFKEFAKRGLLFFDDGSSPRNLTRVLA 314 (396) Q Consensus 239 pGp~~L~~~~~~~~~~~~l~~~l~~~p~~v----GvnNhmGs~~t~~~~~m~~vl~~l~~rgL~flDs~Ts~~Sva~~~A 314 (396) -=-..+.-+-+.++..+.++...+.+|+++ =|.-|-| .+++-.+..++.+++-++-.+-+ ..-. T Consensus 281 ~iLK~M~R~Y~~e~~~~~~~k~r~~~P~~~i~TDiIVGFPG----ETeEdF~~Tl~l~~ev~F~~~~~--------F~YS 348 (455) T TIGR00089 281 RILKRMNRKYTREEYLEIVEKLRAKVPDAAITTDIIVGFPG----ETEEDFEETLDLVEEVKFDKLHS--------FIYS 348 (455) T ss_pred HHHHCCCCCCCHHHHHHHHHHHHHHCCCCEEECCEEECCCC----CCHHHHHHHHHHHHHCCCCEEEE--------ECCC T ss_conf 99970378988899999999999847881775026882899----98899999999985238443431--------2057 Q ss_pred HHHCCCEEEEEEEECCCCCHHHHHHHHHHHHHHHHHC Q ss_conf 8709946984588628999999999999999999871 Q gi|254780558|r 315 PKLNLPYMVADLYLDDQVDRDKIREKLKGLEEIARTT 351 (396) Q Consensus 315 ~~~gvp~~~~dvfLD~~~~~~~I~~qL~~l~~~Ar~~ 351 (396) .+-|.|++. +++.-..+...++++.|.++.++. T Consensus 349 pR~gTpAa~----~~d~vP~~vk~~R~~~L~~l~~~i 381 (455) T TIGR00089 349 PRPGTPAAD----MKDQVPEEVKKERLERLIALQKEI 381 (455) T ss_pred CCCCCCCCC----CCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 888874635----678889889999999999999999 No 184 >TIGR00830 PTBA PTS system, glucose subfamily, IIA component; InterPro: IPR001127 The phosphoenolpyruvate-dependent sugar phosphotransferase system (PTS) , is a major carbohydrate transport system in bacteria. The PTS catalyses the phosphorylation of incoming sugar substrates and coupled with translocation across the cell membrane, makes the PTS a link between the uptake and metabolism of sugars. The general mechanism of the PTS is the following: a phosphoryl group from phosphoenolpyruvate (PEP) is transferred via a signal transduction pathway, to enzyme I (EI) which in turn transfers it to a phosphoryl carrier, the histidine protein (HPr). Phospho-HPr then transfers the phosphoryl group to a sugar-specific permease, a membrane-bound complex known as enzyme 2 (EII), which transports the sugar to the cell. EII consists of at least three structurally distinct domains IIA, IIB and IIC . These can either be fused together in a single polypeptide chain or exist as two or three interactive chains, formerly called enzymes II (EII) and III (EIII). The first domain (IIA or EIIA) carries the first permease-specific phosphorylation site, a histidine which is phosphorylated by phospho-HPr. The second domain (IIB or EIIB) is phosphorylated by phospho-IIA on a cysteinyl or histidyl residue, depending on the sugar transported. Finally, the phosphoryl group is transferred from the IIB domain to the sugar substrate concomitantly with the sugar uptake processed by the IIC domain. This third domain (IIC or EIIC) forms the translocation channel and the specific substrate-binding site. An additional transmembrane domain IID, homologous to IIC, can be found in some PTSs, e.g. for mannose , , , . ; GO: 0005351 sugar:hydrogen ion symporter activity, 0006810 transport, 0009401 phosphoenolpyruvate-dependent sugar phosphotransferase system, 0016020 membrane. Probab=20.72 E-value=42 Score=14.04 Aligned_cols=18 Identities=17% Similarity=0.357 Sum_probs=11.3 Q ss_pred HHHHCCCCEEEECCCCCC Q ss_conf 999859929998176656 Q gi|254780558|r 215 EAKKKGQEAILQIPMQAF 232 (396) Q Consensus 215 ~Ar~~G~EvllhlPMEp~ 232 (396) .--+.|.|||+|+=.... T Consensus 60 i~s~~G~EiLiH~GIDTV 77 (129) T TIGR00830 60 IESDSGVEILIHIGIDTV 77 (129) T ss_pred EEECCCEEEEEECCCCEE T ss_conf 982898399998242358 No 185 >COG5016 Pyruvate/oxaloacetate carboxyltransferase [Energy production and conversion] Probab=20.49 E-value=58 Score=13.14 Aligned_cols=106 Identities=18% Similarity=0.123 Sum_probs=44.6 Q ss_pred CCCCCCCC----CHHHHHHHHHHHHHHCCCCEEEEEC-----CCCCCCCCHHHHHHHHH--------HHHCCCCEEEECC Q ss_conf 55446789----9899999999999855893598001-----23000049899999999--------9850692899749 Q gi|254780558|r 241 SYTLKVTQ----TVQQLLNRLRYSLRRGTGYFGVMNY-----RGAMLLSNKESAEVIFK--------EFAKRGLLFFDDG 303 (396) Q Consensus 241 p~~L~~~~----~~~~~~~~l~~~l~~~p~~vGvnNh-----mGs~~t~~~~~m~~vl~--------~l~~rgL~flDs~ 303 (396) |+..++.+ .++-..+++...|..+|...-=..| --|.+....+.|..++. +.+.. .-=..+ T Consensus 300 PGGMlSNl~sQLkeqnaldK~~eVLeEvprVredlGypPLVTPtSQiVGtQAvlNVl~GerYK~It~E~~~y--v~G~YG 377 (472) T COG5016 300 PGGMLSNLESQLKEQNALDKLEEVLEEVPRVREDLGYPPLVTPTSQIVGTQAVLNVLTGERYKVITKETKDY--VKGLYG 377 (472) T ss_pred CHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHH--HCCCCC T ss_conf 717889999999873456679999998488886069998658544555499999987253154877888887--534569 Q ss_pred CCCCCHHHHHHHH-HCC---CEEEEEEEECCCCCHHHHHHHHHHHHHHHHHCC Q ss_conf 7632267898987-099---469845886289999999999999999998719 Q gi|254780558|r 304 SSPRNLTRVLAPK-LNL---PYMVADLYLDDQVDRDKIREKLKGLEEIARTTG 352 (396) Q Consensus 304 Ts~~Sva~~~A~~-~gv---p~~~~dvfLD~~~~~~~I~~qL~~l~~~Ar~~G 352 (396) .+|-.+-..+-++ .|= |..-|--.+ ....+++.=+++++.++..+ T Consensus 378 rtPapi~~el~~~ilg~~~~~i~~RpADl----l~pe~~k~k~e~~~~~~~~~ 426 (472) T COG5016 378 RTPAPINAELIEKILGDEEKPITCRPADL----LEPELDKLKKELEELAIEEE 426 (472) T ss_pred CCCCCCCHHHHHHHHCCCCCCCCCCHHHH----CCHHHHHHHHHHHHHHHHCC T ss_conf 99999898999997389888655784440----46579999999999863145 No 186 >COG1307 DegV Uncharacterized protein conserved in bacteria [Function unknown] Probab=20.49 E-value=58 Score=13.14 Aligned_cols=112 Identities=18% Similarity=0.253 Sum_probs=54.2 Q ss_pred CCCCCCHHHHHHHHHHHHHHCCCCEEEEEC----CCCCCCCCHHHHHHHHHHHHCCCCEEEECCCCCCCHHHHHHHHHCC Q ss_conf 467899899999999999855893598001----2300004989999999998506928997497632267898987099 Q gi|254780558|r 244 LKVTQTVQQLLNRLRYSLRRGTGYFGVMNY----RGAMLLSNKESAEVIFKEFAKRGLLFFDDGSSPRNLTRVLAPKLNL 319 (396) Q Consensus 244 L~~~~~~~~~~~~l~~~l~~~p~~vGvnNh----mGs~~t~~~~~m~~vl~~l~~rgL~flDs~Ts~~Sva~~~A~~~gv 319 (396) +..+.+.+++.+.+.....+.--|+.|.|- -|+|...-. ..+-+-|+=+=++.+|+|. T Consensus 135 ~~~G~s~~ei~~~l~~~~~~t~~~~~v~~L~~L~kgGRIs~~~---a~lg~lL~ikPIl~~~~G~--------------- 196 (282) T COG1307 135 AKAGKSFEEILKKLEEIREKTKAYFVVDDLDNLVKGGRISKAA---AFLGNLLKIKPILSFEDGE--------------- 196 (282) T ss_pred HHCCCCHHHHHHHHHHHHHCCEEEEEECCHHHHHHCCCCCHHH---HHHHHHHCCEEEEEEECCE--------------- T ss_conf 9859999999999999864617999938647787679851139---9997622656999993870--------------- Q ss_pred CEEEEEEEECCCCCHHHHHHHHHHH-HHHHH-HCCEEEEEEECC-HHHHHHHHHHHHHHHHCCCE Q ss_conf 4698458862899999999999999-99998-719489998088-78899999997568876918 Q gi|254780558|r 320 PYMVADLYLDDQVDRDKIREKLKGL-EEIAR-TTGQAIGVAVAF-DESIEVISQWLQQEHVRDVS 381 (396) Q Consensus 320 p~~~~dvfLD~~~~~~~I~~qL~~l-~~~Ar-~~G~AI~igh~~-p~Ti~~L~~w~~~l~~~gi~ 381 (396) -..++-.++...-.++|.+. .+... ..+..+.+.|.- ++-.+.+.+ .+...++. T Consensus 197 -----~~~~~K~R~~kka~~~l~~~~~~~~~~~~~~~~~~~~~~~~e~~~~l~~---~l~~~~~~ 253 (282) T COG1307 197 -----LVLLGKVRGQKKAIKKLIELLKKEVKDGAGYRVAVLHGDAPEAAEQLKE---KLLNKFIE 253 (282) T ss_pred -----EEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEEEEEECCCHHHHHHHHH---HHHHHCCC T ss_conf -----8885433658899999999999862456663899980776678999999---99862677 No 187 >KOG1389 consensus Probab=20.36 E-value=58 Score=13.12 Aligned_cols=171 Identities=18% Similarity=0.156 Sum_probs=78.8 Q ss_pred CCCHHHHHHHHHHCCCCEEEECCCCCC-CCCCCCCCCCCCCCCCC-----HHHHHHHHHHHHHHCCCCEEEEECCCCCCC Q ss_conf 876899999999859929998176656-89877878554467899-----899999999999855893598001230000 Q gi|254780558|r 206 GNSLDRWMKEAKKKGQEAILQIPMQAF-DESYNEDDSYTLKVTQT-----VQQLLNRLRYSLRRGTGYFGVMNYRGAMLL 279 (396) Q Consensus 206 ~~~~~~~~~~Ar~~G~EvllhlPMEp~-~~~~~~pGp~~L~~~~~-----~~~~~~~l~~~l~~~p~~vGvnNhmGs~~t 279 (396) +-++++.+..|.+.||=--=-+||... ..+..+.-|-+..++.. .-+-...|.-++.+= |....--|.-. T Consensus 204 Av~S~kkAa~A~~~G~f~dEIiPv~tkv~D~~g~~k~i~v~~DeGiRp~ttl~~l~klkpvFkkd----GttTagNsSQv 279 (435) T KOG1389 204 AVDSHKKAAAATAKGKFKDEIIPVKTKVVDPKGDEKPITVSVDEGIRPNTTLESLAKLKPVFKKD----GTTTAGNSSQV 279 (435) T ss_pred HHHHHHHHHHHHHCCCCHHHEEEEEEECCCCCCCCCCEEEECCCCCCCCCCHHHHHHCCHHHHCC----CCCCCCCHHHC T ss_conf 88878999987644863233211001215886775662673478878776566776343142217----82035763112 Q ss_pred CCHHHHHHHHH--HHHCCCCE----EEECCC----------CCCCHHHHHHHHHCCCEEEEEEEECCCCCH---HHHHHH Q ss_conf 49899999999--98506928----997497----------632267898987099469845886289999---999999 Q gi|254780558|r 280 SNKESAEVIFK--EFAKRGLL----FFDDGS----------SPRNLTRVLAPKLNLPYMVADLYLDDQVDR---DKIREK 340 (396) Q Consensus 280 ~~~~~m~~vl~--~l~~rgL~----flDs~T----------s~~Sva~~~A~~~gvp~~~~dvfLD~~~~~---~~I~~q 340 (396) +|-..---+|+ ...+.||= |.|--+ .|---.++.-+..|+-.-..|+|==|+-.. -+-.++ T Consensus 280 SDGa~aVLLmkRs~A~qlgLPvlgvfr~f~~vGv~P~iMGiGPavAIp~alk~aGL~v~did~FEINEAFAsQ~~yc~~K 359 (435) T KOG1389 280 SDGAGAVLLMKRSVAMQLGLPVLGVFRDFAAVGVPPAIMGIGPAVAIPKALKAAGLEVDDIDLFEINEAFASQALYCRNK 359 (435) T ss_pred CCCHHHHHHHHHHHHHHCCCCEEEEEEEEEEECCCHHHCCCCCHHHHHHHHHHCCCCCCCCCEEEHHHHHHHHHHHHHHH T ss_conf 55535644436656765499546776532440688898387724342889987498522354132338888889999987 Q ss_pred HHHHHHHHHHCCEEEEEEECCHHHHH-HHHHHHHHHHHCCC Q ss_conf 99999999871948999808878899-99999756887691 Q gi|254780558|r 341 LKGLEEIARTTGQAIGVAVAFDESIE-VISQWLQQEHVRDV 380 (396) Q Consensus 341 L~~l~~~Ar~~G~AI~igh~~p~Ti~-~L~~w~~~l~~~gi 380 (396) |.--.+.-..+|-||+||||.--|=. .....+.+++.+|. T Consensus 360 L~ld~ekVN~~GGaiAlGHPlGatGAR~VaTlL~emKrrgk 400 (435) T KOG1389 360 LGLDPEKVNVNGGAIALGHPLGATGARCVATLLHEMKRRGK 400 (435) T ss_pred HCCCHHHCCCCCCEEECCCCCCCCCHHHHHHHHHHHHHHCC T ss_conf 38988884888862330687577417599999999986332 No 188 >pfam01553 Acyltransferase Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene. Probab=20.21 E-value=59 Score=13.10 Aligned_cols=81 Identities=15% Similarity=0.205 Sum_probs=41.9 Q ss_pred CCEEEEECCCCCCCCCHHHHHHHHHHHHCCCCEEEECC-CCCCCHHHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHH Q ss_conf 93598001230000498999999999850692899749-76322678989870994698458862899999999999999 Q gi|254780558|r 266 GYFGVMNYRGAMLLSNKESAEVIFKEFAKRGLLFFDDG-SSPRNLTRVLAPKLNLPYMVADLYLDDQVDRDKIREKLKGL 344 (396) Q Consensus 266 ~~vGvnNhmGs~~t~~~~~m~~vl~~l~~rgL~flDs~-Ts~~Sva~~~A~~~gvp~~~~dvfLD~~~~~~~I~~qL~~l 344 (396) .++-+.||. |-+ |.-.|-.++..... ...|+... -........+.+. ...+++|..... .-.+.+.+. T Consensus 15 ~~iiv~NH~-s~~--D~~~l~~~~~~~~~-~~~~~~~~~l~~~p~~~~~~~~------~g~i~V~R~~~~-~~~~~~~~~ 83 (132) T pfam01553 15 PAIVVANHQ-SWL--DPLVLSLLLYKRPR-RPVFVAKDELFTVPLLGWLMRL------LGHIFIDRKKAK-DARASLREL 83 (132) T ss_pred CEEEEECCC-CCH--HHHHHHHHHHHCCC-CEEEEEEHHHHCCCCCCHHEEE------CCCEEEECCCHH-HHHHHHHHH T ss_conf 999997479-944--59999999861499-8699986565138766700121------211778759828-789999999 Q ss_pred HHHHHHCCEEEEE Q ss_conf 9999871948999 Q gi|254780558|r 345 EEIARTTGQAIGV 357 (396) Q Consensus 345 ~~~Ar~~G~AI~i 357 (396) .+...++|..|.| T Consensus 84 ~~~l~~~g~~i~i 96 (132) T pfam01553 84 VELLLRAGKLVVI 96 (132) T ss_pred HHHHHHCCCEEEE T ss_conf 9999849988999 No 189 >PRK09061 D-glutamate deacylase; Validated Probab=20.16 E-value=59 Score=13.10 Aligned_cols=36 Identities=14% Similarity=0.088 Sum_probs=12.9 Q ss_pred HHCCCCEEEEECCCCCCCCCHHHHHHHHHHHHCCCC Q ss_conf 855893598001230000498999999999850692 Q gi|254780558|r 262 RRGTGYFGVMNYRGAMLLSNKESAEVIFKEFAKRGL 297 (396) Q Consensus 262 ~~~p~~vGvnNhmGs~~t~~~~~m~~vl~~l~~rgL 297 (396) ++..++--..-|+.+.-..+...+-.+++..+++|+ T Consensus 226 a~~~g~~vhi~H~~~~~~~~~~~~~~~i~~A~~~G~ 261 (496) T PRK09061 226 AAATGAHMHICHINSTSLQDIPLCAELVQTAQAQGL 261 (496) T ss_pred HHHHCCCEEEEEECCCCCCCHHHHHHHHHHHHHCCC T ss_conf 887399479996436664567899999999997799 No 190 >PRK03600 nrdI ribonucleotide reductase stimulatory protein; Reviewed Probab=20.12 E-value=59 Score=13.09 Aligned_cols=36 Identities=14% Similarity=0.211 Sum_probs=31.8 Q ss_pred HHHHHHHHCCCEEEEEEEECCCCCHHHHHHHHHHHH Q ss_conf 789898709946984588628999999999999999 Q gi|254780558|r 310 TRVLAPKLNLPYMVADLYLDDQVDRDKIREKLKGLE 345 (396) Q Consensus 310 a~~~A~~~gvp~~~~dvfLD~~~~~~~I~~qL~~l~ 345 (396) +..+|++.+||.+.+.=..-...|.+.|...|.+.- T Consensus 94 a~~Ia~k~~VP~L~~FEL~GT~~DV~~v~~~l~~~~ 129 (136) T PRK03600 94 GDVISAKCQVPLLYRFELSGTNEDVENVRKGVEEFW 129 (136) T ss_pred HHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHHHHH T ss_conf 999999809978997604699899999999999999 No 191 >PRK06918 4-aminobutyrate aminotransferase; Reviewed Probab=20.11 E-value=59 Score=13.09 Aligned_cols=23 Identities=13% Similarity=0.179 Sum_probs=9.7 Q ss_pred CCCCCCCHHHHHHH---HHHHHCCCC Q ss_conf 30000498999999---999850692 Q gi|254780558|r 275 GAMLLSNKESAEVI---FKEFAKRGL 297 (396) Q Consensus 275 Gs~~t~~~~~m~~v---l~~l~~rgL 297 (396) |+-|..++-+|... ++++.+.+| T Consensus 316 ~~Tf~gnpla~aaa~a~L~~i~~e~l 341 (454) T PRK06918 316 GGTYAGSPLGCAAALAVLDIIEKENL 341 (454) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHCCH T ss_conf 56179898999999999999986689 No 192 >TIGR02902 spore_lonB ATP-dependent protease LonB; InterPro: IPR014251 This entry represents LonB, a paralog of the ATP-dependent protease La (LonA, IPR004815 from INTERPRO). LonB proteins are unassigned peptidases belonging to the MEROPS peptidase family S16 (lon protease family, clan SJ) and are found strictly, and almost universally, in endospore-forming bacteria. This protease was shown, in Bacillus subtilis, to be expressed specifically in the forespore during sporulation, under control of sigmaF . The lonB gene, despite being located immediately upstream of lonA, was shown to be monocistronic. LonB appears to be involved in the post-translation control of sigmaH, but lonB mutation did not produce an obvious sporulation defect under the conditions tested . Note that additional paralogs of LonA and LonB occur in the Clostridium lineage and these are excluded from this entry. . Probab=20.00 E-value=11 Score=17.65 Aligned_cols=17 Identities=18% Similarity=0.208 Sum_probs=9.4 Q ss_pred CCHHHHHHHHHHCCCCE Q ss_conf 76899999999859929 Q gi|254780558|r 207 NSLDRWMKEAKKKGQEA 223 (396) Q Consensus 207 ~~~~~~~~~Ar~~G~Ev 223 (396) +++-++...+-..|+|+ T Consensus 285 ~~Al~~I~~Ya~nGREA 301 (532) T TIGR02902 285 KEALDLIAKYASNGREA 301 (532) T ss_pred HHHHHHHHHHHCCCCHH T ss_conf 54799999874054067 Done!