Query gi|254780402|ref|YP_003064815.1| 3-deoxy-D-manno-octulosonic-acid transferase [Candidatus Liberibacter asiaticus str. psy62] Match_columns 440 No_of_seqs 174 out of 2129 Neff 7.1 Searched_HMMs 39220 Date Sun May 29 17:11:23 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780402.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK05749 3-deoxy-D-manno-octul 100.0 0 0 885.6 38.5 418 8-425 1-422 (423) 2 COG1519 KdtA 3-deoxy-D-manno-o 100.0 0 0 858.6 35.8 414 9-422 1-419 (419) 3 pfam04413 Glycos_transf_N 3-De 100.0 0 0 375.5 16.4 179 38-216 1-185 (186) 4 cd03817 GT1_UGDG_like This fam 99.9 1.3E-19 3.3E-24 148.5 24.4 327 67-418 15-370 (374) 5 cd03814 GT1_like_2 This family 99.9 4.3E-19 1.1E-23 145.0 24.3 322 67-421 15-362 (364) 6 PRK10307 predicted glycosyl tr 99.9 6E-19 1.5E-23 144.1 24.3 336 67-423 16-412 (415) 7 cd03796 GT1_PIG-A_like This fa 99.9 4E-19 1E-23 145.2 23.2 332 67-427 15-370 (398) 8 cd03801 GT1_YqgM_like This fam 99.9 2.2E-18 5.7E-23 140.3 25.5 328 67-416 15-367 (374) 9 cd03807 GT1_WbnK_like This fam 99.9 5.2E-18 1.3E-22 137.9 27.1 334 63-420 8-362 (365) 10 cd03820 GT1_amsD_like This fam 99.9 1.1E-18 2.9E-23 142.3 22.9 317 67-418 14-346 (348) 11 cd03811 GT1_WabH_like This fam 99.9 1.2E-18 3.1E-23 142.1 23.0 321 67-407 13-350 (353) 12 cd03795 GT1_like_4 This family 99.9 6.8E-19 1.7E-23 143.7 21.4 316 67-408 15-351 (357) 13 cd03794 GT1_wbuB_like This fam 99.8 3.5E-18 9E-23 139.0 22.1 321 67-408 16-384 (394) 14 cd03808 GT1_cap1E_like This fa 99.8 1.1E-17 2.7E-22 135.9 24.3 321 66-416 10-355 (359) 15 cd04962 GT1_like_5 This family 99.8 3.5E-17 9E-22 132.4 25.8 329 67-421 13-367 (371) 16 cd03798 GT1_wlbH_like This fam 99.8 6.1E-18 1.5E-22 137.5 21.9 331 67-421 15-373 (377) 17 cd05844 GT1_like_7 Glycosyltra 99.8 2.1E-18 5.4E-23 140.5 18.9 280 112-419 66-365 (367) 18 cd03800 GT1_Sucrose_synthase T 99.8 2.6E-17 6.6E-22 133.4 23.9 323 67-409 22-388 (398) 19 cd03799 GT1_amsK_like This is 99.8 1.1E-17 2.7E-22 135.9 20.2 316 67-410 13-348 (355) 20 cd03822 GT1_ecORF704_like This 99.8 2.8E-17 7.1E-22 133.2 21.7 323 67-418 14-361 (366) 21 cd03812 GT1_CapH_like This fam 99.8 4.8E-17 1.2E-21 131.6 21.5 314 67-402 13-343 (358) 22 cd03809 GT1_mtfB_like This fam 99.8 3.2E-17 8.2E-22 132.7 20.4 316 67-406 16-353 (365) 23 PRK13609 diacylglycerol glucos 99.8 3.2E-16 8.2E-21 126.1 24.9 339 56-439 4-384 (388) 24 TIGR03449 mycothiol_MshA UDP-N 99.8 8.1E-17 2.1E-21 130.1 21.6 347 59-426 4-403 (405) 25 cd03819 GT1_WavL_like This fam 99.8 2.2E-16 5.6E-21 127.2 23.1 312 67-408 11-350 (355) 26 TIGR03088 stp2 sugar transfera 99.8 8E-16 2E-20 123.6 25.9 338 64-425 11-373 (374) 27 cd03821 GT1_Bme6_like This fam 99.8 1.8E-16 4.6E-21 127.8 21.8 328 67-416 15-371 (375) 28 cd03805 GT1_ALG2_like This fam 99.8 2E-16 5E-21 127.6 20.7 326 67-415 14-389 (392) 29 PRK13608 diacylglycerol glucos 99.8 9.8E-16 2.5E-20 123.0 23.9 340 56-438 5-385 (391) 30 cd03823 GT1_ExpE7_like This fa 99.8 5E-16 1.3E-20 124.9 22.0 297 67-403 16-343 (359) 31 cd04951 GT1_WbdM_like This fam 99.8 6.3E-16 1.6E-20 124.2 22.0 326 64-420 9-356 (360) 32 TIGR03087 stp1 sugar transfera 99.7 9.8E-15 2.5E-19 116.4 21.1 270 122-422 98-394 (397) 33 PRK00726 murG N-acetylglucosam 99.7 1.5E-13 3.9E-18 108.6 26.9 319 58-422 4-357 (359) 34 cd03825 GT1_wcfI_like This fam 99.7 5E-14 1.3E-18 111.8 24.4 315 67-417 14-357 (365) 35 cd04955 GT1_like_6 This family 99.7 1.8E-14 4.5E-19 114.7 21.0 319 67-417 16-357 (363) 36 cd03792 GT1_Trehalose_phosphor 99.6 3E-13 7.7E-18 106.7 22.4 322 67-419 13-367 (372) 37 cd04949 GT1_gtfA_like This fam 99.6 1.3E-13 3.4E-18 109.0 18.5 267 119-408 86-364 (372) 38 PRK09922 UDP-D-galactose:(gluc 99.6 4E-12 1E-16 99.3 24.7 319 67-418 16-355 (361) 39 cd03818 GT1_ExpC_like This fam 99.6 3.6E-13 9.1E-18 106.2 19.0 212 180-406 154-383 (396) 40 cd03785 GT1_MurG MurG is an N- 99.6 7.2E-12 1.8E-16 97.6 23.6 307 62-416 6-349 (350) 41 cd03813 GT1_like_3 This family 99.6 1.7E-13 4.4E-18 108.2 14.9 238 147-410 193-463 (475) 42 TIGR03492 conserved hypothetic 99.6 8.3E-12 2.1E-16 97.2 23.3 336 64-422 5-396 (396) 43 COG0707 MurG UDP-N-acetylgluco 99.5 2.8E-11 7.2E-16 93.7 24.2 320 58-422 3-355 (357) 44 PRK00025 lpxB lipid-A-disaccha 99.5 1.2E-11 3E-16 96.2 18.8 332 59-424 4-380 (382) 45 TIGR02149 glgA_Coryne glycogen 99.5 8.7E-13 2.2E-17 103.6 11.8 293 108-421 64-415 (416) 46 cd04946 GT1_AmsK_like This fam 99.5 4.9E-12 1.2E-16 98.7 15.1 216 173-416 174-404 (407) 47 PRK12446 N-acetylglucosaminyl 99.4 1.3E-09 3.2E-14 82.9 22.8 317 57-420 3-351 (352) 48 COG0381 WecB UDP-N-acetylgluco 99.4 8.8E-11 2.2E-15 90.5 15.7 330 65-425 12-375 (383) 49 pfam04007 DUF354 Protein of un 99.3 6.1E-09 1.6E-13 78.4 23.1 312 58-423 2-335 (335) 50 pfam02684 LpxB Lipid-A-disacch 99.3 1.3E-10 3.2E-15 89.5 13.9 301 70-407 12-358 (373) 51 cd03802 GT1_AviGT4_like This f 99.3 1E-08 2.7E-13 76.8 21.2 296 67-416 20-328 (335) 52 TIGR03568 NeuC_NnaA UDP-N-acet 99.2 5.9E-09 1.5E-13 78.5 19.2 324 59-418 3-365 (365) 53 pfam02350 Epimerase_2 UDP-N-ac 99.1 1.6E-08 4.2E-13 75.6 17.6 267 120-419 59-346 (346) 54 cd03786 GT1_UDP-GlcNAc_2-Epime 99.1 4.1E-08 1.1E-12 72.9 18.2 328 59-419 2-363 (363) 55 cd03784 GT1_Gtf_like This fami 99.1 6.4E-07 1.6E-11 65.1 24.0 109 302-420 287-400 (401) 56 pfam00534 Glycos_transf_1 Glyc 99.1 1.3E-09 3.2E-14 82.9 9.8 153 233-402 13-171 (172) 57 TIGR01133 murG undecaprenyldip 99.1 1.4E-08 3.5E-13 76.0 14.8 324 58-417 7-367 (368) 58 cd03804 GT1_wbaZ_like This fam 99.1 1.7E-09 4.3E-14 82.0 9.9 191 172-399 143-336 (351) 59 PRK10125 predicted glycosyl tr 98.9 7.8E-08 2E-12 71.1 14.0 248 125-408 121-390 (405) 60 cd03806 GT1_ALG11_like This fa 98.9 2.7E-08 6.9E-13 74.1 11.7 216 170-408 177-412 (419) 61 PRK01021 lpxB lipid-A-disaccha 98.9 2E-07 5E-12 68.5 14.1 318 70-420 240-604 (607) 62 cd03791 GT1_Glycogen_synthase_ 98.8 5.4E-06 1.4E-10 59.1 20.1 258 126-402 127-458 (476) 63 COG0763 LpxB Lipid A disacchar 98.8 8.4E-07 2.1E-11 64.4 15.6 324 63-422 5-379 (381) 64 TIGR03609 S_layer_CsaB polysac 98.8 1E-06 2.6E-11 63.8 15.2 253 71-362 17-286 (298) 65 PRK10017 putative pyruvyl tran 98.7 8.8E-06 2.3E-10 57.6 19.4 249 130-402 119-405 (426) 66 TIGR02472 sucr_P_syn_N sucrose 98.7 7.8E-07 2E-11 64.6 13.1 316 67-408 27-431 (445) 67 TIGR03590 PseG pseudaminic aci 98.6 6.4E-06 1.6E-10 58.5 16.4 259 65-364 14-280 (280) 68 COG1817 Uncharacterized protei 98.5 1.9E-05 4.9E-10 55.4 15.4 314 58-423 2-341 (346) 69 KOG1111 consensus 98.3 1.3E-05 3.4E-10 56.5 11.4 251 72-350 21-300 (426) 70 PRK00654 glgA glycogen synthas 98.3 4.4E-05 1.1E-09 53.1 13.7 256 126-402 129-455 (476) 71 pfam04101 Glyco_tran_28_C Glyc 98.3 1.8E-05 4.6E-10 55.6 11.3 103 304-413 56-166 (167) 72 cd03816 GT1_ALG1_like This fam 98.2 0.00015 3.8E-09 49.6 14.1 213 171-403 148-398 (415) 73 TIGR02468 sucrsPsyn_pln sucros 98.2 1.2E-05 3.1E-10 56.7 8.0 160 233-405 481-657 (1072) 74 COG0438 RfaG Glycosyltransfera 98.1 0.00074 1.9E-08 45.0 16.1 270 129-424 84-376 (381) 75 KOG0853 consensus 98.0 3.6E-05 9.2E-10 53.6 7.9 226 177-422 204-469 (495) 76 pfam04230 PS_pyruv_trans Polys 98.0 0.00088 2.2E-08 44.5 14.8 234 66-350 2-253 (258) 77 PRK10422 lipopolysaccharide co 98.0 0.0019 5E-08 42.3 16.3 264 53-353 3-291 (352) 78 pfam00201 UDPGT UDP-glucoronos 98.0 0.00025 6.3E-09 48.2 11.4 104 330-434 345-453 (501) 79 PRK10964 ADP-heptose:LPS hepto 97.9 0.0016 4E-08 42.9 15.1 279 58-388 3-321 (322) 80 COG4641 Uncharacterized protei 97.9 0.0021 5.3E-08 42.1 15.1 314 70-419 18-357 (373) 81 cd04950 GT1_like_1 Glycosyltra 97.8 0.0035 8.8E-08 40.6 16.0 229 129-390 102-341 (373) 82 KOG1387 consensus 97.8 0.0025 6.4E-08 41.6 14.4 268 124-416 146-453 (465) 83 cd01635 Glycosyltransferase_GT 97.8 0.0012 3.2E-08 43.5 12.2 200 66-360 13-219 (229) 84 PRK10916 ADP-heptose:LPS hepto 97.8 0.003 7.6E-08 41.0 14.2 261 59-356 4-293 (348) 85 COG2327 WcaK Polysaccharide py 97.7 0.0048 1.2E-07 39.7 20.8 302 64-395 12-357 (385) 86 pfam04464 Glyphos_transf CDP-G 97.5 0.0013 3.4E-08 43.3 9.6 161 233-419 9-185 (186) 87 COG1887 TagB Putative glycosyl 97.5 0.0097 2.5E-07 37.7 13.7 223 175-420 141-386 (388) 88 TIGR01426 MGT glycosyltransfer 97.4 0.0016 4.2E-08 42.7 9.3 110 305-423 314-428 (429) 89 COG1819 Glycosyl transferases, 97.4 0.01 2.7E-07 37.5 12.9 110 304-422 285-399 (406) 90 COG0859 RfaF ADP-heptose:LPS h 97.3 0.016 4E-07 36.3 13.2 259 57-356 3-283 (334) 91 TIGR00236 wecB UDP-N-acetylglu 97.2 0.0054 1.4E-07 39.3 10.0 330 65-420 9-378 (380) 92 PRK09814 hypothetical protein; 96.9 0.038 9.8E-07 33.8 20.6 247 119-409 54-322 (337) 93 cd03788 GT1_TPS Trehalose-6-Ph 96.8 0.044 1.1E-06 33.4 17.0 275 119-408 121-447 (460) 94 KOG2941 consensus 96.7 0.053 1.3E-06 32.9 15.1 205 178-403 164-423 (444) 95 PRK10117 trehalose-6-phosphate 96.6 0.068 1.7E-06 32.1 17.1 273 120-408 114-439 (474) 96 pfam06258 DUF1022 Protein of u 96.5 0.0018 4.7E-08 42.4 2.8 172 180-381 93-285 (308) 97 COG3980 spsG Spore coat polysa 96.0 0.14 3.6E-06 30.0 15.6 290 65-409 14-312 (318) 98 PHA01630 putative group 1 glyc 95.6 0.2 5E-06 29.1 10.3 199 181-416 95-327 (333) 99 cd03789 GT1_LPS_heptosyltransf 95.4 0.23 5.8E-06 28.7 13.5 56 59-116 3-58 (279) 100 KOG1192 consensus 95.3 0.21 5.3E-06 29.0 8.8 94 304-401 336-434 (496) 101 TIGR02400 trehalose_OtsA alpha 95.0 0.3 7.7E-06 27.9 15.3 266 119-399 130-454 (476) 102 pfam06925 MGDG_synth Monogalac 94.5 0.24 6.2E-06 28.5 7.4 80 119-209 80-166 (169) 103 pfam08660 Alg14 Oligosaccharid 94.4 0.42 1.1E-05 27.0 10.2 130 60-198 2-158 (166) 104 TIGR02095 glgA glycogen/starch 94.4 0.36 9.2E-06 27.4 8.2 169 218-402 302-498 (517) 105 pfam00982 Glyco_transf_20 Glyc 94.4 0.43 1.1E-05 26.9 16.6 276 120-407 129-455 (470) 106 COG3914 Spy Predicted O-linked 94.1 0.48 1.2E-05 26.6 16.1 295 68-397 274-585 (620) 107 PRK00742 chemotaxis-specific m 92.8 0.78 2E-05 25.2 11.0 140 235-378 156-309 (345) 108 COG4370 Uncharacterized protei 92.4 0.8 2E-05 25.1 7.2 342 57-421 7-410 (412) 109 PHA01633 putative glycosyl tra 91.5 0.15 3.9E-06 29.8 2.6 149 177-351 88-253 (335) 110 TIGR02470 sucr_synth sucrose s 91.5 0.12 3E-06 30.6 2.0 332 54-406 281-736 (790) 111 cd01425 RPS2 Ribosomal protein 89.6 0.73 1.9E-05 25.4 4.8 30 128-157 127-156 (193) 112 cd01149 HutB Hemin binding pro 87.9 2.1 5.4E-05 22.3 9.1 174 123-311 53-234 (235) 113 COG0595 mRNA degradation ribon 87.6 0.79 2E-05 25.2 3.8 18 411-428 525-542 (555) 114 COG2242 CobL Precorrin-6B meth 87.5 2.2 5.7E-05 22.2 7.2 95 259-369 54-153 (187) 115 pfam05159 Capsule_synth Capsul 87.2 0.96 2.5E-05 24.6 4.1 86 257-361 149-235 (268) 116 pfam00318 Ribosomal_S2 Ribosom 86.5 1.8 4.5E-05 22.8 5.1 12 338-349 153-164 (205) 117 cd06295 PBP1_CelR Ligand bindi 86.3 2.6 6.6E-05 21.7 12.8 180 70-289 27-211 (275) 118 pfam01497 Peripla_BP_2 Peripla 85.8 2.8 7E-05 21.6 8.7 177 117-310 50-234 (236) 119 PRK08287 cobalt-precorrin-6Y C 85.5 2.8 7.3E-05 21.5 6.3 114 233-369 30-148 (186) 120 pfam05686 DUF821 Arabidopsis t 85.4 1.7 4.4E-05 22.9 4.6 78 342-421 235-319 (396) 121 PRK00377 cbiT cobalt-precorrin 85.1 3 7.6E-05 21.4 6.4 94 262-368 64-162 (198) 122 PRK07402 precorrin-6B methylas 81.6 2 5E-05 22.5 3.6 89 233-342 40-128 (196) 123 TIGR02398 gluc_glyc_Psyn gluco 80.4 0.27 6.8E-06 28.2 -1.2 165 220-396 274-464 (495) 124 pfam00852 Glyco_transf_10 Glyc 77.7 5.1 0.00013 19.8 4.7 49 345-397 256-307 (353) 125 PRK08261 fabG 3-ketoacyl-(acyl 77.4 5.5 0.00014 19.6 7.0 50 34-88 9-68 (447) 126 cd06299 PBP1_LacI_like_13 Liga 76.4 5.8 0.00015 19.4 11.5 105 69-209 15-123 (265) 127 COG0380 OtsA Trehalose-6-phosp 76.3 5.9 0.00015 19.4 17.1 275 120-405 138-462 (486) 128 pfam03033 Glyco_transf_28 Glyc 75.8 6.1 0.00016 19.3 7.8 93 63-160 6-117 (136) 129 cd06292 PBP1_LacI_like_10 Liga 75.0 6.4 0.00016 19.2 13.8 109 69-212 15-133 (273) 130 pfam04016 DUF364 Domain of unk 74.8 4.5 0.00011 20.2 3.8 69 315-383 157-228 (229) 131 TIGR02884 spore_pdaA delta-lac 74.2 2.3 5.9E-05 22.1 2.2 99 193-291 107-219 (225) 132 COG0373 HemA Glutamyl-tRNA red 73.8 6.8 0.00017 19.0 7.9 89 231-345 175-264 (414) 133 cd06296 PBP1_CatR_like Ligand- 73.7 6.8 0.00017 19.0 11.2 105 70-209 16-124 (270) 134 pfam02571 CbiJ Precorrin-6x re 73.2 7 0.00018 18.9 9.5 129 58-209 2-136 (246) 135 TIGR00649 MG423 conserved hypo 72.7 7 0.00018 18.9 4.3 182 142-324 191-431 (593) 136 TIGR02469 CbiT precorrin-6Y C5 72.6 4.5 0.00012 20.2 3.4 83 258-346 39-123 (135) 137 PRK03379 vitamin B12-transport 72.5 7.3 0.00019 18.8 4.5 84 59-160 25-109 (265) 138 TIGR02195 heptsyl_trn_II lipop 71.6 4.1 0.0001 20.5 2.9 256 66-359 10-308 (361) 139 cd01536 PBP1_ABC_sugar_binding 70.9 7.9 0.0002 18.6 11.5 44 119-163 47-92 (267) 140 COG1834 N-Dimethylarginine dim 70.5 5.9 0.00015 19.4 3.6 41 268-308 83-126 (267) 141 TIGR01496 DHPS dihydropteroate 69.9 8.3 0.00021 18.4 8.9 146 67-242 58-213 (268) 142 cd06267 PBP1_LacI_sugar_bindin 68.9 8.7 0.00022 18.3 13.6 181 70-289 16-202 (264) 143 PRK05299 rpsB 30S ribosomal pr 68.4 8.9 0.00023 18.2 5.4 13 338-350 173-185 (255) 144 pfam01075 Glyco_transf_9 Glyco 66.3 9.8 0.00025 18.0 9.1 38 310-354 175-215 (249) 145 cd03793 GT1_Glycogen_synthase_ 65.8 10 0.00026 17.9 8.9 94 308-403 458-567 (590) 146 pfam10649 DUF2478 Protein of u 65.7 5.2 0.00013 19.8 2.5 66 300-379 91-156 (159) 147 COG0052 RpsB Ribosomal protein 65.0 10 0.00026 17.8 7.1 34 257-293 152-185 (252) 148 KOG0564 consensus 63.8 7.4 0.00019 18.8 3.0 122 54-186 117-247 (590) 149 KOG2619 consensus 63.4 8.8 0.00022 18.3 3.3 48 345-398 278-328 (372) 150 pfam05693 Glycogen_syn Glycoge 63.3 11 0.00028 17.6 8.8 127 268-403 413-562 (633) 151 cd06279 PBP1_LacI_like_3 Ligan 63.1 11 0.00029 17.6 13.6 98 71-209 22-123 (283) 152 cd06283 PBP1_RegR_EndR_KdgR_li 62.9 11 0.00029 17.6 9.2 105 69-209 15-123 (267) 153 cd06289 PBP1_MalI_like Ligand- 62.4 12 0.00029 17.5 12.9 182 70-289 16-203 (268) 154 cd06294 PBP1_ycjW_transcriptio 60.8 12 0.00031 17.3 12.7 105 71-210 22-130 (270) 155 TIGR02764 spore_ybaN_pdaB poly 60.4 10 0.00026 17.9 3.2 50 245-295 142-196 (198) 156 COG0297 GlgA Glycogen synthase 59.7 13 0.00033 17.2 5.1 213 174-402 200-459 (487) 157 cd01976 Nitrogenase_MoFe_alpha 59.6 13 0.00033 17.2 5.5 130 67-207 82-249 (421) 158 TIGR03172 probable selenium-de 59.6 11 0.00028 17.6 3.2 27 67-93 9-35 (232) 159 cd01574 PBP1_LacI Ligand-bindi 59.3 13 0.00033 17.1 12.6 107 67-209 13-123 (264) 160 KOG0780 consensus 58.8 13 0.00034 17.1 11.8 124 262-401 182-326 (483) 161 cd06272 PBP1_hexuronate_repres 58.7 13 0.00034 17.1 13.1 101 70-211 16-120 (261) 162 TIGR01011 rpsB_bact ribosomal 58.0 9.2 0.00024 18.1 2.6 50 255-307 151-200 (227) 163 cd01078 NAD_bind_H4MPT_DH NADP 57.6 14 0.00035 17.0 3.7 53 227-285 21-74 (194) 164 COG4671 Predicted glycosyl tra 57.1 14 0.00036 16.9 17.8 300 62-390 18-366 (400) 165 cd02809 alpha_hydroxyacid_oxid 55.6 15 0.00038 16.8 12.0 117 67-208 82-200 (299) 166 PRK13287 amiF formamidase; Pro 55.4 15 0.00039 16.7 4.5 62 304-374 116-187 (333) 167 cd06298 PBP1_CcpA_like Ligand- 54.9 15 0.00039 16.7 12.6 107 70-212 16-127 (268) 168 pfam01993 MTD methylene-5,6,7, 54.9 12 0.00031 17.4 2.8 83 61-162 7-97 (276) 169 TIGR00450 thdF tRNA modificati 54.0 16 0.0004 16.6 4.7 184 57-255 78-273 (473) 170 PRK12311 rpsB 30S ribosomal pr 51.6 17 0.00044 16.3 5.1 28 129-156 160-187 (332) 171 pfam01488 Shikimate_DH Shikima 51.5 12 0.00032 17.3 2.4 49 231-284 9-57 (134) 172 COG1448 TyrB Aspartate/tyrosin 51.4 17 0.00044 16.4 3.1 37 118-154 108-144 (396) 173 PRK08057 cobalt-precorrin-6x r 51.4 17 0.00044 16.3 8.9 125 58-209 3-133 (241) 174 TIGR02002 PTS-II-BC-glcB PTS s 51.4 9.6 0.00024 18.0 1.8 22 189-210 474-495 (518) 175 TIGR03531 selenium_SpcS O-phos 51.4 7.3 0.00018 18.8 1.2 73 268-353 216-288 (444) 176 COG1736 DPH2 Diphthamide synth 50.9 18 0.00045 16.3 4.5 37 59-96 27-64 (347) 177 PRK05333 NAD-dependent deacety 50.4 18 0.00046 16.2 5.4 34 317-350 211-246 (285) 178 cd00316 Oxidoreductase_nitroge 49.9 18 0.00047 16.2 5.4 20 67-86 64-83 (399) 179 cd01144 BtuF Cobalamin binding 49.6 19 0.00047 16.1 6.0 38 123-160 52-89 (245) 180 PRK11613 folP dihydropteroate 49.4 19 0.00048 16.1 10.0 20 141-160 123-142 (282) 181 smart00672 CAP10 Putative lipo 49.1 19 0.00048 16.1 6.0 65 342-408 165-233 (256) 182 COG0379 NadA Quinolinate synth 48.4 19 0.00049 16.0 7.8 162 185-382 141-311 (324) 183 cd01539 PBP1_GGBP Periplasmic 47.9 20 0.0005 16.0 3.2 49 120-169 50-100 (303) 184 cd02509 GDP-M1P_Guanylyltransf 47.6 20 0.00051 15.9 6.3 117 152-283 2-125 (274) 185 PRK12555 chemotaxis-specific m 47.4 20 0.00051 15.9 9.3 138 235-379 154-307 (340) 186 cd06270 PBP1_GalS_like Ligand 47.4 20 0.00051 15.9 12.0 106 71-212 17-127 (268) 187 TIGR02858 spore_III_AA stage I 47.4 16 0.00041 16.5 2.5 52 302-371 206-262 (282) 188 PRK05124 cysN sulfate adenylyl 47.2 20 0.00052 15.9 5.1 25 138-162 119-143 (475) 189 pfam00290 Trp_syntA Tryptophan 46.9 20 0.00052 15.9 9.1 157 233-411 85-257 (258) 190 COG0135 TrpF Phosphoribosylant 46.8 21 0.00052 15.9 8.5 150 186-350 7-180 (208) 191 COG0159 TrpA Tryptophan syntha 46.2 21 0.00053 15.8 9.1 154 233-410 93-262 (265) 192 cd01968 Nitrogenase_NifE_I Nit 45.2 22 0.00055 15.7 6.0 40 305-350 338-379 (410) 193 cd06310 PBP1_ABC_sugar_binding 44.7 22 0.00056 15.7 10.0 42 120-162 50-93 (273) 194 KOG3349 consensus 44.4 22 0.00057 15.6 5.5 68 313-386 73-144 (170) 195 cd06301 PBP1_rhizopine_binding 44.4 22 0.00057 15.6 4.2 43 120-163 49-93 (272) 196 TIGR02918 TIGR02918 conserved 44.1 16 0.00041 16.6 2.0 268 110-406 153-494 (511) 197 COG4213 XylF ABC-type xylose t 44.0 23 0.00058 15.6 3.3 74 85-158 27-113 (341) 198 cd01829 SGNH_hydrolase_peri2 S 43.8 23 0.00058 15.6 8.6 72 59-142 2-75 (200) 199 KOG0353 consensus 43.8 6.9 0.00018 19.0 0.1 50 378-428 503-563 (695) 200 pfam05889 SLA_LP_auto_ag Solub 43.0 23 0.0006 15.5 4.6 19 269-287 167-185 (389) 201 PRK04168 hypothetical protein; 43.0 23 0.0006 15.5 4.3 49 52-102 29-77 (336) 202 cd06312 PBP1_ABC_sugar_binding 42.9 23 0.0006 15.5 13.0 43 120-163 50-94 (271) 203 cd06285 PBP1_LacI_like_7 Ligan 42.7 24 0.0006 15.5 12.2 105 70-212 16-124 (265) 204 TIGR00315 cdhB CO dehydrogenas 42.6 24 0.00061 15.4 3.7 69 128-204 8-77 (170) 205 pfam11019 DUF2608 Protein of u 42.6 24 0.00061 15.4 5.4 30 131-161 77-106 (251) 206 TIGR01179 galE UDP-glucose 4-e 42.5 24 0.00061 15.4 4.4 86 74-159 14-126 (341) 207 TIGR00073 hypB hydrogenase acc 41.5 17 0.00043 16.4 1.8 119 189-319 72-216 (225) 208 PRK06163 hypothetical protein; 41.2 25 0.00063 15.3 6.4 31 356-389 130-162 (202) 209 TIGR01989 COQ6 Ubiquinone bios 41.1 8.1 0.00021 18.5 0.1 17 185-201 56-72 (481) 210 cd01391 Periplasmic_Binding_Pr 40.2 26 0.00066 15.2 8.5 149 119-288 50-206 (269) 211 PRK05632 phosphate acetyltrans 39.8 26 0.00067 15.2 7.7 41 245-289 398-438 (702) 212 TIGR00666 PBP4 D-alanyl-D-alan 39.6 26 0.00067 15.1 2.7 58 54-116 71-134 (427) 213 KOG0100 consensus 39.5 13 0.00033 17.2 1.0 15 334-348 393-407 (663) 214 TIGR00715 precor6x_red precorr 39.5 26 0.00068 15.1 4.3 241 68-387 8-259 (260) 215 cd06271 PBP1_AglR_RafR_like Li 39.5 26 0.00068 15.1 13.8 104 71-210 21-128 (268) 216 TIGR02477 PFKA_PPi diphosphate 39.3 15 0.00039 16.7 1.3 283 112-418 33-368 (566) 217 TIGR02733 desat_CrtD C-3',4' d 39.2 12 0.00032 17.3 0.9 77 345-423 333-416 (499) 218 KOG0832 consensus 38.1 28 0.00071 15.0 4.2 27 322-350 175-201 (251) 219 TIGR00314 cdhA CO dehydrogenas 38.0 20 0.00052 15.9 1.8 97 54-161 238-338 (795) 220 cd06317 PBP1_ABC_sugar_binding 37.9 28 0.00071 15.0 5.1 45 120-165 49-95 (275) 221 TIGR01487 SPP-like SPP-like hy 37.9 16 0.00041 16.5 1.3 22 142-163 27-48 (223) 222 TIGR02201 heptsyl_trn_III lipo 37.4 24 0.00061 15.4 2.1 40 310-355 253-294 (347) 223 pfam09353 DUF1995 Domain of un 37.3 29 0.00073 14.9 6.5 94 69-165 39-135 (205) 224 PRK13228 consensus 37.2 29 0.00073 14.9 4.8 61 320-388 169-229 (232) 225 PRK13224 consensus 37.1 29 0.00073 14.9 3.3 60 320-388 157-216 (216) 226 TIGR01916 F420_puta_dedox F420 37.0 29 0.00074 14.9 3.1 40 262-313 125-165 (260) 227 PRK13116 consensus 36.4 30 0.00075 14.8 10.0 66 344-411 205-277 (278) 228 pfam08267 Meth_synt_1 Cobalami 36.3 24 0.0006 15.5 1.9 43 302-346 229-274 (310) 229 PRK11052 malQ 4-alpha-glucanot 36.1 11 0.00029 17.5 0.3 22 66-87 161-182 (694) 230 pfam03192 DUF257 Pyrococcus pr 36.0 30 0.00076 14.8 3.1 39 121-159 2-47 (210) 231 PRK09257 aromatic amino acid a 35.3 31 0.00078 14.7 2.7 34 121-154 111-144 (397) 232 PRK12735 elongation factor Tu; 35.2 31 0.00079 14.7 4.7 55 139-217 88-142 (396) 233 PRK02145 consensus 35.1 31 0.00079 14.7 7.6 46 189-241 85-130 (257) 234 cd01541 PBP1_AraR Ligand-bindi 34.9 31 0.00079 14.7 7.7 103 70-208 16-127 (273) 235 cd02013 TPP_Xsc_like Thiamine 34.9 31 0.00079 14.7 5.6 18 370-387 146-163 (196) 236 cd01143 YvrC Periplasmic bindi 34.8 31 0.0008 14.7 5.4 33 126-159 58-90 (195) 237 pfam04392 ABC_sub_bind ABC tra 34.7 31 0.0008 14.6 3.7 38 61-99 38-75 (292) 238 cd01977 Nitrogenase_VFe_alpha 34.5 32 0.00081 14.6 4.3 25 321-351 358-382 (415) 239 PRK00994 F420-dependent methyl 34.3 32 0.00081 14.6 2.9 33 123-159 54-94 (276) 240 cd01407 SIR2-fam SIR2 family o 34.2 32 0.00082 14.6 3.1 29 183-213 80-109 (218) 241 pfam10820 DUF2543 Protein of u 33.2 33 0.00084 14.5 4.3 65 359-424 11-77 (81) 242 PRK06456 acetolactate synthase 33.2 33 0.00084 14.5 11.6 38 57-94 69-106 (572) 243 TIGR02885 spore_sigF RNA polym 33.1 29 0.00075 14.8 2.0 35 344-385 67-101 (231) 244 cd00423 Pterin_binding Pterin 32.9 34 0.00086 14.5 8.7 144 66-242 58-207 (258) 245 pfam07894 DUF1669 Protein of u 32.7 34 0.00086 14.4 2.8 52 107-158 121-181 (284) 246 PRK09701 D-allose transporter 32.6 34 0.00086 14.4 9.2 149 120-287 75-239 (311) 247 COG1171 IlvA Threonine dehydra 32.5 34 0.00087 14.4 6.6 57 332-389 210-277 (347) 248 cd06277 PBP1_LacI_like_1 Ligan 32.3 34 0.00088 14.4 13.6 104 71-211 20-127 (268) 249 PRK13962 bifunctional phosphog 32.1 35 0.00088 14.4 6.2 25 138-162 295-321 (653) 250 PRK08591 acetyl-CoA carboxylas 32.0 35 0.00088 14.4 6.0 126 65-203 9-154 (449) 251 PTZ00254 40S ribosomal protein 31.9 35 0.00089 14.4 6.4 92 62-157 47-147 (242) 252 PRK08155 acetolactate synthase 31.8 35 0.00089 14.3 9.5 39 56-94 76-114 (564) 253 TIGR01860 VNFD nitrogenase van 31.7 35 0.00089 14.3 2.3 24 61-85 104-127 (461) 254 PRK02261 methylaspartate mutas 31.6 35 0.0009 14.3 5.0 41 324-366 88-132 (137) 255 TIGR01371 met_syn_B12ind 5-met 31.5 21 0.00054 15.8 1.0 37 59-95 223-263 (778) 256 PRK13288 pyrophosphatase PpaX; 31.4 36 0.00091 14.3 5.3 54 322-386 157-213 (214) 257 cd06278 PBP1_LacI_like_2 Ligan 31.2 36 0.00091 14.3 9.0 103 70-209 16-122 (266) 258 COG3254 Uncharacterized conser 31.1 35 0.00088 14.4 2.1 22 137-158 23-44 (105) 259 CHL00071 tufA elongation facto 30.9 36 0.00092 14.2 5.4 56 138-217 87-142 (409) 260 cd06242 Peptidase_M14-like_1_5 30.8 36 0.00092 14.2 4.8 27 55-81 53-83 (268) 261 KOG1329 consensus 30.6 28 0.00072 14.9 1.5 20 258-277 382-401 (887) 262 pfam03320 FBPase_glpX Bacteria 30.3 37 0.00094 14.2 2.2 33 262-298 153-187 (308) 263 KOG0622 consensus 30.2 21 0.00053 15.8 0.8 29 184-212 103-133 (448) 264 COG3958 Transketolase, C-termi 30.2 37 0.00095 14.2 2.1 140 240-388 117-268 (312) 265 cd06323 PBP1_ribose_binding Pe 30.0 37 0.00095 14.2 6.5 39 122-161 50-90 (268) 266 cd01452 VWA_26S_proteasome_sub 29.8 21 0.00053 15.8 0.7 82 203-284 77-159 (187) 267 PRK12861 malic enzyme; Reviewe 29.5 38 0.00097 14.1 3.0 69 111-182 259-333 (762) 268 PRK06965 acetolactate synthase 29.5 38 0.00097 14.1 11.3 38 57-94 85-122 (587) 269 TIGR02625 YiiL_rotase L-rhamno 28.5 28 0.00071 15.0 1.2 19 137-155 21-39 (102) 270 COG3340 PepE Peptidase E [Amin 28.4 40 0.001 14.0 3.1 45 203-248 86-134 (224) 271 pfam04705 TSNR_N Thiostrepton- 28.3 22 0.00056 15.6 0.7 25 137-161 59-83 (115) 272 TIGR01392 homoserO_Ac_trn homo 28.1 6.7 0.00017 19.1 -2.0 167 79-284 26-214 (395) 273 pfam07997 DUF1694 Protein of u 28.1 40 0.001 13.9 2.1 63 236-298 35-99 (120) 274 PRK12999 pyruvate carboxylase; 28.0 40 0.001 13.9 5.5 19 74-93 523-541 (1147) 275 COG0647 NagD Predicted sugar p 27.9 41 0.001 13.9 3.8 77 130-211 16-100 (269) 276 cd03371 TPP_PpyrDC Thiamine py 27.7 41 0.001 13.9 5.2 19 370-388 133-151 (188) 277 cd06324 PBP1_ABC_sugar_binding 27.7 41 0.001 13.9 8.8 47 122-168 51-99 (305) 278 PRK13435 response regulator; P 27.7 41 0.001 13.9 6.5 104 264-388 2-116 (141) 279 TIGR01162 purE phosphoribosyla 27.6 40 0.001 13.9 1.9 124 238-399 3-143 (159) 280 PRK13229 consensus 27.6 41 0.001 13.9 2.6 58 321-387 168-226 (234) 281 PRK08124 flagellar motor prote 27.4 28 0.00071 15.0 1.1 45 380-425 205-249 (263) 282 cd01516 FBPase_glpX Bacterial 27.1 42 0.0011 13.8 2.3 33 262-298 154-188 (309) 283 cd01888 eIF2_gamma eIF2-gamma 26.8 42 0.0011 13.8 6.3 87 138-247 95-185 (203) 284 PRK00830 consensus 26.7 43 0.0011 13.8 4.5 26 343-368 215-240 (273) 285 PRK11230 glycolate oxidase sub 26.6 43 0.0011 13.8 5.9 83 56-152 55-142 (499) 286 pfam05211 NLBH Neuraminyllacto 26.3 43 0.0011 13.7 4.9 159 258-420 29-226 (228) 287 COG0041 PurE Phosphoribosylcar 26.2 41 0.001 13.9 1.7 52 237-290 6-60 (162) 288 pfam00128 Alpha-amylase Alpha 26.1 44 0.0011 13.7 2.6 67 69-156 3-70 (314) 289 pfam03599 CdhD CO dehydrogenas 25.9 44 0.0011 13.7 5.3 100 58-158 71-174 (384) 290 pfam01765 RRF Ribosome recycli 25.6 45 0.0011 13.6 1.9 23 306-328 22-44 (165) 291 cd01540 PBP1_arabinose_binding 25.2 45 0.0012 13.6 7.5 44 121-165 48-93 (289) 292 TIGR02617 tnaA_trp_ase tryptop 25.2 33 0.00084 14.5 1.1 201 143-366 209-441 (468) 293 PRK09140 2-dehydro-3-deoxy-6-p 24.7 46 0.0012 13.5 5.7 85 189-290 19-107 (206) 294 pfam01996 F420_ligase F420-0:G 24.6 46 0.0012 13.5 3.5 39 262-312 112-151 (219) 295 PRK01033 imidazole glycerol ph 24.5 47 0.0012 13.5 6.9 50 189-245 84-133 (253) 296 PRK05282 peptidase E; Validate 24.5 47 0.0012 13.5 5.0 11 86-96 4-14 (233) 297 cd06274 PBP1_FruR Ligand bindi 24.5 47 0.0012 13.5 12.5 104 70-209 16-123 (264) 298 PRK13478 phosphonoacetaldehyde 24.5 47 0.0012 13.5 5.4 61 321-386 177-257 (267) 299 cd06309 PBP1_YtfQ_like Peripla 24.2 47 0.0012 13.5 4.3 43 120-163 48-92 (273) 300 COG1080 PtsA Phosphoenolpyruva 24.1 48 0.0012 13.5 4.2 76 340-422 492-570 (574) 301 COG1618 Predicted nucleotide k 24.0 48 0.0012 13.4 3.8 75 302-389 100-177 (179) 302 COG0126 Pgk 3-phosphoglycerate 24.0 48 0.0012 13.4 7.2 90 41-137 16-116 (395) 303 cd06287 PBP1_LacI_like_8 Ligan 23.9 48 0.0012 13.4 13.1 101 71-213 25-130 (269) 304 TIGR02463 MPGP_rel mannosyl-3- 23.9 48 0.0012 13.4 2.7 30 68-98 17-46 (224) 305 TIGR02176 pyruv_ox_red pyruvat 23.9 26 0.00067 15.1 0.4 33 390-422 591-623 (1194) 306 PRK06263 sdhA succinate dehydr 23.8 48 0.0012 13.4 3.9 36 308-346 353-393 (539) 307 cd01141 TroA_d Periplasmic bin 23.8 48 0.0012 13.4 3.7 34 126-159 67-100 (186) 308 TIGR00670 asp_carb_tr aspartat 23.7 48 0.0012 13.4 1.9 262 64-375 9-322 (336) 309 cd06302 PBP1_LsrB_Quorum_Sensi 23.6 49 0.0012 13.4 6.8 151 119-288 48-209 (298) 310 PRK12388 fructose-1,6-bisphosp 23.4 49 0.0013 13.4 2.3 33 262-298 154-188 (321) 311 cd06284 PBP1_LacI_like_6 Ligan 23.3 49 0.0013 13.4 13.2 104 70-210 16-123 (267) 312 PTZ00141 elongation factor 1 a 23.2 49 0.0013 13.3 6.4 32 132-163 87-122 (443) 313 PRK13631 cbiO cobalt transport 22.9 50 0.0013 13.3 3.7 75 304-388 228-317 (320) 314 PRK13645 cbiO cobalt transport 22.8 50 0.0013 13.3 3.3 68 313-388 210-289 (289) 315 PRK07804 L-aspartate oxidase; 22.7 51 0.0013 13.3 5.8 38 307-346 352-394 (533) 316 cd06307 PBP1_uncharacterized_s 22.7 51 0.0013 13.3 5.5 36 128-163 58-95 (275) 317 cd00739 DHPS DHPS subgroup of 22.6 51 0.0013 13.3 8.9 146 64-242 56-206 (257) 318 pfam00809 Pterin_bind Pterin b 22.6 51 0.0013 13.3 9.6 146 64-243 51-203 (208) 319 cd06325 PBP1_ABC_uncharacteriz 22.5 51 0.0013 13.3 5.2 34 66-99 43-76 (281) 320 PRK09479 glpX fructose 1,6-bis 22.2 52 0.0013 13.2 2.3 33 262-298 155-189 (320) 321 pfam03614 Flag1_repress Repres 22.2 23 0.00057 15.6 -0.2 51 135-193 6-63 (165) 322 PHA00527 hypothetical protein 21.8 53 0.0013 13.2 2.8 37 167-203 4-41 (129) 323 PRK07232 malic enzyme; Reviewe 21.8 53 0.0013 13.2 3.7 69 111-182 258-332 (753) 324 pfam02601 Exonuc_VII_L Exonucl 21.7 53 0.0013 13.2 4.7 83 69-165 25-121 (295) 325 cd00885 cinA Competence-damage 21.7 53 0.0013 13.2 4.4 84 190-289 21-108 (170) 326 PRK12837 3-ketosteroid-delta-1 21.7 42 0.0011 13.8 1.1 44 362-408 366-409 (515) 327 PRK12415 fructose 1,6-bisphosp 21.7 53 0.0013 13.2 2.1 24 270-297 165-188 (323) 328 PRK05506 bifunctional sulfate 21.6 53 0.0014 13.2 5.8 58 346-403 445-507 (613) 329 pfam07925 RdRP_5 Reovirus RNA- 21.6 24 0.00062 15.4 -0.2 33 138-175 815-847 (1271) 330 PRK11891 aspartate carbamoyltr 21.5 53 0.0014 13.1 2.7 22 269-290 276-297 (430) 331 pfam02906 Fe_hyd_lg_C Iron onl 21.5 53 0.0014 13.1 4.7 115 59-203 51-170 (286) 332 PRK13135 consensus 21.3 54 0.0014 13.1 7.8 62 343-412 202-263 (267) 333 PRK04000 translation initiatio 21.3 54 0.0014 13.1 5.5 85 138-247 96-186 (410) 334 COG2099 CobK Precorrin-6x redu 21.2 54 0.0014 13.1 4.4 126 57-207 3-133 (257) 335 PRK13124 consensus 21.1 54 0.0014 13.1 7.1 63 343-412 193-255 (257) 336 TIGR02125 CytB-hydogenase Ni/F 20.8 45 0.0012 13.6 1.1 33 316-348 103-135 (229) 337 PRK01170 phosphopantetheine ad 20.8 55 0.0014 13.0 1.6 22 412-433 306-327 (328) 338 cd06305 PBP1_methylthioribose_ 20.6 56 0.0014 13.0 5.1 43 119-162 47-91 (273) 339 PRK07586 hypothetical protein; 20.5 56 0.0014 13.0 5.7 38 56-93 64-101 (514) 340 PRK13644 cbiO cobalt transport 20.4 56 0.0014 13.0 3.6 62 319-388 200-272 (274) 341 cd01537 PBP1_Repressors_Sugar_ 20.3 57 0.0014 13.0 11.0 149 121-289 49-203 (264) 342 KOG2648 consensus 20.2 57 0.0014 13.0 3.5 94 263-366 267-373 (453) 343 COG0145 HyuA N-methylhydantoin 20.1 57 0.0015 12.9 1.6 104 70-193 139-243 (674) No 1 >PRK05749 3-deoxy-D-manno-octulosonic-acid transferase; Reviewed Probab=100.00 E-value=0 Score=885.56 Aligned_cols=418 Identities=40% Similarity=0.656 Sum_probs=405.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHCCCCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCC--C Q ss_conf 99999999999999999999999974184400112454056777788887799981789999999999999986199--8 Q gi|254780402|r 8 ILLGIYRWGGIFFMPFLSVSLSLYRVFNRERGRKFGERLGYPTALRPIGPLIWFHASSVGETMALIGLIPAIRSRHV--N 85 (440) Q Consensus 8 ~~~~lY~~~~~l~~p~~~~~~~~~~~~~k~~~~~~~eR~g~~~~~~~~~~~IW~HaaSvGE~~~a~pli~~l~~~~~--~ 85 (440) ||+++||+++++++|++.+++++|..++|++++++.||||+++..+|++++||||||||||++++.||+++|++++| + T Consensus 1 m~~~lY~~l~~l~~p~~~~~~~~r~~~~~~~~~~~~eR~g~~~~~~~~~~~IW~HaaSvGE~~~~~pli~~l~~~~p~~~ 80 (423) T PRK05749 1 MLRLLYTALLYLALPLALLRLLLRSRKAPEYRKRWGERFGLYKPPVPKGPLIWFHAVSVGETRAAIPLIRALRKRYPDLP 80 (423) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCCCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHHCCCCE T ss_conf 96999999999999999999999987497535428987589888788998599982879899999999999996299974 Q ss_pred EEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEECCCCHHHHHHHHHHCCCCEEEEEECCCCCC Q ss_conf 99996277630012101477416885055720134455744202035640476214678887422455058641266542 Q gi|254780402|r 86 VLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILSESDIWPLTVFELSKQRIPQVLVNARMSRRS 165 (440) Q Consensus 86 iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e~ElWPnli~~~~~~~ip~~linarls~~S 165 (440) ++|||+|+||++.+++.+++.+.|+|+|+|+++++++|+++|+|++++|+|+|+|||++.+|+++|||++++|||+|++| T Consensus 81 ilvTt~T~sg~~~~~~~~~~~~~~~ylP~D~~~~~~rfl~~~~P~~~i~~E~EiWPnli~~~~~~~Ip~~liNaR~s~~S 160 (423) T PRK05749 81 ILVTTMTPTGSERAKALFGDDVEHVYLPYDLPFAVRRFLRFWRPKLLIIMETELWPNLIAEAKKRGIPLVLANARLSERS 160 (423) T ss_pred EEEEECCCHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHCCCCEEEECCCCCCHH T ss_conf 99983783099999986689737999224787999999997398879986203108899999627886676525116336 Q ss_pred CCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHCCC-CEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCEEECCCCC Q ss_conf 12100111123310264423664157788899860885-05872043101234422125689999861798617504655 Q gi|254780402|r 166 FKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKELGAQ-KLIVSGNLKIDTESLPCDKELLSLYQESIAGRYTWAAISTF 244 (440) Q Consensus 166 ~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~~-~i~v~GnlK~d~~~~~~~~~~~~~~~~~~~~r~v~vagSth 244 (440) +++|++++++++.+|++||.|++||++|++||+++|++ ++.|+||+|||...++.+.+....+.+.++++++|+||||| T Consensus 161 ~~~y~~~~~~~~~~l~~~~~i~~qs~~~~~r~~~lG~~~~v~v~GnlKfd~~~~~~~~~~~~~~~~~~~~~~v~vagSth 240 (423) T PRK05749 161 FKRYAKFKRFYRLLFKNIDLVLAQSEEDAERFLALGAKNEVTVTGNLKFDIEIPPELAARAAALRRQLPNRPVWIAASTH 240 (423) T ss_pred HHHHHHHHHHHHHHHHHCCEEEECCHHHHHHHHHCCCCCCEEECCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEEECC T ss_conf 76667669999999974276652699999999975999743876863224578711078899999981899689999288 Q ss_pred CCCCCCCCCCCCCC-CCCCCEEEEEEECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCC Q ss_conf 33210000000000-12341248897055211478887765300101100134444466416750676520255422082 Q gi|254780402|r 245 EGEEDKAVYVHNFI-KCRTDVLTIIVPRHPRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEI 323 (440) Q Consensus 245 ~~EE~~il~a~~~l-~~~~~~~lIIvPRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~Adi 323 (440) +|||++++++++.+ ++.|++++|||||||||++++.+++.+.|+++.+||+.+.+..+++|+|+||+|||+.||++||+ T Consensus 241 ~~EE~iil~~~~~l~~~~~~~~lIiaPRHpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~Vli~Dt~GeL~~~Y~~a~i 320 (423) T PRK05749 241 EGEEELVLDAHQALLKQFPNLLLILVPRHPERFKEVEELLKKAGLSYVRRSQGEAPSADTDVLLGDTMGELGLLYAIADI 320 (423) T ss_pred HHHHHHHHHHHHHHHHCCCCCEEEEECCCHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCEEEEECCCCHHHHHHHHCCE T ss_conf 76999999999999740878289994787767999999999679977982799999988729998888758899985787 Q ss_pred EEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHHHHHHHHHCCCEEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHH Q ss_conf 06613554445458376896199499989844379999999978986995798999999999958989999999999999 Q gi|254780402|r 324 AFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFRDIYRRMVSSGAVRIVEEVGTLADMVYSLLSEPTIRYEMINAAINE 403 (440) Q Consensus 324 afVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~~~~L~~~g~~~~v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~ 403 (440) |||||||.+.|||||+|||++|+||++|||++||+|+++.|.++|++.+|+|+++|.+.+..|++||+.+++|+++|+++ T Consensus 321 afvGGsf~~~GGHN~lEpa~~g~pvi~GP~~~nf~e~~~~L~~~g~~~~v~~~~eL~~~~~~ll~~~~~~~~~~~~a~~~ 400 (423) T PRK05749 321 AFVGGSLVKIGGHNPLEPAAFGVPVISGPHTFNFKEIFERLLQAGAAIQVEDAEDLAKAVSSLLTDPDAREAMGEAGVAF 400 (423) T ss_pred EEECCCCCCCCCCCHHHHHHCCCCEEECCCCCCHHHHHHHHHHCCCEEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHH T ss_conf 89827768889959799998399889993832779999999978995896899999999999976999999999999999 Q ss_pred HHHCCCHHHHHHHHHHHHHHHH Q ss_conf 9704556999999999752378 Q gi|254780402|r 404 VKKMQGPLKITLRSLDSYVNPL 425 (440) Q Consensus 404 i~~~~ga~~~~~~~i~~~l~~~ 425 (440) +++|+||++|+++.|++||+++ T Consensus 401 v~~~~Gat~r~~~~i~~~L~~~ 422 (423) T PRK05749 401 LKQNRGALQRTLQLLKPYLPPL 422 (423) T ss_pred HHHCCCHHHHHHHHHHHHCCCC T ss_conf 9978479999999999763578 No 2 >COG1519 KdtA 3-deoxy-D-manno-octulosonic-acid transferase [Cell envelope biogenesis, outer membrane] Probab=100.00 E-value=0 Score=858.60 Aligned_cols=414 Identities=39% Similarity=0.670 Sum_probs=401.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHC-CCCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCC-- Q ss_conf 999999999999999999999997418440011245405-67777888877999817899999999999999861998-- Q gi|254780402|r 9 LLGIYRWGGIFFMPFLSVSLSLYRVFNRERGRKFGERLG-YPTALRPIGPLIWFHASSVGETMALIGLIPAIRSRHVN-- 85 (440) Q Consensus 9 ~~~lY~~~~~l~~p~~~~~~~~~~~~~k~~~~~~~eR~g-~~~~~~~~~~~IW~HaaSvGE~~~a~pli~~l~~~~~~-- 85 (440) ++++|+++++++.|++.+++++|+.+.++|++++.|||| +....+|.++.||||||||||++++.||+++|++++|+ T Consensus 1 ~~~lY~~l~~~~~p~~~~~l~~R~~~~~~y~~r~~eRfg~~~~~~~~~~p~vWiHaaSVGEv~a~~pLv~~l~~~~P~~~ 80 (419) T COG1519 1 LRFLYRLLLTLALPFIAPRLLYRSFKGPKYRKRLGERFGFYKPPVKPEGPLVWIHAASVGEVLAALPLVRALRERFPDLR 80 (419) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHCCCCCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHHCCCCC T ss_conf 90399999999989999999999740757788999871557877788888079996446678888999999997689987 Q ss_pred EEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEECCCCHHHHHHHHHHCCCCEEEEEECCCCCC Q ss_conf 99996277630012101477416885055720134455744202035640476214678887422455058641266542 Q gi|254780402|r 86 VLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILSESDIWPLTVFELSKQRIPQVLVNARMSRRS 165 (440) Q Consensus 86 iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e~ElWPnli~~~~~~~ip~~linarls~~S 165 (440) +++||+||||++.+++.+++.+.|+|+|+|++.+++||+++|+|+++|++|||+|||+|.+|+++|||++++|||+|+|| T Consensus 81 ilvTt~T~Tg~e~a~~~~~~~v~h~YlP~D~~~~v~rFl~~~~P~l~Ii~EtElWPnli~e~~~~~~p~~LvNaRLS~rS 160 (419) T COG1519 81 ILVTTMTPTGAERAAALFGDSVIHQYLPLDLPIAVRRFLRKWRPKLLIIMETELWPNLINELKRRGIPLVLVNARLSDRS 160 (419) T ss_pred EEEEECCCCHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHCCCCEEEEEEEECHHH T ss_conf 89995276379999987698708996576766889999974289879998001367899999876998999942302325 Q ss_pred CCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCHHHHHHHHHHCCC-CCEEECCCCC Q ss_conf 121001111233102644236641577888998608850587204310123442212568999986179-8617504655 Q gi|254780402|r 166 FKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKELGAQKLIVSGNLKIDTESLPCDKELLSLYQESIAG-RYTWAAISTF 244 (440) Q Consensus 166 ~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~~~i~v~GnlK~d~~~~~~~~~~~~~~~~~~~~-r~v~vagSth 244 (440) +++|++++.+++.+|+++|+|++||+.|++||++||+.++.++||+|||..+++.+......++..+++ |++|+|+||| T Consensus 161 ~~~y~k~~~~~~~~~~~i~li~aQse~D~~Rf~~LGa~~v~v~GNlKfd~~~~~~~~~~~~~~r~~l~~~r~v~iaaSTH 240 (419) T COG1519 161 FARYAKLKFLARLLFKNIDLILAQSEEDAQRFRSLGAKPVVVTGNLKFDIEPPPQLAAELAALRRQLGGHRPVWVAASTH 240 (419) T ss_pred HHHHHHHHHHHHHHHHHCCEEEECCHHHHHHHHHCCCCCEEEECCEEECCCCCHHHHHHHHHHHHHCCCCCCEEEEECCC T ss_conf 77798778999999974233345488889999964986138633324237787324899999998508888559995477 Q ss_pred CCCCCCCCCCCCCC-CCCCCEEEEEEECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCC Q ss_conf 33210000000000-12341248897055211478887765300101100134444466416750676520255422082 Q gi|254780402|r 245 EGEEDKAVYVHNFI-KCRTDVLTIIVPRHPRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEI 323 (440) Q Consensus 245 ~~EE~~il~a~~~l-~~~~~~~lIIvPRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~Adi 323 (440) +|||++++++|+.+ ++.|++++|+|||||||++++.+++++.||++++||+++.+..++||+|+|+||||+.+|++||| T Consensus 241 ~GEeei~l~~~~~l~~~~~~~llIlVPRHpERf~~v~~l~~~~gl~~~~rS~~~~~~~~tdV~l~DtmGEL~l~y~~adi 320 (419) T COG1519 241 EGEEEIILDAHQALKKQFPNLLLILVPRHPERFKAVENLLKRKGLSVTRRSQGDPPFSDTDVLLGDTMGELGLLYGIADI 320 (419) T ss_pred CCHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCCCCEEEEECHHHHHHHHHHCCE T ss_conf 86388999999999963899569991587556799999999759818861378999888868999628689999734327 Q ss_pred EEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHHHHHHHHHCCCEEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHH Q ss_conf 06613554445458376896199499989844379999999978986995798999999999958989999999999999 Q gi|254780402|r 324 AFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFRDIYRRMVSSGAVRIVEEVGTLADMVYSLLSEPTIRYEMINAAINE 403 (440) Q Consensus 324 afVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~~~~L~~~g~~~~v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~ 403 (440) |||||||++.|||||||||++|+|||+|||++||.|+++.|.++|++.+|+|.+.|...+..+++|++.+++|++++..+ T Consensus 321 AFVGGSlv~~GGHN~LEpa~~~~pvi~Gp~~~Nf~ei~~~l~~~ga~~~v~~~~~l~~~v~~l~~~~~~r~~~~~~~~~~ 400 (419) T COG1519 321 AFVGGSLVPIGGHNPLEPAAFGTPVIFGPYTFNFSDIAERLLQAGAGLQVEDADLLAKAVELLLADEDKREAYGRAGLEF 400 (419) T ss_pred EEECCCCCCCCCCCHHHHHHCCCCEEECCCCCCHHHHHHHHHHCCCEEEECCHHHHHHHHHHHCCCHHHHHHHHHHHHHH T ss_conf 99877446778988223787089788777513589999999866986997788999999999707889999999989999 Q ss_pred HHHCCCHHHHHHHHHHHHH Q ss_conf 9704556999999999752 Q gi|254780402|r 404 VKKMQGPLKITLRSLDSYV 422 (440) Q Consensus 404 i~~~~ga~~~~~~~i~~~l 422 (440) +.+++|+.+++++.|++|+ T Consensus 401 v~~~~gal~r~l~~l~~~~ 419 (419) T COG1519 401 LAQNRGALARTLEALKPYL 419 (419) T ss_pred HHHHHHHHHHHHHHHHHCC T ss_conf 9986779999999755119 No 3 >pfam04413 Glycos_transf_N 3-Deoxy-D-manno-octulosonic-acid transferase (kdotransferase). Members of this family transfer activated sugars to a variety of substrates, including glycogen, fructose-6-phosphate and lipopolysaccharides. Members of the family transfer UDP, ADP, GDP or CMP linked sugars. The Glycos_transf_N region is flanked at the N-terminus by a signal peptide and at the C-terminus by Glycos_transf_1 (pfam00534). The eukaryotic glycogen synthases may be distant members of this bacterial family. Probab=100.00 E-value=0 Score=375.52 Aligned_cols=179 Identities=42% Similarity=0.752 Sum_probs=168.6 Q ss_pred CHHHHHHHHCCCCC--CCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCC--CEEEEECCCCCCHHHHCCCCCCEEEEECC Q ss_conf 00112454056777--788887799981789999999999999986199--89999627763001210147741688505 Q gi|254780402|r 38 RGRKFGERLGYPTA--LRPIGPLIWFHASSVGETMALIGLIPAIRSRHV--NVLLTTMTATSAKVARKYLGQYAIHQYAP 113 (440) Q Consensus 38 ~~~~~~eR~g~~~~--~~~~~~~IW~HaaSvGE~~~a~pli~~l~~~~~--~iliT~~T~tg~~~~~~~~~~~~~~~y~P 113 (440) +++++.||+|++.. .++.+++|||||||+||++++.||+++|++++| +++|||+|+||++.+++.+++.+.|+|+| T Consensus 1 ~~~~~~eR~g~~~~~~~~~~~~~IWiHaaSvGE~~~~~~li~~l~~~~p~~~iliT~~T~sg~~~~~~~~~~~~~~~ylP 80 (186) T pfam04413 1 YRKRLRERFGFASPPPPQGGGPLIWLHAASVGEVLAALPLIEALKARYPGLRILVTTTTPTGAELARKLLPDRVIHQYLP 80 (186) T ss_pred CCCHHHHHCCCCCCCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHCCCCCEEEECC T ss_conf 90224776688888888999988999839889999999999999986899629998358169999998678980799777 Q ss_pred CCCCHHHHHHHHHHCCEEEEEECCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHH Q ss_conf 57201344557442020356404762146788874224550586412665421210011112331026442366415778 Q gi|254780402|r 114 LDIQPAVSRFLKYWKPDCMILSESDIWPLTVFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERY 193 (440) Q Consensus 114 ~D~~~~~~~fl~~~~P~~~i~~e~ElWPnli~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~ 193 (440) +|+++.+++|+++|+|+++||+|+|||||++.+|+++|||++|+|||+|++|+++|++++.+++.+|+.||.|++||++| T Consensus 81 ~D~~~~~~~fl~~~~P~~~i~~e~EiWPnli~~~~~~~ip~~linar~s~~s~~~~~~~~~~~~~~l~~~~~i~~qs~~~ 160 (186) T pfam04413 81 LDLPGAVRRFLKHWRPDLLVIVETELWPNLIAAAKKRGIPVVLVNARLSERSARRYRRFPGLFRPLLRRFDLILAQSEED 160 (186) T ss_pred CCCHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCEEEECCHHH T ss_conf 67779999999985998899986132099999999879999999777686422268889999999998459999689999 Q ss_pred HHHHHHHCCC--CEEEECCCCCCCC Q ss_conf 8899860885--0587204310123 Q gi|254780402|r 194 FRRYKELGAQ--KLIVSGNLKIDTE 216 (440) Q Consensus 194 ~~rl~~lG~~--~i~v~GnlK~d~~ 216 (440) ++||+++|++ ++.++||+|||.. T Consensus 161 ~~r~~~lG~~~~~v~v~GnlKfd~~ 185 (186) T pfam04413 161 AERFRALGAPPDRVEVTGNLKFDAA 185 (186) T ss_pred HHHHHHCCCCHHHEEEECCCCCCCC T ss_conf 9999984998357899187766789 No 4 >cd03817 GT1_UGDG_like This family is most closely related to the GT1 family of glycosyltransferases. UDP-glucose-diacylglycerol glucosyltransferase (UGDG; also known as 1,2-diacylglycerol 3-glucosyltransferase) catalyzes the transfer of glucose from UDP-glucose to 1,2-diacylglycerol forming 3-D-glucosyl-1,2-diacylglycerol. Probab=99.88 E-value=1.3e-19 Score=148.45 Aligned_cols=327 Identities=18% Similarity=0.151 Sum_probs=201.6 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCC----------CCCHHHHHHHHHHCCEEEEEEC Q ss_conf 999999999999986199899996277630012101477416885055----------7201344557442020356404 Q gi|254780402|r 67 GETMALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPL----------DIQPAVSRFLKYWKPDCMILSE 136 (440) Q Consensus 67 GE~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~----------D~~~~~~~fl~~~~P~~~i~~e 136 (440) |=-..+..|.+.|.+++.+|.|-|++..+... ......+.....|. -......+.++..+||++.. - T Consensus 15 G~~~~~~~La~~L~~~GheV~Vit~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~DvIh~-~ 91 (374) T cd03817 15 GVATSIRRLAEELEKRGHEVYVVAPSYPGAPE--EEEVVVVRPFRVPTFKYPDFRLPLPIPRALIIILKELGPDIVHT-H 91 (374) T ss_pred CHHHHHHHHHHHHHHCCCEEEEEECCCCCCCC--CCCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEE-C T ss_conf 09999999999999779989999727988775--43576289843677765213455557999999998669999998-7 Q ss_pred CCCHHHH--HHHHHHCCCCEEEEEE-C--------CCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHCC-CC Q ss_conf 7621467--8887422455058641-2--------665421210011112331026442366415778889986088-50 Q gi|254780402|r 137 SDIWPLT--VFELSKQRIPQVLVNA-R--------MSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKELGA-QK 204 (440) Q Consensus 137 ~ElWPnl--i~~~~~~~ip~~lina-r--------ls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~-~~ 204 (440) +.+|... ...+++.+||++..-- . ........+..+..+.+.+++..|.|++.|+..++.+.+.|. .+ T Consensus 92 ~~~~~~~~a~~~~~~~~ip~V~t~H~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ii~~S~~~~~~l~~~~~~~~ 171 (374) T cd03817 92 TPFSLGLLGLRVARKLGIPVVATYHTMYEDYTHYVPLGRLLARAVVRRKLSRRFYNRCDAVIAPSEKIADLLREYGVKRP 171 (374) T ss_pred CHHHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCEEEECHHHHHHHHHHCCCCCC T ss_conf 82588999999999749959999557779988763111013567899999999998599999780999999997089998 Q ss_pred EEEECCCCCCCCC-CCCCHHHHHHHHHHCCCCCEEEC-CCCCC-CCCCCCCCCCCCCC-CCCCEEEEEEECCCCHHHHHH Q ss_conf 5872043101234-42212568999986179861750-46553-32100000000001-234124889705521147888 Q gi|254780402|r 205 LIVSGNLKIDTES-LPCDKELLSLYQESIAGRYTWAA-ISTFE-GEEDKAVYVHNFIK-CRTDVLTIIVPRHPRRCDAIE 280 (440) Q Consensus 205 i~v~GnlK~d~~~-~~~~~~~~~~~~~~~~~r~v~va-gSth~-~EE~~il~a~~~l~-~~~~~~lIIvPRhpeR~~~i~ 280 (440) +.+..|- .|... .+..............++++++. |...+ .--+.+++++..++ +.++..|+|+..-|++ ++++ T Consensus 172 i~vI~ng-vd~~~f~~~~~~~~~~~~~~~~~~~~i~~~Grl~~~Kg~~~li~a~~~l~~~~~~~~l~ivG~G~~~-~~l~ 249 (374) T cd03817 172 IEVIPTG-IDLDRFEPVDGDDERRKLGIPEDEPVLLYVGRLAKEKNIDFLIRAFARLLKEEPDVKLVIVGDGPER-EELE 249 (374) T ss_pred EEEECCC-CCHHHCCCCCHHHHHHHHCCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCCCEEEEECCCHHH-HHHH T ss_conf 8998696-0666439864178999818999984999970575421078999999988741898489998774476-5567 Q ss_pred HHHHHHCCHHHHHCCCCCCCCCCEEEEECCC--CCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHH Q ss_conf 7765300101100134444466416750676--52025542208206613554445458376896199499989844379 Q gi|254780402|r 281 RRLIAKGLKVARRSRGDVINAEVDIFLGDTI--GEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFR 358 (440) Q Consensus 281 ~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~--GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~ 358 (440) +++++.|+.-. |.+...+ -++..+|+.||+ ||-.|..+.-|..++||+++|+|||... ....+ T Consensus 250 ~~~~~~~l~~~-------------V~f~G~v~~~~~~~~l~~adi-~v~pS~~E~fg~~~~EAma~G~PvI~s~-~gg~~ 314 (374) T cd03817 250 ELARELGLADR-------------VIFTGFVPREELPDYYKAADL-FVFASTTETQGLVLLEAMAAGLPVVAVD-APGLP 314 (374) T ss_pred HHHHHHCCCCC-------------EEECCCCCHHHHHHHHHHCCC-CCCCCCCCCCCHHHHHHHHCCCCEEEEC-CCCHH T ss_conf 88887424662-------------443588756677876442475-4477766577599999998199899917-99759 Q ss_pred HHHHHHHHCCCEEEE-CCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHH Q ss_conf 999999978986995-798999999999958989999999999999970455699999999 Q gi|254780402|r 359 DIYRRMVSSGAVRIV-EEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITLRSL 418 (440) Q Consensus 359 e~~~~L~~~g~~~~v-~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~~~i 418 (440) |+.+ +...++.+ .+.++|++++..+++||+.+++|+++|++.+++.. -++.++.| T Consensus 315 e~i~---~g~~G~l~~~~~~~l~~~i~~l~~~~~~~~~~~~~a~~~a~~f~--~~~~~~~~ 370 (374) T cd03817 315 DLVA---DGENGFLFPPGDEALAEALLRLLQDPELRRRLSKNAEESAEKFS--FAKKVEKL 370 (374) T ss_pred HHHH---CCCEEEEECCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCC--HHHHHHHH T ss_conf 9964---79859996978699999999997599999999999999999789--99999999 No 5 >cd03814 GT1_like_2 This family is most closely related to the GT1 family of glycosyltransferases. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homolog Probab=99.87 E-value=4.3e-19 Score=145.04 Aligned_cols=322 Identities=17% Similarity=0.135 Sum_probs=197.6 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEE----------ECCCCCCHHHHHHHHHHCCEEEEEEC Q ss_conf 99999999999998619989999627763001210147741688----------50557201344557442020356404 Q gi|254780402|r 67 GETMALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQ----------YAPLDIQPAVSRFLKYWKPDCMILSE 136 (440) Q Consensus 67 GE~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~----------y~P~D~~~~~~~fl~~~~P~~~i~~e 136 (440) |=-..+..|++.|.+++.+|.|-|++....... ........ .+.+......+++++..+||++... T Consensus 15 G~e~~~~~la~~L~~~Gh~V~v~t~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pDiIh~~- 90 (364) T cd03814 15 GVVRTLQRLVEHLRARGHEVLVIAPGPFRESEG---PARVVPVPSVPLPGYPEIRLALPPRRRVRRLLDAFAPDVVHIA- 90 (364) T ss_pred CHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCC---CCCEEEEEEEECCCCCCCEECCCHHHHHHHHHHHCCCCEEEEC- T ss_conf 499999999999997799899997899765555---6634678667468887630020329999999986599999987- Q ss_pred CCCHHHH--HHHHHHCCCCEEEEE-ECCCC---CCCCC-C-HHHHHHHHHCCCCCCEEEECCHHHHHHHHHHCCCCEEEE Q ss_conf 7621467--888742245505864-12665---42121-0-011112331026442366415778889986088505872 Q gi|254780402|r 137 SDIWPLT--VFELSKQRIPQVLVN-ARMSR---RSFKN-W-KTVLSFSKKIFSQFSLVIVQSERYFRRYKELGAQKLIVS 208 (440) Q Consensus 137 ~ElWPnl--i~~~~~~~ip~~lin-arls~---~S~~~-~-~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~~~i~v~ 208 (440) +..+-.+ ...+++.++|.+..- ..+.+ ..... . +....+.+.+++..|.|++.|+..++.+...|.+++.+. T Consensus 91 ~~~~~~~~a~~~~~~~~ip~i~~~H~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ii~~S~~~~~~~~~~~~~~~~vi 170 (364) T cd03814 91 TPGPLGLAALRAARRLGIPVVTSYHTDFPEYLRYYGLGPLSWLAWAYLRWFHNRADRVLVPSPSLADELRARGFRRVRLW 170 (364) T ss_pred CCCHHHHHHHHHHHHCCCCEEEEECCCHHHHHHHHCCHHHHHHHHHHHHHHHHCCCEEEECCHHHHHHHHHHCCCCEEEE T ss_conf 84167899999999759978999747648887760320568999999999985079999899999999985099888996 Q ss_pred CCCCCCCCC-CCCCHHHHHHHHHHCCCCCEEE-CCCCC--CCCCCCCCCCCCCCCCCCCEEEEEEECCCCHHHHHHHHHH Q ss_conf 043101234-4221256899998617986175-04655--3321000000000012341248897055211478887765 Q gi|254780402|r 209 GNLKIDTES-LPCDKELLSLYQESIAGRYTWA-AISTF--EGEEDKAVYVHNFIKCRTDVLTIIVPRHPRRCDAIERRLI 284 (440) Q Consensus 209 GnlK~d~~~-~~~~~~~~~~~~~~~~~r~v~v-agSth--~~EE~~il~a~~~l~~~~~~~lIIvPRhpeR~~~i~~~l~ 284 (440) .|- .|... .+........-.....++++++ +|... +| -+.+++|.+.+++.++..++|+..-|++ +++++.. T Consensus 171 ~nG-vd~~~f~p~~~~~~~~~~~~~~~~~~i~~vGrl~~~Kg-~~~ll~a~~~l~~~~~~~l~ivG~G~~~-~~l~~~~- 246 (364) T cd03814 171 PRG-VDTELFHPRRRDEALRARLGPPDRPVLLYVGRLAPEKN-LEALLDADLPLRRRPPVRLVIVGDGPAR-ARLEARY- 246 (364) T ss_pred CCC-CCHHHCCCCCCCHHHHHHCCCCCCCEEEEEECCCCCCC-HHHHHHHHHHHHCCCCEEEEEECCCCCH-HHHHHCC- T ss_conf 896-16988487543266653026899838999645755578-9999999997300588599998476339-9998518- Q ss_pred HHCCHHHHHCCCCCCCCCCEEEEECCC--CCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHHHHH Q ss_conf 300101100134444466416750676--520255422082066135544454583768961994999898443799999 Q gi|254780402|r 285 AKGLKVARRSRGDVINAEVDIFLGDTI--GEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFRDIYR 362 (440) Q Consensus 285 ~~gl~~~~~S~~~~~~~~~~V~i~Dt~--GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~~~ 362 (440) .+|...... .|+..+|+.||+ ||-.|..+.=|..++||+++|+|||.-+ ....+|+.+ T Consensus 247 ------------------~~v~f~G~~~~~el~~~~~~adi-~v~pS~~E~fg~~~lEAma~G~PvI~s~-~gg~~Eiv~ 306 (364) T cd03814 247 ------------------PNVHFLGFLDGEELAAAYASADV-FVFPSRTETFGLVVLEAMASGLPVVAPD-AGGPADIVT 306 (364) T ss_pred ------------------CCEEEECCCCHHHHHHHHHHCCC-CCCCCCCCCCCHHHHHHHHCCCCEEECC-CCCHHHHHC T ss_conf ------------------98799078998999999982475-6788653457657999998399899958-997488831 Q ss_pred HHHHCCCEEEE--CCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH Q ss_conf 99978986995--798999999999958989999999999999970455699999999975 Q gi|254780402|r 363 RMVSSGAVRIV--EEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITLRSLDSY 421 (440) Q Consensus 363 ~L~~~g~~~~v--~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~~~i~~~ 421 (440) +.+.+..+ .|+++|++++..|++||+.+++||++|++.+++. .-+++.+.+.+. T Consensus 307 ---~~~~G~l~~~~d~~~la~~i~~l~~~~~~~~~mg~~ar~~~~~~--~w~~~~~~~~~~ 362 (364) T cd03814 307 ---DGENGLLVEPGDAEAFAAALAALLADPELRRRMAARARAEAERR--SWEAFLDNLLEA 362 (364) T ss_pred ---CCCCEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHC--CHHHHHHHHHHH T ss_conf ---79828997999999999999999769999999999999999968--999999999997 No 6 >PRK10307 predicted glycosyl transferase; Provisional Probab=99.87 E-value=6e-19 Score=144.07 Aligned_cols=336 Identities=14% Similarity=0.135 Sum_probs=200.7 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEEECCC--------CCCHHHH--CCCCCCEEEE----ECCCCCCHHHHHH--------- Q ss_conf 999999999999986199899996277--------6300121--0147741688----5055720134455--------- Q gi|254780402|r 67 GETMALIGLIPAIRSRHVNVLLTTMTA--------TSAKVAR--KYLGQYAIHQ----YAPLDIQPAVSRF--------- 123 (440) Q Consensus 67 GE~~~a~pli~~l~~~~~~iliT~~T~--------tg~~~~~--~~~~~~~~~~----y~P~D~~~~~~~f--------- 123 (440) |-...+..|.+.|.+++.+|-|-|..| .|+...+ ....+.+.+. |.|-. +...+|+ T Consensus 16 G~~~~~~~La~~L~~~GheV~Vit~~p~~p~~~~~~~~~~~~~~~e~~~gv~v~R~p~~~~~~-~~~~~r~~~~~~f~~~ 94 (415) T PRK10307 16 GIGKYTGEMAEWLAARGHEVRVITAPPYYPQWQVGEGYSAWRYRREERGGVTVWRCPLYVPKQ-PSGLKRLLHLGSFALS 94 (415) T ss_pred CHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCCCCCCCEEEECCEEEEEEECCCCCC-CHHHHHHHHHHHHHHH T ss_conf 799999999999997899899997799887665577766654311367888999830456788-4089999999999999 Q ss_pred -------HHHHCCEEEEEECCCC--HHHHHHHHHHCCCCEEEE-EE----------CCCC-CCCCCCHHHHHHHHHCCCC Q ss_conf -------7442020356404762--146788874224550586-41----------2665-4212100111123310264 Q gi|254780402|r 124 -------LKYWKPDCMILSESDI--WPLTVFELSKQRIPQVLV-NA----------RMSR-RSFKNWKTVLSFSKKIFSQ 182 (440) Q Consensus 124 -------l~~~~P~~~i~~e~El--WPnli~~~~~~~ip~~li-na----------rls~-~S~~~~~~~~~l~~~~~~~ 182 (440) ...|+||.++.....+ .|-.....+..++|.++. .. .+.. +...-.+....+.+.++++ T Consensus 95 ~~~~~~~~~~~~pD~v~~~~p~~~~~~~~~~~~~~~~~~~v~~~~d~~~~~~~~~g~l~~~~~~~~~~~~~~~e~~~~~~ 174 (415) T PRK10307 95 SFPPLMAQRRWRPDRVIGVAPTLFCAPGARLLARLSGARTWLHIQDYEVDAAFGLGLLGKGKGGKVARLATAFERSLLRR 174 (415) T ss_pred HHHHHHHHHCCCCCEEEEECCHHHHHHHHHHHHHHCCCCEEEEEEEEEHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 99999998476999899928778899999999996298889999002114566514011244569999999999999985 Q ss_pred CCEEEECCHHHHHHHHHHCC--CCEEEECCC-CCCCCCCCCCHHHHHHHHHHC---CCCCEEEC-CCCCC-CCCCCCCCC Q ss_conf 42366415778889986088--505872043-101234422125689999861---79861750-46553-321000000 Q gi|254780402|r 183 FSLVIVQSERYFRRYKELGA--QKLIVSGNL-KIDTESLPCDKELLSLYQESI---AGRYTWAA-ISTFE-GEEDKAVYV 254 (440) Q Consensus 183 ~~~i~~qs~~~~~rl~~lG~--~~i~v~Gnl-K~d~~~~~~~~~~~~~~~~~~---~~r~v~va-gSth~-~EE~~il~a 254 (440) .|.|.+.|+..++++++.|. +++.+..|- ..+.. .+........++..+ .++++++. |...+ ..-+.+++| T Consensus 175 ad~v~~~S~~~~~~l~~~g~~~~ki~vipNgvd~~~f-~p~~~~~~~~~r~~~g~~~~~~vvly~Grl~~~kg~~~li~A 253 (415) T PRK10307 175 FDNVSTISRSMMNKAREKGVAAEKVIFFPNWSDVARF-QDVADADVDALRAQLGLPDGKKIVLYSGNIGEKQGLELVIDA 253 (415) T ss_pred CCEEEECCHHHHHHHHHHCCCCCEEEEECCCCCCCCC-CCCCCCCHHHHHHHHCCCCCCEEEEEECCCHHHCCHHHHHHH T ss_conf 8989977999999999828998709997681510003-787852068999970999998799994776011187999999 Q ss_pred CCCCCCCCCEEEEEEECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECC--CCCCHHHHHHCCCEEEECCCCC Q ss_conf 00001234124889705521147888776530010110013444446641675067--6520255422082066135544 Q gi|254780402|r 255 HNFIKCRTDVLTIIVPRHPRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDT--IGEMGFYLRMTEIAFIGRSFCA 332 (440) Q Consensus 255 ~~~l~~~~~~~lIIvPRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt--~GeL~~lY~~AdiafVGGSl~~ 332 (440) ...++..|++.++|+..-|++ ++++++++..|+.- |.+.+. --++..+|+.||+ |+-.|..+ T Consensus 254 ~~~l~~~~~~~lvivG~G~~~-~~L~~~a~~~gl~~--------------V~f~g~~~~e~l~~~~~~aDv-~v~ps~~e 317 (415) T PRK10307 254 AARLRDRPDLIFVICGQGGGK-ARLEKMARCRGLRN--------------VHFLPLQPYDALPALLKMADC-HLLPQKAG 317 (415) T ss_pred HHHHHCCCCEEEEEECCCCHH-HHHHHHHHHCCCCC--------------EEECCCCCHHHHHHHHHHCCE-EEEECCCC T ss_conf 998312898699996887408-99999999709983--------------898188788999999984749-99704411 Q ss_pred CCCCC----HHHHHHHCCCEEECCCCCCHHHHHHHHHH---CCCEEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHH Q ss_conf 45458----37689619949998984437999999997---898699579899999999995898999999999999997 Q gi|254780402|r 333 SGGQN----PLEAAMLGCAILSGPNVENFRDIYRRMVS---SGAVRIVEEVGTLADMVYSLLSEPTIRYEMINAAINEVK 405 (440) Q Consensus 333 ~GGhN----~lEpaa~G~pVi~GP~~~nf~e~~~~L~~---~g~~~~v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~ 405 (440) ..|.. ++|.+++|+||+....-. .+..+..++ +|......|+++|++++.+|++||+.+++||++|+++++ T Consensus 318 ~~~~v~Pskl~~~mA~G~PVva~~~~g--~~~~~~v~~~~~~G~~v~p~d~~~La~ai~~ll~d~~~~~~mg~~gr~~~~ 395 (415) T PRK10307 318 AADLVLPSKLTNMLASGRNVVATAEPG--TELGQLCETFPGIGVCVPPEDVEALVAAIRQLLALPKRRTALGAAAREYAE 395 (415) T ss_pred CCCCCCHHHHHHHHHCCCCEEEEECCC--CCCCHHEEEECCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHH T ss_conf 123457579999986699689992588--765201276278089978999999999999997799999999999999999 Q ss_pred HCCCHHHHHHHHHHHHHH Q ss_conf 045569999999997523 Q gi|254780402|r 406 KMQGPLKITLRSLDSYVN 423 (440) Q Consensus 406 ~~~ga~~~~~~~i~~~l~ 423 (440) ++-. -+++++.....+. T Consensus 396 ~~f~-~e~~~~~~~~~~~ 412 (415) T PRK10307 396 RTLD-RENVLRQFIADIR 412 (415) T ss_pred HHCC-HHHHHHHHHHHHH T ss_conf 9779-9999999999999 No 7 >cd03796 GT1_PIG-A_like This family is most closely related to the GT1 family of glycosyltransferases. Phosphatidylinositol glycan-class A (PIG-A), an X-linked gene in humans, is necessary for the synthesis of N-acetylglucosaminyl-phosphatidylinositol, a very early intermediate in glycosyl phosphatidylinositol (GPI)-anchor biosynthesis. The GPI-anchor is an important cellular structure that facilitates the attachment of many proteins to cell surfaces. Somatic mutations in PIG-A have been associated with Paroxysmal Nocturnal Hemoglobinuria (PNH), an acquired hematological disorder. Probab=99.87 E-value=4e-19 Score=145.22 Aligned_cols=332 Identities=15% Similarity=0.090 Sum_probs=208.2 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCC-------------HHHHHHHHHHCCEEEE Q ss_conf 999999999999986199899996277630012101477416885055720-------------1344557442020356 Q gi|254780402|r 67 GETMALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQ-------------PAVSRFLKYWKPDCMI 133 (440) Q Consensus 67 GE~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~-------------~~~~~fl~~~~P~~~i 133 (440) |=-..+..|.+.|.+++.+|.|-|.+..+.... ....+.+.+.++|+... ...++++...+||++- T Consensus 15 G~e~~v~~La~~L~~~Gh~V~Vit~~~~~~~~~-~~~~~g~~V~~~p~~~~~~~~~~~~~~~~~~~~r~~~~~~~~DIIH 93 (398) T cd03796 15 GVETHIYQLSQCLIKRGHKVVVITHAYGNRVGI-RYLTNGLKVYYLPFVVFYNQSTLPTFFGTFPLLRNILIRERITIVH 93 (398) T ss_pred CHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCC-EEECCCCEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCEEE T ss_conf 779999999999997699899996899988774-6853884699756633456311677988899999999767998899 Q ss_pred EECCCCHHH----HHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHH-HHCC--CCEE Q ss_conf 404762146----78887422455058641266542121001111233102644236641577888998-6088--5058 Q gi|254780402|r 134 LSESDIWPL----TVFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYK-ELGA--QKLI 206 (440) Q Consensus 134 ~~e~ElWPn----li~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~-~lG~--~~i~ 206 (440) .-. - .+. .+..++..|+|++.-.-.+....-....+...+.+..+...|.+.+.|+..++.+. ..|. +++. T Consensus 94 ~H~-~-~~~l~~~~~~~ar~~g~~~V~T~H~~~~~~~~~~~~~~~~~~~~l~~~d~vIavS~~~~e~~~~~~~~~~~ki~ 171 (398) T cd03796 94 GHQ-A-FSALAHEALLHARTMGLKTVFTDHSLFGFADASSIHTNKLLRFSLADVDHVICVSHTSKENTVLRASLDPERVS 171 (398) T ss_pred ECC-H-HHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHCCEEEECCHHHHHHHHHHHCCCCCCEE T ss_conf 896-2-68899999999875599789983443244631499999999999985799999977999999998489941099 Q ss_pred EECCCCCCCCCCCCCHHHHHHHHHHCCCCCEEE-CCCCC--CCCCCCCCCCCCCC-CCCCCEEEEEEECCCCHHHHHHHH Q ss_conf 720431012344221256899998617986175-04655--33210000000000-123412488970552114788877 Q gi|254780402|r 207 VSGNLKIDTESLPCDKELLSLYQESIAGRYTWA-AISTF--EGEEDKAVYVHNFI-KCRTDVLTIIVPRHPRRCDAIERR 282 (440) Q Consensus 207 v~GnlK~d~~~~~~~~~~~~~~~~~~~~r~v~v-agSth--~~EE~~il~a~~~l-~~~~~~~lIIvPRhpeR~~~i~~~ 282 (440) |..|- .|......+.. ....++++++ .|... +| -+.+++|+..+ ++.|++.++|+...|+| +.++++ T Consensus 172 vIpNG-Vd~~~f~p~~~------~~~~~~~~il~vGRL~~~KG-~d~Li~A~~~l~~~~p~~~lvIvGdGp~~-~~L~~l 242 (398) T cd03796 172 VIPNA-VDSSDFTPDPS------KRDNDKITIVVISRLVYRKG-IDLLVGIIPEICKKHPNVRFIIGGDGPKR-ILLEEM 242 (398) T ss_pred EECCC-CCHHHCCCCCC------CCCCCCEEEEEECCCCCHHH-HHHHHHHHHHHHHHCCCEEEEEECCCCCH-HHHHHH T ss_conf 98895-74764488722------15889869999706750300-99999999999965899599993787118-999999 Q ss_pred HHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHHHHH Q ss_conf 65300101100134444466416750676520255422082066135544454583768961994999898443799999 Q gi|254780402|r 283 LIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFRDIYR 362 (440) Q Consensus 283 l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~~~ 362 (440) +++.|+.-...-.+..+ -.++..+|+.||+ ||-.|..+.-|..++||+++|+|||. .++...+|+.+ T Consensus 243 ~~~~~l~~~V~flG~v~-----------~~~l~~~~~~aDv-fv~PS~~Egfglv~lEAmA~G~PVVa-t~vgG~~Evv~ 309 (398) T cd03796 243 REKYNLQDRVELLGAVP-----------HERVRDVLVQGHI-FLNTSLTEAFCIAIVEAASCGLLVVS-TRVGGIPEVLP 309 (398) T ss_pred HHHHCCCCEEEECCCCC-----------HHHHHHHHHHHHE-EECCCCCCCCCHHHHHHHHCCCCEEE-CCCCCCCCEEE T ss_conf 98723367289758885-----------6777888877442-12765424666799999983998998-88998611341 Q ss_pred HHHHCCCEEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHH Q ss_conf 99978986995798999999999958989999999999999970455699999999975237878 Q gi|254780402|r 363 RMVSSGAVRIVEEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITLRSLDSYVNPLIF 427 (440) Q Consensus 363 ~L~~~g~~~~v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~~~i~~~l~~~~~ 427 (440) +.+......|+++|++++..+++||+.+++|+++|++.++++- +-+++.+.+.+....++. T Consensus 310 ---~~~~~~~~~d~~~la~~l~~ll~~~~~~~~~~~~~r~~v~~~f-sw~~ia~~~~~vY~~vl~ 370 (398) T cd03796 310 ---PDMILLAEPDVESIVRKLEEAISILRTGKHDPWSFHNRVKKMY-SWEDVAKRTEKVYDRILQ 370 (398) T ss_pred ---CCCEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHC-CHHHHHHHHHHHHHHHHC T ss_conf ---8936874899999999999997699999999999999999969-999999999999999981 No 8 >cd03801 GT1_YqgM_like This family is most closely related to the GT1 family of glycosyltransferases and named after YqgM in Bacillus licheniformis about which little is known. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. Probab=99.86 E-value=2.2e-18 Score=140.35 Aligned_cols=328 Identities=18% Similarity=0.134 Sum_probs=199.5 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCC----------CCCHHHHHHHHHHCCEEEEEEC Q ss_conf 999999999999986199899996277630012101477416885055----------7201344557442020356404 Q gi|254780402|r 67 GETMALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPL----------DIQPAVSRFLKYWKPDCMILSE 136 (440) Q Consensus 67 GE~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~----------D~~~~~~~fl~~~~P~~~i~~e 136 (440) |=-..+..|++.|.+++.+|.|-|....+........ .......... =......++++..+||++..-. T Consensus 15 G~e~~~~~la~~L~~~G~~V~vit~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Dii~~~~ 93 (374) T cd03801 15 GAERHVLELARALAARGHEVTVLTPGDGGLPDEEEVG-GIVVVRPPPLLRVRRLLLLLLLALRLRRLLRRERFDVVHAHD 93 (374) T ss_pred CHHHHHHHHHHHHHHCCCEEEEEEECCCCCCHHHHCC-CCEEEECCCCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEECC T ss_conf 1999999999999977998999960798875034237-716995676654200245678999999999855998999788 Q ss_pred CCCHHHHHHHHHHCCCCEEEEE-ECCCCCC----CCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHC-C--CCEEEE Q ss_conf 7621467888742245505864-1266542----121001111233102644236641577888998608-8--505872 Q gi|254780402|r 137 SDIWPLTVFELSKQRIPQVLVN-ARMSRRS----FKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKELG-A--QKLIVS 208 (440) Q Consensus 137 ~ElWPnli~~~~~~~ip~~lin-arls~~S----~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG-~--~~i~v~ 208 (440) ...+.-....++..++|++.-- +-..+.. ......+..+.+.+++..|.|++.|+..++.+.+.+ . +++.+. T Consensus 94 ~~~~~~~~~~~~~~~~~~i~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ii~~S~~~~~~l~~~~~~~~~ki~vI 173 (374) T cd03801 94 WLALLAAALAARLLGIPLVLTVHGLEFGRPGNELGLLLKLARALERRALRRADRIIAVSEATREELRELGGVPPEKITVI 173 (374) T ss_pred CHHHHHHHHHHHHCCCCEEEEECCCCHHCCCCCHHHHHHHHHHHHHHHHHHCCEEEECCHHHHHHHHHHCCCCHHHEEEE T ss_conf 31789999999866997899967862100221002568999999999998389999989999999998619985689998 Q ss_pred CCCCCCCCCCCCCHHHHHHHHHHCCCCCEEE-CCCCCC-CCCCCCCCCCCCC-CCCCCEEEEEEECCCCHHHHHHHHHHH Q ss_conf 0431012344221256899998617986175-046553-3210000000000-123412488970552114788877653 Q gi|254780402|r 209 GNLKIDTESLPCDKELLSLYQESIAGRYTWA-AISTFE-GEEDKAVYVHNFI-KCRTDVLTIIVPRHPRRCDAIERRLIA 285 (440) Q Consensus 209 GnlK~d~~~~~~~~~~~~~~~~~~~~r~v~v-agSth~-~EE~~il~a~~~l-~~~~~~~lIIvPRhpeR~~~i~~~l~~ 285 (440) .|- .|...................++++++ +|...+ .--+.++++++.+ ++.++..++|+...+++ +++.+.+++ T Consensus 174 ~ng-id~~~~~~~~~~~~~~~~~~~~~~~il~vGrl~~~Kg~~~li~a~~~l~~~~~~~~l~ivG~g~~~-~~~~~~~~~ 251 (374) T cd03801 174 PNG-VDTERFRPAPRAARRRLGIPEDEPVILFVGRLVPRKGVDLLLEALAKLRKEYPDVRLVIVGDGPLR-EELEALAAE 251 (374) T ss_pred CCC-CCHHHCCCCCHHHHHHHCCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEECCCCHH-HHHHHHHHH T ss_conf 897-675541754177898718999982999995332002835789999999852887299995688136-699999997 Q ss_pred HCCHHHHHCCCCCCCCCCEEEEECCC--CCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHHHHHH Q ss_conf 00101100134444466416750676--5202554220820661355444545837689619949998984437999999 Q gi|254780402|r 286 KGLKVARRSRGDVINAEVDIFLGDTI--GEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFRDIYRR 363 (440) Q Consensus 286 ~gl~~~~~S~~~~~~~~~~V~i~Dt~--GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~~~~ 363 (440) .++. ..|.+.+.+ .++..+|+.||+ ||-.|..+.-|..++||+++|+|||.- +....+|+. T Consensus 252 ~~l~-------------~~v~f~g~v~~~~~~~~~~~adi-~v~pS~~E~~~~~~lEAma~G~PvI~t-~~gg~~e~i-- 314 (374) T cd03801 252 LGLG-------------DRVTFLGFVPDEDLPALYAAADV-FVLPSLYEGFGLVLLEAMAAGLPVVAS-DVGGIPEVV-- 314 (374) T ss_pred CCCC-------------CEEEECCCCCCHHHHHHHHHHCC-CCCCCHHHHHHHHHHHHHHCCCCEEEC-CCCCHHHHH-- T ss_conf 3998-------------55997586421137788765403-658735543215899999769989997-899758885-- Q ss_pred HHHCCCEEEE--CCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHH Q ss_conf 9978986995--7989999999999589899999999999999704556999999 Q gi|254780402|r 364 MVSSGAVRIV--EEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITLR 416 (440) Q Consensus 364 L~~~g~~~~v--~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~~ 416 (440) .+...++.+ .|.++|++++..+++|++.+++|+++|++.+.++-.- +++.+ T Consensus 315 -~~~~~G~l~~~~d~~~l~~~i~~l~~~~~~~~~~~~~~~~~~~~~fs~-~~~~~ 367 (374) T cd03801 315 -EDGETGLLVPPGDPEALAEAILRLLDDPELRRRLGEAARERVAERFSW-DRVAA 367 (374) T ss_pred -CCCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCH-HHHHH T ss_conf -189718997899999999999999779999999999999999986899-99999 No 9 >cd03807 GT1_WbnK_like This family is most closely related to the GT1 family of glycosyltransferases. WbnK in Shigella dysenteriae has been shown to be involved in the type 7 O-antigen biosynthesis. Probab=99.86 E-value=5.2e-18 Score=137.92 Aligned_cols=334 Identities=14% Similarity=0.159 Sum_probs=199.7 Q ss_pred ECCHHH-HHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCC-------CHHHHHHHHHHCCEEEEE Q ss_conf 178999-999999999998619989999627763001210147741688505572-------013445574420203564 Q gi|254780402|r 63 ASSVGE-TMALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDI-------QPAVSRFLKYWKPDCMIL 134 (440) Q Consensus 63 aaSvGE-~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~-------~~~~~~fl~~~~P~~~i~ 134 (440) .-+.|= -..+..|++.|.+++.++.|-+.+..|. ..+......+.+.+++... .+...++++..+||++.. T Consensus 8 ~l~~GG~e~~~~~la~~L~~~g~~v~vi~~~~~~~-~~~~~~~~~i~v~~l~~~~~~~~~~~~~~l~~~i~~~~~DiIh~ 86 (365) T cd03807 8 GLDVGGAERMLVRLLKGLDRDRFEHVVISLTDRGE-LGEELEEAGVPVYCLGKRPGRPDPGALLRLYKLIRRLRPDVVHT 86 (365) T ss_pred CCCCCCHHHHHHHHHHHHHHCCCEEEEEEECCCCC-HHHHHHHCCCEEEECCCCCCCCCHHHHHHHHHHHHHHCCCEEEE T ss_conf 89994189999999999997799499999579985-57898748956999278766568899999999999839999998 Q ss_pred ECCCCHHHHH---HHHHHCCCCEEEEEECCCCCCCCCC--HHHHHHHHHCCCCCCEEEECCHHHHHHHHHHCC--CCEEE Q ss_conf 0476214678---8874224550586412665421210--011112331026442366415778889986088--50587 Q gi|254780402|r 135 SESDIWPLTV---FELSKQRIPQVLVNARMSRRSFKNW--KTVLSFSKKIFSQFSLVIVQSERYFRRYKELGA--QKLIV 207 (440) Q Consensus 135 ~e~ElWPnli---~~~~~~~ip~~linarls~~S~~~~--~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~--~~i~v 207 (440) ... .++++ ......++|++ ...+-+......+ .+...+.+.+....|.+++.|+..++.+.+.|. +++.| T Consensus 87 ~~~--~~~~~~~l~~~~~~~~~~i-~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~ii~~S~~~~~~~~~~~~~~~~~~v 163 (365) T cd03807 87 WMY--HADLYGGLAARLAGVPPVI-WGIRHSDLDLGKKSTRLVARLRRLLSSFIPLIVANSAAAAEYHQAIGYPPKKIVV 163 (365) T ss_pred CCC--CHHHHHHHHHHHCCCCCEE-EEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEEECCHHHHHHHHHHCCCHHHEEE T ss_conf 774--2679999999975998289-9956885321010579999999999842999999499999999981998456899 Q ss_pred ECCCCCCCCCCCCCHHHHHHHHHH---CCCCCEEE-CCCCCCC-CCCCCCCCCCCC-CCCCCEEEEEEECCCCHHHHHHH Q ss_conf 204310123442212568999986---17986175-0465533-210000000000-12341248897055211478887 Q gi|254780402|r 208 SGNLKIDTESLPCDKELLSLYQES---IAGRYTWA-AISTFEG-EEDKAVYVHNFI-KCRTDVLTIIVPRHPRRCDAIER 281 (440) Q Consensus 208 ~GnlK~d~~~~~~~~~~~~~~~~~---~~~r~v~v-agSth~~-EE~~il~a~~~l-~~~~~~~lIIvPRhpeR~~~i~~ 281 (440) ..|- .|......+.......... ..++++++ +|+.++. --+.+++|+..+ ++.++..++|+...+++.+.... T Consensus 164 I~ng-id~~~~~~~~~~~~~~~~~~~~~~~~~~i~~vGrl~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~ 242 (365) T cd03807 164 IPNG-VDTERFSPDLDARARLREELGLPEDTFLIGIVARLHPQKDHATLLRAAALLLKKFPNARLLLVGDGPDRANLELL 242 (365) T ss_pred ECCC-CCHHHCCCCHHHHHHHHHHHCCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHHCCCEEEEEEECCCCHHHHHHH T ss_conf 8998-678866987036799999829998886999950465310156788999998875888279998378558899998 Q ss_pred HHHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHHHH Q ss_conf 76530010110013444446641675067652025542208206613554445458376896199499989844379999 Q gi|254780402|r 282 RLIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFRDIY 361 (440) Q Consensus 282 ~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~~ 361 (440) ..++.|+. ..|.+....-++..+|+.|| +||-.|..+.-|..++||+++|+|||. ......+|+. T Consensus 243 ~~~~~~l~-------------~~v~f~G~~~d~~~~~~~ad-i~v~pS~~Egf~~~~lEAma~G~PvI~-s~~gg~~eii 307 (365) T cd03807 243 ALKELGLE-------------DKVILLGERSDVPALLNALD-VFVLSSLSEGFPNVLLEAMACGLPVVA-TDVGDNAELV 307 (365) T ss_pred HHHHCCCC-------------CEEEECCCCCCHHHHHHHCC-CCCCCCCCCCCCHHHHHHHHCCCCEEE-CCCCCCHHEE T ss_conf 99975998-------------73999366331899998716-033887000533279999985999998-6799841145 Q ss_pred HHHHHCCCEEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf 99997898699579899999999995898999999999999997045569999999997 Q gi|254780402|r 362 RRMVSSGAVRIVEEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITLRSLDS 420 (440) Q Consensus 362 ~~L~~~g~~~~v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~~~i~~ 420 (440) + +.|......|.++|++++.++++|++.+++|+++|++.+.++- +.+++++...+ T Consensus 308 ~---~~G~l~~~~d~~~l~~~i~~l~~~~~~~~~~~~~a~~~~~~~f-s~~~~~~~~~~ 362 (365) T cd03807 308 G---DTGFLVPPGDPEALAEAIEALLADPALRQALGEAARERIEENF-SIEAMVEAYEE 362 (365) T ss_pred C---CCEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHC-CHHHHHHHHHH T ss_conf 1---7679977999999999999997799999999999999999868-99999999999 No 10 >cd03820 GT1_amsD_like This family is most closely related to the GT1 family of glycosyltransferases. AmSD in Erwinia amylovora has been shown to be involved in the biosynthesis of amylovoran, the acidic exopolysaccharide acting as a virulence factor. This enzyme may be responsible for the formation of galactose alpha-1,6 linkages in amylovoran. Probab=99.85 E-value=1.1e-18 Score=142.27 Aligned_cols=317 Identities=17% Similarity=0.141 Sum_probs=197.5 Q ss_pred HHHHHHHHHHHHHHHCCCCE-EEEECCCCCCHHHHCCCCCCEEEEECC----------CCCCHHHHHHHHHHCCEEEEEE Q ss_conf 99999999999998619989-999627763001210147741688505----------5720134455744202035640 Q gi|254780402|r 67 GETMALIGLIPAIRSRHVNV-LLTTMTATSAKVARKYLGQYAIHQYAP----------LDIQPAVSRFLKYWKPDCMILS 135 (440) Q Consensus 67 GE~~~a~pli~~l~~~~~~i-liT~~T~tg~~~~~~~~~~~~~~~y~P----------~D~~~~~~~fl~~~~P~~~i~~ 135 (440) |=-..+..|++.|.+++.+| ++|+... +.... ...+.+.+..++ +......+++++..+||+++.. T Consensus 14 G~e~~~~~la~~L~~~G~~V~vit~~~~-~~~~~--~~~~~i~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~Dvi~~~ 90 (348) T cd03820 14 GAERVLSNLANALAEKGHEVTIISLDKG-EPPFY--ELDPKIKVIDLGDKRDSKLLARFKKLRRLRKLLKNNKPDVVISF 90 (348) T ss_pred CHHHHHHHHHHHHHHCCCEEEEEEECCC-CCCCC--CCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEC T ss_conf 7899999999999877998999996699-98644--05897499988876542056789999999999997599999998 Q ss_pred CCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHCCCCEEEECCCCCCC Q ss_conf 47621467888742245505864126654212100111123310264423664157788899860885058720431012 Q gi|254780402|r 136 ESDIWPLTVFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKELGAQKLIVSGNLKIDT 215 (440) Q Consensus 136 e~ElWPnli~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~~~i~v~GnlK~d~ 215 (440) ... ++.++.....+++|++.. -+.+........+...+.+..++..|.+.+.|+.+...+...+.+++.+..|.- |. T Consensus 91 ~~~-~~~~~~~~~~~~~~~i~~-~H~~~~~~~~~~~~~~~~~~~~~~~~~ii~~S~~~~~~~~~~~~~k~~vI~N~v-~~ 167 (348) T cd03820 91 LTS-LLTFLASLGLKIVKLIVS-EHNSPDAYKKRLRRLLLRRLLYRRADAVVVLTEEDRALYYKKFNKNVVVIPNPL-PF 167 (348) T ss_pred CCC-HHHHHHHHHCCCCCEEEE-ECCCHHHHHHHHHHHHHHHHHHHHCCEEEECCHHHHHHHHHCCCCCEEEECCCC-CH T ss_conf 963-699999997599828999-757856630136799999999986899999699999987523788989988998-82 Q ss_pred CCCCCCHHHHHHHHHHCCCCCEEECCCCCC-CCCCCCCCCCCCC-CCCCCEEEEEEECCCCHHHHHHHHHHHHCCHHHHH Q ss_conf 344221256899998617986175046553-3210000000000-12341248897055211478887765300101100 Q gi|254780402|r 216 ESLPCDKELLSLYQESIAGRYTWAAISTFE-GEEDKAVYVHNFI-KCRTDVLTIIVPRHPRRCDAIERRLIAKGLKVARR 293 (440) Q Consensus 216 ~~~~~~~~~~~~~~~~~~~r~v~vagSth~-~EE~~il~a~~~l-~~~~~~~lIIvPRhpeR~~~i~~~l~~~gl~~~~~ 293 (440) ...... .....+.++.+|...+ .--+.++++++.+ ++.|++.++|+..-+++ +++.+.+++.++.- T Consensus 168 ~~~~~~--------~~~~~~~il~vGRl~~~Kg~~~li~a~~~l~~~~~~~~l~ivG~G~~~-~~l~~~i~~~~l~~--- 235 (348) T cd03820 168 PPEEPS--------SDLKSKRILAVGRLVPQKGFDLLIEAWAKIAKKHPDWKLRIVGDGPER-EALEALIKELGLED--- 235 (348) T ss_pred HHCCCC--------CCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHCCCEEEEEEECCCCC-CCHHHHHHHHCCCC--- T ss_conf 322654--------466798899993786324949998888988864898599999468753-20156777633577--- Q ss_pred CCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHHHHHHHHHCC-CEEE Q ss_conf 134444466416750676520255422082066135544454583768961994999898443799999999789-8699 Q gi|254780402|r 294 SRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFRDIYRRMVSSG-AVRI 372 (440) Q Consensus 294 S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~~~~L~~~g-~~~~ 372 (440) .|.+....-++..+|+.||+ ||-.|..+.-|-.++||+++|+|||.-+......|+ +..| .+.. T Consensus 236 ----------~v~~~G~~~~~~~~~~~adv-~v~pS~~Egfgl~~lEAma~G~PvIas~~~gg~~e~----v~~~~~G~l 300 (348) T cd03820 236 ----------RVILLGFTKNIEEYYAKASI-FVLTSRFEGFPMVLLEAMAFGLPVISFDCPTGPSEI----IEDGVNGLL 300 (348) T ss_pred ----------EEEECCCCCCCCCCCCCCCC-CCCCCCCCCCCCHHHHHHHCCCCEEEECCCCCHHHH----HCCCCEEEE T ss_conf ----------36424752222332213575-314641245870899999869999996799884999----538962999 Q ss_pred E--CCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHH Q ss_conf 5--798999999999958989999999999999970455699999999 Q gi|254780402|r 373 V--EEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITLRSL 418 (440) Q Consensus 373 v--~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~~~i 418 (440) + .|+++|++++..+++||+.+++|+++|++.+++ =+-+++++.- T Consensus 301 ~~~~d~~~la~~i~~ll~~~~~~~~~~~~a~~~~~~--fsw~~i~~~y 346 (348) T cd03820 301 VPNGDVEALAEALLRLMEDEELRKRMGANARESAER--FSIENIIKQW 346 (348) T ss_pred ECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHH--CCHHHHHHHH T ss_conf 889999999999999977999999999999999996--9999999996 No 11 >cd03811 GT1_WabH_like This family is most closely related to the GT1 family of glycosyltransferases. WabH in Klebsiella pneumoniae has been shown to transfer a GlcNAc residue from UDP-GlcNAc onto the acceptor GalUA residue in the cellular outer core. Probab=99.85 E-value=1.2e-18 Score=142.07 Aligned_cols=321 Identities=18% Similarity=0.171 Sum_probs=193.4 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCC-CCEEEEECCCCCC------HHHHHHHHHHCCEEEEEECCCC Q ss_conf 99999999999998619989999627763001210147-7416885055720------1344557442020356404762 Q gi|254780402|r 67 GETMALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLG-QYAIHQYAPLDIQ------PAVSRFLKYWKPDCMILSESDI 139 (440) Q Consensus 67 GE~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~-~~~~~~y~P~D~~------~~~~~fl~~~~P~~~i~~e~El 139 (440) |=-..+..|++.|.+++.++-|-|++.++......... ........+.... ...+++++..+||+++.... . T Consensus 13 G~e~~~~~la~~L~~~G~~V~vi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~DiI~~~~~-~ 91 (353) T cd03811 13 GAERVLLNLANGLDKRGYDVTLVVLRDEGDYLELLPSNVKLIPVRVLKLKSLRDLLAILRLRRLLRKEKPDVVISHLT-T 91 (353) T ss_pred HHHHHHHHHHHHHHHCCCEEEEEEECCCCCCHHHCCCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHCCCCEEEECCC-C T ss_conf 599999999999997799799999779985133305673388613556553325999999999999749989999886-2 Q ss_pred HHHHH-HHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHH-CC--CCEEEECCCCCCC Q ss_conf 14678-88742245505864126654212100111123310264423664157788899860-88--5058720431012 Q gi|254780402|r 140 WPLTV-FELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKEL-GA--QKLIVSGNLKIDT 215 (440) Q Consensus 140 WPnli-~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~l-G~--~~i~v~GnlK~d~ 215 (440) ++.+. ..++..++|++..-.-........+.+...+.+..++..|.+++.|+..++.+.+. +. +++.+..|- .|. T Consensus 92 ~~~~~~~~~~~~~~~~i~~~h~~~~~~~~~~~~~~~~~~~~~~~~d~ii~~S~~~~~~~~~~~~~~~~~i~vI~Ng-vd~ 170 (353) T cd03811 92 TPNVLALLAARLGTKLIVWEHNSLSLELKRKLRLLLLIRKLYRRADKIVAVSEGVKEDLLKLLGIPPDKIEVIYNP-IDI 170 (353) T ss_pred HHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHHHHHHHHHHCCEEEECCHHHHHHHHHHCCCCHHHEEEECCC-CCH T ss_conf 7899999997479978999798704432334669999999998689999959999999998619985689997675-686 Q ss_pred CCCCCCHHHHHHHHHHCCCCCEEE-CCCCCCC-CCCCCCCCCCCCC-CCCCEEEEEEECCCCHHHHHHHHHHHHCCHHHH Q ss_conf 344221256899998617986175-0465533-2100000000001-234124889705521147888776530010110 Q gi|254780402|r 216 ESLPCDKELLSLYQESIAGRYTWA-AISTFEG-EEDKAVYVHNFIK-CRTDVLTIIVPRHPRRCDAIERRLIAKGLKVAR 292 (440) Q Consensus 216 ~~~~~~~~~~~~~~~~~~~r~v~v-agSth~~-EE~~il~a~~~l~-~~~~~~lIIvPRhpeR~~~i~~~l~~~gl~~~~ 292 (440) ......... ........++++++ +|...+. --+.+++|++.++ +.++..++|+...|++ +++.+.+++.+++- T Consensus 171 ~~~~~~~~~-~~~~~~~~~~~~il~vGrl~~~Kg~~~li~a~~~l~~~~~~~~l~ivG~G~~~-~~l~~~~~~~~l~~-- 246 (353) T cd03811 171 EEIRALAEE-PLELGIPPDGPVILAVGRLSPQKGFDTLIRAFALLRKEGPDARLVILGDGPLR-EELEALAKELGLAD-- 246 (353) T ss_pred HHHCHHHHH-HHHHCCCCCCEEEEEEEECCCCCCHHHHHHHHHHHHHHCCCEEEEECCCCHHH-HHHHHHHHHCCCCC-- T ss_conf 762324565-45306889986999982076642299999999976641873799974786039-99997788659860-- Q ss_pred HCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHHHHHHHHHCCCEEE Q ss_conf 01344444664167506765202554220820661355444545837689619949998984437999999997898699 Q gi|254780402|r 293 RSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFRDIYRRMVSSGAVRI 372 (440) Q Consensus 293 ~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~~~~L~~~g~~~~ 372 (440) .|.+....-++..+|+.||+ ||-.|..+.-|..++||+++|+|||. .+....+|..+. -..|.... T Consensus 247 -----------~V~~~G~~~d~~~~~~~~di-~v~pS~~Egfg~~~lEAma~G~pvI~-s~~gg~~e~i~~-g~~G~l~~ 312 (353) T cd03811 247 -----------RVHFLGFQSNPYPYLKAADL-FVLSSRYEGFPNVLLEAMALGTPVVA-TDCPGPREILED-GENGLLVP 312 (353) T ss_pred -----------EEEECCCCCCHHHHHHHCCE-EECCCCCCCCCHHHHHHHHCCCCEEE-CCCCCHHHHHCC-CCCEEEEC T ss_conf -----------68754866430232420868-87154346885389999980998999-489984898448-98389978 Q ss_pred ECCHHHHHHHHHHH---HCCHHHHHHHHHHHHHHHHHC Q ss_conf 57989999999999---589899999999999999704 Q gi|254780402|r 373 VEEVGTLADMVYSL---LSEPTIRYEMINAAINEVKKM 407 (440) Q Consensus 373 v~d~~eL~~~l~~l---l~d~~~~~~~~~~a~~~i~~~ 407 (440) ..|+++|++++..+ ++||+.|++||++|++.++++ T Consensus 313 ~~d~~~la~~i~~l~~l~~~~~~~~~~g~~~~~~~~~~ 350 (353) T cd03811 313 VGDEAALAAAALALLDLLLDPELRERLAAAARERVARE 350 (353) T ss_pred CCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999851499999999999999999986 No 12 >cd03795 GT1_like_4 This family is most closely related to the GT1 family of glycosyltransferases. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP-linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homolog Probab=99.85 E-value=6.8e-19 Score=143.71 Aligned_cols=316 Identities=17% Similarity=0.104 Sum_probs=189.0 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEE---------EECCCCCCHHHHHHHHHHCCEEEEEECC Q ss_conf 9999999999999861998999962776300121014774168---------8505572013445574420203564047 Q gi|254780402|r 67 GETMALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIH---------QYAPLDIQPAVSRFLKYWKPDCMILSES 137 (440) Q Consensus 67 GE~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~---------~y~P~D~~~~~~~fl~~~~P~~~i~~e~ 137 (440) |=-..+..|.++|.+++.++-|-|..+...... .. .+.... .+.|+-+....+......+||++.+--. T Consensus 15 G~e~~~~~L~~~L~~~Gh~V~v~~~~~~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Diih~h~~ 92 (357) T cd03795 15 GIEQVIRDLAEGLAARGIEVAVLCASPEPKGRD-EE-RNGHRVIRAPSLLNVASTPFSPSFFKQLKKLAKKADVIHLHFP 92 (357) T ss_pred CHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCC-CC-CCCCEEEECCCCCCCCCCHHHHHHHHHHHHHCCCCCEEEECCC T ss_conf 299999999999997799899998279887765-02-5884799877433334420469999999997259999999476 Q ss_pred CCHH--HHHHHHHHCCCCEEEEE-E-CCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHCC--CCEEEECCC Q ss_conf 6214--67888742245505864-1-2665421210011112331026442366415778889986088--505872043 Q gi|254780402|r 138 DIWP--LTVFELSKQRIPQVLVN-A-RMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKELGA--QKLIVSGNL 211 (440) Q Consensus 138 ElWP--nli~~~~~~~ip~~lin-a-rls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~--~~i~v~Gnl 211 (440) || .+...+...++|+++.- + -+.++ ..+..++++.+.++++.|.|++.|+..++.+..++. +++.+..|- T Consensus 93 --~~~~~~~~~~~~~~~~~v~t~H~~~~~~~--~~~~~~~~~~~~~~~~ad~ii~~S~~~~~~~~~~~~~~~k~~vIpng 168 (357) T cd03795 93 --NPLADLALLLLPRKKPVVVHWHSDIVKQK--LLLKLYRPLQRRFLRRADAIVATSPNYAETSPVLRRFRDKVRVIPLG 168 (357) T ss_pred --CHHHHHHHHHHHCCCCEEEEECCCCCCCH--HHHHHHHHHHHHHHHHCCEEEECCHHHHHHHHHHCCCCCCEEEECCC T ss_conf --35999999998579979999878853205--67999999999999848999988999999999844776768998897 Q ss_pred CCCCCCCCCCHHHHHHHHHHCCCCCEEEC-CCCCCC-CCCCCCCCCCCCCCCCCEEEEEEECCCCHHHHHHHHHHHHCCH Q ss_conf 10123442212568999986179861750-465533-2100000000001234124889705521147888776530010 Q gi|254780402|r 212 KIDTESLPCDKELLSLYQESIAGRYTWAA-ISTFEG-EEDKAVYVHNFIKCRTDVLTIIVPRHPRRCDAIERRLIAKGLK 289 (440) Q Consensus 212 K~d~~~~~~~~~~~~~~~~~~~~r~v~va-gSth~~-EE~~il~a~~~l~~~~~~~lIIvPRhpeR~~~i~~~l~~~gl~ 289 (440) .|....+..............++++++. |...+. --+.+++|.+ ..++..++|+...+++ +++.+.+++.++. T Consensus 169 -vd~~~~~~~~~~~~~~~~~~~~~~~i~~vGrl~~~Kg~~~li~a~~---~l~~~~l~i~G~G~~~-~~l~~~~~~~~~~ 243 (357) T cd03795 169 -LDPARYPRPDALEEAIWRRAAGRPFFLFVGRLVYYKGLDVLLEAAA---ALPDAPLVIVGEGPLE-AELEALAAALGLL 243 (357) T ss_pred -CCCCCCCCCCHHHHHHHHCCCCCCEEEEEECCCHHCCCHHHHHHHH---HCCCEEEEEEECCCCC-CCCCCHHHHCCCC T ss_conf -6623368873678887403589978999807804309578998987---6989099999567542-2210005551875 Q ss_pred HHHHCCCCCCCCCCEEEEECCC--CCCHHHHHHCCCEEEECCCC--CCCCCCHHHHHHHCCCEEECCCCCCHHHHHHHHH Q ss_conf 1100134444466416750676--52025542208206613554--4454583768961994999898443799999999 Q gi|254780402|r 290 VARRSRGDVINAEVDIFLGDTI--GEMGFYLRMTEIAFIGRSFC--ASGGQNPLEAAMLGCAILSGPNVENFRDIYRRMV 365 (440) Q Consensus 290 ~~~~S~~~~~~~~~~V~i~Dt~--GeL~~lY~~AdiafVGGSl~--~~GGhN~lEpaa~G~pVi~GP~~~nf~e~~~~L~ 365 (440) - .|.+...+ -++..+|+.||+ ||-.|.. +.-|-.++||+++|+|||.-. +....+....=. T Consensus 244 ~-------------~V~f~G~~~~~~~~~~~~~adi-~v~pS~~~~Egfg~~~lEAma~G~PVVat~-~gg~~~~i~~~~ 308 (357) T cd03795 244 D-------------RVRFLGRLDDEEKAALLAACDV-FVFPSVERSEAFGIVLLEAMAFGKPVISTE-IGTGGSYVNLHG 308 (357) T ss_pred C-------------EEEECCCCCHHHHHHHHHCCCE-EEEECCHHCCCCCHHHHHHHHCCCCEEEEC-CCCCHHHHEECC T ss_conf 1-------------4752586514557988626878-999464021356667999998799899935-999815605569 Q ss_pred HCCCEEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCC Q ss_conf 7898699579899999999995898999999999999997045 Q gi|254780402|r 366 SSGAVRIVEEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQ 408 (440) Q Consensus 366 ~~g~~~~v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ 408 (440) .+|......|.++|++++..+++||+.|++|+++|++.++++- T Consensus 309 ~~G~l~~~~d~~~l~~~i~~ll~~~~~~~~m~~~ar~~~~~~f 351 (357) T cd03795 309 VTGLVVPPGDPAALAEAIRRLLEDPELRERLGEAARERAEEEF 351 (357) T ss_pred CEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHC T ss_conf 5799978999999999999997799999999999999999857 No 13 >cd03794 GT1_wbuB_like This family is most closely related to the GT1 family of glycosyltransferases. wbuB in E. coli is involved in the biosynthesis of the O26 O-antigen. It has been proposed to function as an N-acetyl-L-fucosamine (L-FucNAc) transferase. Probab=99.84 E-value=3.5e-18 Score=139.03 Aligned_cols=321 Identities=16% Similarity=0.148 Sum_probs=185.8 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEEECCCCCCHH-----HHCCCCCCEEEEECCC---CCCHHHHH---------------H Q ss_conf 99999999999998619989999627763001-----2101477416885055---72013445---------------5 Q gi|254780402|r 67 GETMALIGLIPAIRSRHVNVLLTTMTATSAKV-----ARKYLGQYAIHQYAPL---DIQPAVSR---------------F 123 (440) Q Consensus 67 GE~~~a~pli~~l~~~~~~iliT~~T~tg~~~-----~~~~~~~~~~~~y~P~---D~~~~~~~---------------f 123 (440) +| ..+..|.++|.+++.+|-|-|+++..... ......+.+.+..+|. ......++ . T Consensus 16 ~~-~~~~~la~~L~~~Gh~V~v~t~~~~~~~~~~~~~~~~~~~~gi~v~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 94 (394) T cd03794 16 GA-FRTTELAEELVKRGHEVTVITGSPNYPSGKIYKGYKREEVDGVRVHRVPLPPYKKNGLLKRLLNYLSFALSALLALL 94 (394) T ss_pred HH-HHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCCEEEECCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 99-99999999999779979999547877643235666446648859999337766775278899999999999999999 Q ss_pred HHHHCCEEEEEECCC-C-HHHHHHHHHHCCCCEEEEEE----------CCCCCCCCCCHHHHHHHHHCCCCCCEEEECCH Q ss_conf 744202035640476-2-14678887422455058641----------26654212100111123310264423664157 Q gi|254780402|r 124 LKYWKPDCMILSESD-I-WPLTVFELSKQRIPQVLVNA----------RMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSE 191 (440) Q Consensus 124 l~~~~P~~~i~~e~E-l-WPnli~~~~~~~ip~~lina----------rls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~ 191 (440) .+..+||+++....- + +......+++.++|+++--- .......-.++.+..+.+.+++..|.+.+.|+ T Consensus 95 ~~~~~~Div~~~~~~~~~~~~~~~~~~~~~~p~v~~~hd~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ad~vi~~S~ 174 (394) T cd03794 95 KRRRRPDVIIATSPPLLIALAALLLARLKGAPFVLEVRDLWPESAVALGLLKNGSLLYRLLRKLERLIYRRADAIVVISP 174 (394) T ss_pred HHCCCCCEEEECCCCHHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHCCEEEECCH T ss_conf 85589988999178478899999999863996999968744678998367444448999999999999984899997729 Q ss_pred HHHHHHHHHCC--CCEEEECCCCCCCC-CCCCCHHHHHHHHHHCCCCCEEEC-CCCCCC-CCCCCCCCCCCCCCCCCEEE Q ss_conf 78889986088--50587204310123-442212568999986179861750-465533-21000000000012341248 Q gi|254780402|r 192 RYFRRYKELGA--QKLIVSGNLKIDTE-SLPCDKELLSLYQESIAGRYTWAA-ISTFEG-EEDKAVYVHNFIKCRTDVLT 266 (440) Q Consensus 192 ~~~~rl~~lG~--~~i~v~GnlK~d~~-~~~~~~~~~~~~~~~~~~r~v~va-gSth~~-EE~~il~a~~~l~~~~~~~l 266 (440) ..++.+.+.|. +++.+..|- .|.. ..+..............++++++. |+..+. .-+.++++.+.+++.+++.+ T Consensus 175 ~~~~~~~~~~~~~~~i~vipng-vd~~~~~~~~~~~~~~~~~~~~~~~~i~~~Grl~~~kg~~~li~a~~~l~~~~~~~l 253 (394) T cd03794 175 GMREYLVRRGVPPEKISVIPNG-VDLELFKPPPADESLRKELGLDDKFVVLYAGNIGRAQGLDTLLEAAALLKDRPDIRF 253 (394) T ss_pred HHHHHHHHHCCCCCCEEEEECC-CCHHHCCCCCCHHHHHHHCCCCCCEEEEEEEECCCCCCCHHHHHHHHHHCCCCCEEE T ss_conf 9999999848992309999476-257652777504777874268998599996111000363799999997455898599 Q ss_pred EEEECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCC--CCCHHHHHHCCCEEEECCCCCCCCC-C---HHH Q ss_conf 897055211478887765300101100134444466416750676--5202554220820661355444545-8---376 Q gi|254780402|r 267 IIVPRHPRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTI--GEMGFYLRMTEIAFIGRSFCASGGQ-N---PLE 340 (440) Q Consensus 267 IIvPRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~--GeL~~lY~~AdiafVGGSl~~~GGh-N---~lE 340 (440) +|+..-|++ +++.+.+...+++ +|.+...+ .++..+|+.||+.++=-+-...++- . ++| T Consensus 254 ~ivG~G~~~-~~l~~~~~~~~~~--------------~V~~~G~v~~~~~~~~~~~adi~v~p~~~~~~~~~~~P~kllE 318 (394) T cd03794 254 LIVGDGPEK-EELKELAKALGLD--------------NVTFLGRVPKEELPELLAAADVGLVPLKPGPAFEGVSPSKLFE 318 (394) T ss_pred EEECCCHHH-HHHHHHHHHCCCC--------------EEEECCCCCCCHHHHHHHCCCEEEEECCCCCCCCCCCHHHHHH T ss_conf 995685167-8999999981999--------------4998163046136778742969999277754457735689999 Q ss_pred HHHHCCCEEECCCCCCHHHHHHHHHHCCCEEE--ECCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCC Q ss_conf 89619949998984437999999997898699--579899999999995898999999999999997045 Q gi|254780402|r 341 AAMLGCAILSGPNVENFRDIYRRMVSSGAVRI--VEEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQ 408 (440) Q Consensus 341 paa~G~pVi~GP~~~nf~e~~~~L~~~g~~~~--v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ 408 (440) |+++|+|||... .....|... +.+.+.. ..|+++|++++..+++|++.+++|+++|++.++++- T Consensus 319 Ama~G~PVV~s~-~gg~~e~i~---~~~~G~l~~~~d~~~la~~i~~ll~d~~~~~~~~~~ar~~~~~~f 384 (394) T cd03794 319 YMAAGKPVLASV-DGESAELVE---EAGAGLVVPPGDPEALAAAILELLDDPEERAEMGENGRRYVEEKF 384 (394) T ss_pred HHHCCCCEEEEC-CCCHHHHEE---CCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHC T ss_conf 998499799958-998077612---188089977999999999999997799999999999999999858 No 14 >cd03808 GT1_cap1E_like This family is most closely related to the GT1 family of glycosyltransferases. cap1E in Streptococcus pneumoniae is required for the synthesis of type 1 capsular polysaccharides. Probab=99.84 E-value=1.1e-17 Score=135.90 Aligned_cols=321 Identities=17% Similarity=0.150 Sum_probs=191.0 Q ss_pred HHHHHHHHHHHHHHHHCCCCEEE-EECCCCCCHHHHCCCCCCEEEEECCCCCC-----------HHHHHHHHHHCCEEEE Q ss_conf 99999999999999861998999-96277630012101477416885055720-----------1344557442020356 Q gi|254780402|r 66 VGETMALIGLIPAIRSRHVNVLL-TTMTATSAKVARKYLGQYAIHQYAPLDIQ-----------PAVSRFLKYWKPDCMI 133 (440) Q Consensus 66 vGE~~~a~pli~~l~~~~~~ili-T~~T~tg~~~~~~~~~~~~~~~y~P~D~~-----------~~~~~fl~~~~P~~~i 133 (440) =|=-..+..|++.|.+++.+|.| |...+. ..... ...+....+|+... ....++++..+||++. T Consensus 10 GG~e~~~~~La~~L~~~Gh~V~vit~~~~~-~~~~~---~~gv~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~pDvIh 85 (359) T cd03808 10 GGLYSFRLPLIKALRAAGYEVHVVAPPGDE-LEELE---ALGVKVIPIPLDRRGINPFKDLKALLRLYRLLRKERPDIVH 85 (359) T ss_pred CHHHHHHHHHHHHHHHCCCEEEEEEECCCC-CCHHH---HCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHCCCEEE T ss_conf 659999999999999769999999707987-43367---57988999278777788699999999999999984998999 Q ss_pred EECCCCHHHHHH--HHHHCCCCEEEEE--EC--CCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHCC---CC Q ss_conf 404762146788--8742245505864--12--665421210011112331026442366415778889986088---50 Q gi|254780402|r 134 LSESDIWPLTVF--ELSKQRIPQVLVN--AR--MSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKELGA---QK 204 (440) Q Consensus 134 ~~e~ElWPnli~--~~~~~~ip~~lin--ar--ls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~---~~ 204 (440) .-.. -|+++. .++..+.|.++.. +- ...++...+..+..+.+..++..|.+++.|+..++.+.+.+. ++ T Consensus 86 ~~~~--~~~~~~~la~~~~~~~~~v~~~h~~~~~~~~~~~~~~~~~~~~k~~~~~~~~ii~~S~~~~~~~~~~~~~~~~~ 163 (359) T cd03808 86 THTP--KPGILGRLAARLAGVPKVIYTVHGLGFVFTSGGLKRRLYLLLERLALRFTDKVIFQNEDDRDLALKLGIIKKKK 163 (359) T ss_pred EECC--CHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHCCCEEEECCHHHHHHHHHHCCCCCCC T ss_conf 9065--13578999998649986999956774012454778999999999999649999994989999999837997460 Q ss_pred -EEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCEEE-CCCCCCC-CCCCCCCCCCCCC-CCCCEEEEEEECCCCHHHHHH Q ss_conf -58720431012344221256899998617986175-0465533-2100000000001-234124889705521147888 Q gi|254780402|r 205 -LIVSGNLKIDTESLPCDKELLSLYQESIAGRYTWA-AISTFEG-EEDKAVYVHNFIK-CRTDVLTIIVPRHPRRCDAIE 280 (440) Q Consensus 205 -i~v~GnlK~d~~~~~~~~~~~~~~~~~~~~r~v~v-agSth~~-EE~~il~a~~~l~-~~~~~~lIIvPRhpeR~~~i~ 280 (440) +.+.+|- .|......... ....++++++ +|...+. --+.++++++.++ +.++..++|+..-+++..... T Consensus 164 ~~~i~~~g-vd~~~~~~~~~------~~~~~~~~i~~vGrl~~~Kg~~~li~a~~~l~~~~~~~~l~ivG~g~~~~~~~~ 236 (359) T cd03808 164 TVLIPGSG-VDLDRFSPSPE------PIPEDDPVFLFVARLLKDKGIDELLEAARILKAKGPNVRLLLVGDGDEENPAAI 236 (359) T ss_pred EEEECCCC-CCHHHCCCCCC------CCCCCCEEEEEECCCCHHCCHHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHH T ss_conf 89977997-68665386654------689898499998046322073999999999986399808999768872589999 Q ss_pred HHHHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHHH Q ss_conf 77653001011001344444664167506765202554220820661355444545837689619949998984437999 Q gi|254780402|r 281 RRLIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFRDI 360 (440) Q Consensus 281 ~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~ 360 (440) ..+++.++. ..|......-++..+|+.||+ ||-.|..+.-|..++||+++|+|||.- +....+|+ T Consensus 237 ~~~~~~~~~-------------~~v~f~G~~~~~~~~~~~~di-~v~pS~~Egf~~~~lEAma~G~PvI~s-~~gg~~e~ 301 (359) T cd03808 237 LEIEKLGLE-------------GRVEFLGFRDDVPELLAAADV-FVLPSYREGLPRVLLEAMAMGRPVIAT-DVPGCREA 301 (359) T ss_pred HHHHHCCCC-------------CEEEEECCHHHHHHHHHHCCC-CCCCCCCCCCCCCHHHHHHCCCCEEEC-CCCCHHHH T ss_conf 999971889-------------869980757789999996021-578752135784289998669989994-89972888 Q ss_pred HHHHHHCCCEEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHH Q ss_conf 99999789869957989999999999589899999999999999704556999999 Q gi|254780402|r 361 YRRMVSSGAVRIVEEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITLR 416 (440) Q Consensus 361 ~~~L~~~g~~~~v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~~ 416 (440) ... -++|......|+++|++++..+++|++.+++|+++|++.+.++-.. +++++ T Consensus 302 i~~-~~~G~l~~~~d~~~la~~i~~ll~~~~~~~~~~~~a~~~~~~~fs~-~~~~~ 355 (359) T cd03808 302 VID-GVNGFLVPPGDAEALADAIERLIEDPELRARMGQAARKRAEEEFDE-EIVVK 355 (359) T ss_pred HHC-CCCEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCH-HHHHH T ss_conf 607-9818998999999999999999889999999999999999987799-99999 No 15 >cd04962 GT1_like_5 This family is most closely related to the GT1 family of glycosyltransferases. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homolog Probab=99.83 E-value=3.5e-17 Score=132.45 Aligned_cols=329 Identities=20% Similarity=0.231 Sum_probs=198.4 Q ss_pred HHHHHHHHHHHHHHHCCCCE-EEEECCCCCCHHHHCCCCCCEEE----------EECCCCCC--HHHHHHHHHHCCEEEE Q ss_conf 99999999999998619989-99962776300121014774168----------85055720--1344557442020356 Q gi|254780402|r 67 GETMALIGLIPAIRSRHVNV-LLTTMTATSAKVARKYLGQYAIH----------QYAPLDIQ--PAVSRFLKYWKPDCMI 133 (440) Q Consensus 67 GE~~~a~pli~~l~~~~~~i-liT~~T~tg~~~~~~~~~~~~~~----------~y~P~D~~--~~~~~fl~~~~P~~~i 133 (440) |=-..+..|.+.|.+++.+| ++|+..+.-.+ ...+....| .+-++|.. ....+.++.++||++- T Consensus 13 G~e~~v~~La~~L~~~GHeV~vit~~~~~~~~---~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~DvvH 89 (371) T cd04962 13 GSGVVATELGKALARRGHEVHFITSSRPFRLD---EYSPNIFFHEVEVPQYPLFQYPPYDLALASKIAEVAKRYKLDLLH 89 (371) T ss_pred CHHHHHHHHHHHHHHCCCEEEEEECCCCCCCC---CCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCEEE T ss_conf 69999999999999759999999568987655---568973799846877653446721378999999999973998899 Q ss_pred EECCCCHHH----HH--HHHHHCCCCEEEE-EECCCCCCC-CCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHCC--C Q ss_conf 404762146----78--8874224550586-412665421-210011112331026442366415778889986088--5 Q gi|254780402|r 134 LSESDIWPL----TV--FELSKQRIPQVLV-NARMSRRSF-KNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKELGA--Q 203 (440) Q Consensus 134 ~~e~ElWPn----li--~~~~~~~ip~~li-narls~~S~-~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~--~ 203 (440) .- + -+|+ ++ ..++..++|++.. .+. +-+. .....+..+.+..+.+.|.+.+.|+..++.+.+.+. + T Consensus 90 ~h-~-~~p~~~~~~l~~~~~~~~~~~~v~t~H~~--~~~~~~~~~~~~~~~~~~~~~a~~vi~~S~~~~~~~~~~~~~~~ 165 (371) T cd04962 90 VH-Y-AVPHAVAAYLAREILGKKDLPVVTTLHGT--DITLVGQDPSFQPATRFSIEKSDGVTAVSESLRQETYELFDITK 165 (371) T ss_pred EC-C-CCCHHHHHHHHHHHHHCCCCCEEEEECCC--CHHHHCCCHHHHHHHHHHHHHCCEEEECCHHHHHHHHHHCCCCC T ss_conf 72-5-54267999999998644799789993798--55642147456689999997589899899999999999609998 Q ss_pred CEEEECCCCCCCCC-CCCCHHHHHHHHHHCCCCCEEECCCCCCCC--CCCCCCCCCCCCCCCCEEEEEEECCCCHHHHHH Q ss_conf 05872043101234-422125689999861798617504655332--100000000001234124889705521147888 Q gi|254780402|r 204 KLIVSGNLKIDTES-LPCDKELLSLYQESIAGRYTWAAISTFEGE--EDKAVYVHNFIKCRTDVLTIIVPRHPRRCDAIE 280 (440) Q Consensus 204 ~i~v~GnlK~d~~~-~~~~~~~~~~~~~~~~~r~v~vagSth~~E--E~~il~a~~~l~~~~~~~lIIvPRhpeR~~~i~ 280 (440) ++.+..|- .|... .+..............+.++++.-+....+ -+.+++++..+.+..+..++|+..-|++ +++. T Consensus 166 ~i~vI~Ng-vd~~~f~~~~~~~~~~~~~~~~~~~~i~~vgrl~~~Kg~~~ll~a~~~l~~~~~~~l~ivG~G~~~-~~~~ 243 (371) T cd04962 166 EIEVIPNF-VDEDRFRPKPDEALKRRLGAPEGEKVLIHISNFRPVKRIDDVIRIFAKVRKEVPARLLLVGDGPER-SPAE 243 (371) T ss_pred CEEEECCC-CCHHHCCCCCCHHHHHHHCCCCCCEEEEEECCCCHHCCHHHHHHHHHHHHHCCCEEEEEECCCHHH-HHHH T ss_conf 88998797-573214888507899970999898599994265021496999999998630576599998264037-7888 Q ss_pred HHHHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHHH Q ss_conf 77653001011001344444664167506765202554220820661355444545837689619949998984437999 Q gi|254780402|r 281 RRLIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFRDI 360 (440) Q Consensus 281 ~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~ 360 (440) +++++.|+.- .|......-++..+|+.||+ ||-.|..+.-|-.++||+++|+|||.- +.....|. T Consensus 244 ~~~~~~~l~~-------------~V~f~G~~~~~~~~~~~adi-~v~pS~~E~fg~vilEAma~G~PvIat-~~gg~~e~ 308 (371) T cd04962 244 RLARELGLQD-------------DVLFLGKQDHVEELLSIADL-FLLPSEKESFGLAALEAMACGVPVVAS-NAGGIPEV 308 (371) T ss_pred HHHHHHCCCC-------------EEEECCCHHHHHHHHHHCCC-CCCCCHHHHHHHHHHHHHHCCCCEEEC-CCCCHHHH T ss_conf 8887621031-------------36632736559999985511-138732443202599999569949986-89983899 Q ss_pred HHHHHHCCCEEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH Q ss_conf 9999978986995798999999999958989999999999999970455699999999975 Q gi|254780402|r 361 YRRMVSSGAVRIVEEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITLRSLDSY 421 (440) Q Consensus 361 ~~~L~~~g~~~~v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~~~i~~~ 421 (440) .++ -.+|......|+++|++++..+++||+.+++|+++|++.+.++-.. +++.+...+. T Consensus 309 v~~-g~~G~l~~~~d~~~la~~i~~ll~~~~~~~~~~~~a~~~~~~~fs~-~~~~~~~~~i 367 (371) T cd04962 309 VKH-GETGFLVDVGDVEAMAEYALSLLEDDELWQEFSRAARNRAAERFDS-ERIVPQYEAL 367 (371) T ss_pred HCC-CCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCH-HHHHHHHHHH T ss_conf 708-9718997899999999999999769999999999999999986899-9999999999 No 16 >cd03798 GT1_wlbH_like This family is most closely related to the GT1 family of glycosyltransferases. wlbH in Bordetella parapertussis has been shown to be required for the biosynthesis of a trisaccharide that, when attached to the B. pertussis lipopolysaccharide (LPS) core (band B), generates band A LPS. Probab=99.83 E-value=6.1e-18 Score=137.48 Aligned_cols=331 Identities=20% Similarity=0.170 Sum_probs=195.2 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEE-EEEC-CCC----------CCHH---HHHHH--HHHCC Q ss_conf 999999999999986199899996277630012101477416-8850-557----------2013---44557--44202 Q gi|254780402|r 67 GETMALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAI-HQYA-PLD----------IQPA---VSRFL--KYWKP 129 (440) Q Consensus 67 GE~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~-~~y~-P~D----------~~~~---~~~fl--~~~~P 129 (440) |--..+..++++|.+++.+|-|-|..+.+............. .... ... .... ..+++ +..+| T Consensus 15 G~e~~~~~la~~L~~~G~~V~Vit~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~ 94 (377) T cd03798 15 GGGIFVKELARALAKRGVEVTVLAPGPWGPKLLDLLKGRLVGVERLPVLLPVVPLLKGPLLYLLAARALLKLLKLKRFRP 94 (377) T ss_pred HHHHHHHHHHHHHHHCCCEEEEEECCCCCCCHHHHCCCCCEEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 49999999999999779969999537987531231257520003445455444331046677899999999999746998 Q ss_pred EEEEEECCCCHHHHH--HHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHCC--CCE Q ss_conf 035640476214678--8874224550586412665421210011112331026442366415778889986088--505 Q gi|254780402|r 130 DCMILSESDIWPLTV--FELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKELGA--QKL 205 (440) Q Consensus 130 ~~~i~~e~ElWPnli--~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~--~~i 205 (440) |++..- +..+..++ ...+..++|++..-=-..-..+..+..+..+.+.++++.|.+++.|+..++.+.+.|. +++ T Consensus 95 DvI~~~-~~~~~~~~~~~~~~~~~~~~v~~~h~~~~~~~~~~~~~~~~~~~~~~~ad~ii~~S~~~~~~~~~~~~~~~~i 173 (377) T cd03798 95 DLIHAH-FAYPDGFAAALLKRKLGIPLVVTLHGSDVNLLPRKRLLRALLRRALRRADAVIAVSEALADELKALGIDPEKV 173 (377) T ss_pred CEEEEC-CCCHHHHHHHHHHHHCCCCEEEEECCCHHHHCCCCHHHHHHHHHHHHHCCEEEECCHHHHHHHHHHCCCCCCE T ss_conf 689978-8406799999999973998899967741431023168999999999858999988989999999858996559 Q ss_pred EEECCCCCCCCCCCCCHHHHHHHHHHCCCCCEEE-CCCCCCCC-CCCCCCCCCCC-CCCCCEEEEEEECCCCHHHHHHHH Q ss_conf 8720431012344221256899998617986175-04655332-10000000000-123412488970552114788877 Q gi|254780402|r 206 IVSGNLKIDTESLPCDKELLSLYQESIAGRYTWA-AISTFEGE-EDKAVYVHNFI-KCRTDVLTIIVPRHPRRCDAIERR 282 (440) Q Consensus 206 ~v~GnlK~d~~~~~~~~~~~~~~~~~~~~r~v~v-agSth~~E-E~~il~a~~~l-~~~~~~~lIIvPRhpeR~~~i~~~ 282 (440) .+..|- .|...................++++++ +|...+.- -+.+++++..+ ++.++..++|+...+++ +.+.+. T Consensus 174 ~vi~ng-id~~~f~~~~~~~~~~~~~~~~~~~i~~~Grl~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~-~~l~~~ 251 (377) T cd03798 174 TVIPNG-VDTERFSPADRAEARKLGLPEDKKVILFVGRLVPRKGIDYLIEALARLLKKRPDVHLVIVGDGPLR-EALEAL 251 (377) T ss_pred EEECCC-CCHHHCCCCCHHHHHHHCCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHCCCCEEEECCCCHHH-HHHHHH T ss_conf 998897-578754988777898608999985999982452001828999999998874888522432682788-899999 Q ss_pred HHHHCCHHHHHCCCCCCCCCCEEEEECCC--CCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHHH Q ss_conf 65300101100134444466416750676--5202554220820661355444545837689619949998984437999 Q gi|254780402|r 283 LIAKGLKVARRSRGDVINAEVDIFLGDTI--GEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFRDI 360 (440) Q Consensus 283 l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~--GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~ 360 (440) +++.++.- .|.+...+ -++..+|+.||+ ||-.|..+..|-.++||+++|+|||.-+ ....+|. T Consensus 252 ~~~~~l~~-------------~v~~~g~v~~~~~~~~~~~adv-~v~pS~~E~~~~~~lEama~G~PvI~~~-~gg~~e~ 316 (377) T cd03798 252 AAELGLED-------------RVTFLGAVPHEEVPAYYAAADV-FVLPSLREGFGLVLLEAMACGLPVVATD-VGGIPEI 316 (377) T ss_pred HHHCCCCC-------------EEEECCCEEHHHCCCCCCCCCC-CCCCCHHHHHHHHHHHHHHCCCCEEECC-CCCHHHH T ss_conf 88618873-------------6986052100101013333774-1378557651255899997599799958-9986998 Q ss_pred HHHHHHCCCEEEE--CCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH Q ss_conf 9999978986995--798999999999958989999999999999970455699999999975 Q gi|254780402|r 361 YRRMVSSGAVRIV--EEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITLRSLDSY 421 (440) Q Consensus 361 ~~~L~~~g~~~~v--~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~~~i~~~ 421 (440) . .+.+.++.+ .|+++|++++.++++|++. +|+++|++.+.++-. -+++.+.+.+. T Consensus 317 i---~~~~~G~~~~~~~~~~l~~~i~~l~~~~~~--~~~~~~~~~~~~~fs-~~~~~~~~~~~ 373 (377) T cd03798 317 I---TDGENGLLVPPGDPEALAEAILRLLADPWL--RLGRAARRRVAERFS-WENVAERLLEL 373 (377) T ss_pred H---CCCCEEEEECCCCHHHHHHHHHHHHCCHHH--HHHHHHHHHHHHHCC-HHHHHHHHHHH T ss_conf 5---179749997999999999999999879999--999999999998699-99999999999 No 17 >cd05844 GT1_like_7 Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homology. The large cleft that separates the two domains includes the catalytic center Probab=99.83 E-value=2.1e-18 Score=140.48 Aligned_cols=280 Identities=17% Similarity=0.117 Sum_probs=183.3 Q ss_pred CCCCCCHHHHHHHHHHCCEEEEEECCCCHHH--HHHHHHHCCCCEEEE-EE---CCCCC--CCCCCHHHHHHHHHCCCCC Q ss_conf 0557201344557442020356404762146--788874224550586-41---26654--2121001111233102644 Q gi|254780402|r 112 APLDIQPAVSRFLKYWKPDCMILSESDIWPL--TVFELSKQRIPQVLV-NA---RMSRR--SFKNWKTVLSFSKKIFSQF 183 (440) Q Consensus 112 ~P~D~~~~~~~fl~~~~P~~~i~~e~ElWPn--li~~~~~~~ip~~li-na---rls~~--S~~~~~~~~~l~~~~~~~~ 183 (440) ++......++++++.++||++-.- .- ++. .+..+++.|||++.- -| ..... ...++.......+.+++.. T Consensus 66 ~~~~~~~~l~r~lr~~~pDlIHaH-~~-~~g~~~~~~a~~~~iP~V~T~Hg~d~~~~~~~~~~~~~~~~~~~~~~l~~~a 143 (367) T cd05844 66 LLTGSAPQLRRLLRRHRPDLVHAH-FG-FDGVYALPLARRLGVPLVVTFHGFDATTSLALLLRSRWALYARRRRRLARRA 143 (367) T ss_pred HCCCCCHHHHHHHHHCCCCEEEEC-CC-CHHHHHHHHHHHHCCCEEEEEECCCCCCCCCHHCCCHHHHHHHHHHHHHHHC T ss_conf 426874899999997699999976-86-0689999999996999999981364101410100110467899999999726 Q ss_pred CEEEECCHHHHHHHHHHCC--CCEEEECCC-CCCCCCCCCCHHHHHHHHHHCCCCCEEE-CCCCCCC-CCCCCCCCCCC- Q ss_conf 2366415778889986088--505872043-1012344221256899998617986175-0465533-21000000000- Q gi|254780402|r 184 SLVIVQSERYFRRYKELGA--QKLIVSGNL-KIDTESLPCDKELLSLYQESIAGRYTWA-AISTFEG-EEDKAVYVHNF- 257 (440) Q Consensus 184 ~~i~~qs~~~~~rl~~lG~--~~i~v~Gnl-K~d~~~~~~~~~~~~~~~~~~~~r~v~v-agSth~~-EE~~il~a~~~- 257 (440) |.|++.|+..++.+.++|+ +++.|..|- ..+...+ .... ...+.++ .|...+. --+.+++|+.. T Consensus 144 ~~iI~vS~~~~~~l~~~G~~~~ki~vi~~GvD~~~f~~-~~~~---------~~~~~il~vGRl~~~KG~~~li~A~~~l 213 (367) T cd05844 144 ALFIAVSQFIRDRLLALGFPPEKVHVHPIGVDTAKFTP-ATPA---------RRPPRILFVGRFVEKKGPLLLLEAFARL 213 (367) T ss_pred CEEEECCHHHHHHHHHCCCCHHHEEEECCCCCHHHCCC-CCCC---------CCCCEEEEEECCCCCCCHHHHHHHHHHH T ss_conf 99996999999999985989789999778636764699-9877---------7896899993573000769999999997 Q ss_pred CCCCCCEEEEEEECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCC----- Q ss_conf 012341248897055211478887765300101100134444466416750676520255422082066135544----- Q gi|254780402|r 258 IKCRTDVLTIIVPRHPRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCA----- 332 (440) Q Consensus 258 l~~~~~~~lIIvPRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~----- 332 (440) .++.|++.++|+..-|++ +++++.+++.|+.-...-.+..+. .++..+|+.|| +||-.|... T Consensus 214 ~~~~p~~~l~ivG~Gp~~-~~l~~~~~~l~l~~~V~f~G~~~~-----------~~v~~~l~~ad-v~v~PS~~~~~g~~ 280 (367) T cd05844 214 ARRVPEVRLVIIGDGPLL-AALEALARALGLGGRVTFLGAQPH-----------AEVRELMRRAR-IFLQPSVTAPSGDA 280 (367) T ss_pred HHHCCCEEEEEEECCCCH-HHHHHHHHHCCCCCCEEECCCCCC-----------HHHHHHHHHCC-EEEEEEEECCCCCC T ss_conf 966869799999888378-999999997098763787788981-----------88999998578-79960020377885 Q ss_pred -CCCCCHHHHHHHCCCEEECCCCCCHHHHHHHHHHCCCEEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHH Q ss_conf -4545837689619949998984437999999997898699579899999999995898999999999999997045569 Q gi|254780402|r 333 -SGGQNPLEAAMLGCAILSGPNVENFRDIYRRMVSSGAVRIVEEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPL 411 (440) Q Consensus 333 -~GGhN~lEpaa~G~pVi~GP~~~nf~e~~~~L~~~g~~~~v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~ 411 (440) .-|-.++||+++|+|||... .....|++.+ -.+|......|+++|++++..|++||+.+++|+.+|++.++++-.. T Consensus 281 Eg~~~~~lEAmA~G~PVVat~-~ggi~e~V~~-g~~G~lv~~~d~~~La~ai~~Ll~d~~~~~~m~~~gr~~v~~~f~~- 357 (367) T cd05844 281 EGLPVVLLEAQASGVPVVATR-HGGIPEAVED-GETGLLVPEGDVAALAAALGRLLADPDLRARMGAAGRRRVEERFDL- 357 (367) T ss_pred CCCCHHHHHHHHCCCCEEEEC-CCCCHHHHCC-CCCEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCH- T ss_conf 676379999998499789927-9986877207-9968997899999999999999849999999999999999981999- Q ss_pred HHHHHHHH Q ss_conf 99999999 Q gi|254780402|r 412 KITLRSLD 419 (440) Q Consensus 412 ~~~~~~i~ 419 (440) ++..+.|+ T Consensus 358 ~~~~~~Le 365 (367) T cd05844 358 RRQTAKLE 365 (367) T ss_pred HHHHHHHH T ss_conf 99999983 No 18 >cd03800 GT1_Sucrose_synthase This family is most closely related to the GT1 family of glycosyltransferases. The sucrose-phosphate synthases in this family may be unique to plants and photosynthetic bacteria. This enzyme catalyzes the synthesis of sucrose 6-phosphate from fructose 6-phosphate and uridine 5'-diphosphate-glucose, a key regulatory step of sucrose metabolism. The activity of this enzyme is regulated by phosphorylation and moderated by the concentration of various metabolites and light. Probab=99.83 E-value=2.6e-17 Score=133.36 Aligned_cols=323 Identities=15% Similarity=0.150 Sum_probs=185.8 Q ss_pred HHHHHHHHHHHHHHHCCCCEEE-EECCCCCCHHHHCCCCCCEEEEECCC---CCC----------HHHHHHHHH-----H Q ss_conf 9999999999999861998999-96277630012101477416885055---720----------134455744-----2 Q gi|254780402|r 67 GETMALIGLIPAIRSRHVNVLL-TTMTATSAKVARKYLGQYAIHQYAPL---DIQ----------PAVSRFLKY-----W 127 (440) Q Consensus 67 GE~~~a~pli~~l~~~~~~ili-T~~T~tg~~~~~~~~~~~~~~~y~P~---D~~----------~~~~~fl~~-----~ 127 (440) |=-..+..|.+.|.+++.+|-| |..+..+... .....+.+.+.-+|. +.. ..+++++.. . T Consensus 22 G~e~~v~~La~~L~~~GH~V~V~t~~~~~~~~~-~~~~~~gv~v~rl~~~~~~~~~~~~l~~~l~~~~~~~~~~~~~~~~ 100 (398) T cd03800 22 GQNVYVLELARALARLGHEVDIFTRRIDDALPP-IVELAPGVRVVRVPAGPAEYLPKEELWPYLDEFADDLLRFLRREGG 100 (398) T ss_pred CHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCC-EEEECCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 879999999999998699699997247778888-0682498699995578854332777787899999999999998389 Q ss_pred CCEEEEEECCCCHHH-H--HHHHHHCCCCEEEEEECCC---CCCC---CCCHHHH--HHHHHCCCCCCEEEECCHHHHHH Q ss_conf 020356404762146-7--8887422455058641266---5421---2100111--12331026442366415778889 Q gi|254780402|r 128 KPDCMILSESDIWPL-T--VFELSKQRIPQVLVNARMS---RRSF---KNWKTVL--SFSKKIFSQFSLVIVQSERYFRR 196 (440) Q Consensus 128 ~P~~~i~~e~ElWPn-l--i~~~~~~~ip~~linarls---~~S~---~~~~~~~--~l~~~~~~~~~~i~~qs~~~~~r 196 (440) +||++-. -.|.. + ...++..++|++..-=-+. .... ..+.... ..-+.+++..|.|++.|+..++. T Consensus 101 ~pDvIH~---h~~~~~~~~~~~~~~~~ip~V~t~H~l~~~~~~~~~~~~~~~~~~r~~~e~~~~~~ad~via~S~~~~~~ 177 (398) T cd03800 101 RPDLIHA---HYWDSGLVALLLARRLGIPLVHTFHSLGAVKRRHLGAADTYEPARRIEAEERLLRAADRVIASTPQEAEE 177 (398) T ss_pred CCCEEEE---CCCCHHHHHHHHHHHHCCCEEEECCCCHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCEEEECCHHHHHH T ss_conf 9888998---8840789999999971999999637511444332023554234789999999998499999879999999 Q ss_pred HHHH-CC--CCEEEECC-CCCCCCCCCCCHHHHHHHHHHCCCCCEEEC-CCCCCC-CCCCCCCCCCCCC-CCCCEEEEEE Q ss_conf 9860-88--50587204-310123442212568999986179861750-465533-2100000000001-2341248897 Q gi|254780402|r 197 YKEL-GA--QKLIVSGN-LKIDTESLPCDKELLSLYQESIAGRYTWAA-ISTFEG-EEDKAVYVHNFIK-CRTDVLTIIV 269 (440) Q Consensus 197 l~~l-G~--~~i~v~Gn-lK~d~~~~~~~~~~~~~~~~~~~~r~v~va-gSth~~-EE~~il~a~~~l~-~~~~~~lIIv 269 (440) +.+. |. +++.|+.| ...+...+...............++++++. |...+. --+.+++++..++ ..++..|+|+ T Consensus 178 ~~~~~~~~~~~i~vI~nGvd~~~f~p~~~~~~~~~~~~~~~~~~~i~~vGRl~~~Kg~~~li~A~~~l~~~~~~~~l~iv 257 (398) T cd03800 178 LYSLYGAYPRRIRVVPPGVDLERFTPYGRAEARRARLLRDPDKPRILAVGRLDPRKGIDTLIRAYAELPELRERANLVIV 257 (398) T ss_pred HHHHHCCCCCCEEEECCCCCHHHCCCCCCHHHHHHHHCCCCCCCEEEEECCCCHHCCHHHHHHHHHHHHHHCCCCEEEEE T ss_conf 99972999022899769867744388980589998608998994899982896020989999999998877899689999 Q ss_pred ECCCC-----HHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCC--CCCHHHHHHCCCEEEECCCCCCCCCCHHHHH Q ss_conf 05521-----1478887765300101100134444466416750676--5202554220820661355444545837689 Q gi|254780402|r 270 PRHPR-----RCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTI--GEMGFYLRMTEIAFIGRSFCASGGQNPLEAA 342 (440) Q Consensus 270 PRhpe-----R~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~--GeL~~lY~~AdiafVGGSl~~~GGhN~lEpa 342 (440) ...++ ...++.++.+..|+.- .|.+...+ .++..+|+.|| +||-.|..+.-|-.++||+ T Consensus 258 Gg~~~~~~~~~~~~~~~~~~~~~l~~-------------~V~f~G~~~~~~~~~~~~~ad-v~v~pS~~E~fgl~~lEAm 323 (398) T cd03800 258 GGPRDDILAMDEEELRELARELGVID-------------RVDFPGRVSREDLPALYRAAD-VFVNPALYEPFGLTALEAM 323 (398) T ss_pred ECCCCCCCHHHHHHHHHHHHHCCCCC-------------EEEECCCCCHHHHHHHHHHCC-EEEECCCCCCHHHHHHHHH T ss_conf 68876531345999999999759987-------------499889899899999998578-8875451332214899999 Q ss_pred HHCCCEEECCCCCCHHHHHHHHHHCCCEEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCC Q ss_conf 6199499989844379999999978986995798999999999958989999999999999970455 Q gi|254780402|r 343 MLGCAILSGPNVENFRDIYRRMVSSGAVRIVEEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQG 409 (440) Q Consensus 343 a~G~pVi~GP~~~nf~e~~~~L~~~g~~~~v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~g 409 (440) ++|+|||.- ......|+... -.+|..+...|+++|++++.++++||+.+++|+++|++.+.++-. T Consensus 324 a~G~PvIas-~~gg~~e~v~~-g~~G~l~~~~d~~~la~ai~~ll~d~~~~~~~g~~ar~~~~~~fs 388 (398) T cd03800 324 ACGLPVVAT-AVGGPRDIVVD-GVTGLLVDPRDPEALAAALRRLLTDPALRRRLSRAGLRRARARYT 388 (398) T ss_pred HCCCCEEEC-CCCCCHHHHCC-CCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCC T ss_conf 829999993-89980777417-971899789999999999999977999999999999999998689 No 19 >cd03799 GT1_amsK_like This is a family of GT1 glycosyltransferases found specifically in certain bacteria. amsK in Erwinia amylovora, has been reported to be involved in the biosynthesis of amylovoran, a exopolysaccharide acting as a virulence factor. Probab=99.82 E-value=1.1e-17 Score=135.91 Aligned_cols=316 Identities=19% Similarity=0.167 Sum_probs=188.8 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCC-CCCEEEEECCCCCC-----HHHHHHHHHHCCEEEEEECCCCH Q ss_conf 9999999999999861998999962776300121014-77416885055720-----13445574420203564047621 Q gi|254780402|r 67 GETMALIGLIPAIRSRHVNVLLTTMTATSAKVARKYL-GQYAIHQYAPLDIQ-----PAVSRFLKYWKPDCMILSESDIW 140 (440) Q Consensus 67 GE~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~-~~~~~~~y~P~D~~-----~~~~~fl~~~~P~~~i~~e~ElW 140 (440) ||. -+..+++.|.+++++|.|-|..+.......... .......|.+.... ....++++..+||++-.- +... T Consensus 13 ~et-fv~~la~~L~~~GHeV~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~DiIH~H-~~~~ 90 (355) T cd03799 13 SET-FILREILALEAAGHEVEIFSLRPPEDTLVHPEDRAELARTRYLARSLALLAQALVLARELRRLGIDHIHAH-FGTT 90 (355) T ss_pred CCH-HHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCCEECCCCCCHHHHHHHHHHHHHHHHHHHCCCCEEEEC-CCCH T ss_conf 617-99999999996798499995348877730643021215521547779999999999999977799899976-8833 Q ss_pred HHHH--HHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHH-CC--CCEEEECCCCCCC Q ss_conf 4678--88742245505864126654212100111123310264423664157788899860-88--5058720431012 Q gi|254780402|r 141 PLTV--FELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKEL-GA--QKLIVSGNLKIDT 215 (440) Q Consensus 141 Pnli--~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~l-G~--~~i~v~GnlK~d~ 215 (440) |..+ ...+..|+|++.-- .-.+- +.. ......+..++..|.+.+.|+..++.+.+. |. +++.|..|- .|. T Consensus 91 ~~~~~~~~~~~~~ip~v~t~-Hg~d~-~~~--~~~~~l~~~~~~ad~ii~vS~~~~~~l~~~~~~~~~ki~vi~ng-vd~ 165 (355) T cd03799 91 PATVAMLASRLGGIPYSFTA-HGKDI-FRS--PDAIDLDEKLARADFVVAISEYNRQQLIRLLGCDPDKIHVVHCG-VDL 165 (355) T ss_pred HHHHHHHHHHHCCCCEEEEE-CCCCC-CCC--CHHHHHHHHHHHCCEEEECCHHHHHHHHHHCCCCHHHEEEECCC-CCH T ss_conf 79999999997499999998-16765-567--36899999998399999989999999998609991468998996-488 Q ss_pred CCCCCCHHHHHHHHHHCCCCC-EEECCCCCCC-CCCCCCCCCCCC-CCCCCEEEEEEECCCCHHHHHHHHHHHHCCHHHH Q ss_conf 344221256899998617986-1750465533-210000000000-1234124889705521147888776530010110 Q gi|254780402|r 216 ESLPCDKELLSLYQESIAGRY-TWAAISTFEG-EEDKAVYVHNFI-KCRTDVLTIIVPRHPRRCDAIERRLIAKGLKVAR 292 (440) Q Consensus 216 ~~~~~~~~~~~~~~~~~~~r~-v~vagSth~~-EE~~il~a~~~l-~~~~~~~lIIvPRhpeR~~~i~~~l~~~gl~~~~ 292 (440) ......... ...++. ++..|...+. --+.+++|.+.+ .+.|+..|+|+...|++ +++.+++++.|+.-.. T Consensus 166 ~~f~~~~~~------~~~~~~~il~vGrl~~~Kg~~~li~A~~~l~~~~~~~~l~ivG~G~~~-~~l~~~~~~l~l~~~V 238 (355) T cd03799 166 ERFPPRPPP------PPGEPLRILSVGRLVEKKGLDYLLEALALLKDRGIDFRLDIVGDGPLR-DELEALIAELGLEDRV 238 (355) T ss_pred HHCCCCCCC------CCCCCEEEEEEECCCHHCCCHHHHHHHHHHHHHCCCEEEEEECCCHHH-HHHHHHHHHCCCCCCE T ss_conf 887998755------778986999981580211909999999999864999799996676048-8999999974998550 Q ss_pred HCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCC------CCCCCHHHHHHHCCCEEECCCCCCHHHHHHHHHH Q ss_conf 0134444466416750676520255422082066135544------4545837689619949998984437999999997 Q gi|254780402|r 293 RSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCA------SGGQNPLEAAMLGCAILSGPNVENFRDIYRRMVS 366 (440) Q Consensus 293 ~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~------~GGhN~lEpaa~G~pVi~GP~~~nf~e~~~~L~~ 366 (440) .-.+..+ -.++..+|+.||+ ||-.|+.. .-|-.++||+++|+|||... ....+|++++ -. T Consensus 239 ~f~G~v~-----------~~~l~~~~~~adv-~v~pS~~~~~~~~Eg~p~~~lEAma~G~PvVas~-~~g~~e~v~~-g~ 304 (355) T cd03799 239 TLLGAKS-----------QEEVRELLRAADL-FVLPSVTAADGDREGLPVVLMEAMAMGLPVISTD-VSGIPELVED-GE 304 (355) T ss_pred EECCCCC-----------CHHHHHHHHHCCE-EEECCCCCCCCCCCCCCHHHHHHHHCCCCEEECC-CCCCHHHHCC-CC T ss_conf 7644446-----------4767999974989-9984523356776687779999996699899917-9985766228-98 Q ss_pred CCCEEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCH Q ss_conf 89869957989999999999589899999999999999704556 Q gi|254780402|r 367 SGAVRIVEEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGP 410 (440) Q Consensus 367 ~g~~~~v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga 410 (440) +|......|+++|++++..+++||+.+++|+++|++.++++-.. T Consensus 305 ~G~l~~~~d~~~la~~i~~ll~d~~~~~~~~~~ar~~v~~~fs~ 348 (355) T cd03799 305 TGLLVPPGDPEALADAIERLLDDPELRREMGEAGRARVEEEFDI 348 (355) T ss_pred CEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCH T ss_conf 58997999999999999999879999999999999999986995 No 20 >cd03822 GT1_ecORF704_like This family is most closely related to the GT1 family of glycosyltransferases. ORF704 in E. coli has been shown to be involved in the biosynthesis of O-specific mannose homopolysaccharides. Probab=99.81 E-value=2.8e-17 Score=133.16 Aligned_cols=323 Identities=16% Similarity=0.169 Sum_probs=189.7 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCC---HHHHHHHHHHCCEEEEEE-CCCCH-H Q ss_conf 999999999999986199899996277630012101477416885055720---134455744202035640-47621-4 Q gi|254780402|r 67 GETMALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQ---PAVSRFLKYWKPDCMILS-ESDIW-P 141 (440) Q Consensus 67 GE~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~---~~~~~fl~~~~P~~~i~~-e~ElW-P 141 (440) |=...+..|.+.|.+++.+|.|-+.++..-..... .........+...+ ....++++..+||++.+- ++.+| | T Consensus 14 Gi~~~~~~La~~L~~~Gh~V~v~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~i~~~~~Dvvh~~~~~~~~~~ 91 (366) T cd03822 14 GIATFTTDLVNALSARGPDVLVVSVAALYPSLLYG--GEQEVVRVIVLDNPLDYRRAARAIRLSGPDVVVIQHEYGIFGG 91 (366) T ss_pred CHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCC--CCCCEEEECCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCCHH T ss_conf 38999999999998679989999588888754446--7764167603667325999999998539999999365332106 Q ss_pred H----HHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHC-CCCEEEECCCCCCCC Q ss_conf 6----78887422455058641266542121001111233102644236641577888998608-850587204310123 Q gi|254780402|r 142 L----TVFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKELG-AQKLIVSGNLKIDTE 216 (440) Q Consensus 142 n----li~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG-~~~i~v~GnlK~d~~ 216 (440) + .....++.++|++...=.... ...+.+.+.+.+.+++..|.+++.|...++.+...+ ..++.+..|--.+.. T Consensus 92 ~~~~~~~~~~~~~~~p~v~t~H~~~~--~~~~~~~~~~~~~~~~~ad~vi~~s~~~~~~~~~~~~~~~i~vIpngv~~~~ 169 (366) T cd03822 92 EAGLYLLLLLRGLGIPVVVTLHTVLL--HEPRPGDRALLRLLLRRADAVIVMSSELLRALLLRAYPEKIAVIPHGVPDPP 169 (366) T ss_pred HHHHHHHHHHHHCCCCEEEEECCCCC--CCCHHHHHHHHHHHHHHCCEEEECCHHHHHHHHHHCCCCCEEEECCCCCCCC T ss_conf 68999999998559989999777765--5422779999999998679999957999999986469873999679987545 Q ss_pred CCCCCHHHHHHHHHHCCCCCEEEC-CCC--CCCCCCCCCCCCCCC-CCCCCEEEEEEEC-CC--CH-HHHHHHHHHHHCC Q ss_conf 442212568999986179861750-465--533210000000000-1234124889705-52--11-4788877653001 Q gi|254780402|r 217 SLPCDKELLSLYQESIAGRYTWAA-IST--FEGEEDKAVYVHNFI-KCRTDVLTIIVPR-HP--RR-CDAIERRLIAKGL 288 (440) Q Consensus 217 ~~~~~~~~~~~~~~~~~~r~v~va-gSt--h~~EE~~il~a~~~l-~~~~~~~lIIvPR-hp--eR-~~~i~~~l~~~gl 288 (440) . .............++++++. |.. ++| -+.+++|+..+ ++.|++.|+|+.. ++ ++ ..+..+++++.|+ T Consensus 170 ~---~~~~~~~~~~~~~~~~~il~~Grl~~~Kg-~~~li~A~~~l~~~~~~~~l~ivG~~~~~~~~~~~~~~~~~~~lgl 245 (366) T cd03822 170 A---EPPESLKALGGLDGRPVLLTFGLLRPYKG-LELLLEALPLLVAKHPDVRLLVAGETHPDLERYRGEAYALAERLGL 245 (366) T ss_pred C---CCHHHHHHHCCCCCCEEEEEEECCCCCCC-HHHHHHHHHHHHHHCCCEEEEEEECCCCCHHHHHHHHHHHHHHCCC T ss_conf 5---88677887458999859999853405548-5999999999887689859999958987426678999999997399 Q ss_pred HHHHHCCCCCCCCCCEEEEEC---CCCCCHHHHHHCCCEEEECCCCCCCCC--CHHHHHHHCCCEEECCCCCCHHHHHHH Q ss_conf 011001344444664167506---765202554220820661355444545--837689619949998984437999999 Q gi|254780402|r 289 KVARRSRGDVINAEVDIFLGD---TIGEMGFYLRMTEIAFIGRSFCASGGQ--NPLEAAMLGCAILSGPNVENFRDIYRR 363 (440) Q Consensus 289 ~~~~~S~~~~~~~~~~V~i~D---t~GeL~~lY~~AdiafVGGSl~~~GGh--N~lEpaa~G~pVi~GP~~~nf~e~~~~ 363 (440) .-. |.+.+ .--++..+|+.||+ ||-.|.....+. ..+|||++|+|||.- ......++ T Consensus 246 ~~~-------------V~f~~g~v~~~~~~~~~~~adv-~v~Ps~~e~~~~s~v~~EAma~G~PvVat-~~gg~~ev--- 307 (366) T cd03822 246 ADR-------------VIFINRYLPDEELPELFSAADV-VVLPYRSADQTQSGVLAYAIGFGKPVIST-PVGHAEEV--- 307 (366) T ss_pred CCC-------------EEECCCCCCHHHHHHHHHHCCC-CCCCCCCCCCCHHHHHHHHHHCCCCEEEE-CCCCHHEE--- T ss_conf 765-------------5324788899999999995570-30555466544569999999749989990-89974408--- Q ss_pred HHHCCCEEEE--CCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHH Q ss_conf 9978986995--798999999999958989999999999999970455699999999 Q gi|254780402|r 364 MVSSGAVRIV--EEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITLRSL 418 (440) Q Consensus 364 L~~~g~~~~v--~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~~~i 418 (440) .+.+.+..+ .|+++|++++..+++|++.|++|+++|++++++. +-+++.+.. T Consensus 308 -~~~~~G~lv~~~d~~~la~ai~~ll~d~~~r~~l~~~a~~~a~~~--sW~~ia~~~ 361 (366) T cd03822 308 -LDGGTGLLVPPGDPAALAEAIRRLLADPELAQALRARAREYARAM--SWERVAERY 361 (366) T ss_pred -ECCCCEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHC--CHHHHHHHH T ss_conf -839968998999999999999999879999999999999999979--999999999 No 21 >cd03812 GT1_CapH_like This family is most closely related to the GT1 family of glycosyltransferases. capH in Staphylococcus aureus has been shown to be required for the biosynthesis of the type 1 capsular polysaccharide (CP1). Probab=99.80 E-value=4.8e-17 Score=131.57 Aligned_cols=314 Identities=13% Similarity=0.066 Sum_probs=185.7 Q ss_pred HHHHHHHHHHHHHHHCCCCE-EEEECCCCCCHHHHCCCCCCEEEEECCCCC------CHHHHHHHHHHCCEEEEEECCCC Q ss_conf 99999999999998619989-999627763001210147741688505572------01344557442020356404762 Q gi|254780402|r 67 GETMALIGLIPAIRSRHVNV-LLTTMTATSAKVARKYLGQYAIHQYAPLDI------QPAVSRFLKYWKPDCMILSESDI 139 (440) Q Consensus 67 GE~~~a~pli~~l~~~~~~i-liT~~T~tg~~~~~~~~~~~~~~~y~P~D~------~~~~~~fl~~~~P~~~i~~e~El 139 (440) |--..+..|++.|.+++.++ ++++....+. .........+.+.++|... ....+++++..+||++..- +.. T Consensus 13 GaE~~~~~L~~~L~~~g~~v~v~~~~~~~~~-~~~~~~~~g~~v~~~~~~~~~~~~~~~~l~~~i~~~~~DiIh~h-~~~ 90 (358) T cd03812 13 GIETFIMNYYRNLDRSKIQFDFLVTSKEEGD-YDDEIEKLGGKIYYIPARKKNPLKYFKKLYKLIKKNKYDIVHVH-GSS 90 (358) T ss_pred CHHHHHHHHHHHHHHCCCEEEEEEECCCCCC-HHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEC-CCC T ss_conf 1899999999999876998999998798763-68999975997999389764289999999999998399999985-750 Q ss_pred HHHHH-HHHHHCCCCEEEEEECCCCCC--CCCCHHHHHHHHHC-CCCCCEEEECCHHHHHHHHH-HCCCCEEEECCCCCC Q ss_conf 14678-887422455058641266542--12100111123310-26442366415778889986-088505872043101 Q gi|254780402|r 140 WPLTV-FELSKQRIPQVLVNARMSRRS--FKNWKTVLSFSKKI-FSQFSLVIVQSERYFRRYKE-LGAQKLIVSGNLKID 214 (440) Q Consensus 140 WPnli-~~~~~~~ip~~linarls~~S--~~~~~~~~~l~~~~-~~~~~~i~~qs~~~~~rl~~-lG~~~i~v~GnlK~d 214 (440) |..+. ..++..++|..+...+-+... ...........+.+ ....+.+++.|+..++.+.. ...+++.|..|- .| T Consensus 91 ~~~~~~~~~~~~~~~~~I~h~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~iavS~~~~~~l~~~~~~~ki~vI~Ng-Id 169 (358) T cd03812 91 ASGFILLAAKKAGVKVRIAHSHNTSDSHDKKKKILKYKVLRKLINRLATDYLACSEEAGKWLFGKVKNKKFKVIPNG-ID 169 (358) T ss_pred HHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHCCEEEEECHHHHHHHHHHCCCCCEEEECCC-CC T ss_conf 68999999998599989998578744543167899999999999986999999588999999731687878998698-07 Q ss_pred CCCCCCCHHHHHHHHH--HCCCCCEEE-CCCCCCCC-CCCCCCCCCCC-CCCCCEEEEEEECCCCHHHHHHHHHHHHCCH Q ss_conf 2344221256899998--617986175-04655332-10000000000-1234124889705521147888776530010 Q gi|254780402|r 215 TESLPCDKELLSLYQE--SIAGRYTWA-AISTFEGE-EDKAVYVHNFI-KCRTDVLTIIVPRHPRRCDAIERRLIAKGLK 289 (440) Q Consensus 215 ~~~~~~~~~~~~~~~~--~~~~r~v~v-agSth~~E-E~~il~a~~~l-~~~~~~~lIIvPRhpeR~~~i~~~l~~~gl~ 289 (440) ......+.......+. ...++++++ .|..++.- -+.+++|++.+ ++.|+..++|+..-+++ +++.+.+.+.|+. T Consensus 170 ~~~~~~~~~~~~~~~~~~~~~~~~vi~~vGRl~~~Kg~~~Li~A~~~l~~~~~~~~l~ivG~G~~~-~~l~~~i~~~~l~ 248 (358) T cd03812 170 LEKFIFNEEIRKKRRELGILEDKFVIGHVGRFSEQKNHEFLIEIFAELLKKNPNAKLLLVGDGELE-EEIKKKVKELGLE 248 (358) T ss_pred HHHCCCCHHHHHHHHHHCCCCCCEEEEEECCCHHHHCCHHHHHHHHHHHHHCCCCEEEEECCCHHH-HHHHHHHHHCCCC T ss_conf 442387546689999838998986999960476651718899999986510888239996275278-7899999982987 Q ss_pred HHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHHHHHHHHHCCC Q ss_conf 11001344444664167506765202554220820661355444545837689619949998984437999999997898 Q gi|254780402|r 290 VARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFRDIYRRMVSSGA 369 (440) Q Consensus 290 ~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~~~~L~~~g~ 369 (440) - .|.+.....++..+|+.|| +||=.|..+.-|-.++||+++|+|||.-- ...-.++..+ ..|. T Consensus 249 ~-------------~V~f~G~~~~~~~~~~~aD-i~v~pS~~Egf~~v~lEAma~G~PVVasd-~gg~~~ii~~--~~~~ 311 (358) T cd03812 249 D-------------KVIFLGVRNDVPELLQAMD-VFLFPSLYEGLPLVLIEAQASGLPCILSD-TITKEVDLTD--LVKF 311 (358) T ss_pred C-------------EEEECCCCCCHHHHHHHCC-EEEECCCCCCCCHHHHHHHHHCCCEEEEC-CCCCCEEEEC--CCCE T ss_conf 2-------------4997467013789997398-99974876788479999999499899965-9997469729--9557 Q ss_pred EEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHH Q ss_conf 699579899999999995898999999999999 Q gi|254780402|r 370 VRIVEEVGTLADMVYSLLSEPTIRYEMINAAIN 402 (440) Q Consensus 370 ~~~v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~ 402 (440) .....|++++++++.+|++||+ +++|+.++++ T Consensus 312 l~~~~~~~~~a~~I~~l~~~~~-~~~~~~~~~~ 343 (358) T cd03812 312 LSLDESPEIWAEEILKLKSEDR-RERSSESIKK 343 (358) T ss_pred EECCCCHHHHHHHHHHHHCCHH-HHHHHHHHHH T ss_conf 9689999999999999865836-7999999998 No 22 >cd03809 GT1_mtfB_like This family is most closely related to the GT1 family of glycosyltransferases. mtfB (mannosyltransferase B) in E. coli has been shown to direct the growth of the O9-specific polysaccharide chain. It transfers two mannoses into the position 3 of the previously synthesized polysaccharide. Probab=99.80 E-value=3.2e-17 Score=132.72 Aligned_cols=316 Identities=16% Similarity=0.099 Sum_probs=186.6 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEEC--------CCCCCHHHHHHHHHHCCEEEEEECCC Q ss_conf 9999999999999861998999962776300121014774168850--------55720134455744202035640476 Q gi|254780402|r 67 GETMALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYA--------PLDIQPAVSRFLKYWKPDCMILSESD 138 (440) Q Consensus 67 GE~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~--------P~D~~~~~~~fl~~~~P~~~i~~e~E 138 (440) |=-.-+..|++.|.+.+.++.+-+...................... ++-........+...+++++...... T Consensus 16 Gi~ry~~~L~~~L~~~g~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~dv~h~~~~~ 95 (365) T cd03809 16 GIGRYARELLRALLKLDPEEVLLLLPGAPGLLLLPLRAALRLLLRLPRRLLWGLLFLLRAGDRLLLLLLGLDLLHSPHNT 95 (365) T ss_pred CHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCEEEECCCH T ss_conf 68999999999997459977999935864221111123541013566531133435778888999985599999989832 Q ss_pred CHHHHHHHHHHCCCCEEEEEECCC----CC--CCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHH-HCC--CCEEEEC Q ss_conf 214678887422455058641266----54--21210011112331026442366415778889986-088--5058720 Q gi|254780402|r 139 IWPLTVFELSKQRIPQVLVNARMS----RR--SFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKE-LGA--QKLIVSG 209 (440) Q Consensus 139 lWPnli~~~~~~~ip~~linarls----~~--S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~-lG~--~~i~v~G 209 (440) .+ .....++|++..-=-+. +. +.....++..+.+..++..|.|++.|+..++.+.+ +|. +++.|+. T Consensus 96 ~~-----~~~~~~~~~V~t~HD~~~~~~~~~~~~~~~~~~~~~~~~~~~~ad~ii~vS~~~~~~~~~~~~~~~~~i~vi~ 170 (365) T cd03809 96 AP-----LLRLRGVPVVVTIHDLIPLRFPEYFSPGFRRYFRRLLRRALRRADAIITVSEATKRDLLRYLGVPPDKIVVIP 170 (365) T ss_pred HH-----HHHHCCCCEEEEECCCHHHHCCCCCCHHHHHHHHHHHHHHHHHCCEEEECCHHHHHHHHHHHCCCHHHEEEEC T ss_conf 65-----5643599899997885065382007977899999999999996999999799999999998498858989981 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHCCCCC-EEECCCCCCC-CCCCCCCCCCCC-CCCCCEEEEEEECCCCHHHHHHHHHHHH Q ss_conf 431012344221256899998617986-1750465533-210000000000-1234124889705521147888776530 Q gi|254780402|r 210 NLKIDTESLPCDKELLSLYQESIAGRY-TWAAISTFEG-EEDKAVYVHNFI-KCRTDVLTIIVPRHPRRCDAIERRLIAK 286 (440) Q Consensus 210 nlK~d~~~~~~~~~~~~~~~~~~~~r~-v~vagSth~~-EE~~il~a~~~l-~~~~~~~lIIvPRhpeR~~~i~~~l~~~ 286 (440) |- .|....+..............+++ ++..|+..+. --+.+++|+..+ ++.++..|+|+....+..+++.+.+... T Consensus 171 ~g-v~~~~~~~~~~~~~~~~~~~~~~~~il~vg~~~~~K~~~~li~a~~~l~~~~~~~~lvivG~~~~~~~~~~~~~~~~ 249 (365) T cd03809 171 LG-VDPRFRPPPAEAEVLRALYLLPRPYFLYVGTIEPRKNLERLLEAFARLPAKGPDPKLVIVGKRGWLNEELLARLREL 249 (365) T ss_pred CC-CCHHCCCCCCHHHHHHHHCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHC T ss_conf 55-55110588742678887438999889999536455799999999999887689938999778974179999999965 Q ss_pred CCHHHHHCCCCCCCCCCEEEEECC--CCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHHHHHHH Q ss_conf 010110013444446641675067--652025542208206613554445458376896199499989844379999999 Q gi|254780402|r 287 GLKVARRSRGDVINAEVDIFLGDT--IGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFRDIYRRM 364 (440) Q Consensus 287 gl~~~~~S~~~~~~~~~~V~i~Dt--~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~~~~L 364 (440) ++.- .|..... -.||..+|+.|| +||-.|+.+.-|..++||+++|+|||. .+..-++|+.. T Consensus 250 ~~~~-------------~v~~~g~~~~~~l~~~y~~ad-~~v~PS~~EgfGl~~lEAma~G~Pvi~-s~~~~~~Ei~g-- 312 (365) T cd03809 250 GLGD-------------RVRFLGYVSDEELAALYRGAR-AFVFPSLYEGFGLPVLEAMACGTPVIA-SNISSLPEVAG-- 312 (365) T ss_pred CCCC-------------CEEEECCCCHHHHHHHHHCCC-EEEECCCCCCCCCHHHHHHHCCCCEEE-ECCCCCCEEEC-- T ss_conf 9998-------------589936879899999997177-435414335789689999985998999-07998730857-- Q ss_pred HHCCCEEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHH Q ss_conf 978986995798999999999958989999999999999970 Q gi|254780402|r 365 VSSGAVRIVEEVGTLADMVYSLLSEPTIRYEMINAAINEVKK 406 (440) Q Consensus 365 ~~~g~~~~v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~ 406 (440) +.|..+...|+++|++++.++++|++.+++|+++|++.+++ T Consensus 313 -~~g~~~~p~d~~~la~~i~~ll~d~~~~~~~~~~~~~~~~~ 353 (365) T cd03809 313 -DAALYFDPLDPEALAAAIERLLEDPALREELRERGLARAKR 353 (365) T ss_pred -CCEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHH T ss_conf -83799899999999999999987999999999999999996 No 23 >PRK13609 diacylglycerol glucosyltransferase; Provisional Probab=99.80 E-value=3.2e-16 Score=126.15 Aligned_cols=339 Identities=15% Similarity=0.193 Sum_probs=206.8 Q ss_pred CCEEEEEECCHHH--HHHHHHHHHHHHHCCCC-EEE-EEC---CCCCCHHHHCCCCCC------E-EEEEC----CCCCC Q ss_conf 8779998178999--99999999999861998-999-962---776300121014774------1-68850----55720 Q gi|254780402|r 56 GPLIWFHASSVGE--TMALIGLIPAIRSRHVN-VLL-TTM---TATSAKVARKYLGQY------A-IHQYA----PLDIQ 117 (440) Q Consensus 56 ~~~IW~HaaSvGE--~~~a~pli~~l~~~~~~-ili-T~~---T~tg~~~~~~~~~~~------~-~~~y~----P~D~~ 117 (440) ++.|.|=.+|.|. ..+|..|.+.+.+++++ +.+ ..+ +|.-.+.....+-.. . ...|. .+|.. T Consensus 4 ~kKVLILtas~G~GH~~AA~AL~e~l~~~~~~~v~v~D~~~~~~p~~~~~~~~~Yl~~~~~~p~l~~~~Y~~~~~~~~~~ 83 (388) T PRK13609 4 NPKVLILTAHYGNGHVQVAKTLEQTFRQKGIKDVIVCDLFGESHPVITDITKYLYLKSYTIGKELYRLFYYGVEKIYDKK 83 (388) T ss_pred CCEEEEEECCCCHHHHHHHHHHHHHHHHCCCCEEEEEEHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHH T ss_conf 99799997888827899999999999835998199985143027048889988888885535889999996432222156 Q ss_pred ----------HHHHHHHHHHCCEEEEEECCCCHHHH-HHHHHHC---CCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCC Q ss_conf ----------13445574420203564047621467-8887422---455058641266542121001111233102644 Q gi|254780402|r 118 ----------PAVSRFLKYWKPDCMILSESDIWPLT-VFELSKQ---RIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQF 183 (440) Q Consensus 118 ----------~~~~~fl~~~~P~~~i~~e~ElWPnl-i~~~~~~---~ip~~linarls~~S~~~~~~~~~l~~~~~~~~ 183 (440) ...+++++..+||++|.. .|.. +...+++ .+|++.+ ++|-+.- ..++.... T Consensus 84 ~~~~~~~~~~~~l~~li~~~kPDvII~T----~P~~~l~~lk~~~~~~iP~~tV---iTD~~~H--------~~Wi~~~~ 148 (388) T PRK13609 84 IASWYANFGRKRLKLLLQAEKPDIVINT----FPIIAVPELKKQTGISIPVYNV---LTDFCVH--------KIWIHREV 148 (388) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCEEEEC----CCHHHHHHHHHHCCCCCCEEEE---ECCCCCH--------HHHCCCCC T ss_conf 7899999979999999998295999988----8789999999845999988999---4785204--------64557899 Q ss_pred CEEEECCHHHHHHHHHHCC--CCEEEECCC---CCCCCCCCCCHHHHHHHHHHCCCCCE-EECCCCCC--CCCCCCCCCC Q ss_conf 2366415778889986088--505872043---10123442212568999986179861-75046553--3210000000 Q gi|254780402|r 184 SLVIVQSERYFRRYKELGA--QKLIVSGNL---KIDTESLPCDKELLSLYQESIAGRYT-WAAISTFE--GEEDKAVYVH 255 (440) Q Consensus 184 ~~i~~qs~~~~~rl~~lG~--~~i~v~Gnl---K~d~~~~~~~~~~~~~~~~~~~~r~v-~vagSth~--~EE~~il~a~ 255 (440) |..++-+++.++.+.+.|+ ++|.++|.. +|.. +.+.+........-.++++ ++.|..+- +..+ +.. T Consensus 149 D~y~Vase~~k~~l~~~Gv~~~kI~vtGiPVr~~F~~---~~~k~~~r~~lgL~~d~~~vLimgGg~G~~g~i~---~l~ 222 (388) T PRK13609 149 DRYFVATDHVKEVMVDIGVPAEQIVETGIPIRSSFEL---KINPDIIYNKYQLCKNKKILLIVAGAHGVLGNVK---ELC 222 (388) T ss_pred CEEEECCHHHHHHHHHHCCCHHHEEEECCCCCHHHCC---CCCHHHHHHHHCCCCCCCEEEEECCHHHHHHHHH---HHH T ss_conf 9799398999999998099888999889843878727---5887899998289987847999766012114799---999 Q ss_pred CCCCCCCCEEEEEE-ECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCC Q ss_conf 00012341248897-05521147888776530010110013444446641675067652025542208206613554445 Q gi|254780402|r 256 NFIKCRTDVLTIIV-PRHPRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASG 334 (440) Q Consensus 256 ~~l~~~~~~~lIIv-PRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~G 334 (440) ..+...+++.++++ .|.- ++.+.+++.... ....|-+..=.-+|..||+.||+ +|+ ..| T Consensus 223 ~~L~~~~~~qiiVVcGrN~----~L~~~L~~~~~~-----------~~~~v~vlGf~~~~~~~~~~~d~-~i~----k~G 282 (388) T PRK13609 223 QSFMSVPDLQVVVVCGKNE----ALKQDLVGLQET-----------NPDALKVFGYVENIDELFRVTSC-MIT----KPG 282 (388) T ss_pred HHHHCCCCCEEEEEECCCH----HHHHHHHHHHHC-----------CCCCEEEEECHHHHHHHHHHCCE-EEE----CCC T ss_conf 9974589924999908998----999999988750-----------79946995045209999985759-995----786 Q ss_pred CCCHHHHHHHCCCEE-ECCCCCCHHHHHHHHHHCCCEEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHH Q ss_conf 458376896199499-9898443799999999789869957989999999999589899999999999999704556999 Q gi|254780402|r 335 GQNPLEAAMLGCAIL-SGPNVENFRDIYRRMVSSGAVRIVEEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKI 413 (440) Q Consensus 335 GhN~lEpaa~G~pVi-~GP~~~nf~e~~~~L~~~g~~~~v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~ 413 (440) |-+..|++++|+|++ ++|-=.--.+-++.|.+.|++..++|.+++.+.+..|++||++.++|.+++++. ....|+++ T Consensus 283 g~t~~E~~~~~~P~i~~~~~pgqe~~N~~~~~~~g~~~~~~~~~~~~~~~~~ll~~~~~l~~m~~~~~~~--~~p~aa~~ 360 (388) T PRK13609 283 GITLSEAAALQVPVILYKPVPGQENENAMYFERKGAAVVIRDDSEVFAKTEALLQDDMKLLQMKEAMKSI--YLPEPADH 360 (388) T ss_pred HHHHHHHHHHCCCEEECCCCCCHHHHHHHHHHHCCCEEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHC--CCCCHHHH T ss_conf 4589999994899897068996167779999978987997999999999999976999999999999862--79858999 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCC Q ss_conf 99999975237878501678997545 Q gi|254780402|r 414 TLRSLDSYVNPLIFQNHLLSKDPSFK 439 (440) Q Consensus 414 ~~~~i~~~l~~~~~~~~~~~~~~~~~ 439 (440) +++.|-. +-...++|+..|.|++- T Consensus 361 I~~~il~--~~~~~~~~~~~~~~~~~ 384 (388) T PRK13609 361 IVDTILA--ENHVEPNHIPIKSPALA 384 (388) T ss_pred HHHHHHH--HHCCCCCCCCCCCHHHH T ss_conf 9999998--63579764436677666 No 24 >TIGR03449 mycothiol_MshA UDP-N-acetylglucosamine: 1L-myo-inositol-1-phosphate 1-alpha-D-N-acetylglucosaminyltransferase. Members of this protein family, found exclusively in the Actinobacteria, are MshA, the glycosyltransferase of mycothiol biosynthesis. Mycothiol replaces glutathione in these species. Probab=99.80 E-value=8.1e-17 Score=130.08 Aligned_cols=347 Identities=16% Similarity=0.122 Sum_probs=190.0 Q ss_pred EEEEECCH---------HHHHHHHHHHHHHHHCCCCEEE-EECCCCCCHHHHCCCCCCEE--EE-ECCCCC------CH- Q ss_conf 99981789---------9999999999999861998999-96277630012101477416--88-505572------01- Q gi|254780402|r 59 IWFHASSV---------GETMALIGLIPAIRSRHVNVLL-TTMTATSAKVARKYLGQYAI--HQ-YAPLDI------QP- 118 (440) Q Consensus 59 IW~HaaSv---------GE~~~a~pli~~l~~~~~~ili-T~~T~tg~~~~~~~~~~~~~--~~-y~P~D~------~~- 118 (440) |-+||+-. |--..+..|.++|.+++.+|-| |..+..+........ +.+. +. ..|+.. +. T Consensus 4 ~~~~~~p~~~~g~gd~GG~e~~v~~La~~La~rGheV~V~t~~~~~~~~~~~~~~-~gv~v~~~~~~p~~~~~~~~l~~~ 82 (405) T TIGR03449 4 ISMHTSPLQQPGTGDAGGMNVYILETATELARRGIEVDIFTRATRPSQPPVVEVA-PGVRVRNVVAGPYEGLDKEDLPTQ 82 (405) T ss_pred EECCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCEEEEC-CCEEEEEECCCCCCCCCHHHHHHH T ss_conf 9833784546999995886999999999999789969999358887788846704-984999825786232456676999 Q ss_pred ---HHHHHHHHH---CC---EEEEEECCCCHH-HHH--HHHHHCCCCEEEEEECCCC---CCCC----CCHHHHH-HHHH Q ss_conf ---344557442---02---035640476214-678--8874224550586412665---4212----1001111-2331 Q gi|254780402|r 119 ---AVSRFLKYW---KP---DCMILSESDIWP-LTV--FELSKQRIPQVLVNARMSR---RSFK----NWKTVLS-FSKK 178 (440) Q Consensus 119 ---~~~~fl~~~---~P---~~~i~~e~ElWP-nli--~~~~~~~ip~~linarls~---~S~~----~~~~~~~-l~~~ 178 (440) .....++.+ +| |++- +-.|- +.+ ..+++.++|++..--.+.. .... .....+. ..+. T Consensus 83 l~~~~~~~l~~~~~~~~~~~DvIH---~h~~~~~~~~~~~~~~~~iP~V~t~H~l~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (405) T TIGR03449 83 LCAFTGGVLRAEARHEPGYYDLIH---SHYWLSGQVGWLLRDRWGVPLVHTAHTLAAVKNAALADGDTPEPEARRIGEQQ 159 (405) T ss_pred HHHHHHHHHHHHHHHCCCCCCEEE---ECCCCHHHHHHHHHHHCCCCEEEEECCHHHHCCHHHHCCCCCCHHHHHHHHHH T ss_conf 999999999999985689976899---88710789999999864999899814414313124432666441999999999 Q ss_pred CCCCCCEEEECCHHHHHHHHH-HCC--CCEEEECCC-CCCCCCCCCCHHHHHHHHHHCCCCCEEECCCCCCCC--CCCCC Q ss_conf 026442366415778889986-088--505872043-101234422125689999861798617504655332--10000 Q gi|254780402|r 179 IFSQFSLVIVQSERYFRRYKE-LGA--QKLIVSGNL-KIDTESLPCDKELLSLYQESIAGRYTWAAISTFEGE--EDKAV 252 (440) Q Consensus 179 ~~~~~~~i~~qs~~~~~rl~~-lG~--~~i~v~Gnl-K~d~~~~~~~~~~~~~~~~~~~~r~v~vagSth~~E--E~~il 252 (440) ++...|.+++.|+..++.+.+ .|+ +++.|..|- -.+.. .+.+............+++++++.+....+ -+.++ T Consensus 160 ~~~~ad~ii~~s~~~~~~l~~~~~~~~~ki~vi~nGvd~~~f-~p~~~~~~r~~~g~~~~~~~il~vGRl~~~Kg~~~li 238 (405) T TIGR03449 160 LVDNADRLIANTDEEARDLVRHYDADPDRIDVVAPGADLERF-RPGDRATERARLGLPLDTKVVAFVGRIQPLKAPDVLL 238 (405) T ss_pred HHHHCCEEEEECHHHHHHHHHHHCCCHHHEEEECCCCCCCEE-CCCCHHHHHHHCCCCCCCCEEEEECCCCHHHHHHHHH T ss_conf 997489999957899999999849886788997799770306-8888589999719898981899955885011489999 Q ss_pred CCCC-CCCCCCCE--EEEEEE----CCCCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEE Q ss_conf 0000-00123412--488970----5521147888776530010110013444446641675067652025542208206 Q gi|254780402|r 253 YVHN-FIKCRTDV--LTIIVP----RHPRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAF 325 (440) Q Consensus 253 ~a~~-~l~~~~~~--~lIIvP----RhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~Adiaf 325 (440) +|+. ++++.|+. .++++. ...++.+.+.++.++.|++-...-.+..+. -++..+|+.||+ | T Consensus 239 ~A~~~l~~~~p~~~l~~~v~Gg~~g~~~~~~~~l~~~~~~lgl~~~V~f~G~~~~-----------~~~~~~~~~adv-~ 306 (405) T TIGR03449 239 RAVAELLDRDPDRNLRVIVVGGPSGSGLATPDALIELAAELGIADRVRFLPPRPP-----------EELVHVYRAADV-V 306 (405) T ss_pred HHHHHHHHHCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCCEEEECCCCCH-----------HHHHHHHHHCCE-E T ss_conf 9999999868998789999838887536569999999998288875986799889-----------999999995787-6 Q ss_pred EECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHHHHHHHHHCCCEEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHH Q ss_conf 61355444545837689619949998984437999999997898699579899999999995898999999999999997 Q gi|254780402|r 326 IGRSFCASGGQNPLEAAMLGCAILSGPNVENFRDIYRRMVSSGAVRIVEEVGTLADMVYSLLSEPTIRYEMINAAINEVK 405 (440) Q Consensus 326 VGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~~~~L~~~g~~~~v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~ 405 (440) |-.|..+.-|-.++||+++|+|||.. ......|+.++ -.+|......|+++|++++..+++||+.+++|+++|++.++ T Consensus 307 v~PS~~E~fg~~~lEAma~G~PVVas-~~gg~~e~v~~-g~~G~lv~~~d~~~la~ai~~ll~d~~~~~~l~~~~~~~~~ 384 (405) T TIGR03449 307 AVPSYNESFGLVAMEAQACGTPVVAA-RVGGLPVAVAD-GETGLLVDGHDPADWADALARLLDDPRTRIRMGAAAVEHAA 384 (405) T ss_pred ECCCCCCCCCHHHHHHHHCCCCEEEE-CCCCCCEEEEC-CCCEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHH T ss_conf 35566678884799999869999991-79986112537-97389979899999999999997599999999999999999 Q ss_pred HCCCHHHHHHHHHHHHHHHHH Q ss_conf 045569999999997523787 Q gi|254780402|r 406 KMQGPLKITLRSLDSYVNPLI 426 (440) Q Consensus 406 ~~~ga~~~~~~~i~~~l~~~~ 426 (440) +. +-+++.+.+.....+++ T Consensus 385 ~f--sw~~~a~~~~~vY~~~L 403 (405) T TIGR03449 385 GF--SWAATADGLLSSYRDAL 403 (405) T ss_pred HC--CHHHHHHHHHHHHHHHH T ss_conf 69--99999999999999996 No 25 >cd03819 GT1_WavL_like This family is most closely related to the GT1 family of glycosyltransferases. WavL in Vibrio cholerae has been shown to be involved in the biosynthesis of the lipopolysaccharide core. Probab=99.79 E-value=2.2e-16 Score=127.23 Aligned_cols=312 Identities=17% Similarity=0.163 Sum_probs=189.8 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCC------C---CHHHHHHHHHHCCEEEEEEC- Q ss_conf 9999999999999861998999962776300121014774168850557------2---01344557442020356404- Q gi|254780402|r 67 GETMALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLD------I---QPAVSRFLKYWKPDCMILSE- 136 (440) Q Consensus 67 GE~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D------~---~~~~~~fl~~~~P~~~i~~e- 136 (440) |=-..+..|.+.|.+++.++.|-|. .|.. ........+.+..+|.- . ....+++++..+||++..-. T Consensus 11 GaE~~~~~La~~L~~~Gh~V~vi~~--~~~~-~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~l~~~l~~~~~Divh~h~~ 87 (355) T cd03819 11 GVERGTLELARALVERGHRSLVASA--GGRL-VAELEAEGSRHIKLPFISKNPLRILLNVARLRRLIREEKVDIVHARSR 87 (355) T ss_pred HHHHHHHHHHHHHHHCCCEEEEEEC--CCCC-CHHHHHCCCEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCEEEECCC T ss_conf 1799999999999987998999968--9987-155663496699917877882899999999999999969989997786 Q ss_pred CCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHH-HCC--CCEEEECC-C- Q ss_conf 762146788874224550586412665421210011112331026442366415778889986-088--50587204-3- Q gi|254780402|r 137 SDIWPLTVFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKE-LGA--QKLIVSGN-L- 211 (440) Q Consensus 137 ~ElWPnli~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~-lG~--~~i~v~Gn-l- 211 (440) ...|..+ ..++..++|++.--- ++. .....++..+..-+.+++.|+..++.+.+ .|. +++.+..| + T Consensus 88 ~~~~~~~-~a~~~~~~~~i~t~H-----~~~---~~~~~~~~~~~~~~~~i~~S~~~~~~~~~~~~~~~~ki~vI~ngid 158 (355) T cd03819 88 APAWSAY-LAARRTRPPFVTTVH-----GFY---SVNFRYNAIMARGDRVIAVSNFIADHIRENYGVDPDRIRVIPRGVD 158 (355) T ss_pred CHHHHHH-HHHHHCCCCEEEEEC-----CCC---CHHHHHHHHHHHCCEEEEECHHHHHHHHHHHCCCHHHEEEECCCCC T ss_conf 4499999-999853997899967-----877---4067999999727989994589999999973998789999788756 Q ss_pred --CCCCCCCCCCHHHHHHHHHH---CCCCCEEEC-CCCCC-CCCCCCCCCCCCC-CCCCCEEEEEEECCCCH---HHHHH Q ss_conf --10123442212568999986---179861750-46553-3210000000000-12341248897055211---47888 Q gi|254780402|r 212 --KIDTESLPCDKELLSLYQES---IAGRYTWAA-ISTFE-GEEDKAVYVHNFI-KCRTDVLTIIVPRHPRR---CDAIE 280 (440) Q Consensus 212 --K~d~~~~~~~~~~~~~~~~~---~~~r~v~va-gSth~-~EE~~il~a~~~l-~~~~~~~lIIvPRhpeR---~~~i~ 280 (440) +|+.. ............. ..+.++++. |...+ .--+.+++|+..+ ++.|+..++|+..-+++ .+++. T Consensus 159 ~~~f~~~--~~~~~~~~~~~~~~~~~~~~~~i~~vGRl~~~Kg~~~li~a~~~l~~~~~~~~l~i~G~g~~~~~~~~~~~ 236 (355) T cd03819 159 LDRFDPG--AVPPERILALAREWPLPKGKPVILLPGRLTRWKGQEVFIEALARLKKDDPDVHLLIVGDAQGRRFYYAELL 236 (355) T ss_pred HHHCCCC--CCCHHHHHHHHHHCCCCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHCCCEEEEEEECCCCCHHHHHHHH T ss_conf 5423877--78778899999862899998799996166544576999999999986489979999707864168999999 Q ss_pred HHHHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCC-CCCCCCHHHHHHHCCCEEECCCCCCHHH Q ss_conf 776530010110013444446641675067652025542208206613554-4454583768961994999898443799 Q gi|254780402|r 281 RRLIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFC-ASGGQNPLEAAMLGCAILSGPNVENFRD 359 (440) Q Consensus 281 ~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~-~~GGhN~lEpaa~G~pVi~GP~~~nf~e 359 (440) +.+...|++- .|.+.....++..+|+.||+ ||--|.. +.-|-.++||+++|+|||.-. .....| T Consensus 237 ~~~~~~~l~~-------------~v~f~G~~~d~~~~~~~adi-~v~pS~~~Egf~~vllEAma~G~PvV~s~-~gg~~e 301 (355) T cd03819 237 ELIKRLGLQD-------------RVTFVGHCSDMPAAYALADI-VVSASTEPEAFGRTAVEAQAMGRPVIASD-HGGARE 301 (355) T ss_pred HHHHHCCCCC-------------CEEECCCCHHHHHHHHHHCC-CCCCCCCCCCCHHHHHHHHHCCCCEEEEC-CCCCHH T ss_conf 9999819976-------------28865762146899874032-55887771000078999998699899908-999476 Q ss_pred HHHHHHHCCCEEEECCHHHHHHHHHHHHC-CHHHHHHHHHHHHHHHHHCC Q ss_conf 99999978986995798999999999958-98999999999999997045 Q gi|254780402|r 360 IYRRMVSSGAVRIVEEVGTLADMVYSLLS-EPTIRYEMINAAINEVKKMQ 408 (440) Q Consensus 360 ~~~~L~~~g~~~~v~d~~eL~~~l~~ll~-d~~~~~~~~~~a~~~i~~~~ 408 (440) +++. -+.|......|.++|++++.++++ ||+.|++||++|++.++++- T Consensus 302 ii~~-~~~G~l~~~~d~~~l~~~i~~~l~~~~~~r~~~~~~ar~~~~~~f 350 (355) T cd03819 302 TVRP-GETGLLVPPGDAEALAQALDQILSLLPEGRAKMFAKARMCVETLF 350 (355) T ss_pred HCCC-CCCEEEECCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHC T ss_conf 6158-997899889999999999999986999999999999999999848 No 26 >TIGR03088 stp2 sugar transferase, PEP-CTERM/EpsH1 system associated. Members of this family include a match to the pfam00534 Glycosyl transferases group 1 domain. Nearly all are found in species that encode the PEP-CTERM/exosortase system predicted to act in protein sorting in a number of Gram-negative bacteria. In particular, these transferases are found proximal to a particular variant of exosortase, EpsH1, which appears to travel with a conserved group of genes summarized by Genome Property GenProp0652. The nature of the sugar transferase reaction catalyzed by members of this clade is unknown and may conceivably be variable with respect to substrate by species, but we hypothesize a conserved substrate. Probab=99.79 E-value=8e-16 Score=123.56 Aligned_cols=338 Identities=15% Similarity=0.127 Sum_probs=198.0 Q ss_pred CCHHH-HHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECC----CCCC--HHHHHHHHHHCCEEEEEEC Q ss_conf 78999-999999999998619989999627763001210147741688505----5720--1344557442020356404 Q gi|254780402|r 64 SSVGE-TMALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAP----LDIQ--PAVSRFLKYWKPDCMILSE 136 (440) Q Consensus 64 aSvGE-~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P----~D~~--~~~~~fl~~~~P~~~i~~e 136 (440) -+.|= =..+..|++.|.+++.++.|.+.+..|. ..++...+.+.+..++ .|+. +...+.++.++||++-.- T Consensus 11 l~~GGaE~~~~~l~~~l~~~~~~~~vi~~~~~~~-~~~~~~~~~v~~~~l~~~~~~~~~~~~~l~~li~~~kpDiIh~~- 88 (374) T TIGR03088 11 FDVGGLENGLVNLINHLPADRYRHAVVALTEVSA-FRKRIQRPDVAFYALHKQPGKDVAVYPQLYRLLRQLRPDIVHTR- 88 (374) T ss_pred CCCCHHHHHHHHHHHHHHHCCCEEEEEEECCCCH-HHHHHHHCCCEEEECCCCCCCCHHHHHHHHHHHHHHCCCEEEEC- T ss_conf 9998279999999997887599399999789845-78999868988999078876479999999999998398489863- Q ss_pred CCCHHHHHHHHHHCCCCEEEEEEC---CCCCCCCC--CHHHHHHHHHCCCCCCEEEECCHHHHHHHHH-HCC--CCEEEE Q ss_conf 762146788874224550586412---66542121--0011112331026442366415778889986-088--505872 Q gi|254780402|r 137 SDIWPLTVFELSKQRIPQVLVNAR---MSRRSFKN--WKTVLSFSKKIFSQFSLVIVQSERYFRRYKE-LGA--QKLIVS 208 (440) Q Consensus 137 ~ElWPnli~~~~~~~ip~~linar---ls~~S~~~--~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~-lG~--~~i~v~ 208 (440) +--+......+...++|..+.... ..+...++ |+++.++. ....+.+++.|+..++.+.+ +|. +++.+. T Consensus 89 ~~~~~~~~~~~~~~~~~~~i~~~h~~~~~~~~~~~~~~~~~~k~~---~~~~~~~i~vs~~~~~~~~~~~~~~~~ki~~I 165 (374) T TIGR03088 89 NLAALEAQLPAALAGVPARIHGEHGRDVFDLDGSNWKYRWLRRLY---RPLIHHYVAVSRDLEDWLRGPVKVPPAKIHQI 165 (374) T ss_pred CHHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHH---HHHCCEEEECCHHHHHHHHHHHCCCHHHEEEE T ss_conf 611699999999849988999607875437431058999999999---98568899915899999998709987898996 Q ss_pred CCCCCCCCCCCCCHHHHHHHHH--H-CCCCCEEE-CCCCCCCC-CCCCCCCCCCC-CC----CCCEEEEEEECCCCHHHH Q ss_conf 0431012344221256899998--6-17986175-04655332-10000000000-12----341248897055211478 Q gi|254780402|r 209 GNLKIDTESLPCDKELLSLYQE--S-IAGRYTWA-AISTFEGE-EDKAVYVHNFI-KC----RTDVLTIIVPRHPRRCDA 278 (440) Q Consensus 209 GnlK~d~~~~~~~~~~~~~~~~--~-~~~r~v~v-agSth~~E-E~~il~a~~~l-~~----~~~~~lIIvPRhpeR~~~ 278 (440) .|- .|................ . ..++.+++ .|-.++.- -+.+++++..+ ++ .+++.++|+...+++ ++ T Consensus 166 ~Ng-id~~~f~~~~~~~~~~~~~~~~~~~~~~i~~vgRl~~~Kg~~~li~a~~~l~~~~~~~~~~~~l~i~G~G~~~-~~ 243 (374) T TIGR03088 166 YNG-VDTERFHPSRGDRSPILPPDFFADESVVVGTVGRLQAVKDQPTLVRAFALLVRQLPEGAERLRLVIVGDGPAR-GA 243 (374) T ss_pred CCC-CCHHHCCCCCCCHHHHHHHCCCCCCCEEEEEEEECEEECCHHHHHHHHHHHHHHCCCCCCCEEEEEECCCCCH-HH T ss_conf 687-7652158773106776543158987769999663403078799999999999867776688899998177659-99 Q ss_pred HHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHH Q ss_conf 88776530010110013444446641675067652025542208206613554445458376896199499989844379 Q gi|254780402|r 279 IERRLIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFR 358 (440) Q Consensus 279 i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~ 358 (440) +++.+++.|++- .|.+.....++..+|+.|| +||-.|..+.-|-.++||+++|+|||. .++...+ T Consensus 244 l~~~i~~~~l~~-------------~v~f~G~~~~~~~~~~~~D-i~v~~S~~EGf~~~llEAma~g~PvIa-sdvgg~~ 308 (374) T TIGR03088 244 CEQMVRAAGLAH-------------LVWLPGERDDVPALMQALD-LFVLPSLAEGISNTILEAMASGLPVIA-TAVGGNP 308 (374) T ss_pred HHHHHHHCCCCC-------------EEECCCCCCCHHHHHHHCC-CEEECCCCCCCCHHHHHHHHCCCCEEE-CCCCCHH T ss_conf 999999718777-------------5853787468999999639-003134434467799999975997999-1899818 Q ss_pred HHHHHHHHCCCEEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHH Q ss_conf 9999999789869957989999999999589899999999999999704556999999999752378 Q gi|254780402|r 359 DIYRRMVSSGAVRIVEEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITLRSLDSYVNPL 425 (440) Q Consensus 359 e~~~~L~~~g~~~~v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~~~i~~~l~~~ 425 (440) |+.++ -..|......|+++|++++..+++|++.+++|+++|++.+.++- +.++.++...+..+.+ T Consensus 309 eii~~-~~~G~l~~~~d~~~la~~i~~ll~~~~~~~~~~~~a~~~v~~~F-s~~~~~~~~~~lY~~l 373 (374) T TIGR03088 309 ELVQH-GVTGALVPPGDAVALARALQPYVSDPAARRAHGAAGRARAEQQF-SINAMVAAYAGLYDQL 373 (374) T ss_pred HHHCC-CCCEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHC-CHHHHHHHHHHHHHHH T ss_conf 98617-98689978999999999999997799999999999999999878-9999999999999996 No 27 >cd03821 GT1_Bme6_like This family is most closely related to the GT1 family of glycosyltransferases. Bme6 in Brucella melitensis has been shown to be involved in the biosynthesis of a polysaccharide. Probab=99.79 E-value=1.8e-16 Score=127.81 Aligned_cols=328 Identities=15% Similarity=0.101 Sum_probs=185.8 Q ss_pred HHHHHHHHHHHHHHHCCCCE-EEEECCCCCCHHHHCCCCCCEEEEECC---------CCCCHHHHH--HHHHHCCEEEEE Q ss_conf 99999999999998619989-999627763001210147741688505---------572013445--574420203564 Q gi|254780402|r 67 GETMALIGLIPAIRSRHVNV-LLTTMTATSAKVARKYLGQYAIHQYAP---------LDIQPAVSR--FLKYWKPDCMIL 134 (440) Q Consensus 67 GE~~~a~pli~~l~~~~~~i-liT~~T~tg~~~~~~~~~~~~~~~y~P---------~D~~~~~~~--fl~~~~P~~~i~ 134 (440) |=-..+..|++.|.+++.+| ++|+. ..+...-.............+ .-+...... .....+||++.. T Consensus 15 G~e~~~~~la~~L~~~Gh~V~v~t~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Dvi~~ 93 (375) T cd03821 15 GPVRVVLNLSKALAKLGHEVTVATTD-AGGDPLLVALNGVPVKLFSINVAYGLNLARYLFPPSLLAWLRLNIREADIVHV 93 (375) T ss_pred CHHHHHHHHHHHHHHCCCEEEEEECC-CCCCCCCCCCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCEEEE T ss_conf 89999999999999779989999707-98764310246751674166555420133322206899999985489999998 Q ss_pred ECCCCHHH--HHHHHHHCCCCEEEEE-ECCCC-----CCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHCC-CCE Q ss_conf 04762146--7888742245505864-12665-----421210011112331026442366415778889986088-505 Q gi|254780402|r 135 SESDIWPL--TVFELSKQRIPQVLVN-ARMSR-----RSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKELGA-QKL 205 (440) Q Consensus 135 ~e~ElWPn--li~~~~~~~ip~~lin-arls~-----~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~-~~i 205 (440) -...-.+. ....+++.++|++..- +-+.. ++.++...+....+..+...+.+.+.++........... +++ T Consensus 94 ~~~~~~~~~~~~~~~~~~~~p~i~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~s~~~~~~~~~~~~~~k~ 173 (375) T cd03821 94 HGLWSYPSLAAARAARKYGIPYVVSPHGMLDPWALPHKALKKRLAWFLFERRLLQAAAAVHATSEQEAAEIRRLGLKAPI 173 (375) T ss_pred CCCCHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCEEEECCHHHHHHHHHHCCCCCE T ss_conf 68625669999999998499699997998603444434678889999999998740869997657999999962898888 Q ss_pred EEECCCCCCCCCCCC-CHHHHHHHHHHCCCCCEEE-CCCCCCC-CCCCCCCCCCCC-CCCCCEEEEEEECCC-CHHHHHH Q ss_conf 872043101234422-1256899998617986175-0465533-210000000000-123412488970552-1147888 Q gi|254780402|r 206 IVSGNLKIDTESLPC-DKELLSLYQESIAGRYTWA-AISTFEG-EEDKAVYVHNFI-KCRTDVLTIIVPRHP-RRCDAIE 280 (440) Q Consensus 206 ~v~GnlK~d~~~~~~-~~~~~~~~~~~~~~r~v~v-agSth~~-EE~~il~a~~~l-~~~~~~~lIIvPRhp-eR~~~i~ 280 (440) .+..|- .|...... +............++++++ .|...+. --+.+++|+..+ ++.++..++|+.-.+ +..+.+. T Consensus 174 ~vI~ng-id~~~~~~~~~~~~~~~~~~~~~~~~il~vGRl~~~Kg~~~li~a~~~l~~~~~~~~l~ivG~g~~~~~~~~~ 252 (375) T cd03821 174 AVIPNG-VDIPPFAALPSRGRRRKFPILPDKRIILFLGRLHPKKGLDLLIEAFAKLAERFPDWHLVIAGPDEGGYRAELK 252 (375) T ss_pred EEECCC-CCCCCCCCCCHHHHHHHCCCCCCCCEEEEEEECCCCHHHHHHHHHHHHHHHHCCCEEEEEEECCCCHHHHHHH T ss_conf 998697-2720148862378898548998983899987134321477899999999976798599999789826789999 Q ss_pred HHHHHHCCHHHHHCCCCCCCCCCEEEEECCC--CCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHH Q ss_conf 7765300101100134444466416750676--52025542208206613554445458376896199499989844379 Q gi|254780402|r 281 RRLIAKGLKVARRSRGDVINAEVDIFLGDTI--GEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFR 358 (440) Q Consensus 281 ~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~--GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~ 358 (440) +.+++.++.- .|.+...+ -++..+|+.||+ ||-.|..+.-|-.++||+++|+|||.- +....+ T Consensus 253 ~~~~~~~l~~-------------~V~f~G~~~~~~~~~~~~~adi-~v~pS~~Egf~~~~lEAma~G~PvIas-~~gg~~ 317 (375) T cd03821 253 QIAAALGLED-------------RVTFTGMLYGEDKAAALADADL-FVLPSHSENFGIVVAEALACGTPVVTT-DKVPWQ 317 (375) T ss_pred HHHHHHCCCC-------------EEEECCCCCCHHHHHHHHHCCE-ECCCCHHHHHHHHHHHHHHCCCCEEEC-CCCCCH T ss_conf 9999826785-------------5853477683110989851500-146847766458999999859999992-799807 Q ss_pred HHHHHHHHCCCEEEE-CCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHH Q ss_conf 999999978986995-7989999999999589899999999999999704556999999 Q gi|254780402|r 359 DIYRRMVSSGAVRIV-EEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITLR 416 (440) Q Consensus 359 e~~~~L~~~g~~~~v-~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~~ 416 (440) |+. +.|++..+ .|+++|++++.+|++|++.+++|+++|++++.++-. -+++.+ T Consensus 318 eii----~~~~g~~~~~~~~~l~~~i~~l~~d~~~~~~~~~~ar~~~~~~fs-w~~~~~ 371 (375) T cd03821 318 ELI----EYGCGWVVDDDVDALAAALRRALELPQRLKAMGENGRALVEERFS-WTAIAQ 371 (375) T ss_pred HEE----ECCCEEEECCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCC-HHHHHH T ss_conf 728----788489949299999999999976999999999999999999589-999999 No 28 >cd03805 GT1_ALG2_like This family is most closely related to the GT1 family of glycosyltransferases. ALG2, a 1,3-mannosyltransferase, in yeast catalyzes the mannosylation of Man(2)GlcNAc(2)-dolichol diphosphate and Man(1)GlcNAc(2)-dolichol diphosphate to form Man(3)GlcNAc(2)-dolichol diphosphate. A deficiency of this enzyme causes an abnormal accumulation of Man1GlcNAc2-PP-dolichol and Man2GlcNAc2-PP-dolichol, which is associated with a type of congenital disorders of glycosylation (CDG), designated CDG-Ii, in humans. Probab=99.78 E-value=2e-16 Score=127.58 Aligned_cols=326 Identities=16% Similarity=0.130 Sum_probs=176.8 Q ss_pred HHHHHHHHHHHHHHHCCCCEE-EEECCCCCCHHHHCCC-CCCEEEE---ECCCCCCHH-------HH--------HHHHH Q ss_conf 999999999999986199899-9962776300121014-7741688---505572013-------44--------55744 Q gi|254780402|r 67 GETMALIGLIPAIRSRHVNVL-LTTMTATSAKVARKYL-GQYAIHQ---YAPLDIQPA-------VS--------RFLKY 126 (440) Q Consensus 67 GE~~~a~pli~~l~~~~~~il-iT~~T~tg~~~~~~~~-~~~~~~~---y~P~D~~~~-------~~--------~fl~~ 126 (440) |--..+..+++.|.+++.+|. +|+.-+.+... .... .....+. ++|-..... .+ .+++. T Consensus 14 GaEr~v~~la~~L~~~Gh~V~v~t~~~d~~~~~-~~~~~~~~~v~~~~~~~p~~~~~~~~~~~~~lr~~~~~~~~~~~~~ 92 (392) T cd03805 14 GAERLVVDAALALQSRGHEVTIYTSHHDPSHCF-EETKDGTLPVRVRGDWLPRSIFGRFHILCAYLRMLYLALYLLLLPD 92 (392) T ss_pred HHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCC-HHHCCCCEEEEEECCCCCCHHCCHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 699999999999997699399997268833240-5551785489992674670121237899999999999999986334 Q ss_pred HCCEEEEEECCCCHHHHHHHHHHCCCCEEEE----EECCCCC--CCCC-CHH-HHHHHHHCCCCCCEEEECCHHHHHHHH Q ss_conf 2020356404762146788874224550586----4126654--2121-001-111233102644236641577888998 Q gi|254780402|r 127 WKPDCMILSESDIWPLTVFELSKQRIPQVLV----NARMSRR--SFKN-WKT-VLSFSKKIFSQFSLVIVQSERYFRRYK 198 (440) Q Consensus 127 ~~P~~~i~~e~ElWPnli~~~~~~~ip~~li----narls~~--S~~~-~~~-~~~l~~~~~~~~~~i~~qs~~~~~rl~ 198 (440) .+|++++.-..-.|.-++..+... |++.. +..++.+ ..++ |++ +.++.+..++..|.|.+.|+.+++.+. T Consensus 93 ~~~Dvi~~~~~~~~~~~~~~~~~~--~ii~~~H~p~~~l~~~~~~~~~~y~~~~~~le~~~~~~~d~ii~~S~~~~~~~~ 170 (392) T cd03805 93 EKYDVFIVDQVSACVPLLKLFSPS--KILFYCHFPDQLLAQRGSLLKRLYRKPFDWLEEFTTGMADKIVVNSNFTASVFK 170 (392) T ss_pred CCCEEEEECCCCHHHHHHHHCCCC--CEEEEEECCCHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCEEEECCHHHHHHHH T ss_conf 798099988853479999974699--679996078402102666899999999999999988508899995678999999 Q ss_pred HH----CCCCEEEECCCCCCCCCCC-CCHHHHHHHHHHCCCCCEEECCCCCC--CCCCCCCCCCCCCCC-C---CCEEEE Q ss_conf 60----8850587204310123442-21256899998617986175046553--321000000000012-3---412488 Q gi|254780402|r 199 EL----GAQKLIVSGNLKIDTESLP-CDKELLSLYQESIAGRYTWAAISTFE--GEEDKAVYVHNFIKC-R---TDVLTI 267 (440) Q Consensus 199 ~l----G~~~i~v~GnlK~d~~~~~-~~~~~~~~~~~~~~~r~v~vagSth~--~EE~~il~a~~~l~~-~---~~~~lI 267 (440) +. ..+++.|.-|- .|....+ ..............+++++++.+... .--+.+++|+..+++ . ++++|+ T Consensus 171 ~~~~~~~~~~~~vi~ng-id~~~~~~~~~~~~~~~~~~~~~~~~il~vgRl~~~Kg~~~lI~A~~~l~~~~~~~~~~~Lv 249 (392) T cd03805 171 KTFPSLAKNPREVVYPC-VDTDSFESTSEDPDPGLLIPKSGKKTFLSINRFERKKNIALAIEAFAILKDKLAEFKNVRLV 249 (392) T ss_pred HHHHHHCCCCCEEECCC-CCHHHCCCCCCCHHHHHHCCCCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCCCEEEE T ss_conf 98302065885797798-47776487641044555324678739999844453466899999999999856766885999 Q ss_pred EEECCCCHH-------HHHHHHHHHH-CCHHHHHCCCCCCCCCCEEEEECCC--CCCHHHHHHCCCEEEECCCCCCCCCC Q ss_conf 970552114-------7888776530-0101100134444466416750676--52025542208206613554445458 Q gi|254780402|r 268 IVPRHPRRC-------DAIERRLIAK-GLKVARRSRGDVINAEVDIFLGDTI--GEMGFYLRMTEIAFIGRSFCASGGQN 337 (440) Q Consensus 268 IvPRhpeR~-------~~i~~~l~~~-gl~~~~~S~~~~~~~~~~V~i~Dt~--GeL~~lY~~AdiafVGGSl~~~GGhN 337 (440) |+.-.++|. +++.+++.+. |+.- .|...... .++..+|+.||+ ||-.|..+.-|-. T Consensus 250 i~Gg~~~r~~e~~~y~~eL~~l~~~~~~l~~-------------~V~Flg~~~~~~~~~l~~~ad~-~v~~s~~Egfg~v 315 (392) T cd03805 250 IAGGYDPRVAENVEYLEELQRLAEELLLLED-------------QVIFLPSISDSQKELLLSSARA-LLYTPSNEHFGIV 315 (392) T ss_pred EEECCCCCCCHHHHHHHHHHHHHHHHCCCCC-------------EEEECCCCCHHHHHHHHHHCCE-EEECCCCCCCCHH T ss_conf 9818755551018999999999998259878-------------5999888996999999985979-9988743466607 Q ss_pred HHHHHHHCCCEEECCCCCCHHHHHHHHHHCCCEEEE-CCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHH Q ss_conf 376896199499989844379999999978986995-798999999999958989999999999999970455699999 Q gi|254780402|r 338 PLEAAMLGCAILSGPNVENFRDIYRRMVSSGAVRIV-EEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITL 415 (440) Q Consensus 338 ~lEpaa~G~pVi~GP~~~nf~e~~~~L~~~g~~~~v-~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~ 415 (440) ++||+++|+|||.-. ....+|+..+ ...++.| .|+++|++++.++++||+.|++||++|++.++++- +.+++. T Consensus 316 ~lEAma~G~PVVasd-~gG~~E~I~~---g~~G~Lv~~d~~~la~~i~~ll~d~~lr~~mg~~ar~~v~~~F-s~e~~a 389 (392) T cd03805 316 PLEAMYAGKPVIACN-SGGPLETVVD---GETGFLCEPTPEEFAEAMLKLANDPDLADRMGAAGRKRVKEKF-STEAFA 389 (392) T ss_pred HHHHHHCCCCEEEEC-CCCCHHHEEC---CCEEEEECCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHC-CHHHHH T ss_conf 999997799999948-9986766457---9669995959999999999997899999999999999999866-999998 No 29 >PRK13608 diacylglycerol glucosyltransferase; Provisional Probab=99.77 E-value=9.8e-16 Score=122.98 Aligned_cols=340 Identities=15% Similarity=0.189 Sum_probs=211.7 Q ss_pred CCEEEEEECCHHH--HHHHHHHHHHHHHCCCC---EEEE----ECCCCCCHHHHCCC-------CCCEEEEE-C-CC--C Q ss_conf 8779998178999--99999999999861998---9999----62776300121014-------77416885-0-55--7 Q gi|254780402|r 56 GPLIWFHASSVGE--TMALIGLIPAIRSRHVN---VLLT----TMTATSAKVARKYL-------GQYAIHQY-A-PL--D 115 (440) Q Consensus 56 ~~~IW~HaaSvGE--~~~a~pli~~l~~~~~~---iliT----~~T~tg~~~~~~~~-------~~~~~~~y-~-P~--D 115 (440) ++.|.|=.+|.|. ..+|..|.+++.+.+++ +.+. ..+|.-....++.+ +..-.-+| . |- + T Consensus 5 ~KKVLILtas~G~GH~~AA~AL~~~l~~~~~~~~~v~~~D~~~~~~p~~~~~~~~~Yl~~~k~~p~ly~~~Y~~~~~~~~ 84 (391) T PRK13608 5 NKKILIITGSFGNGHMQVTQSIVNQLNDMNLDHLSVIEHDLFMEAHPILTSICKKWYINSFKYFRNMYKGFYYSRPDKLD 84 (391) T ss_pred CCEEEEEECCCCHHHHHHHHHHHHHHHHHCCCCEEEEEECHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHH T ss_conf 88799996898837999999999999850999669999137876484188889999999999999999989854840677 Q ss_pred -------CCHHHHHHHHHHCCEEEEEECCCCHHH-HHHHHHHC---CCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCC Q ss_conf -------201344557442020356404762146-78887422---4550586412665421210011112331026442 Q gi|254780402|r 116 -------IQPAVSRFLKYWKPDCMILSESDIWPL-TVFELSKQ---RIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFS 184 (440) Q Consensus 116 -------~~~~~~~fl~~~~P~~~i~~e~ElWPn-li~~~~~~---~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~ 184 (440) ...-..+++...+||++|.. .|. .+..++++ ++|++.+-.-++- -..|+....| T Consensus 85 ~~~~~~~~~~kl~~~L~~~kPDvII~T----~P~~~~s~lk~~~~~~iP~~tViTD~~~-----------H~~W~~~~~D 149 (391) T PRK13608 85 KCFYKYYGLNKLINLLIKEKPDLILLT----FPTPVMSVLTEQFNINIPVATVMTDYRL-----------HKNWITPYST 149 (391) T ss_pred HHHHHHHHHHHHHHHHHHHCCCEEEEC----CHHHHHHHHHHHCCCCCCEEEEECCHHH-----------HHHHCCCCCC T ss_conf 999999889999999998492999999----8289999999824999988999587133-----------2303689999 Q ss_pred EEEECCHHHHHHHHHHCC--CCEEEECC-C--CCCCCCCCCCHHHHHHHHHHCCCCCE-EECC-CCCCCC-CCCCCCCCC Q ss_conf 366415778889986088--50587204-3--10123442212568999986179861-7504-655332-100000000 Q gi|254780402|r 185 LVIVQSERYFRRYKELGA--QKLIVSGN-L--KIDTESLPCDKELLSLYQESIAGRYT-WAAI-STFEGE-EDKAVYVHN 256 (440) Q Consensus 185 ~i~~qs~~~~~rl~~lG~--~~i~v~Gn-l--K~d~~~~~~~~~~~~~~~~~~~~r~v-~vag-Sth~~E-E~~il~a~~ 256 (440) ..++-+++.++.+.+.|+ ++|.++|= + +|... .+............++++ ++.| |-.-+. .+.++ .. T Consensus 150 ~y~Va~~~~~~~l~~~Gi~~~kI~vtGIPV~~~F~~~---~~~~~~~~~~~l~~~~~~iLv~gG~~G~~~~~~~~i--~~ 224 (391) T PRK13608 150 RYYVATKETKQDFIDVGIDPSTVKVTGIPIDNKFETP---INQKQWLIDNNLDPDKQTILMSAGAFGVSKGFDTMI--TD 224 (391) T ss_pred EEEECCHHHHHHHHHCCCCHHHEEEEEEECCHHHCCC---CCHHHHHHHCCCCCCCCEEEEECCCCCHHHHHHHHH--HH T ss_conf 7996999999999984999768899834358677375---567899997189977768999688631024699999--99 Q ss_pred CCCCCCCEEEEEE-ECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCC Q ss_conf 0012341248897-055211478887765300101100134444466416750676520255422082066135544454 Q gi|254780402|r 257 FIKCRTDVLTIIV-PRHPRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGG 335 (440) Q Consensus 257 ~l~~~~~~~lIIv-PRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GG 335 (440) ++...++..++++ .|.- ++.+.++.. + .....|.+..=.-++..|++.||+ +|. +-|| T Consensus 225 ll~~~~~~qivvvcGrN~----~L~~~L~~~-~-----------~~~~~v~vlG~t~~m~~lM~asDl-lIT----KpGG 283 (391) T PRK13608 225 ILAKSANAQVVMICGKSK----ELKRSLTAK-F-----------KSNENVLILGYTKHMNEWMASSQL-MIT----KPGG 283 (391) T ss_pred HHHCCCCCEEEEEECCCH----HHHHHHHHH-H-----------CCCCCEEEECCCCCHHHHHHHCCE-EEE----CCCH T ss_conf 971599965999908999----999999976-2-----------459976997070519999986529-996----7866 Q ss_pred CCHHHHHHHCCCEEE-CCCCCCHHHHHHHHHHCCCEEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHH Q ss_conf 583768961994999-8984437999999997898699579899999999995898999999999999997045569999 Q gi|254780402|r 336 QNPLEAAMLGCAILS-GPNVENFRDIYRRMVSSGAVRIVEEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKIT 414 (440) Q Consensus 336 hN~lEpaa~G~pVi~-GP~~~nf~e~~~~L~~~g~~~~v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~ 414 (440) -+..||+++|+|++. +|--.-=.+-++-|.+.|++..+++++++.+.+..|++||++.+.|.+++++. ....|+.++ T Consensus 284 lT~sEAla~~lPmii~~piPGQEe~Na~~l~~~G~a~~~~~~~~~~~~v~~l~~~~~~l~~m~~~~~~~--~~p~a~~~I 361 (391) T PRK13608 284 ITITEGFARCIPMIFLNPAPGQELENALYFEEKGFGKIADTPEEAIKIVASLTNGNEQLTNMISTMEQD--KIKYATQTI 361 (391) T ss_pred HHHHHHHHHCCCEEECCCCCCCHHHHHHHHHHCCCEEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHH--CCCCHHHHH T ss_conf 799999995899897579997446679999968976885999999999999855999999999999971--799629999 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 999997523787850167899754 Q gi|254780402|r 415 LRSLDSYVNPLIFQNHLLSKDPSF 438 (440) Q Consensus 415 ~~~i~~~l~~~~~~~~~~~~~~~~ 438 (440) .+.|...++..--+.+|-+|.|-| T Consensus 362 ~~~~~~l~~~~~~~~~~~~~~~~~ 385 (391) T PRK13608 362 CRDLLDLIGHSSQPQEIYGKVPLY 385 (391) T ss_pred HHHHHHHHHCCCCCHHHCCCCCHH T ss_conf 999999872425603430224188 No 30 >cd03823 GT1_ExpE7_like This family is most closely related to the GT1 family of glycosyltransferases. ExpE7 in Sinorhizobium meliloti has been shown to be involved in the biosynthesis of galactoglucans (exopolysaccharide II). Probab=99.77 E-value=5e-16 Score=124.93 Aligned_cols=297 Identities=14% Similarity=0.119 Sum_probs=176.5 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEEC--------CC-------------CCCHHHHHHHH Q ss_conf 9999999999999861998999962776300121014774168850--------55-------------72013445574 Q gi|254780402|r 67 GETMALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYA--------PL-------------DIQPAVSRFLK 125 (440) Q Consensus 67 GE~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~--------P~-------------D~~~~~~~fl~ 125 (440) |=-..+..|.+.|.+++.+|-|-|.++......... ........ +. ......+++++ T Consensus 16 G~e~~~~~la~~L~~~Gh~V~v~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 93 (359) T cd03823 16 GAEVVAHDLAEALAKRGHEVAVLTAGEDPPRQDKEV--IGVVVYGRPIDEVLRSALPRDLFHLSDYDNPAVVAEFARLLE 93 (359) T ss_pred HHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCC--CCEEEEECCCCCCCCCCCCCHHHHHHHHCCHHHHHHHHHHHH T ss_conf 499999999999997799899995579876642135--761797047642003543101567764227899999999998 Q ss_pred HHCCEEEEEECCCCH-HHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHCC-- Q ss_conf 420203564047621-46788874224550586412665421210011112331026442366415778889986088-- Q gi|254780402|r 126 YWKPDCMILSESDIW-PLTVFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKELGA-- 202 (440) Q Consensus 126 ~~~P~~~i~~e~ElW-Pnli~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~-- 202 (440) .++||++.+-....| ...+..+++.++|++.--- .+-++......+.+..|.+++.|+..++.+.+.|. T Consensus 94 ~~~pDivh~h~~~~~~~~~~~~~~~~~~p~v~t~h--------~~~~~~~~~~~~~~~~d~vi~~S~~~~~~~~~~~~~~ 165 (359) T cd03823 94 DFRPDVVHFHHLQGLGVSILRAARDRGIPIVLTLH--------DYWLICPRQGLFKKGGDAVIAPSRFLLDRYVANGLFA 165 (359) T ss_pred HCCCCEEEECCCHHHHHHHHHHHHHCCCCEEEEEC--------CHHHCCCHHHHHHCCCCEEEECCHHHHHHHHHHCCCC T ss_conf 74999999888317679999999984998999972--------2211061778874589999999999999999808993 Q ss_pred CCEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCEEE-CCCCCCC-CCCCCCCCCCCCCCCCCEEEEEEECCCCHHHHHH Q ss_conf 5058720431012344221256899998617986175-0465533-2100000000001234124889705521147888 Q gi|254780402|r 203 QKLIVSGNLKIDTESLPCDKELLSLYQESIAGRYTWA-AISTFEG-EEDKAVYVHNFIKCRTDVLTIIVPRHPRRCDAIE 280 (440) Q Consensus 203 ~~i~v~GnlK~d~~~~~~~~~~~~~~~~~~~~r~v~v-agSth~~-EE~~il~a~~~l~~~~~~~lIIvPRhpeR~~~i~ 280 (440) +++.+..|- .|......... ....++.+++ +|...+. --+.+++|++.++ .++..|+|+..-|++-.. . T Consensus 166 ~~i~vI~ng-vd~~~~~~~~~------~~~~~~~~i~~vGRl~~~Kg~~~li~a~~~l~-~~~~~l~i~G~g~~~~~~-~ 236 (359) T cd03823 166 EKISVIRNG-IDLDRAKRPRR------APPGGRLRFGFIGQLTPHKGVDLLLEAFKRLP-RGDIELVIVGNGLELEEE-S 236 (359) T ss_pred CCEEEECCC-CCHHHCCCCCC------CCCCCCEEEEEECCCCCCCCHHHHHHHHHHCC-CCCCEEEEECCCCHHHHH-H T ss_conf 238998898-68454274333------45678749999588976259999999998555-578289997786056899-9 Q ss_pred HHHHHHCCHHHHHCCCCCCCCCCEEEEECCC--CCCHHHHHHCCCEEEECCC-CCCCCCCHHHHHHHCCCEEECCCCCCH Q ss_conf 7765300101100134444466416750676--5202554220820661355-444545837689619949998984437 Q gi|254780402|r 281 RRLIAKGLKVARRSRGDVINAEVDIFLGDTI--GEMGFYLRMTEIAFIGRSF-CASGGQNPLEAAMLGCAILSGPNVENF 357 (440) Q Consensus 281 ~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~--GeL~~lY~~AdiafVGGSl-~~~GGhN~lEpaa~G~pVi~GP~~~nf 357 (440) .. .+. ...|.+...+ .++..+|+.|| +||-.|. .+.-|..++||+++|+|||... .... T Consensus 237 ~~---~~~-------------~~~v~f~G~~~~~~~~~~~~~ad-i~v~pS~~~E~fg~~~lEAma~G~PvIas~-~gg~ 298 (359) T cd03823 237 YE---LEG-------------DPRVEFLGAYPQEEIDDFYAEID-VLVVPSIWPENFPLVIREALAAGVPVIASD-IGGM 298 (359) T ss_pred HH---HHC-------------CCCCEECCCCCHHHHHHHHHHCC-CCCCCCCCCCCCCHHHHHHHHCCCCEEECC-CCCC T ss_conf 99---725-------------77648806567899999998657-365677565777479999998299899888-9981 Q ss_pred HHHHHHHHHCCCEEE--ECCHHHHHHHHHHHHCCHHHHHHHHHHHHHH Q ss_conf 999999997898699--5798999999999958989999999999999 Q gi|254780402|r 358 RDIYRRMVSSGAVRI--VEEVGTLADMVYSLLSEPTIRYEMINAAINE 403 (440) Q Consensus 358 ~e~~~~L~~~g~~~~--v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~ 403 (440) +|+.+ +...+.. ..|+++|++++.+|++||+++++|++++++- T Consensus 299 ~e~i~---~g~~G~lv~~~d~~~la~ai~~ll~d~~~~~~~~~~~~~~ 343 (359) T cd03823 299 AELVR---DGVNGLLFPPGDAEDLAAALERLIDDPDLLERLRAGIEPP 343 (359) T ss_pred HHHHC---CCCCEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHC T ss_conf 75603---7986799899999999999999984999999999999974 No 31 >cd04951 GT1_WbdM_like This family is most closely related to the GT1 family of glycosyltransferases and is named after WbdM in Escherichia coli. In general glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have Probab=99.77 E-value=6.3e-16 Score=124.24 Aligned_cols=326 Identities=16% Similarity=0.136 Sum_probs=190.7 Q ss_pred CCHHH-HHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEEC-----CCCCCH---HHHHHHHHHCCEEEEE Q ss_conf 78999-99999999999861998999962776300121014774168850-----557201---3445574420203564 Q gi|254780402|r 64 SSVGE-TMALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYA-----PLDIQP---AVSRFLKYWKPDCMIL 134 (440) Q Consensus 64 aSvGE-~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~-----P~D~~~---~~~~fl~~~~P~~~i~ 134 (440) -+.|= -..+..|+++|.+++.+|-|-+.++..... .......+..+ |.++.. ..+++++.++||++.. T Consensus 9 l~~GG~e~~~~~la~~L~~~G~~V~v~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~pDvIh~ 85 (360) T cd04951 9 LGLGGAEKQVVDLADQFVAKGHQVAIISLTGESEVK---PPIDATIILNLNMSKNPLSFLLALWKLRKILRQFKPDVVHA 85 (360) T ss_pred CCCCCHHHHHHHHHHHHHHCCCEEEEEEECCCCCCC---CCCCCCEEEECCCCCCHHHHHHHHHHHHHHHHHHCCCEEEE T ss_conf 998137999999999999769989999817985444---33457337863766676789999999999999829999998 Q ss_pred ECCCCHHHHHHHHH---HCCCCEEEEEECCCCCCCCCCHHHHH-HHHHCCCCCCEEEECCHHHHHHHHHHC---CCCEEE Q ss_conf 04762146788874---22455058641266542121001111-233102644236641577888998608---850587 Q gi|254780402|r 135 SESDIWPLTVFELS---KQRIPQVLVNARMSRRSFKNWKTVLS-FSKKIFSQFSLVIVQSERYFRRYKELG---AQKLIV 207 (440) Q Consensus 135 ~e~ElWPnli~~~~---~~~ip~~linarls~~S~~~~~~~~~-l~~~~~~~~~~i~~qs~~~~~rl~~lG---~~~i~v 207 (440) - -.+++++.... ..++|++. ... +.+...+++. .++..-...+.+...|+...+.+...+ .+++.+ T Consensus 86 h--~~~~~~~~~~~~~~~~~~~~i~-t~h----~~~~~~~~~~~~~~~~~~~~~~~~~vs~~~~~~~~~~~~~~~~ki~v 158 (360) T cd04951 86 H--MFHANIFARLLRLFLPSPPLIC-TAH----SKNEGGRLRMLAYRLTDFLSDLTTNVSKEALDYFIASKAFNANKSFV 158 (360) T ss_pred C--CCCHHHHHHHHHHHCCCCCEEE-EEC----CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHEEE T ss_conf 6--6307899999998579981999-858----88754179999999998887865233399999999855888444899 Q ss_pred ECCCCCCCCCCCCCHHHHHHHHHHC---CCCCEEEC-CCCCCC-CCCCCCCCCCC-CCCCCCEEEEEEECCCCHHHHHHH Q ss_conf 2043101234422125689999861---79861750-465533-21000000000-012341248897055211478887 Q gi|254780402|r 208 SGNLKIDTESLPCDKELLSLYQESI---AGRYTWAA-ISTFEG-EEDKAVYVHNF-IKCRTDVLTIIVPRHPRRCDAIER 281 (440) Q Consensus 208 ~GnlK~d~~~~~~~~~~~~~~~~~~---~~r~v~va-gSth~~-EE~~il~a~~~-l~~~~~~~lIIvPRhpeR~~~i~~ 281 (440) ..|- .|......+.......+..+ .+.++++. |...+. --+.++++++. .++.++..++|+..-|++ +++.+ T Consensus 159 I~ng-vd~~~f~~~~~~~~~~r~~~~~~~~~~~il~vgRl~~~Kg~~~li~a~~~l~~~~~~~~l~ivG~G~~~-~~l~~ 236 (360) T cd04951 159 VYNG-IDTDRFRKDPARRLKIRNALGVKNDTFVILAVGRLVEAKDYPNLLKAFAKLLSDYLDIKLLIAGDGPLR-ATLER 236 (360) T ss_pred ECCC-CCHHHCCCCHHHHHHHHHHCCCCCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCHHH-HHHHH T ss_conf 6687-344421876156788998619998987999984066331157899999999864899799996782567-88876 Q ss_pred HHHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHHHH Q ss_conf 76530010110013444446641675067652025542208206613554445458376896199499989844379999 Q gi|254780402|r 282 RLIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFRDIY 361 (440) Q Consensus 282 ~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~~ 361 (440) .+++.|+.- .|.+....-++..+|+.||+ ||-.|..+.=|..++||+++|+|||.. .....+|+. T Consensus 237 ~~~~~~l~~-------------~v~f~G~~~d~~~~~~~adi-~v~pS~~Egfg~~~lEAma~G~PvI~s-~~gg~~eii 301 (360) T cd04951 237 LIKALGLSN-------------RVKLLGLRDDIAAYYNAADL-FVLSSAWEGFGLVVAEAMACELPVVAT-DAGGVREVV 301 (360) T ss_pred HHHHHCCCC-------------CEEECCCCHHHHHHHHHCCC-CCCCCCCCCCCHHHHHHHHCCCCEEEC-CCCCHHHEE T ss_conf 677617776-------------15424751026898762142-558865578870899999859999987-899726557 Q ss_pred HHHHHCCCEEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf 99997898699579899999999995898999999999999997045569999999997 Q gi|254780402|r 362 RRMVSSGAVRIVEEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITLRSLDS 420 (440) Q Consensus 362 ~~L~~~g~~~~v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~~~i~~ 420 (440) + ++|......|+++|++++.++++|++.+++|+++|++.+.++- +.+++.+...+ T Consensus 302 ~---~~G~lv~~~d~~~la~~i~~ll~~~~~~~~~~~~~~~~v~~~f-s~~~~~~~~~~ 356 (360) T cd04951 302 G---DSGLIVPISDPEALANKIDEILKMSGEERDIIGARRERIVKKF-SINSIVQQWLT 356 (360) T ss_pred C---CCEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHC-CHHHHHHHHHH T ss_conf 4---8649983999999999999998791999999999999999869-99999999999 No 32 >TIGR03087 stp1 sugar transferase, PEP-CTERM/EpsH1 system associated. Members of this family include a match to the pfam00534 Glycosyl transferases group 1 domain. Nearly all are found in species that encode the PEP-CTERM/exosortase system predicted to act in protein sorting in a number of Gram-negative bacteria. In particular, these transferases are found proximal to a particular variant of exosortase, EpsH1, which appears to travel with a conserved group of genes summarized by Genome Property GenProp0652. The nature of the sugar transferase reaction catalyzed by members of this clade is unknown and may conceivably be variable with respect to substrate by species, but we hypothesize a conserved substrate. Probab=99.71 E-value=9.8e-15 Score=116.43 Aligned_cols=270 Identities=17% Similarity=0.205 Sum_probs=158.4 Q ss_pred HHHHHHCCEEEEEECCCCHHHHHHHHHHCCCCEEEE--EE------CC--CCCCCCC------CHHHHHHHHHCCCCCCE Q ss_conf 557442020356404762146788874224550586--41------26--6542121------00111123310264423 Q gi|254780402|r 122 RFLKYWKPDCMILSESDIWPLTVFELSKQRIPQVLV--NA------RM--SRRSFKN------WKTVLSFSKKIFSQFSL 185 (440) Q Consensus 122 ~fl~~~~P~~~i~~e~ElWPnli~~~~~~~ip~~li--na------rl--s~~S~~~------~~~~~~l~~~~~~~~~~ 185 (440) +.+...++|.+++.-..+-+. +.. ..+++|.++- |. +. +.+..++ ++++..+-+.+++.+|. T Consensus 98 ~~~~~~~~D~i~~~~~~~a~y-l~~-~~~~~p~ild~hdv~s~~~~~~a~~~~~~~~~~~~~e~~~l~~~E~~~~~~~d~ 175 (397) T TIGR03087 98 ALLAAEPVDAIVVFSSAMAQY-VTP-HVRGVPRIVDFVDVDSDKWLQYARTKRWPLRWIYRREGRLLLAYERAIAARFDA 175 (397) T ss_pred HHHHCCCCCEEEEECHHHHHH-HHH-HHCCCCEEEEEECHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHCCE T ss_conf 996069986899916576775-344-523799899985343489999987345467789999999999999999996699 Q ss_pred EEECCHHHHHHHHHHCC---CCEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCC-EEECCCC-CCCCCCCCC----CCCC Q ss_conf 66415778889986088---5058720431012344221256899998617986-1750465-533210000----0000 Q gi|254780402|r 186 VIVQSERYFRRYKELGA---QKLIVSGNLKIDTESLPCDKELLSLYQESIAGRY-TWAAIST-FEGEEDKAV----YVHN 256 (440) Q Consensus 186 i~~qs~~~~~rl~~lG~---~~i~v~GnlK~d~~~~~~~~~~~~~~~~~~~~r~-v~vagSt-h~~EE~~il----~a~~ 256 (440) +++.|+.|++.+++++. .++.+..|- .|............ ....+++ ++-.||. |+...+.+. +++. T Consensus 176 ~~~vS~~d~~~~~~~~~~~~~~i~vipnG-vd~~~f~p~~~~~~---~~~~~~~~i~f~G~~~~~pN~da~~~f~~~v~p 251 (397) T TIGR03087 176 ATFVSRAEAELFRRLAPEAAGRITAFPNG-VDADFFSPDRDYPN---PYPPGKRVLVFTGAMDYWPNIDAVVWFAERVFP 251 (397) T ss_pred EEEECHHHHHHHHHHCCCCCCEEEECCCC-CCCCCCCCCCCCCC---CCCCCCCEEEEEEECCCCCCHHHHHHHHHHHHH T ss_conf 99977999999987477767727765787-55123687644457---667889879999717872309999999999999 Q ss_pred CC-CCCCCEEEEEEECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCC Q ss_conf 00-12341248897055211478887765300101100134444466416750676520255422082066135544454 Q gi|254780402|r 257 FI-KCRTDVLTIIVPRHPRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGG 335 (440) Q Consensus 257 ~l-~~~~~~~lIIvPRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GG 335 (440) .+ ++.|++.+.|+.+.|.. ++.++.+.. .|.+...+-++..+|+.|+++++ .-..+.|- T Consensus 252 ~l~~~~p~~~~~ivG~~p~~--~~~~l~~~~-----------------~V~~~G~V~d~~~~~~~a~v~v~-Pl~~g~G~ 311 (397) T TIGR03087 252 AVRARRPAAEFYIVGAKPSP--AVRALAALP-----------------GVTVTGSVADVRPYLAHAAVAVA-PLRIARGI 311 (397) T ss_pred HHHHHCCCCEEEEEECCCCH--HHHHHCCCC-----------------CEEEEEECCCHHHHHHCCCEEEE-ECCCCCCC T ss_conf 99987899879999089629--999851799-----------------97997654986999961989999-46544575 Q ss_pred C-CHHHHHHHCCCEEECCCCCCHHHHHHHHHHCCCEEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHH Q ss_conf 5-837689619949998984437999999997898699579899999999995898999999999999997045569999 Q gi|254780402|r 336 Q-NPLEAAMLGCAILSGPNVENFRDIYRRMVSSGAVRIVEEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKIT 414 (440) Q Consensus 336 h-N~lEpaa~G~pVi~GP~~~nf~e~~~~L~~~g~~~~v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~ 414 (440) . -++|++++|+||+.-+.- .|.. .+.+.+.+...+|++++++++.+|++||+.+++||.+|++++.++- +=+++ T Consensus 312 ~~KilEama~g~PvVst~~g---~egl-~~~~g~~~lia~~~~~fa~~i~~Ll~d~~~~~~l~~~~r~~v~~~y-sW~~~ 386 (397) T TIGR03087 312 QNKVLEAMAMAKPVVASPEA---AEGI-DALPGAELLVAADPADFAAAILALLANPAEREELGQAARRRVLQHY-HWPRN 386 (397) T ss_pred CHHHHHHHHCCCCEEECCCC---CCCC-CCCCCCEEEECCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHC-CHHHH T ss_conf 35799999769989977743---0244-3678970595799999999999998199999999999999999829-99999 Q ss_pred HHHHHHHH Q ss_conf 99999752 Q gi|254780402|r 415 LRSLDSYV 422 (440) Q Consensus 415 ~~~i~~~l 422 (440) .+.++..+ T Consensus 387 ~~~le~~~ 394 (397) T TIGR03087 387 LARLDALL 394 (397) T ss_pred HHHHHHHH T ss_conf 99999986 No 33 >PRK00726 murG N-acetylglucosaminyl transferase; Provisional Probab=99.71 E-value=1.5e-13 Score=108.60 Aligned_cols=319 Identities=15% Similarity=0.175 Sum_probs=189.9 Q ss_pred EEEEEECCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCC-CEEEEECCCCC----------------C--- Q ss_conf 799981789999999999999986199899996277630012101477-41688505572----------------0--- Q gi|254780402|r 58 LIWFHASSVGETMALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQ-YAIHQYAPLDI----------------Q--- 117 (440) Q Consensus 58 ~IW~HaaSvGE~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~-~~~~~y~P~D~----------------~--- 117 (440) .+-.-+-+-|-+.-+..+.++|++++.+|+.-+ |..|.+. +..+. ......+|..- . T Consensus 4 I~i~~GGTGGHi~Palala~~L~~~g~ev~~ig-~~~g~E~--~~v~~~~~~~~~i~~~~~~~~~~~~~~~~~~~l~~~~ 80 (359) T PRK00726 4 ILLAGGGTGGHVFPALALAEELKKRGWEVLWLG-TKRGMEA--RLVPKAGIEFHFIPIGGLRRKGSLANLKAPFKLLKGV 80 (359) T ss_pred EEEEECCCHHHHHHHHHHHHHHHHCCCEEEEEE-CCCHHHH--HHCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHH T ss_conf 999958868999999999999983879899997-8826865--4044149838997778889878799999999999999 Q ss_pred HHHHHHHHHHCCEEEEEECC-CCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCC-CEEEECCHHHHH Q ss_conf 13445574420203564047-6214678887422455058641266542121001111233102644-236641577888 Q gi|254780402|r 118 PAVSRFLKYWKPDCMILSES-DIWPLTVFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQF-SLVIVQSERYFR 195 (440) Q Consensus 118 ~~~~~fl~~~~P~~~i~~e~-ElWPnli~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~-~~i~~qs~~~~~ 195 (440) ..+.++++.++|+++|-+.+ --.|-++ .++..+||+++-.... .+....+++.+| +.|++--++... T Consensus 81 ~~~~~il~~~kPd~Vig~GGY~s~P~~l-aA~l~~iP~iiHEqN~----------v~G~aNr~l~~~a~~i~~~f~~~~~ 149 (359) T PRK00726 81 LQARKILKRFKPDVVVGFGGYVSGPAGL-AARLLGIPLVIHEQNA----------VPGLANKLLARFAKKVATAFEETPK 149 (359) T ss_pred HHHHHHHHHCCCCEEEECCCHHHHHHHH-HHHHCCCCEEEECCCC----------CCCCCHHHHHHHCCEEEECCHHHHH T ss_conf 9999999974999999789741289999-9998299869974542----------3562337888850978997755540 Q ss_pred HHHHHCCCCEEEECCC-CCCCCCCCCCHHHHHHHHHHCCC-CCEEECCCCCCCC--CCCCCCCCCCCCCCCCEEEEEEEC Q ss_conf 9986088505872043-10123442212568999986179-8617504655332--100000000001234124889705 Q gi|254780402|r 196 RYKELGAQKLIVSGNL-KIDTESLPCDKELLSLYQESIAG-RYTWAAISTFEGE--EDKAVYVHNFIKCRTDVLTIIVPR 271 (440) Q Consensus 196 rl~~lG~~~i~v~Gnl-K~d~~~~~~~~~~~~~~~~~~~~-r~v~vagSth~~E--E~~il~a~~~l~~~~~~~lIIvPR 271 (440) .+. ..++.++||. |-.... .+.. .......+ ..+++.|.+.-.+ .+.+.++...+.+..+..++.... T Consensus 150 ~~~---~~k~~~~G~PvR~~~~~--~~~~---~~~~~~~~~~~iLV~GGSqGa~~~N~~v~~~l~~l~~~~~~~i~~~~G 221 (359) T PRK00726 150 AFP---KAKAVVTGNPVREEILA--LPAP---AFRLAGRGPPTLLVVGGSQGARVLNEAVPEALALLPEELRIQVIHQTG 221 (359) T ss_pred CCC---CCCEEEECCCCCHHHHC--CCHH---HHHHCCCCCCEEEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEECC T ss_conf 376---24559967840277661--4333---332104788579997685204789999999999876525908999828 Q ss_pred CCCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCC-HHHHHHHCCCEEE Q ss_conf 521147888776530010110013444446641675067652025542208206613554445458-3768961994999 Q gi|254780402|r 272 HPRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQN-PLEAAMLGCAILS 350 (440) Q Consensus 272 hpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN-~lEpaa~G~pVi~ 350 (440) ++..+++.+...+.+. ++.+.+=+-+|..+|+.||+++. ++|.+ +.|.+++|+|.|. T Consensus 222 -~~~~~~~~~~~~~~~~---------------~~~v~~f~~~m~~~~~~aDlvIs------RaGa~Ti~E~~~~g~P~Il 279 (359) T PRK00726 222 -KKDLEEVRAAYAELGV---------------NAEVVPFIDDMAAAYAAADLVIC------RAGASTVAELAAAGLPAIL 279 (359) T ss_pred -CCHHHHHHHHHHHCCC---------------CEEECCCCCHHHHHHHCCCEEEE------CCCCCHHHHHHHHCCCEEE T ss_conf -4039999999986599---------------76975752318998740889998------8983269999982898699 Q ss_pred CCC---CCCH-HHHHHHHHHCCCEEEEC----CHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 898---4437-99999999789869957----989999999999589899999999999999704556999999999752 Q gi|254780402|r 351 GPN---VENF-RDIYRRMVSSGAVRIVE----EVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITLRSLDSYV 422 (440) Q Consensus 351 GP~---~~nf-~e~~~~L~~~g~~~~v~----d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~~~i~~~l 422 (440) =|+ .+|. ..-++.+.+.|++..+. +++.|.+.+..+++||+.+++|+.+++++.. .+|++++.+.|++.. T Consensus 280 IP~p~a~~~HQ~~NA~~l~~~gaa~~i~e~~~~~~~L~~~i~~ll~d~~~l~~m~~~~~~~~~--~~a~~~i~~~i~~~~ 357 (359) T PRK00726 280 VPLPHAADDHQTANARALVDAGAAKLIPQSDLTPERLAEALLELLSDRERLEAMAEAARALGI--PDAAERLADLIEKLA 357 (359) T ss_pred EECCCCCCCHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHCCCC--CCHHHHHHHHHHHHH T ss_conf 836877753899999999978999995314699999999999998699999999999972489--789999999999985 No 34 >cd03825 GT1_wcfI_like This family is most closely related to the GT1 family of glycosyltransferases. wcfI in Bacteroides fragilis has been shown to be involved in the capsular polysaccharide biosynthesis. Probab=99.71 E-value=5e-14 Score=111.75 Aligned_cols=315 Identities=16% Similarity=0.132 Sum_probs=171.1 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEEECCCCCCH-HHHCCCCCCEEEEE--C-CCCCCHHHHHHHHHHCCEEEEEECCCCHHH Q ss_conf 9999999999999861998999962776300-12101477416885--0-557201344557442020356404762146 Q gi|254780402|r 67 GETMALIGLIPAIRSRHVNVLLTTMTATSAK-VARKYLGQYAIHQY--A-PLDIQPAVSRFLKYWKPDCMILSESDIWPL 142 (440) Q Consensus 67 GE~~~a~pli~~l~~~~~~iliT~~T~tg~~-~~~~~~~~~~~~~y--~-P~D~~~~~~~fl~~~~P~~~i~~e~ElWPn 142 (440) |=-..+..|.+.|.+++.++-+-+...++.. ......+ .+.|.- . .+-.....+++.+. .| +++.-.|.||- T Consensus 14 Gae~~~~~L~~~L~~~Gh~v~v~~~~~~~~~~~~~~~~~-dvvh~h~~~~~~~~~~~l~~l~~~-~p--~v~t~Hd~~~~ 89 (365) T cd03825 14 GAARAAYRLHRALQAAGVDSTMLVQEKKALISKIEIINA-DIVHLHWIHGGFLSIEDLSKLLDR-KP--VVWTLHDMWPF 89 (365) T ss_pred CHHHHHHHHHHHHHHCCCEEEEEEECCCCHHHHHHHCCC-CEEEECCCCCCHHCHHHHHHHHCC-CC--EEEEECCCCHH T ss_conf 389999999999997799089999269847776655289-989984776102059999999708-99--89996356522 Q ss_pred HHHHHHHC----------CCCEEEEEECCCCCCCCCCHHHHHHHHHCCC-CCCEEEECCHHHHHHHHHHC-C--CCEEEE Q ss_conf 78887422----------4550586412665421210011112331026-44236641577888998608-8--505872 Q gi|254780402|r 143 TVFELSKQ----------RIPQVLVNARMSRRSFKNWKTVLSFSKKIFS-QFSLVIVQSERYFRRYKELG-A--QKLIVS 208 (440) Q Consensus 143 li~~~~~~----------~ip~~linarls~~S~~~~~~~~~l~~~~~~-~~~~i~~qs~~~~~rl~~lG-~--~~i~v~ 208 (440) .-.+.... .+|.+-. .......++.+.. .+..+. .-..+.+.|+..++.+.+.+ . .++.|+ T Consensus 90 tg~~~~~~~~~~~~~~~~~~p~l~~---~~~~~~~~~~~~~--~~~~~~~~~~~iv~~S~~~~~~~~~~~~~~~~ki~vI 164 (365) T cd03825 90 TGGCHYPGGCDRYKTECGNCPQLGS---YPEKDLSRWIWRR--KRKAWADLNLTIVAPSRWLADCARSSSLFKGIPIEVI 164 (365) T ss_pred HCCCHHCCHHHHHHHCCCCCCCCCC---CCHHHHHHHHHHH--HHHHHHHCCCEEEECCHHHHHHHHHCCCCCCCCEEEE T ss_conf 1510100011343020465777566---5534667999999--9999852599899869899999997248898978998 Q ss_pred CCCCCCCC-CCCCCHHHHHHHHHHCCCCCEEECCCC---C--CCCCCCCCCCCCCCCC--CCCEEEEEEECCCCHHHHHH Q ss_conf 04310123-442212568999986179861750465---5--3321000000000012--34124889705521147888 Q gi|254780402|r 209 GNLKIDTE-SLPCDKELLSLYQESIAGRYTWAAIST---F--EGEEDKAVYVHNFIKC--RTDVLTIIVPRHPRRCDAIE 280 (440) Q Consensus 209 GnlK~d~~-~~~~~~~~~~~~~~~~~~r~v~vagSt---h--~~EE~~il~a~~~l~~--~~~~~lIIvPRhpeR~~~i~ 280 (440) .|- .|.. ..+.+............+++++++|+. + +| -+.+++|+..+++ .++..|+++...+++.. T Consensus 165 ~Ng-id~~~f~p~~~~~~r~~~~~~~~~~vi~~~~~~~~~~~Kg-~~~li~A~~~l~~~~~~~~~lvi~G~~~~~~~--- 239 (365) T cd03825 165 PNG-IDTTIFRPRDKREARKRLGLPADKKIILFGAVGGTDPRKG-FDELIEALKRLAERWKDDIELVVFGASDPEIP--- 239 (365) T ss_pred CCC-CCHHHCCCCCHHHHHHHHCCCCCCCEEEEEEECCCCCCCC-HHHHHHHHHHHHHCCCCCEEEEEECCCCHHHH--- T ss_conf 997-3646449868899999839798885899953001564324-79999999987650689889999379858889--- Q ss_pred HHHHHHCCHHHHHCCCCCCCCCCEEEEECCC---CCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCH Q ss_conf 7765300101100134444466416750676---5202554220820661355444545837689619949998984437 Q gi|254780402|r 281 RRLIAKGLKVARRSRGDVINAEVDIFLGDTI---GEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENF 357 (440) Q Consensus 281 ~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~---GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf 357 (440) .. .+. +|...--+ -.+..+|+.|| +||-.|..+.-|-.++||+++|+|||.- ..... T Consensus 240 ~~---l~~---------------~v~flG~~~~~~~l~~~~~~aD-i~v~pS~~Egfg~v~lEAma~G~PVVas-d~gg~ 299 (365) T cd03825 240 PD---LPF---------------PVHYLGSLNDDESLALIYSAAD-VFVVPSLQENFPNTAIEALACGTPVVAF-DVGGI 299 (365) T ss_pred HH---CCC---------------CEEEECCCCCHHHHHHHHHHCC-EEECCCCCCCCCHHHHHHHHCCCCEEEC-CCCCH T ss_conf 66---899---------------7999268799899999997272-9951677688859999999719989973-89985 Q ss_pred HHHHHHHHHCCCEEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHH Q ss_conf 999999997898699579899999999995898999999999999997045569999999 Q gi|254780402|r 358 RDIYRRMVSSGAVRIVEEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITLRS 417 (440) Q Consensus 358 ~e~~~~L~~~g~~~~v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~~~ 417 (440) +|++++ -.+|......|+++|++++..+++||+.+++|+++|++.++++-.. +++.+. T Consensus 300 ~eiv~~-~~~G~lv~~~d~~~la~ai~~ll~d~~~~~~~~~~ar~~~~~~fs~-~~~~~~ 357 (365) T cd03825 300 PDIVDH-GVTGYLAKPGDPEDLAEGIEWLLADPDEREELGEAARELAENEFDS-RVQAKR 357 (365) T ss_pred HHHHCC-CCCEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCH-HHHHHH T ss_conf 999607-9827997999999999999999869999999999999999986899-999999 No 35 >cd04955 GT1_like_6 This family is most closely related to the GT1 family of glycosyltransferases. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homolog Probab=99.70 E-value=1.8e-14 Score=114.72 Aligned_cols=319 Identities=15% Similarity=0.074 Sum_probs=176.5 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCCH---------HHHHHHHHHCCEEEEEECC Q ss_conf 9999999999999861998999962776300121014774168850557201---------3445574420203564047 Q gi|254780402|r 67 GETMALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQP---------AVSRFLKYWKPDCMILSES 137 (440) Q Consensus 67 GE~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~---------~~~~fl~~~~P~~~i~~e~ 137 (440) |=-..+..|.+.|.+++.+|-|-|.++........ .+.+.+.++|.-... .........+|+..++-- T Consensus 16 G~e~~v~~La~~L~~~Gh~V~v~t~~~~~~~~~~~--~~gv~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~h~- 92 (363) T cd04955 16 GFETFVEELAPRLVARGHEVTVYCRSPYPKQKETE--YNGVRLIHIPAPEIGGLGTIIYDILAILHALFVKRDIDHVHA- 92 (363) T ss_pred CHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCEE--ECCEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCEEEEEE- T ss_conf 78999999999999779979999878988887513--477799992764445167789899999999860899899997- Q ss_pred CCHHH---HHHHHHHCCCCEEEE-EECCCCCCCCC--CHH-----HHHHHHHCCCCCCEEEECCHHHHHHHHHHCCCCEE Q ss_conf 62146---788874224550586-41266542121--001-----11123310264423664157788899860885058 Q gi|254780402|r 138 DIWPL---TVFELSKQRIPQVLV-NARMSRRSFKN--WKT-----VLSFSKKIFSQFSLVIVQSERYFRRYKELGAQKLI 206 (440) Q Consensus 138 ElWPn---li~~~~~~~ip~~li-narls~~S~~~--~~~-----~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~~~i~ 206 (440) .+|. +...++..++|++.- .| + ++.+ |.. ++...+...+..|.+.+.|+..++.+......+.. T Consensus 93 -~~~~~~~~~~~~~~~~~~~v~t~Hg-~---~~~~~~~~~~~~~~~~~~~~~~~~~ad~vi~~S~~~~~~l~~~~~~~~~ 167 (363) T cd04955 93 -LGPAIAPFLPLLRLKGKKVVVNMDG-L---EWKRAKWGRPAKRYLKFGEKLAVKFADRLIADSPGIKEYLKEKYGRDST 167 (363) T ss_pred -CCHHHHHHHHHHHHCCCEEEEEECC-C---CHHHHHHHHHHHHHHHHHHHHHHHHCCEEEECCHHHHHHHHHHCCCCCE T ss_conf -7816899999997339839999367-4---0113220178999999999999860899999988999999986499839 Q ss_pred EECCCCCCCCCCCCCHHHHHHHHHHCCCCCEEECCCCCCC-CCCCCCCCCCCCCCCCCEEEEEEECCCCHHHHHHHHHHH Q ss_conf 7204310123442212568999986179861750465533-210000000000123412488970552114788877653 Q gi|254780402|r 207 VSGNLKIDTESLPCDKELLSLYQESIAGRYTWAAISTFEG-EEDKAVYVHNFIKCRTDVLTIIVPRHPRRCDAIERRLIA 285 (440) Q Consensus 207 v~GnlK~d~~~~~~~~~~~~~~~~~~~~r~v~vagSth~~-EE~~il~a~~~l~~~~~~~lIIvPRhpeR~~~i~~~l~~ 285 (440) +..|- -|.............. ....++.++..|...+. --+.+++|+..+ .++..|+|+...+++.+-..++.+. T Consensus 168 vIpnG-vd~~~~~~~~~~~~~~-~~~~~~~il~vgRl~~~Kg~~~ll~A~~~l--~~~~~l~iiG~g~~~~~~~~~l~~~ 243 (363) T cd04955 168 YIPYG-ADHVVSSEEDEILKKY-GLEPGRYYLLVGRIVPENNIDDLIEAFSKS--NSGKKLVIVGNADHNTPYGKLLKEK 243 (363) T ss_pred EECCC-CCCCCCCCCHHHHHHC-CCCCCCEEEEEECCCCCCCHHHHHHHHHHH--CCCCEEEEECCCCCCCHHHHHHHHH T ss_conf 97898-7546777506679870-899898899994475304799999999852--6356199977777630899999997 Q ss_pred HCCHHHHHCCCCCCCCCCEEEEECCC--CCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHHHHHH Q ss_conf 00101100134444466416750676--5202554220820661355444545837689619949998984437999999 Q gi|254780402|r 286 KGLKVARRSRGDVINAEVDIFLGDTI--GEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFRDIYRR 363 (440) Q Consensus 286 ~gl~~~~~S~~~~~~~~~~V~i~Dt~--GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~~~~ 363 (440) .+. +..|.+...+ -++..+|+.||+.++=.|..+.-|-.++|||++|+|||.. +.....|+ T Consensus 244 ~~~-------------~~~V~flG~~~~~~~~~~~~~ad~~v~pS~~~Eg~~~~~lEAma~G~PVVas-~~~~~~ev--- 306 (363) T cd04955 244 AAA-------------DPRIIFVGPIYDQELLELLRYAALFYLHGHSVGGTNPSLLEAMAYGCPVLAS-DNPFNREV--- 306 (363) T ss_pred HCC-------------CCCEEEECCCCCHHHHHHHHCCCEEECCCCCCCCCCHHHHHHHHHCCCEEEC-CCCCCCEE--- T ss_conf 346-------------9937970788847789863135446434566678768999999819999991-79987069--- Q ss_pred HHHCCCEEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHH Q ss_conf 997898699579899999999995898999999999999997045569999999 Q gi|254780402|r 364 MVSSGAVRIVEEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITLRS 417 (440) Q Consensus 364 L~~~g~~~~v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~~~ 417 (440) .+.+ +..+.++++|++++.++++|++.+++||++|++.+.++-.- +++.+. T Consensus 307 -~~~~-~~~~~~~~~la~~i~~ll~d~~~~~~~g~~ar~~v~~~fsw-~~~a~~ 357 (363) T cd04955 307 -LGDK-AIYFKVGDDLASLLEELEADPEEVSAMAKAARERIREKYTW-EKIADQ 357 (363) T ss_pred -ECCC-EEECCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCH-HHHHHH T ss_conf -7588-47779989999999999759999999999999999985899-999999 No 36 >cd03792 GT1_Trehalose_phosphorylase Trehalose phosphorylase (TP) reversibly catalyzes trehalose synthesis and degradation from alpha-glucose-1-phosphate (alpha-Glc-1-P) and glucose. The catalyzing activity includes the phosphorolysis of trehalose, which produce alpha-Glc-1-P and glucose, and the subsequent synthesis of trehalose. This family is most closely related to the GT1 family of glycosyltransferases. Probab=99.65 E-value=3e-13 Score=106.66 Aligned_cols=322 Identities=16% Similarity=0.141 Sum_probs=184.5 Q ss_pred HHHHHHHHHHHHHHHCCCCE--EEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHH------------HHHCCEEE Q ss_conf 99999999999998619989--99962776300121014774168850557201344557------------44202035 Q gi|254780402|r 67 GETMALIGLIPAIRSRHVNV--LLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFL------------KYWKPDCM 132 (440) Q Consensus 67 GE~~~a~pli~~l~~~~~~i--liT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl------------~~~~P~~~ 132 (440) |-.+....|++-+++.+.++ .|-...+ ++-...+.+++... =.|..+....+..+ ..-..|++ T Consensus 13 GVa~~l~~Lv~~~~~lGv~~~w~V~~~~~-~ff~~tk~~hn~Lq--g~~~~ls~~~~~~y~~~~~~na~~~~~~~~~DvV 89 (372) T cd03792 13 GVAEILHSLVPLMRDLGVDTRWEVIKGDP-EFFNVTKKFHNALQ--GADIELSEEEKEIYLEWNEENAERPLLDLDADVV 89 (372) T ss_pred CHHHHHHHHHHHHHHCCCCEEEEEECCCC-HHHHHHHHCCHHHC--CCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCEE T ss_conf 69999999999999669816999945983-57898875006541--9997679889999999999987313102799989 Q ss_pred EEECCCCHHHHHHH-HHHCCCCEEEEEE-CCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHCC-CCEEEEC Q ss_conf 64047621467888-7422455058641-2665421210011112331026442366415778889986088-5058720 Q gi|254780402|r 133 ILSESDIWPLTVFE-LSKQRIPQVLVNA-RMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKELGA-QKLIVSG 209 (440) Q Consensus 133 i~~e~ElWPnli~~-~~~~~ip~~lina-rls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~-~~i~v~G 209 (440) ++ .+-||-.+.. .++.+.|++.=.= .+|....+.|. +.+..++..|.+....++ |..-+. .+..++. T Consensus 90 ~i--Hdpqp~~l~~~~~~~~~~~I~r~Hid~~~~~~~~w~----fl~~~i~~~d~~V~~~~~----~~~~~~~~~~~~ip 159 (372) T cd03792 90 VI--HDPQPLALPLFKKKRGRPWIWRCHIDLSSPNRRVWD----FLQPYIEDYDAAVFHLPE----YVPPQVPPRKVIIP 159 (372) T ss_pred EE--CCCCHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHH----HHHHHHHHCCEEEEECHH----HHCCCCCCCEEEEC T ss_conf 98--793667899986368995899968866885388999----999999857999997357----50436887647816 Q ss_pred -CCC-CCCCCCCCCHHHHHHHHHHC---CCCCEEECCCCCCC--CCCCCCCCCCCCCC-CCCEEEEEEEC----CCCHHH Q ss_conf -431-01234422125689999861---79861750465533--21000000000012-34124889705----521147 Q gi|254780402|r 210 -NLK-IDTESLPCDKELLSLYQESI---AGRYTWAAISTFEG--EEDKAVYVHNFIKC-RTDVLTIIVPR----HPRRCD 277 (440) Q Consensus 210 -nlK-~d~~~~~~~~~~~~~~~~~~---~~r~v~vagSth~~--EE~~il~a~~~l~~-~~~~~lIIvPR----hpeR~~ 277 (440) .+. .+....+....+.....+.. .++|+++.-|.... .-+.+++|++.+++ .|+.+|+|+.. .||... T Consensus 160 ~~IDpl~~kn~~l~~~~~~~~~~~~gi~~d~piIl~VgRl~~~Kg~~~li~A~~~~~~~~~d~~LvivG~g~~ddpe~~~ 239 (372) T cd03792 160 PSIDPLSGKNRELSPADIEYILEKYGIDPERPYITQVSRFDPWKDPFGVIDAYRKVKERVPDPQLVLVGSGATDDPEGWI 239 (372) T ss_pred CCCCCCCCCCCCCCHHHHHHHHHHHCCCCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHH T ss_conf 71066774345589899999999829898995899987256546869999999999976899789998998777814789 Q ss_pred HHHHHHHHHCCHHHHHCCCCCCCCCCEEEEEC----CCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCC Q ss_conf 88877653001011001344444664167506----76520255422082066135544454583768961994999898 Q gi|254780402|r 278 AIERRLIAKGLKVARRSRGDVINAEVDIFLGD----TIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPN 353 (440) Q Consensus 278 ~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~D----t~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~ 353 (440) -..+..+..+. +.+|.+.. .--++..+|+.|| +||--|..+.-|-.++||+++|+|||.++ T Consensus 240 ~~~~~~~~~~~-------------~~~i~~~~~~~~~~~~~~~l~~~ad-v~v~~S~~Egfgl~~lEAm~~G~PVVas~- 304 (372) T cd03792 240 VYEEVLEYAEG-------------DPDIHVLTLPPVSDLEVNALQRAST-VVLQKSIREGFGLTVTEALWKGKPVIAGP- 304 (372) T ss_pred HHHHHHHHHCC-------------CCCEEEEECCCCCHHHHHHHHHCCC-EEEECCCCCCCCHHHHHHHHCCCCEEECC- T ss_conf 99999997188-------------9966999368886789999995397-99957642344469999998699899837- Q ss_pred CCCHHHHHHHHHHCCCEEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHH Q ss_conf 443799999999789869957989999999999589899999999999999704556999999999 Q gi|254780402|r 354 VENFRDIYRRMVSSGAVRIVEEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITLRSLD 419 (440) Q Consensus 354 ~~nf~e~~~~L~~~g~~~~v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~~~i~ 419 (440) ....++.+ .+...++.|.+.++++.++..+++||+.+++||++|++.+.++-......-+.+. T Consensus 305 vgGi~e~v---~dg~~G~Lv~~~d~~A~~i~~ll~d~~l~~~mg~~ar~~v~~~f~~~~~~~~yl~ 367 (372) T cd03792 305 VGGIPLQI---EDGETGFLVDTVEEAAVRILYLLRDPELRRKMGANAREHVRENFLITRHLKDYLY 367 (372) T ss_pred CCCCHHHC---CCCCCEEECCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHH T ss_conf 99837760---4898579889869999999999749999999999999999987899999999999 No 37 >cd04949 GT1_gtfA_like This family is most closely related to the GT1 family of glycosyltransferases and is named after gtfA in Streptococcus gordonii, where it plays a role in the O-linked glycosylation of GspB, a cell surface glycoprotein involved in platelet binding. In general glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltra Probab=99.63 E-value=1.3e-13 Score=108.99 Aligned_cols=267 Identities=16% Similarity=0.179 Sum_probs=167.1 Q ss_pred HHHHHHHHHC----CEEEEEECCCCHHHHHHHHHHCCCCEEEE-EECCCCCCCCCCHHHHHHHHHC---CCCCCEEEECC Q ss_conf 3445574420----20356404762146788874224550586-4126654212100111123310---26442366415 Q gi|254780402|r 119 AVSRFLKYWK----PDCMILSESDIWPLTVFELSKQRIPQVLV-NARMSRRSFKNWKTVLSFSKKI---FSQFSLVIVQS 190 (440) Q Consensus 119 ~~~~fl~~~~----P~~~i~~e~ElWPnli~~~~~~~ip~~li-narls~~S~~~~~~~~~l~~~~---~~~~~~i~~qs 190 (440) ...-||+... ++++|.=....+..-+..+++..-.++.+ |.-.++..--....+...++.. .+.+|.+++-+ T Consensus 86 L~~~fl~~l~~~~~~~v~i~DR~~~~~~~~~~~~~~~~~~~vlHs~H~~~~~~~~~~~~n~~Y~~~f~~~~~~d~~Iv~T 165 (372) T cd04949 86 LFAFFLDELNKDTKPDVFILDRPTLDGQALLNMKKAAKVVVVLHSNHVSDNNDPVHSLINNFYEYVFENLDKVDGVIVAT 165 (372) T ss_pred HHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHCCCCCEEEEECHHHHCCCCCCCCCCHHHHHHHHHHCHHHCCEEEECC T ss_conf 99999999851679998998786503789985068860899987388578887764302467999971743178899787 Q ss_pred HHHHHHHHH-HCC-CCEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCEEECCCCC-CCCCCCCCCCCCC-CCCCCCEEE Q ss_conf 778889986-088-505872043101234422125689999861798617504655-3321000000000-012341248 Q gi|254780402|r 191 ERYFRRYKE-LGA-QKLIVSGNLKIDTESLPCDKELLSLYQESIAGRYTWAAISTF-EGEEDKAVYVHNF-IKCRTDVLT 266 (440) Q Consensus 191 ~~~~~rl~~-lG~-~~i~v~GnlK~d~~~~~~~~~~~~~~~~~~~~r~v~vagSth-~~EE~~il~a~~~-l~~~~~~~l 266 (440) +..++-+.+ ++. .+|.++.+.=.+....+.. ......+ .++.+|-.. +.--+.+++|+.. .++.|++.| T Consensus 166 ~~Q~~di~~~f~~~~~i~~IP~~~~~~~~~~~~------~~~r~~~-~ii~vgRL~~eK~~d~LI~A~~~v~~~~P~~~L 238 (372) T cd04949 166 EQQKQDLQKQFGNYNPIYTIPVGSIDPLKLPAQ------FKQRKPH-KIITVARLAPEKQLDQLIKAFAKVVKQVPDATL 238 (372) T ss_pred HHHHHHHHHHHCCCCCEEEECCCCCCHHCCCCC------CCCCCCC-EEEEEECCCCCCCHHHHHHHHHHHHHHCCCCEE T ss_conf 999999999717888589967824203116666------4358989-799996777402859999999999987899299 Q ss_pred EEEECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCC Q ss_conf 89705521147888776530010110013444446641675067652025542208206613554445458376896199 Q gi|254780402|r 267 IIVPRHPRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGC 346 (440) Q Consensus 267 IIvPRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~ 346 (440) .|+..-|+| +++++++.+.|+.-. |.+..-.-.+..+|+.|+ +||-.|..+..|..++||+++|+ T Consensus 239 ~I~G~G~~~-~~L~~~i~~l~l~~~-------------V~f~G~~~~~~~~y~~a~-~~v~~S~~EGfgl~llEAma~Gl 303 (372) T cd04949 239 DIYGYGDEE-EKLKELIEELGLEDY-------------VFLKGYTRDLDEVYQKAQ-LSLLTSQSEGFGLSLMEALSHGL 303 (372) T ss_pred EEEECCHHH-HHHHHHHHHCCCCCE-------------EEECCCCCCHHHHHHHCC-EEEECCCCCCCCCHHHHHHHCCC T ss_conf 999734778-999999998299987-------------998899889899997579-99980200367658999998599 Q ss_pred CEEECCCCCCHHHHHHHHHHCCCEEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCC Q ss_conf 49998984437999999997898699579899999999995898999999999999997045 Q gi|254780402|r 347 AILSGPNVENFRDIYRRMVSSGAVRIVEEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQ 408 (440) Q Consensus 347 pVi~GP~~~nf~e~~~~L~~~g~~~~v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ 408 (440) |||.-.--+.-+|+.+. -.+|.+....|.++|++++..+++||+.+++|+++|++.+++.. T Consensus 304 PvIa~d~~yG~~eiI~~-g~nG~Lv~~~d~~~la~~i~~ll~~~~~~~~~s~~a~~~a~~fs 364 (372) T cd04949 304 PVISYDVNYGPSEIIED-GENGYLVPKGDIEALAEAIIELLNDPKLLQKFSEAAYENAERYS 364 (372) T ss_pred CEEEECCCCCCHHHHCC-CCCEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCC T ss_conf 99980599996888458-98479968999999999999998699999999999999999559 No 38 >PRK09922 UDP-D-galactose:(glucosyl)lipopolysaccharide-1,6-D-galactosyltransferase; Provisional Probab=99.61 E-value=4e-12 Score=99.30 Aligned_cols=319 Identities=13% Similarity=0.033 Sum_probs=177.5 Q ss_pred HHH-HHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCE-EEE--------ECCCCCCHHHHHHHHHHCCEEEEEEC Q ss_conf 999-99999999998619989999627763001210147741-688--------50557201344557442020356404 Q gi|254780402|r 67 GET-MALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYA-IHQ--------YAPLDIQPAVSRFLKYWKPDCMILSE 136 (440) Q Consensus 67 GE~-~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~-~~~--------y~P~D~~~~~~~fl~~~~P~~~i~~e 136 (440) |=+ ..+..|++.|.+++.++.+++++..+... +....... .+. +--....+..+++++..+|+.++... T Consensus 16 GGaErvl~~La~~L~~~~~~~~v~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ik~~~~Dii~~~~ 94 (361) T PRK09922 16 GGMETVISNVINTFENSKINCEMFFFCRNDKMD-KAWLKKIKYAQSFSNIKLSFLRRAKHVYNFSQWLKETSPDIVICID 94 (361) T ss_pred CCHHHHHHHHHHHHHHHCCCEEEEEEECCCCCC-HHHHHCCCCEEECCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEECC T ss_conf 804999999999998719987999993498541-5576447722433665520245789999999999970999999988 Q ss_pred CCCHHHHHHHH--HHCCCCEEEEEE-CCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHCC--CCEEEECCC Q ss_conf 76214678887--422455058641-2665421210011112331026442366415778889986088--505872043 Q gi|254780402|r 137 SDIWPLTVFEL--SKQRIPQVLVNA-RMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKELGA--QKLIVSGNL 211 (440) Q Consensus 137 ~ElWPnli~~~--~~~~ip~~lina-rls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~--~~i~v~Gnl 211 (440) .+++++... +..+.|+-++.- ..+ .... ++... ..+...|.+.+.|+..++.+...|. +++.+..|. T Consensus 95 --~~~~~~~~~~~~~~~~~~~ii~~~h~~---~~~~-~~~~~--~~~~~~d~~i~vS~~~~~~~~~~~~~~~ki~vI~N~ 166 (361) T PRK09922 95 --VISCLYANKARKKSGKQFKIFSWPHFS---LDHK-KHAEC--ITLVYADYHLAISSGIKEQMIARGISAQRISVIYNP 166 (361) T ss_pred --HHHHHHHHHHHHHCCCCCEEEEECCCC---HHHH-HHHHH--HHHHHCCEEEEECHHHHHHHHHCCCCHHCEEEEECC T ss_conf --068999999999829995899975565---3426-78998--999858856995789999999709975429999599 Q ss_pred -CCCCCCCCCCHHHHHHHHHHCCCCCEEECCCCC-CCC--CCCCCCCCCCCCCCCCEEEEEEECCCCHHHHHHHHHHHHC Q ss_conf -101234422125689999861798617504655-332--1000000000012341248897055211478887765300 Q gi|254780402|r 212 -KIDTESLPCDKELLSLYQESIAGRYTWAAISTF-EGE--EDKAVYVHNFIKCRTDVLTIIVPRHPRRCDAIERRLIAKG 287 (440) Q Consensus 212 -K~d~~~~~~~~~~~~~~~~~~~~r~v~vagSth-~~E--E~~il~a~~~l~~~~~~~lIIvPRhpeR~~~i~~~l~~~g 287 (440) .......+... ....+.++..|... ++. -..++.++.. ..++..|+|+...+++ +++++++++.| T Consensus 167 i~~~~~~~~~~~--------~~~~~~il~vGRl~~~~qK~~~~li~a~~~--~~~~~~L~IvG~G~~~-~~L~~~i~~l~ 235 (361) T PRK09922 167 VEIKTIIIPPPE--------RDKPAVFLYVGRLKFEGQKNVKELFDGLSR--TTGEWQLHIIGDGSDF-EKCKAYSRELG 235 (361) T ss_pred CCHHHCCCCCHH--------HCCCCEEEEEECCCCCCCCCHHHHHHHHHH--HCCCCEEEEEEEECCH-HHHHHHHHHHC T ss_conf 173540467503--------157877999954445256899999999985--4899489999843889-99999999838 Q ss_pred CHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHHHHHHHHHC Q ss_conf 10110013444446641675067652025542208206613554445458376896199499989844379999999978 Q gi|254780402|r 288 LKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFRDIYRRMVSS 367 (440) Q Consensus 288 l~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~~~~L~~~ 367 (440) +.-...-.|-.. |....+..+|+.|| +||=.|..+.=|-..+||+++|+|||.---...-.|+.+. -.+ T Consensus 236 l~~~V~flG~~~---------np~~~l~~~~~~ad-ifVl~S~~EGfp~vllEAma~G~PvIatd~~~G~~EiI~d-g~n 304 (361) T PRK09922 236 IEQRIIWHGWQS---------NPWEVVQQKIKNVS-ALLLTSKFEGFPMTLLEAMSYGIPCISSDCMSGPRDIIKP-GLN 304 (361) T ss_pred CCCCEEECCCCC---------CHHHHHHHHHHHCC-EEEECCCCCCCCCHHHHHHHHCCCEEEECCCCCCHHHHCC-CCC T ss_conf 987389906759---------87999999985134-9996475568872899999959989997599990887158-983 Q ss_pred CCEEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCC--HHHHHHHHH Q ss_conf 986995798999999999958989999999999999970455--699999999 Q gi|254780402|r 368 GAVRIVEEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQG--PLKITLRSL 418 (440) Q Consensus 368 g~~~~v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~g--a~~~~~~~i 418 (440) |......|.++|++++.+|++|+.+...- ++..-+++.-+ .+++..+.+ T Consensus 305 G~Lv~~~d~~~la~~i~~li~~e~~~~~~--~~~~~~~~~~~~~~~~~~~~~~ 355 (361) T PRK09922 305 GELYTPGNIDEFVGKLNKVISGEVKYQHD--IIPGSIEKFYDVLYFKNLNNAL 355 (361) T ss_pred EEEECCCCHHHHHHHHHHHHHCHHHCCHH--HHHHHHHHHCHHHHHHHHHHHH T ss_conf 79977999999999999998482213999--9999998646899999999999 No 39 >cd03818 GT1_ExpC_like This family is most closely related to the GT1 family of glycosyltransferases. ExpC in Rhizobium meliloti has been shown to be involved in the biosynthesis of galactoglucan (exopolysaccharide II). Probab=99.61 E-value=3.6e-13 Score=106.17 Aligned_cols=212 Identities=15% Similarity=0.166 Sum_probs=130.9 Q ss_pred CCCCCEEEECCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCHHHHHHHHH---HCCCCCEEECCCC----CCCCCCCCC Q ss_conf 264423664157788899860885058720431012344221256899998---6179861750465----533210000 Q gi|254780402|r 180 FSQFSLVIVQSERYFRRYKELGAQKLIVSGNLKIDTESLPCDKELLSLYQE---SIAGRYTWAAIST----FEGEEDKAV 252 (440) Q Consensus 180 ~~~~~~i~~qs~~~~~rl~~lG~~~i~v~GnlK~d~~~~~~~~~~~~~~~~---~~~~r~v~vagSt----h~~EE~~il 252 (440) +...|.+.+.|+..++++.....+++.|+.|- .|......+.+....... ...+++++++.+- ++| -+.++ T Consensus 154 ~~~ad~~v~~s~~~~~~~~~~~~~~i~VipnG-VD~~~f~P~~~a~~~~~~~~~~~~~~~vvl~vGR~l~~~KG-~~~Ll 231 (396) T cd03818 154 LAQADAGVSPTRWQRSTFPAELRSRISVIHDG-IDTDRLRPDPQARLRLPNGRVLTPGDEVITFVARNLEPYRG-FHVFM 231 (396) T ss_pred HHHCCEEECCCHHHHHHHHHHHCCCEEEEECC-CCCCCCCCCHHHHHHHHHCCCCCCCCEEEEEECCCCCCCCC-HHHHH T ss_conf 98487887188999975267624837998268-77133388804555442114689998699997765130448-99999 Q ss_pred CCC-CCCCCCCCEEEEEEEC-----CCCH--HHHH-HHHHHHHCCHHHHHCCCCCCCCCCEEEEECCC--CCCHHHHHHC Q ss_conf 000-0001234124889705-----5211--4788-87765300101100134444466416750676--5202554220 Q gi|254780402|r 253 YVH-NFIKCRTDVLTIIVPR-----HPRR--CDAI-ERRLIAKGLKVARRSRGDVINAEVDIFLGDTI--GEMGFYLRMT 321 (440) Q Consensus 253 ~a~-~~l~~~~~~~lIIvPR-----hpeR--~~~i-~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~--GeL~~lY~~A 321 (440) .|+ +++++.|+..|+||.. .+++ .... ++++++.+... + ...|.....+ .+|..+|+.| T Consensus 232 ~A~~~l~~~~p~~~lvivG~~~~~~g~~~~~~~~~~~~ll~~l~~~~---------~-~~rV~F~G~v~~~~l~~~l~~a 301 (396) T cd03818 232 RALPRLLRARPDARVVIVGGDGVSYGAPPPDGESWKQHMLDELGGRL---------D-LSRVHFLGRVPYDQYLALLQVS 301 (396) T ss_pred HHHHHHHHHCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCC---------C-CCEEEEECCCCHHHHHHHHHEE T ss_conf 99999998789968999926874456667654379999998632236---------7-6368970898589998875100 Q ss_pred CCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHHHHHHHHHCCCEEEECCHHHHHHHHHHHHCCHHHHHHHHHHHH Q ss_conf 82066135544454583768961994999898443799999999789869957989999999999589899999999999 Q gi|254780402|r 322 EIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFRDIYRRMVSSGAVRIVEEVGTLADMVYSLLSEPTIRYEMINAAI 401 (440) Q Consensus 322 diafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~~~~L~~~g~~~~v~d~~eL~~~l~~ll~d~~~~~~~~~~a~ 401 (440) | +||--|....-|-.++|||+||+|||.. ......|++++ -.+|......|+++|++++.++++||+.+++||++|+ T Consensus 302 d-v~v~PS~~~~~~~~llEAMA~G~PVVas-~~gg~~e~V~d-g~~G~Lvpp~d~~~LA~ai~~lL~dp~~r~~lg~aaR 378 (396) T cd03818 302 D-VHVYLTYPFVLSWSLLEAMACGCLVVGS-DTAPVREVITD-GENGLLVDFFDPDALAAAVIELLDDPARRARLRRAAR 378 (396) T ss_pred E-EEEECCEECCCCCHHHHHHHCCCCEEEE-CCCCCHHHHCC-CCCEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHH T ss_conf 3-9995314045576089999779989992-79982655259-9878996999999999999999759999999999999 Q ss_pred HHHHH Q ss_conf 99970 Q gi|254780402|r 402 NEVKK 406 (440) Q Consensus 402 ~~i~~ 406 (440) +.+++ T Consensus 379 ~~~~~ 383 (396) T cd03818 379 RTALR 383 (396) T ss_pred HHHHH T ss_conf 99999 No 40 >cd03785 GT1_MurG MurG is an N-acetylglucosaminyltransferase, the last enzyme involved in the intracellular phase of peptidoglycan biosynthesis. It transfers N-acetyl-D-glucosamine (GlcNAc) from UDP-GlcNAc to the C4 hydroxyl of a lipid-linked N-acetylmuramoyl pentapeptide (NAM). The resulting disaccharide is then transported across the cell membrane, where it is polymerized into NAG-NAM cell-wall repeat structure. MurG belongs to the GT-B structural superfamily of glycoslytransferases, which have characteristic N- and C-terminal domains, each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homology. The large cleft that separates the two domains includes the catalytic center and permits a high degree of flexibility. Probab=99.58 E-value=7.2e-12 Score=97.61 Aligned_cols=307 Identities=16% Similarity=0.137 Sum_probs=177.1 Q ss_pred EECCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHH--HHCCCCCCEEEEECCCCCC-------------------HHH Q ss_conf 8178999999999999998619989999627763001--2101477416885055720-------------------134 Q gi|254780402|r 62 HASSVGETMALIGLIPAIRSRHVNVLLTTMTATSAKV--ARKYLGQYAIHQYAPLDIQ-------------------PAV 120 (440) Q Consensus 62 HaaSvGE~~~a~pli~~l~~~~~~iliT~~T~tg~~~--~~~~~~~~~~~~y~P~D~~-------------------~~~ 120 (440) -+-+-|-+.-+..+.++|++++.+|+..+ +..|.+. .++. ......+|...+ ... T Consensus 6 ~GGTGGHi~Palala~~L~~~g~~V~~i~-~~~g~e~~~~~~~---g~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 81 (350) T cd03785 6 GGGTGGHIFPALALAEELRERGAEVLFLG-TKRGLEARLVPKA---GIPLHTIPVGGLRRKGSLKKLKAPFKLLKGVLQA 81 (350) T ss_pred ECCCHHHHHHHHHHHHHHHHCCCEEEEEE-CCCHHHHCCCCCC---CCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 47858999999999999997879899998-7836864234413---9948997688878887399999999999999999 Q ss_pred HHHHHHHCCEEEEEECCCCHHHHHHHHHHCCCCEEEE--EECCCCCCCCCCHHHHHHHHHCCCC-CCEEEECCHHHHHHH Q ss_conf 4557442020356404762146788874224550586--4126654212100111123310264-423664157788899 Q gi|254780402|r 121 SRFLKYWKPDCMILSESDIWPLTVFELSKQRIPQVLV--NARMSRRSFKNWKTVLSFSKKIFSQ-FSLVIVQSERYFRRY 197 (440) Q Consensus 121 ~~fl~~~~P~~~i~~e~ElWPnli~~~~~~~ip~~li--narls~~S~~~~~~~~~l~~~~~~~-~~~i~~qs~~~~~rl 197 (440) .++++.++|+++|.+.+=.-.=.+..++..+||+++- |+.+ ....+++.+ .+.|++..+.+... T Consensus 82 ~~~l~~~kPd~vi~~GGY~s~P~~laA~~~~iP~~ihEqN~v~------------G~anr~l~~~a~~i~~~f~~~~~~- 148 (350) T cd03785 82 RKILKKFKPDVVVGFGGYVSGPVGLAAKLLGIPLVIHEQNAVP------------GLANRLLARFADRVALSFPETAKY- 148 (350) T ss_pred HHHHHHCCCCEEEECCCCHHHHHHHHHHHCCCCCEEECCCCCC------------CHHHHHHCCCCCEEEECCHHHHHC- T ss_conf 9999964999999889810389999999729985565677225------------713233210039899857565412- Q ss_pred HHHCCCCEEEECCC-CCCCCCCCCCHHHHHHHHHHCCCCC-EEECCCCCCCC--CCCCCCCCCCCCCCCCEEEEEEECCC Q ss_conf 86088505872043-1012344221256899998617986-17504655332--10000000000123412488970552 Q gi|254780402|r 198 KELGAQKLIVSGNL-KIDTESLPCDKELLSLYQESIAGRY-TWAAISTFEGE--EDKAVYVHNFIKCRTDVLTIIVPRHP 273 (440) Q Consensus 198 ~~lG~~~i~v~Gnl-K~d~~~~~~~~~~~~~~~~~~~~r~-v~vagSth~~E--E~~il~a~~~l~~~~~~~lIIvPRhp 273 (440) .-..++.++||. |-.... .+. . ......-.+++ +++.|.+.-.. .+.+.++...+.. .++.++..-. + T Consensus 149 --~~~~k~~~vG~PvR~~~~~--~~~-~-~~~~~~~~~~~~iLv~GGSqGa~~ln~~v~~~~~~l~~-~~~~ii~~~G-~ 220 (350) T cd03785 149 --FPKDKAVVTGNPVREEILA--LDR-E-RARLGLRPGKPTLLVFGGSQGARAINEAVPEALAELLR-KRLQVIHQTG-K 220 (350) T ss_pred --CCCCCEEEECCCCCHHHHC--CCH-H-HHHCCCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHHH-CCCEEEEECC-C T ss_conf --4667779968852266641--434-4-67527898973999984872047899999999998764-4968999838-4 Q ss_pred CHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCC-CHHHHHHHCCCEEECC Q ss_conf 114788877653001011001344444664167506765202554220820661355444545-8376896199499989 Q gi|254780402|r 274 RRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQ-NPLEAAMLGCAILSGP 352 (440) Q Consensus 274 eR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGh-N~lEpaa~G~pVi~GP 352 (440) +..+++.+..... ..++.+.+-+-+|..+|+.||+++- ++|. +..|.+++|+|.|.=| T Consensus 221 ~~~~~~~~~~~~~---------------~~~~~v~~f~~~m~~~l~~aDlvIs------raGa~Ti~E~~~~g~P~IlIP 279 (350) T cd03785 221 GDLEEVKKAYEEL---------------GVNYEVFPFIDDMAAAYAAADLVIS------RAGASTVAELAALGLPAILIP 279 (350) T ss_pred CHHHHHHHHHHHC---------------CCCEEEECHHHHHHHHHHHCCEEEE------CCCCCHHHHHHHCCCCEEEEE T ss_conf 0089999999866---------------9988992518899999861988997------798425999998199869984 Q ss_pred C---CCC-HHHHHHHHHHCCCEEEEC----CHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHH Q ss_conf 8---443-799999999789869957----989999999999589899999999999999704556999999 Q gi|254780402|r 353 N---VEN-FRDIYRRMVSSGAVRIVE----EVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITLR 416 (440) Q Consensus 353 ~---~~n-f~e~~~~L~~~g~~~~v~----d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~~ 416 (440) + .+| ...-++.+.+.|++..++ +++.|.+.+.++++||+.+++|+++++++.. .+|++++.+ T Consensus 280 ~p~a~d~hQ~~NA~~l~~~g~a~~i~e~~~~~~~L~~~i~~ll~d~~~l~~m~~~~~~~~~--~~a~~~i~~ 349 (350) T cd03785 280 LPYAADDHQTANARALVKAGAAVLIPQEELTPERLAAALLELLSDPERLKAMAEAARSLAR--PDAAERIAD 349 (350) T ss_pred CCCCCCCHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHCCCC--CCHHHHHHC T ss_conf 5877766599999999988999995002499999999999998799999999999874589--799999845 No 41 >cd03813 GT1_like_3 This family is most closely related to the GT1 family of glycosyltransferases. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homolog Probab=99.57 E-value=1.7e-13 Score=108.24 Aligned_cols=238 Identities=17% Similarity=0.184 Sum_probs=164.0 Q ss_pred HHHCCCCEEEE--------------EECCCCCCCCC-C-HHHHHHHHHCCCCCCEEEECCHHHHHHHHHHCC--CCEEEE Q ss_conf 74224550586--------------41266542121-0-011112331026442366415778889986088--505872 Q gi|254780402|r 147 LSKQRIPQVLV--------------NARMSRRSFKN-W-KTVLSFSKKIFSQFSLVIVQSERYFRRYKELGA--QKLIVS 208 (440) Q Consensus 147 ~~~~~ip~~li--------------narls~~S~~~-~-~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~--~~i~v~ 208 (440) ..+.|+|.++- .|.-..-.+|+ | ++|..+.+-.+...|.|.+-.+...+.=+++|+ +|+.|+ T Consensus 193 ~~~~g~P~llTEHGIYtRER~i~l~~a~w~~~~~r~~wi~ff~~l~~~~Y~~Ad~I~~l~~~nr~~qi~~Ga~~~k~~vI 272 (475) T cd03813 193 KARRGTPFLLTEHGIYTRERKIELLQADWEMSYFRRLWIRFFESLGRLAYQAADRITTLYEGNRERQIEDGADPEKIRVI 272 (475) T ss_pred HHHHCCCEEEEECCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHCEEECCCHHHHHHHHHHCCCHHHCEEC T ss_conf 88758987985067228999988864754358999999999999999999851700013424489999819899994411 Q ss_pred CC-C---CCCCCCCCCCHHHHHHHHHHCCCCCEEEC-CCCCC-CCCCCCCCCCCCC-CCCCCEEEEEEE---CCCCHHHH Q ss_conf 04-3---10123442212568999986179861750-46553-3210000000000-123412488970---55211478 Q gi|254780402|r 209 GN-L---KIDTESLPCDKELLSLYQESIAGRYTWAA-ISTFE-GEEDKAVYVHNFI-KCRTDVLTIIVP---RHPRRCDA 278 (440) Q Consensus 209 Gn-l---K~d~~~~~~~~~~~~~~~~~~~~r~v~va-gSth~-~EE~~il~a~~~l-~~~~~~~lIIvP---RhpeR~~~ 278 (440) .| + +|........ -+..+++++ |-..| .+.+.+|.|+..+ ++.|+++|.|+. ..|+-.++ T Consensus 273 pNGId~~~F~~~~~~~~----------~~~~~~v~~vgRv~p~Kdi~tlI~A~~~v~~~~p~~rl~I~Gp~d~~~~y~~e 342 (475) T cd03813 273 PNGIDPERFAPARRARP----------EKEPPVVGLIGRVVPIKDIKTFIRAAAIVRKKIPDAEGWVIGPTDEDPEYAEE 342 (475) T ss_pred CCCCCHHHCCCCCCCCC----------CCCCCEEEEEEECCCCCCHHHHHHHHHHHHHHCCCEEEEEECCCCCCHHHHHH T ss_conf 79832988488765666----------78998899997011166999999999999986898399997799888589999 Q ss_pred HHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHH Q ss_conf 88776530010110013444446641675067652025542208206613554445458376896199499989844379 Q gi|254780402|r 279 IERRLIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFR 358 (440) Q Consensus 279 i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~ 358 (440) ..+++++.|+.-.....+. -....+|+.+| +||=.|..+.-+..++||+++|+|||.. .+..++ T Consensus 343 c~~lv~~lgL~~~V~F~G~--------------~dv~~~l~~~D-v~vl~S~~Eg~plvllEAmA~G~PvVaT-dVGg~~ 406 (475) T cd03813 343 CRELVESLGLEDNVKFTGF--------------QNVKEYLPKLD-VLVLTSISEGQPLVILEAMAAGIPVVAT-DVGSCR 406 (475) T ss_pred HHHHHHHHCCCCCEEECCC--------------CCHHHHHHHCC-EEEECCCCCCCCHHHHHHHHCCCCEEEC-CCCCCH T ss_conf 9999998299872798387--------------89899998579-9996573346757999999769988972-699818 Q ss_pred HHHHHH-----HHCCCEEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCH Q ss_conf 999999-----9789869957989999999999589899999999999999704556 Q gi|254780402|r 359 DIYRRM-----VSSGAVRIVEEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGP 410 (440) Q Consensus 359 e~~~~L-----~~~g~~~~v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga 410 (440) |+..+- -.+|.+....|+++|++++.+|++||+.|++||++|++.+++.-.- T Consensus 407 e~v~~~~~~~~G~~G~lvp~~d~~~LA~ai~~Ll~d~~~r~~~g~~ar~rv~~~f~~ 463 (475) T cd03813 407 ELIEGADDEALGPAGEVVPPADPEALARAILRLLKDPELRRAMGEAGRKRVERYYTL 463 (475) T ss_pred HHHCCCCCCCCCCCEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCH T ss_conf 875386656778854896999999999999999739999999999999999996899 No 42 >TIGR03492 conserved hypothetical protein. This protein family is restricted to the Cyanobacteria, in one or two copies, save for instances in the genus Deinococcus. This protein shows some sequence similarity, especially toward the C-terminus, to lipid-A-disaccharide synthase (TIGR00215 or pfam02684). The function is unknown. Probab=99.57 E-value=8.3e-12 Score=97.20 Aligned_cols=336 Identities=15% Similarity=0.170 Sum_probs=204.3 Q ss_pred CCHHHHHHHHHHHHHHHHCCCCEEEEECCCC--CCHHHHCCCCCCEEEEECC---------C----CC--------CHHH Q ss_conf 7899999999999999861998999962776--3001210147741688505---------5----72--------0134 Q gi|254780402|r 64 SSVGETMALIGLIPAIRSRHVNVLLTTMTAT--SAKVARKYLGQYAIHQYAP---------L----DI--------QPAV 120 (440) Q Consensus 64 aSvGE~~~a~pli~~l~~~~~~iliT~~T~t--g~~~~~~~~~~~~~~~y~P---------~----D~--------~~~~ 120 (440) --.||=+.+.-++++|++..|++-|....-- |...-+...+-.....-+| . |. ..-+ T Consensus 5 NGhGEDl~a~~i~~~L~~~~p~~~v~alPLVG~G~ay~~~gi~iig~~~~lpSGGf~~~~~~~l~~Dl~~Gl~~~~~~q~ 84 (396) T TIGR03492 5 NGHGEDLIAARIAKALLQLSPDLNLEALPLVGEGRAYQNLGIPIIGPTKELPSGGFSYQSLRGLLRDLRAGLVGLTLGQW 84 (396) T ss_pred CCCHHHHHHHHHHHHHHHHCCCCCEEEECCCCCCHHHHHCCCCEECCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 76358899999999999629999669851347849999789948746643577462243677899999703699999999 Q ss_pred HHHHHHHC-CEEEEEECCCCHHHHHHHHHHCCCCEEEEEECCCCCC---------------CCCCHHHHHHHHHCC--CC Q ss_conf 45574420-2035640476214678887422455058641266542---------------121001111233102--64 Q gi|254780402|r 121 SRFLKYWK-PDCMILSESDIWPLTVFELSKQRIPQVLVNARMSRRS---------------FKNWKTVLSFSKKIF--SQ 182 (440) Q Consensus 121 ~~fl~~~~-P~~~i~~e~ElWPnli~~~~~~~ip~~linarls~~S---------------~~~~~~~~~l~~~~~--~~ 182 (440) + .+..|. ..-.|++=+|+-|-++. ...|.|.+.+..--|+-. ....-++++ .++++ ++ T Consensus 85 ~-~~~~~~~~~~~ilavGD~~pl~~A--~~sg~p~~~~~~~~S~yy~~~~~~~~~~~~~~~~~g~~~~Pw-e~~lm~~~r 160 (396) T TIGR03492 85 R-ALRKWAKKGDLIVAVGDIVPLLFA--WLSGKPYAFVGTAKSDYYWESGPRRSPSDEYHRLEGSLYLPW-ERWLMRSRR 160 (396) T ss_pred H-HHHHHCCCCCEEEEECCHHHHHHH--HHCCCCCEEEEECCCCEEEECCCCCCCHHHHHCCCEEECCHH-HHHHHCCCC T ss_conf 9-999854458879996671888999--981898169970453236606877530123321551781677-999740966 Q ss_pred CCEEEECCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCHHHHHHHHHHCCC-CCE-EECCCCCCCC---CCCCCCCCCC Q ss_conf 4236641577888998608850587204310123442212568999986179-861-7504655332---1000000000 Q gi|254780402|r 183 FSLVIVQSERYFRRYKELGAQKLIVSGNLKIDTESLPCDKELLSLYQESIAG-RYT-WAAISTFEGE---EDKAVYVHNF 257 (440) Q Consensus 183 ~~~i~~qs~~~~~rl~~lG~~~i~v~GnlK~d~~~~~~~~~~~~~~~~~~~~-r~v-~vagSth~~E---E~~il~a~~~ 257 (440) +-.|++.|+.+++.|++-|++ +...||.=.|...++.... ...+ +.| ++-||--++= -..++++... T Consensus 161 c~~Vf~RD~lTA~~L~~~gi~-a~f~GnPmMD~l~~~~~~~-------~~~~~~~I~LLPGSR~pEa~~nl~~~L~a~~~ 232 (396) T TIGR03492 161 CLAVFVRDRLTARDLRRQGVR-ASYLGNPMMDGLEPPERKP-------LLTGRFRIALLPGSRPPEAYRNLKLLLRALEA 232 (396) T ss_pred CCEEECCCHHHHHHHHHCCCC-EEEECCCHHCCCCCCCCCC-------CCCCCCEEEEECCCCHHHHHHHHHHHHHHHHH T ss_conf 529950558879999977996-4960873413788888766-------78878679995898859999879999999996 Q ss_pred CCCCCCEEEEEEECCCC-HHHHHHHHHHHHCCHHHHHCCCCC-CC--CCCEEEEECCCCCCHHHHHHCCCEEEECCCCCC Q ss_conf 01234124889705521-147888776530010110013444-44--664167506765202554220820661355444 Q gi|254780402|r 258 IKCRTDVLTIIVPRHPR-RCDAIERRLIAKGLKVARRSRGDV-IN--AEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCAS 333 (440) Q Consensus 258 l~~~~~~~lIIvPRhpe-R~~~i~~~l~~~gl~~~~~S~~~~-~~--~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~ 333 (440) +.+... ...++|.-|. ..+.+.+.+...|+.....+.... .. .+..|.+. -|....+...||+++.- T Consensus 233 l~~~~~-~~f~~alap~l~~~~l~~~l~~~Gw~~~~~~~~~~~~~~~~~~~v~~~--~~~f~~~l~~adl~ia~------ 303 (396) T TIGR03492 233 LPDSQP-FVFLAAIVPSLSLEKLQAILEDLGWQLEGSSEDQTSLFQKGTLEVLLG--RGAFAEILHWADLGIAM------ 303 (396) T ss_pred HCCCCC-EEEEEEECCCCCHHHHHHHHHHCCCEECCCCCCCCHHEECCCEEEEEE--CCCHHHHHHHCHHHHHC------ T ss_conf 341488-699998689999899999999659700578654200010487689997--38489999855114443------ Q ss_pred CCCCHHHHHHHCCCEEE----CC-CCCCHHHHHHHHHHCCCEEEEC-CHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHC Q ss_conf 54583768961994999----89-8443799999999789869957-989999999999589899999999999999704 Q gi|254780402|r 334 GGQNPLEAAMLGCAILS----GP-NVENFRDIYRRMVSSGAVRIVE-EVGTLADMVYSLLSEPTIRYEMINAAINEVKKM 407 (440) Q Consensus 334 GGhN~lEpaa~G~pVi~----GP-~~~nf~e~~~~L~~~g~~~~v~-d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~ 407 (440) -|--..|++.+|+|||. || .++.|.+.-..|. .+.+..++ +++++++.+..++.|++.++++.+++++-. -. T Consensus 304 AGTAteQ~vgLG~Pvv~l~g~GPQfT~~fA~~Q~RLL-G~sv~~~~~~p~~ia~~~~~lL~d~~~~~~~~~~gr~Rl-G~ 381 (396) T TIGR03492 304 AGTATEQAVGLGKPVIQLPGKGPQFTYGFAEAQSRLL-GGSVFLASKNPEQAAQVVRQLLADPELLERCRRNGQERM-GP 381 (396) T ss_pred CCCHHHHHHHCCCCEEEECCCCCHHHHHHHHHHHHHC-CCEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHC-CC T ss_conf 7709999987189879972787277799999999862-753352689999999999998549999999999999855-88 Q ss_pred CCHHHHHHHHHHHHH Q ss_conf 556999999999752 Q gi|254780402|r 408 QGPLKITLRSLDSYV 422 (440) Q Consensus 408 ~ga~~~~~~~i~~~l 422 (440) .|+++|+.+.|.++| T Consensus 382 ~Gas~RiA~~il~~L 396 (396) T TIGR03492 382 PGASARIAESILKQL 396 (396) T ss_pred CCHHHHHHHHHHHHC T ss_conf 658899999999519 No 43 >COG0707 MurG UDP-N-acetylglucosamine:LPS N-acetylglucosamine transferase [Cell envelope biogenesis, outer membrane] Probab=99.55 E-value=2.8e-11 Score=93.73 Aligned_cols=320 Identities=15% Similarity=0.133 Sum_probs=187.6 Q ss_pred EEEEEECCHHHHHHHHHHHHHHHHCCCC-EEEEECCCCCCHHHHCCCCCCEEE---------EECCCCCC---------- Q ss_conf 7999817899999999999999861998-999962776300121014774168---------85055720---------- Q gi|254780402|r 58 LIWFHASSVGETMALIGLIPAIRSRHVN-VLLTTMTATSAKVARKYLGQYAIH---------QYAPLDIQ---------- 117 (440) Q Consensus 58 ~IW~HaaSvGE~~~a~pli~~l~~~~~~-iliT~~T~tg~~~~~~~~~~~~~~---------~y~P~D~~---------- 117 (440) .+-.=+-+-|-+-.+..++++|.+++.+ +.+ ..|..+.+....... .... .+.+++.. T Consensus 3 ivl~~gGTGGHv~pAlAl~~~l~~~g~~~v~~-~~~~~~~e~~l~~~~-~~~~~~I~~~~~~~~~~~~~~~~~~~~~~~~ 80 (357) T COG0707 3 IVLTAGGTGGHVFPALALAEELAKRGWEQVIV-LGTGDGLEAFLVKQY-GIEFELIPSGGLRRKGSLKLLKAPFKLLKGV 80 (357) T ss_pred EEEEECCCCCCHHHHHHHHHHHHHCCCCEEEE-ECCCCHHHHHCCCCC-CEEEEEEECCCCCCCCHHHHHHHHHHHHHHH T ss_conf 99996677665779999999999609717999-446634443205456-7079998646556565066788699999999 Q ss_pred HHHHHHHHHHCCEEEEEECCC-CHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHH Q ss_conf 134455744202035640476-2146788874224550586412665421210011112331026442366415778889 Q gi|254780402|r 118 PAVSRFLKYWKPDCMILSESD-IWPLTVFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRR 196 (440) Q Consensus 118 ~~~~~fl~~~~P~~~i~~e~E-lWPnli~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~r 196 (440) ..+++.++.++|++++-+.+- -=|-++ .+...+||+++.-...... .-|+++..+.+.++..|. . T Consensus 81 ~~a~~il~~~kPd~vig~Ggyvs~P~~~-Aa~~~~iPv~ihEqn~~~G--~ank~~~~~a~~V~~~f~-------~---- 146 (357) T COG0707 81 LQARKILKKLKPDVVIGTGGYVSGPVGI-AAKLLGIPVIIHEQNAVPG--LANKILSKFAKKVASAFP-------K---- 146 (357) T ss_pred HHHHHHHHHHCCCEEEECCCCCCCHHHH-HHHHCCCCEEEEEECCCCC--HHHHHHHHHEEEEEECCC-------H---- T ss_conf 9999999970998999579863464999-9861699879997346642--656453230125771251-------1---- Q ss_pred HHHHCC--CCEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCEEECCCCCCCC--CCCCCCCCCCCCCCCCEEEEEEECC Q ss_conf 986088--505872043101234422125689999861798617504655332--1000000000012341248897055 Q gi|254780402|r 197 YKELGA--QKLIVSGNLKIDTESLPCDKELLSLYQESIAGRYTWAAISTFEGE--EDKAVYVHNFIKCRTDVLTIIVPRH 272 (440) Q Consensus 197 l~~lG~--~~i~v~GnlK~d~~~~~~~~~~~~~~~~~~~~r~v~vagSth~~E--E~~il~a~~~l~~~~~~~lIIvPRh 272 (440) ..-+. +++.++|+.-...... ............+...+++.|.++-.. .+.+.++...+.+ +..++..-.+ T Consensus 147 -~~~~~~~~~~~~tG~Pvr~~~~~--~~~~~~~~~~~~~~~~ilV~GGS~Ga~~ln~~v~~~~~~l~~--~~~v~~~~G~ 221 (357) T COG0707 147 -LEAGVKPENVVVTGIPVRPEFEE--LPAAEVRKDGRLDKKTILVTGGSQGAKALNDLVPEALAKLAN--RIQVIHQTGK 221 (357) T ss_pred -HHCCCCCCCEEEECCCCCHHHHC--CCHHHHHHHCCCCCCEEEEECCCHHHHHHHHHHHHHHHHHCC--CEEEEEECCC T ss_conf -21157866437857846366521--635544320378984899988824279999999999872121--6699997697 Q ss_pred CCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECC Q ss_conf 21147888776530010110013444446641675067652025542208206613554445458376896199499989 Q gi|254780402|r 273 PRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGP 352 (440) Q Consensus 273 peR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP 352 (440) -. .+++.+...+.+. +-+..=+-.|..+|+.||+++. ..|+-.+-|.+++|+|+|+=| T Consensus 222 ~~-~~~~~~~~~~~~~----------------~~v~~f~~dm~~~~~~ADLvIs-----RaGa~Ti~E~~a~g~P~IliP 279 (357) T COG0707 222 ND-LEELKSAYNELGV----------------VRVLPFIDDMAALLAAADLVIS-----RAGALTIAELLALGVPAILVP 279 (357) T ss_pred CH-HHHHHHHHHCCCC----------------EEEEEHHHHHHHHHHHCCEEEE-----CCCHHHHHHHHHHCCCEEEEC T ss_conf 36-9999998720681----------------8997667539999986458986-----786649999999589889965 Q ss_pred CCCC---HHH-HHHHHHHCCCEEEECCHH----HHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 8443---799-999999789869957989----999999999589899999999999999704556999999999752 Q gi|254780402|r 353 NVEN---FRD-IYRRMVSSGAVRIVEEVG----TLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITLRSLDSYV 422 (440) Q Consensus 353 ~~~n---f~e-~~~~L~~~g~~~~v~d~~----eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~~~i~~~l 422 (440) +-.- .++ -++.|.+.|++..+.+++ ++.+.+.+++++|+..++|..++++... .++.+++.+.+.... T Consensus 280 ~p~~~~~~Q~~NA~~l~~~gaa~~i~~~~lt~~~l~~~i~~l~~~~~~l~~m~~~a~~~~~--p~aa~~i~~~~~~~~ 355 (357) T COG0707 280 YPPGADGHQEYNAKFLEKAGAALVIRQSELTPEKLAELILRLLSNPEKLKAMAENAKKLGK--PDAAERIADLLLALA 355 (357) T ss_pred CCCCCCCHHHHHHHHHHHCCCEEEECCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHCC--CCHHHHHHHHHHHHH T ss_conf 8987644189999999967976994255479999999999996598999999999987179--878999999999985 No 44 >PRK00025 lpxB lipid-A-disaccharide synthase; Reviewed Probab=99.50 E-value=1.2e-11 Score=96.23 Aligned_cols=332 Identities=18% Similarity=0.172 Sum_probs=199.4 Q ss_pred EEEEECCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCC---CCC----E-------EEEECCCCCCHHHHHHH Q ss_conf 999817899999999999999861998999962776300121014---774----1-------68850557201344557 Q gi|254780402|r 59 IWFHASSVGETMALIGLIPAIRSRHVNVLLTTMTATSAKVARKYL---GQY----A-------IHQYAPLDIQPAVSRFL 124 (440) Q Consensus 59 IW~HaaSvGE~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~---~~~----~-------~~~y~P~D~~~~~~~fl 124 (440) ||+=|...-==..+..|+++|++++|++-+. --.|-.+.+.-. -+. + .+...=+.....+.+.+ T Consensus 4 ifi~aGE~SGD~~ga~li~~Lk~~~~~~~~~--GiGG~~M~~~G~~~l~d~~~l~vmG~~evl~~~~~~~~~~~~~~~~i 81 (382) T PRK00025 4 IAIVAGEVSGDLLGAGLIRALKAQAPNLEFV--GIGGPRMEAAGFESLFDMEELAVMGLVEVLPRLPRLLKIRKELKRLL 81 (382) T ss_pred EEEEECCCCHHHHHHHHHHHHHHHCCCCEEE--EECCHHHHHCCCCCCCCHHHHHHCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999068418899999999998319896799--98829999769954477578313019999977999999999999999 Q ss_pred HHHCCEEEEEECCCCHHHH----HHHHHHC--CCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHH Q ss_conf 4420203564047621467----8887422--455058641266542121001111233102644236641577888998 Q gi|254780402|r 125 KYWKPDCMILSESDIWPLT----VFELSKQ--RIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYK 198 (440) Q Consensus 125 ~~~~P~~~i~~e~ElWPnl----i~~~~~~--~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~ 198 (440) ...+||++|++++ |.+ -+.+++. +||++-.-+ -.=|.|-++-.+.+-+.+|++++.=+...+.|+ T Consensus 82 ~~~~Pd~vi~ID~---pgFnlrlak~lkk~~~~ipvi~yv~------PqvWAWr~~R~k~~~~~~D~ll~ifPFE~~~f~ 152 (382) T PRK00025 82 LAEPPDVFILIDA---PDFNLRLAKKLKKALPGIPIIHYVS------PSVWAWRPGRAKKIAKATDHVLAILPFEAAFYD 152 (382) T ss_pred HHCCCCEEEEECC---CHHHHHHHHHHHHHCCCCCEEEEEC------CHHHHCCCCHHHHHHHHHHHHHCCCCCCHHHHH T ss_conf 8649999999778---3065999999997169998899947------156540641899999999876108765689998 Q ss_pred HHCCCCEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCEE-ECCCCCCCCCCCC----CCCCCCCC-CCCCEEEEEEECC Q ss_conf 6088505872043101234422125689999861798617-5046553321000----00000001-2341248897055 Q gi|254780402|r 199 ELGAQKLIVSGNLKIDTESLPCDKELLSLYQESIAGRYTW-AAISTFEGEEDKA----VYVHNFIK-CRTDVLTIIVPRH 272 (440) Q Consensus 199 ~lG~~~i~v~GnlK~d~~~~~~~~~~~~~~~~~~~~r~v~-vagSth~~EE~~i----l~a~~~l~-~~~~~~lIIvPRh 272 (440) +.|.+ +..+|+.-.|......+..........-.+++++ +.-.++..|.... +++.+.++ +.|++..++ |-- T Consensus 153 ~~g~~-~~fVGHPl~d~~~~~~~~~~~~~~~~~~~~~~~i~lLPGSR~~EI~~~lPi~l~a~~~l~~~~p~~~fvi-p~~ 230 (382) T PRK00025 153 KLGVP-VTYVGHPLADEIPLEPDRAAARAELGLDPDARVLALLPGSRGSEIKRLLPTFLEAAQLLQQRYPDLRFVV-PLA 230 (382) T ss_pred HCCCC-CEECCCCHHHHCCCCCCHHHHHHHCCCCCCCCEEEEECCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEEE-ECC T ss_conf 65998-1356981564322356879999873998556617870588589999978999999999998789939999-558 Q ss_pred CCHH-HHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEEC Q ss_conf 2114-788877653001011001344444664167506765202554220820661355444545837689619949998 Q gi|254780402|r 273 PRRC-DAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSG 351 (440) Q Consensus 273 peR~-~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~G 351 (440) ++.. +.+.+.++..+. +..+.+. -|+-..+++.||+|++.- |-..||.|.+|+|.++. T Consensus 231 ~~~~~~~i~~~~~~~~~-------------~~~i~~~--~~~~~~~l~~sd~ai~~S------GTaTLE~al~~~P~Vv~ 289 (382) T PRK00025 231 NPKRREQFEQLLAEYAP-------------DLSVTLL--DGQAREAMAAADAALAAS------GTVTLEAALLGVPMVVG 289 (382) T ss_pred CHHHHHHHHHHHHHCCC-------------CCCEEEE--CCCCHHHHHHCCEEEECC------CHHHHHHHHHCCCEEEE T ss_conf 87789999999984799-------------9838982--684177887388876537------77999999971985899 Q ss_pred CCCCCHHH-HHHHHHHCCCEEEEC----------------CHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHH-CCCHHHH Q ss_conf 98443799-999999789869957----------------98999999999958989999999999999970-4556999 Q gi|254780402|r 352 PNVENFRD-IYRRMVSSGAVRIVE----------------EVGTLADMVYSLLSEPTIRYEMINAAINEVKK-MQGPLKI 413 (440) Q Consensus 352 P~~~nf~e-~~~~L~~~g~~~~v~----------------d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~-~~ga~~~ 413 (440) =.+.-+.. +++.+....-.-.++ +++.+++.+.++++|++.++++.+...+.-+. ..||++| T Consensus 290 Yk~~~lt~~i~k~lvkv~~isL~Nii~~k~ivPEllQ~~~~~~~i~~~~~~ll~d~~~~~~~~~~~~~lr~~L~~gas~r 369 (382) T PRK00025 290 YKVKPLTFWIAKRLVKVPYISLPNLLAGEELVPELLQEEATPEKLARALLELLADGARRQALLEGFTELHQYLRCGAAER 369 (382) T ss_pred EECCHHHHHHHHHHHCCCEEEHHHHHCCCCCCHHHHCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCCHHHH T ss_conf 80789999999996569976524875499766134056699999999999996699999999999999999857899999 Q ss_pred HHHHHHHHHHH Q ss_conf 99999975237 Q gi|254780402|r 414 TLRSLDSYVNP 424 (440) Q Consensus 414 ~~~~i~~~l~~ 424 (440) ..+.|.++|++ T Consensus 370 aA~~Il~~l~~ 380 (382) T PRK00025 370 AAQAVLELLKQ 380 (382) T ss_pred HHHHHHHHHHH T ss_conf 99999999973 No 45 >TIGR02149 glgA_Coryne glycogen synthase, Corynebacterium family; InterPro: IPR011875 This entry describes Corynebacterium glutamicum GlgA and closely related proteins in other species. This enzyme is required for glycogen biosynthesis and appears to replace the distantly related IPR011835 from INTERPRO, family of ADP-glucose type glycogen synthase in Corynebacterium glutamicum, Mycobacterium tuberculosis, Bifidobacterium longum, and Streptomyces coelicolor .. Probab=99.48 E-value=8.7e-13 Score=103.64 Aligned_cols=293 Identities=18% Similarity=0.198 Sum_probs=201.5 Q ss_pred EEEECCCCCCHH-----HHHHHHHHCCEEEE--------EECCCCHHHHHH---HHHHCCCCEEEEEECCCCCCCCCCH- Q ss_conf 688505572013-----44557442020356--------404762146788---8742245505864126654212100- Q gi|254780402|r 108 IHQYAPLDIQPA-----VSRFLKYWKPDCMI--------LSESDIWPLTVF---ELSKQRIPQVLVNARMSRRSFKNWK- 170 (440) Q Consensus 108 ~~~y~P~D~~~~-----~~~fl~~~~P~~~i--------~~e~ElWPnli~---~~~~~~ip~~linarls~~S~~~~~- 170 (440) ...|=|+.-..- .+..|+-+.-|+.+ ++=+=.|+-.+. ...-.++|.++--=.| --+|-|| T Consensus 64 v~~Y~~~~~L~~GldPran~aL~tfSvDL~m~~d~~~~~vvHsHTWYa~LAG~LAk~Lyd~PlVvTaHSL--EPLRPWK~ 141 (416) T TIGR02149 64 VLGYRPWSELKEGLDPRANKALKTFSVDLAMANDVEDADVVHSHTWYAALAGHLAKLLYDVPLVVTAHSL--EPLRPWKK 141 (416) T ss_pred EEECCCCHHHCCCCCHHHHHHHCCHHHHHHHHHHHCCCCEEECHHHHHHHHHHHHHHHCCCCEEEEEECC--CCCCCCHH T ss_conf 8740787432256772257540231157887611002715532078887899999996699839973037--88871317 Q ss_pred --------HHHHHHHHCCCCCCEEEECCHHHHHHHHH----HCCCCEEEECC---CCCCCCCCCCCHHHHHHHHHHC-CC Q ss_conf --------11112331026442366415778889986----08850587204---3101234422125689999861-79 Q gi|254780402|r 171 --------TVLSFSKKIFSQFSLVIVQSERYFRRYKE----LGAQKLIVSGN---LKIDTESLPCDKELLSLYQESI-AG 234 (440) Q Consensus 171 --------~~~~l~~~~~~~~~~i~~qs~~~~~rl~~----lG~~~i~v~Gn---lK~d~~~~~~~~~~~~~~~~~~-~~ 234 (440) .-.|.-|.-+..-|.|+|.|+.+++-+.+ |-.++|.|+-| ++-=..+-+.+... ..-+-.+ +. T Consensus 142 EQLGgGY~lSsW~EktA~~aAd~vIAVS~amr~DiL~~YP~lD~~kv~Vv~NGId~~~y~~~~~~~~~~-v~~~~Gid~~ 220 (416) T TIGR02149 142 EQLGGGYKLSSWAEKTAIEAADRVIAVSGAMREDILKVYPDLDPEKVHVVYNGIDTKEYKPAADDDGNK-VLDRYGIDRS 220 (416) T ss_pred HHCCCCCCCCHHHHHHHHHHCCCEEEHHHHCHHHHHCCCCCCCCCCEEEEECCCCHHHHCCCCCCCHHH-HHHHHCCCCC T ss_conf 565897420247888899850406531110335583158688846468886476457606888874113-4663267998 Q ss_pred CCEEEC-C--CCCCCCCCCCCCCCCCCCCCCCEEEEE---EECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEE Q ss_conf 861750-4--655332100000000001234124889---7055211478887765300101100134444466416750 Q gi|254780402|r 235 RYTWAA-I--STFEGEEDKAVYVHNFIKCRTDVLTII---VPRHPRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLG 308 (440) Q Consensus 235 r~v~va-g--Sth~~EE~~il~a~~~l~~~~~~~lII---vPRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~ 308 (440) ||.+++ | |-++| -..++.|.+.+. ++.++++ +|-=||=.+|++....+..- ....-+.|- T Consensus 221 rP~~lFVGRItRQKG-v~~L~~A~~~~~--~dvqvVLCAgapDTPEv~~Ev~~~~a~l~~-----------~R~gv~WI~ 286 (416) T TIGR02149 221 RPYVLFVGRITRQKG-VPHLLKAVHLIS--KDVQVVLCAGAPDTPEVAEEVRQAVAKLDR-----------EREGVIWIE 286 (416) T ss_pred CCEEEEEEEEECCCC-HHHHHHHHHHCC--CCCEEEEECCCCCCCCHHHHHHHHHHHHHH-----------CCCCEEECC T ss_conf 887898520203165-589999996255--203598706767872068999999998876-----------169838635 Q ss_pred C--CCCCCHHHHHHCCCEEEECCCC-CCCCCCHHHHHHHCCCEEECCCCCCHHHHHHHHHHCCCEEEECC---------- Q ss_conf 6--7652025542208206613554-44545837689619949998984437999999997898699579---------- Q gi|254780402|r 309 D--TIGEMGFYLRMTEIAFIGRSFC-ASGGQNPLEAAMLGCAILSGPNVENFRDIYRRMVSSGAVRIVEE---------- 375 (440) Q Consensus 309 D--t~GeL~~lY~~AdiafVGGSl~-~~GGhN~lEpaa~G~pVi~GP~~~nf~e~~~~L~~~g~~~~v~d---------- 375 (440) + +.+|+..+|+.|+ |||+.|.- |-|-=| |||||||.||+- ..+...+|++-+ -.+|.+....+ T Consensus 287 ~ml~~~~~~~L~~~A~-vFvCPSvYEPLGIvN-LEAMAC~tpVVA-S~~GGIpEVV~d-g~TG~LV~~~~lhdGtGtP~d 362 (416) T TIGR02149 287 KMLPKEELVELLSNAE-VFVCPSVYEPLGIVN-LEAMACGTPVVA-SAVGGIPEVVVD-GETGFLVPIDDLHDGTGTPDD 362 (416) T ss_pred CCCCHHHHHHHHHCCC-EEECCCCCCCCCHHH-HHHHHCCCCEEE-CCCCCCCCEEEC-CCCCCCCCCCCCCCCCCCCCC T ss_conf 6588789999984694-786484425420556-878850786344-036895526833-744312470145577888887 Q ss_pred ----HHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCC---HHHHHHHHHHHH Q ss_conf ----8999999999958989999999999999970455---699999999975 Q gi|254780402|r 376 ----VGTLADMVYSLLSEPTIRYEMINAAINEVKKMQG---PLKITLRSLDSY 421 (440) Q Consensus 376 ----~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~g---a~~~~~~~i~~~ 421 (440) .++|+++|+.++.||+.+++||++|++-+.+.-+ ..+||++.=++. T Consensus 363 ~d~f~~~LA~ai~~ll~dp~~A~k~G~aGr~R~~~~FSW~~iA~kT~~~Y~~v 415 (416) T TIGR02149 363 DDKFEADLAKAIDALLADPELAKKMGEAGRKRAVEEFSWESIAKKTVELYRKV 415 (416) T ss_pred CCHHHHHHHHHHHHHHCCHHHHHHHCHHHHHHHHHCCCHHHHHHHHHHHHHHC T ss_conf 40568999999999742957898834434655421257578999999998741 No 46 >cd04946 GT1_AmsK_like This family is most closely related to the GT1 family of glycosyltransferases. AmsK is involved in the biosynthesis of amylovoran, which functions as a virulence factor. It functions as a glycosyl transferase which transfers galactose from UDP-galactose to a lipid-linked amylovoran-subunit precursor. The members of this family are found mainly in bacteria and Archaea. Probab=99.47 E-value=4.9e-12 Score=98.74 Aligned_cols=216 Identities=15% Similarity=0.120 Sum_probs=144.0 Q ss_pred HHHHHHCCCCCCEEEECCHHHHHHHHHH-CC--CCEEEE--CCCCCCCCCCCCCHHHHHHHHHHCCCC-CEEECCCCCC- Q ss_conf 1123310264423664157788899860-88--505872--043101234422125689999861798-6175046553- Q gi|254780402|r 173 LSFSKKIFSQFSLVIVQSERYFRRYKEL-GA--QKLIVS--GNLKIDTESLPCDKELLSLYQESIAGR-YTWAAISTFE- 245 (440) Q Consensus 173 ~~l~~~~~~~~~~i~~qs~~~~~rl~~l-G~--~~i~v~--GnlK~d~~~~~~~~~~~~~~~~~~~~r-~v~vagSth~- 245 (440) ..+.+.+++..|.|++.|+.-.+.++++ |. +++.+. |--.......+. ..+. .++..|...+ T Consensus 174 ~~~~~~~~~~~d~v~~vS~~~~~~l~~~~~~~~~ki~v~~~Gv~~~~~~~~~~-----------~~~~~~i~svgrlv~~ 242 (407) T cd04946 174 IPLRRYLLSSLDAVFPCSEQGRNYLQKRYPAYKEKIKVSYLGVSDPGIISKPS-----------KDDTLRIVSCSYLVPV 242 (407) T ss_pred HHHHHHHHHHCCEEEEECHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCC-----------CCCCEEEEEEECCCHH T ss_conf 47899998526879997778899998736997575899968967433467888-----------7896599996177300 Q ss_pred CCCCCCCCCCCCCC-CCCCEEE--EEEECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHH-- Q ss_conf 32100000000001-2341248--897055211478887765300101100134444466416750676520255422-- Q gi|254780402|r 246 GEEDKAVYVHNFIK-CRTDVLT--IIVPRHPRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRM-- 320 (440) Q Consensus 246 ~EE~~il~a~~~l~-~~~~~~l--IIvPRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~-- 320 (440) .--+.+++|...+. +.|+..+ .|+..-|++ +++++++++.|+.-...-.|..++. |+..||+. T Consensus 243 Kg~~~li~A~~~l~~~~~~~~~~~~iiG~G~~~-~~l~~~~~~l~l~~~v~f~G~~~~~-----------~v~~~~~~~~ 310 (407) T cd04946 243 KRVDLIIKALAALAKARPSIKIKWTHIGGGPLE-DTLKELAESKPENISVNFTGELSNS-----------EVYKLYKENP 310 (407) T ss_pred CCHHHHHHHHHHHHHHCCCCEEEEEEEECCHHH-HHHHHHHHHCCCCCEEEEECCCCHH-----------HHHHHHHHCC T ss_conf 388999999999986589938999998268128-9999999977998879992789969-----------9999998569 Q ss_pred CCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHHHHHHHHHCCCEEEE--C-CHHHHHHHHHHHHCCHHHHHHHH Q ss_conf 08206613554445458376896199499989844379999999978986995--7-98999999999958989999999 Q gi|254780402|r 321 TEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFRDIYRRMVSSGAVRIV--E-EVGTLADMVYSLLSEPTIRYEMI 397 (440) Q Consensus 321 AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~~~~L~~~g~~~~v--~-d~~eL~~~l~~ll~d~~~~~~~~ 397 (440) +| +||-.|..+.-+..++||+++|+|||.- .+...+|++.+ ...+..+ + ++++|++++.++++||+.+++|+ T Consensus 311 ~d-~~~~~s~~eg~p~~~~Eama~g~pvi~t-~~~g~~e~v~~---~~~g~l~~~~~~~~~la~~i~~l~~~~~~~~~~~ 385 (407) T cd04946 311 VD-VFVNLSESEGLPVSIMEAMSFGIPVIAT-NVGGTPEIVDN---GGNGLLLSKDPTPNELVSSLSKFIDNEEEYQTMR 385 (407) T ss_pred CC-EEEECCCCCCCCHHHHHHHHCCCCEEEC-CCCCCHHHEEC---CCCEEEECCCCCHHHHHHHHHHHHHCHHHHHHHH T ss_conf 61-9995663346417999999749999984-79998412306---9827996899999999999999980999999999 Q ss_pred HHHHHHHHHCCCHHHHHHH Q ss_conf 9999999704556999999 Q gi|254780402|r 398 NAAINEVKKMQGPLKITLR 416 (440) Q Consensus 398 ~~a~~~i~~~~ga~~~~~~ 416 (440) .+|++.++++-.+....-+ T Consensus 386 ~~ar~~v~~~f~~~~ny~~ 404 (407) T cd04946 386 EKAREKWEENFNASKNYRE 404 (407) T ss_pred HHHHHHHHHHCCHHHHHHH T ss_conf 9999999986798877999 No 47 >PRK12446 N-acetylglucosaminyl transferase; Reviewed Probab=99.38 E-value=1.3e-09 Score=82.89 Aligned_cols=317 Identities=14% Similarity=0.089 Sum_probs=172.2 Q ss_pred CEEEEEECCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHH--HHCCCCCCEEEEECCCCCC----------------H Q ss_conf 779998178999999999999998619989999627763001--2101477416885055720----------------1 Q gi|254780402|r 57 PLIWFHASSVGETMALIGLIPAIRSRHVNVLLTTMTATSAKV--ARKYLGQYAIHQYAPLDIQ----------------P 118 (440) Q Consensus 57 ~~IW~HaaSvGE~~~a~pli~~l~~~~~~iliT~~T~tg~~~--~~~~~~~~~~~~y~P~D~~----------------~ 118 (440) +.+-.-+-+-|-+.-|..++++|.+++.+|..- +|..|.+. +.+..-+....---++.-. . T Consensus 3 kIii~~GGTGGHi~Palala~~L~~~~~~v~~i-g~~~g~e~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~l~~~~~ 81 (352) T PRK12446 3 KIVFTGGGSAGHVTPNLAIIPKLIEDNWDISYI-GSHQGIEKTIIEKENIPYYSISSGKLRRYFDLKNIKDPFLVMKGVM 81 (352) T ss_pred EEEEEECCCHHHHHHHHHHHHHHHHCCCEEEEE-ECCCHHHHCCCCCCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHHH T ss_conf 799995875888999999999998489959999-8896054304450499689954477278552999999999999999 Q ss_pred HHHHHHHHHCCEEEEEECCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCC-CEEEECCHHHHHHH Q ss_conf 34455744202035640476214678887422455058641266542121001111233102644-23664157788899 Q gi|254780402|r 119 AVSRFLKYWKPDCMILSESDIWPLTVFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQF-SLVIVQSERYFRRY 197 (440) Q Consensus 119 ~~~~fl~~~~P~~~i~~e~ElWPnli~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~-~~i~~qs~~~~~rl 197 (440) ...++++.++|++++-+.+=.=-=.+..++..+||+++-.... .+....+++.++ +.|++--+++.+.+ T Consensus 82 ~s~~il~~~kPd~Vig~GGY~S~P~~lAA~ll~iP~~ihEqNa----------v~G~aNr~l~~~a~~i~~~f~~~~~~~ 151 (352) T PRK12446 82 DAYVRIRKLKPDVIFSKGGFVSVPVVIGGWLNRVPVLLHESDM----------TPGLANKIALRFASKIFVTFEEAAKHL 151 (352) T ss_pred HHHHHHHHCCCCEEEECCCHHHHHHHHHHHHCCCCEEEECCCH----------HHHHHHHHHHHHHCCEEECCHHHHHCC T ss_conf 9999999639999997498777999999998599969988746----------777899999987071289962455208 Q ss_pred HHHCCCCEEEECCCCCCCCCCCCCHHHHHHHHHHCCCC-CEEECCCCCCCC--CCCCCCCCCCCCCCCCEEEEEEECCCC Q ss_conf 86088505872043101234422125689999861798-617504655332--100000000001234124889705521 Q gi|254780402|r 198 KELGAQKLIVSGNLKIDTESLPCDKELLSLYQESIAGR-YTWAAISTFEGE--EDKAVYVHNFIKCRTDVLTIIVPRHPR 274 (440) Q Consensus 198 ~~lG~~~i~v~GnlK~d~~~~~~~~~~~~~~~~~~~~r-~v~vagSth~~E--E~~il~a~~~l~~~~~~~lIIvPRhpe 274 (440) ..+++.++||+--... ...+..........-.++ .+++.|.+.-.+ .+.+.++...+.+ +...+-.- ... T Consensus 152 ---~~~k~~~tGnPvR~~i-~~~~~~~~~~~~~~~~~~~~iLV~GGSqGA~~iN~~v~~~l~~l~~--~~~iih~~-g~~ 224 (352) T PRK12446 152 ---PKEKVIYTGSPVREEV-LKGNREKGLAFLGFSRKKPVITIMGGSLGAKKINETVREALPELLL--KYQIVHLC-GKG 224 (352) T ss_pred ---CCCCEEECCCCCCHHH-HCCCHHHHHHHCCCCCCCCEEEEECCCHHHHHHHHHHHHHHHHHHH--CCEEEEEC-CCC T ss_conf ---8673687488620765-4035566787548887785799973751179999999999999851--97799992-877 Q ss_pred HHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCC-CCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCC Q ss_conf 1478887765300101100134444466416750676-520255422082066135544454583768961994999898 Q gi|254780402|r 275 RCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTI-GEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPN 353 (440) Q Consensus 275 R~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~-GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~ 353 (440) +.++ .+.... .....+-+ .+|..+|+.||+|+. -.|.-++-|-+++|+|.|.=|+ T Consensus 225 ~~~~---~~~~~~----------------~~~~~~~~~~~m~~~~~~aDlvIs-----RAGAsTiaEl~~~g~PsIlIP~ 280 (352) T PRK12446 225 NLDD---SLQNKE----------------GYRQFEYVHGELPDILAMTDFVIS-----RAGSNAIFEFLTLQKPMLLIPL 280 (352) T ss_pred CHHH---HHHCCC----------------CCEECCCHHHHHHHHHHHCCEEEE-----CCCCHHHHHHHHHCCCEEEECC T ss_conf 1568---985013----------------607657245549999984988997-----7870289999982998899628 Q ss_pred ---C-CCHH-HHHHHHHHCCCEEEEC----CHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf ---4-4379-9999999789869957----9899999999995898999999999999997045569999999997 Q gi|254780402|r 354 ---V-ENFR-DIYRRMVSSGAVRIVE----EVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITLRSLDS 420 (440) Q Consensus 354 ---~-~nf~-e~~~~L~~~g~~~~v~----d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~~~i~~ 420 (440) . +|.+ .-++.|.+.|++..+. +++.|.+.+.+++.|++. |.++++++. +.+|++++++.|+. T Consensus 281 p~~a~~nHQ~~NA~~l~~~gaa~vi~e~~l~~~~L~~~i~~l~~n~~~---~~~~~kk~~--~p~aa~~I~d~i~e 351 (352) T PRK12446 281 SKFASRGDQILNAESFERQGYASVLYEEDVTVNSLIKHVEELSHNNEK---YKTALKKYN--GKEAIQTIIDHISE 351 (352) T ss_pred CCCCCCCHHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHHHCHHH---HHHHHHHCC--CCCHHHHHHHHHHC T ss_conf 987775759999999997798899641469999999999999849999---999998507--95599999999851 No 48 >COG0381 WecB UDP-N-acetylglucosamine 2-epimerase [Cell envelope biogenesis, outer membrane] Probab=99.36 E-value=8.8e-11 Score=90.48 Aligned_cols=330 Identities=19% Similarity=0.224 Sum_probs=199.8 Q ss_pred CHHHHHHHHHHHHHHHHCC-CC-EEEEECCCCCCHHHHCCC-------CCCEEEEECCCCC--------CHHHHHHHHHH Q ss_conf 8999999999999998619-98-999962776300121014-------7741688505572--------01344557442 Q gi|254780402|r 65 SVGETMALIGLIPAIRSRH-VN-VLLTTMTATSAKVARKYL-------GQYAIHQYAPLDI--------QPAVSRFLKYW 127 (440) Q Consensus 65 SvGE~~~a~pli~~l~~~~-~~-iliT~~T~tg~~~~~~~~-------~~~~~~~y~P~D~--------~~~~~~fl~~~ 127 (440) .-=|.--..||++++.+.. .+ +++-|.---.+++....+ |+...-.--|-+. -..+.+++... T Consensus 12 TRPE~iKmapli~~~~~~~~~~~~vi~TGQH~d~em~~~~le~~~i~~pdy~L~i~~~~~tl~~~t~~~i~~~~~vl~~~ 91 (383) T COG0381 12 TRPEAIKMAPLVKALEKDPDFELIVIHTGQHRDYEMLDQVLELFGIRKPDYDLNIMKPGQTLGEITGNIIEGLSKVLEEE 91 (383) T ss_pred CCHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCHHHHHHHHHHHCCCCCCCCHHCCCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 58999987099999985899735999706654277899999982898888313216668888999999999999999862 Q ss_pred CCEEEEEECCCCHHHHH--HHHHHCCCCEEEEEECCCCCCCCCCHHHHH-HHHHCCCCC-CEEEECCHHHHHHHHHHCCC Q ss_conf 02035640476214678--887422455058641266542121001111-233102644-23664157788899860885 Q gi|254780402|r 128 KPDCMILSESDIWPLTV--FELSKQRIPQVLVNARMSRRSFKNWKTVLS-FSKKIFSQF-SLVIVQSERYFRRYKELGAQ 203 (440) Q Consensus 128 ~P~~~i~~e~ElWPnli--~~~~~~~ip~~linarls~~S~~~~~~~~~-l~~~~~~~~-~~i~~qs~~~~~rl~~lG~~ 203 (440) +||++++ -+|-=-.+- ..|..++||+.-+-|-+ +|+.+| +|- .-|.+-+.+ +.-+|.++.++++|.+-|.+ T Consensus 92 kPD~VlV-hGDT~t~lA~alaa~~~~IpV~HvEAGl--Rt~~~~--~PEE~NR~l~~~~S~~hfapte~ar~nLl~EG~~ 166 (383) T COG0381 92 KPDLVLV-HGDTNTTLAGALAAFYLKIPVGHVEAGL--RTGDLY--FPEEINRRLTSHLSDLHFAPTEIARKNLLREGVP 166 (383) T ss_pred CCCEEEE-ECCCCHHHHHHHHHHHHCCCEEEEECCC--CCCCCC--CCHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCC T ss_conf 9998999-1785368899999998689368874254--447877--8379878788776523037719999999976999 Q ss_pred --CEEEECCCCCCCCCCCCCHHH--HHHHHH-HCCCCCEEECCCCCCCCCC-----CCCCCC-CCCCCCCCEEEEEEECC Q ss_conf --058720431012344221256--899998-6179861750465533210-----000000-00012341248897055 Q gi|254780402|r 204 --KLIVSGNLKIDTESLPCDKEL--LSLYQE-SIAGRYTWAAISTFEGEED-----KAVYVH-NFIKCRTDVLTIIVPRH 272 (440) Q Consensus 204 --~i~v~GnlK~d~~~~~~~~~~--~~~~~~-~~~~r~v~vagSth~~EE~-----~il~a~-~~l~~~~~~~lIIvPRh 272 (440) +|.|+||+-.|......+... ...... ......-++..+.|+.|-. -+.++. +++.++++ .-+|-|-| T Consensus 167 ~~~IfvtGnt~iDal~~~~~~~~~~~~~~~~~~~~~~~~~iLvT~HRreN~~~~~~~i~~al~~i~~~~~~-~~viyp~H 245 (383) T COG0381 167 EKRIFVTGNTVIDALLNTRDRVLEDSKILAKGLDDKDKKYILVTAHRRENVGEPLEEICEALREIAEEYPD-VIVIYPVH 245 (383) T ss_pred CCCEEEECCHHHHHHHHHHHHHCCCHHHHHHHHCCCCCCEEEEECCHHHCCCCCHHHHHHHHHHHHHHCCC-CEEEEECC T ss_conf 55168859739999998776410004667766324567389997055540364299999999999986789-56997479 Q ss_pred CCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCE-EEECCCCCCCCCCHHHHHHHCCCEEEC Q ss_conf 2114788877653001011001344444664167506765202554220820-661355444545837689619949998 Q gi|254780402|r 273 PRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIA-FIGRSFCASGGQNPLEAAMLGCAILSG 351 (440) Q Consensus 273 peR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~Adia-fVGGSl~~~GGhN~lEpaa~G~pVi~G 351 (440) |. ..+.+.. ..- ......|.++|.+|-+...+-...-- .++ +.||- .=||..+|+||+.= T Consensus 246 ~~--~~v~e~~-~~~-----------L~~~~~v~li~pl~~~~f~~L~~~a~~ilt----DSGgi-qEEAp~lg~Pvl~l 306 (383) T COG0381 246 PR--PRVRELV-LKR-----------LKNVERVKLIDPLGYLDFHNLMKNAFLILT----DSGGI-QEEAPSLGKPVLVL 306 (383) T ss_pred CC--HHHHHHH-HHH-----------HCCCCCEEEECCCCHHHHHHHHHHCEEEEE----CCCCH-HHHHHHCCCCEEEE T ss_conf 97--6666889-998-----------389876798688366989999974509995----48713-54477619927761 Q ss_pred CCCCCHHHHHHHHHHCCCEEEEC-CHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHH Q ss_conf 98443799999999789869957-989999999999589899999999999999704556999999999752378 Q gi|254780402|r 352 PNVENFRDIYRRMVSSGAVRIVE-EVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITLRSLDSYVNPL 425 (440) Q Consensus 352 P~~~nf~e~~~~L~~~g~~~~v~-d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~~~i~~~l~~~ 425 (440) -....-+ +..++|....|. +.+.+.+.+..++++++..++|....-.|-. .+ |++|+++.|+.+.+.+ T Consensus 307 R~~TERP----E~v~agt~~lvg~~~~~i~~~~~~ll~~~~~~~~m~~~~npYgd-g~-as~rIv~~l~~~~~~~ 375 (383) T COG0381 307 RDTTERP----EGVEAGTNILVGTDEENILDAATELLEDEEFYERMSNAKNPYGD-GN-ASERIVEILLNYFDSL 375 (383) T ss_pred CCCCCCC----CCEECCCEEEECCCHHHHHHHHHHHHHCHHHHHHHHCCCCCCCC-CC-HHHHHHHHHHHHHHHC T ss_conf 3677784----10003704871765899999999986295889987425588867-50-5799999999885203 No 49 >pfam04007 DUF354 Protein of unknown function (DUF354). Members of this family are around 350 amino acids in length. They are found in archaebacteria and have no known function. Probab=99.32 E-value=6.1e-09 Score=78.39 Aligned_cols=312 Identities=17% Similarity=0.169 Sum_probs=185.3 Q ss_pred EEEEEECCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCC-CCEEEEECCCCCC----------HHHHHHHHH Q ss_conf 79998178999999999999998619989999627763001210147-7416885055720----------134455744 Q gi|254780402|r 58 LIWFHASSVGETMALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLG-QYAIHQYAPLDIQ----------PAVSRFLKY 126 (440) Q Consensus 58 ~IW~HaaSvGE~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~-~~~~~~y~P~D~~----------~~~~~fl~~ 126 (440) .|||--..-.-+.--+++|++|.+++.+|+||+---......-+.++ +.....--.-+.. .-.-++.+. T Consensus 2 kIwiDI~~p~hvhfFk~iI~eL~k~GheV~iTaR~~~~~~~LL~~y~i~~~~iG~~g~s~~~Kl~~~~~R~~~L~~~~~~ 81 (335) T pfam04007 2 KVWIDITNAPHVHFFKPIISELEKEGYEVLLTCRKFGELPELLRSLGFQVKSIGKHGATLIKKLLSSAERVYLLTKLIPE 81 (335) T ss_pred EEEEECCCCCHHHHHHHHHHHHHHCCCEEEEEEECCCHHHHHHHHCCCCEEEECCCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 39997899508888999999998689889999961351999999769976997588888899999999999999999886 Q ss_pred HCCEEEEEECCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHCCCC-E Q ss_conf 202035640476214678887422455058641266542121001111233102644236641577888998608850-5 Q gi|254780402|r 127 WKPDCMILSESDIWPLTVFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKELGAQK-L 205 (440) Q Consensus 127 ~~P~~~i~~e~ElWPnli~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~~~-i 205 (440) .+||+++-.-+ |-.-+.+.-.|||.+.++= ++.+.. ..+-.+.-.+.+++...-+...++++|.++ + T Consensus 82 ~~PDv~is~~S---~~a~~va~~LgipsI~f~D--teha~~-------~~~Lt~Pf~~~i~~P~~~~~~~~~~~G~~~~i 149 (335) T pfam04007 82 KKPDVAIMKNS---MELPRVAFGLRIPSIIVLD--NEHALA-------ANKLTFPLADYILVPEIIDDEFLRFFGADNRI 149 (335) T ss_pred HCCCEEEECCC---HHHHHHHHHCCCCEEEEEC--CHHHCC-------CCEEEEECCCEEECCCCCCHHHHHHHCCCCCE T ss_conf 29978994488---0199999882998799947--755412-------33023123868881244677899860877856 Q ss_pred E-EECCCCCCCC--CCCCCHHHHHHHHHHCC-CCCEEE-----CCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCHH Q ss_conf 8-7204310123--44221256899998617-986175-----0465533210000000000123412488970552114 Q gi|254780402|r 206 I-VSGNLKIDTE--SLPCDKELLSLYQESIA-GRYTWA-----AISTFEGEEDKAVYVHNFIKCRTDVLTIIVPRHPRRC 276 (440) Q Consensus 206 ~-v~GnlK~d~~--~~~~~~~~~~~~~~~~~-~r~v~v-----agSth~~EE~~il~a~~~l~~~~~~~lIIvPRhpeR~ 276 (440) . .-| .+-..- ....+.+....+. +. ++.|++ -||-..|.+.++-...+.+++..+ .+|+.||..+.. T Consensus 150 ~~y~g-~~E~a~l~~F~Pd~~vl~~lg--l~~~~yIvvR~~~~~A~y~~g~~~i~~~ii~~l~~~~~-~iv~~pr~~~q~ 225 (335) T pfam04007 150 RTYPG-IKEIANISDYVPDPEILKKLG--LEFEEYIVMRPEPLASSYVNGHESILPEIIEMLTKEGV-NIIYFPRNKEQR 225 (335) T ss_pred EEECC-CCEEEECCCCCCCHHHHHHCC--CCCCCEEEEEECCCCCEECCCCCCHHHHHHHHHHHCCC-CEEEECCCCHHH T ss_conf 76668-441432166689865787649--98798899961645560011442159999999987598-199975870366 Q ss_pred HHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEE-CCCCC Q ss_conf 78887765300101100134444466416750676520255422082066135544454583768961994999-89844 Q gi|254780402|r 277 DAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILS-GPNVE 355 (440) Q Consensus 277 ~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~-GP~~~ 355 (440) + ..++. ++.+.+.--+.-.+...||+ |||||=+ =--|||.+|+|+|. -|..- T Consensus 226 ~----~~~~~-----------------~v~i~~~~vd~~~Lly~adl-~Ig~GgT-----Ma~EAAlLGtPaIs~~p~~~ 278 (335) T pfam04007 226 E----IFRGF-----------------DVIIPKKPVDTLSLLYYSDL-VIGAGGT-----MNREAALLGTPAVSCYPGKL 278 (335) T ss_pred H----HHCCC-----------------CEECCCCCCCHHHHHHHCCE-EECCCHH-----HHHHHHHHCCCEEEECCCCC T ss_conf 7----75047-----------------70367888777888865468-9727568-----99999982898798438852 Q ss_pred CHHHHHHHHHHCCCEEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH Q ss_conf 37999999997898699579899999999995898999999999999997045569999999997523 Q gi|254780402|r 356 NFRDIYRRMVSSGAVRIVEEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITLRSLDSYVN 423 (440) Q Consensus 356 nf~e~~~~L~~~g~~~~v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~~~i~~~l~ 423 (440) +.- -+.|++.|..+..+|++++.+.+.+.+.. .....+.+..+-++.+++.++.+++ T Consensus 279 ~~v--d~~l~~~gl~~~~~d~~~i~~~v~~~~~~---------~~~~~~~~~edp~~~i~~~v~~~~~ 335 (335) T pfam04007 279 LAV--DKYLIEKGEMYHSTDPREIVNYVISNLKV---------RKKPRTKEGEDPKAYIIELVNNLID 335 (335) T ss_pred CHH--HHHHHHCCCEEEECCHHHHHHHHHHHHHC---------CCCCCCCCCCCHHHHHHHHHHHHCC T ss_conf 136--79998679879618989999999998604---------4566742335859999999997429 No 50 >pfam02684 LpxB Lipid-A-disaccharide synthetase. This is a family of lipid-A-disaccharide synthetases, EC:2.4.2.128. These enzymes catalyse the reaction: UDP-2,3-bis(3-hydroxytetradecanoyl) glucosamine + 2,3-bis(3-hydroxytetradecanoyl)-beta-D-glucosaminyl 1-phosphate <= UDP + 2,3-bis(3-hydroxytetradecanoyl)-D-glucosaminyl-1,6 -beta-D-2,3-bis(3-hydroxytetradecanoyl)-beta-D-glucosaminyl 1-phosphate. These enzymes catalyse the fist disaccharide step in the synthesis of lipid-A-disaccharide. Probab=99.31 E-value=1.3e-10 Score=89.47 Aligned_cols=301 Identities=14% Similarity=0.072 Sum_probs=179.6 Q ss_pred HHHHHHHHHHHHCCCCEEEEECCCCCCHHHHC----CCCCC--EEEEE------C--CCCCCHHHHHHHHHHCCEEEEEE Q ss_conf 99999999998619989999627763001210----14774--16885------0--55720134455744202035640 Q gi|254780402|r 70 MALIGLIPAIRSRHVNVLLTTMTATSAKVARK----YLGQY--AIHQY------A--PLDIQPAVSRFLKYWKPDCMILS 135 (440) Q Consensus 70 ~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~----~~~~~--~~~~y------~--P~D~~~~~~~fl~~~~P~~~i~~ 135 (440) ..+..|+++|++++|++-+. --.|-.+.+. .++-. ....+ + =+-......+.+...+||++|++ T Consensus 12 ~~ga~Li~~Lk~~~p~i~~~--GvGG~~M~~~G~~~l~d~~~lsvmG~~evl~~l~~l~~~~~~~~~~i~~~~PD~vIlI 89 (373) T pfam02684 12 ILGGELIQNLKEHYPNLRFI--GVGGPKMEAEGFESLANMEEISVMGFIEVLGRLFKLLKIYQKVVKNILKKKPDTLILI 89 (373) T ss_pred HHHHHHHHHHHHCCCCCEEE--EECCHHHHHCCCCCCCCHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEE T ss_conf 99999999998309894899--9880899977995347727841401999998999999999999998742699889997 Q ss_pred CCCC-HHHHHHHHHHCCC--CEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHCCCCEEEECCCC Q ss_conf 4762-1467888742245--505864126654212100111123310264423664157788899860885058720431 Q gi|254780402|r 136 ESDI-WPLTVFELSKQRI--PQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKELGAQKLIVSGNLK 212 (440) Q Consensus 136 e~El-WPnli~~~~~~~i--p~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~~~i~v~GnlK 212 (440) ++.= =--+-+.++++|+ |++-.-+ -.=|.|-++-.+.+-+.+|++++-=+...+.|++.|++. ...|+.- T Consensus 90 D~pgFNlrlak~lkk~g~~ipvi~yV~------PqvWAWr~~R~k~i~~~~D~ll~IfPFE~~~y~~~gv~~-~fVGHPl 162 (373) T pfam02684 90 DAPDFNLRLAKKLRKLGPKLKIIHYVS------PSVWAWRPKRAKIIAKYTDLLLAILPFEKGFFQKFGLDC-RYVGHPL 162 (373) T ss_pred CCCCHHHHHHHHHHHCCCCCCEEEEEC------CCEEEECHHHHHHHHHHHHHHCCCCCCCHHHHHHHCCCE-EECCCCH T ss_conf 176155999999997199987899968------842212715899999999873128988789998609971-5759811 Q ss_pred CCCCCCCCCHHHHHHHHHHCCCCCEE-ECCCCCCCCCCCCC----CCCCCCC-CCCCEEEEEEECCCCHHHHHHH-HHHH Q ss_conf 01234422125689999861798617-50465533210000----0000001-2341248897055211478887-7653 Q gi|254780402|r 213 IDTESLPCDKELLSLYQESIAGRYTW-AAISTFEGEEDKAV----YVHNFIK-CRTDVLTIIVPRHPRRCDAIER-RLIA 285 (440) Q Consensus 213 ~d~~~~~~~~~~~~~~~~~~~~r~v~-vagSth~~EE~~il----~a~~~l~-~~~~~~lIIvPRhpeR~~~i~~-~l~~ 285 (440) .|......+....... ....+++++ +.-.++++|.+..+ ++.+.++ +.|++..+| |.-++..+++.+ .... T Consensus 163 ~d~~~~~~~~~~~~~~-~~~~~~~~i~lLPGSR~~EI~~~lPi~l~aa~~l~~~~~~~~~~i-p~~~~~~~~~~~~~~~~ 240 (373) T pfam02684 163 LDEIKLYKKRASAKEL-LIDHDEPILALLPGSRKGEIRRLLPPIQVAAQQLSSQFPNLKLLV-PLVNKKYEEQIEDTLAL 240 (373) T ss_pred HHHHCCCCCHHHHHHH-CCCCCCCEEEECCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEEE-ECCCHHHHHHHHHHHHH T ss_conf 6540137765899974-689877557767888699999999999999999997689918999-65887899999999986 Q ss_pred HCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCC--------- Q ss_conf 00101100134444466416750676520255422082066135544454583768961994999898443--------- Q gi|254780402|r 286 KGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVEN--------- 356 (440) Q Consensus 286 ~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~n--------- 356 (440) .+ .+.++.+. -|+-...++.||+|++.- |-..||.|.+|+|.+.+=.+.- T Consensus 241 ~~-------------~~~~i~~~--~~~~~~~~~~sd~al~~S------GTaTLE~al~~~P~vV~Yk~~~lty~i~k~l 299 (373) T pfam02684 241 EN-------------CDVSLLEI--SGERYEAMFAADAALAKS------GTATLEAALAGTPMVVAYRVKPLTFFLAKRL 299 (373) T ss_pred CC-------------CCCCEEEC--CCCHHHHHHHCCEEEECC------CHHHHHHHHCCCCEEEEEECCHHHHHHHHHH T ss_conf 49-------------99878980--572499998486501216------7699999981999899995778999999999 Q ss_pred -------------HHHHHHHHHHCCCEEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHC Q ss_conf -------------799999999789869957989999999999589899999999999999704 Q gi|254780402|r 357 -------------FRDIYRRMVSSGAVRIVEEVGTLADMVYSLLSEPTIRYEMINAAINEVKKM 407 (440) Q Consensus 357 -------------f~e~~~~L~~~g~~~~v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~ 407 (440) -++++.++++..+ +++.++..+..+|+|++.|++|....+++.+.. T Consensus 300 vkv~~isL~Nii~~k~ivPEllQ~~~-----t~~~ia~~~~~lL~d~~~~~~~~~~~~~~~~~l 358 (373) T pfam02684 300 VKIDYISLPNIILNREIVPEFIQGEC-----DAHLEAIALLLLLLNGLKAKKQKDRCRKFKQLL 358 (373) T ss_pred HCCCEEEHHHHHCCCCCCHHHHCCCC-----CHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHH T ss_conf 83895443488669986735647517-----899999999999679999999999999999998 No 51 >cd03802 GT1_AviGT4_like This family is most closely related to the GT1 family of glycosyltransferases. aviGT4 in Streptomyces viridochromogenes has been shown to be involved in biosynthesis of oligosaccharide antibiotic avilamycin A. Inactivation of aviGT4 resulted in a mutant that accumulated a novel avilamycin derivative lacking the terminal eurekanate residue. Probab=99.25 E-value=1e-08 Score=76.85 Aligned_cols=296 Identities=14% Similarity=0.014 Sum_probs=152.5 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEEECCCC--CCHHHHCCCCCC-----EEEEECCCCCCHHHHHHHHHHCCEEEEEECCCC Q ss_conf 9999999999999861998999962776--300121014774-----168850557201344557442020356404762 Q gi|254780402|r 67 GETMALIGLIPAIRSRHVNVLLTTMTAT--SAKVARKYLGQY-----AIHQYAPLDIQPAVSRFLKYWKPDCMILSESDI 139 (440) Q Consensus 67 GE~~~a~pli~~l~~~~~~iliT~~T~t--g~~~~~~~~~~~-----~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e~El 139 (440) |=-..+..|.+.|.+++.+|.|-++..+ ..... ...+.. .......+......+++++..+||++..- + T Consensus 20 G~e~~~~~La~~L~~~Gh~V~v~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~Dvvh~~-~-- 95 (335) T cd03802 20 GTERVVAALTEGLVARGHEVTLFASGDSKTAAPLV-PVVPEPLRLDAPGRDRAEAEALALAERALAAGDFDIVHNH-S-- 95 (335) T ss_pred CHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCC-CCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCEEEEC-C-- T ss_conf 79999999999999769989999628987788500-4567665445442212456899999999975798589989-7-- Q ss_pred HHHHHHHHHHCCCCEEEEE-ECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHCCCCEEEECCCCCCCCCC Q ss_conf 1467888742245505864-126654212100111123310264423664157788899860885058720431012344 Q gi|254780402|r 140 WPLTVFELSKQRIPQVLVN-ARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKELGAQKLIVSGNLKIDTESL 218 (440) Q Consensus 140 WPnli~~~~~~~ip~~lin-arls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~~~i~v~GnlK~d~~~~ 218 (440) |.-+...+...++|++... ++... +.....+. ...-..+.+.|+..++.. ....++.++.|- .|.... T Consensus 96 ~~~~~~~~~~~~~p~v~t~H~~~~~-------~~~~~~~~-~~~~~~~i~vS~~~~~~~--~~~~~~~vI~ng-vd~~~f 164 (335) T cd03802 96 LHLPLPFARPLPVPVVTTLHGPPDP-------ELLKLYYA-ARPDVPFVSISDAQRRPW--PPLPWVATVHNG-IDLDDY 164 (335) T ss_pred CHHHHHHHHHCCCCEEEEECCCCCH-------HHHHHHHH-HCCCCEEEECCHHHHHHC--CCCCCEEEECCC-CCHHHC T ss_conf 1789999862799789998999706-------77999975-235868999459999545--776777998799-888767 Q ss_pred CCCHHHHHHHHHHCCCCCEEECCCCCCCC-CCCCCCCCCCCCCCCCEEEEEEECCCCHHHHHHHHHH-HHCCHHHHHCCC Q ss_conf 22125689999861798617504655332-1000000000012341248897055211478887765-300101100134 Q gi|254780402|r 219 PCDKELLSLYQESIAGRYTWAAISTFEGE-EDKAVYVHNFIKCRTDVLTIIVPRHPRRCDAIERRLI-AKGLKVARRSRG 296 (440) Q Consensus 219 ~~~~~~~~~~~~~~~~r~v~vagSth~~E-E~~il~a~~~l~~~~~~~lIIvPRhpeR~~~i~~~l~-~~gl~~~~~S~~ 296 (440) +.. .-++..++..|..++.- -+.++++.+ .++..|+|+..-++.. +...... ..++ T Consensus 165 ~~~---------~~~~~~~l~vGRl~~~KG~~~li~a~~----~~~~~L~i~G~~~~~~-~~~~~~~~~~~~-------- 222 (335) T cd03802 165 PFR---------GPKGDYLLFLGRISPEKGPHLAIRAAR----RAGIPLKLAGPVSDPD-YFYREIAPELLD-------- 222 (335) T ss_pred CCC---------CCCCCEEEEEEECCCCCCHHHHHHHHH----HCCCEEEEEECCCCHH-HHHHHHHHHCCC-------- T ss_conf 988---------899978999993363347699999987----4598089994767379-999999996178-------- Q ss_pred CCCCCCCEEEEECCC--CCCHHHHHHCCCEEEECCC-CCCCCCCHHHHHHHCCCEEECCCCCCHHHHHHHHHHCCCEEEE Q ss_conf 444466416750676--5202554220820661355-4445458376896199499989844379999999978986995 Q gi|254780402|r 297 DVINAEVDIFLGDTI--GEMGFYLRMTEIAFIGRSF-CASGGQNPLEAAMLGCAILSGPNVENFRDIYRRMVSSGAVRIV 373 (440) Q Consensus 297 ~~~~~~~~V~i~Dt~--GeL~~lY~~AdiafVGGSl-~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~~~~L~~~g~~~~v 373 (440) +.+|.+...+ -++..+|+.||+. +--|. .+.-|-..+||+++|+|||.-. ....+|++. +...++.| T Consensus 223 -----~~~V~f~G~v~~~~~~~~~~~a~~~-v~pS~~~E~fglv~lEAma~G~PVVat~-~gG~~E~v~---~g~~G~lv 292 (335) T cd03802 223 -----GPDIEYLGEVGGAEKAELLGNARAL-LFPILWEEPFGLVMIEAMACGTPVIAFR-RGAVPEVVE---DGVTGFLV 292 (335) T ss_pred -----CCCEEEEECCCHHHHHHHHHHHCCC-CCCCCCCCCCCHHHHHHHHCCCCEEECC-CCCHHHHHC---CCCEEEEC T ss_conf -----9958995046839999999974124-5676532467479999998499899928-998145423---89618981 Q ss_pred CCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHH Q ss_conf 7989999999999589899999999999999704556999999 Q gi|254780402|r 374 EEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITLR 416 (440) Q Consensus 374 ~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~~ 416 (440) .|.++|++++.++...+. +++++.++++-.. +++.+ T Consensus 293 ~~~~~la~ai~~~~~~~~------~~~r~~~~~~fs~-~~~~~ 328 (335) T cd03802 293 DSVEELAAAVARADRLDR------AACRRRAERRFSA-ARMVD 328 (335) T ss_pred CCHHHHHHHHHHHHHCCH------HHHHHHHHHHCCH-HHHHH T ss_conf 999999999998752899------9999999986899-99999 No 52 >TIGR03568 NeuC_NnaA UDP-N-acetyl-D-glucosamine 2-epimerase, UDP-hydrolysing. This family of enzymes catalyzes the combined epimerization and UDP-hydrolysis of UDP-N-acetylglucosamine to N-acetylmannosamine. This is in contrast to the related enzyme WecB (TIGR00236) which retains the UDP moiety. NeuC acts in concert with NeuA and NeuB to synthesize CMP-N5-acetyl-neuraminate. Probab=99.23 E-value=5.9e-09 Score=78.49 Aligned_cols=324 Identities=15% Similarity=0.169 Sum_probs=179.2 Q ss_pred EEEEECCHHHHHHHHHHHHHHHHCC-CC-EEEEECCC----CCC---HHHHCCCCC-CEEEEECCCCCCH---------- Q ss_conf 9998178999999999999998619-98-99996277----630---012101477-4168850557201---------- Q gi|254780402|r 59 IWFHASSVGETMALIGLIPAIRSRH-VN-VLLTTMTA----TSA---KVARKYLGQ-YAIHQYAPLDIQP---------- 118 (440) Q Consensus 59 IW~HaaSvGE~~~a~pli~~l~~~~-~~-iliT~~T~----tg~---~~~~~~~~~-~~~~~y~P~D~~~---------- 118 (440) |-+=.-+-.|+.-+.|+++.++++. .+ .++.|..- -|. +...+.+.. .....++.-|.+. T Consensus 3 I~~v~GtRpe~iklapli~~l~~~~~~~~~li~TGqH~~~~~g~t~~~i~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 82 (365) T TIGR03568 3 ICVVTGTRADYGLLRPLLKALQDDPDLELQLIVTGMHLSPEYGNTVNEIEKDGFDIDEKIEILLDSDSNAGMAKSMGLTI 82 (365) T ss_pred EEEEEECCHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCHHHCCHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHH T ss_conf 99998507729999999999972899888999907778411070899999757987655765456898533999999999 Q ss_pred -HHHHHHHHHCCEEEEEECCCCHHHHH--HHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCC-CEEEECCHHHH Q ss_conf -34455744202035640476214678--887422455058641266542121001111233102644-23664157788 Q gi|254780402|r 119 -AVSRFLKYWKPDCMILSESDIWPLTV--FELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQF-SLVIVQSERYF 194 (440) Q Consensus 119 -~~~~fl~~~~P~~~i~~e~ElWPnli--~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~-~~i~~qs~~~~ 194 (440) ...+++...+||+++.. +|-.=.+- -.|.+++||++-+-|-+ +|+..+ --..|.+..++ +..+|.++..+ T Consensus 83 ~~~~~~l~~~kPD~VlV~-GDt~stla~alaA~~~~Ipv~HveaGl--rs~~~~---dE~~R~~i~~lS~~hf~~t~~a~ 156 (365) T TIGR03568 83 IGFSDAFERLKPDLVVVL-GDRFEMLAAAIAAALLNIPIAHIHGGE--VTEGAI---DESIRHAITKLSHLHFVATEEYR 156 (365) T ss_pred HHHHHHHHHCCCCEEEEE-CCCCHHHHHHHHHHHHCCEEEEEECCC--CCCCCC---CHHHHHHHHHHHHHHHCCCHHHH T ss_conf 999999854399899994-898607799999998198189996786--458988---65888898998777732335788 Q ss_pred HHHHHHCC--CCEEEECCCCCCCCCCCC--CHHHH-HHHHHHCCCCCEEECCCCCC----CCC--CCCCCCCCCCCCCCC Q ss_conf 89986088--505872043101234422--12568-99998617986175046553----321--000000000012341 Q gi|254780402|r 195 RRYKELGA--QKLIVSGNLKIDTESLPC--DKELL-SLYQESIAGRYTWAAISTFE----GEE--DKAVYVHNFIKCRTD 263 (440) Q Consensus 195 ~rl~~lG~--~~i~v~GnlK~d~~~~~~--~~~~~-~~~~~~~~~r~v~vagSth~----~EE--~~il~a~~~l~~~~~ 263 (440) ++|.+.|. ++|.++||+-.|...... +.... ..+....+++.+++ ..|| .++ +.+....+.+.+... T Consensus 157 ~nL~~eG~~~~~I~~vGn~~iD~l~~~~~~~~~~~~~~~~~~~~~~~~Lv--T~Hp~~~~~~~~~~~l~~il~al~~~~~ 234 (365) T TIGR03568 157 QRVIQMGEDPDRVFNVGSPGLDNILSLDLLSKEELEEKLGIDLDKPYALV--TFHPVTLEKESAEEQIKELLKALDELNK 234 (365) T ss_pred HHHHHCCCCCCEEEEECCHHHHHHHHCCCCCHHHHHHHHHHCCCCCEEEE--EECCCCCCCCCHHHHHHHHHHHHHHHCC T ss_conf 99986247867089827718999862221378899987412136876999--9535325665689999999999972088 Q ss_pred EEEEEEECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCCCC--HHHHHHCCCEEEECCCCCCCCCCHHHH Q ss_conf 248897055211478887765300101100134444466416750676520--255422082066135544454583768 Q gi|254780402|r 264 VLTIIVPRHPRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIGEM--GFYLRMTEIAFIGRSFCASGGQNPLEA 341 (440) Q Consensus 264 ~~lIIvPRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL--~~lY~~AdiafVGGSl~~~GGhN~lEp 341 (440) -.+++-|+.......+.+.+++... ....|.+++.+|-+ -.+.+.|+ .+||-| .||+= |+ T Consensus 235 ~~~~i~Pn~d~~~~~i~~~i~~~~~------------~~~ni~~i~pl~y~~fl~ll~~a~-~vitdS---sggi~--Ea 296 (365) T TIGR03568 235 NYIFTYPNADAGSRIINEAIEEYVN------------EHPNFRLFKSLGQERYLSLLKNAD-AVIGNS---SSGII--EA 296 (365) T ss_pred CCEEEECCCCHHHHHHHHHHHHHHC------------CCCCEEEECCCCHHHHHHHHHHCC-EEEECC---CCCEE--EE T ss_conf 8179826986027889999999970------------799889966788899999998701-999858---86546--70 Q ss_pred HHHCCCEEE-CCCCCCHHHHHHHHHHCCCEE-EECCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHH Q ss_conf 961994999-898443799999999789869-95798999999999958989999999999999970455699999999 Q gi|254780402|r 342 AMLGCAILS-GPNVENFRDIYRRMVSSGAVR-IVEEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITLRSL 418 (440) Q Consensus 342 aa~G~pVi~-GP~~~nf~e~~~~L~~~g~~~-~v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~~~i 418 (440) ..+|+|+|. |.. ++. ....+... .-.|++++.+++..++ +++....+... .+-.- ...|++|+++.| T Consensus 297 ~~l~~P~i~i~~R----q~~---r~~~~nvi~v~~~~~~I~~ai~~~~-~~~~~~~~~~~-~npyG-DG~as~rIv~iL 365 (365) T TIGR03568 297 PSFGVPTINIGTR----QKG---RLRADSVIDVDPDKEEIVKAIEKLL-DPAFKKSLKNV-KNPYG-DGNSSERIIEIL 365 (365) T ss_pred CCCCCCEEEECCC----CCC---CCCCCEEEEECCCHHHHHHHHHHHH-CHHHHHHHHCC-CCCCC-CCHHHHHHHHHC T ss_conf 1049867883788----555---5247608981799999999999974-87898755308-99896-998999999869 No 53 >pfam02350 Epimerase_2 UDP-N-acetylglucosamine 2-epimerase. This family consists of UDP-N-acetylglucosamine 2-epimerases EC:5.1.3.14 this enzyme catalyses the production of UDP-ManNAc from UDP-GlcNAc. Note that some of the enzymes is this family are bifunctional, in this instance Pfam matches only the N-terminal half of the protein suggesting that the additional C-terminal part (when compared to mono-functional members of this family) is responsible for the UPD-N-acetylmannosamine kinase activity of these enzymes. This hypothesis is further supported by the assumption that the C-terminal part of rat bifunctional UDP-N-acetylglucosamine 2-epimerase/N-acetylmannosamine kinase is the kinase domain. Probab=99.14 E-value=1.6e-08 Score=75.56 Aligned_cols=267 Identities=21% Similarity=0.243 Sum_probs=158.7 Q ss_pred HHHHHHHHCCEEEEEECCCCHHHHH--HHHHHCCCCEEEEEECCCCCCCCCCH-HHHHHHHHCCCCC-CEEEECCHHHHH Q ss_conf 4455744202035640476214678--88742245505864126654212100-1111233102644-236641577888 Q gi|254780402|r 120 VSRFLKYWKPDCMILSESDIWPLTV--FELSKQRIPQVLVNARMSRRSFKNWK-TVLSFSKKIFSQF-SLVIVQSERYFR 195 (440) Q Consensus 120 ~~~fl~~~~P~~~i~~e~ElWPnli--~~~~~~~ip~~linarls~~S~~~~~-~~~~l~~~~~~~~-~~i~~qs~~~~~ 195 (440) ..+++...+||++++. +|-+=-+- -.|..++||++-+-|-+ +|+.... ..=-..|.+..++ +..+|.++.+++ T Consensus 59 ~~~~l~~~~PD~vlv~-GDr~e~la~aiaa~~~~ipi~HiegG~--RS~d~t~g~~de~~R~~isklS~~hf~~t~~~~~ 135 (346) T pfam02350 59 LEDVLEEEKPDLVLVL-GDTNETLAGALAAFYLRIPVAHVEAGL--RSFDLTEGMPEEINRHLIDKLSDLHFAPTEEARE 135 (346) T ss_pred HHHHHHHHCCCEEEEE-CCCCHHHHHHHHHHHHCCCEEEEECCC--CCCCCCCCCCHHHHHCCCCCCCCEEECCCHHHHH T ss_conf 9999998299999996-897158899999998198489952687--4455679993054311466541377246499999 Q ss_pred HHHHHCC--CCEEEECCCCCCCCCCCCCHHHHHH-HHH--HCCCCCEEECCCCCCCCC-------CCCCCCCCCCCCCCC Q ss_conf 9986088--5058720431012344221256899-998--617986175046553321-------000000000012341 Q gi|254780402|r 196 RYKELGA--QKLIVSGNLKIDTESLPCDKELLSL-YQE--SIAGRYTWAAISTFEGEE-------DKAVYVHNFIKCRTD 263 (440) Q Consensus 196 rl~~lG~--~~i~v~GnlK~d~~~~~~~~~~~~~-~~~--~~~~r~v~vagSth~~EE-------~~il~a~~~l~~~~~ 263 (440) |+.++|. ++|.++||+-.|......+...... ... ...+++. +....||.|- ..+.++.+.+.+..+ T Consensus 136 ~L~~~G~~~~~If~vG~~~iD~i~~~~~~~~~~~~~~~~~~~~~~~~-iLvt~Hr~en~~~~~~~~~i~~~l~~l~~~~~ 214 (346) T pfam02350 136 NLLQEGEPPERIFVTGNTVIDALLLSREEIEESLEISGILADLGKRY-ILVTFHRRENEDDPERLEQILEALKALAERPD 214 (346) T ss_pred HHHHCCCCCCCEEEECCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCE-EEEEECCCCCCCCHHHHHHHHHHHHHHHHCCC T ss_conf 99981999472899797199999999987310155666403456877-99996775344764489999999999983568 Q ss_pred EEEEEEECC--CCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCC--CCHHHHHHCCCEEEECCCCCCCCCCHH Q ss_conf 248897055--2114788877653001011001344444664167506765--202554220820661355444545837 Q gi|254780402|r 264 VLTIIVPRH--PRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIG--EMGFYLRMTEIAFIGRSFCASGGQNPL 339 (440) Q Consensus 264 ~~lIIvPRh--peR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~G--eL~~lY~~AdiafVGGSl~~~GGhN~l 339 (440) .. +|.|-| |.-...+.+.+... ..+.+++.+| +.-.+.+.|++ +||-| . || .. T Consensus 215 ~~-~i~~~~n~d~~~~~i~~~l~~~----------------~ni~~~~~l~~~~fl~ll~~s~~-vigdS--s-~~--~~ 271 (346) T pfam02350 215 VP-VVFPVHNNPRTRKRINELLEEL----------------PNIRLIEPLGYLDFLSLLKNADL-VITDS--G-GI--QE 271 (346) T ss_pred CC-EEEEECCCCHHHHHHHHHHHCC----------------CCEEEECCCCHHHHHHHHHHCCE-EEECC--C-CH--HH T ss_conf 60-9998379920778999998347----------------98799656899999999985188-98368--6-21--66 Q ss_pred HHHHHCCCEEECCCCCCHHHHHHHHHHCCCEEEE-CCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHH Q ss_conf 6896199499989844379999999978986995-798999999999958989999999999999970455699999999 Q gi|254780402|r 340 EAAMLGCAILSGPNVENFRDIYRRMVSSGAVRIV-EEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITLRSL 418 (440) Q Consensus 340 Epaa~G~pVi~GP~~~nf~e~~~~L~~~g~~~~v-~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~~~i 418 (440) ||.++|+|+|.=-....-++ ....|....| .|++++.+++.+.++++..++.+... .... -...|++|+++.| T Consensus 272 Ea~~l~~P~iniR~~geRqe----gr~~g~nvlv~~~~~~I~~ai~~~l~~~~~~~~~~~~-~npy-GdG~as~rI~~il 345 (346) T pfam02350 272 EAPSLGKPVLNLRDTTERPE----GREAGTNVLVGTDKEAILAAIEKLLDDEEEYEKMSNA-VNPY-GDGNASERIVDIL 345 (346) T ss_pred HHHHHCCCEEEECCCCCCHH----HHHHCEEEEECCCHHHHHHHHHHHHHCHHHHHHHCCC-CCCC-CCCHHHHHHHHHH T ss_conf 66650896898278888875----6953846997899999999999997196777641247-8989-8986999999996 Q ss_pred H Q ss_conf 9 Q gi|254780402|r 419 D 419 (440) Q Consensus 419 ~ 419 (440) + T Consensus 346 ~ 346 (346) T pfam02350 346 K 346 (346) T ss_pred C T ss_conf 9 No 54 >cd03786 GT1_UDP-GlcNAc_2-Epimerase Bacterial members of the UDP-N-Acetylglucosamine (GlcNAc) 2-Epimerase family are known to catalyze the reversible interconversion of UDP-GlcNAc and UDP-N-acetylmannosamine (UDP-ManNAc). The enzyme serves to produce an activated form of ManNAc residues (UDP-ManNAc) for use in the biosynthesis of a variety of cell surface polysaccharides; The mammalian enzyme is bifunctional, catalyzing both the inversion of stereochemistry at C-2 and the hydrolysis of the UDP-sugar linkage to generate free ManNAc. It also catalyzes the phosphorylation of ManNAc to generate ManNAc 6-phosphate, a precursor to salic acids. In mammals, sialic acids are found at the termini of oligosaccharides in a large variety of cell surface glycoconjugates and are key mediators of cell-cell recognition events. Mutations in human members of this family have been associated with Sialuria, a rare disease caused by the disorders of sialic acid metabolism. This family belongs to the GT-B st Probab=99.10 E-value=4.1e-08 Score=72.94 Aligned_cols=328 Identities=19% Similarity=0.198 Sum_probs=181.5 Q ss_pred EEEEECCHHHHHHHHHHHHHHHHC-CCC-EEEEECCCC----CCHHHHCCC-CCCEEEEECCCCCCHH----------HH Q ss_conf 999817899999999999999861-998-999962776----300121014-7741688505572013----------44 Q gi|254780402|r 59 IWFHASSVGETMALIGLIPAIRSR-HVN-VLLTTMTAT----SAKVARKYL-GQYAIHQYAPLDIQPA----------VS 121 (440) Q Consensus 59 IW~HaaSvGE~~~a~pli~~l~~~-~~~-iliT~~T~t----g~~~~~~~~-~~~~~~~y~P~D~~~~----------~~ 121 (440) |-+=..+--|+--..|++++|+++ ..+ .++.|.+-- |.......+ ........+.-|.... .. T Consensus 2 I~~vtGtRae~~kl~pl~~~l~~~~~~~~~li~TGqH~~~~~g~t~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 81 (363) T cd03786 2 ILVVTGTRPEYIKLAPLIRALKKDPGFELVLVVTGQHYDMEMGVTFFEILFIIKPDYDLLLGSDSQSLGAQTAGLLIGLE 81 (363) T ss_pred EEEEEECHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCHHCCCHHHHHHCCCCCCEEEECCCCCCCHHHHHHHHHHHHH T ss_conf 89999313719999999999974899988999938976701088899982688887785459999769999999999999 Q ss_pred HHHHHHCCEEEEEECCCCHHHHH--HHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCC-CEEEECCHHHHHHHH Q ss_conf 55744202035640476214678--887422455058641266542121001111233102644-236641577888998 Q gi|254780402|r 122 RFLKYWKPDCMILSESDIWPLTV--FELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQF-SLVIVQSERYFRRYK 198 (440) Q Consensus 122 ~fl~~~~P~~~i~~e~ElWPnli--~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~-~~i~~qs~~~~~rl~ 198 (440) +.+...+||++++. +|-.=.|- ..+...+||++-+-|-+ +|+.... .-...|.+..++ +..+|.++.+++|+. T Consensus 82 ~~l~~~kPD~VlV~-GDr~e~la~Alaa~~~~Ipi~HiegG~--rs~~~~~-~de~~R~~i~kls~lhf~~t~~~~~~L~ 157 (363) T cd03786 82 AVLLEEKPDLVLVL-GDTNETLAAALAAFKLGIPVAHVEAGL--RSFDRGM-PDEENRHAIDKLSDLHFAPTEEARRNLL 157 (363) T ss_pred HHHHHHCCCEEEEE-CCCCHHHHHHHHHHHHCCCEEEEECCC--CCCCCCC-CCHHHHHHHHHCCCEEECCCHHHHHHHH T ss_conf 99997299999994-888428799999998198189962643--3476799-8779875522101256146199999998 Q ss_pred HHCC--CCEEEECCCCCCCCCCCCCH---HHHHHHHHHCCCCCEEECCCCCCCCCC----CCCCCCCCCCCCCC-EEEEE Q ss_conf 6088--50587204310123442212---568999986179861750465533210----00000000012341-24889 Q gi|254780402|r 199 ELGA--QKLIVSGNLKIDTESLPCDK---ELLSLYQESIAGRYTWAAISTFEGEED----KAVYVHNFIKCRTD-VLTII 268 (440) Q Consensus 199 ~lG~--~~i~v~GnlK~d~~~~~~~~---~~~~~~~~~~~~r~v~vagSth~~EE~----~il~a~~~l~~~~~-~~lII 268 (440) +.|. ++|.++||+-.|........ ...........++.+ ....||.|-. .+-...+.+++..+ -..++ T Consensus 158 ~~G~~~~~I~~vG~~~iD~l~~~~~~~~~~~~~~~~~~~~~~~~--lvt~Hr~~n~~~~~~~~~i~~al~~~~~~~~~v~ 235 (363) T cd03786 158 QEGEPPERIFVVGNTMIDALLRLLELAKKELILELLGLLPKKYI--LVTLHRVENVDDGEQLEEILEALAELAEEDVPVV 235 (363) T ss_pred HHCCCCCCEEECCCCHHHHHHHHHHHHHCCHHHHHCCCCCCCEE--EEEECCCCCCCCHHHHHHHHHHHHHHHHCCCEEE T ss_conf 61547552577386199999988764103266774454558779--9996452333568999999999999874396899 Q ss_pred EECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHH--HHHCCCEEEECCCCCCCCCCHHH-HHHHC Q ss_conf 7055211478887765300101100134444466416750676520255--42208206613554445458376-89619 Q gi|254780402|r 269 VPRHPRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFY--LRMTEIAFIGRSFCASGGQNPLE-AAMLG 345 (440) Q Consensus 269 vPRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~l--Y~~AdiafVGGSl~~~GGhN~lE-paa~G 345 (440) .|-||.-...+.+...+. . .....|.+++..|....+ ...|+ .+||-| | .+.| |..+| T Consensus 236 ~pn~d~~~~~i~~~~~~~----~--------~~~~~i~~~~~l~~~~~~~ll~~a~-~vigdS----s--Gi~Eea~~l~ 296 (363) T cd03786 236 FPNHPRTRPRIREAGLEF----L--------GHHPNVLLISPLGYLYFLLLLKNAD-LVLTDS----G--GIQEEASFLG 296 (363) T ss_pred EECCCCHHHHHHHHHHHH----H--------CCCCEEEEECCCCHHHHHHHHHCCE-EEEECC----C--CCEEEECCCC T ss_conf 977972577899999998----5--------5787799978877499999995073-998258----8--8688502069 Q ss_pred CCEEECCCCCCHHHHHHHHHHCCCEEEEC-CHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHH Q ss_conf 94999898443799999999789869957-989999999999589899999999999999704556999999999 Q gi|254780402|r 346 CAILSGPNVENFRDIYRRMVSSGAVRIVE-EVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITLRSLD 419 (440) Q Consensus 346 ~pVi~GP~~~nf~e~~~~L~~~g~~~~v~-d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~~~i~ 419 (440) +|++. +.+-++- .+-.+.|....|. |.+++.+++...++++.....+.. .-. -...|++|+++.|+ T Consensus 297 ~P~i~---ir~rqe~-re~~~~~~~~~v~~~~~~I~~~i~~~l~~~~~~~~~~~---npy-GdG~as~rI~~iL~ 363 (363) T cd03786 297 VPVLN---LRDRTER-PETVESGTNVLVGTDPEAILAAIEKLLSDEFAYSLMSI---NPY-GDGNASERIVEILK 363 (363) T ss_pred CCEEE---ECCCCCC-CEEHHCCEEEECCCCHHHHHHHHHHHHCCHHHHHCCCC---CCC-CCCHHHHHHHHHHC T ss_conf 87898---2687767-34210054886589999999999999757034535589---989-79879999999869 No 55 >cd03784 GT1_Gtf_like This family includes the Gtfs, a group of homologous glycosyltransferases involved in the final stages of the biosynthesis of antibiotics vancomycin and related chloroeremomycin. Gtfs transfer sugar moieties from an activated NDP-sugar donor to the oxidatively cross-linked heptapeptide core of vancomycin group antibiotics. The core structure is important for the bioactivity of the antibiotics. Probab=99.09 E-value=6.4e-07 Score=65.12 Aligned_cols=109 Identities=16% Similarity=0.147 Sum_probs=83.4 Q ss_pred CCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCC-CHHHHHHHCCCEEECCCCCCHHHHHHHHHHCCCEEEEC----CH Q ss_conf 64167506765202554220820661355444545-83768961994999898443799999999789869957----98 Q gi|254780402|r 302 EVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQ-NPLEAAMLGCAILSGPNVENFRDIYRRMVSSGAVRIVE----EV 376 (440) Q Consensus 302 ~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGh-N~lEpaa~G~pVi~GP~~~nf~e~~~~L~~~g~~~~v~----d~ 376 (440) ...+++.+-+ -...++..+|+ || .+||+ +..|++++|+|++.=|......+.+..+.+.|++..+. ++ T Consensus 287 ~~nv~i~~~~-pq~~iL~~~~~-~I-----tHgG~~s~~Eal~~GvP~v~~P~~~DQ~~nA~rv~~~G~G~~l~~~~~t~ 359 (401) T cd03784 287 PDNVRVVDFV-PHDWLLPRCAA-VV-----HHGGAGTTAAALRAGVPQLVVPFFGDQPFWAARVAELGAGPALDPRELTA 359 (401) T ss_pred CCCEEEECCC-CHHHHHHHCCE-EE-----ECCCHHHHHHHHHCCCCEEECCCCCCHHHHHHHHHHCCCCCCCCCCCCCH T ss_conf 9978995678-98999743799-99-----66875899999981999895377556899999999879712778356999 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf 99999999995898999999999999997045569999999997 Q gi|254780402|r 377 GTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITLRSLDS 420 (440) Q Consensus 377 ~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~~~i~~ 420 (440) ++|.+++.++|+|+..+ +..+.++++ +..|+.++..+.|++ T Consensus 360 e~l~~av~~lL~~~~~~-~a~~~~~~~--~~~~g~~~aa~~ie~ 400 (401) T cd03784 360 ERLAAALRRLLDPPSRR-RAAALLRRI--REEDGVPSAADVIER 400 (401) T ss_pred HHHHHHHHHHHCCHHHH-HHHHHHHHH--HHCCCHHHHHHHHHC T ss_conf 99999999994899999-999999998--755888999999843 No 56 >pfam00534 Glycos_transf_1 Glycosyl transferases group 1. Mutations in this domain of subunit A of phosphatidylinositol N-acetylglucosaminyltransferase lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysaccharides. Members of this family transfer UDP, ADP, GDP or CMP linked sugars. The eukaryotic glycogen synthases may be distant members of this family. Probab=99.08 E-value=1.3e-09 Score=82.91 Aligned_cols=153 Identities=14% Similarity=0.160 Sum_probs=101.8 Q ss_pred CCCCEEECCCCC-C-CCCCCCCCCCCCCCC--CCCEEEEEEECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEE Q ss_conf 798617504655-3-321000000000012--341248897055211478887765300101100134444466416750 Q gi|254780402|r 233 AGRYTWAAISTF-E-GEEDKAVYVHNFIKC--RTDVLTIIVPRHPRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLG 308 (440) Q Consensus 233 ~~r~v~vagSth-~-~EE~~il~a~~~l~~--~~~~~lIIvPRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~ 308 (440) .++++++..+.. + .--+.++++.+.+.+ .++..++|+.-.+.+. .....+...++. ..|.+. T Consensus 13 ~~~~vi~~~G~~~~~Kg~~~~i~a~~~l~~~~~~~~~l~i~G~~~~~~-~~~~~~~~~~~~-------------~~i~~~ 78 (172) T pfam00534 13 EDKPIILFVGRLVPEKGLDLLLEAFALLKEQLHPNLKLVIVGDGEEEK-KLKKLALKLGLE-------------DNVIFV 78 (172) T ss_pred CCCEEEEEECCCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEECCCCHH-HHHHHHHHCCCC-------------CCEEEE T ss_conf 999599996548554498999998998887408985999983783267-899999983999-------------868995 Q ss_pred CC--CCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHHHHHHHHHCCCEEEECCHHHHHHHHHHH Q ss_conf 67--6520255422082066135544454583768961994999898443799999999789869957989999999999 Q gi|254780402|r 309 DT--IGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFRDIYRRMVSSGAVRIVEEVGTLADMVYSL 386 (440) Q Consensus 309 Dt--~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~~~~L~~~g~~~~v~d~~eL~~~l~~l 386 (440) .. ..++..+|+.||+.++ .|..+..|..++|++++|+|||.-. ...+.|+.+.- ..|..+...|++++++++..+ T Consensus 79 ~~~~~~~~~~~l~~sdi~i~-ps~~E~~~~~~~Eam~~G~pvI~s~-~~~~~eii~~~-~~G~~~~~~~~~~l~~~i~~l 155 (172) T pfam00534 79 GFVPREDLIELYRIADLFVL-PSRYEGFGLVLLEAMAAGVPVIASD-VGGPAEIVKDG-ETGLLVDPGDAEALAEAIEKL 155 (172) T ss_pred CCCCHHHHHHHHHHHHHCCC-CCHHHCCCCHHHHHHHCCCEEEEEC-CCCHHHHHHCC-CEEEEECCCCHHHHHHHHHHH T ss_conf 78898999999997241047-7366515711899996797199956-99729997189-839997899999999999999 Q ss_pred HCCHHHHHHHHHHHHH Q ss_conf 5898999999999999 Q gi|254780402|r 387 LSEPTIRYEMINAAIN 402 (440) Q Consensus 387 l~d~~~~~~~~~~a~~ 402 (440) ++||+.+.+|+++|++ T Consensus 156 i~n~~~~~~m~~n~~~ 171 (172) T pfam00534 156 LKDEELRERLGENARK 171 (172) T ss_pred HCCHHHHHHHHHHHCC T ss_conf 8799999999998448 No 57 >TIGR01133 murG undecaprenyldiphospho-muramoylpentapeptide beta-N-acetylglucosaminyltransferase; InterPro: IPR006009 The murG gene of Escherichia coli encodes the N-acetylglucosaminyltransferase, UDP-N-acetylglucosamine--N-acetylmuramyl-(pentapeptide) pyrophosphoryl-undecaprenol N-acetylglucosamine transferase, responsible for the final step in the formation of the lipid-linked disaccharide-pentapeptide subunit of peptidoglycan. The enzyme is peripherally associated with the inner face of the cytoplasmic membrane. Therefore, the peptidoglycan subunit is completely assembled before it traverses the cytoplasmic membrane .; GO: 0050511 undecaprenyldiphospho-muramoylpentapeptide beta-N-acetylglucosaminyltransferase activity, 0019277 UDP-N-acetylgalactosamine biosynthetic process, 0009276 1-2nm peptidoglycan-based cell wall. Probab=99.07 E-value=1.4e-08 Score=76.05 Aligned_cols=324 Identities=16% Similarity=0.130 Sum_probs=200.0 Q ss_pred EEEEEECCHHHHHHHHHHHHHHHHCCCCEEEE-ECCCCCCH--HHH-CCCCCC--EEE-----------EECCC-CC--- Q ss_conf 79998178999999999999998619989999-62776300--121-014774--168-----------85055-72--- Q gi|254780402|r 58 LIWFHASSVGETMALIGLIPAIRSRHVNVLLT-TMTATSAK--VAR-KYLGQY--AIH-----------QYAPL-DI--- 116 (440) Q Consensus 58 ~IW~HaaSvGE~~~a~pli~~l~~~~~~iliT-~~T~tg~~--~~~-~~~~~~--~~~-----------~y~P~-D~--- 116 (440) .|-+=+--=|=+--|..++++|.++++++-|. -.|.-|.+ .++ +..-+. +.+ ...|+ -+ T Consensus 7 ~~~~~gGGTGG~fPAlA~a~~l~~~~~~~~v~~lG~~~g~e~~lv~~~~~~~~~~i~~~gl~~~~~~~~~~~~~~~~~~~ 86 (368) T TIGR01133 7 KVALAGGGTGGIFPALAVAEELIKRGPEVEVVWLGTKRGLEERLVPSKEGIEFLTIPVGGLRRKGSKKLLKLPLLKLLKA 86 (368) T ss_pred EEEEECCCCCHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCHHHHHHHHHHHHHH T ss_conf 89997278302689999999999748936999850677500003432157417777401003655101467889999999 Q ss_pred CHHHHHHHHHHCCEEEEEECCCC-HHHHHHHHHHCC-CCEEEE-EECCCCCCCCCCHHHHHHHHHCCCCC-CEEEECCHH Q ss_conf 01344557442020356404762-146788874224-550586-41266542121001111233102644-236641577 Q gi|254780402|r 117 QPAVSRFLKYWKPDCMILSESDI-WPLTVFELSKQR-IPQVLV-NARMSRRSFKNWKTVLSFSKKIFSQF-SLVIVQSER 192 (440) Q Consensus 117 ~~~~~~fl~~~~P~~~i~~e~El-WPnli~~~~~~~-ip~~li-narls~~S~~~~~~~~~l~~~~~~~~-~~i~~qs~~ 192 (440) -+.++++|+.++|++++-+.+=. =|-.+.. +..| ||+++- |++ +.++.++++++ +.|+...+. T Consensus 87 ~~~a~~~l~~~~p~~v~G~GGY~s~P~~~AA-~l~g~iP~~~EQN~~------------pG~~Nk~ls~~A~~V~~~f~~ 153 (368) T TIGR01133 87 VLQARRILKKFKPDVVVGFGGYVSGPAGLAA-KLLGRIPLILEQNAV------------PGLTNKLLSRFAKRVLVSFPG 153 (368) T ss_pred HHHHHHHHHHCCCCEEEECCCHHHHHHHHHH-HHCCCCCEEEEECCH------------HHHHHHHHHHHHCEEEEECHH T ss_conf 9999999860087479874736789999998-766799489861541------------257888878874431110513 Q ss_pred HHHHHHHHCCCCEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCEEECCCCCCCC--CCCCCCCCCCCCCC-CCEEEEEE Q ss_conf 8889986088505872043101234422125689999861798617504655332--10000000000123-41248897 Q gi|254780402|r 193 YFRRYKELGAQKLIVSGNLKIDTESLPCDKELLSLYQESIAGRYTWAAISTFEGE--EDKAVYVHNFIKCR-TDVLTIIV 269 (440) Q Consensus 193 ~~~rl~~lG~~~i~v~GnlK~d~~~~~~~~~~~~~~~~~~~~r~v~vagSth~~E--E~~il~a~~~l~~~-~~~~lIIv 269 (440) +...|.. .+++.++||.................+...-+.-.|+|.|-|.-.+ .+.+.++...|.+. .....+.. T Consensus 154 ~~~~~~~--~~~~~~~g~pvr~~~~~~~~~~~~~~~~~~~~~~~ilv~GGSQGA~~lN~~vp~~~~~L~~~~~~~~~~~~ 231 (368) T TIGR01133 154 AKKFFPA--AEKVVVVGNPVREEIRSLPAARARKRFKLRPGKLRILVLGGSQGAKILNELVPKALAKLAEKGLILVIIQG 231 (368) T ss_pred HHCCCCC--CCCEEEECCHHHHHHHCCCCHHHHHHHCCCCCCCEEEEECCCHHHHHHHHHHHHHHHHHHHCCCEEEEEEC T ss_conf 3226766--67568701413454303782568886216899827999627376899999999999886401652588876 Q ss_pred ECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEE Q ss_conf 05521147888776530010110013444446641675067652025542208206613554445458376896199499 Q gi|254780402|r 270 PRHPRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAIL 349 (440) Q Consensus 270 PRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi 349 (440) + ....+++.+...+.|+........ ++ --+|...|+.||+++. --|--.+-|=++.|+|-| T Consensus 232 g--~~~~~~~~~~y~~~~l~~~~~~~f--~~----------~~dm~~~y~~ADLvIs-----RAGA~T~~El~a~G~PaI 292 (368) T TIGR01133 232 G--KGDLEKVKNVYSELGLVAAKITPF--ID----------NEDMAAAYAAADLVIS-----RAGASTVAELAAAGVPAI 292 (368) T ss_pred C--HHHHHHHHHHHHHCCCHHHHHHCC--CC----------HHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHCCCEE T ss_conf 6--377999999852137102221037--78----------7579999987404002-----333789999997177737 Q ss_pred ECCCC-----CCHHHHHHHHHHCCCEEEEC----CHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHH Q ss_conf 98984-----43799999999789869957----9899999999995898999999999999997045569999999 Q gi|254780402|r 350 SGPNV-----ENFRDIYRRMVSSGAVRIVE----EVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITLRS 417 (440) Q Consensus 350 ~GP~~-----~nf~e~~~~L~~~g~~~~v~----d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~~~ 417 (440) +=|+- +|...-++.|.+.|++..+. ..+.|.+.+..|..|+..+++|+++++++.+. +|.+++.+. T Consensus 293 liPyP~a~~r~~Q~~NA~~l~~~gag~~~~q~~~~~e~l~~~~~~l~~~~~~~~~~~~~~~~~~~~--~a~~~~~~~ 367 (368) T TIGR01133 293 LIPYPYAAKRDDQYYNAKFLEDAGAGIVIEQKELLTEKLVSALLKLLKDPARLEEMAEAARKLAKP--DAAKRIAEA 367 (368) T ss_pred ECCCCCCCCHHHHHHHHHHHHHCCCCEEEHHHCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCC--CHHHHHHHH T ss_conf 625875468178999999997344654640220476899999874161089999999999986157--277999994 No 58 >cd03804 GT1_wbaZ_like This family is most closely related to the GT1 family of glycosyltransferases. wbaZ in Salmonella enterica has been shown to possess the mannosyl transferase activity. The members of this family are found in certain bacteria and Archaea. Probab=99.06 E-value=1.7e-09 Score=82.05 Aligned_cols=191 Identities=15% Similarity=0.117 Sum_probs=116.4 Q ss_pred HHHHHHHCCCCCCEEEECCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCEEECCCCC-CCCCCC Q ss_conf 1112331026442366415778889986088505872043101234422125689999861798617504655-332100 Q gi|254780402|r 172 VLSFSKKIFSQFSLVIVQSERYFRRYKELGAQKLIVSGNLKIDTESLPCDKELLSLYQESIAGRYTWAAISTF-EGEEDK 250 (440) Q Consensus 172 ~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~~~i~v~GnlK~d~~~~~~~~~~~~~~~~~~~~r~v~vagSth-~~EE~~ 250 (440) ++.+.....+..|.|++.|+..+++++++--....|.-| +++.+.... ....+..++..|-.. ..--+. T Consensus 143 lr~~~~~~~~~~d~iianS~~t~~~i~~~~~~~~~Viyp--------pvd~~~~~~--~~~~~~~~l~vgRl~~~K~~~~ 212 (351) T cd03804 143 LRIWDRRSAARVDYFIANSRFVARRIKKYYGRDATVIYP--------PVDTDRFTP--AEEKEDYYLSVGRLVPYKRIDL 212 (351) T ss_pred HHHHHHHHHHCCCEEEECCHHHHHHHHHHHCCCCCEECC--------CCCCCCCCC--CCCCCCCEEEEECCCHHCCCHH T ss_conf 999999887438899987989999999985899735582--------983543575--7666772488551504218089 Q ss_pred CCCCCCCCCCCCCEEEEEEECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCC--CCCHHHHHHCCCEEEEC Q ss_conf 0000000012341248897055211478887765300101100134444466416750676--52025542208206613 Q gi|254780402|r 251 AVYVHNFIKCRTDVLTIIVPRHPRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTI--GEMGFYLRMTEIAFIGR 328 (440) Q Consensus 251 il~a~~~l~~~~~~~lIIvPRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~--GeL~~lY~~AdiafVGG 328 (440) +++|++.+. ..|+|+..-+++ +++++.+. .+|.+...+ .++..+|+.|+ +||-. T Consensus 213 ~i~a~~~~~----~~L~i~G~g~~~-~~l~~~~~------------------~~V~f~g~~~~~~~~~~~~~a~-~~v~p 268 (351) T cd03804 213 AIEAFNKLG----KRLVVIGDGPEL-DRLRAKAG------------------PNVTFLGRVSDEELRDLYARAR-AFLFP 268 (351) T ss_pred HHHHHHHCC----CCEEEEECCHHH-HHHHHHCC------------------CCEEEEECCCHHHHHHHHHHCC-EEEEC T ss_conf 999998479----978999847379-99996678------------------9879980389899999998388-69951 Q ss_pred CCCCCCCCCHHHHHHHCCCEEECCCCCCHHHHHHHHHHCCCEEEECCHHHHHHHHHHHHCCHHHHHHHHHH Q ss_conf 55444545837689619949998984437999999997898699579899999999995898999999999 Q gi|254780402|r 329 SFCASGGQNPLEAAMLGCAILSGPNVENFRDIYRRMVSSGAVRIVEEVGTLADMVYSLLSEPTIRYEMINA 399 (440) Q Consensus 329 Sl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~~~~L~~~g~~~~v~d~~eL~~~l~~ll~d~~~~~~~~~~ 399 (440) |. +.-|-.++||+++|+|||. .+.....|++.+ -.+|......|+++|++++.++++||+.+.+..++ T Consensus 269 s~-E~FGi~~vEAma~G~PvIa-~~~gG~~e~v~~-g~tG~l~~~~~~~~la~ai~~~~~~~~~~~~~~r~ 336 (351) T cd03804 269 AE-EDFGIVPVEAMASGTPVIA-YGKGGALETVID-GVTGILFEEQTVESLAAAVERFEKNEDFDPQAIRA 336 (351) T ss_pred CH-HCCCCHHHHHHHCCCCEEE-ECCCCCCCEECC-CCCEEEECCCCHHHHHHHHHHHHHCCCCCHHHHHH T ss_conf 64-2079759999976998898-289997550158-99789959899999999999998595015999999 No 59 >PRK10125 predicted glycosyl transferase; Provisional Probab=98.93 E-value=7.8e-08 Score=71.14 Aligned_cols=248 Identities=14% Similarity=0.126 Sum_probs=135.3 Q ss_pred HHHCCE-EEEEECCCCHHH------------HHHHHHHCCCCEEEEE-ECCCCCCCCCCHHHHHHHHHCCCCCCEEEECC Q ss_conf 442020-356404762146------------7888742245505864-12665421210011112331026442366415 Q gi|254780402|r 125 KYWKPD-CMILSESDIWPL------------TVFELSKQRIPQVLVN-ARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQS 190 (440) Q Consensus 125 ~~~~P~-~~i~~e~ElWPn------------li~~~~~~~ip~~lin-arls~~S~~~~~~~~~l~~~~~~~~~~i~~qs 190 (440) +..++. -+||.=.|.||- +-..|. .||..--. ...-++|.+-|++-+.+++.+.+....+.+.| T Consensus 121 ki~k~~kPvVWTLHDmW~fTG~Chy~~~C~~wk~gC~--~CP~L~~yp~~~~D~s~~~~~~K~~~~~~~~~~~~~~V~ps 198 (405) T PRK10125 121 KNHKPDVTLVWTLHDHWSVTGRCAFTDGCEGWKTGCQ--KCPTLNNYPPVRVDRAHQLVAGKRQLFREMLRLGCQFISPS 198 (405) T ss_pred HHCCCCCCEEEEECCCCCCCCCCCCCCCHHHHHHCCC--CCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCEEEECC T ss_conf 4304799789994367655677788784376762556--56445678853357489999999999875352177199788 Q ss_pred HHHHHHHHH-HCCCCEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCEEECCC--CC-CCC-CCCCCCCCCCCCCCCCEE Q ss_conf 778889986-0885058720431012344221256899998617986175046--55-332-100000000001234124 Q gi|254780402|r 191 ERYFRRYKE-LGAQKLIVSGNLKIDTESLPCDKELLSLYQESIAGRYTWAAIS--TF-EGE-EDKAVYVHNFIKCRTDVL 265 (440) Q Consensus 191 ~~~~~rl~~-lG~~~i~v~GnlK~d~~~~~~~~~~~~~~~~~~~~r~v~vagS--th-~~E-E~~il~a~~~l~~~~~~~ 265 (440) ..-++...+ ++..++.|+.|. .|........+ ....+ ...+++++.++. .. ++. -..++++...+ ..+.. T Consensus 199 ~~~~~~~~sl~~~~~v~vIpNg-ID~~~~~~~~~-~~~~~-~~~~~~~i~~~a~~~~~~~k~~~~ll~~l~~l--~~~~~ 273 (405) T PRK10125 199 QHVADAFNSLYGPGRCRIINNG-IDMATEAILAE-LPPVR-ETQGKPRIAVVAHDLRYDGKTDQQLVREMMAL--GDKIE 273 (405) T ss_pred HHHHHHHHHHCCCCCEEEECCC-CCCCCCCCCHH-HHHHC-CCCCCCEEEEEECCCCCCCHHHHHHHHHHHHC--CCCEE T ss_conf 9999998756179986783899-78543444304-55404-37899769999545445633389999999844--89708 Q ss_pred EEEEECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEE---ECCCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHH Q ss_conf 889705521147888776530010110013444446641675---06765202554220820661355444545837689 Q gi|254780402|r 266 TIIVPRHPRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFL---GDTIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAA 342 (440) Q Consensus 266 lIIvPRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i---~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpa 342 (440) ++++..+-.. ...+|+- +..--+|..+|+.|| +||-.|+.+.-|...+||+ T Consensus 274 l~~~G~~~~~-------------------------~~~~v~~lg~~~d~~~La~~YsaAd-~~v~ps~~e~~~~~~~Ea~ 327 (405) T PRK10125 274 LHTFGKFSPF-------------------------TAGNVVNHGFETDKRKLMSALNQMD-ALVFSSRVDNYPLILCEAL 327 (405) T ss_pred EEEEECCCCC-------------------------CCCCEEECCCCCCHHHHHHHHHHCC-EEEECCHHHCCCHHHHHHH T ss_conf 9997257644-------------------------6887464687589999999996378-8972746754650899999 Q ss_pred HHCCCEEECCCCCCHHHHHHHHHHCCCEEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCC Q ss_conf 619949998984437999999997898699579899999999995898999999999999997045 Q gi|254780402|r 343 MLGCAILSGPNVENFRDIYRRMVSSGAVRIVEEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQ 408 (440) Q Consensus 343 a~G~pVi~GP~~~nf~e~~~~L~~~g~~~~v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ 408 (440) +||+||+. -+....+|+.+. +.|......|.++|++.+...++++.......+..++..++.. T Consensus 328 acg~pvv~-~~~~g~~~~~~~--~~g~~~~~~d~~~la~~i~~~~~~~~~~~~~~~~~~~~~~~fs 390 (405) T PRK10125 328 SIGVPVIA-THSDAAREVLQK--SGGKTVSEEEVLQLAQLSKPEIAQAVFGTTLAEFSQRSRAAYS 390 (405) T ss_pred HCCCCEEE-ECCCCCHHHEEC--CCCEEECCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCC T ss_conf 74998898-359997365115--8745966889999999899998460467789999999998679 No 60 >cd03806 GT1_ALG11_like This family is most closely related to the GT1 family of glycosyltransferases. ALG11 in yeast is involved in adding the final 1,2-linked Man to the Man5GlcNAc2-PP-Dol synthesized on the cytosolic face of the ER. The deletion analysis of ALG11 was shown to block the early steps of core biosynthesis that takes place on the cytoplasmic face of the ER and lead to a defect in the assembly of lipid-linked oligosaccharides. Probab=98.93 E-value=2.7e-08 Score=74.15 Aligned_cols=216 Identities=19% Similarity=0.151 Sum_probs=133.9 Q ss_pred HHHHHHHHHCCCCCCEEEECCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCHHHHHHHHHHCC-CCCEEECCCCC--CC Q ss_conf 0111123310264423664157788899860885058720431012344221256899998617-98617504655--33 Q gi|254780402|r 170 KTVLSFSKKIFSQFSLVIVQSERYFRRYKELGAQKLIVSGNLKIDTESLPCDKELLSLYQESIA-GRYTWAAISTF--EG 246 (440) Q Consensus 170 ~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~~~i~v~GnlK~d~~~~~~~~~~~~~~~~~~~-~r~v~vagSth--~~ 246 (440) +.|-+++....+..|.|++-|.-++.+++++--. +.+.....||++.+.......... ..+.++--+-- +. T Consensus 177 ~~f~~ly~~~~~~ad~v~vNS~~T~~~i~~~w~~------~~~~~VvYPP~d~~~~~~~~~~~~~~~~~ilSi~rfrpeK 250 (419) T cd03806 177 RLFAFLYGLAGSFADVVMVNSTWTRNHIRSLWKR------NTKPSIVYPPCDVEELLKLPLDEKTRENQILSIAQFRPEK 250 (419) T ss_pred HHHHHHHHHHCCCCCEEEECCHHHHHHHHHHHCC------CCCCCCCCCCCCHHHHCCCCCCCCCCCCEEEEEECCCCCC T ss_conf 9999999997257749998578699999998388------8886414799776774536654235766799986455456 Q ss_pred CCCCCCCCCCCCCC-C-----CCEEEEEEE--CCC---CHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEEC-CCCCC Q ss_conf 21000000000012-3-----412488970--552---114788877653001011001344444664167506-76520 Q gi|254780402|r 247 EEDKAVYVHNFIKC-R-----TDVLTIIVP--RHP---RRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGD-TIGEM 314 (440) Q Consensus 247 EE~~il~a~~~l~~-~-----~~~~lIIvP--Rhp---eR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~D-t~GeL 314 (440) .-+++++++..+++ . ++.+|+|+. |.. +|.+++++++.+.+++- +. .++.| +..|+ T Consensus 251 n~~L~i~af~~l~~~~~~~~~~~~~Lvi~Gg~R~~ed~~~~~~L~~la~~l~l~~-----------~V-~f~~~~s~~e~ 318 (419) T cd03806 251 NHPLQLRAFAKLLKRLPEEIKEKIKLVLIGSCRNEDDEKRVEDLKLLAKELGLED-----------KV-EFVVNAPFEEL 318 (419) T ss_pred CHHHHHHHHHHHHHHCCHHCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCCCC-----------CE-EEEECCCHHHH T ss_conf 8699999999998746301056855999948876556999999999999729988-----------76-99815998999 Q ss_pred HHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEE----CCCCCCHHHHHHHHHHCCCEEEECCHHHHHHHHHHHHCC- Q ss_conf 255422082066135544454583768961994999----898443799999999789869957989999999999589- Q gi|254780402|r 315 GFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILS----GPNVENFRDIYRRMVSSGAVRIVEEVGTLADMVYSLLSE- 389 (440) Q Consensus 315 ~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~----GP~~~nf~e~~~~L~~~g~~~~v~d~~eL~~~l~~ll~d- 389 (440) ..+++-|.+. +=---.++-|-.|+|+|++|+|||. ||.. +++..-.+...++..++++|.++++.++++. T Consensus 319 ~~lL~~a~~~-l~T~~nEHFGI~pVEaMaaG~pvvA~nSGGP~e----dIV~~~~~~~tGfL~~~~~e~a~a~~~~l~~~ 393 (419) T cd03806 319 LEELSTASIG-LHTMWNEHFGIGVVEYMAAGLIPLAHASGGPLL----DIVVPWDGGPTGFLASTAEEYAEAIEKILSLS 393 (419) T ss_pred HHHHHHCEEE-EECCCCCCCCCHHHHHHHCCCCEEEECCCCCCC----CEEEECCCCCCCCCCCCHHHHHHHHHHHHCCC T ss_conf 9999739798-855732566858999986699579978889753----07760589985112798799999999998199 Q ss_pred HHHHHHHHHHHHHHHHHCC Q ss_conf 8999999999999997045 Q gi|254780402|r 390 PTIRYEMINAAINEVKKMQ 408 (440) Q Consensus 390 ~~~~~~~~~~a~~~i~~~~ 408 (440) ++.+..|..+|+..+++-+ T Consensus 394 ~~~~~~~~~~ar~~~~rFS 412 (419) T cd03806 394 EEERLRIRRAARSSVKRFS 412 (419) T ss_pred HHHHHHHHHHHHHHHHHCC T ss_conf 8789999999999998468 No 61 >PRK01021 lpxB lipid-A-disaccharide synthase; Reviewed Probab=98.86 E-value=2e-07 Score=68.49 Aligned_cols=318 Identities=11% Similarity=-0.003 Sum_probs=177.4 Q ss_pred HHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCC----CC-CEEE-EEC-CC-------CCCHHHHHHHHHHCCEEEEEE Q ss_conf 9999999999861998999962776300121014----77-4168-850-55-------720134455744202035640 Q gi|254780402|r 70 MALIGLIPAIRSRHVNVLLTTMTATSAKVARKYL----GQ-YAIH-QYA-PL-------DIQPAVSRFLKYWKPDCMILS 135 (440) Q Consensus 70 ~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~----~~-~~~~-~y~-P~-------D~~~~~~~fl~~~~P~~~i~~ 135 (440) ..+..|+++|++++|++ ++.-..|-.+.+.-+ +- ...+ .+. ++ -......+.+...+||++|.+ T Consensus 240 ~lGa~Li~~Lk~~~P~i--~F~GVGGp~M~~eGl~slf~me~lsVMG~~EVL~~lp~L~~~~~~l~~~i~~~~PD~~I~I 317 (607) T PRK01021 240 TLGGNLLKEIKALYPDI--RCFGVGGPQMRQEGLEPLFNMEEFQVSGFWEVLLSLFKLYYRYRKLYKTILKENPETVICI 317 (607) T ss_pred HHHHHHHHHHHHHCCCC--EEEECCHHHHHHCCCCCCCCHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEE T ss_conf 77999999998549898--8996377999876886124467710114899998899999999999999986199999995 Q ss_pred CCC-CHHHHHHHHHHCCC--CEEE-EEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHCCCCEEEECCC Q ss_conf 476-21467888742245--5058-6412665421210011112331026442366415778889986088505872043 Q gi|254780402|r 136 ESD-IWPLTVFELSKQRI--PQVL-VNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKELGAQKLIVSGNL 211 (440) Q Consensus 136 e~E-lWPnli~~~~~~~i--p~~l-inarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~~~i~v~Gnl 211 (440) ++. +=-.+-+.++++|+ |++- |.- .=|.|-+.-.+.+-+.+|++++-=+...+.|++-|.+... .|+. T Consensus 318 D~PdFNlrlak~lkk~gi~ik~vhYVsP-------sVWAWr~~R~k~i~~~vD~~l~lfPFE~~~~~~~g~~~~y-VGHP 389 (607) T PRK01021 318 DFPDFHFLLIKKLRKCGYKGKIIHYVCP-------SIWAWRPKRKTILEKYLDLLLLILPFEQGLFKNSPLRTVY-LGHP 389 (607) T ss_pred CCCCCCHHHHHHHHHCCCCCCEEEEECC-------CEEEECCCHHHHHHHHHHHHEECCCCCHHHHHCCCCCCEE-ECCC T ss_conf 8998788999999972899986899788-------3688662179999998867305267788999507998379-7896 Q ss_pred CCCCCCCCCCHHHHHHHHHHCCCCCEEE-CCCCCCCCCCCCCCC----CCCCCCCCCEEEEEEECCCCHHHHHHHHHHHH Q ss_conf 1012344221256899998617986175-046553321000000----00001234124889705521147888776530 Q gi|254780402|r 212 KIDTESLPCDKELLSLYQESIAGRYTWA-AISTFEGEEDKAVYV----HNFIKCRTDVLTIIVPRHPRRCDAIERRLIAK 286 (440) Q Consensus 212 K~d~~~~~~~~~~~~~~~~~~~~r~v~v-agSth~~EE~~il~a----~~~l~~~~~~~lIIvPRhpeR~~~i~~~l~~~ 286 (440) -.|..+...+...... .....+++++. .-.++.+|....+.. ........+..+++.+-.|+..+.+.+.+++. T Consensus 390 l~e~i~~~~~~~~~r~-~l~~~~k~IIALLPGSR~SEI~RlLPI~~~ai~~~~l~~~~~~lV~~a~p~~~~~i~e~l~~~ 468 (607) T PRK01021 390 LVETISNFQPCASWKE-QLLPSDKPIVAAFPGSRRGDILRNLTIQVQAFLASSLAQTHQLLVSSANPKYDHLILDVLQQE 468 (607) T ss_pred CHHHCCCCCCCHHHHH-HCCCCCCCEEEEECCCCHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCCHHHHHHHHHHHHHC T ss_conf 0120223477456998-617788988999089978999987499999999872446764999568766889999999864 Q ss_pred CCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHH-HHHHHH Q ss_conf 0101100134444466416750676520255422082066135544454583768961994999898443799-999999 Q gi|254780402|r 287 GLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFRD-IYRRMV 365 (440) Q Consensus 287 gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e-~~~~L~ 365 (440) +.. .. . +++. +.-....+.||+|.+.-+ --.||.|-+|+|.+.+=.+.-+.. +.+.+. T Consensus 469 ~~~------------~~-~-II~~-~~kyeam~aSDaALaASG------TATLE~ALagvPmVVaYKlnpLTyfIAK~Lv 527 (607) T PRK01021 469 GCL------------HS-K-IVPS-QFRYELMRECDCALAKCG------TIVLETALNQTPTIVTCQLRPFDTFLAKYIF 527 (607) T ss_pred CCC------------CC-E-ECCC-HHHHHHHHHCCEEEECCC------HHHHHHHHHCCCEEEEEECCHHHHHHHHHHH T ss_conf 898------------84-5-7173-258999985889988877------8999999838898999967827999999999 Q ss_pred HC-----C--------CE-------EEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHH---CCCHHHHHHHHHHH Q ss_conf 78-----9--------86-------995798999999999958989999999999999970---45569999999997 Q gi|254780402|r 366 SS-----G--------AV-------RIVEEVGTLADMVYSLLSEPTIRYEMINAAINEVKK---MQGPLKITLRSLDS 420 (440) Q Consensus 366 ~~-----g--------~~-------~~v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~---~~ga~~~~~~~i~~ 420 (440) +. + .+ ..=-++|+++.++. +|+||+.+++....+++..+. +.-+++..++.|-. T Consensus 528 KI~lp~vsLPNILagr~VVPElI~gQ~D~~PE~iAaAl~-lL~~p~~~ekq~~~c~~~~~~~~~~~~~~~e~~~~~~~ 604 (607) T PRK01021 528 KILLPAYSLPNIILGSVIFPEFIGGKKDFHPEEVAAALD-ILKTSQSKEKQKEACRDLYKAMTTSAITSKECLKRIFE 604 (607) T ss_pred HHCCCCCCCCHHHCCCCCCCCCCCCCCCCCHHHHHHHHH-HHHCHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHH T ss_conf 725751211301169986666367766589899999999-87191567999999999999987178878999999986 No 62 >cd03791 GT1_Glycogen_synthase_DULL1_like This family is most closely related to the GT1 family of glycosyltransferases. Glycogen synthase catalyzes the formation and elongation of the alpha-1,4-glucose backbone using ADP-glucose, the second and key step of glycogen biosynthesis. This family includes starch synthases of plants, such as DULL1 in Zea mays and glycogen synthases of various organisms. Probab=98.81 E-value=5.4e-06 Score=59.06 Aligned_cols=258 Identities=16% Similarity=0.160 Sum_probs=135.6 Q ss_pred HHCCEEEEEEC--CCCHHHHHHHHHH----CCCCEEEE--E----ECCCCCCCCCCH-------------HH--HHHHHH Q ss_conf 42020356404--7621467888742----24550586--4----126654212100-------------11--112331 Q gi|254780402|r 126 YWKPDCMILSE--SDIWPLTVFELSK----QRIPQVLV--N----ARMSRRSFKNWK-------------TV--LSFSKK 178 (440) Q Consensus 126 ~~~P~~~i~~e--~ElWPnli~~~~~----~~ip~~li--n----arls~~S~~~~~-------------~~--~~l~~~ 178 (440) .++||++-.=+ |-+=|.+++...+ .++|+++. | |..+...+.++. +. -.+.+. T Consensus 127 ~~~pDVIH~HDW~tal~~~~lk~~~~~~~~~~~p~V~TIHnl~~qG~~~~~~~~~~~~~~~~~~~~~~~~~~~~in~lk~ 206 (476) T cd03791 127 GWKPDIIHCHDWHTGLVPALLKEKYADPFFKNIKTVFTIHNLAYQGVFPLEALEDLGLPWEELFHIDGLEFYGQVNFLKA 206 (476) T ss_pred CCCCCEEEECCHHHHHHHHHHHHHHHCCCCCCCCEEEEECCCHHCCCCCHHHHHHCCCCCHHHHHHCCCCCCCCHHHHHH T ss_conf 99988799765379999999999653002469988999643010687874358761897066530001002450359999 Q ss_pred CCCCCCEEEECCHHHHHHHHH--HC----------CCCEEEECC-CCCCCCCCCC---------------CHHHHHHHHH Q ss_conf 026442366415778889986--08----------850587204-3101234422---------------1256899998 Q gi|254780402|r 179 IFSQFSLVIVQSERYFRRYKE--LG----------AQKLIVSGN-LKIDTESLPC---------------DKELLSLYQE 230 (440) Q Consensus 179 ~~~~~~~i~~qs~~~~~rl~~--lG----------~~~i~v~Gn-lK~d~~~~~~---------------~~~~~~~~~~ 230 (440) -+..-|.|.+.|+..++.... .| ..++.++-| +-.+.-.+.. .......+.+ T Consensus 207 gi~~ad~V~tVS~~ya~Ei~~~~~g~~l~~~l~~~~~kl~gI~NGiD~~~~~P~~d~~l~~~~~~~~~~~k~~~k~~l~~ 286 (476) T cd03791 207 GIVYADAVTTVSPTYAREILTPEFGEGLDGLLRARAGKLSGILNGIDYDVWNPATDPHLPANYSADDLEGKAENKAALQE 286 (476) T ss_pred HHHHCCEEEEECHHHHHHHCCCCCCCCHHHHHHHCCCCEEEEECCCCHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHH T ss_conf 98747877980777887764721366666675304687699858988643388777531056671135517999999999 Q ss_pred HC-----CCCCEEECCCC---CCCCCCCCCCCCC-CCCCCCCEEEEEEECCCCHH-HHHHHHHHHHCCHHHHHCCCCCCC Q ss_conf 61-----79861750465---5332100000000-00123412488970552114-788877653001011001344444 Q gi|254780402|r 231 SI-----AGRYTWAAIST---FEGEEDKAVYVHN-FIKCRTDVLTIIVPRHPRRC-DAIERRLIAKGLKVARRSRGDVIN 300 (440) Q Consensus 231 ~~-----~~r~v~vagSt---h~~EE~~il~a~~-~l~~~~~~~lIIvPRhpeR~-~~i~~~l~~~gl~~~~~S~~~~~~ 300 (440) .+ .+.|+++.-+- ++| -+.++++.. .+++ +.++||+..-.... +.+.++....+ T Consensus 287 ~~gl~~d~~~pl~~~vgRl~~qKG-~dll~~a~~~~~~~--~~~~vi~G~G~~~~e~~~~~l~~~~~------------- 350 (476) T cd03791 287 ELGLPVDPDAPLFGFVGRLTEQKG-IDLLLEALPELLEL--GGQLVILGSGDPEYEEALRELAARYP------------- 350 (476) T ss_pred HHCCCCCCCCCEEEEEECCCCCCC-HHHHHHHHHHHHHC--CCEEEEEECCCHHHHHHHHHHHHHCC------------- T ss_conf 959897899868999423520248-99999999999963--98899994697789999999997689------------- Q ss_pred CCCEEEEECCCCCCH-HHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHHHHHHHH-----HCCCEEEEC Q ss_conf 664167506765202-55422082066135544454583768961994999898443799999999-----789869957 Q gi|254780402|r 301 AEVDIFLGDTIGEMG-FYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFRDIYRRMV-----SSGAVRIVE 374 (440) Q Consensus 301 ~~~~V~i~Dt~GeL~-~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~~~~L~-----~~g~~~~v~ 374 (440) .+..+++. --.+|. .+|+.||+ ||-.|..+--|-+.+||+++|+|+|. ..+...+|.+.+.. .+|..+.-. T Consensus 351 ~~~~~~~~-~~e~l~~~lya~aD~-~l~PS~~EP~Gl~qleAm~~GtppIa-~~tGGL~dtV~d~~~~~~~~tGf~f~~~ 427 (476) T cd03791 351 GRVAVLIG-YDEALAHLIYAGADF-FLMPSRFEPCGLTQMYAMRYGTVPIV-RATGGLADTVIDYNEDTGEGTGFVFEGY 427 (476) T ss_pred CCEEEEEE-CCHHHHHHHHHHCCE-EECCCCCCCCCHHHHHHHHCCCCEEE-CCCCCCCCEEECCCCCCCCCCEEEECCC T ss_conf 95999980-687889999984999-74255457854899999866997598-0699986510036667777745886799 Q ss_pred CHHHHHHHHHHHH---CCHHHHHHHHHHHHH Q ss_conf 9899999999995---898999999999999 Q gi|254780402|r 375 EVGTLADMVYSLL---SEPTIRYEMINAAIN 402 (440) Q Consensus 375 d~~eL~~~l~~ll---~d~~~~~~~~~~a~~ 402 (440) |+++|..++.+.+ .||+..++|.++|.+ T Consensus 428 ~~~~l~~ai~~al~~~~~~~~~~~l~~~am~ 458 (476) T cd03791 428 NADALLAALRRALALYRDPEAWRKLQRNAMA 458 (476) T ss_pred CHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH T ss_conf 9999999999999985799999999998864 No 63 >COG0763 LpxB Lipid A disaccharide synthetase [Cell envelope biogenesis, outer membrane] Probab=98.80 E-value=8.4e-07 Score=64.36 Aligned_cols=324 Identities=18% Similarity=0.160 Sum_probs=192.9 Q ss_pred ECCHHHH---HHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCC-CEEE-----------EECCCCCCHHHHHHHH-- Q ss_conf 1789999---999999999986199899996277630012101477-4168-----------8505572013445574-- Q gi|254780402|r 63 ASSVGET---MALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQ-YAIH-----------QYAPLDIQPAVSRFLK-- 125 (440) Q Consensus 63 aaSvGE~---~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~-~~~~-----------~y~P~D~~~~~~~fl~-- 125 (440) +.|.||. .....|++.|+++++++=. .-..|-.+.+.-+.. ...+ -.+| -+....++..+ T Consensus 5 ~i~AGE~SGDllGa~LikaLk~~~~~~ef--vGvgG~~m~aeG~~sl~~~~elsvmGf~EVL~~lp-~llk~~~~~~~~i 81 (381) T COG0763 5 ALSAGEASGDLLGAGLIKALKARYPDVEF--VGVGGEKMEAEGLESLFDMEELSVMGFVEVLGRLP-RLLKIRRELVRYI 81 (381) T ss_pred EEEECCCCHHHHHHHHHHHHHHHCCCEEE--EEECCHHHHHCCCCCCCCHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHH T ss_conf 99904443114689999999863898389--98176788865576555889987822999999889-9999999999999 Q ss_pred -HHCCEEEEEECCC-CHHHHHHHHHHCC--CCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHC Q ss_conf -4202035640476-2146788874224--55058641266542121001111233102644236641577888998608 Q gi|254780402|r 126 -YWKPDCMILSESD-IWPLTVFELSKQR--IPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKELG 201 (440) Q Consensus 126 -~~~P~~~i~~e~E-lWPnli~~~~~~~--ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG 201 (440) ..+||.+|++++. +=-.+...+++.+ ||++-.=+ -.=|.|-++-...+-+..|++++.=+.+.+.|.+.| T Consensus 82 ~~~kpD~~i~IDsPdFnl~vak~lrk~~p~i~iihYV~------PsVWAWr~~Ra~~i~~~~D~lLailPFE~~~y~k~g 155 (381) T COG0763 82 LANKPDVLILIDSPDFNLRVAKKLRKAGPKIKIIHYVS------PSVWAWRPKRAVKIAKYVDHLLAILPFEPAFYDKFG 155 (381) T ss_pred HHCCCCEEEEECCCCCCHHHHHHHHHHCCCCCEEEEEC------CCEEEECHHHHHHHHHHHHHEEEECCCCHHHHHHCC T ss_conf 85599889996789886499999997089998699978------530565521689999976172143677889998559 Q ss_pred CCCEEEECCCCCCCCCCCCCHHH-HHHHHHHCCCCCEEECCCCCCCCCCCCCC----CCCCCC-CCCCEEEEEEECCCCH Q ss_conf 85058720431012344221256-89999861798617504655332100000----000001-2341248897055211 Q gi|254780402|r 202 AQKLIVSGNLKIDTESLPCDKEL-LSLYQESIAGRYTWAAISTFEGEEDKAVY----VHNFIK-CRTDVLTIIVPRHPRR 275 (440) Q Consensus 202 ~~~i~v~GnlK~d~~~~~~~~~~-~~~~~~~~~~r~v~vagSth~~EE~~il~----a~~~l~-~~~~~~lIIvPRhpeR 275 (440) .+ .+..|+.=-|..+...+.+. ...+....+.+.+.+.-.|+.+|...++. +.+.++ +.|++.+++ |--++- T Consensus 156 ~~-~~yVGHpl~d~i~~~~~r~~ar~~l~~~~~~~~lalLPGSR~sEI~rl~~~f~~a~~~l~~~~~~~~~vl-p~~~~~ 233 (381) T COG0763 156 LP-CTYVGHPLADEIPLLPDREAAREKLGIDADEKTLALLPGSRRSEIRRLLPPFVQAAQELKARYPDLKFVL-PLVNAK 233 (381) T ss_pred CC-EEEECCHHHHHCCCCCCHHHHHHHHCCCCCCCEEEEECCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEEE-ECCCHH T ss_conf 97-0896780564234434578999982899778769981688588999877899999999986589965999-568478 Q ss_pred HHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCC Q ss_conf 47888776530010110013444446641675067652025542208206613554445458376896199499989844 Q gi|254780402|r 276 CDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVE 355 (440) Q Consensus 276 ~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~ 355 (440) .+.+.....+. . .....+++.| |+-...+..||+|++.- |-..||+|.+|+|.+.+=.+. T Consensus 234 ~~~~~~~~~~~-------~-----~~~~~~~~~~--~~~~~a~~~aD~al~aS------GT~tLE~aL~g~P~Vv~Yk~~ 293 (381) T COG0763 234 YRRIIEEALKW-------E-----VAGLSLILID--GEKRKAFAAADAALAAS------GTATLEAALAGTPMVVAYKVK 293 (381) T ss_pred HHHHHHHHHHC-------C-----CCCCEEEEEC--CHHHHHHHHHHHHHHHC------CHHHHHHHHHCCCEEEEEECC T ss_conf 89999998623-------3-----4576078407--45789999841899844------679999998299979999445 Q ss_pred CHH----------------------HHHHHHHHCCCEEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCC-C-HH Q ss_conf 379----------------------99999997898699579899999999995898999999999999997045-5-69 Q gi|254780402|r 356 NFR----------------------DIYRRMVSSGAVRIVEEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQ-G-PL 411 (440) Q Consensus 356 nf~----------------------e~~~~L~~~g~~~~v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~-g-a~ 411 (440) -+. +++.++++.. + .++.+++++..++.|+..++++.+.-.+.-+..+ + +. T Consensus 294 ~it~~iak~lvk~~yisLpNIi~~~~ivPEliq~~----~-~pe~la~~l~~ll~~~~~~~~~~~~~~~l~~~l~~~~~~ 368 (381) T COG0763 294 PITYFIAKRLVKLPYVSLPNILAGREIVPELIQED----C-TPENLARALEELLLNGDRREALKEKFRELHQYLREDPAS 368 (381) T ss_pred HHHHHHHHHHCCCCCCCCHHHHCCCCCCHHHHHHH----C-CHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCCCHH T ss_conf 88999999861677244368861873046777531----6-999999999998348676999999999999997589688 Q ss_pred HHHHHHHHHHH Q ss_conf 99999999752 Q gi|254780402|r 412 KITLRSLDSYV 422 (440) Q Consensus 412 ~~~~~~i~~~l 422 (440) ++..+.+...+ T Consensus 369 e~aA~~vl~~~ 379 (381) T COG0763 369 EIAAQAVLELL 379 (381) T ss_pred HHHHHHHHHHH T ss_conf 99999999874 No 64 >TIGR03609 S_layer_CsaB polysaccharide pyruvyl transferase CsaB. The CsaB protein (cell surface anchoring B) of Bacillus anthracis adds a pyruvoyl group to peptidoglycan-associated polysaccharide. This addition is required for proteins with an S-layer homology domain (pfam00395) to bind. Within the larger group of proteins described by pfam04230, this model represents a distinct clade that nearly exactly follows the phylogenetic distribution of the S-layer homology domain (pfam00395). Probab=98.76 E-value=1e-06 Score=63.76 Aligned_cols=253 Identities=11% Similarity=0.110 Sum_probs=146.1 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHH-CCEEEEEECCCC---------- Q ss_conf 999999999861998999962776300121014774168850557201344557442-020356404762---------- Q gi|254780402|r 71 ALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYW-KPDCMILSESDI---------- 139 (440) Q Consensus 71 ~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~-~P~~~i~~e~El---------- 139 (440) ....++++|+ .+.++.+.+..|. + ..+.++-.... .++. .+.++.. +-|++|+-.+.+ T Consensus 17 ~l~a~l~~l~-~~~~i~vls~~p~--~-t~~~~~v~av~---r~~~----~~~~~al~~~d~~I~GGG~llqD~ts~~s~ 85 (298) T TIGR03609 17 LLAALLRELP-PGVEPTVLSNDPA--E-TAKLYGVEAVN---RRSL----LAVLRALRRADVVIWGGGSLLQDVTSFRSL 85 (298) T ss_pred HHHHHHHHCC-CCCCEEEECCCHH--H-HHHHCCCEEEC---CCCH----HHHHHHHHHCCEEEECCCCCCCCCCCCCCH T ss_conf 9999998558-9981899529869--9-99863946675---5699----999999987799998585414588754347 Q ss_pred --HHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHCCCCEEEECCCCCCCCC Q ss_conf --146788874224550586412665421210011112331026442366415778889986088505872043101234 Q gi|254780402|r 140 --WPLTVFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKELGAQKLIVSGNLKIDTES 217 (440) Q Consensus 140 --WPnli~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~~~i~v~GnlK~d~~~ 217 (440) |-.++..++..|.|+++..--+.+- ++ .+-+.+.+.+++..+.|.++|+.|.+.++++|+ ++.+++|+=|.... T Consensus 86 ~yy~~~~~la~~~gkpv~~~gqgiGP~--~~-~~~r~l~r~~l~~~~~i~vRD~~S~~~l~~lGv-~~~l~~D~af~l~~ 161 (298) T TIGR03609 86 LYYLGLMRLARLFGKPVILWGQGIGPL--RR-RLSRWLVRRVLRGCRAISVRDAASYRLLKRLGI-PAELAADPVWLLPP 161 (298) T ss_pred HHHHHHHHHHHHCCCCEEEEEECCCCC--CC-HHHHHHHHHHHHHCCEEEECCHHHHHHHHHCCC-CEEEECCEEECCCC T ss_conf 899999999998299889994268876--78-789999999984199999778888999997499-84896771320577 Q ss_pred CCCCHHHHHHHHHHCCCCCEEECCCCCCC-CCCC---CCCCCCCCCCCCCEEEEEEECCCCHHHHHHHHHHHHCCHHHHH Q ss_conf 42212568999986179861750465533-2100---0000000012341248897055211478887765300101100 Q gi|254780402|r 218 LPCDKELLSLYQESIAGRYTWAAISTFEG-EEDK---AVYVHNFIKCRTDVLTIIVPRHPRRCDAIERRLIAKGLKVARR 293 (440) Q Consensus 218 ~~~~~~~~~~~~~~~~~r~v~vagSth~~-EE~~---il~a~~~l~~~~~~~lIIvPRhpeR~~~i~~~l~~~gl~~~~~ 293 (440) .+..... ....+.+-+....|++ +++. +..+...+-+..+..++++|.|.+.=.++.+.+.+... T Consensus 162 ~~~~~~~------~~~~~~i~v~~r~~~~~~~~~~~~~~~~l~~l~~~~g~~V~~lp~~~~~D~~~~~~l~~~~~----- 230 (298) T TIGR03609 162 EPWPGGE------PLPEPVIVVSLRPWPLLDVSRLLRLLRALDRLQRDTGAFVLFLPFQQPQDLPLARALRDQLL----- 230 (298) T ss_pred CCCCCCC------CCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHHHHHHCC----- T ss_conf 5544444------56799899997888878999999999999999983598699996887854999999997578----- Q ss_pred CCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHHHHH Q ss_conf 134444466416750676520255422082066135544454583768961994999898443799999 Q gi|254780402|r 294 SRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFRDIYR 362 (440) Q Consensus 294 S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~~~ 362 (440) ....+....+..|+...++-+|+ +||-- -|..+-|+..|+|++.=.|-.+.....+ T Consensus 231 -------~~~~i~~~~~~~e~~~~i~~~~~-vI~~R-----lH~~I~A~~~gvP~i~isY~~Kv~~f~~ 286 (298) T TIGR03609 231 -------GPAEVLSPLDPEELLGLFASARL-VIGMR-----LHALILAAAAGVPFVALSYDPKVRAFAA 286 (298) T ss_pred -------CCCEEECCCCHHHHHHHHHHCCE-EEEEC-----CHHHHHHHHCCCCEEEECCHHHHHHHHH T ss_conf -------86365378999999999960998-99807-----0899999977999899530799999999 No 65 >PRK10017 putative pyruvyl transferase; Provisional Probab=98.74 E-value=8.8e-06 Score=57.64 Aligned_cols=249 Identities=13% Similarity=0.089 Sum_probs=140.0 Q ss_pred EEEEEECCCC------HHHHHH--HHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHC Q ss_conf 0356404762------146788--87422455058641266542121001111233102644236641577888998608 Q gi|254780402|r 130 DCMILSESDI------WPLTVF--ELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKELG 201 (440) Q Consensus 130 ~~~i~~e~El------WPnli~--~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG 201 (440) |++|-..+.+ ||.+.. .+.-.+-|+++..--+.+ +++ .+.+.+.+.+++++|.|.++++.+.+.+.++| T Consensus 119 D~vIs~GGs~~~D~yg~~~~~~~L~a~l~kKpv~~~aQgIGP--~~~-~~~~~l~~~vl~~~d~ItvRD~~S~~~L~~lG 195 (426) T PRK10017 119 DAIIQVGGSFFVDLYGVPQFEHALCTFMAKKPLFMIGHSVGP--FQD-EQFNQLANYVFGHCDALILRESVSLDLMKRSN 195 (426) T ss_pred CEEEEECCCCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCC--CCC-HHHHHHHHHHHHCCCEEEECCHHHHHHHHHCC T ss_conf 789971776201476852168999999739968999044688--087-88999999998419789976587899999859 Q ss_pred CC--CEEEECCCCCCCCCCCCCHHHHHH---HHHHCCCCCEEE-CCCCCC--------CC---CCCCCCCCC-CCCCCCC Q ss_conf 85--058720431012344221256899---998617986175-046553--------32---100000000-0012341 Q gi|254780402|r 202 AQ--KLIVSGNLKIDTESLPCDKELLSL---YQESIAGRYTWA-AISTFE--------GE---EDKAVYVHN-FIKCRTD 263 (440) Q Consensus 202 ~~--~i~v~GnlK~d~~~~~~~~~~~~~---~~~~~~~r~v~v-agSth~--------~E---E~~il~a~~-~l~~~~~ 263 (440) ++ +|.++-|.=|.......+...... +....++++++. ..+-|. .+ ++.+.++.. ++.+ . T Consensus 196 v~~~~i~~taDpAF~l~~~~~~~~~~~~~~~~l~~~~~~~~VgisVr~~~~~~~~~~~~~~~y~~a~a~~~d~l~~~--G 273 (426) T PRK10017 196 ITTAKVEHGVDTAWLVDHHTEDFTASYAVQHWLDVAAQQKTVAITLRELAPFDKRLGTTQQAYEKAFAGVVNRILDE--G 273 (426) T ss_pred CCCCCEEEECCCCEECCCCCCCCCCCHHHHHHCCCCCCCCEEEEEEECCCCCCCCCCCCHHHHHHHHHHHHHHHHHC--C T ss_conf 99786289458211025654332212355641365666877999970366301124410899999999999999977--9 Q ss_pred EEEEEEECCC-------CHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEE-EECC--CCCCHHHHHHCCCEEEECCCCCC Q ss_conf 2488970552-------114788877653001011001344444664167-5067--65202554220820661355444 Q gi|254780402|r 264 VLTIIVPRHP-------RRCDAIERRLIAKGLKVARRSRGDVINAEVDIF-LGDT--IGEMGFYLRMTEIAFIGRSFCAS 333 (440) Q Consensus 264 ~~lIIvPRhp-------eR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~-i~Dt--~GeL~~lY~~AdiafVGGSl~~~ 333 (440) ...+..|.|. +... +...+.+ ... ....+. +-|. .-|+..+++-+|+ ||| . T Consensus 274 ~~Vv~lp~~~~i~~~~~dD~~-~~~~i~~---------~m~---~~~~~~il~~~~~~~E~~~ii~~~dl-~IG-----~ 334 (426) T PRK10017 274 YQVIALSTCTGIDSYNKDDRM-VALNLRQ---------HIS---DPARYHVVMDELNDLEMGKILGACEL-TVG-----T 334 (426) T ss_pred CEEEEECCCCCCCCCCCHHHH-HHHHHHH---------HHC---CCCCEEEECCCCCHHHHHHHHHHCHH-HHH-----H T ss_conf 879996056687777802599-9999999---------726---87636983899998999999973922-998-----8 Q ss_pred CCCCHHHHHHHCCCEEECCCCCCHHHHHHHHHHCCCEEEEC--CHHHHHHHHHHHHCCHHHHHHHHHHHHH Q ss_conf 54583768961994999898443799999999789869957--9899999999995898999999999999 Q gi|254780402|r 334 GGQNPLEAAMLGCAILSGPNVENFRDIYRRMVSSGAVRIVE--EVGTLADMVYSLLSEPTIRYEMINAAIN 402 (440) Q Consensus 334 GGhN~lEpaa~G~pVi~GP~~~nf~e~~~~L~~~g~~~~v~--d~~eL~~~l~~ll~d~~~~~~~~~~a~~ 402 (440) --|..+=++..|+|++-=+|-.+...+.+.+--.+-+.-++ +.++|...+...+.+-+...+.-.+|-+ T Consensus 335 RLHslIfA~~~gvP~i~IsYd~K~~g~~~~lGl~~~~~di~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 405 (426) T PRK10017 335 RLHSAIISMNFATPAIAINYEHKSAGIMQQLGLPEMAIDIRHLLDGSLQAMVADTLGQLPALNARLAEAVS 405 (426) T ss_pred HHHHHHHHHHCCCCEEEEEECHHHHHHHHHCCCCHHEECHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 89999999975999698402287899999759930030377669278999999999769999999999999 No 66 >TIGR02472 sucr_P_syn_N sucrose-phosphate synthase, putative, glycosyltransferase domain; InterPro: IPR012822 This family consists of the N-terminal regions, or in some cases the entirety, of bacterial proteins closely related to plant sucrose-phosphate synthases (SPS). The C-terminal domain (IPR012821 from INTERPRO), found with most members of this family, resembles both bona fide plant sucrose-phosphate phosphatases (SPP) and the SPP-like domain of plant sucrose phosphate synthase (SPS). At least two members of this family lack the SPP-like domain, which may have binding or regulatory rather than enzymatic activity by analogy to plant SPS. This enzyme produces sucrose 6-phosphate and UDP from UDP-glucose and D-fructose 6-phosphate, and may be encoded near the gene for fructokinase.. Probab=98.71 E-value=7.8e-07 Score=64.57 Aligned_cols=316 Identities=18% Similarity=0.180 Sum_probs=191.2 Q ss_pred HHHHHHHHHHHHHHHCCC--CE-EEEE-----CCCCCCHHHHCCCCCCE-EE-------EECCCCCCH-----HHHHHHH Q ss_conf 999999999999986199--89-9996-----27763001210147741-68-------850557201-----3445574 Q gi|254780402|r 67 GETMALIGLIPAIRSRHV--NV-LLTT-----MTATSAKVARKYLGQYA-IH-------QYAPLDIQP-----AVSRFLK 125 (440) Q Consensus 67 GE~~~a~pli~~l~~~~~--~i-liT~-----~T~tg~~~~~~~~~~~~-~~-------~y~P~D~~~-----~~~~fl~ 125 (440) |=+.-+..|++.|-++-. +| |||= -=.+.|+.-.+...+.. ++ -||+=+..| .+.+.|. T Consensus 27 GQ~~YVlELar~La~~~ev~qVdlvTR~I~D~~vspDYA~~~E~i~p~ArIvRl~fGP~rYLRKE~LWPyLD~~~D~~l~ 106 (445) T TIGR02472 27 GQTKYVLELARALARRSEVEQVDLVTRLIKDAKVSPDYAQPIERIAPKARIVRLPFGPRRYLRKELLWPYLDELADQLLS 106 (445) T ss_pred CHHHHHHHHHHHHHCCCCCCEEEEEEEEEECCCCCCHHHHHHHHCCCCCEEEEECCCCCCCCCCCCCCCCHHHHHHHHHH T ss_conf 33778888899861589751476775034317768023347764089955997457888557401566007899999999 Q ss_pred HHCCEEEEEECCCCHHHHH------------HHHHHCCCCEEEEEECCCCCCCCCCHHHHHHH----------------- Q ss_conf 4202035640476214678------------88742245505864126654212100111123----------------- Q gi|254780402|r 126 YWKPDCMILSESDIWPLTV------------FELSKQRIPQVLVNARMSRRSFKNWKTVLSFS----------------- 176 (440) Q Consensus 126 ~~~P~~~i~~e~ElWPnli------------~~~~~~~ip~~linarls~~S~~~~~~~~~l~----------------- 176 (440) |++ +.|=-|.+| .-|...+||.+.-. .|..|-|+-+-+. T Consensus 107 ylr-------~~g~lPdlIH~HYADAGYVG~~ls~~L~vPlvfTG-----HSLGR~Kr~RLLa~G~~skaPkP~~~IE~~ 174 (445) T TIGR02472 107 YLR-------QQGRLPDLIHAHYADAGYVGARLSRLLGVPLVFTG-----HSLGREKRRRLLAAGLKSKAPKPPQEIEKQ 174 (445) T ss_pred HHH-------HCCCCCCCCHHHHCCHHHHHHHHHHHCCCCEEEEC-----CCCCHHHHHHHHHCCCCCCCCCCHHHHHHH T ss_conf 997-------35888873101010155899999862589848837-----535778999998434002678877899986 Q ss_pred ----H------HCCCCCCEEEEC-CHHHHHHHHHHC---CCCEEEE--C-C-CCCCCCC-CCCCHHHHHHHHHHC--CCC Q ss_conf ----3------102644236641-577888998608---8505872--0-4-3101234-422125689999861--798 Q gi|254780402|r 177 ----K------KIFSQFSLVIVQ-SERYFRRYKELG---AQKLIVS--G-N-LKIDTES-LPCDKELLSLYQESI--AGR 235 (440) Q Consensus 177 ----~------~~~~~~~~i~~q-s~~~~~rl~~lG---~~~i~v~--G-n-lK~d~~~-~~~~~~~~~~~~~~~--~~r 235 (440) + ..|...+.|+|- .||..++|...+ .++..|+ | | .||-... .+.+......+.-++ ..+ T Consensus 175 f~is~RI~AEE~tL~~AslvitST~QEi~~QY~~Y~~y~P~r~~VIPPGvD~~rFyp~~~~~~~~~i~~~L~rFL~~p~K 254 (445) T TIGR02472 175 FNISRRIEAEEETLAHASLVITSTHQEIEEQYALYDSYDPERMEVIPPGVDLSRFYPPQSSEEDSEIDKLLARFLKDPEK 254 (445) T ss_pred HCCCCCHHHHHHHHHHCCEEEEECCCEEHHHHCCCCCCCCCCCEECCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCC T ss_conf 12264147889998514745861451032121014786702113517888875434788888875888752231147887 Q ss_pred CEEECCCC--CCCCCCCCCCCC---CCCCCCCCEEEEEEECC----CCH-----HHHHHHHHHHHCCHHHHHCCCCCCCC Q ss_conf 61750465--533210000000---00012341248897055----211-----47888776530010110013444446 Q gi|254780402|r 236 YTWAAIST--FEGEEDKAVYVH---NFIKCRTDVLTIIVPRH----PRR-----CDAIERRLIAKGLKVARRSRGDVINA 301 (440) Q Consensus 236 ~v~vagSt--h~~EE~~il~a~---~~l~~~~~~~lIIvPRh----peR-----~~~i~~~l~~~gl~~~~~S~~~~~~~ 301 (440) |+++|-|- .....+.+++|+ ..|++..|++||+=-|. .|. +.+|..++...+| |-.--.|.. T Consensus 255 P~ilaisRpd~RKNi~~Lv~aYG~~p~L~~~aNLVlvlG~RdD~r~me~~qR~Vl~~vl~~iD~YDL----YGkvAyPK~ 330 (445) T TIGR02472 255 PPILAISRPDRRKNIPALVEAYGRSPKLQEMANLVLVLGSRDDIRKMESSQREVLTEVLLLIDRYDL----YGKVAYPKH 330 (445) T ss_pred CEEEEECCCCCCCCHHHHHHHHCCCHHHHHHCCEEEEECCCCCHHHCHHHHHHHHHHHHHHHCCCCC----CCCCCCCCC T ss_conf 8388722788766745556200788667652080887527788531215789999999876300024----564026888 Q ss_pred CCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEE----CCCCCCHHHHHHHHHHCCCEEEECCHH Q ss_conf 6416750676520255422082066135544454583768961994999----898443799999999789869957989 Q gi|254780402|r 302 EVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILS----GPNVENFRDIYRRMVSSGAVRIVEEVG 377 (440) Q Consensus 302 ~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~----GP~~~nf~e~~~~L~~~g~~~~v~d~~ 377 (440) .=.|.+=++-.|=+...=+||=.-|++--|-+.|||||||.|+|- || .||... +++|.++-+-|.+ T Consensus 331 ----H~~~dvP~lYRLAA~~rGiFVNPALTEPFGLTLlEAAAcGLPivAT~DGGP-----~dI~~~-C~NGLLvd~ld~e 400 (445) T TIGR02472 331 ----HRPDDVPELYRLAARRRGIFVNPALTEPFGLTLLEAAACGLPIVATEDGGP-----RDIIAN-CRNGLLVDVLDLE 400 (445) T ss_pred ----CCCCCCHHHHHHHHHCCCEEECCCCCCCCHHHHHHHHHCCCCEEECCCCCC-----HHHHHH-CCCCCEECCCCHH T ss_conf ----881123267899986598676272125301689999976997210786486-----688842-8887500578989 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHHHHCC Q ss_conf 9999999995898999999999999997045 Q gi|254780402|r 378 TLADMVYSLLSEPTIRYEMINAAINEVKKMQ 408 (440) Q Consensus 378 eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ 408 (440) ++..++.+.++|+++-+.-+.||.+=|+..= T Consensus 401 ~i~~AL~~alsd~~QW~~Ws~NGi~gV~~HY 431 (445) T TIGR02472 401 AIASALEQALSDSSQWQTWSDNGIEGVRRHY 431 (445) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHHHCCCC T ss_conf 9999999733890667899985577640155 No 67 >TIGR03590 PseG pseudaminic acid biosynthesis-associated protein PseG. This protein is found in association with enzymes involved in the biosynthesis of pseudaminic acid, a component of polysaccharide in certain Pseudomonas strains as well as a modification of flagellin in Campylobacter and Hellicobacter. The role of this protein is unclear, although it may participate in N-acetylation in conjunction with, or in the absence of PseH (TIGR03585) as it often scores above the trusted cutoff to pfam00583 representing a family of acetyltransferases. Probab=98.65 E-value=6.4e-06 Score=58.54 Aligned_cols=259 Identities=13% Similarity=0.037 Sum_probs=141.1 Q ss_pred CHHHHHHHHHHHHHHHHCCCCEEEEECCCCCC--HHHHCCCCCCEEE--EECCCCCCHHHHHHHHHHCCEEEEEECCCCH Q ss_conf 89999999999999986199899996277630--0121014774168--8505572013445574420203564047621 Q gi|254780402|r 65 SVGETMALIGLIPAIRSRHVNVLLTTMTATSA--KVARKYLGQYAIH--QYAPLDIQPAVSRFLKYWKPDCMILSESDIW 140 (440) Q Consensus 65 SvGE~~~a~pli~~l~~~~~~iliT~~T~tg~--~~~~~~~~~~~~~--~y~P~D~~~~~~~fl~~~~P~~~i~~e~ElW 140 (440) ..|-+.-...|+++|.+++.++..-+.+..+. +......-+.... .....|......+.++..+|+.+|+-.+++= T Consensus 14 G~GH~~RclaLA~~l~~~g~~v~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~vIiD~y~~~ 93 (280) T TIGR03590 14 GLGHVMRCLTLARELRKRGAEVAFACKTLPGDLIDLILSAGFPVYVLPDTSSWQDDALELINLLEKEKFDILIVDHYALD 93 (280) T ss_pred EHHHHHHHHHHHHHHHHCCCEEEEEEECCCHHHHHHHHHCCCCEEECCCCCCCCCCHHHHHHHHHHCCCCEEEEECCCCC T ss_conf 32089999999999998899499999279588999999759817981675652012999999997379799999259999 Q ss_pred HHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCH-HHHHHHHHHCCCC-EEEECCCCCCCCCC Q ss_conf 467888742245505864126654212100111123310264423664157-7888998608850-58720431012344 Q gi|254780402|r 141 PLTVFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSE-RYFRRYKELGAQK-LIVSGNLKIDTESL 218 (440) Q Consensus 141 Pnli~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~-~~~~rl~~lG~~~-i~v~GnlK~d~~~~ 218 (440) +.+...+++.+.+++.++-.- ++. .. .|.++-++. .+...+..+-..+ -...| .+|..... T Consensus 94 ~~~~~~lk~~~~~~i~iDD~~-~~~---------~~------~d~vin~~~~~~~~~y~~~~~~~~~~l~G-~~Y~~lr~ 156 (280) T TIGR03590 94 ADWEKLIKHFGRKLLVIDDLA-DRP---------HD------CDLLLDQNPGADALDYQGLVPANCRLLLG-PSYALLRE 156 (280) T ss_pred HHHHHHHHHHCCEEEEEECCC-CCC---------CC------CCEEECCCCCCCHHHHCCCCCCCCEEEEC-CCCCCCCH T ss_conf 799999998398399993676-546---------56------14254145444756636448867669865-75343578 Q ss_pred CCCHHHHHHHHHHCCCCCE-EECCCCCCCCC-CCCCCCCCCCCCCCCEEEEEEECCCCHHHHHHHHHHHHCCHHHHHCCC Q ss_conf 2212568999986179861-75046553321-000000000012341248897055211478887765300101100134 Q gi|254780402|r 219 PCDKELLSLYQESIAGRYT-WAAISTFEGEE-DKAVYVHNFIKCRTDVLTIIVPRHPRRCDAIERRLIAKGLKVARRSRG 296 (440) Q Consensus 219 ~~~~~~~~~~~~~~~~r~v-~vagSth~~EE-~~il~a~~~l~~~~~~~lIIvPRhpeR~~~i~~~l~~~gl~~~~~S~~ 296 (440) .... ...........+.| +..|.|-+... .-++++........+..+|+-|.++. .+++.+.+... T Consensus 157 ~F~~-~~~~~~~~~~~~~Ili~~GGsD~~~lt~~il~~l~~~~~~~~i~vvig~~~~~-~~~i~~~~~~~---------- 224 (280) T TIGR03590 157 EFYQ-LRTANKRRKPLRRVLVSFGGADPDNLTLKLLSALAESQINISITLVTGSSNPN-LDELKKFAKEY---------- 224 (280) T ss_pred HHHH-HHHHHHCCCCCCEEEEEECCCCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCC-HHHHHHHHHHC---------- T ss_conf 8876-30322103655328999778770008999999998516685679998679876-69999999728---------- Q ss_pred CCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHHHHHHH Q ss_conf 44446641675067652025542208206613554445458376896199499989844379999999 Q gi|254780402|r 297 DVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFRDIYRRM 364 (440) Q Consensus 297 ~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~~~~L 364 (440) . .+-+......|..+++-||+|+.+|| .+.+|.+++|+|+|.-|-.+|....++.| T Consensus 225 ----~--~~~~~~~~~~m~~~m~~aDlaI~agG------~t~~El~~~gvP~i~i~~adNQ~~~a~~l 280 (280) T TIGR03590 225 ----P--NIILFIDVENMAELMNEADLAIGAAG------STSWERCCLGLPSLSIVLAENQQSNSQQL 280 (280) T ss_pred ----C--CEEEECCHHHHHHHHHHCCEEEECCC------HHHHHHHHHCCCEEEEEECCCHHHHHHHC T ss_conf ----9--96996598899999997799998596------58999999499989999702569998509 No 68 >COG1817 Uncharacterized protein conserved in archaea [Function unknown] Probab=98.50 E-value=1.9e-05 Score=55.43 Aligned_cols=314 Identities=15% Similarity=0.173 Sum_probs=180.3 Q ss_pred EEEEEECCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCC-CCE---EEEECCCC--------CCHHHHHHHH Q ss_conf 79998178999999999999998619989999627763001210147-741---68850557--------2013445574 Q gi|254780402|r 58 LIWFHASSVGETMALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLG-QYA---IHQYAPLD--------IQPAVSRFLK 125 (440) Q Consensus 58 ~IW~HaaSvGE~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~-~~~---~~~y~P~D--------~~~~~~~fl~ 125 (440) .|||-..---++--.+++|.++.+++.+++|||----+....-+.++ +.. .|...++- .-....++.. T Consensus 2 kVwiDI~n~~hvhfFk~lI~elekkG~ev~iT~rd~~~v~~LLd~ygf~~~~Igk~g~~tl~~Kl~~~~eR~~~L~ki~~ 81 (346) T COG1817 2 KVWIDIGNPPHVHFFKNLIWELEKKGHEVLITCRDFGVVTELLDLYGFPYKSIGKHGGVTLKEKLLESAERVYKLSKIIA 81 (346) T ss_pred EEEEECCCCCHHHHHHHHHHHHHHCCEEEEEEEEECCCHHHHHHHHCCCEEEECCCCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 37997589610238999999998578499999851275889999839970764045774478999999999999999875 Q ss_pred HHCCEEEEEECCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHCCCCE Q ss_conf 42020356404762146788874224550586412665421210011112331026442366415778889986088505 Q gi|254780402|r 126 YWKPDCMILSESDIWPLTVFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKELGAQKL 205 (440) Q Consensus 126 ~~~P~~~i~~e~ElWPnli~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~~~i 205 (440) .++|++.+-.-+ |-+=+.+.-.|+|.+.++-.- .+.- --+-++.-.+.|++....+.+-++..|.+.. T Consensus 82 ~~kpdv~i~~~s---~~l~rvafgLg~psIi~~D~e--hA~~-------qnkl~~Pla~~ii~P~~~~~~~~~~~G~~p~ 149 (346) T COG1817 82 EFKPDVAIGKHS---PELPRVAFGLGIPSIIFVDNE--HAEA-------QNKLTLPLADVIITPEAIDEEELLDFGADPN 149 (346) T ss_pred HCCCCEEEECCC---CCHHHHHHHCCCCEEEECCCH--HHHH-------HHHCCHHHHHHEECCCCCCHHHHHHHCCCCC T ss_conf 229855752278---105567765288638964875--4777-------8630002442150643443577887088955 Q ss_pred EEECC--CCCC-CCC-CCCCHHHHHHHHHHCCCCCEEE----CCCCCC-CCCC--CCCCCCCCCCCCCCEEEEEEECCCC Q ss_conf 87204--3101-234-4221256899998617986175----046553-3210--0000000001234124889705521 Q gi|254780402|r 206 IVSGN--LKID-TES-LPCDKELLSLYQESIAGRYTWA----AISTFE-GEED--KAVYVHNFIKCRTDVLTIIVPRHPR 274 (440) Q Consensus 206 ~v~Gn--lK~d-~~~-~~~~~~~~~~~~~~~~~r~v~v----agSth~-~EE~--~il~a~~~l~~~~~~~lIIvPRhpe 274 (440) ...|. +..- ... ...+.+..+.+.-..+...|++ .+|+|. |... ...+..+.+++.+ .|++||.-+ T Consensus 150 ~i~~~~giae~~~v~~f~pd~evlkeLgl~~~~~yIVmRpe~~~A~y~~g~~~~~~~~~li~~l~k~g---iV~ipr~~~ 226 (346) T COG1817 150 KISGYNGIAELANVYGFVPDPEVLKELGLEEGETYIVMRPEPWGAHYDNGDRGISVLPDLIKELKKYG---IVLIPREKE 226 (346) T ss_pred CEECCCCEEEEEECCCCCCCHHHHHHCCCCCCCCEEEEEECCCCCEEECCCCCHHHHHHHHHHHHHCC---EEEECCCHH T ss_conf 21135662677310267988789987588879866999643445423433222566888999887572---899557556 Q ss_pred HHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEE-ECCCCCCCCCCHHHHHHHCCCEEEC-C Q ss_conf 1478887765300101100134444466416750676520255422082066-1355444545837689619949998-9 Q gi|254780402|r 275 RCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFI-GRSFCASGGQNPLEAAMLGCAILSG-P 352 (440) Q Consensus 275 R~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafV-GGSl~~~GGhN~lEpaa~G~pVi~G-P 352 (440) .. +..+.... .-.|.+ .+|| |..+| +|++++= |||+ --|||.+|+|.|+= | T Consensus 227 ~~----eife~~~n-------~i~pk~-----~vD~---l~Lly-ya~lvig~ggTM-------arEaAlLGtpaIs~~p 279 (346) T COG1817 227 QA----EIFEGYRN-------IIIPKK-----AVDT---LSLLY-YATLVIGAGGTM-------AREAALLGTPAISCYP 279 (346) T ss_pred HH----HHHHHHCC-------CCCCCC-----CCCH---HHHHH-HHHEEECCCCHH-------HHHHHHHCCCEEEECC T ss_conf 89----98741011-------058855-----5227---87886-541564177037-------8888872883478538 Q ss_pred C-CCCHHHHHHHHHHCCCEEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH Q ss_conf 8-4437999999997898699579899999999995898999999999999997045569999999997523 Q gi|254780402|r 353 N-VENFRDIYRRMVSSGAVRIVEEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITLRSLDSYVN 423 (440) Q Consensus 353 ~-~~nf~e~~~~L~~~g~~~~v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~~~i~~~l~ 423 (440) . ..++. +.+++.|...--.|+.+..+...+++.++...+ ....+...-++-+++.++..+. T Consensus 280 Gkll~vd---k~lie~G~~~~s~~~~~~~~~a~~~l~~~~~kK-------~~~~k~e~~~~~ii~~ve~~~e 341 (346) T COG1817 280 GKLLAVD---KYLIEKGLLYHSTDEIAIVEYAVRNLKYRRLKK-------TGVLKLEDPTRLIIDVVEEMLE 341 (346) T ss_pred CCCCCCC---HHHHHCCCEEECCCHHHHHHHHHHHHHCHHHCC-------CCCCCCCCHHHHHHHHHHHHHH T ss_conf 8533223---898866843431788899999999842500023-------3442200379999999999864 No 69 >KOG1111 consensus Probab=98.34 E-value=1.3e-05 Score=56.49 Aligned_cols=251 Identities=17% Similarity=0.133 Sum_probs=148.1 Q ss_pred HHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCC--------------CCCHHHHHHHHHHCCEEEEEECC Q ss_conf 9999999986199899996277630012101477416885055--------------72013445574420203564047 Q gi|254780402|r 72 LIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPL--------------DIQPAVSRFLKYWKPDCMILSES 137 (440) Q Consensus 72 a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~--------------D~~~~~~~fl~~~~P~~~i~~e~ 137 (440) ..-|-+.|.+.+..|++.|-.- |-..--....+..-+.|+|+ |+|..-.-+++. +.-|+.+- T Consensus 21 iy~lSq~li~lghkVvvithay-g~r~girylt~glkVyylp~~v~~n~tT~ptv~~~~Pllr~i~lrE---~I~ivhgh 96 (426) T KOG1111 21 IYALSQCLIRLGHKVVVITHAY-GNRVGIRYLTNGLKVYYLPAVVGYNQTTFPTVFSDFPLLRPILLRE---RIEIVHGH 96 (426) T ss_pred HHHHHHHHHHCCCEEEEEECCC-CCCCCEEEECCCCEEEEEEEEEEECCCCCHHHHCCCHHHHHHHHHH---CEEEEECC T ss_conf 8887511665187699984323-6712505654772699985376554641202104670122676541---16899657 Q ss_pred CCH----HHHHHHHHHCCCCEEEEEECCCCCCCCCCH-----HHHHHHHHCCCCCCEEEECCHHHHHHHH---HHCCCCE Q ss_conf 621----467888742245505864126654212100-----1111233102644236641577888998---6088505 Q gi|254780402|r 138 DIW----PLTVFELSKQRIPQVLVNARMSRRSFKNWK-----TVLSFSKKIFSQFSLVIVQSERYFRRYK---ELGAQKL 205 (440) Q Consensus 138 ElW----Pnli~~~~~~~ip~~linarls~~S~~~~~-----~~~~l~~~~~~~~~~i~~qs~~~~~rl~---~lG~~~i 205 (440) --. -+.+.-++.+|.+++.-+ .|+-... ++..+.+-.+...|+++|.|..++++-. .|..+++ T Consensus 97 s~fS~lahe~l~hartMGlktVfTd-----HSlfGfad~~si~~n~ll~~sL~~id~~IcVshtskentvlr~~L~p~kv 171 (426) T KOG1111 97 SPFSYLAHEALMHARTMGLKTVFTD-----HSLFGFADIGSILTNKLLPLSLANIDRIICVSHTSKENTVLRGALAPAKV 171 (426) T ss_pred CHHHHHHHHHHHHHHHCCCEEEEEC-----CCCCCCCCHHHHHHCCEEEEEECCCCCEEEEEECCCCCEEEEECCCHHHE T ss_conf 7088899999998874582599852-----43114233124421112136723788079986137886179721477675 Q ss_pred EEECCCCCCCCCCCCCHHHHHHHHHHCCCCCEEECCCC--CCCCCCCCCCCC-CCCCCCCCEEEEEEECCCCHHHHHHHH Q ss_conf 87204310123442212568999986179861750465--533210000000-000123412488970552114788877 Q gi|254780402|r 206 IVSGNLKIDTESLPCDKELLSLYQESIAGRYTWAAIST--FEGEEDKAVYVH-NFIKCRTDVLTIIVPRHPRRCDAIERR 282 (440) Q Consensus 206 ~v~GnlK~d~~~~~~~~~~~~~~~~~~~~r~v~vagSt--h~~EE~~il~a~-~~l~~~~~~~lIIvPRhpeR~~~i~~~ 282 (440) .+..|---.....|.+.. ..-++-.+++++|- +..-.+.++++. +...++|+.+.||+..-|.|. .+++. T Consensus 172 svIPnAv~~~~f~P~~~~------~~S~~i~~ivv~sRLvyrKGiDll~~iIp~vc~~~p~vrfii~GDGPk~i-~lee~ 244 (426) T KOG1111 172 SVIPNAVVTHTFTPDAAD------KPSADIITIVVASRLVYRKGIDLLLEIIPSVCDKHPEVRFIIIGDGPKRI-DLEEM 244 (426) T ss_pred EECCCEEECCCCCCCCCC------CCCCCEEEEEEEEEEEECCCHHHHHHHHHHHHHCCCCEEEEEECCCCCCC-HHHHH T ss_conf 533523540334658434------68887069999741111242678999999997359873699956886502-19999 Q ss_pred HHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEE Q ss_conf 65300101100134444466416750676520255422082066135544454583768961994999 Q gi|254780402|r 283 LIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILS 350 (440) Q Consensus 283 l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~ 350 (440) +++..+.-...-.+..+. .++...|.-+| +|+--|+++.-|-.++|||+||.+|+. T Consensus 245 lEk~~l~~rV~~lG~v~h-----------~~Vr~vl~~G~-IFlntSlTEafc~~ivEAaScGL~VVs 300 (426) T KOG1111 245 LEKLFLQDRVVMLGTVPH-----------DRVRDVLVRGD-IFLNTSLTEAFCMVIVEAASCGLPVVS 300 (426) T ss_pred HHHHHCCCCEEEECCCCH-----------HHHHHHHHCCC-EEECCHHHHHHHHHHHHHHHCCCEEEE T ss_conf 998500480588614661-----------78888876385-796207888889999998707977997 No 70 >PRK00654 glgA glycogen synthase; Provisional Probab=98.32 E-value=4.4e-05 Score=53.09 Aligned_cols=256 Identities=17% Similarity=0.161 Sum_probs=133.0 Q ss_pred HHCCEEEEEEC--CCCHHHHHHHHHHCCCCEEEE--EE----CCCCCCCC---------------CCHHHHHHHHHCCCC Q ss_conf 42020356404--762146788874224550586--41----26654212---------------100111123310264 Q gi|254780402|r 126 YWKPDCMILSE--SDIWPLTVFELSKQRIPQVLV--NA----RMSRRSFK---------------NWKTVLSFSKKIFSQ 182 (440) Q Consensus 126 ~~~P~~~i~~e--~ElWPnli~~~~~~~ip~~li--na----rls~~S~~---------------~~~~~~~l~~~~~~~ 182 (440) .|+||++-.=+ |-+=|.+++......+|+++- |. +....++. .|..+ .+.+.-... T Consensus 129 ~~~pDIIH~HDW~t~lvp~~lk~~~~~~~~~V~TIHnl~yQG~~~~~~~~~~gl~~~~~~~~~~e~~~~~-n~lk~gi~~ 207 (476) T PRK00654 129 GWRPDIVHAHDWHTGLVPALLKERGRPPIKSVFTIHNLAYQGLFPAEHLGALGLPDERFHLEGLEFYGQI-SFLKAGLYY 207 (476) T ss_pred CCCCCEEEECCHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCCCCHHHHHHCCCCHHHHCCCHHHHCCCH-HHHHHHHHH T ss_conf 7888867854647789999999743789977999734201676688789665995676243000104502-788898875 Q ss_pred CCEEEECCHHHHHHHHH--HC--------C--CCEEEECC-C-------CCCCCCC---CC-----CHHHHHHHHHHCC- Q ss_conf 42366415778889986--08--------8--50587204-3-------1012344---22-----1256899998617- Q gi|254780402|r 183 FSLVIVQSERYFRRYKE--LG--------A--QKLIVSGN-L-------KIDTESL---PC-----DKELLSLYQESIA- 233 (440) Q Consensus 183 ~~~i~~qs~~~~~rl~~--lG--------~--~~i~v~Gn-l-------K~d~~~~---~~-----~~~~~~~~~~~~~- 233 (440) -|.|.+.|+..++.+.. .| . .++.++-| + ..|...+ .. .......+++.++ T Consensus 208 aD~v~tVS~~Ya~Ei~~~~~g~gL~~~l~~~~~kl~gI~NGiD~~~wnP~tD~~l~~~y~~~~~~~K~~~k~~l~~~~gl 287 (476) T PRK00654 208 ADRVTTVSPTYAREITTPEFGYGLEGLLRARQGKLSGILNGIDYDIWNPETDPYLPANYSADDLEGKAENKRALQERFGL 287 (476) T ss_pred CCEEEECCHHHHHHHCCCCCCCCHHHHHHHCCCCEEEECCCCCHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCC T ss_conf 78748737888876616334666678876356666684488771003998786511666810033169899999999498 Q ss_pred ----CCCEEECCCC---CCCCCCCCCCCCC-CCCCCCCEEEEEEECCCCHHH-HHHHHHHHHCCHHHHHCCCCCCCCCCE Q ss_conf ----9861750465---5332100000000-001234124889705521147-888776530010110013444446641 Q gi|254780402|r 234 ----GRYTWAAIST---FEGEEDKAVYVHN-FIKCRTDVLTIIVPRHPRRCD-AIERRLIAKGLKVARRSRGDVINAEVD 304 (440) Q Consensus 234 ----~r~v~vagSt---h~~EE~~il~a~~-~l~~~~~~~lIIvPRhpeR~~-~i~~~l~~~gl~~~~~S~~~~~~~~~~ 304 (440) +.|++.+-|. .+| -+.++++.. .++. +.++||+..-..+.+ .+.++..+. + .+.. T Consensus 288 ~~~~~~pl~~~vgRl~~qKG-~~ll~~a~~~~~~~--~~~~vi~G~G~~~~~~~l~~l~~~~------------~-~~~~ 351 (476) T PRK00654 288 PVDDDAPLFAMVSRLTEQKG-LDLVLEALDEILEQ--GGQLVLLGTGDPELEEAFRALAARY------------P-GRVG 351 (476) T ss_pred CCCCCCCEEEEEEECCCCCC-HHHHHHHHHHHHHC--CCEEEEEECCCHHHHHHHHHHHHHC------------C-CCEE T ss_conf 97899748999851645678-89999999999970--9989999459789999999999877------------9-8889 Q ss_pred EEEECCCCCCH-HHHHHCCCEEEECCCCCCCCCCHHHHHHHCC-CEEECCCCCCHHHHHHHH---HHCCCEEEE--CCHH Q ss_conf 67506765202-5542208206613554445458376896199-499989844379999999---978986995--7989 Q gi|254780402|r 305 IFLGDTIGEMG-FYLRMTEIAFIGRSFCASGGQNPLEAAMLGC-AILSGPNVENFRDIYRRM---VSSGAVRIV--EEVG 377 (440) Q Consensus 305 V~i~Dt~GeL~-~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~-pVi~GP~~~nf~e~~~~L---~~~g~~~~v--~d~~ 377 (440) +.+. =-.+|. .+|+.||+ |+-.|+.+--|-+.+||+++|+ ||+.. +...+|.+.+. .+.+.++.+ .|++ T Consensus 352 ~~~g-f~e~l~~~iya~aD~-~lmPS~~EP~Gl~qleAm~~Gt~Pvv~~--tGGL~dtV~d~~~~~~~~tGf~f~~~~~~ 427 (476) T PRK00654 352 VQIG-YDEALAHRIYAGADF-FLMPSRFEPCGLTQLYALRYGTLPIVRR--TGGLADTVIDYDPEDGGATGFVFDDFNAE 427 (476) T ss_pred EEEC-CCHHHHHHHHHHCCE-EECCCCCCCCCHHHHHHHHCCCCCEEEC--CCCCCCCCCCCCCCCCCCCEEEECCCCHH T ss_conf 9957-886898999872887-8645611367768999987699858817--99975514566667787634873799999 Q ss_pred HHHHHHH---HHHCCHHHHHHHHHHHHH Q ss_conf 9999999---995898999999999999 Q gi|254780402|r 378 TLADMVY---SLLSEPTIRYEMINAAIN 402 (440) Q Consensus 378 eL~~~l~---~ll~d~~~~~~~~~~a~~ 402 (440) +|..++. .+++||+..++|.++|.+ T Consensus 428 ~l~~ai~~al~~~~~~~~~~~l~~~~m~ 455 (476) T PRK00654 428 DLLTALRRALELYRQPDLWRALQRQAMA 455 (476) T ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHH T ss_conf 9999999999885699999999999852 No 71 >pfam04101 Glyco_tran_28_C Glycosyltransferase family 28 C-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). Structural analysis suggests the C-terminal domain contains the UDP-GlcNAc binding site. Probab=98.30 E-value=1.8e-05 Score=55.62 Aligned_cols=103 Identities=20% Similarity=0.252 Sum_probs=76.9 Q ss_pred EEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCC---C-HHHHHHHHHHCCCEEEEC----C Q ss_conf 1675067652025542208206613554445458376896199499989844---3-799999999789869957----9 Q gi|254780402|r 304 DIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVE---N-FRDIYRRMVSSGAVRIVE----E 375 (440) Q Consensus 304 ~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~---n-f~e~~~~L~~~g~~~~v~----d 375 (440) ++.+.+=+-+|..+|+.||+++- ..|.-++.|.+++|+|.|+=|+-+ | ..+-++.|.+.|++..+. + T Consensus 56 ~~~~~~f~~~m~~~~~~adlvIs-----RaGa~Ti~E~~~~g~P~IliP~p~~~~~hQ~~NA~~l~~~gaa~~i~e~~~~ 130 (167) T pfam04101 56 NVEVFPFIDNMAEYIKAADLVIS-----RAGAGTIAELLALGKPAILVPRPKAAGEHQDNNALELVKAGAALVLLQKELT 130 (167) T ss_pred CEEEEEHHHHHHHHHHHCCHHHH-----CCCCHHHHHHHHHCCCEEEECCCCCCCCCHHHHHHHHHHCCCEEEEECCCCC T ss_conf 88997125559999996606886-----5762279999994899899707655656399999999987998996426799 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHH Q ss_conf 89999999999589899999999999999704556999 Q gi|254780402|r 376 VGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKI 413 (440) Q Consensus 376 ~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~ 413 (440) +++|.+.+..+++|++.+++|++++++... ..|+++ T Consensus 131 ~~~L~~~i~~l~~~~~~l~~m~~~a~~~~~--~da~~~ 166 (167) T pfam04101 131 PEKLVEALLKLLLKPLRLYEMNKAAKGSRL--KDAIKR 166 (167) T ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHHCCC--CCHHHC T ss_conf 999999999998699999999999984489--484541 No 72 >cd03816 GT1_ALG1_like This family is most closely related to the GT1 family of glycosyltransferases. The yeast gene ALG1 has been shown to function as a mannosyltransferase that catalyzes the formation of dolichol pyrophosphate (Dol-PP)-GlcNAc2Man from GDP-Man and Dol-PP-Glc-NAc2, and participates in the formation of the lipid-linked precursor oligosaccharide for N-glycosylation. In humans ALG1 has been associated with the congenital disorders of glycosylation (CDG) designated as subtype CDG-Ik. Probab=98.19 E-value=0.00015 Score=49.56 Aligned_cols=213 Identities=16% Similarity=0.113 Sum_probs=125.9 Q ss_pred HHHHHHHHCCC-CCCEEEECCHHHHHHHHHHCC--CCEEEECCCCCCCCCCCCCHHHHHHHHH----------------H Q ss_conf 11112331026-442366415778889986088--5058720431012344221256899998----------------6 Q gi|254780402|r 171 TVLSFSKKIFS-QFSLVIVQSERYFRRYKELGA--QKLIVSGNLKIDTESLPCDKELLSLYQE----------------S 231 (440) Q Consensus 171 ~~~~l~~~~~~-~~~~i~~qs~~~~~rl~~lG~--~~i~v~GnlK~d~~~~~~~~~~~~~~~~----------------~ 231 (440) ++.+++-..|. ..|.-+|.|++.++.+.+.+. .++.|.-|=-...-.+...++....+.+ . T Consensus 148 ~~~k~~E~~fgr~a~~~lcVT~AM~~~L~~~~~~~~~~~vlyDrP~~~F~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 227 (415) T cd03816 148 RLAKWYEKLFGRLADYNLCVTKAMKEDLQQFNNWKIRATVLYDRPPEQFRPLPLEEKHELFLKLAKTFLTRELRIGAVQL 227 (415) T ss_pred HHHHHHHHHHHCCCCEEECCCHHHHHHHHHHHCCCCCEEEEECCCHHHCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCC T ss_conf 99999998755036460540689999998602668887998228889759886789999997435542323346653113 Q ss_pred CCCCCEEECCCC-CCCCCCC--CCCCCCCC-------CCCCCEEEEEEECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCC Q ss_conf 179861750465-5332100--00000000-------1234124889705521147888776530010110013444446 Q gi|254780402|r 232 IAGRYTWAAIST-FEGEEDK--AVYVHNFI-------KCRTDVLTIIVPRHPRRCDAIERRLIAKGLKVARRSRGDVINA 301 (440) Q Consensus 232 ~~~r~v~vagSt-h~~EE~~--il~a~~~l-------~~~~~~~lIIvPRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~ 301 (440) ..+|+.++..|| |..+|++ +++|.... ...|++.+||...-|.+. ...+.+++..++-.... . T Consensus 228 ~~~r~~llVSSTSWTpDEDF~iLl~AL~~Yd~~~~~~~~~p~ll~iITGKGP~K~-~y~~~I~~~~l~~V~i~---t--- 300 (415) T cd03816 228 SEERPALLVSSTSWTPDEDFGILLDALVAYEKSAATGPKLPKLLCIITGKGPLKE-KYLERIKELKLKKVTIR---T--- 300 (415) T ss_pred CCCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCEEEEEECCCHHHH-HHHHHHHHCCCCCEEEE---E--- T ss_conf 5777338995167777655678999999997653214789987999968853089-99999986288821997---2--- Q ss_pred CCEEEEECCCCCCHHHHHHCCCEEEECCCCCC--CC---CCHHHHHHHCCCEEECCCCCCHHHHHHHHHHCC-CEEEECC Q ss_conf 64167506765202554220820661355444--54---583768961994999898443799999999789-8699579 Q gi|254780402|r 302 EVDIFLGDTIGEMGFYLRMTEIAFIGRSFCAS--GG---QNPLEAAMLGCAILSGPNVENFRDIYRRMVSSG-AVRIVEE 375 (440) Q Consensus 302 ~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~--GG---hN~lEpaa~G~pVi~GP~~~nf~e~~~~L~~~g-~~~~v~d 375 (440) ..=+..+...+.+.||+ |=|+... |- .-..+--.||.||+. .+|+.+ .+|++.| -+....| T Consensus 301 -----~wL~~eDYP~lL~~ADL---GVsLHtSSSGlDLPMKVVDMfG~GlPV~A----~~f~~i-~ELVk~~~NG~~F~~ 367 (415) T cd03816 301 -----PWLSAEDYPKLLASADL---GVSLHTSSSGLDLPMKVVDMFGCGLPVCA----LDFKCI-DELVKHGENGLVFGD 367 (415) T ss_pred -----CCCCHHHHHHHHHCCCC---CEEECCCCCCCCCCHHHHHHCCCCCCEEE----ECCCCH-HHHHCCCCCCCEECC T ss_conf -----57887889998741534---72421268766777021010268875798----337517-877226878766578 Q ss_pred HHHHHHHHHHHHCC---HHHHHHHHHHHHHH Q ss_conf 89999999999589---89999999999999 Q gi|254780402|r 376 VGTLADMVYSLLSE---PTIRYEMINAAINE 403 (440) Q Consensus 376 ~~eL~~~l~~ll~d---~~~~~~~~~~a~~~ 403 (440) ++||++.+..++.+ .+..+.+.+++.+. T Consensus 368 ~~eL~~~l~~l~~~~p~~~~l~~lk~~a~~~ 398 (415) T cd03816 368 SEELAEQLIDLLSNFPNRGKLNSLKKGAQEE 398 (415) T ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH T ss_conf 9999999999985599668999999777774 No 73 >TIGR02468 sucrsPsyn_pln sucrose phosphate synthase; InterPro: IPR012819 Sucrose occupies a central position in the metabolic pathways of all plants and plays a variety of roles including transport sugar, storage reserve, compatible solute, and signal compound . This compound is produced by the combined action of two enzymes, sucrose-phosphate synthase (2.4.1.14 from EC) and sucrose-phosphate phosphatase (3.1.3.24 from EC), via the intermediate sucrose 6-phosphate. Several studies have shown a positive correlation between sucrose-phosphate synthase activity and plant growth rate and yield in agronomically important plants, though direct proof of a causal link is lacking. This entry represents sucrose-phosphate synthase from plants, which is known to exist in multigene families in several species of both monocots and dicots. The enzyme contains an N-terminal domain glucosyltransferase domain, a variable linker region, and a C-terminal domain similar to that of sucrose-phosphate phosphatase, the next and final enzyme of sucrose biosynthesis. The C-terminal domain is likely to serve a binding - not a catalytic - function, as sucrose-phosphate phosphatase is always encoded by a distinct protein. ; GO: 0046524 sucrose-phosphate synthase activity, 0005985 sucrose metabolic process. Probab=98.16 E-value=1.2e-05 Score=56.72 Aligned_cols=160 Identities=14% Similarity=0.142 Sum_probs=111.7 Q ss_pred CCCCEEECCCCCCCC--CCCCCCCC---CCCCCCCCEEEEEEECCC-CHHH--------HHHHHHHHHCCHHHHHCCCCC Q ss_conf 798617504655332--10000000---000123412488970552-1147--------888776530010110013444 Q gi|254780402|r 233 AGRYTWAAISTFEGE--EDKAVYVH---NFIKCRTDVLTIIVPRHP-RRCD--------AIERRLIAKGLKVARRSRGDV 298 (440) Q Consensus 233 ~~r~v~vagSth~~E--E~~il~a~---~~l~~~~~~~lIIvPRhp-eR~~--------~i~~~l~~~gl~~~~~S~~~~ 298 (440) +.||+++|-|--.++ ...+++|| +-|++.-|+-||+==|.- +... .|.+++.+++|--+. ...+ T Consensus 481 p~KPmILALaRPDPkKNiTTLvKAFGECRpLRELANLtLImGNRD~IDems~~~~sVL~svLkLID~YDLYGqV--AYPK 558 (1072) T TIGR02468 481 PRKPMILALARPDPKKNITTLVKAFGECRPLRELANLTLIMGNRDDIDEMSSGSASVLTSVLKLIDKYDLYGQV--AYPK 558 (1072) T ss_pred CCCCEEEECCCCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHCCCCCHHHHHCCCHHHHHHHHHHHHHHCCCCCC--CCCC T ss_conf 89877973268787301477887633786146776577441475223545155558999999886200356554--6776 Q ss_pred CCCCCEEEEECCCCCCHHHHHHC---CCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHHHHHHHHHCCCEEEECC Q ss_conf 44664167506765202554220---820661355444545837689619949998984437999999997898699579 Q gi|254780402|r 299 INAEVDIFLGDTIGEMGFYLRMT---EIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFRDIYRRMVSSGAVRIVEE 375 (440) Q Consensus 299 ~~~~~~V~i~Dt~GeL~~lY~~A---diafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~~~~L~~~g~~~~v~d 375 (440) -...+| ....|++| .=|||=..|++--|-++||||++|.|+|--.|=.- -+| ...+++|.++--+| T Consensus 559 HHkqsD---------VP~IYRLAAktKGVFINPA~~EPFGLTLIEAAa~GLP~VATKNGGP-VDI-~~vL~NGLLvDPHd 627 (1072) T TIGR02468 559 HHKQSD---------VPDIYRLAAKTKGVFINPALIEPFGLTLIEAAAYGLPMVATKNGGP-VDI-HRVLDNGLLVDPHD 627 (1072) T ss_pred CCCCCC---------CCHHHHHHHCCCCEEECCCCCCCCCHHHHHHHHCCCCEEEECCCCC-HHH-HHHHHCCCCCCCCC T ss_conf 788888---------7248899742796575232104564368999863997798358681-338-87731787336776 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHHHHH Q ss_conf 899999999995898999999999999997 Q gi|254780402|r 376 VGTLADMVYSLLSEPTIRYEMINAAINEVK 405 (440) Q Consensus 376 ~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~ 405 (440) .+.++++|.+|++|+.+-.+-.+||.+=|- T Consensus 628 q~aIa~ALLkLvadK~LW~~CR~NGLkNIH 657 (1072) T TIGR02468 628 QEAIADALLKLVADKQLWAKCRQNGLKNIH 657 (1072) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 688999999986215778999736450245 No 74 >COG0438 RfaG Glycosyltransferase [Cell envelope biogenesis, outer membrane] Probab=98.10 E-value=0.00074 Score=45.02 Aligned_cols=270 Identities=20% Similarity=0.163 Sum_probs=136.1 Q ss_pred CEEEEEECCCCHH---HHHHHHHHCCCCEEEEEECCCC-------CCC-CCCH--HHHHHHHHCCCCCCEEEECCHHHHH Q ss_conf 2035640476214---6788874224550586412665-------421-2100--1111233102644236641577888 Q gi|254780402|r 129 PDCMILSESDIWP---LTVFELSKQRIPQVLVNARMSR-------RSF-KNWK--TVLSFSKKIFSQFSLVIVQSERYFR 195 (440) Q Consensus 129 P~~~i~~e~ElWP---nli~~~~~~~ip~~linarls~-------~S~-~~~~--~~~~l~~~~~~~~~~i~~qs~~~~~ 195 (440) ++..........| .........+.+.+...-.... .+. .... .+...........+.+..+++.... T Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 163 (381) T COG0438 84 YDIIHAHSLLLAPGGLLALLLLKLLGIPLVVTLHGLIPRILLLPRLLLLLGLLRLLLKRLKKALRLLADRVIAVSPALKE 163 (381) T ss_pred CCEEEEECCCCCCHHHHHHHHHHHCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEEEECHHHHH T ss_conf 50898602112320267899988617867999703431000000001124578999999987642144599994588899 Q ss_pred HHHHHCCCC-EEEECCCCCCCCCCCCCHHHHHHHHHHCCC---CCEEECCCCCC-CCCCCCCCCCCCCCCC-CCEEEEEE Q ss_conf 998608850-587204310123442212568999986179---86175046553-3210000000000123-41248897 Q gi|254780402|r 196 RYKELGAQK-LIVSGNLKIDTESLPCDKELLSLYQESIAG---RYTWAAISTFE-GEEDKAVYVHNFIKCR-TDVLTIIV 269 (440) Q Consensus 196 rl~~lG~~~-i~v~GnlK~d~~~~~~~~~~~~~~~~~~~~---r~v~vagSth~-~EE~~il~a~~~l~~~-~~~~lIIv 269 (440) .+...+... +.+..+. .+....... ....... ..+...|...+ ..-+.++++.+.+... ++..++++ T Consensus 164 ~~~~~~~~~~~~~~~~~-~~~~~~~~~------~~~~~~~~~~~~i~~~g~~~~~k~~~~~i~~~~~~~~~~~~~~~~~~ 236 (381) T COG0438 164 LLEALGVPNKIVVIPNG-IDTEKFAPA------RIGLLPEGGKFVVLYVGRLDPEKGLDLLIEAAAKLKKRGPDIKLVIV 236 (381) T ss_pred HHHHHCCCCCEEEECCC-CCHHHCCCC------CCCCCCCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHHCCCEEEEEE T ss_conf 99984677887997588-587874700------01445356987999964886547999999999985321586489999 Q ss_pred ECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCC--CCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCC Q ss_conf 0552114788877653001011001344444664167506765--20255422082066135544454583768961994 Q gi|254780402|r 270 PRHPRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIG--EMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCA 347 (440) Q Consensus 270 PRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~G--eL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~p 347 (440) -.-++..+.+...+++.+. ...|.....+. ++..+|+.|++ ++-.|....-|...+|++++|+| T Consensus 237 g~~~~~~~~~~~~~~~~~~-------------~~~v~~~g~~~~~~~~~~~~~~~~-~v~ps~~e~~~~~~~Ea~a~g~p 302 (381) T COG0438 237 GDGPERREELEKLAKKLGL-------------EDNVKFLGYVPDEELAELLASADV-FVLPSLSEGFGLVLLEAMAAGTP 302 (381) T ss_pred EECCCHHHHHHHHHHHCCC-------------CCCEEECCCCCHHHHHHHHHHCCE-EEECCCCCCCCHHHHHHHHHCCE T ss_conf 9534128899999997088-------------887899177898999999972809-99178644558899999984986 Q ss_pred EEECCCCCCHHHHHHHHHHCCCEEEEC--CHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHH Q ss_conf 999898443799999999789869957--98999999999958989999999999999970455699999999975237 Q gi|254780402|r 348 ILSGPNVENFRDIYRRMVSSGAVRIVE--EVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITLRSLDSYVNP 424 (440) Q Consensus 348 Vi~GP~~~nf~e~~~~L~~~g~~~~v~--d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~~~i~~~l~~ 424 (440) |+....- ...+... +.+.+..+. +.++++.++..++++++.++.++.++++.+. ..=..++..+.+.+.+.. T Consensus 303 vi~~~~~-~~~~~~~---~~~~g~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 376 (381) T COG0438 303 VIASDVG-GIPEVVE---DGETGLLVPPGDVEELADALEQLLEDPELREELGEAARERVE-EEFSWERIAEQLLELYEE 376 (381) T ss_pred EEECCCC-CHHHHHC---CCCEEEEECCCCHHHHHHHHHHHHHCHHHHHHHHHHHHHHHH-HHCCHHHHHHHHHHHHHH T ss_conf 9995899-8688430---697069988999999999999998697999999999999999-866999999999999999 No 75 >KOG0853 consensus Probab=98.00 E-value=3.6e-05 Score=53.63 Aligned_cols=226 Identities=15% Similarity=0.110 Sum_probs=124.9 Q ss_pred HHCCCCCCEEEECCHHHHHHHH----HHCCCCEEEEC---CC-CCCCCCCCCCHH--HHHHHHHHCC--CCCEEECCCCC Q ss_conf 3102644236641577888998----60885058720---43-101234422125--6899998617--98617504655 Q gi|254780402|r 177 KKIFSQFSLVIVQSERYFRRYK----ELGAQKLIVSG---NL-KIDTESLPCDKE--LLSLYQESIA--GRYTWAAISTF 244 (440) Q Consensus 177 ~~~~~~~~~i~~qs~~~~~rl~----~lG~~~i~v~G---nl-K~d~~~~~~~~~--~~~~~~~~~~--~r~v~vagSth 244 (440) .......+.+++.+...+.+|. .|+..++.++- |. +++.-......+ +......... ++...-..+.. T Consensus 204 ~e~~~~~~~~~~ns~~~~~~f~~~~~~L~~~d~~~~y~ei~~s~~~~~~~~~~~~~~~~~r~~~~v~~~d~~~~siN~~~ 283 (495) T KOG0853 204 EETTGLAWKILVNSYFTKRQFKATFVSLSNSDITSTYPEIDGSWFTYGQYESHLELRLPVRLYRGVSGIDRFFPSINRFE 283 (495) T ss_pred HHHHHHCCEEECCHHHHHHHHHHHHHHCCCCCCCEEECCCCHHHCCCCCCCCCHHCCCCCCEEEECCCCCEEEEEEEECC T ss_conf 45431015172214666654453553057888614632424011255542111010255422430046646754566417 Q ss_pred CCC-CCCCCCCCCCCCC-C-----CCEEEEEEE------CCCCH---HHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEE Q ss_conf 332-1000000000012-3-----412488970------55211---478887765300101100134444466416750 Q gi|254780402|r 245 EGE-EDKAVYVHNFIKC-R-----TDVLTIIVP------RHPRR---CDAIERRLIAKGLKVARRSRGDVINAEVDIFLG 308 (440) Q Consensus 245 ~~E-E~~il~a~~~l~~-~-----~~~~lIIvP------RhpeR---~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~ 308 (440) ||- -.+++.++..+.. - ++..++++. |-+|. ..++.+++++.++.-. .|+++ T Consensus 284 pgkd~~l~l~a~~~~~~~i~~~~~~~~hl~~~g~~G~d~~~sen~~~~~el~~lie~~~l~g~------------~v~f~ 351 (495) T KOG0853 284 PGKDQDLALPAFTLLHDSIPEPSISSEHLVVAGSRGYDERDSENVEYLKELLSLIEEYDLLGQ------------FVWFL 351 (495) T ss_pred CCCCCEEEHHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHCCCCC------------EEEEE T ss_conf 999846644447764003578887711699943787644553558999999999997276665------------69984 Q ss_pred CCCCCC--HHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEE----CCCCCCHHHHHHHHHHCCCEEEECCHH---HH Q ss_conf 676520--255422082066135544454583768961994999----898443799999999789869957989---99 Q gi|254780402|r 309 DTIGEM--GFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILS----GPNVENFRDIYRRMVSSGAVRIVEEVG---TL 379 (440) Q Consensus 309 Dt~GeL--~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~----GP~~~nf~e~~~~L~~~g~~~~v~d~~---eL 379 (440) -..-.- -.+++-+..+|+=+.+ +.-|-.|+|||+||+||+- || .|++... .+|-.... +.+ ++ T Consensus 352 ~s~~~~~~yrl~adt~~v~~qPa~-E~FGiv~IEAMa~glPvvAt~~GGP-----~EiV~~~-~tG~l~dp-~~e~~~~~ 423 (495) T KOG0853 352 PSTTRVAKYRLAADTKGVLYQPAN-EHFGIVPIEAMACGLPVVATNNGGP-----AEIVVHG-VTGLLIDP-GQEAVAEL 423 (495) T ss_pred CCCCHHHHHHHHHHCCEEEECCCC-CCCCCEEHHHHHCCCCEEEECCCCC-----EEEEECC-CCCEEECC-CHHHHHHH T ss_conf 577638889998744357726888-7756333987855998899669996-----5789848-85044577-45778999 Q ss_pred HHHHHHHHCCHHHHHHHHHHHHHHHHHCC---CHHHHHHHHHHHHH Q ss_conf 99999995898999999999999997045---56999999999752 Q gi|254780402|r 380 ADMVYSLLSEPTIRYEMINAAINEVKKMQ---GPLKITLRSLDSYV 422 (440) Q Consensus 380 ~~~l~~ll~d~~~~~~~~~~a~~~i~~~~---ga~~~~~~~i~~~l 422 (440) ++++.++..||+.+.+|+++|++.|.+.- --.+|+...+..++ T Consensus 424 a~~~~kl~~~p~l~~~~~~~G~krV~e~fs~~~~~~ri~~~~~~~~ 469 (495) T KOG0853 424 ADALLKLRRDPELWARMGKNGLKRVKEMFSWQHYSERIASVLGKYL 469 (495) T ss_pred HHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 9999998139899999988788999998707779999999877507 No 76 >pfam04230 PS_pyruv_trans Polysaccharide pyruvyl transferase. Pyruvyl-transferases involved in peptidoglycan-associated polymer biosynthesis. CsaB in Bacillus anthracis is necessary for the non-covalent anchoring of proteins containing an SLH (S-layer homology) domain to peptidoglycan-associated pyruvylated polysaccharides. WcaK and AmsJ are involved in the biosynthesis of colanic acid in Escherichia coli and of amylovoran in Erwinia amylovora. Probab=97.99 E-value=0.00088 Score=44.53 Aligned_cols=234 Identities=15% Similarity=0.129 Sum_probs=124.0 Q ss_pred HHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEECC-------- Q ss_conf 999999999999998619989999627763001210147741688505572013445574420203564047-------- Q gi|254780402|r 66 VGETMALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILSES-------- 137 (440) Q Consensus 66 vGE~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e~-------- 137 (440) .|.......+.+.|.+.+++..++..+....... .......... +++..+ T Consensus 2 ~GD~li~~~~~~~L~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~---------------------vii~Gg~~~~~~~~ 59 (258) T pfam04230 2 YGDELILEALLKLLKKLGPDAEIIDYSPEPSEDR-KRLSDKRAKS---------------------VIIVGGGGDLLRLL 59 (258) T ss_pred CHHHHHHHHHHHHHHHHCCCCEEEEECCCHHHHH-HHHCCCCCCE---------------------EEEECCCCCCCCCH T ss_conf 5579999999999998589942999579979999-8806555561---------------------89978864664479 Q ss_pred CCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHCCC--CEEEECCCCCCC Q ss_conf 621467888742245505864126654212100111123310264423664157788899860885--058720431012 Q gi|254780402|r 138 DIWPLTVFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKELGAQ--KLIVSGNLKIDT 215 (440) Q Consensus 138 ElWPnli~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~~--~i~v~GnlK~d~ 215 (440) ..|-.....++..+.|+++...-+++. . -.....+.+.++++++.|.++++.+.+.++++|.. ++.+++|+=|.. T Consensus 60 ~~~~~~~~~~~~~~kp~i~~g~gigp~--~-~~~~~~~~~~~l~~~~~i~vRD~~S~~~l~~~g~~~~~v~~~~D~af~l 136 (258) T pfam04230 60 LLYLRLAFVARKLGKPVILLGQGIGPF--L-NEALRKLLKKLLSKADLISVRDELSAELLEKLGGLSANVELVPDPAFLL 136 (258) T ss_pred HHHHHHHHHHHHCCCCEEEECCCCCCC--C-CHHHHHHHHHHHHCCCEEEECCHHHHHHHHHHCCCCCCEEEECCCCEEC T ss_conf 999999999998699899993566887--9-9889999999983699999868877999999289987859963730115 Q ss_pred CCCCCCHHHHHHHHHHCCCCCEEECCCCCCCC---CCCCCCCC-CCCCCCCCEE-EEEEECCCCHHHHHHHHHHHHCCHH Q ss_conf 34422125689999861798617504655332---10000000-0001234124-8897055211478887765300101 Q gi|254780402|r 216 ESLPCDKELLSLYQESIAGRYTWAAISTFEGE---EDKAVYVH-NFIKCRTDVL-TIIVPRHPRRCDAIERRLIAKGLKV 290 (440) Q Consensus 216 ~~~~~~~~~~~~~~~~~~~r~v~vagSth~~E---E~~il~a~-~~l~~~~~~~-lIIvPRhpeR~~~i~~~l~~~gl~~ 290 (440) ...... ..............+ .+...... ....+..... +.+.+.+................ T Consensus 137 ~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 203 (258) T pfam04230 137 DKLYPK-----------KKDYLVIAKSSRVDNKDLNELLALLYDLLLEKGYRILLIPIVSKGGLKDDDDLIELYAALL-- 203 (258) T ss_pred CCCCCC-----------CCCEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCEEEEEEECCCCCCCHHHHHHHHHHHH-- T ss_conf 765445-----------6640346415662267799999999999985799669997220444320256888876544-- Q ss_pred HHHCCCCCCCCCCEEEEEC---CCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEE Q ss_conf 1001344444664167506---76520255422082066135544454583768961994999 Q gi|254780402|r 291 ARRSRGDVINAEVDIFLGD---TIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILS 350 (440) Q Consensus 291 ~~~S~~~~~~~~~~V~i~D---t~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~ 350 (440) .. .....+.+.+ +.-++..+++-++ .+||..|. -.+-++.+|+|++. T Consensus 204 ----~~---~~~~~~~~~~~~~~~~~~~~~i~~~~-~vi~~RlH-----~~I~a~~~gvP~i~ 253 (258) T pfam04230 204 ----LA---AKKEKVVIIPRFYSPEELLALISQAD-FVITTRLH-----GLIFALLLGVPFIV 253 (258) T ss_pred ----HH---HCCCCEEEECCCCCHHHHHHHHHCCC-EEEEECHH-----HHHHHHHCCCCEEE T ss_conf ----43---03675488479999999999997098-99983737-----99999975999899 No 77 >PRK10422 lipopolysaccharide core biosynthesis protein; Provisional Probab=97.96 E-value=0.0019 Score=42.26 Aligned_cols=264 Identities=12% Similarity=0.125 Sum_probs=114.4 Q ss_pred CCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCCH---HHHHH------ Q ss_conf 888877999817899999999999999861998999962776300121014774168850557201---34455------ Q gi|254780402|r 53 RPIGPLIWFHASSVGETMALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQP---AVSRF------ 123 (440) Q Consensus 53 ~~~~~~IW~HaaSvGE~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~---~~~~f------ 123 (440) +|.++.+=+.-.++|.+..+.|+++.|++++|+.-||..+.....-+-+..|....+..++-.... ..+.+ T Consensus 3 ~~~kkILIir~~~iGD~il~tP~i~~Lk~~~P~a~I~~l~~~~~~~ll~~~P~id~i~~~~~k~~~~~~~~~~~~~l~~~ 82 (352) T PRK10422 3 KPFRRILIIKMRFHGDMLLTTPVISSLKKNYPDAKIDVLLYQDTIPILSENPEINALYGIKNKKAGASEKIKNFFSLIKV 82 (352) T ss_pred CCCCEEEEECCCCHHHHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHHHHH T ss_conf 89977999758860499999999999999889988999978047999833999627988667554456779999999999 Q ss_pred HHHHCCEEEEEECCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHCCC Q ss_conf 74420203564047621467888742245505864126654212100111123310264423664157788899860885 Q gi|254780402|r 124 LKYWKPDCMILSESDIWPLTVFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKELGAQ 203 (440) Q Consensus 124 l~~~~P~~~i~~e~ElWPnli~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~~ 203 (440) ++.-+.|.++-+..-.--.++... .+++...- .+-..+....| ..++........ . ...+...+.+..+|.+ T Consensus 83 Lr~~~yD~vi~l~~~~~~~~l~~~--~~~~~~ig-~~~~~~~~~~~---~~~~~~~~~~~~-~-h~v~~~l~ll~~l~~~ 154 (352) T PRK10422 83 LRANKYDLIVNLTDQWMVALLVRL--LNARVKIS-QDYHHRQSAFW---RKSFTHLVPLQG-G-HVVESNLSVLTPLGLS 154 (352) T ss_pred HHHCCCCEEEECCCCHHHHHHHHH--HCCCEEEC-CCCCCCHHHHH---HHHHCCCCCCCC-H-HHHHHHHHHHHHCCCC T ss_conf 855488778866766499999998--38985865-66652101455---655314688754-1-5999999998646998 Q ss_pred CEEEECCCCCCCCCCCCCHHHHHHHHHH--CCCCCEEEC-CCC-----CCCCCCCCCCCCCCCCCC-CCEEEEEEECCCC Q ss_conf 0587204310123442212568999986--179861750-465-----533210000000000123-4124889705521 Q gi|254780402|r 204 KLIVSGNLKIDTESLPCDKELLSLYQES--IAGRYTWAA-IST-----FEGEEDKAVYVHNFIKCR-TDVLTIIVPRHPR 274 (440) Q Consensus 204 ~i~v~GnlK~d~~~~~~~~~~~~~~~~~--~~~r~v~va-gSt-----h~~EE~~il~a~~~l~~~-~~~~lIIvPRhpe 274 (440) ... .+......+.+.+........ .+++.|.+. |++ |+.| . ..+..+.+.+. .+..++.-|-.-| T Consensus 155 ~~~----~~~~l~~~~~~~~~~~~~l~~~~~~~~~ivi~pga~~~~K~Wp~e-~-~a~l~~~L~~~g~~vvl~ggp~~~e 228 (352) T PRK10422 155 SLV----KETTMSYRPESWKRMRRQLDHLGVTQNYVVIQPTARQIFKCWDND-K-FSAVIDALQARGYEVVLTSGPAKDD 228 (352) T ss_pred CCC----CCCCCCCCHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCCCCCHH-H-HHHHHHHHHHCCCEEEEECCCCHHH T ss_conf 667----555667887899999987674488898799967899856779999-9-9999999984799199972898899 Q ss_pred H--HHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEE--CCCCCCHHHHHHCCCEEEECCCCCCC-CCCHHHHHHHCCCEE Q ss_conf 1--478887765300101100134444466416750--67652025542208206613554445-458376896199499 Q gi|254780402|r 275 R--CDAIERRLIAKGLKVARRSRGDVINAEVDIFLG--DTIGEMGFYLRMTEIAFIGRSFCASG-GQNPLEAAMLGCAIL 349 (440) Q Consensus 275 R--~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~--Dt~GeL~~lY~~AdiafVGGSl~~~G-GhN~lEpaa~G~pVi 349 (440) . .+++.+.+.. ...+-+. -++.||..+.+.|++ |||+ +.| .|= |+|+|+|+| T Consensus 229 ~~~~~~i~~~~~~----------------~~v~~l~G~tsL~el~ali~~a~l-~I~n---DSGpmHl---AaAlg~P~V 285 (352) T PRK10422 229 LACVNEIAQGCQT----------------PPVTALAGKTTFPELGALIDHAQL-FIGV---DSAPAHI---AAAVNTPLI 285 (352) T ss_pred HHHHHHHHHHCCC----------------CCCEECCCCCCHHHHHHHHHHCCE-EEEC---CCHHHHH---HHHCCCCEE T ss_conf 9999999974679----------------870423578889999999981787-5605---9818999---998299989 Q ss_pred --ECCC Q ss_conf --9898 Q gi|254780402|r 350 --SGPN 353 (440) Q Consensus 350 --~GP~ 353 (440) |||- T Consensus 286 alFGpT 291 (352) T PRK10422 286 CLFGAT 291 (352) T ss_pred EEECCC T ss_conf 998999 No 78 >pfam00201 UDPGT UDP-glucoronosyl and UDP-glucosyl transferase. Probab=97.95 E-value=0.00025 Score=48.16 Aligned_cols=104 Identities=15% Similarity=0.220 Sum_probs=85.4 Q ss_pred CCCCCC-CCHHHHHHHCCCEEECCCCCCHHHHHHHHHHCCCEEEEC----CHHHHHHHHHHHHCCHHHHHHHHHHHHHHH Q ss_conf 544454-583768961994999898443799999999789869957----989999999999589899999999999999 Q gi|254780402|r 330 FCASGG-QNPLEAAMLGCAILSGPNVENFRDIYRRMVSSGAVRIVE----EVGTLADMVYSLLSEPTIRYEMINAAINEV 404 (440) Q Consensus 330 l~~~GG-hN~lEpaa~G~pVi~GP~~~nf~e~~~~L~~~g~~~~v~----d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i 404 (440) |+-+|| .+.+|++.+|+|+|.=|-+.+....++.+.+.|++..++ +.++|.+++.++++||.-++.+.+-++-+- T Consensus 345 FITHgG~~S~~Eai~~GVP~v~iP~f~DQ~~Na~~~~~~G~g~~l~~~~lt~~~l~~ai~~vl~n~~Y~~na~~~s~~~~ 424 (501) T pfam00201 345 FVTHAGSNGVYEAICHGVPMVGMPLFGDQMDNAKHMEAKGAAVTLNVLTMTSEDLLNALKTVINDPSYKENIMRLSSIHH 424 (501) T ss_pred EEECCCHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHHHHCHHHHHHHHHHHHHHH T ss_conf 99658730699999859898971563446999999997797899632119999999999999709889999999999986 Q ss_pred HHCCCHHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 704556999999999752378785016789 Q gi|254780402|r 405 KKMQGPLKITLRSLDSYVNPLIFQNHLLSK 434 (440) Q Consensus 405 ~~~~ga~~~~~~~i~~~l~~~~~~~~~~~~ 434 (440) .+-..+.++.+.-++-.+.- --..|+.+. T Consensus 425 d~P~~p~~~av~w~E~v~r~-~g~~~l~~~ 453 (501) T pfam00201 425 DQPVKPLDRAVFWIEFVMRH-KGAKHLRPA 453 (501) T ss_pred CCCCCHHHHHHHHHHHHHHC-CCCCCCCCC T ss_conf 59999899999999999835-997313666 No 79 >PRK10964 ADP-heptose:LPS heptosyl transferase I; Provisional Probab=97.93 E-value=0.0016 Score=42.89 Aligned_cols=279 Identities=13% Similarity=0.127 Sum_probs=127.0 Q ss_pred EEEEEECCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHH---------------HHH Q ss_conf 79998178999999999999998619989999627763001210147741688505572013---------------445 Q gi|254780402|r 58 LIWFHASSVGETMALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPA---------------VSR 122 (440) Q Consensus 58 ~IW~HaaSvGE~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~---------------~~~ 122 (440) ..-+-..++|.+..+.|+++.|++.+|+.-||..+..+++.+-+..|..+.+ +|++.... ..+ T Consensus 3 ILiIr~~~lGDvilttP~l~~Lr~~~P~a~I~~lv~~~~~~l~~~~P~id~v--i~~~~~~~~k~~~~~~~~~~~~~~~~ 80 (322) T PRK10964 3 VLIVKTSSMGDVLHTLPALTDAQQAIPGIQFDWVVEEGFAQIPSWHSAVDRV--IPVAIRRWRKAWFSAPIKAERKAFRR 80 (322) T ss_pred EEEEECCCHHHHHHHHHHHHHHHHHCCCCEEEEEECHHHHHHHHCCCCCCEE--EECCCCHHHHCCCCHHHHHHHHHHHH T ss_conf 9999167568999899999999998899889999772578875109862588--74242012212231157999999999 Q ss_pred HHHHHCCEEEEEECCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHH--- Q ss_conf 57442020356404762146788874224550586412665421210011112331026442366415778889986--- Q gi|254780402|r 123 FLKYWKPDCMILSESDIWPLTVFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKE--- 199 (440) Q Consensus 123 fl~~~~P~~~i~~e~ElWPnli~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~--- 199 (440) -++.-+-|++|-+..-++-.++...-..+.+... + .++.+. ....+++. .-..+. ..+...+++.+ T Consensus 81 ~lr~~~yD~vidlq~~~rsa~l~~~~~~~~r~g~-~----~~~~r~-~~~~~~~~----~~~~~~-~~~h~v~r~~~l~~ 149 (322) T PRK10964 81 ALQAEQYDAVIDAQGLVKSAALVTRLAHGVKHGM-D----WQSARE-PLASLFYN----RKHHIA-KQQHAVERTRELFA 149 (322) T ss_pred HHHHCCCCEEEECCCCHHHHHHHHHHCCCCEECC-C----CCCCCC-CHHHHHCC----CCCCCC-CCCCHHHHHHHHHH T ss_conf 9874589799988531778999998636861046-6----433444-01454303----530687-31039999999999 Q ss_pred --HCCCCEEEECCCCCCCCCCCCCHHHHHHHHHH-CCCCCE--EECCCCCCCC---CCCCCCCCCCCCCCCCEEEEEE-- Q ss_conf --08850587204310123442212568999986-179861--7504655332---1000000000012341248897-- Q gi|254780402|r 200 --LGAQKLIVSGNLKIDTESLPCDKELLSLYQES-IAGRYT--WAAISTFEGE---EDKAVYVHNFIKCRTDVLTIIV-- 269 (440) Q Consensus 200 --lG~~~i~v~GnlK~d~~~~~~~~~~~~~~~~~-~~~r~v--~vagSth~~E---E~~il~a~~~l~~~~~~~lIIv-- 269 (440) +|.+.-...++.. .. ....... ...++. ++-+++|+.- .+-..+..+.+.+. ....++. T Consensus 150 ~~lg~~~p~~~~~~~-------~~---~~~~~~~~~~~~~~vv~~~~~s~~~K~Wp~e~f~~La~~L~~~-g~~v~l~~G 218 (322) T PRK10964 150 KSLGYSKPQTQGDYA-------IA---QHFLTNLPADAGPYLVFLHATTRDDKHWPEAHWRELIGLLADS-GLRIKLPWG 218 (322) T ss_pred HHCCCCCCCCCHHHH-------HH---HHHHHHCCCCCCCEEEEECCCCCCCCCCCHHHHHHHHHHHHHC-CCEEEEECC T ss_conf 974998987502256-------65---9987412125698499973787412589989999999999967-997999478 Q ss_pred -ECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCC-CCCHHHHHHHCCC Q ss_conf -05521147888776530010110013444446641675067652025542208206613554445-4583768961994 Q gi|254780402|r 270 -PRHPRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASG-GQNPLEAAMLGCA 347 (440) Q Consensus 270 -PRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~G-GhN~lEpaa~G~p 347 (440) |.+.++.+++. .+.. ..++.--.++.||..+.+.|++ |||+ +.| .|= |+++|+| T Consensus 219 ~~~e~~~~~~i~-----~~~~------------~v~~~g~~sL~elaall~~a~l-~I~n---DSG~mHl---AaAlg~P 274 (322) T PRK10964 219 APHEEARAKRLA-----EGFD------------YVEVLPKMSLEEVARVLAGAKA-VVSV---DTGLSHL---TAALDRP 274 (322) T ss_pred CHHHHHHHHHHH-----HCCC------------CCEECCCCCHHHHHHHHHHCCE-EEEC---CCHHHHH---HHHCCCC T ss_conf 989999999998-----0699------------6124589999999999970999-9966---9759999---9983999 Q ss_pred EE----------ECCCCCCHHHHHHHHHHCCCEEEECCHHHHHHHHHHHHC Q ss_conf 99----------989844379999999978986995798999999999958 Q gi|254780402|r 348 IL----------SGPNVENFRDIYRRMVSSGAVRIVEEVGTLADMVYSLLS 388 (440) Q Consensus 348 Vi----------~GP~~~nf~e~~~~L~~~g~~~~v~d~~eL~~~l~~ll~ 388 (440) ++ +||+-.|...+ ...|....-=++++..+++.++++ T Consensus 275 ~v~LFGpT~P~~~gP~g~~~~~~----~~~~~~~~~~~~~~v~~~~~~~~~ 321 (322) T PRK10964 275 NITLYGPTDPGLIGGYGKNQHAC----RSEGKSMANLSAETVFQKLETLIS 321 (322) T ss_pred EEEEECCCCCCCCCCCCCCCEEE----ECCCCCCCCCCHHHHHHHHHHHHC T ss_conf 89998889940307888882489----689987021999999999999745 No 80 >COG4641 Uncharacterized protein conserved in bacteria [Function unknown] Probab=97.88 E-value=0.0021 Score=42.10 Aligned_cols=314 Identities=13% Similarity=0.062 Sum_probs=157.8 Q ss_pred HHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHH-HHHHHHHCCEEEEEECCC-----CH-HH Q ss_conf 999999999986199899996277630012101477416885055720134-455744202035640476-----21-46 Q gi|254780402|r 70 MALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAV-SRFLKYWKPDCMILSESD-----IW-PL 142 (440) Q Consensus 70 ~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~-~~fl~~~~P~~~i~~e~E-----lW-Pn 142 (440) .--+-+.+.|..++.+++---.-+.+..--.....+..+..++==|.+... ....+.++|+.++.+-.+ .| ++ T Consensus 18 ~~~~~~~~~l~~~g~kvlflE~~~~~~~k~rd~~~~~~~~~~~~~~~~e~~~~~~i~~fk~d~iv~~~~~~~~~~~~~~~ 97 (373) T COG4641 18 EYYRGLLRALKMDGMKVLFLESGDFWDYKNRDIDAEDGCTEAFYKDQPELESLLYIREFKPDIIVNMSGDDQPDEESTID 97 (373) T ss_pred HHHHHHHHHHHHCCCEEEEEECCCHHHHHCCCCCCCCCHHHEEECCCHHHHHHHHHHHCCCCEEEEECCCCCCCCEEHHH T ss_conf 64999999998565548998326277540644567656001000574789999989862874799951566644100179 Q ss_pred HHHHHHHCCCCEEEEEECCCCCCCCCCHHHHH-HHHHCCCCCCEEEECCHHH-HHHHH-HHCCCCEEEECCCCCCC-CCC Q ss_conf 78887422455058641266542121001111-2331026442366415778-88998-60885058720431012-344 Q gi|254780402|r 143 TVFELSKQRIPQVLVNARMSRRSFKNWKTVLS-FSKKIFSQFSLVIVQSERY-FRRYK-ELGAQKLIVSGNLKIDT-ESL 218 (440) Q Consensus 143 li~~~~~~~ip~~linarls~~S~~~~~~~~~-l~~~~~~~~~~i~~qs~~~-~~rl~-~lG~~~i~v~GnlK~d~-~~~ 218 (440) ++..|++..+|++.--.-..-.....-++... +.+..+..||.|++-++.. +.++. +.|..+....+ ...|- ... T Consensus 98 ~~a~l~~~~l~~~~w~te~p~~~~~~~~~~~~~~~~~~l~~fd~v~~~g~~l~~~~yyq~~~~~~~~~~~-~a~d~~~~~ 176 (373) T COG4641 98 LWAWLKRKCLPVIVWYTEDPYDTDIFSQVAEEQLARRPLFIFDNVLSFGGGLVANKYYQEGGARNCYYLP-WAVDDSLFH 176 (373) T ss_pred HHHHHHCCCCCEEEEEECCCHHHHHHHHHHHHHHHCCCCCHHHHHHHCCCHHHHHHHHHHHCCCCEECCC-CCCCCHHCC T ss_conf 9998526786269997146002466653017776105551334233126407889998762355113057-567823206 Q ss_pred CCC--HHHHHHHHHHCCCCCEEECCCCCCCCCCCCCCCCCC-CCCCC-CE-EEEEEECCCCHHHHHHHHHHHHCCHHHHH Q ss_conf 221--256899998617986175046553321000000000-01234-12-48897055211478887765300101100 Q gi|254780402|r 219 PCD--KELLSLYQESIAGRYTWAAISTFEGEEDKAVYVHNF-IKCRT-DV-LTIIVPRHPRRCDAIERRLIAKGLKVARR 293 (440) Q Consensus 219 ~~~--~~~~~~~~~~~~~r~v~vagSth~~EE~~il~a~~~-l~~~~-~~-~lIIvPRhpeR~~~i~~~l~~~gl~~~~~ 293 (440) +.. ... .. -+.+.|+.+|.-.+.+-+.+.. ..+.. +. .....||-|... ....-.....+.-+ T Consensus 177 ~i~~da~~--------~~-dL~~ign~~pDr~e~~ke~~~~ps~kl~v~rr~~~~g~~y~~~~---~~~~~~~~~~yIg~ 244 (373) T COG4641 177 PIPPDASY--------DV-DLNLIGNPYPDRVEEIKEFFVEPSFKLMVDRRFYVLGPRYPDDI---WGRTWEPNVQYIGY 244 (373) T ss_pred CCCCCCCC--------EE-EEEEECCCCCCHHHHHHHHHHCCCHHHHCCCEEEECCCCCCHHH---HCCCCCCHHHHHHC T ss_conf 69854130--------13-36773588855789999986152011100653455077652344---21133656666332 Q ss_pred CCCCCCCCCCEEEEECCCCCCHHHHHHCC---------CEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHHHHHHH Q ss_conf 13444446641675067652025542208---------206613554445458376896199499989844379999999 Q gi|254780402|r 294 SRGDVINAEVDIFLGDTIGEMGFYLRMTE---------IAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFRDIYRRM 364 (440) Q Consensus 294 S~~~~~~~~~~V~i~Dt~GeL~~lY~~Ad---------iafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~~~~L 364 (440) ...-+.....|+..+ ++-++-|+..+ ..|+|+||.+.+ .+..+.++.+ T Consensus 245 --------------~~~~~~v~~~~~~~~~~~n~~r~~~~~~l~~~~~R----vFeiagc~~~li-----T~~~~~~e~~ 301 (373) T COG4641 245 --------------YNPKDGVPNAFKRDDVTLNINRASIANALFSPTNR----VFEIAGCGGFLI-----TDYWKDLEKF 301 (373) T ss_pred --------------CCCCCHHHHCCCCCCEEEEECHHHHHHHCCCCHHH----HHHHHHCCCCCC-----CCCHHHHHHH T ss_conf --------------68500000003554435641378887614785056----888761587501-----5427889872 Q ss_pred HHCC-CEEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHH Q ss_conf 9789-869957989999999999589899999999999999704556999999999 Q gi|254780402|r 365 VSSG-AVRIVEEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITLRSLD 419 (440) Q Consensus 365 ~~~g-~~~~v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~~~i~ 419 (440) ...| .....+|.+++.+.+..++..++.|++++..|.+-+..--.-.+++.+.++ T Consensus 302 f~pgk~~iv~~d~kdl~~~~~yll~h~~erkeiae~~ye~V~~~ht~~~r~~~~~~ 357 (373) T COG4641 302 FKPGKDIIVYQDSKDLKEKLKYLLNHPDERKEIAECAYERVLARHTYEERIFKLLN 357 (373) T ss_pred CCCCHHEEEECCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHH T ss_conf 59860258963789999999998448306899998669999874327888999999 No 81 >cd04950 GT1_like_1 Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homology. The large cleft that separates the two domains includes the catalytic center Probab=97.81 E-value=0.0035 Score=40.63 Aligned_cols=229 Identities=12% Similarity=0.082 Sum_probs=121.4 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHCCCCEEEE Q ss_conf 20356404762146788874224550586412665421210011112331026442366415778889986088505872 Q gi|254780402|r 129 PDCMILSESDIWPLTVFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKELGAQKLIVS 208 (440) Q Consensus 129 P~~~i~~e~ElWPnli~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~~~i~v~ 208 (440) .+.++|.-+..|..++..+... .-++=..=.++.-..- .......-+.+++..|.|++.|+.-.+.++..+ +++.+. T Consensus 102 ~~~ilw~~~P~~~~~~~~l~~~-~vVYdcvDd~~~~~~~-~~~~~~~e~~l~~~ad~v~~ts~~L~~~~~~~~-~~~~~i 178 (373) T cd04950 102 GRPILWYYTPYTLPVAALLQAS-LVVYDCVDDLSAFPGG-PPELLEAERRLLKRADLVFTTSPSLYEAKRRLN-PNVVLV 178 (373) T ss_pred CCEEEEECCCCHHHHHHHCCCC-CEEEEEECCHHHCCCC-CHHHHHHHHHHHHHCCEEEECCHHHHHHHHHCC-CCEEEE T ss_conf 9739998173068898753788-3899950612213798-689999999999977999985999999887469-998998 Q ss_pred CCC-CCCCCCCCCCHH-HHHHHHHHCCCCCEEE-CCCCCC-CCCCCCCCCCCCCCCCCCEEEEEEE-CCCCHHHHHHHHH Q ss_conf 043-101234422125-6899998617986175-046553-3210000000000123412488970-5521147888776 Q gi|254780402|r 209 GNL-KIDTESLPCDKE-LLSLYQESIAGRYTWA-AISTFE-GEEDKAVYVHNFIKCRTDVLTIIVP-RHPRRCDAIERRL 283 (440) Q Consensus 209 Gnl-K~d~~~~~~~~~-~~~~~~~~~~~r~v~v-agSth~-~EE~~il~a~~~l~~~~~~~lIIvP-RhpeR~~~i~~~l 283 (440) +|- -++--....... ....+ ...++|++. .|+..+ -+-+++. +..+.+|+..++++. ..+. . .. ..+ T Consensus 179 ~nGvD~~~F~~~~~~~~~~~~~--~~~~~p~igy~G~i~~~~D~~ll~---~~a~~~p~~~~~liGp~~~~-~-~~-~~l 250 (373) T cd04950 179 PNGVDYEHFAAARDPPPPPADL--AALPRPVIGYYGAIAEWLDLELLE---ALAKARPDWSFVLIGPVDVS-I-DP-SAL 250 (373) T ss_pred CCEECHHHHHHHCCCCCCCHHH--CCCCCCEEEEEECCCCCCCHHHHH---HHHHHCCCCEEEEEECCCCC-C-CH-HHH T ss_conf 8821788841001576880455--047998899992575214899999---99998899689999438875-5-83-456 Q ss_pred HHHCCHHHHHCCCCCCCCCCEEEEECC--CCCCHHHHHHCCCEEEECCCCC-CCCCCH---HHHHHHCCCEEECCCCCCH Q ss_conf 530010110013444446641675067--6520255422082066135544-454583---7689619949998984437 Q gi|254780402|r 284 IAKGLKVARRSRGDVINAEVDIFLGDT--IGEMGFYLRMTEIAFIGRSFCA-SGGQNP---LEAAMLGCAILSGPNVENF 357 (440) Q Consensus 284 ~~~gl~~~~~S~~~~~~~~~~V~i~Dt--~GeL~~lY~~AdiafVGGSl~~-~GGhN~---lEpaa~G~pVi~GP~~~nf 357 (440) .+ ...|..... ..+|..+.+.+|++.+-=...+ +.+-|| +|-+|+|+||+.-+ + T Consensus 251 ~~----------------~~Nv~~lG~~~~~~lp~~l~~~Dv~l~P~~~~~~t~~~~P~Kl~EYlA~G~PVVst~----~ 310 (373) T cd04950 251 LR----------------LPNVHYLGPKPYKELPAYLAGFDVAILPFRLNELTRATSPLKLFEYLAAGKPVVATP----L 310 (373) T ss_pred HC----------------CCCEEECCCCCHHHHHHHHHHCCEEEEECCCCHHHHCCCCCHHHHHHHCCCCEEECC----C T ss_conf 25----------------998798489899999999985787774120554542468637999986699889658----4 Q ss_pred HHHHHHHHHCCCEEEECCHHHHHHHHHHHHCCH Q ss_conf 999999997898699579899999999995898 Q gi|254780402|r 358 RDIYRRMVSSGAVRIVEEVGTLADMVYSLLSEP 390 (440) Q Consensus 358 ~e~~~~L~~~g~~~~v~d~~eL~~~l~~ll~d~ 390 (440) .+..+ + ........+|.+++.+++...+..+ T Consensus 311 ~~~~~-~-~~~~~~~~~~~~~F~~~i~~al~~~ 341 (373) T cd04950 311 PEVRR-Y-EDEVVLIADDPEEFVAAIEKALLED 341 (373) T ss_pred HHHHH-C-CCCEEEECCCHHHHHHHHHHHHHCC T ss_conf 88773-1-6867997799999999999998438 No 82 >KOG1387 consensus Probab=97.80 E-value=0.0025 Score=41.55 Aligned_cols=268 Identities=14% Similarity=0.113 Sum_probs=153.1 Q ss_pred HHHHCCEEEE-EECCCCHHHHHHHHHHCCCCEEE------EEECC-------------CCCCCCCCHHHHHHHHHCCCCC Q ss_conf 7442020356-40476214678887422455058------64126-------------6542121001111233102644 Q gi|254780402|r 124 LKYWKPDCMI-LSESDIWPLTVFELSKQRIPQVL------VNARM-------------SRRSFKNWKTVLSFSKKIFSQF 183 (440) Q Consensus 124 l~~~~P~~~i-~~e~ElWPnli~~~~~~~ip~~l------inarl-------------s~~S~~~~~~~~~l~~~~~~~~ 183 (440) +-++-|+..| .+.+.+=.-.++.++ +||+.- |+..| +-....-|++|..+++..-+.. T Consensus 146 i~r~~Pdi~IDtMGY~fs~p~~r~l~--~~~V~aYvHYP~iS~DML~~l~qrq~s~~l~~~KlaY~rlFa~lY~~~G~~a 223 (465) T KOG1387 146 IIRFPPDIFIDTMGYPFSYPIFRRLR--RIPVVAYVHYPTISTDMLKKLFQRQKSGILVWGKLAYWRLFALLYQSAGSKA 223 (465) T ss_pred HHHCCCHHEEECCCCCCHHHHHHHHC--CCCEEEEEECCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 98188331685478740008999871--6953899845603288999998621012113577899999999998614642 Q ss_pred CEEEECCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCHHHHHHHHHHCCCC--CEEECCCCCCCCCCCCCCCCCC-CCC Q ss_conf 2366415778889986088505872043101234422125689999861798--6175046553321000000000-012 Q gi|254780402|r 184 SLVIVQSERYFRRYKELGAQKLIVSGNLKIDTESLPCDKELLSLYQESIAGR--YTWAAISTFEGEEDKAVYVHNF-IKC 260 (440) Q Consensus 184 ~~i~~qs~~~~~rl~~lG~~~i~v~GnlK~d~~~~~~~~~~~~~~~~~~~~r--~v~vagSth~~EE~~il~a~~~-l~~ 260 (440) |.+++.|.=+.....++=-. .+...+.||++.+..+.....-++| .++..|.-+|+..--+|..+.. +.+ T Consensus 224 d~vm~NssWT~nHI~qiW~~-------~~~~iVyPPC~~e~lks~~~te~~r~~~ll~l~Q~RPEKnH~~Lql~Al~~~~ 296 (465) T KOG1387 224 DIVMTNSSWTNNHIKQIWQS-------NTCSIVYPPCSTEDLKSKFGTEGERENQLLSLAQFRPEKNHKILQLFALYLKN 296 (465) T ss_pred EEEEECCHHHHHHHHHHHHC-------CCEEEECCCCCHHHHHHHHCCCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHC T ss_conf 29996266567789998602-------62148728988788887742457760478987603765555889999999751 Q ss_pred ------CCCEEEEEEE--CCC---CHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECC Q ss_conf ------3412488970--552---11478887765300101100134444466416750676520255422082066135 Q gi|254780402|r 261 ------RTDVLTIIVP--RHP---RRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRS 329 (440) Q Consensus 261 ------~~~~~lIIvP--Rhp---eR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGS 329 (440) -+..+|+||. |.- ||...++++.+...++-..-.+.+.| .-+|-.+|+-|.+-. -+= T Consensus 297 ~pl~a~~~~iKL~ivGScRneeD~ervk~Lkd~a~~L~i~~~v~F~~N~P-----------y~~lv~lL~~a~iGv-h~M 364 (465) T KOG1387 297 EPLEASVSPIKLIIVGSCRNEEDEERVKSLKDLAEELKIPKHVQFEKNVP-----------YEKLVELLGKATIGV-HTM 364 (465) T ss_pred CCHHHCCCCCEEEEEECCCCHHHHHHHHHHHHHHHHCCCCCCEEEEECCC-----------HHHHHHHHCCCEEEH-HHH T ss_conf 82010468825999705478113999998887898628754538995598-----------799999861155545-244 Q ss_pred CCCCCCCCHHHHHHHCCCEEE----CCCCCCHHHHHHHHHHCCCEEEECCHHHHHHHHHHHHC-CHHHHHHHHHHHHHHH Q ss_conf 544454583768961994999----89844379999999978986995798999999999958-9899999999999999 Q gi|254780402|r 330 FCASGGQNPLEAAMLGCAILS----GPNVENFRDIYRRMVSSGAVRIVEEVGTLADMVYSLLS-EPTIRYEMINAAINEV 404 (440) Q Consensus 330 l~~~GGhN~lEpaa~G~pVi~----GP~~~nf~e~~~~L~~~g~~~~v~d~~eL~~~l~~ll~-d~~~~~~~~~~a~~~i 404 (440) ..++-|-.+.|.||+|.-.+. ||.++=..+ -.....++...+.+|-++++.+.+. |++.|..|+++|++-+ T Consensus 365 wNEHFGIsVVEyMAAGlIpi~h~SgGP~lDIV~~----~~G~~tGFla~t~~EYaE~iLkIv~~~~~~r~~~r~~AR~s~ 440 (465) T KOG1387 365 WNEHFGISVVEYMAAGLIPIVHNSGGPLLDIVTP----WDGETTGFLAPTDEEYAEAILKIVKLNYDERNMMRRNARKSL 440 (465) T ss_pred HHHHCCHHHHHHHHCCCEEEEECCCCCCEEEEEC----CCCCCCEEECCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH T ss_conf 5520355689987557268870789973236404----578601011587289999999999719888888899999999 Q ss_pred HHCCC-HHHHHHH Q ss_conf 70455-6999999 Q gi|254780402|r 405 KKMQG-PLKITLR 416 (440) Q Consensus 405 ~~~~g-a~~~~~~ 416 (440) .+-.. ..++-++ T Consensus 441 ~RFsE~~F~kd~~ 453 (465) T KOG1387 441 ARFGELKFDKDWE 453 (465) T ss_pred HHHHHHHHHHHHH T ss_conf 8866888877676 No 83 >cd01635 Glycosyltransferase_GTB_type Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. The structures of the formed glycoconjugates are extremely diverse, reflecting a wide range of biological functions. The members of this family share a common GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homology. The large cleft that separates the two domains includes the catalytic center and permits a high degree of flexibility. Probab=97.75 E-value=0.0012 Score=43.54 Aligned_cols=200 Identities=20% Similarity=0.199 Sum_probs=103.3 Q ss_pred HHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEECCCCHHHHH- Q ss_conf 9999999999999986199899996277630012101477416885055720134455744202035640476214678- Q gi|254780402|r 66 VGETMALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILSESDIWPLTV- 144 (440) Q Consensus 66 vGE~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e~ElWPnli- 144 (440) -|....+..+.+.+.+++.++-+-+ + .+....+.++.++||++..-.. ||..+ T Consensus 13 GG~~~~~~~l~~~l~~~gh~v~v~~----------------------~--~~~~~~~~~~~~~pDiIH~h~~--~~~~~~ 66 (229) T cd01635 13 GGVELVLLDLAKALARRGHEVEVVA----------------------L--LLLLLLRILRGFKPDVVHAHGY--YPAPLA 66 (229) T ss_pred CCHHHHHHHHHHHHHHCCCEEEEEE----------------------C--CHHHHHHHHHHCCCCEEEECCC--CHHHHH T ss_conf 9599999999999998699999993----------------------8--8499999998099888998997--279999 Q ss_pred --HHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCH Q ss_conf --887422455058641266542121001111233102644236641577888998608850587204310123442212 Q gi|254780402|r 145 --FELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKELGAQKLIVSGNLKIDTESLPCDK 222 (440) Q Consensus 145 --~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~~~i~v~GnlK~d~~~~~~~~ 222 (440) ......++|++...--........+. ...........+ .. T Consensus 67 ~~~~~~~~~~~~v~t~H~~~~~~~~~~~--~~~~~~~~~~~~--------------------~~---------------- 108 (229) T cd01635 67 LLLAARLLGIPLVLTVHGVNRSLLEGVP--LSLLALSIGLAD--------------------KV---------------- 108 (229) T ss_pred HHHHHHHCCCCEEEEECCCCHHHHHHHH--HHHHHHCCCCEE--------------------EE---------------- T ss_conf 9999984899199998999803424455--555542113618--------------------99---------------- Q ss_pred HHHHHHHHHCCCCCEEECCCCCCCCCCCCCCCCCCCC-CCCCEEEEEEECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCC Q ss_conf 5689999861798617504655332100000000001-234124889705521147888776530010110013444446 Q gi|254780402|r 223 ELLSLYQESIAGRYTWAAISTFEGEEDKAVYVHNFIK-CRTDVLTIIVPRHPRRCDAIERRLIAKGLKVARRSRGDVINA 301 (440) Q Consensus 223 ~~~~~~~~~~~~r~v~vagSth~~EE~~il~a~~~l~-~~~~~~lIIvPRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~ 301 (440) ++..-+...-.+.++++++.++ +.++..++++..-+++. ...+.....+. T Consensus 109 ---------------~vgrl~~~K~~~~li~a~~~l~~~~~~~~l~i~G~~~~~~-~~~~~~~~~~~------------- 159 (229) T cd01635 109 ---------------FVGRLAPEKGLDDLIEAFALLKERGPDLKLVIAGDGPERE-YLEELLAALLL------------- 159 (229) T ss_pred ---------------EEEECCCCCCHHHHHHHHHHHHHHCCCCEEEEEECCCHHH-HHHHHHHHCCC------------- T ss_conf ---------------9995462129999999999988678994899996880688-99999997288------------- Q ss_pred CCEEEEECC---CCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHHH Q ss_conf 641675067---65202554220820661355444545837689619949998984437999 Q gi|254780402|r 302 EVDIFLGDT---IGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFRDI 360 (440) Q Consensus 302 ~~~V~i~Dt---~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~ 360 (440) ...+....- --++..+|+.||+ |+--|..+.-|...+||+++|+|||.-+. ...+|+ T Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~d~-~v~pS~~E~~~~~~~EA~a~G~pvi~~~~-gg~~e~ 219 (229) T cd01635 160 LDRVIFLGGLDPEELLALLLAAADV-FVLPSLREGFGLVVLEAMACGLPVIATDV-GGPPEI 219 (229) T ss_pred CCCEEECCCCCCHHHHHHHHHHCCE-EEECCCCCCCCHHHHHHHHCCCCEEECCC-CCCHHH T ss_conf 7746323622106789999970680-66056667888899999982998998789-983776 No 84 >PRK10916 ADP-heptose:LPS heptosyltransferase II; Provisional Probab=97.75 E-value=0.003 Score=41.04 Aligned_cols=261 Identities=14% Similarity=0.093 Sum_probs=120.7 Q ss_pred EEEEECCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCCH----HHHHH---HHHHCCEE Q ss_conf 999817899999999999999861998999962776300121014774168850557201----34455---74420203 Q gi|254780402|r 59 IWFHASSVGETMALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQP----AVSRF---LKYWKPDC 131 (440) Q Consensus 59 IW~HaaSvGE~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~----~~~~f---l~~~~P~~ 131 (440) +-+.-.++|.+..+.|+++.|++.+|+.-||..++.+.+-.-+..|..+.+..+|..... ..++| |+.-+-|. T Consensus 4 Lvi~~~~iGDvvlttP~l~aLr~~~P~a~I~~l~~~~~~~l~~~~P~id~vi~~~~~~~~~~~~~~~~l~~~Lr~~~yD~ 83 (348) T PRK10916 4 LVIGPSWVGDMMMSQSLYRTLKARYPQAIIDVMAPAWCRPLLSRMPEVNEAIPMPLGHGALEIGERRRLGHSLREKRYDR 83 (348) T ss_pred EEECCCCHHHHHHHHHHHHHHHHHCCCCEEEEEECHHHHHHHHHCCCCCEEEECCCCCCCHHHHHHHHHHHHHHHHCCCE T ss_conf 99888764699999999999998789988999978626999950998448997467554000679999999998738998 Q ss_pred EEEECCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHCCCC--EEEEC Q ss_conf 5640476214678887422455058641266542121001111233102644236641577888998608850--58720 Q gi|254780402|r 132 MILSESDIWPLTVFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKELGAQK--LIVSG 209 (440) Q Consensus 132 ~i~~e~ElWPnli~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~~~--i~v~G 209 (440) ++.+-+-+...++... .|+|.-+-- +...+|..+ .. ...++.-. .....+++..+..+. ..... T Consensus 84 ~i~l~~s~rsal~~~l--ag~~~riG~-----~~~~r~~l~----~~-~~~~~~~~--~~~~v~~~~~l~~~~~~~~~~~ 149 (348) T PRK10916 84 AYVLPNSFKSALVPFF--AGIPHRTGW-----RGEMRYGLL----ND-LRVLDKEA--WPLMVERYVALAYDKGVMRTAQ 149 (348) T ss_pred EEECCCCHHHHHHHHH--CCCCCCCCC-----CHHHHHHHC----CC-CCCCCCCC--CCHHHHHHHHHHHHHCCCCCHH T ss_conf 9999986799999986--377721244-----302344431----66-52367433--4078999998775402211100 Q ss_pred CCCCCC--CCCCCCHHHHHHHHHH---CCCCCEEE--CCCCC------CCCCCCCCCCCCCCCCCCCEEEEEE--ECCCC Q ss_conf 431012--3442212568999986---17986175--04655------3321000000000012341248897--05521 Q gi|254780402|r 210 NLKIDT--ESLPCDKELLSLYQES---IAGRYTWA--AISTF------EGEEDKAVYVHNFIKCRTDVLTIIV--PRHPR 274 (440) Q Consensus 210 nlK~d~--~~~~~~~~~~~~~~~~---~~~r~v~v--agSth------~~EE~~il~a~~~l~~~~~~~lIIv--PRhpe 274 (440) ++.... ......+.+....... .+++++++ -||+| |.|- + .+..+.+.+. +..+++. |.+-+ T Consensus 150 ~~p~p~~~p~l~~~~~~~~~~~~~~~~~~~~~~i~i~pGa~~~~~KrWp~e~-f-a~la~~L~~~-g~~vvl~G~~~e~~ 226 (348) T PRK10916 150 DLPQPLLWPQLQVSEGEKSYTCNQFSLSSERPIIGFCPGAEFGPAKRWPHYH-Y-AELAKQLIDE-GYQVVLFGSAKDHE 226 (348) T ss_pred CCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCEEEEECCCCCCCCCCCCHHH-H-HHHHHHHHHC-CCEEEEEECHHHHH T ss_conf 0575445556678989999999864877799779981687666567798899-9-9999999968-99799981723699 Q ss_pred HHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEEC--CCCCCHHHHHHCCCEEEECCCCCCC-CCCHHHHHHHCCCEE-- Q ss_conf 14788877653001011001344444664167506--7652025542208206613554445-458376896199499-- Q gi|254780402|r 275 RCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGD--TIGEMGFYLRMTEIAFIGRSFCASG-GQNPLEAAMLGCAIL-- 349 (440) Q Consensus 275 R~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~D--t~GeL~~lY~~AdiafVGGSl~~~G-GhN~lEpaa~G~pVi-- 349 (440) ..+++.+.+....- ..-+-++. ++.|+..+.+.|+ .|||+ +.| .|= |+++|+|+| T Consensus 227 ~~~~i~~~l~~~~~-------------~~~~nl~GktsL~el~ali~~a~-l~I~n---DSGpmHl---AaA~g~P~val 286 (348) T PRK10916 227 AGNEILAALNTEQQ-------------AWCRNLAGETQLEQAVILIAACK-AIVTN---DSGLMHV---AAALNRPLVAL 286 (348) T ss_pred HHHHHHHHHHHHCC-------------CCEECCCCCCCHHHHHHHHHHCC-EEEEC---CCHHHHH---HHHCCCCEEEE T ss_conf 99999985103315-------------65141678899999999998559-87844---8828999---99809988999 Q ss_pred ECCCCCC Q ss_conf 9898443 Q gi|254780402|r 350 SGPNVEN 356 (440) Q Consensus 350 ~GP~~~n 356 (440) |||.--. T Consensus 287 FGpT~P~ 293 (348) T PRK10916 287 YGPSSPD 293 (348) T ss_pred ECCCCCC T ss_conf 8999933 No 85 >COG2327 WcaK Polysaccharide pyruvyl transferase family protein [Cell wall/membrane/envelope biogenesis] Probab=97.72 E-value=0.0048 Score=39.69 Aligned_cols=302 Identities=12% Similarity=0.125 Sum_probs=164.8 Q ss_pred CCHHHHHHHHHHHHHHHHCCC--CEEEEECCCCCCH--HHHCCCCCCEEEEECCCCCCHHHHHHHHH------------- Q ss_conf 789999999999999986199--8999962776300--12101477416885055720134455744------------- Q gi|254780402|r 64 SSVGETMALIGLIPAIRSRHV--NVLLTTMTATSAK--VARKYLGQYAIHQYAPLDIQPAVSRFLKY------------- 126 (440) Q Consensus 64 aSvGE~~~a~pli~~l~~~~~--~iliT~~T~tg~~--~~~~~~~~~~~~~y~P~D~~~~~~~fl~~------------- 126 (440) ---|+--..+.+++.|++..| ++++-+..|+-.. ......|+.. .|+ ..+.++++++. T Consensus 12 gN~Gdeail~all~~l~~~~~~~~~~~~~~~p~~i~~p~~~~~~p~~~----~~~-l~g~~k~v~R~~~k~~~~~~il~~ 86 (385) T COG2327 12 GNIGDEAILKALLDMLRRLNPDAKVLVMGRRPPVIVDPVFLSANPEGS----AAG-LNGRVKSVLRRRLKHPGLVSILSA 86 (385) T ss_pred CCCCCHHHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCEEECCCCCCC----CHH-HHHHHHHHHHHHHCCCCHHHHHHH T ss_conf 876459999999999985386644356504786001313552786667----203-438899999975515308999998 Q ss_pred -HCCEEEEEECCC----------CHHHH--HHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHH Q ss_conf -202035640476----------21467--88874224550586412665421210011112331026442366415778 Q gi|254780402|r 127 -WKPDCMILSESD----------IWPLT--VFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERY 193 (440) Q Consensus 127 -~~P~~~i~~e~E----------lWPnl--i~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~ 193 (440) -+-|.+|+..++ +||-. +..++..+.|+++++-...+-.- ..++++.+..+...+.+.++|+.| T Consensus 87 l~~~d~~I~~Gg~l~~d~~~~~~~~~~~~~~~la~l~~kp~~~~g~svGP~~~---~~s~~~~~~~~~~~s~i~vRD~~S 163 (385) T COG2327 87 LGKADLIIIGGGGLLQDVTSSRSIIYYGGSILLARLAGKPTFFFGQSVGPLKH---PLSRQLLNYVLGGCSAISVRDPVS 163 (385) T ss_pred HHHCCEEEECCCCCCCCCCCCCEEHHHHHHHHHHHHCCCCEEEEECCCCCCCC---HHHHHHHHHHHCCCCEEEEECHHH T ss_conf 50178899848530157656612110688999999759987999556787667---779999998856774899836875 Q ss_pred HHHHHHHCCCCEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCEEECCCC-CC-CCCC-----CCCCCCCCC-CC-CCCE Q ss_conf 88998608850587204310123442212568999986179861750465-53-3210-----000000000-12-3412 Q gi|254780402|r 194 FRRYKELGAQKLIVSGNLKIDTESLPCDKELLSLYQESIAGRYTWAAIST-FE-GEED-----KAVYVHNFI-KC-RTDV 264 (440) Q Consensus 194 ~~rl~~lG~~~i~v~GnlK~d~~~~~~~~~~~~~~~~~~~~r~v~vagSt-h~-~EE~-----~il~a~~~l-~~-~~~~ 264 (440) .++++++|.+.+.++ |.-|.......+.... ......+.+-+.-.- |+ ++-+ .+-++.+.+ .+ .... T Consensus 164 ~~llk~~gi~a~l~~-D~Af~L~~~~~~~~~~---~~~~~~~~~~i~lr~~~~~~t~~~~~~~~~~~~l~~~~~~~~~~~ 239 (385) T COG2327 164 YELLKQLGINARLVT-DPAFLLPASSQNATAS---DVEAREKTVAITLRGLHPDNTAQRSILKYVNEALDLVERQVKALW 239 (385) T ss_pred HHHHHHCCCCEEEEC-CCCEECCCCCCCCCCC---CCCCCCCEEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCE T ss_conf 999998099837605-8402234544465521---100345537999414677515667899989999999998641044 Q ss_pred EEEEEECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEE-CCC-CCCHHHHHHCCCEEEECCCCCCCCCCHHHHH Q ss_conf 48897055211478887765300101100134444466416750-676-5202554220820661355444545837689 Q gi|254780402|r 265 LTIIVPRHPRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLG-DTI-GEMGFYLRMTEIAFIGRSFCASGGQNPLEAA 342 (440) Q Consensus 265 ~lIIvPRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~-Dt~-GeL~~lY~~AdiafVGGSl~~~GGhN~lEpa 342 (440) ++.+.|.+...--.+.+.+...-. ...+|++. |.. -+++...+.+|+ +||.= -|+.+=++ T Consensus 240 ~i~~~~~~~s~d~~va~~i~~~~~------------~~~~i~~~~d~~~~~~~~~l~~~dl-~Vg~R-----~HsaI~al 301 (385) T COG2327 240 RITLIDYGASDDLAVADAIAQLVL------------DSAEILVSSDEYAEELGGILAACDL-IVGMR-----LHSAIMAL 301 (385) T ss_pred EEEEEECCCCCHHHHHHHHHHHCC------------CCCCEEEECCHHHHHHHHHHCCCCE-EEEEH-----HHHHHHHH T ss_conf 887320455421678999876438------------7200686143278898877515745-98621-----68999998 Q ss_pred HHCCCEEECCCCCCHHHHHHHHHHCCCEEEEC--CHHHHHHHHHHHHC-CHHHHHH Q ss_conf 61994999898443799999999789869957--98999999999958-9899999 Q gi|254780402|r 343 MLGCAILSGPNVENFRDIYRRMVSSGAVRIVE--EVGTLADMVYSLLS-EPTIRYE 395 (440) Q Consensus 343 a~G~pVi~GP~~~nf~e~~~~L~~~g~~~~v~--d~~eL~~~l~~ll~-d~~~~~~ 395 (440) ++|+|+|-=-|-.++....+++-..+.+..+. |.+.+.+.+.+-+. +++.+++ T Consensus 302 ~~g~p~i~i~Y~~K~~~l~~~~gl~~~~~~i~~~~~~~l~~~~~e~~~~~~~~~~~ 357 (385) T COG2327 302 AFGVPAIAIAYDPKVRGLMQDLGLPGFAIDIDPLDAEILSAVVLERLTKLDELRER 357 (385) T ss_pred HCCCCEEEEEECHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHCCHHHHHH T ss_conf 65997589860577789999729975222477875677899999997345877765 No 86 >pfam04464 Glyphos_transf CDP-Glycerol:Poly(glycerophosphate) glycerophosphotransferase. Wall-associated teichoic acids are a heterogeneous class of phosphate-rich polymers that are covalently linked to the cell wall peptidoglycan of gram-positive bacteria. They consist of a main chain of phosphodiester-linked polyols and/or sugar moieties attached to peptidoglycan via a linkage unit. CDP-glycerol:poly(glycerophosphate) glycerophosphotransferase is responsible for the polymerisation of the main chain of the teichoic acid by sequential transfer of glycerol-phosphate units from CDP-glycerol to the linkage unit lipid. Probab=97.51 E-value=0.0013 Score=43.34 Aligned_cols=161 Identities=12% Similarity=0.101 Sum_probs=95.3 Q ss_pred CCCCEEECCCCCCCCCCC---C------CC-CCCCCCCCCCEEEEEEECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCCC Q ss_conf 798617504655332100---0------00-0000012341248897055211478887765300101100134444466 Q gi|254780402|r 233 AGRYTWAAISTFEGEEDK---A------VY-VHNFIKCRTDVLTIIVPRHPRRCDAIERRLIAKGLKVARRSRGDVINAE 302 (440) Q Consensus 233 ~~r~v~vagSth~~EE~~---i------l~-a~~~l~~~~~~~lIIvPRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~ 302 (440) .++++++-+-||.+.... . .. ..+.++ .+..+|+=||+-.. ..... .... T Consensus 9 ~~kkvILYaPT~R~~~~~~~~~~~~~~~~~~l~~~l~--~n~~liik~Hp~~~-----~~~~~-------------~~~~ 68 (186) T pfam04464 9 KDKKVILYAPTFRDDGYYSAGSILFNLDLEKLLEKLG--ENYVILVKLHPLVS-----NSIIN-------------KRYD 68 (186) T ss_pred CCCEEEEECCCCCCCCCCCCCCCCCCCCHHHHHHHHC--CCEEEEEEECHHHH-----CCHHH-------------HCCC T ss_conf 9897999869733886655444323013999998727--68399997266764-----00022-------------0257 Q ss_pred CEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEE-CCCCCCHHHH---HHHHHHCCCEEEECCHHH Q ss_conf 416750676520255422082066135544454583768961994999-8984437999---999997898699579899 Q gi|254780402|r 303 VDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILS-GPNVENFRDI---YRRMVSSGAVRIVEEVGT 378 (440) Q Consensus 303 ~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~-GP~~~nf~e~---~~~L~~~g~~~~v~d~~e 378 (440) ..+++++.-..+..++..||+.+-==|=+ .+|.+.+++|||+ -|...++... +..+.+.+.+-.|.+.++ T Consensus 69 ~~~~~~~~~~di~~ll~~aDiLITDYSSi------~fD~lll~kPii~~~~D~~~Y~~~rg~~~d~~~~~~g~~v~~~~e 142 (186) T pfam04464 69 SDVIDVSDYSDIQDLFLASDILITDYSSV------FFDFALLDKPIIFYAPDLEEYRELRGFYFDYEKEAPGPVVKTFEE 142 (186) T ss_pred CCEEECCCCCCHHHHHHHHCEEEEEHHHH------HHHHHHHCCCEEEEECCHHHHHHCCCCCCCHHHCCCCCEECCHHH T ss_conf 76797889858999999843677646889------999998799789981878999752586105878078762598999 Q ss_pred HHHHHHHHHCCHHHHHHHHHH-HHHHHHHC-CCHHHHHHHHHH Q ss_conf 999999995898999999999-99999704-556999999999 Q gi|254780402|r 379 LADMVYSLLSEPTIRYEMINA-AINEVKKM-QGPLKITLRSLD 419 (440) Q Consensus 379 L~~~l~~ll~d~~~~~~~~~~-a~~~i~~~-~ga~~~~~~~i~ 419 (440) |.+++...+.++....+..++ ...+..-+ ..+++|+++.|. T Consensus 143 L~~~i~~~~~~~~~~~~~~~~~~~~~~~~~DG~s~eRv~~~I~ 185 (186) T pfam04464 143 LLDALKNYMENDEEYAEKRRAFRDKFFPYDDGKSSERVLNRIF 185 (186) T ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHH T ss_conf 9999999875776779999999998288758808999999983 No 87 >COG1887 TagB Putative glycosyl/glycerophosphate transferases involved in teichoic acid biosynthesis TagF/TagB/EpsJ/RodC [Cell envelope biogenesis, outer membrane] Probab=97.49 E-value=0.0097 Score=37.69 Aligned_cols=223 Identities=13% Similarity=0.139 Sum_probs=133.5 Q ss_pred HHHHCCCCCCEEEECCHHHHHHHHH-HCC--CCEEEECCCCCCCCCCCCCHHHHHH-HH---HHCCCCCEEECCCCCCCC Q ss_conf 2331026442366415778889986-088--5058720431012344221256899-99---861798617504655332 Q gi|254780402|r 175 FSKKIFSQFSLVIVQSERYFRRYKE-LGA--QKLIVSGNLKIDTESLPCDKELLSL-YQ---ESIAGRYTWAAISTFEGE 247 (440) Q Consensus 175 l~~~~~~~~~~i~~qs~~~~~rl~~-lG~--~~i~v~GnlK~d~~~~~~~~~~~~~-~~---~~~~~r~v~vagSth~~E 247 (440) .........|.+.+.+..+..-+.+ .|. +++..+|=.+.|............. .. ....+++|++.++||.+. T Consensus 141 ~~~~~~~~~dy~~~~~~~~~~if~~~f~~~~~~i~~~G~Pr~D~~~~~~~~~~~~~~~~~~~~~~~~k~vIlyaPTfr~~ 220 (388) T COG1887 141 EFIYVRNHWDYLISPNPESTAIFAEAFNIDKENILETGYPRNDKLFDEAGKTEDILLIQLALPLPQDKKVILYAPTFRDN 220 (388) T ss_pred CCEEEEEECCCEECCCCHHHHHHHHHHCCCCCCEEECCCCCCHHHHHHCCHHHHHHHHHHHCCCCCCCCEEEECCCCCCC T ss_conf 53576521030311781066899987053300044336585056544122146678877604775557769965765788 Q ss_pred C--------CCCCCCCCCCCCC--CCEEEEEEECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHH Q ss_conf 1--------0000000000123--41248897055211478887765300101100134444466416750676520255 Q gi|254780402|r 248 E--------DKAVYVHNFIKCR--TDVLTIIVPRHPRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFY 317 (440) Q Consensus 248 E--------~~il~a~~~l~~~--~~~~lIIvPRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~l 317 (440) + +..+...+..+.. .+..+|+-| ||-..+.+..... ..+..+.+.| --+...+ T Consensus 221 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~ii~k~-Hp~is~~~~~~~~---------------~~~~~~~vs~-~~di~dl 283 (388) T COG1887 221 DVLIGTQFFNLDIDIEKLKEKLGENEYVIIVKP-HPLISDKIDKRYA---------------LDDFVLDVSD-NADINDL 283 (388) T ss_pred CCCCCCHHHCCHHHHHHHHHHHCCCCEEEEEEC-CHHHCCCCHHHHC---------------CCCCEEECCC-CHHHHHH T ss_conf 554330110001459999876166876999955-8534042100020---------------3660465466-3449999 Q ss_pred HHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCC----CCCHHHHHHHHHHCCCEEEECCHHHHHHHHHHHHCCHHHH Q ss_conf 422082066135544454583768961994999898----4437999999997898699579899999999995898999 Q gi|254780402|r 318 LRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPN----VENFRDIYRRMVSSGAVRIVEEVGTLADMVYSLLSEPTIR 393 (440) Q Consensus 318 Y~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~----~~nf~e~~~~L~~~g~~~~v~d~~eL~~~l~~ll~d~~~~ 393 (440) |..+|+.+--=|-+ ++|.|.+.+||++.-. +...+..+.+.....=+..+.+.+++..++..-..+++.+ T Consensus 284 l~~sDiLITDySSv------~fdf~~l~KPiify~~D~~~y~~~rg~~~d~~~~~Pg~~~~~~~~li~ai~~~~~~~~~~ 357 (388) T COG1887 284 LLVSDILITDYSSV------IFDFMLLDKPIIFYTYDLEQYDELRGFYLDYKFEAPGEVVETQEELIDAIKPYDEDGNYD 357 (388) T ss_pred HHHCCEEEECCCCH------HHHHHHHCCCEEEEECCCHHHHHCCCHHHHHHHCCCCCHHCCHHHHHHHHHHHHCCCHHH T ss_conf 87548888516300------466886559679996470475540112445776199511036899999877531022156 Q ss_pred HHHHHHHHHHHHHC--CCHHHHHHHHHHH Q ss_conf 99999999999704--5569999999997 Q gi|254780402|r 394 YEMINAAINEVKKM--QGPLKITLRSLDS 420 (440) Q Consensus 394 ~~~~~~a~~~i~~~--~ga~~~~~~~i~~ 420 (440) .+.-....+.+.+. ..+++|+++.+-+ T Consensus 358 ~~k~~~~~~~~~~~~dg~ss~ri~~~i~~ 386 (388) T COG1887 358 LEKLRVFNDKFNSYEDGRSSERILKLIFK 386 (388) T ss_pred HHHHHHHHHHHHCCCCCHHHHHHHHHHHC T ss_conf 88898777775123473077899999850 No 88 >TIGR01426 MGT glycosyltransferase, MGT family; InterPro: IPR006326 These sequences belong to the MGT (macroside glycosyltransferase) subfamily of the UDP-glucuronosyltransferase family. Members include a number of glucosyl transferases for macrolide antibiotic inactivation, but also include transferases of glucose-related sugars for macrolide antibiotic production. ; GO: 0016758 transferase activity transferring hexosyl groups, 0016999 antibiotic metabolic process. Probab=97.44 E-value=0.0016 Score=42.74 Aligned_cols=110 Identities=15% Similarity=0.251 Sum_probs=87.0 Q ss_pred EEEECCCCCCHHHHHHCCCEEEECCCCCCCCC-CHHHHHHHCCCEEECCCCCCHHHHHHHHHHCCCEEEEC----CHHHH Q ss_conf 67506765202554220820661355444545-83768961994999898443799999999789869957----98999 Q gi|254780402|r 305 IFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQ-NPLEAAMLGCAILSGPNVENFRDIYRRMVSSGAVRIVE----EVGTL 379 (440) Q Consensus 305 V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGh-N~lEpaa~G~pVi~GP~~~nf~e~~~~L~~~g~~~~v~----d~~eL 379 (440) |.+=+-+=. ......||. |+-+||- +..|+++.|+|+|.=|-+....-.++...+.|++..+. +++.| T Consensus 314 v~VR~~VPq-~evL~~A~l------fvTHgGmnSt~EaL~~gVP~va~P~~adQ~~~A~R~~ELGlg~~l~~e~vTa~~L 386 (429) T TIGR01426 314 VEVRRWVPQ-LEVLEKADL------FVTHGGMNSTMEALAAGVPLVAVPQGADQPMTARRIAELGLGRVLPKEEVTAEKL 386 (429) T ss_pred EEEECCCCH-HHHHHHHHH------HHHCCCCHHHHHHHHCCCCEEEECCCCCCHHHHHHHHHCCCEEECCCCCCCHHHH T ss_conf 788546562-778988888------8631660158999964996898517888013765751356211137655278999 Q ss_pred HHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH Q ss_conf 99999995898999999999999997045569999999997523 Q gi|254780402|r 380 ADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITLRSLDSYVN 423 (440) Q Consensus 380 ~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~~~i~~~l~ 423 (440) -+.+.++++|++.+++..+=-.+ .+.+|.+.|-.+.|+++|- T Consensus 387 R~~v~~v~~D~~~~~~~~~~r~~--~~eAGG~~rAAdeiE~~l~ 428 (429) T TIGR01426 387 REAVLAVLSDDEYLERLKKIRAE--IREAGGARRAADEIEGFLA 428 (429) T ss_pred HHHHHHHHCCHHHHHHHHHHHHH--HHHCCCHHHHHHHHHHHHC T ss_conf 99999860588899999999999--9850453389999999746 No 89 >COG1819 Glycosyl transferases, related to UDP-glucuronosyltransferase [Carbohydrate transport and metabolism / Signal transduction mechanisms] Probab=97.39 E-value=0.01 Score=37.48 Aligned_cols=110 Identities=17% Similarity=0.160 Sum_probs=80.2 Q ss_pred EEEEECCCCCCHHHHHHCCCEEEECCCCCCCCC-CHHHHHHHCCCEEECCCCCCHHHHHHHHHHCCCEEEEC----CHHH Q ss_conf 167506765202554220820661355444545-83768961994999898443799999999789869957----9899 Q gi|254780402|r 304 DIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQ-NPLEAAMLGCAILSGPNVENFRDIYRRMVSSGAVRIVE----EVGT 378 (440) Q Consensus 304 ~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGh-N~lEpaa~G~pVi~GP~~~nf~e~~~~L~~~g~~~~v~----d~~e 378 (440) .+.+.| ..-...+...||+++ .+||| ++.|++.+|+|++.=|...........+.+.|++.... +++. T Consensus 285 n~~v~~-~~p~~~~l~~ad~vI------~hGG~gtt~eaL~~gvP~vv~P~~~DQ~~nA~rve~~G~G~~l~~~~l~~~~ 357 (406) T COG1819 285 NVIVAD-YVPQLELLPRADAVI------HHGGAGTTSEALYAGVPLVVIPDGADQPLNAERVEELGAGIALPFEELTEER 357 (406) T ss_pred CEEEEC-CCCHHHHHHCCCEEE------ECCCHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHCCCCEECCCCCCCHHH T ss_conf 747861-575898874079899------1798579999997399989827873078799999974988312756588899 Q ss_pred HHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 99999999589899999999999999704556999999999752 Q gi|254780402|r 379 LADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITLRSLDSYV 422 (440) Q Consensus 379 L~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~~~i~~~l 422 (440) |.+++.+.|+|+..++...+.+.. .++..| .++..+.|+... T Consensus 358 l~~av~~vL~~~~~~~~~~~~~~~-~~~~~g-~~~~a~~le~~~ 399 (406) T COG1819 358 LRAAVNEVLADDSYRRAAERLAEE-FKEEDG-PAKAADLLEEFA 399 (406) T ss_pred HHHHHHHHHCCHHHHHHHHHHHHH-HHHCCC-HHHHHHHHHHHH T ss_conf 999999996739999999999999-765553-799999999998 No 90 >COG0859 RfaF ADP-heptose:LPS heptosyltransferase [Cell envelope biogenesis, outer membrane] Probab=97.32 E-value=0.016 Score=36.30 Aligned_cols=259 Identities=15% Similarity=0.106 Sum_probs=124.5 Q ss_pred CEEEEEECCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCC-CC---C---HHHHHHHHHHCC Q ss_conf 7799981789999999999999986199899996277630012101477416885055-72---0---134455744202 Q gi|254780402|r 57 PLIWFHASSVGETMALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPL-DI---Q---PAVSRFLKYWKP 129 (440) Q Consensus 57 ~~IW~HaaSvGE~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~-D~---~---~~~~~fl~~~~P 129 (440) +..-+.-.++|.+..+.|+++.|++.+|+.-++..++.+..-+-+..|....+...+. +. . ....+.++.-+- T Consensus 3 kIliir~~~iGD~vlt~p~~~~lk~~~P~a~i~~~~~~~~~~i~~~~p~I~~vi~~~~~~~~~~~~~~~~l~~~lr~~~y 82 (334) T COG0859 3 KILVIRLSKLGDVVLTLPLLRTLKKAYPNAKIDVLVPKGFAPILKLNPEIDKVIIIDKKKKGLGLKERLALLRTLRKERY 82 (334) T ss_pred EEEEEECCCHHHHHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHCCCCHHEEEECCCCCCCHHHHHHHHHHHHHCCCCC T ss_conf 79999557401377699999999987899779999554314677559524168511211212138999999998314686 Q ss_pred EEEEEECCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHH----CCCCE Q ss_conf 03564047621467888742245505864126654212100111123310264423664157788899860----88505 Q gi|254780402|r 130 DCMILSESDIWPLTVFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKEL----GAQKL 205 (440) Q Consensus 130 ~~~i~~e~ElWPnli~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~l----G~~~i 205 (440) |+++-+-+.+--.++.... ++|...-=.+ ++.+ + .+......+ .-....+...+++..+ |.... T Consensus 83 D~vidl~~~~ksa~l~~~~--~~~~r~g~~~---~~~r-~----~~~~~~~~~--~~~~~~~~~~~~~~~l~~~~~~~~~ 150 (334) T COG0859 83 DAVIDLQGLLKSALLALLL--GIPFRIGFDK---KSAR-E----LLLNKFYPR--LDKPEGQHVVERYLALLEDLGLYPP 150 (334) T ss_pred CEEEECHHHHHHHHHHHHH--CCCCEECCCC---HHHH-H----HHHHHCCCC--CCCCCHHHHHHHHHHHHHHHCCCCC T ss_conf 7899740258899999973--8983631450---7777-7----777630244--5654024599999999876079876 Q ss_pred EEECCCCCCCCCCCCCHHHHHHHHHHCCCCCEEEC--C-CC-----CCCCCCCCCCCCCCCCCCCCEEEEEEECCCCHHH Q ss_conf 87204310123442212568999986179861750--4-65-----5332100000000001234124889705521147 Q gi|254780402|r 206 IVSGNLKIDTESLPCDKELLSLYQESIAGRYTWAA--I-ST-----FEGEEDKAVYVHNFIKCRTDVLTIIVPRHPRRCD 277 (440) Q Consensus 206 ~v~GnlK~d~~~~~~~~~~~~~~~~~~~~r~v~va--g-St-----h~~EE~~il~a~~~l~~~~~~~lIIvPRhpeR~~ 277 (440) . ........+.............. ++.++. | |. |+.| ...+..+.+.+.. ..+++.. .++..+ T Consensus 151 ~----~~~~~~~~~~~~~~~~~~~~~~~-~~~i~i~pg~s~~~~K~wp~e--~~~~l~~~l~~~~-~~Vvl~g-~~~e~e 221 (334) T COG0859 151 P----EPQLDFPLPRPPIELAKNLAKFD-RPYIVINPGASRGSAKRWPLE--HYAELAELLIAKG-YQVVLFG-GPDEEE 221 (334) T ss_pred C----CCCCCCCCCCCHHHHHHHHHHCC-CCEEEEECCCCCCCCCCCCHH--HHHHHHHHHHHCC-CEEEEEE-CHHHHH T ss_conf 6----77555455547888986542037-987999647346677889999--9999999999769-9899940-878999 Q ss_pred HHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCC-CCCHHHHHHHCCCEE--ECCCC Q ss_conf 888776530010110013444446641675067652025542208206613554445-458376896199499--98984 Q gi|254780402|r 278 AIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASG-GQNPLEAAMLGCAIL--SGPNV 354 (440) Q Consensus 278 ~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~G-GhN~lEpaa~G~pVi--~GP~~ 354 (440) ...++....+-... +.--.++.++..+-+.||+ |||+ +.| .|= |+++|+|+| +||.- T Consensus 222 ~~~~i~~~~~~~~~-------------l~~k~sL~e~~~li~~a~l-~I~~---DSg~~Hl---AaA~~~P~I~iyg~t~ 281 (334) T COG0859 222 RAEEIAKGLPNAVI-------------LAGKTSLEELAALIAGADL-VIGN---DSGPMHL---AAALGTPTIALYGPTS 281 (334) T ss_pred HHHHHHHHCCCCCC-------------CCCCCCHHHHHHHHHCCCE-EECC---CCHHHHH---HHHCCCCEEEEECCCC T ss_conf 99999973676612-------------1799999999999966989-9914---8879999---9873998899988987 Q ss_pred CC Q ss_conf 43 Q gi|254780402|r 355 EN 356 (440) Q Consensus 355 ~n 356 (440) .+ T Consensus 282 ~~ 283 (334) T COG0859 282 PA 283 (334) T ss_pred CC T ss_conf 55 No 91 >TIGR00236 wecB UDP-N-acetylglucosamine 2-epimerase; InterPro: IPR003331 UDP-N-acetylglucosamine 2-epimerase 5.1.3.14 from EC catalyses the production of UDP-ManNAc from UDP-GlcNAc. Some of the enzymes is this family are bifunctional. In microorganisms the epimerase is involved in in the synthesis of the capsule precursor UDP-ManNAcA , . The protein from rat liver displays both epimerase and kinase activity .; GO: 0008761 UDP-N-acetylglucosamine 2-epimerase activity, 0006047 UDP-N-acetylglucosamine metabolic process, 0009103 lipopolysaccharide biosynthetic process. Probab=97.24 E-value=0.0054 Score=39.35 Aligned_cols=330 Identities=18% Similarity=0.201 Sum_probs=187.1 Q ss_pred CHHHHHHHHHHHHHHHHC-CCCEEEEECCCCC-CHHHHCCCC-------CCEEEEECCC--------CCCHHHHHHHHHH Q ss_conf 899999999999999861-9989999627763-001210147-------7416885055--------7201344557442 Q gi|254780402|r 65 SVGETMALIGLIPAIRSR-HVNVLLTTMTATS-AKVARKYLG-------QYAIHQYAPL--------DIQPAVSRFLKYW 127 (440) Q Consensus 65 SvGE~~~a~pli~~l~~~-~~~iliT~~T~tg-~~~~~~~~~-------~~~~~~y~P~--------D~~~~~~~fl~~~ 127 (440) +--|.....|+++.+.+. +.+.-++.+...- .++....+. +...-..-|- ....-....+... T Consensus 9 ~~p~~~~~~p~~~~~~~~p~~~~~~~~~~~h~~~~~~~~~~~~~~~~~p~~~~~~~~~g~~~~~~~~~~l~~~~~~~~~~ 88 (380) T TIGR00236 9 TRPEAIKLAPLIRALLKDPGIDWKVLHTGQHRDEEMLDQVLDLFSLPNPDYDLNIGSPGQTLGEITGGLLEGLEELLLEE 88 (380) T ss_pred CCCHHHHHHHHHHHHHHCCCCCEEEEECCCCCHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHC T ss_conf 76203566578999862688522567503200146789999886225754444235766304778888887678886414 Q ss_pred CCEEEEEECCCCHHHHHH--HHHHCCCCEEEEEECCCCCCCCCCHHHH-HHHHHCCCCC-CEEEECCHHHHHHHHHHCC- Q ss_conf 020356404762146788--8742245505864126654212100111-1233102644-2366415778889986088- Q gi|254780402|r 128 KPDCMILSESDIWPLTVF--ELSKQRIPQVLVNARMSRRSFKNWKTVL-SFSKKIFSQF-SLVIVQSERYFRRYKELGA- 202 (440) Q Consensus 128 ~P~~~i~~e~ElWPnli~--~~~~~~ip~~linarls~~S~~~~~~~~-~l~~~~~~~~-~~i~~qs~~~~~rl~~lG~- 202 (440) +|++++.- +|.--.+-. .+....||+.-+.+-+ +++.+|.-++ ..-+.+...+ +..++.++....++..-|. T Consensus 89 ~p~~~~~~-gd~~~~~~~~l~~~~~~~~~gh~~~gl--~~~~~~~p~p~~~~~~~~~~~~~~~~~p~~~~~~~l~~~~~~ 165 (380) T TIGR00236 89 KPDVVLVQ-GDTTTTLAGALAAFYLQIPVGHVEAGL--RTGDLYSPFPEELNRVLTGHIAKLHFAPTELAKENLLREGVE 165 (380) T ss_pred CCCEEEEE-CCCHHHHHHHHHHHHHHCCHHHHHHCC--CCCCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCC T ss_conf 88678971-662146777887776530101332011--124455776056777898877764203215667777630555 Q ss_pred -CCEEEECCCCCCCCCCCCCHH-HHHHHHHHCC-------CCCEEECCCCCCCCC-------CCCCCCC-CCCCCCCCEE Q ss_conf -505872043101234422125-6899998617-------986175046553321-------0000000-0001234124 Q gi|254780402|r 203 -QKLIVSGNLKIDTESLPCDKE-LLSLYQESIA-------GRYTWAAISTFEGEE-------DKAVYVH-NFIKCRTDVL 265 (440) Q Consensus 203 -~~i~v~GnlK~d~~~~~~~~~-~~~~~~~~~~-------~r~v~vagSth~~EE-------~~il~a~-~~l~~~~~~~ 265 (440) ..+.++||+-.|......+.. .......... +..+ ++...|..|. +-++.+. +....+++ . T Consensus 166 ~~~~~~~g~~~~d~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~-~~~~~h~~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 243 (380) T TIGR00236 166 PGKIFVTGNTVIDALLTNLEIAEPSSPVLSEFGENYPFVPDDRV-LLLTLHRRENVGNPEPLENILEAILEILEEFPD-V 243 (380) T ss_pred CCEEEEECHHHHHHHHHHHHHHHHCCCCHHHHCCCCCCCCCCCE-EEEEEHHHHCCCCHHHHHHHHHHHHHHHHHCCC-C T ss_conf 45057622056777877665421015502342256773204544-465400110467601468999999998741243-0 Q ss_pred EEEEECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHC Q ss_conf 88970552114788877653001011001344444664167506765202554220820661355444545837689619 Q gi|254780402|r 266 TIIVPRHPRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLG 345 (440) Q Consensus 266 lIIvPRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G 345 (440) -++.|-|+....... ..|+ .........|.+++.+|.+.........-++- .+.||- --|+..+| T Consensus 244 ~~~~p~h~~~~~~~~----~~~~-------~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~l---~d~gg~-~~~~~~~~ 308 (380) T TIGR00236 244 AIVFPVHPNPKVREP----EFGL-------YKALGDAKRVFLIEPLGYLDFLLLLSNAYLIL---TDSGGL-QEEAPSLG 308 (380) T ss_pred EEEEECCCCCCCCHH----HHHH-------HHHHCCCCCEEEECCHHHHHHHHHHHCCEEEE---ECCCCC-CHHHHHCC T ss_conf 356621456421002----3104-------66632566357755134788888753241687---347763-00112127 Q ss_pred CCEEECCCCCCHHHHHHHHHHCCCEEEE-CCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf 9499989844379999999978986995-79899999999995898999999999999997045569999999997 Q gi|254780402|r 346 CAILSGPNVENFRDIYRRMVSSGAVRIV-EEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITLRSLDS 420 (440) Q Consensus 346 ~pVi~GP~~~nf~e~~~~L~~~g~~~~v-~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~~~i~~ 420 (440) +|++.=.....-++ ..++|+...+ .|.+.+......++++|+.+++|.....-+ . ...+.+++++.+.. T Consensus 309 ~p~~~~~~~~~~p~----~~~~g~~~l~g~~~~~~~~~~~~~l~~p~~~~~~~~~~~p~-g-~g~~~~~~~~~~~~ 378 (380) T TIGR00236 309 KPVLVLRDTTERPE----AVEAGTNKLVGTDKEKITKAAGRLLSDPDEYRKMSNAENPY-G-DGEASERIVEELLN 378 (380) T ss_pred CCEEEEECCCCCCC----HHHCCCEEEECCCHHHHHHHHHHHHCCHHHHHHHHHHCCCC-C-CCHHHHHHHHHHHH T ss_conf 74377503566642----01003200104667889999998731726788886412656-7-74146899999860 No 92 >PRK09814 hypothetical protein; Provisional Probab=96.92 E-value=0.038 Score=33.77 Aligned_cols=247 Identities=14% Similarity=0.099 Sum_probs=138.9 Q ss_pred HHHHHHHHHCCEEEEEECCCCHHH------HHHHHHHCCCCEEEEEECCCCCCCCCCHH---HHHHHHHCCCCCCEEEEC Q ss_conf 344557442020356404762146------78887422455058641266542121001---111233102644236641 Q gi|254780402|r 119 AVSRFLKYWKPDCMILSESDIWPL------TVFELSKQRIPQVLVNARMSRRSFKNWKT---VLSFSKKIFSQFSLVIVQ 189 (440) Q Consensus 119 ~~~~fl~~~~P~~~i~~e~ElWPn------li~~~~~~~ip~~linarls~~S~~~~~~---~~~l~~~~~~~~~~i~~q 189 (440) .+.+.+...+..-++++-+..|.. ++..++++|++++++==-+ -|+|...- ...-.-.+|+.+|.+++. T Consensus 54 ~~~~i~~~l~~gDivi~QyP~~~~~~~~~~l~~~lk~~~~K~i~~IHDi--e~LR~~~~~~~~~~~ei~~ln~aD~iIvh 131 (337) T PRK09814 54 RIDGILASLNRGDIVVFQYPTYNGFEFDLLFIDKLKKKQIKIIILIHDI--EPLRFGKSNYYLMKEEIDMLNLSDVVIVH 131 (337) T ss_pred HHHHHHHCCCCCCEEEEECCCCCHHHHHHHHHHHHHHCCCEEEEEECCC--HHHHCCCCCHHHHHHHHHHHHHCCEEEEC T ss_conf 9999995779999999978987078899999999997598799997765--89865677703679999998528999979 Q ss_pred CHHHHHHHHHHCCC--CEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCEEECCCCCCCCCCCCCCCCCCCCC-CCCEEE Q ss_conf 57788899860885--058720431012344221256899998617986175046553321000000000012-341248 Q gi|254780402|r 190 SERYFRRYKELGAQ--KLIVSGNLKIDTESLPCDKELLSLYQESIAGRYTWAAISTFEGEEDKAVYVHNFIKC-RTDVLT 266 (440) Q Consensus 190 s~~~~~rl~~lG~~--~i~v~GnlK~d~~~~~~~~~~~~~~~~~~~~r~v~vagSth~~EE~~il~a~~~l~~-~~~~~l 266 (440) ++...+.|++-|++ ++...|= ||-..+... +. .--++.|..||+-.+. . .|.+ .++..+ T Consensus 132 n~~M~~~L~~~G~~~~kiv~l~i--fDYl~~~~~-~~------~~~~k~I~fAGNL~Ks--~-------fL~~~~~~~~l 193 (337) T PRK09814 132 NKKMTDWLVEEGVTVDKIIDQEI--FDYLVDIDL-ET------PSFQKKIIFAGNLSKS--P-------FLKKWSENIKL 193 (337) T ss_pred CHHHHHHHHHCCCCCCCEEEECC--CCCCCCCCC-CC------CCCCCEEEEECCHHHH--H-------HHHHCCCCCCE T ss_conf 99999999977998686478578--775887555-66------5677439981672434--6-------78847668877 Q ss_pred EEE-ECCCCHHHHHHHHHHHHCCHHHHHCCCCCCC---CCCEEEEECC------CCCCHHHHHHCCCEEEECCCCCCCCC Q ss_conf 897-0552114788877653001011001344444---6641675067------65202554220820661355444545 Q gi|254780402|r 267 IIV-PRHPRRCDAIERRLIAKGLKVARRSRGDVIN---AEVDIFLGDT------IGEMGFYLRMTEIAFIGRSFCASGGQ 336 (440) Q Consensus 267 IIv-PRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~---~~~~V~i~Dt------~GeL~~lY~~AdiafVGGSl~~~GGh 336 (440) -+- |...+. +..+.+.-.|. .+..+.+. ...=-+++|. -|..+.++++-. -| T Consensus 194 ~lYG~~~~~~--~~~~nv~Y~G~----~~pdeL~~~l~~g~FGLVWDgds~~tc~G~~g~Ylk~Nn------------PH 255 (337) T PRK09814 194 NVFGPGPSSL--ESSANVIYKGS----FDPEELPLELSKGGFGLVWDGDSSETNDGEYGEYYKYNN------------PH 255 (337) T ss_pred EEECCCCCCC--CCCCCEEEECC----CCHHHHHHHHHCCCEEEEECCCCCCCCCCCCCCHHEEEC------------CH T ss_conf 9978885455--66566379476----681888998714985689769865346786211001315------------05 Q ss_pred CHHHHHHHCCCEEECCCCCCHHHHHHHHHHCCCEEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCC Q ss_conf 8376896199499989844379999999978986995798999999999958989999999999999970455 Q gi|254780402|r 337 NPLEAAMLGCAILSGPNVENFRDIYRRMVSSGAVRIVEEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQG 409 (440) Q Consensus 337 N~lEpaa~G~pVi~GP~~~nf~e~~~~L~~~g~~~~v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~g 409 (440) =.-==+++|.|||. ++-...++-..+.|.++.|++-+|+.+.+..+ +.+...+|.+|+.++.++-+. T Consensus 256 K~SLYLaaGiPVIV----W~~aAla~fV~~n~lGi~V~sL~ei~~~l~~i--t~~eY~~m~~Nv~~i~~klr~ 322 (337) T PRK09814 256 KLSLYLAAGLPVIV----WSKAAIADFIVENNLGFVVDSLEELPEKINNM--TEEEYNEMVENVKKIAQKLRS 322 (337) T ss_pred HHHHHHHCCCCEEE----CCCHHHHHHHHHCCCEEEECCHHHHHHHHHHC--CHHHHHHHHHHHHHHHHHHHC T ss_conf 78899874998898----27315678999769769987899999999858--999999999999999999854 No 93 >cd03788 GT1_TPS Trehalose-6-Phosphate Synthase (TPS) is a glycosyltransferase that catalyses the synthesis of alpha,alpha-1,1-trehalose-6-phosphate from glucose-6-phosphate using a UDP-glucose donor. It is a key enzyme in the trehalose synthesis pathway. Trehalose is a nonreducing disaccharide present in a wide variety of organisms and may serve as a source of energy and carbon. It is characterized most notably in insect, plant, and microbial cells. Its production is often associated with a variety of stress conditions, including desiccation, dehydration, heat, cold, and oxidation. This family represents the catalytic domain of the TPS. Some members of this domain family coexist with a C-terminal trehalose phosphatase domain. Probab=96.85 E-value=0.044 Score=33.38 Aligned_cols=275 Identities=15% Similarity=0.220 Sum_probs=143.6 Q ss_pred HHHHHHHHHCCEEEEEE-CCCCH--HHHHHHHHHCCCCEEEE-EECC-CCCCCCCCHHHHHHHHHCCCCCCEEEECCHHH Q ss_conf 34455744202035640-47621--46788874224550586-4126-65421210011112331026442366415778 Q gi|254780402|r 119 AVSRFLKYWKPDCMILS-ESDIW--PLTVFELSKQRIPQVLV-NARM-SRRSFKNWKTVLSFSKKIFSQFSLVIVQSERY 193 (440) Q Consensus 119 ~~~~fl~~~~P~~~i~~-e~ElW--Pnli~~~~~~~ip~~li-narl-s~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~ 193 (440) ++++..+.++|+-+||+ ++-++ |.++++... .+++.+. -.-+ |.--|+-..+-..+.+.++. .|.|.-|+..+ T Consensus 121 FA~~v~~~~~~~D~VWVHDYhL~llP~~LR~~~~-~~~igfFlHiPFPs~eifr~LP~r~eil~glL~-~DlIGF~t~~y 198 (460) T cd03788 121 FADAIAEVLRPGDLVWVHDYHLLLLPQMLRERGP-DARIGFFLHIPFPSSEIFRCLPWREELLRGLLG-ADLIGFQTERY 198 (460) T ss_pred HHHHHHHHCCCCCEEEEECCHHHHHHHHHHHHCC-CCEEEEEECCCCCCHHHHHHCCCHHHHHHHHHH-CCEEEECCHHH T ss_conf 9999998568998799964167768999998589-984898870799998999769767999999874-77666468899 Q ss_pred HHHHHH-----HCCCC-----EEEECC-CC------------CCC-CCCCCCHHHHHHHHHHCCCCCEEECCCCC---CC Q ss_conf 889986-----08850-----587204-31------------012-34422125689999861798617504655---33 Q gi|254780402|r 194 FRRYKE-----LGAQK-----LIVSGN-LK------------IDT-ESLPCDKELLSLYQESIAGRYTWAAISTF---EG 246 (440) Q Consensus 194 ~~rl~~-----lG~~~-----i~v~Gn-lK------------~d~-~~~~~~~~~~~~~~~~~~~r~v~vagSth---~~ 246 (440) +++|.+ +|.+. +...|. ++ |+. ...+...+....+++..+++++++..-.- +| T Consensus 199 ~r~Fl~~~~r~l~~~~~~~~~v~~~gr~v~v~~~PigId~~~~~~~~~~~~~~~~~~~l~~~~~~~~li~gvDRlDy~KG 278 (460) T cd03788 199 ARNFLSCCSRLLGLEVTDDGGVEYGGRRVRVGAFPIGIDPDAFRKLAASPEVQERAAELRERLGGRKLIVGVDRLDYSKG 278 (460) T ss_pred HHHHHHHHHHHCCCCCCCCCEEEECCEEEEEEEEECCCCHHHHHHHHCCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCC T ss_conf 99999999997098524798599999899998980401889999986283247999999987379719996253211148 Q ss_pred CCCCCCCCC-CCCCCCCCE----EE--EEEECCC--CHHHHHHHHHHHHCCHHHHH-CCCCCCCCCCEEEEEC--CCCCC Q ss_conf 210000000-000123412----48--8970552--11478887765300101100-1344444664167506--76520 Q gi|254780402|r 247 EEDKAVYVH-NFIKCRTDV----LT--IIVPRHP--RRCDAIERRLIAKGLKVARR-SRGDVINAEVDIFLGD--TIGEM 314 (440) Q Consensus 247 EE~~il~a~-~~l~~~~~~----~l--IIvPRhp--eR~~~i~~~l~~~gl~~~~~-S~~~~~~~~~~V~i~D--t~GeL 314 (440) =.+ -+.|+ +.|+++|+. .| |.+|... +.-+++.+.++.. +.+. .+...++-..-.|+-. ...|+ T Consensus 279 i~~-kl~Afe~fL~~~Pe~~~kvvlvQia~psr~~v~~y~~l~~~i~~~---v~~IN~~fg~~~w~PI~y~~~~~~~~el 354 (460) T cd03788 279 IPE-RLLAFERLLERYPEWRGKVVLVQIAVPSRTDVPEYQELRREVEEL---VGRINGKFGTLDWTPVRYLYRSLPREEL 354 (460) T ss_pred HHH-HHHHHHHHHHHCCCCCCCEEEEEEECCCCCCCHHHHHHHHHHHHH---HHHHHHCCCCCCCEEEEEEECCCCHHHH T ss_conf 788-999999999869143377799999258776755789999999999---9998614378997029999179999999 Q ss_pred HHHHHHCCCEEEECCCCCCCCCCH--HHHHHHCC---CEEECCCCCCHHHHHHHHHHCCCEEEEC--CHHHHHHHHHHHH Q ss_conf 255422082066135544454583--76896199---4999898443799999999789869957--9899999999995 Q gi|254780402|r 315 GFYLRMTEIAFIGRSFCASGGQNP--LEAAMLGC---AILSGPNVENFRDIYRRMVSSGAVRIVE--EVGTLADMVYSLL 387 (440) Q Consensus 315 ~~lY~~AdiafVGGSl~~~GGhN~--lEpaa~G~---pVi~GP~~~nf~e~~~~L~~~g~~~~v~--d~~eL~~~l~~ll 387 (440) ..+|+.||++.|- |+ +.|.|. .|-.++.. .|++ .+.|...++.| +++..|+ |.+++++++..-| T Consensus 355 ~aly~~ADv~lVT-~l--rDGMNLvakEyva~q~~~~GvLI---LSefaGaa~~L---~~Al~VNP~d~~~~a~ai~~AL 425 (460) T cd03788 355 AALYRAADVALVT-PL--RDGMNLVAKEYVACQDDDPGVLI---LSEFAGAAEEL---SGALLVNPYDIDEVADAIHRAL 425 (460) T ss_pred HHHHHHHHEEEEC-CC--CCCCCCCCHHHEEEECCCCCEEE---EECCCCCHHHH---CCCEEECCCCHHHHHHHHHHHH T ss_conf 9999861057855-34--23325341311367559995599---96552436672---8787979999899999999997 Q ss_pred C-CHHHHHHHHHHHHHHHHHCC Q ss_conf 8-98999999999999997045 Q gi|254780402|r 388 S-EPTIRYEMINAAINEVKKMQ 408 (440) Q Consensus 388 ~-d~~~~~~~~~~a~~~i~~~~ 408 (440) + .++++++--++-++++.++. T Consensus 426 ~M~~~Er~~R~~~l~~~v~~~~ 447 (460) T cd03788 426 TMPLEERRERHRKLREYVRTHD 447 (460) T ss_pred CCCHHHHHHHHHHHHHHHHHCC T ss_conf 5999999999999999988579 No 94 >KOG2941 consensus Probab=96.74 E-value=0.053 Score=32.86 Aligned_cols=205 Identities=13% Similarity=0.095 Sum_probs=117.0 Q ss_pred HCCCCC-CEEEECCHHHHHHHH-HHCCCCEEEECCCCCCCCC---------------------------CCCCHHHHHHH Q ss_conf 102644-236641577888998-6088505872043101234---------------------------42212568999 Q gi|254780402|r 178 KIFSQF-SLVIVQSERYFRRYK-ELGAQKLIVSGNLKIDTES---------------------------LPCDKELLSLY 228 (440) Q Consensus 178 ~~~~~~-~~i~~qs~~~~~rl~-~lG~~~i~v~GnlK~d~~~---------------------------~~~~~~~~~~~ 228 (440) ..|.++ +.-+|.+.+..+.+. .-|+.+..|.-| +++ +.....+...+ T Consensus 164 ~~fgk~a~~nLcVT~AMr~dL~qnWgi~ra~v~YD----rPps~~~~l~~~H~lf~~l~~d~~~f~ar~~q~~~~~~taf 239 (444) T KOG2941 164 KYFGKLADYNLCVTKAMREDLIQNWGINRAKVLYD----RPPSKPTPLDEQHELFMKLAGDHSPFRAREPQDKALERTAF 239 (444) T ss_pred HHHHCCCCCCHHHHHHHHHHHHHHCCCCEEEEEEC----CCCCCCCCHHHHHHHHHHHCCCCCHHHHCCCCCCHHHHHHH T ss_conf 98613343222458889999987338740489822----79888985567788886533555424422433301224567 Q ss_pred HH--------HCCCCCEEECCCC-CCCCCC--CCCCCCCC------CC--CCCCEEEEEEECCCCHHHHHHHHHHHHCCH Q ss_conf 98--------6179861750465-533210--00000000------01--234124889705521147888776530010 Q gi|254780402|r 229 QE--------SIAGRYTWAAIST-FEGEED--KAVYVHNF------IK--CRTDVLTIIVPRHPRRCDAIERRLIAKGLK 289 (440) Q Consensus 229 ~~--------~~~~r~v~vagSt-h~~EE~--~il~a~~~------l~--~~~~~~lIIvPRhpeR~~~i~~~l~~~gl~ 289 (440) .+ ....|+.++..|| |.++|+ |+++|... .+ ..|.++.||....|.+. ...+.+++..+. T Consensus 240 ~~k~~s~~v~~~~~~pallvsSTswTpDEdf~ILL~AL~~y~~~~~~~~~~lP~llciITGKGPlkE-~Y~~~I~~~~~~ 318 (444) T KOG2941 240 TKKDASGDVQLLPERPALLVSSTSWTPDEDFGILLEALVIYEEQLYDKTHNLPSLLCIITGKGPLKE-KYSQEIHEKNLQ 318 (444) T ss_pred HHHCCCCHHHHCCCCCEEEEECCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCCHHH-HHHHHHHHHCCC T ss_conf 6513441233146787589953787876327899999986455540135799737999927883168-999998771532 Q ss_pred HHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCC---CHHHHHHHCCCEEECCCCCCHHHHHHHHHH Q ss_conf 11001344444664167506765202554220820661355444545---837689619949998984437999999997 Q gi|254780402|r 290 VARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQ---NPLEAAMLGCAILSGPNVENFRDIYRRMVS 366 (440) Q Consensus 290 ~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGh---N~lEpaa~G~pVi~GP~~~nf~e~~~~L~~ 366 (440) -... ..+.- ...+-..+.+.||.-+.=-+. ..|-- -+..--.||.||+. -||+-+-+.... T Consensus 319 ~v~~---~tpWL--------~aEDYP~ll~saDlGVcLHtS-SSGLDLPMKVVDMFGcglPvcA----~~fkcl~ELVkh 382 (444) T KOG2941 319 HVQV---CTPWL--------EAEDYPKLLASADLGVCLHTS-SSGLDLPMKVVDMFGCGLPVCA----VNFKCLDELVKH 382 (444) T ss_pred CEEE---EECCC--------CCCCCHHHHHCCCCCEEEEEC-CCCCCCCHHHHHHHCCCCCEEE----ECCHHHHHHHHC T ss_conf 1036---50223--------211006676314313476304-7666764667776257874366----456538999856 Q ss_pred CCCEEEECCHHHHHHHHHHHHC----CHHHHHHHHHHHHHH Q ss_conf 8986995798999999999958----989999999999999 Q gi|254780402|r 367 SGAVRIVEEVGTLADMVYSLLS----EPTIRYEMINAAINE 403 (440) Q Consensus 367 ~g~~~~v~d~~eL~~~l~~ll~----d~~~~~~~~~~a~~~ 403 (440) ..-+..+.|++||++.+.-+.+ |.+...+..+++++. T Consensus 383 ~eNGlvF~Ds~eLa~ql~~lf~~fp~~a~~l~~lkkn~~e~ 423 (444) T KOG2941 383 GENGLVFEDSEELAEQLQMLFKNFPDNADELNQLKKNLREE 423 (444) T ss_pred CCCCEEECCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHH T ss_conf 77853750599999999999854999878999999865787 No 95 >PRK10117 trehalose-6-phosphate synthase; Provisional Probab=96.58 E-value=0.068 Score=32.13 Aligned_cols=273 Identities=11% Similarity=0.114 Sum_probs=136.2 Q ss_pred HHHHHHHHCCEEEEEE-CCCCH--HHHHHHHHHCCCCEEEE-EECC-CCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHH Q ss_conf 4455744202035640-47621--46788874224550586-4126-654212100111123310264423664157788 Q gi|254780402|r 120 VSRFLKYWKPDCMILS-ESDIW--PLTVFELSKQRIPQVLV-NARM-SRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYF 194 (440) Q Consensus 120 ~~~fl~~~~P~~~i~~-e~ElW--Pnli~~~~~~~ip~~li-narl-s~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~ 194 (440) +.+....++|+-+||+ .+-++ |.++++..-. .++... ---+ |.--|+...+-..+.+.++. .|.|.-|+..++ T Consensus 114 A~~v~~~~~~~D~VWVHDYHL~llp~~LR~~~~~-~~IgfFlHiPFPs~eifr~LP~r~ell~glL~-aDlIGFqt~~~~ 191 (474) T PRK10117 114 ADKLLPLLQDDDIIWIHDYHLLPFASELRKRGVN-NRIGFFLHIPFPTPEIFNALPPHDELLEQLCD-YDLLGFQTENDR 191 (474) T ss_pred HHHHHHHHCCCCEEEECCCHHHCCHHHHHHHCCC-CCEEEEECCCCCCHHHHHHCCCHHHHHHHHHC-CCEEEECCHHHH T ss_conf 9999986288987998154022132999961899-82798856899997998339966999998750-786787578999 Q ss_pred HHHHHH-----CCC-----CEEEECC--------C-----CCCCCCCCCCHHHHHHHHHHCCCCCEEECCCCC---CCCC Q ss_conf 899860-----885-----0587204--------3-----101234422125689999861798617504655---3321 Q gi|254780402|r 195 RRYKEL-----GAQ-----KLIVSGN--------L-----KIDTESLPCDKELLSLYQESIAGRYTWAAISTF---EGEE 248 (440) Q Consensus 195 ~rl~~l-----G~~-----~i~v~Gn--------l-----K~d~~~~~~~~~~~~~~~~~~~~r~v~vagSth---~~EE 248 (440) ++|.+. +.+ .+...|. + +|..............++..++++++++..--- +|= T Consensus 192 ~~Fl~~~~~~~~~~~~~~~~~~~~Gr~v~v~~~PigID~~~~~~~a~~~~~~~~~~l~~~~~~~~lIlgVDRLDytKGi- 270 (474) T PRK10117 192 LAFLDCLSNQTRVTTRSGKSHTAWGKAFRTEVYPIGIEPDEIAKQAAGPLPPKLAQLKAELKNVQNIFSVERLDYSKGL- 270 (474) T ss_pred HHHHHHHHHHCCCCCCCCCEEEECCEEEEEEEEECEECHHHHHHHHCCHHHHHHHHHHHHHCCCEEEEEECCCCCCCCH- T ss_conf 9999999986186404797599668089999995657599999984142789999999985798499993564232588- Q ss_pred CCCCCCC-CCCCCCCCE----E-EEEE-ECCCCHHH---HHHHHHHHHCCHHHHH-CCCCCCCCCCEEEEECCC--CCCH Q ss_conf 0000000-000123412----4-8897-05521147---8887765300101100-134444466416750676--5202 Q gi|254780402|r 249 DKAVYVH-NFIKCRTDV----L-TIIV-PRHPRRCD---AIERRLIAKGLKVARR-SRGDVINAEVDIFLGDTI--GEMG 315 (440) Q Consensus 249 ~~il~a~-~~l~~~~~~----~-lIIv-PRhpeR~~---~i~~~l~~~gl~~~~~-S~~~~~~~~~~V~i~Dt~--GeL~ 315 (440) ..-+.|+ +.|+++|+. . +-|+ |.. ++.+ ++.+.++.. +.+. ++...++-..-.|+-.++ -+|. T Consensus 271 ~~rl~Afe~fL~~~Pe~~gkvvlvQia~psR-~~v~~Y~~l~~ev~~~---VgrINg~fg~~~w~Pi~y~~~~~~~~~l~ 346 (474) T PRK10117 271 PERFLAYEALLEKYPQHHGKIRYTQIAPTSR-GDVQAYQDIRHQLETE---AGRINGKYGQLGWTPLYYLNQHFDRKLLM 346 (474) T ss_pred HHHHHHHHHHHHHCCCCCCCEEEEEECCCCC-CCHHHHHHHHHHHHHH---HHHHHCCCCCCCCCEEEEEECCCCHHHHH T ss_conf 9999999999985900158679999827886-5539999999999999---99985645889971499982689999999 Q ss_pred HHHHHCCCEEEECCCCCCCCCCH--HHHHHHC---C-CEEECCCCCCHHHHHHHHHHCCCEEEEC--CHHHHHHHHHHHH Q ss_conf 55422082066135544454583--7689619---9-4999898443799999999789869957--9899999999995 Q gi|254780402|r 316 FYLRMTEIAFIGRSFCASGGQNP--LEAAMLG---C-AILSGPNVENFRDIYRRMVSSGAVRIVE--EVGTLADMVYSLL 387 (440) Q Consensus 316 ~lY~~AdiafVGGSl~~~GGhN~--lEpaa~G---~-pVi~GP~~~nf~e~~~~L~~~g~~~~v~--d~~eL~~~l~~ll 387 (440) .+|+.||++.|- |+ +.|.|. .|=.++- . -|++ .+.|..+++.|. | +..|+ |.+++++++.+-| T Consensus 347 alyr~ADv~lVT-pl--RDGMNLVAkEyVa~q~~~~~GvLI---LSeFaGaa~~L~--~-AllVNP~d~~~~A~ai~~AL 417 (474) T PRK10117 347 KIFRYSDVGLVT-PL--RDGMNLVAKEYVAAQDPANPGVLV---LSQFAGAANELT--S-ALIVNPYDRDEVAAALDRAL 417 (474) T ss_pred HHHHHCCEEEEC-CC--CCCCCCCCHHEEEEECCCCCEEEE---EECCCCCHHHHC--C-CEEECCCCHHHHHHHHHHHH T ss_conf 999746689964-54--342355530105785589980399---853413266747--6-86879999799999999997 Q ss_pred C-CHHHHHHHHHHHHHHHHHCC Q ss_conf 8-98999999999999997045 Q gi|254780402|r 388 S-EPTIRYEMINAAINEVKKMQ 408 (440) Q Consensus 388 ~-d~~~~~~~~~~a~~~i~~~~ 408 (440) + .+++|++--++.++.+.++- T Consensus 418 ~Mp~~Er~~R~~~l~~~v~~~d 439 (474) T PRK10117 418 TMPLAERISRHAEMLDVIVKND 439 (474) T ss_pred CCCHHHHHHHHHHHHHHHHHCC T ss_conf 6999999999999999998678 No 96 >pfam06258 DUF1022 Protein of unknown function (DUF1022). This family consists of several hypothetical eukaryotic and prokaryotic proteins. The function of this family is unknown. Probab=96.53 E-value=0.0018 Score=42.43 Aligned_cols=172 Identities=17% Similarity=0.238 Sum_probs=88.5 Q ss_pred CCCCCEEEECCHHHHHHHHHHC-CCCEE-EECCCCC-CCCCCCCCHHHHHHHHHHC-CCCC---EEECCCCCC---CCC- Q ss_conf 2644236641577888998608-85058-7204310-1234422125689999861-7986---175046553---321- Q gi|254780402|r 180 FSQFSLVIVQSERYFRRYKELG-AQKLI-VSGNLKI-DTESLPCDKELLSLYQESI-AGRY---TWAAISTFE---GEE- 248 (440) Q Consensus 180 ~~~~~~i~~qs~~~~~rl~~lG-~~~i~-v~GnlK~-d~~~~~~~~~~~~~~~~~~-~~r~---v~vagSth~---~EE- 248 (440) .+.||+|++.... .+- -+||. ..|.+-. +.... .+....+.+.. ..++ +++.|++.. .++ T Consensus 93 ~~~FDliv~P~HD------~~~~g~NVi~t~gal~~i~~~~l---~~~~~~~~~~~~~~~p~i~vLIGG~sk~~~~~~~~ 163 (308) T pfam06258 93 LGRFDLVIAPEHD------GVPPGPNVLLTVGALHRVTPQRL---AEAAAAWPELAALPRPRVAVLVGGPSKHFRWDADA 163 (308) T ss_pred CCCCCCEECCCCC------CCCCCCCEEEECCCCCCCCHHHH---HHHHHHHHHHCCCCCCEEEEEECCCCCCCCCCHHH T ss_conf 1347710257455------88899978962575554787788---77776655402477876999965578788889999 Q ss_pred -CCCCCC-CCCCCCCCCEEEEEE--ECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCC--CCHHHHHHCC Q ss_conf -000000-000012341248897--0552114788877653001011001344444664167506765--2025542208 Q gi|254780402|r 249 -DKAVYV-HNFIKCRTDVLTIIV--PRHPRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIG--EMGFYLRMTE 322 (440) Q Consensus 249 -~~il~a-~~~l~~~~~~~lIIv--PRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~G--eL~~lY~~Ad 322 (440) .-+++. .++.+.. +..++|. +|-|+...+..+.... ....+++.|.-| -...+++.|| T Consensus 164 ~~~l~~~i~~l~~~~-~~~l~it~SRRTP~~~~~~l~~~~~---------------~~~~~~~~~~~~~Npy~~~L~~Ad 227 (308) T pfam06258 164 ARRLLEQLQALLEAY-GGSLLITTSRRTPEAAEAALRKLLG---------------PRPGLYVWDGTGPNPYFGFLAWAD 227 (308) T ss_pred HHHHHHHHHHHHHHC-CCEEEEEECCCCCHHHHHHHHHHHC---------------CCCEEEECCCCCCCHHHHHHHHCC T ss_conf 999999999999877-9729999468896999999998608---------------997289827988645899998588 Q ss_pred CEEEE-CCCCCCCCCCHHHHHHHCCCEEECC---CCCCHHHHHHHHHHCCCEEEECCHHHHHH Q ss_conf 20661-3554445458376896199499989---84437999999997898699579899999 Q gi|254780402|r 323 IAFIG-RSFCASGGQNPLEAAMLGCAILSGP---NVENFRDIYRRMVSSGAVRIVEEVGTLAD 381 (440) Q Consensus 323 iafVG-GSl~~~GGhN~lEpaa~G~pVi~GP---~~~nf~e~~~~L~~~g~~~~v~d~~eL~~ 381 (440) .++|- -|-. . +-||++.|+||..=| .-..|..-.+.|.+.|.+....+..++.+ T Consensus 228 ~iiVT~DSvS-M----isEA~~tGkPV~i~~l~~~~~r~~~f~~~L~~~g~~r~f~~~~~~~~ 285 (308) T pfam06258 228 AVVVTADSVS-M----VSEAAATGAPVGVLPLEGKRGKFARFHDSLEERGRVRPFTGWASLSQ 285 (308) T ss_pred EEEEECCHHH-H----HHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHCCCEEECCCCCCCCC T ss_conf 6899067188-9----99998649977999677666599999999998799067775123245 No 97 >COG3980 spsG Spore coat polysaccharide biosynthesis protein, predicted glycosyltransferase [Cell envelope biogenesis, outer membrane] Probab=95.97 E-value=0.14 Score=30.03 Aligned_cols=290 Identities=16% Similarity=0.127 Sum_probs=154.8 Q ss_pred CHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHH-HHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEECCCCHHHH Q ss_conf 8999999999999998619989999627763001-210147741688505572013445574420203564047621467 Q gi|254780402|r 65 SVGETMALIGLIPAIRSRHVNVLLTTMTATSAKV-ARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILSESDIWPLT 143 (440) Q Consensus 65 SvGE~~~a~pli~~l~~~~~~iliT~~T~tg~~~-~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e~ElWPnl 143 (440) ..|-+.-...|.++|.+.+.. +++.|.+--+. ..+.+ + .+.+......- -++..+++++|+-.+.+=--. T Consensus 14 GmGHV~R~l~LA~~l~k~~~~--~~fl~k~~~e~~~~~~~-~----~f~~~~~~~~n--~ik~~k~d~lI~Dsygl~~dd 84 (318) T COG3980 14 GMGHVMRTLTLARELEKRGFA--CLFLTKQDIEAIIHKVY-E----GFKVLEGRGNN--LIKEEKFDLLIFDSYGLNADD 84 (318) T ss_pred CCCHHHHHHHHHHHHHHCCCE--EEEECCCCHHHHHHHHH-H----HCCCEEEECCC--CCCCCCCCEEEEECCCCCHHH T ss_conf 751345599999999851746--88840662564215666-5----10430023364--100366778999426888789 Q ss_pred HHHHH-HCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHCCCCE-EEECCCCCCCCCCCCC Q ss_conf 88874-224550586412665421210011112331026442366415778889986088505-8720431012344221 Q gi|254780402|r 144 VFELS-KQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKELGAQKL-IVSGNLKIDTESLPCD 221 (440) Q Consensus 144 i~~~~-~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~~~i-~v~GnlK~d~~~~~~~ 221 (440) .+-.+ +.+.|+++++-- .++|+. + .|.+.-+.-. +.+...+-..+. ...| .+| .+.. T Consensus 85 ~k~ik~e~~~k~l~fDd~-~~~~~~-d-------------~d~ivN~~~~-a~~~y~~v~~k~~~~lG-p~y----~~lr 143 (318) T COG3980 85 FKLIKEEAGSKILIFDDE-NAKSFK-D-------------NDLIVNAILN-ANDYYGLVPNKTRYYLG-PGY----APLR 143 (318) T ss_pred HHHHHHHHCCCEEEECCC-CCCCHH-H-------------HHHHHHHHHC-CHHHCCCCCCCEEEEEC-CCC----EECC T ss_conf 999889738817996477-764225-6-------------6735455535-11220536766379965-871----1416 Q ss_pred HHHHHHHHHHC--CCCCEEE-CCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCHHHHHHHHHHHHCCHHHHHCCCCC Q ss_conf 25689999861--7986175-04655332100000000001234124889705521147888776530010110013444 Q gi|254780402|r 222 KELLSLYQESI--AGRYTWA-AISTFEGEEDKAVYVHNFIKCRTDVLTIIVPRHPRRCDAIERRLIAKGLKVARRSRGDV 298 (440) Q Consensus 222 ~~~~~~~~~~~--~~r~v~v-agSth~~EE~~il~a~~~l~~~~~~~lIIvPRhpeR~~~i~~~l~~~gl~~~~~S~~~~ 298 (440) ++....-.... ..|.|++ .|.|-+.- +.++....|.+.+--.-|+++---....++.+..++. T Consensus 144 ~eF~~~r~~~~~r~~r~ilI~lGGsDpk~--lt~kvl~~L~~~~~nl~iV~gs~~p~l~~l~k~~~~~------------ 209 (318) T COG3980 144 PEFYALREENTERPKRDILITLGGSDPKN--LTLKVLAELEQKNVNLHIVVGSSNPTLKNLRKRAEKY------------ 209 (318) T ss_pred HHHHHHHHHHHHCCHHEEEEECCCCCHHH--HHHHHHHHHHCCCEEEEEEECCCCCCHHHHHHHHHHC------------ T ss_conf 99998689986353112899716887244--5999999840357049999468885466788888657------------ Q ss_pred CCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHHHHHHHHHCCCEEEE---CC Q ss_conf 446641675067652025542208206613554445458376896199499989844379999999978986995---79 Q gi|254780402|r 299 INAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFRDIYRRMVSSGAVRIV---EE 375 (440) Q Consensus 299 ~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~~~~L~~~g~~~~v---~d 375 (440) .++-+.-..-.|..+..-||.+++.| |....|++..|+|-++=|..+|.......+.+.|..... -. T Consensus 210 ----~~i~~~~~~~dma~LMke~d~aI~Aa------GstlyEa~~lgvP~l~l~~a~NQ~~~a~~f~~lg~~~~l~~~l~ 279 (318) T COG3980 210 ----PNINLYIDTNDMAELMKEADLAISAA------GSTLYEALLLGVPSLVLPLAENQIATAKEFEALGIIKQLGYHLK 279 (318) T ss_pred ----CCEEEEECCHHHHHHHHHCCHHEECC------CHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHCCCHHHCCCCCC T ss_conf ----88026862245899998603331446------35799999826982587633017888778986686002267776 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCC Q ss_conf 8999999999958989999999999999970455 Q gi|254780402|r 376 VGTLADMVYSLLSEPTIRYEMINAAINEVKKMQG 409 (440) Q Consensus 376 ~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~g 409 (440) ...+...+.++.+|+++|.......+ .+-..+| T Consensus 280 ~~~~~~~~~~i~~d~~~rk~l~~~~~-~i~dg~g 312 (318) T COG3980 280 DLAKDYEILQIQKDYARRKNLSFGSK-LIGDGRG 312 (318) T ss_pred HHHHHHHHHHHHHCHHHHHHHHHCCC-EEECCCC T ss_conf 18789999986407777642221110-2203504 No 98 >PHA01630 putative group 1 glycosyl transferase Probab=95.62 E-value=0.2 Score=29.11 Aligned_cols=199 Identities=16% Similarity=0.104 Sum_probs=97.4 Q ss_pred CCCCEEEECCHHHHHHHHHHCC---CCEEEEC-CCCCCCCCCCCCHHHHHHHHHHCCCCCEEECCCCCCCC---CCCCCC Q ss_conf 6442366415778889986088---5058720-43101234422125689999861798617504655332---100000 Q gi|254780402|r 181 SQFSLVIVQSERYFRRYKELGA---QKLIVSG-NLKIDTESLPCDKELLSLYQESIAGRYTWAAISTFEGE---EDKAVY 253 (440) Q Consensus 181 ~~~~~i~~qs~~~~~rl~~lG~---~~i~v~G-nlK~d~~~~~~~~~~~~~~~~~~~~r~v~vagSth~~E---E~~il~ 253 (440) +..|.+.+.|+=+..-|..+|. +.+.++. |+.-+- ....+.+ ..-+++++.+-|..+ .+++-. T Consensus 95 ~~~D~ivv~SqWS~naf~~sgl~I~~PiY~IpHn~nprm--~~~~~ke--------k~~~~Vl~~l~HS~~RKG~Di~~~ 164 (333) T PHA01630 95 QPVDEIVVPSQWSKNAFYTSGLKIPQPIYVIPHNLNPRM--FEYKPKE--------KPHPCVLAILPHSWDRKGGDIVVK 164 (333) T ss_pred CCCCEEEECCHHHHHHHHHCCCCCCCCEEECCCCCCHHH--HHCCHHH--------CCCCEEEEECCCCCCCCCCHHHHH T ss_conf 875406742444365787528999986276456799356--4076321--------887569998566545465688999 Q ss_pred CCCCC-CCCCCE-EEEEEECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEE---EC Q ss_conf 00000-123412-48897055211478887765300101100134444466416750676520255422082066---13 Q gi|254780402|r 254 VHNFI-KCRTDV-LTIIVPRHPRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFI---GR 328 (440) Q Consensus 254 a~~~l-~~~~~~-~lIIvPRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafV---GG 328 (440) +++.+ +...+. .||.+- +.+ ..+-.|++. ...|.+..| +-++|+.||+.|. || T Consensus 165 v~~elqke~~d~Y~Lvkss-------n~~-d~Rl~~l~g-----vk~plp~dd---------~~~lf~~~Di~f~p~RGG 222 (333) T PHA01630 165 IFHELQNEGYDFYFLIKSS-------NML-DPRLFGLNG-----VKTPLPDDD---------IYSLFAGCDILFYPVRGG 222 (333) T ss_pred HHHHHHHCCCCEEEEEEEC-------CCC-CCEEECCCC-----CCCCCCCHH---------HHHHHHHCCEEEEECCCC T ss_conf 9999984578569999841-------556-730212335-----447898167---------899874063799841586 Q ss_pred CCCCCCCCCHHHHHHHCCCEEE---CCCCCCHHH-HHHHHHHCCC--------------EEEECCHHHHHHHHHHHHCC- Q ss_conf 5544454583768961994999---898443799-9999997898--------------69957989999999999589- Q gi|254780402|r 329 SFCASGGQNPLEAAMLGCAILS---GPNVENFRD-IYRRMVSSGA--------------VRIVEEVGTLADMVYSLLSE- 389 (440) Q Consensus 329 Sl~~~GGhN~lEpaa~G~pVi~---GP~~~nf~e-~~~~L~~~g~--------------~~~v~d~~eL~~~l~~ll~d- 389 (440) ||. -|.+||.+.|.|++. |+-..=+.. .-..+++.+. .... |.++..+.+...|+| T Consensus 223 aFE----i~~iEAl~~gl~~v~te~GaWsE~~~~~~~~~~ik~~~~~k~~~~NpiHvG~~le~-~~eda~qKll~~L~n~ 297 (333) T PHA01630 223 AFE----IPVIEALALGLDVVVTEKGAWSEWVLSNLDVYWIKSGRKPKLWYTNPIHVGYFLDP-DIEDAYQKLLEALANW 297 (333) T ss_pred EEE----CCHHHHHHCCCCEEECCCCCHHHHCCCCCHHHHHHHCCCCEECCCCCEEECCCCCC-CHHHHHHHHHHHHHCC T ss_conf 034----31799987079767627864065247521002212068724426797322266678-7689999999998625 Q ss_pred -HHH-HHHHHHHHHHHHHHCC-CHHHHHHH Q ss_conf -899-9999999999997045-56999999 Q gi|254780402|r 390 -PTI-RYEMINAAINEVKKMQ-GPLKITLR 416 (440) Q Consensus 390 -~~~-~~~~~~~a~~~i~~~~-ga~~~~~~ 416 (440) |+. -++...++.=+.++.+ .+..+.++ T Consensus 298 ~~ek~Ke~le~~~~~~rEnYsy~ai~k~~~ 327 (333) T PHA01630 298 TPEKKKENLEGRAILYRENYSYNAIAKMWE 327 (333) T ss_pred CHHHHHHHHHHHHHHHHHHCCHHHHHHHHH T ss_conf 837777655531211245347899999999 No 99 >cd03789 GT1_LPS_heptosyltransferase Lipopolysaccharide heptosyltransferase is involved in the biosynthesis of lipooligosaccharide (LOS). Lipopolysaccharide (LPS) is a major component of the outer membrane of gram-negative bacteria. LPS heptosyltransferase transfers heptose molecules from ADP-heptose to 3-deoxy-D-manno-octulosonic acid (KDO), a part of the inner core component of LPS. This family belongs to the GT-B structural superfamily of glycoslytransferases, which have characteristic N- and C-terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homology. The large cleft that separates the two domains includes the catalytic center and permits a high degree of flexibility. Probab=95.42 E-value=0.23 Score=28.67 Aligned_cols=56 Identities=13% Similarity=0.118 Sum_probs=38.1 Q ss_pred EEEEECCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCC Q ss_conf 9998178999999999999998619989999627763001210147741688505572 Q gi|254780402|r 59 IWFHASSVGETMALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDI 116 (440) Q Consensus 59 IW~HaaSvGE~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~ 116 (440) +=++-.++|.+....|+++.|++.+|+.-||..|.......-+..+..+.+ .++|. T Consensus 3 Lii~~~~iGD~i~~~p~i~~lk~~~P~~~I~~l~~~~~~~l~~~~p~id~v--i~~~~ 58 (279) T cd03789 3 LVIRLSWIGDVVLATPLLRALKARYPDARITVLAPPWFAPLLELMPEVDRV--IVLPK 58 (279) T ss_pred EEEECCCCHHHHHHHHHHHHHHHHCCCCEEEEEECHHHHHHHHHCCCCCEE--EEECC T ss_conf 999478503999999999999998879989999893689999639985799--99535 No 100 >KOG1192 consensus Probab=95.31 E-value=0.21 Score=28.97 Aligned_cols=94 Identities=15% Similarity=0.234 Sum_probs=58.3 Q ss_pred EEEEECCCCCCHHHHHHCCCEEEECCCCCCCCC-CHHHHHHHCCCEEECCCCCCHHHHHHHHHHCCCEEEEC----CHHH Q ss_conf 167506765202554220820661355444545-83768961994999898443799999999789869957----9899 Q gi|254780402|r 304 DIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQ-NPLEAAMLGCAILSGPNVENFRDIYRRMVSSGAVRIVE----EVGT 378 (440) Q Consensus 304 ~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGh-N~lEpaa~G~pVi~GP~~~nf~e~~~~L~~~g~~~~v~----d~~e 378 (440) .|+..+=......+ +++ .-|| .|.-+||- +.+|++.+|+|++.-|-+.....-.+.+.+.|++.... +..+ T Consensus 336 nV~~~~W~PQ~~ll--l~H-~~v~-~FvTHgG~nSt~E~~~~GvP~v~~Plf~DQ~~Na~~i~~~g~~~v~~~~~~~~~~ 411 (496) T KOG1192 336 NVVLSKWAPQNDLL--LDH-PAVG-GFVTHGGWNSTLESIYSGVPMVCVPLFGDQPLNARLLVRHGGGGVLDKRDLVSEE 411 (496) T ss_pred CEEECCCCCCHHHH--CCC-CCCC-EEEECCCHHHHHHHHHCCCCEECCCCCCCCHHHHHHHHHCCCEEEEEECCCCCHH T ss_conf 18412665853353--568-7417-0885186888999986599843478633538899999972754899621233125 Q ss_pred HHHHHHHHHCCHHHHHHHHHHHH Q ss_conf 99999999589899999999999 Q gi|254780402|r 379 LADMVYSLLSEPTIRYEMINAAI 401 (440) Q Consensus 379 L~~~l~~ll~d~~~~~~~~~~a~ 401 (440) +..++...+.+++..+...+-+. T Consensus 412 ~~~~~~~il~~~~y~~~~~~l~~ 434 (496) T KOG1192 412 LLEAIKEILENEEYKEAAKRLSE 434 (496) T ss_pred HHHHHHHHCCCHHHHHHHHHHHH T ss_conf 67777765076999999999999 No 101 >TIGR02400 trehalose_OtsA alpha,alpha-trehalose-phosphate synthase [UDP-forming]; InterPro: IPR012766 This enzyme catalyzes the key, penultimate step in biosynthesis of trehalose, a compatible solute made as an osmoprotectant in some species in all three domains of life. The gene symbol OtsA stands for osmotically regulated trehalose synthesis A. Trehalose helps protect against both osmotic and thermal stresses, and is made from two glucose subunits. This entry excludes glucosylglycerol-phosphate synthase, an enzyme of an analogous osmoprotectant system in many cyanobacterial strains. This entry does not identify archaeal examples, as they are more divergent than glucosylglycerol-phosphate synthase. Sequences that score in the gray zone between the trusted and noise cut offs include a number of yeast multidomain proteins in which the N-terminal domain may be functionally equivalent to this family. The gray zone also includes the OtsA of Cornyebacterium glutamicum (and related species), shown to be responsible for synthesis of only trace amounts of trehalose while the majority is synthesized by the TreYZ pathway; the significance of OtsA in this species is unclear (see ).; GO: 0003825 alphaalpha-trehalose-phosphate synthase (UDP-forming) activity, 0005992 trehalose biosynthetic process. Probab=95.02 E-value=0.3 Score=27.88 Aligned_cols=266 Identities=15% Similarity=0.180 Sum_probs=147.7 Q ss_pred HHHHHHHHHCCEEEEEECCC----CHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHH----HHHHHCCCCCCEEEECC Q ss_conf 34455744202035640476----21467888742245505864126654212100111----12331026442366415 Q gi|254780402|r 119 AVSRFLKYWKPDCMILSESD----IWPLTVFELSKQRIPQVLVNARMSRRSFKNWKTVL----SFSKKIFSQFSLVIVQS 190 (440) Q Consensus 119 ~~~~fl~~~~P~~~i~~e~E----lWPnli~~~~~~~ip~~linarls~~S~~~~~~~~----~l~~~~~~~~~~i~~qs 190 (440) ++++.....+|+=+||+ -| +=|-++++..-.++++-..+=-==+ |.-=|+.+| .+.+.++. .|+|.-|+ T Consensus 130 FA~~~~~~~~~gD~~Wv-HDYHL~~lp~~LRe~~~~n~~iGFFLHiPfP-S~ei~~~lP~~~~~ll~gll~-yDLvGFqT 206 (476) T TIGR02400 130 FAEALAPLLQPGDIVWV-HDYHLMLLPAMLRELGVENVKIGFFLHIPFP-SSEIYRTLPEWRRELLEGLLA-YDLVGFQT 206 (476) T ss_pred HHHHHHHHCCCCCEEEE-ECCHHHHHHHHHHHHCCCCCEEEEECCCCCC-CHHHHHCCHHHHHHHHHHHHH-CCCCCCCC T ss_conf 99999986389989998-4414568999999717785300104178998-579985180579999999871-58302456 Q ss_pred HHHHHHHHH-----HCC---CC-EEE--ECCCCC----------------CCCCCCCCHHHHHHHHHHCCCCCEEECCCC Q ss_conf 778889986-----088---50-587--204310----------------123442212568999986179861750465 Q gi|254780402|r 191 ERYFRRYKE-----LGA---QK-LIV--SGNLKI----------------DTESLPCDKELLSLYQESIAGRYTWAAIST 243 (440) Q Consensus 191 ~~~~~rl~~-----lG~---~~-i~v--~GnlK~----------------d~~~~~~~~~~~~~~~~~~~~r~v~vagSt 243 (440) ++|+++|.+ +|. ++ +.. +..-|. ++...+........+++.+++.++++--=- T Consensus 207 ~~d~~nF~~~~~~~~~~~~~~~~~~~~~~~~~R~~~v~aFPIGID~~~f~~~a~~~~~~~~~~~l~~~l~~~k~i~GVDR 286 (476) T TIGR02400 207 YEDARNFLSAVSRELGYETLENGVESGGGAGGRTVRVGAFPIGIDVDRFAEQAKKPSVQKRIAELRESLRGRKLIIGVDR 286 (476) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCEECCCCCCCCEEEEEEEEECCCHHHHHHHHHCHHHHHHHHHHHHHHCCCEEEEEEEE T ss_conf 88999999999998511112662011775643369998874200779999873140038999999998379749998420 Q ss_pred ---CCCCCCCCCCCC-CCCCCCCCE-----EEEEEECCCCH--HHHHHHHHHHHCCHHHHHC-CCCCCCCCCEEEEECCC Q ss_conf ---533210000000-000123412-----48897055211--4788877653001011001-34444466416750676 Q gi|254780402|r 244 ---FEGEEDKAVYVH-NFIKCRTDV-----LTIIVPRHPRR--CDAIERRLIAKGLKVARRS-RGDVINAEVDIFLGDTI 311 (440) Q Consensus 244 ---h~~EE~~il~a~-~~l~~~~~~-----~lIIvPRhpeR--~~~i~~~l~~~gl~~~~~S-~~~~~~~~~~V~i~Dt~ 311 (440) -+| -.--+.|| ++|+++|+. .+=||| |-| .++-+++=.+-...+.+.. +....+-..-.|+-.++ T Consensus 287 LDYsKG-~~~R~~AFe~fL~~~P~~rgkV~lvQIA~--PSR~~V~~Y~~Lr~~~~~~VGrING~fg~~~WtPi~Y~~~~~ 363 (476) T TIGR02400 287 LDYSKG-LPERLKAFERFLEEHPEWRGKVTLVQIAV--PSRGDVEEYQQLRRQVEELVGRINGRFGTLDWTPIRYLNRSY 363 (476) T ss_pred CHHHCC-HHHHHHHHHHHHHHCHHHCCCEEEEEEEE--CCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEECCCC T ss_conf 201122-77899999999874734207279999850--668887899999999999864542021788853678723888 Q ss_pred C--CCHHHHHHCCCEEEECCCCCCCCCCH--HHHHHHCCC----EEECCCCCCHHHHHHHHHHCCCEEEEC--CHHHHHH Q ss_conf 5--20255422082066135544454583--768961994----999898443799999999789869957--9899999 Q gi|254780402|r 312 G--EMGFYLRMTEIAFIGRSFCASGGQNP--LEAAMLGCA----ILSGPNVENFRDIYRRMVSSGAVRIVE--EVGTLAD 381 (440) Q Consensus 312 G--eL~~lY~~AdiafVGGSl~~~GGhN~--lEpaa~G~p----Vi~GP~~~nf~e~~~~L~~~g~~~~v~--d~~eL~~ 381 (440) . +|..||+.|||++|=. + +-|.|. -|=.|+-.| |++ .+.|.-++++|. .+..|+ |.+++++ T Consensus 364 ~~~~L~aLYr~adV~LVTp-L--RDGMNLVAKEYVAaQ~~~~PGVLi---LSefAGAA~eL~---~ALlVNP~d~~g~a~ 434 (476) T TIGR02400 364 PREELMALYRAADVGLVTP-L--RDGMNLVAKEYVAAQDPEDPGVLI---LSEFAGAAQELK---GALLVNPYDIEGMAD 434 (476) T ss_pred CHHHHHHHHHHCCEEEECC-C--CCCCCHHHHHHHHCCCCCCCCCEE---CCHHHHHHHHHC---CCCCCCCCCHHHHHH T ss_conf 8689999974103266545-6--567513535878528988478543---002666998850---156218978899999 Q ss_pred HHHHHHC--CHHHHHHHHHH Q ss_conf 9999958--98999999999 Q gi|254780402|r 382 MVYSLLS--EPTIRYEMINA 399 (440) Q Consensus 382 ~l~~ll~--d~~~~~~~~~~ 399 (440) +|.+-|+ ..++.+++.+- T Consensus 435 Ai~~AL~Mp~~eR~~R~~~l 454 (476) T TIGR02400 435 AIARALTMPLEEREERHRAL 454 (476) T ss_pred HHHHHHCCCHHHHHHHHHHH T ss_conf 99998529988999999999 No 102 >pfam06925 MGDG_synth Monogalactosyldiacylglycerol (MGDG) synthase. This family represents a conserved region of approximately 180 residues within plant and bacterial monogalactosyldiacylglycerol (MGDG) synthase (EC:2.4.1.46). In Arabidopsis, there are two types of MGDG synthase which differ in their N-terminal portion: type A and type B. Probab=94.51 E-value=0.24 Score=28.49 Aligned_cols=80 Identities=18% Similarity=0.204 Sum_probs=52.7 Q ss_pred HHHHHHHHHCCEEEEEECCCCHHHH-HHHHHHCC----CCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHH Q ss_conf 3445574420203564047621467-88874224----550586412665421210011112331026442366415778 Q gi|254780402|r 119 AVSRFLKYWKPDCMILSESDIWPLT-VFELSKQR----IPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERY 193 (440) Q Consensus 119 ~~~~fl~~~~P~~~i~~e~ElWPnl-i~~~~~~~----ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~ 193 (440) ..+++++..+||++|.. ..+-..+ +..++++| +|++-+ ++|-+. --..++-...|..++-|++. T Consensus 80 ~l~~~i~~~~PD~IV~T-hp~~~~~~l~~lk~~~~~~~~p~~tV---iTD~~~-------~H~~W~~~~~D~y~Va~ee~ 148 (169) T pfam06925 80 ELAALLKEFQPDIIIST-HPLPAAVPLSVLKSKGLLKRVLVVTV---VTDFRT-------CHPFWLHPEIDRYYVPSKEV 148 (169) T ss_pred HHHHHHHHHCCCEEEEC-CHHHHHHHHHHHHHCCCCCCCCEEEE---ECCCCC-------CCCCCCCCCCCEEEECCHHH T ss_conf 99999998493999999-76266789999998387889978999---898866-------65781689999899799999 Q ss_pred HHHHHHHCC--CCEEEEC Q ss_conf 889986088--5058720 Q gi|254780402|r 194 FRRYKELGA--QKLIVSG 209 (440) Q Consensus 194 ~~rl~~lG~--~~i~v~G 209 (440) ++.+.+.|+ ++|.++| T Consensus 149 ~~~l~~~Gi~~~kI~vtG 166 (169) T pfam06925 149 KKEALEKGIDPSNIKVTG 166 (169) T ss_pred HHHHHHCCCCHHHEEEEC T ss_conf 999998599988978837 No 103 >pfam08660 Alg14 Oligosaccharide biosynthesis protein Alg14 like. Alg14 is involved dolichol-linked oligosaccharide biosynthesis and anchors the catalytic subunit Alg13 to the ER membrane. Probab=94.45 E-value=0.42 Score=26.97 Aligned_cols=130 Identities=18% Similarity=0.215 Sum_probs=77.5 Q ss_pred EEEECCHHHHHHHHHHHHHHHHCCC--CEEEEECCCCCCHHHHCCCCCCEEEEECCC----------CCCHHHH------ Q ss_conf 9981789999999999999986199--899996277630012101477416885055----------7201344------ Q gi|254780402|r 60 WFHASSVGETMALIGLIPAIRSRHV--NVLLTTMTATSAKVARKYLGQYAIHQYAPL----------DIQPAVS------ 121 (440) Q Consensus 60 W~HaaSvGE~~~a~pli~~l~~~~~--~iliT~~T~tg~~~~~~~~~~~~~~~y~P~----------D~~~~~~------ 121 (440) -+=+.|-|-+.....|++.+.+++. ..++|.++..+.+.+++...+.....-.|- +.....+ T Consensus 2 ~vv~GSGGHt~eml~L~~~l~~~~~~~~yvv~~~D~~s~~~~~~~~~~~~~i~~~~r~R~v~qs~~~s~~~~~~~~~~s~ 81 (166) T pfam08660 2 CVVLGSGGHTAEMLRLLPFLKNLEDPRRYVVTETDKDSLSKASKFEKSRGGILKIPRAREVGQSYLRSIFTTLRNLLSAL 81 (166) T ss_pred EEEEECHHHHHHHHHHHHHHHHCCCCCEEEEECCCHHHHHHHHHCCCCCCEEEECCCEEECCCHHHHHHHHHHHHHHHHH T ss_conf 99994827899999999987756897389998898078999886055552376446315738556745999999999999 Q ss_pred HHHHHHCCEEEEEECCCCHHHHHHHH------HHCCCCEEEEEE--CCCCCCCCCCHHHHHHHHHC-CCCCCEEEECCHH Q ss_conf 55744202035640476214678887------422455058641--26654212100111123310-2644236641577 Q gi|254780402|r 122 RFLKYWKPDCMILSESDIWPLTVFEL------SKQRIPQVLVNA--RMSRRSFKNWKTVLSFSKKI-FSQFSLVIVQSER 192 (440) Q Consensus 122 ~fl~~~~P~~~i~~e~ElWPnli~~~------~~~~ip~~lina--rls~~S~~~~~~~~~l~~~~-~~~~~~i~~qs~~ 192 (440) +.+...+||+++....-.---....+ .-.+.+++.|.. |....| ++.++ +.-.|.+++|-++ T Consensus 82 ~il~k~kPdvii~tG~g~~vp~~~~a~ll~~~~~~~~k~i~iES~~r~~~~s---------ltgkll~~~ad~f~vqW~~ 152 (166) T pfam08660 82 KLLRRERPDVILCNGPGTCVPFCLAAKLLKILGLKGTKIVYIESFARVKTLS---------LSGKLLYPLADRFIVQWPE 152 (166) T ss_pred HHHHHHCCCEEEECCCCEEEHHHHHHHHHHHHHCCCCCEEEEEEEECCCCCC---------HHHHHHHHHCCEEEEECHH T ss_conf 9998539989997799603099999999986401588589998442137864---------7777699868988983799 Q ss_pred HHHHHH Q ss_conf 888998 Q gi|254780402|r 193 YFRRYK 198 (440) Q Consensus 193 ~~~rl~ 198 (440) -++++. T Consensus 153 l~~~yp 158 (166) T pfam08660 153 LKKKYP 158 (166) T ss_pred HHHHCC T ss_conf 994689 No 104 >TIGR02095 glgA glycogen/starch synthases, ADP-glucose type; InterPro: IPR011835 This family consists of glycogen (or starch) synthases that use ADP-glucose (2.4.1.21 from EC), rather than UDP-glucose (2.4.1.11 from EC) as in animals, as the glucose donor. This enzyme is found in bacteria and plants. Whether the name given is glycogen synthase or starch synthase depends on context, and therefore on substrate.; GO: 0009011 starch synthase activity, 0009250 glucan biosynthetic process. Probab=94.44 E-value=0.36 Score=27.37 Aligned_cols=169 Identities=14% Similarity=0.033 Sum_probs=98.9 Q ss_pred CCCCHHH-HHHHHHHCC------CCCEEECCCC---CCCCCCCCCCCCCCCCCCCC-EEEEEEECCCCHHH-HHHHHH-- Q ss_conf 4221256-899998617------9861750465---53321000000000012341-24889705521147-888776-- Q gi|254780402|r 218 LPCDKEL-LSLYQESIA------GRYTWAAIST---FEGEEDKAVYVHNFIKCRTD-VLTIIVPRHPRRCD-AIERRL-- 283 (440) Q Consensus 218 ~~~~~~~-~~~~~~~~~------~r~v~vagSt---h~~EE~~il~a~~~l~~~~~-~~lIIvPRhpeR~~-~i~~~l-- 283 (440) .....+. +..+.+.++ +.|++...|. .+| -++++++...+-+..| .++|+...-....+ .+.++. T Consensus 302 ~~~K~~ncK~aLq~~lGL~~~Y~~~Pl~~~isRL~~QKG-~Dl~~~a~~~ll~~~~~~Qlv~lG~Gdp~le~~l~~la~~ 380 (517) T TIGR02095 302 LSGKAKNCKEALQEELGLPVDYDDVPLFGVISRLVEQKG-VDLLLAALPELLELGDFGQLVVLGTGDPELEEALRELADH 380 (517) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCCCC-HHHHHHHHHHHHCCCCCEEEEEEECCCHHHHHHHHHHHHH T ss_conf 666788758999998198878888537999822562442-7899999999971179668999704887999999999999 Q ss_pred HHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHHHHHH Q ss_conf 53001011001344444664167506765202554220820661355444545837689619949998984437999999 Q gi|254780402|r 284 IAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFRDIYRR 363 (440) Q Consensus 284 ~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~~~~ 363 (440) ++..=++..+-..+.. .=-.+|+-||+ |+=.|-.+-=|-|-|.++.+|++.|. -.+..-.|.+.+ T Consensus 381 ~~~p~~~~~~~~yde~-------------LAh~iyAgaD~-~lmPSrFEPCGL~Ql~amRYGt~PiV-r~tGGL~DTV~d 445 (517) T TIGR02095 381 ERYPGKVRVIIGYDEA-------------LAHRIYAGADF-FLMPSRFEPCGLTQLYAMRYGTVPIV-RRTGGLADTVVD 445 (517) T ss_pred HHCCCCEEEEEECCHH-------------HHHHHHHCCCE-EECCCCCCCCHHHHHHHHHCCCCEEE-ECCCCCCEEEEC T ss_conf 6378948999625879-------------99989723776-88078557312579989734995387-158895201003 Q ss_pred ----------HHHCCCEEEECCHHHHHHHHHHHH---C-CHHHHHHHHHHHHH Q ss_conf ----------997898699579899999999995---8-98999999999999 Q gi|254780402|r 364 ----------MVSSGAVRIVEEVGTLADMVYSLL---S-EPTIRYEMINAAIN 402 (440) Q Consensus 364 ----------L~~~g~~~~v~d~~eL~~~l~~ll---~-d~~~~~~~~~~a~~ 402 (440) -..+|-.+.-.|+++|..++.+-| . +|+..++|.++|-+ T Consensus 446 ~~~~~~~aP~~~~tGF~F~~~~~~~L~~a~~rAl~lY~~~~~~w~~l~~~aM~ 498 (517) T TIGR02095 446 ADPENLAAPAGSGTGFLFEEYDPEALLAALSRALRLYRQDPELWKALQKNAMS 498 (517) T ss_pred CCCCCCCCCCCCCCCEEECCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHC T ss_conf 87764447787765417236888999999999999872397899999998513 No 105 >pfam00982 Glyco_transf_20 Glycosyltransferase family 20. Members of this family belong to glycosyl transferase family 20. OtsA (Trehalose-6-phosphate synthase) is homologous to regions in the subunits of yeast trehalose-6-phosphate synthase/phosphate complex,. Probab=94.38 E-value=0.43 Score=26.88 Aligned_cols=276 Identities=13% Similarity=0.180 Sum_probs=133.8 Q ss_pred HHHHHHHHCCEEEEEE-CCCCH--HHHHHHHHHCCCCEEEE-EECC-CCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHH Q ss_conf 4455744202035640-47621--46788874224550586-4126-654212100111123310264423664157788 Q gi|254780402|r 120 VSRFLKYWKPDCMILS-ESDIW--PLTVFELSKQRIPQVLV-NARM-SRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYF 194 (440) Q Consensus 120 ~~~fl~~~~P~~~i~~-e~ElW--Pnli~~~~~~~ip~~li-narl-s~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~ 194 (440) +.+....++|+-+||+ .+-++ |.++++.... .++... -.-+ |.--|+-..+-..+.+.++ ..|.|.-|+..++ T Consensus 129 A~~i~~~~~~~D~VWVHDYHL~llp~~LR~~~~~-~~IgfFlHiPFPs~eifr~lP~r~eil~glL-~~DliGFqt~~y~ 206 (470) T pfam00982 129 ADKIVEVYKDGDLIWVHDYHLMLLPQMLRKRLPD-AKIGFFLHIPFPSSEIFRCLPVREEILRGLL-GADLIGFHTYDYA 206 (470) T ss_pred HHHHHHHCCCCCEEEECCCHHHHHHHHHHHHCCC-CEEEEEECCCCCCHHHHHHCCCHHHHHHHHH-HCCEEEECCHHHH T ss_conf 9999985588998998263366689999984899-8389886189999899976987699999987-5888896788999 Q ss_pred HHHHH-----HCCC---C--EEEECC-CCC-------C-----C-CCCCCCHHHHHHHHHHCCCC-CEEECCCCC---CC Q ss_conf 89986-----0885---0--587204-310-------1-----2-34422125689999861798-617504655---33 Q gi|254780402|r 195 RRYKE-----LGAQ---K--LIVSGN-LKI-------D-----T-ESLPCDKELLSLYQESIAGR-YTWAAISTF---EG 246 (440) Q Consensus 195 ~rl~~-----lG~~---~--i~v~Gn-lK~-------d-----~-~~~~~~~~~~~~~~~~~~~r-~v~vagSth---~~ 246 (440) ++|.+ +|.+ + +...|. ++. | . ...+...+....+++.++++ ++++..-.- +| T Consensus 207 r~Fl~~~~rllg~~~~~~~~v~~~gr~v~v~~~PigId~~~~~~~~~~~~~~~~~~~lr~~~~~~~kli~gvDRlDytKG 286 (470) T pfam00982 207 RHFLSCCSRLLGLETTSDGGVEYGGRTVSVGAFPIGIDPGRIESGLKSPSVQEKVKELKERFGNKKKLILGVDRLDYIKG 286 (470) T ss_pred HHHHHHHHHHHCCCCCCCCEEEECCEEEEEEEEECEECHHHHHHHHCCCHHHHHHHHHHHHHCCCCEEEEECCCCCCCCC T ss_conf 99999999971874058973999998999999813318899999863914689999999983799669997366402258 Q ss_pred CCCCCCCCC-CCCCCCCCEE-----EEEE-ECCCCHHHHHHHHHHHHCCHHHHH-CCCCCCCCCCEEEEEC--CCCCCHH Q ss_conf 210000000-0001234124-----8897-055211478887765300101100-1344444664167506--7652025 Q gi|254780402|r 247 EEDKAVYVH-NFIKCRTDVL-----TIIV-PRHPRRCDAIERRLIAKGLKVARR-SRGDVINAEVDIFLGD--TIGEMGF 316 (440) Q Consensus 247 EE~~il~a~-~~l~~~~~~~-----lIIv-PRhpeR~~~i~~~l~~~gl~~~~~-S~~~~~~~~~~V~i~D--t~GeL~~ 316 (440) =.+- +.|+ +.|+++|+.+ +-|+ |-. ++.++..++-.+-.-.+.+. ++...++-..-+++-. +.-|+.. T Consensus 287 i~~k-l~Afe~fL~~~Pe~~gkv~lvQia~psr-~~v~~y~~l~~~v~~~v~rIN~~fg~~~w~Pv~y~~~~~~~~el~a 364 (470) T pfam00982 287 IPQK-LLAFERFLEEYPEWRGKVVLVQIAVPSR-GDVEEYQNLRSQVEELVGRINGEFGTLDYTPVHHLHRSLDFDELIA 364 (470) T ss_pred HHHH-HHHHHHHHHHCCCCCCCEEEEEECCCCC-CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCHHHHHH T ss_conf 8999-9999999986914468779999815887-5627899999999999999961447899742999907999999999 Q ss_pred HHHHCCCEEEECCCCCCCCCCH--HHHHHHC---CCEEECCCCCCHHHHHHHHHHCCCEEEEC--CHHHHHHHHHHHHC- Q ss_conf 5422082066135544454583--7689619---94999898443799999999789869957--98999999999958- Q gi|254780402|r 317 YLRMTEIAFIGRSFCASGGQNP--LEAAMLG---CAILSGPNVENFRDIYRRMVSSGAVRIVE--EVGTLADMVYSLLS- 388 (440) Q Consensus 317 lY~~AdiafVGGSl~~~GGhN~--lEpaa~G---~pVi~GP~~~nf~e~~~~L~~~g~~~~v~--d~~eL~~~l~~ll~- 388 (440) +|+.||++.|- |+ +.|.|. .|=.++- .-|++ .+.|...++.|. +++..|+ |.+++++++..-|+ T Consensus 365 ly~~adv~lVT-pl--rDGMNLvakEyva~q~~~~GvLI---LSefaGaa~~L~--~gAllVNP~d~~~~a~ai~~AL~M 436 (470) T pfam00982 365 LYAIADVCLVT-SL--RDGMNLVAYEYVACQQDRKGVLI---LSEFAGAAQSLN--DGAILVNPWDIEEVAEAINEALTM 436 (470) T ss_pred HHHHHHEEEEC-CC--CCCCCCCCHHHEEEECCCCCEEE---EECCCCHHHHHC--CCCEEECCCCHHHHHHHHHHHHCC T ss_conf 99851235745-33--34335330221366559995499---952403176755--970898999989999999999759 Q ss_pred CHHHHHHHHHHHHHHHHHC Q ss_conf 9899999999999999704 Q gi|254780402|r 389 EPTIRYEMINAAINEVKKM 407 (440) Q Consensus 389 d~~~~~~~~~~a~~~i~~~ 407 (440) .++++++--++-++.+.++ T Consensus 437 ~~~Er~~R~~~l~~~v~~~ 455 (470) T pfam00982 437 SEEERQKRHRKLFKYISKH 455 (470) T ss_pred CHHHHHHHHHHHHHHHHHC T ss_conf 9999999999999998857 No 106 >COG3914 Spy Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones] Probab=94.14 E-value=0.48 Score=26.55 Aligned_cols=295 Identities=8% Similarity=0.048 Sum_probs=150.6 Q ss_pred HHHHHHHHHHHHHHCCCCEEE-EECCCCCCHHHHCCCCCCEEEEE--CCCCCCHHHHHHHHHHCCEEEEEECCCCHHHHH Q ss_conf 999999999999861998999-96277630012101477416885--055720134455744202035640476214678 Q gi|254780402|r 68 ETMALIGLIPAIRSRHVNVLL-TTMTATSAKVARKYLGQYAIHQY--APLDIQPAVSRFLKYWKPDCMILSESDIWPLTV 144 (440) Q Consensus 68 E~~~a~pli~~l~~~~~~ili-T~~T~tg~~~~~~~~~~~~~~~y--~P~D~~~~~~~fl~~~~P~~~i~~e~ElWPnli 144 (440) +.-...++.+.+-++..+|.. .++++ +-+....+....+.|.| -+.|.--.+.+... =..+.+|-+.+=.--+.- T Consensus 274 vg~l~~~v~e~hDRdkfEvfay~~g~~-~~dal~~rI~a~~~~~~~~~~~dd~e~a~~I~~-d~IdILvDl~g~T~d~r~ 351 (620) T COG3914 274 VGFLLRWVFEYHDRDKFEVFAYSLGPP-HTDALQERISAAVEKWYPIGRMDDAEIANAIRT-DGIDILVDLDGHTVDTRC 351 (620) T ss_pred HHHHHHHHHHHHCHHHEEEEEEECCCC-CCHHHHHHHHHHHHHEECCCCCCHHHHHHHHHH-CCCEEEEECCCCEECCCH T ss_conf 899999999873500158999965887-731678888876531410588688999999872-587099965672103400 Q ss_pred HHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEE----ECCHHHHHHHHHHC--CCCEEEECCCCCCCCCC Q ss_conf 8874224550586412665421210011112331026442366----41577888998608--85058720431012344 Q gi|254780402|r 145 FELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVI----VQSERYFRRYKELG--AQKLIVSGNLKIDTESL 218 (440) Q Consensus 145 ~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~----~qs~~~~~rl~~lG--~~~i~v~GnlK~d~~~~ 218 (440) .....+--|+. |+ |++.....-....|-++ +.-++..++|.+-- .+ -...++-++....+ T Consensus 352 ~v~A~RpAPiq-vs------------wlGy~aT~g~p~~DY~I~D~y~vPp~ae~yysEkl~RLp-~cy~p~d~~~~v~p 417 (620) T COG3914 352 QVFAHRPAPIQ-VS------------WLGYPATTGSPNMDYFISDPYTVPPTAEEYYSEKLWRLP-QCYQPVDGFEPVTP 417 (620) T ss_pred HHHHCCCCCEE-EE------------ECCCCCCCCCCCCEEEEECCEECCCHHHHHHHHHHHHCC-CCCCCCCCCCCCCC T ss_conf 36635777648-76------------226566668876307850760368147789989987330-13688777565889 Q ss_pred CCCHHHHHHHHHHCCCCCEEECC-CCCCCCCCCCCCCCCCCCCCCCEEEEEEEC--CCCHHHHHHHHHHHHCCHHHHHC- Q ss_conf 22125689999861798617504-655332100000000001234124889705--52114788877653001011001- Q gi|254780402|r 219 PCDKELLSLYQESIAGRYTWAAI-STFEGEEDKAVYVHNFIKCRTDVLTIIVPR--HPRRCDAIERRLIAKGLKVARRS- 294 (440) Q Consensus 219 ~~~~~~~~~~~~~~~~r~v~vag-Sth~~EE~~il~a~~~l~~~~~~~lIIvPR--hpeR~~~i~~~l~~~gl~~~~~S- 294 (440) +.+.+.. ...++-.|++.+ ...+--.+++---.++++.-|+-+|.|-+- .++-...+.++.++.|+...|.- T Consensus 418 ~~sR~~l----glp~~avVf~c~~n~~K~~pev~~~wmqIL~~vP~Svl~L~~~~~~~~~~~~l~~la~~~Gv~~eRL~f 493 (620) T COG3914 418 PPSRAQL----GLPEDAVVFCCFNNYFKITPEVFALWMQILSAVPNSVLLLKAGGDDAEINARLRDLAEREGVDSERLRF 493 (620) T ss_pred CCCHHHC----CCCCCEEEEEECCCCCCCCHHHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHCCCCHHHEEE T ss_conf 9432105----999980899966886447879999999999848985799826898688999999999970898133462 Q ss_pred CCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCC-CHHHHHHHCCCEEECCCCCCHHH--HHHHHHHCCCEE Q ss_conf 344444664167506765202554220820661355444545-83768961994999898443799--999999789869 Q gi|254780402|r 295 RGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQ-NPLEAAMLGCAILSGPNVENFRD--IYRRMVSSGAVR 371 (440) Q Consensus 295 ~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGh-N~lEpaa~G~pVi~GP~~~nf~e--~~~~L~~~g~~~ 371 (440) ....+++ +...-|.+||++ =-|+ |.||| +.+||.-.|+||++=+.- .|.. -+..+..+|.-. T Consensus 494 ~p~~~~~-----------~h~a~~~iADlv--LDTy-PY~g~TTa~daLwm~vPVlT~~G~-~FasR~~~si~~~agi~e 558 (620) T COG3914 494 LPPAPNE-----------DHRARYGIADLV--LDTY-PYGGHTTASDALWMGVPVLTRVGE-QFASRNGASIATNAGIPE 558 (620) T ss_pred CCCCCCH-----------HHHHHHCHHHEE--EECC-CCCCCCCHHHHHHHCCCEEEECCH-HHHHHHHHHHHHHCCCCH T ss_conf 6999988-----------999862313246--5246-678864267778735844651117-788760599998669802 Q ss_pred E-ECCHHHHHHHHHHHHCCHHHHHHHH Q ss_conf 9-5798999999999958989999999 Q gi|254780402|r 372 I-VEEVGTLADMVYSLLSEPTIRYEMI 397 (440) Q Consensus 372 ~-v~d~~eL~~~l~~ll~d~~~~~~~~ 397 (440) . +.+.++-.+.-..+=+|...+++.. T Consensus 559 ~vA~s~~dYV~~av~~g~dral~q~~r 585 (620) T COG3914 559 LVADSRADYVEKAVAFGSDRALRQQVR 585 (620) T ss_pred HHCCCHHHHHHHHHHHCCCHHHHHHHH T ss_conf 421888899999998534177787557 No 107 >PRK00742 chemotaxis-specific methylesterase; Provisional Probab=92.85 E-value=0.78 Score=25.17 Aligned_cols=140 Identities=18% Similarity=0.143 Sum_probs=67.1 Q ss_pred CCEEECC-CCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCH-HHHHHHHHHH-HCCHHHHHCCCCCCCCCCEEEEECCC Q ss_conf 8617504-6553321000000000012341248897055211-4788877653-00101100134444466416750676 Q gi|254780402|r 235 RYTWAAI-STFEGEEDKAVYVHNFIKCRTDVLTIIVPRHPRR-CDAIERRLIA-KGLKVARRSRGDVINAEVDIFLGDTI 311 (440) Q Consensus 235 r~v~vag-Sth~~EE~~il~a~~~l~~~~~~~lIIvPRhpeR-~~~i~~~l~~-~gl~~~~~S~~~~~~~~~~V~i~Dt~ 311 (440) .++++.| ||- |=+ .+......|.....+-++||-|=|.. ...+.+++.+ .++.+..-++++... ...|||+-.= T Consensus 156 ~~vV~IGaStG-Gp~-aL~~il~~Lp~~~p~pivivQHm~~~f~~~la~~L~~~~~l~V~~a~~g~~l~-pG~vYiap~~ 232 (345) T PRK00742 156 EKLVAIGASTG-GPE-ALQKVLTQLPANFPAPIVIVQHMPAGFTASFAERLNRLCQIPVKEAEDGERVK-PGHAYLAPGG 232 (345) T ss_pred CCEEEEEECCC-CHH-HHHHHHHHCCCCCCCCEEEEECCCCCHHHHHHHHHHHHCCCCEEECCCCCEEE-CCEEEECCCC T ss_conf 71899983756-899-99999985898899748999658984678999998614699769926998754-8989982799 Q ss_pred CCCHHHHHHCCCEEEECCCCCCCCCCH-----HHHHH--HCCC----EEECCCCCCHHHHHHHHHHCCCEEEECCHHH Q ss_conf 520255422082066135544454583-----76896--1994----9998984437999999997898699579899 Q gi|254780402|r 312 GEMGFYLRMTEIAFIGRSFCASGGQNP-----LEAAM--LGCA----ILSGPNVENFRDIYRRMVSSGAVRIVEEVGT 378 (440) Q Consensus 312 GeL~~lY~~AdiafVGGSl~~~GGhN~-----lEpaa--~G~p----Vi~GP~~~nf~e~~~~L~~~g~~~~v~d~~e 378 (440) -.|.---.-++..+---.-.+..||.| ++.++ +|.- |++|=.-+.-.. ...+.++|+...+.|++. T Consensus 233 ~hl~i~~~~~~~~~~l~~~~~~~~~rPsiD~lf~S~A~~~g~~~v~viLTGmG~DGa~G-~~~ik~~Gg~tiaQd~~s 309 (345) T PRK00742 233 KHMMLARSGANYRIKLTDGPPVNRHRPSVDVLFESVAKAAGGNALGVILTGMGRDGAAG-LKAMRQAGATTIAQDEAS 309 (345) T ss_pred CEEEEEECCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEECCCCCHHHHHH-HHHHHHCCCEEEEECHHH T ss_conf 76999974881699988888545878976699999999708988999956586508999-999998799799989266 No 108 >COG4370 Uncharacterized protein conserved in bacteria [Function unknown] Probab=92.44 E-value=0.8 Score=25.11 Aligned_cols=342 Identities=15% Similarity=0.126 Sum_probs=176.0 Q ss_pred CEEEEEECCHHHHHHHHHHHHHHHHCC-C-CEEEEEC-CCCCCHHHH---------CCCCCCEEEEECCCCCCHH----- Q ss_conf 779998178999999999999998619-9-8999962-776300121---------0147741688505572013----- Q gi|254780402|r 57 PLIWFHASSVGETMALIGLIPAIRSRH-V-NVLLTTM-TATSAKVAR---------KYLGQYAIHQYAPLDIQPA----- 119 (440) Q Consensus 57 ~~IW~HaaSvGE~~~a~pli~~l~~~~-~-~iliT~~-T~tg~~~~~---------~~~~~~~~~~y~P~D~~~~----- 119 (440) +-|.+-.-..||-..+..++++|.++. | ++++.-- --.|-.... +.+|.. -..|.-+|..|. T Consensus 7 ~kiLllSNGHgEDlia~~i~qal~rra~p~eila~LPLVGeG~aYq~l~i~ligpv~~mPSG-Gf~y~~~~~l~rDvrgG 85 (412) T COG4370 7 PKILLLSNGHGEDLIAVAIAQALRRRAPPDEILAALPLVGEGGAYQNLNIELIGPVLTMPSG-GFIYMDLRNLWRDVRGG 85 (412) T ss_pred CCEEEEECCCHHHHHHHHHHHHHHHHCCCCCEEECCCCCCCCHHHCCCCCCEECCCCCCCCC-CCCCCCHHHHHHHHHCC T ss_conf 10478606852778999999999963798660421642257412125685000035447888-82345599999997521 Q ss_pred -------HHHHHHHHCCE-----EE--EEECCCCHHHHHHHHHHCCCCEEE---------EEECCCCCCCC--CC-HHHH Q ss_conf -------44557442020-----35--640476214678887422455058---------64126654212--10-0111 Q gi|254780402|r 120 -------VSRFLKYWKPD-----CM--ILSESDIWPLTVFELSKQRIPQVL---------VNARMSRRSFK--NW-KTVL 173 (440) Q Consensus 120 -------~~~fl~~~~P~-----~~--i~~e~ElWPnli~~~~~~~ip~~l---------inarls~~S~~--~~-~~~~ 173 (440) --.-++.|.|. .. +..-+|+=|-.+........+-+. +-++++-.+-. || +++. T Consensus 86 LvqlT~~Qi~alrkq~~q~~~~g~~~~ilAvGdivpla~a~lg~~~y~~v~~a~seyyvr~~~g~~l~~t~a~rwen~lg 165 (412) T COG4370 86 LVQLTLGQILALRKQGPQLERVGVVGDILAVGDIVPLAFAILGGLAYAFVGTAKSEYYVRVEYGLGLPLTGALRWENALG 165 (412) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCEEEECCHHHHHHHHHCCCCCCCEEECCCCEEEEECCCCCCCCCHHHHHHHHCC T ss_conf 78852999999987431655335555358851125889986167764303321440566201234776400445543025 Q ss_pred HHHH--H--CCCCCCEEEECCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCEE--ECCCCCCCC Q ss_conf 1233--1--026442366415778889986088505872043101234422125689999861798617--504655332 Q gi|254780402|r 174 SFSK--K--IFSQFSLVIVQSERYFRRYKELGAQKLIVSGNLKIDTESLPCDKELLSLYQESIAGRYTW--AAISTFEGE 247 (440) Q Consensus 174 ~l~~--~--~~~~~~~i~~qs~~~~~rl~~lG~~~i~v~GnlK~d~~~~~~~~~~~~~~~~~~~~r~v~--vagSth~~E 247 (440) ..+. + .-++.-.+++.|..+++.+..-|+.- .-.||.-.|..+++...... ...+.+++ +-||.-++- T Consensus 166 ~~y~pwwlm~~rrc~~vf~rD~~Taq~L~~rgvna-~~vGnpmmD~L~p~~~~~q~-----l~~g~~viaLLPGsR~pea 239 (412) T COG4370 166 AVYMPWWLMLRRRCWAVFPRDALTAQHLANRGVNA-AYVGNPMMDGLPPPERDPQL-----LLTGVPVIALLPGSRVPEA 239 (412) T ss_pred CCHHHHHHHHCCCCEEEECCCCCCHHHHHHCCCCH-HHCCCHHHCCCCCCCCCCHH-----HCCCCCEEEECCCCCCHHH T ss_conf 40105898715520255055531578897469766-65067143069976678134-----2268725886689888178 Q ss_pred C---CCCCCCCCCC-CCCCC--EEEEEEECCCCHHHHHHHHHHHHCCHHHHH-CCCCCCCCCCEEEEECCCCCCHHHHHH Q ss_conf 1---0000000000-12341--248897055211478887765300101100-134444466416750676520255422 Q gi|254780402|r 248 E---DKAVYVHNFI-KCRTD--VLTIIVPRHPRRCDAIERRLIAKGLKVARR-SRGDVINAEVDIFLGDTIGEMGFYLRM 320 (440) Q Consensus 248 E---~~il~a~~~l-~~~~~--~~lIIvPRhpeR~~~i~~~l~~~gl~~~~~-S~~~~~~~~~~V~i~Dt~GeL~~lY~~ 320 (440) + .+++.....+ ....+ ..--|+|--| ..-...+.+..|+.+.-. .+. +..+-++.-+++ .+.... T Consensus 240 ~~nl~~il~slcal~~~~a~vvfw~ai~~~lp--l~~l~~l~e~~gWq~~ad~~~k---dnc~l~lsqqsf---adiLH~ 311 (412) T COG4370 240 QTNLAVILGSLCALPAMFALVVFWAAIAPELP--LLLLWTLEERQGWQPLADRFGK---DNCSLWLSQQSF---ADILHA 311 (412) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCC--HHHHHHHHHHCCCCHHHHHHCC---CCEEEEEEHHHH---HHHHHH T ss_conf 76399999998642777788898760376798--7899999985386225666466---763899758889---999988 Q ss_pred CCCEEEECCCCCCCCCCHHHHHHHCCCEE----ECCCC-CCHHHHHHHHHHCCCEEEECCHHHHHHHH-HHHHCCHHHHH Q ss_conf 08206613554445458376896199499----98984-43799999999789869957989999999-99958989999 Q gi|254780402|r 321 TEIAFIGRSFCASGGQNPLEAAMLGCAIL----SGPNV-ENFRDIYRRMVSSGAVRIVEEVGTLADMV-YSLLSEPTIRY 394 (440) Q Consensus 321 AdiafVGGSl~~~GGhN~lEpaa~G~pVi----~GP~~-~nf~e~~~~L~~~g~~~~v~d~~eL~~~l-~~ll~d~~~~~ 394 (440) ||.+. | .-|--.=.++-+|+||| .||.+ ..|.+.-..|..+.. ..++.+..-+..+ ..++.||+..+ T Consensus 312 adaal-g-----mAGTAtEQavGLGkPvi~fPg~GPQy~pgFA~rQ~rLLG~sl-tlv~~~aq~a~~~~q~ll~dp~r~~ 384 (412) T COG4370 312 ADAAL-G-----MAGTATEQAVGLGKPVIGFPGQGPQYNPGFAERQQRLLGASL-TLVRPEAQAAAQAVQELLGDPQRLT 384 (412) T ss_pred HHHHH-H-----HCCCHHHHHHCCCCCEEECCCCCCCCCHHHHHHHHHHHCCEE-EECCCCHHHHHHHHHHHHCCHHHHH T ss_conf 99998-7-----544167776336986243689898758179999999852534-5417754568999999844817778 Q ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHH Q ss_conf 999999999970455699999999975 Q gi|254780402|r 395 EMINAAINEVKKMQGPLKITLRSLDSY 421 (440) Q Consensus 395 ~~~~~a~~~i~~~~ga~~~~~~~i~~~ 421 (440) ...+++++-+. +.|+..++.+.+.+. T Consensus 385 air~nGqrRiG-qaGaa~rIAe~l~e~ 410 (412) T COG4370 385 AIRHNGQRRIG-QAGAARRIAEELGEM 410 (412) T ss_pred HHHHCCHHHCC-CCCHHHHHHHHHHHH T ss_conf 88753443316-762378999999874 No 109 >PHA01633 putative glycosyl transferase group 1 Probab=91.51 E-value=0.15 Score=29.84 Aligned_cols=149 Identities=14% Similarity=0.186 Sum_probs=84.3 Q ss_pred HHCCCCCCEEEECCHHHHHHHHHHCCC-CEEEECCCCCCCCCCCCCHHHHHHHHHH----CCC--CCEEECCCCCCCCCC Q ss_conf 310264423664157788899860885-0587204310123442212568999986----179--861750465533210 Q gi|254780402|r 177 KKIFSQFSLVIVQSERYFRRYKELGAQ-KLIVSGNLKIDTESLPCDKELLSLYQES----IAG--RYTWAAISTFEGEED 249 (440) Q Consensus 177 ~~~~~~~~~i~~qs~~~~~rl~~lG~~-~i~v~GnlK~d~~~~~~~~~~~~~~~~~----~~~--r~v~vagSth~~EE~ 249 (440) +.+++.+ ..+..|..+++++...|.+ .+-|-.-+-|..+. ..+.....+++. +.+ +.=++-|-|.....+ T Consensus 88 ~~l~~dv-tfipnsk~sa~nlq~agl~vdlpvfhginfk~ve--kae~lvpqlkqkl~kdfpdt~kfgvvsg~tkrknid 164 (335) T PHA01633 88 KYLLQDV-KFIPNSKFSAENLQEVGLQVDLPVFHGINFKIVE--NAEKLVPQLKQKLDKDFPDTIKFGIVSGLTKRKNMD 164 (335) T ss_pred HHHHHHC-EECCCCCCCHHHHHHCCCCCCCCCCCCCCHHHHH--HHHHHHHHHHHHHHCCCCCCEEEEEEECCCCCCCHH T ss_conf 9987433-7577876435558653731154203255523245--477661899877533488644554664443234456 Q ss_pred CCCCCCCCCCC-CCC----EEEEEEECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCC-----CCHHHHH Q ss_conf 00000000012-341----2488970552114788877653001011001344444664167506765-----2025542 Q gi|254780402|r 250 KAVYVHNFIKC-RTD----VLTIIVPRHPRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIG-----EMGFYLR 319 (440) Q Consensus 250 ~il~a~~~l~~-~~~----~~lIIvPRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~G-----eL~~lY~ 319 (440) +++.+++.|.. .|+ .....+. |- + ..+.+.|. .|-.+-.+| +...+|+ T Consensus 165 l~lq~~n~lntkypd~ak~ihffvis-he----d--------------f~k~evpa---nvhfva~fg~q~~e~i~afy~ 222 (335) T PHA01633 165 LMLQVFNELNTKYPDIAKKIHFFVIS-HK----Q--------------FTQLEVPA---NVHFVAEFGHNSREYIFAFYG 222 (335) T ss_pred HHHHHHHHHHCCCCCHHHCEEEEEEE-HH----H--------------HHHCCCCC---CEEEHHHHCCCCHHHHHHHHH T ss_conf 99999988630188766714799962-67----7--------------65424874---031067647784899999860 Q ss_pred HCCCEEEECCCCCCCCCCHHHHHHHCCCEEEC Q ss_conf 20820661355444545837689619949998 Q gi|254780402|r 320 MTEIAFIGRSFCASGGQNPLEAAMLGCAILSG 351 (440) Q Consensus 320 ~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~G 351 (440) .-|..||-.+- +.-|.-.+|.++.|.|||+- T Consensus 223 amdf~~vpsg~-egfglpvlesmamgtpvihq 253 (335) T PHA01633 223 AMDFTIVPSGT-EGFGMPVLESMAMGTPVIHQ 253 (335) T ss_pred HCEEEECCCCC-CCCCCHHHHHHHCCCHHHHH T ss_conf 03068735886-64582678776037633764 No 110 >TIGR02470 sucr_synth sucrose synthase; InterPro: IPR012820 This entry represents sucrose synthase an enzyme that despite its name, generally uses rather produces sucrose. Sucrose plus UDP (or ADP) becomes D-fructose plus UDP-glucose (or ADP-glucose), which is then available for cell wall (or starch) biosynthesis. The enzyme is homologous to sucrose phosphate synthase, which catalyses the penultimate step in sucrose synthesis. Sucrose synthase is found, so far, exclusively in plants and cyanobacteria.; GO: 0016157 sucrose synthase activity. Probab=91.49 E-value=0.12 Score=30.60 Aligned_cols=332 Identities=15% Similarity=0.174 Sum_probs=159.9 Q ss_pred CCCCEEEEEECCHHHHHHHHH-HHHHHHHCC-------CCEEEEE-------CCCCCCHHHHCCCCCCEEEEE-CC---- Q ss_conf 888779998178999999999-999998619-------9899996-------277630012101477416885-05---- Q gi|254780402|r 54 PIGPLIWFHASSVGETMALIG-LIPAIRSRH-------VNVLLTT-------MTATSAKVARKYLGQYAIHQY-AP---- 113 (440) Q Consensus 54 ~~~~~IW~HaaSvGE~~~a~p-li~~l~~~~-------~~iliT~-------~T~tg~~~~~~~~~~~~~~~y-~P---- 113 (440) ..|-+|.|== =+.|... ++++++.++ |+|||-| .|..+.. .+|..+-...|+. .| T Consensus 281 TGGQVVYiLD----QVRaLE~em~~ri~~~GL~vL~~~PkIlIvTRLiPdA~GT~CnqR-LEKv~Gt~~~~ILRVPFr~~ 355 (790) T TIGR02470 281 TGGQVVYILD----QVRALENEMLERIKLQGLEVLEITPKILIVTRLIPDAEGTTCNQR-LEKVSGTEHAHILRVPFRTE 355 (790) T ss_pred CCCEEEEEEH----HHHHHHHHHHHHHHHCCCCEECCCCEEEEEECCCCCCCCCCCCCE-EECCCCCCCCEEEECCCCCC T ss_conf 8865775104----469999999999987384011565528998501768988605724-52012798531642166434 Q ss_pred --CCCCHHHHHH-----------------HH--HHCCEEEEEECCCCHHHHHHHHHHCCCCEEEEEE-CCCCCC------ Q ss_conf --5720134455-----------------74--4202035640476214678887422455058641-266542------ Q gi|254780402|r 114 --LDIQPAVSRF-----------------LK--YWKPDCMILSESDIWPLTVFELSKQRIPQVLVNA-RMSRRS------ 165 (440) Q Consensus 114 --~D~~~~~~~f-----------------l~--~~~P~~~i~~e~ElWPnli~~~~~~~ip~~lina-rls~~S------ 165 (440) -++..++.|| .. .-+||++|--=+| =||++.+=.++..|.-.|- --=||+ T Consensus 356 nG~~~~~WISRFeiWPYLE~FA~D~~~e~~~el~g~PDLIIGNYSD--GNLVA~LLA~kLgVTQC~IAHALEKtKY~~Sd 433 (790) T TIGR02470 356 NGIILRKWISRFEIWPYLETFAEDAEKEILAELQGKPDLIIGNYSD--GNLVASLLARKLGVTQCTIAHALEKTKYPDSD 433 (790) T ss_pred CCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCEEEECCCC--HHHHHHHHHHHCCCCCCCHHHHCCCCCCCCCC T ss_conf 7730036633012472045668999999999846899706732563--58999999853362222033313343676645 Q ss_pred ------CCCCHHHHHHHHHC--CCCCCEEEEC-------CHHHHHHHHH---HCCCCE--EEEC----CCCCCCCCCCCC Q ss_conf ------12100111123310--2644236641-------5778889986---088505--8720----431012344221 Q gi|254780402|r 166 ------FKNWKTVLSFSKKI--FSQFSLVIVQ-------SERYFRRYKE---LGAQKL--IVSG----NLKIDTESLPCD 221 (440) Q Consensus 166 ------~~~~~~~~~l~~~~--~~~~~~i~~q-------s~~~~~rl~~---lG~~~i--~v~G----nlK~d~~~~~~~ 221 (440) -.+|-.=--|+-.+ ++.-|+|++- |.++.-++++ ...+.. .|.| |.||..++|=.+ T Consensus 434 ~yW~~~e~kYHFS~QFTADLIAMN~aDFIITSTYQEIaGt~~~~GQYESh~aFTMP~LYrVV~GIDvFdPKFNiVsPGad 513 (790) T TIGR02470 434 IYWQELEDKYHFSCQFTADLIAMNAADFIITSTYQEIAGTKDSVGQYESHQAFTMPGLYRVVHGIDVFDPKFNIVSPGAD 513 (790) T ss_pred CHHHHCCCCCCCCHHHHHHHHHHHCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCCCCCCCCCCC T ss_conf 40420257656430578999964158965871122014688778842122110588750476103225877554687887 Q ss_pred HHH--------------HHHHHHHC-C-CCCEEECCCCCCCCCCCCCC---------------CC---CCCCCCCCEEEE Q ss_conf 256--------------89999861-7-98617504655332100000---------------00---000123412488 Q gi|254780402|r 222 KEL--------------LSLYQESI-A-GRYTWAAISTFEGEEDKAVY---------------VH---NFIKCRTDVLTI 267 (440) Q Consensus 222 ~~~--------------~~~~~~~~-~-~r~v~vagSth~~EE~~il~---------------a~---~~l~~~~~~~lI 267 (440) .+. ...+.+++ . ..+-=..|-....+-.++.. ++ +.|++. +-|| T Consensus 514 ~~iYFPY~~~~~Rl~~l~~~ie~LLf~~~~~~~~~G~L~d~~KPiiFsMARLDrvKNlTGLvE~yg~s~~Lr~l--~NLv 591 (790) T TIGR02470 514 ESIYFPYSDKEKRLTSLHPEIEELLFSDEDNDEHIGYLKDPNKPIIFSMARLDRVKNLTGLVELYGRSKKLREL--ANLV 591 (790) T ss_pred CCEECCCCCHHHCCHHHHHHHHHHCCCCCCCCCEEEEECCCCCCCEEEECCCCCCCCCHHHHHHHCCCHHHHHH--CCEE T ss_conf 82003767701046236899998617845873007872076878266306300101347788485048447816--6888 Q ss_pred EE-E-------CCCCHHHHHH---HHHHHHCCHHH-HHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCC Q ss_conf 97-0-------5521147888---77653001011-00134444466416750676520255422082066135544454 Q gi|254780402|r 268 IV-P-------RHPRRCDAIE---RRLIAKGLKVA-RRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGG 335 (440) Q Consensus 268 Iv-P-------RhpeR~~~i~---~~l~~~gl~~~-~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GG 335 (440) || . +..|..++|. +++.+.+|.-. ||=....+.. .-|||-.+-+--.=+||=..|.+.-| T Consensus 592 ~VaG~~~~~~S~DrEE~~Ei~kmh~lid~Y~L~G~iRWi~~q~~~~--------~~GElYR~IAD~~GiFVQPA~yEAFG 663 (790) T TIGR02470 592 VVAGKLDAKESKDREEQAEIEKMHNLIDQYQLHGQIRWIGAQLNRV--------RNGELYRYIADTKGIFVQPALYEAFG 663 (790) T ss_pred EEECCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCC--------CCCEEEEEEECCCCCEECCCHHHHHH T ss_conf 8806568644678789999999999888646697167700006853--------14646764425775033330444532 Q ss_pred CCHHHHHHHCCCEEECCCCCCHHHHHHHHHHCCCEEEEC--CH----HHHHHHHHHHHCCHHHHHHHHHHHHHHHHH Q ss_conf 583768961994999898443799999999789869957--98----999999999958989999999999999970 Q gi|254780402|r 336 QNPLEAAMLGCAILSGPNVENFRDIYRRMVSSGAVRIVE--EV----GTLADMVYSLLSEPTIRYEMINAAINEVKK 406 (440) Q Consensus 336 hN~lEpaa~G~pVi~GP~~~nf~e~~~~L~~~g~~~~v~--d~----~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~ 406 (440) -+++|||.||.|..--- +..=.||.+ +..-++-|+ +. +.|++.+.+.-.||+.-++..+.|.+-|.+ T Consensus 664 LTv~EAM~cGLPTFAT~-~GGP~EII~---dgvsGFhIdP~~~~~~A~~l~~Ff~kC~~dP~~W~~iS~~gl~Ri~~ 736 (790) T TIGR02470 664 LTVVEAMTCGLPTFATR-FGGPLEIIQ---DGVSGFHIDPYHGEEAAEKLVDFFEKCDEDPSYWQKISQGGLQRIYE 736 (790) T ss_pred HHHHHHHHCCCHHHHHH-CCCCEEEEE---ECCCCCEECCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHH T ss_conf 57999863011246762-689636877---28835422675589999999999986203836899999988676664 No 111 >cd01425 RPS2 Ribosomal protein S2 (RPS2), involved in formation of the translation initiation complex, where it might contact the messenger RNA and several components of the ribosome. It has been shown that in Escherichia coli RPS2 is essential for the binding of ribosomal protein S1 to the 30s ribosomal subunit. In humans, most likely in all vertebrates, and perhaps in all metazoans, the protein also functions as the 67 kDa laminin receptor (LAMR1 or 67LR), which is formed from a 37 kDa precursor, and is overexpressed in many tumors. 67LR is a cell surface receptor which interacts with a variety of ligands, laminin-1 and others. It is assumed that the ligand interactions are mediated via the conserved C-terminus, which becomes extracellular as the protein undergoes conformational changes which are not well understood. Specifically, a conserved palindromic motif, LMWWML, may participate in the interactions. 67LR plays essential roles in the adhesion of cells to the basement membrane an Probab=89.64 E-value=0.73 Score=25.35 Aligned_cols=30 Identities=27% Similarity=0.318 Sum_probs=18.6 Q ss_pred CCEEEEEECCCCHHHHHHHHHHCCCCEEEE Q ss_conf 020356404762146788874224550586 Q gi|254780402|r 128 KPDCMILSESDIWPLTVFELSKQRIPQVLV 157 (440) Q Consensus 128 ~P~~~i~~e~ElWPnli~~~~~~~ip~~li 157 (440) .|++++++...-=.+.+.||++.|||++-+ T Consensus 127 ~Pdliiv~d~~~~~~ai~Ea~~l~IPvI~i 156 (193) T cd01425 127 LPDLVIVLDPRKEHQAIREASKLGIPVIAI 156 (193) T ss_pred CCCEEEEECCCCCHHHHHHHHHHCCCEEEE T ss_conf 999899988984168999998618755788 No 112 >cd01149 HutB Hemin binding protein HutB. These proteins have been shown to function as initial receptors in ABC transport of hemin and hemoproteins in many eubacterial species. They belong to the TroA superfamily of periplasmic metal binding proteins that share a distinct fold and ligand binding mechanism. A typical TroA protein is comprised of two globular subdomains connected by a single helix and can bind the metal ion in the cleft between these domains. Probab=87.92 E-value=2.1 Score=22.33 Aligned_cols=174 Identities=11% Similarity=0.090 Sum_probs=73.8 Q ss_pred HHHHHCCEEEEEECCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCC---EEEECCHHHHHHHHH Q ss_conf 57442020356404762146788874224550586412665421210011112331026442---366415778889986 Q gi|254780402|r 123 FLKYWKPDCMILSESDIWPLTVFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFS---LVIVQSERYFRRYKE 199 (440) Q Consensus 123 fl~~~~P~~~i~~e~ElWPnli~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~---~i~~qs~~~~~rl~~ 199 (440) -+-..+||+++..+...-+..+..+.+.|||++.+++.- +...+..--.....++..=+ .+...-+...+..++ T Consensus 53 ~il~l~PDLVi~~~~~~~~~~~~~L~~~gi~v~~~~~~~---~~~~~~~~i~~lg~i~g~e~~A~~li~~~~~~l~~v~~ 129 (235) T cd01149 53 GVLSLKPTLVIASDEAGPPEALDQLRAAGVPVVTVPSTP---TLDGLLTKIRQVAQALGVPEKGEALAQEVRQRLAALRK 129 (235) T ss_pred HHHHCCCCEEEEECCCCHHHHHHHHHHCCCCEEECCCCC---CHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 997379988998177683999999996299079558988---87899999999745066568999999999999999997 Q ss_pred ----HCC-CCEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCC Q ss_conf ----088-505872043101234422125689999861798617504655332100000000001234124889705521 Q gi|254780402|r 200 ----LGA-QKLIVSGNLKIDTESLPCDKELLSLYQESIAGRYTWAAISTFEGEEDKAVYVHNFIKCRTDVLTIIVPRHPR 274 (440) Q Consensus 200 ----lG~-~~i~v~GnlK~d~~~~~~~~~~~~~~~~~~~~r~v~vagSth~~EE~~il~a~~~l~~~~~~~lIIvPRhpe 274 (440) .+. +++.+.-...-+.....-.......+....+++.+. +.+++-.. +.....+..+||..++ ..+..+ T Consensus 130 ~~~~~~~~~~v~~~~~~~~~~~~~~g~~s~~~~li~~aG~~Nv~---~~~~~~~~--vs~E~i~~~nPDvI~v-~~~~~~ 203 (235) T cd01149 130 TVAAHKKPPRVLFLLSHGGGAAMAAGRNTAADAIIALAGAVNAA---AGFRGYKP--LSAEALIAAQPDVILV-MSRGLD 203 (235) T ss_pred HHHHCCCCCEEEEEEECCCCCCEECCCCCHHHHHHHHCCCCCCC---CCCCCCCC--CCHHHHHHHCCCEEEE-ECCCCC T ss_conf 64021368538999980899755449985589999986894254---56778776--7999999869999999-589876 Q ss_pred HHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCC Q ss_conf 1478887765300101100134444466416750676 Q gi|254780402|r 275 RCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTI 311 (440) Q Consensus 275 R~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~ 311 (440) .......+...-+ |+..+ ...+.+||.+|.. T Consensus 204 ~~~~~~~~~~~p~-----~~~l~-AVKn~rVy~vd~~ 234 (235) T cd01149 204 AVGGVDGLLKLPG-----LAQTP-AGRNKRILAMDDL 234 (235) T ss_pred CHHHHHHHHHCCC-----HHCCC-HHHCCCEEEECCC T ss_conf 4667999983910-----40599-8777978995999 No 113 >COG0595 mRNA degradation ribonucleases J1/J2 (metallo-beta-lactamase superfamily) [Translation, ribosomal structure and biogenesis; Replication, recombination and repair] Probab=87.56 E-value=0.79 Score=25.16 Aligned_cols=18 Identities=11% Similarity=0.224 Sum_probs=7.6 Q ss_pred HHHHHHHHHHHHHHHHHH Q ss_conf 999999999752378785 Q gi|254780402|r 411 LKITLRSLDSYVNPLIFQ 428 (440) Q Consensus 411 ~~~~~~~i~~~l~~~~~~ 428 (440) ...+-+.+...|.+.+++ T Consensus 525 ~~~lk~~ir~~l~~~~~~ 542 (555) T COG0595 525 WKELKESIRRALRKFLYK 542 (555) T ss_pred HHHHHHHHHHHHHHHHHH T ss_conf 899999999999999988 No 114 >COG2242 CobL Precorrin-6B methylase 2 [Coenzyme metabolism] Probab=87.49 E-value=2.2 Score=22.16 Aligned_cols=95 Identities=18% Similarity=0.207 Sum_probs=56.5 Q ss_pred CCCCCEEEEEEECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCH Q ss_conf 12341248897055211478887765300101100134444466416750676520255422082066135544454583 Q gi|254780402|r 259 KCRTDVLTIIVPRHPRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQNP 338 (440) Q Consensus 259 ~~~~~~~lIIvPRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~ 338 (440) ...|..+.+=+=|++++.+.+.+.+++.|.+ +..|+-.|..-.|..+= -.|.+||||| .+-+.+ T Consensus 54 ~~~p~~~v~AIe~~~~a~~~~~~N~~~fg~~------------n~~vv~g~Ap~~L~~~~-~~daiFIGGg---~~i~~i 117 (187) T COG2242 54 LAGPSGRVIAIERDEEALELIERNAARFGVD------------NLEVVEGDAPEALPDLP-SPDAIFIGGG---GNIEEI 117 (187) T ss_pred HHCCCCEEEEEECCHHHHHHHHHHHHHHCCC------------CEEEEECCCHHHHCCCC-CCCEEEECCC---CCHHHH T ss_conf 7398855999925888999999999984999------------67999546457636999-9999998798---777899 Q ss_pred HHHHHHC-----CCEEECCCCCCHHHHHHHHHHCCC Q ss_conf 7689619-----949998984437999999997898 Q gi|254780402|r 339 LEAAMLG-----CAILSGPNVENFRDIYRRMVSSGA 369 (440) Q Consensus 339 lEpaa~G-----~pVi~GP~~~nf~e~~~~L~~~g~ 369 (440) +|.+.-- .-|+.=-...+...+.+.+.+.|+ T Consensus 118 le~~~~~l~~ggrlV~naitlE~~~~a~~~~~~~g~ 153 (187) T COG2242 118 LEAAWERLKPGGRLVANAITLETLAKALEALEQLGG 153 (187) T ss_pred HHHHHHHCCCCCEEEEEEECHHHHHHHHHHHHHCCC T ss_conf 999999718687699986008889999999997298 No 115 >pfam05159 Capsule_synth Capsule polysaccharide biosynthesis protein. This family includes export proteins involved in capsule polysaccharide biosynthesis, such as KpsS and LipB. Probab=87.20 E-value=0.96 Score=24.58 Aligned_cols=86 Identities=19% Similarity=0.077 Sum_probs=53.2 Q ss_pred CCCCCCCEEEEEEECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCC Q ss_conf 00123412488970552114788877653001011001344444664167506765202554220820661355444545 Q gi|254780402|r 257 FIKCRTDVLTIIVPRHPRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQ 336 (440) Q Consensus 257 ~l~~~~~~~lIIvPRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGh 336 (440) ..+.+|+..+|+=+ ||..... ..+...++. .. ...+.+...=|.+..+...|+-|++-.|.++ T Consensus 149 fa~~~Pda~lv~K~-HP~~~g~---------~~~~~~~~~--~~-~~~~~~~~~~~~l~~Ll~~~~~VvtvnSt~G---- 211 (268) T pfam05159 149 FAEENPDAQIIIKP-HPDVLGG---------NKYGLITEL--PG-LERVDVIVHDVNLISLLRHVDEVVTITSTVG---- 211 (268) T ss_pred HHHHCCCCEEEEEC-CCCCCCC---------CCCCCCCHH--HC-CCCEEEEECCCCHHHHHHHCCEEEEECCHHH---- T ss_conf 99878898399968-9875467---------764545754--55-6881999369999999985799999655689---- Q ss_pred CHHHHHHHCCCEE-ECCCCCCHHHHH Q ss_conf 8376896199499-989844379999 Q gi|254780402|r 337 NPLEAAMLGCAIL-SGPNVENFRDIY 361 (440) Q Consensus 337 N~lEpaa~G~pVi-~GP~~~nf~e~~ 361 (440) +||+..|+||+ +|.-+++..... T Consensus 212 --~eALl~gkpV~~~G~~fYa~wGLT 235 (268) T pfam05159 212 --FEALLLGKPVITLGNPFYAGWGLT 235 (268) T ss_pred --HHHHHCCCCEEEEEEECCCCCCCC T ss_conf --999985997389401356778775 No 116 >pfam00318 Ribosomal_S2 Ribosomal protein S2. Probab=86.50 E-value=1.8 Score=22.84 Aligned_cols=12 Identities=33% Similarity=0.465 Sum_probs=5.7 Q ss_pred HHHHHHHCCCEE Q ss_conf 376896199499 Q gi|254780402|r 338 PLEAAMLGCAIL 349 (440) Q Consensus 338 ~lEpaa~G~pVi 349 (440) +-|+..+|.|+| T Consensus 153 i~Ea~~l~IP~I 164 (205) T pfam00318 153 IKEASKLGIPVI 164 (205) T ss_pred HHHHHHCCCCEE T ss_conf 999987599756 No 117 >cd06295 PBP1_CelR Ligand binding domain of a transcription regulator of cellulose genes, CelR, which is highly homologous to the LacI-GalR family of bacterial transcription regulators. This group includes the ligand binding domain of a transcription regulator of cellulose genes, CelR, which is highly homologous to the LacI-GalR family of bacterial transcription regulators. The binding of CelR to the celE promoter is inhibited specifically by cellobiose. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn chang Probab=86.29 E-value=2.6 Score=21.74 Aligned_cols=180 Identities=14% Similarity=0.120 Sum_probs=89.0 Q ss_pred HHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEECCCCHHHHHHHHHH Q ss_conf 99999999998619989999627763001210147741688505572013445574420203564047621467888742 Q gi|254780402|r 70 MALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILSESDIWPLTVFELSK 149 (440) Q Consensus 70 ~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e~ElWPnli~~~~~ 149 (440) ..+..+-+.+.+.+.+++|+++.+... ....+++...+.+.+|++-+..=-..+..+.+ T Consensus 27 ~l~~gie~~~~~~gY~~li~~~~~~~e---------------------~~~~~~l~~~~vdGiIi~~~~~~~~~~~~l~~ 85 (275) T cd06295 27 SLLGGIADALAERGYDLLLSFVSSPDR---------------------DWLARYLASGRADGVILIGQHDQDPLPERLAE 85 (275) T ss_pred HHHHHHHHHHHHCCCEEEEEECCCCHH---------------------HHHHHHHHHCCCCEEEEECCCCCHHHHHHHHH T ss_conf 999999999998599899994898689---------------------99999998489988999799899799999995 Q ss_pred CCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEEC----CHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCHHHH Q ss_conf 2455058641266542121001111233102644236641----577888998608850587204310123442212568 Q gi|254780402|r 150 QRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQ----SERYFRRYKELGAQKLIVSGNLKIDTESLPCDKELL 225 (440) Q Consensus 150 ~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~q----s~~~~~rl~~lG~~~i~v~GnlK~d~~~~~~~~~~~ 225 (440) .+||+++++....+... +.|.+- .....++|.+.|-+++...|...-.........-.. T Consensus 86 ~~iPvV~~d~~~~~~~~-----------------~~V~~d~~~a~~~~~~~L~~~G~~~I~~i~~~~~~~~~~~R~~Gf~ 148 (275) T cd06295 86 TGLPFVVWGRPLPGQPY-----------------CYVGSDNVGGGRLATEHLLARGRRRIAFLGGPQDMPEGEERLEGYR 148 (275) T ss_pred CCCCEEEECCCCCCCCC-----------------CEEEECHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHHH T ss_conf 79999999862689999-----------------7898287999999999999809987987058866726999999999 Q ss_pred HHHHHH-CCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCHHHHHHHHHHHHCCH Q ss_conf 999986-1798617504655332100000000001234124889705521147888776530010 Q gi|254780402|r 226 SLYQES-IAGRYTWAAISTFEGEEDKAVYVHNFIKCRTDVLTIIVPRHPRRCDAIERRLIAKGLK 289 (440) Q Consensus 226 ~~~~~~-~~~r~v~vagSth~~EE~~il~a~~~l~~~~~~~lIIvPRhpeR~~~i~~~l~~~gl~ 289 (440) ..+.+. ......++.-+.+..| ...-.+.+.+.+.+..--|++.... -+-.+.+.+.+.|+. T Consensus 149 ~a~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~l~~~~~~~ai~~~nD~-~A~g~~~~~~~~g~~ 211 (275) T cd06295 149 EALAEAGLPLDPRLVAPGDFTEE-SGRAAMRALLERGPDFDAVFAASDL-MALGALRALREAGRR 211 (275) T ss_pred HHHHHCCCCCCCCEEEECCCCHH-HHHHHHHHHHHCCCCCCEEEECCCH-HHHHHHHHHHHCCCC T ss_conf 99998699999417996577668-7999988898549998703414758-789999999974999 No 118 >pfam01497 Peripla_BP_2 Periplasmic binding protein. This family includes bacterial periplasmic binding proteins. Several of which are involved in iron transport. Probab=85.76 E-value=2.8 Score=21.57 Aligned_cols=177 Identities=14% Similarity=0.157 Sum_probs=78.0 Q ss_pred CHHHHHHHHHHCCEEEEEECCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCH-HHHHHHHHCCCC---CCEEEECCHH Q ss_conf 013445574420203564047621467888742245505864126654212100-111123310264---4236641577 Q gi|254780402|r 117 QPAVSRFLKYWKPDCMILSESDIWPLTVFELSKQRIPQVLVNARMSRRSFKNWK-TVLSFSKKIFSQ---FSLVIVQSER 192 (440) Q Consensus 117 ~~~~~~fl~~~~P~~~i~~e~ElWPnli~~~~~~~ip~~linarls~~S~~~~~-~~~~l~~~~~~~---~~~i~~qs~~ 192 (440) .....+. -..+||+++.....-++....+..+.++|++.++.. .++..|. ++..+ ..++.. .+.+...-+. T Consensus 50 ~~n~E~i-~~l~PDlVi~~~~~~~~~~~~~~~~~gipvv~~~~~---~~~~~~~~~i~~l-g~~~g~~~~a~~~~~~~~~ 124 (236) T pfam01497 50 EIDVEKL-AALKPDLIIGSAGIGLTDKAYELLSLIIPTVIFESS---STGEGILEQIRAL-GELLGLEDEAEELVAELDS 124 (236) T ss_pred CCCHHHH-HHCCCCEEEEECCCCCHHHHHHHHHCCCCEEEECCC---CCHHHHHHHHHHH-HHHHCCCHHHHHHHHHHHH T ss_conf 9899999-715999899726767368999999579978994289---9989999999999-9996981789999999999 Q ss_pred HHHHHHH----HCCCCEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCEEEEE Q ss_conf 8889986----088505872043101234422125689999861798617504655332100000000001234124889 Q gi|254780402|r 193 YFRRYKE----LGAQKLIVSGNLKIDTESLPCDKELLSLYQESIAGRYTWAAISTFEGEEDKAVYVHNFIKCRTDVLTII 268 (440) Q Consensus 193 ~~~rl~~----lG~~~i~v~GnlK~d~~~~~~~~~~~~~~~~~~~~r~v~vagSth~~EE~~il~a~~~l~~~~~~~lII 268 (440) ..+.+++ .+.++..+.....-..............+....+.+.+.-.. .++.-.-+....++..+||..++. T Consensus 125 ~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~i~~~G~~n~~~~~---~~~~~~~vs~E~i~~~nPD~I~~~ 201 (236) T pfam01497 125 AIDAAKARIDSLKPKPVLVFGYADGGGYVVFGSGSYIGDLLDALGGENIAAET---KGSESAPISFENILAADPDVIIVS 201 (236) T ss_pred HHHHHHHHHHCCCCCEEEEEEEECCCCEEEECCCCCHHHHHHHCCCCCCCCCC---CCCCCCCCCHHHHHHHCCCEEEEE T ss_conf 99999987404689839999998899638835886568999982996554445---778888789899998599999997 Q ss_pred EECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECC Q ss_conf 705521147888776530010110013444446641675067 Q gi|254780402|r 269 VPRHPRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDT 310 (440) Q Consensus 269 vPRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt 310 (440) .+.-+. ...++... -.|+..+ ...+.+||.+|. T Consensus 202 -~~~~~~--~~~~l~~~-----p~~~~l~-AVk~~~V~~vd~ 234 (236) T pfam01497 202 -GRETKT--GVDELKAN-----PLWANLP-AVKNGRVYTLDS 234 (236) T ss_pred -CCCCHH--HHHHHHHC-----CCCCCCC-HHHCCCEEEECC T ss_conf -898557--89998839-----4805689-857898899489 No 119 >PRK08287 cobalt-precorrin-6Y C(15)-methyltransferase; Validated Probab=85.49 E-value=2.8 Score=21.49 Aligned_cols=114 Identities=16% Similarity=0.088 Sum_probs=56.6 Q ss_pred CCCCEEECCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCC Q ss_conf 79861750465533210000000000123412488970552114788877653001011001344444664167506765 Q gi|254780402|r 233 AGRYTWAAISTFEGEEDKAVYVHNFIKCRTDVLTIIVPRHPRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIG 312 (440) Q Consensus 233 ~~r~v~vagSth~~EE~~il~a~~~l~~~~~~~lIIvPRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~G 312 (440) .+.-+|=.|+=- | .+-+++ .+..|+...+=+=+++++.+-+.+.+++.|..-. +++-.+... T Consensus 30 ~~~vvwDIGaGs-G--svsiEa---a~~~p~~~V~AvE~~~~~~~~i~~N~~~fg~~nv------------~~i~g~Ap~ 91 (186) T PRK08287 30 RAKHLIDVGAGT-G--SVSIEA---ALQFPSLQVTAIERNPAALRLIKENRQRFGCGNI------------DIIPGEAPI 91 (186) T ss_pred CCCEEEEECCCC-C--HHHHHH---HHHCCCCEEEEEECCHHHHHHHHHHHHHHCCCCE------------EEEECCCCH T ss_conf 999999957887-7--899999---9978998899993798999999998997299987------------999377811 Q ss_pred CCHHHHHHCCCEEEECCCCCCCCCCHHHHHHH-----CCCEEECCCCCCHHHHHHHHHHCCC Q ss_conf 20255422082066135544454583768961-----9949998984437999999997898 Q gi|254780402|r 313 EMGFYLRMTEIAFIGRSFCASGGQNPLEAAML-----GCAILSGPNVENFRDIYRRMVSSGA 369 (440) Q Consensus 313 eL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~-----G~pVi~GP~~~nf~e~~~~L~~~g~ 369 (440) .|.. ..|.+|||||= .....+++.+.. |.-|+..--..+..++.+.+.+.|. T Consensus 92 ~l~~---~pD~vFIGGsg--g~l~~il~~~~~~L~~gGriVinavtlet~~~a~~~~~~~~~ 148 (186) T PRK08287 92 TLTG---KADAIFMGGSG--GHLTAIIDWALGHLHPGGRLVLNFILQENLHSALAHLEKIGA 148 (186) T ss_pred HCCC---CCCEEEEECCC--CCHHHHHHHHHHHCCCCCEEEEEEEEHHHHHHHHHHHHHCCC T ss_conf 0357---89849997478--988999999997579998999982608779999999997699 No 120 >pfam05686 DUF821 Arabidopsis thaliana protein of unknown function (DUF821). This family consists of a group of Arabidopsis thaliana proteins with no known function. Probab=85.37 E-value=1.7 Score=22.90 Aligned_cols=78 Identities=14% Similarity=0.138 Sum_probs=53.4 Q ss_pred HHHCCCEE-ECCCCCCHHHHHHHHHHCCCEEEEC---CHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCC---HHHHH Q ss_conf 96199499-9898443799999999789869957---98999999999958989999999999999970455---69999 Q gi|254780402|r 342 AMLGCAIL-SGPNVENFRDIYRRMVSSGAVRIVE---EVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQG---PLKIT 414 (440) Q Consensus 342 aa~G~pVi-~GP~~~nf~e~~~~L~~~g~~~~v~---d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~g---a~~~~ 414 (440) ++|++.|+ .||.++.|= +..|.-..--..|+ +-.++..+|...-+++++++++|++|++|+.++-- ..+-- T Consensus 235 LaCdSlvl~v~~~yyeFF--y~~L~P~~HY~Pi~~~~~~~dl~~~v~W~~~hd~~Aq~Ia~~g~~Fi~e~L~m~~VycY~ 312 (396) T pfam05686 235 LACDSVVLHVGPEYYDFF--YRGLQPWVHYWPIKSDDDCRDLKEAVDWGNAHDDKAQEIAERGSQFIRENLKMDDVYDYM 312 (396) T ss_pred EECCCEEEECCCCHHHHH--HCCCCCCCCEECCCCCCCHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHCCHHHHHHHH T ss_conf 205744784277055444--325665310210467652778999999988688999999999999999852702789999 Q ss_pred HHHHHHH Q ss_conf 9999975 Q gi|254780402|r 415 LRSLDSY 421 (440) Q Consensus 415 ~~~i~~~ 421 (440) ..+|+.| T Consensus 313 ~~LL~eY 319 (396) T pfam05686 313 FHLLTEY 319 (396) T ss_pred HHHHHHH T ss_conf 9999999 No 121 >PRK00377 cbiT cobalt-precorrin-6Y C(15)-methyltransferase; Provisional Probab=85.07 E-value=3 Score=21.36 Aligned_cols=94 Identities=12% Similarity=0.098 Sum_probs=50.9 Q ss_pred CCEEEEEEECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCHHHH Q ss_conf 41248897055211478887765300101100134444466416750676520255422082066135544454583768 Q gi|254780402|r 262 TDVLTIIVPRHPRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQNPLEA 341 (440) Q Consensus 262 ~~~~lIIvPRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~lEp 341 (440) +...+.-+=|.+++.+.+++.+++.|+. .+.+++-.|....|..+-...|.+|||||-... ..++|. T Consensus 64 ~~~~V~aIE~~~e~~~~~~~N~~kfg~~-----------~nv~~i~G~ap~~l~~l~p~pD~vFIGG~~g~l--~~il~~ 130 (198) T PRK00377 64 EGGKVYAIDKDELAVELTKKNAEKFGVG-----------DNVVLIEGEAPEVLPKLNPKSDRYFIGGGGEEL--PEIIQA 130 (198) T ss_pred CCCEEEEEECCHHHHHHHHHHHHHCCCC-----------CCEEEEECCHHHHHHCCCCCCCEEEEECCCCCH--HHHHHH T ss_conf 7875999967888999999999980999-----------885999525488772089988989997887778--999999 Q ss_pred HHH-----CCCEEECCCCCCHHHHHHHHHHCC Q ss_conf 961-----994999898443799999999789 Q gi|254780402|r 342 AML-----GCAILSGPNVENFRDIYRRMVSSG 368 (440) Q Consensus 342 aa~-----G~pVi~GP~~~nf~e~~~~L~~~g 368 (440) +.. |.-|+..--..+..++.+.|.+.| T Consensus 131 ~~~~L~~gGriVinaVtlet~~~~~~~l~~~~ 162 (198) T PRK00377 131 ALEKIGKGGRIVADAILLESLNKALSALEELG 162 (198) T ss_pred HHHHCCCCCEEEEEEECHHHHHHHHHHHHHCC T ss_conf 99857999899998362988999999999769 No 122 >PRK07402 precorrin-6B methylase; Provisional Probab=81.64 E-value=2 Score=22.53 Aligned_cols=89 Identities=16% Similarity=0.116 Sum_probs=50.8 Q ss_pred CCCCEEECCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCC Q ss_conf 79861750465533210000000000123412488970552114788877653001011001344444664167506765 Q gi|254780402|r 233 AGRYTWAAISTFEGEEDKAVYVHNFIKCRTDVLTIIVPRHPRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIG 312 (440) Q Consensus 233 ~~r~v~vagSth~~EE~~il~a~~~l~~~~~~~lIIvPRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~G 312 (440) .+.-+|=.|. |-=.+-+++. +..|+...+=+=|++++.+-+.+.+++.|+. +.+++-.|... T Consensus 40 ~~~~vwDIGa---GtGsVsiEaa---~~~~~g~V~AIE~~~~a~~li~~N~~rfgv~------------nv~ii~g~ape 101 (196) T PRK07402 40 PDSVLWDIGA---GTGTIPVEAA---LLCPKGRVIAIERDEEVANLIRRNCDRFGVK------------NVEVIEGSAPE 101 (196) T ss_pred CCCEEEEECC---CCCHHHHHHH---HHCCCCEEEEEECCHHHHHHHHHHHHHHCCC------------CEEEEECCCHH T ss_conf 9999999478---8779999999---8789988999976888999999989972999------------87999726366 Q ss_pred CCHHHHHHCCCEEEECCCCCCCCCCHHHHH Q ss_conf 202554220820661355444545837689 Q gi|254780402|r 313 EMGFYLRMTEIAFIGRSFCASGGQNPLEAA 342 (440) Q Consensus 313 eL~~lY~~AdiafVGGSl~~~GGhN~lEpa 342 (440) .|..+-..-|.+|||||- +-..++|.+ T Consensus 102 ~L~~l~p~pD~vFIGGg~---~l~~il~~~ 128 (196) T PRK07402 102 CLAQLAPAPDRICIEGGR---PIKEILQAV 128 (196) T ss_pred HHHHCCCCCCEEEECCCC---CHHHHHHHH T ss_conf 684089999999984896---889999999 No 123 >TIGR02398 gluc_glyc_Psyn glucosylglycerol-phosphate synthase; InterPro: IPR012764 Glucosylglycerol-phosphate synthase catalyzes the key step in the biosynthesis of the osmolyte glucosylglycerol. It is known in several cyanobacteria and in Pseudomonas anguilliseptica. The enzyme is closely related to the alpha,alpha-trehalose-phosphate synthase, likewise involved in osmolyte biosynthesis, of Escherichia coli and many other bacteria. A close homolog from Xanthomonas campestris is excluded from this family.; GO: 0016758 transferase activity transferring hexosyl groups, 0051473 glucosylglycerol biosynthetic process. Probab=80.39 E-value=0.27 Score=28.23 Aligned_cols=165 Identities=16% Similarity=0.214 Sum_probs=81.4 Q ss_pred CCHHHHHHHHHHCC-CCCEEECCCC---CCCCCCCCCCCCCCCCCCCCE----EEEEE--E--CCCCHHHHHHHHHHHH- Q ss_conf 21256899998617-9861750465---533210000000000123412----48897--0--5521147888776530- Q gi|254780402|r 220 CDKELLSLYQESIA-GRYTWAAIST---FEGEEDKAVYVHNFIKCRTDV----LTIIV--P--RHPRRCDAIERRLIAK- 286 (440) Q Consensus 220 ~~~~~~~~~~~~~~-~r~v~vagSt---h~~EE~~il~a~~~l~~~~~~----~lIIv--P--RhpeR~~~i~~~l~~~- 286 (440) .-++....+++.++ +++.++-+|- -+|-+|+++.-.++|.++|++ .|++| | ....=-++.+..+++. T Consensus 274 ~~~e~~e~ir~el~kg~KLILSa~RvDYtKG~~E~L~ayERLLErrpeL~gkV~Lv~acv~AA~gM~iY~e~Q~~IE~~v 353 (495) T TIGR02398 274 SVRERIEKIREELAKGRKLILSAERVDYTKGIKELLVAYERLLERRPELLGKVVLVVACVKAASGMKIYEELQSEIERLV 353 (495) T ss_pred CHHHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHHHHHHHCCCCCCCCCEEEEEEEHHHCCCCEEHHHHHHHHHHHH T ss_conf 47899999999860485268850100110441577888887522582305413402444266067772687668899860 Q ss_pred CCHHHHHCCCCCCCCCCEEEEEC---CCCCCHHHHHHCCCEEEECCCCCCCCCCHH--HH-HH----HCCCEEECCCCCC Q ss_conf 01011001344444664167506---765202554220820661355444545837--68-96----1994999898443 Q gi|254780402|r 287 GLKVARRSRGDVINAEVDIFLGD---TIGEMGFYLRMTEIAFIGRSFCASGGQNPL--EA-AM----LGCAILSGPNVEN 356 (440) Q Consensus 287 gl~~~~~S~~~~~~~~~~V~i~D---t~GeL~~lY~~AdiafVGGSl~~~GGhN~l--Ep-aa----~G~pVi~GP~~~n 356 (440) |===.|.++.+-.+ |.+.- .+-||..||..|||+-|-. | +-|-|.. |= +| =|.-|++ .+- T Consensus 354 GRINGRFa~~~W~P----~~fftr~lpyEE~~awf~~ADv~WITP-L--RDGLNLVAKEyvaA~g~~dGrGVLv---LSE 423 (495) T TIGR02398 354 GRINGRFARVDWTP----VRFFTRSLPYEELVAWFAAADVAWITP-L--RDGLNLVAKEYVAARGLRDGRGVLV---LSE 423 (495) T ss_pred CCCCCEECCCCCEE----HHHHHHCCCHHHHHHHHHCCCCEECCC-C--CCCHHHHHHHHHHHCCCCCCCEEEE---EEC T ss_conf 64244031346540----104421166789999974143112067-6--3421478899999515889966898---511 Q ss_pred HHHHHHHHHHCCCEEEE-CCHHHHHHHHHHHHC--CHHHHHHH Q ss_conf 79999999978986995-798999999999958--98999999 Q gi|254780402|r 357 FRDIYRRMVSSGAVRIV-EEVGTLADMVYSLLS--EPTIRYEM 396 (440) Q Consensus 357 f~e~~~~L~~~g~~~~v-~d~~eL~~~l~~ll~--d~~~~~~~ 396 (440) |....-+|. ||+.+- .++.++.+.+..-|+ ..|.+++| T Consensus 424 FaGaaveL~--gA~LTNPy~~~rMde~i~~ALaMP~~Eqq~Rm 464 (495) T TIGR02398 424 FAGAAVELK--GAVLTNPYDAKRMDEAIEVALAMPKEEQQRRM 464 (495) T ss_pred CCCCCEEEC--CCEECCCCCCCCHHHHHHHHHHCCHHHHHHHH T ss_conf 125401216--64312888755357999998506388999999 No 124 >pfam00852 Glyco_transf_10 Glycosyltransferase family 10 (fucosyltransferase). This family of Fucosyltransferases are the enzymes transferring fucose from GDP-Fucose to GlcNAc in an alpha1,3 linkage. This family is know as glycosyltransferase family 10. Probab=77.68 E-value=5.1 Score=19.82 Aligned_cols=49 Identities=18% Similarity=0.243 Sum_probs=37.3 Q ss_pred CCCEEECCCCCCHHHHHHHHHHCCCEEEE---CCHHHHHHHHHHHHCCHHHHHHHH Q ss_conf 99499989844379999999978986995---798999999999958989999999 Q gi|254780402|r 345 GCAILSGPNVENFRDIYRRMVSSGAVRIV---EEVGTLADMVYSLLSEPTIRYEMI 397 (440) Q Consensus 345 G~pVi~GP~~~nf~e~~~~L~~~g~~~~v---~d~~eL~~~l~~ll~d~~~~~~~~ 397 (440) -+||+.||.-.|+.. +.-.+..+-| .++++|++.|..|-+|++...+.= T Consensus 256 ~VPVv~G~~r~~y~~----~~Pp~SfI~v~DF~s~~~La~yL~~Ld~n~~~Y~~Yf 307 (353) T pfam00852 256 AVPVVLGPPRANYEN----FVPPDSFIHVDDFASPKELAEYLKSLDKNDDEYRRYF 307 (353) T ss_pred CEEEEECCCCCCHHH----HCCCCCEEECCCCCCHHHHHHHHHHHHCCHHHHHHHH T ss_conf 653997986115856----0899877873028999999999999808999999998 No 125 >PRK08261 fabG 3-ketoacyl-(acyl-carrier-protein) reductase; Provisional Probab=77.37 E-value=5.5 Score=19.61 Aligned_cols=50 Identities=24% Similarity=0.371 Sum_probs=23.6 Q ss_pred CCCCCHHHHHHHHCCCCCCC-----C-----CCCEEEEEECCHHHHHHHHHHHHHHHHCCCCEEE Q ss_conf 18440011245405677778-----8-----8877999817899999999999999861998999 Q gi|254780402|r 34 FNRERGRKFGERLGYPTALR-----P-----IGPLIWFHASSVGETMALIGLIPAIRSRHVNVLL 88 (440) Q Consensus 34 ~~k~~~~~~~eR~g~~~~~~-----~-----~~~~IW~HaaSvGE~~~a~pli~~l~~~~~~ili 88 (440) -+....+.+..++|.+.+.. + .|+ +.+-++ ...+.++.+.|...+..+.. T Consensus 9 ~~~~~g~~l~~~lglp~p~~l~r~~~~~~~~~g~-~l~~~~----~~~~~~~~~~l~~~~~~~~~ 68 (447) T PRK08261 9 VNSGPGRFLAKALGLPQPVPLRRYQPGQPPLEGP-VLIGGA----GRLAEALAAALAGLGADVYN 68 (447) T ss_pred HCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCC-EEEECC----CHHHHHHHHHHHHCCCEEEE T ss_conf 4882789999975998597776789999888786-799126----14589999999746840142 No 126 >cd06299 PBP1_LacI_like_13 Ligand-binding domain of DNA-binding regulatory protein from Corynebacterium glutamicum which has a unique ability to produce significant amounts of L-glutamate directly from cheap sugar and ammonia. This group includes the ligand-binding domain of DNA-binding regulatory protein from Corynebacterium glutamicum which has a unique ability to produce significant amounts of L-glutamate directly from cheap sugar and ammonia. This regulatory protein is a member of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial trans Probab=76.41 E-value=5.8 Score=19.44 Aligned_cols=105 Identities=13% Similarity=0.134 Sum_probs=65.4 Q ss_pred HHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEECCCCHHHHHHHHH Q ss_conf 99999999999861998999962776300121014774168850557201344557442020356404762146788874 Q gi|254780402|r 69 TMALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILSESDIWPLTVFELS 148 (440) Q Consensus 69 ~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e~ElWPnli~~~~ 148 (440) ...+..+-+.+.+.+.++++.++.. ..+.-.+ .-+.+...+++.+|+..++.-+..+..+. T Consensus 15 ~~~~~gi~~~~~~~gy~~ll~~s~~-~~~~e~~------------------~l~~l~~~~vDGiIi~~~~~~~~~~~~l~ 75 (265) T cd06299 15 ASLATAIQDAASAAGYSTIIGNSDE-NPETENR------------------YLDNLLSQRVDGIIVVPHEQSAEQLEDLL 75 (265) T ss_pred HHHHHHHHHHHHHCCCEEEEEECCC-CHHHHHH------------------HHHHHHHCCCCEEEEEECCCCHHHHHHHH T ss_conf 9999999999998599899997899-9899999------------------99999947999999980679989999999 Q ss_pred HCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEEC----CHHHHHHHHHHCCCCEEEEC Q ss_conf 22455058641266542121001111233102644236641----57788899860885058720 Q gi|254780402|r 149 KQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQ----SERYFRRYKELGAQKLIVSG 209 (440) Q Consensus 149 ~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~q----s~~~~~rl~~lG~~~i~v~G 209 (440) +.++|+++++....+.. ++.|..- .....++|.+.|-+++-+.| T Consensus 76 ~~~iPvV~id~~~~~~~-----------------~~~V~~d~~~~~~~a~~~L~~~G~~~i~~i~ 123 (265) T cd06299 76 KRGIPVVFVDREITGSP-----------------IPFVTSDPQPGMTEAVSLLVALGHKKIGYIS 123 (265) T ss_pred HCCCCEEEECCCCCCCC-----------------CCEEEECHHHHHHHHHHHHHHCCCCEEEEEC T ss_conf 66998998378689998-----------------8689967699999999999974998388604 No 127 >COG0380 OtsA Trehalose-6-phosphate synthase [Carbohydrate transport and metabolism] Probab=76.26 E-value=5.9 Score=19.41 Aligned_cols=275 Identities=13% Similarity=0.185 Sum_probs=130.8 Q ss_pred HHHHHHHHCCEEEEEE-CCCCH--HHHHHHHHHCCCCEEEEEECCCCC-CCCCCHHHHHHHHHCCCCCCEEEECCHHHHH Q ss_conf 4455744202035640-47621--467888742245505864126654-2121001111233102644236641577888 Q gi|254780402|r 120 VSRFLKYWKPDCMILS-ESDIW--PLTVFELSKQRIPQVLVNARMSRR-SFKNWKTVLSFSKKIFSQFSLVIVQSERYFR 195 (440) Q Consensus 120 ~~~fl~~~~P~~~i~~-e~ElW--Pnli~~~~~~~ip~~linarls~~-S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~ 195 (440) +.+..+.++|.-+||+ ++-|+ |.|+++..-...=.+..-+-+... =|+-..+-..+.+.++. .|.|.-|++.+++ T Consensus 138 Ad~i~~~~~~gDiIWVhDYhL~L~P~mlR~~~~~~~IgfFlHiPfPssEvfr~lP~r~eIl~gll~-~dligFqt~~y~~ 216 (486) T COG0380 138 ADKIVEIYEPGDIIWVHDYHLLLVPQMLRERIPDAKIGFFLHIPFPSSEVFRCLPWREEILEGLLG-ADLIGFQTESYAR 216 (486) T ss_pred HHHHHHHCCCCCEEEEEECHHHHHHHHHHHHCCCCEEEEEEECCCCCHHHHHHCCHHHHHHHHHHC-CCEEEECCHHHHH T ss_conf 999997467899899970416666899997488765898896799998998657309999987532-7866853778999 Q ss_pred HHHHH-----CC---CCEEEE---------CCC-------CCCCC-CCCCCHHHHHHHHHHCCC-CCEEECCC-C--CCC Q ss_conf 99860-----88---505872---------043-------10123-442212568999986179-86175046-5--533 Q gi|254780402|r 196 RYKEL-----GA---QKLIVS---------GNL-------KIDTE-SLPCDKELLSLYQESIAG-RYTWAAIS-T--FEG 246 (440) Q Consensus 196 rl~~l-----G~---~~i~v~---------Gnl-------K~d~~-~~~~~~~~~~~~~~~~~~-r~v~vagS-t--h~~ 246 (440) +|.++ |. ..+... +.. .|+.. ..+.-......+++..++ +++++.-- + -.| T Consensus 217 nF~~~~~r~~~~~~~~~~~~~~~~~~~v~v~a~PIgID~~~~~~~~~~~~v~~~~~el~~~~~~~~kiivgvDRlDy~kG 296 (486) T COG0380 217 NFLDLCSRLLGVTGDADIRFNGADGRIVKVGAFPIGIDPEEFERALKSPSVQEKVLELKAELGRNKKLIVGVDRLDYSKG 296 (486) T ss_pred HHHHHHHHHCCCCCCCCCCCCCCCCCEEEEEEEEEECCHHHHHHHHCCCCHHHHHHHHHHHHCCCCEEEEEEHHCCCCCC T ss_conf 99999998636554443202466786578989940148899987616873156799999984688669997401012468 Q ss_pred CCCCCCCCCCCCCCCCCE-----EEEEEECCCCHHHHHHHHHHHHCCHHHH-HCCCCCCCCCCEEEEEC---CCCCCHHH Q ss_conf 210000000000123412-----4889705521147888776530010110-01344444664167506---76520255 Q gi|254780402|r 247 EEDKAVYVHNFIKCRTDV-----LTIIVPRHPRRCDAIERRLIAKGLKVAR-RSRGDVINAEVDIFLGD---TIGEMGFY 317 (440) Q Consensus 247 EE~~il~a~~~l~~~~~~-----~lIIvPRhpeR~~~i~~~l~~~gl~~~~-~S~~~~~~~~~~V~i~D---t~GeL~~l 317 (440) --+-+....++|..+|+. .+-|+|---++.++.+++..+..-.+.+ -++...++- +.|...+ ..-++..+ T Consensus 297 i~~rl~Afe~lL~~~Pe~~~kvvliQi~~pSr~~v~~y~~~~~~i~~~V~rIN~~fG~~~~-~Pv~~l~~~~~~~~l~al 375 (486) T COG0380 297 IPQRLLAFERLLEEYPEWRGKVVLLQIAPPSREDVEEYQALRLQIEELVGRINGEFGSLSW-TPVHYLHRDLDRNELLAL 375 (486) T ss_pred CHHHHHHHHHHHHHCHHHHCCEEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCC-CEEEEEECCCCHHHHHHH T ss_conf 3899999999998585652746899841777556388999999999999998750078886-416888625898899999 Q ss_pred HHHCCCEEEECCCCCCCCCCH--HHHHHHCC---CEEECCCCCCHHHHHHHHHHCCCEEEEC--CHHHHHHHHHHHHC-C Q ss_conf 422082066135544454583--76896199---4999898443799999999789869957--98999999999958-9 Q gi|254780402|r 318 LRMTEIAFIGRSFCASGGQNP--LEAAMLGC---AILSGPNVENFRDIYRRMVSSGAVRIVE--EVGTLADMVYSLLS-E 389 (440) Q Consensus 318 Y~~AdiafVGGSl~~~GGhN~--lEpaa~G~---pVi~GP~~~nf~e~~~~L~~~g~~~~v~--d~~eL~~~l~~ll~-d 389 (440) |+.||+.+|- |+ +.|.|. .|-.++-- .++. .+.|...+..|.+ +..|+ |.++.++++..-|+ . T Consensus 376 ~~~aDv~lVt-pl--rDGMNLvakEyVa~q~~~~G~Li---LSeFaGaa~~L~~---AliVNP~d~~~va~ai~~AL~m~ 446 (486) T COG0380 376 YRAADVMLVT-PL--RDGMNLVAKEYVAAQRDKPGVLI---LSEFAGAASELRD---ALIVNPWDTKEVADAIKRALTMS 446 (486) T ss_pred HHHHCEEEEC-CC--CCCCCHHHHHHHHHHCCCCCCEE---EECCCCCHHHHCC---CEEECCCCHHHHHHHHHHHHCCC T ss_conf 9630344532-33--45420778889986147897289---8526565556433---87678887699999999984599 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 8999999999999997 Q gi|254780402|r 390 PTIRYEMINAAINEVK 405 (440) Q Consensus 390 ~~~~~~~~~~a~~~i~ 405 (440) ++++++--+.-++.+. T Consensus 447 ~eEr~~r~~~~~~~v~ 462 (486) T COG0380 447 LEERKERHEKLLKQVL 462 (486) T ss_pred HHHHHHHHHHHHHHHH T ss_conf 9999999999999998 No 128 >pfam03033 Glyco_transf_28 Glycosyltransferase family 28 N-terminal domain. The glycosyltransferase family 28 includes monogalactosyldiacylglycerol synthase (EC 2.4.1.46) and UDP-N-acetylglucosamine transferase (EC 2.4.1.-). This N-terminal domain contains the acceptor binding site and likely membrane association site. This family also contains a large number of proteins that probably have quite distinct activities. Probab=75.75 E-value=6.1 Score=19.33 Aligned_cols=93 Identities=18% Similarity=0.133 Sum_probs=60.9 Q ss_pred ECCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCH-HHHCCCCCCEEEEECCCCC-----C-------------HHHHHH Q ss_conf 17899999999999999861998999962776300-1210147741688505572-----0-------------134455 Q gi|254780402|r 63 ASSVGETMALIGLIPAIRSRHVNVLLTTMTATSAK-VARKYLGQYAIHQYAPLDI-----Q-------------PAVSRF 123 (440) Q Consensus 63 aaSvGE~~~a~pli~~l~~~~~~iliT~~T~tg~~-~~~~~~~~~~~~~y~P~D~-----~-------------~~~~~f 123 (440) +-|-|-+.-+..+.++|+++++++.+. |..+.+ .+.+ .......+|.+. . ..+.++ T Consensus 6 GGTGGHv~Palala~~L~~~g~~v~ig--t~~~~e~~v~~---~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (136) T pfam03033 6 GGTRGHVFPAVALAWALRRRGHEVRLG--TPPGLEEFVEE---AGLPFVPIGGDGLRRKSLKNLKEPLEGGRALRQAKEI 80 (136) T ss_pred CCCHHHHHHHHHHHHHHHHCCCEEEEC--CCCCHHHHHHH---CCCEEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 415799999999999999859977121--58028888753---5981899627985467599999999999999999999 Q ss_pred HHHHCCEEEEEECCCCHHHHHHHHHHCCCCEEEEEEC Q ss_conf 7442020356404762146788874224550586412 Q gi|254780402|r 124 LKYWKPDCMILSESDIWPLTVFELSKQRIPQVLVNAR 160 (440) Q Consensus 124 l~~~~P~~~i~~e~ElWPnli~~~~~~~ip~~linar 160 (440) ++..+|+.++-+.+=.-.-.+..++..++|+++-... T Consensus 81 l~~~kp~~vig~GGy~s~p~~~aa~~~~ip~~ihEqN 117 (136) T pfam03033 81 LKEFKPDLVIGFGGYVAVPALIAAPLAGIPLIVHEQN 117 (136) T ss_pred HHHCCCCEEECCCCCCCHHHHHHHHHCCCCEEEECCC T ss_conf 9856998897438854228999999839988998798 No 129 >cd06292 PBP1_LacI_like_10 Ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=75.01 E-value=6.4 Score=19.20 Aligned_cols=109 Identities=17% Similarity=0.106 Sum_probs=65.6 Q ss_pred HHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEEC-----CCCHHHH Q ss_conf 99999999999861998999962776300121014774168850557201344557442020356404-----7621467 Q gi|254780402|r 69 TMALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILSE-----SDIWPLT 143 (440) Q Consensus 69 ~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e-----~ElWPnl 143 (440) ...+..+-+.+.+.+..+++.++... .+.-++ .+. .+...+++.+|++. .+..+.. T Consensus 15 ~~l~~gie~~~~~~Gy~~li~~s~~~-~~~e~~-----------------~i~-~l~~~~vdGiIi~~~~~~~~~~~~~~ 75 (273) T cd06292 15 PAFAEAIEAALAQYGYTVLLCNTYRG-GVSEAD-----------------YVE-DLLARGVRGVVFISSLHADTHADHSH 75 (273) T ss_pred HHHHHHHHHHHHHCCCEEEEEECCCC-HHHHHH-----------------HHH-HHHHCCCCEEEEECCCCCCCCCCHHH T ss_conf 99999999999986998999978999-799999-----------------999-99964998499924766633157899 Q ss_pred HHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCH----HHHHHHHHHCCCCEEEE-CCCC Q ss_conf 888742245505864126654212100111123310264423664157----78889986088505872-0431 Q gi|254780402|r 144 VFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSE----RYFRRYKELGAQKLIVS-GNLK 212 (440) Q Consensus 144 i~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~----~~~~rl~~lG~~~i~v~-GnlK 212 (440) +..+.+.++|+++++....+.. .++.|..-+. .-.+++.+.|.+++-+. |+.. T Consensus 76 ~~~~~~~~iPvV~~d~~~~~~~----------------~~~~V~~Dn~~a~~~a~~~L~~~G~~~i~~i~~~~~ 133 (273) T cd06292 76 YERLAERGLPVVLVNGRAPPPL----------------KVPHVSTDDALAMRLAVRHLVALGHRRIGFASGPGR 133 (273) T ss_pred HHHHHHCCCCEEEEECCCCCCC----------------CCCEEEECHHHHHHHHHHHHHHHCCCEEEEECCCCC T ss_conf 9999966998899933579988----------------874899688999999999999859980899517877 No 130 >pfam04016 DUF364 Domain of unknown function (DUF364). Archaeal domain of unknown function. Probab=74.84 E-value=4.5 Score=20.20 Aligned_cols=69 Identities=16% Similarity=0.095 Sum_probs=41.1 Q ss_pred HHHHHHCCCEEEEC-CCCCCCCCCHHHHHH-HCCCEEECCCCCCHHHHHHH-HHHCCCEEEECCHHHHHHHH Q ss_conf 25542208206613-554445458376896-19949998984437999999-99789869957989999999 Q gi|254780402|r 315 GFYLRMTEIAFIGR-SFCASGGQNPLEAAM-LGCAILSGPNVENFRDIYRR-MVSSGAVRIVEEVGTLADMV 383 (440) Q Consensus 315 ~~lY~~AdiafVGG-Sl~~~GGhN~lEpaa-~G~pVi~GP~~~nf~e~~~~-L~~~g~~~~v~d~~eL~~~l 383 (440) ..+..-||++|+-| |+++..--.+|+-+. ...-|++||...-.++...+ -++.=++..|.|++.+.+.+ T Consensus 157 ~~lLp~~D~viITgstlvN~Tl~~lL~~~~~~~~vvl~GPS~p~~P~~l~~~Gv~~lag~~V~d~d~~~~~i 228 (229) T pfam04016 157 EELLPEADVVIITGSTLVNGTLERLLALARKAAEVVLVGPSAPLLPEALFGYGVDVLAGVRVTDPDKLLRAI 228 (229) T ss_pred HHHCCCCCEEEEEECHHHCCCHHHHHHHCCCCCEEEEECCCCCCCHHHHHHCCCCEEEEEEEECHHHHHHHH T ss_conf 875124889999840354189899997476587799989986357789963897678677871899998854 No 131 >TIGR02884 spore_pdaA delta-lactam-biosynthetic de-N-acetylase; InterPro: IPR014235 Muramic delta-lactam is an unusual constituent of peptidoglycan, found only in bacterial spores, in the peptidoglycan wall or spore cortex. This entry represents a subset of the larger polysaccharide deacetylase family that is specifically involved in delta-lactam biosynthesis. PdaA from Bacillus subtilis acts immediately after CwlD, an N-acetylmuramoyl-L-alanine amidase and performs a de-N-acetylation. PdaA may also perform the subsequent transpeptidation that leads to lactam ring formation, as heterologous expression in Escherichia coli of CwlD and PdaA together is sufficient for delta-lactam production.. Probab=74.22 E-value=2.3 Score=22.06 Aligned_cols=99 Identities=16% Similarity=0.235 Sum_probs=57.0 Q ss_pred HHHHHH----HHCCCCEEEEC--CCCCCCCCCCCCHHHHHHHHHHCCCCCE-EECC-CCCCCCCCCCC-CCC-CCCCC-C Q ss_conf 888998----60885058720--4310123442212568999986179861-7504-65533210000-000-00012-3 Q gi|254780402|r 193 YFRRYK----ELGAQKLIVSG--NLKIDTESLPCDKELLSLYQESIAGRYT-WAAI-STFEGEEDKAV-YVH-NFIKC-R 261 (440) Q Consensus 193 ~~~rl~----~lG~~~i~v~G--nlK~d~~~~~~~~~~~~~~~~~~~~r~v-~vag-Sth~~EE~~il-~a~-~~l~~-~ 261 (440) +.++|+ ++-..--.||| ..||=++|.=.=.+.--...+.+|-+.| |=+| .=|.-++.-=- .|+ +.+++ | T Consensus 107 ~~ek~~~El~~V~e~~~~vTG~~~~~YlRPPrG~fSE~tL~~t~~LGY~~vFWSlAy~DW~~D~Q~G~~~A~~~im~~~H 186 (225) T TIGR02884 107 DDEKFKEELDGVEEEFKEVTGKKEMKYLRPPRGKFSERTLAYTKELGYKTVFWSLAYKDWEVDKQKGKEYAYKQIMKKIH 186 (225) T ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCEECCCCCCCHHHHHHHHHHCCCCEEEEEEEECCCCCCCCCCHHHHHHHHHHHCC T ss_conf 48899999888998876405762770007878754057899999748821100122026776987787778877754157 Q ss_pred CCEEEEE--E-ECCCCHHHHHHHHHHHHCCHHH Q ss_conf 4124889--7-0552114788877653001011 Q gi|254780402|r 262 TDVLTII--V-PRHPRRCDAIERRLIAKGLKVA 291 (440) Q Consensus 262 ~~~~lII--v-PRhpeR~~~i~~~l~~~gl~~~ 291 (440) |...++| | .--.+=.++|.+.|++.|..+. T Consensus 187 pGai~LLHAVS~dNA~aLd~~i~~lk~~GY~Fk 219 (225) T TIGR02884 187 PGAILLLHAVSKDNAEALDEIIKDLKEQGYTFK 219 (225) T ss_pred CHHHHHHHCCCHHHHHHHHHHHHHHHHCCCEEC T ss_conf 024133231474589999999999986597503 No 132 >COG0373 HemA Glutamyl-tRNA reductase [Coenzyme metabolism] Probab=73.78 E-value=6.8 Score=19.01 Aligned_cols=89 Identities=13% Similarity=0.081 Sum_probs=52.7 Q ss_pred HCCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECC Q ss_conf 61798617504655332100000000001234124889705521147888776530010110013444446641675067 Q gi|254780402|r 231 SIAGRYTWAAISTFEGEEDKAVYVHNFIKCRTDVLTIIVPRHPRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDT 310 (440) Q Consensus 231 ~~~~r~v~vagSth~~EE~~il~a~~~l~~~~~~~lIIvPRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt 310 (440) .+.++.+++.| -||-... .+ +.|....-..++|+-|-.+|+.++++.+.. . +.+ T Consensus 175 ~L~~~~vlvIG---AGem~~l-va-~~L~~~g~~~i~IaNRT~erA~~La~~~~~---~------------------~~~ 228 (414) T COG0373 175 SLKDKKVLVIG---AGEMGEL-VA-KHLAEKGVKKITIANRTLERAEELAKKLGA---E------------------AVA 228 (414) T ss_pred CCCCCEEEEEC---CCHHHHH-HH-HHHHHCCCCEEEEECCCHHHHHHHHHHHCC---E------------------EEC T ss_conf 54467699986---5189999-99-999858987799975878999999998387---0------------------221 Q ss_pred CCCCHHHHHHCCCEEEE-CCCCCCCCCCHHHHHHHC Q ss_conf 65202554220820661-355444545837689619 Q gi|254780402|r 311 IGEMGFYLRMTEIAFIG-RSFCASGGQNPLEAAMLG 345 (440) Q Consensus 311 ~GeL~~lY~~AdiafVG-GSl~~~GGhN~lEpaa~G 345 (440) +-||..++.-+|++|.+ ||-.+.=++..+|.+... T Consensus 229 l~el~~~l~~~DvVissTsa~~~ii~~~~ve~a~~~ 264 (414) T COG0373 229 LEELLEALAEADVVISSTSAPHPIITREMVERALKI 264 (414) T ss_pred HHHHHHHHHHCCEEEEECCCCCCCCCHHHHHHHHHC T ss_conf 877887652079999906998554078889988741 No 133 >cd06296 PBP1_CatR_like Ligand-binding domain of a LacI-like transcriptional regulator, CatR which is involved in catechol degradation. This group includes the ligand-binding domain of a LacI-like transcriptional regulator, CatR which is involved in catechol degradation. This group belongs to the the LacI-GalR family repressors that are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=73.70 E-value=6.8 Score=18.99 Aligned_cols=105 Identities=19% Similarity=0.165 Sum_probs=63.8 Q ss_pred HHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEECCCCHHHHHHHHHH Q ss_conf 99999999998619989999627763001210147741688505572013445574420203564047621467888742 Q gi|254780402|r 70 MALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILSESDIWPLTVFELSK 149 (440) Q Consensus 70 ~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e~ElWPnli~~~~~ 149 (440) ..+..+-+.+.+.+.++++.++....... . ..++ .+...+++.+|+.-++..+..+..+++ T Consensus 16 ~l~~gi~~~~~~~gy~~ll~~~~~~~~~e-~-----------------~~i~-~l~~~~vdgiIi~~~~~~~~~~~~l~~ 76 (270) T cd06296 16 EVLRGVEEAAAAAGYDVVLSESGRRTSPE-R-----------------QWVE-RLSARRTDGVILVTPELTSAQRAALRR 76 (270) T ss_pred HHHHHHHHHHHHCCCEEEEEECCCCHHHH-H-----------------HHHH-HHHHCCCCEEEEECCCCCHHHHHHHHH T ss_conf 99999999999849989999799996999-9-----------------9999-999549999999068899999999997 Q ss_pred CCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEEC----CHHHHHHHHHHCCCCEEEEC Q ss_conf 2455058641266542121001111233102644236641----57788899860885058720 Q gi|254780402|r 150 QRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQ----SERYFRRYKELGAQKLIVSG 209 (440) Q Consensus 150 ~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~q----s~~~~~rl~~lG~~~i~v~G 209 (440) .++|+++++....+.. .+..|..- .....++|.+.|-++|...+ T Consensus 77 ~~iPvV~id~~~~~~~----------------~~~~V~~Dn~~a~~~a~~~L~~~G~~~I~~i~ 124 (270) T cd06296 77 TGIPFVVVDPAGDPDA----------------DVPSVGATNWAGGLAATEHLLELGHRRIGFIT 124 (270) T ss_pred CCCCEEEECCCCCCCC----------------CCCEEEECHHHHHHHHHHHHHHHCCCCEEEEC T ss_conf 4999999878888888----------------98689957699999999999985897288845 No 134 >pfam02571 CbiJ Precorrin-6x reductase CbiJ/CobK. This family consists of Precorrin-6x reductase EC:1.3.1.54. This enzyme catalyses the reaction: precorrin-6Y + NADP(+) <= precorrin-6X + NADPH. CbiJ and CobK both catalyse the reduction of macocycle in the colbalmin biosynthesis pathway. Probab=73.20 E-value=7 Score=18.92 Aligned_cols=129 Identities=13% Similarity=0.108 Sum_probs=77.8 Q ss_pred EEEEEECCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEE-- Q ss_conf 799981789999999999999986199899996277630012101477416885055720134455744202035640-- Q gi|254780402|r 58 LIWFHASSVGETMALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILS-- 135 (440) Q Consensus 58 ~IW~HaaSvGE~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~-- 135 (440) .||+=+-. .=++.+++.|.+.+. +|++|..|.+...... ..+.+..=++........|++..+.+.+|=. T Consensus 2 ~IlilgGT----~e~r~la~~L~~~g~---~~v~t~~~~~~~~~~~-~~~~~~~G~l~~~~~m~~~i~~~~i~~vIDATH 73 (246) T pfam02571 2 RILILGGT----TEARALAAALAAAGV---VSVVTSLAGRTAAPRL-PPLPVRVGGFGGADGLAAYLREEGIDAVIDATH 73 (246) T ss_pred EEEEEEEC----HHHHHHHHHHHHCCC---EEEEECCCHHHCCCCC-CCCCEEECCCCCHHHHHHHHHHCCCCEEEECCC T ss_conf 69999736----899999999985698---7999847554437667-885089799899999999999779979998999 Q ss_pred --CCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHH-HHHCCCCEEE-EC Q ss_conf --47621467888742245505864126654212100111123310264423664157788899-8608850587-20 Q gi|254780402|r 136 --ESDIWPLTVFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRY-KELGAQKLIV-SG 209 (440) Q Consensus 136 --e~ElWPnli~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl-~~lG~~~i~v-~G 209 (440) -.++==|....|++.|||.+-..= +........ ..+.+.|-+++..+ .+++-++|.. +| T Consensus 74 PfA~~is~na~~a~~~~~ipyiR~eR-------------p~~~~~~~d--~~~~v~s~~ea~~~l~~~~~~~VllttG 136 (246) T pfam02571 74 PFAAQISRNAAAACKELGVPLLRLER-------------PAWQPGPGD--RWIYVDSLAEAAAALAELPGKRVFLTTG 136 (246) T ss_pred CCHHHHHHHHHHHHHHHCCCEEEECC-------------CCCCCCCCC--CEEEECCHHHHHHHHHHCCCCEEEEEEC T ss_conf 96899999999999985996899606-------------212678898--7799799999999876467865898405 No 135 >TIGR00649 MG423 conserved hypothetical protein; InterPro: IPR004613 This is a family of conserved hypothetical proteins, the members of which contain an ATP-binding domain at the N-terminal end of the protein. It is possibly part of a superfamily of beta-lactmases.. Probab=72.73 E-value=7 Score=18.95 Aligned_cols=182 Identities=18% Similarity=0.185 Sum_probs=74.8 Q ss_pred HHHHHHHHCCCCEEEEEECCCC--CCCCCC--HH----HHHHHHHCCCCCC-EEEECCHHHHHHHHHH---CC---CCEE Q ss_conf 6788874224550586412665--421210--01----1112331026442-3664157788899860---88---5058 Q gi|254780402|r 142 LTVFELSKQRIPQVLVNARMSR--RSFKNW--KT----VLSFSKKIFSQFS-LVIVQSERYFRRYKEL---GA---QKLI 206 (440) Q Consensus 142 nli~~~~~~~ip~~linarls~--~S~~~~--~~----~~~l~~~~~~~~~-~i~~qs~~~~~rl~~l---G~---~~i~ 206 (440) +-+.+.-+.|+-++|.+.-.++ ++-.-- .. +..+++..-..-+ .|+|--.....|..++ +. .++. T Consensus 191 ~~~~~~G~~GVL~LlsdsT~~~er~~g~TpSE~~~~~~l~~~~~~a~gr~~G~i~ttFaSni~Rv~~~~~~A~k~~R~~~ 270 (593) T TIGR00649 191 NRIAELGKKGVLLLLSDSTNVEERNEGKTPSEAVVSELLDDIFKNAQGRKDGVIVTTFASNIHRVQQIIQIARKNGRKVA 270 (593) T ss_pred HHHHHHHCCCCEEEEECCEECCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCCEEEEECHHHHHHHHHHHHHHHHCCCEEE T ss_conf 99999732884899950202267578868718899899999862114787735998601148899999999985698299 Q ss_pred EEC-CC-CC-------CCCCCCC------CH-H---HHHHHHHHCCCCCEEECCCCCCCCCCCCCC-----CCCCCCCC- Q ss_conf 720-43-10-------1234422------12-5---689999861798617504655332100000-----00000123- Q gi|254780402|r 207 VSG-NL-KI-------DTESLPC------DK-E---LLSLYQESIAGRYTWAAISTFEGEEDKAVY-----VHNFIKCR- 261 (440) Q Consensus 207 v~G-nl-K~-------d~~~~~~------~~-~---~~~~~~~~~~~r~v~vagSth~~EE~~il~-----a~~~l~~~- 261 (440) |.| ++ ++ .....+. ++ + ..+...+ .+.+.+++..--|+||.-.+|. .|++++=. T Consensus 271 v~GrSm~~~~~~A~~lG~i~~p~~~riygsP~~~~i~~~e~~~-~P~~~~lii~TGsQGEp~A~L~RiAn~~h~~~~i~~ 349 (593) T TIGR00649 271 VIGRSMEKVLGIARKLGYIKLPEDARIYGSPKNLFISLKEINK-YPKENYLIITTGSQGEPLAALTRIANGEHEQIRIRK 349 (593) T ss_pred EECCCHHHHHHHHHHCCCEECCCCCEEECCHHHHHHHHHHHHH-CCCCCEEEEEECCCCCHHHHHHHCCCCCCCCEEECC T ss_conf 9800178999999864812278755031380556799999971-888637999855989412555010078876310048 Q ss_pred CCEEEEEEECCCCHH----HHHHH-HHHHHCCHHHHHCCCC------CCCC--------CCEEEEECCCCCCHHHHHHCC Q ss_conf 412488970552114----78887-7653001011001344------4446--------641675067652025542208 Q gi|254780402|r 262 TDVLTIIVPRHPRRC----DAIER-RLIAKGLKVARRSRGD------VINA--------EVDIFLGDTIGEMGFYLRMTE 322 (440) Q Consensus 262 ~~~~lIIvPRhpeR~----~~i~~-~l~~~gl~~~~~S~~~------~~~~--------~~~V~i~Dt~GeL~~lY~~Ad 322 (440) .|...+=+|.=|.-. ..+.+ .+...|-........+ ..++ -..=|++-+-||...+-.++. T Consensus 350 ~DtVVfSa~~IPG~~n~~~~~~~~~~L~~~ga~v~~~~~~~~HvSGHAs~Ed~k~llrllkPkyiiP~hGeyrm~~~h~~ 429 (593) T TIGR00649 350 GDTVVFSAPPIPGNENIAVSNTLDVRLARAGARVIKGELDDIHVSGHASQEDHKLLLRLLKPKYIIPVHGEYRMLAAHTK 429 (593) T ss_pred CCEEEEECCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHCCCEEECCCCCHHHHHHHHH T ss_conf 73489856899873779999999999998777874111187167132673469999986298555117865678999999 Q ss_pred CE Q ss_conf 20 Q gi|254780402|r 323 IA 324 (440) Q Consensus 323 ia 324 (440) +| T Consensus 430 ~A 431 (593) T TIGR00649 430 LA 431 (593) T ss_pred HH T ss_conf 99 No 136 >TIGR02469 CbiT precorrin-6Y C5,15-methyltransferase (decarboxylating), CbiT subunit; InterPro: IPR014008 Cobalamin (vitamin B12) is a structurally complex cofactor, consisting of a modified tetrapyrrole with a centrally chelated cobalt. Cobalamin is usually found in one of two biologically active forms: methylcobalamin and adocobalamin. Most prokaryotes, as well as animals, have cobalamin-dependent enzymes, whereas plants and fungi do not appear to use it. In bacteria and archaea, these include methionine synthase, ribonucleotide reductase, glutamate and methylmalonyl-CoA mutases, ethanolamine ammonia lyase, and diol dehydratase . In mammals, cobalamin is obtained through the diet, and is required for methionine synthase and methylmalonyl-CoA mutase . There are at least two distinct cobalamin biosynthetic pathways in bacteria : Aerobic pathway that requires oxygen and in which cobalt is inserted late in the pathway ; found in Pseudomonas denitrificans and Rhodobacter capsulatus. Anaerobic pathway in which cobalt insertion is the first committed step towards cobalamin synthesis ; found in Salmonella typhimurium, Bacillus megaterium, and Propionibacterium freudenreichii shermanii. Either pathway can be divided into two parts: (1) corrin ring synthesis (differs in aerobic and anaerobic pathways) and (2) adenosylation of corrin ring, attachment of aminopropanol arm, and assembly of the nucleotide loop (common to both pathways) . There are about 30 enzymes involved in either pathway, where those involved in the aerobic pathway are prefixed Cob and those of the anaerobic pathway Cbi. Several of these enzymes are pathway-specific: CbiD, CbiG, and CbiK are specific to the anaerobic route of S. typhimurium, whereas CobE, CobF, CobG, CobN, CobS, CobT, and CobW are unique to the aerobic pathway of P. denitrificans. This entry represents CbiT subunit of precorrin-6Y C5,15-methyltransferase (2.1.1.132 from EC) from the anaerobic pathway, a bifunctional enzyme that catalyses two methylations (at C-5 and C-15) in precorrin-6Y, as well as the decarboxylation of the acetate side chain located in ring C, in order to generate precorrin-8X. In the anaerobic pathway, two enzymes are required to produce precorrin-8X: CbiE and CbiT, which can be fused as CbiET (sometimes called CobL) . In the aerobic pathway, the bifunctional enzyme is CobL .; GO: 0008276 protein methyltransferase activity, 0009236 cobalamin biosynthetic process. Probab=72.58 E-value=4.5 Score=20.17 Aligned_cols=83 Identities=20% Similarity=0.123 Sum_probs=49.5 Q ss_pred CCCCCC-EEEEEEECCCCHHHHHHHHHHHH-CCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCC Q ss_conf 012341-24889705521147888776530-0101100134444466416750676520255422082066135544454 Q gi|254780402|r 258 IKCRTD-VLTIIVPRHPRRCDAIERRLIAK-GLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGG 335 (440) Q Consensus 258 l~~~~~-~~lIIvPRhpeR~~~i~~~l~~~-gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GG 335 (440) .+..|+ ..+.=+=|.+++.+-+.+.+++. +++-..--.++.|..-.+...- +-.....-.|.+|||||= .-- T Consensus 39 ~~~~p~~g~v~aiEr~~~~~~~~~~N~~~~c~~~~~~i~~g~ap~~~~~~~~~----~~~~~~~~~Da~fvGGs~--~~~ 112 (135) T TIGR02469 39 ARLVPNSGRVYAIERNPEALRLIERNLRRFCGVSNIVIVEGDAPEELLNSDAP----EDSAKLPEPDAVFVGGSG--GKL 112 (135) T ss_pred HHHCCCCCEEEEEECCHHHHHHHHHHHHHHCCCCCCEEEECCCCCCCCCCCCC----HHHCCCCCCCEEEECCCC--HHH T ss_conf 97359860799985376898799999998289996325635568433367777----100588746888883897--178 Q ss_pred CCHHHHHHHCC Q ss_conf 58376896199 Q gi|254780402|r 336 QNPLEAAMLGC 346 (440) Q Consensus 336 hN~lEpaa~G~ 346 (440) +.+||++.-+. T Consensus 113 ~~il~~~~~~l 123 (135) T TIGR02469 113 EEILEAVERRL 123 (135) T ss_pred HHHHHHHHHCC T ss_conf 99999998505 No 137 >PRK03379 vitamin B12-transporter protein BtuF; Provisional Probab=72.52 E-value=7.3 Score=18.81 Aligned_cols=84 Identities=17% Similarity=0.150 Sum_probs=47.8 Q ss_pred EEEEECCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCC-HHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEECC Q ss_conf 99981789999999999999986199899996277630-01210147741688505572013445574420203564047 Q gi|254780402|r 59 IWFHASSVGETMALIGLIPAIRSRHVNVLLTTMTATSA-KVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILSES 137 (440) Q Consensus 59 IW~HaaSvGE~~~a~pli~~l~~~~~~iliT~~T~tg~-~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e~ 137 (440) |=-=+-|+=|..-+..+ +. |-+|.++. | +.+++ .+ .+- .|...| +.+. -..+||+++..++ T Consensus 25 IVSL~Ps~TEil~alG~-------~~-Vgv~~~~d--yP~~a~~-lp-~VG-~~~~~~----~E~I-laL~PDLVia~~~ 86 (265) T PRK03379 25 VITLSPANTELAFAAGI-------TP-VGVSSYSD--YPPQAQK-IE-QVS-TWQGMN----LERI-VALKPDLVLAWRG 86 (265) T ss_pred EEECCCHHHHHHHHCCC-------CC-EEEECCCC--CCHHHHC-CC-CCC-CCCCCC----HHHH-HHHCCCEEEEECC T ss_conf 79818469999998699-------98-88668899--9867854-87-027-888989----9999-9639998999568 Q ss_pred CCHHHHHHHHHHCCCCEEEEEEC Q ss_conf 62146788874224550586412 Q gi|254780402|r 138 DIWPLTVFELSKQRIPQVLVNAR 160 (440) Q Consensus 138 ElWPnli~~~~~~~ip~~linar 160 (440) .-=+..+.++++.|||++.++.+ T Consensus 87 ~~~~~~~~~L~~~GI~v~~~~~~ 109 (265) T PRK03379 87 GNAERQVDQLASLGIKVMWVDAT 109 (265) T ss_pred CCCHHHHHHHHHCCCEEEECCCC T ss_conf 89589999998169758835999 No 138 >TIGR02195 heptsyl_trn_II lipopolysaccharide heptosyltransferase II; InterPro: IPR011910 This family consists of examples of ADP-heptose:LPS heptosyltransferase II, an enzyme of LPS inner core region biosynthesis. LPS, composed of lipid A, a core region, and O antigen, is found in the outer membrane of Gram-negative bacteria.; GO: 0016757 transferase activity transferring glycosyl groups, 0009103 lipopolysaccharide biosynthetic process. Probab=71.56 E-value=4.1 Score=20.46 Aligned_cols=256 Identities=16% Similarity=0.168 Sum_probs=134.3 Q ss_pred HHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCC-------CHHHHHHHHHHCCEEEEEECCC Q ss_conf 999999999999998619989999627763001210147741688505572-------0134455744202035640476 Q gi|254780402|r 66 VGETMALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDI-------QPAVSRFLKYWKPDCMILSESD 138 (440) Q Consensus 66 vGE~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~-------~~~~~~fl~~~~P~~~i~~e~E 138 (440) ||.+..+.+|-+.|+++||++.|=..+|.=.+=.-+.+|+.-.....||-+ -+-..+-|+.-+=|.+++. T Consensus 10 VGDmvMaQ~Lf~~Lk~~yP~~~IDV~APaW~~PlL~RMPEi~~~~~~PlgHGaL~l~~R~rlg~~LR~~~YD~AiVL--- 86 (361) T TIGR02195 10 VGDMVMAQSLFRLLKKRYPDAVIDVLAPAWCKPLLERMPEIRQAIDMPLGHGALELTERYRLGKSLREERYDQAIVL--- 86 (361) T ss_pred HHHHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHCCCCHHHHHHCCCCCCCCCHHHHHHHHHHHHHHCCCCEEEEC--- T ss_conf 65556768999999985898389720753301333127015787458878762126767899999975589889986--- Q ss_pred CHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCC---CCC-EEEECCHHHHHHHHHHCCCC-EEEEC-CCC Q ss_conf 2146788874224550586412665421210011112331026---442-36641577888998608850-58720-431 Q gi|254780402|r 139 IWPLTVFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFS---QFS-LVIVQSERYFRRYKELGAQK-LIVSG-NLK 212 (440) Q Consensus 139 lWPnli~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~---~~~-~i~~qs~~~~~rl~~lG~~~-i~v~G-nlK 212 (440) ||=++.|= ||- =|++-.|+ .|.+-.--.+++ .+| ..=..-....+||..|+-+| +.-.| +|- T Consensus 87 --PNSlKSAL---iPf---FA~IP~Rt----Gw~GEmRYGLLND~r~LdGkak~~~P~m~~rY~ALAy~K~~i~~~~~LP 154 (361) T TIGR02195 87 --PNSLKSAL---IPF---FAGIPLRT----GWRGEMRYGLLNDIRALDGKAKERLPLMIERYIALAYDKDVIQKGADLP 154 (361) T ss_pred --CCCHHHHH---HHH---HCCCCCCC----CCCCCHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCC T ss_conf --77216676---367---61886123----5563235666644220677111433579999998864034467765576 Q ss_pred --CC---CCCCCCCHHH----HHHHH---HHCC-CCCEEEC--CC------CCCCCCCCCCCCCCCCCCCCCEEEEEEEC Q ss_conf --01---2344221256----89999---8617-9861750--46------55332100000000001234124889705 Q gi|254780402|r 213 --ID---TESLPCDKEL----LSLYQ---ESIA-GRYTWAA--IS------TFEGEEDKAVYVHNFIKCRTDVLTIIVPR 271 (440) Q Consensus 213 --~d---~~~~~~~~~~----~~~~~---~~~~-~r~v~va--gS------th~~EE~~il~a~~~l~~~~~~~lIIvPR 271 (440) |. .+.+..+.+. ..++. ..-. .||++.+ |+ -||.|-=.-+ |.+.+.+. ...++|..- T Consensus 155 ~sf~plp~P~L~~~~~~~~~~~~kF~k~taL~~P~rP~ialCPGAEfGpAKRWP~~HyA~L-A~~~~~~G-gy~V~lFGS 232 (361) T TIGR02195 155 ESFQPLPRPQLQIDPAEQAAALAKFGKQTALDTPERPIIALCPGAEFGPAKRWPAEHYAEL-AKKLIAQG-GYQVVLFGS 232 (361) T ss_pred CCCCCCCCCCCEECHHHHHHHHHHHHHHHCCCCCCCCEEEECCCCCCCCCCCCCHHHHHHH-HHHHHHCC-CCEEEEECC T ss_conf 1011377745254788999999985122126677887687587766775567853899999-99998706-918998428 Q ss_pred CCCH--HHH-HHHHHHHHCCHHHHHCC---CCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCC-CCHHHHHHH Q ss_conf 5211--478-88776530010110013---4444466416750676520255422082066135544454-583768961 Q gi|254780402|r 272 HPRR--CDA-IERRLIAKGLKVARRSR---GDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGG-QNPLEAAML 344 (440) Q Consensus 272 hpeR--~~~-i~~~l~~~gl~~~~~S~---~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GG-hN~lEpaa~ 344 (440) .-++ .++ |.+.....- ..+++ |++. .-|=-.+.+.|.. +|-. +.|- |- |||+ T Consensus 233 ~kD~~~~~~fI~~~~~~~~---~~~~~NLaG~T~-----------L~EAvdLia~a~a-vV~N---DSGLMHV---AAAL 291 (361) T TIGR02195 233 KKDKPVGDEFIEALAPGEL---REYCDNLAGETS-----------LDEAVDLIALAKA-VVSN---DSGLMHV---AAAL 291 (361) T ss_pred CCCHHHHHHHHHHHHHHHH---HHHHHHCCCCCC-----------HHHHHHHHHHCCE-EEEC---CHHHHHH---HHHC T ss_conf 4557899999998634468---899741047888-----------8899999871560-1215---6067999---9962 Q ss_pred CCCEE--ECCCCCCHHH Q ss_conf 99499--9898443799 Q gi|254780402|r 345 GCAIL--SGPNVENFRD 359 (440) Q Consensus 345 G~pVi--~GP~~~nf~e 359 (440) +.|+| +||---.|.. T Consensus 292 ~rPlVAlYGsTsP~fTP 308 (361) T TIGR02195 292 NRPLVALYGSTSPDFTP 308 (361) T ss_pred CCCEEEEECCCCCCCCC T ss_conf 79889973685688888 No 139 >cd01536 PBP1_ABC_sugar_binding_like Periplasmic sugar-binding domain of active transport systems that are members of the type I periplasmic binding protein (PBP1) superfamily. Periplasmic sugar-binding domain of active transport systems that are members of the type I periplasmic binding protein (PBP1) superfamily. The members of this family function as the primary receptors for chemotaxis and transport of many sugar based solutes in bacteria and archaea. The sugar binding domain is also homologous to the ligand-binding domain of eukaryotic receptors such as glutamate receptor (GluR) and DNA-binding transcriptional repressors such as LacI and GalR. Moreover, this periplasmic binding domain, also known as Venus flytrap domain, undergoes transition from an open to a closed conformational state upon the binding of ligands such as lactose, ribose, fructose, xylose, arabinose, galactose/glucose, and other sugars. This family also includes the periplasmic binding domain of autoinducer-2 (AI-2 Probab=70.85 E-value=7.9 Score=18.57 Aligned_cols=44 Identities=16% Similarity=0.231 Sum_probs=29.1 Q ss_pred HHHHHHHHHCCEEEEEECC--CCHHHHHHHHHHCCCCEEEEEECCCC Q ss_conf 3445574420203564047--62146788874224550586412665 Q gi|254780402|r 119 AVSRFLKYWKPDCMILSES--DIWPLTVFELSKQRIPQVLVNARMSR 163 (440) Q Consensus 119 ~~~~fl~~~~P~~~i~~e~--ElWPnli~~~~~~~ip~~linarls~ 163 (440) .++.++.. +++.+|+.-. +-....+..+...|||++++|..+.. T Consensus 47 ~i~~~~~~-~vdgiI~~~~~~~~~~~~~~~~~~~~iPvV~~d~~~~~ 92 (267) T cd01536 47 QIEDLIAQ-GVDGIIISPVDSAALTPALKKANAAGIPVVTVDSDIDG 92 (267) T ss_pred HHHHHHHC-CCCEEEEECCCCHHHHHHHHHHHHCCCCEEEEECCCCC T ss_conf 99999975-99999993158415589999999879969998256678 No 140 >COG1834 N-Dimethylarginine dimethylaminohydrolase [Amino acid transport and metabolism] Probab=70.46 E-value=5.9 Score=19.40 Aligned_cols=41 Identities=22% Similarity=0.330 Sum_probs=16.5 Q ss_pred EEECCCCHHH---HHHHHHHHHCCHHHHHCCCCCCCCCCEEEEE Q ss_conf 9705521147---8887765300101100134444466416750 Q gi|254780402|r 268 IVPRHPRRCD---AIERRLIAKGLKVARRSRGDVINAEVDIFLG 308 (440) Q Consensus 268 IvPRhpeR~~---~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~ 308 (440) -.++||.|.. .+++.+++.|++..-+-+........|+++. T Consensus 83 ~r~~~p~R~gE~~~~~~~~~~lgi~i~~~~~~~~~eG~GD~l~~ 126 (267) T COG1834 83 ARMGAPERRGEEEAIKETLESLGIPIYPRVEAGVFEGAGDVLMD 126 (267) T ss_pred ECCCCHHHCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCCCEEEE T ss_conf 54687454368779999999739952034568773256648996 No 141 >TIGR01496 DHPS dihydropteroate synthase; InterPro: IPR006390 This domain is present in sequences representing dihydropteroate synthase, the enzyme that catalyzes the second to last step in folic acid biosynthesis. Dihydropteroate synthase (2.5.1.15 from EC) (DHPS) catalyses the condensation of 6-hydroxymethyl-7,8-dihydropteridine pyrophosphate to para-aminobenzoic acid to form 7,8-dihydropteroate. This is the second step in the three-step pathway leading from 6-hydroxymethyl-7,8-dihydropterin to 7,8-dihydrofolate. DHPS is the target of sulphonamides, which are substrate analogues that compete with para-aminobenzoic acid. Bacterial DHPS (gene sul or folP) is a protein of about 275 to 315 amino acid residues that is either chromosomally encoded or found on various antibiotic resistance plasmids. In the lower eukaryote Pneumocystis carinii, DHPS is the C-terminal domain of a multifunctional folate synthesis enzyme (gene fas) .; GO: 0004156 dihydropteroate synthase activity, 0009396 folic acid and derivative biosynthetic process. Probab=69.93 E-value=8.3 Score=18.44 Aligned_cols=146 Identities=18% Similarity=0.208 Sum_probs=81.1 Q ss_pred HHHHHHHHHHHHHHHCC--CC--EEEEECC--CCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEECCCCH Q ss_conf 99999999999998619--98--9999627--763001210147741688505572013445574420203564047621 Q gi|254780402|r 67 GETMALIGLIPAIRSRH--VN--VLLTTMT--ATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILSESDIW 140 (440) Q Consensus 67 GE~~~a~pli~~l~~~~--~~--iliT~~T--~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e~ElW 140 (440) -|..=+.|+|++|++++ ++ +.||.=| +.=++.|-+.++-..+ -|.... ..= T Consensus 58 eE~~Rv~Pv~~~~~~~~~~~dQC~~iSvDT~~a~Va~~Al~~~Ga~ii-----NDv~G~------------------~~d 114 (268) T TIGR01496 58 EELNRVVPVIKALREQYLFSDQCVPISVDTYRAEVARAALEAFGADII-----NDVSGG------------------QFD 114 (268) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCEEEEECCCCHHHHHHHHHHCCCCEE-----EECCCC------------------CCC T ss_conf 999988999999997448999841477618828999999986798678-----604114------------------668 Q ss_pred HHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHCCC-CEEEECCCCCCCC-CC Q ss_conf 467888742245505864126654212100111123310264423664157788899860885-0587204310123-44 Q gi|254780402|r 141 PLTVFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKELGAQ-KLIVSGNLKIDTE-SL 218 (440) Q Consensus 141 Pnli~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~~-~i~v~GnlK~d~~-~~ 218 (440) |.++.-+++.++|+++-.-+=++++.+.-..+......+ ...=++.++.+++.|+. +|....=+-|.-. .. T Consensus 115 p~m~~vaae~~~~~vlMH~~g~P~~~q~~~~Y~dv~~e~-------~~fl~~~~~~~~~~Gv~~~I~LDPG~GF~K~~t~ 187 (268) T TIGR01496 115 PDMLEVAAEYGVPLVLMHMRGTPETMQENPRYEDVVEEV-------LRFLEARAEELLAAGVAERIILDPGIGFGKSDTV 187 (268) T ss_pred HHHHHHHHHHCCCEEEECCCCCCCCCCCCCCCCCHHHHH-------HHHHHHHHHHHHHCCCCCCEEECCCCCCCCCCCH T ss_conf 358999998489889876876388766677765668999-------9999999999997588665787177577888887 Q ss_pred CCCHHHHHHHHHHCC--CCCEEECCC Q ss_conf 221256899998617--986175046 Q gi|254780402|r 219 PCDKELLSLYQESIA--GRYTWAAIS 242 (440) Q Consensus 219 ~~~~~~~~~~~~~~~--~r~v~vagS 242 (440) ..+-+..+.+.++.. +-|++++.| T Consensus 188 ~~Nl~ll~~l~~f~~~lg~PlL~G~S 213 (268) T TIGR01496 188 EHNLELLKRLEEFKAVLGYPLLVGVS 213 (268) T ss_pred HHHHHHHHHHHHHHHHHCCCCEEECC T ss_conf 67899998689999985770212011 No 142 >cd06267 PBP1_LacI_sugar_binding_like Ligand binding domain of the LacI tanscriptional regulator family belonging to the type I periplasmic-binding fold protein superfamily. Ligand binding domain of the LacI tanscriptional regulator family belonging to the type I periplasmic-binding fold protein superfamily. In most cases, ligands are monosaccharide including lactose, ribose, fructose, xylose, arabinose, galactose/glucose, and other sugars. The LacI family of proteins consists of transcriptional regulators related to the lac repressor. In this case, the domain sugar binding changes the DNA binding activity of the repressor domain. Probab=68.95 E-value=8.7 Score=18.31 Aligned_cols=181 Identities=19% Similarity=0.151 Sum_probs=81.9 Q ss_pred HHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEECCCCHHHHHHHHHH Q ss_conf 99999999998619989999627763001210147741688505572013445574420203564047621467888742 Q gi|254780402|r 70 MALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILSESDIWPLTVFELSK 149 (440) Q Consensus 70 ~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e~ElWPnli~~~~~ 149 (440) ..+..+.+.+.+.+.++++.++....... ...-+.+...+++.+|++..+.=...+..+.+ T Consensus 16 ~~~~gi~~~a~~~gy~~~i~~~~~~~~~~-------------------~~~i~~l~~~~vDgiIi~~~~~~~~~~~~~~~ 76 (264) T cd06267 16 ELLRGIEEAAREAGYSVLLCNSDEDPEKE-------------------REALELLLSRRVDGIILAPSRLDDELLEELAA 76 (264) T ss_pred HHHHHHHHHHHHCCCEEEEEECCCCHHHH-------------------HHHHHHHHHCCCCEEEECCCCCCHHHHHHHHH T ss_conf 99999999999869989999789998999-------------------99999999579999996788899899999998 Q ss_pred CCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEEC----CHHHHHHHHHHCCCCEEEEC-CCCCCCCCCCCCHHH Q ss_conf 2455058641266542121001111233102644236641----57788899860885058720-431012344221256 Q gi|254780402|r 150 QRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQ----SERYFRRYKELGAQKLIVSG-NLKIDTESLPCDKEL 224 (440) Q Consensus 150 ~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~q----s~~~~~rl~~lG~~~i~v~G-nlK~d~~~~~~~~~~ 224 (440) .++|+++++....... ++.|..- .....++|.+.|.+++-+.+ +... ........-. T Consensus 77 ~~iPvV~~~~~~~~~~-----------------~~~V~~D~~~a~~~a~~~L~~~G~~~i~~i~~~~~~-~~~~~R~~gf 138 (264) T cd06267 77 LGIPVVLVDRPLDGLG-----------------VDSVGIDNRAGAYLAVEHLIELGHRRIAFIGGPPDL-STARERLEGY 138 (264) T ss_pred CCCCEEEEEECCCCCC-----------------CCEEEECHHHHHHHHHHHHHHHCCCCEEEECCCCCC-CHHHHHHHHH T ss_conf 6998899960479999-----------------879997879999999999987188856896389877-0699999999 Q ss_pred HHHHHHH-CCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCHHHHHHHHHHHHCCH Q ss_conf 8999986-1798617504655332100000000001234124889705521147888776530010 Q gi|254780402|r 225 LSLYQES-IAGRYTWAAISTFEGEEDKAVYVHNFIKCRTDVLTIIVPRHPRRCDAIERRLIAKGLK 289 (440) Q Consensus 225 ~~~~~~~-~~~r~v~vagSth~~EE~~il~a~~~l~~~~~~~lIIvPRhpeR~~~i~~~l~~~gl~ 289 (440) ...+.+. ......++..+.+..|+ ..-.+.+.++.+++.--|++.-. .-+-.+.+.+++.|++ T Consensus 139 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~l~~~~~~~ai~~~~D-~~A~g~~~~l~~~g~~ 202 (264) T cd06267 139 REALEEAGIPLDEELIVEGDFSEES-GYEAARELLASGERPTAIFAAND-LMAIGALRALRELGLR 202 (264) T ss_pred HHHHHHCCCCCCHHEEECCCCCHHH-HHHHHHHHHHCCCCCCEEEECCH-HHHHHHHHHHHHCCCC T ss_conf 9999986999885307527897799-99999999965999989987887-9999999999982999 No 143 >PRK05299 rpsB 30S ribosomal protein S2; Provisional Probab=68.42 E-value=8.9 Score=18.24 Aligned_cols=13 Identities=31% Similarity=0.409 Sum_probs=7.2 Q ss_pred HHHHHHHCCCEEE Q ss_conf 3768961994999 Q gi|254780402|r 338 PLEAAMLGCAILS 350 (440) Q Consensus 338 ~lEpaa~G~pVi~ 350 (440) +-||..+|.|||- T Consensus 173 V~EA~kl~IPvI~ 185 (255) T PRK05299 173 VKEARKLGIPVVA 185 (255) T ss_pred HHHHHHCCCCEEE T ss_conf 9999975998887 No 144 >pfam01075 Glyco_transf_9 Glycosyltransferase family 9 (heptosyltransferase). Members of this family belong to glycosyltransferase family 9. Lipopolysaccharide is a major component of the outer leaflet of the outer membrane in Gram-negative bacteria. It is composed of three domains; lipid A, Core oligosaccharide and the O-antigen. All of these enzymes transfer heptose to the lipopolysaccharide core. Probab=66.28 E-value=9.8 Score=17.96 Aligned_cols=38 Identities=24% Similarity=0.353 Sum_probs=29.2 Q ss_pred CCCCCHHHHHHCCCEEEECCCCCCC-CCCHHHHHHHCCCEE--ECCCC Q ss_conf 7652025542208206613554445-458376896199499--98984 Q gi|254780402|r 310 TIGEMGFYLRMTEIAFIGRSFCASG-GQNPLEAAMLGCAIL--SGPNV 354 (440) Q Consensus 310 t~GeL~~lY~~AdiafVGGSl~~~G-GhN~lEpaa~G~pVi--~GP~~ 354 (440) ++.|+..+.+.|+ .|||. +.| .|= |+++|+|++ |||.. T Consensus 175 sL~el~ali~~a~-l~I~n---DSGp~Hi---AaA~g~Pti~ifGpT~ 215 (249) T pfam01075 175 SLEQAAALLAGAD-LVVGN---DSGLMHL---AAALDRPVIGLYGPTD 215 (249) T ss_pred CHHHHHHHHHHCC-EEEEC---CCHHHHH---HHHCCCCEEEEECCCC T ss_conf 9999999998506-68857---9859999---9983998899978898 No 145 >cd03793 GT1_Glycogen_synthase_GSY2_like Glycogen synthase, which is most closely related to the GT1 family of glycosyltransferases, catalyzes the transfer of a glucose molecule from UDP-glucose to a terminal branch of a glycogen molecule, a rate-limit step of glycogen biosynthesis. GSY2, the member of this family in S. cerevisiae, has been shown to possess glycogen synthase activity. Probab=65.77 E-value=10 Score=17.90 Aligned_cols=94 Identities=15% Similarity=0.196 Sum_probs=50.2 Q ss_pred ECCCCCCHH----HHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHHHHHHHHHC---CCEEEEC----CH Q ss_conf 067652025----542208206613554445458376896199499989844379999999978---9869957----98 Q gi|254780402|r 308 GDTIGEMGF----YLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFRDIYRRMVSS---GAVRIVE----EV 376 (440) Q Consensus 308 ~Dt~GeL~~----lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~~~~L~~~---g~~~~v~----d~ 376 (440) ..+=|.++. +-+-+|+ =|-.|.-+--|=+|+|++++|+|.|+ .+...|-.-.++.++. -|+.+|+ +. T Consensus 458 s~~dgl~~m~Y~efv~Gc~L-gVFPSyYEPWGYTPlE~~a~gvPsIT-TdLsGFG~~~~~~~~~~~~~GV~VvdR~~~~~ 535 (590) T cd03793 458 SSTNPLLGLDYEEFVRGCHL-GVFPSYYEPWGYTPAECTVMGIPSIT-TNLSGFGCFMEEHIEDPESYGIYIVDRRFKSP 535 (590) T ss_pred CCCCCCCCCCHHHHHCCCCC-CCCCCCCCCCCCCHHHHHHCCCCEEE-CCCHHHHHHHHHHHCCCCCCCEEEEECCCCCH T ss_conf 68998778769999645765-55765348878886988762798022-16603779999864164237479997898997 Q ss_pred HHHHHHHHHHHC-----CHHHHHHHHHHHHHH Q ss_conf 999999999958-----989999999999999 Q gi|254780402|r 377 GTLADMVYSLLS-----EPTIRYEMINAAINE 403 (440) Q Consensus 377 ~eL~~~l~~ll~-----d~~~~~~~~~~a~~~ 403 (440) +|-++++...+- ++.+|..+.+++.+. T Consensus 536 ~Esv~~l~~~~~~f~~~s~~qr~~~R~rae~l 567 (590) T cd03793 536 DESVQQLTQYMYEFCQLSRRQRIIQRNRTERL 567 (590) T ss_pred HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH T ss_conf 99999999999999749999999999999999 No 146 >pfam10649 DUF2478 Protein of unknown function (DUF2478). This is a family of hypothetical bacterial proteins found in the vicinity of Molybdenum ABC transporter ATP-binding gene-products MobA MobB and MobC. However the function could not be confirmed. This family appears to belong to the P-loop superfamily by alignment to pfam03266. However, the characteristic P-loop sequence motif appears to have diverged beyond recognition in this family. Probab=65.65 E-value=5.2 Score=19.75 Aligned_cols=66 Identities=21% Similarity=0.288 Sum_probs=41.9 Q ss_pred CCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHHHHHHHHHCCCEEEECCHHHH Q ss_conf 46641675067652025542208206613554445458376896199499989844379999999978986995798999 Q gi|254780402|r 300 NAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFRDIYRRMVSSGAVRIVEEVGTL 379 (440) Q Consensus 300 ~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~~~~L~~~g~~~~v~d~~eL 379 (440) ....|++|++.||-.-. -|++|-+. +.||++.|.||+++-+-.|+.. ...+...-+....-|.+.+ T Consensus 91 ~~g~dLlIvNkFGk~Ea---------~G~Glr~~----i~~Al~~giPvLt~V~~~~~~~-W~~Fagg~a~~L~~d~~al 156 (159) T pfam10649 91 AAGADLLIVNKFGKQEA---------EGRGLRAE----IAEALAAGIPVLTAVPPRNLEA-WRAFAGGLAEELPPDREAL 156 (159) T ss_pred HCCCCEEEECCCHHHHH---------CCCCHHHH----HHHHHHCCCCEEEEECHHHHHH-HHHHCCCCCCCCCCCHHHH T ss_conf 42998999814246661---------49976999----9999976998899889888899-9985277676579998888 No 147 >COG0052 RpsB Ribosomal protein S2 [Translation, ribosomal structure and biogenesis] Probab=64.98 E-value=10 Score=17.81 Aligned_cols=34 Identities=26% Similarity=0.285 Sum_probs=16.2 Q ss_pred CCCCCCCEEEEEEECCCCHHHHHHHHHHHHCCHHHHH Q ss_conf 0012341248897055211478887765300101100 Q gi|254780402|r 257 FIKCRTDVLTIIVPRHPRRCDAIERRLIAKGLKVARR 293 (440) Q Consensus 257 ~l~~~~~~~lIIvPRhpeR~~~i~~~l~~~gl~~~~~ 293 (440) -++..|+++.|+=|++-. .-...+.+.|+++.-. T Consensus 152 ~m~~~Pd~l~ViDp~~e~---iAv~EA~klgIPVvAl 185 (252) T COG0052 152 DMKGLPDVLFVIDPRKEK---IAVKEANKLGIPVVAL 185 (252) T ss_pred HCCCCCCEEEEECCCHHH---HHHHHHHHCCCCEEEE T ss_conf 126799989996881768---9999999759998998 No 148 >KOG0564 consensus Probab=63.80 E-value=7.4 Score=18.76 Aligned_cols=122 Identities=13% Similarity=0.192 Sum_probs=76.2 Q ss_pred CCCCEEEEEECCHHHHHHHHHHHHHHHHCCCC-EEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHH---HHHHCC Q ss_conf 88877999817899999999999999861998-99996277630012101477416885055720134455---744202 Q gi|254780402|r 54 PIGPLIWFHASSVGETMALIGLIPAIRSRHVN-VLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRF---LKYWKP 129 (440) Q Consensus 54 ~~~~~IW~HaaSvGE~~~a~pli~~l~~~~~~-iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~f---l~~~~P 129 (440) |.|..=|.+.. |++.-|.+|++.+++++-+ +-|++ .||-......+... .--|++..-+|. -+.+-. T Consensus 117 P~g~d~~~~~e--~gF~yA~DLVr~Irs~YGDyF~IgV---AgYPEghpe~~~~~----~~~Dl~yLk~KvdaGaDFIiT 187 (590) T KOG0564 117 PIGQDKWVEEE--GGFRYAVDLVRYIRSKYGDYFCIGV---AGYPEGHPEAPSHD----YLADLPYLKEKVDAGADFIIT 187 (590) T ss_pred CCCCCCCCCCC--CCCHHHHHHHHHHHHHHCCEEEEEE---CCCCCCCCCCCCCC----HHHHHHHHHHHHCCCCHHHHH T ss_conf 99865320035--7712579999999998288478875---03787776776544----122168999852154214355 Q ss_pred EEEEEECCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHH-----HCCCCCCEE Q ss_conf 035640476214678887422455058641266542121001111233-----102644236 Q gi|254780402|r 130 DCMILSESDIWPLTVFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSK-----KIFSQFSLV 186 (440) Q Consensus 130 ~~~i~~e~ElWPnli~~~~~~~ip~~linarls~~S~~~~~~~~~l~~-----~~~~~~~~i 186 (440) + +|.|.|..-++...|+..||-+=++-|-|.=.|++.+.+.-.+.+ .+...++-| T Consensus 188 Q--lFYd~e~flkfv~~cR~~gi~~PIvPGIMPI~~Y~sf~R~~kls~~~IP~~~~~~L~pi 247 (590) T KOG0564 188 Q--LFYDVETFLKFVKDCRAAGINVPIVPGIMPIQSYRSFLRIAKLSGVSIPQHLMDRLEPI 247 (590) T ss_pred H--HHCCHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCHHHHHHCCCC T ss_conf 4--64079999999999998378888555543102089999999872787879999733667 No 149 >KOG2619 consensus Probab=63.37 E-value=8.8 Score=18.28 Aligned_cols=48 Identities=19% Similarity=0.320 Sum_probs=34.9 Q ss_pred CCCEEECCCCCCHHHHHHHHHHCCCEEEE---CCHHHHHHHHHHHHCCHHHHHHHHH Q ss_conf 99499989844379999999978986995---7989999999999589899999999 Q gi|254780402|r 345 GCAILSGPNVENFRDIYRRMVSSGAVRIV---EEVGTLADMVYSLLSEPTIRYEMIN 398 (440) Q Consensus 345 G~pVi~GP~~~nf~e~~~~L~~~g~~~~v---~d~~eL~~~l~~ll~d~~~~~~~~~ 398 (440) -+||++|| .|.+ ..+-.++.+-| .+++||+..++.|-+|+....+.-+ T Consensus 278 sVPVvlg~--~n~e----~fvP~~SfI~vdDF~s~~ela~ylk~L~~n~~~Y~~Yf~ 328 (372) T KOG2619 278 SVPVVLGP--PNYE----NFVPPDSFIHVDDFQSPQELAAYLKKLDKNPAAYLSYFE 328 (372) T ss_pred CCCEEECC--CCCC----CCCCCCCEEEHHHCCCHHHHHHHHHHHHCCHHHHHHHHH T ss_conf 30179787--5201----468970137634359999999999986328899999999 No 150 >pfam05693 Glycogen_syn Glycogen synthase. This family consists of the eukaryotic glycogen synthase proteins GYS1, GYS2 and GYS3. Glycogen synthase (GS) is the enzyme responsible for the synthesis of -1,4-linked glucose chains in glycogen. It is the rate limiting enzyme in the synthesis of the polysaccharide, and its activity is highly regulated through phosphorylation at multiple sites and also by allosteric effectors, mainly glucose 6-phosphate (G6P). Probab=63.26 E-value=11 Score=17.60 Aligned_cols=127 Identities=16% Similarity=0.195 Sum_probs=66.3 Q ss_pred EEECCC--CHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEE-----EECCCCCCHHH----HHHCCCEEEECCCCCCCCC Q ss_conf 970552--114788877653001011001344444664167-----50676520255----4220820661355444545 Q gi|254780402|r 268 IVPRHP--RRCDAIERRLIAKGLKVARRSRGDVINAEVDIF-----LGDTIGEMGFY----LRMTEIAFIGRSFCASGGQ 336 (440) Q Consensus 268 IvPRhp--eR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~-----i~Dt~GeL~~l----Y~~AdiafVGGSl~~~GGh 336 (440) |+.|.. +-.|.|...++..+|.-. +.....|+ |..+=|.++.= -+-+|+ =|-.|.-+--|= T Consensus 413 i~THnl~dd~~DpIL~~~rr~~L~N~-------~~drVKVIF~P~fL~~~d~lf~l~Y~efv~Gc~L-gVFPSYYEPWGY 484 (633) T pfam05693 413 VCTHNMIDDANDPILNQIRRVGLFNS-------PSDRVKVIFHPEFLSSTSPLLGLDYEEFVRGCHL-GVFPSYYEPWGY 484 (633) T ss_pred EEEECCCCCCCCHHHHHHHHCCCCCC-------CCCCEEEEECCHHCCCCCCCCCCCHHHHHCCCCC-CCCCCCCCCCCC T ss_conf 13335776766889999987378688-------7773589962001268898778779998524664-446543589888 Q ss_pred CHHHHHHHCCCEEECCCCCCHHHHHHHHHHCC---CEEEEC----CHHHHHHHHHHHHC-----CHHHHHHHHHHHHHH Q ss_conf 83768961994999898443799999999789---869957----98999999999958-----989999999999999 Q gi|254780402|r 337 NPLEAAMLGCAILSGPNVENFRDIYRRMVSSG---AVRIVE----EVGTLADMVYSLLS-----EPTIRYEMINAAINE 403 (440) Q Consensus 337 N~lEpaa~G~pVi~GP~~~nf~e~~~~L~~~g---~~~~v~----d~~eL~~~l~~ll~-----d~~~~~~~~~~a~~~ 403 (440) .|+|.+++|+|.|+ .+...|--=.+.+++.. |+.+|+ +.+|-++++...+- ++.+|..+.+++.+. T Consensus 485 TPlE~~a~gVPsIT-TdLSGFG~~~~~~~~~~~~~GI~VvdR~~~n~~Esv~ql~~~m~~f~~~s~rqri~~RnrterL 562 (633) T pfam05693 485 TPAECTVMGIPSVT-TNLSGFGCFMEEHIEDPKDYGIYIVDRRFKSPDESVQQLAQFMYEFCQQSRRQRIIQRNRTERL 562 (633) T ss_pred CHHHHHHHCCCCEE-CCCHHHHHHHHHHHCCCCCCCEEEEECCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH T ss_conf 85998651788011-3751578999987306434757999788898799999999999999749999999999999999 No 151 >cd06279 PBP1_LacI_like_3 Ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=63.06 E-value=11 Score=17.58 Aligned_cols=98 Identities=18% Similarity=0.210 Sum_probs=59.6 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEECCCCHHHHHHHHHHC Q ss_conf 99999999986199899996277630012101477416885055720134455744202035640476214678887422 Q gi|254780402|r 71 ALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILSESDIWPLTVFELSKQ 150 (440) Q Consensus 71 ~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e~ElWPnli~~~~~~ 150 (440) .+..+-+.+.+.+.+++|.++.. ..+. .+ .+.....+-+|+..++.-...+..+++. T Consensus 22 i~~gie~~~~~~gy~l~l~~~~~-~~~~---------------------~~-~~~~~~vDGiIi~~~~~~~~~~~~l~~~ 78 (283) T cd06279 22 FLAGVAEVLDAAGVNLLLLPASS-EDSD---------------------SA-LVVSALVDGFIVYGVPRDDPLVAALLRR 78 (283) T ss_pred HHHHHHHHHHHCCCEEEEEECCC-CHHH---------------------HH-HHHHCCCCEEEEECCCCCHHHHHHHHHC T ss_conf 99999999998499899996999-9999---------------------99-9997799999991798998999999857 Q ss_pred CCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECC----HHHHHHHHHHCCCCEEEEC Q ss_conf 4550586412665421210011112331026442366415----7788899860885058720 Q gi|254780402|r 151 RIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQS----ERYFRRYKELGAQKLIVSG 209 (440) Q Consensus 151 ~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs----~~~~~rl~~lG~~~i~v~G 209 (440) ++|+++++....+ .++.|.+-+ ....++|.+.|-++|...| T Consensus 79 ~iPvV~idr~~~~------------------~~~~V~~Dn~~~~~~a~~~Li~~Gh~~Ia~i~ 123 (283) T cd06279 79 GLPVVVVDQPLPP------------------GVPSVGIDDRAAAREAARHLLDLGHRRIGILG 123 (283) T ss_pred CCCEEEECCCCCC------------------CCCEEEECCHHHHHHHHHHHHHCCCCEEEEEE T ss_conf 9978998876889------------------99989967288799999999972998699995 No 152 >cd06283 PBP1_RegR_EndR_KdgR_like Ligand-binding domain of DNA transcription repressor RegR and other putative regulators such as KdgR and EndR. Ligand-binding domain of DNA transcription repressor RegR and other putative regulators such as KdgR and EndR, all of which are members of the LacI-GalR family of bacterial transcription regulators. RegR regulates bacterial competence and the expression of virulence factors, including hyaluronidase. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA b Probab=62.90 E-value=11 Score=17.56 Aligned_cols=105 Identities=15% Similarity=0.241 Sum_probs=60.0 Q ss_pred HHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEECCCCHHHHHHHHH Q ss_conf 99999999999861998999962776300121014774168850557201344557442020356404762146788874 Q gi|254780402|r 69 TMALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILSESDIWPLTVFELS 148 (440) Q Consensus 69 ~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e~ElWPnli~~~~ 148 (440) ...+..+-+.+.+.+.++++.+ |....+.-. ..+++| ...+.+.+|+.-++.=...+..+. T Consensus 15 ~~i~~gi~~~a~~~gy~l~i~~-s~~~~~~e~-----------------~~i~~l-~~~~vDgiIl~~~~~~~~~~~~~~ 75 (267) T cd06283 15 SLVLKGIEDVCRAHGYQVLVCN-SDNDPEKEK-----------------EYLESL-LAYQVDGLIVNPTGNNKELYQRLA 75 (267) T ss_pred HHHHHHHHHHHHHCCCEEEEEE-CCCCHHHHH-----------------HHHHHH-HHCCCCEEEECCCCCCHHHHHHHH T ss_conf 9999999999998699899997-899989999-----------------999999-966999999858779989999999 Q ss_pred HCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEEC----CHHHHHHHHHHCCCCEEEEC Q ss_conf 22455058641266542121001111233102644236641----57788899860885058720 Q gi|254780402|r 149 KQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQ----SERYFRRYKELGAQKLIVSG 209 (440) Q Consensus 149 ~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~q----s~~~~~rl~~lG~~~i~v~G 209 (440) +.+||+++++....+.. ++.|..- ...-.++|.+.|.+++.+.| T Consensus 76 ~~~iPvV~i~~~~~~~~-----------------~~~V~~D~~~~~~~~~~~L~~~G~~~i~~i~ 123 (267) T cd06283 76 KNGKPVVLVDRKIPELG-----------------VDTVTLDNYEAAKEAVDHLIEKGYERILFVT 123 (267) T ss_pred HCCCCEEEECCCCCCCC-----------------CCEEEECHHHHHHHHHHHHHHCCCCEEEEEE T ss_conf 76998999688578999-----------------9889977799999999999975997299995 No 153 >cd06289 PBP1_MalI_like Ligand-binding domain of MalI, a transcription regulator of the maltose system of Escherichia coli and its close homologs from other bacteria. This group includes the ligand-binding domain of MalI, a transcription regulator of the maltose system of Escherichia coli and its close homologs from other bacteria. They are members of the LacI-GalR family of repressor proteins which are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=62.37 E-value=12 Score=17.50 Aligned_cols=182 Identities=15% Similarity=0.095 Sum_probs=86.1 Q ss_pred HHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEEC-CCCHHHHHHHHH Q ss_conf 9999999999861998999962776300121014774168850557201344557442020356404-762146788874 Q gi|254780402|r 70 MALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILSE-SDIWPLTVFELS 148 (440) Q Consensus 70 ~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e-~ElWPnli~~~~ 148 (440) ..+..+-+.+.+.+.+++|.++.. ..+.-++ .++ .+...+++.+|+.. ++-=+..+..++ T Consensus 16 ~~~~gi~~~a~~~gy~lll~~t~~-~~~~e~~-----------------~i~-~l~~~~vDGiIi~~~~~~~~~~~~~l~ 76 (268) T cd06289 16 ELAAGLEEVLEEAGYTVFLANSGE-DVERQEQ-----------------LLS-TMLEHGVAGIILCPAAGTSPDLLKRLA 76 (268) T ss_pred HHHHHHHHHHHHCCCEEEEEECCC-CHHHHHH-----------------HHH-HHHHCCCCEEEEECCCCCCHHHHHHHH T ss_conf 999999999998699899995899-9899999-----------------999-999659998999468889999999999 Q ss_pred HCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECC----HHHHHHHHHHCCCCEEEECCCCCCCCCCCCCHHH Q ss_conf 224550586412665421210011112331026442366415----7788899860885058720431012344221256 Q gi|254780402|r 149 KQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQS----ERYFRRYKELGAQKLIVSGNLKIDTESLPCDKEL 224 (440) Q Consensus 149 ~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs----~~~~~rl~~lG~~~i~v~GnlK~d~~~~~~~~~~ 224 (440) +.++|+++++..+.... ++.|.+-+ ....++|.+.|-+++...|...-.........-. T Consensus 77 ~~~iPvV~i~~~~~~~~-----------------~~~V~~d~~~~~~~a~~~L~~~G~~~i~~i~~~~~~~~~~~R~~Gf 139 (268) T cd06289 77 ESGIPVVLVAREVAGAP-----------------FDYVGPDNAAGARLATEHLISLGHRRIAFIGGLEDSSTRRERLAGY 139 (268) T ss_pred HCCCCEEEECCCCCCCC-----------------CCEEEECHHHHHHHHHHHHHHHCCCEEEEECCCCCCCHHHHHHHHH T ss_conf 75998998367679999-----------------8789977799999999999973987099952898870699999999 Q ss_pred HHHHHH-HCCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCHHHHHHHHHHHHCCH Q ss_conf 899998-61798617504655332100000000001234124889705521147888776530010 Q gi|254780402|r 225 LSLYQE-SIAGRYTWAAISTFEGEEDKAVYVHNFIKCRTDVLTIIVPRHPRRCDAIERRLIAKGLK 289 (440) Q Consensus 225 ~~~~~~-~~~~r~v~vagSth~~EE~~il~a~~~l~~~~~~~lIIvPRhpeR~~~i~~~l~~~gl~ 289 (440) ...+.. ....+..|+.-+.+..| ...-.+.+.+.++|+.-.|++... .-+-.+.+-+.+.|++ T Consensus 140 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~l~~~~~~~aii~~~D-~~A~g~l~~l~~~gi~ 203 (268) T cd06289 140 RAALAEAGLPFDSELVVEGPPSRQ-GGAEAVAQLLDLPPRPTAIVCFND-LVAFGAMSGLRRAGLT 203 (268) T ss_pred HHHHHHCCCCCCCCEEECCCCCHH-HHHHHHHHHHHCCCCCCEECCCCH-HHHHHHHHHHHHCCCC T ss_conf 999997699866434531688655-699999999846999862035768-9999999999984999 No 154 >cd06294 PBP1_ycjW_transcription_regulator_like Ligand-binding domain of uncharacterized transcription regulator ycjW which is a member of the LacI-GalR family repressors. This group includes the ligand-binding domain of uncharacterized transcription regulator ycjW which is a member of the LacI-GalR family repressors that are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=60.78 E-value=12 Score=17.32 Aligned_cols=105 Identities=16% Similarity=0.178 Sum_probs=62.2 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEECCCCHHHHHHHHHHC Q ss_conf 99999999986199899996277630012101477416885055720134455744202035640476214678887422 Q gi|254780402|r 71 ALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILSESDIWPLTVFELSKQ 150 (440) Q Consensus 71 ~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e~ElWPnli~~~~~~ 150 (440) .+..+-+.+.+.+.++++.++.. ..+. ...+.+++...+.+-+|++.++.-...+..+++. T Consensus 22 l~~gie~~~~~~Gy~lli~~~~~-~~~~------------------~~~~~~~l~~~~vDGiIi~~~~~~~~~~~~l~~~ 82 (270) T cd06294 22 VLRGISAVANENGYDISLATGKN-EEEL------------------LEEVKKMIQQKRVDGFILLYSREDDPIIDYLKEE 82 (270) T ss_pred HHHHHHHHHHHCCCEEEEEECCC-CHHH------------------HHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHC T ss_conf 99999999998599899991899-6999------------------9999999982799889995798984999999976 Q ss_pred CCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEEC----CHHHHHHHHHHCCCCEEEECC Q ss_conf 455058641266542121001111233102644236641----577888998608850587204 Q gi|254780402|r 151 RIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQ----SERYFRRYKELGAQKLIVSGN 210 (440) Q Consensus 151 ~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~q----s~~~~~rl~~lG~~~i~v~Gn 210 (440) ++|+++++-..... ..+..|..- .....++|.+.|-++|-..|. T Consensus 83 ~iP~V~i~r~~~~~----------------~~~~~V~~Dn~~~~~~a~~~L~~~Ghr~I~~i~~ 130 (270) T cd06294 83 KFPFVVIGKPEDDK----------------ENITYVDNDNIQAGYDATEYLIKLGHKKIAFVGG 130 (270) T ss_pred CCCEEEECCCCCCC----------------CCCEEEECCHHHHHHHHHHHHHHCCCCEEEEECC T ss_conf 99999987756888----------------9984993777999999999999769956999818 No 155 >TIGR02764 spore_ybaN_pdaB polysaccharide deacetylase family sporulation protein PdaB; InterPro: IPR014132 This entry describes the YbaN protein family, also called PdaB and SpoVIE, of Gram-positive bacteria. Although ybaN null mutants have only a mild sporulation defect, ybaN/ytrI double mutants show drastically reduced sporulation efficiencies. This synthetic defect suggests the role of this sigmaE-controlled gene in sporulation had been masked by functional redundancy. Members of this family are homologous to a characterised polysaccharide deacetylase; the exact function this protein family is unknown.. Probab=60.45 E-value=10 Score=17.89 Aligned_cols=50 Identities=20% Similarity=0.181 Sum_probs=21.8 Q ss_pred CCCCCCCCCCCCCCCCCCCEEEEE-EECC----CCHHHHHHHHHHHHCCHHHHHCC Q ss_conf 332100000000001234124889-7055----21147888776530010110013 Q gi|254780402|r 245 EGEEDKAVYVHNFIKCRTDVLTII-VPRH----PRRCDAIERRLIAKGLKVARRSR 295 (440) Q Consensus 245 ~~EE~~il~a~~~l~~~~~~~lII-vPRh----peR~~~i~~~l~~~gl~~~~~S~ 295 (440) ||+|.++-.+.+..+. -+..|.= |--- ++-.+.|.+.|++.|..+..-|+ T Consensus 142 PG~e~iv~~V~~~~~~-GdIiL~HDASd~~KqT~~ALp~Ii~~LK~~GY~fv~ise 196 (198) T TIGR02764 142 PGVESIVDRVVKNTKP-GDIILLHDASDSAKQTVKALPEIIKKLKEKGYEFVTISE 196 (198) T ss_pred CCEEHHHHHHHCCCCC-CCEEEEEECCCCCCCHHHHHHHHHHHHHHCCCEEEEHHC T ss_conf 8610001113312799-856987634879744177899998998754953434220 No 156 >COG0297 GlgA Glycogen synthase [Carbohydrate transport and metabolism] Probab=59.69 E-value=13 Score=17.19 Aligned_cols=213 Identities=18% Similarity=0.123 Sum_probs=104.3 Q ss_pred HHHHHCCCCCCEEEECCHHHHHHHH--HHC-----C---CCEEEE----------CCCCCCCCC--------CCCCHHHH Q ss_conf 1233102644236641577888998--608-----8---505872----------043101234--------42212568 Q gi|254780402|r 174 SFSKKIFSQFSLVIVQSERYFRRYK--ELG-----A---QKLIVS----------GNLKIDTES--------LPCDKELL 225 (440) Q Consensus 174 ~l~~~~~~~~~~i~~qs~~~~~rl~--~lG-----~---~~i~v~----------GnlK~d~~~--------~~~~~~~~ 225 (440) .+.+.-+...|.|-+.|...++-+. ..| . ..-.+. -|.+.|... .+...... T Consensus 200 ~~lK~gi~~ad~vttVSptYa~Ei~t~~~g~gl~g~l~~~~~~l~GI~NgiD~~~wnp~~d~~~~~~y~~~~~~~k~~nk 279 (487) T COG0297 200 SFLKGGLYYADAVTTVSPTYAGEIYTPEYGEGLEGLLSWRSGKLSGILNGIDYDLWNPETDPYIAANYSAEVLPAKAENK 279 (487) T ss_pred HHHHHHHEECCEEEEECHHHHHHHCCCCCCCCCHHHHHHCCCCEEEEEEEEEECCCCCCCCCCHHCCCCCCCHHHHHHHH T ss_conf 26654304222899988778776526002555210555104518998744773231866443300147701033467899 Q ss_pred HHHHHHCC-----CCCEEECCCCCCCC--CCCCCCCC-CCCCCCCCEEEEEEECCCCHHHHHHHHHHHHCCHHHHHCCCC Q ss_conf 99998617-----98617504655332--10000000-000123412488970552114788877653001011001344 Q gi|254780402|r 226 SLYQESIA-----GRYTWAAISTFEGE--EDKAVYVH-NFIKCRTDVLTIIVPRHPRRCDAIERRLIAKGLKVARRSRGD 297 (440) Q Consensus 226 ~~~~~~~~-----~r~v~vagSth~~E--E~~il~a~-~~l~~~~~~~lIIvPRhpeR~~~i~~~l~~~gl~~~~~S~~~ 297 (440) ..+.+.++ +.|++.+-|.-+.. -+.++.+. ..+.+. .+++|...- ++ .+...+... ++.. T Consensus 280 ~~L~~~~gL~~~~~~pl~~~vsRl~~QKG~dl~~~~i~~~l~~~--~~~vilG~g-d~--~le~~~~~l------a~~~- 347 (487) T COG0297 280 VALQERLGLDVDLPGPLFGFVSRLTAQKGLDLLLEAIDELLEQG--WQLVLLGTG-DP--ELEEALRAL------ASRH- 347 (487) T ss_pred HHHHHHHCCCCCCCCCEEEEEECCCCCCCHHHHHHHHHHHHHHC--CEEEEEECC-CH--HHHHHHHHH------HHHC- T ss_conf 99999809877999758999422420134558999999999708--369998268-27--899999999------9866- Q ss_pred CCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHHHHH-------HHHHCCCE Q ss_conf 44466416750676520255422082066135544454583768961994999898443799999-------99978986 Q gi|254780402|r 298 VINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFRDIYR-------RMVSSGAV 370 (440) Q Consensus 298 ~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~~~-------~L~~~g~~ 370 (440) +. ..-+.+.=....-...|+.|| +|+=.|..+-=|-+-++++.+|+..|.=+ +..-.+.+. .-...|.. T Consensus 348 -~~-~~~~~i~~~~~la~~i~agaD-~~lmPSrfEPcGL~ql~amryGtvpIv~~-tGGLadTV~~~~~~~~~~~gtGf~ 423 (487) T COG0297 348 -PG-RVLVVIGYDEPLAHLIYAGAD-VILMPSRFEPCGLTQLYAMRYGTLPIVRE-TGGLADTVVDRNEWLIQGVGTGFL 423 (487) T ss_pred -CC-EEEEEEEECHHHHHHHHHCCC-EEEECCCCCCCCHHHHHHHHCCCCCEECC-CCCCCCEECCCCCHHCCCCEEEEE T ss_conf -76-099996336899999984498-99957767687589999987188446754-688654103766110037503799 Q ss_pred EEECCHHHHHHHHHHHHC---CHHH-HHHHHHHHHH Q ss_conf 995798999999999958---9899-9999999999 Q gi|254780402|r 371 RIVEEVGTLADMVYSLLS---EPTI-RYEMINAAIN 402 (440) Q Consensus 371 ~~v~d~~eL~~~l~~ll~---d~~~-~~~~~~~a~~ 402 (440) +.-.|++++..++.+-+. +++. .+.+..++.. T Consensus 424 f~~~~~~~l~~al~rA~~~y~~~~~~w~~~~~~~m~ 459 (487) T COG0297 424 FLQTNPDHLANALRRALVLYRAPPLLWRKVQPNAMG 459 (487) T ss_pred EECCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHCC T ss_conf 826998999999999999960878999999986424 No 157 >cd01976 Nitrogenase_MoFe_alpha Nitrogenase_MoFe_alpha_II: Nitrogenase MoFe protein, beta subunit. A group of proteins similar to the alpha subunit of the MoFe protein of the molybdenum (Mo-) nitrogenase. The nitrogenase enzyme catalyzes the ATP-dependent reduction of dinitrogen to ammonia. The Mo-nitrogenase is the most widespread and best characterized of these systems. Mo-nitrogenase consists of the MoFe protein (component 1) and the Fe protein (component 2). MoFe is an alpha2beta2 tetramer. Each alphabeta pair of MoFe contains one P-cluster (at the alphabeta interface) and, one molecule of iron molybdenum cofactor (FeMoco) contained within the alpha subunit. The Fe protein contains a single [4Fe-4S] cluster. Electrons are transferred from the [4Fe-4S] cluster of the Fe protein to the P-cluster of the MoFe and in turn to FeMoCo, the site of substrate reduction. Probab=59.64 E-value=13 Score=17.19 Aligned_cols=130 Identities=13% Similarity=0.117 Sum_probs=60.6 Q ss_pred HHHHHHHHHHHHHHHCCC-C-EEEEECCCCCC---H---HHHCCCCCCEEEEECCCCCCHHHH------------HHHHH Q ss_conf 999999999999986199-8-99996277630---0---121014774168850557201344------------55744 Q gi|254780402|r 67 GETMALIGLIPAIRSRHV-N-VLLTTMTATSA---K---VARKYLGQYAIHQYAPLDIQPAVS------------RFLKY 126 (440) Q Consensus 67 GE~~~a~pli~~l~~~~~-~-iliT~~T~tg~---~---~~~~~~~~~~~~~y~P~D~~~~~~------------~fl~~ 126 (440) ||=.....+.+...+-.| + |.|.+|-++|. + .+++. .+....-.+|++.|.+.. .++++ T Consensus 82 GekKL~~aI~ea~~~~~P~kaIfV~sTC~~glIGDDI~aV~~~~-~~~~gipVIpv~~~GF~g~s~~~G~~~a~~al~~~ 160 (421) T cd01976 82 GDKKLAKAIDEAYELFPLNKGISVQSECPVGLIGDDIEAVARKA-SKELGIPVVPVRCEGFRGVSQSLGHHIANDAIRDH 160 (421) T ss_pred HHHHHHHHHHHHHHHCCCCEEEEEEECCCHHHHCCCHHHHHHHH-HHHCCCCEEEEECCCCCCCCHHHHHHHHHHHHHHH T ss_conf 79999999999998579730899992373786556899999998-77529988998689857752306889999999998 Q ss_pred H----CC------EEEEEECC----CCHHHHHHHHHHCCCCEEEEEECCCCC-CCCCCHHHHHHHHHCCCCCCEEEECCH Q ss_conf 2----02------03564047----621467888742245505864126654-212100111123310264423664157 Q gi|254780402|r 127 W----KP------DCMILSES----DIWPLTVFELSKQRIPQVLVNARMSRR-SFKNWKTVLSFSKKIFSQFSLVIVQSE 191 (440) Q Consensus 127 ~----~P------~~~i~~e~----ElWPnli~~~~~~~ip~~linarls~~-S~~~~~~~~~l~~~~~~~~~~i~~qs~ 191 (440) + .| ++-++-|+ |+| .+-..+.+.||.+ |+.++.. |+. .+.+.--..++.+.|... T Consensus 161 vv~~~~~~~~~~~~iNiige~ni~gd~~-~i~~lL~~mGi~V---~~~~tgd~t~~------di~~~~~A~lNvv~C~~~ 230 (421) T cd01976 161 ILGKRNEFEPTPYDVNIIGDYNIGGDAW-ASRILLEEMGLRV---VAQWSGDGTLN------EMENAHKAKLNLIHCYRS 230 (421) T ss_pred HCCCCCCCCCCCCCEEEECCCCCCCCHH-HHHHHHHHCCCCE---EEEEECCCCHH------HHHHCCCCCEEEEECHHH T ss_conf 4166887788984179975566763299-9999999839837---99980899799------997320076998853579 Q ss_pred HH--HHHHH-HHCCCCEEE Q ss_conf 78--88998-608850587 Q gi|254780402|r 192 RY--FRRYK-ELGAQKLIV 207 (440) Q Consensus 192 ~~--~~rl~-~lG~~~i~v 207 (440) .. ++.++ +.|++-+.+ T Consensus 231 ~~~~A~~le~~yGiP~~~~ 249 (421) T cd01976 231 MNYIARMMEEKYGIPWMEY 249 (421) T ss_pred HHHHHHHHHHHHCCCEEEE T ss_conf 9999999999869896951 No 158 >TIGR03172 probable selenium-dependent hydroxylase accessory protein YqeC. This uncharacterized protein family includes YqeC from Escherichia coli. A phylogenetic profiling analysis shows correlation with SelD, the selenium donor protein, even in species where SelD contributes to neither selenocysteine nor selenouridine biosynthesis. Instead, this family, and families TIGR03309 and TIGR03310 appear to mark selenium-dependent molybdenum hydroxylase maturation systems. Probab=59.63 E-value=11 Score=17.64 Aligned_cols=27 Identities=33% Similarity=0.379 Sum_probs=23.4 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEEECCC Q ss_conf 999999999999986199899996277 Q gi|254780402|r 67 GETMALIGLIPAIRSRHVNVLLTTMTA 93 (440) Q Consensus 67 GE~~~a~pli~~l~~~~~~iliT~~T~ 93 (440) |-+....-|.+++.+++.+|+|||||- T Consensus 9 GKTS~m~~LA~e~~~~G~~VlvTTTT~ 35 (232) T TIGR03172 9 GKTSTMFWLAAEYRKEGYRVLVTTTTR 35 (232) T ss_pred CHHHHHHHHHHHHHHCCCCEEEECCCC T ss_conf 799999999999987699099988966 No 159 >cd01574 PBP1_LacI Ligand-binding domain of DNA transcription repressor LacI specific for lactose, a member of the LacI-GalR family of bacterial transcription regulators. Ligand-binding domain of DNA transcription repressor LacI specific for lactose, a member of the LacI-GalR family of bacterial transcription regulators. The ligand-binding domain of LacI is structurally homologous to the periplasmic sugar-binding domain of ABC-type transporters and both domains contain the type I periplasmic binding protein-like fold. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the type I periplasmic binding proteins. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA b Probab=59.29 E-value=13 Score=17.15 Aligned_cols=107 Identities=21% Similarity=0.214 Sum_probs=66.7 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEECCCCHHHHHHH Q ss_conf 99999999999998619989999627763001210147741688505572013445574420203564047621467888 Q gi|254780402|r 67 GETMALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILSESDIWPLTVFE 146 (440) Q Consensus 67 GE~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e~ElWPnli~~ 146 (440) --...+..+-+.+.+.+.+++|.+++..-.+..++. ++. +...+.+.+|++.+......... T Consensus 13 f~~~~~~gie~~~~~~Gy~~~l~~~~~~~~~~~~~~-----------------i~~-l~~~~vdGiI~~~~~~~~~~~~~ 74 (264) T cd01574 13 GPSSTLAAIESAAREAGYAVTLSMLAEADEEALRAA-----------------VRR-LLAQRVDGVIVNAPLDDADAALA 74 (264) T ss_pred HHHHHHHHHHHHHHHCCCEEEEEECCCCCHHHHHHH-----------------HHH-HHHCCCCEEEEECCCCCHHHHHH T ss_conf 999999999999998499799997999989999999-----------------999-99649999999168899699999 Q ss_pred HHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECC----HHHHHHHHHHCCCCEEEEC Q ss_conf 74224550586412665421210011112331026442366415----7788899860885058720 Q gi|254780402|r 147 LSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQS----ERYFRRYKELGAQKLIVSG 209 (440) Q Consensus 147 ~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs----~~~~~rl~~lG~~~i~v~G 209 (440) ..+.+||+++++....+. +..|..-+ ..-.+.|.+.|.+++...| T Consensus 75 ~~~~~iPvV~i~~~~~~~------------------~~~V~~Dn~~~~~~a~~~L~~~G~~~i~~i~ 123 (264) T cd01574 75 AAPADVPVVFVDGSPSPR------------------VSTVSVDQEGGARLATEHLLELGHRTIAHVA 123 (264) T ss_pred HHHCCCCEEEECCCCCCC------------------CCEEEECHHHHHHHHHHHHHHCCCCCEEECC T ss_conf 985899689986679999------------------9889978899999999999984998273258 No 160 >KOG0780 consensus Probab=58.80 E-value=13 Score=17.10 Aligned_cols=124 Identities=19% Similarity=0.258 Sum_probs=58.1 Q ss_pred CCEEEEEE---ECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEEC-CCCCCHHHHHHC--CCEEEECCCC---- Q ss_conf 41248897---0552114788877653001011001344444664167506-765202554220--8206613554---- Q gi|254780402|r 262 TDVLTIIV---PRHPRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGD-TIGEMGFYLRMT--EIAFIGRSFC---- 331 (440) Q Consensus 262 ~~~~lIIv---PRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~D-t~GeL~~lY~~A--diafVGGSl~---- 331 (440) .+.-+||+ .||-....-..++..-. + ...++.-|++.| ++|.-..--+-| +.+=||+-.. T Consensus 182 e~fdvIIvDTSGRh~qe~sLfeEM~~v~--------~--ai~Pd~vi~VmDasiGQaae~Qa~aFk~~vdvg~vIlTKlD 251 (483) T KOG0780 182 ENFDVIIVDTSGRHKQEASLFEEMKQVS--------K--AIKPDEIIFVMDASIGQAAEAQARAFKETVDVGAVILTKLD 251 (483) T ss_pred CCCCEEEEECCCCHHHHHHHHHHHHHHH--------H--HCCCCEEEEEEECCCCHHHHHHHHHHHHHHCCCEEEEEECC T ss_conf 3972899827873012489999999998--------5--15987389998562007679999988776154037997225 Q ss_pred --CCCCCCHHHHHHHCCCEEE---CCCCCCHHHH-----HHHHHHCCCEEEECCHHHHHHHHHHHH-CCHHHHHHHHHHH Q ss_conf --4454583768961994999---8984437999-----999997898699579899999999995-8989999999999 Q gi|254780402|r 332 --ASGGQNPLEAAMLGCAILS---GPNVENFRDI-----YRRMVSSGAVRIVEEVGTLADMVYSLL-SEPTIRYEMINAA 400 (440) Q Consensus 332 --~~GGhN~lEpaa~G~pVi~---GP~~~nf~e~-----~~~L~~~g~~~~v~d~~eL~~~l~~ll-~d~~~~~~~~~~a 400 (440) .+||--+---|+-++||+| |-|++.+..- +..|... .|-+.|.+.+..+- .|.+...+..+.+ T Consensus 252 GhakGGgAlSaVaaTksPIiFIGtGEhmdDlE~F~pk~FvsrlLGm------GDi~glvek~~ev~~~d~~el~~kl~~g 325 (483) T KOG0780 252 GHAKGGGALSAVAATKSPIIFIGTGEHMDDLEPFDPKPFVSRLLGM------GDIEGLVEKVQEVGKDDAKELVEKLKQG 325 (483) T ss_pred CCCCCCCEEEEHHHHCCCEEEEECCCCCCCCCCCCHHHHHHHHHCC------CCHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 6777773453035407987998167551115777807799987156------5289999999998552089999998828 Q ss_pred H Q ss_conf 9 Q gi|254780402|r 401 I 401 (440) Q Consensus 401 ~ 401 (440) + T Consensus 326 k 326 (483) T KOG0780 326 K 326 (483) T ss_pred C T ss_conf 7 No 161 >cd06272 PBP1_hexuronate_repressor_like Ligand-binding domain of DNA transcription repressor for the hexuronate utilization operon from Bacillus species and its close homologs from other bacteria, all of which are a member of the LacI-GalR family of bacterial transcription regulators. Ligand-binding domain of DNA transcription repressor for the hexuronate utilization operon from Bacillus species and its close homologs from other bacteria, all of which are a member of the LacI-GalR family of bacterial transcription regulators. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor Probab=58.74 E-value=13 Score=17.09 Aligned_cols=101 Identities=21% Similarity=0.175 Sum_probs=64.0 Q ss_pred HHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEECCCCHHHHHHHHHH Q ss_conf 99999999998619989999627763001210147741688505572013445574420203564047621467888742 Q gi|254780402|r 70 MALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILSESDIWPLTVFELSK 149 (440) Q Consensus 70 ~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e~ElWPnli~~~~~ 149 (440) ..+..+-+.+.+.+.++++.++ ....+ .. +.+...+.+-+|++....-+..+..+.+ T Consensus 16 ~l~~gi~~~~~~~GY~~ll~~~-~~~~~---------------------~~-~~l~~~~vDGiIi~~~~~~~~~~~~~~~ 72 (261) T cd06272 16 ELVTGINQAISKNGYNMNVSIT-PSLAE---------------------AE-DLFKENRFDGVIIFGESASDVEYLYKIK 72 (261) T ss_pred HHHHHHHHHHHHCCCEEEEEEC-CCHHH---------------------HH-HHHHHCCCCEEEEECCCCCHHHHHHHHH T ss_conf 9999999999984998999968-98599---------------------99-9999679989999789899799999974 Q ss_pred CCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECC----HHHHHHHHHHCCCCEEEECCC Q ss_conf 24550586412665421210011112331026442366415----778889986088505872043 Q gi|254780402|r 150 QRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQS----ERYFRRYKELGAQKLIVSGNL 211 (440) Q Consensus 150 ~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs----~~~~~rl~~lG~~~i~v~Gnl 211 (440) .++|+++++ |-.+. .++.|.+-+ ....+.|.+.|-+++-..|.. T Consensus 73 ~~iPvV~~~-~~~~~-----------------~~~~V~~Dn~~~~~~a~~~L~~~G~~~i~~i~~~ 120 (261) T cd06272 73 LAIPVVSYG-VDYDL-----------------KYPIVNVDNEKAMELAVLYLAEKGHKKIAYIGDL 120 (261) T ss_pred CCCCEEEEC-CCCCC-----------------CCCEEEECHHHHHHHHHHHHHHCCCCCEEEECCC T ss_conf 899399983-67899-----------------9989996889999999999998699828996698 No 162 >TIGR01011 rpsB_bact ribosomal protein S2; InterPro: IPR005706 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites , . About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many of ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome , . This family describes the bacterial, mitochondrial and chloroplast forms of ribosomal protein S2.; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0015935 small ribosomal subunit. Probab=58.00 E-value=9.2 Score=18.14 Aligned_cols=50 Identities=20% Similarity=0.150 Sum_probs=28.2 Q ss_pred CCCCCCCCCEEEEEEECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEE Q ss_conf 00001234124889705521147888776530010110013444446641675 Q gi|254780402|r 255 HNFIKCRTDVLTIIVPRHPRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFL 307 (440) Q Consensus 255 ~~~l~~~~~~~lIIvPRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i 307 (440) .+-+++.|+++.||=|++ |+ --..-+++.|+++.---+-...+...|+.| T Consensus 151 Ik~M~~lPd~l~viD~~~-E~--IAv~EA~kLgIPvvA~vDTNCdPd~vD~~I 200 (227) T TIGR01011 151 IKDMKKLPDLLFVIDPRK-EK--IAVAEARKLGIPVVAIVDTNCDPDVVDYPI 200 (227) T ss_pred HHHHHCCCCEEEEECCCC-CH--HHHHHHHHCCCCEEEEECCCCCCCCCCCCC T ss_conf 254531697089974821-06--899997337997898704788888665025 No 163 >cd01078 NAD_bind_H4MPT_DH NADP binding domain of methylene tetrahydromethanopterin dehydrogenase. Methylene Tetrahydromethanopterin Dehydrogenase (H4MPT DH) NADP binding domain. NADP-dependent H4MPT DH catalyzes the dehydrogenation of methylene- H4MPT and methylene-tetrahydrofolate (H4F) with NADP+ as cofactor. H4F and H4MPT are both cofactors that carry the one-carbon units between the formyl and methyl oxidation level. H4F and H4MPT are structurally analogous to each other with respect to the pterin moiety, but each has distinct side chain. H4MPT is present only in anaerobic methanogenic archaea and aerobic methylotrophic proteobacteria. H4MPT seems to have evolved independently from H4F and functions as a distinct carrier in C1 metabolism. Amino acid DH-like NAD(P)-binding domains are members of the Rossmann fold superfamily and include glutamate, leucine, and phenylalanine DHs, methylene tetrahydrofolate DH, methylene-tetrahydromethanopterin DH, methylene-tetrahydropholate DH/cyclo Probab=57.65 E-value=14 Score=16.97 Aligned_cols=53 Identities=19% Similarity=0.234 Sum_probs=35.4 Q ss_pred HHHHHCCCCCEEECCCCCC-CCCCCCCCCCCCCCCCCCEEEEEEECCCCHHHHHHHHHHH Q ss_conf 9998617986175046553-3210000000000123412488970552114788877653 Q gi|254780402|r 227 LYQESIAGRYTWAAISTFE-GEEDKAVYVHNFIKCRTDVLTIIVPRHPRRCDAIERRLIA 285 (440) Q Consensus 227 ~~~~~~~~r~v~vagSth~-~EE~~il~a~~~l~~~~~~~lIIvPRhpeR~~~i~~~l~~ 285 (440) .+...+.++.+.+.|.|-+ |-... .++ ......++|+-|..||.+++.+.++. T Consensus 21 ~~g~dl~g~~~~V~G~tG~vG~~~A-----~~l-A~~Ga~v~lv~R~~ek~~~~a~~i~~ 74 (194) T cd01078 21 LMGKDLKGKTAVVLGGTGPVGQRAA-----VLL-AREGARVVLVGRDLERAQKAADSLRA 74 (194) T ss_pred HHCCCCCCCEEEEECCCCHHHHHHH-----HHH-HHCCCEEEEEECCHHHHHHHHHHHHH T ss_conf 8299867988999858857899999-----999-98399799995878889999999999 No 164 >COG4671 Predicted glycosyl transferase [General function prediction only] Probab=57.05 E-value=14 Score=16.91 Aligned_cols=300 Identities=15% Similarity=0.142 Sum_probs=163.0 Q ss_pred EECCHHHHHHHHHHHHHHHHCC--CCEEEEE-CCC-CCCHHHHC----CCCC---CE----EEEECCCCCCHH------- Q ss_conf 8178999999999999998619--9899996-277-63001210----1477---41----688505572013------- Q gi|254780402|r 62 HASSVGETMALIGLIPAIRSRH--VNVLLTT-MTA-TSAKVARK----YLGQ---YA----IHQYAPLDIQPA------- 119 (440) Q Consensus 62 HaaSvGE~~~a~pli~~l~~~~--~~iliT~-~T~-tg~~~~~~----~~~~---~~----~~~y~P~D~~~~------- 119 (440) |.-.+|-..-+..+++.|.+++ .+|++-+ .|+ +|+..... ..|. .. ...+.-.|.-.+ T Consensus 18 d~~GlGHlrR~~~Ia~aLv~d~~~~~Il~IsG~~~~~~F~~~~gVd~V~LPsl~k~~~G~~~~~d~~~~l~e~~~~Rs~l 97 (400) T COG4671 18 DLLGLGHLRRALRIAHALVEDYLGFDILIISGGPPAGGFPGPAGVDFVKLPSLIKGDNGEYGLVDLDGDLEETKKLRSQL 97 (400) T ss_pred HHCCCHHHHHHHHHHHHHHHCCCCCEEEEEECCCCCCCCCCCCCCCEEECCCEEECCCCCEEEEECCCCHHHHHHHHHHH T ss_conf 10130489999999999852556843999958975688977456756854845745887334445688899999999999 Q ss_pred HHHHHHHHCCEEEEE------ECCCCHHHHHHHHHHCCCCEEE-E-EECCCCCCC-CCCHHHHHHHHHCCCCCCEEEECC Q ss_conf 445574420203564------0476214678887422455058-6-412665421-210011112331026442366415 Q gi|254780402|r 120 VSRFLKYWKPDCMIL------SESDIWPLTVFELSKQRIPQVL-V-NARMSRRSF-KNWKTVLSFSKKIFSQFSLVIVQS 190 (440) Q Consensus 120 ~~~fl~~~~P~~~i~------~e~ElWPnli~~~~~~~ip~~l-i-narls~~S~-~~~~~~~~l~~~~~~~~~~i~~qs 190 (440) +..-++.++||.+|. +..|+=|- +...+.++-+.++ + +-|-..... +-|++ ..-.+.+=+.||.|.+.. T Consensus 98 il~t~~~fkPDi~IVd~~P~Glr~EL~pt-L~yl~~~~t~~vL~lr~i~D~p~~~~~~w~~-~~~~~~I~r~yD~V~v~G 175 (400) T COG4671 98 ILSTAETFKPDIFIVDKFPFGLRFELLPT-LEYLKTTGTRLVLGLRSIRDIPQELEADWRR-AETVRLINRFYDLVLVYG 175 (400) T ss_pred HHHHHHHCCCCEEEEECCCCCHHHHHHHH-HHHHHHCCCCCEEEHHHHHHCHHHHCCCHHH-HHHHHHHHHHHEEEEEEC T ss_conf 99999832997899935554136546679-9998605986255447655254441351445-679999987540799946 Q ss_pred HHHHHHHH-HHCC-----CCEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCEEEC-CCCCCCCC--CCCCCCCCCCCCC Q ss_conf 77888998-6088-----50587204310123442212568999986179861750-46553321--0000000000123 Q gi|254780402|r 191 ERYFRRYK-ELGA-----QKLIVSGNLKIDTESLPCDKELLSLYQESIAGRYTWAA-ISTFEGEE--DKAVYVHNFIKCR 261 (440) Q Consensus 191 ~~~~~rl~-~lG~-----~~i~v~GnlK~d~~~~~~~~~~~~~~~~~~~~r~v~va-gSth~~EE--~~il~a~~~l~~~ 261 (440) .-+...+. +++. .++..+|=++-+....+...... ..++.|+|. |.-..|.| +-.++|.+.+..- T Consensus 176 dP~f~d~~~~~~~~~~i~~k~~ytG~vq~~~~~~~~p~~~~------pE~~~Ilvs~GGG~dG~eLi~~~l~A~~~l~~l 249 (400) T COG4671 176 DPDFYDPLTEFPFAPAIRAKMRYTGFVQRSLPHLPLPPHEA------PEGFDILVSVGGGADGAELIETALAAAQLLAGL 249 (400) T ss_pred CCCCCCHHHCCCCCHHHHHHEEEEEEEECCCCCCCCCCCCC------CCCCEEEEECCCCHHHHHHHHHHHHHHHHCCCC T ss_conf 95415732227860765632667677613676788987678------763339995488720599999999875507788 Q ss_pred -CCEEEEEEECCCCH-HHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCHH Q ss_conf -41248897055211-4788877653001011001344444664167506765202554220820661355444545837 Q gi|254780402|r 262 -TDVLTIIVPRHPRR-CDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQNPL 339 (440) Q Consensus 262 -~~~~lIIvPRhpeR-~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~l 339 (440) +..++|.-|-=|+. .+.+..... ...+|-|..=.-++..+.+-|+.++--|+.. +.- T Consensus 250 ~~~~~ivtGP~MP~~~r~~l~~~A~----------------~~p~i~I~~f~~~~~~ll~gA~~vVSm~GYN-----TvC 308 (400) T COG4671 250 NHKWLIVTGPFMPEAQRQKLLASAP----------------KRPHISIFEFRNDFESLLAGARLVVSMGGYN-----TVC 308 (400) T ss_pred CCCEEEEECCCCCHHHHHHHHHHCC----------------CCCCEEEEEHHHHHHHHHHHHHEEEECCCCH-----HHH T ss_conf 7433898489998899999987425----------------6997289973303999987644035204622-----266 Q ss_pred HHHHHCCCEEECCCCCCHHHH---HHHHHHCCCEEEEC----CHHHHHHHHHHHHCCH Q ss_conf 689619949998984437999---99999789869957----9899999999995898 Q gi|254780402|r 340 EAAMLGCAILSGPNVENFRDI---YRRMVSSGAVRIVE----EVGTLADMVYSLLSEP 390 (440) Q Consensus 340 Epaa~G~pVi~GP~~~nf~e~---~~~L~~~g~~~~v~----d~~eL~~~l~~ll~d~ 390 (440) |-..+|||.+.=|...--+|- ++.|.+.|.+-... +++.|++++...++.| T Consensus 309 eILs~~k~aLivPr~~p~eEQliRA~Rl~~LGL~dvL~pe~lt~~~La~al~~~l~~P 366 (400) T COG4671 309 EILSFGKPALIVPRAAPREEQLIRAQRLEELGLVDVLLPENLTPQNLADALKAALARP 366 (400) T ss_pred HHHHCCCCEEEECCCCCCHHHHHHHHHHHHCCCCEEECCCCCCHHHHHHHHHHCCCCC T ss_conf 8873799559843678738999999999866951030735589689999998542689 No 165 >cd02809 alpha_hydroxyacid_oxid_FMN Family of homologous FMN-dependent alpha-hydroxyacid oxidizing enzymes. This family occurs in both prokaryotes and eukaryotes. Members of this family include flavocytochrome b2 (FCB2), glycolate oxidase (GOX), lactate monooxygenase (LMO), mandelate dehydrogenase (MDH), and long chain hydroxyacid oxidase (LCHAO). In green plants, glycolate oxidase is one of the key enzymes in photorespiration where it oxidizes glycolate to glyoxylate. LMO catalyzes the oxidation of L-lactate to acetate and carbon dioxide. MDH oxidizes (S)-mandelate to phenylglyoxalate. It is an enzyme in the mandelate pathway that occurs in several strains of Pseudomonas which converts (R)-mandelate to benzoate. Probab=55.65 E-value=15 Score=16.76 Aligned_cols=117 Identities=15% Similarity=0.113 Sum_probs=46.9 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCC-CEEEEECCCCCCHHHHHHHHHHCCEEEEEECCCCHHHHHH Q ss_conf 999999999999986199899996277630012101477-4168850557201344557442020356404762146788 Q gi|254780402|r 67 GETMALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQ-YAIHQYAPLDIQPAVSRFLKYWKPDCMILSESDIWPLTVF 145 (440) Q Consensus 67 GE~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~-~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e~ElWPnli~ 145 (440) ||...++ .-.+.+-...++|.+.++.+.+.+..+. ...-.|.|-|.. ... .++. T Consensus 82 GE~~~Ar----aA~~~gi~~~lSt~ss~slEei~~~~~~~~wfQLY~~~d~~-~~~--------------------~li~ 136 (299) T cd02809 82 GELATAR----AAAAAGIPFTLSTVSTTSLEEVAAAAPGPRWFQLYVPRDRE-ITE--------------------DLLR 136 (299) T ss_pred HHHHHHH----HHHHHCCCEECCCCCCCCHHHHHHHCCCCEEEEEECCCCHH-HHH--------------------HHHH T ss_conf 2699999----99970564311376656689999744898467764369999-999--------------------9999 Q ss_pred HHHHCCCCEEEEEECCCCCCC-CCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHCCCCEEEE Q ss_conf 874224550586412665421-210011112331026442366415778889986088505872 Q gi|254780402|r 146 ELSKQRIPQVLVNARMSRRSF-KNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKELGAQKLIVS 208 (440) Q Consensus 146 ~~~~~~ip~~linarls~~S~-~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~~~i~v~ 208 (440) .+.+.|.+.+.+..-..-... -.|..+.++.+..=.++-.=-.++.+|+++-.+.|++-|.|+ T Consensus 137 rA~~aG~~al~lTvD~p~~g~R~~w~~i~~l~~~~~~p~i~KGi~~~~DA~~a~~~G~dgI~VS 200 (299) T cd02809 137 RAEAAGYKALVLTVDTPVLGRRLTWDDLAWLRSQWKGPLILKGILTPEDALRAVDAGADGIVVS 200 (299) T ss_pred HHHHCCCCEEEEECCCCCCCCCCCHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHCCCCEEEEC T ss_conf 9998599989997058987887999999999986699879972788999999998599889972 No 166 >PRK13287 amiF formamidase; Provisional Probab=55.38 E-value=15 Score=16.73 Aligned_cols=62 Identities=23% Similarity=0.353 Sum_probs=38.0 Q ss_pred EEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCC----------CCHHHHHHHHHHCCCEEEE Q ss_conf 167506765202554220820661355444545837689619949998984----------4379999999978986995 Q gi|254780402|r 304 DIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNV----------ENFRDIYRRMVSSGAVRIV 373 (440) Q Consensus 304 ~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~----------~nf~e~~~~L~~~g~~~~v 373 (440) .-+|+|.-||+...|+--. ..+|.. ++||--++.||.-||.- .||+|+.+++.-.||=..+ T Consensus 116 T~iLIn~~Geiv~kYRKi~------PWvPiE---~W~PGd~~~pV~dgPkG~KI~liIC~DG~fPEiaRe~A~kGAEvii 186 (333) T PRK13287 116 TAIIIDDQGEIILKYRKLH------PWVPVE---PWEPGDLGIPVCDGPGGSKLAVCICHDGMFPEMAREAAYKGANVMI 186 (333) T ss_pred EEEEECCCCCEEEEEEECC------CCCCCC---CCCCCCCCCEEEECCCCCEEEEEEECCCCCHHHHHHHHHCCCEEEE T ss_conf 2899889981888998225------655677---7358986515654688757899997589981899998746887999 Q ss_pred C Q ss_conf 7 Q gi|254780402|r 374 E 374 (440) Q Consensus 374 ~ 374 (440) + T Consensus 187 R 187 (333) T PRK13287 187 R 187 (333) T ss_pred E T ss_conf 8 No 167 >cd06298 PBP1_CcpA_like Ligand-binding domain of the catabolite control protein A (CcpA), which functions as the major transcriptional regulator of carbon catabolite repression/regulation. Ligand-binding domain of the catabolite control protein A (CcpA), which functions as the major transcriptional regulator of carbon catabolite repression/regulation (CCR), a process in which enzymes necessary for the metabolism of alternative sugars are inhibited in the presence of glucose. In gram-positive bacteria, CCR is controlled by HPr, a phosphoenolpyruvate:sugar phsophotrasnferase system (PTS) and a transcriptional regulator CcpA. Moreover, CcpA can regulate sporulation and antibiotic resistance as well as play a role in virulence development of certain pathogens such as the group A streptococcus. The ligand binding domain of CcpA is a member of the LacI-GalR family of bacterial transcription regulators. Probab=54.93 E-value=15 Score=16.68 Aligned_cols=107 Identities=15% Similarity=0.119 Sum_probs=64.9 Q ss_pred HHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEECCCCHHHHHHHHHH Q ss_conf 99999999998619989999627763001210147741688505572013445574420203564047621467888742 Q gi|254780402|r 70 MALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILSESDIWPLTVFELSK 149 (440) Q Consensus 70 ~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e~ElWPnli~~~~~ 149 (440) ..+..+-+.+.+.+.++++.++... .+ ++. ..++ .+...+++.+|++.++.=+..+..+.+ T Consensus 16 ~l~~gie~~~~~~gy~~li~~s~~~-~~--~e~---------------~~i~-~l~~~~vDGiI~~~~~~~~~~~~~l~~ 76 (268) T cd06298 16 ELARGIDDIATMYKYNIILSNSDND-KE--KEL---------------KVLN-NLLAKQVDGIIFMGGKISEEHREEFKR 76 (268) T ss_pred HHHHHHHHHHHHCCCEEEEEECCCC-HH--HHH---------------HHHH-HHHHCCCCEEEEECCCCCHHHHHHHHH T ss_conf 9999999999987998999989999-89--999---------------9999-999669999998267799999999996 Q ss_pred CCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECC----HHHHHHHHHHCCCCEEEE-CCCC Q ss_conf 24550586412665421210011112331026442366415----778889986088505872-0431 Q gi|254780402|r 150 QRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQS----ERYFRRYKELGAQKLIVS-GNLK 212 (440) Q Consensus 150 ~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs----~~~~~rl~~lG~~~i~v~-GnlK 212 (440) .++|+++++....+. .++.|.+-+ ....++|.+.|-++|-.. |+.. T Consensus 77 ~~~PvV~i~~~~~~~-----------------~~~~V~~Dn~~~~~~a~~~L~~~G~~~I~~i~~~~~ 127 (268) T cd06298 77 SPTPVVLAGSVDEDN-----------------ELPSVNIDYKKAAFEATELLIKNGHKKIAFISGPLE 127 (268) T ss_pred CCCCEEEECCCCCCC-----------------CCCEEEECHHHHHHHHHHHHHHCCCCEEEEEECCCC T ss_conf 699899957757899-----------------987898488999999999999729966999967877 No 168 >pfam01993 MTD methylene-5,6,7,8-tetrahydromethanopterin dehydrogenase. This enzyme family is involved in formation of methane from carbon dioxide EC:1.5.99.9. The enzyme requires coenzyme F420. Probab=54.86 E-value=12 Score=17.39 Aligned_cols=83 Identities=17% Similarity=0.247 Sum_probs=45.8 Q ss_pred EEECCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEECCCCH Q ss_conf 98178999999999999998619989999627763001210147741688505572013445574420203564047621 Q gi|254780402|r 61 FHASSVGETMALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILSESDIW 140 (440) Q Consensus 61 ~HaaSvGE~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e~ElW 140 (440) +.+-..|-...+.-++.+.-. +.++-+-+.+ ||.++-. -+.-..+.+.++.|+|+++|++. T Consensus 7 iK~GNIg~s~~idl~LDErAd-RedI~vrv~g-sGaKm~p-------------e~~e~~~~~~l~~~~pDf~i~is---- 67 (276) T pfam01993 7 IKCGNIGTSPVVDLLLDERAD-REDIEVRVVG-SGAKMDP-------------ECVEEVVLDMLEEFEPDFVIYIS---- 67 (276) T ss_pred EEECCCCHHHHHHHHHHHHHC-CCCCEEEEEC-CCCCCCH-------------HHHHHHHHHHHHHHCCCEEEEEC---- T ss_conf 974452159999998776523-4686499952-6666798-------------88999999999861899899978---- Q ss_pred HH--------HHHHHHHCCCCEEEEEECCC Q ss_conf 46--------78887422455058641266 Q gi|254780402|r 141 PL--------TVFELSKQRIPQVLVNARMS 162 (440) Q Consensus 141 Pn--------li~~~~~~~ip~~linarls 162 (440) || -=..++..|||++.|+-.-+ T Consensus 68 PN~a~PGP~~ARE~l~~~giP~IvI~D~p~ 97 (276) T pfam01993 68 PNPAAPGPKKAREMLSDSGYPAVIIGDAPG 97 (276) T ss_pred CCCCCCCCHHHHHHHHHCCCCEEEECCCCC T ss_conf 998899956799999756998799837863 No 169 >TIGR00450 thdF tRNA modification GTPase TrmE; InterPro: IPR004520 The GTP-binding domain of all TrmE/ThdF orthologues is found in the C-terminal portion of the molecule. The N-terminal half can be removed without affecting the GTP-binding/hydrolysis function of the GTP-binding domain. The last four amino acids of all orthologues of ThdF/TrmE are highly conserved, being either CIGK or CLGK. This matches the Caax (where 'a' represents an aliphatic amino acid, and 'x' represents any amino acid) motif for isoprenylation that anchors small GTP-binding proteins to cell membranes in eukaryotic cells. However, protein isoprenylation has never been shown to occur in bacteria. Interestingly, biochemical experiments have shown that the Escherichia coli TrmE protein peripherally associates with the membrane fraction . Although the biochemical properties of TrmE have been investigated for the E. coli and Thermotoga maritima proteins, nothing is known about the relationship of this protein to tRNA modification. Orthologues of TrmE are present in eukaryotes and bacteria, but are not present in archaea. In Saccharomyces cerevisiae, Mss1p is a nuclear-encoded mitochondrial protein that is the yeast orthologue of TrmE. Mss1p interacts with the 15S rRNA of the yeast mitochondria, which is equivalent to the 16S rRNA of bacteria. Subsequent analysis of the S. cerevisiae MTO1 gene suggests that MSS1 and MTO1 act together in a pathway involved in optimizing mitochondrial protein synthesis. TrmE may play a role in tRNA processing and may be directly or indirectly involved in regulating ribosome function.; GO: 0003924 GTPase activity, 0005525 GTP binding, 0006400 tRNA modification, 0005622 intracellular. Probab=54.05 E-value=16 Score=16.59 Aligned_cols=184 Identities=12% Similarity=0.039 Sum_probs=76.0 Q ss_pred CEEEEEECCHHHHHHHHHHHHHHHHCC-CCEEEE-ECCCCCCHHHHCCCC--C-CEEEEECCCCCCHHHHHHHHHHCCEE Q ss_conf 779998178999999999999998619-989999-627763001210147--7-41688505572013445574420203 Q gi|254780402|r 57 PLIWFHASSVGETMALIGLIPAIRSRH-VNVLLT-TMTATSAKVARKYLG--Q-YAIHQYAPLDIQPAVSRFLKYWKPDC 131 (440) Q Consensus 57 ~~IW~HaaSvGE~~~a~pli~~l~~~~-~~iliT-~~T~tg~~~~~~~~~--~-~~~~~y~P~D~~~~~~~fl~~~~P~~ 131 (440) .+|=+.| .|-+..+..+++-+.+.+ ...-=- =||.-+|- ..++. . ..++...=-..+....-.++..+=++ T Consensus 78 D~iEi~C--HGs~~~v~~i~~L~L~~GGar~A~PGEFt~raFL--n~k~dL~~aea~~~Lil~~~~~~~~~~l~~L~G~l 153 (473) T TIGR00450 78 DVIEIQC--HGSVLIVQEILQLLLKSGGARLAKPGEFTQRAFL--NGKMDLTQAEAINELILAPNEKVKDIALNKLAGEL 153 (473) T ss_pred CEEEEEE--CCCCHHHHHHHHHHHHCCCCEECCCCCCHHHHHH--HCCCCHHHHHHHHHHHCCCCHHHHHHHHHHCCCCH T ss_conf 3589960--7870478899999983588023267652046664--12447789998877543888779999998506712 Q ss_pred EEEECCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCH--HHHHHHHHHCC-CCEEEE Q ss_conf 564047621467888742245505864126654212100111123310264423664157--78889986088-505872 Q gi|254780402|r 132 MILSESDIWPLTVFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSE--RYFRRYKELGA-QKLIVS 208 (440) Q Consensus 132 ~i~~e~ElWPnli~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~--~~~~rl~~lG~-~~i~v~ 208 (440) -+.+| ++--+++.-+....+-+= -.--..+-++.--.-+..........+..++--++ .+.+-|+++.- -++.+. T Consensus 154 ~~~i~-~~~~~~l~ll~~~ev~iD-Y~~~~~e~d~~~~~~~~~~~~~~~~~L~~i~~~~~aq~~~~vl~~l~~g~k~ai~ 231 (473) T TIGR00450 154 DQKIE-ELRKSLLQLLAQVEVNID-YEEDDDELDQLELVSLNQKLEKIIAELKDILNSANAQRSKKVLEKLKDGFKLAIV 231 (473) T ss_pred HHHHH-HHHHHHHHHHHHHHHCCC-CCCCCCCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEE T ss_conf 25689-999999888887431026-7545775311200017899999999999999876410034589984089479996 Q ss_pred CCCCCCCCCCCCCHHHHHHHHHHC-CCCCEEE--CCCCCCC-CCCCCCCCC Q ss_conf 043101234422125689999861-7986175--0465533-210000000 Q gi|254780402|r 209 GNLKIDTESLPCDKELLSLYQESI-AGRYTWA--AISTFEG-EEDKAVYVH 255 (440) Q Consensus 209 GnlK~d~~~~~~~~~~~~~~~~~~-~~r~v~v--agSth~~-EE~~il~a~ 255 (440) |++= +. .-..+...+ .+|-++- =|||++- |+.+.++-| T Consensus 232 G~~N-------vG--KSSLLNa~l~~DrAiVS~~kGtTRD~vE~~~~L~G~ 273 (473) T TIGR00450 232 GKPN-------VG--KSSLLNALLKQDRAIVSDIKGTTRDVVEGDFELNGI 273 (473) T ss_pred CCCC-------CC--HHHHHHHHHHCCCEEEECCCCCCCCEEEEEEEECCE T ss_conf 4788-------75--789999876228705527668832044205777467 No 170 >PRK12311 rpsB 30S ribosomal protein S2; Provisional Probab=51.61 E-value=17 Score=16.34 Aligned_cols=28 Identities=18% Similarity=0.349 Sum_probs=13.3 Q ss_pred CEEEEEECCCCHHHHHHHHHHCCCCEEE Q ss_conf 2035640476214678887422455058 Q gi|254780402|r 129 PDCMILSESDIWPLTVFELSKQRIPQVL 156 (440) Q Consensus 129 P~~~i~~e~ElWPnli~~~~~~~ip~~l 156 (440) |++++++...-=.+-+.+|++.|||++= T Consensus 160 PDlvfViD~~kE~iAV~EA~kLgIPvIa 187 (332) T PRK12311 160 PDLIFVIDTNKEDIAIQEAQRLGIPVAA 187 (332) T ss_pred CCEEEEECCCCCHHHHHHHHHCCCCEEE T ss_conf 9989996897338999999980979899 No 171 >pfam01488 Shikimate_DH Shikimate / quinate 5-dehydrogenase. This family contains both shikimate and quinate dehydrogenases. Shikimate 5-dehydrogenase catalyses the conversion of shikimate to 5-dehydroshikimate. This reaction is part of the shikimate pathway which is involved in the biosynthesis of aromatic amino acids. Quinate 5-dehydrogenase catalyses the conversion of quinate to 5-dehydroquinate. This reaction is part of the quinate pathway where quinic acid is exploited as a source of carbon in prokaryotes and microbial eukaryotes. Both the shikimate and quinate pathways share two common pathway metabolites 3-dehydroquinate and dehydroshikimate. Probab=51.51 E-value=12 Score=17.30 Aligned_cols=49 Identities=10% Similarity=0.145 Sum_probs=27.3 Q ss_pred HCCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCHHHHHHHHHH Q ss_conf 617986175046553321000000000012341248897055211478887765 Q gi|254780402|r 231 SIAGRYTWAAISTFEGEEDKAVYVHNFIKCRTDVLTIIVPRHPRRCDAIERRLI 284 (440) Q Consensus 231 ~~~~r~v~vagSth~~EE~~il~a~~~l~~~~~~~lIIvPRhpeR~~~i~~~l~ 284 (440) .+.++++++.|. ||-... +.+.|....--.+.++-|.++|++++.+.+. T Consensus 9 ~l~~~~vlVIGa---G~~~~~--~~~~L~~~g~~~i~v~nRt~~ka~~la~~~~ 57 (134) T pfam01488 9 DLKGKKVLLIGA---GEMARL--AAKHLLSKGAKKITIANRTLEKAKELAEEFG 57 (134) T ss_pred CCCCCEEEEECC---CHHHHH--HHHHHHHCCCCEEEEECCCHHHHHHHHHHCC T ss_conf 814898999996---099999--9999997599889995475789999999849 No 172 >COG1448 TyrB Aspartate/tyrosine/aromatic aminotransferase [Amino acid transport and metabolism] Probab=51.44 E-value=17 Score=16.36 Aligned_cols=37 Identities=16% Similarity=0.313 Sum_probs=31.7 Q ss_pred HHHHHHHHHHCCEEEEEECCCCHHHHHHHHHHCCCCE Q ss_conf 1344557442020356404762146788874224550 Q gi|254780402|r 118 PAVSRFLKYWKPDCMILSESDIWPLTVFELSKQRIPQ 154 (440) Q Consensus 118 ~~~~~fl~~~~P~~~i~~e~ElWPnli~~~~~~~ip~ 154 (440) ...-+|++++.|+..||+...-|||....-...|.++ T Consensus 108 ~~~A~fl~~~~~~~~vwis~PtW~NH~~If~~aGl~v 144 (396) T COG1448 108 RVAADFLARFFPDATVWISDPTWPNHKAIFEAAGLEV 144 (396) T ss_pred HHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHCCCCE T ss_conf 9999999984888628847998275899999669814 No 173 >PRK08057 cobalt-precorrin-6x reductase; Reviewed Probab=51.40 E-value=17 Score=16.32 Aligned_cols=125 Identities=10% Similarity=0.002 Sum_probs=66.8 Q ss_pred EEEEEECCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEE-- Q ss_conf 799981789999999999999986199899996277630012101477416885055720134455744202035640-- Q gi|254780402|r 58 LIWFHASSVGETMALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILS-- 135 (440) Q Consensus 58 ~IW~HaaSvGE~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~-- 135 (440) .||+=+-. +|. +.|++.|. +.++.+|+.+.+.. .. ...+.+..=.+........|++..+.+.+|=. T Consensus 3 ~IlilgGT-~Eg---r~la~~L~--~~~~~~s~ag~~~~---~~--~~~~~i~~G~~~~~~~l~~~l~~~~i~~VIDATH 71 (241) T PRK08057 3 RILLLGGT-SEA---RALARALA--PDDTVTSLAGRTLK---PA--DLPGPVRVGGFGGAEGLAAYLREEGIDLVVDATH 71 (241) T ss_pred EEEEEECH-HHH---HHHHHHHH--CCCEEEEEEEECCC---CC--CCCCCEEECCCCCHHHHHHHHHHCCCCEEEECCC T ss_conf 59999670-899---99999974--89989998530255---65--6887679888899999999999679989998999 Q ss_pred --CCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCC-CEEEECCHHHHHHHHHHCCCCEEE-EC Q ss_conf --476214678887422455058641266542121001111233102644-236641577888998608850587-20 Q gi|254780402|r 136 --ESDIWPLTVFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQF-SLVIVQSERYFRRYKELGAQKLIV-SG 209 (440) Q Consensus 136 --e~ElWPnli~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~-~~i~~qs~~~~~rl~~lG~~~i~v-~G 209 (440) -.++=-|....|+..|||.+=.. | -.| ..... ..+.+.|-+++..+.....++|.. +| T Consensus 72 PfA~~is~~a~~a~~~~~ipylR~e-R--------p~~-------~~~~~d~~~~v~s~~ea~~~l~~~~~~vllttG 133 (241) T PRK08057 72 PYAAQISANAAAACRALGIPYLRLE-R--------PPW-------QPQPGDDWIEVDDIAAAAEALAPFGRRVLLTTG 133 (241) T ss_pred CCHHHHHHHHHHHHHHHCCEEEEEE-C--------CCC-------CCCCCCCEEEECCHHHHHHHHHHCCCCEEEECC T ss_conf 7089999999999998697079996-7--------865-------678887789979999999987643885798316 No 174 >TIGR02002 PTS-II-BC-glcB PTS system, glucose-specific IIBC component; InterPro: IPR011299 This entry represents the combined B and C domains of the PTS transport system enzyme II specific for glucose transport . Many of the genes in this family also include an A domain as part of the same polypeptide and thus should be given the name 'PTS system, glucose-specific IIABC component' while the Bacillus subtilus enzyme also contains an enzyme III domain which appears to act independently of the enzyme II domains . This group is most closely related to the N-acetylglucosamine-specific PTS enzymes (IPR010974 from INTERPRO).; GO: 0005355 glucose transmembrane transporter activity, 0015758 glucose transport, 0016021 integral to membrane. Probab=51.39 E-value=9.6 Score=18.03 Aligned_cols=22 Identities=32% Similarity=0.409 Sum_probs=12.7 Q ss_pred CCHHHHHHHHHHCCCCEEEECC Q ss_conf 1577888998608850587204 Q gi|254780402|r 189 QSERYFRRYKELGAQKLIVSGN 210 (440) Q Consensus 189 qs~~~~~rl~~lG~~~i~v~Gn 210 (440) ++.-|++||++||+.-|.|.|| T Consensus 474 ~~~Vd~~~LK~LGA~GVLvvGn 495 (518) T TIGR02002 474 IKKVDKAKLKKLGAAGVLVVGN 495 (518) T ss_pred HCCCCHHHHHHCCCCCEEEECC T ss_conf 0013436665216771388868 No 175 >TIGR03531 selenium_SpcS O-phosphoseryl-tRNA(Sec) selenium transferase. In the archaea and eukaryotes, the conversion of the mischarged serine to selenocysteine (Sec) on its tRNA is accomplished in two steps. This enzyme, O-phosphoseryl-tRNA(Sec) selenium transferase, acts second, after a phosphophorylation step catalyzed by a homolog of the bacterial SelA protein. Probab=51.35 E-value=7.3 Score=18.82 Aligned_cols=73 Identities=12% Similarity=0.154 Sum_probs=39.0 Q ss_pred EEECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCC Q ss_conf 97055211478887765300101100134444466416750676520255422082066135544454583768961994 Q gi|254780402|r 268 IVPRHPRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCA 347 (440) Q Consensus 268 IvPRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~p 347 (440) -+||-|++..+|.++|++.+.....-...... .+ ..+..+...++.+-|-++=-|- -.|++=|. |=+ T Consensus 216 FAPR~~D~i~eiAklC~~~~IpHvVNNAYGlQ--~~-----~~~~~i~~A~r~GRVDa~VQSt----DKNflVPV--GGa 282 (444) T TIGR03531 216 FAPRSPDDIEEIAKICAKYDIPHVVNNAYGLQ--SN-----KYMEKINKAIKVGRVDAVVSST----DKNFMVPV--GGA 282 (444) T ss_pred CCCCCCCCHHHHHHHHHHCCCCEEECCHHHHH--HH-----HHHHHHHHHHHCCCCEEEEEEC----CCCEEEEC--CCE T ss_conf 59999766999999999749983784203661--79-----9999999998649831999706----86625434--874 Q ss_pred EEECCC Q ss_conf 999898 Q gi|254780402|r 348 ILSGPN 353 (440) Q Consensus 348 Vi~GP~ 353 (440) ||.|++ T Consensus 283 II~s~d 288 (444) T TIGR03531 283 IIYSFD 288 (444) T ss_pred EEEECC T ss_conf 897188 No 176 >COG1736 DPH2 Diphthamide synthase subunit DPH2 [Translation, ribosomal structure and biogenesis] Probab=50.93 E-value=18 Score=16.27 Aligned_cols=37 Identities=11% Similarity=0.165 Sum_probs=21.7 Q ss_pred EEEEECCHHHHHHHHHHHHHHHHC-CCCEEEEECCCCCC Q ss_conf 999817899999999999999861-99899996277630 Q gi|254780402|r 59 IWFHASSVGETMALIGLIPAIRSR-HVNVLLTTMTATSA 96 (440) Q Consensus 59 IW~HaaSvGE~~~a~pli~~l~~~-~~~iliT~~T~tg~ 96 (440) |-+.+-- |=-.-+..+++.|.+. +.+++|+..+.=|+ T Consensus 27 V~LQ~Pe-GLk~~~~~ia~~le~~~~~~v~i~gd~~yGA 64 (347) T COG1736 27 VLLQFPE-GLKRYAIEIADILEANLGAEVIISGDPVYGA 64 (347) T ss_pred EEEECCH-HHHHHHHHHHHHHHHHHCCEEEEECCCCCCC T ss_conf 8998886-7877789999999885295599967764244 No 177 >PRK05333 NAD-dependent deacetylase; Provisional Probab=50.35 E-value=18 Score=16.22 Aligned_cols=34 Identities=24% Similarity=0.211 Sum_probs=19.9 Q ss_pred HHHHCCCEEE-ECCCCCC-CCCCHHHHHHHCCCEEE Q ss_conf 5422082066-1355444-54583768961994999 Q gi|254780402|r 317 YLRMTEIAFI-GRSFCAS-GGQNPLEAAMLGCAILS 350 (440) Q Consensus 317 lY~~AdiafV-GGSl~~~-GGhN~lEpaa~G~pVi~ 350 (440) ...-||+.+| |-|+.-. ...=|..+...|.+|+. T Consensus 211 ~~~~aDlllviGTSl~V~PA~~l~~~a~~~g~~vvi 246 (285) T PRK05333 211 ALDAADAVLVVGSSLMVYSGYRFCVWAARQGKPIAA 246 (285) T ss_pred HHHCCCEEEEECCCCCCCCHHHHHHHHHHCCCEEEE T ss_conf 986299999978784225213689999977992999 No 178 >cd00316 Oxidoreductase_nitrogenase The nitrogenase enzyme system catalyzes the ATP-dependent reduction of dinitrogen to ammonia. This group contains both alpha and beta subunits of component 1 of the three known genetically distinct types of nitrogenase systems: a molybdenum-dependent nitrogenase (Mo-nitrogenase), a vanadium-dependent nitrogenase (V-nitrogenase), and an iron-only nitrogenase (Fe-nitrogenase) and, both subunits of Protochlorophyllide (Pchlide) reductase and chlorophyllide (chlide) reductase. The nitrogenase systems consist of component 1 (MoFe protein, VFe protein or, FeFe protein respectively) and, component 2 (Fe protein). The most widespread and best characterized nitrogenase is the Mo-nitrogenase. MoFe is an alpha2beta2 tetramer, the alternative nitrogenases are alpha2beta2delta2 hexamers whose alpha and beta subunits are similar to the alpha and beta subunits of MoFe. For MoFe, each alphabeta pair contains one P-cluster (at the alphabeta interface) and, one molec Probab=49.87 E-value=18 Score=16.17 Aligned_cols=20 Identities=20% Similarity=0.290 Sum_probs=11.1 Q ss_pred HHHHHHHHHHHHHHHCCCCE Q ss_conf 99999999999998619989 Q gi|254780402|r 67 GETMALIGLIPAIRSRHVNV 86 (440) Q Consensus 67 GE~~~a~pli~~l~~~~~~i 86 (440) |+-+....+.+...+.+|++ T Consensus 64 g~ekl~~~i~~~~~~~~p~~ 83 (399) T cd00316 64 GGEKLLEAIINELKRYKPKV 83 (399) T ss_pred CHHHHHHHHHHHHHHCCCCE T ss_conf 78999999999987439988 No 179 >cd01144 BtuF Cobalamin binding protein BtuF. These proteins have been shown to function as initial receptors in ABC transport of vitamin B12 (cobalamin) in eubacterial and some archaeal species. They belong to the TroA superfamily of helical backbone metal receptor proteins that share a distinct fold and ligand binding mechanism. A typical TroA protein is comprised of two globular subdomains connected by a single helix and can bind the metal ion in the cleft between these domains. In addition, these proteins sometimes have a low complexity region containing a metal-binding histidine-rich motif (repetitive HDH sequence). Probab=49.59 E-value=19 Score=16.14 Aligned_cols=38 Identities=21% Similarity=0.153 Sum_probs=27.2 Q ss_pred HHHHHCCEEEEEECCCCHHHHHHHHHHCCCCEEEEEEC Q ss_conf 57442020356404762146788874224550586412 Q gi|254780402|r 123 FLKYWKPDCMILSESDIWPLTVFELSKQRIPQVLVNAR 160 (440) Q Consensus 123 fl~~~~P~~~i~~e~ElWPnli~~~~~~~ip~~linar 160 (440) -+-..+||++|.....-=+..+..+++.|||+++.+.. T Consensus 52 ~I~~L~PDLVi~~~~~~~~~~~~~l~~~gi~v~~~~~~ 89 (245) T cd01144 52 RVLALKPDLVIAWDDCNVCAVVDQLRAAGIPVLVSEPQ 89 (245) T ss_pred HHHCCCCCEEEEECCCCHHHHHHHHHCCCCEEEEECCC T ss_conf 99525996477426877688999876049769984899 No 180 >PRK11613 folP dihydropteroate synthase; Provisional Probab=49.36 E-value=19 Score=16.12 Aligned_cols=20 Identities=15% Similarity=0.306 Sum_probs=8.7 Q ss_pred HHHHHHHHHCCCCEEEEEEC Q ss_conf 46788874224550586412 Q gi|254780402|r 141 PLTVFELSKQRIPQVLVNAR 160 (440) Q Consensus 141 Pnli~~~~~~~ip~~linar 160 (440) |.++..+++.++|+++.-.+ T Consensus 123 ~~~~~~va~~~~~~vlmH~~ 142 (282) T PRK11613 123 PGALEAAAETGLPVCLMHMQ 142 (282) T ss_pred HHHHHHHHHCCCCEEEEECC T ss_conf 65999999729988998068 No 181 >smart00672 CAP10 Putative lipopolysaccharide-modifying enzyme. Probab=49.06 E-value=19 Score=16.09 Aligned_cols=65 Identities=8% Similarity=0.026 Sum_probs=42.4 Q ss_pred HHHCCCEEECCCCCCHHHHHHHHHHCC-CEEEEC-CH--HHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCC Q ss_conf 961994999898443799999999789-869957-98--99999999995898999999999999997045 Q gi|254780402|r 342 AMLGCAILSGPNVENFRDIYRRMVSSG-AVRIVE-EV--GTLADMVYSLLSEPTIRYEMINAAINEVKKMQ 408 (440) Q Consensus 342 aa~G~pVi~GP~~~nf~e~~~~L~~~g-~~~~v~-d~--~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ 408 (440) +++|+.|+.=. ..+.|-+..+..-+ --+.|+ |. ++|.+.+..+.++++.+++++++|+++++++- T Consensus 165 l~~~Svvl~~~--~~~~e~f~~~L~P~~HYvPv~~d~sd~~l~~~i~w~~~~d~~a~~Ia~~~~~f~~~~L 233 (256) T smart00672 165 LACDSVVLKVK--PEYYEFFSRGLQPWVHYWPIKSDLSCRELKEAVDWGNEHDKKAQEIGKRGSEFIQQNL 233 (256) T ss_pred HHCCCEEEEEC--CHHHHHHHCCCCCCCCEEEEECCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHC T ss_conf 72684699628--4189998701265421788666875666999999986098999999999999999974 No 182 >COG0379 NadA Quinolinate synthase [Coenzyme metabolism] Probab=48.43 E-value=19 Score=16.02 Aligned_cols=162 Identities=13% Similarity=0.095 Sum_probs=84.7 Q ss_pred EEEECCHHHHHHHHH-HCCCCEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCEEEC-C--CCCCCCCCCCCCC-CCCCC Q ss_conf 366415778889986-08850587204310123442212568999986179861750-4--6553321000000-00001 Q gi|254780402|r 185 LVIVQSERYFRRYKE-LGAQKLIVSGNLKIDTESLPCDKELLSLYQESIAGRYTWAA-I--STFEGEEDKAVYV-HNFIK 259 (440) Q Consensus 185 ~i~~qs~~~~~rl~~-lG~~~i~v~GnlK~d~~~~~~~~~~~~~~~~~~~~r~v~va-g--Sth~~EE~~il~a-~~~l~ 259 (440) -+.+-|....+-..+ ++.++|... .|+.......+..+.+.+++. | +.|+ .+-.+- .+.-+ T Consensus 141 Di~cTSsNAvkvVe~~~~~~~Iif~-----------PD~~Lg~yva~~tg~k~ii~w~G~C~VH~---~ft~~~i~~~k~ 206 (324) T COG0379 141 DICCTSSNAVKVVESALDGDKILFL-----------PDKNLGRYVAKQTGAKKIILWPGHCIVHE---EFTAEDIEELKE 206 (324) T ss_pred CEEEECCCHHHHHHHCCCCCCEEEC-----------CCHHHHHHHHHHCCCCCEEEECCCCCHHH---HCCHHHHHHHHH T ss_conf 9389463099999721579838983-----------70779999999759982898787410101---108999999998 Q ss_pred CCCCEEEEEEECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCC-CEEEECCCCCCCCCCH Q ss_conf 234124889705521147888776530010110013444446641675067652025542208-2066135544454583 Q gi|254780402|r 260 CRTDVLTIIVPRHPRRCDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTE-IAFIGRSFCASGGQNP 338 (440) Q Consensus 260 ~~~~~~lIIvPRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~Ad-iafVGGSl~~~GGhN~ 338 (440) .+|+..++. |||--.++.+...-.|-+.. ..+.-...+...++|+-..|....+.+.++ .-|+. T Consensus 207 ~~Pda~vlv---HPEC~~~Vv~~AD~vGST~~-ii~~~~~~~~~~~iv~TE~g~~~~l~~~~P~k~~~~----------- 271 (324) T COG0379 207 KYPDAEVLV---HPECPPEVVELADFVGSTSQ-IIKAVKASPAQKFIVGTERGIVHRLQKEAPDKEFIP----------- 271 (324) T ss_pred HCCCCEEEE---CCCCCHHHHHHCCCCCCHHH-HHHHHHCCCCCEEEEEECHHHHHHHHHHCCCCEEEC----------- T ss_conf 789987997---89999889975545466899-999986299764999964789999999789973881----------- Q ss_pred HHHHHHCCCEEECCCCCC--HHHHHHHHHHCCC-EEEECCHHHHHHH Q ss_conf 768961994999898443--7999999997898-6995798999999 Q gi|254780402|r 339 LEAAMLGCAILSGPNVEN--FRDIYRRMVSSGA-VRIVEEVGTLADM 382 (440) Q Consensus 339 lEpaa~G~pVi~GP~~~n--f~e~~~~L~~~g~-~~~v~d~~eL~~~ 382 (440) +--.|+ +=|+|.. -..++..|.+.+. -+.| ++++++. T Consensus 272 --~~~~~~---~C~~Mk~itL~~i~~~L~~~~~~eV~V--~~~i~~~ 311 (324) T COG0379 272 --LPTAGA---VCPTMKMITLEKILEALEEGGNNEVTV--DEEIAER 311 (324) T ss_pred --CCCCCC---CCHHHHHHCHHHHHHHHHHCCCCEEEE--CHHHHHH T ss_conf --688778---585456507999999999578846982--9999999 No 183 >cd01539 PBP1_GGBP Periplasmic glucose/galactose-binding protein (GGBP) involved in chemotaxis towards, and active transport of, glucose and galactose in various bacterial species. Periplasmic glucose/galactose-binding protein (GGBP) involved in chemotaxis towards, and active transport of, glucose and galactose in various bacterial species. GGBP is a member of the pentose/hexose sugar-binding protein family of the type I periplasmic binding protein superfamily which consists of two alpha/beta globular domains connected by a three-stranded hinge. This Venus flytrap-like domain undergoes transition from an open to a closed conformational state upon ligand binding. Moreover, the periplasmic GGBP is homologous to the ligand-binding domain of eukaryotic receptors such as glutamate receptor (GluR) and DNA-binding transcriptional repressors such as LacI and GalR. Probab=47.93 E-value=20 Score=15.97 Aligned_cols=49 Identities=12% Similarity=0.242 Sum_probs=34.8 Q ss_pred HHHHHHHHCCEEEEEE--CCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCC Q ss_conf 4455744202035640--4762146788874224550586412665421210 Q gi|254780402|r 120 VSRFLKYWKPDCMILS--ESDIWPLTVFELSKQRIPQVLVNARMSRRSFKNW 169 (440) Q Consensus 120 ~~~fl~~~~P~~~i~~--e~ElWPnli~~~~~~~ip~~linarls~~S~~~~ 169 (440) +..|+.. +++.+++. +++-+...+..++..|||++.+|-...+.....| T Consensus 50 Ie~~I~q-gvD~Iii~p~d~~~~~~~i~~A~~agIPVI~~d~~~~~~~~~~~ 100 (303) T cd01539 50 IDTALAK-GVDLLAVNLVDPTAAQTVINKAKQKNIPVIFFNREPEEEDIKSY 100 (303) T ss_pred HHHHHHC-CCCEEEEECCCHHHHHHHHHHHHHCCCEEEEECCCCCHHHCCCC T ss_conf 9999975-99999994687334899999999669849998787661220345 No 184 >cd02509 GDP-M1P_Guanylyltransferase GDP-M1P_Guanylyltransferase catalyzes the formation of GDP-Mannose. GDP-mannose-1-phosphate guanylyltransferase, also called GDP-mannose pyrophosphorylase (GDP-MP), catalyzes the formation of GDP-Mannose from mannose-1-phosphate and GTP. Mannose is a key monosaccharide for glycosylation of proteins and lipids. GDP-Mannose is the activated donor for mannosylation of various biomolecules. This enzyme is known to be bifunctional, as both mannose-6-phosphate isomerase and mannose-1-phosphate guanylyltransferase. This CD covers the N-terminal GDP-mannose-1-phosphate guanylyltransferase domain, whereas the isomerase function is located at the C-terminal half. GDP-MP is a member of the nucleotidyltransferase family of enzymes. Probab=47.56 E-value=20 Score=15.94 Aligned_cols=117 Identities=15% Similarity=0.150 Sum_probs=53.4 Q ss_pred CCEEEEEE---CCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHH-CCCCEEEECCCCCCCCCCCCCHHHHHH Q ss_conf 55058641---26654212100111123310264423664157788899860-885058720431012344221256899 Q gi|254780402|r 152 IPQVLVNA---RMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKEL-GAQKLIVSGNLKIDTESLPCDKELLSL 227 (440) Q Consensus 152 ip~~lina---rls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~l-G~~~i~v~GnlK~d~~~~~~~~~~~~~ 227 (440) +|++|..| |+.+.|-+.+ |+-+-.+++.- ..=+.+.+|+..+ +.+++.|+-|-++- ...... T Consensus 2 ~~VIlaGG~GtRLWPlSr~~~---PKQf~~~~~~~----sLlq~T~~R~~~~~~~~~i~IvTn~~~~-------~~v~~q 67 (274) T cd02509 2 YPVILAGGSGTRLWPLSRESY---PKQFLKLFGDK----SLLQQTLDRLKGLVPPDRILVVTNEEYR-------FLVREQ 67 (274) T ss_pred EEEEECCCCCCCCCCCCCCCC---CCCCCCCCCCC----CHHHHHHHHHHCCCCCCCCEEEECCHHH-------HHHHHH T ss_conf 899947986044687558889---97752179999----8999999987468986780999572179-------999999 Q ss_pred HHHHCCCCCEEECCCCCCCCC-CCCCCCCCCCCCCCCEEEEEEEC--CCCHHHHHHHHH Q ss_conf 998617986175046553321-00000000001234124889705--521147888776 Q gi|254780402|r 228 YQESIAGRYTWAAISTFEGEE-DKAVYVHNFIKCRTDVLTIIVPR--HPRRCDAIERRL 283 (440) Q Consensus 228 ~~~~~~~r~v~vagSth~~EE-~~il~a~~~l~~~~~~~lIIvPR--hpeR~~~i~~~l 283 (440) +.+......+++= -...+-- .+.+.+....++.|+..+++.|- .++..+...+.+ T Consensus 68 l~~~~~~~~ii~E-P~~rnTApAI~laa~~~~~~~~d~~l~v~PsDH~I~d~~~F~~~i 125 (274) T cd02509 68 LPEGLPEENIILE-PEGRNTAPAIALAALYLAKRDPDAVLLVLPSDHLIEDVEAFLKAV 125 (274) T ss_pred HHHCCCCCCEEEC-CCCCCCHHHHHHHHHHHHHHCCCCEEEEEECCCCCCCHHHHHHHH T ss_conf 8843986648967-688897999999999999759997799984740200889999999 No 185 >PRK12555 chemotaxis-specific methylesterase; Provisional Probab=47.40 E-value=20 Score=15.92 Aligned_cols=138 Identities=17% Similarity=0.129 Sum_probs=64.8 Q ss_pred CCEEECC-CCCCCCCCCCCCCCCCCCC-CCCEEEEEEECCCCH-HHHHHHHHHH-HCCHHHHHCCCCCCCCCCEEEEECC Q ss_conf 8617504-6553321000000000012-341248897055211-4788877653-0010110013444446641675067 Q gi|254780402|r 235 RYTWAAI-STFEGEEDKAVYVHNFIKC-RTDVLTIIVPRHPRR-CDAIERRLIA-KGLKVARRSRGDVINAEVDIFLGDT 310 (440) Q Consensus 235 r~v~vag-Sth~~EE~~il~a~~~l~~-~~~~~lIIvPRhpeR-~~~i~~~l~~-~gl~~~~~S~~~~~~~~~~V~i~Dt 310 (440) .++++.| ||--.+ .+-.....+.. .| . ++||-|=|.. ...+.+++.+ ..+.+..-++++... ...|||+-. T Consensus 154 ~~vv~IGaSTGGp~--aL~~il~~LP~~~p-~-ivivQHmp~~F~~~la~~L~~~~~l~V~~a~~g~~l~-~G~vyiapg 228 (340) T PRK12555 154 EKLVAIGASAGGTE--ALAALLRGLPADFP-A-VVIVQHVPAAFAAGFAERLDGLTRLPVREARDGDRPL-PGHALLAPG 228 (340) T ss_pred CCEEEEEECCCCHH--HHHHHHHHCCCCCC-C-EEEEECCCCCHHHHHHHHHHHHCCCCEEECCCCCEEE-CCEEEECCC T ss_conf 87899996889999--99999986786689-6-8999768986768999998604688559947998740-897998389 Q ss_pred CCCCHHHHHHCCCEEEECCC-CCCCCCCH-----HHHHH--HCCC----EEECCCCCCHHHHHHHHHHCCCEEEECCHHH Q ss_conf 65202554220820661355-44454583-----76896--1994----9998984437999999997898699579899 Q gi|254780402|r 311 IGEMGFYLRMTEIAFIGRSF-CASGGQNP-----LEAAM--LGCA----ILSGPNVENFRDIYRRMVSSGAVRIVEEVGT 378 (440) Q Consensus 311 ~GeL~~lY~~AdiafVGGSl-~~~GGhN~-----lEpaa--~G~p----Vi~GP~~~nf~e~~~~L~~~g~~~~v~d~~e 378 (440) =-.|.--..-++ .++.=+- .+..+|.| ++.++ +|.- |++|=.-+.-.. ...+.++|+...+.|++. T Consensus 229 ~~hl~v~~~g~~-~~~~l~~~~~~~~~rPsvD~lf~S~a~~~~~~~igviLTGMG~DGa~G-~~~i~~~Gg~tiaQd~~s 306 (340) T PRK12555 229 DDHLRLTRRGAN-YRLRYTRGPPVTRHRPSVDVLFRSVADHWGGNAIGVLLTGMGRDGARG-LKAMRQAGAYTIAQDEAT 306 (340) T ss_pred CCEEEEEECCCE-EEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEEECCCCHHHHHH-HHHHHHCCCEEEEECHHH T ss_conf 976999966970-799956787323889986699999999708877999903686408999-999998799799989255 Q ss_pred H Q ss_conf 9 Q gi|254780402|r 379 L 379 (440) Q Consensus 379 L 379 (440) - T Consensus 307 ~ 307 (340) T PRK12555 307 S 307 (340) T ss_pred C T ss_conf 3 No 186 >cd06270 PBP1_GalS_like Ligand binding domain of DNA transcription iso-repressor GalS, which is one of two regulatory proteins involved in galactose transport and metabolism. Ligand binding domain of DNA transcription iso-repressor GalS, which is one of two regulatory proteins involved in galactose transport and metabolism. Transcription of the galactose regulon genes is regulated by Gal iso-repressor (GalS) and Gal repressor (GalR) in different ways, but both repressors recognize the same DNA binding site in the absence of D-galactose. GalS is a dimeric protein like GalR,and its major role is in regulating expression of the high-affinity galactose transporter encoded by the mgl operon, whereas GalR is the exclusive regulator of galactose permease, the low-affinity galactose transporter. GalS and GalR are members of the LacI-GalR family of transcription regulators and both contain the type I periplasmic binding protein-like fold. Hence, they are homologous to the periplasmic sugar bindi Probab=47.37 E-value=20 Score=15.92 Aligned_cols=106 Identities=23% Similarity=0.264 Sum_probs=60.9 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEECCCCHHHHHHHHHHC Q ss_conf 99999999986199899996277630012101477416885055720134455744202035640476214678887422 Q gi|254780402|r 71 ALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILSESDIWPLTVFELSKQ 150 (440) Q Consensus 71 ~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e~ElWPnli~~~~~~ 150 (440) .+..+-+.+.+.+.++++..+... .+. +. ..++ .+...+++.+|+...+.=...+..+.+. T Consensus 17 i~~~ie~~~~~~gy~~ll~~s~~~-~~~--e~---------------~~l~-~l~~~~vdGiIi~~~~~~~~~~~~l~~~ 77 (268) T cd06270 17 LLSGVESVARKAGKHLIITAGHHS-AEK--ER---------------EAIE-FLLERRCDALILHSKALSDDELIELAAQ 77 (268) T ss_pred HHHHHHHHHHHCCCEEEEEECCCC-HHH--HH---------------HHHH-HHHHCCCCEEEEECCCCCHHHHHHHHHC T ss_conf 999999999985999999958999-899--99---------------9999-9996599999995277998999999964 Q ss_pred CCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEEC----CHHHHHHHHHHCCCCEEEE-CCCC Q ss_conf 455058641266542121001111233102644236641----5778889986088505872-0431 Q gi|254780402|r 151 RIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQ----SERYFRRYKELGAQKLIVS-GNLK 212 (440) Q Consensus 151 ~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~q----s~~~~~rl~~lG~~~i~v~-GnlK 212 (440) ++|+++++..+.+... +.|.+- .....++|.+.|-+++.+. |+.. T Consensus 78 ~~PvV~i~~~~~~~~~-----------------~~V~~Dn~~~~~~a~~~L~~~G~~~i~~i~~~~~ 127 (268) T cd06270 78 VPPLVLINRHIPGLAD-----------------RCIWLDNEQGGYLATEHLIELGHRKIACITGPLT 127 (268) T ss_pred CCCEEEECCCCCCCCC-----------------CEEEECHHHHHHHHHHHHHHCCCCCEEEECCCCC T ss_conf 9989998676899989-----------------8898388999999999999839983898459888 No 187 >TIGR02858 spore_III_AA stage III sporulation protein AA; InterPro: IPR014217 Proteins in this entry include the stage III sporulation protein AA that is encoded by one of several genes in the spoIIIA locus. This protein is only found in species that are capable of endospore formation.. Probab=47.35 E-value=16 Score=16.53 Aligned_cols=52 Identities=19% Similarity=0.365 Sum_probs=35.6 Q ss_pred CCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHHH-----HHHHHHCCCEE Q ss_conf 64167506765202554220820661355444545837689619949998984437999-----99999789869 Q gi|254780402|r 302 EVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFRDI-----YRRMVSSGAVR 371 (440) Q Consensus 302 ~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~-----~~~L~~~g~~~ 371 (440) ..||+++|.||-.-+. --++||+-.|+.||+--|=.+..|. ..+|.+.++.. T Consensus 206 SP~Viv~DEIGr~ED~------------------~Al~eA~naGV~~I~TaHg~~~~Dl~kRP~fk~l~e~~~Fe 262 (282) T TIGR02858 206 SPDVIVVDEIGREEDV------------------EALLEALNAGVSVIATAHGRDLEDLKKRPVFKELLEQKAFE 262 (282) T ss_pred CCCEEEEECCCCHHHH------------------HHHHHHHCCCCEEEEEECCCCHHHHHCCHHHHHHHHHCCCE T ss_conf 9857998148895338------------------99999861675688764048812665076679999724314 No 188 >PRK05124 cysN sulfate adenylyltransferase subunit 1; Provisional Probab=47.21 E-value=20 Score=15.90 Aligned_cols=25 Identities=16% Similarity=0.084 Sum_probs=22.0 Q ss_pred CCHHHHHHHHHHCCCCEEEEEECCC Q ss_conf 6214678887422455058641266 Q gi|254780402|r 138 DIWPLTVFELSKQRIPQVLVNARMS 162 (440) Q Consensus 138 ElWPnli~~~~~~~ip~~linarls 162 (440) ++=|||+..+.+..+-+++|+|+-. T Consensus 119 ~f~~NMitGas~aD~aiLvVdA~~G 143 (475) T PRK05124 119 QYTRNMATGASTCDLAILLIDARKG 143 (475) T ss_pred HHHHHHHHHHHHCCEEEEEEECCCC T ss_conf 7788898888767889999989889 No 189 >pfam00290 Trp_syntA Tryptophan synthase alpha chain. Probab=46.88 E-value=20 Score=15.87 Aligned_cols=157 Identities=11% Similarity=0.119 Sum_probs=75.4 Q ss_pred CCCCEEECCCC----CCCCCCCCCCCCCCCCCCCCEEEEEEECCC-CHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEE Q ss_conf 79861750465----533210000000000123412488970552-1147888776530010110013444446641675 Q gi|254780402|r 233 AGRYTWAAIST----FEGEEDKAVYVHNFIKCRTDVLTIIVPRHP-RRCDAIERRLIAKGLKVARRSRGDVINAEVDIFL 307 (440) Q Consensus 233 ~~r~v~vagSt----h~~EE~~il~a~~~l~~~~~~~lIIvPRhp-eR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i 307 (440) .+.|+++.+-. +.|.|.++-.+. ....--+|+|.=| |..+++.+.+++.|+.....-.-..+.. T Consensus 85 ~~~pivlM~Y~N~i~~~G~e~F~~~~~-----~~GvdGvIipDLP~eE~~~~~~~~~~~~l~~I~lvsPtt~~~------ 153 (258) T pfam00290 85 TSVPIVLMTYYNPVLNYGIERFYAQAA-----EAGVDGLIIPDLPPEEADPLREAAEKHGIDLIFLVAPTTSDE------ 153 (258) T ss_pred CCCCEEEEEECHHHHHCCHHHHHHHHH-----HCCCCEEECCCCCHHHHHHHHHHHHHCCCCEEEEECCCCCHH------ T ss_conf 998889985208898729999999999-----759977870799988999999999845843588845888199------ Q ss_pred ECCCCCCHHHHHHCC-CEEEECCCCCCCCCCH--------HHHH--HHCCCEEECCCCCCHHHHHHHHHHCCCEEEECCH Q ss_conf 067652025542208-2066135544454583--------7689--6199499989844379999999978986995798 Q gi|254780402|r 308 GDTIGEMGFYLRMTE-IAFIGRSFCASGGQNP--------LEAA--MLGCAILSGPNVENFRDIYRRMVSSGAVRIVEEV 376 (440) Q Consensus 308 ~Dt~GeL~~lY~~Ad-iafVGGSl~~~GGhN~--------lEpa--a~G~pVi~GP~~~nf~e~~~~L~~~g~~~~v~d~ 376 (440) -+..+-+.|+ .+|+-.+..-+|..+. ++-. ...+||+.|=-+++=.++ +.+.+.--+.+|. T Consensus 154 -----Ri~~i~~~s~gFiY~vs~~GvTG~~~~~~~~~~~~i~~ik~~t~~Pv~vGFGIs~~e~v-~~~~~~aDGvIVG-- 225 (258) T pfam00290 154 -----RLKTISEAASGFVYLVSRAGVTGARNAFNAQLDELVERLKKYTNVPVAVGFGISTPEHV-KKIAAGADGVIVG-- 225 (258) T ss_pred -----HHHHHHHHCCCEEEEEECCCCCCCCCCCCHHHHHHHHHHHHCCCCCEEEEECCCCHHHH-HHHHCCCCEEEEC-- T ss_conf -----99999960898089985344567655563889999999986069984899457999999-9998159999984-- Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHH Q ss_conf 99999999995898999999999999997045569 Q gi|254780402|r 377 GTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPL 411 (440) Q Consensus 377 ~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~ 411 (440) ..+.+.+.+- .+..+.+.++-.+|+++.+.|+ T Consensus 226 Saiv~~i~~~---~~~~~~~~~~v~~fv~~lk~a~ 257 (258) T pfam00290 226 SAIVDIIEEN---LDDPEQMLAKLEEFVGKLKAAT 257 (258) T ss_pred HHHHHHHHHC---CCCHHHHHHHHHHHHHHHHHHH T ss_conf 9999999970---4068899999999999999963 No 190 >COG0135 TrpF Phosphoribosylanthranilate isomerase [Amino acid transport and metabolism] Probab=46.79 E-value=21 Score=15.86 Aligned_cols=150 Identities=13% Similarity=0.091 Sum_probs=75.0 Q ss_pred EEECCHHHHHHHHHHCCCCEEEECCCCCCC-CCCCCCHHHHHHHHHHCCC-CCEEECCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 664157788899860885058720431012-3442212568999986179-86175046553321000000000012341 Q gi|254780402|r 186 VIVQSERYFRRYKELGAQKLIVSGNLKIDT-ESLPCDKELLSLYQESIAG-RYTWAAISTFEGEEDKAVYVHNFIKCRTD 263 (440) Q Consensus 186 i~~qs~~~~~rl~~lG~~~i~v~GnlK~d~-~~~~~~~~~~~~~~~~~~~-r~v~vagSth~~EE~~il~a~~~l~~~~~ 263 (440) +-..+.+|++...++|++-+=++ |+. .+-..+.+....+...... +.|-|+-..- .+.+.+....+ .++ T Consensus 7 CGlt~~eda~~a~~~gad~iG~I----f~~~SpR~Vs~~~a~~i~~~v~~~~~VgVf~n~~---~~~i~~i~~~~--~ld 77 (208) T COG0135 7 CGLTRLEDAKAAAKAGADYIGFI----FVPKSPRYVSPEQAREIASAVPKVKVVGVFVNES---IEEILEIAEEL--GLD 77 (208) T ss_pred CCCCCHHHHHHHHHCCCCEEEEE----ECCCCCCCCCHHHHHHHHHHCCCCCEEEEECCCC---HHHHHHHHHHC--CCC T ss_conf 78898999999997699889999----7479987179999999998278887799987999---89999999833--998 Q ss_pred EEEEEEECCCCHHHHHHHHHHHHCCHHH---HHCC------CCCCCCCCEEEEECCC--------CCCHHHHHHCCC--- Q ss_conf 2488970552114788877653001011---0013------4444466416750676--------520255422082--- Q gi|254780402|r 264 VLTIIVPRHPRRCDAIERRLIAKGLKVA---RRSR------GDVINAEVDIFLGDTI--------GEMGFYLRMTEI--- 323 (440) Q Consensus 264 ~~lIIvPRhpeR~~~i~~~l~~~gl~~~---~~S~------~~~~~~~~~V~i~Dt~--------GeL~~lY~~Adi--- 323 (440) ..=+=-+-.++.++++.... ++.+. ..+. ........+.++.|+. |....|=.+... T Consensus 78 ~VQlHG~e~~~~~~~l~~~~---~~~v~kai~v~~~~~~~~~~~~~~~~d~~LlDa~~~~~~GGtG~~fDW~~l~~~~~~ 154 (208) T COG0135 78 AVQLHGDEDPEYIDQLKEEL---GVPVIKAISVSEEGDLELAAREEGPVDAILLDAKVPGLPGGTGQTFDWNLLPKLRLS 154 (208) T ss_pred EEEECCCCCHHHHHHHHHHC---CCCEEEEEEECCCCCHHHHHHCCCCCCEEEECCCCCCCCCCCCCEECHHHCCCCCCC T ss_conf 89988989999999998534---875699998677622445544148774899818887778978867797770542456 Q ss_pred --EEEECCCCCCCCCCHHHHHHHCCCEEE Q ss_conf --066135544454583768961994999 Q gi|254780402|r 324 --AFIGRSFCASGGQNPLEAAMLGCAILS 350 (440) Q Consensus 324 --afVGGSl~~~GGhN~lEpaa~G~pVi~ 350 (440) +++.|++.+. |.-|++..++|-.. T Consensus 155 ~~~~LAGGL~p~---NV~~ai~~~~p~gv 180 (208) T COG0135 155 KPVMLAGGLNPD---NVAEAIALGPPYGV 180 (208) T ss_pred CCEEEECCCCHH---HHHHHHHHCCCCEE T ss_conf 987997788989---99999983598379 No 191 >COG0159 TrpA Tryptophan synthase alpha chain [Amino acid transport and metabolism] Probab=46.18 E-value=21 Score=15.80 Aligned_cols=154 Identities=14% Similarity=0.128 Sum_probs=80.2 Q ss_pred CCCCEEECC----CCCCCCCCCCCCCCCCCCCCCCEEEEEEECCC-CHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEE Q ss_conf 798617504----65533210000000000123412488970552-1147888776530010110013444446641675 Q gi|254780402|r 233 AGRYTWAAI----STFEGEEDKAVYVHNFIKCRTDVLTIIVPRHP-RRCDAIERRLIAKGLKVARRSRGDVINAEVDIFL 307 (440) Q Consensus 233 ~~r~v~vag----Sth~~EE~~il~a~~~l~~~~~~~lIIvPRhp-eR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i 307 (440) .+-|+.+.+ .-+.|.|.++-.+ + .....-+|+|.=| |..+++.+-+++.|+.....-.-..++. T Consensus 93 ~~~Pivlm~Y~Npi~~~Gie~F~~~~----~-~~GvdGlivpDLP~ee~~~~~~~~~~~gi~~I~lvaPtt~~~------ 161 (265) T COG0159 93 VKVPIVLMTYYNPIFNYGIEKFLRRA----K-EAGVDGLLVPDLPPEESDELLKAAEKHGIDPIFLVAPTTPDE------ 161 (265) T ss_pred CCCCEEEEEECCHHHHHHHHHHHHHH----H-HCCCCEEEECCCCHHHHHHHHHHHHHCCCCEEEEECCCCCHH------ T ss_conf 99988998701188773599999999----9-759987985789866777899999976986798869999989------ Q ss_pred ECCCCCCHHHHHHC-CCEEEECCCCCCCCCCHHHHH----------HHCCCEEECCCCCCHHHHHHHHHHCCCEEEECCH Q ss_conf 06765202554220-820661355444545837689----------6199499989844379999999978986995798 Q gi|254780402|r 308 GDTIGEMGFYLRMT-EIAFIGRSFCASGGQNPLEAA----------MLGCAILSGPNVENFRDIYRRMVSSGAVRIVEEV 376 (440) Q Consensus 308 ~Dt~GeL~~lY~~A-diafVGGSl~~~GGhN~lEpa----------a~G~pVi~GP~~~nf~e~~~~L~~~g~~~~v~d~ 376 (440) -|...-+.| ..+|..+...-+|-.|+.... ..++||+.|=.+++-..+.+ +.+.--+..|. T Consensus 162 -----rl~~i~~~a~GFiY~vs~~GvTG~~~~~~~~~~~~v~~vr~~~~~Pv~vGFGIs~~e~~~~-v~~~ADGVIVG-- 233 (265) T COG0159 162 -----RLKKIAEAASGFIYYVSRMGVTGARNPVSADVKELVKRVRKYTDVPVLVGFGISSPEQAAQ-VAEAADGVIVG-- 233 (265) T ss_pred -----HHHHHHHHCCCCEEEEECCCCCCCCCCCCHHHHHHHHHHHHHCCCCEEEECCCCCHHHHHH-HHHHCCEEEEC-- T ss_conf -----9999997479858999666666777653046999999999744897387448699999999-99768857973-- Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCH Q ss_conf 9999999999589899999999999999704556 Q gi|254780402|r 377 GTLADMVYSLLSEPTIRYEMINAAINEVKKMQGP 410 (440) Q Consensus 377 ~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga 410 (440) ..+.+.+.+-.. +++-+..++++++.+.+ T Consensus 234 SAiV~~i~~~~~-----~~~~~~~~~l~~~l~~~ 262 (265) T COG0159 234 SAIVKIIEEGLD-----EEALEELRALVKELKAA 262 (265) T ss_pred HHHHHHHHHCCC-----HHHHHHHHHHHHHHHHH T ss_conf 999999995551-----44699999999999987 No 192 >cd01968 Nitrogenase_NifE_I Nitrogenase_NifE_I: a subgroup of the NifE subunit of the NifEN complex: NifE forms an alpha2beta2 tetramer with NifN. NifE and NifN are structurally homologous to nitrogenase MoFe protein alpha and beta subunits respectively. NifEN participates in the synthesis of the iron-molybdenum cofactor (FeMoco) of the MoFe protein. NifB-co (an iron and sulfur containing precursor of the FeMoco) from NifB is transferred to the NifEN complex where it is further processed to FeMoco. The NifEN bound precursor of FeMoco has been identified as a molybdenum-free, iron- and sulfur- containing analog of FeMoco. It has been suggested that this NifEN bound precursor also acts as a cofactor precursor in nitrogenase systems which require a cofactor other than FeMoco: i.e. iron-vanadium cofactor (FeVco) or iron only cofactor (FeFeco). Probab=45.25 E-value=22 Score=15.71 Aligned_cols=40 Identities=15% Similarity=0.155 Sum_probs=19.4 Q ss_pred EEEECC-CCCCHHHHHHCCC-EEEECCCCCCCCCCHHHHHHHCCCEEE Q ss_conf 675067-6520255422082-066135544454583768961994999 Q gi|254780402|r 305 IFLGDT-IGEMGFYLRMTEI-AFIGRSFCASGGQNPLEAAMLGCAILS 350 (440) Q Consensus 305 V~i~Dt-~GeL~~lY~~Adi-afVGGSl~~~GGhN~lEpaa~G~pVi~ 350 (440) +++.|. .-|+..+-.-.+. .++|+|-. -.-+...|.|++- T Consensus 338 ~vi~d~d~~el~~~i~~~~~Dliig~s~~------~~~a~rlGiP~~~ 379 (410) T cd01968 338 VIVDDANPRELKKLLKEKKADLLVAGGKE------RYLALKLGIPFCD 379 (410) T ss_pred EEEECCCHHHHHHHHHHCCCCEEEECCCH------HHHHHHHCCCEEE T ss_conf 89848999999999865699999957732------4889980898898 No 193 >cd06310 PBP1_ABC_sugar_binding_like_2 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=44.74 E-value=22 Score=15.66 Aligned_cols=42 Identities=19% Similarity=0.360 Sum_probs=29.2 Q ss_pred HHHHHHHHCCEEEEEEC--CCCHHHHHHHHHHCCCCEEEEEECCC Q ss_conf 44557442020356404--76214678887422455058641266 Q gi|254780402|r 120 VSRFLKYWKPDCMILSE--SDIWPLTVFELSKQRIPQVLVNARMS 162 (440) Q Consensus 120 ~~~fl~~~~P~~~i~~e--~ElWPnli~~~~~~~ip~~linarls 162 (440) ++.++.. +++.+++.- .+-....+..+...|||++++|..+. T Consensus 50 i~~~i~~-~vDgIii~p~~~~~~~~~l~~a~~~gIPvV~~d~~~~ 93 (273) T cd06310 50 LENAIAR-GPDAILLAPTDAKALVPPLKEAKDAGIPVVLIDSGLN 93 (273) T ss_pred HHHHHHC-CCCEEEEECCCCHHHHHHHHHHHHCCCCEEEEECCCC T ss_conf 9999974-9999999168714479999999984998589803334 No 194 >KOG3349 consensus Probab=44.41 E-value=22 Score=15.62 Aligned_cols=68 Identities=18% Similarity=0.261 Sum_probs=50.6 Q ss_pred CCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCC---CCC-HHHHHHHHHHCCCEEEECCHHHHHHHHHHH Q ss_conf 20255422082066135544454583768961994999898---443-799999999789869957989999999999 Q gi|254780402|r 313 EMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPN---VEN-FRDIYRRMVSSGAVRIVEEVGTLADMVYSL 386 (440) Q Consensus 313 eL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~---~~n-f~e~~~~L~~~g~~~~v~d~~eL~~~l~~l 386 (440) .|....+-||+|+ | ..|--+.+|-...|+|.|+=+| |+| ..|.++.|.+.|....+. +.+|.+.+.++ T Consensus 73 sl~e~I~~AdlVI-s----HAGaGS~letL~l~KPlivVvNd~LMDNHQ~ELA~qL~~egyL~~C~-ps~L~~~L~~~ 144 (170) T KOG3349 73 SLTEDIRSADLVI-S----HAGAGSCLETLRLGKPLIVVVNDSLMDNHQLELAKQLAEEGYLYYCT-PSTLPAGLAKL 144 (170) T ss_pred CHHHHHHHCCEEE-E----CCCCCHHHHHHHCCCCEEEEECHHHHHHHHHHHHHHHHHCCCEEEEE-CCCHHHHHHHH T ss_conf 1788875345887-4----58742099999749977999275764448999999998659689960-56567898760 No 195 >cd06301 PBP1_rhizopine_binding_like Periplasmic binding proteins specific to rhizopines. Periplasmic binding proteins specific to rhizopines, which are simple sugar-like compounds produced in the nodules induced by the symbiotic root nodule bacteria, such as Rhizobium and Sinorhizobium. Rhizopine-binding-like proteins from other bacteria are also included. Two inositol based rhizopine compounds are known to date: L-3-O-methly-scyllo-inosamine (3-O-MSI) and scyllo-inosamine. Bacterial strains that can metabolize rhizopine have a greater competitive advantage in nodulation and rhizopine synthesis is regulated by NifA/NtrA regulatory transcription activators which are maximally expressed at the onset of nitrogen fixation in bacteroids. The members of this group belong to the pentose/hexose sugar-binding protein family of the type I periplasmic binding protein superfamily. Probab=44.37 E-value=22 Score=15.62 Aligned_cols=43 Identities=26% Similarity=0.305 Sum_probs=30.8 Q ss_pred HHHHHHHHCCEEEEEE--CCCCHHHHHHHHHHCCCCEEEEEECCCC Q ss_conf 4455744202035640--4762146788874224550586412665 Q gi|254780402|r 120 VSRFLKYWKPDCMILS--ESDIWPLTVFELSKQRIPQVLVNARMSR 163 (440) Q Consensus 120 ~~~fl~~~~P~~~i~~--e~ElWPnli~~~~~~~ip~~linarls~ 163 (440) ++.++.+ +++.+|+. +.+-....+.++.+.|||++.+|..... T Consensus 49 i~~li~~-~vDgIii~p~d~~~~~~~~~~a~~agIPvV~~d~~~~~ 93 (272) T cd06301 49 VENFIAQ-GVDAIIVVPVDTAATAPIVKAANAAGIPLVYVNRRPEN 93 (272) T ss_pred HHHHHHC-CCCEEEEECCCCHHHHHHHHHHHHCCCCEEEEECCCCC T ss_conf 9999985-99999995588211189999999869988999356677 No 196 >TIGR02918 TIGR02918 conserved hypothetical protein TIGR02918; InterPro: IPR014267 Members of this entry are found only in Gram-positive bacteria of the Firmicutes lineage, including several species of Staphylococcus, Streptococcus, and Lactobacillus.. Probab=44.14 E-value=16 Score=16.56 Aligned_cols=268 Identities=15% Similarity=0.128 Sum_probs=145.4 Q ss_pred EECCCCCCHHH--HHHHHH------------------HCCE-EEEEECCCCHHHHHHHHHHCCCCEEEEE---------- Q ss_conf 85055720134--455744------------------2020-3564047621467888742245505864---------- Q gi|254780402|r 110 QYAPLDIQPAV--SRFLKY------------------WKPD-CMILSESDIWPLTVFELSKQRIPQVLVN---------- 158 (440) Q Consensus 110 ~y~P~D~~~~~--~~fl~~------------------~~P~-~~i~~e~ElWPnli~~~~~~~ip~~lin---------- 158 (440) -|+|.|....+ |+|.+- .=|+ .++-.+.||=-.|+.++.--+==+++++ T Consensus 153 YYAP~Dn~A~LY~R~FYNEDGSIAY~e~~dqG~~s~F~F~dG~iLYsK~eliayFl~~L~LT~~Di~IlDR~~~dknlii 232 (511) T TIGR02918 153 YYAPADNKAKLYQRTFYNEDGSIAYEEYVDQGDESVFVFKDGKILYSKQELIAYFLKQLNLTKKDIIILDRSTGDKNLII 232 (511) T ss_pred ECCCCCCCCEEEECEEECCCCCHHHHHCCCCCCCEEEECCCCCEECCHHHHHHHHHHHCCCCCCCEEEEECCCCCCEEEE T ss_conf 11776798547101124258834344203798843678379745418699999999960698566899973689930587 Q ss_pred ----------EC---------CCCCCC-CCCHHHHHHHHHCCCC---CCEEEECCHHHHHHH----HHH-CC-CCEE--E Q ss_conf ----------12---------665421-2100111123310264---423664157788899----860-88-5058--7 Q gi|254780402|r 159 ----------AR---------MSRRSF-KNWKTVLSFSKKIFSQ---FSLVIVQSERYFRRY----KEL-GA-QKLI--V 207 (440) Q Consensus 159 ----------ar---------ls~~S~-~~~~~~~~l~~~~~~~---~~~i~~qs~~~~~rl----~~l-G~-~~i~--v 207 (440) || .|+... ..+=..-.++-..|++ +|.+++-++...+-+ .+- +. |+|. . T Consensus 233 GQ~Vl~Nkg~AklgVVvHAEHf~~~~t~e~~ILWNNyYEYqF~nA~~iDFFItATd~Q~~~l~~QF~kY~~~~P~i~TIP 312 (511) T TIGR02918 233 GQAVLENKGPAKLGVVVHAEHFSESATNETYILWNNYYEYQFSNADYIDFFITATDIQNQILLEQFKKYKNIEPKIYTIP 312 (511) T ss_pred EEEEEEECCCEEEEEEEEEEECCCCCCCCCCCCCCCCHHHHCCCCCEEEEEEECHHHHHHHHHHHHHHHCCCCCEEEEEE T ss_conf 13888750870289999746368886168602214652553156866511340057889999998887479987788850 Q ss_pred ECCCCCCCCCCCCCHHHHHHHHHHCCCCCEEECCCCCCCC--CCCCCCCC-CCCCCCCCEEEEEEECCCCHHHHHHHHHH Q ss_conf 2043101234422125689999861798617504655332--10000000-00012341248897055211478887765 Q gi|254780402|r 208 SGNLKIDTESLPCDKELLSLYQESIAGRYTWAAISTFEGE--EDKAVYVH-NFIKCRTDVLTIIVPRHPRRCDAIERRLI 284 (440) Q Consensus 208 ~GnlK~d~~~~~~~~~~~~~~~~~~~~r~v~vagSth~~E--E~~il~a~-~~l~~~~~~~lIIvPRhpeR~~~i~~~l~ 284 (440) +|++.- ...|. -..++ ..=++-||.-..| .+-+++|. +.-+..|++.+=|=..--|+ ..+.+++. T Consensus 313 VGSl~~--L~~p~-W~~Rk--------p~SiiTASRLA~EKHiDWLV~AVv~Ak~~~P~l~FDIYG~GgE~-~~L~~iI~ 380 (511) T TIGR02918 313 VGSLDE--LQYPE-WQERK--------PFSIITASRLAKEKHIDWLVKAVVKAKKSVPELTFDIYGEGGEK-SKLKKIIN 380 (511) T ss_pred CCCHHH--CCCCC-CCCCC--------CHHHHHHHHHCCCCCHHHHHHHHHHHCCCCCCEEEEECCCCHHH-HHHHHHHH T ss_conf 575433--57886-34645--------21567773413767126888999995133885110003563788-99999876 Q ss_pred HHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEE-ECCCCCCHHHHHHH Q ss_conf 30010110013444446641675067652025542208206613554445458376896199499-98984437999999 Q gi|254780402|r 285 AKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAIL-SGPNVENFRDIYRR 363 (440) Q Consensus 285 ~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi-~GP~~~nf~e~~~~ 363 (440) +.-= .+.+.-.. --+|...|+-=. +|+-||=.+.=|-+.|||.--|.|+| +==++.|-.-+-+ T Consensus 381 ~n~A-------~DYI~LkG-------H~~L~~vY~~Ye-lyLsaStSEGFGLTLmEAvGSGLgmIGFDV~YGN~TFI~d- 444 (511) T TIGR02918 381 ENQA-------EDYIRLKG-------HKNLSEVYKDYE-LYLSASTSEGFGLTLMEAVGSGLGMIGFDVNYGNPTFIKD- 444 (511) T ss_pred HHCC-------CCHHHHCC-------CCCHHHHHCCCC-EEEEECCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCEEEC- T ss_conf 3120-------01243115-------433566623232-2345212144115799997504332366187438870240- Q ss_pred HHHCCCEEEECCH--------HHHHHHHHHHHCCHHHHHHHHHHHHHHHHH Q ss_conf 9978986995798--------999999999958989999999999999970 Q gi|254780402|r 364 MVSSGAVRIVEEV--------GTLADMVYSLLSEPTIRYEMINAAINEVKK 406 (440) Q Consensus 364 L~~~g~~~~v~d~--------~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~ 406 (440) .++|-.++.+.. +.||+.+..+.-+.+...+|.+...+.++. T Consensus 445 -~~NGYLIP~~~~~~~~~~I~~~lA~~Iv~~Fv~~~~~~~~~~~SY~IA~~ 494 (511) T TIGR02918 445 -NKNGYLIPIDEEEDDEDEIITALAEKIVEYFVNENDIDAFHEVSYQIAEG 494 (511) T ss_pred -CCCCEECCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHH T ss_conf -88840433457878878999999999898612678600265631788866 No 197 >COG4213 XylF ABC-type xylose transport system, periplasmic component [Carbohydrate transport and metabolism] Probab=44.01 E-value=23 Score=15.58 Aligned_cols=74 Identities=11% Similarity=0.078 Sum_probs=44.9 Q ss_pred CEEEEECCCCCCHHHHC---------CCCCCEEEEECCCCCCHHHHHHHHHH--CCEEEEEEC--CCCHHHHHHHHHHCC Q ss_conf 89999627763001210---------14774168850557201344557442--020356404--762146788874224 Q gi|254780402|r 85 NVLLTTMTATSAKVARK---------YLGQYAIHQYAPLDIQPAVSRFLKYW--KPDCMILSE--SDIWPLTVFELSKQR 151 (440) Q Consensus 85 ~iliT~~T~tg~~~~~~---------~~~~~~~~~y~P~D~~~~~~~fl~~~--~P~~~i~~e--~ElWPnli~~~~~~~ 151 (440) .|-+|..+++.....++ .+...+..+|.-=|-+.-...+=+.+ .|+++++.- ++---+.+..++..| T Consensus 27 ~IGis~~d~~~eRW~~D~~~~~~~~e~~g~k~~~q~A~~~~~~Q~~qien~i~qg~~vlvi~a~d~~~l~~~i~~A~~~g 106 (341) T COG4213 27 VIGISMPDLRSERWIKDRDAFVKKAEALGAKVDVQSADGDEEKQLAQIENMINQGVKVLVIGAIDGGVLSNAVEKAKSEG 106 (341) T ss_pred EEEEECCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHCCCHHHHHHHHHHHHHCCCCEEEEEECCCHHHHHHHHHHHHCC T ss_conf 59997687357666532688999998624323034322571679999999873599789998135325899999998759 Q ss_pred CCEEEEE Q ss_conf 5505864 Q gi|254780402|r 152 IPQVLVN 158 (440) Q Consensus 152 ip~~lin 158 (440) ||++-.+ T Consensus 107 ikViaYD 113 (341) T COG4213 107 IKVIAYD 113 (341) T ss_pred CEEEEEE T ss_conf 9499740 No 198 >cd01829 SGNH_hydrolase_peri2 SGNH_peri2; putative periplasmic member of the SGNH-family of hydrolases, a diverse family of lipases and esterases. The tertiary fold of the enzyme is substantially different from that of the alpha/beta hydrolase family and unique among all known hydrolases; its active site closely resembles the Ser-His-Asp(Glu) triad found in other serine hydrolases. Probab=43.80 E-value=23 Score=15.56 Aligned_cols=72 Identities=17% Similarity=0.255 Sum_probs=45.5 Q ss_pred EEEEECCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEECC- Q ss_conf 9998178999999999999998619989999627763001210147741688505572013445574420203564047- Q gi|254780402|r 59 IWFHASSVGETMALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILSES- 137 (440) Q Consensus 59 IW~HaaSvGE~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e~- 137 (440) |++=.=|+.+..+. .+.+.+ ...+.+-+...|..|--++.. -++|.|..++..+...+|+++|++=+ T Consensus 2 Vlv~GDSl~~gl~~-~l~~~l-~~~~~i~~~~~s~~stGL~r~----------~~~dW~~~~~~~~~~~~pd~vVv~lG~ 69 (200) T cd01829 2 VLVIGDSLAQGLAP-GLLRAL-ADNPGIRVINRSKGSSGLVRP----------DFFDWPEKLKELIAEEKPDVVVVFLGA 69 (200) T ss_pred EEEEECHHHHHHHH-HHHHHH-HCCCCEEEEECCCCCCCCCCC----------CCCCHHHHHHHHHHCCCCCEEEEEECC T ss_conf 89991318887899-999985-216982999877457686679----------857879999998745799989999547 Q ss_pred -CCHHH Q ss_conf -62146 Q gi|254780402|r 138 -DIWPL 142 (440) Q Consensus 138 -ElWPn 142 (440) |-|+- T Consensus 70 ND~~~~ 75 (200) T cd01829 70 NDRQDI 75 (200) T ss_pred CCCCCC T ss_conf 777442 No 199 >KOG0353 consensus Probab=43.75 E-value=6.9 Score=18.95 Aligned_cols=50 Identities=16% Similarity=0.163 Sum_probs=29.1 Q ss_pred HHHHHHHHHHCCHHHHHHHHHH-----------HHHHHHHCCCHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999995898999999999-----------99999704556999999999752378785 Q gi|254780402|r 378 TLADMVYSLLSEPTIRYEMINA-----------AINEVKKMQGPLKITLRSLDSYVNPLIFQ 428 (440) Q Consensus 378 eL~~~l~~ll~d~~~~~~~~~~-----------a~~~i~~~~ga~~~~~~~i~~~l~~~~~~ 428 (440) .+++...... +|+.+++|-.+ ..++-+....+...+.+.|+.+|||+..- T Consensus 503 ~laehfde~w-~~~~c~k~cd~c~~~n~f~~~n~~ey~~dl~e~~kt~~~~i~e~ln~~k~~ 563 (695) T KOG0353 503 KLAEHFDEAW-EPEACNKMCDNCCKDNAFEGKNIKEYCRDLIEAAKTQAEEIEEHLNPAKDG 563 (695) T ss_pred HHHHHHHHHC-CHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 9998888641-889898876403257642366658888999999998999999734840036 No 200 >pfam05889 SLA_LP_auto_ag Soluble liver antigen/liver pancreas antigen (SLA/LP autoantigen). This family consists of several eukaryotic and archaeal proteins which are related to the human soluble liver antigen/liver pancreas antigen (SLA/LP autoantigen). Autoantibodies are a hallmark of autoimmune hepatitis, but most are not disease specific. Autoantibodies to soluble liver antigen (SLA) and to liver and pancreas antigen (LP) have been described as disease specific, occurring in about 30% of all patients with autoimmune hepatitis. The function of SLA/LP is unknown, however, it has been suggested that the protein may function as a serine hydroxymethyltransferase and may be an important enzyme in the thus far poorly understood selenocysteine pathway. Some archaeal members are annotated as being pyridoxal phosphate-dependent enzymes. Probab=42.97 E-value=23 Score=15.48 Aligned_cols=19 Identities=21% Similarity=0.357 Sum_probs=7.2 Q ss_pred EECCCCHHHHHHHHHHHHC Q ss_conf 7055211478887765300 Q gi|254780402|r 269 VPRHPRRCDAIERRLIAKG 287 (440) Q Consensus 269 vPRhpeR~~~i~~~l~~~g 287 (440) +||-|++..+|.++|++.+ T Consensus 167 aPR~pD~v~~IAklC~~~~ 185 (389) T pfam05889 167 APRSPDNVKEIAKICAEYD 185 (389) T ss_pred CCCCCCCHHHHHHHHHHCC T ss_conf 9999667999999999749 No 201 >PRK04168 hypothetical protein; Provisional Probab=42.96 E-value=23 Score=15.48 Aligned_cols=49 Identities=24% Similarity=0.226 Sum_probs=35.8 Q ss_pred CCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCC Q ss_conf 788887799981789999999999999986199899996277630012101 Q gi|254780402|r 52 LRPIGPLIWFHASSVGETMALIGLIPAIRSRHVNVLLTTMTATSAKVARKY 102 (440) Q Consensus 52 ~~~~~~~IW~HaaSvGE~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~ 102 (440) ..+.++++=+||+|+.+.... +.+...+++|++=+......+...+++. T Consensus 29 ~~~~~~l~Vf~AGSL~~pf~e--i~~~Fe~~~p~v~v~~e~~GS~~~ar~I 77 (336) T PRK04168 29 AEPPGKLKIFHAGSLSVPFEE--YEKEFEAYYPNVDVQREAGGSVACVRKI 77 (336) T ss_pred CCCCEEEEEEECCCCHHHHHH--HHHHHHHHCCCCEEEEEECCHHHHHHHH T ss_conf 788615999975525899999--9999998789965999847389999999 No 202 >cd06312 PBP1_ABC_sugar_binding_like_4 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=42.89 E-value=23 Score=15.47 Aligned_cols=43 Identities=19% Similarity=0.310 Sum_probs=32.5 Q ss_pred HHHHHHHHCCEEEEEE--CCCCHHHHHHHHHHCCCCEEEEEECCCC Q ss_conf 4455744202035640--4762146788874224550586412665 Q gi|254780402|r 120 VSRFLKYWKPDCMILS--ESDIWPLTVFELSKQRIPQVLVNARMSR 163 (440) Q Consensus 120 ~~~fl~~~~P~~~i~~--e~ElWPnli~~~~~~~ip~~linarls~ 163 (440) ++.++.. +|+.+++. ..+-....+..+...|||++.+|..... T Consensus 50 i~~~i~~-~vDaIii~p~~~~~~~~~i~~a~~agIpVv~~ds~~~~ 94 (271) T cd06312 50 IEAAIAA-KPDGIVVTIPDPDALDPAIKRAVAAGIPVISFNAGDPK 94 (271) T ss_pred HHHHHHC-CCCEEEEECCCCCHHHHHHHHHHHCCCEEEEEECCCCC T ss_conf 9999975-99989993788300269999999659869999478864 No 203 >cd06285 PBP1_LacI_like_7 Ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=42.72 E-value=24 Score=15.46 Aligned_cols=105 Identities=18% Similarity=0.192 Sum_probs=63.2 Q ss_pred HHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEECCCCHHHHHHHHHH Q ss_conf 99999999998619989999627763001210147741688505572013445574420203564047621467888742 Q gi|254780402|r 70 MALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILSESDIWPLTVFELSK 149 (440) Q Consensus 70 ~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e~ElWPnli~~~~~ 149 (440) ..+..+-+.+.+.+.++++.++.. ..+ ++ ...++ .+...+++.+|+.......+.+..+.+ T Consensus 16 ~l~~~i~~~~~~~gy~~~i~~s~~-~~~--~e---------------~~~i~-~l~~~~vdGiIi~~~~~~~~~~~~l~~ 76 (265) T cd06285 16 TMYEGIEEAAAERGYSTFVANTGD-NPD--AQ---------------RRAIE-MLLDRRVDGLILGDARSDDHFLDELTR 76 (265) T ss_pred HHHHHHHHHHHHCCCEEEEEECCC-CHH--HH---------------HHHHH-HHHHCCCCEEEECCCCCCHHHHHHHHH T ss_conf 999999999998699899997999-989--99---------------99999-999569999997688799899999996 Q ss_pred CCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCH----HHHHHHHHHCCCCEEEECCCC Q ss_conf 245505864126654212100111123310264423664157----788899860885058720431 Q gi|254780402|r 150 QRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSE----RYFRRYKELGAQKLIVSGNLK 212 (440) Q Consensus 150 ~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~----~~~~rl~~lG~~~i~v~GnlK 212 (440) .++|+++++..... +..|.+-+. ...++|.+.|-+++.+.+... T Consensus 77 ~~iPvV~~~~~~~~-------------------~~~v~~dn~~~~~~~~~~Li~~G~~~i~~l~~~~ 124 (265) T cd06285 77 RGVPFVLVLRHAGT-------------------SPAVTGDDVLGGRLATRHLLDLGHRRIAVLAGPD 124 (265) T ss_pred CCCCEEEECCCCCC-------------------CCEEEECHHHHHHHHHHHHHHHCCCCEEECCCCC T ss_conf 79978998556899-------------------9979877799999999999983886587547887 No 204 >TIGR00315 cdhB CO dehydrogenase/acetyl-CoA synthase complex, epsilon subunit; InterPro: IPR003704 Carbon monoxide dehydrogenase (Cdh) from Methanosarcina frisia Go1 is a Ni2+-, Fe2+-, and S2-containing alpha2beta2 heterotetramer . The CO dehydrogenase enzyme complex from Methanosarcina thermophila contains a corrinoid/iron-sulphur enzyme composed of two subunits (delta and gamma) . This family consists of carbon monoxide dehydrogenase I/II beta subunit 1.2.99.2 from EC and CO dehydrogenase (acetyl-CoA synthase epsilon subunit).; GO: 0003824 catalytic activity, 0008152 metabolic process. Probab=42.60 E-value=24 Score=15.44 Aligned_cols=69 Identities=17% Similarity=0.366 Sum_probs=57.1 Q ss_pred CCEEEEEECC-CCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHCCCC Q ss_conf 0203564047-6214678887422455058641266542121001111233102644236641577888998608850 Q gi|254780402|r 128 KPDCMILSES-DIWPLTVFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKELGAQK 204 (440) Q Consensus 128 ~P~~~i~~e~-ElWPnli~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~~~ 204 (440) .|+.+-+++- +.=-|||+.|+| |++++...+.+. -++.-+..++..+||.=.+.+..+.++|.+-|++. T Consensus 8 ~~~~A~~~~pG~~v~~MIkkAkr---PLLv~G~~l~~~-----Ek~~E~~~K~iekF~l~vvaTa~~~~~lieaGi~~ 77 (170) T TIGR00315 8 GPKEATLVEPGKLVAMMIKKAKR---PLLVVGPNLEKD-----EKLLELVVKFIEKFDLPVVATADAIKALIEAGIEE 77 (170) T ss_pred CCCEEEEECCCCHHHHHHHHCCC---CEEEECCCCHHC-----CHHHHHHHHHHHHCCCCEEECCHHHHHHHHCCCCC T ss_conf 75516753688425676776288---605336100121-----21345555556432886474731138888546621 No 205 >pfam11019 DUF2608 Protein of unknown function (DUF2608). This family is conserved in Bacteria. The function is not known. Probab=42.55 E-value=24 Score=15.44 Aligned_cols=30 Identities=30% Similarity=0.273 Sum_probs=25.2 Q ss_pred EEEEECCCCHHHHHHHHHHCCCCEEEEEECC Q ss_conf 3564047621467888742245505864126 Q gi|254780402|r 131 CMILSESDIWPLTVFELSKQRIPQVLVNARM 161 (440) Q Consensus 131 ~~i~~e~ElWPnli~~~~~~~ip~~linarl 161 (440) ...++|.+ ||+++..++++|+|++-+++|- T Consensus 77 ~~~lie~~-~~~~i~~l~~~~~~v~glT~~~ 106 (251) T pfam11019 77 KMELIESD-VPSLINKLQNKGITVLGLTERP 106 (251) T ss_pred CEEEECHH-HHHHHHHHHHCCCCEEEECCCC T ss_conf 64871573-8999999997899399983898 No 206 >TIGR01179 galE UDP-glucose 4-epimerase; InterPro: IPR005886 Synonym: UDP-galactose 4-epimerase UDP-glucose 4-epimerase (5.1.3.2 from EC) interconverts UDP-glucose and UDP-galactose which are precursors of glucose- and galactose-containing exopolysaccharides (EPS). A set of related proteins, some of which are tentatively identified as UDP-glucose-4-epimerase in Thermotoga maritima, Bacillus halodurans, and several archaea, but deeply branched from this set and lacking experimental evidence, are excluded and described by a separate model. ; GO: 0003978 UDP-glucose 4-epimerase activity, 0006012 galactose metabolic process. Probab=42.48 E-value=24 Score=15.43 Aligned_cols=86 Identities=10% Similarity=0.062 Sum_probs=55.2 Q ss_pred HHHHHHHHCCCCEEEEECCCCCCHH----HHCCCCCCEEEEECCCCCCHHHHHHHH----HHCCEEEEEE-------CCC Q ss_conf 9999998619989999627763001----210147741688505572013445574----4202035640-------476 Q gi|254780402|r 74 GLIPAIRSRHVNVLLTTMTATSAKV----ARKYLGQYAIHQYAPLDIQPAVSRFLK----YWKPDCMILS-------ESD 138 (440) Q Consensus 74 pli~~l~~~~~~iliT~~T~tg~~~----~~~~~~~~~~~~y~P~D~~~~~~~fl~----~~~P~~~i~~-------e~E 138 (440) -.+++|.+++.+|+|==+=.+|... .++..+..+...--=++-...++++|. .-+|+++|=+ |+= T Consensus 14 Ht~~~Ll~~G~ev~vlDNLs~G~~~~l~~~~~~~G~~~~fv~gDL~D~~~l~~~f~kqql~~~idAViHFAg~~~VgESv 93 (341) T TIGR01179 14 HTVRQLLESGYEVVVLDNLSNGSAEALKRGEEITGKEVTFVEGDLRDRELLRRVFEKQQLEHKIDAVIHFAGLIAVGESV 93 (341) T ss_pred HHHHHHHHCCCEEEEEECCCCCCHHHHCCCCEECCCCCEEEEECCHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCHHHH T ss_conf 88788763597289981578884887500234148532058717515799999987743116754675201121252557 Q ss_pred CH------------HHHHHHHHHCCCCEEEEEE Q ss_conf 21------------4678887422455058641 Q gi|254780402|r 139 IW------------PLTVFELSKQRIPQVLVNA 159 (440) Q Consensus 139 lW------------Pnli~~~~~~~ip~~lina 159 (440) .= -||+.+|.+-||.-++.+. T Consensus 94 ~~Pl~YY~NNv~nTl~L~~~m~~~GV~~~iFSS 126 (341) T TIGR01179 94 QKPLKYYRNNVVNTLNLLEAMQETGVKKFIFSS 126 (341) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCEEECC T ss_conf 524544000468999999999981897415304 No 207 >TIGR00073 hypB hydrogenase accessory protein HypB; InterPro: IPR004392 The hydrogenase accessory protein HypB is a GTP hydrolase for assembly of nickel metallocentre of hydrogenase. A similar protein, ureG, is an accessory protein for urease, which also uses nickel.; GO: 0016151 nickel ion binding, 0006461 protein complex assembly. Probab=41.48 E-value=17 Score=16.41 Aligned_cols=119 Identities=21% Similarity=0.284 Sum_probs=56.8 Q ss_pred CCHHHHHHHHHHCCCCEEE-ECCC-CCCCCC----------CCCCHHHHHHHHHH-----------CCCCCEEECCCCCC Q ss_conf 1577888998608850587-2043-101234----------42212568999986-----------17986175046553 Q gi|254780402|r 189 QSERYFRRYKELGAQKLIV-SGNL-KIDTES----------LPCDKELLSLYQES-----------IAGRYTWAAISTFE 245 (440) Q Consensus 189 qs~~~~~rl~~lG~~~i~v-~Gnl-K~d~~~----------~~~~~~~~~~~~~~-----------~~~r~v~vagSth~ 245 (440) +|..|++|+++.|++-+.+ ||.- =.|... .+.+....-.+.+. +|...-++.-|+-+ T Consensus 72 ~t~~DA~RlR~~G~~a~~~nTGk~CHLdA~mv~G~~~~L~~~~ld~~~DlL~IENVGNLvCP~~FdLGe~~rVvllSVTE 151 (225) T TIGR00073 72 QTKNDADRLRKYGVPAIQINTGKECHLDAHMVAGAIHALKDLPLDDISDLLLIENVGNLVCPADFDLGEHMRVVLLSVTE 151 (225) T ss_pred HHHHHHHHHHHCCCEEEEECCCCCCCCHHHHHHHHHHHHHCCCCCCCCCEEEEEECCCCEECCCCCCCCCCEEEEEEECC T ss_conf 22556999986498688636886444016678658755421688871462688644761006731123563079998658 Q ss_pred CCCCCCCCCCCCCCCCCCEEEEEEECCCCHHHHHHHHHHHHCCHHHHHC-CCCCCCCCCEEEEEC-CCC-CCHHHHH Q ss_conf 3210000000000123412488970552114788877653001011001-344444664167506-765-2025542 Q gi|254780402|r 246 GEEDKAVYVHNFIKCRTDVLTIIVPRHPRRCDAIERRLIAKGLKVARRS-RGDVINAEVDIFLGD-TIG-EMGFYLR 319 (440) Q Consensus 246 ~EE~~il~a~~~l~~~~~~~lIIvPRhpeR~~~i~~~l~~~gl~~~~~S-~~~~~~~~~~V~i~D-t~G-eL~~lY~ 319 (440) ||+...-. -..-+ .+-+|++ .=.++++..|+...+-. +-...+++.+|+.++ +.| =|..||. T Consensus 152 GdDk~lKy--P~~F~--~Ad~~~i--------nK~DL~~~v~~D~ek~~~d~~~~nP~a~Ii~~S~ktg~Gl~~w~~ 216 (225) T TIGR00073 152 GDDKVLKY--PAMFK--EADLILI--------NKVDLAEAVGFDVEKMKADARKINPEAEIILVSAKTGKGLDEWLE 216 (225) T ss_pred CCCCCCCC--CHHHH--HHHHHHH--------CHHHHHHHHCCCHHHHHHHHHHHCCCEEEEEEECCCCCCHHHHHH T ss_conf 99965466--15887--4445621--------478899770736789999998628950799863689734789999 No 208 >PRK06163 hypothetical protein; Provisional Probab=41.23 E-value=25 Score=15.31 Aligned_cols=31 Identities=19% Similarity=0.246 Sum_probs=19.6 Q ss_pred CHHHHHHHHHHCC--CEEEECCHHHHHHHHHHHHCC Q ss_conf 3799999999789--869957989999999999589 Q gi|254780402|r 356 NFRDIYRRMVSSG--AVRIVEEVGTLADMVYSLLSE 389 (440) Q Consensus 356 nf~e~~~~L~~~g--~~~~v~d~~eL~~~l~~ll~d 389 (440) +|..+++ ..| ....|++.++|.+++...++. T Consensus 130 Df~~iA~---a~Gy~~~~~v~~~~el~~~l~~~l~~ 162 (202) T PRK06163 130 DVVAIAR---GAGLENSHWAADEAHFEALVDQALSG 162 (202) T ss_pred CHHHHHH---HCCCCCEEEECCHHHHHHHHHHHHHC T ss_conf 9999999---77998579977999999999999838 No 209 >TIGR01989 COQ6 Ubiquinone biosynthesis mono0xygenase COQ6; InterPro: IPR000689 Ubiquinone (Q) functions as an electron carrier in the respiratory chain in mitochondria. Q biosynthesis involves a series of enzymatic steps, which are catalysed by the enzymes COQ1-COQ8 in Saccharomyces cerevisiae. COQ6, or ubiquinone biosynthesis monooxygenase, is a flavin-dependent enzyme localised to the matrix side of the inner mitochondrial membrane that is required for one or more steps in Q biosynthesis , .; GO: 0004497 monooxygenase activity. Probab=41.07 E-value=8.1 Score=18.50 Aligned_cols=17 Identities=18% Similarity=0.239 Sum_probs=6.9 Q ss_pred EEEECCHHHHHHHHHHC Q ss_conf 36641577888998608 Q gi|254780402|r 185 LVIVQSERYFRRYKELG 201 (440) Q Consensus 185 ~i~~qs~~~~~rl~~lG 201 (440) +|.+.|+.+...|+++| T Consensus 56 Rvssitp~si~~f~~~G 72 (481) T TIGR01989 56 RVSSITPASIEFFKKIG 72 (481) T ss_pred CEEECCCHHHHHHHHCC T ss_conf 35853750799985348 No 210 >cd01391 Periplasmic_Binding_Protein_Type_1 Type 1 periplasmic binding fold superfamily. Type 1 periplasmic binding fold superfamily. This model and hierarchy represent the ligand binding domains of the LacI family of transcriptional regulators, periplasmic binding proteins of the ABC-type transport systems, the family C G-protein couples receptors (GPCRs), membrane bound guanylyl cyclases including the family of natriuretic peptide receptors (NPRs), and the N-terminal leucine/isoleucine/valine- binding protein (LIVBP)-like domains of the ionotropic glutamate receptors (iGluRs). In LacI-like transcriptional regulator and the bacterial periplasmic binding proteins the ligands are monosaccharides including lactose, ribose, fructose, xylose, arabinose, galactose/glucose, and other sugars, with a few exceptions. Periplasmic sugar binding proteins are one of the components of ABC transporters and are involved in the active transport of water-soluble ligands. The LacI family of proteins con Probab=40.23 E-value=26 Score=15.21 Aligned_cols=149 Identities=13% Similarity=0.046 Sum_probs=70.7 Q ss_pred HHHHHHHHHCCEEEEEECCC-CHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECC----HHH Q ss_conf 34455744202035640476-2146788874224550586412665421210011112331026442366415----778 Q gi|254780402|r 119 AVSRFLKYWKPDCMILSESD-IWPLTVFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQS----ERY 193 (440) Q Consensus 119 ~~~~fl~~~~P~~~i~~e~E-lWPnli~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs----~~~ 193 (440) .+++++++ +++.+|..-+. --......+++.+||++.++....+.+. ...+..+...+ ... T Consensus 50 ~i~~l~~~-~vd~ii~~~~~~~~~~~~~~~~~~~iPvv~~~~~~~~~~~-------------~~~~~~v~~~~~~~~~~~ 115 (269) T cd01391 50 ALRDLIQQ-GVDGIIGPPSSSSALAVVELAAAAGIPVVSLDATAPDLTG-------------YPYVFRVGPDNEQAGEAA 115 (269) T ss_pred HHHHHHHC-CCCEEEECCCCCCHHHHHHHHHHCCCEEEEECCCCCCCCC-------------CCCEEEEEECHHHHHHHH T ss_conf 99999974-9999994688744489999999769819996788787778-------------886589985889999999 Q ss_pred HHHHHHHCCCCEEEECCCCCCCCCCCCCHHHHHHHHHHC---CCCCEEECCCCCCCCCCCCCCCCCCCCCCCCEEEEEEE Q ss_conf 889986088505872043101234422125689999861---79861750465533210000000000123412488970 Q gi|254780402|r 194 FRRYKELGAQKLIVSGNLKIDTESLPCDKELLSLYQESI---AGRYTWAAISTFEGEEDKAVYVHNFIKCRTDVLTIIVP 270 (440) Q Consensus 194 ~~rl~~lG~~~i~v~GnlK~d~~~~~~~~~~~~~~~~~~---~~r~v~vagSth~~EE~~il~a~~~l~~~~~~~lIIvP 270 (440) .+.+.+.|.+++.+.+....+ .......-+...+ +...+...-..+..++ -.-.+.+.+++.++...|++- T Consensus 116 ~~~l~~~g~~~i~~i~~~~~~-----~~~~r~~g~~~~~~~~g~~~~~~~~~~~~~~~-~~~~~~~~l~~~~~~~ai~~~ 189 (269) T cd01391 116 AEYLAEKGWKRVALIYGDDGA-----YGRERLEGFKAALKKAGIEVVAIEYGDLDTEK-GFQALLQLLKAAPKPDAIFAC 189 (269) T ss_pred HHHHHHHCCCEEEEECCCCCH-----HHHHHHHHHHHHHHHCCCCEEEEEEECCCCCH-HHHHHHHHHHHCCCCEEEEEC T ss_conf 868987189616997379857-----79999999999999869986799971343214-899999998618997399988 Q ss_pred CCCCHHHHHHHHHHHHCC Q ss_conf 552114788877653001 Q gi|254780402|r 271 RHPRRCDAIERRLIAKGL 288 (440) Q Consensus 271 RhpeR~~~i~~~l~~~gl 288 (440) -. .-...+.+.+++.|+ T Consensus 190 ~d-~~a~g~~~a~~~~G~ 206 (269) T cd01391 190 ND-EMAAGALKAAREAGL 206 (269) T ss_pred CH-HHHHHHHHHHHHCCC T ss_conf 88-999999999998698 No 211 >PRK05632 phosphate acetyltransferase; Reviewed Probab=39.80 E-value=26 Score=15.17 Aligned_cols=41 Identities=20% Similarity=0.291 Sum_probs=21.1 Q ss_pred CCCCCCCCCCCCCCCCCCCEEEEEEECCCCHHHHHHHHHHHHCCH Q ss_conf 332100000000001234124889705521147888776530010 Q gi|254780402|r 245 EGEEDKAVYVHNFIKCRTDVLTIIVPRHPRRCDAIERRLIAKGLK 289 (440) Q Consensus 245 ~~EE~~il~a~~~l~~~~~~~lIIvPRhpeR~~~i~~~l~~~gl~ 289 (440) +|||+-++.|...+.+..=...|++.+. ++|.+.+++.|+. T Consensus 398 Eg~d~rvl~Aa~~~~~~gia~~iLlG~~----~~i~~~~~~~gl~ 438 (702) T PRK05632 398 EGDEPRTLKAAAICAERGIADCVLLGNP----EEIRRVAAAQGVD 438 (702) T ss_pred CCCCHHHHHHHHHHHHCCCEEEEEECCH----HHHHHHHHHCCCC T ss_conf 9898699999999998698059997899----9999999975999 No 212 >TIGR00666 PBP4 D-alanyl-D-alanine carboxypeptidase/D-alanyl-D-alanine-endopeptidase; InterPro: IPR000667 Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes . They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence . Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases . Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base . The geometric orientations of the catalytic residues are similar between families, despite different protein folds . The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) , . Peptidases are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry. Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. This family of serine peptidases belong to MEROPS peptidase family S13 (D-Ala-D-Ala carboxypeptidase C, clan SE). The predicted active site residues for members of this family and family S12 occur in the motif SXXK. D-Ala-D-Ala carboxypeptidase C is involved in the metabolism of cell components ; it is synthesised with a leader peptide to target it to the cell membrane . After cleavage of the leader peptide, the enzyme is retained in the membrane by a C-terminal anchor . There are three families of serine-type D-Ala-D-Ala peptidase (designated S11, S12 and S13), which are also known as low molecular weight penicillin-binding proteins . Family S13 comprises D-Ala-D-Ala peptidases that have sufficient sequence similarity around their active sites to assume a distant evolutionary relationship to other clan members; members of the S13 family also bind penicillin and have D-amino-peptidase activity. Proteases of family S11 have exclusive D-Ala-D-Ala peptidase activity, while some members of S12 are C beta-lactamases . ; GO: 0004185 serine carboxypeptidase activity, 0006508 proteolysis. Probab=39.63 E-value=26 Score=15.15 Aligned_cols=58 Identities=24% Similarity=0.367 Sum_probs=44.2 Q ss_pred CCCCEEEEEECCHHHHHHHHHHHHHHHHCCC-----CEEEEECCCCCCHHHHCCCCCCEEEEE-CCCCC Q ss_conf 8887799981789999999999999986199-----899996277630012101477416885-05572 Q gi|254780402|r 54 PIGPLIWFHASSVGETMALIGLIPAIRSRHV-----NVLLTTMTATSAKVARKYLGQYAIHQY-APLDI 116 (440) Q Consensus 54 ~~~~~IW~HaaSvGE~~~a~pli~~l~~~~~-----~iliT~~T~tg~~~~~~~~~~~~~~~y-~P~D~ 116 (440) ++|.--|.|+|. -...|++.|++++- +++|=++..+++..+..+-.+....|| .|..- T Consensus 71 dPGqdtWyhGaa-----r~~~l~~~LK~~GV~~I~g~~~~Dts~f~~~~~aPgW~W~D~~~~~~aP~~A 134 (427) T TIGR00666 71 DPGQDTWYHGAA-----RLDKLVEELKKSGVKKIEGNVLVDTSAFSDHDRAPGWPWNDLTTGYAAPIAA 134 (427) T ss_pred CCCCCCCHHHHH-----HHHHHHHHHHHCCCCEECCCEEEEEECCCCCCCCCCCCCCHHHCCCCCCHHH T ss_conf 388652012478-----9999999998648802026377631035773337566810544267788566 No 213 >KOG0100 consensus Probab=39.47 E-value=13 Score=17.15 Aligned_cols=15 Identities=47% Similarity=0.645 Sum_probs=8.1 Q ss_pred CCCCHHHHHHHCCCE Q ss_conf 545837689619949 Q gi|254780402|r 334 GGQNPLEAAMLGCAI 348 (440) Q Consensus 334 GGhN~lEpaa~G~pV 348 (440) -|-||-||.++|.+| T Consensus 393 kGinPdEAVAYGAAV 407 (663) T KOG0100 393 KGINPDEAVAYGAAV 407 (663) T ss_pred CCCCHHHHHHHHHHH T ss_conf 798867877765666 No 214 >TIGR00715 precor6x_red precorrin-6x reductase; InterPro: IPR003723 Cobalamin (vitamin B12) is a structurally complex cofactor, consisting of a modified tetrapyrrole with a centrally chelated cobalt. Cobalamin is usually found in one of two biologically active forms: methylcobalamin and adocobalamin. Most prokaryotes, as well as animals, have cobalamin-dependent enzymes, whereas plants and fungi do not appear to use it. In bacteria and archaea, these include methionine synthase, ribonucleotide reductase, glutamate and methylmalonyl-CoA mutases, ethanolamine ammonia lyase, and diol dehydratase . In mammals, cobalamin is obtained through the diet, and is required for methionine synthase and methylmalonyl-CoA mutase . There are at least two distinct cobalamin biosynthetic pathways in bacteria : Aerobic pathway that requires oxygen and in which cobalt is inserted late in the pathway ; found in Pseudomonas denitrificans and Rhodobacter capsulatus. Anaerobic pathway in which cobalt insertion is the first committed step towards cobalamin synthesis ; found in Salmonella typhimurium, Bacillus megaterium, and Propionibacterium freudenreichii shermanii. Either pathway can be divided into two parts: (1) corrin ring synthesis (differs in aerobic and anaerobic pathways) and (2) adenosylation of corrin ring, attachment of aminopropanol arm, and assembly of the nucleotide loop (common to both pathways) . There are about 30 enzymes involved in either pathway, where those involved in the aerobic pathway are prefixed Cob and those of the anaerobic pathway Cbi. Several of these enzymes are pathway-specific: CbiD, CbiG, and CbiK are specific to the anaerobic route of S. typhimurium, whereas CobE, CobF, CobG, CobN, CobS, CobT, and CobW are unique to the aerobic pathway of P. denitrificans. This entry represents CobK and CbiJ precorrin-6x reductase (1.3.1.54 from EC). In the aerobic pathway, CobK catalyses the reduction of the macrocycle of precorrin-6X to produce precorrin-6Y; while in the anaerobic pathway CbiJ catalyses the reduction of the macrocycle of cobalt-precorrin-6X into cobalt-precorrin-6Y , .; GO: 0016994 precorrin-6A reductase activity, 0009236 cobalamin biosynthetic process. Probab=39.46 E-value=26 Score=15.13 Aligned_cols=241 Identities=19% Similarity=0.187 Sum_probs=134.1 Q ss_pred HHHHHHHHHHHHHHCC-CCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEECCC----CHHH Q ss_conf 9999999999998619-9899996277630012101477416885055720134455744202035640476----2146 Q gi|254780402|r 68 ETMALIGLIPAIRSRH-VNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILSESD----IWPL 142 (440) Q Consensus 68 E~~~a~pli~~l~~~~-~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e~E----lWPn 142 (440) =+.-++.++++|.... -+|+.|.||..|....++.-...+ .-=|||-..-.+.|++..+-+++|=.-.. +=-| T Consensus 8 GT~dsr~~~~~L~~~~~~~i~~t~tt~~~~~l~~~~~a~~v--~~gaL~~~EGL~~~l~~~~i~~~vDAtHPFA~~~t~~ 85 (260) T TIGR00715 8 GTKDSRAIAKKLRALGDVEILVTVTTEEGKKLLEISQASKV--VTGALDKDEGLRELLKEESIDILVDATHPFAAQITKN 85 (260) T ss_pred CCHHHHHHHHHHCCCCCEEEEEEECCCCCCCCCCCCCCCCE--EECCCCCCCCHHHHHHHCCCCEEEECCCHHHHHHHHH T ss_conf 77789999974037875899987415776300100377715--8757488875378886449638985797358999999 Q ss_pred HHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHH-CCCCCCEEEECCHHHHHHHHHHC--C-CCEEE-ECCCCCCCCC Q ss_conf 788874224550586412665421210011112331-02644236641577888998608--8-50587-2043101234 Q gi|254780402|r 143 TVFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKK-IFSQFSLVIVQSERYFRRYKELG--A-QKLIV-SGNLKIDTES 217 (440) Q Consensus 143 li~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~-~~~~~~~i~~qs~~~~~rl~~lG--~-~~i~v-~GnlK~d~~~ 217 (440) -+..|+..+||.+-+. | .. ....-.-|.+.|=+++.+..+-+ - ++|.. +|- T Consensus 86 a~~vc~E~~~~Yvrfe-R----------------p~e~~~~~~ii~V~d~~~a~~~a~q~~~~g~~V~l~~G~------- 141 (260) T TIGR00715 86 ALEVCKELGIPYVRFE-R----------------PEELVLGKNIIEVSDIEEAARVAEQGYLKGKRVFLTAGV------- 141 (260) T ss_pred HHHHHHHCCCEEEEEC-C----------------CCCCCCCCCEEEECCHHHHHHHHHCCCCCCCEEEEEECH------- T ss_conf 9999976198179960-8----------------835369986688567788877653034453124440111------- Q ss_pred CCCCHHHHHHHHHHCCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCHHHHHH-HHHHHHCCHHHHHCCC Q ss_conf 422125689999861798617504655332100000000001234124889705521147888-7765300101100134 Q gi|254780402|r 218 LPCDKELLSLYQESIAGRYTWAAISTFEGEEDKAVYVHNFIKCRTDVLTIIVPRHPRRCDAIE-RRLIAKGLKVARRSRG 296 (440) Q Consensus 218 ~~~~~~~~~~~~~~~~~r~v~vagSth~~EE~~il~a~~~l~~~~~~~lIIvPRhpeR~~~i~-~~l~~~gl~~~~~S~~ 296 (440) .....+....+... +++-+-|.- .-+-+++++ -.|.-+ ||+-|-|- +.+.. .++++. T Consensus 142 -----~~L~~~~~~~~~~~--v~~rvlP~~-~al~~~~~~--G~p~~~-Iva~~G~~-~~~~e~aLlr~y---------- 199 (260) T TIGR00715 142 -----STLAAVVNSQDKEK--VIVRVLPDG-TALEKALKL--GLPSDR-IVAMRGPF-SEELEKALLREY---------- 199 (260) T ss_pred -----HHHHHHHHHCCCCE--EEEEECCCC-HHHHHHHHC--CCCCCC-EEECCCCC-CCHHHHHHHHHH---------- T ss_conf -----77899974037727--999988880-237999965--898112-78727897-713678888862---------- Q ss_pred CCCCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHHHHHHHHHCCCEEEECCH Q ss_conf 44446641675067652025542208206613554445458376896199499989844379999999978986995798 Q gi|254780402|r 297 DVINAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFRDIYRRMVSSGAVRIVEEV 376 (440) Q Consensus 297 ~~~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~~~~L~~~g~~~~v~d~ 376 (440) ..|.+|-=.=|+- ||...+ .-=|.++|.+||. +.+-....|+...++|- T Consensus 200 -----~~~avvTK~SG~~------------Gg~~~K-----~~~A~~lgi~vi~---------i~RP~~~~~G~~~~D~~ 248 (260) T TIGR00715 200 -----RIDAVVTKESGEK------------GGLKEK-----VEAAEALGIKVIL---------IARPKVDLGGVAVFDDI 248 (260) T ss_pred -----CCCEEEECCCCCC------------CCCHHH-----HHHHHHCCCEEEE---------EECCCCCCCCEEEECCH T ss_conf -----0696898576886------------774588-----8999856942899---------85687598975998177 Q ss_pred HHHHHHHHHHH Q ss_conf 99999999995 Q gi|254780402|r 377 GTLADMVYSLL 387 (440) Q Consensus 377 ~eL~~~l~~ll 387 (440) +++.+.+.+++ T Consensus 249 ~~~~~~v~~~~ 259 (260) T TIGR00715 249 DELAKFVAKLL 259 (260) T ss_pred HHHHHHHHHCC T ss_conf 99999996148 No 215 >cd06271 PBP1_AglR_RafR_like Ligand-binding domain of DNA transcription repressors specific for raffinose (RafR) and alpha-glucosides (AglR) which are members of the LacI-GalR family of bacterial transcription regulators. Ligand-binding domain of DNA transcription repressors specific for raffinose (RafR) and alpha-glucosides (AglR) which are members of the LacI-GalR family of bacterial transcription regulators. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the represso Probab=39.45 E-value=26 Score=15.13 Aligned_cols=104 Identities=17% Similarity=0.146 Sum_probs=62.2 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEECCCCHHHHHHHHHHC Q ss_conf 99999999986199899996277630012101477416885055720134455744202035640476214678887422 Q gi|254780402|r 71 ALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILSESDIWPLTVFELSKQ 150 (440) Q Consensus 71 ~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e~ElWPnli~~~~~~ 150 (440) .+..+-+.+.+.+.++++.++.. ..+.- ..+.+++...+.+.+|+..+......+..+.+. T Consensus 21 l~~gie~~~~~~Gy~lll~~~~~-~~~~~------------------~~~~~~l~~~~vDGiIl~~~~~~~~~~~~l~~~ 81 (268) T cd06271 21 FLSGLSEALAEHGYDLVLLPVDP-DEDPL------------------EVYRRLVESGLVDGVIISRTRPDDPRVALLLER 81 (268) T ss_pred HHHHHHHHHHHCCCEEEEEECCC-CHHHH------------------HHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHC T ss_conf 99999999998499899995899-98999------------------999999984898889996788981999999977 Q ss_pred CCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEEC----CHHHHHHHHHHCCCCEEEECC Q ss_conf 455058641266542121001111233102644236641----577888998608850587204 Q gi|254780402|r 151 RIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQ----SERYFRRYKELGAQKLIVSGN 210 (440) Q Consensus 151 ~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~q----s~~~~~rl~~lG~~~i~v~Gn 210 (440) ++|+++++-...+.. ++.|.+- .....++|.+.|-++|-..|. T Consensus 82 ~iP~V~~~r~~~~~~-----------------~~~V~~Dn~~~~~~a~~~L~~~G~~~I~~i~~ 128 (268) T cd06271 82 GFPFVTHGRTELGDP-----------------HPWVDFDNEAAAYQAVRRLIALGHRRIALLNP 128 (268) T ss_pred CCCEEEECCCCCCCC-----------------CCEEEECHHHHHHHHHHHHHHHCCCEEEECCC T ss_conf 999999766568999-----------------96899576999999999999859874875488 No 216 >TIGR02477 PFKA_PPi diphosphate--fructose-6-phosphate 1-phosphotransferase; InterPro: IPR011183 Diphosphate--fructose-6-phosphate 1-phosphotransferase catalyses the addition of phosphate from diphosphate (PPi) to fructose 6-phosphate to give fructose 1,6-bisphosphate (2.7.1.90 from EC). The enzyme is also known as pyrophosphate-dependent phosphofructokinase. The usage of PPi-dependent enzymes in glycolysis presumably frees up ATP for other processes . IPR012828 from INTERPRO represents the ATP-dependent 6-phosphofructokinase enzyme contained within Phosphofructokinase. This entry contains primarily bacterial, plant alpha, and plant beta sequences. These may be dimeric nonallosteric enzymes (bacteria) or allosteric heterotetramers (plants) . For additional information please see , , , .; GO: 0005524 ATP binding, 0047334 diphosphate-fructose-6-phosphate 1-phosphotransferase activity, 0006096 glycolysis, 0005945 6-phosphofructokinase complex. Probab=39.30 E-value=15 Score=16.68 Aligned_cols=283 Identities=16% Similarity=0.160 Sum_probs=126.0 Q ss_pred CCCCCCHHHHHHHHHH--CCEEEEEECCCCHH--HHHHHHHHCCCCEEEEEECCCCC-------------CCCCCHHHHH Q ss_conf 0557201344557442--02035640476214--67888742245505864126654-------------2121001111 Q gi|254780402|r 112 APLDIQPAVSRFLKYW--KPDCMILSESDIWP--LTVFELSKQRIPQVLVNARMSRR-------------SFKNWKTVLS 174 (440) Q Consensus 112 ~P~D~~~~~~~fl~~~--~P~~~i~~e~ElWP--nli~~~~~~~ip~~linarls~~-------------S~~~~~~~~~ 174 (440) .|-|.+..+++++.+- .|-+-+.-.++-=| ..+...+..+|-++|=.|. ++. .+.+-.++-. T Consensus 33 ~~~dr~~~l~~~FP~Tyg~P~~~f~~g~~~~ldaqii~~~~~l~~GviLSGGq-APGGHNVi~GLFD~lk~~np~SkLyG 111 (566) T TIGR02477 33 AVADRQEELKELFPNTYGLPIITFVPGEAKPLDAQIIDEHQPLKVGVILSGGQ-APGGHNVISGLFDALKKLNPESKLYG 111 (566) T ss_pred CCCCCHHHHHHHCCCCCCCCCEEEECCCCCCCCHHHHHCCCCCEEEEEEECCC-CCCCHHHHHHHHHHHHHHCCCCCCCC T ss_conf 10264789998630124775378864667883214532168756889875798-97711467778999997377672001 Q ss_pred HHHHCCCCCCEEE-ECCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCHHHHHHHHH--HCC-CCCEEECCCCCCCCCCC Q ss_conf 2331026442366-4157788899860885058720431012344221256899998--617-98617504655332100 Q gi|254780402|r 175 FSKKIFSQFSLVI-VQSERYFRRYKELGAQKLIVSGNLKIDTESLPCDKELLSLYQE--SIA-GRYTWAAISTFEGEEDK 250 (440) Q Consensus 175 l~~~~~~~~~~i~-~qs~~~~~rl~~lG~~~i~v~GnlK~d~~~~~~~~~~~~~~~~--~~~-~r~v~vagSth~~EE~~ 250 (440) |.+.--.-++.=. =-+++--..|+..|==++.-+|=+|.. ..++..+.+.. .++ +--|++.|-.=-- ... T Consensus 112 F~~Gp~GL~~~~~~elT~~~i~~YRN~GGFD~iGSGRtKI~-----T~Eq~~~al~~~k~l~LdgLVIIGGDdSNT-nAA 185 (566) T TIGR02477 112 FLGGPEGLLDNNYVELTKELIDEYRNTGGFDIIGSGRTKIE-----TEEQFAKALETAKKLKLDGLVIIGGDDSNT-NAA 185 (566) T ss_pred CCCCCCCCCCCCEEEECHHHHHCCCCCCCEEEECCCCCCCC-----CHHHHHHHHHHHHHHCCCEEEEECCCCCHH-HHH T ss_conf 02074443158715763788722278877420016854556-----888999999998760896489974798679-999 Q ss_pred CCCCCCCCCCCCCEEEEEEECCCCHHHHHHHH-H-HHHCCHH--HHHCCCCCCCC-CCEEEEECCCCCCHHHHHHCCCEE Q ss_conf 00000000123412488970552114788877-6-5300101--10013444446-641675067652025542208206 Q gi|254780402|r 251 AVYVHNFIKCRTDVLTIIVPRHPRRCDAIERR-L-IAKGLKV--ARRSRGDVINA-EVDIFLGDTIGEMGFYLRMTEIAF 325 (440) Q Consensus 251 il~a~~~l~~~~~~~lIIvPRhpeR~~~i~~~-l-~~~gl~~--~~~S~~~~~~~-~~~V~i~Dt~GeL~~lY~~Adiaf 325 (440) +| |....++.-+.++|=||-=++. +++.. + ...||.. ..+|+. +.+ .. |.+=- ..+|-- |=. T Consensus 186 ~L-AEyF~~~~~~t~viGVPKTIDG--DLKn~~iEtsFGFDTacK~YSEl--IGNi~r-----DA~S~-~KYwHF--IrL 252 (566) T TIGR02477 186 LL-AEYFAKKGLKTQVIGVPKTIDG--DLKNQYIETSFGFDTACKVYSEL--IGNICR-----DALSA-KKYWHF--IRL 252 (566) T ss_pred HH-HHHHHHCCCCCEEEEEECCCCH--HHCCCCCCCCCCCHHHHHHHHHH--HHHHHH-----HHHHC-CCCEEE--EEE T ss_conf 99-9999973899227864025472--11027104587610478899887--557888-----76642-871367--774 Q ss_pred EECCCCCCCCCCHHHHHHHCCCEE--ECCCCC-------C-HHHHHHHHH--------------HCCCEEEECCHHHHHH Q ss_conf 613554445458376896199499--989844-------3-799999999--------------7898699579899999 Q gi|254780402|r 326 IGRSFCASGGQNPLEAAMLGCAIL--SGPNVE-------N-FRDIYRRMV--------------SSGAVRIVEEVGTLAD 381 (440) Q Consensus 326 VGGSl~~~GGhN~lEpaa~G~pVi--~GP~~~-------n-f~e~~~~L~--------------~~g~~~~v~d~~eL~~ 381 (440) ||-| --|-.||.|----|=+ .|=++. . .+++++... =.|...=+-+...|.+ T Consensus 253 MGRs----ASHiaLECALqThPN~ciigEEv~~Kk~tL~~~~~~I~d~i~~Ra~~G~NfGvvLiPEGLIEFIPEv~~L~~ 328 (566) T TIGR02477 253 MGRS----ASHIALECALQTHPNVCIIGEEVAAKKMTLSQVTDYIADVIVKRAAKGKNFGVVLIPEGLIEFIPEVQALIK 328 (566) T ss_pred CCCC----HHHHHHHHHCCCCCCEEEHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCEEEEEECCCCEECCHHHHHHHH T ss_conf 3763----158999873214797675123355325848899999999999998568942799976985420266889999 Q ss_pred HHHHHHCCHHHH---HHHHHHHHHHHHHCCCHHHHHHHHH Q ss_conf 999995898999---9999999999970455699999999 Q gi|254780402|r 382 MVYSLLSEPTIR---YEMINAAINEVKKMQGPLKITLRSL 418 (440) Q Consensus 382 ~l~~ll~d~~~~---~~~~~~a~~~i~~~~ga~~~~~~~i 418 (440) .|+.+|.+...- -+......-+.++.+....++.+.| T Consensus 329 ELn~~L~~~~~~F~~i~~~~~~~~~~~kLS~~~~~~f~~L 368 (566) T TIGR02477 329 ELNNLLAQNVEEFSKIKFEGRKDLVKSKLSPESKALFESL 368 (566) T ss_pred HHHHHHHCCCCHHCCCCHHHHHHHHHHHCCHHHHHHHHHC T ss_conf 9986540476200146778899999862688899997407 No 217 >TIGR02733 desat_CrtD C-3',4' desaturase CrtD; InterPro: IPR014104 Members of this family are encoded by slr1293, a carotenoid biosynthesis protein which was shown to be the C-3',4' desaturase (CrtD) of myxoxanthophyll biosynthesis in Synechocystis sp. (strain PCC 6803), and close homologues (presumed to be functionally equivalent) from other cyanobacteria, where myxoxanthophyll biosynthesis is either known or expected. This enzyme can act on neurosporene and so presumably catalyses the first step that is committed to myxoxanthophyll.. Probab=39.21 E-value=12 Score=17.28 Aligned_cols=77 Identities=12% Similarity=0.110 Sum_probs=35.3 Q ss_pred CCCEEECCCCCCHHHHHHHHHHCCCEE-EECCHH------HHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHH Q ss_conf 994999898443799999999789869-957989------9999999995898999999999999997045569999999 Q gi|254780402|r 345 GCAILSGPNVENFRDIYRRMVSSGAVR-IVEEVG------TLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITLRS 417 (440) Q Consensus 345 G~pVi~GP~~~nf~e~~~~L~~~g~~~-~v~d~~------eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~~~ 417 (440) ..|+.--||++-..|...-+-+++-++ .|+.+. --++.|-....|.+.--. .+..+|..+.+..++.+.+. T Consensus 333 alP~~cp~HlQf~~D~~GP~gENNSLFVSiS~egDGRAP~G~~TlIAS~FTd~~~W~~--l~~~~Y~~~K~~y~~~~~~~ 410 (499) T TIGR02733 333 ALPVDCPPHLQFLSDHQGPIGENNSLFVSISQEGDGRAPAGEATLIASSFTDVADWSS--LDEEDYTAKKKQYTETILER 410 (499) T ss_pred HCCCCCCCCCEEECCCCCCCCCCCCEEEEECCCCCCCCCCCCEEEEEEECCCCHHHCC--CCHHHHHHHHHHHHHHHHHH T ss_conf 3777787860450078888200364027865786888775753678742137122117--72678999999999999998 Q ss_pred HHHHHH Q ss_conf 997523 Q gi|254780402|r 418 LDSYVN 423 (440) Q Consensus 418 i~~~l~ 423 (440) |+.++| T Consensus 411 L~~~Fd 416 (499) T TIGR02733 411 LGEYFD 416 (499) T ss_pred HHHHCC T ss_conf 432067 No 218 >KOG0832 consensus Probab=38.13 E-value=28 Score=15.00 Aligned_cols=27 Identities=15% Similarity=0.142 Sum_probs=19.0 Q ss_pred CCEEEECCCCCCCCCCHHHHHHHCCCEEE Q ss_conf 82066135544454583768961994999 Q gi|254780402|r 322 EIAFIGRSFCASGGQNPLEAAMLGCAILS 350 (440) Q Consensus 322 diafVGGSl~~~GGhN~lEpaa~G~pVi~ 350 (440) |+++|-.++.++- -++||+-.+.|+|- T Consensus 175 D~vvvln~~e~~s--AilEA~K~~IPTIg 201 (251) T KOG0832 175 DLVVVLNPEENHS--AILEAAKMAIPTIG 201 (251) T ss_pred CEEEECCCCCCCH--HHHHHHHHCCCEEE T ss_conf 6267627544318--99999970798588 No 219 >TIGR00314 cdhA CO dehydrogenase/acetyl-CoA synthase complex, epsilon subunit; InterPro: IPR004460 Acetyl-CoA decarbonylase/synthase (ACDS) is a multienzyme complex (carbon monoxide dehydrogenase is a synonym). The ACDS complex carries out an unusual reaction involving the reversible cleavage and synthesis of acetyl-CoA in methanogens. The family contains the 4Fe-4S ferredoxin domain [C-x(2)-C-x(2)-C-x(3)-C-[PEG]] between residues 448-462 of the model.; GO: 0018492 carbon-monoxide dehydrogenase (acceptor) activity, 0006084 acetyl-CoA metabolic process. Probab=38.01 E-value=20 Score=15.87 Aligned_cols=97 Identities=20% Similarity=0.203 Sum_probs=60.5 Q ss_pred CCCCEEEEEECCHHHHHHHHHHHHHHHHCCC--CEEE--EECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCC Q ss_conf 8887799981789999999999999986199--8999--96277630012101477416885055720134455744202 Q gi|254780402|r 54 PIGPLIWFHASSVGETMALIGLIPAIRSRHV--NVLL--TTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKP 129 (440) Q Consensus 54 ~~~~~IW~HaaSvGE~~~a~pli~~l~~~~~--~ili--T~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P 129 (440) .++++|.+=+ =.+.-+.++++.|.+... +|=| -|-|. .++.+-+-...+ +-|+-.. =||++.==| T Consensus 238 ~sKp~ilviG---Hnv~p~~~i~dYleEn~~eD~ve~~GiCCtA--~D~TRy~~kaKv---vgpls~Q---L~~iRaG~a 306 (795) T TIGR00314 238 KSKPVILVIG---HNVLPGADIADYLEENGMEDEVELCGICCTA--IDMTRYNEKAKV---VGPLSRQ---LRVIRAGVA 306 (795) T ss_pred CCCCEEEEEC---CCCCCHHHHHHHHHHCCCCCCEEEECCCCCC--CCHHHHCCCCCE---ECCCHHC---EEEEECCCC T ss_conf 6788899966---8564512578887516840304530300043--101001047834---5320101---044351787 Q ss_pred EEEEEECCCCHHHHHHHHHHCCCCEEEEEECC Q ss_conf 03564047621467888742245505864126 Q gi|254780402|r 130 DCMILSESDIWPLTVFELSKQRIPQVLVNARM 161 (440) Q Consensus 130 ~~~i~~e~ElWPnli~~~~~~~ip~~linarl 161 (440) |++|.=|-=+.--+|.+++|.|+|++--|-.+ T Consensus 307 DV~v~DEQCiRaD~Le~~~K~g~~~IaT~~k~ 338 (795) T TIGR00314 307 DVIVVDEQCIRADILEEVKKLGIPLIATNDKA 338 (795) T ss_pred CEEEECCCCHHHHHHHHHHHCCCCEEECCHHH T ss_conf 37997473012789999986586067341788 No 220 >cd06317 PBP1_ABC_sugar_binding_like_8 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. Pperiplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=37.94 E-value=28 Score=14.98 Aligned_cols=45 Identities=22% Similarity=0.387 Sum_probs=32.4 Q ss_pred HHHHHHHHCCEEEEE--ECCCCHHHHHHHHHHCCCCEEEEEECCCCCC Q ss_conf 445574420203564--0476214678887422455058641266542 Q gi|254780402|r 120 VSRFLKYWKPDCMIL--SESDIWPLTVFELSKQRIPQVLVNARMSRRS 165 (440) Q Consensus 120 ~~~fl~~~~P~~~i~--~e~ElWPnli~~~~~~~ip~~linarls~~S 165 (440) ++.++.+ +++.+++ ...+--...+..+...|||++.+|....+.. T Consensus 49 i~~~i~~-~vd~Iii~p~d~~~~~~~i~~a~~agIpVv~~d~~~~~~~ 95 (275) T cd06317 49 VEDLIAQ-KVDGIILWPTDGQAYIPGLRKAKQAGIPVVITNSNISEKG 95 (275) T ss_pred HHHHHHC-CCCEEEEECCCCHHHHHHHHHHHHCCCEEEEECCCCCCCC T ss_conf 9999975-9999999678712457999999986994999768887667 No 221 >TIGR01487 SPP-like SPP-like hydrolase, Archaeal; InterPro: IPR006382 This group of Archaeal sequences is most closely related to the sucrose-phosphate phosphatases (SPP) from plants and cyanobacteria. If it should be shown that the proteins in this family possess sucrose-phosphatase activity then SPP would represent a broad family of functionally related proteins.. Probab=37.89 E-value=16 Score=16.54 Aligned_cols=22 Identities=18% Similarity=0.167 Sum_probs=17.4 Q ss_pred HHHHHHHHCCCCEEEEEECCCC Q ss_conf 6788874224550586412665 Q gi|254780402|r 142 LTVFELSKQRIPQVLVNARMSR 163 (440) Q Consensus 142 nli~~~~~~~ip~~linarls~ 163 (440) ..|+.+.+.|||++|+.|+.++ T Consensus 27 e~iRk~~k~GI~V~LvtGN~~~ 48 (223) T TIGR01487 27 EAIRKAEKKGIKVSLVTGNVVP 48 (223) T ss_pred HHHHHHHHCCCEEEEEECCHHH T ss_conf 9998675278369998088279 No 222 >TIGR02201 heptsyl_trn_III lipopolysaccharide heptosyltransferase III, putative; InterPro: IPR011916 This family consists of examples of the putative ADP-heptose:LPS heptosyltransferase III, an enzyme of LPS inner core region biosynthesis. LPS, composed of lipid A, a core region, and O antigen, is found in the outer membrane of Gram-negative bacteria. This enzyme may be less widely distributed than heptosyltransferases I and II.. Probab=37.44 E-value=24 Score=15.40 Aligned_cols=40 Identities=25% Similarity=0.394 Sum_probs=29.7 Q ss_pred CCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEE--ECCCCC Q ss_conf 7652025542208206613554445458376896199499--989844 Q gi|254780402|r 310 TIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAIL--SGPNVE 355 (440) Q Consensus 310 t~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi--~GP~~~ 355 (440) |+-+|+.+--.|.+ |||===+| -| =|||.|+|.| |||.-- T Consensus 253 Tl~~LaA~IDhArl-FIGVDSvP--mH---~AAAlg~plvaLFGpsk~ 294 (347) T TIGR02201 253 TLRQLAAVIDHARL-FIGVDSVP--MH---MAAALGTPLVALFGPSKL 294 (347) T ss_pred CHHHHHHHHCCCEE-EECCCCHH--HH---HHHHCCCCEEEECCCCCC T ss_conf 55666557311103-42237767--89---998618871750177566 No 223 >pfam09353 DUF1995 Domain of unknown function (DUF1995). This family of proteins are functionally uncharacterized. Probab=37.34 E-value=29 Score=14.92 Aligned_cols=94 Identities=18% Similarity=0.153 Sum_probs=47.5 Q ss_pred HHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEECCCC--HHHHHHH Q ss_conf 99999999999861998999962776300121014774168850557201344557442020356404762--1467888 Q gi|254780402|r 69 TMALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILSESDI--WPLTVFE 146 (440) Q Consensus 69 ~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e~El--WPnli~~ 146 (440) ...+..+++.+.+.+..+.+.+-.....+.++..+++....+ .|+.....+.-..=..+++|++--.- =+-+-.. T Consensus 39 ~~la~~~~~~l~~~~~~~~~~fpDaGaaalA~r~~~~~~~~i---~~~~d~~~~~~~~~~d~~~i~v~P~~~~~~~ve~~ 115 (205) T pfam09353 39 MPLALRLARALAEKGTRLLVVFPDAGAAALARRDWGDLAFKI---LGLSDLSGRKEIEESDDLLILVAPQPSDLEEVEAL 115 (205) T ss_pred CHHHHHHHHHHHHCCCEEEEEECCCCHHHHHCCCCCCCHHHC---CCCCCCCCCCCCCCCCCEEEEECCCHHHHHHHHHH T ss_conf 299999999987539705999569758888507564314450---24110134456789985899981874229999999 Q ss_pred HHHC-CCCEEEEEECCCCCC Q ss_conf 7422-455058641266542 Q gi|254780402|r 147 LSKQ-RIPQVLVNARMSRRS 165 (440) Q Consensus 147 ~~~~-~ip~~linarls~~S 165 (440) |... +-|++|+|+|+.+-. T Consensus 116 ~e~~~~rpvvllNprLed~~ 135 (205) T pfam09353 116 CELAGGRPVIMLNPRLEDAG 135 (205) T ss_pred HHHHCCCEEEEECCCCCCCC T ss_conf 99837975999867534566 No 224 >PRK13228 consensus Probab=37.20 E-value=29 Score=14.90 Aligned_cols=61 Identities=13% Similarity=0.177 Sum_probs=41.0 Q ss_pred HCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHHHHHHHHHCCCEEEECCHHHHHHHHHHHHC Q ss_conf 208206613554445458376896199499989844379999999978986995798999999999958 Q gi|254780402|r 320 MTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFRDIYRRMVSSGAVRIVEEVGTLADMVYSLLS 388 (440) Q Consensus 320 ~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~~~~L~~~g~~~~v~d~~eL~~~l~~ll~ 388 (440) -.+.++||-|..+ +.-|-++|..+|.=++-++-. ++|.+.|+-..|+|.+||.+.|...=+ T Consensus 169 p~~~v~VgDs~~d-----i~aA~~AGi~~i~V~~G~~~~---~~l~~~~ad~vv~~l~el~~lL~~~~~ 229 (232) T PRK13228 169 PANMVLVGDSAND-----VIAARAAGCPVFCVPYGYADG---HGLAALDCDAIVDSLPEAYALLTPQRN 229 (232) T ss_pred CCCEEEECCCHHH-----HHHHHHCCCEEEEECCCCCCH---HHHHHCCCCEEECCHHHHHHHHHHCCC T ss_conf 3408988488879-----999999599399987999997---789878999999999999999751614 No 225 >PRK13224 consensus Probab=37.14 E-value=29 Score=14.90 Aligned_cols=60 Identities=18% Similarity=0.200 Sum_probs=38.1 Q ss_pred HCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHHHHHHHHHCCCEEEECCHHHHHHHHHHHHC Q ss_conf 208206613554445458376896199499989844379999999978986995798999999999958 Q gi|254780402|r 320 MTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFRDIYRRMVSSGAVRIVEEVGTLADMVYSLLS 388 (440) Q Consensus 320 ~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~~~~L~~~g~~~~v~d~~eL~~~l~~ll~ 388 (440) ..+.++||-|-.+ ..=|.++|+++|.=.+= |.+ ..+.+.+....++|..||.+.+.+||+ T Consensus 157 ~~~~l~VGDs~~D-----~~aA~~AGi~~i~v~~G--y~~--~~~~~~~~~~~i~~~~el~~~l~~l~~ 216 (216) T PRK13224 157 DEPVLYVGDSEVD-----AETAENAGVPFLLFTEG--YRK--APVHELPHHGAFDDHAELPDLVARLLA 216 (216) T ss_pred CCCEEEECCCHHH-----HHHHHHHCCCEEEECCC--CCC--CCHHHCCCCCHHCCHHHHHHHHHHHHC T ss_conf 9998997598878-----99999949988997779--799--896773846261799989999999849 No 226 >TIGR01916 F420_puta_dedox F420-dependent oxidoreductase, putative; InterPro: IPR008225 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.. Probab=37.03 E-value=29 Score=14.89 Aligned_cols=40 Identities=23% Similarity=0.454 Sum_probs=28.0 Q ss_pred CCEEEEEEECCCCH-HHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCCC Q ss_conf 41248897055211-47888776530010110013444446641675067652 Q gi|254780402|r 262 TDVLTIIVPRHPRR-CDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIGE 313 (440) Q Consensus 262 ~~~~lIIvPRhpeR-~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~Ge 313 (440) ++-.+++-|+.|++ +..|.+.+++. ...+.-|+|-||+|= T Consensus 125 ~~G~~llLP~dPD~SA~~ir~~L~el------------~G~~VgVIITDT~GR 165 (260) T TIGR01916 125 EEGELLLLPEDPDASAEKIRRGLREL------------LGKDVGVIITDTNGR 165 (260) T ss_pred CCCEEECCCCCHHHHHHHHHHHHHHH------------HCCEEEEEEECCCCC T ss_conf 78567358988378999999999986------------398478999718888 No 227 >PRK13116 consensus Probab=36.35 E-value=30 Score=14.82 Aligned_cols=66 Identities=11% Similarity=0.108 Sum_probs=36.6 Q ss_pred HCCCEEECCCCCCHHHHHHHHHHCCCEEEECCHHHHHHHHHHHHCCHH-------HHHHHHHHHHHHHHHCCCHH Q ss_conf 199499989844379999999978986995798999999999958989-------99999999999997045569 Q gi|254780402|r 344 LGCAILSGPNVENFRDIYRRMVSSGAVRIVEEVGTLADMVYSLLSEPT-------IRYEMINAAINEVKKMQGPL 411 (440) Q Consensus 344 ~G~pVi~GP~~~nf~e~~~~L~~~g~~~~v~d~~eL~~~l~~ll~d~~-------~~~~~~~~a~~~i~~~~ga~ 411 (440) ..+||+.|=-+++-.++.+.+...--+.+|. ..+.+.+.+-+++++ ....+.+.-.++++..+.|+ T Consensus 205 t~~Pv~vGFGIs~~e~v~~~~~~~aDGVIVG--SAiVk~Ie~~~~~~~~~~~~~~~~~~~~~~l~~~v~~~k~~~ 277 (278) T PRK13116 205 DGAPILLGFGISSPQHVADAIAAGASGAITG--SAITKIIASHCEGEHPNPSTIRDMDGLKKDLTEFISAMKAAT 277 (278) T ss_pred CCCCEEEECCCCCHHHHHHHHHCCCCEEEEC--HHHHHHHHHCCCCCCCCCHHHCCHHHHHHHHHHHHHHHHHHC T ss_conf 7998799816798999999986689999987--799999985276555550112369999999999999987652 No 228 >pfam08267 Meth_synt_1 Cobalamin-independent synthase, N-terminal domain. The N-terminal domain and C-terminal domains of cobalamin-independent synthases together define a catalytic cleft in the enzyme. The N-terminal domain is thought to bind the substrate, in particular, the negatively charged polyglutamate chain. The N-terminal domain is also thought to stabilize a loop from the C-terminal domain. Probab=36.28 E-value=24 Score=15.46 Aligned_cols=43 Identities=21% Similarity=0.332 Sum_probs=21.8 Q ss_pred CCEEEEECCCCCCHHHHHHC---CCEEEECCCCCCCCCCHHHHHHHCC Q ss_conf 64167506765202554220---8206613554445458376896199 Q gi|254780402|r 302 EVDIFLGDTIGEMGFYLRMT---EIAFIGRSFCASGGQNPLEAAMLGC 346 (440) Q Consensus 302 ~~~V~i~Dt~GeL~~lY~~A---diafVGGSl~~~GGhN~lEpaa~G~ 346 (440) +.++++.--+|.+...|... ++.-+|=-|+ +|..| +|....|. T Consensus 229 ~~kill~TYFg~~~~~~~~l~~lpv~gl~lDlv-~~~~~-l~~v~~~~ 274 (310) T pfam08267 229 GLKLLLATYFGSVADALELLASLPVDGLGLDLV-RGPEN-LAALKKGF 274 (310) T ss_pred CCCEEEECCCCCHHHHHHHHHCCCCCEEEEECC-CCHHH-HHHHHHHC T ss_conf 986899825567888999997699747998605-78657-99999618 No 229 >PRK11052 malQ 4-alpha-glucanotransferase; Provisional Probab=36.10 E-value=11 Score=17.54 Aligned_cols=22 Identities=14% Similarity=0.282 Sum_probs=11.2 Q ss_pred HHHHHHHHHHHHHHHHCCCCEE Q ss_conf 9999999999999986199899 Q gi|254780402|r 66 VGETMALIGLIPAIRSRHVNVL 87 (440) Q Consensus 66 vGE~~~a~pli~~l~~~~~~il 87 (440) +|.+-....|++...+.+-+++ T Consensus 161 iGDf~DL~~l~~~aa~~Gadfv 182 (694) T PRK11052 161 IGDFGDLKQMLVDVAKRGGDFI 182 (694) T ss_pred CCCHHHHHHHHHHHHHCCCCEE T ss_conf 7789999999999997699879 No 230 >pfam03192 DUF257 Pyrococcus protein of unknown function, DUF257. Probab=36.00 E-value=30 Score=14.78 Aligned_cols=39 Identities=18% Similarity=0.180 Sum_probs=33.1 Q ss_pred HHHHHHHCCEEEEEECCC-------CHHHHHHHHHHCCCCEEEEEE Q ss_conf 455744202035640476-------214678887422455058641 Q gi|254780402|r 121 SRFLKYWKPDCMILSESD-------IWPLTVFELSKQRIPQVLVNA 159 (440) Q Consensus 121 ~~fl~~~~P~~~i~~e~E-------lWPnli~~~~~~~ip~~lina 159 (440) ..|++..+|.=.+++|++ +|..++..++++|+|+++.+- T Consensus 2 ~e~~~~~k~GEtVLvEysS~~~~~l~~~~ii~~~~e~g~~ilI~Di 47 (210) T pfam03192 2 EEFIDKIKFGETVLIEYSSSSIPELLFYEIIKWAKEKGIPIVIDDI 47 (210) T ss_pred HHHHHCCCCCCEEEEEECCCCCHHHHHHHHHHHHHHCCCCEEEEEC T ss_conf 5687268899889999479984788999999999976993899954 No 231 >PRK09257 aromatic amino acid aminotransferase; Provisional Probab=35.33 E-value=31 Score=14.71 Aligned_cols=34 Identities=21% Similarity=0.309 Sum_probs=29.8 Q ss_pred HHHHHHHCCEEEEEECCCCHHHHHHHHHHCCCCE Q ss_conf 4557442020356404762146788874224550 Q gi|254780402|r 121 SRFLKYWKPDCMILSESDIWPLTVFELSKQRIPQ 154 (440) Q Consensus 121 ~~fl~~~~P~~~i~~e~ElWPnli~~~~~~~ip~ 154 (440) -.|++++.|...||+...-|||....++..|.++ T Consensus 111 a~fl~~~~p~~~v~is~PtW~Nh~~If~~aGl~v 144 (397) T PRK09257 111 ADFLKRAFPDAKVWVSDPTWPNHRAIFEAAGLEV 144 (397) T ss_pred HHHHHHHCCCCEEEECCCCCHHHHHHHHHCCCEE T ss_conf 9999985899869976999620389999779805 No 232 >PRK12735 elongation factor Tu; Reviewed Probab=35.16 E-value=31 Score=14.70 Aligned_cols=55 Identities=13% Similarity=-0.001 Sum_probs=41.9 Q ss_pred CHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHCCCCEEEECCCCCCCCC Q ss_conf 2146788874224550586412665421210011112331026442366415778889986088505872043101234 Q gi|254780402|r 139 IWPLTVFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKELGAQKLIVSGNLKIDTES 217 (440) Q Consensus 139 lWPnli~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~~~i~v~GnlK~d~~~ 217 (440) +=||||.-+.....-+++|+|+- .++.|+.+.....+.||++++.|.= .|-|... T Consensus 88 fiknMI~Ga~~aD~alLVV~A~~-----------------------G~~~QTrEHl~l~~~lgv~~~iV~v-nK~D~v~ 142 (396) T PRK12735 88 YVKNMITGAAQMDGAILVVSAAD-----------------------GPMPQTREHILLARQVGVPYIVVFL-NKCDMVD 142 (396) T ss_pred HHHHHHHCCCCCCEEEEEEECCC-----------------------CCCHHHHHHHHHHHHHCCCEEEEEE-ECCCCCC T ss_conf 77666410042567999998687-----------------------8753169999999983998589999-8758888 No 233 >PRK02145 consensus Probab=35.14 E-value=31 Score=14.69 Aligned_cols=46 Identities=17% Similarity=0.216 Sum_probs=21.7 Q ss_pred CCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCEEECC Q ss_conf 15778889986088505872043101234422125689999861798617504 Q gi|254780402|r 189 QSERYFRRYKELGAQKLIVSGNLKIDTESLPCDKELLSLYQESIAGRYTWAAI 241 (440) Q Consensus 189 qs~~~~~rl~~lG~~~i~v~GnlK~d~~~~~~~~~~~~~~~~~~~~r~v~vag 241 (440) .|.++++++.+.|+++|.+..-. -.+++....+.+.++.+.|.++- T Consensus 85 rs~e~~~~ll~~GadkVii~s~a-------~~np~~v~~~~~~fG~q~Iv~si 130 (257) T PRK02145 85 RAVEDVRRLLNAGADKVSMNSSA-------VANPQLVRDAADKYGSQCIVVAI 130 (257) T ss_pred CCHHHHHHHHHCCCCEEEEHHHH-------HHCCCHHHHHHHHCCCCCEEEEE T ss_conf 04688999998199889841556-------65930224578766983449999 No 234 >cd01541 PBP1_AraR Ligand-binding domain of DNA transcription repressor specific for arabinose (AraR) which is a member of the LacI-GalR family of bacterial transcription regulators. Ligand-binding domain of DNA transcription repressor specific for arabinose (AraR) which is a member of the LacI-GalR family of bacterial transcription regulators. The ligand-binding domain of AraR is structurally homologous to the periplasmic sugar-binding domain of ABC-type transporters and both domains contain the type I periplasmic binding protein-like fold. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the type I periplasmic binding proteins. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which i Probab=34.92 E-value=31 Score=14.67 Aligned_cols=103 Identities=20% Similarity=0.303 Sum_probs=58.7 Q ss_pred HHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEEC--CCCH-H--HHH Q ss_conf 9999999999861998999962776300121014774168850557201344557442020356404--7621-4--678 Q gi|254780402|r 70 MALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILSE--SDIW-P--LTV 144 (440) Q Consensus 70 ~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e--~ElW-P--nli 144 (440) ..+..+-+.+.+.+.+++++++..+-... + ..++.+++ -+++.+|++- +... | .++ T Consensus 16 ~~~~gi~~~~~~~gy~~li~~s~~~~~~e-~-----------------~~l~~l~~-~~vdGiIl~~~~~~~~~~~~~~~ 76 (273) T cd01541 16 SIIRGIESVLSEKGYSLLLASTNNDPERE-R-----------------KCLENMLS-QGIDGLIIEPTKSALPNPNIDLY 76 (273) T ss_pred HHHHHHHHHHHHCCCEEEEEECCCCHHHH-H-----------------HHHHHHHH-CCCCEEEEECCCCCCCCCCHHHH T ss_conf 99999999999869989999789998999-9-----------------99999996-59998999253103578777999 Q ss_pred HHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECC----HHHHHHHHHHCCCCEEEE Q ss_conf 8874224550586412665421210011112331026442366415----778889986088505872 Q gi|254780402|r 145 FELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQS----ERYFRRYKELGAQKLIVS 208 (440) Q Consensus 145 ~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs----~~~~~rl~~lG~~~i~v~ 208 (440) ..+++.++|+++++..+.+... ..|.+-+ ....++|.+.|-+++-.. T Consensus 77 ~~l~~~~iPvV~i~~~~~~~~~-----------------~~V~~Dn~~a~~~a~~~L~~~G~~~i~~i 127 (273) T cd01541 77 LKLEKLGIPYVFINASYEELNF-----------------PSLVLDDEKGGYKATEYLIELGHRKIAGI 127 (273) T ss_pred HHHHHCCCCEEEEECCCCCCCC-----------------CEEEECCHHHHHHHHHHHHHCCCCCEEEE T ss_conf 9999769989999567788999-----------------88997859999999999985699608998 No 235 >cd02013 TPP_Xsc_like Thiamine pyrophosphate (TPP) family, Xsc-like subfamily, TPP-binding module; composed of proteins similar to Alcaligenes defragrans sulfoacetaldehyde acetyltransferase (Xsc). Xsc plays a key role in the degradation of taurine, catalyzing the desulfonation of 2-sulfoacetaldehyde into sulfite and acetyl phosphate. This enzyme requires TPP and divalent metal ions for activity. Probab=34.90 E-value=31 Score=14.67 Aligned_cols=18 Identities=6% Similarity=0.023 Sum_probs=10.5 Q ss_pred EEEECCHHHHHHHHHHHH Q ss_conf 699579899999999995 Q gi|254780402|r 370 VRIVEEVGTLADMVYSLL 387 (440) Q Consensus 370 ~~~v~d~~eL~~~l~~ll 387 (440) +..|++++||..++.+-+ T Consensus 146 g~~V~~~~el~~al~~Al 163 (196) T cd02013 146 GITVDKPEDVGPALQKAI 163 (196) T ss_pred EEEECCHHHHHHHHHHHH T ss_conf 899789999999999999 No 236 >cd01143 YvrC Periplasmic binding protein YvrC. These proteins are predicted to function as initial receptors in ABC transport of metal ions in eubacteria and archaea. They belong to the TroA superfamily of periplasmic metal binding proteins that share a distinct fold and ligand binding mechanism. A typical TroA protein is comprised of two globular subdomains connected by a single helix and can bind the metal ion in the cleft between these domains. Probab=34.77 E-value=31 Score=14.65 Aligned_cols=33 Identities=33% Similarity=0.387 Sum_probs=18.2 Q ss_pred HHCCEEEEEECCCCHHHHHHHHHHCCCCEEEEEE Q ss_conf 4202035640476214678887422455058641 Q gi|254780402|r 126 YWKPDCMILSESDIWPLTVFELSKQRIPQVLVNA 159 (440) Q Consensus 126 ~~~P~~~i~~e~ElWPnli~~~~~~~ip~~lina 159 (440) ..+||+++.. +...+..+..+.+.|||++.++. T Consensus 58 ~l~PDLVi~~-~~~~~~~~~~L~~~gi~v~~~~~ 90 (195) T cd01143 58 ALKPDLVIVS-SSSLAELLEKLKDAGIPVVVLPA 90 (195) T ss_pred CCCCCEEEEE-CCCCHHHHHHHHHCCCEEEEECC T ss_conf 1699999982-78747799998641885999758 No 237 >pfam04392 ABC_sub_bind ABC transporter substrate binding protein. This family contains many hypothetical proteins and some ABC transporter substrate binding proteins. Probab=34.66 E-value=31 Score=14.64 Aligned_cols=38 Identities=5% Similarity=0.140 Sum_probs=30.5 Q ss_pred EEECCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHH Q ss_conf 981789999999999999986199899996277630012 Q gi|254780402|r 61 FHASSVGETMALIGLIPAIRSRHVNVLLTTMTATSAKVA 99 (440) Q Consensus 61 ~HaaSvGE~~~a~pli~~l~~~~~~iliT~~T~tg~~~~ 99 (440) ++-| -|+...+..+++++..++|++++|..||...... T Consensus 38 ~~na-~gd~~~~~~ia~~l~~~~~Dli~a~~Tpaa~a~~ 75 (292) T pfam04392 38 VKNA-EGDPSKAAQIARQLVTDKNDLIIGIATPVAQILK 75 (292) T ss_pred EEEC-CCCHHHHHHHHHHHHHCCCCEEEECCCHHHHHHH T ss_conf 9507-8999999999999973799899987509999999 No 238 >cd01977 Nitrogenase_VFe_alpha Nitrogenase_VFe_alpha -like: Nitrogenase VFe protein, alpha subunit like. This group contains proteins similar to the alpha subunits of, the VFe protein of the vanadium-dependent (V-) nitrogenase and the FeFe protein of the iron only (Fe-) nitrogenase Nitrogenase catalyzes the ATP-dependent reduction of dinitrogen (N2) to ammonia. In addition to V- and Fe- nitrogenases there is a molybdenum (Mo)-dependent nitrogenase which is the most widespread and best characterized of these systems. These systems consist of component 1 (VFe protein, FeFe protein or, MoFe protein respectively) and, component 2 (Fe protein). MoFe is an alpha2beta2 tetramer, V-and Fe- nitrogenases are alpha2beta2delta2 hexamers. The alpha and beta subunits of VFe and FeFe are similar to the alpha and beta subunits of MoFe. For MoFe each alphabeta pair contains one P-cluster (at the alphabeta interface) and, one molecule of iron molybdenum cofactor (FeMoco) contained within the alpha sub Probab=34.50 E-value=32 Score=14.63 Aligned_cols=25 Identities=4% Similarity=-0.126 Sum_probs=13.8 Q ss_pred CCCEEEECCCCCCCCCCHHHHHHHCCCEEEC Q ss_conf 0820661355444545837689619949998 Q gi|254780402|r 321 TEIAFIGRSFCASGGQNPLEAAMLGCAILSG 351 (440) Q Consensus 321 AdiafVGGSl~~~GGhN~lEpaa~G~pVi~G 351 (440) ..=.++|||.... - +..+|.|.+-- T Consensus 358 ~pDlliG~~~~~~---i---a~klgiP~~~~ 382 (415) T cd01977 358 KPDIILTGPRVGE---L---VKKLHVPYVNI 382 (415) T ss_pred CCCEEEECCCCCE---E---EEECCCCEEEE T ss_conf 9999996587441---1---02248987980 No 239 >PRK00994 F420-dependent methylenetetrahydromethanopterin dehydrogenase; Provisional Probab=34.33 E-value=32 Score=14.61 Aligned_cols=33 Identities=33% Similarity=0.547 Sum_probs=23.4 Q ss_pred HHHHHCCEEEEEECCCCHHH--------HHHHHHHCCCCEEEEEE Q ss_conf 57442020356404762146--------78887422455058641 Q gi|254780402|r 123 FLKYWKPDCMILSESDIWPL--------TVFELSKQRIPQVLVNA 159 (440) Q Consensus 123 fl~~~~P~~~i~~e~ElWPn--------li~~~~~~~ip~~lina 159 (440) -++.|+|+++|++. || -=..++..|||++.|+- T Consensus 54 ~~~~~~pDf~i~is----PN~a~PGP~~ARE~l~~~~iP~IvI~D 94 (276) T PRK00994 54 KMKEWKPDFIIVIS----PNPAAPGPTKAREILSAAGIPCIVIGD 94 (276) T ss_pred HHHHHCCCEEEEEC----CCCCCCCCHHHHHHHHHCCCCEEEECC T ss_conf 98840899899978----998899956799999756998899748 No 240 >cd01407 SIR2-fam SIR2 family of proteins includes silent information regulator 2 (Sir2) enzymes which catalyze NAD+-dependent protein/histone deacetylation, where the acetyl group from the lysine epsilon-amino group is transferred to the ADP-ribose moiety of NAD+, producing nicotinamide and the novel metabolite O-acetyl-ADP-ribose. Sir2 proteins, also known as sirtuins, are found in all eukaryotes and many archaea and prokaryotes and have been shown to regulate gene silencing, DNA repair, metabolic enzymes, and life span. The most-studied function, gene silencing, involves the inactivation of chromosome domains containing key regulatory genes by packaging them into a specialized chromatin structure that is inaccessible to DNA-binding proteins. The oligomerization state of Sir2 appears to be organism-dependent, sometimes occurring as a monomer and sometimes as a multimer. Probab=34.17 E-value=32 Score=14.59 Aligned_cols=29 Identities=24% Similarity=0.303 Sum_probs=16.3 Q ss_pred CCEEEECCHHHHHHHHHHCCCC-EEEECCCCC Q ss_conf 4236641577888998608850-587204310 Q gi|254780402|r 183 FSLVIVQSERYFRRYKELGAQK-LIVSGNLKI 213 (440) Q Consensus 183 ~~~i~~qs~~~~~rl~~lG~~~-i~v~GnlK~ 213 (440) ...+++|+-.... .+-|.++ +.+-||+.. T Consensus 80 ~~~viTQNIDgLh--~~AG~~~vielHG~~~~ 109 (218) T cd01407 80 LKRVITQNVDGLH--QRAGSPKVIELHGSLFR 109 (218) T ss_pred CCEEEECCCHHHH--HHCCCCHHHHHHCCCCE T ss_conf 7548952607778--77599334076486364 No 241 >pfam10820 DUF2543 Protein of unknown function (DUF2543). This family of proteins with unknown function appear to be restricted to Enterobacteriaceae. The family has a highly conserved sequence. Probab=33.23 E-value=33 Score=14.49 Aligned_cols=65 Identities=18% Similarity=0.225 Sum_probs=40.5 Q ss_pred HHHHHHHHCCCEEEECCHH--HHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHH Q ss_conf 9999999789869957989--99999999958989999999999999970455699999999975237 Q gi|254780402|r 359 DIYRRMVSSGAVRIVEEVG--TLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITLRSLDSYVNP 424 (440) Q Consensus 359 e~~~~L~~~g~~~~v~d~~--eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~~~i~~~l~~ 424 (440) +++++ .....+..|++++ .|+...+-|+..--..++....|++-..+.+|.-++-++.|-.+||+ T Consensus 11 Di~dE-Y~tE~a~pVs~~Er~~LAhYFQlLitRLmnneEIsEeaQ~eMA~eAgi~~~RID~IA~FLNq 77 (81) T pfam10820 11 DIADE-YSTETAEPVSDSEREPLAHYFQLLITRLMNNEEISEEAQHEMAREAGINENRIDEIATFLNQ 77 (81) T ss_pred HHHHH-HHHHHCCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH T ss_conf 43888-87752367425665189999999999980407523999999999729977768999999987 No 242 >PRK06456 acetolactate synthase catalytic subunit; Reviewed Probab=33.23 E-value=33 Score=14.49 Aligned_cols=38 Identities=26% Similarity=0.220 Sum_probs=26.5 Q ss_pred CEEEEEECCHHHHHHHHHHHHHHHHCCCCEEEEECCCC Q ss_conf 77999817899999999999999861998999962776 Q gi|254780402|r 57 PLIWFHASSVGETMALIGLIPAIRSRHVNVLLTTMTAT 94 (440) Q Consensus 57 ~~IW~HaaSvGE~~~a~pli~~l~~~~~~iliT~~T~t 94 (440) +-+-+=.+..|..+++..+......+-|=++||-..++ T Consensus 69 ~gv~~~t~GPG~~N~~~gl~~A~~d~~Pvl~i~G~~~~ 106 (572) T PRK06456 69 PGVCTATSGPGVTNLVTGLITAYWDSSPVVAITGQVVR 106 (572) T ss_pred CEEEEECCCHHHHHHHHHHHHHHHHCCCEEEEECCCCH T ss_conf 88999803636999999999999749998999578872 No 243 >TIGR02885 spore_sigF RNA polymerase sigma-F factor; InterPro: IPR014236 The bacterial core RNA polymerase complex, which consists of five subunits, is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . RNA polymerase recruits alternative sigma factors as a means of switching on specific regulons. Most bacteria express a multiplicity of sigma factors. Two of these factors, sigma-70 (gene rpoD), generally known as the major or primary sigma factor, and sigma-54 (gene rpoN or ntrA) direct the transcription of a wide variety of genes. The other sigma factors, known as alternative sigma factors, are required for the transcription of specific subsets of genes. With regard to sequence similarity, sigma factors can be grouped into two classes, the sigma-54 and sigma-70 families. Sequence alignments of the sigma70 family members reveal four conserved regions that can be further divided into subregions eg. sub-region 2.2, which may be involved in the binding of the sigma factor to the core RNA polymerase; and sub-region 4.2, which seems to harbor a DNA-binding 'helix-turn-helix' motif involved in binding the conserved -35 region of promoters recognized by the major sigma factors , . Members of this entry represent RNA polymerase sigma factor F. Within the Firmicutes, Sigma-F is specifically, and universally, a component of the endospore formation program, and is expressed in the forespore to turn on the expression of dozens of genes. It is closely related to sigma-G, which is also expressed in the forespore.; GO: 0003677 DNA binding, 0003700 transcription factor activity, 0016987 sigma factor activity, 0006352 transcription initiation, 0006355 regulation of transcription DNA-dependent. Probab=33.06 E-value=29 Score=14.83 Aligned_cols=35 Identities=17% Similarity=0.365 Sum_probs=14.7 Q ss_pred HCCCEEECCCCCCHHHHHHHHHHCCCEEEECCHHHHHHHHHH Q ss_conf 199499989844379999999978986995798999999999 Q gi|254780402|r 344 LGCAILSGPNVENFRDIYRRMVSSGAVRIVEEVGTLADMVYS 385 (440) Q Consensus 344 ~G~pVi~GP~~~nf~e~~~~L~~~g~~~~v~d~~eL~~~l~~ 385 (440) |.+|.|+| |+-+-|.+.|...+-++-.+++..+.. T Consensus 67 YAVPmI~G-------EIkRFLRDDG~ikVSRSlK~la~k~~~ 101 (231) T TIGR02885 67 YAVPMIMG-------EIKRFLRDDGIIKVSRSLKELARKIRY 101 (231) T ss_pred CCCCHHHH-------HHHHHHCCCCCEEEEHHHHHHHHHHHH T ss_conf 12220221-------222310268855751527899999987 No 244 >cd00423 Pterin_binding Pterin binding enzymes. This family includes dihydropteroate synthase (DHPS) and cobalamin-dependent methyltransferases such as methyltetrahydrofolate, corrinoid iron-sulfur protein methyltransferase (MeTr) and methionine synthase (MetH). DHPS, a functional homodimer, catalyzes the condensation of p-aminobenzoic acid (pABA) in the de novo biosynthesis of folate, which is an essential cofactor in both nucleic acid and protein biosynthesis. Prokaryotes (and some lower eukaryotes) must synthesize folate de novo, while higher eukaryotes are able to utilize dietary folate and therefore lack DHPS. Sulfonamide drugs, which are substrate analogs of pABA, target DHPS. Cobalamin-dependent methyltransferases catalyze the transfer of a methyl group via a methyl- cob(III)amide intermediate. These include MeTr, a functional heterodimer, and the folate binding domain of MetH. Probab=32.87 E-value=34 Score=14.46 Aligned_cols=144 Identities=17% Similarity=0.154 Sum_probs=64.6 Q ss_pred HHHHHHHHHHHHHHHHCCCCEEEEECC--CCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEECCCCHHHH Q ss_conf 999999999999998619989999627--763001210147741688505572013445574420203564047621467 Q gi|254780402|r 66 VGETMALIGLIPAIRSRHVNVLLTTMT--ATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILSESDIWPLT 143 (440) Q Consensus 66 vGE~~~a~pli~~l~~~~~~iliT~~T--~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e~ElWPnl 143 (440) --|..-+.|+++.++++ +++.+|.=| +.=.+.+-+..-+-+ -|.... ..| |++ T Consensus 58 ~eE~~Rl~pvl~~i~~~-~~v~iSIDT~~~~Va~~al~~G~~iI------NDVsg~----------------~~d--~~m 112 (258) T cd00423 58 EEELERVIPVLRALAGE-PDVPISVDTFNAEVAEAALKAGADII------NDVSGG----------------RGD--PEM 112 (258) T ss_pred HHHHHHHHHHHHHHHCC-CCCEEEEECCCHHHHHHHHHCCCCEE------ECCHHH----------------HCC--HHH T ss_conf 88888850056887427-99609997988899999998599868------240031----------------065--579 Q ss_pred HHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHCC--CCEEEECCCCCCCCCCCCC Q ss_conf 88874224550586412665421210011112331026442366415778889986088--5058720431012344221 Q gi|254780402|r 144 VFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKELGA--QKLIVSGNLKIDTESLPCD 221 (440) Q Consensus 144 i~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~--~~i~v~GnlK~d~~~~~~~ 221 (440) +..+++.++|+++.-.+-.+.....+.........+...| ++..+++.+.|+ ++|.+..-+-|.-. ...+ T Consensus 113 ~~~va~~~~~~ilmH~~~~p~~~~~~~~~~~~~~~v~~~~-------~~~i~~~~~~Gi~~~~IiiDPGiGFgK~-~~~n 184 (258) T cd00423 113 APLAAEYGAPVVLMHMDGTPQTMQNNPYYADVVDEVVEFL-------EERVEAATEAGIPPEDIILDPGIGFGKT-EEHN 184 (258) T ss_pred HHHHHHCCCCEEEEECCCCCCCCCCCCCCCCHHHHHHHHH-------HHHHHHHHHCCCCHHEEEECCCCCCCCC-HHHH T ss_conf 9999974998898305788655666898664899999999-------9999999986999300887477677888-7889 Q ss_pred HHHHHHHHHHC--CCCCEEECCC Q ss_conf 25689999861--7986175046 Q gi|254780402|r 222 KELLSLYQESI--AGRYTWAAIS 242 (440) Q Consensus 222 ~~~~~~~~~~~--~~r~v~vagS 242 (440) -+..+.+..+. .+.|+++..| T Consensus 185 ~~ll~~l~~~~~~~g~PilvG~S 207 (258) T cd00423 185 LELLRRLDAFRELPGLPLLLGVS 207 (258) T ss_pred HHHHHHHHHHHHCCCCCEEEEEC T ss_conf 99999799997215994698602 No 245 >pfam07894 DUF1669 Protein of unknown function (DUF1669). This family is composed of sequences derived from hypothetical eukaryotic proteins of unknown function. Some members of this family are annotated as being potential phospholipases but no literature was found to support this. Probab=32.66 E-value=34 Score=14.43 Aligned_cols=52 Identities=21% Similarity=0.319 Sum_probs=30.1 Q ss_pred EEEEECC-CCCCHHH----HHHHHHHCCEEEE----EECCCCHHHHHHHHHHCCCCEEEEE Q ss_conf 1688505-5720134----4557442020356----4047621467888742245505864 Q gi|254780402|r 107 AIHQYAP-LDIQPAV----SRFLKYWKPDCMI----LSESDIWPLTVFELSKQRIPQVLVN 158 (440) Q Consensus 107 ~~~~y~P-~D~~~~~----~~fl~~~~P~~~i----~~e~ElWPnli~~~~~~~ip~~lin 158 (440) +.+.+-| -|-...+ |+.++.-+--++| |...||---++..+.+|+|||+++= T Consensus 121 vtv~~qpp~~~~~~iKe~vRr~I~~A~kVIAvVMD~FTD~dIf~DlleAa~kR~VpVYiLL 181 (284) T pfam07894 121 VTVYFQPPKDGQPHIKEVVRRMIQEAQKVIAVVMDVFTDVDIFCDLLEAASKRRVPVYILL 181 (284) T ss_pred EEEECCCCCCCCCCHHHHHHHHHHHCCEEEEEEEECCCCHHHHHHHHHHHHCCCCCEEEEE T ss_conf 8997369988898889999999963285999997244449999999999862796189997 No 246 >PRK09701 D-allose transporter subunit; Provisional Probab=32.62 E-value=34 Score=14.43 Aligned_cols=149 Identities=11% Similarity=0.121 Sum_probs=69.3 Q ss_pred HHHHHHHHCCEEEEEE--CCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCH-----H Q ss_conf 4455744202035640--47621467888742245505864126654212100111123310264423664157-----7 Q gi|254780402|r 120 VSRFLKYWKPDCMILS--ESDIWPLTVFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSE-----R 192 (440) Q Consensus 120 ~~~fl~~~~P~~~i~~--e~ElWPnli~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~-----~ 192 (440) +..++.. +++.+++. ..+-.-..+..+++.|||++.+|..+...+..+.. ...+.....|. . T Consensus 75 Ie~~I~~-gvdaIii~p~d~~a~~~~i~~A~~aGIpVV~~D~~v~~~~~~~~~----------~~~~~~v~~Dn~~~G~~ 143 (311) T PRK09701 75 FEDLSNK-NYKGIAFAPLSSVNLVMPVARAWKKGIYLVNLDEKIDMDNLKKAG----------GNVEAFVTTDNVAVGAK 143 (311) T ss_pred HHHHHHC-CCCEEEEECCCHHHHHHHHHHHHHCCCCEEECCCCCCCHHHHCCC----------CCEEEEECCCHHHHHHH T ss_conf 9999975-999999918987788999999997799189636766731121136----------64589981568999999 Q ss_pred HHHHH-HHHCCC--CEE-EECCCCCCCCCCCCCHHHHHHHHHHC-C-CCCEEECC--CCCCCCCCCCCC-CCCCCCCCCC Q ss_conf 88899-860885--058-72043101234422125689999861-7-98617504--655332100000-0000012341 Q gi|254780402|r 193 YFRRY-KELGAQ--KLI-VSGNLKIDTESLPCDKELLSLYQESI-A-GRYTWAAI--STFEGEEDKAVY-VHNFIKCRTD 263 (440) Q Consensus 193 ~~~rl-~~lG~~--~i~-v~GnlK~d~~~~~~~~~~~~~~~~~~-~-~r~v~vag--Sth~~EE~~il~-a~~~l~~~~~ 263 (440) .++.+ .++|.+ ++. +.|+.-... . .+...-+.+.+ . ...-++.. ..|..++ ... +...|..+|+ T Consensus 144 ~a~~l~~~l~~~~g~v~ii~g~~g~~~----~-~~R~~G~~~~~~~~~~~~iv~~~~~~~d~~~--a~~~~~~~L~a~Pd 216 (311) T PRK09701 144 GASFIIDKLGAEGGEVAIIEGKAGNAS----G-EARRNGATEAFKKASQIKLVASQPADWDRIK--ALDVATNVLQRNPN 216 (311) T ss_pred HHHHHHHHHCCCCCCEEEECCCCCCHH----H-HHHHHHHHHHHHHCCCCEEEEEECCCCCHHH--HHHHHHHHHHCCCC T ss_conf 999999984677881789727887617----9-9999889999875798467732037765877--99999999954999 Q ss_pred EEEEEEECCCCHHHHHHHHHHHHC Q ss_conf 248897055211478887765300 Q gi|254780402|r 264 VLTIIVPRHPRRCDAIERRLIAKG 287 (440) Q Consensus 264 ~~lIIvPRhpeR~~~i~~~l~~~g 287 (440) +..|+++-..- .-...+-+++.| T Consensus 217 i~~I~~~nd~~-a~Ga~~Al~~aG 239 (311) T PRK09701 217 IKAIYCANDTM-AMGVAQAVANAG 239 (311) T ss_pred CCEEEECCCHH-HHHHHHHHHHCC T ss_conf 76899779726-899999999669 No 247 >COG1171 IlvA Threonine dehydratase [Amino acid transport and metabolism] Probab=32.46 E-value=34 Score=14.41 Aligned_cols=57 Identities=12% Similarity=0.226 Sum_probs=29.5 Q ss_pred CCCCCCHHHHHHHCCCEEECCCCCCHHHH----------HHHHHHC-CCEEEECCHHHHHHHHHHHHCC Q ss_conf 44545837689619949998984437999----------9999978-9869957989999999999589 Q gi|254780402|r 332 ASGGQNPLEAAMLGCAILSGPNVENFRDI----------YRRMVSS-GAVRIVEEVGTLADMVYSLLSE 389 (440) Q Consensus 332 ~~GGhN~lEpaa~G~pVi~GP~~~nf~e~----------~~~L~~~-g~~~~v~d~~eL~~~l~~ll~d 389 (440) +.|.+..-+....|.++.+-+....|.+- ++.+.+. .-...| |.++++.++..++++ T Consensus 210 p~~a~~~~~Sl~~G~~~~~~~~~~tiaDG~av~~~g~~tf~i~~~~vd~~v~V-~e~ei~~am~~l~~~ 277 (347) T COG1171 210 PEGAPSMYASLKAGKIVVVLPDVGTIADGLAVKRPGDLTFEILRELVDDIVLV-DEDEICAAMRDLFER 277 (347) T ss_pred ECCCHHHHHHHHCCCCEEECCCCCCCCCCCCCCCCCHHHHHHHHHCCCCEEEE-CHHHHHHHHHHHHHC T ss_conf 48885899999759943536987800254204888777899999709968998-889999999999732 No 248 >cd06277 PBP1_LacI_like_1 Ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=32.26 E-value=34 Score=14.39 Aligned_cols=104 Identities=18% Similarity=0.168 Sum_probs=62.9 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEECCCCHHHHHHHHHHC Q ss_conf 99999999986199899996277630012101477416885055720134455744202035640476214678887422 Q gi|254780402|r 71 ALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILSESDIWPLTVFELSKQ 150 (440) Q Consensus 71 ~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e~ElWPnli~~~~~~ 150 (440) .+..+-+.+.+.+.++++.++....... ...-+.+...+.+-+|++..+- +..+..+.+. T Consensus 20 l~~gie~~~~~~Gy~lll~~~~~~~~~e-------------------~~~~~~l~~~~vdGiI~~~~~~-~~~~~~l~~~ 79 (268) T cd06277 20 IYRAIEEEAKKYGYNLILKFVSDEDEEE-------------------FELPSFLEDGKVDGIILLGGIS-TEYIKEIKEL 79 (268) T ss_pred HHHHHHHHHHHCCCEEEEEECCCCHHHH-------------------HHHHHHHHHCCCCEEEEECCCC-HHHHHHHHHC T ss_conf 9999999999849989999389998999-------------------9999999948987899978999-7999999975 Q ss_pred CCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCH----HHHHHHHHHCCCCEEEECCC Q ss_conf 45505864126654212100111123310264423664157----78889986088505872043 Q gi|254780402|r 151 RIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSE----RYFRRYKELGAQKLIVSGNL 211 (440) Q Consensus 151 ~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~----~~~~rl~~lG~~~i~v~Gnl 211 (440) ++|+++++...... .++.|..-+. ...++|.+.|-++|-..|.. T Consensus 80 ~~P~V~id~~~~~~-----------------~~~~V~~Dn~~a~~~a~~~L~~~Ghr~I~~i~~~ 127 (268) T cd06277 80 GIPFVLVDHYIPNE-----------------KADCVLTDNYSGAYAATEYLIEKGHRKIGFVGDP 127 (268) T ss_pred CCCEEEECCCCCCC-----------------CCCEEEECHHHHHHHHHHHHHHCCCCCEEEEECC T ss_conf 99899967768999-----------------9988996659999999999997099836997278 No 249 >PRK13962 bifunctional phosphoglycerate kinase/triosephosphate isomerase; Provisional Probab=32.10 E-value=35 Score=14.37 Aligned_cols=25 Identities=28% Similarity=0.280 Sum_probs=13.2 Q ss_pred CCHHHHHHHHHH--CCCCEEEEEECCC Q ss_conf 621467888742--2455058641266 Q gi|254780402|r 138 DIWPLTVFELSK--QRIPQVLVNARMS 162 (440) Q Consensus 138 ElWPnli~~~~~--~~ip~~linarls 162 (440) |+=|+-+....+ ++..+++.||-|. T Consensus 295 DIGp~Ti~~~~~~I~~a~TI~WNGP~G 321 (653) T PRK13962 295 DIGPETVELFKEKLSDAKTIVWNGPMG 321 (653) T ss_pred CCCHHHHHHHHHHHHHCCEEEEECCCC T ss_conf 688999999999997799999978964 No 250 >PRK08591 acetyl-CoA carboxylase biotin carboxylase subunit; Validated Probab=31.97 E-value=35 Score=14.36 Aligned_cols=126 Identities=15% Similarity=0.105 Sum_probs=53.6 Q ss_pred CHHHHHHHHHHHHHHHHCCCCEEEEECCCC-CCHHHHCCCCCCEEEEEC----CCCCCHHHHHHH--------HHHCCEE Q ss_conf 899999999999999861998999962776-300121014774168850----557201344557--------4420203 Q gi|254780402|r 65 SVGETMALIGLIPAIRSRHVNVLLTTMTAT-SAKVARKYLGQYAIHQYA----PLDIQPAVSRFL--------KYWKPDC 131 (440) Q Consensus 65 SvGE~~~a~pli~~l~~~~~~iliT~~T~t-g~~~~~~~~~~~~~~~y~----P~D~~~~~~~fl--------~~~~P~~ 131 (440) .-|| .|.-+++..++.+..-+..++++. ....+ + ..|... .+ +-+....+.+.+ +.+-|-- T Consensus 9 NRGE--IA~RiiRt~~elgi~tVavys~~D~~a~hv-~-~ADeav--~ig~~~~~~sYln~~~Ii~~A~~~g~dAihPGY 82 (449) T PRK08591 9 NRGE--IALRILRACKELGIKTVAVHSTADRDALHV-Q-LADEAV--CIGPAPSKKSYLNIPAIISAAEITGADAIHPGY 82 (449) T ss_pred CCCH--HHHHHHHHHHHCCCCEEEECCHHHCCCCHH-H-HCCEEE--EECCCCCCCCCCCHHHHHHHHHHHCCCEEECCC T ss_conf 7849--999999999984994999868575278528-8-598889--958988433304899999999982999897274 Q ss_pred EEEECCCCHHHHHHHHHHCCCCEEEEEE------CCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHH-HHHCCC Q ss_conf 5640476214678887422455058641------26654212100111123310264423664157788899-860885 Q gi|254780402|r 132 MILSESDIWPLTVFELSKQRIPQVLVNA------RMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRY-KELGAQ 203 (440) Q Consensus 132 ~i~~e~ElWPnli~~~~~~~ip~~lina------rls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl-~~lG~~ 203 (440) -++.|. |.+-..|.+.|| ..|.- +|.+|...|- .....--+++...+ -..+|.+++..+ .++|-+ T Consensus 83 GFLSEn---a~FA~~~~~~Gi--~fIGP~~~~i~~~GdK~~ar~-~a~~agVPvvpgs~-~~~~~~~ea~~~a~~iGyP 154 (449) T PRK08591 83 GFLSEN---ADFAEICEDSGF--TFIGPSAETIRLMGDKVTAKA-TMKKAGVPVVPGSD-GPVDDEEEALAIAKEIGYP 154 (449) T ss_pred CHHHCC---HHHHHHHHHCCC--EEECCCHHHHHHHHCHHHHHH-HHHHCCCCCCCCCC-CCCCCHHHHHHHHHHCCCC T ss_conf 243258---999999998799--999929999998769899999-99983999788876-6556899999999874996 No 251 >PTZ00254 40S ribosomal protein SA; Provisional Probab=31.88 E-value=35 Score=14.35 Aligned_cols=92 Identities=15% Similarity=0.264 Sum_probs=46.1 Q ss_pred EECCHHHH----HHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCC----CCCCEEEEECCCCCCH-HHHHHHHHHCCEEE Q ss_conf 81789999----999999999986199899996277630012101----4774168850557201-34455744202035 Q gi|254780402|r 62 HASSVGET----MALIGLIPAIRSRHVNVLLTTMTATSAKVARKY----LGQYAIHQYAPLDIQP-AVSRFLKYWKPDCM 132 (440) Q Consensus 62 HaaSvGE~----~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~----~~~~~~~~y~P~D~~~-~~~~fl~~~~P~~~ 132 (440) |--.+..+ ..|.-++..+. ..-+|++-.+-+.|...+.+. ....+...|+|==+-- ..++| ..|+++ T Consensus 47 hIIdL~kT~e~L~~Aa~~i~~i~-~~~~Il~V~tr~~~q~aV~kfA~~tG~~~i~~Rw~pGtlTN~i~~~f---~eP~ll 122 (242) T PTZ00254 47 HIINLAKTWEKIKLAARVIAAVE-NPADVVVVSSRPYGQRAILKFSQYTGASFIAGRFTPGTFTNQIQKKF---QEPRLL 122 (242) T ss_pred EEEEHHHHHHHHHHHHHHHHHHC-CCCCEEEEECCHHHHHHHHHHHHHHCCCEEECCCCCCCCCCHHHHCC---CCCCEE T ss_conf 79679999999999999999864-79977999838898799999999979942405127866468766524---688889 Q ss_pred EEECCCCHHHHHHHHHHCCCCEEEE Q ss_conf 6404762146788874224550586 Q gi|254780402|r 133 ILSESDIWPLTVFELSKQRIPQVLV 157 (440) Q Consensus 133 i~~e~ElWPnli~~~~~~~ip~~li 157 (440) |++..-.=-+-+.+|.+.|||++-+ T Consensus 123 iV~DP~~d~~AV~EA~~~nIPvIal 147 (242) T PTZ00254 123 IVTDPRTDHQAIREASYVNIPVIAL 147 (242) T ss_pred EEECCCCCHHHHHHHHHCCCCEEEE T ss_conf 9966754408999999859988986 No 252 >PRK08155 acetolactate synthase catalytic subunit; Validated Probab=31.79 E-value=35 Score=14.34 Aligned_cols=39 Identities=15% Similarity=0.130 Sum_probs=25.2 Q ss_pred CCEEEEEECCHHHHHHHHHHHHHHHHCCCCEEEEECCCC Q ss_conf 877999817899999999999999861998999962776 Q gi|254780402|r 56 GPLIWFHASSVGETMALIGLIPAIRSRHVNVLLTTMTAT 94 (440) Q Consensus 56 ~~~IW~HaaSvGE~~~a~pli~~l~~~~~~iliT~~T~t 94 (440) ++-+-+=.+-.|..+++-.+......+-|=+++|-.+++ T Consensus 76 k~~v~~~t~GpG~~N~~~al~~A~~~~~Pll~itg~~~~ 114 (564) T PRK08155 76 KPAVCMACSGPGATNLVTAIADARLDSIPLICITGQVPA 114 (564) T ss_pred CCEEEEEECCCHHHHHHHHHHHHHHHCCCEEEEECCCCH T ss_conf 988999943857998999999999709984999688976 No 253 >TIGR01860 VNFD nitrogenase vanadium-iron protein, alpha chain; InterPro: IPR010142 This entry represents the alpha chain of the vanadium-containing component of the vanadium-iron nitrogenase compound I. The complex also includes a second alpha chain, two beta chains and two delta chains. Component I interacts with compound II also known as the iron-protein which transfers electrons to compound I where the catalysis occurs.; GO: 0016163 nitrogenase activity, 0051212 vanadium ion binding, 0051536 iron-sulfur cluster binding, 0009399 nitrogen fixation. Probab=31.70 E-value=35 Score=14.33 Aligned_cols=24 Identities=17% Similarity=0.090 Sum_probs=15.6 Q ss_pred EEECCHHHHHHHHHHHHHHHHCCCC Q ss_conf 9817899999999999999861998 Q gi|254780402|r 61 FHASSVGETMALIGLIPAIRSRHVN 85 (440) Q Consensus 61 ~HaaSvGE~~~a~pli~~l~~~~~~ 85 (440) -|.+--||-..-+.+.+.+.+ .|+ T Consensus 104 ~h~vfG~ek~l~k~~~eaf~e-~P~ 127 (461) T TIGR01860 104 SHVVFGGEKQLEKSIREAFKE-FPD 127 (461) T ss_pred CEEEECCHHHHHHHHHHHHHH-CCH T ss_conf 226754325578899999862-201 No 254 >PRK02261 methylaspartate mutase subunit S; Provisional Probab=31.61 E-value=35 Score=14.32 Aligned_cols=41 Identities=12% Similarity=0.357 Sum_probs=27.4 Q ss_pred EEEECCCCCCCCCCHHHHH----HHCCCEEECCCCCCHHHHHHHHHH Q ss_conf 0661355444545837689----619949998984437999999997 Q gi|254780402|r 324 AFIGRSFCASGGQNPLEAA----MLGCAILSGPNVENFRDIYRRMVS 366 (440) Q Consensus 324 afVGGSl~~~GGhN~lEpa----a~G~pVi~GP~~~nf~e~~~~L~~ 366 (440) +|+||.+ ..+.+.|=|-. .+|..-+|||.+. ..++.+.+.+ T Consensus 88 vvvGG~i-~i~~~dp~~~~~~L~~~Gv~~VF~pGT~-~~~ii~~i~~ 132 (137) T PRK02261 88 LYVGGNL-VVGKHDFEEVEKKFKEMGFDRVFAPGTD-LEEVIDDLKA 132 (137) T ss_pred EEECCCC-CCCCCCCHHHHHHHHHCCCCEEECCCCC-HHHHHHHHHH T ss_conf 9983621-6788783999999997797988797889-9999999999 No 255 >TIGR01371 met_syn_B12ind 5-methyltetrahydropteroyltriglutamate--homocysteine S-methyltransferase; InterPro: IPR006276 This group represents cobalamin-independent methionine synthase. A family of uncharacterised archaeal proteins having substantial homology to the C-terminal region of this family, are not included in this group, though they are contained in a larger family of vitamin-B12 independent methionine synthase.; GO: 0003871 5-methyltetrahydropteroyltriglutamate-homocysteine S-methyltransferase activity, 0009086 methionine biosynthetic process. Probab=31.53 E-value=21 Score=15.76 Aligned_cols=37 Identities=14% Similarity=0.115 Sum_probs=17.3 Q ss_pred EEEEECCHHHHHHHHHHHHHHHH----CCCCEEEEECCCCC Q ss_conf 99981789999999999999986----19989999627763 Q gi|254780402|r 59 IWFHASSVGETMALIGLIPAIRS----RHVNVLLTTMTATS 95 (440) Q Consensus 59 IW~HaaSvGE~~~a~pli~~l~~----~~~~iliT~~T~tg 95 (440) +.+--.|--+..+++..=++|.+ ..++|+|+|.=.+. T Consensus 223 ~LV~Dl~~~~~~~~~~~Y~~l~~~~ss~~~~l~l~TYF~~~ 263 (778) T TIGR01371 223 ALVTDLSKEELALFKEAYTELEEALSSKDLKLLLQTYFDSV 263 (778) T ss_pred EEEECCCHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCCCH T ss_conf 43405988999999999999874210245751788246776 No 256 >PRK13288 pyrophosphatase PpaX; Provisional Probab=31.36 E-value=36 Score=14.30 Aligned_cols=54 Identities=13% Similarity=0.111 Sum_probs=37.2 Q ss_pred CCEEEECCCCCCCCCCHHHHHHHCCCEE---ECCCCCCHHHHHHHHHHCCCEEEECCHHHHHHHHHHH Q ss_conf 8206613554445458376896199499---9898443799999999789869957989999999999 Q gi|254780402|r 322 EIAFIGRSFCASGGQNPLEAAMLGCAIL---SGPNVENFRDIYRRMVSSGAVRIVEEVGTLADMVYSL 386 (440) Q Consensus 322 diafVGGSl~~~GGhN~lEpaa~G~pVi---~GP~~~nf~e~~~~L~~~g~~~~v~d~~eL~~~l~~l 386 (440) +.++||-|..+ +.-|-++|..++ .|.+- .+.|.+.++-..+++..||.+.|.+| T Consensus 157 e~l~VGDs~~D-----i~aA~~aG~~~i~v~~g~~~------~~~l~~~~ad~vi~~~~eL~~il~~l 213 (214) T PRK13288 157 EALMVGDNYHD-----ILAGKNAGTKTAGVAWTIKG------REYLEQYKPDFMLDKMSDLLAIVGDL 213 (214) T ss_pred CEEEEECCHHH-----HHHHHHCCCEEEEEECCCCC------HHHHHHCCCCEEECCHHHHHHHHHHC T ss_conf 68999689899-----99999959959999389998------76797689999989999999999861 No 257 >cd06278 PBP1_LacI_like_2 Ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=31.25 E-value=36 Score=14.28 Aligned_cols=103 Identities=19% Similarity=0.252 Sum_probs=63.6 Q ss_pred HHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEECCCCHHHHHHHHHH Q ss_conf 99999999998619989999627763001210147741688505572013445574420203564047621467888742 Q gi|254780402|r 70 MALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILSESDIWPLTVFELSK 149 (440) Q Consensus 70 ~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e~ElWPnli~~~~~ 149 (440) ..+..+-+.+.+.+.++++.+ |....+. + ..++.++. .+.+.+|++.++.=+..+..+.+ T Consensus 16 ~i~~gie~~~~~~gy~~ll~~-~~~~~~~-e-----------------~~l~~l~~-~~vDGiIi~~~~~~~~~~~~~~~ 75 (266) T cd06278 16 ELLEALSRALQARGYQPLLIN-TDDDEDL-D-----------------AALRQLLQ-YRVDGVIVTSGTLSSELAEECRR 75 (266) T ss_pred HHHHHHHHHHHHCCCEEEEEE-CCCCHHH-H-----------------HHHHHHHH-CCCCEEEEECCCCCHHHHHHHHH T ss_conf 999999999998699999997-9998899-9-----------------99999995-59999999489899999999997 Q ss_pred CCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHH----HHHHHHHHCCCCEEEEC Q ss_conf 2455058641266542121001111233102644236641577----88899860885058720 Q gi|254780402|r 150 QRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSER----YFRRYKELGAQKLIVSG 209 (440) Q Consensus 150 ~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~----~~~rl~~lG~~~i~v~G 209 (440) .++|+++++...... .++.|..-+.. ..++|.+.|-+++-+.| T Consensus 76 ~~iPvV~i~~~~~~~-----------------~~~~V~~Dn~~~~~~a~~~L~~~Gh~~I~~i~ 122 (266) T cd06278 76 NGIPVVLINRYVDGP-----------------GVDAVCSDNYEAGRLAAELLLAKGCRRIAFIG 122 (266) T ss_pred CCCCEEEECCCCCCC-----------------CCCEEEECHHHHHHHHHHHHHHCCCCEEEEEE T ss_conf 699899978768999-----------------99899977899999999999980998899996 No 258 >COG3254 Uncharacterized conserved protein [Function unknown] Probab=31.14 E-value=35 Score=14.36 Aligned_cols=22 Identities=18% Similarity=0.341 Sum_probs=17.7 Q ss_pred CCCHHHHHHHHHHCCCCEEEEE Q ss_conf 7621467888742245505864 Q gi|254780402|r 137 SDIWPLTVFELSKQRIPQVLVN 158 (440) Q Consensus 137 ~ElWPnli~~~~~~~ip~~lin 158 (440) .++||.++..+++.||.-+.|= T Consensus 23 ~~vWPE~~a~lk~agi~nYSIf 44 (105) T COG3254 23 DAVWPELLALLKEAGIRNYSIF 44 (105) T ss_pred CCCCHHHHHHHHHCCCCEEEEE T ss_conf 3336899999998498503688 No 259 >CHL00071 tufA elongation factor Tu Probab=30.86 E-value=36 Score=14.24 Aligned_cols=56 Identities=16% Similarity=0.046 Sum_probs=41.6 Q ss_pred CCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHCCCCEEEECCCCCCCCC Q ss_conf 62146788874224550586412665421210011112331026442366415778889986088505872043101234 Q gi|254780402|r 138 DIWPLTVFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKELGAQKLIVSGNLKIDTES 217 (440) Q Consensus 138 ElWPnli~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~~~i~v~GnlK~d~~~ 217 (440) ++=|||+.-+.....-+++|+|+- .++.|+.+.......||++.+.|.= .|.|... T Consensus 87 ~fv~nmi~Gas~aD~alLVV~A~~-----------------------G~~~QTkEHl~l~~~lgV~~~IVav-nKmD~v~ 142 (409) T CHL00071 87 DYVKNMITGAAQMDGAILVVSAAD-----------------------GPMPQTKEHILLAKQVGVPNIVVFL-NKEDQVD 142 (409) T ss_pred HHHHHHHHHHCCCCCEEEEEECCC-----------------------CCCCCHHHHHHHHHHCCCCCEEEEE-ECCCCCC T ss_conf 999998752301581289998687-----------------------8850049999999973999365555-5679854 No 260 >cd06242 Peptidase_M14-like_1_5 Peptidase M14-like domain of a functionally uncharacterized subgroup of the M14 family of metallocarboxypeptidases (MCPs). The M14 family are zinc-binding carboxypeptidases (CPs) which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. Two major subfamilies of the M14 family, defined based on sequence and structural homology, are the A/B and N/E subfamilies. Enzymes belonging to the A/B subfamily are normally synthesized as inactive precursors containing preceding signal peptide, followed by an N-terminal pro-region linked to the enzyme; these proenzymes are called procarboxypeptidases. The A/B enzymes can be further divided based on their substrate specificity; Carboxypeptidase A-like (CPA-like) enzymes favor hydrophobic residues while carboxypeptidase B-like (CPB-like) enzymes only cleave the basic residues lysine or arginine. Th Probab=30.81 E-value=36 Score=14.24 Aligned_cols=27 Identities=19% Similarity=0.391 Sum_probs=14.0 Q ss_pred CCCEEEEEECCHH-H---HHHHHHHHHHHHH Q ss_conf 8877999817899-9---9999999999986 Q gi|254780402|r 55 IGPLIWFHASSVG-E---TMALIGLIPAIRS 81 (440) Q Consensus 55 ~~~~IW~HaaSvG-E---~~~a~pli~~l~~ 81 (440) .++.||+-+.=.| | ..++..++++|.. T Consensus 53 ~K~~V~i~~giHGnE~~g~ea~l~l~~~L~~ 83 (268) T cd06242 53 KKLRVWLQGGVHGNEPAGDEAALALLGKLDN 83 (268) T ss_pred CCCEEEEECCCCCCCCCHHHHHHHHHHHHHH T ss_conf 8857999667479997549999999999983 No 261 >KOG1329 consensus Probab=30.61 E-value=28 Score=14.94 Aligned_cols=20 Identities=25% Similarity=0.381 Sum_probs=13.9 Q ss_pred CCCCCCEEEEEEECCCCHHH Q ss_conf 01234124889705521147 Q gi|254780402|r 258 IKCRTDVLTIIVPRHPRRCD 277 (440) Q Consensus 258 l~~~~~~~lIIvPRhpeR~~ 277 (440) ...+++...+.-|||+.+.. T Consensus 382 ~~lH~nV~vlr~P~~~~~~~ 401 (887) T KOG1329 382 FFLHPNVKVLRCPRHPGSGP 401 (887) T ss_pred HHCCCCEEEEECCCCCCCCC T ss_conf 51389718998988767677 No 262 >pfam03320 FBPase_glpX Bacterial fructose-1,6-bisphosphatase, glpX-encoded. Probab=30.26 E-value=37 Score=14.18 Aligned_cols=33 Identities=30% Similarity=0.557 Sum_probs=20.1 Q ss_pred CCEEEEEE--ECCCCHHHHHHHHHHHHCCHHHHHCCCCC Q ss_conf 41248897--05521147888776530010110013444 Q gi|254780402|r 262 TDVLTIIV--PRHPRRCDAIERRLIAKGLKVARRSRGDV 298 (440) Q Consensus 262 ~~~~lIIv--PRhpeR~~~i~~~l~~~gl~~~~~S~~~~ 298 (440) .++..++. ||| +++.+.+++.|-...+.++++. T Consensus 153 ~dltV~vLdRpRH----~~lI~eir~~Gari~Li~DGDV 187 (308) T pfam03320 153 EDLTVVVLDRPRH----AELIEEIREAGARVKLISDGDV 187 (308) T ss_pred HHEEEEEEECCHH----HHHHHHHHHCCCEEEEECCCCH T ss_conf 6838999975267----9999999973984999656789 No 263 >KOG0622 consensus Probab=30.18 E-value=21 Score=15.83 Aligned_cols=29 Identities=17% Similarity=0.145 Sum_probs=15.0 Q ss_pred CEEEECCHHHHHHHHHHCC--CCEEEECCCC Q ss_conf 2366415778889986088--5058720431 Q gi|254780402|r 184 SLVIVQSERYFRRYKELGA--QKLIVSGNLK 212 (440) Q Consensus 184 ~~i~~qs~~~~~rl~~lG~--~~i~v~GnlK 212 (440) ..+.+.|....+...++|+ +++...++.| T Consensus 103 ~gfdcaSk~E~~lvl~~gv~P~riIyanpcK 133 (448) T KOG0622 103 CGFDCASKNELDLVLSLGVSPERIIYANPCK 133 (448) T ss_pred CCCEECCHHHHHHHHHCCCCHHHEEECCCCC T ss_conf 6601037179999986698856668627875 No 264 >COG3958 Transketolase, C-terminal subunit [Carbohydrate transport and metabolism] Probab=30.15 E-value=37 Score=14.16 Aligned_cols=140 Identities=23% Similarity=0.258 Sum_probs=62.2 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCCHHHHHHHHH-HHHCCHHHHHCCCCCCCCCCEEEEECCCCCCHHHH Q ss_conf 04655332100000000001234124889705521147888776-53001011001344444664167506765202554 Q gi|254780402|r 240 AISTFEGEEDKAVYVHNFIKCRTDVLTIIVPRHPRRCDAIERRL-IAKGLKVARRSRGDVINAEVDIFLGDTIGEMGFYL 318 (440) Q Consensus 240 agSth~~EE~~il~a~~~l~~~~~~~lIIvPRhpeR~~~i~~~l-~~~gl~~~~~S~~~~~~~~~~V~i~Dt~GeL~~lY 318 (440) -||||..=|++.+ ++-.|| ..|++|-++-...++...+ .-.|..|.|.+..+.+..-.+=---=++|--..+- T Consensus 117 dG~sHq~~EDiai-----mR~lpn-~~V~~P~D~v~~~~i~~~~~~~~GP~Y~Rl~R~~~p~~~~~~~~~F~iGka~vLr 190 (312) T COG3958 117 DGSSHQALEDIAI-----MRGLPN-MTVIAPADAVETRAILDQIADYKGPVYMRLGRGKVPVVVDEGGYTFEIGKANVLR 190 (312) T ss_pred CCCCCHHHHHHHH-----HHCCCC-CEEECCCCHHHHHHHHHHHHHCCCCEEEEECCCCCCCEECCCCCEEECCCEEEEE T ss_conf 8864116789999-----860788-5587048589999999999860797799805778885033787257336346764 Q ss_pred HHCCCEEEECCCCCCCCCCHHHHHH------HCCCEEECCCCCCHHH--HHHHHHHCCCEEEECCH---HHHHHHHHHHH Q ss_conf 2208206613554445458376896------1994999898443799--99999978986995798---99999999995 Q gi|254780402|r 319 RMTEIAFIGRSFCASGGQNPLEAAM------LGCAILSGPNVENFRD--IYRRMVSSGAVRIVEEV---GTLADMVYSLL 387 (440) Q Consensus 319 ~~AdiafVGGSl~~~GGhN~lEpaa------~G~pVi~GP~~~nf~e--~~~~L~~~g~~~~v~d~---~eL~~~l~~ll 387 (440) .-+|+++|.-+.. .|--||||. ..+.||.=|.+.-..+ +.....+.|..+.+.+- .-|..++...+ T Consensus 191 dG~D~tiiA~G~m---v~~al~AA~~L~~~GIsa~Vi~m~tIKPiD~~~i~~~A~~t~~IvT~EeHsi~GGlGsaVAEvl 267 (312) T COG3958 191 DGSDLTIIATGVM---VAEALEAAEILKKEGISAAVINMFTIKPIDEQAILKAARETGRIVTAEEHSIIGGLGSAVAEVL 267 (312) T ss_pred CCCCEEEEECCCC---HHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHHHHHHHCCCEEEEECCEEECCHHHHHHHHH T ss_conf 1772699951716---6999999999986698879986476677878999998742583899863322356269999999 Q ss_pred C Q ss_conf 8 Q gi|254780402|r 388 S 388 (440) Q Consensus 388 ~ 388 (440) . T Consensus 268 s 268 (312) T COG3958 268 S 268 (312) T ss_pred H T ss_conf 8 No 265 >cd06323 PBP1_ribose_binding Periplasmic sugar-binding domain of the thermophilic Thermoanaerobacter tengcongensis ribose binding protein (ttRBP) and its mesophilic homologs. Periplasmic sugar-binding domain of the thermophilic Thermoanaerobacter tengcongensis ribose binding protein (ttRBP) and its mesophilic homologs. Members of this group are belonging to the type I periplasmic binding protein superfamily, whose members are involved in chemotaxis, ATP-binding cassette transport, and intercellular communication in central nervous system. The thermophilic and mesophilic ribose-binding proteins are structurally very similar, but differ substantially in thermal stability. Probab=30.03 E-value=37 Score=14.15 Aligned_cols=39 Identities=18% Similarity=0.257 Sum_probs=29.7 Q ss_pred HHHHHHCCEEEEE--ECCCCHHHHHHHHHHCCCCEEEEEECC Q ss_conf 5574420203564--047621467888742245505864126 Q gi|254780402|r 122 RFLKYWKPDCMIL--SESDIWPLTVFELSKQRIPQVLVNARM 161 (440) Q Consensus 122 ~fl~~~~P~~~i~--~e~ElWPnli~~~~~~~ip~~linarl 161 (440) .++ ..+++.+|+ ..++--+..+..+.+.|||++++|..+ T Consensus 50 ~l~-~~~vdgiI~~p~~~~~~~~~i~~~~~~~iPvV~id~~~ 90 (268) T cd06323 50 DLI-TRGVDAIIINPTDSDAVVPAVKAANEAGIPVFTIDREA 90 (268) T ss_pred HHH-HCCCCEEEECCCCCCCCHHHHHHHHHCCCCEEEECCCC T ss_conf 999-64999899756542124699999997699689963666 No 266 >cd01452 VWA_26S_proteasome_subunit 26S proteasome plays a major role in eukaryotic protein breakdown, especially for ubiquitin-tagged proteins. It is an ATP-dependent protease responsible for the bulk of non-lysosomal proteolysis in eukaryotes, often using covalent modification of proteins by ubiquitylation. It consists of a 20S proteolytic core particle (CP) and a 19S regulatory particle (RP). The CP is an ATP independent peptidase consisting of hydrolyzing activities. One or both ends of CP carry the RP that confers both ubiquitin and ATP dependence to the 26S proteosome. The RP's proposed functions include recognition of substrates and translocation of these to CP for proteolysis. The RP can dissociate into a stable lid and base subcomplexes. The base is composed of three non-ATPase subunits (Rpn 1, 2 and 10). A single residue in the vWA domain of Rpn10 has been implicated to be responsible for stabilizing the lid-base association. Probab=29.76 E-value=21 Score=15.82 Aligned_cols=82 Identities=13% Similarity=0.012 Sum_probs=32.3 Q ss_pred CCEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCEEECCCCCCCCCCCCCCCCCCCCCCC-CEEEEEEECCCCHHHHHHH Q ss_conf 505872043101234422125689999861798617504655332100000000001234-1248897055211478887 Q gi|254780402|r 203 QKLIVSGNLKIDTESLPCDKELLSLYQESIAGRYTWAAISTFEGEEDKAVYVHNFIKCRT-DVLTIIVPRHPRRCDAIER 281 (440) Q Consensus 203 ~~i~v~GnlK~d~~~~~~~~~~~~~~~~~~~~r~v~vagSth~~EE~~il~a~~~l~~~~-~~~lIIvPRhpeR~~~i~~ 281 (440) .++.+.|+..|-.......-...-+..+....|-|.-.||-...+|+.+....+.++++. .+-+|-..-+.+-.+.+.. T Consensus 77 h~i~~~G~~~~~~~IqiA~LALKHRqnk~~~qRIv~FVgSPi~~~ek~l~~laKklKKnnV~vDII~FGe~~~n~~kL~~ 156 (187) T cd01452 77 HDVQPKGKANFITGIQIAQLALKHRQNKNQKQRIVAFVGSPIEEDEKDLVKLAKRLKKNNVSVDIINFGEIDDNTEKLTA 156 (187) T ss_pred CCCCCCCEECHHHHHHHHHHHHHCCCCCCCCEEEEEEECCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCCHHHHHH T ss_conf 26771876518879999999972346777544799997898755789999999987555853589994688899899999 Q ss_pred HHH Q ss_conf 765 Q gi|254780402|r 282 RLI 284 (440) Q Consensus 282 ~l~ 284 (440) .++ T Consensus 157 f~~ 159 (187) T cd01452 157 FID 159 (187) T ss_pred HHH T ss_conf 999 No 267 >PRK12861 malic enzyme; Reviewed Probab=29.49 E-value=38 Score=14.09 Aligned_cols=69 Identities=14% Similarity=0.156 Sum_probs=42.8 Q ss_pred ECCCCCCHH-HHHHHHHHCCEEEEEE----CCCCHHHHHHHHHHCCCCEEEEEECC-CCCCCCCCHHHHHHHHHCCCC Q ss_conf 505572013-4455744202035640----47621467888742245505864126-654212100111123310264 Q gi|254780402|r 111 YAPLDIQPA-VSRFLKYWKPDCMILS----ESDIWPLTVFELSKQRIPQVLVNARM-SRRSFKNWKTVLSFSKKIFSQ 182 (440) Q Consensus 111 y~P~D~~~~-~~~fl~~~~P~~~i~~----e~ElWPnli~~~~~~~ip~~linarl-s~~S~~~~~~~~~l~~~~~~~ 182 (440) |+=+-.+.. ....++...++-+||. .-||||....+++. -.++.-||- .+.--..-.-||.+||..+.. T Consensus 259 fiG~S~~~~~~~~mv~~Ma~~pivfalANP~PEI~p~~a~~~r~---~~i~aTGRSD~pNQvNNvL~FP~iFRGaLdv 333 (762) T PRK12861 259 FLGLSAGGVLKAEMLKAMAARPLILALANPTPEIFPELAHATRD---DVVIATGRSDYPNQVNNVLCFPYIFRGALDV 333 (762) T ss_pred EEECCCCCCCCHHHHHHHCCCCEEEECCCCCCCCCHHHHHHHCC---CEEEECCCCCCCCCCCCEEECHHHHHHHHHH T ss_conf 99767899899999985377876997269998889899854168---6898348878853124532041877668760 No 268 >PRK06965 acetolactate synthase 3 catalytic subunit; Validated Probab=29.49 E-value=38 Score=14.09 Aligned_cols=38 Identities=21% Similarity=0.145 Sum_probs=26.1 Q ss_pred CEEEEEECCHHHHHHHHHHHHHHHHCCCCEEEEECCCC Q ss_conf 77999817899999999999999861998999962776 Q gi|254780402|r 57 PLIWFHASSVGETMALIGLIPAIRSRHVNVLLTTMTAT 94 (440) Q Consensus 57 ~~IW~HaaSvGE~~~a~pli~~l~~~~~~iliT~~T~t 94 (440) .-+.+=.+..|+.+++..+.....++-|=++||-..++ T Consensus 85 ~gv~~~t~GPG~~N~~~gia~A~~d~~Pvl~i~G~~~~ 122 (587) T PRK06965 85 VGVALVTSGPGVTNAVTGIATAYMDSIPMVVISGQVPT 122 (587) T ss_pred CEEEEECCCHHHHHHHHHHHHHHHHCCCEEEECCCCCC T ss_conf 88999756704999999999999749986998688970 No 269 >TIGR02625 YiiL_rotase L-rhamnose 1-epimerase; InterPro: IPR013448 This entry contains rhamnose mutarotase from Escherichia coli, previously designated YiiL as an uncharacterised protein, and close homologs associated with rhamnose dissimilation operons in other bacterial genomes. Mutarotase is a term for an epimerase that changes optical activity. This enzyme was shown experimentally to interconvert alpha and beta stereoisomers of the pyranose form of L-rhamnose . The crystal structure of this small (104 amino acid) protein shows a locally asymmetric dimer with active site residues of His, Tyr, and Trp .. Probab=28.48 E-value=28 Score=15.01 Aligned_cols=19 Identities=26% Similarity=0.405 Sum_probs=15.2 Q ss_pred CCCHHHHHHHHHHCCCCEE Q ss_conf 7621467888742245505 Q gi|254780402|r 137 SDIWPLTVFELSKQRIPQV 155 (440) Q Consensus 137 ~ElWPnli~~~~~~~ip~~ 155 (440) .||||-|-..+|+.|.--+ T Consensus 21 ~eiWPEl~~~LK~HGAhnY 39 (102) T TIGR02625 21 NEIWPELKEVLKKHGAHNY 39 (102) T ss_pred CCCCHHHHHHHHHCCCCCC T ss_conf 8556458999876075110 No 270 >COG3340 PepE Peptidase E [Amino acid transport and metabolism] Probab=28.40 E-value=40 Score=13.97 Aligned_cols=45 Identities=22% Similarity=0.234 Sum_probs=17.8 Q ss_pred CCEEEE-CCCCCCCCCCCCCHHHHHHHHHHC-CCCCE--EECCCCCCCCC Q ss_conf 505872-043101234422125689999861-79861--75046553321 Q gi|254780402|r 203 QKLIVS-GNLKIDTESLPCDKELLSLYQESI-AGRYT--WAAISTFEGEE 248 (440) Q Consensus 203 ~~i~v~-GnlK~d~~~~~~~~~~~~~~~~~~-~~r~v--~vagSth~~EE 248 (440) +-|.|. ||+.+= ...-.+......+++.+ .+.+. |-||+.--+++ T Consensus 86 d~IyVgGGNTF~L-L~~lke~gld~iIr~~vk~G~~YiG~SAGA~ia~p~ 134 (224) T COG3340 86 DIIYVGGGNTFNL-LQELKETGLDDIIRERVKAGTPYIGWSAGANIAGPT 134 (224) T ss_pred CEEEECCCHHHHH-HHHHHHHCCHHHHHHHHHCCCCEEEECCCCEEECCC T ss_conf 5899888518999-999998475899999997698368743672465475 No 271 >pfam04705 TSNR_N Thiostrepton-resistance methylase, N terminus. This region is found in some members of the SpoU-type rRNA methylase family (pfam00588). Probab=28.30 E-value=22 Score=15.64 Aligned_cols=25 Identities=20% Similarity=0.088 Sum_probs=14.8 Q ss_pred CCCHHHHHHHHHHCCCCEEEEEECC Q ss_conf 7621467888742245505864126 Q gi|254780402|r 137 SDIWPLTVFELSKQRIPQVLVNARM 161 (440) Q Consensus 137 ~ElWPnli~~~~~~~ip~~linarl 161 (440) +++=|.++..|.++|||+-++++-+ T Consensus 59 ~p~~~~ll~~c~~r~i~vrli~~~i 83 (115) T pfam04705 59 VPFPGDLLDLCEKRGIPVRLIDASI 83 (115) T ss_pred CCCCHHHHHHHHHCCCCEEEECHHH T ss_conf 8798489999986598416655889 No 272 >TIGR01392 homoserO_Ac_trn homoserine O-acetyltransferase; InterPro: IPR006296 This family describes homoserine-O-acetyltransferase, an enzyme of methionine biosynthesis. The family does not contain a number of homologs that appear to represent sequences derived from within the family but to have changed substantially.; GO: 0004414 homoserine O-acetyltransferase activity, 0009086 methionine biosynthetic process. Probab=28.11 E-value=6.7 Score=19.06 Aligned_cols=167 Identities=13% Similarity=0.015 Sum_probs=81.1 Q ss_pred HHHC-CCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHH---HHHCCEEEEEECCCCHHHHHHHHHHCCCCE Q ss_conf 9861-998999962776300121014774168850557201344557---442020356404762146788874224550 Q gi|254780402|r 79 IRSR-HVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFL---KYWKPDCMILSESDIWPLTVFELSKQRIPQ 154 (440) Q Consensus 79 l~~~-~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl---~~~~P~~~i~~e~ElWPnli~~~~~~~ip~ 154 (440) |..+ +-+++|-|=.-||=..+...+.+... .|.++..++ +-+.|+..++|= +|-|=-|+-.-=|. T Consensus 26 Ln~~~rsNavLvCHALTG~aHvag~~~~~~~-------~pGWW~~l~GPGraiDT~ryFvvc----~NvlGgC~GStGP~ 94 (395) T TIGR01392 26 LNAEDRSNAVLVCHALTGDAHVAGRADDDDK-------TPGWWDDLIGPGRAIDTDRYFVVC----SNVLGGCYGSTGPL 94 (395) T ss_pred CCCCCCCCEEEEEECCCCCCCCCCCCCCCCC-------CCCCCCCCCCCCCCCCCCCCEEEE----CCCCCCCCCCCCCC T ss_conf 1514699779998425788676688755556-------678876667888746747446887----27687888678878 Q ss_pred EEEEECCCCCC--CCCCH-HHHHHHHHCCCCCCEEEECCHHH--HHHHHHHCCCCEE--EECCCC------CCCCCCCCC Q ss_conf 58641266542--12100-11112331026442366415778--8899860885058--720431------012344221 Q gi|254780402|r 155 VLVNARMSRRS--FKNWK-TVLSFSKKIFSQFSLVIVQSERY--FRRYKELGAQKLI--VSGNLK------IDTESLPCD 221 (440) Q Consensus 155 ~linarls~~S--~~~~~-~~~~l~~~~~~~~~~i~~qs~~~--~~rl~~lG~~~i~--v~GnlK------~d~~~~~~~ 221 (440) - +| +.+ -+.|. .||.++ .+|.-- ++.+.+||++++. |.|++- +....+. T Consensus 95 S-~n----P~te~G~pYG~~FP~~T-----------irD~V~~q~~llD~LGi~~~aaVvGGSmGGM~aLEWA~~YPe-- 156 (395) T TIGR01392 95 S-IN----PDTETGKPYGPDFPLIT-----------IRDIVRAQRLLLDHLGIERIAAVVGGSMGGMQALEWAISYPE-- 156 (395) T ss_pred C-CC----CCCCCCCCCCCCCCCCC-----------HHHHHHHHHHHHHHCCCEEEEEEECCCCHHHHHHHHHHCCHH-- T ss_conf 8-86----76888872523468644-----------565899999999875971899875167314689998722766-- Q ss_pred HHHHHHHHHHCCCCCEEECCCCCCCCCCCCCC-CC-CCCCCCCCE---EEEEEECCCCHHHHHHHHHH Q ss_conf 25689999861798617504655332100000-00-000123412---48897055211478887765 Q gi|254780402|r 222 KELLSLYQESIAGRYTWAAISTFEGEEDKAVY-VH-NFIKCRTDV---LTIIVPRHPRRCDAIERRLI 284 (440) Q Consensus 222 ~~~~~~~~~~~~~r~v~vagSth~~EE~~il~-a~-~~l~~~~~~---~lIIvPRhpeR~~~i~~~l~ 284 (440) . -++-|.+|.|-...=+.|-.+ +. +.+...|+. ..----++|++.-.+++++- T Consensus 157 --~--------v~~~~~~A~saR~sa~~Ia~~~~~RqAI~~DP~w~~G~Y~~~~~~P~~GLa~ARm~a 214 (395) T TIGR01392 157 --R--------VRAIVVLATSARHSAWCIAFNEVQRQAILADPDWNDGDYYEEGQQPDKGLALARMLA 214 (395) T ss_pred --H--------HHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHHHHHH T ss_conf --8--------888887514201588777899999999751702156887988876156899999973 No 273 >pfam07997 DUF1694 Protein of unknown function (DUF1694). This family contains many hypothetical proteins. Probab=28.07 E-value=40 Score=13.93 Aligned_cols=63 Identities=5% Similarity=-0.117 Sum_probs=41.5 Q ss_pred CEEECCCCCCCCCCCCCCCC-CCCCCCCCEEEEEEECCCCH-HHHHHHHHHHHCCHHHHHCCCCC Q ss_conf 61750465533210000000-00012341248897055211-47888776530010110013444 Q gi|254780402|r 236 YTWAAISTFEGEEDKAVYVH-NFIKCRTDVLTIIVPRHPRR-CDAIERRLIAKGLKVARRSRGDV 298 (440) Q Consensus 236 ~v~vagSth~~EE~~il~a~-~~l~~~~~~~lIIvPRhpeR-~~~i~~~l~~~gl~~~~~S~~~~ 298 (440) .|+++-+...-++..+...+ +.++++++..++|-+.=+.. .....+++.+.|.++...++... T Consensus 35 RV~lalt~~qv~~~~~~~~~~~~l~~~~~~~l~ing~l~~~~~~~YiklA~~~~i~fTiV~~~~~ 99 (120) T pfam07997 35 RVVLALTIKQVLDPKVYKEFEQELKDYPNYTLLLNGNLDYDIQSQYIKLAKKANVPFTIVNDPEA 99 (120) T ss_pred HHHHEEEHHHHHCHHHHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHHHHCCCCEEEECCCCC T ss_conf 86522409998173179999999864888689985899878999999999985998699578866 No 274 >PRK12999 pyruvate carboxylase; Reviewed Probab=28.04 E-value=40 Score=13.93 Aligned_cols=19 Identities=37% Similarity=0.504 Sum_probs=10.9 Q ss_pred HHHHHHHHCCCCEEEEECCC Q ss_conf 99999986199899996277 Q gi|254780402|r 74 GLIPAIRSRHVNVLLTTMTA 93 (440) Q Consensus 74 pli~~l~~~~~~iliT~~T~ 93 (440) .+.+.++++. .+++|-||- T Consensus 523 ~~~~~~~~~~-~~liTDTT~ 541 (1147) T PRK12999 523 GFADWLREQK-RVLLTDTTM 541 (1147) T ss_pred HHHHHHHHCC-CEEEEECCC T ss_conf 9999998558-667663332 No 275 >COG0647 NagD Predicted sugar phosphatases of the HAD superfamily [Carbohydrate transport and metabolism] Probab=27.88 E-value=41 Score=13.91 Aligned_cols=77 Identities=14% Similarity=0.113 Sum_probs=44.9 Q ss_pred EEEEEECCCCHHH---HHHHHHHCCCCEEEE--EECCCCCCCCCCHHHHHHHHHCC--CCCCEEEECCHHHHHHHHHHC- Q ss_conf 0356404762146---788874224550586--41266542121001111233102--644236641577888998608- Q gi|254780402|r 130 DCMILSESDIWPL---TVFELSKQRIPQVLV--NARMSRRSFKNWKTVLSFSKKIF--SQFSLVIVQSERYFRRYKELG- 201 (440) Q Consensus 130 ~~~i~~e~ElWPn---li~~~~~~~ip~~li--narls~~S~~~~~~~~~l~~~~~--~~~~~i~~qs~~~~~rl~~lG- 201 (440) +-+++-..+.-|. .+..++++|+|++++ |++.|+ +-+ -..+..... ...+.|++....+++.+.+.. T Consensus 16 DGvl~~G~~~ipga~e~l~~L~~~g~~~iflTNn~~~s~---~~~--~~~L~~~~~~~~~~~~i~TS~~at~~~l~~~~~ 90 (269) T COG0647 16 DGVLYRGNEAIPGAAEALKRLKAAGKPVIFLTNNSTRSR---EVV--AARLSSLGGVDVTPDDIVTSGDATADYLAKQKP 90 (269) T ss_pred CCCEEECCCCCCHHHHHHHHHHHCCCEEEEEECCCCCCH---HHH--HHHHHHHCCCCCCHHHEECHHHHHHHHHHHHCC T ss_conf 671683892381599999999986993999958998998---999--999986269988877766389999999985378 Q ss_pred CCCEEEECCC Q ss_conf 8505872043 Q gi|254780402|r 202 AQKLIVSGNL 211 (440) Q Consensus 202 ~~~i~v~Gnl 211 (440) ..++.+.|.- T Consensus 91 ~~kv~viG~~ 100 (269) T COG0647 91 GKKVYVIGEE 100 (269) T ss_pred CCEEEEECCC T ss_conf 9879997781 No 276 >cd03371 TPP_PpyrDC Thiamine pyrophosphate (TPP) family, PpyrDC subfamily, TPP-binding module; composed of proteins similar to phosphonopyruvate decarboxylase (PpyrDC) proteins. PpyrDC is a homotrimeric enzyme which functions in the biosynthesis of C-P compounds such as bialaphos tripeptide in Streptomyces hygroscopicus. These proteins require TPP and divalent metal cation cofactors. Probab=27.69 E-value=41 Score=13.89 Aligned_cols=19 Identities=21% Similarity=0.170 Sum_probs=9.6 Q ss_pred EEEECCHHHHHHHHHHHHC Q ss_conf 6995798999999999958 Q gi|254780402|r 370 VRIVEEVGTLADMVYSLLS 388 (440) Q Consensus 370 ~~~v~d~~eL~~~l~~ll~ 388 (440) ...|++.+||.+++.+.++ T Consensus 133 ~~~v~~~~el~~al~~al~ 151 (188) T cd03371 133 VYEVPSLEELVAALAKALA 151 (188) T ss_pred EEEECCHHHHHHHHHHHHH T ss_conf 7997899999999999984 No 277 >cd06324 PBP1_ABC_sugar_binding_like_13 Periplasmic sugar-binding domain of uncharacterized ABC-type transport systems. This group includes the periplasmic sugar-binding domain of uncharacterized ABC-type transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily, which consists of two domains connected by a three-stranded hinge. The substrate specificity of this group is not known, but it is predicted to be involved in the transport of sugar-containing molecules and chemotaxis. Probab=27.67 E-value=41 Score=13.89 Aligned_cols=47 Identities=19% Similarity=0.289 Sum_probs=31.8 Q ss_pred HHHHH-HCCEEEEEEC-CCCHHHHHHHHHHCCCCEEEEEECCCCCCCCC Q ss_conf 55744-2020356404-76214678887422455058641266542121 Q gi|254780402|r 122 RFLKY-WKPDCMILSE-SDIWPLTVFELSKQRIPQVLVNARMSRRSFKN 168 (440) Q Consensus 122 ~fl~~-~~P~~~i~~e-~ElWPnli~~~~~~~ip~~linarls~~S~~~ 168 (440) .++.. .+++.+++.- ...=|..+..+++.|||++.+|..+++...+. T Consensus 51 ~~I~~~~~vd~Iii~p~~~~~~~~l~~A~~agIPVv~~d~~~~~~~~~~ 99 (305) T cd06324 51 TILQRPDKPDALIFTNEKSVAPELLRLAEGAGVKLFLVNSGLTEAQARE 99 (305) T ss_pred HHHHCCCCCEEEEECCCCCCHHHHHHHHHHCCCEEEEECCCCCHHHHHH T ss_conf 9983637970999879620018999999976983999658888365544 No 278 >PRK13435 response regulator; Provisional Probab=27.65 E-value=41 Score=13.89 Aligned_cols=104 Identities=16% Similarity=0.144 Sum_probs=50.1 Q ss_pred EEEEEEECCCCHHHHHHHHHHHHCCHHHH-HCCCC----C-CCCCCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCC Q ss_conf 24889705521147888776530010110-01344----4-446641675067652025542208206613554445458 Q gi|254780402|r 264 VLTIIVPRHPRRCDAIERRLIAKGLKVAR-RSRGD----V-INAEVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQN 337 (440) Q Consensus 264 ~~lIIvPRhpeR~~~i~~~l~~~gl~~~~-~S~~~----~-~~~~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN 337 (440) ++.+||--++.=...+.+.+++.|+.+.- -+.++ . -....|+++.|-. +.+++ .|.. T Consensus 2 mRILIVEDe~~i~~~l~~~L~~~G~~vv~~A~~~~eAl~~~~~~~PDlvllDi~--------------LpdG~---~G~e 64 (141) T PRK13435 2 LRVLIVEDEALIALELEKLLEEAGHQVVGIASTSEQALALGRRAQPDVALVDIR--------------LADGP---TGVE 64 (141) T ss_pred CEEEEECCCHHHHHHHHHHHHHCCCEEEEEECCHHHHHHHHHCCCCCEEEECCC--------------CCCCC---CHHH T ss_conf 789998998999999999999879979997599999999976599989997887--------------89999---8999 Q ss_pred HHH--HHHHCCCEEECCCCCCHHHHHHHHHHCCCE-EEEC--CHHHHHHHHHHHHC Q ss_conf 376--896199499989844379999999978986-9957--98999999999958 Q gi|254780402|r 338 PLE--AAMLGCAILSGPNVENFRDIYRRMVSSGAV-RIVE--EVGTLADMVYSLLS 388 (440) Q Consensus 338 ~lE--paa~G~pVi~GP~~~nf~e~~~~L~~~g~~-~~v~--d~~eL~~~l~~ll~ 388 (440) ... ....+.|||+-.- +.+....+.. |+. +.++ +.++|.+++.+.+. T Consensus 65 ~~r~l~~~~~ipvI~lTa---~~d~~~~~~a-GA~dyl~KPf~~~eL~~ai~~~~~ 116 (141) T PRK13435 65 VARRLSIDGGVEVIFMTA---NPERVGHDFA-GALGVIAKPYSLRQVARALSYVAA 116 (141) T ss_pred HHHHHHHCCCCCEEEEEC---CHHHHHHHHC-CCCCEEECCCCHHHHHHHHHHHHH T ss_conf 999998759983899977---7998601345-998589899998999999999997 No 279 >TIGR01162 purE phosphoribosylaminoimidazole carboxylase, catalytic subunit; InterPro: IPR000031 Phosphoribosylaminoimidazole carboxylase is a fusion protein in plants and fungi, but consists of two non-interacting proteins in bacteria, PurK and PurE. PurK, N5-carboxyaminoimidazole ribonucleotide (N5_CAIR) synthetase, catalyzes the conversion of 5-aminoimidazole ribonucleotide (AIR), ATP, and bicarbonate to N5-CAIR, ADP, and Pi. PurE converts N5-CAIR to CAIR, the sixth step of de novo purine biosynthesis. In the presence of high concentrations of bicarbonate, PurE is reported able to convert AIR to CAIR directly and without ATP. Some members of this family contain two copies of this domain . The crystal structure of PurE indicates a unique quaternary structure that confirms the octameric nature of the enzyme .; GO: 0004638 phosphoribosylaminoimidazole carboxylase activity, 0006189 'de novo' IMP biosynthetic process, 0009320 phosphoribosylaminoimidazole carboxylase complex. Probab=27.61 E-value=40 Score=13.93 Aligned_cols=124 Identities=15% Similarity=0.172 Sum_probs=66.7 Q ss_pred EECCCCCCCCCCCCCCCCCCCCCC---CCEEEEEEECCCCHHHHHHHHHHHHCC-HHHHHCCCCCCCCCCEEEEECCCCC Q ss_conf 750465533210000000000123---412488970552114788877653001-0110013444446641675067652 Q gi|254780402|r 238 WAAISTFEGEEDKAVYVHNFIKCR---TDVLTIIVPRHPRRCDAIERRLIAKGL-KVARRSRGDVINAEVDIFLGDTIGE 313 (440) Q Consensus 238 ~vagSth~~EE~~il~a~~~l~~~---~~~~lIIvPRhpeR~~~i~~~l~~~gl-~~~~~S~~~~~~~~~~V~i~Dt~Ge 313 (440) ++.||. .|-+.+-++.+.|++. .+++.+=|.|=||+..++.+..++.|+ .+ +| T Consensus 3 iIMGS~--SD~~~m~~a~~~L~~fgi~~e~~V~SAHRTP~~~~~ya~~a~~~G~P~V---------------iI------ 59 (159) T TIGR01162 3 IIMGSD--SDLETMKKAAEILEEFGIPYELRVVSAHRTPELMFEYAKEAEERGIPKV---------------II------ 59 (159) T ss_pred EECCCC--CCHHHHHHHHHHHHHCCCCEEEEEECCCCCHHHHHHHHHHHHHCCCCEE---------------EE------ T ss_conf 853672--5478999999999855996678986067780889999999986789979---------------98------ Q ss_pred CHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHHHHHHHHH-----CC------CEEEE--CCHHHHH Q ss_conf 02554220820661355444545837689619949998984437999999997-----89------86995--7989999 Q gi|254780402|r 314 MGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFRDIYRRMVS-----SG------AVRIV--EEVGTLA 380 (440) Q Consensus 314 L~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~~~~L~~-----~g------~~~~v--~d~~eL~ 380 (440) +.=||| .+ --.+=++.--+|||==|=-...-.-.+.|.. .| +.-.. .|+.-|+ T Consensus 60 ----------AgAGga--AH--LPGmvAa~T~~PVIGVPv~~~~l~G~DSLlSiVQMP~GvPVATvaig~aGA~NAAllA 125 (159) T TIGR01162 60 ----------AGAGGA--AH--LPGMVAALTTLPVIGVPVPSKALSGLDSLLSIVQMPAGVPVATVAIGNAGAKNAALLA 125 (159) T ss_pred ----------ECCCCC--CC--CCCCCCCCCCCCEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEEECCCHHHHHHHHHH T ss_conf ----------403511--33--4010011478775503477342452135445430758974653003713278899999 Q ss_pred HHHHHHHCCHHHHHHHHHH Q ss_conf 9999995898999999999 Q gi|254780402|r 381 DMVYSLLSEPTIRYEMINA 399 (440) Q Consensus 381 ~~l~~ll~d~~~~~~~~~~ 399 (440) ..+..+ .|++.++++.+- T Consensus 126 a~ILa~-~~~~l~~kl~~~ 143 (159) T TIGR01162 126 AQILAI-KDPELAEKLKEY 143 (159) T ss_pred HHHHCC-CCHHHHHHHHHH T ss_conf 999726-898899999999 No 280 >PRK13229 consensus Probab=27.55 E-value=41 Score=13.87 Aligned_cols=58 Identities=14% Similarity=0.148 Sum_probs=34.3 Q ss_pred CCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHHHHHHHHHCCCEEEECCHHHHHHHH-HHHH Q ss_conf 082066135544454583768961994999898443799999999789869957989999999-9995 Q gi|254780402|r 321 TEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFRDIYRRMVSSGAVRIVEEVGTLADMV-YSLL 387 (440) Q Consensus 321 AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~~~~L~~~g~~~~v~d~~eL~~~l-~~ll 387 (440) .+.++||-|..+ +.-|-+.|.++|.=.+- |.+ ..+.+.++-..+++-+||...+ .+++ T Consensus 168 ~~~l~VGDs~~D-----i~aA~~AGi~~i~v~~G--~~~--~~l~~~~pd~ii~~~~EL~~~l~e~~~ 226 (234) T PRK13229 168 DRALMVGDSRTD-----IDTAKAAGIPVVAVDFG--YSD--VPVQHLEPSRVISHFDELTPELAEDLI 226 (234) T ss_pred CCEEEEECCHHH-----HHHHHHCCCEEEEECCC--CCC--CCHHHCCCCEEECCHHHHHHHHHHHHH T ss_conf 367999268989-----99999969908998689--996--657676999998999998999999998 No 281 >PRK08124 flagellar motor protein MotA; Validated Probab=27.38 E-value=28 Score=14.97 Aligned_cols=45 Identities=16% Similarity=0.163 Sum_probs=19.2 Q ss_pred HHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHH Q ss_conf 9999999589899999999999999704556999999999752378 Q gi|254780402|r 380 ADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITLRSLDSYVNPL 425 (440) Q Consensus 380 ~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~~~i~~~l~~~ 425 (440) +..+....++.....+|...+---+.++. .-..+.+.++.||+|- T Consensus 205 A~kl~~~~~~e~~~~~~i~egil~i~~g~-~P~~i~e~L~~~L~p~ 249 (263) T PRK08124 205 ANKLKRKSKEEIEKKYIIIEGVLAIQEGN-APRIIEEKLLGYLSPK 249 (263) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCC-CHHHHHHHHHHCCCHH T ss_conf 99999899999999899999999986899-8799999998609962 No 282 >cd01516 FBPase_glpX Bacterial fructose-1,6-bisphosphatase, glpX-encoded. A dimeric enzyme dependent on Mg(2+). glpX-encoded FPBase (FBPase class II) differs from other members of the inositol-phosphatase superfamily by permutation of secondary structure elements. The core structure around the active site is well preserved. In E. coli, FBPase II is part of the glp regulon, which mediates growth on glycerol or sn-glycerol 3-phosphate as the sole carbon source. Probab=27.05 E-value=42 Score=13.82 Aligned_cols=33 Identities=27% Similarity=0.560 Sum_probs=21.3 Q ss_pred CCEEEEEE--ECCCCHHHHHHHHHHHHCCHHHHHCCCCC Q ss_conf 41248897--05521147888776530010110013444 Q gi|254780402|r 262 TDVLTIIV--PRHPRRCDAIERRLIAKGLKVARRSRGDV 298 (440) Q Consensus 262 ~~~~lIIv--PRhpeR~~~i~~~l~~~gl~~~~~S~~~~ 298 (440) .++..++. ||| +++.+.+++.|-...+.++++. T Consensus 154 ~dltV~vLdRpRH----~~lI~eir~~Gari~Li~DGDV 188 (309) T cd01516 154 EDLTVVVLDRPRH----AALIEEIREAGARIKLIPDGDV 188 (309) T ss_pred HHCEEEEEECCHH----HHHHHHHHHCCCCEEEECCCCH T ss_conf 4718999865617----9999999975993899667787 No 283 >cd01888 eIF2_gamma eIF2-gamma (gamma subunit of initiation factor 2). eIF2 is a heterotrimeric translation initiation factor that consists of alpha, beta, and gamma subunits. The GTP-bound gamma subunit also binds initiator methionyl-tRNA and delivers it to the 40S ribosomal subunit. Following hydrolysis of GTP to GDP, eIF2:GDP is released from the ribosome. The gamma subunit has no intrinsic GTPase activity, but is stimulated by the GTPase activating protein (GAP) eIF5, and GDP/GTP exchange is stimulated by the guanine nucleotide exchange factor (GEF) eIF2B. eIF2B is a heteropentamer, and the epsilon chain binds eIF2. Both eIF5 and eIF2B-epsilon are known to bind strongly to eIF2-beta, but have also been shown to bind directly to eIF2-gamma. It is possible that eIF2-beta serves simply as a high-affinity docking site for eIF5 and eIF2B-epsilon, or that eIF2-beta serves a regulatory role. eIF2-gamma is found only in eukaryotes and archaea. It is closely related to SelB, the sel Probab=26.84 E-value=42 Score=13.79 Aligned_cols=87 Identities=14% Similarity=0.035 Sum_probs=53.3 Q ss_pred CCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHCCCCEEEECCCCCCCCC Q ss_conf 62146788874224550586412665421210011112331026442366415778889986088505872043101234 Q gi|254780402|r 138 DIWPLTVFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKELGAQKLIVSGNLKIDTES 217 (440) Q Consensus 138 ElWPnli~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~~~i~v~GnlK~d~~~ 217 (440) ++=+||++.+....+-+++|+|.- . .. -.|+.+....++.+|++++.|.=| |.|... T Consensus 95 df~~nmi~Gas~aD~aiLvVdA~e--G----------~~----------~~QT~eH~~l~~~lgv~~iIV~vN-KmDlv~ 151 (203) T cd01888 95 ILMATMLSGAAVMDGALLLIAANE--P----------CP----------QPQTSEHLAALEIMGLKHIIIVQN-KIDLVK 151 (203) T ss_pred HHHHHHHHHHHHCCEEEEEEECCC--C----------CC----------HHHHHHHHHHHHHCCCCCEEEEEE-CCCCCC T ss_conf 999999976643476689864366--7----------75----------077999999999849986367750-777788 Q ss_pred CCCCHHHHHHHHHHCC----CCCEEECCCCCCCC Q ss_conf 4221256899998617----98617504655332 Q gi|254780402|r 218 LPCDKELLSLYQESIA----GRYTWAAISTFEGE 247 (440) Q Consensus 218 ~~~~~~~~~~~~~~~~----~r~v~vagSth~~E 247 (440) .....+....+...+. +..-++--|...|+ T Consensus 152 ~~~~~~~~~ei~~~l~~~~~~~~~iIPiSA~~G~ 185 (203) T cd01888 152 EEQALENYEQIKKFVKGTIAENAPIIPISAQLKY 185 (203) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCEEEEEECCCCC T ss_conf 6789999999999855216899859991478897 No 284 >PRK00830 consensus Probab=26.74 E-value=43 Score=13.78 Aligned_cols=26 Identities=15% Similarity=0.262 Sum_probs=14.9 Q ss_pred HHCCCEEECCCCCCHHHHHHHHHHCC Q ss_conf 61994999898443799999999789 Q gi|254780402|r 343 MLGCAILSGPNVENFRDIYRRMVSSG 368 (440) Q Consensus 343 a~G~pVi~GP~~~nf~e~~~~L~~~g 368 (440) ..++|||..=-..+..++.+.+...+ T Consensus 215 ~~~iPvIasGGv~~~~di~~~~~~~~ 240 (273) T PRK00830 215 EVDIPVIASGGVGNPEHIYEGFSDGK 240 (273) T ss_pred CCCCCEEEECCCCCHHHHHHHHHHCC T ss_conf 37998899889999999999998389 No 285 >PRK11230 glycolate oxidase subunit GlcD; Provisional Probab=26.59 E-value=43 Score=13.76 Aligned_cols=83 Identities=11% Similarity=0.068 Sum_probs=43.7 Q ss_pred CCEEEEEECCHHHHHHHHHHHHHHHHCCCCEEE-EEC---CCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEE Q ss_conf 877999817899999999999999861998999-962---7763001210147741688505572013445574420203 Q gi|254780402|r 56 GPLIWFHASSVGETMALIGLIPAIRSRHVNVLL-TTM---TATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDC 131 (440) Q Consensus 56 ~~~IW~HaaSvGE~~~a~pli~~l~~~~~~ili-T~~---T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~ 131 (440) -|..=+...|..|+..+..+..+. ++-| ... +.+|-... ....+. +|+.. .++.+..-.-+. T Consensus 55 ~P~aVv~P~s~eeV~~iv~~a~~~-----~ipvvprGggTgL~Gga~p---~~~gIv-----l~l~r-mn~Ileid~~~~ 120 (499) T PRK11230 55 TPLLVVLPKQMEQVQALLAVCHRL-----RVPVVARGAGTGLSGGALP---LEKGVL-----LVMAR-FNRILDIDPVGR 120 (499) T ss_pred CCCEEEEECCHHHHHHHHHHHHHC-----CCCEEEECCCCCCCCCCCC---CCCEEE-----EEECC-CCCEEEEECCCC T ss_conf 988899409999999999999987-----9909997898322658126---898299-----98001-277068826899 Q ss_pred EEEECCC-CHHHHHHHHHHCCC Q ss_conf 5640476-21467888742245 Q gi|254780402|r 132 MILSESD-IWPLTVFELSKQRI 152 (440) Q Consensus 132 ~i~~e~E-lWPnli~~~~~~~i 152 (440) .+.+|.- +.+.+-..+++.|. T Consensus 121 ~v~VeaGv~~~~l~~~l~~~Gl 142 (499) T PRK11230 121 RARVQPGVRNLAISQAAAPHGL 142 (499) T ss_pred EEEEECCCCHHHHHHHHHHCCC T ss_conf 7999458107999998986698 No 286 >pfam05211 NLBH Neuraminyllactose-binding hemagglutinin precursor (NLBH). This family is comprised of several flagellar sheath adhesin proteins also called neuraminyllactose-binding hemagglutinin precursor (NLBH) or N-acetylneuraminyllactose-binding fibrillar hemagglutinin receptor-binding subunits. NLBH is found exclusively in Helicobacter which are gut colonising bacteria and bind to sialic acid rich macromolecules present on the gastric epithelium. Probab=26.33 E-value=43 Score=13.73 Aligned_cols=159 Identities=16% Similarity=0.089 Sum_probs=84.1 Q ss_pred CCCCCCEEEEEEECC------CCH---------HHHHHHHHHHHCCHHHHHCCCCCCCC---CCEEEEECCCCCCHHHHH Q ss_conf 012341248897055------211---------47888776530010110013444446---641675067652025542 Q gi|254780402|r 258 IKCRTDVLTIIVPRH------PRR---------CDAIERRLIAKGLKVARRSRGDVINA---EVDIFLGDTIGEMGFYLR 319 (440) Q Consensus 258 l~~~~~~~lIIvPRh------peR---------~~~i~~~l~~~gl~~~~~S~~~~~~~---~~~V~i~Dt~GeL~~lY~ 319 (440) ..++..+.+|+.|+- |+. +..+...+++.|.++...++.+.... ..-.++.|-=|+++-+=- T Consensus 29 ~p~n~h~v~ll~P~iq~sdni~k~Y~~kF~n~l~~qi~~ilek~GY~v~~fkD~~~l~~~~Kkk~~lvL~m~G~v~ileD 108 (228) T pfam05211 29 VPANDHKVLLLRPAIQYSDNIAKEYENKFKNQLTLQVEEILENQGYKVINFSDKDDFSFAQKKKGYLVLAMNGEVVILED 108 (228) T ss_pred CCCCCCEEEEECCCCHHCCCCCHHHHHHHHHHHHHHHHHHHHHCCCEEEEECCHHHCCHHHHHHHEEEEECCCCEEEECC T ss_conf 67667259996353301123367899999999999999999856935776245222366765310689962574887126 Q ss_pred H------------CCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHHHHHHHH---HCCCEEEECC-H-----HH Q ss_conf 2------------082066135544454583768961994999898443799999999---7898699579-8-----99 Q gi|254780402|r 320 M------------TEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFRDIYRRMV---SSGAVRIVEE-V-----GT 378 (440) Q Consensus 320 ~------------AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~~~~L~---~~g~~~~v~d-~-----~e 378 (440) + -.++-+-.+++ --|++||.+--+-=-||-.++.|+...+.+. ...+++.+++ . .. T Consensus 109 ~k~~~~~~~~~~~~~~vd~SSG~v---~~nf~EP~S~~vvhsF~vdvs~~~a~~~~~~~t~s~sgGf~~kt~vh~~ke~n 185 (228) T pfam05211 109 PKRTIQKKSGLDKMEGVDISSGFV---KVNFLEPMSGESLHSFTMDVSELDAQEKFLKTTHSSSGGFLVKTMVHRIKGTN 185 (228) T ss_pred HHHHCCCCCCCCHHHCEECCCCEE---EEEEECCCCCCCHHHHCCCHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHCCCC T ss_conf 364335889888211152247758---99986577665077631236778888988741345778864277776530247 Q ss_pred HHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf 999999995898999999999999997045569999999997 Q gi|254780402|r 379 LADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITLRSLDS 420 (440) Q Consensus 379 L~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~~~i~~ 420 (440) =-+++...| |.--..-|.+.-.+.-++|-++.++....++. T Consensus 186 ~~dAIkkil-Nk~ya~vM~k~~keLtkkni~~Y~k~~~emK~ 226 (228) T pfam05211 186 SNDAIKKIL-NKIYASVMQEMDKELTKRNLESYQKDAKEMKN 226 (228) T ss_pred HHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 789999999-99999999999999989889999999998626 No 287 >COG0041 PurE Phosphoribosylcarboxyaminoimidazole (NCAIR) mutase [Nucleotide transport and metabolism] Probab=26.20 E-value=41 Score=13.90 Aligned_cols=52 Identities=15% Similarity=0.093 Sum_probs=33.1 Q ss_pred EEECCCCCCCCCCCCCCCCCCCCCC---CCEEEEEEECCCCHHHHHHHHHHHHCCHH Q ss_conf 1750465533210000000000123---41248897055211478887765300101 Q gi|254780402|r 237 TWAAISTFEGEEDKAVYVHNFIKCR---TDVLTIIVPRHPRRCDAIERRLIAKGLKV 290 (440) Q Consensus 237 v~vagSth~~EE~~il~a~~~l~~~---~~~~lIIvPRhpeR~~~i~~~l~~~gl~~ 290 (440) -++.||.- +-+.+-.+...|.+. -+.+.+=+.|-||+..++.+..++.|+.+ T Consensus 6 ~IIMGS~S--D~~~mk~Aa~~L~~fgv~ye~~VvSAHRTPe~m~~ya~~a~~~g~~v 60 (162) T COG0041 6 GIIMGSKS--DWDTMKKAAEILEEFGVPYEVRVVSAHRTPEKMFEYAEEAEERGVKV 60 (162) T ss_pred EEEECCCC--HHHHHHHHHHHHHHCCCCEEEEEEECCCCHHHHHHHHHHHHHCCCEE T ss_conf 99955702--39999999999998499759999844679899999999998789759 No 288 >pfam00128 Alpha-amylase Alpha amylase, catalytic domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding between beta strand 3 and alpha helix 3, and a carboxyl-terminal Greek key beta-barrel domain. Probab=26.08 E-value=44 Score=13.70 Aligned_cols=67 Identities=13% Similarity=0.175 Sum_probs=35.7 Q ss_pred HHHHHHHHHHHHHCCCC-EEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEECCCCHHHHHHHH Q ss_conf 99999999999861998-99996277630012101477416885055720134455744202035640476214678887 Q gi|254780402|r 69 TMALIGLIPAIRSRHVN-VLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILSESDIWPLTVFEL 147 (440) Q Consensus 69 ~~~a~pli~~l~~~~~~-iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e~ElWPnli~~~ 147 (440) ...+..-+..|++-+.+ |.++-..+++ ...|.|.|.|+...-.+ .-.+-| .=.|+.+| T Consensus 3 ~~gi~~kLdyl~~LGv~~i~l~P~~~~~----------~~~~GY~~~d~~~vd~~----------~Gt~~e-~~~Lv~~~ 61 (314) T pfam00128 3 LQGIIDKLDYLKDLGVTAIWLSPIFDSP----------QSYHGYDITDYYKIDPH----------FGTMDD-FKELIDKA 61 (314) T ss_pred HHHHHHHHHHHHHCCCCEEEECCCCCCC----------CCCCCCCCCCCCCCCCC----------CCCHHH-HHHHHHHH T ss_conf 8899982489997599989979671799----------99999683567876823----------299999-99999998 Q ss_pred HHCCCCEEE Q ss_conf 422455058 Q gi|254780402|r 148 SKQRIPQVL 156 (440) Q Consensus 148 ~~~~ip~~l 156 (440) +++||.+++ T Consensus 62 H~~Gi~Vil 70 (314) T pfam00128 62 HERGIKVIL 70 (314) T ss_pred HHCCCEEEE T ss_conf 466988860 No 289 >pfam03599 CdhD CO dehydrogenase/acetyl-CoA synthase delta subunit. Probab=25.93 E-value=44 Score=13.69 Aligned_cols=100 Identities=14% Similarity=0.123 Sum_probs=36.9 Q ss_pred EEEEEECCHHHHHHHHHHHHHHHHC-CCCEEEEECCCCCCHHHHCCCCCCEEEEE-CCCCCCHHHHHHHHHHCCEEEEEE Q ss_conf 7999817899999999999999861-99899996277630012101477416885-055720134455744202035640 Q gi|254780402|r 58 LIWFHASSVGETMALIGLIPAIRSR-HVNVLLTTMTATSAKVARKYLGQYAIHQY-APLDIQPAVSRFLKYWKPDCMILS 135 (440) Q Consensus 58 ~IW~HaaSvGE~~~a~pli~~l~~~-~~~iliT~~T~tg~~~~~~~~~~~~~~~y-~P~D~~~~~~~fl~~~~P~~~i~~ 135 (440) .|=+|++| |.-.....++++..+. +..+++-++.|.-.+.+-+...+.-...| +-.|....+-..-+.++--++++. T Consensus 71 ~iav~~~s-~dp~~fa~~vk~V~~~~~~pliL~~~dp~vl~aale~~~~~rPLlyaAt~~N~~~m~~lA~~~~~Pv~v~a 149 (384) T pfam03599 71 MIAIRLIS-TDPKEFAKAVEKVLQAVDVPLVIGGSDPEVLKAALEVAEDERPLLYAATLDNYKEIAELALEYKHPVLLWS 149 (384) T ss_pred EEEEEECC-CCHHHHHHHHHHHHHHCCCCEEEEECCHHHHHHHHHHHCCCCCEEEECCHHHHHHHHHHHHHCCCCEEEEC T ss_conf 89999078-99999999999999646998899808999999999974658977854899899999999997498289974 Q ss_pred CCC--CHHHHHHHHHHCCCCEEEEE Q ss_conf 476--21467888742245505864 Q gi|254780402|r 136 ESD--IWPLTVFELSKQRIPQVLVN 158 (440) Q Consensus 136 e~E--lWPnli~~~~~~~ip~~lin 158 (440) ..+ -=-.|...+.+.|++-++++ T Consensus 150 ~~dl~~l~~L~~~l~~~GikdlVLD 174 (384) T pfam03599 150 INDLNELKNLNRKLLKAGVKDIVLD 174 (384) T ss_pred CCCHHHHHHHHHHHHHCCCCCEEEC T ss_conf 8889999999999997698548988 No 290 >pfam01765 RRF Ribosome recycling factor. The ribosome recycling factor (RRF / ribosome release factor) dissociates the ribosome from the mRNA after termination of translation, and is essential bacterial growth. Thus ribosomes are "recycled" and ready for another round of protein synthesis. Probab=25.62 E-value=45 Score=13.65 Aligned_cols=23 Identities=13% Similarity=0.008 Sum_probs=9.3 Q ss_pred EEECCCCCCHHHHHHCCCEEEEC Q ss_conf 75067652025542208206613 Q gi|254780402|r 306 FLGDTIGEMGFYLRMTEIAFIGR 328 (440) Q Consensus 306 ~i~Dt~GeL~~lY~~AdiafVGG 328 (440) +.|+..|.-..+=.+|.|+..++ T Consensus 22 i~V~~yg~~~pL~~lA~Isv~~~ 44 (165) T pfam01765 22 IRVDYYGSPTPLNQLASISVPEA 44 (165) T ss_pred EEEEECCCCCCHHHHHHCCCCCC T ss_conf 69856899765899853027898 No 291 >cd01540 PBP1_arabinose_binding Periplasmic L-arabinose-binding protein (ABP), a member of a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily. Periplasmic L-arabinose-binding protein (ABP), a member of a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily. ABP is only involved in transport contrary to other related sugar-binding proteins such as the glucose/galactose-binding protein (GGBP) and the ribose-binding protein (RBP), both of which are involved in chemotaxis as well as transport. The periplasmic ABP consists of two alpha/beta globular domains connected by a three-stranded hinge, a Venus flytrap-like domain, which undergoes a transition from an open to a closed conformational state upon ligand binding. Moreover, ABP is homologous to the ligand-binding domain of eukaryotic receptors such as metabotropic glutamate receptor (mGluR) and DNA-binding transcriptional repressors such a Probab=25.24 E-value=45 Score=13.60 Aligned_cols=44 Identities=11% Similarity=0.167 Sum_probs=32.2 Q ss_pred HHHHHHHCCEEEEEE--CCCCHHHHHHHHHHCCCCEEEEEECCCCCC Q ss_conf 455744202035640--476214678887422455058641266542 Q gi|254780402|r 121 SRFLKYWKPDCMILS--ESDIWPLTVFELSKQRIPQVLVNARMSRRS 165 (440) Q Consensus 121 ~~fl~~~~P~~~i~~--e~ElWPnli~~~~~~~ip~~linarls~~S 165 (440) ..|+.+ +++.+++. +.+..|..+..|+..|||++.++.++.+.. T Consensus 48 e~~i~q-gvd~iii~p~d~~~~~~~v~~A~~agIpVv~~D~~~~~~~ 93 (289) T cd01540 48 DNLGAQ-GAKGFVICVPDVKLGPAIVAKAKAYNMKVVAVDDRLVDAD 93 (289) T ss_pred HHHHHC-CCCEEEECCCCCCHHHHHHHHHHHCCCEEEEECCCCCCCC T ss_conf 999985-8988999278710018999999987986999346455655 No 292 >TIGR02617 tnaA_trp_ase tryptophanase; InterPro: IPR013440 Proteins in this entry belong to the beta-eliminating lyase family and are thought to act as tryptophanases (4.1.99.1 from EC) (also known as L-tryptophan indole-lyases). The genes encoding these proteins are, as a rule, found with a tryptophanase leader peptide TnaC (IPR013439 from INTERPRO) encoded upstream. ; GO: 0009034 tryptophanase activity, 0006568 tryptophan metabolic process. Probab=25.18 E-value=33 Score=14.49 Aligned_cols=201 Identities=14% Similarity=0.201 Sum_probs=88.2 Q ss_pred HHHHHHHCCCCEEEEEECCCCCCC---------CCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHCCCCEEEECCCCC Q ss_conf 788874224550586412665421---------21001111233102644236641577888998608850587204310 Q gi|254780402|r 143 TVFELSKQRIPQVLVNARMSRRSF---------KNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKELGAQKLIVSGNLKI 213 (440) Q Consensus 143 li~~~~~~~ip~~linarls~~S~---------~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~~~i~v~GnlK~ 213 (440) +...|++..|||++=+||+.+..+ +.|. +..+.|.+++.-|...+.-.+| -+ |.+.|=+=| T Consensus 209 vY~IAk~yDIPVv~DSARFaENAyFIkQRE~~Y~~~s-i~~I~~E~Y~YaD~L~MSAKKD--~~-------V~~GGL~C~ 278 (468) T TIGR02617 209 VYEIAKKYDIPVVMDSARFAENAYFIKQREKEYENWS-IEQITREMYKYADMLAMSAKKD--AL-------VNMGGLLCF 278 (468) T ss_pred HHHHHHHCCCCEEECCHHHHHHHHHHHHHHHHHCCCC-HHHHHHHHHHHCCHHHHHHCCC--CC-------CCCCCEEEE T ss_conf 9999864599868623022334577775330000331-8898888654202011010125--40-------004762552 Q ss_pred CCCCCCCCHHHHHHHHHHCCCCCEEECC-CCCCC----C-CCCCCCCCCCCCCCCCEEEEEEECCCCHHHHHHHHHHHHC Q ss_conf 1234422125689999861798617504-65533----2-1000000000012341248897055211478887765300 Q gi|254780402|r 214 DTESLPCDKELLSLYQESIAGRYTWAAI-STFEG----E-EDKAVYVHNFIKCRTDVLTIIVPRHPRRCDAIERRLIAKG 287 (440) Q Consensus 214 d~~~~~~~~~~~~~~~~~~~~r~v~vag-Sth~~----E-E~~il~a~~~l~~~~~~~lIIvPRhpeR~~~i~~~l~~~g 287 (440) .|..... +-.....+.|..=| .|.-| + |.+.+-...-.++ + . +|= .++..+-+.+.+++.| T Consensus 279 ------~Ddsf~D-~y~E~RT~CV~~EGF~TYGGL~G~~ME~LAvGL~~GM~~--~--~-LAY-RI~QV~YL~~~L~~~G 345 (468) T TIGR02617 279 ------KDDSFFD-VYTEVRTLCVVQEGFPTYGGLEGGAMERLAVGLYDGMNE--D--Y-LAY-RINQVAYLVNGLEDIG 345 (468) T ss_pred ------CCCCHHH-HHHCCCCEEECCCCCCCCCCCCCHHHHHHHHHHHHHCCC--C--H-HHH-HHHHHHHHHHHHHHCC T ss_conf ------0352245-533312223214675555777604578887514530034--3--2-778-8999999985576678 Q ss_pred CHHHHHCCCCCCCCCCEEEE----ECCCC---CCHHHHHHCCCEEE-ECCCC----CC-CCC--CHHHHHHHCCCEE--E Q ss_conf 10110013444446641675----06765---20255422082066-13554----44-545--8376896199499--9 Q gi|254780402|r 288 LKVARRSRGDVINAEVDIFL----GDTIG---EMGFYLRMTEIAFI-GRSFC----AS-GGQ--NPLEAAMLGCAIL--S 350 (440) Q Consensus 288 l~~~~~S~~~~~~~~~~V~i----~Dt~G---eL~~lY~~AdiafV-GGSl~----~~-GGh--N~lEpaa~G~pVi--~ 350 (440) ..++.-.-+-..--+..-+| +|++= --..||..|-|==| =|||- |. |=| .|.|=+.+-.|=- + T Consensus 346 v~~Q~P~GGHA~FVDA~~LLPHIPa~QFPA~AL~~ELY~~AGIRaVEIGSlLLGRDP~TGKQ~P~P~ELLRLTIPR~~YT 425 (468) T TIGR02617 346 VVIQQPAGGHAAFVDAGKLLPHIPADQFPAHALACELYKVAGIRAVEIGSLLLGRDPKTGKQKPCPAELLRLTIPRATYT 425 (468) T ss_pred CEEECCCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHHCCCEEEEECHHHHCCCCCCCCCCCCHHHHHHHCCCCHHHH T ss_conf 05764998637676311027888865554888855567761711453101421678878788887067764026620110 Q ss_pred CCCCCCHHHHHHHHHH Q ss_conf 8984437999999997 Q gi|254780402|r 351 GPNVENFRDIYRRMVS 366 (440) Q Consensus 351 GP~~~nf~e~~~~L~~ 366 (440) --||+=.-|++..+.+ T Consensus 426 ~~HMDfiiEAF~~~K~ 441 (468) T TIGR02617 426 QTHMDFIIEAFKSVKE 441 (468) T ss_pred CCHHHHHHHHHHHHHH T ss_conf 1124789999999987 No 293 >PRK09140 2-dehydro-3-deoxy-6-phosphogalactonate aldolase; Reviewed Probab=24.70 E-value=46 Score=13.54 Aligned_cols=85 Identities=16% Similarity=0.166 Sum_probs=47.7 Q ss_pred CCHHHHHH----HHHHCCCCEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCE Q ss_conf 15778889----98608850587204310123442212568999986179861750465533210000000000123412 Q gi|254780402|r 189 QSERYFRR----YKELGAQKLIVSGNLKIDTESLPCDKELLSLYQESIAGRYTWAAISTFEGEEDKAVYVHNFIKCRTDV 264 (440) Q Consensus 189 qs~~~~~r----l~~lG~~~i~v~GnlK~d~~~~~~~~~~~~~~~~~~~~r~v~vagSth~~EE~~il~a~~~l~~~~~~ 264 (440) .+.+++.. +.+-|++-+.|+=++ +...+..+.+++..++..++-||+.-..|+ +.+.. .-.. T Consensus 19 ~~~~~a~~~~~al~~~Gi~~iEVTl~t-------p~a~~~I~~l~~~~~~~~~iGAGTVlt~e~-----~~~ai--~aGA 84 (206) T PRK09140 19 ITPDEALAHVGALIEAGFRAIEIPLNS-------PDPFDSIAALVKALGDDALIGAGTVLSPEQ-----VDRLA--DAGG 84 (206) T ss_pred CCHHHHHHHHHHHHHCCCCEEEEECCC-------CCHHHHHHHHHHHCCCCEEEEEEECCCHHH-----HHHHH--HCCC T ss_conf 999999999999998699889991799-------769999999999679865998620467999-----99999--8599 Q ss_pred EEEEEECCCCHHHHHHHHHHHHCCHH Q ss_conf 48897055211478887765300101 Q gi|254780402|r 265 LTIIVPRHPRRCDAIERRLIAKGLKV 290 (440) Q Consensus 265 ~lIIvPRhpeR~~~i~~~l~~~gl~~ 290 (440) ..|+.| +.+ +++.+.+.+.|+.+ T Consensus 85 ~FiVSP-~~~--~~vi~~a~~~~i~~ 107 (206) T PRK09140 85 RLIVTP-NID--PEVIRRAVAYGMTV 107 (206) T ss_pred CEEECC-CCC--HHHHHHHHHCCCCC T ss_conf 999999-998--99999999829965 No 294 >pfam01996 F420_ligase F420-0:Gamma-glutamyl ligase. F420-0:Gamma-glutamyl ligase (EC:6.3.2.-) is an enzyme involved in F420 biosynthesis pathway. It catalyses the GTP-dependent successive addition of multiple gamma-linked L-glutamates to the L-lactyl phosphodiester of 7,8-didemethyl-8-hydroxy-5-deazariboflavin (F420-0). This reaction produces polyglutamated F420 derivatives. GTP + F420-0 + n L-glutamate - GDP + phosphate + F420-n Probab=24.62 E-value=46 Score=13.53 Aligned_cols=39 Identities=18% Similarity=0.419 Sum_probs=25.5 Q ss_pred CCEEEEEEECCCCH-HHHHHHHHHHHCCHHHHHCCCCCCCCCCEEEEECCCC Q ss_conf 41248897055211-4788877653001011001344444664167506765 Q gi|254780402|r 262 TDVLTIIVPRHPRR-CDAIERRLIAKGLKVARRSRGDVINAEVDIFLGDTIG 312 (440) Q Consensus 262 ~~~~lIIvPRhpeR-~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~i~Dt~G 312 (440) ++-..++.|++|+. ++++.+.+.+. ...+..|+|.||+| T Consensus 112 ~~~~v~LlP~dPd~sA~~i~~~l~~~------------~g~~v~ViI~Ds~g 151 (219) T pfam01996 112 PGGYVVLLPEDPDASARELRAALKER------------TGKDVGVIITDTNG 151 (219) T ss_pred CCCEEEECCCCHHHHHHHHHHHHHHH------------HCCCEEEEEECCCC T ss_conf 99848987889899999999999998------------59976899966998 No 295 >PRK01033 imidazole glycerol phosphate synthase subunit HisF; Provisional Probab=24.55 E-value=47 Score=13.52 Aligned_cols=50 Identities=18% Similarity=0.213 Sum_probs=27.4 Q ss_pred CCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCEEECCCCCC Q ss_conf 157788899860885058720431012344221256899998617986175046553 Q gi|254780402|r 189 QSERYFRRYKELGAQKLIVSGNLKIDTESLPCDKELLSLYQESIAGRYTWAAISTFE 245 (440) Q Consensus 189 qs~~~~~rl~~lG~~~i~v~GnlK~d~~~~~~~~~~~~~~~~~~~~r~v~vagSth~ 245 (440) .|.++++++.+.|+++|.+. +.=+ .+.+....+.+.++.+.+.++--... T Consensus 84 rs~e~~~~ll~~GadkViig-s~a~------~~p~~i~~~~~~fG~q~IvvsiD~k~ 133 (253) T PRK01033 84 KTVEQAKRIFSLGVEKVSIS-TAAL------EDPKLITEAAEIYGSQSVVVSIDVKK 133 (253) T ss_pred CCHHHHHHHHHCCCCEEEEC-CHHH------HCCHHHHHHHHHCCCCCEEEEEEEEC T ss_conf 21688899986798669999-8786------37416578998779976999999824 No 296 >PRK05282 peptidase E; Validated Probab=24.53 E-value=47 Score=13.52 Aligned_cols=11 Identities=27% Similarity=0.389 Sum_probs=5.4 Q ss_pred EEEEECCCCCC Q ss_conf 99996277630 Q gi|254780402|r 86 VLLTTMTATSA 96 (440) Q Consensus 86 iliT~~T~tg~ 96 (440) +|+|++|..|. T Consensus 4 LLlS~St~~g~ 14 (233) T PRK05282 4 LLLSNSTLPGT 14 (233) T ss_pred EEEECCCCCCC T ss_conf 99824767998 No 297 >cd06274 PBP1_FruR Ligand binding domain of DNA transcription repressor specific for fructose (FruR) and its close homologs. Ligand binding domain of DNA transcription repressor specific for fructose (FruR) and its close homologs, all of which are a member of the LacI-GalR family of bacterial transcription regulators. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to members of the type I periplasmic binding protein superfamily. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor Probab=24.51 E-value=47 Score=13.51 Aligned_cols=104 Identities=15% Similarity=0.143 Sum_probs=60.3 Q ss_pred HHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEECCCCHHHHHHHHHH Q ss_conf 99999999998619989999627763001210147741688505572013445574420203564047621467888742 Q gi|254780402|r 70 MALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILSESDIWPLTVFELSK 149 (440) Q Consensus 70 ~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e~ElWPnli~~~~~ 149 (440) ..+..+-+.+.+.+.++++.++-.. .+. +. ..++ .+...+++.+|+.-+..=...+..+.+ T Consensus 16 ~l~~~ie~~~~~~Gy~~ll~~s~~d-~~~--e~---------------~~i~-~l~~~~vdGiIi~~~~~~~~~~~~~~~ 76 (264) T cd06274 16 RIAKRLEALARERGYQLLIACSDDD-PET--ER---------------ETVE-TLIARQVDALIVAGSLPPDDPYYLCQK 76 (264) T ss_pred HHHHHHHHHHHHCCCEEEEEECCCC-HHH--HH---------------HHHH-HHHHCCCCEEEECCCCCCHHHHHHHHH T ss_conf 9999999999986998999938999-899--99---------------9999-999669987997678898799999997 Q ss_pred CCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHH----HHHHHHHHCCCCEEEEC Q ss_conf 2455058641266542121001111233102644236641577----88899860885058720 Q gi|254780402|r 150 QRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSER----YFRRYKELGAQKLIVSG 209 (440) Q Consensus 150 ~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~----~~~rl~~lG~~~i~v~G 209 (440) .++|+++++...... .++.|.+-+.. ..++|.+.|-++|-..| T Consensus 77 ~~iPvV~idr~~~~~-----------------~~~~V~~Dn~~~~~~a~~~L~~~Ghr~I~~i~ 123 (264) T cd06274 77 AGLPVVALDRPGDPS-----------------RFPSVVSDNRDGAAELTRELLAAPPEEVLFLG 123 (264) T ss_pred CCCCEEEEECCCCCC-----------------CCCEEEECHHHHHHHHHHHHHHCCCCEEEEEC T ss_conf 599889985058989-----------------99789976699999999999985996699976 No 298 >PRK13478 phosphonoacetaldehyde hydrolase; Provisional Probab=24.50 E-value=47 Score=13.51 Aligned_cols=61 Identities=10% Similarity=0.173 Sum_probs=36.1 Q ss_pred CCCEEEECCCCCCCCCCHHHHHHHCCCEEE----CCCCC----------------CHHHHHHHHHHCCCEEEECCHHHHH Q ss_conf 082066135544454583768961994999----89844----------------3799999999789869957989999 Q gi|254780402|r 321 TEIAFIGRSFCASGGQNPLEAAMLGCAILS----GPNVE----------------NFRDIYRRMVSSGAVRIVEEVGTLA 380 (440) Q Consensus 321 AdiafVGGSl~~~GGhN~lEpaa~G~pVi~----GP~~~----------------nf~e~~~~L~~~g~~~~v~d~~eL~ 380 (440) ..++.||-|..+ +.|...+|+..+= |..+. ....+.+.|.++|+-.++++-.||- T Consensus 177 ~~vvkVgDT~~d-----I~~G~nAG~~tVGv~~~~~~~gl~~~e~~~l~~~e~~~~~~~~~~~l~~aGa~yvi~~i~dLp 251 (267) T PRK13478 177 AACVKVDDTVPG-----IEEGRNAGMWTVGLILSGNELGLTEEEYQALDAAELAARRERARAKLRAAGAHYVIDTIADLP 251 (267) T ss_pred HHEEEECCCHHH-----HHHHHHCCCEEEEEECCCCCCCCCHHHHHHCCHHHHHHHHHHHHHHHHHCCCCEEECCHHHHH T ss_conf 893777898889-----999997899289997056555879999984899999999999999999749998944898889 Q ss_pred HHHHHH Q ss_conf 999999 Q gi|254780402|r 381 DMVYSL 386 (440) Q Consensus 381 ~~l~~l 386 (440) ..|..+ T Consensus 252 ~~i~~i 257 (267) T PRK13478 252 AVIADI 257 (267) T ss_pred HHHHHH T ss_conf 999999 No 299 >cd06309 PBP1_YtfQ_like Periplasmic binding domain of ABC-type YtfQ-like transport systems. Periplasmic binding domain of ABC-type YtfQ-like transport systems. The YtfQ protein from Escherichia coli is up-regulated under glucose-limited conditions and shares homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein superfamily. Members of this group are predicted to be involved in the transport of sugar-containing molecules across cellular and organellar membranes; however their ligand specificity is not determined experimentally. Probab=24.22 E-value=47 Score=13.48 Aligned_cols=43 Identities=23% Similarity=0.385 Sum_probs=30.2 Q ss_pred HHHHHHHHCCEEEEEE--CCCCHHHHHHHHHHCCCCEEEEEECCCC Q ss_conf 4455744202035640--4762146788874224550586412665 Q gi|254780402|r 120 VSRFLKYWKPDCMILS--ESDIWPLTVFELSKQRIPQVLVNARMSR 163 (440) Q Consensus 120 ~~~fl~~~~P~~~i~~--e~ElWPnli~~~~~~~ip~~linarls~ 163 (440) ++.++.. +++.+|+. +.+-....+.+++..|||++++|..+.. T Consensus 48 i~~li~~-~vDgIii~p~~~~~~~~~l~~a~~agIPvV~~d~~~~~ 92 (273) T cd06309 48 IRSFIAQ-GVDVIILAPVVETGWDPVLKEAKAAGIPVILVDRGVDV 92 (273) T ss_pred HHHHHHC-CCCEEEECCCCCCCHHHHHHHHHHCCCCEEEECCCCCC T ss_conf 9999975-99999980576543589999999749988998267767 No 300 >COG1080 PtsA Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) [Carbohydrate transport and metabolism] Probab=24.07 E-value=48 Score=13.46 Aligned_cols=76 Identities=9% Similarity=0.017 Sum_probs=34.9 Q ss_pred HHHHHCCCEEECCCCCCHHHHHHHHHHCCCEE---EECCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHH Q ss_conf 68961994999898443799999999789869---957989999999999589899999999999999704556999999 Q gi|254780402|r 340 EAAMLGCAILSGPNVENFRDIYRRMVSSGAVR---IVEEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLKITLR 416 (440) Q Consensus 340 Epaa~G~pVi~GP~~~nf~e~~~~L~~~g~~~---~v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~~~~~ 416 (440) .+...|++|=+==.+..-......|...|.-. ...+-......+.++ |-..++++++++.. ....+.+.+ T Consensus 492 ~ah~~gkwvgmCGElAgD~~a~plLlGlGldElSms~~si~~vK~~i~~l--~~~~~~~la~~aL~-----~~ta~ev~~ 564 (574) T COG1080 492 AAHRHGKWVGMCGELAGDPAATPLLLGLGLDELSMSAPSIPAVKARIRKL--DSEECRELAEQALA-----CRTAEEVEE 564 (574) T ss_pred HHHHCCCEEEECHHHCCCHHHHHHHHHCCCCHHCCCCHHHHHHHHHHHHC--CHHHHHHHHHHHHC-----CCCHHHHHH T ss_conf 99973984664330115823679998638332306802389999999965--89999999999976-----998999999 Q ss_pred HHHHHH Q ss_conf 999752 Q gi|254780402|r 417 SLDSYV 422 (440) Q Consensus 417 ~i~~~l 422 (440) .+.++. T Consensus 565 l~~~~~ 570 (574) T COG1080 565 LLAQFI 570 (574) T ss_pred HHHHHH T ss_conf 999852 No 301 >COG1618 Predicted nucleotide kinase [Nucleotide transport and metabolism] Probab=23.97 E-value=48 Score=13.45 Aligned_cols=75 Identities=20% Similarity=0.311 Sum_probs=47.2 Q ss_pred CCEEEEECCCCCCHHHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCEEECCCCCCHHHHHHHHHHCCCEEE-E--CCHHH Q ss_conf 64167506765202554220820661355444545837689619949998984437999999997898699-5--79899 Q gi|254780402|r 302 EVDIFLGDTIGEMGFYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAILSGPNVENFRDIYRRMVSSGAVRI-V--EEVGT 378 (440) Q Consensus 302 ~~~V~i~Dt~GeL~~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pVi~GP~~~nf~e~~~~L~~~g~~~~-v--~d~~e 378 (440) +.||+|+|.+|-|-..- ..|... .=|.+-+++|+|.-=|-..-....+.....+.... + .|.+. T Consensus 100 ~aDvIIIDEIGpMElks---------~~f~~~----ve~vl~~~kpliatlHrrsr~P~v~~ik~~~~v~v~lt~~NR~~ 166 (179) T COG1618 100 EADVIIIDEIGPMELKS---------KKFREA----VEEVLKSGKPLIATLHRRSRHPLVQRIKKLGGVYVFLTPENRNR 166 (179) T ss_pred CCCEEEEECCCCHHHCC---------HHHHHH----HHHHHCCCCCEEEEEECCCCCHHHHHHHHCCCEEEEECCCHHHH T ss_conf 49989994336330200---------889999----99996589937999962567758998642487799975121567 Q ss_pred HHHHHHHHHCC Q ss_conf 99999999589 Q gi|254780402|r 379 LADMVYSLLSE 389 (440) Q Consensus 379 L~~~l~~ll~d 389 (440) +...+..+|.+ T Consensus 167 i~~~Il~~L~~ 177 (179) T COG1618 167 ILNEILSVLKG 177 (179) T ss_pred HHHHHHHHHCC T ss_conf 99999998524 No 302 >COG0126 Pgk 3-phosphoglycerate kinase [Carbohydrate transport and metabolism] Probab=23.97 E-value=48 Score=13.45 Aligned_cols=90 Identities=16% Similarity=0.179 Sum_probs=45.8 Q ss_pred HHHHHHCCCCCCCCCCCEEEEEECCHHHHHHHHHHHHHHHHCCCCEEE-EE-CCCCCCHH------HHCCCCC--CEEEE Q ss_conf 124540567777888877999817899999999999999861998999-96-27763001------2101477--41688 Q gi|254780402|r 41 KFGERLGYPTALRPIGPLIWFHASSVGETMALIGLIPAIRSRHVNVLL-TT-MTATSAKV------ARKYLGQ--YAIHQ 110 (440) Q Consensus 41 ~~~eR~g~~~~~~~~~~~IW~HaaSvGE~~~a~pli~~l~~~~~~ili-T~-~T~tg~~~------~~~~~~~--~~~~~ 110 (440) +..=|.-+..+... |+ | .+-....++.|-|+.|.++++.|++ +. .-|.++.- ..+.+.. ...+. T Consensus 16 ~VlvRvD~NvP~~d-G~-I----~dd~RI~a~lpTIk~l~~~ga~Vvl~SHlGRPk~~~~~~SL~pva~~Ls~ll~~~V~ 89 (395) T COG0126 16 RVLVRVDFNVPVDD-GK-I----TDDTRIRAALPTIKYLLEKGAKVVLLSHLGRPKEYSDKTSLEPVAKRLSELLGKEVK 89 (395) T ss_pred EEEEEECCCCCCCC-CE-E----CCCHHHHHHHHHHHHHHHCCCEEEEEECCCCCCCCCCCCCHHHHHHHHHHHCCCCEE T ss_conf 89999435786669-81-6----771879871689999995899599982478989998764479999999986299618 Q ss_pred ECCCC-CCHHHHHHHHHHCCEEEEEECC Q ss_conf 50557-2013445574420203564047 Q gi|254780402|r 111 YAPLD-IQPAVSRFLKYWKPDCMILSES 137 (440) Q Consensus 111 y~P~D-~~~~~~~fl~~~~P~~~i~~e~ 137 (440) |.| | +...+++-++..+|--++++|+ T Consensus 90 f~~-d~~g~~a~~~v~~l~~GevlLLEN 116 (395) T COG0126 90 FVD-DCVGPEARQAVAELKDGEVLLLEN 116 (395) T ss_pred ECC-CCCCHHHHHHHHCCCCCCEEEEEE T ss_conf 544-446879999996168986799863 No 303 >cd06287 PBP1_LacI_like_8 Ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. This group includes the ligand-binding domain of uncharacterized DNA-binding regulatory proteins that are members of the LacI-GalR family of bacterial transcription repressors. The LacI-GalR family repressors are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding affinity of the repressor. Probab=23.93 E-value=48 Score=13.44 Aligned_cols=101 Identities=19% Similarity=0.177 Sum_probs=60.5 Q ss_pred HHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEECCCCHHHHHHHHHHC Q ss_conf 99999999986199899996277630012101477416885055720134455744202035640476214678887422 Q gi|254780402|r 71 ALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILSESDIWPLTVFELSKQ 150 (440) Q Consensus 71 ~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e~ElWPnli~~~~~~ 150 (440) .+..+-+...+++..+++.++.. +-+ -++..+-|-+|++.++.=--.+..+.++ T Consensus 25 ~~~~ve~~A~~~gy~liL~~~~~-~~~-------------------------~l~~~~VDGvIl~~~~~~d~~i~~L~~~ 78 (269) T cd06287 25 VAAAAAESALERGLALCLVPPHE-ADS-------------------------PLDALDIDGAILVEPMADDPQVARLRQR 78 (269) T ss_pred HHHHHHHHHHHCCCEEEEECCCC-CHH-------------------------HHHHCCCCEEEEECCCCCCHHHHHHHHC T ss_conf 99999999998699899947998-655-------------------------6875879989990378897799999976 Q ss_pred CCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECC----HHHHHHHHHHCCCCEE-EECCCCC Q ss_conf 4550586412665421210011112331026442366415----7788899860885058-7204310 Q gi|254780402|r 151 RIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQS----ERYFRRYKELGAQKLI-VSGNLKI 213 (440) Q Consensus 151 ~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs----~~~~~rl~~lG~~~i~-v~GnlK~ 213 (440) |+|+++++ |... ... .+..|-.-+ ....++|.++|-++|- ++|+... T Consensus 79 ~iP~V~ig-r~~~-~~~--------------~~~~Vd~Dn~~~~~~a~~hLi~~GhrrIa~i~g~~~~ 130 (269) T cd06287 79 GIPVVSIG-RPPG-DRT--------------DVPYVDLQSAATARMLLEHLRAQGARQIALIVGSARR 130 (269) T ss_pred CCCEEEEC-CCCC-CCC--------------CCCEEECCHHHHHHHHHHHHHHCCCCEEEEEECCCCC T ss_conf 99999989-8787-789--------------9828966659999999999998799879999689867 No 304 >TIGR02463 MPGP_rel mannosyl-3-phosphoglycerate phosphatase homolog; InterPro: IPR012815 This family consists of members of the HAD superfamily, subfamily IIB. All members are closely related to mannosyl-3-phosphoglycerate phosphatase, the second enzyme in a two-step pathway for biosynthesis of mannosylglycerate, a compatible solute present in some thermophiles and in Dehalococcoides ethenogenes. However, members of this family are separable in a neighbour-joining tree constructed from a multiple sequence alignment and are found only in mesophiles that lack the companion mannosyl-3-phosphoglycerate synthase (IPR012812 from INTERPRO). Members of this family are likely to act on a compound related to yet distinct from mannosyl-3-phosphoglycerate.; GO: 0050531 mannosyl-3-phosphoglycerate phosphatase activity, 0005737 cytoplasm. Probab=23.93 E-value=48 Score=13.44 Aligned_cols=30 Identities=17% Similarity=0.246 Sum_probs=15.6 Q ss_pred HHHHHHHHHHHHHHCCCCEEEEECCCCCCHH Q ss_conf 9999999999998619989999627763001 Q gi|254780402|r 68 ETMALIGLIPAIRSRHVNVLLTTMTATSAKV 98 (440) Q Consensus 68 E~~~a~pli~~l~~~~~~iliT~~T~tg~~~ 98 (440) ++..|.+.++.|.+.+-.|++ |||.|++|+ T Consensus 17 dw~PA~~~l~~L~e~~iPvI~-CtSKTAAEv 46 (224) T TIGR02463 17 DWQPAAPWLTRLQEAGIPVIL-CTSKTAAEV 46 (224) T ss_pred CHHHHHHHHHHHHHCCCCEEE-CCCCCHHHH T ss_conf 546779999999975897642-587307899 No 305 >TIGR02176 pyruv_ox_red pyruvate:ferredoxin (flavodoxin) oxidoreductase; InterPro: IPR011895 The oxidative decarboxylation of pyruvate to acetyl-CoA, a central step in energy metabolism, can occur by two different mechanisms . In mitochondria and aerobic bacteria this reaction is catalysed by the multienzyme complex pyruvate dehydrogenase using NAD as electron acceptor. In anaerobic organisms, however, this reaction is reversibly catalysed by a single enzyme using either ferrodoxin or flavodoxin as the electron acceptor. Pyruvate:ferrodoxin/flavodoxin reductases (PFORs) in this entry occur in both obligately and facultatively anaerobic bacteria and also some eukaryotic microorganisms. These proteins are single-chain enzymes containing a thiamin pyrophosphate cofactor for the cleavage of carbon-carbon bonds next to a carbonyl group, and iron-sulphur clusters for electron transfer. The Desulfovibrio africanus enzyme is currently the only PFOR whose three dimensional structure is known , . It is a homodimer where each subunit contains one thiamin pyrophosphate cofactor and two ferrodoxin-like 4Fe-S clusters and an atypical 4Fe-S cluster. Each monomer is composed of seven domains - domains I, II and VI make intersubunit contacts, while domains III, IV and V are located at the suface of the dimer, and domain VII forms a long arm extending over the other subunit. The cofactor is bound at the interface of domains I and VI and is proximal to the atypical 4Fe-S bound by domain VI, while the ferrodoxin-like 4Fe-S clusters are bound by domain V. Comparison of this enzyme with the multi-chain PFORs shows a correspondance between the domains in this enzyme and the subunits of the multi-chain enzymes.; GO: 0005506 iron ion binding, 0016903 oxidoreductase activity acting on the aldehyde or oxo group of donors, 0006118 electron transport. Probab=23.86 E-value=26 Score=15.14 Aligned_cols=33 Identities=15% Similarity=0.064 Sum_probs=28.8 Q ss_pred HHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 899999999999999704556999999999752 Q gi|254780402|r 390 PTIRYEMINAAINEVKKMQGPLKITLRSLDSYV 422 (440) Q Consensus 390 ~~~~~~~~~~a~~~i~~~~ga~~~~~~~i~~~l 422 (440) +.....+|+++.++|++|..|.|++++.+.+.= T Consensus 591 ~~I~ksYGKKG~~vV~~N~~AvD~~v~~l~~v~ 623 (1194) T TIGR02176 591 KSIEKSYGKKGEEVVQKNIKAVDKAVESLHEVK 623 (1194) T ss_pred HHHHHHCCCCCHHHHHHHHHHHHHHHHCCEECC T ss_conf 998850489777889877999977733245324 No 306 >PRK06263 sdhA succinate dehydrogenase flavoprotein subunit; Reviewed Probab=23.80 E-value=48 Score=13.43 Aligned_cols=36 Identities=17% Similarity=0.220 Sum_probs=24.1 Q ss_pred ECCCC--CCHHHHHHCCCEEEECCCCC---CCCCCHHHHHHHCC Q ss_conf 06765--20255422082066135544---45458376896199 Q gi|254780402|r 308 GDTIG--EMGFYLRMTEIAFIGRSFCA---SGGQNPLEAAMLGC 346 (440) Q Consensus 308 ~Dt~G--eL~~lY~~AdiafVGGSl~~---~GGhN~lEpaa~G~ 346 (440) +|+-| -+..||+.++++ || +.. -||-..+|.+.+|. T Consensus 353 id~~~~t~I~GLyAaGE~a--gG-~hGanRlggnsl~e~~VfG~ 393 (539) T PRK06263 353 INEDCETNIPGLFACGEVA--GG-VHGANRLGGNALADTQVFGA 393 (539) T ss_pred ECCCCCCCCCCEEECCCCC--CC-CCCCCCCCHHHHHHHHHHHH T ss_conf 9988760578737756645--47-68898860247788899999 No 307 >cd01141 TroA_d Periplasmic binding protein TroA_d. These proteins are predicted to function as initial receptors in the ABC metal ion uptake in eubacteria and archaea. They belong to the TroA superfamily of helical backbone metal receptor proteins that share a distinct fold and ligand binding mechanism. A typical TroA protein is comprised of two globular subdomains connected by a single helix and can bind their ligands in the cleft between these domains. Probab=23.79 E-value=48 Score=13.43 Aligned_cols=34 Identities=29% Similarity=0.266 Sum_probs=17.7 Q ss_pred HHCCEEEEEECCCCHHHHHHHHHHCCCCEEEEEE Q ss_conf 4202035640476214678887422455058641 Q gi|254780402|r 126 YWKPDCMILSESDIWPLTVFELSKQRIPQVLVNA 159 (440) Q Consensus 126 ~~~P~~~i~~e~ElWPnli~~~~~~~ip~~lina 159 (440) ..+||+++.....--...+..+.+.|||++.++. T Consensus 67 aL~PDlVi~~~~~~~~~~~~~L~~~gI~v~~~~~ 100 (186) T cd01141 67 ALKPDLVILYGGFQAQTILDKLEQLGIPVLYVNE 100 (186) T ss_pred HHCCCEEEEECCCCCHHHHHHHHHCCCCEEEEEC T ss_conf 7099999995887867899999964995799606 No 308 >TIGR00670 asp_carb_tr aspartate carbamoyltransferase; InterPro: IPR002082 Aspartate carbamoyltransferase (ATCase) catalyses the formation of carbamoyl-aspartate in the pyrimidine biosynthesis pathway, by the association of aspartate and carbamoyl-phosphate. This is the commitment step in the Escherichia coli pathway and is regulated by feedback inhibition by CTP, the final product of the pathway. The structural organisation of the ATCase protein varies considerably between different organisms. In bacteria such as E. coli, Salmonella typhimurium and Serratia marcescens, the ATCase is a dodecamer of 2 catalytic (c) trimers and 3 regulatory (r) dimers. The catalytic domains are coded for by the pyrB gene, and the regulatory domains by pyrI. In Gram-positive bacteria such as Bacillus subtilis, ATCase exists as a trimer of catalytic subunits, but unlike in E. coli, it neither contains nor binds to regulatory subunits. In eukaryotes, ATCase is found as a single domain in a multifunctional enzyme that contains activity for glutamine amidotransferase, carbamoylphosphate synthetase, dihydroorotase, and aspartate carbamoyltransferase. ; GO: 0004070 aspartate carbamoyltransferase activity, 0006207 'de novo' pyrimidine base biosynthetic process. Probab=23.73 E-value=48 Score=13.42 Aligned_cols=262 Identities=18% Similarity=0.170 Sum_probs=146.9 Q ss_pred CCHHHHHHHHHHHHHHHHC----C-------CCEEE--------EECCCCCCHHHHCCCCC-CEEEEECCCCCCHHHH-- Q ss_conf 7899999999999999861----9-------98999--------96277630012101477-4168850557201344-- Q gi|254780402|r 64 SSVGETMALIGLIPAIRSR----H-------VNVLL--------TTMTATSAKVARKYLGQ-YAIHQYAPLDIQPAVS-- 121 (440) Q Consensus 64 aSvGE~~~a~pli~~l~~~----~-------~~ili--------T~~T~tg~~~~~~~~~~-~~~~~y~P~D~~~~~~-- 121 (440) =|.+|.+.+..-+++|++. . -+|+- ||=|-.++|.|.|.++= +..+-+.+-....+.| T Consensus 9 l~~Eei~~lL~~A~~l~~~~~~~~~~~~~L~gKi~a~YilFFEPSTRTR~SFE~A~KRLGgPn~v~~~~~~~~sS~~KGE 88 (336) T TIGR00670 9 LSREEIELLLETAEELEQVASGEKKLKEALKGKIVANYILFFEPSTRTRLSFETAMKRLGGPNDVVNFSDSETSSVAKGE 88 (336) T ss_pred CCHHHHHHHHHHHHHHHHHCCCCCCCHHHHCCCEEEEEEEEECCCCCCHHHHHHHHHHHCCCCCEEECCCCCCCCHHCCC T ss_conf 78556999999999998631576621586639779999998607855013588988872798545622556341121054 Q ss_pred ------HHHHHHC-CEEEEE-ECC----CCHHHHHHHHH-HCCCCEE-EEEECCCCCCCCCCHHHHHHHHHCCCCCCEEE Q ss_conf ------5574420-203564-047----62146788874-2245505-86412665421210011112331026442366 Q gi|254780402|r 122 ------RFLKYWK-PDCMIL-SES----DIWPLTVFELS-KQRIPQV-LVNARMSRRSFKNWKTVLSFSKKIFSQFSLVI 187 (440) Q Consensus 122 ------~fl~~~~-P~~~i~-~e~----ElWPnli~~~~-~~~ip~~-linarls~~S~~~~~~~~~l~~~~~~~~~~i~ 187 (440) +.+..+. +|++|+ =.. |--|-++++-. +.+|-+= +|||== .+- -.=++.+|.-++..- T Consensus 89 tL~DTi~~~~~y~D~D~iViRH~~~GvCegaar~~ae~~~R~~~~vPsviNAGD--G~~------qHPTQ~LLDLyTi~~ 160 (336) T TIGR00670 89 TLADTIKTLSAYGDSDAIVIRHPLEGVCEGAARLAAEVSDRLGIEVPSVINAGD--GSG------QHPTQTLLDLYTIYE 160 (336) T ss_pred CHHHHHHHHHHCCCCCEEEEECCCCCHHHHHHHHHHHHHHHCCCCCCEEEECCC--CCC------CCCCHHHHHHHHHHH T ss_conf 478899988540475569983586541346889998863110457871663687--878------797534788999998 Q ss_pred ECCHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCHHHHHHHHHHCCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCEEEE Q ss_conf 41577888998608850587204310123442212568999986179861750465533210000000000123412488 Q gi|254780402|r 188 VQSERYFRRYKELGAQKLIVSGNLKIDTESLPCDKELLSLYQESIAGRYTWAAISTFEGEEDKAVYVHNFIKCRTDVLTI 267 (440) Q Consensus 188 ~qs~~~~~rl~~lG~~~i~v~GnlK~d~~~~~~~~~~~~~~~~~~~~r~v~vagSth~~EE~~il~a~~~l~~~~~~~lI 267 (440) +-.-|---+..+.--+|-+.||+||.++... ....+ ..+++..| .. T Consensus 161 -~fGPDnP~l~~~~Gl~iA~vGDlkygRtvhS----~~~~L-~~f~~~~v----------------------------~l 206 (336) T TIGR00670 161 -EFGPDNPALERLDGLKIALVGDLKYGRTVHS----LIKAL-ALFGNAEV----------------------------YL 206 (336) T ss_pred -HHCCCCCCHHHCCCCEEEEEEECCCCHHHHH----HHHHH-HHHCCCEE----------------------------EE T ss_conf -7188863032228857999843120247667----88998-65189569----------------------------98 Q ss_pred EEECCCCHHHHHHHHHHHHCCHHHHHCCCCCCCCCCEEE---EE-----CCCC--CCHHHHHHCCCEEEECCCCCCCCCC Q ss_conf 970552114788877653001011001344444664167---50-----6765--2025542208206613554445458 Q gi|254780402|r 268 IVPRHPRRCDAIERRLIAKGLKVARRSRGDVINAEVDIF---LG-----DTIG--EMGFYLRMTEIAFIGRSFCASGGQN 337 (440) Q Consensus 268 IvPRhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~~~~V~---i~-----Dt~G--eL~~lY~~AdiafVGGSl~~~GGhN 337 (440) ++|-..-==+++.+.+++.|..+...+..++.-.+.||+ -+ ..++ .+..+= =+.|++.=+ +. T Consensus 207 ~sP~~LrmP~~~~e~~~~~G~~~~~~~~l~e~~~~~DVlkiWY~tRiQkER~~~~d~~ey~------~~~~~Y~i~--~~ 278 (336) T TIGR00670 207 ISPEELRMPKEILEDLKAKGVKVRETESLEEVIDEADVLKIWYVTRIQKERFPEDDPEEYE------KVRGSYGIT--AE 278 (336) T ss_pred ECCHHHHCHHHHHHHHHHCCCEEEEEEEHHHHHCCCCEEEEEEEECCCCCCCCCCCHHHHH------HHHCEEEEE--HH T ss_conf 6772430617889998528962899522434421578478864200111237887889998------410014662--88 Q ss_pred HHHHHHHC--CCEEE-CC--CCCCHHHHHHHHHHCCCEE-EECC Q ss_conf 37689619--94999-89--8443799999999789869-9579 Q gi|254780402|r 338 PLEAAMLG--CAILS-GP--NVENFRDIYRRMVSSGAVR-IVEE 375 (440) Q Consensus 338 ~lEpaa~G--~pVi~-GP--~~~nf~e~~~~L~~~g~~~-~v~d 375 (440) .+|.+.-| .-|+| || .+.-....+.+...+.... |+.+ T Consensus 279 ~L~~~~~~~e~ivlHPlPv~RvdEI~~~vd~~~~~~~f~~Q~~n 322 (336) T TIGR00670 279 RLEAAKKGKEVIVLHPLPVHRVDEIDPEVDDTPHAKYFKDQAFN 322 (336) T ss_pred HHHHCCCCCCEEEECCCCCCCCCCCCCCCCCCHHHHHHHHHHHC T ss_conf 98630689936884788978856677432366446788887515 No 309 >cd06302 PBP1_LsrB_Quorum_Sensing Periplasmic binding domain of autoinducer-2 (AI-2) receptor LsrB from Salmonella typhimurium and its close homologs. Periplasmic binding domain of autoinducer-2 (AI-2) receptor LsrB from Salmonella typhimurium and its close homologs from other bacteria. The members of this group are homologous to a family of periplasmic pentose/hexose sugar-binding proteins that function as the primary receptors for chemotaxis and transporters of many sugar based solutes in bacteria and archaea and that are a member of the type I periplasmic binding protein superfamily. LsrB binds a chemically distinct form of the AI-2 signal that lacks boron, in contrast to the Vibrio harveyi AI-2 signaling molecule that has an unusual furanosyl borate diester. Hence, many bacteria coordinate their gene expression according to the local density of their population by producing species specific AI-2. This process of quorum sensing allows LsrB to function as a periplasmic AI-2 binding p Probab=23.56 E-value=49 Score=13.40 Aligned_cols=151 Identities=11% Similarity=0.109 Sum_probs=66.8 Q ss_pred HHHHHHHHHCCEEEEEEC--CCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHH Q ss_conf 344557442020356404--762146788874224550586412665421210011112331026442366415778889 Q gi|254780402|r 119 AVSRFLKYWKPDCMILSE--SDIWPLTVFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRR 196 (440) Q Consensus 119 ~~~~fl~~~~P~~~i~~e--~ElWPnli~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~r 196 (440) .++.++.. +|+.+++.- .+-.-..+.+++..|||++.+|....+.+..-| ....|.. ......++. T Consensus 48 ~ie~~Ia~-gvDaIii~p~d~~a~~~~i~~A~~aGIpVv~~Ds~~~~~~~~~~----------v~~~dn~-~~G~~~a~~ 115 (298) T cd06302 48 IIEDLIAQ-GVDAIAVVPNDPDALEPVLKKAREAGIKVVTHDSDVQPDNRDYD----------IEQADNK-AIGETLMDS 115 (298) T ss_pred HHHHHHHC-CCCEEEEECCCHHHHHHHHHHHHHCCCEEEEECCCCCCCCEEEE----------EECCCHH-HHHHHHHHH T ss_conf 99999973-99889990687788899999998679869982687887780599----------9717889-999999999 Q ss_pred H-HHHCCC-CEE-EECCCCCCCCCCCCCHHHHHHHHHHCC-C--CCEEECC---CCCCCCCCCCCCCCCCCCCCCCEEEE Q ss_conf 9-860885-058-720431012344221256899998617-9--8617504---65533210000000000123412488 Q gi|254780402|r 197 Y-KELGAQ-KLI-VSGNLKIDTESLPCDKELLSLYQESIA-G--RYTWAAI---STFEGEEDKAVYVHNFIKCRTDVLTI 267 (440) Q Consensus 197 l-~~lG~~-~i~-v~GnlK~d~~~~~~~~~~~~~~~~~~~-~--r~v~vag---Sth~~EE~~il~a~~~l~~~~~~~lI 267 (440) + ..+|-+ ++. +.|+.-.... .....-+...+. + ..+-+.+ +-|..++. .-.+...|.++|++..| T Consensus 116 l~~~~g~~g~v~i~~g~~~~~~~-----~~R~~g~~~~l~~~~~~~~~vv~~~~~~~~~~~a-~~~~~~~L~~~Pdi~~I 189 (298) T cd06302 116 LAEQMGGKGEYAIFVGSLTATNQ-----NAWIDAAKAYQKEKYYPMLELVDRQYGDDDADKS-YQTAQELLKAYPDLKGI 189 (298) T ss_pred HHHHHCCCCCEEEEECCCCCHHH-----HHHHHHHHHHHHHCCCCCEEEEEEECCCCCHHHH-HHHHHHHHHHCCCCCEE T ss_conf 99971379958999858898679-----9999999999863489987999861378748999-99999999759991299 Q ss_pred EEECCCCHHHHHHHHHHHHCC Q ss_conf 970552114788877653001 Q gi|254780402|r 268 IVPRHPRRCDAIERRLIAKGL 288 (440) Q Consensus 268 IvPRhpeR~~~i~~~l~~~gl 288 (440) +..- ..-.....+-+++.|+ T Consensus 190 ~~~n-d~~a~Ga~~Al~~aG~ 209 (298) T cd06302 190 IGPT-SVGIPGAARAVEEAGL 209 (298) T ss_pred EECC-CHHHHHHHHHHHHCCC T ss_conf 9789-8169999999996799 No 310 >PRK12388 fructose-1,6-bisphosphatase II-like protein; Reviewed Probab=23.38 E-value=49 Score=13.37 Aligned_cols=33 Identities=27% Similarity=0.355 Sum_probs=19.1 Q ss_pred CCEEEEEE--ECCCCHHHHHHHHHHHHCCHHHHHCCCCC Q ss_conf 41248897--05521147888776530010110013444 Q gi|254780402|r 262 TDVLTIIV--PRHPRRCDAIERRLIAKGLKVARRSRGDV 298 (440) Q Consensus 262 ~~~~lIIv--PRhpeR~~~i~~~l~~~gl~~~~~S~~~~ 298 (440) .++..+|. ||| +++.+.+++.|-...+.++++. T Consensus 154 ~dltV~vLDRpRH----~~lI~eiR~~Gari~LI~DGDV 188 (321) T PRK12388 154 DKLRMVTLDKPRL----SAAIEEATQLGVKVFALPDGDV 188 (321) T ss_pred HHEEEEEEECCHH----HHHHHHHHHCCCCEEEECCCCH T ss_conf 8949999826508----9999999975993799656669 No 311 >cd06284 PBP1_LacI_like_6 Ligand-binding domain of an uncharacterized transcription regulator from Actinobacillus succinogenes and its close homologs from other bacteria. This group includes the ligand-binding domain of an uncharacterized transcription regulator from Actinobacillus succinogenes and its close homologs from other bacteria. This group belongs to the the LacI-GalR family repressors and are composed of two functional domains: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal ligand-binding domain, which is homologous to the sugar-binding domain of ABC-type transport systems that contain the type I periplasmic binding protein-like fold. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcription repressor undergoes a conformational change upon ligand binding which in turn changes the DNA binding. Probab=23.27 E-value=49 Score=13.36 Aligned_cols=104 Identities=17% Similarity=0.213 Sum_probs=62.3 Q ss_pred HHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEECCCCHHHHHHHHHH Q ss_conf 99999999998619989999627763001210147741688505572013445574420203564047621467888742 Q gi|254780402|r 70 MALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILSESDIWPLTVFELSK 149 (440) Q Consensus 70 ~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e~ElWPnli~~~~~ 149 (440) ..+..+.+.+.+.+.++++.++... .+.-.+ .+ +.+...+++.+|++.++..+..+....+ T Consensus 16 ~~~~gi~~~~~~~gy~~~i~~s~~~-~~~e~~-----------------~i-~~l~~~~vdGiIi~~~~~~~~~~~~~~~ 76 (267) T cd06284 16 EILKGIEDEAREAGYGVLLGDTRSD-PEREQE-----------------YL-DLLRRKQADGIILLDGSLPPTALTALAK 76 (267) T ss_pred HHHHHHHHHHHHCCCEEEEEECCCC-HHHHHH-----------------HH-HHHHHCCCCEEEECCCCCCHHHHHHHHC T ss_conf 9999999999986998999978999-899999-----------------99-9998479985998278899999999847 Q ss_pred CCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEEC----CHHHHHHHHHHCCCCEEEECC Q ss_conf 2455058641266542121001111233102644236641----577888998608850587204 Q gi|254780402|r 150 QRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQ----SERYFRRYKELGAQKLIVSGN 210 (440) Q Consensus 150 ~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~q----s~~~~~rl~~lG~~~i~v~Gn 210 (440) ++|+++++-..++.. ++.|..- .....++|.+.|-+++.+.+. T Consensus 77 -~~PvV~i~~~~~~~~-----------------~~~V~~Dn~~a~~~~~~~L~~~G~~~i~~i~~ 123 (267) T cd06284 77 -LPPIVQACEYIPGLA-----------------VPSVSIDNVAAARLAVDHLISLGHRRIALITG 123 (267) T ss_pred -CCCEEEECCCCCCCC-----------------CCEEEECHHHHHHHHHHHHHHHCCCEEEEECC T ss_conf -999999867788899-----------------98899678999999999999709976999628 No 312 >PTZ00141 elongation factor 1 alpha; Provisional Probab=23.17 E-value=49 Score=13.35 Aligned_cols=32 Identities=13% Similarity=0.173 Sum_probs=25.0 Q ss_pred EEEECC----CCHHHHHHHHHHCCCCEEEEEECCCC Q ss_conf 564047----62146788874224550586412665 Q gi|254780402|r 132 MILSES----DIWPLTVFELSKQRIPQVLVNARMSR 163 (440) Q Consensus 132 ~i~~e~----ElWPnli~~~~~~~ip~~linarls~ 163 (440) +.++.. ++.|||+.-+....+-+++|+|+-.. T Consensus 87 ~~iiD~PGH~~fi~nmi~Gas~aD~ailvVdA~~G~ 122 (443) T PTZ00141 87 YTVIDAPGHRDFIKNMITGTSQADVALLVVPAEVGG 122 (443) T ss_pred EEEEECCCHHHHHHHHHHHHCCCCEEEEEEECCCCC T ss_conf 999989972888999996341077589999867785 No 313 >PRK13631 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=22.86 E-value=50 Score=13.31 Aligned_cols=75 Identities=11% Similarity=0.225 Sum_probs=49.5 Q ss_pred EEEEECCCCCCHHHHHHCCCEEE--ECCCCCCC-------CCCHHHHHHHCCCEEECCCCCCHHHHHHHHHHCCCEEE-- Q ss_conf 16750676520255422082066--13554445-------45837689619949998984437999999997898699-- Q gi|254780402|r 304 DIFLGDTIGEMGFYLRMTEIAFI--GRSFCASG-------GQNPLEAAMLGCAILSGPNVENFRDIYRRMVSSGAVRI-- 372 (440) Q Consensus 304 ~V~i~Dt~GeL~~lY~~AdiafV--GGSl~~~G-------GhN~lEpaa~G~pVi~GP~~~nf~e~~~~L~~~g~~~~-- 372 (440) .|+++ --.|...+.+||-++| -|..+..| -+..++.+....|-+ .+.+..|.+.|.-+. T Consensus 228 Tiilv--THdm~~v~~~aDrviVm~~GkIv~~Gtp~eiF~~~~~l~~~~l~~P~~--------~~l~~~L~~~g~~~~~~ 297 (320) T PRK13631 228 TVFVI--THTMEHVLEVADEVIVMDKGKILKTGTPYEIFTDQHIINSTSIQVPRV--------IQVINDLIKKDPKYKKL 297 (320) T ss_pred EEEEE--CCCHHHHHHHCCEEEEEECCEEEEECCHHHHHCCHHHHHHCCCCCCCH--------HHHHHHHHHCCCCCHHH T ss_conf 99999--478999999799999998988999758899865999999779999929--------99999999739983330 Q ss_pred ----ECCHHHHHHHHHHHHC Q ss_conf ----5798999999999958 Q gi|254780402|r 373 ----VEEVGTLADMVYSLLS 388 (440) Q Consensus 373 ----v~d~~eL~~~l~~ll~ 388 (440) ..+.++|++++.+.+. T Consensus 298 ~~~~~~~~~~l~~~~~~~~~ 317 (320) T PRK13631 298 YQKQPRTIEQLADAINEFIK 317 (320) T ss_pred CCCCCCCHHHHHHHHHHHHC T ss_conf 47886469999999999862 No 314 >PRK13645 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=22.82 E-value=50 Score=13.30 Aligned_cols=68 Identities=15% Similarity=0.231 Sum_probs=48.0 Q ss_pred CCHHHHHHCCCEEE--ECCCCCCC-------CCCHHHHHHHCCCEEECCCCCCHHHHHHHHHHCCCE---EEECCHHHHH Q ss_conf 20255422082066--13554445-------458376896199499989844379999999978986---9957989999 Q gi|254780402|r 313 EMGFYLRMTEIAFI--GRSFCASG-------GQNPLEAAMLGCAILSGPNVENFRDIYRRMVSSGAV---RIVEEVGTLA 380 (440) Q Consensus 313 eL~~lY~~AdiafV--GGSl~~~G-------GhN~lEpaa~G~pVi~GP~~~nf~e~~~~L~~~g~~---~~v~d~~eL~ 380 (440) +|...+.+||-++| .|..+..| -+..++.+....|.+ ...+..|.+.|.- ..+.+.+|++ T Consensus 210 dm~~v~~~aDrviVm~~G~iv~~G~p~evf~~~~~l~~~~l~~P~~--------~~l~~~L~~~g~~~~~~~~~t~ee~~ 281 (289) T PRK13645 210 NMDQVLRIADEVIVMHEGKVISIGSPFEIFSNQELLTKIEIDPPKL--------YQLMYKLKNKGIDLLNKNIRTIEEFA 281 (289) T ss_pred CHHHHHHHCCEEEEEECCEEEEECCHHHHHCCHHHHHHCCCCCCHH--------HHHHHHHHHCCCCCCCCCCCCHHHHH T ss_conf 9999999799999998998999878899867999999779999859--------99999999769988899979999999 Q ss_pred HHHHHHHC Q ss_conf 99999958 Q gi|254780402|r 381 DMVYSLLS 388 (440) Q Consensus 381 ~~l~~ll~ 388 (440) +.+.+++. T Consensus 282 ~~l~~~~~ 289 (289) T PRK13645 282 KELAKVLK 289 (289) T ss_pred HHHHHHHC T ss_conf 99999759 No 315 >PRK07804 L-aspartate oxidase; Provisional Probab=22.66 E-value=51 Score=13.28 Aligned_cols=38 Identities=21% Similarity=0.217 Sum_probs=22.8 Q ss_pred EECCCC--CCHHHHHHCCCEEEECCCC--C-CCCCCHHHHHHHCC Q ss_conf 506765--2025542208206613554--4-45458376896199 Q gi|254780402|r 307 LGDTIG--EMGFYLRMTEIAFIGRSFC--A-SGGQNPLEAAMLGC 346 (440) Q Consensus 307 i~Dt~G--eL~~lY~~AdiafVGGSl~--~-~GGhN~lEpaa~G~ 346 (440) .+|.-| .+..||+..+++-. ++. + .||-..+|.+.+|. T Consensus 352 ~vD~~~~T~IpGLyAaGEva~~--G~hGAnRLggnsl~e~~VfG~ 394 (533) T PRK07804 352 VTDVYGRTEVPGLYAAGEVART--GVHGANRLASNSLLEGLVVGG 394 (533) T ss_pred EECCCCCCCCCCCEECCCCCCC--CCCCCHHHHHHHHHHHHHHHH T ss_conf 6788886087784625775547--776523477778999999999 No 316 >cd06307 PBP1_uncharacterized_sugar_binding Periplasmic sugar-binding domain of uncharacterized transport systems. Periplasmic sugar-binding domain of uncharacterized transport systems that share homology with a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein (PBP1) superfamily. The members of this group are predicted to be involved in the transport of sugar-containing molecules across cellular and organellar membranes. Probab=22.65 E-value=51 Score=13.28 Aligned_cols=36 Identities=19% Similarity=0.257 Sum_probs=25.1 Q ss_pred CCEEEEEECCC--CHHHHHHHHHHCCCCEEEEEECCCC Q ss_conf 02035640476--2146788874224550586412665 Q gi|254780402|r 128 KPDCMILSESD--IWPLTVFELSKQRIPQVLVNARMSR 163 (440) Q Consensus 128 ~P~~~i~~e~E--lWPnli~~~~~~~ip~~linarls~ 163 (440) +++.++++-.+ -.--.+.++..+|||++.++..+.+ T Consensus 58 ~vDgiii~~~d~~~~~~~i~~a~~~GIPVV~~~~~~~~ 95 (275) T cd06307 58 RSDGVALVAPDHPQVRAAVARLAAAGVPVVTLVSDLPG 95 (275) T ss_pred CCCEEEEECCCCHHHHHHHHHHHHCCCEEEEECCCCCC T ss_conf 89889998998177899999999879849997288888 No 317 >cd00739 DHPS DHPS subgroup of Pterin binding enzymes. DHPS (dihydropteroate synthase), a functional homodimer, catalyzes the condensation of p-aminobenzoic acid (pABA) in the de novo biosynthesis of folate, which is an essential cofactor in both nucleic acid and protein biosynthesis. Prokaryotes (and some lower eukaryotes) must synthesize folate de novo, while higher eukaryotes are able to utilize dietary folate and therefore lack DHPS. Sulfonamide drugs, which are substrate analogs of pABA, target DHPS. Probab=22.59 E-value=51 Score=13.28 Aligned_cols=146 Identities=18% Similarity=0.186 Sum_probs=67.4 Q ss_pred CCHHHHHHHHHHHHHHHHCCCCEEEEE--CCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEECCCCHH Q ss_conf 789999999999999986199899996--277630012101477416885055720134455744202035640476214 Q gi|254780402|r 64 SSVGETMALIGLIPAIRSRHVNVLLTT--MTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILSESDIWP 141 (440) Q Consensus 64 aSvGE~~~a~pli~~l~~~~~~iliT~--~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e~ElWP 141 (440) ..--|..-+.|+++.++++. ++++|. +.+.=++.+-+...+-+ -|... ...=| T Consensus 56 s~~eE~~Rl~pvi~~l~~~~-~~~iSIDT~~~~Va~~al~~Ga~iI------NDisg------------------~~~d~ 110 (257) T cd00739 56 SVEEELERVIPVLEALRGEL-DVLISVDTFRAEVARAALEAGADII------NDVSG------------------GSDDP 110 (257) T ss_pred CHHHHHHHHHHHHHHHHHCC-CCCEEEECCCHHHHHHHHHCCCCEE------ECCCC------------------CCCCH T ss_conf 98888988999999998607-9828997997599999998499899------75342------------------44777 Q ss_pred HHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHCCC--CEEEECCCCCCCCCCC Q ss_conf 67888742245505864126654212100111123310264423664157788899860885--0587204310123442 Q gi|254780402|r 142 LTVFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKELGAQ--KLIVSGNLKIDTESLP 219 (440) Q Consensus 142 nli~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~~--~i~v~GnlK~d~~~~~ 219 (440) +|+..+++.++|+++--.+-.++....... +..++.. |..-=++..+++.+.|++ +|.+..-+-|.-. .. T Consensus 111 ~m~~~va~~~~~~ilmH~~g~p~~m~~~~~----~~dv~~~---i~~~f~~~i~~l~~~Gi~~~~IiiDPG~GFgKt-~~ 182 (257) T cd00739 111 AMLEVAAEYGAPLVLMHMRGTPKTMQENPY----YEDVVDE---VLSFLEARLEAAESAGVARNRIILDPGIGFGKT-PE 182 (257) T ss_pred HHHHHHHHHCCCEEEECCCCCCCCCCCCCC----CCHHHHH---HHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCC-HH T ss_conf 899999984999999768998422233786----3309999---999999999999987998251997088787888-79 Q ss_pred CCHHHHHHHHHHCC-CCCEEECCC Q ss_conf 21256899998617-986175046 Q gi|254780402|r 220 CDKELLSLYQESIA-GRYTWAAIS 242 (440) Q Consensus 220 ~~~~~~~~~~~~~~-~r~v~vagS 242 (440) .+-+..+.+..+.. +.|+++..| T Consensus 183 ~n~~ll~~l~~f~~~~~PiLvG~S 206 (257) T cd00739 183 HNLELLRRLDELKQLGLPVLVGAS 206 (257) T ss_pred HHHHHHHHHHHHHHCCCCEEEECC T ss_conf 999999989999528998277502 No 318 >pfam00809 Pterin_bind Pterin binding enzyme. This family includes a variety of pterin binding enzymes that all adopt a TIM barrel fold. The family includes dihydropteroate synthase EC:2.5.1.15 as well as a group methyltransferase enzymes including methyltetrahydrofolate, corrinoid iron-sulfur protein methyltransferase (MeTr) that catalyses a key step in the Wood-Ljungdahl pathway of carbon dioxide fixation. It transfers the N5-methyl group from methyltetrahydrofolate (CH3-H4folate) to a cob(I)amide centre in another protein, the corrinoid iron-sulfur protein. MeTr is a member of a family of proteins that includes methionine synthase and methanogenic enzymes that activate the methyl group of methyltetra-hydromethano(or -sarcino)pterin. Probab=22.58 E-value=51 Score=13.27 Aligned_cols=146 Identities=16% Similarity=0.136 Sum_probs=73.9 Q ss_pred CCHHHHHHHHHHHHHHHHCCCCEEEE--ECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEECCCCHH Q ss_conf 78999999999999998619989999--6277630012101477416885055720134455744202035640476214 Q gi|254780402|r 64 SSVGETMALIGLIPAIRSRHVNVLLT--TMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILSESDIWP 141 (440) Q Consensus 64 aSvGE~~~a~pli~~l~~~~~~iliT--~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e~ElWP 141 (440) ...-|..-+.|+++.+++. ++.|| |+.+.-++.+-+...+.+ -|... .+.=| T Consensus 51 ~~~eE~~Rl~pvl~~l~~~--~~~iSIDT~~~~v~~~al~~G~~iI------NDvsg------------------~~~d~ 104 (208) T pfam00809 51 SGEEELERLVPVLEALRDQ--DVPISIDTFNAEVAEAALKAGADII------NDSSG------------------GKDDP 104 (208) T ss_pred CHHHHHHHHHHHHHHHHCC--CCCEEEECCCHHHHHHHHHCCCCEE------EECHH------------------HHCCH T ss_conf 8899999999999986357--9828997984999999998199389------87012------------------10377 Q ss_pred HHHHHHHHCCCCEEEEEECCCCCCCCCCHHHH-HHHHHCCCCCCEEEECCHHHHHHHHHHCC--CCEEEECCCCCCCCCC Q ss_conf 67888742245505864126654212100111-12331026442366415778889986088--5058720431012344 Q gi|254780402|r 142 LTVFELSKQRIPQVLVNARMSRRSFKNWKTVL-SFSKKIFSQFSLVIVQSERYFRRYKELGA--QKLIVSGNLKIDTESL 218 (440) Q Consensus 142 nli~~~~~~~ip~~linarls~~S~~~~~~~~-~l~~~~~~~~~~i~~qs~~~~~rl~~lG~--~~i~v~GnlK~d~~~~ 218 (440) +++..+++.++|+++.-.+-.+..+....... .....+... =++..+++.+.|+ ++|.+..-+-|.. .. T Consensus 105 ~~~~~~a~~~~~~vlmh~~~~p~~~~~~~~~~~dv~~~i~~~-------~~~~i~~~~~~Gi~~~~IiiDPGiGF~k-~~ 176 (208) T pfam00809 105 EMAPLAAEYGAPVVLMHMDGNPQGMQENPEYRVDVVEELLRF-------LTERLEAAEAAGVPPERIILDPGIGFGK-TE 176 (208) T ss_pred HHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCHHHHHHHHH-------HHHHHHHHHHCCCCHHHEEECCCCCCCC-CH T ss_conf 999999973999998668999865332676530699999999-------9999999998798977799718878889-86 Q ss_pred CCCHHHHHHHHHH--CCCCCEEECCCC Q ss_conf 2212568999986--179861750465 Q gi|254780402|r 219 PCDKELLSLYQES--IAGRYTWAAIST 243 (440) Q Consensus 219 ~~~~~~~~~~~~~--~~~r~v~vagSt 243 (440) ..+-+..+.+... +.+.|+++.+|. T Consensus 177 ~~n~~ll~~l~~~~~~~~~Pilvg~SR 203 (208) T pfam00809 177 EHNLELLRTLDALRELRGYPLLVGGSR 203 (208) T ss_pred HHHHHHHHHHHHHHHCCCCEEEEEECC T ss_conf 778999986999972489828998246 No 319 >cd06325 PBP1_ABC_uncharacterized_transporter Type I periplasmic ligand-binding domain of uncharacterized ABC-type transport systems that are predicted to be involved in the uptake of amino acids, peptides, or inorganic ions. This group includes the type I periplasmic ligand-binding domain of uncharacterized ABC (ATPase Binding Cassette)-type transport systems that are predicted to be involved in the uptake of amino acids, peptides, or inorganic ions. This subgroup has high sequence similarity to members of the family of hydrophobic amino acid transporters (HAAT), such as leucine/isoleucine/valine binding protein (LIVBP); its ligand specificity has not been determined experimentally. Probab=22.47 E-value=51 Score=13.26 Aligned_cols=34 Identities=12% Similarity=0.198 Sum_probs=26.5 Q ss_pred HHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHH Q ss_conf 9999999999999986199899996277630012 Q gi|254780402|r 66 VGETMALIGLIPAIRSRHVNVLLTTMTATSAKVA 99 (440) Q Consensus 66 vGE~~~a~pli~~l~~~~~~iliT~~T~tg~~~~ 99 (440) -|+...+..+++++..+++++++|..||...... T Consensus 43 ~gd~~~l~~ia~~l~~~~~DlIva~gTpaaqa~~ 76 (281) T cd06325 43 QGDQSNLPTIARKFVADKPDLIVAIATPAAQAAA 76 (281) T ss_pred CCCHHHHHHHHHHHHHCCCCEEEECCHHHHHHHH T ss_conf 9999999999999985499999987709999999 No 320 >PRK09479 glpX fructose 1,6-bisphosphatase II; Reviewed Probab=22.23 E-value=52 Score=13.23 Aligned_cols=33 Identities=30% Similarity=0.563 Sum_probs=18.1 Q ss_pred CCEEEEEE--ECCCCHHHHHHHHHHHHCCHHHHHCCCCC Q ss_conf 41248897--05521147888776530010110013444 Q gi|254780402|r 262 TDVLTIIV--PRHPRRCDAIERRLIAKGLKVARRSRGDV 298 (440) Q Consensus 262 ~~~~lIIv--PRhpeR~~~i~~~l~~~gl~~~~~S~~~~ 298 (440) .++..+|. ||| +++.+.+++.|-...+.++++. T Consensus 155 ~dltV~vLdRpRH----~~lI~eir~~Gari~LI~DGDV 189 (320) T PRK09479 155 SDLTVVVLDRPRH----AELIAEIREAGARVKLISDGDV 189 (320) T ss_pred HHCEEEEEECCHH----HHHHHHHHHCCCEEEEECCCCH T ss_conf 6828999866267----9999999973984999656789 No 321 >pfam03614 Flag1_repress Repressor of phase-1 flagellin. Probab=22.19 E-value=23 Score=15.59 Aligned_cols=51 Identities=20% Similarity=0.273 Sum_probs=27.5 Q ss_pred ECCCCHHHHHH-------HHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHH Q ss_conf 04762146788-------874224550586412665421210011112331026442366415778 Q gi|254780402|r 135 SESDIWPLTVF-------ELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERY 193 (440) Q Consensus 135 ~e~ElWPnli~-------~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~ 193 (440) .|.|+||-=.. -.+.+.||+-+++-. ..++-.| .....+.=..|+|.++.+ T Consensus 6 reaevwprdys~larrlqFlRfndiPvrlvs~n--gqvi~gY------i~kfn~~eN~iLasd~~k 63 (165) T pfam03614 6 REAEVWPRDYSNLARRLQFLRFNDIPVRLVSEN--GQVIIGY------IQKFNSSENKILASDPFK 63 (165) T ss_pred CCCCCCCCHHHHHHHHHHHHHHCCCCEEEEECC--CEEEEEE------EHHHCCCCCEEECCCCCC T ss_conf 611017616789999988863157636999428--8089866------001076346686379877 No 322 >PHA00527 hypothetical protein Probab=21.81 E-value=53 Score=13.17 Aligned_cols=37 Identities=16% Similarity=0.370 Sum_probs=32.0 Q ss_pred CCCHHHHHHHHHCCCCCCEEEECCHHHHHHH-HHHCCC Q ss_conf 2100111123310264423664157788899-860885 Q gi|254780402|r 167 KNWKTVLSFSKKIFSQFSLVIVQSERYFRRY-KELGAQ 203 (440) Q Consensus 167 ~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl-~~lG~~ 203 (440) -+|.++|+++-++|..-+.++++|...-+|+ ++||+. T Consensus 4 ~~~~~~Pr~~VPLF~S~~v~~CR~~~EW~~~~~~LGV~ 41 (129) T PHA00527 4 TKWPKLPRFFVPLFHSANVYLCRSKEEWDQACIHLGVG 41 (129) T ss_pred CCCCCCCHHHHEEECCCEEEEECCHHHHHHHHHHHCCC T ss_conf 54666532442003153189977828999999993988 No 323 >PRK07232 malic enzyme; Reviewed Probab=21.80 E-value=53 Score=13.17 Aligned_cols=69 Identities=10% Similarity=0.145 Sum_probs=40.8 Q ss_pred ECCCCCCHHH-HHHHHHHCCEEEEEE----CCCCHHHHHHHHHHCCCCEEEEEECC-CCCCCCCCHHHHHHHHHCCCC Q ss_conf 5055720134-455744202035640----47621467888742245505864126-654212100111123310264 Q gi|254780402|r 111 YAPLDIQPAV-SRFLKYWKPDCMILS----ESDIWPLTVFELSKQRIPQVLVNARM-SRRSFKNWKTVLSFSKKIFSQ 182 (440) Q Consensus 111 y~P~D~~~~~-~~fl~~~~P~~~i~~----e~ElWPnli~~~~~~~ip~~linarl-s~~S~~~~~~~~~l~~~~~~~ 182 (440) |+=+-.+..+ ...++...++-+||. .-||||....+++. -.++.-||- .+.--..-.-||.+||..+.. T Consensus 258 fig~S~~~~~~~~mv~~Ma~~pivfalANP~PEI~p~~a~~~r~---d~i~aTGRsD~pNQvNNvl~FP~ifRGaLdv 332 (753) T PRK07232 258 FLGLSAAGVLTPEMVKSMADNPIIFALANPDPEITPEEAKAVRP---DAIIATGRSDYPNQVNNVLCFPYIFRGALDV 332 (753) T ss_pred EEECCCCCCCCHHHHHHHCCCCEEEECCCCCCCCCHHHHHHHHC---CEEEECCCCCCCCCCCHHHCCCHHHHHHHHC T ss_conf 99747799789999986287877985689897769999997604---3798348889854111111240877668861 No 324 >pfam02601 Exonuc_VII_L Exonuclease VII, large subunit. This family consist of exonuclease VII, large subunit EC:3.1.11.6 This enzyme catalyses exonucleolytic cleavage in either 5'-3' or 3'-5' direction to yield 5'-phosphomononucleotides. This exonuclease VII enzyme is composed of one large subunit and 4 small ones. Probab=21.71 E-value=53 Score=13.16 Aligned_cols=83 Identities=13% Similarity=0.174 Sum_probs=47.9 Q ss_pred HHHHHHHHHHHHHCCCC--EEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHH---HHHCCEEEEEEC-----CC Q ss_conf 99999999999861998--999962776300121014774168850557201344557---442020356404-----76 Q gi|254780402|r 69 TMALIGLIPAIRSRHVN--VLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFL---KYWKPDCMILSE-----SD 138 (440) Q Consensus 69 ~~~a~pli~~l~~~~~~--iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl---~~~~P~~~i~~e-----~E 138 (440) .-+...+++.+.+++|. +.+--.+..|...+.... ...+.+- ...+||++|+.- .| T Consensus 25 gAa~~Di~~~~~~r~p~~~i~l~p~~VQG~~a~~~I~--------------~ai~~~~~~~~~~~~Dviii~RGGGS~eD 90 (295) T pfam02601 25 GAAYQDFLRTARRRGPLVEIEIYPTLVQGDGAAESIV--------------SALERANERETALDYDVIVIIRGGGSKED 90 (295) T ss_pred CHHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHH--------------HHHHHHHHCCCCCCCCEEEEECCCCCHHH T ss_conf 0899999999998199967999473576504899999--------------99999984689899838999578688898 Q ss_pred CHH----HHHHHHHHCCCCEEEEEECCCCCC Q ss_conf 214----678887422455058641266542 Q gi|254780402|r 139 IWP----LTVFELSKQRIPQVLVNARMSRRS 165 (440) Q Consensus 139 lWP----nli~~~~~~~ip~~linarls~~S 165 (440) ||+ .+.+...+..||++..=|--.|.+ T Consensus 91 L~~FN~e~laraI~~~~iPvisaIGHEtD~T 121 (295) T pfam02601 91 LWVFNDEELARAIANSPIPVITGIGHETDTT 121 (295) T ss_pred HHHCCHHHHHHHHHHCCCCEEECCCCCCCCC T ss_conf 7455889999999838998780678899961 No 325 >cd00885 cinA Competence-damaged protein. CinA is the first gene in the competence- inducible (cin) operon and is thought to be specifically required at some stage in the process of transformation. This domain is closely related to a domain, found in a variety of proteins involved in biosynthesis of molybdopterin cofactor, where the domain is presumed to bind molybdopterin. Probab=21.68 E-value=53 Score=13.16 Aligned_cols=84 Identities=12% Similarity=0.144 Sum_probs=45.7 Q ss_pred CHHHHHHHHHHCCC--CEEEECCCCCCCCCCCCCHHHHHHHHHHC-CCCCEEECCCCCCCCCCCCCCCC-CCCCCCCCEE Q ss_conf 57788899860885--05872043101234422125689999861-79861750465533210000000-0001234124 Q gi|254780402|r 190 SERYFRRYKELGAQ--KLIVSGNLKIDTESLPCDKELLSLYQESI-AGRYTWAAISTFEGEEDKAVYVH-NFIKCRTDVL 265 (440) Q Consensus 190 s~~~~~rl~~lG~~--~i~v~GnlK~d~~~~~~~~~~~~~~~~~~-~~r~v~vagSth~~EE~~il~a~-~~l~~~~~~~ 265 (440) +..-++++.++|.+ ...+++|-. ......+.... ....|++.|..-|.++++-.++. +.+ +.. T Consensus 21 ~~~l~~~L~~~G~~v~~~~~v~D~~---------~~I~~al~~~~~~~d~vi~tGGlGPT~DDiT~eavA~a~----g~~ 87 (170) T cd00885 21 AAFLAKELAELGIEVYRVTVVGDDE---------DRIAEALRRASERADLVITTGGLGPTHDDLTREAVAKAF----GRP 87 (170) T ss_pred HHHHHHHHHHCCCEEEEEEECCCCH---------HHHHHHHHHHHHCCCEEEEECCCCCCCCCCHHHHHHHHH----CCC T ss_conf 9999999997798266899939889---------999999999974199999947877899853899999994----898 Q ss_pred EEEEECCCCHHHHHHHHHHHHCCH Q ss_conf 889705521147888776530010 Q gi|254780402|r 266 TIIVPRHPRRCDAIERRLIAKGLK 289 (440) Q Consensus 266 lIIvPRhpeR~~~i~~~l~~~gl~ 289 (440) |+. |++-.+.+.+...+.+.. T Consensus 88 l~~---~~~~~~~i~~~~~~~~~~ 108 (170) T cd00885 88 LVL---DEEALERIEARFARRGRE 108 (170) T ss_pred CEE---CHHHHHHHHHHHHHCCCC T ss_conf 278---999999999999861764 No 326 >PRK12837 3-ketosteroid-delta-1-dehydrogenase; Provisional Probab=21.68 E-value=42 Score=13.83 Aligned_cols=44 Identities=9% Similarity=-0.016 Sum_probs=26.2 Q ss_pred HHHHHCCCEEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCC Q ss_conf 99997898699579899999999995898999999999999997045 Q gi|254780402|r 362 RRMVSSGAVRIVEEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQ 408 (440) Q Consensus 362 ~~L~~~g~~~~v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ 408 (440) +...+.|.....++-+||++.+. -|++..++..++--++++++. T Consensus 366 ~~~~~~g~~~~adTleeLA~~~g---i~~~~L~~Tv~~yN~~~~~G~ 409 (515) T PRK12837 366 AKYRAAGLWRTADTLEELAAKIG---VPADALTATVARFNGFAAAGV 409 (515) T ss_pred HHHHHCCCEEECCCHHHHHHHHC---CCHHHHHHHHHHHHHHHHCCC T ss_conf 87742581664598999999869---599999999999999985399 No 327 >PRK12415 fructose 1,6-bisphosphatase II; Reviewed Probab=21.67 E-value=53 Score=13.16 Aligned_cols=24 Identities=42% Similarity=0.537 Sum_probs=12.2 Q ss_pred ECCCCHHHHHHHHHHHHCCHHHHHCCCC Q ss_conf 0552114788877653001011001344 Q gi|254780402|r 270 PRHPRRCDAIERRLIAKGLKVARRSRGD 297 (440) Q Consensus 270 PRhpeR~~~i~~~l~~~gl~~~~~S~~~ 297 (440) ||| +++.+.+++.|-...+.++++ T Consensus 165 pRH----~~lI~eir~~Gari~Li~DGD 188 (323) T PRK12415 165 ERH----QDIIDRVRAKGARVKLFGDGD 188 (323) T ss_pred CHH----HHHHHHHHHCCCEEEEECCCC T ss_conf 457----999999998698399966677 No 328 >PRK05506 bifunctional sulfate adenylyltransferase subunit 1/adenylylsulfate kinase protein; Provisional Probab=21.63 E-value=53 Score=13.15 Aligned_cols=58 Identities=17% Similarity=0.098 Sum_probs=33.5 Q ss_pred CCEEECCCCCCHHHHH----HHHHHCCCEEEECCHHHHHHHHHHHHC-CHHHHHHHHHHHHHH Q ss_conf 9499989844379999----999978986995798999999999958-989999999999999 Q gi|254780402|r 346 CAILSGPNVENFRDIY----RRMVSSGAVRIVEEVGTLADMVYSLLS-EPTIRYEMINAAINE 403 (440) Q Consensus 346 ~pVi~GP~~~nf~e~~----~~L~~~g~~~~v~d~~eL~~~l~~ll~-d~~~~~~~~~~a~~~ 403 (440) +-.++|..-..=..++ +.|.+.|.-..+=|.+.+-.-+.+=|. +++-|.+..++..++ T Consensus 445 ~iw~tGlsgsGKstiA~~le~~L~~~g~~~~~LDGd~lR~gl~~dlgf~~~dR~enirR~~ev 507 (613) T PRK05506 445 TVWFTGLSGSGKSTIANLVERRLHALGRHTYVLDGDNVRHGLNRDLGFTDADRVENIRRVAEV 507 (613) T ss_pred EEEEECCCCCCHHHHHHHHHHHHHHCCCCEEEECCHHHHHHHCCCCCCCHHHHHHHHHHHHHH T ss_conf 999977898974799999999999779987998808987410457797989999999999999 No 329 >pfam07925 RdRP_5 Reovirus RNA-dependent RNA polymerase lambda 3. The sequences in this family are similar to the reoviral minor core protein lambda 3, which functions as a RNA-dependent RNA polymerase within the protein capsid. It is organized into 3 domains. N- and C-terminal domains create a 'cage' that encloses a conserved central catalytic domain within a hollow centre; this catalytic domain is arranged to form 'fingers', 'palm' and 'thumb' subdomains. Unlike other RNA polymerases, like HIV reverse transcriptase and T7 RNA polymerase, lambda 3 protein binds template and substrate with only localized rearrangements, and catalytic activity can occur with little structural change. However, the structure of the catalytic complex is similar to that of other polymerase catalytic complexes with known structure. Probab=21.57 E-value=24 Score=15.39 Aligned_cols=33 Identities=18% Similarity=0.367 Sum_probs=12.0 Q ss_pred CCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHH Q ss_conf 62146788874224550586412665421210011112 Q gi|254780402|r 138 DIWPLTVFELSKQRIPQVLVNARMSRRSFKNWKTVLSF 175 (440) Q Consensus 138 ElWPnli~~~~~~~ip~~linarls~~S~~~~~~~~~l 175 (440) |+||.++..... + .+||-..----|+|.+|.+- T Consensus 815 EiWPslidi~~G--~---f~NGV~D~l~Wr~WlRfsWA 847 (1271) T pfam07925 815 EIWPSLLDIIMG--I---FFNGVTDGLQWRRWLRFSWA 847 (1271) T ss_pred CCCHHHHHHHHH--H---HHCCCCCCCCHHHHHHHHHH T ss_conf 225688988876--2---21132244439999999999 No 330 >PRK11891 aspartate carbamoyltransferase; Provisional Probab=21.53 E-value=53 Score=13.14 Aligned_cols=22 Identities=14% Similarity=0.123 Sum_probs=8.3 Q ss_pred EECCCCHHHHHHHHHHHHCCHH Q ss_conf 7055211478887765300101 Q gi|254780402|r 269 VPRHPRRCDAIERRLIAKGLKV 290 (440) Q Consensus 269 vPRhpeR~~~i~~~l~~~gl~~ 290 (440) .|....=-+++.+.+...|..+ T Consensus 276 SP~~L~mP~~i~~~l~~~g~~~ 297 (430) T PRK11891 276 SPPTLEMPAYIVDQISTNGHVI 297 (430) T ss_pred CCHHHCCCHHHHHHHHHCCCEE T ss_conf 9213079889999999779869 No 331 >pfam02906 Fe_hyd_lg_C Iron only hydrogenase large subunit, C-terminal domain. Probab=21.47 E-value=53 Score=13.13 Aligned_cols=115 Identities=11% Similarity=0.095 Sum_probs=69.8 Q ss_pred EEEEECCHHHHHHHHHHHHHH----HHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHH-HHHHHHCCEEEE Q ss_conf 999817899999999999999----861998999962776300121014774168850557201344-557442020356 Q gi|254780402|r 59 IWFHASSVGETMALIGLIPAI----RSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVS-RFLKYWKPDCMI 133 (440) Q Consensus 59 IW~HaaSvGE~~~a~pli~~l----~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~-~fl~~~~P~~~i 133 (440) -+++-+++|.-..+..-.+++ .+.....++|+.=|+=...+++.+|+.+ -...|...|..+. ++++...| T Consensus 51 ~~V~d~a~gad~~~~~~~~E~~~~~~~~~~~P~isS~CP~~V~~iEk~~PeLi-~~ls~v~SPm~~~g~l~K~~~~---- 125 (286) T pfam02906 51 DYVFDTAFGADLTIMEEASEFLERLKKGEKLPMFTSCCPAWVKYVEKYYPELL-PNLSTCKSPMQMFGALIKTDPE---- 125 (286) T ss_pred CEEEECHHHHHHHHHHHHHHHHHHHHCCCCCCEEECCCHHHHHHHHHHCHHHC-CCCCCCCCHHHHHHHHHCCCCC---- T ss_conf 89998478899999999999999985599975485379899999998675434-8878888989999999615899---- Q ss_pred EECCCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHHHCCC Q ss_conf 4047621467888742245505864126654212100111123310264423664157788899860885 Q gi|254780402|r 134 LSESDIWPLTVFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKELGAQ 203 (440) Q Consensus 134 ~~e~ElWPnli~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~lG~~ 203 (440) +|.++.|---+++|.-..- ...-..+|.+++-.+- .+.|.+.|++ T Consensus 126 -----------------~i~~V~I~PC~aKK~Ea~r-------~~~~~~VD~VLT~~El-~~~l~~~~i~ 170 (286) T pfam02906 126 -----------------DIFVVSIMPCTAKKFEAAR-------PEMKGDVDAVLTTREL-AAMIKEAGID 170 (286) T ss_pred -----------------CEEEEEEECCCCHHHHHCC-------CCCCCCCCEEEEHHHH-HHHHHHCCCC T ss_conf -----------------7799999556004767508-------3014765299768999-9999984999 No 332 >PRK13135 consensus Probab=21.33 E-value=54 Score=13.11 Aligned_cols=62 Identities=8% Similarity=0.086 Sum_probs=33.3 Q ss_pred HHCCCEEECCCCCCHHHHHHHHHHCCCEEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHH Q ss_conf 6199499989844379999999978986995798999999999958989999999999999970455699 Q gi|254780402|r 343 MLGCAILSGPNVENFRDIYRRMVSSGAVRIVEEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLK 412 (440) Q Consensus 343 a~G~pVi~GP~~~nf~e~~~~L~~~g~~~~v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~ 412 (440) ...+||+.|=-+++-.++ +.+.+.--+.+|. ..+.+.+..- ..+++.++..+|+++.+.|.+ T Consensus 202 ~t~~Pv~vGFGI~~~e~v-~~i~~~ADGvIVG--SaiVk~ie~~-----~~~~~~~~i~~fv~~lk~ai~ 263 (267) T PRK13135 202 KITVPVVVGFGISTPQQA-ADVAAMADGVVVG--SALVKLFELH-----RGEELRQEVATFVASLRQAIP 263 (267) T ss_pred HCCCCEEEEECCCCHHHH-HHHHCCCCEEEEC--HHHHHHHHHC-----CCHHHHHHHHHHHHHHHHHHH T ss_conf 068984898167999999-9998059999987--8999999860-----818789999999999999975 No 333 >PRK04000 translation initiation factor IF-2 subunit gamma; Validated Probab=21.28 E-value=54 Score=13.10 Aligned_cols=85 Identities=13% Similarity=0.087 Sum_probs=53.3 Q ss_pred CCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEE-EECCHHHHHHHHHHCCCCEEEECCCCCCCC Q ss_conf 6214678887422455058641266542121001111233102644236-641577888998608850587204310123 Q gi|254780402|r 138 DIWPLTVFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLV-IVQSERYFRRYKELGAQKLIVSGNLKIDTE 216 (440) Q Consensus 138 ElWPnli~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i-~~qs~~~~~rl~~lG~~~i~v~GnlK~d~~ 216 (440) ++=.||+.-+.....-+++|.|.- .+ ..|+.+...-++.+|++++.|.= .|.|.+ T Consensus 96 ~fi~nMi~Gas~~D~alLVVaA~e-----------------------G~p~pQT~EHl~i~~~lgi~~~iV~l-nK~Dlv 151 (410) T PRK04000 96 TLMATMLSGAALMDGAVLVIAANE-----------------------PCPQPQTKEHLMALDIIGIKNIVIVQ-NKIDLV 151 (410) T ss_pred HHHHHHHHHHCCCCEEEEEEECCC-----------------------CCCCCCHHHHHHHHHHCCCCCEEEEE-ECCCCC T ss_conf 999999840212667999986577-----------------------87677149999999980998379999-625678 Q ss_pred CCCCCHHHHHHHHHHCC-----CCCEEECCCCCCCC Q ss_conf 44221256899998617-----98617504655332 Q gi|254780402|r 217 SLPCDKELLSLYQESIA-----GRYTWAAISTFEGE 247 (440) Q Consensus 217 ~~~~~~~~~~~~~~~~~-----~r~v~vagSth~~E 247 (440) ......+....+.+++. +-|+ +-.|...++ T Consensus 152 ~~e~~~~~~~ei~~~l~g~~~~~~pi-ipvSa~~g~ 186 (410) T PRK04000 152 SKEKALENYEQIKEFVKGTVAENAPI-IPVSALHGV 186 (410) T ss_pred CHHHHHHHHHHHHHHHCCCCCCCCCE-EEEECCCCC T ss_conf 98999999999999870676568999-996477788 No 334 >COG2099 CobK Precorrin-6x reductase [Coenzyme metabolism] Probab=21.15 E-value=54 Score=13.09 Aligned_cols=126 Identities=16% Similarity=0.125 Sum_probs=80.6 Q ss_pred CEEEEEECCHHHHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEEC Q ss_conf 77999817899999999999999861998999962776300121014774168850557201344557442020356404 Q gi|254780402|r 57 PLIWFHASSVGETMALIGLIPAIRSRHVNVLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILSE 136 (440) Q Consensus 57 ~~IW~HaaSvGE~~~a~pli~~l~~~~~~iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e 136 (440) ..||+=+ =+.-++.|.++|....-.+++|+.|..|.+.+++..+ .+.. =++|- .-.+.|++.-+-+++|=.- T Consensus 3 ~~ilvlG----GT~Dar~la~~L~~~~~~~~~ss~t~~g~~l~~~~~~-~~~~--G~l~~-e~l~~~l~e~~i~llIDAT 74 (257) T COG2099 3 MRILLLG----GTSDARALAKKLAAAPVDIILSSLTGYGAKLAEQIGP-VRVG--GFLGA-EGLAAFLREEGIDLLIDAT 74 (257) T ss_pred CEEEEEE----CCHHHHHHHHHHHCCCCCEEEEECCCCCCCCHHCCCC-EEEC--CCCCH-HHHHHHHHHCCCCEEEECC T ss_conf 3599982----6389999999862068617999703444241110588-6652--76887-8999999974988899788 Q ss_pred ----CCCHHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHHHHHHHH-HCCCCEEE Q ss_conf ----762146788874224550586412665421210011112331026442366415778889986-08850587 Q gi|254780402|r 137 ----SDIWPLTVFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERYFRRYKE-LGAQKLIV 207 (440) Q Consensus 137 ----~ElWPnli~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~~~rl~~-lG~~~i~v 207 (440) .++=-|.+..|+..|||.+-.- |=.|.+. .=..+.+-|-+++.+... .| ++|.. T Consensus 75 HPyAa~iS~Na~~aake~gipy~r~e---------RP~~~~~-------gd~~~~V~d~~ea~~~~~~~~-~rVfl 133 (257) T COG2099 75 HPYAARISQNAARAAKETGIPYLRLE---------RPPWAPN-------GDNWIEVADIEEAAEAAKQLG-RRVFL 133 (257) T ss_pred CHHHHHHHHHHHHHHHHHCCCEEEEE---------CCCCCCC-------CCCEEEECCHHHHHHHHHCCC-CCEEE T ss_conf 75799998999999998599679987---------7755457-------996698458999999875047-73799 No 335 >PRK13124 consensus Probab=21.15 E-value=54 Score=13.09 Aligned_cols=63 Identities=11% Similarity=0.136 Sum_probs=34.7 Q ss_pred HHCCCEEECCCCCCHHHHHHHHHHCCCEEEECCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHH Q ss_conf 6199499989844379999999978986995798999999999958989999999999999970455699 Q gi|254780402|r 343 MLGCAILSGPNVENFRDIYRRMVSSGAVRIVEEVGTLADMVYSLLSEPTIRYEMINAAINEVKKMQGPLK 412 (440) Q Consensus 343 a~G~pVi~GP~~~nf~e~~~~L~~~g~~~~v~d~~eL~~~l~~ll~d~~~~~~~~~~a~~~i~~~~ga~~ 412 (440) ....||+.|=-+++-.++ +.+.+.--+..|. ..+.+.+.+-.+ .++...+..+|+++.+.|.+ T Consensus 193 ~t~~Pv~vGFGI~~~e~v-~~~~~~ADGvIVG--Saivk~i~~~~~----~~~~~~~v~~fv~~lk~ald 255 (257) T PRK13124 193 YSNVPVAVGFGISTPEQV-QKMKEIADGVVVG--SALVEKIEEPEE----REEALAEVEEFASSLRESLD 255 (257) T ss_pred CCCCCEEEEECCCCHHHH-HHHHHHCCEEEEC--HHHHHHHHHCCC----HHHHHHHHHHHHHHHHHHHC T ss_conf 179983898446999999-9998019999982--899999985687----57899999999999999877 No 336 >TIGR02125 CytB-hydogenase Ni/Fe-hydrogenase, b-type cytochrome subunit; InterPro: IPR000516 Bacterial membrane-bound nickel-dependent hydrogenases , , seem to be associated with a b-type cytochrome involved in electron transfer from hydrogen to oxygen. This cytochrome is a protein of about 28 kDa that seems to have four transmembrane regions, which include several histidine residues that may be involved in coordination of the haem iron group. The gene coding for this cytochrome is adjacent to that coding for the large subunit of the hydrogenase. It has been assigned a variety of names in different species: hupC, hyaC, hydC or hoxZ. Every genome which contains a member of this family posesses a Ni/Fe hydrogenase according to Genome Properties (GenProp0177), and most are gene clustered with other hydrogenase components. Some Ni/Fe hydrogenase-containing species lack a member of this family but contain other CytB homologs, which may substitute for it.; GO: 0016491 oxidoreductase activity, 0006118 electron transport, 0016020 membrane. Probab=20.80 E-value=45 Score=13.59 Aligned_cols=33 Identities=27% Similarity=0.394 Sum_probs=21.5 Q ss_pred HHHHHCCCEEEECCCCCCCCCCHHHHHHHCCCE Q ss_conf 554220820661355444545837689619949 Q gi|254780402|r 316 FYLRMTEIAFIGRSFCASGGQNPLEAAMLGCAI 348 (440) Q Consensus 316 ~lY~~AdiafVGGSl~~~GGhN~lEpaa~G~pV 348 (440) .+|++.---|+|+--...|||||+-.+++=.=. T Consensus 103 ~~~q~~~YlFL~~~p~~~g~yNP~q~~ay~~~~ 135 (229) T TIGR02125 103 FLYQLKWYLFLGKHPHKKGGYNPLQFVAYFGFI 135 (229) T ss_pred HHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHH T ss_conf 999999886405899876153728999999999 No 337 >PRK01170 phosphopantetheine adenylyltransferase/unknown domain fusion protein; Provisional Probab=20.76 E-value=55 Score=13.03 Aligned_cols=22 Identities=5% Similarity=0.153 Sum_probs=14.5 Q ss_pred HHHHHHHHHHHHHHHHHHCCCC Q ss_conf 9999999975237878501678 Q gi|254780402|r 412 KITLRSLDSYVNPLIFQNHLLS 433 (440) Q Consensus 412 ~~~~~~i~~~l~~~~~~~~~~~ 433 (440) +-+..++-+.|||-+|...-+| T Consensus 306 ~~v~mA~iprlnp~~Y~~~~~~ 327 (328) T PRK01170 306 ESIRNAFIPRIDPDFYDTTYTP 327 (328) T ss_pred HHHHHHHHCCCCHHHCCCCCCC T ss_conf 9999875034697770166689 No 338 >cd06305 PBP1_methylthioribose_binding_like Methylthioribose-binding protein-like of ABC-type transport systems that belong to a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein (PBP1) superfamily. Methylthioribose-binding protein-like of ABC-type transport systems that belong to a family of pentose/hexose sugar-binding proteins of the type I periplasmic binding protein (PBP1) superfamily, which consists of two alpha/beta globular domains connected by a three-stranded hinge. This Venus flytrap-like domain undergoes transition from an open to a closed conformational state upon ligand binding. The sugar-binding domain of the periplasmic proteins in this group is also homologous to the ligand-binding domain of eukaryotic receptors such as metabotropic glutamate receptor (mGluR), DNA-binding transcriptional repressors such as LacI and GalR. Probab=20.62 E-value=56 Score=13.02 Aligned_cols=43 Identities=16% Similarity=0.231 Sum_probs=32.0 Q ss_pred HHHHHHHHHCCEEEEEE--CCCCHHHHHHHHHHCCCCEEEEEECCC Q ss_conf 34455744202035640--476214678887422455058641266 Q gi|254780402|r 119 AVSRFLKYWKPDCMILS--ESDIWPLTVFELSKQRIPQVLVNARMS 162 (440) Q Consensus 119 ~~~~fl~~~~P~~~i~~--e~ElWPnli~~~~~~~ip~~linarls 162 (440) .++.++.. +++.+++. ..+-....+..+...|||++.+|.... T Consensus 47 ~i~~~i~~-~vDgIii~p~d~~~~~~~l~~a~~aGIPVV~~d~~~~ 91 (273) T cd06305 47 QIDQAIAQ-KVDAIIIQHGRAEVLKPWVKRALDAGIPVVAFDVDSD 91 (273) T ss_pred HHHHHHHC-CCCEEEEECCCHHHHHHHHHHHHHCCCCEEEECCCCC T ss_conf 99999985-9999999468714448999999985997899817767 No 339 >PRK07586 hypothetical protein; Validated Probab=20.54 E-value=56 Score=13.00 Aligned_cols=38 Identities=24% Similarity=0.098 Sum_probs=26.2 Q ss_pred CCEEEEEECCHHHHHHHHHHHHHHHHCCCCEEEEECCC Q ss_conf 87799981789999999999999986199899996277 Q gi|254780402|r 56 GPLIWFHASSVGETMALIGLIPAIRSRHVNVLLTTMTA 93 (440) Q Consensus 56 ~~~IW~HaaSvGE~~~a~pli~~l~~~~~~iliT~~T~ 93 (440) ++-+.+=-+..|-.+++..+.....++-|=++||-..+ T Consensus 64 ~pgv~~~t~GPG~tN~~~gl~~A~~d~~Pvl~i~G~~~ 101 (514) T PRK07586 64 KPAATLLHLGPGLANGLANLHNARRARSPIVNIVGDHA 101 (514) T ss_pred CCEEEEEECCHHHHHHHHHHHHHHHHCCCEEEEECCCC T ss_conf 98899981380799999999999840978699966887 No 340 >PRK13644 cbiO cobalt transporter ATP-binding subunit; Provisional Probab=20.44 E-value=56 Score=12.99 Aligned_cols=62 Identities=13% Similarity=0.153 Sum_probs=39.9 Q ss_pred HHCCCEEE--ECCCCCCC------CCCHHHHHHHCCCEEECCCCCCHHHHHHHHHHCCCEEE---ECCHHHHHHHHHHHH Q ss_conf 22082066--13554445------45837689619949998984437999999997898699---579899999999995 Q gi|254780402|r 319 RMTEIAFI--GRSFCASG------GQNPLEAAMLGCAILSGPNVENFRDIYRRMVSSGAVRI---VEEVGTLADMVYSLL 387 (440) Q Consensus 319 ~~AdiafV--GGSl~~~G------GhN~lEpaa~G~pVi~GP~~~nf~e~~~~L~~~g~~~~---v~d~~eL~~~l~~ll 387 (440) +.||-++| .|-.+..| .+.-++.+....|.+ -+.+..|.+.|.-.. +.+.++|++.+++|+ T Consensus 200 ~~aDrvivl~~G~Iv~~G~p~eif~~~~l~~~~l~~P~~--------~~~~~~L~~~g~~~~~~~~~~~~~~~~~~~~~~ 271 (274) T PRK13644 200 HDADRIIVMDRGKIVLEGEPENVLSDVSLQTLGLTPPSL--------IELAENLKMHGVVIPWENTSSPSSFAEEICRLF 271 (274) T ss_pred HCCCEEEEEECCEEEEECCHHHHHCCHHHHHCCCCCCHH--------HHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHH T ss_conf 719989999899999987989984896288769999849--------999999997699899534689999999999985 Q ss_pred C Q ss_conf 8 Q gi|254780402|r 388 S 388 (440) Q Consensus 388 ~ 388 (440) . T Consensus 272 ~ 272 (274) T PRK13644 272 L 272 (274) T ss_pred C T ss_conf 4 No 341 >cd01537 PBP1_Repressors_Sugar_Binding_like Ligand-binding domain of the LacI-GalR family of transcription regulators and the sugar-binding domain of ABC-type transport systems. Ligand-binding domain of the LacI-GalR family of transcription regulators and the sugar-binding domain of ABC-type transport systems, all of which contain the type I periplasmic binding protein-like fold. Their specific ligands include lactose, ribose, fructose, xylose, arabinose, galactose/glucose, and other sugars. The LacI family of proteins consists of transcriptional regulators related to the lac repressor; in general the sugar binding domain in this family binds a sugar, which in turn changes the DNA binding activity of the repressor domain. The core structure of the periplasmic binding proteins is classified into two types and they differ in number and order of beta strands in each domain: type I, which has six beta strands, and type II, which has five beta strands. These two distinct structural arrangem Probab=20.26 E-value=57 Score=12.97 Aligned_cols=149 Identities=17% Similarity=0.150 Sum_probs=66.6 Q ss_pred HHHHHHHCCEEEEEECCCC-HHHHHHHHHHCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEEC----CHHHHH Q ss_conf 4557442020356404762-14678887422455058641266542121001111233102644236641----577888 Q gi|254780402|r 121 SRFLKYWKPDCMILSESDI-WPLTVFELSKQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQ----SERYFR 195 (440) Q Consensus 121 ~~fl~~~~P~~~i~~e~El-WPnli~~~~~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~q----s~~~~~ 195 (440) +.++. .+++.+|+...+. =+..+..+.+.+||++.++....+.. .++.|..- .....+ T Consensus 49 ~~~~~-~~vDgiIi~~~~~~~~~~~~~~~~~~ipvV~~~~~~~~~~----------------~~~~V~~d~~~~g~~~~~ 111 (264) T cd01537 49 ENLIA-RGVDGIIIAPSDLTAPTIVKLARKAGIPVVLVDRDIPDGD----------------RVPSVGSDNEQAGYLAGE 111 (264) T ss_pred HHHHH-CCCCEEEEECCCCCCHHHHHHHHHCCCCEEEECCCCCCCC----------------CCEEEEECHHHHHHHHHH T ss_conf 99997-6999999967988868999999975997999856678888----------------871999777999999999 Q ss_pred HHHHHCCCCEEEEC-CCCCCCCCCCCCHHHHHHHHHHCCCCCEEECCCCCCCCCCCCCCCCCCCCCCCCEEEEEEECCCC Q ss_conf 99860885058720-43101234422125689999861798617504655332100000000001234124889705521 Q gi|254780402|r 196 RYKELGAQKLIVSG-NLKIDTESLPCDKELLSLYQESIAGRYTWAAISTFEGEEDKAVYVHNFIKCRTDVLTIIVPRHPR 274 (440) Q Consensus 196 rl~~lG~~~i~v~G-nlK~d~~~~~~~~~~~~~~~~~~~~r~v~vagSth~~EE~~il~a~~~l~~~~~~~lIIvPRhpe 274 (440) ++.+.|-.++-+.+ +...... ..........+.+.......+..-..+..++ ..-.+.+.+.++|+.--|++.-. . T Consensus 112 ~l~~~g~~~i~~i~~~~~~~~~-~~R~~g~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~l~~~~~~dai~~~~d-~ 188 (264) T cd01537 112 HLAEKGHRRIALLAGPLGSSTA-RERVAGFKDALKEAGPIEIVLVQEGDWDAEK-GYQAAEELLTAHPDPTAIFAAND-D 188 (264) T ss_pred HHHHCCCCCEEECCCCCCCCHH-HHHHHHHHHHHHHCCCCCCEEEEECCCCHHH-HHHHHHHHHHHCCCCEEEEECCH-H T ss_conf 9986298624530488777069-9999999999997799985379957999899-99999999850899769998986-9 Q ss_pred HHHHHHHHHHHHCCH Q ss_conf 147888776530010 Q gi|254780402|r 275 RCDAIERRLIAKGLK 289 (440) Q Consensus 275 R~~~i~~~l~~~gl~ 289 (440) -+..+.+.+++.|++ T Consensus 189 ~A~gv~~al~~~g~~ 203 (264) T cd01537 189 MALGALRALREAGLR 203 (264) T ss_pred HHHHHHHHHHHCCCC T ss_conf 999999999974999 No 342 >KOG2648 consensus Probab=20.18 E-value=57 Score=12.96 Aligned_cols=94 Identities=18% Similarity=0.247 Sum_probs=59.1 Q ss_pred CEEEEEE---E--CCCCHHHHHHHHHHHHCCHHHHHCCCCCCCC------CCEEEEECCCCCCHHHHHHCCCEEE-ECCC Q ss_conf 1248897---0--5521147888776530010110013444446------6416750676520255422082066-1355 Q gi|254780402|r 263 DVLTIIV---P--RHPRRCDAIERRLIAKGLKVARRSRGDVINA------EVDIFLGDTIGEMGFYLRMTEIAFI-GRSF 330 (440) Q Consensus 263 ~~~lIIv---P--RhpeR~~~i~~~l~~~gl~~~~~S~~~~~~~------~~~V~i~Dt~GeL~~lY~~AdiafV-GGSl 330 (440) +..-||+ . .|++-...+++++++.|.+..+.+.++.... +.||++.=.=.. ++.- |+.| T Consensus 267 ~~iGlivGTLG~qg~~~vl~~L~~~~~~~Gkk~y~l~~g~inPaKLAnF~eIDvfV~iaCp~---------lsid~s~~F 337 (453) T KOG2648 267 RTIGLIVGTLGRQGNREVLEHLRKLLKAAGKKSYVLALGEINPAKLANFPEIDVFVQIACPR---------LSIDWSKEF 337 (453) T ss_pred CEEEEEEECCCCCCCHHHHHHHHHHHHHCCCCEEEEEECCCCHHHHCCCCCCCEEEEEECCC---------CCHHHHHHH T ss_conf 86899986131347799999999999973882499990677878955765454899970765---------313344440 Q ss_pred CCCCCCCHHHHHHHCCC-EEECCCCCCHHHHHHHHHH Q ss_conf 44454583768961994-9998984437999999997 Q gi|254780402|r 331 CASGGQNPLEAAMLGCA-ILSGPNVENFRDIYRRMVS 366 (440) Q Consensus 331 ~~~GGhN~lEpaa~G~p-Vi~GP~~~nf~e~~~~L~~ 366 (440) .+---+|+|+..+..+ .-+|++...|.+.+..+.. T Consensus 338 -~kPiltPfEa~~Al~~~~W~~~~~~~~~~~~~~~~~ 373 (453) T KOG2648 338 -YKPLLTPFEAEVALNPIAWTGDYLAPFVTAIKLLLK 373 (453) T ss_pred -CCCCCCHHHHHHHCCCCCCCCCCCCCHHHHHHHHHC T ss_conf -565455899987618654576556205667766521 No 343 >COG0145 HyuA N-methylhydantoinase A/acetone carboxylase, beta subunit [Amino acid transport and metabolism / Secondary metabolites biosynthesis, transport, and catabolism] Probab=20.09 E-value=57 Score=12.94 Aligned_cols=104 Identities=15% Similarity=0.163 Sum_probs=59.0 Q ss_pred HHHHHHHHHHHHCCCC-EEEEECCCCCCHHHHCCCCCCEEEEECCCCCCHHHHHHHHHHCCEEEEEECCCCHHHHHHHHH Q ss_conf 9999999999861998-999962776300121014774168850557201344557442020356404762146788874 Q gi|254780402|r 70 MALIGLIPAIRSRHVN-VLLTTMTATSAKVARKYLGQYAIHQYAPLDIQPAVSRFLKYWKPDCMILSESDIWPLTVFELS 148 (440) Q Consensus 70 ~~a~pli~~l~~~~~~-iliT~~T~tg~~~~~~~~~~~~~~~y~P~D~~~~~~~fl~~~~P~~~i~~e~ElWPnli~~~~ 148 (440) ..+...+++++.++.+ |.|++ .|.|.=-++-..++..++..-|++-+....|+||-+ .+-. T Consensus 139 ~~v~~~~~~l~~~gv~siAVs~-----------------~~S~~NP~HE~~v~eiire~~~~i~V~~shev~p~~-~~~e 200 (674) T COG0145 139 EEVREAAAALKAAGVEAIAVSS-----------------LFSYRNPEHELRVAEIIREIGPDIPVSLSHEVSPEI-GEYE 200 (674) T ss_pred HHHHHHHHHHHHCCCCEEEEEE-----------------ECCCCCCHHHHHHHHHHHHHCCCCEEEECHHCCHHC-CCCC T ss_conf 9999999999867985899998-----------------636578477999999999865994598601324001-8530 Q ss_pred HCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCCEEEECCHHH Q ss_conf 224550586412665421210011112331026442366415778 Q gi|254780402|r 149 KQRIPQVLVNARMSRRSFKNWKTVLSFSKKIFSQFSLVIVQSERY 193 (440) Q Consensus 149 ~~~ip~~linarls~~S~~~~~~~~~l~~~~~~~~~~i~~qs~~~ 193 (440) + .-+.++||++++..-+--..+....+.---.....++||.-. T Consensus 201 R--~~TavlnA~L~pi~~~yl~~v~~~l~~~g~~~~l~~m~sdGg 243 (674) T COG0145 201 R--ANTAVLNAYLSPILRRYLEAVKDALKERGIKARLMVMQSDGG 243 (674) T ss_pred C--HHHHEEEEEEHHHHHHHHHHHHHHHHHCCCCCEEEEEECCCC T ss_conf 0--140123114249899999999999985699850699965886 Done!